vits chinese, tts chinese, tts mandarin

Last update: Dec 14, 2022

Related tags

Text Data & NLP tts

Overview

vits实现的中文TTS

this is the copy of https://github.com/jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Espnet连接：github.com/espnet/espnet/tree/master/espnet2/gan_tts/vits

coqui-ai/TTS连接：github.com/coqui-ai/TTS/tree/main/recipes/ljspeech/vits_tts

如果有侵权行为，请联系我，我将删除项目

If there is infringement, please contact me and I will delete the item

基于VITS 实现 16K baker TTS 的流程记录

apt-get install espeak

pip install -r requirements.txt

cd monotonic_align

python setup.py build_ext --inplace

将16K标贝音频拷贝到./baker_waves/，启动训练

python train.py -c configs/baker_base.json -m baker_base

两张1080卡，训练两天，基本可以使用了

测试

python vits_strings.py

上面的模型训练出来后存在，明显停顿的问题

原因：

1，本来已经在音素后面强插边界了，VITS又强插边界了，具体是配置参数："add_blank": true

2，可能影响，随机时长预测，具体配置参数：use_sdp=True

vits chinese, tts chinese, tts mandarin

Related tags

Overview

基于VITS 实现 16K baker TTS 的流程记录

将16K标贝音频拷贝到./baker_waves/，启动训练

测试

Owner

AmorTX

Python module (C extension and plain python) implementing Aho-Corasick algorithm

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

CoSENT、STS、SentenceBERT

⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡

Large-scale Knowledge Graph Construction with Prompting

Implementation of "Adversarial purification with Score-based generative models", ICML 2021

Hostapd-mac-tod-acl - Setup a hostapd AP with MAC ToD ACL

Vad-sli-asr - A Python scripts for a speech processing pipeline with Voice Activity Detection (VAD)

This repository is home to the Optimus data transformation plugins for various data processing needs.

Japanese synonym library

Switch spaces for knowledge graph embeddings

多语言降噪预训练模型MBart的中文生成任务

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks

[KBS] Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks

NeoDays-based tileset for the roguelike CDDA (Cataclysm Dark Days Ahead)

Shared code for training sentence embeddings with Flax / JAX

A text augmentation tool for named entity recognition.

Conversational text Analysis using various NLP techniques

Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2021).