This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Last update: Dec 05, 2022

Overview

Non-autoregressive Deep Learning-Based TTS Template

This is a template for the Non-autoregressive TTS model. It contains

Data Preprocessing Pipeline
Data Loader
Model / Trainer
Logger, Postprocessing (logging, synthesizing, plotting, etc..)

How to use it?

Clone the repository.

git clone https://github.com/keonlee9420/Deep-Learning-TTS-Template
cd Deep-Learning-TTS-Template

Replace all MYMODEL strings in this repo with your model name and also rename the file model/MYMODEL.py.
Build your model on model/ and check train.py and synthesize.py.
Use README_template.md for the README.md file of your project.
Feel free to add /img for your model architecture and tensorboard examples. It would also be nice to show your model's output audio in /demo.
Don't forget to update requirements.txt and /config of your project.

Citation

@misc{lee2021deep_learning_tts_template,
  author = {Lee, Keon},
  title = {Deep-Learning-TTS-Template},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/keonlee9420/Deep-Learning-TTS-Template}}
}

References

ming024's FastSpeech2

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

DiffSinger - PyTorch Implementation PyTorch implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension). Status

152 Jan 2, 2023

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

GradTTS Unofficial Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech" (arxiv) About this repo This is an unoffic

103 Dec 23, 2022

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Fast Symbolic Regression Symbolic Regression is a non-linear, non-parametric Machine Learning method capable of modeling complex data sets. fastsr aim

3 Jun 22, 2022

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection 1. 介绍用以替代 NMS，在所有 bbox 中挑选出最优的集合。 NMS 仅考虑了 bbox 的得分，然后根据 IOU 来

44 Sep 15, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image.

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

7 May 29, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

4 Nov 16, 2021

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

English | 简体中文 Why Non-Euclidean Geometry Considering these simple graph structures shown below. Nodes with same color has 2-hop distance whereas 1-ho

123 Dec 12, 2022

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

53 Dec 29, 2022

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

One model to speak them all 🌎 Audio Language Text ▷ Chinese 人人生而自由，在尊严和权利上一律平等。 ▷ English All human beings are born free and equal in dignity and rig

60 Nov 14, 2022

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Related tags

Overview

Non-autoregressive Deep Learning-Based TTS Template

How to use it?

Citation

References

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

This project uses Template Matching technique for object detecting by detection of template image over base image.

This project uses Template Matching technique for object detecting by detection of template image over base image

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Releases(v1.0.0)

v1.0.0(Jun 15, 2021)

Owner

Keon Lee

SAGE: Sensitivity-guided Adaptive Learning Rate for Transformers

Experiments and examples converting Transformers to ONNX

Code and models for "Rethinking Deep Image Prior for Denoising" (ICCV 2021)

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting

Object classification with basic computer vision techniques

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)

PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training”

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

Official implementation of Unfolded Deep Kernel Estimation for Blind Image Super-resolution.

Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021

The original implementation of TNDM used in the NeurIPS 2021 paper (no longer being updated)

Demo code for paper "Learning optical flow from still images", CVPR 2021.

a Lightweight library for sequential learning agents, including reinforcement learning

A modified version of DeepMind's Alphafold2 to divide CPU part (MSA and template searching) and GPU part (prediction model)

A tf.keras implementation of Facebook AI's MadGrad optimization algorithm

Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical Instrument Recognition.

QKeras: a quantization deep learning library for Tensorflow Keras