lbqin / SpeechSynthesis

语音合成综述

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Text-to-Speech Synthesis

Voice synthesis related materials using deep learning

Lectures & Seminars

Dataset

Korean Corpus

WaveNet

Paper

Articles

Source Code

Multi-GPU

WaveNet takes too long to learn, so I do not seem to get the answer unless I use a multi-GPU. The related code links are summarized.

Fast WaveNet

Paper

Articles

Source Code

Parallel WaveNet

Paper

Articles

Source Code

WaveRNN

Paper

Deep Voice

Paper

Deep Voice 2

Paper

Deep Voice 3

Paper

Source Code

Tacotron

Paper

Source Code

Tacotron 2

Paper

Articles

Source Code

HybridNet

ClariNet

Articles

Demo

Voice Cloning

Paper

Speed ​​Up Strategy

  • Fast Generation for Convolutional Autoregressive Models (2017.04) - Prajit Ramachandran et al.
    • This technique was applied to Wavenet and PixelCNN ++ models, and it was said that there was a speed increase of up to 21 times and 183 times, respectively. It is important to note that the speed improvement may not be greater than expected in a real environment because it is the maximum performance improvement for a specific situation.

About

语音合成综述

License:Apache License 2.0