nlp deep-learning tts style-transfer voice-cloning tacotron tacotron-2 prosody style-tokens

Text to Speech(TTS)/Style Transfer/Voice Cloning Landscape

Reddit Posts:

Samples from github:

Samples	Pretrained Models	Code	Paper	Output Quality
Baidu's Deep Voice samples(official)	--	--	--	D
Baidu's Deep Voice 3 samples(official)	--	--	1710.07654	B
Google Tacotron2 samples(official)	--	--	1712.05884	A
Google tacotron + style transfer sample(official)	--	--	1803.09047	A
NVIDIA's waveglow	Download Model	Code	1811.00002	A
NVIDIA's tacotron2 + waveglow	Download Model	Code	--	A
Griffin-Lim	--	--	--	A
Deepmind Neural Discrete Representation Learning samples(official)	--	--	1711.00937	B
r9y9's wavenet vocoder Tacotron2(189k iterations)	(Download Tacotron2 model) - (Download wavenet model(1000k iterations)) - (Get models)	--	1712.05884 and 1611.09482	B
dhgrs's implementation of Neural Discrete Representation Learning samples	Download Model	Code	1711.00937	D
mazzzystar's Tacotron-WaveRNN samples(730k iterations)	Get Model	Code	--	A
syang1993's tacotron + style transfer samples(200k iterations)	Model ErnstTmp(232k iter)	--	1803.09047 and 1803.09017	C
keithito's tacotron samples(414k iterations)	Get model	--	--	D
rayhane's Tacotron2 samples(6k4 steps(whatever that means))	--	--	--	D
Kyubyong's tacotron on LJ dataset(200k iterations)	Download model	--	--	D
Kyubyong's tacotron on nick dataset(215k iterations)	--	--	--	D
Kyubyong's tacotron on web dataset(183k iterations)	Download model	--	--	D
Kyubyong's expressive tacotron(420k iterations)	--	Code	1803.09047	D
Kyubyong's dc-tts on LJ dataset(800k iterations)	Get model	--	--	D
Kyubyong's dc-tts on nick dataset(800k iterations)	--	--	--	D
Kyubyong's dc-tts kate(800k iterations)	--	--	--	D
andabi's deep voice conversion	--	--	--	D
Facebook Loop samples(official)	Get model	--	--	D
mazzzystar's randomCNN voice transfer	--	--	1712.08363	D

Work in progress:

If I missed your output sample/demo in this consolidation, just add and send a pull request. I will be more than happy to add it. Thanks!

Codelabs:

https://github.com/tugstugi/dl-colab-notebooks

Product Demos:

Lyrebird samples(official)
Lyrebird Demo(official)
Google Duplex Demo(official)
Adobe Voco Demo(official)
Voice Cloning Toolbox(official)

Related Works:

https://github.com/tensorflow/magenta

Arxiv-sanity

Support:

If you want the good work to continue please support us on

About

nlp deep-learning tts style-transfer voice-cloning tacotron tacotron-2 prosody style-tokens