Text to Speech(TTS)/Style Transfer/Voice Cloning Landscape
Reddit Posts:
- [N] Baidu AI Can Clone Your Voice in Seconds
- [R] Expressive Speech Synthesis with Tacotron
- [D] Realtime Neural Voice Style Transfer Feasibility and Implications
- [D] Is there an implementation of Neural Voice Cloning?
- [D] Are the hyper-realistic results of Tacotron-2 and Wavenet not reproducible?
- [P] Voice Style Transfer: Speaking like Kate Winslet
Samples from github:
Work in progress:
- https://github.com/ErnstTmp is implementing https://arxiv.org/abs/1807.06736
- https://github.com/nii-yamagishilab/self-attention-tacotron
- https://github.com/nii-yamagishilab/tacotron2
If I missed your output sample/demo in this consolidation, just add and send a pull request. I will be more than happy to add it. Thanks!
Codelabs:
Product Demos:
- Lyrebird samples(official)
- Lyrebird Demo(official)
- Google Duplex Demo(official)
- Adobe Voco Demo(official)
- Voice Cloning Toolbox(official)
Related Works:
Arxiv-sanity
Support:
If you want the good work to continue please support us on