There are 14 repositories under tacotron topic.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
DeepMind's Tacotron-2 Tensorflow implementation
Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
⏩ Generating speech in a single forward pass without any attention!
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
Phoneme multilingual(Russian-English) voice cloning based on
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
PyTorch implementation of Tacotron speech synthesis model.
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
Tensorflow Implementation of Expressive Tacotron
Speech Recognition Using Tacotron
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on Tacotron-2 model.
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
VAE Tacotron 2, an alternative of GST Tacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Multilingual text processing API for cleaning, IPA word phonemization, tokenization, translating into sequence of character IDs for easy stacking with neural Text-to-Speech models.
Google's TPGST reimplementation.
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.