tacotron

There are 17 repositories under tacotron topic.

coqui-ai / TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis
Language:Python 39006
mozilla / TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder
Language:Jupyter Notebook 9758
keithito / tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
tacotron tensorflow speech-synthesis python machine-learning tts
Language:Python 2973
Rayhane-mamah / Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
tacotron tensorflow paper python speech-synthesis text-to-speech wavenet
Language:Python 2309
fatchord / WaveRNN
WaveRNN Vocoder + TTS
wavernn pytorch neural-vocoder speech-synthesis tts tacotron text-to-speech
Language:Python 2154
DanRuta / xVA-Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
voice-synthesis tacotron machine-learning electron skyrim fallout elder-scrolls speech-synthesis
Language:JavaScript 617
MycroftAI / mimic2
Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
deep-learning neural-network machine-learning tacotron speech-synthesis
Language:Python 583
spring-media / ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
deep-learning axelspringerai text-to-speech python pytorch tacotron tts forwardtacotron
Language:Python 579
google / tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
machine-learning tts speech audio prosody tacotron
Language:HTML 541
MycroftAI / mimic-recording-studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
tts tts-engine mimic tacotron recording-studio docker microphone voice mycroft mycroftai hacktoberfest
Language:JavaScript 505
ranchlai / mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
tts tts-chinese tts-hanzi tacotron pytorch fastspeech2 aishell3 multi-speaker
Language:Python 472
Emotional-Text-to-Speech / dl-for-emo-tts
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
deep-learning emotional-tts speech-synthesis tacotron dc-tts affective-computing tts lj-speech tacotron-models ravdess
Language:Jupyter Notebook 445
gia-guar / JARVIS-ChatGPT
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
ai chat-gpt-3 ibm-watson jarvis-ai openai python tacotron speech-recognition tts chatgpt chatgpt-api elevenlabs pytorch stt
Language:Python 406
Multi-Tacotron-Voice-Cloning
vlomme / Multi-Tacotron-Voice-Cloning
Phoneme multilingual(Russian-English) voice cloning based on
deep-learning pytorch tensorflow tts voice-cloning g2p tacotron wavernn russian
Language:Python 394
syang1993 / gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
expressive-tacotron expressive-speech-synthesis tacotron global-style-tokens gst-tacotron
Language:Python 368
KinglittleQ / GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
gst-tacotron pytorch tts tacotron
Language:Python 366
r9y9 / tacotron_pytorch
PyTorch implementation of Tacotron speech synthesis model.
speech-synthesis pytorch tacotron python speech
Language:Jupyter Notebook 309
atomicoo / FCH-TTS
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。
tts english tibetan mandarin japanese russian dctts tacotron fastspeech korean chinese melgan pytorch
Language:Python 264
NTT123 / vietTTS
Vietnamese Text to Speech library
tts-engines deep-learning tacotron vocoder hifi-gan vietnam vietnamese text-to-speech
Language:Python 226
soobinseo / Tacotron-pytorch
Pytorch implementation of Tacotron
tacotron text-to-speech tts pytorch
Language:Python 206
Kyubyong / expressive_tacotron
Tensorflow Implementation of Expressive Tacotron
speech-to-text speech-synthesis tacotron
Language:Python 196
Kyubyong / tacotron_asr
Speech Recognition Using Tacotron
speech-recognition speech-to-text tacotron speech
Language:Python 163
BogiHsu / Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
tacotron2-pytorch tacotron2 tacotron pretrained-models reduction-factor pytorch tts text-to-speech ljspeech
Language:Python 147
karim23657 / Persian-tts-coqui
Persian/Farsi text to speech(TTS) training using coqui tts
farsi farsi-datasets persian persian-dataset persian-language text-to-speech tts coqui coqui-ai coqui-tts dataset deep-learning glow-tts hifigan speech speech-synthesis tacotron tts-model vits persian-tts
Language:Jupyter Notebook 140
atomicoo / tacotron2-mandarin
Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on Tacotron-2 model.
tensorflow tacotron tacotron2 chinese mandarin tts
Language:Python 131
ide8 / tacotron2
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
tacotron tacotron2 tacotron2-pytorch waveglow tts multispeaker emotions nvidia
Language:Jupyter Notebook 128
bshall / Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
speech-synthesis text-to-speech tacotron pytorch tts attention-mechanism
Language:Python 114
ttaoREtw / Tacotron-pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
deep-learning text-to-speech tacotron end-to-end speech-synthesis pytorch seq2seq
Language:Python 110
rishikksh20 / vae_tacotron2
VAE Tacotron 2, an alternative of GST Tacotron
tacotron-2 vae-tacotron tacotron speech-synthesis tts variational-autoencoder
Language:Python 88
BridgetteSong / ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
tacotron gst-tacotron vae-tacotron gmvae-tacotron forward-attention-tacotron durian non-attentive-tacotron gmm-tacotron
Language:Python 74
acetylSv / GST-tacotron
Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)
speech-synthesis gst-tacotron tacotron global-style-tokens
Language:Python 61
everydaycodings / MimicMania
MimicMania is a web application that allows you to generate speech and clone voices using text-to-speech technology. With MimicMania, you can create custom voices in a variety of languages and use them for a range of applications, from voiceovers to chatbots.
cloning jspeech python streamlit tacotron text-to-speech tts voice-cloning hacktoberfest
Language:Python 59
rishikksh20 / gmvae_tacotron
Gaussian Mixture VAE Tacotron
tacotron tacotron-2 tts
Language:Python 53
kaituoxu / Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
text-to-speech speech-synthesis tacotron pytorch
Language:Python 52
keonlee9420 / Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
text-to-speech tts tacotron tacotron2 pytorch speech-synthesis autoregressive single-speaker multi-speaker robustness efficiency comprehensive neural-tts mel-gan hifi-gan reduction-factor diagonal-guided-attention deep-learning
Language:Python 48
ishandutta2007 / Text-to-Speech-Landscape
nlp deep-learning tts style-transfer voice-cloning tacotron tacotron-2 prosody style-tokens
44

tacotron

coqui-ai / TTS

mozilla / TTS

keithito / tacotron

Rayhane-mamah / Tacotron-2

fatchord / WaveRNN

DanRuta / xVA-Synth

MycroftAI / mimic2

spring-media / ForwardTacotron

google / tacotron

MycroftAI / mimic-recording-studio

ranchlai / mandarin-tts

Emotional-Text-to-Speech / dl-for-emo-tts

gia-guar / JARVIS-ChatGPT

vlomme / Multi-Tacotron-Voice-Cloning

syang1993 / gst-tacotron

KinglittleQ / GST-Tacotron

r9y9 / tacotron_pytorch

atomicoo / FCH-TTS

NTT123 / vietTTS

soobinseo / Tacotron-pytorch

Kyubyong / expressive_tacotron

Kyubyong / tacotron_asr

BogiHsu / Tacotron2-PyTorch

karim23657 / Persian-tts-coqui

atomicoo / tacotron2-mandarin

ide8 / tacotron2

bshall / Tacotron

ttaoREtw / Tacotron-pytorch

rishikksh20 / vae_tacotron2

BridgetteSong / ExpressiveTacotron

acetylSv / GST-tacotron

everydaycodings / MimicMania

rishikksh20 / gmvae_tacotron

kaituoxu / Tacotron2

keonlee9420 / Comprehensive-Tacotron2

ishandutta2007 / Text-to-Speech-Landscape