There are 53 repositories under tts topic.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Lingvo
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
an open-source implementation of sequence-to-sequence based speech processing engine
Multi-source Translation
Управление Яндекс.Станцией и другими колонками с Алисой из Home Assistant
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
A Python/Pytorch app for easily synthesising human voices
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Augmentative and Alternative Communication (AAC) communication system with text-to-speech for the browser
⏩ Generating speech in a single forward pass without any attention!
一个可以录制 Microsoft Edge 浏览器的语音合成(TTS)语音并输出为 .wav 音频的(windows平台)工具。
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.