geneing's repositories
WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)
AndroidTTS
TTS Service for Android
AntennaPod
A podcast manager for Android
Bert-VITS2
vits2 backbone with multilingual-bert
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
HiFi-GAN-1
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
lean-lora
Tune LLaMA on Lean data
lean4_tutorials
Some Lean tutorials
NeMo-text-processing
NeMo text processing for ASR and TTS
ParallelWaveGAN
Unofficial Parallel WaveGAN implementation with Pytorch
piper
A fast, local neural text to speech system
piper-phonemize
C++ library for converting text to phonemes for Piper
pocket-casts-android
Pocket Casts Android 🎧
sherpa-onnx
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
vits2_pytorch
unofficial vits2-TTS implementation in pytorch