geneing's repositories
AndroidTTS
TTS Service for Android
MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
AntennaPod
A podcast manager for Android
Bert-VITS2
vits2 backbone with multilingual-bert
epub-kotlin-toolkit
A toolkit for ebooks, audiobooks and comics written in Kotlin
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
hexgrad-kokoro
https://hf.co/hexgrad/Kokoro-82M
kaldifst
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
lean-lora
Tune LLaMA on Lean data
lean4_tutorials
Some Lean tutorials
LibreraReader
Book Reader for Android
NeMo-text-processing
NeMo text processing for ASR and TTS
piper
A fast, local neural text to speech system
piper-phonemize
C++ library for converting text to phonemes for Piper
pocket-casts-android
Pocket Casts Android 🎧
SesameAILabs_csm
A Conversational Speech Generation Model
sherpa-onnx
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
stylish-tts
High quality text-to-speech based on StyleTTS 2.
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
vits2_pytorch
unofficial vits2-TTS implementation in pytorch