spicysama's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
LxgwWenKai
An open-source Chinese font derived from Fontworks' Klee One. 一款开源中文字体,基于 FONTWORKS 出品字体 Klee One 衍生。
Bert-VITS2
vits2 backbone with multilingual-bert
fish-speech
Brand new TTS solution
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
emotional-vits
无需情感标注的情感可控语音合成模型,基于VITS
Style-Bert-VITS2
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
fish-speech
Brand new TTS solution
text-labeler
A simple svs labeling tool