Chunfeng Wang's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Bert-VITS2
vits2 backbone with multilingual-bert
fish-speech
Brand new TTS solution
HierSpeechpp
The official implementation of HierSpeech++
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
libriheavy
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
tts-scores
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
Bridge-TTS
Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).
naturalspeech3_facodec
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
ChildAugment
Codes for LPC Segmental Warping Perturbations (LPC-SWP) and Formant Energy Bandwidth (FEP-BWP) Perturbations