Wangzhen's starred repositories
tts-frontend-dataset
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
HierSpeechpp
The official implementation of HierSpeech++
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
fish-speech
Brand new TTS solution
Prosody_Prediction
Predict prosody labels for Chinese sentences.
TTS-TextAnalyzer
TTS Text Analyzer
chinese_speech_pretrain
chinese speech pretrained models
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
Meta-voicebox
Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC