dragnDriver's starred repositories
chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
fish-speech
Brand new TTS solution
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Bert-VITS2
vits2 backbone with multilingual-bert
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
so-vits-svc-Deployment-Documents
So-VITS-SVC 本地部署/训练/推理/使用帮助文档 So-VITS-SVC Local Deployment/Training/Inference/Usage Help Document
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
project-NN-Pytorch-scripts
see README
XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
CoMoSpeech
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
light-speed
A modified VITS that utilizes phoneme duration's ground truth for better robustness
Bert-VITS2-Cook-Book
Documentation for Bert-VITS2