GUO HOUJIAN's starred repositories
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
parler-tts
Inference and training library for high-quality TTS models.
HierSpeechpp
The official implementation of HierSpeech++
naturalspeech3_facodec
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
SpeechCLIP
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
fish-speech
Brand new TTS solution
midi-visualizer
Visualize MIDIs as piano tutorials
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
DOL-CHS-Chemistry
DOL-CHS-Chemistry:欲都孤儿中文社区民间整合包
miipher2.0
Reimplementation of Miipher
Bert-VITS2
vits2 backbone with multilingual-bert
Degrees-of-Lewdity-Chinese-Localization
Degrees of Lewdity 游戏的授权中文社区本地化版本
ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
multilingual_genshin_speech_from_fandom
Download multilingual speech and text from https://genshin-impact.fandom.com/wiki/Genshin_Impact_Wiki
QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
consistency_models
Official repo for consistency models.