ColinSnow's starred repositories
LivePortrait
Bring portraits to life!
MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
paper-reading
深度学习经典、新论文逐段精读
Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
metavoice-src
Foundational model for human-like, expressive TTS
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Bert-VITS2
vits2 backbone with multilingual-bert
EasyBertVits2
文章から感情豊かな音声を生成する Bert-VITS2 を簡単に使えます。
fish-speech
Brand new TTS solution