Kevin Wang's repositories
Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
gpt-sovits
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Retrieval-based-Voice-Conversion-New
Voice data <= 10 mins can also be used to train a good VC model!
AICoverGen
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
fish-speech
Brand new TTS solution
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Advanced-RVC-Inference
Advanced RVC Inference for quicker and effortless model downloads
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
HairFastGAN
Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"
KevinWang676
My profile
KevinWang676.github.io
Personal website
MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
test-repo
RVC Inference with multiple model and huggingface support
ttts
Train the next generation of TTS systems.