jasonwongw's starred repositories
facefusion
Next generation face swapper and enhancer
Emote-hack
Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)
whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
avatar_ernerf
Just a suturing monster project.
AI-Song-Cover-RVC
All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab
DynamiCrafter
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
wav2lip-576x576
This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital human videos.
ColossalAI
Making large AI models cheaper, faster and more accessible
vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
fay-android
app会常驻手机后台,你可以随时随地保持与Fay数字人的沟通。
python_rtmpstream
python库,实现推送实时rtmp音视频流
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time