huangxin168's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
PhotoMaker
PhotoMaker
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
TikTokDownloader
TikTok 主页/合辑/直播/视频/图集/原声;抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具
threestudio
A unified framework for 3D content generation.
fish-speech
Brand new TTS solution
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
One-Shot_Free-View_Neural_Talking_Head_Synthesis
Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"
BakedAvatar
Pytorch Code for "BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis"
gaussian-head
Official repository for 'GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation'
INSTA-pytorch
INSTA - Instant Volumetric Head Avatars [Demo]
Face-Upscalers-ONNX
ONNX-Powered Inference for State-of-the-Art Face Upscalers
Portrait-Talker
Talking head animation