huangxin168

huangxin168

Geek Repo

Github PK Tool:Github PK Tool

huangxin168's starred repositories

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32097Issues:274Issues:1063

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29407Issues:188Issues:966

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27362Issues:206Issues:206

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10177Issues:102Issues:143

PhotoMaker

PhotoMaker

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8691Issues:97Issues:125

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:6876Issues:57Issues:143

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Language:PythonLicense:NOASSERTIONStargazers:6806Issues:37Issues:121

TikTokDownloader

TikTok 主页/合辑/直播/视频/图集/原声;抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具

Language:PythonLicense:GPL-3.0Stargazers:6679Issues:46Issues:226

threestudio

A unified framework for 3D content generation.

Language:PythonLicense:Apache-2.0Stargazers:5958Issues:82Issues:314

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:5917Issues:54Issues:248

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4461Issues:76Issues:182

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:3839Issues:86Issues:94

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonLicense:Apache-2.0Stargazers:2951Issues:36Issues:143

DemoFusion

Let us democratise high-resolution generation! (CVPR 2024)

Language:Jupyter NotebookStargazers:1938Issues:34Issues:42

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonLicense:MITStargazers:1492Issues:30Issues:46

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:870Issues:23Issues:32

One-Shot_Free-View_Neural_Talking_Head_Synthesis

Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

Language:PythonLicense:NOASSERTIONStargazers:737Issues:26Issues:78

INSTA

INSTA - Instant Volumetric Head Avatars [CVPR2023]

Language:CLicense:NOASSERTIONStargazers:420Issues:20Issues:42

MOSNet

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Language:PythonLicense:NOASSERTIONStargazers:318Issues:10Issues:10

BakedAvatar

Pytorch Code for "BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis"

Language:PythonLicense:MITStargazers:286Issues:15Issues:15

gaussian-head

Official repository for 'GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation'

Language:PythonLicense:MITStargazers:242Issues:22Issues:25

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonLicense:MITStargazers:182Issues:7Issues:13

INSTA-pytorch

INSTA - Instant Volumetric Head Avatars [Demo]

Language:CLicense:NOASSERTIONStargazers:128Issues:5Issues:17

havatar

[TOG 2023] HAvatar: High-fidelity Head Avatar via Facial Model ConditionedNeural Radiance Field

AvatarMAV

A PyTorch implementation of "AvatarMAV: Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels"

Language:PythonLicense:MITStargazers:92Issues:15Issues:9

Face-Upscalers-ONNX

ONNX-Powered Inference for State-of-the-Art Face Upscalers

Language:PythonLicense:NOASSERTIONStargazers:71Issues:5Issues:8

LipFD

This repository contains the codes of "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-syncing DeepFakes".

Portrait-Talker

Talking head animation

NPVA

The official implementation of "Neural Point-based Volumetric Avatar: Surface-guided Neural Points for Efficient and Photorealistic Volumetric Head Avatar". SIGGRAPH ASIA 2023

Language:PythonStargazers:15Issues:5Issues:0