Beast code in Giters

huangxin168's starred repositories

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.032097 274 1063

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT29407 188 966

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT27362 206 206

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause10177 102 143

PhotoMaker

Language:Jupyter NotebookNOASSERTION8691 97 125

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.06876 57 143

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

Language:PythonNOASSERTION6806 37 121

TikTokDownloader

TikTok 主页/合辑/直播/视频/图集/原声；抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具

Language:PythonGPL-3.06679 46 226

threestudio

A unified framework for 3D content generation.

Language:PythonApache-2.05958 82 314

fish-speech

Brand new TTS solution

Language:PythonNOASSERTION5917 54 248

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT4461 76 182

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonMIT3839 86 94

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonApache-2.02951 36 143

TTS

Language:Java2633 41 105

DemoFusion

Let us democratise high-resolution generation! (CVPR 2024)

Language:Jupyter Notebook1938 34 42

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonMIT1492 30 46

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonMIT870 23 32

One-Shot_Free-View_Neural_Talking_Head_Synthesis

Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

Language:PythonNOASSERTION737 26 78

INSTA

INSTA - Instant Volumetric Head Avatars [CVPR2023]

Language:CNOASSERTION420 20 42

MOSNet

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Language:PythonNOASSERTION318 10 10

BakedAvatar

Pytorch Code for "BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis"

Language:PythonMIT286 15 15

gaussian-head

Official repository for 'GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation'

Language:PythonMIT242 22 25

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonMIT182 7 13

INSTA-pytorch

INSTA - Instant Volumetric Head Avatars [Demo]

Language:CNOASSERTION128 5 17

havatar

[TOG 2023] HAvatar: High-fidelity Head Avatar via Facial Model ConditionedNeural Radiance Field

Language:Python116 11 10

AvatarMAV

A PyTorch implementation of "AvatarMAV: Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels"

Language:PythonMIT92 15 9

Face-Upscalers-ONNX

ONNX-Powered Inference for State-of-the-Art Face Upscalers

Language:PythonNOASSERTION71 5 8

LipFD

This repository contains the codes of "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-syncing DeepFakes".

Language:Python61 3 7

Portrait-Talker

Talking head animation

Language:Python28 6 1

NPVA

The official implementation of "Neural Point-based Volumetric Avatar: Surface-guided Neural Points for Efficient and Photorealistic Volumetric Head Avatar". SIGGRAPH ASIA 2023

Language:Python15 50