xy-gao's starred repositories

hello-algo

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Language:JavaLicense:NOASSERTIONStargazers:90908Issues:513Issues:203

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:30146Issues:190Issues:1003

GPT_API_free

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Language:PythonLicense:MITStargazers:19692Issues:106Issues:255

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11068Issues:163Issues:239

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10274Issues:102Issues:143

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6201Issues:70Issues:234

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6016Issues:52Issues:141

threestudio

A unified framework for 3D content generation.

Language:PythonLicense:Apache-2.0Stargazers:6009Issues:82Issues:320

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4367Issues:61Issues:176

dreamgaussian

[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation

Language:PythonLicense:MITStargazers:3815Issues:47Issues:144

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonLicense:Apache-2.0Stargazers:3591Issues:76Issues:120

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Language:PythonLicense:NOASSERTIONStargazers:2640Issues:30Issues:52

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookLicense:MITStargazers:2638Issues:33Issues:96

SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

consistent_depth

We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.

Language:PythonLicense:MITStargazers:1610Issues:57Issues:69

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Language:PythonLicense:MITStargazers:1389Issues:30Issues:205

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonLicense:NOASSERTIONStargazers:1139Issues:63Issues:200

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonLicense:MITStargazers:930Issues:16Issues:152

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonLicense:MITStargazers:866Issues:30Issues:94

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Language:PythonLicense:MITStargazers:825Issues:24Issues:67

VividTalk

VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior

gsgen

[CVPR 2024] Text-to-3D using Gaussian Splatting

Language:PythonLicense:MITStargazers:742Issues:12Issues:41

MVDream

Multi-view Diffusion for 3D Generation

Language:PythonLicense:MITStargazers:741Issues:21Issues:32

LucidDreamer

Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"

Language:PythonLicense:MITStargazers:722Issues:23Issues:35

GaussianDreamer

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:604Issues:15Issues:40

DINet_optimized

An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing work.