Shanshan Zhao's starred repositories

face_recognition

The world's simplest facial recognition api for Python and the command line

Language:PythonLicense:MITStargazers:52061Issues:1565Issues:1325

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:17214Issues:156Issues:265

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:13049Issues:123Issues:297

Waifu2x-Extension-GUI

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.

Language:C++License:NOASSERTIONStargazers:12233Issues:140Issues:416

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:5841Issues:70Issues:213

infinigen

Infinite Photorealistic Worlds using Procedural Generation

Language:PythonLicense:BSD-3-ClauseStargazers:5081Issues:80Issues:201

ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2334Issues:41Issues:0

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonLicense:NOASSERTIONStargazers:1854Issues:32Issues:79

DynamiCrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:1803Issues:23Issues:77

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonLicense:MITStargazers:1402Issues:29Issues:44

Awesome-Talking-Face

📖 A curated list of resources dedicated to talking face.

Hotshot-XL

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Language:PythonLicense:Apache-2.0Stargazers:964Issues:13Issues:42

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonLicense:MITStargazers:837Issues:30Issues:92

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonLicense:MITStargazers:823Issues:15Issues:144

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonLicense:Apache-2.0Stargazers:717Issues:33Issues:32

animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Language:PythonLicense:MITStargazers:598Issues:15Issues:48

DiffTalk

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

MiniGPT4-video

Official code for MiniGPT4-video

Language:PythonLicense:BSD-3-ClauseStargazers:401Issues:9Issues:26

Vlogger

[CVPR2024] Make Your Dream A Vlog

Language:PythonLicense:Apache-2.0Stargazers:378Issues:10Issues:15

CelebV-Text

(CVPR 2023) CelebV-Text: A Large-Scale Facial Text-Video Dataset

DFRF

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Language:PythonLicense:MITStargazers:330Issues:10Issues:37

WorldDreamer

WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens

awesome-video-generation

A collection of awesome video generation studies.

Language:TeXLicense:MITStargazers:103Issues:0Issues:0

T2VScore

T2VScore: Towards A Better Metric for Text-to-Video Generation

ConDaFormer

[NeurIPS'23] ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding

Language:PythonStargazers:4Issues:0Issues:0