Beast code in Giters

Shanshan Zhao's starred repositories

face_recognition

The world's simplest facial recognition api for Python and the command line

Language:PythonMIT52061 1565 1325

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.017214 156 265

MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonMIT13049 123 297

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.

Language:C++NOASSERTION12233 140 416

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.05841 70 213

infinigen

Infinite Photorealistic Worlds using Procedural Generation

Language:PythonBSD-3-Clause5081 80 201

ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

GPL-3.03475 24 7

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.02334 410

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonNOASSERTION1854 32 79

DynamiCrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonApache-2.01803 23 77

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

1403 43 12

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonMIT1402 29 44

awesome-talking-head-generation

1185 73 1

Awesome-Talking-Face

📖 A curated list of resources dedicated to talking face.

MIT1089 67 4

Hotshot-XL

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Language:PythonApache-2.0964 13 42

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonMIT837 30 92

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonMIT823 15 144

DragNUWA

MIT733 22 27

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonApache-2.0717 33 32

animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Language:PythonMIT598 15 48

DiffTalk

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

Language:Python411 46 32

MiniGPT4-video

Official code for MiniGPT4-video

Language:PythonBSD-3-Clause401 9 26

Vlogger

[CVPR2024] Make Your Dream A Vlog

Language:PythonApache-2.0378 10 15

CelebV-Text

(CVPR 2023) CelebV-Text: A Large-Scale Facial Text-Video Dataset

Language:Python366 13 23

DFRF

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Language:PythonMIT330 10 37

WorldDreamer

WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens

MIT149 17 3

awesome-video-generation

A collection of awesome video generation studies.

Language:TeXMIT10300

T2VScore

T2VScore: Towards A Better Metric for Text-to-Video Generation

72 7 2

flow-matching

Language:Jupyter Notebook44 1 1

ConDaFormer

[NeurIPS'23] ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding

Language:Python400

sshan-zhao