Shanshan Zhao's starred repositories
face_recognition
The world's simplest facial recognition api for Python and the command line
MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
ComfyUI-Workflows-ZHO
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
MiniGPT4-video
Official code for MiniGPT4-video
CelebV-Text
(CVPR 2023) CelebV-Text: A Large-Scale Facial Text-Video Dataset
WorldDreamer
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
awesome-video-generation
A collection of awesome video generation studies.
ConDaFormer
[NeurIPS'23] ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding