Luozhou Wang's starred repositories
Awesome-Video-Datasets
Video datasets
diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Defect_Spectrum
Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics [ECCV2024]
SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
FilmRemoval
[CVPR 2024] Official Implementation of Learning to Remove Wrinkled Transparent Film with Polarized Prior
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
mvdream_diffusers
A unified diffusers implementation for MVDream and ImageDream
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
diffusion-motion-transfer
Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""
MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
LucidDreamer
Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"
langchain-gpt4free
LangChain x gpt4free
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models