Show Lab's repositories
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
Awesome-MLLM-Hallucination
đź“– A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
sparseformer
(ICLR 2024, CVPR 2024) SparseFormer
Efficient-CLS
[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video
VisInContext
Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
Tune-An-Ellipse
[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want
DynVideo-E
This is the project page for DynVideo-E.
GUI-Narrator
Repository of GUI Action Narrator