Shawn J.'s starred repositories
ShareGPT4Video
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Awesome-Parameter-Efficient-Transfer-Learning
Collection of awesome parameter-efficient fine-tuning resources.
VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
all-in-one
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
awesome-video-text-retrieval
A curated list of deep learning resources for video-text retrieval.
CLIP_benchmark
CLIP-like model evaluation