Zeyuan Chen's starred repositories
single-video-curation-svd
Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.
MagicDance
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
instaloader
Download pictures (or videos) along with their captions and other metadata from Instagram.
PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
visualwebarena
VisualWebArena is a benchmark for multimodal agents.
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
HD-VG-130M
The HD-VG-130M Dataset
DynamiCrafter
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
motionshop
Project page of replacing the human motion in the video with a virtual 3D human
Awesome-Video-Datasets
Video datasets
instagrapi
🔥 The fastest and powerful Python library for Instagram Private API 2024
DeepLabCut
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans