wtchong's starred repositories
VLM_survey
Collection of AWESOME vision-language models for vision tasks
TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
FreestyleNet
[CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis
TrailBlazer
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Transformer-in-Computer-Vision
A paper list of some recent Transformer-based CV works.
Video-Motion-Customization
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)
Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
paper-gestalt
Deep Paper Gestalt
aot-benchmark
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)