cwt000297

0

followers

following

stars

wtchong's starred repositories

VLM_survey

Collection of AWESOME vision-language models for vision tasks

TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Language:PythonNOASSERTION100600

Peekaboo

Interactive Video Generation via Masked-Diffusion

Language:PythonMIT6400

FreestyleNet

[CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis

Language:PythonMIT13900

PLACE

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis (CVPR2024 Highlight)

Language:Python2600

TrailBlazer

TrailBlazer: Trajectory Control for Diffusion-Based Video Generation

Language:PythonMIT8700

LVDM

LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation

Language:PythonMIT43000

EchoReel

An innovative method designed to augment the capabilities of existing video diffusion models

Language:Python1900

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonApache-2.0417400

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonApache-2.077200

Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

Video-Motion-Customization

VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)

Language:PythonApache-2.015900

mamba

Mamba SSM architecture

Language:PythonApache-2.01214000

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonMIT4278200

Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Language:PythonNOASSERTION119800

paper-gestalt

Deep Paper Gestalt

aot-benchmark

An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch

Language:PythonBSD-3-Clause59500

detr

End-to-End Object Detection with Transformers

Language:PythonApache-2.01324400

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

OC_SORT

[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

Language:PythonMIT73700

mvit

Code Release for MViTv2 on Image Recognition.

Language:PythonApache-2.037900