wtchong (cwt000297)

cwt000297

Geek Repo

Github PK Tool:Github PK Tool

wtchong's starred repositories

VLM_survey

Collection of AWESOME vision-language models for vision tasks

Stargazers:2100Issues:0Issues:0

TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Language:PythonLicense:NOASSERTIONStargazers:1006Issues:0Issues:0

Peekaboo

Interactive Video Generation via Masked-Diffusion

Language:PythonLicense:MITStargazers:64Issues:0Issues:0

FreestyleNet

[CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis

Language:PythonLicense:MITStargazers:139Issues:0Issues:0

PLACE

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis (CVPR2024 Highlight)

Language:PythonStargazers:26Issues:0Issues:0

TrailBlazer

TrailBlazer: Trajectory Control for Diffusion-Based Video Generation

Language:PythonLicense:MITStargazers:87Issues:0Issues:0

LVDM

LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation

Language:PythonLicense:MITStargazers:430Issues:0Issues:0

EchoReel

An innovative method designed to augment the capabilities of existing video diffusion models

Language:PythonStargazers:19Issues:0Issues:0

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4174Issues:0Issues:0

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonLicense:Apache-2.0Stargazers:772Issues:0Issues:0

Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

Stargazers:1038Issues:0Issues:0

Video-Motion-Customization

VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:159Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12140Issues:0Issues:0

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:42782Issues:0Issues:0

Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Language:PythonLicense:NOASSERTIONStargazers:1198Issues:0Issues:0

paper-gestalt

Deep Paper Gestalt

Stargazers:438Issues:0Issues:0

aot-benchmark

An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:595Issues:0Issues:0

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:13244Issues:0Issues:0

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

Stargazers:3324Issues:0Issues:0

OC_SORT

[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

Language:PythonLicense:MITStargazers:737Issues:0Issues:0

mvit

Code Release for MViTv2 on Image Recognition.

Language:PythonLicense:Apache-2.0Stargazers:379Issues:0Issues:0