AndyTang15's starred repositories
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
DynamicViT
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
Bringing-Old-Films-Back-to-Life
Bringing Old Films Back to Life (CVPR 2022)
segment-caption-anything
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gradio demo that show how to use the model.
transductive-vos.pytorch
a transductive approach for video object segmentation
bnv_fusion
This repository implements our CVPR2022 paper "BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion"
ManiGaussian
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
FineDiving
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
OrdinalCLIP
[NeurIPS 2022] OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression
AoT_Dataset
CVPR18: Learning and Using the Arrow of Time