Multimedia Computing Group, Nanjing University (MCG-NJU)

Multimedia Computing Group, Nanjing University

MCG-NJU

Geek Repo

Location:Nanjing

Home Page:mcg.nju.edu.cn

Github PK Tool:Github PK Tool

Multimedia Computing Group, Nanjing University's repositories

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1339Issues:16Issues:120

MixFormer

[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention

Language:PythonLicense:MITStargazers:448Issues:7Issues:107

SparseBEV

[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Language:PythonLicense:MITStargazers:339Issues:9Issues:82

SparseOcc

[ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric

Language:PythonLicense:Apache-2.0Stargazers:236Issues:6Issues:47

CamLiFlow

[CVPR 2022 Oral & TPAMI 2023] Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion

MeMOTR

[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

Language:PythonLicense:MITStargazers:144Issues:5Issues:20

MixFormerV2

[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking

Language:PythonLicense:MITStargazers:137Issues:10Issues:40

MOTIP

Multiple Object Tracking as ID Prediction

Language:PythonLicense:Apache-2.0Stargazers:88Issues:6Issues:29

MixSort

[ICCV2023] MixSort: The Customized Tracker in SportsMOT

Language:PythonLicense:MITStargazers:71Issues:5Issues:11

SGM-VFI

[CVPR 2024] Sparse Global Matching for Video Frame Interpolation with Large Motion

BIVDiff

[CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:52Issues:2Issues:4

PointTAD

[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points

Language:PythonLicense:Apache-2.0Stargazers:38Issues:4Issues:6

CoMAE

[AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets

TemporalPerceiver

[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection

Language:PythonLicense:Apache-2.0Stargazers:34Issues:1Issues:1

VFIMamba

VFIMamba: Video Frame Interpolation with State Space Models

Language:PythonLicense:Apache-2.0Stargazers:34Issues:1Issues:0

PDPP

[CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos

EVAD

[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement

Language:PythonLicense:NOASSERTIONStargazers:26Issues:2Issues:4

DEQDet

[ICCV 2023] Deep Equilibrium Object Detection

Language:Jupyter NotebookStargazers:21Issues:3Issues:2

MGMAE

[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding

Language:PythonLicense:MITStargazers:20Issues:2Issues:2

Dynamic-MDETR

[TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding

Language:PythonLicense:Apache-2.0Stargazers:14Issues:0Issues:0

SPLAM

[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model

Language:PythonLicense:MITStargazers:14Issues:2Issues:1

AMD

[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models

SportsHHI

[CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos

Language:PythonStargazers:11Issues:0Issues:0

StageInteractor

[ICCV 2023] StageInteractor: Query-based Object Detector with Cross-stage Interaction

Language:PythonLicense:Apache-2.0Stargazers:9Issues:2Issues:0

VLG

VLG: General Video Recognition with Web Textual Knowledge (https://arxiv.org/abs/2212.01638)

Language:PythonStargazers:8Issues:1Issues:0

DGN

[IJCV 2023] Dual Graph Networks for Pose Estimation in Crowded Scenes

Language:PythonStargazers:7Issues:2Issues:0

ViT-TAD

[CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos

Language:PythonStargazers:7Issues:0Issues:0

PRVG

[CVIU 2024] End-to-end dense video grounding via parallel regression

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

VideoEval

VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model

Language:PythonStargazers:6Issues:0Issues:0

LogN

[IJCV 2024] Logit Normalization for Long-Tail Object Detection

Language:PythonLicense:Apache-2.0Stargazers:4Issues:1Issues:0