Multimedia Computing Group, Nanjing University (MCG-NJU)

Multimedia Computing Group, Nanjing University

MCG-NJU

Geek Repo

Location:Nanjing

Home Page:mcg.nju.edu.cn

Github PK Tool:Github PK Tool

Multimedia Computing Group, Nanjing University's repositories

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1186Issues:17Issues:110

MixFormer

[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention

Language:PythonLicense:MITStargazers:414Issues:7Issues:104

TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

Language:PythonLicense:Apache-2.0Stargazers:362Issues:10Issues:71

EMA-VFI

[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio

Language:PythonLicense:Apache-2.0Stargazers:290Issues:2Issues:24

SparseBEV

[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Language:PythonLicense:MITStargazers:252Issues:9Issues:62

CamLiFlow

[CVPR 2022 Oral & TPAMI 2023] Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion

MixFormerV2

[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking

Language:PythonLicense:MITStargazers:113Issues:10Issues:30

MeMOTR

[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

Language:PythonLicense:MITStargazers:112Issues:5Issues:13

SportsMOT

[ICCV 2023] SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes

MultiSports

[ICCV 2021] MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions

Language:PythonLicense:NOASSERTIONStargazers:95Issues:7Issues:27

MMN

[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

Language:PythonLicense:MITStargazers:86Issues:6Issues:7

LinK

[CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception

MixSort

[ICCV2023] MixSort: The Customized Tracker in SportsMOT

Language:PythonLicense:MITStargazers:56Issues:3Issues:7

BasicTAD

BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection

Language:PythonLicense:Apache-2.0Stargazers:46Issues:3Issues:15

DDM

[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection

Language:PythonLicense:MITStargazers:46Issues:2Issues:10

STMixer

[CVPR 2023] STMixer: A One-Stage Sparse Action Detector

VideoMAE-Action-Detection

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

Language:PythonLicense:NOASSERTIONStargazers:37Issues:2Issues:5

PointTAD

[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points

Language:PythonLicense:Apache-2.0Stargazers:35Issues:4Issues:4

TemporalPerceiver

[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection

Language:PythonLicense:Apache-2.0Stargazers:31Issues:1Issues:1

CoMAE

[AAAI 2023] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets

PDPP

[CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos

DEQDet

[ICCV 2023] Deep Equilibrium Object Detection

Language:Jupyter NotebookStargazers:19Issues:3Issues:2

EVAD

[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement

Language:PythonLicense:NOASSERTIONStargazers:17Issues:2Issues:3

MGMAE

[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding

Language:PythonLicense:MITStargazers:15Issues:2Issues:2

MOTIP

Multiple Object Tracking as ID Prediction

Language:PythonStargazers:10Issues:0Issues:0

APP-Net

[TIP] APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Recognition

Language:PythonStargazers:9Issues:2Issues:0

StageInteractor

[ICCV 2023] StageInteractor: Query-based Object Detector with Cross-stage Interaction

Language:PythonLicense:Apache-2.0Stargazers:9Issues:2Issues:0

VLG

VLG: General Video Recognition with Web Textual Knowledge (https://arxiv.org/abs/2212.01638)

Language:PythonStargazers:8Issues:1Issues:0
Language:PythonStargazers:6Issues:1Issues:0

DGN

[IJCV 2023] Dual Graph Networks for Pose Estimation in Crowded Scenes

Language:PythonStargazers:6Issues:2Issues:0