Multimedia Computing Group, Nanjing University's repositories
MixFormerV2
[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking
TemporalPerceiver
[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Dynamic-MDETR
[TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
StageInteractor
[ICCV 2023] StageInteractor: Query-based Object Detector with Cross-stage Interaction