Linhui Xiao's starred repositories
Awesome-Visual-Dialog
A curated publication list on visual dialog
CVPR2022-FTCL
[CVPR 2022] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
CVPR2023-OWTAL
[CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization
ECCV2022-DELU
[ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
CVPR2023-CMPAE
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
ICLR2024-REDL
[ICLR 2024 Spotlight] R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning
Awesome-Visual-Grounding
A Survey on Open Visual Grounding
awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
flash-attention
Fast and memory-efficient exact attention
Awesome-Mamba-Papers
Awesome Papers related to Mamba.
MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
SceneGraphParser
A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).