linhuixiao

Linhui Xiao's starred repositories

DynRefer

DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution

Language:PythonApache-2.03100

Awesome-Visual-Dialog

A curated publication list on visual dialog

1100

CVPR2022-FTCL

[CVPR 2022] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization

Language:PythonMIT4600

CVPR2023-OWTAL

[CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization

Language:PythonMIT800

ECCV2022-DELU

[ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization

Language:PythonMIT3800

CVPR2023-CMPAE

[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

Language:PythonMIT3300

ICLR2024-REDL

[ICLR 2024 Spotlight] R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning

Language:PythonMIT3000

Awesome-Visual-Grounding

A Survey on Open Visual Grounding

Apache-2.0200

awesome-described-object-detection

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.

16000

llama

Inference code for Llama models

Language:PythonNOASSERTION5475200

Libra

Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)

Language:PythonApache-2.03800

HiVG

Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.

Language:PythonApache-2.02600

COMM

Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

MIT18000

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Language:Python257100

FlagAI

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Language:PythonApache-2.0381400

s4

Structured state space sequence models

Language:Jupyter NotebookApache-2.0228900

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1266100

mamba

Mamba SSM architecture

Language:PythonApache-2.01196300

Awesome-Mamba-Papers

Awesome Papers related to Mamba.

100500

MultiModalMamba

A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.

Language:PythonMIT42200