There are 1 repository under visual-grounding topic.
awesome grounding: A curated list of research papers in visual grounding
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
paper list of robotic grasping and some related works
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
Referring Video Object Segmentation / Multi-Object Tracking Repo
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
A list of research papers on knowledge-enhanced multimodal learning
Codebase for "Learning to ground medical text in a 3D human atlas (CoNLL 2020)".
Code used to train probing classifiers in the attribute prediction task
Utilizing a transformer-based object detector for the task of 3D visual grounding.
Explore new research topics, visual grounding
Helper tools for extracting and projecting ENet features to ScanNet pointclouds.
TransformerVG - 3D Visual Grounding with Transformers
[EMNLP 22] Extending Phrase Grounding with Pronouns in Visual Dialogues.
Shortened version of the final exam for the Deep Learning course of the University of Trento in 2023.