linhuixiao / awesome-open-vocabulary-object-detection

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Awesome-Open-Vocabulary-Object-DetectionAwesome

- Recent papers (from 2021)

Keywords

img.: image   | vid.: video   | 3d.: 3D   | obj.: object detection   | sem.: semantic segmentation   | ins.: instance segmentation   | pan.: panoptic segmentation


2022

  • [NeurIPS] GLIPv2: Unifying Localization and Vision-Language Understanding. [pytorch] [img., obj.]
  • [NeurIPS] Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection. [pytorch] [img., obj.]
  • [ECCV] Open Vocabulary Object Detection with Pseudo Bounding-Box Labels. [pytorch] [img., obj.]
  • [ECCV] Exploiting Unlabeled Data with Vision and Language Models for Object Detection. [pytorch] [img., obj.]
  • [ECCV] Simple Open-Vocabulary Object Detection with Vision Transformers. [jax] [img., obj.]
  • [ECCV] Open-Vocabulary DETR with Conditional Matching. [pytorch] [img., obj.]
  • [ECCV] PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images. [pytorch] [img., obj.]
  • [ECCV] A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model. [pytorch] [img., sem.]
  • [ECCV] Scaling Open-Vocabulary Image Segmentation with Image-Level Labels. [img., sem.]
  • [CVPR] Learning To Prompt for Open-Vocabulary Object Detection With Vision-Language Model. [pytorch] [img., obj.]
  • [CVPR] Grounded Language-Image Pre-training. [pytorch] [img., obj.]
  • [CVPR] Open-Vocabulary One-Stage Detection With Hierarchical Visual-Language Knowledge Distillation. [pytorch] [img., obj.]
  • [CVPR] RegionCLIP: Region-Based Language-Image Pretraining. [pytorch] [img., obj.]
  • [CVPR] Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling. [pytorch] [img., ins.]
  • [ACMM] Rethinking Open-World Object Detection in Autonomous Driving Scenarios. [img., obj.]
  • [GCPR] Localized Vision-Language Matching for Open-vocabulary Object Detection. [pytorch] [img., obj.]
  • [TPAMI] Learning to Overcome Noise in Weak Caption Supervision for Object Detection. [img., obj.]
  • [Arxiv] P3OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. [img., obj.]
  • [Arxiv] F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models. [img., obj.]
  • [Arxiv] Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization. [pytorch] [img., obj.]
  • [Arxiv] Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models. [pytorch] [img., obj.]
  • [Arxiv] Learning Object-Language Alignments for Open-Vocabulary Object Detection. [pytorch] [img., obj.]
  • [Arxiv] Open-Vocabulary Panoptic Segmentation with MaskCLIP. [img., pan.]
  • [Arxiv] Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP. [pytorch][img., sem.]
  • [Arxiv] Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning. [img., sem.]
  • [Arxiv] Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation.[pytorch] [img., ins.]
  • [Arxiv] Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning. [3d., obj.]

2021

  • [ICLR] Open-vocabulary Object Detection via Vision and Language Knowledge Distillation. [pytorch] [img., obj.]
  • [CVPR] Open-Vocabulary Object Detection Using Captions. [pytorch] [img., obj.]

About