open-vocabulary-segmentation

There are 3 repositories under open-vocabulary-segmentation topic.

IDEA-Research / Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
3d-whole-body-pose-estimation automatic-labeling-system caption data-generation image-editing open-vocabulary-detection open-vocabulary-segmentation speech
Language:Jupyter Notebook 16910
roboflow / notebooks
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.
automatic-labeling-system computer-vision deep-learning deep-neural-networks google-colab image-classification image-segmentation machine-learning object-detection open-vocabulary-detection open-vocabulary-segmentation paligemma pytorch qwen tutorial vlm yolov5 yolov8 zero-shot-classification zero-shot-detection
Language:Jupyter Notebook 8403
roboflow / awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
chatgpt computer-vision openai classification clip zero-shot grounding-dino open-vocabulary-detection open-vocabulary-segmentation segment-anything
Language:Python 1685
hkchengrex / Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
deep-learning object-tracking open-vocabulary-segmentation video-editing video-object-segmentation video-segmentation open-vocabulary-video-segmentation open-world-video-segmentation iccv2023
Language:Python 1427
FoundationVision / GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
foundation-model object-detection open-world tracking open-vocabulary-detection open-vocabulary-segmentation open-vocabulary-video-segmentation referring-expression-comprehension referring-expression-segmentation video-instance-segmentation video-object-segmentation zero-shot-object-detection referring-video-object-segmentation interactive-segmentation segment-anything
Language:Python 1152
NVlabs / ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
deep-learning instance-segmentation panoptic-segmentation pytorch semantic-segmentation diffusion-models text-image-retrieval zero-shot-learning open-vocabulary open-vocabulary-segmentation open-world-classification open-world-object-detection open-vocabulary-semantic-segmentation
Language:Python 926
SkalskiP / awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
blip clip computer-vision foundational-models grounding-dino image-captioning llava multimodal nlp open-vocabulary-detection open-vocabulary-segmentation segment-anything zero-shot-detection
Language:Python 634
segments-ai / panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
open-vocabulary-detection open-vocabulary-segmentation segmentation
Language:Jupyter Notebook 404
wanghao9610 / OV-DINO
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
fundation-models object-detection open-vocabulary-detection open-vocabulary-segmentation open-world ov-dino zero-shot-object-detection
Language:Python 367
Kunhao-Liu / 3D-OVS
[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation
3d nerf open-vocabulary-segmentation
Language:Python 118
hustvl / MaskAdapter
[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"
clip open-vocabulary open-vocabulary-segmentation open-vocabulary-semantic-segmentation segment-anything segmentation vision-language-model zero-shot zero-shot-segmentation
Language:Python 83
CVRP-SOLE / SOLE
[ICLR 2025] Official code of "Segment any 3D Object with Language"
3d-instance-segmentation open-vocabulary-segmentation replica scannet scannet200 scene-understanding segment-anything segment-anything-model
Language:Python 51
chenxi52 / FrozenSeg
Open-Vocabulary Panoptic Segmentation
clip open-vocabulary-segmentation open-vocabulary-semantic-segmentation panoptic-segmentation segment-anything segmentation instance-segmentation multi-modal-learning open-vocabulary vision-and-language zero-shot
Language:Python 23
clownrat6 / OpenVIS
Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
open-vocabulary-segmentation open-vocabulary-video-segmentation video-instance-segmentation
Language:Python 22
OVCamo
lartpang / OVCamo
(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation
camouflage-detection camouflage-images camouflaged-object-detection camouflaged-target-detection open-vocabulary open-vocabulary-detection open-vocabulary-segmentation
Language:Python 22
lorebianchi98 / Talk2DINO
Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"
clip computer-vision dinov2 open-vocabulary-segmentation unsupervised-open-vocabulary-segmentation
Language:Python 22
HVision-NKU / MaskCLIPpp
Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"
open-vocabulary-segmentation vision-language-model clip image-segmentation
Language:Python 21
PhucNDA / Open3DSceneUnderstanding
[ICCVW23] VinAI-3DIS Metadata repo of OpenSUN3D
3d-understanding instance-segmentation open-vocabulary-segmentation
Language:Jupyter Notebook 4
katsunori-waragai / zed-gsam
grounded-segment-anything with ZED SDK
open-vocabulary-segmentation segment-anything segmentation zed-camera
Language:Python 0
macorisd / open-object-classification-TLS
A Computer Vision pipeline in Python designed for object detection and segmentation in images using an open-vocabulary approach, without relying on predefined and limited categories like those found in datasets like COCO.
computer-vison open-vocabulary-detection open-vocabulary-segmentation
Language:Python