There are 3 repositories under open-vocabulary-segmentation topic.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API π₯
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation
Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
[ICCVW23] VinAI-3DIS Metadata repo of OpenSUN3D