There are 27 repositories under segment-anything topic.
Ultralytics YOLO ๐
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Segment Anything for Stable Diffusion WebUI
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (ๆฏๆDragGANใChatGPTใImageBindใSAM็ๅจ็บฟDemo็ณป็ป)
Efficient vision foundation models for high-resolution generation and perception.
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
Images to inference with no labeling (use foundation models to train supervised models).
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API ๐ฅ
Tracking and collecting papers/projects/others related to Segment Anything.
Segment-Anything + 3D. Let's lift anything to 3D.
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.ไบคไบๅผๅ่ชๅจๅพๅๆ ๆณจๅทฅๅ ท
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
ๆถ้ CVPR ๆๆฐ็ๆๆ๏ผๅ ๆฌ่ฎบๆใไปฃ็ ๅdemo่ง้ข็ญ๏ผๆฌข่ฟๅคงๅฎถๆจ่๏ผCollect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
MetaSeg: Packaged version of the Segment Anything repository
Segment Anything in 3D with NeRFs (NeurIPS 2023)
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
SSSegmentation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch.
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT
The official implementation of SAGA (Segment Any 3D GAussians)
Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam
๐๏ธ + ๐ฌ + ๐ง = ๐ค Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
[Open-Source Project] Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting