There are 23 repositories under segment-anything topic.
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Segment Anything for Stable Diffusion WebUI
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything, MobileSAM!!
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Images to inference with no labeling (use foundation models to train supervised models).
Segment-Anything + 3D. Let's lift anything to 3D.
Tracking and collecting papers/projects/others related to Segment Anything.
EfficientViT is a new family of vision models for efficient high-resolution vision.
收集 CVPR 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!Collect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!
Labeling tool with SAM(segment anything model),supports SAM, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
MetaSeg: Packaged version of the Segment Anything repository
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Segment Anything in 3D with NeRFs (NeurIPS 2023)
SSSegmentation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch.
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam
A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
AI-First Process Automation with Large [Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
The official implementation of SAGA (Segment Any 3D GAussians)
The implementation of the technical report: "Customized Segment Anything Model for Medical Image Segmentation"
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"