shouhengmzh's starred repositories

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14675Issues:115Issues:383

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10419Issues:63Issues:219

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6727Issues:49Issues:207

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:6395Issues:62Issues:135

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonLicense:Apache-2.0Stargazers:4297Issues:59Issues:145

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4279Issues:39Issues:423

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:3189Issues:30Issues:134
Language:PythonLicense:BSD-3-ClauseStargazers:3187Issues:39Issues:170

SegFormer

Official PyTorch implementation of SegFormer

Language:PythonLicense:NOASSERTIONStargazers:2464Issues:31Issues:150

YOLOP

You Only Look Once for Panopitic Driving Perception.(MIR2022)

Language:PythonLicense:MITStargazers:1894Issues:31Issues:198

glomap

GLOMAP - Global Structured-from-Motion Revisited

Language:C++License:BSD-3-ClauseStargazers:1262Issues:21Issues:59

OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Language:PythonLicense:NOASSERTIONStargazers:1205Issues:23Issues:43

MaskDINO

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Language:PythonLicense:Apache-2.0Stargazers:1147Issues:34Issues:108

PoseLib

Minimal solvers for calibrated camera pose estimation

Language:C++License:BSD-3-ClauseStargazers:862Issues:24Issues:44

hierarchical-3d-gaussians

Official implementation of the SIGGRAPH 2024 paper "A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets"

Language:PythonLicense:NOASSERTIONStargazers:845Issues:16Issues:60

OpenSeeD

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Language:PythonLicense:Apache-2.0Stargazers:636Issues:21Issues:37

Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:624Issues:6Issues:35

mseg-semantic

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

Language:PythonLicense:MITStargazers:459Issues:14Issues:30

flowsam

Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman

Language:PythonLicense:Apache-2.0Stargazers:255Issues:4Issues:12

YOSO

Code release for paper "You Only Segment Once: Towards Real-Time Panoptic Segmentation" [CVPR 2023]

Language:PythonLicense:MITStargazers:247Issues:3Issues:25

wild-gaussians

WildGaussians: 3D Gaussian Splatting In the Wild

Language:PythonLicense:NOASSERTIONStargazers:244Issues:8Issues:18

SEA-RAFT

[ECCV2024 Oral] SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow

Language:PythonLicense:BSD-3-ClauseStargazers:228Issues:7Issues:14

Downstream-Dinov2

Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such as Classification, Semantic Segmentation and Monocular depth estimation.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:177Issues:3Issues:11

Linfer

基于TensorRT的C++高性能推理库,Yolov10, YoloPv2,Yolov5/7/X/8,RT-DETR,单目标跟踪OSTrack、LightTrack。

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:141Issues:4Issues:22

BoTSORT-cpp

C++ implementation of BoT-SORT MOT algorithm with Re-ID and Camera Motion Compensation

NGD-SLAM

NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU.

Language:C++License:GPL-3.0Stargazers:80Issues:3Issues:4

seg-dinov2

Fine-tuning dino v2 for semantic segmentation task on MSCOCO.

Language:PythonLicense:NOASSERTIONStargazers:16Issues:2Issues:2
Stargazers:6Issues:0Issues:0
Language:C++Stargazers:2Issues:0Issues:0