Wu Xiaodong's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46858Issues:305Issues:662

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35650Issues:996Issues:188

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:15842Issues:200Issues:76

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14838Issues:114Issues:385

leetcode_101

LeetCode 101:和你一起你轻松刷题(C++)

Deep-Learning-Interview-Book

深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)

FastSAM

Fast Segment Anything

Language:PythonLicense:AGPL-3.0Stargazers:7376Issues:57Issues:203

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4709Issues:44Issues:123

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonLicense:Apache-2.0Stargazers:4320Issues:59Issues:147

rtabmap

RTAB-Map library and standalone application

Language:C++License:NOASSERTIONStargazers:2733Issues:95Issues:1139

Semantic-Segment-Anything

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Language:PythonLicense:Apache-2.0Stargazers:2116Issues:19Issues:58

Anything-3D

Segment-Anything + 3D. Let's lift anything to 3D.

Language:PythonLicense:MITStargazers:1542Issues:35Issues:15

Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Language:PythonLicense:MITStargazers:1512Issues:19Issues:296

UNINEXT

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

Language:PythonLicense:MITStargazers:1491Issues:99Issues:56

Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

SegmentAnything3D

[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes

Language:PythonLicense:MITStargazers:961Issues:15Issues:48

omni3d

Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"

Language:PythonLicense:NOASSERTIONStargazers:705Issues:22Issues:51

ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:678Issues:12Issues:31

OpenSeeD

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Language:PythonLicense:Apache-2.0Stargazers:640Issues:21Issues:37

openscene

[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies

Language:PythonLicense:Apache-2.0Stargazers:633Issues:19Issues:88

ucasproposal

LaTeX Proposal Template for the University of Chinese Academy of Sciences

Segment-Any-Point-Cloud

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

segment-anything-annotator

We developed a python UI based on labelme and segment-anything for pixel-level annotation. It support multiple masks generation by SAM(box/point prompt), efficient polygon modification and category record. We will add more features (such as incorporating CLIP-based methods for category proposal and VOS methods for video datasets

Language:PythonLicense:GPL-3.0Stargazers:343Issues:1Issues:33

laion-3d

Collect large 3d dataset and build models

PLA

(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

Language:PythonLicense:Apache-2.0Stargazers:254Issues:14Issues:53

PointCLIP_V2

[ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning

Language:PythonLicense:MITStargazers:223Issues:10Issues:29
Language:PythonLicense:MITStargazers:222Issues:11Issues:54
Language:PythonLicense:NOASSERTIONStargazers:151Issues:13Issues:15

Virtual-Multi-View-Fusion

An Elegant PyTorch Implementation of ECCV'2020: Virtual Multi View Fusion for 3D Semantic Segmentation.

Language:PythonStargazers:1Issues:1Issues:0