wangb's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44606Issues:294Issues:640

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7739Issues:75Issues:481

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:5913Issues:43Issues:158

EditAnything

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Language:PythonLicense:Apache-2.0Stargazers:3157Issues:39Issues:57

SensorsCalibration

OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving

Language:C++License:Apache-2.0Stargazers:2103Issues:48Issues:149

thread-pool

Thread pool implementation using c++11 threads

Language:C++License:MITStargazers:1060Issues:34Issues:15

Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

CVPR2023-3D-Occupancy-Prediction

CVPR2023-Occupancy-Prediction-Challenge

Language:PythonLicense:MITStargazers:744Issues:19Issues:57

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

MonoScene

[CVPR 2022] "MonoScene: Monocular 3D Semantic Scene Completion": 3D Semantic Occupancy Prediction from a single image

Language:PythonLicense:Apache-2.0Stargazers:656Issues:12Issues:94

FB-BEV

Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception

Language:PythonLicense:NOASSERTIONStargazers:558Issues:29Issues:37

OpenOccupancy

[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception

Language:PythonLicense:Apache-2.0Stargazers:534Issues:13Issues:47

StreamPETR

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Language:PythonLicense:NOASSERTIONStargazers:486Issues:13Issues:198

Sparse4D

Sparse4D v1 & v2

Language:PythonLicense:MITStargazers:323Issues:12Issues:55

onnx-tool

A parser, editor and profiler tool for ONNX models.

Language:PythonLicense:MITStargazers:321Issues:6Issues:61

OccFormer

[ICCV 2023] OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction

Language:PythonLicense:Apache-2.0Stargazers:293Issues:11Issues:17

SparseBEV

[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Language:PythonLicense:MITStargazers:283Issues:9Issues:69

UniTR

[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"

Language:PythonLicense:Apache-2.0Stargazers:248Issues:9Issues:21

DriveDreamer

DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

DDQ

Dense Distinct Query for End-to-End Object Detection (CVPR2023)

Language:PythonLicense:Apache-2.0Stargazers:237Issues:9Issues:20

bevdet-tensorrt-cpp

BEVDet implemented by TensorRT, C++; Achieving real-time performance on Orin

CUDA-FastBEV

TensorRT deploy and PTQ/QAT tools development for FastBEV, total time only need 6.9ms!!!

Language:PythonLicense:MITStargazers:191Issues:3Issues:30

UniScene

Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving

Language:PythonLicense:MITStargazers:190Issues:10Issues:8

3D-deformable-attention

[ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"

Language:PythonLicense:NOASSERTIONStargazers:136Issues:3Issues:9

BasicCUDA

A tutorial for CUDA&PyTorch

Language:C++Stargazers:88Issues:1Issues:0

MV2D

Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"

FBMNet

Multi-Modal 3D Object Detection by Box Matching

layer_norm_expressivity_role

Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)

Language:PythonStargazers:43Issues:6Issues:0

AAA

This repository is an official implementation of PETR series

Language:PythonLicense:NOASSERTIONStargazers:8Issues:0Issues:0