wangb's starred repositories
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
SensorsCalibration
OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving
thread-pool
Thread pool implementation using c++11 threads
Transformer-in-Computer-Vision
A paper list of some recent Transformer-based CV works.
CVPR2023-3D-Occupancy-Prediction
CVPR2023-Occupancy-Prediction-Challenge
LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
OpenOccupancy
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
StreamPETR
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
DriveDreamer
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
bevdet-tensorrt-cpp
BEVDet implemented by TensorRT, C++; Achieving real-time performance on Orin
CUDA-FastBEV
TensorRT deploy and PTQ/QAT tools development for FastBEV, total time only need 6.9ms!!!
3D-deformable-attention
[ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"
layer_norm_expressivity_role
Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)