bytemaster's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25122Issues:219Issues:450

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:TypeScriptLicense:MITStargazers:11734Issues:184Issues:3947

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookLicense:MITStargazers:7504Issues:92Issues:146

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:6237Issues:60Issues:129

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5602Issues:78Issues:141

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5524Issues:37Issues:282

OpenPCDet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Language:PythonLicense:Apache-2.0Stargazers:4440Issues:70Issues:1404

ByteTrack

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Language:PythonLicense:MITStargazers:4422Issues:43Issues:354

kalibr

The Kalibr visual-inertial calibration toolbox

Language:C++License:NOASSERTIONStargazers:4148Issues:145Issues:567

lora-scripts

LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Language:PythonLicense:AGPL-3.0Stargazers:4031Issues:25Issues:395

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:3127Issues:50Issues:100

UniAD

[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:3046Issues:34Issues:169

Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

Language:PythonLicense:MITStargazers:2418Issues:33Issues:323

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonLicense:MITStargazers:2309Issues:28Issues:225

SensorsCalibration

OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving

Language:C++License:Apache-2.0Stargazers:2172Issues:48Issues:152

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonLicense:MITStargazers:2067Issues:31Issues:150
Language:PythonLicense:Apache-2.0Stargazers:2067Issues:128Issues:54

autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language:PythonLicense:Apache-2.0Stargazers:1661Issues:19Issues:92

apriltag

AprilTag is a visual fiducial system popular for robotics research.

Language:CLicense:NOASSERTIONStargazers:1458Issues:47Issues:200

BEVDet

Official code base of the BEVDet series .

Language:PythonLicense:Apache-2.0Stargazers:1321Issues:37Issues:344

Lidar_AI_Solution

A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution, YUV2RGB, cuOSD,).

Language:PythonLicense:NOASSERTIONStargazers:1184Issues:17Issues:247

BoT-SORT

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

Language:Jupyter NotebookLicense:MITStargazers:833Issues:12Issues:93

FB-BEV

Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception

Language:PythonLicense:NOASSERTIONStargazers:586Issues:30Issues:40

VIMER

视觉预训练基础模型仓库

Language:PythonLicense:Apache-2.0Stargazers:398Issues:16Issues:82

BEVFormer_tensorrt

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Language:PythonLicense:Apache-2.0Stargazers:370Issues:4Issues:70

bevdet-tensorrt-cpp

BEVDet implemented by TensorRT, C++; Achieving real-time performance on Orin

YOLOv8-3D

YOLOv8-3D is a LowCode, Simple 2D and 3D Bounding Box Object Detection and Tracking , Python 3.10