bytemaster's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:165055Issues:1561Issues:2408

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25179Issues:222Issues:452

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:TypeScriptLicense:MITStargazers:11268Issues:182Issues:3796

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookLicense:MITStargazers:7528Issues:92Issues:146

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5781Issues:37Issues:287

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5624Issues:78Issues:141

ByteTrack

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Language:PythonLicense:MITStargazers:4511Issues:43Issues:358

OpenPCDet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Language:PythonLicense:Apache-2.0Stargazers:4499Issues:71Issues:1412

kalibr

The Kalibr visual-inertial calibration toolbox

Language:C++License:NOASSERTIONStargazers:4216Issues:146Issues:571

lora-scripts

LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Language:PythonLicense:AGPL-3.0Stargazers:4144Issues:26Issues:401

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:3185Issues:50Issues:101

UniAD

[CVPR'23 Best Paper Award] Planning-oriented Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:3148Issues:34Issues:171

Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

Language:PythonLicense:MITStargazers:2475Issues:34Issues:333

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonLicense:MITStargazers:2423Issues:35Issues:259

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonLicense:MITStargazers:2369Issues:29Issues:226

SensorsCalibration

OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving

Language:C++License:Apache-2.0Stargazers:2241Issues:48Issues:157

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonLicense:MITStargazers:2132Issues:31Issues:154

autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language:PythonLicense:Apache-2.0Stargazers:1715Issues:19Issues:93

apriltag

AprilTag is a visual fiducial system popular for robotics research.

Language:CLicense:NOASSERTIONStargazers:1488Issues:47Issues:201

BEVDet

Official code base of the BEVDet series .

Language:PythonLicense:Apache-2.0Stargazers:1343Issues:37Issues:350

Lidar_AI_Solution

A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution, YUV2RGB, cuOSD,).

Language:PythonLicense:NOASSERTIONStargazers:1214Issues:17Issues:248

BoT-SORT

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

Language:Jupyter NotebookLicense:MITStargazers:851Issues:12Issues:96
Language:PythonLicense:MITStargazers:749Issues:20Issues:70

FB-BEV

Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception

Language:PythonLicense:NOASSERTIONStargazers:606Issues:29Issues:41

VIMER

视觉预训练基础模型仓库

RTM3D

The official PyTorch Implementation of RTM3D and KM3D for Monocular 3D Object Detection

Language:PythonLicense:MITStargazers:450Issues:46Issues:65

PersFormer_3DLane

[ECCV2022 Oral] Perspective Transformer on 3D Lane Detection

Language:PythonLicense:Apache-2.0Stargazers:417Issues:15Issues:124

BEVFormer_tensorrt

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Language:PythonLicense:Apache-2.0Stargazers:388Issues:4Issues:72

bevdet-tensorrt-cpp

BEVDet implemented by TensorRT, C++; Achieving real-time performance on Orin