northeastsquare

bytemaster's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25122 219 450

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:TypeScriptMIT11734 184 3947

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookMIT7504 92 146

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonMIT6237 60 129

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.05602 78 141

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.05524 37 282

OpenPCDet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Language:PythonApache-2.04440 70 1404

ByteTrack

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Language:PythonMIT4422 43 354

kalibr

The Kalibr visual-inertial calibration toolbox

Language:C++NOASSERTION4148 145 567

lora-scripts

LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Language:PythonAGPL-3.04031 25 395

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonApache-2.03127 50 100

UniAD

[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving

Language:PythonApache-2.03046 34 169

Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

Language:PythonMIT2418 33 323

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT2309 28 225

SensorsCalibration

OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving

Language:C++Apache-2.02172 48 152

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonMIT2067 31 150

autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language:PythonApache-2.01661 19 92

apriltag

AprilTag is a visual fiducial system popular for robotics research.

Language:CNOASSERTION1458 47 200

BEVDet

Official code base of the BEVDet series .

Language:PythonApache-2.01321 37 344

A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution, YUV2RGB, cuOSD,).

Language:PythonNOASSERTION1184 17 247

autoware.universe

Language:C++Apache-2.0871 44 822

BoT-SORT

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

Language:Jupyter NotebookMIT833 12 93

FB-BEV

Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception

Language:PythonNOASSERTION586 30 40

VIMER

视觉预训练基础模型仓库

Language:Python487 21 44

DAIR-V2X

Language:PythonApache-2.0398 16 82

BEVFormer_tensorrt

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Language:PythonApache-2.0370 4 70

bev_lane_det

229 11 31

bevdet-tensorrt-cpp

BEVDet implemented by TensorRT, C++； Achieving real-time performance on Orin

Language:C++209 4 27

YOLOv8-3D

YOLOv8-3D is a LowCode, Simple 2D and 3D Bounding Box Object Detection and Tracking , Python 3.10

Language:Python111 4 12