northeastsquare

bytemaster's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT165055 1561 2408

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25179 222 452

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:TypeScriptMIT11268 182 3796

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookMIT7528 92 146

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.05781 37 287

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.05624 78 141

ByteTrack

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Language:PythonMIT4511 43 358

OpenPCDet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Language:PythonApache-2.04499 71 1412

kalibr

The Kalibr visual-inertial calibration toolbox

Language:C++NOASSERTION4216 146 571

lora-scripts

LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Language:PythonAGPL-3.04144 26 401

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonApache-2.03185 50 101

UniAD

[CVPR'23 Best Paper Award] Planning-oriented Autonomous Driving

Language:PythonApache-2.03148 34 171

Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

Language:PythonMIT2475 34 333

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonMIT2423 35 259

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT2369 29 226

SensorsCalibration

OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving

Language:C++Apache-2.02241 48 157

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonMIT2132 31 154

autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language:PythonApache-2.01715 19 93

apriltag

AprilTag is a visual fiducial system popular for robotics research.

Language:CNOASSERTION1488 47 201

BEVDet

Official code base of the BEVDet series .

Language:PythonApache-2.01343 37 350

A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution, YUV2RGB, cuOSD,).

Language:PythonNOASSERTION1214 17 248