Bo Jiang (rb93dett)

rb93dett

Geek Repo

Company:HUST | Intern at Horzion Robotics

Location:Beijing

Home Page:https://www.cnblogs.com/RB26DETT/

Github PK Tool:Github PK Tool

Bo Jiang's starred repositories

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5659Issues:46Issues:73

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:3952Issues:38Issues:367

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2585Issues:32Issues:96

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1568Issues:21Issues:85

waymax

A JAX-based simulator for autonomous driving research.

Language:PythonLicense:NOASSERTIONStargazers:801Issues:14Issues:54

Awesome-LLM4AD

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

awesome-CARLA

👉 CARLA resources such as tutorial, blog, code and etc https://github.com/carla-simulator/carla

GaussianObject

Code for "GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting"

DriveLM

[ECCV 2024] DriveLM: Driving with Graph Visual Question Answering

Language:HTMLLicense:Apache-2.0Stargazers:716Issues:21Issues:69

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonLicense:MITStargazers:600Issues:10Issues:13

LMDrive

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:546Issues:17Issues:61

DrivingDiffusion

Layout-Guided multi-view driving scene video generation with latent diffusion model

Language:PythonLicense:MITStargazers:515Issues:18Issues:10

VAD

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:505Issues:27Issues:68

DriveAGI

[Incl. GenAD, CVPR 2024 Highlight] Embracing Foundation Models into Autonomous Agent and System

Language:PythonLicense:Apache-2.0Stargazers:466Issues:24Issues:6

tuplan_garage

[CoRL'23] Parting with Misconceptions about Learning-based Vehicle Motion Planning

Language:PythonLicense:NOASSERTIONStargazers:451Issues:21Issues:39

Matte-Anything

[Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models

Language:PythonLicense:MITStargazers:436Issues:8Issues:21

Driving-with-LLMs

PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"

Language:PythonLicense:Apache-2.0Stargazers:367Issues:16Issues:23

JudgeLM

An open-sourced LLM judge for evaluating LLM-generated answers.

Language:PythonLicense:Apache-2.0Stargazers:278Issues:7Issues:16

VMA

A general map auto annotation framework based on MapTR, with high flexibility in terms of spatial scale and element type

Language:PythonLicense:MITStargazers:178Issues:13Issues:14

Symphonies

[CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

Language:PythonLicense:MITStargazers:142Issues:9Issues:23

forecast-mae

[ICCV'2023] Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders

Language:Jupyter NotebookStargazers:142Issues:4Issues:17

SparseTrack

Official PyTorch implementation of SparseTrack (the new version of code will come soon)

Language:PythonLicense:MITStargazers:125Issues:7Issues:31

ELM

[ECCV 2024] Embodied Understanding of Driving Scenarios

HDRFlow

[CVPR 2024] Real-Time HDR Video Reconstruction

Language:PythonLicense:MITStargazers:99Issues:4Issues:2

ProRes

ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration

MIM4D

MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning

WeakSAM

WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition

Language:PythonStargazers:28Issues:3Issues:0

CircuitFormer

[NeurIPS 2023] CircuitFormer: Circuit as Set of Points