HaoCheng's repositories

QCNet

[CVPR 2023] Query-Centric Trajectory Prediction

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

3D-Box-Segment-Anything

We extend Segment Anything to 3D perception by combining it with VoxelNeXt.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

aot-benchmark

An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Collaborative_Perception

This repository is a paper digest of recent advances in collaborative / cooperative / multi-agent perception for V2I / V2V / V2X autonomous driving scenario.

Stargazers:0Issues:0Issues:0

ConditionalDETR

This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org/abs/2108.06152)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DAB-DETR

[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeepAccident

Code for the benchmark - DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving.

Stargazers:0Issues:0Issues:0

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DriveLM

DriveLM: Driving with Graph Visual Question Answering

License:Apache-2.0Stargazers:0Issues:0Issues:0

FastSAM

Fast Segment Anything

License:Apache-2.0Stargazers:0Issues:0Issues:0

futr3d

Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs

License:Apache-2.0Stargazers:0Issues:0Issues:0

H-Deformable-DETR

[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".

License:MITStargazers:0Issues:0Issues:0

l5kit

L5Kit - https://woven.toyota

Stargazers:0Issues:0Issues:0

LAformer

Official PyTorch Implementation of "LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MaskDINO

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mile

PyTorch code for the paper "Model-Based Imitation Learning for Urban Driving".

License:MITStargazers:0Issues:0Issues:0

MobileSAM

This is the offiicial code for Faster Segment Anything (MobileSAM) project that makes SAM lightweight

License:Apache-2.0Stargazers:0Issues:0Issues:0

PF-Track

Implementation of PF-Track

License:NOASSERTIONStargazers:0Issues:0Issues:0

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

simple_bev

A Simple Baseline for BEV Perception

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Sparse4D

Sparse4D v1 & v2

License:MITStargazers:0Issues:0Issues:0

UniAD

[CVPR 2023 Award Candidate] Planning-oriented Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

License:MITStargazers:0Issues:0Issues:0

Video-Swin-Transformer

This is an official implementation for "Video Swin Transformers".

License:Apache-2.0Stargazers:0Issues:0Issues:0

Vim

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

VoxelNeXt

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

License:Apache-2.0Stargazers:0Issues:0Issues:0