ywfwyht's repositories
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
bevfusion
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
ConvNeXt-V2
Code release for ConvNeXt V2 model
fastai
The fastai deep learning library
FasterTransformer
Transformer related optimization, including BERT, GPT
FasterViT
Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
FB-BEV
Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception
GANet
A Keypoint-based Global Association Network for Lane Detection. Accepted by CVPR 2022
K-Radar
4D Radar Object Detection for Autonomous Driving in Various Weather Conditions
LATR
[ICCV2023 Oral] LATR: 3D Lane Detection from Monocular Images with Transformer
Lidar_AI_Solution
A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution, YUV2RGB, cuOSD,).
LMDrive
LMDrive: Closed-Loop End-to-End Driving with Large Language Models
nuscenes-devkit
The devkit of the nuScenes dataset.
Occ4cast
Occ4cast: LiDAR-based 4D Occupancy Completion and Forecasting
occupancy-for-nuscenes
3D occupancy
oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
onnx-modifier
A tool to modify onnx models in a visualization fashion, based on Netron and Flask.
OpenOccupancy
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
OpenPCDet
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
Paddle3D
A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.
PersFormer_3DLane
[ECCV2022 oral] Perspective Transformer on 3D Lane Detection
ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
SurroundOcc
[ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
VitTRTPlugin
vit transformer TensorRT plugin