Xiaodong Yang (xiaodongyang)

xiaodongyang

Geek Repo

Company:NVIDIA Research

Location:Santa Clara, CA

Home Page:http://xiaodongyang.org

Github PK Tool:Github PK Tool

Xiaodong Yang's starred repositories

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:31172Issues:310Issues:898

LLM101n

LLM101n: Let's build a Storyteller

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13015Issues:114Issues:975

mit-deep-learning

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Language:Jupyter NotebookLicense:MITStargazers:10107Issues:642Issues:11

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:8286Issues:558Issues:128

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:8008Issues:56Issues:1493

pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Language:PythonLicense:MITStargazers:4812Issues:30Issues:169

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonLicense:Apache-2.0Stargazers:4425Issues:26Issues:83

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonLicense:Apache-2.0Stargazers:3167Issues:69Issues:263

matplotlib-cheatsheet

Matplotlib 3.1 cheat sheet.

Language:PythonLicense:BSD-2-ClauseStargazers:2896Issues:92Issues:2

SensorsCalibration

OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving

Language:C++License:Apache-2.0Stargazers:2284Issues:48Issues:161

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Language:PythonLicense:Apache-2.0Stargazers:2195Issues:42Issues:607

Transformers-Recipe

🧠 A study guide to learn about Transformers

BEVDet

Official code base of the BEVDet series .

Language:PythonLicense:Apache-2.0Stargazers:1360Issues:37Issues:350

data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

SegmenTron

Support PointRend, Fast_SCNN, HRNet, Deeplabv3_plus(xception, resnet, mobilenet), ContextNet, FPENet, DABNet, EdaNet, ENet, Espnetv2, RefineNet, UNet, DANet, HRNet, DFANet, HardNet, LedNet, OCNet, EncNet, DuNet, CGNet, CCNet, BiSeNet, PSPNet, ICNet, FCN, deeplab)

Language:PythonLicense:Apache-2.0Stargazers:695Issues:15Issues:66

nuplan-devkit

The devkit of the nuPlan dataset.

Language:PythonLicense:NOASSERTIONStargazers:653Issues:20Issues:331

pillarnext

PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds (CVPR 2023)

Language:PythonLicense:NOASSERTIONStargazers:178Issues:6Issues:22

LiDAR_snow_sim

LiDAR snowfall simulation

Language:PythonLicense:NOASSERTIONStargazers:171Issues:11Issues:30

simtrack

Exploring Simple 3D Multi-Object Tracking for Autonomous Driving (ICCV 2021)

Language:PythonLicense:NOASSERTIONStargazers:166Issues:15Issues:33

VehicleX

VehicleX: Simulating Content Consistent Vehicle Datasets with Attribute Descent (ECCV 2020, TPAMI 2023)

CODD

Cooperative Driving Dataset: a dataset for multi-agent driving scenarios

Language:PythonLicense:CC-BY-SA-4.0Stargazers:135Issues:6Issues:6

Learning-to-See-Moving-Objects-in-the-Dark

Learning to See Moving Objects in the Dark. ICCV 2019

Language:PythonLicense:MITStargazers:131Issues:8Issues:8

pillar-motion

Self-Supervised Pillar Motion Learning for Autonomous Driving (CVPR 2021)

Language:PythonLicense:NOASSERTIONStargazers:119Issues:10Issues:18

distill-bev

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

info-ground

Learning phrase grounding from captioned images through InfoNCE bound on mutual information

Language:PythonLicense:NOASSERTIONStargazers:71Issues:4Issues:6

MANTRA-CVPR20

Official Pytorch code for MANTRA - Memory Augmented Neural Trajectory Predictor (CVPR2020)

Language:PythonLicense:NOASSERTIONStargazers:68Issues:3Issues:11

gedepth

GEDepth: Ground Embedding for Monocular Depth Estimation (ICCV 2023)

Language:PythonLicense:NOASSERTIONStargazers:53Issues:5Issues:8

tip

Transcendental Idealism of Planner: Evaluating Perception from Planning Perspective for Autonomous Driving (ICML 2023)

Language:PythonLicense:NOASSERTIONStargazers:20Issues:8Issues:1