Yao Zhou's starred repositories

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:PythonStargazers:736Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21048Issues:0Issues:0

FusionAD

An open source autonomous driving stack by San Jose State University Autonomous Driving Team

Language:C++License:MITStargazers:40Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54851Issues:0Issues:0

PF-Track

Implementation of PF-Track

Language:PythonLicense:NOASSERTIONStargazers:189Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46008Issues:0Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3494Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:137214Issues:0Issues:0

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12993Issues:0Issues:0

cross_view_transformers

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

Language:PythonLicense:MITStargazers:519Issues:0Issues:0

spconv

Spatial Sparse Convolution Library

Language:PythonLicense:Apache-2.0Stargazers:1813Issues:0Issues:0

VISTA

This repo presents you the official code of "VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention"

Language:PythonLicense:MITStargazers:126Issues:0Issues:0

dd3d

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Language:PythonLicense:MITStargazers:459Issues:0Issues:0

Neighborhood-Attention-Transformer

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Language:PythonLicense:MITStargazers:1027Issues:0Issues:0

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonLicense:Apache-2.0Stargazers:3129Issues:0Issues:0

DN-DETR

[CVPR 2022 Oral] Official implementation of DN-DETR

Language:PythonLicense:Apache-2.0Stargazers:531Issues:0Issues:0

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonLicense:Apache-2.0Stargazers:9213Issues:0Issues:0

sparse-detr

PyTorch Implementation of Sparse DETR

Language:PythonLicense:Apache-2.0Stargazers:157Issues:0Issues:0

BoxeR

Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"

Language:PythonLicense:MITStargazers:137Issues:0Issues:0

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:29628Issues:0Issues:0

uniformer-pytorch

Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, debuted in ICLR 2022

Language:PythonLicense:MITStargazers:96Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29944Issues:0Issues:0

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonLicense:NOASSERTIONStargazers:7040Issues:0Issues:0

DeFCN

End-to-End Object Detection with Fully Convolutional Network

Language:PythonLicense:Apache-2.0Stargazers:491Issues:0Issues:0

bundle-adjusting-NeRF

BARF: Bundle-Adjusting Neural Radiance Fields 🤮 (ICCV 2021 oral)

Language:PythonLicense:MITStargazers:778Issues:0Issues:0

mtl

Unofficial implementation of: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics

Language:PythonLicense:BSD-2-ClauseStargazers:529Issues:0Issues:0

multi-task-refinenet

Multi-Task (Joint Segmentation / Depth / Surface Normas) Real-Time Light-Weight RefineNet

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:195Issues:0Issues:0

visualDet3D

Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving / YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D Detection

Language:PythonLicense:Apache-2.0Stargazers:362Issues:0Issues:0
Language:PythonLicense:MITStargazers:128Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:31013Issues:0Issues:0