Beast code in Giters

Yao Zhou's starred repositories

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:Python73600

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02104800

FusionAD

An open source autonomous driving stack by San Jose State University Autonomous Driving Team

Language:C++MIT4000

llama

Inference code for Llama models

Language:PythonNOASSERTION5485100

PF-Track

Implementation of PF-Track

Language:PythonNOASSERTION18900

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04600800

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonMIT349400

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.013721400

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookApache-2.01299300

cross_view_transformers

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

Language:PythonMIT51900

spconv

Spatial Sparse Convolution Library

Language:PythonApache-2.0181300

VISTA

This repo presents you the official code of "VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention"

Language:PythonMIT12600

dd3d

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Language:PythonMIT45900

Neighborhood-Attention-Transformer

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Language:PythonMIT102700

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonApache-2.0312900

DN-DETR

[CVPR 2022 Oral] Official implementation of DN-DETR

Language:PythonApache-2.053100

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonApache-2.0921300

sparse-detr

PyTorch Implementation of Sparse DETR

Language:PythonApache-2.015700

BoxeR

Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"

Language:PythonMIT13700

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonApache-2.02962800

uniformer-pytorch

Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, debuted in ICLR 2022

Language:PythonMIT9600

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT2994400

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonNOASSERTION704000

DeFCN

End-to-End Object Detection with Fully Convolutional Network

Language:PythonApache-2.049100

bundle-adjusting-NeRF

BARF: Bundle-Adjusting Neural Radiance Fields 🤮 (ICCV 2021 oral)

Language:PythonMIT77800

mtl

Unofficial implementation of: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics

Language:PythonBSD-2-Clause52900

multi-task-refinenet

Multi-Task (Joint Segmentation / Depth / Surface Normas) Real-Time Light-Weight RefineNet

Language:Jupyter NotebookNOASSERTION19500

visualDet3D

Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving / YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D Detection

Language:PythonApache-2.036200

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03101300

zhouyao4321