YTEP-ZHI

Jiazhi Yang's repositories

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonApache-2.0000

centerformer

Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)

Language:PythonMIT000

DiffusionDet

PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonNOASSERTION000

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonApache-2.0000

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION000

GroundingDINO

The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.0000

HAL

Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs" at AAAI 2020.

Language:Python000

LLaMA-Adapter

Fine-tuning LLaMA to follow instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.0000

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.0000

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Language:Python000

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT000

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonApache-2.0000

nerfvis

NeRF visualization library under construction

Language:PythonBSD-2-Clause000

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

Language:PythonBSD-3-Clause000

OpenSelfSup

Self-Supervised Learning Toolbox and Benchmark

Language:PythonApache-2.0000

PolyLoss

Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.

Language:PythonApache-2.0000

Position-Focused-Attention-Network

Position Focused Attention Network for Image-Text Matching

Language:Python000

Proxy-Anchor-CVPR2020

Official PyTorch Implementation of Proxy Anchor Loss for Deep Metric Learning, CVPR 2020

Language:PythonMIT000

pyllama

LLaMA: Open and Efficient Foundation Language Models

Language:PythonGPL-3.0000

ResNeSt

ResNeSt: Split-Attention Networks

Language:PythonApache-2.0000

SCAN

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

Language:PythonApache-2.0000

setup

Setup a new machine without sudo!

Language:Shell000

ST-P3

[ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.

Language:PythonApache-2.0000

Stable-Pix2Seq

A full-fledged version of Pix2Seq

Language:PythonApache-2.0000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000

TCP

[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.

Language:PythonApache-2.0000

transfuser

[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Language:PythonMIT000

UniAD

Goal-oriented Autonomous Driving

Language:JavaScriptApache-2.0000

unified-io-inference

Language:Jupyter NotebookApache-2.0000

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonApache-2.0000