SheffieldCao

Xu CAO's repositories

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonApache-2.0000

CAT-Seg

Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"

Language:Python000

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonMIT000

Co-DETR

[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training

Language:PythonMIT000

DiffIR

This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023

Language:Jupyter Notebook000

fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Language:Jupyter NotebookApache-2.0000

Lite-Mono

Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation

Language:PythonMIT000

mmdet-learning

Language:PythonApache-2.0000

Far3D

[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection

NOASSERTION000

mmagic

OpenMMLab Image and Video Restoration, Editing and Generation Toolbox

Language:Jupyter NotebookApache-2.0000

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonApache-2.0000

mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Language:PythonApache-2.0000

Multimodal-GPT

Apache-2.0000

Occ3D

MIT000

Occ3DBaseline

CVPR2023-Occupancy-Prediction-Challenge

MIT000

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonNOASSERTION000

ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Language:Jupyter NotebookNOASSERTION000

OVO-Open-Vocabulary-Occupancy

Language:PythonApache-2.0000

PolarFormer

[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers

MIT000

SAN

Open-vocabulary Semantic Segmentation

Language:PythonMIT000

Semantic-Segment-Anything

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Apache-2.0000

sheffield.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

MIT000

SheffieldCao

Config files for my GitHub profile.

000

stable-dreamfusion

A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion.

Apache-2.0000

SurroundOcc

Multi-camera 3D Occupancy Prediction for Autonomous Driving

Language:PythonApache-2.0000

UniAD

[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving

Language:PythonApache-2.0000

ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Language:PythonApache-2.0000

VLDet

[ICLR 2023] PyTorch implementation of VLDet （https://arxiv.org/abs/2211.14843）

000

VoxFormer

Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]

Language:PythonNOASSERTION000

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

NOASSERTION000