annopackage / Awesome-BEV-Perception-Multi-Cameras

Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird-Eye-View, such as DETR3D, BEVDet, BEVFormer

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Awesome BEV Perception from Multi-Cameras

ECCV 2020

  • Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D [paper] [Github]

CoRL 2021

  • DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries [paper] [Github]

ICCV 2021

  • FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras [paper] [Github]

CVPR 2022

  • CVT: Cross-view Transformers for real-time Map-view Semantic Segmentation [paper] [Github]

ICRA 2022

ACMM 2022

  • Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection [paper]

ECCV 2022

  • BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers [paper] [Github]
  • PETR: Position Embedding Transformation for Multi-View 3D Object Detection [paper][Github]
  • ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning [paper][Github]
  • SpatialDETR: Robust Scalable Transformer-Based 3D Object Detection from Multi-View Camera Images with Global Cross-Sensor Attention[paper] [Github]

CoRL 2022

  • LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation [paper] [Github]

WACV 2023

  • BEVSegFormer: Bird’s Eye View Semantic Segmentation From Arbitrary Camera Rigs [paper]

Arxiv 2022

  • BEVDet: High-Performance Multi-Camera 3D Object Detection in Bird-Eye-View [paper] [Github]
  • BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection [paper]
  • PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images [paper][Github]
  • M2BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation [paper]
  • BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving [paper] [Github]
  • PolarDETR: Polar Parametrization for Vision-based Surround-View 3D Detection[paper] [Github]
  • PolarFormer: Multi-camera 3D Object Detection with Polar Transformers[paper] [Github]
  • CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection[paper] [Github]
  • Real4d [Github]
  • Inspur-MASTER-3D [Github]
  • BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection [paper][Github]
  • A Simple Baseline for BEV Perception Without LiDAR [paper] [Github]
  • BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo [paper] [Github]
  • STS: Surround-view Temporal Stereo for Multi-view 3D Detection [paper]

HD Map Construction

  • HDMapNet: An Online HD Map Construction and Evaluation Framework [paper] [Github]
  • MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction [paper] [Github]

Multi-sensor fusion

  • FUTR3D: A Unified Sensor Fusion Framework for 3D Detection [paper] [Github]
  • BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework [paper] [Github]
  • Unifying Voxel-based Representation with Transformer for 3D Object Detection [paper] [Github]
  • BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation [paper] [Github]

Survey

  • Vision-Centric BEV Perception: A Survey [paper] [Github]
  • BEVPerception-Survey-Recipe [Github]

others

  • Focal Sparse Convolutional Networks for 3D Object Detection [paper] [Github]
  • Voxel Field Fusion for 3D Object Detection [paper] [Github]
  • Scaling up Kernels in 3D CNNs [paper] [Github]

About

Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird-Eye-View, such as DETR3D, BEVDet, BEVFormer