qcraftai/distill-bev

3d-object-detection bev knowledge-distillation lidar multi-camera nuscenes point-cloud self-driving autonomous-driving distillation cross-modal multi-modal

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation

Zeyu Wang, Dingwen Li, Chenxu Luo, Cihang Xie, Xiaodong Yang
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation, ICCV 2023

Get Started

Installtion

Please refer to INSTALL to set up the environment and install the dependencies (see more details in Dockerfile).

Data Preparation

Please follow the instructions in DATA.

Training and Evaluation

Please follow the instructions in RUN.

Main Results

BEVDepth and BEVFormer are respectively used in this repo as CNNs and Transformers based students to exemplify the improvement by DistillBEV.

Backbone: ResNet-50 pre-trained on ImageNet-1K

Teacher	Student	mAP	NDS	Checkpoints
-	BEVDepth	36.4	48.4	[Google Drive] [Baidu Cloud]
CenterPoint	BEVDepth	39.0	50.6	[Google Drive] [Baidu Cloud]
MVP	BEVDepth	40.3	51.0	[Google Drive] [Baidu Cloud]

Teacher	Student	mAP	NDS	Checkpoints
-	BEVFormer	32.3	43.4	[Google Drive] [Baidu Cloud]
CenterPoint	BEVFormer	35.9	46.8	[Google Drive] [Baidu Cloud]
MVP	BEVFormer	36.7	47.6	[Google Drive] [Baidu Cloud]

Backbone: ResNet-101 pre-trained on ImageNet-1K

Teacher	Student	mAP	NDS	Checkpoints
-	BEVDepth	40.7	52.2	[Google Drive] [Baidu Cloud]
CenterPoint	BEVDepth	43.6	53.6	[Google Drive] [Baidu Cloud]
MVP	BEVDepth	45.0	54.6	[Google Drive] [Baidu Cloud]

Teacher	Student	mAP	NDS	Checkpoints
-	BEVFormer	34.9	46.0	[Google Drive] [Baidu Cloud]
CenterPoint	BEVFormer	37.4	48.2	[Google Drive] [Baidu Cloud]
MVP	BEVFormer	38.2	49.1	[Google Drive] [Baidu Cloud]

Citation

Please cite the following paper if this repo helps your research:

@InProceedings{Wang_2023_ICCV,
    author    = {Wang, Zeyu and Li, Dingwen and Luo, Chenxu and Xie, Cihang and Yang, Xiaodong},
    title     = {DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation},
    booktitle = {IEEE/CVF International Conference on Computer Vision (ICCV)},
    year      = {2023},
}

Acknowledgement

We thank the authors for the multiple great open-sourced repos, including MMDetection3D, CenterPoint, BEVDet, BEVDepth and BEVFormer.

License

Copyright (C) 2023 QCraft. All rights reserved. Licensed under the CC BY-NC-SA 4.0 (Attribution-NonCommercial-ShareAlike 4.0 International). The code is released for academic research use only. For commercial use, please contact business@qcraft.ai.

About

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

3d-object-detection bev knowledge-distillation lidar multi-camera nuscenes point-cloud self-driving autonomous-driving distillation cross-modal multi-modal

Languages

Language:Python 87.4%Language:C++ 6.8%Language:Cuda 4.7%Language:Shell 1.0%Language:Dockerfile 0.0%