Bencheng Liao1,2,3 *, Shaoyu Chen1,3 *, Xinggang Wang1 📧, Tianheng Cheng1,3, Qian Zhang3, Wenyu Liu1, Chang Huang3
1 School of EIC, HUST, 2 Institute of Artificial Intelligence, HUST, 3 Horizon Robotics
(*) equal contribution, (📧) corresponding author.
ArXiv Preprint (arXiv 2208.14437)
openreview ICLR'23, accepted as ICLR Spotlight
May. 12th, 2023
: MapTR now support various bevencoder, such as BEVFormer encoder and BEVFusion bevpool. Check it out!Apr. 20th, 2023
: Extending MapTR to a general map annotation framework (paper), with high flexibility in terms of spatial scale and element type.Mar. 22nd, 2023
: By leveraging MapTR, VAD (paper, code) models the driving scene as fully vectorized representation, achieving SoTA end-to-end planning performance!Jan. 21st, 2023
: MapTR is accepted to ICLR 2023 as Spotlight Presentation!Nov. 11st, 2022
: We release an initial version of MapTR.Aug. 31st, 2022
: We released our paper on Arxiv. Code/Models are coming soon. Please stay tuned! ☕️
We present MapTR, a structured end-to-end framework for efficient online vectorized HD map construction.
We propose a unified permutation-based modeling approach,
ie, modeling map element as a point set with a group of equivalent permutations, which avoids the definition ambiguity of map element and eases learning.
We adopt a hierarchical query embedding scheme to flexibly encode structured map information and perform hierarchical bipartite matching for map element learning. MapTR achieves the best performance and efficiency among existing vectorized map construction approaches on nuScenes dataset. In particular, MapTR-nano runs at real-time inference speed (
Results from the paper
Method | Backbone | BEVEncoder | Lr Schd | mAP | FPS | memroy |
---|---|---|---|---|---|---|
MapTR-nano | R18 | GKT | 110ep | 44.2 | 25.1 | 11907M (bs 24) |
MapTR-tiny | R50 | GKT | 24ep | 50.3 | 11.2 | 10287M (bs 4) |
MapTR-tiny | R50 | GKT | 110ep | 58.7 | 11.2 | 10287M (bs 4) |
Notes:
- FPS is measured on NVIDIA RTX3090 GPU with batch size of 1 (containing 6 view images).
- All the experiments are performed on 8 NVIDIA GeForce RTX 3090 GPUs.
Results from this repo. FPSs are much higher.
Method | Backbone | BEVEncoder | Lr Schd | mAP | FPS | memroy | Config | Download |
---|---|---|---|---|---|---|---|---|
MapTR-nano | R18 | GKT | 110ep | 46.3 | 48.2 | 11907M (bs 24) | config | model / log |
MapTR-tiny | R50 | GKT | 24ep | 50.0 | 18.4 | 10287M (bs 4) | config | model / log |
MapTR-tiny | R50 | GKT | 110ep | 59.3 | 18.4 | 10287M (bs 4) | config | model / log |
MapTR-tiny | Camera & LiDAR | GKT | 24ep | 62.7 | 6.0 | 11858M (bs 4) | config | model / log |
MapTR-tiny | R50 | bevpool | 24ep | 50.1 | 17.2 | 9817M (bs 4) | config | model / log |
MapTR-tiny | R50 | bevformer | 24ep | 48.7 | 18.1 | 10219M (bs 4) | config | model / log |
sunny.cloudy_short.mp4
rainy_short.mp4
night_short.mp4
e2e_planning.mp4
- centerline detection & topology support
- multi-modal checkpoints
- multi-modal code
- lidar modality code
- argoverse2 dataset
- Nuscenes dataset
- MapTR checkpoints
- MapTR code
- Initialization
MapTR is based on mmdetection3d. It is also greatly inspired by the following outstanding contributions to the open-source community: BEVFusion, BEVFormer, HDMapNet, GKT, VectorMapNet.
If you find MapTR is useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.
@inproceedings{MapTR,
title={MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction},
author={Liao, Bencheng and Chen, Shaoyu and Wang, Xinggang and Cheng, Tianheng, and Zhang, Qian and Liu, Wenyu and Huang, Chang},
booktitle={International Conference on Learning Representations},
year={2023}
}