Different evaluation results on nuSences val

Question

Different evaluation results on nuSences val

imyyf opened this issue a year ago · comments

We use your BeMapNet-SwinT checkpoint to evaluate nuScenes val, but get results which is different from your github results.
Ours: mAP@EASY 70, mAP@HARD 55.3
Yours: mAP@EASY 67, mAP@HARD 49.1
Our test setting and environments
python==3.8, torch==1.9.1, CUDA==11.4, mmcv-full==1.4.0, pillow==8.1.2, numpy==1.21.6, detectron2(build from source)
We use 1*A6000 to test

Zhenhua Xu · Answer 1 · Fri Oct 27 2023 23:38:51 GMT+0800 (China Standard Time)

Similar here. Get higher results better than those reported in the paper with resnet50 (Ours: 62, Paper: 59.8). The performance of the code is obviously better than the paper. Can you explain such a huge performance gap?

Wenjie · Answer 2 · Wed Jan 17 2024 15:12:02 GMT+0800 (China Standard Time)

The performance differences between GPUs arise from numerical errors in the matmul calculations when using the FloatTensor32 type.

Quick fix: By modifying the line in tools/evaluation/cd.py to dist = torch.cdist(source_pc.type(torch.float64), target_pc.type(torch.float64)), we can obtain consistent and accurate evaluation results that align with the paper.