How to make imVoxelNet support multi-classes in nuScenes dataset?

Question

How to make imVoxelNet support multi-classes in nuScenes dataset?

XinchaoGou opened this issue 2 years ago · comments

Hi. @filaPro
Thanks for sharing the code.
I noticed that your original paper only mentioned results of "car" in nuScenes.
I want to see how it performs under multi-classes. I modified this line to make the network output support 10 classes.
https://github.com/saic-vul/imvoxelnet/blob/3512e89ca98e48aebb21a4c9e9fbe5037220b3a4/configs/imvoxelnet/imvoxelnet_nuscenes.py#L26

I modified it to num_classes=10,
But still I only get results for single class "car". The other classes are all 0 for mAP. Did you tired this before? Can you help me?

Danila Rukhovich · Answer 1 · Wed Dec 15 2021 15:08:16 GMT+0800 (China Standard Time)

Hi @XinchaoGou ,

We also removed other classes from NuScenesMultiViewDataset. To revert them back you also need to comment these lines.

Also the performance will be rather poor if we tried only single-scale head for car detection on outdoor KITTI and nuScenes datasets. However for 10 classes it will be much better with 3 scales as in PointPillars.

XinchaoGou · Answer 2 · Thu Dec 16 2021 11:13:49 GMT+0800 (China Standard Time)

Hi @filaPro Thanks a lot for your help!
I modified the code as you suggested. The experiment is running. By the way, have you ever ran this experiment? Could you share your result for multi-classes as reference?

Danila Rukhovich · Answer 3 · Thu Dec 16 2021 15:18:09 GMT+0800 (China Standard Time)

Probably I tried something like this, but for sure don't have any logs now.

XinchaoGou · Answer 4 · Fri Dec 17 2021 16:26:46 GMT+0800 (China Standard Time)

Hi @filaPro , I have the result. but the scores for other classes are very low. What could be the cause? Is it due to the size of anchor? Do you have any suggestions to improve the performance for multi-classes?

Danila Rukhovich · Answer 5 · Fri Dec 17 2021 18:14:56 GMT+0800 (China Standard Time)

Yes, current anchors are well enogh only for cat category. You need much more scales and anchors for all classes of nuScenes dataset. For better performance you probably need to redesign neck a bit to return feature maps in 3 levels.

XinchaoGou · Answer 6 · Fri Dec 17 2021 19:19:24 GMT+0800 (China Standard Time)

Thanks a lot for your help!