Why confidence map and guidance map are able to correct mistakes?

Question

jiangwei221 opened this issue 5 years ago · comments

Hi,
Looking forward to the release!

I have some questions about the paper.

I don't understand why confidence(uncertainty) map and guidance map are able to correct mistakes in the ground truth? In a general setting, unguided or guided, I think CNN is able to handle a small amount of error in the ground truth.
How many channels do the guidance map have(one of the global net output)? In the figure it says 1216x256x1, I'm wondering did you try to increase the number of channels, how does them perform?
As for the pretrained ERFNet on cityscape dataset, what is the setting for the pretraining? And have you try the depth completion without pretraining?

Thank you!

Wouter Van Gansbeke · Answer 1 · Thu Apr 04 2019 01:18:33 GMT+0800 (China Standard Time)

Hi,

It all depends on the receptive field. The global network can detect global changes due to its large receptive field. However, the local network (with a small receptive field) only needs to perform some sort of interpolation. Important to know is that the input LiDAR frame contains local mistakes which are hard to detect with a small network. The global network can inform the local network which LiDAR points seem inconsistent.
You can use more than 1 channel if you want but I don't think it matters that much.
I downloaded a pretrained network which was accessible from the ERFNet github page, but I don't know if it's available anymore. Pretraining worked slightly better in my case though.

Hope this helps.
Kind regards,
Wouter.

Wei Jiang · Answer 2 · Thu Apr 04 2019 04:49:31 GMT+0800 (China Standard Time)

Thanks for the detailed explaination! It's very helpful!