Orthographic mask in depth rectification network

Question

Orthographic mask in depth rectification network

RohanChacko opened this issue 4 years ago · comments

Hi,
Interesting work. The supplementary section mentions that the front-view depth rectification network outputs a 1D rectified depth image and a 1D binary mask of the orthographic view. How does the mask help in learning depth image? Would I get the same results if I train the network without predicting the orthographic mask?

Lizhen Wang · Answer 1 · Sun Sep 13 2020 22:12:43 GMT+0800 (China Standard Time)

Thank you for your interest! As we can not directly get the orthographic mask from the input perspective RGBD images, I think it might be necessary to predict the orthographic mask. Moreover, the orthographic masks act as prior information of input images without background during the training of our discriminators. But if you train your images only in the perspective view, I think you might generally get similar results.

Rohan Chacko · Answer 2 · Sun Sep 13 2020 22:31:32 GMT+0800 (China Standard Time)

Thanks for the clarification!