paper question

Question

paper question

LemonWade opened this issue a year ago · comments

Thank you very much for the work you've done. May I ask a question? Can I interpret your work as being based on multi-view? I'm curious about the primary difference between your approach and multi-view. If someone were to emulate your work using just multi-view for their experiments, would they outperform you?

Sorry for the disturbance and thank you in advance.

Ankit Goyal · Answer 1 · Thu Aug 10 2023 00:40:35 GMT+0800 (China Standard Time)

Hi, Thanks for your interest in our work.

I am not sure if I understand the question correctly. Would you please explain what do mean by “Can I interpret your work as being based on multi-view? I'm curious about the primary difference between your approach and multi-view.” Specifically, what do you mean by multi-view.

By multi-view, do you mean a network with direct multi-view image input and no re-rendering? If so, there are two sets of disadvantages to this:

Maintaining a setup with five cameras positioned at different angles (shown in Fig 3.) is hard. On the other hand, our current system can work with even with one RGBD sensor (as done in our real world experiments) and hence easier to use.
Directly using multi-view images without any re-rendering would prevent us from using orthographic projection, 3D augmentation and point correspondence. All these significantly boost performance (Table 2 Left) but require re-rendering.

Let me know if I understood your question correctly and if this helps.

Zzy · Answer 2 · Thu Aug 10 2023 09:01:30 GMT+0800 (China Standard Time)

This is exactly the answer I was looking for. Thank you!

FinnJob · Answer 3 · Thu Aug 10 2023 09:26:19 GMT+0800 (China Standard Time)

Thank you for your excellent work!
I would like to inquire about how the occlusion problem is addressed when there is only one RGBD sensor in the system.
Thanks!

Ankit Goyal · Answer 4 · Fri Aug 11 2023 01:46:29 GMT+0800 (China Standard Time)

Hi @FinnJob,

Thanks for the kind words. We didn't face any occlusion issues on the tasks we tested on. I suppose with more cluttered scenes this could be an issue, but the framework is flexible enough to allow adding an additional camera if need be.

FinnJob · Answer 5 · Fri Aug 11 2023 08:58:31 GMT+0800 (China Standard Time)

Hi @FinnJob,

Thanks for the kind words. We didn't face any occlusion issues on the tasks we tested on. I suppose with more cluttered scenes this could be an issue, but the framework is flexible enough to allow adding an additional camera if need be.

Thanks, it helps a lot!