How to training two datasets with different camera intrinsic together

Question

junjunxia opened this issue 3 years ago · comments

Hi,

Thank you very much for release source code!

I have three basic questions

Can I train two datasets (with different camera intrinsic) in one mini-batch by just pass different intrinsic param to loss function?
In the paper, you mentioned the image could be resized during training, in which situation we need to do the image resize except the pre-train?
Is the V2-99 backbone model able to use in the real-time(more than 25 fps) scenario?

Thanks

Dennis Park · Answer 1 · Sat Nov 13 2021 16:15:55 GMT+0800 (China Standard Time)

Hello @junjunxia, thanks for your interest.

Yes, you should be able to train with two datasets with different resolution by passing intrinsics params for each dataset item. Of course there may be some amount of degrading due to the domain gap, but the DD3D is fairly robust against resoultion difference.
Resizing image is part of the data augmentation which is generally known to help train better model, espeically if you're training data is small.
Unfortunately not. V2-99 is a pretty heavy backbone, and it won't probably run in realtime.