resizing images for feature visualization

Question

kahnchana opened this issue 3 years ago · comments

Hi, I'm really interested in this work, and was looking at the feature visualization section.

In this code, how do you feed larger size images to the model? (e.g. 512 to 384 VIT) Do you make any modifications?

YuanLi · Answer 1 · Sun May 09 2021 12:45:49 GMT+0800 (China Standard Time)

Hi,

You can interpolate the position embedding for different image size with the function here.

Or directly use T2T-ViT as the way in the usage, we already put the interpolation function in the function of 'load_for_transfer_learning'.

Kanchana Ranasinghe · Answer 2 · Sun May 09 2021 15:29:28 GMT+0800 (China Standard Time)

Thanks a lot for the info. Got it working.