Question about applying MIRO to ViT

Question

Question about applying MIRO to ViT

dltkddn0525 opened this issue 2 years ago · comments

Hello, Thanks for sharing your great works.

Currently, I'm trying to apply MIRO to ViT architecture and I have some questions about it. According to the paper, you've applied MIRO to CLIP ViT and I found that you tuned lambda only for such non-main experiments from the appendix. Does that mean you used learning rate of 5e-05, no dropout and no weight decay with Adam optimizer? If not, can you please share the algorithm-agnostic hyperparameters you used for CLIP ViT experiments?

Also, I was wondering that if you ever tried MIRO to ImageNet pretrained ViT(for example, torchvision.models.vit_b_16) instead of CLIP.

Junbum Cha · Answer 1 · Sun Dec 18 2022 01:57:12 GMT+0800 (China Standard Time)

Yes, and no.
For CLIP ViT experiments, we used learning rate of 5e-5, no dropout, and no weight decay.
Also, we have not applied MIRO to ImageNet-pretrained ViT.

Suho Lee · Answer 2 · Sun Dec 18 2022 18:33:15 GMT+0800 (China Standard Time)

Thanks for kind reply!