ImageNet V2 evaluation

Question

ImageNet V2 evaluation

samuelstevens opened this issue 9 months ago · comments

One of the proposed benefits of WIT-400M and LAION-400M is that they lead to very strong robustness across distributions. This is typically measured by comparing ImageNet 0-shot performance to ImageNet V2 0-shot, ImageNet-R 0-shot, etc.

Did you evaluate the MetaCLIP models on distribution shifts of ImageNet? Even evaluating on simply ImageNet V2 would give a good idea of the models' robustness. Thanks!

Sam commented 8 months ago

Thanks!

Hu Xu · Answer 1 · Wed Nov 08 2023 10:58:07 GMT+0800 (China Standard Time)

we have ImageNet variants eval averaged in table 8 of appendix. For ImageNet v2, MetaCLIP has: L14-400M: 69.8%, L14-1B: 72.5% L14-2.5B: 72.6% (vs OpenAI CLIP L14-400M 69.8%, OpenCLIP L14-400M: 65.4%).