asyml / vision-transformer-pytorch

Pytorch version of Vision Transformer (ViT) with pretrained models. This is part of CASL (https://casl-project.github.io/) and ASYML project.

Home Page:https://asyml.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Image resolution for training

a-maiti opened this issue · comments

What was the image resolution used for training on ImageNet? On paper, it's written 224, but it seems 384 was used for this code?

Nevermind, got it. It is correct.