ImageNet100 very low accuracy

Question

ImageNet100 very low accuracy

Evgeneus opened this issue 3 years ago · comments

Dear authors,

I would like to run T2T on ImageNet100 on 2 gpus. But I have gotten just 8.5 in top-1 accuracy after 200 epochs! Also the train loss is high. Do you know what can be a reason for that?

I changed the number of classes in the train file (to match 100 classes)
running script:
OMP_NUM_THREADS=16 CUDA_VISIBLE_DEVICES=0,1 bash distributed_train.sh 2 /data/datasets/imagenet-100/ --model T2t_vit_14 -b 128 --lr 1e-3 --weight-decay .03 --cutmix 0.0 --reprob 0.25 --img-size 224
some outputs:
epoch,train_loss,eval_loss,eval_top1,eval_top5 194,4.363854191519997,4.067602333831787,8.519999993896484,26.46000007324219 195,4.340610720894554,4.064138192749024,8.59999998779297,26.379999963378907

YuanLi · Answer 1 · Wed Mar 24 2021 17:23:15 GMT+0800 (China Standard Time)

Hi,

We also trained our T2T-ViT on other datasets like CIFAR100 from scratch, and got reasonable results (77%-80%). So I am not sure why your training not work on ImageNet100 without enough information.

You can also borrow some training method from our transfer learning or other implementations like this one, which only train 60 epoches but still achieve accuracy > 70%.