AssertionError happens when loading the model in the workflow of “finetuning-then-linprobing”.

Question

AssertionError happens when loading the model in the workflow of “finetuning-then-linprobing”.

uk9921 opened this issue a year ago · comments

I used the script main_finetune.py to finetune the pretrained model, and the process went very smoothly. However, when I tried to load the finetuned model and train a linear probe task, I got this AssertionError:
File "main_linprobe.py", line 203, in main assert set(msg.missing_keys) == {'head.weight', 'head.bias', 'fc_norm.weight', 'fc_norm.bias'}
I printed the msg.missing_keys and got msg.missing_keys = []

So, I wonder if we need to assert the missing keys when we try to load the finetuned model?

Kuqs · Answer 1 · Thu May 18 2023 20:35:00 GMT+0800 (China Standard Time)

Here is my training args

    main_linprobe.py \
    --batch_size 128 \
    --model vit_large_patch16 \
    --global_pool \
    --finetune ${PRETRAIN_CHKPT} \
    --epochs 90 \
    --blr 0.05 \
    --weight_decay 0.0 \
    --output_dir ${OUTPUT_DIR} \
    --data_path ${IMAGENET_DIR} \
    --dist_eval

Tianhong Li · Answer 2 · Thu May 18 2023 22:25:27 GMT+0800 (China Standard Time)

The linear probing script is designed to load a pre-trained model (without the linear classification head and norm layer before it). Therefore, we have an assertion there to make sure the loaded model does not have those parameters. If you want to linear probe a model with the classification head, I think you can simply comment out the assertion.

Kuqs · Answer 3 · Thu May 18 2023 23:52:58 GMT+0800 (China Standard Time)

Thank you for your reply, I followed your suggestion and achieve a higher top1 acc than directly loading the pre-trained model.