[BUG] reg_token not working for ViT models

Question

[BUG] reg_token not working for ViT models

Tgaaly opened this issue 6 months ago · comments

Describe the bug
If I try to create a 'vit_base_patch16_384' model, for example, and set the argument: reg_token=4 (to add 3 register tokens to the model, according to this paper: https://arxiv.org/pdf/2309.16588.pdf. I hope I'm not missing something. I understand if this is not a supported feature yet.

the model fails to instantiate, resulting in a size mismatch issue in timm/layers/pos_embed.py line 45 - see below.

To Reproduce
Steps to reproduce the behavior:

create a vit_base_patch16_384 and pass in the function argument reg_token=4.

Expected behavior
I would expect the model to be built/instantiated correctly.

Ross Wightman · Answer 1 · Fri Dec 01 2023 06:07:18 GMT+0800 (China Standard Time)

@Tgaaly you can't use petrained=True when adding reg tokens to an existing model def, it's changing the model architecture. It would be possible to add extra code to allow, but I hacked it and tried it and there's a pretty big drop in performance so I don't feel that the extra overhead in code/maint is warranted.

The current intent is to allow training/defining new models with reg tokens enabled. There are weights for the dinov2 ones. I have a few smaller vits being trained right now with reg tokens.

Ross Wightman · Answer 2 · Fri Dec 01 2023 06:09:39 GMT+0800 (China Standard Time)

Also I feel not using a reg token backbone that hasn't been pretrained with reg tokens would sort of defeat the purpose... it's a fairly fundamental change and you'd want to do the pretrain with them.

Tarek El-Gaaly · Answer 3 · Fri Dec 01 2023 06:18:29 GMT+0800 (China Standard Time)

ah that is right. makes sense. thank you so much for your responses.