Training token model.... where is the regression token?

Question

Training token model.... where is the regression token?

JavierUrenaPhDProjects opened this issue a year ago · comments

JavierUrenaPhDProjects commented a year ago

So im playing with this model around to see exactly how it works at code level, and as far as I know the 'token' model uses a regression token to the input sequence Z0 for the counting, creating a size of HW/K² + 1 input in the regression head (being K the number of patches, HW the dimensions of the image). But i am not able to recognize the explicit difference between the 'token' and 'gap' regression heads inputs in the code.

Could you give me more explanation on how this "regression token" is created and where? and what it is exactly? the paper does not give much enough information about it...

Dingkang Liang · Answer 1 · Thu Jun 15 2023 17:32:22 GMT+0800 (China Standard Time)

Please see the ViT for more detail (class token)