Attention maps rectangular input
lucaswannen opened this issue · comments
lucaswannen commented
Hi,
How to get the attention map with rectangle images? It seems that in any case, the output for the attention is a square dimension
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
lucaswannen opened this issue · comments
Hi,
How to get the attention map with rectangle images? It seems that in any case, the output for the attention is a square dimension