No position encoding? Could you explain some your thoughts?

Question

No position encoding? Could you explain some your thoughts?

laisimiao opened this issue 2 years ago · comments

Ali Hassani · Answer 1 · Fri Aug 19 2022 21:35:58 GMT+0800 (China Standard Time)

Thank you for your interest.
It's common practice in hierarchical vision transformers that use local attention with relative positional biases not to use absolute positional encoding (i.e. Swin), and we simply followed that idea.

Ali Hassani · Answer 2 · Fri Sep 30 2022 09:09:49 GMT+0800 (China Standard Time)

Closing this due to inactivity. If you still have questions feel free to open it back up.