Questions on positional embeddings

Question

Questions on positional embeddings

DianCh opened this issue 3 years ago · comments

Dian Chen commented 3 years ago

Hi! Can you help with a few questions on positional embeddings:

Did you apply any positional embedding in the Transformer attention?
If so, how did you design the positional embeddings for different types of representation and multi-scale features, and how did you apply them (i.e. how the formula should write)?
If not, what's the consideration behind? Why is it not needed?

Thank you! I look forward to your reply and the code release.

Jun Wang · Answer 1 · Fri Jan 21 2022 16:10:48 GMT+0800 (China Standard Time)

Thanks for your interest in our work. We apologize for answering late.

We don't use positional encoding in our attention module. We believe that positional embedding is very likely to improve the performance, but our work is mainly focused on making transformer as a fusion method, and make the whole architecutre work in the 3D object detection task.