regarding Motion-aware Layer Normalization

Question

regarding Motion-aware Layer Normalization

RashoAli opened this issue 7 months ago · comments

Dear author,

Thank you for the great work and the very interesting approach. I am trying to understand the "Motion-aware Layer Normalization" part of the model:

Why does the model calculate 'beta' and 'gamma'? What do these parameters represent? And why add and multiply these parameters with the queries afterward?
For pose, time, and velocity: Are these parameters extracted from the detected queries, or are they part of the ground truth (GT)?

exiawsh · Answer 1 · Thu Jan 11 2024 15:40:06 GMT+0800 (China Standard Time)

@RashoAli beta and gamma are learnable parameters that implicitly containing the motion information.
pose and time are record in the nusc dataset. velocity is from the prediction results.