MAE decoder pos_emb

Question

dnecho opened this issue 2 years ago · comments

Is it necessary add pos_emb to deocder_tokens?

decoder_tokens = decoder_tokens + self.decoder_pos_emb(unmasked_indices)

Phil Wang · Answer 1 · Wed Dec 01 2021 01:12:40 GMT+0800 (China Standard Time)

@dnecho ohh yes, i'm actually not so sure about that - you may be right that it isn't necessary for the unmasked tokens

Phil Wang · Answer 2 · Wed Dec 01 2021 01:13:01 GMT+0800 (China Standard Time)

@dnecho it probably wouldn't hurt to keep it the way it is

Phil Wang · Answer 3 · Wed Dec 01 2021 04:12:41 GMT+0800 (China Standard Time)

@dnecho the other thing to experiment with is reusing the positional embeddings from the original encoder side