Sense-X / MixMIM

MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About 'Mix Embedding' in paper.

lcxrocks opened this issue · comments

In your paper, you mentioned that you 'add two mix embeddings to the visual tokens to implicitly differentiate the two mixing groups' and 'use different mix embeddings for the 4 stages of the encoder', which is in section 2.3. However, it appears to me that there isn't such mix embedding in the function forward_encoder of models_mixmim.py. So, if it's my silly mistake that I didn't find it, would you kindly point it out to me? Thanks!