yuweihao / MambaOut

MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)

Repository from Github https://github.comyuweihao/MambaOutRepository from Github https://github.comyuweihao/MambaOut

The arch figure doesn't comform to the code

kytimmylai opened this issue · comments

I found that in the arch figure of gated CNN block, the concat part is missing, and the gate branch and conv should share the same linear layer. This adjustment would bring the figure closer to the intended inference.

Hi @kytimmylai ,

  • the conv can be standard depthwise convolution or depthwise conv on partial channels (like InceptionNeXt), controlled by the conv ratio.
  • the gate branch and conv branch do not share the same linear (linear + split).