[FEATURE] Feature only support for Twins-PVT and Mvitv2

Question

[FEATURE] Feature only support for Twins-PVT and Mvitv2

L-Reichardt opened this issue 7 months ago · comments

Both are pyramid networks and can be used for multi-scale feature extraction, but to my knowledge do not support it like similar architectures such as PVT or Swin.

Ross Wightman · Answer 1 · Sat Feb 17 2024 00:31:48 GMT+0800 (China Standard Time)

@L-Reichardt the efficient mechanism for feature extraction relies on sequential stack at the stage level of the pyramid network, many pure vit / vit-hybrid need nn.ModuleList (and have extra args) or have extra root level modules in the model that can't be sequentialized... I have a very rough draft for another approach that'd address these but have another project in the way right now...