I want to use pretrained VITMAE to embed patches without any masks which inturn I plan to feed into a decoder, Can someone point me in the correct direction ?

Question

aishik11 opened this issue 2 years ago · comments