ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
shuxjweb opened this issue a year ago · comments
I download the pre-trained model "ViT-L-14.pt"x and its feature is 768. However, the vision_width in yaml file is set 1024. This is different.