Question on zero-shot inference with ViT based model

Question

Question on zero-shot inference with ViT based model

hanguniverse opened this issue a year ago · comments

Hello, I try to use it for zero-shot detection with ground truth based on ViT model, but I couldn't find any instructions on how to use ViT, as this framework seems to only support resnet model, even on zero-shot branch, can you help me check this issue? Thank you