dandelin / ViLT

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Visualizations for VQA dataset

aurooj opened this issue · comments

Hi,
I want to produce visualizations for VQA like you show in demo.py.

Can you please help in what is needed to be changed in demo.py to make it work for VQA?

For instance, I see loss_names has a value of 0.5 for 'mlm'. What value should be kept for VQA?
Looking forward to your help.

Best,
a