dandelin / ViLT

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About SBU Caption dataset

4fee8fea opened this issue · comments

commented

Hi @dandelin,
Thanks for your great work and make it public!

we wanna follow your work, but the SBU Caption dataset becomes an obstacle. The URL has been inaccessible.

Could you please offer me a copy file including the [url - caption] pairs?

Thanks in advance!