cambridgeltl / visual-spatial-reasoning

[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

cambridgeltl/visual-spatial-reasoning Stargazers