[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"
Home Page:https://arxiv.org/abs/2312.00081
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool