Question about BLIPvqa code with reproduce scores of shape, texture

Question

Question about BLIPvqa code with reproduce scores of shape, texture

ofzlo opened this issue 9 months ago · comments

Hi, thank you for sharing your excellent work.

I'm interested in reproducing the B-VQA scores presented in your paper, specifically those in tables 2, 3, and 4. While reviewing the /BLIPvqa_eval/BLIP_vqa.py code, I noticed that it primarily uses 'color' as a parameter. I'd like to inquire whether it can be adapted to work with 'shape' and 'texture' as well. Additionally, I'm curious if making this change would have any impact on the score production.

Retr0573 · Answer 1 · Mon Mar 18 2024 13:51:07 GMT+0800 (China Standard Time)

I'm curious about it too, do you have any understanding now?

Yoorhim Cho · Answer 2 · Mon Mar 18 2024 15:22:43 GMT+0800 (China Standard Time)

@Retr0573
I apologize for not being able to provide a more detailed explanation, as it was quite some time ago, but I believe it was not an issue because 'color', 'shape', and 'texture' were using the same JSON file.