paper

Question

paper

cdancette opened this issue 4 years ago · comments

https://proceedings.neurips.cc/paper/2020/file/1fd6c4e41e2c6a6b092eb13ee72bce95-Paper.pdf

Robik Shrestha · Answer 1 · Sat Jan 23 2021 00:08:16 GMT+0800 (China Standard Time)

Hi, this is a fantastic list of references!

Is there any chance you will consider adding our work too (https://www.aclweb.org/anthology/2020.acl-main.727.pdf)? Our study basically shows that the visual grounding methods (HINT/SCR) improve accuracy on VQA-CP through regularization effects as opposed to improving visual grounding. Example: SCR with irrelevant cues can achieve 49.2% accuracy (comparable to the accuracy achieved with relevant cues).

We also present a regularizer designed to simply degrade the training accuracy. It happens to achieve 48.9% accuracy, providing further evidence as to how the improvements stem from the regularization effects.

Code for HINT/SCR/regularizer.

Corentin Dancette · Answer 2 · Sat Jan 23 2021 03:50:24 GMT+0800 (China Standard Time)

@erobic Thank you for your kind words !

I forgot to add your paper, but I'll add this soon ! It looks very interesting. Btw I am familiar with your other papers on VQA, and I find them very interesting ! Thanks for your work.

Robik Shrestha · Answer 3 · Sat Jan 23 2021 08:13:38 GMT+0800 (China Standard Time)

Oh, that's so great to hear! I am familiar with many of your works too, especially RUBi and the wonderful bootstrap framework.
Thanks a lot for considering the ACL paper! :-)

Corentin Dancette · Answer 4 · Sun Jan 24 2021 22:02:38 GMT+0800 (China Standard Time)

@erobic I just added it under the name ESR (embarassingly Simple Regularizer)

Robik Shrestha · Answer 5 · Mon Jan 25 2021 00:33:52 GMT+0800 (China Standard Time)

Great, thank you for including it!

ESR is really just to showcase how OOD benchmarks can be hacked without any core improvement.
Here are the rest of the fields for it: Yes/No: 69.8, Num: 11.3, Others: 47.8, No valset.

Thanks again!