Inconsistent SPICE scores

Question

Inconsistent SPICE scores

brandbir opened this issue 3 years ago · comments

It was found that in pycocoevalcap/spice/spice.py, line 29 is sorting the img_ids prior the generation of SPICE metrics. The result metrics are then extracted and mapped to the corresponding image ids in pycocoevalcap/eval.py by the setImgToEvalImgs() function. However, the latter function is using an unsorted img_ids list. This incompatibility between the sorted and unsorted lists makes the SPICE metrics being mapped to incorrect image ids. This does not have any effect on the overall SPICE metric as it is only a problem of misalignment between the computed SPICE metrics and their corresponding image ids. For consistency, no sorting is needed for imgIds in pycocoevalcap/spice/spice.py.

Yiwei Guo · Answer 1 · Sat May 14 2022 14:19:17 GMT+0800 (China Standard Time)

I think this problem still exists now (2022-05-14), and this causes some time-consuming trouble in my experiments. Thanks for pointing this out @brandbir