A question about evaluate.

Question

A question about evaluate.

tomtang110 opened this issue 5 years ago · comments

Hi
I run my results in the eval() in hotpot_evaluate_v1.py, however, the result may be not the same with your scores in leaderboard. Could you tell me the correct function to evaluate?

Peng Qi · Answer 1 · Tue Jul 09 2019 02:20:45 GMT+0800 (China Standard Time)

Hi, are you sure you're using our script on a complete output file (an example of predictions on the dev set can be found on the website)? Our script should print out a JSON object containing various metrics, and your output looks very different from it.