A question about evaluate.
tomtang110 opened this issue · comments
tomtang110 commented
Peng Qi commented
Hi, are you sure you're using our script on a complete output file (an example of predictions on the dev set can be found on the website)? Our script should print out a JSON object containing various metrics, and your output looks very different from it.