The human evaluation results

Question

The human evaluation results

HuuuNan opened this issue a year ago · comments

Hi there,
The work is great and thank you for your code sharing.
Would you share the human evaluation results in the Sec.6 and the script for "evaluate the accuracy of your automatic metrics by treating the human annotations as gold labels."?
Thanks a lot.

Tianyu Gao · Answer 1 · Tue Oct 17 2023 04:26:28 GMT+0800 (China Standard Time)

Hi,

Thanks for your interest in our work! We added the detailed human eval results in human_eval folder.

Nan Hu · Answer 2 · Tue Oct 17 2023 17:53:15 GMT+0800 (China Standard Time)

@gaotianyu1350 Thanks for your reply and update. Could you please also share the script for "evaluate the accuracy of your automatic metrics by treating the human annotations as gold labels", i.e. calculate the citation recall, citation precision, insufficient citations and irrelevant citations metrics? That would be helpful for understanding your work correctly.

Tianyu Gao · Answer 3 · Wed Oct 18 2023 01:44:29 GMT+0800 (China Standard Time)

@howard-yen can you help sharing the script? Thanks!

howard-yen · Answer 4 · Wed Oct 18 2023 05:48:54 GMT+0800 (China Standard Time)

Hi @HuuuNan, I updated the human_eval dir with the script, thanks for your interest in our project!