[Bug] 评测C-Eval数据集时，选择test目录下测试数据，但是最后得分非常低，接近于0分

Question

13416157913 opened this issue 3 months ago · comments

I'm evaluating with the officially supported tasks/models/datasets.

1

1

1

1

评测C-Eval数据集时，选择test目录下测试数据，但是最后得分非常低，接近于0分
请问是不是C-Eval数据集评测时，是不是需要自己根据模型的回答，将回答拿到C-Eval官网上计算分数？

Songyang Zhang · Answer 1 · Fri Mar 08 2024 13:47:37 GMT+0800 (China Standard Time)

Please submit the results on C-Eval official website for test acc. We only have answer of the val set.

13416157913 · Answer 2 · Fri Mar 08 2024 15:37:08 GMT+0800 (China Standard Time)

Please submit the results on C-Eval official website for test acc. We only have answer of the val set.

Thanks for your reply.