Except GSM100, other datasets are evaluated in 0-shot?

Question

Except GSM100, other datasets are evaluated in 0-shot?

zhimin-z opened this issue 8 months ago · comments

I hope to confirm the leaderboard configuration.

ChenxinAn · Answer 1 · Tue Dec 12 2023 10:17:45 GMT+0800 (China Standard Time)

Yes, we did not add examples for other tasks. We suggest modifying based on our inference code under the Baselines folder. The leaderboard leaderboard has not been updated please refer to our paper for the up-to-date results!