[SST-2] Different test.tsv for original and k-shot data

Question

[SST-2] Different test.tsv for original and k-shot data

skull8888888 opened this issue 3 years ago · comments

Hello, I noticed that test.tsv for k-shot data contains 872 sentences, whereas original folder contains 1820 for SST-2 task. Is it correct behavior? Perhaps to save on computational time?

Tianyu Gao · Answer 1 · Tue Jun 01 2021 21:08:03 GMT+0800 (China Standard Time)

Hi,

Thanks for pointing this out. For SST-2, we use the original development set as our test set, so the numbers of instances are different. You can also check out Appendix A in the paper for details.

skull8888888 · Answer 2 · Tue Jun 01 2021 21:29:32 GMT+0800 (China Standard Time)

So when I train the model, which test dataset does the model use, the one in k-shot/SST-2/ or in original?

Tianyu Gao · Answer 3 · Tue Jun 01 2021 21:42:46 GMT+0800 (China Standard Time)

In our framework, the one in k-shot will be used.