the files number of TUT 2017 dataset

Question

the files number of TUT 2017 dataset

Zth9730 opened this issue 6 months ago · comments

The Table 2 in the paper says that TUT 2017 contains 6.3k files and includes training, testing, and validation sets. But when I download TUT 2017, there are only 312*15=4680 files and there is only development set. May I ask why there are 6.3k number of files here and how it is divided into training testing validation set?

Soham · Answer 1 · Fri Jan 19 2024 08:02:48 GMT+0800 (China Standard Time)

Hi @Zth9730, TUT 2017 has a total of 6.3k files. It's divided into development and evaluation and can be obtained from the below links:

Development set: https://zenodo.org/records/400515
Evaluation set: https://zenodo.org/records/1040168

Table 2 in the paper shows full dataset statistics. For Pengi's zero-shot evaluation, we use only the evaluation set to ensure our numbers are comparable with other zero-shot and supervised benchmarks. I hope this helps!

TianHao Zhang · Answer 2 · Mon Jan 22 2024 10:41:18 GMT+0800 (China Standard Time)

Thanks a lot !!!