this small project, will help you to create datasets for audio related tasks, major code is taken from KT crawler git project
Repository from Github https://github.comsaurabhvyas/dataset_creatorRepository from Github https://github.comsaurabhvyas/dataset_creator