santi-pdp / pase

Problem Agnostic Speech Encoder

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

input file to segment the dataset

MittalShruti opened this issue · comments

Hi, how is the following file generated?
https://raw.githubusercontent.com/santi-pdp/pase/master/data/libri_all_tr.lst

This is the input file while calling the /data/prep/prepare_segmented_dataset_libri.py file.

It seems this is just a different audio file type .Flac, and the .lst file contains the list of audio files to be processed. Not sure why didn't we use train_scp/test_scp here though.