Using custom data for model training and testing

Question

Using custom data for model training and testing

leandermaben opened this issue 2 years ago · comments

I need to train and evaluate the ConvTasNet model on speech enhancement for a custom dataset.
How should I go about formatting the data, and what changes need to be made in the run.sh file.

Pariente Manuel · Answer 1 · Mon Apr 25 2022 02:10:52 GMT+0800 (China Standard Time)

Hey, thanks for the issue !

Have you had a look at already implemented speech enhancement dataset, how do they look like ?

Leander Maben · Answer 2 · Mon Apr 25 2022 04:05:20 GMT+0800 (China Standard Time)

Hi, thanks for the response.
By dataset, I don't mean the dataset class. I mean I want to use my own data ( clean and noisy .wav files) for training instead of wham or any of the other ones.
I was hoping to do this directly using the ConvTasNet recipe by just changing the parameters in run.sh. Would this be possible?

Pariente Manuel · Answer 3 · Mon Apr 25 2022 13:56:02 GMT+0800 (China Standard Time)

Not directly, not. Unless you create the json file that the WHAM dataset expects for your data, which is quite simple.

Otherwise, you need to create a dataset class for your own data, that's why I suggest you to look at how the datasets are implemented.

Leander Maben · Answer 4 · Mon Apr 25 2022 19:55:10 GMT+0800 (China Standard Time)

Sure, I will look into it.
Thank You.