Training setting problem

Question

Training setting problem

Morank88 opened this issue 4 years ago · comments

Hi,

Thank you for publishing your code!
I am encountering a training problem. As an initial phase I have tried to train only on 1000 samples from LibriSpeech train-clean-100 dataset. I am using the default configuration as was published in your VoiceFilter repo. The only difference is that I used batch size of 6 due to memory limitations. Is it possible that the problem is related to the small batch size that I use?

Another question is related to the generation of the training and testing sets. I have noticed that there is an option to use a VAD for generating the training set but by default it is not used. What is the best practice? to use the VAD or not?

I appreciate your help!

Morank88 · Answer 1 · Tue Jun 30 2020 16:49:47 GMT+0800 (China Standard Time)

I manage to get some progress. Now I training on data from LibriSpeech train-clean-100 and train-clean-360 and testing on train-dev-clean. After 40k steps the SDR reached only to ~5. Is it possible that it is related to the batch size that I am using (6)?

Another question - what is the the learning rate policy? Did you fixed it on 1e-3 throughout the whole training or updated it?

Thanks.

Morank88 · Answer 2 · Tue Jun 30 2020 16:53:17 GMT+0800 (China Standard Time)

Here are tensorboard results:

natbuk2 · Answer 3 · Sun Sep 20 2020 16:35:29 GMT+0800 (China Standard Time)

Hi Morank88 - did you get any improvement on your SDR? SDR I'm getting is much worse than even you're getting:

I've tried a number of runs (firstly, I had smaller batch size to run on lesser GFX card, but the run above was at as-downloaded batch size on amazon EC2 instance with NV100). Only differences are that I made the test sample 1000 (it's 100 in the code, but comment in the Readme mentioned 1000? maybe I'll change it back to the 100 as downloaded, and run again...) - and I have some likely more up-to-date python libraries (couldn't seem to find compatible torch 1.0.1 for example) - any suggestions?

Thanks in advance,

Nat

jianew · Answer 4 · Sun Nov 22 2020 20:27:38 GMT+0800 (China Standard Time)

Hi,

Thank you for publishing your code!
I am encountering a training problem. As an initial phase I have tried to train only on 1000 samples from LibriSpeech train-clean-100 dataset. I am using the default configuration as was published in your VoiceFilter repo. The only difference is that I used batch size of 6 due to memory limitations. Is it possible that the problem is related to the small batch size that I use?

Another question is related to the generation of the training and testing sets. I have noticed that there is an option to use a VAD for generating the training set but by default it is not used. What is the best practice? to use the VAD or not?

I appreciate your help!

hi,Can you share your settings, I run the same situation , thanks

yunzhongfei · Answer 5 · Tue Jan 26 2021 17:03:19 GMT+0800 (China Standard Time)

I have also the same question.
The best result is 8 of SDR.

zardkk · Answer 6 · Thu Mar 17 2022 18:36:41 GMT+0800 (China Standard Time)

Here are tensorboard results:

i meet same question with you,how did you solve this?