Different batchsize settings lead to different results

Question

Different batchsize settings lead to different results

guggugg opened this issue 3 years ago · comments

Hello, thank you for sharing. I trained this code on the LibriMix dataset. When the batchsize=1, the loss can drop normally, but when the batchsize=4, the loss is always positive. What could be the problem, can you give me some help?

Efthymios Tzinis · Answer 1 · Mon Nov 22 2021 23:13:14 GMT+0800 (China Standard Time)

Thanks for reaching out. What does it mean drops normally? With batch size of 1 the results should be worse since with batch size of 1 there is no augmentation but I do not understand the always positive part.

guggugg · Answer 2 · Tue Nov 23 2021 10:02:39 GMT+0800 (China Standard Time)

Thanks for replying to me. I use the trainer [https://github.com/JusperLee/Dual-Path-RNN-Pytorch], and you see, here is the result of batchsize=1

and here is the batchsize=4,

I don't know why...

Efthymios Tzinis · Answer 3 · Tue Nov 23 2021 21:41:04 GMT+0800 (China Standard Time)

If you use other trainers from other repos I can't possibly know what goes wrong.