Small batchsize

Question

Small batchsize

sudalvxin opened this issue 4 years ago · comments

When bs=32 and epoch>40, an error may occurs at "loss = criterion(**loss_args)" : RuntimeError: stack expects a non-empty TensorList

Karsten Roth · Answer 1 · Mon Jun 29 2020 20:18:47 GMT+0800 (China Standard Time)

Thanks for pointing this out! Could you tell me what loss/mining method you used?

Zhanxuan Hu · Answer 2 · Mon Jun 29 2020 21:16:30 GMT+0800 (China Standard Time)

Thanks for your replay. The loss function is MultiSim.

--

Karsten Roth · Answer 3 · Mon Jun 29 2020 21:19:31 GMT+0800 (China Standard Time)

Can you give me the full error message? I think its due to the semihard masking in the MultiSim loss which is not able to retrieve valid candidates in a small batch, but I just want to make sure. If that's the case I should be able to fix it quite easily.

Zhanxuan Hu · Answer 4 · Mon Jun 29 2020 21:28:30 GMT+0800 (China Standard Time)

Sorry, the error message has been delete.

Zhanxuan Hu · Answer 5 · Mon Jun 29 2020 21:41:50 GMT+0800 (China Standard Time)

Besides, have you test XBM~(Cross-Batch Memory for Embedding Learning) using this code? I introduce XBM in your framework, but the improvements on MS loss is very limited.

Karsten Roth · Answer 6 · Mon Jun 29 2020 21:42:52 GMT+0800 (China Standard Time)

Then I assume its the masking that's causing the issue here - you can try and increase the loss_multisimilarity_margin parameter to avoid generating instance-free masks or use larger batchsize - I'll try to include some catch mechanism.

Karsten Roth · Answer 7 · Mon Jun 29 2020 21:43:12 GMT+0800 (China Standard Time)

*mistakenly closed

Zhanxuan Hu · Answer 8 · Mon Jun 29 2020 21:43:57 GMT+0800 (China Standard Time)

Then I assume its the masking that's causing the issue here - you can try and increase the loss_multisimilarity_margin parameter to avoid generating instance-free masks or use larger batchsize - I'll try to include some catch mechanism.

ok, thanks.

Karsten Roth · Answer 9 · Mon Jun 29 2020 21:44:27 GMT+0800 (China Standard Time)

Besides, have you test XBM~(Cross-Batch Memory for Embedding Learning) using this code? I introduce XBM in your framework, but the improvements on MS loss is very limited.

I know the XBM paper, but I haven't tested it with this framework specifically - Have you tested it with the constrastive loss and resnet?

Zhanxuan Hu · Answer 10 · Tue Jun 30 2020 08:24:09 GMT+0800 (China Standard Time)

Besides, have you test XBM~(Cross-Batch Memory for Embedding Learning) using this code? I introduce XBM in your framework, but the improvements on MS loss is very limited.

I know the XBM paper, but I haven't tested it with this framework specifically - Have you tested it with the constrastive loss and resnet?

I have tested XBN with constrastive loss on SOP and CUB. The result demonstrate that XBM is useful for large scale data. Besides, the performance of XBM is often effected by two parameters.

Karsten Roth · Answer 11 · Thu Jul 02 2020 20:19:53 GMT+0800 (China Standard Time)

I'm closing this for now, feel free to open another issue if anything similar re-occurs :).