PSA Do not train with multiple gpus

Question

PSA Do not train with multiple gpus

mlinmg opened this issue 9 months ago · comments

Just for anyone interested in training this model. Do not use multiple gpus, it shows massive performance dump.
For instance, I was getting 1.49s/i with two gpus at batch size 12, and with one gpu I get 5.53it/s

Marco · Answer 1 · Thu Oct 26 2023 18:08:49 GMT+0800 (China Standard Time)

I thought It might help someone in the future