Pre-training Time

Question

Pre-training Time

haoshuai714 opened this issue 3 years ago · comments

Thanks for your great codes!
In your paper, running the pre-training experiments needs 64 V100 GPUs.
How long have you been training with 64 V100 GPUs?
Thank you!

Wonjae Kim · Answer 1 · Thu Dec 09 2021 14:09:03 GMT+0800 (China Standard Time)

Hi @haoshuai714

https://tensorboard.dev/experiment/mNHxDM08R6eHKeU0JHn5vg/#scalars

This is the log of MLM+ITM pre-training with 64 V100 GPUS.