Time cost and memory cost of FastMETRO

Question

Time cost and memory cost of FastMETRO

MooreManor opened this issue 2 years ago · comments

Hello! FastMETRO is a nice work. I want to know the memory and time cost of FastMETRO of two versions when training.

Under the setting of per_gpu_train_batch_size 16 and mixed datasets, how long to train FastMETRO of two versions for one epoch on your cards, and how much GPU memory do you cost on one single card?

FastMETRO · Answer 1 · Sat Dec 17 2022 18:33:57 GMT+0800 (China Standard Time)

Hello,

We conduct single-node distributed training using a machine with 4 NVIDIA V100 GPUs (16GB RAM). We set --per_gpu_train_batch_size as 16 and --num_workers as 4.

The time and memory cost for the training on the mixed datasets in our environment:

[ FastMETRO-S-R50 ]

Training time for each epoch: ~ 0.6 hours
Memory used in each GPU: ~ 5 GB

[ FastMETRO-L-H64 ]

Training time for each epoch: ~ 1.5 hours
Memory used in each GPU: ~ 14 GB

Thanks for your interest in our work!!

FastMETRO · Answer 2 · Sat Dec 17 2022 18:34:06 GMT+0800 (China Standard Time)

Please reopen this issue if you need more help regarding this.