How to finetuning with lower memory fp16 version for p100 GPUs?

Question

How to finetuning with lower memory fp16 version for p100 GPUs?

xurongqiang opened this issue 5 years ago · comments

For finetuning with lower memory fp16 version（for fp32 version , OOMs occur. ）, How should I modify the training.py script?

Nitish Shirish Keskar · Answer 1 · Wed Oct 16 2019 01:07:47 GMT+0800 (China Standard Time)

This is quite delicate and doesn't quite seem to work out of the box. I'm going to need more time to look into this.

xurongqiang · Answer 2 · Wed Oct 16 2019 11:52:32 GMT+0800 (China Standard Time)

This is quite delicate and doesn't quite seem to work out of the box. I'm going to need more time to look into this.

The background to this problem is that we have a large number of p100 machines, but they cannot run on the fp32 version. Thank you for your improvement.

Benjamin Schiller · Answer 3 · Thu Feb 06 2020 01:49:11 GMT+0800 (China Standard Time)

Is there some update on this matter? We have the same problem, unfortunately.

Raymond Cheng · Answer 4 · Tue Feb 11 2020 14:13:16 GMT+0800 (China Standard Time)

Yes same here for us. Both huggingface and this repo seem to have the same OOM error when running on Google Colab free GPU like p100. Any fix or workaround yet?

Anastasia Sorokina · Answer 5 · Mon Jan 03 2022 02:46:21 GMT+0800 (China Standard Time)

The problem still persists, unfortunately. Fine-tuning doesn't really work with collab resources..