Is there anyway that I can use learning rate warm-up during the training ?

Question

shamanez opened this issue 5 months ago · comments

I am using this repo to:

For stage 1, I want to use a learning rate warm-up.

lewtun · Answer 1 · Wed Jan 10 2024 14:48:39 GMT+0800 (China Standard Time)

Hello @shamanez the scripts in this repo are mostly focused on chat models, so for continual pretraining I recommend checking out the transformers script: https://github.com/huggingface/transformers/blob/main/examples/pytorch/language-modeling/run_clm.py

There you can specify e.g. warmup_ratio in TrainingArguments to get the warmup.