Learning rate during warmup steps #426

qwertea · 2023-04-17T22:18:36Z

qwertea
Apr 17, 2023

Is the learning rate during warmup steps 0? Or is it a % of the base LR? Or is it being gradually increased from 0 to the base LR? I understand all three are methods used in ML.

Answered by laksjdjf

Apr 19, 2023

its gradually increased from 0 to the base LR.
https://huggingface.co/docs/transformers/main_classes/optimizer_schedules#schedules

View full answer

laksjdjf · 2023-04-19T05:46:49Z

laksjdjf
Apr 19, 2023

its gradually increased from 0 to the base LR.
https://huggingface.co/docs/transformers/main_classes/optimizer_schedules#schedules

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Learning rate during warmup steps #426

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Learning rate during warmup steps #426

qwertea Apr 17, 2023

Replies: 1 comment

laksjdjf Apr 19, 2023

qwertea
Apr 17, 2023

laksjdjf
Apr 19, 2023