How do you use cosine in simpletuner?? #1193
Answered
by
bghira
Deathawaits4
asked this question in
Q&A
-
For some reason the learning rate in simpletuner drops to such low values in the first 400 steps (i have 7000 steps) that its basically non existant anymore. Any ideas? I also don't understand the warmup steps, because it'll start with full learning rate, drop down and then up again and only then will behave normally. Shouldn't the lr warmup start with a low value and gradually increase to the target lr? |
Beta Was this translation helpful? Give feedback.
Answered by
bghira
Dec 8, 2024
Replies: 1 comment 4 replies
-
are you using polynomial? you would probably want constant_with_warmup then. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
with these schedulers, lr_warmup_steps is actually defining their interval between cycles.