Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Asked: August 3, 20252025-08-03T15:27:55+00:00 2025-08-03T15:27:55+00:00In: AI & Machine Learning

Why does my training suddenly diverge after increasing learning rate slightly?

Rijo JoseBegginer

sudden divergance

Leave an answer

Leave an answer
Cancel reply

1 Answer

Maxine Begginer
2026-01-04T07:01:00+00:00Added an answer on January 4, 2026 at 7:01 am
Neural networks often have narrow stability windows for learning rates.
A small increase can push updates beyond the region where gradients are meaningful, especially in deep or transformer-based models. This causes loss to explode or become NaN within a few steps.
Rollback to the last stable rate and introduce a scheduler instead of manual tuning. Warm-up schedules are especially important for large models.
Also verify that mixed-precision training isn’t amplifying numerical errors.
Common mistakes:
Using the same learning rate across architectures
Disabling gradient clipping
Increasing rate without adjusting batch size
When in doubt, stability beats speed.
0
Reply
Share
Share
Share on Facebook
Share on Twitter
Share on LinkedIn
Share on WhatsApp

Report