[Feature]: Experiment with alternative schedulers for `scheduler_g` and `scheduler_d` #945

Bebra777228 · 2025-01-10T08:50:25Z

Description

In the train.py file, there is a section for initializing schedulers. Have you tried replacing the ExponentialLR class with a different scheduler, such as CosineAnnealingLR or ReduceLROnPlateau, or any other scheduler?

I believe it would be beneficial to experiment with different scheduler classes. There might be a better option than ExponentialLR that could improve the training process.

Problem

—

Proposed Solution

—

Alternatives Considered

—

The text was updated successfully, but these errors were encountered:

blaisewf · 2025-01-10T13:30:40Z

“I believe it would be beneficial to experiment with different scheduler classes” Can you share any result of this? Not tested by us atm

Bebra777228 · 2025-01-10T14:17:43Z

Unfortunately, I haven't conducted any tests and I don't have any results. However, when I was reviewing the code and saw that you had set up the option to choose different optimizers, I had an idea to try changing the learning rate scheduler as well.

In the lr_scheduler.py file, I found a variety of different schedulers. I started reading the descriptions of each one, and two particularly caught my attention: CosineAnnealingLR and ReduceLROnPlateau.

Of course, I didn't review all the available options; there might be something better. But out of the ones I read, these two stood out to me. Yes, the descriptions might be a bit exaggerated, but why not give them a try? :)

short descriptions (unofficial)

`ExponentialLR`

Gradually decreases the learning rate following an exponential function with each epoch.

`CosineAnnealingLR`

Smoothly decreases the learning rate following a cosine curve, which helps stabilize training.

`ReduceLROnPlateau`

Reduces the learning rate when the model's performance metric stops improving, helping to avoid getting stuck in local minima.

blaisewf · 2025-01-10T14:19:04Z

could you try it an share your feelings?

Bebra777228 · 2025-01-10T14:22:27Z

Sure, I'll give it a try 👌

AznamirWoW · 2025-01-10T14:30:46Z

AdamW may benefit from some warmup scheduler, RAdam does not need it.

Bebra777228 added enhancement New feature or request feature labels Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Experiment with alternative schedulers for `scheduler_g` and `scheduler_d` #945

[Feature]: Experiment with alternative schedulers for `scheduler_g` and `scheduler_d` #945

Bebra777228 commented Jan 10, 2025

blaisewf commented Jan 10, 2025

Bebra777228 commented Jan 10, 2025

blaisewf commented Jan 10, 2025

Bebra777228 commented Jan 10, 2025

AznamirWoW commented Jan 10, 2025

[Feature]: Experiment with alternative schedulers for scheduler_g and scheduler_d #945

[Feature]: Experiment with alternative schedulers for scheduler_g and scheduler_d #945

Comments

Bebra777228 commented Jan 10, 2025

Description

Problem

Proposed Solution

Alternatives Considered

blaisewf commented Jan 10, 2025

Bebra777228 commented Jan 10, 2025

short descriptions (unofficial)

ExponentialLR

CosineAnnealingLR

ReduceLROnPlateau

blaisewf commented Jan 10, 2025

Bebra777228 commented Jan 10, 2025

AznamirWoW commented Jan 10, 2025

[Feature]: Experiment with alternative schedulers for `scheduler_g` and `scheduler_d` #945

[Feature]: Experiment with alternative schedulers for `scheduler_g` and `scheduler_d` #945

`ExponentialLR`

`CosineAnnealingLR`

`ReduceLROnPlateau`