Fixing extended trace failure for Adam and AdaMax and generalising `alpha` parameter to accept callable object (scheduler) #1115

kishore-nori · 2024-10-30T02:03:37Z

This is a small PR for Adam and AdaMax methods (thanks a lot for adding them to Optim.jl) to

fix issue Extended trace for Adam fails #1096 (when using extended_trace=true) by adding alpha to the respective State structs, so that common_trace! works out of the box.
generalise alpha (and effected update_state!) for both Adam and AdaMax such that they can accept a callable object - a function or in particular a scheduler. This helps in using the functionality from ParameterSchedulers.jl seamlessly, without adding it as a dependency.

…heduler)

…ion (scheduler)

pkofod · 2024-10-30T13:32:27Z

I am fine with these changes and thank you for fixing the original error. I do have to ask you to update the docstrings of the types to reflect the new feature and also add tests for the bug that was fixed as well as the new feature.

Thanks!

codecov · 2024-10-30T13:42:44Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 85.75%. Comparing base (77501f4) to head (9d2a5f2).
Report is 3 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1115      +/-   ##
==========================================
+ Coverage   85.26%   85.75%   +0.49%     
==========================================
  Files          45       45              
  Lines        3502     3518      +16     
==========================================
+ Hits         2986     3017      +31     
+ Misses        516      501      -15

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

… constructors

…ha constructors

…nded_trace=true case

…Adam and AdaMax

kishore-nori · 2024-10-30T15:50:08Z

Thank you for checking the PR and for the comments. I have added:

details in the docstrings of Adam and AdaMax for scheduled alpha constructor
added tests for Adam and AdaMax, covering the extended_trace=true option that was failing (Extended trace for Adam fails #1096)
added tests for scheduled Adam and AdaMax convergence, and tests covering the extended_trace=true option to check if the alpha values are correct.

pkofod · 2024-11-01T12:53:06Z

Thank you

kishore-nori · 2024-11-01T15:13:02Z

I am glad, would be happy to know if you recommend any corrections or changes to make it merge ready.

Is it possible to run the CI workflow tests to check if the tests work fine? (locally they ran successfully)

pkofod · 2024-11-12T12:38:14Z

I am glad, would be happy to know if you recommend any corrections or changes to make it merge ready.

Is it possible to run the CI workflow tests to check if the tests work fine? (locally they ran successfully)

Running now :)

pkofod · 2024-11-12T13:27:15Z

Seem to fail, not sure why it failed in a newton test.

kishore-nori · 2024-11-12T13:44:56Z

Thanks a lot for running the CI. It unclear to me as well as to why Newton tests failed, and that too on MacOS. I ll investigate and get back.

pkofod · 2024-11-12T17:17:24Z

Not sure what happened to be honest.. This time it worked

kishore-nori · 2024-11-12T23:59:43Z

That's great! Thank you for re-running. Where can I check the earlier CI failure? (It somehow disappeared after new the CI runs)

kishore-nori · 2024-11-13T02:11:53Z

Ok I found the earlier CI runs to trace the failure: CI says Paraboloid Diagonal failed, which I think is the below test:

https://github.com/JuliaNLSolvers/OptimTestProblems.jl/blob/c1fba66c90b44934d13cd11b2502573f1c11fab8/src/optim_tests/multivariate/quad_transforms.jl#L107-L142

I have a feeling guardseed(0) failed to accomplish what it is supposed to and the random matrix used in the test could have been a "bad" one. But I am not sure if this problem could come from architecture or MacOS, or even "just" a failure of guardseed. However, shouldn't be related to the changes in this PR.

some related issues:

JuliaLang/julia#42752 and JuliaLang/julia#51225

pkofod · 2024-11-13T10:19:25Z

Thank you for looking into it. We'll have to see if it reappears but it is not relevant to the resolution of this pr.

kishore-nori added 3 commits October 30, 2024 12:36

adding alpha to state and generalizing alpha to accept a function (sc…

89efb0f

…heduler)

removing unused variables

ed269f0

adding alpha to AdaMax state and generalizing alpha to accept a funct…

5e5de78

…ion (scheduler)

kishore-nori added 4 commits October 31, 2024 02:03

updating the docstring for Adam to add description of scheduled alpha…

3b752fd

… constructors

updating the docstring for AdaMax to add description of scheduled alp…

f55fa3a

…ha constructors

adding tests for scheduled Adam and AdaMax, which covers testing exte…

8a8aa2d

…nded_trace=true case

adding default constant alpha case tests for extended_trace=true for …

b848abf

…Adam and AdaMax

Merge branch 'master' into kn/adam-gen-fix

9d2a5f2

pkofod closed this Nov 12, 2024

pkofod reopened this Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing extended trace failure for Adam and AdaMax and generalising `alpha` parameter to accept callable object (scheduler) #1115

Fixing extended trace failure for Adam and AdaMax and generalising `alpha` parameter to accept callable object (scheduler) #1115

kishore-nori commented Oct 30, 2024 •

edited

Loading

pkofod commented Oct 30, 2024

codecov bot commented Oct 30, 2024 •

edited

Loading

kishore-nori commented Oct 30, 2024

pkofod commented Nov 1, 2024

kishore-nori commented Nov 1, 2024

pkofod commented Nov 12, 2024

pkofod commented Nov 12, 2024

kishore-nori commented Nov 12, 2024

pkofod commented Nov 12, 2024

kishore-nori commented Nov 12, 2024 •

edited

Loading

kishore-nori commented Nov 13, 2024

pkofod commented Nov 13, 2024 •

edited

Loading

Fixing extended trace failure for Adam and AdaMax and generalising alpha parameter to accept callable object (scheduler) #1115

Are you sure you want to change the base?

Fixing extended trace failure for Adam and AdaMax and generalising alpha parameter to accept callable object (scheduler) #1115

Conversation

kishore-nori commented Oct 30, 2024 • edited Loading

pkofod commented Oct 30, 2024

codecov bot commented Oct 30, 2024 • edited Loading

Codecov Report

kishore-nori commented Oct 30, 2024

pkofod commented Nov 1, 2024

kishore-nori commented Nov 1, 2024

pkofod commented Nov 12, 2024

pkofod commented Nov 12, 2024

kishore-nori commented Nov 12, 2024

pkofod commented Nov 12, 2024

kishore-nori commented Nov 12, 2024 • edited Loading

kishore-nori commented Nov 13, 2024

pkofod commented Nov 13, 2024 • edited Loading

Fixing extended trace failure for Adam and AdaMax and generalising `alpha` parameter to accept callable object (scheduler) #1115

Fixing extended trace failure for Adam and AdaMax and generalising `alpha` parameter to accept callable object (scheduler) #1115

kishore-nori commented Oct 30, 2024 •

edited

Loading

codecov bot commented Oct 30, 2024 •

edited

Loading

kishore-nori commented Nov 12, 2024 •

edited

Loading

pkofod commented Nov 13, 2024 •

edited

Loading