How to update learning rate during Flux training in a better manner?

fengkehh · January 10, 2021, 1:54pm

Think of opa in your first code example as a constructor for an ADAM object/data structure. If you invoke opa repeatedly you are constructing a new ADAM object every time which is slow. Instead of doing that, in your training loop do opt.learning_rate = new_learning_rate (check Flux documentation/code etc to find the actual field name for learning rate), which does not reconstruct the entire ADAM object but only changes a single field value inside an existing one.

Note that this is assuming ADAM as defined in Flux is a mutable struct. If it is not you cannot do something like this. No worries though, because I checked the Flux source code and it is.

For your second question, just calculate the new learning rate in whichever way you want then update the learning rate field in your opt object like above.

Topic		Replies	Views
Learning rate decay in callback function Machine Learning question , lux	3	480	January 11, 2024
Flux.jl: Different set of optimisation parameters per layer Machine Learning	1	895	March 30, 2019
How to apply a decay for learning rate? New to Julia question , package , error , error-message	5	521	May 16, 2022
Flux Learning basics New to Julia	0	369	November 24, 2018
Params not getting updated during training New to Julia flux	25	1712	October 11, 2020

How to update learning rate during Flux training in a better manner?

Related topics