ParameterSchedulers causing error with Flux.update!

cjs · May 15, 2024, 7:56am

I am running a DRL algorithm with Adam where I want the learning rate to decay with time. As an example, consider

using Flux, ParameterSchedulers
model = Chain(Dense(5,10,gelu), Dense(10,10,gelu), Dense(10,1,softplus))

ps = Flux.params(model)
# gs is calculated gradient

At the updating step, I want the learning rate to decay, say

sched = Sequence(Exp(λ = 1f-7, γ=1000^(1/100)) => 100, Exp(λ = 1f-4, γ=0.99) => 100)
optimiser = Scheduler(sched, ADAM())
Flux.update!(optimiser, ps, gs)

Unfortunately, this gives the error

ERROR: Optimisers.jl cannot be used with Zygote.jl's implicit gradients, `Params` & `Grads`

I can get Flux.update! to work only if I use

optimiser = Flux.Optimise.ADAM()

but in this case I am unable to use ParameterSchedulers to vary the learning rate.

Topic		Replies	Views
Learning rate scheduler with the new interface of Flux Machine Learning flux	4	1077	December 23, 2023
How to update learning rate during Flux training in a better manner? New to Julia flux	7	2399	December 23, 2023
How to manually update the params in a `Flux.Chain` neural network? General Usage flux , machine-learning	1	979	July 19, 2020
Implementing the Learn rate scheduling in the NeuralPDE julia package New to Julia question	5	250	November 21, 2023
How to apply a decay for learning rate? New to Julia question , package , error , error-message	5	533	May 16, 2022

ParameterSchedulers causing error with Flux.update!

Related topics