How to retrieve gradient value in custom Flux training loop?

BatyLeo · June 8, 2022, 1:48pm

Hello,

I have a few questions about custom training loops in Flux. According to the documentation, I should write something like that :

function my_custom_train!(loss, ps, data, opt)
  ps = Params(ps)
  for d in data
    gs = gradient(ps) do
      training_loss = loss(d...)
      # Insert what ever code you want here that needs Training loss, e.g. logging
      return training_loss
    end
    # insert what ever code you want here that needs gradient
    # E.g. logging with TensorBoardLogger.jl as histogram so you can see if it is becoming huge
    update!(opt, ps, gs)
    # Here you might like to check validation set accuracy, and break out to do early stopping
  end
end

If the ps argument is Flux.params(my_model) like in the Flux.train! method, is the ps = Params(ps) row redundant ?
gs seems to be an instance of Zygote.Grads, how can I retrieve gradient value in the loop ? I tried gs[ps] and gs[my_model] without any success.

albheim · June 8, 2022, 2:00pm

Yes, it shouldn’t be needed.
The Grads type contains the fields grads and params, so you can access them as gs.grads.

CarloLucibello · June 8, 2022, 2:02pm

If the ps argument is Flux.params(my_model) like in the Flux.train! method, is the ps = Params(ps) row redundant ?

yes

gs seems to be an instance of Zygote.Grads, how can I retrieve gradient value in the loop ? I tried gs[ps] and gs[my_model] without any success.

for p in ps
   print(gs[p])
end

BatyLeo · June 8, 2022, 2:16pm

Thank you for your help ! This is exactly what I was looking for.

Topic		Replies	Views
Flux: Custom Training + Logging General Usage flux	7	2237	June 19, 2020
Flux.gradient returns dict of param and Nothing Machine Learning flux	3	762	September 22, 2021
Flux.jl: params() and gradient() ocnfusion Machine Learning	4	653	August 23, 2021
How to obtain gradients from training a model New to Julia flux	1	187	September 6, 2023
How to use Flux.train! to train custom layer? Machine Learning question	2	1818	September 2, 2019

How to retrieve gradient value in custom Flux training loop?

Related topics