Gradient of Flux model wrt to weights

marius311 · May 19, 2021, 8:34pm

On newer versions of Zygote/Flux, you would have gotten the error message:

julia> grad = Flux.gradient(x -> model(x), some_input) # Gradient evaluated at the inputs
ERROR: output an array, so the gradient is not defined. Perhaps you wanted jacobian.

and indeed Flux.jacobian(x -> model(x), some_input) works. The terminology is generally that the derivative of a scalar function (like a loss) is the “gradient” and the derivative of a vector function is the “jacobian,” so you just need the latter (and to upgrade to the latest version). Of course, if in the end you do have a loss function, you’ll just want to do a gradient rather than explicitly calculating the intermediate jacobian.

Topic		Replies	Views
Problem on model and gradient descend in Flux General Usage	18	182	October 27, 2024
How to obtain the gradients of intermediate variables with Flux Machine Learning question , flux	11	1312	March 24, 2022
Flux loss: Gradient wrt input leads to empty gradient wrt parameters or to "can't differentiate foreigncall" Machine Learning flux , forwarddiff , diffeqflux	3	551	April 8, 2022
ERROR: Output should be scalar; gradients are not defined for output General Usage question	0	448	January 3, 2021
Simple Flux model not learning Machine Learning flux	4	1077	October 21, 2019

Gradient of Flux model wrt to weights

Related topics