In Zygote, why (model)->... in gradient

jaynick · February 9, 2020, 3:42am

In this (helpful!) post,
https://kiranshila.com/index.php/2020/02/04/teaching-myself-machine-learning-with-julia-part-1/

The (zygote) gradient call is

g = gradient(model -> mse(model.(x),y), model)

I tried instead doing

g = gradient(() -> mse(model.(x),y), model)

and it did not work.

What are the arguments to gradient supposed to be?
Does not seem to be specified in the documentation.

baggepinnen · February 9, 2020, 5:57am

the function gradient must know what you would like to take the gradient with respect to. Typically, you’d like to take the gradient wrt the parameters of the model, which Zygote let’s you do by simply passing the entire model.

You could also take the gradient wrt, for instance, the input x if you would like.

jaynick · February 9, 2020, 5:11pm

restating, your reply clarifies that giving ‘model’ as the explicit argument of the function tells zygote to take the gradient with respect to the parameters in model.

But what then is the purpose and meaning of the ‘model’ parameter that is passed gradient, this one
g = gradient(model → mse(model.(x),y), model)

Probably a different example would be better to explain. Suppose the function has two arguments,
h(x,y) = x+y
How do you get the gradient with respect to y?

baggepinnen · February 10, 2020, 6:15am

What you write means the following
g = gradient(z-> mse(z.(x),y), model)
You take the gradient of the anonymous function with respect to its input in the point model

baggepinnen · February 10, 2020, 6:16am

gradient(y->h(x, y), y)

Topic		Replies	Views
Help with Zygote and parameters New to Julia zygote	6	1501	July 1, 2020
Need some help in understanding zygote gradient Machine Learning	2	413	September 7, 2022
Zygote: treating a model output as a constant Machine Learning zygote	2	740	February 4, 2020
Strange behavior of differentiating constant functions in Zygote New to Julia	1	329	December 23, 2019
Differentiating implicit parameters using Zygote in complex hierarchical models New to Julia question , differentiation , flux	0	937	January 16, 2019

In Zygote, why (model)->... in gradient

Related topics