Flux.jl: params() and gradient() ocnfusion

Alex-ZA · August 21, 2021, 11:45am

Hello everyone,

I apologise if this is not the area to ask Flux questions. If not, please direct to the correct place.

I have been going through the Flux.jl documentation and came across this code.

julia> x = [2, 1];

julia> y = [2, 0];

julia> gs = gradient(params(x, y)) do
         f(x, y)
       end
Grads(...)

julia> gs[x]
2-element Vector{Int64}:
 0
 2

julia> gs[y]
2-element Vector{Int64}:
  0
 -2

I understand how the do end block works, but I am struggling to understand the use and function of params. I cannot find good documentation on it. If I missed it, please point me towards it!

Thank you for your time.

ToucheSir · August 21, 2021, 3:51pm

Welcome!

I actually want to turn this question around and ask you what you found to be deficient with the existing explanation in the paragraph above that code block. This feedback could be useful for improving the docs themselves.

Alex-ZA · August 23, 2021, 7:45am

Hi @ToucheSir, thank you for your reply! I accept your turn around

One would be able to figure out how params() works given the context of that specific example, but I would just like to see some documentation on the function itself. Unless I am missing something and being unreasonable?

DrChainsaw · August 23, 2021, 10:37am

I’m not an authority on this, but my guess is that params returns a Params struct and this is used for dispatch to disambigue the “normal” use of gradient which assumes that any arguments after the first shall be used as inputs to the first argument (which is always a function).

One possible reason why this is not very well described is that it is considered a temporary and somewhat ugly patch to bridge how Flux used gradients to update parameters in the pre-Zygote days. I think it has survied a fair bit longer than anyone intended/hoped it would.

As for the function params, it basically just traveres the model structure and searches for arrays with numbers using Functors.jl and puts them in a Params struct.

ToucheSir · August 23, 2021, 4:13pm

@DrChainsaw hit the nail on the head here. That said, we should add some documentation for this function, even if it just points to the tutorial. Do you mind filing an issue?

Topic		Replies	Views
Understanding Flux.jl use of `gradient` and `params` Machine Learning flux	4	3539	October 2, 2021
Calling Flux.params() inside gradient changes output? Machine Learning flux , zygote	2	356	September 28, 2021
Why calculating gradients from Params is different than doing it directly? General Usage flux	5	720	April 28, 2020
How is differentiation of implicit parameters implemented in Flux.jl? Internals & Design question , differentiation	0	459	March 7, 2021
Lifting a Julia function into a Flux "layer" Machine Learning flux	7	2205	May 29, 2019

Flux.jl: params() and gradient() ocnfusion

Related topics