Flux/Zygote: Gradient with respect to inputs and implicit parameters (in 2021)

Tomas_Pevny · November 23, 2021, 12:28pm

I think that your solution is the only solution at the moment.
I also think that the overhead would be small. In the union, you essentially create a shallow copy of IdDict and that should be pretty fast, in comparison of the price of the gradient.

You can check it out by yourself. Do few iterations where you will just take gradient with respect to parameters (no union) and then of your solutions. The preformance diff will be small.

Topic		Replies	Views
Differentiating implicit parameters using Zygote in complex hierarchical models New to Julia question , differentiation , flux	0	948	January 16, 2019
Why calculating gradients from Params is different than doing it directly? General Usage flux	5	761	April 28, 2020
Calling Flux.params() inside gradient changes output? Machine Learning flux , zygote	2	377	September 28, 2021
Understanding Flux.jl use of `gradient` and `params` Machine Learning flux	4	3656	October 2, 2021
Lux (And Flux), "parallel" Network Input. When Input is flat, Zygote gradient works, when input is not flat it doesn't Machine Learning flux , zygote , lux	10	746	February 5, 2024

Flux/Zygote: Gradient with respect to inputs and implicit parameters (in 2021)

Related topics