Manually giving gradients in Flux.jl

iHany · October 27, 2021, 7:58am

Hi,
I’m solving a Q-learning problem.
I’m trying to a gradient of TD error. That is,

TD-error = Q^{\theta}(x, u) - (r(x, u) + \min_{v} Q^{\theta}(x^{+}, v))

where \theta = (\theta_1, ..., \theta_N) is the network parameter, x, u, and x^{+} are state, control, successor state.

The difficulty comes from \min. That is,

\frac{Q^{\theta}(x^{+}, u^{*}(x^+, \theta))}{\partial \theta_i} = \frac{\partial Q^{\theta}}{\partial \theta_i} + \frac{\partial Q^{\theta}}{\partial u} \frac{\partial u^{*}}{\partial \theta_i}

I used to use Flux.jl to get a gradient automatically.
The first term, \frac{\partial Q^{\theta}}{\partial \theta_i} would be easy; just from

gs_theta = gradient(params(theta)) do
    Q(x_next, u_star, \theta)
end

as shown in Flux.jl manual.
The second term would also be easy by using

gs_u = gradient(params(u_star)) do
    Q(x_next, u_star, \theta)
end

However, for the last term, the auto-calculation of the gradient of \min seems not to be supported by Flux.jl.
It may be achievable by using DiffOpt.jl, namely, gs_u_star_theta_i and the resulting gradient would be gs_theta + gs_u * [gs_u_star_theta_1, ..., gs_u_star_theta_N].
So I would like to manually tell that “the gradient is the above equation”.

My questions are:

How can I manually give a gradient in Flux.jl?
How can I merge gradients appropriately, for example, the gs_u_star_theta_i’s.

The above description would be poor. Please leave any comments

iiisoo · October 27, 2021, 9:21am

Your θ is the network parameter but where is your nn?

iHany · October 27, 2021, 9:35am

Q(x, u, theta) is itself a network.
More precisely, Q is constructed as a function of nn(x, theta) and u in my case to make sure that Q is convex in u.

ianfiske · October 27, 2021, 11:14am

Flux relies on Zygote for autodiff. Zygote now uses ChainRules for the actual set of math rules to define the basic gradients transformations. The preferred way to define custom gradients now is through ChainRulesCore’s rrule:

https://juliadiff.org/ChainRulesCore.jl/stable/#frule-and-rrule

(also see Custom Adjoints · Zygote for the alternative legacy method and a bit more verbose explanation than what I just gave)

Topic		Replies	Views
Flux.jl: params() and gradient() ocnfusion Machine Learning	4	653	August 23, 2021
Please help with the very simplest 4-line Autodiff example: Flux0.9 Machine Learning flux	1	423	January 12, 2020
Manually updating the parameters of a Neural Network in Flux Machine Learning flux	3	2160	June 17, 2019
Taking gradient to update a Flux.jl CNN New to Julia question	2	324	May 4, 2024
Flux.jl manual training loop results in `error gradent(F, ::Params) are deprecated` New to Julia	2	102	June 16, 2025

Manually giving gradients in Flux.jl

Related topics