Ignore part of the gradient calculation inside differential equation model

facusapienza · November 9, 2022, 11:22pm

I am trying to differentiate the solution of a differential equation using DifferentialEquation with respect to some parameters but ignoring parts of the calculation of the gradient for being redundant or computationally expensive to compute. However, I cannot manage to ignore parts of the forward model when computing the gradient using Zygote. I am including next a MWE.

We can compute the gradient of the solution of a simple ODE with respect of the vector parameter p as follows

using DifferentialEquations
using Zygote, SciMLSensitivity
using Plots 
using DiffEqFlux
using ChainRulesCore
using Zygote: @ignore

p = [0.1, 0.2]

function dynamics(du, u, p, t)
    du[1] = - p[1] * u[1] + p[2]
end

dp = Zygote.gradient(p -> solve(ODEProblem(dynamics,
                                           [10.0],
                                           (0.0,10.0),
                                           tstops=[4.0], 
                                           p), Tsit5()).u[end][1], p)

which results in the final calculation of dp=([-42.072752200991175, 6.321205292676615],). Now, I would like to consider a case in which the dependency of the solution with one of the parameters, let say p[2] is ignored. Zygote allows ignoring certain computations of the gradient by using the macro @ignore, for example in the following example:

using Zygote: @ignore

function foo(x)
    y = @ignore x
    return y*x
end

where the computed gradient gives the formula f'(x) = x instead of f'(x) = 2x. However, running the previous example with the ignore macro inside dynamics() leads to the same numerical value of the gradient

function dynamics2(du, u, p, t)
    offset = @ignore p[2]
    du[1] = - p[1] * u[1] + offset
end

dp2 = Zygote.gradient(p -> solve(ODEProblem(dynamics2,
                                           [10.0],
                                           (0.0,10.0), 
                                           p), Tsit5()).u[end][1], p)

where dp2 = ([-42.072752200991175, 6.321205292676615],).

Does anyone knows if @ignore is supported for differential equations? There is a chance I am also missing something about the behavior of @ignore, but my understanding is that this command should ignore the dependency of certain parts of the code at the moment of applying AD.

Thank you!

Topic		Replies	Views
Ignore derivatives in ReverseDiff Machine Learning zygote , reversediff , autodiff , chainrulescore	0	376	April 8, 2023
Taking variable as independent with Zygote (Automatic differentiation) General Usage zygote , autodiff	1	465	February 10, 2022
Functionality of torch.nograd General Usage	1	217	January 2, 2023
Zero gradients with Zygote vs correct gradients with ReverseDiff using DiffEqFlux Machine Learning zygote , reversediff , diffeqflux	4	1403	January 24, 2022
Ignore_derivatives of entire module with ChainRules.jl Performance	1	335	February 28, 2023

Ignore part of the gradient calculation inside differential equation model

Related topics