Do any of the auto differentiation packages support specifying known derivatives by hand?

marius311 · September 23, 2018, 2:38pm

Some of the functions I’m trying to do AD over involve integrals, and rather than let the AD fight through my quadrature functions, I want to just use the fundamental theorem of calculus to tell the AD that, essentially,

d/dx quadgk(f, xmin, x) = f

as this shows up in several places throughout the derivation. Is this type of thing supported in any of the AD packages?

baggepinnen · September 23, 2018, 2:47pm

Sure, most, if not all, reverse mode AD tools allows this. In Flux its done with a macro @grad if I’m not mistaken.
http://fluxml.ai/Flux.jl/stable/internals/tracker.html#Custom-Gradients-1

marius311 · September 23, 2018, 4:00pm

Wow, that was awesomely easy. I think I did it right, although I’m only just using Flux for the first time:

using QuadGK
using Flux
using Flux.Tracker: TrackedReal, @grad, data, track, gradient

myquad(f, a,              b)              = quadgk(f,a,b)[1]
myquad(f, a::TrackedReal, b::TrackedReal) = track(myquad, f, a, b)
myquad(f, a             , b::TrackedReal) = track(myquad, f, a, b)
myquad(f, a::TrackedReal, b             ) = track(myquad, f, a, b)
@grad myquad(f, a, b) = myquad(f, data(a), data(b)), Δ -> (nothing, -Δ*f(a), Δ*f(b))

f(x) = 2*myquad(x->x^2,x,0)
f′(x) = gradient(f, x)[1]
f′′(x) = gradient(f′, x)[1]

f(1) # -0.6666666666666666
f′(1.) # -2.0 (tracked)
f′′(1.) # -4.0 (tracked)

I wonder whether type stability is possible, but I should probably just go read the Flux docs…

I am curious if the other tools have something like this, I was not able to find it in ForwardDiff.jl/ReverseDiff.jl.

kristoffer.carlsson · September 23, 2018, 4:04pm

I started something like this in https://github.com/JuliaDiff/ForwardDiff.jl/pull/165 but it got a bit stale.

improbable22 · September 23, 2018, 4:48pm

This should be dead easy with ForwardDiff too. You essentially need to do this:

f(x) = x^3

using DualNumbers
f(x::Dual) = dual(f(realpart(x)), df(dualpart(x)))
df(x) = 3x^2

f(dual(1,1))

but IIRC it’s just a little more tricky as ForwardDiff allows for propagating multiple ɛs at once.

With Autograd it’s a macro a bit like Flux’s @grad:

@primitive f(x),dy,y   dy .* fgrad(value(x1))

kristoffer.carlsson · September 23, 2018, 4:52pm

It is indeed quite easy for derivatives, but for Jacobians and gradients, you need to do a bit more work (to propagate the partials correctly).

Topic		Replies	Views
AD in a SGD loop: one call to 'gradient' or many? Machine Learning	5	705	January 27, 2020
Automatic Differentiation Machine Learning	11	3289	February 11, 2019
DifferentialEquations - Derivatives in ODE function/ nesting AD Machine Learning differentiation	6	4023	November 17, 2020
Automatic differentiation of an objective, which already uses AD tools Optimization (Mathematical) question	3	722	June 4, 2019
Understanding automatic differentiation (long-form YouTube video) Teaching & Outreach autodiff	0	752	December 11, 2021

Do any of the auto differentiation packages support specifying known derivatives by hand?

Related topics