Higher order derivatives in Flux

crinders · May 22, 2018, 3:08pm

I’ll let the Flux.jl people answer, but worst-case, you will have to pass the ReverseDiff results to the tracked variable’s gradient data field, e.g.,

X.grad[:]=ReverseDiff.hessian(f,X.data)*X.data +ReverseDiff.gradient(f,X.data)

if your objective is (\nabla^T f) x. I don’t know whether that has any nasty side-effects.

Topic		Replies	Views
Can't differentiate loopinfo expression when trying to compute second order derivatives using Flux and Distributions Machine Learning statistics , differentiation , flux	0	755	January 26, 2020
Batched gradients and hessians with Flux Machine Learning question	10	299	January 25, 2025
Is there an efficient way to compute the Hessian of a NN? Machine Learning flux	11	4803	November 12, 2019
Hessian inside a Flux loss function Machine Learning question	3	820	February 26, 2021
Gradiens of gradients using ReverseDiff Numerics	0	645	June 13, 2018

Higher order derivatives in Flux

Related topics