Is there an efficient way to compute the Hessian of a NN?

ChrisRackauckas · July 30, 2019, 4:24pm

Reverse mode is going to give columns, and I don’t think you need that. Using double forward mode will be the fastest here. You’ll need to mapchildren to remove the tracker information (or use the Flux#zygote branch) and then just forward diff (or use a hyperdual)

If you do want to Forward-over-Reverse for Hess-vec products though, it is implemented in SparseDiffTools.jl

https://github.com/JuliaDiffEq/SparseDiffTools.jl#jacobian-vector-and-hessian-vector-products

but note that our tests don’t show that using Zygote here is the fastest yet

Topic		Replies	Views
Efficiently computing Hessians of Neural Networks output with respect to inputs Performance question , flux , pde , hessian	1	261	July 16, 2023
Hessian matrix of a neural network vetor output Specific Domains physics , zygote , neural-network	3	440	July 16, 2022
Autograd in Flux General Usage	14	1286	March 5, 2021
How to compute hessian Machine Learning	1	866	August 26, 2019
Batched gradients and hessians with Flux Machine Learning question	10	203	January 25, 2025

Is there an efficient way to compute the Hessian of a NN?

Related topics