Calculating hessian of a NN w.r.t params

stash-196 · February 24, 2021, 5:15am

How would you calculate a hessian of a Neural Network w.r.t. it’s parameters?

For instance, a hessian of the loss function below

using Flux: Chain, Dense, σ, crossentropy, params
using Zygote
model = Chain(
    x -> reshape(x, :, size(x, 4)),
    Dense(2, 5),
    Dense(5, 1),
    x -> σ.(x)
)
n_data = 5
input = randn(2, 1, 1, n_data)
target = randn(1, n_data)
loss = model -> Flux.crossentropy(model(input), target)

I can get a gradient w.r.t parameters in two ways…

Zygote.gradient(model -> loss(model), model)

or

grad = Zygote.gradient(() -> loss(model), params(model))
grad[params(model)[1]]

However, I can’t find a way to get a hessian w.r.t its parameters. (I want to do something like Zygote.hessian(model -> loss(model), model), but I can’t)

Topic		Replies	Views
Hessian matrix of ML model General Usage flux , zygote , forwarddiff , reversediff	9	2242	April 28, 2021
Hessian inside a Flux loss function Machine Learning question	3	794	February 26, 2021
Apply hessian of neural network to array Machine Learning	2	383	March 20, 2021
Fast Hessian and Gradient for PINNS using Enzyme/Zygote Performance question , flux , zygote , enzyme , hessian	0	359	July 23, 2023
Errors when trying to compute hessian of flux neural net and jacobian of jacobian with zygote Machine Learning flux , zygote	1	375	July 4, 2022

Calculating hessian of a NN w.r.t params

Related topics