How to add norm of gradient to a loss function?

AlexLewandowski · October 16, 2021, 9:02pm

I think what I am trying to do used to be possible in Zygote, based on the thread here: Gradient of gradient - #8 by martenlienen

The suggestion:

using Flux
net = Dense(10, 1)
x = randn(10, 128)  # dims, batch

function pred(x, net)
    y, pullback = Zygote.pullback(net, x)
    grads = pullback(ones(size(y)))[1]
    return grads
end

gradient(() -> sum(pred(x, net)), params(net)

Now throws the same error:

ERROR: Mutating arrays is not supported -- called copyto!(::Matrix{Float64}, _...)

Based on the comments in the linked thread, this used to work fine.

Topic		Replies	Views
How to do L2 regularization with new Flux and Zygote Machine Learning flux	2	2136	December 29, 2019
Clipping gradients with Zygote/Flux Machine Learning	2	1214	June 10, 2019
M. learning with regularization using Flux is too slow? Performance question , flux	5	420	February 9, 2024
Gradient error in Flux model inputs Machine Learning question , flux , zygote	5	1321	January 13, 2021
How to use gradient of neural network as the loss function? Machine Learning question	13	2735	March 23, 2021

How to add norm of gradient to a loss function?

Related topics