Zygote - parametrize matrix such that gradient is only performed on selected coefficients

jgbrasier · June 4, 2021, 2:50pm

I was wondering if it was possible to parametrize matrices such that zygote’s gradient would only be performed on the selected coefficients of the matrix.

Some dummy code would look like:

theta = randn(2)
W = [theta[1] theta[2]; 0.0 theta[1]]
b = randn(100)
x = randn(2, 100)

gs = gradient(w -> sum(w*x+b), W)

However the gradient would only be performed on the values of theta, keeping W[2, 1] set to 0.0 for the entire gradient descent. Also this would imply that the gradients for theta[1] and theta[2] be equal at all times.
Am I completely going at this wrong way? or is it just not possible in julia?

baggepinnen · June 5, 2021, 5:34am

You have the right idea, but you need to take the gradient w.r.t. theta instead of W, just wrap the function that accepts W in a function that accepts theta instead.

Topic		Replies	Views
Flux.params of a matrix implemented as a struct Machine Learning zygote	11	979	May 17, 2021
Help with Zygote and parameters New to Julia zygote	6	1503	July 1, 2020
Mutate Zygote Gradients with a Custom Mask Before Update? Machine Learning flux , zygote	1	572	March 25, 2021
Can't use Zygote.gradient on Symbolics.jl Matrix Functions General Usage zygote , symbolic	3	631	April 13, 2021
In Zygote, why (model)->... in gradient Machine Learning	4	780	February 10, 2020

Zygote - parametrize matrix such that gradient is only performed on selected coefficients

Related topics