Instatiate a tracked array? or dynamically store slices of a tracked array? (Not sure what to call this)

tue · August 2, 2019, 4:17am

I have some code where I compute what essentially boils down to the following:

di = zeros(n,knn)
for i=1:n
    di[i,:] = d[idx[i]]
end

Now the problem is that d is a tracked array, and hence it cannot be put inside di which is not tracked.
But how would I go about actually doing such an operation?

johnh · August 2, 2019, 7:53am

I seem to remember in Mike Innes talk at Juliacon that tracked arrays are no longer needed in the latest branch of FLuxML and/or Zygote. I could very well be blathering.

Talk here: JuliaCon 2019 | Differentiate All The Things! | Mike Innes - YouTube

freemint · August 2, 2019, 8:35am

When you look at the GitHub page. There are still performance regressions so it is not save to pick Zygote under all circumstances for all i know.

mcabbott · August 2, 2019, 10:59am

What is d? Here’s one possible interpretation, you can’t mutate a tracked array but you can construct one by indexing:

d = param(rand(2,3)) # TrackedMatrix

idx = [1,3,2,1];
di2 = d[:,idx] # Tracked

di1 = zeros(2,4); # almost your code
for i=1:4
    di1[:,i] = d[:,idx[i]].data
end
di1 # same numbers, but not tracked
di3 = d.data[:,idx] # ditto

Tamas_Papp · August 2, 2019, 11:11am

AFAIK Zygote is still considered experimental. Flux is a very robust alternative in the meantime.

tue · August 2, 2019, 6:37pm

x is the input data #cifar10 images for instance
y = F(x,theta) #F is the neural network, and y is the output of x through the network
d is built from the output data, hence:
d=g(y)
and is used to create my regularization.

I need to be able to track all steps in the creation of the regularization, since this is one of the things we are testing in the research article I’m currently writing.

Regarding your suggestion for the tracking mcabbott, I’m just worried about two things:

will it preserve the chain of tracking?, it is fundamental that the tracking goes all the way back to the data.
What exactly happens when you just overwrite a tracked parameter like what you are suggesting? is the rand value in some way influential or is that no longer tracked?

mcabbott · August 2, 2019, 7:30pm

Yes, di2 is tracked. You can check it’s working by calling back!(di2[1,1]) and seeing that d.grad is nonzero. (Or by wrapping this up in a function and calling gradient.)

di1 is not tracked, it reads just the .data part, which is an ordinary array. But it should have the same numbers as di2. If you were ever to write into the .data part of a tracked array, then strange things will happen, don’t do this! You have to find ways to work without writing into a fresh array with a loop, i.e. like di2 instead. (Or else you have to write a gradient for this step yourself.)

d=rand(...) is just a convenient way of making some numbers to try out. Really this should be your g(y).

Topic		Replies	Views
Mutating array in gradients? New to Julia flux , zygote	4	488	September 8, 2020
How to force using Tracker (not Zygote) in Flux 0.10 Machine Learning	1	455	January 29, 2020
Collect model outputs into multi-dimensional array Machine Learning flux	5	455	December 16, 2021
Flux.params of a matrix implemented as a struct Machine Learning zygote	11	973	May 17, 2021
On AD of matrix, from Tracker to Zygote General Usage	3	885	January 22, 2020

Instatiate a tracked array? or dynamically store slices of a tracked array? (Not sure what to call this)

Related topics