Why zip the data argument to the `Flux::train!` function?

arnaudmgh · February 22, 2020, 6:08pm

I have been playing with the model zoo autoencoder code. I have a question about this line of code:

@epochs 10 Flux.train!(loss, params(m), zip(data), opt, cb = evalcb)

(permalink)

What is zip(data) doing to the data, and what type does Flux::train! expects for the data argument?

I am confused because data is an array of 60 batches, and zip as only the one argument data. If I pass data directly, it stalls and does nothing. I suspect it has to do with the fact that the type of zip(data) is iterator, and may-be that is what Flux::train! needs for efficiency, but since I can’t find documentation for train!, I have no clue.

Any hints appreciated. Thanks!

jling · February 22, 2020, 7:15pm

julia> zz = zip([1,2,3], ["a","b","c"])
Base.Iterators.Zip{Tuple{Array{Int64,1},Array{String,1}}}(([1, 2, 3], ["a", "b", "c"]))

julia> for (x, y) in zz
           @show x, y
       end
(x, y) = (1, "a")
(x, y) = (2, "b")
(x, y) = (3, "c")

you’re right, the zipped thing can be iterated over and each element is a pair of input and output (of your ML model)

and in the example you linked, it looks like this is mainly for maintaining the type requirement, not for zipping input and label together

arnaudmgh · February 24, 2020, 4:19pm

Thank you for your answer, jling. I guess I have to give iterator types to train! for data, always, then.

dellison · February 24, 2020, 5:19pm

Maybe here’s another way to think about it (or at least, here’s how I think about it)-

The data argument to train! just has to be an iterable of tuples that are splatted to loss. train! pretty much just does this:

for datapoint in data
    loss(datapoint...)
end

(Of course, it’s actually slightly fancier that than, since it takes the gradient and updates the parameters and all that stuff, but the training loop itself is quite straightforward, I think- the source is here, in case you’d like to take a look!)

So, if you’ve defined your loss function to look like this:

function loss(x, y)
    # ...
end

…then you can pass in a vector of (x, y) tuples to train! as the data argument. It could just as easily be loss(a, b, c), if you calculate your model’s loss that way, where you’d want to pass in (a, b, c) tuples.

I hope that helps!

mbauman · February 24, 2020, 5:41pm

This case is interesting in that — since it’s an autoencoder — the data is itself the label. That means that the loss function can be defined with just one argument.

A more typical use-case of flux might do something like:

loss(x, y) = Flux.mse(model(x), y)
Flux.train!(loss, params(model), zip(features, labels), opt)

Where features is the vector of all the input data, and labels is the corresponding vector of their corresponding known outputs. Zipping them together converts the two vectors to a single vector with each datapoint in the same tuple as its label.

You could define an auto-encoder with the loss definition above just by zipping data with itself: zip(data, data), or you could do as the model zoo does: just recognize that a single argument is sufficient and then the one-argument zip is just a cute way of putting each element of the data vector into a 1-tuple that can be splatted into loss.

arnaudmgh · February 25, 2020, 5:07am

Oh I see, yes that’s a good way to put it - I pay atttention now the loss function had only one argument, because of this particular case (loss(x) = mse(m(x), x)).

Thanks a lot!

arnaudmgh · February 25, 2020, 5:10am

Yes that helps - and thank you also for linking the source code, good idea to go look, especially in Julia, where the source code is often relatively concise and readable.

Topic		Replies	Views
What datatypes can you pass into train! Machine Learning	3	429	June 18, 2019
Flux networks as arguments in functions Machine Learning flux	6	607	May 8, 2019
Data-formatting in and out of Flux ML model Machine Learning	1	438	July 28, 2021
Trying to train with Flux.jl New to Julia question , flux , machine-learning	3	765	December 14, 2020
Problem understanding Loss function behavior using Flux.jl Machine Learning flux	5	1325	August 7, 2020

Why zip the data argument to the `Flux::train!` function?

Related topics