Training gets differents results when using Flux.train() inside function

darsnack · August 20, 2020, 8:44pm

Probably has to do with your loss(x, y) definition. I suspect the m in the function body is not referencing the m created inside all_the_code and instead referencing some other m in global scope. In any case, it is better to explicitly pass in the model to the loss function, because depending on the scoping of m, you maybe using a global variable which can cause performance issues (and bugs like this one!). Instead you should define

loss(x, y, m) = Flux.mse(m(x), y)

then when you call train!, you can use a closure over m:

Flux.train!((x, y) -> loss(x, y, m), ps, datatrain, opt, cb = throttle(evalcb, time_show))

This will close over the m in the same scope as where Flux.train! was called, so unless you do something really weird, it should be referencing the m you expect.

Topic		Replies	Views
Problems with Flux NN regression Machine Learning question , package	1	408	November 19, 2021
No changes with Flux NN regression training Machine Learning question	2	657	October 2, 2020
Training layers of a Flux model separately Machine Learning question	1	376	November 13, 2021
NN Getting Same Training and Testing Accuracies? General Usage flux , machine-learning , neural-network	20	386	March 24, 2024
How to use Flux.train! to train custom layer? Machine Learning question	2	1817	September 2, 2019

Training gets differents results when using Flux.train() inside function

Related topics