How can I go about updating scalar parameters in Flux.jl?

DoktorMike · December 31, 2018, 12:08pm

Hey everyone and happy holidays,

I’ve been playing around with Flux.jl as I would like to base my new Bayesian deep learning package on this framework. When running the examples from the documentation I noticed something I couldn’t wrap my head around.

The first example in the “Basic usage” documentation in Flux which uses the update! function looks something like this

using Flux
using Flux.Tracker
using Flux.Tracker: update!

W, b = param(rand(2, 5)), param(rand(2))

predict(x) = W*x .+ b
loss(x, y) = sum((y .- predict(x)).^2)

x, y = rand(5), rand(2) # Dummy data
pars = Params([W, b])
grads = Tracker.gradient(() -> loss(x, y), pars)

update!(W, -0.1*grads[W])
loss(x, y)

which works as expected. So far so good. Now however, moving back to the example before in the same section I would like to try to update the parameters of that model and that’s where I fail. The following code shows the issue.

using Flux
using Flux.Tracker
using Flux.Tracker: update!

W, b = param(2), param(3)

predict(x) = W*x + b
loss(x, y) = sum((y - predict(x))^2)

x, y = 4, 15
pars = Params([W, b])
grads = Tracker.gradient(() -> loss(x, y), pars)

update!(W, -0.1*grads[W])
loss(x, y)

The error you get is

ERROR: MethodError: no method matching copyto!(::Float64, ::Base.Broadcast.Broadcasted{Base.Broadcast.DefaultArrayStyle{0},Tuple{},typeof(+),Tuple{Float64,Float64}})

so there’s some broadcasting error as far as I can see but there’s no broadcasting done in my functions in the latest code. So a simple answer could be that Flux doesn’t support updating of scalar type parameters. But it seems a bit counter intuitive since the manual shows an example of calculating gradients of a function parameterized by scalars. Did anyone run into this or did I make a mistake somewhere?

xiaodai · January 1, 2019, 10:04am

Does that return a vector or a scalar?

DoktorMike · January 1, 2019, 5:29pm

grads[W] returns

-32.0 (tracked)

i.e. a scalar value.

MikeInnes · January 10, 2019, 10:21am

I have a fix in this PR. Thanks for reporting.

Topic		Replies	Views
Scalar parameter doesn't update in Flux, but array does General Usage flux	10	1201	February 5, 2020
Toy Flux example with one paramemeter not working? Machine Learning flux	2	800	November 4, 2019
Is it possible to selectively update the params in Flux.jl? Machine Learning flux	9	924	April 17, 2019
Params not getting updated during training New to Julia flux	25	1734	October 11, 2020
Flux mutating updates? New to Julia flux , zygote	0	299	September 11, 2020

How can I go about updating scalar parameters in Flux.jl?

Related topics