Very simple Flux model refusing to converge

jClugstor · July 26, 2023, 6:48pm

I’m attempting to do something very simple for now. I just want to do a nonlinear curve fitting of sine using Flux.jl. So far, the model is completely refusing to converge, and is giving some very strange results. I must be doing something wrong because I’ve done this before and it worked very well.

using Flux, Plots, Statistics

timespan = 0:0.5:4*pi
out_dat = sin.(timespan)
plot(timespan,out_dat)


hidden = 5

dat = [([x],y) for (x,y) in zip(timespan,out_dat)]


model = Flux.Chain(
        Flux.Dense(1 => hidden,relu),
        Flux.Dense(hidden => 1))	

opt_state = Flux.setup(Adam(), model)
	
loss(mod,x,y) = Flux.Losses.mse(mod(x), y)
mean([loss(model,x...) for x in dat])

meanerr = 100
i = 0
while meanerr > 0.1
    i = i+1
    Flux.train!(loss, model, dat, opt_state)
    if i%10 == 0
        println(i)
        meanerr = mean([loss(model,x...) for x in dat])
        println(meanerr)
    end
end




NNresult = vcat(model.([[t] for t in timespan])...)

plot(NNresult, seriestype = :scatter)

As you can see the problem is very simple. But when I try to run the code, even after literally thousands of epochs the convergence is terrible. For example after 6000 epochs I get the following output

I’ve tried it with all kinds of different settings, activation functions, and number of hidden neurons. It’s probably something simple I’m missing, so if anyone is able to spot anything wrong I would be very grateful. Thanks

jClugstor · July 26, 2023, 8:13pm

It looks like I just needed to add some more layers, which surprised me, because I thought I had done it before with just one layer. Oh well.

Sevi · July 27, 2023, 7:39am

According to the Universal Approximation Theorem one layer should indeed be enough to fit virtually anything – if the number of nodes in that layer is “large enough”… which I guess 5 nodes isn’t

jClugstor · July 27, 2023, 12:00pm

I actually tried it with many different numbers of hidden neurons. Yeah, that’s why it surprised me that such a simple thing would be having such trouble converging, so I thought I was doing something wrong. Turns out that training stuff just turns out to be difficult.

Topic		Replies	Views
Simple Flux model not learning Machine Learning flux	4	1077	October 21, 2019
Nonlinear fit with Flux Machine Learning flux	2	954	January 10, 2021
Flux.jl changes in api General Usage	2	208	March 17, 2023
Why the Loss function does not decrease significantly in Flux.jl Machine Learning	2	331	February 2, 2023
Problems with Flux Machine Learning	2	1595	March 14, 2018

Very simple Flux model refusing to converge

Related topics