Need some help with FluxOpt

stochastic_guy · January 20, 2022, 4:56am

I am trying to use the FluxOptTools (https://github.com/baggepinnen/FluxOptTools.jl) to train a Flux model using Optim . I followed the example provided in the readme. It is working as intended. But when I slightly changed it to my case it isn’t working properly. The code is running without errors, but the parameters of the model are not getting updated. Can anyone please help me figure out what I am doing wrong?
Here is my code:

using Pkg
Pkg.activate(".")
Pkg.add(["DataFrames", "RDatasets","Flux", "FluxOptTools", "Zygote", "Optim", "LossFunctions"])
using DataFrames
using RDatasets
using Flux, Zygote, Optim, FluxOptTools, Statistics
using LossFunctions
diabetes = dataset("MASS", "Pima.te")
y_df = diabetes[!,:Type] .== "Yes"
X_df = diabetes[!, Not(:Type)]
# Converting X and y into matrices and vectors 
y = vec(y_df)'
X = Matrix(Matrix(X_df)')

m      = Chain(Dense(7,20),    
                Dense(20,50),
                Dense(50,10),
                Dense(10,1,sigmoid)
                )
loss() = mean(value(PerceptronLoss(),m(X),y))

Zygote.refresh()
pars   = Flux.params(m) # Initializing parameters 
initial_par = pars
lossfun, gradfun, fg!, p0 = optfuns(loss, pars)
res = Optim.optimize(Optim.only_fg!(fg!), p0, LBFGS() ,Optim.Options(show_trace=true))

contradict · January 20, 2022, 6:59pm

Since y>=0 and m(X)>0, their product will always be >=0, so PerceptronLoss will always be zero. Maybe L1HingeLoss would be more appropriate for this problem? I tried this:

loss() = value(L1HingeLoss(), y, m(X), AggMode.Mean())

But that results in NaN in the convergence measures and I couldn’t figure out how to fix that.

Topic		Replies	Views
[ANN] FluxOptTools Package Announcements optim , flux , visualization	0	996	July 1, 2019
Params not getting updated during training New to Julia flux	25	1736	October 11, 2020
Simple optimization problem via flux Machine Learning question	0	438	March 10, 2020
How to use Flux for a general non-linear minimization Machine Learning flux	1	671	July 1, 2019
Generic Function to train NN w/ Flux Machine Learning flux	7	1648	April 14, 2020

Need some help with FluxOpt

Related topics