Recover weights of logistic regression using MLJ.jl

PeX · September 12, 2024, 2:55pm

Hi all,

I’m trying to see if I can recover the weights of the logistic regression from data I’m generating synthetically, but for some reason there is large error in the optimized weights when using MLJ.jl.

To create the data, I’m using a sigmoid function with no intercept, decision boundary of 0.5, and 3 features:

sigmoid(x) = 1.0 / (1.0 + exp(-x))

function generate_y(true_weights::Vector{Float64}, U::Matrix{Float64})
    n_time_points = size(U, 2)  # Legnth of time series
    y = [sigmoid(dot(true_weights, U[:, t])) >= 0.5 for t in 1:n_time_points]
    return y
end

function recover_weights(y::Vector{Bool}, U::Matrix{Float64})
    # Convert U to a DataFrame
    U_table = MLJ.table(permutedims(U))

    # Convert the binary target y into a categorical variable
    y_categorical = coerce(y, Multiclass)

    # Define the logistic regression model
    logistic_model = LogisticClassifier(lambda=0.02, fit_intercept=false)

    # Create a machine to fit the model
    mach = machine(logistic_model, U_table, y_categorical)

    # Fit the model to recover the weights
    fit!(mach)

    # Get the recovered weights
    recovered_weights = fitted_params(mach).coefs
    return recovered_weights
end

When using with:

true_weights = [0.2, -0.07, 0.005]

# Generate random control vector U with 3 features for each time point
n_time_points = 1000
U = rand(3, n_time_points)  # U is a matrix with 3 features

y = generate_y(true_weights, U)

recovered_weights = recover_weights(y, U)

I’m getting weights that are far from the true ones (I do get a negative value for the second one which is good).

Is there a way to improve this efficiently? Can I define the 0.5 decision boundary in MLJ? Use another optimizer?

Any ideas?

Thank you !!

Topic		Replies	Views
Unexpected Behavior in LogisticClassifier MLJLinearModels Machine Learning question	7	642	November 16, 2022
MLJ - user defined models - supporting weights - issue? Machine Learning question , mlj	4	632	May 10, 2021
Anyone developing multinomial logistic regression? Optimization (Mathematical) proposal	19	4323	October 6, 2022
How to write an objective function that has both a LASSO and a Ridge regularization term in JuMP Optimization (Mathematical)	30	2211	December 6, 2020
Unable to fit my ML model New to Julia	6	337	July 1, 2021

Recover weights of logistic regression using MLJ.jl

Related topics