Fitting a multiple input Flux.jl model with learning networks in MLJ.jl

boutor2 · October 30, 2023, 9:41pm

Hey there, I have tried to use MLJ.jl to fit a Flux model that takes in multiple inputs. While Flux.jl documentation demonstrates how to build such a model, I unfortunately cannot make it work within MLJ.

Here is a MWE, attempting to make use of learning networks.

using MLJ
using Flux, MLJFlux
using UnPack

# Defining custom Flux model
struct PowerModelNN{NN1, NN2}
    nn1::NN1
    nn2::NN2
end
  
Flux.@functor PowerModelNN

function (m::PowerModelNN)(xs::Tuple)
    a, pred = xs
    return m.nn1(pred) .* a .^ m.nn2(pred)
end
(m::PowerModelNN)(xs...) = m(xs)

# Defining custom composite model
mutable struct MultipleInputCompositeModel <: MLJ.DeterministicNetworkComposite
    fsa_model
    fspred_model
    std_model
    sar_model
end

import MLJBase
function MLJBase.prefit(m::MultipleInputCompositeModel, verbosity::Int, X, y)
    @unpack fsa_model, fspred_model, std_model, sar_model = m

    Xs = source(X)
    ys = source(y)
    
    as = MLJ.transform(machine(fsa_model, Xs), Xs)
    preds = MLJ.transform(machine(fspred_model, Xs), Xs)
    pred_std = MLJ.transform(machine(std_model, preds), preds)

    # Here is the bit which is failing
    sar_machine = machine(sar_model, (as, pred_std), ys)
    ŷ = predict(sar_machine, (as, pred_std))
    return (; predict = ŷ)
end

# Building multiple input model and machine
NN = MLJ.@load NeuralNetworkRegressor pkg=MLJFlux

nn1 = Chain(Dense(3, 10), Dense(10, 1, relu, bias=false))
nn2 = Chain(Dense(3, 10), Dense(10, 1, relu, bias=false))
builder = MLJFlux.@builder PowerModelNN(nn1, nn2)

mdl = MultipleInputCompositeModel(FeatureSelector(features=[:a]),
                                FeatureSelector(features=[:a], ignore = true),
                                Standardizer(),
                                NN(builder=builder, loss = Flux.poisson_loss))

# Generating synthetic data
using DataFrames
X = DataFrame(:a => rand(100), :x1 => randn(100), :x2 => randn(100))
y = exp.(2 * X.x1 .+ X.x2) .* X.a .^ exp.(X.x2)

# Fitting the machine
cm = machine(mdl, X, y)
fit!(cm)

I get the error Mixing concrete data with Nodetraining arguments is not allowed.. I believe that this comes from the fact that one cannot use a tuple of Node as an argument for a model or machine. Does anyone have an idea of a solution to make this work?

Cheers!

ablaom · October 30, 2023, 10:25pm

I haven’t tried to reproduce, but it sounds like you just want to join (horizontally concatenate) the two tables represented by the nodes as and predict_std, yes?

If we assume the tables are DataFrames with non-intersecting column names, then you can try replacing (as, predict_std) with hcat(X1, X2), which hopefully works, because hcat is overloaded to work for nodes out of the box (long-hand would be node(hcat, as, predict_std)).

I don’t know the latest recommendation for general tables, but you can follow this link to get a solution: Add method to horizontally concatenate two (or more) tables of possibly different type · Issue #30 · JuliaData/TableOperations.jl · GitHub

boutor2 · October 31, 2023, 7:37pm

Thanks for the swift reply!

Unfortunately, your solution does not work, as the Flux model NN should take as an argument a tuple, and not a single table. This is required as the entries as and predict_std are processed in a different fashion in the internals of the flux model.

Based on your suggestion, I tried node(tuple, as, predict_std) but this unfortunately does now work either.

ablaom · October 31, 2023, 10:36pm

Sorry, I indeed misunderstood your problem.

Your current approach will not work because you are violating an assumption about how MLJFlux.NeuralNetworkRegressor works. The Flux model m created by the builder can only be called on matrices (and vectors), not tuples of matrices.

In more detail, the X in a call such as machine(NeuralNetworkRegressor(...), X, y) must be a matrix or a table with p columns, say. This X is converted to vector of p x b matrices, where b is the batch size, and in training m is called on each of these matrices.

In case you are curious, here are some relevant sections of the code base:

github.com

FluxML/MLJFlux.jl/blob/caed27b7aafd23df069e4c0f3b21b87ff375e5ff/src/mlj_model_interface.jl#L66


      
          shape = MLJFlux.shape(model, X, y)
          
          chain = try
              build(model, rng, shape) |> move
          catch ex
              @error ERR_BUILDER
              rethrow()
          end
          
          penalty = Penalty(model)
          data = move.(collate(model, X, y))
          
          x = data[1][1]
          
          try
              chain(x)
          catch ex
              @error ERR_BUILDER
              throw(ex)
          end

github.com

FluxML/MLJFlux.jl/blob/caed27b7aafd23df069e4c0f3b21b87ff375e5ff/src/core.jl#L222


      
          end
          
          _get(Xmatrix::AbstractMatrix, b) = Xmatrix[:, b]
          _get(y::AbstractVector, b) = y[b]
          
          # each element in X is a single image of size (w, h, c)
          _get(X::AbstractArray{<:Any,4}, b) = X[:, :, :, b]
          
          
          """
              collate(model, X, y)
          
          Return the Flux-friendly data object required by `MLJFlux.fit!`, given
          input `X` and target `y` in the form required by
          `MLJModelInterface.input_scitype(X)` and
          `MLJModelInterface.target_scitype(y)`. (The batch size used is given
          by `model.batch_size`.)
          
          """
          function collate(model, X, y)
              row_batches = Base.Iterators.partition(1:nrows(y), model.batch_size)

boutor2 · November 1, 2023, 3:54am

Ok, thanks a lot for pointing me to these pieces of code. I’ll just try to overload MLJModelInterface.fit and MLJFlux.collate with a new MultiInputNeuralNetworkRegressor. I’ll post the solution when I found it, and maybe make a PR if relevant.

Topic		Replies	Views
[Solved]Flux.jl: can it be used to train networks with multiple distinct inputs? Machine Learning	1	1256	October 22, 2018
MLJFlux.jl v0.1.2: Interface to Flux.jl for MLJ.jl Package Announcements	0	612	July 22, 2020
How can I add to ML model together in Flux Machine Learning	2	526	July 12, 2019
Input to Neural Network Machine Learning	1	305	November 24, 2022
Flux: multiple input of unequal dimensions Machine Learning flux	4	1329	September 7, 2020

Fitting a multiple input Flux.jl model with learning networks in MLJ.jl

Related topics