Hi,
I’m taking one of my first steps into MLJ by writing a simple script that can predict diverse fields for my research. I know what I want to do can be achieved using simpler interpolation methods or Flux.jl, but I want to do it using MLJ, just for me to learn and explore the ecosystem.
Here is my goal: I have a set of Array{Union{Missing, Float32}, 2}
, each one associated with a triplet [p1, p2, p3]
of parameters. I want to train the model on this set, and obtain a prediction function that takes any triplet of parameters as input and returns “the best matching” array.
My issue is that there is no model matching my data, according to models(matching(X))
where X
is a Vector{Vector{Float64}}
(I removed the missing
value from the original arrays).
In summary, my dataset has the following schema
:
┌────────┬────────────────────────────┬─────────────────┐
│ names │ scitypes │ types │
├────────┼────────────────────────────┼─────────────────┤
│ param │ AbstractVector{Continuous} │ Vector{Float64} │
│ field │ AbstractVector{Continuous} │ Vector{Float64} │
└────────┴────────────────────────────┴─────────────────┘
I can easily remove the vector structure of param
and make 3 different columns, but I want to keep the field as a whole.
How else would you format such data so that MLJ proposes compatible models?
Thanks a lot,
L.