Question Regarding leveraging MLJ.jl's CV features for my own

xiaodai · November 24, 2019, 2:45am

I tried to read through the Adding Models for General Use · MLJ article, but I can’t find any information on how I can expose the hyper-parameters in my model to MLJ.jl so that I can use MLJ’s infrastructure to do CV. Can someone give some info here so it’s easily searchable for future reference?

tlienart · November 24, 2019, 8:21am

Hello, to interface with MLJ you basically need to tick the following boxes:

import MLJBase in your package
write a constructor for your model that meets the requirements for MLJ (basically a mutable struct which contains the hyperparameters of your models) that is a subtype of either Probabilistic or Deterministic (in your case if you have a binary classifier without scores, it will go under Deterministic),
write a clean! method which checks that hyperparameters passed meet constraints for your model
write a fit, fitted_params and predict method
add metadata using the metadata_pkg and metadata_model functions.

To help you with this I would recommend considering:

any of the examples in MLJModels.jl (for instance the XGBoost interface)
the MLJLinearModels package which shows maybe more explicitly how you can write an interface from an external package

you’ll see that these interfaces all follow essentially the same pattern so it should be reasonably easy to adapt to your case.

Note: with respect to writing a constructor + clean! method, you can also use the @mlj_model macro which does some of the work for you, again please consider examples in MLJModels for instance the interface for NearestNeighbors.jl

xiaodai · November 24, 2019, 8:42am

My question is more about CV and hyper-parameter training. Each type of models’s hyper parameters are different, so how do I tell MLJ which are the hyper parameters to tune? That’s the thing I am a bit not sure about at the moment.

tlienart · November 24, 2019, 9:47am

Once you have a working interface, the HP tuning via CV is done through a TunedModel, either see the docs or look at this tutorial using XGB with XGb for an example.

Edit:

how do I tell which are the hp to tune MLJ considers all fields of the model to be hyperparameters that can be tuned

each type are a bit different that’s considered automatically, there’s two scenario, either it’s a numeric HP in which case you specify a lower= and upper= and MLJ works out an appropriate sampling that matches the type of the HP or it’s not a numeric in which case you specify a values=[...] (e.g. if the hp is a symbol or a string or a metric)

ablaom · November 25, 2019, 12:15am

Yes, to emphasise the point already made by @tlienart, in MLJ a “model” struct only contains hyperparameters and not learned parameters. The learned parameters are part of the output of the model fit method you must implement (and labeled fitresult in the docs), and part of the input of the predict (or transform, etc) method.

xiaodai · November 25, 2019, 12:42am

Yeah. Implemented a first cut here https://github.com/xiaodaigh/JLBoost.jl#mljjl-integrations

Topic		Replies	Views
How to create a MLJModelInterface.Model interface of a complex model? Machine Learning mlj	1	365	February 25, 2021
Current State of Hyperparameter Tuning? Machine Learning question	1	248	August 30, 2024
Best ways to do hyper-parameter tuning Machine Learning mlj , tuning	3	2795	January 6, 2020
[ANN] MLJ: an update Machine Learning	7	1275	December 1, 2019
MLJ - A machine learning toolbox for Julia Package Announcements	0	2212	April 30, 2019

Question Regarding leveraging MLJ.jl's CV features for my own

Related topics