How do I regularise MLJFlux models?

ablaom · June 6, 2022, 11:14pm

Reposting this question from a Slack channel.

ablaom · June 6, 2022, 11:18pm

The question of regularization for neural networks is a bit
complicated, and I’m no expert. It has frequently been observed that
increasing the total number of weights (increasing complexity) does
not necessarily lead to over-fitting, but the phenomenon is poorly
understood in general. In this paper Belkin et
al. (2019) introduce specific examples of networks where a “double
descent risk curve” should be expected (so no over-fitting). However,
in this preprint Nicahni et al. (2020) argue that while increasing network width may not
lead to over-fitting, increasing depth can still lead to over-fitting.

Returning to the question, regularization options in MLJFlux/Flux are:

Early stopping: You end training when an out-of-sample error begins
to deteriorate. MLJ’s IteratedModel wrapper is useful for
automating this. See, the Boston or MNIST examples
here.
Add Dropout layers to your Flux model (aka chain) (through the
builder hyper-parameter of your flux model). See Normalization &
Regularization
section of the Flux manual
Add L1/L2 weight penalty regularization by specifying appropriate
values of the hyper-parameters lambda (strength of regularization)
and alpha of your MLJFlux model (L2/L1 mix). If alpha=0 then there is only L2
regularization

Topic		Replies	Views
MLJFlux.jl v0.1.2: Interface to Flux.jl for MLJ.jl Package Announcements	0	609	July 22, 2020
Regularization with Flux Machine Learning flux	3	1140	October 29, 2020
MLJFlux is a lot slower than the same algorithm written in Flux Performance flux , machine-learning , mlj , neural-network	3	1618	June 21, 2021
Pros and cons of using Flux directly over using MLJ/MLJFlux Machine Learning	2	1682	April 22, 2021
Flux Regularisation in Julia at the layer level New to Julia	1	648	January 8, 2020

How do I regularise MLJFlux models?

Related topics