Custom layer in Lux

rkube · July 30, 2024, 4:38pm

Hi,
I’m trying to implement a custom layer in Lux that is similar to a linear layer but needs to apply a user-defined mask to the weights. It needs to compute

h(x) = g(b + (W ⊙ M)(x))

where g is an activation function, b the bias vector, W the weight matrix and M the user-defined mask.

Some things that are unclear to me after reading the documentation:

The weight matrix in my layer is user-definable and may need to be updated. Should the matrix be defined in the struct definition of the layer or in the parameters?
My uses explicit function definitions as in the tutorial linked above, but there exists also the @compact macro. What exactly does this macro do?
The definition of dense, calls Lux._vec and Lux._getproperty. How are these different from vec and getproperty?
What is the rationale for using F1 and F2 as type parameters in the tutorial?

struct Linear{F1, F2} <: LuxCore.AbstractExplicitLayer
    in_dims::Int
    out_dims::Int
    init_weight::F1
    init_bias::F2
end

avikpal · July 31, 2024, 2:42am

The weight matrix in my layer is user-definable and may need to be updated. Should the matrix be defined in the struct definition of the layer or in the parameters?

In the parameters. Model structs should never contain mutable elements. See Migrating from Flux to Lux | Lux.jl Docs.

My uses explicit function definitions as in the tutorial linked above, but there exists also the @compact macro. What exactly does this macro do?

See Utilities | Lux.jl Docs for a detailed description. It essentially automatically writes all the boilerplate code needed for state handling and defining initialparameters / initialstates. See this tutorial for a example showcasing both kinds of layers.

The definition of dense , calls Lux._vec and Lux._getproperty. How are these different from vec and getproperty?

Mostly an implementation detail. _getproperty takes Val as input and returns nothing if no such field is present in the struct. _vec allows nothing as input.

What is the rationale for using F1 and F2 as type parameters in the tutorial?

Just to specialize on the functions

Topic		Replies	Views
Creating a custom container layer in Lux Machine Learning lux	2	190	August 11, 2024
Custom Layer in Lux.jl General Usage	1	575	December 7, 2022
Building a custom Layer in Lux.jl General Usage	10	1038	December 24, 2022
(Flux/Lux) Custom Layers as Functions of Other Layers General Usage question , machine-learning	4	237	April 25, 2023
[ANN] Lux.jl: Explicitly Parameterized Neural Networks in Julia Package Announcements package , announcement , machine-learning	50	11370	April 27, 2024

Custom layer in Lux

Related topics