(Flux/Lux) Custom Layers as Functions of Other Layers

Bizzi · April 25, 2023, 5:50pm

Hey everyone. For some experiments, I want to set up a network where some of the weights are not trained, but instead are given as functions of other (trained) weights in my network.

For a simple example, take the NN of the following image: I would like to be able to force W3=2*W1 and W4=3*W2, then train W1 and W2 normally.

Is this possible within the SciML environment? I’m quite new to Julia and SciML as a whole, so I honestly wouldn’t even know how to begin. The Flux page on Custom Layers, opaque as I find it to be, doesn’t seem to consider this possibility.

gdalle · April 25, 2023, 6:46pm

I think in this case using the same layer objects two times each could work?

Bizzi · April 25, 2023, 6:56pm

While that could possibly work, it does not seem to generalize to cases where the mapping is not just the identity, which is what actually interests me. I’ll edit the question to better reflect that!

mcabbott · April 25, 2023, 7:29pm

Is your diagram doing something like this?

struct Diamond{T}  # store two matrices
    W1::T
    W2::T
end

Flux.@functor Diamond  # make sure Flux can see them

function (d::Diamond)(A)  # write out the forward pass
    B = d.W1 * A
    C = d.W2 * A
    D1 = 2 * d.W1 * B
    D2 = 2 * d.W2 * C
    D1 + D2  # assume D is the sum of the two inputs
end

m = Chain(Dense(10=>10, relu), Diamond(randn32(10,10), randn32(10,10)))

m(rand32(10))  # it runs

It would be fine to have say D1 = 2 * (d.W1 .^ 2) * B, or some other function of W1 before using it a second time.

gdalle · April 25, 2023, 7:50pm

I’m not sure what it would look like in Flux, but in Lux you could just pass identical or modified versions of the parameter objects to both layers

Topic		Replies	Views
Flux: How to create a custom multi-layer model with some parameters shared across layers? Machine Learning question	2	732	July 7, 2021
Flux custom layers - incorrect solution from training Machine Learning	0	298	April 3, 2020
Stacking layers example Flux - Flux.params empty New to Julia question	2	833	December 12, 2019
Using Flux to construct a CES function Machine Learning flux	3	342	September 28, 2021
Custom layer in Lux Machine Learning lux	1	275	July 31, 2024

(Flux/Lux) Custom Layers as Functions of Other Layers

Related topics