Reusing exact same layer and parameters

crinders · May 22, 2021, 2:17am

I want to reuse the exact same layer in a network. But I can’t figure out whether my naive approach will do that. My toy architecture is

using Flux
D1 = Dense(2,2)
D2 = Dense(2,2)
NaiveReuse = Chain(D1, Parallel(vcat,Chain(D2,Parallel(vcat, D1, identity)), identity))

The output of params(NaiveReuse) is

Params([Float32[-1.1226765 0.9502689; 0.6875402 0.4517343], Float32[0.0, 0.0], Float32[-0.18272986 -0.16167739; 0.46781456 1.2025808], Float32[0.0, 0.0]])

but I’m having trouble interpreting that. It looks like only two matrices are being stored. Am I correct to assume that the parameters for D1 will be reused and properly updated by Zygote during training? If not, how would I go about that?

ablaom · May 22, 2021, 3:56am

The Dense and other layer constructors use Random.GLOBAL_RNG in initialisation by default. So your naive approach won’t work. There is a user interface point for specifying the RNG. This may be in the docs somewhere, but see https://github.com/FluxML/Flux.jl/pull/1292 .

Or, you can just do D2 = deepcopy(D1), unless you want to avoid deep copies for some memory/performance reason.

ablaom · May 22, 2021, 8:11am

Or perhaps I misunderstood. Do you want the the weights to be coupled?

crinders · May 22, 2021, 2:27pm

Anthony,

Yes, I want the weights to be coupled.
As in
\sigma(D1+\sigma(D1+D2)) instead of \sigma(D1+\sigma(D1^\prime+D2)) if D1 and D2 were variables,i.e., I wouldn’t want the former to be implicitly changed to the latter.

Topic		Replies	Views
Flux.params does not recognize parameters with `x -> layer(x)` syntax Machine Learning flux	4	1160	September 18, 2020
Flux: Resample model with different initializations Machine Learning flux	9	929	July 9, 2021
How to initialize a subset of params of a NN New to Julia flux	2	796	January 16, 2020
Flux: How to create a custom multi-layer model with some parameters shared across layers? Machine Learning question	2	732	July 7, 2021
Re-using layers in Flux.jl: how to train a multi-layer model sharing a common LSTM layer and separate dense layers? Machine Learning	1	618	June 15, 2022

Reusing exact same layer and parameters

Related topics