How are called "transversal" layers in NN?

sylvaticus · March 31, 2021, 3:37pm

Hello, how is it known the concept in a neural network of a “transversal” or “cross section” layer ? I can’t find anything relevant with these two words…
Its characteristic is that its activation function takes as input all the output from the previous layer and its output dimension size is given by the dimension in output of its activation function.

I implemented a generic one in my own NN Library, to use it for classification (with the softmax activation function) or for pooling the neurons of the previous layer (max, avg,…).

However I can’t find how such kind of layer is known in the literature on in standard NN libraries as Flux or KNET…

anon74562486 · March 31, 2021, 7:35pm

What is the name of your layer in your library?

I’ll try to guess.

Inception or one derived from it?
The layer used in DenseNet?

Maybe one of its elements is called in other libraries, like keras, Concatenate layer

sylvaticus · March 31, 2021, 9:02pm

I did call it “VectorFuncionLayer”
Not sure… for what I understood that kind of layer type merges different layers, like two separate nn merging to one, while mine remains on the simple one chain model.
Other diff is that that layer works on n-dimensional tensors, while I remain on 1d.
Finally, in kera each different merging operation is a different kind of layers, while here in VectorFunctionLayer the layer can be associated with any R^N → R^M activation function, like softmax or pool1d.

anon74562486 · March 31, 2021, 9:30pm

You don’t want to act on multiple layers, but just want to apply a vector function?
Or am I making a mistake and misunderstanding?

In Flux you can simpliy do this (and those layers do not have a specific name):

model = Chain(
    Dense(28^2, 200), my_vector_function, 
    Dense(200, 100), my_vector_function,    # change 200 to the output size of my_vector_function
    Dense(100, 10), softmax
)

It seems simple, so I can’t tell if I’m missing something in the translation from English. In case I am making a mistake I apologize.

sylvaticus · April 1, 2021, 7:49am

Oh, I see… basically in Flux you can chain “Layer” objects (like Dense(200,100) with directly functions “alone” (and perhaps other kind of stuff).
In BetaML I can chain only “Layers” and hence I introduced a “weightless” layer to “support” the function, at the end it is the same…

Thank you
/Antonello

ToucheSir · April 2, 2021, 4:53pm

Yes, supporting plain old julia functions as layers is an explicit design goal of Flux. Layer structs should only be required when one wants to keep track of internal parameters/state that should have a gradient.

Topic		Replies	Views
Flux: concatenate layers Machine Learning	7	3255	September 18, 2020
Dense Layers, softmax, relu New to Julia	4	2480	March 4, 2020
How to combine two layers? Machine Learning question , flux	1	683	August 27, 2020
Flux: different activation functions for different outputs General Usage flux	0	843	June 23, 2020
Flux: multiple input of unequal dimensions Machine Learning flux	4	1325	September 7, 2020

How are called "transversal" layers in NN?

Related topics