How to apply an activation function to a subset of output units?

holylorenzo · January 11, 2020, 4:30pm

Since in Flux, all layers and activation functions are just functions, this is straightforward to implement.

Let’s say we’re working with a Dense layer with a relu activation. Now usually you would construct your layer with Dense(nin, nout, relu) for relu to be applied to every output of the Dense layer.

We can write a custom activation layer that applies a regular activation function to all but the first output as follows:

struct PartialActivation
    activationfn
end
(pa::PartialActivation)(xs) = map((i, x) -> i == 1 ? x : pa.activationfn(x), eachindex(xs), xs)

This will apply the activationfn to all but the first element.

To use it in a model, you will have to switch from something that probably looks like

Chain(.., Dense(10, 10, relu), ...)

to

Chain(.., Dense(10, 10), PartialActivation(relu) ...)

Hope this helps and feel free to ask questions!

Topic		Replies	Views
Flux: different activation functions for different outputs General Usage flux	0	832	June 23, 2020
Individual ctivation function for each network output Machine Learning	1	188	August 11, 2023
FlexLayer: A Custom Layer with Different Activation Fcns, Non-negativity, and more New to Julia flux	0	563	July 3, 2020
Creating Parametric ReLU in Flux Machine Learning	8	2046	February 1, 2025
Flux output layer with custom activation function Machine Learning question , gpu , flux , type-stability	2	379	May 14, 2023

How to apply an activation function to a subset of output units?

Related topics