Flux: different activation functions for different outputs

jmurray · June 23, 2020, 2:30pm

I need to apply different activation functions to different outputs of a Flux layer. This requires a custom layer. I can modify an existing custom layer so that its only job is to apply a per-output activation function as shown:

function (a::Nonneg)(x::AbstractArray)
    x_out = x # or a.W * x .+ a.b
    return vcat(softplus.(x[1:1,:]), σ.(x[2:2,:]), σ.(x[3:3,:]))
end

Here, σ is the activation function that’s part of the layer’s struct, and softplus is hard-coded to a particular output to enforce non-negativity. But I would really like to not have to hard-code any functions or the (multiple) indices at which they are to be applied. What’s a good approach to this?

There was a similar topic a few months back that used a struct to apply an activation function only to a (hard-coded) subset of the outputs, but it reportedly didn’t work for arrays of dimension larger than one (as I’ve confirmed). There might be a simple fix. If so, then maybe one can replace the struct’s activationfn with an array of activation functions?

Topic		Replies	Views
How to apply an activation function to a subset of output units? Machine Learning flux	4	1217	February 21, 2023
Individual ctivation function for each network output Machine Learning	1	188	August 11, 2023
FlexLayer: A Custom Layer with Different Activation Fcns, Non-negativity, and more New to Julia flux	0	563	July 3, 2020
Flux output layer with custom activation function Machine Learning question , gpu , flux , type-stability	2	379	May 14, 2023
Flux: Custom Layer New to Julia flux	3	2630	June 25, 2020

Flux: different activation functions for different outputs

Related topics