Individual ctivation function for each network output

tommy_J · August 10, 2023, 11:53am

Hi,

Can I use a different activation function for each output of my network?

using Lux
net = Chain(Dense(3, 10, tanh), Dense(10, 3, tanh))

I know that output 2 can only be positive, but 1 and 3 can be negative.

Is it possible to define a particular activation function for each output of the network?
1 and 3 → tanh
2 → relu

If so, how can I do that?

contradict · August 11, 2023, 8:38pm

You can probably do this with a custom layer.

struct ChannelActivations{T}
  activations::T
end

ChannelActivations(args...) = ChannelActivations(args)

function (a::ChannelActivations)(x::AbstractMatrix, ps, st::NamedTuple)
    y = reduce(vcat, [f.(v) for (f, v) in zip(a.activations, eachrow(x))])
    return y, st
end

Lux.parameterlength(::ChannelActivations) = 0

Lux.statelength(::ChannelActivations) = 0

# no specified activation on the second Dense makes it a linear layer.
net = Chain(Dense(3, 10, tanh), Dense(10, 3), ChannelActivations(tanh, relu, tanh))

Topic		Replies	Views
Flux: different activation functions for different outputs General Usage flux	0	831	June 23, 2020
How to apply an activation function to a subset of output units? Machine Learning flux	4	1215	February 21, 2023
FlexLayer: A Custom Layer with Different Activation Fcns, Non-negativity, and more New to Julia flux	0	563	July 3, 2020
Custom layer with new parameters in Flux.jl Machine Learning flux	3	1539	December 11, 2019
Flux output layer with custom activation function Machine Learning question , gpu , flux , type-stability	2	377	May 14, 2023

Individual ctivation function for each network output

Related topics