Allocating OnlineStats on the stack

mattwigway · February 27, 2024, 4:19pm

I’m using OnlineStats to calculate a likelihood function, so I’m creating and destroying a LogSumExp for each row. LogSumExps, like all online stats, are mutable structs, so heap-allocated in general, but I thought that since the LogSumExp is not visible outside the function the optimizer would get rid of those allocations as discussed here. However, the code in the MWE below prints 0.000000 seconds (1 allocation: 112 bytes). Since this happens once per row, it’s a significant performance penalty, any suggestions?

MWE:

using OnlineStats

function logsumexp(vals)
    result = LogSumExp()

    for val in vals
        fit!(result, val)
    end

    x = value(result)

    return x
end

function main()
    logsumexp([1, 2, 3, 4, 5, 6])
    @time logsumexp([1, 2, 3, 4, 5, 6])
end

main()

Actual example code: DiscreteChoiceModels.jl/src/mnl.jl at main · mattwigway/DiscreteChoiceModels.jl · GitHub

Krastanov · February 27, 2024, 4:25pm

If I understand correctly, you are running your logsumexp multiple times inside of a loop, thus being penalized by its allocations.

The options I can think of are:

Rework your logsumexp to take a pre-allocated buffer LogSumExp and zero it out each time at the start of the function.
Try this experimental bumper allocator GitHub - MasonProtter/Bumper.jl: Bring Your Own Stack (it will probably require some modification of how logsumexp is set up).

I do not have domain knowledge here, so I might be missing a more obvious domain-specific solution.

mattwigway · February 27, 2024, 4:26pm

So, in my example code, I just realized the allocation is coming from creating the Vector. That’s not the case in the actual code, I will work on making a better MWE.

Topic		Replies	Views
Prevent huge number of allocations mutating columns of arrays Performance	15	546	September 19, 2023
Why is the function evaluation with more allocations faster? Performance	6	821	April 11, 2021
Controlling memory allocation using external struct as placeholder General Usage question	5	869	March 11, 2017
Allocations in advent of code Performance memory-allocation	9	594	December 9, 2021
Weird allocations General Usage question , memory-allocation	2	314	September 25, 2020

Allocating OnlineStats on the stack

Related topics