Getting rid of memory allocations in nested functions

p-gw · January 31, 2023, 9:09am

Hi,

I am trying to minimize memory allocations of a function in my package.
However, it displays some mysterious behaviour - at least for me.
Since there are many experts here, maybe somebody can shed some light on what’s going on here.

Here is a MWE of the problem:

I have an outer! function that modifies a vector x by calling an inner function on each element of x. Additionally a function is passed as an argument provided by the user:

function outer!(x, f=identity)
    for i in eachindex(x)
        x[i] += inner(f)
    end
end

function inner(f)
    y = zero(Float64)
    for i in 1:10
        y += f(1)
    end
    return y
end

If I call this version it results in a lot of allocations:

x = zeros(1000)
@benchmark outer!($x)

BenchmarkTools.Trial: 10000 samples with 1 evaluation.
 Range (min … max):  63.021 μs …   9.655 ms  ┊ GC (min … max): 0.00% … 99.07%
 Time  (median):     82.074 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   84.586 μs ± 207.426 μs  ┊ GC (mean ± σ):  5.42% ±  2.21%

  ▄▂█▆▃▄▃▃      ▄▆▆▆▅▅▄▃▃▂▂▁▁▁                                 ▂
  ██████████▇▇▇████████████████████▇▇▇▇▇▇▇▇▆▆▄▆▅▃▅▄▅▂▅▄▄▂▄▃▅▄▄ █
  63 μs         Histogram: log(frequency) by time       138 μs <

 Memory estimate: 54.52 KiB, allocs estimate: 3489.

Note that this only happens if inner has the for loop. if I define inner without the loop all allocations vanish.

However, if I just evaluate f in outer! once, the allocations also vanish:

function outer!(x, f=identity)
    f(1)  # evaluate f once
    for i in eachindex(x)
        x[i] += inner(f)
    end
end

@benchmark outer!($x)

BenchmarkTools.Trial: 10000 samples with 1000 evaluations.
 Range (min … max):  71.814 ns …  5.371 μs  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     72.181 ns              ┊ GC (median):    0.00%
 Time  (mean ± σ):   76.442 ns ± 67.196 ns  ┊ GC (mean ± σ):  0.00% ± 0.00%

  █▄▂▂▁  ▁                                                    ▁
  ██████████████▇▇▇▆▇▆▆▇▆▆▅▆▅▆▆▅▆▅▅▄▅▄▄▅▅▅▅▆▅▄▄▄▄▅▄▄▄▄▃▄▄▄▁▄▄ █
  71.8 ns      Histogram: log(frequency) by time       135 ns <

 Memory estimate: 0 bytes, allocs estimate: 0.

I am trying to understand what leads to the memory allocations in this case and why evaluating f a single time gets rid of them. I’m fine with leaving f(1) in my code, but it seems like a rather hacky solution and I’m sure there is a better way of solving this problem.

Hopefully somebody can help me here.

ademonts · January 31, 2023, 9:22am

When passing functions as arguments, it’s better to add the type parameter in the signature so it can specialize on whatever function f is :

function outer2!(x; f::F=identity) where F
    for i in eachindex(x)
        x[i] += inner(f)
    end
end

julia> @btime outer!($x)
  45.900 μs (3489 allocations: 54.52 KiB)

julia> @btime outer2!($x)
  52.487 ns (0 allocations: 0 bytes)

p-gw · January 31, 2023, 9:29am

Well, that was a fast solution… Many thanks!

j-fu · January 31, 2023, 9:31am

See also the (admittedly slightly cryptic) remarks in the Julia Performance Tipps.

Topic		Replies	Views
Memory allocation with function passed as arguments Performance	4	1063	July 13, 2021
Allocations when moving loop into a function New to Julia memory-allocation	3	130	June 1, 2025
Understanding allocations - function returned by function New to Julia	1	289	November 24, 2023
Spurious memory allocations within function Performance question , memory-allocation	9	436	March 22, 2023
Weird allocations General Usage question , memory-allocation	2	314	September 25, 2020

Getting rid of memory allocations in nested functions

Related topics