Running small functions bundled in an outer function taking twice the time of running them separately

Tamas_Papp · April 2, 2020, 9:12am

I suspect it has to do with some of the data fitting into a CPU cache with the individual benchmarks, but only a higher level cache for the combined ones. I can replicate the phenomenon on 1.4, with multiple runs giving somewhat inconsistent timings.

In any case, making x and v 10x larger resolves the inconsistency for me (you may have to increase it more if you have a recent desktop CPU, mine is a puny laptop CPU with little cache) eg

julia> @btime toPolar!($x)
  4.520 ms (0 allocations: 0 bytes)

julia> @btime toCartesian!($x)
  1.284 ms (0 allocations: 0 bytes)

julia> @btime move!($x, $v, $T)
  148.501 μs (0 allocations: 0 bytes)

julia> @btime outerFunction!($x, $v, $T)
  5.503 ms (0 allocations: 0 bytes)

what the relevant benchmark is depends on your data size I guess.

Also, cf

Topic		Replies	Views
Very strange performance issue of a simple code General Usage performance	4	581	June 16, 2019
10x slowdown when passing function as argument Performance	15	2270	June 4, 2020
Surprising runtime behaviour when wrapping functions Performance question	4	415	September 14, 2021
Apparent mismatch of run times during summation Performance question	5	377	February 2, 2022
Performance: tuple unpacking General Usage	8	1964	October 5, 2017

Running small functions bundled in an outer function taking twice the time of running them separately

Related topics