Identical functions repeated benchmarks show systematic differences

Sukera · July 31, 2021, 11:25am

Having read through this thread again, I’ve noticed that in basically all benchmarks posted here the setup looked something like this:

list = rand(n)
.
.
.
@benchmark [...] setup=(x=copy(list)) # or deepcopy(list)

This will skew the benchmark heavily, since it’s always the same list that is being sorted and the branch predictor in your CPU will learn the patterns in that data.

I don’t think the benchmarks posted here support the conclusions drawn because of that fact alone (not to mention that they also still don’t account for the code being placed in different memory locations, which may influence things as well). Statistics over statistics are meaningless if the statistics you’ve collected are wonky in the first place.

Topic		Replies	Views
Benchmark Tests: Improvements for BenchmarkTools Performance discussion	16	2244	August 12, 2021
How much is it normal that @time differs in time? New to Julia	20	2407	June 28, 2017
Making @benchmark outputs statistically meaningful, and actionable Profiling benchmark , benchmarktools	123	3599	November 15, 2023
Push! versus preallocation New to Julia	17	2803	June 11, 2020
Fluctuations when measuring execution time of linear algebra code Performance	12	1047	July 19, 2019

Identical functions repeated benchmarks show systematic differences

Related topics