Making @benchmark outputs statistically meaningful, and actionable

nsajko · October 1, 2023, 7:51pm

AFAIK PGO for Julia user code is a relevant topic. Were Julia to gain profile-guided recompilation as an option for Julia packages, as mentioned here, BenchmarkTools could presumably stay simple, because LLVM would take care of the complexity. That is, BenchmarkTools would just need to recompile with PGO, which should (when enough PGO-based optimizations are implemented) ensure that some of the issues mentioned here don’t matter.

nsajko · October 1, 2023, 8:17pm

user664303:

There is basically nothing else going on. Just for you, I’ve started Julia with:
sudo nice -n -20 julia
This sets the Julia process to the highest priority possible. Given these two things, the idea that something would interrupt the Julia process seems remote.

No, nice doesn’t cut it. To prevent interruption you’d need to run Julia with a real time priority, perhaps using chrt. Also you’d probably need to fiddle with kernel options for real time scheduling (by default Linux reserves some time for non-realtime processes, so you’d need to turn that off), and possibly with other scheduler options. See this, for a start: Real-Time group scheduling — The Linux Kernel documentation

You might also want to fiddle with kernel and/or CPU options that control the powersaving/performance trade offs and similar.

You might also want to reserve some cores for Julia.

If you go down this route, make sure not to damage your system. EDIT: if you forbid the kernel from preempting your process, I think you need to make that process yield to the kernel on its own volition, by doing explicit sleeps every so often. Not sure if this is doable without modifying BenchmarkTools, and maybe even the Julia runtime.

There are tons of other things that can affect performance. For example, plugging in a laptop’s charger, so that it’d have more power available, usually improves performance compared to running merely off battery power.

user664303 · October 1, 2023, 8:33pm

I’m aware of this one. All my tests were done with my laptop plugged in.

user664303 · November 15, 2023, 1:01pm

A couple of updates to this thread:

Somebody replied to my Stabilizer issue saying

“there are problems with stabilizer that I don’t know how to solve with reasonable effort. Check other open issues before proceeding further ahead.”

This suggests that Stabilizer may not be a good solution to randomizing memory layout.

Prof. Berger replied to my email, and shared this document on design of a benchmark suite for evaulating a couple of web stack libraries. It’s not particularly relevant or helpful for us, except that it lists the following requirements for mitigating measurement bias caused by memory layout:
a) Shuffling allocator
b) Stack padding
c) Shuffling linker
d) Randomizing environment variables

There were also some links to some helpful academic papers at the bottom of the page.

I don’t know enough about Julia’s compilation process to know where and how these randomizations could be implemented.

Topic		Replies	Views
Benchmarking questions General Usage benchmark	3	325	September 18, 2023
@time and @benchmark give different results Performance question , benchmarktools	6	1541	August 16, 2021
How to specify the number of execution and the number of repetitions per execution in BenchmarkTools? General Usage question	16	2591	September 2, 2021
BenchmarkTools with simple, fast-running function New to Julia	3	2128	February 21, 2019
.= vs = speed difference New to Julia	2	553	June 12, 2019

Making @benchmark outputs statistically meaningful, and actionable

Related topics