Latency with v1.11-rc3 REPL over SSH

Qfl3x · September 17, 2024, 6:03am

I have a noticeable latency in 1.11-rc3 than in 1.10.5, when working locally it isn’t noticeable, but when using Julia through ssh it is quite noticeable.

More info: It’s worst when starting the REPL and it gets smoother over time, like if functions are compiling in the background.

nsajko · September 17, 2024, 2:55pm

Starting julia with --trace-compile=stderr should print a message for each method that gets compiled. Replacing stderr with a file name should output to that file. Could you try comparing what gets compiled on REPL startup with and without SSH?

mkitti · September 21, 2024, 7:32pm

Could you describe the latency in more detail? Do you experience it when you are typing? After you push enter?

Qfl3x · September 27, 2024, 7:46am

Julia takes a tad bit longer to start, and when I start typing, and when I press enter. It’s also present when printing to stdout.

There’s also substantial slowdown especially when precompiling packages, it takes me longer to precompile on the HPC cluster than it takes me to precompile and train the model on my laptop. The only thing I can think of is that there is a lot of file I/O since that may be the only place where there is a big difference between the cluster and my laptop.

Qfl3x · September 27, 2024, 7:47am

There’s no difference between the two.

ufechner7 · September 27, 2024, 8:14am

Can you share which CPU you use on the cluster and which on the laptop?
You can find out with the command

cat /proc/cpuinfo  | grep 'name'| uniq

Often server CPUs have much lower clock speeds than desktop (and even laptop) CPUs.

Qfl3x · September 27, 2024, 8:18am

Cluster: AMD EPYC 7662 64 cores
Laptop: i7 13700H

Another Cluster also has a Xeon Silver (probably slower single core than my laptop), but it’s always slow whether I’m using the powerful EPYC or the Xeon.

In 1.10 there isn’t much slowdown.

ufechner7 · September 27, 2024, 8:50am

The EPIC might be powerful, but still has a much lower single-core performance than your laptop.

In addition, 1.11 moved a lot of code out of the default system image to packages, which, if not correctly pre-compiled slows down the start-up time.

nsajko · September 27, 2024, 8:51am

Maybe Julia somehow fails to recognize the architecture precisely? If so, it should be possible to regain (most of the) performance using the --cpu-target option to julia, I think:

https://docs.julialang.org/en/v1.11-dev/manual/command-line-interface/

Qfl3x · September 27, 2024, 8:58am

It appears to be faster when I set cpu target to znver2 (Zen 2), will test some more.

ufechner7 · September 27, 2024, 9:06am

With Julia 1.11 you also have znver3 and znver4 available.

Qfl3x · September 27, 2024, 9:11am

I thought that in my case since this is a Zen 2 CPU I should set to znver2, or is it irrelevant?

Qfl3x · September 27, 2024, 9:16am

Actually, speaking of target, I often get the message “cache misses: target mismatch” while precompiling packages, even now with -C znver2.

ufechner7 · September 27, 2024, 9:22am

It is relevant to choose the heighest cpu version that matches your target.

ufechner7 · September 27, 2024, 9:23am

Did you clear your cache after adding the parameter znver2?

Qfl3x · September 27, 2024, 9:23am

No, probably why I get a cache miss.

ufechner7 · September 27, 2024, 9:34am

You could delete the .julia folder if you are brave.

As long as you are using juliaup everything gets re-installed automatically.

jishnub · September 27, 2024, 9:51am

To clear out the compiled cache, only deleting the .julia/compiled/v1.11 directory should be enough. No need to delete the entire .julia directory.

Palli · September 27, 2024, 10:13am

It’s probably better in the latest RC4. I got some fix in they might apply.

Qfl3x · September 30, 2024, 8:30am

So it seems that the performance problem disappeared after I deleted the cache. Even without specifying the target cpu it’s still good. No idea why that might be. Maybe I was using very packages compiled from rc2/rc1.

Is there a way to know which target cpu was detected by Julia?

Topic		Replies	Views
How to compile a portable binary (at least across macs) with `juliac.jl` Tooling interoperability , compilation	12	6496	March 23, 2018
Compile time slowdown from 1.1 to 1.2 Internals & Design	12	2164	November 27, 2019
Why Julia is fast in interpreter but slow when dealing with files Performance	11	6041	March 1, 2018
Compiler Performance Internals & Design	20	2121	June 2, 2018
Switching from 1.1 to 1.2, more startup slowness after loading startup file Performance	4	516	November 7, 2019

Latency with v1.11-rc3 REPL over SSH

Related topics