Startup time of 1000 packages – 53% slower in Julia 1.12 vs 1.10

Krastanov · October 13, 2025, 12:36pm

jeez, @jules , give me a minute

I modified the script to make sure the csv file is saved as well. Finished doing that 3 minutes ago, came to discourse to post that the aggregate data is now also available on the repo, and see that this was already announced 2 minutes ago by you.

I wish my students were as magically efficient and quick as this community (they are great anyway though)

Krastanov · October 13, 2025, 12:43pm

PRs to the master branch that make ttfx_snippets_vis.jl better (called in make_and_commit_plots.sh) would be appreciated.

jules · October 13, 2025, 12:43pm

Haha I didn’t know you had just set it up a minute ago, thanks for making that data available!

Krastanov · October 13, 2025, 12:51pm

Here is the detailed runner info:

Summary

Ubuntu 2024 LTS

128GB ram

AMD Ryzen 7 5700G

nilshg · October 13, 2025, 1:03pm

I was thinking something like this:

Code

pdd = CSV.read(Downloads.download("https://raw.githubusercontent.com/JuliaEcosystemBenchmarks/julia-ecosystem-benchmarks/refs/heads/jeb_logs/data/Julia-TTFX-Snippets/ttfx_snippets_data.csv"), DataFrame)

pddg = combine(groupby(pdd, [:package_name, :julia_version]), :precompile_time .=> [minimum, mean, median, maximum])
	
pgn = pddg[pddg.package_name .== "BaseDirs", :]
sort!(pgn, :julia_version)
barplot(pgn.precompile_time_mean, label = "Mean",
    axis = (; xticks = (1:nrow(pgn), pgn.julia_version), xticklabelrotation = π/4,
		 xlabel = "Julia version", ylabel = "Precompile time (seconds)")		
	)
rangebars!(1:nrow(pgn), pgn.precompile_time_minimum, pgn.precompile_time_maximum, color = :red, linewidth = 1, whiskerwidth = 10,
			  label = "Min-max range")
axislegend()
current_figure()

Krastanov · October 13, 2025, 1:13pm

That is great, here are a few comments:

The named versions alpha, lts, etc, probably need to be filtered out or changed to include only the last handful of results because lts includes all versions that were historically lts, not just the current lts (some for the other named versions)
Median might be better than mean because there are a ton of extreme outliers
If someone gets around to submitting this as a PR it will be accepted quickly and will be present in the autogenerated plots (see ttfx_snippets_vis.jl)

jules · October 13, 2025, 2:24pm

maybe it would make sense to store commit hashes or something for the changing specifiers? Otherwise it gets a bit complicated to map them back I guess

Krastanov · October 13, 2025, 3:13pm

That is available deep in the logs and can be added to the csv (at some point).

RomeoV · October 13, 2025, 7:25pm

The discussion seems to be burying the lead a bit - from the plot it looks like 1.12 does not have a major regression compared to 1.10, correct?

ufechner7 · October 13, 2025, 9:12pm

If I look at my CI jobs:

They need 26 min with Julia 1.10 and 31.5 min with Julia 1.12. A difference of 20%. But this is, of course, only one measurement, not necessarily representative.

joa-quim · October 13, 2025, 9:29pm

And when I do the same comparison with GMT it takes ~12-13 minutes with 1.10 and 24-25 with 1.11. 1.12 and nightly.

And this despite significant improvements achieved after important reductions in invalidations on 1.12.

ufechner7 · October 14, 2025, 12:46am

Did you create an issue for this regression at GitHub · Where software is built ?

christiangnrd · October 14, 2025, 1:22am

I just switched over our benchmark CI to 1.12. Over 1.12, Import is a 10% improvement, precompile is a slight regression, and TTFX (compiling an empty kernel) takes 55% longer

gdalle · October 14, 2025, 5:34am

For DifferentiationInterface, the regression can be really visible in my current CI runs (chore: switch to Runic formatting from JuliaFormatter (#871) · JuliaDiff/DifferentiationInterface.jl@7a87b5f · GitHub):

SimpleFiniteDiff backend:
- 1.10: 58m
- 1.11: 1h19
- 1.12: 1h27
ForwardDiff backend:
- 1.10: 43m
- 1.11: 59m
- 1.12: 55m

joa-quim · October 14, 2025, 2:24pm

Not particularly this one but for example #59494, where with the very helpful contibutions of Kristoffer and Tim the TTFP of the plot function dropped down to the amazing:

julia> @time using GMT
  0.707362 seconds (1.11 M allocations: 64.533 MiB, 5.06% gc time, 3.01% compilation time)

julia> @time @eval plot(rand(5,2))
  0.002897 seconds (721 allocations: 34.281 KiB)

This is very likely fastest TTFP of a plot() function in Julia. But unfortunately this does not propagate and a call that is almost identical in called functions (it differes to only one line)

julia> @time @eval bar(1:5, (20, 35, 30, 35, 27), width=0.5, color=:lightblue, limits=(0.5,5.5,0,40))
  0.625434 seconds (2.13 M allocations: 108.216 MiB, 6.82% gc time, 98.59% compilation time)

compiles again. Sure, I can add this one too to the pre-compile workload but just that extra line adds 500 kb to the pre compiled cache. The size of the pre-compiled cache is what stops me to add much more to PrecompileTools workload. I have codes with a couple hundreds LOC that add ~10 MB to the pre-compile cache. I really don’t understand what happens in this land, only that they are huge.

ufechner7 · October 14, 2025, 5:13pm

What you describe here is probably not directly related to the compile times of packages. I am still looking for a simple, small, reproducible test case for the compile time that shows an increase of more than 20%. Then we would have the chance to create an issue that might get addressed.

sdanisch · October 14, 2025, 5:25pm

This is likely influenced by how you specialize to kwargs in your code.
In makie we try do convert to untyped dicts for kwargs as early in the call as possible, so that all inner functions don’t specialize to the kwargs.
I’m not sure if that helps with gmt, but without it I’m sure you’ll get larger precompile times with slightly different kwargs!

joa-quim · October 14, 2025, 5:33pm

I think that what you mean is more or less what I do here

github.com/GenericMappingTools/GMT.jl

src/psxy.jl

master


      
          function common_plot_xyz(cmd0::String, arg1, caller::String, first::Bool, is3D::Bool; kwargs...)
          	d, K, O = init_module(first, kwargs...)		# Also checks if the user wants ONLY the HELP mode
          	(cmd0 != "" && arg1 === nothing && is_in_dict(d, [:groupvar :hue]) !== nothing) && (arg1 = gmtread(cmd0); cmd0 = "")
          	invokelatest(_common_plot_xyz, cmd0, arg1, caller, O, K, is3D, d)
          end
          function common_plot_xyz(cmd0::String, arg1, caller::String, first::Bool, is3D::Bool, d::Dict{Symbol, Any})
          	(cmd0 != "" && arg1 === nothing && is_in_dict(d, [:groupvar :hue]) !== nothing) && (arg1 = gmtread(cmd0); cmd0 = "")
          	invokelatest(_common_plot_xyz, cmd0, arg1, caller, !first, true, is3D, d)
          end
          function _common_plot_xyz(cmd0::String, arg1, caller::String, O::Bool, K::Bool, is3D::Bool, d::Dict{Symbol, Any})

The _common_plot_xyz() is the big function that does most of the parsing work. I still have to keep a Dict{Symbol, Any} because input can take different forms. But one thing that puzzles me is that other functions that rely almost ti calls to this main function still compile sub-functions that should have been already precompiled (at least they do not show up as invalidations with SnoopCompile)

juliohm · October 16, 2025, 7:54pm

Not just startup times are increasing. The runtimes are getting worse too. I have a couple of tests in GeoStatsFunctions.jl that are now failing in Julia v1.12:

CompositeVariogram: Test Failed at /home/runner/work/GeoStatsFunctions.jl/GeoStatsFunctions.jl/test/theoretical/composite.jl:102
  Expression: #= /home/runner/work/GeoStatsFunctions.jl/GeoStatsFunctions.jl/test/theoretical/composite.jl:102 =# @elapsed(sill(γ)) < 1.0e-5
   Evaluated: 0.007924943 < 1.0e-5

I didn’t change a single line of code

P.S.: the elapsed test is placed after a warmup call, so it is not measuring compilation.

hz-xiaxz · October 17, 2025, 2:12am

Oh that’s quite servere, some should look into it.

Topic		Replies	Views
Taking TTFX seriously: Can we make common packages faster to load and use Performance ttfp	125	12048	June 20, 2022
Compile time slowdown from 1.1 to 1.2 Internals & Design	12	2165	November 27, 2019
Any way to speed up loading large precompiled packages? General Usage precompilation , ttfp , ttfx	35	3337	May 15, 2023
10-15 minute TTFP with Plots.jl... Please help New to Julia ttfp	55	3042	January 9, 2023
Ways to make slow/sluggish REPL/interactive development experience faster? Performance repl , ttfp	35	5683	July 23, 2019

Startup time of 1000 packages – 53% slower in Julia 1.12 vs 1.10

Related topics