Possible Performance Regression for Loops on 1.9?

jacobxk · February 3, 2023, 3:17pm

There is a function in Bogumił Kamiński’s book Julia for Data Analysis(page 7) with the purpose to show off the optimizations done by Julia compiler.

function sum_n(n) 
    s = 0
    for i in 1:n
        s += i
    end
    return s 
end

In the book, the code shown superb results in runtime, like 0.000001 seconds, where running the Julia v1.7. However, I tested the same function in both v1.8.3, v1.8.5 and v1.9.0, the results were not the case, which yielded about one second. Thus, I am wondering that what happened from v1.7 to v1.8 and above on the loops?

TheCedarPrince · February 3, 2023, 3:32pm

Could you please re-run your function after using the package BenchmarkTools.jl? Like this:

using BenchmarkTools

@btime sum_n(1_000_000_000)

And post back the results?

P.S. Also, welcome to the Julia community! Thanks for bringing this up!

Oscar_Smith · February 3, 2023, 3:32pm

This is very odd. The @code_native is showing that the loop can be folded, but when run in the global scope, it’s running the loop.

filchristou · February 3, 2023, 3:33pm

A small remark. First of all and as a rule of thumb, you should benchmark with @time on the second call of the function. That is because the first time you call the function it will also compile it and essentially @time will also include compilation time, which is something you typically are not interested in.
Also using BenchmarkTools.jl as mentioned above will help a deal.

Oscar_Smith · February 3, 2023, 3:35pm

This isn’t compilation time. Running @time repeatedly shows it taking 1.7 seconds, but running @btime on it takes nanoseconds (and this is easy to see since @btime runs faster than @time which shouldn’t be possible).

Oscar_Smith · February 3, 2023, 3:40pm

Ok, this appears to be something weird @time is doing. just running sum_n is fast.

jacobxk · February 3, 2023, 3:43pm

Thanks for the info. Here is the results.

While there were almost equivalent performance in terms of running @btime, but still the difference about a nanosecond. In the book, Bogumił explained that the consecutive sum would be optimized as to n(n+1)/2. I am not sure it is still the case in v1.8.0 or above. How can I check it, given that I don’t know assembly language.

jacobxk · February 3, 2023, 3:46pm

I checked with a much larger number, and the result implies that the n(n+1)/2 still the case?

jacobxk · February 3, 2023, 3:48pm

Yes, if I conduct @time sum_n(1_000_000_000_000), there is a demanding CPU loading and no expected results shown after a long time.

TheCedarPrince · February 3, 2023, 3:56pm

Hm. Thanks for posting this back! This is helpful.

I cannot explain why it is apparently slower by a single nanosecond – perhaps a regression somewhere on 1.9 that hasn’t been caught yet? – but I’d say the speeds are still comparable as you mentioned. I am sure a speed hacker like @Oscar_Smith or @Elrod could probably comment more. Just looking at this though from my perspective, this does seem to be a minor regression on 1.9.

P.S. I updated the title of your post to make it a bit more precise for other Julians to see/understand.

jacobxk · February 3, 2023, 3:58pm

Thanks and please go ahead.

nilshg · February 3, 2023, 5:33pm

There was some change to time to avoid constant folding and similar instances of the compiler defeating the benchmark iirc recently

vchuravy · February 3, 2023, 6:23pm

The issue is Spurious performance regression in Julia 1.8 vs 1.7 for `@time` in top-level-scope · Issue #47561 · JuliaLang/julia · GitHub

There is a proposed fix, but it is not yet ready for prime time.

Topic		Replies	Views
Simple code run @time 1.7, 1.8, 1.9 difference New to Julia	7	445	April 28, 2023
Shouldn't 1.8.0 be faster than Julia 1.7? Performance	30	2544	September 16, 2022
Speed up in Julia Performance	11	915	September 1, 2020
Trouble understanding how time() works New to Julia	3	400	April 6, 2020
How much is it normal that @time differs in time? New to Julia	20	2372	June 28, 2017

Possible Performance Regression for Loops on 1.9?

Related topics