Why is this code run-time dispatch/slow?

JamesK · September 24, 2020, 5:24am

I made a minimum reproducible example for a performance issue I am having with my code:

mutable struct Array_container
    array_field::Array{Float64}
end

function foo(a::Array_container, b::Array_container)
    a.array_field[1] - b.array_field[1]
end

function run_it_n_times(n::Int64)
    a = Array_container([1.1])
    b = Array_container([2.2])
    for i in (1:n)
        foo(a, b)
    end
end

I run

@profview run_it_n_times(10000000)

and observe that the function foo() is “run-time dispatch” since it’s in red (source: GitHub - timholy/ProfileView.jl: Visualization of Julia profiling data). See below for the image.

Specifically, the docs say that it is “dynamic dispatch, run-time method lookup, or a virtual call”. I don’t see any obvious type instability (I am a beginner though), or any other issues addressed in the Performance Tips section of the manual. What is wrong with this code and how do I fix its performance? Thank you!

Oscar_Smith · September 24, 2020, 5:33am

I think what the red in this case is the allocation of an array to store the result of foo, not any runtime dispatch.

JamesK · September 24, 2020, 5:39am

Thank you for the reply! I don’t think that’s the problem in this case. The following code does not show those red bars:

function foobar(a::Array{Float64}, b::Array{Float64})
    a[1] - b[1]
end

function run_foobar_n_times(n::Int64)
    for i in (1:n)
        foobar([1.1], [2.2])
    end
end

remi-garcia · September 24, 2020, 6:01am

@benchmark to illustrate this:

julia> using BenchmarkTools

julia> mutable struct Array_container
           array_field::Array{Float64}
       end

julia> function foo(a::Array_container, b::Array_container)
           a.array_field[1] - b.array_field[1]
       end;

julia> function bar(a::Array{Float64}, b::Array{Float64})
           a[1] - b[1]
       end;

julia> @benchmark foo(a, b) setup=((a,b)=(Array_container([rand()]), Array_container([rand()])))
BenchmarkTools.Trial: 
  memory estimate:  32 bytes
  allocs estimate:  2
  --------------
  minimum time:     50.809 ns (0.00% GC)
  median time:      51.581 ns (0.00% GC)
  mean time:        61.539 ns (11.28% GC)
  maximum time:     47.769 μs (99.86% GC)
  --------------
  samples:          10000
  evals/sample:     987

julia> @benchmark bar(a, b) setup=((a,b)=([rand()], [rand()]))
BenchmarkTools.Trial: 
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     2.211 ns (0.00% GC)
  median time:      2.233 ns (0.00% GC)
  mean time:        2.273 ns (0.00% GC)
  maximum time:     30.894 ns (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     1000

remi-garcia · September 24, 2020, 6:08am

I don’t know if that would be called type instability but:

julia> Array{Float64}
Array{Float64,N} where N

julia> Array{Float64, 1}
Array{Float64,1}

Precising the dim of array_field inside the mutable struct makes a great difference (but I have no explication, sorry):

julia> using BenchmarkTools

julia> mutable struct Array_container
           array_field::Array{Float64, 1}
       end

julia> function foo(a::Array_container, b::Array_container)
           a.array_field[1] - b.array_field[1]
       end;

julia> function bar(a::Array{Float64, 1}, b::Array{Float64, 1})
           a[1] - b[1]
       end;

julia> @benchmark foo(a, b) setup=((a,b)=(Array_container([rand()]), Array_container([rand()])))
BenchmarkTools.Trial: 
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     4.130 ns (0.00% GC)
  median time:      4.181 ns (0.00% GC)
  mean time:        4.286 ns (0.00% GC)
  maximum time:     50.718 ns (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     1000

julia> @benchmark bar(a, b) setup=((a,b)=([rand()], [rand()]))
BenchmarkTools.Trial: 
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     2.641 ns (0.00% GC)
  median time:      2.656 ns (0.00% GC)
  mean time:        2.808 ns (0.00% GC)
  maximum time:     62.966 ns (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     1000

DNF · September 24, 2020, 6:42am

Abstract field types in structs are a performance trap. Use concrete, possibly parameterized, types for the fields.

Also, do you need Array_container to be a mutable struct? That could also affect performance. Since the wrapped array is already mutable, you don’t need the outer type to be mutable also, in most cases.

BTW, the convention for type names is to use CapitalCase not underscores, so ArrayContainer.

tisztamo · September 24, 2020, 6:54am

Without fully parametrizing the Array, it remains a UnionAll type, which is not concrete (although the performance tips in the docs speaks about abstract types, I think non-concrete would be a better term)

The explanation is that the compiler cannot know the memory layout of a UnionAll type, so it has to compile code for checking the runtime type (which is concrete). That check is the dynamic dispatch in the OP.

You can use Vector{Float64} for one-dimensional arrays, it looks nicer.

JamesK · September 25, 2020, 4:55am

Thank you very much for the answers and explanations, my Julia codebase performance finally matches that of C++!

Should something along these lines be put in the Avoid fields with abstract containers section of the docs? The docs skirt this issue, but seem to never directly address it (unless if I’m missing it), and other beginners may have similar points of confusion.

Perhaps the simple addition of the following to that section of the docs:
=============================

struct MyContainer
    a::Array{Float64}
end

is bad: it’s still an abstract type since the dimensionality is not specified. Instead, consider using something like:

struct MyContainer
    a::Array{Float64, 1}
end

(replace 1 with n for n-dimensions) or

struct MySimpleContainer{A<:AbstractVector}
    a::A
end

=============================
This addition to that section may help a lot of people. In my case, it would have saved me quite a few hours.

Oscar_Smith · September 25, 2020, 5:08am

Our performance tips should probably mention that Array isn’t Vector and that can be a performance issue. I’m not sure why but a ton of newcomers use Array when they really want Vector or Matrix. Maybe it’s from Matlab where those aren’t distinct?

tbeason · September 25, 2020, 5:10am

I think that and also that Array is typically what gets printed at the REPL.

JamesK · September 25, 2020, 5:14am

Huh, I just tried replacing all Array{Float64, 1} with Vector{Float64} and it’s not making any noticeable performance difference in my case. What really helped was specifying the dimension. This could just be my use case though, perhaps in other cases using Vector instead makes a big difference?

Oscar_Smith · September 25, 2020, 5:15am

Vector{T} is an alias for Array{T,1}. They are exactly the same. The problem is only when using Array makes you forget a parameter and therefore have non-static types.

paulmelis · September 25, 2020, 7:05am

Slightly off-topic, but I was digging through the Base sources to figure out how Array is defined exactly to further understand the behaviour above. But base/array.jl does not contain a definition of Array as far as I can tell, nor do any of the other julia source files. I see the Array type being referenced at the top of base/Base.jl, even before base/array.jl is included. Is the Array type defined in C code?

Sukera · September 25, 2020, 7:26am

I think so, yes. It’s one of the few things defined that way, possibly because arrays are needed very early on in the chain, when almost none of the machinery for calling and compiling julia exists.

paulmelis · September 25, 2020, 7:52am

Coming from C++ this is a real mind-bender. The conceptually equivalent construct in C++ would be a template Array<T,N>, but there is no way to reference Array as a type without filling in all of the parameters.

remi-garcia · September 25, 2020, 9:53am

On a project with friends we talk about the style that we will use for the code and most of them wanted Array{Float64, 1} over Vector{Float64}. I think it is because they are use to write Array{Float64, 1} in Julia as they started this way when they did not know the existence of Vector.

Elrod · September 25, 2020, 10:34am

Among the many cool new things in Julia master:

julia> rand(3)
3-element Vector{Float64}:
 0.24215660775997416
 0.5386524854737231
 0.5880707121465651

julia> typeof(ans)
Vector{Float64} (alias for Array{Float64, 1})

Topic		Replies	Views
How to maintain performance despite type instability when you know the type? General Usage question	5	944	August 3, 2017
Avoiding Vectors of Abstract Types Performance question , type-stability	22	4434	February 17, 2022
How slow is runtime dispatch, anyway? Benchmark attempts Performance	8	339	January 15, 2025
Experiments in speeding up runtime dispatch for integer value specialization Performance	28	1231	November 18, 2022
Dramatic performance change by adding additional unused method Performance	33	1939	November 23, 2021

Why is this code run-time dispatch/slow?

Related topics