Why these simple codes have so much difference in performace?

Brian1 · November 13, 2022, 11:12am

struct TestStruct
    a::Int
    b::Int
end

get_field1(x::Vector{TestStruct}, y::S) = getfield.(x, y)
get_field2(x::Vector{TestStruct}, y::S) = [getfield(x_, y) for x_ in x]
get_field3(x::Vector{TestStruct}, y::Val{T}) where T = getfield.(x, T)
get_field4(x::Vector{TestStruct}, y::Val{T}) where T = [getfield(x_, T) for x_ in x]

test_struct_vec = [TestStruct(1, 1) for i = 1 : 1_000_000];

@time :get_fields1 get_field1(test_struct_vec, :a);
@time :get_fields2 get_field2(test_struct_vec, :a);
@time :get_fields3 get_field3(test_struct_vec, Val(:a));
@time :get_fields4 get_field4(test_struct_vec, Val(:a));

get_fields1: 0.015035 seconds (1.00 M allocations: 38.147 MiB)
get_fields2: 0.017935 seconds (1.00 M allocations: 38.147 MiB)
get_fields3: 0.030592 seconds (1.00 M allocations: 38.147 MiB)
get_fields4: 0.001574 seconds (2 allocations: 7.629 MiB)

As can be seen above, get_fields4 is much faster than the others. What is the course of this difference?

jules · November 13, 2022, 12:57pm

For the first two to be fast, the symbol needs to be constant propagated so the compiler compiles a special variant where sym == :a, just like you force it when you explicitly use Val. But this doesn’t happen if the symbol is not constant when it is used, which is the case because you time a top level expression. If you wrap the call with a normal symbol into an outer function, then inside that function the compiler will know the symbol to be constant, which should make constant propagation happen and match the speed of variant 4.

Why 3 is slower I’m not immediately sure, I’m just used to broadcasting having the occasional problem with type inference or constant propagation.

Topic		Replies	Views
Performance benefits of writing getters for struct fields Performance	2	104	October 31, 2024
Allocation depending on value of typed field in struct New to Julia question , performance , memory-allocation	4	494	March 23, 2022
Passing struct vs struct fields as function arguments Performance memory-allocation , data_structures , struct	5	1073	September 2, 2021
Mutable structs with all constant fields outperform immutable structs for equality comparison Performance struct	1	161	December 4, 2024
Understanding specialized methods for field access General Usage question , type , struct	7	352	February 7, 2023

Why these simple codes have so much difference in performace?

Related topics