Passing struct vs struct fields as function arguments

fipelle · September 2, 2021, 2:59pm

Hi,

I have noticed that passing struct fields as function arguments generally results in better performance compared to passing the struct directly. For instance,

using BenchmarkTools;
using Random;

struct MyStruct
    A::Vector{Float64}
    B::Matrix{Float64}
end

Random.seed!(1);
something = MyStruct(randn(100), randn(100,100));

function test1(A::Vector{Float64}, B::Matrix{Float64})
    return B*A;
end

call_test1(something::MyStruct) = test1(something.A, something.B);

function test2(something::MyStruct)
    return something.B*something.A;
end

@inline function test3(something::MyStruct)
    return something.B*something.A;
end

julia> @btime call_test1($something);
  2.331 μs (1 allocation: 896 bytes)

julia> @btime test2($something);
  2.517 μs (1 allocation: 896 bytes)

julia> @btime test3($something);
  2.598 μs (1 allocation: 896 bytes)

julia> @benchmark call_test1($something)
BenchmarkTools.Trial: 10000 samples with 9 evaluations.
 Range (min … max):  2.428 μs … 94.531 μs  ┊ GC (min … max): 0.00% … 95.36%
 Time  (median):     2.899 μs              ┊ GC (median):    0.00%
 Time  (mean ± σ):   2.978 μs ±  1.030 μs  ┊ GC (mean ± σ):  0.30% ±  0.95%

          ▁▅▇█▃                                               
  ▁▂▄▅▆▆▅▆██████▆▆▆▆▅▃▂▂▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁ ▂
  2.43 μs        Histogram: frequency by time        4.86 μs <

 Memory estimate: 896 bytes, allocs estimate: 1.

julia> @benchmark test2($something)
BenchmarkTools.Trial: 10000 samples with 9 evaluations.
 Range (min … max):  2.664 μs … 95.310 μs  ┊ GC (min … max): 0.00% … 95.29%
 Time  (median):     3.091 μs              ┊ GC (median):    0.00%
 Time  (mean ± σ):   3.363 μs ±  1.503 μs  ┊ GC (mean ± σ):  0.27% ±  0.95%

    ▂▇█▆▁                                                     
  ▁▃█████▅▃▂▂▂▂▂▃▃▂▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁ ▂
  2.66 μs        Histogram: frequency by time        7.51 μs <

 Memory estimate: 896 bytes, allocs estimate: 1.

julia> @benchmark test3($something)
BenchmarkTools.Trial: 10000 samples with 9 evaluations.
 Range (min … max):  2.379 μs … 97.867 μs  ┊ GC (min … max): 0.00% … 96.19%
 Time  (median):     3.214 μs              ┊ GC (median):    0.00%
 Time  (mean ± σ):   3.586 μs ±  1.513 μs  ┊ GC (mean ± σ):  0.26% ±  0.96%

  ▁▃ ▃▆██▇▆▆▅▆▆▅▃▁  ▁▁▁▁                                     ▂
  █████████████████████████▇██▆▇██▇▆▇▇▇▆▇▆▇▆▇▇▆▇▆▆▆▆▆▆▆▆▆▆▅▅ █
  2.38 μs      Histogram: log(frequency) by time     8.93 μs <

 Memory estimate: 896 bytes, allocs estimate: 1.

Can someone explain why? Any link to the appropriate documentation (if available) would be appreciated.

lmiq · September 2, 2021, 3:13pm

my two cents is that that is only benchmarking noise:

julia> @btime call_test1($something);
  2.614 μs (1 allocation: 896 bytes)

julia> @btime test2($something);
  2.808 μs (1 allocation: 896 bytes)

julia> @btime test3($something);
  2.821 μs (1 allocation: 896 bytes)

julia> @btime call_test1($something);
  2.805 μs (1 allocation: 896 bytes)

julia> @btime test2($something);
  2.879 μs (1 allocation: 896 bytes)

julia> @btime test3($something);
  2.765 μs (1 allocation: 896 bytes)

julia> @btime call_test1($something);
  2.735 μs (1 allocation: 896 bytes)

julia> @btime test2($something);
  3.511 μs (1 allocation: 896 bytes)

julia> @btime test3($something);
  3.441 μs (1 allocation: 896 bytes)

julia> @btime call_test1($something);
  3.692 μs (1 allocation: 896 bytes)

julia> @btime test2($something);
  2.986 μs (1 allocation: 896 bytes)

julia> @btime test3($something);
  3.778 μs (1 allocation: 896 bytes)

rdeits · September 2, 2021, 3:15pm

Yeah, I get exactly the same performance for all three:

julia> @btime call_test1($something);
  2.579 μs (1 allocation: 896 bytes)

julia> @btime test2($something);
  2.525 μs (1 allocation: 896 bytes)

julia> @btime test3($something);
  2.552 μs (1 allocation: 896 bytes)

which makes sense–I wouldn’t expect there to be any difference.

fipelle · September 2, 2021, 3:21pm

Thank you both. I agree with you, I was not expecting to find any differences. However, running many times @btime I keep getting the output I posted above (in relative terms). Can it be something specific to my laptop?

Sukera · September 2, 2021, 3:43pm

@btime reports the minimal time - under load, your laptop may throttle its speed, making your benchmark appear slower. Since your timings seem to get slower and slower, I’d guess during the first benchmark the CPU is running with a higher clockspeed than with subsequent benchmarks.

What CPU do you have?

fipelle · September 2, 2021, 4:01pm

Thanks. I have an Intel(R) Core™ i7-1060NG7 CPU @ 1.20GHz.

Topic		Replies	Views
Time cost for using structs Performance benchmarktools	2	946	February 27, 2022
Unexpected performance outcome. Does accessing struct members cause allocation? General Usage performance	18	284	October 16, 2024
Function performance when passed as an argument vs. passed in a struct Performance question , performance	8	3796	November 4, 2017
Changing array inside mutable struct passed to function, how to improve performance? Performance question	15	408	June 27, 2023
On the performance of function calls that depends on a variable Performance metaprogramming	12	953	February 24, 2021

Passing struct vs struct fields as function arguments

Related topics