Static vectors of vectors slow?

rokke · December 24, 2025, 8:49am

first, I introduce some very similar ways to represent some boring data, with reasonable benchmarks:

julia> a=SVector{3, MVector{3,Int}}([MVector{3,Int}([1,1,1]),MVector{3,Int}([1,1,1]),MVector{3,Int}([1,1,1])])
3-element SVector{3, MVector{3, Int64}} with indices SOneTo(3):
 [1, 1, 1]
 [1, 1, 1]
 [1, 1, 1]

julia> struct mysvec
           one::MVector{3,Int}
           two::MVector{3,Int}
           thr::MVector{3,Int}
       end

julia> Base.:+(a::mysvec, b::mysvec) = a.one+b.one, a.two+b.two, a.thr+b.thr

julia> b=mysvec([1,1,1], [1,1,1], [1,1,1])
mysvec([1, 1, 1], [1, 1, 1], [1, 1, 1])

julia> c=[[1,1,1],[1,1,1],[1,1,1]]
3-element Vector{Vector{Int64}}:
 [1, 1, 1]
 [1, 1, 1]
 [1, 1, 1]

julia> @benchmark $a+$a
BenchmarkTools.Trial: 10000 samples with 998 evaluations per sample.
 Range (min … max):  14.578 ns …  11.243 μs  ┊ GC (min … max):  0.00% … 99.60%
 Time  (median):     29.597 ns               ┊ GC (median):     0.00%
 Time  (mean ± σ):   37.294 ns ± 262.228 ns  ┊ GC (mean ± σ):  18.37% ±  2.63%

     ▁█▇                                                        
  ▁▂▄███▇▄▃▃▃▂▂▃▃▂▁▁▂▃▃▄▅▆▅▅▃▃▂▂▂▁▂▁▂▂▃▄▆▆▆▆▅▄▄▃▃▃▃▃▃▃▂▂▂▂▁▂▁▁ ▃
  14.6 ns         Histogram: frequency by time         53.1 ns <

 Memory estimate: 96 bytes, allocs estimate: 3.

julia> @benchmark $b+$b
BenchmarkTools.Trial: 10000 samples with 998 evaluations per sample.
 Range (min … max):  18.183 ns …  11.171 μs  ┊ GC (min … max):  0.00% … 99.42%
 Time  (median):     32.191 ns               ┊ GC (median):     0.00%
 Time  (mean ± σ):   40.879 ns ± 265.808 ns  ┊ GC (mean ± σ):  18.01% ±  2.81%

     ▄█▅                                                        
  ▁▅█████▆▄▅▆▅▄▂▂▂▃▄▅███▆▄▃▃▂▂▂▂▂▂▃▅▆▇▆▆▆▆▅▅▅▅▄▃▃▂▂▂▂▂▂▂▁▁▁▁▁▁ ▃
  18.2 ns         Histogram: frequency by time         58.7 ns <

 Memory estimate: 96 bytes, allocs estimate: 3.

julia> @benchmark $c+$c
BenchmarkTools.Trial: 10000 samples with 984 evaluations per sample.
 Range (min … max):   50.980 ns …  13.368 μs  ┊ GC (min … max):  0.00% … 98.93%
 Time  (median):     113.817 ns               ┊ GC (median):     0.00%
 Time  (mean ± σ):   147.831 ns ± 539.619 ns  ┊ GC (mean ± σ):  22.46% ±  6.22%

                         ▁▄▆█▆▅▄▄▃▁                              
  ▂▁▂▂▂▃▄▇▇▇██▆▅▄▄▄▅▅▅▅▆▆███████████▇▆▅▅▅▅▅▆▅▆▆▄▄▃▃▃▃▃▄▃▃▂▂▂▃▃▂ ▄
  51 ns            Histogram: frequency by time          190 ns <

 Memory estimate: 320 bytes, allocs estimate: 8.

however, if I release one condition, the static vector becomes slower than a normal vector:

julia> a=SVector{3, MVector}([MVector{3,Int}([1,1,1]),MVector{4,Int}([1,1,1,1]),MVector{3,Int}([1,1,1])])
3-element SVector{3, MVector} with indices SOneTo(3):
 [1, 1, 1]
 [1, 1, 1, 1]
 [1, 1, 1]

julia> struct mysvec2
           one::MVector{3,Int}
           two::MVector{4,Int}
           thr::MVector{3,Int}
       end

julia> Base.:+(a::mysvec2, b::mysvec2) = a.one+b.one, a.two+b.two, a.thr+b.thr

julia> b=mysvec2([1,1,1], [1,1,1,1], [1,1,1])
mysvec([1, 1, 1], [1, 1, 1, 1], [1, 1, 1])

julia> c=[[1,1,1],[1,1,1,1],[1,1,1]]
3-element Vector{Vector{Int64}}:
 [1, 1, 1]
 [1, 1, 1, 1]
 [1, 1, 1]

julia> @benchmark $a+$a
BenchmarkTools.Trial: 10000 samples with 200 evaluations per sample.
 Range (min … max):  362.120 ns …  64.566 μs  ┊ GC (min … max): 0.00% … 98.97%
 Time  (median):     450.567 ns               ┊ GC (median):    0.00%
 Time  (mean ± σ):   477.896 ns ± 882.369 ns  ┊ GC (mean ± σ):  2.60% ±  1.40%

                 ▁██▇▆▅▂▂                                        
  ▂▂▁▁▁▂▂▂▁▂▂▂▂▃▅█████████▇▇▇▇▇▇▆▆▅▄▄▄▄▃▃▃▄▃▃▃▃▃▂▂▂▃▂▂▂▃▃▂▂▂▂▂▂ ▄
  362 ns           Histogram: frequency by time          609 ns <

 Memory estimate: 192 bytes, allocs estimate: 6.

julia> @benchmark $b+$b
BenchmarkTools.Trial: 10000 samples with 998 evaluations per sample.
 Range (min … max):  16.423 ns …  12.970 μs  ┊ GC (min … max):  0.00% … 99.50%
 Time  (median):     28.776 ns               ┊ GC (median):     0.00%
 Time  (mean ± σ):   41.415 ns ± 315.749 ns  ┊ GC (mean ± σ):  21.19% ±  2.81%

      ▆█▅     ▁▄▅▄▂                                             
  ▁▁▂▅███▅▄▆▆▆█████▆▅▃▃▂▂▂▂▂▂▂▁▂▂▂▂▃▄▆▆▆▄▄▄▃▃▂▂▂▂▂▂▁▁▁▁▁▁▁▁▁▁▁ ▃
  16.4 ns         Histogram: frequency by time           68 ns <

 Memory estimate: 112 bytes, allocs estimate: 3.

julia> @benchmark $c+$c
BenchmarkTools.Trial: 10000 samples with 983 evaluations per sample.
 Range (min … max):   57.537 ns …  15.545 μs  ┊ GC (min … max):  0.00% … 98.95%
 Time  (median):     109.860 ns               ┊ GC (median):     0.00%
 Time  (mean ± σ):   147.736 ns ± 629.855 ns  ┊ GC (mean ± σ):  24.53% ±  5.75%

                      ▁▄▆██▇▅▂                                   
  ▁▁▁▁▂▂▃▅▅▅▅▄▅▄▄▅▅▅▆▇████████▇▅▃▃▃▂▂▂▂▂▂▂▂▂▂▂▂▂▂▃▂▂▂▂▂▁▂▁▂▂▁▁▁ ▃
  57.5 ns          Histogram: frequency by time          193 ns <

 Memory estimate: 336 bytes, allocs estimate: 8.

I included the struct version to show that there is a theoretical gain to be made, but I would like to just treat it like a normal vector without having to define a bunch of struct operations which would almost certainly be less efficient than normal vector indexing/iterating methods. what is the correct way to make a SVector like this?

GunnarFarneback · December 24, 2025, 9:40am

As far as I know you can’t. Since the elements are of different type, the SVector needs to package them as an abstract type (in this case MVector without parameters), and when indexing the code has to dynamically see what type was retrieved.

You could consider using an SVector{3, Vector{Int}} or a tuple of your MVectors instead, but it depends on how you’re going to use them whether it makes sense.

rokke · December 24, 2025, 10:01am

nice, a SVector{3, Vector{Int}} is a bit faster than a default vector; but it still loses to the struct. my use case is mostly adding them all together like in the benchmark I’m testing, or sometimes just a specific column like a[2].+=b[2].

julia> d=SVector{3, Vector{Int}}([[1,1,1],[1,1,1,1],[1,1,1]])                  
3-element SVector{3, Vector{Int64}} with indices SOneTo(3):                    
 [1, 1, 1]                                                                     
 [1, 1, 1, 1]                                                                  
 [1, 1, 1]                                                                     

julia> @benchmark $d+$d
BenchmarkTools.Trial: 10000 samples with 990 evaluations per sample.
 Range (min … max):   44.966 ns …  18.776 μs  ┊ GC (min … max):  0.00% … 99.46%
 Time  (median):      74.091 ns               ┊ GC (median):     0.00%
 Time  (mean ± σ):   105.874 ns ± 557.036 ns  ┊ GC (mean ± σ):  25.05% ±  4.85%

          ▁▃▃▂▂▂▂▂▆▇█▆▃                                           
  ▁▁▁▂▂▃▆███████████████▆▄▃▂▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▂▂▃▃▃▃▂▂▂▂▂▂▂▂▂▂▂▁▁ ▃
  45 ns            Histogram: frequency by time          150 ns <

 Memory estimate: 256 bytes, allocs estimate: 6.

matthias314 · December 24, 2025, 12:11pm

You could use SmallVector or MutableSmallVector from SmallCollections.jl. They are like SVector (= FixedVector in SmallCollections.jl), but with a length up to some predefined capacity.

julia> using SmallCollections, Chairmarks

julia> b = mysvec2([1,1,1], [1,1,1,1], [1,1,1]);
julia> c = FixedVector{3,SmallVector{4,Int}}([[1,1,1], [1,1,1,1], [1,1,1]]);
julia> d = FixedVector{3,MutableSmallVector{4,Int}}([[1,1,1], [1,1,1,1], [1,1,1]]);

julia> @b $b+$b, $c+$c, $d+$d
(29.692 ns (3 allocs: 112 bytes), 5.175 ns, 5.991 ns)

By default, the sum of two MutableSmallVectors is a SmallVector (immutable). That’s why there are no allocations. You could change this manually:

julia> @b map(MutableSmallVector, $d+$d)
24.289 ns (3 allocs: 144 bytes)

EDIT: If you modify whole vectors, but not individual entries, then something like MutableFixedVector{3,SmallVector{4,Int}} might be the better choice. Also, if you can get by with Int8 or Int16 instead of Int, then this would be faster if your vectors get longer.

Topic		Replies	Views
Usage of arrays of static arrays New to Julia performance , staticarrays	16	1241	February 22, 2023
SVector vs Vec usage: Why do I have an 8x speedup in a simple example? Performance	7	1085	August 17, 2019
Performance regression with StaticArrays? Performance question , staticarrays	5	500	January 27, 2023
Fastest way to convert an MVector/NTuple to a different sized SVector/NTuple? Performance staticarrays	8	492	February 27, 2023
Constructing SVector with a loop General Usage question	17	1107	May 20, 2025

Static vectors of vectors slow?

Related topics