Sub-arrays of static arrays

TimHargreaves · April 25, 2025, 2:20pm

From a previous step in my computations, I have a static array and I would now like to perform an operation, say a QR decomposition, on a sub-array. This is performance critical code so I would like to squeeze as much speed out of it as possible—ideally being roughly the same speed as performing the operation directly on a static array the same size as the sub-array.

I’ve played around with a few ideas:

julia> A = @SMatrix rand(4, 4);

# Baseline comparison
julia> A_sub = SMatrix{3, 3}(A[1:3, 1:3]);

julia> 1e9 * @belapsed qr($A_sub)
20.519558676028083

# Views are not efficient
julia> 1e9 * @belapsed qr(@view ($A)[1:3, 1:3])
880.5961538461538

# Constructing a new SMatrix is better but still a bit slower than the target
julia> 1e9 * @belapsed qr(SMatrix{3, 3}($A[1:3, 1:3]))
50.49797160243408

# We can manually construct a new matrix and we then get the speed we desired. Can we automate this?
julia> 1e9 * @belapsed qr(SMatrix{3, 3}(($A)[1,1], ($A)[2,1], ($A)[3,1], 
                              ($A)[1,2], ($A)[2,2], ($A)[3,2],
                              ($A)[1,3], ($A)[2,3], ($A)[3,3]))
22.024072216649948

The last results seems to suggest what I’m after is possible, but I’m not sure how to generalise this. Any thoughts would be appreciated.

Mason · April 25, 2025, 2:40pm

You could do

julia> A[SOneTo(3), SOneTo(3)]
3×3 SMatrix{3, 3, Float64, 9} with indices SOneTo(3)×SOneTo(3):
 0.679889  0.422904   0.851232
 0.248234  0.805128   0.460355
 0.584804  0.0535163  0.903487

i.e.

qr(A[SOneTo(3), SOneTo(3)])

mbauman · April 25, 2025, 2:45pm

And that’s also exactly what @view A[SOneTo(3), SOneTo(3)] returns, too. Since static arrays are immutable, there’s no meaningful distinction between views and getindex. You just need to give it static indices so it can know exactly how big the sub-array needs to be.

TimHargreaves · April 25, 2025, 2:57pm

Ah fantastic, thank you both! That gives the target speed

1e9 * @belapsed qr(($A)[SOneTo(3), SOneTo(3)])
21.837349397590362

Does this only work for unit ranges starting from one? I can’t seem to find any variants for say i:j where i, j are known at compile time.

Mason · April 25, 2025, 3:00pm

Yeah, there really should be a static range type. For now though you can just write

function srange(i,j;kwargs...) 
    r = range(i,j;kwargs...)
    SVector{length(r)}(r)
end

and then e.g.

qr(A[srange(2, 4), srange(1, 3)])

DNF · April 25, 2025, 5:29pm

This one seems a bit more robust with respect to performance:

 function srange_(start, step, len)
    t = ntuple(i->(i-1)*step + start, len)
    return SVector(t)
end

It seems fast in most cases, but if you really need it you can provide len as a Val. This also works for float ranges.

mikmoore · April 25, 2025, 5:53pm

There is the internal StaticArrays.SUnitRange(start, stop). It may have some pitfalls or missing features and (since it isn’t part of the interface) might someday break. But for now it works. And I would hope that eventually something like this gets made public interface.

Topic		Replies	Views
SMatrix static slice General Usage question , staticarrays	3	316	December 6, 2023
Custom StaticArray matrix type constructors from sub-arrays of a larger matrix type New to Julia question , staticarrays	5	555	July 12, 2022
Static matrix with size 400 New to Julia staticarrays	7	461	May 6, 2024
Recent developments in StaticArrays.jl land Package Announcements performance , array , linearalgebra , arrays	12	3718	March 11, 2021
StaticArrays:construct a SMatrix from three vectors General Usage question	9	1300	October 19, 2017

Sub-arrays of static arrays

Related topics