Fastest way to calculate eigenvectors of 4x4 matrix

Xu_Shan · January 24, 2025, 4:43pm

Hi guys,

I am trying to run a model which simulates a long time series. For each time step, I need to solve the Eigen values and Eigen vectors for one matrix. This matrix is only 4*4, but the computation time is important because the huge number of for-loops of the time steps…

The matrix is sparse, it only has values in diagnal and sub-diagonal elements, like below:

for now I am using

e_val = eigvals(A);
e_vec = eigvecs(A);

from the LinearAlgebra. However, profiling the whole model codes showing that these two lines:

creat a lot of allocations…because each time the eigvals and eigvecs create new arrays;
takes a lot of time…

So, is there any way to make it a bit faster, and ZERO allocations? I don’t want to use eigvals! to modify it at the original matrix…I could use a preallocated array prealloc_e_val for eigvals. Typically, what I want is:
e_val = eigvals(A, prealloc_e_val)
with fastest speed…
Could you please help me with this? Thanks!

Oscar_Smith · January 24, 2025, 4:55pm

Have you tried using StaticArrays?

Xu_Shan · January 24, 2025, 5:15pm

not yet…but could you please be more specific? I am quite new to Julia…

stevengj · January 24, 2025, 7:27pm

Why? What do you do with these at each timestep? (Maybe there is a way to avoid computing them entirely. And since your matrix is non-symmetric, the whole idea of diagonalizing it is often suspect, e.g. what would you do if the matrix were defective?)

The StaticArrays.jl package has optimized methods for working with small fixed-size vectors and matrices. If you have zillions of 4x4 matrices, they will be much more efficient if represented by the SMatrix type from that paper.

However, eigenvectors/values of non-symmetric matrices are not currently supported by StaticArrays.jl.

Oscar_Smith · January 24, 2025, 7:42pm

Given that this is a Tri-diagonal 4x4 it feels like there should be an analytical solution, is that being too optimistic?

Mason · January 24, 2025, 8:13pm

Note that StaticArrays.jl does not have any methods for eigenvalues of 4x4 matrices and falls back on allocating and re-wrapping, so it’s actually slightly slower than working with a regular array.

julia> using StaticArrays

julia> let A = rand(4, 4)
           As = SMatrix{4, 4}(A)
           @btime eigvecs($A)
           @btime eigvecs($As)
       end;
  4.100 μs (30 allocations: 5.81 KiB)
  4.129 μs (32 allocations: 6.02 KiB)

sgaure · January 24, 2025, 8:40pm

I don’t know about zero allocations, but it’s possible to use LAPACK in julia. E.g. geev! to find the eigenvalues and the left and right eigenvectors. This might be somewhat faster than calling eigvals/eigvecs. For symmetric matrices, there’s a syev!. There are still some allocations, though.

prealloc .= A  # make a copy of A, so it's not destroyed
W, VL, VR = LAPACK.geev!('N', 'V', A)   # see ?LAPACK.geev!

There is also a possibility to call the actual unwrapped LAPACK routines via @ccall. Since they don’t allocate, all the workspace must be preallocated. It’s a bit tricky, but absolutely possible. There are quite a few arguments: LAPACK: dgeev

sgaure · January 24, 2025, 9:25pm

Here’s how to get zero allocations with a direct LAPACK call:

using LinearAlgebra

const dgeev = let
    preallocA = fill(0.0, 4, 4)
    WR = fill(0.0, 4)  # real part of eigval
    WI = fill(0.0, 4)  # imag part
    VL = fill(0.0, 4, 4)  # left eigvecs
    VR = fill(0.0, 4, 4)  # right eigvecs
    LWORK = Int32(64)
    WORK = Vector{Float64}(undef, LWORK)
    INFO = Ref(Int32(0))
    
    function mydgeev(A)
        preallocA .= A
        @ccall "liblapack".dgeev_(
            Ref(Int8('N'))::Ptr{Cchar},   # no left eigvecs
            Ref(Int8('V'))::Ptr{Cchar},
            Ref{Int32}(size(A, 2))::Ptr{Cint},
            preallocA::Ptr{Float64},
            Ref{Int32}(size(A, 1))::Ptr{Cint},
            WR::Ptr{Float64},
            WI::Ptr{Float64},
            VL::Ptr{Float64},
            Ref{Int32}(size(VL, 1))::Ptr{Cint},
            VR::Ptr{Float64},
            Ref{Int32}(size(VR, 1))::Ptr{Cint},
            WORK::Ptr{Float64},
            Ref(LWORK)::Ptr{Cint},
            INFO::Ptr{Cint}
        )::Cvoid
        return WR, WI, VR
    end
end

A = rand(4,4)
dgeev(A)

The @ccall to "liblapack".dgeev_ might use a different name on your machine. “liblapack” is the name of the lapack library which is linked into julia, “dgeev_” with the underscore at the end is a convention for some (most?) fortran compilers. The call arguments are standardized. It’s a bit tricky because lapack is written in/for fortran, where all arguments are passed by reference.

The call to dgeev(A) does not allocate. The outputs (WR, WI, VR) are overwritten by the next call to dgeev. So if you store them in some structure, make sure to copy them.

Complex eigenvectors may come in conjugate pairs. See the docs (linked above) for how they are coded in VR.

Note that the dgeev above is not thread safe. If this is a parallel loop, the preallocated buffers (between let and function mydgeev) must be task-local, e.g. stored in Base.task_local_storage() with a get!.

If your matrix is complex, you must call zgeev_ instead, with different arguments.

mstewart · January 24, 2025, 10:52pm

I have been hesitating to point it out, because I think the bars over your variables mean your matrix is probably complex and that negative sign in the (4,3) position looks like it could be a deal breaker even if a lot of other things are real and positive, but a real tridiagonal T is diagonally similar to a symmetric tridiagonal if t_{j, j+1} t_{j+1, j} > 0 for each sub/super diagonal pair. Something similar holds for a complex matrix if the diagonal is real and the products of the sub/super diagonals are real and positive. On the off chance that this is the case for your matrix, it does open the possibility of using some simpler algorithms that might be faster.

RoyiAvital · January 25, 2025, 10:11am

Sounds like a feature which should be in FastLapackInterface.jl which is a great tool to such cases.

I can see geevx!() and syevr()!. So maybe all is already there?

sgaure · January 25, 2025, 12:49pm

You’re right. I wasn’t aware of that package LAPACK · FastLapackInterface.jl
It seems to do the job with a preallocated workspace.

Xu_Shan · January 26, 2025, 12:59pm

Hi @stevengj, thanks for your reply! For each time step, I need to solve a ODE which is:

I derived the analytical solutions which needs the Eigen values and vectors…(mostly the exp operation of matrix…)

Xu_Shan · January 26, 2025, 1:00pm

Hi @Oscar_Smith, thanks for your idea! Yes I am also thinking about this, but I didn’t find a solution after I googled on the internet…mostly people are talking about symmetric matrix…

Xu_Shan · January 26, 2025, 1:03pm

Hi @mstewart thanks for your points! It does make sense, however, the bar means the average between one item and another…nvd, but the final solution does include complex numbers in Eigen values and vectors, so that the final solution of the ODE always takes the real part…

stevengj · January 26, 2025, 1:04pm

You should just use exp(A * Δt) for matrix exponentials, not eigenvectors, despite what you may have learned in school.

As explained in the famous paper “19 Dubious Ways to Compute a the Exponential of a Matrix”, using eigenvectors is a bad idea to exponentiate non-Hermitian/non-normal matrices, because it might be very inaccurate if the matrix happens to be close to defective.

That being said, if you are solving a system of ODEs, then there are probably much better methods available. See DifferentialEquations.jl.

Xu_Shan · January 26, 2025, 1:06pm

Meanwhile, I am also curious to see better solution to solve the ODE in above post…I wrote the analytical solution and hard-coded it. However, the most time consuming comes from the Eigen values and vectors calculation (a simple @profileview shows this…)…so I am wondering is there any other way to avoid the eigen vectors calculation? I tried with numerical package suggested by ChatGPT, however, they are slower than the analytical one…

stevengj · January 26, 2025, 1:08pm

It looks like you are trying to write down the solution for an ODE for just one “timestep”, i.e. for just \Delta t. You should take a step backwards and write down the equations you are actually solving for all t and send those equations to an ODE solver. (A good algorithm will probably not take fixed timestep sizes.)

But it’s hard to give you specific advice without more context. (This is a classic XY problem.)

Xu_Shan · January 26, 2025, 1:12pm

yes, it’s true. However, this ODE is interacted in a complex model system which consists of empirical/other analytical equations…it’s a huge model, I don’t think it’s a chance to put the equations for the whole time to an ODE solver…because:

the states simulated by this ODE is affected and affect other states from other sub-models…(so-called “interacted”);
currently the big model is write for this only one time step…so that there is a for-loop wrap-up thing out side, and people could run it by specifying the range of the time step…

Mason · January 26, 2025, 1:26pm

So you just need to exponentiate the matrix? In that case, StaticArrays.jl is definitely your friend.

julia> let A = rand(4, 4)
           As = SMatrix{4, 4}(A)
           @btime exp($A)
           @btime exp($As)
       end;
  1.499 μs (18 allocations: 1.58 KiB)
  183.229 ns (0 allocations: 0 bytes)

This is way faster than eigensolving.

Also, if what you really want is to calculate exp(t * A) * v, then ExponentialUtilities.jl has tools for doing that even more efficiently than calculating the full matrix exponential:

julia> let A = rand(4, 4)
           As = SMatrix{4, 4}(A)
           
           v = rand(4)
           vs = SVector{4}(v)
           
           t = 1e-3

           # calculate exp(t * A) * v
           @btime expv($t, $A, $v)     
           @btime expv($t, $As, $vs)
       end;
  1.904 μs (27 allocations: 2.22 KiB)
  146.822 ns (0 allocations: 0 bytes)

Mason · January 26, 2025, 4:42pm

It’s not clear to me what a lot of this means here, I think the actual problem being solved is still pretty underspecified, but I’ll just mention that DifferentialEquations.jl does support building a differential equation and then remake-ing it with different time-spans.

E.g. you could do something like

using OrdinaryDiffEq, StaticArrays

function f(u, p, t)
    (; a, b, c) = p # p stores parameters a, b, and c
    # build up some potentially time-dependant matrix
    A = SA[a*sin(b*t)   b+1
           b-1          c*cos(b*t)]
    A * u
end

# initial condition
u0 = SA[1.0
        0.0]

p = (;a=1.0, b=2.0, c=3.0)

# the timespan we solve over
tspan = (0.0, 1.0)

# construct an ODE problem
prob = ODEProblem(f, u0, tspan, p)

and then solve it:

julia> sol = solve(prob, Tsit5())
retcode: Success
Interpolation: specialized 4th order "free" interpolation
t: 12-element Vector{Float64}:
 0.0
 0.0009990009990009992
 0.010989010989010992
 0.04404476553655739
 0.10104982246948335
 0.17604841169104612
 0.27278951309861316
 0.3924925450833525
 0.5393213697913134
 0.7202827501778557
 0.9573407118994137
 1.0
u: 12-element Vector{SVector{2, Float64}}:
 [1.0, 0.0]
 [1.000002496505656, 0.0010005003304639334]
 [1.0003039254717578, 0.011173253868269296]
 [1.0049882162890587, 0.04715623487012128]
 [1.0273635343800853, 0.11882725033265978]
 [1.088385494639565, 0.23564608045793892]
 [1.2325610793753434, 0.4328632171924387]
 [1.546107921614314, 0.7620787066366294]
 [2.2130677873858833, 1.2954463943225067]
 [3.6169890726518927, 2.0777267620889894]
 [6.617024114295522, 3.043862542508724]
 [7.290497064992269, 3.189883166408041]

You can then remake this problem to create a new problem that starts where that previous one left off and solve it:

julia> prob2 = remake(prob; 
                       tspan=(1.0, 1.5),
                       u0 = sol[end])
ODEProblem with uType SVector{2, Float64} and tType Float64. In-place: false
timespan: (1.0, 1.5)
u0: 2-element SVector{2, Float64} with indices SOneTo(2):
 7.290497064992269
 3.189883166408041

julia> sol2 = solve(prob2, Tsit5())
retcode: Success
Interpolation: specialized 4th order "free" interpolation
t: 5-element Vector{Float64}:
 1.0
 1.0858050494318707
 1.2208993653350446
 1.3703727135505457
 1.5
u: 5-element Vector{SVector{2, Float64}}:
 [7.290497064992269, 3.189883166408041]
 [8.742818021206942, 3.454946787443764]
 [11.206831027521389, 3.811693893096692]
 [13.96922356812029, 4.1722693109505045]
 [16.172373463833523, 4.507967667146217]

Topic		Replies	Views
Custom function for eigenvectors designed for my sparse matrix can't beat eigen Numerics linearalgebra	17	1156	December 27, 2020
Julia is slower than MATLAB at diagonalizing matrices New to Julia question , matlab , linearalgebra , matrices , eigenvalues	41	4403	March 30, 2022
Julia vs NumPy broadcasting New to Julia	23	776	July 22, 2025
Does eigen( ) get stuck in processing of the results? Performance linearalgebra	10	573	January 27, 2021
Matrix exponential * vector for various scales Numerics question	38	1421	April 9, 2024

Fastest way to calculate eigenvectors of 4x4 matrix

Related topics