Most efficient way of computing B'* inv(A)* B for A large

PetrKryslUCSD · September 30, 2020, 12:02am

Is this model reduction?

Joris_Pinkse · September 30, 2020, 12:04am

For the sake of completeness, B’*inv(A)*B does better than I’d thought: 0.45 seconds on second run.

PetrKryslUCSD · September 30, 2020, 12:08am

Can tullio incorporate symmetry?

Joris_Pinkse · September 30, 2020, 12:10am

@PetrKryslUCSD I’m using it for some optimization problem in many variables. I can simplify this by writing it as a nested problem with the inner nest a bunch of high-dimensional problems. The Hessian of the outer problem then becomes the sum over terms of the form C-B’A^{-1}B, where the A’s are the Hessians of the inner problem.

Joris_Pinkse · September 30, 2020, 12:11am

Good question; I don’t know. Doesn’t look like it.

dpo · September 30, 2020, 2:14am

It looks like you may be trying to solve a system with

\begin{bmatrix} A & B \\ B^T & -C \end{bmatrix}.

If C is also positive definite, you can factorize the above matrix directly while taking advantage of its structure using https://github.com/JuliaSmoothOptimizers/LDLFactorizations.jl.

Joris_Pinkse · September 30, 2020, 2:14pm

Not quite. But yes, partitioned inverses are related.

baggepinnen · September 30, 2020, 2:48pm

If you’re looking for the inplace version of backsolve it’s ldiv!().
More generally, the syntax to write into a matrix D is D .= ....

Joris_Pinkse · September 30, 2020, 2:49pm

Ha, I knew about ldiv, but thought .= was only for assigning scalars, thanks!

Joris_Pinkse · September 30, 2020, 3:42pm

Tried another run between some major contenders with B 3_000 by 10. The difference between the first four contenders is small. I’m surprised at the gain of using MKL (my computer has an AMD processor).

function one()
D.= cholesky(A).U \ B
@tullio C[i,j] = D[k,i] * D[k,j]
end

function two()
ldiv!(D, cholesky(A).U, B)
@tullio C[i,j] = D[k,i] * D[k,j]
end

function three()
D .= cholesky(A).U \ B
C .= D’ * D
end

function four()
ldiv!(D, cholesky(A).U, B)
C .= D’ * D
end

function five()
C.= B’ * inv(cholesky(A)) * B
end

with openblas:

61.508 ms (40 allocations: 68.89 MiB)
61.744 ms (36 allocations: 68.67 MiB)
61.966 ms (15 allocations: 68.89 MiB)
61.603 ms (11 allocations: 68.67 MiB)
142.128 ms (15 allocations: 137.56 MiB)

with mkl:

34.641 ms (40 allocations: 68.89 MiB)
34.490 ms (36 allocations: 68.67 MiB)
34.084 ms (15 allocations: 68.89 MiB)
33.951 ms (11 allocations: 68.67 MiB)
129.186 ms (15 allocations: 137.56 MiB)

PetrKryslUCSD · September 30, 2020, 4:19pm

What if you do the loops by hand with LoopVectorization, taking into account the symmetry. Can you halve the fastest time?

Topic		Replies	Views
Explicit Inversion of a random matrix performs the same as Symmetric & SPD Performance question , linearalgebra	6	1053	October 25, 2018
Inverting a symmetric matrix is not faster than inverting a random one Performance linearalgebra	16	1171	March 28, 2023
Fastest way to perform A[a,:] * B * A’[:,b] where a and b are vectors of indices General Usage question , blas , mkl , linearalgebra , optimization	5	99	August 8, 2024
Fastest way to perform A * B * A’ General Usage question , blas , mkl , linearalgebra , optimization	9	585	August 13, 2024
Large matrix operations involving inversion Performance memory , matrices , inverse	5	462	May 13, 2022

Most efficient way of computing B'* inv(A)* B for A large

Related topics