Batched LU solves (or Factorizations) with Sparse Matrices

schev · November 9, 2023, 10:02pm

Let’s say I have a sparse set of matrices A1, A2, ... An, each of type CuSparseMatrixCSC{Float32, Int32} along with associated RHS vectors b1, b2, ... bn. Is there some watch to do a batch LU solve on the GPU with CUDA.jl?

If the matrices are dense, getrf_batched! works fine to get the LU factors. Is there some way to do batch solves/factorizations with sparse matrices @maleadt?

Oscar_Smith · November 9, 2023, 10:47pm

how big are the a matrices? you might just want to treat them as dense

schev · November 10, 2023, 12:02am

The matrices are large network matrices (e.g., 10k x 10k), and they are extremely sparse (e.g., 99.99%). So leaving them dense is not an option, sadly!

ChrisRackauckas · November 10, 2023, 5:15am

LinearSolve.jl’s SimpleGMRES has optimizations for batched solving and is compatible with GPUs. That’s likely to be the best option here, though if you can supply a good preconditioner that would be helpful.

schev · November 10, 2023, 4:57pm

Thanks @ChrisRackauckas! Big fan of LinearSolve.jl. Unfortunately, I am solving linear systems within the context of a nonlinear optimization problem (interior point method), where the associated matrices sequentially have worse and worse condition numbers as convergence is approached (10^20 even). Thus, iterative methods (GMRES et al.) are well-known to cause slow convergence or divergence, so they are not a great choice here. If there is a GPU-batched LU/QR solver in LinearSolve.jl, I would love to hear about it. If not, maybe I can help develop it in the future.

ChrisRackauckas · November 10, 2023, 9:02pm

I see. MKLPardisoFactorize should do batching I think?

amontoison · April 15, 2024, 5:37am

@schev I suggest to try CUDSS.jl (GitHub - exanauts/CUDSS.jl).

It provides an interface to the new sparse linear solvers of NVIDIA.

They don’t provide a routine for solving batched sparse linear systems but you can create one big sparse block diagonal matrix diag(A1, A2, …, An) with the right-hand side [b1; b2; …; bn].

Topic		Replies	Views
Solving Sparse Linear Systems fast Performance sparse , linearsolve	11	5050	June 23, 2022
Increasing the solution speed of sparse linear system General Usage	14	507	May 19, 2024
Sparse GPU linear solve from documentation fails Numerics	2	136	June 17, 2025
Sparse LU factorization on GPU GPU linearalgebra , factorization	12	545	November 2, 2024
Cannot solve Ax=B on GPU with A CuSparseMatrixCSC New to Julia cuda , sparse	9	1101	May 20, 2021

Batched LU solves (or Factorizations) with Sparse Matrices

Related topics