Solving A\b in parallel

wsshin · March 27, 2018, 5:14pm

I have an access to a large shared-memory machine. It has memory in the order of TB, and has 64 CPU cores.

If I want to perform A\b for a square matrix A and column vector b on this shared-memory machine using multiple CPU cores, what is the workflow? A is stored in Julia’s sparse matrix format.

stabbles · March 27, 2018, 5:30pm

Did A\b not work? What kind of matrix is A? How sparse is it? Is it symmetric positive definite? Non-symmetric?

wsshin · March 27, 2018, 5:37pm

A\b works, but how do we make sure it runs on N CPU cores?

ChrisRackauckas · March 27, 2018, 5:51pm

Check htop. You’ll see that it’s already multithreaded.

wsshin · March 27, 2018, 6:07pm

So you mean I don’t need to do something like julia -np N scriptname.jl to make A\b in scriptname.jl use N CPU cores?

ChrisRackauckas · March 27, 2018, 6:08pm

In fact, don’t do that. Adding processes sets the number of BLAS threads to 1, which is what you don’t want.

kristoffer.carlsson · March 27, 2018, 6:46pm

You can also try out https://github.com/JuliaSparse/Pardiso.jl/. I’ve gotten significantly higher performance with it than with the solvers coming with julia on high core machines.

pasha · March 27, 2018, 8:01pm

To clarify what is only really implied here, and please correct me if I’m wrong: linear algebra ops are automatically parallel if you have BLAS set up right (run versioninfo() and look for libopenblas). BLAS operations use a different kind of parallel computing than Julia native parallel as described in the parallel doc

kristoffer.carlsson · March 27, 2018, 8:17pm

Solving a sparse linear system will not use BLAS directly, although it is likely used by the sparse solver.

wsshin · March 28, 2018, 1:50am

@kristoffer.carlsson, I am testing Pardiso.jl, but the performance enhancement with the number of CPU cores is a bit disappointing. The same problem is solved in about 40 seconds with 2 cores, and about 30 seconds with 16 cores. Is this typical, or you think something is wrong?

ChrisRackauckas · March 28, 2018, 2:19am

How big is the matrix?

wsshin · March 28, 2018, 2:32am

It is million by million.

Also, do you have an example with set_solver!(ps, ITERATIVE_SOLVER)? I am solving a system with slowly evolving matrix, so I would like to perform factorization only once by using ITERATIVE_SOLVER, but not sure where to call this. Don’t see a good example in the PARDISO documentation, either.

RaulDurand · July 22, 2021, 1:01am

So, let’s suppose I am using multiprocessing and solving sparse matrix systems. Do I have to set manually the number of BLAS threads for each process?

RoyiAvital · August 21, 2021, 10:25am

Did you find a way?
I also want to use Pardiso.jl in the same manner. Factorize once, use many times (For different RHS).

I thought iterative mode means the way the system is solved not use it many times.

Topic		Replies	Views
High-dimensional A\b when running julia -p N Performance	7	767	October 21, 2020
Solving Sparse Linear Systems fast Performance sparse , linearsolve	11	5078	June 23, 2022
Is it possible to parallelize matrix division Performance	10	672	October 1, 2023
Is the linear system solver \ also multi threaded in Julia as in Matlab? And how to “multithread” it in Julia? Performance linearalgebra , sparse	33	2453	August 29, 2024
Is there an easy way to parallelise matrix multiplication? Performance	8	4722	April 19, 2019

Solving A\b in parallel

Related topics