Efficient multiplication of out-of-memory sparse matrix and an in-memory dense vector

pablosanjose · January 15, 2019, 3:29pm

Say I have a really huge sparse matrix A, and an equally huge dense vector v. I need to do A * v. The structure of A is simple, like a -1 -1 4 -1 -1 stencil in 2D for example (in practice it’s a bit more complicated) so that I want to compute A * v without storing A itself in memory (which would be a real waste), only v. The effect of A on a given part of v would be computed on the fly. I also need to use all my CPU cores as efficiently as possible, make use of SIMD, keep cache locality and minimize memory fetches. Is there a julia package that can help for this kind of problem?

simonbyrne · January 15, 2019, 4:58pm

If I understand your problem correctly, you might be better off using imfilter from the JuliaImages stack, which will avoid allocating a matrix A.

rveltz · January 15, 2019, 7:18pm

what about a Matrix Free like function if you know how to evaluate it?

pablosanjose · January 16, 2019, 7:05am

what about a Matrix Free like function if you know how to evaluate it?

Ah, you mean something like LinearMaps.jl, yes? It does seem like the perfect solution. However, for the parallelization I guess I would need to take care of that myself when defining the map function. I was hoping to find a specific solution that has been optimised for me already, but I guess that is too much to ask, as the kind of optimization tricks probably depend on A.

If I understand your problem correctly, you might be better off using imfilter from the JuliaImages stack, which will avoid allocating a matrix A .

This might be closer to what I was looking for, yes! I’ll look into it.

Thanks to both

rveltz · January 16, 2019, 7:17am

Well, you have DiffEqOperators for spencils if you dont want to bother writing the forloops on each piece of your distributed vector. And you can use DistributedArrays for managing your vector.

pablosanjose · January 16, 2019, 10:08am

Ah, yes! Very nice. Unfortunately my “stencil” is an arbritrary comlicated “hopping” matrix on the lattice discrete. I’m not sure one can add completely arbitrary stencil coefficients with DiffEqOperators. I’ll look into it.

ChrisRackauckas · August 19, 2019, 12:05pm

DiffEqOperators handles this case now. It composes operators of different dimensions and does a cache-optimized convolution call. We’ll get the docs updated.

pablosanjose · August 19, 2019, 12:27pm

Excellent! Looking forward to the docs, thanks!

Topic		Replies	Views
Sparse matrix-vector product: much more slow than Matlab Performance matlab , optimization	24	4545	December 20, 2017
Performing mul!() with Sparse Matrices inside DiffEq.jl model Modelling & Simulations	2	211	November 9, 2022
Matrix Vector Operations to For Loop Numerics question , performance	5	968	April 27, 2017
Parallel sparse matrix vector product Internals & Design	4	1382	June 29, 2018
Multithreaded MatVec Numerics multithreading , matrices	10	1963	February 4, 2022

Efficient multiplication of out-of-memory sparse matrix and an in-memory dense vector

Related topics