Fastest way to fill a Sparse Matrix?

Alejandro_Quiaro_San · April 21, 2022, 5:07pm

Hello guys, I am wondering which should be the right strategy to fill values in a big sparse matrix.

I have a variable L, which is an n x m matrix, and is sparse. An outer loop runs on the row index, and just a few values of the row are non zero. The non zero values are contained in a vector d, which changes size in every iterations. The pseudo code would be something like this:

L=zeros(n,m)
for i in 1:m
    for j in 1:n
         L[i,j]=d[j]
    end
end

Only 5% of the values are different from zero. If I initialize the matrix L as a Sparse Array (using spzeros), the code becomes slow (I found in another post that the best way to operate with sparse arrays is by just using the Is, Js, ans Vs vectors and build the matrix using the function sparse).

If a try to build a vector, only with the values of d (to then build the matrix L using the function sparse), since the size of d is unknown in each iteration, I have to initialize d as an empty vector that changes size in each loop, making it run slow once again.

For this particular case I can’t find a way to take advantage of the sparse arrays to make the code more efficient.

Any thoughts? Thanks

goerch · April 21, 2022, 5:56pm

This is how I understand it. I tried to translate your problem to use sparse

using BenchmarkTools
using SparseArrays

function test(ds)
    is = Int[]
    js = Int[]
    vs = Float64[]
    for j = 1:size(ds, 2)
        d = ds[:, j]
        dis, dvs = findnz(d)
        for (i, v) in zip(dis, dvs)
            push!(is, i)
            push!(js, j)
            push!(vs, v)
        end
    end
    L = sparse(is, js, vs, size(ds, 1), size(ds, 2))
    @assert L == ds
    L
end

@btime test(ds) setup=(ds=sprand(10000, 10000, 0.05))

Does this help?

Alejandro_Quiaro_San · April 21, 2022, 9:20pm

Yes, It helped a lot! Thank you so much! the key was in the function:

push!

I was using

vcat & hcat

Which seems to run much more slowly. Thank again for your support.

goerch · April 21, 2022, 9:29pm

Yep, cat is used to concatenate Arrays and does more work. But be careful with the order of loops and accesses to your sparse matrix, because it is in (C)ompressed(S)parse(C)olumn format.

Topic		Replies	Views
Sparse matrix 700x slower than full New to Julia sparse	7	1069	February 24, 2021
Efficient Initialization of huge sparse arrays New to Julia	6	3165	September 15, 2019
How to speed up creating a sparse matrix? Performance	2	536	May 5, 2020
Efficient way for assigning a massive array New to Julia array	6	687	January 27, 2020
Huge sparse array construction General Usage sparse	9	869	April 12, 2020

Fastest way to fill a Sparse Matrix?

Related topics