Asymmetric speed of in-place `sparse*dense` matrix product

carstenbauer · November 7, 2018, 4:03pm

For completeness, one can still fix it locally by defining

import LinearAlgebra.mul!
function mul!(C::StridedMatrix, X::StridedMatrix, A::SparseMatrixCSC)
    mX, nX = size(X)
    nX == A.m || throw(DimensionMismatch())
    fill!(C, zero(eltype(C)))
    rowval = A.rowval
    nzval = A.nzval
    @inbounds for multivec_row=1:mX, col = 1:A.n, k=A.colptr[col]:(A.colptr[col+1]-1)
        C[multivec_row, col] += X[multivec_row, rowval[k]] * nzval[k]
    end
    C
end

Benchmark:

julia> @btime $C = $A*$B;
  19.478 μs (2 allocations: 78.20 KiB)

julia> @btime $C = $B*$A;
  22.261 μs (2 allocations: 78.20 KiB)

julia> @btime mul!($C,$A,$B);
  16.077 μs (0 allocations: 0 bytes)

julia> @btime mul!($C,$B,$A);
  18.241 μs (0 allocations: 0 bytes)

Topic		Replies	Views
Performance discrepancy in sparse matrix product Performance	4	984	June 29, 2018
Scaling a sparse matrix row-wise and column-wise too slow Performance broadcast , sparse	20	451	June 23, 2024
Speed comparison: Sparse matrix multiplication vs usual matrix multiplication General Usage question	7	3900	February 13, 2017
In-place matrix multiplication compatible with 0.5 and 0.6 General Usage	3	722	January 10, 2017
How to speed up dense-sparse matrix multiplication where the sparse matrix is in CSC? Performance	0	169	March 28, 2024

Asymmetric speed of in-place `sparse*dense` matrix product

Related topics