Julia vs NumPy broadcasting

mcabbott · July 21, 2025, 6:08pm

Do we need batched matrix multiplication? At least if @mikmoore 's version is correct, the loop is doing repeated matrix-vector multiplication x[a, j] = V[a,b] * yj[b] in a loop over j, whose batched version is matrix-matrix multiplication, x[a, j] = V[a,b] * ys[b, j]. For me that’s 20x quicker:

function step_reference3(A::Matrix, B::Vector, F::Real, n::Int)  # my adaptation of @mikmoore's version
    λ, V = eigen(A)
    BF_transformed = V \ (B * F)  # 4-element Vector{Float64}
    ys = expm1.(λ .* (1:n)') ./ λ .* BF_transformed
    x = V * ys  # x[a, j] = V[a,b] * ys[b, j]
end

@btime step_reference($A, $B, $F, $n)  #  2.602 ms (58011 allocations: 3.29 MiB)
@btime step_reference2($A, $B, $F, $n)  # 3.401 ms (45033 allocations: 2.22 MiB)
@btime step_reference3($A, $B, $F, $n)  # 155.708 μs (41 allocations: 326.47 KiB)

While loops are quick in Julia, loops which allocate small vectors are expensive. (The above SMatrix version didn’t run for me, on a quick attempt.)

Topic		Replies	Views
Numpy 10x faster than Julia ?! What am I doing wrong ?! [solved - julia faster now] Performance question	37	11162	October 15, 2019
Why is this Julia code considerably slower than Matlab New to Julia performance	64	8636	March 5, 2017
Linear solver \(A, B) performance vs Matlab A\b General Usage	32	7847	May 21, 2017
Kron vs scalar product speed difference. python code faster? New to Julia question	41	4260	January 14, 2017
Multiply many-matrices by many-vectors Performance matlab , parallel , multithreading , tensors	33	6854	December 14, 2018

Julia vs NumPy broadcasting

Related topics