Parallell TensorKit

Nellatur · April 13, 2023, 12:58am

Based on Please read: make it easier to help you - #2

Avoid extending the original question that has already been answered with many new questions. It is better to start a new thread.

Let me start a new thread than from TensorKit efficiency

For the following code


using TensorKit
using BenchmarkTools 

na = nb = nc = nd = 12
A = rand(na, nb)
B = rand(nb, nc, nd)
C = zeros(na, nc, nd)


function tensorkit_contraction(A,B,C)
    @tensor  C[a,c,d] := A[a,b] * B[b,c,d]
end 

@btime tensorkit_contraction(A,B,C)

name as 1.jl. I tried to run julia 1.jl -t 1 and julia 1.jl -t 2 (julia 1.8.5)
I got
3.900 μs (1 allocation: 13.62 KiB) and 3.837 μs (1 allocation: 13.62 KiB)
Looks similar. Is there any simple way to parallel tensorkit?

carstenbauer · April 13, 2023, 3:48am

Don’t know about TensorKit.jl but Tullio.jl typically has good parallel performance. Maybe you want to try it out.

jmair · April 13, 2023, 8:26am

I think your arguments here should be swapped around. It should be julia -t 1 1.jl instead. You can check by printing out Threads.nthreads().

Also, since these are linear algebra operations, it will likely use BLAS threads for a lot of the parallelism, which are separate from Julia threads. This can be set with

using LinearAlgebra;
BLAS.set_num_threads(1)

for example.

It’s hard to know which type of parallelism a library will use without looking at the docs/source. But you can experiment to find out by setting Julia threads and BLAS threads separately.

Nellatur · April 20, 2023, 6:36am

Thanks. I tried

using TensorKit
using Tullio
using BenchmarkTools

na = nb = nc = nd = 12
A = rand(na, nb)
B = rand(nb, nc, nd)
C = zeros(na, nc, nd)



function tensorkit_contraction(A,B,C)
    @tullio   C[a,c,d] := A[a,b] * B[b,c,d]
end

@btime tensorkit_contraction(A,B,C)

but did not get speed up by julia -t 1 1.jl or julia -t 2 1.jl

carstenbauer · April 20, 2023, 6:41am

Your problem size seems too small to gain anything from multithreading. Try for larger na, nb, nc, nd.

Nellatur · April 20, 2023, 6:48am

Thanks! na = nb = nc = nd = 120
julia -t1 1.jl 171 ms
julia -t 2 1.jl 85 ms

Topic		Replies	Views
TensorKit efficiency Numerics tensors	11	1316	April 12, 2023
Parallelization efficiency for @tensor @tullio Numerics	5	204	June 30, 2023
Improving performance of tensor contractions Performance question , blas , tensors	7	1646	June 14, 2019
Tensor contractions very inefficient in julia Performance tensors	13	2535	December 18, 2020
How do I implement the fastest tensor contractions Performance question , tensors , tensor-contraction , tensoroperations	11	853	April 18, 2024

Parallell TensorKit

Related topics