LoopVectorization: Best way to have a multi and single threaded version?

jw3126 · May 15, 2023, 6:51am

Sometimes when I use LoopVectorization, I want to have both a single and multi-threaded variant of the function available. For instance:

using LoopVectorization

function add!(out, x, y; thread)
    if thread
        add_multi_thread!(out, x, y)
    else
        add_single_thread!(out, x, y)
    end
end

function add_single_thread!(out, x, y)
    @turbo thread=false for i in eachindex(out, x, y)
        out[i] = x[i] + y[i]
    end
    out
end

function add_multi_thread!(out, x, y)
    @turbo thread=true for i in eachindex(out, x, y)
        out[i] = x[i] + y[i]
    end
    out
end

x = randn(10)
y = randn(10)
out1 = randn(10)
out2 = randn(10)
add!(out1, x, y, thread=true)
add!(out2, x, y, thread=false)
@assert out1 ≈ out2

What is a good way to reduce code duplication here? Should I use@eval? What if I also want to control the number of threads at runtime, would I need to use @generated?

Topic		Replies	Views
Can't understand what LoopVectorization is doing General Usage	7	749	September 1, 2021
How to use threads in a reduction with LoopVectorization? Performance multithreading , loopvectorization	3	657	August 23, 2021
ANN: LoopVectorization 0.12: multithreading and better handling of discontiguous memory accesses Performance	16	2138	March 17, 2021
LoopVectorization almost doubles execution time? Performance loopvectorization	6	655	July 9, 2021
Using LoopVectorization with StructArrays Performance struct , loopvectorization , structarrays	3	501	May 6, 2023

LoopVectorization: Best way to have a multi and single threaded version?

Related topics