Looking for code to solve (surely common) 'embarrassing parallelism' multithreading use-case

pbayer · January 18, 2021, 3:18pm

yes, I see you are right!

using .Threads, BenchmarkTools

# a silly function taking some time, returning its thread
f(n) = (sum((i for i in 1:n) .^ 2); threadid())

function show_load(threads)
    res = fill("", nthreads())
    foreach(i->res[i]*="*", threads)
    res
end

then

julia> @btime f(2_000)
  1.174 μs (2 allocations: 31.50 KiB)
1

julia> fetch.(map((_->Threads.@spawn f(2000)), 1:nthreads()))
8-element Array{Int64,1}:
 3
 2
 4
 5
 6
 7
 8
 1

If we put the same load on all tasks, all threads are employed. Even with unbalanced load (if M >> N | M: number of tasks, N: nthreads) the balance is quite good:

julia> show_load(fetch.(map(_->(Threads.@spawn f(rand(1:2000))), 1:500)))
8-element Array{String,1}:
 "****************************************************"
 "****************************************************************************************"
 "**************************************************************************"
 "*********************************"
 "********************************************************"
 "***********************************************************"
 "***********************************************************************"
 "*******************************************************************"

I first had an other impression because in my applications/tasks usually I read first from a channel. In that case the load is very imbalanced:

g(n) = (yield(); f(n))

julia> show_load(fetch.(map(_->(Threads.@spawn g(rand(1:2000))), 1:500)))
8-element Array{String,1}:
 "*"
 "*************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************"
 "*"
 "*"
 "*"
 "*"
 "*"
 "*"

Topic		Replies	Views
Question about optimal thread allocation for vector of problems of differing sizes Performance multithreading	7	1862	January 17, 2020
How to execute tasks in parallel in a for loop Performance parallel , multithreading , juliapro , optimization	27	2018	November 29, 2023
Strategies for parallelization of fast non-uniform tasks Performance parallel	12	623	November 16, 2021
How to Maximize CPU Utilization - @spawn Assigning to Busy Workers - Use pmap Instead Julia at Scale parallel , distributed	17	3027	November 17, 2021
In multithreading, how to make each thread pick the first/smallest available element? General Usage parallel , multithreading	13	849	January 10, 2023

Looking for code to solve (surely common) 'embarrassing parallelism' multithreading use-case

Related topics