Map and mapreduce with Threads

Tamas_Papp · December 4, 2019, 4:03pm

I would like to experiment with parallelizing an embarrassingly parallel computation on an iterable using the new multithreading in 1.3. I need map and mapreduce.

I would prefer to leave return type determination to Julia, so I would prefer not to preallocate.

Do I just need to @spawn and then fetch, as in

using Base.Threads: @spawn, fetch, threadid, nthreads

@show nthreads()

function ploop(f, itr)
    map(fetch, map(i -> @spawn(f(i)), itr))
end

function pmapreduce(f, op, itr)
    mapreduce(fetch, op, map(i -> @spawn(f(i)), itr))
end

f(i) = (@show threadid(); sleep(rand()); Float64(i))

ploop(f, 1:10)
pmapreduce(f, +, 1:10)

(The motivation for this approach is that in practice f itself can use threads and the whole things hopefully composes neatly.)

Tomas_Pevny · December 4, 2019, 7:14pm

This approach is taken in https://github.com/baggepinnen/ThreadTools.jl

Per · December 5, 2019, 7:12am

For mapreduce it might make sense to split up the list recursively, so that op can also run in parallel.

longemen3000 · December 5, 2019, 7:32am

a robust mapreduce should include a lower threshold to activate. that threshold is related to the proportion of calling a function f and spawning a thread. for example, if f(x) = x+1, then the cost of spawning a thread is aproximately 5000 times more expensive (obviusly this is the worst case, but using a proportion between the time to do one operation and the time to spawn a thread can give a good approximation.

Spawning 1 thread per op seems excessive, but that strategy pays off if your operation is expensive enough to overcome the thread spawning overhead (by how much? 10 times?)

Tamas_Papp · December 5, 2019, 7:41am

Obviously — the actual problem I am working on takes about 2–10 minutes for a single call, and the sleep in the MWE is a stand-in for this.

Tomas_Pevny · December 5, 2019, 7:45am

Then ThreadTools does exactly what you want.

baggepinnen · December 5, 2019, 9:39am

ThreadTools.jl is currently implemented with somewhat expensive computations in mind. There is a benchmark in the Readme that indicate roughly where the overhead starts becoming excessively expensive.

Topic		Replies	Views
Looking for advice on threading General Usage	6	1649	January 15, 2020
Parallelization of simple loop: reductions, thread-private variables? Julia at Scale	9	4242	September 30, 2017
Pmap for Threads? General Usage	1	727	April 28, 2019
Local Parallel Processing Benchmark Advice Julia at Scale	2	793	April 26, 2018
Threads.@threads to return results Internals & Design threads	12	639	September 28, 2020

Map and mapreduce with Threads

Related topics