Dubious code: @spawn psort!(v, lo, mid)

Sijun · December 2, 2019, 11:40am

Here is the code snippet taken from Announcing composable multi-threaded parallelism in Julia


import Base.Threads.@spawn

# sort the elements of `v` in place, from indices `lo` to `hi` inclusive
function psort!(v, lo::Int=1, hi::Int=length(v))

   # omitted above
    mid = (lo+hi)>>>1                 # find the midpoint

    half = @spawn psort!(v, lo, mid)  # task to sort the lower half; will run
    psort!(v, mid+1, hi)              # in parallel with the current call sorting
                                      # the upper half
    wait(half)                        # wait for the lower half to finish
    temp = v[lo:mid]

What’s dubious is this:
half = @spawn psort!(v, lo, mid)

When psort!(v) is spawned, the entire array that v points to must be copied to the remote process. so it is necessary to fetch(half) and copy the result to the local v[lo:mid].

Actually the above program never ends and keep spinning. More strangely, even if I put v[lo:mid] .= fetch(half), it never ends.

I think the following simple example confirms my understanding; data must be fetched from @spawn.

using Base.Threads, Distributed

arr = zeros(10)
@everywhere function update!(arr::AbstractArray)
    for i = 1:length(arr)
        arr[i] = i
    end
end

r = @spawn update!(arr)
wait(r)
println(arr)

the result of which is all-zero array without fetch.

What mistake am I making? or the example is really flawed?

kristoffer.carlsson · December 2, 2019, 11:56am

Threads.@spawn doesn’t use multiple processes (distributed memory), it uses shared memory and there is no need to copy any data between tasks spawned in the same process.

Sijun · December 2, 2019, 12:03pm

Ah, I see. Indeed there are two @spawn: Threads.@spawn and Distributed.@spawn. The latter is deprecated. Thank you for the clarification (It’s till strange why the program never terminated on my PC)

Topic		Replies	Views
Why do use @spawnat and fetch? Julia at Scale	3	2551	May 9, 2019
Assignment not effective in `@spawn` in function General Usage multithreading	4	379	December 26, 2020
How to synchronize and share data correctly between processes in a loop? General Usage question	12	353	February 15, 2023
Spawn-fetch usage General Usage parallel	16	3565	December 7, 2017
How to Maximize CPU Utilization - @spawn Assigning to Busy Workers - Use pmap Instead Julia at Scale parallel , distributed	17	3027	November 17, 2021

Dubious code: @spawn psort!(v, lo, mid)

Related topics