Basic of `@everywhere` and `@distributed` macro

jpsamaroo · June 22, 2023, 1:47pm

YZX:

using Distributed, ClusterManagers

addprocs(40)
println(nworkers())

@everywhere summark = myid()
@everywhere println(summark)

@sync @distributed for i = 1:100
    println(myid(), " , i = ", i, ", summark = ",summark)
end

A better approach would be to use a library which has an explicit notion of remote data, such as Dagger.jl. This example could be better done like:

using Distributed, ClusterManagers

addprocs(40)
println(nworkers())

@everywhere using Dagger

summark = Dagger.@shard myid()
# `summark` is an object which points to the result of `myid()` on all workers in the cluster
map(println, summark)

@everywhere myprint(i, s) = println("$(myid()) , i = $i, summark = $s")
@sync for i = 1:100
    Dagger.@spawn myprint(i, s)
end

This approach is better because:

There is no weirdness with global variables (instead there’s just one local variable which points to other “global” variables)
Dagger.@shard is explicitly built for this purpose, and you will always get the right value for whichever worker the code runs on
You don’t need to express your logic in a for loop; you can use whatever control flow patterns make sense for you

Note that with this example, you’re not guaranteed a perfectly even distribution of prints across the cluster, but generally Dagger will tend to balance the tasks evenly over time.

Topic		Replies	Views
Confusing julia behavior. @everywhere macro changes the scope of local variables to global General Usage parallel , distributed , scope	5	1180	March 6, 2022
@everywhere works, sort of General Usage distributed , parallel-computing	2	127	July 8, 2024
Macro @everywhere General Usage	16	8086	September 28, 2020
How to use macro that has been moved from base to a package (e.g. @everywhere)? General Usage	5	845	August 30, 2018
Using @distributed for loop New to Julia question , loops , parallel-computing	10	312	August 5, 2024

Basic of `@everywhere` and `@distributed` macro

Related topics