Distributed, packages, and race condition in pre-compiling

torgo · July 19, 2020, 4:55pm

I am having a problem that I think is represented well by this StackExchange question discussion.
As far as I understand that discussion, one needs to be careful when using Distributed to balance two issues:

Each worker needs to be able to see all of the necessary code.
If packages need to be pre-compiled, then one could run into a race condition if each worker tries to compile.

I was running into this problem with a more complicated set of code, so I decided to try to make a simple example to understand the issues.

One solution suggested in the StackExchange post is to do something like this:

using Distributed
using DataFrames
@everywhere using DataFrames

with the idea (as I understand it), that the first using will result in pre-compilation in serial, and then the @everywhere using will load the code onto each worker.
I have verified that this works as expected for me.

However, suppose now that instead of DataFrames, I want to load my very simple module called Run:

# Run.jl
module Run

function runsims()
    println("hello!")
end

end

I start a Julia process with no options and run this code:

using Distributed
using Run
@everywhere using Run

No problems.

However, if I start with julia -p 1 and try to run the same code, I get this error after using Run:

ERROR: On worker 2:
ArgumentError: Package Run not found in current path:
- Run `import Pkg; Pkg.add("Run")` to install the Run package.

Why does this error occur?
If I try this:

using Distributed
Pkg.add("Run")
using Run
@everywhere using Run

then I also get an error:

ERROR: LoadError: The following package names could not be resolved:
 * Run (not found in project, manifest or registry)
Please specify by known `name=uuid`.

Am I just using modules in the wrong way?
I have not had problems with this type of approach until trying to parallelize with Distributed.

torgo · July 19, 2020, 5:35pm

Forgot something: All of this is premised on me having added the current working directory to LOAD_PATH:

push!(LOAD_PATH, ".")

Topic		Replies	Views
Distributed is in stdlib, but preventing precompilation via being missing? New to Julia question	2	40	April 8, 2025
"Failed to precompile" error on worker processes General Usage	17	4831	June 15, 2020
Package precompile fail with Distributed Performance pkg , parallel , distributed	0	197	May 15, 2023
Precompilation, environments, and `Distributed` computing General Usage question , package , pkg , distributed	1	851	July 13, 2021
`Using` with Distributed Julia at Scale	1	446	November 23, 2020

Distributed, packages, and race condition in pre-compiling

Related topics