Distributed workflow for MCMC

Tamas_Papp · December 16, 2025, 2:38pm

I would like to set up an MCMC workflow using Distributed.

I have a Julia script that does the following:

load all packages
load the data
run an MCMC chain, with index i, for i in 1:5
save the result in some_chain_$i.jld2
done! send the user a message.

I would like parallelize step 3 with Distributed. Is there a tutorial that would get me started? I have never used this package before, so sorry if not all questions make sense.

I am running everything on a single server which I fully control, so processes are local. Is it sufficient to just use addprocs(5) with the local manager?

Is it enough if I load packages using @everywhere?

For the core computation, can I just do something like this:

remotes = [remotecall(my_mcmc_runner, i, data, logdensity) for i in 1:5]
map(fetch, remotes)

to automatically finish when all tasks are done?

vchuravy · December 16, 2025, 2:50pm

Slight tangent, but IIUC Pidgeons.jl is essentially a distributed MCMC engine Custom MCMC · Pigeons.jl

Tamas_Papp · December 16, 2025, 3:59pm

Thanks, but that’s not what I want to do, I want to use DynamicHMC.jl.

I just need help with Distributed, as explained above.

MCMC is just the context, to indicate the kind of parallelism.

ufechner7 · December 16, 2025, 5:46pm

Not a direct answer, but consider using DistributedNext.jl instead. Much faster to start a worker and other nice improvements.

abraemer · December 16, 2025, 9:37pm

Tamas_Papp:

I am running everything on a single server which I fully control, so processes are local. Is it sufficient to just use addprocs(5) with the local manager?

Is it enough if I load packages using @everywhere?

For the core computation, can I just do something like this:
remotes = [remotecall(my_mcmc_runner, i, data, logdensity) for i in 1:5]
map(fetch, remotes)
to automatically finish when all tasks are done?

To answer these questions: Yes to all of them.

Perhaps consider controlling the RNGs of your workers. I don’t think you can realistically have 2 of them start with the same seed but for reproducability it of course beneficial nonetheless.

pdeffebach · December 17, 2025, 1:43pm

My understanding is that pmap does basically all you need under the hood and you should just try that as a first pass.

Topic		Replies	Views
The ultimate guide to distributed computing Julia at Scale parallel , cluster , distributed	44	10607	June 21, 2021
How to load a whole "project" into a worker with @everywhere General Usage	2	354	January 8, 2022
Error sending packages to other workers (parallelization) General Usage	6	809	March 4, 2019
Using pmap within a package General Usage distributed , pmap	6	267	August 16, 2025
Unable to create parallel package General Usage question , package , distributed	1	176	April 19, 2024

Distributed workflow for MCMC

Related topics