Spent last week fighting with issues on a slurm cluster, and having finally figured it out, I wanted to share the result (and a warning): I had been parallelizing through a slurm batch script with this call: julia --machinefile $SLURM_NODEFILE indiv_array.jl (For full script, see here ) Turns out…

Issues with machinefile and SLURM

vchuravy December 20, 2017, 7:23pm 2

Using srun should be fine since that is the correct way of starting a job.

The workflow should work something like this:

salloc | sbatch # create resources.
julia> addprocs(SlurmManager(2)) # SlurmManager should inherit the outside allocation.

[Ann] julia in parallel batch mode: job schedulers, etc

How to get started with distributed memory parallel programming?

Topic		Replies	Views
Configure Slurm File for Parallelization Julia at Scale parallel , cluster , distributed , slurm	4	123	October 13, 2024
SLURM manager: one node with multiple tasks General Usage slurm	2	154	December 28, 2024
Debugging possible issue with `machinefile` option on SLURM system Julia at Scale	9	1711	December 19, 2017
Julia on Slurm Cluster Specific Domains question	5	569	March 13, 2021
SlurmClusterManagers timeout Julia at Scale slurm	2	41	June 19, 2025