Running Julia in a SLURM Cluster

marius311 · September 3, 2021, 8:44pm

My preferred way is something like this:

#!/usr/bin/env sh
#SBATCH -N 10
#SBATCH -n 8
#SBATCH -o %x-%j.out
#=
srun julia $(scontrol show job $SLURM_JOBID | awk -F= '/Command=/{print $2}')
exit
# =#

using MPIClusterManagers
MPIClusterManagers.start_main_loop(MPI_TRANSPORT_ALL)

println(workers()) # should have 80 workers here across 10 nodes (controlled by -n and -N above)

You put this in myscript.jl and then sbatch myscript.jl.

This is using A neat Julia/SLURM trick and MPIClusterManagers.jl.

ClusterMangers’s ElasticManager is also quite useful for dynamically hooking up workers to e.g. a Jupyter session, if you prefer the interactive workflow.

Topic		Replies	Views
Using Julia on Cluster New to Julia question	2	626	March 6, 2023
Using multi-node Distributed.jl in a slurm cluster Julia at Scale	1	185	January 29, 2025
Distributed Computing with Slurm and Julia Julia at Scale	9	3517	February 10, 2022
I am unable to run a simple distributed.jl code on my slurm cluster Julia at Scale parallel , distributed , slurm	11	640	February 10, 2024
How to parallel Julia on multiple nodes on HPC (slurm)? Julia at Scale question	11	3576	May 20, 2020

Running Julia in a SLURM Cluster

Related topics