A good way of running Julia MPI jobs with slurm on HPC platforms

PetrKryslUCSD · June 5, 2024, 4:09pm

For some reason the Julia MPI jobs that I submit end in error due
to a lack of instantiation / activation. I do run it as

mpiexec julia --project=~/a64fx/FinEtoolsDDParallel.jl/examples ~/a64fx/FinEtoolsDDParallel.jl/examples/heat/Poisson2D_cg_mpi_driver.jl

and when I do just julia --project ~/a64fx/FinEtoolsDDParallel.jl/examples/ it works fine.

How do you guys run Julia on HPC?

abraemer · June 5, 2024, 8:07pm

For me Distributed.jl was always enough. But I only had embarassingly parallel problems anyways and MPI always looked too complicated for that

xzackli · June 5, 2024, 8:48pm

On NERSC, one of my recent SLURM scripts looks like this (I cd to the project directory but one could replace the --project=. with a specific path)

#!/bin/bash
#SBATCH -A mp107d
#SBATCH --qos=regular
#SBATCH -C cpu
#SBATCH -t 1:00:00
#SBATCH --nodes=32
#SBATCH --ntasks-per-node=16
#SBATCH --cpus-per-task=16
#SBATCH -J big_filter
#SBATCH --exclusive

cd YOUR_PROJECT_DIR
module load cray-mpich
module load cray-hdf5-parallel
export JULIA_NUM_THREADS=8
which julia

julia --project=. -e \
    'using Pkg; using InteractiveUtils;
     Pkg.instantiate(); Pkg.precompile(); Pkg.status(); versioninfo();
     using MPI; println("MPI: ", MPI.identify_implementation());'

mpiexecjl --project=. --cpu-bind=cores julia fft_filter_6144.jl

lxvm · June 5, 2024, 9:19pm

On my university cluster I need to set some environment variables/flags to get a multi-node MPI setup to work. Here is a template for the slurm script I use for distributed training of a neural network.

#SBATCH --nodes=2
#SBATCH --ntasks=80
#SBATCH --cpus-per-task=1
#SBATCH --exclusive
#SBATCH --time=4:00:00
#SBATCH --mem=0

module load julia/1.9.2
module load mpi/openmpi-4.1.5

...

~/.julia/bin/mpiexecjl --project=$PROJECT --mca btl_tcp_if_include eth0 -n 80 julia --project=$PROJECT script.jl

PetrKryslUCSD · June 6, 2024, 2:10pm

Thank you all who responded. The setup was weird in some way: removed the depot, things changed. Not for (much) better, though.

Now all the processes try to precompile and it is a huge mess of stale lock files and such. How do you handle that?

I would like to have a single depot on the lustre file system, and each process to use that depot. So now I need to prevent each process from trying to precompile stuff…

PetrKryslUCSD · June 6, 2024, 2:42pm

So you have a separate Julia depot on all nodes?

PetrKryslUCSD · June 6, 2024, 2:43pm

Using MPI is a pain! Could not agree more.

PetrKryslUCSD · June 6, 2024, 4:53pm

I figured out one way in which to run the sim with MPI, only to be stopped by a weird error: Ookami: MPI error opal_libevent2022_evthread_use_pthreads · Issue #835 · JuliaParallel/MPI.jl · GitHub

Edit: Looks like there is a solution.

Topic		Replies	Views
Using multi-node Distributed.jl in a slurm cluster Julia at Scale	1	193	January 29, 2025
How to run MPI jobs on a cluster Julia at Scale mpi , distributed	1	818	June 2, 2023
Julia crashes when started on the nodes of a HPC cluster General Usage question , hpc , debug , cluster	8	2181	January 3, 2018
Using Julia on Cluster New to Julia question	2	631	March 6, 2023
How to parallel Julia on multiple nodes on HPC (slurm)? Julia at Scale question	11	3584	May 20, 2020

A good way of running Julia MPI jobs with slurm on HPC platforms

Related topics