Hi, I am running some distributed code using ClusterManagers.jl
’s addprocs_slurm
command to spawn processes after submitting an sbatch script on our cluster’s login node. I can happily rename the output file for the “head” process and even capture error logs, however, I cannot figure out how to determine the naming pattern of the spawned processes output files, they all appear as job0001.out, job0002.out etc. and I’d like to be able to change this if possible so I can run two large experiments at once without them overwriting each other.
Are you using the master branch? I think now it also uses the task id in the file names, so you may run multiple tasks at once