Question About New GC Threads

TI36XPro · December 28, 2023, 3:52am

I am excited to start using Julia 1.10.0 since I have a particular set of simulations which are allocation heavy (many small allocations) and I believe the GC is the current bottleneck.

I want to make sure I am doing this right. I think to start, it would make sense to split total threads available between those used for GC and those for computing. In that case I am running the following.

My run script contains the following lines (summarizing)

# runscript.slurm

#SBATCH --ntasks-per-node=12

GC_THREADS=6
COMPUTE_THREADS=6

main="my_simulation.jl"

export OMP_NUM_THREADS=$COMPUTE_THREADS

julia --project=@. --gcthreads=$GC_THREADS $main

The idea here is that I don’t want the GC threads to compete with the BLAS threads right? So it wouldn’t make sense to have both be set to 12 (total threads in this example).

Also - I can double check the number of BLAS threads by doing BLAS.get_num_threads(). Is there any similar function for checking number of GC threads?

ufechner7 · December 28, 2023, 9:19am

Try this:

--gcthreads=6,1

For me it gives a significant improvement.

And I don’t think BLAS threads compete with GC threads because the GC stops everything else while running, but I might be wrong.

carstenbauer · December 28, 2023, 1:28pm

Threads.ngcthreads() (although not public API)

Note that, while this works (because we explicitly check for it), you should rather set OPENBLAS_NUM_THREADS because Julia isn’t using OpenBLAS with OpenMP threads but pthreads.

TI36XPro · December 28, 2023, 5:36pm

Thanks!

I was using OMP_NUM_THREADS because on my cluster we have intel and AMD CPUs and I have the run script built so that if the architecture is AMD it won’t use MKL (defaults to OpenBLAS). Since I could possibly use either OpenBLAS or MKL depending on how slurm dispatches the run, I thought I should use OMP_NUM_THREADS. Sorry if that’s completely wrong or a bad practice (was just my naive first attempt). If you have any recommendation on how to do that better please let me know.

I suppose I could constrain slurm to only use intel CPUs and then use MKL_NUM_THREADS?

TI36XPro · December 28, 2023, 5:37pm

Good to know, I will try that today.

Regarding whether the threads compete, that would also be good to know.

ufechner7 · December 28, 2023, 5:39pm

I cannot answer this question, but you should know that MKL also works nicely on AMD CPUs… Just benchmark yourself if OpenBLAS or MKL is better for your use case…
Running MTK simulations multi-threaded, that only works with MKL…

TI36XPro · December 28, 2023, 5:40pm

Hm - yes I suppose I will have to then. I thought I had read somewhere that Intel made it run worse if it detected an AMD CPU.

ufechner7 · December 28, 2023, 5:47pm

This is the relevant link, might be outdated: How to circumvent Intel's AMD discrimination in MKL from v1.7 onwards?

Topic		Replies	Views
Best strategy to choose the number of gc threads on a cluster General Usage multithreading , threads , gc	2	635	December 14, 2023
Enable Parallel Processing of Garbage Collections New to Julia	11	455	January 3, 2024
Multithreading using more CPUs than expected Performance	11	545	July 20, 2023
How to prevent BLAS from thrashing with Julia? General Usage parallel	5	2188	May 30, 2017
Julia Threads vs BLAS threads Internals & Design	16	10956	July 26, 2018

Question About New GC Threads

Related topics