Best strategy to choose the number of gc threads on a cluster

jishnub · December 14, 2023, 4:02am

I have a multithreaded code that I am running on a cluster using Julia v1.10, and it allocates many temporary arrays. The computation is quite linear algebra heavy, and I would like to keep freeing memory from time to time. Assuming that I need 10 threads to carry out my calculation, would a good strategy be to request for 10+n CPUs, and start julia with julia -t 10 --gcthreads n,1? So with 15 CPUs, this would be julia -t 10 --gcthreads 5,1? Would having free CPUs for the GC threads help, or is this unnecessary? Also, would the 1 dedicated thread for the concurrent sweep phase make a difference? In that case, would requesting for 16 CPUs be a better idea?

Sorry about the vague question, but I am just looking for general guidelines that I can play around with.

Oscar_Smith · December 14, 2023, 4:12am

For this you should likely use julia -t 15 --gcthreads 8,1. Julia doesn’t have a concurrent GC, so when the GC is running, the regular threads aren’t running. That said, if you are seeing substantial amounts of time spent in GC, it might be worth reducing the number of temporary arrays needed (e.g. via preallocation). Especially once you start scaling up threads (e.g. running on 30-60 cores), gc can be an issue if not mitigated since garbage collection (pretty much inherently) doesn’t achieve perfect scaling. That said, if your arrays are relatively large, I would expect garbage collection to be pretty quick.

johnh · December 14, 2023, 9:45am

One thing I would flag up here - think about thread pinning. You don’t want the OS Scheduler moving things around

BTW you say 15 CPUs - does that mean 15 CPU cores or 15 CPU sockets?
Indeed anything not a power of 2 gives me the willies…

GitHub - carstenbauer/ThreadPinning.jl: Readily pin Julia threads to CPU processors

Topic		Replies	Views
Enable Parallel Processing of Garbage Collections New to Julia	11	455	January 3, 2024
Question About New GC Threads General Usage multithreading , garbage-collection	7	544	December 28, 2023
[Regression in rc3] Can you decline threads you've been assigned, e.g. GC threads? Or should it be possible? Offtopic	2	145	September 16, 2024
Using environment variables to moderate GC. julia version 1.10.4 with prerelease version VS Code gc	1	121	August 22, 2024
Customize number of threads interactively General Usage multithreading	8	2063	April 30, 2019

Best strategy to choose the number of gc threads on a cluster

Related topics