Garbage collection not aggressive enough on Slurm Cluster

Salmon · May 27, 2021, 10:57am

That’s strange, I am also using Julia on a Slurm cluster and I have never encountered such an issue. On the contrary, when I lower the amount of memory per cpu, I typically get close to full memory efficiency while gc time increases.
Unfortunately, I cannot directly help with this problem as I am no expert, but maybe the difference in our use cases helps to identify the issue?
My particular use involves using Threads over all cores of a node and distributing equivalent jobs over workers via pmap. As a result, I do not have that many distinct workers. Do you know if your problem somehow depends on the number of workers, i.e is there still a memory leak if you only use one worker?
In any case, I hope you find a solution ,
Salmon

Edit: To clarify, I am only using one instance of pmap() in a program, so if the issue is freeing memory after a worker is completely finished, this might be why it works in my case.

Topic		Replies	Views
Garbage collection not triggering on SLURM cluster Julia at Scale question	6	1161	March 4, 2024
Poor performance of garbage collection in multi-threaded application Julia at Scale garbage-collection	22	5446	February 3, 2022
Garbage collector behaviour when memory is almost full Performance	7	2253	June 24, 2021
Is there a way to limit memory as reported by Sys.total_memory()? General Usage memory	2	1166	February 2, 2023
Memory issues using command line argument New to Julia question , memory , slurm , command-line-options	4	653	June 20, 2023

Garbage collection not aggressive enough on Slurm Cluster

Related topics