Running multiple instances of an already parallel external program in parallel

Tetrakai · June 30, 2024, 6:26pm

I have a situation like below:

Julia

nrep = 3
Threads.@threads for i in 1:nrep
    run(`bash -c """cd $path && ./external_program"""`)
end

external_program.cpp

#pragma omp parallel for num_threads(10)
for (int i = 0; i < 10; i++){
     // do stuff
}

Also, my cpu has 32 cores (64 threads). Julia is running with 32 threads.

Everything works fine, but I get way more overhead if nrep*num_threads > 32 than when running nested parallel loops all from julia.

I’m not sure what exactly is going on in the background, but I’m guessing the latter case is composable while my problem is not.

Is that understanding correct? Is there a way to address this?

abraemer · June 30, 2024, 6:42pm

Yes Julia has composable multithreading

To fix this you either need to control the number of threads used by your external programs and how many you start to avoid oversubscription or just use Julia and don’t waste brain cycles on the thread logistics

Topic		Replies	Views
Launching with JULIA_NUM_THREADS=<max> Performance question , parallel , threads	2	834	January 7, 2021
Julia: how to run embarrassingly parallel jobs with nested for loops? Julia at Scale parallel , multithreading	6	1537	July 13, 2021
Big performance slowdown increasing Julia threads but keeping parallelism the same? General Usage	4	201	January 3, 2025
Parallelized calls to Optim.optimize use the same number of threads as a single threaded call Performance multithreading , optim	4	72	October 16, 2024
Multi-threading on a 2 CPU system New to Julia multithreading	15	1083	February 2, 2023

Running multiple instances of an already parallel external program in parallel

Related topics