Parallelize nested loop in v1.72

bast · August 11, 2022, 12:14pm

Hi,

I didn’t find any recent thread on this topic and from the documentation it seemed like development in this area was very active so I was wondering if there is some updated information.

I run a simulation model that takes a couple different parameters and I would like to loop over different values of two of them in parallel on my laptop (Macbook M1 if important), i.e. in a simplified form my code would look something like this,

convergence_time = zeros(10,10)
for a in 1:10
   for b in 1:10
       res = simulate_model(a,b)
       convergence_time[a,b] = res.t_conv
    end
end

Now I tried to parallelize this by using Threads.@threads on both for loops:

convergence_time = zeros(10,10)
Threads.@threads for a in 1:10
   Threads.@threads for b in 1:10
       res = simulate_model(a,b)
       convergence_time[a,b] = res.t_conv
    end
end

which did speed up the calculation but I am wondering if that is currently the best way to go about it.

Thank you very much!

cmarcotte · August 11, 2022, 12:39pm

The details will very much depend on what simulate_model(a,b) does. If you’re solving an ODE, perhaps consider an ensemble problem in DifferentialEquations.jl using EnsembleThreads() (though if the simulation is sufficiently short, then doing things serially might work out faster). Otherwise, you can of course simplify your nested loops with for a in 1:10, b in 1:10, which may permit the compiler to simplify things more readily, or at least save the overhead of the 10 thread spawns in the inner loop. Finally, for M1-based machines, using the Apple Silicon native Julia (>1.8) would be faster and less buggy, while also giving you a dynamic thread scheduler by default, which may make better use of your compute resources than the static scheduler in 1.7.

lungben · August 11, 2022, 12:39pm

Nested parallel for loops should be in general fine to do from Julia v1.8 on, see https://github.com/JuliaLang/julia/blob/v1.8.0-rc4/NEWS.md#multi-threading-changes

For current Julia versions, this creates too much overhead. In any case, it should be sufficient to parallelize the outer for loop unless you have more than 10 cores.

bast · August 11, 2022, 12:44pm

Okay, thank you very much!
I was a bit hesitant to update to 1.8 since it is still experimental but I will give it a try!

DNF · August 11, 2022, 1:11pm

But can you nest the @threads macro? I thought you had to use @spawn or something for that.

lungben · August 11, 2022, 1:19pm

If I understand the news.md file correctly, yes:

Threads.@threads now defaults to a new :dynamic schedule option which is similar to the previous behavior except that iterations will be scheduled dynamically to available worker threads rather than pinned to each thread. This behavior is more composable with (possibly nested) @spawn and @threads loops (#43919, #44136).

But I have not tested it yet.

Topic		Replies	Views
Threads parallelization on different nested loops Performance	0	333	June 21, 2021
Nested parallelization in Julia? General Usage	4	696	November 1, 2019
Nested parallel loops with @spawn Numerics parallel	13	2819	May 22, 2023
Parallel for nested loop with inner loop first, and then outer loop General Usage	2	219	May 23, 2023
Multithreading for nested for loops General Usage parallel , multithreading , threads	13	1735	August 16, 2023

Parallelize nested loop in v1.72

Related topics