Lack of improvement from distributed pmap, understanding a simple example

ma-y · October 28, 2024, 6:40pm

Thank you so much for your kind reply, this is very helpful.

If we consider the setup with BLAS.set_num_threads(1) (@everywhere), what is the reason that we can’t scale efficiently to 4 workers and that there are almost no gain from going from 4 to 8?

Is it because it’s a memory bound problem or is there something else I am missing (communication is deliberately kept to a minimum in the example, but I do need to create arrays in the problem that matters for me)?

Topic		Replies	Views
Struggling with pmap New to Julia parallel	8	1081	September 5, 2019
Multithreading and pmap Julia at Scale	8	2844	January 5, 2019
Pmap usage Performance question , parallel	1	386	December 13, 2020
Why is the parallel map so slow? General Usage parallel , optimization , pmap	2	3319	May 10, 2020
Pmap use of processor cores Julia at Scale question , pmap , load-balancing	13	2290	June 12, 2019

Lack of improvement from distributed pmap, understanding a simple example

Related topics