"Textbook" use of pmap but strange execution times

mgiugliano · July 26, 2019, 4:15pm

I have 28 cores and 56 threads and I call julia with “-p auto”.
When starting from a (non-parallel) example,

@time for x in [10;ones(Int, 10)]
sleep(x/10); println(x)
end

its execution time (~2s) makes perfect sense.

However, the execution time of the following (equivalent?) parallelized code does not:

@time res = pmap(x -> (sleep(x/10); println(x)), [10;ones(Int, 10)]);

In fact, this (reproducibly) takes 7 repeated (identical) executions, before converging to ~1 (i.e. 3.2s, 1.8s, 1.8s, 1.8s, 1.8s, 1.8, 1.1, 1.1,…).

I know Julia compiles functions upon first use (i.e. the 3.2s), but… why should I wait seven repeated executions, before getting the intended parallelization?

Topic		Replies	Views
Using Julia with @parallel pmap or blank makes no difference in speed. Julia at Scale	3	863	March 22, 2018
Simple Parallel Examples for Embarrassingly Simple Problems Julia at Scale	29	7350	April 23, 2021
Poor speed gain using `pmap` Performance parallel , pmap	17	1194	August 6, 2021
unexpected pmap behaviour New to Julia performance , parallel , regex	0	476	March 4, 2019
For loop in function and multiplication of larger matrices, slow speed in parallel Performance performance , parallel , loops	3	1301	November 22, 2019

"Textbook" use of pmap but strange execution times

Related topics