Clarification on pmap master-worker model

juliohm · December 18, 2017, 3:05am

In the master-worker model with pmap, what is the correct way of checking for parallel vs. serial execution?

if nprocs() > 1
  # do task in parallel
else
  # fallback to serial execution
end

or

if nprocs() > 2
  # do task in parallel
else
  # fallback to serial execution
end

Can the master process perform work without overhead or I should add 1 for safety when querying nprocs()? Assuming this is to be run on a HPC cluster without inter-node SSH communication.

vchuravy · December 18, 2017, 3:10am

Without inter-node communication the master process shouldn’t do any work.

nprocs() > 1 is checking for distributed execution. But you can also just query nworkers(),
If nprocs == 2 you won’t have parallel execution, but you still have a distributed setup.

juliohm · December 18, 2017, 3:14am

Thank you @vchuravy, very helpful information. From your answer, I understood that most use cases should be using nworkers() instead of nprocs():

if nworkers() > 1
  # do task in parallel
end

I will fix my packages accordingly.

vchuravy · December 18, 2017, 3:17am

I normally just use nprocs() > 1, since the question for me most often is not, should I do this in parallel, but should I execute my program on the workers(). Even if only one worker is available

Topic		Replies	Views
Difference between nprocs and nthreads Julia at Scale distributed	2	1003	September 17, 2022
Attaching workers to cores General Usage distributed	10	482	July 22, 2020
Pmap but with control over which workers do the tasks? Julia at Scale question	0	410	January 12, 2020
Correct way of parallelizing on a HPC remote cluster machine Performance question , hpc , parallel , distributed , threads	8	1265	August 25, 2020
Problems using pmap(), and doubt about the number of workers/processes to use General Usage pmap	3	1153	February 7, 2019

Clarification on pmap master-worker model

Related topics