Overhead of Distributed module vs. Threads

RonK · March 7, 2019, 5:25pm

It’s been a long, long time since I last used Julia and I need to update code written for version 0.4 which used parallelization primitives which seem to have since been moved into their own module, Distributed. While looking at the documentation, I see that there is a new experimental module Threads which if I understand correctly is designed for a single compute node with multiple cores. My question is how much extra overhead is there in continuing to use the Distributed module when anyway the code most likely will only be run on a single compute node with multiple cores? If each remotecall_fetch(…) requires multiple seconds to finish its computation, then am I correct to assume that the extra overhead will be insignificant?

zgornel · March 7, 2019, 5:57pm

The answer to the question is program dependent. Threading generally damages the scalability and clarity of the code. For simple programs and operations it can improve performance but if overheads of seconds are large, you may not need threading at all… There were some very interesting videos from an older juliacon (2016 i think) where some people from Intel were discussing this. I guess the best way is to give it a try with both parallel and threaded implementations and check out the complexity implications of both.

Topic		Replies	Views
Why might I be seeing a large overhead for multiprocessing? Performance	8	2357	October 1, 2020
First post - what am I doing wrong - distributed benchmark show no improvement New to Julia question , benchmark , distributed	2	352	September 22, 2021
Is the best number of threads used in parallel computing by using distribute 4? Performance parallel	4	1388	June 11, 2020
An embarrassingly parallel problem: threads or MPI? Performance parallel , multithreading , mpi	14	3860	June 10, 2021
What type of code is `threads` good for? General Usage	12	1687	July 26, 2019

Overhead of Distributed module vs. Threads

Related topics