Using Distributed computing in JULIA for UNet

chuma2 · August 29, 2022, 3:51am

Hi
I have a UNet training for Machine Learning which works on my laptop (14 core) but its extremely slow.
I’m trying to explore ways of speeding it up and came across Distributed library.
So I’ve only added the following three lines in my code:

using Distributed

addprocs(exeflags=`--project=$(Base.active_project())`)
..
rmprocs(workers)

Also, It doesn’t identify pmap() function which I’ve called like this :

train_batch_input_files, train_batch_target_files = pmap(grab_random_files, train_dataset, batch_size)
I’m not sure If i’m actually using distributed computing correctly or not?

jmair · August 29, 2022, 8:18am

It looks like you are just trying to load the files in parallel? If you are loading from disk, this will likely be bottlenecked by your disk speed, and not how many cores you have.

pmap is likely being defined, but maybe it is called with the wrong types. The main way to use pmap is:

some_func(x) = x^2

results = pmap(some_func, [1, 2, 3])
# results = [1, 4, 9]

So that each element in the array is mapped using the supplied function into a result, similarly to how one uses broadcasting.

In your example, the pmap should return an array of results, with each element being the return type of your supplied function. In this case it looks like a tuple. This means you will get an array of tuples as the return type, so I don’t think you can simply destructure that. Secondly, if the batch size parameter is fed into your function, but is not an array, you can create an anonymous function which wraps this parameter:

results = pmap(x->grab_random_files(x, batch_size), train_dataset)

If you are trying to get your code to run faster, I would recommend profiling the code first to see which parts are taking the most time, and focus on optimising them first.

EDIT: Use the @everywhere macro to load any non base functions used in your mapping functions. I usually have a separate file with all necessary function definitions and have an @everywhere include("functions.jl") before I use pmap.

chuma2 · August 30, 2022, 6:47am

So I tried

but it throws an error for

‘status’ not defined

. How do I use pmap then?

chuma2 · August 30, 2022, 11:19pm

So instead of using pmap, i used @sync and @distributed before the for loop. In the call to environment i used @everywhere for all process to have access to all using files and now the run time has reduced. from 145 to 110 seconds.

jmair · August 31, 2022, 5:22am

Since this is just on a single machine, multithreading is likely a better fit here as it has very little overhead (Multi-Threading · The Julia Language). Here you would just replace the pmap and not need the @everything:

Threads.@threads for file in input_files
    Jaws.transfer(file, pwd())
end

Distributed uses different processes with separate memory so one needs to load all the libraries on each process, whereas multithreading uses shared memory. If I am on a single machine, I tend to try multithreading first, and only move to distributed if there is a particular benefit.

Just make sure you have multiple threads available with Threads.nthreads(). A sensible value for this is the number of logical cores in your CPU (14). The docs tells you how to change this.

Topic		Replies	Views
Distributed nested in Pmap Julia at Scale	2	1110	February 22, 2019
Lack of improvement from distributed pmap, understanding a simple example New to Julia distributed , pmap	6	135	October 29, 2024
Understanding message passing with pmap Performance	3	435	June 1, 2022
Distributed parallel loops Julia at Scale parallel , distributed	0	369	December 2, 2023
How to use distributed and pmap across GPU cores GPU question , cuda , distributed , pmap	2	927	April 1, 2022

Using Distributed computing in JULIA for UNet

Related topics