Parallel Postprocessing

henry2004y · October 4, 2019, 3:19pm

Hi everyone,

I am using Julia PyPlot to do postprocessing for simulation data. Here is the sample script for my parallel work:

using Distributed
@everywhere using PyCall
@everywhere matplotlib = pyimport("matplotlib")
@everywhere matplotlib.use("Agg")
@everywhere using PyPlot, Glob

@everywhere function process(filename::String, dir::String=".")
   np = pyimport("numpy");
   filehead, data, filelist = readdata(filename, dir=dir, verbose=false);
   # Postprocessing...
   plt.savefig("$(time).png")
   println("finished saving $(time)!")
end

# Define path and filenames
dir = ".";
filename = "y*.out";

# Find filenames
# ......

# Processing
@distributed for filename in filenames
    println("filename: $(filename)")
    process(filename, dir)
end

In this way, I find all the filenames on one processor, distribute the names within workers, and do the plotting and saving on each worker using @distributed. Since there’s no dependency between processing different files, this seems to work.

I am wondering if there are better ways to do this, say, using @threads or channel? Any idea is appreciated!

kolia · October 4, 2019, 4:33pm

For embarrassingly parallel tasks like this, this looks good to me. Simple and effective.

henry2004y · October 4, 2019, 8:06pm

Later I encountered some issue when running this script in the command line but not in REPL. In REPL mode, everything looks fine, the plots are saved in png format. However, if I just type

julia process.jl

Then the function process is never been executed, and no error message returned. What’s wrong with that? I feel like the scheduled task in the queue is never been executed.

kolia · October 4, 2019, 9:42pm

The driver script is sending the tasks off to workers and exiting immediately, because the distribute for loop returns immediately. This in turn kills the workers immediately. Doesn’t happen on the REPL because that stays alive after you run each command.

You need to have your driver script wait for the workers to all finish, by for example doing a dummy reduce.

henry2004y · October 4, 2019, 9:50pm

Makes sense! So what is the command I’m missing? Some synchronization, barrier() or just wait()?

kolia · October 4, 2019, 10:06pm

The docs for @distributed say that if you give it a reducer it’ll wait for the workers, so that it can compute the reduction. So you can have each worker return something, say 1, and use + as the reducer.

Or just add @sync in front of @distributed.

baggepinnen · October 4, 2019, 11:54pm

I created ThreadTools for this kind of tasks
https://github.com/baggepinnen/ThreadTools.jl

henry2004y · October 5, 2019, 2:27am

Is this for the upcoming 1.3 only? I would think thread pools is preferable in these kind of parallel IO work.

baggepinnen · October 5, 2019, 5:52am

Yeah it only works for v1.3
I have a small benchmark in the Readme indicating roughly where the overhead caused by my implementation strategy starts becoming noticeable.

henry2004y · October 5, 2019, 2:17pm

Can’t wait to try v1.3 and your package! Thanks for sharing!

kolia · October 5, 2019, 7:00pm

Nice!

@baggepinnen your response promoted me to catch up on the multithreading news

If I understand the single global lock on libuv correctly, that means that interacting with it is done on a privileged thread, but the actual IO can happen in parallel? If not then for IO heavy tasks threads wouldn’t help…

Topic		Replies	Views
Using pycall from threads New to Julia pycall , multithreading	1	1422	December 27, 2019
The ultimate guide to distributed computing Julia at Scale parallel , cluster , distributed	44	9637	June 21, 2021
Running a single Julia script simultaneously on different workers, each with different input parameters General Usage question , parallel , scripting	0	451	August 5, 2021
Help for basic usage of @distributed for Julia at Scale question	1	495	October 1, 2020
Distributed nested in Pmap Julia at Scale	2	1109	February 22, 2019

Parallel Postprocessing

Related topics