How to measure time in spawned process?

hsgg · June 18, 2019, 1:56am

I’m trying to parallelize my code, and find it useful to measure the elapsed CPU time for a single process. However, I find that using @time the output is garbled. Here is a MWE, minimal.jl:

using Distributed

@everywhere module minimal

using Distributed

function do_something(x)
    @time x=5+x
    return x
end

function main()
    n = 5
    futures = Array{Any}(fill(NaN, n))

    # start processes
    for i=1:n
        futures[i] = @spawn do_something(i)
    end

    # collect answers
    for i=1:n
        result = fetch(futures[i])
        @show result
    end
end

end

minimal.main()

Then, even with no parallel execution starting Julia simply as julia I get

julia> include("minimal.jl");
          00000.....000000000000000000000000000000 seconds seconds seconds seconds seconds




result = 6
result = 7
result = 8
result = 9
result = 10

And similar when using multiple processes, e.g. starting Julia as julia -p2.

I was wondering what @time is doing differently than, say, println(), which does not have this problem? How can I measure serial execution time?

c42f · June 18, 2019, 4:45am

What’s happening here is that @time uses base.print_time which uses @printf which uses multiple print invocations, one for each part of the output. IO can cause a task switch, and they’re all writing to stdout together so pieces end up interleaved. (Furthermore these compete with the REPL task which is also writing to stdout!)

To reproduce the behavior:

$ julia -e 'using Distributed; for i=1:10; @spawn (print(stdout, "a") ; print(stdout, "b")) ; end ; sleep(1)'
aaaaaaaaaabbbbbbbbbb

You could use @timed and manage the formatting of the output yourself, gathering this information back to the main task and doing the IO there.

ffevotte · June 18, 2019, 10:52am

Yes.

See the thread below for an example using this technique to benchmark function calls that are distributed with pmap:

hsgg · June 18, 2019, 11:55am

Thank you @c42f @ffevotte! I guess I was mostly stumped that @time behaved differently than println(). The explanation makes sense, though. Thanks, again!

Topic		Replies	Views
Several questions about time evaluation with @time Performance question , performance	7	134	September 16, 2024
Measuring and storing execution time in a variable Performance question , distributed , pmap , time	2	2243	August 20, 2020
Measuring the time properly for nested for-loops with @parallel Performance parallel , time	2	512	August 5, 2021
How to benchmark @distributed block? General Usage parallel , distributed	4	432	July 9, 2022
Behavior of `@time` when using `@spawn` (in Julia 1.8 highlights blog post) New to Julia multithreading	2	410	August 22, 2022

How to measure time in spawned process?

Related topics