Displaying amount of data sent to worker

raminammour · October 10, 2018, 8:46pm

Hello,

I am trying to produce an example that shows the subtle differences in writing code in a distributed setting. Here is a contrived example:

function bla1(rho,nc1)

    t1=@fetchfrom workers()[1]  Sys.free_memory()
    
    pids= workers()[1:nc1]
    ss=zeros(nc1)
    
    @sync for i::Int in 1:nc1
        @async ss[i]=@fetchfrom pids[i] s=sum(view(rho,1:1,1:1)).*myid()
    end
    t2=@fetchfrom workers()[1] Sys.free_memory()
    
    @show (t1-t2)/2^30
    sum(ss)
end

 function bla2(rho,nc1)


    t1=@fetchfrom workers()[1] Sys.free_memory()
    
    ss=zeros(nc1)
    
    pids= workers()[1:nc1]
    
    @sync for i::Int in 1:nc1
        rhoI=view(rho,1:1,1:1)
        @async ss[i]=@fetchfrom pids[i] sum(rhoI).*myid()
    end
    t2=@fetchfrom workers()[1] Sys.free_memory()
    @show (t1-t2)/2^30
    sum(ss)
end

The punchline is that bla1 sends the whole array rho, and bla2 only sends one entry. I can show that effect with:

@everywhere GC.gc_enable(false) #scary
rho=rand(10^7)
@time bla1(rho,3)
@tme bla2(rho,3)

(t1 - t2) / 2 ^ 30 = 0.22303009033203125
  0.131072 seconds (506 allocations: 32.297 KiB)

(t1 - t2) / 2 ^ 30 = 0.0
  0.002824 seconds (526 allocations: 32.016 KiB)

I feel that my solution is rather hacky, especially turning off garbage collection. What is the “Julian” way of doing it? Couldn’t figure it out with @time,@allocated …

Thanks!

Topic		Replies	Views
Distributed: Passing views of an array for read access to workers (using pmap) General Usage question , performance , parallel , distributed , views	8	500	January 31, 2024
Overhead in passing data to worker processes General Usage parallel	4	844	June 11, 2018
Scaling of @threads for "embarrassingly parallel" problem Performance threads	29	1956	January 20, 2023
Probable data race condition causing problems when trying to parallelize a loop used to populate an array Performance distributed	14	191	August 4, 2024
Memory Increase with DistributedArrays in Loop General Usage question , memory , distributed	2	273	January 8, 2024

Displaying amount of data sent to worker

Related topics