Bug in DistributedArrays? Pushing to first local part pushes to all local parts

jsbryan4 · April 8, 2022, 1:20am

I found what I believe to be a bug in DistributedArrays. I am trying to push to the first local part of an array but it ends up pushing to all of the local parts.

using Distributed
addprocs(4)
@everywhere using DistributedArrays

a = dfill([], 10)

s = @spawnat 2 push!(localpart(a)[1], 999)

a

I would expect the a to be [[999], [], [], [] ...], but it actually is [[999], [999], [999], [], ...]. This is unexpected and against my intuition for how it should behave.

skleinbo · April 8, 2022, 7:28am

This behavior is not specific to Distributed, but the way (d)fill operates. From the docstring

If x is an object reference, all elements will refer to the same object:

  julia> A = fill(zeros(2), 2);

  julia> A[1][1] = 42; # modifies both A[1][1] and A[2][1]

  julia> A
  2-element Vector{Vector{Float64}}:
   [42.0, 0.0]
   [42.0, 0.0]
end

Create the array locally first and then distribute

a = distribute([ [] for _ in 1:10 ])

There is no additional overhead since dfill allocates locally first anyway.

Topic		Replies	Views
DistributedArrays: unexpected behavior before modifying localpart Julia at Scale	1	569	November 29, 2017
How to access the localpart of a distributed array? Julia at Scale	8	1715	October 28, 2017
Distributed arrays in for loop General Usage parallel	2	70	August 28, 2024
Adding vs multiplying matrices with DistributedArrays General Usage distributed	7	609	May 25, 2021
Unable to call a distributed array in a parallel subprocess General Usage parallel , distributed	1	474	April 7, 2020

Bug in DistributedArrays? Pushing to first local part pushes to all local parts

Related topics