How to distribute array by column?

Carol · May 10, 2019, 8:59pm

Background:

using Distributed
@everywhere using DistributedArrays
addprocs(4)

A=[1 1 2 2; 
   1 1 2 2; 
   3 3 4 4;
   3 3 4 4]
adist = distribute(A)

I found A is distributed in a checkerboard block way, which means

fetch( @spawnat 2 localpart(adist))

will return

[1 1;
 1,1]

But I want A to be distributed by column, which means

fetch( @spawnat 2 localpart(adist))

will return

[1,
 1,
 3,
 3]

How can I define the way of distribution? (eg. only by column)

Thanks,

mbauman · May 10, 2019, 9:48pm

You can just specify it with the dist keyword argument to specify the number of partitions per dimension:

julia> adist = distribute(A; dist=(1, nworkers()))
4×4 DArray{Int64,2,Array{Int64,2}}:
 1  1  2  2
 1  1  2  2
 3  3  4  4
 3  3  4  4

julia> fetch( @spawnat 2 localpart(adist))
4×1 Array{Int64,2}:
 1
 1
 3
 3

(As an aside, note that you want to @everywhere using after addprocsing)

Carol · May 10, 2019, 10:17pm

I tested that

adist2 = distribute(A; dist=(4, 1))  #by row
adist3 = distribute(A; dist=(1, 4))  #by column
adist4 = distribute(A; dist=(2, 2))  #checkerboard

The documentation said “dist optionally specifies a vector or tuple of the number of partitions in each dimension”.

But I still feel confused about the two parameters in dist(parameter1, parameter2)

Could you please explain with more detail? Thanks for your time.

mbauman · May 10, 2019, 10:22pm

The dist tuple has as many elements as there are dimensions in A. It specifies how many partitions (or splits) should be used within each dimension.

The first dimension is rows — and there we don’t want any partitions, so we use a 1.

The second dimension is the columns — and there we want as many partitions as there are workers.

Carol · May 10, 2019, 10:23pm

I got it. Thank you very much!

Topic		Replies	Views
Distributed § Dimensional Arrays General Usage distributed	2	364	October 29, 2020
Why do use @spawnat and fetch? Julia at Scale	3	2546	May 9, 2019
Adding vs multiplying matrices with DistributedArrays General Usage distributed	7	609	May 25, 2021
How to access the localpart of a distributed array? Julia at Scale	8	1713	October 28, 2017
DistributedArrays: basic element-wise vector operation is slow / fails Julia at Scale question	1	791	December 18, 2018

How to distribute array by column?

Related topics