Non-rectangular arrays in Julia

LaurentPlagne · April 29, 2018, 7:39am

Hi,

I have to manipulate non-rectangular multidimensional arrays.
Let a be a 2D array of floats with two columns and each column has a different size:
length(a[:,1])=n1 and length(a[:,2])=n2 with n1!=n2

I wonder if it would be a good idea to create my own array type MultiArray <: AbstractArray
but it would break the AbstractArray interface because the size method would not return a simple tuple…

Maybe a new EvenMoreAbstractArray with a shape function that returns a generalized form of size…

Any hint ?

felix · April 29, 2018, 7:43am

Interesting, do you definitely need to represent it as a 2D array?

If yes, perhaps padding your data would be easier (you could pad it with garbage, and just record the lengths of the real data elsewhere).

If not, would an array of (variable-length) arrays work?

fredrikekre · April 29, 2018, 7:51am

Seems like https://github.com/mbauman/RaggedArrays.jl is what you are looking for. Might need to be updated for julia 0.6.

LaurentPlagne · April 29, 2018, 7:52am

Hi felix, thank you for this instantaneous reply !

The solution of an array of array is OK in my case (padding is not because the length can be very different).
My only concerns is that the indexing will be inhomogeneous. If a is a 4D array built as a 2D array of (differently sized) 2D arrays, the indexing syntax will be:
a[i,j][k,l]

LaurentPlagne · April 29, 2018, 7:58am

Thank you for the RaggedArrays link !

Tamas_Papp · April 29, 2018, 9:05am

I have a package for something similar:
https://github.com/tpapp/RaggedData.jl

Also supports ingestion of data with an ex ante unknown number of elements per column.

LaurentPlagne · April 29, 2018, 9:55am

Thanks !
actually for my application, I deal with ragged (I learned a new word) arrays of rectangular arrays.
I need the inner rectangular arrays to be really fast. I guess that the first solution (array of arrays) will be more efficient…

Tamas_Papp · April 29, 2018, 10:00am

It depends. For my application, mapping into a flat vector was the most efficient (because it uses the least memory and I was memory-constrained, and I have lots of small vectors, with eg 5–100 elements). I think the same approach can be extended for arrays. But make sure your profile and benchmark. Also, I am experiencing a lot of speedups on v0.7 compared to v0.6.

LaurentPlagne · April 29, 2018, 10:05am

Thank you for the tips. I will experiment the different options.

chakravala · April 29, 2018, 10:47am

Why not use an Array{<:Array{<:Any,1},1} for that?

LaurentPlagne · April 29, 2018, 11:12am

Yes, I guess that it is what felix proposed (array of array). I think I will go that way.

Topic		Replies	Views
Ragged Arrays New to Julia	5	2603	November 6, 2018
Vector of Matrices of Varying Dimensions New to Julia	1	408	January 19, 2019
Extendable multi-dimensional arrays General Usage question	23	1963	October 11, 2017
Arrays with non-uniform dimensions General Usage	8	713	October 22, 2022
Array of Vectors with Different Lengths New to Julia	2	1831	January 5, 2022

Non-rectangular arrays in Julia

Related topics