Should `reshape` have an option for row-major order?

tparker · November 7, 2017, 7:17am

Currently reshape can only rearrange the raw linearly ordered data into Julia’s default column-major order (or colexicographical order, for higher-dimensional arrays). Would it be worth adding an optional keyword argument that reshapes the data in row major (or generally lexicographical) order? This would be convenient e.g. for interfacing with data generated by or for other programming languages that use the row-major convention (e.g. here). It’s fairly easy to reshape an array into an n \times m row-major matrix by reshaping it into an m \times n column-major matrix and then taking the transpose, but this gets cumbersome for higher-dimensional arrays.

(Note that I’m only suggesting changing the visible behavior of reshape, not the way that the resulting array is stored internally.)

Tamas_Papp · November 7, 2017, 7:32am

permutedims?

tparker · November 7, 2017, 7:43am

Okay, fair, I guess permutedims(reshape(A, p, o, n, m), [4, 3, 2, 1]) isn’t that much more cumbersome than reshape(A, m, n, o, p, ordering = rowmajor) would be. Maybe my suggestion isn’t necessary.

carstenbauer · November 7, 2017, 9:47am

I think it is still a valid point. Although permutedims(reshape(A, p, o, n, m), [4, 3, 2, 1]) is nice, it is conceptually different from the order="row" version in that one still has to “think” for a second.

I would, besides order="row" and order="col", even suggest to follow numpy’s example and add something like order="C" and order="F" or order="Fortran" etc. for most common languages. I found those options really convenient.

Tamas_Papp · November 7, 2017, 10:17am

The key about reshape is that it does not rearrange the data, it just presents it differently. It even shares structure:

julia> A = ones(4)
4-element Array{Float64,1}:
 1.0
 1.0
 1.0
 1.0

julia> B = reshape(A, 2, :)
2×2 Array{Float64,2}:
 1.0  1.0
 1.0  1.0

julia> B[2,2] = 3.0
3.0

julia> A
4-element Array{Float64,1}:
 1.0
 1.0
 1.0
 3.0

What you are asking for is a useful operation (especially if interfacing with languages that use row-major), but it may be better to give it a different name.

tparker · November 7, 2017, 4:01pm

I agree that “rearrange the data” was a poor choice of works in my OP. But couldn’t the current implementation of reshape and my proposed “row-major reshape” simply be two different ways to “present the data differently”? I don’t see why they are conceptually different enough to require two different names.

nalimilan · November 7, 2017, 4:18pm

As @Tamas_Papp showed, currently reshape always returns a view on the same data. The behavior you request (changing to row-major order) requires allocating a new copy. This is a significant change which deserves a separate function IMHO.

Tamas_Papp · November 7, 2017, 4:21pm

Not efficiently, no. In principle a “view” could be created for permuted dimensions, but it is vastly more complicated than a reshape.

tparker · November 7, 2017, 5:49pm

You can’t change to row-major order just by changing the array strides? For example, if your raw data has six elements, then (schematically) setting rowstride = 1, columnstride = 3 gives a column-major 3x2 matrix, rowstride = 1, columnstride = 2 gives a column-major 2x3 matrix, rowstride = 2, columnstride = 1 gives a row-major 3x2 matrix, and rowstride = 1, columnstride = 3 gives a row-major 2x3 matrix.

Tamas_Papp · November 7, 2017, 6:25pm

reshape also preserves linear indexing, ie

reshape(A, ...)[i] == A[i]

for all i in 1:length(A).

Again, what you are asking for is a perfectly reasonable feature, but you should call it something else.

carstenbauer · November 7, 2017, 6:34pm

numpy:

This will be a new view object if possible; otherwise, it will be a copy. Note there is no guarantee of the memory layout (C- or Fortran- contiguous) of the returned array.

tim.holy · November 7, 2017, 6:42pm

PermutedDimsArray does create a view; permutedims is the “eager” version which creates a copy.

@carstenbauer, Julia developers have tended to view “it might be A or it might be B depending on circumstance” as less than desirable; for example if you call those functions and then set elements in the resulting array, what will happen? Consequently among Julia devs there has been a drive to be 100% predictable, and it’s currently viewed as a bug when it’s not.

carstenbauer · November 7, 2017, 6:45pm

@tim.holy Yes, I know and couldn’t agree more. I like this viewpoint very much. I was more citing this for completeness. I looked it up because I was curious how numpy handles this issue.

carstenbauer · November 8, 2017, 6:03pm

So, we could either do something similar to this

change_major_order(X::AbstractArray, sizes...=size(X)...) = permutedims(reshape(X, sizes...), length(sizes):-1:1)

which performs the multidimensional transpose (as proposed above), or go with functions like row_major(X, sizes...), col_major(X, sizes...) which always brings X into row/column-major order.

In any case, we should certainly mention in the doc that none of this changes the actual structure in memory.

Tamas_Papp · November 8, 2017, 6:21pm

I am not sure what you mean here. permutedims does rearrange the elements (of the copy). To see this, call vec on the result.

carstenbauer · November 8, 2017, 6:34pm

Sorry, what I meant was that the final Array is always ~~row-major~~ column-major order in the sense that Julia always uses ~~row-major~~ column-order internally. If I switch to ~~column-major~~ row-major order by using change_major_order the Array will look/be like expected, what has been a row before is now a column, but internally I still have to iterate over ~~rows first~~ columns to be fast.

julia> A = rand(2,2)
2×2 Array{Float64,2}:
 0.48992   0.135993
 0.856454  0.625905

julia> A_row_order = change_major_order(A)
2×2 Array{Float64,2}:
 0.48992   0.856454
 0.135993  0.625905

julia> A_row_order[1] = 123
123

julia> A
2×2 Array{Float64,2}:
 0.48992   0.135993
 0.856454  0.625905

It does produce a copy but ~~A_col_order~~ A_row_order is still stored in column-major order. I am still fast if I iterate over ~~rows first~~ columns.

ScottPJones · November 8, 2017, 6:47pm

Actually, it’s the opposite, Julia is always column-major (Fortran) order internally.

carstenbauer · November 8, 2017, 6:51pm

Argh, thanks, too much switching between orders. But my statement holds
I update my post to not confuse anyone else.

tparker · November 8, 2017, 8:05pm

@carstenbauer Sorry to nitpick, but for completeness you should edit post #16 to change both mentions of “iterate over rows” to “iterate over columns”. Also, note that your proposal in post #14 isn’t quite right; you need to reverse the desired sizes before the transpose so that they end up right after the transpose. So it would need to be something more like

change_major_order(X::AbstractArray, size...=size(X)...) = permutedims(reshape(X, reverse([size...])...), length(size):-1:1)

carstenbauer · November 8, 2017, 8:20pm

No need to be sorry, in fact, I should be sorry
By “iterate over rows first” I was thinking of two for loops and the inner one should be the row index. But indeed, this is iterating columns. Changed it. Your second point is also right.

But, overlooking my incompetence, what about change_major_order(X) vs row_major(X) and col_major(X)?

Topic		Replies	Views
Is there a way to reshape a Vector to a row-major AbstractMatrix? Performance	4	2722	October 12, 2020
Row major arrays? General Usage	2	766	February 21, 2018
Why column major? General Usage question , array , linearalgebra , column-major	59	19241	February 17, 2024
Optimal column to row major conversion Performance question , arrays , row-major	7	900	June 7, 2023
What are the pros and cons of row/column major ordering? Internals & Design arrays , column-major , row-major	10	3137	February 13, 2024

Should `reshape` have an option for row-major order?

Related topics