A better way to assign to [:, :, ... k]?

tuckermcclure · September 2, 2017, 1:37am

I’m writing a utility that collects a user’s n-dimensional matrices over time and stores them in an n+1 dimensional array. Something like this:

storage[:, :, k] = input

But of course I can’t literally type those colons, because the input could be anything (as long as it’s consistent from call to call). So I’ve written this to store the input matrix:

num_input_dims = length(size(input))
storage[repeat([:], num_input_dims), k] = input

For what it’s worth, in my case storage is actually part of an HDF5 file with a space preallocated for all of the data the user will pass in. The syntax for [:, :, :, k] is what the HDF5 package uses for input, and I don’t think strides will work for it.

My syntax above just feels a little silly, and I’m wondering if there’s a better way.

rdeits · September 2, 2017, 1:52am

I think slicedim() does what you’re looking for.

ChrisRackauckas · September 2, 2017, 2:12am

EllipsisNotation.jl at least gives nice syntax: storage[..,k]. In was fine for performance, but when we made it more generic it doesn’t infer properly anymore so you may want to use an older version, or help us finish it up .

https://github.com/ChrisRackauckas/EllipsisNotation.jl

bramtayl · September 2, 2017, 2:47am

Lispy tuple programming!

replace_last(t, i) = first(t), replace_last(Base.tail(t), i)...
replace_last(t::Tuple{T}, i) where T = (i,)

test = rand(5, 5, 5)
test_indices = indices(test)
test[replace_last(test_indices, 1)...]

Should be super-performant.

DNF · September 2, 2017, 5:54am

Why can’t you use a (pre-allocated) vector of n-dimensional matrices?

tuckermcclure · September 2, 2017, 5:06pm

Thanks, but if I understand correctly, this would be for reading, and I need to write with setindex! (it’s the method the HDF5 package has writing to data in files). Have I missed something?

This is more elegant than what I had. My code now looks like (taken from EllipsisNotation.jl):

colons = (Colon() for i in 1:length(size(input)))
storage[colons..., k] = input

Nice. I’m going with @ChrisRackauckas’s suggestion for clarity, but I wonder if this recursive implementation is any faster?

One could; it just won’t work for my application, where storage is (must be) a multidimensional array.

Thanks all. You helped me retire a TODO in my code, find a bunch of new functions, and figure out why @bramtayl’s second replace_last method would be interpreted as the base case.

ChrisRackauckas · September 2, 2017, 5:12pm

Only if you hit the splatting penalty, i.e. have more than 16 dimensions. The issue with EllipsisNotation.jl is a little bit different and has to do with inferrability when it generalized. If it infers correctly, any implementation should be about the same speed.

kristoffer.carlsson · September 2, 2017, 5:14pm

FWIW assigning to the last index is slow(er) since Julia is column major.

Edit. This is nonsense.

Elrod · September 2, 2017, 9:52pm

By assigning to

storage[:, :, k]

Isn’t he effectively assigning through the first two axis? That is, I thought the above would basically sort of be equivalent to:

for in 1:size(storage,2)
  for j in 1:size(storage,1)
    storage[j, i, k]
  end
end

in terms of memory access pattern, and that the whole block from both cases should be contiguous.
Am I mistaken, or did I misinterpret something?

kristoffer.carlsson · September 3, 2017, 5:21am

Yes, sorry. I don’t know what I was thinking…

Topic		Replies	Views
Indexing into multidimensional array for general n dimensions New to Julia question	2	281	December 29, 2020
How to implement macro for assign array of arbitrary dims General Usage metaprogramming	6	562	April 16, 2019
Assign a value for one dimension of multi-dimensional array New to Julia arrays	4	434	June 30, 2023
Indexing multidimensional arrays of arbitrary dimension Performance indexing , array	12	1559	July 19, 2019
Indexing array based on a vector including Colon() General Usage question	3	810	November 29, 2018

A better way to assign to [:, :, ... k]?

Related topics