Undef array with customized index

linwaytin · May 1, 2021, 12:33pm

I’m curious if there is a convenient way to define an array with customized index.
In fortran, we have real, dimension(2:6) :: array.
I know in julia we can use Array{Float64}(undef, 10) to declare a 10-element array with undef.
We can then use OffsetArray to get an index starting from any integer.
Is there an easy way like array(Int, -3:3, -5:5) or Int[-3:3, -5:5] to get an array with a customized index?

Henrique_Becker · May 1, 2021, 1:42pm

I am not sure if a helper function exists, but it is not hard to to write one, for example, the one-liner below work if you always use ranges to specify the axes:

oarray(::Type{T}, dims...) where {T} = OffsetArray(Array{T}(undef, length.(dims)...), dims...)

linwaytin · May 1, 2021, 2:08pm

@Henrique_Becker Thanks, this is pretty close to what I want.
I hope this kind of syntax can get support on the language level in the future.

linwaytin · May 1, 2021, 2:23pm

Also, it seems OffsetArray is slower than the standard array when accessing the array elements.
A language-level support should allow the compiler optimizes this away.

sostock · May 1, 2021, 3:33pm

OffsetArrays already defines an undef constructor:

julia> OffsetArray{Int}(undef, -3:3, -5:3)
7×9 OffsetArray(::Matrix{Int64}, -3:3, -5:3) with eltype Int64 with indices -3:3×-5:3:
...

linwaytin · May 1, 2021, 6:19pm

I don’t know this. Thanks!

Both answers are great.

lmiq · May 1, 2021, 7:28pm

Is it? Sounds strange to me. I would like to know more about that.

In parallel, note that in Julia there are many ways to iterate over the elements of an array which are independent of the way indices are set. With those the interfaces with arrays of other languages is less prone to error.

for i in eachindex(v)

for (i,el) in pairs(v)

for el in v

for i in axes(v)[1]

linwaytin · May 1, 2021, 8:02pm

What I meant is a simple element access like the following.

X = rand(10,10)
XO = OffsetArray(X, -5:4, -5:4)
@btime X[3, 3]   #  14.729 ns (1 allocation: 16 bytes)
@btime XO[3, 3]  #  17.698 ns (1 allocation: 16 bytes)

I’m still using julia 1.5.4. Maybe this is the reason.

rdeits · May 2, 2021, 5:31am

Your benchmark isn’t measuring what you think–it’s dominated by the fact that X and XO are non-constant global variables. See: GitHub - JuliaCI/BenchmarkTools.jl: A benchmarking framework for the Julia language for more info.

Fixing this gives quite different results (on Julia 1.5.3):


julia> @btime ($X)[3, 3]
  1.753 ns (0 allocations: 0 bytes)
0.34069137833211705

julia> @btime ($XO)[3, 3]
  2.841 ns (0 allocations: 0 bytes)
0.7901133830907208

Updating to Julia 1.6.1 makes OffsetArrays even faster:

julia> @btime ($X)[3, 3]
  1.593 ns (0 allocations: 0 bytes)
0.41741256881369937

julia> @btime ($XO)[3, 3]
  2.250 ns (0 allocations: 0 bytes)
0.4126414953734878

What do you actually mean by this? Most of Julia is implemented in Julia already, so there is no difference between “language-level” code and user code in terms of performance.

Skoffer · May 2, 2021, 6:29am

Recommended way is axes(v, 1)

jishnub · May 2, 2021, 8:04am

While there is a small overhead in the “raw” indexing while accessing each individual element, this often does not matter much in practical applications when you access a large number of elements in a loop (see this issue in OffsetArrays.jl). Such differences, when the exist, are perhaps due to the bounds-checking being sub-optimal (there is another issue in OffsetArrays where we are trying to improve this).

Here is an example:

julia> using OffsetArrays

julia> f(x) = sum(xi for xi in x);

julia> g(x) = sum(x[i] for i in eachindex(x));

julia> g2(x) = sum((@inbounds x[i]) for i in eachindex(x));

julia> A = ones(2000,2000);

julia> AO = ones(1:2000, 1:2000);

julia> @btime f($A);
  6.686 ms (0 allocations: 0 bytes)

julia> @btime f($AO);
  6.754 ms (0 allocations: 0 bytes)

julia> @btime g($A);
  6.738 ms (0 allocations: 0 bytes)

julia> @btime g($AO);
  6.791 ms (0 allocations: 0 bytes)

julia> @btime g2($A);
  6.727 ms (0 allocations: 0 bytes)

julia> @btime g2($AO);
  6.724 ms (0 allocations: 0 bytes)

We see that with bounds-checking turned off, there is no performance gap anymore.

To answer the original question, you should be able to use the OffsetArray constructor to initialize an undefined array.

julia> OffsetArray{Float64}(undef, 2:3, 4:5)
2×2 OffsetArray(::Matrix{Float64}, 2:3, 4:5) with eltype Float64 with indices 2:3×4:5:
 0.0  0.0
 0.0  0.0

Another useful function is similar, which chooses an appropriate array type for you depending on the axes.

julia> similar(Array{Float64}, 2:3, 4:5)
2×2 OffsetArray(::Matrix{Float64}, 2:3, 4:5) with eltype Float64 with indices 2:3×4:5:
 6.94499e-310  6.94499e-310
 6.94499e-310  0.0

linwaytin · May 2, 2021, 3:29pm

@rdeits Thanks, could you please explain why you use ($X) in the benchmark?
I can see the difference (no allocation) but I don’t understand why.

I thought proper language support might make the OffsetArrays more efficient.
I might be wrong, but what causes the different access time?
The difference is more than 0.5ns in your benchmark.
Is it from the bound checking?

linwaytin · May 2, 2021, 3:32pm

@jishnub Thanks, I understand that in practice the overhead is not important.
Thank you for pointing out there are other ways to achieve what I want.

linwaytin · May 2, 2021, 3:34pm

@Skoffer Could you please elaborate on how to do this?

Skoffer · May 2, 2021, 4:18pm

I only meant, that instead of axes(v) [1] one should use axes(v, 1)

linwaytin · May 2, 2021, 9:19pm

@Skoffer Thanks.

rdeits · May 3, 2021, 4:31am

Check the BenchmarkTools documentation here for an explanation of what’s going on: GitHub - JuliaCI/BenchmarkTools.jl: A benchmarking framework for the Julia language

linwaytin · May 3, 2021, 2:17pm

@rdeits Thanks! I didn’t know that.

Topic		Replies	Views
Custom Indices in OffsetArrays New to Julia	5	633	April 8, 2019
OffsetArrays methods for querying dimensions? New to Julia	11	258	August 5, 2023
Performance of OffsetArrays General Usage	9	2197	July 12, 2018
Custom indices range for arrays Performance	3	664	February 26, 2019
Why do views of offsetarrays have one-based indexing if bounds are specified? New to Julia question	14	1040	April 24, 2024

Undef array with customized index

Related topics