Status of AxisArrays.jl

ufechner7 · September 12, 2019, 8:22am

Hello,
What is the status of this package?
Even though there was some activity lately it looks as if it lacks a little bit of love an care…
Can it already be used in a production system, or are a lot of hidden bugs to be expected?
Uwe

mauro3 · September 12, 2019, 8:41am

There is also the new DimensionalData.jl, announced recently ANN: DimensionalData.jl and GeoData.jl.

ufechner7 · September 12, 2019, 9:10am

Thanks for the hint!

tobias.knopp · September 12, 2019, 9:15am

Is somebody aware of a package that does not encode the axes in its type?

oxinabox · September 12, 2019, 9:20am

See also WIP: The Plan · Issue #1 · JuliaCollections/AxisArraysFuture · GitHub
and the two packages that do each of the main halves and are designed around being composable:

https://github.com/invenia/IndexedDims.jl/blob/master/src/IndexedDims.jl (WIP)
GitHub - invenia/NamedDims.jl: For working with dimensions of arrays by name (less WIP, but still not fully happy. Am using in production though)

There is a lot going on in the area, and I think that is great.
Many options/parts to shake out the best way.

@davidanthoff’s GitHub - davidavdav/NamedArrays.jl: Julia type that implements a drop-in replacement of Array with named dimensions

Note that if you want zero overhead at runtime (like NamedDims does),
you basically need to encode the axes in the type, so that they can be compiled out of existance,
during specialization.

oxinabox · September 12, 2019, 9:27am

You can use AxisArrays in production, Invenia does.
But they are very touchy and so can be frustrating to work with.

Any production system should have good enough integration tests that your confident that you are not hitting bugs (of course you are wrong, there are always bugs, but you want to minimize how often that happens)
As such any package can be used in production, the question is how frustrating will it be.

AxisArrays does not have hidden bugs that will sneak past you tests.
It has obvious bugs (/missing features),
like a ton of operations dropping the axes,
and missing overloads,
and confusing notation for how to work with things that are indexed with different Int ranges.
Which will easily be caught by your tests.

And it itself does still prevent certain categories of coding mistakes

tobias.knopp · September 12, 2019, 9:38am

Its not fully clear to me, why this needs to be like that. The axis are just metadata and indexing could be done on the raw array. Its clear that permutedims and slicing require some additional things but things can keep type stable as long as one uses traditional indexing.

What I am looking for is a way to represent a tomographic data (3D) that has some center, some pixelspacing, and some rotation matrix in space.

My issue with AxisArrays is that I, for instance, cannot change the center, or the rotation matrix without creating a new type.

Raf · September 12, 2019, 10:54am

DimensionalData.jl will definitely have more bugs than AxisArrays.jl, its only at 0.1.0! But they will also be fixed promptly if you post an issue.

@tobias.knopp I’m wondering what the negative consequences are for you of “creating a new type”? do you mean you have problem with creating a new struct instance with a different type than the original?

DimensionalData.jl is mostly functional so the objects are frequently rebuilt. The compiler can elide the allocations most of the time anyway.

tobias.knopp · September 12, 2019, 11:39am

For instance I have written a Gtk based data viewer that can display 4D tomographic data. It looks something like:

  struct DataViewer
    data:: ???
  end

With a simple Array, I can do

  struct DataViewer
    data::Array{Float32,4}
  end

With an AxisArray this is much more complicated. And no,

  struct DataViewer{T}
    data::T
  end

is not an option, because data can be changed at runtime.

oxinabox · September 12, 2019, 11:42am

You could just do:

mutable struct DataViewer
    data::AbstractArray
end

The performance implictations are not as bad as you might think,
and are kinda similar to the ones that one avoids by putting the axes into the types.

tobias.knopp · September 12, 2019, 11:42am

To get somewhat more concrete, here is the type that I am using

ImageMeta{Float32,5,AxisArray{Float32,5,Array{Float32,5},Tuple{Axis{:color,UnitRange{Int64}},Axis{:x,StepRangeLen{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}},Base.TwicePrecision{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}}},Base.TwicePrecision{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}}}}},Axis{:y,StepRangeLen{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}},Base.TwicePrecision{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}}},Base.TwicePrecision{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}}}}},Axis{:z,StepRangeLen{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}},Base.TwicePrecision{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}}},Base.TwicePrecision{Quantity{Float64,𝐋,Unitful.FreeUnits{(mm,),𝐋,nothing}}}}},Axis{:time,StepRangeLen{Quantity{Float64,𝐓,Unitful.FreeUnits{(s,),𝐓,nothing}},Base.TwicePrecision{Quantity{Float64,𝐓,Unitful.FreeUnits{(s,),𝐓,nothing}}},Base.TwicePrecision{Quantity{Float64,𝐓,Unitful.FreeUnits{(s,),𝐓,nothing}}}}}}},Dict{String,Any}}

oxinabox · September 12, 2019, 11:43am

what a glorious type.
Godlike and powerful.
Huge like a mountain.

tobias.knopp · September 12, 2019, 11:47am

AbstractArray is, what I am currently doing. My issue is that my data viewer has a compile time of more than 16 seconds. second call is 0.2 s. Therefore my hope is that making data DataViewer concrete will make compile time (seems to be basically inference) smaller.

tobias.knopp · September 12, 2019, 11:50am

Powerful it is, and I like AxisArrays design actually. But then, please write a simple function that changes the center of an AxisArray that has spatial dimensions in the first three dims. Not obvious how to do that.

tobias.knopp · September 12, 2019, 12:20pm

By the way: Is there an easy way for stripping the units from my type, while first converting things to SI of course (e.g. convert ms to s and so on). This would make the type much shorter in many cases.

Zach_Christensen · September 12, 2019, 12:41pm

For those who are working on these new array interfaces I’ve been working on a AbstractIndices.jl package. I don’t know if it will ever make it out the door though because I have a lot of other obligations. The heart of it is this file https://github.com/Tokazama/AbstractIndices.jl/blob/master/src/abstractindex.jl. The goal was to make something that relied on very little unique internal behavior so that any changes to performance in base (in terms of sorting, indexing, and maybe even multithreading) would just come along with it.

The idea is that the only obstacle to making any AbstractVector into an index is knowing how to transform the user input into the index for the to_index function in base. Once that’s done most indexing behavior is taken care of by to_axes in base.

The basic idea of how it works can be seen in the examples found here https://github.com/Tokazama/AbstractIndices.jl/blob/master/src/asindex.jl. It isn’t currently optimized for performance and I haven’t figured out the show method. Feel free to take whatever you want or let me know if you want some help implementing it in one of your packages. My only goal here is to have something that’s very flexible and maintainable.

tobias.knopp · September 12, 2019, 12:50pm

I should mention that coordination across packages is extremely important here. In the end, all what I want is

A = load("myimage.nii")
B = load("mydicomdata.dcm")
DataViewer(A)
DataViewer(B)

Zach_Christensen · September 12, 2019, 12:55pm

I can almost guarantee that this won’t be too much of an issue for the images interface as I have been specifically concerned with this very issue while rewriting the NIfTI package. If you take a look at some of the recent additions to ImageCore.jl, you’ll see that I’ve started implementing a minimal trait based interface that will hopefully allow compatibility across different array paradigms. It basically comes down to returning a NamedTuple for the image based traits. I imagine this would be possible no matter what the community converges on.

Raf · September 12, 2019, 1:06pm

So the problem is using a fixed-type mutable container of an immutable, functionally updated object that may change type in some of those updates. It would be interesting to see how using ::AbstractArray works to know how much of a problem this really is.

Raf · September 12, 2019, 1:29pm

AbstractIndices looks interesting, funny how many of us have been writing similar things at the same time. I wonder if it’s possible to end up with one package that fits all of our requirements.

Topic		Replies	Views
The fate of DimensionalArrays / AxisArrays in Julia, and which to actually use Specific Domains	7	3352	May 19, 2022
Converting between AxisArray like packages? General Usage	18	1448	February 27, 2021
Indexing by names, current favorites in the package space Data question , indexing , arrays	7	190	June 6, 2025
Including Named Dimensions in Base Internals & Design	2	1583	June 21, 2021
[ANN] AxisIndices Package Announcements package , announcement	15	1491	March 16, 2020

Status of AxisArrays.jl

Related topics