How exactly are Julia Arrays Implemented?

astrobc1 · January 5, 2023, 6:13pm

I’m curious how exactly Julia arrays are implemented.

julia> t = Array{Float64, 2}
Matrix{Float64} (alias for Array{Float64, 2})

julia> isabstracttype(t)
false

julia> isprimitivetype(t)
false

julia> isstructtype(t)
true

julia> fieldnames(t)
()

Where is the data stored if there are no fields?

Oscar_Smith · January 5, 2023, 6:18pm

They are implemented in C. It would be nice to move more of the complexity to the Julia side eventually, but no one has gotten around to doing it yet.

mbauman · January 5, 2023, 6:26pm

In C:

github.com

JuliaLang/julia/blob/321c5f55165643cff299ae7d270cfb3d5df73779/src/julia.h#L172-L189


      
          JL_EXTENSION typedef struct {
              JL_DATA_TYPE
              void *data;
              size_t length;
              jl_array_flags_t flags;
              uint16_t elsize;  // element size including alignment (dim 1 memory stride)
              uint32_t offset;  // for 1-d only. does not need to get big.
              size_t nrows;
              union {
                  // 1d
                  size_t maxsize;
                  // Nd
                  size_t ncols;
              };
              // other dim sizes go here for ndims > 2
          
          
    // followed by alignment padding and inline data, or owner pointer
          } jl_array_t;

See also: What is special about Array, String, and Symbol?

uniment · January 5, 2023, 7:21pm

Would there be major benefits to be had? I imagine array dimensions would move into Julia’s type system—would this be valuable for dispatch or bounds checking?

Oscar_Smith · January 5, 2023, 9:48pm

There would be a 2 main benefits.

faster empty array construction. Right now Constructing a Float64[] takes about 20ns and it should take more like 6ns (the difference is because we aren’t able to constant propagate some of the information across the language barrier).
The implementation would involve adding a C type for a fixed sized mutable buffer which would be a better thing to use for lists that we use in other data-structures (e.g. Dict) that currently have some memory overhead as a result of the multi-conditionality and resizing that Arrays support.

uniment · January 23, 2023, 5:33am

This seems like something that could have been mildly more ergonomic if Base Arrays were implemented in Julia (docstring for StaticArrays.jl SMatrix):

SMatrix{S1, S2}(mat::Matrix)

  Construct a statically-sized matrix of dimensions S1 × S2 using the data from mat. The parameters S1 and S2 are
  mandatory since the size of mat is unknown to the compiler (the element type may optionally also be specified).

stevengj · January 23, 2023, 7:31pm

No, the distinction between arrays with a runtime size (e.g. the built-in Array type) and a static/compile-time size (StaticArrays) is a semantic choice that has nothing to do with whether it is implemented in pure Julia. We don’t want Array to have a static size (part of the type), because that severely limits what you can do with it. StaticArrays are great but are much more specialized.

uniment · January 24, 2023, 2:07am

Thanks, I see. I had given myself the impression that, as multidimensional arrays’ size is immutable, it might be something that’d be desirable to track in the type system or dispatch on. But I am a dilettante here.

Oscar_Smith · January 25, 2023, 1:29am

It is stored in the type system, but we also store it in the datatype (because it is faster to get that way and the sizes work out to make it basically free).

mkitti · January 25, 2023, 3:00am

If you are interested in viewing Julia’s C array structures, I recently wrote a wrapper in Undefs.jl:

github.com

mkitti/Undefs.jl/blob/main/src/JLArrays.jl

module JLArrays

export JLArray, isptrarray

struct JLArray{O}
    data::Ptr{Nothing}
    length::Csize_t
    flags::UInt16
    elsize::UInt16
    offset::UInt32
    nrows::Csize_t
    ncols::Csize_t
    other::O
end
@static if VERSION ≥ v"1.4.0"
    function Base.getproperty(jl_array::JLArray, s::Symbol)
        s == :maxsize && return getfield(jl_array, :ncols)
        flags = getfield(jl_array, :flags)
        s == :how       ?  flags & 0b0000000000000011       : # 2
        s == :ndims     ? (flags & 0b0000011111111100) >> 2 : # 9

This file has been truncated. show original

Here’s a demonstration.

julia> using Undefs: JLArray

julia> A = Array{Int}(undef, 5, 6)
5×6 Matrix{Int64}:
 0  0  0  0  0  0
 0  0  0  0  0  0
 0  0  0  0  0  0
 0  0  0  0  0  0
 0  0  0  0  0  0

julia> jla = JLArray(A)
JLArray{Nothing}:
   data: Ptr{Nothing} @0x00007f8300821800
 length: 30
  flags: 1000100000001000
        how: 0 (data is inlined, or a foreign pointer we don't manage)
      ndims: 2
     pooled: true
   ptrarray: false
   isshared: false
  isaligned: true
 elsize: 8
 offset: 0
  nrows: 5
  ncols: 6
  other: nothing

julia> B = vec(A);

julia> jla_B = JLArray(B)
JLArray{Nothing}:
   data: Ptr{Nothing} @0x00007f8300821800
 length: 30
  flags: 1100100000000111
        how: 3 (has a pointer to the object that owns the data)
      ndims: 1
     pooled: true
   ptrarray: false
   isshared: true
  isaligned: true
 elsize: 8
 offset: 0
  nrows: 30
maxsize: 30
  other: nothing

julia> jla = JLArray(A)
JLArray{Nothing}:
   data: Ptr{Nothing} @0x00007f8300821800
 length: 30
  flags: 1100100000001000
        how: 0 (data is inlined, or a foreign pointer we don't manage)
      ndims: 2
     pooled: true
   ptrarray: false
   isshared: true
  isaligned: true
 elsize: 8
 offset: 0
  nrows: 5
  ncols: 6
  other: nothing

Topic		Replies	Views
How does Julia store array sizes? New to Julia question	3	551	March 1, 2021
Why so many internals in C? Internals & Design array	5	1250	September 27, 2017
What is special about Array, String, and Symbol? Internals & Design	19	1859	October 20, 2020
How to build a wrapper around a C-style array using Julia C API? General Usage question , array , c	4	144	May 31, 2025
Arrays of CTypes Memory Layout General Usage	15	815	March 16, 2020

How exactly are Julia Arrays Implemented?

Related topics