Pre-allocating differently typed arrays for multiple dispatch

Adriel · November 27, 2018, 12:40am

I have a function that allocates a large matrix H each time.

function f(x::Vector{T}) where T <: Real
    H = Matrix{Complex{T}}(undef, 2, 3)
    # do stuff...
    H
end

I want pre-allocate H to speed up my code, but sometimes T is Float64, sometimes it’s a dual number or maybe something else.

I’m guessing this is a fairly common problem, but I didn’t know how to search for it. Is there a “best practice” way of pre-allocating arrays for each type and dispatching for the appropriate one?

kristoffer.carlsson · November 27, 2018, 12:54am

Typically you know T outside the big loop (or something) and then you can just allocate it there and pass it into f.

JaredCrean2 · November 27, 2018, 12:55am

The only way I know of is to make a type with the same static parameters, eg.

struct fData{T}
  H::Matrix{Complex{T}}

  function fData{T}() where {T}
    H = Matrix{Complex{T}}(undef, 2, 3)
    return new(H)
  end
end

function f(data::fData{T}, x::Vector{T}) where T <: Real
  H = data.h
  # do stuff
end

# in the caller, T must be known from somewhere
data = fData{T}()
x = rand(n)  # however x gets created
for i=1:1000
  f(x, data)
end

This approach effectively pushes the problem out one level, because data is allocated outside the for loop. Its gets more difficult the more levels you have though.

rdeits · November 27, 2018, 2:19am

We had some discussion about this issue over at Conveniently avoiding allocating new MechanismState objects · Issue #363 · JuliaRobotics/RigidBodyDynamics.jl · GitHub . Perhaps that kind of approach would work for you?

tkoolen · November 27, 2018, 6:36am

Yeah, that approach has been working quite well. I’ve since abstracted out the key parts into an AbstractTypeDict:

github.com

JuliaRobotics/RigidBodyDynamics.jl/blob/94b14f7dd22941918921180d9b20605a868f0638/src/caches.jl#L1-L64


      
          abstract type AbstractTypeDict end
          function valuetype end
          function makevalue end
          
          
function Base.getindex(c::C, ::Type{T}) where {C<:AbstractTypeDict, T}
              ReturnType = valuetype(C, T)
              key = (objectid(T), Threads.threadid())
              @inbounds for i in eachindex(c.keys)
                  if c.keys[i] === key
                      return c.values[i]::ReturnType
                  end
              end
              value = makevalue(c, T)::ReturnType
              push!(c.keys, key)
              push!(c.values, value)
              value::ReturnType
          end
          
          
"""
          $(TYPEDEF)

This file has been truncated. show original

You could just copy the AbstractTypeDict type, make a concrete subtype and implement:

a constructor
a valuetype method
a makevalue method

(Just pattern match StateCache or one of the other subtypes in that file).

sairus7 · November 27, 2018, 9:40am

function f(x::Vector{T}) where T <: Real
    H = similar(x)
    f!(H, x)
    H
end

function f!(out::Vector{T}, in::Vector{T}) where T <: Real
    ... # your code here
end

Edit: sorry, didn’t notice H has a different type.
You can use eltype(x) in caller script or function - and replace similar(x) with the type you need.

Tamas_Papp · November 27, 2018, 9:45am

I see this pattern a lot, but it implicitly relies on T being closed under f. This is a reasonable assumption, but can quickly break down when relying eg on multiple AD packages at the same time.

The fundamental problem is that in general, it is very hard to preallocate for an operation

d = f(a, b, c)

as

f!(d, a, b, c)

when a, b and c can be sufficiently generic, because predicting the type of d can be tricky.

My current approach to this problem is to

initially avoid it, not think about preallocation,
the first line of optimizations is to use immutables, eg StaticArrays, and hope for the best,
if I decide I really need it, use some kind of a heuristic to determine the element type.

For the last one, something like

T = typeof(g(one(eltype(a)), one(eltybe(b)), one(eltype(c)))
d = Matrix{T}(undef, I_know, the_dimensions)

where g somehow reflects the elementary operations for what is going on in f. But this can be brittle as the <: Real interface is not formally specified and packages can deviate from expectations.

Topic		Replies	Views
Preallocate an array with the element type, which is unknow before running? New to Julia	2	275	August 2, 2022
Allocation free alternative to typeof(similar(...) General Usage	3	573	November 10, 2018
Unexpected Allocations in Julia 0.5 General Usage	4	573	December 5, 2016
How to predict the type of a view(A, ...) General Usage	6	751	September 20, 2018
Initialise array without specific types Performance question	25	1992	April 30, 2021

Pre-allocating differently typed arrays for multiple dispatch

Related topics