This function seems to allocate memory only if the argument types are not annotated in the definition

Joaquin_Rodriguez · February 14, 2024, 2:48am

I am defining a custom data type that behaves like a vector (in the linear algebra sense). I defined it as a structure with a 6-element StaticVector that holds the underlying data:

using StaticArrays

struct MyVectorType
    data::SVector{6, Float64}
end

One operation that I need is the cross product between instances of MyVectorType, which is defined by a particular combination of cross products between 3-component sub-vectors of the underlying data:

import LinearAlgebra: ×

function cross_product1(a::MyVectorType, b::MyVectorType)
    #split the vectors in two 3-component parts:
    a1 = a.data[SVector(1,2,3)]
    a2 = a.data[SVector(4,5,6)]
    b1 = b.data[SVector(1,2,3)]
    b2 = b.data[SVector(4,5,6)]

    #compute two combinations of cross products:
    part1 = a1 × b1
    part2 = a1 × b2 + a2 × b1

    #concatenate the two parts to form a new 6-component vector:
    return MyVectorType( (part1..., part2...) )
end

This function has been tested to make sure that it doesn’t allocate any memory:

#two random vectors:
a = MyVectorType(rand(6))
b = MyVectorType(rand(6))

using BenchmarkTools
@btime cross_product1(a,b);
5.357 ns (0 allocations: 0 bytes)

However, I have noted something very strange: if I remove the type annotations from the arguments in the function definition, the funtion now allocates memory!!

function cross_product2(a, b)
    #split the vectors in two 3-component parts:
    a1 = a.data[SVector(1,2,3)]
    a2 = a.data[SVector(4,5,6)]
    b1 = b.data[SVector(1,2,3)]
    b2 = b.data[SVector(4,5,6)]

    #compute two combinations of cross products:
    part1 = a1 × b1
    part2 = a1 × b2 + a2 × b1

    #concatenate the two parts to form a new 6-component vector:
    return MyVectorType( (part1..., part2...) )
end

@btime cross_product2(a,b);
20.113 ns (1 allocation: 64 bytes)

How can this possibly be?
The definitions of cross_product1 and cross_product2 are exactly the same, I just copy-pasted and removed the type annotations. And the function calls are also exactly the same, so the type annotations should not make a difference.

The amount of memory allocated is the same across many runs, so its not a compilation thing.

Can anyone explain this?

Oscar_Smith · February 14, 2024, 2:59am

This is absolutely bizarre. I can reproduce it.

jar1 · February 14, 2024, 3:16am


julia> let a=MyVectorType(rand(6)), b = MyVectorType(rand(6))
         (@allocations cross_product1(a,b)), (@allocations cross_product2(a,b))
       end
(0, 0)

julia> versioninfo()
Julia Version 1.10.0
Commit 3120989f39b (2023-12-25 18:01 UTC)

Build Info:
  Official https://julialang.org/ release
Platform Info:
  OS: Linux (x86_64-linux-gnu)
  CPU: 24 × AMD Ryzen 9 3900XT 12-Core Processor
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-15.0.7 (ORCJIT, znver2)
  Threads: 17 on 24 virtual cores

PetrKryslUCSD · February 14, 2024, 3:24am

See the same (reproduce allocations) on
julia> versioninfo()
Julia Version 1.10.0
Commit 3120989f39b (2023-12-25 18:01 UTC)
Build Info:
Official https://julialang.org/ release
Platform Info:
OS: macOS (arm64-apple-darwin22.4.0)
CPU: 24 × Apple M2 Ultra
WORD_SIZE: 64
LIBM: libopenlibm
LLVM: libLLVM-15.0.7 (ORCJIT, apple-m1)
Threads: 1 on 16 virtual cores

Dan · February 14, 2024, 3:29am

Allocation disappear when adding $s:

julia> @btime cross_product2(a,b);
  14.953 ns (1 allocation: 64 bytes)

julia> @btime cross_product2($a,$b);
  2.376 ns (0 allocations: 0 bytes)

Julia 1.10
OS: Linux (x86_64-linux-gnu)
CPU: 16 × 12th Gen Intel(R) Core™ i5-1240P

Joaquin_Rodriguez · February 14, 2024, 1:41pm

Allocation disappear when adding $s

This is good. So maybe the allocation is not from the function itself but from julia figuring out the types of the global variables?

The allocation also disappears if the types of the global variables are fixed:

c::MyVectorType = MyVectorType(rand(6))
d::MyVectorType = MyVectorType(rand(6))
@btime cross_product2(c,d);
  3.578 ns (0 allocations: 0 bytes)

Topic		Replies	Views
Extra allocation with `T::DataType`? Performance	17	595	August 19, 2022
Why does this function allocate memory? Performance	6	406	November 22, 2022
Unexpected memory allocation behavior General Usage memory-allocation	6	749	January 2, 2022
Memory allocations when returning vectors General Usage array , memory-allocation	15	1491	June 6, 2018
Impact of specifying input types on function performance, when referencing arguments from Vector{Any} Performance	16	608	November 10, 2022

This function seems to allocate memory only if the argument types are not annotated in the definition

Related topics