Can the overhead of `myT{<:T}` compared to `myT{T}`, where `T` is a concrete type, be avoided?

frankwswang · November 16, 2024, 9:58pm

I understand that Vector{<:Float64} is not the same as Vector{Float64} as it allows the bottom type Union{} to be its element type and is (therefore) not a concrete type:

julia> Vector{Union{}} <: Vector{<:Float64}
true

julia> isconcretetype(Vector{<:Float64})
false

However, in practice, the user cannot construct an instance of Union{}.

Thus, I would imagine that any UnionAll of a composite type MyT{T}, MyT{<:T}, such that T is a concrete type, should add no overhead compared to MyT{T}. For example, Vector{<:Float64} should be as efficient as Vector{Float64}.

Unfortunately, this is not the case as of Julia 1.11.1:

julia> v1 = Vector{Float64}[rand(100) for _ in 1:10];

julia> v2 = Vector{<:Float64}[rand(100) for _ in 1:10];

julia> @benchmark mapreduce(sum, +, $v1)
BenchmarkTools.Trial: 10000 samples with 987 evaluations.
 Range (min … max):  51.570 ns … 102.128 ns  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     52.280 ns               ┊ GC (median):    0.00%
 Time  (mean ± σ):   53.195 ns ±   3.458 ns  ┊ GC (mean ± σ):  0.00% ± 0.00%

   ██       ▁▁                                                 ▁
  ███▇▇▇▇▇█████▇▇▇▇▆▇▆▆▇▇▇▆▆▆▇▄▆▅▅▆▆▅▆▅▄▄▄▄▄▅▆▆▅▆▅▆▅▅▅▅▅▆▅▅▆▅▇ █
  51.6 ns       Histogram: log(frequency) by time      70.9 ns <

 Memory estimate: 0 bytes, allocs estimate: 0.

julia> @benchmark mapreduce(sum, +, $v2)
BenchmarkTools.Trial: 10000 samples with 858 evaluations.
 Range (min … max):  138.811 ns … 620.629 ns  ┊ GC (min … max): 0.00% … 57.99%
 Time  (median):     140.909 ns               ┊ GC (median):    0.00%
 Time  (mean ± σ):   154.184 ns ±  38.408 ns  ┊ GC (mean ± σ):  2.36% ±  7.44%

  █▅▄▃▂▂▁▁     ▁                  ▂▂                            ▁
  ███████████████▇█▇▇▇▇▇▇▇██▇▆▆▅████▇▃▅▃▁▁▃▁▄▁▁▄▁▁▃▁▁▁▁▄▆▆▆▇▆▅▅ █
  139 ns        Histogram: log(frequency) by time        340 ns <

 Memory estimate: 160 bytes, allocs estimate: 10.

Can this performance degradation be fixed as the compiler gets more optimized/“smarter,” or are there some fundamental reasons that prevent it from happening? Thanks!

matthias314 · November 16, 2024, 10:56pm

That’s true, but you can construct a Vector{Union{}}:

julia> v = Union{}[]
Union{}[]

julia> typeof(v)
Vector{Union{}} (alias for Array{Union{}, 1})

julia> v isa Vector{<:Float64}
true

julia> Vector{<:Float64}[v, [1.0, 2.0]]
2-element Vector{Vector{<:Float64}}:
 Union{}[]
 [1.0, 2.0]

gdalle · November 16, 2024, 11:20pm

I’m curious, do you have a realistic setting where you would manipulate a Vector{<:Float64} that is not a Vector{Float64}. It seems to me that this overhead will never be faced in practice (@matthias314’s example is rather contrived).

Benny · November 17, 2024, 11:05am

Worse, they can have a non-zero length e.g. Vector{Union{}}(undef, 3) despite not containing any instances. isempty also returns false.

T where T<:Float64 and Union{Union{}, Float64} do simplify to Float64, so Vector{T where T<:Float64} simplifies to Vector{Float64} even though Vector{T} where T<:Float64 does not.

Topic		Replies	Views
Why isn't `Vector{<:Float64}` reduced to `Vector{Float64}`? General Usage type , parametric-types	16	418	July 28, 2024
Why are unions of concrete types not concrete? General Usage question , type	6	377	March 2, 2023
Union type confusion General Usage	6	159	May 16, 2024
Is Union{Missing, Float64} a concrete type? Performance	3	634	August 5, 2021
Make typeof(T::Type) = Type{T}? Performance question	0	321	June 13, 2020

Can the overhead of `myT{<:T}` compared to `myT{T}`, where `T` is a concrete type, be avoided?

Related topics