Performance issue with use of eltype()?

anon94023334 · September 7, 2017, 6:07pm

I am at a loss to explain the performance difference between these two functions. Can someone help?

julia> function foo(a::AbstractVector)
           T = eltype(a)
           c = Set{T}[Set{T}() for x in a]
           return length(c)
       end
foo (generic function with 1 method)

julia> function bar(a::Vector{T}) where T
         c = Set{T}[Set{T}() for x in a]
         return length(c)
       end
bar (generic function with 1 method)

julia> a = rand(1:100_000, 2_000_000);

julia> eltype(a)
Int64

julia> @benchmark foo($a)
BenchmarkTools.Trial:
  memory estimate:  961.30 MiB
  allocs estimate:  10000004
  --------------
  minimum time:     3.654 s (14.18% GC)
  median time:      3.797 s (17.07% GC)
  mean time:        3.797 s (17.07% GC)
  maximum time:     3.939 s (19.76% GC)
  --------------
  samples:          2
  evals/sample:     1

julia> @benchmark bar($a)
BenchmarkTools.Trial:
  memory estimate:  961.30 MiB
  allocs estimate:  10000003
  --------------
  minimum time:     283.377 ms (0.00% GC)
  median time:      983.086 ms (65.25% GC)
  mean time:        1.080 s (68.63% GC)
  maximum time:     2.720 s (87.05% GC)
  --------------
  samples:          6
  evals/sample:     1

julia> foo(a) == bar(a)
true

yuyichao · September 7, 2017, 6:17pm

Please don’t cross post or at least link to the other places you post to.

It’s unrelated to eltype but closure capture variable. See my comment in https://github.com/JuliaLang/julia/issues/23618

anon94023334 · September 7, 2017, 6:18pm

@yuyichao this is a separate issue. Notice that the generator is explicit with Set{T}[...] in both functions, unlike the github issue I posted.

yuyichao · September 7, 2017, 6:19pm

Didn’t notice that although it’s actually still the same issue. The explicitly specified type hides the type instability on the final value but not in the loop.

anon94023334 · September 7, 2017, 6:20pm

Neither foo nor bar is warning of any type instability.

mauro3 · September 7, 2017, 6:21pm

This is fast for any abstract array, in case that is an issue:

function foobar(a::AV) where AV<:AbstractVector{T} where T
                c = Set{T}[Set{T}() for x in a]
                return length(c)
end

yuyichao · September 7, 2017, 6:28pm

The type instability is hiden in the (not inlined in this case) implementation of comprehension.

If you want to see it, you’ll need to look into the line that implements comprehention

      c::Array{Set{Int64},1} = $(Expr(:invoke, MethodInstance for copy!(::Array{Set{Int64},1}, ::Base.Generator{Array{Int64,1},getfield(Main, Symbol("##1#2")){DataType}}), :(Base.copy!), :($(Expr(:foreigncall, :(:jl_alloc_array_1d), Array{Set{Int64},1}, svec(Any, Int64), :(:ccall), 2, Array{Set{Int64},1}, :((Base.select_value)((Base.slt_int)(SSAValue(4), 0)::Bool, 0, SSAValue(4))::Int64)))), SSAValue(2)))::Array{Set{Int64},1}

And show the code_warntype of that. You can get a hint about it from ::Base.Generator{Array{Int64,1},getfield(Main, Symbol("##1#2")){DataType}} showing that the closure is only parametrized for {DataType} and not {Type{Int64}}.

anon94023334 · September 7, 2017, 6:28pm

Ah, cool. I understand. Thanks.

Topic		Replies	Views
`eltype` could be smarter Internals & Design	8	630	May 4, 2022
Should Generators finally be given eltype Internals & Design	25	2295	February 7, 2019
`TypeVar` or `eltype`, which is faster? Performance question	1	346	August 1, 2020
Type instability in list comprehensions General Usage repl , type-stability , comprehension	46	778	August 9, 2024
Can `eltype()` deduce the element type of a generator? Internals & Design data	30	2927	December 11, 2019

Performance issue with use of eltype()?

Related topics