Why is identity on an Any vector so much slower when broadcasting than when mapped?

numberMX · March 5, 2023, 7:30am

In the REPL I was experimenting with a function that returns Vector{Any}, and I wanted to see how I could make Julia “automatically” change the type to a fixed type without having to specify it myself. I noticed that even though the end result is the same, the performance can vary widely depending on how I do it.

I found a simple example that shows an extreme difference: (I’ll suppress the outputs of the vectors because they’re long)

julia> s = Any[i for i in 1:10000]

# 3 different ways to do it:
julia> [i for i in s] == map(identity, s) == identity.(s)
true
 
#Now to time them:
julia> using BenchmarkTools

julia> @btime [i for i in s]
  5.217 μs (5 allocations: 78.22 KiB)
10000-element Vector{Int64}:

julia> @btime map(identity, s)
  5.493 μs (6 allocations: 78.23 KiB)
10000-element Vector{Int64}:

julia> @btime identity.(s)
  107.125 μs (9502 allocations: 226.83 KiB)
10000-element Vector{Int64}:

Woah. The listcomp and the map are similar, but the broadcast is 19.5 times slower than the map.

Why is the broadcast so much slower here? (If it helps, I’m using Julia 1.8.5.)

Raf · March 5, 2023, 12:30pm

Just to confuse you a little more, on julia 1.9 on my laptop:

julia> @btime [i for i in s];
  8.824 μs (5 allocations: 78.22 KiB)

julia> @btime map(identity, s);
  12.602 μs (21 allocations: 78.94 KiB)

julia> @btime identity.(s);
  10.903 μs (13 allocations: 78.56 KiB)

hexaeder · March 5, 2023, 12:56pm

Can’t time it right now but there is also an option along those lines

function narrow_type(A::AbstractArray)
    isconcretetype(eltype(A)) && return A
    elt = mapreduce(typeof, promote_type, A)
    convert.(elt, A)
end

numberMX · March 6, 2023, 4:15pm

Ah, so it seems to be fixed in 1.9 then - that’s good news! The differing number of allocations between the 3 methods is interesting though, wonder what causes that. And also it seems that listcomps still reign supreme for now.

Raf · March 6, 2023, 4:47pm

Yeah, and map seems to be relatively slower?

DNF · March 6, 2023, 4:49pm

It used to be that one couldn’t trust the allocation estimates without using variable interpolation. Is that improved?

Raf · March 6, 2023, 5:04pm

Seems that not interpolating just adds 1 allocation for the comprehension and map, and but 2 for broadcasts.

julia> @btime [i for i in $s];
  8.197 μs (4 allocations: 78.20 KiB)

julia> @btime map(identity, $s);
  10.869 μs (20 allocations: 78.92 KiB)

julia> @btime identity.($s);
  10.891 μs (11 allocations: 78.53 KiB)

And the time differnce with map disappears too.

numberMX · March 7, 2023, 8:16am

hexaeder:

Can’t time it right now but there is also an option along those lines
function narrow_type(A::AbstractArray)
    isconcretetype(eltype(A)) && return A
    elt = mapreduce(typeof, promote_type, A)
    convert.(elt, A)
end

Interesting, thanks - learning about some new functions here (hadn’t heard of isconcretetype before). I tried this out, and it works, but unfortunately it’s slower than the broadcast:

julia> @btime narrow_type(s)
  1.528 ms (9502 allocations: 226.84 KiB)
10000-element Vector{Int64}:

Topic		Replies	Views
`map` vs `broadcast`: should one prefer `map` if these are equivalent? Performance package , broadcast , map	3	1505	August 23, 2022
When to use broadcasting with . vs map General Usage broadcast	23	5236	October 4, 2022
Broadcasting over an anonymous function much slower than map Internals & Design performance , broadcasting	4	1803	February 21, 2020
Why is a multi-argument inplace map much faster in this case than a broadcast? Performance question , broadcast , map	16	674	December 12, 2022
Broadcasting slower than for-loop New to Julia	6	425	December 13, 2023

Why is identity on an Any vector so much slower when broadcasting than when mapped?

Related topics