Custom flatten version is faster than `flatten()`

affans · October 8, 2018, 11:58pm

Consider an array of matrices i.e. a = [rand(3, 3) for i = 1:1000]. I have two ways to “flatten” this array:

function naive_flatten(a)
    ret = Array{Float64}(undef, 0)
    @inbounds for i=1:length(a) 
        append!(ret, vec(a[i]))
    end
end

using BenchmarkTools; using Base.Iterators; 
@btime aa = naive_flatten($a)
## built in function
@btime aa = collect(flatten($a))

  61.487 μs (2012 allocations: 222.64 KiB)
  204.570 μs (18015 allocations: 819.22 KiB)

Why is there such a big difference? My function shouldn’t even do well since I am not preallocating anything…

gpapo · October 9, 2018, 9:07am

I think that the problem is the current implementation of Iterators.flatten does not take advantage of many informations: first of all it treats arrays in a generic way, as we didn’t know their actual length, in fact if you see

https://github.com/JuliaLang/julia/blob/0d713926f85dfa3e4e0962215b909b8e47e94f48/base/iterators.jl#L880-L897

flatten_iteratorsize does not specialize on Union{HasLength, HasShape} unless the iterator element type is a tuple or a number.

I don’t know if there is a deeper motivation about that.

mschauer · October 9, 2018, 10:53am

This is not so much about flatten_iteratorsize but about the performance difference between

julia> function naive_flatten2(a)
           ret = Array{Float64}(undef, 0)
           for ai in a
               for aij in ai
                   push!(ret, aij)
               end
           end
           ret
       end
naive_flatten2 (generic function with 1 method)

julia> function naive_flatten3(a)
           ret = Array{Float64}(undef, 0)
           for ai in Iterators.flatten(a)
               push!(ret, ai)
           end 
           ret
       end
naive_flatten3 (generic function with 1 method)

julia> @btime aa = naive_flatten2($a);
  100.138 μs (14 allocations: 256.70 KiB)

julia> @btime aa = naive_flatten3($a);
  239.802 μs (18015 allocations: 819.22 KiB)

affans · January 2, 2019, 3:55am

Is this something worth opening an issue for?

carstenbauer · January 2, 2019, 9:38am

FWIW, I’d think it is.

Topic		Replies	Views
length(Iterators.flatten(...)) not working General Usage question	7	1300	April 7, 2019
Extreme memory usage seemingly caused by using the functions `map` and `Iterators.flatten` Performance memory , memory-allocation	2	701	November 4, 2018
Concatenating iterables without allocating memory Performance	34	7315	January 16, 2020
Iterating over several arrays New to Julia	14	331	March 24, 2021
Optimal way to flatten array General Usage array	5	883	April 9, 2021

Custom flatten version is faster than `flatten()`

Related topics