Transducer allocations

astadmistry · October 11, 2021, 5:03pm

I’m trying to use transducers to do data manipulation such as computing a rolling simple moving average. The code works fine, however why is the code allocating memory when I’ve pre allocated it already? (look at last two functions)

using DataFrames
using BenchmarkTools
using Transducers
using Transducers: SIMDFlag, GetIndex, ZipSource, SetIndex, _map!

function _prepare_map(xf, dest, src, simd)
    # isexpansive(xf) && error("map! only supports non-expanding transducer")
    # TODO: support Dict
    indices = eachindex(dest, src)

    rf = reducingfunction(
        opcompose(ZipSource(opcompose(GetIndex{true}(src), xf)), SetIndex{true}(dest)),
        (::Vararg) -> nothing,
        simd = simd)

    return rf, indices, dest
end

function Base.map!(xf::Transducer, dest::AbstractArray, src::AbstractArray;
    simd::SIMDFlag = Val(false))
    _map!(_prepare_map(xf, dest, src, simd)...)
    return dest
end


function sma!(len, vec_in, vec_out)
    map!(opcompose(Consecutive(len, step=1), Map(mean)), vec_in, vec_out)
end

function main()
    N = 10^5
    df = DataFrame(:data => ones(N))
    sma_length = 10
    df[!,"data"] = 1:N
    df[!,"sma"] .= 0.
    
    sma!(sma_length, df.data, df.sma)
    df[!,"sma"] .= 0.
    @btime sma!($sma_length, $df.data, $df.sma)
end

main()

492.581 ms (299493 allocations: 19.83 MiB)

tkf · October 12, 2021, 7:40am

It’s an inference failure. I played with it in Cthulhu a bit but I couldn’t find out exactly where the compiler gives up. But I’d point out ZipSource and Consecutive are very complex transducers. So, the inference failure is (disappointing but) not surprising.

Meanwhile, if you “just” need to implement sma! on vectors, I think it’d be much less painful to just write raw loops. Transducers like Consecutive becomes strictly necessary only when it is used within other non-trivial processing.

Topic		Replies	Views
Understanding DataFrame allocations Performance dataframes	1	63	November 18, 2024
Memory allocation during assignment and modification of Float64 variables Performance memory-allocation	3	104	August 6, 2024
Removing undesirable allocations in some functions Performance performance , memory-allocation	15	413	January 27, 2024
Could Transducers help here? Performance performance , transducers	4	978	January 23, 2019
Need help understanding the allocations Performance question , memory-allocation	8	484	September 26, 2023

Transducer allocations

Related topics