[ANN] MapUnroll.jl: Type stable, efficient map-like iterations

Alec_Loudenback · July 16, 2025, 3:39am

MapUnroll.jl will soon (end of this week) be registered in General if all goes well.

Copying the whole readme since it’s not very long:

Quickstart

To avoid issues and ensure performance with maps that use intermediate variables. Users should turn this pattern:

function simulate(n)
    x = 0.0

    map(1:n) do i
        x += exp(i)
        (timestep=i,state=x)
    end

end

Into this:

using MapUnroll

function simulate_unroll(n)
    out = UndefVector{Union{}}(n)
    x = 0.0

    @unroll 2 for i ∈ 1:n
        x += exp(i)
        out = setindex!!(out, (timestep=i,state=x), i)
    end
    out
end

Explanation

This package addresses situations where you would like to map over a collection and return a concretely typed array that depends on some intermediate variables. For example:

function simulate(n)
    x = 0.

    map(1:n) do t
        x += exp(i)
        (timestep=t,state=x)
    end

end

In the above code, x is effectively a global variable with respect to the closure created to the inner map. This means that x get’s “boxed” and is wrapped in a mutable container for use within the map’s loop.

Another potential problem is that map does not guarantee execution in sequential order, meaning that our simulation could end up being calculated out-of-order.

An alternative is to write a for loop. However, the user then needs to take care to create the appropriate output container. For simple types this may work, but for complex types we would prefer that the compiler infer what the output eltype of our output vector should be.

@unroll addresses this by ‘unrolling’ the loop, or making the first couple (default N=2) iterations occur before the for loop actually begins, thus letting the compiler calculate the type of the object that will be placed into the output vector.

Then, from MicroCollections.jl (UndefVector) and BangBang.jl (setindex!!), the output container can be efficiently expanded by the compiler to return type stable and performant code. UndefVector and setindex!! are re-exported from MapUnroll.jl for convenience.

Comparing the two versions of the simulation above:

The original simulate does not avoid boxing the intermediate variable:

julia> @code_warntype simulate(100)
...
Locals
  #15::var"#15#16"
  x::Core.Box
Body::Vector
1 ─       (x = Core.Box())
│   %2  = x::Core.Box
│         Core.setfield!(%2, :contents, 0.0)
│   %4  = Main.map::Core.Const(map)
│   %5  = Main.:(var"#15#16")::Core.Const(var"#15#16")
│   %6  = x::Core.Box
│         (#15 = %new(%5, %6))
│   %8  = #15::var"#15#16"
│   %9  = (1:n)::Core.PartialStruct(UnitRange{Int64}, Any[Core.Const(1), Int64])
│   %10 = (%4)(%8, %9)::Vector
└──       return %10

However simulate_unroll avoids this and has faster performance as a result:

julia> using BenchmarkTools
julia> @btime simulate(100)
  9.167 μs (407 allocations: 11.12 KiB)
julia> @btime simulate_unroll(100)
  233.583 ns (2 allocations: 1.62 KiB)

Credit

The original @unroll macro was developed by Mason Protter on the Julia Zulip.

Sevi · July 16, 2025, 11:26am

Sounds pretty useful!

Could you briefly clarify how this differs from what @unroll fom Unrolled.jl does?

My understanding is that MapUnroll.jl is not specifically restricted to create statically unrolled loops, but essentially provides a convenience for computing arbitrary element types for the final output container. The size etc. of the result does not need to be known at compile time and also dynamic dispatch might still be an issue if e.g. looping over abstractly typed collection. Is that correct?

aplavin · July 16, 2025, 1:25pm

The recommended approach is to turn this into

function simulate(n)
    x = Ref(0.0)
    map(1:n) do i
        x[] += exp(i)
        (timestep=i,state=x[])
    end
end

Is MapUnroll even more performant?

Alec_Loudenback · July 16, 2025, 5:07pm

Yes, the point of MapUnroll is to have a convenient way to have a dynamic output container that avoids type instability or the user needing to specify what might be a fairly complex output kind.

I hadn’t seen Unrolled.jl, but it appears that package:

is focused on statically sized collections (e.g. tuples or static arrays) and fully unrolling them.
Does not do anything with dynamic arrays.
Requires the user to specify the type of the output container.

Alec_Loudenback · July 16, 2025, 5:10pm

Using Ref was mentioned briefly in the associated Zulip discussion, but the point was raised that map does not explicitly guarantee execution order while the logic inside the map example above assumes sequential calculation. Therefore MapUnroll.jl has @unroll to force the user to use a for loop, which has a guaranteed order of execution.

bertschi · July 16, 2025, 9:04pm

Isn’t that what fold and friends are for?

function sim(n)
    accumulate((x,i) -> (timestep = i, state = x.state + exp(i)),
               1:n;
               init = (; state = 0.0))
end

aplavin · July 16, 2025, 10:25pm

That would be a useful piece of information to have in the package README.
That is, the package focus isn’t in making this calculation fast (because it is already fast with Ref + map), but in forcing in-order execution for potential fancy collections where map is not in-order.

Topic		Replies	Views
Loop unrolling for type stability New to Julia question	6	403	December 12, 2022
Unrolling loops over tuples - why so hard? Performance tuple , unrolling	14	1361	September 10, 2023
Efficient iteration with fixed size array types without `@generated` functions Performance	9	201	September 26, 2024
Why isn't a simple map type stable? General Usage	2	579	August 26, 2021
@foreach item in vector General Usage macros , unrolling	2	1156	October 1, 2021

[ANN] MapUnroll.jl: Type stable, efficient map-like iterations

Quickstart

Explanation

Credit

Related topics