Preventing broadcast fusing

c42f · February 3, 2019, 2:39am

All excellent points and I’d probably do the same in production code.

The identity trick is manual loop “unfusing” and forces the allocation of a temporary array, exactly like making the temporary explicitly on a separate line. Deciding on which parts of a broadcast would better be materialized early, stored and reused in a separate loop is an optimization which depends on pretty high level knowledge of the cost of memory allocation, memory bandwidth vs the cost of redoing some computation in the inner loop. And these things also completely depend on the size of the arrays involved. I don’t expect the julia compiler to do this any time soon and it also seems at odds with the simple definition of broadcasting which we have now as a fully fused operation.

If I understand LICM correctly it’s a much more local optimization which given a loop structure decides which parts may be hoisted out as loop invariants. It would definitely help here (if it’s not done already), as it allows the following transformation

for i=1:n
    for j=1:n
        for k=1:n
            out[i,j,k] = exp(x[i]) * exp(y[j]) * exp(z[k])
        end
    end
end

to

for i=1:n
    ex = exp(x[i])
    for j=1:n
        ey = exp(y[j])
        for k=1:n
            out[i,j,k] = ex * ey * exp(z[k])
        end
    end
end

But even so, you’ve still got O(n^3) invocations of exp, whereas allocating temporary storage and splitting the loops brings you down to O(n).

Topic		Replies	Views
Confusion on performance when using the broadcasting macro @. vs explicit . operators Performance	7	165	March 27, 2025
Blog post: Loop fusion and vectorization in Julia 0.6 Internals & Design announcement , broadcast	28	8401	May 4, 2017
Performance of simple broadcasting operations with many arguments Performance performance , broadcast	15	1592	November 29, 2021
Bad performance in simple array broadcast operations Performance broadcast	5	1076	October 12, 2019
Optimising function for broadcast Performance broadcast	9	477	October 4, 2022

Preventing broadcast fusing

Related topics