Question on semantics of loops, maps, and broadcast

klaff · November 4, 2019, 6:38pm

I’ve been playing with writing things in different forms, for example the following:

function f_by_loop!(data,x)
    @inbounds for i in eachindex(data)
        data[i] = min(x,data[i])
    end
end

vs data .= min.(x,data) or map!(y -> min(y,x), data, data).
where, for example, data=randn(100_000_000) and x=1.0.

These do the same thing and once I added @inbounds to the loop, all three performed similarly.

The broadcast form is definitely the most concise and I’m guessing the least error prone.

My question is: Is broadcast also preferable to the manual loop because it does not unnecessarily specify an order of operation, making it easier for the compiler to optimize?

rdeits · November 4, 2019, 7:41pm

In my experience, the advantages of broadcasting are:

Loop fusion (More Dots: Syntactic Loop Fusion in Julia), which can also be achieved with manual loops, just more verbosely.
Support for efficient operations on more esoteric containers. For example, broadcasting over a sparse array can avoid unnecessarily traversing all of the zero entries:

julia> function f_loop!(y, x)
         for i in eachindex(y)
           @inbounds y[i] = x[i]
         end
       end
f_loop! (generic function with 1 method)

julia> x = sprand(100, 100, 0.001);

julia> @btime y .= $x setup=(y = similar(x));
  37.082 ns (0 allocations: 0 bytes)

julia> @btime f_loop!(y, x) setup=(y = similar(x))
  60.394 μs (0 allocations: 0 bytes)

There’s an efficient way to iterate over only the nonzeros in a sparse array, but broadcasting is smart enough to do that for you.

This also enables clever packages like https://github.com/tkoolen/TypeSortedCollections.jl which allows broadcasting across heterogeneous containers.

non-Jedi · November 4, 2019, 7:42pm

I’m not entirely sure what you mean by “preferable” (some options: more performant, considered better style, etc.), but with loops please do note that the @simd annotation can be used to indicate to the compiler that the loop iterations are independent and can be reordered.

mancellin · November 4, 2019, 9:34pm

Note also that, even if they overlap almost exactly for functions of a single argument, they can mean different things for functions of two or more arguments.

Compare

map((x,y) -> x + y, ones(1, 4), ones(4))

and

broadcast((x,y) -> x + y, ones(1, 4), ones(4))

klaff · November 5, 2019, 2:14am

Good point about the dimensional magic of broadcast.

klaff · November 5, 2019, 2:18am

Thanks,

I figured out that the loop version was using SIMD instructions once I applied @inbounds. I believe @simd wasn’t necessary because there is no reduction aspect of the problem I chose as an example.

klaff · November 5, 2019, 3:10am

Thanks for the link - that was helpful.

Topic		Replies	Views
Performance of simple broadcasting operations with many arguments Performance performance , broadcast	15	1592	November 29, 2021
Confusion on performance when using the broadcasting macro @. vs explicit . operators Performance	7	167	March 27, 2025
Performance of loops New to Julia broadcast , loops , aliasing	30	865	August 19, 2024
Rewrite a for loop using dot New to Julia	4	321	March 19, 2025
Should I use broadcasting or write the function directly for higher dimensional input? New to Julia question , broadcasting	2	377	December 19, 2022

Question on semantics of loops, maps, and broadcast

Related topics