Newbie question about loop fusion and broadcasting

cwindolf · June 11, 2020, 4:52pm

Dear JuliaLang,

I’m excited to get started with Julia as a long-time Python user, tired of typing np.array. One thing that is really exciting to me is the ability to avoid writing my own ufuncs, especially avoiding the slow but easy to write frompyfunc ones.

One thing that is confusing me right now is the difference between, say, * and .*. Say I let x = randn(10), and I want to compute the outer product: x * x', aka x .* x'. These are of course the same, but from what I can tell they have different performance characteristics when combined with other broadcasted operations – the .* form can be fused with other broadcasted operations while * cannot.

However, to me, it seems like both of these are doing broadcasting, and that there is no room to interpret them differently from a mathematical point of view. So, is there an intuition for what underlying factor makes the difference for loop fusion? Is there just some loop fusion switch that gets turned on only when a . is present?

Thanks!

klaff · June 11, 2020, 5:03pm

It turns out they are both doing broadcasting, but not by a trick of syntax.

If you use @edit x * x', you can see that the called method of * explicitly calls broadcast.

From adjtrans.jl line 272:

# vector * Adjoint/Transpose-vector
*(u::AbstractVector, v::AdjOrTransAbsVec) = broadcast(*, u, v)

mbauman · June 11, 2020, 5:06pm

Is there just some loop fusion switch that gets turned on only when a . is present?

Yes, that’s precisely it. Broadcast’s loop fusion is a syntax thing. While Julia’s compiler could maybe someday get smart enough to look inside the * method to see that it’s “just” a broadcast — and thus can avoid allocating its temporary, it requires a rather complex escape analysis to make sure that this intermediate isn’t stored/used/re-used/changed/etc and a complex function analysis to ensure that it actually is doing something that is broadcast-compatible.

On the other hand, just looking at an expression and combining the .s is a very simple pass.

klaff · June 11, 2020, 5:07pm

I should add that a .* b is syntax for broadcast(*, a, b). See More Dots: Syntactic Loop Fusion in Julia for a good explanation.

cwindolf · June 11, 2020, 5:55pm

Thanks all for the interesting responses! I can tell this is a nice community. The “More Dots” article was especially helpful.

To make sure I’m getting it: that article says that dot-calls are sugar for broadcast. But, if I’m understanding right, it also seems to indicate that dots are explicitly required in order for loop fusion to kick in. So, although (as @klaff writes) x .* x' and x * x' both end up mapping onto calls to broadcast, only the former will trigger loop fusion (if there were more vectorized operations involved). Is that the right idea?

rdeits · June 11, 2020, 6:01pm

Yup!

mbauman · June 11, 2020, 6:07pm

Yes, that’s exactly right. For example: v .* v' .+ 2 is effectively the same as:

vt = v'
broadcast((x, xt)->x*xt + 2, v, vt)

Had I written this as v * v' .+ 2, then we’d first compute the result of * (regardless of the implementation of *) and then do the broadcasted addition, effectively:

temp = v * v'
broadcast(x->x+2, temp)

The consecutively-connected dots are how we define what goes into that inner “kernel” (the anonymous function in the cartoon equivalence above). To make this even more clear, you can compare the result of zeros(5) .+ rand() vs. zeros(5) .+ rand.().

cwindolf · June 11, 2020, 6:10pm

Whoah… no way…

klaff · June 11, 2020, 7:23pm

You’re welcome! But also, you asked a very astute question. I thought I understood dot fusion but didn’t really understand the purely syntactical nature of it.

klaff · June 12, 2020, 3:47pm

This is still making my brain hurt so here’s a question:
I think
zeros(5) .+ rand() is equivalent to
(+).(zeros(5), rand()) is equivalent to
broadcast(+, zeros(5), rand()).

Is this the right way to think about this one?
zeros(5) .+ rand.() is equivalent to
(+).(zeros(5),rand.()) is equivalent to
(x -> x+rand()).(zeros(5)) is equivalent to [Edit - added this line]
broadcast(x -> x+rand(), zeros(5))?

Topic		Replies	Views
Vectorized math Performance	12	667	February 17, 2021
Unnecessary materialize in broadcast? Performance broadcast	9	1733	August 28, 2018
Dot notation General Usage	5	8758	October 22, 2020
Broadcast and .* not the same General Usage	2	953	December 23, 2017
@. weird behavior New to Julia	10	636	February 13, 2020

Newbie question about loop fusion and broadcasting

Related topics