Element-wise vector multiplication and fusing dot

aaowens · December 31, 2016, 4:27pm

I thought the last section “The importance of higher-order inlining” was very interesting. I’ve been wondering whether there was a tension between loop fusion and SIMD optimizations. It seemed to me that A.*B.+C could end up faster than f(a, b, c) = a*b+c; f.(A, B, C) if the first version used SIMD. I thought the second version wouldn’t be able do that, but now I see how inlining solves that problem.

One thing I’m unclear about. If f contains some operations without SIMD support (erf?), will SIMD occur at all, or is this impossible? I’d think it could operate chunk by chunk, doing SIMD operations on what it can, then finishing serially, but this is probably tricky.

Topic		Replies	Views
How to perform parallel vector addition? New to Julia question	24	2068	December 27, 2022
Vectorized math Performance	12	728	February 17, 2021
Benchmark MATLAB & Julia for Matrix Operations Performance	148	20716	October 15, 2019
Blog post: Loop fusion and vectorization in Julia 0.6 Internals & Design announcement , broadcast	28	8599	May 4, 2017
About dot notation for pre-allocated values a .= f.(x) New to Julia question	17	1199	March 6, 2023

Element-wise vector multiplication and fusing dot

Related topics