When shall I use `BLAS.axpy!` and when `muladd`?

chobbes · December 3, 2017, 1:46am

Can’t tell much difference between the two. Any insights? Thanks.

stevengj · December 3, 2017, 1:51am

I don’t understand the comparison. axpy! is for vectors, while muladd is for scalars, although of course you can do x .= muladd.(x,y,z) to apply it to vectors similar to axpy!.

I would say that 99.9% of code should not be calling low-level BLAS functions directly. If a BLAS-1 function like axpy! is performance-critical for you, you probably need to re-think your code anyway.

chobbes · December 3, 2017, 1:58am

Oh, sorry. Just realized that muladd is for scalars. What is the high-level surrogate for ‘BLAS.axpy!’ then? Can you give me a pointer?

Why is calling low-level BLAS functions deemed a bad idea? Sorry if my question sounds stupid… Really appreciate!

ChrisRackauckas · December 3, 2017, 4:32am

Low-level BLAS calls usually are memory-bound and not compute-bound, so you’ll find that using low-level BLAS usually doesn’t even give a performance advantage over Julia (that’s not true of high-level BLAS though). muladd is generic, can fuse, and will be FMA on processors which it should, so it’s a great option here.

chobbes · December 3, 2017, 5:08am

Thanks for the explanation! Get it now.

rveltz · December 3, 2017, 2:07pm

I thought BLAS would give threading for free as compared to broadcasting muladd. Am I wrong?

antoine-levitt · December 3, 2017, 2:22pm

It does. It might or might not be useful, depending on the architecture/BLAS/weather : Using axpy!. It also might have an overhead for small sizes.

Topic		Replies	Views
Using axpy! General Usage blas , benchmark	12	3558	November 15, 2017
What's the difference between LinearAlgebra.axpy! and BLAS.axpy!, and which one should packages extend? General Usage package , blas , linearalgebra , function	0	447	December 13, 2022
What is the idea of having a builtin function like 'axpy!'? General Usage question	3	314	January 7, 2024
Inplace axpy! but storing to a third arguement rather than y Performance blas	4	245	December 6, 2023
OpenBLAS vs. "for loops" demo Performance	6	831	October 7, 2019

When shall I use `BLAS.axpy!` and when `muladd`?

Related topics