Why is BLAS dot product so much faster than Julia loop?

dlakelan · August 15, 2020, 3:03pm

I had a very similar question, with a lot of nice answers

my favorite by far was to use @tullio to avoid coding loops at all, just use Einstein tensor notation

return @tullio x[i]*y[i]

Topic		Replies	Views
Interesting post about SIMD dot product (and cosine similarity) Offtopic performance	17	854	December 2, 2024
Naive dot product faster in Fortran than in Juila Performance	12	1403	July 24, 2021
Dot product not parallelized on cluster Performance linearalgebra	4	260	January 4, 2023
Julia matrix-multiplication performance Performance linearalgebra	20	8633	October 30, 2022
Alternate BLAS libraries? General Usage blas	22	2915	July 4, 2020