Why is this simple function twice as slow as its Python version

lmiq · April 12, 2021, 1:36pm

May I summarize what we learnt here?

The original post had python doing a*b which is not a matrix multiplication. Thus the comparison was not correct. That was changed to a@b, and the difference in performance reported is much smaller (30%), not “twice”.
The remaining difference in performance is somewhat system dependent. If I copy/paste now the original codes I get, in my machine, the same time for both of them (~950 ms). Yet the benchmarks oscilate a bit, because they are calling BLAS with multi-threading on the background, and probably other programs can compete for the processor usage. There may be an issue concerning the number of threads launched by the BLAS routine.
That, considering the fact that in Python the line tmp2[...] = t@tmp1 is not allocating a new array, while the line tmp2[...] = t*tmp1 is allocating a new array in Julia.
Solving that specific allocation in Julia requires a more verbose syntax (mul!(@view(tmp2[...]),t,tmp1). It might improve slightly the performance (10% maybe), but the timings vary because of the same reasons above.
Avoding other allocations can readily make the Julia code run 2x faster than the original one. That can probably be done with Python as well.
More advanced modifications and 32 bit representation of the matrices can make the code 50x faster than the original one (Elrod batch version), but that is advanced indeed.

Finally, I congratulate all for the very pleasant and civilized conversation!

Topic		Replies	Views
Why is this code so slow in julia compared to a numpy implementation? Performance performance	9	3563	October 24, 2017
Why are calls to this function slow, compared to Python+Numpy? General Usage fast-math	14	2111	August 13, 2017
Why is python faster than Julia Performance	14	1912	March 12, 2020
Julia is slower than Python when appending elements to untyped arrays General Usage question	42	884	February 10, 2025
Numpy 10x faster than Julia ?! What am I doing wrong ?! [solved - julia faster now] Performance question	37	10937	October 15, 2019