Benchmark MATLAB & Julia for Matrix Operations

RoyiAvital · October 13, 2019, 9:53pm

Again, I think both of your systems are memory bounded since there is no way single thread in GEMM is as fast as 4 Threads.
You need to understand it.
It is not surprise that on my machine with Quad Channel Memory (Though modern CPU’s can get better bandwidth than my machine even with double channel configuration) you see scaling with threading.

Also, 7 runs are very stable. When I built this I tried many numbers and actually even 5 is great.
You need to understand @btime doesn’t do anything magical (Nor MATLAB’s timeit() which internally just do multiple runs and using tic() and toc()). It calls the same CPU timers.
I prefer do that manually and as you can see in the above answer of @jling, Since I’m not doing it in global scope results are correct and reasonable.

Again, it is you who have to explain how can the most optimized function in history - GEMM, which scales beautifully with threads has no scaling in your tests. Do you suggest that the people of OpenBLAS created a function 4 times faster than Intel guys (Which spent hundreds of work years on this)? Com on…

You need to find better arguments to back up those results.

Topic		Replies	Views
Matlab versus Julia General Usage	33	4926	July 15, 2021
How to accelerate matrix operations(multiplication, add, inverse) in a for loop? Performance performance , matlab	23	6679	September 2, 2018
Julia is significantly slower (~18 x) than Matlab in vector and matrix algebra New to Julia	32	1867	June 25, 2023
Matlab's matmul much faster than julia's New to Julia	6	715	April 5, 2024
Sparse matrix-vector product: much more slow than Matlab Performance matlab , optimization	24	4539	December 20, 2017

Benchmark MATLAB & Julia for Matrix Operations

Related topics