Julia matrix-multiplication performance

stevengj · October 30, 2022, 12:02pm

The base case is completely unrelated to the cache size. You want the base case to be just large enough that the recursion overhead is negligible in comparison, but much smaller than any cache.

Probably it should be m * n * p <= something, therefore, since the relevant factor is the cost of the base case, which scales like \Theta(mnp). But since I was looking mostly at square matrices it didn’t really matter too much exactly how we implemented the criterion as long as we did a little tuning of the cutoff value.

Topic		Replies	Views
Matrix-Vector multiplication complex/real Performance	0	378	March 14, 2021
Performance issue with multithreaded computation with matrix operations at its heart (Threads.@threads vs. BLAS threads) Performance blas , parallel , multithreading , linearalgebra , threads	7	410	November 13, 2023
Matrix vector multiplication Performance question	4	899	September 27, 2020
Why is BLAS dot product so much faster than Julia loop? Performance	18	5457	August 15, 2020
Performance gotcha in linear algebra lu() General Usage performance , linearalgebra	33	3606	February 11, 2020

Julia matrix-multiplication performance

Related topics