How to improve performance in a function that repeatedly defines and multiplies matrices

Uranium238 · January 6, 2024, 2:22pm

The main purpose of this thread was to find ways to optimize the serial code rather than focusing on parallelization either by Threads or using Distributed . With the code finally running faster than the serial Fortran equivalent on my personal PC , I think the goal (at least in terms of programming optimization of the serial code) has been achieved. Hence I am marking this threads as solved. Huge thanks to @ufechner7 @DNF @nilshg @abraemer and others for their constant support.

Further questions deal with parallelizing the code and running it on supercomputers. I have opened a new thread to carry on the further discussion.

Topic		Replies	Views
How to improve the scaling of Julia code aimed at multi-node parallelization? Julia at Scale linearalgebra , distributed	38	595	August 14, 2024
Julia code becomes slower on running on supercomputers and does not scale well when parallelizing with Base.Threads Julia at Scale fortran , parallel , linearalgebra , threads	73	2031	January 22, 2024
How to convert a thread-parallelized code into a core-parallelized code? Julia at Scale multithreading , linearalgebra , distributed , threads , matrix	3	308	May 19, 2024
Probable data race condition causing problems when trying to parallelize a loop used to populate an array Performance distributed	14	191	August 4, 2024
Speeding up the multiplying, adding, subtracting of 3D matrices Numerics question	16	730	June 24, 2023

How to improve performance in a function that repeatedly defines and multiplies matrices

Related topics