Again on reaching optimal parallel scaling

j_u · December 18, 2021, 2:20am

Interesting thread, I will read with interest, thx.

tkf · December 18, 2021, 2:38am

Just be clear, this particular comment was referring to the sequential loops written with @floop, not the parallel loops. See the BlockVector speedup w.r.t iterate in [RFC/ANN] FLoops.jl: fast generic for loops (foldl for humans™) (Note: back then there was no parallel @floop). You can also manually write nested loop quite easily in this case anyway and you don’t need it to be generic over collection type. So probably I shouldn’t have shoehorned FLoops advertisement

Topic		Replies	Views
How to achieve perfect scaling with Threads (Julia 1.7.1) Performance multithreading	33	2429	January 13, 2023
Scaling for multi-threading Julia at Scale	10	1451	July 28, 2021
Is the best number of threads used in parallel computing by using distribute 4? Performance parallel	4	1382	June 11, 2020
Help me understand multi-threaded scaling for matrix multiplication Performance question	22	630	April 16, 2024
Blog: Using Julia on the HPC Teaching & Outreach blog-post	40	2233	October 10, 2022

Again on reaching optimal parallel scaling

Related topics