Again on reaching optimal parallel scaling

tkf · December 18, 2021, 2:38am

Just be clear, this particular comment was referring to the sequential loops written with @floop, not the parallel loops. See the BlockVector speedup w.r.t iterate in [RFC/ANN] FLoops.jl: fast generic for loops (foldl for humans™) (Note: back then there was no parallel @floop). You can also manually write nested loop quite easily in this case anyway and you don’t need it to be generic over collection type. So probably I shouldn’t have shoehorned FLoops advertisement

Topic		Replies	Views
Scaling of @threads for "embarrassingly parallel" problem Performance threads	29	1963	January 20, 2023
Huge performance fluctuations in parallel benchmark: insights? Performance parallel , multithreading , benchmarktools	52	2629	December 1, 2021
Julia code becomes slower on running on supercomputers and does not scale well when parallelizing with Base.Threads Julia at Scale fortran , parallel , linearalgebra , threads	73	2042	January 22, 2024
How to achieve perfect scaling with Threads (Julia 1.7.1) Performance multithreading	33	2443	January 13, 2023
Garbage collection and threading Performance memory-allocation	17	1948	December 20, 2023

Again on reaching optimal parallel scaling

Related topics