Innefficient paralellization? Need some help optimizing a simple dot product

tkoolen · March 15, 2018, 4:17pm

Yeah, with @threads there seems to always be some reason to have to manually split up the workload over the threads.

In this case it’s because you want each of the threads to reduce to their own local variable before summing the results of the threads together. Similarly in Parallel is very slow - #17 by Elrod.
Another case: Parallelizing for loop in the computation of a gradient - #18 by saschatimme (perhaps not as frequently encountered as the first, but still)

Maybe @threads in its current form is just not the right abstraction? At the very least, a mapreduce-style version of @threads similar to @parallel would be pretty useful I think.

Topic		Replies	Views
What's the problem with this simple multi-thread code? General Usage question	17	1189	March 11, 2022
Parallelizing for loop in the computation of a gradient Performance question	19	2629	February 26, 2018
Poor performance while multithreading (Julia 1.0) Performance multithreading	28	4036	February 11, 2019
Questions on a number of code acceleration techniques General Usage performance , hpc , parallel	11	1807	July 8, 2017
Sum operations between arrays Performance	21	5716	April 7, 2020

Innefficient paralellization? Need some help optimizing a simple dot product

Related topics