Parallel assembly of a finite element sparse matrix

PetrKryslUCSD · March 16, 2023, 2:39am

Interesting! I implemented an alternative way based on threads. This works quite well, at least the irregularities (slow tasks mixed in with fast tasks) are gone: all threads run in the same time.

Here is the parallel speed up for the thread implementation:

Note well that this is only the computation of the conductivity matrix and its assembly into the COO format.
Because the conversion into the CSC format is not parallelized, the overall speed up is quite a bit worse.

Machine used in the above graph:

2x Intel Xeon E5 2670
Cores 8
Code Name Sandy Bridge-EP/EX
Package Socket 2011 LGA
Technology 32nm
Specification Intel Xeon CPU E5-2670 0 @ 2.60GHz
L1 Data Cache Size 8 x 32 KBytes
L1 Instructions Cache Size 8 x 32 KBytes
L2 Unified Cache Size 8 x 256 KBytes
L3 Unified Cache Size 20480 KBytes
255 GB DDR3

This machine was running WSL2 under Windows 10.

Linear heat conduction problem. 343000 serendipity quadratic elements with 3x3x3 Gauss quadrature.

Open question: what is wrong with the task-based implementation? Why are some tasks much slower than others in the same batch?

References:
The task loop: https://github.com/PetrKryslUCSD/FinEtoolsHeatDiff.jl/blob/d041cd06035547e7bdb1422a94daf006594f1393/examples/steady_state/3-d/Poisson_examples.jl#L336
The thread loop: https://github.com/PetrKryslUCSD/FinEtoolsHeatDiff.jl/blob/d041cd06035547e7bdb1422a94daf006594f1393/examples/steady_state/3-d/Poisson_examples.jl#L479

Topic		Replies	Views
Parallelization in Julia vs C++ Performance question , parallel-computing	21	938	March 11, 2026
Slower with threads Performance question	26	1284	August 6, 2022
Parallelizing multiple Crank–Nicolson solvers Performance linearalgebra	21	1592	March 13, 2021
Huge performance fluctuations in parallel benchmark: insights? Performance parallel , multithreading , benchmarktools	52	2917	December 1, 2021
Why with @threads, the execution time is worse? Performance question , multithreading	19	2892	September 16, 2021

Parallel assembly of a finite element sparse matrix

Related topics