Trying to understand parallel performance in Turing

mikkoku · February 23, 2022, 5:09pm

Since more than 50% of the time is spent in GC, one possible reason for the performance difference between Threads and Distributed could be that Distributed allows GC to run in parallel while GC in any thread stops all threads. The memory allocations could also be a limiting factor in the parallelization speedup. This should not be a problem in a bigger model where more time is spent in computation instead of GC.

Topic		Replies	Views
Turing.jl coin flipping example with large number of data points Probabilistic programming turing	4	1206	April 26, 2022
Within-chain parallelization with Turing.jl Probabilistic programming turing	6	473	September 1, 2023
How can I solve "no method matching Chains" in Turing - Julia Probabilistic programming turing	4	1005	August 6, 2019
GPU and Thread-Parallel Support for Turing.jl Probabilistic programming	1	1839	April 13, 2021
Sampling percentage indicator New to Julia question , turing	2	232	June 23, 2023

Trying to understand parallel performance in Turing

Related topics