Random number and parallel execution

greg_plowman · March 14, 2021, 6:03am

There are a few separate but possibly confounding issues here:

rand() is now thread-safe, but this has an overhead
(see The need for rand speed | Blog by Bogumił Kamiński)
You can explicity pass an RNG to rand() to bypass this overhead, but beware, if you do this you also bypass thread-safety.

unsafe threaded code

using Random

function pi_serial(rng, n::Int)
    s = 0
    for _ in 1:n
        s += rand(rng)^2 + rand(rng)^2 < 1
    end
    return 4 * s / n
end

function pi_threaded(rng, n::Int)
    s = 0
    Threads.@threads for _ in 1:n
        s += rand(rng)^2 + rand(rng)^2 < 1
    end
    return 4 * s / n
end

const n = 10^8
const rng = MersenneTwister()
println("Num threads: ", Threads.nthreads())

pi_serial(rng, n)       # correct
pi_threaded(rng, n)     # incorrect

To gain full benefit from threading with OP example:
- explicitly supply a separate RNG for each thread
- accumulate into thread-local variable, rather than accumulating directly into array

safe threaded code

using Random, Future, BenchmarkTools

function sum_rand_serial(rng, n)
    s = 0.0
    for i in 1:n
        s += rand(rng)
    end
    s
end

function sum_rand_parallel(rngs, n)
    nthreads = Threads.nthreads()
    s = zeros(nthreads)
    n_per_thread = n ÷ nthreads
    Threads.@threads for i in 1:nthreads
        rng = rngs[i]
        si = 0.0
        for j in 1:n_per_thread
            si += rand(rng)
        end
        s[i] = si
    end
    sum(s)
end

function parallel_rngs(rng::MersenneTwister, n::Integer)
    step = big(10)^20
    rngs = Vector{MersenneTwister}(undef, n)
    rngs[1] = copy(rng)
    for i = 2:n
        rngs[i] = Future.randjump(rngs[i-1], step)
    end
    return rngs
end

println("Num threads: ", Threads.nthreads())
const N = Threads.nthreads() * 10^8
const rng = MersenneTwister();
const rngs = parallel_rngs(MersenneTwister(), Threads.nthreads());
@btime sum_rand_serial(rng, N)
@btime sum_rand_parallel(rngs, N)

On my machine using 12 threads, I get ~11x speed up.

Topic		Replies	Views
Random numbers and threads General Usage question , multithreading , random	30	3863	March 29, 2022
Number of threads available seems random (using @spawn) New to Julia parallel	19	1343	August 25, 2020
Poor performance on cluster multithreading Performance performance , parallel , multithreading , cluster	40	4079	July 11, 2018
Why does rand() in threads slowdown speed in Julia 1.3 New to Julia question	7	899	December 30, 2019
How to run multiple threads, all using the same sequence of pseudorandom numbers, in Julia? General Usage multithreading , random	9	1084	August 30, 2021

Random number and parallel execution

Related topics