Generators vs loops vs broadcasting: Calculate PI via Monte Carlo Sampling

lmiq · June 20, 2021, 4:50pm

For small N, you can build a data structure that informs the compiler the size of what you are doing, and the function will specialize to that size, possibly being very fast and non-allocating:

# original
julia> function f(N)
           r = rand(N,2)
           cnt = count(r[:,1].^2 + r[:,2].^2 .<= 1)
           pi = cnt/N * 4
       end
f (generic function with 2 methods)

julia> @btime f(10)
  285.348 ns (8 allocations: 1.14 KiB)
4.0

# now the trick
julia> using StaticArrays

julia> struct MyInteger{N}
         i::Int
       end

julia> function f(n::MyInteger{N}) where N
         r = rand(SMatrix{N,2,Float64})
         cnt = count(r[:,1].^2 + r[:,2].^2 .<= 1)
         pi = cnt/N * 4
       end
f (generic function with 2 methods)

julia> n = MyInteger{10}(10)
MyInteger{10}(10)

julia> @btime f($n)
  43.885 ns (0 allocations: 0 bytes)
2.4

For large N the good thing about writing loops is that you can parallelize them:

julia> using FLoops

julia> function f(N)
           @floop for i in 1:N
               if rand()^2 + rand()^2 <= 1
                   @reduce(cnt += 1)
               end
           end
           pi = cnt/N * 4
       end
f (generic function with 1 method)

julia> @btime f(100000)
  317.370 μs (111 allocations: 5.75 KiB)
3.13936

Topic		Replies	Views
Arithmetic broadcasting in Julia 5x slower than MATLAB Performance	17	1068	May 26, 2022
Different speed when estimating pi General Usage question	4	857	December 29, 2017
Trying to understand low performance compared to C++ Performance	13	329	October 2, 2024
Quite bad performance of Julia 0.6.4 vs Python+Numpy General Usage	26	5195	November 13, 2018
Optimizing performance (newcomer from Matlab) General Usage first-steps	19	1206	November 8, 2019

Generators vs loops vs broadcasting: Calculate PI via Monte Carlo Sampling

Related topics