How to sample from MvNormal without allocating?

markmbaum · March 27, 2023, 3:40am

Even using the in-place method rand! to sample from a MvNormal distribution seems to allocate some memory and be orders of magnitude slower than the univariate case. I only have a two dimensional case, so I was hoping to find a way to optimize but can’t seem to find out how.

Suggestions?

oxinabox · March 27, 2023, 4:13am

There is no obvious reason to me that this should be allocating.
Here is the code for reference if anyone wants to go digging

github.com

JuliaStats/Distributions.jl/blob/ec68da3a8d4a4776367f2d7ca5ec2d4666e29c78/src/multivariate/mvnormal.jl#L276-L290


      
          function _rand!(rng::AbstractRNG, d::MvNormal, x::VecOrMat)
              unwhiten!(d.Σ, randn!(rng, x))
              x .+= d.μ
              return x
          end
          
          
# Workaround: randn! only works for Array, but not generally for AbstractArray
          function _rand!(rng::AbstractRNG, d::MvNormal, x::AbstractVector)
              for i in eachindex(x)
                  @inbounds x[i] = randn(rng, eltype(x))
              end
              unwhiten!(d.Σ, x)
              x .+= d.μ
              return x
          end

github.com

JuliaStats/PDMats.jl/blob/fff131e11e23403931a42f5bfb3384f0d2b114c9/src/generics.jl#L40C10-L43


      
          function unwhiten!(r::AbstractVecOrMat, a::AbstractMatrix, x::AbstractVecOrMat)
              v = _rcopy!(r, x)
              lmul!(chol_lower(cholesky(a)), v)
          end

cholesky on a PDMat should be nonallocating as it does that upfront during construction

markmbaum · March 27, 2023, 4:25am

A simple example:

using BenchmarkTools, Distributions, Random
using Random: rand!

X = MvNormal([2 1; 1 3])
y = zeros(2)
rng = Xoshiro()
@btime rand!($rng, $X, $y);

which produces

  133.021 ns (2 allocations: 96 bytes)

It seems to matter that the distribution is a ZeroMeanFullNormal but I’m not sure why.

sethaxen · April 3, 2023, 3:10pm

There seem to be two sources of allocations. The first is PDMats.chol_lower called here. Not certain why this allocates; maybe because the result is a typeunion? This allocation is 16 bytes.

The second is this line, where the mean μ is added to the result via broadcast. The mean in this case is a FillArrays.Zeros, and it seems that broadcasted makes a copy:https://github.com/JuliaArrays/FillArrays.jl/blob/c3b38add861d475aadc66a112e045d7e0db31372/src/fillbroadcast.jl#L205. This results in an 80 byte allocation. Seems like this could be improved in FillArrays.

sethaxen · April 11, 2023, 12:01pm

Seems when I posted this, @jishnub had already begun working on a PR to fix this: don't materialize when broadcasting Zeros with Vector by jishnub · Pull Request #211 · JuliaArrays/FillArrays.jl · GitHub

markmbaum · August 15, 2023, 2:29pm

Seems like this has improved. Running the same tiny example

using BenchmarkTools, Distributions
using Random: Xoshiro, rand!

X = MvNormal([2 1; 1 3])
y = zeros(2)
rng = Xoshiro()
@btime rand!($rng, $X, $y);

69.586 ns (1 allocation: 16 bytes)

Topic		Replies	Views
Reducing allocations for repeatedly generating MvNormal distributions Performance	2	448	May 16, 2020
Is this normal things about MvNormal? General Usage question , distributions	5	87	September 3, 2024
How to sample data without using external packages? General Usage question , memory-allocation , random	12	528	March 1, 2022
Fastest way to sample from MVN, changing parameters Julia at Scale statistics , distributions , gaussian-process , probablistic	7	111	May 3, 2025
Sampling predictive with MvNormal in Turing Probabilistic programming turing	2	193	June 7, 2024

How to sample from MvNormal without allocating?

Related topics