Type instability with Dirichlet distribution in Turing

ElOceanografo · May 11, 2020, 6:41pm

I’m fitting a marginalized Gaussian mixture model with Turing, and the sampler is running very slowly. Looking at the output of @code_warntype, it appears that drawing the mixture weights from a Dirichlet distribution introduces a type instability (when I make the weight vector constant the sampler runs much faster, so that seems to be the issue). Is this a bug? Any ideas for a workaround? Thanks!

MWE:

using Turing
@model MarginalizedGMM(x, K, ::Type{T}=Vector{Float64}) where {T} = begin
    N = length(x)
    μ = T(undef, K)
    σ = T(undef, K)
    for i in 1:K
        μ[i] ~ Normal(0, 5)
        σ[i] ~ Gamma()
    end
    w ~ Dirichlet(K, 1.0)
    # w = T([0.75, 0.25]) Way faster with this line instead of ↑
    for i in 1:N
      x[i] ~ Distributions.UnivariateGMM(μ,σ, Categorical(w))
    end
    return (μ::T, σ::T, w::T)
end


x = [randn(150) .- 2; randn(50) .+ 2]
gmm = MarginalizedGMM(x, 2)
varinfo = Turing.VarInfo(gmm)
spl = Turing.SampleFromPrior()
@code_warntype gmm.f(varinfo, spl, Turing.DefaultContext(), gmm)

chn = sample(gmm, NUTS(100, 0.65), 1000)

mohamed82008 · May 12, 2020, 6:49am

Please open an issue so it doesn’t get forgotten. I will take a look when I get some time.

mohamed82008 · May 12, 2020, 8:25am

And btw it’s natural for it to be faster with that line removed because then you are not accumulating the logpdf from the Dirichlet and not differentiating it wrt to the transformed parameters which that line would do. But the type instability is what I am concerned about.

ElOceanografo · May 12, 2020, 4:26pm

Issue opened: Type instability with Dirichlet distribution · Issue #1276 · TuringLang/Turing.jl · GitHub

And yeah, I wouldn’t be surprised to see it run a bit slower when it needs to sample the weights as well, but the slowdown with the type instability is ~25x, which did get my attention.

Topic		Replies	Views
Turing Mixture Models with Dirichlet weightings Probabilistic Programming turing	8	420	April 25, 2024
Turing, Ordered Variables, and the Dirichlet Distribution Probabilistic Programming turing	0	614	February 23, 2022
Multivariate dirichlet mixture with Turing Probabilistic Programming turing	6	1225	May 16, 2020
Factor mixture model in Turing - too slow? Probabilistic Programming turing	1	596	April 2, 2021
Strange issues fitting GMMs in Turing.jl by extending a fairly simple example Probabilistic Programming turing , bayesian-inference	10	1900	February 15, 2021

Type instability with Dirichlet distribution in Turing

Related topics