Recommended sampler for data with Categorical likelihood

mthelm85 · August 16, 2022, 12:22pm

The NUTS sampler takes a really long time in this MWE:

using Distributions
using Turing

d = Categorical([0.7, 0.29, 0.01])
N = 100
Y = rand(d, N)

@model function test(Y)
    p ~ Dirichlet([2,2,2])
    for i in 1:N
        Y[i] ~ Categorical(p)
    end
end

m = test(Y)
chain = sample(m, NUTS(), 1_000)

I have thousands of observations for my real problem so I’m going to need to choose a different sampler. I’m hoping someone can provide some suggestions for this kind of problem because I’ve been spoiled by the NUTS sampler just working (for the most part) and not having to tune a different sampler.

opera_malenky · August 16, 2022, 9:36pm

Just to provide another data point, you can see that Dirichlet is still much, much slower than using Beta for an equivalent model:

@model function test(Y)
    p ~ Dirichlet([2,2])
    for i in 1:N
        Y[i] ~ Categorical(p)
    end
end

m = test(Y)
chain = sample(m, NUTS(), 1_000)


@model function test2(Y)
    p ~ Beta(1,1)
    for i in 1:N
        Y[i] ~ Categorical([p, 1-p])
    end
end

m2 = test2(Y)
chain = sample(m2, NUTS(), 1_000)

# m (Dirichlet) takes 70.5s (10.7 ESS/s)
# m2 (Beta) takes 0.4s (1060 ESS/s)

No idea why.

Topic		Replies	Views
Running NUTS with a categorical prior? Probabilistic Programming turing	8	433	November 29, 2023
Advice for simple (mildly large) Turing model Probabilistic Programming question	9	2138	December 26, 2019
How to leverage conjugate priors Probabilistic Programming turing , conjugate-priors , parameter-reduction , model-simplification	2	792	April 3, 2021
Error due to Dirichlet prior in Turing? Probabilistic Programming turing	9	279	June 16, 2023
Type instability with Dirichlet distribution in Turing Probabilistic Programming question	3	721	May 12, 2020

Recommended sampler for data with Categorical likelihood

Related topics