How to integrate out latent variables/ intermediate values in Turing model

neuro_enthusiast · October 20, 2022, 10:59am

Hi all, my generative model has the following form.
P(y |\theta) = \int P(y|x) P(x|\theta) dx
I want to get the posterior of θ given fixed data y. But I unfortunately don’t have an analytical solution for the integral over x. As a result, I’m implementing the model in turing with the following structure and using sampling to get a posterior on θ

@model my_func()
   θ ~ prior_dist()
   x ~ dist1(θ)
   y ~ dist2(x)
end

However, this results in a really high number of dimensions for sampling, since all the x are also in the posterior. Is there a way to integrate out a latent variable in Turing to avoid this problem?

dlakelan · October 20, 2022, 1:13pm

dist1 is a high dimensional multivariate dist?

Unless you have an analytic form there isn’t going to be an easy way to do high dim integration

neuro_enthusiast · October 20, 2022, 1:40pm

Indeed dist1 is a high dimensional multivariate dist. In my case, it’s something along the lines of MvNormal(0, θ). However, there are some transformations in between in my actual model so I couldn’t integrate analytically.

Is it perhaps possible to marginalize out x by sampling x within each turing sampling step, assuming that the following is correct?
P(y|\theta) = \int P(y|x)P(x|\theta) dx = \frac{1}{N} \sum_{i=1}^{N} P(y|x_i)
x \sim dist1(x|\theta)

dlakelan · October 20, 2022, 2:12pm

I think you’re trying to do something like:

theta ~ MyPrior()
x = rand(MvNormal(0,theta))
xprime = transform(x)
inclp = sum(logpdf(Ydist(xp),y) for xp in xprime)
@addlogprob!(inclp)

Which you can definitely try… see what you think.

neuro_enthusiast · October 21, 2022, 9:27am

Thanks so much for this solution! I was just introduced to @addlogprob! the other day but didn’t realize you can use it like this.

Topic		Replies	Views
Using Turing with ad-hoc likelihood General Usage question , statistics , turing	1	443	January 14, 2022
Uncertainty quantification of inverse problem using hierarchical Bayes: high-level modelling Probabilistic Programming	12	1683	September 25, 2019
Priors for generated quantities in Turing? Probabilistic Programming question	7	2658	April 18, 2021
Performance issues sampling a multimomial logit (slightly modified) with Turing.jl Probabilistic Programming question , turing , bayesian-inference	6	1181	November 25, 2022
Turing.jl: prior on quantiles General Usage turing	4	979	April 21, 2021

How to integrate out latent variables/ intermediate values in Turing model

Related topics