Need help with Variational Inference

Farlein · June 5, 2020, 3:08am

Hi there,

I have been recently attracted to variational inference. I like the idea to link Bayesian modeling with optimization. I like its uni modal approximation to the parameters in a model (by assuming the variational posterior is a normal distribution). I have read the document on variational inference on the Turing website, Variational Inference. I try to understand the codes in AdvancedVI.jl/src at master · TuringLang/AdvancedVI.jl · GitHub, but have difficult time with them.

Here is a simple scenario. Suppose I have a data set with 100 observations and 2 variables, x and y, where x is a continuous predictor and y is a binary output. I want to use a logistic regression with a single predictor x to predict y and use variational inference to estimate the distribution of parameter z for x. The prior of z is assumed to be a standard normal distribution.

@model logistic_regression(x,y,100) = begin
    intercept ~ Normal(0,1)
    z ~ Normal(0,1)

    for i = i:100
        v = logistic(intercept + z*x[i])
        y[i] ~ Bernoulli(v)
    end    
end;

According to the document above, we need to maximize ELBO(q) =

Σ_k=1 ^mΣ_i=1 ⁿ(log(p(x_i,z_k))/m + H(q(z))

, in order to estimate parameters in a model. I want to understand how ELBO is calculated with the above model. I have not tried to understand the optimization part yet.

Please let me know if the following is right.
log(p(x_i,z_k)) = log(p(x_i|z_k)p(z_k)) = InvLogit(Intercept+z_k*x_i)*exp(-z_k²/2)/sqrt(2π), where x_i is sampled from the data set, and z_k is sampled from q_μ,σ = N(μ,σ²).

In Turing, is log(p(x_i,z_k)) calculated using the two functions in Turing.jl/VariationalInference.jl at master · TuringLang/Turing.jl · GitHub?

function make_logjoint(model::Model; weight = 1.0)
    # setup
    ctx = DynamicPPL.MiniBatchContext(
        DynamicPPL.DefaultContext(),
        weight
    )
    varinfo_init = Turing.VarInfo(model, ctx)

    function logπ(z)
        varinfo = VarInfo(varinfo_init, SampleFromUniform(), z)
        model(varinfo)

        return getlogp(varinfo)
    end

    return logπ
end

function logjoint(model::Model, varinfo, z)
    varinfo = VarInfo(varinfo, SampleFromUniform(), z)
    model(varinfo)

    return getlogp(varinfo)
end

In https://github.com/TuringLang/Turing.jl/blob/master/src/variational/objectives.jl, the objective seems to be calculated using a function elbo,

function (elbo::ELBO)(
    rng::AbstractRNG,
    alg::VariationalInference,
    q,
    model::Model,
    num_samples;
    weight = 1.0,
    kwargs...
)
    return elbo(rng, alg, q, make_logjoint(model; weight = weight), num_samples; kwargs...)
end

I do not understand how ELBO is calculated from these several lines of code .

The entropy part seems to be addressed in Turing.jl/advi.jl at master · TuringLang/Turing.jl · GitHub, right?

    if q isa TransformedDistribution
        res += entropy(q.dist)
    else
        res += entropy(q)
    end

In sum, would someone give me some instruction to read the Turing code on variational inference?

Thanks,
Chuan

Topic		Replies	Views
How to obtain maximized ELBO value from Turing to use as approximation to model evidence? Modelling & Simulations question , turing	5	449	October 23, 2022
Finding MAP estimate in Turing? Statistics bayesian-inference	1	975	June 20, 2020
Posterior prediction in Turing Probabilistic programming turing	4	502	April 13, 2023
Bayesian logistic regression with Turing.jl Probabilistic programming turing , monte-carlo	29	4461	May 18, 2021
Variational Inference for Multinomial output Probabilistic programming turing	0	378	July 22, 2020

Need help with Variational Inference

Related topics