Minibatch ADVI?

spragud2 · February 21, 2023, 11:40pm

Hey!

I am trying to train a last-layer bayesian approximation for a neural network. The issue I am running in to is that ADVI tries to take the gradient of the entire dataset… obviously not going to work.

Can the ADVI algorithm be tweaked to calculate the gradient on a minibatch, presumably by scaling the output by the size of the total dataset?

This feature is built in to pymc but i really am not a fan of the syntax.

Topic		Replies	Views
Taking gradients of minibatch gradient with Flux General Usage flux , zygote	0	512	October 28, 2020
Amortized Hierarchical Variational Model Probabilistic Programming flux , turing , machine-learning	4	462	September 17, 2020
[Review wanted] Adam gradient descent with complicated gradient to fit multivariate gamma convolutions Performance review	2	695	January 7, 2022
Speeding up per-sample gradients? Machine Learning question , autodiff	16	1065	February 2, 2024
Gradient of a gradient of a FastChain General Usage question , zygote , ad , neural-network	19	2292	February 8, 2022

Minibatch ADVI?

Related topics