Is there a library function to fit an identity-link NB1 regression model?

jacques1 · January 11, 2023, 9:19pm

I would like to build an identity link negative binomial 1 glm model for some data (where negative binomial 1 means that the dispersion parameter is of the form (1 + theta)*mean). This means I need to simultaneously control both the link function form and the dispersion parameter form.

Is there a library/function I can use to control both?

More generally, I’d be happy if I could specify both a link function and an arbitrary form of the dispersion, only needing to supply the dispersion parameter theta.

p-gw · January 12, 2023, 9:36am

I’m not sure if there is anything pre-packaged to fit something like this, but I’m sure this can easily be written with Turing.

palday · February 28, 2023, 7:51pm

GLM.jl has support for negative binomial regression.

lrnv · February 28, 2023, 10:01pm

Although I am not sure they allow for over dispersion. You might have to use the joint modeling technique : fit another linear model for the overdispersion parameter and then re-run the original one taking it into account, as described in McCullagh & Nelder 1989 “Generalized liner model”. I think the joint dispersion is chapter 8 but I am not sure.

jacques1 · February 28, 2023, 10:19pm

So far as I can tell, this does not let me choose between an NB1 and NB2 (or more generally NB-P) model. These are described in Hilbe’s Negatvie Binomial Regression book.

jacques1 · February 28, 2023, 10:23pm

It is possible to specify such a model in Turing, and I ended up doing so. The unfortunate part is that the time to sample the Turing model accurately on my data is much greater than with, say negbinfit in the GLM library. There are more general advantages to Turing’s Bayesian philosophy of course, but the speed difference can be orders of magnitude, such that it is impractical. The best work-around that I found was to just get the MAP estimate using optim.jl as a substitute for the MLE, but this shortcut is necessarily less accurate and without the benefit of a full model (error bars, etc).

jacques1 · February 28, 2023, 10:29pm

negbifit.jl does estimate the dispersion. Instead of conducting auxiliary regression for theta, as you describe and which I have seen in some other implementations, it uses an MLE loop method to refine theta, constantly fitting new NB glms with fixed thetas until they reach some optimal loglikelihood.

This implementation is just NB2, and having spent time with the source code it would be a bit painful to modify to work with NB-1/NB-P, since the derivatives for the MLE and the loglikelihood for the stopping criteria are hard-coded for NB2 regression.

simsurace · February 28, 2023, 10:50pm

There are different types of NB likelihoods in GPLikelihoods.jl. They are just mappings from reals to a distribution, so should be able to be used for non-GP models.

p-gw · March 1, 2023, 8:37am

Of course I don’t know the scale of your problem.
Just note that you can get the parameter variance-covariance matrix via vcov from MLE/MAP estimates in Turing.

Topic		Replies	Views
Turing's negative binomial regression with horseshoe prior failing sometimes Statistics question , turing	1	118	November 18, 2024
Bayesian regression with parametrized basis functions in Turing.jl Probabilistic programming question , turing , gaussian-process	7	898	April 19, 2022
[ANN] TuringGLM.jl: A simple @formula macro for specifying Bayesian models in Turing.jl Package Announcements	8	1400	August 19, 2022
Normal Inverse Gaussian distribution in Turing Probabilistic programming	0	537	September 7, 2021
Turing: "no method matching cholenksy" Probabilistic programming	2	962	September 29, 2019

Is there a library function to fit an identity-link NB1 regression model?

Related topics