Turing.jl/MCMC: Chains getting stuck

jacobusmmsmit · March 10, 2022, 4:30pm

I want ask for advice or insight as to why my MCMC chains could be stuck even when using HMC on a uni-modal posterior.

I’m doing parameter calibration of pedestrian models (systems of non-linear ODEs) using Turing.jl. If we generate synthetic data using the model, then inferring the parameters from this data over a short time period (<1s) is a well-posed and well-conditioned problem, as there is little time for the pedestrians’ trajectories to be “chaotically” altered. However, extending the inference to times greater than a few seconds, the chains entirely fail to explore the parameter space and get completely stuck. That is to say, they still accept proposals, it’s just that these proposals are very close to each other, so the chain never moves.

I am confused as to why this would happen, given that I am using HMC (NUTS), which was specifically created to address the issue of non-exploration. The posterior in question looks like this:

Which doesn’t strike me as particularly difficult to identify. That said, I’m so new to MCMC that there could be something obvious that I’m missing. One thing I haven’t done is check the identifiability of my parameters using the relevant SciML tools.

Here’s what an example trace plot looks like for my problem:

wc4wc4wc4 · March 10, 2022, 4:49pm

I think that a MWE would be beneficial, otherwise it’s quite difficult to help.

jacobusmmsmit · March 10, 2022, 5:00pm

I apologise for not having done so already. I will do my best, but I’ve been advised not to share too much of the code I’m using so I don’t know how useful it will ultimately be.

wc4wc4wc4 · March 10, 2022, 5:04pm

Then simulate som data, if you cannot reveal the real data. I think that would make more people able to help.

jacobusmmsmit · March 10, 2022, 5:49pm

What sort of data would be useful? I have saved some chains saved, and some data from the output of my ODE.

Edit: I think I misinterpreted your comment: It’s not the data that is not shareable, but the code itself cannot be shared publicly due to how the automatic code plagiarism detection works. I can try to come up with a MWE but even then I’m not sure how useful it will be because the setup and model are quite particular.

PeetoomHeida · March 10, 2022, 8:29pm

Sorry for being off topic, but can you share the code to make the visualization of the posterior? I haven’t been able to figure out how to do so.

jacobusmmsmit · March 10, 2022, 8:53pm

No worries, the code for creating the visualisations can be found here:
https://turing.ml/dev/docs/using-turing/sampler-viz

There are a few things you need to change (names of variables) to get it to work for your model/chain, but it’s relatively straightforward.

jacobusmmsmit · March 10, 2022, 9:03pm

I can’t believe this, but the chains getting stuck seemed to be an issue of type-instability. After I checked the code with @code_warntype I discovered that there were some global variables over which I had defined closures. Hence the return type of my Turing model was Any.

I think this had an effect on the \epsilon heuristic for the leapfrog integrator. I might submit an issue on the Turing repo to find out where this came from.

The speedup after fixing this was about 30 times btw…

Never use global variables, kids.

16/04 Edit: This did not fix the problem entirely, however it made it possible to run the simulations faster and diagnose possible problems. I think the chains getting stuck is an issue of the \epsilon heuristic being tricked by a very rough posterior.

Sahil_Khan · January 30, 2025, 12:30pm

I have similar issue posted here.

So, what exactly solved your issue?

jacobusmmsmit · January 30, 2025, 12:47pm

My method had a part that was undifferentiable and I was using NUTS which relies on gradients. It was never going to work in the first place and the fact that it even did a little bit is surprising.

Topic		Replies	Views
Turing NUTS chains getting stuck at the parameter bounds General Usage turing , differentialequation	43	1227	August 20, 2022
Identical samples during mcmc run in Turing.jl with NUTS algorithm Modelling & Simulations question , turing , mcmcchain	0	94	January 29, 2025
Sampling gives chains which converge to two different distributions in ODE example of TuringTutorials Probabilistic Programming	4	244	January 11, 2023
Turing.jl - NUTS gets stuck in "The current proposal will be rejected... isfinite.((θ, r, ℓπ, ℓκ)) = (true, true, false, true)" Probabilistic Programming turing	7	2757	November 22, 2022
Turing.jl Warning: The current proposal will be rejected due to numerical error(s). isfinite.((θ, r, ℓπ, ℓκ)) = (true, false, false, false) Modelling & Simulations turing	4	453	January 29, 2023

Turing.jl/MCMC: Chains getting stuck

Related topics