Depending on what exactly you need, at a glance it looks like its the type of model that MUSE might be applicable to (theoretically, and also in practice since there’s a Turing interface). I gave an example here which gave big speedups over HMC, the docs are pretty good too, and show big speedups over VI as well. Please don’t hesitate to get in touch if you try it out and/or run into anything!