Hi all, I’ve seen the perf benchmark or sampling for Turing vs Stan but I was wondering if we have benchmarks for Pyro, pymc3, tensorflow-probabiliy with one chain and with multiple chains (since tfp seems to benefit from vectorizing chains?).
I think extending the Turing vs Stan benchmarks here should be relatively straight-forward. Also, we have a GSoC project for this so it may be worked on soon.
Note that there’s a third benchmark which needs an update but Stan currently fails to be able to estimate it well, so for now we’re leaving that one off (we’ll need to decrease the timespan or something to make it easier to estimate for Stan).