[ANN] HiddenMarkovModels.jl: when did HMMs get so fast?

gdalle · June 11, 2023, 4:43pm

What the **** is an HMM?

Skip to the good bits below if you’re in a rush

Hidden Markov Models (HMMs for short) are a statistical modeling framework that is ubiquitous in signal processing, bioinformatics and plenty of other fields. They capture the distribution of an observation sequence (Y_t) by assuming the existence of a latent state sequence (X_t) such that:

(X_t) follows a (discrete time, discrete space) Markov chain
for each t, the distribution of Y_t is entirely determined by the value of X_t

Imagine we are given an observation sequence (Y_t) and a parametric family of HMMs \{p_\theta\}. Following this famous tutorial, we can list three fundamental problems, each of which has a solution that relies on dynamic programming:

Problem	Description	Solution
Evaluation	Compute the likelihood of the observation sequence p_\theta(Y) for a fixed \theta	Forward algorithm
Decoding	Compute the most likely state sequence \arg\max_X p_\theta(X \mid Y) for a fixed \theta	Viterbi algorithm
Learning	Find the best parameter \arg\max_\theta p_\theta(Y)	Baum-Welch (EM) algorithm

Who are the competitors?

Of course, you saw me coming a mile away: there are already some Python packages that provide HMM functionalities. The main two are:

hmmlearn – this one seems dormant
pomegranate – this one does plenty of other things, it truly is an amazing toolkit, and its maintainer was very friendly when I asked for help

There is also a Julia package called HMMBase.jl, developed by @maxmouchet a while ago. But for my PhD research, I found myself wanting things that none of these implementations provided. So I rolled up my sleeves and created HiddenMarkovModels.jl.

What follows is not a criticism of past efforts: if anything, it is an hommage. HMMBase.jl in particular was my main source of inspiration, and much of my code comes from it. I have discussed it with @maxmouchet, and we agreed to declare my package the official successor to HMMBase.jl. Since the interface is very similar, users shouldn’t struggle too much with the transition.

As a side note, one can always rephrase an HMM as a probabilistic program and throw some MCMC or variational inference at it. But at least in the basic case I describe, it will be much slower than dedicated routines. That is why I don’t include packages such as Turing.jl in the comparison.

What can the package do?

To get a feel for the package, check out its documentation at https://gdalle.github.io/HiddenMarkovModels.jl/. Here is a feature comparison between HMMBase.jl and HiddenMarkovModels.jl:

Feature	HMMBase.jl	HiddenMarkovModels.jl
Number types	`Float64`	anything
Observation types	`Number` or `Vector`	anything
Observation distributions	from Distributions.jl	satisfying DensityInterface.jl
Priors / structures	no	customizable
Autodiff	no	forward mode (for now)
Multiple sequences	no	parallelized

Reaching this level of generality was my main motivation. Indeed, my doctoral research involved differentiating the likelihood of HMMs with multiple sequences of Poisson process observations. This kind of funky combination is now possible with very little additional work from the user.

How reliable is it?

The package follows all the best practices that I’m aware of:

quality checks with Aqua.jl
correctness and type stability checks with JET.jl (shoutout to @aviatesk as always)
coherence checks against HMMBase.jl
integration checks with third-party packages like SparseArrays, StaticArrays.jl and ForwardDiff.jl
documentation with Documenter.jl
benchmarks compatible with PkgBenchmark.jl

Does it run fast?

In addition to the features, there is a number of tricks that I used to speed up the code: avoiding allocations, calling linear algebra subroutines, multithreading across sequences, etc.
The results are pretty amazing on small- to medium-scale problems. HiddenMarkovModels.jl blows HMMBase.jl out of the water, and compares quite favorably to hmmlearn (which has a NumPy backend) and pomegranate (which has a PyTorch backend), even though I ran it on a single thread.
Complete reproducible benchmarks with their explanations can be found at https://gdalle.github.io/HiddenMarkovModels.jl/dev/benchmarks/, they’re run automatically before each docs build.

Where will it go next?

The package is currently awaiting registration (see this PR to the general registry).

Here are some of the things that I would like to work on soon-ish:

specification and testing for an AbstractHMM interface, perhaps with Interfaces.jl
numerical stability in large-dimensional settings with sparse transitions
reverse mode autodiff with ChainRules.jl
SIMD optimization with LoopVectorization.jl or Tullio.jl
spectral estimation methods
input-output HMMs in my other package ControlledMarkovModels.jl (don’t look at it yet, it’s ugly and unmaintained!)

Contributors are welcome!
In the long run, I will probably transfer this package to JuliaStats, but for now I’d like to keep control until things are stabilized.

I’ve read everything, what now?

If you’ve made it this far, congrats. Take a break. Treat yourself to a fruit juice. Chill.

And whenever you’re ready, you can run the command below, then look at the tutorial to start playing around: https://gdalle.github.io/HiddenMarkovModels.jl/dev/tutorial/.

pkg> add https://github.com/gdalle/HiddenMarkovModels.jl

If you have any suggestions or run into trouble, feel free to open an issue or use this thread! Also, a little star on the repo is always appreciated if you find it useful

camilogarciabotero · June 14, 2023, 3:59pm

This is so amazing. Thanks for working on this and creating an excellent framework for working with HMMs. I just wanted to point out that there is another package called MarkovModels.jl that seems to have neat implementations of some of the algorithms (e.g. Viterbi) that apparently scales well.

jacobusmmsmit · June 14, 2023, 6:31pm

Are you gonna add the Aqua badge to the repo?

gdalle · June 15, 2023, 7:02am

Wow, I missed that one in my SoTA review. Might add it to the benchmarks sometime! And I should specify that my package is mostly geared towards middle-scale uses on the CPU, I made absolutely zero effort to be GPU-friendly

gdalle · June 15, 2023, 7:02am

I didn’t even remember there was such a badge! Is there a JET badge?

DanielVandH · June 15, 2023, 7:15am

Works (from Logo or Readme badge similar to Aqua's · Issue #375 · aviatesk/JET.jl · GitHub)

gdalle · February 22, 2024, 8:19pm

There is now a new version v0.4.1 which supports most of the stuff listed here! Go try it out

gdalle · April 5, 2024, 8:47pm

Update: version 0.5 is registered and has been turned into a JOSS paper!

NeroBlackstone · May 2, 2024, 6:29pm

Amazing!
Do you have plan to support High-Order HMM?

gdalle · May 2, 2024, 7:01pm

Do you mean HMMs where the dependency is on the previous k states? If so, no, but you can probably trick the package into doing what you want by encoding k states into a single integer and using a sparse transition matrix.
Some discussion of this workaround can be found in this issue, and I can also help you if you’re stuck

MikeAinOz · May 4, 2024, 9:18pm

This looks like great work, congratulations, I did notice that the tutorial link ending “/dev/tutorial” above doesn’t work and gives a 404:+1:

apo383 · May 5, 2024, 3:50am

Doesn’t that violate the Markov property? I thought Markov models were defined as dependent on previous state alone.

gdalle · May 5, 2024, 5:13am

The first message was written with a previous version of the package, where the documentation was structured differently. Now there is more than one tutorial, hence the broken link Here’s the first one:

gdalle · May 5, 2024, 5:13am

It is indeed a generalization of the Markov property to depend on the past k states, but as I have hinted before, you can reformulate such models to depend only on the last state by changing what you put in the state. So it’s not really more generic

adannenberg · October 6, 2024, 4:32pm

Question: Does the package support continuous-valued state variables in addition to discrete-valued state variables? The examples and documentation seem limited to discrete-valued state variables (finite-size vector of probabilities for initial values of state variable, evolution in terms of a set of transition probabilities rather than a transition matrix and noise covariance matrix).

gdalle · October 6, 2024, 5:55pm

No, if you’re thinking SSMs and things like Kalman filter, that’s not supported by HiddenMarkovModels.jl. But there are other packages in Julia to do that, although I’m not familiar with them.

adannenberg · October 6, 2024, 6:07pm

Quel dommage! But thank you for the rapid response. Maybe the package should be renamed DiscreteHiddenMarkovModels or, less jokingly, the documentation should make more explicit the fact that the package is for HMMs in which the state variable is discrete. I’ll keep browsing… Thanks again.

gdalle · October 6, 2024, 7:39pm

Sounds like a good idea! Do you want to open a docs PR?

adannenberg · October 6, 2024, 9:30pm

Sure, if you’d like. I haven’t done that before, though, so just let me know the steps please. Also, just out of curiosity, since you’re the package author and we’ve already communicated what will be the benefit?

gdalle · October 6, 2024, 10:17pm

Of course I can do the docs PR myself, so don’t stress about it. But putting myself in the shoes of someone who has never contributed to a package before (as seems to be your case?), I think it’s nice to be encouraged to do it, especially for something low-stakes like updating documentation. It was meant as an invitation and an offer to guide you through it, not an obligation.

Topic		Replies	Views
[ANN] HMMBase.jl - A lightweight and efficient Hidden Markov Model abstraction Package Announcements statistics	17	5314	February 26, 2024
Hidden Markov Model with fixed parameters Probabilistic Programming turing , hmm	12	185	March 9, 2025
Problem with installing installing HiddenMarkovModels.jl New to Julia	13	779	June 24, 2021
[ANN] BetaML.jl.. yet an other (simple) Machine Learning Package Package Announcements package , announcement , machine-learning	16	3340	May 15, 2024
EM algorithm for HMM-GLM models in Julia? Probabilistic Programming glm , hmm , markov-decision	5	167	September 6, 2024