[ANN] CovarianceEstimation.jl -- fast and lightweight covariance estimation

mateuszbaran · January 25, 2019, 10:23am

Hello,

CovarianceEstimation.jl is a new package for estimating covariance of given samples. It currently focuses on linear shrinkage methods (but has one nonlinear shrinkage algorithm) and is written in pure Julia.

There already is a similar package, CovarianceMatrices.jl but it calculates covariance of coefficient of regression models. CovarianceEstimation.jl is also aiming to be more lightweight.

I would like to thank Thibaut Lienart for his valuable contributions. Thanks to his work the package turned out to be significantly faster than scikit-learn and corpcor.

Kind regards
Mateusz Baran

tk3369 · January 26, 2019, 8:38pm

I’m not a statistician… why would one estimate covariance instead of running the standard cov function?

mateuszbaran · January 26, 2019, 10:08pm

I’m not a statistician either but the issue is that (especially when you have more features than observations), standard estimators implemented by cov are quite poor. There are other estimators that, by assuming certain properties of analyzed random variables, can return an answer that is closer to true covariance. For example, you can use such estimators to improve the predictive power of LDA.

tlienart · January 27, 2019, 2:46am

The standard “canonical” covariance estimator is known to be badly ill conditioned in a large number of use cases, indeed for instance when the number of samples is around or under the number of features, and generally when the number of features is large. And so for instance if you want to recover the precision matrix (inverse of the cov) which is useful to estimate dependence structures, the estimator will blow up.

There’s a simple benchmark in the docs showing a plot of the MSE from the generating covariance matrix (“ground truth”) to the recovered one where you can see that the estimators we implemented get significantly lower MSE than the canonical estimator.

Finally it’s worth noting that linear shrinkage estimators (and even in some cases the non linear one we implemented) have the same computational complexity as the base estimator and so are pretty much just as quick to get (it’s worth being noted because the literature on covariance estimators is very messy and a number of methods are completely impractical from a computational perspective…)

kai · April 17, 2021, 2:10am

Can I use it to estimate regularized precision matrix?

mateuszbaran · April 18, 2021, 10:14am

I think you can just invert the covariance matrix estimated using CovarianceEstimation.jl for that.

Topic		Replies	Views
Realised covariance estimator General Usage question	1	227	April 24, 2021
Covariance matrix estimation using HF data General Usage	0	286	April 16, 2021
[ANN] Linear Regression v0.7-alpha Package Announcements statistics , regression	18	2103	December 6, 2021
Variance-Covariance matrix with missing data Performance statistics , missing-values	5	1198	June 25, 2022
HighFrequencyCovariance.jl - Algorithms for efficiently estimating covariance matrices with high frequency financial data Package Announcements statistics , time-series , finance	1	1025	January 17, 2021

[ANN] CovarianceEstimation.jl -- fast and lightweight covariance estimation

Related topics