Correlation

gideonsimpson · November 3, 2017, 5:47am

I have an array of two vectors and I want to compute the correlation of the components. Currently, I am computing this as

mean([X[j]*X[j]' for j=1:n_batch_iters])

which gives the desired result, but I was curious if this is the most efficient approach.

juliohm · November 3, 2017, 6:47am

You question is not clear, please give the dimensions of this array and what correlation you are interested in. In any case, try to use Julia’s cor(x,y) function if you can.

gideonsimpson · November 5, 2017, 2:10am

Each X[j] is itself an array of length d and there are a total of N of them. Essentially, what I want to compute, is

\frac{1}{N}\sum_{n=1}^N X_n^{(i)} X_n^{(j)}

for each (i,j) pair, with X^{(i)}_n corresponding to the i-th component of X[n]

ChrisRackauckas · November 5, 2017, 2:35am

The covariance of a random variable with itself is the variance, so var(X)?

gideonsimpson · November 6, 2017, 11:17pm

var does not appear to take an array of arrays as its arguments.

juliohm · November 7, 2017, 12:25am

So what do you do in this case?

You can start with a matrix instead of an array of arrays and use var directly
or
You can stick with your original implementation (which assumes zero mean vectors)

gideonsimpson · November 8, 2017, 4:35am

Any thoughts on the efficiency of one method over the other?

dpsanders · November 8, 2017, 6:17am

Try it and see?

Topic		Replies	Views
How to compute the correlation between elements in an array of arrays? New to Julia question	4	870	March 10, 2020
The equivalent of R's var() for matrix? Statistics	2	572	August 27, 2018
Whats the easiest way to create correlation matrices in Julia? New to Julia question , statistics	5	4294	November 5, 2021
Cross-correlation function in Julia vs. Python Statistics question , statistics	12	8624	March 21, 2018
The most efficient way to calculate the pairwise correlation between rows of a large Matrix{Float64} Performance performance	10	799	May 25, 2023

Correlation

Related topics