Inconsistent CPU utilisation in @threads loops

So I was trying to do exactly what was asked in Question on TaskLocalValue.
Say I have a matrix M and I need to calculate eigenvectors of matrices M .+ i for i in 1:100 with 8 cores. But matrices M .+ i are large and I cannot preallocate a hundred of those; I create task-local buffers m instead (and on each iteration I would then do m .= M .+ i). This does the job, but I observe the same core-stopping behaviour as in the original MWE. Anyway, I am not too concerned now that I know that the correct way to do this is to use Channels :smile: