Thanks for this package! I don’t know anything about vectorization but I learned a lot from this thread.
I’m heavily using scatter/gather operations in my package FixedEffects
(here is the gather part and here is the scatter part) For now, I decompose the vector into nthreads part and do the scatter/gather in each thread.
I guess that one alternative would be to vectorize these parts. Do you think it could benefit from it? I have just tried to experiment but @avx
does not seem to support scatter yet.