in general something as simple as adding 2 vectors won’t benefit from multithreading (you’ll be bottle-necked by memory speed). Also, you’ll get roughly 3x better performance from a .+= b
which will update a
to have b
added to it rather than allocate a new array and fill it with the sum.