How to perform parallel vector addition?

in general something as simple as adding 2 vectors won’t benefit from multithreading (you’ll be bottle-necked by memory speed). Also, you’ll get roughly 3x better performance from a .+= b which will update a to have b added to it rather than allocate a new array and fill it with the sum.