Matrix vector multiplication: impact of column major vs row major (M4 Max)

Are you benchmarking in global scope? Can you try putting the calls in a function?