Strange performance of a loop

RoyiAvital · July 20, 2018, 7:03pm

I think that prior to Sandy Bridge if you accessed Aligned Data using the non aligned load it wouldn’t be efficient.
In modern CPU’s if the data is aligned it doesn’t matter if you use the load which assumes alignment or not.
But I still think accessing unaligned data is slower than aligned data.

But my point is different.
We must make sure the length of the data allocated it a multiplication of 16 Bytes (For SSE) / 32 Bytes (For AVX) / 64 Byte (For AVX512).

The tricky part is dealing with 1D / 2D / 3D / Etc… arrays.

Topic		Replies	Views
Arithmetic performance of expression Performance	11	371	October 4, 2022
The cost of size() in for-loops Performance	11	1809	July 20, 2018
Evaluating the "for" condition Performance question	10	489	June 17, 2021
Seemingly large permformance regression on "^" power operator in >=1.6 General Usage bug , performance	2	409	April 28, 2021
Matrix exponential slower in Julia 0.7 / 1.0? Performance	7	2640	September 12, 2018

Strange performance of a loop

Related topics