Strange performance of a loop

RoyiAvital · July 20, 2018, 3:57pm

kristoffer.carlsson:

The alignment is set here

https://github.com/JuliaLang/julia/blob/9ed628780776f59bd5e52fa3d7f8244d615a0301/src/julia_internal.h#L252-L253

and depends on the size of the array. Large arrays are already 64 bytes aligned:
julia> Int(pointer(rand(32))) % 64
48

julia> Int(pointer(rand(1024))) % 64
0

What you show is only the address of the 1st element.
What I suggest is making sure any Loop with vectorization won’t have “Anomaly” to take care of.

I meant something like Intel IPP.

If we define 1D array it will be padded to have size which is multiplication of 16 / 32 / 64 Bytes.
If you define 2D array it will be padded with rows which are also multiplication of 16 / 32 / 64 Bytes.

This way all loops will be able to be unrolled and vectorized with no issues about taking care of edge cases.

Topic		Replies	Views
Arithmetic performance of expression Performance	11	363	October 4, 2022
The cost of size() in for-loops Performance	11	1787	July 20, 2018
Evaluating the "for" condition Performance question	10	481	June 17, 2021
Seemingly large permformance regression on "^" power operator in >=1.6 General Usage bug , performance	2	406	April 28, 2021
Matrix exponential slower in Julia 0.7 / 1.0? Performance	7	2629	September 12, 2018

Strange performance of a loop

Related topics