Understanding the performance and overhead of a vector of SOA vs a vector of AOS for SIMD and the effect of push!

f.ij · June 23, 2023, 11:13am

My apologies, it seems I have been a bit sloppy. I was trying to write this up quickly and in between a lot of other activities. The first function above, getEFactor of course should return the float efac. Also, I pushed the latest version of my code, everything should be working immediately. The file code.jl contains all the relevant functions, and the tests can be run after importing that file, which should reproduce my findings directly.

I’m really hoping someone could give me some insight on why there is such a big discrepancy between the performance of the getEFactor function and the performance of the actual simulation loop. For high connectivity, most of the time of a single iteration of the loop should be taken by evaluating this function, so I would expect to see a similar performance ratio, which as seen from my results, is not the case at all.

Topic		Replies	Views
Bad performance looping over SOAs with SIMD General Usage	3	357	July 10, 2023
Struct of Arrays (SoA) vs Array of Structs (AoS) Performance performance	30	9676	March 12, 2022
Performance scaling of broadcasting compared to looping Performance question , optimization , loops , broadcasting	19	738	March 21, 2023
Poor performance of SIMD vectorization in the latest version of Julia (v1.11.2) Performance performance	19	818	January 8, 2025
LoopVectorization: @turbo performs worse than @inbounds on trivial loop New to Julia question , simd , loopvectorization	9	2099	August 28, 2021

Understanding the performance and overhead of a vector of SOA vs a vector of AOS for SIMD and the effect of push!

Related topics