Different `@code_llvm` output on macos and x86
|
|
4
|
86
|
December 8, 2023
|
Optimizing Direct 2D Convolution Code
|
|
14
|
242
|
November 23, 2023
|
Understanding the performance and overhead of a vector of SOA vs a vector of AOS for SIMD and the effect of push!
|
|
1
|
215
|
June 23, 2023
|
Is this a valid use of simd?
|
|
2
|
179
|
June 16, 2023
|
Optimizing Direct 1D Convolution Code
|
|
21
|
814
|
April 28, 2023
|
```@turbo``` producing different (and wrong) results compared to ```@inbounds @simd```
|
|
3
|
251
|
March 30, 2023
|
Question on multithreading/vectorizing loops
|
|
9
|
530
|
March 22, 2023
|
Major performance boost when precaching random inputs to ```exp```?
|
|
8
|
652
|
September 25, 2022
|
Vectorize but break early?
|
|
3
|
342
|
September 20, 2022
|
LoopVectorization: @turbo performs worse than @inbounds on trivial loop
|
|
9
|
1673
|
August 28, 2021
|
PaddedViews very slow
|
|
7
|
450
|
August 25, 2021
|
Why is this @simd loop faster than a while loop even if it has longer assembly?
|
|
6
|
1165
|
August 1, 2021
|
SIMD Complex Numbers
|
|
19
|
1735
|
July 22, 2021
|
LoopVectorization.jl vmap gives an error ::VectorizationBase.Vec{4, Int64}
|
|
17
|
720
|
July 22, 2021
|
A simple SIMD.jl loop that is slower than a vanilla `@inbounds @simd`
|
|
8
|
1333
|
June 27, 2021
|
Why is this small `@inline` function much slower than an equivalent macro?
|
|
2
|
741
|
June 26, 2021
|
How to do SIMD code with wide-register accumulators (@simd vs LoopVectorization.jl vs SIMD.jl)
|
|
11
|
2378
|
June 22, 2021
|
How to choose vec size in SIMD.jl
|
|
5
|
476
|
June 21, 2021
|
Julia alignas: is there a way to specify the alignment of Julia objects in memory?
|
|
15
|
1511
|
March 20, 2021
|
SIMD and compiled code
|
|
5
|
777
|
December 19, 2020
|
Threads.@spawn performance
|
|
7
|
957
|
October 22, 2020
|
Fast floating point quantisation / rounding
|
|
3
|
639
|
May 17, 2020
|
Division isn't auto-vectorized when function inlined
|
|
12
|
926
|
March 11, 2020
|
Relative performance discrepancy across cpus even with --cpu-target set
|
|
5
|
911
|
August 14, 2019
|
Loop vectorization depends on number of terms in stencil
|
|
1
|
466
|
July 30, 2019
|
Why are vectorized operations faster in julia?
|
|
3
|
6423
|
April 1, 2019
|
@simd with array views
|
|
5
|
1064
|
January 4, 2019
|
Is it possible to use @simd with multiple variable loop?
|
|
5
|
789
|
September 4, 2017
|
Exponentially long compile times in generated functions that disappear after manual inlining
|
|
4
|
979
|
July 8, 2017
|
How to compile AVX instructions
|
|
1
|
1203
|
May 14, 2017
|