AMD Rome vs Intel Xeon shows bad scaling with threads for AMD
|
|
20
|
1824
|
March 13, 2022
|
Same code run multiple times gives wildly different timings
|
|
39
|
1811
|
February 24, 2022
|
Speeding up Matrix multiplication involving dot and hadamard product
|
|
11
|
1464
|
February 9, 2022
|
Performance of naive convolution against Python Numpy
|
|
20
|
2732
|
February 4, 2022
|
Outperformed by Matlab
|
|
54
|
4113
|
November 23, 2021
|
How to control threads in combination of LoopVectorization and @spawn
|
|
6
|
648
|
February 1, 2022
|
Evaluating @view macro before @tturbo
|
|
1
|
373
|
January 27, 2022
|
Computational performance 3D finite difference stencil for different vectorization methods and precision
|
|
1
|
488
|
January 13, 2022
|
Recommended way to use something like `evalpoly` with `@turbo` from LoopVectorization.jl
|
|
2
|
514
|
November 18, 2021
|
2D nested loop gives big performance hit in interpolation routine while it should not
|
|
2
|
375
|
November 17, 2021
|
Parallelization of for loop
|
|
3
|
1008
|
October 31, 2021
|
Gradient evaluation with ForwardDiff and LoopVectorization
|
|
2
|
883
|
October 24, 2021
|
Sparse matrix coefficient computation with ForwardDiff and LoopVectorization
|
|
0
|
533
|
October 17, 2021
|
LoopVectorization does not support functions with kwargs?
|
|
9
|
1207
|
October 12, 2021
|
Clustering using matrix decomp code performance tips
|
|
5
|
623
|
October 2, 2021
|
Multiple Outputs (Tuple) from IfElse with VectorizationBase
|
|
3
|
515
|
September 16, 2021
|
Product of two symmetric matrices: LoopVectorization.jl vs LinearAlgebra
|
|
9
|
946
|
August 31, 2021
|
LoopVectorization: @turbo performs worse than @inbounds on trivial loop
|
|
9
|
2009
|
August 28, 2021
|
How to use threads in a reduction with LoopVectorization?
|
|
3
|
641
|
August 23, 2021
|
Efficient use of @turbo for linear algebra operations (LoopVectorization.jl)
|
|
6
|
3644
|
August 21, 2021
|
Julia Beginner (from Python): Numba outperforms Julia in rewrite. Any tips to improve performance?
|
|
56
|
5802
|
August 18, 2021
|
LoopVectorization triggers segfault or deadlock for complex finite difference stencil
|
|
1
|
457
|
July 27, 2021
|
How to improve performance of sum()
|
|
19
|
4545
|
July 19, 2021
|
LoopVectorization.jl vmap gives an error ::VectorizationBase.Vec{4, Int64}
|
|
17
|
952
|
July 22, 2021
|
Fastest way of contracting arrays
|
|
8
|
697
|
July 10, 2021
|
LoopVectorization almost doubles execution time?
|
|
6
|
654
|
July 9, 2021
|
I just decided to migrate from Python+Fortran to Julia as Julia was faster in my test
|
|
37
|
6954
|
June 25, 2021
|
@turbo speeds routine, slows down everything else
|
|
16
|
2554
|
June 5, 2021
|
Problem with @avx and LoopVectorization: UndefVarError
|
|
3
|
713
|
June 2, 2021
|
Accelerate Non-linear function evaluation
|
|
17
|
1281
|
April 6, 2021
|