|
How to implement performance regression tests?
|
|
5
|
198
|
September 23, 2025
|
|
Apple M4 Max AMX Linear Algebra performance versus CPU and GPU
|
|
2
|
401
|
September 22, 2025
|
|
Multi-threaded Array building
|
|
22
|
443
|
September 20, 2025
|
|
OhMyThreads, ChunkSplitters, and Cost Estimates
|
|
2
|
263
|
September 16, 2025
|
|
Preventing Enzyme from differentiating through constant computations
|
|
1
|
99
|
September 16, 2025
|
|
Find the max element of a numeric vector iteratively with mask and early break
|
|
10
|
291
|
September 16, 2025
|
|
Identical method redefinition suspiciously optimizes runtime and allocations
|
|
25
|
616
|
September 15, 2025
|
|
Praise: CUDA.allowscalar(false) is great
|
|
0
|
159
|
September 13, 2025
|
|
Taking power with float exponent x^y is slower than exp(y*log(x))?
|
|
8
|
366
|
September 11, 2025
|
|
Strange slowdown with @threads :greedy with BigInts
|
|
7
|
157
|
September 9, 2025
|
|
Error in program for coupled PDE
|
|
0
|
76
|
September 8, 2025
|
|
`log` calling `fma_emulated` on hardware that doesn't need it
|
|
2
|
99
|
September 7, 2025
|
|
Why does `Union{Int,...}` in a `Vector` not cause allocations but `Union{Float64,...}` does?
|
|
8
|
195
|
September 3, 2025
|
|
Multi-threaded processing of a Dict
|
|
8
|
232
|
September 1, 2025
|
|
SIMD.jl - vload() continuous blocks from higher-dimensional arrays?
|
|
4
|
148
|
August 31, 2025
|
|
Using control flow in Reactant
|
|
4
|
128
|
August 29, 2025
|
|
Large execution time jumps
|
|
2
|
128
|
August 29, 2025
|
|
Using Reactant with Lux and Enzyme to speed up training in physics context
|
|
16
|
304
|
August 28, 2025
|
|
Performance of `exp(A)` for 9x9 anti-Hermitian matrix: Julia vs. PyTorch vs. MATLAB (CPU & GPU)
|
|
29
|
1244
|
August 28, 2025
|
|
Latex fonts on axis and labels in Makie
|
|
7
|
167
|
August 27, 2025
|
|
Dagger not fully utilizing CPU cores
|
|
5
|
223
|
August 21, 2025
|
|
When does it make sense to use `Base.MultiplicativeInverses`
|
|
6
|
136
|
August 19, 2025
|
|
ODBC querying Snowflake is super slow
|
|
3
|
137
|
August 19, 2025
|
|
Need perf help on paralel var
|
|
4
|
81
|
August 19, 2025
|
|
Abusing `convert` as an alternative to `Union{Nothing, Int64}`
|
|
15
|
436
|
August 19, 2025
|
|
Regular println vs Core.stdout
|
|
10
|
298
|
August 19, 2025
|
|
DistributedNext FYI
|
|
0
|
105
|
August 19, 2025
|
|
Using particle filter to estimate the state through the observations of another parameter that depends on state
|
|
1
|
82
|
August 19, 2025
|
|
Thread safe memoization
|
|
17
|
277
|
August 18, 2025
|
|
Parallel Processing and Eigenvalue calculation
|
|
2
|
125
|
August 18, 2025
|