Too much garbage collection for a simple vector addition operation
|
|
12
|
630
|
July 29, 2023
|
X * y + z does not automatically use FMA instruction
|
|
31
|
2590
|
July 27, 2023
|
Help with speeding up this code
|
|
31
|
1493
|
July 26, 2023
|
Challenge: Can you beat Python and C++ in Int4 Matrix-Vector Multiply Op?
|
|
10
|
1515
|
July 25, 2023
|
Understanding memory usage in Julia
|
|
20
|
2984
|
July 25, 2023
|
Squeezing max performance out of findlast
|
|
0
|
260
|
July 24, 2023
|
Solving ODE inside a double for loop
|
|
9
|
290
|
July 24, 2023
|
Having issues speeding up code with multithreading
|
|
19
|
612
|
July 16, 2023
|
Fast Hessian and Gradient for PINNS using Enzyme/Zygote
|
|
0
|
359
|
July 23, 2023
|
Runtime of program using a large amount of memory stalls
|
|
4
|
295
|
July 22, 2023
|
Push! vs. pushfirst! performance issue with BitArrays
|
|
1
|
201
|
July 20, 2023
|
`@fastmath` is not applied to macros
|
|
5
|
416
|
July 20, 2023
|
Multithreading using more CPUs than expected
|
|
11
|
560
|
July 20, 2023
|
Optimizing nested loops with conditional inside
|
|
10
|
482
|
July 19, 2023
|
Better ways to deal with CompatHelper and compatibility upgrades
|
|
1
|
242
|
July 18, 2023
|
Optimizing dinucleotides count in a DNA sequence type `LongDNA`
|
|
21
|
843
|
July 17, 2023
|
Define multiple methods or one method with union types?
|
|
4
|
410
|
July 17, 2023
|
Efficiently computing Hessians of Neural Networks output with respect to inputs
|
|
1
|
261
|
July 16, 2023
|
Inline function returns tuple of mixed type, assigned to a tuple of variables in the caller
|
|
2
|
191
|
July 14, 2023
|
In-place matrix operations slower?
|
|
9
|
405
|
July 14, 2023
|
Comparing Julia structs
|
|
8
|
1463
|
July 13, 2023
|
Performance Warning when Solving Parameterized ODE
|
|
2
|
426
|
July 13, 2023
|
Broadcasting Heisen-allocations
|
|
2
|
258
|
July 13, 2023
|
Is there a function to make an abstractly-typed variable "more" concrete?
|
|
11
|
446
|
July 11, 2023
|
Eliminate runtime dispatch for repeated @async calls
|
|
0
|
238
|
July 11, 2023
|
Efficient approach to multiply three matrices (M1*M2*M3) and two vectors and a matrix (x*M*y)
|
|
18
|
5862
|
July 10, 2023
|
Help to reduce memory allocations in a function
|
|
2
|
230
|
July 10, 2023
|
Task/thread-local caches/buffers
|
|
12
|
579
|
July 9, 2023
|
Shared data between processes
|
|
0
|
190
|
July 8, 2023
|
Low rank factorized AbstractMatrix
|
|
18
|
935
|
July 7, 2023
|