|
Challenge: Can you beat Python and C++ in Int4 Matrix-Vector Multiply Op?
|
|
10
|
1625
|
July 25, 2023
|
|
Squeezing max performance out of findlast
|
|
0
|
272
|
July 24, 2023
|
|
Solving ODE inside a double for loop
|
|
9
|
371
|
July 24, 2023
|
|
Having issues speeding up code with multithreading
|
|
19
|
726
|
July 16, 2023
|
|
Fast Hessian and Gradient for PINNS using Enzyme/Zygote
|
|
0
|
382
|
July 23, 2023
|
|
Runtime of program using a large amount of memory stalls
|
|
4
|
339
|
July 22, 2023
|
|
Push! vs. pushfirst! performance issue with BitArrays
|
|
1
|
218
|
July 20, 2023
|
|
`@fastmath` is not applied to macros
|
|
5
|
470
|
July 20, 2023
|
|
Multithreading using more CPUs than expected
|
|
11
|
679
|
July 20, 2023
|
|
Optimizing nested loops with conditional inside
|
|
10
|
574
|
July 19, 2023
|
|
Better ways to deal with CompatHelper and compatibility upgrades
|
|
1
|
266
|
July 18, 2023
|
|
Optimizing dinucleotides count in a DNA sequence type `LongDNA`
|
|
21
|
983
|
July 17, 2023
|
|
Define multiple methods or one method with union types?
|
|
4
|
468
|
July 17, 2023
|
|
Efficiently computing Hessians of Neural Networks output with respect to inputs
|
|
1
|
287
|
July 16, 2023
|
|
Inline function returns tuple of mixed type, assigned to a tuple of variables in the caller
|
|
2
|
216
|
July 14, 2023
|
|
In-place matrix operations slower?
|
|
9
|
486
|
July 14, 2023
|
|
Comparing Julia structs
|
|
8
|
1633
|
July 13, 2023
|
|
Performance Warning when Solving Parameterized ODE
|
|
2
|
465
|
July 13, 2023
|
|
Broadcasting Heisen-allocations
|
|
2
|
282
|
July 13, 2023
|
|
Is there a function to make an abstractly-typed variable "more" concrete?
|
|
11
|
522
|
July 11, 2023
|
|
Eliminate runtime dispatch for repeated @async calls
|
|
0
|
256
|
July 11, 2023
|
|
Efficient approach to multiply three matrices (M1*M2*M3) and two vectors and a matrix (x*M*y)
|
|
18
|
6396
|
July 10, 2023
|
|
Help to reduce memory allocations in a function
|
|
2
|
248
|
July 10, 2023
|
|
Task/thread-local caches/buffers
|
|
12
|
703
|
July 9, 2023
|
|
Shared data between processes
|
|
0
|
202
|
July 8, 2023
|
|
Low rank factorized AbstractMatrix
|
|
18
|
1074
|
July 7, 2023
|
|
Mapreduce performance and dispatch
|
|
0
|
230
|
July 7, 2023
|
|
1.10 alpha significantly slower to start REPL
|
|
9
|
715
|
July 7, 2023
|
|
Customize a random function to sample 3 out of a list of 4097 real numbers
|
|
5
|
297
|
July 7, 2023
|
|
Help me improve this simple function
|
|
12
|
349
|
July 5, 2023
|