|
Sparse matrix multiplication for Metal
|
|
15
|
441
|
July 31, 2025
|
|
DiffEqGPU Trajectory Failure Handling and Heterogeneous Trajectories
|
|
4
|
134
|
July 22, 2025
|
|
Does AMDGPU.jl support integrated graphics?
|
|
3
|
266
|
July 19, 2025
|
|
Kernel with dynamic parallelism seems to be calling CPU functions
|
|
4
|
164
|
July 19, 2025
|
|
Out of dynamic GPU memory?
|
|
8
|
1575
|
July 16, 2025
|
|
Batched Hessian-Vector Product (on the GPU)
|
|
0
|
52
|
July 1, 2025
|
|
Relation between KernelAbstractions and Adapt
|
|
1
|
114
|
June 30, 2025
|
|
Cannot manage to use CUDA.atomic_add!
|
|
4
|
93
|
June 30, 2025
|
|
Heterogeneous random seeding
|
|
1
|
71
|
June 25, 2025
|
|
AMDGPU on AI HX370 versioninfo() crashes
|
|
4
|
327
|
June 8, 2025
|
|
CUDA | custom structs
|
|
3
|
162
|
June 6, 2025
|
|
Why is my GPU kernel an order of magnitude slower than my CPU function?
|
|
8
|
313
|
June 4, 2025
|
|
KernelAbstractions.get_backend(::BitArray) causes StackOverflowError
|
|
1
|
42
|
June 2, 2025
|
|
Porting cuda example to rocm amdgpu
|
|
0
|
54
|
June 2, 2025
|
|
`check-bounds=no` causes illegal memory access when using `rand()` in CUDA kernel
|
|
3
|
127
|
May 31, 2025
|
|
Current state of Metal.jl for ML and SciML
|
|
2
|
280
|
May 30, 2025
|
|
CUDA suddenly crashes with check-bounds=no, used to work fine
|
|
1
|
92
|
May 30, 2025
|
|
Unusually Slow First Device-to-Host Copy on A100 GPU
|
|
6
|
247
|
May 27, 2025
|
|
AdaptiveCpp integration?
|
|
9
|
382
|
May 20, 2025
|
|
Ragged Tensors with generic GPU code
|
|
2
|
107
|
May 20, 2025
|
|
GPU is slower than CPU for findall on a CuArray
|
|
2
|
212
|
May 14, 2025
|
|
CUDA | nested loops kernel
|
|
5
|
219
|
May 12, 2025
|
|
Warning: Package cuDNN not found in current path
|
|
5
|
1040
|
May 8, 2025
|
|
Errors reported during Pkg.test("CUDA")
|
|
6
|
130
|
April 28, 2025
|
|
Getting GPU info
|
|
4
|
1404
|
April 24, 2025
|
|
CUDA(.jl) memory errors for very large kernels
|
|
24
|
670
|
April 22, 2025
|
|
Val{N} + LinearIndices Causes Massive Compile-Time Unrolling
|
|
2
|
106
|
April 20, 2025
|
|
Possible to design a compiler for Raspberry Pi GPU?
|
|
0
|
112
|
April 19, 2025
|
|
How to avoid "unsupported dynamic function invocation" in CUDA with nested gradients
|
|
2
|
88
|
April 15, 2025
|
|
Is there a way to use @allowscalar in a heterogeneous manner using KernelAbstractions?
|
|
3
|
98
|
April 10, 2025
|