|
Porting cuda example to rocm amdgpu
|
|
0
|
62
|
June 2, 2025
|
|
`check-bounds=no` causes illegal memory access when using `rand()` in CUDA kernel
|
|
3
|
157
|
May 31, 2025
|
|
Current state of Metal.jl for ML and SciML
|
|
2
|
440
|
May 30, 2025
|
|
CUDA suddenly crashes with check-bounds=no, used to work fine
|
|
1
|
115
|
May 30, 2025
|
|
Unusually Slow First Device-to-Host Copy on A100 GPU
|
|
6
|
295
|
May 27, 2025
|
|
AdaptiveCpp integration?
|
|
9
|
535
|
May 20, 2025
|
|
Ragged Tensors with generic GPU code
|
|
2
|
142
|
May 20, 2025
|
|
GPU is slower than CPU for findall on a CuArray
|
|
2
|
229
|
May 14, 2025
|
|
CUDA | nested loops kernel
|
|
5
|
275
|
May 12, 2025
|
|
Warning: Package cuDNN not found in current path
|
|
5
|
1108
|
May 8, 2025
|
|
Errors reported during Pkg.test("CUDA")
|
|
6
|
165
|
April 28, 2025
|
|
Getting GPU info
|
|
4
|
1449
|
April 24, 2025
|
|
CUDA(.jl) memory errors for very large kernels
|
|
24
|
815
|
April 22, 2025
|
|
Val{N} + LinearIndices Causes Massive Compile-Time Unrolling
|
|
2
|
117
|
April 20, 2025
|
|
Possible to design a compiler for Raspberry Pi GPU?
|
|
0
|
125
|
April 19, 2025
|
|
How to avoid "unsupported dynamic function invocation" in CUDA with nested gradients
|
|
2
|
105
|
April 15, 2025
|
|
Is there a way to use @allowscalar in a heterogeneous manner using KernelAbstractions?
|
|
3
|
129
|
April 10, 2025
|
|
With ParallelStencil, is it possible to launch multiple kernels and sync later?
|
|
7
|
204
|
April 9, 2025
|
|
Track function on profiler to CUDA documentation
|
|
1
|
71
|
April 3, 2025
|
|
Optimize loss calculation on gpu
|
|
0
|
65
|
March 28, 2025
|
|
CUDA.jl write to global memory in PTX
|
|
4
|
149
|
March 27, 2025
|
|
Calculate associated Legendre polynomials on the GPU
|
|
3
|
148
|
March 27, 2025
|
|
Lightweight dependency for GPU programming
|
|
7
|
336
|
March 27, 2025
|
|
@inbounds slower
|
|
8
|
492
|
March 25, 2025
|
|
I32 indexing
|
|
8
|
567
|
March 24, 2025
|
|
Floating point exceptions on the gpu
|
|
1
|
132
|
March 24, 2025
|
|
Unable to use AMDGPU.jl on RX6600
|
|
13
|
445
|
March 19, 2025
|
|
Adapt BroadcastStyle for CUDA
|
|
1
|
114
|
March 18, 2025
|
|
Moving ahead with CUDA support
|
|
2
|
333
|
March 17, 2025
|
|
How to benchmark a function that uses KernelAbstractions kernels?
|
|
4
|
221
|
March 17, 2025
|