|
Q: No SubArray type required for passing partial multidimensional CuArrays?
|
|
4
|
100
|
November 4, 2025
|
|
Efficient lookup of UInt indices in large GPU arrays
|
|
0
|
77
|
November 3, 2025
|
|
Understanding and optimizing Enzyme.jl Reverse AD on CUDA
|
|
5
|
243
|
October 25, 2025
|
|
Error in oneAPI.jl tests
|
|
3
|
164
|
October 18, 2025
|
|
CUDA.jl calling kernels in parallel?
|
|
1
|
170
|
October 11, 2025
|
|
Mixing CUDA.jl with external GPU compute (OpenMM / DLPack.jl)
|
|
0
|
74
|
October 9, 2025
|
|
Dense Matrix sparse binary vector product
|
|
2
|
119
|
October 7, 2025
|
|
CUDA | Avoid divide by zero in kernel using assume()
|
|
10
|
352
|
October 7, 2025
|
|
Slow matrix multiplication in CUBLAS.gemm_strided_batched with ComplexF64
|
|
1
|
213
|
October 7, 2025
|
|
KernelAbstractions + Enzyme - how to do GPU-side autodiff?
|
|
1
|
220
|
September 25, 2025
|
|
RCCL wrapping
|
|
4
|
226
|
September 20, 2025
|
|
CUDA.jl: Unexpected `mapreduce` error: threads per block exceed GPU limit (640 > 512
|
|
9
|
368
|
September 18, 2025
|
|
CUDA.jl: Warning about loading library from system path
|
|
4
|
273
|
August 30, 2025
|
|
cuSOLVER: two calls to cusolverDnDgesvdj_bufferSize, one via Juila, the other via CUDA yield (very) different results
|
|
2
|
126
|
August 22, 2025
|
|
What is the correct way to use multiple GPUs in Slurm cluster?
|
|
0
|
405
|
August 20, 2025
|
|
Trying to parallelize using CUSOLVERRF.jl with @threads
|
|
7
|
247
|
August 19, 2025
|
|
Metal.jl does not speed up FFT
|
|
8
|
2266
|
August 13, 2025
|
|
UndefVarError: cuda_version in Google Colab with CUDA.jl
|
|
2
|
98
|
August 12, 2025
|
|
Using getrf_batched to find matrix inverses
|
|
2
|
140
|
August 7, 2025
|
|
Sparse matrix multiplication for Metal
|
|
15
|
686
|
July 31, 2025
|
|
DiffEqGPU Trajectory Failure Handling and Heterogeneous Trajectories
|
|
4
|
181
|
July 22, 2025
|
|
Does AMDGPU.jl support integrated graphics?
|
|
3
|
372
|
July 19, 2025
|
|
Kernel with dynamic parallelism seems to be calling CPU functions
|
|
4
|
246
|
July 19, 2025
|
|
Out of dynamic GPU memory?
|
|
8
|
1639
|
July 16, 2025
|
|
Batched Hessian-Vector Product (on the GPU)
|
|
0
|
73
|
July 1, 2025
|
|
Relation between KernelAbstractions and Adapt
|
|
1
|
158
|
June 30, 2025
|
|
Cannot manage to use CUDA.atomic_add!
|
|
4
|
144
|
June 30, 2025
|
|
Heterogeneous random seeding
|
|
1
|
109
|
June 25, 2025
|
|
AMDGPU on AI HX370 versioninfo() crashes
|
|
4
|
436
|
June 8, 2025
|
|
CUDA | custom structs
|
|
3
|
217
|
June 6, 2025
|