|
About the GPU category
|
|
0
|
2338
|
November 2, 2016
|
|
CUDA package having issues due to different versions of CUDA toolkit and NVIDIA driver CUDA version
|
|
11
|
195
|
January 4, 2026
|
|
Converting result of round or floor as Int in Metal
|
|
10
|
299
|
December 31, 2025
|
|
Why Atomix.@atomic b[] += a[i] works and b[] = b[] + a[i] does not
|
|
6
|
148
|
December 18, 2025
|
|
Failed to precompile CUDA
|
|
14
|
112
|
December 16, 2025
|
|
CUDA test failure
|
|
8
|
74
|
December 9, 2025
|
|
Block/Tile-Based GPU Programming (not Scratch)
|
|
3
|
535
|
December 8, 2025
|
|
KernelAbstractions + CUDA + Reactant - how to get minimal working example
|
|
12
|
263
|
November 30, 2025
|
|
Potential issues with implementation of ziggurat algorithm
|
|
0
|
53
|
November 26, 2025
|
|
Error in testset gpuarrays/linalg/core
|
|
2
|
40
|
November 25, 2025
|
|
Help with performance issue when upgrading CUDA
|
|
5
|
91
|
November 24, 2025
|
|
InvalidIRError when running AcceleratedKernels.sum on a GPU SubArray (CuArray view)
|
|
2
|
39
|
November 20, 2025
|
|
Looking for Windows ARM tester
|
|
6
|
175
|
November 19, 2025
|
|
Latest CUDA.jl version 5.8.3 fails to install on NVIDIA Jetson Orin with Jetpack 6.2.1+b38
|
|
6
|
266
|
November 12, 2025
|
|
GPU memory issue on AMDGPU
|
|
4
|
170
|
November 10, 2025
|
|
How to call ssyevd
|
|
2
|
109
|
November 9, 2025
|
|
Wrapping CUDA.jl with juliacall
|
|
4
|
164
|
November 7, 2025
|
|
Q: No SubArray type required for passing partial multidimensional CuArrays?
|
|
4
|
57
|
November 4, 2025
|
|
Efficient lookup of UInt indices in large GPU arrays
|
|
0
|
59
|
November 3, 2025
|
|
Understanding and optimizing Enzyme.jl Reverse AD on CUDA
|
|
5
|
189
|
October 25, 2025
|
|
Error in oneAPI.jl tests
|
|
3
|
114
|
October 18, 2025
|
|
CUDA.jl calling kernels in parallel?
|
|
1
|
123
|
October 11, 2025
|
|
Mixing CUDA.jl with external GPU compute (OpenMM / DLPack.jl)
|
|
0
|
54
|
October 9, 2025
|
|
Dense Matrix sparse binary vector product
|
|
2
|
92
|
October 7, 2025
|
|
CUDA | Avoid divide by zero in kernel using assume()
|
|
10
|
260
|
October 7, 2025
|
|
Slow matrix multiplication in CUBLAS.gemm_strided_batched with ComplexF64
|
|
1
|
140
|
October 7, 2025
|
|
KernelAbstractions + Enzyme - how to do GPU-side autodiff?
|
|
1
|
118
|
September 25, 2025
|
|
RCCL wrapping
|
|
4
|
171
|
September 20, 2025
|
|
CUDA.jl: Unexpected `mapreduce` error: threads per block exceed GPU limit (640 > 512
|
|
9
|
292
|
September 18, 2025
|
|
CUDA.jl: Warning about loading library from system path
|
|
4
|
177
|
August 30, 2025
|