|
GPU slower than CPU for simple benchmarks
|
|
7
|
474
|
September 23, 2024
|
|
Where to run CI on GPUs?
|
|
3
|
200
|
September 20, 2024
|
|
Caveats to reusing CuArray memory by changing .dims?
|
|
6
|
200
|
September 18, 2024
|
|
CUDA and NVTX fail to precompile on cluster
|
|
7
|
252
|
September 16, 2024
|
|
Extra memory allocation when using closure with CUDA
|
|
2
|
119
|
September 15, 2024
|
|
Any function like `push!` for `CuArray`
|
|
2
|
184
|
September 8, 2024
|
|
AMDGPUBackend is missing
|
|
5
|
230
|
September 8, 2024
|
|
Slice the type CuSparseMatrixCSC matrix
|
|
2
|
102
|
September 8, 2024
|
|
Batch matrix/vector operations with CUDA.jl
|
|
5
|
644
|
September 4, 2024
|
|
Testing GPU compatability in CI
|
|
2
|
153
|
September 4, 2024
|
|
Writing a Metal Kernel
|
|
9
|
807
|
September 1, 2024
|
|
Need a basic example on using custom structs in CUDA.jl with Adapt.jl
|
|
2
|
388
|
August 31, 2024
|
|
Float16 with AMDGPU
|
|
10
|
394
|
August 30, 2024
|
|
Source code annotation using NVTX in CUDA.jl
|
|
2
|
118
|
August 28, 2024
|
|
Matrix Multiplication Using oneAPI.jl Fails on Second Invocation
|
|
3
|
195
|
August 26, 2024
|
|
Questions about CUDA.dot() function
|
|
4
|
663
|
August 25, 2024
|
|
Synchronize streams in CUDA.jl
|
|
11
|
1008
|
August 23, 2024
|
|
Most efficient way to find cholesky decomposition of slices of a 3D array in KernelAbstractions
|
|
4
|
169
|
August 21, 2024
|
|
Putting obj files on the GPU with Metal.jl
|
|
0
|
65
|
August 20, 2024
|
|
Invalid LLVM IR error using CUDA
|
|
3
|
357
|
August 17, 2024
|
|
Raspberry Pi AI Kit
|
|
1
|
254
|
August 15, 2024
|
|
Different results with different macros for profiling
|
|
3
|
164
|
August 6, 2024
|
|
Using views of CuArray with CUDA-aware MPI is extremely slow
|
|
14
|
499
|
August 5, 2024
|
|
Delays shown in Nsight Systems between HtoD memcopy and kernel launch when using CUDA.jl
|
|
9
|
339
|
July 31, 2024
|
|
Test a package with a CUDA dependency through GitHub Actions CI
|
|
2
|
342
|
July 31, 2024
|
|
Dynamic parallelism slow in CUDA.jl
|
|
1
|
130
|
July 25, 2024
|
|
CUDA: CUDA driver not found
|
|
24
|
830
|
July 24, 2024
|
|
Variable scoping issue when using multiple GPUs in CUDA.jl
|
|
1
|
67
|
July 17, 2024
|
|
Suggestion: abstraction for integrated GPUs?
|
|
7
|
293
|
July 16, 2024
|
|
Metal Kernel 3D indices
|
|
4
|
251
|
July 14, 2024
|