Different results with different macros for profiling
|
|
3
|
125
|
August 6, 2024
|
Using views of CuArray with CUDA-aware MPI is extremely slow
|
|
14
|
307
|
August 5, 2024
|
Delays shown in Nsight Systems between HtoD memcopy and kernel launch when using CUDA.jl
|
|
9
|
221
|
July 31, 2024
|
Test a package with a CUDA dependency through GitHub Actions CI
|
|
2
|
175
|
July 31, 2024
|
Dynamic parallelism slow in CUDA.jl
|
|
1
|
79
|
July 25, 2024
|
CUDA: CUDA driver not found
|
|
24
|
485
|
July 24, 2024
|
Variable scoping issue when using multiple GPUs in CUDA.jl
|
|
1
|
41
|
July 17, 2024
|
Suggestion: abstraction for integrated GPUs?
|
|
7
|
203
|
July 16, 2024
|
Metal Kernel 3D indices
|
|
4
|
205
|
July 14, 2024
|
External functions in GPU ODE example
|
|
7
|
228
|
July 11, 2024
|
Passing `::Type{InconcreteType}` to CUDA kernel
|
|
2
|
92
|
July 11, 2024
|
CUDA.compute_sanitizer() method not defined
|
|
1
|
79
|
July 11, 2024
|
Rubbish values from gpu kernel
|
|
5
|
214
|
July 8, 2024
|
Basic NN forward passage: CuArray fine, oneAPI array:
|
|
0
|
65
|
June 28, 2024
|
Do a function like relu need a kernel ? When you need to write a GPU kernel rather than "just" using CuArray?
|
|
3
|
184
|
June 26, 2024
|
CUDA: unsupported dynamic function invocation for closure
|
|
3
|
324
|
June 23, 2024
|
Running CUDA.jl with JULIA_DEPOT_PATH read-only
|
|
8
|
181
|
June 21, 2024
|
Enzyme Cuda dynamic memory
|
|
12
|
417
|
June 17, 2024
|
Passing a structure with constructor on GPU
|
|
2
|
134
|
June 17, 2024
|
CUB wrapper ccall overhead on Windows
|
|
10
|
149
|
June 15, 2024
|
Calling repeat with a CUDA array changes the state of the random number generator
|
|
2
|
90
|
June 14, 2024
|
Difference between GPUArrays.jl and KernelAbstractations.jl
|
|
4
|
291
|
June 12, 2024
|
Scalar Indexing error when performing mul!(A, B, C) with A a view of a Matrix
|
|
3
|
152
|
June 11, 2024
|
Slicing a CuArray in a kernel
|
|
1
|
129
|
June 6, 2024
|
Simple CUDA kernel on matrix slower than running GPU
|
|
8
|
534
|
June 3, 2024
|
Fresh CUDA and LuxCUDA error ERROR: could not load symbol "cublasLtMatmulDescCreate":
|
|
14
|
438
|
May 31, 2024
|
AMDGPU.jl and AMD Instinct MI300A APUs
|
|
1
|
257
|
May 28, 2024
|
Activating environment triggers error in CUDA
|
|
5
|
217
|
May 28, 2024
|
Why is this CUDA interpolation kernel scaling poorly?
|
|
1
|
295
|
May 8, 2024
|
Scalar indexing GPU problem in Flux.jl model
|
|
4
|
341
|
May 8, 2024
|