Do a function like relu need a kernel ? When you need to write a GPU kernel rather than "just" using CuArray?
|
|
3
|
162
|
June 26, 2024
|
CUDA: unsupported dynamic function invocation for closure
|
|
3
|
206
|
June 23, 2024
|
Running CUDA.jl with JULIA_DEPOT_PATH read-only
|
|
8
|
153
|
June 21, 2024
|
Enzyme Cuda dynamic memory
|
|
12
|
402
|
June 17, 2024
|
Passing a structure with constructor on GPU
|
|
2
|
124
|
June 17, 2024
|
CUB wrapper ccall overhead on Windows
|
|
10
|
135
|
June 15, 2024
|
Calling repeat with a CUDA array changes the state of the random number generator
|
|
2
|
86
|
June 14, 2024
|
Difference between GPUArrays.jl and KernelAbstractations.jl
|
|
4
|
265
|
June 12, 2024
|
Scalar Indexing error when performing mul!(A, B, C) with A a view of a Matrix
|
|
3
|
149
|
June 11, 2024
|
Slicing a CuArray in a kernel
|
|
1
|
121
|
June 6, 2024
|
Simple CUDA kernel on matrix slower than running GPU
|
|
8
|
514
|
June 3, 2024
|
Fresh CUDA and LuxCUDA error ERROR: could not load symbol "cublasLtMatmulDescCreate":
|
|
14
|
346
|
May 31, 2024
|
AMDGPU.jl and AMD Instinct MI300A APUs
|
|
1
|
227
|
May 28, 2024
|
Activating environment triggers error in CUDA
|
|
5
|
212
|
May 28, 2024
|
Why is this CUDA interpolation kernel scaling poorly?
|
|
1
|
282
|
May 8, 2024
|
Scalar indexing GPU problem in Flux.jl model
|
|
4
|
269
|
May 8, 2024
|
Unable to use local CUDA runtime toolkit
|
|
2
|
235
|
May 8, 2024
|
Adding at specific CuArray position
|
|
6
|
178
|
May 6, 2024
|
How to sort a GPU StructArray vector by a subfield and update location of other elements in place?
|
|
0
|
121
|
April 29, 2024
|
Can we accelerate functions from other packages using juliaGPU package?
|
|
2
|
205
|
April 29, 2024
|
Help using CUDA to exponentiate Matrix
|
|
3
|
303
|
April 27, 2024
|
Any qualifier in CUDA.jl like `__device__` in CUDA/C++?
|
|
6
|
265
|
April 26, 2024
|
Multiple GPUs with CuStateVec in CUDA.jl
|
|
1
|
119
|
April 25, 2024
|
Edited video of the course “GPU Programming with Julia - 2nd edition” at the Swiss National Supercomputing Centre (CSCS) now available!
|
|
2
|
252
|
April 23, 2024
|
Using CUDA hangs with P100 GPU
|
|
2
|
288
|
April 23, 2024
|
Adjoint/transpose wrapper question
|
|
18
|
456
|
April 21, 2024
|
Why is it consuming and not freeing GPU memory?
|
|
5
|
363
|
April 18, 2024
|
Batched LU solves (or Factorizations) with Sparse Matrices
|
|
6
|
540
|
April 15, 2024
|
Updating CUDA seems to have completely broken the package
|
|
3
|
411
|
April 14, 2024
|
Newbie question regarding GPU with metal.jl
|
|
0
|
163
|
April 11, 2024
|