Best practices to reduce startup time for CUDA.jl?
|
|
2
|
128
|
December 25, 2022
|
GPU kernel?
|
|
9
|
208
|
December 22, 2022
|
Question about garbage collection, AD, and CUDA
|
|
1
|
112
|
December 19, 2022
|
Prioritising GPU Primitives from Vendor-Specialised Libraries
|
|
1
|
108
|
December 15, 2022
|
Easiest way to get CUDA.jl up and running
|
|
1
|
163
|
December 9, 2022
|
InvalidIRError with CuArray broadcast when complex values are involved in Bessel functions
|
|
2
|
93
|
December 8, 2022
|
CUDA adapter for FFTW plan
|
|
0
|
110
|
December 7, 2022
|
CuPy CuFFT ~2x faster than CUDA.jl CuFFT
|
|
10
|
867
|
November 30, 2022
|
CUDA How to limit the gpu memory available?
|
|
7
|
269
|
November 25, 2022
|
Correct implementation of CuArray's slicing operations
|
|
2
|
133
|
November 23, 2022
|
CUDA.jl - A Clear Example of Dynamic Parallelism
|
|
6
|
860
|
November 18, 2022
|
Continuous callback does not work correctly with Ensemble GPUArray()
|
|
0
|
77
|
November 14, 2022
|
Fast tile search
|
|
6
|
230
|
November 11, 2022
|
Using GPU with NeuralPDE on M1 Mac
|
|
2
|
147
|
November 11, 2022
|
Launch_configuration() equivalent for AMDGPU.jl
|
|
4
|
218
|
November 10, 2022
|
KernelAbstractions is slower than CUDA
|
|
8
|
591
|
November 10, 2022
|
Problem with running Julia 1.6.7 with CUDA 11.7
|
|
8
|
267
|
October 14, 2022
|
Cub library functions in cuda.jl
|
|
5
|
182
|
October 11, 2022
|
Comprehensive list of functions (maths, etc) available in CUDA.jl
|
|
3
|
241
|
October 10, 2022
|
Rotr90 of a CUDA.CuArray
|
|
5
|
150
|
October 6, 2022
|
Neural Nets training with multiple Chains Lux.jl and CUDA.jl
|
|
5
|
251
|
October 5, 2022
|
Create static vector of variable lenght in gpu kernel
|
|
2
|
142
|
September 27, 2022
|
Calling Flux.gpu downloads CUDNN artifact every first call in REPL/script
|
|
6
|
150
|
September 27, 2022
|
Demo: Example of CUDA/OpenGL interop in Julia
|
|
0
|
307
|
September 26, 2022
|
Kernel fails when number of blocks exceeds number of SM's (?)
|
|
3
|
133
|
September 26, 2022
|
Vulkan.jl crashes with ERROR_INCOMPATIBLE_DRIVER
|
|
0
|
130
|
September 24, 2022
|
Dreaded CuArray only supports element types that are stored inline
|
|
10
|
420
|
September 22, 2022
|
Failed to compile PTX code when using NSight on Win11
|
|
1
|
122
|
September 22, 2022
|
CSR matrices in CUDA.jl
|
|
2
|
193
|
September 21, 2022
|
Custom CUDA kernel for nontrivial sum calculations
|
|
2
|
199
|
September 20, 2022
|