Custom Flux layer looking weird upon profiling
|
|
1
|
255
|
May 3, 2023
|
Render Pipeline in Metal.jl
|
|
9
|
948
|
April 30, 2023
|
multiple-GPUs per process
|
|
3
|
336
|
April 27, 2023
|
KernelAbstractions.get_backend keyword arguments
|
|
1
|
219
|
April 26, 2023
|
Question about coalesced read and write to the global memory using CUDA.jl 2D grid
|
|
1
|
777
|
April 20, 2023
|
Efficient CuArray shift/rotation
|
|
2
|
1203
|
April 20, 2023
|
GPU performance issues with an ML-from-scratch tutorial
|
|
7
|
456
|
April 17, 2023
|
Type instability with CuVector inside struct
|
|
2
|
209
|
April 14, 2023
|
CUTENSOR not available
|
|
7
|
830
|
April 13, 2023
|
Questions about using CUDA.jl for GPU concurrent programming: Computational results cannot be obtained when overlapping GPU and CPU operations
|
|
2
|
426
|
April 12, 2023
|
Indexing adjoints of CuArrays
|
|
4
|
294
|
April 10, 2023
|
CUDA.jl crashes if a 4d FFT is asked
|
|
2
|
535
|
April 7, 2023
|
UndefVarError: libcuda_original_version not defined
|
|
1
|
297
|
April 4, 2023
|
Complete and incomplete sparse cholesky factorization
|
|
6
|
487
|
April 4, 2023
|
Indexing in GPU kernel
|
|
2
|
443
|
March 31, 2023
|
Apple M1 GPU from Julia?
|
|
20
|
5836
|
March 31, 2023
|
Sm90 (H100) support for cuda.jl
|
|
3
|
516
|
March 30, 2023
|
Dealing with views and cuda array wrappers
|
|
2
|
287
|
March 29, 2023
|
GPU sum closure throwing an error
|
|
3
|
460
|
March 28, 2023
|
(relative) newbie, looking for suggestions
|
|
3
|
239
|
March 28, 2023
|
MethodError: no method matching gemm!, It looks like |>gpu cannot manage arrays resulting from view and reshape
|
|
0
|
333
|
March 26, 2023
|
LoadError: CUDA runtime not found
|
|
11
|
2467
|
March 25, 2023
|
CUDA Format of __nvvm__reflect function not recognized
|
|
12
|
651
|
March 23, 2023
|
Search in CUDA vector
|
|
5
|
460
|
March 23, 2023
|
LoadError: Could not find any suitable device for this configuration
|
|
2
|
335
|
March 20, 2023
|
MPI RMA with CUDA
|
|
2
|
311
|
March 17, 2023
|
AMDGPU.jl status
|
|
11
|
1805
|
March 16, 2023
|
Is accumulate() with '+' producing the wrong results?
|
|
1
|
295
|
March 13, 2023
|
Advice for optimising code for GPU
|
|
3
|
357
|
March 13, 2023
|
How do I avoid atomics in this kind of code on GPU?
|
|
2
|
225
|
March 11, 2023
|