Custom CUDA kernel for nontrivial sum calculations
|
|
2
|
831
|
September 20, 2022
|
GPU-Kernel Profiling
|
|
1
|
421
|
September 20, 2022
|
Broadcast question
|
|
2
|
415
|
September 19, 2022
|
Mapslices very slow
|
|
3
|
808
|
September 13, 2022
|
BitVector Adapt GPUs
|
|
25
|
861
|
September 11, 2022
|
Usage of CUDA.CUFFT.cufftPlanMany
|
|
1
|
821
|
August 30, 2022
|
Converting an image rotation demo from JuliaCon 2021 (AMD -> NVIDIA)
|
|
3
|
398
|
August 29, 2022
|
Error using CuIterator with Flux.train!
|
|
2
|
450
|
August 29, 2022
|
Custom backpropagation rule on GPU
|
|
2
|
414
|
August 29, 2022
|
Simple CuArray conversion, reverse, and transpose taking too long?
|
|
3
|
719
|
August 29, 2022
|
Enzyme.jl plus parallelstencil.jl?
|
|
5
|
596
|
August 22, 2022
|
Best way to use CuSparseMatrixBSR
|
|
7
|
1037
|
August 21, 2022
|
AMDGPU.jl has made such amazing progress over the last year!
|
|
16
|
3427
|
August 18, 2022
|
How to run many copies of a random function in parallel on GPU?
|
|
5
|
768
|
August 11, 2022
|
Help setting up CUDA.jl and cuTENSORS
|
|
1
|
853
|
August 9, 2022
|
Significant CUDA.jl memory allocations outside of main pool?
|
|
2
|
1414
|
August 6, 2022
|
Strange issues with stride loops as in CUDA.Random
|
|
1
|
315
|
August 3, 2022
|
Broadcasting an expression-evaluator function
|
|
2
|
502
|
August 2, 2022
|
Custom random sampling kernels
|
|
16
|
1665
|
July 26, 2022
|
How to generate a reusable static LLVM IR module?
|
|
4
|
708
|
July 25, 2022
|
LinearAlgebra./ breaks CuArray
|
|
4
|
579
|
July 23, 2022
|
Performance of PencilFFTs with CuArray
|
|
1
|
416
|
July 21, 2022
|
Cuda-memcheck reports over 1300 errors with 4 lines of julia code with CUDA.jl
|
|
2
|
710
|
July 20, 2022
|
CUDA.jl mystery : VSCode + Julia extension works fine but commandline run fails
|
|
5
|
382
|
July 21, 2022
|
Inplace array modification performances
|
|
4
|
628
|
July 20, 2022
|
CUDA.jl unsupported call to an unknown function, unsupported dynamic function invocation
|
|
3
|
1257
|
July 11, 2022
|
GPU sparse matrix-vector product with DoubleFloats.jl
|
|
3
|
479
|
July 11, 2022
|
Test of AMDGPU fails on 5900HX - hipErrorNoBinaryForGpu
|
|
2
|
1036
|
July 9, 2022
|
Metal.jl : Seamless GPU acceleration for Julia based physics using unified processing units?
|
|
1
|
1267
|
July 1, 2022
|
CUDA.jl - Better GPU but Worse Performance
|
|
10
|
1668
|
June 29, 2022
|