Writing fast stencil computation kernels that work on both CPUs and GPUs
|
|
3
|
2155
|
January 29, 2019
|
`sync_threads()` doesn't seem to work within a loop
|
|
8
|
1378
|
January 27, 2019
|
MNIST GPU CuArrays error
|
|
23
|
3039
|
January 22, 2019
|
Casting, annotations and numeric types for CUDAnative
|
|
5
|
1432
|
January 21, 2019
|
Error of view on CuArrays with discrete indices
|
|
0
|
771
|
January 17, 2019
|
CuArray dot fuction error
|
|
2
|
610
|
January 16, 2019
|
Efficiency when handling jobs larger than VRAM
|
|
3
|
1057
|
January 15, 2019
|
Help wanted: CUDA error: an illegal memory access was encountered
|
|
1
|
4430
|
January 11, 2019
|
Is it possible to index a CuArray with a CuArray?
|
|
1
|
829
|
January 11, 2019
|
Presentation on effective use of CUDAnative/CuArrays
|
|
7
|
2367
|
January 3, 2019
|
GitLab CI for Julia GPU packages
|
|
9
|
2297
|
December 21, 2018
|
GPU kernel optimization (GPU vs CPU)
|
|
3
|
1490
|
December 14, 2018
|
Timing square function in CUDA
|
|
4
|
1634
|
December 11, 2018
|
CUDAnative is awesome!
|
|
12
|
5894
|
December 3, 2018
|
Multiprecision arithmetic in CUDA
|
|
1
|
1493
|
November 28, 2018
|
How much faster is GPU compare to CPU
|
|
16
|
25483
|
November 24, 2018
|
Strange behavior of `mapreduce`
|
|
2
|
688
|
November 16, 2018
|
CUDAnative support for Float16
|
|
5
|
1338
|
November 15, 2018
|
How to time CUDA Event
|
|
3
|
1019
|
November 15, 2018
|
cuArrays vs CUDANative
|
|
3
|
1346
|
November 14, 2018
|
Freeing memory in the GPU with CUDAdrv / CUDAnative / CuArrays
|
|
8
|
3026
|
November 13, 2018
|
How to set the diagonal part of a GPU Array
|
|
6
|
1327
|
November 12, 2018
|
Performance of view with cuArrays
|
|
11
|
2650
|
November 11, 2018
|
Is is possible to merge multiple kernels in CUDAnative to minimize launch overhead and execution overhead?
|
|
12
|
1585
|
November 11, 2018
|
Is there any plan for GPU linear algebra?
|
|
3
|
1943
|
October 25, 2018
|
Support for Complex-valued CuArray
|
|
2
|
713
|
October 12, 2018
|
Can not use CuArray on Julia 0.7
|
|
3
|
775
|
October 10, 2018
|
Support for Sparse Matrices on GPU (CUSPARSE)
|
|
1
|
1110
|
September 22, 2018
|
Optimizing the use of Blocks, Threads vs. Array Indexing
|
|
15
|
3196
|
September 21, 2018
|
Package use, CUDA stream support, etc
|
|
5
|
1442
|
September 13, 2018
|