Optimizing the use of Blocks, Threads vs. Array Indexing
|
|
15
|
3306
|
September 21, 2018
|
Package use, CUDA stream support, etc
|
|
5
|
1489
|
September 13, 2018
|
Flux four errors in Julia v0.7 none in v0.6.4
|
|
1
|
710
|
September 13, 2018
|
Computing eigenvalues/eigenvectors using GPU?
|
|
9
|
5109
|
September 11, 2018
|
CUDAnative question: "recursion not currently supported error" when running reduce.jl example
|
|
1
|
867
|
September 11, 2018
|
CuArray and Optim
|
|
9
|
3411
|
September 4, 2018
|
GPUArrays, 64-32bit conversions, and Cassete.jl
|
|
8
|
816
|
August 30, 2018
|
LLVM crash when running Flux and CuArray examples in julia 0.7
|
|
13
|
1673
|
August 21, 2018
|
Flux: GPU slower than CPU?
|
|
7
|
2224
|
August 10, 2018
|
CLBlast, a tuned OpenCL BLAS library
|
|
6
|
1543
|
August 9, 2018
|
CUDAdrv cannot find __host__ __device__ functions
|
|
5
|
1735
|
July 29, 2018
|
What is the recommended type <: Integer to use when doing index arithmetics?
|
|
3
|
1494
|
July 24, 2018
|
Packing structs for OpenCL
|
|
1
|
2169
|
July 23, 2018
|
Sequence of warp and how to avoid divergence when folding shared memory in a reduction kernel
|
|
3
|
1218
|
July 19, 2018
|
Constant Memory?
|
|
11
|
2686
|
July 18, 2018
|
Generic Kernels for CLArrays
|
|
1
|
651
|
July 16, 2018
|
Load JULIA via "julia -p 8" and failed to load CUDAnative library
|
|
3
|
873
|
July 13, 2018
|
Calling CUBLAS GEMM in Julia 0.6
|
|
4
|
1469
|
July 6, 2018
|
Optimizing column reduce with CUDAnative
|
|
5
|
1950
|
July 6, 2018
|
What is the optimal way of updating CuArray?
|
|
7
|
1542
|
July 5, 2018
|
What is the maximal number of arguments a CUDAnative kernel can take? argc = 16 yields "Error: invalid kernel call; too many arguments"
|
|
6
|
1965
|
July 4, 2018
|
Store CuArrays on a mutable struct?
|
|
5
|
1462
|
July 2, 2018
|
Can I change the nvcc location in CUDAnative?
|
|
5
|
1594
|
June 29, 2018
|
Mapping ThreadIdx().x to a 5D array?
|
|
8
|
1241
|
June 15, 2018
|
Strange behaviour of @cuprintf?
|
|
3
|
837
|
June 13, 2018
|
Problem with CUDAintrinsic pow: pow(y[1,1],2.0)?
|
|
2
|
609
|
June 13, 2018
|
Best way to call an OpenCL kernel with arguments of type CLArray
|
|
6
|
2094
|
June 4, 2018
|
LLVM LoadError: Permission Denied (EACCES)
|
|
4
|
1400
|
May 18, 2018
|
Initializing @cuStaticSharedMem array?
|
|
3
|
1369
|
May 12, 2018
|
Stack overflow on cuda
|
|
9
|
2623
|
May 9, 2018
|