Hi all,
Is it correct that scalar indexing like Out[1] is not allowed with KernelAbstractions.jl alone, and that we need GPUArrays.@allowscalar for this? Should we always use GPUArrays together with KernelAbstractions for such cases?
using KernelAbstractions
using CUDA
using GPUArrays
backend = CUDABackend()
Out = KernelAbstractions.zeros(backend, Float64, 1)
GPUArrays.@allowscalar Out[1]