Strange behavior of CuDeviceArrays

fedoroff · June 8, 2021, 12:36pm

This is an example of the use case:

using CUDA

struct Foo{T}
    x :: T
end

function kernel(y, src)
    id = (blockIdx().x - 1) * blockDim().x + threadIdx().x
    stride = blockDim().x * gridDim().x
    for i=id:stride:length(y)
        y[i] = src.x[i]
    end
    return nothing
end

N = 10
y = CUDA.zeros(N)

a = Foo(CUDA.ones(N))
# @cuda threads=N kernel(y, a)
# KernelError: passing and using non-bitstype argument

b = Foo(cudaconvert(CUDA.ones(N)))
@cuda threads=N kernel(y, b)
@show y
# y = Float32[1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0]

With manually converted CuDeviceArrays it is possible to pass custom structures into kernels. Please tell me if there are other methods to do the same.
If later in the code I wil want to access x field of Foo, I will have a read only memory error.

Topic		Replies	Views
Arrays of arrays and arrays of structures in CUDA kernels cause random errors GPU gpu , cuda	21	2967	October 21, 2021
Store CuArrays on a mutable struct? GPU	5	1479	July 2, 2018
Passing a wrapped array to a kernel GPU	2	564	May 27, 2020
Are there any way to copy a Dict or a custom datatype which contains an Array to GPU by CUDA? GPU question	6	1776	November 17, 2020
Allocating different arrays on multiple GPUs GPU	8	1117	September 30, 2021

Strange behavior of CuDeviceArrays

Related topics