Unexpected behavior of CUDA kernel

0samuraiE · August 26, 2024, 2:25pm

Hello all. I hit an unexpected error writing CUDA kernel.
Somehow the variable b seems to be in conflict.
When I removed b=sum(A), this function worked.
Is it a spec or a bug?

using CUDA

function main()
    function kernel(A, B)
        i = threadIdx().x
        b = B[i]
        A[i] = b
        return nothing
    end

    A = CUDA.zeros(1)
    B = CUDA.zeros(1)
    CUDA.@cuda kernel(A, B)

    b = sum(A)
end
main()

ERROR: GPU compilation of MethodInstance for (::var"#kernel#11")(::CuDeviceArray{Float32, 3, 1}, ::CuDeviceArray{Float32, 3, 1}) failed
KernelError: passing and using non-bitstype argument

Argument 1 to your kernel function is of type var"#kernel#11", which is not isbits:
  .b is of type Core.Box which is not isbits.
    .contents is of type Any which is not isbits.

mauro3 · August 26, 2024, 2:38pm

Adding the b = sum(A) or anything b = ... makes the b inside the kernel a closed over variable. I think this is what CUDA does not like, if you change it to c = sum(A) it should work.

This illustrates this behavior a bit:

julia> function f()
         function g()
           return b
         end
         b = 1
         return g
       end
         
f (generic function with 1 method)

julia> gg = f(); gg()
1

0samuraiE · August 26, 2024, 2:48pm

Thanks. I did not know Julia can capture a variable after function definition.

Topic		Replies	Views
Problem with CUDAv3 GPU	9	882	November 8, 2021
Question about CUDA kernels GPU question	4	588	February 10, 2023
Weird error with CUDA.jl on Julia1.7. Cannot rewrite unknown use of function GPU	2	529	December 16, 2021
Bug with Julia 1.7.1 and CUDA 3.3 GPU bug , cuda	26	2398	June 2, 2022
Variable scoping issue when using multiple GPUs in CUDA.jl GPU gpu , cuda	1	47	July 17, 2024

Unexpected behavior of CUDA kernel

Related topics