@cuDynamicSharedMem : allocating beforehand?

drjoke · January 2, 2018, 5:18am

I am getting: ERROR: LoadError: CUDA error: an illegal memory access was encountered (code #700, ERROR_ILLEGAL_ADDRESS) on the following

using CUDAdrv, CUDAnative

function kernel(x)
    i = threadIdx().x
    shared = @cuDynamicSharedMem(Int64,1)
    if i == 1
        shared[1] = 255
    end
    sync_threads()
    x[i] = shared[1]
    return nothing
end

d_x = CuArray{Int64,1}(10)
@cuda (1, 10) kernel(d_x)
x = Array(d_x)
println(x)

The error probably occurs as soon as I try

shared[1] = 255

In the source code CUDAnative.jl/src/device/intrinsics/memory_shared.jl it mentions:

Dynamic shared memory also needs to be allocated beforehand, when calling the kernel.

Yet, I cannot find an example on how to do this.

drjoke · January 2, 2018, 6:00am

Changing to @cuStaticSharedMem fixed all errors.

maleadt · January 2, 2018, 8:41am

This is by design: dynamic shared memory in CUDAnative.jl is identical to shared memory in CUDA, ie. you need to specify how many bytes to allocate at the launch site: @cuda (blocks, threads[, shmem[, stream]]) kernel(args). If you use static shared memory you specify the number of elements, and the amount of memory can be deduced.

Topic		Replies	Views
Illegal memory access problem CUDA GPU	8	2619	November 24, 2021
Initializing @cuStaticSharedMem array? GPU	3	1338	May 12, 2018
CuDynamicSharedArray error GPU gpu	2	521	November 25, 2021
Working with shared memory as one or more variables, what is a good approach? GPU	2	321	January 30, 2023
Enzyme Cuda dynamic memory GPU	12	426	June 17, 2024

@cuDynamicSharedMem : allocating beforehand?

Related topics