Two CuDeviceArrays inside one kernel

Shirakumo · July 29, 2022, 1:03pm

Hello everyone,

I was wondering if I can achieve something like this:

function test(output, output2, input)

    # Set up shared memory cache for this current block.
    cache1 = @cuDynamicSharedMem(Int64, (10,10,3))
    cache2 = @cuDynamicSharedMem(Int64, (5,5))

end

I have two small arrays which I compute on the device so I want them to be there for faster access. However, I’m not sure if I can use shared memory for this because in my example cache2 overwrites cache1. Is there any way to have two separate arrays which are shared among one thread block? I tried to read about CuDeviceArrays but can’t find any example how to use them. I’ll really appreciate your help.

Greetings

Topic		Replies	Views
Working with shared memory as one or more variables, what is a good approach? GPU	2	321	January 30, 2023
Trying to understand the use of shared memory on GPUs GPU	3	2220	May 25, 2021
Allocating different arrays on multiple GPUs GPU	8	1036	September 30, 2021
Is it possible to use CuStaticSharedArray(T, n) with n const? GPU cuda , sharedarrays	2	65	February 11, 2025
ANN: CUDAnative 3.0 and CuArrays 2.0 Package Announcements	3	851	March 29, 2020

Two CuDeviceArrays inside one kernel

Related topics