It would be nice to have “thread local storage” implemented at some point. It looks like it might be necessary for recursive kernel calls. I’m not really sure as I’m still rather new to CUDA programming.

Thread local storage is not implemented

maleadt February 22, 2021, 12:53pm 5

StaticArrays are implemented as structs containing tuples, so it’ll be using registers and not PTX’s local memory. You can inspect generated code using @device_code_ptx.

2 Likes

Array of structures vs. Structures of arrays

Topic		Replies	Views
Local thread memory in GPU using StaticArrays GPU question , gpu , cuda	4	6250	January 26, 2020
Use of static array in a kernel function GPU	1	1547	January 4, 2021
CUDAnative dynamic allocation GPU question , cudanative	5	1810	March 4, 2020
CUDA.jl - Sub-Vector Indexing Problem Inside CUDA Kernel GPU cuda , error , cuarrays , error-message , staticarrays	2	1241	March 28, 2022
CUDA.@cuStaticSharedMem returning a Structure of Arrays GPU	0	329	July 26, 2021

Thread local storage is not implemented

Related topics