Usage of CUDA.Const

Marco_Lombardi · November 1, 2024, 10:03pm

I am unsure how to use the CUDA.Const function to mark an array as constant during the execution of a CUDA kernel. From the few examples I saw, it looks like I just need to define a variable within the kernel that “shadow” the original array, as in cxs = CUDA.Const(xs), and then I can directly use the new variable cxs. I tried that and it works, but surprisingly I do not see any benefit from its usage, although I would expect a significant one (the same kernel, rewritten with shared memory for xs, is almost twice as fast).

Am I missing something or is this expected?

maleadt · November 4, 2024, 4:34pm

With Const we emit loads that use a different GPU cache level (read-only texture cache instead of L1/L2), which used to be much faster. Modern hardware has much faster L1/L2 caches, resulting in ldg being much less relevant than it used to be. So depending on your hardware, this is expected.

Topic		Replies	Views
Constant Memory? GPU	11	2591	July 18, 2018
Is it possible to use CuStaticSharedArray(T, n) with n const? GPU cuda , sharedarrays	2	65	February 11, 2025
Improving cache behavior with ldg() GPU cudanative	6	2666	February 6, 2020
I don't understand why it is slower with CuStaticSharedArray New to Julia gpu , cuda , sharedarrays , cudajl	9	281	March 17, 2025
Correct usage of shared memory？ GPU	5	848	January 20, 2024

Usage of CUDA.Const

Related topics