@cuda threads and blocks confusion

maleadt · February 8, 2021, 2:53pm

The API has recently been simplified: https://github.com/JuliaGPU/CUDA.jl/blob/4eb99b9f53acfc02a01f92d4a0a2b219bf8994cc/src/indexing.jl#L32-L36

But yes, it’s best to use the occupancy API. You need to extend this yourself to multiple dimensions, the occupancy API only works with 1D threads/blocks. Alternatively, just convert the linear thread index to an appropriate 2D one in your kernel.

Topic		Replies	Views
CUDA: blockdimensions and launch_configuration New to Julia question	0	177	April 17, 2024
How do I make sure that GPU functions use the maximum potential config for performance? GPU	3	318	January 16, 2023
Error when implementing multidimensional kernel GPU	6	635	November 27, 2023
The most general way to estimate the optimal arguments for @cuda macro Performance gpu , cudanative	6	1776	April 6, 2021
Synchronizing Cuda kernels GPU	5	2451	September 20, 2019

@cuda threads and blocks confusion

Related topics