Question about coalesced read and write to the global memory using CUDA.jl 2D grid

The general rule of thumb is that consecutive threads should access consecutive memory locations; see CuArray is Row Major or Column Major? - #2 by maleadt for a similar question.

1 Like