Efficient CuArray shift/rotation

stevengj · September 6, 2022, 4:59pm

The most common way to do this in parallel computing is not by shifting the array, but rather by using “ghost cells” — make redundant copies of boundary data in a communication step, after which point the operations on each “chunk” of the array can occur in parallel. See e.g. Principles of Parallel Programming by Lin and Snyder, quoted here: Seemingly unnecessary allocation within for loop - #9 by ptoche and the GitHub - fverdugo/PartitionedArrays.jl: Vectors and sparse matrices partitioned into pieces for parallel distributed-memory computations. package (though this is not GPU-based).

Topic		Replies	Views
Using @view with CuArrays GPU	6	1138	September 20, 2023
CUDA CPU allocations with range General Usage cuda	5	799	January 13, 2022
CuArray local scope memory issue GPU	4	308	January 4, 2023
A .= circshift(b, shfts) and circshift!(a, b, shfts) New to Julia	27	2167	August 6, 2021
Can I move an array asynchronously from main program to CUDA? GPU gpu , gpuarrays , cuda	7	210	December 15, 2024