CUDA race conditions

trasor · January 30, 2024, 4:21pm

Hi together, i try to paralize some of my code with Cuda kernels, but somehow produced race conditions:

The kernel looks somewhat like:

function updateB!(inputs)
x = threadIdx().x + (blockIdx().x-1) * blockDim().x
if x >= 2 && x <= nx-2
dA[ x ] = (A[ x+1 ] - A[ x ])
B[ x ] += scalar * dA[ x ]
end
return nothing
end

it has probably something to do that i update B[ x ] at the end, and each thread try to write something into that array. However, i was not able to find a solution for this

Topic		Replies	Views
Cuda kernel help Performance	2	48	March 29, 2025
Kernel fails when number of blocks exceeds number of SM's (?) GPU	3	398	September 26, 2022
GPU Synchronization Issue - using KernelAbstraction GPU question	5	420	December 13, 2023
Simple kernel not working GPU	10	1183	July 12, 2020
CUDAnative: Using second and third dims in the kernel GPU cudanative	2	874	January 31, 2017

CUDA race conditions

Related topics