CUDA.@atomic only works in kernel context.
This usage looks OK, so try in a regular kernel where you can use @device_code_warntype to look at what’s wrong instead of our broadcast implementation here just bailing out.
CUDA.@atomic only works in kernel context.
This usage looks OK, so try in a regular kernel where you can use @device_code_warntype to look at what’s wrong instead of our broadcast implementation here just bailing out.