Source code annotation using NVTX in CUDA.jl

NVTX is a CPU library, you cannot use it in kernels.