You have to use CUDA.@profile external=true, regular CUDA.@profile uses the internal profiler which clashes with NSight. I hope to auto-detect that in the future, but you have to be verbose for now.
5 Likes