How to show CuArray without copying from GPU to CPU?

I wanna display a CuArray(actually an image) to see how it changes during the computation, and I cannot figure out a method to do it.