Unfortunately no: I have access to another Cray machine, but unfortunately was never able to get MPI.jl to work at all on it (in that case it segfaulted when dlopen-ing the MPI library).
The only thing I can think of is to check that Julia and MPICH are using the same CUDA version?