Gradient on GPU 70X slower than on CPU

I don’t know much about @functor, but I am defining my model as

struct MyModel
    ...
end

Flux.@functor MyModel

and creating a GPU-compatible model by

m = MyModel() |> gpu

Is there anything else I need to do?