Second order gradient with Lux, Zygote, CUDA, Enzyme

Lux swaps in ForwardDiff for nested AD with Zygote (Nested AD with Lux etc) and that will fail here without a ForwardDiff.Dual overload