Hessian Vector Products on GPU using ForwardDiff, Zygote, and Flux

I haven’t dug into the details of your case, but it certainly sounds like that’s the right track. I’m not sure it’s using the fused sigmoid rule, but it could be, in which case that would need an frule.