yuyichao and Keno’s comments in this thread may help; I found them very interesting and informative.
Zygote is focused on machine learning, where 1 microsecond of overhead per operation is negligible in the face of gigantic array operations. I’ve often found ForwardDiff to be a better choice when doing lots of small operations (especially with StaticArrays, which it plays nicely with). Have you considered it?