Optim.jl with autodiff and complex numbers

Dandan · August 18, 2019, 6:26pm

Hello!

I would like to use Optim.jl with the target function defined on a complex Stiefel manifold (unitary matrices) of some large size.
Stiefel manifold is already implemented in Optim.jl. The problem is I can’t use autodiff due to complex numbers, and optimization performs badly without the gradient.
On the other hand unitary matrices can be implemented as arrays of real numbers (of double size), so autodiff will work. But in this case I will miss the unitary constrains (so I’ll have to reimplement them, and I’m not sure how).

Is there any simple workaround on this?

simeonschaub · August 18, 2019, 6:41pm

Yes, unfortunately there’s currently no good way to do complex AD in Julia. Hopefully a solution based on ChainRules.jl will be ready soon. Out of interest: You’re probably differentiating a complex norm, so are non-holomorphic functions a concern, or is this not a problem in your case?

Dandan · August 18, 2019, 6:58pm

Thanks for reply.
The target function is real, so it’s not holomorphic for sure. But I don’t think this is a problem.

antoine-levitt · August 18, 2019, 7:19pm

Cool! I’m the one that added complex and Stiefel for Optim, so glad it’s useful! (out of curiosity what’s your application?) Definitely use complex optim, but hand code your gradient. If you want to use autodiff, you’ll want to use reverse diff (forward is about as expensive as finite differences). Eg Zygote supports complex->real gradients (see https://github.com/FluxML/Zygote.jl/issues/29), but there are currently no packages for reverse diff that are mature enough if your objective function is even moderately complicated.

simeonschaub · August 18, 2019, 7:19pm

By real function, do you mean R->R/R->C or do you mean C->R?

antoine-levitt · August 18, 2019, 7:20pm

I think he means C^n → R, with gradients as in Optim.jl

Dandan · August 18, 2019, 7:44pm

Yeah it works in low dimensions, which is cool In large dimensions the time complexity is just too big (without the gradient). I use it for numerical experiments related to Zauner’s conjecture in quantum science.

Thanks for tips, I now realize forward diff won’t help anyway.
Are there any other automatic ways to compute the gradient of a Julia code? Or by hand is the best option right now?

Yes, the function is just C^n → R.

antoine-levitt · August 18, 2019, 7:53pm

Cool!

Hand-coded gradients are always best if you can manage. Otherwise you can always try your hand at reverse diff. There are a few competing packages (Zygote, Tracker, Nabla, Yota are the names that seem to recur). You can also use a hybrid approach, where you use autodiff for specific parts of the code (eg if you have nasty scalar functions you can just take the forward diff) but drive the computation yourself, or let autodiff take the driver’s seat and provide adjoints for subfunctions that it can’t AD (trickier, should hopefully be helped by ChainRules & friends at some point)

Dandan · August 18, 2019, 8:03pm

Thanks!

Topic		Replies	Views
ForwardDiff.jl and its Dual Type question General Usage package , forwarddiff	4	1076	August 17, 2020
Automatic differentiation of complex valued functions Numerics question , zygote , forwarddiff , complex-numbers	30	4813	November 1, 2019
Complex-valued functions with real arguments - ForwardDiff, JuliaDiff Numerics	6	1593	September 28, 2020
Optim.jl v0.9.0 is out! Community release , optim	16	2423	June 28, 2017
[ANN] OptimKit.jl – A blissfully ignorant Julia package for gradient optimization Package Announcements	8	1733	June 21, 2020

Optim.jl with autodiff and complex numbers

Related topics