Optim.jl vs NLopt.jl vs?

ufechner7 · May 23, 2021, 7:34pm

I have a kind of hard nonlinear optimization problem. 12 variables, I know the result of the function should be zero, but how to find the combination of 12 values that give a very low residual?

So far I tried Optim.jl and NLopt.jl. BFGS(linesearch=LineSearches.BackTracking(order=3)) gives the fastest result, but it is not accurate.

NLopt with :LN_BOBYQA works better, but it is very slow, and it also does not always converge.

So my preferred optimizer would have the following properties:

do global optimization if it is stuck at a local minima, but only then
use gradient based search otherwise
use multiple cores
terminate only if the norm of the residual is below a given threshold

Is there anything else I could try?

ChrisRackauckas · May 23, 2021, 8:55pm

https://galacticoptim.sciml.ai/dev/

It’s made for trying all of the optimizers very quickly. For a guide, check out:

BBO is generally what is preferred.

dpsanders · May 23, 2021, 11:06pm

Do you in fact have a root finding problem, rather than an optimisation problem?

ufechner7 · May 24, 2021, 12:34am

Yes, indeed. Does that make a difference?

lmiq · May 24, 2021, 12:38am

One always valid thing to try is a simple multistart approach. Meaning, run a gradient-based local optimizer multiple times starting from different initial random guesses, and keep the best solution. That is of course embarrassing parallel.

And it is possible to get a feeling about if the global minimizer was found by counting the number of times the best solution was found.

stevengj · May 24, 2021, 12:50am

For 12 variables I would use a smarter multistart approach like MLSL that has heuristics to avoid repeatedly searching the same local optimum. (In higher dimensions these techniques don’t work so well.)

Note that the tolerance of the local optimizer is a crucial hyper-parameter in a multistart algorithm. You want it low enough that it gives a decent estimate for the local optimum but not so low that it wastes a lot of time trying to get poor local optima very accurately. You can always “polish” the solution afterwards by re-running a local optimizer to lower tolerance on the best local optimizer.

Sounds like it (since “I know the result of the function should be zero”) — it’s usually better to use a root-finding algorithm (e.g. multidimensional Newton) directly in such cases.

lmiq · May 24, 2021, 1:16am

I do not disagree, but I think that algorithms that do not take advantage of some structure of the problem to guide the search always fall short on their promesses. When the surface is simple, all of them work, and when it is too rough, none work. Usually I start with a simple random search and if that is not enough I try to understand the bounds and properties of each variable to design a smarter search for the specific function I am dealing with (of course in Chemistry this many times has a very clear structural meaning from which one can get insights).

Probably the important message is to always couple a global search with a gradient based local optimizer if that is possible. Your notes in the choice of the convergence parameters for that are perfect.

ufechner7 · May 24, 2021, 1:21am

Example of a solution: The coordinates of these points:

I tried to exploit what I know about the problem. In particular the sum of the length of the segments is very much constraint, lets say between l_min and 1.002 * l_min. But I did not find an elegant way to exploit that knowledge yet.

ufechner7 · May 24, 2021, 4:00am

Using NLsolve was the solution:

julia> test_nlsolve(plot=true)

Started function test_nlsolve...

result: Results of Nonlinear Solver Algorithm
 * Algorithm: Trust-region with dogleg and autoscaling
 * Starting Point: [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
 * Zero: [5.363910795979038, 8.783674160233206, 10.23341675994623, 9.693671443156749, 7.151557462205669, 3.234074688297643, -2.06416705085432, -3.2858554575315955, -3.7286126990970225, -3.4582357498632255, -2.542134465074674, -1.2229602290316413]
 * Inf-norm of residuals: 0.000000
 * Iterations: 168
 * Convergence: true
   * |x - x'| < 0.0e+00: false
   * |f(x)| < 1.0e-08: true
 * Function Calls (f): 169
 * Jacobian Calls (df/dx): 96
res: 2.501149859040896e-9
Test Passed
pre_tension: 1.0012120700268785

Forces in N:
[728.4835079784997, 734.9505623869616, 741.5053201436064, 748.1408238763977, 754.8499002678598, 761.6993164641827]

Very accurate, very fast. I use a straight line as starting point and then let the solver modify the x and y values until it finds an equilibrium. I have to change the positions and use accelerations and velocities as residual, which was not so clear to me in the beginning. Therefore I used the norm of the residual first. But now I have a perfect solution, thank you!

Topic		Replies	Views
Comparison of derivative-free / black-box optimization algorithms in Julia Optimization (Mathematical) question , comparison	12	2675	February 14, 2024
Making LBFGS in `Optim.jl` as fast as in `NLopt.jl` Optimization (Mathematical)	13	2070	December 6, 2022
Understanding the results from Optim.jl for a black box function Optimization (Mathematical) optim , optimization	15	1707	June 25, 2021
How mature is GalacticOptim for use NLopt algorithms Optimization (Mathematical)	1	401	September 26, 2021
Hacking LineSearches.jl to parallelize step estimation Optimization (Mathematical) optim , optimization	10	192	July 9, 2024

Optim.jl vs NLopt.jl vs?

Related topics