Nonlinear System of Equations with Bounds/Constraints on Unknowns

dpo · April 15, 2020, 1:01am

@bielim Glad to help! KNITRO is a mature solver that contains many people’s many years of research and efforts. It also contains several algorithms (some of the interior-point variety, including one that is quite similar to IPOPT, and some of the active-set variety). If you don’t specify an algorithm, KNITRO chooses one for you automatically at the beginning (you should see this if you inspect the output). You’ll probably want to try each algorithm and identify the one that’s best for your problem.

Using our interface to KNITRO, you can specify a starting point with

stats = knitro(model, x0=[0.5; 0.5; 0.5; 0.5; 0.5; 0.5])

or

stats = knitro(model, x0=model.meta.x0)  # use starting point specified in the model

By default, we decided to let KNITRO compute its own. The interior-point algorithms will reject your starting point if it doesn’t strictly satisfy the bound constraints.

I’m quite glad to see that TRON isn’t doing too badly either (at least with the NLS model). It’s a projected-direction method. I would have to investigate a little to determine why it has trouble solving the other model.

Feel free to open issues on our repos if you have difficulties. Also feel free to ask questions on our Slack channel: https://optimizers-workspace.slack.com/

jlperla · April 15, 2020, 2:33am

Try running the tuner to see if it helps

m = Model(with_optimizer(KNITRO.Optimizer, tuner = 1))

It might find an even better algorithm

bielim · April 16, 2020, 1:57am

This is beautiful, @saschatimme! I’ve been trying the approach you posted with different values of the parameter M – for a single parameter, homotopy continuation is slower than the other methods I tried, but the call to monodromy_solve is a one-off cost that I’m happy to pay given that I’ll have to repeat the calculation for many different values of M (and the subsequent iterations are pretty fast!).

As an aside, I’d like to express my appreciation for the amazing Nextjournal documentation – really neat and user-friendly!

When trying different values for M, I always got four real solutions. Two of them are actually the same, because the solution is symmetric in the sense that the two components of the mixture distribution the solution describes can be relabeled while leaving the mixture pdf invariant. I’m only interested in the mixture pdf, so the “double presence” of solutions is not really a problem - picking either one of the solutions is fine. (But I saw that HomotopyContinuation’s GroupActions would probably be able to filter the solutions down to the ones that are truly different, and I will give this a try.)

However, I also tried to incorporate the equality constraints on the weights (w_1+w_2= 1) into the system of equations by replacing all instances of w_2 with 1-w_1, and when I do that (now dealing with a system of 6 equations for 5 unknowns), the solver doesn’t find a real solution anymore.
I guess I could just pick the solution that satisfies the constraint on the weights most accurately, but I was wondering why directly incorporating it into the system doesn’t seem to work, and if there is a way I can include the constraint in the problem setup rather than after the solution has been computed.

saschatimme · April 16, 2020, 5:12am

Happy to hear that you find this approach helpful and that the documentation is helpful as well

When you have a more equations than unknowns, then you need a very special set of equations such that you still have solutions. Geometrically, your 6 equations in 6 unkowns already only result in 18 points, and now you try to intersect these with the hyperplane 1=w_1 + w_2. So these 18 points have to be quite special such that at least one lies on this plane. Obviously, you know for theoretical reasons that this is the case here. However, I assume your measure are not exact, right?
This perturbation in the measure then would yield solutions which are a little bit away from the hyperplane. When you look at the 4 real solutions for your example above, then you notice that w_1 and w_2 don’t sum up to 1 exactly

julia> 0.31831615942330055 + 0.6949587215517059
1.0132748809750065

julia> 0.6421990255933143 + 0.38742674206054184
1.0296257676538563

If this constraint is important to be satisfied, maybe you could use the obtained solutions coming from HomotopyContinuation.jl as a starting point for a least square optimization routine.

bielim · April 16, 2020, 7:58am

many people’s many years of research and efforts

So no magic bullet, just the magic of hard work

I see - I had assumed that the starting value x0 given as input to the Model (e.g., model = ADNLSModel(c, x0, 7, lvar=zeros(6))) would automatically be used as the starting point for knitro as well.

I’ll keep experimenting and will post any issues I encounter!

bielim · April 16, 2020, 8:06am

m = Model(with_optimizer(KNITRO.Optimizer, tuner = 1))

Apparently with_optimizer has been deprecated and replaced by Model(optimizer_with_attributes(KNITRO.Optimizer, "tuner" => 1))

I tried this – the Knitro Tuner decided that the fastest solution would be found by a solver that involves “algorithm 3”, which based on the user manual is an Active-Set algorithm. But the tuning did not lead to an improvement in the time to solution.

jlperla · April 16, 2020, 1:14pm

Yeah, you never know. Your problem is sufficiently small that there may not be much difference.

The other two very useful options for knitro to keep in mind in the future are: (1) ms_enable = 1 which turns on multi-start, and is huge time save if you have ugly, non-convex equations; and (2) honorbnds = 1 which was necessary for me when my box-bounds defined tight constraints on when my equations could be evaluated.

bielim · April 16, 2020, 7:01pm

That makes sense, thanks for the explanation! Yes, as you point out, for theoretical reasons this overdetermined system should have a unique solution (up to flipping the labels of the two mixture components), but I understand that numerical inaccuracies can stand in the way of finding that solution.

I’ll test if using the homotopy continuation solution as an extremely educated inital guess for a subsequent optimization algorithm is too computationally expensive. As a simpler alternative, picking the solution that best satisfies the constraint, followed maybe by “re-normalizing” the weights (w_{1, \textrm{normalized}} = \frac{w_1}{w_1+w_2}, w_{2, \textrm{normalized}} = \frac{w_2}{w_1+w_2}) such that the constraint is exactly satisfied may already be good enough an approximation for my purpose.

bielim · April 16, 2020, 7:10pm

On a different note: Is it possible to accept more than one reply as solutions? I tried to mark both @dpo’s and @saschatimme’s replies, but when marking the second one the first one gets unmarked.

Tamas_Papp · April 17, 2020, 5:03am

Unfortunately, it isn’t possible. Just mark the one you prefer.

CeterisPartybus · February 8, 2025, 6:12am

This is great. However, I updated the syntax of the solution from @dpo to the current package syntax and tried to replicate the result. But when I run the code to IPOPT fails.

Any idea what I did wrong?

Here is what I did:

using NLPModels, NLPModelsIpopt, ADNLPModels

function c(x)
    M = [2.250, 9.675, 57.263, 427.219, 3836.109, 40234.852]
    F = [ M[1] - x[5]*x[3]*x[1] - x[6]*x[4]*x[2] ;
          M[2] - x[5]*x[3]^2*(x[1]+1)*x[1] - x[6]*x[4]^2*(x[2]+1)*x[2] ;
          M[3] - x[5]*x[3]^3*(x[1]+2)*(x[1]+1)*x[1] - x[6]*x[4]^3*(x[2]+2)*(x[2]+1)*x[2] ;
          M[4] - x[5]*x[3]^4*(x[1]+3)*(x[1]+2)*(x[1]+1)*x[1] - x[6]*x[4]^4*(x[2]+3)*(x[2]+2)*(x[2]+1)*x[2] ;
          M[5] - x[5]*x[3]^5*(x[1]+4)*(x[1]+3)*(x[1]+2)*(x[1]+1)*x[1] - x[6]*x[4]^5*(x[2]+4)*(x[2]+3)*(x[2]+2)*(x[2]+1)*x[2] ;
          M[6] - x[5]*x[3]^6*(x[1]+5)*(x[1]+4)*(x[1]+3)*(x[1]+2)*(x[1]+1)*x[1] - x[6]*x[4]^6*(x[2]+5)*(x[2]+4)*(x[2]+3)*(x[2]+2)*(x[2]+1)*x[2] ]
          #x[5] + x[6] - 1]  # left this out
    return F
end


# define a model with derivatives computed by ForwardDiff
# bounds on the variables are included
f(x) = 0.0
x0 = [0.5; 0.5; 0.5; 0.5; 0.5; 0.5];
lcon = zeros(6)
ucon = zeros(6)
lvar = zeros(6)
uvar = [Inf; Inf; Inf; Inf; Inf; Inf]
model = ADNLPModel(f, x0, lvar, uvar, c, lcon, ucon)
stats = ipopt(model)
x = stats.solution

And the output:


******************************************************************************
This program contains Ipopt, a library for large-scale nonlinear optimization.
 Ipopt is released as open source code under the Eclipse Public License (EPL).
         For more information visit https://github.com/coin-or/Ipopt
******************************************************************************

This is Ipopt version 3.14.17, running with linear solver MUMPS 5.7.3.

Number of nonzeros in equality constraint Jacobian...:       36
Number of nonzeros in inequality constraint Jacobian.:        0
Number of nonzeros in Lagrangian Hessian.............:       10

Total number of variables............................:        6
                     variables with only lower bounds:        6
                variables with lower and upper bounds:        0
                     variables with only upper bounds:        0
Total number of equality constraints.................:        6
Total number of inequality constraints...............:        0
        inequality constraints with only lower bounds:        0
   inequality constraints with lower and upper bounds:        0
        inequality constraints with only upper bounds:        0

iter    objective    inf_pr   inf_du lg(mu)  ||d||  lg(rg) alpha_du alpha_pr  ls
   0  0.0000000e+00 4.02e+04 1.00e+00  -1.0 0.00e+00    -  0.00e+00 0.00e+00   0
   1r 0.0000000e+00 4.02e+04 9.99e+02   4.6 0.00e+00    -  0.00e+00 2.43e-08R 10
   2r 0.0000000e+00 4.02e+04 9.95e+02   4.6 1.97e+04    -  7.63e-14 1.01e-16F  1
   3r 0.0000000e+00 3.53e+04 8.34e+05   3.2 2.90e+04    -  6.56e-04 1.37e-03h  5
   4r 0.0000000e+00 3.57e+04 6.22e+07   3.2 1.73e+02   8.0 2.85e-01 1.93e-03f  3
   5r 0.0000000e+00 3.57e+04 4.77e+08   3.2 2.10e+02   9.3 1.96e-02 9.63e-04f  4
   6r 0.0000000e+00 3.57e+04 5.75e+09   3.2 2.27e+02  10.7 5.93e-02 5.11e-04f  5
   7r 0.0000000e+00 3.57e+04 1.27e+11   3.2 2.35e+02  12.0 9.98e-03 5.32e-04f  5
   8r 0.0000000e+00 3.57e+04 2.94e+12   3.2 2.43e+02  13.3 6.99e-01 5.60e-04h  5
   9r 0.0000000e+00 3.57e+04 6.81e+13   3.2 2.49e+02  14.6 1.03e-02 5.92e-04h  5
iter    objective    inf_pr   inf_du lg(mu)  ||d||  lg(rg) alpha_du alpha_pr  ls
  10r 0.0000000e+00 3.57e+04 1.58e+15   3.2 2.55e+02  16.0 6.32e-02 6.28e-04h  5
  11r 0.0000000e+00 3.57e+04 3.65e+16   3.2 2.60e+02  17.3 2.08e-02 6.67e-04h  5
  12r 0.0000000e+00 3.57e+04 8.42e+17   3.2 2.64e+02  18.6 1.00e+00 7.11e-04h  5
  13r 0.0000000e+00 4.01e+04 2.27e+21   3.2 2.67e+02  20.0 1.59e-02 1.21e-02w  1
WARNING: Problem in step computation; switching to emergency mode.
  14r 0.0000000e+00 3.57e+04 1.93e+19   3.2 2.67e+02  20.0 1.59e-02 7.57e-04h  5
WARNING: Problem in step computation; switching to emergency mode.
  15r 0.0000000e+00 3.57e+04 1.93e+19   3.2 2.67e+02  20.0 0.00e+00 0.00e+00R  1
WARNING: Problem in step computation; switching to emergency mode.
Cannot call restoration phase at point that is almost feasible for the restoration NLP (violation 0.000000e+00).
Abort in line search due to no other fall back.
Step computation in the restoration phase failed.

Number of Iterations....: 15

                                   (scaled)                 (unscaled)
Objective...............:   0.0000000000000000e+00    0.0000000000000000e+00
Dual infeasibility......:   1.0229860218632143e+19    1.0229860218632143e+19
Constraint violation....:   3.5687352884718770e+04    3.5687352884718770e+04
Variable bound violation:   0.0000000000000000e+00    0.0000000000000000e+00
Complementarity.........:   1.8998141258422573e+10    1.8998141258422573e+10
Overall NLP error.......:   3.5687352884718770e+04    1.0229860218632143e+19


Number of objective function evaluations             = 76
Number of objective gradient evaluations             = 3
Number of equality constraint evaluations            = 80
Number of inequality constraint evaluations          = 0
Number of equality constraint Jacobian evaluations   = 18
Number of inequality constraint Jacobian evaluations = 0
Number of Lagrangian Hessian evaluations             = 16
Total seconds in IPOPT                               = 2.650

EXIT: Restoration Failed!
6-element Vector{Float64}:
 0.3589160598971504
 0.3589158882964768
 1.5695199650801184
 1.5695185285845537
 1.6849627578465884
 1.6849627521539774

Topic		Replies	Views
Solvers fail on nonlinear problem (which has solutions) Optimization (Mathematical) jump	30	3024	January 26, 2019
Calling Mathematica into Julia to symbolically solve a system of non-linear equations Specific Domains question , nonlinear , symbolics , mathematica	18	1274	May 11, 2022
Domain error using nlsolve New to Julia	27	2968	May 10, 2020
Solve Constrained Nonlinear System (?) General Usage question	9	2020	May 7, 2020
Differences between NLsolve and Optim in solving system of equations Optimization (Mathematical)	24	3038	November 16, 2022

Nonlinear System of Equations with Bounds/Constraints on Unknowns

Related topics