Techniques to improve resolution of a system of nonlinear equations

mleprovost · October 20, 2020, 10:09pm

Hello,

I would like to solve a set of N = 300 independent 1D nonlinear equations of the type f_i(x_i) = a_i where the a_i are given and each function f_i is a univariate strictly increasing bijection. There is a unique solution for each nonlinear equation. So far, I am using NLsolve.jl with the Newton method to solve simultaneously the equations. I am already using the fact that the Jacobian is diagonal (all the problems are independent).

However the f_i have some bumps (see Figure) that are sometimes challenging for the Newton method if the initial condition is far from the root. How can I improve the convergence?
Is there a way to exploit that the Jacobian has only strictly positive values?

monotone

dpsanders · October 20, 2020, 10:57pm

If the problems are independent, why don’t you just solve each equation separately?

ChrisRackauckas · October 20, 2020, 11:17pm

Yes, since it’s 1D and monotonically increasing you can use a bracketing method like bisection if you find one value to the left and one value to the right.

Oscar_Smith · October 21, 2020, 12:21am

Doesn’t Newton’s method converge faster than bisection (at least once you’re close)?

ChrisRackauckas · October 21, 2020, 12:47am

Bisection is much more stable though. The best thing to do would be to use a Falsi method like in https://github.com/JuliaComputing/NonlinearSolve.jl which is a mixture of bracketing for stability but derivative-based for acceleration.

mleprovost · October 21, 2020, 12:54am

The main reason is that the Newton method is much faster to converge than the bisection method.

I have added a line search and that seems to solve the problem.
Thank you all for your recommendations.

I will check NonlinearSolve.jl

ChrisRackauckas · October 21, 2020, 12:56am

for reference if you’re not familiar with the method: Regula falsi - Wikipedia

Tamas_Papp · October 21, 2020, 1:50pm

Bisection is much more robust, and is derivative-free.

For problems with “bumps” like this, plain vanilla Newton’s method can easily diverge. There are safeguards against this, but why bother when you have bisection.

Newton’s method (in its original, simple form) is not a practical general method for univariate or multivariate problems. It is useful when you can prove reasonable convergence analytically, which happens for globally “nice” problems, which are quite rare.

dpsanders · October 21, 2020, 2:51pm

I thought the standard (?) method for 1d functions was Brent’s method, which combines bisection and Newton to give a method that is guaranteed to converge (if I remember correctly). I believe this is implemented in the Roots.jl package.

If you need guarantees then there’s IntervalRootFinding.jl

Tamas_Papp · October 21, 2020, 3:10pm

Not quite — it uses the secant method, so it does not need derivatives directly.

Brent’s method can be better than bisection for some problems, but I think that first we should establish whether the problem requires a multivariate solver. If the problem is separable, the major efficiency gain will be from that, and the choice of univariate method is of secondary importance compared to it.

mleprovost · October 21, 2020, 3:25pm

Yes, the problem is separable (all the equations are independent), but my code works best if I perform the evaluation of all the f_i(x_i) at once. Also, the computation of the Jacobian is less expensive than the evaluation of the function.

ctkelley · October 21, 2020, 4:56pm

Start with bisection and turn on Newton when things start looking good (ie all the |f|s are small). This is the algorithm HP used years ago for scalar equations in their programmable calculators.

Tamas_Papp · October 22, 2020, 8:52am

There is no technical reason that should prevent you from implementing a coordinate-wise bisection or other method, it is just a matter of bookkeeping, the algorithm is the same.

ctkelley · October 22, 2020, 10:28am

This is wise advice. You can use a scalar bisection + Newton-Armijo solver (or roll you own) and loop over the functions. In that way you are likely to converge for most of them and will be able to identify any corner cases for special attention. The line search is really important for problems like this.

One problem with using Newton-Armijo for systems to solve this problem is that the hardest of the equations will govern the line search for all of them. Take this example, please

f1(x) = atan(x); f2(x) = atan(10x); f3(x) = atan(100x);

with x0=1 as the initial iterate for all three equations.

f3 = 0 is a much harder problem for Newton-Armijo that f1=0, which is easy. Solving as a system will make all three problems appear to be hard.

Topic		Replies	Views
Solving two non-linear equations with two variables. Any suggestion? General Usage	2	256	April 6, 2023
Solving a system of two equations with two variables with certain properties Numerics	3	547	June 15, 2023
Is there a julia package that uses Newton-Krylov method for systems of nonlinear equations? General Usage	28	3819	April 12, 2025
Solving F(X)=0 using nlsolve Optimization (Mathematical) question , nlsolve	6	487	October 28, 2023
Nonlinear systems: NLsolve alternative (non-convergence issue) Optimization (Mathematical) question , nlsolve , nonlinear	3	1045	January 25, 2021

Techniques to improve resolution of a system of nonlinear equations

Related topics