Formation
and examination of a few new iterative methods for nonlinear equation numerical
solutions
Rohit Kumar Pandey1*,
Dr. Jaya Kushwah2
1 Research Scholar,
Dept. of Mathematics, Sardar Patel University Balaghat M.P.
rkpandeyjii@gmail.com
2 Associate Professor,
Sardar Patel University, Balaghat, M.P.
Abstract
The extensive use of numerical solutions to nonlinear
equations in fields as diverse as physics, engineering, economics, and more
makes this a vital topic in scientific computing and applied mathematics.
Although efficient, classical methods such as Newton-Raphson and Secant methods
have drawbacks include being sensitive to initial estimations, diverging in
some circumstances, and requiring more computing power to evaluate derivatives.
Our goal in this research was to find better iterative solutions to nonlinear
equations of the type f(x)=0 that are more resilient and have a faster
convergence rate. We came up with a few new approaches and tested them out. By
adding weight functions and higher-order correction terms to conventional
schemes, the newly created approaches are able to adapt and expand upon them.
Proofs of convergence order, error estimates, and stability criteria are all
part of the extensive theoretical analysis that is given. These techniques
improve computing efficiency by achieving higher-order convergence (up to
fourth and sixth order) without increasing the number of function or derivative
evaluations each iteration. The performance of the suggested approaches is
compared to current techniques through a series of numerical tests performed on
benchmark nonlinear equations. These novel iterative approaches show good
promise as instruments for addressing complicated nonlinear problems in
real-world applications, thanks to their enhanced convergence behaviour.
Keywords: Formation,
Iterative Methods, Nonlinear Equation, Numerical Solutions
1. INTRODUCTION
When describing occurrences on a grand
scale in terms of physical phenomena, such as laws of nature, mathematical
models showing relationships between variables are frequently employed. It is
essential to obtain the solution of a model in order to investigate the
associated physical process and predict its future evolution. It is typically
not feasible to obtain the exact or analytical form of answers due to the
inherent non-linearity among the variables in the developed models. The study
of nonlinear issues presents a greater challenge compared to linear ones. It
should be noted that algebraic solutions are usually not discovered for
polynomial equations of arbitrary type with coefficients of degree five or
above, as stated in the Abel-Ruffini theorem. But remember that the precise
responses aren't always what you desire. Just by numerically approximating the
answers to a certain precision, we can gain a deeper grasp of the underlying
physical process (Acton, F.
S. 2020).
Numerical analysis is a subfield of mathematics
concerned with determining optimal numerical solutions to a wide range of
issues. The usual method for solving nonlinear equations is to use the concept
of iterative theory. In mathematics, iterative techniques are procedures that,
given a sufficiently accurate initial estimate, provide successive estimates
that, taken together, may converge to a solution. Convergence of generated
sequences and the construction of iterative algorithms are the principal foci
of mathematical analysis. It is critical to develop fast and efficient
iterative algorithms since the complexity of modern mathematical problems is
increasing exponentially. Univariate nonlinear equations or multivariate
systems of nonlinear equations might provide a mathematical representation of
these problems. Using numerical approaches, we can solve the following
nonlinear problems (Canale, R. P. 2006):
v
Isentropic supersonic
flow problem
Waves compressing and expanding in flow channels are common
occurrences in fluid dynamics. A differential equation governs the examination
of the uniform two-dimensional isentropic supersonic fluid flow around a sharp
convex corner of a frictionless surface:

In where δ is the specific heat ratio of the fluid, M is the
Mach number, and δ is the flow deflection angle. From an initial Mach
number M1 to a final Mach number M2, the flow deflection angle δ that
expands the fluid may be obtained by integrating the above equation (Li, W.
2006).

Where
Given the values of M1, γ, and δ,
the estimate of M2 is a specific nonlinear issue in this scenario.
Nevertheless, it appears that solving this problem algebraically is not
possible in this case.
v
Three-body problem under
gravitational field
For a tiny mass object (m) subjected to the gravitational pull of
two large bodies (M1 and M2) in orbit around it, the Lagrange points in space
represent the sites of equilibrium in the field of celestial mechanics.
Lagrange points are locations where the centripetal force needed to propel a
tiny object around its orbit is exactly equivalent to the gravitational forces
exerted by two big masses. As an example, a satellite can maintain a
synchronous orbit with Earth by harnessing the centripetal force provided by
the Sun and Earth's gravitational pull. Equation that follows from this model
is (Basu, D. 2008).:

The situation where G stands for the gravitational constant, R for
the distance between Earth and the Sun, r for the distance between the
satellite and Earth, ϋ for the angular velocity, and M1, M2, and m for the
masses of the Sun, Earth, and the satellite, respectively. Then, using the
Kepler’s third law: ![]()
the
given problem leads to solve the following quintic polynomial equation:
(1)
Where c1 = ±(M1 + M2), c2 = ±3(M1 + M2)R, c3 = ±3(M1 + M2)R2, c4 =
(M2 ∓ M2)R3, c5 = 2M2R4, c6 = M2R5. Two distinct solutions to the
provided problem are represented by the +/- symbol in this case. The Lagrange
points can be determined by calculating the quintic equation (1) given the
specific values of the masses M1, M2, m, R, and r, as well as the angular
velocity ό.
2. SOME BASIC CONCEPTS
The basic concepts that are relevant to this
study are presented here.
2.1 Root or zero
A given function is denoted as f: R → R. A
root of an equation is a number that, when multiplied by itself, gives the
equation f(x) = 0. To rephrase, the zero of the function f(x) is x = x. When
the x-axis intersects the graph of y= f(x) on the xy-plane, we say that this
point is a geometric zero of f(x). Rather than limiting oneself to the domain
of real numbers, the concept of root or zero may be easily extended to
functions defined over the complex domain C (Barati, A.
2007).
2.2 Multiple zeros
“A solution
is
a zero of multiplicity m of f(x) if, for
where
In particular, if the given function is m
times continuously differentiable in the interval
then
it has a zero of multiplicity m at x∗ ∈ (a,b) if and only if ”

If m = 1, the given zero is particularly called the simple zero.
2.3 Intermediate value property
Assume that
is a continuous function on the interval [a,b] for every x. Refer
to Figure 1 for reference, and find an integer s such that f(a) < s <
f(b). Then, for any x in the range [a,b], f(x) equals s. One important
consequence is that it confirms the presence of a zero of f(x) in the interval
[a,b] if the condition is true for s = 0 (Hernández, M. A. 2000).

Figure 1: Intermediate value property
2.4 Fixed point iteration
The function's fixed point is a numerical value denoted as x*.
The application of a mapping f to x* results
in x* being mapped onto it. It should be borne in mind realize that locating a
function's zeros is analogous to locating the fixed points of a related
function. Specifically, let's say that φ is the function that provides the
solutions to the equation f(x) = 0, where φ(x) = x − f(x). From that
point on, the zero x of f(x) is determined to be the fixed point of χ(x).
Consider any point x0. Afterwards, the technique for fixed-point iteration is stated
as follows:
![]()
where φ is called iteration function. This
finding proves that there are enough variables for an iterative sequence to
converge. ![]()
“In accordance with Theorem 1.2.1, let x be the
interval I value of the zero of the function f(x). In the set I, there are continuous
functions φ(x) and φ 0 (x) where the definition of φ(x) is such
that x = φ(x) is equivalent to f(x) = 0. In the case when every integer x
in I has an absolute value less than 1, the sequence of approximations
{xk}k≥0 will eventually converge to the root x, provided that x0 is an
integer (Schnabel, R. B. 2013).”
2.5 Order of convergence
An iterative sequence
"converges" to an element x* of rank
p ∈ R + when

for all values of c > 0. The asymptotic error
constant is the name given to the constant c in this context. Linear
convergence, quadratic convergence, and cubic convergence are terms used to
describe the aforementioned sequence for p=1, 2, and 3, respectively. Furthermore, let
be the errors in corresponding iterations,
then the following relation:

known as the error equation. Keep in mind that finding the error
equation is the first step in figuring out the order of convergence, p. E.
Schroder offers an alternative definition of order of convergence. An iteration
function φ(x), corresponding to the iterative scheme
is said to
possess convergence order p ∈ Z +, if the sequence
converges to
and the following holds:
![]()
Here,
will be the asymptotic error constant.
3. DETERMINATION OF INITIAL APPROXIMATION
The convergence of iterative techniques is strongly affected by
the initial approximation of the solution, so determining it is of the utmost
importance. Below are some of the methods that may be used to choose good first
estimations (Ben-Israel, A. 2017).
3.1 Geometrical approach
By drawing a graph of the function f(x) and finding its
intersection with the x-axis, we may obtain an approximation of the solution to
the equation f(x) = 0. Here we get a very approximate root, as this is the
value at which f(x) = 0. The lack of precision makes graphical methods for
computing the roots almost useless. On the other hand, numerical approaches can
use the preliminary estimates from graphical methods as their starting points.
The bisection method and other bracketing techniques can be used to an interval
that contains the answer.
3.2 Trial and error approach
Here, we take a wild guess at x and see if our estimated root can
be better estimated by seeing whether f(x) is near zero. If not, we make
another wild guess and see if our updated root estimate holds up. The method is
iterated until an estimate is made that yields a f(x) value around zero.”
3.3 Incremental search approach
Beginning at one end of the region of interest, this approach
evaluates functions at tiny intervals across the region. It is presumed that a
root is within the increment whenever the function's sign changes. As a
starting point for the bracketing methods, the values of x at the start and end
of increment would be useful.
3.4 Sigmoid function approach
Took a time-saving technique to choose a first guess that was
close enough to the target solution. Let us pretend that there is a single
solution to the function f(x) on the interval [a,b] and that it is continuous
on that interval. The function
is called the sigmoid transform of the function
f(x) for some β > 0. Then, for the sufficiently large values of β,
the following formula provides approximation to the solution of f(x) = 0 as
(Momani, S. 2015):
(3.1)
Where f(a)f(b) < 0 and sgn(t) is the signum function which is
defined by

In equation (3.1), any rule for numerical
integration can be used to evaluate the integral.
4. IM FOR MULTIPLE ROOTS
A solution x ∗ of f(x) = 0 is a root of
multiplicity m if and only if
![]()
Then f(x) can be expressed as
(4.1)
Where g(x) is a continuous function such that g (x∗) 6= 0.
The numerical methods for locating multiple zeros of functions can
be broadly classified in two categories in accordance with:
1.
Multiplicity ‘m’ is known,
2.
Multiplicity ‘m’ is not known.
Case (i): When m is known, the
well-known Newton method (1.3.2) generates a sequence of iterates
for
suitably chosen initial approximation x0, wherein the error equation is given
by
(4.2)
Where ek = xk −x∗ is the error at k th
iteration. For m > 1, the sequence {xk} converges to x∗ with the rate of a geometric progression with common ratio
This slows down the convergence order of
Newton method from quadratic to linear. To retain the quadratic convergence of
Newton method, proposed ¨ a modified Newton method given by.
(4.3)
The method (1.8.3) converges quadratically for all m ∈ N.
Case (ii): When m is not known,
demonstrated that Newton method as well as secant method converge linearly for
the case of multiple roots. They suggested to define a new function,

Since,
, therefore u(x) can be expressed as
(4.4)
Where
Since, g(x∗ ) ≠ 0, it directly
follows that
![]()
Hence, x* is a straightforward zero of u. All the concepts
pertaining to simple zeroes may be used if u and its derivatives are
substituted for f and its derivatives in any IM. Take the Newton technique as
an example; when we swap out f for u, we get
(4.5)
With respect to the number of zeros in f, the scheme given by
(1.8.5) will always be quadratically convergent. An extra assessment of f 00
results in a more arduous technique for computing the iterates, which is the
single theoretical shortcoming of this method. Using the weight function
approach, researchers have recently constructed several higher Order IM for
many roots. In the most basic scenario, think of a one-point Newton-like
approach with an extra weight function φ functioning as (Wheatley, P. O. 2008).
(4.6)
Where the function φ is directly dependent on the variable m.
The highest possible order may be achieved by evaluating the criteria for the
function φ in the context of procedure (1.8.6). Using the right weight
functions in the next stages to create the higher-order methods, the given IM
may be easily expanded to a multipoint methodology.
5. FRACTIONAL IM
Using operators with derivatives of any order, fractional calculus
expands the idea of elementary calculus to a broader theoretical framework.
More accurately describing the spatial non-locality and temporal history of complex
physical events are models based on fractional derivatives than standard
mathematics. A new field of study is the use of fractional calculus to
iteratively solve nonlinear equations. The iterative methods that are created
by using the idea of fractional order derivatives are collectively referred to
as fractional IM.
5.1 Caputo and Riemann-Liouville fractional derivatives
Several definitions of the fractional order derivative may be
found in published works. Among the most common categories of explanations are:
1.
Caputo fractional derivative,
2.
Riemann-Liouville fractional
derivative.
For given a ∈ R and the function f : [a,∞) → R, the fractional derivative
of f of order α ∈ R which is denoted as Fα and is defined as follows:

where n is an integer such that n−1 <
α ≤ n. Apart from these, some other definitions are also available
in literature. However, defined in any manner, all of these might not lead to
the same results even for the sufficiently smooth functions. Indeed, for a
constant function f(x) = c, it turns out that ![]()
Recent years have seen the development of
iterative approaches that use the Caputo or Riemann-Liouville derivatives in
place of the conventional derivatives. Nevertheless, to fully resolve the
provided integrals, it is necessary to evaluate special functions such as Gamma
and Mittag-Leffler functions. Therefore, solving nonlinear equations
efficiently using these approaches is not only wasteful, but may be impossible
in some instances.
5.2 Conformable fractional derivative
Put forward a well-behaved and straightforward
definition, the conformable fractional derivative. A function f: [a,∞)
→ R is defined as the conformable derivative of order α for all a ∈ R and α ∈ (0,1).
(5.1)
The function f is referred to as α-differentiable if the
provided limit exists. Take note that, specifically for α = 1, Eq. (6.1)
simplifies to the definition of ordinary derivative. Notably, unlike the Caputo
and Riemann-Liouville derivatives, conformable derivatives inherit most of the
fundamental features of ordinary derivatives, such as the product rule,
quotient rule, chain rule, mean value theorems, etc (Gerlach, J. 2014).
Recent work has suggested an iterative strategy for determining
the zeros of function f(x) using the notion of conformable fractional
derivative (1.9.1). Specifically, the scheme may be expressed as follows for
each α ∳ (0,1] and a ∈ R:
(5.2)
On which f may be expressed as a function that is
α-differentiable. Keep in mind that, when α = 1, the Newton technique
is reduced to the formulation (6.2). Here we lay out the circumstances in which
the proposed method yields the quadratic convergence order (Sharma, 2007).
6. SYSTEMS OF NONLINEAR EQUATIONS
Here we have a generic system of n
equations with n unknowns, written in the form,
(6.1)
as fi maps a vector (x1, x2,..., xn)T from the
n-dimensional space R n onto the real line R, where 1 ≤ i < n. If
there is a nonlinear function in the system, we say that it is nonlinear. By
using the compact notation (f1, f2,..., fn) T, we can define a vector function
F that maps R n into R n and use it to describe the system of n equations with
n unknowns.
![]()
Thus, system (1.10.1) reduces to the form,
(6.2)
Where x = (x1, x2,..., xn) T and the functions f1(x), f2(x),...,
fn(x) represent the coordinate functions of F(x). The vector x ∗ = (x ∗ 1 , x ∗ 2 ,..., x ∗ n ) T which satisfies,
![]()
Is known as the solution or root of the equation F(x) = 0. Keep in
mind that the domain R n is just used for simplicity's sake, but the concept
may be generalized to the case of complex domain Cn in a similar way.
6.1 Differentiation of vector-valued functions
For all natural numbers from 1 to n, let L (R n )
be the space of bounded linear mappings. The following matrix form is given for
the first Frechet's 1 derivative of F(x), which is a linear operator F 0 (x) ∈ L (R n), for every x ∈ R n.
(6.3)
Where
is the partial derivative of fi at x,
evaluated with regard to x j. The Jacobian matrix is the usual name for the
integral of F0 (x) in Equation (7.3).
Typically, the k-linear function, described below, is the k-th
derivative of F at x ∈ R n.
Explanation : An open convex set Ω is defined by F : Ω
⊆ R n → R n, where F is a sufficiently differentiable
function. A k-linear function F(k) (x) is the kth Frechet derivative at x'∈ Ω
, such that the following holds for any
vectors
:
1.
![]()
2.
![]()
3.
for any permutation ![]()
In view of the above definition, the following notation can be
used for convenience:
1.
![]()
2.
for any v ∈ R n and p ∈ N.
6.2 Generalized Taylor series
The proposed theorem defines the generalized Taylor series
expansion of a vector-valued function that is sufficiently differentiable:
Assumption: The mapping F: Ω ⊆ R n →
R n is k-times, let's pretend. If the function is Frechet differentiable in a
closed convex set Ω ∈ R n, then the given outcome
is true for every x, t ∈ Ω.

Observe that the term
for each i, is well-defined in view of the
Definition.
6.3 Generalized divided difference
You may apply the generalized divided difference for vector-valued
functions F: Ω ⊆ R n → R n to the case
of scalar-valued functions, as stated in Sec. 1.2.9. The operator for divided
differences is described as a mapping F[·,·]: Ω×Ω → L (R n ).
(6.4)
Moreover, if F is Frechet-differentiable, then the first-order
divided difference can be ´ defined as (Alsaedi,
A. 2015).
(6.5)
However, for the computational purposes, the divided difference
operator F[x, y] can be considered as the approximation of Jacobian matrix F 0
(x), as the operator ![]()
which is being defined as the
divided difference matrix with elements,
(6.6)
Wherein ![]()
7. CONCLUSION
All things considered, a major step forward in
computational mathematics has been the development and study of novel iterative
methods for numerically solving nonlinear equations. The goal of these methods
is to make the convergence process faster, more stable, and more
computationally efficient than the Newton-Raphson and Secant methods, which are
traditional approaches. The recently suggested approaches frequently use hybrid
systems, adaptive parameters, or higher-order derivatives to improve accuracy without
drastically raising computing cost. Theoretical research and numerical tests
have proven that these approaches converge quicker with fewer iterations than
traditional methods, especially for equations that are sensitive to initial
assumptions or derivative assessment, and for which standard methods fail.
Convergence features, including convergence order and error bounds, have been
studied and verified to be resilient and applicable to a wide variety of
nonlinear situations. Applying such sophisticated methods can be especially
useful in domains where nonlinear models are common, such as engineering,
physics, and the applied sciences.
References
1.
Acton, F. S.
(2020). Numerical methods that work (Vol. 2). American
Mathematical Soc.
2.
Chapra, S. C., & Canale, R. P. (2006). Numerical methods
for engineers. Tata McGraw-Hill.
3.
Chen, J., & Li, W. (2006).
On new exponential quadratically convergent iterative formulae. Applied
Mathematics and Computation, 180(1), 242-246.
4.
Basu, D. (2008). From third to fourth-order variant of Newton’s
method for simple roots. Applied Mathematics and Computation, 202(2),
886–892.
5.
Darvishi, M. T., & Barati,
A. (2007). A third-order Newton-type method to solve system of nonlinear
equations. Applied Mathematics and Computation, 187(2), 630–635.
6.
Hernández, M. A.
(2000). Second-derivative-free variant of Chebyshev’s method for nonlinear
equations. Journal of Optimization Theory and Applications, 104(3),
501–515.
7.
Hernández, M. A.
(2001). Chebyshev’s approximation algorithms and applications. Computers
and Mathematics with Applications, 41(3–4), 433–445.
8.
Dennis, J. E., & Schnabel, R. B. (2013). Numerical methods
for unconstrained optimization and nonlinear equations. Prentice Hall.
9.
Ben-Israel, A. (2017).
Newton’s method with modified functions. Contemporary Mathematics, 204, 39-50.
10.
El-Ajou, A., Arqub, O. A.,
& Momani, S. (2015). Approximate analytical solution of the nonlinear
fractional KdV–Burgers equation: a new iterative algorithm. Journal of Computational
Physics, 293, 81-95.
11.
Gerald, F. C., & Wheatley, P. O. (2008). Applied numerical
analysis. Pearson Education.
12.
Gerlach, J. (2014).
Accelerated convergence in Newton’s method. SIAM Review, 36(2),
272–276.
13.
El-Ajou, A.,
Arqub, O. A., Momani, S., Baleanu, D., & Alsaedi, A. (2015). A novel
expansion iterative method for solving linear partial differential equations of
fractional order. Applied Mathematics and Computation, 257,
119-133.
14.
Adomian, G. (2014). Solving frontier problems of physics: The
decomposition method. Kluwer Academic Publishers.
15.
Joshi, M. C., & Moudgalya, K. M. (2004). Optimization
theory and practice. Narosa Publishing House.