Formation and examination of a few new iterative methods for nonlinear equation numerical solutions

 

Rohit Kumar Pandey1*, Dr. Jaya Kushwah2

1 Research Scholar, Dept. of Mathematics, Sardar Patel University Balaghat M.P.

rkpandeyjii@gmail.com

2 Associate Professor, Sardar Patel University, Balaghat, M.P.

Abstract

The extensive use of numerical solutions to nonlinear equations in fields as diverse as physics, engineering, economics, and more makes this a vital topic in scientific computing and applied mathematics. Although efficient, classical methods such as Newton-Raphson and Secant methods have drawbacks include being sensitive to initial estimations, diverging in some circumstances, and requiring more computing power to evaluate derivatives. Our goal in this research was to find better iterative solutions to nonlinear equations of the type f(x)=0 that are more resilient and have a faster convergence rate. We came up with a few new approaches and tested them out. By adding weight functions and higher-order correction terms to conventional schemes, the newly created approaches are able to adapt and expand upon them. Proofs of convergence order, error estimates, and stability criteria are all part of the extensive theoretical analysis that is given. These techniques improve computing efficiency by achieving higher-order convergence (up to fourth and sixth order) without increasing the number of function or derivative evaluations each iteration. The performance of the suggested approaches is compared to current techniques through a series of numerical tests performed on benchmark nonlinear equations. These novel iterative approaches show good promise as instruments for addressing complicated nonlinear problems in real-world applications, thanks to their enhanced convergence behaviour.

Keywords: Formation, Iterative Methods, Nonlinear Equation, Numerical Solutions

1. INTRODUCTION

When describing occurrences on a grand scale in terms of physical phenomena, such as laws of nature, mathematical models showing relationships between variables are frequently employed. It is essential to obtain the solution of a model in order to investigate the associated physical process and predict its future evolution. It is typically not feasible to obtain the exact or analytical form of answers due to the inherent non-linearity among the variables in the developed models. The study of nonlinear issues presents a greater challenge compared to linear ones. It should be noted that algebraic solutions are usually not discovered for polynomial equations of arbitrary type with coefficients of degree five or above, as stated in the Abel-Ruffini theorem. But remember that the precise responses aren't always what you desire. Just by numerically approximating the answers to a certain precision, we can gain a deeper grasp of the underlying physical process (Acton, F. S. 2020).

Numerical analysis is a subfield of mathematics concerned with determining optimal numerical solutions to a wide range of issues. The usual method for solving nonlinear equations is to use the concept of iterative theory. In mathematics, iterative techniques are procedures that, given a sufficiently accurate initial estimate, provide successive estimates that, taken together, may converge to a solution. Convergence of generated sequences and the construction of iterative algorithms are the principal foci of mathematical analysis. It is critical to develop fast and efficient iterative algorithms since the complexity of modern mathematical problems is increasing exponentially. Univariate nonlinear equations or multivariate systems of nonlinear equations might provide a mathematical representation of these problems. Using numerical approaches, we can solve the following nonlinear problems (Canale, R. P. 2006):

v   Isentropic supersonic flow problem

Waves compressing and expanding in flow channels are common occurrences in fluid dynamics. A differential equation governs the examination of the uniform two-dimensional isentropic supersonic fluid flow around a sharp convex corner of a frictionless surface:

In where δ is the specific heat ratio of the fluid, M is the Mach number, and δ is the flow deflection angle. From an initial Mach number M1 to a final Mach number M2, the flow deflection angle δ that expands the fluid may be obtained by integrating the above equation (Li, W. 2006).

Where   Given the values of M1, γ, and δ, the estimate of M2 is a specific nonlinear issue in this scenario. Nevertheless, it appears that solving this problem algebraically is not possible in this case.

v   Three-body problem under gravitational field

For a tiny mass object (m) subjected to the gravitational pull of two large bodies (M1 and M2) in orbit around it, the Lagrange points in space represent the sites of equilibrium in the field of celestial mechanics. Lagrange points are locations where the centripetal force needed to propel a tiny object around its orbit is exactly equivalent to the gravitational forces exerted by two big masses. As an example, a satellite can maintain a synchronous orbit with Earth by harnessing the centripetal force provided by the Sun and Earth's gravitational pull. Equation that follows from this model is (Basu, D. 2008).:

The situation where G stands for the gravitational constant, R for the distance between Earth and the Sun, r for the distance between the satellite and Earth, ϋ for the angular velocity, and M1, M2, and m for the masses of the Sun, Earth, and the satellite, respectively. Then, using the Kepler’s third law:  the given problem leads to solve the following quintic polynomial equation:

                                      (1)

Where c1 = ±(M1 + M2), c2 = ±3(M1 + M2)R, c3 = ±3(M1 + M2)R2, c4 = (M2 M2)R3, c5 = 2M2R4, c6 = M2R5. Two distinct solutions to the provided problem are represented by the +/- symbol in this case. The Lagrange points can be determined by calculating the quintic equation (1) given the specific values of the masses M1, M2, m, R, and r, as well as the angular velocity ό.

2. SOME BASIC CONCEPTS

The basic concepts that are relevant to this study are presented here.

2.1 Root or zero

A given function is denoted as f: R → R. A root of an equation is a number that, when multiplied by itself, gives the equation f(x) = 0. To rephrase, the zero of the function f(x) is x = x. When the x-axis intersects the graph of y= f(x) on the xy-plane, we say that this point is a geometric zero of f(x). Rather than limiting oneself to the domain of real numbers, the concept of root or zero may be easily extended to functions defined over the complex domain C (Barati, A. 2007).

2.2 Multiple zeros

A solution  is a zero of multiplicity m of f(x) if, for   where  In particular, if the given function is m times continuously differentiable in the interval  then it has a zero of multiplicity m at x (a,b) if and only if

If m = 1, the given zero is particularly called the simple zero.

2.3 Intermediate value property

Assume that is a continuous function on the interval [a,b] for every x. Refer to Figure 1 for reference, and find an integer s such that f(a) < s < f(b). Then, for any x in the range [a,b], f(x) equals s. One important consequence is that it confirms the presence of a zero of f(x) in the interval [a,b] if the condition is true for s = 0 (Hernández, M. A. 2000).

Figure 1: Intermediate value property

2.4 Fixed point iteration

The function's fixed point is a numerical value denoted as x*.  The application of a mapping f to x* results in x* being mapped onto it. It should be borne in mind realize that locating a function's zeros is analogous to locating the fixed points of a related function. Specifically, let's say that φ is the function that provides the solutions to the equation f(x) = 0, where φ(x) = x − f(x). From that point on, the zero x of f(x) is determined to be the fixed point of χ(x). Consider any point x0. Afterwards, the technique for fixed-point iteration is stated as follows:

where φ is called iteration function. This finding proves that there are enough variables for an iterative sequence to converge.

In accordance with Theorem 1.2.1, let x be the interval I value of the zero of the function f(x). In the set I, there are continuous functions φ(x) and φ 0 (x) where the definition of φ(x) is such that x = φ(x) is equivalent to f(x) = 0. In the case when every integer x in I has an absolute value less than 1, the sequence of approximations {xk}k≥0 will eventually converge to the root x, provided that x0 is an integer (Schnabel, R. B. 2013).

2.5 Order of convergence

An iterative sequence  "converges" to an element x* of rank p R + when

for all values of c > 0. The asymptotic error constant is the name given to the constant c in this context. Linear convergence, quadratic convergence, and cubic convergence are terms used to describe the aforementioned sequence for p=1, 2, and 3, respectively. Furthermore, let  be the errors in corresponding iterations, then the following relation:

known as the error equation. Keep in mind that finding the error equation is the first step in figuring out the order of convergence, p. E. Schroder offers an alternative definition of order of convergence. An iteration function φ(x), corresponding to the iterative scheme   is said to possess convergence order p Z +, if the sequence  converges to  and the following holds:

Here,  will be the asymptotic error constant.

3. DETERMINATION OF INITIAL APPROXIMATION

The convergence of iterative techniques is strongly affected by the initial approximation of the solution, so determining it is of the utmost importance. Below are some of the methods that may be used to choose good first estimations (Ben-Israel, A. 2017).

3.1 Geometrical approach

By drawing a graph of the function f(x) and finding its intersection with the x-axis, we may obtain an approximation of the solution to the equation f(x) = 0. Here we get a very approximate root, as this is the value at which f(x) = 0. The lack of precision makes graphical methods for computing the roots almost useless. On the other hand, numerical approaches can use the preliminary estimates from graphical methods as their starting points. The bisection method and other bracketing techniques can be used to an interval that contains the answer.

3.2 Trial and error approach

Here, we take a wild guess at x and see if our estimated root can be better estimated by seeing whether f(x) is near zero. If not, we make another wild guess and see if our updated root estimate holds up. The method is iterated until an estimate is made that yields a f(x) value around zero.

3.3 Incremental search approach

Beginning at one end of the region of interest, this approach evaluates functions at tiny intervals across the region. It is presumed that a root is within the increment whenever the function's sign changes. As a starting point for the bracketing methods, the values of x at the start and end of increment would be useful.

3.4 Sigmoid function approach

Took a time-saving technique to choose a first guess that was close enough to the target solution. Let us pretend that there is a single solution to the function f(x) on the interval [a,b] and that it is continuous on that interval. The function  is called the sigmoid transform of the function f(x) for some β > 0. Then, for the sufficiently large values of β, the following formula provides approximation to the solution of f(x) = 0 as (Momani, S. 2015):

                                                  (3.1)

Where f(a)f(b) < 0 and sgn(t) is the signum function which is defined by

In equation (3.1), any rule for numerical integration can be used to evaluate the integral.

4. IM FOR MULTIPLE ROOTS

A solution x of f(x) = 0 is a root of multiplicity m if and only if

Then f(x) can be expressed as

                                                                                    (4.1)

Where g(x) is a continuous function such that g (x) 6= 0.

The numerical methods for locating multiple zeros of functions can be broadly classified in two categories in accordance with:

1.               Multiplicity ‘m’ is known,

2.               Multiplicity ‘m’ is not known.

Case (i): When m is known, the well-known Newton method (1.3.2) generates a sequence of iterates  for suitably chosen initial approximation x0, wherein the error equation is given by

                                                                          (4.2)

Where ek = xk −x is the error at k th iteration. For m > 1, the sequence {xk} converges to x with the rate of a geometric progression with common ratio  This slows down the convergence order of Newton method from quadratic to linear. To retain the quadratic convergence of Newton method, proposed ¨ a modified Newton method given by.

                                                        (4.3)

The method (1.8.3) converges quadratically for all m N.

Case (ii): When m is not known, demonstrated that Newton method as well as secant method converge linearly for the case of multiple roots. They suggested to define a new function,

Since,  , therefore u(x) can be expressed as

                               (4.4)

Where  Since, g(x ) ≠ 0, it directly follows that

Hence, x* is a straightforward zero of u. All the concepts pertaining to simple zeroes may be used if u and its derivatives are substituted for f and its derivatives in any IM. Take the Newton technique as an example; when we swap out f for u, we get

                             (4.5)

With respect to the number of zeros in f, the scheme given by (1.8.5) will always be quadratically convergent. An extra assessment of f 00 results in a more arduous technique for computing the iterates, which is the single theoretical shortcoming of this method. Using the weight function approach, researchers have recently constructed several higher Order IM for many roots. In the most basic scenario, think of a one-point Newton-like approach with an extra weight function φ functioning as (Wheatley, P. O. 2008).

                                           (4.6)

Where the function φ is directly dependent on the variable m. The highest possible order may be achieved by evaluating the criteria for the function φ in the context of procedure (1.8.6). Using the right weight functions in the next stages to create the higher-order methods, the given IM may be easily expanded to a multipoint methodology.

5. FRACTIONAL IM

Using operators with derivatives of any order, fractional calculus expands the idea of elementary calculus to a broader theoretical framework. More accurately describing the spatial non-locality and temporal history of complex physical events are models based on fractional derivatives than standard mathematics. A new field of study is the use of fractional calculus to iteratively solve nonlinear equations. The iterative methods that are created by using the idea of fractional order derivatives are collectively referred to as fractional IM.

5.1 Caputo and Riemann-Liouville fractional derivatives

Several definitions of the fractional order derivative may be found in published works. Among the most common categories of explanations are:

1.            Caputo fractional derivative,

2.            Riemann-Liouville fractional derivative.

For given a R and the function f : [a,∞) → R, the fractional derivative of f of order α R which is denoted as Fα and is defined as follows:

where n is an integer such that n−1 < α ≤ n. Apart from these, some other definitions are also available in literature. However, defined in any manner, all of these might not lead to the same results even for the sufficiently smooth functions. Indeed, for a constant function f(x) = c, it turns out that

Recent years have seen the development of iterative approaches that use the Caputo or Riemann-Liouville derivatives in place of the conventional derivatives. Nevertheless, to fully resolve the provided integrals, it is necessary to evaluate special functions such as Gamma and Mittag-Leffler functions. Therefore, solving nonlinear equations efficiently using these approaches is not only wasteful, but may be impossible in some instances.

5.2 Conformable fractional derivative

Put forward a well-behaved and straightforward definition, the conformable fractional derivative. A function f: [a,∞) → R is defined as the conformable derivative of order α for all a R and α (0,1).

                                          (5.1)

The function f is referred to as α-differentiable if the provided limit exists. Take note that, specifically for α = 1, Eq. (6.1) simplifies to the definition of ordinary derivative. Notably, unlike the Caputo and Riemann-Liouville derivatives, conformable derivatives inherit most of the fundamental features of ordinary derivatives, such as the product rule, quotient rule, chain rule, mean value theorems, etc (Gerlach, J. 2014).

Recent work has suggested an iterative strategy for determining the zeros of function f(x) using the notion of conformable fractional derivative (1.9.1). Specifically, the scheme may be expressed as follows for each α (0,1] and a R:

                             (5.2)

On which f may be expressed as a function that is α-differentiable. Keep in mind that, when α = 1, the Newton technique is reduced to the formulation (6.2). Here we lay out the circumstances in which the proposed method yields the quadratic convergence order (Sharma, 2007).

6. SYSTEMS OF NONLINEAR EQUATIONS

Here we have a generic system of n equations with n unknowns, written in the form,

                                                                                 (6.1)

as fi maps a vector (x1, x2,..., xn)T from the n-dimensional space R n onto the real line R, where 1 ≤ i < n. If there is a nonlinear function in the system, we say that it is nonlinear. By using the compact notation (f1, f2,..., fn) T, we can define a vector function F that maps R n into R n and use it to describe the system of n equations with n unknowns.

Thus, system (1.10.1) reduces to the form,

                                                                                                               (6.2)

Where x = (x1, x2,..., xn) T and the functions f1(x), f2(x),..., fn(x) represent the coordinate functions of F(x). The vector x = (x 1 , x 2 ,..., x n ) T which satisfies,

Is known as the solution or root of the equation F(x) = 0. Keep in mind that the domain R n is just used for simplicity's sake, but the concept may be generalized to the case of complex domain Cn in a similar way.

6.1 Differentiation of vector-valued functions

For all natural numbers from 1 to n, let L (R n ) be the space of bounded linear mappings. The following matrix form is given for the first Frechet's 1 derivative of F(x), which is a linear operator F 0 (x) L (R n), for every x R n.

                                                      (6.3)

Where  is the partial derivative of fi at x, evaluated with regard to x j. The Jacobian matrix is the usual name for the integral of F0 (x) in Equation (7.3).

Typically, the k-linear function, described below, is the k-th derivative of F at x R n.

Explanation : An open convex set Ω is defined by F : Ω R n → R n, where F is a sufficiently differentiable function. A k-linear function F(k) (x) is the kth Frechet derivative at x' , such that the following holds for any vectors :

1.           

2.           

3.             for any permutation

In view of the above definition, the following notation can be used for convenience:

1.           

2.             for any v R n and p N.

6.2 Generalized Taylor series

The proposed theorem defines the generalized Taylor series expansion of a vector-valued function that is sufficiently differentiable:

Assumption: The mapping F: Ω R n → R n is k-times, let's pretend. If the function is Frechet differentiable in a closed convex set Ω R n, then the given outcome is true for every x, t Ω.

Observe that the term  for each i, is well-defined in view of the Definition.

6.3 Generalized divided difference

You may apply the generalized divided difference for vector-valued functions F: Ω R n → R n to the case of scalar-valued functions, as stated in Sec. 1.2.9. The operator for divided differences is described as a mapping F[·,·]: Ω×Ω → L (R n ).

                                     (6.4)

Moreover, if F is Frechet-differentiable, then the first-order divided difference can be ´ defined as (Alsaedi, A. 2015).

                               (6.5)

However, for the computational purposes, the divided difference operator F[x, y] can be considered as the approximation of Jacobian matrix F 0 (x), as the operator which is being defined as the divided difference matrix with elements,

              (6.6)

Wherein

7. CONCLUSION

All things considered, a major step forward in computational mathematics has been the development and study of novel iterative methods for numerically solving nonlinear equations. The goal of these methods is to make the convergence process faster, more stable, and more computationally efficient than the Newton-Raphson and Secant methods, which are traditional approaches. The recently suggested approaches frequently use hybrid systems, adaptive parameters, or higher-order derivatives to improve accuracy without drastically raising computing cost. Theoretical research and numerical tests have proven that these approaches converge quicker with fewer iterations than traditional methods, especially for equations that are sensitive to initial assumptions or derivative assessment, and for which standard methods fail. Convergence features, including convergence order and error bounds, have been studied and verified to be resilient and applicable to a wide variety of nonlinear situations. Applying such sophisticated methods can be especially useful in domains where nonlinear models are common, such as engineering, physics, and the applied sciences.

References

1.                  Acton, F. S. (2020). Numerical methods that work (Vol. 2). American Mathematical Soc.

2.                  Chapra, S. C., & Canale, R. P. (2006). Numerical methods for engineers. Tata McGraw-Hill.

3.                  Chen, J., & Li, W. (2006). On new exponential quadratically convergent iterative formulae. Applied Mathematics and Computation, 180(1), 242-246.

4.                  Basu, D. (2008). From third to fourth-order variant of Newton’s method for simple roots. Applied Mathematics and Computation, 202(2), 886–892.

5.                  Darvishi, M. T., & Barati, A. (2007). A third-order Newton-type method to solve system of nonlinear equations. Applied Mathematics and Computation, 187(2), 630–635.

6.                  Hernández, M. A. (2000). Second-derivative-free variant of Chebyshev’s method for nonlinear equations. Journal of Optimization Theory and Applications, 104(3), 501–515.

7.                  Hernández, M. A. (2001). Chebyshev’s approximation algorithms and applications. Computers and Mathematics with Applications, 41(3–4), 433–445.

8.                  Dennis, J. E., & Schnabel, R. B. (2013). Numerical methods for unconstrained optimization and nonlinear equations. Prentice Hall.

9.                  Ben-Israel, A. (2017). Newton’s method with modified functions. Contemporary Mathematics, 204, 39-50.

10.              El-Ajou, A., Arqub, O. A., & Momani, S. (2015). Approximate analytical solution of the nonlinear fractional KdV–Burgers equation: a new iterative algorithm. Journal of Computational Physics, 293, 81-95.

11.              Gerald, F. C., & Wheatley, P. O. (2008). Applied numerical analysis. Pearson Education.

12.              Gerlach, J. (2014). Accelerated convergence in Newton’s method. SIAM Review, 36(2), 272–276.

13.              El-Ajou, A., Arqub, O. A., Momani, S., Baleanu, D., & Alsaedi, A. (2015). A novel expansion iterative method for solving linear partial differential equations of fractional order. Applied Mathematics and Computation257, 119-133.

14.              Adomian, G. (2014). Solving frontier problems of physics: The decomposition method. Kluwer Academic Publishers.

15.              Joshi, M. C., & Moudgalya, K. M. (2004). Optimization theory and practice. Narosa Publishing House.