Module

for

The Improved Newton Method

     

MATHEMATICAL EXPOSITION
An Improved Newton's Method
by
John H. Mathews
The AMATYC Review, Vol. 10, No. 2, Spring, 1989, pp. 9-14.

Introduction

    
Newton's method is used to locate roots of the equation  [Graphics:Images/NewtonImprovedMod_gr_1.gif].  The Newton-Raphson iteration formula is:
(1)        [Graphics:Images/NewtonImprovedMod_gr_2.gif]

Given a starting value  [Graphics:Images/NewtonImprovedMod_gr_3.gif],   the sequence  [Graphics:Images/NewtonImprovedMod_gr_4.gif]  is computed using:  

(2)        [Graphics:Images/NewtonImprovedMod_gr_5.gif]    for    [Graphics:Images/NewtonImprovedMod_gr_6.gif]

provided that  [Graphics:Images/NewtonImprovedMod_gr_7.gif].  

Computer Programs  Newton-Raphson Method  Newton-Raphson Method  

Mathematica Subroutine (Newton-Raphson Iteration).

[Graphics:Images/NewtonImprovedMod_gr_8.gif]

    If the value  [Graphics:Images/NewtonImprovedMod_gr_9.gif]  is chosen close enough to the root p, then the sequence generated in (2) will converge to the root p.  Sometimes the speed at which  [Graphics:Images/NewtonImprovedMod_gr_10.gif]  converges is fast (quadratic) and at other times it is slow (linear).  To distinguish these two cases we make the following definitions.

 

Definition 1 (Order of Convergence)  Assume that  [Graphics:Images/NewtonImprovedMod_gr_11.gif] converges to  p,  and set  [Graphics:Images/NewtonImprovedMod_gr_12.gif].    If two positive constants  [Graphics:Images/NewtonImprovedMod_gr_13.gif]  exist, and  

    
[Graphics:Images/NewtonImprovedMod_gr_14.gif]

then the sequence is said to converge to  p  with
order of convergence R.  The number  A  is called the asymptotic error constant.  The cases  [Graphics:Images/NewtonImprovedMod_gr_15.gif]  are given special  consideration.

(3)      If  [Graphics:Images/NewtonImprovedMod_gr_16.gif], and    [Graphics:Images/NewtonImprovedMod_gr_17.gif]  

       then the convergence of  
[Graphics:Images/NewtonImprovedMod_gr_18.gif]  is called quadratic.

(4)       If  [Graphics:Images/NewtonImprovedMod_gr_19.gif], and    [Graphics:Images/NewtonImprovedMod_gr_20.gif]  

       then the convergence of  
[Graphics:Images/NewtonImprovedMod_gr_21.gif]  is called linear.

The mathematical characteristic for determining which case occurs is the "multiplicity" of the root p.

 

Definition 2 (Order of a Root)  If  [Graphics:Images/NewtonImprovedMod_gr_22.gif]  can be factored as

(5)        [Graphics:Images/NewtonImprovedMod_gr_23.gif]    where  m  is a positive integer,

and  
[Graphics:Images/NewtonImprovedMod_gr_24.gif]  is continuous at  [Graphics:Images/NewtonImprovedMod_gr_25.gif]  and  [Graphics:Images/NewtonImprovedMod_gr_26.gif],  then we say that  [Graphics:Images/NewtonImprovedMod_gr_27.gif]  has a root of order  m  at  [Graphics:Images/NewtonImprovedMod_gr_28.gif].

A root of order  
[Graphics:Images/NewtonImprovedMod_gr_29.gif]  is often called a simple root, and if  [Graphics:Images/NewtonImprovedMod_gr_30.gif]  it is called a multiple root.  A root of order  [Graphics:Images/NewtonImprovedMod_gr_31.gif]  is sometimes called a double root, and so on.  

 

Theorem 1 (Convergence Rate for Newton-Raphson Iteration)  Assume that Newton-Raphson iteration (2) produces a sequence  [Graphics:Images/NewtonImprovedMod_gr_32.gif] that converges to the root  p  of the function  [Graphics:Images/NewtonImprovedMod_gr_33.gif].  

(6)        If  p  is a simple root, then convergence is quadratic and   

        
[Graphics:Images/NewtonImprovedMod_gr_34.gif]  for  k  sufficiently large.

(7)        If  p  is a multiple root of order  m,  then convergence is linear and  
        
        
[Graphics:Images/NewtonImprovedMod_gr_35.gif]  for  k  sufficiently large.

Proof  Newton-Raphson Method  Newton-Raphson Method  

 

    There are two common ways to use Theorem 1 and gain quadratic convergence at multiple roots. We shall call these methods A and B (see Mathews, 1987, p. 72 and Ralston & Rabinowitz, 1978, pp. 353-356).  

 

Method A.  Accelerated Newton-Raphson

    Suppose that  p  is a root of order  [Graphics:Images/NewtonImprovedMod_gr_36.gif] .  Then the accelerated Newton-Raphson formula is:  

(8)        [Graphics:Images/NewtonImprovedMod_gr_37.gif].  
    
Let the starting value  
[Graphics:Images/NewtonImprovedMod_gr_38.gif]  be close to  p, and compute the sequence  [Graphics:Images/NewtonImprovedMod_gr_39.gif]  iteratively;  

(9)        [Graphics:Images/NewtonImprovedMod_gr_40.gif]    for    [Graphics:Images/NewtonImprovedMod_gr_41.gif]  

Then the sequence generated by (9) will converge quadratically to  p.  

Proof  Accelerated & Modified Newton-Raphson  Accelerated & Modified Newton-Raphson  

 

Computer Programs  Accelerated & Modified Newton-Raphson  Accelerated & Modified Newton-Raphson  

Mathematica Subroutine (Accelerated Newton-Raphson Iteration).

[Graphics:Images/NewtonImprovedMod_gr_42.gif]

    On the other hand, if  [Graphics:Images/NewtonImprovedMod_gr_43.gif]  then one can show that the function  [Graphics:Images/NewtonImprovedMod_gr_44.gif]  has a simple root at  [Graphics:Images/NewtonImprovedMod_gr_45.gif].  
Derivation

 

    Using  [Graphics:Images/NewtonImprovedMod_gr_48.gif]  in place of  [Graphics:Images/NewtonImprovedMod_gr_49.gif]  in formula (1) yields Method B.

 

Method B.  Modified Newton-Raphson

    Suppose that  p  is a root of order  
[Graphics:Images/NewtonImprovedMod_gr_50.gif] .  Then the modified Newton-Raphson formula is:  
    
(10)        [Graphics:Images/NewtonImprovedMod_gr_51.gif]  

Derivation

 

Let the starting value  [Graphics:Images/NewtonImprovedMod_gr_54.gif]  be close to  p, and compute the sequence  [Graphics:Images/NewtonImprovedMod_gr_55.gif]  iteratively;  

(11)        [Graphics:Images/NewtonImprovedMod_gr_56.gif]    for    [Graphics:Images/NewtonImprovedMod_gr_57.gif]  

Then the sequence generated by (11) converges quadratically to  p.  

Proof  Accelerated & Modified Newton-Raphson  Accelerated & Modified Newton-Raphson  

 

Computer Programs  Accelerated & Modified Newton-Raphson  Accelerated & Modified Newton-Raphson  

Mathematica Subroutine (Modified Newton-Raphson Iteration).

[Graphics:Images/NewtonImprovedMod_gr_58.gif]

Limitations of Methods A and B

    Method A has the disadvantage that the order  m  of the root must be known a priori.  Determining  m  is often laborious because some type of mathematical analysis must be used.  It is usually found by looking at the values of the higher derivatives of  [Graphics:Images/NewtonImprovedMod_gr_59.gif].  That is,  [Graphics:Images/NewtonImprovedMod_gr_60.gif]  has a root of order  m  at  [Graphics:Images/NewtonImprovedMod_gr_61.gif]  if and only if  

(12)        [Graphics:Images/NewtonImprovedMod_gr_62.gif].
    
Dodes (1978, pp. 81-82) has observed that in practical problems it is unlikely that we will know the multiplicity.  However, a constant  
m  should be used in (8) to speed up convergence, and it should be chosen small enough so that  [Graphics:Images/NewtonImprovedMod_gr_63.gif]   does not shoot to the wrong side of  p.  Rice (1983, pp. 232-233) has suggested a way to empirically find  m.  If  [Graphics:Images/NewtonImprovedMod_gr_64.gif]  is a good approximation to  p  and [Graphics:Images/NewtonImprovedMod_gr_65.gif], and [Graphics:Images/NewtonImprovedMod_gr_66.gif] somewhat distant from  [Graphics:Images/NewtonImprovedMod_gr_67.gif]  then  m  can be determined by the calculation:

        
[Graphics:Images/NewtonImprovedMod_gr_68.gif].  

    Method B has a disadvantage, it involves three functions  [Graphics:Images/NewtonImprovedMod_gr_69.gif].  Again, the laborious task of finding the formula for  [Graphics:Images/NewtonImprovedMod_gr_70.gif]  could detract from using Method B.  Furthermore, Ralston and Rabinowitz (1978, pp. 353-356) have observed that  [Graphics:Images/NewtonImprovedMod_gr_71.gif]  will have poles at points where the zeros of  [Graphics:Images/NewtonImprovedMod_gr_72.gif]  are not roots of [Graphics:Images/NewtonImprovedMod_gr_73.gif].  Hence,  [Graphics:Images/NewtonImprovedMod_gr_74.gif]  may not be a continuous function.

 

The New Method C.  Adaptive Newton-Raphson

    The adaptive Newton-Raphson method incorporates a linear search method with formula (8).  Starting with  [Graphics:Images/NewtonImprovedMod_gr_75.gif],  the following values are computed:  

(13)        [Graphics:Images/NewtonImprovedMod_gr_76.gif]    for    [Graphics:Images/NewtonImprovedMod_gr_77.gif].  

Our task is to determine the value  
m  to use in formula (13), because it is not known a priori.  First, we take the derivative of  [Graphics:Images/NewtonImprovedMod_gr_78.gif]  in formula (5), and obtain:  

(14)        [Graphics:Images/NewtonImprovedMod_gr_79.gif]  

When (5) and (14) are substituted into formula (1) we have  
[Graphics:Images/NewtonImprovedMod_gr_80.gif]  which can be simplified to get  

        [Graphics:Images/NewtonImprovedMod_gr_81.gif].

This enables us to rewrite (13) as  

(15)        [Graphics:Images/NewtonImprovedMod_gr_82.gif].  

    We shall assume that the starting value  [Graphics:Images/NewtonImprovedMod_gr_83.gif]  is close enough to  p  so that  

(16)        [Graphics:Images/NewtonImprovedMod_gr_84.gif],    where    [Graphics:Images/NewtonImprovedMod_gr_85.gif].  

The iterates  [Graphics:Images/NewtonImprovedMod_gr_86.gif]  in (15) satisfy the following:  

(17)        [Graphics:Images/NewtonImprovedMod_gr_87.gif]    for    [Graphics:Images/NewtonImprovedMod_gr_88.gif]     

If we subtract  p  from both sides of (17) then  

        
[Graphics:Images/NewtonImprovedMod_gr_89.gif]

and the result after simplification is:  

(18)        [Graphics:Images/NewtonImprovedMod_gr_90.gif].  

Since  
[Graphics:Images/NewtonImprovedMod_gr_91.gif],  the iterates  [Graphics:Images/NewtonImprovedMod_gr_92.gif]  get closer to  p  as  j  goes from  [Graphics:Images/NewtonImprovedMod_gr_93.gif],  which is manifest by the inequalities:

(19)        [Graphics:Images/NewtonImprovedMod_gr_94.gif].  

The values  [Graphics:Images/NewtonImprovedMod_gr_95.gif]  are shown in Figure 1.  

    Notice that if the iteration (15) was continued for  
[Graphics:Images/NewtonImprovedMod_gr_96.gif]  then  [Graphics:Images/NewtonImprovedMod_gr_97.gif]  could be larger than  [Graphics:Images/NewtonImprovedMod_gr_98.gif].  This is proven by using the derivatives in (12) and the Taylor polynomial approximation of degree  m  for  [Graphics:Images/NewtonImprovedMod_gr_99.gif]   expanded about  [Graphics:Images/NewtonImprovedMod_gr_100.gif]:  

(20)        [Graphics:Images/NewtonImprovedMod_gr_101.gif].  
[Graphics:Images/NewtonImprovedMod_gr_102.gif]
Figure 1.     The values  [Graphics:Images/NewtonImprovedMod_gr_103.gif]  for the "linear search" obtained by using formula (15)  
                   near a "double root"  p  (of order  
[Graphics:Images/NewtonImprovedMod_gr_104.gif]).   Notice that  [Graphics:Images/NewtonImprovedMod_gr_105.gif].  

If  [Graphics:Images/NewtonImprovedMod_gr_106.gif]  is closer to  p  than  [Graphics:Images/NewtonImprovedMod_gr_107.gif]  then (19) and (20) imply that  [Graphics:Images/NewtonImprovedMod_gr_108.gif],   hence we have:  

(21)        [Graphics:Images/NewtonImprovedMod_gr_109.gif].  

Therefore, the way to computationally determine  
m  is to successively compute the values  [Graphics:Images/NewtonImprovedMod_gr_110.gif]  using formula (13) for  [Graphics:Images/NewtonImprovedMod_gr_111.gif]  until we arrive at  [Graphics:Images/NewtonImprovedMod_gr_112.gif].

 

The New Adaptive Newton-Raphson Algorithm

Start with  
[Graphics:Images/NewtonImprovedMod_gr_113.gif],  then we determine the next approximation [Graphics:Images/NewtonImprovedMod_gr_114.gif]as follows:  

        
[Graphics:Images/NewtonImprovedMod_gr_115.gif]    

 

    Observe that the above iteration involves a linear search in either the interval  [Graphics:Images/NewtonImprovedMod_gr_116.gif]  when  [Graphics:Images/NewtonImprovedMod_gr_117.gif]  or in the interval   [Graphics:Images/NewtonImprovedMod_gr_118.gif]  when  [Graphics:Images/NewtonImprovedMod_gr_119.gif].   In the algorithm, the value  [Graphics:Images/NewtonImprovedMod_gr_120.gif]  is stored so that unnecessary computations are avoided.  After the point  [Graphics:Images/NewtonImprovedMod_gr_121.gif]  has been found, it should replace  [Graphics:Images/NewtonImprovedMod_gr_122.gif]  and the process is repeated.  

Mathematica Subroutine (Adaptive Newton-Raphson Iteration).

[Graphics:Images/NewtonImprovedMod_gr_123.gif]

Example 1.  Use the value  [Graphics:Images/NewtonImprovedMod_gr_124.gif]  and compare Methods A,B and C for finding the double root  [Graphics:Images/NewtonImprovedMod_gr_125.gif]  of the equation  [Graphics:Images/NewtonImprovedMod_gr_126.gif].
Solution 1.

 

Behavior at a Triple Root

    When the function has a triple root, then one more iteration for the linear search in (13) is necessary.  The situation is shown in Figure 2.
[Graphics:Images/NewtonImprovedMod_gr_161.gif]
Figure 2.  The values  [Graphics:Images/NewtonImprovedMod_gr_162.gif]  for the "linear search" obtained by using formula (15)  
                   near a "triple root"  p  (of order  
[Graphics:Images/NewtonImprovedMod_gr_163.gif]).   Notice that  [Graphics:Images/NewtonImprovedMod_gr_164.gif].  

 

Example 2.  Use the value  [Graphics:Images/NewtonImprovedMod_gr_165.gif]  and compare Methods A,B and C for finding the triple root  [Graphics:Images/NewtonImprovedMod_gr_166.gif]  of the equation  [Graphics:Images/NewtonImprovedMod_gr_167.gif].
Solution 2.

 

Example 3.  Use the value  [Graphics:Images/NewtonImprovedMod_gr_191.gif]  and compare Methods A,B and C for finding the quadruple root  [Graphics:Images/NewtonImprovedMod_gr_192.gif]  of the equation  [Graphics:Images/NewtonImprovedMod_gr_193.gif].
Solution 3.

 

Acknowledgement

    This module uses Mathematica instead of Pascal, and the content is that of the article:

John Mathews, An Improved Newton's Method, The AMATYC Review, Vol. 10, No. 2, Spring, 1989, pp. 9-14.

 

References

1.  Dodes, 1. A.,  Numerical analysis for computer science, 1978, New York: North-Holland.
2.  Mathews, J. H.,  Numerical methods for computer science, engineering and mathematics, 1987, Englewood Cliffs, NJ: Prentice-Hall.
3.  Ralston, A. & Rabinowitz, P.,  A first course in numerical analysis, second ed., 1978, New York: McGraw-Hill.
4.  Rice, J. R.,  Numerical methods, software and analysis: IMSL reference edition, 1983, New York: McGraw-Hill.

 

Related Material

Animations ( Newton's Method  Newton's Method ).  

 

Research Experience for Undergraduates

Newton-Raphson Method  Newton-Raphson Method  Internet hyperlinks to web sites and a bibliography of articles.  

 

Download this Mathematica Notebook The Improved Newton Method

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(c) John H. Mathews 2004