Theorem (Newton-Raphson Theorem).  Assume that [Graphics:Images/Newton'sMethodProof_gr_7.gif] and there exists a number [Graphics:Images/Newton'sMethodProof_gr_8.gif], where [Graphics:Images/Newton'sMethodProof_gr_9.gif].  If   [Graphics:Images/Newton'sMethodProof_gr_10.gif], then there exists a [Graphics:Images/Newton'sMethodProof_gr_11.gif] such that the sequence [Graphics:Images/Newton'sMethodProof_gr_12.gif] defined by the iteration  

    [Graphics:Images/Newton'sMethodProof_gr_13.gif]    for    [Graphics:Images/Newton'sMethodProof_gr_14.gif]  

will converge to [Graphics:Images/Newton'sMethodProof_gr_15.gif] for any initial approximation  [Graphics:Images/Newton'sMethodProof_gr_16.gif].  

        

[Graphics:Images/Newton'sMethodProof_gr_17.gif]

Proof.

The geometric construction of  [Graphics:../Images/Newton'sMethodProof_gr_86.gif]  does not help in understanding why  [Graphics:../Images/Newton'sMethodProof_gr_87.gif]  needs to be close to  p  or why the continuity of  [Graphics:../Images/Newton'sMethodProof_gr_88.gif]  is essential.  
Our analysis starts the Taylor polynomial of degree n = 1 and its remainder term  

        
[Graphics:../Images/Newton'sMethodProof_gr_89.gif]  

where  c  lies somewhere between  [Graphics:../Images/Newton'sMethodProof_gr_90.gif]  and  x.  Substituting  x = p  into this equation and using the fact that  [Graphics:../Images/Newton'sMethodProof_gr_91.gif]  produces  

        
[Graphics:../Images/Newton'sMethodProof_gr_92.gif].  

If  [Graphics:../Images/Newton'sMethodProof_gr_93.gif]  is close enough to  p,  then the last term on the right side will be small compared to the sum of the first two terms.  Hence it can be neglected and we can use the approximation  

        
[Graphics:../Images/Newton'sMethodProof_gr_94.gif].  

Solving for  p  in this equation, we get  [Graphics:../Images/Newton'sMethodProof_gr_95.gif].    This is used to define the next approximation  [Graphics:../Images/Newton'sMethodProof_gr_96.gif]  to the root  

        
[Graphics:../Images/Newton'sMethodProof_gr_97.gif].  

When  [Graphics:../Images/Newton'sMethodProof_gr_98.gif]  is used in place of  [Graphics:../Images/Newton'sMethodProof_gr_99.gif]  in this equation, the general rule is established.  

        
[Graphics:../Images/Newton'sMethodProof_gr_100.gif].  

    For most applications this is all that needs to be understood.  However, to fully comprehend what is happening, we need to consider the fixed-point iteration theorem, and apply it in our situation.  The Newton-Raphson formula is

        
[Graphics:../Images/Newton'sMethodProof_gr_101.gif].  

Newton's method is merely fixed point iteration using the function  [Graphics:../Images/Newton'sMethodProof_gr_102.gif],  starting with [Graphics:../Images/Newton'sMethodProof_gr_103.gif] and using the iteration  

        [Graphics:../Images/Newton'sMethodProof_gr_104.gif].  

Let us recall the important theorem about fixed point iteration.

Theorem (Second Fixed Point Theorem). Assume that the following hypothesis hold true.  
(a)    [Graphics:../Images/Newton'sMethodProof_gr_105.gif] is a fixed point of a function [Graphics:../Images/Newton'sMethodProof_gr_106.gif],
(b)    [Graphics:../Images/Newton'sMethodProof_gr_107.gif],
(c)    [Graphics:../Images/Newton'sMethodProof_gr_108.gif] is a positive constant,
(d)    [Graphics:../Images/Newton'sMethodProof_gr_109.gif], and
(e)    [Graphics:../Images/Newton'sMethodProof_gr_110.gif]  for all  [Graphics:../Images/Newton'sMethodProof_gr_111.gif].  
Then we have the following conclusions.
(i).    If [Graphics:../Images/Newton'sMethodProof_gr_112.gif]  for all   [Graphics:../Images/Newton'sMethodProof_gr_113.gif],  then the iteration  [Graphics:../Images/Newton'sMethodProof_gr_114.gif]  will converge to the
    unique fixed point [Graphics:../Images/Newton'sMethodProof_gr_115.gif].  In this case, [Graphics:../Images/Newton'sMethodProof_gr_116.gif] is said to be an attractive fixed point.  
(ii).    If [Graphics:../Images/Newton'sMethodProof_gr_117.gif]  for all   [Graphics:../Images/Newton'sMethodProof_gr_118.gif],  then the iteration  [Graphics:../Images/Newton'sMethodProof_gr_119.gif]  will not converge to [Graphics:../Images/Newton'sMethodProof_gr_120.gif].  
    In this case, [Graphics:../Images/Newton'sMethodProof_gr_121.gif] is said to be a repelling fixed point and the iteration exhibits local divergence.  

The key in the proof for convergence in Newton's method is in the analysis of  [Graphics:../Images/Newton'sMethodProof_gr_122.gif]:  

        
[Graphics:../Images/Newton'sMethodProof_gr_123.gif].

By hypothesis,  [Graphics:../Images/Newton'sMethodProof_gr_124.gif],  thus  [Graphics:../Images/Newton'sMethodProof_gr_125.gif].  
Because  [Graphics:../Images/Newton'sMethodProof_gr_126.gif],  and  [Graphics:../Images/Newton'sMethodProof_gr_127.gif]  is continuous,  for any value [Graphics:../Images/Newton'sMethodProof_gr_128.gif],  with  [Graphics:../Images/Newton'sMethodProof_gr_129.gif]
it is possible to find a  [Graphics:../Images/Newton'sMethodProof_gr_130.gif]  so that the hypothesis  (i).  |[Graphics:../Images/Newton'sMethodProof_gr_131.gif],  is satisfied on  [Graphics:../Images/Newton'sMethodProof_gr_132.gif].  

Therefore, a sufficient condition for  
[Graphics:../Images/Newton'sMethodProof_gr_133.gif]  to initialize a convergent sequence  [Graphics:../Images/Newton'sMethodProof_gr_134.gif],  which converges to the root [Graphics:../Images/Newton'sMethodProof_gr_135.gif]  of  [Graphics:../Images/Newton'sMethodProof_gr_136.gif]  is that  [Graphics:../Images/Newton'sMethodProof_gr_137.gif]  and that  [Graphics:../Images/Newton'sMethodProof_gr_138.gif]  be chosen so that  

        
[Graphics:../Images/Newton'sMethodProof_gr_139.gif]  for all  [Graphics:../Images/Newton'sMethodProof_gr_140.gif].  

Q. E. D.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(c) John H. Mathews 2004