Berkeley MATH 128A - Error Bounds Associated with Newton’s Iteration

Unformatted text preview:

ErrBnd Math. 128 B 28 Jan. 1997 Error Bounds Associated with Newton’s Iteration W. Kahan Math. Dept. Univ. of Calif. Berkeley CA 94720-3840 Suppose we seek a root z of the equation f(z) = o where f(x) is a continuously differentiable vector-valued function of the vector x in a normed space. Newton’s iteration replaces a guess x by x - f’(x)-1f(x) repeatedly in the hope of convergence to z . Is there some neighborhood around z within which iteration must converge to z ? And if convergence is assured, yet we cannot iterate forever; at some point we shall accept an approximate z0 despite f(z0) /= o . How close is z0 to z ? Claim 1: If f is twice differentiable about z , and if for ~~~~~~~~ some positive δ we find that || f’(x)-1f"(y) || < 2/δ as x and y range independently throughout a ball || x - z || < δ , then Newton’s iteration will converge to the root z from every starting point x in that ball. Moreover, convergence is at least quadratic. This claim merely reassures us that, under normal circumstances, Newton’s iteration will converge rapidly to a root z from any starting point close enough to z ; under normal circumstances f’(z)-1 does exist and f"(x) does stay bounded at all x near z , so a positive radius δ has to exist. Proof of Claim 1: According to Newton’s divided difference formula, f(z) = f(x) + f’(x)(z-x) + ∆|2f((z,x,x))(z-x)(z-x) where |∆2f is the second divided difference of f and, according to Hermite’s integral representation for divided differences, |∆2f((a,b,c)) = The uniformly weighted average of f"(y)/2 as y runs over the triangle whose vertices are a, b, c . If x lies in the ball ||x-z|| < δ , so does the degenerate triangle whose vertices are z, x, x , and therefore so does y , and therefore ||f’(x)-1f"(y)/2|| < 1/δ , and therefore ||f’(x)-1|∆2f((z,x,x))|| = || Average of f’(x)-1f"(y)/2 ...|| _< Average of ||f’(x)-1f"(y)/2|| ... < 1/δ . Newton’s iteration replaces x by New(x) = x - f’(x)-1f(x) ; by using Newton’s divided difference formula with f(z) = o we find New(x) - z = f’(x)-1|∆2f((z,x,x))(x-z)(x-z) , whence follows that ||New(x) - z||/δ _< ||f’(x)-1|∆2f((z,x,x))|| ||x-z||2/δ < (||x-z||/δ)2 . Therefore, starting the iteration xn+1 := New(xn) at any x0 in the ball leads to a sequence {xn} that stays in the ball and log(||xn+1-z||/δ) < 2 log(||xn-z||/δ) < ... < 2n+1 log(||x0-z||/δ) --> -∞ as n --> ∞ , as claimed. Who can wait until n --> ∞ ? Instead we shall accept some xn as good enough; call it z0 . How close is it to a desired root z ? 1ErrBnd Math. 128 B 28 Jan. 1997 Claim 2: If we can find some positive δ > || f’(x)-1f(z0) || ~~~~~~~~ throughout the ball || x - z0 || < δ , then at least one root z lies in that ball. Proof: Define a trajectory x = y(τ) by solving the initial value problem y(0) = z0 , d y(τ)/dτ = -f’(y(τ))-1f(y(τ)) for all τ _> 0 . Our first task is to discover for how large an interval 0 _< τ < T the trajectory stays inside the ball || y(τ) - z0 || < δ . We know that the trajectory exists inside the ball for some sufficiently small T > 0 ; let T be the largest value for which the trajectory stays inside the ball throughout 0 _< τ < T . Along the trajectory we find that d f(y(τ))/dτ = f’(y(τ)) dy(τ)/dτ = = f’(y(τ)) (-f’(y(τ))-1f(y(τ)) ) = -f(y(τ)) , whence follows that f(y(τ)) = e-τf(z0) . Therefore the length of the trajectory from τ = 0 up to τ = T is ∫oT || dy(τ)/dτ || dτ = ∫oT || f’(y(τ))-1f(y(τ)) || dτ = ∫oT || f’(y(τ))-1f(z0) || e-τ dτ < ∫oT δ e-τ dτ = (1 - e-T) δ < δ . If T were finite we would observe that || y(T) - z0 || could not exceed the length of the trajectory from τ = 0 to τ = T , so y(T) would have to lie strictly inside the ball, so T could not be the largest such value. Therefore the trajectory y(τ) lies in the ball for all τ _> 0 and has finite length less than the radius δ of the ball. Therefore a limit z exists inside the ball such that y(τ) --> z and f(y(τ)) = e-τf(z0) --> f(z) = o as τ --> ∞ . Thus is claim 2 proved. Comment 1 : The same conclusion could be drawn from a different hypothesis δ > ||f’(x)-1|| ||f(z0)|| . Besides being stronger (harder to satisfy), this hypothesis is affected by scaling; replacing f(x) by Lf(x) for any invertible linear operator L alters the stronger hypothesis but does not affect the root z , nor Newton’s iteration, nor the Claims proved above. And yet the stronger hypothesis is the one more often applied: Given a constant σ > ||f’(x)-1|| throughout a region known to be farther than δ from all roots other than z , we infer that ||z0-z|| < δ wherever ||f(z0)|| < δ/σ . Comment 2 : Claim 2 concerns "at least one root" instead of "just one root" because the ball may contain more than one root. For example, let x be a complex variable, so that ||x|| = |x| ; and let f(x) = 1 + exp(x) , so that z = _+ιπ . Choose δ = 2π and z0 = 3.13ι . Then f’(x)-1f(z0) = exp(-x)(1 + exp(3.13ι)) , so |f’(x)-1f(z0)| = 2 exp(-Re(x)) cos(1.565) .


View Full Document

Berkeley MATH 128A - Error Bounds Associated with Newton’s Iteration

Download Error Bounds Associated with Newton’s Iteration
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Error Bounds Associated with Newton’s Iteration and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Error Bounds Associated with Newton’s Iteration 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?