
2.
 Suppose
 A €
 R
nxn
 is
 banded
 with half-bandwidth
 p.
 Determine
 the
 exact
number
 of
 arithmetic operations required
 to
 factor
 A
 into
 LU.
Exercises
1.
 Suppose
 A 6
 R
nxn
.
 Determine
 the
 exact number
 of
 arithmetic operations
required
 for the
 computation
 of A = LU via
 Gaussian elimination.
 Further
count
 the
 number
 of
 operations required
 to
 compute
 L
-1
b
 and
 U~
1
L~
1
b.
Verify
 the
 results given
 in the
 text.
 The
 following
 formulas
 will
 be
 useful:
486
Chapter
 10.
 More about
 finite
 element methods
methods when
 n
 is
 very large, making Gaussian elimination
 too
 expensive.
In
 such
 a
 case,
 an
 iterative method must give
 a
 reasonable approximation
 in
much
 less
 than
 n
 iterations,
 or it
 also
 is too
 expensive.
 The CG
 algorithm
is
 useful
 precisely because
 it can
 give very good results
 in a
 relatively small
number
 of
 iterations.
The
 rate
 of
 convergence
 of CG is
 related
 to the
 condition number
 of A,
 which
is
 denned
 as the
 ratio
 of the
 largest eigenvalue
 of A to the
 smallest. When
 the
condition number
 is
 relatively small
 (that
 is,
 when
 A is
 well-conditioned),
 CG
 will
converge
 rapidly.
 The
 algorithm also works
 well
 when
 the
 eigenvalues
 of A are
clustered into
 a few
 groups.
 In
 this case, even
 if the
 largest eigenvalue
 is
 much
larger
 than
 the
 smallest,
 CG
 will
 perform
 well.
 The
 worst case
 for CG is a
 matrix
A
 whose eigenvalues
 are
 spread
 out
 over
 a
 wide range.
10.2.7
 Preconditioned
 CG
It is
 often
 possible
 to
 replace
 a
 matrix
 A
 with
 a
 related matrix whose eigenvalues
are
 clustered,
 and for
 which
 CG
 will
 converge quickly. This technique
 is
 called
preconditioning,
 and it
 requires
 that
 one find a
 matrix
 M
 (the
 preconditioned
 that
 is
somehow
 similar
 to A (in
 terms
 of its
 eigenvalues)
 but is
 much simpler
 to
 invert.
 At
each step
 of the
 preconditioned conjugate gradient (PCG) algorithm,
 it is
 necessary
to
 solve
 an
 equation
 of the
 form
 Mq = r.
Preconditioners
 can be
 found
 in
 many
 different
 ways,
 but
 most
 require
 an
intimate knowledge
 of the
 matrix
 A. For
 this reason, there
 are few
 general-purpose
methods.
 One
 method
 that
 is
 often
 used
 is to
 define
 a
 preconditioner
 from
 an
incomplete
 factorization
 of A. An
 incomplete factorization
 is a
 factorization (like
Cholesky)
 in
 which
 fill-in is
 limited
 by fiat.
 Another method
 for
 constructing
 pre-
conditioners
 is to
 replace
 A by a
 simpler matrix (perhaps arising
 from
 a
 simpler
PDE)
 that
 can be
 inverted
 by FFT
 methods.
486 
Chapter 10.  More 
about 
finite element methods 
methods when  n 
is 
very large,  making Gaussian elimination too expensive. 
In such a case, an iterative method must give a reasonable approximation in 
much less 
than 
n  iterations, or 
it 
also 
is 
too 
expensive.  The CG  algorithm 
is 
useful precisely because it can give  very good results in a relatively small 
number of iterations. 
The 
rate 
of convergence of CG 
is 
related to 
the 
condition number of A, which 
is  defined  as  the ratio of the largest eigenvalue of 
A 
to 
the 
smallest.  When the 
condition number 
is 
relatively small 
(that 
is, when A 
is 
well-conditioned), CG will 
converge  rapidly.  The algorithm also  works 
well 
when 
the 
eigenvalues  of A  are 
clustered into a 
few 
groups.  In this case,  even  if 
the 
largest eigenvalue 
is 
much 
larger 
than 
the 
smallest, CG will perform well.  The worst case for  CG 
is 
a matrix 
A  whose eigenvalues are spread out over a wide range. 
10.2.7  Preconditioned 
CG 
It 
is often possible 
to 
replace a matrix A  with a related matrix whose eigenvalues 
are clustered,  and for  which  CG  will  converge quickly.  This technique 
is 
called 
preconditioning, and it requires 
that 
one find a matrix M  (the preconditioner) 
that 
is 
somehow similar 
to 
A  (in terms of its eigenvalues) 
but 
is 
much simpler to invert.  At 
each step of 
the 
preconditioned conjugate gradient (PCG) algorithm, it is necessary 
to solve an equation of the form 
Mq 
= 
r. 
Preconditioners can be found  in many different  ways, 
but 
most require an 
intimate knowledge of the matrix 
A. 
For this reason, there are 
few 
general-purpose 
methods.  One  method 
that 
is  often  used 
is 
to define  a  preconditioner from  an 
incomplete factorization of 
A. 
An incomplete factorization 
is 
a factorization  (like 
Cholesky) in which fill-in is limited by fiat.  Another method for  constructing pre-
conditioners 
is 
to 
replace A  by  a simpler matrix (perhaps arising from  a simpler 
PDE) 
that 
can be inverted by 
FFT 
methods. 
Exercises 
1.  Suppose A  E 
Rnxn. 
Determine the exact number of arithmetic operations 
required for 
the 
computation of A  = 
LU 
via Gaussian elimination.  Further 
count 
the 
number of operations required 
to 
compute 
L-1b 
and 
U-1L-1b. 
Verify the results given in the text.  The following formulas will  be useful: 
ti = 
n(n 
+ 1), 
i=l 
2 
ti
2 
= 
n(n+1)(2n+1). 
i=l 
6 
2.  Suppose A  E 
Rnxn 
is banded with half-bandwidth p.  Determine 
the 
exact 
number of arithmetic operations required 
to 
factor A into L 
U.