0% found this document useful (0 votes)
95 views97 pages

dòng chảy ricci

This honors thesis examines Hamilton's Ricci flow, a method for evolving the metric tensor of a Riemannian manifold to make it "rounder". The document provides an introduction to differential geometry concepts needed to understand the Ricci flow. It then introduces the Ricci flow equation and techniques for analyzing it, such as the maximum principle and derivative estimates. As an application, it proves Hamilton's 1982 theorem that closed 3-manifolds with positive Ricci curvature are spherical space forms. It concludes with a discussion of how the Ricci flow was used to prove the Poincare and Geometrization conjectures.

Uploaded by

Rin Tohsaka
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
95 views97 pages

dòng chảy ricci

This honors thesis examines Hamilton's Ricci flow, a method for evolving the metric tensor of a Riemannian manifold to make it "rounder". The document provides an introduction to differential geometry concepts needed to understand the Ricci flow. It then introduces the Ricci flow equation and techniques for analyzing it, such as the maximum principle and derivative estimates. As an application, it proves Hamilton's 1982 theorem that closed 3-manifolds with positive Ricci curvature are spherical space forms. It concludes with a discussion of how the Ricci flow was used to prove the Poincare and Geometrization conjectures.

Uploaded by

Rin Tohsaka
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 97

The University of Melbourne,

Department of Mathematics and Statistics

Hamilton’s Ricci Flow

Nick Sheridan

Supervisor: Associate Professor Craig Hodgson


Second Reader: Professor Hyam Rubinstein

Honours Thesis, November 2006.


Abstract

The aim of this project is to introduce the basics of Hamilton’s Ricci Flow. The Ricci flow is a pde
for evolving the metric tensor in a Riemannian manifold to make it “rounder”, in the hope that one
may draw topological conclusions from the existence of such “round” metrics. Indeed, the Ricci flow
has recently been used to prove two very deep theorems in topology, namely the Geometrization
and Poincaré Conjectures. We begin with a brief survey of the differential geometry that is needed
in the Ricci flow, then proceed to introduce its basic properties and the basic techniques used
to understand it, for example, proving existence and uniqueness and bounds on derivatives of
curvature under the Ricci flow using the maximum principle. We use these results to prove the
“original” Ricci flow theorem – the 1982 theorem of Richard Hamilton that closed 3-manifolds
which admit metrics of strictly positive Ricci curvature are diffeomorphic to quotients of the round
3-sphere by finite groups of isometries acting freely. We conclude with a qualitative discussion of
the ideas behind the proof of the Geometrization Conjecture using the Ricci flow.
Most of the project is based on the book by Chow and Knopf [6], the notes by Peter Topping
[28] (which have recently been made into a book, see [29]), the papers of Richard Hamilton (in
particular [9]) and the lecture course on Geometric Evolution Equations presented by Ben Andrews
at the 2006 ICE-EM Graduate School held at the University of Queensland. We have reformulated
and expanded the arguments contained in these references in some places. In particular, the proof
of Theorem 7.19 is original, based on a suggestion by Gerhard Huisken. We also diverge from the
existing references by emphasising the analogy between the techniques applied to the Ricci flow
and those applied to the curve-shortening flow, which we feel helps clarify the important ideas
behind the technical details of the Ricci flow. Chapter 6 is based on [6, Chap. 6, 7], but we have
significantly reformulated the material and elaborated on the proofs. We feel that our organization
is easier to follow than Chow and Knopf’s book. The attempt to motivate the compactness result
in Section 8.1 is also original.

1
Acknowledgements: Thanks to my supervisor Craig Hodgson for being so generous with
his time and for always explaining things in the best possible way. Thanks also to Ben Andrews
and Gerhard Huisken for the help they gave me at the ICE-EM Graduate School in July of this
year and afterwards, and thank you to ICE-EM and the University of Queensland for running this
excellent program.

2
Contents

1 Riemannian Geometry 6
1.1 Vectors, Tensors and Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2 The Covariant Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.3 The Lie Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.4 Curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.5 Topology from Geometry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.6 Scaling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
1.7 Time-evolving metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

2 Introduction to the Ricci Flow 24


2.1 The Prehistory of the Ricci Flow – Geometrization . . . . . . . . . . . . . . . . . . 24
2.2 Hamilton’s Ricci Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
2.3 Special Solutions of the Ricci Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
2.4 Pinching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

3 The Maximum Principle 30


3.1 The Heat Equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
3.2 A Scalar Maximum Principle on Manifolds . . . . . . . . . . . . . . . . . . . . . . . 31
3.3 A Maximum Principle for Vector Bundles . . . . . . . . . . . . . . . . . . . . . . . 33

4 Curve-Shortening Flow 36
4.1 Steepest Descent Flow for Length . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
4.2 Short Time Existence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
4.3 Finite Time Singularity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
4.4 Curvature Explodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
4.5 Grayson’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

5 Short Time Existence and Uniqueness of the Ricci Flow 42


5.1 The Linearization of the Ricci Tensor . . . . . . . . . . . . . . . . . . . . . . . . . 42
5.2 The DeTurck Trick . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

6 Derivative Estimates and Curvature Explosion at Singularities 46


6.1 Evolution of Geometric Quantities Under the Ricci Flow . . . . . . . . . . . . . . . 46
6.2 Evolution Equations for Derivatives of Curvature . . . . . . . . . . . . . . . . . . . 47
6.3 The Bernstein-Bando-Shi Estimates . . . . . . . . . . . . . . . . . . . . . . . . . . 49
6.4 Curvature Explodes at Finite-time Singularities . . . . . . . . . . . . . . . . . . . . 51

7 3-Manifolds With Positive Ricci Curvature 58


7.1 The Plan of Attack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
7.2 Existence and Finite-time Explosion of Curvature . . . . . . . . . . . . . . . . . . . 59
7.3 Setting the Scene for the Maximum Principle – The Uhlenbeck Trick . . . . . . . . 60
7.4 Local Curvature Pinching from the Maximum Principle . . . . . . . . . . . . . . . 63
7.5 Global Curvature Pinching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
7.6 Normalized Ricci Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
7.7 Convergence of the Normalized Flow . . . . . . . . . . . . . . . . . . . . . . . . . . 68

3
7.8 Details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

8 Singularities in the Ricci Flow 82


8.1 Blowing Up at Singularities – Heuristics . . . . . . . . . . . . . . . . . . . . . . . . 82
8.2 Convergence of Manifolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
8.3 Blowing Up at Singularities – Results . . . . . . . . . . . . . . . . . . . . . . . . . 85
8.4 Perelman’s F- and W-functionals . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

A Existence Theory for Parabolic PDEs 90


A.1 Linear Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
A.2 Linearization of Nonlinear PDEs . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

B Differential Geometry Formulae 93

4
5
Chapter 1

Riemannian Geometry

We will sweep through the basics of Riemannian geometry in this chapter, with a focus on the
concepts that will be important for the Ricci flow later. Most proofs will be neglected for brevity, as
the main point of much of the chapter is to establish conventions. The particularly useful formulae
have been collected in Appendix B. We also give a flavour in Section 1.5 of how geometry can
be used to draw topological conclusions. The material in this chapter is presented in much more
detail in many texts – for example, [14] presents the basics of manifolds, tangent vectors, tensors
and the Lie derivative. The book [18] is an excellent place to learn the theory of curvature.

1.1 Vectors, Tensors and Metrics


A topological space Mn is an n-manifold if it looks like Euclidean space (Rn ) near each point.
The formal definition has some other technical conditions as well, to avoid certain pathologies that
may arise:
Definition 1.1. A topological space Mn is a topological n-manifold if:

1. For each p ∈ Mn there is an open neighbourhood U of p and a function ϕ : U → Rn that is a


homeomorphism onto an open subset of Rn . The pair (U, ϕ) is called a coordinate chart.
We will frequently write ϕ(q) = (x1 (q), x2 (q), . . . , xn (q)). These xi (q) are referred to as local
coordinates for Mn .

2. Mn is Hausdorff.
3. Mn is paracompact (see [26, Vol. I, App. A, Chap. 1] for a discussion of this condition).
We will usually write M for a generic manifold, and Mn for an n-manifold if the dimension
is of particular relevance. In this project we will deal with smooth manifolds, which have more
structure than topological manifolds. In order to define what a smooth manifold is, we must first
define the concept of a smooth function between subsets of Euclidean space.
Definition 1.2. A function f : U → Rm , where U is an open subset of Rn , is called smooth or
C ∞ if all of its partial derivatives exist and are continuous on U .
Now we can define the concept of a smooth manifold.

Definition 1.3. Given two coordinate charts (U, ϕ) and (V, ψ) on a manifold M, with U ∩ V 6= ∅,
we call the map ψ ◦ ϕ−1 : ϕ(U ∩ V ) → ψ(U ∩ V ) a transition map. Note that each transition map
is a homeomorphism from an open set in Rn to another open set in Rn . We make the definitions:
1. M is called smooth or C ∞ if all the transition maps are smooth.

2. M is orientable if all the transition maps are orientation-preserving.


It is now possible to define the the concept of a smooth map between manifolds.

6
Definition 1.4. Let f : M → N , where M and N are smooth manifolds. f is called smooth if,
for every pair of coordinate charts (U, ϕ) of M and (V, ψ) of N , the function

ψ ◦ f ◦ ϕ−1 : ϕ(U ∩ f −1 (V )) → ψ(f (U ) ∩ V )

is smooth.
As a special case we can set N = R, which has a natural smooth manifold structure. The set
of all smooth real-valued functions f : M → R is denoted C ∞ (M ).
Two topological manifolds are equivalent if they are homeomorphic. The notion of equivalence
on smooth manifolds is a bit more subtle.
Definition 1.5. Two smooth manifolds M, N are equivalent if there exists a smooth function
f : M → N which has a smooth inverse. We will call such a function f a diffeomorphism1 and
say that M and N are diffeomorphic.
In 3 dimensions, topological and smooth manifolds are essentially equivalent. That is, any
topological manifold is homeomorphic to a unique smooth manifold, and vice versa. This result
is not true for higher dimensions. However most of the results in this project will deal with 3-
manifolds, so it is of interest to note that we do not lose any generality by assuming that our
3-manifolds are smooth.
Now that we know what a manifold is, we would like to define a tangent vector to our manifold
M at a point p ∈ M.
Definition 1.6. A tangent vector to a smooth manifold M at a point p ∈ M is a derivation,
that is, an R-linear function X : C ∞ (M) → R satisfying the product rule:

X(f g) = X(f )g(p) + f (p)X(g).

The set of all tangent vectors to an n-manifold Mn at p forms an n-dimensional vector space
Tp Mn .
We note that this definition is related to the more intuitive notion of a tangent vector as a
velocity vector of a curve γ : (−, ) → M in the manifold. If γ(0) = p then we associate to the
velocity vector of γ at p the derivation X ∈ Tp M, where
d
X(f ) = f (γ(t))|t=0 .
dt
We then write X = γ̇(0).
If (xi ) is a local coordinate system about p on an n-manifold Mn , then the set of derivations
{∂/∂xi , i = 1, 2, . . . , n} forms a basis for Tp Mn . To avoid unseemly typesetting nightmares we
will often write ∂i for ∂/∂xi if there can be no confusion about the coordinate system being used.
The set of all tangent vectors at all points of Mn forms a (2n)-manifold known as the tangent
bundle, and denoted T Mn .
A vector field on a manifold M is a smoothly-varying choice of tangent vector at each point
p ∈ M. Here “smoothly-varying” means X(f ) ∈ C ∞ (M) for any f ∈ C ∞ (M).
We note in passing that there is some additional structure to T M on top of the vector space
structure on Tp M: given two vector fields X, Y on M, we can form their Lie bracket [X, Y ],
defined by
[X, Y ]f = X(Y (f )) − Y (X(f ))
(it turns out that the [X, Y ] defined in this way is a tangent vector in the sense of being a derivation
as described above, although this is not immediately obvious).
The example of tangent vectors is a specific case of a more general construction on a manifold,
known as a vector bundle. The idea is that one associates a vector space to each point of
the manifold M, then glues these vector spaces together so as to get a new, higher-dimensional
manifold.
1 A diffeomorphism is usually defined to be a differentiable map with a differentiable inverse. The distinction

is important when dealing with manifolds of varying levels of differentiability, but in this project we will deal
exclusively with smooth manifolds. Thus the slight abuse of terminology will not cause any confusion.

7
Definition 1.7. A k-dimensional vector bundle is a manifold E (the total space) together with a
manifold M (the base space) and a surjective map π : E → M (the projection) such that

1. For each p ∈ M, the set Ep := π −1 (p) (the fibre of E over p) has a k-dimensional vector
space structure.
2. For each p ∈ M there is an open neighbourhood U of p and a smooth diffeomorphism ϕ :
π −1 (U ) → U ×Rk (a local trivialization) such that ϕ takes each fibre Ep to the corresponding
fibre {p} × Rk by a linear isomorphism.

A section of E is a map F : M → E such that π ◦ F = IdM . The space of sections of E is denoted


C ∞ (E).
The tangent bundle is an n-dimensional vector bundle with base space M and projection defined
by π(X) = p if X ∈ Tp M. A vector field is a section of the tangent bundle.
Another important example of a vector bundle is the dual bundle to the tangent bundle, known
as the cotangent bundle, T ∗ M. The fibre Tp∗ M = (Tp M)∗ consists of all linear functionals acting
on the vector space Tp M (the covectors or 1-forms at p). Given a local coordinate system (xi ),
i = 1, . . . , n about p on an n-manifold Mn , the set of covectors {dxi , i = 1, 2, . . . , n} (where
dxi (X) := X(xi )) forms a basis for Tp∗ Mn .
This method of constructing new vector bundles from old can be generalized. Let V be the
category of finite-dimensional real vector spaces. Given vector bundles E 1 , E 2 , . . . , E k over M and
a covariant functor T : V × V × . . . × V → V , it is possible to form a unique vector bundle
E = T (E 1 , E 2 , . . . , E k ) over M having fibres Ep = T (Ep1 , Ep2 , . . . , Epk ) (see [20], Chapter 3). The
cotangent bundle arises in the case k = 1, E 1 = T M, and T (V ) = V ∗ .
In this way we can form the tensor product of vector bundles, by making T the tensor product
functor on vector spaces. We define a kl -tensor field to be a section of


k copies l copies
z }| { z }| {
∗ ∗ ∗
Tlk (M)
:= T M ⊗ T M ⊗ . . . ⊗ T M ⊗ T M ⊗ T M ⊗ . . . ⊗ T M .

Given a local coordinate system (xi ) about p ∈ M, we can express any kl -tensor field F in the


coordinate system as

F = Fij11...i
...jl
k
(p)∂j1 ⊗ . . . ⊗ ∂jl ⊗ dxi1 ⊗ . . . ⊗ dxik . (1.1)

In this equation we sum over each index jp , iq that is repeated twice, once raised and once lowered –
this is known as the Einstein summation convention. We will almost always use this coordinate
representation of tensors because it makes technical calculations easier a lot of the time, and we
will often write Fij11...i
...jl
k
when we mean F .
Definition 1.8. Given a map between manifolds φ : M → N , we define the derivative map
between the corresponding tangent spaces, φ∗ : Tp M → Tf (p) N by

(φ∗ V )(f ) = V (f ◦ φ)

for V ∈ Tp M and f ∈ C ∞ (N ). Defining φ∗ (A⊗B) := φ∗ (A)⊗φ∗ (B), we can extend this definition
to apply to all k0 -tensors.
In a similar way we can define the derivative map between the corresponding cotangent spaces,
φ∗ : Tf∗(p) N → Tp∗ M, by
(φ∗ ω)(V ) = ω(φ∗ V )
for V ∈ Tp M, ω ∈ Tf∗(p) N . By a similar method to above we can extend φ∗ to apply to all
k

0 -tensors.

Given a tensor Fij11...i


...jl
k
∈ Tlk (M) we can take the trace over one raised and one lowered index
as follows:
(trF )ji22...i
...jl
k
pj2 ...jl
= Fpi 2 ...ik

8
k−1
to get an element of Tl−1 (M). Note the use of the Einstein summation convention: the index p
is summed over. Obviously the trace depends on which indices you choose to trace over – here
we traced over the j1 and i1 indices. Although it is not immediately obvious, the resulting tensor
does not depend on the local coordinate system you are working in.
A k-form on M is a section of ∧k T ∗ M, i.e. a k0 -tensor field that is completely antisymmetric
in all its indices. A k-vector field on M is a section of ∧k T M.
Definition 1.9. For a 20 -tensor A we write A > 0 (A ≥ 0) if


A(V, V ) > 0 (A(V, V ) ≥ 0)

for all V ∈ T M, V 6= 0. We can similarly write A > B (A ≥ B) if A − B > 0 (A − B ≥ 0).


Definition 1.10. A Riemannian metric on a smooth manifold M is a smoothly-varying inner
product on the tangent space at each point of M, i.e. a 20 -tensor field which is symmetric and


positive definite at each point of M. We will usually write g for a Riemannian metric, and gij for
its coordinate representation. Given such a g, there is an induced norm on each Tp M which we
write p
|X|g := g(X, X) (1.2)
for X ∈ Tp M. A manifold together with a Riemannian metric, (M, g), is called a Riemannian
manifold.
When there is only one metric under consideration, we will usually neglect the subscript g in
equation (1.2), but there will be some situations in the study of the Ricci flow where we will need
to distinguish between the norms induced by different metrics. We will sometimes use the notation
hX, Y i for g(X, Y ).
Note that a Riemannian metric is not actually a metric (although we will frequently say “metric”
rather than “Riemannian metric” for brevity, in contexts where no confusion is possible). It can
be thought of as an “infinitesimal metric”. In fact any Riemannian metric g on a manifold M
induces a bona fide metric on M, as we will now see:
Definition 1.11. Given a Riemannian metric g we can define the length of a piecewise C 1 curve
γ : [0, 1] → M by
Z 1p
`(γ) := g(γ̇(t), γ̇(t))dt
0

where γ̇(t) := dγ/dt.


This allows us to define a metric d on M induced by the metric g:

d(p, q) := inf{`(γ) : γ is a piecewise C 1 curve in M starting at p and ending at q}.

We will sometimes use the metric space notation for a ball: if (M, g) is a Riemannian manifold,
p ∈ M and r > 0, then
B(p, r) := {q ∈ M : d(p, q) < r}
where d is the metric induced by g.
Finally, we say that a map φ : M → N between Riemannian manifolds (M, g) and (N , h) is
an isometry if it is a diffeomorphism and φ∗ h = g. In this case we say that the two Riemannian
manifolds are isometric.
It is not hard to see that an isometry between Riemannian manifolds in the sense just defined
is also a metric-space isometry between the manifolds, if we view them as metric spaces with the
metrics induced by the corresponding Riemannian metrics.
We note that any smooth manifold, by virtue of its paracompactness, admits a smooth Rie-
mannian metric. Thus we do not lose any generality by tackling topological properties of smooth
manifolds from the point of view of Riemannian geometry.
Given a Riemannian manifold (M, g) and a manifold N embedded in M (a submanifold of
M), there is an induced Riemannian metric ḡ on N defined by restricting g to Tp N at each point
p ∈ N.

9
Given the metric gij , which is a positive-definite symmetric matrix at each point of M, we define
the metric inverse g ij to be the inverse matrix at each point, satisfying g ij gjk = δki where δki is
the Kronecker delta. Any inner product on a vector space gives a natural isomorphism V ∼ = V ∗,
[ [ [ j
via X 7→ X , where X (Y ) = hX, Y i. In coordinates, (X )i = gij X . In general, we can lower an
ijk
index i on a tensor Fpq , for example, by setting
jk mjk
Fipq := gim Fpq .
k+1
This takes us from an element of Tlk (M) to something in Tl−1 (M). We can similarly raise an
index using g ij . Using the metric inverse we can also define a norm on the space of tensors, for
example
ijk 2
|Fpq |g := gi1 i2 gj1 j2 gk1 k2 g p1 p2 g q1 q2 Fpi11 qj11k1 Fpi22 qj22k2 .
We will make frequent use of the very convenient ∗-notation in later chapters. Given two
tensors A, B, the expression A ∗ B means “some linear combination of traces of A ⊗ B with
coefficients that do not depend on A or B”. For example if we have A = Aklm s
ij , B = Bpqr , then
A ∗ B might represent
j
17Akil lrs k
ij Blqr − n!Aqi Bisr

where n is the dimension of the manifold, or

Aijk l
ij Bklm .

The meaning is obviously very broad, and is of most use when we want to obtain bounds on
complicated combinations of tensor quantities (as we will in later chapters). The most useful
property of this notation is that, for any given expression of the form A ∗ B, there is a constant C
which does not depend on A or B such that

|A ∗ B| ≤ C|A||B|

by the Cauchy-Schwarz inequality. As a particular case that we will use frequently, we have
Lemma 1.1. If A is an n × n matrix then
1
|A|2 ≥ (trA)2 .
n
The ∗-notation can obviously be extended to multiple ∗-products like A ∗ B ∗ . . . ∗ Z or powers
A∗n := A ∗ A ∗ . . . ∗ A. We also define ∗(A, B, . . . , Z) to mean any combination of ∗-products of
any powers of A, B, . . . , Z, for example

B ∗3 ∗ Z + A∗2 ∗ Z ∗4 .

In later chapters we will use the notation

∗ (Ai ) := ∗(A1 , A2 , . . . , An ).
1≤i≤n

1.2 The Covariant Derivative


We can differentiate scalar functions on a manifold M without any problem: to find the rate of
change of a function f in the direction of the tangent vector X, we simply calculate X(f ). We
ought also to be able to differentiate vector fields, or more generally a section Y of an arbitrary
vector bundle, in the direction of a given tangent vector X.
Definition 1.12. Given a vector bundle E over M, a connection in E is a map

∇ : C ∞ (T M) × C ∞ (E) → C ∞ (E)

with the following properties:

10
1. ∇X Y is linear over C ∞ (M) in X.

2. ∇X Y is linear over R in Y .
3. ∇ satisfies the product rule:

∇X (f Y ) = X(f )Y + f ∇X Y.

We call ∇(X, Y ) the covariant derivative of Y in the direction X. We usually write ∇X Y


rather than ∇(X, Y ).
It is possible to calculate ∇X Y (p) if we are given Xp and the values of Y along a curve γ :
(−, ) → M such that γ(0) = p and γ 0 (0) = Xp . A section Y of E defined along a curve γ in M
is said to be parallel along γ if ∇γ̇(t) Y = 0 along γ.

A connection on the vector bundle E is completely specified by its Christoffel symbols Γkij
in a local coordinate system (xi ) with a local basis (Ej ) for E, defined by:

∇∂i Ej = Γkij Ek .

As a very important special case, we can consider connections on the bundles Tlk (M).
Lemma 1.2. Given a connection ∇ on the tangent bundle T M, we can define connections on all
of the tensor bundles Tlk (M) (which we will also denote ∇) satisfying:
1. ∇ is the given connection on T M.

2. For a scalar function f , ∇X f = X(f ).


3. ∇X (F ⊗ G) = (∇X F ) ⊗ G + F ⊗ (∇X G).
4. ∇X commutes with all traces:
∇X (trY ) = tr(∇X Y )
for all traces (over any indices) of the tensor Y .
If F is a kl -tensor field on M which is given in local coordinates by equation (1.1), we will write


the coordinate form of the covariant derivative ∇F as

(∇X F ) := (∇p Fij11...i


...jl
k
)∂j1 ⊗ . . . ⊗ ∂jl ⊗ dxi1 ⊗ . . . ⊗ dxik X p .

We can write down the coordinate form of ∇ explicitly:


l
X k
X
∇p Fij11...i
...jl
k
= ∂p Fij11...i
...jl
k
+ Fij11...i
...q...jl js
k
Γpq − Fij11...q...i
...jl
Γq .
k pis
(1.3)
s=1 s=1

In the second term of the above equation, the upper q index occupies the position normally occupied
by js , and in the third term the lower q index occupies that position normally occupied by is .
Note that equation (1.3) can be expressed using the ∗-notation as

∇F = ∂F + Γ ∗ F. (1.4)

We can generalize this observation:


Lemma 1.3. Let ∇m F denote the mth iterated covariant derivative of F and ∂ m F denote the
coordinate expression
(∂ m F )i1 ...im := ∂i1 ...im F

11
in some local coordinate system2 (xi ) defined in a coordinate patch U on the manifold M. Then
 
m−1


X
∇m F = ∂ m F + ∂ j Γ  ∗ ∇i F


i=0 j ≤m−1

in U .
Proof. The result follows from equation (1.4) by induction.
Although there are many possible connections on the tangent bundle T M, if M is equipped
with a Riemannian metric then there is one in particular that has more geometric significance.
Lemma 1.4. Given a Riemannian metric gij on M, there is a unique connection ∇ on T M that
satisfies
1. X(g(Y, Z)) = g(∇X Y, Z) + g(Y, ∇X Z) (∇ is compatible with g). This is equivalent to
∇g = 0 where ∇g is defined by Lemma 1.2.
2. The torsion tensor of ∇,

τ (X, Y ) := ∇X Y − ∇Y X − [X, Y ]

is identically 0.
This connection is known as the Levi-Civita connection of the metric g. Its Christoffel symbols
are given in local coordinates by
1 kl
Γkij = g (∂i gjl + ∂j gil − ∂l gij ). (1.5)
2
Using the Levi-Civita connection we can define the Laplacian:
Definition 1.13. The Laplacian3 is a family of operators

4 : C ∞ (Tlk M) → C ∞ (Tlk M)

(where (M, g) is a Riemannian manifold) defined by

4F := g ij ∇i ∇j F,

where ∇ is the Levi-Civita connection of the metric g. If F has the coordinate form given in
equation (1.1), we will write
4Fij11...i
...jl
k

for
(4F )ji11...i
...jl
k
.
If we are given a connection on T M (we will usually be using the Levi-Civita connection for
some metric gij ) we can define geodesics to be the paths that you could move along in the manifold
“without feeling any force”. That is, a path γ : [0, 1] → M is a geodesic if ∇γ̇(t) γ̇(t) = 0 at each
point. Given an initial point and velocity (or equivalently an element of T M), there is a unique
geodesic in M setting off from that point with the given initial velocity. If M is complete (in
particular if it is closed, as it will be for all of our applications), the geodesic will exist for all time.
2 The coordinate expression ∂ m F does not represent any coordinate-independent tensor field in the way that

∇m F does. We regard ∂ m F as representing the tensor field


∂i1 ...im F dxi1 ⊗ . . . ⊗ dxim ,
defined only in U . We similarly represent by Γ the tensor field
Γkij dxi ⊗ dxj ⊗ ∂k ,
also defined only in U .
3 There exist other, non-equivalent definitions of the Laplacian in different settings. For that reason the Laplacian

defined here is sometimes called the rough Laplacian. We will only ever use the rough Laplacian in this project,
so we simply call it the “Laplacian”.

12
B A

Figure 1.1: The injectivity radius: inj(A) is small, inj(B) is large.

Definition 1.14. Suppose M is complete and has no boundary. Given v ∈ Tp M, let γv : [0, 1] →
M be the unique geodesic in M that starts at p with initial velocity vector v. That is, γv (0) = p
and γp0 (0) = v. We define the exponential map exp : T M → M by exp(v) = γv (1). We denote
by expp the exponential map restricted to Tp M. The exponential map is smooth.
Because expp is locally a diffeomorphism at the origin of Tp M, it is possible to choose an open
neighbourhood U of p that is diffeomorphic via expp to an open set in Tp M = Rn , by the inverse
function theorem. We choose coordinates (xi ), i = 1, . . . , n on Tp M so that the basis vectors
{∂i : i = 1, . . . , n} are orthonormal with respect to the metric g at p. We then have a coordinate
chart (U, (expp )−1 ). These coordinates are called normal coordinates about p and have some
properties which make them very convenient for doing calculations:
Lemma 1.5. In normal coordinates about p, we have
1. gij = δij at p.

2. If v ∈ Rn then the curve γv (t) = tv is a geodesic for as long as it exists.


3. ∂k gij = 0 and Γkij = 0 at p. Thus

∇k Fij11...i
...jl
k
= ∂k Fij11...i
...jl
k

at p, by the formula (1.3).


Definition 1.15. The injectivity radius inj(p) at a point p ∈ M is defined by

inj(p) := sup{r > 0 : expp : B(0, r) → M is injective}.

The injectivity radius of a manifold M with metric g is defined by

inj(M, g) := inf{inj(p) : p ∈ M}.

One can interpret the injectivity radius at p ∈ M as follows: imagine there is a flash of light
at p at some time. Light rays propagate in all directions – the injectivity radius is the smallest
distance one of these rays has to travel before it collides with another light ray. The injectivity
radius is a crucial concept in the study of the convergence of manifolds, as we shall see in Chapter
8. Where the injectivity radius is small (as at A in Figure 1.1), the manifold is close to “pinching
off” as described in Section 2.4.

13
1.3 The Lie Derivative
The covariant derivative gives one way of differentiating tensor fields. We will now look at a different
way, which can be defined purely from the manifold structure of M, without any reference to a
Riemannian metric or the extra structure of a connection.
Given a vector field X on a manifold M, we define a time-dependent family of diffeomorphisms
of M to itself, ϕt : M → M for t ∈ (−, ), such that ϕ0 is the identity and
d
ϕt = X
dt
at each point (the existence of such ϕt follows from basic existence theorems for differential equa-
tions – see [5, Sec. 6-2] for this argument and other details relating to the Lie derivative). This is
to be interpreted as a “flow”
 of the manifold in the direction of the vector field X. We now define
the derivative of some kl -tensor field F in the direction of X as the change in F when we move
a small step in the direction of X – but we need to have some way of comparing the value of F at
the point a little step away with that at the original point. Rather than making this comparison
using a “connection” as before, we make it by pushing the value of F at the translated point back
to the original point using the diffeomorphism ϕt .
We define
(∗ϕt )Fp (X1 , . . . , Xk , ω 1 , . . . , ω l ) := Fϕt (p) (ϕt∗ (X1(p) ), . . . , ϕt∗ (Xk(p) ),
(ϕ−1 ∗ 1 −1 ∗ l
t ) (ω(p) ), . . . , (ϕt ) (ω(p) )).

Note that (∗ϕt )Fp ∈ Tlk Mp for all t. It is now possible to define the Lie derivative of F in the
direction X as  
d
LX F = (∗ϕt )F
dt t=0
Lemma 1.6. The Lie derivative is well-defined. It obeys similar conditions to those satisfied by
the covariant derivative as outlined in Lemma 1.2, with one important exception:
1. For a scalar function f , LX f = X(f ).
2. If Y is a vector field then LX Y = [X, Y ].
3. LX (F ⊗ G) = (LX F ) ⊗ G + F ⊗ (LX G).
4. LX commutes with all traces:
LX (trY ) = tr(LX Y )
for all traces (over any indices) of the tensor Y .
Proof. See [5, Sec. 6-2].
Lemma 1.7. On a Riemannian manifold (M, g), we have
(LX g)ij = ∇i Xj + ∇j Xi ,
where ∇ denotes the Levi-Civita connection of the metric g, for any vector field X.
Proof. Let ω be the 1-form dual to the vector field X, ω(Y ) = hX, Y i. Using the product rule
(from Lemma 1.6) and the metric compatibility and torsion-free conditions on the Levi-Civita
connection we have
LX g(Y, Z) = X(g(Y, Z)) − g(LX Y, Z) − g(Y, LX Z)
= h∇X Y, Zi + hY, ∇X Zi − h[X, Y ], Zi − hY, [X, Z]i
= h∇X Y − [X, Y ], Zi + hY, ∇X Z − [X, Z]i
= h∇Y X, Zi + hY, ∇Z Xi
= Y hX, Zi − hX, ∇Y Zi + ZhY, Xi − h∇Z Y, Xi
= Y (ω(Z)) − ω(∇Y Z) + Z(ω(Y )) − ω(∇Z Y )
= (∇Y ω) (Z) + (∇Z ω) (Y )

14
which is the coordinate-free way of expressing the identity we wanted. Note that we used the
product rule again to get the last line.

1.4 Curvature
We start with a description of curvature for 2-manifolds. Given a Riemannian 2-manifold (M2 , g),
we would like to define some quantity that describes “how curved” the surface is at a certain point
p ∈ M2 . Consider two tangent vectors v1 , v2 ∈ Tp M2 , with |v1 | = |v2 | = 1 (with respect to g).
Let γi : [0, 1] → M2 , i = 1, 2 be two geodesics in M2 that start at p, with γ˙i (0) = vi . Let us
define f : R+ → R+ so that f (r) is the distance from γ1 (r) to γ2 (r) along the circle with centre
p and radius r (where the radius is measured with respect to the metric d on M2 induced by
the Riemannian metric g). If θ is the angle between v1 and v2 then it is clear that for small r,
f (r) ∼ θr, i.e. f 0 (0) = θ (see Figure 1.2).

1
(r)
r
f(r)

r
2
(r)

1
(r)
r
f(r)
r
2
(r)

Figure 1.2: Top: geodesics in a 2-manifold of positive curvature converge. Bottom: geodesics in a
2-manifold of negative curvature diverge.

Curvature deals with second-order effects. We define the Gaussian curvature, K, of the

15
surface at p so that  
f (r) K 2
∼θ 1− r
r 6
for small r. See Figure 1.2 for pictures of manifolds with positive curvature (K > 0) and negative
curvature (K < 0). Euclidean space R2 is flat (K = 0).
The generalization to higher-dimensional manifolds of this concept of curvature is not at all
obvious, and was first achieved by Riemann. His idea was that the curvature, intuitively, is the
obstruction to the “flatness” of a manifold. In other words, non-zero curvature at p is what stops
us from choosing coordinates in which the metric is the Euclidean, flat metric gij = δij in a
neighbourhood of p. Normal coordinates about p are, in some sense, “as close as we can get” to a
Euclidean metric in a neighbourhood of p: we know from Lemma 1.5 that in normal coordinates
about p, gij (p) = δij (p) and ∂k gij (p) = 0. However, in general, the second- and higher-order
derivatives of the metric at p may be non-zero.
This motivates the definition of the 40 Riemann curvature tensor, which holds all of


the information about the second-order derivatives of g. It is defined to be the 40 -tensor with


coordinates Ripqj such that, in normal coordinates about p,


1
gij (x) = δij + Ripqj xp xq + (third- and higher-order terms in x). (1.6)
3
The factor of 1/3 is introduced so that this definition agrees with the alternative one that we
introduce next.
Since Riemann’s original work, it has emerged that the Riemann curvature tensor can be defined
in another way, using the Levi-Civita connection ∇. The 31 Riemann curvature tensor is a

3

1 -tensor which can be defined for vector fields X, Y, Z and a 1-form ω by

Rm(X, Y, Z, ω) := ω(R(X, Y )Z) (1.7)

where

R(X, Y )Z = ∇2 Z(X, Y ) − ∇2 Z(Y, X)


= ∇X (∇Y Z) − ∇Y (∇X Z) − ∇[X,Y ] Z.

Note here that


(∇2 Z)(X, Y ) 6= ∇X (∇Y Z).
In fact,
(∇2 Z)(X, Y ) = ∇X (∇Y Z) − ∇∇X Y Z.
There is a distinction because, by ∇2 Z, we really mean the covariant derivative of the 11 -tensor


∇Z. Thus ∇2 Z is a 21 -tensor.




It is definitely not obvious (but can easily be checked) that this is in fact a 31 -tensor (in


particular that, for example, Rm(X, Y, f Z, ω) = f Rm(X, Y, Z, ω) for any f ∈ C ∞ (M)). We
l
represent it in terms of local coordinates as Rijk . The 31 curvature tensor Rijk l
defined by
4

equation (1.7) is obtained from the 0 curvature tensor Rijkl defined by equation (1.6) by raising
the final index.
The coordinates of Rm can be explicitly calculated by applying equation (1.3) with the Levi-
Civita connection ∇:
Lemma 1.8. The Riemann curvature tensor has the explicit form
l
Rijk = ∂i Γljk − ∂j Γlik + Γpjk Γlip − Γpik Γljp .

Rm has many symmetries, which can be proven by manipulating the defining equation (1.7):
Lemma 1.9. (Symmetries of the curvature tensor) The Riemann curvature tensor has the
following properties:
1. Rijkl = Rklij = −Rjikl = −Rijlk

16
2. The first Bianchi identity:
Rijkl + Rjkil + Rkijl = 0

3. The second Bianchi identity:

∇p Rijkl + ∇i Rjpkl + ∇j Rpikl = 0 (1.8)

Proof. See [18, Prop. 7.4].


The first of these symmetries allows us to view Rm as a section of the bundle

∧2 T ∗ M ⊗S ∧2 T ∗ M

of symmetric bilinear forms on the space of 2-vectors. If φ = φij ∂i ∧ ∂j and ψ = ψ ij ∂i ∧ ∂j are


2-vectors then we define the action of the curvature operator at p, Rp : ∧2 Tp M ⊗ ∧2 Tp M → R,
by
R(φ, ψ) = Rijkl φij ψ lk
(all evaluated at p). Note that the curvature operator is defined on 2-vectors by the antisymmetry
in the first two and in the last two indices of Rm, and is symmetric because of the symmetry
Rijkl = Rklij .
We can now explain the relationship with Gaussian curvature. If there is a 2-plane element
φ ∈ ∧2 Tp M representing a 2-dimensional subspace Π of Tp M, we can imagine the 2-dimensional
submanifold of M that is the image under the exponential map expp of Π (near p). The Gaussian
curvature of this 2-dimensional submanifold at p is called the sectional curvature of M associated
with Π, and denoted K(Π).

Lemma 1.10. If Π is a 2-plane in Tp M spanned by the vectors X, Y ∈ Tp M, and φ = X ∧ Y ,


then
R(φ, φ)
K(Π) = ,
|φ|2
where |φ|2 = gik gjl φij φkl .

Proof. See [18, Prop. 8.8].


Because a rank 4 tensor is a bit awkward to work with, we define some simpler quantities by
l
taking the trace of Rijk :

Definition 1.16. The Ricci curvature tensor, Rc, is the 20 -tensor with coordinate expression


p
Rij := Rpij .

It is symmetric: Rij = Rji .


The scalar curvature is the trace of the Ricci tensor,

R := g ij Rij .

On a 2-manifold, it is equal to twice the Gaussian curvature.

The Ricci and scalar curvatures can be interpreted in terms of sectional curvatures:
Lemma 1.11. Let X ∈ Tp M be a unit vector. Suppose that X is contained in some orthonormal
basis for Tp M. Rc(X, X) is then the sum of the sectional curvatures of planes spanned by X and
other elements of the basis.
Given an orthonormal basis for Tp M, the scalar curvature at p is the sum of all sectional
curvatures of planes spanned by pairs of basis elements.
Proof. See [18, end of Chap. 8].
The Ricci and scalar curvatures can also be expressed in terms of the curvature operator.

17
Lemma 1.12. On a 3-dimensional Riemannian manifold (M3 , g), let us diagonalize the curvature
operator R with respect to a basis {e2 ∧ e3 , e3 ∧ e1 , e1 ∧ e2 } of ∧2 T M3 , where {e1 , e2 , e3 } is an
orthonormal basis of T M3 (this is possible because R is symmetric). Suppose that, with respect to
this basis, R is a diagonal matrix with entries λ1 , λ2 , λ3 down the diagonal. Then with respect to
the basis {e1 , e2 , e3 }, the Ricci tensor takes the form
 
λ + λ3 0 0
1 2
Rc = 0 λ3 + λ1 0  (1.9)
2
0 0 λ1 + λ2

and the scalar curvature is


R = λ1 + λ2 + λ3 . (1.10)
Proof. Note that for i 6= j, |ei ∧ ej |2 = |ei ⊗ ej − ej ⊗ ei |2 = 12 + 12 = 2 (if we chose a different
norm on ∧2 T M3 the result of Lemma 1.10 would change). Thus, by the result of Lemma 1.11,
using Lemma 1.10 to calculate the sectional curvatures, we have
1
Rc(e1 , e1 ) = (R(e1 ∧ e2 , e1 ∧ e2 ) + R(e1 ∧ e3 , e1 ∧ e3 )) (1.11)
2
1
= (λ3 + λ2 ) .
2
Similarly, Rc(e2 , e2 ) = (λ1 + λ3 )/2 and Rc(e3 , e3 ) = (λ1 + λ2 )/2. This accounts for the diagonal
entries of the matrix of Rc.
We now need to show that the off-diagonal matrix elements of Rc are zero, for example that
Rc(e1 , e2 ) = 0. We have

Rc(e1 + e2 , e1 + e2 ) − Rc(e1 , e1 ) − Rc(e2 , e2 )


Rc(e1 , e2 ) = ,
2
hence it suffices to show that

Rc(e1 + e2 , e1 + e2 ) = Rc(e1 , e1 ) + Rc(e2 , e2 ). (1.12)

To do this we note that  


e1 + e2 e1 − e2
√ , √ , e3
2 2
is also an orthonormal basis for Tp M3 . We can re-apply formula (1.11) for this new basis to
calculate Rc(e1 + e2 , e1 + e2 ). The result is as in equation (1.12), hence Rc has the stated form.
The formula for R follows by taking the trace of Rc.
Definition 1.17. The Einstein tensor on a Riemannian n-manifold (Mn , g) is the tensor
1
Eij := Rij − Rgij .
n
It is also known as the traceless part of the Ricci tensor. A metric g is called Einstein if its
Einstein tensor is identically 0.
In dimension 3, the Einstein tensor has particular significance. We record here a result which
shows that, for 3-dimensional manifolds, the Riemann curvature tensor can be expressed quite
simply in terms of the Einstein tensor and the scalar curvature.
Lemma 1.13. On a 3-manifold,
R
Rm = (g g) + E g,
4
where denotes the Kulkarni-Nomizu product of symmetric tensors:

(P Q)ijkl = Pil Qjk + Pjk Qil − Pik Qjl − Pjl Qik .

18
Proof. This result follows from the decomposition of Rm, which holds on any n-manifold:
R 1
Rm = (g g) + (E g) + W, (1.13)
2(n − 1)(n − 2) n−2
where W is the so-called “Weyl tensor”, which is defined by equation (1.13). The Weyl tensor
vanishes in dimension 3 (see [9, Sec. 8] for a simple proof using the symmetries of W ), from which
the result follows.
In dimension 3, Einstein metrics are very special.
Lemma 1.14. An Einstein metric on a manifold of dimension n ≥ 3 has constant scalar curvature.
If n = 3, the metric has constant sectional curvature.
Proof. Consider the second Bianchi identity (1.8). If we raise the indices k, l we obtain
∇p Rij kl + ∇i Rjp kl + ∇j Rpi kl = 0.
Taking the trace over the indices i, l and using the symmetries of the curvature tensor we obtain
∇p Rjk − ∇i Rjp ik − ∇j Rpk = 0.
Taking the trace over the indices j, k now gives us
∇p R = ∇i Rpi + ∇i Rpi ,
or equivalently
1
g ij ∇i Rjp = ∇p R.
2
Thus we have
   
1 1 1 1 1
g ij ∇i Ejp = g ij ∇i Rjp − Rgjp = ∇p R − ∇p R = − ∇p R (1.14)
n 2 n 2 n
(using ∇g = 0). If the metric is Einstein then Ejp = 0, so
 
1 1
− ∇p R = 0.
2 n
Because n 6= 2 this means ∇R = 0, hence R is constant.
If n = 3, we can apply formula (1.9) from Lemma 1.12 and deduce that
   
λ + λ3 0 0 C 0 0
1 2 R
Rc = 0 λ3 + λ1 0  = g =  0 C 0 , (1.15)
2 3
0 0 λ1 + λ2 0 0 C
where C = 31 R is a constant over the whole manifold. Therefore the curvature eigenvalues λi are
all equal to the constant value C, from which it follows that the manifold has constant sectional
curvature.
If a metric has constant sectional curvature, we call it a constant curvature metric. There
are three essentially distinct possibilities: the value of the sectional curvatures can be positive, zero
or negative. We have the following examples:
Definition 1.18. Constant Curvature Metrics
Euclidean n-space, En := Rn with the standard metric, has constant sectional curvature 0.
The n-dimensional sphere of radius R,
SnR = {x ∈ Rn+1 : |x| = R}
with the metric induced as a submanifold of En+1 , has constant sectional curvature 1/R2 .
The hyperbolic n-space of radius R, HnR , is the open ball of radius R in Rn with the metric
4R4 δij
gij (x) = .
(R2 − |x|2 )2
It has constant sectional curvature −1/R2 (there are other equivalent ways of representing this
Riemannian manifold).

19
See [18, end of Chap. 8] for more details, including a proof that these spaces have constant
curvature.
Using the theory of Jacobi fields (see [18, Chap. 10]), one can prove many comparison theorems,
as well as results about constant curvature metrics. For example one can prove that the local
structure of a metric with constant sectional curvature C is unique (see [18, Prop. 10.9]) – we will
see a more general result in Theorem 1.16. One can also prove the following, which will come in
handy in Chapter 7:
Theorem 1.15. The Bishop-Günther Volume Comparison Theorem. Let us denote by
Vnk (r) the volume of the ball of radius r in the complete, simply-connected n-dimensional space
of constant sectional curvature k (this will be either SnR , En or HnR ). Suppose that (Mn , g) is a
Riemannian n-manifold, and p ∈ Mn . Then
1. If there is a constant a > 0 such that Rc ≥ (n − 1)ag then
Vol(B(p, r)) ≤ Vna (r).

2. If there is a constant b such that all sectional curvatures of (Mn , g) are bounded above by b,
and the exponential map is injective on the ball B(p, r), then
Vol(B(p, r)) ≥ Vnb (r).

Proof. See [8, Theorem 3.101].

1.5 Topology from Geometry


The central idea of the Geometrization Conjecture is that the topology of a manifold and the type
of geometry that the manifold can have are intimately related. Here we will record some of the
theorems that relate the geometric structure of a manifold to its topology.
Theorem 1.16. Let (Mn , g) be a complete, simply-connected Riemannian n-manifold with con-
stant sectional curvature C. Then Mn is isometric to one of En (if C = 0), SnR (if C = 1/R2 ) or
HnR (if C = −1/R2 ).
Proof. See [18, Theorem 11.12].
In particular, any simply-connected manifold with constant non-positive sectional curvature is
diffeomorphic to Rn , and any simply-connected manifold with constant positive sectional curvature
is diffeomorphic to Sn .
This also allows us to characterize non-simply-connected manifolds of constant curvature – by
applying Theorem 1.16 to the universal covering space, we see that any of these manifolds must
be a quotient of Rn , Sn or Hn by a discrete group of isometries acting freely.
We can also derive topological information from bounds on the curvatures, as the following
theorems show:
Theorem 1.17. (Myers’ Theorem) Suppose (Mn , g) is a complete, connected Riemannian n-
manifold whose Ricci tensor satisfies
Rc ≥ (n − 1)Hg
for some constant H. Then Mn is compact with finite fundamental group and diameter at most
1
πH − 2 .
Proof. See [18, Theorem 11.8].
Theorem 1.18. (The Sphere Theorem) We say that the Riemannian manifold (M, g) is
strictly δ-pinched (for some δ > 0) if there is a constant K > 0 such that all of the sectional
curvatures of M lie in the interval (δK, K].
If Mn is a complete, simply-connected and strictly 41 -pinched n-manifold then Mn is homeo-
morphic to Sn .
Proof. See [3].

20
1.6 Scaling
We will be interested later on (when we deal with the normalized Ricci flow in Chapter 7) in how
various geometric quantities scale when the metric is scaled by a constant factor C.

Lemma 1.19. If g̃ = Cg are two Riemannian metrics on an n-manifold Mn , related by a scaling


factor C, then the various geometric quantities scale as follows:
1. g̃ ij = C −1 g ij .
2. Γ̃kij = Γkij .
l l
3. R̃ijk = Rijk .

4. R̃ijkl = CRijkl .

5. R̃ij = Rij .

6. R̃ = C −1 R.

7. The volume elements: dµ̃ = C n/2 dµ.

1.7 Time-evolving metrics


When our metrics depend on time, as in the Ricci flow, we will want to know how the various
geometric quantities evolve when the metric evolves.
Lemma 1.20. Suppose that gij (t) is a time-dependent Riemannian metric, and


gij (t) = hij (t).
∂t
Then the various geometric quantities evolve according to the following equations:
1. Metric inverse:
∂ ij
g = −hij = −g ik g jl hkl . (1.16)
∂t
2. Christoffel symbols:
∂ k 1
Γij = g kl (∇i hjl + ∇j hil − ∇l hij ). (1.17)
∂t 2
3. Riemann curvature tensor:
 
∂ l 1 ∇i ∇j hkp + ∇i ∇k hjp − ∇i ∇p hjk
R = g lp . (1.18)
∂t ijk 2 −∇j ∇i hkp − ∇j ∇k hip + ∇j ∇p hik

4. Ricci tensor:
∂ 1
Rij = g pq (∇q ∇i hjp + ∇q ∇j hip − ∇q ∇p hij − ∇i ∇j hqp ). (1.19)
∂t 2

5. Scalar curvature:

R = −4H + ∇p ∇q hpq − hpq Rpq (1.20)
∂t
where H = g pq hpq .
6. Volume element:
∂ H
dµ = dµ. (1.21)
∂t 2

21
7. Volume of manifold: Z Z
d H
dµ = dµ. (1.22)
dt M M 2
8. Total scalar curvature on a closed manifold M:
Z Z  
d 1 ij
Rdµ = RH − h Rij dµ. (1.23)
dt M M 2

Proof. Proof of (1.16): We have g ij gjk = δki = constant. Differentiating,

∂t (g ij gjk ) = 0
⇒ (∂t g )gjk + g ij (∂t gjk ) = 0
ij

⇒ ∂t g ij = −hij .

Proof of (1.17): We use formula (1.5), which yields


1
∂t Γkij = (∂t g kl )(∂i gjl + ∂j gil − ∂l gij )
2
1
+ g kl (∂i ∂t gjl + ∂j ∂t gil − ∂l ∂t gij ).
2
Now we work in normal coordinates about a point p. By Lemma 1.5 we have ∂i gjk = 0 at p, and
∂i A = ∇i A at p for any tensor A. Hence
1 kl
∂t Γkij (p) = g (∇i hjl + ∇j hil − ∇l hij ) (p).
2
Now although the Christoffel symbols are not the coordinates of a tensor quantity, their derivative
is.4 Hence both sides of this equation are the coordinates of tensorial quantities, so it does not mat-
ter what coordinates we evaluate them in. In particular, the equation is true for any coordinates,
not just normal coordinates, and about any point p.
Proof of (1.18): We use the result of Lemma 1.8. Each term can be expressed using the
result of formula (1.17):
l
Rijk = ∂i Γljk − ∂j Γlik + Γpjk Γlip − Γpik Γljp
l
⇒ ∂t Rijk = ∂i (∂t Γljk ) − ∂j (∂t Γlik )
+ (∂t Γpjk )Γlip + Γpjk (∂t Γlip ) − (∂t Γpik )Γljp − Γpik (∂t Γljp ).

Once again we work in normal coordinates so that Γkij (p) = 0. This gives us
l
(p) = ∇i ∂t Γljk (p) − ∇j ∂t Γlik (p).
 
∂t Rijk

Once again, both sides are tensors, so the equation holds in any coordinates. Plugging the result
of formula (1.17) in on the rhs yields the result.
Proof of (1.19): This follows from formula (1.18) by taking the trace over the indices i, l.
Proof of (1.20): This follows from the formulae (1.19) and (1.16):

= ∂t g ij Rij

∂t R
= (∂t g ij )Rij + g ij (∂t Rij )
 
1 pq
= −hij Rij + g ij g (∇q ∇i hjp + ∇q ∇j hip − ∇q ∇p hij − ∇i ∇j hqp )
2
= −4H + ∇ ∇ hpq − hpq Rpq
p q

4 This is true because the difference between the Christoffel symbols of two connections is a tensor. Thus, by

taking a fixed connection with Christoffel symbols Γ̃kij , we have


“ ”
∂t Γkij = ∂t Γkij − Γ̃kij

and the right hand side is clearly a tensor.

22
(recall that ∇g = 0 and 4 = g ij ∇i ∇j ).
Proof of (1.21): We will use the formula

dµ = det gij dx1 ∧ dx2 ∧ . . . ∧ dxn


p

(see [14, Sec. 7.5] on the volume form).


First we need to calculate the variation in the determinant of a matrix det A, when the matrix
A itself varies. Because A will end up being a metric, we may assume that it is symmetric and
hence we canQ choose a basis in which A is diagonalized with eigenvalues λi 6= 0. Then Aij = λi δij ,
and det A = i λi . If we then vary the entry Aij , the determinant will not change unless i = j. If
i = j, we have Q 
∂ det A ∂ j λ j 1 Y 1
= = λj = det A.
∂Aii ∂λi λi j λi

Therefore by the chain rule,


n  
d X ∂ det A dAij
det A =
dt i,j=0
∂Aij dt
n
X 1 dAij
= δij det A
i,j=0
λ i dt
 
ij dAij
= A−1 det A
dt

where we have observed that (A−1 )ij = δ ij /λi and we are now using the Einstein summation
convention. This formula manifestly does not depend on the basis we choose as traces are basis-
independent.
It now follows by the chain rule that

∂t dµ = ∂t det gij dx1 ∧ . . . ∧ dxn


p

1
= p g ij hij det gij dx1 ∧ . . . ∧ dxn
2 det gij
H
= dµ
2
where H = g ij hij .
Proof of (1.22): This follows from formula (1.21) by taking the derivative under the integral
sign.
Proof of (1.23): This follows from formulae (1.21) and (1.20):
Z Z

Rdµ = (∂t R)dµ + R(∂t dµ)
∂t M
ZM  
1
= −4H + ∇p ∇q hpq − hpq Rpq + RH dµ
M 2
Z  
1
= RH − hpq Rpq dµ.
M 2

Here we have used Stokes’ Theorem (see [14, Sec. 7.5]) to get rid of the first two terms in the
integral because they were expressible as the divergence of a vector field on M, and M has no
boundary.

23
Chapter 2

Introduction to the Ricci Flow

The main aim of this project is to introduce the basics of Hamilton’s Ricci flow program, which is
aimed at proving Thurston’s Geometrization Conjecture and consequently the Poincaré Conjecture.
In this chapter we will present the context for the Ricci flow: what is it, what are the problems
that it is intended to solve, and why might it be expected to solve them? In the process we will also
see some simple solutions to the Ricci flow and try to gain a bit of intuition about its behaviour. A
lot of space in this introduction is devoted to material that is not elaborated (or is elaborated very
little) in the main body of the thesis – namely the description of the Geometrization Conjecture
and the section on pinching. That is because the introduction is intended to serve as a “big picture”
guide to the reasons we study the Ricci flow, so that the reader understands, when encountering
the formidable technical details of Chapter 7, why it is all worthwhile.

2.1 The Prehistory of the Ricci Flow – Geometrization


The Poincaré Conjecture was one of the iconic unsolved problems of 20th century mathematics.
Around 1900, Poincaré asked if a simply-connected closed 3-manifold is necessarily the 3-sphere
S3 . After many years of topological difficulties, William Thurston made promising progress in
the 1970s. He proposed, not only a way of approaching the Poincaré Conjecture, but a far more
general conjecture that, if proven, would lead to a classification of all compact 3-manifolds. We
will outline, extremely vaguely, Thurston’s conjecture.
It was already known by the time Thurston came onto the scene that compact orientable 3-
manifolds could be decomposed into simpler manifolds using the “connected sum” decomposition.
Figure 2.1 shows the connect sum operation # for 2-manifolds. For 3-manifolds, the definition
is analogous: one cuts out a 3-ball from each manifold then glues them together along the S2
boundaries created.
If a manifold can not be nontrivially decomposed using the connected sum decomposition then
it is called prime. Hellmuth Kneser showed in 1929 that every compact orientable 3-manifold can
be decomposed into a finite number of prime “factors” (see [16]), and John Milnor showed in 1962
that this decomposition is unique (see [19]). Thus, to classify 3-manifolds it suffices to classify
prime 3-manifolds.
The connected sum decomposition involves cutting 3-manifolds along 2-spheres; the next step
is to cut them along 2-tori. Thurston conjectured that any compact, orientable, prime 3-manifold
can be decomposed by some finite number of embedded tori into pieces so that each piece has
one of eight fundamental, highly symmetric geometric structures (of which the constant-curvature
metrics on S3 , H3 , R3 are three). Note the analogy with the 2-dimensional case: any compact,
orientable 2-manifold is either S2 (which can be given a metric of constant positive curvature),
a torus T 2 (which can be given a metric with zero curvature), or can be decomposed using the
connected sum into tori with holes (which can be given metrics of constant negative curvature)
(see [22, Theorem 77.5] for the topological classification and [27, Chap. 1] for a discussion about
the existence of geometric structures).
It might not be obvious that the geometric structure will give us useful topological information,
but we saw in Chapter 1 how it is possible to make progress towards classifying constant curvature

24
#

Figure 2.1: Connect sum for 2-manifolds: cut a hole in each of M and N , then glue them together
along the boundary circles created to get the connected sum M #N .

metrics in any dimension – an example of the way that a knowledge of the geometric structure of
a manifold can lead to knowledge of the topological structure. Thurston proved that this decom-
position worked for a certain class of 3-manifolds (the so-called Haken manifolds), but not in full
generality. If his conjectured classification of 3-manifolds, known as “Thurston’s Geometrization
Conjecture”, were proven for all 3-manifolds, then the Poincaré Conjecture would follow (see [2,
Sec. 7.7]).

2.2 Hamilton’s Ricci Flow


Enter Richard Hamilton. Hamilton published a groundbreaking paper ([9]) in 1982, introducing
the concept of the Ricci flow. If you have a Riemannian manifold M with metric g0 , the Ricci flow
is a pde that evolves the metric tensor:

g(t) = −2Rc(g(t))
∂t
g(0) = g0

where Rc(g(t)) denotes the Ricci curvature of the metric g(t).


The idea is to try to evolve the metric in some way that will make the manifold “rounder
and rounder”. We hope that the metric will evolve towards one of Thurston’s eight fundamental

25
geometric structures, and that the decomposition by spheres and tori will somehow emerge nat-
urally. In choosing what should go on the right hand side of the equation of the Ricci flow, we
know that it should be a rank-2 tensor, symmetric (so that the metric g remains symmetric), and
it should involve the curvature somehow – the Ricci curvature tensor is the obvious choice. The
minus sign makes the Ricci flow a heat-type (parabolic) equation (as we shall see in Chapter 5),
so it is expected to “average out” the curvature. This should make the metric rounder in the way
that we want.
A characteristic property of heat-type equations is the maximum principle, which we will see
in Chapter 3. We will use the maximum principle in Chapter 7 to prove quantitatively that this
rounding of the metric does indeed happen in one specific case. The main theorem we will prove
is the one proved by Hamilton in the paper that introduced the Ricci flow:
Theorem 2.1. (Hamilton, 1982) Let M3 be a closed 3-manifold which admits a Riemannian
metric with strictly positive Ricci curvature. Then M3 also admits a metric of constant positive
curvature.
In particular, by Theorem 1.16, any simply-connected closed 3-manifold which admits a
metric of strictly positive Ricci curvature is diffeomorphic to the 3-sphere. We are certainly starting
to get into the territory of the Poincaré Conjecture with this result!
More specifically, we will see that if the initial metric g0 has strictly positive Ricci curvature
then the manifold M3 will shrink to a point in finite time under the Ricci flow. But if we dilate the
metric by a time-dependent factor so that the volume remains constant, the problem of shrinking
to a point is removed. Furthermore, we can show that the rescaled metric converges uniformly
to the desired metric of constant positive curvature on M3 . This process of “blowing up” the
manifold when it is becoming singular is a crucial one in the Ricci flow program. Because it is
rather complicated and can get lost in the technical details, we have provided an analogous but
much simpler argument for the “curve-shortening flow” in Chapter 4 in the hope that this will
make it easier to understand what is going on in later chapters.
We note that the Uniformization Theorem (any Riemannian metric on a closed 2-manifold is
conformal to one of constant curvature) can be proved using the Ricci flow as well (see [6, Chap.
5] for the bulk of the argument, and [4] for the remainder). Once again we encounter problems
with singularities: the manifold will shrink to a point in some cases and become arbitrarily large
in others. As before, the solution is to rescale so that the volume is constant. When we do
this, the rescaled metric converges smoothly to a smooth metric of constant curvature. Note
that the Uniformization Theorem is the closest thing to a 2-dimensional analogue of Thurston’s
Geometrization Conjecture, so the fact that the Ricci flow solves it in this way is another strong
indicator that we’re heading in the right direction.
In 2002 and 2003, Grisha Perelman posted three papers ([23, 25, 24]) on arXiv.org which claimed
to have completed Hamilton’s work towards using the Ricci flow to prove the Geometrization
Conjecture. More recently several authors have posted papers elaborating on the technical details
of Perelman’s papers ([2, 21]). The proof seems to have been accepted by the mathematical
community, wrapping up the 100-year history of the Poincaré Conjecture (and the younger but no
less important Geometrization Conjecture).

2.3 Special Solutions of the Ricci Flow


In this section we will exhibit some of the special solutions of the Ricci flow. The first thing, of
course, is to know what they do on the spaces of constant curvature.
On an n-dimensional sphere of radius r (where n > 1), the metric is given by g = r2 ḡ where ḡ
is the metric on the unit sphere. The sectional curvatures are all 1/r2 . Thus for any unit vector
v, the result of Lemma 1.11 tells us that Rc(v, v) = (n − 1)/r2 . Therefore
n−1
Rc = g = (n − 1)ḡ,
r2

26
so the Ricci flow equation becomes an ode:

∂t g = −2Rc
∂ 2
⇒ ∂t (r ḡ) = −2(n − 1)ḡ
d(r 2 )
⇒ dt = −2(n − 1).
We have the solution q
r(t) = R02 − 2(n − 1)t,
where R0 is the initial radius of the sphere. The manifold shrinks to a point as t → R02 /(2(n − 1)).
Similarly, for hyperbolic n-space Hn (where n > 1), the Ricci flow reduces to the ode
d(r2 )
= 2(n − 1)
dt
which has the solution q
r(t) = R02 + 2(n − 1)t.
So the solution expands out to infinity.
Of course the flat metric on En has zero Ricci curvature, so it does not evolve at all under the
Ricci flow. There are other non-trivial Riemannian manifolds with vanishing Ricci curvature (the
metric is flat, i.e. locally isometric to Euclidean space, if and only if the Riemann curvature tensor
vanishes). These metrics can be regarded as the “fixed points” of the Ricci flow. However, we
ought really to regard Sn and Hn as honorary fixed points of the flow – even though the metric
was changing under the flow, it only ever changed by a rescaling of the metric.
Even more generally, one can regard as “generalized fixed points” of the Ricci flow those
manifolds which change only by a diffeomorphism and a rescaling under the Ricci flow. Let
(Mn , g(t)) be a solution of the Ricci flow, and suppose that ϕt : Mn → Mn is a time-dependent
family of diffeomorphisms (with ϕ0 = id) and σ(t) is a time-dependent scale factor (with σ(0) = 1).
If we then have
g(t) = σ(t)ϕ∗t g(0)
then the solution (Mn , g(t)) is called a Ricci soliton. Taking the derivative of this equation and
evaluating at t = 0 yields
∂ dσ(t) ∗ ∂
g(t) = ϕt g(0) + σ(t) ϕ∗t g(0)
∂t dt ∂t
−2Rc(g(0)) = σ 0 (0)g(0) + LV g(0),
where V = dϕt /dt, by the definition of the Lie derivative given in Section 1.3. Let us set σ 0 (0) = 2λ.
We can now use the result of Lemma 1.7 to write this in coordinates as
−2Rij = 2λgij + ∇i Vj + ∇j Vi .
As a special case we can consider the case that V is the gradient vector field of some scalar
function f on Mn , i.e. Vi = ∇i f . The equation then becomes
Rij + λgij + ∇i ∇j f = 0. (2.1)
Such solutions are known as gradient Ricci solitons. A gradient Ricci soliton is called shrinking
if λ < 0, static if λ = 0, and expanding if λ > 0. The gradient Ricci solitons play a role in
motivating the definition of Perelman’s F- and W-functionals which we will see in Section 8.4.
One example of a gradient Ricci soliton is Hamilton’s cigar soliton: Let Σ2 = R2 with the
standard coordinates and
δij
gij = .
1 + x2 + y 2
Because the metric is rotationally symmetric, the calculation of the Ricci curvature tensor and
the covariant derivative is particularly simple (by the rotational symmetry we can embed Σ2 in
R3 as a manifold of revolution – see [18, Ex. 8.2] for the calculation of the Gaussian curvature of
manifolds of revolution). We can show that
Rij + ∇i ∇j f = 0
where f (x, y) = (x2 + y 2 )/2, hence (Σ2 , g) is a static gradient Ricci soliton.

27
S1 x I

Figure 2.2: A neck in a 2-manifold.

Figure 2.3: A neck “pinching off” in a 2-manifold. This diagram is intended to illustrate by lower-
dimensional analogy what a neckpinch in a 3-manifold is like – the Ricci flow on 2-manifolds does
not give rise to neckpinches.

2.4 Pinching
We now describe how the connected sum decomposition arises out of the Ricci flow. Firstly we
note that, if we have a product Riemannian manifold with a product Riemannian metric, then
under the Ricci flow each factor in the product evolves independently.
Now consider a “neck” shaped like S1 × I (I is an interval) between two parts of a 2-manifold,
as shown in Figure 2.2. We expect the metric on the neck to be close to a product metric; the I
factor has no curvature so it will not change, and the S1 factor also has no intrinsic curvature so it
will not change. In contrast, when we have an analogous neck shaped like S2 × I in a 3-manifold,
we have seen in Section 2.3 that the 2-sphere will shrink to a point in finite time. Therefore we
expect that in some situations, the manifold will “pinch off” at such a neck (see Figure 2.3 for
the 2-dimensional analogue of such a neckpinch). Thus the Ricci flow can actually perform the
connected sum decomposition for us! The details of how this pinching off actually happens (how
one ought to perform “surgery” on one’s manifold) are very tricky.
We will discuss the pinching process in more detail in Chapter 8.
The torus decomposition arises in a different way. After we have have performed the connected-
sum decomposition by surgery on the neckpinches, we consider the evolution of the remaining
pieces. Some will be quotients of S3 (which shrink to a point) or products S2 × S1 (which shrink
down to the S1 factor). Both of these cases satisfy the Geometrization Conjecture, so it is the
remaining pieces that concern us. They will exist without singularities as t → ∞.
We expect these pieces to be made up of pieces with the fundamental geometries of Thurston,
glued together along torus boundaries (if the Geometrization Conjecture is true). Some pieces
will be hyperbolic, and we have seen in Section 2.3 that hyperbolic metrics will tend to expand.
The metric on the torus necks, on the other hand, will be close to a product metric on T 2 × I.
Recall that under the Ricci flow on a product metric, each factor will evolve independently. Both

28
factors have zero curvature, so we expect the neck to remain static, with the hyperbolic pieces of
the manifold expanding around it. If we rescale the flow so that the volume remains constant, the
torus neck will become very thin, in the same way that the S2 × I neck becomes very thin before
a neckpinch. This is how the torus decomposition arises.
Under the rescaling, the manifold splits along tori into hyperbolic pieces with cusps and “col-
lapsing” pieces (with injectivity radius shrinking to 0). The collapsing pieces can be identified as
so-called “graph manifolds”, which are known to satisfy the Geometrization Conjecture. For a
more detailed description of how this decomposition arises, see [1]. For the full story on the proof
of the Geometrization Conjecture see [2].

29
Chapter 3

The Maximum Principle

The maximum principle is the key tool in understanding many parabolic partial differential
equations. It appears in many guises, but it always essentially expresses the fact that parabolic
or “heat-type” pdes will “average out” the values of whatever quantity is evolving. It is cru-
cial to understanding the Ricci flow (which will be shown in Chapter 5 to be parabolic modulo
reparametrization), where it can be used to put bounds on the curvature of the metric. In some
situations we will need more refined estimates than can be obtained by applying the maximum
principle to scalar quantities related to curvature, so we must apply the maximum principle to
tensor quantities like the curvature operator. The question of what it means for a tensor quantity
to “average out” naturally arises.
In this chapter we first motivate the idea of the maximum principle using the example of the
heat equation on the real line. We generalize this situation to a scalar function obeying a heat-type
equation on an arbitrary Riemannian manifold (rather than just R), then finally generalize to an
arbitrary section of a vector bundle over a manifold (rather than just the trivial R-bundle whose
sections are the scalar functions).

3.1 The Heat Equation


The simplest parabolic pde is the heat equation, describing the time evolution of the temperature
distribution u(x, t) on an infinite 1-dimensional bar:

∂u ∂2u
= . (3.1)
∂t ∂x2
The maximum principle says that the temperature of the hottest point on the bar is a non-increasing
function of time, and the temperature of the coldest point on the bar is a non-decreasing function
of time.
This can be seen intuitively from the fundamental or Green’s function solution of the heat
equation, which is the temperature distribution due to a delta-function distribution at time t = 0.
That is, it solves the pde

∂u ∂2u
=
∂t ∂x2
u(x, 0) = δ(x).

The solution is 2
exp(− x )
u(x, t) = √ 4t .
4πt
It can be seen in Figure 3.1 that at each time the temperature distribution is Gaussian, and
the distribution becomes flatter and wider as time evolves. This is what we expect of heat flow:
the heat spreads out as time evolves, any relatively hot region becomes cooler, and the maximum
temperature on the bar drops over time. Any solution of the heat equation can be expressed as

30
1

0.8

0.6

0.4

0.2

-4 -2 2 4

Figure 3.1: The evolution of the temperature distribution due to a pointlike source at time t = 0.
The solution becomes flatter and wider as time increases.

a weighted integral of such fundamental solutions, by the method of Green’s functions. Thus it
is plausible that the “averaging out” behaviour of the fundamental solution is characteristic of all
solutions of the heat equation.
One can see why the maximum principle is true in a different way. At a point of maximum
temperature (at a given time), the temperature distribution u must have a local maximum. Thus
∂ 2 u/∂x2 ≤ 0. It follows by the heat equation (3.1) that, at the hottest point of the bar,

∂u ∂2u
= ≤ 0,
∂t ∂x2
so the temperature is nonincreasing. Therefore one would expect the maximum temperature on
the bar to drop over time.

3.2 A Scalar Maximum Principle on Manifolds


We now generalize to scalar functions on an arbitrary manifold, and we also take reaction terms
into account. This and the subsequent section are based on [6, Chap. 4].
Let us work on a closed manifold M with a Riemannian metric g(t) that varies with time. In
this section we will consider pdes of the form
∂u
= 4g(t) u + hX(t), ∇ui + F (u), (3.2)
∂t
where u : M×[0, T ) → R is a time-dependent real-valued function on M, X(t) is a time-dependent
vector field on M and F : R → R. We will see many pdes of this broad type – they consist of a
Laplacian term 4g(t) u and the reaction terms hX(t), ∇ui + F (u). We call such pdes heat-type
equations, because of the analogy with the heat equation (3.1).
In practice, we will apply the maximum principle to heat-type equations with such fearsomely
complicated reaction terms that it is impossible to keep track of them all. The best we can hope
to do, in most cases, is bound them. Incorporating bounds on the reaction terms into the heat-
type equation will give rise to differential inequalities, rather than equalities. For this reason, the
results stated and proved in this chapter relate to differential inequalities, rather than equalities
as in equation (3.2).
Before showing how to deal with the reaction term F (v) we will prove the maximum principle
in the simpler case that F = 0.
Lemma 3.1. Let (M, g(t)) be a closed manifold with a time-dependent Riemannian metric g(t).
Suppose that u : M × [0, T ) → R is initially nonpositive (i.e. u(x, 0) ≤ 0 for all x ∈ M) and that

31
it satisfies the differential inequality
∂u
≤ 4g(t) u + hX(t), ∇ui
∂t
at all points (x, t) ∈ M × [0, T ) where u(x, t) > 0. Then u(x, t) ≤ 0 for all x ∈ M and t ∈ [0, T ).
Note the rather strange condition that the inequality is satisfied only at points and times
where something has “gone wrong” (in the sense that u(x, t) has become positive). The condition
is designed so that we can apply this lemma in the proof of the more general scalar maximum
principle (Theorem 3.2).
Proof. The proof will be based on the idea outlined at the end of Section 3.1.
Given  > 0, let us set v = u − (1 + t). Note that v (x, 0) ≤ − < 0. We will show that for
any  > 0, v < 0 on M × [0, T ). From the fact that  > 0 is arbitrary it will follow that u is
non-positive.
First we compute the evolution equation (or evolution inequality) of v , which follows from that
of u:

v ≤ 4u + hX, ∇ui − 
∂t
= 4v + hX, ∇v i −  (3.3)
at any point x and time t where u(x, t) > 0. In particular, because v < u, the inequality holds at
any (x, t) such that v (x, t) = 0. Note that 4v = 4u and ∇v = ∇u because −(1 + t) has no
spatial dependence.
Now suppose, for a contradiction, that v (x, t) ≥ 0 for some (x, t) ∈ M×[0, T ). By compactness,
there exists a first time t0 ∈ [0, T ) and point x0 ∈ M at which v hits 0. That is, v (x0 , t0 ) = 0
but v (x, t) < 0 for any t < t0 . It follows that
∂v
(x0 , t0 ) ≥ 0.
∂t
Furthermore, x0 must be a local maximum of v (·, t0 ), so
∇v (x0 , t0 ) = 0
4v (x0 , t0 ) ≤ 0.
We can plug these inequalities into equation (3.3), which holds at (x0 , t0 ) because v (x0 , t0 ) = 0.
We obtain:
∂v
0≤ (x0 , t0 ) ≤ 4v (x0 , t0 ) + hX(t0 ), ∇v (x0 , t0 )i −  ≤ − < 0,
∂t
which is a contradiction.
Therefore, v (x, t) < 0 for all (x, t) ∈ M × [0, T ) and for all  > 0. Hence u(x, t) ≤ 0 for all
(x, t) ∈ M × [0, T ), as required.
We can now prove the main result of this section, which shows how to deal with the reaction
term F (u) in equation (3.2).
Theorem 3.2. The Scalar Maximum Principle. Let (M, g(t)) be a closed manifold with a
time-dependent Riemannian metric g(t). Suppose that u : M × [0, T ) → R satisfies
∂u
≤ 4g(t) u + hX(t), ∇ui + F (u)
∂t
u(x, 0) ≤ C for all x ∈ M,
for some constant C, where X(t) is a time-dependent vector field on M and F : R → R is locally
Lipschitz. Suppose that φ : R → R is the solution of the associated ode, which is formed by
neglecting the Laplacian and gradient terms:

= F (φ)
dt
φ(0) = C.

32
Then
u(x, t) ≤ φ(t)
for all x ∈ M and t ∈ [0, T ) such that φ(t) exists.
The theorem essentially tells us that our upper bound grows no faster than we would expect
from the reaction term F (u).
Proof. Let us set v = u − φ. We know that v(x, 0) ≤ 0 for all x ∈ M, and we desire to show that
v(x, t) ≤ 0 for all (x, t) ∈ M × [0, T ). To do this, we fix an arbitrary τ ∈ [0, T ) and show that
v ≤ 0 on [0, τ ], for any τ ∈ [0, T ). First note that
∂v ∂(u − φ)
=
∂t ∂t
≤ 4u + hX, ∇ui + F (u) − F (φ)
= 4v + hX, ∇vi + (F (u) − F (φ)). (3.4)

Note that ∇u = ∇v and 4u = 4v because φ depends only on t. We now want to deal with the
last term on the rhs.
Because M×[0, τ ] is compact, there exists a constant C (dependent on τ ) such that |u(x, t)| ≤ C
and |φ(t)| ≤ C on M × [0, τ ]. Because F is locally Lipschitz and the interval [−C, C] is compact,
there exists C1 (also dependent on τ ) such that |F (x) − F (y)| ≤ C1 |x − y| for all x, y ∈ [−C, C].
Therefore, because u, φ ∈ [−C, C], |F (u) − F (φ)| ≤ C1 |u − φ| = C1 |v| on M × [0, τ ]. Plugging this
into the evolution equation (3.4) for v, we obtain
∂v
≤ 4v + hX, ∇vi + C1 |v|.
∂t
Now let w = e−C1 t v. Then we have
∂w
≤ e−C1 t (4v + hX, ∇vi + C1 |v| − C1 v)
∂t
= 4w + hX, ∇wi + C1 (|w| − w). (3.5)

We are going to apply Lemma 3.1 to the function w. Because v(x, 0) ≤ 0 for all x ∈ M, we have
w(x, 0) ≤ 0 for all x ∈ M. Furthermore, if w ≥ 0 then |w| = w, so the differential inequality (3.5)
gives us
∂w
≤ 4w + hX, ∇wi
∂t
at any point (x, t) ∈ M × [0, τ ] such that w(x, t) ≥ 0. Hence, by Lemma 3.1, w(x, t) ≤ 0 for all
(x, t) ∈ M × [0, τ ].
It follows that v(x, t) ≤ 0, and hence u(x, t) ≤ φ(t) for all (x, t) ∈ M × [0, τ ], for any τ ∈ [0, T ).
Therefore u(x, t) ≤ φ(t) for any (x, t) ∈ M × [0, T ).

3.3 A Maximum Principle for Vector Bundles


In [10], Hamilton showed how to generalize the maximum principle to apply to sections of a vector
bundle (of which scalar functions are a special case). There is a bit of a subtlety in this case though
– a heat-type equation has a Laplacian term in it, but what exactly do we mean by a Laplacian
in this situation? Let π : E → M be a vector bundle over M with a fixed bundle metric h, ∇(t) ¯ a
smooth time-dependent family of connections on E compatible with h, and g(t) a time-dependent
Riemannian metric on M. Now to define the Laplacian of a section ϕ we need to take two covariant
derivatives of ϕ then take the trace. However the first covariant derivative is ∇ϕ¯ ∈ C ∞ (T ∗ M ⊗ E)
(compare Definition 1.12).
There is a problem because ∇ϕ ¯ is not a section of E, so we cannot simply take the second
¯
covariant derivative using ∇(t). To resolve this, we define a connection ∇(t)ˆ on the vector bundle

E ⊗ T M by the product rule:
ˆ X (ϕ ⊗ ξ) ≡ ∇
∇ ¯ X ϕ ⊗ ξ + ϕ ⊗ ∇X ξ

33
Figure 3.2: A 2-dimensional vector bundle over a 1-manifold. The set K (shaded) is not invariant
under parallel translation. The initial section (in black) is contained in K, but averages out to to
a section (shown in grey) that is not contained in K.

where ϕ ∈ C ∞ (E), ξ ∈ C ∞ (T ∗ M), and ∇ denotes the Levi-Civita connection of the metric g(t).
We can now define the Laplacian: if we use the index a for sections of E and indices i, j for sections
of T ∗ M (so that an element H of E ⊗ T ∗ M might have coordinates Hia with respect to some
basis) then
ˆ i∇
(4̂ϕ)a := g ij ∇ ¯ j ϕa

(compare this with Definition 1.13).


According to our earlier discussion, the maximum principle should somehow express the ten-
dency of solutions of heat-type equations to “average out”. What exactly does this mean for a
section of a vector bundle? We would expect a vector in Rk that is getting averaged out to remain
inside any convex set that initially contains it. So rather than proving that the lower bound of
our solution is preserved, as we did for a scalar function, we prove that our solution stays inside
convex sets. The set should also be closed and invariant under parallel translation. The reason
for the latter can be seen from Figure 3.2 which shows a section of a 2-dimensional vector bundle
over a 1-dimensional manifold getting averaged out. The section is initially contained inside the
set K, which is closed and convex but not invariant under parallel translation, but the averaging
out takes it outside K.

Theorem 3.3. Let π : E → M be a vector bundle over M with a fixed bundle metric h, ∇(t) ¯ a
smooth time-dependent family of connections on E compatible with h, and g(t) a time-dependent
Riemannian metric on M. Let K be a subset of E that is closed and convex in each fibre, and
invariant under parallel translation. Let F : E × [0, T ) → E be a continuous map that is fibre-
preserving, and Lipschitz in each fibre. Let α(t) be a time-dependent section of E that satisfies the

34
conditions

α = 4̂α + F (α)
∂t
α(0) ∈ K.

Let Kx = π −1 (x) ∩ K. Suppose that every solution of the ode

da
= F (a)
dt
a(0) ∈ Kx

remains in Kx . Then the solution α(t) of the pde remains in K.


That is, if the reaction term does not force it out, then the solution will remain in K. Note
that the behaviour of the ode is much simpler than the behaviour of the original pde because we
need only consider each fibre individually – the solution of the ode is a time-dependent vector in
some vector space rather than a time-dependent section of some vector bundle.

35
Chapter 4

Curve-Shortening Flow

This chapter is based on the lectures on Geometric Evolution Equations given by Ben Andrews at
the ICE-EM Graduate School in July 2006.
When Hamilton introduced the Ricci flow and used it to prove Theorem 2.1, he introduced
quite a general method of dealing with geometric evolution equations. This method has since
been applied to other flows, such as the mean curvature flow (mcf) and the curve-shortening flow
(csf) (see [30] for a description of both). Although these settings for Hamilton’s method of proof
emerged after Hamilton’s original Ricci flow paper, the csf in particular is much simpler than the
Ricci flow. Many of the concepts central to the theory of the Ricci flow presented in Chapters 5,
6, 7 and 8 find simpler analogies in the theory of the csf.
In this chapter we will outline the basic theory of the csf, with the aim of providing a
“blueprint” for subsequent development of the theory of the Ricci flow. We will not prove all
of the results that we state – those proofs and those results that assist in building intuition for the
methods of proof employed in later chapters are emphasised.

4.1 Steepest Descent Flow for Length


Given a smooth immersion X0 : R/Z → R2 (i.e. a smooth immersion of a circle into the plane),
we can evolve it by taking X : R/Z × [0, T ) → R2 such that

∂X
(u, t) = −κN(u, t) (4.1)
∂t
X(u, 0) = X0 (u).

Here, κ is the curvature of our curve, and N is the unit normal vector to it, defined by

∂2X
−κN =
∂s2
where s is the arclength parameter (see Figure 4.1). The book [15, Chap. 1] is a good reference
for the theory of curves in space.
One might wonder where this definition comes from. The idea is to look for the way to move a
curve in the plane that gives the greatest decrease in total length for the smallest total “movement”
– we say that the csf is the “steepest descent flow for length”.
Lemma 4.1. Denote the length of the curve u 7→ X(u, t) at time t by
Z 1
L(t) = |Xu |du.
0

Consider an arbitrary variation of the curve X0 (u) = X(u, 0) with

∂X
(u, t) = V (u, t),
∂t

36
N

-κN

Figure 4.1: The curve-shortening flow moves a curve in the direction of its curvature vector.

which is normalized so that Z L


|V |2 ds = 1
0

where s is the arclength along X0 and L = L(0). Then the rate at which the length decreases,
−dL/dt, is maximised for the csf, i.e.

V ∝ −κN.

Proof. We note that


 1

∂t |Xu | = ∂t hXu , Xu i 2
1
= hXu , ∂t Xu i
|Xu |
1
= hXu , ∂u ∂t Xi
|Xu |
 
∂u V
= Xu ,
|Xu |
= hXu , ∂s V i.

37
Hence
Z 1
dL(t)
= ∂t |Xu |du
dt 0
Z L
ds
= hXu , ∂s V i
0 |Xu |
Z L
= h∂s X, ∂s V ids
0
Z L
L
= [h∂s X, V i]0 − h∂ss X, V ids
0
Z L
= hκN, V i
0
! 21 ! 21
Z L Z L
2 2
≥ − κ ds |V | ds .
0 0

We have integrated by parts, and the boundary terms vanished by periodicity. The last line follows
from the Cauchy-Schwarz inequality, so equality holds if and only if V ∝ −κN. Thus the csf is
exactly the variation that gives the fastest decrease in length.

Example: The shrinking circle. If our initial curve X0 is a circle of radius r0 centred at the
origin, then the solution will take the form X(u, t) = r(t)(cos(u), sin(u)). We have N = X/r, κ =
1/r, so the csf equation becomes
dr 1
=−
dt r
which has the solution
1
X(u, t) = (r02 − 2t) 2 (cos(u), sin(u)). (4.2)
p
2

So the circle shrinks to a point at a finite time t = r0 /2 = r0 / 2.

4.2 Short Time Existence


Before anything else we must check that this system has a solution for a short time. To do this
we will use the existence and uniqueness theorem for parabolic pdes (Theorem A.1). We need to
check if our system is strongly parabolic before we can apply this theorem.
∂x
If we set X(u) = (x(u), y(u)) and use the notation xu = ∂u , then the csf equation becomes

yu2
    
∂ x 1 −xu yu xuu
=
∂t y (x2u + yu2 )2 −xu yu x2u yuu
For the system to be strongly parabolic, the matrix on the rhs ought to be positive-definite. It
clearly isn’t (it is even singular), so we seem to be in trouble.
Although it appears that our system is not parabolic, all that is going on is that the parabolic
nature of the system is being hidden from us by a poor parametrization of our solution. Suppose,
for the moment, that we have a solution of the csf given by X : R/Z × [0, T ) → R2 . Consider a
time-dependent family of diffeomorphisms from the circle to itself:
ϕ : R/Z × [0, T ) → R/Z.
Then we can define X̃(u, t) = X(ϕ(u, t), t). X̃ is just a time-dependent reparametrization of
X. In particular, it won’t change how the curve “looks”, i.e. its image in R2 .
The evolution equation satisfied by X̃ is then
∂ ∂ ∂
X̃(u, t) = X(ϕ(u, t), t) + Xu ϕ(u, t)
∂t ∂t ∂t
= −κN + Xu V
     
1 yu [yu −xu ] xuu xu
= + V
(x2u + yu2 )2 −xu yuu yu

38
where we define

ϕ(u, t) = V (ϕ(u, t), t). (4.3)
∂t
We are free to choose how to reparametrize our curve – that is, we may choose V and then
solve the differential equation (4.3) for ϕ(u, t). We choose V so that our system of csf with
time-dependent reparametrization is strongly parabolic:
 
1 [xu yu ] xuu
V = 2 . (4.4)
(xu + yu2 )2 yuu

Then we have, setting X̃ = (x̃, ỹ),


       
∂ x̃ 1 yu [yu −xu ] xu [xu yu ] xuu
= +
∂t ỹ (x2u + yu2 )2 −xu yu yuu
  
1 1 0 xuu
= .
(xu + yu ) 0
2 2 1 yuu

We can now express xu , xuu , yu , yuu in terms of x̃u , x̃uu , ỹu , ỹuu using the chain rule. This gives
us a second-order differential equation for (x̃, ỹ), which is strongly parabolic because the matrix
multiplying (x̃uu , ỹuu ) is positive definite. By Theorem A.1 there exists a solution X̃ = (x̃, ỹ) on
some time interval [0, T ), and the solution is unique for as long as it exists. Using this solution it is
possible to reconstruct ϕ(u, t) from (4.3) – because we know x̃, ỹ this is a simple ode. Once we know
the reparametrization used, we can find X(u, t) = X̃(ϕ−1 (u, t), t) and show that it is a solution of
the csf. Therefore, although the csf is not a strongly parabolic system, by reparametrizing it we
can show that it has a solution on some time interval [0, T ), and it is not difficult to further prove
that the solution is unique for as long as it exists. Because of this uniqueness we may without
ambiguity choose

T = sup{τ : X(u, t) exists and is smooth for t ∈ [0, τ )}.

In this situation, we call [0, T ) a maximal time interval – it is the largest time interval on which
the solution exists (note that T could be ∞).
Theorem 4.2. For any smooth immersion X0 : R/Z → R2 , there is a unique smooth solution

X : R/Z × [0, T ) → R2

satisfying the csf and with


X(u, 0) = X0 (u)
on a maximal time interval [0, T ).

4.3 Finite Time Singularity


Theorem 4.3. The Avoidance Principle. If X, Y are solutions of the csf on [0, T ), and
X(u, 0) 6= Y (v, 0) for all u, v, then X(u, t) 6= Y (v, t) for all u, v, for all t ∈ [0, T ). That is, if the
curves do not intersect initially, they will not intersect at any later time.
This theorem can be proven using a version of the maximum principle to show that the smallest
distance between the two curves is a nondecreasing function of time.
Theorem 4.4. The csf exists for only a finite time before becoming singular. That is, the maximal
time T mentioned in Theorem 4.2 is finite.
Proof. Given any X0 , we can take a circle that surrounds it. Running the csf on the circle and the
curve X0 , the Avoidance Principle shows that the solution X can never move outside the evolving
circle. As we showed in (4.2), the circle shrinks to a point in a finite time, “crushing” our solution
X and forcing it to become singular in finite time. Therefore the maximal time T mentioned in
Theorem 4.2 must be finite.

39
4.4 Curvature Explodes
Theorem 4.5. Suppose we have a solution of the csf on a maximal time interval [0, T ), where
T < ∞ by Theorem 4.4. Then

lim sup{|κ(u, t)| : u ∈ R/Z} = ∞.


t→T

We will give an outline of the proof. The basic idea is to assume the curvature were bounded
as t → T , and show that the maps Xt (·) = X(·, t) must then converge so nicely as t → T that it is
possible to extend the solution past T , contradicting the assumption that [0, T ) was the maximal
time interval on which the solution existed.
By differentiating equation (4.1) one can obtain an evolution equation for the curvature:

∂κ ∂2κ
= + κ3 .
∂t ∂s2
This is a heat-type equation. From it we can derive evolution equations for the derivatives of
the curvature ∂ n κ/∂sn , which will also be heat-type equations. Assuming an upper bound on κ,
one can use the maximum principle (in the form of Theorem 3.2) to prove upper bounds on the
derivatives of κ. These bounds are analogous to the bounds obtained in Chapter 6 for the Ricci
flow.
From the bounds on the derivatives of the curvature, it is possible to prove bounds on the
derivatives of X. Once can show that all of the quantities
k l
∂ ∂
∂uk ∂tl X

are bounded above. This means that all derivatives of X are equicontinuous, so by the Ascoli-Arzela
theorem, for any k, l > 0 there is a sequence of times (ti ) such that the maps

∂k ∂l
X(·, ti )
∂uk ∂tl
converge uniformly. Because the time-derivatives are also bounded, this means that the maps

∂k ∂ l
X(·, t)
∂uk ∂tl
converge uniformly. Therefore the map X(·, t) and all its derivatives converge uniformly to some
smooth map X(·, T ). We can then use Theorem 4.2 to find a solution X̄ to the csf with the initial
curve X̄0 (·) = X(·, T ), defined on some time interval [0, ). The solution X̄ can now be “glued onto
the end of X” to extend the solution X past T . Defining X(·, t) = X̄(·, t − T ) for t ∈ [T, T + )
(and leaving X unchanged for t ∈ [0, T )), we obtain a smooth solution1 X to the csf with the
given initial data, which is defined on the time interval [0, T + ). This contradicts the maximality
of T . Hence the original assumption that the curvature was bounded was incorrect.

4.5 Grayson’s Theorem


We have seen that our curve becomes singular in finite time; in fact it can be shown that an
embedded circle will shrink to a point and become round as it approaches the maximal time T .
What does “become round” mean though, when our solution is shrinking to a point? To make
sense of this, we “blow up” our solution – we dilate it by a time-dependent factor so that the area
enclosed by the curve is constant. Then the solution will not shrink to a point, but will stay of the
same size, and “becomes round” simply means that the curve converges to a circle.
1 The solution has smooth u-derivatives at t = T because of the uniform convergence of all derivatives of X(·, t)

to X(·, T ) as t → T . The time-derivatives can be expressed in terms of the space derivatives using the equation of
the csf, so they too are smooth.

40
Theorem 4.6. (Grayson) Given any embedded circle X0 , the solution to the csf X(u, t) with
X(u, 0) = X0 (u) will remain embedded and will shrink to a point x as t approaches the maximal
time of existence T . Furthermore, if A(t) denotes the area enclosed by the curve X(·, t) at time t,
then the curve r
π
X̃(u, t) = (X(u, t) − x)
A(t)
converges exponentially to a unit circle as t → T .

Proof. Once again, we will not give a complete proof but will outline one of the main concepts,
because it is analogous to the blowup technique that we will introduce for the Ricci flow in Chapter
8. See [30, Chap. 1 and Sec. 5.2] for the full proof of this theorem.
If the initial curve X0 is convex, one can prove using derivative estimates that the curvature
converges uniformly to 1, and all derivatives of curvature are bounded. Once again the Ascoli-
Arzela theorem can be employed to show that a limit exists, and that the limit is a smooth
embedded curve with curvature 1, hence the unit circle (see [30, Chap. 1] for the details). This
result is known as the Gage-Hamilton Theorem (a corresponding result for the mean curvature
flow was proven by Gerhard Huisken).
Grayson proved that any initial curve will eventually become convex under the csf, from which
the result follows by the Gage-Hamilton Theorem. We will outline the proof presented in [30, Sec.
5.2]. We use the result of Theorem 4.5, which says that the curvature explodes as we approach
some singular time T < ∞. This means we may choose points pn ∈ R/Z and times tn that converge
to T such that
|κ(pn , tn )| = sup |κ(p, t)|,
p∈R/Z,t≤tn

and define Mn := |κ(pn , tn )|, so that limn→∞ Mn = ∞. We then define rescaled flows Xn by
   
t
Xn (p, t) := Mn X pn + p, tn + 2 − X(pn , tn ) .
Mn

It is not hard to see that these rescaled flows are solutions of the csf defined for t ∈ [−Mn2 tn , Mn2 (T −
tn )), with Xn (0, 0) = 0 and |κn (p, t)| ≤ |κn (0, 0)| = 1 for t ∈ [−Mn tn , 0]. This technique is known
as “blowing up” at a singularity; we rescale the flow so that the curvature no longer diverges.
We saw in the discussion of Theorem 4.5 that it is possible to deduce bounds on derivatives of
the curvature from a bound on the curvature, using the maximum principle. Moreover, one can
get bounds on the derivatives of the maps Xn : R × [−Mn tn , 0] (considering Xn as a periodic map
on the real line) from the bounds on derivatives of the curvature. Hence, by the Ascoli-Arzela
theorem there is a subsequence of Xn that converges uniformly in all C k norms on all compact
subsets of R × (−∞, 0] to a limit solution X∞ : R × (−∞, 0] → R2 . This limit solution satisfies
X∞ (0, 0) = 0 and κ∞ (p, t) = κ∞ (0, 0) = 1 for all t ≤ 0.
After considerable work, one can show that this limit of rescaled solutions is convex with
compact image for all t ∈ (−∞, 0], from which it follows that some Xn is eventually convex and
hence the csf X(·, t) is eventually convex for large enough t, as required.

41
Chapter 5

Short Time Existence and


Uniqueness of the Ricci Flow

Before we can do anything with the Ricci flow, we must show that a solution exists for a short
time. We would like to apply the short-time existence and uniqueness theorem for parabolic pdes
(Theorem A.1) to the system


g(t) = −2Rc(g(t))
∂t
g(0) = g0 .

We need to check if the system is strongly parabolic. This chapter is based on [6, Chap. 3] and
[28, Chap. 5].

5.1 The Linearization of the Ricci Tensor


The first thing to do is to work out the linearization of the Ricci tensor in the sense described in
Appendix A. Recall that, if we have a time-dependent metric tensor gij (t) (we are not assuming
that this time-dependent metric satisfies the Ricci flow or any other specific equation), we define
the linearization of the Ricci tensor, D[Rc] : C ∞ (T ∗ M ⊗S T ∗ M) → C ∞ (T ∗ M ⊗S T ∗ M) so that
 
∂gij ∂
D[Rc] = Rc(gij (t)).
∂t ∂t

Lemma 5.1. The linearization of the Ricci tensor is given by


1 pq
D[Rc](h)ij = g (−∇p ∇q hij − ∇i ∇j hpq + ∇q ∇i hjp + ∇q ∇j hip ).
2
Proof. This follows from equation (1.19). We have interchanged the indices p and q in places,
which we can do because the metric inverse g pq is symmetric.
To check if the system is strongly parabolic we must compute the principal symbol (see Ap-
pendix A for the definition). The symbol can be computed easily from the linearization:

Corollary 5.2. The principal symbol of the differential operator −2Rc (as a function of the metric
g) is:
σ̂[−2Rc](ϕ)(h)ij = g pq (ϕp ϕq hij + ϕi ϕj hpq − ϕq ϕi hjp − ϕq ϕj hip ).
Now we recall that the Ricci flow is strongly parabolic if there exists δ > 0 such that for all
covectors ϕ 6= 0 and all (symmetric) hij 6= 0,

hσ̂[−2Rc](ϕ)(h), hi > δ|ϕ|2 |h|2

42
or, by Corollary 5.2,

g pq (ϕp ϕq hij + ϕi ϕj hpq − ϕq ϕi hjp − ϕq ϕj hip )hij > δϕk ϕk hrs hrs .

However, if we choose hij = ϕi ϕj , it is clear that the LHS of this equation is 0, so the inequality
can not hold. Therefore the Ricci flow is not strongly parabolic.

5.2 The DeTurck Trick


Because the Ricci flow is not strongly parabolic, we can not immediately apply Theorem A.1.
Hamilton, in his original paper [9], used the Nash-Moser inverse function theorem to prove short-
time existence and uniqueness. Shortly afterwards, Dennis DeTurck introduced (in [7]) a much
simpler way of proving the short-time existence of the Ricci flow. In this section we show how one
does the “DeTurck Trick”.
We first rewrite the linearization of the Ricci tensor so that it is possible to see which terms
are making it non-parabolic.
Lemma 5.3. The linearization of −2Rc is equal to

D[−2Rc](h)ij = g pq ∇p ∇q hij + ∇i Vj + ∇j Vi + lower-order terms in h (5.1)

where
1
Vi = g pq ( ∇i hpq − ∇q hpi ).
2
Proof. From Lemma 5.1 we have

D[−2Rc](h)ij = g pq (∇p ∇q hij + ∇i ∇j hpq − ∇q ∇i hjp − ∇q ∇j hip ). (5.2)

Applying the formula (B.3) for commuting covariant derivatives, we obtain


r r
∇q ∇i hjp = ∇i ∇q hjp − Rqij hrp − Rqip hjm
= ∇i ∇q hjp + lower-order terms in h.

Lower-order terms in h make no contribution to the principal symbol. Thus, as far as the principal
symbol is concerned, covariant derivatives commute. We can therefore rearrange equation (5.2) by
commuting covariant derivatives to give
1 1
D[−2Rc](h)ij = g pq ∇p ∇q hij + ∇i ( ∇j hpq − ∇q hjp ) + ∇j ( ∇i hpq − ∇q hip )
2 2
+ lower-order terms in h
= g pq ∇p ∇q hij + ∇i Vj + ∇j Vi + lower-order terms in h

(recalling that ∇g = 0).


The lower-order terms in h make no contribution to the principal symbol. The first term in
equation (5.1) is a good term (we can identify it as a Laplacian, which has a strictly positive
principal symbol), but the terms in V are bad: they make the Ricci flow non-parabolic.
The situation is analogous to that for the curve-shortening flow described in Section 4.2: the
flow is not parabolic because we have a bad parametrization of our manifold. To obtain a parabolic
system, we must introduce a time-dependent reparametrization.
In analogy with the curve-shortening flow, we seek a system

ḡt = P (ḡt ) (5.3)
∂t
that is parabolic, and a time-dependent diffeomorphism ϕt from our manifold M to itself, with
ϕ0 = id, so that the metric
gt = ϕ∗t ḡt

43
is a solution to the Ricci flow. We can compute
∂ ∂ (ϕ∗t ḡt )
gt =
∂t ∂t
!
∂ ϕ∗t+s ḡt+s
=
∂s
s=0
!
∂ ϕ∗t+s ḡt
 
∗ ∂ḡt+s
= ϕt +
∂s s=0 ∂s
s=0
= ϕ∗t P (ḡt ) + ϕ∗t L ∂ϕt ḡt .
∂t

Here we have used the product rule and then the rule for the Lie derivative on tensors (see Section
1.3). We choose ϕt to satisfy

ϕt = W (t)
∂t
ϕ0 = id

for some time-dependent vector field W (t) (this system will have a solution ϕt for as long as W (t)
exists – see [6, Lem. 3.15]). The problem now reduces to finding a differential operator P such
that the system (5.3) is strongly parabolic, and a time-dependent vector field W (t) such that, if
we define a 1-parameter family of diffeomorphisms that satisfies (5.4), then

ϕ∗t P (ḡt ) + ϕ∗t LW (t) ḡt = −2Rc(ϕ∗t ḡt ) = −2ϕ∗t Rc(ḡt )

(using the diffeomorphism invariance of the Ricci tensor). This is equivalent to

P (ḡt ) = −2Rc(ḡt ) − LW (t) ḡt .

Note that the choice of W (t) corresponds to the choice of V in Section 4.2.
Now by Lemma 1.7 we have

(LW ḡt )ij = ∇i Wj + ∇j Wi .

Therefore we can use Lemma 5.3 to write the linearization of P as

D[P ](h) = g pq ∇p ∇q hij + ∇i Vj + ∇j Vi + lower-order terms in h (5.4)


−D[∇i Wj + ∇j Wi ](h). (5.5)

Recall that the first term in equation (5.4) is a good term: if the other second-order terms cancelled
then the linearization would satisfy the condition for parabolicity. So our aim is to choose W in
such a way that the second-order part of the term (5.5) cancels the second-order part of the terms
in V . This will happen if the principal part of the linearization of Wi is equal to that of Vi 1 .
We defined V by
1
Vi = g pq ( ∇i hpq − ∇p hqi )
2
1
= − g pq (∇p hqi + ∇q hpi − ∇i hpq ).
2
We recall from formula (1.17) that
∂  1
D[Γkij ](h) = Γkij (g(t)) = g kl (∇i hjl + ∇j hil − ∇l hij ),
∂t 2

where hij = ∂t gij . This looks pretty similar to the form of V , so we might try

Wi = −g pq gij Γjpq .
1 Note that, as far as the principal symbol is concerned, a covariant derivative ∇ is the same as a coordinate
i
derivative ∂i by formula (1.3). Therefore, if the principal symbol of the linearization of Wi is equal to that of Vi
then the principal symbol of the linearization of ∇j Wi is equal to that of ∇j Vi .

44
We need to be careful though – this expression depends on the coordinates we choose, because the
Christoffel symbol Γ is not a tensor. However, the difference between two connections is a tensor,
so we can fix a constant connection with Christoffel symbols Γ̃kij and define
 
Wi = −g pq gij Γjpq − Γ̃jpq ,

which is the coordinate form of some coordinate-independent vector field. The fixed background
connection, because it is independent of the metric, will make no contribution to the symbol of W ,
so the principal symbol of Wi will be equal to that of Vi and all of the second-order terms other
than the first in equation (5.4) will cancel.
Thus, making this choice of W , the principal symbol of the linearization of P is just

σ̂(D[P ])(ϕ)(h)ij = g pq ϕp ϕq hij ,

so we have
hσ̂(D[P ])(ϕ)(h), hi = |ϕ|2 |h|2 .
Thus the Ricci-DeTurck flow defined by

ḡij = P (ḡ) = −2R̄ij + ∇i Wj + ∇j Wi
∂t
is strongly parabolic, and therefore has a solution for a short time by Theorem A.1.
For as long as this solution exists, the vector field W (t) exists, and the time-dependent dif-
feomorphisms ϕt can be obtained by solving the ode (5.4) with initial condition ϕ0 =id. Once
we know the ϕt exist, the above calculations show that the pullback metrics gt = ϕ∗t ḡt satisfy the
Ricci flow equation. Thus, the Ricci flow has a solution for a short time. In fact the solution is
also unique (see [6, Sec. 4.4]), so we have the following result:
Theorem 5.4. Given a smooth Riemannian metric g0 on a closed manifold M, there exists a
maximal time interval [0, T ) such that a solution g(t) of the Ricci flow, with g(0) = g0 , exists and
is smooth on [0, T ), and this solution is unique.

45
Chapter 6

Derivative Estimates and


Curvature Explosion at
Singularities

In this chapter we will use the maximum principle in some creative ways to obtain bounds on
derivatives of the curvature and metric evolving under the Ricci flow. We will then use our
estimates to show that the curvature must explode1 as we approach a finite-time singularity in the
Ricci flow, in analogy with Theorem 4.5 for the curve-shortening flow. This result is a key part of
the proof of Theorem 2.1 that we will present in Chapter 7. The derivative estimates themselves
are also vital to the proof of the compactness result presented in Chapter 8. Our exposition is
based on [6, Chap. 6,7] (with the exception of Corollary 6.13 which is based on results in [28] and
Section 6.2 which is based on [9, Sec. 13]), with significant reformulation of many of the arguments.

6.1 Evolution of Geometric Quantities Under the Ricci Flow


To apply maximum principle arguments to the curvature, we need to know what the equations
describing the evolution of curvature quantities under the Ricci flow are. We have already done
the bulk of the calculations in Lemma 1.20: the evolution equations for the Ricci flow follow by
substituting hij = −2Rij into those formulae.
Lemma 6.1. Suppose that gij (t) is a solution of the Ricci flow:


gij = −2Rij .
∂t
Then the various geometric quantities evolve according to the following equations:
1. Metric inverse:
∂ ij
g = 2Rij .
∂t
2. Christoffel symbols:
∂ k
Γ = −g kl (∇i Rjl + ∇j Ril − ∇l Rij ).
∂t ij
3. Riemann curvature tensor:

Rijkl = 4Rijkl + 2(Bijkl − Bijlk + Bikjl − Biljk )
∂t
−(Rip Rpjkl + Rjp Ripkl + Rkp Rijpl + Rlp Rijkp ),
1 Some books on the Ricci flow use the words “blow up” here – unfortunately this expressive nomenclature is

already employed to describe our method of examining singularities as in Chapter 8, so we substitute “explode”.
All it means is that the curvature goes to +∞, as detailed in Theorem 6.7.

46
where
q p
Bijkl ≡ −Rpij Rqlk .
See Definition 1.13 for the definition of the Laplacian 4.
4. Ricci tensor:

Rij = 4Rij + 2Rpijq Rpq − 2Rip Rpj .
∂t
5. Scalar curvature:

R = 4R + 2|Rc|2 .
∂t
6. Einstein tensor on a 3-manifold:
∂ 26
|E|2 = 4|E|2 − 2|∇E|2 − 8Rij Rjk Rki + R|Rc|2 − 2R3 .
∂t 3

7. Volume element:

dµ = −Rdµ.
∂t
8. Volume of manifold: Z Z

dµ = − Rdµ.
∂t M M

9. Total scalar curvature (on a closed manifold):


Z Z

−R2 + 2|Rc|2 dµ.

Rdµ =
∂t M M

Proof. Most of these follow easily from Lemma 1.20, but some require a bit of extra fiddling to put
them in a nice form. The proof of the equation for the Riemann curvature tensor in particular is
rather lengthy and is contained (along with those for Rc and R) in [6, Sec. 6.1]. The main technique
is to use the identity (B.3) to commute covariant derivatives, then use the Bianchi identities.
The proof of the evolution equation for |E|2 in dimension 3 can be found in [6, Cor. 6.39]. It
can be proven using the formula
1
|E|2 = |Rc|2 − R2 ,
3
the evolution equations for Rc and R given above, and the result of Lemma 1.13 to express the
Riemann curvature tensor in terms of the Ricci tensor. We will not use this result in the current
chapter, but it comes in handy in Chapter 7.

6.2 Evolution Equations for Derivatives of Curvature


We aim to obtain bounds on the derivatives of the curvature, i.e. on quantities of the form |∇k Rm|2
(here the k is not a raised index, it indicates the kth iterated covariant derivative). Our hope is
to apply the maximum principle of Theorem 3.2 to get such bounds, but before we can apply this
theorem we must have some pdes that describe the evolution of the quantities.
Lemma 6.2. If A is a tensor quantity that satisfies a heat-type evolution equation:

A = 4A + F
∂t
under the Ricci flow (where F is a tensor of the same type as A), then the square of its norm
satisfies the heat-type equation

|A|2 = 4|A|2 − 2|∇A|2 + F ∗ A + Rc ∗ A∗2 .
∂t

47
Proof. We use gt to denote the metric at time t.
 
∂ ∂ ∂gt
gt (A, A) = 2gt A, A + (A, A)
∂t ∂t ∂t
= 2gt (4A + F, A) + Rc ∗ A∗2
= 4|A|2 − 2|∇A|2 + F ∗ A + Rc ∗ A∗2
where the final term in the second line comes from the derivative of the metric (which is −2Rc for
the Ricci flow) and we have used the identity
4|A|2 = 2h4A, Ai + 2|∇A|2 .

Lemma 6.3. If A is a tensor quantity that satisfies a heat-type evolution equation



A = 4A + F
∂t
under the Ricci flow (where F is a tensor of the same type as A), then its covariant derivative
satisfies a heat-type equation

∇A = 4(∇A) + ∇F + Rm ∗ ∇A + ∇Rc ∗ A.
∂t
Proof. Recall from formula (1.3) that ∇A has the form
∇A = ∂A + f (Γ, A)
where f (Γ, A) is some expression of the form Γ ∗ A, which depends on the type of the tensor A.
Also, by Lemma 6.1 we have
∂t Γ = g −1 ∗ ∇Rc.


It follows that
∂t ∇A = ∂t ∂A + ∂t f (Γ, A)
= ∂∂t A + f (Γ, ∂t A) + f (∂t Γ, A) (by the product rule)
= ∇(∂t A) + f (g −1 ∗ ∇Rc, A) (because ∂t A is a tensor of the same type as A)
= ∇(4A + F ) + ∇Rc ∗ A
= (4∇A + Rm ∗ ∇A + ∇Rc ∗ A) + ∇F + ∇Rc ∗ A
= 4∇A + ∇F + Rm ∗ ∇A + ∇Rc ∗ A.
We used the formula
[∇, 4]A := ∇4A − 4∇A = Rm ∗ ∇A + ∇Rc ∗ A,
which follows from careful use of formula (B.3) followed by the second Bianchi identity (1.8) (see
[6, p. 227]).
Now we note that the formula for the evolution of the Riemann curvature tensor under the
Ricci flow given in Lemma 6.1 translates into ∗-notation as

Rm = 4Rm + Rm∗2 . (6.1)
∂t
This allows us to use the preceding lemmas to calculate evolution equations for covariant derivatives
of the metric.
Lemma 6.4. The evolution equation for the kth iterated covariant derivative of the Riemann
curvature tensor under the Ricci flow is:
k
∂ k X
∇ Rm = 4∇k Rm + ∇j Rm ∗ ∇k−j Rm.
∂t j=0

48
Proof. We prove the result by induction. The evolution equation for Rm given above in equation
(6.1) is the base case k = 0. We assume the relation holds true for a given k and apply Lemma 6.3
with A = ∇k Rm and
Xk
F = ∇j Rm ∗ ∇k−j Rm.
j=0

This tells us that



∇∇k Rm = 4(∇∇k Rm) + ∇F + Rm ∗ ∇(∇k Rm) + ∇Rc ∗ ∇k Rm. (6.2)
∂t
It is clear that all of the reaction terms on the rhs are of the form ∇i Rm∗∇j Rm where i+j = k+1,
hence
k+1
∂ k+1 k+1
X
∇ Rm = 4∇ Rm + ∇j Rm ∗ ∇k+1−j Rm,
∂t j=0

completing the inductive step.


Corollary 6.5. The square of the norm of the kth covariant derivative of the Riemann curvature
tensor satisfies the heat-type equation
k
∂ k X
|∇ Rm|2 = 4|∇k Rm|2 − 2|∇k+1 Rm|2 + ∇j Rm ∗ ∇k−j Rm ∗ ∇k Rm. (6.3)
∂t j=0

Proof. We simply apply Lemma 6.2 with A = ∇k Rm and


k
X
F = ∇j Rm ∗ ∇k−j Rm,
j=0

by the result of Lemma 6.4. The result is


∂ k
|∇ Rm|2 = 4|∇k Rm|2 − 2|∇k+1 Rm|2 + F ∗ ∇k Rm + Rc ∗ (∇k Rm)∗2 .
∂t
It is clear that all the terms on the left hand side other than the first two are of the form ∇i Rm ∗
∇j Rm ∗ ∇k Rm where i + j = k, and hence
k
∂ k X
|∇ Rm|2 = 4|∇k Rm|2 − 2|∇k+1 Rm|2 + ∇j Rm ∗ ∇k−j Rm ∗ ∇k Rm
∂t j=0

as required.

6.3 The Bernstein-Bando-Shi Estimates


We now apply the maximum principle to the evolution equations derived in the last section to get
bounds on the derivatives of the curvature. We aim to obtain these bounds under the assumption
that the curvature itself is bounded above by some constant, |Rm| < K. There are two problems we
face in trying to apply the maximum principle to the evolution equation we derived for the covariant
derivatives of the curvature: firstly, we can not guarantee any initial conditions on the derivatives
of the curvature if we are only given bounds on the curvature, and secondly the evolution equation
has some terms in it which we are not sure how to control (namely the terms of the summation in
equation (6.3)).
We bypass the first problem by proving time-dependent upper bounds that diverge at t = 0
(thus they only give us useful information after t = 0). We will see how to bypass the second in
the process of the proof.

49
Theorem 6.6. The Bernstein-Bando-Shi Estimates. Let (Mn , g(t)) be a solution of the
Ricci flow on a closed n-manifold. Then for each α > 0 and m ∈ N, there exists a constant Cm
depending only on m, n and max{α, 1} such that if
α
|Rm(x, t)|g(t) ≤ K for all t ∈ [0, K ],
then
Cm K
|∇m Rm(x, t)|g(t) ≤ α
for all t ∈ (0, K ].
tm/2
Proof. We prove the result by induction on m. For m = 0 the result is true by hypothesis, with
C0 = 1. Assume the result is true for all p ≤ m − 1. The result of Corollary 6.5 tells us that
m
∂ m X
|∇ Rm|2 = 4|∇m Rm|2 − 2|∇m+1 Rm|2 + ∇j Rm ∗ ∇m−j Rm ∗ ∇m Rm
∂t j=0
m
X
≤ 4|∇m Rm|2 − 2|∇m+1 Rm|2 + cmj |∇j Rm||∇m−j Rm||∇m Rm|
j=0
 
m−1
X Cj Cm−j  2 m
≤ 4|∇m Rm|2 − 2|∇m+1 Rm|2 +  cmj j/2 (m−j)/2 K |∇ Rm|
j=1
t t
m 2
+ (cm0 + cmm )K|∇ Rm|
00
0 Cm
≤ 4|∇m Rm|2 − 2|∇m+1 Rm|2 + Cm K|∇m Rm|2 + m/2
K 2 |∇m Rm|
t
α 0 00
for t ∈ (0, K ], where Cm , Cm are constants depending only on m and n. We have used the inductive
hypothesis to go from the second to the third line. We can complete the square (in the variable
|∇m Rm|) on the right hand side then use the inequality (a + b)2 /2 ≤ a2 + b2 to obtain
K2
 
∂ m 2 m 2 m+1 2 m 2
|∇ Rm| ≤ 4|∇ Rm| − 2|∇ Rm| + C̄m K |∇ Rm| + m (6.4)
∂t t
for some constant C̄m .
To get the desired bound, we need to find an upper bound on tm |∇m Rm|2 . This quantity
clearly has an upper bound at t = 0 because it is equal to 0. So to apply the maximum principle
and get an upper bound we need only show that the reaction terms in its evolution equation cause
it to decrease. The problem we have is that this quantity satisfies the differential inequality
∂ m m
(t |∇ Rm|2 ) ≤ 4(tm |∇m Rm|2 ) − 2tm |∇m+1 Rm|2 + (6.5)
∂t
(C̄m Kt + m)tm−1 |∇m Rm|2 + C̄m K 3 , (6.6)
and the reaction terms (6.6) are not negative. This is the “second difficulty” outlined at the start
of this section.
To fix this problem, we make use of the term −2|∇k+1 Rm|2 in the evolution equation of
Corollary 6.5. By adding the right amount of tm−1 |∇m−1 Rm|2 (which we know by our inductive
hypothesis is bounded above by a constant) we can cancel off the unruly reactionary terms involving
tm |∇m Rm|2 . In so doing, we will introduce new unruly terms in tm−1 |∇m−1 Rm|2 – so we will
need to add the right amount of the next derivative down and so on. We define
m−1
X
G = tm |∇m Rm|2 + αmj tj |∇j Rm|2 ,
j=0

where we will determine the constants αmj so that all the terms cancel as we want them to.
By equation (6.5), we have

G ≤ 4G + (C̄m Kt + m)tm−1 |∇m Rm|2 + C̄m K 3
∂t
m−1
X
αmj −2tj |∇j+1 Rm|2 + (C̄j Kt + j)tj−1 |∇j Rm|2 + C̄j K 3 .

+
j=0

50
By the inductive hypothesis there are numbers Dj depending only on j, n for 1 ≤ j ≤ m − 1 such
that
C̄j Ktj |∇j Rm|2 + C̄j K 3 ≤ Dj K 3
α
for all t ∈ (0, K ]. Hence we have


G ≤ 4G + (C̄m Kt + m)tm−1 |∇m Rm|2 + C̄m K 3 )
∂t
m−1
X
αmj −2tj |∇j+1 Rm|2 + jtj−1 |∇j Rm|2 + Dj K 3

+
j=0

= 4G + (C̄m Kt + m − 2αm,m−1 )tm−1 |∇m Rm|2


m−1
X
+ {jαmj − 2αm,j−1 } tj−1 |∇j Rm|2
j=0
m−1
X
+C̄m K 3 + αmj Dj K 3 .
j=0

Now we choose the αmj so that the terms in this equation cancel: choose αm,m−1 such that

0 = C̄m α + m − 2αm,m−1 ≥ C̄m Kt + m − 2αm,m−1 ,

where the second step follows because we are working with t ∈ (0, α/K]. Now define αm,m−2 ,
αm,m−3 , . . . , αm0 in that order, at each step setting

jαmj − 2αm,j−1 = 0.

If we now define
m−1
X
Bm := C̄m + αmj Dj
j=0

then our evolution can be written as



G ≤ 4G + Bm K 3 .
∂t
The reaction term is simply a constant, so it gives linear growth at worst. Because G = αm0 |Rm|2 ≤
αm0 K 2 at t = 0, the scalar maximum principle (Theorem 3.2) tells us that

G ≤ αm0 K 2 + Bm K 3 t ≤ (αm0 + Bm α)K 2 := Cm


2
K2
α
for t ∈ (0, K ], where Cm is a constant depending only on m, n and max{α, 1}.
Therefore, we have r
m G Cm K  αi
|∇ Rm| ≤ ≤ for t ∈ 0, ,
tm tm/2 K
as required.

6.4 Curvature Explodes at Finite-time Singularities


This section will be devoted to proving that, if the Ricci flow becomes singular in finite time, the
curvature must explode as we approach the singular time T .
Theorem 6.7. If g0 is a smooth metric on a compact manifold M, the Ricci flow with g(0) = g0
has a unique solution g(t) on a maximal time interval t ∈ [0, T ) where T ≤ ∞. If T < ∞ then
 
lim sup |Rm(x, t)| = ∞.
t→T x∈M

51
The proof that the curvature must explode as t → T follows the same outline as the proof
of the corresponding result for the curve-shortening flow, Theorem 4.5. That is, we assume that
|Rm|g is bounded above by a constant K, and show that the metric g(t) converges smoothly to a
smooth metric g(T ). It is then possible to use the short-time existence result of Theorem 5.4, with
initial metric g(T ), to extend the solution past T . This contradicts the choice of T as the maximal
time such that the Ricci flow exists on [0, T ).
A key element of the proof is the following theorem and its corollary, which show that a limit
metric g(T ) exists and is continuous:

Theorem 6.8. Let M be a closed manifold and g(t) a smooth time-dependent metric on M,
defined for t ∈ [0, T ). If there exists a constant C < ∞ such that
Z T

g(x, t)
∂t dt ≤ C (6.7)
0 g(t)

for all x ∈ M, then the metrics g(t) converge uniformly as t → T to a continuous metric g(T )
such that
e−C g(x, 0) ≤ g(x, T ) ≤ eC g(x, 0).
Note that this means g(T ) is uniformly equivalent to g(0).

Proof. Let x ∈ M, t0 ∈ [0, T ), V ∈ Tx M be arbitrary. Then


  Z t0
log g(x,t0 ) (V, V ) = ∂ 

log g(x,t) (V, V ) dt
g(x,0) (V, V )
0 ∂t
Z
t0 ∂ g (V, V )
∂t (x,t)
= dt

0 g(x,t) (V, V )
Z t0  
∂ V V
≤ ∂t g(x,t) |V |
, dt
g(t) |V |g(t)

0
Z t0

≤ g(x, t)
∂t dt
0 g(t)
≤ C

where the penultimate step follows as |A(U, U )| ≤ |A|g for any 2-tensor A and unit vector U .
Exponentiating both sides of this inequality gives us:

e−C g(x,0) (V, V ) ≤ g(x,t0 ) (V, V ) ≤ eC g(x,0) (V, V ),

for any V . Thus


e−C g(x, 0) ≤ g(x, t0 ) ≤ eC g(x, 0), (6.8)
so the metrics g(t) are uniformly equivalent.
Hence, we have
Z T

dt ≤ C 0

g(x, t) (6.9)
∂t
0 g(0)

for some C 0 > 0. Note the difference from equation (6.7): the norm is taken with respect to a
constant metric g(0) rather than the time-dependent one g(t).
We now define Z T

g(x, T ) = g(x, 0) + g(x, t)dt.
0 ∂t
This integral exists as the metrics are smooth and the integrand is absolutely integrable with
respect to the norm induced by g(0), by equation (6.9). Now
Z T

|g(x, t) − g(x, T )|g(0) ≤ ∂t g(x, t)
→0
t g(0)

52
as t → T for each x ∈ M. Because M is compact, this convergence is uniform on M. Hence
g(t) → g(T ) uniformly, so g(T ) is continuous. By taking the limit of equation (6.8) we can show
that
e−C g(x, 0) ≤ g(x, T ) ≤ eC g(x, 0),
hence that g(T ) is positive definite.
Therefore, the metrics g(t) converge uniformly to a continuous Riemannian metric g(T ) which
is uniformly equivalent to g(0).
Corollary 6.9. Let (M, g(t)) be a solution of the Ricci flow on a closed manifold. If |Rm|g is
bounded on [0, T ) (where T < ∞) then g(t) converges uniformly as t → T to a continuous metric
g(T ) which is uniformly equivalent to g(0).

Proof. Any bound on |Rm|g implies one on |Rc|g , and hence on | ∂t g|g (by the equation of the
Ricci flow). The integral (6.7) is then an integral of a bounded quantity over a finite interval, and
is hence bounded. Thus Theorem 6.8 applies.
We have now got a foothold: we have shown that there is a limit metric g(T ), and it is
continuous. We now want to show that this metric is smooth, because we need this if we are to use
the short-time existence result of Theorem 5.4 to extend our solution past T . To do this, we need
to make sure the spatial derivatives of g near the limit time T are not exploding – we need bounds
on them. The first step is to bound the derivatives of the curvature, in analogy with the outlined
proof of Theorem 4.5. We do this via the Bernstein-Bando-Shi derivative estimates (Theorem 6.6),
which give bounds on the derivatives of the curvature under the assumption of bounded curvature.
We recall that the Bernstein-Bando-Shi estimates are completely useless at t = 0 (as one might
expect, for bounds on an arbitrary curvature tensor will not tell us anything about its derivatives
– it is only after a period of Ricci flowing that the derivatives start to be brought under control).
That is fine for us because we are interested in derivative estimates near t = T . We can use
Theorem 6.6 to get such estimates, by considering our Ricci flow as starting at some time shortly
before T . This gives us

Corollary 6.10. (of Theorem 6.6) Let (Mn , g(t)) be a solution of the Ricci flow on a compact
n-manifold. If there exist β, K > 0 such that

|Rm(x, t)|g(t) ≤ K for all t ∈ [0, T ],

where T > β/K, then for each m ∈ N there exists a constant Bm depending only on m, n and
min{β, 1} such that
m
h i
|∇m Rm(x, t)|g(t) ≤ Bm K 1+ 2 for all t ∈ min{β,1}
K ,T .

Proof. We use the result of Theorem 6.6. Let β1 = min{β, 1}. Now, given a time t0 ∈ [β1 /K, T ],
we consider the Ricci flow as starting at the time T0 = t0 − β1 /K. Applying Theorem 6.6 to this
Ricci flow, with α = β1 , tells us that
Cm K
|∇m Rm| ≤
(t − T0 )m/2

where Cm depends only on m, n and min{α, 1}. Hence at t = t0 we have

Cm K Cm 1+m/2
|∇m Rm| ≤ = K ,
β1 m/2
(K) β1m

from which the result follows.

Using this bound on the derivatives of curvature near t = T , we can now proceed to bound the
derivatives of the metric g near t = T . The following corollary and its proof are a reformulation of
the argument found in [6, Prop. 6.48].

53
Corollary 6.11. Let (Mn , g(t)) be a solution of the Ricci flow on a closed n-manifold, and let
(xi ), i = 1, . . . , n be a local coordinate system defined on some coordinate chart U ⊂ Mn . If there
exists K > 0 such that
|Rm(x, t)|g(t) ≤ K for all t ∈ [0, T )
0
then for each m ∈ N there exist constants Cm , Cm depending only on the chosen coordinate chart
such that
|∂ m g(x, t)| ≤ Cm
and
0
|∂ m Rc(x, t)| ≤ Cm
for all (x, t) ∈ U × [0, T ), where the norms are taken with respect to the Euclidean metric in the
coordinate system (xi ).
We should explain some of the notation used. By ∂ m g we mean the m+2

0 -tensor field, defined
only in the coordinate chart U , which has coordinates

∂i1 . . . ∂im gpq

with respect to the coordinate system (xi ). The Euclidean metric, which is also defined only in U ,
is the metric which has coordinates δij with respect to the coordinate system (xi ).
Proof. Throughout this proof, we will treat the Christoffel symbols Γkij as the coordinates, with
respect to the coordinate system (xi ), of a tensor Γ (Γ is defined only in U ).
Note that by Corollary 6.10, for each m ∈ N there is a uniform upper bound on |∇m Rc| on a
time interval (β/K, T ). There is also an upper bound on the same quantity on the interval [0, β/K]
because the interval is compact, so the quantity is bounded for all m:

|∇m Rc| ≤ Dm (6.10)

for all x ∈ Mn and t ∈ [0, T ), where Dm is some constant depending on m and the particular Ricci
flow (Mn , g(t)).
We will now prove by strong induction that there exist constants Pm , Qm , Rm for each m ∈
N ∪ {0} such that
1. |∂ m−1 Γ| ≤ Pm (we only prove this for m ≥ 1);
2. |∂ m Rc| ≤ Qm ;
3. |∂ m g| ≤ Rm
for all t ∈ [0, T ).
For the base case m = 0, (2) follows from the bound |Rm| ≤ K and (3) follows from Corollary
6.9.
Assume (1) – (3) are satisfied for all m ≤ p − 1. We will prove they are true for m = p, starting
with (1). Note that there is some constant C such that |∂ p−1 Γ| ≤ C at t = 0, because the manifold
Mn is compact.
Bounds on |∂ m g| imply bounds on |∂ m (g −1 )| by differentiating the formula g ij gjk = δki , m
times. Recalling the formula for ∂t Γ from Lemma 6.1, we can prove:

∂t ∂ p−1 Γ = ∂ p−1 (∂t Γ)


= ∂ p−1 (g −1 ∗ ∇Rc)
p−1
X
= ∂ p−i−1 (g −1 ) ∗ ∂ i ∇Rc.
i=0

We have bounds on the derivatives of g −1 , so we just need to bound the other terms. For i ≤ p − 1
we have, by Lemma 1.3 (recalling the notation from the end of Section 1.1),

∂ i ∇Rc = ∇i+1 Rc + ∗ ∂ j Γ, ∂ k Rc .
0≤j ≤i−1


k≤i

54
Note that we have a bound on ∇i+1 Rc by equation (6.10), and an inductive upper bound on all
of the other terms on the right hand side, because j ≤ i − 1 ≤ p − 2 and k ≤ i ≤ p − 1. Hence
|∂ i ∇Rc| ≤ C for i ≤ p − 1, therefore
|∂t ∂ p−1 Γ| ≤ C
for some constant C (this C is different from the previous one – there are so many constant uniform
bounds at play in this and subsequent arguments that we will refer to most of them as ‘C’ for
convenience, and hope that this does not cause too much confusion). Thus |∂ p−1 Γ| is bounded
at t = 0 and experiences, at worst, linear growth on the finite time interval [0, T ). Therefore
|∂ p−1 Γ| ≤ Pp for some constant Pp . This completes the proof of (1).
We can now compute (again by Lemma 1.3)

∂ p Rc = ∇p Rc +
j ≤p−1
∗ ∂ j Γ, ∂ k Rc .


k ≤p−1

All terms on the right hand side are bounded inductively (with the exception of ∂ p−1 Γ, which we
just showed was bounded as part of the inductive step), so we have |∂ p Rc| ≤ Qp for some constant
Qp . This completes the proof of (2).
Finally,
|∂t ∂ p g| = | − 2∂ p Rc| ≤ C
by the second part of the inductive step, so because we are on a finite time interval, |∂ p g| ≤ Rp
for some constant Rp . This proves (3), completing the inductive step.
Therefore the result is true by induction.
Corollary 6.12. The metric g(T ) of Corollary 6.9 is smooth, and the metrics g(t) converge
uniformly in every C k norm to g(T ) as t → T .
Proof. To show that g(T ) is smooth we must take derivatives with respect to some system of
coordinates which we can only choose arbitrarily, so take some coordinate patch U of Mn . We
have, from the Ricci flow equation,
Z T
gij (x, T ) = gij (x, t) − 2 Rij (x, τ )dτ
t

for any t ∈ [0, T ).


∂ |α|
Now if α is any multi-index (see Definition A.1), then Corollary 6.11 tells us that ∂xα gij and
∂ |α|
∂xα Rij are uniformly bounded on U × [0, T ), and hence that we may take the derivative under the
integral sign:
Z T  |α|
∂ |α|
 |α|  
∂ ∂
gij (x, T ) = gij (x, t) − 2 Rij (x, τ )dτ (6.11)
∂xα ∂xα t ∂xα
for any x ∈ U . In particular, the lhs of the above equation exists for all α, so g(T ) is smooth.
Now we show that the convergence is uniform in every C m norm, in the following sense: we
can choose coordinate charts covering M, such that for any multi-index α, and any  > 0, there
exists δ > 0 such that |α|
∂ |α|


g
∂xα ij
(x, T ) − α
gij (x, t) < (6.12)
∂x
g(x,T )

in any of the chosen coordinate charts, for any t ∈ [T − δ, T ) and x ∈ Mn .


Because Mn is compact, we can choose a finite set of coordinate charts such that the closed
unit balls of the coordinate charts cover Mn . Because the closed unit balls are compact, the
Euclidean metric on each is equivalent to g(T ). Because there are finitely many coordinate charts,
the Euclidean metrics are uniformly equivalent to g(T ). Thus it suffices to prove that equation
(6.12) holds if we take the norm with respect to one of the Euclidean metrics at each point x,
rather than with respect to g(x, T ).
0
By Corollary 6.11, for each of the coordinate charts we have chosen there exists Cm such that
m 0
|∂ Rc| ≤ Cm with respect to the Euclidean norm in that chart. If we choose C to be the largest

55
0
of these Cm where m = |α| (C finite because there are only finitely many coordinate charts), then
equation (6.11) gives us
Z 
|α| T
∂ |α| ∂ |α|


g (x, T ) − g (x, t) = 2 R (x, τ )dτ

ij ij ij

∂xα ∂xα
t ∂xα


Z T
≤ 2 |∂ m Rc(x, τ )| dτ
t
≤ 2C(T − t).

It follows that g(t) → g(T ) uniformly in any C m norm as t → T .


We now prove Theorem 6.7. We assume, for a contradiction, that |Rm(x, t)|g is bounded above
by K. It follows from Corollaries 6.9 and 6.12 that the metrics g(t) converge uniformly in any C k
norm to a smooth metric g(T ).
Because g(T ) is smooth, it is possible to find a solution to the Ricci flow with initial metric
g(T ), by the result of Theorem 5.4. Thus our solution to the Ricci flow can be extended past t = T ,
as we did for the curve-shortening flow in the proof of Theorem 4.5. This extension is smooth,
because all spatial derivatives are continuous at t = T (by the convergence of g(t) in any C k norm).
It follows that all space-time derivatives are continuous at t = T because the Ricci flow equation
allows us to write time derivatives of quantities related to the metric in terms of space derivatives
of those quantities, and the space-derivatives have been shown to be continuous. Therefore, the
solution can be extended past the time t = T , so the time T could not have been maximal. This
is a contradiction, so the original assumption that |Rm(x, t)|g is bounded must be incorrect. This
completes the proof of Theorem 6.7. 
Corollary 6.13. Suppose that (M, g(t)) is a solution of the Ricci flow on a closed manifold, such
that |Rm| ≤ K at t = 0. Then there exists a positive constant b depending only on K and the
dimension n of the manifold such that the Ricci flow exists for all t ∈ [0, b).
Furthermore, there exists a constant C depending only on n such that
K
|Rm| ≤
1 − 21 CKt

for as long as the rhs exists.


Proof. We will first prove the second statement, then deduce the first.
Let u = |Rm|2 . By Corollary 6.5 with k = 0, we have

∂u
≤ 4u + Cu3/2
∂t
for some C > 0 depending only on n. By hypothesis, |Rm|2 ≤ K 2 at t = 0. The situation is perfect
for applying the maximum principle (Theorem 3.2). We need only solve the associated ode:


= Cφ3/2 , φ(0) = K 2 .
dt
The solution is  2
K
φ(t) = .
1 − 12 CKt
It follows by Theorem 3.2 that
 2
K
u≤ ,
1 − 21 CKt
from which it follows that
K
|Rm| ≤
1 − 21 CKt
as required.

56
Now let b = 1/(CK). We then have

|Rm(x, t)| ≤ 2K

for all t ∈ [0, b) and x ∈ M, and b depends only on K and the dimension of the manifold. Because

lim sup |Rm(x, t)| = ∞,


t→T x∈M

where T is the maximal time of existence of the Ricci flow, we must have b < T , hence the Ricci
flow exists for all t ∈ [0, b).

57
Chapter 7

3-Manifolds With Positive Ricci


Curvature

Hamilton’s first major achievement using the Ricci flow was the following, proved in [9].
Theorem 7.1. Let M3 be a closed 3-manifold which admits a smooth Riemannian metric with
strictly positive Ricci curvature. Then M3 also admits a smooth metric of constant positive cur-
vature.
In particular, if M3 is simply-connected, it follows from Theorem 1.16 that M3 is diffeomorphic
to S3 .
The proof in [9] was substantially improved by the introduction of the maximum principle that
we have called Theorem 3.3, in [10]. This improved proof is also presented in Chapter 6 of [6], and
it is this proof that we present in this chapter (we base our exposition on Chow and Knopf’s with
the exception of the proof of Theorem 7.19, the outline of which was suggested to us by Gerhard
Huisken). We will present a more “modern” proof (as presented in [28, 2, 21]) in Section 8.3, based
on Perelman’s recent work.
Because the proofs of some of the results we will use require lengthy calculations which distract
from the main course of the argument, we will relegate these details to Section 7.8.

7.1 The Plan of Attack


To prove Theorem 7.1 we will consider the Ricci flow on the closed 3-manifold M3 , starting from
a metric with strictly positive Ricci curvature. In this situation, the metric will become rounder
and rounder under the Ricci flow, but also smaller and smaller. The manifold shrinks to a point
in finite time, and its shape (locally) approaches that of a 3-sphere as we get closer to this time.
We want to take the limit as this finite time is approached, and show that the limit has constant
positive sectional curvature, but we are thwarted by the manifold shrinking to a point as the limit
time is approached. To overcome this problem, we rescale the manifold (and also time) so that the
volume of the manifold is constant, in analogy with the rescaling of the curve-shortening flow in
Theorem 4.6 to keep the enclosed area constant.
This rescaled metric will not shrink to a point. It will exist for all time, and in fact we can
show that it will converge smoothly to a smooth limit metric. While it is converging, its sectional
curvatures are getting exponentially closer and closer together, so that the limit metric has constant
positive sectional curvature. Thus the limit metric is exactly the one we need to prove Theorem
7.1.
Here is a brief outline of the proof of Hamilton’s theorem that we will present:

1. Prove the Ricci flow has a solution for a short time, and this solution is unique. Thus we
can consider a solution on a maximal time interval [0, T ). Note the analogy with Theorem
4.2 for the csf. In addition, T < ∞, in analogy with Theorem 4.4 for the csf. It follows by
Theorem 6.7 that the curvature explodes as t → T , in analogy with Theorem 4.5 for the csf.

58
2. Show that the sectional curvatures must get close to each other as the curvature explodes.

3. Rescale time and the metric to get a solution to the normalized (volume-preserving) Ricci
flow,
R
∂ 2 Mn RdV
g = −2Rc + R g
∂t n Mn dV
(in this case we have a 3-manifold, so n = 3). Show that this solution exists for all time, and
converges as t → ∞ to a metric of constant curvature.

7.2 Existence and Finite-time Explosion of Curvature


By Theorem 5.4, the Ricci flow has a unique solution on a maximal time interval [0, T ). We
now show that in the case under consideration, namely when the initial Ricci curvature is strictly
positive, T is finite.

Theorem 7.2. Let (M, g(t)) be a solution of the Ricci flow on a compact manifold, defined for
t ∈ [0, T ). If the metric g(0) has strictly positive scalar curvature (in particular, if it has strictly
positive Ricci curvature) then g(t) becomes singular in finite time, i.e. T < ∞.
Proof. Because M is compact and the scalar curvature at time 0 is strictly positive, it is bounded
below by some ρ > 0. Using the result for the evolution of the scalar curvature in Lemma 6.1 we
have
∂ 2
R = 4R + 2|Rc|2 ≥ 4R + R2
∂t n
(here we have used Lemma 1.1). We can now apply the scalar maximum principle (Theorem 3.2).
The solution of the ode
dφ 2
= φ2
dt n
with φ(0) = ρ is
1
φ(t) = 1 2t .
ρ − n

Thus by Theorem 3.2,


R(x, t) ≥ φ(t).
But φ(t) clearly diverges to +∞ in finite time, hence R(x, t) becomes singular and so the solution
g(t) becomes singular in finite time.

Corollary 7.3. The curvature explodes as t → T :


 
lim sup |Rm(x, t)| = ∞ (7.1)
t→T x∈M3
 
lim sup |Rc(x, t)| = ∞ (7.2)
t→T x∈M3

Proof. By Theorem 7.2, the maximal time T is finite; by Theorem 6.7 the curvature must explode
as we approach this singularity. This proves (7.1). It now follows from Lemma 1.13 (which applies
because the manifold is 3-dimensional) that there exists a constant C such that |Rm| ≤ C|Rc|,
from which (7.2) follows.

59
7.3 Setting the Scene for the Maximum Principle – The
Uhlenbeck Trick
Having seen that the curvature explodes as the maximal time T is approached, we now show that
the metric g(t) becomes “rounder” when the curvature becomes large. To do this, we will apply
the maximum principle for vector bundles (Theorem 3.3) to the Riemann curvature tensor.
Recall, from Lemma 6.1, that under the Ricci flow the Riemann curvature tensor evolves
according to

Rijkl = 4Rijkl + 2(Bijkl − Bijlk + Bikjl − Biljk ) (7.3)
∂t
−(Rip Rpjkl + Rjp Ripkl + Rkp Rijpl + Rlp Rijkp ), (7.4)

where
q p
Bijkl ≡ −Rpij Rqlk . (7.5)
Naı̈vely we might try to apply Theorem 3.3 to Rijkl as a section of the vector bundle of 4-tensors.
However, the reaction terms are so hideous that we will not be able to say anything useful about
the ode that we need to solve to apply Theorem 3.3. Furthermore, Theorem 3.3 can’t deal with
bundle metrics that depend on time. The way forward is what is known as the Uhlenbeck trick.
The idea is to fix the initial tangent bundle with its initial metric, then to evolve the isometry
between this fixed bundle and the tangent bundle with its time-evolving metric. Suppose that we
have a solution g(t) of the Ricci flow on M. Let (V, h) be a vector bundle over M with metric h
such that
u0 : (V, h) → (T M, g0 )
is a bundle isometry. We evolve the isometry u(t) by
∂ i
u = Rli ula (7.6)
∂t a
where the uia are the components of the isometry with respect to some local bases of V and T M.
We will use indices a, b, . . . on the vector bundle V and i, j, . . . on T M to distinguish them more
clearly.
Lemma 7.4. u(t) remains an isometry.
Proof. u(t) : (V, h) → (T M, g(t)) is an isometry as long as h is the pullback of g(t) via u(t), i.e.
h = u(t)∗ g(t). Because h is constant and u(0) is an isometry by definition, it suffices to show that
u(t)∗ g(t) does not change in time.

∂ ∂ i j
(u(t)∗ g(t))ab = (u u gij )
∂t ∂t a b
= Rli ula ujb gij + uia Rlj ulb gij + uia ujb (−2Rij )
= 0

as required, so u(t) remains an isometry.


Therefore we can consider the behaviour of the pullback of Rm to V , u∗ Rm, rather than Rm
itself.
The equation describing the evolution of (u∗ Rm)abcd ≡ Rabcd is then the same as that for
Rijkl but without the irritating terms (7.4) (the fact that the Uhlenbeck trick gives the evolution
equation in such a nice form, as well as providing the framework that allows us to use the tensor

60
maximum principle, is a little mysterious – Hamilton refers to it as “magic”):
∂ ∂ i j k l
Rabcd = (u u u u Rijkl )
∂t ∂t a b c d
i m j k l
= (Rm ua )ub uc ud Rijkl + uia (Rm
j m k l
ub )uc ud Rijkl
+uia ujb (Rm uc )ud Rijkl + uia ujb ukc (Rm
k m l l
um
d )Rijkl
+uia ujb ukc uld [4Rijkl + 2(Bijkl − Bijlk + Bikjl − Biljk )
−(Rip Rpjkl + Rjp Ripkl + Rkp Rijpl + Rlp Rijkp )]
= 4Rabcd + 2(Babcd − Babdc + Bacbd − Badbc ) (7.7)

where Babcd is defined by equation (7.5).


Recall that we can view Rm as the curvature operator R, which is a section of the bundle

E = ∧2 T ∗ Mn ⊗S ∧2 T ∗ Mn

of symmetric bilinear forms on the space ∧2 T M3 of 2-vectors. This viewpoint was outlined in
Section 1.4.
In 3 dimensions, ∧2 T M3 has dimension 3, so each fibre of this bundle E is naturally isomorphic
to the vector space of 3 × 3 self-adjoint (i.e. symmetric) matrices. We can diagonalize these with
respect to some orthonormal basis {e1 ∧ e2 , e2 ∧ e3 , e3 ∧ e1 } of ∧2 T M3 , where {e1 , e2 , e3 } is an
orthonormal basis of T M3 .
Now we would like to apply the vector bundle maximum principle (Theorem 3.3) to R, viewed
as a section of E. We must consider the ode corresponding to the pde describing the evolution of
Rm (which is given by equation (7.7)), namely
d
Qabcd = 2(Babcd (Q) − Babdc (Q) + Bacbd (Q) − Badbc (Q))
dt
where Babcd (Q) is defined by analogy with equation (7.5).
In 3 dimensions this ode has a particularly convenient form. Choose a basis {ei } of T M3x so
that Q is diagonal with eigenvalues λ1 , λ2 , λ3 down the diagonal, then the equation is (see [6, Sec.
6.3, 6.4]):    2 
λ 0 0 λ1 + λ2 λ3 0 0
d  1
0 λ2 0  =  0 λ22 + λ3 λ1 0 .
dt
0 0 λ3 0 0 λ23 + λ1 λ2
In particular, the matrix Q will remain diagonal – this does not happen for higher dimensions,
because the matrix on the rhs will not be diagonal (there are other useful decompositions in the
n = 4 case though – see [10]). Therefore, in three dimensions the three parameters λ1 , λ2 , λ3
completely describe Q, and we may represent Q as a point λ = (λ1 , λ2 , λ3 ) moving in R3 according
to the ode    2 
λ λ + λ2 λ3
d  1   21
 3
(λ1 + λ32 + λ33 )

λ2 = λ2 + λ3 λ1  =∇ + λ1 λ2 λ3 . (7.8)
dt 3
λ3 λ23 + λ1 λ2
We note that the initial value of λ also tells us all about the initial Ricci and scalar curvatures, by
the formulae (1.9) and (1.10):
 
λ2 + λ3 0 0
1
Rc =  0 λ3 + λ1 0  (7.9)
2
0 0 λ1 + λ2

and
R = λ1 + λ2 + λ3 . (7.10)
We can visualize the behaviour of the ode (7.8) by visualizing the evolution of λ(t). The ode
is homogeneous, so we can scale any solution to get another solution. We consider “projecting”
the solution λ(t) onto the unit sphere to get a path µ(t) = λ(t)/|λ(t)|. Given µ(0), the evolution
of µ(t) of course depends on which solution λ(t) we choose, satisfying λ(0) = |λ(0)|µ(0). However,

61
z
S2x R

S3

S2x R S2x R y
x

Figure 7.1: The axes and the region Rc > 0 (shaded) on the unit sphere in R3 . The points √
corresponding to S3 and S2 × R are labelled. Note that S3 corresponds to the point (1, 1, 1)/ 3,
not the origin.

Figure 7.2: The paths µ(t) followed by the solutions to equation (7.8) on the 2-sphere.

62
the different paths µ(t) will only differ by a rescaling of time, by the homogeneity of the ode.
Hence we can unambiguously consider the paths that the solution follows on the unit 2-sphere,
if we do not take into account the speed (this procedure corresponds to our freedom to scale the
manifold without changing its shape).
Figure 7.2 shows the paths µ(t) followed on the sphere by the solutions to the ode (7.8). Figure
7.1 shows the axes, and the region corresponding to Rc > 0 is shaded grey (the diagram becomes
incoherent when these two are superimposed so we’ve split them up for clarity). √
The solution√ is stationary at the points (±1, 0, 0), (0, ±1, 0), (0, 0, ±1), (1, 1, 1)/ 3 and
(−1, −1, −1)/ 3. We recall that the eigenvalues of the curvature operator R correspond to sec-
tional curvatures. Thus the points (1, 0, 0), (0, 1, 0), (0, 0, 1) correspond to a manifold with positive
curvature in one 2-plane and zero curvature in orthogonal 2-planes – we can identify these points
with the manifold S2 × R, or quotients thereof if M is not simply connected. Similarly, we identify
2
the points√ (−1, 0, 0), (0, −1, 0), (0, 0, −1) with the manifold H × R or quotients thereof. The point
(1, 1, 1)/ 3 corresponds to a manifold of constant positive
√ sectional curvature, which we iden-
tify with S3 or quotients thereof, and (−1, −1, −1)/ 3 corresponds to a manifold with constant
negative sectional curvature, which we identify with H3 or quotients thereof.
We can see from Figure 7.2 that the paths are flowing towards S3 , but that some of the ones
flowing from outside the set of metrics with positive Ricci curvature get “caught” on the naughty
S2 × Rs. The set of metrics of strictly positive Ricci curvature is the largest convex set (recall that
we need convexity for Theorem 3.3 to apply) that is symmetric in λ1 , λ2 , λ3 and does not contain
the S2 ×R points. Thus we can see that the Ricci flow is expected to converge towards S3 whenever
it starts off with strictly positive Ricci curvature, but no stronger condition can be imposed.
It is interesting to note that, if we look at the sphere from the opposite direction, the arrows
are all reversed, the S3 is replaced by H3 and the S2 × Rs are replaced by H2 × Rs. Of course the
region shown in Figure 7.2 is all that is important to the proof of Theorem 7.1.
If one chooses λ1 (0) ≥ λ2 (0) ≥ λ3 (0) then it is easy to show that this condition is preserved
under the evolution equation (7.8):

d
log(λ1 − λ2 ) = (λ1 + λ2 − λ3 )
dt
and
d
log(λ2 − λ3 ) = (λ2 + λ3 − λ1 ).
dt

7.4 Local Curvature Pinching from the Maximum Principle


We would now like to obtain quantitative results showing that this convergence towards constant
sectional curvature does actually happen. In this section we will assume that (M3 , g(t)) is a
solution of the Ricci flow on a closed Riemannian 3-manifold with initially strictly positive Ricci
curvature.
We first use Theorem 3.3 to prove that, if the eigenvalues of R are initially close together, they
will remain so.
Lemma 7.5. There exist constants C < ∞ and  > 0 depending only on the initial metric such
that
λ1
≤C
λ3 + λ2
(where λ1 ≥ λ2 ≥ λ3 are the eigenvalues of the curvature operator) and

Rc ≥ g.

We have diverged slightly from Chow and Knopf’s formulation of this lemma (namely by
including the second condition) to avoid a technicality that arises in the proof.
Proof. See Section 7.8.

63
Corollary 7.6. The scalar curvature explodes as t → T . That is, if we define Rmax (t) :=
supx∈M R(x, t), then
lim Rmax (t) = ∞.
t→T

Proof. By the result of Lemma 7.5, Rc remains strictly positive under the Ricci flow. Therefore,
if we diagonalize Rc with respect to some orthonormal basis, with eigenvalues a, b, c > 0, then

|Rc|2 = a2 + b2 + c2 < (a + b + c)2 = R2 ,

from which the result follows by Corollary 7.3.


Corollary 7.7. There is a constant β > 0 such that

Rc ≥ 2β 2 Rg.

Proof. We use Lemma 7.5 and formulae (1.9) and (1.10) to deduce that
λ2 + λ3
Rc ≥ g ≥ 2β 2 (λ1 + λ2 + λ3 )g = 2β 2 Rg,
2
for some β > 0.
We now show that the pinching together of the eigenvalues actually gets better as the scalar
curvature goes to +∞.
Theorem 7.8. There exist positive constants δ < 1 and C̄ (depending only on g0 ) such that

λ1 − λ3 C̄
≤ δ,
R R
where R = λ1 + λ2 + λ3 is the scalar curvature.
Proof. See Section 7.8.
Theorem 7.8 shows that, at each point, the sectional curvatures get “pinched” together as the
curvature explodes (λ1 − λ3 is the greatest difference between any two eigenvalues). Furthermore
the left hand side of this estimate is scale-invariant – so even when we rescale the metric by some
factor, this bound tells us that the eigenvalues (i.e. the sectional curvatures) will be close together.
Corollary 7.9. There exist positive constants B, δ̄ such that
|E|2
≤ BR−δ̄
R2
where E is the Einstein tensor, Eij = Rij − 13 Rgij .
Proof. See Section 7.8.
This quantity |E|2 measures how far away the metric is from being an Einstein metric. By
Lemma 1.14, when |E|2 = 0 we have Rij = Cgij where C is constant over the whole manifold. In
the case of 3 dimensions, it follows that the metric has constant sectional curvature (see Lemma
1.14). Thus bounds on |E|2 are very useful.

7.5 Global Curvature Pinching


The previous section dealt with the pinching of sectional curvatures at a point. We know they
pinch together if the scalar curvature explodes at that point, but we only know that the curvature
explodes somewhere on our manifold as we approach the singular time – that is not enough to
conclude that the sectional curvatures pinch together everywhere.
However we recall that, by Corollary 7.9,
|E|2
≤ BR−δ̄
R2

64
for some B, δ̄ > 0. We recall the result of Lemma 1.14, which says that if E = 0 then R is constant.
Thus we would expect that, when we have a bound on |E|2 (as we do, by Corollary 7.9) everywhere
on the manifold, the scalar curvature R might be close to being constant. So it is reasonable to
expect that we will be able to obtain a bound on |∇R| from our pinching result. This bound would
be very useful to have because it will allow us to compare values of R at different places in M.
Because we already know that R is exploding somewhere on M we will be able to show that it
is getting large everywhere, and hence that the sectional curvatures are getting close everywhere.
One way of bounding |∇R| is:

Theorem 7.10. Let (M3 , g(t)) be a solution of the Ricci flow on a closed 3-manifold with initially
strictly positive Ricci curvature. There exist β̄, γ > 0 depending only on the initial metric such
that for any β ∈ [0, β̄] there exists C such that

|∇R|2
≤ βR−γ + CR−2 .
R3
Proof. See Section 7.8.

Using this estimate we can get global bounds on the variation of the curvatures:
Theorem 7.11. Let (M3 , g(t)) be a solution of the Ricci flow on a closed 3-manifold with ini-
tially strictly positive Ricci curvature, defined for t ∈ [0, T ). Then there exist constants C, γ > 0
depending only on initial data such that
Rmin −γ
≥ 1 − CRmax .
Rmax
Note that this means Rmin /Rmax → 1 as t → T , because Rmax → ∞ as t → T by Corollary 7.6. It
follows that R → ∞ uniformly as t → T .
Proof. See Section 7.8.

Now recall the discussion at the start of this section. Because R → ∞ as t → T , the curvature
should be getting uniformly pinched.
Corollary 7.12. Let λ1 (x, t) ≥ λ2 (x, t) ≥ λ3 (x, t) denote the eigenvalues of the curvature operator
at (x, t), then for any  ∈ (0, 1) there exists T ∈ [0, T ) such that

min λ3 (x, t) ≥ (1 − ) max3 λ1 (y, t) > 0


x∈M3 y∈M

for all t ∈ [T , T ). Note that this means the metric will eventually attain positive sectional curvature
everywhere.
Proof. See Section 7.8.

7.6 Normalized Ricci Flow


We now know that the Ricci flow becomes singular in finite time T , that the curvature explodes
as we approach time T , and that the sectional curvatures get pinched together as the curvature
explodes. We want a metric of constant sectional curvature on our manifold, so the idea is to take
the limit of our flow as we approach the time T . The problem is that the manifold is shrinking
down to a point at time T , just as was the case for the csf. In analogy with the csf, where we
rescaled the flow so that the area enclosed by the curve was constant, we consider a rescaling of
our manifold so that its volume is constant.
Let g̃(t) = ψ(t)g(t) be a rescaling of our Ricci flow metric g(t) on Mn , with ψ(0) = 1. We use a
tilde to distinguish quantities that refer to the metric g̃, for example R̃ := R(g̃) denotes the scalar
curvature of the metric g̃. Let us choose ψ(t) so that the volume (in fact the volume element) of

65
the manifold with respect to g̃ is constant. Using the result of Lemma 1.19 for the scaling of the
volume elements we have
Vol(g̃(t)) = Vol(g̃(0))
⇒ ψ(t)n/2 Vol(g(t)) = Vol(g(0))
 2
dµ(t) − n
R
⇒ ψ(t) = R Mn
n dµ(0) M

We then have, from Lemma 6.1,


R − n2 −1 d R
d 2 M n dµ(t) dtR Mn dµ(t)
ψ(t) = − R
dt n Mn
dµ(0) Mn
dµ(0)
R
2 ψ(t) n Rdµ(t)
= R  RM
n R Mn dµ(t) Mn
dµ(0)
Mn
dµ(0)
2r
= ψ(t)
n
2r̃ 2
= (ψ(t)) (7.11)
n
where the average scalar curvature, r, is defined by
R
n Rdµ
r := RM ,
Mn

and the normalized average scalar curvature, r̃, is defined by


R
n R̃dµ r
r̃ := RM =
Mn
dµ ψ(t)

by the result of Lemma 1.19.


From this we can compute the evolution equation for g̃:
 
∂ ∂ d
g̃ = g ψ(t) + g ψ(t)
∂t ∂t dt
2r̃ 2
= −2ψ(t)Rc(g(t)) + (ψ(t)) g
 n 
2r̃
= ψ(t) −2Rc(g̃(t)) + g̃
n

We define a rescaling of time to get rid of the ψ(t) terms in this evolution equation:
Z t
τ= ψ(u)du.
0

The evolution equation is then


∂ ˜ + 2r̃ g̃.
g̃ = −2Rc (7.12)
∂τ n
This is the equation of the normalized Ricci flow. It differs from the ordinary Ricci flow by a
rescaling of time and the metric, so that the metric has constant volume and the problem of the
manifold shrinking to a point as we approach the singular time is eliminated. Because we have
only rescaled the Ricci flow solution, the results that we have proven so far for the unnormalized
Ricci flow translate without too much pain to results for the normalized Ricci flow using the results
of Lemma 1.19. The issue of the curvature exploding is also dealt with by the following lemma:

Lemma 7.13. R̃max is bounded, for the normalized Ricci flow on a closed 3-manifold with initially
strictly positive Ricci curvature.

66
Proof. We use Corollary 7.7 to deduce that
˜ = Rc ≥ 2β 2 Rg ≥ 2β 2 Rmin g = 2β 2 R̃min g̃
Rc

for some β > 0. It follows from Myers’ Theorem (Theorem 1.17) that L̃, the diameter of M with
respect to the metric g̃, satisfies
π
L̃ ≤ p .
β R̃min
Now we can also get a lower bound for L̃ from the fact that the volume of our manifold is constant
with respect to g̃, by the first part of the Bishop-Günther volume comparison theorem (Theorem
˜ = Rc ≥ 0, we have (in the notation of Theorem 1.15):
1.15). Because Rc
4π 3
Ṽ = Vol(B(p, L̃)) ≤ V30 (L̃) = L̃
3

where Ṽ is the volume of M with respect to g̃, which we know is equal to a constant, which we
call Ṽ0 . Combining these gives us
! 13
3Ṽ0 π
≤ L̃ ≤ p .
4π β R̃min

This gives us an upper bound for R̃min . Finally, it follows from the first part of Theorem 7.11 and
scaling invariance that there exists some C > 0 such that

R̃min Rmin 1
= ≥ .
R̃max Rmax C

Hence the upper bound for R̃min implies one for R̃max , and the proof is complete.
We can now show that our rescaling of time has in fact taken us to an infinite time interval.
Theorem 7.14. If (M3 , g(t)) is a solution of the Ricci flow on a closed 3-manifold with initially
strictly positive Ricci curvature then the corresponding normalized Ricci flow exists for all time.
Proof. First we show that
Z T
Rmax (t)dt = ∞,
0

where Rmax (t) is the maximum scalar curvature of the metric g(t) and [0, T ) is the maximal time
interval on which the unnormalized Ricci flow exists. This can be done by setting
 Z t 
f (t) = exp 2 Rmax (u)du Rmax (0),
0

which then satisfies


df
= 2Rmax (t)f
dt
and hence, using the evolution equation for R given in Lemma 6.1,

(R − f ) = 4R + 2|Rc|2 − 2Rmax f
∂t
≤ 4(R − f ) + 2Rmax (R − f )

(using |Rc|2 ≤ R2 , which follows as Rc is positive). Now we have (R − f ) ≤ 0 initially, as


f (0) = Rmax (0). This condition is preserved by the ode obtained by ignoring the Laplacian term
in the above equation, as
d
φ(t) = 2Rmax (t)φ(t)
dt

67
is equivalent to
d
log(|φ(t)|) = 2Rmax (t)
dt
from which we can see that φ never changes sign. Hence the condition (R − f ) ≤ 0 is preserved,
by the scalar maximum principle. By Corollary 7.6, Rmax → ∞ as t → T , so f → ∞ as t → T .
Recalling the definition of f , this means that
Z t
lim Rmax (u)du = ∞.
t→T 0

Now the corresponding integral for the normalized flow will be the same:
Z T̃ Z T
R̃max (τ )dτ = r(t)dt = ∞
0 0

(if we define [0, T̃ ) to be the maximal time interval on which the normalized Ricci flow exists).
This follows easily from the scaling results of Lemma 1.19 as R̃max dτ = (ψ −1 Rmax )(ψdt) = rdt. So
the integral for the normalized flow must also diverge – but the integrand is bounded by Lemma
7.13. Hence the region of integration must be infinite, so T̃ = ∞, i.e. the normalized Ricci flow
exists for all time.

7.7 Convergence of the Normalized Flow


Now we will prove that the normalized Ricci flow converges as τ → ∞ to a smooth metric g̃∞ of
constant positive sectional curvature. We will use the notations

g̃∞ = g̃(∞) := lim g̃(τ ).


τ →∞

The proof will be very similar to that of Theorem 6.7, and the key element will again be Theorem
6.8. To show that g̃(∞) exists and is continuous, we must show that there exists some C < ∞
such that Z ∞

g̃ dτ < C.
∂τ
0 g̃

Using the normalized Ricci flow equation (7.12), that is equivalent to showing that the integral
Z ∞
˜ − r̃ g̃ dτ

Rc (7.13)
0
3

is bounded. We know from Corollary 7.9 that

Rc − 1 Rg 2

3
≤ CR−δ .
R2
This is a bound on exactly the right sort of quantity for the unnormalized Ricci flow (Theorem 7.11
tells us that as t → ∞, r and R approach one another uniformly). We need a bound that holds
for the normalized Ricci flow though, and one that will force the integral (7.13) to be bounded
too. The easiest way to do this is to prove the integrand is bounded by a decaying exponential (a
constant bound will not suffice as it did for Corollary 6.12, as the region of integration is infinite).
We will need a few preliminary results.
Lemma 7.15. If (M3 , g̃(τ )) is a solution of the normalized Ricci flow on a closed 3-manifold with
initially strictly positive Ricci curvature then there exists  > 0 such that R̃ ≥  for all τ > 0.
Proof. See Section 7.8.

We are now ready to prove the key estimate that allows us to prove the integral (7.13) is
bounded.

68
Theorem 7.16. If (M3 , g̃(τ )) is a solution of the normalized Ricci flow on a closed 3-manifold
with initially strictly positive Ricci curvature then there exist constants C, δ > 0 such that

Ẽ ≤ Ce−δτ .

Proof. See Section 7.8.


To apply Theorem 6.8 we must prove an exponential bound on |Rc ˜ − r̃/3g̃|. The previous
˜
theorem gives us a bound on |Ẽ| = |Rc − R̃/3g̃| which is almost what we want. We just need to
go from R̃ to r̃. So we need to show that the difference |R̃ − r̃| is exponentially bounded. We will
actually prove something a bit stronger:
Lemma 7.17. There exist constants C, δ > 0 such that
R̃max − R̃min < Ce−δτ .
This Lemma comes from [9, Lemma 17.4].
Proof. See 7.8.
This allows us to prove the following:
Theorem 7.18. If (M3 , g̃(τ )) is a solution of the normalized Ricci flow with initially strictly
positive Ricci curvature, then g̃(τ ) exists for all τ ∈ [0, ∞) and converges uniformly as τ → ∞ to
a continuous metric g̃(∞).
Proof. By Theorem 7.16 and Lemma 7.17 we have
Z ∞ Z ∞
∂g̃
˜ − r̃ g̃ dτ

dτ = Rc
∂τ 3
0 0
Z ∞
˜ R̃ R̃ − r̃
≤ Rc − g̃ + g̃ dτ
0 3 3
Z ∞
< Ce−δτ dτ < ∞
0

where we have amalgamated the two exponential bounds into one. It follows by Theorem 6.8 that
g̃(τ ) converges uniformly to a continuous metric g̃(∞) as τ → ∞.
The next step is to show that this convergence is smooth. This is important because we
want our limit metric to be smooth, and also because we want the curvatures of the normalized
flow to converge to the corresponding curvatures of the limit metric (as curvature quantities are
all essentially second-order derivatives of the metric). That will allow us to conclude that the
curvature pinching results we have proven for the flow lead to similar results for the limit metric,
and hence that the limit metric has constant curvature.
Theorem 7.19. The limit metric g̃∞ := g̃(∞) of Theorem 7.18 is smooth, and the convergence
of g̃(τ ) to g̃∞ as τ → ∞ is uniform in every C m norm.
Proof. See Section 7.8.
We are now finally in a position to prove Theorem 7.1. We show that the limit metric g̃∞ has
the properties required.
Theorem 7.20. The limit metric g̃∞ is a smooth metric with constant positive sectional curvature.
Proof. By Theorem 7.19, g̃(τ ) converges to g̃∞ in the C 0 , C 1 and C 2 norms. Because all curvature
quantities are combinations of “0th-order”, 1st-order and 2nd-order derivatives of the metric, this
means that we can take the limit to show that the Einstein tensor of the metric g̃∞ vanishes:

Ẽ∞ = lim Ẽ(τ ) ≤ lim Ce−δτ = 0

τ →∞ τ →∞

by the result of Theorem 7.16. Therefore g̃∞ is an Einstein metric, so by Lemma 1.14 g̃∞ has
constant (positive) sectional curvature.

69
7.8 Details
Proof. (of Lemma 7.5) Note that such a C and  will exist at time t = 0 by the compactness of the
manifold M3 : λ1 /(λ2 + λ3 ) is a continuous, positive function at t = 0 so it has an upper bound,
and the eigenvalues of Rc are (λ2 + λ3 )/2, (λ3 + λ1 )/2, (λ1 + λ2 )/2, all of which are strictly positive
at time t = 0 so they have a strictly positive lower bound. Thus it suffices to show that these
conditions are preserved under evolution by the Ricci flow.
We use the maximum principle in the form of Theorem 3.3, on the vector bundle E, with the
set
K ≡ {Q ∈ E : λ1 (Q) − C(λ3 (Q) + λ2 (Q)) ≤ 0 and λ2 + λ3 ≥ 2}.
Note that because λ1 ≥ λ2 ≥ λ3 , the last condition states exactly that Rc ≥ g by formula (1.9).
This set is invariant under parallel translation and convex in each fibre. Convexity follows because
the functions

f (Q) = λ1 (Q)
= max (Q(U, U ))
|U |=1

and

h(Q) = −(λ2 (Q) + λ3 (Q))


= max (−Q(V, V ) − Q(W, W ))
|V | = |W | = 1
hV, W i = 0

are convex, so the function j(Q) = f (Q) + Ch(Q) is convex, therefore the set

K = h−1 ((−∞, −]) ∩ j −1 ((−∞, 0])

is convex.
We just need to show that the solution of the associated ode stays in K. That is, if
λ1
≤C
λ2 + λ3
and
λ2 + λ3 ≥ 2
initially, then this condition remains true under the evolution (7.8). This can be shown by proving
first that
d
(λ2 + λ3 ) = λ22 + λ23 + λ1 (λ2 + λ3 ) ≥ 0,
dt
from which it follows that the condition λ2 + λ3 ≥ 2 is preserved, then

λ2 (λ3 − λ1 ) + λ23 (λ2 − λ1 )


 
d λ1
log = 2 ≤0
dt λ3 + λ2 λ1 (λ3 + λ2 )

by the evolution equation (7.8), because λ2 + λ3 ≥  > 0 and hence λ1 > 0 also. This is the sole
reason for including the condition Rc ≥ g – to show that the rhs of this equation does not become
singular.
Thus all the hypotheses of Theorem 3.3 are satisfied, so the solution to the pde stays in K, i.e.
the conditions
λ1
≤C
λ2 + λ3
and
Rc ≥ g
are preserved.

70
Proof. (of Theorem 7.8) The proof is very similar to that of Lemma 7.5. It suffices to show that
there exist C̄, δ such that
λ1 − λ3 C̄
≤ . (7.14)
λ2 + λ3 (λ2 + λ3 )δ
By compactness, given δ ∈ (0, 1) we can choose such a C̄ at time t = 0, because λ2 + λ3 > 0
everywhere by the positive Ricci curvature condition. We show that this condition is preserved
using the maximum principle (Theorem 3.3) with the same vector bundle E as we used in the proof
of Lemma 7.5, but a different K:

K ≡ {Q ∈ E : [λ1 (Q) − λ3 (Q)] − C̄[λ2 (Q) + λ3 (Q)]1−δ ≤ 0}

Once again, the set K is invariant under parallel translation and convex in each fiber by a
similar argument to that in the proof of Lemma 7.5.
To show that the solution of the ode stays inside K we compute, from equation 7.8:
 
d λ1 − λ3 1
log ≤ δλ1 − (1 − δ)(λ3 + λ2 ).
dt (λ2 + λ3 )1−δ 2
Now, by Lemma 7.5, it is possible to choose δ small enough that this is always non-positive, so
λ1 − λ3
(λ2 + λ3 )1−δ
is non-increasing, hence the inequality (7.14) is preserved by the ode. Thus by the maximum
principle it is also preserved by the pde.
Proof. (of Corollary 7.9) Using the formulae (1.9) and (1.10) we have
 
λ2 + λ3 − 2λ1 0 0
1
E=  0 λ3 + λ1 − 2λ2 0 , (7.15)
6
0 0 λ1 + λ2 − 2λ3

hence
|E|2 (λ2 + λ3 − 2λ1 )2 + (λ3 + λ1 − 2λ2 )2 + (λ1 + λ2 − 2λ3 )2
=
R2 36(λ1 + λ2 + λ3 )2
(λ1 − λ2 ) + (λ2 − λ3 )2 + (λ3 − λ1 )2
2
=
12(λ1 + λ2 + λ3 )2
3(λ3 − λ1 )2

12(λ1 + λ2 + λ3 )2
≤ BR−δ̄

by the result of Theorem 7.8, with δ̄ = 2δ (the penultimate step follows as λ1 − λ3 is the greatest
difference between two eigenvalues).
Proof. (of Theorem 7.10) We provide an outline of the proof without showing all of the excruciating
details. A more complete outline can be found in [6, Section (6.6)].
Using the evolution equation for R under the Ricci flow, given in Lemma 6.1, it is possible to
deduce that
 2
∂ |∇R|2 |∇R|2
    
∇R
= 4 − 2R ∇
∂t R R R
|∇R|2 4
−2 |Rc|2 + h∇R, ∇|Rc|2 i
R2 R
 2
|∇R|2
  
∇R
≤ 4 − 2R ∇
R R
|∇R|2 √
−2 2
|Rc|2 + 8 3|∇Rc|2
R

71
where the last line follows from the Cauchy-Schwarz inequality and Lemma 1.1 (to prove |∇R|2 ≤
3|∇Rc|2 ):
4 4
h∇R, ∇|Rc|2 i |∇R| ∇|Rc|2


R R
|Rc|
≤ 8|∇R||∇Rc|
√ R
≤ 8 3|∇Rc|2

(using |Rc| < R because Rc > 0, as in Corollary 7.6). We would like to use the maximum principle
(Theorem 3.2) to show that the quantity |∇R|2 /R does not increase. All reaction terms on the
right hand side of our evolution equation are negative with the exception of the last one in |∇Rc|2 .
We need to deal with this meddlesome customer. We note that
1
|∇E|2 ≥ |∇Rc|2 (7.16)
37
where E is the Einstein tensor (which we know ought to be falling away nicely as our manifold
gets rounder). To see why this is true, we combine Lemma 1.1 with equation (1.14) to get
1 j 1 1
|∇k Eij |2 ≥ |∇ Eij |2 = · |∇R|2 .
3 3 36
Equation (7.16) now follows easily:
1
|∇Rc|2 − |∇R|2 = |∇E|2
3
1 1
≥ |∇R|2
3 36
which can be rearranged to obtain
1 1
|∇Rc|2 − |∇R|2 ≥ |∇Rc|2
3 37
where |∇E|2 is exactly equal to the lhs.
Therefore our evolution equation becomes
 2
|∇R|2 |∇R|2
    
∂ ∇R
≤ 4 − 2R ∇
∂t R R R
|∇R|2 √
−2 2
|Rc|2 + 8 3 · 37|∇E|2 .
R
We are going to try to get rid of the bad final term of this equation by adding on some multiple
of |E|2 . The evolution equation for |E|2 under the Ricci flow is given in Lemma 6.1. It is

∂|E|2
= 4|E|2 − 2|∇E|2 + W (7.17)
∂t
where
26
W = R|Rc|2 − 8Rij Rjk Rki − 2R3 . (7.18)
3
The second term on the right hand side of equation (7.17) interests us: we can use it to cancel
off the recalcitrant |∇E|2 term in the evolution equation for |∇R|2 /R. We set

|∇R|2 37 √
V := + (8 3 + 1)|E|2
R 2
then calculate

V ≤ 4V − |∇Rc|2 + C1 W
∂t

72
(reusing the inequality (7.16)) where C1 is a positive constant.
It is possible to prove that
50
W ≤ R|E|2 (7.19)
3
purely algebraically; as Rc is positive and symmetric we can diagonalize it with positive real
eigenvalues a, b, c. Each term in the inequality can be expressed in terms of a, b and c, and we are
left with an easy homogeneous inequality in three real variables. Equality occurs when a = b = c,
i.e. for an Einstein metric (this proof is significantly simpler than that presented in [6]). We also
have
|∇R|2 ≤ 3|∇Rc|2
by Lemma 1.1.
Hence we have

V ≤ 4V − |∇Rc|2 + C2 R|E|2
∂t
1
≤ 4V − |∇R|2 + C3 R3−2γ
3
by the result of Corollary 7.9, for some constants C and γ > 0. Now we set

U := V − βR2−γ

where β is to be chosen. It is not difficult to calculate from the result of Lemma 6.1 that
∂ 2−γ
R = 4(R2−γ ) − (2 − γ)(1 − γ)R−γ |∇R|2 + 2(2 − γ)R1−γ |Rc|2 ,
∂t
hence that
 
∂ 1
U ≤ 4U + β(2 − γ)(1 − γ)R−γ − |∇R|2 (7.20)
∂t 3
+C3 R3−2γ − 2β(2 − γ)R1−γ |Rc|2 .

Now by Theorem 7.8 we have Rc ≥ g, hence R ≥ 3 for some  > 0. Thus we may choose
β̄ > 0 such that
(3)γ
β̄ < ,
3(2 − γ)(1 − γ)
so that for any β ∈ [0, β̄] the second term in the evolution equation (7.20) is non-positive. For the
rest of the terms, we use the inequality 3|Rc|2 ≥ |R|2 to obtain

C3 R3−2γ − 2β(2 − γ)R1−γ |Rc|2 ≤ C3 R3−2γ − C4 R3−γ

where C3 , C4 are constants. For large R this term is dominated by the R3−γ term, which is
negative. As it does not diverge at R = 0, there is a uniform upper bound C5 for this term, hence
we have

U ≤ 4U + C5 .
∂t
Thus the growth in U is at worst linear, by the maximum principle (Theorem 3.2). By Theorem
7.2 the time interval we are considering is finite, hence we have a uniform upper bound C for U .
Therefore
|∇R|2
≤ V = U + βR2−γ ≤ C + βR2−γ
R
from which the result follows.
Proof. (of Theorem 7.11) Our main tool here will be Theorem 7.10, which allows us to compare
the curvature at different points on the manifold. By Corollary 7.6, we also have Rmax → ∞ as

73
t → T . Thus for t sufficiently close to T , Theorem 7.10 tells us that there exist positive constants
A, α such that
3/2−α
|∇R| ≤ ARmax .
Thus we have (for t sufficiently close to T )
Z
3/2−α
|R(x) − R(y)| ≤ |∇R|ds ≤ ARmax d(x, y) (7.21)
γ

(here γ is a minimizing geodesic connecting x and y). Let us choose x(t) such that Rmax (t) = R(x, t)
(we can do this as M is compact), and define
1
L(t) := p
 Rmax (t)
where  > 0. Then for all y ∈ B(x(t), L(t)) we have, by equation (7.21):
 
3/2−α A −α
R(y) ≥ R(x) − ARmax L ≥ Rmax 1 − Rmax . (7.22)

Recall that Rmax → ∞ as t → T . Therefore, given δ > 0, for t sufficiently close to T we have

R(y) ≥ (1 − δ)Rmax (7.23)

for all y ∈ B(x(t), L(t)).


This will prove the theorem if we can show that B(x(t), L(t)) is all of M for t sufficiently close
to T . We will prove this using Myers’ Theorem (Theorem 1.17). Define N = B(x(t), L(t)). We
have the estimate Rc ≥ 2β 2 Rg for some β > 0 from Corollary 7.7. Hence, by equation (7.23), for
any δ > 0
Rc ≥ 2β 2 (1 − δ)Rmax g
in N for t close enough to T . By the proof of Myers’ Theorem (see [18, Theorem 11.8]), we have
π 1 1
diam(N ) ≤ p < √ < √ =L
β (1 − δ)Rmax δ Rmax  Rmax

for δ chosen sufficiently small. Because N is defined to be B(x(t), L(t)) but has diameter < L,
if there are any points in M that lie outside N then M must be disconnected. We know that it
isn’t, hence N must be all of M, from which the result follows by equation (7.22).
Proof. (of Corollary 7.12) We apply Theorem 7.8:

λ3 ≥ λ1 − C̄(λ1 + λ2 + λ3 )1−δ ≥ λ1 (1 − 3C̄R−δ ) (7.24)


at each point x ∈ M. Now it follows from Theorem 7.11 that R → ∞ uniformly as t → T . Thus,
if we are given η > 0, for times t close enough to T we have

λ3 (x, t) ≥ (1 − η)λ1 (x, t) (7.25)


−δ
(by equation (7.24), as R → 0)
1−η
≥ R(x, t)
3
(as 3λ1 ≥ λ1 + λ2 + λ3 = R)
(1 − η)2
≥ R(y, t)
3
(as Rmax /Rmin → 1)
(1 − η)2
≥ (λ1 (y, t) + 2(1 − η)λ1 (y, t))
3
(as λ2 + λ3 ≥ 2λ3 ≥ 2(1 − η)λ1 , reusing inequality (7.25))
≥ (1 − η)3 λ1 (y, t).

The result now follows by taking the supremum over all x, y ∈ M .

74
Proof. (of Lemma 7.15) We note that the proof of this Lemma presented in [6] is incorrect. The
proof we present is based on [9, Lemma 16.7].
We use the following lemma, which is used to prove the Sphere Theorem (Theorem 1.18):
Lemma 7.21. If M is a simply connected manifold of dimension ≥ 3 whose sectional curvatures
are pinched between K and 41 K for some constant K (M is “ 41 -pinched”), then the injectivity

radius of M is at least π/ K.

Proof. See [3], Theorem 5.10.


Now by Corollary 7.12,
λ3 (x, t)
→1
λ1 (y, t)
as t → T , uniformly for all x, y ∈ M. It follows by scaling invariance and the fact that λ1 ≥ λ2 ≥ λ3
that
λ̃i
→1
λ̃j
uniformly as τ → ∞ for all i, j.
Therefore our manifold is eventually as pinched as we care to make it – in particular, it will
eventually be 14 -pinched, with the constant K equal to some multiple of R̃min by the result of
Theorem 7.11. We may now apply Lemma 7.21 to the universal cover N of M. The volume of
N is at least some multiple of the cube of the injectivity radius ρ(N ) by the second part of the
Bishop-Günther Volume Comparison Theorem (Theorem 1.15), because we have a uniform upper
bound on sectional curvatures by Lemma 7.13. Hence we have
 3
0 3 0 π −3
Vol(N ) ≥ C ρ(N ) ≥ C √ ≥ C R̃max
2
. (7.26)
K
Note that, because the Ricci tensor of M is bounded below by Corollary 7.7, the fundamental
group of M is finite by Myers’ theorem (Theorem 1.17). Furthermore the volume of M is constant
under the normalized Ricci flow. Hence

Vol(N ) = |π1 (M)|Vol(M) = constant.

Combining this with equation (7.26) gives us a lower bound on R̃max . The result of Theorem 7.11
now gives a uniform lower bound on R̃min , as required.
We will need, in the proofs of many of the remaining theorems, to use maximum principle
arguments similar to those employed in Chapter 6 and early in this chapter. For this we must
know the evolution equation satisfied by the quantity we are considering under the normalized
Ricci flow – luckily there is a very simple way of going from the unnormalized evolution equation
to the normalized one. We need to introduce the concept of the degree of a tensor. If P is some
tensor involving the metric and the curvature for the unnormalized flow, then by the results of
Lemma 1.19, the same quantity calculated for the normalized flow (which is just a dilation of the
unnormalized flow by a factor of ψ, as in Section 7.6) will be related to P by a rule of the form
P̃ = ψ n P . We call n the degree of P . For example, by Lemma 1.19, the scalar curvature R has
degree −1.
Lemma 7.22. If P satisfies
∂P
= 4P + Q
∂t
for the unnormalized flow on a 3-manifold, and P has degree n, then Q has degree n − 1 and for
the normalized equation,
∂ P̃ 2r̃n
= 4̃P̃ + Q̃ + P̃ .
∂τ 3

75
Proof. Note that Q has degree n − 1 because ∂τ /∂t = ψ so ∂P/∂t has degree n − 1. We also have
4 = ψ 4̃, so 4P has degree n − 1, thus Q must have degree n − 1 too.
We now have P̃ = ψ n P and the formula (7.11) for dψ/dt, so

∂ P̃ ∂(ψ n P )
= ψ −1
∂τ ∂t
n−1 ∂P dψ
= ψ + nψ n−2 P
∂t dt  
n−1 n−2 2r̃ 2
= ψ (4P + Q) + nψ ψ P
3
2r̃n
= 4̃P̃ + Q̃ + P̃
3
as required.
Proof. (of Theorem 7.16) This proof is a simplification of that presented in [6].
Because |Ẽ|2 ≤ (λ̃1 − λ̃3 )2 /4 as shown in the proof of Corollary 7.9, it suffices for us to prove a
decaying exponential bound on (λ˜1 − λ˜3 ). In fact we will prove that there exist constants C, δ > 0
such that
λ˜1 − λ˜3 ≤ Ce−δτ (λ˜2 + λ˜3 ).
The result will then follow from Lemma 7.13, which gives a uniform upper bound on (λ˜2 + λ˜3 ).
We again use the Uhlenbeck trick, this time for the normalized Ricci flow equation. We replace
the isometry evolution equation (7.6) by
∂ i r̃
u = R̃li ula − uia .
∂τ a 2
˜ is then (using Lemmas 6.1 and 7.22)
The evolution equation for u∗ Rm

R̃abcd = 4̃R̃abcd + 2(B̃abcd − B̃abdc + B̃acbd − B̃adbc ) − r̃R̃abcd .
∂t
This leads to the analogue of equation (7.8) for the behaviour of the ode corresponding to the
Ricci flow pde:
 ˜   ˜2 
λ1 λ1 + λ˜2 λ˜3 − r̃λ˜1
d  ˜   2
λ2 =  λ˜2 + λ˜3 λ˜1 − r̃λ˜2  . (7.27)


λ˜3 ˜ 2
˜ ˜
λ3 + λ1 λ2 − r̃λ3 ˜

We now apply the vector bundle maximum principle (Theorem 3.3) exactly as we did in the proofs
of Lemmas 7.5 and 7.8, with the time-dependent pinching set1

K(τ ) ≡ {Q ∈ E : eδτ (λ˜1 (Q) − λ˜3 (Q)) − C(λ˜3 (Q) + λ˜2 (Q)) ≤ 0}

for some C, α, δ to be chosen. From equation (7.27) we obtain


! 2 2
d λ˜1 − λ˜3 λ˜2 + λ˜3
log e δt ˜ ˜
= δ − (λ2 − λ3 ) −
dt (λ˜2 + λ˜3 ) λ˜2 + λ˜3
1
≤ δ − (λ˜2 + λ˜3 )
2
where we have used the fact that λ˜2 − λ˜3 remains non-negative under the normalized Ricci flow,
which follows from equation (7.27) in the same way as it did from equation (7.8) for the unnor-
malized flow.
By Lemma 7.15, we can also choose  > 0 such that

2 ≤ λ˜1 + λ˜2 + λ˜3 ≤ (1 + B)(λ˜2 + λ˜3 )


1 Our Theorem 3.3 does not cater for time-dependent pinching sets, but the extension is simple – see [6, Theorem

4.9].

76
for some B > 0, where the last step follows by Lemma 7.5 as, by scale-invariance,

λ˜1 λ1
= ≤ B.
˜
λ2 + λ3˜ λ 2 + λ3

We can now choose δ > 0 small enough that



δ− < 0.
1+B
Combining these choices yields
!
d λ˜1 − λ˜3 1
log eδτ ≤ δ − (λ˜2 + λ˜3 )
dt (λ˜2 + λ˜3 ) 2

≤ δ−
1+B
≤ 0.

Hence the ode remains inside K(τ ) for appropriately chosen C, δ > 0. K(τ ) is closed, convex and
invariant under parallel translation by analogous methods to those used in the proofs of Lemma
7.5 and Theorem 7.8, so the result follows from the analogue of Theorem 3.3 for time-dependent
pinching sets ([6, Theorem 4.9]).
Proof. (of Lemma 7.17) Because we have a uniform upper bound on the diameter of our manifold
from Myers’ Theorem (as noted in the proof of Lemma 7.13), it suffices to prove an exponential
bound on |∇ ˜ R̃|. This is achieved by a maximum principle argument similar to that of Theorem
7.10. Let
|∇R|2
G= + α|E|2 ,
R2
for some α > 0 chosen later. G then has degree −2, and using similar methods to Theorem 7.10
can be shown, for appropriately chosen α > 0, to satisfy an evolution equation of the form

G ≤ 4G + βR|E|2
∂t
for some β > 0 (see [9, Lemma 17.4]). Thus by Lemma 7.22 we have

∂ 4r̃
G̃ ≤ 4̃G̃ + β R̃|Ẽ|2 − G̃
∂τ 3
≤ 4̃G̃ + Ce−δτ − δ G̃

for some C, δ > 0, by the result of Theorem 7.16, and also using the fact that R̃ is bounded below
by Lemma 7.15 to bound r̃ > 3δ/4 for some δ > 0. Therefore

∂  δτ   
e G̃ − Cτ ≤ 4̃ eδτ G̃ − Cτ ,
∂τ
 
so by the maximum principle eδτ G̃ − Cτ ≤ C from which it follows that G̃ is exponentially
˜ R̃|2 is also exponentially decaying,
decaying. Hence, as R̃ is bounded above by Lemma 7.13, |∇
and the result follows.
Proof. (of Theorem 7.19) The proof we present is different from Hamilton’s original proof in [9],
which relied on integral estimates to bound derivatives of the metric. We instead use the maximum
principle to do the same thing.
As we did in the proof of Corollary 6.12, we must show that in any coordinate patch U , we
have Z ∞
|∂τ ∂ k g|dτ < ∞
0

77
for any k ∈ N. From the equation for the normalized Ricci flow, we see that it suffices to prove

|∂ k F | ≤ Ce−δτ

where
F = Rc˜ − 1 r̃g̃.
3
Combining Theorem 7.16 with Lemma 7.17 (as we did in the proof of Theorem 7.18) we see
that |F | is exponentially bounded. The most important step in proving that its derivatives are also
exponentially bounded will be to prove that its covariant derivatives are exponentially bounded.
Because r̃ is constant (in space, not time) and ∇g̃ ˜ = 0, this is equivalent to proving that the
˜
covariant derivatives of Rc are exponentially bounded with respect to g̃(τ ) (recall that the metrics
g̃(τ ) are uniformly equivalent to g̃(0), so this is equivalent to proving that the derivatives are
bounded with respect to a fixed metric):
Lemma 7.23. There exist positive constants Ck , δk such that
˜ k Rc|
|∇ ˜ ≤ Ck e−δk τ

for all k ≥ 1.

Proof. The proof will follow very similar lines to the proof of Theorem 6.6, and will also use some
of the ideas from the proof of Theorem 7.10. In particular, the proof will be by strong induction
on k, the Einstein tensor E will again be very useful, and C, δ will denote different constants in
just about every equation – they are just generic upper bounds.
˜ ≤ |Rc
We note first of all that |Rc| ˜ − 1/3R̃g̃| + |R̃|/3 ≤ C by the results of Theorem 7.16 and
Lemma 7.13.
Now let us begin the induction. We do not need a base case, because there is a bit of monkey
business going on at k = 0 – which is expected because we know that |Rc| ˜ is not actually expo-
nentially bounded. Assume that |∇ ˜ ≤ Ce−δτ for all 1 ≤ j ≤ k − 1, for some positive constants
˜ j Rc|
C, δ. In the k = 1 case this is a null assumption, hence true.
By Lemma 1.13, Rm can be expressed as a linear combination of traces of Rc (because we are
in dimension 3). It now follows by Lemma 6.4 that ∇k Rc satisfies a heat-type equation which is
basically that given in Lemma 6.4 but with Rc substituted for Rm. It then follows by Lemma 6.2
that |∇k Rc|2 satisfies a heat-type evolution equation of the form
k
∂ k X
|∇ Rc|2 = 4|∇k Rc|2 − 2|∇k+1 Rc|2 + ∇j Rc ∗ ∇k−j Rc ∗ ∇k Rc.
∂t j=0

This is the evolution equation for the unnormalized flow; the equation for the normalized flow will
be the same thing except for an extra term of the form (2nr̃/3)|∇ ˜ k Rc|
˜ 2 , by Lemma 7.22. This
term will be absorbed into the j = 0 term of the summation in the evolution equation though, as
it is bounded by a term of the form Rc ˜ ∗∇˜ k Rc
˜ ∗∇˜ k Rc
˜ (note that by the result of Theorem 7.11,
r̃ is uniformly bounded above by CR for some constant C > 0).
˜ k Rc|
Therefore we have the heat-type equation satisfied by |∇ ˜ 2 under the normalized Ricci flow,
and we can simplify it using the inductive hypothesis:
k
X
˜ k Rc|
∂t |∇ ˜ 2 ≤ ˜ k Rc|
4̃|∇ ˜ 2 − 2|∇
˜ k+1 Rc|
˜ 2+ ˜ j Rc
∇ ˜ ∗∇
˜ k−j Rc
˜ ∗∇
˜ k Rc
˜
j=0
˜ k Rc|
≤ 4̃|∇ ˜ 2 − 2|∇
˜ k+1 Rc|
˜ 2 + Ce−δτ + Rc
˜ ∗ (∇ ˜ k Rc)
˜ ∗2
˜ Rc|
≤ 4̃|∇ k ˜ − 2|∇
2 ˜ k+1 ˜ + Ce
Rc| 2 −δτ ˜ Rc|
+ Bk |∇ k ˜ 2.

We have amalgamated the exponential bounds that came from the inductive hypothesis, and used
the uniform upper bound on |∇ ˜ Rc|.
˜ This is not quite enough though – the reaction terms here
would lead to exponential growth. We use the same trick as we did in the proof of Theorem 6.6:
we add on multiples of the lower-order terms to get rid of the bad reaction terms. However we

78
˜ 2 we will get something that is not exponentially
must be careful: if we add on a multiple of |Rc|
bounded. We must instead add on a multiple of |Ẽ|2 , which is exponentially bounded.
Recall the formula (7.17), and combine it with the inequality (7.19) to get
50
∂t |E|2 ≤ 4|E|2 − 2|∇E|2 + R|E|2 .
3
Now apply equation (7.16) and Lemma 7.22 to obtain
2 ˜ ˜ 2 50 4r̃
∂τ |Ẽ|2 ≤ 4̃|Ẽ|2 −
|∇Rc| + R̃|Ẽ|2 − |Ẽ|2
37 3 3
2 2 ˜ ˜ 2 −δτ
≤ 4̃|Ẽ| − |∇Rc| + Ce
37
(we used the bounds R̃ ≤ C and |Ẽ| ≤ Ce−δτ to get the last term).
Now, let us define
k−1
X
˜ k Rc|
V := |∇ ˜ 2 + αk0 |Ẽ|2 + ˜ j Rc|
αkj |∇ ˜ 2,
j=1

where the αkj are constants that we will choose carefully. V satisfies the evolution equation:
k−1
X
∂τ V ˜ k Rc|
≤ 4̃V + Ce−δτ + (Bk − 2αk,k−1 )|∇ ˜ 2+ ˜ j Rc|
(αkj Bj − 2αk,j−1 )|∇ ˜ 2
j=1
 
2 ˜ Rc|
˜ 2
+ αk1 B1 − αk0 |∇
37
where we have grouped all of the exponentially decaying terms together.
Let us choose the αkj such that:

Bk − 2αk,k−1 ≤ −1
αkj Bj − 2αk,j−1 ≤ −αkj for 2 ≤ j ≤ k − 1
2
αk1 B1 − αk0 ≤ −αk1 .
37
The evolution equation for V then becomes

∂τ V ≤ 4̃V + Ce−δτ − V.

We have an upper bound on V at time τ = 0 by the compactness of M. The ode associated with
this pde will lead to exponential decay:

= Ce−δτ − φ

has the solution
C −δτ
φ(τ ) = Be−τ +
e .
1−δ
Hence, by the maximum principle (Theorem 3.2), V is exponentially bounded. So
˜ k Rc|
|∇ ˜ 2 ≤ V ≤ Ck e−δk τ

as required.
Now we have to go from bounds on |∇ ˜ k Rc|
˜ 2 to bounds on |∂ k Rc|
˜ 2 . As we did in Corollary 6.11
we treat the Christoffel symbols as a tensor in some coordinate chart U . To go from covariant
derivatives to coordinate derivatives we will need bounds on the derivatives of the Christoffel
symbols.
Lemma 7.24. In a fixed coordinate chart U , for all k ∈ N we have a uniform upper bound

|∂ k−1 Γ̃| ≤ Ck .

79
Proof. We will prove the result by strong induction. Assume the result is true for all 1 ≤ k ≤ m−1.
As in the previous lemma, no base case is required.
We can now do the inductive step. We can calculate ∂τ Γ̃ from formula (1.17) in Lemma 1.20,
with hij = −2R̃ij + (2r̃/3)g̃ij for the normalized Ricci flow. The result is

˜ i R̃jl + ∇
∂τ Γ̃kij = −g̃ kl (∇ ˜ j R̃il − ∇
˜ l R̃ij ) (7.28)

˜ = 0 and ∇r̃
(recalling that ∇g̃ ˜ = 0). It now follows by Lemma 1.3 and equation (7.28) that
 
m−1


X  
 ˜i
∂ m−1 ∂τ Γ̃ =  ∂ j Γ̃  ∗ ∇ (∂τ Γ̃)
i=0 j ≤m−2
 
m


X  
 ˜i ˜
= ∂ j Γ̃  ∗ ∇ Rc


i=1 j ≤m−2

By the inductive hypothesis we have a uniform bound on each of the terms ∂ j Γ̃ and by Lemma
˜ i Rc.
7.23 we have a decaying exponential bound on the terms ∇ ˜ Thus we have

|∂τ ∂ m−1 Γ̃| ≤ Ce−δτ .

It follows that Z ∞
|∂ m−1 Γ̃| ≤ |∂ m−1 Γ̃|t=0 + Ce−δτ dτ ≤ Cm
0
for some positive constant Cm . This completes the inductive step and hence the proof.
We now have, by Lemma 1.3,
 
k

X
|∂ k F | = ˜ i F ≤ Ce−δτ
∂j Γ  ∗ ∇
  


i=0 j ≤k−1

for all k ∈ N. We have used the bounds |∂ j Γ̃| ≤ C for j ≤ m − 1 from Lemma 7.24 and the bounds
˜ i F |2 ≤ Ce−δτ from Lemma 7.23 and Theorem 7.16.
|∇
Hence, because we have
Z ∞
∂g̃
g̃∞ (x) = g̃(x, 0) − 2 dτ
0 ∂τ
Z ∞
= g̃(x, 0) − 2 F (x, τ )dτ,
0

we may conclude that for any multi-index α, we can take the derivative under the integral sign:
Z ∞ |α|
∂ |α| ∂ |α| ∂
α
g̃∞ (x) = α
g̃(x, 0) − 2 F (x, τ )dτ,
∂x ∂x 0 ∂xα

where the integral is finite by the result proved above:


Z ∞ |α| Z ∞ Z ∞
∂ |α|
F (x, τ )dτ ≤ |∂ F (x, τ )|dτ ≤ Ce−δτ dτ < ∞.
0 ∂xα 0 0

Therefore g̃∞ has derivatives of all orders (it is smooth) and it is clear that the convergence is
uniform in any C k norm, in the same way that it followed for Corollary 6.12.

80
81
Chapter 8

Singularities in the Ricci Flow

This chapter is based on [28, Chap. 6, 7, 8].

8.1 Blowing Up at Singularities – Heuristics


It is time to step back and try to see where the achievements of Chapter 7 take us, in the context
of an attack on Thurston’s Geometrization Conjecture. We saw that, in the specific case of strictly
positive initial Ricci curvature, the Ricci flow has a singularity in finite time, as Rmax → ∞
as t → T . In fact we proved more than this; we showed that there is a singularity occurring
simultaneously everywhere in the manifold (this is a consequence of Theorem 7.11). We were
able to rescale the manifold by a time-dependent factor and show that the rescaled flow converged
smoothly to the geometric structure that we desired (namely a constant-curvature metric). We also
rescaled time, but that was more a matter of convenience because it made the evolution equation
(the normalized Ricci flow) easier to deal with.
The Uniformization Theorem for 2-manifolds can be proven using the Ricci flow in a similar
way. Once more one introduces the normalized Ricci flow, and shows that it converges smoothly
to a constant-curvature metric (see [6, Chap. 5]).
The normalized Ricci flow can not bypass all of our singularity problems however. The two
cases we have mentioned so far are special, because the singularity in the Ricci flow happens
everywhere simultaneously. We saw heuristically in Section 2.4 that singularities can, in general,
be expected to develop along some subset of our manifold (for example in neckpinches). In this
case, a rescaling to make the volume constant would not be expected to remove the singularity, it
would make the manifold unbounded. We need a new technique. We would like to somehow blow
our manifold up at the singularity. The aim of this chapter is to describe how this is done in
broad terms, with very little detail. There is a strong analogy with the blowup idea introduced in
the proof of Grayson’s theorem (Theorem 4.6) for the csf.
Consider a Ricci flow (M, g(t)) with a localised singularity at time T . We might have something
like the neckpinch discussed in Section 2.4 (see Figure 8.1). The curvature is exploding near some
subset of M as t → T . Let us choose times ti such that ti → T , and points xi ∈ M such that
|Rm(g(ti ))|(xi ) = supM |Rm(g(ti ))|.
We then have |Rm(g(ti ))|(xi ) → ∞. If we rescale our manifold by a factor of |Rm(g(ti ))|(xi )
at each time ti then (by the result of Lemma 1.19) the curvature will no longer explode as we
approach T , it will always have a maximum of 1. We would like to know what the limit of these
manifolds as i → ∞ is. That will tell us about the nature of the singularity, and what it means
topologically. This is the notion of “blowing up” at a singularity. We now need to know what it
means to take a “limit of manifolds” – this is a subtle question.
Consider, for example, the case of the neckpinch that we have already described. As we blow
the manifold up, the neck itself will remain roughly the same size, but the rest of the manifold
(which was not shrinking as we approached the singularity but is now getting dilated by the large
factors needed to keep the neck “open”) will get pushed out to infinity. Thus we expect the limit
of these blowups to be an infinite cylinder S2 × R – we completely lose the original topological

82
Curvature explodes

Figure 8.1: Curvature becomes large as the manifold gets close to pinching.

information. Despite all of the manifolds in the flow being compact and homeomorphic to each
other, the limit manifold is neither compact nor homeomorphic to the manifolds in the flow.
Note also that the limit depends on “where we are looking”. For example, suppose we had two
neckpinches happening simultaneously in different places. The limit should be well-defined, i.e. it
can’t be both the blowup of the first neckpinch and the blowup of the second neckpinch. Thus
when we blow up, we need to blow up about a specified point. We define the notion of a pointed
Riemannian manifold (M, g, p) as a Riemannian manifold (M, g) together with a distinguished
point p ∈ M.

8.2 Convergence of Manifolds


The theory of limits and convergence of manifolds was developed by Gromov, Cheeger, Peters,
Greene and Wu among others, building on the idea of Hausdorff convergence in metric spaces.
The definition we use is as follows:
Definition 8.1. A sequence (Mi , gi , pi ) of smooth, complete, pointed Riemannian manifolds is
said to converge smoothly to the smooth, complete, pointed manifold (M, g, p) as i → ∞ if the
following are true:
1. There is a sequence of compact sets Ki ⊂ M such that any compact subset of M is contained
in Ki for sufficiently large i, and such that p ∈ int(Ki ) for all i (we say that the sequence of
sets (Ki ) exhausts M).
2. There exist smooth maps φi : Ki → Mi that are diffeomorphic onto their image and such
that φi (p) = pi for all i.
3. φ∗i gi → g smoothly as i → ∞, in the sense that the convergence is uniform in any C k norm
on compact subsets of M.
Note that the points pi tell us “where to look”, as explained at the end of Section 8.1.
Hamilton proved in [11] the following compactness theorem for pointed Riemannian manifolds:
Theorem 8.1. Suppose that (Mi , gi , pi ) is a sequence of complete, smooth, pointed Riemannian
manifolds of dimension n such that

83
Figure 8.2: Non-smooth points can develop if curvature is not controlled.

Figure 8.3: A neck losing a dimension because its injectivity radius was not controlled.

1. For all r > 0 and k ∈ N, we have

sup sup |∇k Rm(gi )| < ∞;


i∈N Bgi (pi ,r)

2. The injectivity radius at the points pi is uniformly bounded away from zero:

inf inj(Mi , gi , pi ) > 0.


i

Then some subsequence of the sequence (Mi , gi , pi ) converges smoothly to a complete, smooth,
pointed Riemannian manifold of dimension n.
Note that both conditions are necessary – the first to avoid situations where the limit manifold
is not smooth, as in Figure 8.2. The second avoids such monstrosities as dimensional reduction,
see Figure 8.3.
Hamilton extended this compactness result to deal with limits of Ricci flows. In this way, the
limit when we blow up our singularity will not only be a pointed Riemannian manifold, it will be
a pointed solution of the Ricci flow.
Definition 8.2. Let (Mi , gi (t), pi ) be a sequence of pointed smooth manifolds with time-evolving
metrics defined for on some time interval t ∈ (a, b). If (M, g(t), p) is a pointed smooth mani-
fold with a time-evolving metric for t ∈ (a, b), we say that (Mi , gi (t), pi ) converges smoothly to
(M, g(t), p) as i → ∞ if the following conditions are satisfied:
1. There is a sequence of compact sets Ki ⊂ M that exhausts M, such that p ∈ int(Ki ) for all
i.

84
2. There is a sequence of smooth maps φi : Ki → Mi which are diffeomorphisms onto their
image, such that φi (p) = pi .
3. φ∗i gi (t) → g(t) smoothly in both space and time, in the sense that the convergence is uniform
in any C k norm on any compact subset of M × (a, b).
The compactness theorem for manifolds (Theorem 8.1), can be extended to a compactness
theorem for Ricci flows (this also first appeared in [11])
Theorem 8.2. Let (Mi , gi (t), pi ) be a sequence of n-dimensional pointed solutions of the Ricci
flow defined for t ∈ (a, b), where 0 ∈ (a, b). Suppose that
1.
sup |Rm(gi (t))|(x) < ∞;
i∈N,(x,t)∈Mi ×(a,b)

2.
inf inj(Mi , gi (0), pi ) > 0.
i

That is, the curvature is uniformly bounded above and the injectivity radius at pi is uniformly
bounded away from zero at t = 0. Then there exists a marked solution of the Ricci flow (M, g(t), p)
defined for t ∈ (a, b) such that a subsequence of (Mi , gi (t), pi ) converges smoothly to (M, g(t), p).
Note that we do not need control over all derivatives of the curvature as we did for Theorem
8.1 because of the Bernstein-Bando-Shi estimates (Theorem 6.6) which give us bounds on all
derivatives of the curvature in terms of a bound on the curvature. Of course these estimates rely
crucially on the fact that the metrics gi (t) satisfy the Ricci flow equation.

8.3 Blowing Up at Singularities – Results


Let us now apply these compactness results to the blowup of finite-time singularities as described
in Section 8.1. Suppose that a solution of the Ricci flow (M, g(t)) exists on a maximal time interval
t ∈ [0, T ). By the result of Theorem 6.7, the maximum value of |Rm| on M explodes to +∞ as
t → T . Thus we may choose points pi ∈ M and times ti ∈ [0, T ) such that ti → T and

|Rm|(pi , ti ) = sup |Rm|(x, t) → ∞. (8.1)


x∈M,t∈[0,ti ]

Let us define Mi := |Rm|(pi , ti ). Note the analogy with the proof of Grayson’s Theorem (Theorem
4.6).
We now blow up the Ricci flows as described in Section 8.1: define
 
t
gi (t) = Mi g ti + .
Mi

This is merely a parabolic1 rescaling of time and the metric in addition to a translation in time,
and it is not difficult to see that gi (t) is a solution of the Ricci flow defined on the time interval
[−M ti , M (T − ti )). Furthermore, for t ≤ 0 we have
  
|Rm g ti + Mt i | Mi
|Rm(gi (t))| = ≤ =1
Mi Mi
by the way we defined Mi in equation (8.1). By Corollary 6.13, there exists b > 0 (depending only
on the dimension of the manifold) such that, for all i, gi (t) is defined for t ∈ (−M ti , b), and

sup |Rm(gi (t))|(x) < ∞. (8.2)


i∈N,(x,t)∈Mi ×(−M ti ,b)

1A √
parabolic rescaling of a system scales time by a factor α and length by a factor α – it is a characteristic
of heat-type evolution equations that a parabolic rescaling of any solution is another solution.

85
Because the lower limits −M ti of the time intervals tend to −∞, we seek to define a limit flow
for t ∈ (−∞, b) (for any −B < 0 arbitrarily large, the flows gi (t) will eventually be defined on
(−B, b) so the limit can be defined). Now equation (8.2) shows that the first condition of Theorem
8.2 is satisfied, so the only thing we need in order to pass to a limit flow is to uniform bound
the injectivity radii of the manifolds (M, gi (t)) away from zero, for all i ∈ N and t ∈ (−M ti , b).
This turned out to be a major stumbling block for quite a while, but was resolved by Perelman in
[23]. We will briefly discuss some of Perelman’s ideas in Section 8.4. An accessible exposition of
Perelman’s proof of the injectivity radius bounds is contained in [28, Chap. 8]. Using Perelman’s
injectivity radius bounds, we can apply Theorem 8.2. The result is (passing to a subsequence of
(M, gi (t), pi )):
Theorem 8.3. Suppose that (M, g(t)) is a solution of the Ricci flow defined on a maximal time
interval [0, T ), where T < ∞. Then there exist points pi ∈ M and times ti ∈ [0, T ), ti → T such
that
Mi := |Rm|(pi , ti ) = sup |Rm|(x, t) → ∞.
x∈M,t∈[0,ti ]

If we now define  
t
gi (t) = Mi g ti +
Mi
then there exists b > 0 such that (M, gi (t), pi ) converges to a Ricci flow (N , g∞ (t), p∞ ) defined for
t ∈ (−∞, b). Furthermore |Rm(g∞ (0))|(p∞ ) = 1 and |Rm(g∞ (t))| ≤ 1 for t ≤ 0.
One can now seek to classify the possible singularities that one can obtain from such a limiting
argument. Note that the limit flow is defined for t ∈ (−∞, b) – such solutions are called ancient.
Ancient solutions are very special. If we run the Ricci flow equation in reverse we get a non-
parabolic system that tends to develop singularities. Thus we do not expect to be able to extend
an arbitrary Ricci flow on some finite interval (a, b) to an ancient one defined on (−∞, b). Because
ancient solutions are so special, we can hope to classify them despite the fact that classification of
arbitrary Ricci flows is a hopelessly complicated task.
Hamilton went a long way towards performing this classification in [12], showing that (if injec-
tivity radius bounds were satisfied) the limit of the singularity has to be either a 3-sphere S3 or
quotient thereof (which shrinks to a point in finite time) or a cylinder S2 × R, which corresponds
to a neckpinch (see Figure 8.4 for the 2-dimensional analogue). He was unable to rule out a third
possible limit, namely Σ2 × R where Σ2 is Hamilton’s “cigar soliton” (described in Section 2.3),
but Perelman has done away with this difficulty. “Degenerate neckpinches”, in which one of the
halves of the manifold shrinks to a point at the same time as the neck becomes singular (producing
a sort of cusp in the manifold) are also possible.
The neckpinches, as described in Section 2.4, correspond to connected sum decompositions of
the manifold (although some trivial decompositions may occur). The idea is to identify when a
neckpinch is happening, and to perform the connected sum decomposition “manually” – that is,
we cut out the neck part and seal the ends off with discs, glued on smoothly (compare Figure
2.3). This is the process known as “surgery”. The flow can then continue. It might have any
number of singularities that are either quotients of 3-spheres shrinking to a point in finite time, or
products S2 × S1 that shrink down to the S1 factor in finite time, or neckpinches which we must
deal with by surgery. Perelman showed that the surgery times are locally finite (i.e., in any finite
time interval there are only a finite number of times at which surgery occurs). We are left over
with the solutions that have no finite-time singularity, which Perelman showed (building on work
by Hamilton in [13]) must satisfy the Geometrization Conjecture (under rescaling the manifold will
split along tori into expanding hyperbolic pieces with cusps and collapsing “graph manifolds”, as
discussed in Section 2.4).
To show the power of Theorem 8.3 we provide an alternative proof of Theorem 7.1.
Proof. (of Theorem 7.1) We consider a Ricci flow on a closed 3-manifold with initially strictly
positive Ricci curvature. We showed in Theorem 7.2 that the flow must have a singularity in finite
time, hence Theorem 8.3 applies. Let us say that the maximal time interval of existence is [0, T ).
Thus there exist points pi ∈ M, times ti → T and rescaled flows gi (t) defined for t ∈ (−ai , b)

86
Figure 8.4: The limit of a manifold that gets close to pinching is a cylinder.

(where b > 0 and ai → +∞) such that the pointed, rescaled flows (M, gi (t), pi ) converge smoothly
to a Ricci flow (N , g∞ (t), p∞ ) defined for t ∈ (−∞, b).
We use the result of Corollary 7.9, which says that for the Ricci flow (M, g(t)),

|E(g)|2 ≤ CR(g)2−δ

(where E is the Einstein tensor) for some positive constants C, δ. Now the flows gi (t) are just
rescalings of g(t) by the respective factors Mi (defined in Theorem 8.3, with the property that
Mi → ∞), hence one can deduce from the results of Lemma 1.19 that

Mi2 |E(gi )|2 ≤ CMi2−δ R(gi )2−δ


⇒ |E(gi )|2 ≤ CR(gi )2−δ Mi−δ .

Taking the limit as i → ∞ and evaluating at t = 0, we have

|E(g∞ (0))|2 ≤ CR(g∞ (0))2−δ lim Mi−δ = 0


i→∞

because Mi → ∞.
Therefore g∞ (0) is an Einstein metric. By Lemma 1.14, it has constant sectional curvature.
Because R(gi (t)) > 0 for all i, in the limit we must have R(g∞ (0)) ≥ 0 hence the sectional
curvatures are non-negative. Furthermore |Rm(g∞ (0))| = 1 so the sectional curvatures are strictly
positive.
It follows that the Ricci curvature tensor of g∞ (0) on N is bounded below and hence, by
Myers’ Theorem (Theorem 1.17) that N is compact. Therefore, by the definition of convergence
of manifolds, N must in fact be diffeomorphic to M. Thus the metric g∞ (0) induces a metric of
constant, strictly positive sectional curvature on M, as required.
Note that, although we needed the pinching results of Section 7.4 for this proof, we did not
need to prove any of the subsequent global pinching or convergence results – so significantly less
work is required than in Chapter 7, if we assume Theorem 8.3!

8.4 Perelman’s F- and W-functionals


After people had tried unsuccessfully for some time to find a way of bounding the injectivity radius
so that Theorem 8.3 could be obtained, Perelman came along and cleared things up (in [23]). We
will not give much of an outline of his argument (an accessible exposition is contained in [28, Chap.

87
6,8]) but will present one of the key ideas behind his proof – that the Ricci flow can be formulated
as a gradient flow.
We return to the analogy of the csf. Recall Lemma 4.1, which showed that the csf was a
steepest-descent flow for length. Can the Ricci flow be similarly formulated as the steepest-descent
flow for some functional? The simplest approaches do not work. In analogy with the length
functional for the csf the simplest functional we might consider is the volume of the manifold
with the metric g – but we know that some manifolds increase in volume and some decrease in
volume under the Ricci flow (specifically, hyperbolic manifolds increase in volume while manifolds
of positive sectional curvature decrease in volume – see Section 2.3), soR that will not work. The
second-simplest functional we might try is the total scalar curvature, Rdµ. The corresponding
gradient flow is, by formula (1.23) in Lemma 1.20,
∂g 1
= −Rc + Rg.
∂t 2
This is similar to the Ricci flow and looks a bit like the unnormalized Ricci flow but is not parabolic
(and cannot be made parabolic using methods like the DeTurck trick) so it is completely useless
as an evolution equation.
What clues do we have to the nature of the correct functional? One property it ought to have
is that it should obviously be constant on fixed points of the Ricci flow. But recall from Section 2.3
that there exist “generalized fixed points” of the Ricci flow, the so-called Ricci solitons on which
the Ricci flow is only changing the manifold by reparametrization and scaling. We hypothesize
that the functional we seek should also be constant on gradient Ricci solitons (gradient solitons are
much easier to deal with than general solitons). For the moment we will consider gradient solitons
that do not involve any rescaling of the metric, so the constant λ that appears in equation (2.1) is
0. Thus we expect our functional to be constant if there exists a scalar function f : M → R such
that Rij + ∇i ∇j f = 0.
This gives us a clue that our functional should not depend on g alone, but should also depend
on a scalar function f . In fact we define Perelman’s F-functional:
Z
R + |∇f |2 e−f dµ,

F(g, f ) :=
M

defined for Riemannian metrics g and scalar functions f . To work out what its gradient flow is
we must calculate how it evolves under an arbitrary variation in the metric g and the function f :
∂g/∂t = h and ∂f /∂t = k.
Lemma 8.4. The variation of F(g, f ) is given by:
Z   
d 1 ij
F(g, f ) = hij (−Rij − ∇i ∇j f ) + (24f − |∇f |2 + R) h gij − k e−f dµ.
dt M 2
We will not prove this result, but it follows from the results and methods used in Lemma 1.20.
Perelman now defined f so that the volume form e−f dµ is preserved. By formula (1.21) of
Lemma 1.20, we have  
∂ −f 1 ij
e dµ = g hij − k e−f dµ.
∂t 2
So, if we determine the evolution of f by
∂f 1 ∂
= g ij gij
∂t 2 ∂t
and define the (now constant) “modified volume form” dξ := e−f dµ, then the result of Lemma 8.4
is: Z
d
F(g, f ) = hij (−Rij − ∇i ∇j f )dξ.
dt M
Hence the steepest descent flow for this functional is the one with hij = −Rij − ∇i ∇j f , i.e.

gij = −Rij − ∇i ∇j f.
∂t

88
This is exactly the Ricci flow, with a rescaling of time (as we have −Rij rather than −2Rij on
the rhs) and a time-dependent reparametrisation moving along the gradient vector field of f (as
outlined in the discussion of Ricci solitons in Section 2.3). So the gradient flow of this functional
is equivalent to the Ricci flow. We also note that if the solution is stationary under this flow,
it satisfies the gradient Ricci soliton equation (2.1) with no rescaling term, as we have argued it
should.
We would like to extend the F-functional in some way so that the functional is also constant on
gradient solitons that include rescaling. This is the idea behind the W-functional. We introduce a
positive real-valued scaling parameter τ > 0 and define
Z
τ (R + |∇f |2 ) + f − n udµ

W(g, f, τ ) :=
Mn

where n is the dimension of the manifold and

u := (4πτ )−n/2 e−f .

The arguments g, f, τ are said to be compatible if


Z
udµ = 1.
Mn

We would like an evolution equation for W in the style of Lemma 8.4. The analogous result is

Lemma 8.5. If g, f, τ evolve according to


∂g
= −2Rc
∂t
∂τ
= −1
∂t
∂f n
= −4f + |∇f |2 − R +
∂t 2τ
then Z
d 1
W(g, f, τ ) = 2τ |Rij + ∇i ∇j f − gij |2 udµ.
dt Mn 2τ
In particular, W is nondecreasing in time and remains constant if and only if
1
Rij + ∇i ∇j f − gij = 0,

that is, if and only if g is a shrinking gradient Ricci soliton. The condition that g, f, τ are compatible
is preserved under this evolution. Note that the evolution equation for f is a backwards heat
equation (because of the − sign in front of the Laplacian). Backwards heat equations are not
parabolic, and generally can not be expected to have a solution even for a short time. However, if
we reverse time we get a normal heat-type equation, which does have a short-time solution. Thus,
rather than specifying initial conditions on f , we can only specify final conditions.
Perelman was able to use the W-functional to prove bounds on the volume of balls of given
radius as the metric evolves under the Ricci flow, by defining f so that e−f is a bump function
with support inside the ball (W then has the form of an integral over the ball, which can be related
to the ball’s volume). He showed that the volume of such balls will not collapse, from which it is
possible to show that their injectivity radius is bounded below, which is exactly the sort of result
we needed to prove Theorem 8.3. See [28, Chap. 8] for the full argument.

89
Appendix A

Existence Theory for Parabolic


PDEs

In this Appendix we will outline the main existence results for pdes that we will use. This material
is based on [6, Sec 3.2] and [28, Chap. 4].
Consider a vector bundle π : E → M, where M is some Riemannian manifold, with some
bundle metric h. We will be interested in pdes describing the evolution of some time-dependent
section of E, u : M × [0, T ) → E, having the form

∂u
= L(u) (A.1)
∂t
u(x, 0) = u0 (x) (A.2)

where L : C ∞ (E) → C ∞ (E) is some differential operator.

A.1 Linear Theory


In this section we will use the multi-index notation for partial derivatives.

Definition A.1. A multi-index α is an n-tuple of non-negative integers (α1 , α2 , . . . , αn ). We


define
Xn
|α| = αi .
i=1

If f is a function of n variables x1 , x2 , . . . , xn then we define

∂ |α| f
∂ α f := .
∂xα α2
1 ∂x2
1
. . . ∂xα
n
n

We first consider the case that L is a linear differential operator. That is, using multi-index
notation,
X
L(u) = Lα ∂ α u (A.3)
|α|≤k

where k is the order of L and Lα ∈ Hom(E, E).


For example, we might consider a second-order linear differential operator acting on scalar
functions on Rn (so M = Rn , E = M × R, k = 2). This will have the form
X X
L̄(u) = aij ∂ i ∂ j u + bi ∂ i u + cu.
i,j i

90
If ϕ is some covector field (i.e. ϕ ∈ C ∞ (T ∗ M)) then we define the total symbol of L in the
direction ϕ to be the bundle homomorphism σ[L](ϕ) : E → E
X Y
σ[L](ϕ)(u) = Lα (u) ϕαj .
|α|≤k j

In the case of the example L̄ we gave above, the total symbol in the direction ϕ is
X X
σ[L̄](ϕ)(u) = aij ϕi ϕj u + bi ϕi u + cu.
i,j i

We define the principal symbol of L in the direction ϕ to be the bundle homomorphism


σ̂[L](ϕ) : E → E that comes only from the highest order derivative terms in L:
X Y
σ̂[L](ϕ)(u) = Lα (u) ϕαj .
|α|=k j

The principal symbol of the example L̄ in the direction ϕ is thus:


X
σ̂[L̄](ϕ)(u) = aij ϕi ϕj u.
i,j

The operator L is said to be elliptic if σ̂[L](ϕ) is a vector bundle isomorphism whenever ϕ 6= 0.


For our example L̄ this means that X
aij ϕi ϕj 6= 0
i,j

whenever ϕ 6= 0, i.e. that [aij ] is a non-singular matrix.


The system A.1 is said to be strongly parabolic1 if there exists δ > 0 such that at each point
of the manifold M, for all covectors ϕ 6= 0 and elements u 6= 0 of E,
hσ̂[L](ϕ)(u), ui > δ|ϕ|2 |u|2 .
In the order-2 example we considered above, this is saying that
X
aij ϕi ϕj > δ|ϕ|2
i,j

for all ϕ 6= 0, i.e. that the matrix [aij ] is positive-definite.

A.2 Linearization of Nonlinear PDEs


We will mainly deal with non-linear pde’s, where L is not given by the convenient form of A.3.
What does it mean for a non-linear pde to be parabolic? We need to define the linearization of
the non-linear operator L. The linearization is defined in analogy with the derivative of a function
f : Rn → Rm , which is the linear map Df : Rn → Rm defined to “best approximate” f at each
point. That is, if we have curve x : [0, 1] → Rn satisfying
x(0) = p
0
x (0) = v
then
d
Df (v) = f (x(t))
dt t=0
In a similar way we define the linearization of a non-linear operator L on a vector bundle E. If
u : [0, 1] → C ∞ (E) is a time-dependent section of E such that
u(0) = u0
u0 (0) = v.
1 We will often just say “parabolic” when we mean “strongly parabolic”

91
We define the linearization of L at u0 to be the linear map D[L] : C ∞ (E) → C ∞ (E) so that

d
D[L](v) = L(u(t))
dt t=0

(if it exists).
We say that the system A.1 is strongly parabolic at u0 if the system
∂u
= D[L](u) (A.4)
∂t
u(x, 0) = u0 (x) (A.5)

is strongly parabolic in the sense described previously. We will only be interested in the case where
D[L] has the form X
D[L] = L̃α ∂ α
|α|≤k

with some finite k.


Theorem A.1. If the system A.1 is strongly parabolic at u0 then there exists a solution on some
time interval [0, T ), and the solution is unique for as long as it exists.

Practically all existence proofs in this project will rely on this theorem. See [17] for the proof.

92
Appendix B

Differential Geometry Formulae

1. Christoffel symbols, for the Levi-Civita connection of the metric gij :

1 kl
Γkij = g (∂i gjl + ∂j gil − ∂l gij )
2
2. In normal coordinates about the point p:
(a) γV (t) = (tV 1 , tV 2 , . . . , tV n ) is a geodesic
(b) gij (p) = δij
(c) Γkij (p) = 0, ∂i gjk (p) = 0
3. Covariant derivative:

l
X k
X
∇p Fij11...i
...jl
k
= ∂p Fij11...i
...jl
k
+ Fij11...i
...q...jl js
k
Γpq − Fij11...q...i
...jl
Γq
k pis
(B.1)
s=1 s=1

4. Riemann curvature tensor:

[∇i , ∇j ]X l ≡ Rijk
l
Xk

5. Symmetries of the Riemann curvature tensor:

Rijkl = Rklij = −Rjikl = −Rijlk

6. First Bianchi identity:

Rijkl + Rjkil + Rkijl = 0

7. Second Bianchi identity:

∇p Rijkl + ∇i Rjpkl + ∇j Rpikl = 0 (B.2)

8. Ricci and scalar curvatures:

m
Rij ≡ Rmjk , R ≡ Rii

9. Contracted second Bianchi identity:

1
∇j Rij = ∇i R
2

93
10. Commuting covariant derivatives:

l
X k
X
[∇p , ∇q ]Fij11...i
...jl
k
= js
Rpqm Fij11...i
...m...jl
k
− m
Rpqi F j1 ...jl
s i1 ...m...ik
(B.3)
s=1 s=1

11. Coordinate form of the Riemann curvature tensor:


l
Rijk = ∂i Γljk − ∂j Γlik + Γpjk Γlip − Γpik Γljp

94
Bibliography

[1] Michael T. Anderson. Geometrization of 3-manifolds via the Ricci flow. Notices of the AMS,
51:184–193, 2004.
[2] Huai-Dong Cao and Xi-Ping Zhu. A complete proof of the Poincaré and geometrization
conjectures – application of the Hamilton-Perelman theory of the Ricci flow. Asian J. Math.,
10, No. 2:165–492, 2006.
[3] J. Cheeger and D. Ebin. Comparison theorems in Riemannian Manifolds. North-Holland,
1975.
[4] XiuXiong Chen, Peng Lu, and Gang Tian. A note on uniformization of Riemann surfaces by
Ricci flow. http://arXiv.org/math.DG/0505163, 2005.
[5] S.S. Chern, W.H. Chen, and K.S. Lam. Lectures on Differential Geometry. World Scientific,
2000.
[6] Bennett Chow and Dan Knopf. The Ricci Flow: An Introduction. AMS Math. Surveys and
Monographs, 2004.

[7] Dennis M. DeTurck. Deforming metrics in the direction of their Ricci tensors. Journal of
Differential Geometry, 18:157–162, 1983.
[8] S. Gallot, D. Hulin, and J. Lafontaine. Riemannian Geometry. Springer-Verlag, 1987.
[9] Richard Hamilton. Three-manifolds with positive Ricci curvature. J. Diff. Geo., 17:255–306,
1982.
[10] Richard Hamilton. Four-manifolds with positive curvature operator. J. Diff. Geo., 24:153–179,
1986.
[11] Richard Hamilton. A compactness property for solutions of the Ricci flow. American Journal
of Mathematics, 117:545–572, 1992.
[12] Richard Hamilton. The formation of singularities in the Ricci flow. Surveys in Differential
Geometry, 2:1–136, 1995.
[13] Richard Hamilton. Non-singular solutions of the Ricci flow on three-manifolds. Comm. Anal.
Geom., 7:695–729, 1999.

[14] Noel J. Hicks. Notes on Differential Geometry. Van Nostrand, 1965.


[15] Wilhelm Klingenberg. A Course in Differential Geometry. Springer-Verlag, 1978.
[16] Hellmuth Kneser. Geschloßene flächen in dreidimensionalen mannigfaltigkeiten. Jahresbericht
der Deutschen Mathematischen Verein, 38:248–260, 1929.

[17] O. A. Ladyženskaja, V. A. Solonnikov, and N. N. Uralćeva (Translated from the Russian by


S. Smith). Linear and quasilinear equations of parabolic type. Vol. 23 AMS, 1967.
[18] John M. Lee. Riemannian Manifolds: An Introduction to Curvature. Springer-Verlag, 1997.

95
[19] John Milnor. A unique factorization theorem for 3-manifolds. Amer. J. Math., 84:1–7, 1962.

[20] John Milnor and James Stasheff. Characteristic Classes. Princeton University Press, 1974.
[21] John W. Morgan and Gang Tian. Ricci flow and the Poincaré conjecture.
http://arXiv.org/math.DG/0607607v1, 2006.
[22] James R. Munkres. Topology, 2nd Ed. Prentice Hall, 2000.

[23] Grisha Perelman. The entropy formula for the Ricci flow and its geometric applications.
http://arXiv.org/math.DG/0211159v1, 2002.
[24] Grisha Perelman. Finite extinction time for the solutions to the Ricci flow on certain three-
manifolds. http://arXiv.org/math.DG/0307245v1, 2002.
[25] Grisha Perelman. Ricci flow with surgery on three-manifolds.
http://arXiv.org/math.DG/0303109v1, 2003.
[26] Michael Spivak. A Comprehensive Introduction to Differential Geometry. Publish or Perish,
1979.
[27] William P. Thurston. Three-Dimensional Geometry and Topology, Vol. 1. Princeton University
Press, 1997.
[28] Peter Topping. Lectures on the Ricci Flow.
http://www.maths.warwick.ac.uk/~topping/RFnotes.html, 2005.
[29] Peter Topping. Lectures on the Ricci Flow. Cambridge University Press, 2006.

[30] Xi-Ping Zhu. Lectures on Mean Curvature Flows. International Press, 2002.

96

You might also like