dòng chảy ricci
dòng chảy ricci
Nick Sheridan
The aim of this project is to introduce the basics of Hamilton’s Ricci Flow. The Ricci flow is a pde
for evolving the metric tensor in a Riemannian manifold to make it “rounder”, in the hope that one
may draw topological conclusions from the existence of such “round” metrics. Indeed, the Ricci flow
has recently been used to prove two very deep theorems in topology, namely the Geometrization
and Poincaré Conjectures. We begin with a brief survey of the differential geometry that is needed
in the Ricci flow, then proceed to introduce its basic properties and the basic techniques used
to understand it, for example, proving existence and uniqueness and bounds on derivatives of
curvature under the Ricci flow using the maximum principle. We use these results to prove the
“original” Ricci flow theorem – the 1982 theorem of Richard Hamilton that closed 3-manifolds
which admit metrics of strictly positive Ricci curvature are diffeomorphic to quotients of the round
3-sphere by finite groups of isometries acting freely. We conclude with a qualitative discussion of
the ideas behind the proof of the Geometrization Conjecture using the Ricci flow.
Most of the project is based on the book by Chow and Knopf [6], the notes by Peter Topping
[28] (which have recently been made into a book, see [29]), the papers of Richard Hamilton (in
particular [9]) and the lecture course on Geometric Evolution Equations presented by Ben Andrews
at the 2006 ICE-EM Graduate School held at the University of Queensland. We have reformulated
and expanded the arguments contained in these references in some places. In particular, the proof
of Theorem 7.19 is original, based on a suggestion by Gerhard Huisken. We also diverge from the
existing references by emphasising the analogy between the techniques applied to the Ricci flow
and those applied to the curve-shortening flow, which we feel helps clarify the important ideas
behind the technical details of the Ricci flow. Chapter 6 is based on [6, Chap. 6, 7], but we have
significantly reformulated the material and elaborated on the proofs. We feel that our organization
is easier to follow than Chow and Knopf’s book. The attempt to motivate the compactness result
in Section 8.1 is also original.
1
Acknowledgements: Thanks to my supervisor Craig Hodgson for being so generous with
his time and for always explaining things in the best possible way. Thanks also to Ben Andrews
and Gerhard Huisken for the help they gave me at the ICE-EM Graduate School in July of this
year and afterwards, and thank you to ICE-EM and the University of Queensland for running this
excellent program.
2
Contents
1 Riemannian Geometry 6
1.1 Vectors, Tensors and Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2 The Covariant Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.3 The Lie Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.4 Curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.5 Topology from Geometry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.6 Scaling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
1.7 Time-evolving metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
4 Curve-Shortening Flow 36
4.1 Steepest Descent Flow for Length . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
4.2 Short Time Existence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
4.3 Finite Time Singularity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
4.4 Curvature Explodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
4.5 Grayson’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
3
7.8 Details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
4
5
Chapter 1
Riemannian Geometry
We will sweep through the basics of Riemannian geometry in this chapter, with a focus on the
concepts that will be important for the Ricci flow later. Most proofs will be neglected for brevity, as
the main point of much of the chapter is to establish conventions. The particularly useful formulae
have been collected in Appendix B. We also give a flavour in Section 1.5 of how geometry can
be used to draw topological conclusions. The material in this chapter is presented in much more
detail in many texts – for example, [14] presents the basics of manifolds, tangent vectors, tensors
and the Lie derivative. The book [18] is an excellent place to learn the theory of curvature.
2. Mn is Hausdorff.
3. Mn is paracompact (see [26, Vol. I, App. A, Chap. 1] for a discussion of this condition).
We will usually write M for a generic manifold, and Mn for an n-manifold if the dimension
is of particular relevance. In this project we will deal with smooth manifolds, which have more
structure than topological manifolds. In order to define what a smooth manifold is, we must first
define the concept of a smooth function between subsets of Euclidean space.
Definition 1.2. A function f : U → Rm , where U is an open subset of Rn , is called smooth or
C ∞ if all of its partial derivatives exist and are continuous on U .
Now we can define the concept of a smooth manifold.
Definition 1.3. Given two coordinate charts (U, ϕ) and (V, ψ) on a manifold M, with U ∩ V 6= ∅,
we call the map ψ ◦ ϕ−1 : ϕ(U ∩ V ) → ψ(U ∩ V ) a transition map. Note that each transition map
is a homeomorphism from an open set in Rn to another open set in Rn . We make the definitions:
1. M is called smooth or C ∞ if all the transition maps are smooth.
6
Definition 1.4. Let f : M → N , where M and N are smooth manifolds. f is called smooth if,
for every pair of coordinate charts (U, ϕ) of M and (V, ψ) of N , the function
is smooth.
As a special case we can set N = R, which has a natural smooth manifold structure. The set
of all smooth real-valued functions f : M → R is denoted C ∞ (M ).
Two topological manifolds are equivalent if they are homeomorphic. The notion of equivalence
on smooth manifolds is a bit more subtle.
Definition 1.5. Two smooth manifolds M, N are equivalent if there exists a smooth function
f : M → N which has a smooth inverse. We will call such a function f a diffeomorphism1 and
say that M and N are diffeomorphic.
In 3 dimensions, topological and smooth manifolds are essentially equivalent. That is, any
topological manifold is homeomorphic to a unique smooth manifold, and vice versa. This result
is not true for higher dimensions. However most of the results in this project will deal with 3-
manifolds, so it is of interest to note that we do not lose any generality by assuming that our
3-manifolds are smooth.
Now that we know what a manifold is, we would like to define a tangent vector to our manifold
M at a point p ∈ M.
Definition 1.6. A tangent vector to a smooth manifold M at a point p ∈ M is a derivation,
that is, an R-linear function X : C ∞ (M) → R satisfying the product rule:
The set of all tangent vectors to an n-manifold Mn at p forms an n-dimensional vector space
Tp Mn .
We note that this definition is related to the more intuitive notion of a tangent vector as a
velocity vector of a curve γ : (−, ) → M in the manifold. If γ(0) = p then we associate to the
velocity vector of γ at p the derivation X ∈ Tp M, where
d
X(f ) = f (γ(t))|t=0 .
dt
We then write X = γ̇(0).
If (xi ) is a local coordinate system about p on an n-manifold Mn , then the set of derivations
{∂/∂xi , i = 1, 2, . . . , n} forms a basis for Tp Mn . To avoid unseemly typesetting nightmares we
will often write ∂i for ∂/∂xi if there can be no confusion about the coordinate system being used.
The set of all tangent vectors at all points of Mn forms a (2n)-manifold known as the tangent
bundle, and denoted T Mn .
A vector field on a manifold M is a smoothly-varying choice of tangent vector at each point
p ∈ M. Here “smoothly-varying” means X(f ) ∈ C ∞ (M) for any f ∈ C ∞ (M).
We note in passing that there is some additional structure to T M on top of the vector space
structure on Tp M: given two vector fields X, Y on M, we can form their Lie bracket [X, Y ],
defined by
[X, Y ]f = X(Y (f )) − Y (X(f ))
(it turns out that the [X, Y ] defined in this way is a tangent vector in the sense of being a derivation
as described above, although this is not immediately obvious).
The example of tangent vectors is a specific case of a more general construction on a manifold,
known as a vector bundle. The idea is that one associates a vector space to each point of
the manifold M, then glues these vector spaces together so as to get a new, higher-dimensional
manifold.
1 A diffeomorphism is usually defined to be a differentiable map with a differentiable inverse. The distinction
is important when dealing with manifolds of varying levels of differentiability, but in this project we will deal
exclusively with smooth manifolds. Thus the slight abuse of terminology will not cause any confusion.
7
Definition 1.7. A k-dimensional vector bundle is a manifold E (the total space) together with a
manifold M (the base space) and a surjective map π : E → M (the projection) such that
1. For each p ∈ M, the set Ep := π −1 (p) (the fibre of E over p) has a k-dimensional vector
space structure.
2. For each p ∈ M there is an open neighbourhood U of p and a smooth diffeomorphism ϕ :
π −1 (U ) → U ×Rk (a local trivialization) such that ϕ takes each fibre Ep to the corresponding
fibre {p} × Rk by a linear isomorphism.
k copies l copies
z }| { z }| {
∗ ∗ ∗
Tlk (M)
:= T M ⊗ T M ⊗ . . . ⊗ T M ⊗ T M ⊗ T M ⊗ . . . ⊗ T M .
Given a local coordinate system (xi ) about p ∈ M, we can express any kl -tensor field F in the
coordinate system as
F = Fij11...i
...jl
k
(p)∂j1 ⊗ . . . ⊗ ∂jl ⊗ dxi1 ⊗ . . . ⊗ dxik . (1.1)
In this equation we sum over each index jp , iq that is repeated twice, once raised and once lowered –
this is known as the Einstein summation convention. We will almost always use this coordinate
representation of tensors because it makes technical calculations easier a lot of the time, and we
will often write Fij11...i
...jl
k
when we mean F .
Definition 1.8. Given a map between manifolds φ : M → N , we define the derivative map
between the corresponding tangent spaces, φ∗ : Tp M → Tf (p) N by
(φ∗ V )(f ) = V (f ◦ φ)
for V ∈ Tp M and f ∈ C ∞ (N ). Defining φ∗ (A⊗B) := φ∗ (A)⊗φ∗ (B), we can extend this definition
to apply to all k0 -tensors.
In a similar way we can define the derivative map between the corresponding cotangent spaces,
φ∗ : Tf∗(p) N → Tp∗ M, by
(φ∗ ω)(V ) = ω(φ∗ V )
for V ∈ Tp M, ω ∈ Tf∗(p) N . By a similar method to above we can extend φ∗ to apply to all
k
0 -tensors.
8
k−1
to get an element of Tl−1 (M). Note the use of the Einstein summation convention: the index p
is summed over. Obviously the trace depends on which indices you choose to trace over – here
we traced over the j1 and i1 indices. Although it is not immediately obvious, the resulting tensor
does not depend on the local coordinate system you are working in.
A k-form on M is a section of ∧k T ∗ M, i.e. a k0 -tensor field that is completely antisymmetric
in all its indices. A k-vector field on M is a section of ∧k T M.
Definition 1.9. For a 20 -tensor A we write A > 0 (A ≥ 0) if
positive definite at each point of M. We will usually write g for a Riemannian metric, and gij for
its coordinate representation. Given such a g, there is an induced norm on each Tp M which we
write p
|X|g := g(X, X) (1.2)
for X ∈ Tp M. A manifold together with a Riemannian metric, (M, g), is called a Riemannian
manifold.
When there is only one metric under consideration, we will usually neglect the subscript g in
equation (1.2), but there will be some situations in the study of the Ricci flow where we will need
to distinguish between the norms induced by different metrics. We will sometimes use the notation
hX, Y i for g(X, Y ).
Note that a Riemannian metric is not actually a metric (although we will frequently say “metric”
rather than “Riemannian metric” for brevity, in contexts where no confusion is possible). It can
be thought of as an “infinitesimal metric”. In fact any Riemannian metric g on a manifold M
induces a bona fide metric on M, as we will now see:
Definition 1.11. Given a Riemannian metric g we can define the length of a piecewise C 1 curve
γ : [0, 1] → M by
Z 1p
`(γ) := g(γ̇(t), γ̇(t))dt
0
We will sometimes use the metric space notation for a ball: if (M, g) is a Riemannian manifold,
p ∈ M and r > 0, then
B(p, r) := {q ∈ M : d(p, q) < r}
where d is the metric induced by g.
Finally, we say that a map φ : M → N between Riemannian manifolds (M, g) and (N , h) is
an isometry if it is a diffeomorphism and φ∗ h = g. In this case we say that the two Riemannian
manifolds are isometric.
It is not hard to see that an isometry between Riemannian manifolds in the sense just defined
is also a metric-space isometry between the manifolds, if we view them as metric spaces with the
metrics induced by the corresponding Riemannian metrics.
We note that any smooth manifold, by virtue of its paracompactness, admits a smooth Rie-
mannian metric. Thus we do not lose any generality by tackling topological properties of smooth
manifolds from the point of view of Riemannian geometry.
Given a Riemannian manifold (M, g) and a manifold N embedded in M (a submanifold of
M), there is an induced Riemannian metric ḡ on N defined by restricting g to Tp N at each point
p ∈ N.
9
Given the metric gij , which is a positive-definite symmetric matrix at each point of M, we define
the metric inverse g ij to be the inverse matrix at each point, satisfying g ij gjk = δki where δki is
the Kronecker delta. Any inner product on a vector space gives a natural isomorphism V ∼ = V ∗,
[ [ [ j
via X 7→ X , where X (Y ) = hX, Y i. In coordinates, (X )i = gij X . In general, we can lower an
ijk
index i on a tensor Fpq , for example, by setting
jk mjk
Fipq := gim Fpq .
k+1
This takes us from an element of Tlk (M) to something in Tl−1 (M). We can similarly raise an
index using g ij . Using the metric inverse we can also define a norm on the space of tensors, for
example
ijk 2
|Fpq |g := gi1 i2 gj1 j2 gk1 k2 g p1 p2 g q1 q2 Fpi11 qj11k1 Fpi22 qj22k2 .
We will make frequent use of the very convenient ∗-notation in later chapters. Given two
tensors A, B, the expression A ∗ B means “some linear combination of traces of A ⊗ B with
coefficients that do not depend on A or B”. For example if we have A = Aklm s
ij , B = Bpqr , then
A ∗ B might represent
j
17Akil lrs k
ij Blqr − n!Aqi Bisr
Aijk l
ij Bklm .
The meaning is obviously very broad, and is of most use when we want to obtain bounds on
complicated combinations of tensor quantities (as we will in later chapters). The most useful
property of this notation is that, for any given expression of the form A ∗ B, there is a constant C
which does not depend on A or B such that
|A ∗ B| ≤ C|A||B|
by the Cauchy-Schwarz inequality. As a particular case that we will use frequently, we have
Lemma 1.1. If A is an n × n matrix then
1
|A|2 ≥ (trA)2 .
n
The ∗-notation can obviously be extended to multiple ∗-products like A ∗ B ∗ . . . ∗ Z or powers
A∗n := A ∗ A ∗ . . . ∗ A. We also define ∗(A, B, . . . , Z) to mean any combination of ∗-products of
any powers of A, B, . . . , Z, for example
B ∗3 ∗ Z + A∗2 ∗ Z ∗4 .
∗ (Ai ) := ∗(A1 , A2 , . . . , An ).
1≤i≤n
∇ : C ∞ (T M) × C ∞ (E) → C ∞ (E)
10
1. ∇X Y is linear over C ∞ (M) in X.
2. ∇X Y is linear over R in Y .
3. ∇ satisfies the product rule:
∇X (f Y ) = X(f )Y + f ∇X Y.
A connection on the vector bundle E is completely specified by its Christoffel symbols Γkij
in a local coordinate system (xi ) with a local basis (Ej ) for E, defined by:
∇∂i Ej = Γkij Ek .
As a very important special case, we can consider connections on the bundles Tlk (M).
Lemma 1.2. Given a connection ∇ on the tangent bundle T M, we can define connections on all
of the tensor bundles Tlk (M) (which we will also denote ∇) satisfying:
1. ∇ is the given connection on T M.
In the second term of the above equation, the upper q index occupies the position normally occupied
by js , and in the third term the lower q index occupies that position normally occupied by is .
Note that equation (1.3) can be expressed using the ∗-notation as
∇F = ∂F + Γ ∗ F. (1.4)
11
in some local coordinate system2 (xi ) defined in a coordinate patch U on the manifold M. Then
m−1
∗
X
∇m F = ∂ m F + ∂ j Γ ∗ ∇i F
i=0 j ≤m−1
in U .
Proof. The result follows from equation (1.4) by induction.
Although there are many possible connections on the tangent bundle T M, if M is equipped
with a Riemannian metric then there is one in particular that has more geometric significance.
Lemma 1.4. Given a Riemannian metric gij on M, there is a unique connection ∇ on T M that
satisfies
1. X(g(Y, Z)) = g(∇X Y, Z) + g(Y, ∇X Z) (∇ is compatible with g). This is equivalent to
∇g = 0 where ∇g is defined by Lemma 1.2.
2. The torsion tensor of ∇,
τ (X, Y ) := ∇X Y − ∇Y X − [X, Y ]
is identically 0.
This connection is known as the Levi-Civita connection of the metric g. Its Christoffel symbols
are given in local coordinates by
1 kl
Γkij = g (∂i gjl + ∂j gil − ∂l gij ). (1.5)
2
Using the Levi-Civita connection we can define the Laplacian:
Definition 1.13. The Laplacian3 is a family of operators
4 : C ∞ (Tlk M) → C ∞ (Tlk M)
4F := g ij ∇i ∇j F,
where ∇ is the Levi-Civita connection of the metric g. If F has the coordinate form given in
equation (1.1), we will write
4Fij11...i
...jl
k
for
(4F )ji11...i
...jl
k
.
If we are given a connection on T M (we will usually be using the Levi-Civita connection for
some metric gij ) we can define geodesics to be the paths that you could move along in the manifold
“without feeling any force”. That is, a path γ : [0, 1] → M is a geodesic if ∇γ̇(t) γ̇(t) = 0 at each
point. Given an initial point and velocity (or equivalently an element of T M), there is a unique
geodesic in M setting off from that point with the given initial velocity. If M is complete (in
particular if it is closed, as it will be for all of our applications), the geodesic will exist for all time.
2 The coordinate expression ∂ m F does not represent any coordinate-independent tensor field in the way that
defined here is sometimes called the rough Laplacian. We will only ever use the rough Laplacian in this project,
so we simply call it the “Laplacian”.
12
B A
Definition 1.14. Suppose M is complete and has no boundary. Given v ∈ Tp M, let γv : [0, 1] →
M be the unique geodesic in M that starts at p with initial velocity vector v. That is, γv (0) = p
and γp0 (0) = v. We define the exponential map exp : T M → M by exp(v) = γv (1). We denote
by expp the exponential map restricted to Tp M. The exponential map is smooth.
Because expp is locally a diffeomorphism at the origin of Tp M, it is possible to choose an open
neighbourhood U of p that is diffeomorphic via expp to an open set in Tp M = Rn , by the inverse
function theorem. We choose coordinates (xi ), i = 1, . . . , n on Tp M so that the basis vectors
{∂i : i = 1, . . . , n} are orthonormal with respect to the metric g at p. We then have a coordinate
chart (U, (expp )−1 ). These coordinates are called normal coordinates about p and have some
properties which make them very convenient for doing calculations:
Lemma 1.5. In normal coordinates about p, we have
1. gij = δij at p.
∇k Fij11...i
...jl
k
= ∂k Fij11...i
...jl
k
One can interpret the injectivity radius at p ∈ M as follows: imagine there is a flash of light
at p at some time. Light rays propagate in all directions – the injectivity radius is the smallest
distance one of these rays has to travel before it collides with another light ray. The injectivity
radius is a crucial concept in the study of the convergence of manifolds, as we shall see in Chapter
8. Where the injectivity radius is small (as at A in Figure 1.1), the manifold is close to “pinching
off” as described in Section 2.4.
13
1.3 The Lie Derivative
The covariant derivative gives one way of differentiating tensor fields. We will now look at a different
way, which can be defined purely from the manifold structure of M, without any reference to a
Riemannian metric or the extra structure of a connection.
Given a vector field X on a manifold M, we define a time-dependent family of diffeomorphisms
of M to itself, ϕt : M → M for t ∈ (−, ), such that ϕ0 is the identity and
d
ϕt = X
dt
at each point (the existence of such ϕt follows from basic existence theorems for differential equa-
tions – see [5, Sec. 6-2] for this argument and other details relating to the Lie derivative). This is
to be interpreted as a “flow”
of the manifold in the direction of the vector field X. We now define
the derivative of some kl -tensor field F in the direction of X as the change in F when we move
a small step in the direction of X – but we need to have some way of comparing the value of F at
the point a little step away with that at the original point. Rather than making this comparison
using a “connection” as before, we make it by pushing the value of F at the translated point back
to the original point using the diffeomorphism ϕt .
We define
(∗ϕt )Fp (X1 , . . . , Xk , ω 1 , . . . , ω l ) := Fϕt (p) (ϕt∗ (X1(p) ), . . . , ϕt∗ (Xk(p) ),
(ϕ−1 ∗ 1 −1 ∗ l
t ) (ω(p) ), . . . , (ϕt ) (ω(p) )).
Note that (∗ϕt )Fp ∈ Tlk Mp for all t. It is now possible to define the Lie derivative of F in the
direction X as
d
LX F = (∗ϕt )F
dt t=0
Lemma 1.6. The Lie derivative is well-defined. It obeys similar conditions to those satisfied by
the covariant derivative as outlined in Lemma 1.2, with one important exception:
1. For a scalar function f , LX f = X(f ).
2. If Y is a vector field then LX Y = [X, Y ].
3. LX (F ⊗ G) = (LX F ) ⊗ G + F ⊗ (LX G).
4. LX commutes with all traces:
LX (trY ) = tr(LX Y )
for all traces (over any indices) of the tensor Y .
Proof. See [5, Sec. 6-2].
Lemma 1.7. On a Riemannian manifold (M, g), we have
(LX g)ij = ∇i Xj + ∇j Xi ,
where ∇ denotes the Levi-Civita connection of the metric g, for any vector field X.
Proof. Let ω be the 1-form dual to the vector field X, ω(Y ) = hX, Y i. Using the product rule
(from Lemma 1.6) and the metric compatibility and torsion-free conditions on the Levi-Civita
connection we have
LX g(Y, Z) = X(g(Y, Z)) − g(LX Y, Z) − g(Y, LX Z)
= h∇X Y, Zi + hY, ∇X Zi − h[X, Y ], Zi − hY, [X, Z]i
= h∇X Y − [X, Y ], Zi + hY, ∇X Z − [X, Z]i
= h∇Y X, Zi + hY, ∇Z Xi
= Y hX, Zi − hX, ∇Y Zi + ZhY, Xi − h∇Z Y, Xi
= Y (ω(Z)) − ω(∇Y Z) + Z(ω(Y )) − ω(∇Z Y )
= (∇Y ω) (Z) + (∇Z ω) (Y )
14
which is the coordinate-free way of expressing the identity we wanted. Note that we used the
product rule again to get the last line.
1.4 Curvature
We start with a description of curvature for 2-manifolds. Given a Riemannian 2-manifold (M2 , g),
we would like to define some quantity that describes “how curved” the surface is at a certain point
p ∈ M2 . Consider two tangent vectors v1 , v2 ∈ Tp M2 , with |v1 | = |v2 | = 1 (with respect to g).
Let γi : [0, 1] → M2 , i = 1, 2 be two geodesics in M2 that start at p, with γ˙i (0) = vi . Let us
define f : R+ → R+ so that f (r) is the distance from γ1 (r) to γ2 (r) along the circle with centre
p and radius r (where the radius is measured with respect to the metric d on M2 induced by
the Riemannian metric g). If θ is the angle between v1 and v2 then it is clear that for small r,
f (r) ∼ θr, i.e. f 0 (0) = θ (see Figure 1.2).
1
(r)
r
f(r)
r
2
(r)
1
(r)
r
f(r)
r
2
(r)
Figure 1.2: Top: geodesics in a 2-manifold of positive curvature converge. Bottom: geodesics in a
2-manifold of negative curvature diverge.
Curvature deals with second-order effects. We define the Gaussian curvature, K, of the
15
surface at p so that
f (r) K 2
∼θ 1− r
r 6
for small r. See Figure 1.2 for pictures of manifolds with positive curvature (K > 0) and negative
curvature (K < 0). Euclidean space R2 is flat (K = 0).
The generalization to higher-dimensional manifolds of this concept of curvature is not at all
obvious, and was first achieved by Riemann. His idea was that the curvature, intuitively, is the
obstruction to the “flatness” of a manifold. In other words, non-zero curvature at p is what stops
us from choosing coordinates in which the metric is the Euclidean, flat metric gij = δij in a
neighbourhood of p. Normal coordinates about p are, in some sense, “as close as we can get” to a
Euclidean metric in a neighbourhood of p: we know from Lemma 1.5 that in normal coordinates
about p, gij (p) = δij (p) and ∂k gij (p) = 0. However, in general, the second- and higher-order
derivatives of the metric at p may be non-zero.
This motivates the definition of the 40 Riemann curvature tensor, which holds all of
the information about the second-order derivatives of g. It is defined to be the 40 -tensor with
where
It is definitely not obvious (but can easily be checked) that this is in fact a 31 -tensor (in
particular that, for example, Rm(X, Y, f Z, ω) = f Rm(X, Y, Z, ω) for any f ∈ C ∞ (M)). We
l
represent it in terms of local coordinates as Rijk . The 31 curvature tensor Rijk l
defined by
4
equation (1.7) is obtained from the 0 curvature tensor Rijkl defined by equation (1.6) by raising
the final index.
The coordinates of Rm can be explicitly calculated by applying equation (1.3) with the Levi-
Civita connection ∇:
Lemma 1.8. The Riemann curvature tensor has the explicit form
l
Rijk = ∂i Γljk − ∂j Γlik + Γpjk Γlip − Γpik Γljp .
Rm has many symmetries, which can be proven by manipulating the defining equation (1.7):
Lemma 1.9. (Symmetries of the curvature tensor) The Riemann curvature tensor has the
following properties:
1. Rijkl = Rklij = −Rjikl = −Rijlk
16
2. The first Bianchi identity:
Rijkl + Rjkil + Rkijl = 0
∧2 T ∗ M ⊗S ∧2 T ∗ M
Definition 1.16. The Ricci curvature tensor, Rc, is the 20 -tensor with coordinate expression
p
Rij := Rpij .
R := g ij Rij .
The Ricci and scalar curvatures can be interpreted in terms of sectional curvatures:
Lemma 1.11. Let X ∈ Tp M be a unit vector. Suppose that X is contained in some orthonormal
basis for Tp M. Rc(X, X) is then the sum of the sectional curvatures of planes spanned by X and
other elements of the basis.
Given an orthonormal basis for Tp M, the scalar curvature at p is the sum of all sectional
curvatures of planes spanned by pairs of basis elements.
Proof. See [18, end of Chap. 8].
The Ricci and scalar curvatures can also be expressed in terms of the curvature operator.
17
Lemma 1.12. On a 3-dimensional Riemannian manifold (M3 , g), let us diagonalize the curvature
operator R with respect to a basis {e2 ∧ e3 , e3 ∧ e1 , e1 ∧ e2 } of ∧2 T M3 , where {e1 , e2 , e3 } is an
orthonormal basis of T M3 (this is possible because R is symmetric). Suppose that, with respect to
this basis, R is a diagonal matrix with entries λ1 , λ2 , λ3 down the diagonal. Then with respect to
the basis {e1 , e2 , e3 }, the Ricci tensor takes the form
λ + λ3 0 0
1 2
Rc = 0 λ3 + λ1 0 (1.9)
2
0 0 λ1 + λ2
18
Proof. This result follows from the decomposition of Rm, which holds on any n-manifold:
R 1
Rm = (g g) + (E g) + W, (1.13)
2(n − 1)(n − 2) n−2
where W is the so-called “Weyl tensor”, which is defined by equation (1.13). The Weyl tensor
vanishes in dimension 3 (see [9, Sec. 8] for a simple proof using the symmetries of W ), from which
the result follows.
In dimension 3, Einstein metrics are very special.
Lemma 1.14. An Einstein metric on a manifold of dimension n ≥ 3 has constant scalar curvature.
If n = 3, the metric has constant sectional curvature.
Proof. Consider the second Bianchi identity (1.8). If we raise the indices k, l we obtain
∇p Rij kl + ∇i Rjp kl + ∇j Rpi kl = 0.
Taking the trace over the indices i, l and using the symmetries of the curvature tensor we obtain
∇p Rjk − ∇i Rjp ik − ∇j Rpk = 0.
Taking the trace over the indices j, k now gives us
∇p R = ∇i Rpi + ∇i Rpi ,
or equivalently
1
g ij ∇i Rjp = ∇p R.
2
Thus we have
1 1 1 1 1
g ij ∇i Ejp = g ij ∇i Rjp − Rgjp = ∇p R − ∇p R = − ∇p R (1.14)
n 2 n 2 n
(using ∇g = 0). If the metric is Einstein then Ejp = 0, so
1 1
− ∇p R = 0.
2 n
Because n 6= 2 this means ∇R = 0, hence R is constant.
If n = 3, we can apply formula (1.9) from Lemma 1.12 and deduce that
λ + λ3 0 0 C 0 0
1 2 R
Rc = 0 λ3 + λ1 0 = g = 0 C 0 , (1.15)
2 3
0 0 λ1 + λ2 0 0 C
where C = 31 R is a constant over the whole manifold. Therefore the curvature eigenvalues λi are
all equal to the constant value C, from which it follows that the manifold has constant sectional
curvature.
If a metric has constant sectional curvature, we call it a constant curvature metric. There
are three essentially distinct possibilities: the value of the sectional curvatures can be positive, zero
or negative. We have the following examples:
Definition 1.18. Constant Curvature Metrics
Euclidean n-space, En := Rn with the standard metric, has constant sectional curvature 0.
The n-dimensional sphere of radius R,
SnR = {x ∈ Rn+1 : |x| = R}
with the metric induced as a submanifold of En+1 , has constant sectional curvature 1/R2 .
The hyperbolic n-space of radius R, HnR , is the open ball of radius R in Rn with the metric
4R4 δij
gij (x) = .
(R2 − |x|2 )2
It has constant sectional curvature −1/R2 (there are other equivalent ways of representing this
Riemannian manifold).
19
See [18, end of Chap. 8] for more details, including a proof that these spaces have constant
curvature.
Using the theory of Jacobi fields (see [18, Chap. 10]), one can prove many comparison theorems,
as well as results about constant curvature metrics. For example one can prove that the local
structure of a metric with constant sectional curvature C is unique (see [18, Prop. 10.9]) – we will
see a more general result in Theorem 1.16. One can also prove the following, which will come in
handy in Chapter 7:
Theorem 1.15. The Bishop-Günther Volume Comparison Theorem. Let us denote by
Vnk (r) the volume of the ball of radius r in the complete, simply-connected n-dimensional space
of constant sectional curvature k (this will be either SnR , En or HnR ). Suppose that (Mn , g) is a
Riemannian n-manifold, and p ∈ Mn . Then
1. If there is a constant a > 0 such that Rc ≥ (n − 1)ag then
Vol(B(p, r)) ≤ Vna (r).
2. If there is a constant b such that all sectional curvatures of (Mn , g) are bounded above by b,
and the exponential map is injective on the ball B(p, r), then
Vol(B(p, r)) ≥ Vnb (r).
20
1.6 Scaling
We will be interested later on (when we deal with the normalized Ricci flow in Chapter 7) in how
various geometric quantities scale when the metric is scaled by a constant factor C.
4. R̃ijkl = CRijkl .
5. R̃ij = Rij .
6. R̃ = C −1 R.
∂
gij (t) = hij (t).
∂t
Then the various geometric quantities evolve according to the following equations:
1. Metric inverse:
∂ ij
g = −hij = −g ik g jl hkl . (1.16)
∂t
2. Christoffel symbols:
∂ k 1
Γij = g kl (∇i hjl + ∇j hil − ∇l hij ). (1.17)
∂t 2
3. Riemann curvature tensor:
∂ l 1 ∇i ∇j hkp + ∇i ∇k hjp − ∇i ∇p hjk
R = g lp . (1.18)
∂t ijk 2 −∇j ∇i hkp − ∇j ∇k hip + ∇j ∇p hik
4. Ricci tensor:
∂ 1
Rij = g pq (∇q ∇i hjp + ∇q ∇j hip − ∇q ∇p hij − ∇i ∇j hqp ). (1.19)
∂t 2
5. Scalar curvature:
∂
R = −4H + ∇p ∇q hpq − hpq Rpq (1.20)
∂t
where H = g pq hpq .
6. Volume element:
∂ H
dµ = dµ. (1.21)
∂t 2
21
7. Volume of manifold: Z Z
d H
dµ = dµ. (1.22)
dt M M 2
8. Total scalar curvature on a closed manifold M:
Z Z
d 1 ij
Rdµ = RH − h Rij dµ. (1.23)
dt M M 2
∂t (g ij gjk ) = 0
⇒ (∂t g )gjk + g ij (∂t gjk ) = 0
ij
⇒ ∂t g ij = −hij .
Once again we work in normal coordinates so that Γkij (p) = 0. This gives us
l
(p) = ∇i ∂t Γljk (p) − ∇j ∂t Γlik (p).
∂t Rijk
Once again, both sides are tensors, so the equation holds in any coordinates. Plugging the result
of formula (1.17) in on the rhs yields the result.
Proof of (1.19): This follows from formula (1.18) by taking the trace over the indices i, l.
Proof of (1.20): This follows from the formulae (1.19) and (1.16):
= ∂t g ij Rij
∂t R
= (∂t g ij )Rij + g ij (∂t Rij )
1 pq
= −hij Rij + g ij g (∇q ∇i hjp + ∇q ∇j hip − ∇q ∇p hij − ∇i ∇j hqp )
2
= −4H + ∇ ∇ hpq − hpq Rpq
p q
4 This is true because the difference between the Christoffel symbols of two connections is a tensor. Thus, by
22
(recall that ∇g = 0 and 4 = g ij ∇i ∇j ).
Proof of (1.21): We will use the formula
where we have observed that (A−1 )ij = δ ij /λi and we are now using the Einstein summation
convention. This formula manifestly does not depend on the basis we choose as traces are basis-
independent.
It now follows by the chain rule that
1
= p g ij hij det gij dx1 ∧ . . . ∧ dxn
2 det gij
H
= dµ
2
where H = g ij hij .
Proof of (1.22): This follows from formula (1.21) by taking the derivative under the integral
sign.
Proof of (1.23): This follows from formulae (1.21) and (1.20):
Z Z
∂
Rdµ = (∂t R)dµ + R(∂t dµ)
∂t M
ZM
1
= −4H + ∇p ∇q hpq − hpq Rpq + RH dµ
M 2
Z
1
= RH − hpq Rpq dµ.
M 2
Here we have used Stokes’ Theorem (see [14, Sec. 7.5]) to get rid of the first two terms in the
integral because they were expressible as the divergence of a vector field on M, and M has no
boundary.
23
Chapter 2
The main aim of this project is to introduce the basics of Hamilton’s Ricci flow program, which is
aimed at proving Thurston’s Geometrization Conjecture and consequently the Poincaré Conjecture.
In this chapter we will present the context for the Ricci flow: what is it, what are the problems
that it is intended to solve, and why might it be expected to solve them? In the process we will also
see some simple solutions to the Ricci flow and try to gain a bit of intuition about its behaviour. A
lot of space in this introduction is devoted to material that is not elaborated (or is elaborated very
little) in the main body of the thesis – namely the description of the Geometrization Conjecture
and the section on pinching. That is because the introduction is intended to serve as a “big picture”
guide to the reasons we study the Ricci flow, so that the reader understands, when encountering
the formidable technical details of Chapter 7, why it is all worthwhile.
24
#
Figure 2.1: Connect sum for 2-manifolds: cut a hole in each of M and N , then glue them together
along the boundary circles created to get the connected sum M #N .
metrics in any dimension – an example of the way that a knowledge of the geometric structure of
a manifold can lead to knowledge of the topological structure. Thurston proved that this decom-
position worked for a certain class of 3-manifolds (the so-called Haken manifolds), but not in full
generality. If his conjectured classification of 3-manifolds, known as “Thurston’s Geometrization
Conjecture”, were proven for all 3-manifolds, then the Poincaré Conjecture would follow (see [2,
Sec. 7.7]).
25
geometric structures, and that the decomposition by spheres and tori will somehow emerge nat-
urally. In choosing what should go on the right hand side of the equation of the Ricci flow, we
know that it should be a rank-2 tensor, symmetric (so that the metric g remains symmetric), and
it should involve the curvature somehow – the Ricci curvature tensor is the obvious choice. The
minus sign makes the Ricci flow a heat-type (parabolic) equation (as we shall see in Chapter 5),
so it is expected to “average out” the curvature. This should make the metric rounder in the way
that we want.
A characteristic property of heat-type equations is the maximum principle, which we will see
in Chapter 3. We will use the maximum principle in Chapter 7 to prove quantitatively that this
rounding of the metric does indeed happen in one specific case. The main theorem we will prove
is the one proved by Hamilton in the paper that introduced the Ricci flow:
Theorem 2.1. (Hamilton, 1982) Let M3 be a closed 3-manifold which admits a Riemannian
metric with strictly positive Ricci curvature. Then M3 also admits a metric of constant positive
curvature.
In particular, by Theorem 1.16, any simply-connected closed 3-manifold which admits a
metric of strictly positive Ricci curvature is diffeomorphic to the 3-sphere. We are certainly starting
to get into the territory of the Poincaré Conjecture with this result!
More specifically, we will see that if the initial metric g0 has strictly positive Ricci curvature
then the manifold M3 will shrink to a point in finite time under the Ricci flow. But if we dilate the
metric by a time-dependent factor so that the volume remains constant, the problem of shrinking
to a point is removed. Furthermore, we can show that the rescaled metric converges uniformly
to the desired metric of constant positive curvature on M3 . This process of “blowing up” the
manifold when it is becoming singular is a crucial one in the Ricci flow program. Because it is
rather complicated and can get lost in the technical details, we have provided an analogous but
much simpler argument for the “curve-shortening flow” in Chapter 4 in the hope that this will
make it easier to understand what is going on in later chapters.
We note that the Uniformization Theorem (any Riemannian metric on a closed 2-manifold is
conformal to one of constant curvature) can be proved using the Ricci flow as well (see [6, Chap.
5] for the bulk of the argument, and [4] for the remainder). Once again we encounter problems
with singularities: the manifold will shrink to a point in some cases and become arbitrarily large
in others. As before, the solution is to rescale so that the volume is constant. When we do
this, the rescaled metric converges smoothly to a smooth metric of constant curvature. Note
that the Uniformization Theorem is the closest thing to a 2-dimensional analogue of Thurston’s
Geometrization Conjecture, so the fact that the Ricci flow solves it in this way is another strong
indicator that we’re heading in the right direction.
In 2002 and 2003, Grisha Perelman posted three papers ([23, 25, 24]) on arXiv.org which claimed
to have completed Hamilton’s work towards using the Ricci flow to prove the Geometrization
Conjecture. More recently several authors have posted papers elaborating on the technical details
of Perelman’s papers ([2, 21]). The proof seems to have been accepted by the mathematical
community, wrapping up the 100-year history of the Poincaré Conjecture (and the younger but no
less important Geometrization Conjecture).
26
so the Ricci flow equation becomes an ode:
∂
∂t g = −2Rc
∂ 2
⇒ ∂t (r ḡ) = −2(n − 1)ḡ
d(r 2 )
⇒ dt = −2(n − 1).
We have the solution q
r(t) = R02 − 2(n − 1)t,
where R0 is the initial radius of the sphere. The manifold shrinks to a point as t → R02 /(2(n − 1)).
Similarly, for hyperbolic n-space Hn (where n > 1), the Ricci flow reduces to the ode
d(r2 )
= 2(n − 1)
dt
which has the solution q
r(t) = R02 + 2(n − 1)t.
So the solution expands out to infinity.
Of course the flat metric on En has zero Ricci curvature, so it does not evolve at all under the
Ricci flow. There are other non-trivial Riemannian manifolds with vanishing Ricci curvature (the
metric is flat, i.e. locally isometric to Euclidean space, if and only if the Riemann curvature tensor
vanishes). These metrics can be regarded as the “fixed points” of the Ricci flow. However, we
ought really to regard Sn and Hn as honorary fixed points of the flow – even though the metric
was changing under the flow, it only ever changed by a rescaling of the metric.
Even more generally, one can regard as “generalized fixed points” of the Ricci flow those
manifolds which change only by a diffeomorphism and a rescaling under the Ricci flow. Let
(Mn , g(t)) be a solution of the Ricci flow, and suppose that ϕt : Mn → Mn is a time-dependent
family of diffeomorphisms (with ϕ0 = id) and σ(t) is a time-dependent scale factor (with σ(0) = 1).
If we then have
g(t) = σ(t)ϕ∗t g(0)
then the solution (Mn , g(t)) is called a Ricci soliton. Taking the derivative of this equation and
evaluating at t = 0 yields
∂ dσ(t) ∗ ∂
g(t) = ϕt g(0) + σ(t) ϕ∗t g(0)
∂t dt ∂t
−2Rc(g(0)) = σ 0 (0)g(0) + LV g(0),
where V = dϕt /dt, by the definition of the Lie derivative given in Section 1.3. Let us set σ 0 (0) = 2λ.
We can now use the result of Lemma 1.7 to write this in coordinates as
−2Rij = 2λgij + ∇i Vj + ∇j Vi .
As a special case we can consider the case that V is the gradient vector field of some scalar
function f on Mn , i.e. Vi = ∇i f . The equation then becomes
Rij + λgij + ∇i ∇j f = 0. (2.1)
Such solutions are known as gradient Ricci solitons. A gradient Ricci soliton is called shrinking
if λ < 0, static if λ = 0, and expanding if λ > 0. The gradient Ricci solitons play a role in
motivating the definition of Perelman’s F- and W-functionals which we will see in Section 8.4.
One example of a gradient Ricci soliton is Hamilton’s cigar soliton: Let Σ2 = R2 with the
standard coordinates and
δij
gij = .
1 + x2 + y 2
Because the metric is rotationally symmetric, the calculation of the Ricci curvature tensor and
the covariant derivative is particularly simple (by the rotational symmetry we can embed Σ2 in
R3 as a manifold of revolution – see [18, Ex. 8.2] for the calculation of the Gaussian curvature of
manifolds of revolution). We can show that
Rij + ∇i ∇j f = 0
where f (x, y) = (x2 + y 2 )/2, hence (Σ2 , g) is a static gradient Ricci soliton.
27
S1 x I
Figure 2.3: A neck “pinching off” in a 2-manifold. This diagram is intended to illustrate by lower-
dimensional analogy what a neckpinch in a 3-manifold is like – the Ricci flow on 2-manifolds does
not give rise to neckpinches.
2.4 Pinching
We now describe how the connected sum decomposition arises out of the Ricci flow. Firstly we
note that, if we have a product Riemannian manifold with a product Riemannian metric, then
under the Ricci flow each factor in the product evolves independently.
Now consider a “neck” shaped like S1 × I (I is an interval) between two parts of a 2-manifold,
as shown in Figure 2.2. We expect the metric on the neck to be close to a product metric; the I
factor has no curvature so it will not change, and the S1 factor also has no intrinsic curvature so it
will not change. In contrast, when we have an analogous neck shaped like S2 × I in a 3-manifold,
we have seen in Section 2.3 that the 2-sphere will shrink to a point in finite time. Therefore we
expect that in some situations, the manifold will “pinch off” at such a neck (see Figure 2.3 for
the 2-dimensional analogue of such a neckpinch). Thus the Ricci flow can actually perform the
connected sum decomposition for us! The details of how this pinching off actually happens (how
one ought to perform “surgery” on one’s manifold) are very tricky.
We will discuss the pinching process in more detail in Chapter 8.
The torus decomposition arises in a different way. After we have have performed the connected-
sum decomposition by surgery on the neckpinches, we consider the evolution of the remaining
pieces. Some will be quotients of S3 (which shrink to a point) or products S2 × S1 (which shrink
down to the S1 factor). Both of these cases satisfy the Geometrization Conjecture, so it is the
remaining pieces that concern us. They will exist without singularities as t → ∞.
We expect these pieces to be made up of pieces with the fundamental geometries of Thurston,
glued together along torus boundaries (if the Geometrization Conjecture is true). Some pieces
will be hyperbolic, and we have seen in Section 2.3 that hyperbolic metrics will tend to expand.
The metric on the torus necks, on the other hand, will be close to a product metric on T 2 × I.
Recall that under the Ricci flow on a product metric, each factor will evolve independently. Both
28
factors have zero curvature, so we expect the neck to remain static, with the hyperbolic pieces of
the manifold expanding around it. If we rescale the flow so that the volume remains constant, the
torus neck will become very thin, in the same way that the S2 × I neck becomes very thin before
a neckpinch. This is how the torus decomposition arises.
Under the rescaling, the manifold splits along tori into hyperbolic pieces with cusps and “col-
lapsing” pieces (with injectivity radius shrinking to 0). The collapsing pieces can be identified as
so-called “graph manifolds”, which are known to satisfy the Geometrization Conjecture. For a
more detailed description of how this decomposition arises, see [1]. For the full story on the proof
of the Geometrization Conjecture see [2].
29
Chapter 3
The maximum principle is the key tool in understanding many parabolic partial differential
equations. It appears in many guises, but it always essentially expresses the fact that parabolic
or “heat-type” pdes will “average out” the values of whatever quantity is evolving. It is cru-
cial to understanding the Ricci flow (which will be shown in Chapter 5 to be parabolic modulo
reparametrization), where it can be used to put bounds on the curvature of the metric. In some
situations we will need more refined estimates than can be obtained by applying the maximum
principle to scalar quantities related to curvature, so we must apply the maximum principle to
tensor quantities like the curvature operator. The question of what it means for a tensor quantity
to “average out” naturally arises.
In this chapter we first motivate the idea of the maximum principle using the example of the
heat equation on the real line. We generalize this situation to a scalar function obeying a heat-type
equation on an arbitrary Riemannian manifold (rather than just R), then finally generalize to an
arbitrary section of a vector bundle over a manifold (rather than just the trivial R-bundle whose
sections are the scalar functions).
∂u ∂2u
= . (3.1)
∂t ∂x2
The maximum principle says that the temperature of the hottest point on the bar is a non-increasing
function of time, and the temperature of the coldest point on the bar is a non-decreasing function
of time.
This can be seen intuitively from the fundamental or Green’s function solution of the heat
equation, which is the temperature distribution due to a delta-function distribution at time t = 0.
That is, it solves the pde
∂u ∂2u
=
∂t ∂x2
u(x, 0) = δ(x).
The solution is 2
exp(− x )
u(x, t) = √ 4t .
4πt
It can be seen in Figure 3.1 that at each time the temperature distribution is Gaussian, and
the distribution becomes flatter and wider as time evolves. This is what we expect of heat flow:
the heat spreads out as time evolves, any relatively hot region becomes cooler, and the maximum
temperature on the bar drops over time. Any solution of the heat equation can be expressed as
30
1
0.8
0.6
0.4
0.2
-4 -2 2 4
Figure 3.1: The evolution of the temperature distribution due to a pointlike source at time t = 0.
The solution becomes flatter and wider as time increases.
a weighted integral of such fundamental solutions, by the method of Green’s functions. Thus it
is plausible that the “averaging out” behaviour of the fundamental solution is characteristic of all
solutions of the heat equation.
One can see why the maximum principle is true in a different way. At a point of maximum
temperature (at a given time), the temperature distribution u must have a local maximum. Thus
∂ 2 u/∂x2 ≤ 0. It follows by the heat equation (3.1) that, at the hottest point of the bar,
∂u ∂2u
= ≤ 0,
∂t ∂x2
so the temperature is nonincreasing. Therefore one would expect the maximum temperature on
the bar to drop over time.
31
it satisfies the differential inequality
∂u
≤ 4g(t) u + hX(t), ∇ui
∂t
at all points (x, t) ∈ M × [0, T ) where u(x, t) > 0. Then u(x, t) ≤ 0 for all x ∈ M and t ∈ [0, T ).
Note the rather strange condition that the inequality is satisfied only at points and times
where something has “gone wrong” (in the sense that u(x, t) has become positive). The condition
is designed so that we can apply this lemma in the proof of the more general scalar maximum
principle (Theorem 3.2).
Proof. The proof will be based on the idea outlined at the end of Section 3.1.
Given > 0, let us set v = u − (1 + t). Note that v (x, 0) ≤ − < 0. We will show that for
any > 0, v < 0 on M × [0, T ). From the fact that > 0 is arbitrary it will follow that u is
non-positive.
First we compute the evolution equation (or evolution inequality) of v , which follows from that
of u:
∂
v ≤ 4u + hX, ∇ui −
∂t
= 4v + hX, ∇v i − (3.3)
at any point x and time t where u(x, t) > 0. In particular, because v < u, the inequality holds at
any (x, t) such that v (x, t) = 0. Note that 4v = 4u and ∇v = ∇u because −(1 + t) has no
spatial dependence.
Now suppose, for a contradiction, that v (x, t) ≥ 0 for some (x, t) ∈ M×[0, T ). By compactness,
there exists a first time t0 ∈ [0, T ) and point x0 ∈ M at which v hits 0. That is, v (x0 , t0 ) = 0
but v (x, t) < 0 for any t < t0 . It follows that
∂v
(x0 , t0 ) ≥ 0.
∂t
Furthermore, x0 must be a local maximum of v (·, t0 ), so
∇v (x0 , t0 ) = 0
4v (x0 , t0 ) ≤ 0.
We can plug these inequalities into equation (3.3), which holds at (x0 , t0 ) because v (x0 , t0 ) = 0.
We obtain:
∂v
0≤ (x0 , t0 ) ≤ 4v (x0 , t0 ) + hX(t0 ), ∇v (x0 , t0 )i − ≤ − < 0,
∂t
which is a contradiction.
Therefore, v (x, t) < 0 for all (x, t) ∈ M × [0, T ) and for all > 0. Hence u(x, t) ≤ 0 for all
(x, t) ∈ M × [0, T ), as required.
We can now prove the main result of this section, which shows how to deal with the reaction
term F (u) in equation (3.2).
Theorem 3.2. The Scalar Maximum Principle. Let (M, g(t)) be a closed manifold with a
time-dependent Riemannian metric g(t). Suppose that u : M × [0, T ) → R satisfies
∂u
≤ 4g(t) u + hX(t), ∇ui + F (u)
∂t
u(x, 0) ≤ C for all x ∈ M,
for some constant C, where X(t) is a time-dependent vector field on M and F : R → R is locally
Lipschitz. Suppose that φ : R → R is the solution of the associated ode, which is formed by
neglecting the Laplacian and gradient terms:
dφ
= F (φ)
dt
φ(0) = C.
32
Then
u(x, t) ≤ φ(t)
for all x ∈ M and t ∈ [0, T ) such that φ(t) exists.
The theorem essentially tells us that our upper bound grows no faster than we would expect
from the reaction term F (u).
Proof. Let us set v = u − φ. We know that v(x, 0) ≤ 0 for all x ∈ M, and we desire to show that
v(x, t) ≤ 0 for all (x, t) ∈ M × [0, T ). To do this, we fix an arbitrary τ ∈ [0, T ) and show that
v ≤ 0 on [0, τ ], for any τ ∈ [0, T ). First note that
∂v ∂(u − φ)
=
∂t ∂t
≤ 4u + hX, ∇ui + F (u) − F (φ)
= 4v + hX, ∇vi + (F (u) − F (φ)). (3.4)
Note that ∇u = ∇v and 4u = 4v because φ depends only on t. We now want to deal with the
last term on the rhs.
Because M×[0, τ ] is compact, there exists a constant C (dependent on τ ) such that |u(x, t)| ≤ C
and |φ(t)| ≤ C on M × [0, τ ]. Because F is locally Lipschitz and the interval [−C, C] is compact,
there exists C1 (also dependent on τ ) such that |F (x) − F (y)| ≤ C1 |x − y| for all x, y ∈ [−C, C].
Therefore, because u, φ ∈ [−C, C], |F (u) − F (φ)| ≤ C1 |u − φ| = C1 |v| on M × [0, τ ]. Plugging this
into the evolution equation (3.4) for v, we obtain
∂v
≤ 4v + hX, ∇vi + C1 |v|.
∂t
Now let w = e−C1 t v. Then we have
∂w
≤ e−C1 t (4v + hX, ∇vi + C1 |v| − C1 v)
∂t
= 4w + hX, ∇wi + C1 (|w| − w). (3.5)
We are going to apply Lemma 3.1 to the function w. Because v(x, 0) ≤ 0 for all x ∈ M, we have
w(x, 0) ≤ 0 for all x ∈ M. Furthermore, if w ≥ 0 then |w| = w, so the differential inequality (3.5)
gives us
∂w
≤ 4w + hX, ∇wi
∂t
at any point (x, t) ∈ M × [0, τ ] such that w(x, t) ≥ 0. Hence, by Lemma 3.1, w(x, t) ≤ 0 for all
(x, t) ∈ M × [0, τ ].
It follows that v(x, t) ≤ 0, and hence u(x, t) ≤ φ(t) for all (x, t) ∈ M × [0, τ ], for any τ ∈ [0, T ).
Therefore u(x, t) ≤ φ(t) for any (x, t) ∈ M × [0, T ).
33
Figure 3.2: A 2-dimensional vector bundle over a 1-manifold. The set K (shaded) is not invariant
under parallel translation. The initial section (in black) is contained in K, but averages out to to
a section (shown in grey) that is not contained in K.
where ϕ ∈ C ∞ (E), ξ ∈ C ∞ (T ∗ M), and ∇ denotes the Levi-Civita connection of the metric g(t).
We can now define the Laplacian: if we use the index a for sections of E and indices i, j for sections
of T ∗ M (so that an element H of E ⊗ T ∗ M might have coordinates Hia with respect to some
basis) then
ˆ i∇
(4̂ϕ)a := g ij ∇ ¯ j ϕa
Theorem 3.3. Let π : E → M be a vector bundle over M with a fixed bundle metric h, ∇(t) ¯ a
smooth time-dependent family of connections on E compatible with h, and g(t) a time-dependent
Riemannian metric on M. Let K be a subset of E that is closed and convex in each fibre, and
invariant under parallel translation. Let F : E × [0, T ) → E be a continuous map that is fibre-
preserving, and Lipschitz in each fibre. Let α(t) be a time-dependent section of E that satisfies the
34
conditions
∂
α = 4̂α + F (α)
∂t
α(0) ∈ K.
da
= F (a)
dt
a(0) ∈ Kx
35
Chapter 4
Curve-Shortening Flow
This chapter is based on the lectures on Geometric Evolution Equations given by Ben Andrews at
the ICE-EM Graduate School in July 2006.
When Hamilton introduced the Ricci flow and used it to prove Theorem 2.1, he introduced
quite a general method of dealing with geometric evolution equations. This method has since
been applied to other flows, such as the mean curvature flow (mcf) and the curve-shortening flow
(csf) (see [30] for a description of both). Although these settings for Hamilton’s method of proof
emerged after Hamilton’s original Ricci flow paper, the csf in particular is much simpler than the
Ricci flow. Many of the concepts central to the theory of the Ricci flow presented in Chapters 5,
6, 7 and 8 find simpler analogies in the theory of the csf.
In this chapter we will outline the basic theory of the csf, with the aim of providing a
“blueprint” for subsequent development of the theory of the Ricci flow. We will not prove all
of the results that we state – those proofs and those results that assist in building intuition for the
methods of proof employed in later chapters are emphasised.
∂X
(u, t) = −κN(u, t) (4.1)
∂t
X(u, 0) = X0 (u).
Here, κ is the curvature of our curve, and N is the unit normal vector to it, defined by
∂2X
−κN =
∂s2
where s is the arclength parameter (see Figure 4.1). The book [15, Chap. 1] is a good reference
for the theory of curves in space.
One might wonder where this definition comes from. The idea is to look for the way to move a
curve in the plane that gives the greatest decrease in total length for the smallest total “movement”
– we say that the csf is the “steepest descent flow for length”.
Lemma 4.1. Denote the length of the curve u 7→ X(u, t) at time t by
Z 1
L(t) = |Xu |du.
0
∂X
(u, t) = V (u, t),
∂t
36
N
-κN
Figure 4.1: The curve-shortening flow moves a curve in the direction of its curvature vector.
where s is the arclength along X0 and L = L(0). Then the rate at which the length decreases,
−dL/dt, is maximised for the csf, i.e.
V ∝ −κN.
37
Hence
Z 1
dL(t)
= ∂t |Xu |du
dt 0
Z L
ds
= hXu , ∂s V i
0 |Xu |
Z L
= h∂s X, ∂s V ids
0
Z L
L
= [h∂s X, V i]0 − h∂ss X, V ids
0
Z L
= hκN, V i
0
! 21 ! 21
Z L Z L
2 2
≥ − κ ds |V | ds .
0 0
We have integrated by parts, and the boundary terms vanished by periodicity. The last line follows
from the Cauchy-Schwarz inequality, so equality holds if and only if V ∝ −κN. Thus the csf is
exactly the variation that gives the fastest decrease in length.
Example: The shrinking circle. If our initial curve X0 is a circle of radius r0 centred at the
origin, then the solution will take the form X(u, t) = r(t)(cos(u), sin(u)). We have N = X/r, κ =
1/r, so the csf equation becomes
dr 1
=−
dt r
which has the solution
1
X(u, t) = (r02 − 2t) 2 (cos(u), sin(u)). (4.2)
p
2
√
So the circle shrinks to a point at a finite time t = r0 /2 = r0 / 2.
yu2
∂ x 1 −xu yu xuu
=
∂t y (x2u + yu2 )2 −xu yu x2u yuu
For the system to be strongly parabolic, the matrix on the rhs ought to be positive-definite. It
clearly isn’t (it is even singular), so we seem to be in trouble.
Although it appears that our system is not parabolic, all that is going on is that the parabolic
nature of the system is being hidden from us by a poor parametrization of our solution. Suppose,
for the moment, that we have a solution of the csf given by X : R/Z × [0, T ) → R2 . Consider a
time-dependent family of diffeomorphisms from the circle to itself:
ϕ : R/Z × [0, T ) → R/Z.
Then we can define X̃(u, t) = X(ϕ(u, t), t). X̃ is just a time-dependent reparametrization of
X. In particular, it won’t change how the curve “looks”, i.e. its image in R2 .
The evolution equation satisfied by X̃ is then
∂ ∂ ∂
X̃(u, t) = X(ϕ(u, t), t) + Xu ϕ(u, t)
∂t ∂t ∂t
= −κN + Xu V
1 yu [yu −xu ] xuu xu
= + V
(x2u + yu2 )2 −xu yuu yu
38
where we define
∂
ϕ(u, t) = V (ϕ(u, t), t). (4.3)
∂t
We are free to choose how to reparametrize our curve – that is, we may choose V and then
solve the differential equation (4.3) for ϕ(u, t). We choose V so that our system of csf with
time-dependent reparametrization is strongly parabolic:
1 [xu yu ] xuu
V = 2 . (4.4)
(xu + yu2 )2 yuu
We can now express xu , xuu , yu , yuu in terms of x̃u , x̃uu , ỹu , ỹuu using the chain rule. This gives
us a second-order differential equation for (x̃, ỹ), which is strongly parabolic because the matrix
multiplying (x̃uu , ỹuu ) is positive definite. By Theorem A.1 there exists a solution X̃ = (x̃, ỹ) on
some time interval [0, T ), and the solution is unique for as long as it exists. Using this solution it is
possible to reconstruct ϕ(u, t) from (4.3) – because we know x̃, ỹ this is a simple ode. Once we know
the reparametrization used, we can find X(u, t) = X̃(ϕ−1 (u, t), t) and show that it is a solution of
the csf. Therefore, although the csf is not a strongly parabolic system, by reparametrizing it we
can show that it has a solution on some time interval [0, T ), and it is not difficult to further prove
that the solution is unique for as long as it exists. Because of this uniqueness we may without
ambiguity choose
In this situation, we call [0, T ) a maximal time interval – it is the largest time interval on which
the solution exists (note that T could be ∞).
Theorem 4.2. For any smooth immersion X0 : R/Z → R2 , there is a unique smooth solution
X : R/Z × [0, T ) → R2
39
4.4 Curvature Explodes
Theorem 4.5. Suppose we have a solution of the csf on a maximal time interval [0, T ), where
T < ∞ by Theorem 4.4. Then
We will give an outline of the proof. The basic idea is to assume the curvature were bounded
as t → T , and show that the maps Xt (·) = X(·, t) must then converge so nicely as t → T that it is
possible to extend the solution past T , contradicting the assumption that [0, T ) was the maximal
time interval on which the solution existed.
By differentiating equation (4.1) one can obtain an evolution equation for the curvature:
∂κ ∂2κ
= + κ3 .
∂t ∂s2
This is a heat-type equation. From it we can derive evolution equations for the derivatives of
the curvature ∂ n κ/∂sn , which will also be heat-type equations. Assuming an upper bound on κ,
one can use the maximum principle (in the form of Theorem 3.2) to prove upper bounds on the
derivatives of κ. These bounds are analogous to the bounds obtained in Chapter 6 for the Ricci
flow.
From the bounds on the derivatives of the curvature, it is possible to prove bounds on the
derivatives of X. Once can show that all of the quantities
k l
∂ ∂
∂uk ∂tl X
are bounded above. This means that all derivatives of X are equicontinuous, so by the Ascoli-Arzela
theorem, for any k, l > 0 there is a sequence of times (ti ) such that the maps
∂k ∂l
X(·, ti )
∂uk ∂tl
converge uniformly. Because the time-derivatives are also bounded, this means that the maps
∂k ∂ l
X(·, t)
∂uk ∂tl
converge uniformly. Therefore the map X(·, t) and all its derivatives converge uniformly to some
smooth map X(·, T ). We can then use Theorem 4.2 to find a solution X̄ to the csf with the initial
curve X̄0 (·) = X(·, T ), defined on some time interval [0, ). The solution X̄ can now be “glued onto
the end of X” to extend the solution X past T . Defining X(·, t) = X̄(·, t − T ) for t ∈ [T, T + )
(and leaving X unchanged for t ∈ [0, T )), we obtain a smooth solution1 X to the csf with the
given initial data, which is defined on the time interval [0, T + ). This contradicts the maximality
of T . Hence the original assumption that the curvature was bounded was incorrect.
to X(·, T ) as t → T . The time-derivatives can be expressed in terms of the space derivatives using the equation of
the csf, so they too are smooth.
40
Theorem 4.6. (Grayson) Given any embedded circle X0 , the solution to the csf X(u, t) with
X(u, 0) = X0 (u) will remain embedded and will shrink to a point x as t approaches the maximal
time of existence T . Furthermore, if A(t) denotes the area enclosed by the curve X(·, t) at time t,
then the curve r
π
X̃(u, t) = (X(u, t) − x)
A(t)
converges exponentially to a unit circle as t → T .
Proof. Once again, we will not give a complete proof but will outline one of the main concepts,
because it is analogous to the blowup technique that we will introduce for the Ricci flow in Chapter
8. See [30, Chap. 1 and Sec. 5.2] for the full proof of this theorem.
If the initial curve X0 is convex, one can prove using derivative estimates that the curvature
converges uniformly to 1, and all derivatives of curvature are bounded. Once again the Ascoli-
Arzela theorem can be employed to show that a limit exists, and that the limit is a smooth
embedded curve with curvature 1, hence the unit circle (see [30, Chap. 1] for the details). This
result is known as the Gage-Hamilton Theorem (a corresponding result for the mean curvature
flow was proven by Gerhard Huisken).
Grayson proved that any initial curve will eventually become convex under the csf, from which
the result follows by the Gage-Hamilton Theorem. We will outline the proof presented in [30, Sec.
5.2]. We use the result of Theorem 4.5, which says that the curvature explodes as we approach
some singular time T < ∞. This means we may choose points pn ∈ R/Z and times tn that converge
to T such that
|κ(pn , tn )| = sup |κ(p, t)|,
p∈R/Z,t≤tn
and define Mn := |κ(pn , tn )|, so that limn→∞ Mn = ∞. We then define rescaled flows Xn by
t
Xn (p, t) := Mn X pn + p, tn + 2 − X(pn , tn ) .
Mn
It is not hard to see that these rescaled flows are solutions of the csf defined for t ∈ [−Mn2 tn , Mn2 (T −
tn )), with Xn (0, 0) = 0 and |κn (p, t)| ≤ |κn (0, 0)| = 1 for t ∈ [−Mn tn , 0]. This technique is known
as “blowing up” at a singularity; we rescale the flow so that the curvature no longer diverges.
We saw in the discussion of Theorem 4.5 that it is possible to deduce bounds on derivatives of
the curvature from a bound on the curvature, using the maximum principle. Moreover, one can
get bounds on the derivatives of the maps Xn : R × [−Mn tn , 0] (considering Xn as a periodic map
on the real line) from the bounds on derivatives of the curvature. Hence, by the Ascoli-Arzela
theorem there is a subsequence of Xn that converges uniformly in all C k norms on all compact
subsets of R × (−∞, 0] to a limit solution X∞ : R × (−∞, 0] → R2 . This limit solution satisfies
X∞ (0, 0) = 0 and κ∞ (p, t) = κ∞ (0, 0) = 1 for all t ≤ 0.
After considerable work, one can show that this limit of rescaled solutions is convex with
compact image for all t ∈ (−∞, 0], from which it follows that some Xn is eventually convex and
hence the csf X(·, t) is eventually convex for large enough t, as required.
41
Chapter 5
Before we can do anything with the Ricci flow, we must show that a solution exists for a short
time. We would like to apply the short-time existence and uniqueness theorem for parabolic pdes
(Theorem A.1) to the system
∂
g(t) = −2Rc(g(t))
∂t
g(0) = g0 .
We need to check if the system is strongly parabolic. This chapter is based on [6, Chap. 3] and
[28, Chap. 5].
Corollary 5.2. The principal symbol of the differential operator −2Rc (as a function of the metric
g) is:
σ̂[−2Rc](ϕ)(h)ij = g pq (ϕp ϕq hij + ϕi ϕj hpq − ϕq ϕi hjp − ϕq ϕj hip ).
Now we recall that the Ricci flow is strongly parabolic if there exists δ > 0 such that for all
covectors ϕ 6= 0 and all (symmetric) hij 6= 0,
42
or, by Corollary 5.2,
g pq (ϕp ϕq hij + ϕi ϕj hpq − ϕq ϕi hjp − ϕq ϕj hip )hij > δϕk ϕk hrs hrs .
However, if we choose hij = ϕi ϕj , it is clear that the LHS of this equation is 0, so the inequality
can not hold. Therefore the Ricci flow is not strongly parabolic.
where
1
Vi = g pq ( ∇i hpq − ∇q hpi ).
2
Proof. From Lemma 5.1 we have
Lower-order terms in h make no contribution to the principal symbol. Thus, as far as the principal
symbol is concerned, covariant derivatives commute. We can therefore rearrange equation (5.2) by
commuting covariant derivatives to give
1 1
D[−2Rc](h)ij = g pq ∇p ∇q hij + ∇i ( ∇j hpq − ∇q hjp ) + ∇j ( ∇i hpq − ∇q hip )
2 2
+ lower-order terms in h
= g pq ∇p ∇q hij + ∇i Vj + ∇j Vi + lower-order terms in h
43
is a solution to the Ricci flow. We can compute
∂ ∂ (ϕ∗t ḡt )
gt =
∂t ∂t
!
∂ ϕ∗t+s ḡt+s
=
∂s
s=0
!
∂ ϕ∗t+s ḡt
∗ ∂ḡt+s
= ϕt +
∂s s=0 ∂s
s=0
= ϕ∗t P (ḡt ) + ϕ∗t L ∂ϕt ḡt .
∂t
Here we have used the product rule and then the rule for the Lie derivative on tensors (see Section
1.3). We choose ϕt to satisfy
∂
ϕt = W (t)
∂t
ϕ0 = id
for some time-dependent vector field W (t) (this system will have a solution ϕt for as long as W (t)
exists – see [6, Lem. 3.15]). The problem now reduces to finding a differential operator P such
that the system (5.3) is strongly parabolic, and a time-dependent vector field W (t) such that, if
we define a 1-parameter family of diffeomorphisms that satisfies (5.4), then
Note that the choice of W (t) corresponds to the choice of V in Section 4.2.
Now by Lemma 1.7 we have
Recall that the first term in equation (5.4) is a good term: if the other second-order terms cancelled
then the linearization would satisfy the condition for parabolicity. So our aim is to choose W in
such a way that the second-order part of the term (5.5) cancels the second-order part of the terms
in V . This will happen if the principal part of the linearization of Wi is equal to that of Vi 1 .
We defined V by
1
Vi = g pq ( ∇i hpq − ∇p hqi )
2
1
= − g pq (∇p hqi + ∇q hpi − ∇i hpq ).
2
We recall from formula (1.17) that
∂ 1
D[Γkij ](h) = Γkij (g(t)) = g kl (∇i hjl + ∇j hil − ∇l hij ),
∂t 2
∂
where hij = ∂t gij . This looks pretty similar to the form of V , so we might try
Wi = −g pq gij Γjpq .
1 Note that, as far as the principal symbol is concerned, a covariant derivative ∇ is the same as a coordinate
i
derivative ∂i by formula (1.3). Therefore, if the principal symbol of the linearization of Wi is equal to that of Vi
then the principal symbol of the linearization of ∇j Wi is equal to that of ∇j Vi .
44
We need to be careful though – this expression depends on the coordinates we choose, because the
Christoffel symbol Γ is not a tensor. However, the difference between two connections is a tensor,
so we can fix a constant connection with Christoffel symbols Γ̃kij and define
Wi = −g pq gij Γjpq − Γ̃jpq ,
which is the coordinate form of some coordinate-independent vector field. The fixed background
connection, because it is independent of the metric, will make no contribution to the symbol of W ,
so the principal symbol of Wi will be equal to that of Vi and all of the second-order terms other
than the first in equation (5.4) will cancel.
Thus, making this choice of W , the principal symbol of the linearization of P is just
so we have
hσ̂(D[P ])(ϕ)(h), hi = |ϕ|2 |h|2 .
Thus the Ricci-DeTurck flow defined by
∂
ḡij = P (ḡ) = −2R̄ij + ∇i Wj + ∇j Wi
∂t
is strongly parabolic, and therefore has a solution for a short time by Theorem A.1.
For as long as this solution exists, the vector field W (t) exists, and the time-dependent dif-
feomorphisms ϕt can be obtained by solving the ode (5.4) with initial condition ϕ0 =id. Once
we know the ϕt exist, the above calculations show that the pullback metrics gt = ϕ∗t ḡt satisfy the
Ricci flow equation. Thus, the Ricci flow has a solution for a short time. In fact the solution is
also unique (see [6, Sec. 4.4]), so we have the following result:
Theorem 5.4. Given a smooth Riemannian metric g0 on a closed manifold M, there exists a
maximal time interval [0, T ) such that a solution g(t) of the Ricci flow, with g(0) = g0 , exists and
is smooth on [0, T ), and this solution is unique.
45
Chapter 6
In this chapter we will use the maximum principle in some creative ways to obtain bounds on
derivatives of the curvature and metric evolving under the Ricci flow. We will then use our
estimates to show that the curvature must explode1 as we approach a finite-time singularity in the
Ricci flow, in analogy with Theorem 4.5 for the curve-shortening flow. This result is a key part of
the proof of Theorem 2.1 that we will present in Chapter 7. The derivative estimates themselves
are also vital to the proof of the compactness result presented in Chapter 8. Our exposition is
based on [6, Chap. 6,7] (with the exception of Corollary 6.13 which is based on results in [28] and
Section 6.2 which is based on [9, Sec. 13]), with significant reformulation of many of the arguments.
∂
gij = −2Rij .
∂t
Then the various geometric quantities evolve according to the following equations:
1. Metric inverse:
∂ ij
g = 2Rij .
∂t
2. Christoffel symbols:
∂ k
Γ = −g kl (∇i Rjl + ∇j Ril − ∇l Rij ).
∂t ij
3. Riemann curvature tensor:
∂
Rijkl = 4Rijkl + 2(Bijkl − Bijlk + Bikjl − Biljk )
∂t
−(Rip Rpjkl + Rjp Ripkl + Rkp Rijpl + Rlp Rijkp ),
1 Some books on the Ricci flow use the words “blow up” here – unfortunately this expressive nomenclature is
already employed to describe our method of examining singularities as in Chapter 8, so we substitute “explode”.
All it means is that the curvature goes to +∞, as detailed in Theorem 6.7.
46
where
q p
Bijkl ≡ −Rpij Rqlk .
See Definition 1.13 for the definition of the Laplacian 4.
4. Ricci tensor:
∂
Rij = 4Rij + 2Rpijq Rpq − 2Rip Rpj .
∂t
5. Scalar curvature:
∂
R = 4R + 2|Rc|2 .
∂t
6. Einstein tensor on a 3-manifold:
∂ 26
|E|2 = 4|E|2 − 2|∇E|2 − 8Rij Rjk Rki + R|Rc|2 − 2R3 .
∂t 3
7. Volume element:
∂
dµ = −Rdµ.
∂t
8. Volume of manifold: Z Z
∂
dµ = − Rdµ.
∂t M M
Proof. Most of these follow easily from Lemma 1.20, but some require a bit of extra fiddling to put
them in a nice form. The proof of the equation for the Riemann curvature tensor in particular is
rather lengthy and is contained (along with those for Rc and R) in [6, Sec. 6.1]. The main technique
is to use the identity (B.3) to commute covariant derivatives, then use the Bianchi identities.
The proof of the evolution equation for |E|2 in dimension 3 can be found in [6, Cor. 6.39]. It
can be proven using the formula
1
|E|2 = |Rc|2 − R2 ,
3
the evolution equations for Rc and R given above, and the result of Lemma 1.13 to express the
Riemann curvature tensor in terms of the Ricci tensor. We will not use this result in the current
chapter, but it comes in handy in Chapter 7.
47
Proof. We use gt to denote the metric at time t.
∂ ∂ ∂gt
gt (A, A) = 2gt A, A + (A, A)
∂t ∂t ∂t
= 2gt (4A + F, A) + Rc ∗ A∗2
= 4|A|2 − 2|∇A|2 + F ∗ A + Rc ∗ A∗2
where the final term in the second line comes from the derivative of the metric (which is −2Rc for
the Ricci flow) and we have used the identity
4|A|2 = 2h4A, Ai + 2|∇A|2 .
It follows that
∂t ∇A = ∂t ∂A + ∂t f (Γ, A)
= ∂∂t A + f (Γ, ∂t A) + f (∂t Γ, A) (by the product rule)
= ∇(∂t A) + f (g −1 ∗ ∇Rc, A) (because ∂t A is a tensor of the same type as A)
= ∇(4A + F ) + ∇Rc ∗ A
= (4∇A + Rm ∗ ∇A + ∇Rc ∗ A) + ∇F + ∇Rc ∗ A
= 4∇A + ∇F + Rm ∗ ∇A + ∇Rc ∗ A.
We used the formula
[∇, 4]A := ∇4A − 4∇A = Rm ∗ ∇A + ∇Rc ∗ A,
which follows from careful use of formula (B.3) followed by the second Bianchi identity (1.8) (see
[6, p. 227]).
Now we note that the formula for the evolution of the Riemann curvature tensor under the
Ricci flow given in Lemma 6.1 translates into ∗-notation as
∂
Rm = 4Rm + Rm∗2 . (6.1)
∂t
This allows us to use the preceding lemmas to calculate evolution equations for covariant derivatives
of the metric.
Lemma 6.4. The evolution equation for the kth iterated covariant derivative of the Riemann
curvature tensor under the Ricci flow is:
k
∂ k X
∇ Rm = 4∇k Rm + ∇j Rm ∗ ∇k−j Rm.
∂t j=0
48
Proof. We prove the result by induction. The evolution equation for Rm given above in equation
(6.1) is the base case k = 0. We assume the relation holds true for a given k and apply Lemma 6.3
with A = ∇k Rm and
Xk
F = ∇j Rm ∗ ∇k−j Rm.
j=0
as required.
49
Theorem 6.6. The Bernstein-Bando-Shi Estimates. Let (Mn , g(t)) be a solution of the
Ricci flow on a closed n-manifold. Then for each α > 0 and m ∈ N, there exists a constant Cm
depending only on m, n and max{α, 1} such that if
α
|Rm(x, t)|g(t) ≤ K for all t ∈ [0, K ],
then
Cm K
|∇m Rm(x, t)|g(t) ≤ α
for all t ∈ (0, K ].
tm/2
Proof. We prove the result by induction on m. For m = 0 the result is true by hypothesis, with
C0 = 1. Assume the result is true for all p ≤ m − 1. The result of Corollary 6.5 tells us that
m
∂ m X
|∇ Rm|2 = 4|∇m Rm|2 − 2|∇m+1 Rm|2 + ∇j Rm ∗ ∇m−j Rm ∗ ∇m Rm
∂t j=0
m
X
≤ 4|∇m Rm|2 − 2|∇m+1 Rm|2 + cmj |∇j Rm||∇m−j Rm||∇m Rm|
j=0
m−1
X Cj Cm−j 2 m
≤ 4|∇m Rm|2 − 2|∇m+1 Rm|2 + cmj j/2 (m−j)/2 K |∇ Rm|
j=1
t t
m 2
+ (cm0 + cmm )K|∇ Rm|
00
0 Cm
≤ 4|∇m Rm|2 − 2|∇m+1 Rm|2 + Cm K|∇m Rm|2 + m/2
K 2 |∇m Rm|
t
α 0 00
for t ∈ (0, K ], where Cm , Cm are constants depending only on m and n. We have used the inductive
hypothesis to go from the second to the third line. We can complete the square (in the variable
|∇m Rm|) on the right hand side then use the inequality (a + b)2 /2 ≤ a2 + b2 to obtain
K2
∂ m 2 m 2 m+1 2 m 2
|∇ Rm| ≤ 4|∇ Rm| − 2|∇ Rm| + C̄m K |∇ Rm| + m (6.4)
∂t t
for some constant C̄m .
To get the desired bound, we need to find an upper bound on tm |∇m Rm|2 . This quantity
clearly has an upper bound at t = 0 because it is equal to 0. So to apply the maximum principle
and get an upper bound we need only show that the reaction terms in its evolution equation cause
it to decrease. The problem we have is that this quantity satisfies the differential inequality
∂ m m
(t |∇ Rm|2 ) ≤ 4(tm |∇m Rm|2 ) − 2tm |∇m+1 Rm|2 + (6.5)
∂t
(C̄m Kt + m)tm−1 |∇m Rm|2 + C̄m K 3 , (6.6)
and the reaction terms (6.6) are not negative. This is the “second difficulty” outlined at the start
of this section.
To fix this problem, we make use of the term −2|∇k+1 Rm|2 in the evolution equation of
Corollary 6.5. By adding the right amount of tm−1 |∇m−1 Rm|2 (which we know by our inductive
hypothesis is bounded above by a constant) we can cancel off the unruly reactionary terms involving
tm |∇m Rm|2 . In so doing, we will introduce new unruly terms in tm−1 |∇m−1 Rm|2 – so we will
need to add the right amount of the next derivative down and so on. We define
m−1
X
G = tm |∇m Rm|2 + αmj tj |∇j Rm|2 ,
j=0
where we will determine the constants αmj so that all the terms cancel as we want them to.
By equation (6.5), we have
∂
G ≤ 4G + (C̄m Kt + m)tm−1 |∇m Rm|2 + C̄m K 3
∂t
m−1
X
αmj −2tj |∇j+1 Rm|2 + (C̄j Kt + j)tj−1 |∇j Rm|2 + C̄j K 3 .
+
j=0
50
By the inductive hypothesis there are numbers Dj depending only on j, n for 1 ≤ j ≤ m − 1 such
that
C̄j Ktj |∇j Rm|2 + C̄j K 3 ≤ Dj K 3
α
for all t ∈ (0, K ]. Hence we have
∂
G ≤ 4G + (C̄m Kt + m)tm−1 |∇m Rm|2 + C̄m K 3 )
∂t
m−1
X
αmj −2tj |∇j+1 Rm|2 + jtj−1 |∇j Rm|2 + Dj K 3
+
j=0
Now we choose the αmj so that the terms in this equation cancel: choose αm,m−1 such that
where the second step follows because we are working with t ∈ (0, α/K]. Now define αm,m−2 ,
αm,m−3 , . . . , αm0 in that order, at each step setting
jαmj − 2αm,j−1 = 0.
If we now define
m−1
X
Bm := C̄m + αmj Dj
j=0
51
The proof that the curvature must explode as t → T follows the same outline as the proof
of the corresponding result for the curve-shortening flow, Theorem 4.5. That is, we assume that
|Rm|g is bounded above by a constant K, and show that the metric g(t) converges smoothly to a
smooth metric g(T ). It is then possible to use the short-time existence result of Theorem 5.4, with
initial metric g(T ), to extend the solution past T . This contradicts the choice of T as the maximal
time such that the Ricci flow exists on [0, T ).
A key element of the proof is the following theorem and its corollary, which show that a limit
metric g(T ) exists and is continuous:
Theorem 6.8. Let M be a closed manifold and g(t) a smooth time-dependent metric on M,
defined for t ∈ [0, T ). If there exists a constant C < ∞ such that
Z T
∂
g(x, t)
∂t dt ≤ C (6.7)
0 g(t)
for all x ∈ M, then the metrics g(t) converge uniformly as t → T to a continuous metric g(T )
such that
e−C g(x, 0) ≤ g(x, T ) ≤ eC g(x, 0).
Note that this means g(T ) is uniformly equivalent to g(0).
where the penultimate step follows as |A(U, U )| ≤ |A|g for any 2-tensor A and unit vector U .
Exponentiating both sides of this inequality gives us:
for some C 0 > 0. Note the difference from equation (6.7): the norm is taken with respect to a
constant metric g(0) rather than the time-dependent one g(t).
We now define Z T
∂
g(x, T ) = g(x, 0) + g(x, t)dt.
0 ∂t
This integral exists as the metrics are smooth and the integrand is absolutely integrable with
respect to the norm induced by g(0), by equation (6.9). Now
Z T
∂
|g(x, t) − g(x, T )|g(0) ≤ ∂t g(x, t)
→0
t g(0)
52
as t → T for each x ∈ M. Because M is compact, this convergence is uniform on M. Hence
g(t) → g(T ) uniformly, so g(T ) is continuous. By taking the limit of equation (6.8) we can show
that
e−C g(x, 0) ≤ g(x, T ) ≤ eC g(x, 0),
hence that g(T ) is positive definite.
Therefore, the metrics g(t) converge uniformly to a continuous Riemannian metric g(T ) which
is uniformly equivalent to g(0).
Corollary 6.9. Let (M, g(t)) be a solution of the Ricci flow on a closed manifold. If |Rm|g is
bounded on [0, T ) (where T < ∞) then g(t) converges uniformly as t → T to a continuous metric
g(T ) which is uniformly equivalent to g(0).
∂
Proof. Any bound on |Rm|g implies one on |Rc|g , and hence on | ∂t g|g (by the equation of the
Ricci flow). The integral (6.7) is then an integral of a bounded quantity over a finite interval, and
is hence bounded. Thus Theorem 6.8 applies.
We have now got a foothold: we have shown that there is a limit metric g(T ), and it is
continuous. We now want to show that this metric is smooth, because we need this if we are to use
the short-time existence result of Theorem 5.4 to extend our solution past T . To do this, we need
to make sure the spatial derivatives of g near the limit time T are not exploding – we need bounds
on them. The first step is to bound the derivatives of the curvature, in analogy with the outlined
proof of Theorem 4.5. We do this via the Bernstein-Bando-Shi derivative estimates (Theorem 6.6),
which give bounds on the derivatives of the curvature under the assumption of bounded curvature.
We recall that the Bernstein-Bando-Shi estimates are completely useless at t = 0 (as one might
expect, for bounds on an arbitrary curvature tensor will not tell us anything about its derivatives
– it is only after a period of Ricci flowing that the derivatives start to be brought under control).
That is fine for us because we are interested in derivative estimates near t = T . We can use
Theorem 6.6 to get such estimates, by considering our Ricci flow as starting at some time shortly
before T . This gives us
Corollary 6.10. (of Theorem 6.6) Let (Mn , g(t)) be a solution of the Ricci flow on a compact
n-manifold. If there exist β, K > 0 such that
where T > β/K, then for each m ∈ N there exists a constant Bm depending only on m, n and
min{β, 1} such that
m
h i
|∇m Rm(x, t)|g(t) ≤ Bm K 1+ 2 for all t ∈ min{β,1}
K ,T .
Proof. We use the result of Theorem 6.6. Let β1 = min{β, 1}. Now, given a time t0 ∈ [β1 /K, T ],
we consider the Ricci flow as starting at the time T0 = t0 − β1 /K. Applying Theorem 6.6 to this
Ricci flow, with α = β1 , tells us that
Cm K
|∇m Rm| ≤
(t − T0 )m/2
Cm K Cm 1+m/2
|∇m Rm| ≤ = K ,
β1 m/2
(K) β1m
Using this bound on the derivatives of curvature near t = T , we can now proceed to bound the
derivatives of the metric g near t = T . The following corollary and its proof are a reformulation of
the argument found in [6, Prop. 6.48].
53
Corollary 6.11. Let (Mn , g(t)) be a solution of the Ricci flow on a closed n-manifold, and let
(xi ), i = 1, . . . , n be a local coordinate system defined on some coordinate chart U ⊂ Mn . If there
exists K > 0 such that
|Rm(x, t)|g(t) ≤ K for all t ∈ [0, T )
0
then for each m ∈ N there exist constants Cm , Cm depending only on the chosen coordinate chart
such that
|∂ m g(x, t)| ≤ Cm
and
0
|∂ m Rc(x, t)| ≤ Cm
for all (x, t) ∈ U × [0, T ), where the norms are taken with respect to the Euclidean metric in the
coordinate system (xi ).
We should explain some of the notation used. By ∂ m g we mean the m+2
0 -tensor field, defined
only in the coordinate chart U , which has coordinates
with respect to the coordinate system (xi ). The Euclidean metric, which is also defined only in U ,
is the metric which has coordinates δij with respect to the coordinate system (xi ).
Proof. Throughout this proof, we will treat the Christoffel symbols Γkij as the coordinates, with
respect to the coordinate system (xi ), of a tensor Γ (Γ is defined only in U ).
Note that by Corollary 6.10, for each m ∈ N there is a uniform upper bound on |∇m Rc| on a
time interval (β/K, T ). There is also an upper bound on the same quantity on the interval [0, β/K]
because the interval is compact, so the quantity is bounded for all m:
for all x ∈ Mn and t ∈ [0, T ), where Dm is some constant depending on m and the particular Ricci
flow (Mn , g(t)).
We will now prove by strong induction that there exist constants Pm , Qm , Rm for each m ∈
N ∪ {0} such that
1. |∂ m−1 Γ| ≤ Pm (we only prove this for m ≥ 1);
2. |∂ m Rc| ≤ Qm ;
3. |∂ m g| ≤ Rm
for all t ∈ [0, T ).
For the base case m = 0, (2) follows from the bound |Rm| ≤ K and (3) follows from Corollary
6.9.
Assume (1) – (3) are satisfied for all m ≤ p − 1. We will prove they are true for m = p, starting
with (1). Note that there is some constant C such that |∂ p−1 Γ| ≤ C at t = 0, because the manifold
Mn is compact.
Bounds on |∂ m g| imply bounds on |∂ m (g −1 )| by differentiating the formula g ij gjk = δki , m
times. Recalling the formula for ∂t Γ from Lemma 6.1, we can prove:
We have bounds on the derivatives of g −1 , so we just need to bound the other terms. For i ≤ p − 1
we have, by Lemma 1.3 (recalling the notation from the end of Section 1.1),
∂ i ∇Rc = ∇i+1 Rc + ∗ ∂ j Γ, ∂ k Rc .
0≤j ≤i−1
k≤i
54
Note that we have a bound on ∇i+1 Rc by equation (6.10), and an inductive upper bound on all
of the other terms on the right hand side, because j ≤ i − 1 ≤ p − 2 and k ≤ i ≤ p − 1. Hence
|∂ i ∇Rc| ≤ C for i ≤ p − 1, therefore
|∂t ∂ p−1 Γ| ≤ C
for some constant C (this C is different from the previous one – there are so many constant uniform
bounds at play in this and subsequent arguments that we will refer to most of them as ‘C’ for
convenience, and hope that this does not cause too much confusion). Thus |∂ p−1 Γ| is bounded
at t = 0 and experiences, at worst, linear growth on the finite time interval [0, T ). Therefore
|∂ p−1 Γ| ≤ Pp for some constant Pp . This completes the proof of (1).
We can now compute (again by Lemma 1.3)
∂ p Rc = ∇p Rc +
j ≤p−1
∗ ∂ j Γ, ∂ k Rc .
k ≤p−1
All terms on the right hand side are bounded inductively (with the exception of ∂ p−1 Γ, which we
just showed was bounded as part of the inductive step), so we have |∂ p Rc| ≤ Qp for some constant
Qp . This completes the proof of (2).
Finally,
|∂t ∂ p g| = | − 2∂ p Rc| ≤ C
by the second part of the inductive step, so because we are on a finite time interval, |∂ p g| ≤ Rp
for some constant Rp . This proves (3), completing the inductive step.
Therefore the result is true by induction.
Corollary 6.12. The metric g(T ) of Corollary 6.9 is smooth, and the metrics g(t) converge
uniformly in every C k norm to g(T ) as t → T .
Proof. To show that g(T ) is smooth we must take derivatives with respect to some system of
coordinates which we can only choose arbitrarily, so take some coordinate patch U of Mn . We
have, from the Ricci flow equation,
Z T
gij (x, T ) = gij (x, t) − 2 Rij (x, τ )dτ
t
55
0
of these Cm where m = |α| (C finite because there are only finitely many coordinate charts), then
equation (6.11) gives us
Z
|α| T
∂ |α| ∂ |α|
∂
g (x, T ) − g (x, t) = 2 R (x, τ )dτ
ij ij ij
∂xα ∂xα
t ∂xα
Z T
≤ 2 |∂ m Rc(x, τ )| dτ
t
≤ 2C(T − t).
∂u
≤ 4u + Cu3/2
∂t
for some C > 0 depending only on n. By hypothesis, |Rm|2 ≤ K 2 at t = 0. The situation is perfect
for applying the maximum principle (Theorem 3.2). We need only solve the associated ode:
dφ
= Cφ3/2 , φ(0) = K 2 .
dt
The solution is 2
K
φ(t) = .
1 − 12 CKt
It follows by Theorem 3.2 that
2
K
u≤ ,
1 − 21 CKt
from which it follows that
K
|Rm| ≤
1 − 21 CKt
as required.
56
Now let b = 1/(CK). We then have
|Rm(x, t)| ≤ 2K
for all t ∈ [0, b) and x ∈ M, and b depends only on K and the dimension of the manifold. Because
where T is the maximal time of existence of the Ricci flow, we must have b < T , hence the Ricci
flow exists for all t ∈ [0, b).
57
Chapter 7
Hamilton’s first major achievement using the Ricci flow was the following, proved in [9].
Theorem 7.1. Let M3 be a closed 3-manifold which admits a smooth Riemannian metric with
strictly positive Ricci curvature. Then M3 also admits a smooth metric of constant positive cur-
vature.
In particular, if M3 is simply-connected, it follows from Theorem 1.16 that M3 is diffeomorphic
to S3 .
The proof in [9] was substantially improved by the introduction of the maximum principle that
we have called Theorem 3.3, in [10]. This improved proof is also presented in Chapter 6 of [6], and
it is this proof that we present in this chapter (we base our exposition on Chow and Knopf’s with
the exception of the proof of Theorem 7.19, the outline of which was suggested to us by Gerhard
Huisken). We will present a more “modern” proof (as presented in [28, 2, 21]) in Section 8.3, based
on Perelman’s recent work.
Because the proofs of some of the results we will use require lengthy calculations which distract
from the main course of the argument, we will relegate these details to Section 7.8.
1. Prove the Ricci flow has a solution for a short time, and this solution is unique. Thus we
can consider a solution on a maximal time interval [0, T ). Note the analogy with Theorem
4.2 for the csf. In addition, T < ∞, in analogy with Theorem 4.4 for the csf. It follows by
Theorem 6.7 that the curvature explodes as t → T , in analogy with Theorem 4.5 for the csf.
58
2. Show that the sectional curvatures must get close to each other as the curvature explodes.
3. Rescale time and the metric to get a solution to the normalized (volume-preserving) Ricci
flow,
R
∂ 2 Mn RdV
g = −2Rc + R g
∂t n Mn dV
(in this case we have a 3-manifold, so n = 3). Show that this solution exists for all time, and
converges as t → ∞ to a metric of constant curvature.
Theorem 7.2. Let (M, g(t)) be a solution of the Ricci flow on a compact manifold, defined for
t ∈ [0, T ). If the metric g(0) has strictly positive scalar curvature (in particular, if it has strictly
positive Ricci curvature) then g(t) becomes singular in finite time, i.e. T < ∞.
Proof. Because M is compact and the scalar curvature at time 0 is strictly positive, it is bounded
below by some ρ > 0. Using the result for the evolution of the scalar curvature in Lemma 6.1 we
have
∂ 2
R = 4R + 2|Rc|2 ≥ 4R + R2
∂t n
(here we have used Lemma 1.1). We can now apply the scalar maximum principle (Theorem 3.2).
The solution of the ode
dφ 2
= φ2
dt n
with φ(0) = ρ is
1
φ(t) = 1 2t .
ρ − n
Proof. By Theorem 7.2, the maximal time T is finite; by Theorem 6.7 the curvature must explode
as we approach this singularity. This proves (7.1). It now follows from Lemma 1.13 (which applies
because the manifold is 3-dimensional) that there exists a constant C such that |Rm| ≤ C|Rc|,
from which (7.2) follows.
59
7.3 Setting the Scene for the Maximum Principle – The
Uhlenbeck Trick
Having seen that the curvature explodes as the maximal time T is approached, we now show that
the metric g(t) becomes “rounder” when the curvature becomes large. To do this, we will apply
the maximum principle for vector bundles (Theorem 3.3) to the Riemann curvature tensor.
Recall, from Lemma 6.1, that under the Ricci flow the Riemann curvature tensor evolves
according to
∂
Rijkl = 4Rijkl + 2(Bijkl − Bijlk + Bikjl − Biljk ) (7.3)
∂t
−(Rip Rpjkl + Rjp Ripkl + Rkp Rijpl + Rlp Rijkp ), (7.4)
where
q p
Bijkl ≡ −Rpij Rqlk . (7.5)
Naı̈vely we might try to apply Theorem 3.3 to Rijkl as a section of the vector bundle of 4-tensors.
However, the reaction terms are so hideous that we will not be able to say anything useful about
the ode that we need to solve to apply Theorem 3.3. Furthermore, Theorem 3.3 can’t deal with
bundle metrics that depend on time. The way forward is what is known as the Uhlenbeck trick.
The idea is to fix the initial tangent bundle with its initial metric, then to evolve the isometry
between this fixed bundle and the tangent bundle with its time-evolving metric. Suppose that we
have a solution g(t) of the Ricci flow on M. Let (V, h) be a vector bundle over M with metric h
such that
u0 : (V, h) → (T M, g0 )
is a bundle isometry. We evolve the isometry u(t) by
∂ i
u = Rli ula (7.6)
∂t a
where the uia are the components of the isometry with respect to some local bases of V and T M.
We will use indices a, b, . . . on the vector bundle V and i, j, . . . on T M to distinguish them more
clearly.
Lemma 7.4. u(t) remains an isometry.
Proof. u(t) : (V, h) → (T M, g(t)) is an isometry as long as h is the pullback of g(t) via u(t), i.e.
h = u(t)∗ g(t). Because h is constant and u(0) is an isometry by definition, it suffices to show that
u(t)∗ g(t) does not change in time.
∂ ∂ i j
(u(t)∗ g(t))ab = (u u gij )
∂t ∂t a b
= Rli ula ujb gij + uia Rlj ulb gij + uia ujb (−2Rij )
= 0
60
maximum principle, is a little mysterious – Hamilton refers to it as “magic”):
∂ ∂ i j k l
Rabcd = (u u u u Rijkl )
∂t ∂t a b c d
i m j k l
= (Rm ua )ub uc ud Rijkl + uia (Rm
j m k l
ub )uc ud Rijkl
+uia ujb (Rm uc )ud Rijkl + uia ujb ukc (Rm
k m l l
um
d )Rijkl
+uia ujb ukc uld [4Rijkl + 2(Bijkl − Bijlk + Bikjl − Biljk )
−(Rip Rpjkl + Rjp Ripkl + Rkp Rijpl + Rlp Rijkp )]
= 4Rabcd + 2(Babcd − Babdc + Bacbd − Badbc ) (7.7)
E = ∧2 T ∗ Mn ⊗S ∧2 T ∗ Mn
of symmetric bilinear forms on the space ∧2 T M3 of 2-vectors. This viewpoint was outlined in
Section 1.4.
In 3 dimensions, ∧2 T M3 has dimension 3, so each fibre of this bundle E is naturally isomorphic
to the vector space of 3 × 3 self-adjoint (i.e. symmetric) matrices. We can diagonalize these with
respect to some orthonormal basis {e1 ∧ e2 , e2 ∧ e3 , e3 ∧ e1 } of ∧2 T M3 , where {e1 , e2 , e3 } is an
orthonormal basis of T M3 .
Now we would like to apply the vector bundle maximum principle (Theorem 3.3) to R, viewed
as a section of E. We must consider the ode corresponding to the pde describing the evolution of
Rm (which is given by equation (7.7)), namely
d
Qabcd = 2(Babcd (Q) − Babdc (Q) + Bacbd (Q) − Badbc (Q))
dt
where Babcd (Q) is defined by analogy with equation (7.5).
In 3 dimensions this ode has a particularly convenient form. Choose a basis {ei } of T M3x so
that Q is diagonal with eigenvalues λ1 , λ2 , λ3 down the diagonal, then the equation is (see [6, Sec.
6.3, 6.4]): 2
λ 0 0 λ1 + λ2 λ3 0 0
d 1
0 λ2 0 = 0 λ22 + λ3 λ1 0 .
dt
0 0 λ3 0 0 λ23 + λ1 λ2
In particular, the matrix Q will remain diagonal – this does not happen for higher dimensions,
because the matrix on the rhs will not be diagonal (there are other useful decompositions in the
n = 4 case though – see [10]). Therefore, in three dimensions the three parameters λ1 , λ2 , λ3
completely describe Q, and we may represent Q as a point λ = (λ1 , λ2 , λ3 ) moving in R3 according
to the ode 2
λ λ + λ2 λ3
d 1 21
3
(λ1 + λ32 + λ33 )
λ2 = λ2 + λ3 λ1 =∇ + λ1 λ2 λ3 . (7.8)
dt 3
λ3 λ23 + λ1 λ2
We note that the initial value of λ also tells us all about the initial Ricci and scalar curvatures, by
the formulae (1.9) and (1.10):
λ2 + λ3 0 0
1
Rc = 0 λ3 + λ1 0 (7.9)
2
0 0 λ1 + λ2
and
R = λ1 + λ2 + λ3 . (7.10)
We can visualize the behaviour of the ode (7.8) by visualizing the evolution of λ(t). The ode
is homogeneous, so we can scale any solution to get another solution. We consider “projecting”
the solution λ(t) onto the unit sphere to get a path µ(t) = λ(t)/|λ(t)|. Given µ(0), the evolution
of µ(t) of course depends on which solution λ(t) we choose, satisfying λ(0) = |λ(0)|µ(0). However,
61
z
S2x R
S3
S2x R S2x R y
x
Figure 7.1: The axes and the region Rc > 0 (shaded) on the unit sphere in R3 . The points √
corresponding to S3 and S2 × R are labelled. Note that S3 corresponds to the point (1, 1, 1)/ 3,
not the origin.
Figure 7.2: The paths µ(t) followed by the solutions to equation (7.8) on the 2-sphere.
62
the different paths µ(t) will only differ by a rescaling of time, by the homogeneity of the ode.
Hence we can unambiguously consider the paths that the solution follows on the unit 2-sphere,
if we do not take into account the speed (this procedure corresponds to our freedom to scale the
manifold without changing its shape).
Figure 7.2 shows the paths µ(t) followed on the sphere by the solutions to the ode (7.8). Figure
7.1 shows the axes, and the region corresponding to Rc > 0 is shaded grey (the diagram becomes
incoherent when these two are superimposed so we’ve split them up for clarity). √
The solution√ is stationary at the points (±1, 0, 0), (0, ±1, 0), (0, 0, ±1), (1, 1, 1)/ 3 and
(−1, −1, −1)/ 3. We recall that the eigenvalues of the curvature operator R correspond to sec-
tional curvatures. Thus the points (1, 0, 0), (0, 1, 0), (0, 0, 1) correspond to a manifold with positive
curvature in one 2-plane and zero curvature in orthogonal 2-planes – we can identify these points
with the manifold S2 × R, or quotients thereof if M is not simply connected. Similarly, we identify
2
the points√ (−1, 0, 0), (0, −1, 0), (0, 0, −1) with the manifold H × R or quotients thereof. The point
(1, 1, 1)/ 3 corresponds to a manifold of constant positive
√ sectional curvature, which we iden-
tify with S3 or quotients thereof, and (−1, −1, −1)/ 3 corresponds to a manifold with constant
negative sectional curvature, which we identify with H3 or quotients thereof.
We can see from Figure 7.2 that the paths are flowing towards S3 , but that some of the ones
flowing from outside the set of metrics with positive Ricci curvature get “caught” on the naughty
S2 × Rs. The set of metrics of strictly positive Ricci curvature is the largest convex set (recall that
we need convexity for Theorem 3.3 to apply) that is symmetric in λ1 , λ2 , λ3 and does not contain
the S2 ×R points. Thus we can see that the Ricci flow is expected to converge towards S3 whenever
it starts off with strictly positive Ricci curvature, but no stronger condition can be imposed.
It is interesting to note that, if we look at the sphere from the opposite direction, the arrows
are all reversed, the S3 is replaced by H3 and the S2 × Rs are replaced by H2 × Rs. Of course the
region shown in Figure 7.2 is all that is important to the proof of Theorem 7.1.
If one chooses λ1 (0) ≥ λ2 (0) ≥ λ3 (0) then it is easy to show that this condition is preserved
under the evolution equation (7.8):
d
log(λ1 − λ2 ) = (λ1 + λ2 − λ3 )
dt
and
d
log(λ2 − λ3 ) = (λ2 + λ3 − λ1 ).
dt
Rc ≥ g.
We have diverged slightly from Chow and Knopf’s formulation of this lemma (namely by
including the second condition) to avoid a technicality that arises in the proof.
Proof. See Section 7.8.
63
Corollary 7.6. The scalar curvature explodes as t → T . That is, if we define Rmax (t) :=
supx∈M R(x, t), then
lim Rmax (t) = ∞.
t→T
Proof. By the result of Lemma 7.5, Rc remains strictly positive under the Ricci flow. Therefore,
if we diagonalize Rc with respect to some orthonormal basis, with eigenvalues a, b, c > 0, then
Rc ≥ 2β 2 Rg.
Proof. We use Lemma 7.5 and formulae (1.9) and (1.10) to deduce that
λ2 + λ3
Rc ≥ g ≥ 2β 2 (λ1 + λ2 + λ3 )g = 2β 2 Rg,
2
for some β > 0.
We now show that the pinching together of the eigenvalues actually gets better as the scalar
curvature goes to +∞.
Theorem 7.8. There exist positive constants δ < 1 and C̄ (depending only on g0 ) such that
λ1 − λ3 C̄
≤ δ,
R R
where R = λ1 + λ2 + λ3 is the scalar curvature.
Proof. See Section 7.8.
Theorem 7.8 shows that, at each point, the sectional curvatures get “pinched” together as the
curvature explodes (λ1 − λ3 is the greatest difference between any two eigenvalues). Furthermore
the left hand side of this estimate is scale-invariant – so even when we rescale the metric by some
factor, this bound tells us that the eigenvalues (i.e. the sectional curvatures) will be close together.
Corollary 7.9. There exist positive constants B, δ̄ such that
|E|2
≤ BR−δ̄
R2
where E is the Einstein tensor, Eij = Rij − 13 Rgij .
Proof. See Section 7.8.
This quantity |E|2 measures how far away the metric is from being an Einstein metric. By
Lemma 1.14, when |E|2 = 0 we have Rij = Cgij where C is constant over the whole manifold. In
the case of 3 dimensions, it follows that the metric has constant sectional curvature (see Lemma
1.14). Thus bounds on |E|2 are very useful.
64
for some B, δ̄ > 0. We recall the result of Lemma 1.14, which says that if E = 0 then R is constant.
Thus we would expect that, when we have a bound on |E|2 (as we do, by Corollary 7.9) everywhere
on the manifold, the scalar curvature R might be close to being constant. So it is reasonable to
expect that we will be able to obtain a bound on |∇R| from our pinching result. This bound would
be very useful to have because it will allow us to compare values of R at different places in M.
Because we already know that R is exploding somewhere on M we will be able to show that it
is getting large everywhere, and hence that the sectional curvatures are getting close everywhere.
One way of bounding |∇R| is:
Theorem 7.10. Let (M3 , g(t)) be a solution of the Ricci flow on a closed 3-manifold with initially
strictly positive Ricci curvature. There exist β̄, γ > 0 depending only on the initial metric such
that for any β ∈ [0, β̄] there exists C such that
|∇R|2
≤ βR−γ + CR−2 .
R3
Proof. See Section 7.8.
Using this estimate we can get global bounds on the variation of the curvatures:
Theorem 7.11. Let (M3 , g(t)) be a solution of the Ricci flow on a closed 3-manifold with ini-
tially strictly positive Ricci curvature, defined for t ∈ [0, T ). Then there exist constants C, γ > 0
depending only on initial data such that
Rmin −γ
≥ 1 − CRmax .
Rmax
Note that this means Rmin /Rmax → 1 as t → T , because Rmax → ∞ as t → T by Corollary 7.6. It
follows that R → ∞ uniformly as t → T .
Proof. See Section 7.8.
Now recall the discussion at the start of this section. Because R → ∞ as t → T , the curvature
should be getting uniformly pinched.
Corollary 7.12. Let λ1 (x, t) ≥ λ2 (x, t) ≥ λ3 (x, t) denote the eigenvalues of the curvature operator
at (x, t), then for any ∈ (0, 1) there exists T ∈ [0, T ) such that
for all t ∈ [T , T ). Note that this means the metric will eventually attain positive sectional curvature
everywhere.
Proof. See Section 7.8.
65
the manifold with respect to g̃ is constant. Using the result of Lemma 1.19 for the scaling of the
volume elements we have
Vol(g̃(t)) = Vol(g̃(0))
⇒ ψ(t)n/2 Vol(g(t)) = Vol(g(0))
2
dµ(t) − n
R
⇒ ψ(t) = R Mn
n dµ(0) M
We define a rescaling of time to get rid of the ψ(t) terms in this evolution equation:
Z t
τ= ψ(u)du.
0
Lemma 7.13. R̃max is bounded, for the normalized Ricci flow on a closed 3-manifold with initially
strictly positive Ricci curvature.
66
Proof. We use Corollary 7.7 to deduce that
˜ = Rc ≥ 2β 2 Rg ≥ 2β 2 Rmin g = 2β 2 R̃min g̃
Rc
for some β > 0. It follows from Myers’ Theorem (Theorem 1.17) that L̃, the diameter of M with
respect to the metric g̃, satisfies
π
L̃ ≤ p .
β R̃min
Now we can also get a lower bound for L̃ from the fact that the volume of our manifold is constant
with respect to g̃, by the first part of the Bishop-Günther volume comparison theorem (Theorem
˜ = Rc ≥ 0, we have (in the notation of Theorem 1.15):
1.15). Because Rc
4π 3
Ṽ = Vol(B(p, L̃)) ≤ V30 (L̃) = L̃
3
where Ṽ is the volume of M with respect to g̃, which we know is equal to a constant, which we
call Ṽ0 . Combining these gives us
! 13
3Ṽ0 π
≤ L̃ ≤ p .
4π β R̃min
This gives us an upper bound for R̃min . Finally, it follows from the first part of Theorem 7.11 and
scaling invariance that there exists some C > 0 such that
R̃min Rmin 1
= ≥ .
R̃max Rmax C
Hence the upper bound for R̃min implies one for R̃max , and the proof is complete.
We can now show that our rescaling of time has in fact taken us to an infinite time interval.
Theorem 7.14. If (M3 , g(t)) is a solution of the Ricci flow on a closed 3-manifold with initially
strictly positive Ricci curvature then the corresponding normalized Ricci flow exists for all time.
Proof. First we show that
Z T
Rmax (t)dt = ∞,
0
where Rmax (t) is the maximum scalar curvature of the metric g(t) and [0, T ) is the maximal time
interval on which the unnormalized Ricci flow exists. This can be done by setting
Z t
f (t) = exp 2 Rmax (u)du Rmax (0),
0
67
is equivalent to
d
log(|φ(t)|) = 2Rmax (t)
dt
from which we can see that φ never changes sign. Hence the condition (R − f ) ≤ 0 is preserved,
by the scalar maximum principle. By Corollary 7.6, Rmax → ∞ as t → T , so f → ∞ as t → T .
Recalling the definition of f , this means that
Z t
lim Rmax (u)du = ∞.
t→T 0
Now the corresponding integral for the normalized flow will be the same:
Z T̃ Z T
R̃max (τ )dτ = r(t)dt = ∞
0 0
(if we define [0, T̃ ) to be the maximal time interval on which the normalized Ricci flow exists).
This follows easily from the scaling results of Lemma 1.19 as R̃max dτ = (ψ −1 Rmax )(ψdt) = rdt. So
the integral for the normalized flow must also diverge – but the integrand is bounded by Lemma
7.13. Hence the region of integration must be infinite, so T̃ = ∞, i.e. the normalized Ricci flow
exists for all time.
The proof will be very similar to that of Theorem 6.7, and the key element will again be Theorem
6.8. To show that g̃(∞) exists and is continuous, we must show that there exists some C < ∞
such that Z ∞
∂
g̃ dτ < C.
∂τ
0 g̃
Using the normalized Ricci flow equation (7.12), that is equivalent to showing that the integral
Z ∞
˜ − r̃ g̃ dτ
Rc (7.13)
0
3
Rc − 1 Rg 2
3
≤ CR−δ .
R2
This is a bound on exactly the right sort of quantity for the unnormalized Ricci flow (Theorem 7.11
tells us that as t → ∞, r and R approach one another uniformly). We need a bound that holds
for the normalized Ricci flow though, and one that will force the integral (7.13) to be bounded
too. The easiest way to do this is to prove the integrand is bounded by a decaying exponential (a
constant bound will not suffice as it did for Corollary 6.12, as the region of integration is infinite).
We will need a few preliminary results.
Lemma 7.15. If (M3 , g̃(τ )) is a solution of the normalized Ricci flow on a closed 3-manifold with
initially strictly positive Ricci curvature then there exists > 0 such that R̃ ≥ for all τ > 0.
Proof. See Section 7.8.
We are now ready to prove the key estimate that allows us to prove the integral (7.13) is
bounded.
68
Theorem 7.16. If (M3 , g̃(τ )) is a solution of the normalized Ricci flow on a closed 3-manifold
with initially strictly positive Ricci curvature then there exist constants C, δ > 0 such that
Ẽ ≤ Ce−δτ .
where we have amalgamated the two exponential bounds into one. It follows by Theorem 6.8 that
g̃(τ ) converges uniformly to a continuous metric g̃(∞) as τ → ∞.
The next step is to show that this convergence is smooth. This is important because we
want our limit metric to be smooth, and also because we want the curvatures of the normalized
flow to converge to the corresponding curvatures of the limit metric (as curvature quantities are
all essentially second-order derivatives of the metric). That will allow us to conclude that the
curvature pinching results we have proven for the flow lead to similar results for the limit metric,
and hence that the limit metric has constant curvature.
Theorem 7.19. The limit metric g̃∞ := g̃(∞) of Theorem 7.18 is smooth, and the convergence
of g̃(τ ) to g̃∞ as τ → ∞ is uniform in every C m norm.
Proof. See Section 7.8.
We are now finally in a position to prove Theorem 7.1. We show that the limit metric g̃∞ has
the properties required.
Theorem 7.20. The limit metric g̃∞ is a smooth metric with constant positive sectional curvature.
Proof. By Theorem 7.19, g̃(τ ) converges to g̃∞ in the C 0 , C 1 and C 2 norms. Because all curvature
quantities are combinations of “0th-order”, 1st-order and 2nd-order derivatives of the metric, this
means that we can take the limit to show that the Einstein tensor of the metric g̃∞ vanishes:
Ẽ∞ = lim Ẽ(τ ) ≤ lim Ce−δτ = 0
τ →∞ τ →∞
by the result of Theorem 7.16. Therefore g̃∞ is an Einstein metric, so by Lemma 1.14 g̃∞ has
constant (positive) sectional curvature.
69
7.8 Details
Proof. (of Lemma 7.5) Note that such a C and will exist at time t = 0 by the compactness of the
manifold M3 : λ1 /(λ2 + λ3 ) is a continuous, positive function at t = 0 so it has an upper bound,
and the eigenvalues of Rc are (λ2 + λ3 )/2, (λ3 + λ1 )/2, (λ1 + λ2 )/2, all of which are strictly positive
at time t = 0 so they have a strictly positive lower bound. Thus it suffices to show that these
conditions are preserved under evolution by the Ricci flow.
We use the maximum principle in the form of Theorem 3.3, on the vector bundle E, with the
set
K ≡ {Q ∈ E : λ1 (Q) − C(λ3 (Q) + λ2 (Q)) ≤ 0 and λ2 + λ3 ≥ 2}.
Note that because λ1 ≥ λ2 ≥ λ3 , the last condition states exactly that Rc ≥ g by formula (1.9).
This set is invariant under parallel translation and convex in each fibre. Convexity follows because
the functions
f (Q) = λ1 (Q)
= max (Q(U, U ))
|U |=1
and
are convex, so the function j(Q) = f (Q) + Ch(Q) is convex, therefore the set
is convex.
We just need to show that the solution of the associated ode stays in K. That is, if
λ1
≤C
λ2 + λ3
and
λ2 + λ3 ≥ 2
initially, then this condition remains true under the evolution (7.8). This can be shown by proving
first that
d
(λ2 + λ3 ) = λ22 + λ23 + λ1 (λ2 + λ3 ) ≥ 0,
dt
from which it follows that the condition λ2 + λ3 ≥ 2 is preserved, then
by the evolution equation (7.8), because λ2 + λ3 ≥ > 0 and hence λ1 > 0 also. This is the sole
reason for including the condition Rc ≥ g – to show that the rhs of this equation does not become
singular.
Thus all the hypotheses of Theorem 3.3 are satisfied, so the solution to the pde stays in K, i.e.
the conditions
λ1
≤C
λ2 + λ3
and
Rc ≥ g
are preserved.
70
Proof. (of Theorem 7.8) The proof is very similar to that of Lemma 7.5. It suffices to show that
there exist C̄, δ such that
λ1 − λ3 C̄
≤ . (7.14)
λ2 + λ3 (λ2 + λ3 )δ
By compactness, given δ ∈ (0, 1) we can choose such a C̄ at time t = 0, because λ2 + λ3 > 0
everywhere by the positive Ricci curvature condition. We show that this condition is preserved
using the maximum principle (Theorem 3.3) with the same vector bundle E as we used in the proof
of Lemma 7.5, but a different K:
Once again, the set K is invariant under parallel translation and convex in each fiber by a
similar argument to that in the proof of Lemma 7.5.
To show that the solution of the ode stays inside K we compute, from equation 7.8:
d λ1 − λ3 1
log ≤ δλ1 − (1 − δ)(λ3 + λ2 ).
dt (λ2 + λ3 )1−δ 2
Now, by Lemma 7.5, it is possible to choose δ small enough that this is always non-positive, so
λ1 − λ3
(λ2 + λ3 )1−δ
is non-increasing, hence the inequality (7.14) is preserved by the ode. Thus by the maximum
principle it is also preserved by the pde.
Proof. (of Corollary 7.9) Using the formulae (1.9) and (1.10) we have
λ2 + λ3 − 2λ1 0 0
1
E= 0 λ3 + λ1 − 2λ2 0 , (7.15)
6
0 0 λ1 + λ2 − 2λ3
hence
|E|2 (λ2 + λ3 − 2λ1 )2 + (λ3 + λ1 − 2λ2 )2 + (λ1 + λ2 − 2λ3 )2
=
R2 36(λ1 + λ2 + λ3 )2
(λ1 − λ2 ) + (λ2 − λ3 )2 + (λ3 − λ1 )2
2
=
12(λ1 + λ2 + λ3 )2
3(λ3 − λ1 )2
≤
12(λ1 + λ2 + λ3 )2
≤ BR−δ̄
by the result of Theorem 7.8, with δ̄ = 2δ (the penultimate step follows as λ1 − λ3 is the greatest
difference between two eigenvalues).
Proof. (of Theorem 7.10) We provide an outline of the proof without showing all of the excruciating
details. A more complete outline can be found in [6, Section (6.6)].
Using the evolution equation for R under the Ricci flow, given in Lemma 6.1, it is possible to
deduce that
2
∂ |∇R|2 |∇R|2
∇R
= 4 − 2R ∇
∂t R R R
|∇R|2 4
−2 |Rc|2 + h∇R, ∇|Rc|2 i
R2 R
2
|∇R|2
∇R
≤ 4 − 2R ∇
R R
|∇R|2 √
−2 2
|Rc|2 + 8 3|∇Rc|2
R
71
where the last line follows from the Cauchy-Schwarz inequality and Lemma 1.1 (to prove |∇R|2 ≤
3|∇Rc|2 ):
4 4
h∇R, ∇|Rc|2 i |∇R| ∇|Rc|2
≤
R R
|Rc|
≤ 8|∇R||∇Rc|
√ R
≤ 8 3|∇Rc|2
(using |Rc| < R because Rc > 0, as in Corollary 7.6). We would like to use the maximum principle
(Theorem 3.2) to show that the quantity |∇R|2 /R does not increase. All reaction terms on the
right hand side of our evolution equation are negative with the exception of the last one in |∇Rc|2 .
We need to deal with this meddlesome customer. We note that
1
|∇E|2 ≥ |∇Rc|2 (7.16)
37
where E is the Einstein tensor (which we know ought to be falling away nicely as our manifold
gets rounder). To see why this is true, we combine Lemma 1.1 with equation (1.14) to get
1 j 1 1
|∇k Eij |2 ≥ |∇ Eij |2 = · |∇R|2 .
3 3 36
Equation (7.16) now follows easily:
1
|∇Rc|2 − |∇R|2 = |∇E|2
3
1 1
≥ |∇R|2
3 36
which can be rearranged to obtain
1 1
|∇Rc|2 − |∇R|2 ≥ |∇Rc|2
3 37
where |∇E|2 is exactly equal to the lhs.
Therefore our evolution equation becomes
2
|∇R|2 |∇R|2
∂ ∇R
≤ 4 − 2R ∇
∂t R R R
|∇R|2 √
−2 2
|Rc|2 + 8 3 · 37|∇E|2 .
R
We are going to try to get rid of the bad final term of this equation by adding on some multiple
of |E|2 . The evolution equation for |E|2 under the Ricci flow is given in Lemma 6.1. It is
∂|E|2
= 4|E|2 − 2|∇E|2 + W (7.17)
∂t
where
26
W = R|Rc|2 − 8Rij Rjk Rki − 2R3 . (7.18)
3
The second term on the right hand side of equation (7.17) interests us: we can use it to cancel
off the recalcitrant |∇E|2 term in the evolution equation for |∇R|2 /R. We set
|∇R|2 37 √
V := + (8 3 + 1)|E|2
R 2
then calculate
∂
V ≤ 4V − |∇Rc|2 + C1 W
∂t
72
(reusing the inequality (7.16)) where C1 is a positive constant.
It is possible to prove that
50
W ≤ R|E|2 (7.19)
3
purely algebraically; as Rc is positive and symmetric we can diagonalize it with positive real
eigenvalues a, b, c. Each term in the inequality can be expressed in terms of a, b and c, and we are
left with an easy homogeneous inequality in three real variables. Equality occurs when a = b = c,
i.e. for an Einstein metric (this proof is significantly simpler than that presented in [6]). We also
have
|∇R|2 ≤ 3|∇Rc|2
by Lemma 1.1.
Hence we have
∂
V ≤ 4V − |∇Rc|2 + C2 R|E|2
∂t
1
≤ 4V − |∇R|2 + C3 R3−2γ
3
by the result of Corollary 7.9, for some constants C and γ > 0. Now we set
U := V − βR2−γ
where β is to be chosen. It is not difficult to calculate from the result of Lemma 6.1 that
∂ 2−γ
R = 4(R2−γ ) − (2 − γ)(1 − γ)R−γ |∇R|2 + 2(2 − γ)R1−γ |Rc|2 ,
∂t
hence that
∂ 1
U ≤ 4U + β(2 − γ)(1 − γ)R−γ − |∇R|2 (7.20)
∂t 3
+C3 R3−2γ − 2β(2 − γ)R1−γ |Rc|2 .
Now by Theorem 7.8 we have Rc ≥ g, hence R ≥ 3 for some > 0. Thus we may choose
β̄ > 0 such that
(3)γ
β̄ < ,
3(2 − γ)(1 − γ)
so that for any β ∈ [0, β̄] the second term in the evolution equation (7.20) is non-positive. For the
rest of the terms, we use the inequality 3|Rc|2 ≥ |R|2 to obtain
where C3 , C4 are constants. For large R this term is dominated by the R3−γ term, which is
negative. As it does not diverge at R = 0, there is a uniform upper bound C5 for this term, hence
we have
∂
U ≤ 4U + C5 .
∂t
Thus the growth in U is at worst linear, by the maximum principle (Theorem 3.2). By Theorem
7.2 the time interval we are considering is finite, hence we have a uniform upper bound C for U .
Therefore
|∇R|2
≤ V = U + βR2−γ ≤ C + βR2−γ
R
from which the result follows.
Proof. (of Theorem 7.11) Our main tool here will be Theorem 7.10, which allows us to compare
the curvature at different points on the manifold. By Corollary 7.6, we also have Rmax → ∞ as
73
t → T . Thus for t sufficiently close to T , Theorem 7.10 tells us that there exist positive constants
A, α such that
3/2−α
|∇R| ≤ ARmax .
Thus we have (for t sufficiently close to T )
Z
3/2−α
|R(x) − R(y)| ≤ |∇R|ds ≤ ARmax d(x, y) (7.21)
γ
(here γ is a minimizing geodesic connecting x and y). Let us choose x(t) such that Rmax (t) = R(x, t)
(we can do this as M is compact), and define
1
L(t) := p
Rmax (t)
where > 0. Then for all y ∈ B(x(t), L(t)) we have, by equation (7.21):
3/2−α A −α
R(y) ≥ R(x) − ARmax L ≥ Rmax 1 − Rmax . (7.22)
Recall that Rmax → ∞ as t → T . Therefore, given δ > 0, for t sufficiently close to T we have
for δ chosen sufficiently small. Because N is defined to be B(x(t), L(t)) but has diameter < L,
if there are any points in M that lie outside N then M must be disconnected. We know that it
isn’t, hence N must be all of M, from which the result follows by equation (7.22).
Proof. (of Corollary 7.12) We apply Theorem 7.8:
74
Proof. (of Lemma 7.15) We note that the proof of this Lemma presented in [6] is incorrect. The
proof we present is based on [9, Lemma 16.7].
We use the following lemma, which is used to prove the Sphere Theorem (Theorem 1.18):
Lemma 7.21. If M is a simply connected manifold of dimension ≥ 3 whose sectional curvatures
are pinched between K and 41 K for some constant K (M is “ 41 -pinched”), then the injectivity
√
radius of M is at least π/ K.
Combining this with equation (7.26) gives us a lower bound on R̃max . The result of Theorem 7.11
now gives a uniform lower bound on R̃min , as required.
We will need, in the proofs of many of the remaining theorems, to use maximum principle
arguments similar to those employed in Chapter 6 and early in this chapter. For this we must
know the evolution equation satisfied by the quantity we are considering under the normalized
Ricci flow – luckily there is a very simple way of going from the unnormalized evolution equation
to the normalized one. We need to introduce the concept of the degree of a tensor. If P is some
tensor involving the metric and the curvature for the unnormalized flow, then by the results of
Lemma 1.19, the same quantity calculated for the normalized flow (which is just a dilation of the
unnormalized flow by a factor of ψ, as in Section 7.6) will be related to P by a rule of the form
P̃ = ψ n P . We call n the degree of P . For example, by Lemma 1.19, the scalar curvature R has
degree −1.
Lemma 7.22. If P satisfies
∂P
= 4P + Q
∂t
for the unnormalized flow on a 3-manifold, and P has degree n, then Q has degree n − 1 and for
the normalized equation,
∂ P̃ 2r̃n
= 4̃P̃ + Q̃ + P̃ .
∂τ 3
75
Proof. Note that Q has degree n − 1 because ∂τ /∂t = ψ so ∂P/∂t has degree n − 1. We also have
4 = ψ 4̃, so 4P has degree n − 1, thus Q must have degree n − 1 too.
We now have P̃ = ψ n P and the formula (7.11) for dψ/dt, so
∂ P̃ ∂(ψ n P )
= ψ −1
∂τ ∂t
n−1 ∂P dψ
= ψ + nψ n−2 P
∂t dt
n−1 n−2 2r̃ 2
= ψ (4P + Q) + nψ ψ P
3
2r̃n
= 4̃P̃ + Q̃ + P̃
3
as required.
Proof. (of Theorem 7.16) This proof is a simplification of that presented in [6].
Because |Ẽ|2 ≤ (λ̃1 − λ̃3 )2 /4 as shown in the proof of Corollary 7.9, it suffices for us to prove a
decaying exponential bound on (λ˜1 − λ˜3 ). In fact we will prove that there exist constants C, δ > 0
such that
λ˜1 − λ˜3 ≤ Ce−δτ (λ˜2 + λ˜3 ).
The result will then follow from Lemma 7.13, which gives a uniform upper bound on (λ˜2 + λ˜3 ).
We again use the Uhlenbeck trick, this time for the normalized Ricci flow equation. We replace
the isometry evolution equation (7.6) by
∂ i r̃
u = R̃li ula − uia .
∂τ a 2
˜ is then (using Lemmas 6.1 and 7.22)
The evolution equation for u∗ Rm
∂
R̃abcd = 4̃R̃abcd + 2(B̃abcd − B̃abdc + B̃acbd − B̃adbc ) − r̃R̃abcd .
∂t
This leads to the analogue of equation (7.8) for the behaviour of the ode corresponding to the
Ricci flow pde:
˜ ˜2
λ1 λ1 + λ˜2 λ˜3 − r̃λ˜1
d ˜ 2
λ2 = λ˜2 + λ˜3 λ˜1 − r̃λ˜2 . (7.27)
dτ
λ˜3 ˜ 2
˜ ˜
λ3 + λ1 λ2 − r̃λ3 ˜
We now apply the vector bundle maximum principle (Theorem 3.3) exactly as we did in the proofs
of Lemmas 7.5 and 7.8, with the time-dependent pinching set1
K(τ ) ≡ {Q ∈ E : eδτ (λ˜1 (Q) − λ˜3 (Q)) − C(λ˜3 (Q) + λ˜2 (Q)) ≤ 0}
4.9].
76
for some B > 0, where the last step follows by Lemma 7.5 as, by scale-invariance,
λ˜1 λ1
= ≤ B.
˜
λ2 + λ3˜ λ 2 + λ3
Hence the ode remains inside K(τ ) for appropriately chosen C, δ > 0. K(τ ) is closed, convex and
invariant under parallel translation by analogous methods to those used in the proofs of Lemma
7.5 and Theorem 7.8, so the result follows from the analogue of Theorem 3.3 for time-dependent
pinching sets ([6, Theorem 4.9]).
Proof. (of Lemma 7.17) Because we have a uniform upper bound on the diameter of our manifold
from Myers’ Theorem (as noted in the proof of Lemma 7.13), it suffices to prove an exponential
bound on |∇ ˜ R̃|. This is achieved by a maximum principle argument similar to that of Theorem
7.10. Let
|∇R|2
G= + α|E|2 ,
R2
for some α > 0 chosen later. G then has degree −2, and using similar methods to Theorem 7.10
can be shown, for appropriately chosen α > 0, to satisfy an evolution equation of the form
∂
G ≤ 4G + βR|E|2
∂t
for some β > 0 (see [9, Lemma 17.4]). Thus by Lemma 7.22 we have
∂ 4r̃
G̃ ≤ 4̃G̃ + β R̃|Ẽ|2 − G̃
∂τ 3
≤ 4̃G̃ + Ce−δτ − δ G̃
for some C, δ > 0, by the result of Theorem 7.16, and also using the fact that R̃ is bounded below
by Lemma 7.15 to bound r̃ > 3δ/4 for some δ > 0. Therefore
∂ δτ
e G̃ − Cτ ≤ 4̃ eδτ G̃ − Cτ ,
∂τ
so by the maximum principle eδτ G̃ − Cτ ≤ C from which it follows that G̃ is exponentially
˜ R̃|2 is also exponentially decaying,
decaying. Hence, as R̃ is bounded above by Lemma 7.13, |∇
and the result follows.
Proof. (of Theorem 7.19) The proof we present is different from Hamilton’s original proof in [9],
which relied on integral estimates to bound derivatives of the metric. We instead use the maximum
principle to do the same thing.
As we did in the proof of Corollary 6.12, we must show that in any coordinate patch U , we
have Z ∞
|∂τ ∂ k g|dτ < ∞
0
77
for any k ∈ N. From the equation for the normalized Ricci flow, we see that it suffices to prove
|∂ k F | ≤ Ce−δτ
where
F = Rc˜ − 1 r̃g̃.
3
Combining Theorem 7.16 with Lemma 7.17 (as we did in the proof of Theorem 7.18) we see
that |F | is exponentially bounded. The most important step in proving that its derivatives are also
exponentially bounded will be to prove that its covariant derivatives are exponentially bounded.
Because r̃ is constant (in space, not time) and ∇g̃ ˜ = 0, this is equivalent to proving that the
˜
covariant derivatives of Rc are exponentially bounded with respect to g̃(τ ) (recall that the metrics
g̃(τ ) are uniformly equivalent to g̃(0), so this is equivalent to proving that the derivatives are
bounded with respect to a fixed metric):
Lemma 7.23. There exist positive constants Ck , δk such that
˜ k Rc|
|∇ ˜ ≤ Ck e−δk τ
for all k ≥ 1.
Proof. The proof will follow very similar lines to the proof of Theorem 6.6, and will also use some
of the ideas from the proof of Theorem 7.10. In particular, the proof will be by strong induction
on k, the Einstein tensor E will again be very useful, and C, δ will denote different constants in
just about every equation – they are just generic upper bounds.
˜ ≤ |Rc
We note first of all that |Rc| ˜ − 1/3R̃g̃| + |R̃|/3 ≤ C by the results of Theorem 7.16 and
Lemma 7.13.
Now let us begin the induction. We do not need a base case, because there is a bit of monkey
business going on at k = 0 – which is expected because we know that |Rc| ˜ is not actually expo-
nentially bounded. Assume that |∇ ˜ ≤ Ce−δτ for all 1 ≤ j ≤ k − 1, for some positive constants
˜ j Rc|
C, δ. In the k = 1 case this is a null assumption, hence true.
By Lemma 1.13, Rm can be expressed as a linear combination of traces of Rc (because we are
in dimension 3). It now follows by Lemma 6.4 that ∇k Rc satisfies a heat-type equation which is
basically that given in Lemma 6.4 but with Rc substituted for Rm. It then follows by Lemma 6.2
that |∇k Rc|2 satisfies a heat-type evolution equation of the form
k
∂ k X
|∇ Rc|2 = 4|∇k Rc|2 − 2|∇k+1 Rc|2 + ∇j Rc ∗ ∇k−j Rc ∗ ∇k Rc.
∂t j=0
This is the evolution equation for the unnormalized flow; the equation for the normalized flow will
be the same thing except for an extra term of the form (2nr̃/3)|∇ ˜ k Rc|
˜ 2 , by Lemma 7.22. This
term will be absorbed into the j = 0 term of the summation in the evolution equation though, as
it is bounded by a term of the form Rc ˜ ∗∇˜ k Rc
˜ ∗∇˜ k Rc
˜ (note that by the result of Theorem 7.11,
r̃ is uniformly bounded above by CR for some constant C > 0).
˜ k Rc|
Therefore we have the heat-type equation satisfied by |∇ ˜ 2 under the normalized Ricci flow,
and we can simplify it using the inductive hypothesis:
k
X
˜ k Rc|
∂t |∇ ˜ 2 ≤ ˜ k Rc|
4̃|∇ ˜ 2 − 2|∇
˜ k+1 Rc|
˜ 2+ ˜ j Rc
∇ ˜ ∗∇
˜ k−j Rc
˜ ∗∇
˜ k Rc
˜
j=0
˜ k Rc|
≤ 4̃|∇ ˜ 2 − 2|∇
˜ k+1 Rc|
˜ 2 + Ce−δτ + Rc
˜ ∗ (∇ ˜ k Rc)
˜ ∗2
˜ Rc|
≤ 4̃|∇ k ˜ − 2|∇
2 ˜ k+1 ˜ + Ce
Rc| 2 −δτ ˜ Rc|
+ Bk |∇ k ˜ 2.
We have amalgamated the exponential bounds that came from the inductive hypothesis, and used
the uniform upper bound on |∇ ˜ Rc|.
˜ This is not quite enough though – the reaction terms here
would lead to exponential growth. We use the same trick as we did in the proof of Theorem 6.6:
we add on multiples of the lower-order terms to get rid of the bad reaction terms. However we
78
˜ 2 we will get something that is not exponentially
must be careful: if we add on a multiple of |Rc|
bounded. We must instead add on a multiple of |Ẽ|2 , which is exponentially bounded.
Recall the formula (7.17), and combine it with the inequality (7.19) to get
50
∂t |E|2 ≤ 4|E|2 − 2|∇E|2 + R|E|2 .
3
Now apply equation (7.16) and Lemma 7.22 to obtain
2 ˜ ˜ 2 50 4r̃
∂τ |Ẽ|2 ≤ 4̃|Ẽ|2 −
|∇Rc| + R̃|Ẽ|2 − |Ẽ|2
37 3 3
2 2 ˜ ˜ 2 −δτ
≤ 4̃|Ẽ| − |∇Rc| + Ce
37
(we used the bounds R̃ ≤ C and |Ẽ| ≤ Ce−δτ to get the last term).
Now, let us define
k−1
X
˜ k Rc|
V := |∇ ˜ 2 + αk0 |Ẽ|2 + ˜ j Rc|
αkj |∇ ˜ 2,
j=1
where the αkj are constants that we will choose carefully. V satisfies the evolution equation:
k−1
X
∂τ V ˜ k Rc|
≤ 4̃V + Ce−δτ + (Bk − 2αk,k−1 )|∇ ˜ 2+ ˜ j Rc|
(αkj Bj − 2αk,j−1 )|∇ ˜ 2
j=1
2 ˜ Rc|
˜ 2
+ αk1 B1 − αk0 |∇
37
where we have grouped all of the exponentially decaying terms together.
Let us choose the αkj such that:
Bk − 2αk,k−1 ≤ −1
αkj Bj − 2αk,j−1 ≤ −αkj for 2 ≤ j ≤ k − 1
2
αk1 B1 − αk0 ≤ −αk1 .
37
The evolution equation for V then becomes
∂τ V ≤ 4̃V + Ce−δτ − V.
We have an upper bound on V at time τ = 0 by the compactness of M. The ode associated with
this pde will lead to exponential decay:
dφ
= Ce−δτ − φ
dτ
has the solution
C −δτ
φ(τ ) = Be−τ +
e .
1−δ
Hence, by the maximum principle (Theorem 3.2), V is exponentially bounded. So
˜ k Rc|
|∇ ˜ 2 ≤ V ≤ Ck e−δk τ
as required.
Now we have to go from bounds on |∇ ˜ k Rc|
˜ 2 to bounds on |∂ k Rc|
˜ 2 . As we did in Corollary 6.11
we treat the Christoffel symbols as a tensor in some coordinate chart U . To go from covariant
derivatives to coordinate derivatives we will need bounds on the derivatives of the Christoffel
symbols.
Lemma 7.24. In a fixed coordinate chart U , for all k ∈ N we have a uniform upper bound
|∂ k−1 Γ̃| ≤ Ck .
79
Proof. We will prove the result by strong induction. Assume the result is true for all 1 ≤ k ≤ m−1.
As in the previous lemma, no base case is required.
We can now do the inductive step. We can calculate ∂τ Γ̃ from formula (1.17) in Lemma 1.20,
with hij = −2R̃ij + (2r̃/3)g̃ij for the normalized Ricci flow. The result is
˜ i R̃jl + ∇
∂τ Γ̃kij = −g̃ kl (∇ ˜ j R̃il − ∇
˜ l R̃ij ) (7.28)
˜ = 0 and ∇r̃
(recalling that ∇g̃ ˜ = 0). It now follows by Lemma 1.3 and equation (7.28) that
m−1
∗
X
˜i
∂ m−1 ∂τ Γ̃ = ∂ j Γ̃ ∗ ∇ (∂τ Γ̃)
i=0 j ≤m−2
m
∗
X
˜i ˜
= ∂ j Γ̃ ∗ ∇ Rc
i=1 j ≤m−2
By the inductive hypothesis we have a uniform bound on each of the terms ∂ j Γ̃ and by Lemma
˜ i Rc.
7.23 we have a decaying exponential bound on the terms ∇ ˜ Thus we have
It follows that Z ∞
|∂ m−1 Γ̃| ≤ |∂ m−1 Γ̃|t=0 + Ce−δτ dτ ≤ Cm
0
for some positive constant Cm . This completes the inductive step and hence the proof.
We now have, by Lemma 1.3,
k
∗
X
|∂ k F | = ˜ i F ≤ Ce−δτ
∂j Γ ∗ ∇
i=0 j ≤k−1
for all k ∈ N. We have used the bounds |∂ j Γ̃| ≤ C for j ≤ m − 1 from Lemma 7.24 and the bounds
˜ i F |2 ≤ Ce−δτ from Lemma 7.23 and Theorem 7.16.
|∇
Hence, because we have
Z ∞
∂g̃
g̃∞ (x) = g̃(x, 0) − 2 dτ
0 ∂τ
Z ∞
= g̃(x, 0) − 2 F (x, τ )dτ,
0
we may conclude that for any multi-index α, we can take the derivative under the integral sign:
Z ∞ |α|
∂ |α| ∂ |α| ∂
α
g̃∞ (x) = α
g̃(x, 0) − 2 F (x, τ )dτ,
∂x ∂x 0 ∂xα
Therefore g̃∞ has derivatives of all orders (it is smooth) and it is clear that the convergence is
uniform in any C k norm, in the same way that it followed for Corollary 6.12.
80
81
Chapter 8
82
Curvature explodes
Figure 8.1: Curvature becomes large as the manifold gets close to pinching.
information. Despite all of the manifolds in the flow being compact and homeomorphic to each
other, the limit manifold is neither compact nor homeomorphic to the manifolds in the flow.
Note also that the limit depends on “where we are looking”. For example, suppose we had two
neckpinches happening simultaneously in different places. The limit should be well-defined, i.e. it
can’t be both the blowup of the first neckpinch and the blowup of the second neckpinch. Thus
when we blow up, we need to blow up about a specified point. We define the notion of a pointed
Riemannian manifold (M, g, p) as a Riemannian manifold (M, g) together with a distinguished
point p ∈ M.
83
Figure 8.2: Non-smooth points can develop if curvature is not controlled.
Figure 8.3: A neck losing a dimension because its injectivity radius was not controlled.
2. The injectivity radius at the points pi is uniformly bounded away from zero:
Then some subsequence of the sequence (Mi , gi , pi ) converges smoothly to a complete, smooth,
pointed Riemannian manifold of dimension n.
Note that both conditions are necessary – the first to avoid situations where the limit manifold
is not smooth, as in Figure 8.2. The second avoids such monstrosities as dimensional reduction,
see Figure 8.3.
Hamilton extended this compactness result to deal with limits of Ricci flows. In this way, the
limit when we blow up our singularity will not only be a pointed Riemannian manifold, it will be
a pointed solution of the Ricci flow.
Definition 8.2. Let (Mi , gi (t), pi ) be a sequence of pointed smooth manifolds with time-evolving
metrics defined for on some time interval t ∈ (a, b). If (M, g(t), p) is a pointed smooth mani-
fold with a time-evolving metric for t ∈ (a, b), we say that (Mi , gi (t), pi ) converges smoothly to
(M, g(t), p) as i → ∞ if the following conditions are satisfied:
1. There is a sequence of compact sets Ki ⊂ M that exhausts M, such that p ∈ int(Ki ) for all
i.
84
2. There is a sequence of smooth maps φi : Ki → Mi which are diffeomorphisms onto their
image, such that φi (p) = pi .
3. φ∗i gi (t) → g(t) smoothly in both space and time, in the sense that the convergence is uniform
in any C k norm on any compact subset of M × (a, b).
The compactness theorem for manifolds (Theorem 8.1), can be extended to a compactness
theorem for Ricci flows (this also first appeared in [11])
Theorem 8.2. Let (Mi , gi (t), pi ) be a sequence of n-dimensional pointed solutions of the Ricci
flow defined for t ∈ (a, b), where 0 ∈ (a, b). Suppose that
1.
sup |Rm(gi (t))|(x) < ∞;
i∈N,(x,t)∈Mi ×(a,b)
2.
inf inj(Mi , gi (0), pi ) > 0.
i
That is, the curvature is uniformly bounded above and the injectivity radius at pi is uniformly
bounded away from zero at t = 0. Then there exists a marked solution of the Ricci flow (M, g(t), p)
defined for t ∈ (a, b) such that a subsequence of (Mi , gi (t), pi ) converges smoothly to (M, g(t), p).
Note that we do not need control over all derivatives of the curvature as we did for Theorem
8.1 because of the Bernstein-Bando-Shi estimates (Theorem 6.6) which give us bounds on all
derivatives of the curvature in terms of a bound on the curvature. Of course these estimates rely
crucially on the fact that the metrics gi (t) satisfy the Ricci flow equation.
Let us define Mi := |Rm|(pi , ti ). Note the analogy with the proof of Grayson’s Theorem (Theorem
4.6).
We now blow up the Ricci flows as described in Section 8.1: define
t
gi (t) = Mi g ti + .
Mi
This is merely a parabolic1 rescaling of time and the metric in addition to a translation in time,
and it is not difficult to see that gi (t) is a solution of the Ricci flow defined on the time interval
[−M ti , M (T − ti )). Furthermore, for t ≤ 0 we have
|Rm g ti + Mt i | Mi
|Rm(gi (t))| = ≤ =1
Mi Mi
by the way we defined Mi in equation (8.1). By Corollary 6.13, there exists b > 0 (depending only
on the dimension of the manifold) such that, for all i, gi (t) is defined for t ∈ (−M ti , b), and
1A √
parabolic rescaling of a system scales time by a factor α and length by a factor α – it is a characteristic
of heat-type evolution equations that a parabolic rescaling of any solution is another solution.
85
Because the lower limits −M ti of the time intervals tend to −∞, we seek to define a limit flow
for t ∈ (−∞, b) (for any −B < 0 arbitrarily large, the flows gi (t) will eventually be defined on
(−B, b) so the limit can be defined). Now equation (8.2) shows that the first condition of Theorem
8.2 is satisfied, so the only thing we need in order to pass to a limit flow is to uniform bound
the injectivity radii of the manifolds (M, gi (t)) away from zero, for all i ∈ N and t ∈ (−M ti , b).
This turned out to be a major stumbling block for quite a while, but was resolved by Perelman in
[23]. We will briefly discuss some of Perelman’s ideas in Section 8.4. An accessible exposition of
Perelman’s proof of the injectivity radius bounds is contained in [28, Chap. 8]. Using Perelman’s
injectivity radius bounds, we can apply Theorem 8.2. The result is (passing to a subsequence of
(M, gi (t), pi )):
Theorem 8.3. Suppose that (M, g(t)) is a solution of the Ricci flow defined on a maximal time
interval [0, T ), where T < ∞. Then there exist points pi ∈ M and times ti ∈ [0, T ), ti → T such
that
Mi := |Rm|(pi , ti ) = sup |Rm|(x, t) → ∞.
x∈M,t∈[0,ti ]
If we now define
t
gi (t) = Mi g ti +
Mi
then there exists b > 0 such that (M, gi (t), pi ) converges to a Ricci flow (N , g∞ (t), p∞ ) defined for
t ∈ (−∞, b). Furthermore |Rm(g∞ (0))|(p∞ ) = 1 and |Rm(g∞ (t))| ≤ 1 for t ≤ 0.
One can now seek to classify the possible singularities that one can obtain from such a limiting
argument. Note that the limit flow is defined for t ∈ (−∞, b) – such solutions are called ancient.
Ancient solutions are very special. If we run the Ricci flow equation in reverse we get a non-
parabolic system that tends to develop singularities. Thus we do not expect to be able to extend
an arbitrary Ricci flow on some finite interval (a, b) to an ancient one defined on (−∞, b). Because
ancient solutions are so special, we can hope to classify them despite the fact that classification of
arbitrary Ricci flows is a hopelessly complicated task.
Hamilton went a long way towards performing this classification in [12], showing that (if injec-
tivity radius bounds were satisfied) the limit of the singularity has to be either a 3-sphere S3 or
quotient thereof (which shrinks to a point in finite time) or a cylinder S2 × R, which corresponds
to a neckpinch (see Figure 8.4 for the 2-dimensional analogue). He was unable to rule out a third
possible limit, namely Σ2 × R where Σ2 is Hamilton’s “cigar soliton” (described in Section 2.3),
but Perelman has done away with this difficulty. “Degenerate neckpinches”, in which one of the
halves of the manifold shrinks to a point at the same time as the neck becomes singular (producing
a sort of cusp in the manifold) are also possible.
The neckpinches, as described in Section 2.4, correspond to connected sum decompositions of
the manifold (although some trivial decompositions may occur). The idea is to identify when a
neckpinch is happening, and to perform the connected sum decomposition “manually” – that is,
we cut out the neck part and seal the ends off with discs, glued on smoothly (compare Figure
2.3). This is the process known as “surgery”. The flow can then continue. It might have any
number of singularities that are either quotients of 3-spheres shrinking to a point in finite time, or
products S2 × S1 that shrink down to the S1 factor in finite time, or neckpinches which we must
deal with by surgery. Perelman showed that the surgery times are locally finite (i.e., in any finite
time interval there are only a finite number of times at which surgery occurs). We are left over
with the solutions that have no finite-time singularity, which Perelman showed (building on work
by Hamilton in [13]) must satisfy the Geometrization Conjecture (under rescaling the manifold will
split along tori into expanding hyperbolic pieces with cusps and collapsing “graph manifolds”, as
discussed in Section 2.4).
To show the power of Theorem 8.3 we provide an alternative proof of Theorem 7.1.
Proof. (of Theorem 7.1) We consider a Ricci flow on a closed 3-manifold with initially strictly
positive Ricci curvature. We showed in Theorem 7.2 that the flow must have a singularity in finite
time, hence Theorem 8.3 applies. Let us say that the maximal time interval of existence is [0, T ).
Thus there exist points pi ∈ M, times ti → T and rescaled flows gi (t) defined for t ∈ (−ai , b)
86
Figure 8.4: The limit of a manifold that gets close to pinching is a cylinder.
(where b > 0 and ai → +∞) such that the pointed, rescaled flows (M, gi (t), pi ) converge smoothly
to a Ricci flow (N , g∞ (t), p∞ ) defined for t ∈ (−∞, b).
We use the result of Corollary 7.9, which says that for the Ricci flow (M, g(t)),
|E(g)|2 ≤ CR(g)2−δ
(where E is the Einstein tensor) for some positive constants C, δ. Now the flows gi (t) are just
rescalings of g(t) by the respective factors Mi (defined in Theorem 8.3, with the property that
Mi → ∞), hence one can deduce from the results of Lemma 1.19 that
because Mi → ∞.
Therefore g∞ (0) is an Einstein metric. By Lemma 1.14, it has constant sectional curvature.
Because R(gi (t)) > 0 for all i, in the limit we must have R(g∞ (0)) ≥ 0 hence the sectional
curvatures are non-negative. Furthermore |Rm(g∞ (0))| = 1 so the sectional curvatures are strictly
positive.
It follows that the Ricci curvature tensor of g∞ (0) on N is bounded below and hence, by
Myers’ Theorem (Theorem 1.17) that N is compact. Therefore, by the definition of convergence
of manifolds, N must in fact be diffeomorphic to M. Thus the metric g∞ (0) induces a metric of
constant, strictly positive sectional curvature on M, as required.
Note that, although we needed the pinching results of Section 7.4 for this proof, we did not
need to prove any of the subsequent global pinching or convergence results – so significantly less
work is required than in Chapter 7, if we assume Theorem 8.3!
87
6,8]) but will present one of the key ideas behind his proof – that the Ricci flow can be formulated
as a gradient flow.
We return to the analogy of the csf. Recall Lemma 4.1, which showed that the csf was a
steepest-descent flow for length. Can the Ricci flow be similarly formulated as the steepest-descent
flow for some functional? The simplest approaches do not work. In analogy with the length
functional for the csf the simplest functional we might consider is the volume of the manifold
with the metric g – but we know that some manifolds increase in volume and some decrease in
volume under the Ricci flow (specifically, hyperbolic manifolds increase in volume while manifolds
of positive sectional curvature decrease in volume – see Section 2.3), soR that will not work. The
second-simplest functional we might try is the total scalar curvature, Rdµ. The corresponding
gradient flow is, by formula (1.23) in Lemma 1.20,
∂g 1
= −Rc + Rg.
∂t 2
This is similar to the Ricci flow and looks a bit like the unnormalized Ricci flow but is not parabolic
(and cannot be made parabolic using methods like the DeTurck trick) so it is completely useless
as an evolution equation.
What clues do we have to the nature of the correct functional? One property it ought to have
is that it should obviously be constant on fixed points of the Ricci flow. But recall from Section 2.3
that there exist “generalized fixed points” of the Ricci flow, the so-called Ricci solitons on which
the Ricci flow is only changing the manifold by reparametrization and scaling. We hypothesize
that the functional we seek should also be constant on gradient Ricci solitons (gradient solitons are
much easier to deal with than general solitons). For the moment we will consider gradient solitons
that do not involve any rescaling of the metric, so the constant λ that appears in equation (2.1) is
0. Thus we expect our functional to be constant if there exists a scalar function f : M → R such
that Rij + ∇i ∇j f = 0.
This gives us a clue that our functional should not depend on g alone, but should also depend
on a scalar function f . In fact we define Perelman’s F-functional:
Z
R + |∇f |2 e−f dµ,
F(g, f ) :=
M
defined for Riemannian metrics g and scalar functions f . To work out what its gradient flow is
we must calculate how it evolves under an arbitrary variation in the metric g and the function f :
∂g/∂t = h and ∂f /∂t = k.
Lemma 8.4. The variation of F(g, f ) is given by:
Z
d 1 ij
F(g, f ) = hij (−Rij − ∇i ∇j f ) + (24f − |∇f |2 + R) h gij − k e−f dµ.
dt M 2
We will not prove this result, but it follows from the results and methods used in Lemma 1.20.
Perelman now defined f so that the volume form e−f dµ is preserved. By formula (1.21) of
Lemma 1.20, we have
∂ −f 1 ij
e dµ = g hij − k e−f dµ.
∂t 2
So, if we determine the evolution of f by
∂f 1 ∂
= g ij gij
∂t 2 ∂t
and define the (now constant) “modified volume form” dξ := e−f dµ, then the result of Lemma 8.4
is: Z
d
F(g, f ) = hij (−Rij − ∇i ∇j f )dξ.
dt M
Hence the steepest descent flow for this functional is the one with hij = −Rij − ∇i ∇j f , i.e.
∂
gij = −Rij − ∇i ∇j f.
∂t
88
This is exactly the Ricci flow, with a rescaling of time (as we have −Rij rather than −2Rij on
the rhs) and a time-dependent reparametrisation moving along the gradient vector field of f (as
outlined in the discussion of Ricci solitons in Section 2.3). So the gradient flow of this functional
is equivalent to the Ricci flow. We also note that if the solution is stationary under this flow,
it satisfies the gradient Ricci soliton equation (2.1) with no rescaling term, as we have argued it
should.
We would like to extend the F-functional in some way so that the functional is also constant on
gradient solitons that include rescaling. This is the idea behind the W-functional. We introduce a
positive real-valued scaling parameter τ > 0 and define
Z
τ (R + |∇f |2 ) + f − n udµ
W(g, f, τ ) :=
Mn
We would like an evolution equation for W in the style of Lemma 8.4. The analogous result is
89
Appendix A
In this Appendix we will outline the main existence results for pdes that we will use. This material
is based on [6, Sec 3.2] and [28, Chap. 4].
Consider a vector bundle π : E → M, where M is some Riemannian manifold, with some
bundle metric h. We will be interested in pdes describing the evolution of some time-dependent
section of E, u : M × [0, T ) → E, having the form
∂u
= L(u) (A.1)
∂t
u(x, 0) = u0 (x) (A.2)
∂ |α| f
∂ α f := .
∂xα α2
1 ∂x2
1
. . . ∂xα
n
n
We first consider the case that L is a linear differential operator. That is, using multi-index
notation,
X
L(u) = Lα ∂ α u (A.3)
|α|≤k
90
If ϕ is some covector field (i.e. ϕ ∈ C ∞ (T ∗ M)) then we define the total symbol of L in the
direction ϕ to be the bundle homomorphism σ[L](ϕ) : E → E
X Y
σ[L](ϕ)(u) = Lα (u) ϕαj .
|α|≤k j
In the case of the example L̄ we gave above, the total symbol in the direction ϕ is
X X
σ[L̄](ϕ)(u) = aij ϕi ϕj u + bi ϕi u + cu.
i,j i
91
We define the linearization of L at u0 to be the linear map D[L] : C ∞ (E) → C ∞ (E) so that
d
D[L](v) = L(u(t))
dt t=0
(if it exists).
We say that the system A.1 is strongly parabolic at u0 if the system
∂u
= D[L](u) (A.4)
∂t
u(x, 0) = u0 (x) (A.5)
is strongly parabolic in the sense described previously. We will only be interested in the case where
D[L] has the form X
D[L] = L̃α ∂ α
|α|≤k
Practically all existence proofs in this project will rely on this theorem. See [17] for the proof.
92
Appendix B
1 kl
Γkij = g (∂i gjl + ∂j gil − ∂l gij )
2
2. In normal coordinates about the point p:
(a) γV (t) = (tV 1 , tV 2 , . . . , tV n ) is a geodesic
(b) gij (p) = δij
(c) Γkij (p) = 0, ∂i gjk (p) = 0
3. Covariant derivative:
l
X k
X
∇p Fij11...i
...jl
k
= ∂p Fij11...i
...jl
k
+ Fij11...i
...q...jl js
k
Γpq − Fij11...q...i
...jl
Γq
k pis
(B.1)
s=1 s=1
[∇i , ∇j ]X l ≡ Rijk
l
Xk
m
Rij ≡ Rmjk , R ≡ Rii
1
∇j Rij = ∇i R
2
93
10. Commuting covariant derivatives:
l
X k
X
[∇p , ∇q ]Fij11...i
...jl
k
= js
Rpqm Fij11...i
...m...jl
k
− m
Rpqi F j1 ...jl
s i1 ...m...ik
(B.3)
s=1 s=1
94
Bibliography
[1] Michael T. Anderson. Geometrization of 3-manifolds via the Ricci flow. Notices of the AMS,
51:184–193, 2004.
[2] Huai-Dong Cao and Xi-Ping Zhu. A complete proof of the Poincaré and geometrization
conjectures – application of the Hamilton-Perelman theory of the Ricci flow. Asian J. Math.,
10, No. 2:165–492, 2006.
[3] J. Cheeger and D. Ebin. Comparison theorems in Riemannian Manifolds. North-Holland,
1975.
[4] XiuXiong Chen, Peng Lu, and Gang Tian. A note on uniformization of Riemann surfaces by
Ricci flow. http://arXiv.org/math.DG/0505163, 2005.
[5] S.S. Chern, W.H. Chen, and K.S. Lam. Lectures on Differential Geometry. World Scientific,
2000.
[6] Bennett Chow and Dan Knopf. The Ricci Flow: An Introduction. AMS Math. Surveys and
Monographs, 2004.
[7] Dennis M. DeTurck. Deforming metrics in the direction of their Ricci tensors. Journal of
Differential Geometry, 18:157–162, 1983.
[8] S. Gallot, D. Hulin, and J. Lafontaine. Riemannian Geometry. Springer-Verlag, 1987.
[9] Richard Hamilton. Three-manifolds with positive Ricci curvature. J. Diff. Geo., 17:255–306,
1982.
[10] Richard Hamilton. Four-manifolds with positive curvature operator. J. Diff. Geo., 24:153–179,
1986.
[11] Richard Hamilton. A compactness property for solutions of the Ricci flow. American Journal
of Mathematics, 117:545–572, 1992.
[12] Richard Hamilton. The formation of singularities in the Ricci flow. Surveys in Differential
Geometry, 2:1–136, 1995.
[13] Richard Hamilton. Non-singular solutions of the Ricci flow on three-manifolds. Comm. Anal.
Geom., 7:695–729, 1999.
95
[19] John Milnor. A unique factorization theorem for 3-manifolds. Amer. J. Math., 84:1–7, 1962.
[20] John Milnor and James Stasheff. Characteristic Classes. Princeton University Press, 1974.
[21] John W. Morgan and Gang Tian. Ricci flow and the Poincaré conjecture.
http://arXiv.org/math.DG/0607607v1, 2006.
[22] James R. Munkres. Topology, 2nd Ed. Prentice Hall, 2000.
[23] Grisha Perelman. The entropy formula for the Ricci flow and its geometric applications.
http://arXiv.org/math.DG/0211159v1, 2002.
[24] Grisha Perelman. Finite extinction time for the solutions to the Ricci flow on certain three-
manifolds. http://arXiv.org/math.DG/0307245v1, 2002.
[25] Grisha Perelman. Ricci flow with surgery on three-manifolds.
http://arXiv.org/math.DG/0303109v1, 2003.
[26] Michael Spivak. A Comprehensive Introduction to Differential Geometry. Publish or Perish,
1979.
[27] William P. Thurston. Three-Dimensional Geometry and Topology, Vol. 1. Princeton University
Press, 1997.
[28] Peter Topping. Lectures on the Ricci Flow.
http://www.maths.warwick.ac.uk/~topping/RFnotes.html, 2005.
[29] Peter Topping. Lectures on the Ricci Flow. Cambridge University Press, 2006.
[30] Xi-Ping Zhu. Lectures on Mean Curvature Flows. International Press, 2002.
96