Differential Geometry: Paulo Ventura Araújo
Differential Geometry: Paulo Ventura Araújo
Differential
Geometry
Differential Geometry
Paulo Ventura Araújo
Differential Geometry
Paulo Ventura Araújo
Department of Mathematics
Faculdade de Ciências da Universidade do Porto
Porto, Portugal
Translation from the Portuguese language edition: “Geometria Diferencial” by Paulo Ventura
Araújo, © Paulo Ventura Araújo 1998. Published by IMPA. All Rights Reserved.
© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland
AG 2024
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher,
whether the whole or part of the material is concerned, specifically the rights of reprinting, reuse of
illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and
transmission or information storage and retrieval, electronic adaptation, computer software, or by
similar or dissimilar methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this
publication does not imply, even in the absence of a specific statement, that such names are exempt
from the relevant protective laws and regulations and therefore free for general use.
The publisher, the authors, and the editors are safe to assume that the advice and information in this
book are believed to be true and accurate at the date of publication. Neither the publisher nor the
authors or the editors give a warranty, expressed or implied, with respect to the material contained
herein or for any errors or omissions that may have been made. The publisher remains neutral with
regard to jurisdictional claims in published maps and institutional affiliations.
Cover illustration: Image created from two figured to place above the title.
This Springer imprint is published by the registered company Springer Nature Switzerland AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
This book is based on the lecture notes of the course Differential Geometry taught
at the Faculty of Sciences of the University of Porto in the academic years 1992–
93 and 1993–94. Students from different courses and with different mathematical
backgrounds attended the course and, consequently, its prerequisites were reduced
to Linear Algebra, Calculus (of one and several variables), and the study of curves
up to the Frenet trihedron. Furthermore, we avoided the introduction of certain
technical apparatus, such as Tensor Calculus, in order to insist instead on results
with accessible geometric content whose proofs, although possibly long, used more
elementary means.
That said, one understands why we have restricted ourselves to the study of curves
and surfaces in Euclidean space. But, in our opinion, even for students pursuing a
scientific career, this is the right approach for a first study of Differential Geometry,
grounding intuition and motivating the problems that arise in higher dimensions.
Although the idea was to reproduce, in order and content, the course notes, the
notes grew and included subjects not discussed in the lectures. There is a risk, when
teaching Differential Geometry at an introductory level, that the harvest of interesting
results will not compensate for the work spent in digesting definitions and assimilating
techniques. The digressions in this text may lead the student to discover some of the
richness of Differential Geometry which, by the imperative of bureaucratic “realism”,
is so often absent from the classroom.
The exercises included in the text are rarely routine, although few are really
difficult, and were chosen on the assumption that a good exercise, with a medium
level of difficulty, should reward the students’ effort by teaching them something.
Among the books we consulted, Manfredo do Carmo’s [6] deserves to be high-
lighted: some of the exercises and the structuring of some subjects come from there.
But several exercises are original, and the selection of themes and the composition of
the proofs reflect personal taste and work.
We now give some hints on how to use the book: Sections 1.1 – 1.3 cover subjects
probably already known to the student, and may be omitted in well-prepared classes.
Chapters 2–4 cover a basic course in Differential Geometry. Sections 3.3, 4.4 may be
omitted if time is tight. If time permits, a choice of topics from Chapter 1 (sections
v
vi Preface
1.4 to 1.8) and Chapter 5 can be made; the interdependence between the various
sections of these chapters is indicated at the beginning of each chapter.
1 Differentiable Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Velocity and Arc Length . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Acceleration, Curvature and the Frenet Trihedron . . . . . . . . . . . . . . . . 5
1.3 Planar Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.4 Contact of Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.5 Convex Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.6 Curves of Constant Width . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.7 Theorem of the Four Vertices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
1.8 The Isoperimetric Inequality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
2 Regular Surfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.1 Definition and Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.2 Change of Parameters, Level Surfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
2.3 Differentiable Functions on Surfaces, Tangent Space . . . . . . . . . . . . . 42
2.4 Orientability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
2.5 Areas, Lengths, and Angles: The First Fundamental Form . . . . . . . . . 51
vii
viii Contents
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
Chapter 1
Differentiable Curves
The first three sections of this chapter contain the basics on curves, and, because of
their brevity, are rather a review of concepts and results, gathered in a form we will
use later; in the last five sections we will deal with subjects whose inclusion in the
course is optional. Section 1.5 should be read before 1.6 and 1.7, but otherwise, and
except for a few exercises, sections 1.4 – 1.8 are independent of each other.
In the space R 𝑛 we will denote vectors by the symbols v, w and points by lowercase
consonants 𝑝, 𝑞. This space is equipped with a canonical Euclidean structure in which
the inner product of two vectors is the sum of the products of their components of
equal√index, i.e. ⟨v, w⟩ = 𝑣 1 𝑤 1 + ⋯ + 𝑣 𝑛 𝑤 𝑛 . The norm or length of a vector is given by
∣v∣ = ⟨v, v⟩, and the angle between the nonzero vectors v and w is the only number
𝜃 ∈ [0, 𝜋] such that
⟨v, w⟩
cos 𝜃 = .
∣v∣ ∣w∣
The distance between the points 𝑝 and 𝑞 is defined as the length of the vector 𝑝 − 𝑞.
A parametrized curve is a continuous function 𝛼∶ 𝐼 → R 𝑛 on an interval of R in
Euclidean 𝑛-dimensional space, and its trace is the image of that function. Writing
𝛼(𝑡) = (𝑥 1 (𝑡), . . . , 𝑥 𝑛 (𝑡)), the functions 𝑥𝑖 are the component functions of 𝛼. We say
that 𝛼 is of class 𝐶 ∞ if each of its component functions has continuous derivatives of
all orders (if 𝛼 is defined, for example, on a closed interval [𝑎, 𝑏], then we require the
existence of all right-hand side derivatives at 𝑎 and left-hand side derivatives at 𝑏).
The velocity vector of the curve is 𝛼′ (𝑡) = (𝑥 1′ (𝑡), . . . , 𝑥 𝑛′ (𝑡)) and, when nonzero,
points in the direction tangent to the curve at time 𝑡. Regular curves are those whose
velocity vector never vanishes and therefore have a well-defined tangent direction at
each instant.
From now on, unless otherwise stated, by curve we mean a regular parametrized
curve of class 𝐶 ∞ .
z y
y
x
Figure 1.1
One of the first questions to ask about a curve is how to compute its length. Assume
that the curve 𝛼 is defined on a closed interval [𝑎, 𝑏]. Take an arbitrary partition
𝑎 = 𝑡0 < 𝑡1 < ⋯ < 𝑡 𝑚 = 𝑏 of that interval. The sum
𝑚
∑ ∣𝛼(𝑡𝑖 ) − 𝛼(𝑡𝑖−1 )∣
𝑖=1
gives the length of the polygonal line obtained by replacing, for each 1 ≤ 𝑖 ≤ 𝑚,
the trace of the curve in the interval [𝑡 𝑖−1 , 𝑡 𝑖 ] by the line segment joining 𝛼(𝑡𝑖−1 )
with 𝛼(𝑡𝑖 ) (see Fig. 1.2). The narrower and more numerous are the intervals of the
partition, the better the sum should approximate the length of the curve. In Exercise 3
of this section we show that the limit of these sums, as the maximum of the differences
𝑡𝑖 − 𝑡 𝑖−1 tends to zero, is given by the integral ∫𝑎 ∣𝛼′ (𝑡)∣ 𝑑𝑡, and this is how the length
𝑏
a (t 5 )
a (t0 ) a (t 4 )
a (t1 ) a (t 3 )
a (t 2 )
Figure 1.2
Our first result reassures us as to the correctness of the definition just given, by
showing that the line is the shortest path between two points:
Proposition 1.1.1 Let 𝛼∶ [𝑎, 𝑏] → R 𝑛 be a curve. Then 𝑙(𝛼) ≥ ∣𝛼(𝑏) − 𝛼(𝑎)∣, and
equality holds if and only if the trace of 𝛼 is a line segment.
vector whose coordinates are the integrals of the components of 𝛼′ (𝑡). Forming the
inner product of both sides with the vector 𝛼(𝑏) − 𝛼(𝑎), we obtain
𝑏
∣𝛼(𝑏) − 𝛼(𝑎)∣2 = ∫ ⟨𝛼′ (𝑡), 𝛼(𝑏) − 𝛼(𝑎)⟩ 𝑑𝑡
𝑎
𝑏
≤∫ ∣𝛼′ (𝑡)∣∣𝛼(𝑏) − 𝛼(𝑎)∣ 𝑑𝑡 = 𝑙(𝛼)∣𝛼(𝑏) − 𝛼(𝑎)∣,
𝑎
from which, simplifying, we obtain the desired inequality. For equality to hold, the
Cauchy-Schwarz inequality
will have to reduce to an equality for all 𝑡 ∈ [𝑎, 𝑏], which happens if and only if 𝛼′ (𝑡)
is a positive multiple of 𝛼(𝑏) − 𝛼(𝑎) — i.e., if and only if the trace of 𝛼 is the line
segment joining 𝛼(𝑎) to 𝛼(𝑏). ◻
The value 𝑣(𝑡) = ∣𝛼′ (𝑡)∣ is the scalar velocity of the curve 𝛼 at time 𝑡, and
𝑡
𝑆(𝑡) = ∫ 𝑣(𝑟) 𝑑𝑟
𝑎
is the arc length function. When the scalar velocity is constant, the arc length is
proportional to time. We recall that the curve 𝛼˜ is a reparametrization of 𝛼 if 𝛼˜ = 𝛼 ○ ℎ,
for some increasing diffeomorphism ℎ∶ 𝐽 → 𝐼 between intervals of R . (If ℎ were
4 1 Differentiable Curves
decreasing, then we would obtain a curve with the same trace as 𝛼 but with reversed
orientation — that is, traversed in the opposite direction.) Let us now show, by
defining a suitable function ℎ, that it is possible to reparameterize 𝛼 so that 𝛼 ○ ℎ has
constant scalar velocity, equal to 1.
Since 𝑆 ′ (𝑡) = 𝑣(𝑡) > 0, the function 𝑆 is increasing and sends [𝑎, 𝑏] diffeomor-
phically onto the interval [0, 𝑙(𝛼)]. Defining ℎ as the inverse of 𝑆, and putting
˜
𝛼(𝑠) = 𝛼 ○ ℎ(𝑠), we have
1 ′
𝛼˜ ′ (𝑠) = ℎ′ (𝑠)𝛼′ (ℎ(𝑠)) = 𝛼 (𝑡),
𝑣(𝑡)
where 𝑡 = ℎ(𝑠), and from this we see that ∣𝛼˜ ′ (𝑠)∣ = 1 for all 𝑠 in [0, 𝑙(𝑎)].
We say that a curve is parametrized by arc length when it is traversed with constant
scalar velocity equal to 1, regardless of whether the time at which the parameterization
starts is zero.
Exercises
1. Let 𝛼∶ [𝑎, 𝑏] → R 𝑛 and 𝛽∶ [𝑐, 𝑑] → R 𝑛 be two regular, injective curves with the
same trace. Show that the function 𝛽−1 ○ 𝛼∶ [𝑎, 𝑏] → [𝑐, 𝑑] is differentiable and its
derivative never vanishes.
2. Let 𝛼(𝑡) = (𝑒 𝑏𝑡 cos 𝑡, 𝑒 𝑏𝑡 sin 𝑡), where 𝑏 is a negative constant and 𝑡 ∈ R .
(a) Sketch the trace of 𝛼.
(b) Check that 𝛼 has finite length on [𝑡0 , +∞[ and compute it.
3. Let 𝛼(𝑡) = (𝑥 1 (𝑡), . . . , 𝑥 𝑛 (𝑡)), 𝑡 ∈ [𝑎, 𝑏] be a curve of class 𝐶 1 . Show that
(a) given 𝜀 > 0, there exists 𝛿 > 0 such that for all 1 ≤ 𝑘 ≤ 𝑛,
𝑥 𝑘 (𝑡) − 𝑥 𝑘 (𝑠)
∣𝑡 − 𝑠∣ < 𝛿 ⇒ ∣ − 𝑥 ′𝑘 (𝑡)∣ < 𝜀;
𝑡−𝑠
(b) for the 𝜀 and 𝛿 just obtained, if 𝑎 = 𝑡0 < 𝑡1 < ⋯ < 𝑡 𝑚 = 𝑏 is a partition of [𝑎, 𝑏]
such that 𝑡𝑖 − 𝑡 𝑖−1 < 𝛿 for all 1 ≤ 𝑖 ≤ 𝑚, then
𝑚 𝑚 √
∣∑ ∣𝛼(𝑡𝑖 ) − 𝛼(𝑡𝑖−1 )∣ − ∑ ∣𝛼′ (𝑡𝑖 )∣(𝑡 𝑖 − 𝑡 𝑖−1 )∣ < 𝑛(𝑏 − 𝑎)𝜀;
𝑖=1 𝑖=1
where 𝑡 0 < 𝑡 1 < ⋯ < 𝑡 𝑚 is some partition of [𝑎, 𝑏], is finite. We call this supremum
the length of 𝛼.
(a) Show that, for curves of class 𝐶 1 , this new definition of length is equivalent to
the previous one.
(b) Does a rectifiable curve necessarily have derivatives at all points?
(c) Is it true that the graph of any continuous and monotone function [𝑎, 𝑏] → R
is a rectifiable curve?
(d) Consider the Weierstrass example of a continuous function [0, 1] → R that is
nowhere differentiable (see [24], chapter 23, Theorem 5), and find out whether the
graph of this function is rectifiable.
In this section we only consider curves in three dimensional Euclidean space. In the
previous section we learned how to compute the length of a curve with the help of its
velocity or first derivative. This calculation does not exhaust the analysis of a curve,
since it tells us nothing about the shape it can take, and does not distinguish a line
from a circle. To proceed we will also have to take into account the second derivative.
Given a curve 𝛼∶ 𝐼 → R 3 , the tangent unit vector to 𝛼 is 𝝉(𝑡) = 𝑣(𝑡) 1
𝛼′ (𝑡). If
˜
𝛼(𝑟) = 𝛼 ○ ℎ(𝑟) is a reparametrization of 𝛼 then the unit vector tangent to 𝛼˜ is given
by 𝝉˜ = 𝝉 ○ ℎ. Hence, defining 𝑣˜ (𝑟) = ∣𝛼˜ ′ (𝑟)∣, we have
If we put
1
𝑘(𝑡) = ∣𝝉 ′ (𝑡)∣
𝑣(𝑡)
and denote by 𝑘˜ the analogous function for 𝛼,˜ then we see that 𝑘˜ = 𝑘 ○ ℎ. The quantity
𝑘(𝑡) is the curvature of 𝛼 at the point 𝛼(𝑡). The preceding calculations show that the
curvature does not depend on the way the curve is traversed but only on the point
at which it is computed, and does not change even when we reverse its orientation.
We can thus assume, whenever convenient, that the parameter of the curve is the arc
length. In this case we simply have 𝑘(𝑠) = ∣𝛼′′ (𝑠)∣.
A line, for example, has zero constant curvature: in fact, when parametrized with
constant scalar velocity, its second derivative vanishes. But the converse is also true,
since the condition 𝑘(𝑠) ≡ 0 implies that 𝛼′′ (𝑠) ≡ 0, which shows that there exist
constants 𝑝 and v such that 𝛼(𝑠) ≡ 𝑝 + 𝑠v.
Since 𝝉(𝑡) has constant norm, it is orthogonal to its derivative. In fact, by
differentiating the equality ⟨𝝉(𝑡), 𝝉(𝑡)⟩ = 1, we obtain 2⟨𝝉 ′ (𝑡), 𝝉(𝑡)⟩ = 0. Thus, when
6 1 Differentiable Curves
𝑘(𝑡) ≠ 0, the vector 𝝉 ′ (𝑡) points in a direction normal to the curve, the so-called
principal normal. In these cases we can define the unit vector
1
n(𝑡) = 𝝉 ′ (𝑡)
∣𝝉 ′ (𝑡)∣
The value 1/𝑘(𝑡) is the curvature radius at the point 𝛼(𝑡). The osculating plane is
the plane parallel to 𝝉(𝑡) and n(𝑡) that passes through 𝛼(𝑡).
The curvature measures the variation of the direction of the curve, but it does not
determine the form of the curve: both the circumference and the helix, for instance,
have constant curvature, that of the former being equal to the inverse of the radius;
and while the circumference is a planar curve, in the helix the osculating plane varies
from point to point. What we lack then is to measure the variation of the osculating
plane — or, to put it differently, how far a curve deviates from being planar.
Continuing to assume that 𝑘(𝑡) ≠ 0, the bi-normal vector b(𝑡) is defined as the
only vector such that (𝝉(𝑡), n(𝑡), b(𝑡)) is a direct orthonormal trihedron — that is,
an ordered triplet of unit vectors, orthogonal to each other, such that the 3 × 3-matrix
whose columns are these vectors in the same order has positive determinant. Even
simpler, we have b(𝑡) = 𝝉(𝑡) × n(𝑡), where × denotes the vector product on R 3 .
n
b
b
n
Figure 1.3
The trihedron (𝝉(𝑡), n(𝑡), b(𝑡)) is the Frenet trihedron. Each of the vectors of the
trihedron is orthogonal to its derivative, so that each derivative is expressible as a
linear combination of the other two vectors. We will next study the coefficients of
this linear combination. We have
1.2 Acceleration, Curvature and the Frenet Trihedron 7
b′ (𝑡) =
𝑑
(𝝉(𝑡) × n(𝑡))
𝑑𝑡
= 𝝉 ′ (𝑡) × n(𝑡) + 𝝉(𝑡) × n′ (𝑡)
= 𝝉(𝑡) × n′ (𝑡)
(because 𝝉 ′ and n are collinear for each 𝑡). From this equality it follows that b′ , which
we already knew to be orthogonal to b, is also orthogonal to 𝝉, and is therefore a
multiple of n. The torsion of 𝛼 at the point 𝛼(𝑡) is the value 𝜈(𝑡) defined by the
equality
1 ′
b (𝑡) = 𝜈(𝑡)n(𝑡);
𝑣(𝑡)
the factor 1/𝑣(𝑡) again ensures that the torsion is independent of the parameterization.
Note that torsion can assume negative values — although, like curvature, it does not
depend on the orientation of the curve.
Differentiating the product n(𝑡) = b(𝑡) × 𝝉(𝑡), and using the formulas seen above
for b′ (𝑡) and 𝝉 ′ (𝑡), we find that
1 ′
n (𝑡) = −𝑘(𝑡)𝝉(𝑡) − 𝜈(𝑡)b(𝑡).
𝑣(𝑡)
Proposition 1.2.1 Assume that the curvature of 𝛼 never vanishes. Then 𝛼 is a planar
curve if and only if it has zero constant torsion.
Proof If 𝛼 is all contained in the plane 𝑆, the vectors 𝝉(𝑡) and n(𝑡) are parallel
to 𝑆 and linearly independent, and the product b(𝑡) = 𝝉(𝑡) × n(𝑡) is a unit vector
orthogonal to 𝑆; therefore b(𝑡) can only take two distinct values and, since it varies
continuously, it must be constant — from which it follows that 𝜈 ≡ 0.
Conversely, if 𝜈 ≡ 0 then b(𝑡) is a nonzero constant vector b and for all 𝑡 holds
⟨b, 𝛼′ (𝑡)⟩ = 0. Therefore there exists a constant 𝑐 such that ⟨b, 𝛼(𝑡)⟩ ≡ 𝑐, and this
equality shows that the curve 𝛼 is planar. ◻
Let us now consider a curve 𝛼(𝑠), where the parameter 𝑠 ∈ [𝑎, 𝑏] is the arc
length, and whose curvature 𝑘(𝑠) never vanishes. We ask ourselves to what extent the
8 1 Differentiable Curves
functions 𝑘(𝑠) and 𝜈(𝑠) determine the curve 𝛼. They do not do so completely, since
any rigid movement of R 3 transforms 𝛼 into another curve with the same curvature
and torsion functions. (By rigid movement we mean the composition of a translation
with a linear mapping that transforms the canonical basis of R 3 into some direct
orthonormal trihedron.) But this is the only freedom:
Theorem 1.2.2 Let 𝛼, 𝛼∶ ˜ [𝑎, 𝑏] → R 3 be curves parametrized by arc length and with
nonzero curvature at all points. If we have 𝑘(𝑠) = 𝑘(𝑠)
˜ and 𝜈(𝑠) = 𝜈(𝑠)
˜ for all 𝑠 on
[𝑎, 𝑏], then there exists a rigid movement 𝐿∶ R → R 3 such that 𝛼 ≡ 𝐿 ○ 𝛼.
3
˜
Proof Let 𝐿 1 be the linear mapping that transforms the trihedron (𝝉(𝑎),˜ ñ(𝑎), b̃(𝑎))
into the trihedron (𝝉(𝑎), n(𝑎), b(𝑎)), 𝐿 2 the translation that maps 𝐿 1 (𝛼(𝑎))˜ into
𝛼(𝑎), and 𝐿 = 𝐿 2 ○ 𝐿 1 . Then the curve 𝛼0 = 𝐿 ○ 𝛼˜ has at the start time 𝑠 = 𝑎 the same
Frenet trihedron as 𝛼 and satisfies 𝛼0 (𝑎) = 𝛼(𝑎). Identifying by the subindex 0 the
vectors and quantities concerning the curve 𝛼0 , we define the function
1
𝛿(𝑠) = (∣𝝉(𝑠) − 𝝉 0 (𝑠)∣2 + ∣n(𝑠) − n0 (𝑠)∣2 + ∣b(𝑠) − b0 (𝑠)∣2 ) .
2
We know that 𝛿(𝑎) = 0; furthermore, we have
From this, using Frenet’s formulas and the fact that 𝑘 0 = 𝑘 and 𝜈0 = 𝜈, we easily
obtain 𝛿′ ≡ 0, therefore also 𝛿 ≡ 0. In particular we have 𝛼0′ = 𝛼′ — and, since
𝛼0 (𝑎) = 𝛼(𝑎), we conclude that 𝛼0 = 𝛼. ◻
Note 1.2.3 To complement Theorem 1.2.2, we will now show the following result:
for any differentiable functions 𝑘, 𝜈∶ [𝑎, 𝑏] → R (with 𝑘 strictly positive), there exists
a curve 𝛼(𝑠), parametrized by arc length, whose curvature and torsion functions are
precisely 𝑘(𝑠) and 𝜈(𝑠). We make use of the Theorem of Existence and uniqueness
of solutions of differential equations that we will state in Section 3.3; it deserves
mention that this approach provides another proof of Theorem 1.2.2. We keep the
above proof though as it is more elementary.
Once functions 𝑘(𝑠) and 𝜈(𝑠) are fixed, Frénet’s formulas can be viewed as
a non-autonomous equation of the form 𝑋 = v(𝑠, 𝑋), where 𝑋 = (𝝉, n, b) ∈ R 9
and where v∶ [𝑎, 𝑏] × R 9 → R 9 is differentiable. Take any direct orthonormal
trihedron (𝝉(𝑎), n(𝑎), b(𝑎)). Then there exists 𝜀 > 0 such that the solution 𝑋(𝑠) =
(𝝉(𝑠), n(𝑠), b(𝑠)) with this initial condition is defined for [𝑎, 𝑎 + 𝜀]. But since
𝑑
{∣𝝉∣2 + ∣n∣2 + ∣b∣2 } = 2 (⟨𝝉, 𝑘n⟩ + ⟨n, −𝑘𝝉 − 𝜈b⟩ + ⟨b, 𝜈n⟩) = 0,
𝑑𝑡
√
we see that 𝑋(𝑠) stays in the compact set {𝑋 ∈ R 9 ∶ ∣𝑋∣ = 3}, and is therefore defined
on the entire interval [𝑎, 𝑏] (see Theorem 3 on p. 17 of [23]). By differentiating, we
obtain the equalities
1.3 Planar Curves 9
– from which it follows that ⟨𝝉, n⟩, ⟨n, b⟩, ⟨𝝉, b⟩, ∣𝝉∣2 , ∣n∣2 and ∣b∣2 are constant func-
tions, equal to 0, 0, 0, 1, 1 and 1 respectively, since, as it is easily seen, these constants
constitute a solution, with the same initial condition, of the same differential equation
defined in R 6 . This proves that (𝝉(𝑠), n(𝑠), b(𝑠)) is an orthonormal, necessarily
direct trihedron for all 𝑠 ∈ [𝑎, 𝑏]. To conclude we take for 𝛼(𝑠) any primitive of
𝝉(𝑠), e.g. 𝛼(𝑠) = ∫𝑎 𝝉(𝑡) 𝑑𝑡: we verify without difficulty that (𝝉(𝑠), n(𝑠), b(𝑠)) is
𝑠
the Frenet trihedron of 𝛼(𝑠), and that 𝑘(𝑠) and 𝜈(𝑠) are its curvature and torsion. ◻
Exercise
5. Show that if we permit curves whose curvature vanishes at some point then
the conclusion of Theorem 1.2.2 holds — that is, there exists a pair of curves
˜ [𝑎, 𝑏] → R 3 parametrized by arc length such that their curvature and torsion
𝛼, 𝛼∶
functions coincide whenever they are defined, but which cannot be transformed into
each other by a rigid movement.
Hint: look for an example that also shows that in Proposition 1.2.1 the assumption
that the curvature is positive is essential.
We will deal in this section with planar curves, more specifically with curves in R 2 .
To simplify the calculations, we only consider curves parametrized by arc length: the
formulas we obtain are easily adapted to any other parameterizations.
Consider a curve 𝛼∶ 𝐼 → R 2 . Then there exists, for each 𝑠 ∈ 𝐼, a single vector n(𝑠)
such that the pair (𝝉(𝑠), n(𝑠)) forms a direct orthonormal or positively oriented
dihedron: if 𝝉(𝑠) = (𝑥 ′ (𝑠), 𝑦 ′ (𝑠)) then n(𝑠) = (−𝑦 ′ (𝑠), 𝑥 ′ (𝑠)). As before, we know
that the vectors 𝝉 ′ (𝑠) and 𝝉(𝑠) are orthogonal; but in this case we can conclude that
𝝉 ′ (𝑠) is a multiple of n(𝑠). The curvature of 𝛼 at the point 𝛼(𝑠) is the number 𝐾(𝑠)
such that 𝝉 ′ (𝑠) = 𝑘(𝑠)n(𝑠).
This curvature can take negative values and is therefore sometimes called signed
curvature; its absolute value is equal to the curvature defined in the previous section.
Whenever we talk about the curvature of a curve in R 2 we will be referring to the
signed curvature.
10 1 Differentiable Curves
𝝉 ′ = 𝑘n
n′ = −𝑘𝝉.
Since 𝝉(𝑠) is a unit vector, it describes the position vector of a point on S 1 (circle
with radius 1 centered at the origin). Denoting by 𝜑(𝑠) the oriented angle that 𝝉(𝑠)
makes with the positive part of the 𝑥-axis, we have 𝝉(𝑠) = (cos 𝜑(𝑠), sin 𝜑(𝑠)) and
n(𝑠) = (− sin 𝜑(𝑠), cos 𝜑(𝑠)). From this we obtain
Figure 1.4
Note 1.3.1 We must make one caveat: the value 𝜑(𝑠) of the angle is only determined
up to an integer multiple of 2𝜋, and it is not clear that we can make a choice for each
𝑠 such that the resulting function is differentiable or even continuous. The way to
solve this problem is to take advantage of the formula (1.1) to define 𝜑(𝑠): fixing
𝑠0 ∈ 𝐼 and a number 𝜑0 such that 𝝉(𝑠0 ) = (cos 𝜑0 , sin 𝜑0 ), we put
𝑠
𝜑(𝑠) = 𝜑0 + ∫ 𝑘(𝑡) 𝑑𝑡.
𝑠0
The function 𝜑 is obviously differentiable and all that remains to be shown is that
1
𝛿(𝑠) = ∣𝝉(𝑠) − (cos 𝜑(𝑠), sin 𝜑(𝑠))∣2
2
is identically zero. Let us put 𝝉(𝑠) = (𝑥 ′ (𝑠), 𝑦 ′ (𝑠)): by Frenet’s formulas, we have
𝑥 ′′ (𝑠) = −𝑘(𝑠)𝑦 ′ (𝑠) and 𝑦 ′′ (𝑠) = 𝑘(𝑠)𝑥 ′ (𝑠). Therefore
𝛿′ (𝑠) = ⟨𝝉 ′ (𝑠) − 𝜑′ (𝑠)(− sin 𝜑(𝑠), cos 𝜑(𝑠)), 𝝉(𝑠) − (cos 𝜑(𝑠), sin 𝜑(𝑠))⟩
= (𝑘(𝑠) − 𝜑′ (𝑠))(𝑦 ′ (𝑠) cos 𝜑(𝑠) − 𝑥 ′ (𝑠) sin 𝜑(𝑠)) = 0,
since 𝜑′ (𝑠) = 𝑘(𝑠). From this, and since 𝛿(𝑠0 ) = 0, we conclude that 𝛿 ≡ 0.
We further stress that any other continuous choice of the angle between 𝝉(𝑠)
and the 𝑥-axis is of the form 𝜑(𝑠) + 2𝑛𝜋, for some constant 𝑛 ∈ Z (since the
difference between two choices is continuous and only takes values in the discrete set
{2𝑛𝜋 ∶ 𝑛 ∈ Z }).
We end this section with a result that is the version for planar curves of Theo-
rem 1.2.2 and the note 1.2.3; its proof is kept as an exercise (it is possible to prove
directly, without using the theorem on solutions of differential equations, the existence
of 𝛼, and for uniqueness adapt the proof of Theorem 1.2.2)
Theorem 1.3.2 Given a differentiable function 𝑘∶ [𝑎, 𝑏] → R , there exists some curve
𝛼∶ [𝑎, 𝑏] → R 2 whose curvature at 𝛼(𝑠) is 𝑘(𝑠). Any other curve with the same
curvature function is the composite of 𝛼 with some rigid plane movement.
It follows in particular from Theorem 1.3.2 that the only planar curves with
nonzero constant curvature are circles or arcs of circumference. A direct proof of this
fact is certainly possible and is an exercise well worth to be done.
Exercises
6. (a) Consider a curve 𝛼(𝑡) = (𝑥(𝑡), 𝑦(𝑡)) not necessarily parametrized by arc length,
and put, as usual, 𝑣(𝑡) = ∣𝛼′ (𝑡)∣ and 𝝉(𝑡) = 𝑣(𝑡)
1
𝛼′ (𝑡). Prove each of the following
formulas for the curvature of 𝛼:
1 1 1
𝑘 = ⟨𝝉 ′ , n⟩ = 2 ⟨𝛼′′ n⟩ = 3 ⟨𝛼′′ , 𝑣n⟩
𝑣 𝑣 𝑣
𝑥 ′ 𝑦 ′′ − 𝑥 ′′ 𝑦 ′
= .
((𝑥 ′ )2 + (𝑦 ′ )2 )3/2
𝑥2 𝑦2
(b) Show that the curvature of the ellipse given by the equation 𝑎2
+ 𝑏2
= 1,
parametrized by 𝛼(𝑡) = (𝑎 cos 𝑡, 𝑏 sin 𝑡), is given by
𝑎𝑏
𝑘(𝑡) = 3/2
.
(𝑎 2 sin2 𝑡 + 𝑏 2 cos2 𝑡)
12 1 Differentiable Curves
8. Let 𝛼∶ ]𝑎, 𝑏[→ R 2 be a regular curve parametrized by arc length. Given 𝑠0 ∈ ]𝑎, 𝑏[
and 𝑝 ∈ R 2 ∖ {𝛼(𝑠0 )}, let 𝐶 be the circle with center 𝑝 and radius ∣𝛼(𝑠0 ) − 𝑝∣.
(a) Show that 𝐶 is tangent to 𝛼 at the point 𝛼(𝑠0 ) if and only if 𝑝 is a point of the
normal to 𝛼 at 𝛼(𝑠0 ).
(b) Assume 𝑝 = 𝛼(𝑠0 ) + 𝜆n(𝑠0 ) and consider the function 𝜌(𝑠) = ∣𝛼(𝑠) − 𝑝∣2 .
Show that if 𝜆𝑘(𝑠0 ) > 1 then 𝑠0 is a strict local maximum of 𝜌 and if 𝜆𝑘(𝑠0 ) < 1 then
𝑠0 is a strict local minimum.
(c) Let D ⊆ R 2 be a circle with radius 𝑅 such that 𝛼(]𝑎, 𝑏[) is contained in the
closed disk bounded by D. Conclude that at the instants 𝑠0 at which 𝛼(𝑠0 ) ∈ D, one
has ∣𝑘(𝑠0 )∣ ≥ 𝑅1 .
1
lim ∣𝛼(𝑠
˜ 0 + 𝑠) − 𝛼(𝑠0 + 𝑠)∣ = 0.
𝑠→0 𝑠 𝑛
1.4 Contact of Curves 13
Considering the Taylor polynomial expansion, it is easily seen that this condition is
equivalent to the condition that the derivatives of 𝛼 and 𝛼˜ up to order 𝑛 coincide at 𝑠0 .
In particular, since 𝛼′ (𝑠0 ) = 𝝉(𝑠0 ) and 𝛼′′ (𝑠0 ) = 𝑘(𝑠0 )n(𝑠0 ), we conclude that two
planar curves have second-order contact at a point 𝑝 if and only if they are tangent
to 𝑝 and had equal curvature there (of course when we speak of tangency here we
require that the velocity vectors of the two curves are identical — i.e., point in the
same direction); when the curvature is nonzero, this is equivalent to these two curves
having the same center of curvature at 𝑝. From this we conclude that the only circle
which has second-order contact with 𝛼 at a point of nonzero curvature is the one
with center at the center of curvature of 𝛼 at that point and radius equal to the radius
of curvature (of course there is no circle with second-order contact at a point of zero
curvature).
The definition of 𝑛-order contact we have given is perhaps not the most natural
nor the easiest to handle, since it depends on a special parameterization of the two
curves. To improve the situation, we start by defining Δ(𝑡) as the distance between
the points 𝛼(𝑠0 + 𝑠) and 𝛼(𝑠˜ 0 + 𝑠˜) given by the condition
Δ(𝑡) is thus the length of the line segment bounded by the intersections with 𝛼 and 𝛼˜
of the line orthogonal to 𝝉(𝑠0 ) and the (oriented) distance 𝑡 from 𝛼(𝑠0 ) (see Fig. 1.5).
~
(t, f(t))
D (t)
n (s0)
(t, f(t))
t (s0)
Figure 1.5
Proposition 1.4.1 The curves 𝛼 and 𝛼˜ have 𝑛-order contact at the point 𝛼(𝑠0 ) =
˜ 0 ) if and only if lim 𝑡1𝑛 Δ(𝑡) = 0.
𝛼(𝑠
𝑡→0
Proof We first assume that 𝛼 and 𝛼˜ have 𝑛-order contact — i.e., that 𝛼(𝑠0 ) = 𝛼(𝑠 ˜ 0)
and 𝛼(𝑖) (𝑠0 ) = 𝛼˜ (𝑖) (𝑠0 ) for 1 ≤ 𝑖 ≤ 𝑛, — and we define functions 𝑔 and 𝑔˜ by
Note that 𝑔(0) = 𝑔(0)˜ = 0 and 𝑔 (𝑖) (0) = 𝑔˜ (𝑖) (0) for 1 ≤ 𝑖 ≤ 𝑛, and that, since
the first derivative at 0 of these functions is nonzero, both have local inverses in a
neighborhood of 0, which we will denote by 𝑔 −1 and 𝑔˜ −1 . Let us now put
and note that, for 𝑡 near 0, one has Δ(𝑡) = ∣ 𝑓 (𝑡) − 𝑓˜(𝑡)∣ (see Fig. 1.5.). Moreover,
the first 𝑛 derivatives of 𝑓 and 𝑓˜ at 0 coincide, since at this point also the first 𝑛
derived from 𝐹 are identical to those from 𝐹, ˜ and those from 𝑔 −1 to those from 𝑔˜ −1 .
It follows, as desired, that lim 𝑡 𝑛 Δ(𝑡) = 0.
1
𝑡→0
We now prove the converse implication. To simplify the notation, we assume that
𝑠0 = 0, take 𝛼(0) = 𝛼(0)
˜ for the origin of the coordinates, and further assume that
the tangent and normal vectors at this point are (1, 0) and (0, 1) respectively.
With the above notation, let 𝛽(𝑡) = 𝛼 ○ 𝑔 −1 (𝑡), 𝛽(𝑡)
˜ = 𝛼˜ ○ 𝑔˜ −1 (𝑡): we then have
𝛽(𝑡) = (𝑡, 𝑓 (𝑡)) and 𝛽(𝑡)
˜ = (𝑡, 𝑓˜(𝑡)), and by hypothesis 𝑓 and 𝑓˜ are functions whose
derivatives up to the 𝑛th order coincide at 0. Since 𝛼 and 𝛼˜ are parametrized by arc
length, we have
𝑡 𝑡 √
𝑔 −1 (𝑡) = ∫ ∣𝛽′ (𝑟)∣ 𝑑𝑟 = ∫ 1 + [ 𝑓 ′ (𝑟)]2 𝑑𝑟,
0 0
𝑡 𝑡 √
𝑔˜ −1 (𝑡) = ∫ ∣ 𝛽˜′ (𝑟)∣ 𝑑𝑟 = ∫ 1 + [ 𝑓˜′ (𝑟)]2 𝑑𝑟,
0 0
and these formulas show that the derivatives of 𝑔 −1 and 𝑔˜ −1 at the point 0 coincide at
least to the same order as the derivatives of 𝑓 and 𝑓˜ at the same point — that is, to
the order 𝑛. Since 𝛼 = 𝛽 ○ 𝑔 and 𝛼˜ = 𝛽˜ ○ 𝑔, we conclude that 𝛼(𝑖) (0) = 𝛼˜ (𝑖) (0) for
1 ≤ 𝑖 ≤ 𝑛, which means that 𝛼 and 𝛼˜ have 𝑛-order contact in 𝛼(0) = 𝛼(0).˜
Exercises
10. Assume that two planar curves 𝛼 and 𝛼, ˜ not necessarily parametrized by arc
length, touch at the point 𝑝 = 𝛼(0) = 𝛼(0).
˜ Show that if lim 𝑡1𝑛 ∣𝛼(𝑡)
˜ − 𝛼(𝑡)∣ = 0, then
𝑡→0
the curves have 𝑛-order contact at 𝑝.
11. With the same terminology as in Proposition 1.4.1, but under the assumption
that 𝑠0 = 0 and 𝜑(𝑠0 ) = 0, show that 𝑓 ′ (𝑡) = 𝑡𝑔𝜑(𝑠) and 𝑓 ′′ (𝑡) = cos3 𝜑(𝑠) , where
𝑘(𝑠)
𝑠 = 𝑔 −1 (𝑡). Conclude that if 𝑘(0) > 0 then there exists 𝜀 > 0 such that, on an
appropriate coordinate system, the trace of 𝛼∣[−𝜀, 𝜀] is the graph of a convex function.
1.5 Convex Curves 15
12. Let 𝐼 be an open interval such that 𝑜 ∈ 𝐼 and let 𝛼∶ 𝐼 → R 3 be a curve parametrized
by arc length whose curvature at 0 is nonzero.
(a) Show that
1 1 1 1
𝛼(𝑠) = 𝛼(0) + (𝑠 − 𝑘 2 𝑠3 ) 𝝉 + ( 𝑘 𝑠2 + 𝑘 ′ 𝑠3 ) n − 𝑘 𝜈𝑠3 b + 𝑜(𝑠3 ),
6 2 6 6
where the quantities 𝑘, 𝜈 and the vectors 𝝉, n, b are computed at 0.
(b) Conclude that if 𝜈(0) > 0 then, when 𝑠 reaches the instant 0, the curve crosses
the osculating plane at 𝛼(0) from top to bottom (the “top part” is the one pointed to
by b(0)).
13. Using the notation and assumptions from Exercise 12, show that the plane
that contains the points 𝛼(0), 𝛼(ℎ0 ) and 𝛼(ℎ1 ), for ℎ1 < 0 < ℎ0 , converges to the
osculating plane at 𝛼(0) when ∣ℎ0 ∣ + ∣ℎ1 ∣ → 0.
Hint: Using the Taylor expansion — just up to the second order — show that
Continuing with planar curves parametrized by arc length, let us talk about simple
closed curves and characterize those that are convex. We say that a curve 𝛼∶ [𝑎, 𝑏] →
R 2 is closed if 𝛼(𝑎) = 𝛼(𝑏); if its periodic extension, defined by 𝛼(𝑠 + 𝑛(𝑏 − 𝑎)) =
𝛼(𝑠) for 𝑠 ∈ [𝑎, 𝑏] and 𝑛 ∈ Z , is differentiable (i.e., 𝐶 ∞ ), the curve is closed regular;
and if the curve has no self-intersections — that is, if its restriction on [𝑎, 𝑏[ is
injective — we say it is simple.
In this section all closed curves are regular; and, where necessary, we consider
them defined in R by periodic extension.
We recall from Section 1.3 that there is a continuous choice 𝜑(𝑠) of the angle that
the vector 𝝉(𝑠) makes with the positive part of the 𝑥-axis. Since 𝝉(𝑏) = 𝝉(𝑎), the
difference 𝜑(𝑏) − 𝜑(𝑎) is an integer multiple of 2𝜋, which, by note 1.3.1, does not
depend on the choice of 𝜑(𝑠). We call rotation index of the closed curve 𝛼 the integer
ℜ(𝛼) = 21𝜋 (𝜑(𝑏) − 𝜑(𝑎)); ℜ(𝛼) thus counts the number of turns that its tangent
vector 𝝉(𝑠) makes in the unit circle when the point 𝛼(𝑠) completes one turn around
the curve. Since 𝜑′ (𝑠) = 𝑘(𝑠), the rotation index can be given in integral form
1 𝑏
ℜ(𝛼) = ∫ 𝑘(𝑠) 𝑑𝑠.
2𝜋 𝑎
A closed curve 𝛼∶ [𝑎, 𝑏] → R 2 is called convex if, for every 𝑠0 ∈ [𝑎, 𝑏], the curve is all
on the same side of the tangent line to 𝛼 at the point 𝛼(𝑠0 ) — that is, if the function
16 1 Differentiable Curves
ℎ(𝑠) = ⟨𝛼(𝑠) − 𝛼(𝑠0 ), n(𝑠0 )⟩ does not change sign. Our next result characterizes
these curves.
Figure 1.6
Theorem 1.5.1 A closed curve is convex if and only if its curvature does not change
sign and its rotation index is ±1. Any convex curve is simple.
Proof (i) We begin by showing that any closed curve with non-negative curvature
at all its points and rotation index 1 is convex. Let 𝛼∶ [𝑎, 𝑏] → R 2 be such a curve.
Then the function 𝜑(𝑠) is non-decreasing and 𝜑(𝑏) = 𝜑(𝑎) + 2𝜋.
Given 𝑠0 ∈ [𝑎, 𝑏], we want to prove that the function ℎ(𝑠) = ⟨𝛼(𝑠) − 𝛼(𝑠0 ), n(𝑠0 )⟩
does not change sign. Otherwise ℎ(𝑠) reaches a positive maximum and a negative
minimum at points 𝑠1 , 𝑠2 ∈ [𝑎, 𝑏] ∖ {𝑠0 }, and at each of these points the tangent line
is parallel to the tangent line at 𝛼(𝑠0 ). Hence, there exist 𝑖 ≠ 𝑗 ∈ {0, 1, 2} such that
𝜑(𝑠𝑖 ) = 𝜑(𝑠 𝑗 ) and, 𝜑 being non-decreasing, this is only possible if 𝜑 is constant in
the interval between 𝑠𝑖 and 𝑠 𝑗 . This means that the curve contains the line segment
from 𝛼(𝑠𝑖 ) to 𝛼(𝑠 𝑗 ), and therefore the tangent lines at these points coincide — which
is absurd given the way they were chosen. Therefore ℎ does not change sign and 𝛼 is
convex.
(ii) We assume now and until the end of the proof that 𝛼∶ [𝑎, 𝑏] → R 2 is a convex
curve. Let us first see that 𝑘(𝑠) does not change sign. Consider the function of two
variables (𝑠, 𝑡) ∈ [𝑎, 𝑏] × [𝑎, 𝑏] defined by
By hypothesis, for each 𝑡 ∈ [𝑎, 𝑏] the function ℎ𝑡 given by ℎ𝑡 (𝑠) = 𝐻(𝑠, 𝑡) has constant
sign — that is, the restriction of 𝐻 to each horizontal line segment [𝑎, 𝑏]×{𝑡} is either
non-negative or non-positive. What we want for now is to prove that the function 𝐻
itself does not change sign.
Assume, instead, that there exist (𝑠0 , 𝑡 0 ), (𝑠1 , 𝑡 1 ) ∈ [𝑎, 𝑏] × [𝑎, 𝑏] such that
𝐻(𝑠0 , 𝑡 0 ) < 0 < 𝐻(𝑠1 , 𝑡 1 ); and let us agree that 𝑡0 < 𝑡 1 . Consider the set
and therefore 𝐴 contains the interval [𝑡0 , 𝑡 0 + 𝛿]. Since 𝑡1 ∉ 𝐴, 𝑡2 = sup(𝐴) lies in
]𝑡 0 , 𝑡 1 [. Since 𝑡 1 ∈ 𝐴, we have 𝐻(𝑠, 𝑡 1 ) ≤ 0 for all 𝑠 ∈ [𝑎, 𝑏]. If for all 𝑠 this inequality
were strict, then the same argument as above would show that there exists 𝛿 > 0 such
that [𝑡 2 , 𝑡 2 + 𝛿] ⊂ 𝐴, which contradicts the definition of 𝑡 2 . We then conclude that
𝐻(𝑠, 𝑡 2 ) = 0 for all 𝑠 ∈ [𝑎, 𝑏] — which is absurd because it means that the curve 𝛼 is
all contained in a straight line.
So we have proved that 𝐻 does not change sign. That the curvature does not either
is now immediate: when 𝑘(𝑡0 ) > 0, the function ℎ𝑡0 (𝑠) has a strict local minimum for
𝑠 = 𝑡 0 (since ℎ′𝑡0 (𝑡0 ) = 0 and ℎ′′𝑡0 (𝑡0 ) = 𝑘(𝑡 0 )) and therefore 𝐻(𝑠, 𝑡 0 ) > 0 for 𝑠 near 𝑡 0 ;
and, conversely, when 𝑘(𝑡0 ) < 0 one has 𝐻(𝑠, 𝑡 0 ) < 0 for 𝑠 near 𝑡0 .
(iii) Let us now prove that 𝛼 is simple. Let us assume, to the contrary, that it has
some self-intersection which, changing if necessary the initial point, we suppose to
take place at 𝛼(𝑎). Then there exists 𝑥 ∈ ]𝑎, 𝑏[ such that 𝛼(𝑐) = 𝛼(𝑎).
The function 𝐻(𝑠, 𝑡) vanishes for (𝑠, 𝑡) = (𝑎, 𝑐) and, by (ii), reaches at that point
a local extremum. Thus ⟨𝝉(𝑎), n(𝑐)⟩ = 𝜕𝐻 𝜕𝑠 ∣ = 0, and therefore 𝝉(𝑎) = ±𝝉(𝑐). If
(𝑎,𝑐)
it were 𝝉(𝑎) = −𝝉(𝑐), we would have 𝐻(𝑠, 𝑎) = −𝐻(𝑠, 𝑐) for all 𝑠 ∈ [𝑎, 𝑏], which is
impossible by (ii). Therefore we have 𝝉(𝑎) = 𝝉(𝑐).
To simplify the notation, we assume that 𝛼(𝑎) = 𝛼(𝑐) = (0, 0). Let us put, (as in
the proof of 1.4.1) for 𝑠 ≥ 0,
– where the functions 𝑓 and 𝑓˜ are defined on some interval [0, 𝜀], 𝜀 > 0. The graphs
of 𝑓 and 𝑓˜ are portions of the trace of 𝛼: in fact, putting 𝑠 = 𝑔 −1 (𝑡), 𝑠˜ = 𝑔˜ −1 (𝑡), we
can write
In each of these products, and since n(𝑎) = n(𝑐), the second factor is positive for 𝑠,
𝑠˜ sufficiently small; hence, if it were 𝑓 (𝑡) ≠ 𝑓˜(𝑡) for some 𝑡 ∈ [0, 𝜀], 𝐻(𝑎 + 𝑠, 𝑐 + 𝑠˜)
and 𝐻(𝑐 + 𝑠˜, 𝑎 + 𝑠) would have opposite signs, in contradiction to (ii). We thus have
𝑓 (𝑡) = 𝑓˜(𝑡) for all 𝑡 ∈ [0, 𝜀] — and from this, since 𝛼 is parametrized by arc length,
we conclude that there exists 𝛿 > 0 such that 𝛼(𝑎 + 𝑠) = 𝛼(𝑐 + 𝑠) for all 𝑠 ∈ [0, 𝛿]. A
18 1 Differentiable Curves
trivial argument now proves that for all 𝑠 ≥ 0 one has 𝛼(𝑎 + 𝑠) = 𝛼(𝑐 + 𝑠), and this
says that when the curve returns to the starting point, it repeats the same path from
then on. The given hypothesis thus leads us to conclude that 𝛼∣[𝑎,𝑏] gives more than
one turn to the same closed curve. Assuming this does not happen, such a 𝑐 ∈ ]𝑎, 𝑏[
with 𝛼(𝑐) = 𝛼(𝑎) does not exist and the curve is simple.
(iv) Let us now show that ℜ(𝛼) = ±1. Assuming that 𝛼 is positively oriented,
and putting 𝜑(𝑏) = 𝜑(𝑎) + 2𝑛𝜋, we have 𝑛 = ℜ(𝛼) ≥ 1, and we want to see that
𝑛 = 1. We can assume, without loss of generality, that 𝑘(𝑎) > 0. Take 𝑐 ∈ [𝑎, 𝑏]
such that 𝜑(𝑐) = 𝜑(𝑎) + 2𝜋: then 𝐻(𝑎, 𝑐) = −𝐻(𝑐, 𝑎), and it follows by (ii) that
𝐻(𝑎, 𝑐) = 0, which means that the tangent lines at 𝛼(𝑎) and 𝛼(𝑐) coincide. The
function 𝜆(𝑠) = 𝐻(𝑐, 𝑎 + 𝑠) reaches a minimum at 0, and so
whence it follows, since 𝑘(𝑎) > 0 and the points 𝛼(𝑐) and 𝛼(𝑎) lie on a straight line
parallel to 𝝉(𝑎), that 𝛼(𝑐) = 𝛼(𝑎). Since 𝛼 is simple, we must have be 𝑐 = 𝑏 and
therefore ℜ(𝛼) = 1. ◻
It is important to note that any simple curve, whether convex or not, has rotation
index ±1: this is what the rotation index theorem says, the proof of which we give in
the Appendix to Chap. 4, but of which a special case is given in Exercise 16 below. If
we already had this result, the proof of 1.5.1 would be somewhat simplified; another
simplification would be to suppress step (iii) if, as some authors do, we already
required in the definition that a convex curve be simple.
One result we will not prove, but of which we will make important use, not always
explicit, is the Jordan curve theorem. This theorem states that any simple closed
curve divides the plane into two disjoint connected open subsets of which it is a
common boundary. (For a proof of the theorem in the differentiable case, and its
generalization to higher dimensions, see [15]; for the topological version we suggest
[17], which also includes a proof of Schönflies’ theorem: the region bounded by a
simple closed curve is homeomorphic to an open disc).
To finish this section we mention that a convex curve of nonzero curvature at all
its points is usually called strictly convex. In this case 𝜑(𝑠) is strictly monotone and
therefore every tangent line touches the curve at a single point.
Exercises
14. Show that if a line intersects a closed convex curve then one and only one of the
following cases occurs: either the line is tangent to the curve, or it intersects the curve
at exactly two points.
15. Let 𝛼 be a closed, simple, regular curve, Ω be the open set bounded by 𝛼, and
Ω = Ω ∪ 𝛼 be the closure of 𝛼. Show that the following conditions are equivalent:
(Suggestion for (i) ⇒ (ii): Show that for each 𝑠0 , the image of the function 𝑠
(𝑠 ≠ 𝑠0 ) ↦ (𝛼(𝑠) − 𝛼(𝑠0 ))/∣𝛼(𝑠) − 𝛼(𝑠0 )∣ is contained in a semicircle.)
16. (a) Let H∶ [0, 1] × [𝑎, 𝑏] → R 2 be a differentiable mapping such that every
𝛼𝑠 = H(𝑠, ⋅ ) is a regular closed curve. Show that ℜ(𝛼𝑠 ) is constant. (Exercise 7 may
be helpful.)
(b) Let 𝛼∶ [𝑎, 𝑏]→R 2 be a regular closed curve and 𝑝 ∈ R 2 ∖ 𝛼([𝑎, 𝑏]) such that
each half-line with origin at 𝑝 intersects 𝛼 exactly once, and this intersection is
transverse (i.e., the half-line is not tangent to 𝛼 at the point of intersection). Prove
that ℜ(𝛼) = ±1.
(c) Now assume only that all intersections of 𝛼 with half-lines 𝑟 of origin 𝑝 are
transversal. Show that the cardinal of the set 𝑇(𝑟) = {𝑡 ∈ [𝑎, 𝑏[∶ 𝛼(𝑡) ∈ 𝑟} is the same
for all such half-lines.
In this section, we explore the varying width of a planar curve. The width of the curve
in any given direction is the narrowest distance between two lines perpendicular to
that direction that can contain the curve. This means that the width of a curve is not
necessarily the same in all directions. Remarkably, besides the circle, there are other
convex curves that have a constant width regardless of direction; and the perimeter of
such curves is equal to that of the circle of the same width.
We will deal in this section with the width of a planar curve. The width of a curve
in a given direction is the minimal width among the strips that contain the curve and
are bounded by lines orthogonal to that direction.
Given a closed curve 𝛼∶ [𝑎, 𝑏] → R 𝑛 we define, for v ∈ S1 , ℎ(v) = max ⟨𝛼(𝑠), v⟩.
𝑎≤𝑠≤𝑏
Since the maximum of ⟨𝛼(𝑠), v⟩ is only reached at points 𝑠 such that ⟨𝝉(𝑠), v⟩ = 0,
ℎ(v) is the maximum among the (oriented) distances from the origin to the tangent
lines to 𝛼 that are orthogonal to v. The width of 𝛼 in the direction of v is L(v) =
ℎ(v) + ℎ(−v).
If 𝛼 is a convex curve, then there are exactly two tangent lines to 𝛼 that are
orthogonal to v (although each of them may be tangent to 𝛼 at more than one point),
and L(v) is the distance between these lines (see Fig. 1.7). For example, the width of
a circle is, in all directions, equal to its diameter.
Proposition 1.6.1 In any closed curve the diameter and maximum width are equal.
20 1 Differentiable Curves
L( v ) v
Figure 1.7
1.7
Proof We denote the diameter of 𝛼 by 𝐷 = max{∣𝛼(𝑠) − 𝛼(𝑡)∣ ∶ 𝑠, 𝑡 ∈ [𝑎, 𝑏], and by
𝔏 = max{L(v) ∶ v ∈ S1 } the maximum width of 𝛼.
Let us check that 𝐷 ≤ 𝔏. Consider the function D(𝑠, 𝑡) = ∣𝛼(𝑠) − 𝛼(𝑡)∣. This
function reaches its maximum 𝐷 on a pair of points (𝑠0 , 𝑡 0 ) such that
𝜕D 𝜕D
∣ = ∣ = 0,
𝜕𝑠 (𝑠0 ,𝑡0 ) 𝜕𝑡 (𝑠0 ,𝑡0 )
This shows that the tangent lines at 𝛼(𝑠0 ) and 𝛼(𝑡 0 ) are parallel, both being orthogonal
to the line segment joining 𝛼(𝑠0 ) to 𝛼(𝑡 0 ). Moreover the curve is completely contained
in the strip between these tangent lines, otherwise the maximum distance between
distinct points of 𝛼 would exceed 𝐷. Thus the width of the strip, which is 𝐷, is also
equal to L(n(𝑠0 )), and thus 𝐷 ≤ 𝔏.
Let us now deal with the opposite inequality. Given v ∈ S1 , let 𝑠0 , 𝑡 0 ∈ [𝑎, 𝑏] be
such that ℎ(v) = ⟨𝛼(𝑠0 ), v)⟩ and ℎ(−v) = ⟨𝛼(𝑡0 ), −v⟩. Then
and therefore 𝐷 ≥ L(v). Since this inequality holds for all v, it follows that 𝐷 ≥ 𝔏. ◻
Let us now assume that the curve 𝛼 is convex and has constant width 𝔏. By 1.6.1,
also its diameter is equal to 𝔏. Let us now see that this diameter is realized by many
pairs of points on the curve.
Fixing 𝑠0 ∈ [𝑎, 𝑏], let 𝛼(𝑠1 ) be a point such that 𝝉(𝑠1 ) = −𝝉(𝑠0 ). We know from
the analysis done in Section 1.5 that the curve is contained in the strip bounded by
tangent lines at 𝛼(𝑠0 ) and 𝛼(𝑠1 ). Hence these tangent lines are at a distance 𝔏 from
each other, and therefore ∣𝛼(𝑠0 ) − 𝛼(𝑠1 )∣ ≥ 𝔏. Since the diameter of 𝛼 is 𝔏, it must
be ∣𝛼(𝑠0 ) − 𝛼(𝑠1 )∣ = 𝔏, an equality that is only possible if the line segment between
1.6 Curves of Constant Width 21
𝛼(𝑠0 ) and 𝛼(𝑥 1 ) is orthogonal to the tangent lines to 𝛼 at these points. Furthermore,
there is no other point 𝛼(̃ 𝑠1 ) such that ∣𝛼(𝑠0 ) − 𝛼(𝑠˜1 )∣ = 𝔏, for the proof of 1.6.1
shows that 𝛼(̃ 𝑠1 ) would also be on the normal to 𝛼 at 𝛼(𝑠0 ).
We conclude that for every point 𝑝 of a convex curve of constant width 𝔏, there is
a single point 𝑝̃ of the curve at the maximum distance 𝔏 from 𝑝, and 𝑝̃ is situated
on the normal to 𝛼 at 𝑝. We call this point 𝑝̃ the antipode of 𝑝. Assuming that the
curve is positively oriented (and therefore has non-negative curvature at all points),
our conclusion translates into 𝑝̃ = 𝑝 + 𝔏n(𝑝). We stress that “being antipodal to” is a
reflexive relation, and that two tangent lines to 𝛼 that are parallel and distinct meet 𝛼
at points that are antipodes of each other.
Consider now the circle C with center 𝑝̃ and radius 𝔏. Such a circle is tangent to
𝛼 at the point 𝑝; and all other points of 𝛼 are contained in the interior of the disk
bounded by C. Exercise 8 then says that the absolute value of the curvature of 𝛼 at 𝑝
is greater than or equal to 1/𝔏. We thus conclude that any convex curve of constant
width is strictly convex.
Example 1.6.2 At this point it is good to wonder about the existence of constant-width
convex curves that are not circles. The above discussion suggests that such a curve is
determined by knowing the arc between two antipodal points 𝑝 and 𝑝̃: the remaining
segment is found by marking, from each point of this arc, a distance of 𝔏 along the
normal.
a(t-c)
(-1,0) (1,0)
b (t)
Figure 1.8
Proof Assume that a certain function 𝑓 satisfies the equation 𝜑 ○ 𝑓 (𝑠) = 𝜑(𝑠) + 𝜋.
Then 𝝉( 𝑓 (𝑠)) = (cos(𝜑 ○ 𝑓 (𝑠)), sin((𝜑 ○ 𝑓 (𝑠))) = −𝝉(𝑠), and therefore the points
𝛼(𝑠) and 𝛼( 𝑓 (𝑠)) are antipodes of each other — that is, 𝛼 ̃(𝑠) = 𝛼( 𝑓 (𝑠)) just as we
intend.
This means that we just have to find 𝑓 such that 𝜑○ 𝑓 (𝑠) = 𝜑(𝑠)+𝜋. Such a function
is given by 𝑓 (𝑠) = 𝜑−1 (𝜑(𝑠) + 𝜋), which is differentiable and has positive derivative.
Furthermore, we have 𝑓 (𝑠+𝐿) = 𝜑−1 (𝜑(𝑠+𝐿)+𝜋) = 𝜑−1 ({𝜑(𝑠)+𝜋}+2𝜋) = 𝑓 (𝑠)+𝐿,
as we want. ◻
We now finish the proof of 1.6.3. Differentiating the equality 𝛼(𝑠) + 𝔏n(𝑠) =
𝛼( 𝑓 (𝑠)), we obtain {1−𝔏𝑘(𝑠)} 𝝉(𝑠) = 𝑓 ′ (𝑠)𝝉( 𝑓 (𝑠)) — and from this, as 𝝉( 𝑓 (𝑠)) =
−𝝉(𝑠), yields 𝑓 ′ (𝑠) = −1 + 𝔏𝑘(𝑠). Finally, using 1.6.4, and since the rotation index
of 𝛼 is 1, we have
𝐿 𝐿
𝐿 = 𝑓 (𝐿) − 𝑓 (0) = ∫ 𝑓 ′ (𝑠) 𝑑𝑠 = −𝐿 + 𝔏 ∫ 𝑘(𝑠) 𝑑𝑠 = −𝐿 + 2𝜋𝔏.
0 0
Note 1.6.5 We cannot omit the simplest example of a non-circular curve of constant
width: Reuleaux’s triangle, which is formed by three arcs, each centered at one of the
vertices of an equilateral triangle ABC and radius equal to the side of the triangle. Its
perimeter 𝐿 and width 𝔏 are also related by 𝐿 = 𝜋𝔏, but the proof of 1.6.3 does not
⌢ ⌢ ⌢
cover this case: the antipode of each point of the arc 𝐵𝐶 (resp. 𝐶 𝐴, 𝐴𝐵) is the point
𝐴 (resp. 𝐵, 𝐶).
C A
Figure 1.9
constant width 𝔏 + 2𝑑. In Fig. 1.9 we show a curve parallel to the Reuleaux triangle.
Each “corner” of 𝛼 is replaced by an arc of a circle at 𝛼𝑑 . Therefore 𝛼𝑑 already has a
well-defined tangent vector 𝝉 𝑑 (𝑠) at each point; the angle 𝜑 𝑑 (𝑠) that 𝝉 𝑑 (𝑠) makes
with the 𝑥-axis is a continuous, strictly increasing function, which is piecewise 𝐶 1 ;
and the antipode mapping is already a bijection of the curve onto itself. The proof of
1.6.3 can easily be adapted to show that the perimeter of 𝛼𝑑 is
There are numerous results on constant width curves: [7] contains a careful
discussion of the topic (and its generalization to higher dimensions) and an extensive
bibliography. In Section 5.5 we give some results on constant width surfaces.
Exercises
17. (a) Check that the function 𝑓 (𝑥) = 𝑒 −1/𝑥(1−𝑥) if 0 < 𝑥 < 1, 𝑓 (𝑥) = 0 otherwise it
𝑥 1
is 𝐶 ∞ , and that 𝐹(𝑥) = ∫ 𝑓 (𝑡) 𝑑𝑡/ ∫ 𝑓 (𝑡) 𝑑𝑡 satisfies the conditions: 𝐹(𝑥) = 0
0 0
for 𝑥 < 0, 𝐹 is strictly increasing on [0, 1], 𝐹(𝑥) = 1 for 𝑥 ≥ 1.
(b) Given 𝑎 < 𝑏 and 𝑦 1 , 𝑦 2 ∈ R , show that there exists a nonconstant 𝑔∶ R → R of
class 𝐶 ∞ such that 𝑔(𝑥) = 𝑦 1 ∀ 𝑥 ∈ [−∞, 𝑎], and 𝑔(𝑥) = 𝑦 2 ∀ 𝑥 ∈ [𝑏, +∞[.
18. Let 𝛼∶ [𝑎, 𝑏] → R 2 be a closed curve and L(v) be the corresponding “width
function”. Show that:
(a) L is continuous and therefore there exists max L(v);
v∈S
(b) if 𝛼 is regular and strictly convex then L is differentiable (i.e., the function
𝜃 → L(cos 𝜃, sin 𝜃) is differentiable).
19. Show that the function of Lemma 1.6.4 is unique but for the addition of a constant.
(c) if each pair of antipodal points divide 𝛼 into two arcs of equal length then 𝛼 is a
circle.
21. Convex curves of constant width 𝔏 are characterized by the fact that all the
rectangles that circumscribe them are squares of side 𝔏. In this exercise we prove a
generalization of Barbier’s Theorem: if 𝛼 is a regular curve strictly convex such that
all rectangles that surround it have perimeter 4𝔏, then the perimeter of 𝛼 is 𝜋𝔏.
1.7 Theorem of the Four Vertices 25
Using the same notation as in the proof of 1.6.3, denote by 𝐿 the perimeter of the
curve and by 𝛼 ̃(𝑠) the only point of 𝛼 at which the tangent vector is −𝝉(𝑠). Show
that:
̃(𝑠) = 𝛼( 𝑓 (𝑠)) and
(a) there exists a differentiable function 𝑓 ∶ R → R such that 𝛼
𝑓 (𝑠 + 𝐿) = 𝑓 (𝑠) + 𝐿;
(b) there exist differentiable functions 𝜆, 𝜂 such that
where 𝜂(𝜃) denotes 𝜂(𝜑−1 (𝜃)). Note that 𝜂(𝜃) is periodic of period 2𝜋 and that the
𝜋
assumption about 𝛼 translates to 𝜂(𝜃) + 𝜂 (𝜃 + ) = 2𝔏. Finally, we can write
2
2𝜋 2𝜋 𝜋
4𝐿 = ∫ 𝜂(𝜃) 𝑑𝜃 + ∫ 𝜂 (𝜃 + ) 𝑑𝜃
0 0 2
2𝜋 𝜋
=∫ {𝜂(𝜃) + 𝜂 (𝜃 + )} 𝑑𝜃 = 4𝜋𝔏.
0 2
22. Modify 1.6.2 to give examples of curves that satisfy the hypothesis of Exercise 21
but do not have constant width.
We now give the four-vertex theorem, which states that the curvature of any closed
convex curve has at least four critical points (this result is also valid for nonconvex
closed planar curves, but we will not prove it in such generality). This result is best
possible: a non-circular ellipse has exactly four vertices, which are its points of
intersection with the axes (see Exercise 6).
Let 𝛼∶ [𝑎, 𝑏] → R 2 be a regular closed curve, and 𝑘(𝑠) its curvature function. A
vertex of 𝛼 is a point 𝛼(𝑠0 ) such that 𝑘 ′ (𝑠0 ) = 0. This definition does not depend on
the parameterization and so we assume that 𝑠 is the arc length.
Theorem 1.7.1 Any closed convex curve has at least four vertices.
Proof We can assume that 𝑘(𝑠) has a finite number of critical points, because
otherwise there is nothing to be shown. The function 𝑘(𝑠) attains some maximum
26 1 Differentiable Curves
and some minimum — which, changing the starting point if necessary, we suppose
happen at the points 𝑠 = 𝑎 and 𝑠 = 𝑠0 ∈]𝑎, 𝑏[. By applying a rotation or translation we
can ensure that both points 𝛼(𝑎) and 𝛼(𝑠0 ) are on the 𝑥-axis.
Let us check that there are no other points of 𝛼 on the 𝑥-axis: for if there were
another one — let it be 𝑝 — then the tangent line to 𝛼 at the one of the three points
𝛼(𝑎), 𝛼(𝑠0 ) and 𝑝 which lies between the other two is the 𝑥-axis; otherwise there
would be points of 𝛼 on opposite sides of this tangent line, in contradiction to the
convexity of 𝛼. It follows that the tangent line at 𝛼(𝑎) and 𝛼(𝑠0 ) is also the horizontal
axis, and that (as in the proof of 1.5.1) the trace of 𝛼∣[𝑎,𝑠 ] is a line segment. This
0
however contradicts our assumption that 𝛼 has a finite number of vertices.
Putting 𝛼(𝑠) = (𝑥(𝑠), 𝑦(𝑠)), we then have that 𝑦(𝑠) never vanishes on the
intervals ]𝑎, 𝑠0 [ and ]𝑠0 , 𝑏[, taking on the second interval a sign opposite to the
one it takes on the first; and the same is true of 𝑘 ′ (𝑠) if we assume that 𝛼 has at
most two vertices. Under this assumption the function 𝑘 ′ (𝑠)𝑦(𝑠) then has constant
𝑏
sign, vanishing only at 𝑎, 𝑥0 and 𝑏, and so ∫ 𝑘 ′ (𝑠)𝑦(𝑠) 𝑑𝑠 ≠ 0. But, writing
𝑎
(𝑥 ′ (𝑠), 𝑦 ′ (𝑠)) = (cos 𝜑(𝑠), sin 𝜑(𝑠)), and using integration by parts and the equality
𝜑′ (𝑠) = 𝑘(𝑠), we have
𝑏 𝑏 𝑏
𝑘 ′ (𝑠)𝑦(𝑠) 𝑑𝑠 = 𝑘(𝑠)𝑦(𝑠)∣𝑎 − ∫ 𝑘(𝑠)𝑦 ′ (𝑠) 𝑑𝑠 = − ∫ 𝑘(𝑠)𝑦 ′ (𝑠) 𝑑𝑠
𝑏
∫
𝑎 𝑎 𝑎
𝑏
𝜑′ (𝑠) sin 𝜑(𝑠) 𝑑𝑠 = cos 𝜑(𝑠)∣𝑎 = 0.
𝑏
= −∫
𝑎
This contradiction shows that 𝑘 ′ (𝑠) changes sign on some intervals ]𝑎, 𝑠0 [ and ]𝑠0 , 𝑏[.
Since in each of them 𝑘 ′ (𝑠) has the same sign near the endpoints, we conclude that
𝑘 ′ (𝑠) changes sign at least twice in such an interval, which proves the theorem. ◻
The four-vertex theorem is still valid for non-convex curves. For a very elegant
geometric proof that also covers this generalization, we suggest [21].
The isoperimetric inequality states that, among all planar curves with a given perimeter,
the circumference encompasses the largest area. The proof we give (by A. Hurwitz,
1902) makes essential use of the theory of Fourier series (see [11] for an introduction
to this theory). An elementary proof appears in [18], and [9] contains a generalization
of the isoperimetric inequality for convex bodies in dimensions greater than two.
and equality holds if and only if there exist 𝑎 and 𝑏 such that 𝑓 (𝑡) = 𝑎 cos 𝑡 + 𝑏 sin 𝑡.
Proof Be
𝑎0 ∞
𝑓 (𝑡) ∼ + ∑ (𝑎 𝑛 cos 𝑛𝑡 + 𝑏 𝑛 sin 𝑛𝑡)
2 𝑛=1
the Fourier series expansion of 𝑓 . Since 𝑓 ′ (𝑡) is continuous, its expansion is obtained
from that of 𝑓 (𝑡) by term-by-term differentiation, thus
∞
𝑓 ′ (𝑡) ∼ ∑ (𝑛 𝑏 𝑛 cos 𝑛𝑡 − 𝑛 𝑎 𝑛 sin 𝑛𝑡).
𝑛=1
2𝜋
Since ∫0 𝑓 (𝑡) 𝑑𝑡 = 𝜋 𝑎 0 , our hypothesis yields 𝑎 0 = 0. Using Parseval’s formula,
we have
2𝜋 ∞
1 2
∫ 𝑓 (𝑡) 𝑑𝑡 = ∑ (𝑎 2𝑛 + 𝑏 2𝑛 ),
𝜋 0 𝑛=1
2𝜋 ∞
1
𝑓 ′ (𝑡) 𝑑𝑡 = ∑ 𝑛2 (𝑎 2𝑛 + 𝑏 2𝑛 ).
2
∫
𝜋 0 𝑛=1
It follows that
2𝜋 2𝜋 ∞
𝑓 ′ (𝑡) 𝑑𝑡 − ∫
2 2
∫ 𝑓 (𝑡) 𝑑𝑡 = ∑ 𝜋(𝑛2 − 1)(𝑎 2𝑛 + 𝑏 2𝑛 ) ≥ 0,
0 0 𝑛=1
and equality only holds if 𝑎 𝑛 = 𝑏 𝑛 = 0 for all 𝑛 > 1. Since continuous functions are
determined by their Fourier expansion, this is equivalent to 𝑓 (𝑡) = 𝑎 1 cos 𝑡 + 𝑏 1 sin 𝑡.
◻
Theorem 1.8.2 (Isoperimetric inequality) Let 𝛼 be a simple regular closed curve of
perimeter 𝐿, bounding a region Ω of area 𝐴. Then
𝐿2
𝐴≤ ,
4𝜋
and equality holds only when 𝛼 is a circle.
We can rescale the figure using a homothety, so there is no loss of generality
if we suppose that 𝐿 = 2𝜋, and therefore 𝛼(𝑠) = (𝑥(𝑠), 𝑦(𝑠)), 𝑠 ∈ [0, 2𝜋]. With a
2𝜋
translation, we can achieve ∫0 𝑥(𝑠) 𝑑𝑠 = 0. Furthermore, we assume that 𝛼(𝑠) runs
through the boundary of Ω in the counterclockwise direction. Applying Green’s
theorem
𝜕𝑄 𝜕𝑃
(∫ 𝑃 𝑑𝑥 + 𝑄 𝑑𝑦 = ∬ ( − ) 𝑑𝑥𝑑𝑦)
𝜕Ω Ω 𝜕𝑥 𝜕𝑦
to the vector field (𝑃, 𝑄) = (0, 𝑥), we obtain
2𝜋
𝐴=∫ 𝑥𝑦 ′ 𝑑𝑠,
0
28 1 Differentiable Curves
The second integral of this sum is non-negative and, by Wirtinger’s lemma, so is the
first. We thus conclude, as desired, that 𝐴 ≤ 𝜋. To achieve equality both integrals
have to be zero, so 𝑦 ′ (𝑠) = 𝑥(𝑠) and 𝑥(𝑠) = 𝑎 cos 𝑠 + 𝑏 sin 𝑠. We thus have
Exercises
of a periodic function of period 2𝜋, integrable on [0, 2𝜋] (a class that includes
bounded functions with a finite number of discontinuities), are defined by
1 2𝜋
𝑎0 = ∫ 𝑓 (𝑡) 𝑑𝑡,
𝜋 0
1 2𝜋
𝑎𝑛 = ∫ 𝑓 (𝑡) cos 𝑛𝑡 𝑑𝑡,
𝜋 0
1 2𝜋
𝑏𝑛 = ∫ 𝑓 (𝑡) sin 𝑛𝑡 𝑑𝑡.
𝜋 0
Show that:
+∞ +∞
(a) if 𝑓 is of class 𝐶 1 and (𝑎 ′𝑛 )𝑛=0 and (𝑏 ′𝑛 )𝑛=1 are the Fourier coefficients of 𝑓 ′ ,
then 𝑎 ′0 = 0 and 𝑎 ′𝑛 = 𝑛 𝑏 𝑛 and 𝑏 ′𝑛 = −𝑛 𝑎 𝑛 for 𝑛 ≥ 1 (use integration by parts);
Figure 1.10
24. Consider a straight line 𝑟 in the plane and a flexible string C of length 𝐿. By
placing C in the plane so that its ends are on 𝑟, we obtain a figure bounded by 𝑟 and by
C and whose area depends on the shape we give the string (see figure above). Show
that the figure of maximum area among all those so obtained is a semicircle based on
𝑟.
25. Given two points 𝑝 and 𝑞 in the plane and a flexible string C of length 𝐿 > ∣𝑝 − 𝑞∣,
determine the figure of largest area among those bounded by C and by the line segment
𝑝𝑞.
26. Let 𝛼 be a convex closed curve, piecewise 𝐶 1 , of perimeter 𝐿, bounding a region
Ω of area 𝐴. Let 𝑟 1 and 𝑟 2 be two parallel lines at a distance 𝑑 from each other such
that both touch 𝛼, and 𝛼 is contained in the strip bounded by the them. Consider an
orthonormal Cartesian coordinate system whose vertical axis is 𝑟 1 and whose origin
is the midpoint of the line segment 𝑟 1 ∩ 𝛼 (a line segment which may contain a single
point). There thus exist functions piecewise 𝐶 1 𝑓 , 𝑔∶ [0, 𝑑] → R such that:
r1 r2
(t,f(t))
0
W
(t,g(t))
Figure 1.11
30 1 Differentiable Curves
In this chapter we introduce regular surfaces, the object of all our further study,
defining them as those subsets of R 3 that can be described locally by two independent
parameters. We introduce notions such as tangent space, differentiable function and
diffeomorphism, and consider the problems of orientability and the measurement of
quantities (areas, lengths . . .) on surfaces.
We all have an intuitive notion of what a surface is, and any attempt to describe that
notion would inevitably fall into redundancy. We accept, however, that the plane is
the simplest surface of all, and that a good way to construct models of others is by
gluing together various pieces of paper. Our definition of surface is the mathematical
elaboration of this idea.
A subset 𝑆 of R 3 is called a regular surface if, for each 𝑝 ∈ 𝑆, there exist an open
neighborhood 𝑉 ⊆ R 3 of 𝑝, an open subset 𝑈 ⊆ R 2 , and a bijection Φ∶ 𝑈 → 𝑉 ∩ 𝑆
with the following properties:
i. Φ is of class 𝐶 ∞ ;
ii. Φ is a homeomorphism (i.e., its inverse Φ−1 ∶ 𝑉 ∩ 𝑆 → 𝑈 is continuous);
iii. for all 𝑞 ∈ 𝑈 the Jacobian matrix 𝐽Φ(𝑞) has rank two.
F
F -1 (p) Fv
p
Fu
u S
Figure 2.1
defined on the disk {(𝑢, 𝑣)∶ 𝑢 2 +𝑣 2 < 1}. With a few more analogous parameterizations
(how many are needed?) we can cover the whole sphere, which therefore is a surface.
2.1 Definition and Examples 33
p
q
j
y
Figure 2.2
has length sin 𝜃, and so is nonzero: therefore Ψ𝜑 and Ψ 𝜃 are linearly independent.
We mention that some authors use, in spherical coordinates, the latitude 𝜃̃ =
− 𝜃 (𝜃̃ ∈ ]− , [) instead of the colatitude, thus obtaining
𝜋 𝜋 𝜋
2 2 2
̃
Ψ(𝜑, ̃ = (cos 𝜃̃cos 𝜑, cos 𝜃̃sin 𝜑, sin 𝜃).
𝜃) ̃
⎛ ⎞ √
𝑢 = 𝑓 −1 𝑣 = 𝛼−1 ( 𝑥 2 + 𝑦 2 , 𝑂, 𝑧),
𝑦
√ ,
⎝ 𝑥 + 𝑥2 + 𝑦2 ⎠
1/2p 1/p
Figure 2.3
Let 𝛼(𝑢) = (𝑥(𝑢), 𝑦(𝑢)), 𝑢>0, be a simple regular curve of class 𝐶 ∞ that includes
the vertical line segment {0} × ]1, 1] and the graph of the function sin 1𝑥 , 𝑥 ∈ ]0, 𝜋1 ];
this curve accumulates in the neighborhood of {0} × ] −1, 1[ (see Fig. 2.3). The
trace 𝑆 of the parametrized surface Φ(𝑢, 𝑣) = (𝑥(𝑢), 𝑦(𝑢), 𝑣) is not a surface: if it
were, each 𝑝 ∈ 𝑆 would have arbitrarily small neighborhoods 𝑉 in R 3 with 𝑉 ∩ 𝑆
homeomorphic to disks; but that does not happen if 𝑝 ∈ {0} × ] −1, 1[ × R , because
𝑉 ∩ 𝑆 has infinitely many connected components for all sufficiently small 𝑉.
This means that not all parametrized surfaces define regular surfaces. But it is
also not easy, and in some cases not even possible, to describe regular surfaces as
2.1 Definition and Examples 35
parametrized surfaces (i.e., as the image of one single function Φ). There is in general
no reason to privilege a particular (even global) parameterization on a given surface.
In conclusion: surface for us means regular surface, and only in the exercises we
will mention parametrized surfaces.
Exercises
28. For each 𝑎 ∈ R , the polar coordinates Φ(𝜌, 𝜑) = (𝜌 cos 𝜑, 𝜌 sin 𝜑), with 𝜌 > 0
and 𝜑 ∈ ]𝑎 − 𝜋, 𝑎 + 𝜋[, define a parameterization of R 2 that excludes a half-line. (We
consider R 2 as a surface by identifying it with the plane R 2 × {0} ⊆ R 3 .)
29. Consider a helix parametrized by (cos 𝑡, sin 𝑡, 𝑡) (𝑡 ∈ R ). The helicoid is the set
formed by all (horizontal) lines connecting each point of the 𝑧-axis with the point of
the helix at the same height (see Fig. 2.4). Show that the helicoid is a regular surface.
Figure 2.4
36 2 Regular Surfaces
p (p)
Figure 2.5
Figure 2.6
2.2 Change of Parameters, Level Surfaces 37
32. The tractrix is the planar curve obtained as follows: let us fix a line (let it be
the 𝑧-axis); the distance from any point 𝑝 on the curve to the point of intersection
𝑝 ′ of the tangent line to the curve at 𝑝 with the fixed line is constant, equal to
𝐶 > 0. Parametrize the tractrix using the angle 𝑡 in Fig. 2.7 as a parameter (note that
𝑡 ∈ ] 𝜋2 , 𝜋[). [The surface of revolution obtained from the tractrix around the 𝑧-axis is
the pseudosphere].
p´
Figure 2.7
38 2 Regular Surfaces
Proposition 2.2.1 The coordinate change Φ−1 ○ Ψ∶ Φ−1 (𝑊) → Ψ−1 (𝑊) is a diffeo-
morphism.
Proof It suffices to show that Φ−1 ○Ψ is differentiable, because the same argument
proves the differentiability of its inverse Ψ−1 ○Φ. We write
𝑓 −1
𝑢, ̃
(̃ 𝑣 ) ↦ Ψ(̃ 𝑣 ) = (𝑥, 𝑦, 𝑧) ↦ (𝑥, 𝑦) z→ (𝑢, 𝑣) = Φ−1 ○ Ψ(̃
𝑢, ̃ 𝑢, ̃
𝑣 ).◻
F
Y
Y -1(W)
F -1(W)
U ~
F -1 Y U
Figure 2.8
Let us point out that the foregoing proof establishes a more general fact than 2.2.1:
if 𝛼 is a differentiable function defined on an open subset of R 𝑛 (a curve, for example)
2.2 Change of Parameters, Level Surfaces 39
Example 2.2.3 Proposition 2.2.2 gives us a criterion to show that certain sets are not
surfaces, which we illustrate with the cone
√
C = {(𝑥, 𝑦, 𝑧) ∈ R 3 ∶ 𝑧 = 𝑥 2 + 𝑦 2 }.
If C is a surface, there exists an open subset 𝑉 of R 3 containing the point (0, 0, 0)
such that 𝑉 ∩ 𝑆 is the graph of a differentiable function ℎ. But ℎ can only be a
function of (𝑥, 𝑦), because none of the projections of C√on the other coordinate planes
contains a neighborhood of (0, 0). Thus ℎ(𝑥, 𝑦) = 𝑥 2 + 𝑦 2 and this function is
not differentiable at (0, 0). Therefore, C is not a surface. [But the origin is the only
problematic point: C/{(0, 0, 0)} is a surface]. ◻
𝜕𝑓 𝜕𝑓 𝜕𝑓
∇𝑓 = ( , , ),
𝜕𝑥 𝜕𝑦 𝜕𝑧
𝜕𝑓
The condition (𝑝) ≠ 0 guarantees that in a neighborhood 𝑈 of 𝑝 ∈ 𝑓 −1 ({𝑎}),
𝜕𝑧
𝑓 is strictly monotone along vertical segments, and therefore each of these segments
intersects 𝑓 −1 ({𝑎}) at most one point. In fact, the proof of the next proposition
consists essentially in showing that under these circumstances 𝑓 −1 ({𝑎}) ∩ 𝑈 is the
graph of a differentiable function of (𝑥, 𝑦).
𝜕𝑓
Proof Assuming that (𝑝) ≠ 0, we define
𝜕𝑧
𝐹(𝑥, 𝑦, 𝑧) = (𝑥, 𝑦, 𝑓 (𝑥, 𝑦, 𝑧)).
𝜕𝑓
Since det 𝐽𝐹(𝑝) = (𝑝) ≠ 0, the function 𝐹 is invertible in a neighborhood of
𝜕𝑧
𝑝: so there exist open subsets 𝑈, 𝑊 ⊆ R 3 such that 𝑝 ∈ 𝑈 ⊆ 𝑉, and 𝐹 sends 𝑈
diffeomorphically onto 𝑊. Let us now note that the inverse 𝐺∶ 𝑊 → 𝑈 of 𝐹∣𝑈 has
the form 𝐺(𝑥, 𝑦, 𝑧) = (𝑥, 𝑦, 𝑔(𝑥, 𝑦, 𝑧)) and that, for (𝑥, 𝑦, 𝑧) ∈ 𝑈, all the following
equalities are equivalent:
𝑓 (𝑥, 𝑦, 𝑧) = 𝑎,
𝐹(𝑥, 𝑦, 𝑧) = (𝑥, 𝑦, 𝑎),
(𝑥, 𝑦, 𝑧) = 𝐺(𝑥, 𝑦, 𝑎),
𝑧 = 𝑔(𝑥, 𝑦, 𝑎).
The equivalence between the first and last subset of these equalities shows that
𝑈 ∩ 𝑓 −1 ({𝑎}) is the graph of the differentiable function ℎ(𝑥, 𝑦) = 𝑔(𝑥, 𝑦, 𝑎), whose
domain is the open subset 𝑅 = {(𝑥, 𝑦) ∈ R 2 ∶ (𝑥, 𝑦, 𝑎) ∈ 𝑊} — and this concludes the
proof that 𝑓 −1 ({𝑎}) is a surface. ◻
The sets 𝑓 −1 ({𝑎}) are the level sets of 𝑓 and, when 𝑎 is a regular value, they are
also called level surfaces. Taking for example
𝑥 2 𝑦2 𝑧2
𝑓 (𝑥, 𝑦, 𝑧) = + + ,
𝑎2 𝑏2 𝑐2
we see that the ellipsoid
𝑥 2 𝑦2 𝑧2
+ + =1
𝑎2 𝑏2 𝑐2
is a regular surface, since ∇ 𝑓 (𝑥, 𝑦, 𝑧) is nonzero for all (𝑥, 𝑦, 𝑧) ≠ (0, 0, 0). In general,
any non-degenerate quadric in R 3 is a regular surface, since, for an appropriate
orthonormal basis, it has equation 𝑥 + 𝜀2 𝑦 2 + 𝜀3 𝑧2 = 0 if it is a paraboloid, or
𝜀1 𝑥 2 + 𝜀2 𝑦 2 + 𝜀3 𝑧 2 = 1, with (𝜀1 , 𝜀 2 , 𝜀 3 ) ≠ (0, 0, 0), if it is a hyperboloid or an
ellipsoid.
2.2 Change of Parameters, Level Surfaces 41
Note 2.2.5 A level surface is not necessarily connected, as shown by the two-leaf
hyperboloid defined by the equation 𝑧 2 − 𝑥 2 − 𝑦 2 = 1. For the benefit of the reader
unfamiliar with the concept, we give here the definition of connectedness and a brief
discussion: a set 𝐴 ⊆ R 𝑛 is called connected if it cannot be split, i.e., if there are no
disjoint open subsets 𝑉 and 𝑊 of R 𝑛 such that 𝐴 ∩ 𝑉 ≠ ∅ ≠ 𝐴 ∩ 𝑊 and 𝐴 ⊆ 𝑉 ∪ 𝑊.
The hyperboloid above, for example, is not connected because it admits the splitting
𝑉 = {(𝑥, 𝑦, 𝑧) ∈ R 3 ∶ 𝑧 > 0} and 𝑊 = {(𝑥, 𝑦, 𝑧) ∈ R 3 ∶ 𝑧 < 0}. The connected subsets of
R are the intervals; the balls in R 𝑛 (disks in R 2 ) are connected. Connectedness is a
topological property, in the sense that the image of a connected set under a continuous
function is still connected.
To put it suggestively, a surface is connected when it is made up of a single
chunk. A useful connectivity criterion for surfaces (and for open subsets of R 𝑛 )
is the following: 𝑆 is connected if and only if, for every 𝑝 and 𝑞 on 𝑆, there exists
a piecewise differentiable curve 𝛼∶ [𝑎, 𝑏] → 𝑆 such that 𝛼(𝑎) = 𝑝 and 𝛼(𝑏) = 𝑞.
[Proof: if 𝑉 and 𝑊 split 𝑆, then there is no curve in 𝑆 that joins 𝑝 ∈ 𝑆 ∩𝑉 to 𝑞 ∈ 𝑆 ∩𝑊,
because the trace of a curve, being a continuous image of an interval, is connected.
On the other hand, if 𝑆 is connected and 𝑝 ∈ 𝑆, consider the set 𝑅 = {𝑞 ∈ 𝑆: there
exists a curve in 𝑆 from 𝑝 to 𝑞}. Given 𝑞 ∈ 𝑆, let (𝐷, Φ) be a parameterization in the
neighborhood of 𝑞, where 𝐷 ⊆ R 2 is an open disk. Any 𝑟 ∈ Φ(𝐷) can be joined to 𝑞
by a curve in 𝑆: the image under Φ of the line segment [Φ−1 (𝑞), Φ−1 (𝑟)]. Thus, if
𝑞 ∈ 𝑅 (resp. 𝑞 ∈ 𝑆/𝑅) then Φ(𝐷) ⊆ 𝑅 (resp. Φ(𝐷) ⊆ 𝑆/𝑅). Therefore 𝑅 and 𝑆/𝑅 are
open subsets of 𝑆 and, since 𝑆 is connected, one of them, necessarily 𝑆/𝑅, is empty.
Therefore 𝑆 = 𝑅, which proves what we wanted. ] ◻
Now that we have a method for establishing that a set is a surface without using
any parameterization, our next proposition states that if 𝑆 is a surface, anything that
appears to be a parameterization of 𝑆 is indeed so.
Proof One just has to check the continuity of the inverse Φ−1 ∶ Φ(𝑈) → 𝑈. Given
(𝑢 0 , 𝑣 0 ) ∈ 𝑈, the point Φ(𝑢 0 , 𝑣 0 ) has, by 2.2.2, an open neighborhood 𝑉 in R 3 such
that 𝑉 ∩ 𝑆 is the graph of a function that we assume to depend on (𝑥, 𝑦). Thus,
𝑉 ∩ 𝑆 = {(𝑥, 𝑦, ℎ(𝑥, 𝑦))∶ (𝑥, 𝑦) ∈ 𝑅}, where 𝑅 is an open subset of R 2 ; and, taking an
open disk 𝐷 ⊆ 𝑈 centered at (𝑢 0 , 𝑣 0 ) and such that Φ(𝐷) ⊆ 𝑉, the restriction of Φ to
𝐷 can be written as Φ(𝑢, 𝑣) = (𝑥(𝑢, 𝑣), 𝑦(𝑢, 𝑣), ℎ(𝑥(𝑢, 𝑣), 𝑦(𝑢, 𝑣))). We then have
42 2 Regular Surfaces
𝜕𝑥 𝜕ℎ 𝜕𝑦 𝜕ℎ
Φ𝑢 = (1, 0, )+ (0, 1, ),
𝜕𝑢 𝜕𝑥 𝜕𝑢 𝜕𝑦
𝜕𝑥 𝜕ℎ 𝜕𝑦 𝜕ℎ
Φ𝑣 = (1, 0, )+ (0, 1, ),
𝜕𝑣 𝜕𝑥 𝜕𝑣 𝜕𝑦
𝜕𝑥 𝜕𝑦 𝜕𝑥 𝜕𝑦 𝜕ℎ 𝜕ℎ
Φ𝑢 × Φ𝑣 = { − } (1, 0, ) × (0, 1, )
𝜕𝑢 𝜕𝑣 𝜕𝑣 𝜕𝑢 𝜕𝑥 𝜕𝑦
𝜕(𝑥, 𝑦) 𝜕ℎ 𝜕ℎ
= (− ,− , 1)
𝜕(𝑢, 𝑣) 𝜕𝑥 𝜕𝑦
𝜕(𝑥, 𝑦)
– and from this, since Φ𝑢 ×Φ𝑣 is nonzero, it follows that ≠ 0. We can therefore
𝜕(𝑢, 𝑣)
assume, shrinking 𝐷 if necessary, that 𝜋 ○ Φ∣𝐷 [where 𝜋∶ R 3 → R 2 is the projection
on the first two coordinates] is a diffeomorphism onto its image, which is then an open
subset 𝐸 of R 2 . Thus, Φ(𝐷) = {(𝑥, 𝑦, ℎ(𝑥, 𝑦))∶ (𝑥, 𝑦) ∈ 𝐸} is an open neighborhood
of Φ(𝑢 0 , 𝑣 0 ) in 𝑆, and the restriction Φ−1 ∣Φ(𝐷) is continuous, because it is given by
−1
the composite (𝜋 ○ Φ∣𝐷 ) ○ 𝜋 of continuous functions. Thus Φ−1 is continuous on
Φ(𝑢 0 , 𝑣 0 ). ◻
Exercises
33. Show that any surface is locally a level surface. Given 𝑝 ∈ 𝑆, there exist an
open neighborhood 𝑉 of 𝑝 in R 3 and a differentiable function 𝑓 ∶ 𝑉 → R such that
𝑆 ∩ 𝑉 = 𝑓 −1 ({0}) and 0 is a regular value of 𝑓 .
34. Show that if two surfaces 𝑆1 and 𝑆2 intersect transversely at 𝑝 then there exists
an open neighborhood 𝑉 of 𝑝 (in R 3 ) such that 𝑆1 ∩ 𝑆2 ∩ 𝑉 is the trace of a regular
curve. (We say that 𝑆1 and 𝑆2 intersect transversely at 𝑝 if 𝑇 𝑝 𝑆1 ≠ 𝑇 𝑝 𝑆2 .)
The results of the previous section prepared the setting to do Differential Calculus on
surfaces; and we can now, in this section, explain what is a differentiable function in
such a context. The derivatives of such functions are defined not on the surface but
on its tangent spaces, a concept that we also introduce here.
Let 𝑆1 and 𝑆2 be two surfaces. A mapping 𝑓 ∶ 𝑆1 → 𝑆2 is called differentiable if its
expression in local coordinates is differentiable: more precisely, if there exist, for each
𝑝 ∈ 𝑆1 , parameterizations (𝑈, Φ) of 𝑆1 and (𝑉, Ψ) of 𝑆2 in the neighborhoods of 𝑝
and 𝑓 (𝑝), respectively, such that Ψ−1 ○ 𝑓 ○ Φ is differentiable. Similarly, a function
𝑓 ∶ 𝑆1 → R is called differentiable if every point of 𝑆1 has a parametrized neighborhood
(𝑈, Φ) such that 𝑓 ○ Φ is differentiable. A diffeomorphism is a differentiable bijection
𝑓 ∶ 𝑆1 → 𝑆2 whose inverse is also differentiable.
2.3 Differentiable Functions on Surfaces, Tangent Space 43
̃ = Φ∣
Ψ ]𝑢 1 +𝑢0 − 𝜋,𝑢1 +𝑢0 + 𝜋[×]𝑣1 +𝑣0 − 𝜋,𝑣1 +𝑣0 + 𝜋[
We now deal with the tangent space to a surface 𝑆 at a point 𝑝, which we denote
by 𝑇 𝑝 𝑆. We define 𝑇 𝑝 𝑆 to be the set of velocity vectors, at the point 𝑝, of the curves
whose graph is in 𝑆:
In practice, what we did was to write the curve 𝛼 in local coordinates: if 𝛼(𝑡) =
Φ(𝑢(𝑡), 𝑣(𝑡)) then 𝛼 is differentiable if and only if so are both functions 𝑢(𝑡) and
𝑣(𝑡); and the chain rule provides the equality 𝛼′ (𝑡) = 𝑢 ′ (𝑡)Φ𝑢 + 𝑣 ′ (𝑡)Φ𝑣 , which
shows that at each point of Φ(𝑈) the tangent space is generated by the vectors Φ𝑢
and Φ𝑣 .
Example 2.3.3 The tangent space to the level surface 𝑆 𝑎 = 𝑓 −1 ({𝑎}) at point 𝑝 is the
orthogonal complement of the line generated by ∇ 𝑓 (𝑝). In fact, if 𝛼∶ ]−𝜀, 𝜀[ → 𝑆 𝑎
satisfies 𝛼(0) = 𝑝 then 𝑓 ○ 𝛼(𝑡) = 𝑎 for all 𝑡 ∈ ]−𝜀, 𝜀[, so that ⟨∇ 𝑓 (𝑝), 𝛼′ (0)⟩ =
( 𝑓 ○ 𝛼)′ (0) = 0 — which shows that ∇ 𝑓 (𝑝) is orthogonal to 𝑇 𝑝 𝑆 𝑎 . ◻
From (*) and (**) it follows that 𝐷 𝑓 𝑝 (𝛼′ (0)) is well-defined, not depending on 𝛼
but only on 𝛼′ (0); and that furthermore 𝐷 𝑓 𝑝 is linear and has matrix 𝐽 𝑓̃Φ−1 ( 𝑝) with
respect to the given bases. ◻
We end this section with two results which are the transpositions of the inverse
mapping theorem and the chain rule to the context of surfaces. The proof of the first
one is left as an (easy) exercise.
Proof The verification that 𝑔 ○ 𝑓 is differentiable is left to the reader. As for the
second statement, let us take u ∈ 𝑇 𝑝 𝑆1 and a curve 𝛼 such that 𝛼′ (0) = u, and let
us put 𝛽 = 𝑓 ○ 𝛼, 𝛾 = 𝑔 ○ 𝑓 ○ 𝛼, v = 𝛽′ (0), w = 𝛾 ′ (0). We then have 𝐷 𝑓 𝑝 (u) = v
(because 𝛽 = 𝑓 ○ 𝛼), 𝐷𝑔 𝑓 ( 𝑝) (v) = w (because 𝛾 = 𝑔 ○ 𝛽), 𝐷(𝑔 ○ 𝑓 ) 𝑝 (u) = w (because
𝛾 = (𝑔 ○ 𝑓 ) ○ 𝛼), and therefore 𝐷(𝑔 ○ 𝑓 ) 𝑝 (u) = (𝐷𝑔 𝑓 ( 𝑝) ○ 𝐷 𝑓 𝑝 )(u). ◻
Exercises
35. Consider the function 𝑓 ∶ R 3 → R given by 𝑓 (𝑥, 𝑦, 𝑧) = 2𝑥 2 − 𝑦 2 − 𝑧2 . Determine
the equations of the planes which √are tangent to the surface 𝑓 −1 ({1}) and parallel to
the plane given by the equation 2 2𝑥 + 𝑦 + 𝑧 = 0.
36. (a) Show that the paraboloid 𝑧 = 𝑥 2 + 𝑦 2 is diffeomorphic to the plane.
(b) Show that the sphere S2 and the ellipsoid
𝑥 2 𝑦2 𝑧2
+ + =1
𝑎2 𝑏2 𝑐2
are diffeomorphic.
37. Let 𝑉 be a neighborhood of the origin in R 2 and let Φ∶ 𝑉 → R 3 given by
Φ(𝑢, 𝑣) = 𝑓 (𝑢) + 𝑔(𝑣) be a parameterization of a regular surface 𝑆. Show that the
tangent planes to 𝑆 along the curve Φ(𝑢, 0) are all parallel to the same line.
38. A differentiable mapping 𝑓 ∶ 𝑆1 → 𝑆2 is called a local diffeomorphism if each point
𝑝 ∈ 𝑆1 has a neighborhood 𝑊 in 𝑆1 such that 𝑓 ∣𝑊 ∶ 𝑊 → 𝑓 (𝑊) is a diffeomorphism.
Show that if 𝑓 is a local diffeomorphism then 𝐷 𝑓 𝑝 is a linear isomorphism for all
𝑝 ∈ 𝑆1 .
39. Show that if all normal lines to a connected surface pass through the same point,
then that surface is contained in a sphere.
46 2 Regular Surfaces
40. (a) Given 0 < 𝑟 1 < 𝑟 2 and 𝜑0 , consider a function 𝑔∶ R → R that is 𝐶 ∞ , monotone
and such that 𝑔(𝑥) = 𝜑0 for 𝑥 ≤ 𝑟 1 , and 𝑔(𝑥) = 0 for 𝑥 ≥ 𝑟 2 (see ex. 17). Let
ℎ∶ R 2 → R 2 be the mapping that sends the point with polar coordinates (𝜌, 𝜑) to the
point with coordinates (𝜌, 𝜑 + 𝑔(𝜌)). Show that ℎ is a 𝐶 ∞ diffeomorphism. How
does ℎ behave in {𝑝 ∈ R 2 ∶ ∣𝑝∣ ≤ 𝑟 1 } and {𝑝 ∈ R 2 ∶ ∣𝑝∣ ≥ 𝑟 2 }?
(b) Let (𝑈, Φ) be a parameterization of 𝑆 such that 𝑈 contains the closed disk
with radius 𝑟 2 centered at the origin. Show that Φ ○ ℎ ○ Φ−1 ∶ Φ(𝑈) → Φ(𝑈) extends
to a diffeomorphism of 𝑆.
(c) Show that if 𝑆 is connected then for any two points of 𝑆 there exists a
diffeomorphism of 𝑆 that sends one of these points to the other one.
41. Define explicitly a differentiable mapping T 2 → S2 that is surjective.
42. Let 𝑆 = {(𝑥, 𝑦, 𝑧) ∈ R 3 ∶ 𝑥 ≠ 0, 𝑧 = 𝑥 𝑓 (𝑦/𝑥)}, where 𝑓 ∶ R → R is a 𝐶 ∞ function.
Show that 𝑆 is a regular surface, and that all tangent planes to 𝑆 pass through the
origin.
43. Consider the mapping
2.4 Orientability
A surface is orientable when it is possible to distinguish its top from the bottom, so
that an observer placed on it can distinguish left from right. This approach works
when the observer is three-dimensional and has an idea of the position of the surface
in space; it is more intricate to explain how two-dimensional beings whose universe
is the surface will know whether it is orientable or not.
Given two linearly independent vectors v and w in R 3 , the trihedron (v, w, 𝑁),
where
1
𝑁= (v × w),
∣v × w∣
forms a positively oriented basis of R 3 , meaning that the matrix whose columns
are (in the same order) these vectors has positive determinant. The unit vector 𝑁 is
orthogonal to the plane Π generated by v and w, introducing an orientation in Π as
follows: a basis (v1 , w1 ) of Π is called positively oriented if the triplet (v1 , w1 , 𝑁) is
a positively oriented basis of R 3 ; in other words, if
1
𝑁= (v1 × w1 ).
∣v1 × w1 ∣
2.4 Orientability 47
We thus recognize that Π has exactly two orientations, one induced by 𝑁 and the
other one by −𝑁.
We say that the surface 𝑆 is orientable if it is possible to choose, for each 𝑝 ∈ 𝑆, an
orientation on 𝑇 𝑝 𝑆 that varies continuously with 𝑝 — more precisely, if there exists a
continuous function 𝑁∶ 𝑆 → S2 such that, for each 𝑝, 𝑁(𝑝) is orthogonal to 𝑇 𝑝 𝑆. We
call such a field of normal vectors 𝑁 an orientation of 𝑆.
For example, level surfaces are orientable, because the vector field
1
𝑁(𝑝) = ∇ 𝑓 (𝑝)
∣∇ 𝑓 (𝑝)∣
Figure 2.9
which satisfies the relations e1 ′ (𝜃) = e2 (𝜃), e2 ′ (𝜃) = −e1 (𝜃), e1 (𝜃) × e2 (𝜃) = e3 (𝜃),
e2 (𝜃) × e3 (𝜃) = e1 (𝜃), e3 (𝜃) × e1 (𝜃) = e2 (𝜃). Writing
𝜃 𝜃
Φ(𝜃, 𝑡) = (2 − 𝑡 sin ) e1 (𝜃) + 𝑡 cos e3 (𝜃),
2 2
we easily conclude that
𝜃 𝜃 𝜃 𝑡
Φ 𝜃 × Φ𝑡 = (2 − 𝑡 sin ) [cos e1 (𝜃) + sin e3 (𝜃)] + e2 (𝜃).
2 2 2 2
𝑢, ̃
Φ(𝑢, 𝑣) = Ψ(̃ 𝑣) (*)
for (𝑢, 𝑣) ∈ Φ−1 (𝑊) and (̃ 𝑣 ) = Ψ−1 ○ Φ(𝑢, 𝑣). By differentiation of (*) we obtain
𝑢, ̃
the two equalities
𝜕̃
𝑢 𝜕̃
𝑣 𝜕̃
𝑢 𝜕̃
𝑣
Φ𝑢 = Ψ𝑢̃ + Ψ̃𝑣 , Φ𝑣 = Ψ𝑢̃ + Ψ̃𝑣 ,
𝜕𝑢 𝜕𝑢 𝜕𝑣 𝜕𝑣
from which
1 1
have Φ𝑢 × Φ𝑣 = 𝑁∣Φ(𝑈) or Φ𝑢 × Φ𝑣 = −𝑁∣Φ(𝑈) . In the first
∣Φ𝑢 × Φ𝑣 ∣ ∣Φ𝑢 × Φ𝑣 ∣
hypothesis, we take (𝑈 𝛼 , Φ 𝛼 ) = (𝑈, Φ); in the second one, we take 𝑈 𝛼 = {(𝑢, 𝑣) ∈
R 2 ∶ (𝑢, −𝑣) ∈ 𝑈} and Φ 𝛼 (𝑢, 𝑣) = Φ(𝑢, −𝑣); in each case Φ 𝛼 induces on Φ 𝛼 (𝑈 𝛼 )
the same orientation as 𝑁. The atlas (𝑈 𝛼 , Φ 𝛼 ) 𝛼∈I obtained in this way covers 𝑆 and,
by the equivalence of conditions a) and b) above, is oriented.
Given now an oriented atlas (𝑈 𝛼 , Φ 𝛼 ) 𝛼∈I , we define an orientation 𝑁∶ 𝑆 → S2 by
requiring that its restriction to each open subset Φ 𝛼 (𝑈 𝛼 ) is the orientation induced
by Φ 𝛼 . By (**), there is no ambiguity in the definition of 𝑁; and, since 𝑁∣Φ (𝑈 ) is
𝛼 𝛼
continuous and (Φ 𝛼 (𝑈 𝛼 )) 𝛼∈I is a covering of 𝑆 by open sets, 𝑁 is continuous. ◻
Proof Let 𝑁 and −𝑁 be the two orientations of 𝑆. We define two oriented atlases A1
and A2 of 𝑆 as follows: A1 (resp. A2 ) includes all parameterizations (𝑈, Φ) of 𝑆 such
that Φ induces in Φ(𝑈) the orientation 𝑁∣Φ(𝑈) (resp. −𝑁∣Φ(𝑈) ). Thus, any oriented
atlas is included either in A1 or in A2 , so that we can assume A = A1 . Furthermore,
det 𝐽(Ψ−1 ○ Φ) < 0 whenever (𝑈, Φ) ∈ A1 and (𝑉, Ψ) ∈ A2 .
The set 𝑓 (A1 ) of the parameterizations (𝑈, 𝑓 ○ Φ) such that (𝑈, Φ) ∈ A1
is an oriented atlas of 𝑆, so that it defines on 𝑆 one of the orientations 𝑁 or
2.5 Areas, Lengths, and Angles: The First Fundamental Form 51
−𝑁. In the first hypothesis, 𝑓 (A1 ) ⊆ A1 and therefore det 𝐽(Ψ−1 ○ 𝑓 ○ Φ) > 0
whenever (𝑈, Φ), (𝑉, Ψ) ∈ A1 ; in the second hypothesis, 𝑓 (A1 ) ⊆ A2 and thence
det 𝐽(Ψ−1 ○ 𝑓 ○ Φ) < 0 for (𝑈, Φ), (𝑉, Ψ) ∈ A1 . This proves (ii).
To prove (i), we take (𝑈, Φ) in A1 and set 𝑉 = {(𝑢, 𝑣) ∈ R 2 ∶ (𝑢, −𝑣) ∈ 𝑈},
Ψ(𝑢, 𝑣) = Φ(𝑢, −𝑣). Then (𝑉, Ψ) belongs to A2 , since the Jacobian of Ψ−1 ○Φ(𝑢, 𝑣) =
(𝑢, −𝑣) is negative; and, for the same reason, the parameterizations (𝑈, 𝑓 ○ Φ) and
(𝑉, 𝑓 ○ Ψ) cannot belong both to A1 or both to A2 . Therefore 𝑓 (A1 ) and 𝑓 (A2 )
define distinct orientations, which proves (i). ◻
belongs to some oriented atlas of S2 , since its domain is connected; and, since the
Jacobian of Φ−1 ○ 𝑓 ○ Φ(𝑢, 𝑣) = (−𝑢, 𝑣) is negative, we conclude that 𝑓 reverses
orientation.
Exercises
44. Find out whether the antipodal mapping ℎ∶ S2 → S2 given by ℎ(𝑥, 𝑦, 𝑧) =
(−𝑥, −𝑦, −𝑧) preserves orientation or not.
45. Consider the Möbius strip M parametrized by Φ(𝜃, 𝑡) = ((2 − 𝑡 sin 2𝜃 ) cos 𝜃, (2 −
𝑡 sin 2𝜃 ) sin 𝜃, 𝑡 cos 2𝜃 ). Show that if the circumference 𝑡 = 0 is removed from M , then
the resulting surface is still connected but is then orientable.
46. Let 𝑓 ∶ 𝑆1 → 𝑆2 be a local diffeomorphism. Check whether the following statements
are true:
(a) if 𝑆2 is orientable then 𝑆1 is orientable;
(b) if 𝑆1 is orientable and 𝑓 is surjective then 𝑆2 is orientable.
47. Let 𝑆 be a connected orientable surface and let 𝑓 ∶ 𝑆 → 𝑆 be a diffeomorphism. Is
it true that 𝑓 ○ 𝑓 preserves orientation?
Any surface 𝑆 ⊆ R 3 inherits from the ambient space a notion of size that can be
used to measure the area of regions and the length of curves in 𝑆. This metric
structure, which we now introduce, enriches the concept of surface and enables a
finer classification than that by diffeomorphisms.
The first fundamental form of 𝑆 at 𝑝 ∈ 𝑆 is the quadratic form 𝐼 𝑝 ∶ 𝑇 𝑝 𝑆 → R +
defined by 𝐼 𝑝 (v) = ⟨v, v⟩ 𝑝 , where ⟨⋅ , ⋅⟩ 𝑝 is the restriction to 𝑇 𝑝 𝑆 of the usual inner
product on R 3 .
If Φ(𝑢, 𝑣) is a parameterization of 𝑆 and 𝛼(𝑡) = Φ(𝑢(𝑡), 𝑣(𝑡)) is a differentiable
curve, we have
52 2 Regular Surfaces
where 𝐸, 𝐹 and 𝐺 are the so-called coefficients of the first fundamental form
for the parameterization Φ(𝑢, 𝑣), defined by 𝐸(𝑢, 𝑣) = 𝐼Φ(𝑢,𝑣) (Φ𝑢 ), 𝐹(𝑢, 𝑣) =
⟨Φ𝑢 , Φ𝑣 ⟩Φ(𝑢,𝑣) , 𝐺(𝑢, 𝑣) = 𝐼Φ(𝑢,𝑣) (Φ𝑣 ). The above calculations show that the length
of 𝛼(𝑡), 𝑡 ∈ [𝑎, 𝑏], is given by
𝑏√
2 2
𝑙(𝛼) = ∫ 𝐸 𝑢 ′ (𝑡) + 2𝐹 𝑢 ′ (𝑡)𝑣 ′ (𝑡) + 𝐺 𝑣 ′ (𝑡) 𝑑𝑡.
𝑎
Therefore it is possible to compute the length of any curve in 𝑆 knowing only the first
fundamental form (and hence its coefficients in any parameterization) without further
reference to the ambient space.
We point out that the matrix of the quadratic form 𝐿 Φ(𝑢,𝑣) relative to the basis
𝐸 𝐹
(Φ𝑢 , Φ𝑣 ) of 𝑇Φ(𝑢,𝑣) 𝑆 is 𝑀 = [ ]: so if v = 𝑎Φ𝑢 + 𝑏Φ𝑣 and w = 𝑐Φ𝑢 + 𝑑Φ𝑣 , the
𝐹𝐺
𝑐
inner product of v and w is given by the matrix product [𝑎, 𝑏] 𝑀 [ ] = 𝐸 𝑎𝑐 + 𝐹(𝑎𝑑 +
𝑑
𝑏𝑐) + 𝐺 𝑏𝑑.
Examples 2.5.1 If v and w are orthonormal vectors and 𝑝 ∈ R 3 then the parameteri-
zation Ψ(𝑢, 𝑣) = 𝑝 + 𝑢v + 𝑣w of the plane parallel to v and w which passes through
𝑝 has coefficients 𝐸 = 1, 𝐹 = 0, 𝐺 = 1. On the other hand, the coefficients of the
parameterization Φ(𝑢, 𝑣) = ((2 + cos 𝑣) cos 𝑢, (2 + cos 𝑣) sin 𝑢, sin 𝑣) of T 2 are 𝐸 = 1,
2
𝐹 = 0 and 𝐺 = (2 + cos 𝑣) . ◻
The first fundamental form also allows one to compute the angle between two
nonzero vectors v, w ∈ 𝑇 𝑝 𝑆: this (non-oriented) angle is the only 𝜃 ∈ [0, 𝜋] such that
1
cos 𝜃 = √ ⟨v, w⟩ 𝑝 ;
𝐼 𝑝 (v)𝐼 𝑝 (w)
in local coordinates, writing v = 𝑎Φ𝑢 + 𝑏Φ𝑣 and w = 𝑐Φ𝑢 + 𝑑Φ𝑣 , we have
If the surface 𝑆 is oriented and Φ(𝑢, 𝑣) is compatible with the orientation, we can
assign a sign to the angles: the oriented angle ∠(v, w) (from v to w) is the only
𝜃 ∈ ] − 𝜋, 𝜋] such that equality (*) is satisfied and such that it is negative when
𝑎𝑑 − 𝑏𝑐 < 0, non-negative when 𝑎𝑑 − 𝑏𝑐 ≥ 0. [Since the oriented angles are defined
up to integer multiples of 2𝜋, the representatives of the angle ∠(v, w) are thus all
numbers of the form 𝜃 + 2𝑘 𝜋, 𝑘 ∈ Z .]
The angle between two curves 𝛼(𝑡) and 𝛽(𝑠) in 𝑆 at an intersection point
𝛼(𝑡 0 ) = 𝛽(𝑠0 ) is, by definition, the angle between the velocity vectors 𝛼′ (𝑡 0 ) and
2.5 Areas, Lengths, and Angles: The First Fundamental Form 53
𝛽′ (𝑠0 ). For instance, it follows from the formulas deduced above that the
√angle between
the coordinate curves of the parameterization Φ(𝑢, 𝑣) is arccos (𝐹/ 𝐸𝐺) ∈ ]0, 𝜋[.
When 𝐹 = 0 the coordinate curves intersect each other orthogonally; in this case
we say that Φ(𝑢, 𝑣) is an orthogonal parameterization. The above examples 2.5.1
are orthogonal parameterizations; in fact, as we showed in Section 3.3, any surface
admits orthogonal parameterizations.
We finally deal with the measurement of areas. If Δ ⊆ 𝑆 is a region contained in a
single coordinate system (𝑈, Φ), its area is defined by the integral
∬ ∣Φ𝑢 × Φ𝑣 ∣ 𝑑𝑢 𝑑𝑣
Φ−1 (Δ)
— if such an integral exists (and it certainly exists when Δ is open or closed and the
closure of Φ−1 (Δ) is a compact set contained in 𝑈). If Δ is not contained in a single
parametrized neighborhood, we can write it as a disjoint, finite or countable union of
regions Δ𝑛 whose areas we can compute, and add up the results.
In Section 2.4 we deduced the formula Φ𝑢 × Φ𝑣 = (det 𝐽(𝜓 −1 ○ Φ)(𝑢,𝑣) )Ψ𝑢̃ × Ψ̃𝑣 .
Therefore, if we have Δ ⊆ Φ(𝑈) ∩ Ψ(𝑈), the equality
∬ ∣Φ𝑢 × Φ𝑣 ∣ 𝑑𝑢 𝑑𝑣
Φ−1 (Δ)
for the calculation of the area of Δ, let us cover Φ−1 (Δ) with a fine lattice of horizontal
and vertical lines, and let 𝑅𝑖, 𝑗 = [𝑢 𝑖 , 𝑢 𝑖+1 ] × [𝑣 𝑗 , 𝑣 𝑗+1 ] be any rectangle of this lattice,
whose intersection with Φ−1 (Δ) be non-empty. Then the integral in question is the
limit, as the maximum diameter of the 𝑅𝑖, 𝑗 tends to zero, of the sums
— where each summand gives the area of the parallelogram of sides (𝑢 𝑖+1 − 𝑢 𝑖 )Φ𝑢
and (𝑣 𝑗+1 − 𝑣 𝑗 )Φ𝑣 . The sides of this parallelogram are tangent to Φ(𝑢 𝑖 , 𝑣 𝑗 ) and have
lengths approximating the sides of the “rectangle” Φ(𝑅𝑖, 𝑗 ).
We now want to express the area using the coefficients 𝐸, 𝐹 and 𝐺. From the
identity
∣Φ𝑢 × Φ𝑣 ∣2 + ⟨Φ𝑢 , Φ𝑣 ⟩2 = ∣Φ𝑢 ∣2 ∣Φ𝑣 ∣2 ,
we obtain
54 2 Regular Surfaces
√
∣Φ𝑢 × Φ𝑣 ∣ = 𝐸𝐺 − 𝐹 2 ,
and therefore the area of Δ is given by the integral
√
∬ 𝐸𝐺 − 𝐹 2 𝑑𝑢 𝑑𝑣.
Φ−1 (Δ)
We again point out that it follows from this formula that the notion of area depends
only on the knowledge of the first fundamental form.
Example 2.5.2 The coefficients of the first fundamental form of the spherical coordi-
nates (𝜑, 𝜃) in S2 (example 2.1.1 D) are 𝐸 = sin2 𝜃, 𝐹 = 0 and 𝐺 = 1. The area of S2
is then given by the integral
𝜋 𝜋
∫ (∫ sin 𝜃 𝑑𝜃) 𝑑𝜑 = 4𝜋.
−𝜋 0
It is also interesting to note that the area of the spindle between the meridians 𝜑 = 0
and 𝜑 = 𝜑0 is equal to 2𝜑0 ; more generally, any spindle of amplitude 𝜑0 (bounded
by two maximal semicircles of S2 that intersect at an angle 𝜑0 ∈ ]0, 𝜋[) has area
2𝜑0 . This allows us to deduce Girard’s formula, which gives the area of a spherical
triangle (which is the figure inside S2 bounded by three maximal circular arcs) as a
function of its interior angles.
Assume that such a triangle T has vertices 𝐴, 𝐵, 𝐶 and interior angles 𝜑1 , 𝜑2 , 𝜑3
and denote by 𝛼, 𝛽, 𝛾 the maximal circles containing respectively the pairs of points
𝐵 and 𝐶, 𝐴 and 𝐶, 𝐴 and 𝐵. The antipodes 𝐴, ̃ 𝐵,
̃ and 𝐶̃ of the vertices of T form a
triangle 𝐶̃ bounded by arcs of the same maximal circles 𝛼, 𝛽 and 𝛾 (see Fig. 2.10).
Because they are antipodes of each other, T and T̃ have the same area (see Exercise
51 in this section).
The two maximal circles 𝛽 and 𝛾 define two spindles of amplitude 𝜑1 in S2 ; one
of them contains T and the other one T̃ . We denote by Δ1 the union of these two
spindles, and define analogously (using the pairs 𝛼 and 𝛾, 𝛼 and 𝛽) the regions Δ2
and Δ3 . The union of the Δ𝑖 covers S2 , but each point of 𝑇 ∪ T̃ is counted three times,
since, for 𝑖 ≠ 𝑗, we have Δ𝑖 ∩ Δ 𝑗 = T ∪ T̃ . Thus,
3
∑ area(Δ𝑖 ) = area(S ) + 2[area(T ) + area(T̃ )]
2
𝑖=1
and therefore
1 3
area(T ) = {∑ area(Δ𝑖 ) − area(S2 )}
4 𝑖=1
= 𝜑1 + 𝜑2 + 𝜑3 − 𝜋.
We can describe our conclusion by saying that the area of a spherical triangle is
proportional to its spherical excess (with proportionality constant equal to the square
2.5 Areas, Lengths, and Angles: The First Fundamental Form 55
We end this section by defining what is meant by the integral of a real function
defined on a surface: given a function 𝑓 ∶ 𝑆 → R , a parameterization (𝑈, Φ) of 𝑆, and
a region Δ ⊆ Φ(𝑈), the integral of 𝑓 along Δ is
√
∫ 𝑓 𝑑𝜎 = ∬ 𝑓 ○ Φ(𝑢, 𝑣) 𝐸𝐺 − 𝐹 2 𝑑𝑢 𝑑𝑣.
Δ Φ−1 (Δ)
~ j1 ~
C B
j2 j3
a
B C
g ~
A
D1
j1
A
j1
D3 D2
j2 j3
B
j2 C j3
D2 D1 D3
Figure 2.10
56 2 Regular Surfaces
In particular, we have just defined what is meant by ∫𝑆 𝑓 𝑑𝜎. Again it can be shown
that these definitions do not depend on the parameterizations
√ we use. The “quantity”
𝑑𝜎, which in local coordinates is written 𝐸𝐺 − 𝐹 2 𝑑𝑢 𝑑𝑣, is usually called area
element of the surface.
Exercises
48. Obtain the coefficients of the first fundamental form for: (i) the helicoid (choose a
parameterization); (ii) the sphere (parametrized by the inverse of the stereographic
projection); (iii) a surface of revolution (parametrized as in example 2.1.1 E).
49. The coordinate curves of the parameterization Φ(𝑢, 𝑣) constitute a Chebyshev
net if the opposite sides of any quadrangle formed by them have the same lengths.
𝜕𝐸 𝜕𝐺
Show that this happens if and only if ≡ ≡ 0.
𝜕𝑣 𝜕𝑢
50. Find all curves of the cylinder {(𝑥, 𝑦, 𝑧) ∈ R 3 ∶ 𝑥 2 + 𝑦 2 = 1} that intersect the
generatrices (vertical lines) at a constant angle.
51. We say that a diffeomorphism of one surface onto another (or of an open subset
of one surface onto an open subset of another surface) preserves area if the area of
each open set is equal to that of its image.
(a) Let Ψ∶ 𝑈 → 𝑆1 and Φ∶ 𝑈 → 𝑆2 be parameterizations of two surfaces, 𝐸, 𝐹, 𝐺
̃ 𝐹,
and 𝐸, ̃ 𝐺̃ the coefficients of the first fundamental form for Ψ and Φ, respectively.
̃ − 𝐹̃2
Show that Ψ ○ Φ−1 preserves areas if and only if the functions 𝐸𝐺 − 𝐹 2 and 𝐸̃𝐺
are identical.
(b) Show that the antipodal mapping S2 → S2 , (𝑥, 𝑦, 𝑧) ↦ (−𝑥, −𝑦, −𝑧), preserves
areas.
(c) The Archimedes projection sends each point 𝑝 of S2 (except the north and
south poles) to the point of intersection of the circumscribing vertical cylinder with
the half-line 𝑞 𝑝 whose origin is the point 𝑞 on the 𝑧-axis at the same height as 𝑝.
Show that this mapping preserves areas.
(d) Find a mapping of an open subset of T 2 into the plane that preserves areas.
52. Let 𝑈 be a connected open subset of R 2 and let ℎ∶ 𝑈 → R be a differentiable
function. Consider the surface 𝑆 = {(𝑥, 𝑦, 𝑧) ∈ R 3 ∶ (𝑥, 𝑦) ∈ 𝑈, 𝑧 = ℎ(𝑥, 𝑦)}. Show
that:
(a) the mapping 𝜋∶ 𝑆 → 𝑈 given by 𝜋(𝑥, 𝑦, 𝑧) = (𝑥, 𝑦) is a diffeomorphism;
(b) 𝜋 decreases area (for any open subset 𝑊 ⊆ 𝑆 the area of 𝜋(𝑊) is ≤ to the area
of 𝑊);
(c) if 𝜋 preserves areas then 𝑆 is contained in a horizontal plane.
53. The gnomonic projection PΠ ∶ S2 ∖𝛾Π → Π sends each point on the sphere (except
the points on a certain maximal circle 𝛾Π ) into a tangent plane Π by projecting it
from the center of the sphere. Show that:
2.5 Areas, Lengths, and Angles: The First Fundamental Form 57
(a) two points have the same image for PΠ if and only if they are antipodes, and
the restriction PΠ to each of the hemispheres of S2 ∖𝛾Π is a diffeomorphism;
(b) the maximal circles of S2 are transformed into the lines of Π;
(c) the two drawings of Fig. 2.10 are related by a gnomonic projection.
Chapter 3
The Geometry of the Gauss Map
In this chapter we deal with the extrinsic geometry of the surface, by defining
quantities (curvatures) that express how the surface is embedded in its ambient space.
The main tool for this study is the normal vector field to the surface; hence, we shall
deal only with oriented surfaces.
The next proposition, whose complete proof can be found in numerous Linear Algebra
texts, gathers the essentials about symmetric endomorphisms.
Proposition 3.1.1 Let 𝐸 be a space with inner product ⟪ ⋅, ⋅ ⟫, B = (e1 , . . . , e𝑛 ) an
orthonormal basis of 𝐸, and 𝐿∶ 𝐸 → 𝐸 an endomorphism. Then:
(i) 𝐿 is symmetric if and only if its matrix with respect to the basis B is symmetric;
(ii) if 𝐿 is symmetric, 𝐸 has an orthonormal basis formed by eigenvectors of 𝐿.
Proof (i) Since B is orthonormal, the matrix of 𝐿 in this basis is 𝑀 = (𝑎 𝑖 𝑗 )1≤𝑖, 𝑗≤𝑛
given by 𝑎 𝑖 𝑗 = ⟪e𝑖 , 𝐿(e 𝑗 )⟫. We thus observe that if 𝐿 is symmetric then 𝑎 𝑖 𝑗 = 𝑎 𝑗𝑖 for
all 𝑖, 𝑗, a condition, that expresses the symmetry of 𝑀. Conversely, if 𝑀 is symmetric
then ⟪e𝑖 , 𝐿(e 𝑗 )⟫ = ⟪𝐿(e𝑖 ), e 𝑗 ⟫ for all 1 ≤ 𝑖, 𝑗 ≤ 𝑛, and it follows that the symmetry
condition (3.2) is verified for any two vectors that are linear combinations of the e𝑖 ;
but every vector of 𝐸 is such a linear combination, and therefore 𝐿 is symmetric.
(ii) We give the proof only in the case 𝑛 = 2, the only one we will need. Let
𝑎𝑐
𝑀 =[ ]
𝑐𝑏
symmetric because of the symmetry of 𝜁. Let 𝐿 be the linear mapping whose matrix
with respect to B is 𝑀: by 3.1.1 (i), 𝐿 is symmetric. A simple calculation shows that
𝜁(v, w) = ⟪v, 𝐿(w)⟫. By 3.1.1 there exists an orthonormal basis C of 𝐸 formed by
eigenvectors of 𝐿, and C is the sought basis. ◻
Proposition 3.1.3 (i) The normal curvature 𝑘 𝑛 (𝛼, 𝑠) at 𝛼(𝑠) depends only on the
tangent direction to the curve at instant 𝑠: more precisely, if 𝛼 and 𝛽 are curves in 𝑆
tangent to each other at 𝛼(𝑠0 ) = 𝛽(𝑡0 ) = 𝑝 0 then 𝑘 𝑛 (𝛼, 𝑠0 ) = 𝑘 𝑛 (𝛽, 𝑡0 ).
(ii) The set of normal curvatures at 𝑝 0 is the interval [𝑘 1 (𝑝 0 ), 𝑘 2 (𝑝 0 )]. If
𝑘 1 (𝑝 0 ) < 𝑘 2 (𝑝 0 ), then the minimum and maximum of these normal curvatures are
the principal curvatures at 𝑝 0 , which occur precisely in the principal directions
associated with 𝑘 1 (𝑝 0 ) and 𝑘 2 (𝑝 0 ).
Proof Let us put 𝑁(𝑠) = 𝑁 ○ 𝛼(𝑠). Differentiating the equality ⟨𝛼′ (𝑠), 𝑁(𝑠)⟩ = 0,
we obtain ⟨𝛼′′ (𝑠), 𝑁(𝑠)⟩ + ⟨𝛼′ (𝑠), 𝑁 ′ (𝑠)⟩ = 0, and from this we get 𝑘 𝑛 (𝛼, 𝑠) =
⟨𝛼′ (𝑠), −𝑁 ′ (𝑠)⟩ = ⟨𝛼′ (𝑠), −𝐷𝑁 𝑝 (𝛼′ (𝑠))⟩, where we let 𝑝 = 𝛼(𝑠). This equality
shows that 𝑘 𝑛 (𝛼, 𝑠) only depends on 𝛼′ (𝑠) ∈ 𝑇 𝑝 𝑆 and proves statement (i).
Let us now fix 𝑝 0 = 𝛼(𝑠0 ) and let (v1 , v2 ) be an orthonormal basis of 𝑇 𝑝0 𝑆
consisting of eigenvectors of −𝐷𝑁 𝑝0 . Putting 𝛼′ (𝑠0 ) = 𝑎 v1 +𝑏 v2 , we have 𝑎 2 +𝑏 2 = 1;
furthermore,
𝑘 1 = 𝑘 1 (𝑎 2 + 𝑏 2 ) ≤ 𝑘 𝑛 (𝛼, 𝑠0 ) ≤ 𝑘 2 (𝑎 2 + 𝑏 2 ) = 𝑘 2 ,
From which it follows that the normal curvatures cover the entire interval [𝑘 1 , 𝑘 2 ] and
that if 𝑘 1 < 𝑘 2 then the minimum is only reached for 𝛼′ (𝑠0 ) = ±v1 and the maximum
for 𝛼′ (𝑠0 ) = ±v2 . ◻
62 3 The Geometry of the Gauss Map
As we have said, the normal curvature 𝑘 𝑛 (𝛼, 𝑠) gives the component of the
curvature vector 𝛼′′ (𝑠0 ) of 𝛼 in the direction of the normal 𝑁 ○ 𝛼(𝑠) to the surface.
If these vectors are collinear, i.e., if the principal normal to the curve 𝛼 at instant 𝑠
points in the direction of the normal to the surface at 𝛼(𝑠), then the absolute value of
𝑘 𝑛 (𝛼, 𝑠) is equal to the curvature of 𝛼 at that point.
Given an arbitrary direction v ∈ 𝑇 𝑝 𝑆 with ∣v∣ = 1, there exists at least one curve
that passes through 𝑝 with velocity v, and whose principal normal at 𝑝 points in the
direction of 𝑁(𝑝): the intersection of 𝑆 with the plane that passes through 𝑝 and is
parallel to the vectors 𝑁(𝑝) and v (see Fig. 3.1 and Exercise 34 in Section 2.2). A
curve obtained this way is called a normal section of 𝑆 at 𝑝.
Figure 3.1
Note that any point on the surface belongs to one and only one of the first four
classes, but that the umbilical points can be either elliptical or planar. We further note
that although the sign of the principal curvatures depends on the orientation of 𝑆,
the Gaussian curvature does not change when one changes the orientation and the
classification of the points we gave above does not depend on the orientation chosen.
Hence, since any surface is locally orientable (any parametrized neighborhood has
an orientation induced by the parameterization), the classification given extends also
to points on non-orientable surfaces.
In the next section we study the meaning of these definitions with the help of local
coordinates. Now we give some examples.
Examples 3.1.4 A. All points in a plane are planar: in fact, the normal sections are
straight lines, and therefore have zero curvature. This means that all normal curvatures
(and hence both principal curvatures) of the plane are zero.
B. The normal sections of a sphere with radius 𝑟 are maximal circles whose curvature
is 1/𝑟. This means that at each point, the absolute values of all normal curvatures
are equal to 1/𝑟, and therefore the two principal curvatures are equal (if they were
−1/𝑟 and 1/𝑟, some normal curvature would be zero, which is not the case) and
have absolute value 1/𝑟. All points on the sphere are therefore umbilical, and their
Gaussian curvature is constant and positive, equal to 1/𝑟 2 .
Of course, the analysis of the signs of the normal curvatures could be replaced, in
this example, by a simple calculation. But with this analysis we illustrate a useful
principle: at the point 𝑝 ∈ 𝑆 there is some direction in which the normal curvature
vanishes (called an asymptotic direction) if and only if 𝐾(𝑝) ≤ 0.
C. Consider the hyperbolic paraboloid {(𝑥, 𝑦, 𝑧)∶ 𝑧 = 𝑥 2 − 𝑦 2 } with the orientation
induced by the parameterization Φ(𝑢, 𝑣) = (𝑢, 𝑣, 𝑢 2 − 𝑣 2 ), i.e.
1
𝑁(𝑢, 𝑣) = √ (−2𝑢, 2𝑣, 1).
1 + 4𝑢 2 + 4𝑣 2
At the point 𝑂 = Φ(0, 0) we have
The principal curvatures at 𝑂 are thus 2 and −2, the principal directions are those of
the vectors Φ𝑢 = (1, 0, 0) and Φ𝑣 = (0, 1, 0), and the Gaussian curvature is negative,
equal to −4.
D. Let 𝐸 be the ellipsoid
𝑥 2 𝑦2 𝑧2
+ + = 1,
𝑎2 𝑏2 𝑐2
where 𝑎 ≥ 𝑏 ≥ 𝑐 > 0. We use the symmetric bilinear form 𝜁∶ R 3 × R 3 → R given by
𝑥𝑥 ′ 𝑦𝑦 ′ 𝑧𝑧′
𝜁((𝑥, 𝑦, 𝑧), (𝑥 ′ , 𝑦 ′ , 𝑧 ′ )) = + 2 + 2
𝑎2 𝑏 𝑐
64 3 The Geometry of the Gauss Map
𝑥 2 𝑦2 𝑧2
Q(𝑥, 𝑦, 𝑧) = + + ⋅
𝑎2 𝑏2 𝑐2
Given 𝑝 0 ∈ E and a plane Π passing through the origin (0, 0, 0), let us study the
intersection of E with the plane Π0 = 𝑝 0 + Π.
Note that, for every vector w ∈ R 3 , we have
1 1
2
∣w∣2 ≤ Q(w) ≤ 2 ∣w∣2 . (3.3)
𝑎 𝑐
Given now w ∈ Π (so that 𝑝 0 + w ∈ Π0 ), we have
̃ 𝑥 v1 + ̃ ̃𝑥2 ̃𝑦2
Q(̃ 𝑦 v2 ) = 2 + ,
𝑎̃ ̃
𝑏2
𝑎̃𝑟 𝑎̃ ̃
𝑏𝑟 ̃
𝑏
= and =
(̃
𝑏 𝑟)
2 ̃
𝑏2𝑟 (̃
𝑎 𝑟)
2 𝑎̃2 𝑟
(see Exercise 6, in Section 1.3). Since F ⊆ E and the diameter of E is 2𝑎, we have
𝑎̃𝑟 ≤ 𝑎. Hence, the curvature of F is, at all points, greater than or equal to
̃
𝑏 ̃
𝑏 𝑐
≥ ≥ 2⋅
𝑎̃ 𝑟 𝑎 𝑎̃ 𝑎
2
𝑐
We thus prove that the normal curvatures of E are not less than in absolute value.
𝑎2
Hence (see example B) the Gaussian curvature of E is positive at all points, being
bounded by 𝑐2 /𝑎 4 .
We now want to obtain an upper bound for the principal curvatures of E, and for
this we need a lower bound for the diameter of its normal sections. The normal line
to E at the point (𝑥 0 , 𝑦 0 , 𝑧 0 ) ∈ E, given by
𝑥0 𝑦 0 𝑧0
𝑡 ↦ (𝑥 0 , 𝑦 0 , 𝑧 0 ) + 𝑡 ( , , ),
𝑎2 𝑏2 𝑐2
intersects E for 𝑡 = 0 and for 𝑡 = 𝑡 0 , where
𝑥2 𝑦02 𝑧02
−2 ( 𝑎04 + 𝑏4
+ 𝑐4
)
𝑡0 = ⋅
𝑥02 𝑦02 𝑧02
𝑎6
+ 𝑏6
+ 𝑐6
The length of the line segment between the two intersections is then
3/2
√ 𝑥2 𝑦02 𝑧02
𝑥02 𝑦 20 𝑧20 2( 𝑎04 + 𝑏4
+ 𝑐4
)
∣𝑡0 ∣ + + =
𝑎4 𝑏4 𝑐4 𝑥02 𝑦02 𝑧02
𝑎6
+ 𝑏6
+ 𝑐6
√
𝑥02 𝑦 20 𝑧 20
≥ 2𝑐 2
+ +
𝑎4 𝑏4 𝑐4
√
2𝑐2 𝑥02 𝑦 20 𝑧 20 2𝑐2
≥ + + = ⋅
𝑎 𝑎2 𝑏2 𝑐2 𝑎
Thus, under the assumption that F = E ∩ Π0 is a normal section of E, we have
𝑐2 1 𝑎̃ 𝑎
𝑎̃𝑟 ≥ , or ≤ 2 . The curvature of F is then not greater than
𝑎 𝑟 𝑐
𝑎̃ 𝑎̃2 𝑎 𝑎3
≤ ≤ 4,
̃
𝑏2𝑟 ̃ 𝑏2 𝑐2 𝑐
and so the absolute values of the normal curvatures of E are also not greater than
𝑎 3 /𝑐4 . In conclusion: at any point 𝑝 ∈ E we have
𝑐2 𝑎6
≤ 𝐾(𝑝) ≤ ⋅
𝑎4 𝑐8
66 3 The Geometry of the Gauss Map
Note that when E is a sphere then 𝑎 = 𝑏 = 𝑐, and both these inequalities become
the equality already seen in example B. It should however be made clear that it is
possible, by more ingenious methods than ours, to obtain an explicit expression for
the curvature of the points of E. This example was intended to show that we can
estimate (in this case obtain lower and upper bounds) the Gaussian curvature without
computing it explicitly. ◻
In examples 3.1.4 A, B we observed that all points on the plane and the sphere are
umbilical; we end the section by showing that these are the only surfaces with such a
property.
Proposition 3.1.5 Let 𝑆 be a connected surface whose points are all umbilical. Then
𝑆 is contained in a sphere or in a plane.
Proof Our hypothesis implies the existence of a function 𝜆∶ 𝑆→R such that, for every
𝑝∈𝑆, 𝐷𝑁 𝑝 is a homothety of ratio 𝜆(𝑝). Let us take a parameterization (𝑈, Φ) of 𝑆,
with 𝑈 connected, and put 𝑁(𝑢, 𝑣) = 𝑁 ○ Φ(𝑢, 𝑣) and 𝜆(𝑢, 𝑣) = 𝜆 ○ Φ(𝑢, 𝑣). We then
have
𝑁𝑢 = 𝜆(𝑢, 𝑣)Φ𝑢
(3.4)
𝑁 𝑣 = 𝜆(𝑢, 𝑣)Φ𝑣
and these equalities ensure that 𝜆(𝑢, 𝑣) is differentiable, for we deduce from them
that
⟨𝑁𝑢 , Φ𝑢 ⟩ ⟨𝑁 𝑣 , Φ𝑣 ⟩
𝜆(𝑢, 𝑣) = = ⋅
∣Φ𝑢 ∣2 ∣Φ𝑣 ∣2
By differentiation of (3.4) we obtain
𝑁𝑢𝑣 = 𝜆 𝑣 Φ𝑢 + 𝜆Φ𝑢𝑣
𝑁 𝑣𝑢 = 𝜆𝑢 Φ𝑣 + 𝜆Φ𝑣𝑢
𝜆 𝑣 Φ𝑢 − 𝜆𝑢 Φ𝑣 = 0,
an equality which is only possible when 𝜆 𝑣 and 𝜆𝑢 are identically zero. Thus, the
function 𝜆(𝑢, 𝑣) is constant on 𝑈, and therefore 𝜆∶ 𝑆 → R is locally constant, hence
(since 𝑆 is connected) constant, equal to 𝜆 ∈ R .
If 𝜆 = 0 then by (3.4) the normal vector 𝑁 is constant, and the function 𝑝 ↦ ⟨𝑁, 𝑝⟩
is locally constant, hence constant, on 𝑆, which means that 𝑆 is contained in a plane
{𝑝 ∈ R 3 ∶ ⟨𝑁, 𝑝⟩ = 𝑎}, for some 𝑎 ∈ R .
1
If 𝜆 ≠ 0 then, again by (3.4), the mapping 𝑆 → R 3 given by 𝑝 ↦ 𝑝 − 𝑁(𝑝) is
𝜆
locally constant, hence constant. Denoting by 𝑞 0 this constant, we have, for all 𝑝 in 𝑆,
∣𝑝 − 𝑞 0 ∣ = ∣ 𝜆1 𝑁(𝑝)∣ = ∣𝜆∣
1
, and therefore 𝑆 is contained in the sphere with center 𝑞 0
and radius 1
∣𝜆∣ . ◻
Exercises
3.2 The Second Fundamental Form 67
54. What is the region on the sphere covered by the image of the Gauss map of the
surface given by the equation: (i) 𝑧 = 𝑥 2 +𝑦 2 ; (ii) 𝑥 2 +𝑦 2 −𝑧 2 = 1; (iii) 𝑥 2 +𝑦 2 = cosh2 𝑧.
55.(a) Compute the principal curvatures at (0, 0, 0) of each of the following surfaces:
(i) 𝑧 = 𝑥 2 +𝑦 3 ; (ii) 𝑧 = 𝑥 2 +𝑦 4 ; (iii) 𝑧 = 𝑥 3 −3𝑥𝑦 2 . Sketch the surface (iii), indicating
the region that lies above the plane 𝑧 = 0.
(b) Conclude that when 𝑝 0 is a parabolic(or planar)pointof 𝑆 the following two
cases are possible: (i) there exists a neighborhood of 𝑝 0 in 𝑆 that lies entirely on the
same side of the tangent plane to 𝑆 at 𝑝 0 ; (ii) any neighborhood of 𝑝 0 in 𝑆 contains
points on both sides of the tangent plane.
56. (a) Show that at a point on a surface, the arithmetic mean of the normal curvatures
in two orthogonal directions is equal to the mean curvature at that point.
(b) Show that the mean curvature at 𝑝 ∈ 𝑆 is given by 𝜋1 ∫0 𝑘 𝑛 (𝜃) 𝑑𝜃, where
𝜋
𝑘 𝑛 (𝜃) is the normal curvature at 𝑝 in the direction that makes an angle 𝜃 with a fixed
principal direction.
57. Let 𝑆 be an oriented regular surface, and suppose that 𝑝 0 ∈ 𝑆 is a maximum of
the function 𝑓 ∶ 𝑆 → R , 𝑓 (𝑝) = ∣𝑝∣2 . Show that:
(a) the line segment [𝑂, 𝑝 0 ] is orthogonal to 𝑆 at 𝑝 0 ;
(b) the Gaussian curvature of 𝑆 at 𝑝 0 is greater than or equal to 1/ 𝑓 (𝑝 0 ) (use
Exercise 8, in Section 1.3);
(c) if 𝑆 is compact, then 𝑆 has some point with positive Gaussian curvature.
58. Show that if a surface is tangent to a plane along a regular curve then the points
on that curve are parabolic or planar.
59. Let 𝑝 be a hyperbolic point of 𝑆, and assume there exists a neighborhood 𝑈 of 𝑝
in 𝑆 such that (𝑝 + 𝑇 𝑝 𝑆) ∩ 𝑈 is the union of two regular curves that intersect at 𝑝.
Show that the tangent line at 𝑝 to each of these curves defines an asymptotic direction
in 𝑇 𝑝 𝑆.
Using local coordinates, we will in this section continue the study of Gaussian
curvature, obtaining explicit formulas to compute it and a better understanding of its
geometric meaning. The tool is again a quadratic form, now related to the Gaussian
normal mapping.
We observed in Section 3.1 that 𝐷𝑁 𝑝 ∶ 𝑇 𝑝 𝑆 → 𝑇 𝑝 𝑆 is a symmetric linear mapping
with respect to the inner product ⟨⋅, ⋅⟩ 𝑝 on 𝑇 𝑝 𝑆, which means that the bilinear
form (v, w) ↦ ⟨v, −𝐷𝑁 𝑝 (w)⟩ is symmetric. The second fundamental form at
𝑝 ∈ 𝑆 is the quadratic form associated with this symmetric bilinear form, i.e.
Π 𝑝 (v) = ⟨v, −𝐷𝑁 𝑝 (v)⟩.
From the proof of 3.1.3 it follows that the normal curvature at 𝑝 in the direction
of v is precisely Π 𝑝 (v) when v∣ = 1 [if v ≠ 0 is not a unit vector, then that normal
curvature is given by Π 𝑝 (v)/∣v∣2 = Π 𝑝 (v)/𝐼 𝑝 (v)] and that the principal curvatures
68 3 The Geometry of the Gauss Map
are the maximum and minimum of the set {Π 𝑝 (v)∶ v ∈ 𝑇 𝑝 𝑆, ∣v∣ = 1}. This means that
at each point of the surface the second fundamental form gathers all information
about normal curvatures, principal curvatures and Gaussian curvature.
𝑒 𝑓
Given a parameterization Φ(𝑢, 𝑣), we want to determine the matrix ( ) of
𝑓 𝑔
ΠΦ(𝑢,𝑣) relative to the base (Φ𝑢 , Φ𝑣 ) of 𝑇Φ(𝑢,𝑣) 𝑆. The entries 𝑒, 𝑓 , 𝑔 of this matrix,
which are functions of (𝑢, 𝑣), are called the coefficients of the second fundamental
form in the coordinates (𝑢, 𝑣), and are computed by the formulas
For computational purposes, formulas that do not involve the derivatives of 𝑁 are
in general easier to handle. Once we have computed 𝑒, 𝑓 , 𝑔 it is easy to compute
the normal curvature of a curve 𝛼(𝑡) = Φ(𝑢(𝑡), 𝑣(𝑡)) expressed in local coordinates:
writing 𝛼′ (𝑡) = 𝑢 ′ (𝑡)Φ𝑢 + 𝑣 ′ (𝑡)Φ𝑣 , we have
𝑒𝑢 ′2 + 2 𝑓 𝑢 ′ 𝑣 ′ + 𝑔𝑣 ′2
𝑘 𝑛 (𝑡) =
∣𝛼′ (𝑡)∣2
(3.5)
𝑒𝑢 ′2 + 2 𝑓 𝑢 ′ 𝑣 ′ + 𝑔𝑣 ′2
= ⋅
𝐸𝑢 ′2 + 2𝐹𝑢 ′ 𝑣 ′ + 𝐺𝑣 ′2
𝑎 11 𝑎 12
Let us now determine the matrix [ ] of −𝐷𝑁Φ(𝑢,𝑣) relative to the basis
𝑎 21 𝑎 22
(Φ𝑢 , Φ𝑣 ): the entries of the matrix are determined by the equalities
3.2 The Second Fundamental Form 69
−𝑁𝑢 = 𝑎 11 Φ𝑢 + 𝑎 21 Φ𝑣
−𝑁 𝑣 = 𝑎 12 Φ𝑢 + 𝑎 22 Φ𝑣 .
Forming the inner product of each of these equalities with Φ𝑢 and with Φ𝑣 we obtain
𝑒 = 𝑎 11 𝐸 + 𝑎 21 𝐹 𝑓 = 𝑎 11 𝐹 + 𝑎 21 𝐺
𝑓 = 𝑎 12 𝐸 + 𝑎 22 𝐹 𝑔 = 𝑎 12 𝐹 + 𝑎 22 𝐺,
𝑒 𝑓 𝐸 𝐹 𝑎 11 𝑎 12
[ ]=[ ][ ],
𝑓 𝑔 𝐹 𝐺 𝑎 21 𝑎 22
i.e.
−1
𝑎 11 𝑎 12 𝐸 𝐹 𝑒 𝑓
[ ]=[ ] [ ]
𝑎 21 𝑎 22 𝐹𝐺 𝑓 𝑔
(3.6)
1 𝐺 −𝐹 𝑒 𝑓
= [ ][ ]⋅
𝐸𝐺 − 𝐹 2 −𝐹 𝐸 𝑓 𝑔
From this we derive explicit formulas for the Gaussian curvature and mean curvature:
𝑒𝑔 − 𝑓 2
𝐾 ○ Φ(𝑢, 𝑣) = det ( −𝐷𝑁Φ(𝑢,𝑣) ) = (3.7)
𝐸𝐺 − 𝐹 2
1 1
𝐻 ○ Φ(𝑢, 𝑣) = tr ( −𝐷𝑁Φ(𝑢,𝑣) ) = (𝑎 11 + 𝑎 22 )
2 2
𝐺𝑒 − 2𝐹 𝑓 + 𝐸𝑔 (3.8)
= ;
2(𝐸𝐺 − 𝐹 2 )
and also for the principal curvatures 𝑘 1 and 𝑘 2 , which are the eigenvalues of the
matrix [𝑎 𝑖 𝑗 ]: √ √
𝑘 1 = 𝐻 − 𝐻 2 − 𝐾, 𝑘 2 = 𝐻 + 𝐻 2 − 𝐾. (3.9)
has coefficients
−𝑧 𝜌 𝜌𝑧 − 𝜌𝑧
𝑒 =√ , 𝑓 = 0, 𝑔= √ ,
𝜌2 + 𝑧2 𝜌2 + 𝑧2
𝐸=𝜌 ,
2
𝐹 = 0, 𝐺 = 𝜌2 + 𝑧2 .
It then follows from (3.6) that the matrix [𝑎 𝑖 𝑗 ] is diagonal. The principal directions
at the non-umbilical points are thus those of Φ𝑢 and Φ𝑣 , the tangent lines to the
meridians and the parallels. The principal curvatures are
70 3 The Geometry of the Gauss Map
𝑒 −𝑧 𝑔 𝜌𝑧 − 𝜌𝑧
= √ and =
𝐸 𝜌 𝜌2 + 𝑧2 𝐺 (𝜌 2 + 𝑧2 )3/2
−𝜌(𝜌 2 + 𝑧2 ) −𝜌
𝐾= 2
= .
𝜌(𝜌 2 + 𝑧2 ) 𝜌
Using this formula we will now determine the surfaces of revolution of constant
curvature; to do this we simply solve the differential equation 𝜌 + 𝐾 𝜌 = 0, with 𝐾
constant. Leaving the cases 𝐾 = 0 and 𝐾 < 0 as an exercise, let us deal with the case
𝐾 > 0; to simplify the formulas we let 𝐾 = 1.
The general solution of the equation 𝜌 + 𝜌 = 0 is of the form 𝜌(𝑣) = 𝐶 cos(𝑣 + 𝐵),
where 𝐶 and 𝐵 are constants. We can choose 𝐵 = 0, since the solutions we obtain
with 𝐵 ≠ 0 correspond only to a translation of the domain; so we are reduced to
the solutions 𝜌𝐶 (𝑣) = 𝐶 cos 𝑣. Since we want 𝜌 > 0, we take 𝐶 > 0 and restrict 𝑣
to an interval of the form ] − 𝑣 0 , 𝑣 0 [. Putting 𝑧𝐶 (0) = 0 and integrating the equality
𝜌𝐶2
+ 𝑧𝐶
2
= 1, we obtain
𝑣 √
𝑧𝐶 (𝑣) = ∫ 1 − 𝐶 2 sin2 𝑡 𝑑𝑡
0
Figure 3.2
𝜕ℎ 𝜕ℎ 1 𝜕2 ℎ 𝜕2 ℎ 𝜕2 ℎ
ℎ(𝑢, 𝑣) = ℎ(0, 0) + 𝑢+ 𝑣 + ( 2 𝑢 2 +2 𝑢𝑣 + 2 𝑣 2 ) + 𝑅(𝑢, 𝑣),
𝜕𝑢 𝜕𝑣 2 𝜕𝑢 𝜕𝑢𝜕𝑣 𝜕𝑣
𝑅(𝑢, 𝑣)
lim = 0.
(𝑢,𝑣)→(00) 𝑢2 + 𝑣 2
𝜕ℎ 𝜕ℎ
A quick calculation shows that ℎ(0, 0) = = = 0 and that the second-order
𝜕𝑢 𝜕𝑣
derivatives are nothing else but the coefficients at 𝑝 0 of the second fundamental form:
i.e.,
72 3 The Geometry of the Gauss Map
𝜕2 ℎ 𝜕2 ℎ 𝜕2 ℎ
= 𝑒, = 𝑓, = 𝑔.
𝜕𝑢 2 𝜕𝑢𝜕𝑣 𝜕𝑣 2
This leaves us with
1
ℎ(𝑢, 𝑣) = (𝑒𝑢 2 + 2 𝑓 𝑢𝑣 + 𝑔𝑣 2 ) + 𝑅(𝑢, 𝑣). (3.10)
2
Let us put Q(𝑢, 𝑣) = 𝑒𝑢 2 + 2 𝑓 𝑢𝑣 + 𝑔𝑣 2 . By (3.7), 𝐾(𝑝 0 ) and 𝑒𝑔 − 𝑓 2 have the same
sign. If 𝑒𝑔 − 𝑓 2 > 0 then 𝑒, 𝑔 are both positive or both negative: let us now show
that in this case there exists 𝛿 > 0 such that ℎ(𝑢, 𝑣) is nonzero and has constant sign
(positive if 𝑒, 𝑔 > 0, negative otherwise) whenever 0 < ∣(𝑢, 𝑣)∣ < 𝛿.
Assuming then that 𝑒𝑔 − 𝑓 2 > 0 and that 𝑒, 𝑔 are positive, we can write
𝑓 2 𝑒𝑔 − 𝑓 2 2 𝑒𝑔 − 𝑓 2 2
Q(𝑢, 𝑣) = 𝑒(𝑢 + 𝑣) + 𝑣 ≥ 𝑣 ,
𝑒 𝑒 𝑒
and analogously,
𝑒𝑔 − 𝑓 2 2
Q(𝑢, 𝑣) ≥ 𝑢 .
𝑔
From these inequalities we obtain Q(𝑢, 𝑣) ≥ 2𝑀(𝑢 2 + 𝑣 2 ), where
𝑒𝑔 − 𝑓 2 1 1
𝑀= min { , } > 0.
4 𝑔 𝑒
ℎ(𝑢, 𝑣) 𝑅(𝑢, 𝑣)
≥𝑀− 2 >0
𝑢2 + 𝑣 2 𝑢 + 𝑣2
whenever 0 < ∣(𝑢, 𝑣)∣ < 𝛿, provided that 𝛿 > 0 is chosen sufficiently small. The case
where 𝑒, 𝑔 are negative is treated analogously, and the first statement is thus proved.
If 𝑒𝑔 − 𝑓 2 < 0 then we can write Q(𝑢, 𝑣) as a product 𝔏1 (𝑢, 𝑣)𝔏2 (𝑢, 𝑣) of linearly
independent linear forms 𝔏𝑖 ∶ R 2 → R . The two lines 𝔏𝑖 (𝑢, 𝑣) = 0 divide the plane
into four sectors: in two of these sectors 𝔏1 and 𝔏2 have equal signs, and in the other
two they have opposite signs. There are therefore unit vectors (𝑢 0 , 𝑣 0 ), (𝑢 1 , 𝑣 1 ) ∈ R 2
such that 𝑎 0 = Q(𝑢 0 , 𝑣 0 ) < 0 < Q(𝑢 1 , 𝑣 1 ) = 𝑎 1 . Now observe that
for 𝑖 = 0.1. This shows that ℎ(𝑢, 𝑣) takes positive and negative values for (𝑢, 𝑣)
arbitrarily close to the origin and proves (ii). ◻
Let us now study the contact between two surfaces. Assume that 𝑆1 and 𝑆2 are
tangent at 𝑝 0 , and denote 𝜋∶ R 2 → 𝑝 0 + 𝑇 𝑝0 𝑆1 the orthogonal projection on the
3.2 The Second Fundamental Form 73
ℎ1 (𝑢, 𝑣) − ℎ2 (𝑢, 𝑣)
lim = 0.
(𝑢,𝑣)→(0,0) 𝑢2 + 𝑣 2
Expanding the functions ℎ𝑖 into Taylor polynomials, we see that this happens if and
only ℎ1 and ℎ2 and their partial derivatives up to second order are equal at (0, 0);
but, since we have ℎ𝑖 (0, 0) = 0 and
𝜕ℎ𝑖 𝜕ℎ𝑖
∣ = ∣ =0
𝜕𝑢 (0,0) 𝜕𝑣 (0,0)
for 𝑖 = 1, 2, this condition boils down to
𝜕 2 ℎ1 𝜕 2 ℎ2
∣(0,0) = ∣ ,
𝜕𝑢 2 𝜕𝑢 2 (0,0)
𝜕 2 ℎ1 𝜕 2 ℎ2
∣(0,0) = ∣ , (3.11)
𝜕𝑢𝜕𝑣 𝜕𝑢𝜕𝑣 (0,0)
𝜕 2 ℎ1 𝜕 2 ℎ2
∣ = ∣ ⋅
𝜕𝑣 2 (0,0) 𝜕𝑣 2 0,0)
Now if 𝑆1 and 𝑆2 are oriented such that their normal vectors both coincide with 𝑁0
at the point 𝑝 0 , then the second-order derivatives of ℎ1 and ℎ2 are the coefficients
of the second fundamental form at 𝑝 0 of the parameterizations Φ(𝑢, 𝑣) and Ψ(𝑢, 𝑣).
Since Φ𝑢 ∣ 𝑝 = Ψ𝑢 ∣ 𝑝 = w1 and Φ𝑣 ∣ 𝑝 = Ψ𝑣 ∣ 𝑝 = w2 , the equality of the coefficients
0 0 0 0
implies that the second fundamental forms of the two surfaces coincide at the point
𝑝 0 . We thus proved half of the following proposition.
74 3 The Geometry of the Gauss Map
Proposition 3.2.3 Let 𝑆1 and 𝑆2 be two oriented surfaces tangent at 𝑝 0 whose normal
vectors coincide at that point. Then the following two conditions are equivalent:
(i) 𝑆1 and 𝑆2 have contact of order ≥ 2 at 𝑝 0 ;
(ii) the restrictions on 𝑇 𝑝0 𝑆1 = 𝑇 𝑝0 𝑆2 of the second fundamental forms of 𝑆1 and
𝑆2 coincide.
An asymptotic line is a regular curve whose velocity vector defines at each point
an asymptotic direction — that is, it is a curve whose normal curvature is constant
and equal to zero. It follows from the argument in example 3.1.4 B that at all points
on an asymptotic line the Gaussian curvature is nonpositive. In local coordinates,
formula (3.5) implies that a regular curve Ψ(𝑢(𝑡), 𝑣(𝑡)) is asymptotic if and only if
𝑒(𝑢 ′ ) + 2 𝑓 𝑢 ′ 𝑣 ′ + 𝑔(𝑣 ′ ) ≡ 0.
2 2
Exercises
60. Define orientations for 𝑆1 = {(𝑥, 𝑦, 𝑧)∶ 𝑧 = 𝑥 2 − 𝑦 2 } and 𝑆2 = {(𝑥, 𝑦, 𝑧)∶ 𝑧 =
𝑥 3 − 3𝑥𝑦 2 }, and then determine for each of these surfaces:
(a) the Gaussian curvature and the mean curvature at each point;
(b) the points where the mean curvature vanishes.
(c) Do any of the answers to the above questions depend on the chosen orientations?
61. (a) Show that the Gaussian curvature of the Möbius strip M (with the parameteri-
zation given in 2.4.2) is given by
3.2 The Second Fundamental Form 75
−2
1 𝜃 2
𝐾(𝜃, 𝑡) = −( 𝑡 2 + (2 − 𝑡 sin ) ) .
4 2
(b) Show that if 𝑆 is a surface of strictly positive curvature at all its points, then
𝑆 is orientable. Show further that it is possible to choose this orientation such that
all principal curvatures are positive. Hint: For every point 𝑝 0 of 𝑆, let 𝑈 be the
neighborhood given by 3.2.2 (i) and 𝑁(𝑝 0 ) the normal vector pointing to the side
where 𝑈 lies; then 𝑁∶ 𝑆 → S2 is an orientation of 𝑆.
62. Show that the pseudosphere (Exercise 32) has constant negative curvature.
63. (a) Show that the only surfaces of revolution with zero constant curvature are the
cylinder, cone and plane.
(b) Show that any surface of revolution of constant curvature 𝐾 = −1 is, up to
reparametrization, of the form Φ(𝑢, 𝑣) = (𝜌(𝑣) cos 𝑢, 𝜌(𝑣) sin 𝑢, 𝑧(𝑣)), where the
generating curve 𝑣 ↦ (𝜌(𝑣), 0, 𝑧(𝑣)) is of one of the following three types:
√
(i) 𝜌(𝑣) = 𝐴 cosh 𝑣, 𝑧(𝑣) = ∫0 1 − 𝐴2 sinh2 𝑡 𝑑𝑡, 𝐴 > 0;
𝑣
𝑣√
(ii) 𝜌(𝑣) = 𝐴𝑒 −𝑣 , 𝑧(𝑣) = ∫0 1 − 𝐴2 𝑒 −2𝑡 𝑑𝑡, 𝐴 > 0;
√
(iii) 𝜌(𝑣) = 𝐴 sinh 𝑣, 𝑧(𝑣) = ∫0 1 − 𝐴2 cosh2 𝑡 𝑑𝑡, 0 < 𝐴 < 1;
𝑣
In each of the cases find the domain of 𝑣 and sketch the generating curve.
(c) Show that the surface of type (ii) is the pseudosphere.
64. Let 𝑝 0 ∈ 𝑆 be such that 𝐾(𝑝 0 ) ≠ 0, and let (𝑈, Φ) be a parametrized neighborhood
of 𝑝 0 where 𝐾 has constant sign. Show that:
(a) 𝑁𝑢 × 𝑁 𝑣 = 𝐾(𝑢, 𝑣)(Φ𝑢 × Φ𝑣 );
(b) if 𝑉 ⊆ Φ(𝑈), then the area of 𝑁(𝑉) ⊆ S2 is given by ∫ ∣𝐾∣ 𝑑𝜎;
𝑉
area of𝑁 (𝑉)
(c) ∣𝐾(𝑝 0 )∣ is the limit of the ratio area of𝑉 as the diameter of its neighborhood
𝑉 tends to zero.
66. Let 𝑝 0 be a point shared by the surfaces 𝑆1 and 𝑆2 . Show that the following
assertions are equivalent:
(i) 𝑆1 and 𝑆2 have contact of order ≥ 2 at 𝑝 0 .
(ii) There exist parameterizations Φ(𝑢, 𝑣) and Ψ(𝑢, 𝑣) of 𝑆1 and 𝑆2 such that
𝑝 0 = Φ(0, 0) = 𝜓(0, 0) and
(a) Show that the hyperbolic paraboloid 𝑧 = 𝑥 2 −𝑦 2 and the hyperboloid 𝑥 2 +𝑦 2 −𝑧2 =
1 are ruled surfaces, and find out whether the equation 𝑒 𝑥 = 𝑧 + 𝑦 2 defines a ruled
surface;
(b) Show that the tangent planes to the ruled surface 𝑆 intersect 𝑆 along a line,
which is an asymptotic line; conclude that the curvature of 𝑆 is ≤ 0 at all points.
68. With the notation of Exercise 67, let 𝑆 be a ruled surface without planar points.
Show that the following conditions are equivalent:
i. 𝑆 has curvature 0 at all its points.
ii. The lines 𝜆 ↦ 𝛼(𝑡) + 𝜆v(𝑡) are lines of curvature of 𝑆.
iii. For every 𝑡 ∈ 𝐼, the vector v′ (𝑡) is a linear combination of v(𝑡) and 𝛼′ (𝑡).
69. Given a point 𝑝 on a compact surface 𝑆 ⊆ R 3 , we define the diameter of 𝑆 at 𝑝 by
𝑑(𝑝) = max {∣𝑞 − 𝑝∣∶ 𝑞 ∈ 𝑆}. Assume that 𝑆 has constant diameter 𝑑 — that is, that
𝑑(𝑝) = 𝑑 for every point 𝑝 of 𝑆. Show that:
(a) for every 𝑝 in 𝑆 there exists a single point 𝑓 (𝑝) in 𝑆 such that ∣ 𝑓 (𝑝) − 𝑝∣ = 𝑑;
(b) 𝑓 is an involutive diffeomorphism of 𝑆 (i.e., 𝑓 ○ 𝑓 = id);
(c) 𝑆 has strictly positive curvature, and therefore [Exercise 61-b)] admits an
orientation for which all principal curvatures are positive;
(d) if 𝑘 2 (𝑝) ≥ 𝑘 1 (𝑝) denote the principal curvatures of 𝑆 then
1 1
+ = 𝑑.
𝑘 1 (𝑝) 𝑘 2 ( 𝑓 (𝑝))
A vector field assigns to each point in a surface a vector in the tangent space of the
surface to that point. If this assignment is made in a sufficiently regular way, the
vector field can be interpreted as a velocity field, and so it determines certain curves
(trajectories) on the surface. With this approach we can establish in this section the
existence of curves and parameterizations satisfying certain requirements (such as
asymptotic lines, lines of curvature, and orthogonal parameterizations).
A vector field of class 𝐶 𝑘 (𝑘 ≥ 1) in an open subset 𝑉 ⊆ R 𝑛 is a mapping
v∶ 𝑉 → R 𝑛 of class 𝐶 𝑘 , and a trajectory (or integral curve) of v is a curve 𝜑∶ 𝐼 → 𝑉
such that 𝜑′ (𝑡) = v(𝜑(𝑡)). In other words, a trajectory is a curve whose velocity
at each point is the vector v assigned to that point. The fundamental theorem of
differential equations, which we now state, asserts the existence and uniqueness of the
trajectory passing through each 𝑝 ∈ 𝑉 at a given instant; for its proof we suggest [23].
At a point 𝑝 where the vector field v vanishes, one trajectory of v that passes
through 𝑝 is the constant curve 𝜑(𝑡) = 𝑝; the theorem guarantees that this is the only
trajectory that passes through 𝑝. This means that the singularities of the vector field
Figure 3.3
v correspond to the constant trajectories, and the trajectories of the other points are
regular curves: for if the derivative of 𝜑(𝑡) vanishes at 𝑡 0 , then 𝜑(𝑡 0 ) is a singularity
of v and the only trajectory through 𝜑(𝑡 0 ) is the constant one: so a non-constant
trajectory cannot have points of zero velocity.
The mapping 𝜑(𝑡, 𝑝) in 3.3.1 is called the local flow of the vector field v. A
frequent way to write the differential equation associated with a given field is in the
form
𝑋 = v(𝑋)
where 𝑋 = (𝑥 1 , . . . , 𝑥 𝑛 ) is a point of 𝑉. A differential equation of this type is called
autonomous. A non-autonomous equation is one where the right-hand side depends
explicitly on the independent variable 𝑡, i.e. of the form
neighborhood of 𝑝 0 ) such that, for every 𝑝 ∈ 𝑈, the function 𝑡 ↦ 𝜑(𝑡, 𝑝) is the only
solution of 3.1.2 with 𝜑(𝑡 0 , 𝑝) = 𝑝 defined in the interval ]𝑡 0 − 𝜀, 𝑡 0 + 𝜀[.
This result for non-autonomous equations, although seemingly more general
than 3.3.1, can in fact be easily deduced from it. To this end we consider the
autonomous equation
𝑌 = w(𝑌 ), (3.13)
where we put 𝑌 = (𝑡, 𝑋) ∈ 𝐼 ×𝑉, and where the vector field w∶ 𝐼 ×𝑉 ⊆ R 𝑛+1 → R 𝑛+1 is
given by w(𝑡, 𝑋) = (1, v(𝑡, 𝑋)). If 𝜓(𝑠) is the solution of (3.13) with 𝜓(0) = (𝑡0 , 𝑝),
then we have 𝜓(𝑠) = (𝑡 0 + 𝑠, 𝜑̃(𝑠)), and therefore 𝑡 = 𝑡0 + 𝑠. Putting 𝜑(𝑡) = 𝜑̃(𝑡 − 𝑡0 ),
we recognize that 𝜑(𝑡) is the solution of (3.12) with initial condition 𝜑(𝑡0 ) = 𝑝
if and only if 𝜓(𝑠) = (𝑡0 + 𝑠, 𝜑̃(𝑠)) is a solution of (3.13) with initial condition
𝜓(0) = (𝑡 0 , 𝑝).
Returning to surfaces, a vector field in an open subset 𝑉 ⊆ 𝑆 is a mapping
v such that v(𝑝) ∈ 𝑇 𝑝 𝑆 for every 𝑝 ∈ 𝑉. Using local coordinates, we can write
v ○ Φ(𝑢, 𝑣) = 𝛼(𝑢, 𝑣)Φ𝑢 + 𝛽(𝑢, 𝑣)Φ𝑣 for certain functions 𝛼 and 𝛽. We say that v is
of class 𝐶 𝑘 if every point of 𝑉 has a parametrized neighborhood Φ(𝑢, 𝑣) in which
the coordinate functions 𝛼 and 𝛽 of v are of class 𝐶 𝑘 (of course then the coordinate
functions of v in any parameterization are of class 𝐶 𝑘 ). Unless otherwise stated, our
vector fields are 𝐶 ∞ .
The equation 𝑋 = v(𝑋), when 𝑋 is contained in the image of Φ(𝑢, 𝑣), is equivalent
to the equation (𝑢, 𝑣) = (𝛼(𝑢, 𝑣), 𝛽(𝑢, 𝑣)) defined in an open subset of R 2 : in fact,
𝜑(𝑡) is a trajectory of the latter equation if and only if Φ ○ 𝜑(𝑡) is a trajectory of
the former. Theorem 3.3.1 is then carried over without changes to surfaces, since it
concerns local properties of trajectories.
Example 3.3.2 As an example of the application of these ideas, let us verify that
through each non-umbilical point 𝑝 pass exactly two lines of curvature, corresponding
to the two principal directions at 𝑝: to prove this, it suffices to show that there are two
differentiable vector fields w1 and w2 , defined in a neighborhood of 𝑝, such that, for
every 𝑞, the vectors w1 (𝑞) and w2 (𝑞) define the two principal directions in 𝑞: the
two lines of curvature through 𝑝 are then the integral curves of w1 and w2 that pass
through 𝑝. (The precise choice of the w𝑖 does not matter, since any other suitable
vector fields would have the same trajectories up to reparametrization.)
Let us consider a parameterization Φ(𝑢, 𝑣) in a neighborhood of the non-umbilical
point 𝑝 = Φ(0, 0). By the formulas (3.6)–(3.9), §2, the entries 𝑎 𝑖 𝑗 of the matrix of
−𝐷𝑁Φ(𝑢,𝑣) relative to the basis (Φ𝑢 , Φ𝑣 ) are differentiable functions of (𝑢, 𝑣), and
so are the principal curvatures 𝑘 1 < 𝑘 2 in a neighborhood of (0, 0). The eigenvectors
of −𝐷𝑁Φ(𝑢,𝑣) are 𝛼𝑖 Φ𝑢 + 𝛽𝑖 Φ𝑣 , where
𝑎 11 − 𝑘 𝑖 𝑎 12 𝛼 0
[ ][ 𝑖 ] = [ ] , 𝑖 = 1, 2. (3.14)
𝑎 21 𝑎 22 − 𝑘 𝑖 𝛽𝑖 0
What happens in the above example is that the principal directions do not define
vector fields, but direction fields: a direction field is a mapping D, defined in an open
subset 𝑈 ⊆ 𝑆, such that, for every 𝑝 ∈ 𝑈, D(𝑝) is a one-dimensional linear subspace
of 𝑇 𝑝 𝑆. The direction field D is called differentiable if every point of 𝑈 has an open
neighborhood 𝑉 in which a differentiable vector field v is defined such that v(𝑝)
generates D(𝑝) for all 𝑝 ∈ 𝑉. All direction fields we consider are differentiable. An
integral curve of the direction field is a curve whose velocity at each point has the
same direction as the field at that point.
Locally, the study of direction fields amounts to that of vector fields, and so it
readily follows that the integral curve of a direction field through a given point exists
and is unique (up to reparametrization). But, by example of the “hay fork” in Fig. 3.4,
we see that there are direction fields which globally do not arise from a vector field: it
is impossible to choose a coherent (continuous) orientation for all the trajectories
shown in the picture.
Figure 3.4
In local coordinates the direction fields are given by a linear relation 𝐴(𝑢, 𝑣)𝛼 +
𝐵(𝑢, 𝑣)𝛽 = 0 between the coordinates 𝛼 and 𝛽 of a vector in the basis (Φ𝑢 , Φ𝑣 ), so
that the integral curves have the form Φ(𝑢(𝑡), 𝑣(𝑡)), where 𝐴(𝑢, 𝑣)𝑢 ′ + 𝐵(𝑢, 𝑣)𝑣 ′ ≡ 0.
For example, the asymptotic directions at a point of negative Gaussian curvature
satisfy the equation 𝑒𝛼2 + 2 𝑓 𝛼𝛽 + 𝑔𝛽2 = 0. Since 𝑒𝑔 − 𝑓 2 < 0, the left-hand side of
80 3 The Geometry of the Gauss Map
the equation is the product of two linear factors: so if 𝑒 ≠ 0 then 𝑒𝛼2 + 2 𝑓 𝛼𝛽 + 𝑔𝛽2 =
𝑒(𝛼 − 𝜆1 𝛽)(𝛼 − 𝜆2 𝛽) with 𝜆1 ≠ 𝜆2 . Hence, the two asymptotic directions at each
point are given by the relations 𝛼 − 𝜆1 𝛽 = 0, 𝛼 − 𝜆2 𝛽 = 0, which define two direction
fields. The two families of asymptotic lines are the line integrals of these vector fields,
having equations 𝑢 ′ − 𝜆1 𝑣 ′ = 0, 𝑢 ′ − 𝜆2 𝑣 ′ = 0.
We say that two direction fields D1 and D2 , defined in the same open subset
𝑈 ⊆ 𝑆, are independent if, for every 𝑝 ∈ 𝑈, the linear subspaces D1 (𝑝) and D2 (𝑝) of
𝑇 𝑝 𝑆 are distinct. Our next result, the most important one in this section, establishes
the existence of parameterizations in which tangent lines to coordinate curves have
directions fixed beforehand.
and such that the coordinate curves 𝑥 = constant are line integrals of D2 ; the curves
𝑡 = constant of such a parameterization coincide with the curves 𝑣 = constant of
Φ(𝑢, 𝑣). We then have
𝜕𝜑
Ψ𝑥 = Φ𝑢 ,
𝜕𝑥
𝜕𝜑
Ψ𝑡 = Φ𝑢 + Φ 𝑣 ,
𝜕𝑡
3.3 Vector Fields 81
𝜕𝜑
= 𝛼(𝜑(𝑡, 𝑥), 𝑡). (3.15)
𝜕𝑡
Consider now the differential equation
The next two results are an immediate consequence of 3.3.4 and the preceding
discussion.
Exercises
70. Show that in a neighborhood of a non-umbilical point (hyperbolic point) the
coordinate curves of Φ(𝑢, 𝑣) are lines of curvature (asymptotic lines) if and only if
the coefficients of this parameterization satisfy the condition 𝐹 ≡ 0 ≡ 𝑓 (𝑒 ≡ 0 ≡ 𝑔).
71. Consider T 2 parametrized by
⎛ 𝑡 𝑑𝑥 ⎞ 𝜋 𝜋
(𝑢(𝑡), 𝑣(𝑡)) = 𝑢 0 + ∫ √ , 𝜋 + 𝑡 , 𝑡 ∈ ]− , [
⎝ 0 cos 𝑥(2 − cos 𝑥) ⎠ 2 2
or
⎛ 𝑡 𝑑𝑥 ⎞ 𝜋 𝜋
(𝑢(𝑡), 𝑣(𝑡)) = 𝑢 0 − ∫ √ , 𝜋 + 𝑡 , 𝑡 ∈ ]− , [ ⋅
⎝ 0 cos 𝑥(2 − cos 𝑥) ⎠ 2 2
(c) Show that lim𝜋 𝛼(𝑡) and lim𝜋 𝛼(𝑡) exist and are points of the parallels 𝑣 = 𝜋
2
𝑡→− 2 𝑡→ 2
and 𝑣 = 3𝜋
2 . Deduce that all asymptotic lines of T 2 have finite length.
72. Let 𝑆 be a surface of constant zero Gaussian curvature and let 𝑝 be a parabolic
point of 𝑆. Show that there exists a line segment through 𝑝 which is entirely contained
in 𝑆.
Chapter 4
The Intrinsic Geometry of Surfaces
sent by the conformal mapping 𝑓 into curves that intersect at 𝑓 (𝑝) at the same angle.
(Note that we are referring to non-oriented angles).
We now give a brief interlude of linear algebra showing several characterizations
of linear maps which preserve angles.
Lemma 4.1.1 Let (𝐸 1 , ⟨⋅, ⋅⟩1 ) and (𝐸 2 , ⟨⋅, ⋅⟩2 ) be spaces of the same dimension n
equipped with an inner product and let 𝐿∶ 𝐸 1 → 𝐸 2 be a linear isomorphism. Then
the following conditions are equivalent:
i. 𝐿 preserves angles;
ii. 𝐿 is a similarity — that is, there exists 𝜆 > 0 such that ∣𝐿(v)∣2 = 𝜆∣v∣1 for all
v ∈ 𝐸1 ;
iii. there exists 𝜆 > 0 such that ⟨𝐿(v), 𝐿(w)⟩2 = 𝜆2 ⟨v, w⟩1 for all v, w ∈ 𝐸 1 ;
iv. there exist 𝜆 > 0 and a basis (v1 , . . . , v𝑛 ) of 𝐸 1 such that ⟨𝐿(v𝑖 ), 𝐿(v 𝑗 )⟩2 =
𝜆2 ⟨v𝑖 , v 𝑗 ⟩1 for all 1 ≤ 𝑖, 𝑗 ≤ 𝑛.
Proof The equivalence between iii. and iv. is simple to verify, and ii. is included for
informational purposes only; finally, to show that ii. ⇔ iii. may be left as an exercise.
i. ⇒ iii. Let (e1 , . . . , e𝑛 ) be an orthonormal basis of 𝐸 1 . The vectors
𝐿(e1 ), . . . , 𝐿(e𝑛 ) are then pairwise orthogonal. Given 𝑖 ≠ 𝑗, the angle 𝜃 ∈ [0, 𝜋]
between e𝑖 and e𝑖 + e 𝑗 is given by
⟨e𝑖 , e𝑖 + e 𝑗 ⟩1 1
cos 𝜃 = =√
∣e𝑖 ∣1 ∣e𝑖 + e 𝑗 ∣1 2
so that 𝜃 = 𝜋
4. Since 𝐿 preserves angles, we also have
1
∣𝐿(e 𝑗 )∣2 = √ ∣𝐿(e𝑖 + e 𝑗 )∣2 ,
2
and so we deduce that ∣𝐿(e𝑖 )∣2 = ∣𝐿(e 𝑗 )∣2 for all 1 ≤ 𝑖, 𝑗 ≤ 𝑛. We now easily conclude
that iii. holds for 𝜆 = ∣𝐿(e𝑖 )∣2.
iii. ⇒. i. Just note that if iii. holds then we have, for all vectors v, w ∈ 𝐸 1
By iv., condition (4.1) only needs to be checked for the vectors of a given basis of
𝑇 𝑝 𝑆. This has the consequence that in local coordinates Φ(𝑢, 𝑣) the condition that 𝑓
is conformal is equivalent to the combination of the three equalities
2
⟨( 𝑓 ○ Φ)𝑢 , ( 𝑓 ○ Φ)𝑢 ⟩ = 𝜆(𝑢, 𝑣) ⟨Φ𝑢 , Φ𝑢 ⟩
2
⟨( 𝑓 ○ Φ)𝑢 , ( 𝑓 ○ Φ)𝑣 ⟩ = 𝜆(𝑢, 𝑣) ⟨Φ𝑢 , Φ𝑣 ⟩
2
⟨( 𝑓 ○ Φ)𝑣 , ( 𝑓 ○ Φ)𝑣 ⟩ = 𝜆(𝑢, 𝑣) ⟨Φ𝑣 , Φ𝑣 ⟩,
that express that the coefficients of the first fundamental form of the parameterization
Φ(𝑢, 𝑣) of 𝑆1 are proportional to those of the parametrization Ψ(𝑢, 𝑣) = 𝑓 ○Φ(𝑢, 𝑣) of
𝑆2 . (There is a certain abuse of language here, since only locally can it be guaranteed
that Ψ is a parameterization.) It further follows from these formulas that 𝜆∶ 𝑈 → R +
is differentiable, because its expression 𝜆(𝑢, 𝑣) in local coordinates is differentiable.
Example 4.1.2 Let us verify that the stereographic projection (Exercise 30, Section 2.1)
𝜋∶ S1 ∖ {(0, 0, 1)} → R 2 is conformal. We recall that
2𝑥 2𝑦
𝜋(𝑥, 𝑦, 𝑧) = ( , );
1−𝑧 1−𝑧
so, using local coordinates
√ √
Φ(𝑢, 𝑣) = ( 1 − 𝑣 2 cos 𝑢, 1 − 𝑣 2 sin 𝑢, 𝑣) , 𝑣 ∈ ]−1, 1[ ,
1
𝐸 = 1 − 𝑣2, 𝐹 = 0, 𝐺= ,
1 − 𝑣2
and those of Ψ(𝑢, 𝑣) are
4(1 + 𝑣) 4
𝐸= , 𝐹 = 0, 𝐺= ⋅
1−𝑣 3
(1 − 𝑣) (1 + 𝑣)
Noting that
86 4 The Intrinsic Geometry of Surfaces
2 2 2
𝐸 = 𝜆(𝑢, 𝑣) 𝐸, 𝐹 = 𝜆(𝑢, 𝑣) 𝐹, 𝐺 = 𝜆(𝑢, 𝑣) 𝐺,
1
tg 𝛼 = , 𝛽 = 2𝜋 sin 𝛼,
𝑘
and also 𝑈 = {(𝜌 cos 𝜑, 𝜌 sin 𝜑) ∈ R 2 ∶ 𝜌 > 0, 0 < 𝜑 < 𝛽}. (See Fig. 4.1.)
a b
Figure 4.1
𝜌𝛽 2𝜋𝜑 𝜌𝛽 2𝜋𝜑
( cos ( ), sin ( ) , 𝜌 cos 𝛼) ∶
2𝜋 𝛽 2𝜋 𝛽
and
𝜌𝛽 2𝜋𝜑 𝜌𝛽 2𝜋𝜑
𝑓 ○ Φ(𝜌, 𝜑) = ( cos ( ), sin ( ) , 𝜌 cos 𝛼)
2𝜋 𝛽 2𝜋 𝛽
have the same coefficients: 𝐸 = 1, 𝐹 = 0, 𝐺 = 𝜌 2 . ◻
88 4 The Intrinsic Geometry of Surfaces
Exercises
73. The inversion of the plane with respect to the circle with center 𝑂 and radius 𝑟
is the mapping that sends each point 𝑝 ≠ 𝑂 to the point 𝑝 ′ of the half-line 𝑂 𝑝 that
verifies the condition 𝑂 𝑝 ′ 𝑂 𝑝 = 𝑟 2 . Show that the inversion is a conformal mapping
and that it reverses orientation of the plane.
1 ̃
̃ =
𝑔 ′ (𝜃) ̃ = log tg ( 𝜋 + 𝜃 )]
[and therefore 𝑔(𝜃)
̃
cos 𝜃 4 2
𝑢, ̃
Ψ(̃ 𝑣 cos 𝑢̃, ̃
𝑣 ) = (̃ 𝑢 ).
𝑣 sin cos 𝑢, 𝑎̃
Figure 4.2
78. (a) Let 𝐸(̃ 𝑣 ) be 𝐶 ∞ functions defined on a compact interval 𝐼. Show that
𝑣 ), 𝐺(̃
there exists a parameterization
𝑢, ̃
Ψ(̃ 𝑣 ) = (𝜌(̃
𝑣 ) cos(𝑎̃
𝑢 ), 𝜌(̃
𝑣 ) sin(𝑎̃
𝑢 ), 𝑧(̃
𝑣 ))
79. Construct an isometry between the surface defined by the equation 𝑧 = 𝑦 2 and the
plane 𝑧 = 0 which sends the parabolas 𝑧 = 𝑦 2 , 𝑥 = 𝑥 0 to the lines 𝑧 = 0, 𝑥 = 𝑥0 .
In this section we give a necessary condition for two surfaces to be locally isometric,
from which it will follow, for example, the non-existence of a local isometry between
the plane and the sphere. In the proof we make use of Christoffel symbols, which are
functions of local parameters that are invariant under isometries.
Given a parameterization Φ(𝑢, 𝑣) of an oriented surface 𝑆, we consider at each point
the trihedron (Φ𝑢 , Φ𝑣 , 𝑁), where 𝑁(𝑢, 𝑣) is the normal field to 𝑆 compatible with
the orientation. [It suffices that 𝑁 be a local orientation, since all the considerations
we make are local in character — and therefore all the results in this section hold for
non-orientable surfaces.] We can express the second-order derivatives of Φ in terms
of this trihedron:
Φ𝑢𝑢 = Γ11
1
Φ𝑢 + Γ112
Φ𝑣 + 𝜆1 𝑁,
Φ𝑢𝑣 = Γ12
1
Φ𝑢 + Γ12
2
Φ𝑣 + 𝜆2 𝑁,
(4.2)
Φ𝑣𝑢 = 1
Γ21 Φ𝑢 + Γ21
2
Φ𝑣 + 𝜆3 𝑁,
Φ𝑣𝑣 = Γ22
1
Φ𝑢 + Γ22
2
Φ𝑣 + 𝜆4 𝑁.
The 𝜆𝑖 in (4.2) are just the coefficients of the second fundamental form: indeed, by the
formulas in Section 3.2, we have 𝜆1 = ⟨Φ𝑢𝑢 , 𝑁⟩ = 𝑒, 𝜆2 = 𝜆3 = ⟨Φ𝑢𝑣 , 𝑁⟩ = 𝑓 , 𝜆4 =
⟨Φ𝑣 , 𝑁⟩ = 𝑔. The coefficients Γ𝑖𝑘𝑗 are the Christoffel symbols of the parameterization
Φ(𝑢, 𝑣), and it follows from the definition that they are symmetric with respect to the
lower indices: thus, Γ𝑖𝑘𝑗 = Γ 𝑘𝑗𝑖 . To compute them, we form the inner product of the
equalities (4.2) with Φ𝑢 and Φ𝑣 , obtaining
⎧ 1
⎪
⎪Γ 𝐸 + Γ112
𝐹 = ⟨Φ𝑢𝑢 , Φ𝑢 ⟩ = 21 𝐸 𝑢
⎨ 11
⎪Γ11
⎪
1
𝐹 + Γ11
2
𝐺 = ⟨Φ𝑢𝑢 , Φ𝑣 ⟩ = 𝐹𝑢 − 12 𝐸 𝑣
⎩
⎧
⎪
⎪Γ12
1
𝐸 + Γ12
2
𝐹 = ⟨Φ𝑢𝑣 , Φ𝑢 ⟩ = 12 𝐸 𝑣
⎨ 1 (4.3)
⎪
⎪ Γ 𝐹 + Γ12
2
𝐺 = ⟨Φ𝑢𝑣 , Φ𝑣 ⟩ = 12 𝐺 𝑢
⎩ 12
⎧
⎪
⎪Γ22
1
𝐸 + Γ22
2
𝐹 = ⟨Φ𝑣𝑣 , Φ𝑢 ⟩ = 𝐹𝑣 − 12 𝐺 𝑢
⎨ 1
⎪
⎪ Γ 𝐹 + Γ22
2
𝐺 = ⟨Φ𝑣 , Φ𝑣 ⟩ = 21 𝐺 𝑣 .
⎩ 22
Each of the equation systems in (4.3) has determinant 𝐸𝐺 − 𝐹 2 > 0, which shows that
they suffice for computing the Christoffel symbols, and that these are functions of the
coefficients 𝐸, 𝐹, 𝐺 and their derivatives. In particular, and since local isometries
preserve 𝐸, 𝐹, 𝐺, they also preserve the coefficients Γ𝑖𝑘𝑗 — more precisely, if 𝑓 is a local
4.2 Gauss’s Theorema Egregium 91
(Φ𝑢𝑢 )𝑣 = (Γ11
1
)𝑣 Φ𝑢 + (Γ11
2
)𝑣 Φ𝑣 + Γ11
1
Φ𝑢𝑣 + Γ11
2
Φ𝑣𝑣 + 𝑒 𝑣 𝑁 + 𝑒𝑁 𝑣 ,
(Φ𝑢𝑣 )𝑢 = (Γ12
1
)𝑢 Φ𝑢 + (Γ12
2
)𝑢 Φ𝑣 + Γ12
1
Φ𝑢𝑢 + Γ12
2
Φ𝑢𝑣 + 𝑓𝑢 𝑁 + 𝑓 𝑁𝑢 .
Using again (4.2) and expressing 𝑁 𝑣 and 𝑁𝑢 in terms of Φ𝑢 and Φ𝑣 , we can write
each of the terms of (4.5) as a linear combination of the vectors Φ𝑢 , Φ𝑣 , 𝑁. Equating
the coefficients of Φ𝑣 in the two linear combinations, we obtain
(Γ11
2
)𝑣 + Γ11
1 2
Γ12 + Γ11
2 2
Γ22 − 𝑒𝑎 22 = (Γ12
1
)𝑢 + Γ12
1 2
Γ11 + Γ12
2 2
Γ12 − 𝑓 𝑎 21 . (4.6)
Combining (4.6) and (4.7), we obtain the following expression for the Gaussian
curvature:
1
𝐾= ((Γ11
2
)𝑣 − (Γ12
2
)𝑢 + Γ12
1
(Γ11
1
− Γ12
2
) + Γ11
1
(Γ22
2
− Γ12
1
)). (4.8)
𝐸
Although its utility is questionable, formula (4.8) has this consequence: it is possible to
compute the Gaussian curvature knowing only the coefficients of the first fundamental
form and their derivatives. In the calculation of the principal curvatures 𝑘 1 and 𝑘 2 ,
the Gauss map and its derivative (i.e., the second fundamental form) are crucial
92 4 The Intrinsic Geometry of Surfaces
Exercises
81. Let 𝑆1 = {(𝑥, 𝑦, 𝑧)∶ 𝑧 = 0} and 𝑆2 = {(𝑥, 𝑦, 𝑧)∶ 𝑥 2 + 𝑦 2 = 𝑧}.
(a) Check that 𝑓 ∶ 𝑆1 → 𝑆2 given by 𝑓 (𝑥, 𝑦, 0) = (𝑥, 𝑦, 𝑥 2 + 𝑦 2 ) is not an isometry.
(b) Show that 𝑆1 and 𝑆2 are not locally isometric.
82. Consider the surfaces 𝑆1 and 𝑆2 given by
and
Ψ(𝑢, 𝑣) = (𝑢 cos 𝑣, 𝑢 sin 𝑣, 𝑣) (𝑢, 𝑣 ∈ R ).
−1
Show that Ψ ○ Φ ∶ 𝑆1 → 𝑆2 is not a local isometry, although the curvature of 𝑆2 at
Ψ(𝑢, 𝑣) is equal to that of 𝑆1 at Φ(𝑢, 𝑣).
83. Consider T 2 with the usual parameterization. Show that any isometry of T 2 sends
the equator 𝑣 = 0 to itself (use Exercise 71-a) in Section 3.3).
84. Show that a conformal mapping that preserves areas (see Exercise 71, in Section
2.5) is necessarily a local isometry (and therefore all maps of the Earth’s surface have
to choose one of two defects: either they distort areas, or they distort shapes).
All the concepts we define in this section belong to intrinsic geometry, in the sense
that they are invariant under local isometries. The method, used here repeatedly, to
establish the intrinsic character of a certain concept consists of showing that it is a
function solely of the coefficients 𝐸, 𝐹, 𝐺 and their derivatives — which is the case
whenever we can express it by means of these quantities and the Christoffel symbols
only.
4.3 Covariant Derivative, Parallel Transport, Geodesic Curvature 93
Example 4.3.1 Consider a unit vector field v in S2 ∖ {(0, 0, 1), (0, 0, −1)} having
constant angle 𝜋4 with the parallels of the sphere. In spherical coordinates
we put
1 1
v(𝜑, 𝜃) = √ (−Φ 𝜃 + Φ𝜑 )
2 sin 𝜃
1
= √ (−cos 𝜃 cos 𝜑 − sin 𝜑, −sin 𝜃 sin 𝜑 + cos 𝜑, sin 𝜃).
2
Let us compute the covariant derivative of v along the parallel 𝜃 = 𝜃 0 , parametrized
by 𝛼 𝜃0 (𝑡) = Φ(𝑡, 𝜃 0 ). We now have
1
v′ (𝑡) = √ (cos 𝜃 0 sin 𝑡 − cos 𝑡, −sin 𝜃 0 cos 𝑡 − sin 𝑡, 0),
2
𝐷v
and we obtain (𝑡) by subtracting from v′ (𝑡) its normal component:
𝑑𝑡
Ignoring the normal component, the components of v′ (𝑡) with respect to Φ𝑢 and
𝐷v
Φ𝑣 give the expression of the covariant derivative (𝑡) in local coordinates. If
𝑑𝑡
we look at the coefficients of Φ𝑢 and Φ𝑣 , we see that they have “intrinsic character”
and are therefore preserved by isometries. To formalize this observation, let us
take a mapping 𝑓 ∶ 𝑈 → 𝑆2 that is a diffeomorphism onto the image, and consider
the vector field u = 𝐷 𝑓 (v) which is the transfer of v by 𝑓 , defined on 𝑓 (𝑈) by
𝐷u
u( 𝑓 (𝑝)) = 𝐷 𝑓 𝑝 (v(𝑝)). Let us denote by the covariant derivative of u along the
𝑑𝑡
curve 𝛽(𝑡) = 𝑓 ○ 𝛼(𝑡). We claim that if 𝑓 is an isometry, then
𝐷u 𝐷v
= 𝐷 𝑓 𝛼(𝑡) ( )⋅ (4.10)
𝑑𝑡 𝑑𝑡
In fact, using coordinates Φ(𝑢, 𝑣) on 𝑆 and Ψ(𝑢, 𝑣) = 𝑓 ○ Φ(𝑢, 𝑣) on 𝑆2 , we see by
𝐷u 𝐷v
(4.9) that the expressions of with respect to the basis (Ψ𝑢 , Ψ𝑣 ) and of with
𝑑𝑡 𝑑𝑡
respect to the basis (Φ𝑢 , Φ𝑣 ) have the same coefficients, which proves (4.10).
𝐷v
Another property of the covariant derivative of v along 𝛼 is that at each instant
𝑑𝑡
it only depends on the velocity vector of the curve at that instant.
𝐷v
Proposition 4.3.2 (𝑡) only depends on 𝛼′ (𝑡) = 𝑢 ′ (𝑡)Φ𝑢 + 𝑣 ′ (𝑡)Φ𝑣 and not on
𝑑𝑡
the curve 𝛼.
𝑎 ′ (𝑡) =
𝜕𝑎 ′ 𝜕𝑎 ′
𝑢 (𝑡) + 𝑣 (𝑡),
𝜕𝑢 𝜕𝑣
𝑏 ′ (𝑡) =
𝜕𝑏 ′ 𝜕𝑏 ′
𝑢 (𝑡) + 𝑣 (𝑡),
𝜕𝑢 𝜕𝑣
we recognize that the functions 𝑢(𝑡), 𝑣(𝑡) only enter in (4.9) through the value they
and their first derivatives 𝑢 ′ and 𝑣 ′ take at time 𝑡. ◻
𝐷v
We further observe that the expression of is linear in 𝑢 ′ , 𝑣 ′ . Thanks to
𝑑𝑡
Proposition 4.3.2, for every 𝑝 ∈ 𝑈, a linear mapping 𝐷v 𝑝 ∶ 𝑇 𝑝 𝑆 → 𝑇 𝑝 𝑆 is defined as
follows: given w ∈ 𝑇 𝑝 𝑆, let 𝛼(𝑡) be a curve in 𝑈 such that 𝛼(𝑡 0 ) = 𝑝 and 𝛼′ (𝑡0 ) = w;
then 𝐷v 𝑝 (w) is the covariant derivative of v along 𝛼 computed at 𝑡0 . Intuitively,
𝐷v 𝑝 (w) is a kind of directional derivative of v in the direction of w.
Given a field v in 𝑈 ⊆ 𝑆, the divergence of v is the mapping Div v∶ 𝑈 ⊆ 𝑆 → R
such that, at every 𝑝 ∈ 𝑈, its value Div v(𝑝) is the trace of the linear mapping 𝐷v 𝑝 .
Example 4.3.3 If v(𝑢, 𝑣) = ( 𝑓1 (𝑢, 𝑣), 𝑓2 (𝑢, 𝑣)) is a vector field on an open subset
𝑈 ⊆ R 2 , its covariant derivative along a curve 𝛼∶ 𝐼 → 𝑈 is the usual derivative
(v ○ 𝛼)′ (𝑡), and therefore the linear mapping 𝑑v 𝑝 is just the derivative of v as the
4.3 Covariant Derivative, Parallel Transport, Geodesic Curvature 95
𝜕 𝑓1 𝜕 𝑓2
and its trace ∣ + ∣ is the divergence of v computed at 𝑝. ◻
𝜕𝑢 𝑝 𝜕𝑣 𝑝
Proposition 4.3.4 Divergence is invariant under isometries: if 𝑓 ∶ 𝑈 ⊆ 𝑆 → 𝑆2 is
an isometry into its image and v is a vector field on 𝑈, then Div(𝐷 𝑓 v)( 𝑓 (𝑝)) =
Div v(𝑝).
Proof We put u = 𝐷 𝑓 v, and start by rewriting formula (4.10) using the linear
mappings
𝐷u 𝑓 ( 𝑝) ∶ 𝑇 𝑓 ( 𝑝) 𝑆2 → 𝑇 𝑓 ( 𝑝) 𝑆2 and 𝐷v 𝑝 ∶ 𝑇 𝑝 𝑆 → 𝑇 𝑝 𝑆.
Let us fix a curve 𝛼(𝑡) such that 𝛼(0) = 𝑝, and let us put 𝛽(𝑡) = 𝑓 ○ 𝛼(𝑡), w1 = 𝛼′ (0)
𝐷u 𝐷v
and w2 = 𝛽′ (0). Then we have (0) = 𝐷u 𝑓 ( 𝑝) (w2 ) and (0) = 𝐷v 𝑝 (w1 ) =
𝑑𝑡 𝑑𝑡
−1
𝐷v 𝑝 (𝐷 𝑓 𝑝 (w2 )). By (4.10) we can write
i.e.
𝐷u 𝑓 ( 𝑝) = 𝐷 𝑓 𝑝 ○ 𝐷v 𝑝 ○ 𝐷 𝑓 𝑝−1 , (4.11)
which means that 𝐷u 𝑓 ( 𝑝) and 𝐷v 𝑝 are conjugate linear mappings, and therefore
have the same trace. ◻
We will revisit the notion of divergence in the next section, where we prove a
theorem that adapts the divergence theorem known from vector calculus to surfaces.
For now, we return to the covariant derivative.
The reader must have noticed that in the computation of the covariant derivative
of v along 𝛼, only the way v is defined on the trace of the curve plays a role, and so it
is not necessary that the field be defined on other points. We thus define a vector field
along a regular curve 𝛼(𝑡) as a function v(𝑡) such that, for every 𝑡, the vector v(𝑡)
belongs to 𝑇𝛼(𝑡) 𝑆; if v is differentiable (i.e., if, by writing v(𝑡) = 𝑎(𝑡)Φ𝑢 + 𝑏(𝑡)Φ𝑣 ,
the functions 𝑎(𝑡) and 𝑏(𝑡) are differentiable), we can as before compute the covariant
derivative of v along 𝛼.
Lemma 4.3.5 If v(𝑡), w(𝑡) are two vector fields along the curve 𝛼(𝑡) then
𝑑 𝐷v 𝐷w
⟨v, w⟩ = ⟨ , w⟩ + ⟨v, ⟩⋅
𝑑𝑡 𝑑𝑡 𝑑𝑡
𝐷𝝉1 𝑑 1
⟨𝝉1 (𝑡), (𝑡)⟩ = ( ∣𝝉1 ∣2 ) = 0,
𝑑𝑡 𝑑𝑡 2
𝐷𝝉1
and so the vectors (𝑡) and 𝝉2 (𝑡) are collinear. The geodesic curvature 𝑘 𝑔 (𝑡) of
𝑑𝑡
𝛼 at the point 𝛼(𝑡) is defined by the equality
From formula (4.15) it follows that the sign of the geodesic curvature is changed
when we reverse orientation of the curve or change the orientation of the surface. In
our next proposition we gather some simple properties of the geodesic curvature.
Proposition 4.3.6 (i) The geodesic curvature is invariant under local isometries that
preserve orientation — that is, if 𝑓 ∶ 𝑈 ⊆ 𝑆1 → 𝑆2 is such a mapping and 𝛼∶ 𝐼 → 𝑈 is
a regular curve, then the geodesic curvatures of 𝛼 in 𝑆1 (computed at 𝛼(𝑡)) and of
𝑓 ○ 𝛼 in 𝑆2 (computed at 𝑓 ○ 𝛼(𝑡)) are equal.
(ii) If 𝛼(𝑡) is a regular curve in 𝑆, then
2 2 2
𝑘(𝑡) = 𝑘 𝑔 (𝑡) + 𝑘 𝑛 (𝑡) ,
where 𝑘, 𝑘 𝑛 , 𝑘 𝑔 are respectively the curvature, the normal curvature and the geodesic
curvature of 𝛼.
Proof The derivative 𝐷 𝑓 transforms the “moving” orthonormal frame (𝝉1 (𝑡), 𝝉2 (𝑡))
that appears in the calculation (4.12) of the geodesic curvature of 𝛼 into another
“moving” orthonormal frame 𝝉1 (𝑡), 𝝉2 (𝑡). Furthermore, 𝝉1 is the unit tangent vector
field along the curve 𝑓 ○ 𝛼 and, if 𝑓 preserves orientation, this second frame is also
positively oriented. Statement (i) is then an immediate consequence of (4.10).
To prove (ii), it suffices to note that from the decomposition
𝝉1 ′ = + (𝝉1 ′ )
𝐷𝝉1 𝑛
𝑑𝑡
into the tangential and normal components, it follows that
𝐷𝝉1 2 𝑛 2
∣𝝉1 ′ ∣2 = ∣ ∣ + ∣(𝝉1 ′ ) ∣ ,
i.e. 𝑑𝑡
∣𝛼′ (𝑡)∣2 𝑘(𝑡) = ∣𝛼′ (𝑡)∣2 (𝑘 𝑔 (𝑡) + 𝑘 𝑛 (𝑡) ).
2 2 2
◻
Now let v be any vector field along 𝛼. We say that v is parallel along 𝛼 if its
covariant derivative along 𝛼 is constantly zero. Assume that 𝛼 is defined on [𝑎, 𝑏],
that w1 ∈ 𝑇𝛼(𝑎) 𝑆, and let v be a parallel vector field along 𝛼 such that v(𝑎) = w1 .
Then the vector w2 = v(𝑏) in 𝑇𝛼(𝑏) 𝑆 is called the parallel transport of w1 along 𝛼
from 𝛼(𝑎) to 𝛼(𝑏). To justify the use of the definite article the, we have to show that
there is only one parallel vector field along 𝛼 with initial position w1 ; this is done in
the next proposition. However, we note first that by formula (4.13) the fact that v is
parallel along 𝛼 does not depend on the parameterization of the curve. Therefore the
notion of parallel transport is also independent of the parameterization.
𝑑 2 𝐷v
∣v∣ = 2⟨ , v⟩ = 0,
𝑑𝑡 𝑑𝑡
and so ∣v∣ is constant, and the same holds for ∣w∣ and for the inner product ⟨v, w⟩.
Therefore each of these vector fields is either always or never zero. Assuming they are
⟨v, w⟩
both nonzero, the angle 𝜃(𝑡) = ∠(v(𝑡), w(𝑡)) is constant, since cos 𝜃(𝑡) = ⋅
∣v∣ ∣w∣
(ii) It follows from (i) that two parallel vector fields v and u such that v(𝑎) = u(𝑎)
must be identical, since they both have constant norm equal to ∣v(𝑎)∣, and the angle
between them is always zero. Thus there exists at most one parallel vector field v(𝑡)
such that v(𝑎) = w1 .
Assuming that w1 ≠ 0, it remains to prove the existence of v. For this, consider the
vector fields 𝝉1 (𝑡) = ∣𝛼′1(𝑡)∣ 𝛼′ (𝑡) and 𝝉2 = 𝑁 × 𝝉1 . We write v(𝑡) in the form
Since v is parallel, v has constant norm 𝑟 = ∣w1 ∣. We can thus try to find 𝜑(𝑡) such
that 𝑎(𝑡) = 𝑟 cos 𝜑(𝑡) and 𝑏(𝑡) = 𝑟 sin 𝜑(𝑡). Note that
, 𝝉2 ⟩ = −𝑘 𝑔 ∣𝛼′ ∣
𝐷𝝉2 𝐷𝝉1
⟨ , 𝝉1 ⟩ = −⟨
𝑑𝑡 𝑑𝑡
𝐷𝝉2
and therefore = −𝑘 𝑔 ∣𝛼′ ∣𝝉1 . Noting that the covariant derivative obeys the usual
𝑑𝑡
rules of differentiation, we then have
1 𝐷v 𝐷𝝉1 𝐷𝝉2
= −𝜑 sin 𝜑 𝝉1 + cos 𝜑 + 𝜑 cos 𝜑 𝝉2 + sin 𝜑
𝑟 𝑑𝑡 𝑑𝑡 𝑑𝑡
= (𝜑 + 𝑘 𝑔 ∣𝛼′ ∣)(− sin 𝜑 𝝉1 + cos 𝜑 𝝉2 ),
so that v is a parallel vector field along 𝛼 if and only if 𝜑′ (𝑡) = −𝑘 𝑔 (𝑡)∣𝛼′ (𝑡)∣. To
finish, it is therefore sufficient to define 𝜑(𝑡) = 𝜑0 − ∫𝑎 𝑘 𝑔 (𝑠)∣𝛼′ (𝑠)∣ 𝑑𝑠, where 𝜑0
𝑡
Observations and Examples 4.3.8 A. Every local isometry 𝑓 sends a parallel vector
field along 𝛼 to a parallel vector field along 𝑓 ○ 𝛼. This is a consequence of formula
(4.10).
B. In the plane a vector field is parallel along a certain curve if and only if it is
constant along that curve, which shows that in this case the parallel transport only
depends on the initial and final points of the curve, not on the path covered. This
property is shared (locally) by those surfaces which are locally isometric to the plane,
such as the cylinder and the cone: thus, each point of these surfaces has an open
neighborhood 𝑈 such that the parallel transport along any curve contained in 𝑈
depends only on the initial and final points of the curve. Later on we will conclude
that it is only on surfaces of zero curvature that this holds true.
4.3 Covariant Derivative, Parallel Transport, Geodesic Curvature 99
C. If the surfaces 𝑆1 and 𝑆2 are tangent along the regular curve 𝛾(𝑡), the covariant
derivative of a vector field v along 𝛾 is the same computed with respect to 𝑆1 or 𝑆2
(since the normal component we subtract from v′ (𝑡) is the same in both cases), and
therefore also the parallel transport along 𝛾 is the same on the two surfaces. Using
this observation, let us determine the parallel transport on the sphere S2 along the
parallel of colatitude 𝜃 0 , which we denote by 𝛾.
Consider the cone of revolution ℭ tangent to the sphere along 𝛾. The angle that
the generatrices of ℭ make with the axis is 𝜋2 − 𝜃 0 , and therefore (see example 4.1.3
D) ℭ is isometric to a planar region 𝑈 that defines an angle 𝛽 = 2𝜋 cos 𝜃 0 (see Figure
4.3). By such an isometry, the parallel transport along 𝛾 corresponds to the parallel
transport in 𝑈 along the arc with center 𝑂 and radius equal to tg 𝜃 0 . However, in
𝑈 parallel transport is simply translation; but a vector w1 which is tangent to the
generatrix of the cone is translated to a vector w2 which makes an angle of 𝛽 with
that same generatrix (the generatrices of ℭ correspond in 𝑈 to the half-lines of origin
𝑂, and the two half-lines bounding 𝑈 are identified with the same generatrix).
In conclusion: parallel transport in S2 along one complete turn of the parallel
𝜃 = 𝜃 0 makes each vector rotate through an angle of 2𝜋 cos 𝜃 0 at the end of a complete
turn. ◻
We end this section by defining one of the most important concepts of Differential
Geometry: a geodesic of the surface 𝑆 is a regular curve 𝛼(𝑡) on 𝑆 whose geodesic
curvature is constantly zero. By
p -q
2 0
w1
U
w2
b q0
w1 b w2 b
Figure 4.3
(4.12), this is equivalent to the unit vector field 𝝉1 = ∣𝛼1′ ∣ 𝛼′ tangent to 𝛼 being parallel.
This simply means that at points where the curvature of 𝛼 is nonzero, the principal
normal to the curve is also normal to the surface.
We leave the in-depth study of geodesics to a later section; to finish this one, we
give some simple examples.
100 4 The Intrinsic Geometry of Surfaces
Examples 4.3.9 Geodesics on the plane are simply straight lines, since in the plane
geodesic curvature is the same as signed curvature. On the sphere any maximal circle
is a geodesic, because the principal normal to a maximal circle passes through the
center of the sphere, and therefore coincides with the normal to the sphere itself (later
on we will see that there are no other geodesics on the sphere). More generally, the
meridians of any surface of revolution are geodesics. ◻
Exercises
85. Consider on the sphere S2 the meridian 𝜑 = 0, which we denote by 𝛾, and the
points N = (0, 0, 1) and 𝑆 = (0, 0, −1). We define a mapping 𝐿 𝛾 ∶ 𝑇N S2 → 𝑇S S2 as
follows: 𝐿 𝛾 (v) is the parallel transport of the vector v from 𝑁 to S along 𝛾.
(a) Compute 𝐿 𝛾 (1, 0, 0) and 𝐿 𝛾 (0, 1, 0).
(b) Show that 𝐿 𝛾 is linear.
(c) Does 𝐿 𝛾 depend on the choice of meridian?
86. (a) Describe all the geodesics of the cylinder 𝑥 2 + 𝑦 2 = 1.
(b) Compare the geodesic curvatures of the same helix in the cylinder and in the
helicoid.
87. (a) Let 𝑆 = {𝛼(𝑡) + 𝜆v(𝑡)∶ 𝑇 ∈ 𝐼, 𝜆 ∈ R } be a ruled surface. Show that the
lines 𝑟 𝑡 (𝜆) = 𝛼(𝑡) + 𝜆v(𝑡) are all geodesics, and find out whether the curves
𝑐 𝜆 (𝑡) = 𝛼(𝑡) + 𝜆v(𝑡) are geodesics.
(b) Show that on any surface an asymptotic line that is also a geodesic is necessarily
a (segment of) straight line.
88. Show that on the surface of revolution
the parallel 𝑣 = 𝑣 0 is a geodesic if and only if the tangent line to the generating curve
at 𝑣 0 is parallel to the axis of revolution.
√
89. (a) Check that in the cone C 𝑘 = {(𝑥, 𝑦, 𝑧)∶ 𝑧 ≠ 0, 𝑧 = 𝑘 𝑥 2 + 𝑦 2 } there is always
some geodesic connecting each pair of points.
√
(b) The generatrices 𝑡 ↦ 𝑡(𝑥 0 , 𝑦 0 , 𝑘 𝑥02 + 𝑦 20 ) are geodesics of C 𝑘 that are not
defined for 𝑡 ≤ 0. Show that any other geodesic of C 𝑘 parametrized with constant
scalar velocity extends to all values of the parameter.
90. Let E be the intersection of the cylinder 𝑥 2 + 𝑦 2 = 1 with a plane which passes
through the 𝑥-axis and makes an angle 𝜃 with the 𝑥𝑦 plane. Show that E is an ellipse,
and compute the absolute value of the geodesic curvature of E (relative to the cylinder)
at the points where E intersects its axes.
91. Show that the geodesic curvature of the curve 𝛼 at 𝑝 ∈ 𝑆 is equal to the curvature
at 𝑝 of the curve obtained by projecting 𝛼 orthogonally on 𝑝 + 𝑇 𝑝 𝑆.
4.4 The Divergence Theorem. First Variation of Area 101
In this section we give a simple expression for the divergence of a vector field and
prove the divergence theorem. We also prove a formula for the variation of the area
of a surface when it undergoes a perturbation induced by a vector field (and mention
how this is related to minimal surfaces). These results are used in Section 5.4 in
the proof of the Minkowski integral formulas (which in turn we use to establish the
rigidity of the sphere and in the proof of Theorem 5.5.3) and otherwise are not used in
the remainder of this book. This being said, we must acknowledge that some subjects
treated here do not pertain to intrinsic geometry, but we could find no better place to
fit them in.
Let v be a vector field on an open subset 𝑈 of an oriented surface 𝑆. In the previous
section we defined the linear mapping 𝐷v 𝑝 as follows: given w ∈ 𝑇 𝑝 𝑆, 𝐷v 𝑝 (w)
is the covariant derivative of v, computed at 𝑝, along any curve in 𝑆 that passes
through 𝑝 with velocity w. Using coordinates Φ(𝑢, 𝑣), the vectors 𝐷vΦ(𝑢,𝑣) (Φ𝑢 )
𝜕v 𝜕v
and 𝐷vΦ(𝑢,𝑣) (Φ𝑣 ) are then the tangential components of and respectively —
𝜕𝑢 𝜕𝑣
that is, if we write
𝜕v
= 𝑏 11 Φ𝑢 + 𝑏 21 Φ𝑣 + 𝜆1 𝑁
𝜕𝑢 (4.16)
𝜕v
= 𝑏 12 Φ𝑢 + 𝑏 22 Φ𝑣 + 𝜆2 𝑁,
𝜕𝑣
then the matrix of 𝐷vΦ(𝑢,𝑣) relative to the basis (Φ𝑢 , Φ𝑣 ) of 𝑇Φ(𝑢,𝑣) 𝑆 is
𝑏 11 𝑏 12
𝐵=[ ]⋅
𝑏 21 𝑏 22
Our goal is to obtain an explicit formula for divergence. Using (4.16), we have
𝜕v 𝜕v
× Φ𝑣 + Φ𝑢 × = (𝑏 11 + 𝑏 22 )(Φ𝑢 × Φ𝑣 ) + 𝜆1 (𝑁 × Φ𝑣 ) + 𝜆2 (Φ𝑢 × 𝑁),
𝜕𝑢 𝜕𝑣
and therefore
𝜕v 𝜕v
⟨ × Φ𝑣 + Φ𝑢 × , 𝑁⟩ = (𝑏 11 + 𝑏 22 )⟨Φ𝑢 × Φ𝑣 , 𝑁⟩
𝜕𝑢 𝜕𝑣 (4.17)
√
= (Div v) 𝐸𝐺 − 𝐹 2 ,
𝑎 11 𝑎 12
[ ]
𝑎 21 𝑎 22
of −𝐷𝑁Φ(𝑢,𝑣) relative to the basis (Φ𝑢 , Φ𝑣 ) to express the vectors 𝑁𝑢 and 𝑁 𝑣 (see
(3.6) in Section 3.2), we have
Φ𝑣 × 𝑁𝑢 + 𝑁 𝑣 × Φ𝑢 = −Φ𝑣 × (𝑎 11 Φ𝑢 + 𝑎 21 Φ𝑣 ) − (𝑎 12 Φ𝑢 + 𝑎 22 Φ𝑣 ) × Φ𝑢
= (𝑎 11 + 𝑎 22 )(Φ𝑢 × Φ𝑣 )
√
= 2𝐻 𝐸𝐺 − 𝐹 2 𝑁 (4.18)
√
⟨v, 𝑁 × Φ𝑢 ⟩ = 𝛽 𝐸𝐺 − 𝐹 2 (4.20)
𝜕v 𝜕v 𝜕v 𝜕v
⟨ × Φ𝑣 + Φ𝑢 × , 𝑁⟩ = ⟨ , Φ𝑣 × 𝑁⟩ + ⟨ , 𝑁 × Φ𝑢 ⟩
𝜕𝑢 𝜕𝑣 𝜕𝑢 𝜕𝑣
𝜕 𝜕
= ⟨ −v, (Φ𝑣 × 𝑁) + (𝑁 × Φ𝑢 )⟩ +
𝜕𝑢 𝜕𝑣
𝜕 𝜕
+ ⟨v, Φ𝑣 × 𝑁⟩ + ⟨v, 𝑁 × Φ𝑢 ⟩
𝜕𝑢 𝜕𝑣
= −⟨v, Φ𝑣 × 𝑁𝑢 + 𝑁 𝑣 × Φ𝑢 ⟩ +
𝜕 𝜕
+ ⟨v, Φ𝑣 × 𝑁⟩ + ⟨v, 𝑁 × Φ𝑢 ⟩
𝜕𝑢 𝜕𝑣
√
= − 𝐸𝐺 − 𝐹 2 ⟨v, 2𝐻𝑁⟩ +
𝜕 √ 𝜕 √
+ (𝛼 𝐸𝐺 − 𝐹 2 ) + (𝛽 𝐸𝐺 − 𝐹 2 ),
𝜕𝑢 𝜕𝑣
where we apply (4.18), (4.19), and (4.20). Of course, when the vector field v is tangent
to 𝑆, the first summand of the last expression vanishes. Combining this formula with
(4.17), we obtain the desired formula for divergence:
Proposition 4.4.1 Let v be a (not necessarily tangent) vector field on an open subset
𝑈 covered by a parameterization Φ(𝑢, 𝑣) compatible with the orientation of 𝑆. Then:
(i) if we denote by v⊺ the tangential component of v, then we have
1
, 𝑁⟩ = −⟨v, 2𝐻𝑁⟩ + Div(v⊺ );
𝜕v 𝜕v
√ ⟨ × Φ𝑣 + Φ𝑢 ×
𝐸𝐺 − 𝐹 2 𝜕𝑢 𝜕𝑣
4.4 The Divergence Theorem. First Variation of Area 103
where 𝛾(𝑠) is the boundary of Ω and 𝝉2 (𝑠) ∈ 𝑇𝛾(𝑠) 𝑆 is the unit vector orthogonal to
𝛾 ′ (𝑠) that points to the interior of Ω.
Proof We assume that 𝛾(𝑠) is parametrized by arc length and runs in the positive
direction — i.e., in such a way that Ω is always to the left of 𝛾 (in other words, we
have 𝝉2 (𝑠) = 𝑁(𝑠) × 𝛾 ′ (𝑠)) (see Fig. 4.4). Of course, 𝝉2 is not defined at the vertices
of 𝛾, but this happens only for a finite number of values of 𝑠, which do not affect the
integration.
It is sufficient to prove the theorem under the hypothesis that the closure of Ω is
contained in some parametrized neighborhood. For if this is not the case, we can
𝑘
decompose Ω into a finite number of sufficiently small polygonal regions (Ω𝑖 )𝑖=1 .
Supposing that (4.21) holds for the Ω𝑖 , and
104 4 The Intrinsic Geometry of Surfaces
t2
t1
Figure 4.4
– since any “edge” (segment of some 𝛾𝑖 ) inside Ω is run through twice, with the
integrand function ⟨v, −𝝉2 𝑖 ⟩ taking up opposite signs in both instances, and therefore
we are left with only the integrals relative to the edges that are part of 𝛾.
We can thus assume that Ω is covered by the parameterization Φ(𝑢, 𝑣), and we
put 𝛾(𝑠) = Φ(𝑢(𝑠), 𝑣(𝑠)) for every 𝑠 ∈ [0, 𝐿]. Furthermore, we require that Φ(𝑢, 𝑣)
be compatible with the orientation of 𝑆. We thus have, with the notation of 4.4.1 and
using Green’s theorem
𝜕 √ 𝜕 √
∫ Div v 𝑑𝜎 = ∬ { (𝛼 𝐸𝐺 − 𝐹 2 ) + (𝛽 𝐸𝐺 − 𝐹 2 )} 𝑑𝑢𝑑𝑣
Ω Φ−1 (Ω) 𝜕𝑢 𝜕𝑣
𝐿 √
=∫ 𝐸𝐺 − 𝐹 2 (𝛼(𝑠)𝑣 ′ (𝑠) − 𝛽(𝑠)𝑢 ′ (𝑠)) 𝑑𝑠.
0
(Note that, since Φ is compatible with the orientation of 𝑆, the curve (𝑢(𝑠), 𝑣(𝑠))
moves along the boundary of Φ−1 (Ω) in the positive direction, and we have used this
fact when we applied Green’s theorem.) To simplify the latter integral, we form the
vector product of the two equalities
obtaining
The divergence theorem can be generalized to regions bounded by more than one
closed curve, making the sum of all integrals relative to each curve constituting the
boundary of Ω appear in the right-hand side of (4.21). However, it is more interesting,
in the case of a compact surface 𝑆, to consider the integral of Div v over the whole
𝑘
surface. Choosing a “polygonal decomposition” (Ω𝑖 )𝑖=1 of 𝑆, and applying 4.4.2 to
each of the Ω𝑖 , what happens, when adding up the results, is that all the terms vanish,
because now none of the edges is run through only once. Thus we have just proved (i)
of the corollary below.
(i) ∫𝑆 Div v 𝑑𝜎 = 0;
To prove (ii) we can assume that 𝑆 is connected, because in any case its connected
components are compact. Hence, if Div v has no zeros, then it has constant sign and
therefore ∫𝑆 Div v 𝑑𝜎 ≠ 0, in contradiction to (i). Therefore Div v has some zero,
which proves (ii).
Figure 4.5
Proposition 4.4.4 There exists 𝜂 > 0 such that, for all 𝑡 ∈ ]−𝜂, 𝜂[, 𝑆𝑡 is a compact
surface.
𝜕(𝑥, 𝑦)
det(𝐽𝐹(0,0,0) ) = det(𝐽(𝐺 0 )(0,0) ) = ∣ ≠ 0.
𝜕(𝑢, 𝑣) (0,0)
Hence, by the inverse mapping theorem, there exists an open neighborhood 𝑈 × ]−𝜀, 𝜀[
of (0, 0, 0) such that the restriction of 𝐹 to this neighborhood is a diffeomorphism
−1
onto its image. The local inverse of 𝐹 has the form 𝐹 −1 (𝑥, 𝑦, 𝑡) = ((𝐺 𝑡 ) (𝑥, 𝑦), 𝑡),
and so, for every 𝑡 ∈ ]−𝜀, 𝜀[, the mapping 𝐺 𝑡 ∣𝑈 has a differentiable inverse, which
guarantees that it is injective and its Jacobian matrix has rank two. Since 𝐿 𝑡 ○Φ(𝑢, 𝑣) =
4.4 The Divergence Theorem. First Variation of Area 107
(𝐺 𝑡 (𝑢, 𝑣), 𝑧 𝑡 (𝑢, 𝑣)), the same holds for 𝐿 𝑡 ○ Φ∣𝑈 . The parameterization (𝑈, Φ) thus
satisfies the conditions of Claim 1.
Claim 2. For ∣𝑡∣ sufficiently small, 𝐿 𝑡 is injective.
By compactness of 𝑆, we can find 𝜀 > 0 and parameterizations (𝑈𝑖 , Φ𝑖 ), with
𝑖 = 1, . . . , 𝑘, which cover 𝑆 and such that each of them satisfies Claim 1 for the given 𝜀.
Next, we take 𝛿 > 0 such that, for every 𝑝 ∈ 𝑆, the ball 𝐵 𝛿 (𝑝, 𝑆) = {𝑞 ∈ 𝑆∶ ∣𝑞 − 𝑝∣ < 𝛿}
is contained in one of the open subsets Φ𝑖 (𝑈𝑖 ), and we put 𝑀 = max ∣v(𝑝)∣ and
𝑝∈𝑆
𝜂 = min { 4𝑀𝛿
, 𝜀}}. We claim that 𝐿 𝑡 is injective for ∣𝑡∣ < 𝜂. In fact, if ∣𝑞 − 𝑝∣ < 𝛿 it
is clear that 𝐿 𝑡 (𝑝) ≠ 𝐿 𝑡 (𝑞), since in this case 𝑝 and 𝑞 both belong to some Φ𝑖 (𝑈𝑖 )
and, by Claim 1, the restriction of 𝐿 𝑡 to Φ𝑖 (𝑈𝑖 ) is injective. If ∣𝑞 − 𝑝∣ ≥ 𝛿 then
Proof We can consider a polygonal decomposition (Ω 𝑗 )1≤ 𝑗≤𝑟 of 𝑆 such that the
closure of each Ω 𝑗 is contained in the image of one of the parameterizations (𝑈𝑖 , Φ𝑖 )
defined in the proof of 4.4.4. Thus, for ∣𝑡∣ < 𝜂, the sets Ω𝑡𝑗 = 𝐿 𝑡 (Ω 𝑗 ) form a
decomposition of 𝑆𝑡 and each of them is covered by some parameterization
and therefore
𝐴′𝑗 (0) = ∬
𝜕
∣(𝐿 𝑡 ○ Φ𝑖 )𝑢 × (𝐿 𝑡 ○ 𝜙𝑖 )𝑣 ∣ 𝑑𝑢𝑑𝑣.
𝑊𝑗 𝜕𝑡 ∣𝑡=0
Now we have
𝜕v 𝜕v 𝜕v 𝜕v
∣(𝐿 𝑡 ○ Φ𝑖 )𝑢 × (𝐿 𝑡 ○ Φ𝑖 )𝑣 ∣ = ∣(Φ𝑖𝑢 × Φ𝑖𝑣 ) + 𝑡 ( × Φ𝑖𝑣 + Φ𝑖𝑢 × ) + 𝑡2 ( × )∣ .
𝜕𝑢 𝜕𝑣 𝜕𝑢 𝜕𝑣
Therefore, using 4.4.1 (i), we have
Observation 4.4.6 If 𝑆 is not compact but the vector field v has compact support (the
support of v is the closure in 𝑆 of the set {𝑝 ∈ 𝑆∶ v(𝑝) ≠ 0}), then the sets 𝑆𝑡 are
still surfaces for ∣𝑡∣ small and, suitably interpreted, formula 4.4.5 remains valid. We
choose a compact region Ω ⊆ 𝑆 whose boundary is made up of a finite number of
piecewise regular closed curves and whose closure contains the support of v, and
we take a polygonal decomposition (Ω 𝑗 )1≤ 𝑗≤𝑟 of Ω. We denote by 𝐴(𝑡) the area of
𝐿 𝑡 (Ω) ⊆ 𝑆𝑡 . Proceeding as in 4.4.5, and using the divergence theorem, we obtain
region Ω ⊆ 𝑆 as above, the area of Ω is less than or equal to that of any other portion
̃ of the surface such that 𝜕 Ω
Ω ̃ = 𝜕Ω. Then 𝑆 is a minimal surface: indeed, denoting
by 𝐴(𝑡) the area of 𝐿 𝑡 (Ω) (for a variation with support contained in Ω), 𝐴(𝑡) has
a minimum at 0 and therefore 𝐴′ (0) = 0. This implies that for every vector field v
with compact support we have − ∫ ⟨v, 2𝐻𝑁⟩ 𝑑𝜎 = 0 — which is only possible with
𝑆
𝐻 ≡ 0.
Not all minimal surfaces minimize area in the sense just stated, but the name has
stuck. Besides the plane, the reader can verify (using formula (3.8) in the Section 3.2)
that the helicoid and the catenoid are minimal surfaces (Exercise 76). In the exercises
of this section we prove that besides the plane the catenoid is the only minimal surface
of revolution.
For more examples and an in-depth study of minimal surfaces we recommend
Osserman’s book [19].
Exercises
92. (a) Compute the divergence of the vector field v(𝜑, 𝜃) in S2 of example 4.3.1. For
that example, directly compute ∫S2 Div v 𝑑𝜎. Could you use corollary 4.4.3?
(b) Let 𝑆 be a compact surface, let 𝐹 ⊆ 𝑆 be a finite set, and let v be a field of
tangent vectors defined on 𝑆 ∖ 𝐹 such that the set sup {∣v(𝑝)∣∶ 𝑝 ∈ 𝑆 ∖ 𝐹} is bounded.
Show that ∫𝑆 Div v 𝑑𝜎 = 0.
93. If v is a tangent field to 𝑆 and 𝑓 ∶ 𝑆 → R a differentiable function, then
1 𝜕𝑓 𝜕𝑓
∇𝑓 = √ 𝑁 ×(
Φ𝑢 − Φ𝑣 )
𝐸𝐺 − 𝐹2 𝜕𝑣 𝜕𝑢
1 𝜕𝑓 𝜕𝑓 𝜕𝑓 𝜕𝑓
= {(− 𝐹 + 𝐺) Φ𝑢 + (− 𝐸 − 𝐹) Φ𝑣 } .
𝐸𝐺 − 𝐹 2 𝜕𝑣 𝜕𝑢 𝜕𝑣 𝜕𝑢
(b) Conclude that the vector field w = 𝑁 × ∇ 𝑓 has zero divergence. Conversely,
show that if Div w ≡ 0 then v = w × 𝑁 is locally a gradient field (i.e., each point of 𝑆
has a neighborhood 𝑈 such that v∣𝑈 is the gradient of some function 𝑈 → R ).
(c) Show that a vector field v on 𝑆 is a gradient field if and only if, for every
curve piecewise differentiable 𝛼∶ [𝑎, 𝑏] → 𝑆, the line integral ∫𝑎 ⟨v ○ 𝛼(𝑡), 𝛼′ (𝑡)⟩ 𝑑𝑡
𝑏
95. Let Φ(𝑢, 𝑠) = (𝜌(𝑠) cos 𝑢, 𝜌(𝑠) sin 𝑢, 𝑧(𝑠)) be a parameterization of a surface of
revolution 𝑆, where 𝑠 is the arc length of the generating curve (i.e., 𝜌 2 + 𝑧 2 ≡ 1) and
𝜌(𝑠) > 0. Assume that 𝑆 is not a plane (so that 𝑧 is not constantly zero). Show that:
(a) 𝑆 is a minimal surface if and only if
𝑧
= 𝜌𝑧 − 𝜌𝑧 (apply 3.2.1)
𝜌
(b) assuming that 𝑧(𝑠0 ) ≠ 0, there exists 𝜀 > 0 such that, for 𝑠 ∈ ]𝑠0 − 𝜀, 𝑠0 + 𝜀[,
the above equation is equivalent to
𝑑2 1 2
𝜌𝜌 + 𝜌 2 = 1 ⇔ ( 𝜌 )=1
𝑑𝑠2 2
√
⇔ ∃ 𝐴, 𝐵 ∈ R ∶ 𝜌(𝑠) = 𝑠2 + 𝐴𝑠 + 𝐵;
The Gauss-Bonnet Theorem is one of the deepest results in the Differential Geometry
of surfaces establishing an unexpected connection between the Euler characteristic
of a compact surface (a purely topological concept) and its Gaussian curvature.
Moreover, it provides a general context for a seemingly rather particular result as
Girard’s formula for spherical triangles (example 2.5.2).
We work with an oriented surface 𝑆. Let Ω be a polygonal region (as defined
in Section 4.4) and 𝛼(𝑠), 𝑠 ∈ [0, 𝐿], a parameterization of 𝜕Ω by arc length with
positive orientation. Let 0 = 𝑠0 < 𝑠1 < ⋯ < 𝑠 𝑘 = 𝐿 be the 𝑘 vertices of 𝛼; 𝛾𝑖 ∈ [−𝜋, 𝜋],
for 𝑖 = 1, . . . , 𝑘 − 1, the oriented angle of 𝛼′ (𝑠𝑖− ) and 𝛼′ (𝑠𝑖+ ); and 𝛾 𝑘 ∈ [−𝜋, 𝜋] the
angle between 𝛼′ (𝑠−𝑘 ) and 𝛼′ (𝑠0+ ). These angles 𝛾𝑖 are called the exterior angles of
Ω.
4.5 The Gauss-Bonnet Theorem 111
p
gj
- W
-p
W
gi
a´ (si+ ) +
a´ (s-i )
W
Figure 4.6
(Note: if 𝛼′ (𝑠𝑖+ ) = −𝛼′ (𝑠𝑖− ), we have to decide which of the values 𝜋 or −𝜋 we choose
for 𝛾𝑖 . We take 𝛾𝑖 = 𝜋 when, for 𝑠 → 𝑠𝑖− , the trace of 𝛼∣[𝑠 ,𝑠 +𝜀] (for small 𝜀 > 0) stays,
𝑖 𝑖
like Ω, to the left of the curve; otherwise, we take 𝛾𝑖 = −𝜋. We can make this criterion
rigorous with the help of local coordinates, but Fig. 4.6 is more illuminating).
Let us state the first version of our theorem:
Let w(𝑠), 𝑠 ∈ [0, 𝐿], be a unit vector field along 𝛼: the vector field w is therefore
continuous, and is parallel along each regular arc 𝛼([𝑠𝑖−1 , 𝑠𝑖 ]). Let 𝜑𝑖 (𝑠) (𝑠 ∈
[𝑠𝑖−1 , 𝑠𝑖 ]) be a continuous choice of the angle between 𝛼′ (𝑠) and w(𝑠). The proof of
Proposition 4.3.7 (ii) shows that
Further, let 𝜓(𝑠), 𝑠 ∈ [0, 𝐿], be a choice of the angle between v1 and w(𝑠). By the
equality between oriented angles
𝐿 𝑘
𝜓(𝐿) − 𝜓(0) + ∫ 𝑘 𝑔 (𝑠) 𝑑𝑠 + ∑ 𝛾𝑖 = 2𝜋.
0 𝑖=1
In view of this formula, the next lemma concludes the proof of the Gauss-Bonnet
theorem.
Lemma 4.5.2
𝜓(𝐿) − 𝜓(0) = ∫ 𝐾 𝑑𝜎. (4.25)
Ω
Proof Let v2 = 𝑁 ×v1 , so that the pair (v1 , v2 ) forms at each point a direct orthonormal
𝐷v1
frame. Since v1 is a unit vector field, is orthogonal to v1 , and is therefore collinear
𝑑𝑠
𝐷v1 𝐷v1
with v2 . We thus have = 𝑎(𝑠)v2 , where 𝑎(𝑠) = ⟨ , v2 ⟩ = ⟨v1 ′ , v2 ⟩ − ⟨v1 , v2 ′ ⟩,
𝑑𝑠 𝑑𝑠
from which we conclude that
Hence
𝐿
𝜓(𝐿) − 𝜓(0) = − ∫ ⟨v1 ′ (𝑠), v2 (𝑠)⟩ 𝑑𝑠
0
𝐿
, v2 ⟩𝑢 ′ (𝑠) + ⟨ , v2 ⟩𝑣 ′ (𝑠)) 𝑑𝑠
𝜕v1 𝜕v1
= −∫ (⟨
0 𝜕𝑢 𝜕𝑣
𝜕 𝜕v1 𝜕 𝜕v1
= −∬ { ⟨ , v2 ⟩ − ⟨ , v2 ⟩} 𝑑𝑢𝑑𝑣
Φ−1 (Ω) 𝜕𝑢 𝜕𝑣 𝜕𝑣 𝜕𝑢
(by Green’s theorem)
𝜕v1 𝜕v2 𝜕v1 𝜕v2
=∬ {⟨ , ⟩−⟨ , ⟩} 𝑑𝑢𝑑𝑣.
Φ−1 (Ω) 𝜕𝑢 𝜕𝑣 𝜕𝑣 𝜕𝑢
𝜕v1 𝜕v2
𝑁𝑢 = ⟨𝑁𝑢 , v1 ⟩v1 + ⟨𝑁𝑢 , v2 ⟩v2 = −⟨𝑁, ⟩v1 − ⟨𝑁, ⟩v2 ,
𝜕𝑢 𝜕𝑢
𝜕v1 𝜕v2
𝑁 𝑣 = −⟨𝑁, ⟩v1 − ⟨𝑁, ⟩v2 ,
𝜕𝑣 𝜕𝑣
If we write
𝜕v1 𝜕v1 𝜕v2 𝜕v2
= 𝑎v2 + ⟨𝑁, ⟩𝑁, = 𝑏v1 + ⟨𝑁, ⟩𝑁,
𝜕𝑢 𝜕𝑢 𝜕𝑣 𝜕𝑣
and analogously
𝜕v1 𝜕v2 𝜕v1 𝜕v2
⟨ , ⟩ = ⟨𝑁, ⟩⟨𝑁, ⟩⋅
𝜕𝑣 𝜕𝑢 𝜕𝑣 𝜕𝑢
Together with (4.28), the last two equalities give
114 4 The Intrinsic Geometry of Surfaces
and therefore
𝜕v1 𝜕v2 𝜕v1 𝜕v2
⟨ , ⟩−⟨ , ⟩ = ⟨𝑁𝑢 × 𝑁 𝑣 , 𝑁⟩
𝜕𝑢 𝜕𝑣 𝜕𝑣 𝜕𝑢
√
= 𝐾⟨Φ𝑢 × Φ𝑣 , 𝑁⟩ = 𝐾 𝐸𝐺 − 𝐹 2 .
This concludes the proof of (4.27) and hence that of the lemma. ◻
Observations 4.5.3 A. The difference 𝜓(𝐿) − 𝜓(0) in Lemma 4.5.2 is the angle
between the initial and final positions of a vector that is carried in parallel along a
closed curve. If 𝐾 has constant (nonzero) sign on Ω, Lemma 4.5.2 shows that this
angle is nonzero and becomes smaller as the region bounded by the curve becomes
smaller. It follows that a surface has constant zero curvature if parallel transport along
any curve on the surface depends only on the starting and ending points of the curve
(and not on the path t covered). The reader is invited to elaborate the argument in the
exercises in this section.
1 𝜕 𝜕v2 𝜕 𝜕v2
𝐾=√ { ⟨v1 , ⟩− ⟨v1 , ⟩} (4.29)
𝐸𝐺 − 𝐹2 𝜕𝑢 𝜕𝑣 𝜕𝑣 𝜕𝑢
1 𝜕 𝐷v2 𝜕 𝐷v2
=√ { ⟨v1 , ⟩− ⟨v1 , ⟩} , (4.30)
𝐸𝐺 − 𝐹2 𝜕𝑢 𝜕𝑣 𝜕𝑣 𝜕𝑢
and formula (4.30) gives another proof of Gauss’s Theorema Egregium, for it
expresses 𝐾 via intrinsic quantities: in fact, any isometry transforms (v1 , v2 ) into
another orthonormal dihedron and, by (4.10) in Section 4.3, preserves the covariant
derivative. We can further apply (4.29) to express 𝐾 as a function of the coefficients
𝐸, 𝐹 and 𝐺. For simplicity, we assume that Φ(𝑢, 𝑣) is an orthogonal parameterization
(𝐹 ≡ 0), so v1 = √1 Φ𝑢 and v2 = √1 Φ𝑣 . We then have
𝐸 𝐺
𝜕v2 𝜕 1 1 𝜕v2 𝜕 1 1
= ( √ ) Φ𝑣 + √ Φ𝑣 , = ( √ ) Φ𝑣 + √ Φ𝑣𝑢 ,
𝜕𝑣 𝜕𝑣 𝐺 𝐺 𝜕𝑢 𝜕𝑢 𝐺 𝐺
so that
𝜕v2 1 −1 −𝐺 𝑢
⟨v1 , ⟩= √ ⟨Φ𝑢 , Φ𝑣𝑣 ⟩ = √ ⟨Φ𝑢𝑣 , Φ𝑣 ⟩ = √ ,
𝜕𝑣 𝐸𝐺 𝐸𝐺 2 𝐸𝐺
𝜕v2 1 𝐸𝑣
⟨v1 , ⟩= √ ⟨Φ𝑢 , Φ𝑢𝑣 ⟩ = √ ⋅
𝜕𝑢 𝐸𝐺 2 𝐸𝐺
Finally, by (4.29), we have
4.5 The Gauss-Bonnet Theorem 115
−1 𝜕 𝐺𝑢 𝜕 𝐸𝑣
𝐾= √ { (√ )+ (√ )} , (4.31)
2 𝐸𝐺 𝜕𝑢 𝐸𝐺 𝜕𝑣 𝐸𝐺
which is the promised formula. ◻
Figure 4.7
Each interior edge of Ω is run through twice in opposite directions (the geodesic
curvature showing up with opposed sign), and by summing the above formulas for
𝑗 = 1, . . . , 𝐹 only the edges that make up 𝜕Ω are left. We thus obtain
116 4 The Intrinsic Geometry of Surfaces
𝐹 3
∫ 𝐾 𝑑𝜎 + ∫ 𝐾𝑔 (𝑠) 𝑑𝑠 + ∑ ∑ 𝛾 𝑙𝑗 = 2𝜋𝐹. (4.32)
Ω 𝜕Ω 𝑗=1 𝑙=1
● if 𝑣 ∈ V1 then
● if 𝑣 ∈ V2 then
— because each edge is counted twice (once for each one of its endpoints). Replacing
in (4.32) and using Lemma 4.5.4 below, we obtain
𝑘
∫ 𝐾 𝑑𝜎 + ∫ 𝑘 𝑔 (𝑠) 𝑑𝑠 + ∑ 𝛾𝑖 = 2𝜋(𝑉 − 𝐴 + 𝐹) = 2𝜋,
Ω 𝜕Ω 𝑖=1
Proof We have to show that for every triangulation of a polygonal region we have
𝑉 − 𝐴 + 𝐹 = 1. We proceed by induction on the number of faces 𝐹. If 𝐹 = 1, then
there are three vertices and three edges and the formula is true. If 𝐹 > 1, then let Δ 𝑗
̃ = Ω ∖ Δ 𝑗 is a
be a face where at least one of the edges is part of 𝜕Ω, and such that Ω
polygonal region. Consider in Ω ̃ the triangulation induced by the triangulation of Ω.
Denoting by 𝑉,̃ 𝐴̃ and 𝐹̃ the numbers of vertices, edges and faces of the triangulation
̃ we have:
of Ω,
̃ = 𝐴−1, 𝐹̃ = 𝐹−1;
̃ = 𝑉, 𝐴
● if 𝜕Ω contains a single edge of Δ 𝑗 , then 𝑉
● if 𝜕Ω contains two edges of Δ 𝑗 , then 𝑉 ̃
̃ = 𝑉−1, 𝐴 = 𝐴−2, 𝐹̃ = 𝐹−1.
̃ ̃ ̃
In both cases, 𝑉 − 𝐴 + 𝐹 = 𝑉 − 𝐴 + 𝐹 and the proof by induction is complete. ◻
There is a special case of the Gauss-Bonnet formula that is worthy to note: if the
boundary of Ω consists of geodesic arcs (𝑘 𝑔 ≡ 0), we are left with
𝑘
∫ 𝐾 𝑑𝜎 + ∑ 𝛾𝑖 = 2𝜋,
Ω 𝑖=1
When 𝑘 = 3, Ω is called a geodesic triangle, and we have just obtained the promised
generalization of Girard’s formula:
Corollary 4.5.5 The difference between the sum of the interior angles of a geodesic
triangle Δ and 𝜋 is given by the integral, extended to Δ, of the Gaussian curvature:
(𝜂1 + 𝜂2 + 𝜂3 ) − 𝜋 = ∫ 𝐾 𝑑𝜎.
Δ
Proof Writing down the Gauss-Bonnet formula for each of the faces of a triangulation
T and summing them up, the integrals of the geodesic curvatures all cancel, because
118 4 The Intrinsic Geometry of Surfaces
each edge is run through twice in opposite directions. Furthermore, the vertices of T
are all inside 𝑆 (of type V1 ) and the sum of the interior angles adjacent to each of
them is 2𝜋. By the above calculations, we are left with
∫ 𝐾 𝑑𝜎 = 2𝜋(𝑉 − 𝐴 + 𝐹),
𝑆
The integral ∫𝑆 𝐾 𝑑𝜎 is called the total curvature of 𝑆. The equation now obtained
shows that 𝜒(𝑆) is welldefined, independent of the triangulation of 𝑆 chosen to
compute it. Furthermore, 𝜒(𝑆) is invariant under diffeomorphisms (because a
diffeomorphism 𝑓 ∶ 𝑆 → 𝑆2 maps any triangulation T of 𝑆 to another triangulation
𝑓 (T ) of 𝑆2 with the same number of vertices, edges and faces). We thus obtain the
following result.
Corollary 4.5.7 Any two diffeomorphic compact surfaces have the same total curva-
ture.
For example, any surface 𝑆 diffeomorphic to the sphere has total curvature 4𝜋.
This is not surprising if the curvature of 𝑆 is positive at all points (as in the ellipsoid),
because in this situation we will see later on that 𝑁∶ 𝑆 → S2 is a diffeomorphism,
and ∫𝑆 𝐾 𝑑𝜎 is nothing but the area of the image of 𝑆 under 𝑁 (cf. Exercise 64 in
Section 3.2), which in this case is S2 . But if we call to mind that 𝑆 can also have
regions of negative curvature we more readily appreciate the strength of the result.
Figure 4.8
The Euler characteristic (and hence the total curvature) of the torus is zero; that of
the double torus is −2. That of the 𝑛-torus (𝑛 ≥ 1) is 2 − 2𝑛. Together with the sphere,
and up to diffeomorphisms, this list exhausts all orientable compact surfaces (see
Fig. 4.8); this is a classical result whose proof you can find, for example, in [17]. In
particular, among the compact surfaces, only those which are diffeomorphic to the
sphere have non-negative total curvature.
4.5 The Gauss-Bonnet Theorem 119
Example 4.5.8 A non-compact surface may have finite total curvature. Consider, on
a surface of revolution 𝑆 given by 𝜌 = 𝜌(𝑧), 𝑧 ∈ R , the region Ω(𝑧0 , 𝑧 1 ) bounded
by the two parallels 𝑧 = 𝑧 0 and 𝑧 = 𝑧1 (𝑧0 < 𝑧1 ). We can break Ω(𝑧 0 , 𝑧 1 ) into two
“four-sided polygons” by two meridians and add up the two resulting Gauss-Bonnet
formulas. Since there are four vertices and the sum of the two exterior angles adjacent
to each vertex is 𝜋, we obtain
∫ 𝐾 𝑑𝜎 + ∫ 𝑘 𝑔 𝑑𝑠 + 4𝜋 = 4𝜋,
Ω(𝑧0 ,𝑧1 ) 𝜕Ω(𝑧0 ,𝑧1 )
i.e.,
∫ 𝐾 𝑑𝜎 = ∫ −𝑘 𝑔 𝑑𝑠.
Ω(𝑧0 ,𝑧1 ) 𝜕Ω(𝑧0 ,𝑧1 )
z1
z0
Figure 4.9
−𝜌(𝑧0 ) 𝜌(𝑧 1 )
𝑘 𝑔 (𝑧 0 ) = √ , 𝑘 𝑔 (𝑧1 ) = √ ,
2 2
𝜌(𝑧0 ) 1 + 𝜌(𝑧0 ) 𝜌(𝑧 1 ) 1 + 𝜌(𝑧1 )
and therefore
120 4 The Intrinsic Geometry of Surfaces
⎛ 𝜌(𝑧1 ) 𝜌(𝑧0 ) ⎞
∫ 𝐾 𝑑𝜎 = −2𝜋 ⎜ √ −√ ⎟⋅
Ω(𝑧0 ,𝑧1 ) ⎝ 1 + 𝜌(𝑧1 )2 1 + 𝜌(𝑧0 ) ⎠
2
∫ 𝐾 𝑑𝜎 = 𝑧1lim
→+∞ ∫
𝐾 𝑑𝜎
𝑆 Ω(𝑧0 ,𝑧1 )
𝑧0 →−∞
It would not be difficult to compute directly ∫Ω(𝑧0 ,𝑧1 ) 𝐾 𝑑𝜎 (see example 3.2.1) —
but, besides illustrating a use of the Gauss-Bonnet Theorem, this method is applicable
to surfaces other than surfaces of revolution (e.g., to those non-compact surfaces 𝑆
for which 𝑆 ∖ ℭ is a surface of revolution for some compact ℭ ⊆ 𝑆). ◻
Exercises
96. Assume that 𝑆 has non-positive curvature at all its points. Show that if two
geodesics start from the same point in 𝑆, they cannot meet again in such a way that
their traces constitute the boundary of a simple region of 𝑆 (in particular, no closed
geodesic of 𝑆 can be the boundary of a simple region).
97. Show that if 𝛾 is a regular closed simple curve in S2 then ∣∫𝛾 𝑘 𝑔 𝑑𝑠∣ < 2𝜋.
98. Let 𝑝 ∈ 𝑆 be such that 𝐾(𝑝) > 0, and let (𝑈, Φ) be a parameterization such
that 𝑝 ∈ Φ(𝑈) and 𝐾 ○ Φ(𝑢, 𝑣) > 0 for all (𝑢, 𝑣) ∈ 𝑈. Consider a family of circles
(C𝑟 )0<𝑟< 𝛿 such that:
i. each C𝑟 has radius 𝑟;
ii. for every 𝑟, Φ−1 (𝑝) ∈ C𝑟 and the closed disk bounded by C𝑟 is contained in 𝑈;
iii. if 𝑟 < ̃
𝑟 then C𝑟 is inside C̃𝑟 .
Further denote by 𝜓𝑟 ∈ [−𝜋, 𝜋] the oriented angle between the initial and final
positions of a vector carried parallel from 𝑝 to 𝑝 along the closed curve Φ(C𝑟 ) in 𝑆.
(a) Show that there exists 𝜀 ∈ ]0, 𝛿[ such that
0<𝑟 <̃
𝑟 < 𝜀 ⇒ 0 < ∣𝜓𝑟 ∣ < ∣𝜓̃𝑟 ∣ < 𝜋.
(b) Conclude that any surface on which parallel transport depends only on the
initial and final points of the curve has constant zero curvature:
Give an example of a surface with constant zero curvature that does not have this
property.
4.6 Minimizing Properties of Geodesics 121
Figure 4.10
99. Is it true that two non-compact diffeomorphic surfaces necessarily have the same
total curvature?
1 ′ 1 ′
𝝉1 = 𝛼 = (𝑢 Φ𝑢 + 𝑣 ′ Φ𝑣 ).
𝑚 𝑚
From formula (4.9), we obtain
122 4 The Intrinsic Geometry of Surfaces
𝐷𝝉1 1
= {(𝑢 ′′ + 𝑢 ′2 Γ111
+ 2𝑢 ′ 𝑣 ′ Γ12
1
+ 𝑣 ′2 Γ22
1
)Φ𝑢
𝑑𝑡 𝑚
+ (𝑣 ′′ + 𝑢 ′2 Γ11
2
+ 2𝑢 ′ 𝑣 ′ Γ12
2
+ 𝑣 ′2 Γ22
2
)Φ𝑣 }
𝐷𝝉1
— and, if 𝛼 is a geodesic, then ≡ 0, so that
𝑑𝑡
⎧
⎪
⎪𝑢 ′′ + 𝑢 ′2 Γ11
1
+ 2𝑢 ′ 𝑣 ′ Γ12
1
+ 𝑣 ′2 Γ22
1
= 0,
⎨ ′′ (4.33)
⎪
⎪ + ′2
Γ 2
+ 2𝑢 ′ ′
Γ 2
+ ′2
Γ 2
=
22 0.
⎩
𝑣 𝑢 11 𝑣 12 𝑣
𝑡 ↦ 𝜑̃(𝑡; 𝑢, 𝑣, 𝑎, 𝑏)
is the only solution of the system (4.35) with initial condition 𝜑̃(0; 𝑢, 𝑣, 𝑎, 𝑏) =
(𝑢, 𝑣, 𝑎, 𝑏).
4.6 Minimizing Properties of Geodesics 123
𝑑
𝜑(0; 𝑢, 𝑣, 𝑎, 𝑏) = (𝑢, 𝑣), 𝜑(𝑡; 𝑢, 𝑣, 𝑎, 𝑏)∣𝑡=0 = (𝑎, 𝑏).
𝑑𝑡
In summary, determining (𝑢(0), 𝑣(0)) and (𝑢 ′ (0), 𝑣 ′ (0)) completely determines
a solution (𝑢(𝑡), 𝑣(𝑡)) of (4.33). The solutions depend differentiably on these initial
conditions. Let us now define
𝐷Φ∶ 𝑈 × R 2 → ⋃ ({𝑝} × 𝑇 𝑝 𝑆)
𝑝∈Φ(𝑈)
(4.36)
(𝑢, 𝑣, 𝑎, 𝑏) ↦ (Φ(𝑢, 𝑣), (𝑎Φ𝑢 + 𝑏Φ𝑣 )∣(𝑢,𝑣) )
and it follows from our analysis that for every (𝑝, v) ∈ 𝐵 𝛿 (𝑉), the curve 𝑡 ↦ 𝛾(𝑡; 𝑝, v)
is the only parametrized geodesic that at time 0 passes through 𝑝 with velocity v; the
constant parametrized geodesics are those of the form 𝛾(𝑡; 𝑝, 0). Furthermore, for
𝜆 ∈ R we have
𝛾(𝜆𝑡; 𝑝, v) = 𝛾(𝑡; 𝑝, 𝜆v), (4.37)
because the two terms of (4.37) are parametrized geodesics satisfying the same initial
conditions: at time 0, they both pass through 𝑝 with velocity 𝜆v. It follows that, up to
reparametrization, there is exactly one geodesic whose tangent line at a given point
has a given direction.
Observation 4.6.1 At this point we can already state that there are no geodesics
on the sphere other than the maximal circles (see example 4.3.9), since through
each point passes a maximal circle tangent to each given direction. The sphere thus
has the particularity that all its geodesics are closed. (A non-constant parametrized
geodesic 𝛾(𝑡) is called closed if it is periodic — that is, if there exists 𝑇 > 0 such
that 𝛾(𝑡 + 𝑇) = 𝛾(𝑡) for all 𝑡 ∈ R . A necessary and sufficient condition for 𝛾 to be
124 4 The Intrinsic Geometry of Surfaces
closed is that there exist 𝑡1 < 𝑡 2 such that 𝛾(𝑡 1 ) = 𝛾(𝑡2 ) and 𝛾 ′ (𝑡 1 ) = 𝛾 ′ (𝑡 2 ) — i.e., 𝛾
is closed if and only if it passes again through the same point with the same velocity
vector.) Surprisingly, there are other surfaces with the same property, as shown in [4].
◻
Let us now apply (4.37) to show that, choosing ∣v∣ sufficiently small, 𝛾(𝑡; 𝑝, v)
is defined for ∣𝑡∣ < 2. In fact, since 𝛾(𝑡; 𝑝, v) = 𝛾( 2𝜀 𝑡; 𝑝, 2𝜀 v), we can state that, for
∣ 2𝜀 v∣ < 𝛿 (i.e., for ∣v∣ < 𝜀2𝛿 ), 𝛾(𝑡; 𝑝, v) is defined whenever ∣ 2𝜀 𝑡∣ < 𝜀 — i.e., whenever
∣𝑡∣ < 2. In summary:
For every 𝑝 0 ∈ 𝑆 there exist 𝜂 > 0 and a neighborhood 𝑉 of 𝑝 0 such that whenever
𝑝 ∈ 𝑉 and v ∈ 𝐵 𝜂 (𝑝) = {v ∈ 𝑇 𝑝 𝑆∶ ∣v∣ < 𝜂} the geodesic 𝑡 ↦ 𝛾(𝑡; 𝑝, v) is defined for
𝑡 ∈ ] −2, 2[.
Given 𝑝 ∈ 𝑆, the exponential mapping exp 𝑝 is defined by exp 𝑝 (v) = 𝛾(1; 𝑝, v).
By the above reasoning, there exists some 𝜂 > 0 such that exp 𝑝 is defined on 𝐵 𝜂 (𝑝);
one can choose such an 𝜂 suitable for all points in a neighborhood of 𝑝.
The geodesic 𝑡 ↦ 𝛾(𝑡; 𝑝, v) has constant scalar velocity ∣v∣, and therefore its arc
length in the interval [0, 1] is also ∣v∣. The geometric meaning of the exponential
mapping is therefore as follows: exp 𝑝 (v) is the point that travels a distance of ∣v∣ on
the geodesic that begins at 𝑝 and whose direction and orientation is given by v. Note
that, by (4.37), we have exp 𝑝 (𝑡v) = 𝛾(1; 𝑝, 𝑡v) = 𝛾(𝑡; 𝑝, v) — which means that the
geodesics starting from 𝑝 are the image of the lines (or line segments) in 𝑇 𝑝 𝑆 that
pass through the origin under exp 𝑝 .
To make full use of the exponential mapping, we need the next proposition:
Proposition 4.6.2 Given 𝑝 0 ∈ 𝑆, there exist 𝛿 > 0 and an open neighborhood 𝑊 ⊆ 𝑆
of 𝑝 0 such that, for all 𝑝 ∈ 𝑊, exp 𝑝 ∣ 𝐵 ( 𝑝) is a diffeomorphism onto the image.
𝛿
Proof We know that there exist 𝜂 > 0 and an open neighborhood 𝑉 ⊆ 𝑆 of 𝑝 0 such
that, for all 𝑝 on 𝑉, the exponential mapping exp 𝑝 is defined on 𝐵 𝜂 (𝑝). We can thus
consider the differentiable mapping
𝐹∶ 𝐵 𝜂 (𝑉) Ð→ 𝑆 × 𝑆
(𝑝, v) z→ (𝑝, exp 𝑝 (v)).
(A caveat: both 𝐵 𝜂(𝑉) and 𝑆 × 𝑆 are spaces of dimension four, since 𝑆 × 𝑆 is the
product of two spaces of dimension two and 𝐵 𝜂 (𝑉) can be identified, via 𝐷Φ as
defined in (4.36), with an open subset of R 4 . Our proof can be made rigorous by
−1
applying the inverse mapping theorem to (Φ × Φ) ○ 𝐹 ○ 𝐷Φ, which is the expression
of 𝐹 in “local coordinates”.)
The tangent spaces to 𝐵 𝜂(𝑉) at (𝑝 0 , 0) and to 𝑆 ×𝑆 at (𝑝 0 , 𝑝 0 ) = 𝐹(𝑝 0 , 0) coincide:
both are 𝑇 𝑝0 𝑆 × 𝑇 𝑝0 𝑆. Given v ∈ 𝑇 𝑝0 𝑆, let 𝛼 be a curve in 𝑉 such that 𝛼(0) = 𝑝 0 and
𝛼′ (0) = v. Then 𝑡 ↦ (𝛼(𝑡), 0) is a curve in 𝐵 𝜂 (𝑉) that passes through (𝑝 0 , 0) with
velocity (v, 0). Thus
𝑑 𝑑
𝐷𝐹( 𝑝0 ,0) (v, 0) = 𝐹(𝛼(𝑡), 0)∣𝑡=0 = (𝛼(𝑡), 𝛼(𝑡))∣𝑡=0 = (v, v).
𝑑𝑡 𝑑𝑡
4.6 Minimizing Properties of Geodesics 125
𝜂
On the other hand, if w ∈ 𝑇 𝑝0 𝑆 ∖ {0} then the curve 𝑡 ↦ (𝑝 0 , 𝑡w), ∣𝑡∣ < , is
∣w∣
contained in 𝐵 𝜂 (𝑉) and passes through (𝑝 0 , 0) with velocity (0, w), and therefore
𝑑 𝑑
𝐷𝐹( 𝑝0 ,0) (0, w) = 𝐹(𝑝 0 , 𝑡w)∣𝑡=0 = (0, exp 𝑝0 (𝑡w)∣𝑡=0 ) = (0, w).
𝑑𝑡 𝑑𝑡
We thus conclude that
– which shows that 𝐷𝐹( 𝑝0 ,0) is a linear isomorphism. The inverse mapping theorem
then guarantees that the restriction of 𝐹 to some neighborhood of (𝑝 0 , 0) in 𝐵 𝜂 (𝑉)
is a diffeomorphism onto the image, and we can choose such a neighborhood of the
form 𝐵 𝛿 (𝑊), where 𝑊 is an open subset of 𝑆 and 𝛿 > 0; it is easily verified that these
choices of 𝑊 and 𝛿 satisfy the desired condition. ◻
The coordinates Ψ(𝜌, 𝜑) are known as geodesic polar coordinates, and we call
Φ(𝑢, 𝑣) geodesic Cartesian coordinates. Of course, these coordinates depend on the
choice of the basis (v1 , v2 ). If we want them to respect the orientation of 𝑆, it suffices
that (v1 , v2 ) are positively oriented. In the case of geodesic polar coordinates, we
obtain different parameterizations by restricting 𝜑 to intervals of length 2𝜋, and each
of these parameterizations excludes a radial geodesic (𝜑 = constant). However, since
the excluded radial geodesic is arbitrary, the conclusions we draw with one of these
parameterizations are valid throughout the “punctured disc” 𝐷 𝛿 (𝑝) ∖ {𝑝}.
The next lemma says, among other things, that Ψ(𝜌, 𝜑) is an orthogonal parame-
terization. Geometrically, this means that the radial geodesics (𝜑 = constant) starting
from 𝑝 and the circumference geodesics (𝜌 = constant) with center 𝑝 intersect
orthogonally (see Fig. 4.11.).
TpS expp
S p
Figure 4.11
since 𝜌 ↦ exp 𝑝 (𝜌v𝝋 ) is a parametrized geodesic, and so has constant scalar velocity.
𝐷Ψ𝜌
Furthermore, denoting by the derivative covariant of Ψ𝜌 along 𝜌 ↦ Ψ(𝜌, 𝜑) —
𝜕𝜌
which is zero because Ψ𝜌 is the velocity field of a parametrized geodesic — we have
𝜕 𝐷Ψ𝜌
𝐹𝜌 = ⟨Ψ𝜌 , Ψ𝜑⟩ = ⟨ , Ψ𝜑 ⟩ + ⟨Ψ𝜌 , Ψ𝜑𝜌 ⟩
𝜕𝜌 𝜕𝜌
𝜕 1
= ( ∣Ψ𝜌 ∣2 ) = 0 (because 𝐸 ≡ 1).
𝜕𝜑 2
We thus conclude that 𝐹 does not depend on 𝜌. Since
we have
∣𝐹∣ ≤ 𝜌∣𝐷(exp 𝑝 )𝜌v𝝋 (v𝝋 )∣ ∣𝐷(exp 𝑝 )𝜌v𝝋 (w𝝋 )∣,
which, together with the facts that 𝐷(exp 𝑝 )𝑂 is the identity (as we saw in the proof
of Proposition 4.6.2) and v𝝋 and w𝝋 are unit vectors, implies lim 𝐹 = 0 and therefore
𝜌→0
𝐹 ≡ 0.
It remains to prove the last two equalities. For this let us consider the geodesic
Cartesian coordinates Φ(𝑢, 𝑣), whose coefficients we denote by 𝐸, 𝐹, 𝐺. Noting that
for (𝑢, 𝑣) ≠ (0, 0), we have 𝑢 = 𝜌 cos 𝜑, 𝑣 = 𝜌 sin 𝜑 and that at (0, 0) the coefficients
𝐸, 𝐹, 𝐺 are equal to 1, 0, 1 (because Φ𝑢 ∣(0, 0) = v1 and Φ𝑣 ∣(0,0) = v2 ), we have
4.6 Minimizing Properties of Geodesics 127
√ √ √
√ 𝜕(𝑢, 𝑣) 2 2
𝐺 = 𝐸𝐺 − 𝐹 2 = ∣ ∣ 𝐸𝐺−𝐹 = 𝜌 𝐸𝐺−𝐹
𝜕(𝜌, 𝜑)
√
and therefore lim 𝐺 = 0. Finally, differentiating the last equality, we obtain
𝜌→0
√ √
√ 2 𝜕 2
( 𝐺)𝜌 = 𝐸𝐺−𝐹 +𝜌 ( 𝐸𝐺−𝐹 )
𝜕𝜌
√ √ √
2 𝜕 2 𝜕 2
= 𝐸𝐺 −𝐹 + 𝜌{ ( 𝐸 𝐺 − 𝐹 ) cos 𝜑 + ( 𝐸 𝐺 − 𝐹 ) sin 𝜑} ,
𝜕𝑢 𝜕𝑣
and thus √
√ 2
lim ( 𝐺)𝜌 = lim 𝐸 𝐺 − 𝐹 = 1. ◻
𝜌→0 (𝑢,𝑣)→(0,0)
Example 4.6.4 Let us make explicit a geodesic coordinate system in S2 with ori-
gin at the North Pole N = (0, 0, 1). Fixing the orthonormal basis v1 = (1, 0, 0),
v2 = (0, 1, 0), the geodesic 𝜑 = constant is the meridian that forms angle 𝜑
with the plane 𝑦 = 0; its parameterization by arc length is 𝑠 (0 ≤ 𝑠 < 𝜋) ↦
(sin 𝑠 cos 𝜑, sin 𝑠 sin 𝜑, cos 𝑠), and we find the point Ψ(𝜌, 𝜑) at distance 𝜌 from
N . Thus Ψ(𝜌, 𝜑) = (sin 𝜌 cos 𝜑, sin 𝜌 sin 𝜑, cos 𝜌) — which shows that these
geodesic coordinates are just the spherical coordinates. Let us also point out that
S2 ∖ {(0, 0, −1)} = 𝐷 𝜋 (N ) is a normal neighborhood of N , because it is the
diffeomorphic image of the disk 𝐵 𝜋 (N ) under expN . ◻
We can now almost specify under what conditions a geodesic describes a shortest
path on the surface. But first, let us define the intrinsic distance on the surface, which
is the distance “experienced” by those who move on it. For a connected surface 𝑆
and 𝑝, 𝑞 ∈ 𝑆, we define 𝑑(𝑝, 𝑞) to be the least of the lengths of the curves in 𝑆 that
connect 𝑝 to 𝑞:
The intrinsic distance 𝑑 is a true distance, since it enjoys the following three properties:
(i) 𝑑(𝑝, 𝑞) ≥ 0 and 𝑑(𝑝, 𝑞) = 0 if and only if 𝑝 = 𝑞 (positivity);
(ii) 𝑑(𝑝, 𝑞) = 𝑑(𝑞, 𝑝) (symmetry);
(iii) 𝑑(𝑝, 𝑞) ≤ 𝑑(𝑝, 𝑟) + 𝑑(𝑟, 𝑞) (triangular inequality).
These properties are easily verified; we prove property (iii) via an example. If
𝛼1 ∶ [𝑎, 𝑏] → 𝑆 is a curve from 𝑝 to 𝑟, and 𝛼2 ∶ [𝑐, 𝑑] → 𝑆 another from 𝑟 to 𝑞, then
their juxtaposition 𝛼1 ∗ 𝛼2 , defined on [𝑎, 𝑏 + 𝑑 − 𝑐] by
⎧
⎪
⎪𝛼1 (𝑡) if 𝑎 ≤ 𝑡 ≤ 𝑏
(𝛼1 ∗ 𝛼2 )(𝑡) = ⎨
⎪
⎪ 𝛼 (𝑡 + 𝑐 − 𝑏) if 𝑏 ≤ 𝑡 ≤ 𝑏 + 𝑑 − 𝑐,
⎩ 2
128 4 The Intrinsic Geometry of Surfaces
Proof Take geodesic polar coordinates Ψ(𝜌, 𝜑) centered at 𝑝, and let (𝜌0 , 𝜑0 ) be
such that Ψ(𝜌0 , 𝜑0 ) = 𝑞. Given a curve 𝛼∶ [𝑎, 𝑏] → 𝑆, piecewise regular, such that
𝛼(𝑎) = 𝑝, 𝛼(𝑏) = 𝑞, our goal is to show that 𝑙(𝛼) > 𝜌0 , unless 𝛼 is a reparametrization
of the radial geodesic 𝜌 (0 ≤ 𝜌 ≤ 𝜌0 ) ↦ Ψ(𝜌, 𝜑0 ).
We first deal with the case where 𝛼([𝑎, 𝑏]) ⊆ 𝐷 𝛿 (𝑝). We may assume, truncating
the curve if necessary, that 𝛼(𝑡) ≠ 𝑝 for all 𝑡 ∈ ]𝑎, 𝑏]. Under this hypothesis, there
exist functions 𝜌(𝑡) and 𝜑(𝑡), piecewise differentiable, such that 𝜑(𝑏) = 𝜑0 and
𝛼(𝑡) = Ψ(𝜌(𝑡), 𝜑(𝑡)) for 𝑡 ∈ ]𝑎, 𝑏]. Using Lemma 4.6.3, we then have
𝑏√
𝑙(𝛼) = ∫ 𝐸 𝜌 2 + 2𝐹 𝜌𝜑 + 𝐺𝜑2 𝑑𝑡
𝑎
𝑏√ 𝑏√ 𝑏
=∫ 𝜌 2 + 𝐺𝜑2 𝑑𝑡 ≥ ∫ 𝜌 2 𝑑𝑡 ≥ ∫ 𝜌 𝑑𝑡 = 𝜌0 ,
𝑎 𝑎 𝑎
and equality holds only if 𝜑 ≡ 0 (i.e., if 𝜑 is constant) and 𝜌 ≥ 0, which implies that 𝛼
is the radial geodesic from 𝑝 to 𝑞.
If 𝛼([𝑎, 𝑏]) is not contained in 𝐷 𝛿 (𝑝) then, given 0 < 𝑟 < 𝛿, let 𝑡𝑟 be the first
point such that 𝛼(𝑡𝑟 ) belongs to the geodesic circumference 𝜌 = 𝑟. By the above
conclusion, we have 𝑙(𝛼) ≥ 𝑙(𝛼∣[𝑎,𝑡 ] ) ≥ 𝑟. This inequality, valid for all 𝑟 < 𝛿, implies
𝑟
that 𝑙(𝛼) ≥ 𝛿 > 𝜌0 . ◻
Observations 4.6.7 A. From the proof of Proposition 4.6.6 it also follows that, for
every point 𝑞 ∈ 𝑆 ∖ 𝐷 𝛿 (𝑝), we have 𝑑(𝑝, 𝑞) ≥ 𝛿. In short, if we put, for arbitrary
𝛿 > 0, 𝐷 𝛿 (𝑝) = {𝑞 ∈ 𝑆∶ 𝑑(𝑝, 𝑞) < 𝛿}, this agrees with our previous definition when
𝐷 𝛿 (𝑝) is a normal neighborhood of 𝑝.
4.6 Minimizing Properties of Geodesics 129
B. We should stress that we can only guarantee locally that geodesics are minimizing.
On the cylinder 𝑥 2 + 𝑦 2 = 1, for example, the geodesic 𝛼(𝑡) = (cos 𝑡, sin 𝑡, 𝑡) does
not minimize the distance between 𝛼(0) = (1, 0, 0) and 𝛼(2𝜋) = (1, 0, 2𝜋): the line
segment {(1, 0)} × [0, 2𝜋] is shorter. ◻
Lemma 4.6.9 Let 𝑓 ∶ [𝑎, 𝑏[→ R be a continuous function, differentiable on ]𝑎, 𝑏[,
and such that lim+ 𝑓 ′ (𝑥) exists and is finite. Then 𝑓 is differentiable at 𝑎 and
𝑥→𝑎
𝑓 ′ (𝑎) = lim+ 𝑓 ′ (𝑥).
𝑥→𝑎
with initial conditions 𝑥(0) = 0 and 𝑥(0) = 1. There are three cases to consider:
√ 1 √ 1 √
i. if 𝐾 < 0 then 𝐺 = √ sinh( −𝐾 𝜌), and therefore 𝐺 = − sinh2 ( −𝐾 𝜌);
√ −𝐾 𝐾
ii. if 𝐾 = 0 then 𝐺 = 𝜌, 𝐺 = 𝜌 2 ;
130 4 The Intrinsic Geometry of Surfaces
√ 1 √ 1 √
iii. if 𝐾 > 0 then𝐺 = √ sin( 𝐾 𝜌), 𝐺 = sin2 ( 𝐾 𝜌).
𝐾 𝐾
We thus conclude that if 𝐾 is constant then the coefficients 𝐸, 𝐹, 𝐺 of the
coordinates Ψ(𝜌, 𝜑) only depend on 𝐾. As a consequence we have the following
result.
Theorem 4.6.10 Any two surfaces of equal constant Gaussian curvature are locally
isometric.
Proof Assume that 𝑆1 and 𝑆2 have the same constant curvature. Given 𝑝 ∈ 𝑆1 and
𝑞 ∈ 𝑆2 , let 𝛿 > 0 be such that 𝐷 𝛿 (𝑝) and 𝐷 𝛿 (𝑞) are normal neighborhoods of
𝑝 and 𝑞. Fixing orthonormal bases (v1 , v2 ) on 𝑇 𝑝 𝑆1 and (w1 , w2 ) on 𝑇𝑞 𝑆2 , let
𝐿∶ 𝑇 𝑝 𝑆1 → 𝑇𝑞 𝑆2 be the linear isometry such that 𝐿(v𝑖 ) = w𝑖 (𝑖 = 1, 2). We shall see
−1
that 𝑓 = exp𝑞 ○𝐿 ○ exp 𝑝 ∣ 𝐵 ( 𝑝) is an isometry of 𝐷 𝛿 (𝑝) onto 𝐷 𝛿 (𝑞): in fact, 𝑓 is
𝛿
clearly a diffeomorphism. Furthermore, the geodesic coordinates Ψ(𝜌, 𝜑) in 𝐷 𝛿 (𝑝)
̃
associated with (v1 , v2 ) are sent by 𝑓 to the geodesic coordinates Ψ(𝜌, 𝜑) in 𝐷 𝛿 (𝑞)
associated with (w1 , w2 ). From what we have seen above, Ψ and Ψ ̃ have the same
coefficients, and therefore 𝑓 ∣𝐷 ( 𝑝)∖{ 𝑝} = Ψ ̃ ○ Ψ−1 is an isometry. Thus, and since
𝛿
𝐷 𝑓𝑞 = 𝐿, we see that 𝐷 𝑓𝑟 is a linear isometry for all 𝑟 ∈ 𝐷 𝛿 (𝑝), which completes
the proof. ◻
Exercises
100. (a) Check that the differential equations of the geodesics of a surface of revolution
parametrized by
Φ(𝑢, 𝑣) = (𝜌(𝑣) cos 𝑢, 𝜌(𝑣) sin 𝑢, 𝑧(𝑣))
are
⎧
⎪ ′′ 2𝜌 ′ ′
⎪
⎪𝑢 + 𝜌 𝑢 𝑣 = 0,
⎪
⎨ 𝜌𝜌 + 𝑧𝑧
⎪
⎪
⎪
𝜌𝜌
𝑣 ′′ − 2 2 𝑢 ′2 + 2 2 𝑣 ′2 = 0
⎪
⎩ 𝜌 +𝑧 𝜌 +𝑧
— where 𝜌, 𝑧, etc. denote the derivatives with respect to 𝑣, and 𝑢 ′ , 𝑣 ′ , etc. the
derivatives with respect to the curve parameter.
(b) Use these equations to conclude again that any meridian 𝑢 = constant is a
geodesic, and that the parallel 𝑣 = 𝑣 0 is a geodesic if and only if the tangent line to
the generating curve at 𝑣 0 is parallel to the 𝑧-axis.
(c) Show that if 𝛾(𝑠) = Φ(𝑢(𝑠), 𝑣(𝑠)) is a geodesic parametrized by arc length
then (𝜌(𝑠)) 𝑢 ′ (𝑠) is constant. Check that (𝜌(𝑠)) 𝑢 ′ (𝑠) = 𝜌(𝑠) cos 𝜃(𝑠), where
2 2
4.6 Minimizing Properties of Geodesics 131
𝜌(𝑠) = 𝜌 ○ 𝑣(𝑠) is the radius of the parallel where 𝛾 lies at time 𝑠 and 𝜃(𝑠) is the
angle of intersection of the curve with that parallel. (The equality 𝜌 cos 𝜃 = constant
is the Clairaut Equation, and plays a fundamental role in the study of geodesics on
surfaces of revolution.)
101. (In this exercise use the Clairaut Equation from Exercise 100.) Consider the
geodesic 𝛾 that starts from a point 𝑝 on the upper half (𝑧 > 0) of the hyperboloid of
revolution 𝑥 2 + 𝑦 2 − 𝑧2 = 1 and makes angle 𝜃 given by cos 𝜃 = 𝜌1 with the parallel
(with radius 𝜌) passing through 𝑝. Put 𝛾(𝑠) = (𝜌(𝑠) cos 𝜑(𝑠), 𝜌(𝑠) sin 𝜑(𝑠), 𝑧(𝑠)),
𝛾(0) = 𝑝. Show that:
(a) while 𝛾(𝑠) stays in the upper half 𝑧 > 0, however 𝑧′ (𝑠) ≠ 0;
(b) if 𝑧′ (0) < 0 then
(c) Show that if 𝐺 is of the form (4.40) then the Gaussian curvature along each
geodesic circumference 𝜌 = constant is constant.
(d) Conclude that the only oriented connected surfaces on which any geodesic
circumference has constant geodesic curvature are surfaces of constant Gaussian
curvature.
111. Show that a conformal mapping 𝑓 ∶ 𝑆1 → 𝑆2 that sends geodesics of 𝑆1 to
geodesics of 𝑆2 is necessarily a similarity — i.e., there exists 𝜆 > 0 such that
∣𝐷 𝑓 𝑝 (v)∣ = 𝜆∣v∣ for all 𝑝 ∈ 𝑆1 and v ∈ 𝑇 𝑝 𝑆1 .
112. Let 𝑝 be a point of 𝑆 and 𝛼(𝑢), ∣𝑢∣ < 𝜀, a regular curve in 𝑆 such that 𝛼(0) = 𝑝.
Choose along 𝛼 a unit vector field w(𝑢) of tangent vectors orthogonal to 𝛼′ (𝑢) and
write Φ(𝑢, 𝑣) = 𝛾(𝑣 − 𝑣 0 ; 𝛼(𝑢), w(𝑢)) (i.e., Φ(𝑢, ⋅) is the geodesic that at time 𝑣 0
passes through 𝛼(𝑢) with velocity w(𝑢)). Show that:
(a) Φ(𝑢, 𝑣) is a parameterization in a neighborhood of 𝑝 (the coordinates Φ(𝑢, 𝑣)
obtained this way are called semi-geodesic);
(b) Φ(𝑢, 𝑣) is an orthogonal parameterization;
(c) geodesic polar coordinates are an example of semi-geodesic coordinates.
113. Let 𝑆 be a connected oriented surface and let 𝑓 ∶ 𝑆 → 𝑆 be a local isometry. Show
that:
(a) if there exists 𝑝 ∈ 𝑆 such that 𝑓 (𝑝) = 𝑝 and 𝐷 𝑓 𝑝 = id ∣𝑇 𝑆 then 𝑓 is the identity
𝑝
(Hint: what are the geodesics that start from 𝑝 mapped to?);
4.6 Minimizing Properties of Geodesics 133
(b) if 𝑓 is not the identity and if there exists a regular curve 𝛼∶ 𝐼 → 𝑆 such that
𝑓 ○ 𝛼 = 𝛼 then 𝑓 reverses orientation of 𝑆;
(c) the curve 𝛼 in b) is a geodesic.
115. In this exercise we show that any compact surface has a triangulation (a fact
used when establishing the global version of the Gauss-Bonnet theorem). Fix 𝛿 > 0
so that, for all 𝑝 ∈ 𝑆, 𝐷 𝛿 (𝑝) is a normal neighborhood of 𝑝, and consider a family
(R 𝑘 )1≤ 𝑗≤𝑘 of geodesic triangles such that:
(a) Show that any two edges of two distinct R 𝑗 are either disjoint, or intersect at a
single point, or intersect along an arc common to both. Hence, the intersections of
the R 𝑗 form a finite number of regions Ω𝑙 .
(b) By properly triangulating each region Ω𝑙 , obtain a triangulation of 𝑆.
This shows that 𝑠 ↦ (𝜆 1 (𝑠), 𝜆 2 (𝑠)) is a mapping of [𝑎, 𝑏] into the unit circle S1 . If
the vector fields considered are differentiable, then also this mapping is differentiable,
and therefore (see note 1.3.1 and Exercise 7 in Section 1.3) there exists a differentiable
mapping 𝜑(𝑠) such that (𝜆1 (𝑠), 𝜆 2 (𝑠)) = (cos 𝜑(𝑠), sin 𝜑(𝑠)) — that is, such that
It is this function 𝜑(𝑠) that we call the continuous choice of the oriented angle
between v1 (𝑠) and w(𝑠). As we noted in note 1.3.1, any other choice of the same
angle is the sum of 𝜑(𝑠) with an integer multiple of 2𝜋.
For later use, it is convenient to obtain a description like the one we saw above for
functions [𝑎, 𝑏] → S1 that are not necessarily differentiable:
A.1 Lemma. Let 𝐹∶ [𝑎, 𝑏] → S1 be a continuous function. Then there is a lift of 𝐹,
that is, a continuous function 𝜑∶ [𝑎, 𝑏] → R such that 𝐹(𝑠) = (cos 𝜑(𝑠), sin 𝜑(𝑠)) for
all 𝑠 on [𝑎, 𝑏]. Every other lift of 𝐹 is the sum of 𝜑 with a constant integer multiple
of 2𝜋.
Proof Consider the mapping Π(𝑡) = (cos 𝑡, sin 𝑡) which wraps the line R into
S1 . Note that the restriction of Π to any interval [𝑡 1 , 𝑡 2 ] with 𝑡2 − 𝑡1 < 2𝜋 is a
homeomorphism onto its image, since [𝑡 1 , 𝑡 2 ] is compact and Π∣[𝑡 ,𝑡 ] is continuous
1 2
and injective. What we are looking for is a continuous function 𝜑 such that 𝐹 = Π ○ 𝜑.
The idea is to restrict 𝐹 to small intervals where we can apply a local inverse of Π to
both sides of this equality.
By uniform continuity of 𝐹, there exists 𝛿 > 0 such that for ∣𝑠 − 𝑡∣ < 𝛿, the
points 𝐹(𝑠) and 𝐹(𝑡) are never diametrically opposite in S1 . If we take a partition
𝑠0 < 𝑠1 < ⋯ < 𝑠 𝑘 of [𝑎, 𝑏] with 𝑠𝑖 − 𝑠𝑖−1 < 𝛿, then, for every 1 ≤ 𝑖 ≤ 𝑘, the image
𝐹([𝑠𝑖−1 , 𝑠𝑖 ]) is contained in a semi-circle. Let us now define 𝜑 recursively, starting
at the interval [𝑠0 , 𝑠1 ]. By construction, there exists an interval 𝐽1 , of amplitude
𝜋, such that the arc Π(𝐽1 ) contains 𝐹([𝑠0 , 𝑠1 ]): thus, for 𝑠 ∈ [𝑠0 , 𝑠1 ], we define
−1
𝜑(𝑠) = Π∣ 𝐽 ○ 𝐹(𝑠). Assuming we have defined 𝜑(𝑠) for all 𝑠 ∈ [𝑠0 , 𝑠𝑖−1 ], we take 𝐽𝑖 ,
1
of amplitude 𝜋, such that 𝜑(𝑠𝑖−1 ) ∈ 𝐽𝑖 and 𝐹([𝑠𝑖−1 , 𝑠𝑖 ]) ⊆ Π(𝐽𝑖 ), for 𝑠 ∈ [𝑠𝑖−1 , 𝑠𝑖 ], we
define 𝜑(𝑠) = Π∣−1
𝐽𝑖 ○ 𝐹(𝑠). This ends the construction of 𝜑. It is clear that 𝐹 = Π ○ 𝜑.
Since 𝜑 is continuous (because it is continuous on every interval [𝑠𝑖−1 , 𝑠𝑖 ]), 𝜑 is a lift
of 𝐹.
Regarding uniqueness of 𝜑 (minus a constant), the proof is given in note 1.3.1. ◻
Proof Let 𝜃(𝑡) be a lifting of the mapping 𝑡 ↦ 𝐹(0, 𝑡), whose existence is guaranteed
by A.1. Again, using A.1, let us next take, for every 𝑡 ∈ [𝑐, 𝑑], a lifting 𝑠 ↦ 𝜑(𝑠, 𝑡) of
𝑠 ↦ 𝐹(𝑠, 𝑡) that satisfies 𝜑(0, 𝑡) = 𝜃(𝑡). The function 𝜑 obtained this way satisfies
the equality 𝐹 = Π ○ 𝜑, and it remains to show that 𝜑 is continuous. By construction,
4.6 Minimizing Properties of Geodesics 135
its restriction to each of the horizontal segments [𝑎, 𝑏] × {𝑡}, and to the vertical line
segment {0} × [𝑐, 𝑑], is continuous.
Given 𝜀 > 0 with 𝜀 < 𝜋2 , the uniform continuity of 𝐹 gives us 𝛿 > 0 such that
the angle between 𝐹(𝑠, 𝑡) and 𝐹(̃ 𝑠, ̃
𝑡 ) has amplitude 𝜀 whenever ∣(𝑠, 𝑡) − (̃ 𝑠, ̃
𝑡 )∣ < 𝛿.
Thus, given (𝑠0 , 𝑡 0 ), and under the assumption that ∣(𝑠, 𝑡) − (𝑠0 , 𝑡 0 )∣ < 𝛿, we can write
where 𝑘(𝑠, 𝑡) is an integer and ∣𝜀(𝑠, 𝑡)∣ < 𝜀. If we show that 𝑘(𝑠, 𝑡) = 0, the
continuity of 𝜑 on (𝑠0 , 𝑡 0 ) follows. Now, since 𝜑∣{0}×[𝑐,𝑑] is continuous, we have
∣𝜑(0, 𝑡) − 𝜑(0, 𝑡 0 )∣ < 𝜀 < 𝜋2 . Furthermore, since the difference 𝜑(⋅, 𝑡) − 𝜑(⋅, 𝑡 0 ) is
continuous and takes neither of the values ± 𝜋2 , it takes values in the interval ]− 𝜋2 , 𝜋2 [;
we conclude that ∣𝜑(𝑠, 𝑡) − 𝜑(𝑠, 𝑡 0 )∣ < 𝜋2 . But as also ∣𝜑(𝑠, 𝑡 0 ) − 𝜑(𝑠0 , 𝑡 0 )∣ < 𝜋2 , it
follows that
𝜋 > ∣𝜑(𝑠, 𝑡) − 𝜑(𝑠0 , 𝑡 0 )∣ = ∣𝜀(𝑠, 𝑡) + 2𝜋𝑘(𝑠, 𝑡)∣ ≥ 2𝜋∣𝑘(𝑠, 𝑡)∣ − ∣𝜀(𝑠, 𝑡)∣,
𝛼(𝑠) − 𝛼(𝑡)
𝐹(𝑠, 𝑡) = ;
∣𝛼(𝑠) − 𝛼(𝑡)∣
in the other cases, we put 𝐹(𝑠, 𝑠) = 𝛼′ (𝑠) and 𝐹(𝑎, 𝑏) = −𝐹(𝑏, 𝑎) = 𝛼′ (𝑎). Denoting
by 𝜑(𝑠, 𝑡) a lifting of 𝐹(𝑠, 𝑡), the restriction of 𝜑 to the diagonal {(𝑠, 𝑠)∶ 𝑡𝑜 ≤ 𝑠 ≤ 𝑏}
is a lifting of 𝑠 ↦ 𝛼′ (𝑠), and therefore the rotation index is
1
ℜ(𝛼) = (𝜑(𝑏, 𝑏) − 𝜑(𝑎, 𝑎)).
2𝜋
Suppose that the initial point 𝛼(𝑎) is chosen so that the curve 𝛼 is all on the same
side of its tangent line at that point (such a choice is always possible: see for example
Section 1.6). Then the image of 𝐹(𝑠, 𝑎), for 𝑠 ∈ [𝑎, 𝑏], is contained in a semi-circle,
and therefore its lift 𝜑(𝑠, 𝑎) covers at most an interval of amplitude 𝜋. But since
𝐹(𝑎, 𝑎) = −𝐹(𝑏, 𝑎), we have
where 𝜀 = 1 if 𝛼 is positively oriented, and 𝜀 = −1 otherwise. Since 𝐹(𝑏, 𝑡), 𝑡 ∈ [𝑎, 𝑏],
runs exactly along the curve at S1 diametrically opposite to 𝐹(𝑠, 𝑎), we also have
136 4 The Intrinsic Geometry of Surfaces
Adding these two equalities, we obtain ℜ(𝛼) = 𝜀, which concludes the proof. ◻
Before we move to a generalization of Theorem A.3, we note that the above proof
works under the assumption that 𝛼 is only of class 𝐶 1 .
We will now show that A.3 remains valid for curves on surfaces, provided
such curves are contained in parametrized neighborhoods. Let 𝛼(𝑠), for 𝑠 ∈ [𝑎, 𝑏],
be a closed, simple, regular curve of class 𝐶 1 , contained in the image of the
parameterization Φ(𝑢, 𝑣) that we assume to be compatible with the orientation of the
1
surface. Let v1 = √ Φ𝑢 , and let 𝜃(𝑠) be a continuous choice of the oriented angle
𝐸
between v1 and 𝛼′ (𝑠). We claim that if 𝛼 is positively oriented, then 𝜃(𝑏) − 𝜃(𝑎) = 2𝜋.
The idea, of course, is to apply Theorem A.3 to the planar curve 𝛽 = Φ−1 ○ 𝛼.
If Φ were an isothermal parameterization, our statement would be an immediate
consequence of A.3, but it is unnecessary to invoke such a strong result as the
existence of isothermal parameterizations. Let 𝜑(𝑠) be a continuous choice of the
𝛽′ (𝑠)
angle between (1, 0) and 𝝉(𝑠) = ′ . For every instant 𝑠, and since Φ preserves
∣𝛽 (𝑠)∣
orientation and 𝐷Φ𝛽(𝑠) sends (1, 0) and 𝝉(𝑠) to vectors that are positive multiples of
v1 and 𝛼′ (𝑠), the dihedra ((1, 0), 𝝉(𝑠)) and (v1 , 𝛼′ (𝑠)) are both positively oriented
or both negatively oriented. Hence, 𝜑(𝑠) − 𝜃(𝑠) ≠ ±𝜋 for all 𝑠; choosing 𝜃(𝑎) and
𝜑(𝑎) with ∣𝜑(𝑎) − 𝜃(𝑎)∣ < 𝜋, we also have ∣𝜑(𝑏) − 𝜃(𝑏)∣ < 𝜋. It follows that
is 𝜀.
Let us denote by 𝜃 𝜀 (𝑠) a continuous choice of the angle between v1 and 𝛼′𝜀 (𝑠). It
is clear then that each of the differences
𝜃 𝜀 (𝑏 𝑖𝜀 ) − 𝜃 𝜀 (𝑎 𝑖𝜀 )
comes arbitrarily close to 𝜃 𝑖 (𝑠𝑖 ) − 𝜃 𝑖 (𝑠𝑖−1 ), taking 𝜀 sufficiently small, and in this
way also
𝜃 𝜀 (𝑎 𝑖+1
𝜀
) − 𝜃 𝜀 (𝑏 𝑖𝜀 ), for 𝑖 = 1, . . . , 𝑘 − 1,
and 𝜃 𝜀 (𝑎 1𝜀 ) − 𝜃 𝜀 (0) + 𝜃 𝜀 (𝐿 𝜀 ) − 𝜃 𝜀 (𝑏 𝑘𝜀 ), for 𝑖 = 𝑘,
lie close to 𝛾𝑖 for 𝑖 = 1, . . . , 𝑘. We thus conclude that
2𝜋 = 𝜃 𝜀 (𝐿 𝜀 ) − 𝜃 𝜀 (0)
𝑘
= {𝜃 𝜀 (𝐿 𝜀 ) − 𝜃 𝜀 (𝑏 𝑘𝜀 )} + ∑ {𝜃 𝜀 (𝑏 𝑖𝜀 ) − 𝜃 𝜀 (𝑎 𝑖𝜀 )} +
𝑖=1
𝑘−1
+ ∑ {𝜃 𝜀 (𝑎 𝑖+1
𝜀
) − 𝜃 𝜀 (𝑏 𝑖𝜀 )} + {𝜃 𝜀 (𝑎 1𝜀 ) − 𝜃 𝜀 (0)}
𝑖=1
is arbitrarily close to
𝑘 𝑘
∑ {𝜃 𝑖 (𝑠𝑖 ) − 𝜃 𝑖 (𝑠𝑖−1 )} + ∑ 𝛾𝑖 ,
𝑖=1 𝑖=1
Exercises
116. Lemma A.2 applies not only to rectangles, but also, obviously, to any regions that
are homeomorphic to rectangles. For example, any continuous function 𝐹∶ D2 → S1 ,
where D2 is the closed disk {𝑝 ∈ R 2 ∶ ∣𝑝∣ ≤ 1}, has a lifting. Using this fact, we next
give a proof of Brouwer’s fixed point theorem: any continuous function 𝑓 ∶ D2 → D2
has some fixed point (i.e., a point 𝑝 such that 𝑓 (𝑝) = 𝑝).
138 4 The Intrinsic Geometry of Surfaces
𝑓 (𝑝) − 𝑝
𝐹(𝑝) = ,
∣ 𝑓 (𝑝) − 𝑝∣
1
Δ(𝛼) = (𝜃(𝑏) − 𝜃(𝑎)),
2𝜋
where 𝜃 is a continuous choice of the oriented angle of w and 𝛼′ (𝑡).
(a) Compute Δ(𝛼) for the curve 𝑡 ↦ Φ(𝑡, 𝑛𝑡), where 𝑛 ∈ Z is a constant and
𝑡 ∈ [0, 2𝜋].
(b) Does the result of (a) depend on the vector field w?
Chapter 5
The Global Geometry of Surfaces
Global geometry deals with those results that concern the surface as a whole. In
Chapter 4 we have already seen examples of global theorems, such as the divergence
theorem (Corollary 4.4.3.i) and the Gauss-Bonnet theorem (Theorem 4.5.6). Another
example is the sphere theorem that we will present in this chapter: any compact surface
of constant curvature in R 3 is a sphere. As we already observed, the assumption
that the curvature is constant is insufficient, and a global condition (in this case, the
compactness of the surface) is needed to draw such a conclusion.
Compact surfaces are inextensible, in the following sense: if 𝑆1 and 𝑆2 are
connected surfaces such that 𝑆1 is compact and 𝑆1 ⊆ 𝑆2 then 𝑆1 = 𝑆2 . The global
results must naturally deal with inextensible surfaces. In Section 5.1 we will define
the notion of a complete surface, which is a sufficient but not necessary condition for
a surface to be inextensible.
This chapter includes a mixed bag of topics unusual in introductory texts of
Differential Geometry: for instance, a Blaschke formula for surfaces of constant width
(Theorem 5.5.3) and the description of all complete surfaces of constant non-negative
curvature (Theorems 5.7.7 and 5.7.10). To help the readers in their choice of topics,
we mention that Sections 5.1 and 5.2 form the main body of the chapter, from which
two independent branches emerge, one consisting of 5.4 and 5.5 and the other one of
5.3, 5.6 and 5.7.
All surfaces in this chapter are connected.
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024 139
P. V. Araújo, Differential Geometry, https://doi.org/10.1007/978-3-031-62384-4_5
140 5 The Global Geometry of Surfaces
it follows that (𝛾(𝑠 𝑛 ))𝑛≥1 is a Cauchy sequence in R 3 . Since 𝑆 is closed, the limit 𝑝
of this sequence is a point of 𝑆. We can then choose 𝜂 > 0 and a neighborhood 𝑈
of 𝑝 in 𝑆 such that, for all 𝑞 ∈ 𝑈, the radial geodesics starting from 𝑞 have length at
least 𝜂. We fix 𝑛 so that 𝑠 𝑛 > 𝑠0 − 𝜂2 and 𝛾(𝑠 𝑛 ) ∈ 𝑈, and let 𝑞 = 𝛾(𝑠 𝑛 ), w = 𝛾 ′ (𝑠 𝑛 ).
The geodesic 𝛾̃(𝑡) = 𝛾(𝑡; 𝑞, w) is defined for 𝑡 ∈ ]−𝜂, 𝜂[, and 𝛾(𝑠) = 𝛾̃(𝑠 − 𝑠 𝑛 ). Thus
𝛾(𝑠) is extensible to the interval [0, 𝑠 𝑛 + 𝜂[ ⊇ [0, 𝑠0 + 𝜂2 [. ◻
In particular, it follows from the above proposition that all compact surfaces are
complete. However, we observe that there are complete surfaces other than those
closed in R 3 :
Example 5.1.2 Let 𝑆 be the surface given by
Figure 5.1
5.1 Complete Surfaces 141
The intrinsic distance 𝑑 in 𝑆 and the distance ∣ ∣ in R 3 are equivalent, in the sense
that a sequence (𝑝 𝑛 ) in 𝑆 converges to 𝑝 ∈ 𝑆 for one of these distances if and only if
it converges for the other one. However, this example shows that they do not have to
be uniformly equivalent: there can be a sequence of points on 𝑆 that is Cauchy for ∣ ∣
but is not so for 𝑑. ◻
The major advantage of complete surfaces is that there exist minimizing geodesics
between any two points on them. Before we prove this, we note that it follows from the
triangle inequality (see Section 4.6) that, for every 𝑝 ∈ 𝑆, the mapping 𝑞 ↦ 𝑑(𝑝, 𝑞)
is continuous (since ∣𝑑(𝑝, 𝑞) − 𝑑(𝑝, 𝑟)∣ ≤ 𝑑(𝑞, 𝑟) ∀ 𝑞, 𝑟 ∈ 𝑆) — and therefore its
restriction to any compact 𝑆 attains a maximum and a minimum.
𝛿
𝑙(𝛼) = 𝑙(𝛼∣[𝑎,𝑡 ] ) + 𝑙(𝛼∣[𝑡 ) ≥ 𝑑(𝑝, 𝛼(𝑡 0 )) + 𝑑(𝛼(𝑡0 ), 𝑞) ≥ + 𝑑(𝑟, 𝑞),
0 0 ,𝑏] 2
which implies 𝑙 ≥ 2𝛿 + 𝑑(𝑟, 𝑞). Together with (5.1), this gives 𝑑(𝑟, 𝑞) = 𝑙 − 2𝛿 . Let
𝛾(𝑠) be the radial geodesic, parametrized by arc length, such that 𝛾(0) = 𝑝 and
𝛾( 2𝛿 ) = 𝑟. Note that for 𝑠 ≥ 0 we have 𝑑(𝛾(𝑠), 𝑞) ≥ 𝑑(𝑝, 𝑞) − 𝑑(𝑝, 𝛾(𝑠)) ≥ 𝑙 − 𝑠.
Defining
𝐼 = {𝑠 ∈ [0, 𝑙]∶ 𝑑(𝛾(𝑠), 𝑞) = 𝑙 − 𝑠},
the preceding inequality shows that 𝑠 ∈ 𝐼 if and only if 𝑠 ≥ 0 and 𝑑(𝛾(𝑠), 𝑞) ≤ 𝑙 − 𝑠.
It follows that 𝐼 is a (necessarily closed) interval: in fact, if 𝑠 ∈ 𝐼, and 0 ≤ 𝑡 < 𝑠, then
𝑑(𝛾(𝑡), 𝑞) ≤ 𝑑(𝛾(𝑡), 𝛾(𝑠)) + 𝑑(𝛾(𝑠), 𝑞) ≤ (𝑠 − 𝑡) + (𝑙 − 𝑠) = 𝑙 − 𝑡, and therefore 𝑡 ∈ 𝐼.
We have already seen that [0, 2𝛿 ] ⊆ 𝐼; let us now show that 𝐼 = [0, 𝑙]. To this end, it
suffices to show that if 𝑠0 ∈ ]0, 𝑙[ is in 𝐼 then also 𝑠0 + 𝜂 ∈ 𝐼 for some 𝜂 > 0 — because
then necessarily sup 𝐼 = 𝑙. Let us fix a normal neighborhood 𝐷 2𝜂 (𝛾(𝑠0 )) of 𝛾(𝑠0 )
so that 𝑑(𝛾(𝑠0 ), 𝑞) > 𝜂, and let ̃ 𝑟 be a point of S1 (𝛾(𝑠0 ), 𝜂) at the shortest possible
distance from 𝑞: the above argument shows that
and therefore
142 5 The Global Geometry of Surfaces
The restriction 𝛾∣[𝑠 −𝜂,𝑠 ] , followed by the radial geodesic from 𝛾(𝑠0 ) to ̃
𝑟 , is a
0 0
piecewise regular curve from 𝛾(𝑠0 − 𝜂) to ̃ 𝑟 whose length is exactly 2𝜂. By (5.3) the
curve is minimizing and therefore a geodesic
q
p
g r~
g (s0 )
Figure 5.2
(Proposition 4.6.8), so that it coincides with 𝛾∣[𝑠 −𝜂,𝑠 +𝜂] . It follows that 𝛾(𝑠0 + 𝜂) = ̃
𝑟
0 0
and, by (5.2), 𝑠0 + 𝜂 ∈ 𝐼. We then have 𝐼 = [0, 𝑙]; in particular, 𝑑(𝛾(𝑙), 𝑞) = 0 and
therefore 𝛾(𝑙) = 𝑞. Thus, and since 𝑑(𝑝, 𝑞) = 𝑙 as well, 𝛾∣[0,𝑙] is a minimizing
geodesic from 𝑝 to 𝑞. ◻
If we read the above proof carefully, we obtain: if 𝑝 ∈ 𝑆 is such that all geodesics
starting from 𝑝 extend to all values of the parameter then, for every point 𝑞 of 𝑆,
there is some minimizing geodesic from 𝑝 to 𝑞. This observation lets us easily assert
that any complete surface 𝑆 is inextensible: if 𝑆 is contained in another surface 𝑆1
then 𝑆1 = 𝑆. In fact, 𝑆 is necessarily open in 𝑆1 , and therefore the geodesics of 𝑆 are
also geodesics of 𝑆1 . Let us fix 𝑝 ∈ 𝑆: the geodesics of 𝑆1 that start from 𝑝, because
they are geodesics of 𝑆, are defined for all values of the parameter. This means that,
given 𝑞 ∈ 𝑆1 , there exists some minimizing geodesic 𝛾 from 𝑝 to 𝑞. But 𝛾 ⊆ 𝑆 and
therefore 𝑞 ∈ 𝑆 — which shows that 𝑆1 ⊆ 𝑆.
√ 5.1.4 Besides the punctured plane, also the cone C given by the equation
Example
𝑧 = 𝑥 2 + 𝑦 2 (𝑧 > 0) is a non-complete surface, since the generating lines are
geodesics that are not defined for all parameter values. Nevertheless, between every
pair of points of C there exists a minimizing geodesic (Exercise 89 of Section 4.3) —
and C is inextensible. To prove the latter statement let us suppose, to the contrary,
that there exists a connected surface 𝑆 such that C ⊆ 𝑆 but C ≠ 𝑆. Let 𝑝 be a point on
the boundary of C in 𝑆: 𝑝 does not belong to C because C is open in 𝑆. Let (𝑝 𝑛 )𝑛≥1
be a sequence of points of C that converges to 𝑝.
Lemma Given 𝜀 > 0, there exists 𝑛0 such that 𝑝 𝑛 is below the plane 𝑧 = 𝜀 for all
𝑛 ≥ 𝑛0 .
5.1 Complete Surfaces 143
Exercises
119. (a) Show that 𝑆 is a complete surface if and only if (𝑆, 𝑑) (where 𝑑 is the
intrinsic distance on 𝑆) is a complete metric space. (Complete means that any Cauchy
sequence converges. Try to prove the stronger statement that any bounded sequence
has a convergent subsequence.)
(b) Show that if there exists 𝑝 ∈ 𝑆 such that any geodesic passing through 𝑝 is
defined in R , then 𝑆 is complete.
120. If 𝑆 is a non-compact, complete surface, and 𝑝 a point of 𝑆, then there exists a
geodesic 𝛾(𝑠) of 𝑆 such that 𝛾(0) = 𝑝 and that minimizes the distance between 𝑝
and 𝛾(𝑠) for all 𝑠 ∈ R .
121. Let 𝑆 be a complete surface, 𝑝 a point of 𝑆, and v ∈ 𝑇 𝑝 𝑆 a unit vector. Write
𝛾(𝑠) = exp 𝑝 (𝑠v), and suppose that there exists 𝑠 > 0 such that 𝛾∣[0,𝑠] does not
minimize the distance between 𝑝 and 𝛾(𝑠). Consider the set 𝐼 = {𝑠 ≥ 0∶ 𝑑(𝑝, 𝛾(𝑠)) =
𝑠}. Show that:
(a) 𝐼 is a closed interval [0, 𝑠0 ];
(b) for 𝑠 > 𝑠0 there exists a geodesic 𝛾̃ of length 𝑠 connecting 𝑝 to 𝛾(𝑠);
(c) for 0 < 𝑠 < 𝑠0 , 𝛾∣[0,𝑠] is the only minimizing geodesic connecting 𝑝 with 𝛾(𝑠);
(d) for 𝑠 = 𝑠0 two cases could apply:
● there is another minimizing geodesic connecting 𝑝 with 𝛾(𝑠0 );
● 𝑠0 v is not a regular point of exp 𝑝 .
122. Consider in T 2 , parametrized by
the points 𝑝 = Φ(0, 0), 𝑞 = Φ(𝑢 0 , 𝑣 0 ) and 𝑟 = Φ(𝑢 0 , 𝜋), where 0 < 𝑢 0 < 𝜋, for
0 < 𝑣 0 < 𝜋. Let 𝛾(𝑠) = Φ(𝑢(𝑠), 𝑣(𝑠)), for 𝑠 ∈ [0, 𝑎], be a minimizing geodesic from
𝑝 to 𝑞 with 𝑢(0) = 𝑣(0) = 0. Show that:
(a) 0 ≤ 𝑢(𝑠) ≤ 𝑢 0 , 0 ≤ 𝑣(𝑠) < 𝜋 for all 𝑠 ∈ [0, 𝑎];
(b) there are two and only two minimizing geodesics from 𝑝 to 𝑟 (use the Clairaut
Equation, Exercise 100, to conclude that there is only one such geodesic in the region
0 ≤ 𝑢 ≤ 𝑢 0 , 0 ≤ 𝑣 ≤ 𝜋);
(c) if 𝛾̃(𝑠) is a geodesic such that 𝛾̃(0) = 𝑝 and 𝛾̃(𝑏) = 𝑟, then 𝛾̃(2𝑏) = Φ(2𝑢 0 , 0);
(d) if 𝑢𝜋0 is rational then the geodesic 𝛾̃ is periodic, otherwise it is dense in T 2 ;
144 5 The Global Geometry of Surfaces
(f) there exist in T 2 geodesics that are neither periodic nor dense.
Use the Clairaut Equation to show that the sum of the interior angles of Δ is greater
than 𝜋. Can you draw the same conclusion from the Gauss-Bonnet theorem?
124. Let 𝑆 be a connected surface such that for every 𝑝 on 𝑆, there exists an isometry
𝜉 𝑝 ∶ 𝑆 → 𝑆 with 𝜉 𝑝 (𝑝) = 𝑝 and 𝐷(𝜉 𝑝 ) 𝑝 = − id. Show that:
√
125. Let C be the cone 𝑧 = 𝑥 2 + 𝑦 2 , 𝑧 > 0. Show that there is no mapping 𝑓 ∶ C → 𝑆
such that 𝑆 is a complete surface and 𝑓 an isometry onto the image (i.e., there is no
isometric embedding of C into a complete surface).
5.2 Coverings
The notion of a covering is one of the most fruitful in topology — but, to save time,
we will restrict ourselves to coverings of surfaces, though they prove useful in more
general topological spaces. The theory developed here will allow us to show that a
good number of surfaces are (globally) images of local diffeomorphisms of standard
surfaces like the plane or the sphere.
f
U
Figure 5.3
It follows from the definition that any covering is a locally surjective diffeomor-
phism, but it is worthy to note that not all locally surjective diffeomorphisms are
coverings. An example is the restriction of Φ to the square ]−2𝜋, 2𝜋[ × ]−2𝜋, 2𝜋[:
in Figure 5.4, the preimage of the marked open subset 𝑈 joins nine disjoint open
subsets, but only one of them is surjectively sent onto 𝑈 under Φ. ◻
Figure 5.4
146 5 The Global Geometry of Surfaces
Proof Let us first show that 𝑓 is surjective. Since 𝑓 is a local diffeomorphism, its
image 𝑓 (𝑆1 ) is open in 𝑆2 . Given a sequence ( 𝑓 (𝑝 𝑛 ))𝑛≥1 on 𝑓 (𝑆1 ) that converges
to 𝑞 ∈ 𝑆2 , if 𝑝 ∈ 𝑆1 is an accumulation point of (𝑝 𝑛 )𝑛≥1 (which exists because 𝑆1 is
compact), then 𝑓 (𝑝) = 𝑞. Thus 𝑓 (𝑆1 ) is also closed in 𝑆2 and, since 𝑆2 is connected,
necessarily 𝑓 (𝑆1 ) = 𝑆2 .
We now observe that each point 𝑞 of 𝑆2 has a finite number of preimages, otherwise
they would accumulate in the neighborhood of some point of 𝑆1 , in contradiction
to 𝑓 being a local diffeomorphism. Let 𝑝 1 , . . . , 𝑝 𝑘 be the preimages of 𝑞, and let us
choose open neighborhoods 𝑉𝑖 of 𝑝 𝑖 such that each 𝑓 ∣𝑉 is a diffeomorphism onto the
𝑖
image and 𝑉𝑖 ∩ 𝑉 𝑗 is empty if 𝑖 ≠ 𝑗.
𝑘
Claim: There is 𝜀 > 0 such that 𝑓 −1 ({𝑟 ∈ 𝑆2 ∶ ∣𝑟 − 𝑞∣ < 𝜀}) ⊆ ⋃ 𝑉𝑖 .
𝑖=1
We say that a differentiable mapping 𝑓 ∶ 𝑆1 → 𝑆2 lifts curves if, for every curve
𝛼∶ [𝑎, 𝑏] → 𝑆2 and 𝑝 ∈ 𝑆1 such that 𝑓 (𝑝) = 𝛼(𝑎), there exists a single curve
̃∶ [𝑎, 𝑏] → 𝑆1 such that 𝛼
𝛼 ̃(𝑡) = 𝛼(𝑡) for all 𝑡 ∈ [𝑎, 𝑏]. That is, for
̃(𝑎) = 𝑝 and 𝑓 ○ 𝛼
every preimage 𝑝 of the final point 𝛼(𝑎) of 𝛼, there exists a single curve in 𝑆1 that
starts from 𝑝 and whose image under 𝑓 is 𝛼. Note that any mapping that lifts curves
is necessarily surjective.
𝐻∶ [0, 1] × [𝑎, 𝑏] → 𝑆2
such that 𝐻(0, 𝑡) = 𝛼0 (𝑡), 𝐻(1, 𝑡) = 𝛼1 (𝑡) for all 𝑡 ∈ [𝑎, 𝑏], and 𝐻(𝑠, 𝑎) = 𝑞 0 ,
𝐻(𝑠, 𝑏) = 𝑞 1 for all 𝑠 ∈ [0, 1]. Letting 𝛼𝑠 = 𝐻(𝑠, ⋅), the family of curves (𝛼𝑠 )0≤𝑠≤1 is
called a continuous deformation of 𝛼0 with fixed endpoints. Note that if 𝛼0 is closed
then all 𝛼𝑠 curves are also closed.
Proof It suffices to show that, given 𝑠0 ∈ [0, 1], there exists 𝛿 > 0 such that if
∣𝑠 − 𝑠0 ∣ < 𝛿, for all 𝑠 ∈ [0, 1] then 𝛼̃𝑠 (𝑏) = 𝛼 ̃𝑠0 (𝑏). Let 𝑎 = 𝑡 0 < 𝑡1 < ⋯ < 𝑡𝑙 = 𝑏 be a
partition of [𝑎, 𝑏] such that 𝛼 ̃𝑖 , where for each 𝑖:
̃𝑠0 ([𝑡 𝑖−1 , 𝑡 𝑖 ]) ⊆ 𝐷
where the 𝑉 𝑗 are the connected components. Each 𝑉 𝑗 is open: in fact, if 𝑝 ∈ 𝑉 𝑗 and
𝑊 ⊆ 𝑓 −1 (𝑉) is a neighborhood of 𝑝 diffeomorphic to a disk, then 𝑊 ⊆ 𝑉 𝑗 .
Let us now verify that 𝑓 ∣𝑉 ∶ 𝑉 𝑗 → 𝑉 is bijective. Take 𝑝 ∈ 𝑉 𝑗 and let 𝑞 = 𝑓 (𝑝).
𝑗
Given 𝑟 ∈ 𝑉, let 𝛼∶ [𝑎, 𝑏] → 𝑉 be a curve with 𝛼(𝑎) = 𝑞 and 𝛼(𝑏) = 𝑟, and let 𝛼 ̃ be
̃(𝑏) is in the same component 𝑉 𝑗
the lifting of 𝛼 that starts from 𝑝. By definition, 𝛼
as 𝑝 and 𝑓 (̃𝛼(𝑏)) = 𝑟. Therefore 𝑓 ∣𝑉 is surjective. Regarding injectivity, assume
𝑗
that 𝑝̃, 𝑝̂ ∈ 𝑉 𝑗 have the same image 𝑞, and 𝛼̃0 ∶ [𝑎, 𝑏] → 𝑉 𝑗 is a curve from 𝑝̃ to 𝑝̂, and
𝛼0 = 𝑓 ○ 𝛼 ̃0 . This curve 𝛼0 is then closed. Since 𝑉 is diffeomorphic to a disk, there
exists a continuous deformation (𝛼𝑠 )0≤𝑠≤1 of 𝛼0 with fixed endpoints such that the
trace of each 𝛼𝑠 is in 𝑉 and 𝛼1 is the constant curve equal to 𝑞. Denoting by 𝛼 ̃𝑠 the
lifting of 𝛼𝑠 that starts from 𝑝̃, Lemma 5.2.5 guarantees that the curves 𝛼 ̃𝑠 all have
the same endpoints 𝑝̃ and 𝑝̂. But 𝛼 ̃1 is constant, and therefore 𝑝̃ = 𝛼̃1 (𝑎) = 𝛼
̃1 (𝑏) = 𝑝̂.
Thus 𝑓 ∣𝑉 is injective.
𝑗
In conclusion: each component 𝑉 𝑗 is diffeomorphically sent into 𝑉, and therefore
𝑉 is an evenly covered neighborhood of 𝑞 0 . This shows that 𝑓 is a covering and
concludes the proof of Proposition 5.2.4.
5.3 Complete Surfaces of Non-Positive Curvature 149
In this proof we used that, for every open subset 𝑈 diffeomorphic to a disk, every
closed curve in 𝑈 is homotopic (with fixed endpoints) to a constant curve by a
homotopy that only takes values in 𝑈. A set 𝑈 with such a property is called simply
connected. Examples of simply connected surfaces are the plane and the sphere (recall
that S2 minus one point is diffeomorphic to the plane via stereographic projection).
If 𝑆2 is simply connected and 𝑓 ∶ 𝑆1 → 𝑆2 a covering then 𝑓 −1 (𝑆2 ) has only one
connected component, which is all of 𝑆1 — and, by the proof of Proposition 5.2.4, 𝑓
is injective, hence a diffeomorphism. This proves our next result.
We now have gathered all necessary tools about coverings and it is good to start at
once to make interesting use of it.
In fact, such a surface is orientable (e.g. Example 61-b in Section 3.2), and the
hypothesis implies that the normal mapping 𝑁∶ 𝑆 → S2 is a local diffeomorphism,
hence (by Proposition 5.2.2 and the compactness of 𝑆) a covering, hence (by
Proposition 5.2.6 and S2 being simply connected) a diffeomorphism.
In Section 5.4 we will discuss in more depth the compact surfaces of positive
curvature in R 3 (called ovals) and show that ovals are convex (in the sense that they
bound convex regions of R 3 ).
Exercises
̃ 𝑡) = 𝛼
126. Show that in Lemma 5.2.5 the function defined by 𝐻(𝑠, ̃𝑠 (𝑡) is continuous.
127. Let 𝑆1 and 𝑆2 be two connected surfaces and 𝑓 ∶ 𝑆1 → 𝑆2 a covering. Further, let
ℑ( 𝑓 ) be the set of diffeomorphisms 𝑔∶ 𝑆1 → 𝑆1 such that 𝑓 ○ 𝑔 = 𝑔. Show that:
(a) ℑ( 𝑓 ) is a group with respect to composition of functions;
(b) if 𝑆1 is simply connected, then for every pair of points 𝑝 0 , 𝑝 1 in 𝑆1 such that
𝑓 (𝑝 0 ) = 𝑓 (𝑝 1 ), there exists a unique 𝑔 ∈ ℑ( 𝑓 ) such that 𝑔(𝑝 0 ) = 𝑝 1 ;
(c) if 𝑆1 is not simply connected, then the property in (b) may fail.
Both these inequalities hold, as does (5.5), up to the first 𝜌0 > 0 where 𝐺(𝜌0 , 𝜑) = 0.
But (5.6) ensures that such a 𝜌0 does not exist, and therefore the inequalities hold
for all 𝜌 ≥ 0. Since 𝐺(𝜌, 𝜑) is nonzero for all 𝜌 > 0, it follows that exp 𝑝 is a local
diffeomorphism. This proves one part of the following theorem:
Proof It suffices to show that for every 𝑟 ∈ 𝑇 𝑝 𝑆, and vector v, we have ∣𝐷(exp 𝑝 )𝑟 (v)∣ ≥
∣v∣, since then
𝑏 𝑏
𝑙(𝛽) = ∫ ∣𝛽′ (𝑡)∣ 𝑑𝑡 ≤ ∫ ∣𝐷(exp 𝑝 )𝛽(𝑡) (𝛽′ (𝑡))∣ 𝑑𝑡 = 𝑙(exp 𝑝 ○𝛽).
𝑎 𝑎
Since 𝐷(exp 𝑝 )𝑂 is the identity, we can assume that 𝑟 = 𝜌v𝝋 with 𝜌 > 0. The vectors
v𝝋 and w𝝋 form an orthonormal basis of 𝑇 𝑝 𝑆, and the vectors 𝐷(exp 𝑝 )𝑟 (v𝝋 ) and
𝐷(exp 𝑝 )𝑟 (w𝝋 ) are also orthogonal (𝐹 ≡ 0). Moreover,
5.3 Complete Surfaces of Non-Positive Curvature 151
∣𝐷(exp 𝑝 )𝑟 (𝜆v𝝋 + 𝜇w𝝋 )∣2 = 𝜆2 ∣𝐷(exp 𝑝 )𝑟 (v𝝋 )∣2 + 𝜇2 ∣𝐷(exp 𝑝 )𝑟 (w𝝋 )∣2
≥ 𝜆2 + 𝜇2 = ∣𝜆v𝝋 + 𝜇w𝝋 ∣2 .◻
∣̃
𝛼(𝑡) − 𝛼 𝛼∣[𝑠,𝑡] ) ≤ 𝑙(𝛼∣[𝑠,𝑡] )
̃(𝑠)∣ ≤ 𝑙(̃
— and therefore
lim ∣̃ ̃(𝑠)∣ = 0,
𝛼(𝑡) − 𝛼
𝑠,𝑡→̃
𝑡−
Exercises
128. Let 𝑆 be a complete surface, of non-positive curvature, and 𝑝 a point of 𝑆.
(a) Show that the equations of geodesics in geodesic polar coordinates are
1 𝐺𝜌 𝐺𝜑 2
𝜌− 𝐺 𝜌 𝜑2 = 0, 𝜑+ 𝜌𝜑 + 𝜑 = 0.
2 𝐺 2𝐺
(b) Let 𝛾(𝑠) be a geodesic of 𝑆 that does not pass through 𝑝, and denote by
𝛾̃∶ R → 𝑇 𝑝 𝑆 a lifting of 𝛾. Show that the function 𝜌(𝑠) = ∣̃
𝛾 (𝑠)∣ is convex (i.e., 𝜌 ≥ 0)
and has at most one local minimum.
(c) Assume now that 𝑆 is simply connected. Show that the trace of any geodesic
of 𝑆 is a closed set on 𝑆 and that, given a geodesic 𝛾(𝑠) that does not pass through 𝑝,
there exists a single point of 𝛾 at the minimal distance from 𝑝.
(d) Compare the result of (c) with the case of the sphere. Give an example of a
complete surface of non-positive curvature where not all geodesics are closed sets.
129. Let 𝑆 be a complete surface of constant curvature 𝐾 > 0. Using polar geodesic
𝜋
coordinates, show that any minimizing geodesic of 𝑆 has length ≤ √ . Conclude
𝐾
that 𝑆 is compact.
The oval surfaces (or simply ovals) are the compact surfaces in R 3 with positive
curvature at all points. Such surfaces are, as we saw in Section 5.2 (Theorem 5.2.7),
5.4 Ovals (First Part): The Rigidity of the Sphere 153
Proposition 5.4.2 The sets C and D are connected open, C is convex and bounded, 𝑆
is the boundary of both sets C and D, and R 3 ∖ 𝑆 = C ∪ D.
Proof Each of the sets D 𝑝 = {𝑞 ∈ R 3 ∶ ⟨𝑁(𝑝), 𝑞 − 𝑝⟩ < 0}, for 𝑝 ∈ 𝑆, is open, and D
is the union of them all. Therefore D is open, and it is easily seen to be connected.
The set C is the intersection of the convex sets C 𝑝 = {𝑞 ∈ R 3 ∶ ⟨𝑁(𝑝), 𝑞 − 𝑝⟩ > 0}
(𝑝 ∈ 𝑆), and is therefore convex; and any convex set is connected.
Given 𝑞 ∈ C, let us consider 𝛿 = min ⟨𝑁(𝑝), 𝑞 − 𝑝⟩ > 0. If ∣𝑟 − 𝑞∣ < 𝛿 then we have,
𝑝∈𝑆
for every 𝑝 ∈ 𝑆,
— which shows that 𝑝 𝑡 ∈ C for 𝑡 ∈ ]0, 𝜀[. Thus 𝑝 0 ∈ 𝜕C, and therefore 𝑆 ⊆ 𝜕C.
We now show that R 3 ∖ (C ∪ D) ⊆ 𝑆. Indeed, if 𝑞 ∈ R 3 ∖ (C ∪ D) then there exists
some 𝑝 0 ∈ 𝑆 such that ⟨𝑁(𝑝 0 ), 𝑞 − 𝑝 0 ⟩ = 0. If 𝑞 ≠ 𝑝 0 then we can consider a curve
𝛼∶ ]− 𝜂, 𝜂[ → 𝑆 such that 𝛼(0) = 𝑝 0 and 𝛼′ (0) = 𝐷𝑁 −1 𝑝0 (𝑞 − 𝑝 0 ), where
The next result is used immediately afterwards in the proof of the rigidity of the
sphere, and in the proof of a Blaschke formula in the next section (Theorem 5.5.3).
Both formulas (5.7) and (5.8) are valid for every compact surface 𝑆 ⊆ R 3 , and
indeed we prove (5.7) [but not (5.8)] in this generality. We will make use of the results
of Section 4.4, in particular of Theorem 4.4.5 (first variation of area).
Let us consider the vector field on 𝑆 given by v(𝑝) = 𝑝 − 𝑝 0 , and let 𝑆𝑡 be the
variation of 𝑆 induced by v. Since 𝑆𝑡 is the image of 𝑆 under the homothety with
center 𝑝 0 and ratio 1 + 𝑡, its area is 𝐴(𝑡) = (1 + 𝑡) 𝐴, which implies 𝐴′ (0) = 2𝐴.
2
1
𝐻 𝑡 ∣Φ𝑢𝑡 × Φ𝑡𝑣 ∣ = − ⟨Φ𝑢𝑡 × 𝑁 𝑣𝑡 + 𝑁𝑢𝑡 × Φ𝑡𝑣 , 𝑁 𝑡 ⟩. (5.9)
2
We now look at the derivative of the second term in (5.9). We have
𝑑
⟨Φ𝑡 × 𝑁 𝑣𝑡 + 𝑁𝑢𝑡 × Φ𝑡𝑣 , 𝑁 𝑡 ⟩∣𝑡=0
𝑑𝑡 𝑢
𝑑 𝑑
=⟨Φ𝑢 × 𝑁 𝑣 + 𝑁𝑢 × Φ𝑣 , 𝑁 𝑡 ∣𝑡=0 ⟩ + ⟨ {Φ𝑢𝑡 × 𝑁 𝑣𝑡 + 𝑁𝑢𝑡 × Φ𝑡𝑣 }∣𝑡=0 , 𝑁⟩
𝑑𝑡 𝑑𝑡
𝜕v 𝜕v 𝑑 𝑑
=⟨ × 𝑁 𝑣 + 𝑁𝑢 × , 𝑁⟩ + ⟨Φ𝑢 × ( 𝑁 𝑣𝑡 ∣𝑡=0 ) + ( 𝑁𝑢𝑡 ∣𝑡=0 ) × Φ𝑣 , 𝑁⟩ (5.10)
𝜕𝑢 𝜕𝑣 𝑑𝑡 𝑑𝑡
𝜕w 𝜕w
⟨Φ𝑢 × + × Φ𝑣 , 𝑁⟩ = (Div w) ∣Φ𝑢 × Φ𝑣 ∣. (5.11)
𝜕𝑣 𝜕𝑢
Regarding the left-hand side in (5.9), we note that 𝑁∶ 𝑆 → S2 induces in S2 the orien-
tation for which the mean curvature is negative, equal to −1. Using Proposition 4.4.1
we obtain
where Div∗ (v⊺ ) indicates the divergence of v⊺ as the tangent vector field to S2
— which is fine since 𝑁 is a diffeomorphism and the tangent spaces 𝑇 𝑝 𝑆 and
𝑇𝑁 ( 𝑝) S2 are parallel. Now the total mean curvature of Φ𝑡 (𝑈) ⊆ 𝑆𝑡 is 𝑀Φ (𝑡) =
∫𝑈 𝐻 ∣Φ𝑢 × Φ𝑣 ∣ 𝑑𝑢 𝑑𝑣, so that
𝑡 𝑡 𝑡
𝑀Φ′ (0) = ∫
𝑑
{𝐻 𝑡 ∣Φ𝑢𝑡 × Φ𝑡𝑣 ∣}∣𝑡=0 𝑑𝑢 𝑑𝑣. (5.13)
𝑈 𝑑𝑡
Combining formulas (5.9)-(5.13) and using Corollary 4.4.3, we finally obtain
1 1
𝑀 ′ (0) = − ∫ Div w 𝑑𝜎 − ∫ Div∗ (v⊺ ) 𝑑𝜎 − ∫ 𝐾⟨v, 𝑁⟩ 𝑑𝜎
2 𝑆 2 S2 𝑆
= − ∫ 𝐾⟨v, 𝑁⟩ 𝑑𝜎. ◻
𝑆
Proof Being compact, 𝑆 has some point of positive curvature (see Exercise 57 in
Section 3.1), and therefore 𝐾 > 0. 𝑆 is then an oval, and we can fix the orientation
𝑁∶ 𝑆 → S2 such that ⟨𝑁(𝑝), 𝑞 − 𝑝⟩ > 0 for every pair of points 𝑝 ≠ 𝑞 on 𝑆. With this
orientation we have√ 𝐻(𝑝) > 0 at every point 𝑝 ∈ 𝑆.
Let us put 𝑘 = 𝐾. By the inequality between the arithmetic and geometric means,
we have 𝑘 𝐻(𝑝) − 𝐾 ≥ 0, and the equality holds at 𝑝 if and only if 𝑝 is an umbilical
point. Let 𝑝 0 ∈ R 3 (whose existence is guaranteed by Proposition 5.4.2) be such that
⟨𝑁(𝑝), 𝑝 0 − 𝑝⟩ > 0 for all 𝑝 ∈ 𝑆, and let us put
Δ0 = ∫ (𝑘 𝐻 − 𝐾) 𝑑𝜎 = 𝑘 𝑀 − 𝑘 2 𝐴,
𝑆
Δ1 = ∫ (𝑘 𝐻 − 𝐾)⟨𝑁, 𝑝 0 − 𝑝⟩ 𝑑𝜎
𝑆
= 𝑘 ∫ 𝐻⟨𝑁, 𝑝 0 − 𝑝⟩ 𝑑𝜎 − ∫ 𝐾⟨𝑁, 𝑝 0 − 𝑝⟩ 𝑑𝜎
𝑆 𝑆
= 𝑘𝐴 − 𝑀 (by Minkowski’s formulas).
We thus have Δ0 = −𝑘Δ1 , but their definitions ensure that Δ0 and Δ1 are both non-
negative. Hence, we have Δ0 = Δ1 = 0, and therefore the integrand function 𝑘 𝐻 − 𝐾
is constantly zero. All points of 𝑆 are therefore umbilical and, by Proposition 3.1.5,
𝑆 is a sphere. ◻
In particular, all surfaces in R 3 that are isometric to the sphere are themselves
spheres, and hence the theorem speaks of the rigidity of the sphere. The theorem has
the following generalization (see [13]): if 𝑓 ∶ 𝑆1 → 𝑆2 is an isometry between two oval
surfaces then 𝑓 is the restriction of an isometry of R 3 (which, as is known, is the
composite of a linear isometry with a translation).
We now give another result of the same kind as Theorem 5.4.5.
5.5 Ovals: Areas and Volumes; Surfaces of Constant Width 157
We choose the orientation 𝑁 and the point 𝑝 0 as above, and notice that 𝐻 2 − 𝐾 ≥ 0,
with equality only at the umbilical points. Then
= 𝐻 𝐴 − ∫ 𝐻 𝑑𝜎 = 0
𝑆
— and from this, since the integrand function is non-negative, it follows that 𝐻 2 −𝐾 ≡ 0,
and therefore 𝑆 is a sphere.
Let us point out that Theorem 5.4.6 remains valid without the assumption that 𝑆
has positive curvature: any compact surface 𝑆 ⊆ R 3 with constant mean curvature is
a sphere. An accessible proof of this result appears in Osserman’s paper [20].
Exercise
130. Show that the open subset C bounded by the oval surface 𝑆 (Proposition 5.4.2) is
given by {(1 − 𝑡)𝑝 + 𝑡𝑞∶ 𝑝, 𝑞 ∈ 𝑆, 0 < 𝑡 < 1}.
In this section we prove a number of formulas involving the area, the total mean
curvature, and the volume bounded by an oval surface. Some of these formulas
concern convex bodies in R 3 — which include not only solids bounded by oval
surfaces but also by convex polyhedra. As we noted in Section 1.1, the length of a
regular curve 𝛾 can be computed by considering polygonal lines inscribed in 𝛾 with
increasing numbers of segments. The generalization to surfaces is not so obvious:
there are approximations of the cylindrical surface C = {𝑥 2 + 𝑦 2 = 1, 0 ≤ 𝑧 ≤ 1} by
polyhedra whose vertices are all in C but whose areas do not converge to that of C
(see Exercise 131).
The solution, for a compact convex surface 𝑆, is to approximate 𝑆 by convex
polyhedra: if P1 and P2 are convex polyhedra such that P1 ≺ 𝑆 ≺ P2 , where the sign
≺ means “is in”, then we have the expected inequality of areas
Knowing that we can find P1 and P2 such that P1 ≺ 𝑆 ≺ P2 and with 𝐴(P2 ) − 𝐴(P1 )
as small as we want, it follows from (5.14) that
Formula (5.15) suggests that for certain formulas involving areas of convex surfaces,
it is sufficient to prove them for polyhedra; moreover, it provides a definition of area
that does not depend on whether the surface in question is regular or not.
The interested reader can find the details of this construction in several books on
convexity (e.g., [9]). For now, we will make use of (5.15) to prove a Cauchy formula.
Let 𝑆 be a compact convex surface. Given 𝑝 ∈ S2 , let us denote by 𝐴𝑆 (𝑝) the area of
the orthogonal projection of 𝑆 onto 𝑇 𝑝 S2 .
Theorem 5.5.1 (Cauchy)
The area of 𝑆 is given by
1
𝐴= ∫ 𝐴𝑆 (𝑝) 𝑑𝜎. (5.16)
𝜋 S2
Proof Let 𝐹 be a polygon included in a plane Π, and let v be a unit vector orthogonal
to Π. Let 𝜃 ∈ [0, 𝜋] be the angle between v and Op (𝑝 ∈ S2 ) ; the area of the
orthogonal projection of 𝐹 onto 𝑇 𝑝 S2 is 𝐴𝐹 (𝑝) = 𝐴(𝐹)∣ cos 𝜃∣. Using spherical
coordinates (𝜑, 𝜃) relative to an orthonormal frame in which the third vector is v, we
then have
2𝜋 𝜋
∫ 𝐴𝐹 (𝑝) 𝑑𝜎 = ∫ (∫ 𝐴(𝐹)∣ cos 𝜃∣ sin 𝜃 𝑑𝜃) 𝑑𝜑
S2 0 0
𝜋
= 2𝜋 𝐴(𝐹) ∫ ∣ cos 𝜃∣ sin 𝜃 𝑑𝜃 = 2𝜋 𝐴(𝐹).
0
Thus
1
𝐴(𝐹) = ∫ 𝐴𝐹 (𝑝) 𝑑𝜎. (5.17)
2𝜋 S2
Now let P be a convex polyhedron and 𝐹1 , . . . , 𝐹𝑘 its faces. Except for the points on
the boundary (which are negligible), each point of the orthogonal projection of P
in the plane 𝑇 𝑝 S2 is the image of exactly two points of P, belonging to two distinct
faces. We then have
1 𝑘
𝐴P (𝑝) = ∑ 𝐴𝐹𝑖 (𝑝);
2 𝑖=1
and, using (5.17), we obtain
𝑘
1 1 𝑘
𝐴(P) = ∑ 𝐴(𝐹𝑖 ) = ∫ 2 ( ∑ 𝐴𝐹𝑖 (𝑝)) 𝑑𝜎
𝜋 S 2 𝑖=1
𝑖=1 (5.18)
1
= ∫ 𝐴P (𝑝) 𝑑𝜎.
𝜋 S2
Noting that if P1 ≺ 𝑆 ≺ P2 then 𝐴P1 (𝑝) ≤ 𝐴𝑆 (𝑝) ≤ 𝐴P2 (𝑝) for every 𝑝 ∈ S2 , it
follows from (5.18) that
1
𝐴(P1 ) ≤ ∫ 𝐴𝑆 (𝑝) 𝑑𝜎 ≤ 𝐴(P2 ),
𝜋 S2
5.5 Ovals: Areas and Volumes; Surfaces of Constant Width 159
As in the case of curves (see Section 1.6), we say that an oval surface 𝑆 has
constant curvature 𝔏 if, for every pair of parallel planes tangent to 𝑆 at two distinct
points, the distance between them is equal to 𝔏.
Corollary 5.5.2 If 𝑆 has constant curvature 𝔏 and area 𝐴 then
𝐴 ≤ 𝜋 𝔏2 ,
Sketch of the proof. The orthogonal projection of 𝑆 onto the plane 𝑇 𝑝 S2 , which we
denote by 𝑆(𝑝), is bounded by a curve of constant width 𝔏, having (by Theorem 1.6.3)
perimeter 𝜋𝔏. By the isoperimetric inequality (Theorem 1.8.2), the area of 𝑆(𝑝)
satisfies the inequality
𝜋𝔏2
𝐴𝑆 (𝑝) ≤ (5.19)
4
— with equality only in the case where 𝑆(𝑝) is a disk with radius 𝔏/2. Combining
(5.19) and (5.16), we obtain
1 𝜋𝔏2
𝐴≤ ∫ 𝑑𝜎 = 𝜋𝔏2 ,
𝜋 S2 4
with equality if and only if every orthogonal projection 𝑆(𝑝) of 𝑆 is a disk with radius
𝔏/2 — and this last condition implies that 𝑆 is a sphere (see Exercise 132). ◻
Let us further consider an oval surface 𝑆, and on it the field of normal vectors 𝑁
pointing into 𝑆. The variation of 𝑆 induced by −𝑁 is, as we have already defined,
the family 𝑆𝑡 = {𝑝 − 𝑡𝑁(𝑝)∶ 𝑝 ∈ 𝑆}. But in this case, the surfaces 𝑆𝑡 are parallel to
𝑆, either surrounding it (for 𝑡 > 0) or being surrounded by it (for 𝑡 < 0) at a fixed
distance equal to ∣𝑡∣. Furthermore 𝑆𝑡 is a surface for all 𝑡 ≥ 0 (Exercise 135), though
not for all 𝑡 < 0.
Given a parameterization Φ(𝑢, 𝑣) of 𝑆, let us put
Now we have
For 𝑡 > 0 (and for negative 𝑡 near 0), the expression 1 + 2𝑡𝐻 + 𝑡 2 𝐾 is positive. As a
consequence, and taking absolute values on both sides of (5.20), we conclude that
the area of 𝑆𝑡 is given by
where 𝐴 and 𝑀 are the area and the total mean curvature of 𝑆 (where we recall
that as 𝑆 is diffeomorphic to the sphere its total curvature ∫𝑆 𝐾 𝑑𝜎 is 4𝜋). Formula
(5.21) thus expresses the remarkable fact that the area of a surface parallel to 𝑆 is a
polynomial function of its distance to 𝑆.
𝜕Ψ 𝜕Ψ 𝜕Ψ
∣ det 𝐽Ψ∣ = ∣⟨ × , ⟩∣ = ∣⟨Φ𝑢𝑠 × Φ𝑠𝑣 , 𝑁⟩∣ = ∣Φ𝑢𝑠 × Φ𝑠𝑣 ∣,
𝜕𝑢 𝜕𝑣 𝜕𝑠
where 𝐴(Φ𝑠 ) denotes the area of Φ𝑠 (𝑈). Using (5.22) and (5.21), we finally obtain
𝑡 4𝜋 3
𝑉(𝑡) = 𝑉 + ∫ 𝐴(𝑠) 𝑑𝑠 = 𝑉 + 𝐴𝑡 + 𝑀𝑡 2 + 𝑡 , (5.23)
0 3
and here it is, the promised formula, which is also valid for negative 𝑡 near 0. There
is a formula analogous to (5.23) for the area of planar regions bounded by parallel
curves as well, which we will give in Exercise 133.
To conclude the section, we again assume that 𝑆 has constant width 𝔏. As in the
case of curves, the antipode A(𝑝) of 𝑝 ∈ 𝑆 is given by
A(𝑝) = 𝑝 + 𝔏𝑁(𝑝),
— a formula that ensures that the expression 1 − 2𝐻𝔏 + 𝐾𝔏2 never vanishes. In terms
of the principal curvatures 𝑘 1 , 𝑘 2 , we have
since both factors are non-negative. It then follows from (5.24) that for every
continuous function 𝑓 ∶ 𝑆 → R we have
The formula (5.25) has interesting consequences. For example, taking 𝑓 ≡ 1, and
using the fact that the total curvature of 𝑆 is 4𝜋, we obtain
— that is, the total mean curvature of a surface of constant width 𝔏 is 𝑀 = 2𝜋𝔏.
For another application of (5.25), we will prove a Blaschke formula that relates
the area 𝐴 of a surface of constant width to the volume 𝑉 of the region bounded by it.
Theorem 5.5.3 (Blaschke)
If 𝑆 has constant width 𝔏 then 𝑉 = 1
2 𝐴𝔏 − 13 𝜋𝔏3 .
Proof Let 𝑝 0 be a point of the region Ω bounded by 𝑆. The volume of Ω is given by
1
𝑉= ∫ ⟨𝑁(𝑝), 𝑝 0 − 𝑝⟩ 𝑑𝜎(𝑝)
3 𝑆
(this formula corresponds to regarding Ω as a union of “infinitesimal cones” of vertex
𝑝 0 and base in 𝑆, and is a particular case of the divergence theorem in R 3 — see e.g.
[16], p. 493). By (5.25), denoting by 𝑝̃ the antipode of 𝑝, we also have
1
𝑉= ∫ ⟨𝑁( 𝑝̃), 𝑝 0 − 𝑝̃⟩(1 − 2𝐻𝔏 + 𝐾𝔏 ) 𝑑𝜎(𝑝).
2
3 𝑆
Since 𝑁( 𝑝̃) = −𝑁(𝑝) and 𝑝̃ = 𝑝 + 𝔏𝑁(𝑝), we obtain from this, using Minkowski’s
formulas (Theorem 5.4.3) and the fact that 𝑀 = 2𝜋𝔏,
1
𝑉= ∫ (−⟨𝑁(𝑝), 𝑝 0 − 𝑝⟩ + 𝔏)(1 − 2𝐻𝔏 + 𝐾𝔏 ) 𝑑𝜎(𝑝)
2
3 𝑆
1 2 1
= −𝑉 + 𝐴𝔏 + (∫ 𝐻⟨𝑝 0 − 𝑝, 𝑁⟩ 𝑑𝜎) 𝔏− (∫ 𝐾⟨𝑝 0 − 𝑝, 𝑁⟩ 𝑑𝜎) 𝔏2
3 3 𝑠 3 𝑆
1 2 1 2
= −𝑉 + 𝐴𝔏 + 𝐴𝔏 − 𝑀𝔏2 = −𝑉 + 𝐴𝔏 − 𝜋𝔏3 ,
3 3 3 3
which concludes the proof of the theorem. ◻
Theorem 5.5.3 shows that among surfaces with a certain constant width 𝔏 those
with the largest (or smallest) area are also those that enclose the largest (or smallest)
volume. From Corollary 5.5.2 it then follows that the maximal volume is that of the
sphere. The problem of finding the surface of a given constant width with minimal
area (or minimal volume) is still open (see [7]).
Exercises
162 5 The Global Geometry of Surfaces
(b) Compute lim 𝐴(𝑛𝑟 , 𝑛) for 𝑟 = 1, 2, 3. What conclusion can you draw?
𝑛→+∞
Figure 5.5
132. Let 𝑆 be an oval surface and 𝑟 > 0 such that the orthogonal projection of 𝑆 onto
each plane P ⊆ R 3 is a disk with radius 𝑟. Show that:
(a) for every plane P, the surface 𝑆 is inscribed in a straight circular cylinder C of
height 2𝑟 whose base has radius 𝑟 and is parallel to P;
(b) if 𝛾 is the equator of C then 𝛾 ⊆ 𝑆 and the normals to C and to 𝑆 along 𝛾
coincide;
(c) each normal section of 𝑆 is a circle with radius 𝑟, and therefore 𝑆 is a sphere.
133. Let 𝛼 be a convex, regular, closed planar curve. Denote by 𝛼𝑡 the curve parallel
to 𝛼 at distance 𝑡 from 𝛼, and by 𝔘(𝑡) the area of the region bounded by 𝛼𝑡 . Show
that
𝔘(𝑡) = 𝔘 + 𝑙(𝛼)𝑡 + 𝜋𝑡 2 [where 𝔘 = 𝔘(0)].
Hint: verify that 𝑙(𝛼𝑡 ) = 𝑙(𝛼) + 2𝜋𝑡 and 𝔘(𝑡) = 𝔘 + ∫0 𝑙(𝛼𝑠 ) 𝑑𝑠.
𝑡
5.6 Abstract Surfaces. The Hyperbolic Plane 163
Note: since any convex curve can be approximated by a regular convex curve, the
obtained formula is valid for any convex curve.
134. Let 𝑆 be an oval surface, and denote by 𝑙 𝑆 (𝑝) the perimeter of the orthogonal
projection of 𝑆 onto 𝑇 𝑝 S2 . Show that the total mean curvature of 𝑆 is given by
1
𝑀= ∫ 𝑙 𝑆 (𝑝) 𝑑𝜎.
2𝜋 S2
Hint: use Exercise 133 and Theorem 5.5.1 to obtain an expression for the area 𝐴(𝑡)
of the surface 𝑆𝑡 parallel to 𝑆, and compare (5.21) with the formula obtained.
135. Let 𝑆 be an oval surface and 𝑁 be the normal field pointing into 𝑆. Show that,
for all 𝑡 ≥ 0, the surface 𝑆𝑡 = {𝑝 − 𝑡𝑁(𝑝)∶ 𝑝 ∈ 𝑆} is diffeomorphic to 𝑆.
All measures intrinsic to a surface depend, as we know, on the inner product defined
on the tangent space (i.e., the first fundamental form) — which, so far, has simply been
the usual scalar product restriction on R 3 . But nothing prevents us from considering
other inner products, obtaining in this way surfaces whose metric structure is not
inherited from R 3 . The next step is to dispense with the ambient space, defining
abstract surfaces that are not even diffeomorphic to surfaces of R 3 .
Let 𝑈 ⊆ R 2 be an open and connected region, and let 𝐸, 𝐹, 𝐺∶ 𝑈 → R be
∞
𝐶 -functions such that
● 𝐸 and 𝐺 are strictly positive;
● 𝐸𝐺 − 𝐹 2 > 0 at all points of 𝑈.
For every point (𝑢, 𝑣) ∈ 𝑈 we define an inner product ⟨⋅, ⋅⟩(𝑢,𝑣) on 𝑇(𝑢,𝑣)𝑈 = R 2 as
follows:
If 𝛼∶ [𝑎, 𝑏] → 𝑈 is a regular curve then, in this inner product, the norm of 𝛼′ (𝑡) is
given by
√ √
2 2
𝐼 𝛼(𝑡) (𝛼′ (𝑡)) = 𝑢 ′ (𝑡) 𝐸(𝛼(𝑡)) + 2𝑢 ′ (𝑡)𝑣 ′ (𝑡)𝐹(𝛼(𝑡)) + 𝑣 ′ (𝑡) 𝐺(𝛼(𝑡)),
The inner product thus imposed on 𝑈 determines therefore all intrinsic measures:
lengths of curves, angles between vectors, areas of regions. We say that 𝑈 has been
equipped with a Riemannian metric. We can define, for such a metric, the notions of
Gaussian curvature, covariant derivative, geodesic — and, under the assumption that
𝑈 is oriented, that of geodesic curvature. To this end, we make use of the formulas in
Chapter 4 that express such notions using 𝐸, 𝐹, 𝐺, Christoffel’s symbols, and their
derivatives [such as (4.8), (4.9), (4.31)].
cos(𝑢 + 𝑣)
𝐾(𝑢, 𝑣) = ⋅
{2 + cos(𝑢 + 𝑣)}
This is a conformal metric, in the sense that angles are measured as usual in R 2 ,
since 𝐹 ≡ 0 and 𝐸 = 𝐺. Another particularity is that with this metric R 2 has finite
area and is not a complete surface (the proof of this statement is Exercise 136). ◻
To avoid pathologies (see the appendix to volume I of [25] for a catalogue of them)
we further impose certain conditions on the topology of the surface 𝑆, namely: any
two distinct points of 𝑆 have disjoint open neighborhoods (𝑆 is Hausdorff ) and there
exists a subset of 𝑆 which is dense and countable (𝑆 is separable).
By Proposition 2.2.1, and since the mentioned topological requirements are
satisfied by any subset of R 𝑛 , any regular connected surface of R 3 is an abstract
surface. What we have done was to require the coordinate changes on 𝑆 to be
𝐶 ∞ -diffeomorphisms, which allows us to define all sorts of notions using local
coordinates. Note that a surface does not determine an atlas uniquely: we can add to
or take away from a given atlas a few mappings, and as long as the remaining ones
still cover 𝑆 and coordinate changes remain 𝐶 ∞ , we obtain several different atlases;
5.6 Abstract Surfaces. The Hyperbolic Plane 165
what matters is that they all define the same differentiable structure on 𝑆, and thus the
same abstract surface.
Given an abstract surface 𝑆, it is not easy, now that we have no ambient space, to
define a tangent space 𝑇 𝑝 𝑆. Our approach, instead of defining it, is to explain how to
work with it in local coordinates. So if we have a curve that in local coordinates is
written 𝛼(𝑡) = Φ(𝑢(𝑡), 𝑣(𝑡)), we would still like it to be
𝜕̃
𝑢 𝜕̃
𝑣
Φ𝑢 = Ψ𝑢̃ + Ψ̃𝑣 ,
𝜕𝑢 𝜕𝑢
𝜕̃
𝑢 𝜕̃
𝑣
Φ𝑣 = Ψ𝑢̃ + Ψ̃𝑣 ;
𝜕𝑣 𝜕𝑣
and, replacing in (5.26), we obtain
𝛼′ (𝑡) = (
𝜕̃𝑢 ′ 𝜕̃𝑢 ′ 𝑣 ′
𝜕̃ 𝑣 ′
𝜕̃
𝑢 (𝑡) + 𝑣 (𝑡)) Ψ𝑢̃ + ( 𝑢 (𝑡) + 𝑣 (𝑡)) Ψ̃𝑣
𝜕𝑢 𝜕𝑣 𝜕𝑢 𝜕𝑣
= 𝑢̃′ (𝑡)Ψ𝑢̃ + ̃
𝑣 ′ (𝑡)Ψ̃𝑣
— which shows that (5.26) is compatible with the change of coordinates. Hence for us
the tangent space to 𝑆 at the point Φ(𝑢, 𝑣) is simply the space generated by the two
independent vectors Φ𝑢 and Φ𝑣 . The transition matrix from the basis (Ψ𝑢̃, Ψ̃𝑣 ) to
(Φ𝑢 , Φ𝑣 ) is the Jacobian of Ψ−1 ○ Φ.
A Riemannian metric on an abstract surface 𝑆 is given by an inner product ⟨⋅, ⋅⟩ 𝑝
in the tangent space 𝑇 𝑝 𝑆 for every 𝑝 ∈ 𝑆. In local coordinates Φ(𝑢, 𝑣), the matrix
𝐸 𝐹
of this inner product with respect to the basis (Φ𝑢 , Φ𝑣 ) is [ ], and we usually
𝐹𝐺
̃ 𝐹,
require that 𝐸, 𝐹, 𝐺 are 𝐶 ∞ functions; the coefficients 𝐸, ̃ 𝐺 ̃ of other coordinates
Ψ(̃𝑢, ̃
𝑣 ) are obtained from these by the relation
𝐸̃ 𝐹̃
[ ̃ ̃] = {𝐽(Φ−1 ○ Ψ)}⊺ [ ] 𝐽(Φ−1 ○ Ψ),
𝐸 𝐹
𝐹𝐺 𝐹𝐺
𝑢, ̃
(𝑢, 𝑣) ∼ (̃ 𝑣)
if and only if both differences 𝑢 − 𝑢̃ and 𝑣 − ̃ 𝑣 are integers, and let [𝑢, 𝑣] denote
the equivalence class of the pair (𝑢, 𝑣). Let R 2 / ∼ be the set of equivalence classes
and Π∶ R 2 → R 2 / ∼ the quotient mapping, given by Π(𝑢, 𝑣) = [𝑢, 𝑣]. We define a
topology on R 2 / ∼ by 𝑈 ⊆ R 2 / ∼ being open if and only if Π −1 (𝑈) is an open subset
of R 2 . Thus R 2 / ∼ is a surface, since a restriction of Π to all squares of the form
]𝑢 0 , 𝑢 0 + 1[×]𝑣 0 , 𝑣 0 + 1[ constitutes an atlas of R 2 / ∼.
Figure 5.6
𝑢, ̃
(𝑢, 𝑣) ≃ (̃ 𝑣 ) ⇔ ∃𝑛 ∈ Z ∶ 𝑢̃ = 𝑢 + 𝑛 and ̃
𝑣 = (−1) 𝑣
𝑛
𝐸 ≡ 1, 𝐹 ≡ 0, 𝐺 ≡ sinh2 𝜌 (5.27)
5.6 Abstract Surfaces. The Hyperbolic Plane 167
— and we can take the coefficients of this metric in Cartesian coordinates. However,
this is not the best model of D, because the metric defined this way is not conformal.
Let 𝑈 be the open disk with radius 𝜆 (0 < 𝜆 ≤ +∞) centered at the origin, and
𝑓 ∶ [0, 𝜆[ → [0, +∞[ a differentiable, strictly monotone function such that 𝑓 (0) = 0,
and consider the mapping Ψ∶ 𝑈 → R 2 which transforms the polar coordinate point
(𝜌, 𝜑) into the coordinate point ( 𝑓 (𝜌), 𝜑). We want to find 𝑓 so that the Riemannian
metric induced by Ψ on 𝑈, from the metric defined by (5.27) on R 2 , is conformal. If
𝑝 has coordinates (𝜌, 𝜑) then, letting v𝝋 = (cos 𝜑, sin 𝜑) and w𝝋 = (−sin 𝜑, cos 𝜑),
we have
𝑓 (𝜌)
𝐷Ψ 𝑝 (v𝝋 ) = 𝑓 ′ (𝜌)v𝝋 , 𝐷Ψ 𝑝 (w𝝋 ) = w𝝋 (5.28)
𝜌
— by which we obtain
⟨v𝝋 , v𝝋 ) 𝑝 = [ 𝑓 ′ (𝜌)]2 ,
⟨v𝝋 , w𝝋 ) 𝑝 = 0,
(5.29)
𝐺( 𝑓 (𝜌), 𝜑) sinh2 ( 𝑓 (𝜌))
⟨w𝝋 , w𝝋 ) 𝑝 = = ⋅
𝜌2 𝜌2
For the metric (5.29) to be conformal, necessarily
sinh( 𝑓 (𝜌))
𝑓 ′ (𝜌) = ⋅
𝜌
The general solution of this equation is 𝑓 (𝜌) = 2tgh−1 (𝑐𝜌), where 𝑐 is a real constant,
and is defined for 𝜌 ∈ [0, ∣𝑐∣
1
[. We take 𝑐 = 1, so that 𝜆 = 1 and 𝑈 is the unit disk.
In this case 𝑓 ′ (𝜌) = 2
1−𝜌2
, and therefore (5.29) defines in Cartesian coordinates the
metric
4
𝑑𝑠2 = (𝑑𝑢 2 + 𝑑𝑣 2 ). (5.30)
{1 − (𝑢 2 + 𝑣 2 )}2
From now on, let D denote the disk 𝑢 2 + 𝑣 2 < 1 (or the disk ∣𝑧∣ < 1 in the complex
plane C) equipped with the Riemannian metric (5.30). D is usually called Poincaré’s
disk. We next give some of its properties:
(1) D has constant curvature equal to −1. We verify this using formula (4.31) in
Section 4.5. To obtain a surface of constant curvature 𝐾 < 0, we would take, on the
same disk, the metric
4
𝑑𝑠2 = 2
(𝑑𝑢 2 + 𝑑𝑣 2 ).
∣𝐾∣(1 − (𝑢 2 + 𝑣 2 ))
(2) The diameters of D are geodesic. This can be seen by checking that its geodesic
curvature is zero, but we prefer to show that the diameters are minimizing curves.
Indeed, if 𝛼(𝑡) = (𝑢(𝑡), 𝑣(𝑡)) is a curve from 𝛼(0) = (0, 0) to 𝛼(1) = (𝑢 1 , 0) (𝑢 1 > 0)
then
168 5 The Global Geometry of Surfaces
√
1√
2 2
1 2 𝑢 ′ (𝑡) + 𝑣 ′ (𝑡)
𝑙(𝛼) = ∫ 𝐼 𝛼(𝑡) (𝛼′ (𝑡)) 𝑑𝑡 = ∫ 2 2
𝑑𝑡
0 0 1 − [𝑢(𝑡) + 𝑣(𝑡) ] (5.31)
1 2∣𝑢 ′ (𝑡)∣ 1 2𝑢 ′ (𝑡) 1 + 𝑢1
≥∫ 𝑑𝑡 ≥ ∫ 𝑑𝑡 = log ( ),
0 1 − 𝑢(𝑡)
2 0 1 − 𝑢(𝑡)
2 1 − 𝑢1
and equality holds if and only if 𝑣(𝑡) = 0 and 𝑢 ′ (𝑡) ≥ 0 for all 𝑡 ∈ [0, 1] — i.e., if and
only if the trace of 𝛼 is the line segment [0, 𝑢 1 ] × {0}, which is therefore a minimizing
curve. The diameter ] − 1, 1[ × {0} is thus a geodesic of D. Since rotations around
the origin are isometries of D, all other diameters are also geodesics.
It follows from (5.31) that geodesics starting from the origin have infinite length,
and therefore (see Exercise 119) D is complete. Furthermore, the intrinsic distance
between 0 and 𝑧 ∈ D is
1 + ∣𝑧∣
𝑑D (0, 𝑧) = log ( )⋅
1 − ∣𝑧∣
(3) Let 𝑎, 𝑏 ∈ C be such that ∣𝑎∣ > ∣𝑏∣, and consider the mapping
𝑎𝑧 + 𝑏
ℎ 𝑎,𝑏 (𝑧) = ⋅
𝑏𝑧 + 𝑎
The reader may check that ℎ 𝑎,𝑏 sends D bijectively onto itself. More remarkable is
that ℎ 𝑎,𝑏 is an isometry of D. Indeed, for 𝑧 ∈ D and 𝑤 ∈ C, we have (abbreviating
ℎ 𝑎,𝑏 to ℎ)
Now given 𝑧0 ∈ D,
𝑧 − 𝑧0
ℎ(𝑧) =
1 − 𝑧0 𝑧
is an isometry that maps 𝑧0 to 0, which allows us to deduce a formula for the distance
between 𝑧0 and another point 𝑧1 ∈ D:
5.6 Abstract Surfaces. The Hyperbolic Plane 169
𝑧1 − 𝑧0
𝑑D (𝑧 0 , 𝑧 1 ) = 𝑑D (ℎ(𝑧 0 ), ℎ(𝑧1 )) = 𝑑D (0, )
1 − 𝑧0 𝑧1
(5.33)
∣1 − 𝑧0 𝑧1 ∣ + ∣𝑧1 − 𝑧0 ∣
= log ( )⋅
∣1 − 𝑧0 𝑧1 ∣ − ∣𝑧1 − 𝑧0 ∣
𝑎𝑧 + 𝑏
𝑓 𝑀 (𝑧) = ,
𝑐𝑧 + 𝑑
we have 𝑓 𝑀1 𝑀2 = 𝑓 𝑀1 ○ 𝑓 𝑀2 . We hence obtain a group homomorphism Sl(2, C) → 𝔐
(where Sl(2, C) is the multiplicative group of the complex 2 × 2 matrices with
determinant equal to 1).
We are now interested in the geometric properties of these transformations. Writing
𝑎𝑧 + 𝑏 𝑎 𝑏 − 𝑎𝑑 𝑎𝑧 + 𝑏 𝑎 𝑏
= + 𝑐
(if 𝑐 ≠ 0), = 𝑧+ ,
𝑐𝑧 + 𝑑 𝑐 𝑐(𝑧 + 𝑑𝑐 ) 𝑑 𝑑 𝑑
● translations 𝑧↦𝑧+𝜁 (𝜁 ∈ C)
1
● inversions 𝑧↦
𝑧
Transformations of the first three types have known properties: in particular, they
transform straight lines into straight lines and circles into circles. Regarding the
inversions, we have the following:
● if 𝑟 is a line passing through 0, its inverse is still a line. If 𝑟 does not pass through
0, its inverse is a circle passing through 0;
● the inverse of a circle C is a straight line if 0 ∈ C, otherwise it is also a circle.
Figure 5.7
(5) If “lines” mean geodesics, Poincaré’s disc provides a model of the non-
Euclidean geometry of Lobachevski and Bolyai. In this geometry all the axioms of
Euclidean geometry are valid, except the axiom of parallels: in D, through a point
outside a “line” 𝑟, pass infinitely many “lines” that do not intersect 𝑟 — and not only
one, as in the Euclidean case.
Exercises
136. Show that R 2 , with the metric of example 5.6.1.B, has finite area and is not
complete.
In the remaining exercises in this section we work with the hyperbolic plane D.
5.6 Abstract Surfaces. The Hyperbolic Plane 171
137. Consider two geodesics 𝛾1 (𝑠) and 𝛾2 (𝑠) which, at time 𝑠 = 0, start from 𝑧0 ∈ D
in directions that make an angle 𝜃 with each other. Show that
exists.
138. Check that for all 𝑧0 , 𝑧 1 ∈ D, the set {𝑧 ∈ D∶ 𝑑D (𝑧, 𝑧 0 ) = 𝑑D (𝑧, 𝑧1 )} is a geodesic.
139. (a) Show that the hyperbolic circle S(𝑧 0 ; 𝑟) = {𝑧 ∈ D∶ 𝑑D (𝑧, 𝑧 0 ) = 𝑟} is also a
Euclidean circle.
(b) Check that any equilateral triangle in D (geodesic triangle with all sides equal)
can be inscribed into a hyperbolic circle, but that this is no longer true for every
geodesic triangle.
140. (a) Check that 𝑔(𝑧) = − 2𝑖 + 𝑧−𝑖
1
sends D into the half-plane {𝑧 ∈ C∶ ℑ𝑚(𝑧) > 0}.
(b) Let H be the half-plane {𝑧 ∈ C∶ ℑ𝑚(𝑧) > 0} with the metric given by
2
𝐸 = 𝐺 = 1/ℑ𝑚(𝑧) and 𝐹 ≡ 0. Show that 𝑔 is an isometry of D on H .
(c) Show that the geodesics of H are the vertical semicircles ℜ𝑒(𝑧) = constant
and the half circumferences with center on the axis ℑ𝑚(𝑧) = 0.
(d) Show that the isometries of H that preserve orientation are the functions
𝑎𝑧 + 𝑏
𝑧↦ ,
𝑐𝑧 + 𝑑
where 𝑎, 𝑏, 𝑐, 𝑑 are real numbers such that 𝑎𝑑 − 𝑏𝑐 = 1 (you can use Proposition
5.7.3 of the next section).
(e) Denote by ℑ(D) and ℑ(H ) the groups of the isometries of D and H that
preserve orientation. Note that ℑ(H ) = 𝑔 ○ ℑ(D) ○ 𝑔 −1 , and therefore ℑ(D) and
ℑ(H ) are conjugate subgroups of the group 𝔐 of Möbius transformations.
𝑎𝑏 𝑎𝑧 + 𝑏
141. Check that 𝑀 = [ ] ↦ 𝑓 𝑀 (𝑧) = is a homomorphism of Sl(2, C) on
𝑐𝑑 𝑐𝑧 + 𝑑
𝔐, and that the kernel of this homomorphism is {𝐼, −𝐼}. Conclude that 𝑓 𝑀 = 𝑓 𝑁 if
and only if 𝑀 = ±𝑁.
142. Two elements 𝑓 , 𝑔 ∈ 𝔐 are conjugate if there exists ℎ ∈ 𝔐 such that 𝑓 = ℎ○𝑔○ℎ−1 .
(a) Given 𝑓 ∈ 𝔐, define 𝜏( 𝑓 ) = ∣trace(𝑀)∣ if 𝑓 = 𝑓 𝑀 for 𝑀 ∈ Sl(2, C). Show that
if 𝑓 , 𝑔 ≠ id then 𝑓 and 𝑔 are conjugate if and only if 𝜏( 𝑓 ) = 𝜏(𝑔).
(b) Let Γ = ℑ(D) or Γ = ℑ(H ). Show that if 𝑓 , 𝑔 ∈ Γ are conjugate by an element
of 𝔐, then they are conjugate by some element of Γ.
143. Denote by Γ the group ℑ(D) or ℑ(H ), and let 𝑓 ∈ Γ. Show that:
(a) if√𝜏( 𝑓 ) > 2 then 𝑓 is conjugate to 𝑧 ↦ 𝜆𝑧 in H , where 𝜆 > 0 satisfies
𝜏( 𝑓 ) = 𝜆 + √1 ;
𝜆
(b) if 𝜏( 𝑓 ) = 2 and 𝑓 ≠ id then ℎ is conjugate to 𝑧 ↦ 𝑧 + 1 in H ;
(c) if 𝜏( 𝑓 ) < 2 then 𝑓 is conjugate to 𝑧 ↦ 𝑒 𝑖 𝜃 𝑧 on D, where 2 cos ( 2𝜃 ) = 𝜏( 𝑓 ).
Note: 𝑓 ≠ id is called hyperbolic, parabolic or elliptic according to whether a), b) or
c) of Ex. 143 holds.
172 5 The Global Geometry of Surfaces
In this section we will study the isometry groups of the complete and simply connected
surfaces of constant curvature, and show that the other complete surfaces of equal
curvature are obtained from these as quotients by a certain subgroup of the isometry
group. This approach allows us to describe all complete surfaces of non-negative
constant curvature.
We begin with a lemma that will be applied repeatedly:
Proof Given v ∈ 𝑇 𝑝 𝑆1 , let us consider the geodesic 𝛾(𝑡) = exp 𝑝 (𝑡v). Then 𝑓 ○ 𝛾 and
𝑔 ○ 𝛾 are parametrized geodesics with the same initial conditions, since 𝑓 ○ 𝛾(0) =
𝑔 ○ 𝛾(0) = 𝑓 (𝑝) and ( 𝑓 ○ 𝛾)′ (0) = 𝐷 𝑓 𝑝 (v) = 𝐷𝑔 𝑝 (v) = (𝑔 ○ 𝛾)′ (0) — and therefore
𝑓 ○𝛾(𝑡) = 𝑔○𝛾(𝑡) whenever 𝛾(𝑡) is defined. We thus conclude that 𝑓 ○ exp 𝑝 = 𝑔○ exp 𝑝 ,
which implies that 𝑓 and 𝑔 coincide in a neighborhood of 𝑝. This shows that the set
𝑈 = {𝑞 ∈ 𝑆1 ∶ 𝑓 (𝑞) = 𝑔(𝑞), 𝐷 𝑓𝑞 = 𝐷𝑔𝑞 } is non-empty and open. But its definition
ensures that 𝑈 is also closed, and therefore, 𝑆1 being connected, 𝑈 = 𝑆1 . ◻
Proof The case 𝐾 ≤ 0 was treated in Theorem 5.3.4. Only the case 𝐾 > 0 is left.
Given 𝑝 ∈ 𝑆 𝐾 , the mapping exp 𝑝 sends the disk 𝐵 𝜋/√𝐾 (𝑝) diffeomorphically into
𝑆 𝐾 ∖{−𝑝}. Thus, if 𝑞 is a point of 𝑆 and 𝐿∶ 𝑇 𝑝 𝑆 𝑘 → 𝑇𝑞 𝑆 a linear isometry, the mapping
𝑔∶ 𝑆 𝐾 ∖{−𝑝} → 𝑆 defined by 𝑔 = exp𝑞 ○ 𝐿 ○ exp−1 𝑝 is a local isometry. Let us now
̃
take 𝑝̃ ∈ 𝑆 𝐾 ∖{𝑝, −𝑝}, and let 𝑞̃ = 𝑔( 𝑝̃), 𝐿 = 𝐷𝑔 ̃𝑝 and ℎ∶ 𝑆 𝐾 ∖{− 𝑝̃} → 𝑆 be the
mapping given by ℎ = exp𝑞̃ ○ ̃ 𝐿 ○ exp−1
𝑝 . The mapping ℎ is also a local isometry, and
̃
furthermore 𝑔( 𝑝̃) = ℎ( 𝑝̃) and 𝐷𝑔 ̃𝑝 = 𝐷ℎ ̃𝑝 . By Lemma 5.7.1, 𝑔 and ℎ coincide on
the intersection of their domains, and we can thus define a local isometry 𝑓 ∶ 𝑆 𝐾 → 𝑆
by 𝑓 (𝑟) = 𝑔(𝑟) if 𝑟 ≠ −𝑝, and 𝑓 (𝑟) = ℎ(𝑟) if 𝑟 ≠ − 𝑝̃. Since 𝑆 𝐾 is compact, by
Proposition 5.2.2 this local isometry is a covering. ◻
5.7 Complete Surfaces of Constant Curvature 173
The following result is a consequence of Lemma 5.7.1 and the proofs of Theo-
rems 5.3.4 and 5.7.2.
Since the composite and inverse of isometries are still isometries, the set of
isometries of a given surface forms a group. By the above proposition, the group of
isometries of 𝑆 𝐾 , which we denote by ℑ𝐾 , is exceptionally large.
Let us now assume that 𝑓 ∶ 𝑆 𝐾 → 𝑆 is an isometric covering. Given 𝑞 ∈ 𝑆, let
𝑝 0 and 𝑝 1 be two of the preimages of 𝑞, and 𝐿 𝑖 (𝑖 = 0, 1) the linear isometry
𝐷 𝑓 𝑝𝑖 ∶ 𝑇 𝑝𝑖 𝑆 𝐾 → 𝑇𝑞 𝑆. By Proposition 5.7.3, there exists an isometry 𝑔∶ 𝑆 𝐾 → 𝑆 𝐾 such
that 𝑔(𝑝 0 ) = 𝑝 1 and 𝐷𝑔 𝑝0 = 𝐿 1−1 ○ 𝐿 0 . Now 𝑓 ○ 𝑔∶ 𝑆 𝐾 → 𝑆 is a local isometry such
that
𝑓 ○ 𝑔(𝑝 0 ) = 𝑞 = 𝑓 (𝑝 0 ),
𝐷( 𝑓 ○ 𝑔) 𝑝0 = 𝐷 𝑓 𝑝1 ○ 𝐷𝑔 𝑝0 = 𝐿 1 ○ (𝐿 1−1 ○ 𝐿 0 ) = 𝐷 𝑓 𝑝0
The first statement (apart from the uniqueness of 𝑔, which is easy) has already
been proved. Regarding the second, let us take an evenly covered neighborhood 𝑈
of 𝑓 (𝑝) and let 𝑉 be the component of 𝑓 −1 (𝑈) that contains 𝑝. If 𝑔 ∈ ℑ𝐾 ( 𝑓 )∖{id}
and 𝑞 ∈ 𝑉 then 𝑞 ≠ 𝑔(𝑞), since the identity is the only element of ℑ𝐾 ( 𝑓 ) with some
fixed point. Since 𝑓 (𝑞) = 𝑓 (𝑔(𝑞)), necessarily 𝑔(𝑞) ∉ 𝑉. Thus 𝑔(𝑉) ∩ 𝑉 = ∅, which
proves the statement.
It deserves mention that what we call discontinuous action is called proper
discontinuous action by most authors, who reserve the former name for a weaker
condition, of which we will make no use. We further say that a subgroup Γ of ℑ𝐾 is
discrete if its action on 𝑆 𝐾 is discontinuous (this terminology is also not the usual
one).
The covering group completely determines the surface in the following sense: if
we have two isometric coverings 𝑓𝑖 ∶ 𝑆 𝐾 → 𝑆𝑖 (𝑖 = 1, 2) such that ℑ𝐾 ( 𝑓1 ) = ℑ𝐾 ( 𝑓2 )
then 𝑆1 and 𝑆2 are isometric. We can further ask which subgroups of ℑ𝐾 are covering
groups. The answer is simple:
Consider, for example, the case of the sphere 𝑆 𝐾 given by the equation 𝑥 2 + 𝑦 2 +𝑧2 =
1
𝐾 (𝐾 > 0). The isometries of 𝑆 𝐾 are the restrictions of the linear isometries of
R , and correspond to the orthogonal 3 × 3 matrices over R — the group of which
3
Figure 5.8
More generally, we have the following result (the proof of which is Exercise 145
of this section):
Theorem 5.7.7 The only complete surfaces of constant curvature 𝐾 > 0 are, up to
isometry, the sphere and the projective plane.
5.7 Complete Surfaces of Constant Curvature 175
Proof In view of Proposition 5.7.5, it suffices to show that the only discrete subgroups
of ℑ𝐾 ≃ 𝑂(3, R ) are {𝐼, −𝐼} and the trivial group {𝐼}. Let Γ be such a subgroup:
given 𝐴 ∈ Γ, the isometry 𝐴 has some real eigenvalue 𝜆, which is necessarily 1 or −1.
If 𝜆 = 1, then 𝐴 has some fixed point in 𝑆 𝐾 , which implies that 𝐴 = 𝐼. If 𝜆 = −1 then
𝐴2 has the eigenvalue 𝜆2 = 1 and therefore 𝐴2 = 𝐼. The eigenvalues of 𝐴 are thus all
equal to 1 or −1; but none of them can be 1, and therefore 𝐴 = −𝐼. We thus have
Γ = {𝐼} or Γ = {𝐼, −𝐼}. ◻
c b
d a
c b
d a
Figure 5.9
The surface 𝑆 𝐾 /Γ is, in this case, the double torus. We would obtain the 𝑛-torus,
𝑛 ≥ 2, from a regular polygon P𝑛 of 4𝑛 sides and sum of angles equal to 2𝜋 (𝑛 = 1
is impossible because, by the Gauss-Bonnet theorem, the sum of the angles of any
four-sided polygon in the hyperbolic plane is < 2𝜋). A theorem of Poincaré ensures
176 5 The Global Geometry of Surfaces
that the elements of Γ that identify the sides of P𝑛 generate Γ. The interested reader
can find details in [10], [2] and [3].
To conclude this section and the book, we will determine all discrete subgroups of
the group ℑ0 of the isometries of R 2 — which is equivalent, by Proposition 5.7.5, to
determining all complete surfaces of constant zero curvature.
Lemma 5.7.8 The isometries of R 2 without fixed points are translations and glide
reflections (reflection in an axis 𝑟 followed by a translation in the direction of 𝑟).
Translations preserve orientation, while glide reflections reverse it.
Proof Every isometry of R 2 is of the form 𝑓 (𝑝) = 𝐿(𝑝) + v, where 𝐿 is a linear
isometry and v a vector. If 𝐼 − 𝐿 were an isomorphism then 𝑓 would have some fixed
point, and so for 𝑓 to have no fixed points, 𝐿 must have eigenvalue 1. Let (e1 , e2 ) be
an orthonormal basis of R 2 such that 𝐿(e1 ) = e1 . Since 𝐿 is an isometry, necessarily
𝐿(e2 ) = ±e2 . If 𝐿(e2 ) = e2 then 𝐿 is the identity and 𝑓 a translation. If 𝐿(e2 ) = −e2 ,
then 𝐿 is the reflection in the straight line generated by e1 . Letting v = 𝛼e1 + 𝛽e2 , we
recognize that 𝑓 is the composite of the reflection in the line 𝜆 ↦ 𝜆e1 + 𝛽2 e2 with the
translation associated with the vector 𝛼e1 (in particular, 𝑓 has no fixed points if and
only if 𝛼 ≠ 0).
Regarding the orientation, just note that it is preserved by 𝑓 if and only if it is by
𝐿. ◻
Statement (v) says that 𝐿(Γ𝑇 ) ⊆ Γ𝑇 . Applying 𝐿 to both sides, we obtain the
opposite inclusion Γ𝑇 ⊆ 𝐿(Γ𝑇 ); we thus conclude that Γ𝑇 is invariant under 𝐿.
Moreover, (iii) and (iv) say that Γ𝑅 is the result of a translation on Γ𝑇 : for all v ∈ Γ𝑅
we have Γ𝑅 = {v} + Γ𝑇 . This means that to obtain all the glide reflections of Γ, we
just need to compose one of these transformations with each of the translations of Γ.
Let (e1 , v2 ) be an orthonormal basis of R 2 such that 𝐿(e1 ) = e1 , 𝐿(e2 ) = −e2 . An
important observation, contained in the proof of Lemma 5.7.8, is that if 𝛼e1 + 𝛽e2 ∈ Γ𝑅
then 𝛼 ≠ 0. With these remarks in mind, let us now look at the forms that the group Γ
can take.
If Γ𝑇 is of the form {𝑛v1 ∶ 𝑛 ∈ Z }, then v1 is an eigenvector of 𝐿, which is collinear
with e1 or with e2 . Take v = 𝛼e1 + 𝛽e2 ∈ Γ𝑅 : then 𝛼 ≠ 0 and, by (i), 2𝛼e1 = v + 𝐿(v)
belongs to Γ𝑇 . We thus have v1 = 𝜆e1 , where we suppose 𝜆 > 0. Adding to v, if
necessary, an integer multiple of v1 , we can assume that 0 < 𝛼 < 𝜆. Since 2𝛼e1 is an
integer multiple of v1 , it follows that 𝜆 = 2𝛼 and v1 = v + 𝐿(v), which means that the
translation associated with v1 is the composite of 𝑓 (𝑝) = 𝐿(𝑝) + v with itself. Since
Γ𝑅 = {v} + Γ𝑇 , we conclude that Γ is the cyclic group generated by 𝑓 (𝑝). Therefore:
If Γ contains elements that reverse orientation and Γ𝑇 is one-dimensional, then Γ
is generated by a single glide reflection. In this case R 2 /Γ is a Möbius strip.
178 5 The Global Geometry of Surfaces
The case where Γ𝑇 is two-dimensional is left. Since 𝐿(Γ𝑇 ) = Γ𝑇 , there are two
possibilities:
(a) Γ𝑇 = {𝑛v1 + 𝑚𝐿(v1 )∶ 𝑛, 𝑚 ∈ Z };
(b) Γ𝑇 = {𝑛𝜆 1 e1 + 𝑚𝜆2 e2 ∶ 𝑛, 𝑚 ∈ Z }.
Suppose (a). Writing v1 = 𝜆e1 + 𝜂e2 (𝜆 > 0, 𝜂 ≠ 0), we have Γ𝑇 ∩⟨e1 ⟩ = {2𝑛𝜆e1 ∶ 𝑛 ∈
Z }. Take v = 𝛼e1 + 𝛽e2 ∈ Γ𝑅 : we may suppose, by adding to v a multiple of v1 , that
0 < 𝛼 < 𝜆. But 2𝛼e1 = v + 𝐿(v) is in Γ𝑇 ∩ ⟨e1 ⟩, implying 𝛼 = 𝑛𝜆, for some 𝑛 ∈ Z ,
which is absurd.
Thus (b) must be true. Repeating the argument above, we see that there exists
v = 𝛼e1 + 𝛽e2 ∈ Γ𝑅 such that 2𝛼 = 𝜆1 , and therefore v+𝐿(v) = 𝜆1 e1 . As Γ𝑅 = {v}+Γ𝑇 ,
the group Γ is generated by 𝑓 (𝑝) = 𝐿(𝑝) + v and the translation associated with the
vector 𝜆2 e2 . To summarize:
f(p)
p+ l2e2
p p+ ae1
Figure 5.10
Exercises
145. Prove Proposition 5.7.6. Hint: if 𝑆 𝐾 /Γ is orientable and an orientation is fixed
on it then the quotient mapping 𝑆 𝑘 → 𝑆 𝐾 /Γ induces an orientation on 𝑆 𝐾 .
146. (a) Let Γ1 and Γ2 be discrete subgroups of ℑ𝐾 . Show that 𝑆 𝐾 /Γ1 and 𝑆 𝐾 /Γ2 are
isometric if and only if Γ1 and Γ2 are conjugate subgroups.
(b) Give an example of two tori of equal area and constant zero curvature that are
not isometric.
147. Show that if Γ is a discrete subgroup of ℑ(D), then Γ ∖{id} contains only
hyperbolic isometries (see, in Section 5.6, exercises 140, 143 and 144).
148. (a) Consider on the torus of revolution T 2 ⊆ R 3 , given by Φ(𝑢, 𝑣) = ((2 +
cos 𝑣) cos 𝑢, (2+cos 𝑣) sin 𝑢, sin 𝑣), the equivalence relation ∼ that identifies (𝑥, 𝑦, 𝑧)
and (−𝑥, −𝑦, −𝑧). Show that T 2 / ∼ is a Klein bottle.
(b) If ≃ is the equivalence relation that identifies (𝑥, 𝑦, 𝑧) and (−𝑥, −𝑦, 𝑧), what
surface is T 2 / ≃?
References
1. E. Barbier: Note sur le problème de l’aiguille et le jeu du joint couvert. J. Math. Pures Appl.
(2) 5 (1860).
2. A.F. Beardon: The Geometry of Discrete Groups. Springer-Verlag 1983.
3. A.F. Beardon: An introduction to Hyperbolic Geometry [in Bedford, Keane, Series (eds.):
Ergodic Theory, Symbolic Dynamics, and Hyperbolic Spaces]. Oxford U.P. 1991.
4. A.L. Besse: Manifolds all of whose Geodesics are Closed. Springer-Verlag 1978.
5. J.H. Cadwell: Topics in Recreational Mathematics. Cambridge University Press 1966.
6. M.P. do Carmo: Differential Geometry of Curves and Surfaces. Prentice-Hall 1976.
7. C.D. Charkerian, H. Groemer: Convex bodies of constant width [in P.M. Gruber, J.M. Wills
(eds.): Convexity and its applications]. Birkhäuser 1983.
8. S.S. Chern: Curves and Surfaces in Euclidean Space [in S.S. Chern (ed.): Global Differential
Geometry]. Math. Association of America 1988.
9. H.G. Eggleston: Convexity. Cambridge University Press 1958.
10. R. Fenn: What is the Geometry of a Surface? Am. Math. Monthly 90 (1983)
11. D.G. Figueiredo: Análise de Fourier e equações diferenciais parciais. Projeto Euclides, IMPA
1977.
12. A. Goetz: Introduction to Differential Geometry. Addison-Wesley 1970.
13. W. Klingenberg: A course in Differential Geometry. Springer-Verlag 1978.
14. E. Kreyszig: Introduction to Differential Geometry and Riemannian Geometry. University of
Toronto Press 1968.
15. E.L. Lima: Duas novas demonstrações do teorema de Jordan-Brouwer no caso diferenciável.
Matemática Universitária 4 (1986).
16. E.L. Lima: Curso de Análise, vol. 2 (3a edição). Projeto Euclides, IMPA 1989.
17. E.E. Moise: Geometric Topology in dimensions 2 and 3. Springer-Verlag 1977.
18. I. Niven: Maxima and minima without Calculus. Math. Association of America 1981.
19. R. Osserman: A survey of Minimal Surfaces. Dover Publications 1986.
20. R. Osserman: Curvature in the Eighties. Am. Math. Monthly 97 (1990).
21. R. Osserman: The four-or-more vertex theorem. Am. Math. Monthly 92 (1985).
22. L.A. Santaló: Integral Geometry [in S.S. Chern (ed.): Global Differential Geometry]. Math.
Association of America 1988.
23. J. Sotomayor: Lições de equações diferenciais ordinárias. Projeto Euclides, IMPA 1979.
24. M. Spivak: Calculus. Benjamin 1967.
25. M. Spivak: A comprehensive introduction to Differential Geometry, vol. I-IV. Publish or Perish
1970-75.
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024 181
P. V. Araújo, Differential Geometry, https://doi.org/10.1007/978-3-031-62384-4
Index
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024 183
P. V. Araújo, Differential Geometry, https://doi.org/10.1007/978-3-031-62384-4
184 Index
gnomonic, 57 Theorem
Pseudo-sphere, 36, 75 of Barbier, 22
of Brouwer’s fixed point, 136
Radius of curvature, 6 of divergence, 102, 103, 161 – 161
Glide reflection, 176 of Gauss, 93
Fundamental region, 175 of Gauss-Bonnet, 109, 116
Polygonal region, 102 of Hadamard, 148, 151
of the four vertices, 26
Normal Section, 61 of the rigidity of the sphere, 156
Second fundamental form, 67, 74
of the rotation index, 134, 136
Similarity, 86, 87, 131
Torsion, 7, 8, 14
Poincaré Semi-plane, 171
Torus, 36, 43, 82, 94, 143, 144, 166, 177, 179
Christoffel Symbols, 91
Double Torus, 117, 175
Conjugate Subgroups, 171, 180
Trace of a curve, 1
Sphere, 33, 63, 70
Tractrix, 36
geodesics, 122
isometries, 174 – 175 Trajectory of a vector field, 76
rigidity theorem, 156 Möbius Transformation, 169
parallel transport, 100 Parallel Transport, 99, 100, 112, 119
Tangent Space, 44, 165 – 166 Triangulation, 113, 131
Surfaces Geodesic triangle, 115, 130, 143, 171
abstract, 164 Reuleaux triangle, 23
complete, 139, 140, 142 Frenet’s triangle, 6
connected, 40
convex, 152 Regular Value, 40
of constant curvature, 70, 75, 123, 128, 130, Variation
131, 156, 172, 175, 179 of area, 106
of non-positive curvature, 150, 151 normal, 103
inextensible, 141 of a surface, 103, 159
of constant width, 75 – 76, 159 160 – 161 Velocity, 2
minimal, 107, 108 scalar, 3
level, 40, 42 Volume
orientable, 46 – 47, 49, 75, 175 between parallel surfaces, 160
oval, 148, 152
bounded by a surface of constant width, 161
parallel, 160
parameterized, 34
ruled, 75, 102 Width, 19
regular, 31, 40 Constant Width
of revolution, 34, 70, 75, 102, 128 of a curve, 19 – 24
simply connected, 148, 151, 151 of a surface, 71 – 76, 159, 160 – 161
Spindle, 55 Wirtinger’s lemma, 27