Lecture Notes For MA455 Manifolds: David Mond March 7, 2008
Lecture Notes For MA455 Manifolds: David Mond March 7, 2008
MA455  Manifolds
David  Mond
March  7,  2008
Contents
1   Foundations   3
1.1   First  denitions   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   3
1.2   Submersions  and  Immersions .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   15
1.3   Finding  Regular  Equations  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   25
1.4   Images  of  Immersions  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   28
1.5   Transversality   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   33
2   Sards  theorem  and  the  density  of  transversality   37
2.1   Tangent  and  normal  bundles   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   37
2.2   Tubular  neighbourhoods   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   41
2.3   Sards  theorem   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   43
2.4   The  construction  of  transverse  perturbations   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   46
2.5   The  stability  of  transverse  intersections   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   50
3   Oriented  Intersection  Theory   52
4   Applications  of  Oriented  Intersection  Numbers   74
4.1   Vector  elds  on  spheres  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   75
4.2   Linking  Numbers   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   76
4.3   Self-intersection  and  the  Euler  Characteristic  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   81
4.4   Brouwers  Fixed  Point  Theorem  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   82
4.5   The  Fundamental  Theorem  of  Algebra   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   83
5   Abstract  Manifolds   85
5.1   Denition  and  examples   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   85
5.2   An  important  example  -  the  Grassmanian   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   95
5.3   Complex  manifolds   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   97
5.4   Quotient  spaces  as  manifolds  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   100
1
5.5   Submanifolds   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   104
5.6   Embedding  manifolds  in  Euclidean  space   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   104
5.7   Further  Structure  on  Abstract  Manifolds   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   107
5.7.1   A  rst  approach  to  tangent  spaces .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   108
5.7.2   A  second  approach  to  tangent  spaces   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   110
5.7.3   A  third  approach  to  tangent  spaces   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   111
5.8   You  dont  have  to  choose  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   112
6   Dierential   Forms  and  Integration  on  Manifolds   113
6.1   First  examples .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   113
6.2   Determinants,  volume  and  change  of  variable  in  multiple  integration  .   .   .   .   .   117
6.3   Dierential  forms   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   121
6.4   Integration  of  dierential  forms   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   125
6.5   Stokess  Theorem  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   132
6.6   Concluding  Remarks   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   136
7   Appendices   137
7.1   Appendix  A:  Linear  Maps  and  Matrices .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   137
7.2   Appendix  B:  Some  Topics  in  Linear  Algebra   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   139
7.3   Appendix  C:  Derivatives   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   140
7.4   Appendix  D:  Quotient  spaces  in  topology  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   144
2
1   Foundations
Check  the  Appendices  at  the  end  of  these  notes  for  revision  of  basic  notions  of  linear  alge-
bra  and  of  the  derivative  of  a  smooth  map.
1.1   First  denitions
Manifolds,  as  we  will  rst  encounter  them,  are  certain  subsets  of  Euclidean  space  R
n
.   Later
we will introduce a more sophisticated notion of Manifold, which does not require an ambient
space  to  live  in.
Example  1.1.   Choose  real   numbers  0  <  b  <  a.   The  torus   T  =  T
ab
    R
3
is  shown  in  the
following  diagram.   It  is  the  set  of  points  a  distance  b  from  the  circle  C
a
  (not  shown)  in  the
x
1
x
2
  plane  with  centre  0  and  radius  a.
P
1
x
x
x
1
2
3
P
Each  point  P   T  is  determined  by  the  two  angles  
1
, 
2
  shown;  in  terms  of  these  angles  its
co-ordinates  in  the  ambient  R
3
are
((a + b cos 
2
) cos 
1
, (a + b cos 
2
) sin 
1
, b sin 
2
).
This formula species a bi-periodic map  from the 
1
2
-plane to the torus.   In fact the torus
is the image of the square [0, 2][0, 2] under , or indeed of any square [c, c+2][d, d+2].
3
A  smooth  map  taking  an  open  set   in  the  plane  onto  an  open  set   in  a  surface  is  called  a
(smooth)  parametrisation of  that  part  of  the  surface.
We can also dene  T by an equation:   for any point P   R
3
 x
3
axis there is a unique
closest  point  P
on  C
a
.   Then  P    T  if   and  only  if   |P   P
|  =  b.   If   P  has  co-ordinates
(x
1
, x
2
, x
3
)  then  P
has  co-ordinates
_
  ax
1
_
x
2
1
 + x
2
2
,
  ax
2
_
x
2
1
 + x
2
2
, 0
_
so  T  is  the  set  of  points  (x
1
, x
2
, x
3
)  satisfying
_
x
1
  ax
1
_
x
2
1
 + x
2
2
_
2
+
_
x
2
  ax
2
_
x
2
1
 + x
2
2
_
2
+ x
2
3
  = b
2
.
Notice  that  this  equation  only  makes  sense  in  R
3
 x
3
axis.   2
Example  1.2.   The  sphere  S
2
 R
3
is  the  set  of  points  dened  by  the  equation
x
2
1
 + x
2
2
 + x
2
3
  = 1.
The  north  and  south  poles  N  and  S  are  the  points  (0, 0, 1)  and  (0, 0, 1).   Stereographic
projection  
N
  :  S
2
 N    R
2
is  dened  by  the  following  diagram.   That  is,   
N
(P)  is  the
point  where  the  line  joining  N  to  P  meets  the  equatorial  plane.
N
S
N
P
   (P)
Evidently  
N
(P) =  P  + NP  for  some  constant  .   The  requirement  that  
N
(P)  lie  in  the
equatorial  plane  determines  the  value  of  :   it  is  x
3
/(1 x
3
).   Hence
N
(P) = (x
1
, x
2
, x
3
) +
  x
3
1 x
3
(x
1
, x
2
, x
3
1) =
_
  x
1
1 x
3
,
  x
2
1 x
3
_
(we  ignore  the  last  coordinate,  0,  of  
N
(P)  in  order  to  regard  s
N
(P)  as  a  point  in  R
2
).
Stereographic  projection  
S
  :  S
2
 S    R
2
is  dened  by  a  similar  procedure,   resulting  in
the  formula
S
(x
1
, x
2
, x
3
) =
_
  x
1
1 +x
3
,
  x
2
1 +x
3
_
4
Each is a homeomorphism from an open set in S
2
to the plane.   Such maps are called charts,
in line with the old nautical term:   they are planar representations  of parts of the surface (in
this  case  the  sphere).   2
Charts  and  parametrisations  are  mutually  inverse  procedures:
Example  1.1.   (again).Given  (x
1
, x
2
, x
3
)    T,   we  can  nd  (
1
, 
2
)  such  that  (x
1
, x
2
, x
3
)  =
(
1
, 
2
)  by  means  of  the  formula
1
  =
_
  arctan x
2
/x
1
  if  x
1
  > 0   ,   arctan x
2
/x
1
 +    if  x
1
  < 0
/2 arctanx
1
/x
2
  if  x
2
  > 0   ,   /2 arctan x
1
/x
2
  if  x
2
  < 0
and  a  slightly  more  complicated  formulae  for  
2
.   Each  point  on  the  torus  is  in  the  interior
of  the  domain  of  a  map  which  gives  (
1
, 
2
)  in  terms  of  smooth  functions  of  the  coordinates
(x
1
, x
2
, x
3
).   Each  is  a  homeomorphism  from some  open  set  in  T  to  an open  set  in  the  plane.
But  these  charts  do  not  piece  together  to  give  a  smooth  map  from  all  of  the  torus  to  some
region  in  R
2
.
Example  1.2.   (again) Exercise:   Find a  formula  for the inverse  of 
N
  : S
2
 N    R
2
.   2
Remarks  on  these  examples
1.   The  charts  weve  seen  in  the  examples  are  dened  in  terms  of   ambient  coordinates,
even  though  were  only  interested  in  their  behaviour  on  the  surfaces  in  question.   As
functions  in  the  ambient  co-ordinates,   they  are  generally  dened  on  open  sets  in  R
3
,
not  on  all  of   R
3
.
2.   In general we need several charts to cover the whole surface.   Their domains will overlap,
so  we  will  have  several  dierent  planar  representations  of  the  same  portion  of  surface.
In order to extract unambiguous information from these charts we need to understand
how  the picture  in one  chart is  transformed  when  we  view  the  same  portion of  surface
in  another.
Example 1.2.  (yet again):   the composite 
S
1
N
  (which maps 
N
(domain(
N
)domain(
S
)) =
R
2
 0  to  
S
(domain(
N
)  domain(
S
)) =  R
2
 0)  is  easily  seen  to  be  the  map
(y
1
, y
2
)  
  1
y
2
1
 + y
2
2
(y
1
, y
2
).
This  is  inversion  in  the  unit  circle  in  the  language  of  classical   geometry.   It  has  the  inter-
esting  property that it preserves  angles,  in the  sense  that if  the smooth curves  C
1
  and  C
2
  in
R
2
 0  meet  at the  point Q,  then  their  images  under  
S
 
1
N
  meet  with  the  same  angle at
S
 
1
N
  (Q).   A map from the plane to the plane with this property is called conformal.   You
can  easily  check  this  property  here  by  dierentiating  
S
   
1
N
  ,   and  checking  that  for  each
point  Q  and  any  two  vectors  u, v, there  is  a  constant  C(Q)  (depending  on  Q)  such  that
d
Q
(
S
 
1
N
  )(u)  d
Q
(
S
 
1
N
  )(v) = C(Q)u  v,
5
from  which  the  property  of  preserving  angles  follows  easily.
We  all  know  how  to  measure  the  angle  between  smooth  curves  on  the  sphere,  of  course.
But  now  imagine  that we  do  not (e.g.   the  sphere  is  to  big  for us  to  climb  onto).   A  sensible,
consistent  way of assigning an angle to the intersection  of two curves  C
1
, C
2
  on the sphere
would  be  to  measure  the  angle  between  their  images  in  either   of   the  charts  
S
, 
N
.   The
crucial  point  is  that  it  would  not  matter  which  we  use,  because  since  
S
 
1
N
  is  conformal,
we  would  get  the  same  answer  either  way.   2
Denition  1.3.   (i)   Let   U     R
n
be   open.   A  map  f   :   U    R
p
is   smooth  if   its   partial
derivatives  of  all  orders  exist  and  are  continuous.
(ii)  If   U, V     R
n
are  open  then  f   :   U    V   is  a  dieomorphism  if   it  is  smooth  and  has  a
smooth  inverse.
We  need  openness  of  U  in  (i)  to  be  able  to  speak  of  partial  derivatives:   since
f
x
i
(a) =   lim
h    0
f(a + he
i
) f(a)
h
(where  e
i
  is  the  ith  standard  basis  vector),  we  need  a + he
i
  to  be  in  U,  for  all  a  and  small
enough  h,  in  order  to  dene  the  limit,  and  this  is  exactly  what  openness  of  U  guarantees.
Denition  1.3.   (continued)  (iii)  Let  X    R
n
.   A  map  f   :   X    R
p
is  smooth  (new)  if
for  each  x  X  there  is  a  neighbourhood  U  of  x  in  R
n
and  a  smooth  map  (in  the  old  sense)
F  : U    R
p
such  that  on  U   X,  f  and  F  coincide.
(iv)  If   X    R
n
and  Y     R
p
then  a  map  f   :   X    Y   is  a  dieomorphism  (new)  if   it  is
smooth  (in  the  sense  of  (iii))  and  has  a  smooth  inverse  (also  in  the  sense  of  (iii)).
We  will  refer  to  maps  which  are  smooth  in  the  sense  of  (i)  and  (iii)  as  smooth  (old)  and
smooth (new),  until it has become clear that our new denition does not clash with the old.
Proposition  1.4.   (i)  The  composite  of  two  smooth  (new)  maps  is  smooth  (new).
(ii)  If the  domain  of  a  smooth  (new)  map  is  in  fact  open  in  a  Euclidean  space,  then  the  map
is  smooth  (old).
Proof   If X
  f
  Y
  g
  R
q
are smooth (new),  with X, Y  in  R
n
and  R
p
respectively,  sup-
pose  x  X.   By  denition  of  smoothness  (new)  of  f,  there  is  a  neighbourhood  U  of  x  in  R
n
and  a  local  smooth  extension  F  : U    R
p
of  f    that  is,  a  smooth  (old)  map  such  that  on
U X, f  and F  coincide.   Similarly, by the smoothness (new) of g, there is a neighbourhood
V   of  f(x)  in  R
p
on  which  g  has  a  local  smooth  extension  G.   Then  G  F  is  a  local  smooth
extension  of  g  f  on  U,  so  g  f  is  smooth  (new).
The  second  statement  is  obvious  from  the  denition    take  F  = f.   2
The  purpose  of part (ii) here  is  to make  clear that the new  denition  never  clashes  with  the
old.
6
Example  1.5.   The  torus  T  R
3
is  dieomorphic  to  S
1
S
1
 R
4
.
Proof
P
1
P
2
2
=(x  ,y  )
2
2
1
1
1
 (x  ,y  )
=
The  map  R
2
  T  introduced  in  Example  1.1  used  the  trigonometric  functions  of   the  two
angles   
1
, 
2
.   These   can  be  expressed  in  terms   of   the  cartesian  coordinates   of   the  point
(P
1
, P
2
)  S
1
S
1
:   cos 
1
  = x
1
, sin 
1
  = y
1
, cos 
2
  = x
2
, sin 
2
  = y
2
.   Substituting  these  in  the
formula  for  the  parametrisation,  we  get  a  map  S
1
S
1
  T  dened  by
(x
1
, y
1
, x
2
, y
2
) ((a + bx
2
)x
1
, (a + bx
2
)y
1
, by
2
).
Exercise  Find  its  inverse  and  check  that  it  is  smooth.   2
Denition  1.6.   (i) M   R
n
is a smooth  manifold  of  dimension m if for all x  M  there is a
neighbourhood  U  of  x  in  M,  an  open  set  V   R
m
,  and  a  dieomorphism   : U    V .   Such
a  map    is  called  a  chart  on  M  around  x.
(ii)  A  collection  of  charts  whose  domains  together  cover  all  of  M  is  called  an  atlas  on  M
Example  1.7.   (i)  T  R
3
is a smooth manifold of dimension  2.   The locally dened  inverses
to  the  periodic  parametrisation  give  charts.
(ii)  S
2
 R
3
is  a  manifold  of  dimension  2  -  use  the  two  stereographic  projections  as  charts.
(iii)  If  M   R
M
and  N   R
N
are  manifolds  of  dimension  m  and  n  then  M  N   R
M+N
is
a  manifold  of  dimension  m + n.
(iv)   If   U    R
n
is   open,   and  f   :   U    R
p
is   smooth  (old)   then  the  graph  of   f,   (x, y)   
U    R
p
:   y   =  f(x)  is   a  manifold  of   dimension  n.   For   the   restriction  to  gr(f)   of   the
projection  V   R
p
  V   is  smooth  with  smooth  inverse  the  map  x   (x, f(x)),   and  is  thus
a  dieomorphism.   In  this  case  we  need  only  one  chart  to  get  an  atlas.
In  fact, we  will  see  later  that every  n-dimensional  manifold  M   R
N
is  locally the  graph
of   a  smooth  map  mapping  an  open  set  in  some  co-ordinate  n-plane  to  its  complementary
N  n-plane.
(v)  If,  in  (iv),  f  is  not  smooth,  then  the  graph  of  f  is  not  a  smooth  manifold  (Exercise).
(vi) Although this might seem to be stretching  a point, a 0-dimensional manifold is a collec-
tion  of  discrete  points.   For  we  (perhaps  articially)  declare  that  R
0
= 0  is  a  single  point,
7
and  it  follows  that  the  only  non-empty  open  set   in  R
0
is   R
0
itself.   Thus,   a  set   of   points
X    R
n
is  a  0-dimensional   manifold  if  each  point  x    X  has  a  neighbourhood  in  X  which
is  dieomorphic  to  R
0
.   This  means  precisely  that  each  point  has  a  neighbourhood  in  X  of
which  it  is  the  only  member,  and  thus  that  X  is  a  collection  of  discrete  points.
2
Remark  1.8.   Suppose  that  
1
  : U
1
    V
1
  and  
2
  : U
2
    V
2
  are  charts  on  the  manifold  M,
and that their domains, U
1
 and U
2
 have non-empty intersection.   Then there is a commutative
diagram
U
U
1
2
1
1
o
2
M
in  which  the  map  
2
 
1
1
  : 
1
(U
1
 U
2
)    
2
(U
1
 U
2
)  is  known  a  the  crossover  map  from
the  chart  
1
  to  the  chart  
2
.   It  is  a  consequence  of  1.4(ii)  that  all  such  crossover  maps  are
automatically smooth (old).   In the more abstract version of the denition of manifold, which
we  will  meet  in  Chapter  5,  and  which  you  may  already  have  come  across  elsewehere,   we  do
not insist  that our manifolds be contained in some Euclidean  space  R
n
, and we  then have to
insist,  as  an  explicit  hypothesis,  that  in  any  atlas,  all  the  crossover  maps  must  be  smooth.
Almost  everything  one can prove  about manifolds  at the outset makes  use of the Inverse
Function  Theorem:
Theorem  1.9.   Let  U  be  an  open  set  in  R
n
and  f  : U    R
n
a  smooth  map.   If  the  derivative
d
x
f  :   R
n
  R
n
of  f  is  an  isomorphism  then  there  are  neighbourhoods  U
1
  of  x  in  R
n
and  V
1
of  f(x)  in  R
n
such  that  f  : U
1
    V
1
  is  a  dieomorphism.   2
Recall   that   the  derivative   of   f   at   x,   d
x
f,   (when  it   exists)   is   a  linear   map  such  that
f(x) + d
x
f(v)  is  a  reasonable  approximation  to  f(x + v).   More  precisely,  it  is  the  unique
linear  map  with  the  property  that
f(x + v) = f(x) + d
x
f(v) +|v|E(x, v)
for some error term E  such that E    0 as v    0.   A sucient  condition for the existence  of
the  derivative  is  the  existence  and  continuity  of  the  rst  order  partial  derivatives  of  each  of
the component functions of f.   Thus every smooth map on an open set in  R
n
has a derivative
at  every  point.
8
Recall  also  the  Chain  Rule  for  derivatives:
Theorem  1.10.   d
x
(g  f) = d
f(x)
g  d
x
f.   2
And  remember  that  in  the  case  of  a  function  f  of  1  variable,  the  notion  of  derivative,  in
this  sense,  is  linked  to  the  old  notation  f
(t)  as  follows:
f
(t) =   lim
h    0
f(t + h) f(t)
h
  =   lim
h    0
f(t) + d
t
f(h) +[h[E(t, h)     f(t)
h
  =
=   lim
h    0
hd
t
f(1) +[h[E(t, h)
h
  = d
t
f(1).
A  similar  argument  shows  that  for  a  function  f  of  n  variables,
f
x
j
(x) = d
x
f(e
j
),
where  e
j
  is   the  jth  basis   vector   (0, . . ., 0, 1, 0, . . ., 0)   (with  1  in  the  jth  place.)   Because
of   this,   we  have  the  following  useful   formula:   suppose  that    :   R    U    R
n
is  a  smooth
parametrised  curve  and  f  :   U    R
p
is  a  smooth  (old)  map.   Then  if   (0)  =  x,   and  we  let
v = 
(0),  then
d
x
f(v) = d
x
f(
(0)) = d
x
f(d
0
(1)) = d
0
(f   )(1) = (f   )
(0)
which  we  will  use  a  lot  in  what  follows.
In order to use the inverse  function theorem we need rst a notion of derivative for maps
between  manifolds.
Denition/Proposition 1.11.  Let M
m
 R
n
be a manifold, and x  M.   The tangent space
to  M  at  x,  denoted  T
x
M,  is  the  space  dened  by  the  following  two  equivalent  formulae:
(i)  Let   : U    V   R
m
be  a  chart  on  M  around  x,  and  let  a = (x).   Then
T
x
M  = d
a
(
1
)(R
m
).
Note  that  
1
is  smooth  (old)  and  so  has  an  already-dened  derivative.
(ii)
T
x
M  = 
(0) = (
1
 )
(0) = d
a
(
1
)(
(0)).
9
Thus  the  set  dened  by  the  second  formula  is  contained  in  the  set  dened  by  the  rst.
Conversely,   if   v    R
m
,   we  can  choose  a  curve    :   R, 0    V, a  such  that  
(0)  =  v:   e.g.
(t) = a + tv.   If  we  denote  by    the  composite  
1
 ,  then  we  have
d
a
(
1
)(v) = d
a
(
1
)(
(0)) = (
1
 )
(0) =
(0).
It  follows  that  the  set   dened  by  the  rst   formula  is  contained  in  the  set   dened  by  the
second.   2
Corollary  1.12.   T
x
M  is  an  m-dimensional  vector  space.
Proof   That  it  is  a  vector  space  follows  from  the  rst  formulation  of  the  denition.   Here
is  why  it  is  m-dimensional:   choose  a  local   smooth  extension    to    around  x.   That  is,   
is  dened  and  smooth  (old)  on  some  neighbourhood  of   x  in  R
n
,   and  on  U   M    and  
coincide.   Then
  
1
=   
1
= 1
V
,
so
d
x
  d
a
(
1
) = d
a
1
V
  = 1
R
m
.
Hence  d
a
(
1
)  is  injective,  and  its  image  is  an  m-dimensional  vector  space.   2
Example 1.13.  (i) If U   R
k
is open then U  is a smooth manifold.   It is easy to see, by either
of  the  two  versions  of  the  denition  of  the  tangent  space,   that  for  each  x    U,   T
x
U  =  R
k
.
To  apply  the  rst  version  of  the  denition,   just  take,   as  chart  on  U,   the  identity  map.   To
apply the second  version,  let v  be any vector  in  R
k
and dierentiate the curve  (t) = x +tv.
(ii)  If  V   R
n
is  a  real  vector  space  then  it  is  linearly  isomorphic  to  some  R
k
,  and  is  thus  a
k-dimensional  manifold.   The  last  argument  in  (i)  shows  that  for  all  x  V ,  T
x
V  = V .
(iii) Suppose that the surface S   R
3
is dened  by an equation h = 0 (e.g.   in the case of the
sphere  S
2
,  h(x
1
, x
2
, x
3
) = x
2
1
 + x
2
2
 + x
2
3
1).   Suppose  also  that  at  x  S,  d
x
h ,= 0.   Then
T
x
S  = ker d
x
h.
For  since  d
x
h ,=  0,  both  sides  in  this  equation  are  2-dimensional  vector  spaces;   and  the  left
hand side is contained in the right, for the obvious reason that if   :   R, 0    S, x is any curve
then  h    is  constant,  so  that  0 = (h  )
(0) = d
x
h(
(0)).   2
Denition/Proposition  1.14.   Let  f  :   M    N  be  a  smooth  map  between  manifolds.   If
x  M,  the  derivative  of  f  at  x  is  the  linear  map
d
x
f  : T
x
M    T
f(x)
N
dened  by  any  one  of  the  following  equivalent  formulations:
10
(i) Let v  T
x
M.   Choose a curve  in M  such that 
(0) = v.   Then d
x
f(v) = d
x
f(
(0)) =
(f   )
(0).
(ii)  Choose  a  smooth  (old)  extension  F  of  f  around  x.   Then
d
x
f  = d
x
F
|TxM
.
Again,  each  denition  on  its  own  is  decient,   in  this  case  because  each  involves  an  element
of  choice.   However,
Claim  The  two  versions  of  the  denition  coincide.
Proof   d
x
F(
(0)) = (F )
(0) = (f )
(0)  =  v,   and
(ii)  is  independent  of   the  choice  of   smooth  local   extension  of   f,   so  both  are  unambiguous
denitions.   Note also that version (ii) makes clear that if the manifold M  is actually an open
set  in  a  Euclidean  space  R
n
,  (so  that  for  a  map  with  domain  M,  smooth  (new)  is  the  same
as smooth (old)), then our new  denition of  derivative  d
x
f  is the same as the old denition.
Example  1.15.   Consider  the  map  f  : S
1
  S
1
dened,  using  a  complex  co-ordinate  z,  by
f(z) = z
2
.   In  real  co-ordinates  x, y,  this  is
f(x, y) = (x
2
y
2
, 2xy).
This  formula  uses  ambient  coordinates,   and  therefore  denes  a  smooth  extension  of   f   (in
fact  to  all   of   R
2
).   Thus  d
(x,y)
f   is  the  restriction  to  T
(x,y)
S
1
of   the  linear  map  with  matrix
(with  respect  to  the  standard  basis  of  the  ambient  space)
_
  2x   2y
2y   2x
_
.
Consider  the  curve    in  S
1
(in  fact,  parametrising  all  of  S
1
,  of  course!)   (t) = (cos  t, sin  t).
The  vector  
(t)) =
_
  2 cos  t   2 sin  t
2 sin  t   2 cos  t
__
  sin  t
cos  t
_
=
_
  4 cos  t sin  t
2 cos
2
t 2 sin
2
t
_
=
_
  2 sin  2t
2 cos  2t
_
= 2
(2t).
That  is,  the  image  under  d
(x,y)
f  of  the  unit  generator  of  T
(x,y)
S
1
is  twice  the  unit  generator
of  T
f(x,y)
S
1
.   Of  course,  this  is  not  surprising    we  know  that z
2
goes round  the  circle  twice
as  fast  as  z  does.   2
Proposition  1.16.   Chain  rule  for  smooth  maps  between  manifolds
If  M
  f
  N
  g
  P  are  smooth  maps,  then
d
x
(g  f) = d
f(x)
g  d
x
f.
11
Proof   Obvious,  using  denition  (i)  of  the  derivative.   2
Example  1.17.   A Lie  group is a manifold G which is also a group, for which the operations
of
multiplication:
_
p : GG    G
(g
1
, g
2
) g
1
g
2
and  inversion:
_
i : G    G
g g
1
are  smooth  maps.   We  can  give  only  rather  few  examples  at  this  stage:   they  are
1.   Gl(n, R),  the  set  of  invertible  n n  real  matrices,  under  matrix  multiplication.
2.   R
2
 0,  under  complex  multiplication,  and  its  subgroup  S
1
,  the  unit  circle.
3.   R
4
0 under quaternionic multiplication, which is dened as follows:   write (x
0
, x
1
, x
2
, x
3
) 
R
4
as  x
0
1 + x
1
i + x
2
j + x
3
k  and  then  dene
1i = i,   1j = j 1k = k
i
2
= j
2
= k
2
= 1
ij = k,   jk = i,   ki = j
with each of the last three products changing sign if the order of the factors is reversed.
Extend  by  linearity   to  all   of   R
4
 0,   so,   back  in  the  usual   coordinate  notation,   we
get
(x
0
, x
1
, x
2
, x
3
)  (y
0
, y
1
, y
2
, y
3
) =
(x
0
y
0
x
1
y
1
x
2
y
2
x
3
y
3
, x
0
y
1
+x
1
y
0
+x
2
y
3
x
3
y
2
, x
0
y
2
+x
2
y
0
+x
3
y
1
x
1
y
3
, x
0
y
3
+x
3
y
1
+x
1
y
2
x
2
y
1
).
It is straightforward to check that if x, y  Q then |xy| = |x||y|; as a consequence,
the  set  of  unit  quaternions,  x  Q : |x| = 1,  is  a  Lie  subgroup.   As  a  manifold,  this
set  is  just  the  sphere  S
3
.
The  following  proposition  provides  a  good  example  of  the  exibility  of  the  denition  of
the  derivative.
Proposition  1.18.   Let  e  denote  the  neutral   element  of  the  Lie  group  G.   Then
(a)  the  derivative  d
e
p : T
e
GT
e
G    T
e
G  of  the  group  multiplication  is  just  addition:
d
e
p( x
1
,  x
2
) =   x
1
 +  x
2
.
(b)  The  derivative  d
e
i : T
e
G    T
e
G  of  the  inversion  map  i  is  just  multiplication  by  1:
d
e
i( x) =  x.
12
Proof   (i) We will use part (i) of the denition of the derivative, but we have to apply it with
a little ingenuity.   To calculate d
e
p( x
1
,  x
2
), we might look for a curve   = (
1
, 
2
) :   R    GG
such  that  (0) = (e, e)  and  (
1
(0), 
2
(0)) = ( x
1
,  x
2
).   Then  by  denition  of  the  derivative,
d
e
p( x
1
,  x
2
) = (p  )
(0).
Now  p       is   just   the   product   of   the   two  components   
1
  and  
2
  of   .   If   we   had  some
information  about  the  value  of  this  product,   we  might  be  able  to  use  it  to  nd  (p  )
(0).
Unfortunately,   without  detailed  information  on  the  particular  Lie  group  G,   we  cannot  in
general   say  anything  useful   about   
1
2
.   But   there  are  cases   where   the  fact   that   G  is   a
group  is   enough.   If,   for   example,    x
1
  =  0  then  we  can  choose  
1
(t)   =  e  for   all   t.   Then
p(
1
(t), 
2
(t)) = 
2
(t)  for  all  t,  so  d
e
p(
1
(0), 
2
(0)) = 
2
(t).   In  other  words,
d
e
p(0,  x
2
) =   x
2
.
Similarly,
d
e
p( x
1
, 0) =   x
1
.
Now  it  is  clear  how  to  calculate  d
e
p( x
1
,  x
2
):
since  the  derivative  is  linear,  we  know  that
d
e
p( x
1
,  x
2
) = d
e
p( x
1
, 0) +d
e
p(0,  x
2
),
and  by  what  has  just  been  said,
d
e
p( x
1
, 0) +d
e
p(0,  x
2
) =   x
1
 +  x
2
.
(ii)  To  calculate  d
e
i( x))  we  use  the  result  we  have  just  proved.   Suppose  that    is  a  curve  in
G  with  (0) = e  and  
(0),
(0)
_
and  by  (i)  this  is  equal  to
(i  )
(0) +
(0),
i.e.   to
d
e
i( x) +  x.
It  follows  that  d
e
i( x) =  x.   2
Theorem  1.19.   (Inverse  Function  Theorem  for  Manifolds).   Suppose  that   f  :   M
m
  N
m
is  a  smooth  map  between  manifolds,   and  suppose  also  that  d
x
f  :  T
x
M    T
f(x)
N  is  an  iso-
morphism.   Then  there  are  neighbourhoods  W
1
  of   x  in  M  and  W
2
  of   f(x)  in  N  such  that
f(W
1
) = W
2
  and  f  : W
1
    W
2
  is  a  dieomorphism.
13
Proof   We  draw  the  standard  diagram
2
U
U
M
N f
f
o o
1
h := 
V
V
1
2
1
2
in which 
1
  and 
2
  are charts on M  and N  around x and f(x) respectively.   Write a = 
1
(x).
Typical Step  1:   use  charts  to  turn  the  hypothesis  on  the  smooth  (new) map  into  a  hypothesis
on  a  smooth  (old)  map
Commutativity  of  this  diagram  implies  commutativity  of  the  diagram
T
x
M
  dxf
   T
f(x)
N
d
x
1
     d
f(x)
2
R
m
  dah
   R
m
by  the chain rule 1.16.   The vertical  maps here  are isomorphisms,  and therefore  because  d
x
f
is  an  isomorphism,  so  is  d
a
h.
Typical  Step  2:   Apply  an  old  theorem
By  the  inverse  function  theorem  (old),   there  are  neighbourhoods  Z
1
  of   a  in  R
m
and  Z
2
  of
h(a)  in  R
m
,  such  that  h : Z
1
    Z
2
  is  a  dieomorphism.   Call  its  inverse  g.
Typical  Step  3:   Use  charts  to  turn  the  conclusion  about  the  smooth  (old)  map  into  a  conclu-
sion  about  the  smooth  (new)  map.
Let   W
1
  =  
1
1
  (Z
1
), W
2
  =  
1
2
  (Z
2
).   Then  f(W
1
)   =  W
2
  and  f   :   W
1
    W
2
  has   inverse
1
1
   g  
2
,  and  so  is  a  dieomorphism.   2
Remark  1.20.   The   inverse   function  theorem  can  be  viewed  as   saying  that   if   d
x
f   is   an
isomorphism  then  with  respect   to  suitable  coordinates   on  M  around  x  and  on  N  around
y  = f(x),  f  is  nothing  but  the  identity  map.   To  see  why,  consider  the  diagram  in  the  proof
of  1.19.   If  we  add  to  it  the  extra  maps  indicated  with  dashed  lines,
14
h
U
2
N
U
1
M
V
V
2
identity map
2
V
2
V
2
f
f
o o
1
1
h := 
and  then  rub  out  some  of  the  maps  and  spaces,  we  are  left  with
h
~
=
U
2
N
U
1
M
V
2
id
e
n
tity
 m
a
p
2
f
V
2
The  expression  of   f   with  respect  to  the  new  coordinates  on  M  provided  by  the  chart
1
  = h  
1
,  and  the  old  coordinates  on  N,  is  simply  the  identity.
1.2   Submersions  and  Immersions
We  begin  with  a  geometrical  example.
15
Example   1.21.   Consider   a  curved  and  twisted  piece  of   wire.   Suppose  it   does   not   have
any  kinks,   and  does  not  touch  itself.   If   we  neglect   its  thickness,   we  can  think  of   it  as  a
one-dimensional  manifold  contained  in  R
3
.
By looking at the wire from dierent positions, we see a number of qualitatively dierent
local   pictures,   some  of   which  are  shown  below.   The  local   pictures  on  the  rst  row  can
be  seen  from  all   points  in  space,   or  at  least  from  the  points  of   an  open  set;   those  on  the
second  row  can  only  be  seen  from  the  points  of   certain  surfaces.   Other  more  complicated
local  pictures  may  be  seen  from  certain  curves,  or  even  from  isolated  points.
All of the local pictures shown  are easy  to understand except  possibly  the last, the cusp.
How can a smooth curve appear to acquire a sharp point when viewed from certain positions?
And  from  which  positions  do  we  see  such  a  view?
If   you  have  a  piece  of   wire  to  hand,   even  a  loosely  coiled  spring,   it  is  a  good  idea  to
experiment  at  this  point,  turning  it  over  and  examining  it  from  dierent  points  of  view.
It is not hard to nd, by experiment, that we see a cusp by looking at the curve along one
of its tangent lines.   The set of all the tangent lines together make up the tangent developable
surface  of  the  curve,  and  this  is  the  set  of  viewpoints  from  which  the  cusp  may  be  seen.   To
explain  this,  we  simplify  the  geometry  of  vision  as  shown  in  the  following  diagram.
16
Retina
Pupil
Object
Image
If  the  eye  is  situated  at  the  origin  of  coordinates  and  the  retina  is  the  plane  y  = h,
then  the  point  on  the  object  with  coordinates  (x, y, z)  has  image
h
y
(x, y, z).
Taking  coordinates  (x, z)  on  the  retinal   plane,   our  simplied  version  of   vision  is  thus  the
map
V  : (x, y, z) h(x/y, z/y),
which  is  well  dened,  and  smooth,  on  R
3
 y = 0.   Its  derivative  d
(x,y,z)
V   has  matrix
h
_
  1/y   x/y
2
0
0   z/y
2
1/y
_
Now suppose that   : (a, b)    R
3
parametrises the curve C.   The image of C is parametrised
by  V   .   Its  derivative
(V   )
(t) = d
(t)
V (
(t))
= h
_
1
(t)
2
(t)
  
  
1
(t)
2
(t)
2
(t)
2
  ,
 
3
(t)
2
(t)
  
  
3
(t)
2
(t)
2
(t)
2
_
can  be  expressed  as
h
2
(t)
2
_
1
(t)   
2
(t)
1
(t)   
2
(t)
3
(t)   
2
(t)
3
(t)   
2
(t)
_
.
Since we are assuming that 
2
(t) ,= 0, the vanishing of the two determinants in this expression
is  equivalent  to  (t)  and  
2
V
2
2
1
U
U
M
N f
f
o o
1
h := 
1
2
1
V
Since  h = H  standard  immersion,  we  let  W
2
  = 
1
2
  (Z
2
)  in  N  and  take  as  new  chart  on  N
the  map
  
2
  =  H
1
 
2
  :  W
2
    Z
1
.   It  follows  that
  
2
 f   
1
1
  =  standard  immersion,   as
required.
W
2
s
t
a
n
d
a
r
d
 
i
m
m
e
r
s
i
o
n
V  x L
1
1
Z
H
1
=
~
1
2
U
M
N f
1
1
V
20
2
An  analogous  result  is  true  for  submersions.
Theorem  1.27.   Local   normal   form  for  a  submersion.   If   f   :   M    N  is  a  submersion  at
x    M  then  there  are  charts  
1
  on  M  around  x  and  
2
  on  N  around  y  =  f(x)  such  that
2
 f   
1
1
  is  the  standard  submersion.
Proof   Begin  with  any  two  charts  
1
  :   U
1
    V
1
  and  
2
  :   U
2
    V
2
  on  M  and  N  around
x  and  y  respectively.   Write  a  =  
1
(x), b  =  
2
(y).   Because  f   is  a  submersion  at  x,   so  is
h := 
2
 f   
1
1
  (this  is  Typical  Step  1).
Typical Step  2:   Let K   R
m
be the kernel of d
a
h, and let  : V
1
    K  be a linear projection.
As  d
a
h  is  surjective,  dimK  = mn.   Dene  H  : V
1
    V
2
L  by
H(x) = (h(x), (x)).
Then  d
a
H  is  injective,   and  hence  an  isomorphism:   for  if  d
a
H(v)  =  0  then  d
a
h(v)  =  0  and
d
a
(v) = 0, so in particular v  K; but this means d
a
(v) = v, since   is a linear projection.
Hence  v = 0.   It  follows  from  the  inverse  function  theorem  (old)  that  H  is  a  dieomorphism
from  some  neighbourhood  Z
1
  of  a  in  V
1
  to  some  neighbourhood  Z
2
  of  H(a)  in  V
2
K.
We  note  that  h  =  standard  submersion   H.   So  take  as  new  chart  on  M  around  x  the
map
  
1
  := H  
1
  : 
1
1
  (Z
1
)    Z
2
.
1
~
1
~ Z
1
Z
2
H
s
t
a
n
d
a
r
d
 s
u
b
m
e
r
s
i
o
n
1
= H 
W
2
1
U
U
M
N f
f
o o
1
h := 
V
1
2
2
1
V
21
Thus,
2
 f  
  
1
1
  = standard  submersion
by  the  commutativity  of  all  these  diagrams  of  mappings.   2
We  will  explore  the  consequences  of  1.26  and  1.27  in  the  rest  of  this  chapter.
Denition  1.28.   Let  f  :   M    N  be  a  smooth  map  of  manifolds.   The  point  x    M  is  a
regular  point  of  f  if  d
x
f  is  surjective,   and  a  critical   point  otherwise.   The  point  y    N  is  a
regular  value  if  every  preimage  x  f
1
(y)  is  a  regular  point,  and  a  critical   value  otherwise.
Thus,   y  is  a  critical   value  if  and  only  if  it  is  the  image  of  a  critical   point.   In  particular,   if
f
1
(y) =   then  y  is  a  regular  value.
Corollary  1.29.   (of  1.27)  If  f  :  M
m
  N
n
is  a  smooth  map  of  manifolds  and  y    N  is  a
regular  value  with  f
1
(y)  ,=    then  f
1
(y)  is  a  manifold  of   dimension  m n,   and  for  any
x  f
1
(y),  T
x
f
1
(y) = ker  d
x
f.
Proof   Let  x  f
1
(y).   Then  f  is  a  submersion  at  x.   Coose  charts  as  in  1.27.   Denote  the
standard  submersion  R
m
  R
n
by  s.   Evidently  s
1
(b)  V
1
  =  (b  R
mn
)  V
1
,   and  this
is  dieomorphic  to  an  open  set  in  R
mn
(just  project  to  lose  the  b).   Moreover  
1
  denes  a
dieomorphism f
1
(y) U
1
 s
1
(b) V
1
.   It follows that x has a neighbourhood f
1
(y) U
1
in  f
1
(y)   which  is   dieomorphic   to  an  open  set   in   R
mn
.   Thus   f
1
(y)   is   a  manifold  of
dimension  mn.
s = standard submersion
N
b
x
a
y
f   (y)
s  (b)
1
1
1
U
M
f
V
V
1
1
2
U
2
For the statement about the tangent space, let   be a smooth curve in f
1
(y) with (0) = x.
Then  f     is  constant,  so  d
x
f(
(0)) = (f )
(0) = 0, and
(0)  ker  d
x
f.   Thus
T
x
f
1
(y)  ker  d
x
f.
22
Both  ker  d
x
f  and  T
x
f
1
(y)  have  dimension  mn,  so  they  must  be  equal.   2
Example  1.30.  (i) Consider the function f  :   R
3
  R, f(x
1
, x
2
, x
3
) = x
2
1
+x
2
2
x
2
3
.   The only
critical  point of f  is  (0, 0, 0), so  the  only critical  value  is  0.   If t < 0  or t > 0  then  f
1
(t) is  a
smooth  manifold  of  dimension  2,  by  Theorem  1.29.   But  f
1
(0) is  not a  smooth manifold  at
the  critical  point  of  f.
t=0 t<0 t>0
If  we think of t as time and view  this sequence  of pictures with time in reverse,  it resem-
bles a drip of water in the moments  before and after it separates from the water on the edge
of  the  tap  and  falls.
(ii) The matrix group O(n) is the set of nn real matrices A satisfying A
t
A = I
n
.   Thus, O(n)
is the preimage of the point I
n
  under the map Mat
nn
(R)    Mat
nn
(R), where Mat
nn
(R) is
the space of real nn matrices.   Thinking of Mat
nn
(R) as the same as  R
n
2
, and thus a man-
ifold, we can ask whether I
n
  is a regular value of f.   The answer  is clearly no, since if it were
a  regular  value  then  its  preimage  O(n)  would  have  dimension  equal   to  dimension(source)-
dimension(target)= 0.   As O(n) plainly is not just a collection of isolated points, this cannot
be  the  case.   In  fact,   however,   for  every  A  the  matrix  A
t
A  is  symmetric,   and  so  we  can  re-
designate f  as a map Mat
nn
(R)    Sym
n
(R), where Sym
n
(R) is the space of nn symmetric
real matrices.   Now I
n
  is a regular value.   To see  this  we  rst compute  the derivative  d
A
f, or
rather, its value on a matrix B   T
A
Mat
nn
(R) (note that since Mat
nn
(R) is a vector space,
its  tangent  space  at  any  point  is  canonically  identied  with  Mat
nn
(R)  itself).   We  have
d
A
f(B) =   lim
h    0
f(A+ hB) f(A)
h
  =   lim
h    0
(A+ hB)
t
(A + hB) A
t
A
h
=   lim
h    0
h(A
t
B + B
t
A) + h
2
B
t
B
h
  = A
t
B + B
t
A.
We  have  to  show  that  for  each  A    O(n)  and  each  matrix  S    T
In
Sym
n
(R),   there  exists
a  matrix  B  such  that   d
A
f(B)   =  S.   Sym
n
(R)   is   once  again  a  vector   space,   so  S  here  is
any  symmetric  matrix.   Solutions  to  this  equation  will   not  be  unique  -  in  fact  we  expect  a
1/2n(n  1)  dimensional   ane  space  of  solutions.   But  here  is  one:   let  B  =  1/2AS.   Then
A
t
B  =  1/2A
t
AS  =  1/2S,   and  B
t
A  =  1/2S
t
A
t
A  =  1/2S
t
=  1/2S.   Hence  A
t
B + B
t
A  =  S
23
as  required.   This  shows  that  I
n
  is  a  regular  value  of  f,  and  thus  that  O(n)  is  a  manifold  of
dimension  equal  to  n
2
dimSym
n
(R) = n(n 1)/2.
The  group O(n) is  contained  in the  group  Gl(n, R) of  invertible  n n matrices,  which  is
an open set in the Euclidean  space Mat
nn
(R).   The operations of matrix multiplication  and
matrix inversion  are smooth maps Gl(n, R) Gl(n, R)    Gl(n, R) and Gl(n, R)    Gl(n, R).
Therefore  they  are  smooth  (new)  on  O(n),  and  O(n)  is  a  Lie  group.
2
Corollary  1.31.   of  1.26  If   M
m
  N
n
are  manifolds  then  for  each  point  x    M  there  is  a
neighbourhood  U  of  x  in  N  and  a  smooth  map  g  : U    R
nm
such  that  0  is  a  regular  value
of  g  and  M   U  = g
1
(0).
Proof   Apply  1.26  to  the  inclusion  f  :   M    N  to  get  charts  
1
  :   U
1
    V
1
    R
m
on  M
around x and 
2
  : U
2
    V
2
  R
n
on N  around f(x) = x, so  that 
2
 f  
1
1
  is the standard
immersion  i.   Now  i(V
1
)  V
2
 (R
m
0),  but  this  inclusion  is  not  necessarily  an  equality.
On  the  other  hand,  because  V
1
  is  open  in  R
m
,  i(V
1
) is  open in  R
m
0,  and  so  there  exists
an  open  set  V
2
  in  R
n
,  contained  in  V
2
,  such  that  i(V
1
) = V
2
  (R
nm
0).
  1
V
2
V
2
i(V  )
1
R  x {0}
m
In V
2
, i(V
1
) is the set of point where x
m+1
, . . ., x
n
  all vanish.   So take U  = 
1
2
  (V
  
2
), and take,
as  g  : U    R
nm
,  the  map  (x
m+1
 
2
, . . ., x
n
 
2
).   The  maps
(x
1
, . . ., x
n
) (x
m+1
, . . ., x
n
)
and
y  U  
2
(y)
are  both  submersions,  and  hence  so  is  their  composite,  g.   2
1
Editorial comment:   Actually this is the trivial statement that the subspace topology on R
m
0  R
n
coincides  with  the  topology  that  R
m
0  inherits  from  its  identication  with  R
m
.
24
The  components  of  a  map  g  :   U    R
nm
such  that  M   U  =  g
1
(0)  are  equations  for
M  in  U;   if   in  addition  0  is  a  regular  value  of   g,   then  g  is  a  set  of   regular  local   equations
for  M  in  U.   So  1.31  establishes  that  every  submanifold  M   N  has  regular  local  equations
everywhere.
Remark  1.32.   Let M
m
 N
n
.   The number of regular equations needed to dene M  locally
in  N  is  n m.   If  n m > 1,  it  is  possible  to  dene  M  with  fewer  equations  -  for  example,
if   g
1
, . . ., g
nm
  are  equations  dening  M  in  the  open  set  U    N,   then  the  single  equation
G := g
2
1
 +   + g
2
nm
  also  denes  M  in  U.   But  is  not  regular:   0  is  not  a  regular  value  of  G.
This  is an immediate  consequence  of 1.29:   if  0 were  a regular value then  G
1
(0) would  have
dimension  n 1, instead  of m.   It is instructive  to  see,  by  applying  the chain  rule to d
x
G for
x  M   U,  why  no  such  point  x  can  be  a  regular  point  of  G.
1.3   Finding  Regular  Equations
How  does  one  nd  regular  equation?   It  can  be  very  hard.   Here  I  give  some  relatively  easy
examples  and  exercises.   Mostly,   nding  equations  is  a  question  of  paying  careful   attention
to  the  description  of  the  submanifold  M  for  which  one  wants  to  nd  equations.
Example  1.33.   (1)  Let  (0, 0) ,= (a, b)  R
2
,  and  let
M  = (u, v)  R
2
: (x, y)   is  orthogonal  to   (a, b).
It  is  easy  to  nd  an  equation  for  M:
(x, y)  M      (a, b)  (x, y) = 0      ax + by  = 0.
The  map  g  :   R
2
  R  dened  by  g(x, y)  =  ax + by  is  a  regular  equation  for  M  in  all   of   R
2
,
because  the  matrix  [d
(x,y)
g]  is  equal   to  [a  b]   (as  g  is  linear,   it  is  equal   to  its  own  derivative
at  every  point),   and  [a  b]   denes  a  surjective  linear  map  (remember  that  we  are  assuming
(a, b) ,= (0, 0).)
(2)  Now  let  M  = (u, v)  R
2
: (u, v)   is  parallel  to   (x, y). How  to  nd  equation(s)  for M?
First approach The vector (b, a) is orthogonal to (a, b), so (x, y) is parallel to (a, b) if it is or-
thogonal to (b, a).   Thus, M  = (x, y) : bx+ay  = 0 and we can take g(x, y) = bx+ay.
Similar  reasoning  to  (i)  shows  g  is  a  regular  equation  for  M  in  all  of   R
2
.
Second approach Two vectors in  R
2
are parallel if they are linearly dependent.   Linear depen-
dence  of two  vectors  in  R
2
(and in general of n vectors in  R
n
) is detected  by the vanishing  of
a  determinant:
M  = (x, y) : det
_
  a   b
x   y
_
= 0.
As  the  determinant  here  is  equal   to  bx + ay,   we  have  arrived  at  the  same  conclusion  as
with  the  rst  approach.   Perhaps  this  approach  is  better.   Its  easier  to  see  how  to  generalise
25
it  to  higher  dimensions.
(3)  Let
M  = (a, b, x, y)  R
4
: (a, b) ,= (0, 0),   (x, y)   is  parallel  to   (a, b).
Is M  a manifold?  The condition for membership is the same as before:   that the determinant
should  be  zero.   But  now  we  are  treating  a  and  b  as  variables  as  well.   So  if  we  take
g(a, b, x, y) = det
_
  a   b
x   y
_
as before, is g  a regular equation for M?  Notice that in the denition of M  we stipulate that
(a, b) ,= (0, 0), so we only need to think of the behaviour of g  in U  := (R
2
(, 0)) R
2
.   We
have
[d
(a,b,x,y)
g] = [y, x, b, a]
and this is a surjective linear map for every (a, b, x, y)  U.   In fact we see that it is surjective
if  (a, b, x, y) ,= (0, 0, 0, 0), so  g  denes  a  manifold  in  R
4
0.
(4)   In  all   of   the   examples   weve   looked  at   so  far,   one   equation  was   enough.   If   M  has
codimension  greater  than  1  then  more  equations  will  be  needed.   Let  0 ,= (a, b, c)  R
3
,  and
let
M  = (x, y, z)  R
3
: (x, y, z)   is  parallel  to   (a, b, c).
Clearly M  is a line in 3-space, so will need two equations in the neighbourhood of every point.
But  which  two  equations?   The  condition  that  the  two  vectors  be  parallel   is  equivalent  to
the  the  matrix
  _
  a   b   c
x   y   z
_
having  rank  1.   This  means  that  all  of  its  2  2  minors  must  vanish.   But  it  has  three  2  2
minors.   Which two should we choose?  The answer is that it depends on the vector (a, b, c).   If
(a, b, c) = (1, 2, 3) then the three minors, in the order det(col 2, col 3), det(col 1, col 3), det(col 1, col 2),
are
2z 3y, z 3x, y 2x
and  in  fact  any  two  of  these  will  do,  as  if  both  are  zero  then  so  is  the  third.   For  example
y 2x = 
1
3
(2z 3y) +
  2
3
(z 3x).
Exercise  1.34.   (1)  Assume  (a, b, c) ,= 0  and  let
M  = (x, y, z)  R
3
: (x, y, z)   is  parallel  to   (a, b, c).
Show that if a ,= 0 then det(col 2, col 3) can be written as a linear combination of det(col 1, col 2)
and  det(col   1, col   3),   and  deduce  that   M  has,   as   regular   equations,   det(col   1, col   3)   and
det(col  1, col  2).   Show  that  if  a = 0  then  these  are  no  longer  regular  equations  for  M.   Find
26
regular  equations  if  b ,= 0,  and  if  c ,= 0.
(2)  Let
M  = (a, b, c, x, y, z)  R
6
: rank
_
  a   b   c
x   y   z
_
= 1.
Find  regular  equations  for  M.   Note  that
_
  0   0   0
0   0   0
_
,= M,
so  we  only  need  be  concerned  with  equations  in  R
6
 0.   Note  also  that  by  (i),   there  is
no  set  of  regular  equations  which  is  good  for  all  of   R
6
 0.   Instead,   you  will  have  to  nd
dierent  sets  of  regular  equations  for  dierent  open  subsets  of   R
6
0.
Example  1.35.   (1)  Let  (t)  =  (cos  t, sin  t, t).   The  image  of    is  a  helix.   Its  easy  to  nd
regular  equations  for  it:   since  
3
(t) = t,  we  can  take  the  equations
x
1
cos  x
3
,   x
2
sin  x
3
.   (1.1)
(2)  Now  let  (t)  =  (cos  t, sin  t, t
3
)  The  image  of    (sketch  it!)   is  once  again  a  manifold;   in
the next subsection we will give conditions on immersions to guarantee that their images are
manifolds.   But  now  nding  regular  equations  is  not  so  easy.   Near  any  point  on  the  curve
where  x
3
 ,= 0,  we  can  recover  the  parameter  t  as  (x
3
)
1
3
,  and  then  take,  as  equations
x
1
cos (x
1
3
3
),   x
2
sin (x
1
3
3
).   (1.2)
These  are  smooth  functions  on  R
3
 x
3
  =  0,   and  give  regular  equations  there  (Exercise
0.4(1).   But  what  about  points  where  x
3
  =  0?   The  function  (x
3
)
1
3
  is  not  smooth  at  x
3
  =  0.
There  is   such  a  point   on  the  curve,   namely  (0)   =  (1, 0, 0).   Fortunately  when  t   is   near
0  we  can  recover   it  from  sin  t,   using  the  arcsin  function,   which  denes   a  dieomorphism
(1, 1)    (/2, /2).   Thus  in  the  neighbourhood  of  (1, 0, 0) we  can  use  the  equations
x
1
cos(arcsin  x
2
),   x
3
(arcsin  x
2
)
3
.   (1.3)
Exercise  1.36.   (1)  Show  that  the  equations  (1.1),  (1.2)  and  (1.3)  are  all  regular  equations
(for  the  manifolds  in  question)  where  they  claim  to  be.
(2) Is the image of the parametrisation (t) = (cos  t, sin  t, t
2
) a manifold?  If it is, nd regular
equations  for  it.
(3) Find regular equations for the image of the map R    R
3
given by f(t) = (cos  t, sin  t, cos  t sin  t).
(4)  Find  equations  for  the  image  of  the  map  R    R
3
given  by  g(t) = (cosh  t, sinh  t, e
t
).
27
(5)  Find  regular  equations  for  the  set
(a, b, c, d, w, x, y, z)  R
8
0 : rank
_
  a   b   c   d
w   x   y   z
_
= 1.
(6)  For  each  xed  k  p  n,  the  set
A  Mat
pn
(R) : rank  A = k
is  a  manifold  of   codimension  (p  k)(n  k).   Hints  for  a  proof   can  be  found  in  Exercises
I   number   18.   Exercises   1.34(2)   and  1.36(4)   above  give  alternative  proofs  for   the  case  of
matrices of rank 1 in the space of 23 matrices, and matrices of rank 1 in the space of 24
matrices.
To  do:   Find  regular  equations  for  this  manifold  when  p = 3, n = 3, k = 1.
1.4   Images  of  Immersions
The  proof  of  theorem  on  the  preimage  of  a  regular  value  (1.29) amounts  to  little  more  than
the  following  two  facts:   rst,   the  obvious   fact  that   the  preimage  of   any  point   under   the
standard  submersion  R
n+k
  R
n
is  a  manifold  of   dimension  k  (in  fact  an  ane  subspace
of   R
n+k
),   and  second,   the  local   normal   form  for  submersions  1.27,   that  that  a  submersion
locally  looks  just  like  the  standard  submersion.
Since dual versions  of these  two statements  hold for immersions   that the image of the
standard  immersion  R
m
  R
m+k
is  a  manifold,  and  the  local  normal  form  for an  immersion
1.26    one  might  imagine  that  a  dual  version  of  1.29  would  hold,  and  that the  image  of  an
immersion  is  a  manifold.   But  life  is  more  interesting  than  that:
Example  1.37.   There is an immersion  twisting  a circle into a gure 8 as shown (which  one
can  easily  describe  in  co-ordinates    Exercise).
f
So the image of an immersion is not necessarily  a manifold.   Perhaps we have to ask that
the  immersion  be  1-1  in  order  to  guarantee  that  the  image  is  a  manifold?   In  fact  this  is
neither  sucient,  as  the  next  diagram  shows,  nor  necessary,  as  we  shall  see  later.
28
1
1 1
1
f
1
1 1
1 1/2
1/2 1/2
1/2
Open interval (1,1)
11
immersion
figure 8
Here the injective immersion f  twists the open interval (1, 1) into a gure 8.   If the interval
were  the  closed  interval  [1, 1]  then  the  points  1, 0  and  1  would  all  have  the  same  image.
But  1  and  1  are  not  there,  and  so  f  is  injective.   2
It  turns  out  that  the  problem  with  the  map  in  this  example  is  that  it  is  not  open  onto
its  image,  as  you  can  see  from  the  lower  drawing    the  image  of  (1/2, 1/2)  is  not  open  in
f(1, 1).
Denition  1.38.   Let  f  : X    Y   be  a  map  of  topological  spaces.
(i)  f  is  open  (or  an  open  map)  if  for  every  subset  U  open  in  X,  f(U)  is  open  in  Y .
(ii)  f  is  open  onto  its  image  if  for  every  subset  U  open  in  X,  f(U)  is  open  in  f(X).
Proposition  1.39.   (i)  If  f  : M    N  is  a  submersion  (with  M, N  manifolds),  then  it  is  an
open  map.
(ii)  If  M
m
 N
m
is  an  inclusion  of  manifolds of  the  same  dimension, then  M  is  open  in  N.
Proof   Exercise.   For   (i),   use   the   local   normal   form  for   a  submersion    the   standard
submersion  is  certainly  open.   For  (ii),  apply  (i)  to  the  inclusion  M  N.   2
Theorem  1.40.   Suppose   that   f   :   M
m
  N
n
is   an  immersion  which   is   open  onto   its
image   f(M).   Then  f(M)   is   a   manifold  of   dimension  m,   and  for   each  x     M,   d
x
f   :
T
x
M    T
f(x)
f(M)  is  an  isomorphism.
29
Proof   Let  y   f(M).   We  have  to  nd  a  neighbourhood  W  of  y  in  f(M)  which  is  dieo-
morphic  to  some  open  set  in  R
m
.   Choose  x    f
1
(y),   and  choose  charts  
1
  :   U
1
    V
1
  on
M  around  x,  and  
2
  : U
2
    V
2
  on  N  around  y,  as  in  theorem  on  the  local  normal  form  for
an  immersion,   1.26.   Then  
2
 f   
1
1
  is  the  standard  immersion  i.   The  image  of  i  in  V
2
  is
certainly  a  manifold:   it  is  dieomorphic  to  the  open  set  V
1
  in  R
m
,   since  i  :   V     i(V )  has
a  smooth  inverse,   namely  the  projection  which  forgets  the  last  n m  co-ordinates.   And  
2
restricts  to  a  dieomorphism  f(U
1
)    i(V
1
).   So  what  more  do  we  need?  We  need  f(U
1
)  to
be  a  neighbourhood  of y  in  f(M).   But  this  is  guaranteed  by  the  hypothesis  that  f  be  open
onto  its  image.
The  last  statement  holds  because  d
x
f  :  T
x
M    T
f(x)
f(M)  is  a  linear  injection,   and  its
source  and  target  have  the  same  dimension.   2
Example  1.41.   (i)  Consider  the  map  S
1
  R
2
given,   with  respect  to  complex  co-ordinate
z,  by  z z
n
(n  Z  0.   Its  an easy  matter to check  that f  is  an immersion.   In Example
1.15  we  do  this  rather  painstakingly  for  the  case  n  =  2;   using  the  holomorphic  derivative
f
(z) = nz
n1
gives  a  much  quicker  way.   Since  the  image  of  f  is  S
1
,  that  f  is  open  onto  its
image  follows  from  Proposition  1.39  above.
(ii)  More  interesting  example:   consider  the  map  S
2
  R
6
given  by  all  the  quadratic  mono-
mials:
f(x
1
, x
2
, x
3
) = (x
2
1
, x
2
2
, x
2
3
, x
1
x
2
, x
1
x
3
, x
2
x
3
).
Since  this  map  identies  antipodal  points  (i.e.   f(x)  =  f(x)),   it  passes  to  the  quotient  to
dene  a  continuous  map
f  : S
2
/     f(S
2
).
where  S
2
/    is  the  quotient  of  S
2
by  the  equivalence  relation  x    x,   with  the  standard
quotient  topology.   Since  f  only  identies  antipodal  points,
  
f  is  a  1-1  continuous  map  onto
f(S
2
).   As  S
2
/   is  the  continuous  image  of  a  compact  space,  it  is  compact.   It  follows  that
f
(x) = (x, f(x)).   Its  image  is  the  graph  of  f.   
f
  has  smooth  inverse  (x, y)  x,  and  so  is
an  embedding.   Its  derivative  at  x  is  the  map
 x ( x, d
x
f( x)),
so  by  Theorem  1.40,
T
(x,f(x))
graph(f) = ( x, d
x
f( x)) :   x  T
x
M.
2
Our  last  result  in  this  direction  is  an  odd-looking  technical   result  whose  purpose  is  not
readily  apparent  yet:
Theorem  1.50.   Suppose  f  : M
m
  N
m
is  a  smooth  map  of  manifolds, and  that  X   M  is
a  submanifold  (possibly  of  lower  dimension  than  M).   If
1.   f  is  1-1  on  X,  and
2.   d
x
f  : T
x
M    T
f(x)
N  is  an  isomorphism  for  all   x  X,
then  there  is  a  neighbourhood  U  of  Xin  M  such  that  f  : U    f(U)  is  a  dieomorphism.
Proof   I  give  the  proof   only  for  X  compact,   where  it  is  substantially  easier  than  in  the
general  case.
By  hypothesis  2.   and  the inverse  function  theorem,  f  maps  some neighbourhood U
x
  of  each
point  x    X  dieomorphically  to  a  neighbourhood  V
f(x)
  of  f(x)  in  N.   The  union  U
1
  of  all
of  the  U
x
  is  a  neighbourhood  of  X,  and  at  each  point  x  U
1
,  d
x
f  is  an  isomorphism.
I   claim  that   there   is   another   neighbourhood  U
2
  of   X  in  M,   on  which  f   is   1-1.   For
suppose  that  there  is  not.   Then  in  every  neighbourhood  of  X  there  exist  points  a ,= b  such
that  f(a)  =  f(b).   In  particular,   in  U
(n)
:=  
xX
B(x, 1/n)  there  are  distinct  points  a
n
, b
n
32
such  that  f(a
n
)  =  f(b
n
).   Now  the  set  U
1
has  compact  closure  (in  the  ambient  Euclidean
space  of  M),  simply  because  X,  being  compact,  is  bounded.   Hence  so  does U
(1)
U
(1)
,  and
so  the  sequence  (a
n
, b
n
),  which  is  contained  in  U
(1)
U
(1)
,  has  a  subsequence  converging  to
some  point  (a, b)  in  the  closure  of  U
(1)
U
(1)
.   By  replacing  (a
n
, b
n
)  by  this  subsequence  we
can  suppose  that  a
n
    a, b
n
    b  as  n   .
So  far  we  are  only  guaranteed  that  a  and  b  lie  in  the  closure  of   U
(1)
.   But  in  fact  they
must  lie  in  X;   for  suppose,   for  example,   that  a  /   X.   As  X  is  compact,   there  exists    >  0
such  that B(a, ) X  = .   Then  for  n > 2/, |a
n
a| > /2, making  it  impossible  for (a
n
)
to  converge  to  a.
Now  f(a)   =  f(b),   by  continuity.   So  as   f   is   1-1  on  X,   a   =  b.   But   because   f   is   a
dieomorphism  on  some  neighbourhood  U
a
  of  a = b  in  M,  f  is  1-1  on  U
a
.   This  contradicts
the  assumption  that  a
n
  ,=  b
n
  and  f(a
n
) =  f(b
n
)  for  all  n,  since  all  but  nitely  many  of  the
a
n
  and  b
n
  are  in  U
a
.
We   conclude   that   f   must   indeed  be   1-1  on  some   neighbourhood  U
2
  of   X.   Now  let
U  =  U
1
 U
2
.   Then  f  : U    N  is  a  1-1,  open  immersion  (for  openness  use  1.39(i)).   Hence
f  : U    f(U)  is  a  dieomorphism.   2
1.5   Transversality
Given  f   :   M    N  and  Z    N,   when  is  f
1
(Z)  a  manifold?   If   Z  is  just  a  point  z
0
,   the
answer is given by 1.29:   when z
0
  is a regular value of f.   Building on this result, we now give
a  condition  for  f
1
(Z)  to  be  a  manifold.
Denition  1.51.   (i)  If  M  and  Z  are  both  submanifolds  of  N,  we  say  that  M  is  transverse
to  Z  at  x  M   Z,  written  M    Z  at  x,  if
T
x
M  + T
x
Z  = T
x
N,
and  M  is  transverse  to  Z  if  this  holds  for  all  x  M   Z.
(ii)  If   Z    N  and  now  f   is  a  map  from  M    N,   we  say  f   is  transverse  to  Z  at   x    M
(written  f    Z  at  x)  if
d
x
f (T
x
M) + T
f(x)
Z  = T
f(x)
N.   (1.4)
f   is   transverse   to  Z  if   it   is   transverse   to  Z  at   every  x    f
1
(Z).   (So  in  particular,   if
f
1
(Z) = ,  f  is  transverse  to  Z.)
In  fact  (i)  is  the  special  case  of  (ii)  in  which  f  : M    N  is  the  inclusion.
It  is  important  to  note  that  the  relation  of  transversality  of  two  submanifolds  of  N  involves
three  manifolds:   the  two  which  are  transverse  to  one  another,   or  not    M  and  Z  in  the
denition  just given   and the ambient  manifold N.   For example,  if M  is the x-axis and Z
33
is the y-axis then M  and N  meet transversely in N  =  R
2
, for at their point O of intersection,
we  have
T
O
M  + T
O
Z  = Sp(1, 0) + Sp(0, 1) =  R
2
= T
O
R
2
.
However,   if  we  now  think  of   R
2
,   and  therefore  also  M  and  Z,   as  contained  in  R
3
,   then  M
and  Z  are  no  longer  transverse.   Indeed,  no  two  1-manifolds  meeting  at  a  point  x  in  R
3
can
be  transverse  in  R
3
  T
x
M  + T
x
Z  has  dimension  at  most  2,   and  thus  cannot  be  equal   to
T
x
R
3
=  R
3
.   For  this  reason,  the  notation  M    Z  should  ideally  be  replaced  by  something
like  (M    Z)
N
  to  emphasize  the  role  of  the  ambient  space.
On the other hand, part (ii) of the denition of transversality does not have this ambigu-
ity, since the manifold N  is now present in the data, as the target of the map f.   Nevertheless,
this  is  an  issue  in  which  one  must  be  careful.
N=R
3
N=R
3
N=R
3
Z
Z Z
p
In  the  picture  on  the  left,   M    Z;   for  at  each  point  where  they  meet,   their  tangent  planes
are  distinct,  and  any  two  distinct  2-dimensional  subspaces  of   R
3
together  generate  all  of   R
3
.
In  the  middle  picture,   M  and  Z  are  not  transverse,   since  at  the  point  p  where  they  meet,
T
p
M  = T
p
Z.   In  the  right-hand  picture  again  M    Z,  this  time  because  their  intersection  is
empty.
Theorem  1.52.   If  f  : M    N  is  transverse  to  Z  then  f
1
(Z),  if  not  empty,  is a  manifold,
and
dimM    dimf
1
(Z) = dimN     dimZ.
Moreover,
T
x
f
1
(Z) = (d
x
f)
1
(T
f(x)
Z).
Proof   Let  x    f
1
(Z).   By  1.31  there  is  a  neighbourhood  U  of   z  in  N  on  which  Z  has
regular equations  g   that  is,  a  smooth map  g  : U    R
c
such  that Z U  = g
1
(0) and 0  is
a  regular  value  of  g.   Here  c = dimN  dimZ.   Then  f
1
(U)  is  a  neighbourhood  of  x  in  M,
and
f
1
(Z)  f
1
(U) = f
1
(Z  U) = f
1
(g
1
(0)) = (g  f)
1
(0).
Now  we  show  that  0  is  a  regular  value  of  g  f.   By  applying  d
f(x)
g  to  the  equality  (1.4)  we
obtain
d
x
(g  f)(T
x
M) + d
f(x)
g(T
f(x)
Z) = d
f(x)
g(T
f(x)
N)   (1.5)
34
Since T
f(x)
Z  = ker d
f(x)
g, the second term on the left hand side is 0.   And since 0 is a regular
value  of   g,   d
f(x)
g  is  an  epimorphism  and  the  right  hand  side  is  equal   to  R
c
.   We  conclude
that  0  is  a  regular  value  of  g  f.   Since  f
1
(Z)  f
1
(U) = (g  f)
1
(0), f
1
(Z)  f
1
(U)  is
a  manifold.   Being  a  manifold  is  a  local  property,  so  f
1
(Z)  is  a  manifold.
The   second  statement   follows   from  the   fact   that   f
1
(Z)   is   locally  the   preimage   of   a
regular  value  of  a  map  into  R
c
.
For  the  third  statement,  we  have
T
x
f
1
(Z) = ker d
x
(g  f) = ker d
f(x)
g  d
x
f  = (d
x
f)
1
(ker d
f(x)
g) = (d
x
f)
1
(T
f(x)
Z),
1.29.   2
Let   M    N  be   manifolds.   The   codimension  of   M  in  N  is   dened  to  be   dimN  
dimM.   Thus,  part (ii) of the previous theorem says that if f    Z  and f
1
(Z) ,= , then the
codimension  in  M  of  f
1
(Z)  is  equal  to  the  codimension  in  N  of  Z.
Remark  1.53.   (i) The above  proof makes  clear that 0 is a regular value of g  f  if  and only
if  equation  (1.4)  holds  for  all  x  in  f
1
(Z  U).
(ii)  The  equality  in  (1.4)  can  be  rewritten  as  surjectivity  of  the  composite
T
x
M
  dxf
  T
f(x)
N  
  T
f(x)
N
T
f(x)
Z
 .
The  kernel   of  the  composite  is  (d
x
f)
1
(T
f(x)
Z).   The  rank-nullity  theorem  of  linear  algebra
tells  us  that  a  linear  map    :   V
1
    V
2
  is  surjective  if   and  only  if   the  dimension  of   ker   is
dimV
1
 dimV
2
.   So  the  composite  map  T
x
M    T
f(x)
N/T
f(x)
Z  is  surjective  if   and  only  if
(d
x
f)
1
(T
f(x)
Z)  is  as  small as  possible,  with  dimension  dimM  (dimN  dimZ).
(iii)   Suppose   y   is   a  regular   value   of   the   map  g   :   N    P   and  Z  :=  g
1
(y)   ,=  .   Since
T
z
Z  = ker d
z
g  for  each  z  Z,  d
z
g  gives  rise  to  an  isomorphism
T
z
N
T
z
Z
   T
g(z)
P.
Example  1.54.   We  give  three  examples  which  illustrate  the  denition  of  transversality  in
action.   The  rst  uses  it  directly;  the  second  uses  it via  the  interpretation  given  in  the  proof
of  Theorem  1.52,  that  f    Z  if  and  only  if  regular  equations  for  Z,  when  composed  with  f,
become  regular  equations  for  f
1
(Z).
(i)  Suppose  S    R
n
 R
p
is  a  manifold,   and  let    :   S    R
p
be  projection.   Then  for  each
point  y  R
p
,
_
R
n
y
_
    S  at  (x, y)      d
(x,y)
 : T
(x,y)
S    T
y
R
p
=  R
p
is  surjective.
For,   by  the  denition  of   transversality,   the  LHS  means  that  for  all   ( x,  y)    T
(x,y)
R
n
 R
p
,
there  exist  ( x
1
,  y
1
)  T
(x,y)
_
R
p
y
_
  and  ( x
2
,  y
2
)  T
(x,y)
S  such  that
( x
1
,  y
1
) + ( x
2
,  y
2
) = ( x,  y).
35
As  T
(x,y)
R
n
 y  =  R
p
 0,    y
1
  =  0,   and  so  we  must  have   y
2
  =   y.   Since  d
(x,y)
( x
2
,  y)  =   y,
and   y  was  arbitrary,  this  shows  that  d
(x,y)
  : T
(x,y)
S    T
y
R
p
is  surjective.
The argument in the opposite direction is equally simple.   Suppose d
(x,y)
 is surjective.   Then
given  ( x,  y)    T
(x,y)
R
n
 R
p
,   there  exists  a  vector  of  the  form  ( x
1
,  y)    T
(x,y)
S.   The  vector
( x   x
1
, 0)  is  in  T
(x,y)
R
n
y.   Thus  ( x,  y)  is  the  sum  of  a  vector  in  T
(x,y)
S  and  a  vector  in
T
(x,y)
R
n
y.
The geometry of the situation can be appreciated in the following  picture, in which n = p =
dimS  =  1.   The  points  where  y  R  fails  to  be  transverse  to  S  are  precisely  the  critical
points  of    : S    R.
R
R
0
1
2
y
y
y
S
 {y }xR
 {y } x R {y }x R
2
0
1
(ii)  Now  suppose  that  F  :   R
m
 R
p
  R
n
is  a  smooth  map,   Z    R
n
is  a  submanifold,   and
F    Z.   For  each  y    R
p
,   denote  by  f
y
  :   R
m
  R
n
the  map  f
y
(x)  =  F(x, y).   Then  for  each
(x, y)  F
1
(Z)  R
m
R
p
,
_
R
m
y
_
    F
1
(Z)  at  (x, y)      f
y
    Z  at  x.
To  see  this,  suppose  that  codimZ  = c  and  g
1
, . . ., g
c
  are  regular  dening  equations  for  Z  in
some  neighbourhood  U  of  F(x, y)  in  R
n
,   and  let  g  :   R
n
  R
c
be  the  map  with  component
functions  g
1
, . . ., g
c
.   Then  by  what  we  saw  in  the  proof  of  Theorem  1.52,  the  transversality
of  F  to  Z  means  that  0  is  a  regular  value  of g  F,  i.e.   g
1
 F, . . ., g
c
 F  are  regular  dening
equations  for  F
1
(Z)  in  F
1
(U).   Moreover,  by  the  same  argument,
f
y
    Z  at  x      d
x
(g  f
y
)  is  surjective.   (1.6)
Denote  by  i
y
  :   R
m
  R
m
R
p
sending  x  to  (x, y).   We  have
_
R
m
y
_
    F
1
(Z)  at  (x, y)      i
y
    F
1
(Z)  at  x.
Since  the  g
i
 F  are  regular  equations  for  F
1
(Z),  the  argument  of  1.52  gives
i
y
    F
1
(Z)  at  x      d
x
(g  F   i
y
)  is  surjective.   (1.7)
36
But  f
y
  = F   i
y
,  so  the  right  hand  sides  of  (1.6)  and  (1.7)  are  the  same.
Exercise  1.55.   (i)  Suppose  M  and  P  are  manifolds,   and  S    M   P  is  a  manifold.   Let
  :  S    P  be  the  restriction  of  the  standard  projection  M   P    P.   Show  that  if  y    P
then
_
M  y
_
    S      y  is  a  regular  value  of  .
(ii)  Suppose  that  F  : M  P    N  is  a  smooth  map,  Z   N  is  a  manifold,  and  F    Z.   For
each  y  P,  denote  by  f
y
  : M    N  the  map  f
y
(x) = F(x, y).   Show  that
f
y
    Z      M  y    F
1
(Z).
(iii)  Show,  in  the  situation  of  (ii),  that
f
y
    Z      y  is  a  regular  value  of    : F
1
(Z)    P.
2   Sards  theorem  and  the  density  of  transversality
2.1   Tangent  and  normal   bundles
Let  M   R
n
be  a  manifold.   We  dene  the  tangent  bundle  TM  to  be  the  set
TM  = (x, v)  M  R
n
: v   T
x
M,
and  denote  the  projection  TM    M  sending  (x, v)  to  x  by  p.
Proposition  2.1.   Let  M
m
 R
n
be  a  manifold.   Then  TM  is  a  manifold  of  dimension  2m,
and  the  projection  p : TM    M  is  a  submersion.
Proof   Suppose   that      :   U    V      R
m
is   a   chart   on   M.   Then   p
1
(U)   is   open   in
TM.   We   show  that   there   is   a  dieomorphism  p
1
(U)     V     R
m
.   As   TM  is   covered
by  open  sets   of   the   form  p
1
(U),   these   dieomorphisms   provide   an  atlas   for   TM.   For
each  x    U,   T
x
M  =  d
(x)
(
1
)(R
m
).   It   follows  that   the  map  V   R
m
  p
1
(U)   sending
(y, v) (
1
(y), d
y
1
(v)) is a smooth surjection.   It has the smooth inverse  T dened  by
T(x, v) = ((x), d
x
(v)),  and  is  thus  a  dieomorphism.
The   map  p   :   TM    M  is   a  submersion,   because   it   is   a  composite   of   submersions;   the
diagram
p
1
(U)
p
 V  R
m
pr
 V
commutes,  so  p = 
1
 pr  T.   2
37
Remark  2.2.   By  juxtaposing  the  diagram  in  this  proof  with  the  commutative  diagram
V  R
m
1
id
 U  R
m
pr
 U
we  obtain  the  cummutative  diagram
p
1
(U)
p
 U  R
m
pr
U
id
 U
more  succinctly  displayed  as
p
1
(U)
p
A
A
A
A
A
A
A
 U R
m
pr
|
|
|
|
|
|
|
|
U
(2.1)
in  which  (check  this!)   restriction  of   the  horizontal   arrow  to  each  bre  p
1
(x)  =  T
x
M  is  a
linear  isomorphism  T
x
M    x R
m
.
Let  M
m
  R
n
be  a  manifold.   The  normal   space  to  M  at  x,   
x
(M, R
n
),   is  the  orthogonal
complement  (T
x
M)
to  T
x
M  in  R
n
.   We  dene  the  normal   bundle  of   M  in  R
n
,   denoted  by
(M, R
n
),  by
(M, R
n
) = (x, v)  R
n
R
n
: x  M, v   (T
x
M)
,
More  generally,   let  M    N    R
n
be  manifolds.   The  normal   bundle  to  M  in  N,   which  we
denote  by  (M, N),  is  the  set
(x, v)  M  T
x
N  : v  (T
x
M)
.
We  denote  the  projections  (M, R
n
)    M  by  and  (M, N)    M  by  p.
Proposition  2.3.   Let  M   R
n
be  a  manifold  of  dimension  m,  and  write  c = n m.
(i)  If   in  the  open  set   U    R
n
M  has  regular  equations  g  :   U    R
c
then  for  each  x    U,
g
1
(x), . . ., g
c
(x)  form  a  basis  for  
x
(M, R
n
).
(ii)  (M, R
n
)  is  a  manifold  of  dimension  n.
Proof   (i)  Recall  that  g
i
(x)  is  the  (unique)  vector  in  R
n
such  that  for  every  vector  v,
g
i
(x)  v = d
x
g
i
(v)
(in  this   connection,   see   Exercise   2.6(6)   below).   Since   T
x
M  =  ker d
x
g,   we   have   T
x
M  =
i
 ker d
x
g
i
  = 
i
(g
i
(x))
= Spg
1
(x), . . ., g
c
(x)
.   Therefore
(T
x
M)
= Spg
1
(x), . . ., g
c
(x)
= Spg
1
(x), . . ., g
c
(x).
38
For  each  x    M,   the  vectors  g
1
(x), . . ., g
c
(x)  are  linearly  independent,   as  they  are  the
rows  of  the  matrix  of  d
x
g,  which  has  rank  c.   Hence  they  form  a  basis  for  
x
(M, R
n
).
(ii) Let x  M  and let U  be a neighbourhood of x in  R
n
on which M  has regular equations
g  : U    R
c
.   Write  U   M  = U
0
.   Then  p
1
(U
0
) = (M, R
n
)  (U   R
n
)  is  open  in  (M, R
n
),
and  by  (i)
(M, R
n
)  (U  R
n
) = (x, v) : x  M, v   Spg
1
(x), . . ., g
c
(x).
There  is  a  smooth  surjection  
U
0
  : U
0
R
c
  p
1
(U
0
)  given  by
(x, t
1
, . . ., t
c
) (x, t
1
g
1
(x) +   + t
c
g
c
(x)).
Claim:   
U
0
  is  a  dieomorphism.   To  show  this  I  will   exhibit  a  smooth  inverse.   The  inverse
is  the  following:
(x, v) (x, coecients  of  v  with  respect  to  the  basis  g
1
(x), . . ., g
c
(x)  of  
x
(M, R
n
)).
The expression  of a vector in terms of the vectors of a basis depends smoothly on the vector
(this  is  essentially  Cramers  rule).   Hence  
1
U
0
  is  smooth.   Because  U
0
R
c
is  a  manifold,  so
is  p
1
(U
0
).   Clearly,   every  point  in  (M, R
n
)  has  a  neighbourhood  of   this  form.   Therefore
(M, R
n
)  is  a  manifold.   2
Exercise  2.4.   Show  that  there  is  a  commutative  diagram
p
1
(U
0
)
  
p
C
C
C
C
C
C
C
C
U
0
R
c
pr
|
|
|
|
|
|
|
|
U
0
(2.2)
in  which  the  restriction  of  the  horizontal   map  to  each  bre  p
1
(x)  =  
x
(M, R
n
)  is  a  linear
isomorphism  
x
(M, R
n
)    x R
c
.
Both  TM  and  (M, R
n
),   or   rather   the  maps   TM
  p
   M  and  (M, R
n
)
  p
   M,   are
examples   of   vector  bundles  over   M    maps  X
  p
  M  characterised  by  the  existence  of
diagrams  like  (2.1)  and  (2.2).
Proposition 2.5.  Suppose that M
m
 N
n
 R
p
are manifolds.   Then (M, N) is a manifold
of  dimension  n.
Proof   (M, N) is the intersection  (in N R
p
) of (M, R
p
) and TN.   I leave  as an exercise
(2.6(3)  below)  the  proof  that  this  intersection  is  transverse.   The  corollary  follows.   2
Exercise  2.6.   1.   The  normal  bundle  of  the  circle  S
1
in  R
2
,  (S
1
, R
2
)  is  the  set
(x, v)  S
1
R
2
: v   (T
x
S
1
)
 = (x, v)  S
1
R
2
: v  = tx  for  some  t  R.
39
(i)  Find  regular  equations  for  (S
1
, R
2
)  in  R
2
R
2
.
(ii)  Show  (by  an  explicit  formula)  that  there  is  a  commutative  diagram
(S
1
, R
2
)
  
p
E
E
E
E
E
E
E
E
S
1
R
pr
.}
}
}
}
}
}
}
}
S
1
in  which  the  restriction  of  the  horizontal   map  to  each  bre  p
1
(x)  =  
x
(S
1
, R
2
)  is  a  linear
isomorphism  
x
(S
1
, R
2
)    x R.   Find also  an explicit  formula  for the  inverse  to  the  hor-
izontal  map.
(iii)  Generalise  (ii)  to  S
n
 R
n+1
.
(2)  The  tangent  bundle  of  S
n
is  the  set
(x, v)  S
n
R
n+1
: v  T
x
S
n
 = (x, v)  S
n
R
n+1
:   v  x = 0.
(i)  Find  regular  equations  for  TS
n
in  R
n+1
R
n+1
(ii)  Show  (by  an  explicit  formula)  that  there  is  a  commutative  diagram
TS
1   
p
=
=
=
=
=
=
=
  S
1
R
pr
.|
|
|
|
|
|
|
|
S
1
in  which  the  restriction  of   the  horizontal   map  to  each  bre  p
1
(x)  =  T
x
S
1
is  a  linear  iso-
morphism  T
x
S
1
  x R.   Find  also  an  explicit  formulae  for  the  inverse  to  the  horizontal
dieomorphism.   Hint:   the  inverse  may  be  easier;   at  each  point  (x, y)   S
1
,   T
(x,y)
S
1
is  gen-
erated  by  the  vector  (y, x).
Part  (ii)  of   (2)  does  not   generalise  to  S
n
for  all   n;   in  particular,   it  fails  for  S
2
,   though  it
is  true  for  S
3
.   This  is  a  consequence  of   a  rather  dramatic  theorem  called  the  Hairy  Ball
Theorem  which  we  will  prove  in  Section  3.
(3) Show,  by  making  a  calculation  for a  suitable  example  (e.g.   S
1
 R
2
),  that in  general  for
a  manifold  M,
T
(x,v)
TM  ,= T
x
M  T
v
(T
x
M).
(4)  Complete  the  proof  of  2.5  by  showing  that  TN  and  (M, R
p
)  are  transverse  in  N  R
p
.
It  is  worth  noting  that  since  both  TN  and  (M, R
p
)  are  contained  in  N   R
p
,   so  are  their
tangent  spaces  at  each  point  contained  in  the  tangent  space  to  N   R
p
.   It  follows  that  the
sum  of  these  two  spaces  is  at  best  equal   to  the  tangent  space  to  N  R
p
.   Thus,   for  example,
it   would  be  a  mistake   here  to  try  to  prove  that   TN  and  (M, R
p
)   are  transverse   in  the
bigger  space  R
p
 R
p
  they  cannot  possibly  be!   Moral:   before  embarking  on  a  proof  that
two  manifolds  are  transverse,  it  is  important  to  make  sure  you  choose  the  smallest  ambient
40
manifold  possible    i.e.   that  contains  them  both.
(5)  Let  M  be  a  smooth  curve  in  R
2
and  let  (x, v)    TM.   Under  what  circumstances  does
equality  hold  in  (3)  above?
(6)  Let  N    R
p
be  a  manifold,   and  let  f  :   M    R  be  a  smooth  function.   How  might  one
dene  a  gradient  vector  f(x)  T
x
M  at  each  point  x  M?   2
2.2   Tubular  neighbourhoods
Vector bundles  play  an important role in the theory  of manifolds.   For now,  we  use  (M, N)
to  construct  a  tubular  neighbourhood of  M  in  N.
Denition  2.7.   Let  M    N    R
n
be  manifolds.   A  tubular  neighbourhood  of   M  in  N  is
a  neighbourhood  U  of   M  in  N  together  with  a  smooth  submersion    :   U    M  such  that
| M
  = 1
M
.
Theorem  2.8.   Tubular  neighbourhood  theorem.   Let  M   N   R
n
be  manifolds.   Then
(i)  M  has  a  tubular  neighbourhood  U
M
  in  R
n
.
(ii)  M  has  a  tubular  neighbourhood  U
M,N
  in  N.
Proof.   Consider   the   exponential   map  of   the   normal   bundle   n  :   (M, R
n
)    R
n
given  by
n(x, v) = x + v.   I  claim  that
1.   At  every  point  (x, 0)  of  M  0  (M, R
n
),  d
(x,0)
n  is  an  isomorphism,  and
2.   n  is  1 1  on  M  0.
The  second  of  these  is  obvious.   The  rst  holds,  because
1.   T
(x,0)
(M, R
n
)   =  T
x
M   (T
x
M)
M,N
  : U
M
   N    M.   All  that  is  lacking  is  that  this  is  a  submersion.   It  is  a  submersion  at
each point of M,  since 
| M
  is the identity.   The set of points where any map is a submersion
41
is  open  (this  follows  directly  from  the  local  normal  form  for  a  submersion,  1.27).   So  M  has
a  neighbourhood  inside  U
M
   N  on  which  
|
  is  a  submersion.   We  take  this,   together  with
the  restriction  of  ,  as  our  tubular  neighbourhood  U
M,N
.   2
Remark  2.9.   In  the  tubular  neighbourhood  of  M  in  R
n
constructed  in  the  proof,  for  each
point u  U  the line  (u) u  lies in  T
(u)
M
u
).   In  fact  I
claim  that  if  M  is  closed  in  R
n
,  and  the  map  n  of  the  proof  of  the  Tubular  Neighbourhood
Theorem  2.25  denes  a  deomorphism  from a  neighbourhood W  of M 0  in (M, R
n
) to
a  neighbourhood  U  of  M  in  R
n
,   then  (u)  is  the  closest  point  to  u  on  M,   the  point  where
u
  attains  its  absolute  minimum.   This  is  left  as  an  exercise.
Exercise  2.10.   Suppose  M   R
n
and U  is  a  neighbourhood  of M  in  R
n
.   Show  that if M  is
compact  then  there  exists   > 0  such  that
_
xM
B(x, )  U.
Show,   by  an  example,   that  if   M  is  not  compact,   such  an    may  not  exist.   (In  fact  there
exists  a  strictly  positive  function    : M    R  such  that
_
xM
B(x, (x))  U,
but   to  prove   this   one   needs   (I   think)   to  use   partitions   of   unity,   which  we   have   not   yet
introduced.)
Exercise 2.11.  (i) Write down an explicit formula for the composite S
1
R    (S
1
, R
2
)
  n
R
2
(where the map S
1
R    (S
1
, R) is the dieomorphism you found in Exercise 2.6(1)(ii),
and  n  is  the  exponential  map  used  in  the  proof  of  2.8).
(ii)  Find  a  neighbourhood  of  W  of  S
1
 0  in  (S
1
, R
2
),   and  a  neighbourhood  U  of  S
1
in
R
2
,  such  that  n : W    U  is  a  dieomorphism.   You  may  nd  it  helpful  to  replace  (S
1
, R
2
)
by  S
1
R  (because  you  can  draw  it) and  use  the  composite  map  you  found  in  (i)  instead  of
n.
(iii)  Write  down  a  formula  for  the  projection    : U    S
1
that  you  get  as  the  composite
U
  n
1
  W    S
1
0 = S
1
.
It  should  be  something  very  familiar!
(iv)  Generalise  the  answer  to  (iii)  to  S
n
 R
n+1
.   2
Remark  2.12.   The dieomorphism W   U
M
  in the proof of 2.8(i) shows that the way that
M  sits  inside   R
n
is  just  the  same  (up  to  dieomorphism)  as  the  way  that  M  =  M   0
sits  inside  the  normal   bundle  space  (M, R
n
).   Our  cheap  proof   of   part  (ii)  of   2.8  avoided
42
the  construction  of  an  exponential  map  (M, N)    N,  and  thus  did  not  give  us  a  dieo-
morphism  from  a  neighbourhood  of   M   0  in  (M, N)  to  a  neighbourhood  of   M  in  N.
However,   it  is  possible  to  show  the  exixtence  of  such  a  dieomorphism,   using  the  existence
of   tubular  neighbourhoods  of   M  in  R
n
and  of   N  in  R
n
in  a  simple  variant  of   the  proof   of
2.8(i)  -  see  Exercises  II.
2.3   Sards  theorem
Tubular  neighbourhoods  have  many  uses  in  dierential   topology.   The  rst  use  we  will   put
them to, is as a means of constructing perturbations of maps.   It will be extremely important
to be able to perturb a given map f  : M    N  an arbitrarily small amount so that it behaves
well with respect to some predetermined criterion.   The main example of this will be in our
treatment  of  intersection  theory:   given  a  map  f  :   M    N  and  a  submanifold  Z  of  N,   we
would like to be able to perturb f  an arbitrarily small amount so that it becomes transverse
to  Z.   And  we  would  like  the  perturbed  map  to  be  homotopic  to  f.   In  the  construction  of
such perturbations and homotopies, the linear structure of the ambient space  R
n
is extremely
useful:   the  easiest  way  to  deform  or  perturb  a  map  f  :  M    N  is  to  add  to  it  some  other
map  p(x)  (  for  small,   p  for  perturbation).   The  problem  is  that  in  a  general  manifold
N  we   have   no  additive   structure,   so  the  addition  f(x) +  p(x)   can  only  be  done  in  the
ambient  R
n
.   The role of the tubular neighbourhood U
N
   R
n
and its projection   : U
N
    N
is  to  allow  us  to  project  the  perturbed  map  f(x) + p(x)  back  into  N,   to  get  a  new  map
f
  : M    N.
Theorem 2.13.  Weak Transversality Theorem Given a smooth map of manifolds f  : M    N,
and  a  submanifold  Z  of   N,   then  for  every    >  0  there  exists  a  smooth  map  f
  :   M    N,
homotopic  to  f,  such  that  for  all  x  M,  |f(x) f
  is  transverse
to  Z.
Here the distance we are using is the Euclidean distance, in the ambient Euclidean space
of  N.   The  proof  of  2.13  relies  on  two  ingredients:   the  rst  is  Sards  theorem,  a  moderately
deep  theorem  of   analysis  whose  proof   we  will   skip  (though  see  Milnors  little  book  for  an
accessible  treatment).   The  second  is  a  very  clever  lemma  of  Rene  Thom,  which  puts  Sards
theorem  to  spectacular  use.
Theorem  2.14.   Sards  Theorem  Let   f   :   M    N  be  a  smooth  map  of   smooth  manifolds.
The  set  of  critical  values  of  f  has  measure  0.   2
Notice  rst  that  it  says  that  the  set  of  critical  values  has  measure  zero  (we  will   explain
what that means in a moment),  not the set of critical points.   The latter set can be very  big,
as  in  the  picture  below,  which  shows  the  graph  of  a  smooth  function  f  :   R    R.
43
y=f(x)
c
a
b
The  set  of  critical  points  is  the  interval  [a, b],  whereas  the  set  of  critical  values  has  just  one
point,  c.
Denition  2.15.   The  set   X    R
m
has  measure  zero  in  R
m
(or   just   measure  zero  if   the
ambient   space   is   clear)   if   for   every     >  0  it   is   possible   to  nd  a  covering   of   X  by  an
enumerable  set  of  rectangles  R
j
jN
  such  that
j
vol(R
j
) < .
Here  rectangle  means  a  subset  of   R
m
of  the  form  [a
1
, b
1
]      [a
m
, b
m
],  and  the  volume
vol(R)  of  such  a  rectangle  is  (b
1
a
1
)    (b
m
a
m
).   A  rectangle  is  by  denition  closed
and  bounded,  and  hence  compact.
Note  that  the  notion  of   measure  zero  is  relative:   the  real   interval   [0, 1]   does  not  have
measure  zero  in  R,  but  if  we  consider  [0, 1]  as  a  subset  of   R
2
by  means  of  the  inclusion  of   R
as  R 0,  it  does  have  measure  zero  in  R
2
.
Proposition  2.16.   1.   The  union  of  a  countable  set  of  sets  X
k
   R
m
of  measure  zero  in
R
m
has  measure  zero  in  R
m
.
2.   If X   R
m
has measure zero in  R
m
and f  :   R
m
  R
m
is smooth, then f(X) has measure
zero  in  R
m
.   Ditto  if   f   is  dened  only  on  some  open  set   U    R
m
,   replacing  f(X)  by
f(X  U).
3.   No  open  subset  of   R
m
can  have  measure  zero  in  R
m
.
Proof   :   1.   This  is  easy  to  prove  directly  from  the  denition  (just  nd  a  countable  cover
of  X
k
  with  total  volume  /2
k
,  and  put  these  covers  together).
2.   This  requires  a  little more  work.   Since  this is  Analysis,  I feel entitled  to  be a  bit sketchy:
Step  1 A smooth map is locally Lipschitz:   for every compact set K   U, there is a constant
C  such  that  for  x
1
, x
2
   K,  |f(x
1
) f(x
2
)|  C|x
1
 x
2
|.   We  call  C  a  Lipschitz  constant
44
for  f  on  K.
Step  2  Any  open  set  U    R
m
is  union  of  a  countable  collection  of  rectangles
  
U
k
.   And  any
set  X   R
m
is  the  union  of  the  countable  collection  X 
  
U
k
.
Step 3 Suppose that f  has Lipschitz constant C  on the rectangle R.   Then f(R) is contained
in  a  rectangle  of  volume  less  than  or  equal  to  C
m
vol(R).
Step  4  Suppose  that  f   has  Lipschitz  constant  C
k
  on
  
U
k
.   Given    >  0,   choose  a  cover  of
X 
  
U
k
  by  rectangles  contained  in
  
U
k
  whose  total  volume  is  less  than
2
k
(C
k
)
n
.
Step 5 Then f(X) is contained in a countable union of rectangles of total volume < .   Thus
f(X)  has  measure  zero.
3.   Any  open  set  contains  a  rectangle  R.   If  the  countable  union  of  rectangles  R
k
  contains  R
then
k
 vol(R
k
)  vol(R).   2
Denition  2.16.   (continued).   Let  M
m
  R
n
be  a  manifold.   We  say  the  set  X    M  has
measure  zero  in  M  if  for  every  chart    :  U    V   on  M,   (U   X)  has  measure  zero  in  R
m
.
If  a property  is held  by all points of M  except  for the members  of some  set of measure  zero,
we  say  that  the  property  holds  for  almost  all   x  M.
In order to check  that X  has measure  zero in M  it is enough to check  that (U X) has
measure  zero  in  R
m
for  each  of  a  collection  of  charts    whose  domains  together  cover  M.   I
leave  the  proof  of  this  as  an  exercise  (use  (i)  and  (ii)  of  2.16).
The  use  we  will   make  of  the  property  of   having  measure  zero  will   generally  go  via  the
following  rather  weak  consequence:
Proposition  2.17.   If  X   M  has  measure  zero  then  M   X  is  dense  in  M.
Proof   If  not,   then  some  open  subset  of  M  is  contained  in  X.   Intersecting  this  set  with
the  domain  of  some  chart,   we  can  transport  it  to  R
m
,   where  it  is  open.   But  no  non-empty
open  set  in  R
n
can  have  measure  zero.   2
Example  2.18.   Applications  of  Sards  Theorem
1.   If  f  :   M
m
  N
n
is  a  smooth  map  and  m  <  n,   then  the  set  of  critical   values  of  f  is
exactly the image of M.   hence, by Sards Theorem, the image of f  has measure zero in
N.   In particular, the image of a smooth map  R    R
2
has measure zero in  R
2
.   Contrast
this  with  the  Peano  curve,   a  surjective  continuous  map  from  the  unit  interval   to  the
unit  square.
2.   Sards Theorem has an application in classication problems.   For example, any triple
of straight lines through (0, 0) in the plane can be transformed by a linear isomorphism
to  a  given  xed  triple,  say  the  triple  x
1
  = 0  x
2
  = 0  x
1
  = x
2
.
45
Proposition  2.19.  The corresponding statement is no longer true  if we replace triples
of  lines  through  (0, 0)  by  quadruples.
Proof   Let   X
1
  =  x
1
  =  0  x
2
  =  0  x
1
  =  x
2
  x
1
  =  x
2
.   The   set   of
quadruples of straight lines through (0, 0) contains the set Q of quadruples of the form
x
2
  =  ax
1
  x
2
  =  bx
1
  x
2
  =  cx
1
  x
2
  =  dx
1
.   There  is  evidently  a  bijection
s : Q    R
4
.   The  set  Gl
2
(R)  of  invertible  matrices
_
  
11
  
12
21
  
22
_
is  also  a  4-dimensional   manifold.   The  subset   G
of   Gl
2
(R)  such  that  A(X
1
)     Q  is
easily  identiable    it   is   the  open  set   consisting  of   those  matrices   in  which  
12
  ,=
0, 
11
 + 
12
  ,=  0, 
11
 
12
  ,=  0, 
11
  ,=  0.   The  composite  of  the  map  G
  Q  with  the
map  s : Q    R
4
is  smooth.   We  will  call  it  f.   Because  both  the  source  and  the  target
of f  have the same dimension, 4, one might hope that f  would be surjective  showing
that every  quadruple  in Q is isomorphic  to the quadruple  X
1
.   However,  the derivative
d
A
f  : T
A
G
  T
f(A)
R
4
is  never  surjective.   For  every  matrix  in  the  1-parameter  family
of matrices A(t) = (1 +t)A : t  R takes X
1
  to the same quadruple of lines;  in other
words  f(A(t)) = f(A(0))  for  all  t,  and  hence  d
A
f(A
(0)) = 0. As A
(0) = A ,= 0,  d
A
f
has  non-trivial  kernel   and  therefore  is  not  surjective.   We  conclude  by  Sards  theorem
that  the  image  of  f  has  measure  zero  in  R
4
  only  rather  special   quadruples  of  lines
are  isomorphic  to  the  quadruple  X
1
.   2
2
Classication  problems  of  this  type  increase  in  complexity  when  we  go  up  a  dimension.
Clearly  not  every  quadruple  of   planes  through  (0, 0, 0)  in  R
3
is  linearly  isomorphic  to  the
quadruple  X
0
  =  x
1
  =  0  x
2
  =  0  x
3
  =  0  x
1
 + x
2
 + x + 3  =  0:   each  triple  of
the  four  planes  making  up  X
0
  intersect  only  at  (0, 0, 0),  and  the  same  must  be  true  of  any
quadruple  of  planes  to  which  X
0
  is  linearly  isomorphic.
Exercise  2.20.  (i) Show that X
0
  is linearly isomorphic to any quadruple of planes with this
property.
(ii)   Let   X
1
  =  X
0
   x
1
  + 2x
2
  + 3x
3
  =  0.   Each  triple  of   the  ve  planes   making  up  X
1
meets  only  at  (0, 0, 0).   Let  G  be  the  set  of  quintuples  of  planes  with  this  property.   Is  every
quintuple  in  G  linearly  isomorphic  to  X
1
?   2
2.4   The  construction  of  transverse  perturbations
Now  we  turn  to  our  main  theme,  the  density  of  transversality.
46
Theorem  2.21.   Thoms  Transversality  lemma  Suppose  that  F  :  M   P    N  is  a  smooth
map,   that  Z    N,   and  that  F    Z.   For  y    P,   denote  by  f
y
  the  map  M    N  dened  by
f
y
(x) = F(x, y).   Then  for  almost  all  y  P,  f
y
    Z.
Proof   The  key  to  the  proof  is  the  following  lemma:
Lemma  2.22.   Under  these  circumstances,  f
y
    Z  if  and  only  if  y  is  a  regular  value  of  the
projection    : F
1
(Z)    P.
Proof of lemma:   This is Exercise 1.55(iii), and most of the details were covered in Example
1.54 in the case where M  =  R
m
, P  =  R
p
and N  =  R
n
, so we give just the outline of the proof
now.
Step  1:
y  is  a  regular  value  of     
_
M  y
_
    F
1
(Z).
Step  2:
_
M y
_
    F
1
(Z)      f
y
    Z.
Details  of  steps  1  and  2  are  given  in  Example  1.54(ii)  and  (i)  respectively.
Proof   of   Theorem:   By  Sards  theorem,   almost   all   points   y     P  are  regular   values   of
|
  : F
1
(Z)    P.   Hence,  by  the  lemma,  for  almost  all  y,  f
y
    Z   2
Now  we   can  prove   the   elementary   transversality   theorem.   We   want   to   perturb  f   :
M    N  an  arbitrarily  small   amount   to  make  it   transverse   to  Z.   To  do  this,   we  try  to
deform  f  by  adding  to  it  some  map  depending  on  new  variables.
Example 2.23.  Suppose that M   R
n
and that f  : M    R.   One way we might get a family
F  of functions, deforming f, is by taking  R
n
as parameter space and dening F  : MR
n
  R
by
F(x, a) = f(x) + a  x.
Clearly F(x, 0) = f(x), or, in other words, f  = f
0
.   We can think of F  as a family of functions
M    R,  parametrised  by  a  R
n
.   2
In this example the target space  R is a vector space and so we can add on to f(x) the term
a  x.   When  the  target  space  is  not  R  but  some  manifold  N  without  any  additive  structure,
we  have  to  be  more  ingenious.   This  is  where  the  tubular  neighbourhood  theorem  comes  in.
Example   2.24.   Suppose  that   f   :   M    N  is   a  smooth  map,   and  that   U
N
  is   a  tubular
neighbourhood of N  in its ambient Euclidean space  R
p
.   Consider the map G : MR
p
  R
p
dened  by  G(x, a) = f(x) + a.   In  general  G(x, a)  does  not  lie  in  N.   However,  suppose  that
U
N
  is  a  tubular  neighbourhood  of  N  in  R
p
,   with  projection    :  U
N
    N.   If  G(x, a)    U
N
then by  applying    we  get a  point in N.   For example,  suppose  that N  is compact,  and that
  >  0  is  so  small   that  U
N
    
yN
B(y, ).   Then  for  x    M  and  a    B(0, ),   G(x, a)    U
N
.
Thus  we  can  dene  a  family
F  : M  B(0, )   N,
47
deforming  f,  by
F(x, a) = (f(x) + a).
If N  is not compact, we can make use of a function   : N    R as in Exercise  2.10, such that
_
yN
B(y, (y))  U
N
,
and  dene  F  : M  B(0, 1)    N  by
F(x, a) = (f(x) + (f(x))a).
These  two  deformations  are  by  no  means  the  only  ways  to  deform  a  given  function  or  map,
but  they  illustrate  a  general  approach.   In  fact  they  do  more:   2
Theorem  2.25.   Let   f   :   M    N  be  a  smooth  map,   let   Z  be  a  submanifold  of   N,   and  let
F  : M  B(0, 1)    N  be  the  family  constructed  in  Example  2.24.   Then
(i)  F  is  a  submersion;
(ii) for almost all a  B(0, 1),  the  map  f
a
  : M    N  dened by  f
a
(x) = F(x, a) is  transverse
to  Z.
Proof   We   already   know  that      :   U
N
    N  is   a   submersion,   by   denition  of   tubular
neighbourhood.   To   prove   that   F   is   a   submersion   it   is   enough  to   show  that   the   map
G  :   M   B(0, 1)    U
N
,   G(x, u)   =  f(x) +  (f(x))a,   is   a  submersion.   But   this   is   obvi-
ous,  if  we  look  closely:   with  respect  to  the  variable  a,  G  is  just  a  translation  plus  a  dilation
by  a  non-zero  scale  factor  (f(x)).   So,  for  any  v  R
p
,  we  have
d
(x,a)
G
_
0,
  1
(f(x))
v
_
= v.
This  shows  that  F  =   G  is  a  submersion.
(ii)   As   F   is   a  submersion,   it   is   transverse   to  Z.   It   follows   by  2.21  that   for   almost   all
a  B(0, 1),  f
a
    Z.   2
Corollary  2.26.   There  exists  a  map  arbitrarily  close  to  f,   and  homotopic  to  it,   which  is
transverse  to  Z.
Proof   By 2.25 and 2.16(iii), in every neighbourhood of 0 in B(0, 1) there are points a such
that  f
a
    Z.   Choose  any  such  a,  as  close  as  to  0  as  is  desired,  and,  using  F  as  in  the  proof
of  2.25,  dene  a  homotopy  H  : M  [0, 1]    N  by  H(x, t) = F(x, ta).  Then  H(x, 0) = f(x)
and  H(x, 1) = f
a
(x).   2
In  fact  2.26  contains  the  Weak  Transversality  Theorem  2.13.   So  we  have  achieved  the
goal  of  this  section.
There  are  in  fact  many  circumstances  where  transversality  theorems  rather  like  2.25  are
used.   Ours  is  the  Weak  Transversality  Theorem  because  it  is  the  easiest,   only  addressing
48
the behaviour of f  with respect to a single submanifold.   The full strength version is Thoms
Transversality  Theorem.   It   assures   us   that   by  an  arbitrarily  small   perturbation  we   can
deform a given map to behave  well in any one of a potentially unlimited  number of dierent
ways.   For example,  we  can  deform  it  so  that  all  the  critical  points  of  the  deformed  map  are
non-degenerate;   or,   if   we  begin  with  a  map  f   :   M
2
  N
3
,   we  can  deform  it  so  that  in  a
neighbourhood of each of its singular points the image is dieomorphic to one of the singular
surfaces  shown  here:
point double
Whitney umbrella triplepoint
The  method  always  uses  the  idea  that  transversality  is  dense,   but  it  is  an  auxiliary  map
(the  jet  extension  map  of  f)  that  is  required  to  be  transverse  to  certain  submanifolds  of  an
auxiliary  space,  the  jet  space.   You  might  gain  at  least  some  idea  of  what  this  is  about  from
Exercise   2.27.   The   Whitney   umbrella   is   the   image   of   the   map   R
2
  R
3
f(x
1
, x
2
)   =
(x
1
, x
2
2
, x
1
x
2
).   For   each  point   x    R
2
,   the  derivative   d
x
f   is   a  linear   map   R
2
  R
3
.   We
can view  this  family of  linear  maps as  a  (smooth) map df  :   R
2
  L(R
2
, R
3
),  where  L(R
2
, R
3
)
is  the  space  of  linear  maps  from  R
2
to  R
3
.   Show
1.   f  is  an  immersion  except  at  0.
2.   d
0
f  lies  in  the  submanifold
1
of  L(R
2
, R
3
)  consisting  of  linear  maps  of  rank  1.
3.   df  :  R
2
  L(R
2
, R
3
)  is  transverse  to
1
.
4.   If  g  :   R
2
  R
3
is  a  polynomial  map  such  that  g(0) = 0  and  d
0
g  has  rank  1,  then  after
a  change  of  co-ordinates  in  R
2
and  R
3
,  the rst  degree  part of f  can  be  brought to  the
form  (x
1
, x
2
) (x
1
, 0, 0).
5.   (Harder)  If   also  dg  
1
,   then  after  a  further  change  of   coordinates  in  source  and
target, the second-degree part of g can be brought to the form (x
1
, x
2
) (x
1
, x
2
2
, x
1
x
2
).
2
In  the  1940s   Hassler   Whitney  proved  a  number   of   theorems   about   the   dimension  of
Euclidean  spaces  in  which  a  manifold  might  be  embedded  or  immersed.
49
Theorem  2.28.   If  M
m
is  a  smooth  manifold  then  there  exists  an  embedding  M    R
2m+1
.
Theorem  2.29.   If  M
m
is  a  smooth  manifold  then  there  exists  an  embedding  M    R
2m
.
Theorem  2.30.   If  M
m
is  a  smooth  manifold  then  there  exists  an  immersion  M    R
2m1
.
The  rst   of   these  is  very  easy  to  prove,   using  Sards  Theorem  with  a  little  ingenuity,
whereas  the  second  and  third  are  much  harder.   In  fact  the  rst  can  be  strengthened,   if  we
allow  some  terms  we  have  not  dened:   in  the  space  of  smooth,   proper  maps  M    R
2m+1
,
embeddings  form  an  open  dense  set.   See  Exercises  II  for  a  guide  to  the  construction  of   a
proof.   In  contrast,   in  the  image  of   a  map  f   :   M
m
  R
2m
,   we  may  encounter   transverse
self-intersections,   where  two  parts  of  the  image  coming  from  distant  parts  of  M  meet  one-
another  transversely,   as  in  Example  1.37.   To  remove  such  a  self-intersection  would  require
a  large  modication  of  the  map.   So  there  is  no  embedding  nearby,  and  embeddings  are  not
dense in the space of proper maps M    R
2n
.   Because of this, the proof of Whitneys second
theorem  is  much  harder  than  the  rst.
A  similar  phenomenon,   the  persistance  of  transverse  intersection,   explains  the  diculty
of   the  third  of   these  theorems.   For  example,   a  map  from  M
2
  R
221
=  R
3
may  have
non-immersive  points  like  the  Whitney  umbrella  discussed  in  Exercise  2.27.   Because  the
derivative  of  such  a  map  is  transverse  to  the  set  of  linear  maps  of  rank  1,  the  points  where
its  meets  with  this  set  (i.e.   the  non-immersive  points  of  the  map)  cannot  be  removed  by  an
arbitrarily small perturbation.   So immersions  are not dense  in the space of maps  M
2
  R
3
.
2.5   The  stability  of  transverse  intersections
Suppose  that  f  :   M    N  is  smooth,   and  transverse  to  the  submanifold  Z  of  N.   Suppose
that  there  is  a  smooth  family  of  maps  F  : M  P    N  deforming  f,  i.e.   there  is  a  y
0
    P
such  that  f
y
0
  = f  (notation  as  in  Theorem  2.21).   We  would  like  to  show  that  for  y  in  some
neighbourhood  of  y
0
  in  P,
1.   f
y
  continues  to  be  transverse  to  Z,  and
2.   f
1
y
  (Z)  is  dieomorphic  to  f
1
(Z).
Neither  is  true  in  quite  the  simple  way  one  would  like.   In  the  following  picture  N  =  R
3
,
M  =  R  and  Z  is  an  open  interval  situated  along  the  vertical  axis.
 Image of M
Z
50
As   both  M  and  Z  are  1-dimensional,   the  only  way  they  can  be  transverse   is   by  not
meeting.   In the picture,  they do not meet.   But if we move the image of M  up an arbitrarily
small  amount,  they  will  meet,  and  transversality  will  fail.   It is  clear  that we  need  to  ask  for
Z  to  be  closed  in  N,  otherwise  this  sort  of  situation  can  always  be  manufactured.
Now  interchange  M  and  Z.   Now  Z  is  closed,  but  transversality  will  still  fail  if  we  move
M  an arbitrarily small amount.   So we  need to impose some kind  of closedness  on the image
of  M  also.   The  simplest  thing  to  ask  for  is  that  f  : M    N  be  proper.
Exercise  2.31.   Suppose  that  F  : M P    N  is  a  smooth  map  and  Z  is  a  submanifold  of
N.   For  each  y  P  we  denote  by  f
y
  the  map  M    N  dened  by  f
y
(x) = F(x, y).   Let
B  := (x, y)  M  P  : f
y
  is  not  transverse  to  Z  at  x.
(a)  Find  B  when
(i)  M  = (0, )  R,   P  =  R,   N  =  R
3
and  Z  = 0 R 0  N,  and  F  : M  P    N  is
dened  by  F(x, t) = (x t, 0, 0)
(ii) M  =  R,   N  =  R
3
,   P  =  R and Z  = (0, ) (0, 0)   N, and F  : MP    N  is dened
by  F(x, t) = (t, x, 0).
(b)  Check  that  in  both  cases,   the  set  y    P  :  f
y
  is  not  tranverse  to  Z  is  not  closed  in  P.
Note  that  this  set  is  just  the  projection  to  P  of  the  set  B.   2
Exercise  2.32.   (i)  Suppose  that  Z    N  is  closed  in  N,   and  F  :   M   P    N  is  smooth.
Show  that  the  set  B  dened  in  the  previous  exercise  is  closed  in  M P.   Hint:   let  C  be  the
complement  of  B.   Because  Z  is  closed  in  N,   for  every  point  w    N  there  is  a  neighbour-
hood  U  of  w  in  N  in  which  Z  has  a  regular  equation  g.
  3
Then  C   F
1
(U)  =  (x, y)  
F
1
(U) : g  f
y
  is  submersion  at  x  F
1
(U  Z),  and  thus  is  open  in  F
1
(U).   Every  point
(x, y)  M  P  lies  in  such  an  F
1
(U),  so  C  is  open.
(ii) Show that if Z  is closed in N  and M  is compact then the restriction to B of the projection
M  P    P  is  proper,  and  conclude  that  the  set
y  P  : f
y
    Z
is  open  in  P.   2
Proving  the  second  part   of   the  stability  of   transverse   intersections,   that   they  remain
essentially   unchanged  when  they  are   perturbed,   is   more   subtle.   How  do  we   show  that
3
This  is  not  the  case  if   Z  is  not  closed.   For  example,   the  point  (0, 0, 0)  in  Exercise  2.31(ii)  has  no  such
neighbourhood.   If   Z  is  closed  then  points  on  Z  have  such  neighbourhoods,   by  Corollary  1.31,   and  points
not  on  Z  have neighbourhoods whose intersection with  Z  is empty, in  which case as regular equation we can
take  any  equation  which  is  never  satised.
51
f
1
y
1
  (Z)  f
1
y
2
  (Z)  if  y
1
  is  near  y
2
  and  f
y
1
    Z?   As  yet  we  have  no  tools  for  producing  such
dieomorphisms.   In  fact  there  is  a  theorem  that  assures  us  that  precisely  this  is  the  case,
but  its  proof  is  beyond  the  scope  of  these  notes
4
.
Theorem  2.33.   Ehresmanns   Fibration  Theorem  Suppose  that   f   :   X    P   is   a  proper
submersion.   Then  f  is  a  locally  trivial   bre  bundle.
A  locally  trivial   bre  bundle  is   a  map    :   X    P  with  the  property  that   over   some
neighbourhood  of  each  point  in  P,    is  dieomorphic  to  the  projection  of  a  product.   More
precisely,   for   each  y     P   there   is   a  neighbourhood  U  of   y   in  P   and  a  dieomorphism
1
(U)
  
U
  U 
1
(y)  such  that  the  diagram
1
(U)
  
U
   U  
1
(y)
      
1
U
where  
1
  is  projection  to  the  rst  factor,  is  commutative.
Exercise  2.34.   Suppose  that  F  : M  P    N  is  smooth,  and  that  f
y
0
    Z.   Suppose  that
M  is  compact  and  Z  is  closed  in  N.   Prove  that  there  is  a  neighbourhood  U  of  y  in  P  such
that  
|
  : F
1
(Z)  (M  U)    U  is  a  locally  trivial  bre  bundle,  and  in  particular  that  for
all  y  U,  f
y
 Z  and  f
1
y
  (Z)  f
1
y
  (Z).   2
3   Oriented  Intersection  Theory
Consider  the  following  pictures:
I
II
IV
III
D
D
C
C
D
C
D
C
2
1
We  see  several   pairs  of  curves  on  the  torus   T.   I  want  to  assign  an  intersection  index  to
each  pair  of  curves.   In  the  simplest  case,  where  the  two  curves  meet  transversely,   I  want  to
do  it  essentially  by  counting  intersection  points,   as  in  pictures  I,   III  and  IV.   But  I  want  it
4
A  sketch  of   a  proof   is  given  in  Section  4  of   my  Lecture  Notes  for  MA5N0  Cohomology,   Connections,
Curvature  and  Characteristic  Classes.
52
to  be  dened  for  every  pair  of   curves  -   even  the  pair  in  II,   which  meet  at  innitely  many
points.   This  last  seems  problematic  -  until   we  recall   that  we  have  just  proved  that  we  can
deform  one  of   the  curves  an  arbitrarily  small   amount,   in  a  homotopy,   so  that  it  becomes
transverse to the other.   So in picture II, we could deform C  to C
1
, as shown in III, and then
count  intersection  points.   But  we  could  also  deform  it  to  C
2
,   as  in  IV,   and  then  count  
getting  a  dierent  answer.   The  way  to  make  the  answers  the  same  in  III and  IV  is  to  count
each  intersection  point  with  a  sign    i.e.   as  1    and  then  add.   We  have  to  have  some
convention  to  count  with  signs;  this  is  what  orientation  is  about.
In  the  case  of  curves  meeting  on  a  surface,  its  not  too  hard.   We  agree  on
1.   an  orientation  for  C  and  for  D
2.   an  order  in  which  we  calculate  the  intersection    i.e.   C  and  D  or  D  and  C,  and
3.   a rule which determines, at every point on the surface, a positive direction of rotation.
Then  the  procedure  for  assigining  a  sign  to  each  transverse  intersection  point  is  as  follows:
Rule  3.1.   let  v
1
  and  v
2
  be  forward  pointing  tangent  vectors   to  C  and  D  at  x.   Because
C    D  at x,  v
1
  and v
2
  are not parallel.   Hence  there  is a  shortest  rotation taking v
1
  to  v
2
.   If
this  is  in  the  positive  direction,  count  +1.   If  it  is  in  the  negative  direction,  count  1.
The  reason  this   rule  is   so  good  is   that   where  two  intersection  points   can  be  brought
together  and  annihilate  one  another  in  a  homotopy,   then  it  turns  out  that  they  must  have
opposite  sign, and so  the sum  of  the  oriented  intersection  numbers  is  the same  before and
after  the  homotopy.   This  will  be  shown  below.
An  orientation  on  a  curve  is  unproblematic    its  just  an  arrow,  a  direction,   and  there
are two possibilities.   But what about on a surface, or in higher dimensions still?  The general
denition is initially a little surprising, and we have to work to reconcile it with our intuition.
Denition  3.2.   (i)  Let  V   be  a  vector  space  over   R.   Say  two  ordered  bases  v
1
, . . ., v
m
  and
w
1
, . . ., w
m
  are  equivalent  if  the  change  of  basis  matrix
  5
from  one  to  the  other  has  positive
determinant.   There  are  thus  two  equivalence  classes  of   bases.   An  orientation  of   V   is  the
choice   of   one   of   these   two,   called  the   class   of   positive   bases.   So  there   are   two  possible
orientations  of   V .   Note  that  to  specify  an  orientation  it  is  enough  to  declare  one  ordered
basis  to  be  positive.
Example  3.3.   1.   On  R
n
we  dene  the  standard  orientation  by  declaring  that  the  basis
v
1
, . . ., v
n
  is  positive  if  and  only  if  det(v
1
, . . ., v
n
)  is  positive.   In  all  of  what  follows,  we
will  orient  R
n
in  this  way.
5
If  E  = e
1
, . . ., e
m
  and  E
= e
1
, . . ., e
m
  are ordered bases for the  same vector space  V ,  the  change of basis
matrix  [I]
E
E
   has  as  its  jth  column  the  expression  of  the  vector  e
j
  with  respect  to  the  basis  E
.   The  crucial
property of this  matrix is that if  v  V  and we denote by [v]
E
  and [v]
E
   the  columns of coecients of  v  with
respect  to  bases  E  and  E
,  then  [v]
E
   = [I]
E
E
 [v]
E
.
53
2.   Let V  be any 3-dimensional vector space and let E  := e
1
, e
2
, e
3
 be any basis.   As ordered
bases,   E  and  E
:=  e
3
, e
1
, e
2
  are  dierent:   however,   the  change  of  basis  matrices  [I]
E
E
and  [I]
E
E
  are
_
_
0   0   1
1   0   0
0   1   0
_
_
  and
_
_
0   1   0
0   0   1
1   0   0
_
_
with determinant  1, so  the bases  E  and E
is
the  ordered  basis  e
1
, e
3
, e
2
,  then  the  change  of  basis  matrices  [I]
E
E
   and  [I]
E
E
  are  both
equal  to
_
_
1   0   0
0   0   1
0   1   0
_
_
with  determinant  1.   So  E  and  E
are  inequivalent.
3.   If   V   is  a  1-dimensional   space,   an  orientation,   in  the  sense  just  dened,   allows  us  to
draw  an  arrow:   it  points  in  the  direction  of   the  (unique)   vector  of   a  positive  basis.
Conversely,   an  arrow  allows  us  to  determine  an  orientation:   a  basis  v  is  positive  if  v
points  in  the  direction  of  the  arrow.
4.   If   V   is  a  2-dimensional   space,   an  orientation  allows  us  to  determine  a  positive  di-
rection  of  rotation  at  each  point:   it  is  the  direction  of  shortest  rotation  from  the  rst
vector   of   a  positive   ordered  basis   to  the  second.   This   choice  of   direction  does   not
depend  on  the  positive  ordered  basis  chosen.   For  let  E  =  v
1
, v
2
  and  F  =  w
1
, w
2
  be
two  ordered  bases.   Let  R  be  a  rotation  taking  w
1
  to  a  positive  multiple  of   v
1
.   The
basis  RF  :=  Rw
1
, Rw
2
  is  equivalent  to  the  basis  w
1
, w
2
,   since  a  rotation  has  positive
determinant.   So
E     F      E     RF.
The  change  of  basis  matrix  [I]
E
RF
  has  the  form
_
  
11
  
12
0   
22
_
.
As  
11
  > 0,  this  matrix  has  positive  determinant  if  and  only  if  
22
  > 0.   But  it  is  clear
from the  diagram below  that  this  is  also  the  condition  that  the  shortest  rotation from
Rw
1
  to  Rw
2
  should  be  in  the  same  direction  as  the  shortest  rotation  from  v
1
  to  v
2
.
54
v
2
v
1
v
1
v
2
Rw
2
Rw
1
Here E  and RF are inequivalent.
From  now  on  we  will  use  basis  to  denoteordered  basis.   2
Denition  3.2.   (continued) (ii) Let W
1
  and W
2
  be oriented vector spaces.   An isomorphism
  :  W
1
    W
2
  preserves  orientation  if  for  some  (and  hence  for  any)  positive  basis  e
1
, . . ., e
n
of  W
1
,   (e
1
), . . ., (e
n
)  is  a  positive  basis  for  W
2
.   If    does  not  preserve  orientation,   then  it
reverses  it.
Exercise  3.4.   (a)  Show  that    preserves  orientation  if  and  only  if  for  some  (and  hence  for
any)  positive  bases  E
1
  of  W
1
  and  E
2
  of  W
2
,  det[]
E
1
E
2
  > 0.
(b) If W
1
  = W
2
  then show that the sign of det[]
E
E
  is independent of the choice of basis E.   2
Denition  3.2.   (continued)(iii)   Let   V
1
  and  V
2
  be   open  sets   in   R
m
.   A  dieomorphism
h : V
1
    V
2
 preserves orientation at a point y in V
1
 if d
y
h :  R
m
  R
m
preserves orientation.   If
h preserves orientation at every point y  V
1
 then we simply say that it preserves orientation.
Now,   nally,   we  are  in  a  position  to  dene  an  orientation  on  a  manifold.   Essentially,
an  orientation  on  a  manifold  M  is  an  orientation  for  each  tangent  space  T
x
M,   but  which
must, in some sense, vary continuously from one tangent space point to another.   There are
several  equivalent  ways  of  making  this  vague  requirement  precise.
For the rst, think of an orientation on each tangent space as a rule for deciding whether
a  basis   of   that   space   is   positive   or   negative.   We   dont   know  yet   what   it   means   to  say
that  a  rule  varies  continuously  from  one  tangent  space  to  another.   Less  problematic  is  the
notion  of   a  continuous   family  of   bases.   For  example,   for  each  point  x  in  the  hemisphere
(x
1
, x
2
, x
3
)  S
2
: x
3
  > 0,  the  vectors
v
1
(x) = (x
3
, 0, x
1
), v
2
(x) = (0, x
3
, x
2
)
form  a  basis  for  T
x
S
2
.   Evidently  v
1
  and  v
2
  vary  continuously,  so  it  is  reasonable  to  say  that
the  basis  E(x)  consisting  of v
1
(x), v
2
(x)  is  a  continuous  family  of  bases.   An  orientation  of  a
manifold  is  an  orientation  for  each  tangent  space  T
x
M,   which  knit  together  in  the  sense
that they all give the same sign to each basis in a continuous family of bases.   More precisely,
Denition  3.5.   Let  M   R
n
be  a  manifold.
(i) A tangent vector eld (usually abbreviated to vector eld) on U   M  is a map v : U    R
n
55
such  that  for  every  x  U,  v(x)  T
x
M.
(ii)  A  continuous  family  of   bases,   or  tangent  frame  eld,   on  a  set  U    M  is  an  m-tuple  of
continuous tangent vector elds on U, v
1
, . . ., v
m
, such that for every x  U, v
1
(x), . . ., v
m
(x)
is  a  basis  for  T
x
M.
Example  Suppose    :  U    V   is  a  chart  on  M
m
.   Let  e
1
, . . ., e
m
  be  the  standard  basis  for
R
m
.   For  each  j  and  for  each  x  U,  let
v
j
(x) := (d
x
)
1
(e
j
).
Then  v
j
  is  a  tangent  vector  eld  on  U.
Moreover,  since  e
1
, . . ., e
m
  are a  basis for  R
m
, v
1
(x), . . ., v
m
(x) is a basis  for T
x
M  for each
x  U.   Thus  v
1
, . . ., v
m
  is  a  continuous  (even  smooth)  tangent  frame  eld  on  U.
(iii) An orientation of a manifold M  is a choice of orientation for each T
x
M  with the following
property:   whenever  U    M  is  connected  and  v
1
, . . ., v
m
  is  a  continuous  tangent  frame  eld
on  U,   then  either  for  all  x  in  U,   v
1
(x), . . ., v
m
(x)  is  a  positive  basis  for  T
x
M,   or  for  all  x  in
U  v
1
(x), . . ., v
m
(x)  is  a  negative  basis  for  T
x
M.
Remark  3.6.   If v
1
, . . ., v
m
  and v
1
, . . ., v
m
  are tangent frame elds on the connected open set
U   M,  let  E(x)  denote  the  basis  v
1
(x), . . ., v
m
(x)  for  T
x
M,  and  let  E
1
(x), . . ., v
m
(x).   The  determinant  det[I]
E(x)
E
(x)
  varies  smoothly,   and  so  always  has  the  same
sign  on  U.   Hence,  we  need  only  check  that  (iii)  holds  for  one  frame  eld  on  U    if  it  does,
then  it  holds  for  every  frame  eld.
Consider  once  again  the  sphere  S
2
 R
3
.   We  dene  an  orientation  on  each  T
x
S
2
by  the
following  rule:
the  basis  v
1
, v
2
  of  T
x
S
2
is  positive  if  the  basis  x, v
1
, v
2
  of   R
3
is  positive
6
.
Let  us  see  that  this  meets  the  requirement  of   3.5(iii).   Suppose  that  U    S
2
is  connected
and  that  v
1
, v
2
  are  continuous  vector  elds  on  U  such  that  for  all   x    U,   v
1
(x), v
2
(x)  is  a
basis  for T
x
S
2
.   Then  the function det(x, v
1
(x), v
2
(x)) is continuous,  and nowhere  zero  on U.
Hence  it  has  the  same  sign  everywhere  on  U.   If  it  is  everywhere  positive,   then  v
1
(x), v
2
(x)
is  a  positive  basis  of  T
x
S
2
for  all  x  in  U.   If  it  is  everywhere  negative,   then  v
1
(x), v
2
(x)  is  a
negative  basis  of  T
x
S
2
for  all  x  in  U.   Thus,  we  have  succeeded  in  orienting  S
2
.
Not  every  manifold  can  be  oriented.   Below  we  will   see  that  the  Mobius  strip  cannot.   We
say  M  is  orientable  if  it  can  be  oriented,  and  non-orientable otherwise.
Exercise  3.7.   Here  is  a  slight  variant  on  the  preceding  denition.   It  is  based  on  the  intu-
itively  appealing  idea  that  tangent  spaces  at  nearby  points  are  themselves  close  enough  to
one  another  that  orthogonal  projection  from  one  to  the  other  is  an  isomorphism.
6
with  respect  to  the  standard  orientation  dened  in  3.3(1)
56
(i) Show that if M   R
n
is smooth, then for each x
0
  M, there is a connected neighbourhood
U
x
0
  of  x
0
  such  that  for  all  x  U
x
0
,  the  orthogonal  projection
x
0
  : T
x
M    T
x
0
M
is  an  isomorphism.
(ii)  Show  that  an  assignment  of  orientations  to  each  tangent  space  T
x
M  satises  the  com-
patibility  condition  of  3.5(iii)  if  and  only  if  for  all  x
0
    M  and  x    U
x
0
,   each  of  the  linear
isomorphisms  
x
0
  : T
x
M    T
x
0
M  preserves  orientation.   2
Proposition  3.8.  (i) Let M
m
be a manifold.   Then M  is orientable if and only if there is an
atlas  
: U
  R
m
  such  that  whenever  U
(U
(U
)
preserves  orientation.
We  will  call  an  atlas  with  this  property  an  orienting  atlas.
Proof   Suppose  rst  that  M  is  oriented.   Take  any  atlas  
: U
  in  which  all   of
the U
1
(e
1
), . . ., d
x
1
(e
m
)  form  a  frame  eld,   E
, on U
.   Thus,
either  they  are  a  positive  basis  for  T
x
M  for  all   x    U
by R
,  where  R  :   R
m
  R
m
is  a  reection.   Now
all  of  the  frame  elds  E
]
E
E
  is  the  identity  matrix
I
n
.   Since
[d
(x)
(
  )]
E
E
  = [d
x
]
E
E
  [I]
E
E
[d
x
]
1
= [I]
E
E
it follows that
  : T
x
M    T
x
R
m
pre-
serves  orientation.   Because  all   of   the  crossover  maps  preserve  orientation,   this  instruction
is  unequivocal.   Now  each  frame  eld  E
N
  : S
2
N    R
2
and 
S
  : S
2
S    R
2
discussed  in Example  1.2.   The  crossover  map
S
 
1
N
  :   R
2
0    R
2
0 is inversion  i in the unit circle,  given given in coordinates by
i(x
1
, x
2
)  =  (
  x
1
x
2
1
+x
2
2
,
  x
2
x
2
1
+x
2
2
).   This  map  does  not  preserve  orientation:   its  derivative  at  (x
1
, x
2
)
has  matrix
1
x
2
1
 + x
2
2
_
  x
2
1
x
2
2
  2x
1
x
2
2x
1
x
2
  x
2
2
x
2
1
_
with  negative  determinant.   So  
N
, 
S
  is  not  an  orienting  atlas.   If  we  compose  
N
  with  a
reection such as r(x
1
, x
2
) = (x
1
, x
2
), we get a new atlas, r 
N
, 
S
 which is an orienting
57
atlas.
The fact that inversion in the unit circle reverses orientation can be understood geometrically:
if   x  grows  radially,   x/|x|  shrinks  radially.   If   x  moves  anticlockwise  around  the  origin,   so
does x/|x|.   So  the  image, under  the derivative  d
x
i,  of the positive  basis shown,  is  as shown
in  the  following  diagram.
i(x)
x
1
2
1
2
Philosophical Remark 3.10.  Formally, the way we obtain an orientation from an orienting
atlas  in  the  second  paragraph  of  this  proof  has  some  similarity  with  our  denition  of  angle
on the sphere S
n
, in the third part of Example 1.2.   In both cases we dene something on the
manifold by means of charts.   For this denition to be unequivocal,  we have to show that we
get  the  same  answer  whichever  chart  we  use.   In  Example  1.2,   we  dened  angle  between
two  curves  on  the  sphere  S
n
by  using  stereographic  projection  (
N
  or  
S
)  to  transport  the
two  curves  to  R
n
,   where  we  measure  the  angle  between  them  in  the  usual   way.   This  made
sense  because  the  crossover  dieomorphisms  
N
   
1
S
  and  
S
 
1
N
  preserve  angles,  so  that
the answer we get applying 
N
  is the same as the answer we get applying 
S
.   Of course, there
we  couldnt  use  just  any  charts:   our  denition  relies  on  the  choice  of  two  particular  charts.
We say that our atlas 
N
, 
S
 equips S
n
with a conformal structure, a structure in which we
can  measure  angles.   Returning  to  orientation:   once  we  have  an  atlas  in  which  all  crossover
dieomorphisms   preserve  orientation,   we  can  use  the  charts   to  orient   the  manifold  in  an
unequivocal   way.   We  might  say  that  such  an  atlas  equips  M  with  an  oriented  structure,
though  in  practice  the  term  is  not  used.
Example  3.11.   (i)  Suppose  that  f  :   R
m+1
  R  has  t  R  as  regular  value,  and  f
1
(t) ,= .
Then  we  can  orient  M  :=  f
1
(t)  as  follows:   the  basis  v
1
, . . ., v
m
  of   T
x
M  is  positive  if   the
basis f(x), v
1
, . . ., v
m
  of  R
m+1
is positive.   Note that this generalises the method used above
to  orient  S
2
.
(ii)   More   generally,   if   f   :   R
m+c
  R
c
has   t   as   a  regular   value   and  f
1
(t)   ,=  0  then  we
can  orient   M  :=  f
1
(t)   as   follows:   the   basis   v
1
, . . ., v
m
  of   T
x
M  is   positive   if   the   basis
f
1
(x), . . ., f
c
(x), v
1
, . . ., v
m
  of   R
m+c
is  positive.
(iii) The  Mobius  strip is  not orientable.   To  see  this,  think  of  it as  obtained from a rectangle
58
by  gluing  together  opposite  ends  after  a  half-twist,  as  shown  here.   Suppose  it  is  orientable.
Because  of   the  half   twist,   the  smooth  frame  eld  shown  on  the  connected  set   U
2
  in  (iii)
must  appear  as  shown  in  (ii)  when  U
2
  is  depicted  on  M.   But  now  we  have  a  contradiction:
the  frame  elds   on  U
1
  and  U
2
  must   each  be  everywhere   positive  or  everywhere   negative.
However,  the bases  they provide in T
x
M  are equivalent  for x in the right hand overlap of U
1
and  U
2
,  but  inequivalent  for  x  in  the  left  hand  overlap.
1
2
1
2
1
2
1
2 1
2
1
2
1
2
1
2
1
2
1
2
1
2
1
2
1
2
U
1
U
1
1
2
1
2
1
2
1
2
1
2
1
2
1
2
1
2
1
2
1
2 1
2
U 
2
(iii)
(ii)
(i)
U
2
U
2
(iv) The non-orientability of the Mobius strip has an interesting consequence for the manifold
M  we   obtained  as   the   image   of   the   quadratic   map  S
2
  R
6
in  Example   1.41(ii):   it   is
impossible  to  dene it by a  single set  of regular equations.   That is,  it is  impossible  to nd a
set U  open in  R
6
containing M  on which there is a smooth map g to  R
4
such that M  = g
1
(0)
and  0  is  a  regular  value  of  g.   For  I  claim  that  M  contains  a  Mobius  strip,   and  is  therefore
non-orientable.   The  claim  is  essentially  proved  by  the  following  picture.
59
.
For since  f  identies  antipodal points  on the sphere,  we get all of M  as the image of  the
closed  upper  hemisphere.   The  map  f  is  1-1 on  the  interior  of  the  hemisphere,  and  identies
antipodal  boundary  points.   Therefore  the  image  of  the  strip  shown  is  a  Mobius  strip  in  M.
Because  M  is therefore not orientable,  by (ii) above it cannot be globally dened  by regular
equations.
(vi) Product Orientation If M  and N  are oriented manifolds, the product MN  acquires
an  orientation  as  follows.   First,  for  (x, y)  M  N,
T
(x,y)
(M  N) = T
x
M  T
y
N.
We  determine  an  orientation  of  T
(x,y)
(M  N)  by  taking  a  positive  basis  E  = v
1
, . . ., v
m
  for
T
x
M  and  a  positive  basis  F  = w
1
, . . ., w
n
  for  T
y
N  and  declaring  that  the  basis
(v
1
, 0), . . ., (v
m
, 0), (0, w
1
), . . ., (0, w
n
)
(which we denote by (E, F)) is a positive basis for T
(x,y)
MN.   If we chose dierent positive
bases  E
for  T
x
M  and  F
for  T
y
N,  then  the  change  of  basis  matrix  [I]
(E,F)
(E
,F
)
  satises
[I]
(E,F)
(E
,F
)
  =
_
  [I]
E
E
   0
0   [I]
F
F
_
and  thus
det[I]
(E,F)
(E
,F
)
  = det[I]
E
E
 det[I]
F
F
   > 0.
So  (E, F)  and  (E
, F
such that if v
1
, v
2
  is a positive basis then n(x), v
1
, v
2
  is a positive basis for  R
3
.
Equally,   n(x)  =  v
1
 v
2
/|v
1
 v
2
|.   Reciprocally,   if  S  is  two-sided,   then  selecting  one  of
the  two  sides,   we  dene  a  nowhere-vanishing  normal  vector  eld  by  choosing  at  each  point
the  unit  normal  pointing  into  the  chosen  side.   This  can  then  be  used  to  orient  the  surface,
playing  the  same  role  as  the  gradient  vector  f  in  Example  3.11(i).
Recall  that  a  zero-dimensional  manifold  is  just  a  collection  of  discrete  points.
Denition  3.14.   An  orientation  of   a  zero-dimensional   manifold  is  simply  the  assignation
of  a  sign,  +  or  ,  to  each  point.
Thus,   exactly  as  for  a  connected  positive-dimensional   orientable  manifold,   a  connected  0-
dimensional   manifold  has  two  possible  orientations.   This  denition  of   orientation  is  a  bit
dicult  to  reconcile  propter  hoc with  the  denition  of  orientation  for a  positive  dimensional
manifold,  but  we  will  shortly  see  that  it  ts  beautifully  into  the  intersection  theory  we  will
develop.
Now, returning  to the original theme  of this chapter,  we will dene the oriented intersection
number   of   two  compact   oriented  submanifolds   of   complementary  dimension  which  meet
transversely  in  an  oriented  manifold.
Denition  3.15.  (i) Suppose U  and V  are complementary subspaces of a nite dimensional
real   vector  space  W.   Suppose  that  all   three  spaces  are  oriented.   If  u
1
, . . ., u
m
  is  a  positive
basis for U  and v
1
, . . ., v
nm
  is a positive basis for V , then we say that U  and V  have positive
intersection  if  u
1
, . . ., u
m
, v
1
, . . ., v
nm
  is  a  positive  basis  for  W,   and  negative  intersection  if
it  is  a  negative  basis.
(ii)  Suppose  that  M
m
and  Z
nm
are  oriented  submanifolds   of   the  oriented  manifold  N
n
,
with  M  compact   and  Z  closed.   Suppose  also  that  M    Z.   Then  we  dene  the  oriented
intersection  number  of  M  and  Z  at  x,  (M   Z)
x
,  as  follows:
(M   Z)
x
  =
_
  +1   if  T
x
M  and  T
x
Z  have  positive  intersection  in  T
x
N
1   if  T
x
M  and  T
x
Z  have  negative  intersection  in  T
x
N
  (3.1)
We  dene  the  oriented  intersection  number  of  M  and  Z  in  N,  (M   Z)
N
,  as
(M   Z)
N
  =
xMZ
(M   Z)
x
.
(iii)  A  mild  extension  of  the  previous  denition:   we  do  not  suppose  that  M
m
is  a  subman-
ifold  of   N
n
,   but   simply  that   f   :   M    N  is   a  smooth  map  which  is   transverse   to  Z
nm
61
(with  M, N  and  Z  oriented,   and  M  compact  and  Z  closed  as  before).   As  the  dimensions
are  complementary,   for  each  x    f
1
(Z)  the  derivative  d
x
f   :   T
x
M    T
f(x)
N  must  be  an
injection.   We  dene
(f   Z)
x
  =
_
  +1   if  d
x
f(T
x
M)  and  T
f(x)
Z  have  positive  intersection  in  T
f(x)
N
1   if  d
x
f(T
x
M)  and  T
f(x)
Z  have  negative  intersection  in  T
f(x)
N
  (3.2)
We  dene  the  oriented  intersection  number  of  f  and  Z,  (f   Z)
N
,  as
(f   Z)
N
  =
xf
1
(Z)
(f   Z)
x
.
Observe  that  (ii)  is  the  special   case  of   (iii)  in  which  f   is  the  inclusion  of   the  submanifold
M  into  N.
Remark 3.16.  Recall that an oriented 0-dimensional manifold is just a collection of discrete
points  with  a  sign  (+  or )  attached  to  each  one.   In  the  circumstance  described  in 3.15(ii),
X    Z  acquires   a  natural   orientation  -   simply  assign  to  each  point   the   sign  dened  by
(3.1).   Thus,  for  example  in  the  following  picture,  in  which  R
2
has  the  standard  orientation,
C
1
C
2
  = P  +QR+S, and (C
1
 C
2
)
R
2  = 0.   Note that C
2
C
1
  = P Q+RS; with
this  convention,  intersection  has  become  non-commutative!
1
2
1
S
P
R
Q
C
C
2
By  means  of  a  similar  convention,  using  (3.2),  we  can  orient  f
1
(Z)  in  3.15(iii).
Exercise  3.17.   (i)  Show,   as  we  must  do  in  order  for  the  last  parts  of   Denition  3.15  to
make  sense,  that  with  the  given  hypotheses  f
1
(Z)  is  nite.
(ii)   Equally  necessary,   show  also  that   in  part   (i)  of   Denition  3.15,   the  sign  of   the  basis
u
1
, . . ., u
m
,   v
1
, . . ., v
nm
  of   W  is  independent   of   which  positive  bases   u
1
, . . ., u
m
  of   U  and
v
1
, . . ., v
nm
  of  V   we  choose.   One  way  to  do  this  is  the  following:   suppose  that  E
U
  and  E
U
are  two  bases  for  U,   and  E
V
  and  E
V
  are  two  bases  for  V .   Denote  by  E  the  basis  for  W
obtained  by  putting  together  E
U
  and  E
V
  (as  described  in  part  (i)  of  the  denition),  and  by
E
U
  and  E
V
.   Express  the  matrix  [I]
E
E
   in
terms  of  the  matrices  [I]
E
U
E
U
and  [I]
E
V
E
V
.   2
62
It  remains  to  extend  our  denitions  to  cover  the  case  where  M  and  Z  are  not  transverse.
As  I  said  earlier,  the  idea  is  to  move  M  in  a  homotopy,  to  a  position  where  it  is  transverse
to  Z,   and  then  use  denition  3.15(ii).   In  fact  the  language  of   (iii)  allows  a  more  exible
description.   In  these  terms,   we  dene  an  oriented  intersection  number  (f   Z)
N
  even  when
f   is  not  transverse  to  Z,   by  perturbing  f   in  a  homotopy  to  a  map  f
1
  which  is  transverse
to  Z,  calculating  (f
1
 Z)
N
  using  part  (iii)  of  Denition  3.15,  and  then  dening  (f   Z)
N
  to
be  (f
1
   Z)
N
.   But   for  this  to  make  sense,   we  have  to  show  that   it  doesnt   matter  which
transverse  perturbation  of f  we  take.   The  relation  of  homotopy  is transitive,  so  if f
1
  and f
2
are  both  homotopic  to  f,  then  f
1
  is  homotopic  to  f
2
,  and  thus  it  will  be  enough  to  show
Theorem  3.18.   If   f
1
  :   M    N  is   homotopic  to  f
2
  :   M    N,   and  both  f
1
  and  f
2
  are
transverse  to  Z,  then  (f
1
 Z)
N
  = (f
2
 Z)
N
.
The  proof of this will  occupy some  time.   As a rst step,  recall that a homotopy between
f
1
  : M    N  and f
2
  : M    N  is a smooth map F  : M[0, 1]    N  such that F(x, 0) = f
1
(x)
and  F(x, 1)   =  f
2
(x).   In  particular,   its   domain  is   not   a  manifold.   We  begin  by  slightly
enlarging  the  denition  of  manifold  to  incorporate  such  objects.
Denition  3.19.   (i)  H
m
is  the  half-space  (x
1
, . . ., x
m
)    R
m
:  x
m
    0.   The  boundary  of
H
m
,  H
m
,  is  the  set  (x
1
, . . ., x
m
)  R
m
: x
m
  = 0.
(ii)  A  subset  M    R
n
is  an  m-dimensional   manifold  with  boundary  if  every  point  in  M  has
a  neighbourhood  dieomorphic  to  an  open  set  in  H
m
.   If  M
m
is  an  m-dimensional  manifold
with boundary, the boundary of M, which we denote by M, is the set of points x  M  such
that  in  some  chart   : U    H
m
,  (x)  H
m
.
Example  3.20.   (i)  The  closed  unit  ball
  
B
m
:=  x    R
m
:   |x|    1  is  an  m-dimensional
manifold with boundary,  with 
 
B
m
= S
m1
.   The  intervals [a, b]  and [a, b) are 1-dimensional
manifolds  with  boundary.   [a, b] = a, b,  and  [a, b) = a.
(ii) Every manifold is a manifold-with-boundary (though with empty boundary), since every
open  set  in  R
m
is  dieomorphic  to  an  open  set  in  H
m
.   On  the  other  hand,  a  manifold  with
boundary  is  only  a  manifold  if  its  boundary  is  empty.
(iii)Exercise  If   M
m
is  a  smooth  manifold  and  f   :   M    R  has  t
0
  as  regular   value,   then
f
1
((, t
0
])  and  f
1
([t
0
, ))  are  both  manifolds  with  boundary,   and  each  has  boundary
equal   to  f
1
(t
0
).   Hint:   f
1
((, t
0
)  is  open  in  M,   so  each  point  of   f
1
((, t
0
)  has  a
neighbourhood  dieomorphic  to  an  open  set  in  R
m
,   and  thus  to  an  open  set  in  H
m
.   For
points  in  f
1
(t
0
),  use  the  local  normal  form  of  a  submersion.
(iii)  Since   R
m
 H
n
=  H
m+n
,   the  cartesian  product  of  an  m-dimensional   manifold  and  an
n-dimensional  manifold  with  boundary  is  an  (m + n)-dimensional  manifold  with  boundary.
In  particular  this  applies  to  M  [0, 1].   2
63
Proposition  3.21.   (i)  Suppose  that   M
m
is   a  manifold  with  boundary  and  that   for  some
chart    :  U    V     H
m
,   (x)    H
m
.   Then  for  every  other  chart    on  M  whose  domain
contains  x,  (x)  H
m
.
(ii)  M,  if  not  empty,  is  a  manifold  of  dimension  m1.
Proof   (i)  If  for  some  chart  ,  (x)  is  not  in  H
m
,  then  there  is  a  neighbourhood  of  (x)
in H
m
which  is  in  fact open in  R
m
.   By  the inverse  function theorem,  if U  is open  in  R
m
and
 : U    V   R
m
is  a  dieomorphism  then  V  is  open  in  R
m
.   It  follows  that  for any  chart  ,
(x)  has  a  neighbourhood  in  H
m
which  is  open  in  R
m
.   Hence  (x)  /  H
m
.
(ii)  For  any  chart    :   U    V     H
m
,   (U   M)  =  V   H
m
(we  are  using  (i)  here).   As
|UM
  has inverse  (
1
)
|V H
m, and V  H
m
is an open set in  R
m1
, each point in M  has
a  neighbourhood  dieomorphic  to  an  open  set  in  R
m1
.   2
Notice  that  we  have  shown  quite  clearly  that  M  is  a  manifold  without  boundary.
Example  3.22.   (i)  If  M  is  a  manifold  and  N  is  a  manifold  with  boundary  then  M  N  is
a  manifold  with  boundary,  and  (M  N) = M  N.
(ii) If both M  and N  are manifolds with (non-empty) boundary then MN  is not a manifold
with  boundary.   Consider  for  example  the  product  of  two  closed  intervals.   2
If M  is an oriented manifold with boundary then M  inherits an orientation from M, the
boundary  orientation.   The  construction  of   the  boundary  orientation  is  very  similar  to  the
construction  of  an  orientation  using  the  gradient  vector  of  a  dening  equation  in  Example
3.11(i).
Denition  3.23.   (i)  Let  y  H
m
.   We  set
T
y
H
m
= T
y
R
m
T
out
y
  H
n
= (v
1
, . . ., v
n
)  T
y
H
n
: v
n
  < 0.
(ii)  If  M
m
is  a  manifold  with  boundary  and  x   M,   let    be  a  chart  on  M  around  x  and
let  (x)  =  y.   Dene  T
x
M  =  (d
y
1
)(T
y
H
m
)  and  T
out
x
  M  =  (d
y
1
)(T
out
y
  H
m
).   We  call   the
vectors  of  T
out
x
  M  outward  pointing  tangent  vectors.
Note that here 
1
is not smooth (old) at the points of V H
m
(where the domain of 
1
contains no set open in  R
m
), so we are cheating a little when we speak of its derivative d
y
1
.
In  order  to  speak  of  d
y
1
we  really  need  to  make  use  of  a  local  smooth  (old)  extension  of
1
on  the  ambient   R
m
,   and  use  its  derivative
  7
at  y.   This  is  OK,   provided  this  derivative
7
Of course, in practice  
1
will usually be given in terms of local ambient coordinates, and so we will not
have  to  struggle  to  nd  such  a  local  extension.
64
is  independent  of  the  choice  of  local  extension.   In  fact  this  is  easy  to  see.   For  every  vector
v  T
y
R
m
=  R
m
,  either  v  or  v  is  equal  to
lim
t     0+
(t) (0)
t
  (3.3)
(one  sided  limit),  for some  smooth  curve    : (, )    R
m
taking  0  to  y  and  taking  [0, ) to
V .   Thus if F  is any smooth (old) extension of 
1
around y, then either  d
y
F(v) or d
y
F(v)
is  equal  to
lim
t     0
F((t)) F((0))
t
  =   lim
t     0+
F((t)) F((0))
t
  =   lim
t     0+
1
((t)) 
1
((0))
t
and thus is independent of choice of F.   As d
y
F(v) = d
y
F(v), for every vector v  the value
of  d
y
F(v)  does  not  depend  on  the  choice  of  F.
In fact this still leaves the question of whether our denition is independent of the choice
of   chart  ,   exactly  as  in  the  denition  of   T
x
M  for  a  manifold  (i.e.   without  boundary),   in
1.11.   But this can be dealt with by the same argument as in 1.11, adapted by the trick weve
just  used  to  deal  with  half-spaces.
With  the denition  of the  outward pointing  tangent  vectors,  once again there is  an issue
of  well-denedness:   this  is  resolved  by
Lemma  3.24.   Suppose  that  x  M  and  that    is  a  chart  on  M  around  x,  with  (x) = y 
H
m
.   Then
(d
y
1
)(T
out
y
  H
m
) = 
1  S
1
.   In  the  left  hand  picture,   we  see
that (v, 0) is  an  outward-pointing  vector.   Moreover,  (v, 0), (0, w) is  a  positive  basis  for
T
(0,y)
[0, 1]  S
1
:   it  is  equivalent  to  the  positive  basis  (v, 0), (0, w),  since  the  change  of  basis
matrix  is  the  diagonal  matrix  with  both  diagonal  entries  1,  and  has  positive  determinant.
Hence  (0, w)  is  a  positive  basis  for  T
(0,y)
S
1
.   On  the  other  hand,   by  a  similar  argument,
(0, w) is  a  positive  basis  for  T
(1,y)
([0, 1] S
1
),  for  now  (v, 0) is  an  outward  pointing  vector.
(1,y)
(0,w)
(v,0)
(0,w)
(v,0)
(0,y)
Boundary orientation at (0,y)
Boundary orientation at (1,y)
2
Notice that the arrows on the right and left hand edges of the cyclinder point  in  opposite
directions.   This  is  a  general  phenomenon.
If M  is any manifold, then ([0, 1]M) = 0M
1M.   Let i
0
  : M    ([0, 1]M)
and  i
1
  :   M    ([0, 1]  M)  be  the  maps  i
0
(x)  =  (0, x),   i
1
(x)  =  (1, x).   Evidently  i
0
  is  a
66
dieomorphism  onto  0 M  and  i
1
  is  a  dieomorphism  onto  1 M.
Proposition  3.27.   Let   M  be  an  oriented  manifold  (without   boundary)  and  let   [0, 1]   have
the  usual   orientation,   in  which  1    T
t
[0, 1]   is  a  positive  basis  for  all   t.   Let  [0, 1]  M  have
the  product   orientation,   and  then  give  ([0, 1]  M)   =  0  M
1  M  the  boundary
orientation.   Then  i
1
  preserves  orientation  and  i
0
  reverses  orientation.
Proof   Let v
1
, . . ., v
m
  be a positive basis for T
x
M.   By denition of the product orientation,
(1, 0), (0, v
1
), . . ., (0, v
m
)   is  a  positive  basis  for   T
(t,x)
[0, 1] M  for  all   t.   (3.4)
We  have
T
out
(0,x)
 ([0, 1] M) = T
out
0
  [0, 1] T
x
M.
So
(1, 0)  T
out
(0,x)
([0, 1] M).
Thus
d
x
i
0
(v
1
), . . ., d
x
i
0
(v
m
) = (0, v
1
), . . ., (0, v
m
)   is  a  negative  basis  for   T
(0,x)
([0, 1] M);
by  denition  of  the  boundary  orientation,  it  would  only  be  positive  if
(1, 0), (0, v
1
), . . ., (0, v
m
)   were  a  positive  basis  for  T
(0,x)
_
[0, 1] M
_
,
and  this  contradicts  (3.4).   Hence  i
0
  reverses  orientation.
On  the  other  hand
(1, 0)  T
out
(1,x)
([0, 1] M),
so,  again  by  (3.4),
d
x
i
1
(v
1
), . . ., d
x
i
1
(v
m
) = (0, v
1
), . . ., (0, v
m
)   is  a   positive  basis  for   T
(1,x)
([0, 1] M).
Hence  i
1
  preserves  orientation.   2
Remark  3.28.   The  previous  result  can  be  restated  as
_
[0, 1] M
_
 = 1 M  0 M.   (3.5)
Here  we  orient  0 M  and 1 M  via  their natural identication  with  M,  and  interpret
the  minus  sign  to  indicate  opposite  orientation.   A  special  case  of  all this  is  the  statement
that
[a, b] = b a,   (3.6)
in  which  a  and  b  are  not  to  be  regarded  as  numbers  but  as  points  in  the  manifold  R,   and
the  minus  sign  refers  to  the orientation  of the  point a,  as in  Denition 3.14.   From (3.5) and
(3.6)  we  see  that  in  this  arithmetic  of  manifolds,   if  M  is  a  manifold  (without  boundary)
then
_
[0, 1] M
_
=
_
[0, 1]
_
M.   (3.7)
67
Now  weve  dened  the  boundary  orientation,  we  can  state  and  prove  the  main  theorem
on oriented intersection numbers, from which great consequences, and in particular Theorem
3.18,  will  promptly  follow.
Theorem  3.29.   Suppose  that  W
m+1
is  a  manifold  with  boundary,  F  : W    N
n
is  smooth
and  Z
nm
is   a  submanifold  of   the  manifold  N,   with  W  compact,   Z  closed  in  N,   and  all
manifolds  oriented.   Give  W  the  boundary  orientation,  and  denote  by  F  the  restriction  of
F  to  W.   Suppose  that  F    Z.   Then  (F   Z)
N
  = 0.
Before  proving  this,  we  re-state  3.18  as
Corollary  3.30.   Suppose  that  f
0
  and  f
1
  are  smooth  maps  from  M
m
  N
n
,  both  transverse
to  Z
mn
 N.   Suppose  that  M  is  compact  and  Z  is  closed  in  N,  and  that  all  manifolds  are
oriented.   Then
(f
0
 Z)
N
  = (f
1
 Z)
N
.
Proof of corollary:   Let F  : [0, 1]    N  be a homotopy from f
0
 to f
1
.   Note that f
0
  = F i
0
and  f
1
  = F   i
1
.   Apply  3.29,  taking  W  = [0, 1] M.   We  have
0 = (F   Z)
N
  = (F
| {1}M
   Z)
N
  + (F
| {0}M
   Z)
N
.
Since  i
1
  : M    1 M  is  an  orientation  preserving  dieomorphism,
(F
| {1}M
   Z)
N
  = ((F   i
1
)  Z)
N
  = (f
1
 Z)
N
.
Since  i
0
  : M    0 M  is  an  orientation  reversing  dieomorphism,
(F
| {0}M
   Z)
N
  = ((F   i
0
)  Z)
N
  = (f
0
 Z)
N
.
Therefore
(f
1
 Z)
N
  (f
0
 Z)
N
  = 0.
2
Proof  of  3.29:   Since  F  is  already transverse  to Z,  it is  possible  to perturb  F  so  that
it  becomes  transverse  to  Z  without  changing  F.   The  proof  of  this  is  a  simple  modication
of   the  proof   of   the  Weak  Transversality  Theorem  2.13.   It  involves  one  technique  we  have
not  yet  described,   and  will   be  left  as  an  exercise  when  this  technique  (gluing  maps  using
partitions  of  unity)  is  introduced.
Now  we  need
Lemma  3.31.   Suppose  F  :   W
p
  N
n
is  a  smooth  map,   and  Z  is  a  submanifold  of   N  of
codimension  k.   If   F    Z  and  F    Z,   then  F
1
(Z)  is  a  p  k-dimensional   manifold  with
boundary,  and
(F
1
(Z)) = (F)
1
(Z).
68
Proof  of  lemma  Exercise  (see  Exercises  III)   2
By the lemma, F
1
(Z) is a compact 1-dimensional manifold with boundary equal to F
1
(Z)
W  =  (F)
1
(Z).   Every  compact   connected  1-dimensional   manifold  with  boundary  is
dieomorphic  either   to  [0, 1]   (if   it   has  non-empty  boundary)   or   to  S
1
(if   its   boundary  is
empty)
  8
.   Thus  every  component  of  F
1
(Z)  is  either  a  circle  or  an  interval.   The  situation
is  illustrated  in  the  following  picture.
I
2
W
I
1
W 
x
x
1
0
C
C
2
1
The  key  step  in  the  proof  is  to  show  that  at  points  x
0
  and  x
1
  of  F
1
(Z)  W  which  lie
on  the  same  connected  component  of  F
1
(Z)  (e.g.   as  shown  in  the  picture),  we  have
(F   Z)
x
0
  = (F   Z)
x
1
.
This is carried out by dening  a preimage  orientation on F
1
(Z), and showing that with
respect  to  this  orientation,  for  every  x  F
1
(Z)  W,
(F
1
(Z)  W)
x
  = (F   Z)
x
  (3.8)
Once  this  is  done,  the  theorem  will  follow.   For  observe  that  if  I  is  a  component  of  F
1
(Z)
with  I  W  = x
0
, x
1
,  then  however  I  is  oriented,
(I  W)
x
0
  = (I  W)
x
1
.
Suppose, for example, I  is oriented, as is I
1
  in the picture, so that the positive direction runs
from  x
0
  to  x
1
.   Let  v
1
  be  a  positive  basis  for  T
x
0
I,  and  v
1
  a  positive  basis  for  T
x
1
I.   Because
8
Though extremely plausible, this statement (the classication of compact 1-manifolds with boundary)
requires  proof:   see  e.g.   the  Appendix  to  Dierential  Topology  by  Guillemin  and  Pollack.   Its  not  hard.
69
F
1
(Z)    W  (this  is  a  consequence  of  having  F    Z),   neither  vector  is  tangent  to  W.
In  fact,   v
0
    T
out
x
0
  W  and  v
1
    T
out
x
1
  W.   It   follows,   by  denition  of   oriented  intersection
numbers  and  of  boundary  orientation,  that
(I  W)
x
0
  = 1,   (I  W)
x
1
  = +1.   (3.9)
If  the  opposite  orientation  is  chosen  for  I,   then  both  signs  are  reversed,   but  in  either  case
the  two  intersections  have  opposite  signs.
Thus,
(F   Z)
N
  =
xF
1
(Z)
(F   Z)
x
  =
xF
1
(Z)W
(F
1
(Z)  W)
x
.   (3.10)
The points of F
1
(Z)W  come in pairs, at opposite ends of the same connected component
of  F
1
(Z).   By  (3.9),   the  contributions  of  two  such  points  to  the  sum  in  (3.10)  cancel   one
another  out.   Thus,  (F   Z) = 0.
It remains  to dene  preimage  orientation and prove  (3.8).   We do  this in a  separate proposi-
tion,  which  follows.   2
To  dene  the  preimage  orientation,   we  introduce  the  notion  of   short   exact   sequence  of
vector  spaces.
Denition  3.32.   (i)  A  sequence  of  linear  maps  between  vector  spaces
       V
k1
    V
k
    V
k+1
      
is  exact   if   the  kernel   of   each  map  is  equal   to  the  image  of   its  predecessor.   A  short   exact
sequence  is  an  exact  sequence  of  the  form
0    A    B    C    0.
Since  the  domain  of  the  rst  map  is  0,  so  is  its  image,  so  the  second  map  is  injective.   Simi-
larly,  the  kernel  of  the  last  map  is  all  of  C,  so  the  third  map  must  be  surjective.
Example  3.33.  Exact sequences occur naturally in the theory of manifolds.   If y  is a regular
value  of  the  smooth  map  f  : M    N,  and  f
1
(y) ,= ,  then  for  each  x  f
1
(y),
0    T
x
f
1
(y)    T
x
M
  dxf
  T
y
N    0
is  exact.   2
(ii)  Now  suppose  that  in  the  short  exact  sequence   0    A
  i
  B
  j
  C    0,   all   three
spaces  are oriented.   We dene  the  sign of  the sequence  as follows.   Let E  be a  positive  basis
for  A,   and  let  F  =  c
1
, . . ., c
p
  be  a  positive  basis  for  C.   Choose  vectors  b
1
, . . ., b
p
    B  such
that  j(b
k
)  =  c
k
  for  k  =  1, . . ., p,   and  write  F
=  b
1
, . . ., b
p
.   Then  i(E), F
is  a  basis  for  B.
Call  the  sequence  positive  if  E, F
i(E),F
  can be  transformed to
the  identity  matrix  by  adding  multiples  of  the  rst  m  rows  to  the  last  p.   Hence  it  has
determinant  +1.
2.   If  F
1
  and  F
2
  are  bases  for  C,   then  the  matrix  of  change  of  basis  [I]
F
1
F
2
,   applied  to  the
vectors of F
1
, transforms them into vectors which dier from those of F
2
 by elements of
ker(j).   Hence,  as  in  the  previous  case,  the  bases  i(E), F
1
  and  i(E), F
2
  are  equivalent.
3.   If   E
1
  and  E
2
  are  equivalent   bases   of   A,   then  the   bases   i(E
1
), F
and  i(E
2
), F
are
equivalent.
Thus  the  denition  of  positive  sequence  is  unambiguous.
(iii)  Given  a  short  exact  sequence  in  which  two  of  the  three  spaces  are  oriented,   there  is  a
unique  orientation  of  the  third  making  the  sequence  positive.   Thus,  a  short  exact  sequence,
together  with  orientations  on  any  two  of  its  members,  can  be  used  to  orient  the  third.
(iv)  Suppose  f  : M    N  has  y  as  regular  value,  and  f
1
(y) ,= .   Here  is  how  we  dene  the
preimage  orientation  on f
1
(y).   Let  x  f
1
(y), and  write  f(x) = y.   There  is  a  short  exact
sequence
0    T
x
f
1
(y)    T
x
M
  dxf
  T
f(x)
N    0.
Use  the  orientations  on  T
x
M  and  T
f(x)
N  to  orient  T
x
f
1
(y) according  to  the  recipe  in  (iii).
(v)   Now  suppose  f   :   M    N  is   transverse  to  Z,   and  x    f
1
(Z).   Let   y   =  f(x).   The
procedure  for  orienting  T
x
f
1
(Z)  is  only  a  little  more  complicated.   There  are  short  exact
sequences
0    T
y
Z    T
y
N    T
y
N/T
y
Z    0
and
0    T
x
f
1
(Z)    T
x
M    T
y
N/T
y
Z    0.
Exactness  of the rst is  trivial; exactness  of  the second  is a consequence  of equation  (1.4) in
the  denition  of  transversality,  together  with  the  fact  that  T
x
f
1
(Z) = (d
x
f)
1
(T
y
Z).
Given  orientations   of   T
x
M,   T
y
Z  and  T
y
N,   we   use   the   rst   short   exact   sequence   to  ori-
ent  T
y
N/T
y
Z  as  explained  in  (iii),   and  then  use  the  second  short  exact  sequence  to  orient
T
x
f
1
(Z).
This  last  denition  might  seem  frighteningly  non-implementable  (and  therefore,   for  the
operationally  inclined
9
,   incomprehensible).   In  fact  it  can  be  used  in  practice,   but  for  our
purposes  it  can  perfectly  well  remain  purely  theoretical,  because  of  the  following  result.
9
such  as  me
71
Proposition 3.34.  If F  : W    N  and Z   N  are as in 3.29, and we give W  the boundary
orientation,  and  F
1
(Z)  the  preimage  orientation,  then  for  x  F
1
(Z),
(F   Z)
x
  = (F
1
(Z)  W)
x
.
Proof   Because T
x
F
1
(Z) and T
x
(W) are transverse,  for dimensional reasons their inter-
section  is  0.   Thus  every  vector  v   T
x
W  can  be  written  uniquely  as  a  sum  v  = v
1
 + v
2
  with
v
1
   T
x
F
1
(Z)  and  v
2
   T
x
(W).   Denote  by  p  the  map  v  =  v
1
 + v
2
    v
2
.   Thus  there  is  a
short  exact  sequence
0    T
x
F
1
(Z)    T
x
W
  p
  T
x
(W)    0;   (3.11)
and we can re-state our denition of oriented intersection number (3.15) as:   (F
1
(Z)W)
x
  =
1  according  to  whether  the  exact  sequence  (3.11) is  positive  or  negative.
There  is  an  isomorphism  (induced  by  d
x
(F))
d : T
x
(W)    T
y
N/T
y
Z;
we  have
(F   Z)
x
  = 1
according  to  whether
  
d  preserves  or  reverses  orientation.
Suppose  we  compose  the  map  p : T
x
W    T
x
W  with
  
d.   We  obtain  the  exact  sequence
0    T
x
F
1
(Z)    T
x
W    T
y
N/T
y
Z    0.   (3.12)
which  is   what   we   use,   following  Example   3.33(v),   to  determine   the   transverse   preimage
orientation  of   F
1
(Z):   each  tangent   space  T
x
F
1
(Z)  is   oriented  so  that   this   sequence  is
positive.   The  two  short  exact  sequences  are  shown  together  in  the  following  diagram.
0   (3.11)
T
x
W
m
m
m
m
m
m
m
m
0
  
 T
x
F
1
(Z)
  
 T
x
W
p   
k
k
k
k
k
k
k
dp
S
S
S
S
S
S
T
y
N/T
y
Z
Q
Q
Q
Q
Q
Q
Q
0   (3.12)
Because  of  the  way  in  which  (3.12)  is  obtained  from  (3.11)  by  inserting  the  isomorphism
  
d,
which  may  preserve  or  reverse  orientation,  we  have
1 = sign(3.12) = sign(3.11)      sign(
d)
72
= sign(3.11)      (F   Z)
x
.
It  follows  that
sign(3.11)   =   (F   Z)
x
,
in  other  words
(F
1
(Z)  W)
x
  = (F   Z)
x
.
2
This   proposition   completes   the   proof   of   Theorem  3.29,   and   thus   of   the   homotopy-
invariance   of   transverse   intersection  numbers.   Now  that   we   have   done   this,   we   can  le-
gitimately  dene  non-transverse  oriented  intersection  numbers,   as  described  after  Exercise
3.17.   We  state  this  as  a  formal  denition:
Denition  3.35.   (i)  If   f   :   M
m
  N
n
is  smooth,   Z
nm
is  a  submanifold  of   N,   all   three
manifolds are oriented,  and M  is compact and Z  is closed in N, then we dene the intersec-
tion  number  of  f  and  Z  in  N,   (f   Z)
N
  by  nding  f
1
  homotopic  to  f  and  transverse  to  Z,
and  setting  (f   Z)
N
  = (f
1
 Z)
N
.
(ii)  If,   in  (i),   M  is  actually  a  submanifold  of  N  and  i  :  M    N  is  the  inclusion,   we  dene
the  intersection  number  of  M  and  Z  in  N,  (M   Z)
N
,  by
(M   Z)
N
  = (i  Z)
N
.
Having  made  this  denition,   we  can  reststate  3.29  without  the  hypothesis  that  F  be
transverse  to  Z:
Theorem  3.36.   Suppose  that  W
m+1
is  a  manifold  with  boundary,  F  : W    N
n
is  smooth
and  Z
nm
is   a  submanifold  of   the  manifold  N,   with  W  compact,   Z  closed  in  N,   and  all
manifolds  oriented.   Give  W  the  boundary  orientation,  and  denote  by  F  the  restriction  of
F  to  W.   Then  (F   Z)
N
  = 0.   2
Proposition  3.37.   In  the  situation  of  3.35(ii),
(M   Z)
N
  = (1)
dimM  dimZ
(Z  M)
N
.
Proof   The  proof  is  easy  if  M    Z.   Suppose  x  M   Z,  v
1
, . . ., v
m
  is  a  basis  of  T
x
M  and
w
1
, . . ., w
d
  is a basis of T
x
Z.   Then by the assumptions on dimension and transversality,  both
v
1
, . . ., v
m
, w
1
, . . ., w
d
and
w
1
, . . ., w
d
, v
1
, . . ., v
m
73
are  bases  of  T
x
N.   Each  basis  can  be  obtained  from  the  other  by  md  transpositions,   and  it
follows that the signs of the two bases are related by a factor of (1)
md
.   Since this holds for
each  point  x  M   Z,  we  have
(M   Z)
N
  =
xMZ
(M   Z)
x
  = (1)
md
xMZ
(Z  M)
x
  = (1)
md
(Z  M)
N
.
In  case  M  is  not  transverse  to  Z,   there  is  a  perturbation  i
1
  of  i  which  is  transverse  to  Z,
and  which  is  still   an  embedding  (  i
1
  can  be  chosen  as  close  as  we  like  to  i,   and  being  an
embedding  is  an  open  condition).   Now  apply  the  previous  argument  to  the  image  M
1
  of  i
1
.
2
We  end  this  section  with  one  minor  extension  of  the  main  theorem:
Proposition  3.38.   The  oriented  intersection  number  (f   Z)
N
  of  f  :  M    N  and  Z    N
is  invariant  under  homotopies  of  the  inclusion  of  Z  in  N.
Proof   (f   Z)
N
  equal  to  (M   Z  graph(f))
MN
.   A  homotopy  of  the  embedding  of  Z  in
N  induces  a  homotopy  of  the  embedding  of  M  Z  in  M  N.   Now  apply  3.30.   2
Its  time  to  harvest  some  of  the  consequences  of  our  hard  work!
4   Applications  of  Oriented  Intersection  Numbers
Let  M
n
and  N
n
be  oriented  manifolds  of   the  same  dimension,   with  M  compact,   and  let
f  : M    N  be  a  smooth  map.   For  a  regular  value  y  of  f,  dene
deg(f; y) =
xf
1
(y)
sgn(x),
where
sgn(x) =
_
  1   if  d
x
f  preserves  orientation
1   if  d
x
f  reverses  orientation
Proposition  4.1.   For  any  two  regular  values  y
0
, y
1
  of  f,
deg(f; y
0
) = deg(f; y
1
).
Proof   deg(f; y)  is  equal   to  the  oriented  intersection  number  in  M   N  of  M   y  and
the   graph  of   f,   gr(f)   (Exercise).   Choose   a  path     in  N  joining  y
0
  and  y
1
.   The   map
 : [0, 1]  M    M  N  dened  by  (t, x) = (x, (t))  is  a  homotopy  between  the  inclusion
i
0
  of  M  in  M  N  as  M  y
0
  and  the  inclusion  i
1
  of  M  as  M y
1
.   Hence
deg(f; y
0
) = (M y
0
  gr(f)) = (i
0
 gr(f)) = (i
1
 gr(f)) = (M y
1
  gr(f)) = deg(f; y
1
)
by  the  homotopy  invariance  of  oriented  intersection  numbers.   2
74
Example  4.2.   The  following  diagram  shows  a  curve  C  wrapping  around  the  circle.   Radial
projection from the centre of the circle denes a map  : C    S
1
.   Two regular values, y
1
 and
y
2
, are shown, as are their preimages in C.   Whereas y
1
 has three preimages in C, y
2
  has only
one.   The  sign  of  each  preimage  is  indicated  in  the  diagram  with  a  +1  or  a 1.   It  is  easy  to
determine visually:   rst determine the direction of movement of (x) as x moves through the
preimage  point in the  positive  direction  on C, then  compare  this with  the  positive  direction
on  S
1
.
1
1
2
y
y
C
1
+1 1
By  inspection,
deg(; y
1
) = 1 = deg(; y
2
),
bearing  out  what  is  proved  in  4.1.
In  the  light  of  4.1  we  can  make  the  folowing  denition:
Denition  4.3.   Let   M
n
and  N
n
be  oriented  manifolds   of   the   same  dimension,   with  M
compact,  and  let  f  : M    N  be  a  smooth  map.   Dene  deg(f)  to  be  deg(f; y) for  any  regular
value  y.
By  the  homotopy-invariance  of  oriented  intersection  numbers,  we  have
Proposition  4.4.   Homotopic  maps  have  the  same  degree.   2
4.1   Vector  elds  on  spheres
It  is  easy  to  construct  a  nowhere  vanishing  vector  eld
  10
on  S
1
,  as  in  the  left-hand  picture.
N
S
10
We  mean  tangent vector elds.   It  is  usual to  omit  the  word tangent,  unless  other types  of vector elds
  e.g.   normal  vector  elds  are  also  being  discussed.
75
The  eld  drawn  is  given  by  v(x
1
, x
2
) = (x
2
, x
1
).  A  similar  formula,
v(x
1
, x
2
, x
3
, x
4
) = (x
2
, x
1
, x
3
, x
4
),
gives  a  nowhere-vanishing  vector  eld  on  S
3
,   and  the  same  idea  gives  a  nowhere  vanishing
vector  eld  on  S
n
for every  odd n.   Does there  exist  a  nowhere-vanishing  vector-eld  on  S
2
?
Its  easy  to  nd  a  vector  eld  with  just  two  zeros:   a  vector  eld  vanishing  only  at  N  and  S
is shown in the right-hand picture.   This question can be recast as a problem in hairdressing:
given a sphere with a hair growing from every point, is it possible to comb them all in such a
way  that all the hairs  lie at on the surface  of the  sphere?  As  you will  observe  on the heads
of  your  friends,  this  problem  is  hard  to  solve  even  on  a  hemisphere.
Theorem  4.5.   If   n  is  even,   every  vector  eld  on  the  n-sphere  S
n
must   vanish  at   least   at
one  point.
Proof
Step  1:   If  n  is  even,  the  antipodal  map  a : S
n
  S
n
has  degree  1.   (Exercise).
Step  2:   If v  is a nowhere vanishing  vector eld on S
n
, then the antipodal map a : S
n
  S
n
is  homotopic  to  the  identity  map.   To  construct  a  homotopy,   we  rst  obtain  a  unit  vector
eld   v   by  dividing  v   by  its   length.   Then  we   dene   F   :   [0, 1]   S
n
  S
n
by  F(t, x)   =
cos(t)x + sin(t) v(x).   Clearly  F(0, x) = x, F(1, x) = x,  and  F(t, x)  S
n
for  all  t.   Since
the  identity  map  has  degree  1,  and  degree  is  a  homotopy-invariant,  n  cannot  be  even.   2
Exercise  4.6.   How  many  everywhere  independent  vector  elds  can  you  nd  on  S
3
?   Can
you  nd  a  frame  eld  dened  on  all  of  S
3
?   2
4.2   Linking  Numbers
Consider  the  two  pairs  of  loops  of  wire  shown  in  the  following  picture.
(i)
(ii)
Its  clear  that  the  two  in  the  left  cannot  be  separated,  whereas  the  two  on  the  right  can
be.   Can  we  give  a  mathematical   description  of   the  way  they  sit  relative  to  one  another,
which  accounts  for  this  dierence?  Using  the  degree  of  a  suitable  smooth  map,  we  can.
Let  C
1
  and  C
2
  be  disjoint  smooth  closed  curves  in  R
3
.   Dene  a  map  C
1
C
2
    S
2
by
(x
1
, x
2
) 
  x
2
x
1
|x
2
x
1
|
.
76
x
2
x
2
f(    ,    ) x
1
1
x
y=
If C
1
  and C
2
 are oriented, we can calculate the degree of f, by giving C
1
C
2
  the product
orientation,  and  giving  S
2
its  usual  orientation  as  the  boundary  of  the  unit  ball.   Dene  the
linking  number  of  C
1
  and  C
2
,  (C
1
, C
2
),  by
(C
1
, C
2
) = deg(f).
To  see  that  this  does  capture  something  of   the  dierence  between  (i)  and  (ii)  in  the  rst
picture,   we  calculate  the  two  linking  numbers.   Remarkably  (we  cannot  say  this  without  a
glow  of  satisfaction)  the  techniques  we  have  developed  make  this  very  simple.   To  calculate
the  degree  of   the  map  f,   we  have  to  select  a  point  y  in  S
2
and  count  its  preimages,   with
their  sign.   The  point  (x
1
, x
2
)  in  C
1
 C
2
  is  a  preimage  of  y  if  the  line  from  x
1
  to  x
2
  points
in  the  direction  of  y.   While  it  looks  a  little  dicult  to  count  the  preimages  of  the  point  y
in  the  picture,  if  instead  we  choose  as  y  the  unit  vector  orthogonal  to  the  plane  of  the  page
and  pointing  towards  us,   then  the  preimages  of  y  are  simply  pairs  (x
1
, x
2
)  where  the  curve
C
2
  appears  to  cross  over  the  curve  C
1
.   In  the  picture  below,   there  is  just  one  such  point,
which  I  have  ringed.
2
f(   (t))
C
C
1
2
f(   (t)) 
1
d  f(v  ,0)
d  f(0,v  )
1
2
x
x
Now we compute its sign.   If v
1
 and v
2
 are positive bases for T
x
1
C
1
 and T
x
2
C
2
 then (v
1
, 0), (0, v
2
)
is a  positive  basis for T
(x
1
,x
2
)
C
1
C
2
.   Suppose that 
1
, 
2
  are parametrisations  of C
1
  and C
2
around  x
1
  and  x
2
  such  that  
1
(0) = v
1
,   
2
(0) = v
2
.   Then  (v
1
, 0)  and  (0, v
2
)  are  the  tangent
vectors  to  the  curves   
1
(t) = (
1
(t), 0)  and   
2
(t) = (0, 
2
(t)).  Moreover
d
(x
1
,x
2
)
f(v
1
, 0) = (f    
1
)
(0),   d
(x
1
,x
2
)
f(0, v
2
) = (f    
2
)
(0).
We only have to decide whether d
(x
1
,x
2
)
f(v
1
, 0), d
(x
1
,x
2
)
f(0, v
2
) is a positive or a negative basis
for  T
y
S
2
,   so  we  can  use  a  fairly  rough  and  ready  approach  (which  will   be  justied  below).
77
Imagine  a  unit  pencil  pointing  from  x
1
  to  
2
(t).   Its  easy  to  see  when  its  transplanted  and
placed  with  the  back  end  at  the  centre  of   the  unit  ball,   then  its  tip  draws  a  curve  on  the
sphere  something  like  the  one  shown,   and  with  the  orientation  shown.   Similarly,   f    
1
(t)
appears  on  the  sphere  as  shown.   The  right  hand  picture  shows  the  tangent  vectors  to  these
two  curves,  and  since  they  make  a  positive  basis  for  T
y
S
2
,  we  conclude  that  (C
1
, C
2
) = 1.
When we calculate d
(x
1
,x
2
)
f(0, v
2
), we move the tip of the pencil in the direction of v
2
.   From
this  it  follows  that  d
(x
1
,x
2
)
f(0, v
2
)  is  a  positive  multiple  of  the  orthogonal  projection  of  v
2
  to
the  plane  T
f(x
1
,x
2
)
S
2
(which  is  equal   to  (x
2
 x
1
)
).   When  we  calculate  d
(x
1
,x
2
)
f(0, v
2
),   we
move the other end of the pencil in the direction of v
1
.   From this it follows that d
(x
1
,x
2
)
f(v
1
, 0)
is  a  negative  multiple  of  the  orthogonal  projection  of  v
2
  to  the  plane  T
f(x
1
,x
2
)
S
2
.
These   two  observations   lead  to  the  following  method  for   computing  the  linking  numbers
of   two  curves   C
1
  and  C
2
  by  counting  apparent   crossings   of   their   orthogonal   projections
(C
1
), (C
2
)  to  a  plane  P.   Let  n  be  a  unit  vector  orthogonal  to  P.   Then  n  determines  an
orientation of P   v
1
, v
2
  is a positive basis if n, v
1
, v
2
  is a positive basis for  R
3
.   Suppose that
x
1
   C
1
, x
2
   C
2
  and (x
1
) = (x
2
), so  that at this point, which  we call p,  (C
1
) and (C
2
)
cross.
  Call   this  point  p  a  crossing  of   C
2
  over  C
1
,   or  overcrossing,   if   the  vector  x
2
 x
1
  is  a
positive multiple of n (because (x
1
) = (x
2
), x
2
x
1
 must be a multiple of n, although
possibly  negative).   Note  that  overcrossings  and  undercrossings  are  interchanged  if  we
interchange  the  order  of  C
1
  and  C
2
.
  Call   an  overcrossing  transverse  if  
|C
1
  and  
|C
2
  are  immersions  at  x
1
  and  x
2
,   so  that
(C
1
) and (C
2
) are manifolds in some neighbourhood of p, and if moreover (C
1
) and
(C
2
)  are  transverse  at  p.
  Call   the   overcrossing   p  positive   if   d
x
1
(v
1
), d
x
2
(v
2
)   is   a  negative   basis   for   P,   and
negative  if   d
x
1
(v
1
), d
x
2
(v
2
)  is  a  positive  basis  for  P,   and  write  sgn(p)  =  1  if   p  is  a
positive  overcrossing  and 1  if  p  is  a  negative  overcrossing.
Proposition 4.7.  (i) If (C
1
) and (C
2
) are transverse to one another at each overcrossing,
then  the  vector  n  is  a  regular  value  of  the  map  f.
(ii)  In  this  case,
(C
1
, C
2
) =
p
sgn(p)   (4.1)
where  the  sum  is  over  overcrossings.
Example   4.8.   The  linking  number   of   the  two  curves   in  picture  (ii)   on  page  76  is   0,   in
whichever  order  we  place  them  and  however  we  orient  them.   There  are  two  overcrossings,
and  they  have  opposite  sign.   2
Exercise  4.9.   What  happens  to  (C
1
, C
2
)  if  we  interchange  C
1
  and  C
2
?   2
78
Exercise  4.10.   Find  the  linking  numbers  of  the  following  pair  of  curves.
C C
1 2
2
Exercise  4.11.   Consider  the  map  S
1
  S
1
S
1
dened  by   (p, q).   Composing  with
the  dieomorphism  S
1
 S
1
  T  described  in  1.5,   we  get  a  curve  C  on  the  torus   T.   It  is
known  as  a  (p, q)  curve  (because  it  winds  around  the  torus  p  times  in  one  direction  and  q
times  in  the  other).   Let  C()  be  the  image  on  T  of  the  curve  S
1
  S
1
 S
1
given  by    to
(p + , q)  for   > 0.
(i)  Show  that  if   > 0  is  small  then  C  C() = .
(ii)  For small  ,  nd  (C, C()).   2
Exercise  4.12.   (i)  Show  that  if   C
1
  and  C
2
  are  disjoint  closed  oriented  curves  in  R
3
then
(C
1
, C
2
)  is  unchanged  if  we  deform  C
1
  or  C
2
  in  a  homotopy  during  which  the  two  curves
remain  at  all  times  disjoint.
(ii)  We  say  that  we  can  separate  C
1
  and  C
2
  if  it  possible,   by  means  of  a  homotopy  during
which  the  curves  remain  at  all   times  disjoint,   to  place  them  on  opposite  sides  of   a  plane.
Show  that if  it is possible  to separate  C
1
  and C
2
  then (C
1
, C
2
) = 0.   Hint:   suppose  they  are
separated by the plane H.   Then it is possible to choose a vector y  S
2
such that f
1
(y) = 
(where  f  is  the map C
1
C
2
    S
2
whose  degree  we measure  as our denition  of (C
1
, C
2
)).
2
Remark  4.13.   One  might   ask,   given  that   there  is   a  simple   algorithm  to  determine   the
linking  number  of  two  closed  curves,   why  do  we  need  the  rather  sophisticated  denition  in
terms  of  the  degree  of  a  smooth  map?  After  all,  the  very  denition  of  degree  took  rather  a
lot of work.   Some authors do prefer to dene the linking number by means of the algorithm.
But  they  then  have  the  problem  of  showing  that
(i) (C
1
, C
2
) is an invariant of the two  curves,  and does not depend on the planar projection
chosen,  and
(ii)  (C
1
, C
2
) does  not change  if  we  move  the  curves  without  passing  one  through the  other.
In  fact   both  can  be  shown  by  a  completely  dierent   procedure  from  ours.   Consider   the
following sequence of pictures.   It begins with one plane projection of a trefoil knot, and ends
with  another.   There  are  two  points  at  which  something  topologically  non-trivial   happens,
79
ringed  with  a  dashed  circle.
The  local  views  in  the  neighbourhood  of  the  special  points  immediately  before,  during  and
after  these  changes  are  shown  (as  moves  I   and  III)  in  the  following  table  of   Reidemeister
moves.   It  is  possible  to  show  that  given  any  two  transverse  planar  projections  of   the  pair
of   curves  C
1
, C
2
,   one  projection  can  be  deformed  to  the  other  through  a  sequence  of   such
moves.   Similarly,   if   the   pair   C
1
, C
2
  can  be   deformed  to  the   pair   C
1
, C
2
  without   at   any
moment   passing   through  one   another,   the   deformation  can  be   represented  by  means   of
planar  projections  which  undergo  only  these  same  changes.
I II
III
None  of  the  three  Reidemeister  moves  change  the  value  of  the  sum  in  (4.1).   In  the  case  of
the  move  of  type  I,  this  is  because  the  overcrossing  it introduces  (or cancels)  is  of  one curve
over  itself,  and therefore  does not contribute  to (4.1).   In the  case of  the type  II move,  if  the
two  crossings  shown  in the  picture  do appear in (4.1) then they  do  so  with  opposite sign,  as
in  the  following  picture  on  the  left,  and  again  do  not  change  its  value.
Possible  sequence   Impossible  sequence
Exercise  4.14.   Show  that  a  type  III  move  does  not  change  the  sum  (4.1).
80
4.3   Self-intersection  and  the  Euler  Characteristic
Let   N
2m
be  an  oriented  manifold  of   even  dimension  and  let   M
m
be  a  compact   oriented
submanifold.   We can dene the self-intersection number (M M)
N
  by the procedure of 3.35:
we  imagine  given  two  copies  of   M,   and  then,   leaving  one  xed,   we  perturb  the  other,   in
a  homotopy,   so  that  it  becomes  transverse  to  the  rst.   For  example,   the  self-intersection
number  of  the  curve  C   T  is  zero,  since  it  can  be  slid  right  o  itself.
C
C
Let  M  be  a  compact  oriented  manifold.   Let    M M  be the  diagonal.   Give  M M
the  product  orientation,   and  orient    by  the  dieomorphism  M      sending  x  to  (x, x).
The  Euler  characteristic  (M)  is  the  self-intersection  number  of    in  M  M,  (  )
MM
.
Proposition  4.15.   If   (M)  ,=  0,   and  the  map  f   :   M    M  is  homotopic  to  the  identity
map,  then  f  must  have  a  xed-point.
Proof   Exercise.
From  4.15  we  can  deduce  a  weak  version  of  the  Poincare-Hopf  theorem.
Theorem  4.16.   If  (M)  ,=  0  then  every  tangent  vector  eld  on  M  must  have  at  least  one
zero.
Proof   Let  U
M
  be  a  tubular  neighbourhood  of  M.   Given  a  tangent  vector  v  on  M,  dene
a  homotopy  F  : [0, ] M    M,  by
F(t, x) = (x + tv(x)).
For  small enough and t  [0, ],then x+tv(x)  U
M
  and thus F  is well-dened.   If v  has no
zero  then f
t
  has no  xed  point (see  the Exercise  below).   But f
t
  is homotopic to the identity
map.   2
Exercise  4.17.   Show  that if M  is compact, oriented  and of  odd dimension  then (M) = 0.
Hint:   use  3.37.   2
Exercise  4.18.   Show  that,  as  claimed  in  the  proof  of  4.16,  the  map  f
t
(x) = F(t, x)  has  no
xed  point.   Hint:   the  picture  should  be  clear  enough:
81
(x+tv(x))
x
v(x)
x+tv(x)
How  to  make  this  into  a  precise  argument?   For  which  points  x
  U
M
  is  (x
)  equal   to  x?
2
The full strength Poincare Hopf Theorem gives more precise information about the zeros
of  a  vector  eld,  with  only  isolated  zeros,  on  a  compact  oriented  manifold.   First,  the  index
ind
x
(v)  of  an  isolated  zero  x  of  a  vector  eld  v  is  dened;  then
Theorem  4.19.   If  v  is  a  vector  eld  with  only  isolated  zeros  on  the  compact  oriented  man-
ifold  M,  then
x  a  zero  of   v
ind
x
(v) = (M).
2
We  will   not  prove  this  theorem  here;   some  of  the  details  of  the  proof,  and  in  particular
the  denition  of  the  index  of  an  isolated  zero,  are  given  in  Exercises  4.
4.4   Brouwers  Fixed  Point  Theorem
Let  Y   X.   A  retraction  X    Y  is  a  continuous  map  such  that  for  all  y  Y, r(y) = y.
Proposition  4.20.   There  can  be  no  smooth  retraction  from  a  compact   oriented  manifold
with  boundary,  W,  onto  its  boundary.
Proof   Exercise Hint:   if r  is a smooth retraction, what is the degree of r?  Use Theorem
3.29   2
Theorem  4.21.   Brouwers  Fixed  Point  Theorem  Let   f   :
  
B
n
  
B
n
be  a  continuous  map
(where
  
B
n
is  the  unit  ball  in  R
n
).   Then  f  has  a  xed  point.
Proof   Suppose  rst  that  f  is  smooth.   Dene  a  smooth  map  g  :
  
B
n
  S
n1
by
x the  point  where  the  line  from  f(x)  to  x,  continued,  meets  S
n1
.
Evidently  this  is  a  retraction  from
  
B
n
to  
 
B
n
,  contradicting  4.20.
82
g(x)
x
f(x)
Now  consider  the  case  where  f  is  merely  continuous,   and  has  no  xed  point.   As
  
B
n
is
compact,   there  exists     >  0  such  that   for  all   x  
  
B
n
,   |x  f(x)|    .   Choose  smooth
f  :
  
B
n
  
B
n
such  that  |
: U
  is  called  an
atlas  for  M.
(iii)  An  atlas  
: U
is smooth if whenever U
(U
(U
)
is  smooth.
(iv)  If  M  is  a  manifold  equipped  with  a  smooth  atlas  A,  then  a  map  f  : M    R
k
is  smooth
with  respect  to  A  at  a  point  x  M  if  f   
1
is smooth, where
  A  is  a  chart  dened  on
some  neighbourhood of x.   And  a  map f  :   R
k
  M  is  smooth  with  respect  to  A  at y  R
k
if
is an open cover V
.
85
from M  to  R
k
without reference to the charts of a smooth atlas, because M  is not embedded
in  any  bigger  space  already  equipped  with  a  notion  of  dierentiability.
Example  5.2.   We  dene  an  atlas  on  RP
n
=  S
n
/    as  follows.   Each  (n + 1)-dimensional
vector subspace  P   R
n+1
divides S
n
into  two open hemispheres.   Let H  be one of them.   As
H  contains  no  pairs  of  antipodal  points,  the  map  q
|
  : H    RP
n
is  1-1;  since  q  is  continuous
and  open  (see  Example  1.41(ii)),  it  follows  that  q(H)  is  open  in  RP
n
and  q  : H    q(H)  is  a
homeomorphism.   Also H  is homeomorphic to an open set in  R
n
:   for instance, the orthogonal
projection  R
n+1
  P  maps  H  homeomorphically  to  the open unit ball B  in P.   We take  the
composite
q(H)
  q
1
  H
  
  B
as  a  chart  on  RP
n
,  and  call  it  .   As  hemispheres  H  cover  S
n
,  their  images  q(H)  cover   RP
n
,
so  we  have  an  atlas.
This  atlas  is  smooth.   This  is  most  easily  seen  with  a  diagram  for  the  case  n  =  1.   We
take  two  hyperplanes  P
1
  and  P
2
,  and two  hemispheres  (semicircles  in  the  picture)  that they
determine,  H
1
  and  H
2
.
2
H
1
H
2
P
1
P
Denote   by  
1
  :   H
1
    B
1
  and  
2
  :   H
2
    B
2
  the   orthogonal   projections   used  in  the
construction  of  the  charts.   We  have
i
_
q(H
1
)  q(H
2
)
_
= 
i
(H
1
 H
2
)
for  i = 1, 2, and
2
 
1
1
  = 
2
 
1
1
  ;
the  diagram  below  shows  the  composite
2
 
1
1
  : 
1
(H
1
 H
2
)    
2
(H
1
 H
2
).
Because  of  its  geometrical  origin,  its  easy  to  believe  that  this  map  is  smooth.
86
In  fact   a  precise   argument   is   straightforward.   After   an  ambient   rotation,   P
1
  is   the
subspace  x
n+1
  = 0,  and  
1
1
  : B
1
    H
1
  is  given  by  the  formula
(x
1
, . . ., x
n
)  
_
x
1
, . . ., x
n
,
_
1 (x
2
1
 +   + x
2
n
)
_
.
As  
2
  is  a  linear  projection,  the  composite  
2
 
1
1
  is  smooth.   2
Example  5.3.   Now  here  is  another  atlas  on  RP
n
.   This  time  we  use  the  fact  that   RP
n
can
also  be  thought  of  as  the  quotient  of   R
n+1
 0  by  the  equivalence  relation  x 
1
  x  where
  R0.   The equivalence classes here are just straight lines through zero, and each meets
S
n
in two antipodal points, so there is a natural bijection between the set of these equivalence
classes,   R
n+1
/ 
1
  and the quotient of S
n
by the antipodal identication.   It follows that there
are mutually inverse  continuous  maps between  the two  quotients,  and they can be identied
with  one  another.   Thus,  we  think  of   RP
n
ss  the  space  of  lines    through  0  in  R
n+1
.
We will denote the equivalence class in RP
n
of a point (x
1
, . . ., x
n+1
) in R
n+1
by [x
1
, . . ., x
n+1
].
Thus  for  any   ,= 0,
[x
1
, . . ., x
n+1
] = [x
1
, . . ., x
n+1
].   (5.1)
For  a  point  P  =  [x
1
, . . ., x
n+1
]    RP
n
,   the  x
i
  are  its  so-called  homogeneous  coordinates.   Of
course,   they  are  not  co-ordinates  at  all:   the  value  of  x
i
  is  not  well   dened,   by  (5.1).   What
is  well  dened  is  the  ratio  x
i
/x
j
,  provided  x
j
  ,= 0.   We  dene
U
j
  = [x
1
, . . ., x
n+1
]  RP
n
: x
j
  ,= 0.
If  P   U
j
,  by  dividing  its  homogeneous  co-ordinates  through  by  x
j
  we  get  a  representation
in  which  the  j-th  homogeneous  coordinate  is  equal   to  1.   Every  point  in  U
j
  has  a  unique
representation  of  this  form.   We  use  this  to  dene  a  bijection  
j
  : U
j
    R
n
:
j
([x
1
, . . ., x
n+1
]) =
_
x
1
x
j
, . . .,
 x
n+1
x
j
_
where  the  resulting  1  in  the  j-th  place  is  omitted.
The  inverse  of  this  map  has  a  geometrical  interpretation  which  deserves  mention,  and  is
illustrated  in  the  following  diagram,  in  which  we  take  j  = n + 1.
87
1 (
y
)
=
R
n
x {1}
O
S
n
y
[
y
,
1
]
n
+
1
For each y  R
n
, there is a unique line through 0 in  R
n+1
containing the point (y, 1).   This
line is of course the linear span of (y, 1), and is thus the equivalence  class of (y, 1) under 
1
.
It  already  has  a  name:   [y, 1].   It  is  the  image  of   y  under  
1
n+1
.   Notice  that  every  line  not
contained  in  the  horizontal   plane   R
n
 0  meets   R
n+1
 1,   and  is  thus  in  the  image  of
1
n+1
.
The crossover maps in this atlas are easier to compute than in the previous atlas:   if i < j
then  
j
(U
i
 U
j
) = (y
1
, . . ., y
n
)  R
n
: y
i
 ,= 0,  and
i
 
1
j
  (y
1
, . . ., y
n
) = 
i
_
[y
1
, . . ., y
j1
, 1, y
j
, . . ., y
n
]
_
=
_
y
1
y
i
, . . .,
 y
j1
y
i
,
  1
y
i
,
 y
j
y
i
, . . .,
 y
n
y
i
_
  (5.2)
where the 1 in the i-th place is omitted.   This is obviously smooth, since y
i
 ,= 0 in its domain.
To  distinguish  this  atlas  from  the  one  introduced  in  Example  5.2,  we  will  call  it  /
.   2
Lemma  5.4.   The  two  atlases  /  and  /
on  RP
n
dened  in  Examples  5.2  and  5.3  determine
the  same  criterion  of  smoothness.
Proof   Denote  charts  in  /  by  
P
  and  charts  in  /
by  
j
.   It  is  enough  to  show  that  each
composite  
P
   
1
j
  and  
j
 
1
P
  ,  where  dened,  is  smooth.   For then  e.g.   if  f  :  RP
n
  R
k
is
smooth  with  respect  to  /,  we  have
f   
1
j
  = (f   
1
P
  )  (
P
   
1
j
  )
and  so  f  is  smooth  with  respect  to  /
.
Showing  smoothness   of   
P
    
1
j
  is   straightforward,   though  it   may  be  guided  by  the
following  picture.
88
B
n+1
P
1
(y)) [
y
,
1
]
P
  (
R
n
x {1}
H
(y,1)
(y,1)
P
   
1
j
  (y) =   q
1
)(q((y, 1)) = ((y, 1))
as   shown.   In  fact     =  1/|(y, 1)|,   and  so  depends   smoothly  on  y,   and  thus   the   map
R
n
  H  is  smooth.   As  the  composite  of  this  map  with  the  linear  projection  ,  
P
   
1
n+1
  is
also  smooth.   2
Denition  5.5.   Two  smooth  atlases  /  and  /
are  equivalent.
(2)  Every  chart  in  /  is  smooth  with  respect  to  /
,  and  vice-versa.
(3)  / /
is  preferable  to  /.   There  are  two  reasons:   a  reason  of   convenience,   that  the  crossover
maps are much simpler in /
allows  us  to  endow  RP
n
with  a  geometric  structure  as  well  as  a  criterion  for  smoothness.
Suppose  that  L  R
n
is  any  ane  subspace  (i.e.   a  translation  of  a  linear  subspace).
Exercise  5.8.   In  this  case  
j
 
1
i
  (L)  is  also  an  ane  subspace,  of  the  same  dimension  as
L.   2
Hint:   rst  read  on  for  a  few  lines.
In view of 5.8, we can speak of lines and planes in  RP
n
, and indeed of higher dimensional
linear  subspaces.   The  word  linear  is  in  quotes  because  RP
n
is  not  a  vector  space.   In  fact
we  make  a  slightly  wider  denition:
Denition  5.9.   A  k-dimensional linear  subspace  of   RP
n
is  the  space  obtained  as  the  image,
under the quotient map  R
n+1
0    RP
n
, of a k +1-dimensional vector subspace V  of   R
n+1
.
Since  any  V
k+1
is  linearly  isomorphic  to  R
k+1
,   and  the  image  [V ]  of  V   is  just  the  set  of
lines  through  0  in  V ,   it  is  itself  a  projective  space  of  dimension  k.   The  great  virtue  of  the
charts  
j
   /
C
C
C
C
C
C
C
C
X
g
{
{
{
{
{
{
{
{
RP
2
commute,  where  p : S
2
  RP
2
is  the  quotient  map  p(x
1
, x
2
, x
3
) = [x
1
, x
2
, x
3
].
90
The  procedure  is  simple.   We  want  g  to  map  (x
2
1
, x
2
2
, x
2
3
, x
1
x
2
, x
1
x
3
, x
2
x
3
)  to  the  point  in  RP
2
represented  in  homogeneous   coordinates  by  [x
1
, x
2
, x
3
].   Denote  the  coordinates  on  R
6
by
y
1
, . . ., y
6
.   If  x
1
 ,= 0  then  [x
1
, x
2
, x
3
] = [x
2
1
, x
1
x
2
, x
1
x
3
].   so  we  can  dene  g  by  the  formula
g(y
1
, . . ., y
6
) = [y
1
, y
4
, y
5
].
This  is  valid  provided  y
1
,= 0,  because  f
1
(y
1
 ,= 0) = x
1
 ,= 0.
If  y
1
  = 0  we  cannot  use  this  formula  to  dene  g;  instead,  we  use
g(y
1
, . . ., y
6
) =
_
_
_
[y
4
, y
2
, y
6
]   if y
2
 ,= 0
[y
5
, y
6
, y
3
]   if y
3
 ,= 0
Every  point  in  the  image  of  f  lies  in  at  least  one  of  y
1
  ,= 0,   y
2
 ,= 0  or  y
3
  ,= 0,  so  g  is
well  dened.   Composition  of g  with  any  of  the three  standard  charts  on  RP
2
gives  a  smooth
map,  so  g  is  smooth.
Exercise  5.11.   (i)  Show  that  g  is  a  dieomorphism  by  nding  a  smooth  inverse.
(ii)
D
D
D
D
D
D
D
D
M
RP
2
z
z
z
z
z
z
z
z
commute.
Example  5.12.   (i)   RP
3
is  homeomorphic  to  the  Lie  group  SO(3).   There  is  a  simple  argu-
ment  to  prove  this.
Step  1:   RP
3
is  the  image  of  a  closed  hemisphere  H    S
3
under  the  quotient  map  q;  under
q,   the  only  identications  in  H  which  take  place  are  those  of  antipodal  points  on  H.   Or-
thogonal   projection  to  the  equatorial   hyperplane  determines  a  homeomorphism  (but  not  a
dieomorphism)    : H    B, where  B  is a  closed  ball in  R
3
.   Thus  q  
1
maps  B  onto  RP
3
,
in  the  process  identifying  antipodal  points  on  B.   This  map  passes  to  the  quotient  to  give
a map  in fact a homeomorphism  from the quotient space  B/  to  RP
3
, where  is the
equivalence  relation  which  identies  antipodal  points  on  B.
Step  2:   SO(3) is  also  homeomorphic  to  B/ .   For SO(3)  is  the  set  of  rotations  about axes
in  R
3
.   A  point  x  B  determines  such  a  rotation;  unless  it  is  the  centre  of  the  ball,  0,  x  lies
on a unique line 
x
  through 0, which we can take as our axis of rotation.   We use the norm of
x to determine the angle of rotation, and we use the position of x on the line 
x
  to determine
a  sense  of  rotation.   In  detail:   provided  x ,= 0,  it  determines  an  orientation  of  the  plane  x
,
and  a  rotation  through  |x|  in  the  positive  sense  in  this  plane.   As  R
3
= 
x
 x
,  we  get  a
91
well  dened rotation about 
x
.   If  x = 0, it does not matter that we have  not dened  an axis
of rotation, since we intend only to rotate through 0 radians, and a rotation through 0 about
one  axis  is  the  same  as  a  rotation  through  0  about  any  other.   Finally,   a  rotation  through
  in  one  sense  is  the  same  thing  as  a  rotation  about    in  the  opposite  sense.   So  antipodal
points  on  B  determine  the  same  rotation.   Thus,  we  have  a  surjection  B    SO(3) which  is
1-1  except  that  it  identies  antipodal  points  on  B.
Together,  Steps  1  and  2  show  that  there  is  a  homeomorphism  RP
3
  SO(3).
(ii)  In  fact  RP
3
and  SO(3)  are  dieomorphic.   Rather  than  trying  to  improve  the  map  in  (i),
we  use  a  dierent  approach.   First  we  dene  a  smooth  map    :   S
3
  SO(3),   and  then  we
show  that  it  passes  to  the  quotient  to  dene  a  smooth  map  RP
3
  SO(3).   To  dene  ,   we
think  of  S
3
as  the  multiplicative  group  of  unit  quaternions:
(a, b, c, d) a + bi + cj + dk
and look for an orthogonal representation of the unit quaternions on R
3
.   There is an ingenious
way  of  doing  this.   We  identify  points  in  R
3
with  quaternions  with  zero  real  part:
(x, y, z) xi + yj + zk.
and  then  let  a + bi + cj + dk  act  on  the  space  of  such  quaternions  by  conjugation:
xi + yj + zk (a + bi + cj + dk)
1
(xi + yj + zk)(a + bi + cj + dk)   (5.3)
(note  that  the  right  hand  side  of  (5.3)  has  zero  real  part).   As  a  linear  map  in  (x, y, z),  this
map  has  matrix
_
_
a
2
+ b
2
c
2
d
2
2(ad + bc)   2(bd ac)
2(bc ad)   a
2
b
2
+ c
2
d
2
2(cd + ab)
2(ac + bd)   2(cd ab)   a
2
b
2
c
2
+ d
2
_
_
.   (5.4)
It  is  not  completely  obvious  that  this  matrix  is  in  SO(3),   although  it  is  straightforward  to
check  that it is.   Note that since    is  smooth and a group homomorphism,  it is obvious  that
the  image  is  a  3-dimensional  compact  subgroup  of  Gl
3
(R).
Exercise  5.13.   (i)  Show  that  (5.4)  is  indeed  the  matrix  of  the  linear  map  (5.3).
(ii)  Check  that  (5.4)  SO(3).
(iii)  Show  that  the  map  S
3
  SO(3)  dened  by  sending  (a, b, c, d)  S
3
to  the  matrix  (5.4)
passes  to  the  quotient  to  dene  a  smooth  map  RP
3
  SO(3).
(iv)  Show  that  the  map  RP
3
  SO(3)  you  found  in  (iii)  is  a  a  dieomorphism.
Exercise  5.14.   Write  down  an  explicit  formula  for  the  map  B
3
  RP
3
of  Example  5.12(i)
Step  1,   with  respect  to  the  standard  coordinates  on  B
3
and  homogeneous  coordinates  on
RP
3
.   Is  it  smooth?
92
Exercise 5.15.  The map f  : B
3
  SO(3) described in Example 5.12(i) above can be written
explicitly in coordinates, though it is not absolutely straightforward to do so.   As a rst step,
it is relatively easy to nd the restriction of f  to the boundary S
2
of B
3
.   Recall that f(x, y, z)
consists  of  a  rotation through  an  angle  of |(x, y, z)|  about  the  line  spanned  by  (x, y, z).   If
|(x, y, z)| = 1 then this rotation is just multiplication by 1 on the orthogonal complement
(x, y, z)
.   If  v
1
, v
2
  is  any  basis  of  (x, y, z)
.
Using  this  basis,  compute  the  matrix  f(x, y, z).
(ii)  Check  that,  after  simplication,  the  expression  you  have  obtained  is  valid  even  if  y  = 0.
Can  you  explain  why?
(iii)  Show  that  the  map  S
2
  SO(3)  you  have  obtained  passes  to  the  quotient  to  dene  a
smooth  map  RP
2
  SO(3).
(iv)  Find  explicit  formula(e)  for  its  inverse.
Example  5.16.  We can play the same game with the group Gl
n
(R) as with  R
n+1
0.   That
is,  we  identify  two  matrices  A
1
  and  A
2
  if  one  is  a  scalar  multiple  of  the  other.   The  quotient
of  Gl
n
(R)  by  this  equivalence  relation  is  called  PGl
n
(R).   We  denote  the  equivalence  class  of
a  matrix  A  by  [A].   If  we  think  of  Mat
nn
(R)  as  R
n
2
,   then  clearly  Gl
n
(R)    R
n
2
 0.   The
equivalence  relation  we  have  just  dened  on  Gl
n
(R)  is  the  same  as  the  quivalence  relation
used  to  obtain  RP
n
2
1
from  R
n
2
 0.   Thus   PGl
n
(R)  is  an  (open)  subset  of   RP
n
2
1
,   and  in
particular  a  manifold  of  dimension  n
2
 1.   It  is  also  a  group:   A    B  if  there  exists    ,=  0
such  that I
n
A = B, and thus the equivalence  classes  are the cosets  of the normal subgroup
S  of  Gl
n
(R)  consisting  of  scalar  matrices
S  := I
n
  : 0 ,=   R.
So  PGl
n
(R)  is  the  quotient  of  Gl
n
(R)  by  S.   What  this  amounts  to  is  that  if  A
=  
1
A  and
B
= 
2
B then A
= 
1
2
AB, so that the group operation in Gl
n
(R) (multiplication) passes
to  the  quotient  to  dene  a  group  operation  in  PGl
n
(R).   2
Proposition  5.17.   PGl
n
(R)  is  a  Lie  group.
93
Proof   Denote  the  product  operation  on  PGl
n
(R)  by  m,  and  the  operation  of  inversion  by
i.   The  chart  domains  in  PGl
n
(R)  are  the  sets
U
ij
  := [A]  PGl
n
(R) : a
ij
  ,= 0,
and  the  chart  
ij
  is  dened  by
ij
([A]) =
_
_
_
_
a
11
/a
ij
  a
12
/a
ij
       
1n
/a
ij
                 
                 
a
n1
/a
ij
  a
n2
/a
ij
       
nn
/a
ij
_
_
_
_
.
The  matrix  on  the  right  has  a  1  in  the  i, j   position:   we  regard  it  as  an  element  of   R
n
2
1
.
To  ease  problems  of  notation,   we  denote  this  copy  of   R
n
2
1
by  V
ij
.   Elements  of  V
ij
  will   be
written  in  the  form  (x) = (x
pq
 m (
1
ij
  , 
1
k
 )
takes  the  form
((x), (y)) 
_
 x
p
y
q
_
.
This  is  evidently  smooth  wherever  the  denominator  is  non-zero,  i.e.   on
(
ij
k
)
_
m
1
(U
pq
)
_
,
as  required.
I  leave  to  the  reader  the  task  of  verifying  that inversion  i also  determines  a  smooth  map
PGl
n
(R)    PGl
n
(R).   2
In  Example  2.18(2)  above,   we  used  Sards  theorem  to  show  that  not  all   quadruples  of
lines  through (0, 0) in  R
2
are linearly  isomorphic  to  one another,  by  looking  at the  way  that
the  group  Gl
2
(R)  acts  on  quadruples  of  lines.   We  can  now  think  of  this  slightly  dierently.
The  space Q of quadruples  of distinct  lines  though (0, 0) is an open set in (RP
1
)
4
.   Thus  it is
a manifold of dimension  4.   The  action of Gl
2
(R) on Q is smooth, and passes  to the quotient
to  dene  a  smooth  action  of   PGl
2
(R)  on  Q.   The  equivalence  class  of  a  given  quadruple  (say
the quadruple X
1
  of Example 2.18) is the image of the group  PGl
2
(R) under the smooth map
[A] A  X
1
.   Since  dimPGl
2
(R) = 3  and  dimQ = 4,  this  image  has  measure  zero  in  Q,  by
Sards  Theorem.
94
5.2   An  important  example  -  the  Grassmanian
Now we give another example of abstract manifold, generalising  RP
n
in an obvious way.   The
Grassmanian  G
k,n
  is  the  space  of   all   k-dimensional   linear   subspaces   of   R
n
.   In  particular
G
1,n+1
  =  RP
n
.   To  save  breath,   we  call   the  elements  of  G
k,n
  k-planes  in  R
n
.   At  the  outset,
G
k,n
  doesnt  even  have  a  topology,   though  most  likely  we  would  agree  that  we  have  an  in-
tuitive  notion  of  the  relative  closeness  of  dierent  linear  subspaces  to  one  another.   We  give
it  a  topology  and  a  k(n  k)-dimensional   smooth  structure  at  the  same  time,   by  dening
bijections   between  certain  subsets   of   G
k,n
  and  R
k(nk)
,   and  then  declaring  that   their   do-
mains  are  open  and  that  the  bijections  are  homeomorphisms,   and  calling  them  charts.   For
this  to  dene  a  topology,   the  crossover   maps   between  the  bijections   must   be  homeomor-
phisms.   This  will  follow  from  the  rst  thing  we  prove,   which  will  be  that  they  are  actually
dieomorphisms.   This  is  the  serious  part  of  the  task.   Everything  else  is  just  what  we  de-
ne  it  to  be;   but  the  fact  that  it  is  possible  to  choose  rather  natural   charts  on  G
k,n
  such
that all the crossover maps are smooth, is a fact of nature, and not just a matter of denition.
The  smooth  structure  we  will   dene  has  the  property  that  if   v
1
(t), . . ., v
k
(t)  are  vectors  in
R
n
,  depending  smoothly  on a  real  variable t,  and are  independent  for all t,  then  the  k-plane
they  span,  P(t),  will   also  vary  smoothly  in  G
k,n
.   Indeed,   it  is  the  unique  smooth  structure
with  this  property.
Construction  of  the  bijections   Choose  a  basis  e
1
, . . ., e
n
  for  R
n
.   We  will   dene  a  chart
on  G
k,n
  for  each  subset  I  of  1, 2, . . ., n  containing  k  elements.   Let  I  be  such  a  subset,   let
J  be  its  complement  in  1, . . ., n,  and  let  W
J
  = Spe
j
  : j   J.  We  take,  as  the  domain  of
our  chart,  the  set
U
I
  = V   G
k,n
  : V   W
J
  = 0.
For  example,   if  n  =  3,   k  =  2,   and  I  =  1, 2  then  U
I
  is  the  set  of  2-planes  which  meet  the
x
3
-axis  only  at  0.
Lemma   5.18.   Let   I   =  i
1
, . . ., i
k
,   let   V     G
n,k
  and  let   v
1
  =  (v
11
, . . ., v
1,n
), . . .,  v
k
  =
(v
k1
, . . ., v
kn
)  be  a  basis  of  V .   Let  M  be  the  n k  matrix
_
_
_
_
v
11
       v
k1
            
            
v
1n
       v
kn
_
_
_
_
Then
(i) V   U
I
  if and only if the kk  submatrix M
I
  of M, formed by rows i
1
, . . ., i
k
, is invertible.
(ii)  Each  V   U
I
  has  a  unique  basis  v
I
1
, . . ., v
I
k
  for  which  M
I
  is  the  k k  identity  matrix.   2
For each V   U
I
, denote by  M
I
(V ) the matrix of 5.18 for the basis v
I
1
, . . ., v
I
k
  of 5.18(ii).   We
dene  
I
  : U
I
    Mat
(nk)k
   R
k(nk)
by  
I
(V ) = M
I
J
(V ),  where  M
I
J
(V )  is  the  (n k) k
95
matrix  formed  by  rows  j
1
, . . ., j
nk
  of  M
I
(V ).   We  recover  the  matrix  M
I
(V )  from  M
I
J
(V )
by interspersing,  among  the rows of M
I
J
(V ), the k  rows  of the k k  identity  matrix, so  that
in the resulting  n k  matrix they occupy  positions i
1
, . . ., i
k
.   Applying  this procedure  to an
arbitrary  (n  k)  k  matrix  A  gives  us  a  n  k  matrix  which  we  denote  by  
I
(A),   whose
columns  span  a  k-plane  V ;  this  k-plane  is  
1
I
  (A).
Now  we  consider  the  crossover  maps
I
1
 
1
I
2
  : 
I
2
(U
I
1
 U
I
2
)    
I
1
(U
I
1
 U
I
2
),
where  I
1
  and  I
2
  are  two  dierent  k-element  subsets  of  1, . . ., n.
Lemma  5.19.   (i)  Let  V   U
I
1
.   Then  V   U
I
2
  also  if  and  only  if  M
I
1
I
2
(V )  is  invertible.
(ii)  If  V   U
I
1
 U
I
2
  then  M
I
2
(V ) = M
I
1
(V )(M
I
1
I
2
(V ))
1
.
(iii)  
I
1
 
1
I
2
  is  a  dieomorphism.
Proof   (i)   is   just   5.18(i).   For   (ii),   simply  observe  that   in  the  product   M
I
1
(V )(M
I
1
I
2
)
1
,
the  k  k  submatrix  formed  by  the  rows  corresponding  to  I
2
  is  the  k  k  identity  matrix
(as  it  is  just  (M
I
1
I
2
(V ))
1
M
I
1
I
2
(V )).   Thus  M
I
1
(V )(M
I
1
I
2
(V ))
1
=  M
I
2
(V ),  since  this  property
characterises  M
I
2
(V ).
(iii) Given an (nk)k matrix A, we obtain 
I
1
(
1
I
2
  (A)) by rst forming the matrix 
I
2
(A),
then right-multiplying it by the inverse of its submatrix consisting of its i
1
st, . . . , i
k
th rows,
and  then  throwing  away  those  rows  (which  are  now  the  rows  of   the  identity  matrix)  from
the  product.   In  symbols,
I
1
 
1
I
2
  (A) =
_
I
2
(A)
1
I
1
_
I
2
(A)
_
_
J
where  J  is  the  complement  of  I.   This  map  is  clearly  smooth.   Since  it  has  inverse  
I
2
 
1
I
1
  ,
which  is  also  smooth,  by  the  same  argument,  it  is  a  dieomorphism.   2
We give  G
k,n
  a topology by  declaring  that each  of the  U
I
  is  open, and  that each  of  the  
I
  is
a  homeomorphism.   In  other  words,  the  open  sets  in  G
k,n
  are,  rst,  the  preimages  under  the
maps  
I
  of  open  sets  in  Mat
(nk)k
,  and  second,   arbitrary  unions  of  these  preimages.   This
procedure does indeed dene a topology on G
k,n
.   Notice that we need to know that 
I
1
1
I
2
is  a  homeomorphism  for  this  to  work:   for  it  establishes  that  a  subset  of  G
k,n
  which  is  open
with  respect  to  one  chart  is  also  open  with  respect  to  any  other.
So  far,  all  of  our  construction  has  been  with  respect  to  a  xed  basis  E  of   R
n
.   Is  the  choice
of   basis   important?   If   we   use   a  dierent   basis,   do  we   get   the   same   smooth  structure?
Let  E
I
   : U
I
     Mat
(nk)k
  the  chart  corresponding  to  I
.
96
Proposition 5.20.  Each such map 
I
   : U
I
     Mat
(nk)k
  is a dieomorphism, with respect
to  the  smooth  structure  dened  using  the  charts  
I
  constructed  using  the  basis  E.
Proof   This  is  a  consequence  of  the  next  lemma.   2
Lemma  5.21.   The  map
1
I
   : 
I
 (U
I
  U
I
)    
I
(U
I
  U
I
)
is  smooth.
Proof   Exercise.   Use  the  fact  that  if  v
1
, . . ., v
k
  is  a  basis  for  the  k-plane  V ,  and  M
is  the
matrix of 5.18, whose columns are the expression of the vectors v
1
, . . ., v
k
  with respect to the
basis E
, then M
= [I]
E
E
 M, where  [I]
E
E
   is the change of basis  matrix (see  Appendix  1, 7.1).
2
Now it is clear, by 5.20, that we can include all maps 
I
   as charts in our atlas for G
k,n
.   The
choice  of  basis  E  is  completely  immaterial.
It  remains  to  show  that  G
k,n
  is  Hausdor  and  second  countable.   Second  countability  is  an
immediate  consequence  of the fact that a nite number of chart domains U
I
  cover  G
k,n
, and
that each  of  them  is  itself  second  countable.   To  show  that G
k,n
  is  Hausdor,  observe  that if
V
1
  and  V
2
  are  any  two  k-planes,  there  is  an  (n k)-plane  W  complementary  to  them  both.
Choose  a  basis  E
for  R
n
such  that  W  =  Spe
k+1
, . . ., e
n
,  and  let  I
= 1, . . ., k.   Then  V
1
and  V
2
  both belong  to U
I
 ,  which  is  homeomorphic,  via  
I
,  to  Mat
(nk)k
.   As  Mat
(nk)k
  is
Hausdor,  so  is  U
I
,  and  we  may  choose  disjoint  neighbourhoods  O
1
,  O
2
  of  V
1
  and  V
2
,  open
in  U
I
 .   As  U
I
   is  open  in  G
k,n
,  so  are  O
1
  and  O
2
.
Exercise  5.22.   Show that if v
1
(t), . . . ,  v
k
(t) are vectors  in  R
n
, each depending  smoothly on
a  parameter  t,  and  are  linearly  independent  for  all  t,  then  the  map
R    G
k,n
sending  t  to  Spv
1
(t), . . ., v
k
(t)  is  smooth.
Exercise  5.23.   The  projective  space   RP
n
is  the  special   case  k  =  1  of   the  Grassmannian
G
k,n+1
.   We  have  described  in  detail   how  to  construct  charts  on  RP
n+1
and  on  G
k,n+1
.   Do
they  give  the  same  charts  when  k = 1?   2
5.3   Complex  manifolds
Example  5.24.   The  complex  projective  space  CP
n
is  dened  in  close  analogy  to  RP
n
:   it  is
the  space  of   complex  lines  through  0  in  C
n+1
,   and  thus  the  quotient  of   C
n+1
 0  by  the
equivalence  relation
(z
1
, . . ., z
n+1
)  (z
1
, . . ., z
n+1
)
97
for      C  0.   An  atlas  
j
  :   U
j
    C
n
  is  dened  by  exactly  the  same  procedure  as  for
RP
n
,   and  the  crossover  maps  are  given  by  the  same  formula  (5.2),   with  respect  to  complex
coordinates on  C
n
=  R
2n
.   The crossover maps are not just smooth, but actually holomorphic
  that is,  dierentiable,  in  the  complex  sense,  with  respect  to  each  complex  variable  y
j
.   In
particular,  the  derivative  d
y
(
j
 
1
i
  ) :   R
2n
=  C
n
  C
n
=  R
2n
is  complex-linear.   2
Denition  5.25.   Identify  C
n
with  R
2n
in  some  xed  way,  e.g.
(x
1
 + iy
1
, . . ., x
n
 + iy
n
)    (x
1
, y
1
, x
2
, y
2
, . . ., x
n
, y
n
).
An  even  dimensional   smooth  manifold  M
2n
equipped  with  an  atlas  A  is  an  n-dimensional
complex  manifold  if  all   the  crossover  maps  of  charts  in  A  are  holomorphic,   with  respect  to
this  identication.
The  Grassmanian  G
k,n
(C)  of  k-dimensional   complex  subspaces  of   C
n
is  a  complex  man-
ifold  of  (complex)  dimension  k(n  k).   For  the  linear  algebra  involved  in  the  construction
of  the  bijections  
I
  works  over  any  eld,   not  just   R,   and  inspection  of  the  crossover  maps
I
1
 
1
I
2
  shows  that  they  are  in  fact  rational  maps  (each  component  function  is  a  quotient
of  polynomials  in  the  k(n k)  complex  variables)  and  therefore  holomorphic.
Projective   spaces   play  a  central   role   in  algebraic   geometry.   They  are   natural   homes   to
many  manifolds  that  do  not  naturally  embed  in  Euclidean  space.   For  example,   there  is  a
well-known  embedding,  due  to  Pl ucker,  of  the  Grassmannian  G
k,n
  into  a  suitable  projective
space   (in  fact,   RP
n
k
1
)   -see   5.27  below.   For   complex  manifolds   the   advantage   is   even
more  dramatic:   a  compact  complex  manifold  of   dimension  >  0  cannot  be  embedded  as  a
complex submanifold of  C
N
for any N.   The reason is simply that if MC
N
is a holomorphic
embedding then each complex coordinate function on  C
N
restricts to a holomorphic function
f
j
  on M.   As M  is compact, [f
j
[ must  attain its maximum  value somewhere  on M.   Let   be
a  chart  around  this  point.   Because  [f
j
 
1
[  has  a  local  maximum,  the  maximum  modulus
principal of  the theory of complex  variables implies  that f
j
 
1
is constant.   So  by analytic
continuation  f
j
  =  z
j
  is  locally  constant  on  all   of  M,   and  thus  M  is  a  collection  of  discrete
points.
On  the  other  hand,   very  many  compact  complex  manifolds  do  embed  as  complex  sub-
manifolds  of   CP
n
  for  example,  the  complex  Grassmanian,  via  the  Pl ucker  embedding.
Exercise 5.26.  (1) Recall that SO(n) is the Lie group of nn orthogonal real matrices with
positive  determinant.   Let  V
0
  = Spe
1
, . . ., e
k
  R
n
,  and  dene  a  map  f  : SO(n)    G
k,n
  by
f(A) = A V
0
;  that  is,  f(A)  is  the  k-plane  to  which  V
0
  is  mapped  by  A.   Show
(i)  f  is  surjective;
(ii)  f  is  smooth;
(iii)  G
k,n
  is  compact.
(2)  Do  the  same  with  the  complex  Grassmanian  G
k,n
(C),  using  U(n)  in  place  of  SO(n).
98
Exercise  5.27.   In  this  exercise  we  dene  and  study  the  Pl ucker   embedding  of   G
k,n
  in  a
certain  projective  space.
(1) There are
_
n
k
_
k-element subsets of 1, . . ., n.   Dene a map P  : (R
n
)
k
  R
n
k
as follows:
given (v
1
, . . ., v
k
)  (R
n
)
k
, form the nk matrix M  as in 5.18.   Map (v
1
, . . ., v
k
) to the k-tuple
consisting  of  the  determinants  of  all  the  distinct  k k  submatrices  of  M.   Show
1.   v
1
, . . ., v
k
  are  linearly  independent  if  and  only  if  P(v
1
, . . ., v
k
) ,= 0.
2.   P(v
1
, . . ., v
i
, . . ., v
j
 + v
i
, . . ., v
k
) = P(v
1
, . . ., v
k
).
3.   If  v
1
, . . ., v
k
  span  the  same  k-plane  as  w
1
, . . ., w
k
  then  P(v
1
, . . ., v
k
)  is  a  scalar  multiple
of  P(w
1
, . . ., w
k
).
4.   P  passes   to  the  quotient   to  give  rise  to  a  smooth  map  p  :   G
k,n
    RP
n
k
1
.   More
precisely,   if   (R
n
)
k
0
  is   the   open  subset   of   (R
n
)
k
consisting  of   linearly  independent   k-
tuples  of  vectors,  there  is  a  smooth  map  p  making  the  diagram
(R
n
)
k
  P
   R
n
k
0
   
G
k,n
p
   RP
n
k
1
commute, where the left vertical arrow takes (v
1
, . . ., v
k
) to Spv
1
, . . ., v
k
 and the right
vertical  arrow  is  the  usual  quotient  map.
5.   p  is  injective.
6.   p  is  dierentiable.
7.   p  is  an  embedding  (a  dieomorphism  onto  its  image).
(2) Check  that if the constructions are made with  C in place of  R then we obtain a holomor-
phic  embedding  of  G
k,n
(C)  into  CP
n
k
1
.
For  more  information  on  the  Pl ucker  embedding,  see  e.g.   [4].
Terminology  A  smooth  manifold  is  really  a  pair,   consisting  of  a  topological  manifold  and
a  smooth  structure.   It  becomes  slightly  tedious  to  refer  always  to  smooth  manifolds  in  this
way,  however,  and  so  we  will  usually  leave  the  smooth  structure  unnamed,  and  refer  simply
to  a  smooth  manifold  M.   This  is,   after  all,   what  we  do  when  we  refer  to  a  group  G:   we
mean  a  pair  consisting  of   a  set  together  with  a  multiplication.   When  we  speak  of  smooth
manifolds  in  this  way,   we  will   speak  of   smooth  charts,   meaning  charts  which  are  smooth
dieomorphisms  with  respect  to  the  given  (but  unnamed)  smooth  structure.   However,   one
must  be  careful:   the  same  manifold  (as  topological  space)  may  have  many  dierent  smooth
99
structures.   This  phenomenon  was  rst  described  by  Milnor  in  the  1960s:   he  showed  that
there  are  28  dierent  smooth  structures  on  the  sphere  S
7
.
5.4   Quotient  spaces  as  manifolds
Weve already observed the versatility of the notion of the quotient of a topological space by
an  equivalence  relation.   The  abstract  denition  of  manifold  explored  in  this  chapter  allows
us to recover some of this versatility  in the case where the equivalence relation is induced by
a  smooth  group  action.
Let  M  be  a  manifold  and  let  G  be  a  group  of  dieomorphisms  of  M.   Dene  a  relation
  on  M  by
x
1
   x
2
  if  there  exists    G  such  that  (x
1
) = x
2
.
Its   easy  to  check  that   this   is   an  equivalence   relation.   We  denote  the  set   of   equivalence
classes  by  M/G,  and  the  quotient  map  M    M/G  by  q.   Its  useful  to  write  q(x) = [x].
Example  5.28.   1.   M  = S
2
, G = id
S
2, antipodal  map. Thus x
1
   x
2
  if x
1
  = x
2
.   This
is  the  case  studied  in  Example  1.41(ii).   The  quotient  is  RP
2
.
2.   M  =  R
n+1
 0,   G  =  group  of  scalar  maps  x    x  :      R
.   So  x
1
    x
2
  if  there
exists  a  non-zero    R  such  that  x
1
  = x
2
.   The  quotient  is  RP
n
.
3.   M  =  R
2
,   G  =  group  of   translations  T
m,n
(x, y)  =  (x + m, y  + n)  with  m, n    Z.   So
(x
1
, y
1
) = (x
2
, y
2
)  if  (x
1
x
2
, y
1
y
2
)  Z Z.
4.   M  =  R
2
,  G  is  the  group  generated  by  the  two  maps
1
(x, y) = (x, y + 1),   
2
(x, y) = (x + 1, 1 y).
In each  of the rst three examples,  the abstract structure  of the groupG is evident:   it is,
respectively,   Z/2  (in  its  incarnation  as  1),   R
and  R
2
/Z  Z
respectively.
Proposition  5.29.   If  X  is  a  topological  space  and  G  is  a  group  of  homeomorphisms  of  X,
then  the  quotient  map  q : X    X/G  is  an  open  map.
Proof   By  denition  of  the  quotient  topology,  a  subset  W  of  X/G  is  open  if  its  preimage
in  X,   q
1
(W),   is  open.   Let  U    X  be  open.   We  want  to  show  that  q(U)  is  open  in  X/G.
Now  q
1
(q(U))  =  
G
(U).   As  each    is  a  homeomorphism,   each  set  (U)  is  open.   So
q
1
(q(U))  is  a  union  of  open  sets,  and  therefore  open.   2
When  is  M/G,   with  the  quotient  topology,   a  smooth  manifold?   More  precisely,   when
can  we  construct,  in  a  natural  way,  a  smooth  structure  on  M/G?  The  smooth  structure  on
100
M/G should evidently  bear some relation to the smooth structure on M, so for example  the
map  q  should  be  smooth  with  respect  to  the  smooth  structures  on  M  and  on  M/G.   It  is
also  desirable  that  the  following  usinversal  property  should  hold:
every time there is a smooth map f  : M    N  such that f(x
1
) = f(x
2
) if x
1
   x
2
, then there
exists  a  unique  smooth  map
  
f  : M/G/    N  (the  quotient  map)  making  the  diagram
M
f
E
E
E
E
E
E
E
E
E
q
M/G
 N
commutative.
It  is  one  of   the  features  of   the  denition  of   quotient   topological   space  that  the  corre-
sponding  property  holds,  with  continuous  in  place  of  smooth.
The  notion of  manifold is more  delicate  than that of topological space.   It turns out that
not  every  quotient  M/G  is  a  manifold.
Example  5.30.   M  =  R,  G = 1  acting  by  multiplication.   So  x
1
   x
2
  if  x
1
  = x
2
.   The
quotient  space  is  homeomorphic  to  the  half-space   R
0
  (exercise),   and  the  point  [0]   has  no
neighbourhood  in  R/G  homeomorphic  to  an  open  set  in  R.
Denition 5.31.  The group G of homeomorphisms of the topological space X  acts properly
discontinuously  if   for   every   x     X  there   is   a  neighbourhood  W  of   x  in  X  such  that
(W)  W  =   if    G  and   ,= id
X
.
In  Examples  5.28(i),(iii),(iv),  G  acts  properly  discontinuously.   In  Examples  5.28(ii)  and
5.30  it  does  not.
Proposition  5.32.   If   the  group  G  of   homeomorphisms  of   X  acts  properly  discontinuously
then  q : X    X/G  is  a  local  homeomorphism.
Proof   The  statement   means  that   for  all   x    X,   there  exists   a  neighbourhood  W  of   x
such  that  q
|
  : W    q(W)  is  a  homeomorphism.   Because  q  is  continuous  and  open,  its  only
necessary  to show  that there  is a  neighbourhood W  of  x on  which  q  is  1-1.   But this  holds  if
W  is  as  in  Denition  5.31.   2
We  impose  one  further  condition:
Denition  5.33.   The  group  G  of   homeomorphisms  of   the  topological   space  X  acts  with
property  (*)  if  for  every  x
1
, x
2
  X  with  x
1
  not  equivalent  to  x
2
,  there  are  neighbourhoods
U
1
, U
2
  of  x
1
, x
2
  such  that
for  all  
1
, 
2
 G,   
1
(U
1
)  
2
(U
2
) = .   (5.5)
By taking  
1
  = 
2
  = id
X
, we  see  that  X  must  be very  nearly  Hausdor for it to  be possible
for  a  group  of  dieomorphisms  to  act  with  property  (*):   inequivalent  points,  at  least,  must
have  disjoint  neighbourhoods.
101
Proposition  5.34.   If  G  acts  on  X  with  property  (*)  then  X/G  is  Hausdor.
Proof   Let   [x
1
]   ,=  [x
2
]     X/G.   Then  the  points   x
1
, x
2
  in  X  are  inequivalent.   Choose
neighbourhoods  U
1
  and  U
2
  of  x
1
  and  x
2
  satisfying  (5.5).   Then  q(U
1
)  and  q(U
2
)  are  disjoint
neighbourhoods  of  [x
1
]  and  [x
2
].   2
Proposition  5.35.   (i)  If  X  is  2nd  countable  then  X/G  is  2nd  countable.
(ii)  If  X  is  paracompact  then  X/G  is  paracompact.
Proof   (i)   2nd  countability  just   means   having  a  countable   dense   set.   This   property  is
obviously  inherited  by  a  quotient.
(ii)  In   progress.   2
Let  the  group  G  act  properly  discontinuously  on  the  manifold  M.   The  inverses  to  the
local   homeomorphisms  of  Proposition  5.32,   followed  by  charts  on  M,   make  up  an  atlas  on
M/G,  so  at  least  M/G  is  a  topological  manifold.   Of  course,  we  have  to  shrink  the  open  set
W  until   it  lies  in  the  domain  of   a  chart  on  M,   but  such  shrunken  Ws  still   cover  M,   and
their  images  in  M/G  cover  M/G.
 } 
(q    )
1
|W
(q    )
1
|W
R
m
M
q
M/G
o
Theorem  5.36.   If  M  is  a  smooth  manifold  and  G  is a  group  of  dieomorphisms of M,  and
acts  properly  discontinuously  on  M,  then  the  atlas  just  constructed  is  smooth.
Proof   Suppose  that  W
1
q
  q(W
1
)  and  W
2
q
  q(W
2
)  are  homeomorphisms,  and  that
q(W
1
) q(W
2
) ,= .  Write U
1
  = q(W
1
), U
2
  = q(W
2
).   Let  
1
  : W
1
    V
1
  and  
2
  : W
2
    V
2
  be
charts  on  M.   We  have  to  show  that
(
2
 q
1
|W
2
)  (
1
 q
1
|W
1
)
1
102
is smooth on its  domain of  denition,  
1
(q
1
|W
1
(U
1
U
2
)).   In the following  diagram, this  map
is  marked  .
(q     )
1
|W
(q     )
1
|W
V
V
2
1
2
1
1 2
We  begin  by  showing  that
q
1
|W
2
 (q
|
W
1
) : q
1
|W
1
(U
1
 U
2
)    q
1
W
2
(U
1
 U
2
)
is  smooth.   This  is  the  curved  arrow  marked    in  the  diagram.   Suppose  x
1
   q
1
|W
1
(U
1
 U
2
),
and  let  x
2
  be  its  image  under  q
1
W
2
 q
|W
1
.  As  [x
1
] = [x
2
]  in  M/G,  there  is  a    G  such  that
(x
1
)  =  x
2
.   I  claim  that  in  some  neighbourhood  of   x
1
,   q
1
|W
2
 q
|W
1
  coincides  with  .   From
this  it  follows  at  once  that  q
1
|W
2
 q
|W
1
  is  smooth  at  x
1
,  which  is  all  we  need  to  prove.
In  fact  the  claim  is  easy  to  prove.   The  required  neighbourhood  of  x
1
  is  just  
1
(W
2
)  W
1
.
For  if  x
  
1
(W
2
)  W
1
,   then  both  (x
)  and  q
1
|W
2
 q
W
1
(x
)  lie  in  W
2
,   and  both  have  the
same  image  under  q.   As  q  is  1-1  on  W
2
,  they  must  coincide.
Now  let 
1
    V
1
  R
m
and 
2
  : W
2
    V
2
  R
m
be charts  on M.   Then  the the fact that
the  map  ,  q
1
|W
2
 q
W
1
,  is  smooth  means  nothing  other  than  that  the  map    is  smooth:   for
  = 
2
   
1
1
  ,
and  by  the  denition  of   smoothness  of   a  map  between  manifolds  (Denition  5.1(iv)),     is
smooth  if  and  only  if    is.   2
103
5.5   Submanifolds
Now  we  make  a  denition  which  places  our  earlier  notion  of  manifold  in  a  proper  relation
to  the  new  notion.   We  begin  by  using  a  smooth  structure  on  a  smooth  manifold  to  dene
a  notion  of  smoothness  for  maps  dened  on  arbitrary  subsets  of  the  manifold.   This  might
seem  familiar.
Denition  5.37.   (i)  Let   N
n
be  a  smooth  manifold,   and  let   X  be  a  subset   of   N.   A  map
f  : X    R
p
is smooth  (newer) if for each x  X  there  is a  neighbourhood U  of x in  N  and a
map F  : U    R
p
, smooth with respect to the smoothness criterion on N, such that on U X,
f  and  F  coincide
12
.
(ii)  If   M  and  N  are  smooth  manifolds  and  X    M, Y     N,   then  a  map  f  :   X    Y   is  a
dieomorphism  if  it  is  smooth  (newer)  and  has  a  smooth  (newer)  inverse.
(iii)   Let   N  be   a  smooth  manifold.   A  subset   X    N  is   a  smooth  submanifold  of   N  of
dimension  m  if  for  all   x   M  there  is  a  neighbourhood  U  of  x  in  M,   an  open  set  V    R
m
,
and  a  dieomorphism  (newer)   : U    V .   is  called  a  chart  on  M  around  x.
Of  course,   this  is  just  a  repeat  of  denitions  1.3  and  1.6.   And  what  we  now  see  is  that
the  smooth  manifolds  we  dened  in  Section  1  are  in  fact  smooth  submanifolds  of   R
n
.
5.6   Embedding  manifolds  in  Euclidean  space
Although  we  have  gone  to  some  trouble  to  give  a  denition  of  manifold  without  an  ambient
space,   in  fact  every  smooth  manifold  is  dieomorphic  to  a  submanifold  of  some  Euclidean
space  R
N
.   We will give  a proof only for compact  manifolds.   The  proof is not really  dicult,
though it can seem fussy  and complicated.   The central idea  is that each chart  : U    V  
R
m
embeds at least part of M  into  R
m
.   If M  has a nite atlas 
j
  : U
j
    V
j
  : j  = 1, . . ., N,
we try to piece these embeddings  together to get an embedding into V
1
   V
N
.   The rst
problem is that each chart 
j
  is dened  only on a subset  of M.   To construct our embedding
of   M,   we  have  to  extend  the  domain  of   each  chart  to  all   of   M.   This  is  done  using  bump
functions.   A bump function on a manifold M  is a smooth, non-negative function which takes
the value 1  on a  prescribed  region and the value 0  on another.   A typical  application of such
a function is as follows:   suppose we are given a function f  dened on some open set U   M,
and  taking  values  in  R.   We  wish  to  extend  it  to  a  function  dened  on  all  of  M.   As  posed,
the problem may be insoluble, since f(x) may tend to innity as x approaches the boundary
of  U.   However,   if  we  are  allowed  to  alter  f  in  the  complement  of  some  closed  set  W    U,
then  we  can  use  a  bump  function,   : M    R,  satisfying
(x) =
_
  1   if  x  W
0   if  x  /  U
  .
12
We  will  retain  the  adjective  newer  for  at  least  one  paragraph.
104
The  function  f   coincides  with  f   in  W,   and  (f)(x)  now  tends  to  0  as  x  approaches  the
boundary  of  U.   Thus  we  can  extend  it  to  a  smooth  function
  
f  : M    R  dened  by
f(x) =
_
  (f)(x)   if  x  W
0   if  x  /  U
  .   (5.6)
We  will  use  bump  functions  in  very  much  this  way,  to  extend  charts  on  M  so  that  they  are
dened  on  all  of  M.   We  will  need  only  a  very  simple  kind  of  bump  function:
Lemma   5.38.   Let   B(a, R)   be  a  ball   in   R
m
,   with  and  let   0  <  .   There  exists   a  smooth
function   :  R
m
  [0, 1]  such  that
(x) =
_
  1   if  x  B(0, R)
0   if  x  /  B(0, R + )
In order not to interrupt the discussion, I will leave some details to Exercises V, and indicate
here  only  the  outline  of  a
Proof
The  function
f(x) =
_
  e
1/x
if  x ,= 0
0   if  x = 0
is  everywhere  smooth,   and  all   its  derivatives  vanish  at  0.   Thanks  to  this  last  property,   we
can  use  it  toglue  together  arbitrary  smooth  functions.   For  example,  we  can  glue  together
the function f, on the positive  half of the real line,  with the function 0  on the negative  half,
to  get  the  smooth  function
g(x) =
_
  e
1/x
2
if  x > 0
0   if  x  0
With a bit of ingenuity  we  can use this  idea  to produce,  for any interval [a, b] and any   > 0
a  smooth  bump  function    :   R    [0, 1]   taking  the  value  1  in  [a, b]   and  the  value  0  outside
(a , b + ).
c d b a
1
To  prove  the  lemma,   take  a  bump  function    :   R    [0, 1]   such  that  (t)  =  1  for  t    R
2
,
(t) = 0  for  t  (R + )
2
.   Dene   :   R
m
  [0, 1]  by  (x) = (|x a|
2
).   2
Theorem  5.39.   Suppose  that   M  is  a  smooth  compact   manifold.   Then  for  some  T   there
exists  an  embedding  f  of  M  into  R
T
.
105
Proof   We  approach  the  conclusion  by  iterated  approximation.
Step  1  Because  we  want  to  use  the  elementary  bump  functions  constructed  in  the  lemma,
we   rst   construct   an  atlas   whose   domains   are   all   dieomorphic   to  Euclidean  balls.   Let
: U
  there  exists  R
a
  > 0  such  that
B(a, 2R
a
)  V
.   (5.7)
The  open  balls  B(a, R
a
)  cover  V
,  so  their  preimages  
1
j
  (B(a, R
a
))  cover  U
.
The  collection  of   all   of   these   balls,   for   all   ,   covers   M,   so  as   M  is   compact   there  is
a  nite  subcover.   Denote  the  members  of   this  subcover  by  U
1
, . . ., U
N
,   and  for  each  i   let
U
i
  = 
1
  (B(a, 2R
a
))  for  the  appropriate  chart  
.   Denote  by  
j
  the  restriction  to  U
j
 ,  and
to  U
j
,  of  
.   Then  
j
  : U
j
    B(a
j
, R
j
)  is  an  atlas,  equivalent  to  the  original  atlas.
i
B(a,2R )
a
B(a,R )
a
M
U
U
i
V
U
Step 2 Now we extend its charts to all of M.   Choose 0 <   < R
a
 and dene a bump function
j
  :   R
m
  [0, 1]   such  that  
j
(x)  =  1  for  x    B(a, R
a
)  and  
j
(x)  =  0  for  x  /   B(a, R
a
 + ).
Write  
j
  = 
j
 
. Then
j
(x) =
_
  1   if  x  U
j
0   if  x  is  outside  some  closed  neighbourhood  of  U
j
  contained  in  U
j
The  map  
j
j
  coincides  with  
j
  = 
on U
j
,  and  is  identically  0  outside  some  closed  neigh-
bourhood of U
j
  contained  in  U
j
 .   It can  thus  be extended  smoothly  to  all of M,  by  giving  it
the  value  0  outside  U
j
 .
Step  3:   We  dene  a  smooth  map  f  : M    R
Nm
by
f(x) =
_
1
(x)
1
(x), . . ., 
N
(x)
N
(x)
_
.   (5.8)
This  is  very  nearly  good  enough.   It  is  an  immersion,   because  
j
j
  =  
j
  on  U
j
,   
j
  is  an
immersion  (indeed  a  dieomorphism),   and  the  U
j
  cover  M.   Now  we  have  to  ensure  that  it
is  injective.
Clearly  
j
j
  is  injective  on  U
j
,   and  so  f  is  injective  on  U
j
.   We  have  therefore  to  devise  a
way  of  distinguishing,  with  our  map,  between  points  in  U
j
  and  points  not  in  U
j
.   Keeping  a
record  of  the  values  of  the  
j
, j  = 1, . . ., N  is  almost  good  enough:   redene  f  by
f(x) = (
1
1
(x), . . ., 
N
N
(x), 
1
(x), . . ., 
N
(x)).
106
With  only  a  tiny  modication  to  our  constructions,   this  map  is  1-1.   The  point  is  to  en-
sure  that  if   the  last  N  components  of   f  agree  on  x
1
  and  x
2
  then  x
1
  and  x
2
  must  both  lie
in  the  same  U
j
.   So  modify  
j
  so  that  it  takes  the  value  1  only  on  U
j
.   Because  
  trans-
forms  U
j
    U
j
  into  a  pair  of   concentric  balls,   this  can  indeed  be  arranged.   Now,   as  the
U
j
  cover  M,   x
1
  must  lie  in  some  U
j
.   So  
j
(x
1
)  =  1.   If   f(x
1
)  =  f(x
2
)  then  we  must  have
j
(x
2
) = 1 also, so x
2
  U
j
, and now it follows from the injectivity of 
j
j
  on U
j
  that x
1
  = x
2
.
To  spare  the  notation,  write  T  = mN  + N.
Step  4:   As  M  is  compact  and  f   is  1-1  and  continuous,   it  is  a  homeomorphism  onto  its
image.   I  claim  that  its  image  is  a  manifold  and  its  inverse  is  smooth.   Suppose  y   f(M)  is
the  image  of  x  U
j
.   Then  V
j
  :=  f(U
j
)  is  a  neighbourhood  of  y  in  f(M).   By  1.48,  applied
to  f   
1
j
  ,  V
j
  is  a  submanifold  of   R
T
,  and  f   
1
j
  : 
j
(U
j
)    U
j
  is  a  dieomorphism.   This
shows  that  f(M)  is  a  submanifold  of   R
T
.   Moreover,  on  V
j
,  f
1
= 
1
j
   (f   
1
j
  )
1
,  and  so
is  smooth.   Since  the  V
  
j
  cover  f(M),  this  completes  the  proof.   2
For later use,  we now give a rather sophisticated  consequence  of the existence  of elemen-
tary  bump  functions.
Theorem  5.40.   Let  M  be  a  smooth  manifold,  and  let  U
  on  M  such  that
(i)  for  each  ,  the  support  of  
a
  is  contained  in  U
(x) = 1
(iii)  Local   niteness:   every  x    M  has  a  neighbourhood  U  such  that   for  all   except  a  nite
number  of  ,  
   0  on  U.
The  collection  
.
We  will  not  prove  the  theorem  here.   Proofs  can  be  found  in  e.g.   Guillemin  and  Pollack,
Dierential  Topology, and  Madsen  and  Tornehave,  From  Calculus  to  Cohomology
5.7   Further  Structure  on  Abstract  Manifolds
Let  M
m
and  N
n
be  smooth  manifolds,  let  f  : M    N  be  a  smooth  map,  let  x  M  and  let
y  = f(x)  N.   Let    be  a  chart  on  M  around  x,  and  let    be  a  chart  on  N  around  y,  with
(x) = w, (y) = z.   Since  the composite   f  
1
is a  smooth (old) map, we  can consider
its  derivative
d
w
(  f   
1
) :   R
m
  R
n
.   (5.9)
107
If we choose dierent  charts 
and
1
)
dw(f
1
)
R
n
dz(
1
)
R
m
d
w
 (
f
1
)
R
n
The  vertical   maps  are  isomorphisms,   so  the  ranks  of   the  top  and  bottom  horizontal   maps
are  equal.   Thus,   we  can  unambiguously  dene  f   to  be  a  submersion,   immersion  or  local
dieomorphism  at  x  according  to  whether  the  derivative  (5.9)  is  surjective,   injective  or  an
isomorphism.
The proofs of the Inverse Function Theorem for manifolds, 1.19, and of the Local Normal
Forms   for   submersions   and  immersions,   1.27  and  1.26,   go  through  unchanged,   and  their
consequences  1.29, 1.31, 1.40 and 1.48, follow.   With a bit of ingenuity, we can even dene
transversality  by  means  of  charts,   and  recover  1.52  for  maps  transverse  to  submanifolds  of
abstract  manifolds.
It seems  curious that we can refer to what is really a property of the derivative  its surjec-
tivity,   injectivity  or  isomorphiusm    before  we  even  have  a  notion  of  tangent  space  T
x
M,
let  alone  of  the  derivative  d
x
f.
So  do  we  really  need  a  notion  of  tangent  vector  and  tangent  space  for  abstract  manifolds?
After  all,   we  managed  to  prove  a  signicant  theorem,   that  a  compact  abstract  manifold  is
dieomorphic  to  a  submanifold  of  Euclidean  space,  without  needing  tangent  spaces.
However,  we will need them.   Several dierent approaches are possible.   Here I will sketch
two.
5.7.1   A  rst  approach  to  tangent  spaces
Let  us  be  guided  by  the  principle  that  what  is  important  about  a  mathematical   object  is
not  what  it  is,   but  what  it  does.   When  we  studied  smooth  submanifolds  of   R
n
,   their  tan-
gent  spaces  were  made  up  of  vectors  which  already  existed  in  the  ambient  space.   What  did
they do?  They dierentiated functions  that is, we could dierentiate functions along them.
Let  M  be  a  submanifold  of   R
N
,   x    M  and  v    T
x
M.   If   f   is  a  smooth  function  dened
on  some  neighbourhood  of  the  point  x,  we  can  measure  the  derivative  of  f  along  the  vector
v,   d
x
f(v).   Because  now  we  are  thinking  of  v  as  the  subject  of  the  action  we  are  trying  to
describe, and f  as its object, we switch their roles in the notation and rewrite d
x
f(v) as v  f.
The  sum  and  product  rules  for  dierentiation  lead  easily  to  the  following  properties:
108
1.   v  (f  + g) = v  f  + v  g
2.   v  (f) =  v  f
3.   v  (fg) = f(x) v  g + g(x) v  f.
The  rst  two  of  these  properties  express  the  fact  that  v  is  an  R-linear  operator
C
(M)
x
    R,
where  C
(M)
x
  is  the  set  of   smooth  functions  dened  on  some  neighbourhood  of   x.   The
third  property  is  known  as  the  Leibniz  property.   Any  map  C
x
  (M)    R  with  these  three
properties  is  called  a  derivation.   We  have  seen  that  every  v  T
x
M  determines  a  derivation
v.   In  fact  the  converse  is  true:   every  derivation  arises  in  this  way.   For  a  moment,   denote
the  set  of  derivations  C
x
  (M)    R  by  D
x
M.
Proposition  5.41.   Let  M
m
be  a  submanifold  of   R
N
.
(i)  D
x
M  is  a  real   vector  space  of  dimension  m.
(ii)  The  map  T
x
M    D
x
M  sending  v   T
x
M  to  v  is  an  isomorphism.
Proof   (i)  First  consider  the  case  M  =  R
m
,   and  let      D
a
M.   By  the  Leibniz  property,
we  have  (1) = (1 1) = 1(1) + 1(1) = 2(1) (where  inside  the  brackets  1  is  the  constant
function  with  value  1).   Hence  (1) = 0.   By  the  linearity  property  (2),  it  follows  that  if  h  is
any  constant  function  then  (h) = 0.
I  claim  that  the  values  (x
j
),   for  j  =  1, . . . , m,   completely  determine  .   To  see  this,   we
need  a  minor  result,  the  so-called  Hadamards  lemma,  proved  in  Exercises  IV:
Lemma  5.42.   Every  function  f   C
a
  (R
m
)  can  be  written  in  the  form
f(x) = f(a) +
j
(x
j
a
j
)g
j
(x)   (5.10)
for  some  functions  g
j
   C
a
  (R
m
).
In  fact  this  is  really  a  form  of  Taylors  Theorem.   In  particular,   by  dierentiating  with
respect  to  x
j
  and  taking  x = a,  we  see  that  g
j
(a) = f/x
j
(a).
Let  f     C
a
  (R
m
).   By  applying  the  lemma  twice,   once  to  f   and  then  to  the  g
j
  in  the
resulting  equation  (5.10),  we  get  the  expression
f(x) = f(a) +
j
g
j
(a)(x
j
 a
j
) +
i,j
(x
i
a
i
)(x
j
a
j
)g
ij
(x).   (5.11)
The  functions  (x
1
 a
1
), . . ., (x
m
 a
m
)  all   vanish  at  a,   so  by  the  Leibniz  property,   ((x
i
a
i
)(x
j
   a
j
))  =  0.   Indeed,   so  also  is  ((x
i
 a
i
)(x
j
   a
j
)g
ij
)  equal   to  0.   Since  (f(a))  =  0,
from  (5.11)  we  get
(f) =
j
g
j
(a)(x
j
a
j
) =
j
f/x
j
(a)(x
j
).   (5.12)
109
Now  if  v = ((x
1
), . . ., (x
m
))  R
m
= T
a
R
m
,  then
v  f  = d
a
f(v) =
j
(x
j
)f/x
j
(a) = (f).   (5.13)
This  proves  (ii)  for  R
m
,  and  it  follows  for  arbitrary  smooth  manifold  M  by  taking  charts.
Since  D
a
M  is  clearly  a  vector  space,  and the  map  T
x
M    D
x
M  is  a  linear  injection,  (i)
follows  also.   2
The  identication  of T
x
M  with  D
x
M  is  so  natural that  D
x
M  is  best  regarded  simply  as
an  alternative  way  of  viewing  T
x
M.   Its  advantage  is  that  it  is  dened  as  soon  as  we  have  a
notion  of  smooth  function.   We  do  not  need  an  ambient  space.
Denition  5.43.   (i)  Let  M  be  a  smooth  manifold.   We  dene  T
x
M  to  be  D
x
M.   A  tangent
vector  to  M  at  x  is  an  element  of  D
x
M.
(ii) If f  : M    N  is a smooth map of smooth manifolds, then d
x
f  : T
x
M  = D
x
M    D
f(x)
N  =
T
f(x)
N  is  dened  as  follows:   for  each  g   C
f(x)
N,  g  f   C
x
  (M).   Given  v   D
x
M,  dene
d
x
f(v)  D
f(x)
N  by
d
x
f(v)  g  = v  (g  f).
In  fact  this  way  of  thinking  of  tangent  vectors  is  rather  fruitful.   It  leads  us  immediately
to  the  possibility  of  viewing  a  vector  eld  v  as  a  rule  for  transforming  each  function  f  into
a  new  function  v  f,  whose  value  at  x  is  d
x
f(v(x)).   Viewing  v  as  an  operator
C
(M) C
(M)
in  this  way,  it  satises
1.   v  (f  + g) = v  f  + v  g
2.   v  (f) = v  f
3.   v  (fg) = f v  g + g v  f.
This  in  turn  leads  to  the  crucially  important  notions  of  the  Lie  bracket  of  two  vector  elds,
and  the  Lie  algebra  of  a  Lie  group.   These  are  explored  in  Exercises  V.
5.7.2   A  second  approach  to  tangent  spaces
For  a  physicist,   there  is  no  reality  beyond  the  measurements  (for  example  of  the  positions
and  velocities  of  a  number  of  objects)  he  or she  takes.   The  measurements  taken  by  another
observer will be dierent, but one set of measurements  can be transformed into the other by
agreed rules of transformation.   Each set of measurements might be thought of as a chart, but
the physicist  has no psychological need for an absolute object of which the charts are partial
views.   The  only  important thing  is  to  know  how  to  transform  one  set  of  measurements  into
another.
110
The  parallels   with  the  notion  of   manifold,   and  of   smooth  atlas,   are  clear.   They  give
rise  to  a  viewpoint  on  the  notion  of   manifold  in  which  a  manifold  is  its  charts  (and  their
transformation  rules).   The   reality  of   the   manifold  is   expressed  in  its   local   co-ordinates
(i.e.   via  its  charts).   Indeed,   we  had  no  diculty  speaking  of  immersions  and  submersions
of   abstract   manifolds,   by  means   of   charts.   We   did  not   need  a  notion  of   tangent   space
independent  of  the  charts.
This  view  encourages  the  denition  of  the  tangent  space  T
x
M  as  being  T
(x)
R
m
,  for  any
chart    around  x,  subject  to  the  identication  that if  v
1
  T
1
(x)
R
m
and  v
2
  T
2
(x)
R
m
,  then
v
1
  and v
2
  are the same if d
1
(x)
(
2
 
1
1
  )(v
1
) = v
2
.   This is made precise with the following
slightly  ponderous  denition:
Denition 5.44.  (i) Let M  be a manifold with smooth atlas  A.   Then T
x
M  is the quotient of
the disjoint union
T
(x)
R
m
(over all 
(x)
R
m
if   d
(x)
(
  )(v
1
) = v
2
.
(ii)  If   f  :   M
m
  N
n
is  a  smooth  map  of   smooth  manifolds,   then  d
x
f  :   T
x
M    T
f(x)
N  is
dened  as  follows:   T
x
M  is  identied  with  T
(x)
R
m
,   and  T
f(x
N  is  identied  with  T
(f(x))
R
n
,
where  and   are charts on M  and N  around x and f(x) respectively.   Then d
x
f  is identied
with  d
(x)
(  f   
1
).
Thus,  the  familiar  diagram
T
x
M
dx
dxf
 T
f(x)
N
d
f(x)
R
m
  d
(x)
(f
1
)
R
n
reappears,  this  time  with  the  vertical  maps  as  identications rather  than  isomorphisms.
5.7.3   A  third  approach  to  tangent  spaces
In  Section  1  we  dened  tangent  vectors  to  M  as  the  derivatives  of  curves  in  M.   This  led  to
an  especially  convenient  description  of  the  derivative  of  a  map  f  : M    N:
d
x
f(
(0)) = (f
(0)).
This  motivates  a  third  view  of  T
x
M.   Each  smooth  parametrised  curve    :  (R, 0)   (M, x)
gives   us   a  way  of   dierentiating   functions   f   :   (M, x)    (R, 0),   by  associating   to  f   the
derivative (f )
(0) = (f   
2
)
(0).   (5.14)
111
Proposition 5.45.  (i) 
1
 
2
  if and only if for some chart  on M  around x, (
1
)
(0) =
(  
2
)
(0).   2
When  M  is  a  smooth  submanifold  of  a  Euclidean  space  R
N
,  we  can  identify  an  equivalence
class   of   curve   germs      :   (R, 0)   (M, x)   with  their   common  tangent   vector   
(0)     R
N
.
When M  is an abstract smooth manifold, we take such an equivalence class as our denition
of  tangent  vector:
Denition   5.46.   (i)   T
x
M  is   the   set   of   equivalence   classes   of   smooth   curve   germs      :
(R, 0)    (M, x)  by  the  equivalence  relation  of  (5.14).
(ii)   If   f   :   M    N  is   a  smooth  map  of   smooth  manifolds,   then  d
x
f   :   T
x
M    T
f(x)
N  is
dened  by  the  formula
d
x
f([]) = [f   ].
The  map  d
x
 : T
x
M    T
(x)
R
m
is a  bijection,  and its  target is  naturally identied  with  R
m
.
Using  d
x
  we  can  give  T
x
M  the  structure  of  an  m-dimensional  vector  space.   This  structure
is  independent  of  the  choice  of  chart  ,   by  the  smoothness  of  each  crossover  maps  and  the
consequent  linearity  of  its  derivative.
5.8   You  dont  have  to  choose
All   three  views  of   T
x
M  are  equivalent.   Each  has  its  advantages  in  certain  contexts.   The
approaches  we  have  used  stem  from  the  three  approaches  to  the  derivative  we  described  in
Section  1.
112
6   Dierential  Forms  and  Integration  on  Manifolds
6.1   First  examples
In this chapter we begin to study analysis on manifolds.   There are two reasons for doing this.
The rst is that manifolds, such as the universe, are home to many objects  forces, electric
and  magnetic  elds,   ows,   etc  etc    which  are  well-modelled  by  the  dierential   forms  we
will   shortly  dene.   The  second,   slightly  less  obvious,   is  that  the  methods  of  analysis,   and
in  particular  the  integration  of   dierential   forms,   lead  to  new  approaches  to  questions  we
have already been thinking about, such as the degrees of smooth maps, oriented intersection
numbers,  etc.
We  begin  with  some  physical   motivation  which  leads  to  the  denition  of  dierential   forms.
If  the  physics  doesnt  help  you  appreciate  the  mathematics,  ignore  it.
Example  6.1.   In  physics  one  often  meets  expressions  like
_
  b
a
F((t))  
(t)dt   (6.1)
where F  is a vector eld on  R
3
and   : [a, b]    R
2
is a parametrisation of a smooth curve  C.
This  integral   is  a  measurement  of  the  amount  of  work  done  in  moving  an  object  along  the
trajectory   while it is subject  to the force F.   In so called conservative  systems,  the force F
is  the  gradient of  a  potential  function  f  (e.g.   gravitational potential  or electrical  potential);
then  the  integral  can  be  written
_
  b
a
f((t))  
(t)dt   (6.2)
But notice that the gradient vector f  is a way of considering the derivative d
x
f  as a vector,
whereas  d
x
f  is  really  a  covector  -  something  which  eats  vectors  and  spits  out  numbers.   We
can  rewrite  (6.2)  as
_
  b
a
(d
(t)
f)(
(t))dt.   (6.3)
Written  like  this,   it  is  clearly  the  integral   of   the  derivative  of   f   ,   and  thus  is  equal   to
f((b)) f((a)). This expresses  a version of the law of conservation of energy:   in a conser-
vative  system,   the  work  done  is  equal   to  the  change  in  the  value  of  the  potential   from  one
end  of  the  trajectory  to  the  other.
The  derivative  d
(x
1
,x
2
)
f  has  matrix
_
f
x
1
,
  f
x
2
_
with respect to the basis of  R
2
dual to the coordinate functions x
1
, x
2
.   It is useful to develop
a  notation  which  avoids  having  to  write  a  matrix.
113
Dene dx
i
  as the covector (i.e.   linear form on  R
m
) whose matrix with respect to the basis
dual  to  the  coordinates  x
1
, . . ., x
m
  is
_
  0        0   1   0        0
(s))ds = 
_
  b
a
(
(t))dt.   (6.4)
The  proof  is  very  simple:   both  parametrisations  of  C  are  dieomorphisms,  so  we  can  write
  =   (
1
 ).
Write  
1
   =: h,  so    =   h.   Then
_
  b
a
(
(t))dt =
_
  b
a
(h(t))h
(t)
_
dt =
_
  b
a
(h(t))
_
h
(t)dt
Now  use  integration  by  substitution;  write  s = h(t)  (as  indeed  it  is);  then  ds = h
(t)dt,  and
_
  b
a
(h(t))
_
h
(t)dt =
_
  h(b)
h(a)
(
(s))ds =
_
_
_
_
d
c
  (
(s))ds   if  h is  increasing
_
c
d
  (
(s))ds   if  h is  decreasing.
Thus
_
d
c
  (
(s))ds is equal to
_
b
a
  (
(t))dt
if  h  reverses  orientation.
This  allows  us  to  dene  the  integral  of  a  1-form over  a  compact  oriented  curve  in  R
n
,
_
C
 :
13
I  am  not  suggesting  that  force  is  really  a  covector rather  than  a  vector,  merely  that  in  our  integral  this
is  how  it  is  functioning
114
Denition  6.2.   Let  C  be  a  compact  oriented  curve  (i.e.   a  1-manifold) in  R
n
and  let    be  a
1-form  on  R
n
.   Choose  an  orientation-preserving  dieomorphism    : [a, b]    C.   Then
_
C
  :=
_
  b
a
(
(t))dt.
The  calculation  we  have  just  made  shows  that  the  integral  is  independent  of  the  choice
of  orientation-preserving  dieomorphism  (parametrisation),  and  thus  is  well-dened.
Returning to the example in which the 1-form was the derivative of some function,   = df:
suppose  that  C  begins  at  P  and  ends  at  Q;  then  we  have
_
C
df  = f(Q) f(P).   (6.5)
The  points  P  and  Q  together  make  up  the  boundary  of  C,   C.   Thus  (6.5)  can  be  written,
somewhat  tendentiously,  as
_
C
df  =
_
C
f.   (6.6)
It is not clear at this point why we should write f(Q)f(P) as an integral.   This will become
clear later.   One thing  that is clear,  however,  is that if P  = Q (i.e.   if  C  is a  closed  curve),  or
in  other  words,  if  C  has  no  boundary,  then
_
C
 df  = 0.   2
Example  6.3.   A  less  trivial  example  of  a  1-form  on  R
2
 0  is  provided  by  the  derivative
of  the polar coordinate function    shown.   The  angle   is not a  well  dened  smooth function
on  all  of   R
2
 0.   In  any  sector  of  angle  less  than  2,  there  is  a  smooth  determination  of  ,
but  it  is  not  unique,  and  determinations  of    in  overlapping  regions  will  in  general  dier.
(x,y)
  [/4,5/4]   [3/4,9/4]
However,  any two determinations 
1
, 
2
  of   dier by a multiple of 2.   It follows that the
derivatives  d
1
  and  d
2
  are  equal.   Thus,  they  piece  together  to  dene  a  single,  well-dened,
smooth 1-form on  R
2
 0, which  we denote  by d.   It is easy to calculate:   if x > 0 then one
determination  of    is  as  arctan(y/x);  thus
d =
  y
x
2
  
  1
1 + (y/x)
2
dx +
  1
x
 
  1
1 + (y/x)
2
dy  =
  ydx + xdy
x
2
+ y
2
  .   (6.7)
115
If   y   >  0  then  another   determination  of     is   as   arccotan(x/y);   you  should  check  that   its
derivative  has  the  same  formula  (6.7).
What  is  the  signicance  of   this  1-form?   If   C  is  an  oriented  curve  in  R
2
 0,   what  is
the  geometrical   meaning  of
_
C
 d?   If   C  is  a  curve  contained  in  a  sector  in  which    is  well
dened,  then  by  (6.5),
_
C
 d  measures  the  dierence  between  the  value  of    at one  end  of C
and  the  other.   But  what  if  C  is  not  contained  in  such  a  sector?  What  about
_
S
1
 d?
Exercise  6.4.   Calculate
_
S
1
 d.   2
2
Example  6.5.   Let X  be the velocity eld of a uid ow in a region U  of  R
3
.   For each point
x  U,  each  pair  of  vectors  u, v,  and  each  positive  real  ,  imagine  a  parallelogram  P
  based
at x and spanned by u and v.   Note that the area of P
, divided by 
2
, is independent of .
1
2
x
v
v
Let
(v
1
, v
2
) = lim
     0
uid  ow  through  P
2
  .   (6.8)
We give this a sign:   ow through P
  is  
2
|u v|.
The unit normal vector in the positive direction is uv/|uv|.   Thus the uid ow through
P
  is  approximated  by
X(x) 
  u v
|u v|
Area  of  P
  = 
2
X(x)  u v.
As      0  this  approximation  improves,  and  indeed  (one  can  check  that)
(u, v) = X(x)  u v.
Let  u = (u
1
, u
2
, u
3
)  and  v = (v
1
, v
2
, v
3
);  then  this  last  expression  is  equal  to
X
1
(x)(u
2
v
3
u
3
v
2
) + X
2
(x)(u
3
v
1
u
1
v
3
) + X
3
(x)(u
1
v
2
u
2
v
2
)
116
= X
1
(x)
u
2
  u
3
v
2
  v
3
X
2
(x)
u
1
  u
3
v
1
  v
3
+ X
3
(x)
u
1
  u
2
v
1
  v
2
.   (6.9)
For each  x  U,  this  is  a  bilinear  form   it  is  linear  in  each  of u  and  v    and  moreover  it is
alternating:   if  we  interchange  the  order  of  u  and  v,  it  changes  sign.
Denition 6.6.  A 2-form on  R
m
is a family of bilinear alternating maps depending smoothly
on  x  R
m
.
Thus    dened  in  (6.8)  is  a  smooth  2-form  on  R
3
.   2
Fussy  remark  6.7.   In  (6.9),   the  subindices   of   the  X
j
  are  rather   neatly  ordered.   Each
determinant   u
i
v
j
   u
j
v
i
  involves   all   of   the  subindices   except   one,   and  the  subindex  of   its
coecient   X
k
  is   the  missing  index.   In  fact   this   is   the  most   symmetric   way  of   assigning
subindices   to  the   variables   and  coecients   in  this   formula.   Taking  care   of   this   kind  of
symmetry can be very helpful in understanding, or even noticing, the structure of complicated
calculations,  and  in  getting  them  right,  and  it  is  worth  while  spending  a  little  time  on  it.
6.2   Determinants,   volume  and  change  of   variable  in  multiple  in-
tegration
Exercise 6.8.  The appearance of alternating multilinear forms when we are considering area
or  volume  is  natural  and  inevitable.   Let  P(u, v)  be  the  parallelogram  spanned  by  vectors  u
and  v,  and  let  [P(u, v)[ be  its  area.   The  formula
Area  = base    height
makes clear that if we add a multiple of u to v, the area of the parallelogram they span does
not  change:
[P(u, v + u)[ = [P(u, v)[.
A
B
C D
u
v  v+   u
Show  that  if  b : V  V     R  is  a  bilinear  form  on  the  vector-space  V ,  then  the  following
are  equivalent:
1.   for  all  u, v  V   and    R,  b(u + v, v) = b(u, v)
2.   for  all  v   V ,  b(v, v) = 0
3.   for  all  u, v  V ,  b(v, u) = b(u, v).
117
2
Now  consider  the  following  sequence  of  pictures:
u
u
v
P(u,v)
v
u= u+ av
v=v+ bu
We transform the parallelogram P(u, v) spanned by vectors u, v  into a rectangle P(u
, v
)
with  its  edges  along  the  coordinate  axes.   Denote  the  area  of  P(u, v) by  [P(u, v)[.
[P(u
, v
)[ = [P(u
, v)[.
It  is  clear,   in  fact,   that  by  adding  to  either  vector  a  multiple  of  the  other,   we  do  not  alter
the  area  of  the  parallelogram  P(u, v).
In  this  respect,   [P(u, v)[  behaves  identically  to  det[u, v],   where  [u, v]  is  the  matrix  with
columns u and v.   The coincidence does not end there:   since, in our diagram, the two vectors
u
and v
, v
)[ = det[u
, v
].
Because  the  process  of   transforming  u, v  to  u
, v
det[u
1
, . . . , u
n
]
.
Here  we  mean,   by  P(u
1
, . . ., u
n
)  the  parallelipiped  spanned  by  u
1
, . . ., u
n
.   The  theorem
can  easily  be  proved  by  essentially  the  same  procedure  as  for  the  2-dimensional   case.   We
know that det[u
1
, . . . , u
n
] is unaltered by adding to one vector a multiple of one of the others,
118
and  show  that  [P(u
1
, . . . , u
n
)[   has  the  same  property.   To  do  this  without  a  drawing,   one
needs  a precise  denition  of  P(u
1
, . . . , u
n
) and  of P(u
1
, . . . , u
i
 +u
j
, . . . , u
n
); equipped  with
such  a  precise  denition,  it  is  easy  to  show  that  the  regions
P(u
1
, . . . , u
n
)  P(u
1
, . . . , u
i
 + u
j
, . . . , u
n
)
and
P(u
1
, . . . , u
i
 + u
j
, . . . , u
n
)  P(u
1
, . . . , u
n
)
are   congruent   (by  a  translation,   in  fact,   just   as   in  the   pictures   above,   where   these   two
regions were the triangles that are lost and gained in each of the two transformations).   From
this  it  follows  that  P(u
1
, . . . , u
n
)  has  the  same  volume  as  P(u
1
, . . . , u
i
 + u
j
, . . . , u
n
).   By
operations of this type it is possible to transform u
1
, . . ., u
n
 into vectors u
1
, . . ., u
n
 lying along
the  coordinate  axes.   Then
[P(u
1
, . . . , u
n
)[ = [P(u
1
, . . . , u
n
)[ =
det[u
1
, . . . , u
n
]
det[u
1
, . . . , u
n
]
.
Further  details  of  the  steps  in  this  argument  are  given  in  Exercises  V.   2
Theorem  6.9  is  the  key  to  the  proof  of   the  change-of-variable  formula  in  multiple  inte-
gration.   We  will  need  this  formula  when  we  prove  a  higher  dimensional  analogue  of  (6.4).
Theorem 6.10.  Suppose that R is a region in  R
m
, h : R    h(R)  R
m
is a dieomorphism,
and  f  : R    R  is  an  integrable  function.   Then
_
h(R)
f  =
_
R
(f   h)[J(h)[
where  J(h) = det[dh].
Sketch  proof.
R
x
u
v
h
h(R)
R
i h(x)
d  h(v)
d  h(u)
x
x
h(R )
i
119
The  diagram shows  a  partition of R  (drawn  as a  rectangle,  though this  is  not necessary)
into  rectangles  R
i
.   For  any  function  g  on  R,   the  integral
_
R
g  is  dened  by  means  of  such
partitions.   If  the  supremum  of  g  on  R
i
  is  achieved  at  x
i
,
i
g(x
i
)vol(R
i
)   (6.10)
is an approximation from above to the value of
_
R
g; the integral itself  is the inmum,  over
all partitions, of  these  approximations  from above.   (Of course  this is not how one  calculates
it,  in  general.)   Now  consider  the  function  g  = f   h.   Given  any  partition  of  R,  by  means  of
the  dieomorphism  h  we  obtain  a  partition  of  h(R)  (though  not  into  rectangles).   Suppose
that on R
i
, f  h achieves  its supremum  at x
i
.   Let  y
i
 = h(x
i
).   Then  on h(R
i
), f  achieves  its
supremum  at  y
i
.   Consider  the  sum
i
f(y
i
)vol(h(R
i
))   (6.11)
where  y
i
  =  h(x
i
).   This   is   an  approximation  from  above  of
_
h(R)
f.   Suppose  R
i
  =  x
i
  +
P(v
1
, . . ., v
m
).   If  R
i
  is  very  small,
h(R
i
)
  approx
=   y
i
 + P
_
d
x
i
h(v
1
), . . ., d
x
i
h(v
m
)
_
.   (6.12)
The  parallelipiped  on  the  right  has  volume  [ det[d
x
i
h(v
1
), . . ., d
x
i
h(v
m
)][.   If  v
i
  = 
i
e
i
,  then
det
_
d
x
i
h(v
1
), . . ., d
x
i
h(v
m
)
det
_
1
d
x
i
h(e
1
), . . ., 
m
d
x
i
h(e
m
)
=
= 
1
  
m
det
_
 h
x
1
, . . .,
  h
x
m
= vol(R
i
)[ det[d
x
h][.
Therefore
i
f(y
i
)vol(h(R
i
))
  approx
=
i
f(h(x
i
))vol(R
i
) [  det[d
x
i
h] [.   (6.13)
When  we  take  the  inmum  over  all   choices  of   partitions  R
i
  of   R,   on  the  left  hand  side
we  get
_
h(R)
f;   on  the  right  hand  side  we  get
_
R
(f   h)[ det[df][.   The  inmum  of  the  set  of
approximations  from  above  is  approached  by  taking  ner  and  ner  partitions,   and  as  the
partition  gets  ner,   the  approximation  (6.13)  becomes  closer.   In  the  limit,   the  two  inma
coincide.   2
Exercise 6.11.  Suppose that A : (R
m
)
k
  R is k-linear and alternating.   Show that A shares
the  following  properties  with  det:
(i)  If  v
i
  = v
j
  for  some  i ,= j  then  A(v
1
, . . ., v
k
) = 0.   Ditto  if  v
i
  = v
j
.
(ii)   A(v
1
, . . ., v
i
 + v
j
, . . ., v
k
) = A(v
1
, . . ., v
i
, . . ., v
k
).   (Hint:   use  linearity  in  the  i-th  argu-
ment  to  write  the  left  hand  side  as  a  sum  of  two  terms.)
(iii)   For   any  permutation    of   k   objects,   A(v
(1)
, . . ., v
(k)
)   =  sign()A(v
1
, . . ., v
k
)   (Hint:
write    as  the  composite  of  a  sequence  of  transpositions.)   2
120
Exercise  6.12.   Suppose  that  A  :   (R
m
)
m
  R  is  m-linear  and  alternating.   Show  that  A  is
a  scalar  multiple  of  det.   Hint:   The  properties  listed  in  the  previous  exercise  enable  you  to
reduce  any  m-tuple  of  vectors  v
1
, . . ., v
m
  of   R
m
to  something  like  row  echelon  form  without
altering  the  value  of   A(v
1
, . . ., v
m
).   This  is  essentially  what  we  did  to  prove  Theorem  6.9,
although  that  proof  used  only  property  (ii)  of  6.11,  and  moreover  contained,  as  a  last  step,
the  calculation  that  the  scalar  is  1.   2
6.3   Dierential   forms
We  are  ready  for  the  denition  of  dierential  forms.
Denition  6.13.   A  dierential   k-form  on  R
m
is   a  smooth  family  of   alternating  k-linear
maps  R
m
   R
m
  R.   Smoothness  here  means  the  following:   a  k-form    on  U   R
m
is
smooth  if,  whenever  v
1
, . . ., v
k
  are  smooth  vector  elds  on  U,  then  the  function  (v
1
, . . ., v
k
)
is  smooth.
The  set  of  all   k-forms  dened  on  an  open  set  U    R
m
is  denoted  
k
(U).   It  is  useful   (and
consistent)  to  regard functions  on U  as 0-forms, and  thus  denote  the  set  of all  smooth func-
tions  on  U  as  
0
(U).
Remark  6.14.   There  is  a  minor  issue  of  notation  here.   Suppose  that  U  is  an  open  set  in
R
m
and      
k
(U).   For  each  point  x    U,   we  get  a  k-linear  alternating  form  (R
m
)
k
  R.
We  could  in  priniciple  denote  it  by  (x),  since  it  depends  on  x.   Since  it  is  to  be  evaluated
on  k-tuples  of  vectors  v
1
, . . ., v
k
  in  R
m
,  we  would  then  have  to  write
(x)(v
1
, . . ., v
k
).
The universally observed convention is to drop reference to x, and simply write (v
1
, . . ., v
k
).
This  is  slightly  uncomfortable  when  rst  encountered.   Later,  we  will  work  with  dierential
forms  on  manifolds.   If      
k
(M),  then  for  each  x  M  we  get  a  k-linear  alternating  map
(T
x
M)
k
  R.   Here,   the  tangent  space  comes  with  the  label   x,   and  so  the  uncomfortable
imprecision  is  removed.   But  in  practice  it  will  not  be  a  problem  even  when  we  are  working
with  k-forms  on  open  sets  in  Euclidean  spaces.
We have already seen what is in some sense the most important example, the determinant,
which  is  a  (constant)  m-form  on  R
m
  constant  in  the  sense  that  it  doesnt  vary  with  x.
Lemma  6.15.   If   U    R
m
,   then  every      
m
(U)  is  of   the  form    =  Ldet,   where  L  is  a
smooth  function  on  U.
Proof   This  is  just  Exercise  6.12  again.   At  each  point,    is  a  scalar  multiple  of  det.   The
form  is  smooth  if  and  only  if  the  scalar  varies  smoothly  with  x.   Thus,     =  Ldet  for  some
smooth  function  L.   2
We  introduce  now  some  notation  which  makes  representing  forms  simpler.
121
Denition  6.16.   Suppose  that   
1
  and  
2
  are  two  1-forms.   Dene  a  2-form  
1
 
2
  (this
reads  
1
  wedge  
2
)  by  the  formula
1
 
2
(u, v) = 
1
(u)
2
(v) 
2
(u)
1
(v).   (6.14)
It is very easy to check that 
1
2
 is indeed alternating and bilinear, and varies smoothly
with  x  provided  
1
  and  
2
  do  so.   For  example,
dx
1
 dx
2
 (u, v) = dx
1
(u)dx
2
(v) dx
2
(u)dx
1
(v) = u
1
v
2
u
2
v
1
.
And  (6.9)  can  be  rewritten  as
(X
1
dx
2
 dx
3
 + X
2
dx
3
dx
1
 + X
3
dx
1
dx
2
)(u, v).
As  another example,  observe  that if e
1
, . . ., e
m
  form a basis  for the m-dimensional  vector
space V , and if 
1
, . . ., 
m
  is the dual basis for the dual vector space V
(so that 
j
(e
i
) = 
ij
),
then  for  i ,= j,
j
(e
i
, e
j
)   =   1
j
(e
j
, e
i
)   =   1
j
(e
s
, e
t
)   =   0  if  i, j , = s, t
.
Lemma  6.17.   (i)  Let   e
1
, . . ., e
m
  be  a  basis  for  the  m-dimensional   vector  space  V ,   and  let
1
. . ., 
m
  be  the  dual  basis  for  V
1i<jm
ij
 
i
j
.
where  the  
ij
  are  real   numbers.
(ii)  Let  U   R
m
and    
2
(U).   Then    can  be  written  uniquely  in  the  form
  =
1i<jm
L
ij
dx
i
dx
j
,
where  the  L
ij
  are  smooth  functions  on  U.
Proof   (i)  Let  A  be  an  alternating  bilinear  map  V  V     R.   We  have
A(u, v) = A(
i
u
i
e
i
,
j
v
j
e
j
) =
i,j
u
i
v
j
A(e
i
, e
j
)
by  bilinearity,  and
i,j
u
i
v
j
A(e
i
, e
j
) =
i<j
u
i
v
j
A(e
i
, e
j
) +
i>j
u
i
v
j
A(e
i
, e
j
).
122
Interchange  the  summation indices  i and j  in the second  summand  on the right:   it becomes
i<j
 u
j
v
i
A(e
j
, e
i
),  and  so  is  equal  to  
i<j
 u
j
v
i
A(e
i
, e
j
),  since  A  is  alternating.   Hence
A(u, v) =
i<j
(u
i
v
j
u
j
v
i
)A(e
i
, e
j
) =
i<j
A(e
i
, e
j
)
i
j
 (u, v)
This  shows  that
A =
i<j
A(e
i
, e
j
) 
i
j
.
(ii) By (i), at each x  U,  is a (unique) linear combination of the constant 2-forms dx
i
dx
j
for 1  i < j   m.   In order that  be a smooth 2-form, these coecients must vary smoothly
with  x,  giving  the  functions  L
ij
.   2
It   will   be  useful   to  dene   the   wedge   product   of   k-forms   and  -forms.   Unfortunately
it  cannot  be  dened  by  means  of   as  simple  a  formula  as  (6.14),   for  two  reasons;   rst,   we
have  to  ensure  that  the  object  we  dene  really  is  alternating,  and  second,  we  would  like  the
associative  law  to  hold:   we  want
1
(
2
3
) = (
1
2
)
3
,
for  forms  
1
, 
2
, 
3
  of  any  degree.   The  result  of  these  requirements  is  a  rather  complicated
formula.   Before  stating  it,   let  me  just  point  out,   in  reassurance,   that  it  encompasses   the
most  important  example,  namely  det:   it  will  turn  out  that  on  R
m
,
det = dx
1
  dx
m
,
and  thus  the  wedge  product  in  the  end  brings  about  an  important  simplication.
Denition  6.18.   Let  U   R
m
and  let  
1
  
k
(U)  and  
2
  
(U). Dene
1
 
2
  
k+
(U)
by
(
1
2
)(v
1
, . . ., v
k+
) =
  1
k!!
S
k+
sign()
1
_
v
(1)
, . . ., v
(k)
_
2
_
v
(k+1)
, . . ., v
(k+)
_
.
For  example,  let  us  evaluate  dx
1
(dx
2
dx
3
)  on  the  triple  of  vectors  (u, v, w).   We  have
dx
1
 (dx
2
dx
3
) (u, v, w) =
1
2!
_
dx
1
(u)
_
dx
2
dx
3
 (v, w)
_
dx
1
(u)
_
dx
2
dx
3
 (w, v)
_
+ dx
1
(v)
_
dx
2
dx
3
 (w, u)
_
dx
1
(v)
_
dx
2
dx
3
 (u, w)
_
+dx
1
(w)
_
dx
2
dx
3
 (u, v)
_
dx
1
(w)
_
dx
2
dx
3
 (v, u)
__
= dx
1
(u)
dx
2
dx
3
 (v, w) dx
2
dx
3
 (w, v)
2!
123
dx
1
(v)
dx
2
dx
3
 (w, u) dx
2
dx
3
 (u, w)
2!
+dx
1
(w)
dx
2
dx
3
 (u, v) dx
2
dx
3
 (v, u)
2!
Each of the fractions in this expression is a dierence of 2! terms.   Because the form dx
i
dx
j
is  alternating,  the  second  is  (1)  times  the  rst.   Thus
dx
1
 (dx
2
dx
3
) (u, v, w)
= dx
1
(u) dx
2
dx
3
 (v, w)
+dx
1
(v) dx
2
dx
3
 (w, u)
+dx
1
(w) dx
2
dx
3
 (u, v)
=
u
1
  v
1
  w
1
u
2
  v
2
  w
2
u
3
  v
3
  w
3
1
(
2
3
)(u, v, w) =
1
(u)   
1
(v)   
1
(w)
2
(u)   
2
(v)   
2
(w)
3
(u)   
3
(v)   
3
(w)
Permutation  of the  
i
  results  in  the same  sign-changes  on both sides  of  this  expression,  and
from  this,  associativity  of  the  wedge  product,  at  least  among  1-forms,  follows.
Exercise  6.19.   (i)  Suppose  that   
1
, . . ., 
k
  are  1-forms,   and  v
1
, . . ., v
k
  are  vectors.   Show
that
1
(
2
(  
k
)  ) (v
1
, . . ., v
k
) =
1
(v
1
)        
1
(v
k
)
            
k
(v
1
)        
k
(v
k
)
(6.15)
Hint:   induction on k.   The form of the induction step is suggested by the previous calculation.
(ii)  Conclude  that  the  parentheses  on  the  left  hand  side  of  (6.15)  are  unnecessary.
2
Exercise  6.20.   Let   U  be  an  open  subset   of   R
m
.   Show  that   every  form      
k
(U)  can  be
written  uniquely  as
  =
1i
1
<<i
k
m
L
i
1
i
k
dx
i
1
  dx
i
k
,
where  the  L
i
1
i
k
  are  smooth  functions  on  U.   Hint:   this  is  a  straightforward  generalisation
of   6.17.   Notice,   however,   that  we  have  used  6.19  to  dispense  with  any  parentheses  in  the
expression  dx
i
1
  dx
i
k
.   2
124
Exercise  6.21.   Prove  that  if  
1
  
k
(U), 
2
  
(U)  and  
3
  
m
(U)  then
1
(
2
3
) = (
1
2
)
3
.
That   is,   the  wedge  product   of   forms  of   arbitrary  degree  is  associative.   Hint:   use  6.20  and
6.19(ii).   2
Exercise  6.22.   Let  
1
  
k
(U)  and  
2
  
2
  = (1)
k
1
.
2
6.4   Integration  of  dierential   forms
Just   as   we  could  integrate  1-forms  on  1-manifolds,   we  can  integrate  k-forms   on  R
m
over
compact   oriented  k-dimensional   submanifolds   of   R
m
.   Let  us  rst   dene  the  integral   of   a
k-form    over   a  submanifold  Z  which  is   dieomorphic  to  a  region  R    R
k
with  smooth
boundary.   Suppose  that
  : R    Z
is  an orientation-preserving  dieomorphism,  and  suppose  that   
k
(U)  for some  open set
U  containing  Z.   Let  e
1
  =  (1, . . ., 0), . . ., e
k
  =  (0, . . ., 1)  be  the  standard  basis  for   R
k
.   Then
for  each  x    R,   we  apply    to  the  k-tuple  of  vectors  d
x
(e
1
), . . ., d
x
(e
k
).   This  gives  us  a
number.   We  dene
_
Z
   by
_
Z
  :=
_
R
_
d
x
(e
1
), . . ., d
x
(e
k
)
_
  (6.16)
Note  that  the  vector  d
x
(e
j
)  can  also  be  writen  as  /x
j
.
The integral over the product region R can of course be evaluated as an iterated integral
_
    
_
  
_
d
x
(e
1
), . . ., d
x
(e
k
)
_
dx
1
. . .dx
k
.
Here,   the  symbols  dx
j
  make  another  appearance.   In  some  sense  they  are  the  same  as  the
1-forms  we  discussed  in  Example  6.1,  but  here  their  only  role  is  to  guide  us  in  the  order  of
integration  (rst  integrate  with  respect  to  x
1
,  then  with  respect  to  x
2
,  etc.)
In  order  that  (6.16)  can  be  said  to  dene
_
Z
 ,  we  need
Lemma  6.23.   If   : R
_
d
x
(e
1
), . . ., d
x
(e
k
)
_
=
_
R
_
d
y
(e
1
), . . ., d
y
(e
k
)
_
.   (6.17)
125
This is exactly what we showed in the 1-dimensional case, in the discussion leading up to 6.2.
And  essentially  the  same  proof  works  here,  although because  we  allow  k  > 1  we  have  to  use
the formula for a change of variable in multiple integration, 6.10, where in the 1-dimensional
case  we  used  integration  by  substitution.
Proof   Because  and  are both dieomorphisms, so is 
1
  := h, and we have   = h.
Then
_
d
x
(e
1
), . . ., d
x
(e
k
)
_
= 
_
d
y
(d
x
h(e
1
)), . . ., d
y
(d
x
h(e
k
))
_
,   (6.18)
where  y  = h(x),  by  the  chain  rule.   Write
d
x
h(e
i
) =
ij
e
j
  (6.19)
(so  
ij
  = h
j
/x
i
).
Lemma  6.24.   If  A :   R
k
    R
k
  R  is  any  alternating  k-linear  map,  and  v
1
, . . ., v
k
  are
any  k  vectors  in  R
m
,  then
A
_
1j
v
j
, . . .,
kj
v
j
_
=
11
       
k1
1k
       
kk
A(v
1
, . . ., v
k
).
Proof   This is really a small modication of Exercise 6.12.   Dene a new map B  : (R
k
)
k
  R
by
B
_
_
_
_
11
1k
_
_
,    ,
_
_
k1
kk
_
_
_
_
= A
_
1j
v
1
, . . .,
j
a
kj
v
j
_
.
Because  A  is  alternating,   so  is  B.   Hence,   by  6.12,   B  =   det  for  some      R.   To  nd  the
value  of   ,   we  evaluate  B  on  some  k-tuple  of   vectors  where  we  already  know  the  answer.
Because  B  =  det,  we  have
 =  det
_
_
_
_
1
0
_
_
,    ,
_
_
0
1
_
_
_
_
= B
_
_
_
_
1
0
_
_
,    ,
_
_
0
1
_
_
_
_
= A(v
1
, . . ., v
k
).
where  the  last  equality  is  by  the  denition  of  B.   Hence
A
_
1j
v
j
, . . .,
kj
v
j
_
= B
_
_
_
_
11
1k
_
_
,    ,
_
_
k1
kk
_
_
_
_
= A(v
1
, . . ., v
k
)
11
       
k1
1k
       
kk
.
2
Now  we  continue  the  proof  of  6.23.   By  (6.18)  and  (6.19),
_
d
x
(e
1
), . . ., d
x
(e
k
)
_
= (
1j
d
y
(e
j
), . . .,
kj
d
y
(e
j
)
_
.
126
Apply  Lemma  6.24  with  A =   and  v
j
  = d
y
(e
j
)  for  j  = 1, . . ., k.   We  get
(
1j
d
y
(e
j
), . . .,
kj
d
y
(e
j
)
_
=
11
       
k1
1k
       
kk
_
d
y
(e
1
), . . ., d
y
(e
k
)
_
.
Since 
ij
  = h
j
/x
i
, the determinant here is the jacobian determinant of the dieomorphism
h,  J(h).   Thus
(
1j
d
y
(e
j
), . . .,
kj
d
y
(e
j
)
_
= J(h)  
_
d
y
(e
1
), . . ., d
y
(e
k
)
_
and
  _
R
_
d
x
(e
1
), . . ., d
x
(e
k
)
_
=
_
R
J(h)  
_
d
y
(e
1
), . . ., d
y
(e
k
)
_
  (6.20)
where,  again,  y  = h(x).   But  the  right  hand  side  here  is  just
_
R
_
d
y
(e
1
), . . ., d
y
(e
k
)
_
by  the  change-of-variables  formula  6.10.   2
The  expression  
_
d
x
(e
1
), . . ., d
x
(e
k
)
_
  appearing  in  the  denition  (6.16)  of
_
Z
   is  rather
clumsy.   Here  is  a  more  exible  notation.
Denition  6.25.   Suppose  that  U  and  V   are  open  sets  in  Euclidean  spaces,    
k
(U),  and
h  :   V     U  is  a  smooth  map.   Dene  a  new  k-form,   h
()    
k
(V )  (the  pull   back  of     by
h),  by
h
()(v
1
, . . ., v
k
) = (d
x
h(v
1
), . . ., d
x
h(v
k
)).
Note  that   if     is   a  0-form    i.e.   a  smooth  function  f     then  this   denition  says   that
h
() = f   h.
It  is,   once  again,   slightly  uncomfortable  that  the  x  appears  on  the  right  hand  side  and
not  on  the  left.   See  Remark  6.14  for  an  expression  of  regret.
Example  6.26.   (i)  Let  f  :   R
m
  R,  and  let  t  be  a  coordinate  on  R.   Then  for  v   R
m
,
f
(dt)(v) = dt(d
x
f(v)) = dt(
i
f
x
i
v
i
) =
i
f
x
i
v
i
  =
i
f
x
i
dx
i
  (v).   (6.21)
Thus f
(dt) = df.   The third equality on the previous line is also slightly uncomfortable.   The
expressions on either side of it are dierent because on the left we are thinking of d
x
f(v) as a
vector  in  T
f(x)
R,  while  on the  right  we  are  thinking  of d
x
f(v) as a  number  in  R.   The  1-form
dt  intervenes  to  turn  the  vector  into  a  number.   Identifying  T
f(x)
R  with  R  and  thinking  of
127
d
x
f(v) as  a  number,  we  cause  occasional  notational  confusion.
(ii)  Let  f  :   R
n
  R
2
.   Then  for  vectors  v
1
, v
2
  R
n
,  we  have
f
(dx
1
dx
2
)(v
1
, v
2
) = dx
1
dx
2
 (d
x
f(v
1
), d
x
f(v
2
))
= dx
1
(d
x
f(v
1
)) dx
2
(d
x
f(v
2
)) dx
1
(d
x
f(v
2
)) dx
2
(d
x
f(v
1
))
= d
x
f
1
(v
1
)d
x
f
2
(v
2
) d
x
f
1
(v
2
)d
x
f
2
(v
1
)
= df
1
df
2
  (v
1
, v
2
)
So f
(dx
1
dx
2
) = df
1
df
2
.   In view of (i), this can also be written f
(dx
1
dx
2
) = f
(dx
1
)f
(dx
2
).
2
Slightly  less  immediate,   but  nonetheless   easy  to  prove  directly  from  the  denitions  of
wedge  product  and  pull-back,  is  the  more  general  result
Lemma  6.27.
f
(
1
2
) = f
(
1
) f
(
2
).
2
It  is  an  immediate  consequence  of  the  chain  rule  that
(g  f)
() = f
(g
()).
The  denition  of  pull-back  enables  us  to  write  the  integral  in  (6.16)  as
_
R
()(e
1
, . . ., e
k
)   (6.22)
Now  the k-form 
()(e
1
, . . ., e
k
) = fdx
1
  dx
k
(e
1
, . . ., e
k
) = f.
Denition  6.28.   (i)  Let  R  be  a  region  in  R
k
.   We  dene
_
R
fdx
1
  dx
k
  =
_
R
f(x)dx
1
  dx
k
.
Lemma  6.23  can  now  be  restated  as
Proposition  6.29.   Suppose   that   R
1
  and  R
2
  are   regions   in   R
k
and  h   :   R
1
    R
2
  is   an
orientation-preserving  dieomorphism.   Then  for  any  k-form    on  R
2
,
_
R
1
h
() =
_
R
2
.   (6.23)
And  in  the   light   of   this,   the   denition,   in  (6.16),   of   the   integral   of   a  k-form  over   a
k-dimensional  oriented  submanifold  of   R
m
.   can  be  rewritten  as
128
Denition 6.30.  (i) Let Z  be an oriented submanifold of  R
N
, dieomorphic, by an orientation-
preserving  dieomorphism,  to  a  region  R  in  R
k
.   Let    be  a  k  form  dened  on  some  set  U
containing  Z.   Choose  an  orientation-preserving  dieomorphism    : R    Z.   Then
_
Z
  :=
_
R
().   (6.24)
The  proof   that  this  does  not  depend  on  the  choice  of   R  or    now  runs  as  follows:   if
  : R
1
    Z  and   : R
2
    Z  are  orientation-preserving  dieomorphisms,   then  
1
   =: h
is  an  orientation  preserving  dieomorphism  R
1
    R
2
.   So
_
R
1
() =
_
R
1
(  h)
() =
_
R
1
h
()) =
_
R
2
(),
where  the  last  equality  is  (6.23)  applied  to  the  form  
().
Denition  6.30.   (continued)  (ii)  Let  Z  be  any  compact  submanifold  of   R
N
,  and  let    be  a
k-form  dened  on  some  set  U  containing  Z.   Choose  a  nite  atlas 
i
  : U
i
    V
i
{i=1,...,n}
  for
Z,  and  a  partition  of  unity  
i
  subordinate  to  the  open  cover  U
i
.   (Recall  that  this  means
a  collection  of   smooth  functions  
i
  :   Z    [0, 1]   such  that   supp(
i
)    U
i
,   and  such  that   for
all  x  Z,
i
(x) = 1.)   Dene
_
Z
  =
i
_
U
i
i
.   (6.25)
Exercise 6.31.  Show that the value of the right hand side of (6.25) is independent of the atlas
and partition of unity chosen.   Actually this is quite easy; if 
j
  : U
j
    V
{j=1,...,s}
 is another
atlas and 
j
  is a partition of unity subordinate to the cover U
j
, then U
i
U
{i=1,...,r, j=1,...,s}
is  also  an  open  cover  and  
i
j
  is  a  partition  of  unity  subordinate  to  it.   2
Remark  6.32.   This  is  a  one  of  those  denitions  which  may  seem  utterly  impractical,  since
although  partitions  of  unity  exist,   one  rarely  gets  ones  hands  on  one  explicitly.   There  are
two  answers  to  this  objection.   The  rst  is  that  in  many  of  the  cases  where  we  really  want
to  calculate  some  integral
_
Z
 ,   we  can  use  a  map    :   R    Z  which  is  a  dieomorphism
outside a set of measure zero, and then integrate 
i
1
<<i
k
L
i
1
i
k
dx
i
1
  dx
i
k
where  each  L
i
1
i
k
  is  a  smooth  function  on  U.   Then
d =
i
1
<<i
k
dL
i
1
i
k
dx
i
1
  dx
i
k
.
Notice  that  this  coincides  with  the  already  existing  notion  of  the  derivative  when  k  = 0
and  our  k-form    is  just  a  smooth  function.
Example  6.34.   (i)  d(dx
i
1
  dx
i
k
) = d(1dx
i
1
  dx
i
k
) = d1dx
i
1
  dx
i
k
  = 0.
(ii)  If    = x
1
dx
2
  then  d = dx
1
dx
2
.   So  for  example  d(x
1
dx
2
x
2
dx
1
) = 0.
(iii)  If    = f
1
dx
2
dx
3
 + f
2
dx
3
dx
1
 + f
3
dx
1
dx
2
  then
d  =
_
f
1
x
1
+
  f
2
x
2
+
  f
3
x
3
_
dx
1
dx
2
dx
3
.   (6.26)
(iv)  If    = f
1
dx
1
 + f
2
dx
2
 + f
3
dx
3
  then
d  =
_
f
2
x
1
  f
1
x
2
_
dx
1
dx
2
 +
_
f
1
x
3
  f
3
x
1
_
dx
3
dx
1
 +
_
f
3
x
2
f
2
x
3
_
dx
2
dx
3
.   (6.27)
Classically,  one  meets  the  expressions
div(f
1
, f
2
, f
3
) =
  f
1
x
1
+
  f
2
x
2
+
  f
3
x
3
the  divergence of  (f
1
, f
2
, f
3
))  and
curl(f
1
, f
2
, f
3
) =
_
f
2
x
1
  f
1
x
2
,
  f
1
x
3
  f
3
x
1
,
  f
3
x
2
f
2
x
3
_
130
(the  curl,  or  rotational,  of  (f
1
, f
2
, f
3
).)   The  divergence  gures  in  Gausss  Theorem
_
R
f   n =
_
R
div(f)
where  R  is  a  region  in  R
3
and  n  the  unit  outward  pointing  normal   to  R,   and  the  curl   in
Stokess  Theorem
  _
S
f  =
_
S
curl(f)
where S  is an oriented surface in  R
3
, and the integral on the left is a contour integral, i.e the
integral over  S  of the 1-form f
1
dx
1
 +f
2
dx
2
 +f
3
dx
3
.   Both theorems are special cases  of the
general  version  of  Stokess  theorem  we  prove  below.
(v)  If    = df  for  some  function  f,  then  d = 0.   For
d(df) =
j
d
_
f
x
j
_
dx
j
  =
i,j
2
f
x
i
x
j
dx
i
dx
j
=
i<j
_
  
2
f
x
i
x
j
  
2
f
x
j
x
i
_
dx
i
dx
j
,
and  this  is  equal  to  zero,  by  the  equality  of  mixed  partial  derivatives.
(vi)  If    is  the  1-form  d,   then  d  =  0,   even  though  d  is  not  globally  the  derivative  of  a
function.   It is enough that this should be true locally for the calculation in (v) to go through.
(vii)  Exercise If    
k
(U) for any  k,  then  d(d) = 0.   The  proof of  this  is  a  generalisation
of   the  calculation  in  (v),   involving  equality  of   mixed  partial   derivatives.   It   requires  only
good book-keeping  to  keep  track  of  the  signs.
(viii)  Exercise  If  
1
  
k
(U)  then
d(
1
2
) = (d
1
)
2
 + (1)
k
1
d
2
.
2
The  exterior  derivative  and  the  pull-back  of  forms  are  related  in  a  very  gratifying  way:
Proposition  6.35.   If  U
1
  and  U
2
  are  open  in  R
n
and  R
p
respectively,  and  h : U
1
    U
2
  is  a
smooth  map,  then  for    
k
(U
2
),
d(h
()) = h
(d).
131
Proof   We   have   already  seen,   in  6.26(i),   that   this   is   true   for   0-forms   (functions).   To
prove   it   for   k   >  0,   by  the   R-linearity  of   f
(d) = h
_
d(Ldx
i
1
  dx
i
k
)
_
= h
(dLdx
i
1
  dx
i
k
)
= h
(dL)dh
i
1
  dh
i
k
  = d(h
(L))dh
i
1
  dh
i
k
= d
_
h
(L)dh
i
1
  dh
i
k
_
= d(h
()).
For the penultimate equality we have used 6.34(viii), taking 
1
  = L and 
2
  = dh
i
1
  dh
i
k
(so  that  d
2
  = 0).   2
6.5   Stokess  Theorem
The  nal   theorem  we  will   prove  is   Stokess   Theorem,   which  provides   a  fundamental   link
between
d : forms    forms
 d
  and
    : Manifolds    Manifolds
M  M
There  is  a  suggestive  similarity  between  the  operators  d  and  :
d(d) = 0   and   (M) = .
On  the   other   hand,   d  raises   degree,   whereas      lowers   dimension.   But   they  t   together
beautifully:
Theorem  6.36.   Let  M
m
 R
N
be  a  compact  oriented  manifold  with  boundary  M,  and  let
  
m1
(M).   Then
_
M
d =
_
M
.
Corollary  6.37.   If   M
m
is  a  compact   oriented  manifold  without   boundary  and    =  d  
m
(M)  then
_
M
   = 0.   2
Pre-proof  of  Stokess  Theorem When m = 1, Stokess theorem is just the fundamen-
tal theorem of calculus - provided that we supply the appropriate interpretation for the term
boundary  orientation  in  this  case    compare  (6.5)  and  (6.6).
Lets look in detail at the case M  = [a
1
, b
1
] [a
2
, b
2
]. Admittedly,  this is not a manifold with
boundary  in  the  sense  we  have  dened,  because  of  its  corners.   But  as  an  example  it  is  very
instructive.
132
a b
c
d
1
2
(ii)
(iii)
(i)
(iv)
The orientation chosen in M  induces the boundary orientation shown by the arrows.   Let
  = pdx + qdy.   Then  d = (q/x p/y)dxdy, and
_
M
d  =
_
[a,b][c,d]
(q/x p/y)dxdy
By  denition  of  the  integral  of  an  m-form  over  a  region  in  R
m
,  this  is  just
_
[a,b][c,d]
(q/x p/y)dxdy
We  can  evaluate  this  multiple  integral   as  an  iterated  integral  in  two  ways:   integrating  rst
with  respect  to  x  and  then  with  respect  to  y  or  vice  versa.   It  is  most  convenient  to  use  one
order  for  the  rst  term  and  the  other  for  the  second:
_
M
d  =
_
  d
c
_
  b
a
q
x
(x, y)dxdy 
_
  b
a
_
  d
c
p
y
(x, y)dy dx
By  the  fundamental  theorem  of  calculus,
=
_
  b
a
q
x
(x, y)dx = q(b, y) q(a, y)
and
=
_
  d
c
p
y
(x, y)dy = p(x, d) p(x, c).
So
_
M
d =
_
  d
c
_
q(b, y) q(a, y)
_
dy 
_
  b
a
_
p(x, d) p(x, c)
_
dx   (6.28)
133
In  the   picture   I   have   divided  up  M  into  four   parts,   marked  (i)     (iv).   They  can  be
parametrised  as  follows:
(i)   
1
  : [a, b]    R
2
1
(x) = (x, c)
(ii)   
2
  : [c, d]    R
2
2
(y) = (b, y)
(iii)   
3
  : [a, b]    R
2
3
(x) = (x, d)
(iv)   
4
  : [c, d]    R
2
2
(y) = (a, y)
Of  these,  
1
  and  
2
  preserve  orientation  and  
3
  and  
4
  reverse  it.   Hence
_
M
  =
_
(i)
 +
_
(ii)
 +
_
(iii)
 +
_
(iv)
=
_
  b
a
(
1
(x))dx +
_
  d
c
(
2
(y))dy 
_
  b
a
(
3
(x))dx 
_
  d
c
(
4
(y))dy
Because   the   
i
  have   such  a  simple   form,   their   derivatives   are   all   equal   to  basis   vectors:
1
(x) = 
3
(x) = (1, 0)  and  
2
(y) = 
4
(y) = (0, 1).   Thus
_
M
  =
_
  b
a
p((g
1
(x))dx +
_
  d
c
q(
2
(y))dy 
_
  b
a
p(
3
(x))dx 
_
  d
c
q(
4
(y))dy
=
_
  b
a
p(x, c)dx +
_
  d
c
q(b, y)dy 
_
  b
a
p(x, d)dx 
_
  d
c
q(a, y)dy.
=
_
  b
a
_
p(x, c) p(x, d)
_
dx +
_
  d
c
_
q(b, y) q(a, y)
_
dy.
And  this  is  the  same  as  (6.28).
A  calculation  very  similar   to  this  one  works  in  m  dimensions,   and  is  in  some  ways   more
satisfying  (to me) than the proof which  follows,  in which  the rather simple  idea at the heart
of Stokess Theorem is muddied by the presence of the bump functions of a partition of unity.
I  suggest  it  as  an  exercise.
Proof  of  Stokess  Theorem:   Let  M  be  a compact  oriented  manifold with  boundary.   Let
i
  :   U
i
    V
i
{i=1,...,T}
  be  an  atlas  of   orientation-preserving  charts,   with  each  V
i
  an  open
subset  of  the  half-space  H
m
.   Let  
i
  be  a  partition  of  unity  subordinate  to  the  open  cover
U
i
.   Then
_
M
d  =
_
M
d
_
_
=
i
_
U
i
d(
i
) =
i
_
V
i
1
i
_
d(
i
)
_
=
i
_
V
i
d
_
1
i
  (
i
)
_
134
and  similarly
_
M
  =
i
_
V
i
1
i
  ().
From  a  comparison  of  these  two  expressions,   it  is  clear  that  it  is  enough  to  prove,   for  any
compactly  supported  (m1)-form    and  open  set  V   in  H
m
,  that
_
V
d =
_
V
.
We  have
  =
j
L
j
dx
1
  
dx
j
  dx
m
,
so
d  =
j
L
j
x
j
dx
j
dx
1
  
dx
j
  dx
m
  =
j
(1)
j1
L
j
x
j
dx
1
  dx
m
.
Choose R > 0 such that supp()  [R, R]
m
V .   Because supp()  V  [R, R]
m
, we can
write
  _
V
d =
_
[R,R]
m1
[0,R]
j
(1)
j1
L
j
x
j
dx
1
  dx
m
It is convenient  to integrate the j-th summand rst with respect to x
j
.   For j  < m, this gives
(1)
j1
_
[R,R]
m2
[0,R]
_
_
  R
R
L
j
x
j
dx
j
_
dx
1
  
dx
j
  dx
m
.
But  here  the  inner  integral  is  zero,  since  by  the  fundamental  theorem  of  calculus  it  is  equal
to
L
j
(x
1
, . . ., R, . . ., x
m
) L
j
(x
1
, . . ., R, . . ., x
m
)
and  L
j
  is  identically  zero  outside  (and  therefore  on  the  boundary  of)  [R, R]
m
.   For  j  = m,
we  get
(1)
m1
_
[R,R]
m1
_
_
  R
0
L
m
x
m
dx
m
_
dx
1
. . .dx
m1
and,  once  again  because    vanishes  outside  [R, R]
m
,  this  is  equal  to
= (1)
m
_
[R,R]
m1
L
m
(x
1
, . . ., x
m1
, 0)dx
1
. . .dx
m1
.
Finally,   because  dx
m
(v)  =  0  for  every  vector  v  tangent  to  V ,   if  j  <  m  the  (m  1)  form
dx
1
  
dx
j
  dx
m
 is identically zero on V , and so on V   reduces to L
m
dx
1
  dx
m1
.
The   boundary   orientation   of   R
m1
  0   =  H
m
is   (1)
m1
times   the   orientation   of
R
m1
0  coming  from  its  identication  with  R
m1
.   Therefore
_
V
 =
_
V
d.
2
135
6.6   Concluding  Remarks
For  reasons  of  time,   our  treatment  of  dierential   forms  on  manifolds  only  covers  manifolds
embedded in Euclidean space, and so is seriously incomplete.   Moreover, even here, we spoke
only  of   dierential   forms   dened  on  open  sets   of   the   ambient   Euclidean  space.   This   is
ne  when  we  are  concerned  only  with  integrating  forms  over  submanifolds,   but  forms  play
another,  less  obvious  role,  in  the  study  of  cohomology,  and  for this  one  needs  to  make  sense
of  the  idea  of  a  dierential   form  on  a  manifold  M,   without  reference  to  an  ambient  space.
The  space  of  all  smooth  k-forms  on  a  manifold  M  is  denoted  
k
(M),   and  the  collection  of
spaces  and  linear  maps
0    
0
(M)
  d
  
1
(M)
  d
    
  d
  
n
(M)    0
(where  n =  dimM)  is  known  as  the  de  Rham  complex  of  M.   From  this  complex  important
information about the cohomology of M  can be extracted by purely algebraic procedures.   A
good place to read about this is in the nal chapters of [3], though the treatment there is still
restricted  to  manifolds  embedded  in  Euclidean  space.   More  advanced  treatments,   covering
abstract manifolds as well,  and developing  de Rham cohomology, can be found in the classic
text [1] and in [5].   The book of Conlon, [2], is, despite its name, more encyclopaedic,  though
a  very  useful  reference.
136
7   Appendices
7.1   Appendix  A:  Linear  Maps  and  Matrices
Let  U  be  an  n-dimensional  real  vector  space.   A  choice  of  basis  determines  a  bijection
Linear  maps  U    U  n n  real  matrices
and  an  isomorphism
Invertible  linear  maps  U    U  Gl(n, R),
but  since  in  general  one  basis  is  no  better  than  another,  there  are  many  equally  good  bijec-
tions  and  isomorphisms.   More  generally,  if  U  and  V   are  vector  spaces  of  dimensions  n  and
p  respectively  then  choices  of  basis  E  for  U  and  F  for  V   determine  a  bijection
Linear  maps  U    V     p n  real  matrices.
What  is  the  relation  between  dierent  bijections  obtained  in  this  way?  To  sort  this  all  out,
we introduce some notation which may seem rather ponderous at rst, but works like a Rolls
Royce  once  you  put  it  on  the  road.
Remember that a choice of basis gives us a way of converting vectors in our n-dimensional
space  U  into  n-tuples  of  numbers.   Let  E  = e
1
, . . ., e
n
  be  a  basis  for  U.   If  u  U  is  equal  to
1
e
1
 +   + 
n
e
n
,  then  we  write
[u]
E
  =
_
n
_
_
.
That  is,  [u]
E
  is  the  expression  of  u  with  respect  to  the  basis  E,  as  a  column  vector.
Suppose   that   U  and  V   are   vector   spaces   with  bases   E  =  e
1
, . . ., e
n
  and  F   =  f
1
, . . ., f
m
respectively,  and let T  : U    V  be a linear map.   Let us denote the matrix of T  with respect
to  the  bases  E  in  the  source,  U,  and  F  in  the  target  V   by
[T]
E
F
.
By  denition,
[T]
E
F
  =
_
  [T(e
1
)]
F
       [T(e
n
)]
F
  (7.1)
 its columns are the expressions of the vectors T(e
1
), . . ., T(e
n
) with respect to the basis F.
Multiplication  of  matrices  is  dened  so  that  multiplication  by  [T]
E
F
  converts  the  expression
for  a  vector  u  with  respect  to  the  basis  E,  into  the  expression  for  T(u)  with  respect  to  the
basis  F.   That  is,
[T(u)]
F
  = [T]
E
F
[u]
E
.
137
Once  you  know  the  rule  for  multiplying  a  column  vector  by  a  matrix,   this  last  equality  is
actually  obvious:
[T]
E
F
_
n
_
_
= 
1
 column  1  of  [T]
E
F
  +     + 
n
 column  n  of  [T]
E
F
= 
1
[T(e
1
)]
F
  +       + 
n
[T(e
n
)]
F
= [
1
T(e
1
) +   + 
n
T(e
n
)]
F
= [T(
1
e
1
 +   + 
n
e
n
)]
F
= [T(u)]
F
.
From this everything  else  follows.   For example,  suppose  that  U
  S
  V  and V
  T
  W  are
linear  maps,  and  let  E  be  a  basis  for  U,  F  a  basis  for  V   and  G  a  basis  for  W.   Then
[T]
F
G
  [S]
E
F
  = [T]
F
G
_
  [S(e
1
)]
F
       [S(e
n
)]
F
;
since
[T]
F
G
[v]
F
  = [T(v)]
G
for  any  v   V ,  in  particular
[T]
F
G
[S(e
i
)]
F
  = [T(S(e
i
))]
G
and  thus
[T]
F
G
[S]
E
F
  =
_
  [T(S(e
1
))]
G
       [T(S(e
n
))]
G
= [T   S]
E
G
.
The  equality  we  have  just  deduced,
[T]
F
G
[S]
E
F
  = [T   S]
E
G
,   (7.2)
expresses  the  fundamental   relation  between  multiplication  of   matrices  and  composition  of
linear   transformations;   it   gures   as  Theorem  7.3  in  Derek  Holts   Linear   Algebra  Lecture
Notes.
The  matrix  for  a  change  of  basis  can  also  be  conveniently  represented  in  this  way.   Suppose
that E  and F  are now both bases for the same space U.   Denote the identity  map from U  to
U  by  I.   Then  of  course  [I]
E
E
  and  [I]
F
F
  are  both  identity  matrices:   matrices  with  1s  down
the  diagonal  and  zeros  everywhere  else:   for  example
[I]
E
E
  =
_
  [I(e
1
)]
E
       [I(e
n
)]
E
=
_
  [e
1
]
E
       [e
n
]
E
138
=
_
_
1
0
  
0
  
0
  
0
1
_
_
.
On  the  other  hand,  [I]
E
F
  and  [I]
F
E
  are  not  identity  matrices;  for  example
[I]
E
F
  =
_
  [e
1
]
F
       [e
n
]
F
and  of  course  [e
i
]
F
  is  not  the  i-th  column  of  the  identity  matrix  unless  e
i
  = f
i
.
However,  [I]
E
F
  and  [I]
F
E
  are  mutually  inverse:
[I]
E
F
[I]
F
E
  = [I  I]
F
F
  = [I]
F
F
is  the  identity  matrix.   And  in  particular  if  T  : U    U  we  have
[I]
F
E
  [T]
F
F
  [I]
E
F
  = [I  T]
F
E
  [I]
E
F
= [I  F   I]
E
E
  = [T]
E
E
.
In  other  words,
[T]
E
E
  =
_
  [I]
E
F
_
1
[T]
F
F
  [I]
E
F
,
and  [T]
E
E
  is  the  conjugate  of  [T]
F
F
  by  [I]
E
F
.
7.2   Appendix  B:  Some  Topics  in  Linear  Algebra
Experience  in  previous  years   exams  shows  that  students  often  nd  diculties  with  basic
linear  algebra  questions  of  the  following  type:
Exercise 7.1.  Let W  be a vector space, and let U  and V  be subspaces of W.   In the product
space  W  W,  let  D  be  the  diagonal,  D = (w, w) : w  W.   Show  that
U  + V  = W   if  and  only  if   (U  V ) + D = W  W.
Exercise  7.2.   Let  U, V   and  W  be  vector  spaces,   and  let  S  :  U    W  and  T  :  V     W  be
linear  maps.   Let
X  = (u, v)  U V  : S(u) = T(v).
Let  p : X    U  be  the  restriction  of  the  usual  projection  U  V     U.
(a)  Show  that  p  is  injective  if  and  only  if  T  is  injective.
Now  suppose   also  that   S(U) + T(V )   =  W  (i.e.   the  set   S(u) + T(v)   :   u    U, v     V 
is  all  of  W).
(b)  Show  that  p  is  surjective  if  and  only  if  T  is  surjective.
139
Try  these  questions  now!   Neither  involves  much  more  than  basic  set  theory.
The  notion  of  quotient  space  is  very  useful.   Students  tend  to  meet  it  only  in  group  theory
(quotient  groups)  and  not  in  linear  algebra  (quotient  spaces)  though  the  latter  is  really  a
special  case  of  the  former.
Let V  be a vector space and let W  be a subspace.   Declare vectors v
1
, v
2
  V  to be equivalent
modulo  W  if  their  dierence  v
1
 v
2
  lies  in  W.   This  is  easily  checked  to  be  an  equivalence
relation.   The  quotient space  V/W  is dened  to  be the  set of equivalence  classes.   It is useful
to  denote  the  equivalence  class  of  v  by  [v].
Exercise  7.3.   (i)  Show  that  in  these  circumstances   [v]   =  v  + W(=  v  + w  :   w    W).
Because  the expression  v +W  is more transparent than [v] (for a start, it tells you what the
equivalence  relation  is),  it  is  more  usual.
(ii)   Let   V   =  R
2
and  let   W  be  the  subspace  (x
1
, x
2
)     R
2
:   x
1
  =  x
2
.   Make  a  drawing
showing  W  and  showing  the  cosets  v + W  for  v  = (1, 0), (0, 1) and  (1, 2).
(iii)  Let  U  =  W
=  (x
1
, x
2
)    R
2
:  x
1
  =  x
2
.   Add  U  to  your  drawing  for  (ii),   and  then
nd  a  natural  bijection  U    V/W.
(iv)  Show  that  the  operation  of  addition  dened  in  the  quotient  V/W  (in  general,   not  just
in  the  previous  example)  by
(v
1
 + W) + (v
2
 + W) = (v
1
 + v
2
) + W
(or,  in  the  other  notation,  by  [v
1
] + [v
2
] = [v
1
 + v
2
])  is  well  dened.
(v)  How  is  multiplication  by  scalars  dened  in  V/W?
(vi)  Is  the  bijection  you  found  in  (iii)  an  isomorphism?
(vii)  Suppose  that  e
1
, . . ., e
n
  is  a  basis  for  V ,  and  that  the  rst  k  of  them,  e
1
, . . ., e
k
,  form  a
basis  for  the  subspace  W   V .   Show  that  e
k+1
 + W, . . ., e
n
 + W  is  then  a  basis  for  V/W.
(viii)  Let  V   be  a  vector  space  with  subspaces  U  and  W.   Under  what  circumstances  is  the
map  U    V/W  dened  by
u u + W
(a)  injective
(b)  surjective
(c)  an  isomorphism?
7.3   Appendix  C:  Derivatives
The  other  thing  that  people  struggle  with  is  the  notion  of  the  derivative  of  a  smooth  map.
This is the key denition in the course MA225 Dierentiation, but unfortunately many people
clearly only feel comfortable with the notion of derivative they learned about at A-level.   The
derivative  of   a  smooth  map  f   :   R
n
  R
p
at  a  point  x    R
n
is  not  a  number,   but  a  linear
map  d
x
f  :   R
n
  R
p
,  which  depends  on  x.   It  is  an  easy  consequence  of  the  denition  of  the
derivative  that
d
x
f( x) =   lim
h    0
f(x + h x) f(x)
h
  .   (7.3)
140
Incidentally,   the  statement   that  f   is  dierentiable  at  x  is  little  more  than  the  statement
that  the  limit  in  (7.3)  depends  linearly  on  the  vector    x.   It  is  not  hard  to  nd  examples  of
(non-dierentiable)  maps  f  and  points  x  such  that  the  limit  in  (7.3)  exists  for  all    x,  but  is
not  linear  in   x.   However,   in  this  course,   and  in  the  rest  of  this  appendix,  we  are  concerned
only  with  maps   which  are  everywhere   dierentiable.   For   completeness   I   give  the  formal
denition:
Denition  7.4.   Let  U  be  an  open  set  in  R
n
.   The  map  f  :  U    R
p
is  dierentiable  at  x  if
there  is  a  linear  map  L :  R
n
  R
p
such  that
f(x +  x) = f(x) + L( x) +| x|E(x,  x),
where  the  error  term  E  tends  to  0  as   x  tends  to  zero.   In  other  words,  f  is  dierentiable
at  x  if  there  is  a  linear  map  L :  R
n
  R
p
such  that
lim
 x    0
f(x +  x) 
_
f(x) + L( x)
_
| x|
  = 0.
In  this  case  the  linear  map  L  is  called  the  derivative  of  f  at  x,  and  denoted  by  d
x
f.
Exercise  7.5.   Show  that  if  f  is  dierentiable  at  x  then  (7.3)  holds  for  all  vectors   x.
Theorem  7.6.   (i)  If   all   rst   order  partial   derivatives  of   f  :   R
n
  R  exist  in  some  neigh-
bourhood  of  the  point  x,  and  are  all   continuous  at  x,  then  f  is  dierentiable  at  x.
(ii)  The  map  f  :   R
n
  R
p
is  dierentiable  at  x  if  and  only  if  all   of  its  component  functions
f
i
  :   R
n
  R  are  dierentiable  at  x.   2
Exercise  7.7.   Suppose that f  is dierentiable at x.   Show, using (7.1) and (7.3) above, that
in  the  matrix  of  d
x
f  with  respect  to  the  usual   bases  of   R
n
and  R
p
,   the  entry  in  row  i  and
column j  is the partial derivative of the ith component f
i
  with respect to x
j
,
  f
i
x
j
, evaluated
at  x.
In many  cases  it is  not practicable  to calculate  all these  partial derivatives,  and you  have  to
make use of the notion of the derivative itself, as a linear map.   The simplest way to describe
the derivative  d
x
f  is  to give  a formula  for d
x
f( x), where  x is the point we  are calulating  the
derivative  and   x  is  the  vector  to  which  we  are  applying  it.   It  is  really  important  that  you
have  a  good  understanding  of   this  notion.   It  is  one  of   those  ideas  that  ultimately  makes
mathematicss  simpler   rather  than  more  complicated,   although  it  may  cost  some  eort  to
master it at rst.   It ties together into a single concept all np rst order partials of f, and,
for  example,  makes  remembering  the  chain  rule  absolutely  straightforward:
Theorem  7.8.   The  chain  rule  If   U    R
n
and  V     R
p
are  open  sets,   and  f  :  U    V   and
g  :  V     R
q
are  dierentiable  at  x  and  at  f(x)  respectively,   then  g  f  is  dierentiable  at  x
and
d
x
(g  f) = d
f(x)
g  d
x
f.
141
The  statement  of  the  chain  rule  in  terms  of  partial  derivatives  is  then  an  easy  consequence
of   (7.2)  in  Appendix  A,   whereas  a  statement   of   the  chain  rule  purely  in  terms  of   partial
derivatives  is  quite  complicated  and  easy  to  memorise  wrongly.
The  derivative  plays  a  central  role  in  the  subject  of  dierentiable  manifolds.
Exercise  7.9.   1) Find the derivative at (x, y) of the map f  :  R
3
R
3
  R
3
sending  (x, y) to
the vector product xy.   Find the derivative means:   give an explicit formula for d
(x,y)
f(a, b).
Important:   dont write everything in coordinates.   Expand the expression (x+ha) (y +hb)
using the fact that the vector product is bilinear (i.e.   linear in each of its arguments).   Write
d
(x,y)
f(a, b)  as  a  sum  of  vector  products.
2)   Ditto,   for   the   map  g   :   R
n
  R
n
  R  sending   (x, y)   to   the   scalar   product   x    y   :=
x
1
y
1
 +   + x
n
y
n
.   Important:   write  your  answer  as  a  sum  of  scalar  products.
3)  Find  the  derivative  of   the  map  Mat
22
(R)    R  sending  the  2  2  matrix  M  to  det M.
Here  Mat
22
(R)  is  the  vector  space  of  all   2  2  matrices.   Important:   write  d
A
 det(B)  as  a
sum  of  determinants.
4) Find the derivative of the map det : Mat
33
(R)    R sending the 33 matrix M  to det M.
Essential:   write  d
A
 det(B)  as  a  sum  of  determinants.
5)  On  the  basis  of  (3)  and  (4),  guess  a  formula  for  d
A
 det(B)  for  the  general  case  where  we
consider  det : Mat
nn
(R)    R.   Can  you  prove  your  formula?
6)  What  is  the  kernel   of  d
I
 det,   where  I  is  the  identity  matrix?   If  youve  done  (3)  and  (4)
but  not  (5),  do  this  exercise  just  for  2  2  matrices  and  3  3  matrices,   and  then  guess  the
answer  for  n n  matrices.
7)  Let
p : Mat
np
(R) Mat
pq
(R)    Mat
nq
(R)
be  the  product  map,  p(X, Y ) = XY.  Write  down  a  formula  for
d
(X,Y )
p(A, B).
8)  Inside  Mat
nn
(R)  is  the  open  set  Gl
n
(R)  consisting  of  invertible  matrices.   Consider  the
inversion  map  i : Gl
n
(R)    Gl
n
(R)  sending  A  to  A
1
.
i)  Find  d
I
i  when  n  =  2  (where  I   is  the  identity  matrix).   That  is,   nd  an  expressions  for
d
I
i(A),   where  A  is  a  matrix.   You  must  write  your  anwer  in  terms  of   matrices.   How  does
this  compare  with  the  case  n = 1?
ii) What is the  answer  for n = 3?  For general  n?  Guesswork,  on the basis  of examples,  is  as
good a  route  as  any  for  arriving  at  the  answer,  but  of  course  a  proof  is  eventually  required.
142
iii)  There  is  a  very  simple  proof  which  makes  clever  use  of  the  chain  rule,  by  applying  it  to
the  composition
Gl
n
(R)
  j
  Gl
n
(R) Gl
n
(R)
  iid
  Gl
n
(R) Gl
n
(R)
  p
  Gl
n
(R),   (7.4)
where  j(X)  =  (X, X),   i  id  is  the  map  (X, Y )    (X
1
, Y ),   and  p  is  the  product  map  of
part  (7)  of  this  exercise.   The  composite  of  the  maps  in  (7.4)  maps  everything  to  I,
X  (X, X)
  i1
  (X
1
, X)
  p
  X
1
X  = I
so  its  derivative  at  any  point  (matrix)  X    Gl
n
(R)  is  zero.   Use  the  chain  rule,   and  your
knowledge  of  d
(X,Y )
p,   to  obtain  (and  indeed  prove)  a  simple  formula  for  d
I
i(A)  and,   more
generally,  for  d
X
i(A).   How  does  your  answer  to  the  last  part  compare  with  the  case  n = 1?
What  is  the  crucial  dierence?
8.   Consider  the  map  f  :   Mat
nn
(R)    Mat
nn
R  sending  X    X
3
.   What  is  d
X
f(A)?   Be
careful!   Find  your  answer  by  carefully  computing  the  limit  (7.3).
9.   The  exponential  map  for  n n  matrices  is  dened  by  the  same  formula  as  when  n = 1:
exp(X) = I + X +
  X
2
2!
  +
  X
3
3!
  +  .
Find  d
X
 exp(A).   Find  d
I
 exp(A).
At  a  more  basic  level,   it  is  also  extremely  important  that  you  have  a  clear  idea  of   how  to
write down the matrix of a linear map L : V     W  with respect to chosen bases of the vector
spaces  V   and  W.
8)  Let  A  Mat
nn
(R)  and  B   Mat
pp
(R).   Dene  linear  maps
R
A
  : Mat
np
(R)    Mat
np
(R)
and
L
B
  : Mat
np
(R)    Mat
np
(R)
by  R
A
(X) = XA,  L
B
(X) = BX.
i)  In  the  case  where  n  =  2  and  p  =  1,   choose  a  basis  for  Mat
np
(R)  and  write  down  the
matrices  of  R
A
  and  L
B
  with  respect  to  this  basis.
ii)  Ditto  when  n = p = 2.
iii) What is the relation between  the determinants  of the matrices you have  found in (i) and
(ii)  and  det  A  and  det  B?
iv)  Find  the  answer  to  (iii)  for  general  n, p.
v)  Suppose  that  det  A  =  0.   What  is  the  kernel   of  R
A
?   If  det  B  =  0,   what  is  the  kernel   of
L
B
?
143
7.4   Appendix  D:  Quotient  spaces  in  topology
The circle S
1
is obtained from the interval [0, 1] by joining the end-points.  What does this
mean?  In mathematics, gluing and joining are carried out by means of the quotient topology.
If X  is a topological space and  is an equivalence  relation on X, then the quotient of X  by
,  denoted  X/ ,  is  the  topological  space  dened  as  follows:
  its  points  are  the  equivalence  classes  of  points  in  X.   Thus,  equivalent  points  in  X  are
identied  with  one  another  or  glued  together  in  X/  .
  the map sending x  X to its equivalence class [x]  X/ is denoted by q : X    X/ .
  a  subset  U   X/   is  open  if  and  only  if  q
1
(U)  is  open  in  X.
Example  7.10.   To  glue  the  endpoints  of  [0, 1]  to  one  another,  we  dene  a  suitable  equiva-
lence  relation.   Simply  declare
x  x
if
_
_
_
x = x
,   or
x = 0, x
= 1,   or
x
= 0, x = 1
This  might  seem  articial.   It  is!   We  are  creating  a  topological  space  to  suit  our  purpose.
The topology on X/  (the quotient topology) is dened precisely in order for the following
proposition  to  hold:
Proposition 7.11.  Let  be an equivalence relation on the topological space X  and let X/ 
be  the  quotient  space.   Then
1.   q : X    X/    is  continuous.
2.   Suppose  f  :   X    Y   is  a  continuous  map  with  the  property  that   for  all   x, x
  X,   if
x    x
E
E
E
E
E
E
E
E
E
X/ 
 Y
commutes    that  is,  such  that
  
f   q  = f.
  14
The condition we impose on f  in order that
  
f  be dened is often summarised by saying that
f  respects the  equivalence relation .   The existence of
  
f  is described by saying that f  passes
to  the  quotient  to  dene
  
f.   The  map
  
f  is  the  key  object  in  this  game.
14
We  do  not  prove  this  here,  though  its  very  easy.   At  this  point  its  more  important  to  learn  to  use  the
quotient topology than to prove its properties.   Its also often true that proving that an object or construction
has  certain  properties  becomes  easier  after  one  has  become  familiar  with  using  it.
144
Exercise  7.12.   Show  that if  there  exists  a  map
  
f  : X/     Y  such  that
  
f  q = f, then f
must  respect  the  equivalence  relation.
Example  7.13.  In Example 7.10 we constructed the space [0, 1]/ .   Now we will see that it
is homeomorphic to S
1
.   This is fundamental - but easy, because we have set things up right.
First, dene f  : [0, 1]    Y , taking care that f  respects  the equivalence  relation.   In this case
all that this requirement means is that f(0) = f(1).   A natural choice for f  is the exponential
map  f(x)  =  exp(2ix).   Because  f  respects  the  equivalence  relation,   we  have  a  continuous
map
  
f  :   X/       S
1
.   Now  we  show  that
  
f  is  1-1.   Suppose  that
  
f([x
1
])  =
  
f([x
2
]).   Then
f(x
1
)  =  f(x
2
),   i.e.   e
2ix
1
=  e
2ix
2
.   Therefore  e
2i(x
1
x
2
)
=  1,   and  so  x
1
 x
2
  is  an  integer.
The only way this can happen when x
1
, x
2
  [0, 1] is if x
1
  and x
2
  are the opposite endpoints.
Thus  [x
1
] = [x
2
],  and
  
f  is  1-1.
Exercise  7.14.   Suppose  that  f  :  X    Y   respects  the  equivalence  relation    on  X.   Show
that  if   f   identies  (i.e.   maps  to  the  same  image)  only  points  which  are  equivalent  to  one
another,  then
  
f  is  1-1.
To  conclude  that  [0, 1]/   is  homeomorphic  to  S
1
is  now  easy.   Because  q  is  continuous,
q([0, 1]/  )   is   compact.   As   S
1
  R
2
,   it   is   Hausdor.   As   f   is   surjective,   so  is
  
f.   Thus
f  : [0, 1]/     S
1
is  a  continuous  bijection  from  a  compact  space  to  a  Hausdor  space.   It
follows  that  it  is  a  homeomorphism
15
Exercise 7.15.  On the unit square [0, 1][0, 1], dene an equivalence relation by identifying
boundary  points  as  indicated  by  the  arrows  in  (i)  of  the  following  diagram.
(i) (ii) (iii)
 for each y  [0, 1], the boundary point (0, y) is identied with (1, y), and for each x  [0, 1]
the  boundary  point  (x, 0)  is  identied  with  (x, 1).   Parts  (ii)  and  (iii)  of  the  diagram  below
give an intuitive image of how as a result of these identications, the square becomes a torus:
rst we glue the upper and lower edges, forming a cylinder.   In the process, the left and right
hand edges have become the boundary circles of the cylinder.   Then we glue these two circles,
forming  a  torus.
15
Since  the  domain  is  compact,   any  closed  subset  is  compact  too,   so  its  image  in  the  target  is  compact,
and  therefore closed (a  consequence of the  targets being Hausdor).   So the  map  sends closed sets  to  closed
sets,  and  being  1-1,  must  send  open  sets  to  open  sets.   So  its  inverse  is  continuous.
145
To  do:   Show  that the quotient
_
[0, 1] [0, 1]
_
/  is homeomorphic  to  the product S
1
S
1
.
Hint:   use  the  idea  of  Example  7.13.
Exercise 7.16.  Let S
+
 be the closed upper hemisphere in S
2
.   Dene an equivalence relation
1
  on  S
+
  by  identifying  diametrically  opposite  points  on  the  boundary.   That  is,
x 
1
  x
if
_
  x = x
,   or
x, x
 S
+
  and  x = x
if
_
  x = x
,   or
x = x
q
1
q
2
i
K
K
K
K
K
K
K
K
K
K
  S
2
q
2
S
+
/ 
1
   _ _ _
  S
2
/ 
2
Exercise  7.17.   On the unit disc D, dene an equivalence  relation 
3
  by the same recipe as
1
  in  Exercise  7.16:   diametrically  opposite  points  on  the  boundary  are  identied  with  one
another.   Show  that  D/ 
3
  is  homeomorphic  to  S
+
/ 
1
  and  thus  to  S
2
/ 
2
.
Exercise  7.18.   Let L
2
  be the set of straight lines through the origin in  R
3
.   Show that there
is  a  natural  bijection  L
2
    S
2
/ 
2
.
Exercise  7.19.   (i)  Show  that  the  map  q  : S
2
  S
2
/ 
2
  is  not  only  continuous  but  open.
(ii)  Let  X  be  a  topological  space  and  let  G  be  a  group  acting  continuously  on  X
  16
.   Dene
an  equivalence  relation  on  X  by
x  x
.
Show  that  q : X    X/   is  not  only  continuous  but  open.
Exercise  7.20.   Prove  Proposition  7.11.
Underlying  all   of  this  discussion  is  the  deeper  issue  of  how  we  dene  structures  on  the
objects we wish to study.   One way, followed to a very large extent in these notes (in the rst
four  chapters,  to  be  precise),  is  to  make  use  of  already-existing  structures  by  describing  our
objects  as sub-objects of pre-existing  and well  understood  objects.   The  manifolds  studied  in
the  rst  four  chapters  are  all  embedded  in  some  R
n
,  and  our  notion  of  smoothness  relies  on
the  pre-existing  notion  of   the  smoothness  of   a  map  dened  on  an  open  set  in  R
n
.   By  the
16
That  is,   for  each  g    G  there  is  a  continuous  map  X    X,   (necessarily  a  homeomorphism),   which  we
denote  by  x g  x,  such  that  for  all  x  X,  and  g
1
, g
2
   G,  g
2
 (g
1
 x) = (g
1
g
2
)  x.
146
same token, in group theory is is often easier to study some group as a subgroup of a known
group  (e.g.   as  a  subgroup  of  Gl
n
(R)  or  of  the  group  of  isometries  of  the  plane  or  3-space)
than  as  an  abstract  group,   dened,   for  example,   by  means  of   generators  and  relations.
This  is  why  the  theory  of  (linear)  representations  of  groups is  such  a  powerful  tool in  group
theory.
The alternative approach, often regarded as more abstract, is to view the objects we want
to  describe  as  quotients  of  familiar  objects.   For  example,   Z/m  is  the  quotient  of   Z  by  the
equivalence relation in which n  n
if nn