Szamuely T. Galois Groups and Fundamental Groups (CUP, 2008) (ISBN 0521888506) (276s) - MA
Szamuely T. Galois Groups and Fundamental Groups (CUP, 2008) (ISBN 0521888506) (276s) - MA
FUNDAMENTAL GROUPS
Tamás Szamuely
2
Foreword
Ever since the concepts of the Galois group and the fundamental group
emerged in the course of the nineteenth century, mathematicians have been
aware of the strong analogies between the two notions. In its early formula-
tion Galois theory studied the effect of substitutions on roots of a polynomial
equation; in the language of group theory this is a permutation action. On
the other hand, the fundamental group made a first, if somewhat disguised,
appearance in the study of solutions of differential equations in a complex
domain. Given a local solution of the equation in the neighbourhood of a
base point, one obtains another solution by analytic continuation along a
closed loop: this is the monodromy action.
Leaving the naı̈ve idea of substituting solutions, the next important ob-
servation is that the actions in question come from automorphisms of objects
that do not depend on the equations any more but only on the base. In the
context of Galois theory the automorphisms are those of a separable closure
of the base field from which the coefficients of the equation are taken. For
differential equations the analogous role is played by a universal cover of the
base domain. The local solutions, which may be regarded as multi-valued
functions in the neighbourhood of a base point, pull back to single-valued
functions on the universal cover, and the monodromy action is the effect of
composing with its topological automorphisms.
In fact, the two situations are not only parallel but closely interrelated.
If our complex domain D is just the complex plane minus finitely many
points, a local solution of a linear holomorphic differential equation that
becomes single-valued on a cover with finite fibres lies in a finite algebraic
extension of the field C(t) of meromorphic functions on D. We actually
obtain a one-to-one correspondence between finite extensions of C(t) and
finite covers of D, provided that we allow finitely many exceptional points
called branch points. This opens the way for developing a unified theory
within the category of algebraic curves, first over the complex numbers and
then over a general base field. Algebraic geometers working in the 1950s
realized that the theory generalizes without much effort to higher-dimensional
algebraic varieties satisfying the normality condition.
3
4
Acknowledgments
Tamás Szamuely
6
Contents
3 Riemann Surfaces 73
3.1 Basic Concepts . . . . . . . . . . . . . . . . . . . . . . . . . . 73
3.2 Local Structure of Holomorphic Maps . . . . . . . . . . . . . . 75
3.3 Relation with Field Theory . . . . . . . . . . . . . . . . . . . 79
3.4 The Absolute Galois Group of C(t) . . . . . . . . . . . . . . . 85
3.5 An Alternate Approach: Patching Galois Covers . . . . . . . . 90
3.6 Topology of Riemann Surfaces . . . . . . . . . . . . . . . . . . 93
7
8 CONTENTS
Bibliography 265
Index 272
Chapter 1
9
10 Chapter 1. Galois Theory of Fields
For the proof, see Lang [48], Chapter V, Corollary 2.6 and Theorem 2.8
or van der Waerden [106], §72.
Thus henceforth when speaking of algebraic extensions of k we may (and
often shall) assume that they are embedded in a fixed algebraic closure k̄.
Lemma 1.1.6 Let L|k be a finite extension of degree n. Then L has at most
n distinct k-algebra homomorphisms to k̄, with equality if and only if L|k is
separable.
1.1 Algebraic Field Extensions 11
Corollary 1.1.7 Given a tower L|M |k of finite field extensions, the exten-
sion L|k is separable if and only if L|M and M |k are.
Proof: The proof of (1) ⇒ (2) was given in Lemma 1.2.2 above. The
implication (2) ⇒ (3) follows from the fact that each σ ∈ Gal (ks |k) must
map α ∈ L to a root of its minimal polynomial. Finally, for (3) ⇒ (1) pick
α ∈ L \ k. As ks is Galois over k (Example 1.2.3), we find σ ∈ Gal (ks |k)
with σ(α) 6= α. By (3), this σ preserves L, so its restriction to L yields an
element of Aut(L|k) which does not fix α.
14 Chapter 1. Galois Theory of Fields
Using the proposition it is easy to prove the main results of Galois theory
for finite Galois extensions.
M 7→ H := Aut(L|M ) and H 7→ M := LH
Lemma 1.2.6 A finite extension L|k is Galois if and only if it is the splitting
field of an irreducible separable polynomial f ∈ k[x].
Examples 1.2.9
1. Let m > 2 be an integer and ω a primitive m-th root of unity. The
extension Q(ω)|Q is Galois, being the splitting field of the minimal
polynomial of ω, the m-th cyclotomic polynomial Φm . Indeed, all other
roots of Φm are powers of ω, and hence are contained in Q(ω). The
degree of Φm is φ(m), where φ denotes the Euler function. The Galois
group is isomorphic to (Z/mZ)× , the group of units in the ring Z/mZ.
When m is a prime power, it is known to be cyclic.
4. When k does not contain a primitive m-th root of unity, we may not get
a Galois extension.
√ For instance, take k = Q, m = 3 and a ∈ Q× \Q×3 .
We√define 3 a to be the unique√ real cube root of a.√ The extension
Q( 3 a)|Q is nontrivial because 3 a ∈ /√
Q, but Aut(Q( 3 a)|Q)√ is trivial.
Indeed, an automorphism
√ √in Aut(Q( 3
a)|Q) must send√
3
a to a root
3
of x − a in Q( a), but a is the only one, since Q( a) ⊂√R and
3 3 3
the other two roots are complex. Thus the extension Aut(Q( 3 a)|Q) √
is not Galois. The splitting field of x3 − a is generated over Q by 3 a
and a primitive third root of unity ω that has degree 2 over Q, so it
has degree 6 over Q.
Our final example gives an application of the above ideas outside the
scope of Galois theory in the narrow sense.
1.3 Infinite Galois Extensions 17
σ1 = x1 + x2 · · · + xn ,
σ2 = x1 x2 + x1 x3 + · · · + xn−1 xn ,
..
.
σn = x1 x2 · · · xn .
But by definition K is also the splitting field of f over the field k(σ1 , . . . , σn ).
As k(σ1 , . . . , σn ) ⊂ K Sn and [K : K Sn ] = n!, Remark 1.2.8 shows that
K Sn = k(σ1 , . . . , σn ).
With a little commutative algebra one can say more. The xi , being
roots of f , are in fact integral over the subring k[σ1 , . . . , σn ] ⊂ k(σ1 , . . . , σn )
(see Section 4.1 for basic facts and terminology). Therefore the subring
k[x1 , . . . , xn ]Sn = k[x1 , . . . , xn ] ∩ K Sn of k[x1 , . . . , xn ] is an integral ring ex-
tension of k[σ1 , . . . , σn ]. But as K ⊃ k(σ1 , . . . , σn ) is a finite extension con-
taining n algebraically independent elements, the σi must be algebraically
independent over k. Thus k[σ1 , . . . , σn ] is isomorphic to a polynomial ring;
in particular, it is integrally closed in its fraction field K Sn . It follows that
k[x1 , . . . , xn ]Sn = k[σ1 , . . . , σn ]. This is the main theorem of symmetric poly-
nomials: every symmetric polynomial in n variables over k is a polynomial
in the σi . For more traditional proofs, see [48], Chapter IV, Theorem 6.1, or
[106], §33.
Remark 1.2.12 The above example also shows that each finite group G
occurs as the Galois group of some Galois extension. Indeed, we may embed
G in a symmetric group Sn for suitable n and then consider its action on the
transcendental extension K|k of the above example. The extension K|K G
will then do. However, we shall see in the next section that the analogous
statement is false for most infinite G.
extension it will no longer be true that all subgroups of the Galois group
arise as the subgroup fixing some subextension M |k. The first example of
a subgroup that does not correspond to some subextension was found by
Dedekind, who, according to Wolfgang Krull, already had the feeling that
“die Galoissche Gruppe gewissermaßen eine stetige Mannigfaltigkeit bilde”.
It was Krull who then cleared up the question in his classic paper [47]; we
now describe a modern version of his theory.
Let K|k be a possibly infinite Galois extension. The first step is the
observation that K is a union of finite Galois extensions of k. More precisely:
notation when it is clear from the context. Also, we shall often loosely say
that lim Gα is the inverse limit of the groups Gα , without special reference
←
to the inverse system.
Plainly, this notion is not specific to the category of groups and one can
define the inverse limit of sets, rings, modules, even of topological spaces in
an analogous way.
We now come to the key definition.
Examples 1.3.4
2. Given a group G, the set of its finite quotients can be turned into an
inverse system as follows. Let Λ be the index set formed by the normal
subgroups of finite index partially ordered by the following relation:
Uα ≤ Uβ ⇔ Uα ⊃ Uβ . For each pair Uα ≤ Uβ of normal subgroups we
have a quotient map φαβ : G/Uβ → G/Uα . The inverse limit of this
system is called the profinite completion of G, customarily denoted by
b There is a canonical homomorphism G → G.
G. b
3. Take G = Z in the previous example. Then Λ is just the set Z>0 , since
each subgroup of finite index is generated by some positive integer
m. The partial order is induced by the divisibility relation: m|n iff
mZ ⊃ nZ. The completion Z b is usually called zed hat (or zee hat in
b is also a ring, with multiplication induced by that
the US). In fact, Z
of the Z/mZ’s.
Proof: Only the isomorphism statement Qneeds a proof. For this, define a
group homomorphism φ : Gal (K|k) → Gal (L|k) (where the product is
over all finite Galois subextensions L|k) by sending a k-automorphism σ of
K to the direct product of its restrictions to the various subfields L indexing
the product. That σ(L) ⊂ L for all such L follows from Proposition 1.2.4.
The map φ is injective, since if an automorphism σ does not fix an element
α of ks , then its restriction to a finite Galois subextension containing k(α)
is nontrivial (as we have already remarked, such an extension always exists).
On the other hand, the Main Theorem of Galois theory assures that the
image of φ is contained in lim Gal (L|k). It is actually all of lim Gal (L|k),
← ←
which is seen as follows: take an element (σL ) of lim Gal (L|k) and define a
←
k-automorphism σ of K by putting σ(α) = σL (α) with some finite Galois L
containing k(α). The fact that σ is well defined follows from the fact that
by hypothesis the σL form a compatible system of automorphisms; finally, σ
maps to (σL ) ∈ lim Gal (L|k) by construction.
←
Examples 1.3.7
1.2.9 (2) that this is a Galois extension. It is the union of the chain
of finite subextensions Q(µp ) ⊂ Q(µp2 ) ⊂ Q(µp3 ) ⊂ . . . , where µpr is
the group of pr -th roots of unity. As mentioned in Example 1.2.9 (1),
one has Gal (Q(µpr )|Q) ∼ = (Z/pr Z)× . It follows that Gal (Q(µp∞ )|Q)
is lim(Z/pr Z)× = Z× p , the group of units in the ring Zp . This group is
←
known to be isomorphic to Z/(p−1)Z×Zp for p > 2, and to Z/2Z×Z2
for p = 2 (see e.g. [90], Chapter II, Theorem 2).
Similarly, one obtains that the Galois group of the extension Q(µ)|Q
b × , where Q(µ) is the extension of Q generated by all
is isomorphic to Z
roots of unity.
Lemma 1.3.8 Let (Gα , φαβ ) be an inverse system of groups equipped with
the discrete topology. The inverse limit lim Gα is a closed topological subgroup
Q ←
of the product Gα .
Q
Proof: Take an element g = (gα ) ∈ Gα . If g ∈
/ lim Gα , we have to
←
show that it has an open neighbourhood which does not meet lim Gα . By
←
assumptionQ for some α and β we must have φαβ (gβ ) 6= gα . Now take the
subset of Gα consisting of all elements with α-th component gα and β-th
component gβ . It is a suitable choice, being open (by the discreteness of
the Gα and by the definition of topological product) and containing g but
avoiding lim Gα .
←
Proof: Recall that finite discrete groups are compact, and so is a product of
compact groups, by Tikhonov’s theorem ([66], Theorem 37.3). Compactness
22 Chapter 1. Galois Theory of Fields
of the inverse limit then follows from the lemma, as closed subspaces of
compact spaces are compact. Complete disconnectedness follows from the
construction. For the second statement, note that each open subgroup U
is closed since its complement is a disjoint union of cosets gU which are
themselves open (the map U 7→ gU being a homeomorphism in a topological
group); by compactness of G, these must be finite in number. Conversely, a
closed subgroup of finite index is open, being the complement of the finite
disjoint union of its cosets which are also closed.
Remarks 1.3.10
2. One may ask whether all subgroups of finite index in a profinite group
are actually open. This is false already for the absolute Galois group
of Q (see Exercise 4). However, it has been conjectured for a long
time whether the property holds for those profinite groups that are
topologically finitely generated (i.e. contain a dense finitely generated
subgroup). This conjecture was recently proven by Nikolov and Segal
[71].
We may now state and prove the main theorem of Galois theory for
possibly infinite extensions. Observe first that if L is a subextension of a
Galois extension K|k, then K is also a Galois extension of L and Gal (K|L)
is naturally identified with a subgroup of Gal (K|k).
Gal (M |L) by the natural projection Gal (K|k) → Gal (M |k). Since the
projection is continuous and Gal (M |k) has the discrete topology, UL is open.
We claim that UL = Gal (K|L). Indeed, we have UL ⊂ Gal (K|L), for each
element of UL fixes L; on the other hand, the image of Gal (K|L) by the
projection Gal (K|k) → Gal (M |k) is contained in Gal (M |L), whence the
reverse inclusion. Now if L|k is an arbitrary subextension of K|k, write
it as a union of finite subextensions Lα |k. By what we have just proven,
each Gal (K|Lα ) is an open subgroup of Gal (K|k), hence it is also closed
by Corollary 1.3.9. Their intersection is precisely Gal (K|L) which is thus a
closed subgroup; its fixed field is exactly L, for K is Galois over L.
Conversely, given a closed subgroup H ⊂ G, it fixes some extension L|k
and is thus contained in Gal (K|L). To show equality, let σ be an element of
Gal (K|L), and pick a fundamental open neighbourhood UM of the identity
in Gal (K|L), corresponding to a Galois extension M |L. Now H ⊂ Gal (K|L)
surjects onto Gal (M |L) by the natural projection; indeed, otherwise its im-
age in Gal (M |L) would fix a subfield of M strictly larger than L according
to finite Galois theory, which would contradict our assumption that each el-
ement of M \ L is moved by some element of H. In particular, some element
of H must map to the same element in Gal (M |L) as σ. Hence H contains
an element of the coset σUM and, as UM was chosen arbitrarily, this implies
that σ is in the closure of H in Gal (K|L). But H is closed by assumption,
whence the claim. The assertion about finite extensions follows from the
above in view of Corollary 1.3.9.
Finally, the relation between Galois subextensions and normal subgroups
is proven exactly as in the finite case.
Remark 1.3.12 To see that the Galois theory of infinite extensions is really
different from the finite case we must exhibit non-closed subgroups in the
Galois group. We have already seen such a subgroup in the last two examples
of 1.3.4: the absolute Galois group of a finite field is isomorphic to Ẑ, which
in turn contains Z as a non-trivial dense (hence non-closed) subgroup; there
are in fact many copies of Z embeddded in Ẑ.
The original example of Dedekind was very similar: he worked with the
extension Q(µp∞ )|Q. However, he did not determine the Galois group it-
self (profinite groups were not yet discovered at the time); he just showed
the existence of a non-closed subgroup. His proof was generalised in Krull
[47] to establish the existence of non-closed subgroups in the Galois group
of any infinite extension as follows. First one shows that given a non-trivial
Galois extension K2 |K1 , each automorphism of K1 may be extended to an
automorphism of K2 in at least two ways. From this one infers by taking an
infinite chain of non-trivial Galois subextensions of an infinite Galois exten-
24 Chapter 1. Galois Theory of Fields
sion L|k that Gal (L|k) is uncountable. By the same argument, all infinite
closed subgroups of Gal (L|k) are uncountable, hence countable subgroups in
an infinite Galois group are never closed.
Remark 1.3.13 The absolute Galois group is a rather fine invariant for
fields of finite type. In 1995 Florian Pop proved the following remarkable
theorem: Let K, L be two infinite fields that are finitely generated over their
prime field. Fix separable closures Ks , Ls of K and L, respectively, and
∼
assume there exists a continuous isomorphism Φ : Gal (Ks |K) → Gal (Ls |L)
of profinite groups. Then there exist purely inseparable extensions K ′ |K, L′ |L
∼
with K ′ ∼= L′ . Moreover, there is an isomorphism φ : L′ Ls → K ′ Ks of
separable closures such that Φ(g) = φ−1 ◦ g ◦ φ for all g ∈ Gal (K ′ Ks |K ′ ).
Here recall that an algebraic extension is called purely inseparable if all
of its separable subextensions are trivial. Of course, in characteristic 0 such
extensions are trivial, and one has K = K ′ , L = L′ . In fact, already the
case L = K of Pop’s theorem is interesting: it shows that every continuous
automorphism of the absolute Galois group comes from a field automorphism.
The first nontrivial case of this theorem, that of finite Galois extensions of Q,
was proven by J. Neukirch [68]. Already this special case is quite surprising:
for instance, it shows that if p and q are different primes, then the absolute
√ √
Galois groups of Q( p) and Q( q) cannot be isomorphic. For more on this
fascinating topic, see [77] and [101].
1. The identity functor is the functor idC on any category C which leaves
all objects and morphisms fixed.
Definition 1.4.5 If F and G are two functors with same domain C1 and
target C2 , a morphism of functors Φ between F and G is a collection of
morphisms ΦA : F (A) → G(A) in C2 for each object A ∈ C1 such that for
every morphism φ : A → B in C1 the diagram
ΦA
F (A) −−−→ G(A)
F (φ)y
G(φ)
y
BΦ
F (B) −−−→ G(B)
We can now give one of the notions which will be ubiquitous in what
follows.
1.4 Interlude on Category Theory 27
Definition 1.4.7 Two categories C1 and C2 are equivalent if there exist two
functors F : C1 → C2 and G : C2 → C1 , and two isomorphisms of functors
∼ ∼
Φ : F ◦ G → idC2 and Ψ : G ◦ F → idC1 . In this situation we say that the
functor G is a quasi-inverse for F (and F is a quasi-inverse for G).
If we can actually find F and G with F ◦ G = idC2 and G ◦ F = idC1 ,
we say that C1 and C2 are isomorphic. Finally, we say that C1 and C2 are
anti-equivalent (resp. anti-isomorphic) if C1 is equivalent (resp. isomorphic)
to C2op .
One sees that equivalence of categories has all properties that equivalence
relations on sets have, i.e. it is reflexive, symmetric and transitive. Also,
the seemingly asymmetric definition of anti-equivalence is readily seen to be
symmetric.
In practice, when one has to establish an equivalence of categories it often
turns out that the construction of one functor is easy but that of the quasi-
inverse is rather cumbersome. The following general lemma enables us to
make do with the construction of only one functor in concrete situations.
Before stating it, we introduce some terminology.
Lemma 1.4.9 Two categories C1 and C2 are equivalent if and only if there
exists a functor F : C1 → C2 which is fully faithful and essentially surjective.
Proof: For the proof of the ‘if’ part fix for each object V of C2 an isomor-
∼
phism iV : F (A) → V with some object A of C1 . Such an isomorphism exists
by the second condition. Define a functor G : C2 → C1 by sending each
object V in C2 to the A fixed above, and each morphism φ : V → W to
−1 −1
G(φ) = FAB (iW ◦ φ ◦ iV ) for φ ∈ Hom(V, W ), where B = G(W ) and FAB
is the bijection appearing in the definition of fully faithfulness. The maps
∼ ∼
iV : F (G(V )) → V induce an isomorphism Φ : F ◦G → idC2 by construction.
∼
To construct an isomorphism Ψ : G ◦ F → idC1 , we first need functorial maps
ΨA : G(F (A)) → A for each A in C1 . By fully faithfulness of F it is enough
28 Chapter 1. Galois Theory of Fields
to construct maps F (ΨA ) : F (G(F (A)) → F (A), and we may take as F (ΨA )
the unique preimage of idF (A) by ΦF (A) . A similar construction yields a map
A → G(F (A)) that is an inverse to ΨA . As the construction is functorial in
A, we obtain an isomorphism of functors.
For the ‘only if’ part assume there exist a functor G : C2 → C1 and
∼ ∼
isomorphisms of functors Φ : F ◦ G → idC2 and Ψ : G ◦ F → idC1 . Essen-
tial surjectivity is immediate: given an object C of C2 , it is isomorphic to
F (G(C)) via Φ. For fully faithfulness fix any two objects A, B of C1 and
consider the sequence of maps
induced respectively by FAB , GF (A),F (B) and Ψ. Their composite is the iden-
tity, which implies that FAB is a bijection.
Note that in the above proof the construction of G depended on the
axiom of choice: we had to pick for each V an isomorphism iV . Different
choices define different G’s but the categories are equivalent with any choice.
This suggests that the notion of equivalence of categories means that “up to
isomorphism the categories have the same objects and morphisms” but this
does not mean at all that there are bijections between objects and morphisms.
In order to stress this point we discuss an example that expresses a basic fact
from linear algebra in the language of category theory.
Example 1.4.10 Consider a field k and the category Vecf k of finite dimen-
sional k-vector spaces (with linear maps as morphisms). We show that this
category is equivalent to a category C that we define as follows. The objects
of C are to be the nonnegative integers, and the set of morphisms between
two integers n, m > 0 is to be the set of all n by m matrices with entries in
k; here the identity morphisms are given by the identity matrices and com-
position by multiplication of matrices. There are also canonical morphisms
0 → n and n → 0 for each n ≥ 0. (Notice that this is an example of a
category where morphisms are not set-theoretic maps.)
To show the asserted equivalence, we introduce an auxiliary subcategory
C ′ and show it is equivalent to both categories. This category C ′ is to be
the full subcategory of Vecf k spanned by the standard vector spaces k n . The
equivalence of C ′ and Vecf k is immediate from the criterion of the previous
lemma applied to the inclusion functor of C ′ to Vecf k : the first condition
is tautological as we took C ′ to be a full subcategory and the second holds
because any finite dimensional vector space is isomorphic to some k n .
We now show the equivalence of C ′ and C. Define a functor F : C ′ → C
by sending k n to n and a morphism k n → k m to its matrix with respect to
1.4 Interlude on Category Theory 29
the standard bases. The fact that F is indeed a functor hides a non-trivial
result of linear algebra, namely that the matrix of the composition of two
linear maps φ and ψ is the product of the matrix of φ with that of ψ. One
constructs an inverse functor G by reversing this procedure. It is immediate
to check that in this case F ◦G and G◦F are actually equal to the appropriate
identity functors.
We close this brief overview with a very important notion due to Groth-
endieck. It will not be used until the next chapter.
Definition 1.4.11 Let C be a category. A functor F from C to the category
Sets is representable if there is an object C ∈ C and an isomorphism of
functors F ∼= Hom(C, ).
Recall that the latter functor sends an object A to the set of morphisms
from C to A. There is also an analogous notion for contravariant functors.
The object C is called the representing object.
The following well-known lemma is of pivotal importance. Observe that
if C and D are objects of C, every morphism D → C induces a morphism of
functors Hom(C, ) → Hom(D, ) via composition.
Lemma 1.4.12 (Yoneda Lemma) If F and G are functors C → Sets
represented by objects C and D, respectively, every morphism Φ : F → G of
functors is induced by a unique morphism D → C as above.
Proof: Assume C and D both represent F , and apply the Yoneda Lemma
to the identity map of F .
We shall encounter several interesting examples of representable functors
from Chapter 2 onwards.
30 Chapter 1. Galois Theory of Fields
Lemma 1.5.1 The above left action of Gal (k) on Homk (L, ks ) is continuous
and transitive, hence Homk (L, ks ) as a Gal (k)-set is isomorphic to the left
coset space of some open subgroup in Gal (k). For L Galois over k this coset
space is in fact a quotient by an open normal subgroup.
Proof: This follows from the Qprevious theorem in view of the remark that
given a decomposition A = Li into a product of fields and an element
φ ∈ Homk (A, ks ), the map φ induces the injection of exactly one Li in ks .
Indeed, if φ(Li ) 6= 0, then being a field, Li injects in ks , and on the other
hand, a product Li × Lj cannot inject in ks since ks has no zero-divisors.
Thus Homk (A, ks ) decomposes into the disjoint union of the Homk (Li , ks );
this is in fact its decomposition intoQGal (k)-orbits. For a similar reason,
given another étale k-algebra A′ = L′j , a morphism A → A′ identifies
with a collection of morphisms Li → L′j , one for each i, and these in turn
correspond bijectively to morphisms of the corresponding Gal (k)-sets by the
previous theorem.
1. A is étale.
3. A ⊗k k̄ is reduced.
1.5 Finite Étale Algebras 33
Proof: The ‘only if’ part is obvious. To prove the ‘if’ part, by decomposing
a finite dimensional F -algebra A into a finite direct product of indecompos-
able F -algebras we may assume that A is indecomposable itself. Notice that
under this restriction A can have no idempotent elements other than 0 and
1; indeed, if e 6= 0, 1 were an idempotent then A ∼ = Ae × A(1 − e) would
be a nontrivial direct product decomposition since e(1 − e) = e − e2 = 0
by assumption. The lemma will follow if we show that every nonzero ele-
ment x ∈ A is invertible and thus A is a field. Since A is finite dimensional
over F , the descending chain of ideals (x) ⊃ (x2 ) ⊃ . . . (xn ) ⊃ . . . must
stabilise and thus for some m we must have xn = xn+1 y with an appropriate
y. By iterating this formula we get xn = xn+i y i for all positive integers i,
in particular xn = x2n y n . Thus xn y n = (xn y n )2 , i. e. xn y n is an idempo-
tent. By what has been said above there are two cases. If xn y n = 0, then
xn = (xn )(xn y n ) = 0 which is a contradiction since x 6= 0 by assumption and
A is reduced. Otherwise, xn y n = 1 and thus x is invertible.
Proof of Proposition 1.5.6. The derivation of the third condition from the
second is immediate; actually, the lemma applied to A ⊗k k̄ shows that they
are equivalent. We therefore only have to prove the equivalence of the first
two conditions. To see that (1) implies (2) we may restrict to finite separable
34 Chapter 1. Galois Theory of Fields
[To see this, observe that given a k-algebra homomorphism A → k̄, tensoring
by k̄ and composing by the multiplication map gives a k̄-homomorphism
A ⊗k k̄ → k̄ ⊗k k̄ → k̄; on the other hand the natural inclusion k → k̄
induces a k-homomorphism A ∼ = A ⊗k k → A ⊗k k̄ which composed by
homomorphisms A ⊗k k̄ → k̄ gives a map from the set on the right hand side
to that on the left which is clearly inverse to the previous construction.] The
assumption now implies that the set on the right hand side has dimk̄ (A ⊗k k̄)
elements. But dimk̄ (A ⊗k k̄) = dimk A, whence the claim.
Exercises
1. Show that an inverse limit of nonempty finite discrete sets (for any directed
Q For such an inverse system {Xα : α ∈ Λ}
index set) is nonempty. [Hint:
consider the subsets Xλµ ⊂ Xα consisting of the sequences (xα ) satisfying
φλµ (xµ ) = xλ for a fixed pair λ ≤ µ and use the compactness of the topo-
logical product of the Xα . We shall go through this argument in a more
general case in Lemma 3.4.12.]
(a) Show that Gal (E|Q) is uncountable and has uncountably many sub-
groups of index 2.
(b) Deduce that there are uncountably many subgroups of index 2 in
Gal (Q|Q) that are not open.
Verify that this action satisfies the criterion of the previous exercise.]
[Remark: The statement does not hold if one requires K to be a separable
closure of k. For instance, Artin and Schreier showed in [3] that among the
nontrivial finite groups only Z/2Z can arise as an absolute Galois group.]
7. Let k be a field, and A a finite étale k-algebra equipped with an action
of a finite group G via k-algebra automorphisms; we call such algebras G-
algebras. We moreover say that A is Galois with group G if dimk (A) equals
the order of G and AG = k.
(a) Show that the finite étale k-algebra corresponding to ∆(S) via The-
orem 1.5.4 is isomorphic to k × k if Gal (k) acts on ∆(S) by even
permutations, and is a degree 2 field extension of k otherwise.
(b) If A is the finite étale k-algebra corresponding to S via Theorem 1.5.4,
denote the k-algebra of (a) by ∆(A). Show that when A ∼ = k[x]/(f ) for
a polynomial f without multiple roots, then ∆(A) ∼ = k[x]/(x2 − d(f )),
[Remark: The k-algebra ∆(A) is called the discriminant of the finite étale k-
algebra A. For a description of ∆(A) in the general case, see [45], Proposition
18.24.]
Chapter 2
In the last section we saw that when studying extensions of some field it is plausi-
ble to conceive the base field as a point and a finite separable extension (or, more
generally, a finite étale algebra) as a finite discrete set of points mapping to this
base point. Galois theory then equips the situation with a continuous action of
the absolute Galois group which leaves the base point fixed. It is natural to try
to extend this situation by taking as a base not just a point but a more general
topological space. The role of field extensions would then be played by certain con-
tinuous surjections, called covers, whose fibres are finite (or, even more generally,
arbitrary discrete) spaces. We shall see in this chapter that under some restrictions
on the base space one can develop a topological analogue of the Galois theory of
fields, the part of the absolute Galois group being taken by the fundamental group
of the base space.
In the second half of the chapter we give a reinterpretation of the main theo-
rem of Galois theory for covers in terms of locally constant sheaves. Esoteric as
these objects may seem to the novice, they stem from reformulating in a modern
language very classical considerations from analysis, such as the study of local
solutions of holomorphic differential equations. In fact, the whole concept of the
fundamental group arose from Riemann’s study of the monodromy representa-
tion for hypergeometric differential equations, a topic we shall briefly discuss at
the end of the chapter. Our exposition therefore traces history backwards, but
hopefully reflects the intimate connection between differential equations and the
fundamental group.
2.1 Covers
We start with the basic definitions.
37
38 Chapter 2. Fundamental Groups in Topology
In the literature the terms ‘covering space’ and ‘covering’ are also used for what
we call a cover; we shall stick to the above terminology. Note an easy consequence
of the definition: if p : Y → X is a cover, the map p is always surjective.
Example 2.1.2 Take a nonempty discrete topological space I and form the topo-
logical product X × I. The first projection X × I → X turns X × I into a space
over X which is immediately seen to be a cover. It is called the trivial cover.
Trivial covers may at first seem very special but as the next proposition shows,
every cover is locally a trivial cover.
Proof: The ‘if’ part follows from the previous example and ` the ‘only if’ part
is easily seen as follows: given a decomposition p−1 (V ) ∼
= Ui for some index
i∈I
In the notation of the previous proof, the set I is the fibre of p over the points
of V . The proof shows that the points of X over which the fibre of p equals I
form an open subset of X. Thus making I vary yields a decomposition of X into
a disjoint union of open subsets. In particular:
Notice that this does not mean at all that the cover is trivial. Indeed, let us
give an example of a non-trivial cover with a connected base.
Definition 2.1.6 Let G be a group acting continuously from the left on a topo-
logical space Y . The action of G is even if each point y ∈ Y has some open
neighborhood U such that the open sets gU are pairwise disjoint for all g ∈ G.
This terminology is that of Fulton [26]. Older texts use the much more awkward
term ‘properly discontinuous’. Now recall that if a group G acts from the left on
a topological space Y , one may form the quotient space G\Y whose underlying
set is by definition the set of orbits under the action of G and the topology is the
finest one that makes the projection Y → G\Y continuous.
Proof: The map pG is surjective and moreover each x ∈ G\Y has an open
neighbourhood of the form V = pG (U ) with a U as in Definition 2.1.6. This V is
readily seen to satisfy the condition of Definition 2.1.1.
Example 2.1.8 With this tool at hand, one can give lots of examples of covers.
3. For an integer n > 1 denote by µn the group of n-th roots of unity. Multi-
plying by elements of µn defines an even action on C∗ := C \ {0}, whence a
cover pn : C∗ → C∗ /µn . In fact, the map z 7→ z n defines a natural home-
omorphism of C∗ /µn onto C∗ (even an isomorphism of topological groups)
and via this homeomorphism pn becomes identified with the cover C∗ → C∗
given by z 7→ z n . Note that this map does not extend to a cover C → C;
this phenomenon will be studied further in Chapter 3.
Conversely, we have:
Proof: Notice first that we may naturally view G as a subgroup of Aut(Y |(G\Y )).
Now given an element φ in the latter group, look at its action on an arbitrary point
y ∈ Y . Since the fibres of pG are precisely the orbits of G we may find g ∈ G with
φ(y) = gy. Applying Lemma 2.2.1 to the automorphism φ ◦ g −1 we get g = φ.
Remark 2.2.6 Note the similarity of the above definition with that of a Galois
extension of fields. This analogy is further confirmed by remarking that the cover
pG in Proposition 2.2.4 is Galois.
Proof: Indeed, the underlying set of Aut(Y |X)\Y is by definition the set of
orbits of Y under the action of Aut(Y |X) and so the map p is one-to-one precisely
when each such orbit is equal to a whole fibre of p, i.e. when Aut(Y |X) acts
transitively on each fibre.
Remark 2.3.1 The above convention for composition of paths is the one used
by Deligne in his fundamental works [12] and [13]. It differs from the convention
of many textbooks: most authors define the composition by first going through f
and then through g. However, several reasons speak for our convention. One is
that it is parallel to the usual convention for composition of functions. For another
particularly pregnant one, see Remark 2.6.3 below.
Proof: For the first statement, note first that uniqueness follows from Propo-
sition 2.2.2 applied with X, Y and Z replaced by our current X, [0, 1] and Y .
Existence is immediate in the case of a trivial cover. To reduce the general case
to this, for each x ∈ f ([0, 1]) choose some open neighbourhood Vx satisfying the
condition in the definition of a cover. The sets f −1 (Vx ) form an open covering of
the interval [0, 1] from which we may extract a finite subcovering since the interval
is compact. We may then choose a subdivision 0 = t0 ≤ t1 ≤ · · · ≤ tn = 1 of [0, 1]
such that each closed interval [ti , ti+1 ] is contained in some f −1 (Vx ), hence the
cover is trivial over each f ([ti , ti+1 ]). We can now construct fe inductively: given
a lifting fei of the path f restricted to [t0 , ti ] (the case i = 0 being trivial), we may
construct a lifting of the restriction of f to [ti , ti+1 ] starting from fei (ti ); piecing
this together with fi gives fi+1 .
For statement (2) we first show that given a homotopy h : [0, 1] × [0, 1] → X
with h(0, t) = f (t) and h(1, t) = g(t), there is a lifting e h : [0, 1] × [0, 1] → Y with
e e e e
p◦ h = h, h(0, t) = f (t) and h(1, t) = ge(t). The construction is similar to that for f :
first choose a sufficiently fine subdivision of [0, 1] × [0, 1] into small subsquares Sij
so that over each h(Sij ) the cover is trivial. That this may be done is assured by
a well-known fact from the topology of compact metric spaces called Lebesgue’s
lemma (see e.g. Munkres [66], Lemma 27.5; we have used a trivial case of it
above). Then proceed by piecing together liftings over each subsquare, moving
“serpent-wise” from the point (0, 0) (for which we put e h(0, 0) = y) towards the
point (1, 1). Note that by uniqueness of path lifting it is sufficient to find a local
lifting which coincides with the previous one at the left corner of the side where
two squares meet; they will then coincide over the whole of the common side.
Again by uniqueness of path lifting we get successively that the path t 7→ e h(0, t) is
fe (since both are liftings of f starting from y), the path s 7→ e h(s, 0) is the constant
path [0, 1] → {y} and that t 7→ e h(1, t) is none but ge. Finally, s 7→ e h(s, 1) is a path
e
joining f (1) and ge(1) which lifts the constant path [0, 1] → {f (1)}; by uniqueness
it must coincide with the constant path [0, 1] → {fe(1)}, whence fe(1) = ge(1).
We can now construct the promised left action of π1 (X, x) on the fibre p−1 (x).
Construction 2.3.3 Given y ∈ p−1 (x) and α ∈ π1 (X, x) represented by a path
f : [0, 1] → X with f (0) = f (1) = x, we define αy := fe(1), where fe is the unique
lifting fe to Y with fe(0) = y given by part (1) of the lemma above. By part (2)
of the lemma αy does not depend on the choice of f , and it lies in p−1 (x) by
construction. This is indeed a left action of π1 (X, x) on p−1 (x): (α • β)y = α(βy)
for α, β ∈ π1 (X, x). It is called the monodromy action on the fibre p−1 (x).
46 Chapter 2. Fundamental Groups in Topology
Here local simply connectedness means that each point has a basis of simply
connected open neighbourhoods.
The proof of this classification result relies on two crucial facts. The first one
uses the notion of representable functors introduced in Definition 1.4.11.
Theorem 2.3.5 For a connected and locally simply connected topological space X
ex → X.
and a base point x ∈ X the functor Fibx is representable by a cover X
The cover X ex depends on the choice of the base point x, and comes equipped
with a canonical point in the fibre π −1 (x) called the universal element. Let us spell
this out in detail. By definition, cover maps from π : X ex → X to a fixed cover
p : Y → X correspond bijectively (and in a functorial way) to points of the fibre
ex itself is a cover of X via π, we have a canonical
p−1 (x) ⊂ Y . In particular, since X
e ∼
isomorphism Fibx (Xx ) = HomX (X ex , X
ex ), where HomX denotes the set of maps
of spaces over X. Via this isomorphism the identity map of X ex corresponds to a
canonical element x −1
e in the fibre π (x); this is the universal element. Now for an
arbitrary cover p : Y → X and element y ∈ π −1 (x) the cover map πy : X ex → Y
corresponding to y via the isomorphism Fibx (Y ) ∼ = HomX (X ex , Y ) maps x
e to y by
commutativity of the diagram
∼
=
ex , X
HomX (X ex ) −−−→ Fibx (X
ex )
y y
∼
=
ex , Y ) −−−→ Fibx (Y )
HomX (X
set HomX (Xex , Y ) onto itself for each cover Y . In this way we obtain a right action
on HomX (X ex , Y ) ∼
= Fibx (Y ) from the left action of Aut(X ex |X) on X
ex . We would
like to compare it with the monodromy action, which is a left action. To this end
we introduce the following notion:
Definition 2.3.6 For a group G the opposite group Gop is the group with the
same underlying set as G but with multiplication defined by (x, y) 7→ yx.
Note that (Gop )op = G, and moreover G is canonically isomorphic to Gop via
ex |X) on X
the map g 7→ g −1 . This being said, the above right action of Aut(X ex
ex |X) .
becomes a left action of Aut(X op
We postpone the proof of the above two theorems to the next section, and
prove Theorem 2.3.4 assuming their validity.
Proof of Theorem 2.3.4: The proof is strictly parallel to that of Theorem
1.5.4: we check that the functor satisfies the conditions of Lemma 1.4.9. For full
faithfulness we have to show that given two covers p : Y → X and q : Z → X,
each map φ : Fibx (Y ) → Fibx (Z) of π1 (X, x)-sets comes from a unique map
Y → Z of covers of X. For this we may assume Y , Z are connected and consider
the map πy : X ex → Y corresponding to a fixed y ∈ Fibx (Y ). By Theorem 2.2.10
the map πy realizes Y as the quotient of X ex by the stabilizer Uy = Aut(X
ex |Y ) of
∼ ex be the inverse map. Since Uy injects into the stabilizer of
y; let ψy : Y → Uy \X
φ(y) via φ, the natural map πz : X ex → Z corresponding to φ(y) induces a map
e
Uy \Xx → Z by passing to the quotient; composing it with ψy gives the required
map Y → Z. For essential surjectivity we have to show that each left π1 (X, x)-set
S is isomorphic to the fibre of some cover of X. For S transitive we may take
the quotient of Xex by the action of the stabilizer of some point; in the general
case we decompose S into its π1 (X, x)-orbits and take the disjoint union of the
corresponding covers.
Remark 2.3.8 If we compare the above theorems with Theorem 1.5.4, we see that
the cover X ex plays the role of a separable closure ks ; the choice of x corresponds
to the choice of the separable closure. The fundamental group is the counter-
part of the absolute Galois group. The functor inducing the equivalence is A 7→
Hom(A, ks ) in the case of fields (it is contravariant), and Y 7→ HomX (X ex , Y ) ∼
=
Fibx (Y ) in the topological case.
48 Chapter 2. Fundamental Groups in Topology
We now state a corollary of Theorem 2.3.4 that is even closer to Theorem 1.5.4
in its formulation and will be invoked in subsequent chapters. First a definition:
call a cover Y → X finite if it has finite fibres; for connected X these have the
same cardinality, called the degree of X.
Corollary 2.3.9 For X and x as in the Theorem 2.3.4, the functor Fibx induces
an equivalence of the category of finite covers of X with the category of finite
continuous left π1\(X, x)-sets. Connected covers correspond to finite π1\
(X, x)-sets
with transitive action and Galois covers to coset spaces of open normal subgroups.
Here π1\
(X, x) denotes the profinite completion of π1 (X, x) (Example 1.3.4 (2)).
We need a well-known lemma from group theory.
Proof: Consider the natural left representation ρH of G on the left coset space
of H, and take N := ker(ρH ). It is of finite index in G as [G : H] is finite, and it
is contained in H as it fixes H considered as a coset.
by composing the homotopy class of f with the homotopy class of some path
g : [0, 1] → X with g(0) = y and g([0, 1]) ⊂ U . Notice that since U is assumed
to be simply connected, two such g having the same endpoints have the same
homotopy class. Thus in more picturesque terms, U e is obtained by “continuing
homotopy classes of paths arriving at y to other points of U ”. This indeed gives a
basis of open neighbourhoods of ye, for given two neighbourhoods U eye and Veye, their
eye ∩ Veye contains W
intersection U fye with some simply connected neighbourhood W of
y contained in U ∩ V ; one also sees immediately that π is continuous with respect
to this topology. The inverse image by π of a simply connected neighbourhood of
a point y will be the disjoint union of the open sets Ueye for all inverse images ye of y,
so we have obtained a cover of X. Finally note that there is a “universal element”
e of the fibre π −1 (x) corresponding to the homotopy class of the constant path.
x
ex is connected.
Lemma 2.4.2 The space X
Next we prove:
50 Chapter 2. Fundamental Groups in Topology
ex → X is Galois.
Proposition 2.4.3 The cover π : X
For the proof we need some auxiliary statements.
Lemma 2.4.4 A cover of a simply connected and locally path-connected space is
trivial.
Here ‘locally path-connected’ means that each point has a basis of path-
connected open neighbourhoods.
Proof of Theorem 2.3.7: Everything was proven above except for the last
statement concerning the monodromy action. By Theorem 2.3.5 each point y of
the fibre corresponds to a morphism of covers πy : X ex → Y and the proof shows
that πy maps points of X ex represented by paths f to points of the form fe(1) where
fe : [0, 1] → Y is the lifting of f with fe(0) = y; in particular y, being the image
of the class of the constant path c : [0, 1] → {x}, corresponds to the constant
path [0, 1] → {y}. By the proof of the previous lemma an element of π1 (X, x)
represented by a path fα acts on c by mapping it to the class of fα . Hence the
action of α on p−1 (x) maps y to feα (1), where feα : [0, 1] → Y is the canonical
lifting of fα starting from y. This is the monodromy action.
We now examine the dependence of the fundamental group on the choice of the
base point. Assume given a path-connected and locally simply connected space X
ey → X
and two base points x, y ∈ X. Pick a path f from x to y. There is a map X ex
induced on homotopy classes by the map g 7→ g • f (here g is a path starting from
ey ). It only depends on the homotopy class of f and is
y representing a point of X
an isomorphism of spaces over X, the inverse coming from composition with f −1 .
ey → X
Proof: An isomorphism λ : X ex takes the distinguished element ye ∈ X
ey
ex . It must also lie above y, and hence is
y) ∈ X
of the fibre over y to an element λ(e
the homotopy class of a path from x to y. The reader will check that it induces
the isomorphism λ in the manner described above.
∼
Proposition 2.4.6 it corresponds to an isomorphism λop : π1 (X, y) → π1 (X, x). If
λ is induced by the homotopy class of a path f in the above construction, then λop
corresponds at the level of homotopy classes of paths to the map g 7→ f −1 • g • f .
We thus recover the familiar dependence of the fundamental group on the base
point. Changing the path f to f1 changes λop to its composite with the inner
automorphism of π1 (X, x) induced by the class of f1−1 • f , so λop is uniquely
determined up to an inner automorphism of π1 (X, x).
Most textbooks neglect the role of the base point explained above and call any
cover isomorphic to some X ex a universal cover of X. The next proposition shows
that one can easily detect universal covers in practice.
cover of X, then X e′ ∼ ex ∼
= π1 (X ′ , x′ )\X =Xex with some point x′ ∈ X ′ , by Lemma
2.4.10 and Theorem 2.3.7.
Example 2.4.11 Since Rn is simply connected for any n, we see that the first
two examples in 2.1.8 actually give universal covers of the circle and of linear
tori, respectively. On the other hand, the third example there does not give a
universal cover since C∗ is not simply connected. However, the complex plane C,
being a 2-dimensional R-vector space, is simply connected and the exponential
map C → C∗ , z 7→ exp(z) is readily seen to be a cover. Hence C is the universal
cover of C∗ .
2.4 The Universal Cover 53
Example 2.4.12 Let us spell out in more detail an example similar to that of
C∗ ; it will serve in the next chapter. Let Ḋ be the punctured complex disc
{z ∈ C : z 6= 0, |z| < 1}. As in the previous example, the exponential map z 7→
exp(z) restricted to the left half plane L = {z ∈ C : Re z < 0} furnishes a uni-
versal cover of Ḋ. The automorphism groups of fibres are isomorphic to Z, the
action of n ∈ Z being given via translation by 2nπi. Thus if we let Z act on L
via translation by multiplies of 2πi, the disc Ḋ becomes the quotient of L by this
action. But then by Theorem 2.2.10 each connected cover of Ḋ is isomorphic to a
quotient of L by a subgroup of Z. These subgroups are 0 and the subgroups kZ
for integers k ≥ 1; the corresponding covers of Ḋ are L and Ḋ itself via the map
z 7→ z k . If we choose a base point x ∈ Ḋ, we obtain an isomorphism π1 (Ḋ, x) ∼
= Z.
A path representative of a generator is given by a circle around 0 going through
x, oriented counterclockwise.
The previous considerations showed that one way to eliminate the role of the
base point is to work up to non-canonical isomorphism. A better way is to consider
all possible base points at the same time, as we explain next.
We first need to recall the notion of the fibre product Y ×X Z of two spaces
p : Y → X and q : Z → X over X. By definition it is the subspace of Y × Z
consisting of points (y, z) satisfying p(y) = q(z). It is equipped with natural
projections qY : Y ×X Z → Y and pZ : Y ×X Z → Z making the diagram
Y q
Y ×X Z −−−→ Y
pZ y
p
y
q
Z −−−→ X
commute. In fact, it satisfies a universal property: it represents the set-valued
functor on the category of spaces over X that maps a space S → X to the set of
pairs of morphisms (φ : S → Y, ψ : S → Z) over X satisfying p ◦ φ = q ◦ ψ. In case
p : Y → X is a cover, then so is pZ : Y ×X Z → Z and moreover the fibre p−1 Z (z)
−1
over z ∈ Z is canonically isomorphic to p (q(z)). Indeed, using Proposition 2.1.3
it suffices to check these properties in the case when Y → X is a trivial cover,
which is straightforward. The cover pZ : Y ×X Z → Z is called the pullback of
p : Y → X along q and is also denoted by q ∗ Y → Z. Now we come to:
Remarks 2.4.14
1. Fix a pair of points (x, y) ∈ X × X. Pulling back the cover X e → X × X via
the inclusion map {(x, y)} → X × X we get back the space X ex,y viewed as
a cover of the one point space {(x, y)}. Notice that it carries a natural right
action Xex,y × π1 (X, y) → X ex,y by the fundamental group π1 (X, y) coming
from composition of paths. This action is in fact simply transitive: given
two paths f, g : [0, 1] → X with f (0) = g(0) = x and f (1) = g(1) = y, we
may map f to g by composing with the closed path g • f −1 around y. A
topological space equipped with a continuous simply transitive action of a
topological group G is sometimes called a G-torsor. In our case we thus
obtain discrete π1 (X, y)-torsors Xex,y and we may view X ex as a continuous
family of π1 (X, y)-torsors. The space X e is then a continuous 2-parameter
family of torsors.
2. An interesting cover of X is given by the pullback of X e → X × X via the
diagonal map ∆ : X → X × X sending x ∈ X to (x, x) ∈ X × X. The
fibre of the resulting cover Xe∆ over a point x ∈ X is precisely π1 (X, x); in
particular, it carries a group structure. Thus Xe∆ encodes the fundamental
groups of X with respect to varying base points. For some of its properties
see Exercise 6.
Remarks 2.5.2
V Φ
F(V ) −−−→ G(V )
ρF
G
UV y yρU V
UΦ
F(U ) −−−→ G(U )
commutes.
Example 2.5.3 The basic example to bear in mind is that of continuous real-
valued functions defined locally on open subsets of X; in this case, the maps ρU V
are given by restriction of functions to some open subset.
The category of sheaves of sets (or abelian groups, etc.) on a space X is defined
as the full subcategory of the corresponding presheaf category. This means that a
morphism of sheaves is just a morphism of the underlying presheaves.
Examples 2.5.5
1. If D is a connected open subset of C, we define the sheaf of holomorphic
functions on D to be the sheaf of rings O whose sections over some open
subset U ⊂ D are the complex functions holomorphic on U . This construc-
tion carries over to any complex manifold. One defines the sheaf of analytic
functions on some real analytic manifold, or the sheaf of C ∞ functions on a
C ∞ manifold in a similar way.
2. Let S be a topological space (or abelian group, etc.) and X another topolog-
ical space. Define a sheaf FS on X by taking FS (U ) to be the set (abelian
group, etc.) of continuous functions U → S for all nonempty open U ⊂ X.
As in the case of real-valued functions, this is a sheaf.
In fact, as we shall instantly show, these are very familiar objects. Assume
henceforth that all spaces are locally connected. First a definition that will ulti-
mately explain the use of the terminology ‘section’ for sheaves.
2.5 Locally Constant Sheaves and Their Classification 57
We call the sheaf FY the sheaf of local sections of the space p : Y → X over X.
Proof: The sheaf axioms follow from the fact that the sections over U are con-
tinuous functions U → Y and hence satisfy the patching properties. Now assume
p : Y → X is a cover. Given a point x ∈ X, take a connected open neighbourhood
V of x over which the cover is trivial, i.e. isomorphic to V × F where F is the
fibre over x. The image of a section V → Y is a connected open subset mapped
isomorphically onto V by p, hence it must be one of the connected components of
p−1 (V ). Thus sections over V correspond bijectively to points of the fibre F and
the restriction of FY to V is isomorphic to the constant sheaf defined by F . We
get the constant sheaf if and only if we may take V to be the whole connected
component of X containing x in the above argument.
Thus for instance Example 2.1.5 which shows a simple example of a nontriv-
ial cover also gives, via the preceding proposition, a simple example of a locally
constant but non-constant sheaf.
Given a morphism φ : Y → Z of covers of X, it induces a natural morphism
FY → FZ of locally constant sheaves by mapping a local section s : U → Y of
p : Y → X to φ ◦ s. To see that we indeed obtain a local section of q : Z → X over
U in this way, it is enough to show by the sheaf axioms that there is a covering of
U by open subsets Ui such that (φ ◦ s)|Ui is a local section over Ui for all i. But
this holds if we choose each Ui to be so small that both covers are trivial over Ui .
Thus the rule Y 7→ FY is a functor.
Theorem 2.5.9 The above functor induces an equivalence between the category
of covers of X and that of locally constant sheaves on X.
For the proof of the theorem we construct a functor in the reverse direction;
the construction will in fact work for an arbitrary presheaf on X. We first need a
definition:
58 Chapter 2. Fundamental Groups in Topology
In case F is a sheaf of abelian groups (or rings, etc.), the stalk Fx carries a
natural structure of abelian group (ring, etc.) For example, if F is the sheaf of
continuous real-valued functions on X, then Fx is the ring of ‘germs of continuous
functions’ at x.
Remark 2.5.11 The above definition is a special case of a more general construc-
tion. Namely, a (filtered) direct system of sets (Sα , φαβ ) consists of sets Sα indexed
by a directed partially ordered set Λ together with maps φαβ : Sα → Sβ for all
α ≤ β. The direct limit of the system is defined as the quotient of the disjoint
union of the Sα modulo the equivalence relation in which sα ∈ Sα and sβ ∈ Sβ are
equivalent if φαγ (sα ) = φβγ (sβ ) for some γ ≥ α, β. In our case Λ consists of the
open neighbourhoods of x with the partial order in which U ≤ V means V ⊂ U .
This partially ordered set is directed since for any U , V we have U, V ≤ U ∩V . The
sets F(U ) form a direct system indexed by Λ whose direct limit is Fx . Again, one
can define direct limits of systems of abelian groups (rings, etc.) in an analogous
manner.
Now that we have proven the theorem, we may combine it with Theorem 2.2.10
to obtain:
With this definition we can state the following corollary of Theorem 2.5.15:
Remark 2.6.3 This is a point where the reader may appreciate the advantage
of the convention we have chosen for the multiplication rule in π1 (X, x) in the
previous chapter. With our convention, the monodromy representation is indeed
a homomorphism for the usual multiplication rule of matrices in GL(n, C). Had
we defined the composition of paths ‘the other way round’, we would be forced to
use here an unorthodox matrix multiplication of columns by rows instead of rows
by columns.
The following example shows where to find local systems ‘in nature’. It uses
the straightforward notion of a subsheaf of a sheaf F: it is a sheaf whose sections
over each open set U form a subset (subgroup, subspace etc.) of F(U ).
of local solutions of (2.1) over U are again local solutions, they form a C-vector
space S(U ). Moreover, by a classical theorem due to Cauchy (see [23], Theorem
11.2 or any basic text on differential equations) each point of D has an open
neighbourhood U where the C-vector space S(U ) has a finite basis x1 , . . . , xn .
The second sheaf axiom then shows that the local solutions of the equation (2.1)
form a subsheaf S ⊂ On ; this is a sheaf of complex vector spaces. Since the
restrictions of the above xi to smaller open sets still form a basis for the solutions,
we conclude that the sheaf S is a complex local system of dimension n.
A similar observation can be made when one considers solutions of a system
of n linear differential equations in n variables, given in matrix form by y ′ = Ay,
where A is an n × n matrix of holomorphic functions and y = (y1 , . . . , yn ) is an n-
tuple of variables. In fact, by a classical trick the solutions of (2.1) are the same is
the solutions of the linear system in the n variables y1 = y, y2 = y ′ , . . . , yn = y (n−1)
given by yi′ = yi+1 for 1 ≤ i ≤ n − 1 and yn′ = −a1 yn − · · · − an y1 .
Remark 2.6.5 According to Corollary 2.6.2, the local system S of the above ex-
ample is uniquely determined by an n-dimensional left representation of π1 (X, x),
where x is a point of D (notice that D is locally simply connected). Let us describe
this representation explicitly. Take a closed path f : [0, 1] → D representing an el-
ement γ ∈ π1 (X, x) and take an element s ∈ Sx which is, in classical terminology, a
germ of a (vector-valued) holomorphic function satisfying the equation (2.1). Now
s is naturally a point of the fibre over x of the cover pS : DS → D associated with
S by Theorem 2.5.9. By definition of the monodromy action, the element γ acts
on s as follows: s is mapped to the element fe(1) of the fibre p−1 S (x) = Sx , where
e
f is the unique lifting of f to DS . By looking at the construction of the unique
lifting in the proof of Lemma 2.3.2, one can make this even more explicit as follows.
There exist open subsets U1 , . . . , Uk of D such that the f −1 (U1 ), . . . , f −1 (Uk ) give
an open covering of [0, 1] and S is constant over each Ui . There are moreover
sections si ∈ S(Ui ) such that the restrictions of si and si+1 to Ui ∩ Ui+1 coincide
for all 1 ≤ i ≤ k − 1, and such that s1 (resp. sk ) maps to s (resp. γs) in Sx .
Classically this is expressed by saying that γs is the analytic continuation of the
holomorphic function germ s along the path f representing γ.
Notice that the existence and the uniqueness of γs are guaranteed by the fact
that S is a locally constant sheaf and hence DS → D is a cover. Had we worked
with the bigger sheaf On instead of S, the analytic continuation of an arbitrary
germ may not have been possible.
Example 2.6.6 Let us work out the simplest nontrivial case of the above theory
in detail. Take as D an open disc in the complex plane centered around 0, of
radius 1 < R ≤ ∞, with the point 0 removed. We choose 1 as base point for the
fundamental group of D. We study the local system of solutions of the first order
differental equation
y′ = f y (2.2)
62 Chapter 2. Fundamental Groups in Topology
m = exp (F− (1))(exp (F+ (1)))−1 = exp (F− (1) − F− (−1) + F+ (−1) − F+ (1))
Z
= exp ( f ) = exp (2πi Res 0 f )
γ
by the Residue Theorem (see e.g. Rudin [80], Theorem 10.42), where Res (f )
denotes the residue of f at 0. So we have expressed m in terms of the function f
occurring in the equation (2.2).
Remark 2.6.7 One sees from the above example that for any one-dimensional
monodromy representation on the punctured open disc D we may find a differential
equation of type (2.2) whose local system has the given monodromy; if m is the
image of γ, one may take, for example, y ′ (z) = µz −1 y(z) with µ ∈ C satisfying
exp (2πiµ) = m. This equation has the additional virtue that the coefficient µz −1
has only a simple pole at 0.
We can generalize this to higher dimension. An n-dimensional representation of
the punctured open disc D of radius R is again determined by the image of γ which
is a matrix M ∈ GLn (C). To find a system of n differential equations y ′ = Ay
with the above monodromy representation and with coefficients holomorphic on D
2.7 The Riemann–Hilbert Correspondence 63
except for a simple pole at the origin, we may take A = Bz −1 , where B ∈ GLn (C)
is a matrix with exp (2πiB) = M . Here exp (2πiB) is defined by the absolutely
convergent power series
∞
X (2πiB)i
exp (2πiB) := .
i!
i=0
Given M ∈ GLn (C), one constructs B with exp (2πiB) = M using the Jordan
decomposition of M (see [23], 11.8 for details).
Remark 2.7.1 Locally free sheaves on D are related to holomorphic vector bun-
dles on D. The latter are defined as complex manifolds E equipped with a holomor-
phic surjection p : E → D (see Remark 3.1.2 below for these concepts) such that
p−1 (z) has the structure of a complex vector space for each z ∈ D, and moreover
each z ∈ D has an open neighbourhood V ⊂ D for which there is an isomorphism
p−1 (V ) ∼
= V × Cn of complex manifolds over V inducing vector space isomor-
phisms on the fibres. Given a holomorphic vector bundle p : E → D, its sheaf of
holomorphic sections HE is given over U ⊂ D by the holomorphic maps s : U → E
satisfying p ◦ s = idU . The restriction of HE to an open subset V ⊂ D where
p−1 (V ) ∼
= V × Cn is isomorphic to On |V , therefore HE is a locally free sheaf. This
64 Chapter 2. Fundamental Groups in Topology
Here the tensor product E ⊗O Ω1D is defined in the obvious way, by the rule
U 7→ E(U ) ⊗O(U ) Ω1D (U ). Holomorphic connections form a category: a morphism
(E, ∇) → (E ′ , ∇′ ) is a morphism of O-modules φ : E → E ′ making the diagram
∇
E −−−→ E ⊗O Ω1D
φy
φ⊗id
y
∇′
E ′ −−−→ E ′ ⊗O Ω1D
commute.
By the remarks preceding Definition 2.7.2, we can actually identify each entry ωij
of Ω with a 1-form of the shape fij dz, so ‘division of Ω by dz’ yields a matrix [fij ]
with entries in O(D). Hence setting f = (f1 , . . . , fn ) and A = −[fij ] we see that
∇(f ) = 0 if and only if f satisfies the system of differential equations y ′ = Ay.
Lemma 2.7.4 The sheaf E ∇ is a local system of dimension equal to the rank of E.
We can now prove a proposition that belongs to the family of statements that
usually go by the name of ‘Riemann–Hilbert correspondence’ in the literature.
(or integrable). This means the following. Write Ω1X for the sheaf of holomorphic
1-forms on the complex manifold X, and Ω2X for its second exterior power. For
each connection (E, ∇) one may define a map ∇1 : E ⊗O Ω1X → E ⊗O Ω2X by setting
∇1 (s ⊗ ω) = ∇(s) ∧ ω + s ⊗ dω. Here d : Ω1X → Ω2X is the usual differential given
by d(f dzi ) = df ∧ dzi , where dz1 , . . . , dzn are free generators of Ω1X in the neigh-
bourhood of a point. One then says that the connection is flat if the composite
map ∇1 ◦∇ is trivial. The Riemann–Hilbert correspondence for complex manifolds
asserts that the category of holomorphic flat connections on a complex manifold
X is equivalent to that of complex local systems on X. For a proof, see e.g. [110],
Proposition 9.11.
Combined with Corollary 2.6.2, Proposition 2.7.5 implies that every finite di-
mensional representation of π1 (D, x) for some base point x is the monodromy rep-
resentation of some linear system of holomorphic differential equations. Assume
now that D is a complement of finitely many points in C. Then one may consider
a more subtle question, namely realizing representations of the fundamental group
as monodromy representations of linear systems satisfying additional restrictions
on the local behaviour of the coefficients around the missing points.
Traditionally, this problem is studied over the projective line P1 (C). The
precise definition of P1 (C) as a Riemann surface will be given in Example 3.1.3
(2), but it can be easily introduced in an ad hoc way: it is just the complex plane
with a point ∞ added at infinity. A system of complex open neighbourhoods
of ∞ is given by the complements of closed discs around 0, and a function f is
holomorphic around ∞ if z 7→ f (z −1 ) is holomorphic around 0. Thus we may
extend the notion of sheaves of O-modules to P1 (C). One difference should be
noted, however: whereas over a domain D ⊂ C all locally free sheaves are free,
over P1 (C) this is not so any more.
Consider a finite set S = {x1 , . . . , xm } of points of P1 (C), and set henceforth
D := P1 (C) \ S. We may safely assume xm = ∞. Define an extension Ω1 (S) of
Ω1D to an O-module on P1 (C) as follows: the sections of Ω1 (S) are P1 (C)-valued
functions that restrict to sections of Ω1D over open subsets contained in D, and in
a neighbourhood of a xi can be written in the form f dz with f having at most
a simple pole at xi (for xi = ∞ one of course requires a representation of the
form f d(z −1 )). The sheaf Ω1 (S) is called of the sheaf of 1-forms with logarithmic
poles along S. The name is explained by writing f = g(z − xi )−1 with a function
g holomorphic around xi : we then obtain f dz = g (z − xi )−1 d(z − xi ), and we
recognize the logarithmic derivative of z − xi on the right hand side.
A connection with logarithmic poles along S is a pair (E, ∇), where E is a locally
free sheaf on P1 (C), and ∇ : E → E ⊗O Ω1 (S) is a C-linear map satisfying (2.3).
As in Example 2.7.3, in the case when E is free a connection with logarithmic poles
corresponds to a linear system y ′ = Ay of differential equations where the entries
of the matrix A are holomorphic functions on D that have at worst a simple pole
at the xi . Such linear systems are called Fuchsian.
2.7 The Riemann–Hilbert Correspondence 67
The proof uses a general construction for patching sheaves that will also serve
later.
F(U ) := {(si )i∈I : si ∈ Fi (U ∩ Ui ) and θij (si |U ∩Ui ∩Uj ) = sj |U ∩Ui ∩Uj for all i 6= j}.
The F(U ) together with the obvious restriction maps form a presheaf F, and the
sheaf axioms for the Fi imply that F is in fact a sheaf. Its restrictions over the
Ui yield the Fi by construction. The verification of the isomorphism statement is
left to the readers.
One says that F is obtained by patching (or gluing) the Fi together. Of course
the lemma also holds for sheaves with additional structure (sheaves of groups,
rings, etc.) In particular, patching locally free sheaves on a complex domain yields
a locally free sheaf. Also, patching locally constant sheaves on a topological space
results in a locally constant sheaf.
Proof of Proposition 2.7.7: Take small open discs Di ⊂ P1 (C) around each
xi that do not meet, and write n for the rank of the connection (E, ∇). We
know from Remark 2.6.7 that for each i there exists an n × n system y ′ = Ai y
of linear differential equations having a simple pole at xi and holomorphic else-
where. Write (Ei , ∇i ) for the connection given by Ei = O|nDi and ∇i ((f1 , . . . , fn )) =
(df1 , . . . , dfr )+Ai (f1 , . . . , fn ). We can cover each open set D∩Di = Di \{xi } by two
simply connected open subsets Ui+ and Ui− as in Example 2.6.6. Over each Ui+
and Ui− the locally constant sheaves E ∇ and Ei∇i are both trivial of dimension n by
Lemma 2.4.4. Consequently, we may patch the locally free sheaves E ∼ = E ∇ ⊗C O
and Ei ∼ ∇
= Ei ⊗C O together using the above construction. The restrictions of E
i
and Ei to the Ui+ and Ui− are both equipped with the trivial connection map since
they correspond to trivial local systems, so the connections also patch.
Proof: Combine the proposition with Proposition 2.7.5 and Corollary 2.6.2.
Remarks 2.7.10
1. In his fundamental work [12], Deligne proved a vast generalization of the
above results. He considered a complex algebraic variety U , and introduced
the notion of regular connections on the associated analytic space U an . In
the case when U is a Zariski open subset in a smooth projective variety X
whose complement S is a divisor with normal crossings (which means that
its irreducible components are smooth of codimension 1 and meet trans-
versely), a regular connection is a holomorphic flat connection on U an that
extends to a meromorphic connection on X with logarithmic poles along
S. The extension is then actually an algebraic connection, which in our
very special case corresponds to the fact that the functions occurring in the
construction of our connections are meromorphic over P1 (C) and hence are
rational functions. Deligne showed that all holomorphic flat connections on
U an are regular, and consequently every finite dimensional representation of
the fundamental group of U an can be constructed in a purely algebraic way.
For a nice introduction to these ideas, see [40].
2. Corollary 2.7.9 does not say that every representation ρ : π1 (D, x) → GLn (C)
is the monodromy representation of a Fuchsian system of linear differential
equations, because the sheaf E we constructed above is not necessarily free.
The question whether every such ρ is the monodromy representation of a
Fuchsian system is usually called the Riemann–Hilbert problem in the lit-
erature, though it is due neither to Riemann nor to Hilbert. However, the
21st problem on Hilbert’s famous list, which he did not formulate completely
rigorously, may be interpreted in this way.
The fundamental group of D = P1 (C) \ S is known to have a presentation
of the form hγ1 , . . . , γm | γ1 · · · γn = 1i, where γi is the class of a closed
path going through z that turns around xi . Thus the representation ρ is
determined by the image of the γi in GLn (C), i.e. by a system of m invertible
matrices M1 , . . . , Mm ∈ GLn (C) satisfying M1 · · · Mm = 1.
It was believed for a long time that Pljemelj gave a positive answer to the
question as early as 1908. However, the last step of his argument contains
a gap, and only works under the additional assumption that one of the ma-
trices Mi is diagonalizable; see [37], Theorem 18.6 for a proof of Pljemelj’s
result. At the end of the 1980’s Bolibrukh came up with a series of coun-
terexamples showing that the answer can be negative in general, already for
n = 3. For expositions of the problem and of Bolibrukh’s counterexamples,
see the Bourbaki seminar by Beauville [4] as well as the book of Ilyashenko
and Yakovenko [37].
One way to eliminate the mistake from Pljemelj’s proof is to allow apparent
singularities for the system y ′ = Ay. This means that the entries of A may
2.7 The Riemann–Hilbert Correspondence 69
have poles outside the xi , but the associated monodromy matrix should be
the identity. Pljemelj’s theorem then immediately yields a positive answer:
one simply adds one more xi with the identity as monodromy matrix and
applies the theorem. Thus if Hilbert meant that apparent singularities are
allowed, then Pljemelj already solved the problem.
y ′ = Bz −1 y,
(1 + a + b)z − c ′ ab
y ′′ + y + y=0
z(z − 1) z(z − 1)
studied previously by Euler, Gauss, Kummer and others; the general equation can
be reduced to this form by suitable transformations.
Riemann computed (and also defined!) the monodromy representation associ-
ated with such equations. The results are quite complicated. For instance, under
the additional assumption that none of the differences α − α′ , β − β ′ , γ − γ ′ are
integers, one obtains for the monodromy representation ρ
" #
exp (2πiα) 0
ρ(γ0 ) = ,
0 exp (2πiα′ )
τ exp (2πiβ)−exp (2πiβ ′ ) exp (2πiβ)−exp (2πiβ ′ )
τ −1 τ −1
ρ(γ1 ) = ,
τ exp (2πiβ ′ )−τ exp (2πiβ) τ exp (2πiβ ′ )−exp (2πiβ)
τ −1 τ −1
These formulae are taken from Section 3.1 of Varadarajan’s expository paper [108],
where the degenerate cases are also discussed. See also [5] for an introduction to
hypergeometric equations.
Exercises
1. Show that if a finite group G acts on a Hausdorff space Y such that none of
its elements has a fixed point, the action is even and thus it yields a Galois
cover Y → G\Y for connected Y .
(a) Show that there is a natural injective homomorphism π1 (Y, y) → π1 (X, x).
(b) Viewing π1 (Y, y)op as a subgroup of Aut(X ex |X) via the isomorphism
e ∼ ex ∼
Aut(Xx |X) = π1 (X, x) , establish an isomorphism π1 (Y, y)op \X
op
=Y.
e e → G → 1,
1 → π1 (G, e)op → G
6. Let Xe∆ → X be the cover of the connected and locally simply connected
space X introduced in Remark 2.4.14 (2).
(a) Verify that the π1 (X, x)-space corresponding to Xe∆ via Theorem 2.3.4
is π1 (X, x) acting on itself via inner automorphisms.
(b) Let f : [0, 1] → X be a path with endpoints x = f (0) and y = f (1).
The pullback f ∗ Xe∆ is a trivial cover of [0, 1] by simply connectedness
e∆,x →∼ e
of [0, 1], whence an isomorphism of fibres X X∆,y . Verify that
under the identifications X∆,x = π1 (X, x) and X∆,y ∼
e ∼ e = π1 (X, y) this
isomorphism corresponds to the isomorphism of fundamental groups
∼
π1 (X, x) → π1 (X, y) induced by g 7→ f • g • f −1 .
product Π ×X Π with respect to the map s on the left and t on the right.
The groupoid structure is given by three morphisms of spaces over X × X:
a multiplication map m : Π ×X Π → Π, a unit map e : X → Π (where X is
considered as a space over X × X by means of the diagonal map), and an
inverse map ι : Π → Π which satisfies t ◦ ι = s, s ◦ ι = t. These are subject
to the following conditions:
9. Let S be a set having at least two elements, and let X be a topological space.
Define a presheaf FS of sets on X by setting FS (U ) = S for all nonempty
open sets U ⊂ X and ρU V = idS for all open inclusions V ⊂ U . Give a
necessary and sufficient condition on X for FS to be a sheaf.
Riemann Surfaces
We shall often refer to the equivalence class of atlases occurring in the above
definition as the complex structure on the Riemann surface.
73
74 Chapter 3. Riemann Surfaces
In the second and third examples we obtain compact Riemann surfaces; the
other two are non-compact. In fact, one may define a compact version of the
last example by considering smooth projective plane curves: these are to be the
closed subsets of the complex projective plane P2 (C) arising as the locus of zeros
of some homogeneous polynomial F ∈ C[x, y, z] such that the partial derivatives
∂x F, ∂y F, ∂z F have no common zero. The complex structure is defined in a similar
way as above, or by means of a covering by smooth affine curves.
It follows from the definition of equivalence between atlases that the above
definition does not depend on the atlases chosen. Riemann surfaces together with
holomorphic maps form a category.
We define a holomorphic function on an open subset U ⊂ X to be a holomor-
phic map U → C, where C is equipped with its usual complex structure. For
instance, a complex chart f : U → C is a holomorphic map.
commutes with an appropriate positive integer ey that does not depend on the choice
of the complex charts.
Definition 3.2.2 The integer ey of the proposition is called the ramification index
or branching order of φ at y. The points y with ey > 1 are called branch points.
We denote the set of branch points of φ by Sφ .
Corollary 3.2.4 The fibres of φ and the set Sφ are discrete closed subsets of Y .
Proof: The proposition implies that each point of Y has a punctured open
neighbourhood containing no branch points where φ is finite-to-one.
Now we restrict our attention to proper maps. Recall that a continuous map of
locally compact topological spaces is proper if the preimage of each compact subset
is compact. If the spaces in question are moreover Hausdorff spaces, then a proper
map is closed, i.e. maps closed subsets to closed subsets. This follows from the
easy fact that in a locally compact Hausdorff space a subset is closed if and only if
its intersection with every compact subset is closed. Note that Riemann surfaces
are locally compact Hausdorff spaces (since they are locally homeomorphic to open
subsets of C).
Examples 3.2.5
3. In the next chapter we shall see that if X and Y are smooth complex affine
plane curves and φ : Y → X is a finite algebraic morphism, then φ is proper
as a map of Riemann surfaces.
We can now state the main topological properties of proper holomorphic maps.
Proof: Finiteness of fibres follows from the previous corollary, since discrete
subsets of a compact space are finite. Surjectivity of φ holds because φ(Y ) is open
in X (by Corollary 3.2.3) as well as closed (as φ is proper), and X is connected.
For the last statement note that by Proposition 3.2.1 each of the finitely many
preimages of x ∈ X \φ(Sφ ) has an open neighbourhood mapping homeomorphically
onto some open neighbourhood of x; the intersection of these is a distinguished
open neighbourhood of x as in the definition of a cover.
We call a proper surjective map of locally compact Hausdorff spaces that re-
stricts to a finite cover outside a discrete closed subset a finite branched cover.
Its degree is by definition the degree of the cover obtained by restriction. With
this terminology the proposition says that proper holomorphic maps of Riemann
surfaces give rise to finite branched covers.
We now state a theorem that will show in particular that a proper holomorphic
map is determined by its topological properties. First some notation: given a
connected Riemann surface X and a discrete closed subset S ⊂ X, we denote by
HolX,S the category of Riemann surfaces Y equipped with a proper holomorphic
map Y → X whose branch points all lie above points in S. A morphism in this
category is a holomorphic map compatible with the projections onto X.
The proof will be in several steps. The following lemma essentially handles the
case S = ∅.
78 Chapter 3. Riemann Surfaces
The following proposition shows that the functor of Theorem 3.2.7 is essentially
surjective.
Proposition 3.2.9 Assume given a connected Riemann surface X, a discrete
closed set S of points of X and a finite connected cover φ′ : Y ′ → X ′ , where
X ′ := X\S. There exists a Riemann surface Y containing Y ′ as an open subset and
a proper holomorphic map φ : Y → X such that φ|Y ′ = φ′ and Y ′ = Y \ φ−1 (S).
two Riemann surfaces Y and Z equipped with proper holomorphic maps φY and
φZ onto X with no branch points above S and a morphism of covers ρ′ : Y ′ → Z ′
over X ′ with Y ′ = Y \ φ−1 ′ −1
Y (S) and Z = Z \ φZ (S), there is a unique holomorphic
map ρ : Y → Z over X extending ρ . We know from Lemma 2.2.11 that ρ′ : Y ′ →
′
Proof: As Aut(Y |X) acts transitively on all fibres not containing branch points,
the first statement follows from the continuity of automorphisms. Most of the sec-
ond statement then results from the automorphism property. Finally, the assertion
about stabilizers follows from the fact that an automorphism fixing y must stabi-
lize a sufficiently small open neighbourhood of y over which φ is isomorphic to the
cover z 7→ z e of the open disc.
Proof: For a nonzero f ∈ M(X) the function 1/f will be seen to give an element
of M(X) once we show that the zeros of f form a discrete closed subset. Indeed,
if it is not discrete, then it has a limit point x. Composing f with any complex
chart containing x we get a holomorphic function on some complex domain whose
set of zeros has a limit point. By the Identity Principle of complex analysis ([80],
Theorem 10.18) this implies that the function is identically 0, so f is 0 in some
neighbourhood of x. Now consider the set of those points y of X for which f
vanishes identically in a neighbourhood of y. This set is open by definition, but it
is also closed for it contains all of its boundary points by the previous argument.
Since X is connected, this implies f = 0, a contradiction.
The proof uses nontrivial analytic techniques and cannot be given here. See
e.g. [23], Corollary 14.13.
Remark 3.3.4 The theorem easily follows from the following seemingly weaker
statement: given a compact Riemann surface X and two points x1 , x2 ∈ X, there
exists a meromorphic function f on X holomorphic at the xi with f (x1 ) 6= f (x2 ).
Indeed, the function f1 := f − f (x2 ) satisfies f1 (x1 ) 6= 0 but f1 (x2 ) = 0, and there
is a similar function for x2 . If n points x1 , . . . , xn are given, we obtain by induction
functions fi with fi (xi ) 6= 0 but fi (xj ) = 0 for i 6= j. The theorem then follows by
taking a suitable linear combination.
has d distinct complex roots (namely the f (yi )), whence we must have n = d. If
by chance one of the ai happens to have a pole in x, observe that by continuity all
points x′ in a sufficiently small open neighbourhood of x still have the property
that they are not images of branch points, and moreover f is holomorphic and
takes distinct values at all of their preimages. We may choose x′ as above so that
all the ai are holomorphic at x′ and apply the argument with x′ instead of x.
Next observe that with f as above we have M(Y ) ∼ = M(X)(f ). Indeed, if g is
another element of M(Y ), we have M(X)(f, g) = M(X)(h) with some function
h ∈ M(Y ) according to the theorem of the primitive element. In particular we
have M(X)(f ) ⊂ M(X)(h), but since h should also have degree at most d over
M(X) by the lemma, this inclusion must be an equality, i.e. g ∈ M(X)(f ).
According to the proposition and the remarks preceding it, the rule Y 7→ M(Y )
gives a contravariant functor from the category of compact Riemann surfaces map-
ping holomorphically onto a fixed connected compact Riemann surface X to the
category of finite étale algebras over M(X).
Remark 3.3.9 One may replace the use of the implicit function theorem by using
ideas from residue calculus in complex analysis as in [23], Corollary 8.8.
Remark 3.3.13 The corollary is often summarized in the concise statement that
‘compact Riemann surfaces are algebraic’. In fact, it can be shown that compact
Riemann surfaces are always holomorphically isomorphic to smooth projective
complex algebraic curves, and that holomorphic maps between them all come
from algebraic morphisms. Here, however, one has to allow smooth curves in
projective 3-space as well, not just the plane curves mentioned at the beginning
of this chapter. For a vast generalization in higher dimension, see Theorem 5.7.4
and the subsequent discussion.
where each generator γi can be represented by a closed path around the point xi
passing through x. For n > 1 this group is isomorphic to the free group Fn−1
on n − 1 generators (send γi to a free generator fi of Fn−1 for i < n and γn
to (f1 · · · fn−1 )−1 ). In particular, every finite group arises as a finite quotient of
π1 (P1 (C) \ {x1 , . . . , xn }, x) for sufficiently large n. Since the field M(P1 (C)) is
isomorphic to C(t), Theorem 3.4.1 implies the corollary:
Corollary 3.4.4 Every finite group occurs as the Galois group of some finite Ga-
lois extension L|C(t).
We have proven more than what is stated in the corollary, namely that every
finite group G that can be generated by n − 1 elements arises as the automorphism
group of a Galois branched cover p : Y → P1 (C) ramified above at most n points
x1 , . . . , xn . We note for later use the following complement.
a finite disjoint union of open sets over which p is isomorphic to the branched cover
z 7→ z ej of the unit disc. Modifying y if necessary we may assume x ∈ Ui \ {xi }.
Denote by Vi the component of p−1 (Ui ) containing y, and by yi the unique preim-
age of xi in Vi . By continuity the stabilizer of yi in G must map Vi onto itself,
so it is a subgroup of the cyclic group Aut(Vi |Ui ). The latter group is generated
by the image of γi in G by Example 2.4.12. The action of γi stabilizes yi , so the
stabilizer of yi is the whole of Aut(Vi |Ui ).
88 Chapter 3. Riemann Surfaces
Corollary 3.4.4 describes the finite quotients of Gal (C(t)|C(t)), but does not
determine the profinite group itself. As the corollary indicates, it should be free
in an approprate sense. Here is a formal definition.
Definition 3.4.6 Let X be a set, and let F (X) be the free group with basis X.
The free profinite group Fb(X) with basis X is defined as the inverse limit formed by
the natural system of quotients F (X)/U , where U ⊂ F (X) is a normal subgroup
of finite index containing all but finitely many elements of X.
Remarks 3.4.7
Gal (C(t)|C(t)) ∼
= Fb(C)
of the absolute Galois group of C(t) with the free profinite group on the set C of
complex numbers.
π1 (XT , x0 ) → π1 (XS , x0 ) for some base point x0 after taking profinite completion,
so in particular λST (γx ) = 1 for x ∈ T \ S. The groups Gal (KXS |C(t)) together
with the maps λST form an inverse system indexed by the system of finite subsets
of C partially ordered by inclusion. The inverse limit is Gal (C(t)|C(t)), because
by Theorem 3.3.7 and Proposition 3.2.6 every finite subextension of C(t)|C(t) is
contained in KXS for sufficiently large S, so the intersection of the open normal
subgroups Gal (C(t)|KXS ) must be trivial. The theorem now follows from the
purely group-theoretic proposition below.
Proposition 3.4.9 Let X be a set, and S the system of finite subsets S ⊂ X
partially ordered by inclusion. Let (GS , λST ) be an inverse system of profinite
groups indexed by S satisfying the following conditions:
1. The λST are surjective for all S ⊂ T .
2. Each GS has a system {gx : x ∈ S} of elements so that the map Fb(S) → GS
induced by the inclusion S → GS is an isomorphism, and moreover for every
S ⊂ T we have λST (gx ) = 1 for x ∈
/ S.
Then lim GS is isomorphic to Fb(X).
←
Proof: We check the property of Remark 3.4.7 (2). First observe that there is
a natural injection ı̂ : X → lim Fb(S) sending x ∈ X to (xS )S∈S , where xS = x
←
for x ∈ S and xS = 1 otherwise. It generates a dense subgroup in lim Fb(S), so
←
given a map λ : X → G with G finite, an extension lim Fb(S) → G must be unique
←
if exists. But since G is finite, we must have λ(x) = 1 for all but finitely many
x ∈ X, so λ factors through the image of ı̂(X) in some quotient Fb(S), which is
none but S. The existence then follows from the freeness of Fb(S).
Lemma 3.5.1 Let X be a locally connected topological space, and let {Ui : i ∈ I}
be an open covering of X. Assume given covers pi : Yi → Ui for i ∈ I, together
with isomorphisms
∼ −1
θij : p−1
i (Ui ∩ Uj ) → pj (Ui ∩ Uj )
3.5 An Alternate Approach: Patching Galois Covers 91
for each pair i 6= j satisfying θjk ◦ θij = θik over Ui ∩ Uj ∩ Uk . Then there exists a
cover p : Y → X with p−1 (Ui ) isomorphic to Yi as a space over Ui for each i ∈ I.
It is unique up to unique isomorphism.
If moreover X and Y are connected and the Yi are Galois covers of X with
group G, then p : Y → X is also a Galois cover of X with group G.
Proof: In Construction 2.7.8 we have already seen how to patch locally constant
sheaves together. Therefore the dictionary between covers and locally constant
sheaves (Theorem 2.5.9) yields the first part of the lemma. For the second part
notice first that in the above construction automorphisms of the covers pi : Yi → X
also patch together to automorphisms of X; in particular we obtain an injective
map G → Aut(Y |X). Since each Aut(Yi |X) is transitive on the fibres of pi , so is
Aut(Y |X). By the connectedness assumption on Y we thus obtain that it is Galois
over X. Finally, restriction to the Yi shows that Y /G ∼
= X, whence it follows that
G∼ = Aut(Y |X).
We also need another topological construction, that of induced covers. This
concept is analogous to that of induced representations in algebra.
Notice that we do not claim that all branch points lie above the xi . In fact, the
construction will yield n other points y1 , . . . , yn above which the cover is branched.
Still, by the same arguments as in the previous section, the proposition implies
Corollary 3.4.4, i.e. that every finite group arises as a Galois group over C(t).
yi z ki + xi
z 7→ fi (z) := .
z ki + 1
It induces a holomorphic map φi : P1 (C) → P1 (C) mapping 0 to xi and ∞ to
yi . Since fi is the composite of the map z 7→ z ki with a complex automorphism of
P1 (C), the branch points of φi are 0 and ∞, lying above xi and yi , respectively.
The map φi : φ−1 i (Di ) → Di is by construction a Galois branched cover with
group Gi ; it restricts to a Galois cover over D̈i := Di \ {xi , yi }. Let Yi → D̈i be the
G-cover obtained by inducing this cover from Gi to G as in Construction 3.5.2.
For each i denote by Bi the closed disc of radius ri /2 around xi ; it contains yi
in its interior. Since P1 (C) \ Bi is simply connected, the restriction of the Galois
cover φi : P1 (C) \ {0, ∞} → P1 (C) \ {xi , yi } to P1 (C) \ Bi is trivial. Therefore the
restriction of the G-cover Yi → D̈i to the annulus Di \ Bi is trivial as well, being
induced from a trivial cover. Now denote by U the open subset of P1 (C) obtained
by removing all the Bi , and consider the trivial G-cover G × U → U . By what we
have just said, the restrictions of Yi → D̈i and G × U → U to D̈i ∩ U = Di \ Bi are
both trivial G-covers, and are therefore isomorphic. We can now apply Lemma
3.5.1 to the covering of X := P1 (C) \ {x1 , . . . , xn , y1 , . . . yn } formed by the D̈i and
U and patch the covers Yi → D̈i and G × U → U together in a G-cover Y ′ → X
(notice that the triple intersections are empty). It extends to a finite branched
cover Y → X by Theorem 3.2.7.
As the statement about the gi results from the construction, it remains to show
that Y → X is a Galois branched cover, which boils down to the connectedness
of Y ′ . Write Yi = (G/Gi ) × φ−1 i (D̈i ) using the definition of induced covers. If ē
denotes the class of the unit element e ∈ G in G/Gi , we see from the construction
that the action of gi ∈ Gi maps the component {ē} × φ−1 i (D̈i ) of Yi onto itself.
−1
Since the component {ē} × φi (D̈i ) meets the component {e} × U of G × U in Y ′ ,
we conclude applying gi that it also meets {gi } × U . It follows that {e} × U and
{gi } × U lie in the same connected component of Y ′ . Iterating the argument we
find that {e} × U and {g} × U lie in the same component of Y ′ for an arbitrary
n
product g = g1s1 · · · gns of the gi . But the gi generate G by assumption, hence all
components of G × U lie in the same component of Y ′ . This shows that Y ′ has
only one connected component.
3.6 Topology of Riemann Surfaces 93
Remark 3.5.4 The advantage of the above method is that it works in a more
general setting, that of complete valued fields. For instance, the same argument
can be used to prove the following interesting theorem originally due to D. Harbater
[31]: Every finite group arises as the Galois group of a finite Galois extension of
L|Qp (t), with Qp algebraically closed in L. Here Qp is the field of p-adic numbers,
i.e. the fraction field of the ring Zp encountered in Example 1.3.4 (4).
2. What the theorem really uses about the topology of compact Riemann sur-
faces is that they are orientable topological manifolds of dimension 2. The
topological manifold structure is obtained by considering the complex charts
as homeomorphisms of some neighbourhood of each point with an open sub-
set of R2 . Orientability can be expressed in this case by remarking that if
fi : Ui → C (i = 1, 2) are some complex charts on our Riemann surface,
then the map f1−1 ◦ f2 regarded as a real differentiable map from R2 to R2
has a positive Jacobian determinant at each point; this is a consequence of
the Cauchy-Riemann equations.
The second representation of tori with g holes in Remark 3.6.2 (1) makes it
possible to compute the fundamental group of a compact Riemann surface of genus
g. Here it is convenient to take as a base point the point x where all the closed
paths coming from the ai , bi , a−1 −1
i , bi meet. The homotopy classes of these paths
then generate the fundamental group. More precisely, one proves:
For the proof see e.g. [26], Proposition 17.6. It uses the definition of the
fundamental group in terms of closed paths and the van Kampen theorem.
Remark 3.6.4 By the same method that proves the theorem one can also de-
termine the fundamental group of the complement of n + 1 points x0 , . . . , xn in a
compact Riemann surface X of genus g. Here one has to add one generator γi for
each xi represented by a closed path going through x and turning around xi . We
get a presentation of π1 (X \ {x1 , . . . , xn }, x) by
a fixed point (Exercise 2). For C, one can prove that the only even action on it
with compact quotient is the one by Z2 as in the second example of Chapter 2,
Example 2.1.8, so the quotient is a torus C/Λ; this is in accordance with the case
g = 1 of the theorem. All other compact Riemann surfaces are thus quotients of
the open unit disc D by some even group action. Poincaré studied such actions
and showed that they come exactly from transformations mapping ai to a−1 i and
bi to b−1
i in a 4g-gon with sides labelled as above; the only difference is that in this
case the sides of the polygon are not usual segments but circular arcs inscribed
into the unit disc, for he worked in the model of the hyperbolic plane named after
him.
2. A triangulation of the complex torus C/Λ may be easily obtained from its
description as a square with opposite edges identified. Divide first the square
into nine subsquares by dividing each edge in three, and then divide each
subsquare in two triangles by the diagonal from the upper left to the lower
right corner. After identification of the edges of the original square these
induce a triangulation of the torus.
We now prove:
where the sum is over the branch points of φ and ey is the ramification index at
the branch point y ∈ Y .
Exercises 97
Now it is a known topological fact that the Euler characteristic of a torus with
g holes is 2 − 2g (see [26], p. 244; the cases g = 0, 1 may be read off from the above
example). Hence the proposition implies:
Proof: Otherwise the right hand side of the Riemann-Hurwitz formula would
be non-negative and the left hand side −2.
Remark 3.6.14 Combining the last corollary with Theorem 3.3.7 one obtains the
following purely algebraic fact: Every subfield C ⊂ K ⊂ C(t) with [C(t) : K] < ∞
is isomorphic to C(t). This is in fact true for an arbitrary field k in place of C and
is known as Lüroth’s Theorem (see [106], §73). Using the techniques of subsequent
chapters it is possible to extend the above proof to the general case.
Exercises
where n is the cardinality of the fibres not containing branch points, and ey
is the ramification index at y ∈ Y .
98 Chapter 3. Riemann Surfaces
(a) Show that there exists a unique rational function f ∈ C(t) such that
φ = φf as in the proof of Proposition 3.3.11.
(b) Relate the branch points of φf to zeros and poles of f , and determine
the ramification indices.
(c) Show that φf is a holomorphic automorphism of P1 (C) if and only if
f = (at + b)(ct + d)−1 with some a, b, c, d ∈ C satisfying ad − bc 6= 0.
[Hint: Observe that if φf is an automorphism, f can only have a single
zero and pole, and these must be of order 1.]
(d) Deduce that a holomorphic automorphism of P1 (C) has one or two
fixed points.
Dn = hσ, τ | σ n = τ 2 = 1, στ = τ σ n−1 i.
Show that every complex torus X = C/Λ has a Galois branched cover
Y → X with group Dn with exactly n branch points, all lying above the
same point of X.
In the previous chapter the Riemann Existence Theorem created a link between
the category of compact connected Riemann surfaces and that of finite extensions
of C(t). This hints at a possibility of developing a theory of the fundamental
group in a purely algebraic way. We shall now present such a theory for curves
over an arbitrary perfect base field, using a modest amount of algebraic geometry.
Over the complex numbers the results will be equivalent to those of the previous
chapter, but a new and extremely important feature over an arbitrary base field
k will be the existence of a canonical quotient of the algebraic fundamental group
isomorphic to the absolute Galois group of k. In fact, over a subfield of C we shall
obtain an extension of the absolute Galois group of the base field by the profinite
completion of the topological fundamental group of the corresponding Riemann
surface over C. This interplay between algebra and topology is a source for many
powerful results in recent research. Among these we shall discuss applications to
the inverse Galois problem, Belyi’s theorem on covers of the projective line minus
three points and some advanced results on ‘anabelian geometry’ of curves.
Reading this chapter requires no previous acquaintance with algebraic geome-
try. We shall, however, use some standard results from commutative algebra that
we summarize in the first section. The next three sections contain foundational
material, and the discussion of the fundamental group itself begins in Chapter 5.4.
101
102 Chapter 4. Fundamental Groups of Algebraic Curves
in B. In the special case when A is an integral domain and B is the fraction field of
A one says that A is integrally closed. The basic properties of integral extensions
may be summarized as follows.
All these facts are proven in [48], Chapter VII, §1, or [2], Chapter 5, 5.1–5.10.
Statement (1) is [48], Chapter VII, Proposition 2.1, and (2) results from (1).
Statement (3) is proven in [48], Chapter VII, Proposition 2.5. Finally, statement
(4) results from [69], Theorem 9.12 (and the discussion preceding it).
We now state an important result concerning the finiteness of integral closure.
Facts 4.1.4 Let A be an integral domain with fraction field K, and let L|K be a
finite extension. Assume that either
For the proof of part a), see e.g. [2], Corollary 5.17; for b), see [18], Corollary
13.13.
An integral domain A is called a Dedekind ring if A is Noetherian (i.e. all of
its ideals are finitely generated), integrally closed, and all nonzero prime ideals
in A are maximal. Basic examples of Dedekind rings are polynomial rings in one
variable over a field, the ring Z of integers and, more generally, the integral closure
of Z in a finite extension of Q.
Recall that the localization AP means the fraction ring of A with respect to
the multiplicatively closed subset A \ P , which in our case is the subring of the
fraction field of A constisting of fractions with denominator in A \ P . For a proof,
see e.g. [2], Theorem 9.3 and Corollary 9.4.
Note that in view of Facts 4.1.1 (2), (4) and 4.1.4 a) the integral closure of a
Dedekind ring in a finite separable extension of its fraction field is again a Dedekind
ring. We then have the following consequence of the above facts.
Proposition 4.1.6 Let A be a Dedekind ring with fraction field K, and let B be
the integral closure of A in a finite separable extension L|K. For a nonzero prime
ideal P ⊂ A consider the decomposition P B = Qe11 · · · Qerr in B. Then
r
X
ei [κ(Qi ) : κ(P )] = [L : K].
i=1
104 Chapter 4. Fundamental Groups of Algebraic Curves
Proof: By the Chinese Remainder Theorem and Fact 4.1.5 (1) we have an iso-
morphism
B/P B ∼
= B/Qe11 ⊕ · · · ⊕ B/Qerr . (4.1)
Since each Qi generates a principal ideal (qi ) in the localization BQi by Fact
∼
4.1.5 (2), the map b 7→ qij b induces isomorphisms κ(Qi ) = B/Qi → Qji /Qj+1 i
for all 0 ≤ j ≤ ei − 1. It follows that the left hand side of the formula of the
proposition equals the dimension of the κ(P )-vector space B/P B. Choose elements
t1 . . . , tn ∈ B whose images B/P B form a basis of this vector space. By a form
of Nakayama’s lemma ([48], Chapter X, Lemma 4.3) the ti generate the finitely
generated AP -module B ⊗A AP , hence they generate P the K-vector space L. It
remains to see that there is no nontrivial relation ai ti = 0 with ai ∈ K. Assume
there is one. By multiplying with a suitable generator of the principal ideal P AP
we may assume the ai lie in AP and not all of them are in P AP . Reducing
modulo P AP we then obtain a nontrivial relation with coefficients in κ(P ), which
is impossible.
The integers ei in formula (4.1) are called the ramification indices at the prime
ideals Qi lying above P . As it will turn out later, it is by no means accidental that
we are using the same terminology as for branched topological covers.
Proof: It is enough to verify the second statement for the inertia subgroup at
Q1 , the rest then follows from Proposition 4.1.3 (2). Let K1 be the subfield of L
fixed by DQ1 , A1 the integral closure of A in K1 and P1 := Q1 ∩A1 . Since Q1 is the
only maximal ideal of B above P1 by construction, the formula of the proposition
reads |DQ1 | = e1 · [κ(Q1 ) : κ(P1 )]. On the other hand, Proposition 4.1.3 (3) implies
|DQ1 | = |IQ1 | · [κ(Q1 ) : κ(P1 )] (note that the extension κ(Q1 )|κ(P1 ) is separable,
being a subextension of κ(Q1 )|κ(P )). The statement follows by comparing the two
equalities.
Before leaving this topic, we collect some facts about local Dedekind rings.
For a proof, see e.g. [57], Theorem 11.2. Such rings are called discrete valuation
rings in the literature.
Proof: The intersection of the ideals (tn ) is 0 (this follows, for instance, from
the fact that A is a principal ideal domain). Thus for a 6= 0 there is a unique n ≥ 0
with a ∈ (tn ) \ (tn+1 ), whence (1). By (1), if x is an element of the fraction field,
we may write x = utn with a unit u and t ∈ Z, whence (2). For statement (3),
assume b ∈ B is not a unit. Then b ∈ A, for otherwise we would have b−1 ∈ A ⊂ B
by (2), which is impossible. It moreover follows that b ∈ (t) (otherwise it would be
a unit), so using (1) we see that t cannot be a unit in B. It follows that non-units
in A are non-units in B, from which we conclude by (2) that the units of B lie in
A.
Corollary 4.1.10 Every nonzero a ∈ K can be written as a = utn with some unit
u ∈ A and n ∈ Z. Here n does not depend on the choice of the generator t.
See [48], Chapter VIII, Theorem 2.1 for a proof. Notice that in the situation
above k[x1 , . . . , xd ] is isomorphic to a polynomial ring.
Proof: Let P ⊂ A be a nonzero prime ideal, and use the normalization lemma
to write A as an integral extension of the polynomial ring k[x]. The prime ideal
P ∩ k[x] of k[x] is nonzero, because if t is a nonzero element of P , it satisfies a
monic polynomial equation tn + an−1 tn−1 + · · · + a0 = 0 with ai ∈ k[x]. Here we
may assume a0 is nonzero, but it is an element of P ∩k[x] by the equation. Now all
nonzero prime ideals in k[x] are generated by irreducible polynomials and hence
they are maximal. Thus P is maximal by Fact 4.1.1 (4).
Corollaries 4.1.13 and 4.1.14 are weak forms of Hilbert’s Nullstellensatz. Here
is a statement that may be considered as a strong form. Recall that the radical
√
I of an ideal I in a ring A consists of the elements f ∈ A satisfying f m ∈ I with
an appropriate m > 0.
This follows from [48], Chapter IX, Theorem 1.5 in view of the previous corol-
lary. See also [18], Corollary 13.12 combined with Corollary 2.12.
Definition 4.2.1 The subset X := V (I) ⊂ An (k) is called the affine closed set
defined by I.
Remark 4.2.2 According to the Hilbert Basis Theorem there exist finitely many
polynomials f1 , . . . , fm ∈ k[x1 , . . . , xn ] with I = (f1 , . . . , fm ). Therefore
Example 4.2.3 Let us look at the simplest examples. For n = 1 each ideal in
k[x] is generated by a single polynomial f ; since k algebraically closed, f factors
in linear factors x − ai with some ai ∈ k. The affine closed set we obtain is a finite
set of points corresponding to the ai .
For n = 2, m = 1 we obtain the locus of zeros of a single two-variable polyno-
mial f in A2 : it is a plane curve. In general it may be shown that an affine closed
set in A2 is always the union of a plane curve and a finite set of points.
The following lemma is an easy consequence of the definition; its proof is left
to the readers.
108 Chapter 4. Fundamental Groups of Algebraic Curves
The last two properties imply that the affine closed sets may be used to define
the closed subsets in a topology on An (note that An = V (0), ∅ = V (1)). This
topology is called the Zariski topology on An , and affine closed sets are equipped
with the induced topology. A basis for the Zariski topology is given by the open
subsets of the shape D(f ) := {P ∈ An : f (P ) 6= 0}, where f ∈ k[x1 , . . . , xn ] is a
fixed polynomial. Indeed, each closed subset V (I) is the intersection of subsets of
the form V (f
√ ).
If I = I, then by Fact 4.1.16 it is the intersection of the maximal ideals
containing it. These are of the form (x1 − a1 , . . . , xn − an ) with some ai ∈ k ac-
cording to Corollary 4.1.14, therefore I consists precisely of those f ∈ k[x1 , . . . , xn ]
that vanish at all P ∈ V (I). Thus in this case the ideal I and the set X = V (I)
determine each other, and we call X an affine variety.
Example 4.2.6 If we look back at the examples in 4.2.3, we see that for n = 1
the affine closed set we obtain is a variety if and only if the ai are distinct, and it
is integral if and only if it is a single point, i.e. f = x − ai .
The affine plane curve X = V (f ) ⊂ A2 is an integral variety if and only if f is
irreducible.
We may use the notion of regular functions to define morphisms of affine vari-
eties, and hence obtain a category.
Proof: For fully faithfulness, let x̄1 , . . . , x̄m be the coordinate functions on X.
Then φ∗ 7→ (φ∗ (x̄1 ), . . . , φ∗ (x̄m )) defines an inverse for the map φ 7→ φ∗ . For essen-
tial surjectivity, simply write a finitely generated reduced k-algebra as a quotient
A∼ = k[x1 , . . . , xn ]/I. Then X = V (I) is a good choice.
To proceed further, we define regular functions and morphisms on open subsets
of an integral affine variety X. First, the function field K(X) of X is the fraction
field of the integral domain O(X). By definition, an element of K(X) may be
represented by a quotient of polynomials f /g with g ∈ / I, with two quotients f1 /g1
and f2 /g2 identified if f1 g2 − f2 g1 ∈
/ I.
Next let P be a point of X. We define the local ring OX,P at P as the subring
of K(X) consisting of functions that have a representative with g(P ) 6= 0. It is the
same as the localization of O(X) by the maximal ideal corresponding to P . One
thinks of it as the ring of functions ‘regular at P ’. For an open subset U ⊂ X we
define the ring of regular functions on U by
\
OX (U ) := OX,P ,
P ∈U
the intersection being taken inside K(X). The following lemma shows that for
U = X this definition agrees with the previous one.
T
Lemma 4.2.11 For an integral affine variety X one has O(X) = OX,P .
P ∈X
110 Chapter 4. Fundamental Groups of Algebraic Curves
T
Proof: To show the nontrivial inclusion, pick f ∈ OX,P , and choose for each
P
P a representation f = fP /gP with gP ∈ / P . By our assumption on the gP none
of the maximal ideals of O(X) contains the ideal I := hgP : P ∈ Xi ⊂ O(X),
so Fact 4.1.16 implies I = O(X). In particular, there exist P1 , . . . , Pr ∈ X with
1 = gP1 hP1 + . . . gPr hPr with some hPi ∈ O(X). Thus
r
X r
X r
X
f= f gPi hPi = (fPi /gPi )gPi hPi = fPi hPi ∈ O(X),
i=1 i=1 i=1
as required.
Remark 4.2.13 We can give a geometric meaning to this algebraic notion as fol-
lows. First note that since O(Ank ) = k[x1 , . . . , xn ], affine n-space has dimension n,
as expected. Next, let X be an integral affine variety of dimension n. The Noether
Normalization Lemma (Fact 4.1.11) together with Proposition 4.2.10 shows that
there is a surjective morphism φ : X → An so that moreover O(X) is a finitely
generated k[x1 , . . . , xn ]-module. The latter property implies that φ has finite fi-
bres. Indeed, if P = (a1 , . . . , an ) is a point in An and MP = (x1 − a1 , . . . , xn − an )
the corresponding maximal ideal in k[x1 , . . . , xn ], then O(X)/MP O(X) is a fi-
nite dimensional k-algebra, and as such has finitely many maximal ideals. Their
preimages in O(X) correspond to the finitely many points in φ−1 (P ). Thus n-
dimensional affine varieties are ‘finite over An ’.
Integral affine varieties of dimension 1 are called integral affine curves. The
following lemma shows that their Zariski topology is particularly simple.
Lemma 4.2.14 All proper Zariski closed subsets of an integral affine curve are
finite.
Proof:
√ Quite generally, in a Noetherian ring every proper ideal I satisfying
I = I is an intersection of finitely many prime ideals (a consequence of primary
decomposition; see e.g. [2], Theorem 7.13). Therefore the lemma follows from
Corollary 4.1.12.
We now show that in characteristic 0 every normal affine curve is locally iso-
morphic to one as in the above example.
Proof: The local ring OX,P is a discrete valuation ring, so its maximal ideal
is principal, generated by an element t. Since we are in characteristic 0, by the
theorem of the primitive element we find s ∈ K(X) such that K(X) = k(t, s).
Replacing s by stm for m sufficiently large if necessary, we may assume s ∈ OX,P .
Taking the minimal polynomial of s over k(t) and multiplying by a common de-
nominator of the coefficients we find an irreducible polynomial f ∈ k[x, y] such that
f (t, s) = 0 and moreover the fraction field of the ring k[x, y]/(f ) is isomorphic to
K(X). It follows that the map (t, s) 7→ (x, y) defines an isomorphism of K(X)
onto the function field of the plane curve V (f ) ⊂ A2k . If we choose U ⊂ X so that
t, s ∈ OX (U ), then the above map defines a morphism ρ : U → V (f ). Conversely,
the map x 7→ t, y 7→ s defines a morphism V (f ) → X that is an inverse to ρ on
ρ(U ); in particular, ρ(U ) is open in V (f ). We conclude that X and V (f ) contain
the isomorphic open subsets U and ρ(U ), with U containing P .
112 Chapter 4. Fundamental Groups of Algebraic Curves
Remarks 4.2.19
1. The only place in the above proof where we used the characteristic 0 assump-
tion is where we applied the theorem of the primitive element. But if t is a
generator of the maximal ideal of a normal point as in the above proof, the
extension K(X)|k(t) is always separable (see e.g. [100], Proposition II.1.4),
and hence the theorem applies. Thus the proposition extends to arbitrary
characteristic.
Remarks 4.2.21
We now give the general definition of integral affine curves. This will be a
special case of the definition of affine schemes to be discussed in the next chapter,
but there are some simplifying features.
is the set of prime ideals of A, and we equip it with the topology in which the
open subsets are X and those that do not contain a given ideal I ⊂ A. Note
that all nonempty open subsets contain the point (0); it is called the generic point
of X. The other points come from maximal ideals and hence are closed as one-
point subsets; we call them closed points. By the same argument as in Corollary
4.2.14 the open subsets in X are exactly the subsets whose complement is a finite
(possibly empty) set of closed points.
Given a point P in X, we define the local ring OX,P as the localization AP ;
note that for P = (0) we obtain OX,(0) = K(X), the fraction field of A. Finally,
we put \
OX (U ) := OX,P ,
P ∈U
Next we introduce morphisms for the curves just defined. They are to be
morphisms of ringed spaces, whose general definition is as follows.
instead of OX (X). This is in accordance with the notation of the previous chapter,
and we may also call O(X) the coordinate ring if X.
For affine curves X = Spec (A) and Y = Spec (B) a morphism φ : X → Y in-
duces a ring homomorphism φ♯X : A → B given by O(X) → (φ∗ OY )(X) = O(Y ).
Associating a morphism of curves with a ring homomorphism is a bit more com-
plicated.
Proof: For a prime ideal P ⊂ B the ideal ρ−1 (P ) ⊂ A is a prime ideal (indeed,
the map A/(ρ−1 (P )) → B/P is injective, hence A/(ρ−1 (P )) is an integral domain).
This defines a map of sets Spec (ρ) : Y → X that is easily seen to be a continuous
map of topological spaces. But in our situation we can say more. There are two
cases.
Case 1: ρ is injective. In this case A is a subring of B via ρ, and moreover by
Corollary 4.1.15 if P is a maximal ideal in B, then ρ−1 (P ) = P ∩ A is a maximal
ideal in A. Of course, we have ρ−1 ((0)) = (0).
Case 2: ρ is not injective. As we are dealing with curves, the ideal M := ker(ρ) is
maximal in A, and so ρ−1 (P ) = M for all prime ideals P ⊂ B. This corresponds
to a ‘constant morphism’ Y → {M }.
We define morphisms of sheaves Spec (ρ)♯ : OX → Spec (ρ)∗ OY in each case.
In Case 1 we have an inclusion of function fields K(X) ⊂ K(Y ) and also of
localizations A(P ∩A) ⊂ BP for each maximal ideal P ⊂ B. By taking intersections
this defines maps OX (U ) → OY (Spec (ρ)−1 (U )) for each open set U ⊂ X; for
U = X we get ρ : A → B by the same argument as in Lemma 4.2.11. In Case 2
we define OX (U ) → OY (Spec (ρ)−1 (U )) to be the composite
∼
OX (U ) → AM → AM /M AM → A/M → B
if M ∈ U , and to be 0 otherwise. The reader will check that this indeed yields a
morphism of sheaves.
Proposition 4.3.6 The assignments A 7→ Spec (A), ρ 7→ Spec (ρ) and X 7→ O(X),
φ 7→ φ♯X yield mutually inverse contravariant functors between the category of in-
tegral domains finitely generated and of transcendence degree 1 over a field, and
that of integral affine curves.
Note that the conclusion here is stronger than in Proposition 4.2.10, because
here we say that the two categories are actually anti-isomorphic: there is an arrow-
reversing bijection between objects and morphisms. In Proposition 4.2.10 this was
116 Chapter 4. Fundamental Groups of Algebraic Curves
only true up to isomorphism, because an affine variety as defined there may have
several embeddings in affine spaces.
We now discuss an important construction related to extensions of the base
field.
Construction 4.3.7 Let X = Spec (A) be an integral affine curve over a field k,
and L|k a field extension for which the tensor product A⊗k L is an integral domain.
Then the integral affine curve XL = Spec (A ⊗k L) is defined. We call the resulting
curve over L the base change of X to L. There is a natural morphism XL → X
corresponding by the previous proposition to the map A → A ⊗k L sending a to
a ⊗ 1.
Assume that A ⊗k k̄ is an integral domain for an algebraic closure k̄|k; in this
case X is called geometrically integral. Then A ⊗k L is an integral domain for all
algebraic extensions L|k, so that the above assumption on L is satisfied. Thus for a
fixed algebraic extension L|k the rule X 7→ XL defines a functor from the category
of geometrically integral affine k-curves to that of integral affine L-curves.
We say that an integral affine curve is normal if its local rings are integrally
closed. As in the previous section, this is equivalent to requiring that the coordinate
ring O(X) is integrally closed.
We now prove an analogue of Theorem 3.3.7 for integral affine curves. For
this we have to restrict the morphisms under the consideration. We say that a
morphism φ : Y → X of integral affine curves is finite if O(Y ) becomes a finitely
generated O(X)-module via the map φ♯X : O(X) → O(Y ). A finite morphism has
finite fibres, by the same argument as in Remark 4.2.13. This property is shared
by proper holomorphic maps of Riemann surfaces.
and in Case 1 we may apply Fact 4.1.1 (4). An example of a non-finite mor-
phism is given by the inclusion A1k \ {0} → A1k , corresponding to the natural ring
homomorphism k[x] → k[x, x−1 ] (over any field k).
Theorem 4.3.10 Let X be an integral normal affine curve. The rule Y 7→ K(Y ),
φ 7→ φ∗ induces an anti-equivalence between the category of normal integral affine
curves equipped with a finite morphism φ : Y → X and that of finite field exten-
sions of the function field K(X).
Proof: For essential surjectivity take a finite extension L|K(X), and apply Fact
4.1.4 b) with A = O(X). It implies that the integral closure B of O(X) in L is a
finitely generated k-algebra, which is also integrally closed by Fact 4.1.1 (2). As L
is finite over K(X), it is still of transcendence degree 1. Applying Proposition 4.3.6
to the ring extension O(X) ⊂ B we obtain an integral affine curve Y = Spec (B)
and a morphism φ : Y → X inducing the ring inclusion O(X) ⊂ B above. Again
by Fact 4.1.4 b) the morphism φ is finite. Fully faithfulness is proven by a similar
argument as in Theorem 3.3.7.
The affine curve Y constructed in the first part of the proof is called the
normalization of X in L.
Examples 4.3.11
an isomorphism. As in the proof of Proposition 4.2.18 one sees that the condition
∼
φ∗ : K(X) → K(Y ) can be rephrased by saying that φ is an isomorphism over
an open subset. So the criterion becomes: X is normal if and only if every finite
surjective morphism Y → X inducing an isomorphism over an open subset is in
fact an isomorphism.
Lemma 4.4.1 The local rings of an integral normal affine curve X are exactly
the discrete valuation rings R with fraction field K(X) containing O(X).
Proof: If R is such a ring, the intersection of its maximal ideal M with O(X) is
nonzero, for otherwise the restriction of the projection R → R/M to O(X) would
be injective, and the field R/M would contain K(X), which is absurd. Thus
M ∩ O(X) is a maximal ideal in O(X), and R contains the local ring OX,P . But
then by Proposition 4.1.9 (3) we have R = OX,P .
Example 4.4.2 The rational function field k(x) is the function field of the affine
line A1k over k; we have O(A1k ) = k[x]. But k(x) is also the fraction field of the
ring k[x−1 ], which we may view as the coordinate ring of another copy of A1k
with coordinate function x−1 . By Proposition 4.1.9 (3) every discrete valuation
ring R ⊃ k with fraction field k(x) contains either x or x−1 , and hence by the
preceding discussion R is a local ring of one of the two copies of A1k . In fact, there
is only one localization of k[x−1 ] that does not contain x: the localization at the
ideal (x−1 ). Thus there is only one discrete valuation ring R as above that is not
a local ring on the first copy of A1k ; it corresponds to the ‘point at infinity’. The
whole discussion is parallel to the construction of the complex structure on the
Riemann surface P1 (C) in Example 3.1.3 (2): there we took a copy of C around 0,
another copy around ∞, and outside these two points we identified the two charts
4.4 Proper Normal Curves 119
via the isomorphism z 7→ z −1 . Thus we may regard the discrete valuation rings
R ⊃ k with fraction field k(x) as the local rings of the projective line over k.
We can generalize the example as follows. Given a normal integral affine curve
X, we may use the Noether Normalization Lemma (Fact 4.1.11) to find a regular
function f ∈ O(X) such that O(X) is a finitely generated module over k[f ]. Let
X − be the normal affine curve corresponding to the integral closure of k[f −1 ] in
K(X). By Lemma 4.4.1 and the above example every discrete valuation ring R ⊃ k
with fraction field K(X) is a local ring of either X or X − . Moreover, there are only
finitely many R that are not local rings of X, namely the localizations of O(X − ) at
the finitely many maximal ideals lying above (f −1 ) ⊂ k[f −1 ]. Informally speaking,
we may view the set of the above R as the local rings on a ‘curve’ obtained by
‘gluing X and X − together’.
We now give a formal definition that is independent of the choice of the function
f above.
Construction 4.4.3 Let k be a field, and K|k a finitely generated field extension
of transcendence degree 1. Let X K be the set of discrete valuation rings with
fraction field K containing k. Endow X K with the topology in which the proper
closed subsets K
T are the finite subsets. Define aKsheaf of rings on X by the formula
O (U ) = R∈U R for an open subset U ⊂ X . We call the ringed space (X K , OK )
K
Remark 4.4.4 It can be shown that a proper normal curve comes from a pro-
jective curve in the same way as its affine open subsets come from affine curves.
We explain the necessary notions very briefly over an algebraically closed field
k. One identifies points of projective n-space Pnk over k with (n + 1)-tuples
(a0 , . . . , an ) ∈ k n+1 \ {(0, . . . , 0)}, modulo the equivalence relation identifying two
(n + 1)-tuples (a0 , . . . , an ) and (b0 , . . . , bn ) if there exists λ ∈ k × with ai = λbi for
all i. A projective variety X over k is then a subset of some Pn (k) given by the
locus of common zeros of a finite system of homogeneous polynomials. If these
polynomials generate a prime ideal I(X) in k[x0 , . . . , xn ] we say that the variety
is integral. The subring of the fraction field of k[x0 , . . . , xn ]/I(X) that can be
represented by quotients of homogeneous polynomials of the same degree is the
function field K(X) of X. The local ring OX,P of a point P ∈ X is the subring
of K(X) consisting of elements that can be represented by a function with non-
vanishing denominator at P ; the sheaf OX is defined as in the affine case. The
120 Chapter 4. Fundamental Groups of Algebraic Curves
Proof: Given a finite extension L|K, we may use the discussion after Example
4.4.2 to cover X K (resp. X L ) by two open subsets X+ K and X K (resp. X L and
− +
X− ) arising as normalizations of two overlapping copies of A1k in K (resp. L).
L
From now on we drop the annoying superscript K from the notation when
discussing proper normal curves.
A morphism φ : Y → X of proper normal curves is finite if for all affine
open subsets U ⊂ X the preimage φ−1 (U ) ⊂ Y is affine, and moreover φ∗ O(U )
is a finitely generated O(U )-module. The restriction of φ to each φ−1 (U ) may be
identified with the finite morphism of affine curves corresponding to the k-algebra
homomorphism O(U ) → φ∗ O(U ). It thus follows from Remark 4.3.9 that a finite
morphism is always surjective. Conversely, we have:
Proof: Let U ⊂ X be an affine open subset. The points of φ−1 (U ) are the
discrete valuation rings R with fraction field L containing O(U ). Since each R is
integrally closed, it contains the integral closure B of O(U ) which is none but the
coordinate ring of the normalization V of U in L by the proof of Theorem 4.3.10.
Lemma 4.4.1 then allows us to identify V with φ−1 (U ), so the latter is indeed an
affine open subset finite over U .
Corollary 4.4.8 Mapping an integral proper normal curve over k to its function
field induces an anti-equivalence between the category of integral proper normal
curves with finite surjective morphisms and that of finitely generated field exten-
sions of k having transcendence degree 1.
the field κ(P ) = A/P . It is étale over an open subset U ⊂ X if it is étale over all
P ∈ U.
Proof: This follows from the above discussion, together with the facts that φ
induces inclusions of discrete valuation rings OX,P ⊂ OY,Pi for each i, and OY,Pi
is the localization of B by Pi .
Remark 4.5.3 The κ(P )-algebra B/P B should be interpreted as the ring of reg-
ular functions on the fibre of φ over the point P . The fact that some ei above is
greater than 1 means that there are nilpotent functions on the fibre.
In sheaf-theoretic language, one can check that B/P B ∼ = (φ∗ OY )P ⊗OX,P κ(P ),
where (φ∗ OY )P is the stalk of the direct image sheaf φ∗ OY at P . In this interpre-
tation, the separability of φ means that (φ∗ OY )(0) is a separable field extension of
OX,(0) = K(X), and therefore an étale K(X)-algebra.
The above abstract notions are enlightened by the following key example.
Example 4.5.4 Consider the map ρn : A1C → A1C given by x → xn for some
n > 0. The coordinate ring of A1 over C is C[x], and the morphism corresponding
to ρn by Proposition 4.3.6 is given by the inclusion C[xn ] → C[x]. A closed point
a ∈ A1 (C) corresponds to the maximal ideal Ma = (x − a). To check whether ρn
is étale over the point a, we take a primitive n-th root of unity ω and an n-th root
√
n
a of a, and compute
L C[x]/(x − ω i √
n−1
n ∼
n
a) ∼
= Cn a 6= 0
C[x]/(x − a)C[x] = i=0
C[x]/(xn ) a = 0.
In the first case we indeed obtain a finite étale algebra of dimension n over C.
On the other hand, for a = 0 we obtain a C-algebra containing nilpotents, which
therefore cannot be étale. The nilpotent functions on the fibre over 0 reflect the
property that the fibre is degenerate.
4.5 Finite Branched Covers of Normal Curves 123
Remark 4.5.5 The above example is the algebraic analogue of the local branch-
ing behaviour of the morphism z → z n of Riemann surfaces. In fact, the same
argument shows more generally that if k is algebraically closed and ρf : A1k → A1k
is the morphism coming from the k-homomorphism k[x] → k[x] mapping x to
f ∈ k[x], then ρf is étale above a point P if and only if every preimage Q of P
satisfies f ′ (Q) 6= 0.
We can generalize this comparison with the theory over C as follows. Assume
given a finite morphism φ : Y → X of integral normal affine curves over C
coming from a ring homomorphism A → B, and equip X and Y with the complex
structures defined in Construction 4.2.20. Let P ∈ X be a closed point, and
consider the decomposition P B = P1e1 · · · Prer discussed above.
Proposition 4.5.6 With notations as above, the integer ei is the same as the
ramification index at Pi of φ considered as a holomorphic map. In particular, φ as
an algebraic map is étale above P if and only if as a holomorphic map it restricts
to a cover over a complex neighbourhood of P .
Remark 4.5.7 Comparing the above proposition with Theorem 3.2.7 we see that
a finite morphism of normal affine curves must be proper as a holomorphic map.
Example 4.5.8 A key example occurring over a non-closed field is the following.
Assume that X is a geometrically integral affine curve over a field k (see Construc-
tion 4.3.7). Given a finite separable extension L|k, the base change morphism
XL → X is finite and étale over the whole of X. Indeed, it is finite as L|k is finite,
and moreover it is étale over each P ∈ X, because O(XL )/P O(XL ) ∼ = κ(P ) ⊗k L,
which is indeed a product of finite separable field extensions of κ(P ).
C/P C ∼
= C ⊗A κ(P ) ∼
= C ⊗B (B ⊗A κ(P )). (4.2)
In the proof of the proposition we shall only use the first statement of the
lemma, which holds without the assumption of normality.
Proof of Proposition 4.5.9: We keep the notation A = O(X), B = O(Y ) from
the previous proof. As φ is finite, viewing A as a subring of B we find finitely many
elements f1 , . . . , fr ∈ B integral over A such that B = A[f1 , . . . , fr ]. Consider the
chain
A ⊂ A[f1 ] ⊂ A[f1 , f2 ] ⊂ · · · ⊂ A[f1 , . . . , fr−1 ] ⊂ B
and the chain of morphisms of curves corresponding to it via Proposition 4.3.6.
By induction on r using Lemmas 4.2.14 and 4.5.10 we reduce to the case r = 1, i.e.
B = A[t]/(F ) with a monic polynomial F ∈ A[t] satisfying F (f1 ) = 0. Here F is
also the minimal polynomial of f1 over K(X), so its derivative F ′ must be prime
to F in the ring K(X)[t], since f1 is contained in the separable extension K(Y ) of
K(X). We then find polynomials G1 , G2 ∈ K(X)[t] satisfying G1 F + G2 F ′ = 1.
Multiplying with a common denominator g ∈ A of the coefficients of G1 and G2
we obtain polynomials H1 = gG1 , H2 = gG2 ∈ A[t] with H1 F + H2 F ′ = g. We
claim that U = D(g) ⊂ X is a good choice. Indeed, assume P is a maximal ideal
in A with g ∈/ P . The image F̄ of F in κ(P )[t] cannot have multiple roots in an
algebraic closure of κ(P ), because reducing H1 F + H2 F ′ = g mod P we obtain
4.5 Finite Branched Covers of Normal Curves 125
We now extend the above notions to integral proper normal curves. Note first
that the notion of a separable morphism carries over immediately to the proper
case, as it only depends on the function fields. Then define a finite separable
morphism φ : Y → X of proper normal curves to be étale above a point P of X if
there is an affine open subset U ⊂ X containing P such that the finite morphism
of affine curves φ−1 (U ) → U induced by φ is étale above P . Lemma 4.5.2 implies
that this definition does not depend on the choice of U . If φ is étale above all P ,
we say that φ is a finite étale morphism.
Using this definition of étaleness we can extend the above theory from the affine
to the proper case by choosing affine open coverings of proper normal curves. In
126 Chapter 4. Fundamental Groups of Algebraic Curves
Proposition 4.5.13 Let X be an integral proper normal curve over C with func-
tion field K. Both of the first two categories below are anti-equivalent to the third
one:
3. Finite extensions of K.
a) If a finite subextension L|K comes from a curve étale over U , then so does
every subfield L ⊃ L′ ⊃ K.
Definition 4.6.3 In the situation of the proposition we define the algebraic fun-
damental group π1 (U ) of U to be the Galois group Gal (KU |K).
Theorem 4.6.4 Let X be an integral proper normal curve over a perfect field k,
and let U ⊂ X be a nonempty open subset. The category of proper normal curves
Y equipped with a finite separable morphism φ : Y → X étale over U is equivalent
to the category of finite continuous left π1 (U )-sets.
128 Chapter 4. Fundamental Groups of Algebraic Curves
Corollary 4.6.5 The category of normal affine curves V equipped with a finite
étale morphism V → U is equivalent to the category of finite continuous left π1 (U )-
sets.
Remark 4.6.6 In the situation of the above corollary let A e be the integral closure
of A := O(U ) in KU . For a finite subextension K(V ) of KU |K(X) coming from
a finite étale cover V → U we have O(V ) = A e ∩ K(V ), and for each maximal
e
ideal M ⊂ A the intersection M ∩ O(V ) defines a closed point of O(V ). Although
the k-algebra A e is not finitely generated, it is the union of the finitely generated
k-algebras O(V ), and the set U e of its maximal ideals may be identified with the
inverse limit of the natural inverse system formed by the closed points of each V .
We may even equip U e with the inverse limit topology, and view it as an affine ‘pro-
algebraic curve’ by defining a sheaf of rings OUe on it in the usual way: localizing
Ae by a maximal ideal Q e⊂A e we get the local ring O e e of the pro-point Q, e and
U ,Q
for an open subset Ve ⊂ U e we let O e (Ve ) be the intersection of the rings O e e for
U U ,Q
Qe ∈ Ve . We obtain a ‘pro-étale cover’ of U which is the algebraic analogue of the
universal cover in topology. Note that it carries a natural action by π1 (U ).
If we embed U in a proper normal curve X, we may perform a similar con-
struction for X by taking an affine open cover and considering the normalizations
of its elements in finite subextensions of KU |K. We obtain a ringed space (X, e O e)
X
which is a ‘profinite branched cover’ of X ‘pro-étale over U ’. It carries a natural
action of π1 (U ), but it also has ‘pro-points at infinity’ lying above points in X \ U .
The action of π1 (U ) on the latter captures a lot of information about U .
In the case k = C our discussion was completely parallel with the theory for
Riemann surfaces discussed in the previous chapter. This yields the following
important structure theorem for the algebraic fundamental group.
Theorem 4.6.7 Let X be an integral proper normal curve over C, and let U ⊂ X
be an open subset. The algebraic fundamental group π1 (U ) is isomorphic to the
4.6 The Algebraic Fundamental Group 129
where n is the number of points of X lying outside U , and g is the genus of the
compact Riemann surface associated with X.
Remark 4.6.8 Let γi be one of the above generators, and let G be a finite quotient
of π1 (U ) corresponding to a finite Galois branched cover Y → X. By Proposition
3.4.5 γi generates the cyclic stabilizer of a point Qi of Y lying above a point
Pi ∈ X \ U . If we make G vary among the finite quotients of π1 (U ), we obtain
a coherent system of points Qi which define a point Q e of the profinite branched
e
cover X of Remark 4.6.6 that lies above Pi . By construction, its stabilizer IQe
under the action of π1 (U ) is the procyclic subgroup generated by γi . In particular,
b
it is isomorphic to Z.
A surprising fact is that the above presentation for π1 (U ) holds over an ar-
bitrary algebraically closed field of characteristic 0. To derive it we first have to
discuss base change for proper normal curves.
We can now state the following nontrivial theorem whose proof we leave to the
next chapter (see Remark 5.7.8).
2. The same theorems show that π1 (P1k \ {0, ∞}) ∼ = Z.b Thus for each n > 0
there is a unique isomorphism class of finite Galois covers of P1k with group
Z/nZ that are étale outside 0 and ∞. Such a cover is given by the normal-
ization of A1k in the cyclic Galois extension of k(t) defined by the equation
xn = t.
3. The group π1 (P1k \ {0, 1, ∞}) is the free profinite group on two generators.
Thus every finite group that may be generated by two elements is the Galois
group of a finite Galois cover of P1k étale outside 0, 1 and ∞. It is known from
the classification of finite simple groups that all of them can be generated
by two elements – hence all of them arise as quotients of π1 (P1k \ {0, 1, ∞}).
Ks of the function fields of all base changes XL , with L|k finite. The morphisms
XL → X are finite étale for all L|k by the same argument as in Example 4.5.8.
It follows that K k̄ is contained in the subfield KU ⊂ Ks of Proposition 4.6.1
for all open subsets U ⊂ X. By construction there is a canonical isomorphism
Gal (K(XL )|k(X)) ∼ = Gal (L|k) for each L, where K(XL ) is the function field of
XL , whence an isomorphism Gal (K k̄|K) ∼ = Gal (k̄|k). In conclusion, Gal (k̄|k)
arises as a quotient of π1 (U ) for all open U ⊂ X.
Proof: By the above discussion it remains to identify the kernel of the map
π1 (U ) → Gal (K k̄|K) with π1 (Uk̄ ). A finite quotient G of the latter group corre-
sponds to a finite Galois extension K0 of K k̄ that is the function field of a finite
Galois branched cover Y0 → Xk̄ étale over Uk̄ . Let f ∈ (K k̄)[t] be a minimal poly-
nomial for this field extension. We may find a finite extension L|k contained in k̄ so
that the coefficients of f lie in KL and the finite extension L0 |KL defined by f is
Galois with group G. By construction we have L0 k̄ = K0 , and moreover L0 is the
function field of a finite branched cover Y → XL so that Yk̄ ∼ = Y0 . It then follows
from the definition of étaleness that Y must be étale over UL . As XL is finite étale
above X, the composite map Y → XL → X realizes Y as a finite branched cover
of X étale above U . Therefore L0 ⊂ KU , where KU is as in Proposition 4.6.1.
Moreover, we have L0 ∩ K k̄ = KL, which allows us to identify G = Gal (L0 |KL)
with a finite quotient of ker(π1 (U ) → Gal (K k̄|K)). On the other hand, as L0 was
shown to be a subfield of KU , so is the composite L0 k̄ = K0 , and therefore by
making G vary among the finite quotients of π1 (Uk̄ ) we see that KUk̄ ⊂ KU . It
follows that there is a surjection from Gal (KU |K k̄) = ker(π1 (U ) → Gal (K k̄|K))
onto Gal (KUk̄ |K k̄) = π1 (Uk̄ ). We have just seen that for each finite quotient G
of the latter we may find a finite quotient of the former mapping isomorphically
onto G via the above surjection, which shows that the map is an isomorphism.
1→N →G→Γ→1
Applying the above to the exact sequence of the proposition we obtain a con-
tinuous homomorphism
The group π1 (Uk̄ ) is called the geometric fundamental group of U , and ρU the outer
Galois action on the geometric fundamental group.
e introduced in Remark
We now investigate the action of π1 (U ) on the space X
e e
4.6.6 above. Let Q be a pro-point of X lying above a point P of X, and DQe its
e = O e e /QO
stabilizer in π1 (U ). The residue field κ(Q) e e e is an algebraic closure
X,Q X,Q
of κ(P ) = OX,P /P OX,P . By the same argument as before the statement of Fact
4.1.3, we have a homomorphism DQe → Gal (κ(P )|κ(P )), and one sees using Fact
e
4.1.3 (3) that it is surjective. Its kernel I e is called the inertia group at Q.
Q
By the previous proposition (and its proof) we may view X e as a profinite Galois
branched cover of Xk̄ with Galois group π1 (Uk̄ ), a normal subgroup in π1 (U ).
Proof: The subgroups in question consist of those elements of the stabilizer DQe
that are in the kernel of the natural projection DQe → Gal (k̄|k).
e be a pro-point of U
Proof: Let Q e lying above a k-rational point of U , and Q̄ its
image in Uk̄ . By definition of π1 (Uk̄ ) it must have trivial stabilizer in π1 (Uk̄ ). The
lemma then implies that IQe is trivial, so DQe ⊂ π1 (U ) maps isomorphically onto
Gal (k̄|k) by the projection π1 (U ) → Gal (k̄|k), whence the required splitting.
Example 4.7.4 Let us consider the case U = P1k \ {0, ∞}. We have seen that
π1 (Uk̄ ) ∼
=Zb is commutative, hence we have a true action of Gal (k̄|k) on π1 (U ), not
k̄
4.7 The Outer Galois Action 133
just an outer action. For each n > 0 the quotient π1 (Uk̄ )/nπ1 (Uk̄ ) ∼
= Z/nZ can be
identified with the Galois group of the extension Kn |k̄(t) defined by the equation
xn −t by Example 4.6.12 (2). Moreover, the action of Gal (k̄|k) on π1 (Uk̄ )/nπ1 (Uk̄ )
is the one coming from the extension of profinite groups
Remark 4.7.5 In the situation of Lemma 4.7.2 the action of Gal (k̄|k) on IQe ∼
=Zb
coming from the extension
is also given by the cyclotomic character. Indeed, each degree n finite extension
Kn |K k̄ contained in the fixed field of IQe is given by an n-th root of a local param-
eter at the unique point Q of Xk̄ above P , by Kummer theory and the fact that
the ramification index at the unique point of the curve corresponding to Kn lying
above Q must be n. This being said, the same argument as above applies.
Theorem 4.7.6 (Belyi) Let X be an integral proper normal curve defined over
an algebraically closed field k of characteristic 0. There exists a morphism X → P1k
étale over P1k \ {0, 1, ∞} if and only if X can be defined over Q.
Here the condition that X may be defined over Q means that there exists a
curve X0 defined over Q such that X comes from X0 by base change from Q to k.
134 Chapter 4. Fundamental Groups of Algebraic Curves
Proof: The ‘only if’ part follows from Theorem 4.6.10 (to be proven in the next
chapter). Belyi proved the ‘if’ part as follows. In any case there is a morphism
p : X → P1 defined over Q and étale above the complement of a finite set S of
closed points. The idea is to compose p with suitable morphisms P1 → P1 that
reduce the size of S.
In a first step, we reduce to the case where S consists of Q-rational points. For
this, let P be a point for which the degree n = [κ(P ) : Q] is maximal among the
points in S. Choose a minimal polynomial f ∈ Q(t) for a generator of the extension
κ(P )|Q and consider the map φf : P1 → P1 attached to f . Since f has coefficients
in Q, this map is defined over Q, and as it is given by a polynomial, it restricts to
a map A1 → A1 that we denote in the same way. By Remark 4.5.5 the map φf
is étale outside the φf (Sf ), where Sf is the set of points Q ∈ AQ 1 with f ′ (Q) = 0.
Therefore the composite φf ◦ p is étale outside the set S ′ = φf (S) ∪ {∞} ∪ φf (Sf ).
Now ∞ has degree 1 over Q; the points of Sf , and hence of φf (Sf ) have degree at
most n − 1; finally, those in φf (S) have degree at most n. But since φf (P ) = 0,
there are strictly less points of degree exactly n in S ′ than in S. Replacing p by
φf ◦ p and S by S ′ we may continue this procedure until we arrive at n = 1.
So assume all points in S are Q-rational. If S consists of at most three points,
we are done by composing with an automorphism of P1 ; otherwise we may assume
S contains 0, 1, ∞ and at least one more Q-rational point α. The idea again is
to compose p with a map φf : P1 → P1 associated with a well-chosen rational
function f . This time we seek f in the form xA (x − 1)B with some nonzero
integers A, B. Outside 0, 1 and ∞ it restricts to a morphism of affine curves
A1Q \{0, 1} → A1Q corresponding to the homomorphism Q[x] → Q[x][(x(x−1))−1 ]
sending x to f . As above, φf ◦p will be étale outside φf (S) together with the images
of those points in A1 \ {0, 1} where the derivative f ′ vanishes. These are given by
the equation AxA−1 (x − 1)B + BxA (x − 1)B−1 = 0, or else A(x − 1) + Bx = 0.
We therefore have only one such point, namely x = A/(A + B). So if we choose A
and B so that α = A/(A + B), then φf ◦ p will be étale everywhere outside φf (S).
But φf (S) contains strictly less points than S, because φf ({0, 1, ∞}) ⊂ {0, ∞}.
We may then continue the procedure until φf (S) has at most 3 elements.
Proof: We use the shorthands U for P1Q \ {0, 1, ∞} and U for UQ . Assume
that ρU has a nontrivial kernel, fixing a (possibly infinite) extension L|Q. The
representation ρUL : Gal (L) → Out (π1 (U ))) is trivial. Recall that ρUL comes
from the short exact sequence
via the conjugation action of π1 (UL ) on π1 (U ). The triviality of ρUL means that
every automorphism of π1 (U ) induced by conjugating with an element x ∈ π1 (UL )
equals the conjugation automorphism by an element y ∈ π1 (U ). This implies that
y −1 x is in the centralizer C of π1 (U ) in π1 (UL ), and therefore the latter group is
generated by C and π1 (U ). But by Example 4.6.12 (3) the group π1 (U ) is a free
profinite group on two generators, and as such has trivial center. Hence C and
π1 (U ) have trivial intersection, which implies that π1 (UL ) is actually their direct
product. Whence a quotient map π1 (UL ) → π1 (U ) which is a retraction of the
natural inclusion. Considering finite continuous π1 (U )-sets and applying Theorem
4.6.4 we conclude that every finite étale cover of U comes by base change from a
cover of UL . By Belyi’s theorem this then means that every integral proper normal
curve defined over Q̄ can in fact be defined over L. But there are counter-examples
to this latter assertion; see the facts below.
4A3
j(E) := 1728 ∈L
4A3 + 27B 2
Remarks 4.7.9
136 Chapter 4. Fundamental Groups of Algebraic Curves
1. The significance of Theorem 4.7.7 lies in the fact that it embeds the group
Gal (Q|Q), which is of arithmetic nature, in the outer automorphism group
of a group coming from topology. It is the starting point of Grothendieck’s
fascinating theory of dessins d’enfants; see [85] for a comprehensive intro-
duction.
Fact 4.8.2 Consider a finite regular Galois extension K|Q(t) with Galois group
G. Let xm + am−1 xm−1 + · · · + a0 be a minimal polynomial for this extension,
with ai ∈ Q(t). There exist infinitely many α ∈ Q such that none of the ai has
denominator vanishing at α, and xm + am−1 (α)xm−1 + · · · + a0 (α) ∈ Q[x] defines
a Galois extension of Q with group G.
This is a somewhat sharpened form of Hilbert’s Irreducibility Theorem. (The
original form only states that there are infinitely many α for which the polynomial
xm + am−1 (α)xm−1 + · · · + a0 (α) remains irreducible.) For proofs, see e.g. Serre’s
books [91] or [93]. They show that the α can in fact be chosen to be integers, or
even prime numbers.
Problem 4.8.1 is largely open at the present day and is the subject of intense
research. Towards the end of the 1970’s Belyi, Fried, Matzat and later Thompson
independently developed a method based on the theory of the algebraic fundamen-
tal group that yields a positive solution for many of the finite simple groups. We
now explain the basic idea of the construction, relying largely on the exposition of
Serre in [93].
The starting point is that, as we have seen in Corollary 3.4.4, every finite group
G occurs as a Galois group over C(t). More precisely, we have shown that if G
4.8 Application to the Inverse Galois Problem 137
Lemma 4.8.3 In the above situation let S ⊂ Hom(N, G) be a subset stable by the
actions of G and Γ, such that moreover G acts freely and transitively on S. Then
every φ ∈ S extends to a continuous homomorphism φ̃ : Γ → G.
We would like to apply the lemma in the situation where N = π(n) and
Γ = Π(n), so we have to specify a subset S ⊂ Hom(π(n), G) with the required
138 Chapter 4. Fundamental Groups of Algebraic Curves
For rational conjugacy classes the stability of the set S by the action of Π(n)
boils down to:
Proof: By Remark 4.6.8 and Lemma 4.7.2 each γi topologically generates the
inertia subgroup IQei ⊂ π(n) of a point Q e i of the profinite branched cover Pg 1 of
C
Remark 4.6.6 above the point Pi . By definition of inertia subgroups, for every
σ ∈ Π(n) the conjugate σγi σ −1 lies in the inertia subgroup Iσ(Qei ) of another point
σ(Qe i ) above Pi . Since Pi is rational over Q, there is a unique point Qi of X lying
k̄
above Pi . Hence both IQei and Iσ(Qei ) are stabilizers of points above Qi in π(n), and
as such they are conjugate in π(n) as well. Thus φ(IQei ) and φ(Iσ(Qei ) ) are conjugate
cyclic subgroups in G, with respective generators φ(γi ) and φ(σγi σ −1 ). Therefore
there is m ∈ Z prime to the order of G and g ∈ G with φ(σγi σ −1 ) = gφ(γi )m g −1 .
As Ci is a rational conjugacy class, this shows that φ(σγi σ −1 ) ∈ Ci , as required.
We can summarize the result of the above discussion in the following theorem.
4.8 Application to the Inverse Galois Problem 139
Theorem 4.8.7 Let G be a finite group with trivial center, and assume there
exists a rigid system C1 , . . . , Cn of rational conjugacy classes in G. For each n-
tuple P1 , . . . , Pn of Q-rational points on P1Q there is a surjection
π1 (P1Q \ {P1 , . . . , Pn }) ։ G
such that moreover the image of each canonical generator γi lies in Ci . In partic-
ular, G is the Galois group of a Galois extension of Q(t) regular over Q.
Remark 4.8.8 The role played by rationality in the above proof can be made
more precise as follows. Denoting by N the order of G, there is a natural action
of (Z/N Z)× on G (as a set) via (m, γ) 7→ γ m ; this also induces an action of
(Z/N Z)× on the set of conjugacy classes of G. Rationality of a class Ci then means
that Ci is preserved by the action of (Z/N Z)× . Now observe that (Z/N Z)× ∼ =
Gal (Q(µN )|Q), where as usual µN denotes the group of N -th roots of unity.
Remark 4.7.5 then implies that the integer m at the end of the proof of Lemma
4.8.6 can be recovered as the image of σ by the composite map
χ b×
Π(n) → Gal (Q|Q) → Z ։ (Z/N Z)× ,
Example 4.8.9 The criterion of Theorem 4.8.7 is satisfied for many of the spo-
radic finite simple groups. For instance, Thompson has verified that the Monster
has a rigid system of three rational conjugacy classes of orders 2, 3 and 29, re-
spectively. Such a verification is of course far from trivial and often relies on the
classification of finite simple groups. Concerning Thompson’s theorem and further
results about sporadic groups, see Section II.9 of [56].
We now present an example where the criterion of Theorem 4.8.7 does not
apply directly but the variant of Remark 4.8.8 does.
Example 4.8.10 Let p be an odd prime such that 2 is not a square modulo p. We
construct a rigid triple of conjugacy classes in the finite simple group PSL2 (Fp ).
140 Chapter 4. Fundamental Groups of Algebraic Curves
Each element of order p in PSL2 (Fp ) comes from a matrix in SL2 (Fp ) having an
eigenvalue 1, so since it has determinant 1, its upper triangular form is
" #
1 a
Ea :=
0 1
for some a ∈ F× p . It follows that order p elements in PSL2 (Fp ) fall into two
conjugacy classes pA and pB, depending on whether a is a square in Fp or not.
Another calculation shows that the order 2 elements form a single conjugacy class
2A. We contend that (2A, pA, pB) is a rigid triple of conjugacy classes in PSL2 (Fp ).
The matrices
" # " # " #
1 −1 1 1 1 0
M1 = , M2 = , M3 =
2 −1 0 1 −2 1
represent elements of 2A, pA, pB, respectively, the last one because it is conjugate
to E2 and 2 is not a square in Fp by assumption. Their product is the identity
matrix, and the pair (M2 , M3 ) generates SL2 (Fp ), because their powers are exactly
the 2 × 2 elementary matrices (see [48], Chapter XIII, Lemma 8.1). To conclude
it suffices to show that given a triple (N1 , N2 , N3 ) of matrices representing a gen-
erating triple from 2A × pA × pB, the pair (N2 , N3 ) is conjugate in SL2 (Fp ) to
(M2 , M3 ). For this notice that both N2 and N3 have a 1-dimensional eigenspace
corresponding to the eigenvalue 1, and they are not the same since otherwise the
Ni would not generate. Hence we may choose a basis of F2p consisting of an eigen-
vector of N2 and an eigenvector of N3 . Passing to this basis the matrices become
" # " #
1 a 1 0
N2′ = , N3′ = .
0 1 b 1
Many of the finite simple groups have been realized as Galois groups over Q(t)
by a variant of the rigidity method like in the above example. For the state of the
art in 1998, see the book of Malle and Matzat [56].
4.9 A Survey of Advanced Results 141
For k = C we have seen this in Theorem 4.6.7, and the case of a general
k of characteristic 0 follows using Theorem 4.6.10. The main contribution of
Grothendieck was in positive characteristic; we shall discuss it in more detail in
Theorem 5.7.13 and Remark 5.7.16 (4) of the next chapter.
Remark 4.9.2 It is an interesting question whether one can prove Theorem 4.9.1
without recourse to transcendental techniques. It would be enough to show that
the finite quotients are exactly the finite groups that can be generated by 2g + n
elements satisfying a relation as above. Indeed, it can be proven in a purely
′
algebraic way that for fixed N > 0 the group π1 (X)(p ) has only finitely many
quotients of order N (Lang–Serre [51], Abhyankar [1]). But a profinite group
having this property is determined up to isomorphism by its finite quotients (see
Lemma 5.7.7 in the next chapter).
Results in this direction are quite scarce. Borne and Emsalem have observed
in a recent preprint [7] that the methods of Serre [92] can be used to determine
142 Chapter 4. Fundamental Groups of Algebraic Curves
′
the finite solvable quotients of π1 (P1 \ {0, 1, ∞})(p ) , and hence to describe the
′
maximal pro-solvable quotient of π1 (P1 \ {0, 1, ∞})(p ) in a purely algebraic way;
see also [54] for further results in this direction. Also, Wingberg used methods of
Galois cohomology and class field theory to show that for a normal curve U defined
over Fp the maximal pro-ℓ quotient of π1 (U ) has a presentation as in Theorem
4.9.1 for all primes ℓ 6= p (see [70], Theorem 10.1.2).
Of course, the theorem is only interesting for those G that actually arise as the
fundamental group of some curve as above.
Now that we know the maximal prime-to-p quotient of the fundamental group,
we may ask for the structure of its maximal pro-p quotient in characteristic p > 0.
The answer is radically different in the proper and the affine cases.
In order to be able to state the result for proper curves, we need to recall
some facts from algebraic geometry. With a proper normal curve over a field k
one associates its Jacobian variety, which is an abelian variety, i.e. a projective
group variety over k (these are known to be commutative). The p-rank of an
abelian variety A over an algebraically closed field k of characteristic p > 0 is the
dimension of the Fp -vector space given by the kernel of the multiplication-by-p
map on the k-points of A. It is a nonnegative integer bounded by dim A.
Theorem 4.9.4 For an integral normal curve X over an algebraically closed field
k of characteristic p > 0 the group π1 (X)(p) is a free pro-p group. It is of finite
rank equal to the p-rank of the Jacobian variety of X if X is proper, and of infinite
rank equal to the cardinality of k if X is affine.
Here one may define a free pro-p group of rank r as the maximal pro-p quo-
tient of the free profinite group of rank r. The proper case was first proven by
Shafarevich in [94] using the classical theory of Hasse-Witt matrices. Nowadays
the theorem can be quickly derived using methods of étale cohomology; see the
chapters by Gille and Bouw in [8] or [10], Theorem 1.9 (which only deals with the
proper case).
Observe that Theorems 4.9.1 and 4.9.4 do not elucidate completely the struc-
ture of the fundamental group of an integral normal curve over an algebraically
4.9 A Survey of Advanced Results 143
closed field of positive characteristic; this is still unknown at the present day. The
theorems give, however, a good description of its maximal abelian quotient: this
group is the direct sum of its maximal prime-to-p and pro-p quotients, and hence
the previous two theorems together suffice to describe it.
By the previous theorem every finite p-group arises as a quotient of the funda-
mental group of a normal affine curve over an algebraically closed field of character-
istic p > 0. More generally, we have the following important theorem, previously
known as Abhyankar’s Conjecture.
Raynaud proved the crucial case X = A1k in his paper [79]. He constructed
Galois covers of the affine line with prescribed group G satisfying the condition
of the theorem by combining three different methods. The first came from an
earlier paper [92] by Serre that handled the case of solvable G, and contained
an inductive statement for extensions of groups satisfying the conclusion of the
theorem by solvable groups. The second method used a patching technique like
the one we encountered in Chapter 3, but in the setting of rigid analytic geometry.
Finally, the third method exploited the theory of semi-stable curves. Soon after
Raynaud’s work Harbater reduced the general case to the case of the affine line in
[32]; another proof for this reduction was given by Pop [76]. See also the chapters
by Chambert-Loir and Saı̈di in [8].
In the remainder of this section k will denote a perfect field, k̄ a fixed algebraic
closure, and X an integral proper normal curve over k. For an open subcurve
U ⊂ X we introduced after Proposition 4.7.1 an outer Galois representation
The theorem was proven by Tamagawa [102] in the important special case
when k is a number field and U is affine. He first proved a result over finite fields
(see Theorem 4.9.8 below), and then used a specialization argument to obtain the
statement in the number field case. Mochizuki first extended Tamagawa’s theorem
to proper curves over number fields in [61] using techniques from logarithmic ge-
ometry, and then in [62] proved the general theorem stated above in a completely
different way, exploiting p-adic Hodge theory. For a survey of this second method,
see the Bourbaki lecture [22] by Faltings.
In fact, Mochizuki proved much more. In order to formulate the general result
of his paper [62] we need to set up some notation. First, for U and p as in the
theorem above consider the maximal pro-p-quotient π1 (Uk̄ )(p) of the geometric
fundamental group π1 (Uk̄ ). The kernel N of the projection π1 (Uk̄ ) → π1 (Uk̄ )(p) is
preserved by all automorphisms of π1 (Uk̄ ), hence it is normal in π1 (U ). We denote
(p)
the quotient π1 (U )/N by π1 (U ); it is an extension of Gal (k̄|k) by the pro-p-group
π1 (Uk̄ )(p) .
Given profinite groups G1 , G2 equipped with continuous surjections Gi ։ G
onto a third profinite group G, we denote by Hom∗G (G1 , G2 ) the set of continuous
homomorphisms G1 → G2 that are compatible with the projections onto G up to
an inner automorphism of G. Composition with inner automorphisms of G2 equips
this set with a left action by G2 ; we denote the quotient by HomOut G (G1 , G2 ). We
shall see in the next chapter (Remark 5.5.3 and the subsequent discussion) that a
morphism φ : U → U ′ of curves over a perfect field k induces a homomorphism on
fundamental groups that is well-defined up to an inner automorphism of π1 (U ′ ),
(p) (p)
whence a well-defined element in HomOut (π (U ), π1 (U ′ )). If moreover φ
Gal (k̄|k) 1
is dominating, i.e. has Zariski dense image, then the induced homomorphism
of fundamental groups is known to have open image. Accordingly, given curves
U, U ′ over a perfect field k, denote by Homdom k (U, U ′ ) the set of k-morphisms
Out, open (p) (p)
U → U ′ with dense image, and by HomGal (k̄|k) (π1 (U ), π1 (U ′ )) the subset of
(p) (p)
HomOut (π (U ), π1 (U ′ )) coming from homomorphisms with open image. We
Gal (k̄|k) 1
may now state:
is a bijection.
4.9 A Survey of Advanced Results 145
As mentioned above, there are also results for curves over finite fields. Here
they are:
Isom(U, U ′ ) → Isom(π1 (U ), π1 (U ′ ))
is a bijection.
Here on the left hand side we have the set of isomorphisms between U and
U′ as schemes, regardless of the F-structure, and on the right hand side the set of
continuous isomorphisms between π1 (U ) and π1 (U ′ ). Note that here we are not
considering Gal (k̄|k)-isomorphisms between π1 (UF ) and π1 (UF′ ) as before; in fact,
over a finite field F the outer Galois action on π1 (UF ) is ‘encoded’ in π1 (U ).
The theorem was proven in the affine case by Tamagawa in [102] using class
field theory, and in the proper case by Mochizuki [63] as a consequence of his theory
of cuspidalizations. Using Tamagawa’s affine result and a specialization technique,
Stix [98] proved an analogue of Theorem 4.9.6 for non-constant hyperbolic curves
over a finitely generated field of positive characteristic.
We conclude this survey by stating the main open question in the area, the
famous Section Conjecture of Grothendieck. Let k be a perfect field, and X an
integral proper normal curve over k. Recall from Proposition 4.7.1 that there is an
exact sequence
p
1 → π1 (Xk̄ ) → π1 (X) → Gal (k̄|k) → 1
X(k) → HomOut
Gal (k̄|k) (Gal (k̄|k), π1 (X)).
In other words, each conjugacy class of sections should come from a unique
k-rational point of X. The proof of injectivity is not hard and was already known
to Grothendieck, but as of today, the issue of surjectivity is widely open.
146 Chapter 4. Fundamental Groups of Algebraic Curves
Exercises
(a) Show that the normalization of A1k in the finite extension of k(t) defined
by xp −x−f is an étale Galois cover of A1k whose Galois group is Z/pZ;
it is called an Artin–Schreier cover of A1k .
(b) Conclude that the maximal abelian quotient of exponent p of π1 (A1k )
is an infinite dimensional Fp -vector space of dimension equal to the
cardinality of k.
5. Give an example showing that the converse statement in Lemma 4.5.10 fails
if one does not assume that Y is normal.
[Hint: Take X = A1k , Y = V (x22 − x31 + x21 ) ⊂ A2k , Z the normalization of Y
and φ : (x1 , x2 ) 7→ x1 .]
Exercises 147
(a) Verify that the morphism φf : X → P1k defined in Section 4.4 satisfies
φf (P ) = f (P ) for all closed points P ∈ X where f ∈ OX,P , and
φf (P ) = ∞ otherwise.
(b) Show that φf is étale exactly above those points whose preimages sat-
isfy (f ′ /f )(P ) 6= 0.
8. Prove that when G is a finite group with trivial center having a rigid system
(C1 , . . . , Cn ) of not necessarily rational conjugacy classes, the conclusion of
Theorem 4.8.7 holds with Q replaced by Q(µ), where µ is the group of all
complex roots of unity.
9. Verify that for n ≥ 3 the three conjugacy classes in the symmetric group Sn
consisting of 2–cycles, (n − 1)–cycles and n–cycles, respectively, form a rigid
system of rational conjugacy classes. [Remark: It then follows from Theorem
4.8.7 that Sn occurs as a Galois group over Q(t). For a more elementary
proof, see [93], Section 4.4.]
10. Let G be a finite group with trivial center possessing a rigid system of
three rational conjugacy classes, and let H ⊂ G be a subgroup of index 2.
(Example: G = Sn , H = An , n ≥ 3.) Show there exists a Galois extension
of Q(t) regular over Q with group H.
[Hint: Apply Theorem 4.8.7 to get a Galois extension K|Q(t) with group
G corresponding to a morphism étale outside three Q-rational points, and
set L := K H . Use the Riemann–Hurwitz formula (Corollary 3.6.12) to show
that the morphism X → P1Q corresponding to L|Q(t) is étale outside exactly
two points, and conclude that L ∼
= Q(u) for some u ∈ K(X).]
(a) Let s be a section of the natural projection π1 (X) → Gal (k̄|k), and
denote by D the image of s in π1 (X); it has a natural action on KX .
Prove the equivalence of the following statements about a k-rational
point P of X:
148 Chapter 4. Fundamental Groups of Algebraic Curves
Though the theory of the previous chapter is sufficient for many applications,
a genuine understanding of the algebraic fundamental group only comes from
Grothendieck’s definition of the fundamental group for schemes. His theory en-
compasses the classification of finite covers of complex algebraic varieties of any
dimension, Galois theory for extensions of arbitrary fields and even notions coming
from arithmetic such as specialization modulo a prime. Moreover, it is completely
parallel to the topological situation and clarifies the role of base points and univer-
sal covers – these have been somewhat swept under the carpet in the last chapter.
In his original account in [29] Grothendieck adopted an axiomatic viewpoint and
presented his constructions within the context of ‘Galois categories’. Here we
choose a more direct approach, emphasizing the parallelism with topology. The
background from algebraic geometry that is necessary for the basic constructions
will be summarized in the first section. However, the proofs of some of the deeper
results discussed towards the end of the chapter will require more refined tech-
niques.
Definition 5.1.1 The prime spectrum Spec (A) of A is the topological space
whose points are prime ideals of A and a basis of open sets is given by the sets
149
150 Chapter 5. Fundamental Groups of Schemes
for all f ∈ A.
For this definition to be correct, one must verify that the system of the sets
D(f ) is closed under finite intersections. This holds because it follows from the
definition that D(f ) ∩ D(g) = D(f g) for all f, g ∈ A.
We next define a sheaf of rings on X = Spec (A) as follows. Recall that for a
nonzero element f ∈ A the notation Af stands for the ring of fractions of A with
denominators in the multiplicatively closed subset {1, f, f 2 , . . . } ⊂ A. See ([2],
Chapter 3) for the definition of rings of fractions in the case when A may contain
zero-divisors.
Proof: See [64], Section II.1 (or [34], Proposition II.2.2 for a direct construction).
Definition 5.1.3 The ringed space (X, OX ) defined above is the affine scheme
associated with A. The sheaf OX is called its structure sheaf.
The structure sheaf OX has the special property that its stalk at a point P is
a local ring, namely the localization AP . This follows from the fact that AP is the
direct limit of the fraction rings Af for all f ∈/ P . A ringed space satisfying the
above property is called a locally ringed space.
Recall that a morphism (X, F) → (Y, G) of ringed spaces is a pair (φ, φ♯ ), where
φ : X → Y is a continuous map and φ♯ : G → φ∗ F is a morphism of sheaves.
Given a point P ∈ X, there is an induced map φP : Gφ(P ) → FP on stalks given
by passing to the direct limit of the maps G(U ) → φ∗ F(U ) over all open subsets
U containing φ(P ). If (X, F) and (Y, G) are locally ringed spaces, we say that φ is
a local homomorphism if the preimage of the maximal ideal of FP is the maximal
ideal of Gφ(P ) . (In the case of proper normal curves considered in Section 4.4 this
condition is automatically fulfilled by the morphisms considered there.)
The category of locally ringed spaces is defined to be the category whose objects
are locally ringed spaces and whose morphisms are local homomorphisms.
We define the category of schemes (resp. affine schemes) as the full subcategory
of that of locally ringed spaces spanned by schemes (resp. affine schemes). This
means that a morphism of schemes is a morphism of locally ringed spaces.
Given a ring homomorphism φ : A → B, there is a canonically defined mor-
phism Spec (φ) : Spec (B) → Spec (A) of schemes ([34], II.2.3 (b) or [64], II.2).
5.1 The Vocabulary of Schemes 151
The continuous map Spec (φ) sends a prime ideal P ∈ Spec (B) to φ−1 (P ), and
the morphism Spec (φ)♯ of sheaves is the unique one that is given over a basic open
set D(f ) ⊂ Spec (A) by the natural ring homomorphism Af → Bφ(f ) . Thus the
rule A 7→ (Spec (A), OSpec (A) ) is a contravariant functor from the category of rings
to that of schemes.
1. If k is a field, then the underlying space of Spec (k) consists of a single point,
and the stalk of the structure sheaf at this point is k.
3. If A is a discrete valuation ring with fraction field K, then X = Spec (A) has
one closed point corresponding to the maximal ideal M , and one non-closed
point corresponding to the ideal (0). There are two open subsets, namely
(0) and X. The rings of sections of OX over these open subsets are K and
A, respectively.
4. The affine scheme Spec (Z) has one closed point corresponding to each prime
number p, and a non-closed point corresponding to (0). The nonempty open
subsets all contain (0) and their complement in Spec (Z) is finite (possibly
empty). The ring of sections of the structure sheaf over an open subset U is
the ring of rational numbers with denominator divisible only by the primes
lying outside U . This example has a direct generalization to affine schemes
of the form Spec (A) with A a Dedekind ring.
We now generalize a few other notions from the previous chapter to schemes.
Definition 5.1.7 A scheme X is called reduced (resp. integral) if for all open
subsets U ⊂ X the ring OX (U ) has no nilpotent elements (resp. no zero-divisors).
Lemma 5.1.8 A scheme X is integral if and only if it is reduced and its underlying
space is irreducible.
There are topological properties of schemes coming from the topology of the
underlying space. Thus for instance we say that a scheme is connected or that it
is quasi-compact (meaning that from each open covering we may extract a finite
subcovering) if the underlying space is. Similarly, properties of local rings define
algebraic restrictions. A scheme X is thus said to be locally Noetherian if the stalks
OX,P of its structure sheaf are Noetherian local rings; if moreover X is integral,
then we say that X is Noetherian. The scheme X is normal if the stalks OX,P are
integrally closed domains, and regular if the OX,P are regular local rings. (Recall
that a Noetherian local ring A with maximal ideal M and residue field κ = A/M
is said to be regular if dimκ M/M 2 = dim A.) A regular scheme is integral if and
only if it is connected.
Next we mention some important examples of morphisms of not necessarily
affine schemes.
Examples 5.1.10
5.1 The Vocabulary of Schemes 153
Proof: See [34], Theorem II.3.3. In the case when X = Spec (A), Y = Spec (B)
and Z = Spec (C) are all affine, then Y ×X Z is Spec (B ⊗A C). The general case
is handled by a patching procedure.
Example 5.1.14 When X = Spec (A) with A a discrete valuation ring, a mor-
phism Y → X has two fibres. One is over the generic point of X called the generic
fibre; it is an open subscheme in X and may be empty. The other fibre is the one
over the closed point; it is closed in X and is usually called the special fibre.
The notion of fibre product also allows us to define the diagonal map ∆ : Y →
Y ×X Y coming from a morphism of schemes Y → X; it is induced by the identity
map of Y in both coordinates. In the affine case X = Spec (A), Y = Spec (B) it
is a closed immersion coming from the surjection B ⊗A B → B induced by the
multiplication map (b1 , b2 ) 7→ b1 b2 . However, this is not always so in the general
case ([34], Example 4.0.1), so we record it as a definition:
Examples 5.1.17
Next a general notion for schemes that generalizes the concept of modules over
a ring.
Examples 5.1.19 Here are two natural situations where OX -modules arise.
156 Chapter 5. Fundamental Groups of Schemes
The next proposition gives a means for constructing OX -modules over affine
schemes out of modules over the ring of global sections.
Proof: The proof is similar to that of Lemma 5.1.2. See [64], Section III.1 or
[34], Proposition II.5.1 for a direct approach.
Remark 5.1.22 It can be shown that for an affine scheme X = Spec (A) the
functor M → M f establishes an equivalence between the category of A-modules
and that of quasi-coherent sheaves on X. See [34], Corollary II.5.5 or [64], III.1,
Corollary to Proposition 1.
We now return to the first example in 5.1.19 and investigate the question of
determining whether a morphism φ : X → Y yields a quasi-coherent sheaf φ∗ OX
on Y . This is not true in general, but imposing restrictions on φ yields sufficient
conditions.
Here is such a condition: a morphism φ : X → Y of schemes is affine if Y
has a covering by affine open subsets Ui = Spec Ai such that for each i the open
subscheme φ−1 (Ui ) of X is affine as well. By definition, finite morphisms (Example
5.1.10 (2)) are affine. The following lemma is then almost tautological for F = OX ,
and the general case can be reduced to it; we leave it as an exercise to the readers.
Remark 5.1.25 Nakayama’s lemma ([48], Chapter X, Lemma 4.3) is a key tool
in the study of coherent sheaves. For instance, it can be used to show that if the
stalk of a coherent sheaf is 0 at a point P , then the sheaf restricts to 0 in an open
neighbourhood of P . Similarly, if the stalk at P is a free OX,P -module, then the
sheaf is locally free in a neighbourhood of P . See the discussion at the end of §III.2
of [64].
Example 5.1.27 Assume that B arises as the quotient of the polynomial ring
A[x1 , . . . , xn ] by an ideal (f1 , . . . , fm ). Then Ω1B|A is the quotient of the free B-
module generated by the dxi modulo the submodule generated by the elements
P
i ∂i fj dxi (1 ≤ j ≤ m), where ∂i denotes the partial derivative with respect to
xi . This follows immediately from the above construction.
It follows from the construction that for an A-algebra A′ one has an isomor-
phism Ω1B⊗A A′ |A′ ∼
= Ω1B|A ⊗A A′ ; this is the base change property of differentials.
Also, for a multiplicative subset S of B one has Ω1BS |A ∼ = Ω1B|A ⊗B BS ; this is the
localization property of differentials. It follows from the latter property that the
quasi-coherent sheaf Ω e 1 on Spec (B) satisfies Ω e 1 (D(f )) = Ω1
B|A B|A Bf |A for all basic
open sets D(f ) ⊂ Spec (B).
This defines the sheaf of relative differential forms for morphisms of affine
schemes. To extend the definition to general morphisms we need an algebraic
lemma. Let A → B be a ring homomorphism, and let I be the kernel of the multi-
plication map m : B ⊗A B → B sending b1 ⊗ b2 to b1 b2 . Then since multiplication
by elements of I is trivial on I/I 2 , there is a natural (B ⊗A B)/I-module on I/I 2 .
This is in fact a B-module structure since (B ⊗A B)/I ∼ = B by surjectivity of m.
Definition 5.1.29 The sheaf of relative differentials ΩX|Y is the OX -module de-
∼
fined by pulling back the O∆(X) -module (I/I 2 ) via the isomorphism X → ∆(X).
Using Example 5.1.27 one shows that for k algebraically closed the condition
on ΩX|Spec (k) is equivalent to saying that for an affine open subset of the form
U = Spec (k[x1 , . . . , xn ]/(f1 , . . . , fm )) the Jacobian determinant of the fi has rank
n − d at all closed points of U ; see [34], Theorem II.8.15. Thus we have defined
an algebraic analogue of the concept of a complex submanifold of Cn . For k
perfect there is also a purely algebraic characterization going back to Zariski: X
is smooth over k if and only if it is a regular scheme (see [34], Theorem II.8.15 for
k algebraically closed; the proof of the general case is similar).
The other application is the differential characterization of finite étale algebras.
Proof: The ‘only if’ part is an easy application of Example 5.1.27 and the prim-
itive element theorem. The ‘if’ part is a bit more difficult; see e.g. [18], Corollary
16.16.
Remarks 5.2.2
1. In view of Remark 5.1.24, to define a finite locally free morphism is the same
as defining a quasi-coherent OX -algebra A that is locally free of finite rank
as an OX -module. Finite étale morphisms correspond to OX -algebras such
that AP ⊗OX,P κ(P ) is a finite étale κ(P )-algebra for all P ∈ X.
3. The image of a finite and locally free morphism φ is both open and closed.
Indeed, it is closed because φ is finite, hence proper, and it is open because
by local freeness φ∗ OX has nonzero stalks over an open subset of S.
Remarks 5.2.3 Here are some properties of finite étale morphisms that are more
or less immediate from the definition.
160 Chapter 5. Fundamental Groups of Schemes
We next check that the definition of finite étale covers given above generalizes
the one for normal curves used in the previous chapter. To see this, note first that
the separability assumption made in Definition 4.5.1 corresponds to étaleness of
the fibre over the generic point. Thus it suffices to prove the following lemma.
Proof: We may assume that X = Spec (B) and S = Spec (A) are affine, and
φ comes from a ring homomorphism λ : A → B. At a point P ∈ S the stalk
of φ∗ OX is the spectrum of the localization Bλ(P ) , which is a finitely generated
AP -module. Choose elements t1 , . . . , tn ∈ BP whose images modulo P BP form a
basis of the κ(P )-vector space BP /P BP . By Nakayama’s lemma ([48], Chapter
X, Lemma 4.3) they generate BP over AP . It then suffices to see that the ti are
linearly independent
P over the fraction field K of AP . If not, there is a nontrivial
relation ai ti = 0 with ai ∈ K. As S is normal of dimension one, the local ring
AP is a discrete valuation ring (Proposition 4.1.9), so P AP is a principal ideal. By
multiplying with a suitable power of a generator of P AP we may assume that all
the ai lie in AP and not all of them are in P AP . But then reducing modulo P we
obtain a nontrivial relation among the ti in BP /P BP , a contradiction.
Remark 5.2.5 There are other important examples of finite étale covers X → S
with S normal of dimension one. For instance, when X = Spec (B) and S =
Spec (A) are affine and the morphism comes from an inclusion A ⊂ B, then A and
B are Dedekind rings and the points in the fibre over P ∈ S correspond to the
factors Pi in the decomposition P B = P1e1 . . . Prer (see Facts 4.1.5). In particular,
XP is étale if and only if all ei are equal to 1.
This is particularly interesting when K ⊂ L are finite extensions of Q, and A
(resp. B) is the integral closure of Z in K (resp. B). The morphism Spec (B) →
Spec (A) is finite (Fact 4.1.4) and surjective (Fact 4.1.1 (4)). The case when it
is a finite étale cover corresponds in classical parlance to an unramified extension
of number fields K ⊂ L. According to a famous theorem of Minkowski (see e.g.
[69], Chapter III, Theorem 2.18) the scheme Spec (Z) has no nontrivial finite étale
covers. This is the arithmetic analogue of the simply connectedness of the complex
affine line.
5.2 Finite Étale Covers of Schemes 161
Example 5.2.6 Let S = Spec (A) and X = Spec (B) be affine, where B =
A[x]/(f ) with a monic polynomial f ∈ A[x] of degree d. As B is freely gen-
erated as an A-module by the images of 1, x, x2 , . . . xd−1 in B, the morphism
φ : Spec (B) → Spec (A) is finite and locally free. If moreover (f, f ′ ) = (1) in
A[x], then φ is finite étale. Indeed, if P ∈ S, then the fibre XP is the spectrum
of B ⊗A κ(P ) ∼= κ(P )[x]/(f¯), where f¯ is the image of f in κ(P )[x]. It has only
simple roots by our assumption on f ′ , so this is a finite étale κ(P )-algebra.
See [59], Proposition I.3.5 for a generalization without the assumption that φ
is finite and locally free.
Proof: To show that (1) ⇔ (2) we may assume that X and S are affine, and
then by the localisation and base change properties of differentials and Nakayama’s
lemma we are reduced to showing that for all P ∈ S the fibre XP is the spectrum of
a finite étale κ(P )-algebra if and only if Ω1XP |Spec (κ(P )) = 0. This is the differential
characterization of finite étale algebras (Proposition 5.1.31).
Next we show (2) ⇒ (3). As φ is finite, hence separated, the diagonal morphism
∆ : X → X ×S X is a closed immersion. It corresponds to a coherent sheaf of
ideals I on X ×S X. The restriction of the quotient I/I 2 to ∆(X) is isomorphic
to Ω1X/S , so it is 0 by assumption (2). It follows using Nakayama’s lemma that the
stalk of I is trivial at all points of X. As I is coherent, we obtain that I = 0 on
an open subset of X ×S X, so ∆(X) is both open and closed in X ×S X.
Finally, for (3) ⇒ (1) let s̄ : Spec (Ω) → S be a geometric point. Taking the
base change of the morphism ∆ : X → X ×S X over S via s̄, we obtain a morphism
∆s̄ from the geometric fibre Xs̄ = Spec (Ω) ×S X to
(Xs̄ ) ×S X = (Spec (Ω) ×S X) ×Spec (Ω) Spec (Ω) ×S X = Xs̄ ×Spec (Ω) Xs̄ .
162 Chapter 5. Fundamental Groups of Schemes
As property (3) behaves well with respect to base change, ∆s̄ is an isomorphism
of Xs̄ onto an open and closed subscheme of Xs̄ ×Spec (Ω) Xs̄ . The geometric fi-
bre Xs̄ is the spectrum of a finite dimensional Ω-algebra, and as such it has
finitely many points, all having residue field Ω because Ω is algebraically closed.
If t̄ : Spec (Ω) → Xs̄ is such a point, then by taking yet another base change, this
time by t̄, we obtain a morphism Spec (Ω) → Xs̄ , which is again an isomorphism
onto an open and closed subscheme of Xs̄ . As Spec (Ω) is connected, this must
be a connected component. We conclude that Xs̄ as a scheme is a finite disjoint
union of points, as required.
Remark 5.2.8 The equivalence (1) ⇔ (2) of the proposition gives a criterion for
checking étaleness in practice. Assume φ : X → S is finite and locally free (as
we have seen, the latter property is automatic for S normal of dimension one),
and moreover each point of S has an affine open neighbourhood U = Spec (A)
such that φ−1 (U ) = Spec (B), with B = A[x1 , . . . , xn ]/(f1 , . . . , fn ) for some monic
polynomials fi . If over all these open subsets the Jacobian determinant det(∂j fi )i,j
maps to a unit in B, then Ω1X|S = 0, and so φ is a finite étale morphism. This is
the Jacobian criterion for étaleness. (The converse is also true; see [59], Corollary
I.3.16.)
Using the previous proposition we now prove that finite étale covers are locally
trivial in an appropriate sense. Call a finite étale cover φ : X → S trivial if X as a
scheme over S is isomorphic to a finite disjoint union of copies of S, and the map
φ restricts to the identity on each component.
Proof: To prove the ‘if’ part, we first show that φ must be finite and locally
free. As ψ is locally free, each point of S is contained in an affine open subset
U = Spec (A) over which ψ restricts to a morphism Spec (C) → Spec (A) with an
A-algebra C that is finitely generated and free as an A-module. If φ restricts to
Spec (B) → Spec (A) over U , then the base change over Spec (B) is of the form
Spec (B ⊗A C) → Spec (B). Here B ⊗A C is a finitely generated free C-module by
assumption, so it is also a finitely generated and free A-module. As on the other
hand it is isomorphic to a finite direct sum of copies of B, this is only possible if
B is finitely generated and free over A.
Next let s̄ : Spec (Ω) → Y be a geometric point of Y ; by composition with ψ
it yields a geometric point of S. The geometric fibres X(s̄◦ψ) and (X ×S Y )s̄ are
isomorphic, and the latter is a finite direct sum of copies of Spec (Ω) by assumption.
As ψ is surjective, this shows that all fibres of φ are étale.
5.2 Finite Étale Covers of Schemes 163
Now to the ‘only if’ part. As S is connected, we see as in topology that all fibres
of φ have the same cardinality n. Following Lenstra [52], we use induction on n,
the case n = 1 being trivial. For n > 1 we consider the base change X ×S X → X.
By part (3) of the previous proposition the diagonal map ∆ induces a section of
this map, and in fact X ×S X is the disjoint union of ∆(X) with some open and
closed subscheme X ′ . As the inclusion map X ′ → X ×S X and the projection
X ×S X → X are both finite and étale (the latter by Remark 5.2.3 (2)), we see
from Remark 5.2.3 (1) that so is their composite X ′ → X. By construction, it
has fibres of cardinality n − 1, so the inductive hypothesis yields a finite, locally
free and surjective morphism ψ ′ : Y → X such that X ′ ×X Y is isomorphic to
the disjoint union of (n − 1) copies of Y . But then (X ×S X) ×X Y ∼ = X ×S Y is
the disjoint union of n copies of Y . It remains to notice that the composite map
ψ := ψ ′ ◦ φ is also finite, locally free and surjective, being the composite of two
such maps.
Remark 5.2.10 In topology covers are characterized by the property that they
become trivial after restricting to sufficiently small open subsets. Notice that the
restriction of a cover Y → X above an open subset U ⊂ X is none but the fibre
product Y ×X U → U in the category of topological spaces. This explains the
analogy of the above proposition with the topological situation.
In fact the proposition says that finite étale covers are locally trivial for the
Grothendieck topology where covering families are given by surjective finite locally
free morphisms. For a discussion of Grothendieck topologies, see e.g. [59] or [109].
We conclude this section with some facts related to special schemes. Among
these normal schemes will play a prominent role, so we first summarize a few
known facts about them.
Facts 5.2.11
1. Recall that we defined a scheme to be normal if its local rings are integrally
closed domains. In the locally Noetherian case there is a useful algebraic
criterion for this:
Serre’s Normality Criterion. A reduced Noetherian local ring is normal
if and only if for each prime ideal P of height 1 the localization AP is a
discrete valuation ring, and each prime divisor of a principal ideal is of
height 1.
Here the height of a prime ideal P is by definition the dimension of the
localization AP . For a proof of the criterion, see [57], Theorem 23.8.
3). In the affine case S = Spec (A) it is the spectrum of the integral closure
of A in L (compare with the proof of Theorem 4.3.10); in the general case it
is obtained by patching normalizations of affine open subschemes together.
1. S is reduced.
2. S is regular.
Proof: In all three cases we may assume X = Spec (B) and S = Spec (A) are
affine.
Assume first S is reduced. Let P be a point of S, which we view as a prime ideal
of A. By assumption the localization AP has no nilpotents, and we shall prove that
the same holds for CP = B⊗A AP . This will suffice, as the local rings of points of X
lying above P are localizations of CP , and φ is surjective. Let P1 , . . . , Pr be the set
of minimal prime ideals of AP . Their intersection
Q is the ideal of nilpotents in AP ,
so by assumption the natural map AP → (AP /Pi ) is injective. But CP is a free
AP -module (because φ isQ locally free), hence by tensoring with CP we again obtain
an injective map CP → (CP /Pi CP ). It thus suffices to show that the C/Pi CP
have no nilpotents. But each Spec (CP /Pi CP ) is finite étale over Spec (AP /Pi )
by Remark 5.2.3 (2), so by replacing AP with AP /Pi and CP with CP /Pi CP we
reduce to the case where AP is an integral domain. Denoting by KP its fraction
field, we obtain that Spec (CP ⊗AP KP ) → Spec (KP ) is a finite étale morphism
(again by Remark 5.2.3 (2)), and that the natural map CP → CP ⊗AP KP induced
by the inclusion AP → KP is injective (again because φ is locally free). But then
CP has no nilpotents because CP ⊗AP KP is a direct product of fields.
Now return to the case where X = Spec (B) and S = Spec (A) are affine. As
φ is a finite morphism, they have the same dimension, and if Q is a prime ideal
of B with Q ∩ A = P , then P and Q must have the same height. Moreover, it
follows from the definition of étaleness that the image of the maximal ideal MP of
OS,P generates the maximal ideal MQ of OX,Q . From this we conclude that the
regularity of S implies the regularity of X, and also that if MP is principal, then
so is MQ . This shows that both conditions of Serre’s normality criterion recalled
above are preserved, whence the proposition in the last two cases.
The strongest result about finite étale covers of a regular scheme is the follow-
ing.
5.3 Galois Theory for Finite Étale Covers 165
Here the codimension of P means the dimension of the local ring OX,P . The
name ‘purity’ comes from the following reformulation: for φ : X → S a finite
surjective morphism of integral schemes with X normal and S regular, the closed
subscheme Z ⊂ S over which φ is not étale (the ‘branch locus’) must be of pure
codimension 1, i.e. each irreducible component of Z must have codimension 1.
The statement is of local nature, therefore it is enough to prove it when X is the
spectrum of a regular local ring. In this case it is a difficult piece of commutative
algebra. See [67], Theorem 41.1 for Nagata’s original proof, and [30], X.3.4 for a
proof by Grothendieck.
Proof: For the first statement, note again that the diagonal morphism X →
X ×S X coming from φ is a closed immersion as φ is separated. In particular,
it is a finite morphism. Taking fibre products over X with Y via ψ we obtain a
morphism Γψ : Y → Y ×S X (the ‘graph’ of ψ) that is again finite. The second
projection p2 : Y ×S X → X is also finite, being the base change of φ ◦ ψ : Y → S
by φ : X → S. Thus p2 ◦ Γψ = ψ is finite.
If φ is moreover étale, then by Proposition 5.2.7 the diagonal morphism X →
X ×S X is an isomorphism of X onto an open and closed subscheme (a union of
connected components) of X ×S X. As such, it is certainly a finite étale morphism,
hence so is Γψ : Y → Y ×S X by Remark 5.2.3 (2). As above, p2 : Y ×S X → X
is finite étale as well, and hence so is p2 ◦ Γψ = ψ by Remark 5.2.3 (1).
Proof: By passing to the fibre product Z ×S X → Z and using the base change
property of étale morphisms (Remark 5.2.3 (2)) we may assume S = Z. Then we
have to prove that if two sections of a finite étale cover X → S of a connected
scheme S coincide at a geometric point, then they are equal. This follows from the
proposition because each such section, being an isomorphism of S onto a connected
component of X, is determined by the image of a geometric point.
Proposition 5.3.6 The ringed space G\X constructed above is a scheme, the
morphism π is affine and surjective, and φ factors as φ = ψ ◦ π with an affine
morphism ψ : G\X → S.
Proof: We may assume, using the affineness assumption on φ, that X = Spec (B)
and S = Spec (A) are affine, and φ comes from a ring homomorphism λ : A → B.
Then it suffices to show that the ringed space G\X is isomorphic to the spectrum
of B G , the ring of G-invariants of B. This will imply the claim including the
assertions on π, because
Q B is integral over B G (as every b ∈ B is a root of the
monic polynomial (x − σ(b)) ∈ B G [x], where σ runs over the elements of G),
and therefore Fact 4.1.1 (4) implies the surjectivity of π.
To identify the underlying space of G\X with XG := Spec (B G ) it is enough
to identify them as sets, as a closed subset V (I) ⊂ X induces the closed subset
V (I G ) ⊂ XG . As we have just seen the surjectivity of π : X → XG , we have to
show that the fibres of the map X → XG coming from the inclusion B G → B
are the G-orbits of Spec (B). Assume that two G-orbits {σ(P ) : σ ∈ G} and
{σ(Q) : σ ∈ G} of points of B lie above the same point P G ∈ Spec (B G ). As the
fibre Xκ(P G ) is zero-dimensional, the σ(P ) and σ(Q) induce maximal ideals σ(P )
T T
and σ(Q) of the ring B = B ⊗B G κ(P G ) with σ(P ) = σ(Q) = 0. But using
the Chinese Remainder Theorem we find b̄ ∈ B with b̄ ∈ σ(P ) and b̄ ∈ / σ(Q) for
all σ ∈ G, which is a contradiction.
Finally, to show (π∗ OX )G ∼= OXG , notice that the first sheaf is quasi-coherent,
being the kernel of the morphism of quasi-coherent sheaves
M
π∗ O X → π∗ OX , s 7→ (. . . , σ(s) − s, . . . ).
σ∈G
Thus it is enough to check the isomorphism on the rings of sections over XG , which
are B G in both cases.
168 Chapter 5. Fundamental Groups of Schemes
In [29] Grothendieck proved this under the additional assumption that the
schemes are locally Noetherian. We give a proof due to Lenstra [52] that works in
general.
Proof: Thanks to Lemma 5.3.2 (2) it suffices to prove that G\X → S is a finite
étale cover. Apply Proposition 5.2.9 to obtain a base change X ×S Y → Y such that
X ×S Y is a finite disjoint union of copies of Y , i.e. X ×S Y ∼
= F × Y for a finite set
F . There is a natural action of G on X ×S Y coming by base change from its action
of X, which yields an isomorphism G\(X ×S Y ) ∼ = (G\F ) × Y . Now observe that
G\(X ×S Y ) ∼ = (G\X)× S Y . Indeed, there is a natural map X ×S Y → (G\X)×S Y
that is constant on G-orbits, whence a map G\(X ×S Y ) → (G\X) ×S Y . To see
that it induces an isomorphism we may argue over a small affine neighbourhood
U = Spec (A) of each point of S, with preimages Spec (B) and Spec (C) in X and
∼
Y , respectively. There the isomorphism to be proven translates to B G ⊗A C →
(B ⊗A C)G , which holds for U sufficiently small, as C is then a free A-module
with trivial G-action. We thus obtain that (G\X) ×S Y ∼ = (G\F ) × Y , which is
again a finite disjoint union of copies of Y . We may then conclude by applying
Proposition 5.2.9 in the other direction.
X
π - Z
@
@ ψ
φ
@
R ?
@
S
Proof: Fix a geometric point s̄ : Spec (Ω) → S, and let F = {x̄1 , . . . , x̄n } be the
finite set of Spec (Ω)-points of the geometric fibre Xs̄ . An ordering of the x̄i induces
a canonical geometric point x̄ of the n-fold fibre product X n = X×S · · ·×S X, giving
x̄i in the i-th component. Let P be the connected component of X n containing
the image of x̄, and let π : P → X be the map induced by the first projection of
X n to X. Using Remarks 5.2.3 we see that P is a finite étale cover of S via φ ◦ π.
Next we show that each point in the geometric fibre Ps̄ can be represented by
an n-tuple (x̄σ(1) , . . . , x̄σ(n) ) for some permutation σ of the x̄i . Indeed, each point
of Xs̄n corresponds to an element of F n , so we only have to show that the points
concentrated on P have distinct coordinates. But by Proposition 5.2.7 (3) the
diagonal image ∆(X) of X in X ×S X is open and closed, and therefore so is its
inverse image by a projection πij mapping X n to the (i, j)-components. As P is
−1 −1
connected, πij (∆(X)) ∩ P 6= ∅ would imply πij (∆(X)) ⊃ P , which is impossible
−1
because x̄ hits P away from any of the πij (∆).
Now to show that P is Galois over S, remark that each permutation σ of
the x̄i induces an S-automorphism φσ of X n by permuting the components. If
φσ ◦ x̄ ∈ Ps̄ , then φσ (P ) ∩ P 6= ∅ and so φσ ∈ Aut(P |S) by connectedness of P .
Thus Aut(P |S) acts transitively on one geometric fibre, from which we conclude
as in Remark 2.2.8 that P is Galois.
Finally, if q : Q → X is an S-morphism with Q Galois, choose a preimage ȳ of
x̄1 . As q is a surjective morphism of covers by Proposition 5.3.8, after composing
with appropriate elements of Aut(Q|S) we get n maps q = q1 , . . . , qn : Q → X
such that qi ◦ ȳ = x̄i . Whence an S-morphism Q → X n that factors through P ,
for it maps ȳ to x̄ and Q is connected. This concludes the proof.
To close this section, we briefly discuss another point of view on Galois covers
that is often useful. First some definitions.
and
id×i
G - G ×S G
@
@ e◦p
i×id
@ mult
@
? R ?
@
mult-
G ×S G G.
We say that G is finite if the structure morphism p : G → S is finite. Similarly,
G is a finite flat (resp. finite étale) group scheme if p : G → S is finite locally free
(resp. finite étale).
Examples 5.3.11
1. Let S be a scheme, Γ a finite group of order n. We define the constant group
scheme ΓS → S corresponding to Γ over S to be the disjoint union of n
copies of S indexed by Γ and equipped with the projection map given by
the identity on each component. The group operation is induced by that on
Γ. (Explicitly, a point (P1 , P2 ) of ΓS ×S ΓS lies in some component indexed
by a pair (g1 , g2 ) ∈ Γ × Γ, and on that component may be identified with a
point P ∈ S. We define P1 · P2 to be the point P on the component of ΓS
indexed by g1 g2 . The inverse is given similarly.) This is a finite étale group
scheme over S.
2. Assume S = Spec (k) for a field k. In this case a finite étale group scheme
G → S is of the form G = Spec (A) with a finite étale k-algebra A. Hence
it corresponds via Theorem 1.5.2 to a finite set equipped with a continuous
action of Gal (ks |k). The fibre of G over the geometric point given by an
algebraic closure of ks carries a group structure coming from that of G; it
is compatible with the Galois action. Conversely, Galois-equivariant group
operations on a finite continuous Gal (ks |k)-set come from a group scheme
structure on the corresponding finite étale k-scheme, by a similar reasoning
as in the proof of Theorem 2.5.15. Thus the category of finite étale k-group
schemes is equivalent to the category of finite groups Γ carrying a continuous
Gal (ks |k)-action. Here the constant group scheme ΓSpec (k) corresponds to
Γ with trivial Galois action. For a general finite étale group scheme G there
5.3 Galois Theory for Finite Étale Covers 171
is always a finite separable field extension L|k such that G ×S Spec (L) is
isomorphic to the constant group scheme ΓSpec (L) .
Proof: For the ‘only if’ part, take Y = X. For the ‘if’ part, note first that
since G ×S Y → Y and Y → S are finite, locally free and surjective, the same
must be true of X → S. Also, the map G ×S X → X ×S X corresponds via
Remark 5.1.24 to a morphism λ : (φ∗ OX ) ⊗OS (φ∗ OX ) → (φ∗ OX ) ⊗OS (φ∗ OG ).
We have G ×S Y ∼ = X ×S Y by assumption, and therefore the map (ρ, idX , idY ) :
G ×S X ×S Y → X ×S X ×S Y is an isomorphism. This means that λ becomes an
isomorphism after tensoring with φ∗ OY . But φ∗ OY is locally free, so λ must be
an isomorphism.
Remark 5.3.14 One defines torsors under arbitrary flat group schemes and states
a lemma exactly as above, except that ‘finite, locally free and surjective’ must be
replaced by ‘flat, locally of finite presentation and surjective’ everywhere. However,
the proof of the general lemma requires flat descent theory. See [59], Section III.4.
Proof: Statement (1) follows from Proposition 5.2.9 and Lemma 5.3.13. To
prove (2), one first takes a separable extension L|k for which Gk ×Spec (k) Spec (L)
is a constant group scheme, and then applies statement (1) to Y ×Spec (k) Spec (L) →
S ×Spec (k) Spec (L); the conclusion follows by another application of Proposition
5.2.9.
By the preceding discussion, there is a natural left action of π1 (S, s̄) on Fibs̄ (X)
for each finite étale S-scheme X, and therefore Fibs̄ takes its values in the category
of left π1 (S, s̄)-sets. The main theorem is now the following.
In [29] this was proven under the additional assumption that S is locally
Noetherian, because Proposition 5.3.7 was only proven in that case.
Example 5.4.3 The theorem contains as a special case the case S = Spec (k) for
a field k. Here a finite étale S-scheme X is the spectrum of a finite étale k-algebra.
For a geometric point s̄ the fibre functor maps a connected cover X = Spec (L)
to the underlying set of Spec (L ⊗k Ω), which is a finite set indexed by the k-
algebra homomorphisms L → Ω. The image of each such homomorphism lies in
the separable closure ks of k in Ω via the embedding given by s̄. So finally we
obtain that Fibs̄ (X) ∼= Homk (L, ks ) for all X = Spec (L), and therefore π1 (S, s̄) ∼
=
Gal (ks |k). Thus in this case the theorem is equivalent to Theorem 1.5.2.
Note that although in the above example the fibre functor is identified with
the functor X 7→ Hom(Spec (ks ), X), this does not mean that the fibre functor is
representable, for ks is not a finite étale k-algebra. However, it is the union of
its finite Galois subextensions which already are. Passing to the associated affine
schemes, this motivates the following definition.
Notice that in the above definition the inverse limit of the Pα may not exist
in C, but the direct limit of the Hom(Pα , X) does in the category of sets. Recall
that by definition the direct limit of a direct system (Sα , φαβ ) of sets is the disjoint
union of the sets Sα modulo the equivalence relation where sα ∈ Sα is equivalent
to sβ ∈ Sβ if φαγ (sα ) = φαγ (sβ ) for some γ ≥ α, β.
In Example 5.4.3 the fibre functor was pro-representable by the inverse system
of spectra of finite (Galois) extensions contained in ks . We now prove that this
holds in general.
Proposition 5.4.6 Under the assumptions of the theorem the fibre functor Fibs̄
is pro-representable.
Proof: Take the index set Λ to be the set of all finite étale Galois covers Pα → S,
and define Pα ≤ Pβ if there is a morphism Pβ → Pα . This partially ordered set
is directed, because if Pα , Pβ ∈ Λ, we may apply Proposition 5.3.9 to a connected
component Z of the fibre product Pα ×S Pβ to obtain Pγ ∈ Λ together with maps
Pγ → Z → Pα , Pγ → Z → Pβ .
The objects of the inverse system will be the Pα themselves, so we next have
to define the morphisms φαβ . If Pα ≤ Pβ in the above ordering, then by definition
there exists a morphism φ : Pβ → Pα over S. This φ is in general not unique, so
we ‘rigidify’ the situation as follows. For each Pα ∈ Λ we fix an arbitrary element
pα ∈ Fibs̄ (Pα ). Since Pβ → S is a Galois cover, we find by Corollary 5.3.3 a unique
S-automorphism λ of Pβ so that φ ◦ λ maps pβ to pα . Defining φαβ := φ ◦ λ we
obtain an inverse system (Pα , φαβ ) such that moreover for each α ≤ β we have
Fibs̄ (φαβ )(pβ ) = pα .
As in Remark 5.4.5 above, for every X in FetS and every Pα ∈ Λ there is a
natural map Hom(Pα , X) → Fibs̄ (X) sending φ ∈ Hom(Pα , X) to Fibs̄ (φ)(pα ).
These maps are compatible with the transition maps in the inverse system defined
above, whence a functorial map lim Hom(Pα , X) → Fibs̄ (X). To conclude the
→
proof we have to construct a functorial inverse to this map. To do so, we may
assume X is connected (otherwise take disjoint unions), and consider the Galois
closure π : P → X given by Proposition 5.3.9. Here P is one of the Pα ∈ Λ
by definition, and since it is Galois, for each x̄ ∈ Fibs̄ (X) we find a unique S-
automorphism λ as above so that Fibs̄ (π ◦ λ) maps the distinguished element
5.4 The Algebraic Fundamental Group in the General Case175
pα ∈ Fibs̄ (Pα ) to x̄. Now sending x̄ to the class of π ◦ λ in lim Hom(Pα , X) yields
→
the required inverse.
An inspection of the above proof shows that the maps φαβ in the system pro-
representing the fibre functor Fibs̄ depend on the choice of the system of geometric
points (pα ); however, once such a system (pα ) is fixed, the pro-representing system
becomes unique. This fact will be crucial for the proof of the next corollary.
Corollary 5.4.7 Every automorphism of the functor Fibs̄ comes from a unique
automorphism of the inverse system (Pα , φαβ ) constructed in the proof above.
Before stating the next corollary, recall from Chapter 2 that the opposite group
of a group G is the group Gop with the same underlying set but multiplication
defined by (x, y) 7→ yx.
Corollary 5.4.8 The automorphism groups Aut(Pα )op form an inverse system
whose inverse limit is isomorphic to π1 (S, s̄).
surjectivity, leaving the details for fully faithfulness to the readers. Given a finite
continuous left π1 (S, s̄)-set E, we may assume that the π1 (S, s̄)-action is transitive
by decomposing E in orbits. The stabilizer U of a point x ∈ E is an open subgroup
of π1 (S, s̄), and therefore contains an open normal subgroup Vα arising as the
kernel of a projection π1 (S, s̄) → Aut(Pα |S)op , since the Vα form a basis of open
neighbourhoods of 1 in π1 (S, s̄). Let U be the image of U in Aut(Pα |S)op , and let
op
X be the quotient of Pα by the action of U constructed in Lemma 5.3.7. Then
E∼ = Fibs (X).
Finally we make the link with the theory of the previous chapter. In fact, it
generalizes to an arbitrary integral normal scheme S.
Proof: A finite étale cover X → S is normal by Proposition 5.3.2 (3). Its generic
fibre is a finite product of finite separable extensions Li |K, so by uniqueness of
normalization (Fact 5.2.11 (2)) X must be the disjoint union of the normalizations
of S in the Li . This being said, one proves exactly as in Theorem 4.6.4 that KS |K
is a Galois extension, and that the category of finite étale S-schemes is equivalent
to that of finite continuous left Gal (KS |K)-sets, except that for the composition
and base change properties of finite étale covers one applies Remarks 5.2.3 as in
the proof of Lemma 3.4.2. By construction, the equivalence is induced by the fibre
functor at the geometric point s̄.
set Λ and objects Pα , only the transition morphisms may be different. Denote them
by φαβ and ψαβ , respectively. Proving the proposition amounts to constructing
an isomorphism of the inverse system (Pα , φαβ ) onto the system (Pα , ψαβ ), i.e. a
system of automorphisms λα ∈ Aut(Pα |S) transforming the maps φαβ to the ψαβ .
Assume given a pair α ≤ β in Λ and an automorphism λβ ∈ Aut(Pβ |S). Consider
the distinguished elements pα ∈ Fibs̄ (Pα ) and pβ ∈ Fibs̄ (Pβ ). By construction we
have Fibs̄ (φαβ )(pβ ) = pα . On the other hand, set p′α := Fibs̄ (ψαβ )(λβ (pβ )). As
Pα is Galois, there is a unique automorphism λα ∈ Aut(Pα |S) mapping pα to p′α .
Corollary 5.3.3 applied with z̄ = pα , φ1 = ψαβ ◦ λβ and φ2 = λα ◦ φαβ then implies
that the diagram of S-morphisms
λβ
Pβ −−−→ Pβ
φαβ y
ψ
y αβ
α λ
Pα −−−→ Pα
commutes. Define ραβ : Aut(Pβ |S) → Aut(Pα |S) to be the map sending each
λβ ∈ Aut(Pβ |S) to the λα defined as above. It is a map of sets but in general not
a group homomorphism. We thus obtain an inverse system (Aut(Pα |S), ραβ ) of
finite sets. The inverse limit of such a system is nonempty (see Corollary 3.4.12).
An element of it defines the required isomorphism of (Pα , φαβ ) onto (Pα , ψαβ ).
Corollary 5.5.2 Let S be a connected scheme. For any two geometric points
s̄ : Spec (Ω) → S and s̄′ : Spec (Ω′ ) → S there exists a continuous isomorphism of
∼
profinite groups π1 (S, s̄′ ) → π1 (S, s̄).
∼
Proof: An isomorphism λ : Fibs̄ → Fibs̄′ of fibre functors induces an isomor-
phism of their automorphism groups via φ 7→ λ−1 ◦ φ ◦ λ. Continuity of this
isomorphism with respect to the profinite structure follows from the construction
in the proof above.
φ∗ : π1 (S ′ , s̄′ ) → π1 (S, s)
Proposition 5.5.4
1. The map φ∗ is trivial if and only if for every connected finite étale cover
X → S the base change X ×S S ′ is a trivial cover (i.e. isomorphic to a finite
disjoint union of copies of S ′ ).
2. The map φ∗ is surjective if and only if for every connected finite étale cover
X → S the base change X ×S S ′ is connected as well.
Proof: A finite étale cover is trivial if and only if it corresponds to a finite set
with trivial action of the fundamental group. This immediately gives the ‘only if’
part of the first statement. For the ‘if’ part assume that im (φ∗ ) is nontrivial, and
choose an open subgroup of π1 (S, s̄) not containing the whole of im (φ∗ ). Then the
action of π1 (S ′ , s̄′ ) on the coset space π1 (S, s̄)/U induced by φ∗ is nontrivial, so
the finite étale cover corresponding to U \π1 (S, s̄) pulls back to a nontrivial cover
of S ′ .
In the second statement the ‘only if’ part follows from the fact that connected
covers correspond via the fibre functor to sets with transitive π1 (S, s̄)-action. For
the ‘if’ part assume that im (φ∗ ) is not the whole of π1 (S, s̄). It is then a proper
closed subgroup by compactness of π1 (S ′ , s̄′ ), so we find a nontrivial open subgroup
U ⊂ π1 (S, s̄) containing it (see Lemma 5.5.7 (1) below for a stronger statement).
Then π1 (S ′ , s̄′ ) acts trivially on the coset space U \π1 (S, s̄) via φ∗ , which means
that the connected cover corresponding to U \π1 (S, s̄) pulls back to a trivial cover
of S ′ (different from S ′ ).
Proposition 5.5.5 Let U ⊂ π1 (S, s̄) be an open subgroup, and let X → S be the
connected cover corresponding to the coset space U \π1 (S, s̄), together with the base
point x̄ described above.
The subgroup U contains the image of φ∗ if and only if the finite étale cover
X ×S S ′ → S ′ has a section S ′ → X ×S S ′ sending s̄ to x̄.
Proof: For the first statement, given g ∈ G \ H we shall find an open subgroup
of G containing H but not g. To do so, by closedness of H we first pick an open
normal subgroup N ⊂ G with gN ∩ H = ∅ (such an N exists, because the gN of
this type form a basis of open neighbourhoods of g). Denoting by p : G → G/N
the natural projection, the open subgroup p−1 (p(H)) ⊂ G will do.
For statement (2) use the closedness of V ′ in G and part (1) to write both
H and V ′ as the intersection of the open subgroups of G containing them. Since
[H : V ′ ] is finite, we find finitely many open subgroups V1 , . . . , Vn in G containing
V ′ but not H such that V ′ = V1 ∩ · · · ∩ Vn ∩ H. Thus V := V1 ∩ · · · ∩ Vn will do.
Proof of Proposition 5.5.6: For the ‘only if’ part let X be as in the statement
of the proposition, and let U ⊂ π1 (S, s̄) be an open subgroup with X ∼ = U \π1 (S, s̄).
By choosing an appropriate geometric base point we may identify the component
Xi ⊂ X ×S S ′ with the coset space U ′′ \π1 (S ′ , s̄′ ), for some open subgroup U ′′ ⊂
π1 (S ′ , s̄′ ). Note that U ′′ must contain ker(φ∗ ), as ker(φ∗ ) stabilizes the base point
by construction. To say that there is a morphism Xi → X ′ is then equivalent to
the inclusion U ′′ ⊂ U ′ , but U ′′ contains ker(φ∗ ) as we just said, and we are done.
180 Chapter 5. Fundamental Groups of Schemes
Corollary 5.5.8 The map φ∗ is injective if and only if for every connected finite
étale cover X ′ → S ′ there exists a finite étale cover X → S and a morphism
Xi → X ′ over S ′ , where Xi is a connected component of X ×S S ′ .
In particular, if every connected finite étale cover X ′ → S ′ is of the form
X ×S S ′ → S ′ for a finite étale cover X → S, then φ is injective.
Proof: This follows from the proposition since the intersection of the open sub-
groups of π1 (S ′ , s̄′ ) is trivial.
ψ φ
Corollary 5.5.9 Let S ′′ → S ′ → S be a a sequence of morphisms of connected
schemes, and let s̄, s̄′ , s̄′′ be geometric points of S, S ′ and S ′′ , respectively, satisfying
s̄ = φ ◦ s̄′ and s̄′ = ψ ◦ s̄′′ . The sequence
ψ∗ φ∗
π1 (S ′′ , s̄′′ ) → π1 (S ′ , s̄′ ) → π1 (S, s̄)
Remark 5.5.10 In part (2) of the above criterion it suffices to consider finite
Galois covers. This is because ker(φ∗ ), being a closed normal subgroup of π1 (S ′ , s̄′ ),
is the intersection of the open normal subgroups containing it.
Lemma 5.6.2 Given a finite étale cover Y → X, there is a finite extension L|k
contained in ks and a finite étale cover YL of XL := X ×Spec (k) Spec (L) so that
Y ∼= YL ×Spec (L) Spec (ks ).
Similarly, elements of Aut(Y |X) come from Aut(YL |XL ) for L sufficiently
large.
Proof: We prove the first statement, the proof of the second one being similar.
Since X is quasi-compact by assumption, it has a finite covering by open affine
subschemes. Let Ui = Spec (Ai ) (1 ≤ i ≤ n) be a finite affine open covering of X
so that the preimage in Y of each U i := Spec (Ai ⊗k ks ) is an affine open subscheme
of the form Spec (B i ), where B i is a finitely generated Ai ⊗k ks -module. Each B i
then arises as a quotient of some polynomial ring (Ai ⊗k ks )[x1 , . . . , xm ] by an ideal
generated by finitely many polynomials f1 , . . . , fr . The finitely many coefficients
involved in the fj generate a finitely generated ks -subalgebra A′i ⊂ Ai ⊗k ks , itself
a quotient of a polynomial ring over ks by an ideal generated by finitely many
polynomials g1 , . . . , gs . If L|k is the finite extension generated by the coefficients
of the gl , then the coefficients of the fj are contained in Ai ⊗k L, and hence
Bi ∼= ((Ai ⊗k L)[x1 , . . . , xm ]/(f1 , . . . , fr )) ⊗L ks . Moreover, such an isomorphism
holds for all i for L sufficiently large. Similarly, one sees that the isomorphisms
showing the compatibility of the Y Ui over the overlaps Ui ∩ Uj can be defined by
equations involving only finitely many coefficients. Thus an extension L that is so
large that it contains all the coefficients involved satisfies the requirements of the
lemma.
Proof of Proposition 5.6.1: Injectivity of the map π1 (X, x̄) → π1 (X, x̄) follows
from the criterion of Corollary 5.5.8 in view of the lemma. Similarly, the surjectiv-
ity of π1 (X, x̄) → Gal (ks |k) follows from Proposition 5.5.4 (2) and our assumption
that X is geometrically connected.
It thus remains to prove exactness in the middle. For this we apply the criterion
of Corollary 5.5.9, complemented by Remark 5.5.10. Condition (1) is straightfor-
ward: each finite étale cover of the form XL → X, with L|k a finite extension,
yields a trivial cover when pulled back to X. For condition (2), assume Y → X
is a finite étale Galois cover such that Yks → X has a section. As X is integral,
182 Chapter 5. Fundamental Groups of Schemes
Remarks 5.6.3
1. Using the proposition we may extend the definition the outer Galois rep-
resentation ρX : Gal (ks |k) → Out(π1 (X, x̄)) of Section 4.7 to an arbitrary
quasi-compact and geometrically integral scheme over a field k.
2. The proposition also enables us to give another, slightly more precise formu-
lation of Grothendieck’s Section Conjecture discussed at the end of Section
4.9. Given a k-rational point y : Spec k → X, it induces by functoriality a
map σy : Gal (k̄|k) → π1 (X, ȳ) for a geometric point ȳ lying above y. This
is not quite a section of the exact sequence of the proposition because of the
difference of base points. But by Remark 5.5.3 a path from ȳ to x̄ induces
∼
an isomorphism λ : π1 (X, ȳ) → π1 (X, x̄) that is uniquely determined up
to inner automorphism of π1 (X, x̄). The composite λ ◦ σy is then a section
of the exact sequence uniquely determined up to conjugation. The Section
Conjecture predicts that in the case when X is a smooth projective curve
of genus at least 2 and k is finitely generated over Q this construction es-
tablishes a bijection between k-rational points of X and conjugacy classes
of sections.
is exact.
For the notion of a flat morphism, see Remark 5.2.2 (2). The main tool in the
proof is the Stein factorization theorem. Let us recall the statement.
5.6 The Homotopy Exact Sequence and Applications 183
Proof of Proposition 5.6.4: We first prove surjectivity of the map π1 (X, x̄) →
π1 (S, s̄) by means of the criterion of Proposition 5.5.4 (2). Assume given a con-
nected finite étale cover ρ : Y → S. We have to show that ρX : Y ×S X → X is
again connected. By our assumption on φ we have φ∗ OX = OS , from which we con-
clude φ∗ (ρX∗ OX×S Y ) = ρ∗ OY . Were X ×S Y disconnected, the sheaf ρX∗ OX×S Y
would decompose as a nontrivial direct product of sheaves of OX -algebras, cor-
responding to the regular functions on different components. But then the same
would be true of ρ∗ OY , contradicting the connectedness of Y .
Next we show exactness in the middle by checking the conditions of Corollary
5.5.9. Condition (1) is automatic, so we turn to condition (2). Take a connected
finite étale cover X ′ → X, and consider the Stein factorization of the composite
map X ′ → X → S. We obtain a finite morphism ψ : S ′ → S and a morphism
φ′ : X ′ → S ′ with φ′∗ OX ′ = OS ′ . As we assumed φ flat, we may apply Corollary
III.12.9 of [34]. It first shows that ψ∗ OS ′ = (ψ ◦ φ′ )∗ OX ′ is locally free, i.e. ψ
is a locally free morphism. Secondly, it shows that for each point P of S we
have (ψ∗ OS ′ )P ⊗OX,P κ(P ) = ((ψ ◦ φ′ )∗ OX ′ )P ⊗OX,P κ(P ) ∼ = OXP′ (XP′ ), where
XP′ = X ′ ×S Spec (κ(P )). But since φ had proper geometrically integral fibres,
here OXP′ (XP′ ) ⊗κ(P ) κ(P ) must be a finite direct product of copies of κ(P ). We
conclude that ψ : S ′ → S is a finite étale cover; it is connected because so is X ′
and φ′ is surjective by the condition φ′∗ OX ′ = OS ′ . Consider the natural map
X ′ → X ×S S ′ given by the universal property of the fibre product. It is a map
of finite étale covers of X. Assuming, as in condition (2) of Corollary 5.5.9, that
the base change of X ′ → X to Xs̄ has a section implies that the base change of
X ×S S ′ has a section as well. This shows that the finite étale cover X ′ → X ×S S ′
must be of degree 1, i.e. an isomorphism.
Proof: The map (5.2) may be inserted in the commutative diagram of exact
sequences
1 −−−→ π1 (X, x̄) −−−→ π1 (X × Y, (x̄, ȳ)) −−−→ π1 (Y, ȳ) −−−→ 1
y y y
1 −−−→ π1 (X, x̄) −−−→ π1 (X, x̄) × π1 (Y, ȳ) −−−→ π1 (Y, ȳ) −−−→ 1.
Here the upper row comes from the previous proposition applied to the morphism
X × Y → Y and the geometric point ȳ. Injectivity on the left comes from the fact
that the projection X × Y → Y yields a section of the inclusion map X → X × Y
(where X is identified with the fibre Xȳ ) sending (x̄, ȳ) to x̄. The outer vertical
maps are identity maps, hence the middle one must be an isomorphism.
We conclude this section by two applications of Corollary 5.6.6, taken from the
classic paper of Lang and Serre [51]. The first one is:
In the next section we shall give another proof of the proposition due to Pop
that is more natural but works under somewhat different assumptions. The proper-
ness assumption in the proposition is essential in positive characteristic; see Exer-
cise 8.
The second application of Corollary 5.6.6 concerns abelian varieties. Recall
that an abelian variety over a field k means a proper geometrically integral group
scheme over k; it is necessarily commutative by [65], §II.4.
Yi ∼
= Yi′ ∼
= Y for i = 1, 2. This yields a map Y × Y → Y ′ , whence also a map
mY : Y × Y → Y by composing with the projection Y ′ → Y . Fix a point 0Y of
Y in the fibre above 0. Modifying mY by an automorphism of Y if necessary we
may assume mY (0Y , 0Y ) = 0Y .
We contend that mY is a commutative group operation on Y with 0Y as
null element. By construction this will also imply that Y is an abelian variety
(being proper over k) and that φ is a group homomorphism. To see that mY is
commutative, consider the automorphism σY : Y × Y given by permuting the
factors. The maps mY and mY ◦ σY : Y × Y → Y coincide at the point (0Y , 0Y ),
so they are the same by Corollary 5.3.3. Similar reasonings show that mY is
associative with null element 0Y . The existence of an inverse is equivalent to the
property that the product map (idY , mY ) : Y ×Y → Y ×Y is an isomorphism. We
know that (idA , mA ) is an isomorphism. Therefore from the commutative diagram
(idY ,mY )
Y × Y −−−−−−→ Y × Y
φ×φy
φ×φ
y
(idA ,mA )
A × A −−−−−−→ A × A
we infer that the composite (φ × φ) ◦ (idY , mY ) : Y × Y → A × A is an étale
Galois cover with group G × G. But so is φ × φ, so (idY , mY ) must indeed be an
isomorphism.
We can now completely determine the fundamental group of the abelian variety
A. Recall that given a prime number ℓ, the ℓ-adic Tate module Tℓ (A) of A is defined
as the inverse limit of the inverse system formed by the ℓr -torsion subgroups ℓr A(k)
for all r > 0. Here A(k) denotes the group of k-rational points of A (i.e. the group
of sections of the structure morphism A → Spec (k)). The direct product of the
Tℓ (A) for all ℓ is the full Tate module T (A) of A; it is also the inverse limit of the
inverse system of n-torsion subgroups n A(k) partially ordered by divisibility.
Theorem 5.6.10 Let A be an abelian variety over an algebraically closed field.
The fundamental group of A is commutative, and there are natural isomorphisms
Y
π1 (A) ∼
= T (A) ∼
= Tℓ (A).
ℓ
We have omitted the base point from the notation as it plays no role in the
commutative case (Remark 5.5.3).
Proof: By Theorem II.8.18 in [34] one may find H with X ∩ H smooth. Then
Remark III.7.9.1 of op. cit. shows the connectedness of X ∩ H for this H.
The same argument carries over mutatis mutandis to to the closed immersion
Y ×X (H ∩ X) ֒→ Y and shows the connectedness of Y ×X (H ∩ X).
Proof of Proposition 5.7.1: Since by Remark 5.2.3 (2) the fibre product
Y ×X (H ∩ X) in the lemma is a connected finite étale cover of H ∩ X, we obtain
from Proposition 5.5.4 (2) a surjection π1 (H ∩ X, x̄) ։ π1 (X, x̄) for a geomet-
ric point x̄ of H ∩ X. As H ∩ X is smooth and projective of strictly smaller
dimension than X, we may apply induction on dimension to obtain a surjection
π1 (C, x̄) ։ π1 (X, x̄), where C ⊂ X is a smooth projective curve obtained by cut-
ting X with a linear subspace in Pn , and x̄ is a geometric point of C. Over k = C
we know from Theorem 4.6.7 that π1 (C, x̄) is topologically finitely generated, hence
so is π1 (X, x̄).
Remark 5.7.3 The key point in the above proof was the surjectivity of the map
π1 (H ∩ X, x̄) → π1 (X, x̄). In fact, when X has dimension at least 3, the Lef-
schetz hyperplane theorem states that this map is actually an isomorphism for any
hyperplane H (and X ∩ H is connected).
Over k = C there is a simple topological proof by Andreotti and Frenkel (see
[60], §7). In the general case there is a proof by Grothendieck in [30]. He constructs
an equivalence of categories between FetX and FetX∩H in three steps. First he
takes an open subscheme U ⊂ X containing X ∩H. As X has dimension at least 3,
each codimension 1 closed subset meets X∩H and hence the complement of U must
be of codimension ≥ 2. Corollary 5.2.14 then implies an equivalence between FetX
and FetU . Next he considers the formal completion X b of X along X ∩ H ([34],
Section II.9) and defines finite étale covers for formal schemes. He then proves
a nontrivial algebraization theorem asserting the equivalence of FetU and FetXb .
Finally, the equivalence between FetXb and FetX∩H results by a generalization of
Exercise 7. For a simplified account of Grothendieck’s proof, see [33], Chapter IV.
5.7 Structure Theorems for the Fundamental Group 189
Theorem 5.7.4 Let X be a connected scheme of finite type over C. The functor
(Y → X) 7→ (Y an → X an ) induces an equivalence of the category of finite étale
covers of X with that of finite topological covers of X an . Consequently, for every
C-point x̄ : Spec (C) → X this functor induces an isomorphism
π1top\
∼
(X an , x̄) → π1 (X, x̄)
where on the left hand side we have the profinite completion of the topological
fundamental group of X with base point im (x̄).
The hard part of the theorem is the essential surjectivity of the functor. As in
the theory of Riemann surfaces, one first proves that every finite topological cover
p : Y → X an can be equipped with a canonical structure of analytic space for
which p becomes a proper holomorphic map. Next there is a reduction to the case
where X is normal. One then has to use a deep theorem, first proven by Grauert
and Remmert, which says that for normal X every proper analytic map Z → X an
with finite fibres is isomorphic to Y an → X an for a finite surjective morphism
of schemes Y → X. In the case when X is a projective variety over C this is
part of Serre’s famous GAGA theorems. The proof given in [29], Exposé XII does
not use the theorem of Grauert and Remmert, but needs Hironaka’s resolution of
singularities.
Remark 5.7.5 Despite the above theorem, there is in general a big difference be-
tween the topological and algebraic fundamental groups. Toledo [104] constructed
a smooth projective variety X over C such that the intersection of all subgroups
of finite index in π1top (X, x̄) is a free group of infinite rank. In other words, the
completion map π1top (X, x̄) → π1 (X, x̄) has a free kernel of infinite rank. It is
an open question of Serre whether there exist examples with π1 (X, x̄) = {1} but
π1top (X, x̄) 6= {1}.
190 Chapter 5. Fundamental Groups of Schemes
Lemma 5.7.7 Let G1 be a profinite group that has only finitely many open normal
subgroups of index N for every integer N > 0. Assume that G2 is another profinite
group such that for every open normal subgroup U ⊂ G1 there is an open normal
subgroup V ⊂ G2 with G1 /U ∼ = G2 /V , and vice versa. Then:
Proof: For part (1) denote by UN (resp. VN ) the intersection of all open normal
subgroups of index at most N in G1 (resp. G2 ). By assumption [G1 : UN ] is finite.
If V ⊂ G2 is an open normal subgroup that is an intersection of open normal
subgroups of index at most N , there is an open normal subgroup U ⊃ UN in G1
with G1 /U ∼= G2 /V . Thus G2 /V is a quotient of G1 /UN . Now let W ⊂ G2 be an
open normal subgroup of maximal index such that G2 /W arises as a quotient of
G1 /UN . By the previous discussion G2 /(W ∩ V ) is again a quotient of G1 /UN for
every open normal V ⊂ G2 of index at most N . It follows from the maximality
assumption that the inclusion W ∩V ⊂ W must be an equality, and hence VN = W .
In particular, [G2 : VN ] is finite. As the finite groups G1 /UN and G2 /VN have the
same quotients, the natural surjection G1 /UN ։ G2 /VN is an isomorphism.
∼
Now consider the finite set XN of isomorphisms G1 /UN → G2 /VN for each
N > 0. The XN form a natural inverse system indexed by the positive integers,
∼
because composing a given isomorphism G1 /UN +1 → G2 /VN +1 with the projection
G2 /VN +1 → G2 /VN induces an isomorphism G1 /UN → G2 /VN . Thus we obtain an
inverse system of nonempty finite sets, and the inverse limit is nonempty by Lemma
∼
3.4.12. An element of lim XN is a compatible system of isomorphisms G1 /UN →
←
∼
G2 /VN , so it induces a continuous isomorphism lim G1 /UN → lim G2 /VN . But
← ←
these inverse limits are G1 and G2 , respectively, because the UN are cofinal in the
system of open normal subgroups of G1 , and similarly for the VN .
5.7 Structure Theorems for the Fundamental Group 191
Remark 5.7.8 Though it works under more restrictive assumptions than that
of Proposition 5.6.7, the above proof has several advantages. To begin with, it
avoids the use of the difficult Proposition 5.6.4, and in particular the properness
assumption that was crucial in its proof. Instead, all we need here is the fact that
π1 (XC , x̄C ) is topologically finitely generated. In particular, the above proof also
works for a normal affine curve X over an algebraically closed field of characteristic
0 (by virtue of Theorem 4.6.7), and we obtain a result used in the previous chapter.
On the other hand, if one is only interested in proving Proposition 5.6.7 for
proper normal schemes but not Corollary 5.7.6, even the transcendental input is
superfluous. In fact, in order to be able to apply the lemma, all one needs is the
fact that π1 (X, x̄) has only finitely many open normal subgroups of index N for
every integer N > 0. But, as already noted in the previous chapter, this was proven
in a purely algebraic way for proper normal schemes (in arbitrary characteristic)
by Lang and Serre ([51], th. 4).
1. The natural map π1 (Xs , ȳ) → π1 (X, ȳ) induced by the map Xs → X is an
isomorphism.
This is proven in [29], Exposé X using deep algebraization techniques for formal
schemes. We refer to Section 8.5.C of [36] for an excellent exposition.
If k is algebraically closed and s = s̄, we may consider the composite map
∼ ∼
sp : π1 (Xη̄ , x̄) → π1 (X, x̄) → π1 (X, ȳ) → π1 (Xs̄ , ȳ) (5.4)
where the middle isomorphism is a non-canonical one coming from Corollary 5.5.2,
the first map is the one in part (2) of the theorem, and the last one is the inverse of
the isomorphism of part (1). It is called a specialization map for the fundamental
group associated with φ. As it depends on the choice of a path from x̄ to ȳ, it is
unique up to inner automorphism.
Concerning the specialization map the main result is the following.
where the superscripts (p′ ) denote the maximal prime-to-p quotients of the profinite
groups involved.
5.7 Structure Theorems for the Fundamental Group 193
Proof of Theorem 5.7.10: In view of Theorem 5.7.9 (1) we have to show that
the map π1 (Xη̄ , x̄) → π1 (X, x̄) induces an isomorphism on maximal prime-to-p
quotients.
We first show that the map π1 (Xη̄ , x̄) → π1 (X, x̄) itself is surjective. By
Proposition 5.5.4 (2) we have to check that for each connected finite étale cover
π : Y → X the base change Yη̄ := Y ×X η̄ remains connected. The scheme
Yη̄ is the inverse limit of the schemes Yη′ := Y ×S Spec (K ′ ), where K ′ |K is a
finite extension contained in the algebraic closure of K corresponding to η̄ ′ . As
an inverse limit of connected topological spaces is again connected, it suffices to
check the connectedness of the Yη′ . But Yη′ is the generic fibre (so in particular an
open subscheme) of Y ′ = Y ×S S ′ , where S ′ is the spectrum of the normalization
A′ of A in K ′ , so therefore it is enough to check the connectedness of Y ′ . If Y ′
were the disjoint union of two nonempty closed subsets Z1 and Z2 , one of them,
say Z1 , would contain the special fibre Ys . Indeed, Ys is the same as Xs (since
s = s̄) and Xs is connected by assumption. But then the image of Z2 by the map
194 Chapter 5. Fundamental Groups of Schemes
Remark 5.7.12 In fact, one may define the specialization map in the following
more general situation: S is a locally noetherian scheme, X → S is a proper
morphism with connected geometric fibres, and s0 , s1 are scheme-theoretic points
of S such that s0 lies in the closure of s1 in S. Fix geometric points s̄i lying
above the si and geometric points x̄i of the corresponding geometric fibres. We
then have a specialization homomorphism π1 (Xs̄1 , x̄1 ) → π1 (Xs̄0 , x̄0 ) defined as
follows. Replace S by the spectrum of the local ring OZ,s0 of s0 on the closure Z
of s1 in S; this does not affect the geometric fibres Xs̄i . Let A be the completion
of the localization of the integral closure of OZ,s0 in its fraction field K0 by a
prime ideal of height 1. It is a complete discrete valuation ring containing OZ,s0 ,
hence the specialization map (5.4) is defined for X ×S Spec (A) → Spec (A). The
geometric special fibre Xs̄0 is preserved, and the natural map Xη̄ → Xs̄0 induces
5.7 Structure Theorems for the Fundamental Group 195
Grothendieck’s main motivation for developing the above theory was to prove
Theorem 4.9.1 in positive characteristic. Let us restate it in the proper case.
As already remarked in the previous chapter, for k = C this follows from the
topological theory and the Riemann existence theorem, and then for general k of
characteristic 0 it follows from Proposition 5.6.7. When k has positive character-
istic, the proof proceeds in three steps.
1. First one uses the fact that there exists a discrete valuation ring A with fraction
field K of characteristic 0 and residue field κ isomorphic to k ([57], Theorem 29.1).
Here A may be assumed complete by taking its completion.
2. Then one uses the existence of a smooth proper scheme X → Spec (A) with
Xs ∼= X and Xη a smooth proper curve over K. This can be proven in several
ways. The original approach of Grothendieck was to extend X first to a formal
Spec (A)-scheme and use an algebraization theorem; see [36], Theorem 8.5.19.
Another approach is to use the classification of curves. For instance, one may
embed X in projective space using the third power of the canonical sheaf, and then
identify its isomorphism class with a k-point [X] of the Hilbert scheme denoted
by Hg0 in [15]. According to Corollaries 1.7 and 1.9 of that paper the scheme Hg0
is smooth over Spec (Z), and therefore a form of Hensel’s lemma (see e.g. [64],
§III.5) can be applied to extend [X] to an A-valued point. (This argument works
for g ≥ 2, but the cases g = 0, 1 are easily handled directly.)
3. Finally, one applies Theorem 5.7.10.
Proof: Like Proposition 5.7.1, this is proven by reduction to the case of curves
treated in the theorem above.
Combining the arguments involved in the above proof with techniques of flat
descent theory, Grothendieck proved in ([29], X.2.10) that the corollary holds for
196 Chapter 5. Fundamental Groups of Schemes
Remarks 5.7.16
1. It follows from the definition that π1t (U, ū) is a quotient of π1 (U, ū); in par-
ticular, it is a profinite group. When ū is given by an algebraic closure K(U )
of the function field K(U ), we may identify π1t (U, ū) with the Galois group
Gal (KUt |K(Y )), where KUt is the compositum in K(U ) of the function fields
of tamely ramified connected finite étale covers of U . This makes sense,
because composita of tamely ramified extensions of discrete valuation rings
are again tamely ramified ([69], Chapter II, Corollary 7.9).
3. The above definition of tameness is taken from Kerz and Schmidt [43]. They
show that if there exists a regular proper X containing U such that the
complement X \ U is a so-called normal crossing divisor, then Y → U is
tamely ramified if and only if it is tamely ramified along X \ U . The latter
condition is the definition of tameness adopted in [29]. In [43] it is also
5.8 The Abelianized Fundamental Group 197
shown that a finite étale cover Y → U is tamely ramified if and only if the
base change X ×S C → C is a tamely ramified cover of C for every regular
integral separated scheme of finite type over S that has dimension 1. This
implies in particular that the notion of tameness is preserved by base change.
Torsion elements in Pic (X) of order prime to the characteristic of K give rise
to finite étale covers of X via the following construction.
From now on we write π1ab (X) for the maximal abelian profinite quotient of
the fundamental group π1 (X, x̄) with respect to some geometric base point x̄. It
can also be described as the quotient of π1 (X, x̄) by the closure of its commutator
subgroup. According to Remark 5.5.3 it does not depend on the choice of the
geometric base point.
Here for an abelian group A the notation m A stands for the m-torsion subgroup,
and continuous homomorphisms are considered.
5.8 The Abelianized Fundamental Group 199
Keeping the assumptions under which the above facts hold, we have for each
m > 0 a commutative diagram
0 −−−→ Pic0X (k) −−−→ PicX (k) −−−→ N S(X) −−−→ 0
m m m
y y y
0 −−−→ Pic0X (k) −−−→ PicX (k) −−−→ N S(X) −−−→ 0
where the vertical maps are given by multiplication by m, the left one being
surjective. We deduce an exact sequence on m-torsion subgroups
Assuming m prime to the characteristic, we may combine the above with Propo-
sition 5.8.3 and obtain an exact sequence
since in this case OX (X) = k is divisible. Dualizing the above sequence of Z/mZ-
modules and bearing in mind the facts recalled in Remark 5.6.11 we obtain:
0 → Hom(m N S(X), Z/mZ) → π1ab (X)/mπ1ab (X) → Hom(m Pic0X (k), Z/mZ) → 0.
where µm is the group of m-th roots of unity in k. It is called the Weil pairing.
Proof: By choosing a primitive ℓr -th root of unity in k for all r > 0 we obtain
isomorphisms µℓr ∼ = Z/ℓr Z. Hence we may identify the group ℓr AlbX (k) with
Hom(ℓr PicX (k), Z/ℓr Z). The statement then follows from applying Corollary 5.8.8
0
with m = ℓr for all r > 0, and passing to the inverse limit in the resulting inverse
system of exact sequences of finite abelian groups.
Remark 5.8.11 The exact sequence of the corollary also holds for ℓ = p. In
fact, an argument of Raynaud in flat cohomology (see [59], Proposition III.4.16)
implies that there are isomorphisms Hom(π1ab (X), Z/mZ) ∼ = Hom(µm , Pic X ) for
all m > 0. The rest of the proof then goes through. Consequently, the whole
torsion subgroup of π1ab (X) is finite, and the quotient is isomorphic to the full
Tate module T (AlbX ).
On the other hand, if we only assume X to be smooth and quasi-projective,
Serre’s theory of generalized Albanese varieties implies an analogous exact sequence
for the abelianized tame fundamental group. See [97], Proposition 4.4.
We now investigate the abelianized fundamental group over more general bases.
First assume X is a proper smooth geometrically integral scheme of finite type over
an arbitrary perfect base field k. The homotopy exact sequence
Recall the canonical outer action of Gal (k̄|k) on π1 (X, x̄): it is induced by the
action of π1 (X, x̄) on itself via conjugation. It thus yields a well-defined action of
of Gal (k̄|k) on π1ab (X). Moreover, if we equip π1ab (X) with the trivial Gal (k̄|k)-
action, the map π1ab (X) → π1ab (X) is compatible with the action of Gal (k̄|k).
It thus factors through the maximal quotient π1ab (X)Gal (k̄|k) of π1ab (X) invariant
under the action of Gal (k̄|k), the coinvariants of the Galois action. We obtain an
exact sequence
Remark 5.8.12 In the case when X has a k-rational point the exact sequence
(5.5) has a section, so we obtain a semidirect product decomposition
π1 (X, x̄) ∼
= π1 (X, x̄) ⋊ Gal (k̄|k).
5.8 The Abelianized Fundamental Group 203
In fact, the p-part of the kernel is also finite, but we shall not prove this; see
Remark 5.8.15 below.
Proof: It suffices to show that the coinvariants of the action of Gal (F|F) on the
Tate module Tℓ (AlbX̄ ) are finite for all ℓ 6= p and trivial for all but finitely many
ℓ. As Gal (F|F) ∼ b topologically generated by the Frobenius automorphism
= Z,
p
F : x 7→ x , and the Galois action is continuous, we may identify Tℓ (AlbX̄ )Gal (F|F)
with the cokernel of the Zℓ -linear automorphism F − id of Tℓ (AlbX̄ ). Now recall
from Remark 5.6.11 that Tℓ (AlbX̄ ) is a finitely generated free Zℓ -module. It follows
that the map
has finite kernel (resp. cokernel) if and only if the induced map
Proof: First we treat the case S = Spec (K). As above, we have to show that
the coinvariants of the action of Gal (K|K) on π1ab (X) are finite. By a similar
argument as in the proof of Lemma 5.6.2 we find an integrally closed domain A
finitely generated over Z with fraction field K such that X → S extends to a proper
smooth morphism X → Spec (A) with generic fibre X and all fibres geometrically
integral. Pick a point Q of codimension 1 in Spec (A) and a normal closed point P
on the closure Y of Q in Spec (A). Let B be the completion of the local ring OY,P ;
it is a complete discrete valuation ring with residue field κ(P ). Denote by p the
residue characteristic, and fix geometric points η̄ (resp. s̄) lying above the generic
(resp. closed) point of Spec (B). Theorem 5.7.10 implies that the specialization
′ ′
map sp : π1 (Xη̄ )ab,(p ) → π1 (Xs̄ )ab,(p ) on maximal abelian prime-to-p quotients is
an isomorphism. The geometric point η factors as η : Spec (Ω) → Spec (K) →
Spec (K), where K is an algebraic closure of K in Ω. Denoting by ηK the geometric
point Spec (K) → Spec (A), we obtain from Proposition 5.6.7 an isomorphism
′ ∼ ′
π1 (Xη̄ )ab,(p ) → π1 (Xη̄K )ab,(p ) through which the map sp factors. We conclude
′ ∼ ′
that there is an isomorphism π1 (Xη̄K )ab,(p ) → π1 (Xs̄ )ab,(p ) .
∧
Denote by K ∧ the fraction field of B. The Galois group Gal (K |K ∧ ) may
′
be identified with a subgroup D ⊂ Gal (K|K), and its action on π1 (Xη̄ )ab,(p )
′
corresponds to the action of D on π1 (Xη̄K )ab,(p ) . It follows that there exists a
surjective map
ab,(p′ ) ab,(p′ )
π1 (Xη̄ ) ∧ ։ π1 (Xη̄K )Gal (K|K) . (5.7)
Gal (K |K ∧ )
∧ ′
But Gal (K |K ∧ ) acts on π1 (Xη̄ )ab,(p ) via its quotient π1 (Spec (B), η̄) because
′ ∼ ′
of the isomorphism π1 (Xη̄ )ab,(p ) → π1 (XSpec (B) )ab,(p ) factoring the specialization
∧
isomorphism sp, where B is the integral closure of B in K . On the other hand,
the choice of a path between s̄ and η̄ yields a map
Gal (κ(P )|κ(P )) ∼
= π1 (Spec (κ(P ), s̄) → π1 (Spec (B), η̄).
Putting all the above together, we conclude that there is a surjective map
ab,(p′ ) ab,(p′ )
π1 (Xs̄ ) ։ π1 (Xη̄ ) ∧ . (5.8)
Gal (κ(P )|κ(P )) Gal (K |K ∧ )
But the first group here is finite, as we saw in the proof of the previous propo-
sition. We conclude from the surjections (5.7) and (5.8) that the coinvariants of
′
Gal (K|K) on π1 (X)ab,(p ) are finite. Working with another closed point of residue
characteristic different from p we obtain that the coinvariants on π1 (X)ab,(p) are
finite as well. This concludes the proof of the case S = Spec (K).
To treat the general case, fix a geometric point x̄ of X and consider the com-
mutative diagram
φ
−→ π1ab (Spec (K)) −−−→ 0
π1ab (Xx̄ ) −−−→ π1ab (X ×S Spec (K)) −−K∗
y id y y
φ∗
π1ab (Xx̄ ) −−−→ π1ab (X) −−−→ π1ab (S) −−−→ 0
5.8 The Abelianized Fundamental Group 205
whose exact rows result from Proposition 5.6.4 after abelianization. A diagram
chase shows that there is a surjective map ker(φK∗ ) ։ ker(φ∗ ), and we reduce to
the special case proven above.
Remark 5.8.15 If we only assume that the field K is finitely generated over its
prime field, the above proof shows that the maximal prime-to-p quotient of ker(φ∗ )
is finite, where p is the characteristic of K. An additional argument sketched in
[41] shows that the p-part is finite as well. Katz and Lang also proved the finiteness
of the maximal prime-to-p quotient of ker(φ∗ ) under the following assumptions: S
is normal, φ is smooth and surjective with geometrically integral generic fibre, and
K is finitely generated over the prime field, of characteristic p ≥ 0.
We close this section with a theorem of Lang about the abelianized fundamen-
tal group of a separated scheme of finite type over Z. When X is such a scheme,
the residue field κ(P ) of every closed point P is finite by Corollary 4.1.14. The
fundamental group of Spec (κ(P )) is the absolute Galois group of κ(P ). It is iso-
morphic to Z,b and a topological generator is given by the Frobenius automorphism
FP of a fixed algebraic closure κ(P ).
Given a geometric point x̄ of X, the morphism Spec (κ(P )) → X induces a
continuous homomorphism Gal (κ(P )|κ(P )) → π1 (X, x̄). Its definition involves
the choice of a path between the geometric points Spec (κ(P )) and x̄ of X, hence
it is only defined up to inner automorphism. The element FP thus gives rise to a
conjugacy class of elements in π1 (X, x̄) called Frobenius elements.
When we compose with the projection π1 (X, x̄) → π1ab (X), the resulting map
Gal (κ(P )|κ(P )) → π1ab (X) becomes well-defined, and each FP maps to a unique
Frobenius element in π1ab (X).
Facts 5.8.17 Given a separated scheme X of finite type over Z, its zeta function
ζX is defined by the Euler product
Y 1
ζX (s) =
P ∈X0
(1 − NP )−s
(where X is the spectrum of the ring of integers in an algebraic number field; see
e.g. [50], Chapter VIII, Theorem 5), using estimates on the number of points on
varieties over number fields. When X is actually of finite type over a finite field,
much more is true: ζX is a rational function (Dwork’s theorem), there are simple
poles at s = 1 and s = d, and for X smooth and projective the other poles are
described by Deligne’s theorem (ex Weil conjecture). See e.g. [59], Section VI.12.
Remark 5.8.18 Using a more powerful analytic result one obtains a non-commuta-
tive generalization of the theorem. Namely, a consequence of the generalized Cheb-
otarev density theorem ([89], Theorem 6) can be stated as follows. Assume given a
finite étale Galois cover Y → X of integral separated schemes of finite type over Z.
Then each subset C of the Galois group G that is stable by conjugation contains
the image of a conjugacy class of Frobenius elements in π1 (X, x̄). Consequently,
the union D of all conjugacy classes of Frobenius elements in π1 (X, x̄) meets every
single coset of every open normal subgroup, and D is therefore dense in π1 (X, x̄).
Note that this gives a stronger result even for π1ab (X): already the subset of the
Frobenius elements is dense, not just the subgroup they generate.
Remark 5.8.19 Lang’s theorem is the starting point of unramified class field
theory for schemes of finite type over Z. We review here the main statements. Let
X be a regular integral proper scheme over Z. Denote by Z0 (X) the free abelian
group with basis the closed points of X; it is called the group of zero-cycles on
X. By the discussion above, functoriality of the abelianized fundamental group
induces a well-defined homomorphism
In the case when the map X → Spec (Z) factors through the spectrum of
a finite field F, the map (5.9) factors through the quotient CH0 (X), yielding a
reciprocity map ρX : CH0 (X) → π1ab (X). Moreover, the natural degree map
CH0 (X) → Z induced by sending a zero-cycle to the sum of its coefficients sits in
a commutative diagram
ρ
X
CH0 (X) −−−→ π1ab (X)
y y
Z −−−→ Gal (F|F).
Under the assumptions that X is projective and geometrically integral, the reci-
procity map ρX induces an isomorphism on the kernels of the vertical maps, which
are moreover finite abelian groups.
In the case when the map X → Spec (Z) is surjective, one works with a slight
g
modification of the reciprocity map. Namely, one considers the quotient π ab (X) of
1
π1ab (X) that classifies finite étale abelian Galois covers Y → X with the property
that for each R-valued point Spec (R) → X the base change Y ×X Spec (R) is
a finite disjoint union of copies of Spec (R). From (5.9) one then derives a map
CH (X) → π gab (X) that is actually an isomorphism of finite groups.
0 1
The above statements are due to Lang, Bloch, Kato and Saito. See [78] for a
detailed survey, as well as [83], [84] for generalizations to the tame fundamental
group.
Exercises
1. Let A be a commutative ring with unit. Prove that Spec (A) is a connected
affine scheme if and only if A contains no idempotent other than 0 and 1.
2. Let G be a profinite group, and let F be the forgetful functor from the
category of finite continuous G-sets to the category of sets mapping a G-set
to its underlying set. Prove that Aut(F ) ∼= G. [Hint: Begin with the case
of finite G.]
3. Show that for a connected scheme S the category of inverse systems (Pα , φαβ )
indexed by Λ, with Λ and the Pα as in the proof of Proposition 5.4.6, is
equivalent to the category of fibre functors Fibs̄ at geometric points of S.
π1 (X, x̄) ∼
= lim π1 (XL , x̄),
←
the inverse limit being taken over all finite Galois extensions of k con-
tained in k̄.
(b) For each L|k as above construct an exact sequence
(c) Give another proof of Proposition 5.6.1 using the above two statements.
7. Let A be a complete local ring with maximal ideal M and residue field k
∼
(recall that completeness means A → lim A/M i ). We say that an A-algebra
←
B is finite étale if the induced morphism of schemes Spec (B) → Spec (A) is.
(a) Show that Spec (B) is connected if and only if Spec (B ⊗A k) is.
[Hint: Observe that the natural maps Spec (B/M i B) → Spec (B/M B)
are identity maps on the underlying topological spaces, and apply Ex-
ercise 1.]
(b) Show that for every finite separable field extension L|k there is a finite
étale A-algebra B with B ⊗A k ∼= L.
(c) Conclude that the natural morphism π1 (Spec (k), s̄) → π1 (Spec (A), s̄)
is an isomorphism for a geometric point s̄ lying above the closed point of
Spec (A). In particular, π1 (Spec (A), s̄) = {1} if k is separably closed,
and π1 (Spec (A), s̄) ∼
=Zb if k is finite.
(d) Conclude that the natural functor FetSpec (A) → FetSpec (k) induces an
equivalence of categories.
[Remark: The statements of this exercise hold more generally for so-called
Henselian local rings. See [59], §I.4.]
Exercises 209
defines a finite étale cover of A1K that does not arise by base change from a
finite étale cover of A1k .
9. Check that if k is an algebraically closed field, then π1 (Pnk , x̄) = {1} for all
n > 0 and all geometric points x̄.
(a) Let X be a proper regular integral scheme over a field k, and let
ρ : X 99K Y be a k-rational map to a normal scheme Y of finite type
over k. Show that ρ induces a well-defined map π1 (X, x̄) → π1 (Y, ρ ◦ x̄)
for every geometric point x̄ of X for which ρ ◦ x̄ is defined.
[Hint: Use Zariski–Nagata purity and the fact that under the above as-
sumptions ρ is defined outside a closed subset of codimension at least 2.]
(b) Conclude that a birational map between proper regular k-schemes in-
duces an isomorphism on their fundamental groups.
11. Let X be a proper normal integral scheme over an algebraically closed field
k for which there exists a rational map Pnk 99K X with dense image such
that the induced extension of function fields k(Pn )|k(X) is separable. Prove
that the fundamental group of S is finite.
[Hint: Use Zariski–Nagata purity to show that the function field of every
connected finite étale cover of X can be embedded in the finite extension
k(Pn )|k(X).]
Remark. An X having the property of the exercise is called separably uni-
rational. In fact, every separably unirational smooth proper k-scheme has
trivial fundamental group. In characteristic 0 this is an old result due to
Serre [88] who used Hodge theory. The general case was settled only recently
by Kollár using a clever deformation trick and a powerful theorem of de Jong
and Starr (see [11], Corollaire 3.6). The argument works more generally for
so-called separably rationally connected schemes.
210 Chapter 5. Fundamental Groups of Schemes
Chapter 6
The theory of the last chapter established an equivalence between the category
of finite étale covers of a connected scheme and the category of finite continuous
permutation representations of its algebraic fundamental group. We shall now
study a linearization of this concept, also due to Grothendieck and developed in
detail by Saavedra [81] and Deligne [14]. The origin is a classical theorem from the
theory of topological groups due to Tannaka and Krein: they showed that one may
recover a compact topological group from the category of its continuous unitary
representations. In Grothendieck’s algebraic context the group is a linear algebraic
group, or more generally an affine group scheme, and one studies the category of
finite dimensional representations. The key features that enable one to reconsti-
tute the group are the tensor structure on this category and the forgetful functor
that sends a representation to its underlying vector space. Having abstracted the
conditions imposed on the category of representations, one gets a theorem stating
that a category with certain additional structure is equivalent to the category of
finite dimensional representations of an affine group scheme. This can be applied
in several interesting situations. We shall discuss in some detail the theory of dif-
ferential Galois groups, and also Nori’s fundamental group scheme that creates a
link between the algebraic fundamental group and Tannakian theory.
We only treat so-called neutral Tannakian categories, but the reader familiar
with Grothendieck’s descent theory will have no particular difficulty afterwards in
studying the general theory of [14]. Non-commutative generalizations have also
been developed in connection with quantum groups; as samples of a vast literature
we refer to the books of Chari–Pressley [9] and Majid [55].
211
212 Chapter 6. Tannakian Fundamental Groups
Remark 6.1.2 In the last chapter we defined a group scheme over k as a group
object in the category of k-schemes, i.e. a k-scheme G together with k-morphisms
m : G × G → G (‘multiplication’), e : Spec (k) → G (‘unit’) and i : G → G (‘in-
verse’) subject to the usual group axioms. These morphisms induce a group struc-
ture on the set G(S) := Homk (S, G) of k-morphisms into G for each k-scheme S.
Therefore the contravariant functor S 7→ Homk (S, G) on the category of k-schemes
represented by G is in fact group-valued. Restricting it to the full subcategory of
affine k-schemes we obtain a covariant functor R 7→ Homk (Spec (R), G). Propo-
sition 5.1.5 shows that when G = Spec (A) is itself affine, this is none but the
functor above.
inverse i : G → G ↔ antipode ι : A → A
The group axioms imply compatibility conditions for ∆, ε and ι by the uniqueness
statement of the Yoneda lemma. Below we indicate diagram translations of the
associativity, unit and inverse axioms for groups on the left hand side, and the
corresponding compatibility conditions on A on the right hand side. They are
called the coassociativity, counit and antipode (or coinverse) axioms, respectively.
id×mult id⊗∆
G×G×G - G×G A⊗A⊗A A⊗A
6 6
mult×id mult ∆⊗id ∆
? ?
G×G
mult - G A⊗A
∆
A
6.1 Affine Group Schemes and Hopf Algebras 213
id×e id⊗ε
G - G×G A A⊗A
@ 6@
I
@ 6
e×id @ id
mult ε⊗id @
id
∆
@ @
? @
R ?
@ @
mult-
A⊗A
∆
G×G G A
id×i m◦(id⊗ι)
G - G×G A A⊗A
@ 6@
I
@ 6
i×id @ c
mult m◦(ι⊗id) @
γ
∆
@ @
? @
R ?
@ @
mult-
A⊗A
∆
G×G G A
In the two last diagrams c is the constant map G → {e} on the left hand side, γ
the composite A → k → A and m : A ⊗k A → A the algebra multiplication on the
right hand side. To see that they indeed correspond, observe that m corresponds
to the diagonal map G → G × G by the Yoneda Lemma.
A not necessarily commutative k-algebra equipped with the above additional
structure and satisfying the three axioms is called a Hopf algebra. Hopf algebras
coming from affine group schemes are always commutative, but interesting non-
commutative Hopf algebras arise, for instance, in the theory of quantum groups.
Remark 6.1.3 In calculations it is often useful to write down the HopfPalgebra ax-
ioms explicitly for concrete elements. For instance, if we write P∆(a) = P ai ⊗ bi for
the comultiplication map, then the counit
P axiom says
P a = ε(a )b
i i = ai ε(bi ),
and the antipode axiom says ε(a) = ι(ai )bi = ai ι(bi ). Patient readers will
write out the coassociativity axiom.
Tautologically, the category of commutative Hopf algebras over k is anti-
equivalent to that of affine group schemes over k. Here are some basic examples
of affine group schemes and their Hopf algebras.
Examples 6.1.4
1. The functor R 7→ Ga (R) mapping a k-algebra R to its underlying additive
group R+ is an affine group scheme with coordinate ring k[x], in view of the
functorial isomorphism R+ ∼ = Homk (k[x], R). The comultiplication map on
k[x] is given by ∆(x) = 1 ⊗ x + x ⊗ 1, the counit is the zero map, and the
antipode is induced by x 7→ −x.
2. Similarly, the functor R 7→ Gm (R) sending a k-algebra R to the subgroup
R× of invertible elements is an affine group scheme with coordinate ring
k[x, x−1 ], because an invertible element in R corresponds to a k-algebra
homomorphism k[x, x−1 ] → R. On the coordinate ring the comultiplication
map is induced by ∆(x) = x ⊗ x, the counit sends x to 1, and the antipode
is induced by x 7→ x−1 .
214 Chapter 6. Tannakian Fundamental Groups
4. Here is a link with the classical theory of linear algebraic groups. Assume
k is algebraically closed. Observe that in the previous example we real-
ized GLn (k) as a closed subvariety of affine n2 + 1-space; in particular, it
inherits a Zariski topology. Let G(k) ⊂ GLn (k) be a closed topological sub-
group. By Proposition 4.2.10 this inclusion corresponds to a surjective map
O(GLn ) → O(G) of coordinate rings. Here the k-algebra O(GLn ) is just the
algebra A of the previous example; in particular, it carries a Hopf algebra
structure. But then the quotient map O(GLn ) → O(G) induces a Hopf alge-
bra structure on O(G), because the Hopf algebra structure on A corresponds
to maps GLn (k) × GLn (k) → GLn (k), k → GLn (k) and GLn (k) → GLn (k)
via Proposition 4.2.10, and G(k) is a subgroup in GLn (k). The affine group
scheme associated with the Hopf algebra O(G) is the group scheme deter-
mined by the linear algebraic group G(k).
As concrete examples, one may associate affine group schemes with the other
classical groups SLn , On , etc. The construction generalizes to general base
fields: an affine group scheme G embeds as a closed subgroup scheme in
GLn if there is a morphism of group-valued functors such that the induced
map O(GLn ) → O(G) on Hopf algebras is surjective. Thus Hopf algebra
quotients of O(GLn ) correspond to closed subgroup schemes in general.
Let us now forget about the k-algebra structure on Hopf algebras for a while.
We then obtain the following more general notion.
In this definition the maps ∆ and ι are only assumed to be maps of k-vector
spaces. Coalgebras over k form a category: morphisms are defined as k-linear
maps compatible with the k-coalgebra structure.
We now define right comodules over a coalgebra by dualizing the notion of left
modules over a k-algebra B. Observe that to give a unitary left B-module is to
6.1 Affine Group Schemes and Hopf Algebras 215
where ǫ : k → B is the natural map sending 1 to the unit element of B. The first
diagram here corresponds to the axiom (b1 b2 )v = b1 (b2 v) for bi ∈ B and v ∈ V ,
and the second to 1 · v = v. The dual notion for k-coalgebras is the following.
commute.
X
ε(ai )mi = m. (6.3)
i
Proof: For (1), note first that by the k-linearity of ρ : M → M ⊗k A the k-linear
span of finitely many subcomodules of M is again a subcomodule. Therefore it
is enough to prove the case n = 1 of the first statement.
P Fix a (possibly infinite)
k-basis B of A. For m ∈ M we may write ρ(m) = P mi ⊗ ai with mi ∈ M and
ai ∈ B (finite sum). Therefore (ρ ⊗ idA )(ρ(m)) = i ρ(mi ) ⊗ ai .POn the other
comodule axiom we must have (ρ ⊗ idA )(ρ(m)) = i mi ⊗ ∆(ai ).
hand, by the firstP
Writing ∆(ai ) = j,k λijk (aj ⊗ ak ), we obtain (after changing running indices)
X X X
ρ(mk ) ⊗ ak = mi ⊗ λijk (aj ⊗ ak ),
k i j,k
for all k. This implies that the k-span of m and the mi is a finite dimensional
subcomodule of M .
To prove (2) we again reduce to the case n = 1. By part (1) for fixed a ∈ A we
find a finite dimensional k-subspace N ⊂ A containing P a with ∆(N ) ⊂ N ⊗k A.
Fix a k-basis e1 , . . . , en of N , and write ∆(e
Pi ) = j j ⊗ cij with some cij ∈ A.
e
By formula (6.4) above we have ∆(cij ) = l clj ⊗ cil , therefore the k-span of the
finitely many elements ej and cij is a subcoalgebra containing a.
Proposition 6.1.9 The above construction gives a bijection between right comod-
ules over the commutative Hopf algebra A and left representations of the corre-
sponding affine group scheme G.
Combining the above corollary with Proposition 6.1.8 is often useful in calcu-
lations. Here is such an application that will be needed later.
is exact.
Proof: The first comodule axiom implies that the sequence is a complex, and
the second
P axiom implies the injectivity of ρ. It remains to see that each element
α = mi ⊗ ai in the kernel of ρ ⊗ idA − idM ⊗ ∆ is in the image of ρ. Using
Proposition 6.1.8 we find a finite dimensional subcomodule M ′ ⊂ M and a finite
dimensional subcoalgebra A′ ⊂ A with α ∈ M ′ ⊗k A′ . Thus we reduce to the
case when A and M are finite dimensional over k. Taking k-linear duals we then
obtain a finite dimensional k-algebra B = A∗ , a left B-module N = M ∗ , and the
sequence becomes
ρ∗
B ⊗k B ⊗k N → B ⊗k N −→ N → 0.
Here ρ∗ is the map giving the left B-module structure on N , whereas the unnamed
map is the difference of the maps b ⊗ b′ ⊗ n 7→ b(b′ ⊗ n) and b ⊗ b′ ⊗ n 7→ (bb′ ) ⊗ n.
Exactness of this sequence is a tautology, and the dual exact sequence is (6.5) by
Corollary 6.1.11.
In other words, for each k-vector space V there are functorial isomorphisms
∼
Hom(A, V ) → Hom(ω, ω ⊗ V ). (6.6)
Nρ id⊗Φ id⊗ε⊗id ∼
N → N ⊗ B −→B N ⊗k B ⊗k V −→ N ⊗k k ⊗k V → N ⊗k V,
220 Chapter 6. Tannakian Fundamental Groups
where we have omitted the ω’s to ease notation. To prove it, notice first that the
map idN ⊗ ∆ : N ⊗k B → N ⊗k B ⊗k B defines a right B-comodule structure
on the k-vector space N ⊗k B. The fact that N is a B-comodule means precisely
that ρN : N → N ⊗k B is a morphism of B-comodules. As Φ is a morphism of
functors, we have a commutative diagram
N ρ id⊗ε ∼
=
N −−−→ N ⊗k B −−−→ N ⊗k k −−−→ N
ΦN y
Φ N ⊗k B y
ΦN y
ρ ⊗id id⊗ε⊗id ∼
=
N ⊗k V −−N−−→ N ⊗k B ⊗k V −−−−−→ N ⊗k k ⊗k V −−−→ N ⊗k V
where the composites of the maps in the horizontal lines are identity maps by the
second comodule axiom. It then suffices to see that the second vertical map equals
idN ⊗ ΦB . This holds because choosing a k-basis of N identifies N ⊗k B as a
B-comodule with a finite direct sum of copies of B, and Φ commutes with direct
sums.
Proposition 6.2.3 The underlying k-vector space of A⊗k A represents the functor
V 7→ Hom(ω⊗ω, ω⊗ω⊗V ) on the category Veck . In particular, we have a bijection
∼
Hom(A ⊗k A, A) → Hom(ω ⊗ ω, ω ⊗ ω ⊗ A).
Proof: We have just seen that each k-linear map A ⊗k A → V induces a k-linear
map ω(M ) ⊗k ω(N ) → ω(M ) ⊗k ω(N ) ⊗k V functorial in V for each pair (M, N )
of objects of Comodf A . This gives a morphism of functors Hom(A ⊗k A, ) →
Hom(ω⊗ω, ω⊗ω⊗ ). The proof that it is an isomorphism is completely analogous
to the proof of the previous proposition.
where the map in the middle is the one defining the comodule structure of M ⊗m N .
As we have seen in the proof of Proposition 6.2.1, it is induced by the morphism
of functors ω → ω ⊗ A corresponding to the identity map of A.
Corollary 6.2.4
We now turn to the unit element e for the multiplication in Hopf algebras. It
is determined by the morphism k → A sending 1 to e that we also denote by e. As
in the definition of a Hopf algebra we imposed that the counit map is a k-algebra
homorphism, we now have to require dually that e : k → A be compatible with
the coalgebra structures on k an A, where k is equipped with the comultiplication
sending 1 to 1 ⊗ 1. This holds if and only if ∆(e) = e ⊗ e, which is also precisely
the condition for e to equip k with a right comodule structure. Now we have:
∼ M ρ ⊗e id⊗m
M → M ⊗k k −→ M ⊗ A ⊗k A → M ⊗k A.
τV,W : V ∗ ⊗k W → Hom(V, W )
φ⊗ι ∼
M → M ⊗k A −→ k ⊗k A → A,
Here the tensor products are equipped with the A-comodule structure coming
from the multiplication of A, and k with the one coming from the unit element
1 ∈ A.
Proof: This is just calculation with the axioms. We first check that ρ∗ defines an
A-comodule structure on M ∗ . The first diagram in (6.1) for M ∗ can be rewritten
as
ρ∗
M∗ −−−→ Hom(M, A)
ρ∗ y
y
Hom(M, A) −−−→ Hom(M, A ⊗k A)
where the right vertical map is induced by ∆ and the bottom map is defined
similarly as ρ∗ , with A in place of k. With the notation of Remark 6.1.7 the
∗
P of the upper and right maps in the diagram maps φ ∈ M to the map
composite
m 7→ P i,l φ(mi )ι(ail ) ⊗ ι(bl ), and the composite of the left and bottom maps to
m 7→ i,j φ(mij )ι(cj ) ⊗ ι(ai ). Equality of the two
P follows from equation (6.2) of
Remark 6.1.7. Finally the counit axiom φ(m) = i φ(m)ε(ι(ai )) follows from (6.3)
and the identity ε ◦ ι = ε, which is an easy consequence of the antipode axiom.
224 Chapter 6. Tannakian Fundamental Groups
M ⊗k M ∗
ǫ - k
ρ⊗ρ∗ e
? ?
M ⊗k A ⊗k Hom(M, A) - A ⊗k A m - A
as required. The proof for compatibility with the coevaluation map is similar.
id⊗δ =
ω(M ) −→ω(M ) ⊗k ω(M )∗ ⊗k ω(M ) →
id⊗ρM ∗ ⊗id
→ ω(M ) ⊗k ω(M ∗ ) ⊗k ω(M ) −→
∼
=
→ ω(M ) ⊗k ω(M ∗ ) ⊗k A ⊗k ω(M ) →
=
→ ω(M ) ⊗k ω(M ∗ ) ⊗k ω(M ) ⊗k A →
ǫ⊗id
→ ω(M ) ⊗k ω(M )∗ ⊗k ω(M ) ⊗k A −→ ω(M ) ⊗k A.
6.2 Categories of Comodules 225
Here the equality ω(M ∗ ) = ω(M )∗ that we used twice is of course a tautology,
but we wrote it out explicitly, because for later use it is important to note that ω
transforms duals in Comodf A (which exist by assumption) to duals in Vecf k .
By Proposition 6.2.1 the resulting morphism of functors ω → ω ⊗ A yields a
map of comodules ι : A → A. It remains to show that it satisfies the antipode
axiom, i.e. that the composite map
∆ id⊗ι m
A → A ⊗k A −→ A ⊗k A → A
is the identity. Again by Proposition 6.2.1 this is equivalent to saying that for
all M in Comodf A with comodule structure map ρM : ω(M ) → ω(M ) ⊗k A the
composite map
M ρ id⊗∆ id⊗id⊗ι
ρ1 : ω(M ) → ω(M ) ⊗k A −→ ω(M ) ⊗k A ⊗k A −→
id⊗m
−→ω(M ) ⊗k A ⊗k A −→ ω(M ) ⊗k A
equals ρM . This we shall check in two steps. First we shall show that ρ1 equals
the composite map
id⊗∆ id⊗id⊗ι
ω(M ) ⊗k A −−−→ ω(M ) ⊗k A ⊗k A −−−−−→ ω(M ) ⊗k A ⊗k A
commutes: the first square by the comodule axiom, and the second by construction.
Therefore ρ1 equals the composite map
by the very definition of ι. Thus to show ρ′1 = ρ2 it remains to note the commu-
tativity of the diagram
ǫ⊗id
ω(M ) ⊗k ω(M )∗ ⊗k ω(M ) ⊗k A −−−→ ω(M ) ⊗k A
ρM ⊗idy
id⊗ρ ⊗id
y M
ǫ⊗id
ω(M ) ⊗k ω(M )∗ ⊗k ω(M ) ⊗k A ⊗k A −−−→ ω(M ) ⊗k A ⊗k A
id⊗my yid⊗m
ǫ⊗id
ω(M ) ⊗k ω(M )∗ ⊗k ω(M ) ⊗k A −−−→ ω(M ) ⊗k A
which follows from the compatibility of the evaluation map ǫ with the comodule
structure and the fact that the composite of the right vertical maps is the identity,
the map m being a morphism of comodules.
Finally, to show that ρ2 = ρM , note that by definition of the comodule structure
on M ∗ ⊗m M and the compatibility of ω with tensor products the map ρ2 equals
the composite
id⊗δ ∼ ρM ⊗m M ∗ ⊗id
ω(M ) −→ω(M ) ⊗k ω(M ∗ ⊗m M ) → (ω(M ⊗m M ∗ ) ⊗ ω(M ) −→
ǫ⊗id
→ ω(M ⊗m M ∗ ) ⊗k ω(M ) ⊗k A −→ ω(M ) ⊗k A.
Using the compatibility of ǫ with the comodule structure we may rewrite this map
as the composite
id⊗δ ǫ⊗id ρ
ω(M ) −→ ω(M ⊗m M ∗ ⊗m M ) −→ ω(M ) →
M
ω(M ) ⊗k A,
but the composite of the first two maps is the identity by the first diagram in (6.8)
(we have dropped tensorizations with k throughout to ease notation).
Φ Φ
? ?
X ⊗ ((Y ⊗ Z) ⊗ W ) (X ⊗ Y ) ⊗ (Z ⊗ W )
HH
HH
id⊗Φ
HH Φ
HH
j
X ⊗ (Y ⊗ (Z ⊗ W ))
Remark 6.3.1 For each object X in C there exist canonical functorial isomor-
1 : 1⊗X → ∼ 1 : X ⊗ 1 → ∼
phisms αX X and βX X; in particular, the functors
X 7→ 1 ⊗ X and X 7→ X ⊗ 1 induce category equivalences of C with itself. To
1 , start with the isomorphism ν ⊗ id : 1 ⊗ 1 ⊗ X → ∼
construct αX X 1 ⊗ X, and then
define αX as the morphism 1 ⊗ X → X that induces ν ⊗ idX via tensoring by 1
on the left. Such an αX 1 exists and is unique as the functor X 7→ 1 ⊗ X is fully
In what follows we shall assume that all tensor categories under consideration
have a unit object. Such tensor categories are often called monoidal categories; this
terminology goes back to MacLane. (There is no universally accepted terminology
concerning tensor categories in the literature; here we are adopting that of [42].)
A commutativity constraint on a tensor category is an isomorphism Ψ of func-
tors from C × C to C given on a pair (X, Y ) of objects by
∼
ΨX,Y : X ⊗ Y → Y ⊗ X
such that ΨY,X ◦ ΨX,Y = idX⊗Y for all X, Y . A tensor category C is commutative
if there is a commutativity constraint on C so that the diagram
X ⊗ (Y ⊗ Z)
Q
id⊗Ψ Q
QΦ
Q
+ QQ
s
X ⊗ (Z ⊗ Y ) (X ⊗ Y ) ⊗ Z
Φ Ψ
? ?
(X ⊗ Z) ⊗ Y Z ⊗ (X ⊗ Y )
Q
Q
Q
Ψ⊗id Q Φ
QQ
s
+
(Z ⊗ X) ⊗ Y
commutes for each triple (X, Y, Z) of objects in C. This compatibility is called the
hexagon axiom.
A tensor functor between two tensor categories C and C ′ is a functor F : C → C ′
together with an isomorphism Λ of functors from C × C to C ′ given on a pair (X, Y )
of objects of C by
∼
ΛX,Y : F (X ⊗ Y ) → F (X) ⊗ F (Y )
such that moreover for a triple (X, Y, Z) of objects of C the diagram
commute. Here the isomorphisms are the inverses of the canonical isomorphisms
α1 and β 1 constructed in Remark 6.3.1. The reader will recognize the formalism of
dual vector spaces discussed in the previous section. Note that while the preceding
axioms are satisfied by the usual tensor product of vector spaces, here we have
to restrict to finite dimensional spaces in order to get examples. The following
general lemma implies that up to isomorphism the k-linear dual V ∗ of a finite
dimensional vector space V is the only dual object of V in the tensor category of
finite dimensional vector spaces.
Proof: We fix X and ǫ, and show that X ∗ represents the contravariant functor
Z 7→ Hom(X ⊗ Z, 1), from which the uniqueness statements will follow. For
representability we show that for each map φ : X ⊗Z → 1 we can find ν : Z → X ∗
making the diagram
id⊗ν
X ⊗Z - X ⊗ X∗
φ ǫ
?
+
1
commute. Define ν as the composite
∼ δ⊗id id ∗ ⊗φ ∼
Z
Z → 1 ⊗ Z −−−−→ X ∗ ⊗ X ⊗ Z −−X−−→ X ∗ ⊗ 1 → X ∗
(with some δ as in the definition of a dual). Commutativity of the diagram then
holds because the first diagram in (6.9) commutes.
∼
The map φ 7→ φt induces a bijection Hom(X, Y ) → Hom(Y ∗ , X ∗ ).
A morphism of tensor functors between two tensor categories is a morphism of
functors F → G compatible with the isomorphisms ΛX,Y for F and G occurring
in the definition of tensor functors, and for which the composite isomorphism
∼
1′ ∼
= F (1) → G(1) ∼ = 1′ is the identity of 1′ . An isomorphism of tensor functors
is a morphism as above that has a two-sided inverse that is again a morphism of
tensor functors. We shall need the following lemma later.
Having introduced all this terminology, we can now summarize the discussion
of the previous section in the following theorem.
Theorem 6.3.4 Let A be a coalgebra over a field k, and ω the forgetful functor
from the category Comodf A of right A-comodules finite dimensional over k to the
category Vecf k of finite dimensional k-vector spaces.
Assume that there is a tensor category structure on Comodf A for which ω
becomes a tensor functor when Vecf k carries its usual tensor structure.
of R-modules. We can thus define set-valued functors End(ω) (resp. End⊗ (ω),
Aut⊗ (ω)) on the category of k-algebras by sending R to the set of R-linear functor
morphisms (resp. tensor functor morphisms, tensor functor isomorphisms) ω⊗R →
ω ⊗ R. The functor Aut⊗ (ω) is actually group-valued.
Proof: The proof is in three steps. First we show that we have functorial iso-
morphisms End(ω)(R) ∼ = HomModR (A ⊗k R, R), where A is the Hopf algebra of
G. Note that here we are considering R-module homomorphisms. In fact, we
have HomModR (A ⊗k R, R) ∼ = HomVeck (A, R), the isomorphism being induced by
composition with the map A → A ⊗k R given by a 7→ a ⊗ 1. Similarly, we obtain
HomR (ω ⊗ R, ω ⊗ R) ∼ = Homk (ω, ω ⊗ R). Now we can conclude by Proposition
6.2.1, with the additional remark that it also holds with Veck replaced by the cat-
egory of underlying vector spaces of k-algebras, because all we needed in the proof
is that A is an object of the category.
Next we check that elements of End(ω)(R) lying in End⊗ (ω)(R) correspond to
k-algebra homomorphisms A → R. By definition, a morphism ΦR : ω ⊗R → ω ⊗R
is a morphism of tensor functors if the diagram
Φ
R
ω( ⊗ ) ⊗ R −−−→ ω( ⊗ ) ⊗ R
y y
Φ ⊗Φ
ω( ) ⊗ ω( ) ⊗ R −−R−−−→
R
ω( ) ⊗ ω( ) ⊗ R
commutes, where the vertical maps are induced by the isomorphisms ΛX,Y in the
definition of tensor categories. By Propositions 6.2.1 and 6.2.3 the maps ΦR and
ΦR ⊗ ΦR correspond to k-linear maps λ : A → R and λ ◦ m : A ⊗k A → R,
respectively, where m : A ⊗k A → R is the multiplication map of A. The diagram
thus expresses the fact that λ is a k-algebra homomorphism.
Finally, Lemma 6.3.3 implies that the natural morphism Aut⊗ (ω) → End⊗ (ω)
of functors is an isomorphism.
The proposition shows that one may recover an affine group scheme from the
tensor category of its representations. This is the algebraic analogue of the classical
Tannaka–Krein theorem on topological groups ([35], §30).
Now let G and G′ be affine group schemes. Given a group scheme homo-
morphism φ : G → G′ , every finite dimensional representation of G′ yields a
representation of G via composition with φ. In this way we obtain a tensor func-
tor φ∗ : RepG′ → RepG satisfying ω ◦ φ∗ = ω ′ , where ω ′ is the forgetful functor on
RepG′ .
232 Chapter 6. Tannakian Fundamental Groups
There is also a dual notion of an amalgamated sum that the reader will formulate.
This being said, an additive category is a category A in which each pair of
objects has a product, and moreover the sets Hom(A, B) carry the structure of
an abelian group so that the composition map (φ, ψ) 7→ φ ◦ ψ is Z-bilinear. If
moreover the Hom(A, B) are k-vector spaces over a fixed field k and composition
of maps is k-bilinear, we speak of a k-linear additive category.
Note that in an additive category each set Hom(A, B) has a zero element, i.e.
there is a canonical morphism 0 : A → B between A and B whose composite
with other morphisms is again 0. This allows us to define the kernel ker(φ) of a
morphism φ : A → B (if it exists) as the fibre product of the morphisms φ and
0 : A → B over B. Dually, the cokernel coker (φ) of φ is the amalgamated sum
of φ and 0 over A, or equivalently, the kernel of φ in the opposite category of A.
6.4 Second Interlude on Category Theory 233
Assume that ker(φ) exists. We then define the coimage coim(φ) of φ as the cokernel
of the natural map ker(φ) → A (if it exists). Similarly, assuming the existence of
coker (φ), we define im (φ) as the kernel of the natural map B → coker (φ) (if it
exists). Note that if φ has an image and a coimage, it induces a natural map
coim(φ) → im (φ).
Plainly, the categories of abelian groups, modules over a fixed ring or abelian
sheaves over a topological space are abelian, and the category of vector spaces over
a field k is a k-linear abelian category.
In an abelian category it is customary to speak of direct products and direct
sums instead of products and coproducts. Also, one says that φ : A → B is a
monomorphism (resp. epimorphism) if the morphism ker(φ) → A (resp. B →
coker (φ)) is the zero morphism. We say abusively that A′ is a subobject (resp. A′′
is a quotient) of A if there is a monomorphism A′ → A (resp. an epimorphism
A → A′′ ). We shall often use the notations A′ ⊂ A and A/A′ for subobjects
and quotients. An object A is simple if for each subobject φ : A′ → A the
monomorphism φ is either 0 or an isomorphism. A composition series of an object
A, if it exists, is a descending series A = F 0 ⊃ F 1 ⊃ F 2 ⊃ · · · of subobjects such
that the quotients F i /F i+1 are simple.
We say that A has finite length if it has a finite composition series. One proves
as in the case of abelian groups that in this case every chain of subobjects in A
can be refined to a composition series. Thus A is both Noetherian and Artinian,
i.e. all ascending and descending series of subobjects in A stabilize. Moreover,
all composition series of A are finite of the same length, and the finite set of the
isomorphism classes of the F i /F i+1 is the same up to permutation.
The usual notion of an exact sequence carries over without change to abelian
categories. A functor between abelian categories is said to be exact if it takes
short exact sequences to short exact sequences; there are also the usual weaker
properties of left and right exactness that the appropriate Hom-functors enjoy.
An object P of an abelian category A is projective if the functor Hom(P, ) is
exact. As the Hom-functor is always left exact, this is equivalent to requiring that
given an epimorphism A ։ B, each map P → B can be lifted to a map P → A.
The reader will define the dual notion of injective objects.
An object G of A is a generator if the functor Hom(G, ) is faithful. This
amounts to saying that for each nonzero morphism φ : A → B in A there is
a morphism G → A such that the composite G → A → B is again nonzero.
In the case when G is projective, this is moreover equivalent to the condition
Hom(G, A) 6= 0 for all A 6= 0 (for the nontrivial implication, use the projectivity
of G to lift a nonzero morphism G → im (φ) to a morphism G → A).
234 Chapter 6. Tannakian Fundamental Groups
In the next section we shall need the following variant of a theorem of Mitchell
and Freyd (for the original result, see [24], Exercise 4F).
Proposition 6.4.2 Let A be an abelian category such that every object of A has
finite length. Assume that A has a projective generator P . Then the functor
Hom(P, ) induces an equivalence of A with the category Modf End(P ) of finitely
generated right End(P )-modules.
Lemma 6.4.3 Under the assumptions of the proposition for each A in A there is
an epimorphism P ⊕r → A from a finite direct power of P .
Proof: The left action of End(P ) on P induces a right End(P )-module structure
on Hom(P, A) for each A via composition of maps. To see that we obtain a finitely
generated module, consider an epimorphism P ⊕r → A as in the lemma. Applying
the functor Hom(P, ) and noting the isomorphism End(P )⊕r ∼ = Hom(P, P ⊕r ) we
⊕r
obtain a surjection End(P ) ։ Hom(P, A).
Next we show that Hom(P, ) is fully faithful. As P is a generator by assump-
tion, this boils down to showing that every morphism φ : Hom(P, A) → Hom(P, B)
comes from a morphism A → B. Consider epimorphisms P ⊕r → A, P ⊕s → B
given by the lemma. Applying the functor Hom(P, ) we obtain a diagram
with exact rows. As End(P )⊕r is a free, hence projective End(P )-module, there
is a map ψ : End(P )⊕r → End(P )⊕s of free End(P )-modules making the diagram
commute. Here ψ is defined by multiplication with an r × s matrix of elements
in End(P ). But such a matrix defines a morphism ψ̄ : P ⊕r → P ⊕s that gives
rise to ψ after applying the functor Hom(P, ). By construction, the composite
ψ
map End(P )⊕r → End(P )⊕s → Hom(P, B) annihilates the kernel of the map
End(P )⊕r → Hom(P, A). Therefore by exactness of the functor Hom(P, ) the
ψ̄
composite P ⊕r → P ⊕s → B annihilates the kernel of P ⊕r → A, i.e. induces a map
A → B. This is the map we were looking for.
6.4 Second Interlude on Category Theory 235
Proposition 6.4.4 Keep the assumptions of the previous proposition, and assume
moreover that B is a full subcategory of A closed under subobjects, quotients and
finite direct sums. There exist a uniquely determined ideal I ⊂ End(P ) and an
equivalence of categories between B and the category Modf End(P )/I under which
the inclusion functor B → A becomes identified with the functor Modf End(P )/I →
Modf End(P ) described above.
Remark 6.5.2 The term ‘k-linear abelian tensor category’ involves a compatibil-
ity condition relating the tensor and abelian category structures on C. It means
that the tensor operation
should be k-bilinear with respect to the k-vector space structures on the Hom-sets
involved. This also explains the presence of the condition End(1) ∼ = k. Namely,
∼
the isomorphism 1 ⊗ X → X induces a map End(1) → End(X) that endows the
group Hom(X, Y ) with the structure of an End(1)-module via composition, and
similarly for endomorphisms of Y . We require that the isomorphism End(1) ∼ =k
transforms these to the k-linear structure on Hom(X, Y ). These requirements are
of course satisfied in tensor categories of k-vector spaces.
Given an affine group scheme G over k, the category RepG of finite dimensional
representations of G with its usual tensor structure and the forgetful functor as
fibre functor is a neutral Tannakian category. Conversely, we have:
Proof: Finite direct sums of projective objects are again projective, so we only
have to exhibit a nonzero morphism P → X ′ for each nonzero object X ′ of hXi.
By definition the composition factors of X ′ lie in S, so in particular there is an
epimorphism X ′ → S with [S] ∈ S. We may define an epimorphism P → S
by taking it to be φS on PS and extending by 0 on the other components. By
projectivity of P it lifts to a nonzero morphism P → X ′ , as required.
α∗ : Hom(S, T ) → Hom(E, T )
Let us now return to the k-linear category hXi of Proposition 6.5.5. Let S be
a simple object in hXi, and E → S an essential extension. The following lemma
gives a criterion for E to be projective.
Proof: The right hand side of (6.10) is additive for short exact sequences of the
form
0 → Y ′ → Y → Y ′′ → 0. (6.11)
dimk Homk (E, Y ) ≤ dimk Homk (E, Y ′ ) + dimk Homk (E, Y ′′ ) (6.12)
by left exactness of the Hom-functor; it is an equality for all short exact sequences
(6.11) if and only if the Hom-functor is exact, i.e. E is projective. By the previous
lemma (6.10) holds with equality for all simple objects Y in hXi. For arbitrary Y
we may consider a composition series and conclude from the previous arguments
that (6.10) holds for Y ; moreover it holds with equality for all Y if and only if E is
projective. This shows the first statement of the lemma as well as the equivalence
of (1) and (2). The implication (2) ⇒ (3) is obvious. To prove the converse,
note that given a short exact sequence (6.11), we infer from (6.12) and (6.10) that
there is equality in (6.10) for Y if and only if there is equality for Y ′ and Y ′′ , and
moreover equality holds in (6.12). This shows that equality in (6.10) for Y = X
implies equality for all finite direct powers of X, as well as for their subquotients.
Hom(Pi+1 , X/F i ) is injective. But the dimension of the first k-vector space here
is ℓS (X/F i ) dimk End(S) by the inductive hypothesis, and that of the second is
at most ℓS (X/F i ) dimk End(S) by (6.10) applied with E = Pi+1 and Y = X/F i .
This is only possible if
∼
Hom(Pi , X/Fi ) → Hom(Pi+1 , X/Fi ). (6.14)
From the diagram above we infer that each φj gives rise to a map ψj : Qj → X/F i+1
by base change. As the φj form a k-basis of Homk (Pi , X/F i ), composing the ψj
with the epimorphisms Pi+1 → Qj induces a homomorphism Hom(Pi , X/F i ) →
Hom(Pi+1 , X/F i+1 ) whose composite with the inverse isomorphism of (6.14) yields
the required retraction.
All in all, we obtain that the last map in the exact sequence
is surjective. The dimension of the last term is ℓS (X/F i ) dimk End(S) by the
inductive hypothesis and (6.14), and that of the first is ℓS (F i /F i+1 ) dimk End(S)
by Lemma 6.5.7. The required formula for i + 1 follows.
This completes Step 1 of the proof of Theorem 6.5.4 outlined above, with
R = End(P ). We now turn to Step 2, and show that M = ω(P ) is a good choice
for the R-module M required there. For one thing, there is indeed a natural
left End(P )-module structure on ω(P ), the multiplication End(P ) × ω(P ) → ω(P )
being defined for a pair (φ, a) by ω(φ)(a). The statement of Step 2 is then:
Proposition 6.5.10 Via the category equivalence of the previous corollary the
functor ω becomes isomorphic to the functor mapping a right End(P )-module N
to the underlying k-vector space of N ⊗End(P ) ω(P ).
Proof: For each object A in hXi the rule φ ⊗ x 7→ ω(φ)(x) defines a natural
map Hom(P, A) ⊗End(P ) ω(P ) → ω(A) that is moreover functorial in A. It is
tautologically an isomorphism for A = P and, being compatible with finite direct
sums, for A = P ⊕r for all r > 0. Given an arbitrary object A, we choose an
epimorphism λ : P ⊕r → A with the benediction of Lemma 6.4.3, and consider the
commutative diagram
Hom(P, K) ⊗ ω(P ) −−−→ Hom(P, P ⊕r ) ⊗ ω(P ) −−−→ Hom(P, A) ⊗ ω(P ) −−−→ 0
∼
y y= y
ω(K) −−−→ ω(P ⊕r ) −−−→ ω(A) −−−→ 0
6.5 Neutral Tannakian Categories 241
where K := ker(λ) and tensor products are taken over End(P ). The lower row
is exact by exactness of ω (it is even exact on the left), and the upper row by
projectivity of P and right exactness of the tensor product. We have just seen
that the middle vertical map is surjective, which implies the surjectivity of the
map on the right. As this holds for all A, we get surjectivity of the left vertical
map as well. But then the injectivity of the middle vertical map implies the
injectivity of the one on the right.
We now turn to Step 3 of the proof of Theorem 6.5.4, and use the notation
R := End(P ) and M := ω(P ) from now on. Note first that the tensor product
A := M ∗ ⊗R M makes sense as a k-vector space, because the dual k-vector space
M ∗ carries a right R-module structure induced from that of M . Our next task is
to define a k-coalgebra structure on A.
Quite generally, given a right R-module N , we have a natural map
idN ⊗R M ⊗ δM : N ⊗R M → N ⊗R M ⊗k M ∗ ⊗R M, (6.15)
This completes Step 3 in the proof of Theorem 6.5.4, so it remains to give some
details on the limit procedure of Step 4, which will complete the proof.
Proof of Theorem 6.5.4: Write the category C as a union of the full subcat-
egories hXi for each object X. The system of these subcategories is partially
ordered by inclusion; the partial order is directed, because given two objects X
and Y , both hXi and hY i are full subcategories of hX ⊕ Y i. By Proposition 6.5.5
each hXi is equivalent to a module category Modf R , and by Proposition 6.4.4 a
full subcategory hY i ⊂ hXi corresponds to the category Modf R/I for some ideal
I ⊂ R. From Proposition 6.5.12 we obtain a further equivalence of Modf R and
Comodf A , where A = M ∗ ⊗R M . In particular, by Proposition 6.5.10 we obtain
an A-comodule structure on ω(X). In what follows we write AX in place of A in
order to emphasize the dependence of A on X. The coalgebra AY corresponding
to the full subcategory hY i ∼
= Modf R/I is
Remark 6.5.13 We now briefly discuss the theory of general Tannakian cate-
gories, for which we have to use the language of schemes. Assume that C is, as
before, a rigid k-linear abelian tensor category whose unit 1 satisfies End(1) ∼
= k.
A general (non-neutral) fibre functor is an exact faithful tensor functor ω on C
with values in the k-linear abelian tensor category LFS of locally free sheaves on
a k-scheme S. The pair (C, ω) defines a Tannakian category.
Given a morphism φ : T → S, the functor ω induces a fibre functor φ∗ ω with
values in LFT via X 7→ ω(X)⊗OS OT . Define the functor Aut⊗ k (ω) on the category
of schemes over S ×k S by sending T → S ×k S to the set of OT -linear isomorphisms
∼
of tensor functors π2∗ ω → π1∗ ω, where π1 , π2 : T → S are the composite maps of
T → S ×k S with the two projections S ×k S → S. It can be shown that this
6.5 Neutral Tannakian Categories 243
Lemma 6.5.16 Let K be a field, and G a closed subgroup scheme of GLn over K.
Denote by W the n-dimensional representation of G corresponding to its action on
K n via the inclusion G → GLn . The full Tannakian subcategory hW i⊗ ⊂ RepG is
the whole of RepG .
Remark 6.5.17 As Repf k (Γ) is the direct limit of the full subcategories hV i⊗ ,
the proposition implies the following description of Γalg . Consider pairs (G, φ),
where G is a closed subgroup of GLn (k) for some n, and φ : Γ → G is a morphism
with dense image. The system of the (G, φ) carries a natural partial order where
(G, φ) ≥ (G′ , φ′ ) if there is a morphism λ : G → G′ of algebraic groups with
λ ◦ φ = φ′ . Notice that if λ exists, it is unique by our assumption that φ has dense
image. Thus the (G, φ) together with the λ form a natural inverse system whose
inverse limit is precisely Γalg .
consider Repf π1 (X,x) with its Tannakian structure described in Example 6.5.14.
Consequently, the Tannakian fundamental group of LSX is isomorphic to the al-
gebraic hull of π1 (X, x) over C. Moreover, by Proposition 6.5.15 the Tannakian
fundamental group of each full Tannakian subcategory hLi⊗ of LSX is isomorphic
to the Zariski closure of the image of ρL,x : π1 (X, x) → GLn (C), where ρL,x is the
monodromy representation of L at the point x.
The dual connection to (E, ∇) is (E ∗ , ∇∗ ), the locally free sheaf E ∗ being given by
U 7→ Hom(E|U , O|U ) and ∇∗ over φ ∈ Hom(E|U , O|U ) by
Finally, the fibre functor is given by (E, ∇) 7→ Ex∇ . One checks that the category
equivalence (E, ∇) 7→ E ∇ of Proposition 2.7.5 between ConnD and LSD is compat-
ible with the tensor structures and fibre functors, so the Tannakian fundamental
group of ConnD with respect to the above fibre functor is again the algebraic hull
of π1 (D, x) over C.
The above construction generalizes to arbitrary complex manifolds if one con-
siders flat connections as in Example 2.7.6.
Notice a subtlety in the above argument: it is a priori not obvious at all that
ConnD is a C-linear abelian category, which is part of the definition of Tannakian
categories. We deduced this fact from its equivalence with LSD .
The next example is, in some sense, a differential analogue of Theorem 3.3.7.
We relate holomorphic connections on D to algebraically defined objects over C(z),
the field of meromorphic functions D.
Example 6.6.3 Consider the field C(t) equipped with its usual derivation f 7→ f ′ .
A differential module over C(t) is a finite dimensional C(t)-vector space V together
with a C-linear map ∇ : V → V satisfying
A shortcoming of the above example was that we introduced the fibre functor
on h(V, ∇)i⊗ in a transcendental way, by comparison with the holomorphic theory.
It is natural to ask whether this is possible using a direct construction. After
all, the fibre functor is given by local horizontal sections of the connection, i.e.
solutions of the corresponding differential equation. These are not elements of
C(z) any more, but typically exponential functions. By adjoining the solution
functions to C(z) we obtain a large transcendental extension which, however, can
be studied in a purely algebraic way. This is the purpose of differential Galois
theory, a vast subject of which we now explain the rudiments.
Let K be a field of characteristic 0. A derivation on K is an additive function
∂ : f 7→ f ′ satisfying the Leibniz rule (f g)′ = f ′ g + f g ′ . The pair (K, ∂) is called a
248 Chapter 6. Tannakian Fundamental Groups
differential field. The kernel ker(∂) is a subfield of K; we shall call it the constant
field of (K, ∂) and denote it by k. We define a differential module over (K, ∂) by
direct generalization of the above special case: it is a pair (V, ∇) consisting of a
finite dimensional K-vector space V and a k-linear map ∇ : V → V satisfying
(6.16). The subset V ∇ := ker(∇) is a k-subspace of V ; its elements are called
horizontal vectors. As above, the category DiffmodK of differential modules over
K is a rigid k-linear abelian tensor category, the tensor structure being defined
by (6.17) and (6.18). Given an extension of differential fields (L, ∂) ⊃ (K, ∂) (i.e.
a field extension L|K together with a derivation extending ∂) and a differential
module (V, ∇) over K, there is a natural notion of base change (V ⊗K L, ∇L ),
where ∇L has the same connection matrix as ∇ in a basis of V .
We can now give the formal definition of the extension generated by the local
solutions.
ωL : (W, ∇) 7→ (W ⊗K L)∇L .
Remark 6.6.5 In the last section of [14] Deligne and Bertrand show that Picard–
Vessiot extensions for (V, ∇) correspond bijectively to neutral fibre functors on
h(V, ∇)i⊗ . The proof is not very hard, and uses a construction akin to Lemma
6.7.17 below.
On the other hand, Corollary 6.20 of the same paper implies the much deeper
fact that over an algebraically closed field k a neutral fibre functor always ex-
ists on a rigid abelian k-linear tensor category of the form hXi⊗ that has some
(non-neutral) fibre functor into a category Vecf K for an extension K ⊃ k. As
6.6 Differential Galois Groups 249
the forgetful functor on h(V, ∇)i⊗ is such a fibre functor, we obtain that Picard–
Vessiot extensions always exist for differential modules over differential fields with
algebraically closed constant field (they may not exist in general). In [107], Propo-
sitions 1.20 and 1.22 a simple direct proof is given for this fundamental fact. There
it is also proven that if a Picard–Vessiot extension exists, it is unique up to an
isomorphism of differential fields.
Definition 6.6.6 Assume that there exists a Picard–Vessiot extension (L, ∂) for
(V, ∇). The differential Galois group scheme Gal (V, ∇) of (V, ∇) is defined as the
Tannakian fundamental group of the category h(V, ∇)i⊗ equipped with the above
fibre functor ωL .
Lemma 6.6.7 The functor Gal ∂ (L|K) defines an affine group scheme over k.
Now assume given a differential module (V, ∇) over K, and let L be a Picard–
Vessiot extension for (V, ∇). Observe that if σ : L → L is a K-automorphism of
L commuting with ∂, then the induced automorphism of V ⊗K L preserves the
space (V ⊗K L)∇L of horizontal vectors. This holds for each object of h(V, ∇)i⊗ , so
the fibre functor ωL takes its values not only in the category of finite dimensional
k-vector spaces, but actually in the category of finite dimensional representations
of Gal ∂ (L|K)(k). For each object (W, ∇) of h(V, ∇)i⊗ we have a functor on the
category of k-algebras given by R 7→ (W ⊗K L⊗K R)∇L⊗K R , where the base change
∇L⊗K R is defined in a straightforward way. This shows that ωL induces a functor
from h(V, ∇)i⊗ to the category RepGal ∂ (L|K) of representations of the affine group
scheme Gal ∂ (L|K). It is moreover a tensor functor as ωL is.
We give a proof following [107]. One additional algebraic fact is needed which
does not result from the Tannakian theory.
Fact 6.6.9 Let (K, ∂) be a differential field with algebraically closed constant
field k, and (L, ∂) a Picard–Vessiot extension for the differential module (V, ∇)
over K. Then the fixed field for the action of Gal ∂ (L|K)(k) on L is exactly K.
Thus Picard–Vessiot extensions play the role of Galois extensions in the differential
theory.
For proofs, see [107], Theorem 1.27 (3) or [39], Theorem 5.7.
Proof of Proposition 6.6.8: For fully faithfulness, assume given two objects
(V1 , ∇) and (V2 , ∇) of h(V, ∇)i⊗ . By the definition of duals in a rigid tensor cat-
egory we have Hom((V1 , ∇), (V2 , ∇)) ∼ = Hom(K, (V1∗ ⊗K V2 , ∇)), and similarly for
the corresponding representations of Gal ∂ (L|K), so we may assume that (V1 , ∇) is
K equipped with the trivial connection. Elements of Hom(K, (V2 , ∇)) correspond
bijectively to horizontal elements of V2 . As k is algebraically closed, elements in the
image of Hom(K, (V2 , ∇)) by ωL correspond to elements of (V2 ⊗K L)∇L invariant
by Gal ∂ (L|K)(k). But by the above fact the Gal ∂ (L|K)(k)-invariant elements in
V2 ⊗k L are exactly the elements of V2 , so we again obtain the horizontal elements
of V2 .
For essential surjectivity, note that by property (3) of Picard–Vessiot extensions
the functor ωL maps (V, ∇) to a faithful representation of Gal ∂ (L|K), or in other
6.7 Nori’s Fundamental Group Scheme 251
words the induced map Gal ∂ (L|K) → GLn (k) is injective. We then conclude by
Lemma 6.5.16.
When k is algebraically closed, the group Gal ∂ (L|K)(k) of k-points of the affine
group scheme Gal (M, ∇) ∼ = Gal ∂ (L|K) is called the differential Galois group of
(M, ∇). In this case the above considerations imply an analogue of the main
theorem of Galois theory (Theorem 1.2.5) in the differential context.
Theorem 6.6.10 Let (K, ∂) be a differential field with algebraically closed con-
stant field k, and (L, ∂) a Picard–Vessiot extension for the differential module
(V, ∇) over K. The rule H(k) 7→ LH(k) induces a bijection between closed sub-
groups of Gal ∂ (L|K)(k) and differential subfields of (L, ∂) containing (K, ∂). Here
closed normal subgroups correspond to Picard–Vessiot extensions of (K, ∂), and the
associated differential Galois group is (Gal ∂ (L|K)/H)(k).
Proof: If (M, ∂) is an intermediate differential field between (K, ∂) and (L, ∂),
then (L, ∂) is a Picard–Vessiot extension for the base change (VM , ∇M ). This
identifies Gal ∂ (L|M )(k) with a closed subgroup H(k) of Gal ∂ (L|K)(k) (as it is
closed in GLn (k) with the same GLn as for Gal ∂ (L|K)(k)), and by Fact 6.6.9 we
have LH(k) = M . Similarly, a closed subgroup H(k) ⊂ Gal ∂ (L|K)(k) fixes some
M with Gal (VM , ∇M )(k) ∼ = H(k).
Given a closed normal subgroup scheme H ⊂ Gal ∂ (L|K), fix a representation
ρ in RepGal ∂ (L|K) with kernel H. (Such a representation exists; see e.g. [111],
§16.1.) The full subcategory hρi⊗ is equivalent to RepGal ∂ (L|K)/H by Lemma
6.5.16 on the one hand, and to h(W, ∇)i⊗ for some (W, ∇) in h(V, ∇)i⊗ on the
other hand. As we have already noted, (L, ∂) satisfies defining properties (1) and
(2) of Picard–Vessiot extensions for (W, ∇), therefore by defining property (3) it
contains a Picard–Vessiot extension (M, ∂) for (W, ∇). By construction we have
Gal (W, ∇) ∼= Gal (V, ∇)/H, and Gal ∂ (L|M )(k) ∼ = H(k) by the first part of the
proof.
n
M
f (F) := (F ⊗i )⊕ai .
i=0
Definition 6.7.2 A locally free sheaf E is called finite if there exist polynomials
f 6= g with non-negative integer coefficients such that f (E) ∼
= g(E).
Here and below the term ‘locally free sheaf’ will always mean ‘locally free sheaf
of finite rank’.
To obtain another characterization of finite locally free sheaves, call a locally
free sheaf indecomposable if it is not isomorphic to a direct sum of nonzero locally
free sheaves. For a general locally free sheaf E write I(E) for the set of isomorphism
classes of indecomposable locally free sheaves E ′ for which there exists a locally
free E ′′ with E ∼
= E ′ ⊕ E ′′ .
Proof: Quite generally, for a coherent sheaf F the set of isomorphism classes
of coherent sheaves that are isomorphic to a direct summand of F and cannot be
decomposed in a nontrivial direct sum of two coherent sheaves is finite. This is
a consequence of the Krull–Remak–Schmidt theorem for coherent sheaves which
is proven in the same way as the classical theorem for modules (see [48], Chapter
X, Theorem 7.5). Now if E has only finitely many nonisomorphic indecomposable
direct summands as a coherent sheaf, then it has the corresponding property as a
locally free sheaf.
Denote by S(E) the union of the finite sets I(E ⊗i ) for all i > 0.
Proposition 6.7.4 A locally free sheaf E is finite if and only if S(E) is a finite
set.
Proof: The ‘if’ part is proven using the well-known ‘determinant trick’. Write
L for the free abelian group generated by isomorphism classes of indecomposable
locally free sheaves on S. Those elements in L that have nonnegative coefficients
correspond to isomorphism classes of locally free sheaves, and L carries a structure
of a commutative ring induced by tensor product. Consider the free additive
subgroup LE ⊂ L with basis S(E). It is preserved by the Z-linear map mE given
by multiplication with the class of E. The characteristic polynomial χ is a monic
polynomial in Z[x] with χ(mE ) = 0. Writing χ = f − g, where f and g have
nonnegative coefficients, we obtain polynomials with f (E) = g(E).
Conversely, if there exist such f and g, then plugging the class of E in L into
f − g ∈ Z[x] we obtain 0. Writing d for the degree of f − g, this implies that each
indecomposable direct summand of E ⊗d must be a direct summand of some E ⊗j
for j < d. Applying the above to the polynomials (f − g)xi for i > 0 and using
induction on i, we obtain that each element of S(E) may be represented by a direct
summand of some E ⊗j for j < d. We then conclude by the previous lemma.
Proof: This follows from the proposition because the tensor powers L⊗i are
invertible for all i > 0, and invertible sheaves are indecomposable.
Corollary 6.7.6 The category of finite sheaves is stable under taking direct sums,
direct summands, tensor products of two sheaves and duals.
Here recall that the dual E ∨ of a locally free sheaf E on S is the sheaf U 7→
HomOU (E|U , OS |U ), where the Hom-group means homomorphisms of OU -modules.
It is again a locally free sheaf of the same rank. The easy proof of the corollary is
left to the readers.
254 Chapter 6. Tannakian Fundamental Groups
We now see that the category of finite locally free sheaves on S is a rigid
commutative tensor category with unit OS . If moreover S is equipped with a
morphism S → Spec (k) for a field k and has a k-rational point s : Spec (k) → S,
the rule E 7→ s∗ E yields a faithful tensor functor to the category Vecf k . (Recall
that if Spec (A) ⊂ S is an affine open subset containing the image of s and E = E e
∗
for an A-module E, the pullback s E is just given by the k-vector space E ⊗A k
on the one-point space Spec (k), where the map A → k corresponds to s.) Our
category thus satisfies all requirements for a neutral Tannakian category except
that we don’t know yet whether it is abelian. It will turn out to be abelian when k
is of characteristic 0 and S is proper and integral, but this is not obvious to prove.
Nori’s idea was to use the theory of semi-stable sheaves, so we need first to recall
some facts about these.
Facts 6.7.7 Let S be a scheme, and E aL locally free sheaf on S. Consider the dual
sheaf E ∨ and the OS -algebra S(E ∨ ) := i≥0 (E ∨ )⊗i . Let V(E) → S be the affine
morphism corresponding to S(E ∨ ) via Remark 5.1.24; it is called the vector bundle
associated with E. For an open subset U ⊂ S where E|U is a free sheaf of rank n
we have V(E) ×S U ∼ = U × An ; this is because the tensor algebra of a free module
is a polynomial ring. We have worked with the dual sheaf E ∨ in order to obtain
a covariant functor E 7→ V(E). It is also possible to define a vector bundle as an
affine morphism satisfying such a local triviality condition; see e.g. [34], Exercise
II.5.18. In the literature locally free sheaves are often called vector bundles.
A locally free subsheaf E ′ ⊂ E is called a subbundle if the corresponding mor-
phism V(E ′ ) → V(E) is a closed immersion. In this case E ′ is a direct summand
in E with locally free complement ([95], Chapter VI, §1.3, Proposition), so the
quotient E/E ′ is locally free; it is called a quotient bundle. A sufficient condition
for the kernel and the image of a morphism φ : E → F of locally free sheaves to
be subbundles is that for all geometric points s̄ : Spec (Ω) → S the Ω-linear maps
s̄∗ E → s̄∗ F have the same rank ([53], Proposition 1.7.2).
Assume now that C is an integral proper normal curve over a field, and E is a
locally free sheaf of rank r on C. The determinant det(E) (which is by definition
the r-th exterior power of E) is an invertible sheaf, so by [34], Propositions II.6.11
and II.6.13 it corresponds to a divisor D on C. The degree d of D is called the
degree of E, and the quotient µ(E) := d/r the slope of E. For locally free sheaves
E and F on C one has
A quick way to prove this formula is to identify the degree of a locally free sheaf
with the degree of its first Chern class ([27], Theorem 3.2 (f) and Remark 3.2.3
(c)), and then to use the multiplicativity of the Chern character ([27], Example
3.2.3). See also Exercise 9 for a direct argument.
Also, using the Riemann–Roch formula for vector bundles ([53], Theorems 2.6.3
6.7 Nori’s Fundamental Group Scheme 255
and 2.6.9) one can calculate the degree of a locally free sheaf E of rank r as
Proposition 6.7.8 A finite locally free sheaf E on an integral proper normal curve
is semi-stable of slope 0.
Proof: Denoting by r the rank of F, for (1) it suffices to show that the degrees of
rank r′ subbundles F ′ ⊂ F are bounded for all 1 ≤ r′ ≤ r. But by the Riemann–
Roch formula (6.20) the degree of such an F ′ equals χ(F ′ ) − r′ χ(OC ). As the
Euler–Poincaré characteristic χ is additive for short exact sequences, we see that
χ(F ′ ) ≤ χ(F), whence statement (1).
For (2) consider a subbundle G ⊂ F1 ⊕ F2 , and denote by j : Spec (K(C)) → C
the inclusion of the generic point of C. The pullback j ∗ G decomposes as a di-
rect sum j ∗ G ∼
= G1′ ⊕ G2′ , where the Gi′ are locally free sheaves on Spec (K(C))
corresponding to subspaces of the vector spaces j ∗ Fi . The j∗ (Gi′ ) are locally
free subsheaves of the Fi , hence there exist subbundles j∗ (Gi′ ) ⊂ Gi ⊂ Fi that
have the same rank as the j∗ (Gi′ ) (see [53], p. 73). It then suffices to prove
µ(G1 ⊕ G2 ) ≤ max(µ(G1 ), µ(G2 )), which is an easy calculation.
256 Chapter 6. Tannakian Fundamental Groups
Remark 6.7.12 The point where the use of semi-stability was crucial in the con-
struction was in the innocent-looking last part of the above proof. Indeed, for the
purpose of embedding the category of finite locally free sheaves in an abelian cat-
egory, the category of coherent sheaves on S is just as handy; it moreover carries
a commutative tensor structure with unit. However, for duals to exist we need
to know that the objects are locally free sheaves, and this is also needed for the
construction of the fibre functor.
Definition 6.7.13 Let S be a proper integral scheme over a field k that has a
k-rational point s : Spec (k) → S. The fundamental group scheme of S with base
point s is the affine k-group scheme corresponding via Theorem 6.5.3 to the neutral
Tannakian category EFS and the fibre functor s∗ . We denote it by π1N (S, s) (the
superscript N stands for ‘Nori’).
258 Chapter 6. Tannakian Fundamental Groups
Proposition 6.7.14 The group scheme π1N (S, s) is an inverse limit of finite k-group
schemes.
Proof: Let F be a finite locally free sheaf on S. Consider the direct sum Fe of a
system of representatives of S(F ⊕ F ∨ ); by Proposition 6.7.4 this is a finite direct
sum. As before, denote by hFi e the full abelian subcategory of EFS generated
e
by the object F. It is stable by tensor product and duals because by definition
S(F ⊕ F ∨ ) contains all indecomposable direct summands of tensor products of the
e thus equals the full Tannakian subcategory
form F ⊗i ⊗ (F ∨ )⊗j . The category hFi
hFi⊗ of EFS , and as such is equivalent to the category RepG for an affine group
scheme G. The group scheme G is finite, because we know from Step 1 of the
proof of Theorem 6.5.4 that hFi e is equivalent to the category of finite dimensional
comodules over a finite dimensional k-coalgebra RF which must be the underlying
coalgebra of the Hopf algebra of G. It remains to note that EFS is the direct limit
of the full Tannakian subcategories hFi⊗ , as each object E is a subquotient of a
finite locally free sheaf F, and is therefore contained in hFi⊗ .
Remark 6.7.15 The arguments in the above proof also imply that when k is of
characteristic 0, every essentially finite locally free sheaf over S is finite. Indeed,
it follows from a theorem of Cartier (see [111], Theorem 11.4) that in characteristic
0 every finite group scheme is étale. On the other hand, for a finite étale group
scheme G the regular representation k[G] is manifestly semisimple (i.e. a direct
sum of irreducible representations), hence each object in the category RepG is
semisimple. Applying this to the category hFi e of the above proof which is equiva-
lent to RepG for some finite G, we see that each object is a direct sum of objects
in S(F ⊕ F ∨ ), and therefore a finite locally free sheaf by Corollary 6.7.6.
The next theorem shows why π1N (S, s) is called a fundamental group scheme.
To state it, introduce the category FtorsS,s of triples (G, X, x), where G is a finite
group scheme over k, X is a left G-torsor over S and x is a k-rational point in the
fibre of X above s. A morphism (G, X, x) → (G′ , X ′ , x′ ) in this category is given
by a pair of morphisms φ : G → G′ , ψ : X → X ′ such that the G-action on X is
compatible with the G′ -action on X ′ via (φ, ψ), and moreover ψ(x) = x′ .
Note that not all G-torsors over S have a k-point in the fibre above s. The proof
will show that every G-torsor X gives rise to a homomorphism φX : π1N (S, s) → G
but one may recover X from φX only after securing a k-point in the fibre. We
need the following general lemma.
6.7 Nori’s Fundamental Group Scheme 259
Lemma 6.7.17 Let G be a finite k-group scheme, and consider the neutral Tan-
nakian category RepG of its finite dimensional representations, together with a
fibre functor ω : RepG → Vecf k .
Given a non-neutral fibre functor η with values in the category LFS of locally free
sheaves on S, the functor Hom⊗ (η, ωS ) given by T 7→ Hom⊗ (η ⊗OS OT , ω ⊗k OT )
on the category of schemes over S is representable by a left G-torsor over S.
Proof: By Lemma 6.3.3 the subfunctor Isom⊗ (η, ωS ) ⊂ Hom⊗ (η, ωS ) given by
tensor isomorphisms equals Hom⊗ (η, ωS ). Therefore for each T → S the natural
map
Aut⊗ (ωS ) × Hom⊗ (η, ωS ) → Hom⊗ (η, ωS ) × Hom⊗ (η, ωS ), (g, φ) 7→ (φ, g ◦ φ)
Proof of Theorem 6.7.16: A G-torsor X gives rise to a finite locally free sheaf
EX on S by Lemma 6.7.1. As we have seen in the proof of Proposition 6.7.14,
the full Tannakian subcategory hEX i⊗ is equivalent to the category RepG for some
260 Chapter 6. Tannakian Fundamental Groups
finite group scheme G. Consider the fibre functor on RepG given by composing the
∼
equivalence RepG → hEX i⊗ with the restriction of the functor s∗ to hEX i⊗ . The in-
clusion functor hEX i⊗ → EFS then corresponds to a group scheme homomorphism
φX : π1N (S, s) → G by Corollary 6.3.6 and Proposition 6.7.11.
Conversely, a homomorphism φ : π1N (S, s) → G induces a tensor functor
∗
φ : RepG → EFS , whence a non-neutral fibre functor η with values in LFS
by composition with the inclusion of categories EFS → LFS , and a neutral fibre
functor ω by composing with s∗ . Applying the lemma we obtain a G-torsor Xφ
over S. However, for φ = φX the G-torsor Xφ is not necessarily isomorphic to X.
Indeed, the construction shows that we would get X back if instead of the above ω
we applied the lemma to the forgetful fibre functor F sending an object of RepG to
its underlying k-vector space. By fixing an isomorphism ω ∼ = F of fibre functors we
can thus define a quasi-inverse to the functor of the previous paragraph. But such
an isomorphism exists if and only if there is a k-rational point in the fibre of X
above s. Indeed, the affine k-scheme Isom⊗ (η, ωS ) has a canonical k-rational point
corresponding to s∗ , so an isomorphism ω ∼= F of functors yields a k-rational point
of Isom⊗ (η, FS ) ∼
= X above s. Conversely, such a point defines a trivialization of
the G-torsor Xs over k, whence an isomorphism of functors ω ∼ = F.
Over an algebraically closed base field we may compare Nori’s fundamental
group scheme to Grothendieck’s algebraic fundamental group.
Here ‘maximal pro-étale quotient’ means the inverse limit of those quotients
of π1N (S, s) that are finite étale group schemes over k.
Proof: From Proposition 5.3.16 (1) and the fact that k is algebraically closed we
conclude that a torsor under a finite étale k-group scheme G is one and the same
thing as a finite étale Galois cover with group G(k). In the proof of Proposition
5.4.6 we turned the set of finite étale Galois covers of S into an inverse system
(Pα , φαβ ) by specifying a point pα in the geometric fibre Fibs̄ (Pα ) and defining
φαβ to be the unique morphism Pβ → Pα sending pβ to pα . The group π1 (S, s̄)
was the automorphism group of this inverse system.
On the other hand, we have seen that each Gα -torsor Pα defines an object
of EFS . The full Tannakian subcategory hPα i⊗ ⊂ EFS is equivalent to the cat-
egory RepGα . Let F be the neutral fibre functor on RepGα given by the forget-
ful functor, and let η be the non-neutral fibre functor given by the composite
∼
RepGα → hPα i⊗ → EFS → LFS . By Lemma 6.7.17 the functor Hom⊗ (η, FS )
on RepGα is representable by a Gα -torsor over S and, as we have noted in the
previous proof, this Gα -torsor is none but Pα . Thus fixing points pα as above
turns the functors Hom⊗ (η, FS ) on the various RepGα into an inverse system as
6.7 Nori’s Fundamental Group Scheme 261
well. Passing to automorphisms defines an inverse system for the Gα whose inverse
limit lim Gα is an affine group scheme over k whose group of k-points is π1 (S, s̄)
←
by construction. But lim Gα is also the maximal pro-étale quotient of π1N (S, s) by
←
Theorem 6.7.16.
Proof: This follows from the proposition above, Proposition 6.7.14 and the fact,
already recalled in Remark 6.7.15, that in characteristic 0 all finite group schemes
are étale.
Remarks 6.7.21
1. Nori established in [73] a number of properties of π1N (S, s). For instance,
given two k-rational points s and t, one has π1 (S, s) ×k k̄ ∼
= π1 (S, t) ×k k̄ (we
have omitted the ‘Spec ’ to ease notation). Also, one has an isomorphism
of group schemes π1 (S ×k L) ∼ = π1 (S, s) ×k L for a separable algebraic ex-
tension L|k. However, as Mehta and Subramanian have shown in [58], the
corresponding property does not hold for extensions of algebraically closed
fields, at least if one allows S to be singular. In the same paper they checked
that π1N (S, s) is compatible with direct products. This can be used to adapt
the Lang–Serre arguments on the étale fundamental group of an abelian va-
riety A, and obtain that π1N (A, 0) is the inverse limit of the kernels of the
multiplication-by-m maps m : A → A considered as finite group schemes.
Nori proved this earlier in [74] by a direct method.
2. In their recent paper [21], Esnault and Hai develop a generalization of Nori’s
theory for smooth geometrically connected schemes S over a field k of char-
acteristic 0, without assuming the existence of a k-point. They consider
the category FConnS of locally free sheaves on S equipped with the struc-
ture of a flat connection, and then define a connection (E, ∇) to be finite
if f ((E, ∇)) = g((E, ∇)) for some polynomials f 6= g. They show that in
the proper case each object of Nori’s category EFS carries the structure of
a flat connection lying in FConnS . But in the absence of a k-rational point
the category FConnS has a priori only a non-neutral Tannakian structure.
If ω is a non-neutral fibre functor coming from a k̄-valued geometric point
of S, the general Tannakian theory (Remark 6.5.13) shows that Aut⊗ k (ω)
is representable by a groupoid scheme Π over k̄. Esnault and Hai identify
the sections of the groupoid map s : Π → Spec (k̄) with elements of the
étale fundamental group π1 (S ⊗k k̄) and show that neutral fibre functors on
FConnS correspond to sections of the projection π1 (S) → Gal (k̄|k). This
yields a Tannakian reinterpretation of Grothendieck’s Section Conjecture:
262 Chapter 6. Tannakian Fundamental Groups
Exercises
1. Let V and W be infinite dimensional vector spaces. Show that the natural
embedding V ∗ ⊗k W ∗ → (V ⊗k W )∗ sending a tensor product φ ⊗ ψ of
functions to the function given by v ⊗ w 7→ φ(v)ψ(w) is not surjective.
[Hint: Choose infinite sequences of linearly independent vectors v1 , v2 , . . .
and w1 , w2 , . . . in V and W , respectively. Define a k-linear function on
V ⊗k W by sending vi ⊗ wj to δij (Kronecker delta), extending linearly to
the span S of the vi ⊗ wj , and extending by 0 outside S. Show that this
function is not in the image of V ∗ ⊗k W ∗ .]
(a) Show that the set End(ω) of functor morphisms ω → ω carries a natural
k-algebra structure.
(b) Verify that the map B 7→ End(ω) induced by mapping b ∈ B to the
functorial collection Φb of multiplication-by-b maps V → V on the un-
derlying space of each finitely generated B-module V is an isomorphism
of k-algebras.
(c) State and prove a dual statement for a finite dimensional k-coalgebra
A and the category of finite dimensional k-vector spaces equipped with
a right A-comodule structure.
(a) Given a differential module (V, ∇) over K, show that a system of el-
ements of V ∇ is k-linearly independent if and only if it is K-linearly
independent.
(b) Let f1 , . . . , fn be elements of K, and consider the Wronskian matrix
(i−1) (i−1)
W (f1 , . . . , fn ) = [fj ]1≤i,j≤n , where fj denotes the (i − 1)-st
(0)
derivative of fj and fj = fj by convention. Show that W (f1 , . . . , fn )
is invertible if and only if the fj are k-linearly independent. [Hint:
Define a differential module structure on K n for which the vectors
(fj , fj′ , . . . , fjn−1 ) are horizontal.]
10. Let S be a proper integral scheme over a field k, and let s : Spec (k) → S
be a k-rational point. Consider the category of pairs (G, X) with G a finite
group scheme over k and X → S a left GS -torsor such that the fibre Xs over
s has a k-rational point. Let Fs be the functor on this category sending a
pair (G, X) to the set of k-rational points of the fibre Xs . Show that Fs is
a pro-representable functor.
Bibliography
[3] Emil Artin, Otto Schreier, Eine Kennzeichnung der reell abgeschlossenen
Körper, Abh. Math. Sem. Hamburg 5 (1927), 225–231.
[8] Jean-Benoı̂t Bost, François Loeser, Michel Raynaud (eds.), Courbes semi-
stables et groupe fondamental en géométrie algébrique, Progress in Mathe-
matics, vol. 187, Birkhäuser, Basel, 2000.
265
266 BIBLIOGRAPHY
[15] Pierre Deligne, David Mumford, The irreducibility of the space of curves of
given genus, Publ. Math. IHES 36 (1969), 75–109.
[16] Adrien Douady, Détermination d’un groupe de Galois. C. R. Acad. Sci. Paris
258 (1964), 5305–5308.
[17] Adrien and Régine Douady, Algèbre. Théories galoisiennes, Dunod, Paris,
1979.
[18] David Eisenbud, Commutative Algebra with a View toward Algebraic Geom-
etry, Graduate Texts in Mathematics, vol. 150, Springer-Verlag, New York,
1995.
[19] Noam D. Elkies, ABC implies Mordell, Internat. Math. Res. Notices (1991),
99–109.
[20] Hélène Esnault, Phùng Hô Hai, Packets in Grothendieck’s section conjecture,
Adv. Math. 218 (2008), 395–416.
[21] Hélène Esnault, Phùng Hô Hai, The fundamental groupoid scheme and ap-
plications, preprint arXiv:math/0611115.
[24] Peter Freyd, Abelian Categories, Harper & Row, New York, 1964.
[31] David Harbater, Galois coverings of the arithmetic line, in Number Theory
(New York, 1984–1985), Lecture Notes in Math., vol. 1240, Springer-Verlag,
Berlin, 1987, 165–195.
[32] David Harbater, Abhyankar’s conjecture on Galois groups over curves. In-
vent. Math. 117 (1994), 1–25.
[38] André Joyal, Ross Street, An introduction to Tannaka duality and quantum
groups, in Category theory (Como, 1990), Lecture Notes in Math., vol. 1488,
Springer-Verlag, Berlin, 1991, pp. 413–492.
[41] Nicholas M. Katz, Serge Lang, Finiteness theorems in geometric class field
theory, L’Enseign. Math. 27 (1981), 285–314
[44] Steven Kleiman, The Picard scheme, in B. Fantechi et al., Fundamental Alge-
braic Geometry, Mathematical Surveys and Monographs, vol. 123, American
Mathematical Society, Providence, 2005, 237–329.
[50] Serge Lang, Algebraic Number Theory, second edition, Springer–Verlag, New
York, 1994.
[51] Serge Lang, Jean-Pierre Serre, Sur les revêtements non ramifiés des variétés
algébriques, Amer. J. Math. 79 (1957), 319–330.
[52] Hendrik W. Lenstra, Jr, Galois Theory of Schemes, course notes available
from the server of the Universiteit Leiden Mathematics Department.
[54] Max Lieblich, Martin Olsson, Generators and relations for the étale funda-
mental group, to appear in Quart. J. of Pure and Appl. Math.
BIBLIOGRAPHY 269
[60] John Milnor, Morse Theory, Annals of Mathematics Studies, vol. 51, Prince-
ton University Press, 1963.
[61] Shinichi Mochizuki, The profinite Grothendieck conjecture for closed hyper-
bolic curves over number fields, J. Math. Sci. Univ. Tokyo 3 (1996), 571–627.
[62] Shinichi Mochizuki, The local pro-p anabelian geometry of curves. Invent.
Math. 138 (1999), 319–423.
[64] David Mumford. The Red Book of Varieties and Schemes, Lecture Notes in
Mathematics 1358, Springer-Verlag, Berlin, 1988.
[66] James Munkres, Topology, second edition, Prentice Hall, Englewood Cliffs,
2000.
[68] Jürgen Neukirch, Kennzeichnung der p-adischen und der endlichen algebrais-
chen Zahlkörper, Invent. Math. 6 (1969), 296–314.
[71] Nikolay Nikolov, Dan Segal, On finitely generated profinite groups I. Strong
completeness and uniform bounds, Ann. of Math. 165 (2007), 171–238.
270 BIBLIOGRAPHY
[73] Madhav Nori, The fundamental group-scheme, Proc. Indian Acad. Sci.
Math. Sci. 91 (1982), 73–122.
[75] Florian Pop, 1/2 Riemann existence theorem with Galois action, in G. Frey
and J. Ritter (eds.) Algebra and Number Theory, de Gruyter, Berlin, 1994.
[76] Florian Pop, Étale Galois covers of affine smooth curves. The geometric case
of a conjecture of Shafarevich. On Abhyankar’s conjecture, Invent. Math. 120
(1995), 555–578.
[78] Wayne Raskind, Abelian class field theory of arithmetic schemes, in B. Ja-
cob, A. Rosenberg (eds.) K-Theory and Algebraic Geometry: Connections
with Quadratic Forms and Division Algebras, Proc. Symp. Pure Math., vol.
58/1, American Mathematical Society, Providence, 1995, 85–187.
[80] Walter Rudin, Real and Complex Analysis, third edition, McGraw-Hill, New
York, 1987.
[82] Peter Schauenburg, Tannaka duality for arbitrary Hopf algebras, Algebra
Berichte, vol. 66, Verlag Reinhard Fischer, Munich, 1992.
[83] Alexander Schmidt, Tame class field theory for arithmetic schemes. Invent.
Math. 160 (2005), 527–565.
[84] Alexander Schmidt, Michael Spieß, Singular homology and class field theory
for varieties over finite fields, J. reine angew. Math. 527 (2000), 13–36.
[85] Leila Schneps, Pierre Lochak (eds.), Geometric Galois actions 1, London
Mathematical Society Lecture Note Series, vol. 242. Cambridge University
Press, 1997.
BIBLIOGRAPHY 271
[93] Jean-Pierre Serre, Topics in Galois Theory, Jones and Bartlett, Boston,
1992.
[97] Michael Spieß, Tamás Szamuely, On the Albanese map for smooth quasi-
projective varieties, Math. Ann. 325 (2003), 1–17.
[98] Jakob Stix, Projective anabelian curves in positive characteristic and descent
theory for log étale covers, Bonner Mathematische Schriften, vol. 354, Bonn,
2002.
[101] Tamás Szamuely, Groupes de Galois de corps de type fini [d’après Pop],
Séminaire Bourbaki, exposé 923, Astérisque 294 (2004), 403–431.
[102] Akio Tamagawa, The Grothendieck conjecture for affine curves, Compositio
Math. 109 (1997), 135–194.
[105] Karl-Heinz Ulbrich, On Hopf algebras and rigid monoidal categories, Israel
J. Math. 72 (1990), 252–256.
[106] Bartel Leendert van der Waerden, Algebra I (7te Auflage) II (5te Auflage),
Springer-Verlag, Berlin, 1966–67. English translation: Springer-Verlag, New
York, 1991.
[107] Marius van der Put, Michael F. Singer, Galois Theory for Linear Differen-
tial Equations, Grundlehren der mathematischen Wissenschaften, vol. 328,
Springer-Verlag, Berlin, 2003.
[109] Angelo Vistoli, Grothendieck topologies, fibered categories and descent the-
ory, in B. Fantechi et al., Fundamental Algebraic Geometry, Mathematical
Surveys and Monographs, vol. 123, American Mathematical Society, Provi-
dence, 2005, 1–104.
[110] Claire Voisin, Hodge Theory and Complex Algebraic Geometry I, Cambridge
Studies in Advanced Mathematics, vol. 76, Cambridge University Press,
2002.
[112] André Weil, Généralisation des fonctions abeliennes, J. Math. Pures Appl.
17 (1938), 47–87.
Index
273
274 INDEX
Tamagawa’s theorem
on curves over Fp , 142
Tamagawa–Mochizuki theorem, 144
tame fundamental group, 196
Tannakian category
general, 242
neutral, 236
Tannakian fundamental group, 237
Tate module, 187
tensor category, 226
tensor functor, 228
morphism of, 230
torsor
under group scheme, 171
under topological group, 54
triangulation, 95
trivial cover, 38
universal cover, 52
unramified class field theory, 206
vector bundle
holomorphic, 63
on scheme, 254