(Cambridge Studies in Advanced Mathematics) Kiran S. Kedlaya-P-Adic Differential Equations (Cambridge Studies in Advanced Mathematics) - Cambridge University Press (2010) PDF
(Cambridge Studies in Advanced Mathematics) Kiran S. Kedlaya-P-Adic Differential Equations (Cambridge Studies in Advanced Mathematics) - Cambridge University Press (2010) PDF
C A M B R I D G E S T U D I E S I N A DVA N C E D M AT H E M AT I C S 1 2 5
Editorial Board
B . B O L L O B S , W. F U LTO N , A . K ATO K , F. K I RWA N ,
P. S A R NA K , B . S I M O N , B . TOTA RO
978-0-511-74472-3
eBook (EBL)
ISBN-13
978-0-521-76879-5
Hardback
Contents
Preface
0
0.1
0.2
0.3
0.4
Part I
1
1.1
1.2
1.3
1.4
1.5
2
2.1
2.2
2.3
3
3.1
3.2
Introductory remarks
Why p-adic differential equations?
Zeta functions of varieties
Zeta functions and p-adic differential equations
A word of caution
Notes
Exercises
Tools of p-adic Analysis
Norms on algebraic structures
Norms on abelian groups
Valuations and nonarchimedean norms
Norms on modules
Examples of nonarchimedean norms
Spherical completeness
Notes
Exercises
Newton polygons
Introduction to Newton polygons
Slope factorizations and a master factorization theorem
Applications to nonarchimedean field theory
Notes
Exercises
Ramification theory
Defect
Unramified extensions
page xiii
1
1
3
5
7
8
9
11
13
13
16
17
25
28
31
33
35
35
38
41
42
43
45
46
47
v
vi
3.3
3.4
4
4.1
4.2
4.3
4.4
4.5
Contents
Tamely ramified extensions
The case of local fields
Notes
Exercises
Matrix analysis
Singular values and eigenvalues (archimedean case)
Perturbations (archimedean case)
Singular values and eigenvalues (nonarchimedean case)
Perturbations (nonarchimedean case)
Horns inequalities
Notes
Exercises
6
6.1
6.2
6.3
6.4
6.5
6.6
6.7
6.8
6.9
7
7.1
49
52
53
54
55
56
60
62
68
71
72
74
75
77
77
80
81
84
85
87
87
90
91
91
93
93
95
102
104
105
107
109
112
114
116
117
118
119
Contents
7.2
7.3
7.4
7.5
Part III
8
8.1
8.2
8.3
8.4
8.5
8.6
9
9.1
9.2
9.3
9.4
9.5
9.6
9.7
9.8
9.9
9.10
10
10.1
10.2
10.3
10.4
10.5
10.6
10.7
10.8
vii
120
123
126
127
129
130
133
135
136
138
140
143
144
147
149
150
151
152
153
154
155
157
158
160
162
162
164
165
166
168
168
169
170
172
174
176
180
181
viii
11
11.1
11.2
11.3
11.4
11.5
11.6
11.7
11.8
11.9
12
12.1
12.2
12.3
12.4
12.5
12.6
12.7
12.8
13
13.1
13.2
13.3
13.4
13.5
13.6
13.7
Contents
Notes
Exercises
Variation of generic and subsidiary radii
Harmonicity of the valuation function
Variation of Newton polygons
Variation of subsidiary radii: statements
Convexity for the generic radius
Measuring small radii
Larger radii
Monotonicity
Radius versus generic radius
Subsidiary radii as radii of optimal convergence
Notes
Exercises
Decomposition by subsidiary radii
Metrical detection of units
Decomposition over a closed disc
Decomposition over a closed annulus
Decomposition over an open disc or annulus
Partial decomposition over a closed disc or annulus
Modules solvable at a boundary
Solvable modules of rank 1
Clean modules
Notes
Exercises
p-adic exponents
p-adic Liouville numbers
p-adic regular singularities
The Robba condition
Abstract p-adic exponents
Exponents for annuli
The p-adic Fuchs theorem for annuli
Transfer to a regular singularity
Notes
Exercises
182
183
184
185
186
189
190
191
193
195
197
198
199
200
201
202
203
207
209
210
211
212
214
216
216
218
218
221
222
223
225
231
234
237
238
241
14
14.1
243
243
Contents
14.2
14.3
14.4
14.5
14.6
15
15.1
15.2
15.3
15.4
16
16.1
16.2
16.3
16.4
16.5
Twisted polynomials
Difference-closed fields
Difference algebra over a complete field
Hodge and Newton polygons
The DieudonnManin classification theorem
Notes
Exercises
Frobenius modules
A multitude of rings
Frobenius lifts
Generic versus special Frobenius lifts
A reverse filtration
Notes
Exercises
Frobenius modules over the Robba ring
Frobenius modules on open discs
More on the Robba ring
Pure difference modules
The slope filtration theorem
Proof of the slope filtration theorem
Notes
Exercises
ix
246
247
248
254
256
258
260
262
262
264
266
269
271
272
273
273
275
277
279
281
284
286
289
17
17.1
17.2
291
291
17.3
17.4
17.5
18
18.1
18.2
18.3
18.4
18.5
294
296
298
298
299
300
301
301
302
306
308
310
19
19.1
19.2
19.3
19.4
20
20.1
20.2
20.3
20.4
20.5
21
21.1
21.2
21.3
21.4
21.5
Contents
Notes
Exercises
Galois representations and differential modules
Representations and differential modules
Finite representations and overconvergent differential
modules
The unit-root p-adic local monodromy theorem
Ramification and differential slopes
Notes
Exercises
The p-adic local monodromy theorem
Statement of the theorem
An example
Descent of sections
Local duality
When the residue field is imperfect
Notes
Exercises
The p-adic local monodromy theorem: proof
Running hypotheses
Modules of differential slope 0
Modules of rank 1
Modules of rank prime to p
The general case
Notes
Exercises
Part VI
22
22.1
22.2
22.3
23
23.1
23.2
23.3
Areas of Application
PicardFuchs modules
Origin of PicardFuchs modules
Frobenius structures on PicardFuchs modules
Relationship to zeta functions
Notes
Rigid cohomology
Isocrystals on the affine line
Crystalline and rigid cohomology
Machine computations
Notes
310
311
313
314
316
318
321
323
325
326
326
328
329
332
333
335
337
338
338
339
341
342
343
343
344
345
347
347
348
349
350
352
352
353
354
355
Contents
24
24.1
24.2
24.3
24.4
24.5
xi
357
357
359
361
362
363
364
References
Notation
Index
365
374
376
Preface
This book is an outgrowth of a course, taught by the author at MIT during fall 2007, on p-adic ordinary differential equations. The target audience
was graduate students with some prior background in algebraic number theory, including exposure to p-adic numbers, but not necessarily with any
background in p-adic analytic geometry (of either the Tate or Berkovich
flavors).
Custom would dictate that ordinarily this preface would continue with an
explanation of what p-adic differential equations are, and why they matter. Since we have included a whole chapter on this topic (Chapter 0), we
will devote this preface instead to a discussion of the origin of the book, its
general structure, and what makes it different from previous books on the
subject.
The subject of p-adic differential equations has been treated in several previous books. Two that we used in preparing the MIT course, and to which
we make frequent reference in the text, are those of Dwork, Gerotto, and
Sullivan [80] and of Christol [42]. Another existing book is that of Dwork [78],
but it is not a general treatise; rather, it focuses in detail on hypergeometric
functions.
However, this book develops the theory of p-adic differential equations in
a manner that differs significantly from most prior literature. Key differences
include the following.
We limit our use of cyclic vectors. This requires an initial investment in
the study of matrix inequalities (Chapter 4) and lattice approximation
arguments (especially Lemma 8.6.1), but it pays off in significantly
stronger results.
We introduce the notion of a Frobenius descendant (Chapter 10). This
complements the older construction of Frobenius antecedents, particularly in dealing with certain boundary cases where the antecedent
method does not apply.
xiii
xiv
Preface
As a result, we end up with some improvements of existing results, including the following. (Some of these can also be found in an upcoming book of
Christol [46], whose development we learned about only after this book was
mostly complete.)
We refine the Frobenius antecedent theorem of Christol and Dwork
(Theorem 10.4.2).
We extend some results of Christol and Dwork, on the variation of the
generic radius of convergence, to subsidiary radii (Theorem 11.3.2).
We extend Youngs geometric interpretation of subsidiary generic radii
of convergence beyond the range of applicability of Newton polygons
(Theorem 11.9.2).
We give quantitative versions of the ChristolMebkhout decomposition
theorem for differential modules on an annulus that are applicable even
when the modules are not solvable at a boundary (Theorems 12.2.2
and 12.3.1).
We give a somewhat simplified treatment of the theory of p-adic
exponents (Theorems 13.5.5, 13.5.6, and 13.6.1).
We sharpen the bound in the Christol transfer theorem to a disc containing a regular singularity with exponents in Z p (Theorem 13.7.1).
We give a general version of the DieudonnManin classification
theorem for difference modules over a complete nonarchimedean field
(Theorem 14.6.3).
We give improvements on the ChristolDworkRobba effective bounds
for solutions of p-adic differential equations (Theorems 18.2.1 and
18.5.1) and some related bounds that apply in the presence of a
Frobenius structure (Theorem 18.3.3). The latter can be used to recover
a theorem of Chiarellotto and Tsuzuki concerning the logarithmic
growth of solutions of differential equations with Frobenius structure
(Theorem 18.4.5).
We state a relative version of the p-adic local monodromy theorem,
formerly Crews conjecture (Theorem 20.1.4), and describe in detail
how it may be derived either from the p-adic index theory of Christol
and Mebkhout, which we treat in detail in Chapter 13, or from the slope
theory for Frobenius modules of Kedlaya, which we only sketch, in
Chapter 16.
Some of the new results are relevant in theory (in the study of higherdimensional p-adic differential equations, largely in the context of the
semistable reduction problem for overconvergent F-isocrystals, for which
see [138] and [143]) or in practice (in the explicit computation of solutions
of p-adic differential equations, e.g., for the machine computation of zeta
Preface
xv
functions of particular varieties, for which see [139]). There is also some relevance, entirely outside number theory, to the study of flat connections on
complex analytic varieties (see [144]).
Although some applications involve higher-dimensional p-adic analytic
spaces, this book treats exclusively p-adic ordinary differential equations. In
joint work with Liang Xiao [145], we have developed some extensions to
higher-dimensional spaces.
Each individual chapter of this book exhibits the following basic structure.
Before the body of the chapter, we give a brief introduction explaining what
is to be discussed and often setting some running notations or hypotheses.
After the body of the chapter, we typically include a section of afternotes, in
which we provide detailed references for results in that chapter, fill in historical
details, and add additional comments. (This practice is modeled on that in [94],
although we do not carry it out quite as fully.) Note that we have a habit of
attributing to various authors slightly stronger versions of their theorems than
the ones they originally stated; to avoid complicating the discussion in the
text, we resolve these misattributions in the afternotes instead. At the end of
a chapter we typically include a few exercises; a fair number of these request
proofs of results which are stated and used in the text but whose proofs pose
no unusual difficulties.
The chapters themselves are grouped into several parts, which we now
describe briefly. (Chapter 0, being introductory, does not fit into this grouping.)
Part I is preliminary, collecting some basic tools of p-adic analysis. However, it also includes some facts of matrix analysis (the study of the variation
of numerical invariants attached to matrices as a function of the matrix entries)
which may not be familiar to the typical reader.
Part II introduces some formalism of differential algebra, such as differential
rings and modules, twisted polynomials, and cyclic vectors, and applies these
to fields equipped with a nonarchimedean norm.
Part III begins the study of p-adic differential equations in earnest, developing some basic theory for differential modules on rings and annuli, including
the ChristolDwork theory of variation of the generic radius of convergence
and the ChristolMebkhout decomposition theory. We also include a treatment of p-adic exponents, culminating in the ChristolMebkhout structure
theorem for p-adic differential modules on an annulus satisfying the Robba
condition (i.e., having intrinsic generic radius of convergence everywhere
equal to 1).
Part IV introduces some formalism of difference algebra, and presents (without full proofs) the theory of slope filtrations for Frobenius modules over the
Robba ring.
xvi
Preface
Part V introduces the concept of a Frobenius structure on a p-adic differential module, to the point of stating the p-adic local monodromy theorem
and sketching briefly the proof techniques using either p-adic exponents or
Frobenius slope filtrations. We also discuss effective convergence bounds for
solutions of p-adic differential equations.
Part VI consists of a series of brief discussions of several areas of application of the theory of p-adic differential equations. These are somewhat more
didactic, and much less formal, than in the other parts; they are meant primarily
as suggestions for further reading.
The following diagram indicates the logical dependencies of the chapters. To
keep the diagram manageable, we have grouped together some chapters (13
and 912) and omitted Chapter 0 and the chapters of Part VI. The reader should
be aware that there is one forward reference, from Chapter 13 to Chapter 18,
but the graph remains acyclic. (There are some additional forward references
between Chapters 1 and 2, but these should not cause any difficulty.)
GFED
@ABC
13 o
H
/.-,
()*+
4I O
/.-,
()*+
()*+
/.-,
()*+
/.-,
7I O
5 o
6O o
/.-,
()*+
HIJK
ONML
89:;
?>=<
912 o
8 @_ o @
13
>^ >>
@@
F
>>
@@
>>
@@@@
>>
>>
@
>>
89:;
89:;
?>=<
89:;
15 o nn6 ?>=<
16
14 o ?>=<
>>
n nn
>>
n
n
>>
nn
n
n
>>
nnnnn
>
n
no
?>=<
89:;
?>=<
89:;
89:;
?>=<
?>=<
89:;
89:;
?>=<
o
o
o
19
18
17
20
21
As noted above we have not assumed that the reader is familiar with rigid
analytic geometry and so have phrased all statements more concretely in terms
of rings and modules. Although we expect that the typical reader has at least
a passing familiarity with p-adic numbers, for completeness we include a
rapid development of the algebra of complete rings and fields in the first few
chapters of the book. This development, when read on its own, may appear
somewhat idiosyncratic; its design is justified by the reuse of some material in
later chapters.
Preface
xvii
0
Introductory remarks
Introductory remarks
Introductory remarks
another way to write this, which makes it look more like a typical zeta
function, is
X (T ) =
(1 T deg(x) )1 ,
x
where x runs over the Galois orbits of X (Fq ) and deg(x) is the size of the orbit
x. (If you prefer algebro-geometric terminology, you may run x over closed
points of the scheme X , in which case deg(x) denotes the degree of the residue
field of x over Fq .)
Example 0.2.4. For X = E one can verify that
X (T ) =
1 aq ()T + qT 2
,
(1 T )(1 qT )
P1 (T ) P2d1 (T )
,
P0 (T ) P2d (T )
where each Pi (T ) has integer coefficients, satisfies Pi (0) = 1, and has all
roots in C on the circle |T | = q i/2 .
Proof. The proof of this theorem is a sufficiently massive undertaking that
even a reference is not reasonable here; instead, we give [107, Appendix C] as
a source of references. (Another useful exposition is [178].)
Remark 0.2.6. It is worth pointing out that the first complete proof of
Theorem 0.2.5 used the fact that for any prime = p one has
(1)i Trace(F n , Heti (X, Q )),
#X (Fq n ) =
i
where Heti (X, Q ) is the i-th tale cohomology group of X (or rather, the base
change of X to Fq ) with coefficients in Q . This is an instance of the Lefschetz
trace formula in tale cohomology.
a(a + 1) (a + i)b(b + 1) (b + i)
i=0
c(c + 1) (c + i)i!
zi
(0.3.2.1)
(0.3.2.2)
Set
(z) = F(1/2, 1/2; 1; z).
Over C, is related to an elliptic integral, for instance, by the formula
d
2 /2
(0 < < 1).
() =
0
1 sin2
Introductory remarks
(One can extend this to complex , but care needs to be taken with the branch
cuts.) This elliptic integral can be viewed as a period integral for the curve E ,
i.e., one is integrating some meromorphic differential form on E around some
loop (or more properly, around some homology class).
Let p be an odd prime. We now try to interpret (z) as a function of a
p-adic variable rather than a complex variable. Be aware that this means that z
can take any value in a field with a norm extending the p-adic norm on Q, not
just in Q p itself. (For the moment, you can imagine z running over a completed
algebraic closure of Q p .)
Lemma 0.3.3. The series (z) converges p-adically for |z| < 1.
Proof. Exercise.
Dwork discovered that a closely related function admits a sort of analytic
continuation.
Definition 0.3.4. Define the Igusa polynomial
H (z) =
( p1)/2
i=0
( p 1)/2 2 i
z.
i
Modulo p, the roots of H (z) are the values of F p for which E is a supersingular elliptic curve, i.e., for which aq () 0 (mod p). (In fact, the roots
of H (z) all belong to F p2 , by a theorem of Deuring; see [200, Theorem V.3.1].)
Dworks analytic continuation result is the following.
i
Theorem 0.3.5 (Dwork). There exists a series (z) =
i=0 Pi (z)/H (z) ,
with each Pi (z) Q p [z], converging uniformly for those z satisfying |z| 1
and |H (z)| = 1 and such that
(z) = (1)( p1)/2
(z)
(z p )
a1
).
That is, the quantity u is the unit root (meaning the root of valuation 0) of
the polynomial T 2 aq ()T +q occurring (up to reversal) in the zeta function.
Proof. See [213, 7].
xn
n!
n=0
and the denominators hurt us rather than helping. In fact, the series only converges for |x| < p1/( p1) (assuming that we are normalizing in such a way
that | p| = p 1 ). For comparison, note that the logarithmic series
log
xn
1
=
1x
n
n=1
Introductory remarks
Notes
For detailed notes on the topics discussed in Section 0.1, see the notes for the
chapters referenced.
We again mention [107, Appendix C] and [178] as starting points for further
reading about the Weil conjectures.
The notion of an analytic function specified in terms of a uniform limit
of rational functions with poles prescribed to certain regions is the original
such notion, introduced by Krasner. For this book, we will restrict our consideration of p-adic analysis to working with complete rings in this fashion,
without attempting to introduce any notion of nonarchimedean analytic geometry. However, it must be noted that it is much better in the long run to work
in terms of analytic geometry; for example, it is prohibitively difficult to deal
with partial differential equations without doing so.
That said, there are several ways to develop a theory of analytic spaces over a
nonarchimedean field. The traditional method is Tates theory of rigid analytic
spaces, so-called because one develops everything rigidly by imitating the
theory of schemes in algebraic geometry but using rings of convergent power
series instead of polynomials. The canonical foundational reference for rigid
geometry is the book of Bosch, Gntzer, and Remmert [31], but novices may
find the text of Fresnel and van der Put [93] or the lecture notes of Bosch [30]
more readable. A more recent method, which in some ways is more robust,
is Berkovichs theory of nonarchimedean analytic spaces (commonly called
Berkovich spaces), as introduced in [19] and further developed in [20]. For
both points of view, see also the lecture notes of Conrad [59].
Dworks original analysis of the Legendre family of elliptic curves using
the associated hypergeometric equation (this analysis expands earlier work of
Tate) appears in [74, 8]. The treatment in [213] is more overtly related to
p-adic cohomology.
The family of hypergeometric equations with a, b, c QZ p is rich enough
that one could devote an entire book to the study of its p-adic properties.
Indeed, Dwork did exactly this; the result was [78].
It is possible to resurrect in part the fundamental theorem of ordinary differential equations in the p-adic setting. The best results in that direction seem
Exercises
Exercises
The reader new to p-adic numbers should postpone doing these exercises until
he or she has read Part I.
(1) Prove directly from the definition that the series F(a, b; c; z) converges
p-adically for |z| < 1 whenever a, b, c are rational numbers with
denominators not divisible by p. This implies Lemma 0.3.3.
(2) Using the fact that (z) satisfies the hypergeometric equation, write down
a nontrivial differential equation with coefficients in Q(z) satisfied by the
function (z).
(3) Check that the usual formula
lim inf |an |1/n
n
n
for the radius of convergence of the power series
n=0 an z still works
over a nonarchimedean field. That is, the series converges when |z| is less
than this radius and diverges when |z| is greater than this radius.
(4) Show that in the previous exercise, just like in the archimedean case, a
power series over a nonarchimedean field can either converge or diverge at
a value of z for which |z| equals the radius of convergence.
(5) Check that (as claimed in Example 0.4.1), under the normalization
| p| = p 1 , the exponential series exp(z) over Q p has radius of convergence p 1/( p1) , while the logarithm series log(1 z) has radius of
convergence 1.
(6) Show that, over Q p , while a power series in z which converges for |z| 1
may have an antiderivative which only converges for |z| < 1, its derivative
still converges for |z| 1. This is the reverse of what happens over an
archimedean field.
Part I
Tools of p-adic Analysis
1
Norms on algebraic structures
In this chapter, we recall some basic facts about norms (absolute values),
primarily of the nonarchimedean sort, on groups, rings, fields, and modules.
We also briefly discuss the phenomenon of spherical completeness, which is
peculiar to the nonarchimedean setting. Our discussion is not particularly comprehensive; the reader new to nonarchimedean analysis is directed to [191] for
a fuller treatment.
Several proofs in this chapter make forward references to Chapter 2.
There should be no difficulty in verifying the absence of circular
references.
Convention 1.0.1. In this book, a ring means a commutative ring unless commutativity is suppressed explicitly by describing the ring as not necessarily
commutative or implicitly by its usage in certain phrases, e.g., a ring of
twisted polynomials (Definition 5.5.1).
Notation 1.0.2. For R a ring, we denote by R the multiplicative group of
units of R.
14
|g|2 c2 |g|1 ;
this implies topological equivalence but the reverse implication does not
necessarily hold.
Definition 1.1.3. Let G be an abelian group equipped with a seminorm.
A Cauchy sequence in G under | | is a sequence {xn }
n=0 in G such that,
for any > 0, there exists an integer N such that, for all integers m, n N ,
|xm xn | < . We say the sequence {xn }
n=0 is convergent if there exists x G
such that, for any > 0 there exists an integer N such that, for all integers
n N , |x xn | < ; in this case, the sequence is automatically Cauchy, and
we say that x is a limit of the sequence. If G is separated under | |, then limits
are unique when they exist. We say G is complete under | | if every Cauchy
sequence has a unique limit.
Theorem 1.1.4. Let G be an abelian group equipped with a norm | |. Then
there exists an abelian group G , equipped with a norm | | under which it is
complete, and an isometric homomorphism : G G with dense image.
This is standard, so we only sketch the proof.
15
Proof. We take the set of Cauchy sequences in G and declare two sequences
{xn }
n=0 , {yn }n=0 to be equivalent if the sequence x 0 , y0 , x 1 , y1 , . . . is also
Cauchy. This is easily shown to be an equivalence relation; let G be the set
of equivalence classes. It is then straightforward to construct the group operation (termwise addition) and the norm on G (the limit of the norms of the
terms of the sequence). The map takes g G to the constant sequence
g, g, . . .
Definition 1.1.5. With the notation of Theorem 1.1.4, we call G the completion of G; the group G , equipped with the norm | | and the homomorphism
, is functorial in G. That is, any continuous homomorphism G H extends
uniquely to a continuous homomorphism G H between the completions;
in particular, G is unique up to unique isomorphism. Note that one can also
define the completion even if G is only equipped with a seminorm, but only by
first quotienting by the kernel of the seminorm; in that case, the map from G
to its completion need not be injective.
Definition 1.1.6. If R is a not necessarily commutative ring and | | is a seminorm on its additive group, we say that | | is submultiplicative if the following
additional condition holds.
(c) For f, g R, | f g| | f ||g|.
We say that | | is multiplicative if the following additional condition holds.
(c ) For f, g R, | f g| = | f ||g|.
The completion of a ring R equipped with a submultiplicative seminorm
admits a natural ring structure, because the termwise product of two Cauchy
sequences is again Cauchy.
Lemma 1.1.7. Let F be a field equipped with a multiplicative norm. Then the
completion of F is also a field.
1
bounded below away from 0, from which it follows easily that { f n }n=0 is also
a Cauchy sequence. This proves that every nonzero element of the completion
of F has a multiplicative inverse, as desired.
Proposition 1.1.8. Two multiplicative norms | |, | | on a field F are topologically equivalent if and only if there exists c > 0 such that |x| = |x|c for
all x F.
Proof. Exercise, or see [80, Lemma I.1.2].
16
(1.1.9.1)
17
o F = { f F : v( f ) 0},
m F = { f F : v( f ) > 0},
F = o F /m F .
Note that o F is a local ring (the valuation ring of F), m F is the maximal ideal
of o F , and F is a field (the residue field of F).
It is worth noting that there are comparatively few archimedean (i.e., not
nonarchimedean) absolute values on fields.
Theorem 1.2.4 (Ostrowski). Let F be a field equipped with a norm | |. Then
| | fails to be nonarchimedean if and only if the sequence |1|, |2|, |3|, . . . is
unbounded. In that case, F is isomorphic to a subfield of C equipped with the
restriction of the usual absolute value.
Proof. Exercise, or see [191, 2.1.6] and [191, 2.2.4], respectively.
( f R, x M );
18
(cb R).
bB
n
Ai j m 1,i ;
i=1
n
Ai j a2, j
(i = 1, . . . , n)
j=1
and so
n
n
max{|a1,i |}
|Ai j | max{|a2, j |}.
i
i=1 j=1
This inequality, together with the corresponding one with the bases reversed
(involving the matrix A1 ), implies the claim.
Corollary 1.3.4. Let R
S be an isometric inclusion of rings equipped
with multiplicative seminorms. Let M be a finite free R-module. Let | | M
be a seminorm on M that is compatible with R, which is the restriction of a
supremum-equivalent seminorm on M R S that is compatible with S. Then
| | M is supremum-equivalent.
Proof. Put N = M R S, and let | | N be a supremum-equivalent norm on
N , compatible with S, whose restriction to M equals | | M . Pick any basis B
19
inf
a1 ,...,ak R
inf
a1 ,...,ak R
{|a1 m 1 + + an m n | M }
= |ak+1 m k+1 + + an m n | M1
|ak+1 m k+1 + + an m n | M
c2 max{|ak+1 |, . . . , |an |}.
Thus | | M1 is equivalent to the supremum norm defined by the images of
m k+1 , . . . , m n in M1 , proving the desired result.
In general, even over a field, not every compatible norm on a vector space
need be supremum-equivalent; see the exercises. However, such supremumequivalence is true for complete fields.
Theorem 1.3.6. Let F be a field complete for a norm | |, and let V be a
finite-dimensional vector space over F. Then any two norms on V compatible
with F are metrically equivalent.
Proof. In the archimedean case, apply Theorem 1.2.4 to deduce that F = R
or F = C, then use compactness of the unit ball. In the nonarchimedean case,
we proceed as follows. (See [80, Theorem I.3.2] for a different proof.)
20
We then have
|a1 m 1 + + an m n |V max{|a1 m 1 |V , |a2 m 2 + + an m n |V }
max{|a1 m 1 |V , |a2 m 2 + + an m n |V }
min{|m 1 |, c2 } max{|ai |},
i
(x V ).
21
Proof. We induct on n, with trivial base case n = 0. For n > 0, pick any
nonzero m 1 V , and put V1 = V /Fm 1 . Using (a) or (b), we can rescale m 1
by an element of F to force 1 |m 1 |V c2/3 .
Equip V1 with the quotient seminorm ||V1 induced by ||V . By Lemma 1.3.5,
| |V1 is again supremum-equivalent. Moreover, in case (b) the infimum in
(1.1.9.1) is always achieved, i.e., for every x1 V1 , there exists x V lifting
x1 with |x|V = |x1 |V1 . Hence V1 again satisfies (b).
We may now apply the induction hypothesis to V1 to produce a basis
m 2,1 , . . . , m n,1 of V1 defining a supremum norm | |V1 for which
c1/3 |x1 |V1 |x1 |V1 c1/3 |x1 |V1
(x1 V1 ).
22
ai m i converges to m.
F such that the series i=1
(b) With notation as in (a), the function | |V defined by
|m|V = sup{|ai m i |V }
i
rn
|m n |V
rn+1
(n > 1).
rn
rn+1
max{|am n |V , |m|V };
the same holds if |am n |V = |m|V since in that case |am n + m|V = max
{|am n |V , |m|V }.
By induction on n, we deduce that, for a1 , . . . , an F,
|a1 m 1 + + an m n |V
r1
rn+1
max{|a1 m 1 |V , . . . , |an m n |V }
r1 max{|a1 m 1 |V , . . . , |an m n |V }.
Combining this with the evident inequality
|a1 m 1 + + an m n |V max{|a1 m 1 |V , . . . , |an m n |V },
23
we conclude that on n Vn the seminorms | |V and | |V are metrically equivalent. Consequently, | |V extends by continuity to a function on V which is
metrically equivalent to | |V and hence is a norm (which is evidently compatible with F). This assertion will imply both (a) and (b) as soon as we have
established the existence aspect of (a), which we will do now.
Given m V , by hypothesis there exists a sequence x1 , x2 , . . . of elements
ai, j m i , where
of n Vn converging to m. For j = 1, 2, . . . write x j = i=1
is
a
Cauchy
sequence, for
only finitely many ai, j are nonzero. Since {x j }
j=1
each > 0 there exists N such that, for j, j N , |x j x j |V . Since | |V
is metrically equivalent to | |V , for each > 0 there also exists N such that,
for j, j N , |x j x j |V .
On the one hand this implies that, for each fixed i, the sequence {ai, j }
j=1
is Cauchy. Since F is complete, this sequence has a limit ai . On the other
hand, for j = N there exists some i 0 such that ai, j = 0 for all i i 0 . If we
write xi, j =
h=i+1 ah, j m j , for all j N and all i i 0 we have |xi, j |V
and hence |xi, j |V . For fixed i, xi, j converges to m a1 m 1
ai m i as j ; hence, for all i i 0 , |m a1 m 1 ai m i |V so
i=1
As in the case of the quotient seminorm, it is clear that the product seminorm is
a seminorm, but it is not clear whether it is compatible with R unless R happens
to be a field. Moreover for norms | | M and | | N , it is not clear whether the
product seminorm is a norm. However, if M and N are finite free R-modules
equipped with supremum-equivalent norms then the product seminorm will
be supremum-equivalent, which forces it to be a norm. See also the following
lemma.
24
ai m i , with ai
element can be uniquely written as a convergent series i=1
F, and | |V is equivalent to the norm | |V defined by
ai m i = sup{|ai m i |V }
(ai F).
i
i=1
More precisely, we have c||V ||V ||V for some c > 0. Let j : V F
s
This gives a norm by the following argument. Let
k=1 yk z k be any
representation of x V F W with yk V and z k W , so that
j,W (x) =
s
j (yk )z k .
k=1
s
j (yk )z k ,
k=1
s
For x =
k=1 yk z k V F W and any positive integer N , we can
express x also as
m 1 1,W (x)+ +m N N ,W (x)+
s
k=1
25
s
c sup |m j |V
j (yk )z k
j
k=1
so that
|x|V F W c|x|V F W .
That is, the product seminorm is equivalent to | |V F W and so is a norm.
s
In the general case, suppose on the contrary that x =
j=1 m j n j
V F W had product seminorm 0. This would mean that we can find a sequence
i
m i, j n i, j , and so
xi V F W in which each xi can be represented as sj=1
lim max{|m i, j |V |n i, j |W } = 0.
Then the same data would be available if we replaced V by the closure of the
F-subspace spanned by the m j and the m i, j , and similarly for W . We may thus
apply the previous case to obtain a contradiction.
i
i ci t F((t)), vt ( f ) is the least i for which ci = 0. This exponentiates to
give a t-adic norm under which F((t)) is complete and discretely valued. (See
Example 1.5.8 for a variation on this construction.)
Before introducing our next example, we make a more general definition for
later use.
26
Definition 1.4.3. Let R be a ring equipped with a nonarchimedean submultiplicative (semi)norm | |. For 0, define the -Gauss (semi)norm | | on
the polynomial ring R[T ] by
i
Pi T = max{|Pi | i };
i
i
27
Remark 1.4.7. When converting the p-adic norm into a valuation, it is common practice to use base- p logarithms. We have instead opted to keep the
factor of log p visible when we take logarithms. One may liken this practice
to using metric units rather than normalizing some dimensioned constants to 1
(e.g., the speed of light).
Just as the only archimedean norm on Q is the usual one, every nontrivial
nonarchimedean norm on Q is essentially a p-adic norm, again by a theorem
of Ostrowski.
Theorem 1.4.8 (Ostrowski). Any nontrivial nonarchimedean norm on Q is
equivalent to the p-adic norm for some prime p.
Proof. See [191, 2.2.4].
To equip extensions of Q p with norms, we use the following result. As usual,
when E is a finite extension of a field F we write [E : F] for the degree of the
field extension, i.e., the dimension of E as an F-vector space.
Theorem 1.4.9. Let F be a complete nonarchimedean field. Then any finite
extension E of F admits a unique extension of | | to a norm on E (under
which E is also complete).
Proof. We only prove uniqueness now; existence will be established in
Section 2.3. Let ||1 and ||2 be two extensions of || to norms on E. Then these
in particular give norms on E viewed as an F-vector space; by Theorem 1.3.6,
these norms are metrically equivalent; that is, there exist c1 , c2 > 0 such that
|x|1 c1 |x|2 ,
|x|2 c2 |x|1
(x E).
They are also both metrically equivalent to the supremum norm for some basis
of E over F, under which E is evidently complete.
We now use the extra information that | |1 and | |2 are multiplicative
(because they are norms on E as a field in its own right). For any positive
integer n, we may substitute x n in place of x in the previous inequalities and
then take nth roots, to obtain
1/n
|x|1 c1 |x|2 ,
1/n
|x|2 c2 |x|1
(x E).
28
Because of the uniqueness in Theorem 1.4.9, it also follows that any algebraic extension E of F, finite or not, inherits a unique extension of | |.
However, if [E : F] = then E is not complete, so we may prefer to use its
completion instead. For instance, if F = Q p , we define C p to be the completion of an algebraic closure of Q p . One might worry that this may launch us
into an endless cycle of completion and algebraic closure, but fortunately this
does not occur.
Theorem 1.4.11. Let F be an algebraically closed nonarchimedean field.
Then the completion of F is also algebraically closed.
For the proof, see Section 2.3.
29
residue field. (In fact, this is not difficult to prove using Zorns lemma.) The
equivalence of this condition with spherical completeness was then proved by
Kaplansky [118, Theorem 4]. See also [214, p. 151].
One can also prove the result more directly; for instance, the case F =
Q p is explained in detail in [191, 3]. For the case F = K ((t)), see
Example 1.5.8.
One benefit of the hypothesis of spherical completeness is that it can
simplify the construction of quotient norms.
Lemma 1.5.4. Let F be a spherically complete nonarchimedean field, let V
be a finite-dimensional vector space over F, and let | |V be a norm on V compatible with F (and which must be supremum-equivalent by Theorem 1.3.6).
Let V be a quotient of V , and let | |V be the quotient norm on V induced by
| |V . Then, for every x V , there exists x V lifting x with |x|V = |x |V .
Proof. We first treat the case where dim F (V ) = dim F (V ) 1. In this case,
we can choose m 1 V so that V = V /Fm 1 . Given x V , start with any
lift x0 of x to V . Any other lift of x to V can be written uniquely as x0 + am 1
for some a F.
For > 0, let B be the set of a F such that |x0 + am 1 |V |x |V + .
By the definition of |x |V , B is nonempty. Pick any a B and define
r (a, ) = sup {|b a|}.
bB
Then on the one hand B is contained in the closed ball of radius r (a, ) centered at a. On the other hand, for any r < r (a, ) there exists b B with
r |b a|, so
r |m 1 |V |b a||m 1 |V
max{|x0 + am 1 |V , |x0 + bm 1 |V }
|x |V + .
By taking limits, we may deduce that r (a, )|m 1 |V |x |V + . Hence, for
any c F with |c a| r (a, ),
|x0 + cm 1 |V max{|x0 + am 1 |V , |(c a)m 1 |V }
|x |V +
and so c B .
We conclude that B must equal the closed ball of radius r (a, ) centered at
a. As decreases, the B form a decreasing family of closed balls in F. Since
30
Notes
31
j
Q
y =
jQ y j t be two elements of K ((t )). We would like to define their
product to be
xy =
xi y j t k .
kQ
i, jQ:i+ j=k
Notes
The concept of a real valuation is a special case of Krulls notion of a valuation
(sometimes called a Krull valuation for emphasis), in which the role of the real
numbers is replaced by an arbitrary totally ordered group. For instance, on the
polynomial ring k[x, y] one can define a degree function taking monomials to
elements of Z Z. If we then equip the latter with the lexicographic ordering
(i.e., we compare pairs in their first component, using the second component
32
only to break a tie in the first component), we may then define a valuation
taking each polynomial to the lowest degree of any of its monomials.
While not all concepts from archimedean analysis generalize as nicely to
Krull valuations as to real valuations, Krull valuations are important in algebraic geometry; indeed, they were originally advocated by Zariski as a key
component of a rigorous foundation for the study of algebraic varieties. They
were later ousted from this role by the more flexible theory of schemes, but
they continue to play an important role in the study of birational properties of
varieties and schemes, particularly in questions about the resolution of singularities by blowups. See [187] for a full account of the theory of valuations and
[113] for discussion of some points concerning Zariskis use of valuations in
algebraic geometry.
A sequence m 1 , m 2 , . . . as in the statement of Lemma 1.3.8(a) is called a
Schauder basis for V . For more discussion, see [31, 2.7.2]. Over a complete
archimedean field (i.e., R or C), the statement of Lemma 1.3.8 becomes false,
as there are Banach spaces admitting countable dense subsets (i.e., Banach
spaces which are separable) but not admitting Schauder bases. The first example is due to Enflo [85]. For a general discussion of the problem of constructing
various sorts of Banach-space bases in the archimedean setting, see [159,
Part I] or [37].
For a direct proof of Theorem 1.4.11 in the case of the completed algebraic
closure of Q p , see [191, 3.3.3].
The fact that the completion of an infinite extension of Q p fails to be
spherically complete if it is not discretely valued (see Example 1.5.2 and the
exercises) is a special case of a more general fact, namely, that any nonarchimedean metric space which admits a countable dense subset and whose
metric takes values which are dense in R+ fails to be spherically complete
[193, Theorem 20.5].
The condition of spherical completeness is quite important in nonarchimedean functional analysis, as it is needed for the HahnBanach theorem to
hold. (By contrast, the nonarchimedean version of the open mapping theorem
requires only completeness of the field.) For an expansion of this remark we
recommend [195]; an older reference is [214].
Condition (c) of Proposition 1.5.7 is due to Ostrowski [179]; it is the
definition used by Kaplansky in [118].
Example 1.5.8 was originally due to Hahn [100]. It was later generalized
independently by Malcev and by Neumann to the case where the ordered
abelian group Q is replaced by a possibly nonabelian group. Then it was used
by Poonen [182] to describe the spherical completion of an arbitrary complete
nonarchimedean field, even in the mixed-characteristic case. For instance, one
Exercises
33
Exercises
(1) Prove Proposition 1.1.8. (Hint: first note that |x| < 1 if and only if the
sequence x, x 2 , . . . converges to 0.)
(2) Prove Ostrowskis theorem (Theorem 1.2.4).
(3) Give an example to show that, even for a finite-dimensional vector space
V over a complete nonarchimedean field F, the requirement that a norm
on | |V compatible with F must satisfy the strong triangle inequality is not superfluous; that is, condition (b) in Definition 1.3.1 is not a
consequence of the rest of the definition. (Hint: use a modification of a
supremum norm.)
(4) Let M be a module over a nonarchimedean field F. Prove that any two
norms on M compatible with F are topologically equivalent if and only
if they are metrically equivalent.
(5) Use Remark 1.4.10 to give an example showing that the statement of
Theorem 1.3.6 may fail if F is not complete.
(6) Prove that the valuation ring o F of a nonarchimedean field is noetherian
if and only if F is trivially or discretely valued.
(7) Use Theorem 1.4.9 to prove that, for any field F, any nonarchimedean
norm | | on F, and any extension of E, there exists at least one extension
of | | to a norm on E. (Hint: reduce to the cases where E is a finite
extension or a purely transcendental extension.)
(8) Here is a more exotic variation of the t-adic valuation. Let F be a field,
and choose 1 , . . . , n R.
(a) Prove that on the rational function field F(t1 , . . . , tn ) there is a valuation v such that v( f ) = 0 for all f F and v(ti ) = i for
i = 1, . . . , n. (Hint: you may construct it by iterating the definition
of Gauss valuations.)
(b) Prove that if 1 , . . . , n are linearly independent over Q, the valuation v is uniquely determined by (a).
(c) Prove that if 1 , . . . , n are not linearly independent over Q, the valuation v is not uniquely determined by (a). (Hint: try for a starter
the case n = 2, 1 = 2 = 1.)
(9) Let E be the completion of an infinite extension of Q p which is not discretely valued. Let 1 , 2 , . . . E be any sequence of elements such
that |1 |, |2 |, . . . form a strictly decreasing sequence with positive limit.
34
(10)
(11)
(12)
(13)
is decreasing, but its intersection is empty. (Hint: note that if the intersection were nonempty then it would contain an element algebraic over Q p ,
since such elements are dense in E.) Deduce as a corollary that E is not
spherically complete.
Prove that any finite-dimensional vector space over a spherically complete nonarchimedean field equipped with a compatible norm is also
spherically complete. (Hint: use Lemma 1.5.5.)
Prove Proposition 1.5.7.
Prove that a subset S of R is well-ordered (contains no infinite decreasing
sequence) if and only if every nonempty subset of S has a least element.
Check the unproved assertions at the end of Example 1.5.8. (Hint: reduce
both (a) and (b) to the fact that, given any sequence of pairs (i, j)
Q2 for which i + j is nonincreasing, one can pass to a subsequence in
which one component is nonincreasing. Reduce (c) to checking that x
K ((t Q )) has an inverse whenever x 1 has positive t-adic valuation, then
use a geometric series.)
2
Newton polygons
In this chapter, we recall the traditional theory of Newton polygons for polynomials over a nonarchimedean field. In the process, we introduce a general
framework which will allow us to consider Newton polygons in a wider range
of circumstances; it is based on a version of Hensels lemma that applies in not
necessarily commutative rings. As a first application, we fill in a few missing
proofs from Chapter 1.
i
tiplicative (semi)norm | |. For > 0 and P =
i Pi T R[T ], define
the width of P under the -Gauss norm | | as the difference between the
maximum and minimum values of i for which maxi {|Pi | i } is achieved.
Proposition 2.1.2. Let R be a ring equipped with a nonarchimedean multiplicative seminorm | |. For > 0 and P, Q R[T ] the following results
hold.
(a) We have |P Q| = |P| |Q| . That is, | | is multiplicative.
(b) The width of P Q under | | equals the sum of the widths of P and Q
under | | .
Proof. For {P, Q}, let j , k be the minimum and maximum values of i
for which maxi {|i | i } is achieved. Write
35
36
Newton polygons
PQ =
(P Q)i T i =
Pg Q h T i .
g+h=i
(i < j P + j Q ),
(i = j P + j Q ),
(i > j P + j Q ).
(i < k P + k Q ),
(i = k P + k Q ),
(i > k P + k Q ).
n
Pi T i R[T ], draw the set of points
Given a polynomial P(T ) = i=0
{(i, v(Pi )) : i = 0, . . . , n; v(Pi ) < +} R2 ,
then form the lower convex hull of these points. (That is, take the intersection of every closed half-plane which contains all the points and lies above
some nonvertical line.) The boundary of this region is called the Newton polygon of P. The slopes of P are the slopes of this open polygon, viewed as
a multiset in which each slope r counts with multiplicity equal to the horizontal width of the segment of the Newton polygon of slope r (or equal to
zero if there is no such segment); the latter can also be interpreted as the
width of P under | |er . (In the case where this multiset has cardinality
less than deg(P), we include + with sufficient multiplicity to make up the
shortfall.)
Example 2.1.4. For R = Q p equipped with the p-adic norm | | p , the
Newton polygon of the polynomial T 3 + pT 2 + pT + p 3 T 3 has vertices
(3, 0), (1, log p), (0, 3 log p). Its slopes are 12 log p with multiplicity 2 and
2 log p with multiplicity 1.
37
P(T ) =
Tp 1
T
p h1
p1
T ip
h1
i=0
Q(T ) =
(T + 1) p 1
(T
h1
+ 1) p
T p + 1p 1
T
p h1
h1
+ 1p
= T ( p1) p
h1
so all the coefficients of Q except for its leading coefficient are divisible by p.
Moreover Q(0) = p, so the Newton polygon of Q is a straight line segment
with endpoints (( p 1) p h1 , 0) and (0, log p). We conclude that
|1 ph | p = p p
h+1 /( p1)
(2.1.6.1)
38
Newton polygons
Remark 2.1.7. The data of the Newton polygon is equivalent to the data of
the -Gauss norms for all . This amounts to the statement that the graph of
a convex continuous function is determined by the positions of its supporting
lines of all slopes; this holds because the graph is the lower boundary of the
intersection of the closed halfspaces bounded below by the supporting lines.
For an example of the use of this observation, see the proof of Theorem 11.2.1.
39
(u U, v V ).
Before proving Theorem 2.2.2, let us see how it implies Theorem 2.2.1.
Proof of Theorem 2.2.1. We apply Theorem 2.2.2 with the following parameters:
R = F[T ],
| | = | |er ,
U = {P F[T ] : deg(P) deg(S) m},
V = {P F[T ] : deg(P) m 1},
W = {P F[T ] : deg(P) deg(S)},
a = 1,
b = T m,
c = S,
=1
and then put P = a + x and Q = b + y.
To see that this works, let us verify explicitly that condition (c) is satisfied
in this setup (the other conditions are more obvious). If u U, v V are such
40
Newton polygons
that max{|a||v|, |b||u|} is achieved by some term of av then that term appears
unchanged in av + ub because ub is divisible by T m . Hence | f (u, v)| |av|
in that case. Otherwise we have |av| < |bu| and so | f (u, v)| = |bu|.
With this motivation in mind, we now prove Theorem 2.2.2.
Proof of Theorem 2.2.2. We define a norm on U V by setting
|(u, v)| = max{|a||v|, |b||u|},
so that (c) implies
|(u, v)| | f (u, v)| |(u, v)|.
In particular 1, so |ab c| < |ab| = |c|.
Since a, b are nonzero, (c) implies that f is injective. By (b), f is in fact a
bijective group homomorphism between U V and W . It follows that, for all
w W,
| f 1 (w)| 1 |w|.
By (d) we may choose (0, ) with |ab c| |c|. Define
B = {(u, v) U V : |(u, v)| |c|}.
For (u, v) B we have
|a||v| |(u, v)| |c| = |a||b|,
so |v| |b|. Similarly |u| |a|. As a result,
| f 1 (c ab uv)| 1 |c ab uv|
1 max{|c ab|, |uv|}
1 max{|c|, 2 |a||b|}
= |c|.
Consequently, the map g(u, v) = f 1 (c ab uv) carries B into itself.
We next show that g is contractive. For (u, v), (t, s) B ,
|g(u, v) g(t, s)| | f 1 (ts uv)|
1 |ts uv|
= 1 |t (s v) + (t u)v|
1 max{|a||s v|, |t u||b|}
= 1 |(u t, v s)|
= 1 |(u, v) (t, s)|,
which has the desired effect because 1 < 1.
41
Since g is contractive on B and U V is complete, by the Banach contraction mapping theorem (exercise) there is a unique (x, y) U V fixed by g.
That is,
ay + xb = f (x, y) = f (g(x, y)) = c ab x y
and so
c = (a + x)(b + y).
Moreover, there is a unique such (x, y) in the union of all the B , and that
element belongs to the intersection of all the B .
Remark 2.2.3. One can also use Theorem 2.2.2 to recover other instances
of Hensels lemma. For instance, if F is a complete nonarchimedean field,
P(x) o F [x], and the reduction of P(x) into F [x] factors as Q R with Q, R
coprime, then there exists a unique factorization P = Q R in o F [x] with Q, R
lifting Q, R (exercise).
42
Newton polygons
er -Gauss
d1 i=0
i
i=0 Q i d F[T ] with |Q i Pi | < for i = 0, . . . , d 1.
Now assume that < min{|P0 |, , /|P0 |}, so that |Q 0 | = |P0 |. By
Proposition 2.1.5 there exists z F with Q(z) = 0 and |z| |Q 0 |1/d =
|P0 |1/d . We now have
|P(z)| = |(P Q)(z)| max{1, |z|}d max{1, |P(0)|} < ,
as desired.
Proof of Theorem 1.4.11. We must check that the completion E of an algebraically closed nonarchimedean field F is itself algebraically closed. Let
P(T ) E[T ] be a monic polynomial of degree d. Define a sequence of polynomials P0 , P1 , . . . as follows. Put P0 = P. Given Pi , apply Lemma 2.3.2 to
construct z i with |z i | |Pi (0)|1/d and |Pi (z i )| < 2i ; then set Pi+1 (T ) =
Pi (T + z i ), so that Pi+1 (0) = Pi (z i ). If some Pi satisfies Pi (0) = 0 then
z 0 + + z i1 is a root of P. Otherwise, we get an infinite sequence z 0 , z 1 , . . .
such that z 0 + z 1 + converges to a root of P.
Notes
There is a good reason for Newtons name to be attached to the polygons considered in this chapter. He considered them in the context of finding a series
expansion at the origin for a function y = f (x) implicitly defined by a polynomial relation P(x, y) = 0 over C. This was later reinterpreted by Puiseux,
Exercises
43
in terms of computing roots of polynomials over the subfield of C((x)) consisting of series which represent meromorphic functions in a neighborhood
of x = 0.
Remark 2.1.7 is perhaps best viewed within the broader context of duality
for convex functions, which is a central theme in the study of linear optimization problems. One reference for the rigorous mathematical theory of convex
functions is [192].
Although the formulation of Theorem 2.2.2 is due to Christol, the basic
observation that Hensels lemma does not rely on commutativity is rather old.
One early instance is due to Zassenhaus [223].
The Banach contraction mapping theorem (also known as the Banach fixed
point theorem), used in the proof of Theorem 2.2.2, is probably familiar to most
readers except possibly in name. For instance, it appears in the most common
proofs of the implicit function theorem, the inverse function theorem, and the
fundamental theorem of ordinary differential equations.
The notion of a nonarchimedean metric space can be generalized to the case
where the metric takes values in an arbitrary partially ordered set. One can then
give a number of extensions of the Banach contraction mapping theorem, at
least in the case of spherically complete metric spaces. For instance, PriessCrampe has proved that if X is a spherically complete metric space with
distance function d, and f : X X is a function such that d( f (x), f (y)) <
d(x, y) for all x, y X with x = y, then f has a fixed point. See [197] for an
exposition of a number of results of this type.
Exercises
(1) With notation as in Example 2.1.6, rederive (2.1.6.1) as follows. First note
j
that for j coprime to p, 1 ph and 1 ph are multiples of each other in
j
44
Newton polygons
(5) Prove the Banach contraction mapping theorem (used in the proof of
Theorem 2.2.2). That is, suppose that X is a nonempty complete metric
space (not necessarily nonarchimedean) with distance function d, and let
g : X X be a map for which there exists (0, 1) such that
d(g(x), g(y)) d(x, y) for all x, y X . Prove that g has a unique fixed
point. (Hint: show that, for any x X , the sequence x, g(x), g(g(x)), . . .
is Cauchy and that its limit is the desired fixed point.)
3
Ramification theory
Recall (Theorem 1.4.9) that any finite extension of a complete nonarchimedean field carries a unique extension of the norm and is also complete. In
this chapter, we study the relationship between a complete nonarchimedean
field and its finite extensions; this relationship involves the residue fields, value
groups, and Galois groups of the fields in question. We distinguish some important types of extensions, the unramified and tamely ramified extensions. See
[187] (especially Chapter 6) for a more thorough treatment.
We also briefly discuss the special case of a discretely valued field with
a perfect residue field, in which one can say much more. We introduce the
standard ramification filtrations on the Galois groups of extensions of local
fields; these will not reappear again until Part IV, at which point they will relate
to the study of the convergence of solutions of p-adic differential equations
made in Part III. We make no attempt to be thorough in our treatment; instead
we refer the reader to the standard text [198] for more details.
Notation 3.0.1. For E/F a Galois extension of fields, write G E/F for
Gal(E/F). If E = F sep , the separable closure of E, write G F for the absolute
Galois group G F sep /F . Of course, if F has characteristic 0 (or is perfect, e.g.,
if F is a finite field) then F sep coincides with the algebraic closure F alg . (We
avoid the usual notation F for the latter because we prefer to reserve the overbar to denote reduction modulo the maximal ideal of a local ring, e.g., from
Z p to F p .)
Remark 3.0.2. Throughout this chapter, when a nonarchimedean field F is
assumed to be complete it will be sufficient to assume it is only henselian
instead. One of many equivalent formulations of this condition (for which see
[175, 43.2]) is as follows: for any monic polynomial P(x) o F [x] and any
45
46
Ramification theory
3.1 Defect
Let F be a complete nonarchimedean field, and let E be a finite extension
of F. Then the value group |E | contains |F |, while the residue field E may
naturally be viewed as an extension of F . The first fundamental fact about the
extension E/F is that both these containments are finite in appropriate senses
closely related to the degree of E over F.
Lemma 3.1.1. Let F be a complete nonarchimedean field, and let E be a finite
extension of F. Then
[E : F] [ E : F ]#(|E |/|F |),
(3.1.1.1)
[E : F]
[ E : F ]#(|E |/|F |)
47
[U : E] [U : E ],
[E : F] [ E : F ];
48
Ramification theory
= [U1 : U3 ]
= [U1 : U3 ]
= [U1 U2 : U2 ]
[U : U2 ]
(because U1 U2 U ).
49
50
Ramification theory
F(x1
1/m
, t1
1/m
, . . . , th
1/m
) = F(t1
1/m
, . . . , th
i=0
converges (since its coefficients are p-adically integral, given that m is not
divisible by p) to an mth root of y in F.
Our next argument may be viewed as a preview of the filtration construction
of the next section.
Proposition 3.3.7. The group W E/F is a p-group.
Proof. We proceed by induction on [E : F]; we may assume that E is totally
wildly ramified over F (so W E/F = G E/F ) and that E = F. Let v E denote
the valuation on E. Pick any x o E \ o F , and set
j = min{v E (1 g(x)/x) : g G E/F }.
Note that j < + because x
/ F, and j > 0 because E is totally wildly
ramified over F.
51
{y o
E : v E (y 1) j}
{y o
E : v E (y 1) > j}
Since j > 0, the group on the right is isomorphic to the additive group of E
and so is a p-torsion group. Hence G E/F surjects onto a nontrivial p-group;
let E be the fixed field of the kernel of this surjection. It is clear that E is again
totally wildly ramified over E , so that G E/E is also a p-group; hence G E/F
is an extension of two p-groups and is thus a p-group itself.
The following are easily verified using Proposition 3.3.6.
Lemma 3.3.8. Let E be a subextension of E over F. Then E is tamely ramified over F if and only if E is tamely ramified over E and E is tamely ramified
over F.
Lemma 3.3.9. Let F be a complete nonarchimedean field, let E be a finite
extension of F, and let T1 , T2 be tamely ramified subextensions of E over F.
Then T = T1 T2 is also tamely ramified over F.
Remark 3.3.10. Let T be the maximal tamely ramified subextension of E
over F, so that G E/T = W E/F is a p-group by Proposition 3.3.7. A fact from
elementary group theory (exercise) allows us to construct a tower E 0 = T
E 1 E m = E such that each E i is Galois over T , and each group
G Ei /Ei1 is isomorphic to Z/ pZ. This is particularly important if F is of characteristic p, because every Z/ pZ-extension of a field L of characteristic p > 0
is isomorphic to L[z]/(z p z x) for some x L. (Such an extension is called
an ArtinSchreier extension.)
Remark 3.3.11. Using the result of Kaplansky mentioned in the proof of
Theorem 1.5.3 (i.e., the fact that maximal completeness is equivalent to spherical completeness), it is possible to show that any finite extension E of a
spherically complete field F is defectless. It suffices to check this first for E
separable over F and then for E purely inseparable over F (since any E can
be obtained by first making a separable extension and then a purely inseparable extension on top of that). As we go along, keep in mind that any finite
extension of F is also spherically complete (Example 1.5.2).
The key initial observation is that if [E : F] = p then by Theorem 3.1.2
either E is defectless over F or [ E : F ] = [|E | : |F |] = 1. If F is
spherically complete, the latter case is ruled out by Kaplanskys theorem.
52
Ramification theory
E/F
relation G E/F
(i)
= G E/F,i , where
i
E/F (i) =
0
1
dt.
[G E/F,0 : G E/F,t ]
Notes
53
Note that the indices where the filtration jumps are now rational numbers but
not necessarily integers. In any case, Proposition 3.4.4 below implies that there
is a unique filtration G iF on G F which induces the upper numbering filtration
on each G E/F ; that is, G iE/F is the image of G iF under the surjection G F
G E/F . It is this filtration which plays the more important role in, e.g., class
field theory.
Proposition 3.4.4 (Herbrand). Let E be a Galois subextension of E over F
and put H = G E/E , so that H is normal in G E/F and G E/F /H = G E /F .
Then G iE /F = (G iE/F H )/H ; that is, the upper numbering filtration is
compatible with the formation of quotients of G E/F .
Proof. The proof is elementary but slightly involved, so we will not give it
here. See [198, IV.3].
Notes
Ramification theory originally emerged in the study of algebraic number fields;
the formalism we see nowadays is due largely to Hilbert. The upper and
lower numbering filtrations were originally introduced by Hilbert as part of
class field theory (the study of abelian extensions of number fields and their
Galois groups), but not in the form we see today; the modern definitions were
introduced slightly later by Herbrand. See [198] for more discussion.
Ramification theory for a complete discrete nonarchimedean field becomes
substantially more complicated when one drops the requirement of a perfect
residue field. However, the case of an imperfect residue field is of great interest
in the study of finite covers of schemes of dimension greater than 1. A satisfactory theory for abelian extensions was introduced by Kato [119]. A generalization to nonabelian extensions was later introduced by Abbes and Saito [1, 2].
However, a number of alternate approaches exist, the relationships among
which are not fully understood. These include Borgers theory of residual perfection [29], Kedlayas differential Swan conductor [133], and methods from
higher local class field theory [224, 225]. See also the notes for Chapter 19.
A henselian-valued field is called stable if every finite extension of it is
defectless. A deep theorem of Kuhlmann [152, Theorem 1] (generalizing earlier theorems of Grauert-Remmert and Gruson) states that if F is a stable
henselian field, and E is a henselian extension of F of finite transcendence
degree with transcendence defect equal to 0, then E is also stable. In this
statement, the transcendence defect of E/F is defined as
trdeg(E/F) dimQ (|E |Q /|F |Q ) trdeg( E / F ),
54
Ramification theory
Exercises
(1) Complete the proof of Lemma 3.1.1. (Hint: construct a sequence of elements, in the span of the purported basis, converging to a given element
of o E .)
(2) Let G be a group of order p n . Prove that there exists a chain of subgroups
G 0 = {e} G 1 G n = G such that each inclusion is proper
and each G i is normal in G. Deduce that, for any finite Galois extension
L/K of fields of characteristic p of pth degree, there exists a sequence
K = K 0 K 1 K n = L of subextensions such that each inclusion
is an extension of degree p and each K i is Galois over K . (A somewhat
easier exercise is to produce such a sequence in which each K i is only
Galois over K i1 .)
4
Matrix analysis
56
Matrix analysis
sup {|Av|/|v|};
vCn \{0}
that is, |A| is the operator norm of A, as will be defined in Definition 6.1.2.
As mentioned above, we are interested in two sets of numerical invariants of
A. One of these is the familiar set of eigenvalues.
Definition 4.1.2. Let 1 , . . . , n be the list of eigenvalues of A, arranged so
that |1 | |n |.
A second set of numerical invariants of A, which is better behaved from the
point of view of numerical analysis, is the set of singular values.
Definition 4.1.3. Let A denote the conjugate transpose (or Hermitian transpose) of A. The matrix A A is Hermitian and nonnegative definite and so has
nonnegative real eigenvalues. The (nonnegative) square roots of these eigenvalues comprise the singular values of A; we denote them 1 , . . . , n with
1 n . These are not invariant under conjugation, but they are invariant
under the multiplication of A on either side by a unitary matrix.
Theorem 4.1.4 (Singular value decomposition). There exist unitary n n
matrices U, V such that U AV = Diag(1 , . . . , n ).
Proof. This is equivalent to showing that there is an orthonormal basis of Cn
which remains orthogonal upon applying A. To construct it, start with a vector
v Cn maximizing | Av|/|v| and then show that, for any w Cn orthogonal
to v, Aw is also orthogonal to Av. For further details, see the references in the
notes.
57
(i = 1, . . . , n),
58
Matrix analysis
59
(i = 1, . . . , n),
|i | > |i+1 |,
1 i = |1 i |.
Then there exists a unitary matrix U such that U 1 AU is block diagonal, the
first block accounting for the first i singular values and eigenvalues and the
second block accounting for the others.
Proof. Let v1 , . . . , vn be a basis of Cn such that v1 , . . . , vi span the generalized eigenspaces having eigenvalues 1 , . . . , i and vi+1 , . . . , vn span the
generalized eigenspaces having eigenvalues i+1 , . . . , n . Apply the singular
value decomposition (Theorem 4.1.4) to construct an orthonormal basis
w1 , . . . , wn such that Aw1 , . . . , Awn are also orthogonal and | Awi | = i |wi |.
Since i > i+1 , the only nonzero vectors v i Cn for which | Av|/|v|
achieves its maximum value 1 i are the nonzero multiples of w1 wi .
60
Matrix analysis
61
(i = 1, . . . , n).
n1
Pi T i and
Theorem 4.2.2. Let B be an n n matrix. Let P(T ) = T n + i=0
n1
Q(T ) = T n + i=0 Q i T i be the characteristic polynomials of A and A + B.
Then
i1
i n
max{ j , |B|}
(i = 1, . . . , n).
|Pni Q ni | 2
|B|
i
j=1
The superfluous enclosure of the integer 2i ni in absolute value signs is quite
deliberate; it will become relevant in the nonarchimedean setting.
Proof. Note that Q ni is the sum of the ni principal i i minors of A + B.
(A minor is the determinant of the i i submatrix obtained by choosing a
set of i rows and i columns. A principal minor is a minor in which the rows
and columns correspond; for instance, if the first row is included, then the
first column must also be included.) By multilinearity of the determinant,
each principal minor can be written as a sum of 2i terms, each of which
is the product of a sign, a k k minor of A, and an (i k) (i k)
minor of B. The terms with k = i sum to Pni itself; the others all have
k < i and so the norm of each is bounded by 1 k |B|ik . This proves the
claim.
We also need to consider multiplicative perturbations. For a considerable
generalization of the following inequality, see Theorem 4.5.2.
Proposition 4.2.3. Let B GLn (C) satisfy |B| . Let 1 , . . . , n be the
singular values of B A. Then
i i
(i = 1, . . . , n).
62
Matrix analysis
i |v| |B Av| |B||Av| i |B||v|,
= |i |
(i = 1, . . . , n).
1/k
sup {|Av|/|v|}.
vF n \{0}
63
(i = 0, . . . , n).
64
Matrix analysis
(i = 1, . . . , n),
with equality for i = n. In other words, the Hodge and Newton polygons have
the same endpoints and the Newton polygon is everywhere on or above the
Hodge polygon.
Proof. Again, the case i = 1 is clear because 1 is the operator norm of A,
and the general case follows by considering exterior powers (using the obvious
analogue of Lemma 4.1.9).
Like its archimedean analogue, Theorem 4.3.8 also has a converse, but in
this case we can write the construction down quite explicitly.
n1
Pi T i a monic polynomial of degree n
Definition 4.3.9. For P = T n + i=0
over a ring R, the companion matrix of P is defined as the matrix
0 0
P0
1 0
P1
. .
..
..
..
.
0 1 Pn1
n1
Pi T i
and such that the polynomial P(T ) = (T 1 ) (T n ) = T n + i=0
has coefficients in F. Choose c1 , . . . , cn F with i = |ci | such that 1
n and
1 i |1 i |
(i = 1, . . . , n),
65
1
0
0 c11 cn1
P0
cn1 0 c1 c1 P1
1
n2
..
..
..
.
.
0
Pn1
c1
ni .
Thus we can perform column operations over o F to clear everything
1
P0 , which has norm
in the far right-hand column except c11 cn1
1
1
1 n1 |1 n | = n . By permuting the rows and columns we obtain
a diagonal matrix with entries that are the norms 1 , . . . , n . This proves the
claim.
Again equality has a structural meaning, but the proof requires a bit
more work than in the archimedean case since we no longer have access to
orthogonality. However, this extra work is rewarded by a slightly stronger
result.
Theorem 4.3.11 (HodgeNewton decomposition). Suppose that for some i
{1, . . . , n 1} we have
|i | > |i+1 |,
1 i = |1 i |.
(That is, the Newton polygon has a vertex with x-coordinate n + i and this
vertex also lies on the Hodge polygon.) Then there exists U GLn (o F ) such
that U 1 AU is block upper triangular, with the top left block accounting for
the first i singular values and eigenvalues and the bottom right block accounting for the others. Moreover, if i > i+1 then we can ensure that U 1 AU is
block diagonal.
Proof. We first note that, by Theorem 2.2.1 applied to the characteristic polynomial of A, P(T ) = (T 1 ) (T i ) and Q(T ) = (T i+1 ) (T n )
have coefficients in F. Since P and Q have no common roots, we can write
66
Matrix analysis
w j = i Ui j ei . Then
B C
1
U AU =
0 D
is block upper triangular. By Cramers rule, each entry of B 1 C is an i i
minor of A divided by the determinant of B. Since |det(B)| = 1 i , B 1 C
must thus have entries in o F . Writing
B 0
Ii B 1 C
1
U AU =
,
0 D
0
Ini
we see that the singular values of B and D together must comprise 1 , . . . , n .
The only way for this to happen, given the constraint that the product of the
singular values of B equals 1 i , is for B to account for 1 , . . . , i and for
D to account for i+1 , . . . , n .
This proves the first claim; we may thus assume now that i > i+1 . In that
case, conjugating by the matrix
Ii B 1 C
0
Ini
gives a new matrix,
B
0
C1
,
D
with C1 = B 1 C D. Since
|C1 | |B 1 ||C||D| = i1 |C|i+1 < |C|,
this process converges. More explicitly, we obtain a sequence of matrices Ui
GLn (o F ) converging to the identity, such that the convergent product U =
U1 U2 satisfies
B 0
,
U 1 AU =
0 D
as desired.
67
Note that the slopes of the Hodge polygon are forced to be in the additive
value group of F, whereas the slopes of the Newton polygon need only lie in
the divisible closure of the additive value group. Consequently, it is possible for
a matrix to have no conjugates over GLn (F) for which the Hodge and Newton
polygons coincide. However, the following is true; see also Corollary 4.4.8
below.
Lemma 4.3.12. Suppose that one of the following holds.
(a) The value group of |F | is dense in R>0 , and > 1.
(b) We have |i | |F | for i = 1, . . . , n (so in particular i = 0), and
1.
Then there exists U GLn (F) such that the ith singular value of U 1 AU is
at most |i |.
Proof. Case (a) will follow from Corollary 4.4.8 below. Case (b) is directly
analogous to Lemma 4.1.14.
One also has the following variant.
Lemma 4.3.13. Suppose that |F | is discrete. Then there exists U GLn (F)
such that, for each positive integer m, |U 1 Am U | is the least element of |F |
greater than or equal to |m
1 |.
Proof. We may normalize the valuation on F for convenience so that
log |F | = Z. As in the proof of Theorem 4.3.11, we may also reduce to
the case where all the eigenvalues of A have the same norm.
Let E be a finite extension of F containing an element with || = |1 |.
(For instance, we could take E = F(1 ) and = 1 , but any other choice
would also work.) By Lemma 4.3.12, there exists U0 GLn (E) such that
1 U01 AU0 GLn (o E ); in other words, there exists a supremum norm | |0
on E n such that |1 Av|0 = |v|0 for all v E n .
Put V = {v F n : |v|0 1}. Let v1 , . . . , vn be a basis of V over o F , and
let | |1 be the supremum norm on F n defined by v1 , . . . , vn . Given a nonzero
v F n , choose F with log |v|0 [0, 1); then v is an element of
V which is not divisible by m F , so |v|1 = 1. We conclude that
e1 |v|1 < |v|0 |v|1
(v F n ).
(4.3.13.1)
68
Matrix analysis
n1
Theorem 4.4.2. Let B be an n n matrix. Let P(T ) = T n + i=0
Pi T i and
n1
Q(T ) = T n + i=0 Q i T i be the characteristic polynomials of A and A + B.
Then
|Pni Q ni | |B|
i1
max{ j , |B|}
(i = 1, . . . , n).
j=1
Proof. The proof is as for Theorem 4.2.2, except that now the factor |2i
dominated by 1.
n
i
| is
(i = 1, . . . , n).
Proof. As for Proposition 4.2.3 but using the Smith normal form
(Theorem 4.3.4) instead of the singular value decomposition.
Corollary 4.4.5. Suppose that the Newton and Hodge slopes of A coincide
and that U GLn (F) satisfies |U | |U 1 | . Then each Newton slope of
U 1 AU differs by at most log from the corresponding Hodge slope.
Here is a weak converse to Corollary 4.4.5. (We leave the archimedean
analogue to the readers imagination.)
Proposition 4.4.6. Suppose that the Newton slopes of A are nonnegative and
that 1 1. Then there exists U GLn (F) such that
|U 1 AU | 1,
|U 1 | 1,
|U | 1n1 .
69
each i, if j = j (i) is the least integer such that ei , Aei , . . . , A j ei are lin
j1
early dependent then we have A j ei = h=0 ch Ah ei for some ch F. The
j1
polynomial T j h=0 ch T h has roots which are eigenvalues of A, so the nonnegativity of the Newton slopes forces |ch | 1. Hence M is finitely generated,
and thus free, over o F .
Let v1 , . . . , vn be a basis of M, and let U be the change-of-basis matrix
(4.4.6.1)
1
A = 0
0
c 0
1 c
0 1
with |c| > 1 shows that the bound of Proposition 4.4.6 is sharp; in particular,
the bound |U | 1n1 cannot be improved to |U | 1 , as one might initially have expected. However, one should be able to get a more precise bound
(which agrees with the bound given in this example) by accounting for the
other singular values; see the exercises.
Corollary 4.4.8. There exists a continuous function
f n (1 , . . . , n , 1 , . . . , n , ) : (0, +)2n [0, +) (0, +)
(independent of F) with the following properties.
(a) Suppose that, for each i = 1, . . . , n, either i = i or max{i , i }.
Then
f n (1 , . . . , n , 1 , . . . , n , ) = 1.
(b) If A has singular values 1 , . . . , n and eigenvalues 1 , . . . , n , if i =
|i | for i = 1, . . . , n, and if i |F | whenever i > then there
exists U GLn (F) such that
|U 1 | 1,
|U | f n (1 , . . . , n , 1 , . . . , n , ),
70
Matrix analysis
|U11 | 1,
Let i be the largest index such that i = 1 . Then the first i singular values of U11 AU1 are all at most 1 but at least i . Hence they are all equal,
and A1 = U11 AU1 satisfies the hypothesis of Theorem 4.3.11. We may thus
choose U2 GLn (o F ) such that A2 = U21 A1 U2 is block upper triangular, the
top left block accounting for the first i singular values and eigenvalues and the
bottom right block accounting for the others.
If i = n then we may take U = U1 U2 and be done. Otherwise, note that by
applying Proposition 4.4.4 we may bound the singular values of A2 by a continuous function of 1 , . . . , n , 1 , . . . , n , . We may then apply the induction
hypothesis to construct a block diagonal matrix U3 , where the top left block
of U3 is the identity, |U31 | 1, |U3 | is bounded by a continuous function
of 1 , . . . , n , 1 , . . . , n , , and the multiset of singular values of the bottom
, . . . , in its values greater
right block of A3 = U31 A2 U3 agrees with i+1
n
than .
We may bound the norm of the top right block of A3 by a continuous function of 1 , . . . , n , 1 , . . . , n , . We can then conjugate by a suitable block
diagonal matrix U4 , with scalar matrices in the diagonal blocks, to ensure that
the multiset of singular values of A4 = U41 A3 U4 agrees with 1 , . . . , n in
its values greater than . We then take U = U1 U4 .
For the purposes of this book, it is immaterial what the function f n is, as
long as it is continuous. However, for numerical applications it may be quite
helpful to identify a good function f n ; here is a conjectural best possible result
in the case = 0, phrased in a somewhat stronger form. (One can formulate
an archimedean analogue. It should also be possible to prove, using the Horn
inequalities, that this conjecture cannot be improved; see the next section.)
Conjecture 4.4.9. If A has singular values 1 , . . . , n and eigenvalues
1 , . . . , n , none of which is equal to 0, and if 1 , . . . , n |F | satisfy
1 i 1 i |1 i |
(i = 1, . . . , n),
71
= |i |
(i = 1, . . . , n).
r (r + 1)
i+
j=
k+
.
2
iI
jJ
kK
For (I, J, K ) Urn , write I = {i 1 < < ir } and similarly for J, K . For
r = 1, put T1n = U1n . For r > 1, put
p( p + 1)
if +
jg
kh +
.
2
f F
gG
hH
72
Matrix analysis
iI
jJ
Proof. See [95, Theorem 16]. Note that the first condition in (b) is omitted in
the statement given in [95], but this was a typographical error.
Theorem 4.5.3. Let F be a complete nonarchimedean field. For
{ A, B, C}, let ,1 , . . . , ,n be a nonincreasing sequence of elements of |F|.
Then the following are equivalent.
(a) There exist n n matrices A, B, C over F with AB = C such that, for
{A, B, C}, has singular values ,1 , . . . , ,n .
n
A,i nj=1 B, j = nk=1 C,k and, for all r < n and
(b) We have i=1
(I, J, K ) Trn ,
C,k
A,i
B, j .
kK
iI
jJ
J = {1},
K = {i}
to obtain the inequality C,i A,i B,1 ; this is precisely Proposition 4.2.3.
Remark 4.5.5. For additive perturbations, one has an analogous result in the
archimedean case; see [95, Theorem 15]. We are not aware of an additive result
in the nonarchimedean case. Also, in the archimedean case one has analogous
results (with slightly different statements) in which one restricts to Hermitian
matrices.
Notes
The subject of archimedean matrix inequalities is an old one, with many important applications. A good reference for this is [28]; for instance, see [28,
I.2] for the singular value decomposition, [28, Theorem II.3.6] for the Weyl
Notes
73
74
Matrix analysis
Exercises
(1) Prove that any elementary matrix of type (a) (swapping two rows) can
be factored as a product of elementary matrices of types (b) and (c); see
Definition 4.0.3.
(2) Let e1 , . . . , e4 be a basis of C4 . Prove that in 2 C4 the element e1 e2 +
e3 e4 is not decomposable.
(3) Check that the characteristic polynomial of the companion matrix of a
polynomial P (Definition 4.3.9) is equal to P.
(4) Prove Proposition 4.4.1. (Hint: use Corollary 4.3.6.)
(5) With notation as in Theorem 4.3.11, suppose that U, V GLn (o F ) are
congruent to the identity matrix modulo m F . Prove that the product of
the i largest eigenvalues of U AV again has norm |1 i |. (Hint: use
exterior powers to reduce to the case i = 1.) This yields as a corollary
[34, Lemma 5]: if D GLn (F) is diagonal and U, V GLn (o F ) are
congruent to the identity matrix modulo m F then the Newton polygons of
D and U DV coincide.
(6) State and prove an archimedean analogue of the previous problem.
(7) Prove the following improved version of Proposition 4.4.6. Suppose that
the Newton slopes of A are nonnegative. Then there exists U GLn (F)
such that
|U 1 AU | 1,
|U 1 | 1,
|U |
n1
max{1, i }.
i=1
Part II
Differential Algebra
5
Formalism of differential algebra
(a, b R).
78
The latter computes Yoneda extensions; see Lemma 5.3.3 below. Elements of
H 0 (M) are said to be horizontal (see the notes). Note that H 0 (R) = ker(d)
is a subring of R; if R is a field then ker(d) is a subfield. We call them the
constant subring and constant subfield of R.
We now make an observation about base changes of the constant subring.
For the definition of a more general base change, see Definition 5.3.2. (See
also Proposition 6.9.1.)
Lemma 5.1.4. Let R0 be the constant subring of the differential ring (R, d),
and let R0 be an R0 -algebra. Let (M, D) be a differential module over (R, d).
View R = R R0 R0 as a differential ring by defining the derivation d by
ai ri =
d(ai ) ri
(ai R, ri R0 ).
d
i
79
Proof
(a) Tensoring the structure morphism R0 R0 with M induces a map
M M . This in turn induces maps H i (M) H i (M ) of R0 modules; using the R -module structure on H i (M ), we also obtain
maps H i (M) R0 R0 H i (M ).
(b) Tensoring with R0 is always a right exact functor on R0 -modules.
Since
D
M M H 1 (M) 0
is an exact sequence in the category of R0 -modules,
D
M M H 1 (M) R0 R0 0
is also exact. Hence the induced map H 1 (M) R0 R0 H 1 (M ) is an
isomorphism.
(c) Recall that, by definition, R0 is flat over R0 if and only if tensoring with
R0 is an exact functor on R0 -modules. Since
D
0 H 0 (M) M M
is an exact sequence in the category of R0 -modules,
D
0 H 0 (M) R0 R0 M M
is also exact. Hence the induced map H 0 (M) R0 R0 H 0 (M ) is an
isomorphism.
Another frequently used observation is the following.
Lemma 5.1.5. Let (R, d) be a differential field with constant subfield R0 .
Then, for any differential module (M, D) over (R, d), the natural map
H 0 (M) R0 R M is injective. In particular, dim R0 H 0 (M) dim R M.
Proof. An equivalent statement is that if m 1 , . . . , m n H 0 (M) are linearly
dependent over R then they are also linearly dependent over R0 . Suppose on
the contrary that, for some positive integer n, there exist m 1 , . . . , m n H 0 (M)
which are linearly dependent over R but linearly independent over R0 . Choose
n as small as possible with this property; then there exist c1 , . . . , cn R all
nonzero such that c1 m 1 + +cn m n = 0. We may rescale the ci so that c1 = 1.
Since m 1 , . . . , m n H 0 (M), we also have
d(c1 )m 1 + + d(cn )m n = 0.
80
That is, d(c1 ), . . . , d(cn ) form another linear dependence relation between
m 1 , . . . , m n . But d(c1 ) = d(1) = 0, so to avoid contradicting the choice of
n we must have d(c2 ) = = d(cn ) = 0. That is, c1 , . . . , cn R0 , contradicting the hypothesis that m 1 , . . . , m n are linearly independent over R0 . This
contradiction proves the claim.
Definition 5.1.6. Let M be a differential module over a differential ring R
admitting a finite exhaustive filtration with irreducible successive quotients.
(For instance, R could be a differential field and M could be a finitely generated differential module over R.) Then the multiset of these quotients is
independent of the choice of the filtration; we call them the (JordanHlder)
constituents of M.
n
Ni j ei .
i=1
We then have
D(v) =
n
i=1
d(vi ) +
n
Ni j v j ei .
j=1
vn ] then
D(v) = N v + d(v).
Conversely, it is clear that, given the underlying finite free R-module, any
differential module structure is given by such an equation.
81
Definition 5.2.2. With notation as in Definition 5.2.1, let v1 , . . . , vn be a second basis of M. The change-of-basis matrix from e1 , . . . , en to v1 , . . . , vn is
the n n matrix U defined by
Ui j ei .
vj =
i
82
n
m 1 m i1 D(m i ) m i+1 m n .
i=1
(A similar fact is true for the symmetric power SymnR M, but we will have
no need of this.) The module of R-homomorphisms Hom R (M1 , M2 ) may be
viewed as a differential module via the formula
D( f )(m) = D( f (m)) f (D(m));
the homomorphisms from M1 to M2 as differential modules are precisely the
horizontal elements of Hom R (M1 , M2 ). (In this case, the subscript is quite
crucial: we have Hom(M1 , M2 ) = H 0 (Hom R (M1 , M2 )).)
We write M1 for Hom R (M1 , R) and call it the dual of M1 ; if M1 is finite
free then Hom R (M1 , M2 )
= M1 M2 and the natural map M1 (M1 )
is an isomorphism. In particular, M1 M1 contains a horizontal element corresponding to the identity map M1 M1 ; we call this the trace (element) of
M1 M1 , and we call the trivial submodule generated by the trace element the
trace component of M1 M1 . If R is a Q-algebra then M1 M1 splits as the
direct sum of the trace component with the set of elements of Hom R (M1 , M2 )
of trace zero; we call the latter the trace-zero component of M1 M1 . Even
if R is not a Q-algebra, we can still view the trace component as a quotient of
M1 M1 by duality, but the map given by embedding the trace component
into M1 M1 and then projecting onto the trace component need not be an
isomorphism.
Lemma 5.3.3. Let M, N be differential modules with M finite free. Then the
group H 1 (M N ) is canonically isomorphic to the Yoneda extension group
Ext(M, N ).
Proof. The group Ext(M, N ) consists of equivalence classes of exact sequences 0 N P M 0 under the relation that this sequence
is equivalent to a second sequence 0 N P M 0 if there
is an isomorphism P
= P that induces the identity maps on M and N .
Addition consists of taking two such sequences and returning the Baer sum
0 N Q/ M 0, where Q is the set of elements in P P whose
images in M coincide and = {(n, n) Q : n N }. The identity element
is the split sequence 0 N M N M 0. The inverse of a sequence
0 N P M 0 is the same sequence with the map N P negated.
(See [216, 3.4] for the proof that this indeed gives a group.)
83
84
always flat. If R is noetherian then a finitely generated R-module M is projective if and only if it is locally free, i.e., if there exists a finite subset f 1 , . . . , f m
of R, generating the unit ideal, such that M[ f i1 ] is free over R[ f i1 ] for each
i [84, Exercise 4.12].
i=0
(5.4.2.1)
85
s i i (m, z) = 0
(s Z)
(5.4.2.2)
i=0
for
i
i+ j
i (m, z) =
m D i+ j1 (m)D j (z)D i+ j+1 (m) D (m).
i
j=0
(m, z M);
(m M).
But this means that the dimension of the span of m, D(m), . . . is at most 1,
contradicting the definition of .
j
j
ai T i
bjT j =
ai d h (b j )T i+ jh .
h
i=0
j=0
i, j=0 h=0
(a R)
and then check that the result is a not necessarily commutative ring (see the
exercises). We define the degree of a twisted polynomial, in the usual way,
86
M2
= R{T }/R{T }P2 .
87
88
Remark 5.7.1. In ordinary linear algebra (or in other words, when considering differential modules for the trivial derivation), one can pass freely between
linear transformations on a vector space and square matrices if one is willing to
choose a basis. The merits of doing this depend on the situation, so it is valuable to have both the matricial and coordinate-free viewpoints well in hand.
One can then pass to the characteristic polynomial, but not all information is
retained (one loses information about nilpotency) and even information that
in principle is retained is sometimes not so conveniently accessed. In short,
no one would seriously argue that one can dispense with studying matrices
because of the existence of the characteristic polynomial.
The situation is not so different in the differential case. The difference
between a differential module and a differential system is merely the choice of
a basis, and again it is valuable to have both points of view in mind. However,
the cyclic vector theorem may seduce one into thinking that collapsing a differential system into a differential polynomial is an operation without drawbacks,
but this is far from the case. For instance, determining whether two differential
polynomials correspond to the same differential system is not straightforward.
More seriously for our purposes, the cyclic vector theorem only applies
over a differential field. Many differential modules are more naturally defined
over some ring which is not a field. For instance, differential modules arising from geometry (such as PicardFuchs modules) are usually defined over
a ring of functions on some geometric space. While there are forms of the
cyclic vector theorem available over nonfields (see for instance Theorem 5.7.3
below), these do not suffice for our purposes. We also find that working with
differential modules instead of differential polynomials has a tremendously
clarifying effect, partly because it improves the parallelism with difference
algebra, where there is no good analogue of the cyclic vector theorem even
over a field. (See Part IV.)
We find it unfortunate that much literature on complex ordinary differential equations, and nearly all the literature on p-adic ordinary differential
equations, is mired in the language of differential polynomials. By switching
instead between differential modules and differential polynomials, as appropriate, we will be able to demonstrate strategies that lead to a more systematic
development of p-adic theory.
As promised, we offer some results concerning cyclic vectors over rings,
due to Katz [122].
Theorem 5.7.2. Let R be a differential local Q-algebra. Suppose that the maximal ideal of R contains an element t such that d(t) = 1. Let M be a finite free
differential module over R, and let e1 , . . . , en be a basis of M. Then
v=
89
j
n1 j
t
j
(1)k
D k (e j+1k )
j!
k
j=0
k=0
is a cyclic vector of M.
Proof. For i, j 0, put
j
k j
c(i, j) =
(1)
D k (ei+ j+1k )
k
k=0
n1 j
t
j=0
j!
c(i, j) ei+1
(mod t).
n1 j
x
c(i, j).
j!
j=0
(x R).
90
i!
D i (m)
i=1
d(r )
ri
r i1
D i (m) +
D i+1 (m)
(i 1)!
i!
i=0
Exercises
91
Notes
The subject of differential algebra is rather well developed; a classic treatment,
though possibly too dry to be useful to the casual reader, is the book of Ritt
[188]. As in abstract algebra in general, the development of differential algebra
was partly driven by differential Galois theory, i.e., the study of the expression
of solutions of differential equations in terms of solutions to ostensibly simpler
differential equations. A relatively lively introduction to the latter is [202].
Calling an element of a differential module horizontal when it is killed by
the derivation makes sense if you consider connections in differential geometry. In that setting, the differential operator is measuring the extent to which a
section of a vector bundle deviates from some prescribed horizontal direction
identifying points on one fibre with points on nearby fibres.
The history of the cyclic vector theorem is rather complicated. It appears
to have been first proved by Loewy [162] in the case of meromorphic functions, and (independently) by Cope [60] in the case of rational functions. For a
detailed historical discussion, see [53].
Twisted polynomials were introduced by Ore [177]. They are actually somewhat more general than we have discussed; for instance, one can also twist
by an endomorphism : R R by imposing the relation T a = (a)T .
(This enters the realm of the analogue of differential algebra called difference
algebra, which we will treat in Part IV.) Moreover, one can twist by both an
endomorphism and a derivation if they are compatible in an appropriate way,
and one can even study differential or difference Galois theory in this setting. A unifying framework for doing so, which is also suitable for considering
multiple derivations and automorphisms, was given by Andr [4].
Differential algebra in positive characteristic has a rather different flavor
than in characteristic 0; for instance, the pth power of the derivation d/dt on
F p (t) is the zero map. A brief discussion of the characteristic- p situation is
given in [80, III.1].
Exercises
(1) Prove that if M is a locally free differential module over R of rank 1 then
M M is trivial (as a differential module).
(2) Check that the bijection Ext(R, N ) H 1 (N ) constructed in the proof of
Lemma 5.3.3 is indeed a homomorphism.
92
(3) Check that, in characteristic p > 0, the cyclic vector theorem holds for
modules of rank at most p but may fail for modules of rank p + 1.
(4) Give a counterexample to the cyclic vector theorem for a differential field
of characteristic 0 with trivial derivation.
(5) Verify that R{T } is indeed a not necessarily commutative ring; the content
in this exercise is to check the associativity of multiplication.
(6) Prove the division algorithm (Proposition 5.5.2).
6
Metric properties of differential modules
In this chapter, we study the metric properties of differential modules over nonarchimedean differential rings. The principal invariant that we will identify is
a familiar quantity from functional analysis known as the spectral radius of a
bounded endomorphism. When applied to the derivation acting on a differential module, we obtain a quantity which can be related to the least slope of the
Newton polygon of the corresponding twisted polynomial.
We can give meaning to the other slopes as well, by proving that over a
complete nonarchimedean differential field any differential module decomposes into components whose spectral radii are computed by the various
slopes of the Newton polygon. However, this theorem will provide somewhat incomplete results when we apply it to p-adic differential modules in
Part III; we will have to remedy the situation using Frobenius descendants and
antecedents.
This chapter provides important foundational material for much of what
follows, but on its own it may prove indigestably abstract at first. The reader
arriving at this opinion is advised to read Chapter 7 in conjunction with this
one, to see how the constructions of this chapter become explicit in a simple
but important class of examples.
94
of G. Recall that this means that there exists c 0 such that |T (g)| c|g| for
all g G.
Definition 6.1.2. The operator norm |T |G of T is defined to be the least c 0
for which |T (g)| c|g| for all g G, i.e.,
|T |G =
gG,g=0
|T |sp,G = lim |T s |G ;
s
the existence of the limit follows from the fact |T m+n |G |T m |G |T n |G and
from the following lemma.
Lemma 6.1.4 (Fekete). Let {an }
n=1 be a sequence of real numbers such that
am+n am + an for all m, n. Then the sequence {an /n}
n=1 either converges
to its supremum or diverges to +.
Proof. Exercise.
Proposition 6.1.5. The spectral radius of T depends on the norm | | only up
to metric equivalence.
Proof. Suppose that | | is a norm metrically equivalent to | |. We can then
choose c > 0 such that c1 |g| |g| c|g| for all g G. We then have
|T (g)|/|g| c2 |T (g)| /|g| for all g G \ {e}. Applying this with T replaced
by T s gives |T s |G c2 |T s |G , so
|T s |sp,G lim c2/s (|T s |sp,G )1/s .
s
95
96
nonarchimedean differential field, we can define the operator norm |d| F and
the spectral radius |d|sp,F ; by hypothesis the former is finite, so the latter is too.
Definition 6.2.2. Let F be a nonarchimedean differential field. By a normed
differential module over F we mean a vector space V over F equipped with a
compatible norm | |V and a derivation D with respect to d which is bounded
as a endomorphism of the additive group of V . For V nonzero, we may then
consider the operator norm |D|V and the spectral radius |D|sp,V .
Remark 6.2.3. If V is finite-dimensional over F and F is complete then the
spectral radius does not depend on the norm on V , since by Theorem 1.3.6 any
two norms on V compatible with the norm on F are metrically equivalent.
In general, one cannot have differential modules with arbitrarily small
spectral radius.
Lemma 6.2.4. Let F be a nonarchimedean differential field, and let V be a
nonzero normed differential module over F. Then
|D|sp,V |d|sp,F .
Proof. (This proof was suggested by Liang Xiao.) For a F and v V
nonzero, the Leibniz rule gives
si
si si j
D si (a D i (v)) = d si (a)D i (v)+
(a)D i+ j (v) (0 i s).
d
j
j=1
s
cs,i D si (a D i (v))
i=0
(6.2.4.1)
By Lemma 6.1.8, given > 0 we can choose c = c() such that, for all s 0,
|D s |V c(|D|sp,V + )s .
Using (6.2.4.1), we deduce that
|d s (a)v|V c2 (|D|sp,V + )s |a||v|V .
Dividing by |a||v|V and taking the supremum over a F, we obtain
|d s | F c2 (|D|sp,V + )s .
97
i (Ds )i j ei ). Then
|D|sp,V = max{|d|sp,F , lim sup |Ds |1/s }.
(6.2.5.1)
Proof. (Compare [49, Proposition 1.3].) Equip V with the supremum norm
defined by e1 , . . . , en ; then |D s |V maxi, j |(Ds )i, j |. This plus Lemma 6.2.4
imply that the left-hand side of (6.2.5.1) is greater than or equal to the the
right-hand side.
Conversely, for any x V , if we write x = x1 e1 + + xn en with
x1 , . . . , xn F then
s
n
s j
D s (x) =
d (xi )D s j (ei ),
j
i=1 j=0
so
1/s
1/s
|D s |V max |d j | F |Ds j |1/s .
0 js
(6.2.5.2)
Given > 0, apply Lemma 6.1.8 to choose c = c() such that, for all s 0,
s
|d s | F c |d|sp,F + ,
s
|Ds |V c lim sup |Ds |1/s + .
s
1/s
|D s |V c2/s max |d|sp,F + , lim sup |Ds |1/s + .
s
98
Using Lemma 6.2.5, we may infer the following base-change property for
the spectral radius. We will need to refine this result later; see Corollary 6.5.5
and Proposition 10.6.6 below.
Corollary 6.2.7. Let F F be an isometric embedding of complete nonarchimedean differential fields. (In particular, the differential on F must restrict
to the differential on F.) Then, for any nonzero finite differential module V
over F,
|D|sp,V = max{|d|sp,F , |D|sp,V }.
Consequently, if |d|sp,F = |d|sp,F then |D|sp,V = |D|sp,V by Lemma 6.2.4.
Proof. Choose a basis of V , and use Lemma 6.2.5 to compute both |D|sp,V
and |D|sp,V in terms of this basis. This yields the desired formula.
Here is how the spectral radius behaves with respect to basic operations on
the category of differential modules.
Lemma 6.2.8. Let F be a complete nonarchimedean differential field.
(a) For a short exact sequence 0 V1 V V2 0 of nonzero finite
differential modules over F,
|D|sp,V = max{|D|sp,V1 , |D|sp,V2 }.
(b) For V a nonzero finite differential module over F,
|D|sp,V = |D|sp,V .
(c) For V1 , V2 nonzero finite differential modules over F,
|D|sp,V1 V2 max{|D|sp,V1 , |D|sp,V2 },
with equality when |D|sp,V1 = |D|sp,V2 .
Proof. To prove (a), choose a splitting V = V1 V2 of the short exact sequence
in the category of vector spaces over F. The action of D on V is then given
by D(v1 , v2 ) = (D(v1 ) + f (v2 ), D(v2 )) for some F-linear map f : V2 V1
(as in the proof of Lemma 5.3.3). From this, the rest of the proof of (a) is an
exercise in the spirit of the proofs of Lemmas 6.2.4 and 6.2.5.
We next recall from Definition 5.3.2 that for any finite differential modules
V1 , V2 over F, the action of D on V = Hom F (V1 , V2 ) is given by
D( f )(v1 ) = D( f (v1 )) f (D(v1 )).
99
The Leibniz rule in this setting is that, for any nonnegative integer s,
!
s
"
s
i i
(1)
D i f (D si (v1 )) .
D ( f )(v1 ) =
s
i=0
100
1/
p
p > 0,
h0 |(F ) |
where p is the characteristic of F .
101
i
i Pi T F[T ] be the minimal polynomial of . Then the unique extension
of d to F is characterized by
0 = d()P () +
d(Pi ) i .
(6.2.15.1)
i
If
o
F
102
m1 i/m
with ci F, and the norm on F is given by
i=0 ci t
m1
ci t i/m = max{|ci ||t|i/m }.
i
i=0
It thus suffices to check that |d(ci t i/m )| |d| F |ci t i/m | for i = 0, . . . , m 1.
By the Leibniz rule, this reduces to checking |d(t i/m )| |d| F |t i/m |. But
|d(t i/m )| = |t i/m1 d(t)| |t i/m ||t|1 |d| F |t| = |d| F |t i/m |,
so the claim follows.
To treat the general case, we may choose an integer m not divisible by p and
annihilating I E/F and some t1 , . . . , th F such that |t1 |, . . . , |th | generate
1/m
1/m
|F |/|F |m . Put F = F(t1 , . . . , th ). By the two previous paragraphs,
1/m
we have |d| F = |d| F . (More precisely, when adjoining ti
we split the
reasoning into two cases: first, for e the largest divisor of m such that |ti |1/e is
1/e
already present, adjoining ti gives an unramified extension, as in the proof
of Proposition 3.3.6, so the operator norm of d does not change. After that,
1/m
proceeds as in the second paragraph.) By Proposition 3.3.6, F
adjoining ti
is contained in an unramified extension of F so the claim follows by applying
the first paragraph again.
(6.3.1.1)
103
(6.3.1.2)
Then equality holds in (6.3.1.1), and if |d|sp,F > 0 then the limit inferior on
the right is also a limit.
The condition (6.3.1.2) is satisfied in the situation of greatest interest to us;
see Proposition 9.10.2.
Proof. By taking T s F{T }(s) we obtain the inequality |D s |V |d s | F |Ds |.
Taking sth roots of both sides and then taking limits as s yields (6.3.1.1).
Conversely, suppose that (6.3.1.2) holds. Given > 0, apply Lemma 6.1.8
to choose c > 0 such that, for all s 0,
|D s |V c(|D|sp,V + )s .
s
Given a nonnegative integer s such that |Ds | > , choose P = i=0
Pi T i
(s)
F{T } nonzero and such that |P(D)|V |P(d)| F (|Ds | ). (This could
only fail to hold if we had |P(d)| F = 0 for all P F{T }(s) , but the presence
of constant polynomials eliminates this possibility.) Then
max{|Pi ||d|isp,F (|Ds | )} max{|Pi d i | F (|Ds | )}
is
is
= |P(d)| F (|Ds | )
|P(D)|V
max{|Pi D i |V }
is
max{|Pi |c(|D|sp,V + )i }.
is
This forces equality in (6.3.1.1) and forces the limit inferior therein to be a
limit provided that |d|sp,F > 0.
104
105
provided that d is nonzero. (If d = 0 then (x +i) log |d| F must be interpreted
to mean 0 if x + i = 0 and + if x + i > 0.)
106
i=0
i=0
then ||V = |d| F , D is F-linear, and |D |V = er1 . Then, for all positive
integers s,
|(D )s |V = er1 s ,
107
j1
dependent then we have D j (ei ) = h=0 ch D h (ei ) for some ch F. Since
|D|sp,V 1 and |d| F 1, Theorem 6.5.3 implies that |ch | 1 for each h.
Hence M is finitely generated, and thus free, over o F .
Let | |V be the supremum norm on V defined by a basis of M. Then
|x|V |x|V because e1 , . . . , en M. Conversely, for i = 1, . . . , n and
h = 0, . . . , j (i) 1, we have |D h (ei )|V |D|hV ch cn1 . Since the
D h (ei ) generate M, this implies that |x|V cn1 |x|V for all x V .
108
Vs
s>|d| F
109
= | A||N12 | .
Let X be the block upper triangular nilpotent matrix with X 12 = AN12 , put
U = I X , and put
N = U 1 N U + U 1 d(U ).
110
( j = 1, 2).
in which the first two terms have norm at most | A|1 and the third has norm
at most |d| F .
, we write it as (I + X N A)N . Because |X N A|
To analyze N11
12 21
11
12 21
|X 12 |(|N21 ||A|) < < 1 the first factor is invertible, and it and its inverse both
is invertible, |N | = |N |, and |(N )1 | = |A|.
have norm 1. Hence N11
11
11
11
Since max{, |d| F |A|} for some fixed < 1, iterating the construction
of N from N yields a convergent sequence of conjugations whose limit has
the desired property.
We need a version of the argument used in the proof of Theorem 6.5.3 that
is no longer restricted to cyclic vectors.
Lemma 6.7.2. Let F be a complete nonarchimedean differential field. Let V
be a nonzero finite differential module over F. Let e1 , . . . , en be a basis of
V , and let N be the matrix of the action of D on e1 , . . . , en . Suppose that
|N | = > |d| F and |N 1 | = 1 . Then the full spectrum of V consists
entirely of .
Proof. As in the proof of Theorem 6.5.3, we find that for the supremum norm
for e1 , . . . , en we have |D s (v)|V = s |v|V for all nonnegative integers s and
all v V . Consequently, for any nonzero differential submodule W of V ,
we have |D|sp,W = . By Theorem 6.6.1, it follows that every irreducible
subquotient of V also has spectral radius , as desired.
111
112
113
we have |D|sp,V W < || if and only if each eigenvalue has norm strictly less
than ||, which occurs if and only if |/ 1| < 1 for all , .
Theorem 6.8.2 (Refined visible decomposition theorem). Let F be a complete nonarchimedean differential field of characteristic 0, and let V be a finite
differential module over F such that no subquotient of Vs has norm less than
or equal to |d| F . Then, for some finite tamely ramified extension F of F, there
exists a (unique) decomposition
%
Vi
V F F =
i
114
115
Let R be the completed tensor product of R R0 R0 for the product norm
(which is a norm by Lemma 1.3.11). View R as a differential ring by equipping
it with the unique continuous extension of d with R0 in its kernel. Then, for any
differential module M over R, the natural map
H 0 (M) R0 R0 H 0 (M R R )
is an isomorphism of R0 -modules.
Proof. We first check injectivity. Since R0 is flat over R0 , the map H 0 (M) R0
R0 H 0 (M R0 R0 ) is bijective by Lemma 5.1.4. Since M is finite over
R, we may identify M R R with the completed tensor product of M R
(R R0 R0 ) = M R0 R0 , on which the product seminorm is a norm by
Lemma 1.3.11. In particular M R0 R0 injects into M R R , so the map
H 0 (M R0 R0 ) H 0 (M R R ) is also injective.
We next check surjectivity in the case where R0 is the completion of a
finitely generated field extension S0 of R0 . Then S0 is of countable dimension
over R0 , so the hypothesis of Lemma 1.3.8 is satisfied by F = R0 , V = R0 .
Let m 1 , m 2 , . . . be the sequence of elements of R0 given by Lemma 1.3.8. As
in the proof of Lemma 1.3.11 these define projection maps j,R : R R and
j,M : M R R M; these maps are both horizontal.
s
j,R (z k )yk
k=1
j,M (x) m j .
j=1
Hence x lies in the closure of H 0 (M) R0 R0 under the product norm. However,
since H 0 (M) is finite-dimensional over R0 by Lemma 5.1.5, H 0 (M) R0 R0
is complete under the product norm, hence closed. Thus x H 0 (M) R0 R0
as desired.
We next check surjectivity in general. Pick any x H 0 (M R R ). As noted
earlier, we may identify M R R with the completion of M R0 R0 under the
product norm. We can thus choose a convergent series
x=
k=1
yk z k
(yk M, z k R0 ).
116
Notes
Lemma 6.2.5 is tacitly assumed at various places in the literature (including
by the present author) but we were unable to locate even an explicit statement, let alone a proof. We thank Liang Xiao for contributing the proof given
here.
Proposition 6.3.1 is modeled on some ongoing work of Baldassarri and Di
Vizio (a promised sequel to [11]), which gives a development of much of the
material we are discussing from the point of view of Berkovich analytic spaces.
This point of view will probably be vital for the study of differential modules
on higher-dimensional spaces. However, there are some key differences from
our approach; see the notes for Chapter 9.
Newton polygons for differential operators were considered by Dwork and
Robba [81, 6.2.3]; the first systematic treatment seems to have been given by
Robba [189]. Our treatment using Theorem 2.2.2 follows [42].
The proof of Theorem 6.5.3 given here is close to the original proof of Christol and Dwork [49, Thorme 1.5], save that we avoid a small logical gap in the
latter. The gap is in the implication 1 2; there one made a finite extension
of the differential field without accounting for the possibility that this might
increase |d| F to the point where Corollary 6.2.7 fails to show that |D|V is preserved. (It would be obvious that this does not occur if the finite extension were
being made in the constant subfield, but that was not the case.) Compare also
[80, Lemma VI.2.1].
Proposition 6.5.6 answers a conjecture of Christol and Dwork [48, Introduction, Conjecture A]. This conjecture was posed in the context of giving
effective convergence bounds, and that is one use to which we will put it here;
see Theorem 18.2.1 and its proof.
The use in Lemma 6.7.3 of a well-chosen norm (meaning a norm in which at
least some singular values match eigenvalues of the matrix of action of d) is an
extension of the notion of the canonical lattice (or DeligneMalgrange lattice)
introduced by Malgrange in the study of irregular meromorphic connections.
See [165].
The refined visible decomposition theorem is due to Liang Xiao [221]. It
was motivated by applications to refined Swan conductors of tale sheaves
and overconvergent isocrystals, as suggested by the work of Kato [119] in the
rank 1 case.
Exercises
117
We thank Andrea Pulita for the suggestion that Proposition 6.9.1 should be
included.
Exercises
(1) Prove Feketes lemma (Lemma 6.1.4).
(2) (a) Let A, B be commuting bounded endomorphisms on an abelian group
G equipped with a norm. Prove that
|A + B|sp,G |A|sp,G + |B|sp,G .
(b) Prove that if the norm on G is nonarchimedean then the inequality in
(a) can be improved to
|A + B|sp,G max{|A|sp,G , |B|sp,G }
(3)
(4)
(5)
(6)
(7)
and that equality occurs when the maximum is achieved only once.
(c) Prove that both these assertions may fail in the case where A and B
do not commute. (Hint: write an identity matrix as a sum of nilpotent
matrices.)
Prove that, in Lemma 6.2.5, if |D|sp,V > |d|sp,F then |Ds |1/s converges to
a limit as s . (Hint: again reduce to Lemma 6.1.4.)
Fill in the missing arguments in the proofs of Lemma 6.2.8(a), (b), using
Lemma 6.1.8.
Prove the claim from the proof of Lemma 6.4.2 that, for R a nonarchimedean differential domain, > |d| R , and P, Q R{T }, we have
|P Q Q P| 1 |d| R |P| |Q| . (Hint: reduce to the case where P
and Q are monomials and then to the case where P = T and Q R.)
Exhibit an example to show that, in Definition 6.4.3, if we had not omitted
slopes greater than or equal to log |d| R then it would no longer be the
case that the multiplicity of a slope r in a product P Q is the sum of the
multiplicities of r as a slope of P and as a slope of Q. (Hint: this can
already be seen using the t-adic valuation on Q(t) with d = d/dt.)
Let (F, d) be a complete nonarchimedean differential field. Let F be the
completion of F(t) for the -Gauss norm. Prove that there is a unique
continuous extension of d to F with d(t) = 0 and that it satisfies
|d| F = |d| F ,
|d|sp,F = |d|sp,F .
7
Regular singularities
In the next part of the book, which begins with Chapter 8, we will use the
results from the previous chapters to make a detailed analysis of ordinary
differential equations over nonarchimedean fields of characteristic 0, the motivating case being that of positive residual characteristic. However, before doing
so it may be helpful to demonstrate how the results apply in a somewhat
simpler setting.
In this chapter, we reconstruct some of the traditional Fuchsian theory of
regular singular points of meromorphic differential equations. (The treatment
is modeled on [80, 3].) We first introduce a quantitative measure of the
irregularity of a singular point. We then recall how, in the case of a regular singularity (i.e., a singularity with irregularity equal to zero), one has an
algebraic interpretation, using the notion of exponents, of the eigenvalues of
the monodromy operator around the singular point. We then describe how to
compute formal solutions of meromorphic differential equations and go on
to sketch the proof of Fuchss theorem, that the formal solutions of a regular
meromorphic differential equation actually converge in some disc. We finally
establish the TurrittinLeveltHukuhara decomposition theorem, which gives
a decomposition of an arbitrary formal differential module that is analogous to
the eigenspace decomposition of a complex linear transformation. The search
for an appropriate p-adic analogue of this result will lead us in Part V to the
p-adic local monodromy theorem.
Although this chapter focuses on issues different from much of the rest of
the book, it should not be considered entirely optional. It is referenced in the
discussions of p-adic exponents in Chapter 13 and of effective convergence
bounds in Chapter 18.
Hypothesis 7.0.1. Throughout this chapter, we will view C((z)) as a complete
nonarchimedean differential field with valuation given by the z-adic valuation
118
7.1 Irregularity
119
7.1 Irregularity
Definition 7.1.1. Let V be a finite differential module over C((z)), and
decompose V according to Theorem 6.6.1. Define the irregularity of V as
( log s) dim(Vs ).
irr(V ) =
s>1
d1
integer. More explicitly, if P = T d + i=0
Pi T i then
irr(V ) = max 0, max{vz (Pi )} .
i
Proof. The second assertion follows from Corollary 6.5.4, since in this case
|d| F = |d|sp,F = 1. The first assertion follows from the second because, by
Theorem 5.4.2, V always admits a cyclic vector.
Theorem 7.1.2 gives rise to several criteria for regularity.
Corollary 7.1.3. Let F be any subfield of C((z)) containing z and stable
under d, and let V be a finite differential module over F. Then the following
conditions are equivalent.
(a) The module V is regular, i.e., irr(V ) = 0.
(b) For some isomorphism V
= F{T }/F{T }P with P monic, P has coefficients in o F .
(c) For any isomorphism V
= F{T }/F{T }P with P monic, P has coefficients in o F .
(d) There exists a basis of V on which D acts via a matrix over o F .
(e) For any basis B of V , the o F -span of the set {D i (v) : i {0, . . . ,
dim(V ) 1}, v B} is stable under D.
120
Regular singularities
Proof. By Theorem 7.1.2, (a) implies (c). It is obvious that (c) implies (b) and
that (b) implies (d). Given (d), let | |V be the supremum norm defined by the
chosen basis of V ; then |D|V 1, which implies (a).
This proves that (a), (b), (c), (d) are all equivalent. To add (e) to the circle of implications, on the one hand note that (e) implies (d). On the other
hand, given (a), pick any v B; then v, D(v), D 2 (v), . . . generate a differential submodule W of V for which v is a cyclic vector. Since V is regular,
its submodule W is also regular by Lemma 6.2.8; since (a) implies (c), the
o F -span of {D i (v) : i {0, . . . , dim(W ) 1}} is stable under D. This
implies (e).
Remark 7.1.4. One can also view C((z)) as a differential field with the derivation d/dz instead of zd/dz. The categories of differential modules for these
two choices of derivation are equivalent in an obvious fashion: given the action
zd/dz, we obtain the action d/dz by dividing by z. If V is a differential module
for zd/dz with spectral radius s then the spectral radius of V for d/dz is s|z|1
(exercise). The notion of irregularity translates naturally: for instance, if V is
a differential module for d/dz that is isomorphic to F{T }/F{T }P for some
n1
Pi T i then V is regular if and only if vz (Pi ) n + i for
P = T n + i=0
i = 1, . . . , n. For example, for a, b C, the differential system corresponding
to the hypergeometric differential equation
y +
c (a + b + 1)z
ab
y
y=0
z(1 z)
z(1 z)
is regular.
Example 7.1.5. For another example from the classical theory of ordinary
differential equations, consider the Bessel equation
y +
1 z2 n2
y=0
y +
z
z2
121
Theorem 7.2.1 (Cauchy). Fix > 0, and let R Cz be the ring of
power series convergent for |z| < . Let N be an n n matrix over R. Then
the differential system D(v) = N v + (d/dz)(v) has a basis of horizontal
sections.
Proof. This can be deduced from the fundamental theorem of ordinary differential equations; however, for future reference it will be useful to give a slightly
more detailed explanation. (Specifically, we are anticipating Definition 7.3.1.)
Note that there exists a unique n n matrix U over Cz such that U In
i
and U = i=0 Ui z and then rewriting the equation N U + (d/dz)(U ) = 0 as
the recurrence
(i + 1)Ui+1 =
i
N j Ui j
(i = 0, 1, . . . ).
(7.2.1.1)
j=0
Following an argument of Cauchy [80, Appendix III], we may deduce that the
entries of U converge on a disc of positive radius, as follows. (In this argument,
we use the L 2 operator norm on matrices over C.) Pick any (0, ). Since
N converges in the open disc of radius , we have |Ni |i 0 as i ; in
particular, we may choose c > 1 with |Ni |i+1 c for all i. We then have by
induction on i that
|Ui |i ci
(i 0):
|Ui+1 |i+1
j=0
122
Regular singularities
123
i
Definition 7.3.1. Let N =
i=0 Ni t be an n n matrix with entries in
K z. A fundamental solution matrix for N is an n n matrix U with U In
(mod z) such that U 1 N U + U 1 z(d/dz)(U ) = N0 .
Remark 7.3.2. Note that if U is a fundamental solution matrix for N then
d
T T T
T d
T
T T T
T T
T
U N U + U z (U ) = U N U U U
z (U ) U T
dz
dz
d
= U T N T U T z (U T ) U T
dz
= N0T .
That is, U T is a fundamental solution matrix for N T . Consequently, by
proving a general result about U we also obtain a corresponding result for
U T , and hence for U 1 .
To specify when a fundamental solution matrix exists, we need the following
definition.
Definition 7.3.3. We say that a square matrix N with entries in a field of
characteristic 0 has prepared eigenvalues if the eigenvalues 1 , . . . , n of N
satisfy the following conditions:
i Z i = 0,
i j Z i = j .
If only the second condition holds, we say that N has weakly prepared
eigenvalues.
We will also need the following lemma, which will come up again several
times later.
Definition 7.3.4. Let N be a nilpotent n n matrix over K . The nilpotency
index of N is the smallest positive integer e such that N e = 0.
Lemma 7.3.5. Let N1 and N2 be matrices of respective sizes m m and n n
over K . Let 1,1 , . . . , 1,m and 2,1 , . . . , 2,n be the eigenvalues of N1 and
N2 , respectively. Then the eigenvalues of the K -linear endomorphism X
N1 X + X N2 on the space of m n matrices over K are equal to 1,i + 2, j for
i = 1, . . . , m, j = 1, . . . , n. Moreover, if N1 and N2 are themselves nilpotent,
124
Regular singularities
matrices N1e1 1 , N2e2 1 must be nonzero, so we can make this product nonzero
by choosing a suitable matrix X .
Ui t i and write
equation as NU +z(d/dz)(U ) = U N0 , then expand U as i=0
the new defining equation as a recurrence:
iUi = Ui N0 N0 Ui
i
N j Ui j
(i > 0).
(7.3.6.1)
j=1
125
126
Regular singularities
127
128
Regular singularities
Theorem 7.5.3. Let V be a finite differential module over K ((z)) such that
V V is regular. Then there exists a differential module W of rank 1 such
that W V is regular.
Corollary 7.5.4. Suppose that K is algebraically closed. Let V be a finite
differential module over K ((z)) such that V V is regular. Then there is a
unique decomposition V = i Vi such that Vi V j has all exponents zero if
i = j and all exponents nonzero if i = j.
Proof. If V itself is regular, the corollary follows from Proposition 7.3.12 and
the fact that a linear transformation over an algebraically closed field decomposes as a direct sum of generalized eigenspaces. In general we may replace
V with W V for some V as specified in Theorem 7.5.3 and reduce to the
regular case.
The strongest form of the TurrittinLeveltHukuhara decomposition is
the following statement, which both eliminates the base extension in
Theorem 7.5.1 and incorporates the statement of Theorem 7.5.3.
Definition 7.5.5. Let h be a positive integer, and suppose that P =
{P1 , . . . , Ph } is a Galois orbit over K ((z)). Let F be a finite Galois extension of K ((z)) containing P1 , . . . , Ph . Then the differential module of rank h
over F with generators e1 , . . . , eh satisfying
d(ei ) = Pi ei
(i = 1, . . . , h)
Notes
129
Notes
The notion of a regular singularity was introduced by Fuchs in the nineteenth century as part of a classification of those differential equations
with everywhere-meromorphic singularities on the Riemann sphere which
have algebraic solutions. Regular singularities are sometimes referred to as
130
Regular singularities
Fuchsian singularities. Much of our modern understanding of the regularity condition, especially in higher dimensions, comes from the book of
Deligne [68].
As noted in the text, Corollary 7.3.11 is false for irregular modules. This was
originally noticed in numerous examples of particular differential equations
(e.g., the Bessel equation at infinity; see Example 20.2.1) and motivated the
definition of irregularity in the first place.
The relationship between the interpretations of irregularity using the Newton
polygon and using indices is due to Malgrange [164]. Our treatment, in which
the spectral radius plays a pivotal role, is based on [80, 3]; this point of view
is ultimately due to Robba.
A complex analytic interpretation of the Newton polygon, in the manner
of the relation between irregularity and index, was given by Ramis [186]. It
involves considering subrings of C{z} composed of functions having certain
extra convergence restrictions (Gevrey functions) and looking at the index of
z d/dz after tensoring the given differential module with one of these subrings.
Theorem 7.5.6 is a slight reformulation of classification results due to
Turrittin [212] (building on earlier work of Hukuhara in the rank 1 case) for the
existence aspect and to Levelt [158] for the uniqueness aspect. See [7, II.3.1]
for further discussion; we further recommend [7] for higher-dimensional
analogues of the results discussed in this chapter. See [144] for some additional higher-dimensional analogues, presented in much the same style as in
this chapter.
Exercises
(1) Let V be a differential module over K ((z)) with spectral radius s for
zd/dz. Prove that the spectral radius of V for d/dz is s|z|1 .
(2) Verify the claims of Example 7.1.5.
(3) Let N be an n n matrix over K z whose constant term has prepared
eigenvalues. Prove that any vector v K ((z))n with N v +z(d/dz)(v) = 0
is a K -linear combination of columns of the fundamental solution matrix
of N . (Hint: change basis first.)
i
(4) Prove Fuchss theorem (Theorem 7.3.8). (Hint: let U =
i=0 Ui z be
the fundamental solution matrix of N . Relate the norms of Ui and iUi
Ui N0 + N0 Ui by considering the singular values of the map X X
X N0 + N0 X . Then use (7.3.6.1) to obtain
i
Ui Ui N0 N0 Ui 1
|N j ||Ui j | max {|N j ||Ui j |}
i
1 ji
i
j=1
Exercises
131
(5) Prove Proposition 7.3.10. (Hint: show that, for any space V of constant
vectors stable under multiplication by N0 , the set of vectors with entries
in K z whose reductions modulo z belong to V are closed under the
operation v N v + d(v). Then take V to be a union of generalized
eigenspaces.)
(6) Prove Proposition 7.4.2. (Hint: change basis by preparing eigenvalues and
then applying Theorem 7.3.8.)
(7) Let Q K ((z))[U ] be a polynomial whose roots 1 , . . . , n K ((z))alg
satisfy |i / j 1| < 1 for all i, j. Prove that there exists K ((z)) such
that |i / 1| for all i. Then show that this fails without the hypothesis
that K has characteristic 0.
Part III
p-adic Differential Equations on Discs
and Annuli
8
Rings of functions on discs and annuli
136
x = '0,
1/2
(
f (x) = 1/2 1/(N + 1) + N /(N + 2) (1/N x) x 1/(N + 1), 1/N ,
N = 1, 2, . . . ;
this function is piecewise affine on (0, 1] and continuous on [0, 1], but there is
no one-sided neighborhood of 0 on which f is affine.
Lemma 8.0.4. Let I = [, ] be a (bounded) closed interval. Let S be a set
of closed subintervals of I which cover every one-sided neighborhood of every
point in I . (That is, for any [, ), there exist ( , ] and J S
such that [ , ] J and, similarly, for any (, ], there exist [, ]
and J S such that [, ] J .) Then there exists a finite subset of S with
union I .
Proof. Exercise.
( [, ]) .
That is, consider the set of formal bidirectional power series which converge
whenever one inserts a value for t with |t| [, ] or, in other words, when
/|t| and |t|/ are both at most 1; it suffices to check for = and =
. Although formal bidirectional power series do not form a ring, the subset
K /t, t/ does form a ring under the expected operations. (In this and all
similar notation, we will omit when it is equal to 1.)
Definition 8.1.2. If = 0, the only reasonable interpretation of the previous
definition is to require ci = 0 for i < 0. When there are no negative powers
137
i=0
One can also write K /t, for which the implied value of is . More
succinctly put, we identify K /t, t/ with K 1 /t 1 , t 1 / 1 . We do
not allow both and to equal 0, as this case is exceptional: no condition is
then imposed on the power series except that negative indices cannot occur.
This results in the ring K t.
Remark 8.1.3. Note that if then, within K /t, t/,
K /t, t/ K /t, t/ = K /t, t/.
This will serve as the basis for some gluing arguments later.
Definition 8.1.4. We will also occasionally use the intermediate ring
i
i
K t/0 =
ci t K t : sup{|ci | } < ;
i
i=0
these are the power series which converge and take bounded values on the open
disc |t| < . (The notation will make more sense once we have also defined
K t/ for > 0; see Definition 18.4.1.) Note that, for any (0, ),
K t/ K t/0 K t/.
We will most often use this construction with = 1, in which case we can
also write
K t0 = o K t oK K .
Definition 8.1.5. An analogue of the previous construction for an annulus is
i
i
i
K /t, t/0 =
ci t : ci K , lim |ci | = 0, sup{|ci | } < ;
iZ
these are the Laurent series which converge and take bounded values on the
half-open annulus |t| < . For any [, ), this ring satisfies
K /t, t/ K /t, t/0 K /t, t/.
One can also use the boundedness condition on both sides, defining
i
i
i
ci t : ci K , sup{|ci | } < , sup{|ci | } < .
K /t, t/0 =
iZ
138
Remark 8.1.6. The rings of bounded series behave well only in the case where
K is discretely valued; otherwise, they are not even noetherian (exercise). For
general K it is better to work with rings of analytic elements; see Section 8.5.
for any [, ]. For = = 0, this reduces to simply |c0 |. The fact that
this is a multiplicative norm follows as in Proposition 2.1.2. In the additive
version (Gauss valuation) one takes r [ log , log ] and puts
i
vr
ci t = min{v(ci ) + ri},
i
where v(c) = log |c|. There is also a -Gauss norm on K /t, t/0 ,
although it must be defined as a supremum that may fail to be achieved if
K is not discretely valued.
Definition 8.2.2. One may define the Newton polygon for an element
x = i xi t i K /t, t/ as the boundary of the lower convex hull of the set
{(i, v(xi )) : i Z, xi = 0},
retaining only those slopes within [ log , log ].
139
140
(1 f ) j
(8.2.6.1)
j=0
h R K t 1 ,
141
142
has the same effect as another sequence in which all steps of type (a) and (b)
happen before all steps of type (c).
By Lemma 8.3.3 we can perform a sequence of elementary row operations
on A that produces the identity matrix. By this, plus the previous paragraph,
we can perform another sequence of elementary row operations, of types
(a) and (b), on A that produces a diagonal matrix D. By appending additional
row operations of type (c) with c K (resp. c K t Z ), we can force
|D In | < 1.
Let R1 , . . . , Rm be elementary matrices as above with R1 Rm A = D;
note that those R j of type (a) and (c) are already elementary matrices over
K [t] (resp. over K [t, t 1 ]), the case of type (c) being handled by the previous
paragraph. Put
= max{1, | A|1 }
m
max{1, |R j |1 }.
j=1
R j
143
Proof. We will explain only the case > 0; the case = 0 is similar. Choose
bases v1,1 , . . . , v1,n and v2,1 , . . . , v2,n of M1 and M2 , respectively. Let A be
the change-of-basis matrix from the v1,i to the v2,i , viewing both as bases of
M1 K /t, t/
= M2 K /t, t/ via . By Proposition 8.3.5 we can
factor A as U V with U GLn (K t/) and V GLn (K /t).
We can then construct a finite free module M over K /t, t/ equipped
with a basis v1 , . . . , vn such that the change-of-basis matrices from this basis
to the v1,i and to the v2,i are U 1 and V , respectively. This is the desired
module.
Remark 8.3.7. If < , one can similarly glue together a finite
free module over K /t, t/ and a finite free module over K /t, t/0 ,
whose base extensions to K /t, t/ are isomorphic; the result is a finite
free module over K /t, t/0 . As we will not use this fact, we omit further
details.
i=0
( (0, )) ;
these are the power series convergent on the open disc |t| < , with no
boundedness restriction. Note that we can write
)
K t/;
K {t/} =
(0,)
iZ
i+
( (0, )) ;
these are the Laurent series convergent on the half-open annulus |t| < .
144
These rings behave worse than their closed counterparts (see the exercises),
so we will only make occasional use of them. More often, we will work
with the following definition, which is motivated by considerations from rigid
analytic geometry.
Definition 8.4.3. Consider the region |t| I , for any interval I [0, +);
this could be a closed or open disc or a closed, open, or half-open annulus.
By a coherent locally free module M on this region, we will mean a sequence
of finite free modules Mi over K i /t, t/i , with [1 , 1 ] [2 , 2 ]
an increasing sequence of closed intervals with union I , together with isomorphisms Mi+1 K i /t, t/i
= Mi . Using Lemma 8.3.6 we check that the
construction is canonically independent of the choice of sequence. However,
the resulting object is not finite in the sense of admitting a finite generating set
over the entire annulus, except when K is spherically complete. For a special
case, see Proposition 16.1.4; the notes give further discussion.
145
146
147
148
(8.6.2.1)
As in the proof of Lemma 1.3.5 it follows that, for any positive integer j and
any a1 , . . . , an R/t j R,
c1 max{|ai | R/t j R } |a1 m 1 + + an m n | j c2 max{|ai | R/t j R }. (8.6.2.2)
i
Notes
149
Notes
The Hadamard three-circles theorem (Proposition 8.2.3(c)) is a special case of
the fact that the Shilov boundary of the annulus |t| consists of the two
circles |t| = and |t| = . For a considerable amplification of this remark,
including a full-blown theory of harmonic functions on Berkovich analytic
curves, see [205]. For an alternative presentation, restricted to the Berkovich
projective line but otherwise more detailed, see [12].
The gluing lemma (Lemma 8.3.6) is a special case of the gluing property of coherent sheaves on affinoid rigid analytic spaces, as specified in the
theorems of Kiehl and Tate [31, Theorems 8.2.1/1 and 9.4.2/3]. The factorization argument in the proof, however, is older still; it is the nonarchimedean
version of what is called a Birkhoff factorization over an archimedean field.
Similarly, Definition 8.4.3 corresponds to the definition of a locally free coherent sheaf on the corresponding rigid or Berkovich analytic space. Such a
sheaf is only guaranteed to be freely generated by global sections in the
case where K is spherically complete [128, Theorem 3.14]; in fact, a previous result of Lazard [156] implies that this property, even when restricted
to modules of rank 1, is in fact equivalent to the spherical completeness
of K .
We again thank Liang Xiao for his help with the proof of Lemma 8.6.1.
150
Exercises
(1) Prove Lemma 8.0.4. (Hint: for each point of I find an open neighborhood covered by one or two elements of S. Then reduce to the usual
compactness property of a bounded closed interval.)
(2) Verify the assertions of Remark 8.2.4. (Hint: a typical example where
1/n t n .)
piecewise affinity fails is
n=1 p
(3) Prove Proposition 8.2.5. (Hint: it may be easiest to first construct the limit
using a single [, ] and then show that this can also be done for the
other .)
(4) Prove that if K is discretely valued then, for any f K /t, t/0 , there
exists a polynomial P K [t] and a unit g K /t, t/
0 such that
f = Pg. (Hint: the Newton polygon has finite width in this case, so one
may argue as in Proposition 8.3.2.)
(5) Prove that the ring K {t} is not noetherian. (Hint: pick a sequence of points
in the open unit disc converging to the boundary, and consider the ideal of
functions vanishing on all but finitely many of these points.)
(6) Prove that if K is not trivially or discretely valued then K t0 is not noetherian. (Hint: proceed as in the previous exercise but choose the points so
that the Newton polygon of a function vanishing on all the points has finite
height.)
(7) Prove that if K is discretely valued then o K t = o K t K t is
noetherian. Otherwise it is not, because then o K itself is not noetherian.
(8) Prove that each maximal ideal of o K t is generated by m K together with
some P o K [t] whose reduction modulo m K is irreducible in K [t].
(9) State and prove an analogue of the gluing lemma (Lemma 8.3.6) for gluing
together finite free modules over K 1/t, tan and K t0 , using an isomorphism over the completion of K t0 K [t] K (t) under | |1 , to obtain a
module over K tan .
9
Radius and generic radius of convergence
152
9.2 Antidifferentiation
153
Definition 9.1.5. Consider the region |t| I , for any interval I [0, +);
this region could be a closed or open disc or a closed, open, or half-open annulus. By a finite differential module M on this region we will mean a coherent
locally free module in the sense of Definition 8.4.3, in which each module Mi
carries the structure of a differential module over K i /t, t/i and each isomorphism Mi+1 K i /t, t/i
= Mi is horizontal. By Proposition 9.1.2 we
see that these again form an abelian category. (It would probably be better to
call these coherent differential modules since they need not be generated by a
finite set of sections.)
9.2 Antidifferentiation
One initially surprising fact about p-adic analysis is that the relationship
between differentiation and boundary convergence is reversed from the archimedean case: whereas d carries K /t, t/ into itself, antidifferentiation
does not. Instead, one has merely the following.
154
R(M),
R(M K t/ ) =
R(M) > R(M).
Example 9.3.2. In general, if p > 0 then it is possible to have R(M) < ;
that is, there is no p-adic analogue of the fundamental theorem of ordinary
differential equations (as was noted in Example 0.4.1). For instance, consider the module M = K t/v with D(v) = v; for > p 1/( p1) we
have R(M) = p 1/( p1) because that is the radius of convergence of the
exponential series. (This is essentially Example 0.4.1 again.)
However, the local form of the fundamental theorem of ordinary differential
equations has the following analogue.
Proposition 9.3.3 ( p-adic Cauchy theorem). Let M be a finite differential
module over K t/, K t/an , or K t/0 . Then R(M) > 0.
Proof. By shrinking we reduce to the case over K t/. One can give
a direct proof of this, but instead we will deduce it from Dworks transfer
theorem (Theorem 9.6.1 below). We will give a direct proof of a slightly
stronger result later (Proposition 18.1.1); see also the notes.
Here are some easy consequences of the definition of the radius of convergence; note the parallels with properties of the spectral radius (Lemma 6.2.8).
Lemma 9.3.4. Let M, M1 , M2 be finite differential modules over K t/,
K t/an , or K t/0 .
155
156
|d| F =
1/n 1
1
p 1/( p1) 1
p = 0,
p > 0.
p = 0,
p 1/( p1)
p > 0,
157
We can translate some basic properties of the spectral radius (Lemma 6.2.8)
into properties of generic radii of convergence, leading to the following analogue of Lemma 9.3.4. Alternatively, one can first check Proposition 9.7.5 and
then simply invoke Lemma 9.3.4 itself around a generic point.
Lemma 9.4.6. Let V, V1 , V2 be nonzero finite differential modules over F .
(a) For an exact sequence 0 V1 V V2 0,
R(V ) = min{R(V1 ), R(V2 )}.
(b) We have
R(V ) = R(V ).
(c) We also have
R(V1 V2 ) min{R(V1 ), R(V2 )},
with equality when R(V1 ) = R(V2 ).
Definition 9.4.7. In some situations it is more natural to consider the intrinsic
generic radius of convergence, or for short the intrinsic radius, defined as
I R(V ) = 1 R(V ) =
|d|sp,F
(0, 1].
|D|sp,V
(The upper bound of 1 comes from Lemma 6.2.4.) To emphasize the difference,
we may refer to the unadorned generic radius of convergence defined earlier
as the extrinsic generic radius of convergence. (See Proposition 9.7.6 and the
notes for some reasons why the intrinsic radius deserves its name.)
Remark 9.4.8. For I an interval, M a differential module on the annulus
|t| I , and I , it is unambiguous to refer to the generic radius of convergence R(M F ) of M at radius . This is defined by first making a base
change to K /t, t/ for some closed subinterval [, ] of I containing .
Since the resulting module M F does not depend on the choice of [, ],
neither does its generic radius.
158
i
Using the embedding F1
E, we may expand f =
iZ f i t with
maxi {| f i |} = 1. The previous calculation then forces | f i | p 1 unless
i 1 (mod p).
By considering the reduction of f modulo p n and arguing similarly, we
/ Z,
find that | f i | p 1 unless i (mod p n ) for all n. But, since
this means that the image of f in K ((t)) cannot have any terms at all, a
contradiction.
159
(t)i
i=0
i!
D i (x)
(9.6.1.1)
converges under || M . To see this, pick > 0 such that 1 (|D|sp,M + ) < 1.
By Lemma 6.1.8 there exists c > 0 such that |D i (x)| M c(|D|sp,M +
)i |x| M for all i. The ith term of the sum defining y thus has norm at most
i i c(|D|sp,M + )i |x| M , which tends to 0 as i .
By differentiating the series expression as in Remark 5.8.4 we find that
D(y) = 0, so y is a horizontal section of M K t/. If we run this construction over a basis of M, we obtain horizontal sections of M K t/ whose
reductions modulo t also form a basis; these sections are thus K -linearly independent and so form a basis of M K t/ by Proposition 9.1.2. This proves
the claim.
Here is a simple example of how one may apply the transfer theorem; it is
conditional on one result, Theorem 11.3.2.
Example 9.6.2. Recall the hypergeometric differential equation
y +
c (a + b + 1)z)
ab
y
y = 0,
z(1 z
z(1 z)
a(a + 1) (a + i)b(b + 1) (b + i)
i=0
c(c + 1) (c + i)i!
zi .
(A, B K ),
(9.6.2.1)
and it converges for |z| < 1 (see the exercises for Chapter 0).
We now pass to the associated differential module M of rank 2 which is
defined over the ring K /z, zan for any > 0. From (9.6.2.1) we see that,
for any (0, 1), M K /z, z/ has a filtration in which one quotient is a
trivial module and the other has the form D(w) = z 1 w for some Z p Q.
From Example 9.5.2 and Lemma 9.4.6 we deduce that R(M F ) = 1 for
160
= |c|
max{, |z i |}
= |c|
|t z i |
= |P| .
Definition 9.7.3. Let L be a complete extension of K . For any t L with
|t | = , the substitution t t + (t t ) induces an isometric map
K [t] L(t t )/. However, if (and only if) t is a generic point then the
composition of this map with the reduction modulo t t is again an isometry,
161
by Remark 9.7.2. Hence in this case the map K [t] L(t t )/ extends
to an isometry F L(t t )/an .
Remark 9.7.4. In Berkovichs theory of nonarchimedean analytic geometry,
the geometric interpretation of the above construction is that the analytic space
corresponding to F is obtained from the closed disc of radius by removing
the open disc of radius centered around each point of K alg . As a result, it still
contains any open disc of radius that does not meet K alg .
Proposition 9.7.5. Let V be a finite differential module over F , and put V =
V F L(t t )/an . Then the generic radius of convergence of V is equal
to the radius of convergence of V .
Proof. Let G be the completion of L(t t ) for the -Gauss norm; then the
map F G is an isometry for any < . By Corollary 6.2.7,
|D|sp,V G = max{|d|sp,G , |D|sp,V } = max{1 , |D|sp,V }.
On one hand, this implies R(V ) R(V ) by the application of Theorem 9.6.1
to V L(t t )/ for a sequence of values of converging to .
On the other hand, pick any < R(V ); then V G is a trivial differential
module, so the spectral radius of D on it is 1 . We thus have
|D|sp,V 1 ,
so R(V ) . This yields R(V ) R(V ).
Here is an example illustrating both the use of the geometric interpretation
and a good transformation property of the intrinsic normalization.
Proposition 9.7.6. Let m be a nonzero integer not divisible by p, and let
f m : F F 1/m be the substitution t t m . Then, for any finite differential
module V over F , I R(V ) = I R( f m (V )).
Proof. Let m be a primitive mth root of unity in K alg . Then the claim follows
from the geometric interpretation plus the fact that
|t t mi | < c for some i {0, . . . , m 1} |t m tm | < c m
(c (0, 1)),
(9.7.6.1)
162
We will encounter this issue again when we perform Frobenius pullback and
pushforward in Chapter 10. One may view it as an argument in favor of a
coordinate-free perspective (Proposition 6.3.1).
Remark 9.7.8. A similar construction can be made for E. Let L be the
completion of o K ((t1 )) oK K for the 1-Gauss norm. Then the substitution
t t1 + (t t1 ) induces an isometry o K ((t)) o L t t1 for the 1-Gauss
norm that extends to an isometric embedding of E into the completion of
o L t t1 oL L for the 1-Gauss norm.
163
(9.9.2.1)
n1, pn
in which (n) is the Mbius function. That is, (n) equals (1)e if n is the
product of e 0 distinct primes, and it equals 0 otherwise.
Example 9.9.3 (Matsuda). Let h be a nonnegative integer, and suppose that K
contains a primitive p h+1 th root of unity . Let Mh be the differential module
of rank 1 on the whole t-line, the action of D on a generator v being given by
D(v) =
h !
" i
i
p 1 t p 1 v;
i=0
is isomorphic to the
(9.9.5.1)
164
Proof. The left-hand side of (9.10.2.1) is less than or equal to the right-hand
side because, by hypothesis, |u i d i /i!| F |u|i |d|iF 1 for all i. Conversely,
let j {0, . . . , n} be the minimal index for which |c j | = maxi {|ci |}; since
ui i j
j
ci u j
ci d (u ) =
i
i!
and F is of characteristic 0, we have
i
= |c j u j |
u i j
ci d (u ) < |c j u j |
i!
= 0
(i = j),
(i < j),
(i > j).
n
ci u i d i (u j )/i!| = |c j u j |, so the left-hand side of (9.10.2.1) is
Hence | i=0
greater than or equal to the right-hand side.
Corollary 9.10.3. Set notation as in Definition 9.10.1. Then, for any nonzero
finite differential module V over F, we have
|D|sp,V = |d|sp,F lim |Ds |1/s ,
s
Notes
165
Proof. This follows from Proposition 6.3.1, since (6.3.1.2) holds by virtue of
Proposition 9.10.2.
Notes
As noted in [80, Appendix III], which can be consulted for more information, the p-adic Cauchy theorem (Proposition 9.3.3) was originally proved
by Lutz [163]. (Note that this publication long predates Dworks work.) See
Proposition 18.1.1 for a related result.
The idea of restricting a p-adic differential module to a generic disc originated in the work of Dwork [75], although in retrospect the base change
involved is quite natural in Berkovichs framework of nonarchimedean analytic geometry. For instance, the Berkovich space associated with a closed
unit disc contains generic points corresponding to each disc of each possible radius less than or equal to 1; the residue field of the generic point
of the disc |t| is precisely F . (This point of view was adopted
by Baldassarri and Di Vizio in [11] and by Baldassarri in [10].) Our
definition of the generic radius of convergence is taken from Christol and
Dwork [49].
The intrinsic generic radius of convergence (the original terminology) was
introduced in [140], where it is called the toric normalization in light of
Proposition 9.7.6.
Previously the subsidiary radii (the original terminology) have not been
studied much; the one reference we found is the work of Young [222]. We will
give Youngs interpretation of the subsidiary radii as the radii of convergence
of certain horizontal sections, in a refined form, as Theorem 11.9.2.
Our description of the ArtinHasse exponential follows Robert [191, 7.2].
Matsudas example, from [168], is an explicit instance of a general construction introduced by Robba [190]. In turn, Matsudas example can be greatly
generalized as shown by Pulita [185], building on work of Chinellato. One
obtains an analogous construction from any LubinTate group over Q p ; Matsudas example arises from the multiplicative group. (The introduction to [185]
provides a detailed historical discussion.)
A slightly different notion of the generic radius of convergence was introduced by Baldassarri and Di Vizio [11] and studied further by Baldassarri [10].
The difference between that definition and ours appears only in the case where
our definition gets truncated. For instance, for a differential module on a closed
disc of radius , our generic radius of convergence at radius (0, ) is only
allowed to take values up to . However, Baldassarri and Di Vizio allow values
166
Exercises
(1) Prove Lemma 9.2.1. Then exhibit an example showing that the cokernel of d/dt on K /t, t/ is not spanned over K by t 1 . That is,
antidifferentiation with respect to t is not well defined on K /t, t/.
(2) Prove Example 9.3.5. (Hint: use the exponential series to construct a
horizontal section.)
(3) Prove Example 9.5.2. (Hint: consider the cases Z p , o K Z p ,
and
/ o K separately.)
(4) In Example 9.5.2 give an explicit formula for I R(V ) in terms of and
the minimum distance from to an integer.
(5) Prove (9.7.6.1). (Hint: you may find it easier to start with the cases where
m > 0 and where m = 1 and then deduce the general result from them.)
(6) The following considerations illustrate the pitfalls of using td/dt instead
of d/dt in the p-adic setting.
(a) Verify that |td/dt|sp,F = |t| |d/dt|sp,F .
(b) Show that the inequality (6.3.1.1) of Proposition 6.3.1 can be strict
for F = F and d = td/dt. (Hint: use Example 9.5.2.)
(7) With notation as in Proposition 9.7.6, show that all the intrinsic subsidiary
radii of V match those of f m (V ), not just the generic radii.
(8) Here is an off-centered analogue of Proposition 9.7.6 suggested by
Liang Xiao (compare with Theorem 10.8.2 below). Let m be a nonzero
integer not divisible by p. Given (0, 1], let f m : F F be the
Exercises
167
h+1 /( p1)
from
10
Frobenius pullback and pushforward
In this chapter, we introduce Dworks technique of Frobenius descent to analyze the generic radius of convergence and subsidiary radii of a differential
module, in the range where Newton polygons do not apply. In one direction
we introduce a somewhat refined form of the Frobenius antecedents introduced
by Christol and Dwork. These fail to apply in an important boundary case; we
remedy this by introducing the new notion of Frobenius descendants, which
covers the boundary case.
Using these results, we are able to improve a number of results from
Chapter 6 in the special case of differential modules over F . For instance
we get a full decomposition by spectral radius, extending the visible decomposition theorem (Theorem 6.6.1) and the refined visible decomposition theorem
(Theorem 6.8.2). We will use these results again to study the variation of subsidiary radii, and decomposition by subsidiary radii, in the remainder of this
part.
Notation 10.0.1. Throughout this chapter we retain Hypothesis 9.0.1. We also
continue to use F to denote the completion of K (t) for the -Gauss norm
viewed as a differential field with respect to d = d/dt, unless otherwise
specified.
Notation 10.0.2. Throughout this chapter we also assume p > 0 unless
otherwise specified.
169
( (0, 1)).
}| | =
p
p | p |
p 1/( p1) ,
p 1 | p |
p 1/( p1) .
170
min{1/ p , p}
are strictly increasing functions from [0, 1] to [0, 1] that are inverse to each
other.
Proof. There is no harm in assuming that p K for both parts. For (a), factor
p1
t p p as (t ) m=1 (t pm ), and write
t pm = (t ) + (1 pm ).
If |t | p 1/( p1) || then t is the dominant term; otherwise (1 pm )
dominates. This gives the claimed bounds.
For (b), consider the Newton polygon of
p1
p i
p
p
t c =
(t ) pi c,
i
i=0
171
d(t p )
d = pt p1 d .
dt
m p
t v.
p
(The generator v is taken to behave as t m = (t p )m/ p .) From the Newton polygon associated to v, we may read off I R(Wm ) = p p/( p1) for m = 0 since
this is within the range of applicability of Theorem 6.5.3. Note that the inequality of Lemma 10.3.2 is strict in this case, since Wm is trivial and so has
intrinsic radius 1.
Definition 10.3.4. For V a differential module over F , define the Frobenius
descendant of V as the module V obtained from V by restriction along
F F . The module V is viewed as a differential module over F with
differential D = p 1 t 1 p D. Note that this operation commutes with duals,
but not with tensor products because of a rank mismatch: the rank of V is p
times that of V . See, however, Lemma 10.3.6(f) below.
172
Remark 10.3.5. The definition of the Frobenius descendant extends to differential modules over K /t, t/ for > 0 but not for = 0, since we must
divide by a power of t to express D in terms of D. The underlying problem
is that (in geometric terms) the map ramifies at the point t = 0. We will see
one way to deal with this problem in the discussion of off-centered Frobenius
descendants (Section 10.8).
The operation enjoys a number of properties which are useful and
reasonably easy to verify.
Lemma 10.3.6
(a) For V a differential module over F , there are canonical isomorphisms
m : ( V ) Wm
= V
(m = 0, . . . , p 1).
p1
%
(V Wm ).
m=0
(i = 0, 1).
173
( pm t t)i
i=0
i!
D i (x)
(x V, m Z/ pZ).
p1
1 i j i
p p (v)
p
( j = 0, . . . , p 1)
i=0
are F -linear projectors onto the generalized eigenspaces for the characters of
Z/ pZ. Note also that these eigenspaces are permuted by multiplication by t, so
they must all have the same dimension. We may conclude that the fixed space
V is an F -subspace of V and the natural map V V is an isomorphism.
(This calculation amounts to a simple instance of the HilbertNoether theorem,
i.e., of Galois descent.)
By applying the Z/ pZ-action to a basis of horizontal sections of V in the
generic disc |t t | for some ( p 1/( p1) , I R(V )) and then invoking
Corollary 10.2.3(b), we may construct horizontal sections of V in the generic
p
disc |t p t | p p . Hence I R(V ) I R(V ) p > p p/( p1) .
To check uniqueness, suppose that V
= V with I R(V ),
= V
p/(
p1)
. By Lemma 10.3.6(c) we have
I R(V ) > p
p1
p1
V
= m=0 (V Wm )
= m=0 (V Wm ).
174
For the last assertion, note that the proof of existence gives I R(V )
I R(V ) p , whereas Lemma 10.3.2 gives the reverse inequality.
Corollary 10.4.3. Let V be a differential module over F such that I R(V ) >
p p/( p1) . Then V is the Frobenius antecedent of V , so I R(V ) =
I R( V ) p .
Proof. By Lemma 10.3.2, I R( V ) I R(V )1/ p > p 1/( p1) , so V
has a unique Frobenius antecedent by Theorem 10.4.2. Since I R(V ) >
p p/( p1) , V is that antecedent.
The construction of Frobenius antecedents carries over to discs and annuli
as follows.
Theorem 10.4.4. Let M be a finite differential module over K /t, t/
(we may allow = 0) such that I R(M F ) > p 1/( p1) for
[, ] (or, equivalently, for = and = ). Then there exists a
unique differential module M over K p /t p , t p / p such that M = M
K /t, t/ and I R(M F ) > p p/( p1) for [, ]; this M also
satisfies I R(M F ) = I R(M F ) p for [, ]. (This theorem
also holds with K /t, t/, K p /t p , t p / p replaced with K /t, t/an ,
K p /t p , t p / p an , respectively, by a similar proof.)
Proof. Define the projectors P j as in the proof of Theorem 10.4.2, and let
M be the image of P0 . By arguing as in Theorem 10.4.2 we may show that
M K /t, t/[t 1 ]
= M K /t, t/[t 1 ]. However, the quotient of
M by the image of M K /t, t/ is a differential module and so cannot
have any t-torsion, by Proposition 9.1.2. Hence M K /t, t/
= M, and
we may continue as in Theorem 10.4.2.
{ p 1 si ( p times)}
si p 1/( p1)
175
(note that in the upper expression the second element occurs p 1 times altogether and in the lower expression the first element occurs p times altogether).
In particular,
I R( V ) = min{ p 1 I R(V ), p p/( p1) }.
Proof. It suffices to consider V as irreducible. First suppose that I R(V ) >
p 1/( p1) . Let V be the Frobenius antecedent of V (as per Theorem 10.4.2);
p1
note that V is also irreducible. By Lemma 10.3.6(c), V
= m=0 (V Wm ).
Since each Wm has rank 1, V Wm is also irreducible. Since I R(V ) =
I R(V ) p by Theorem 10.4.2 and I R(V Wm ) = p p/( p1) for m = 0 by
Lemma 9.4.6(c), we have the claim.
Next suppose that I R(V ) p 1/( p1) . We show first that
I R( V ) p 1 I R(V ) = max{I R(V ) p , p 1 I R(V )}.
For t a generic point of radius and (0, p 1/( p1) ), the module V
p
L(t p t )/( p 1 p ) splits as the direct sum of V L(t pm t )/()
over m = 0, . . . , p 1. If < I R(V ), by applying Corollary 10.2.3(c) we
obtain I R( V ) p 1 .
Next, let W be any irreducible subquotient of V ; then I R(W )
I R( V ), so Lemma 10.3.2 gives on the one hand
I R( W ) min{I R(W )1/ p , p I R(W )} min{I R( V )1/ p , p I R( V )} I R(V ).
(10.5.1.1)
176
(b) For i such that either i = n or si+1 p 1/( p1) , the product of the
i p + ( p 1)(n i) smallest intrinsic subsidiary radii of V is equal
p
p
to p in s1 si .
In particular, the product of the intrinsic subsidiary radii of V is
p
p
p np s1 sn .
Note that both conditions apply when si = p 1/( p1) ; this will be important
later (see Remark 11.6.2).
As promised, we will now obtain a refined version of Lemma 10.3.2.
Corollary 10.5.4. Let V be a differential module over F such that I R(V ) =
p p/( p1) . Then I R( V ) = min{I R(V )1/ p , p I R(V )}. (This fails when
I R(V ) = p p/( p1) , e.g., for V = Wm .)
Proof. In the case I R(V ) > p p/( p1) , this holds by Corollary 10.4.3. Othp1
erwise, by Lemma 10.3.6(c), V
= m=0 (V Wm ) and I R(V Wm ) =
I R(V ) since I R(V ) < I R(Wm ). Hence, by Theorem 10.5.1,
I R(V ) = I R( V ) = min{ p 1 I R( V ), p p/( p1) }.
We get a contradiction if the right-hand side equals p p/( p1) , so it must be
the case that I R(V ) = p 1 I R( V ) p p/( p1) , proving the claim.
177
Suppose that p > 0 and I R(V1 ) = p 1/( p1) . Let V2 be the Frobenius
antecedent of V2 ; it also is irreducible, and I R(V2 ) = I R(V2 ) p > p p/( p1)
by Theorem 10.4.2. By Theorem 10.5.1, each irreducible subquotient W of
V1 satisfies I R(W ) = p p/( p1) ; hence H 1 (W V2 ) = 0 by the previous
case, so H 1 ( V1 V2 ) = 0 by the snake lemma.
By Lemma 10.3.6(a), (c),
p1
V1 V2
= m=0 ( V1 Wm V2 )
= ( V1 V ) p .
2
178
179
180
is isomorphic to its own twist by any Wm F E . Hence on the one hand its
intrinsic subsidiary radii must comprise a multiset of p rank(X )2 elements in
which exactly rank(X )2 of the elements are greater than p p/( p1) . On the
other hand, since I R(X X ) I R(X ) = p 1/( p1) by Lemma 9.4.6(c),
Theorem 10.5.1 implies that the intrinsic subsidiary radii of (X X )
include at most rank(X )2 elements greater than p p/( p1) . In fact, equality
occurs if and only if all the intrinsic subsidiary radii of X X are greater
than p 1/( p1) . Hence X is refined.
Finally, we handle the case p 1/( p1) < I R(V ) < 1 by induction on the
h
smallest integer h such that I R(V ) p p /( p1) . The case h = 0 is handled by the arguments above. If h > 0 then, by Theorem 10.4.2, V has a
h+1 /( p1)
.
Frobenius antecedent W for which I R(W ) = I R(V )1/ p p p
By the induction hypothesis, W admits a decomposition into nonequivalent
refined submodules; pulling back by a Frobenius morphism then gives the
desired decomposition of V , by Corollary 10.5.4.
h /( p1)
s1m |K | p Z .
Proof. For h = 0 we can read this off from the case of a Newton polygon
(i.e., we can invoke Theorem 6.5.3). To reduce from h to h 1, if I R(V ) >
p 1/( p1) then replace V by its Frobenius antecedent (Theorem 10.4.2); if
I R(V ) = p 1/( p1) , apply and invoke Corollary 10.5.3.
Here is an example to show that the exponent p h in the conclusion s1m
h
|K | p Z of Theorem 10.7.1 is not spurious.
Example 10.7.2. Suppose that K satisfies | | = p 1/( p1) . Pick K
and 0 < such that, for [, ],
p 1/( p1) < || p < p p/( p1) .
181
Let M be the differential module over K /t, t/ that is generated by v satisfying D(v) = p t p1 v. Then M
= M , where M is the differential
module over K p /t p , t p / p with generator w and D (w) = (t p )2 w.
We deduce that
|D | M F = p 1/( p1) || 2 p > p .
Hence we have
I R(M F ) = ||1 p ,
I R(M F ) = ||1/ p ,
where the first equality follows by Theorem 6.5.3 and the second follows from
the first by Corollary 10.4.3.
Remark 10.7.3. Another way to understand Example 10.7.2 is by means of
the Dwork exponential series; see Definition 17.1.3.
Question 10.7.4. What is the correct extension of Theorem 10.7.1 for the
remaining subsidiary radii? (This should not be difficult.)
d
1
=
d.
du
du/dt
182
The main point here is that du/dt K t . Consequently, we can extend
both and not just to annuli but also discs. This is needed to establish the
monotonicity property for subsidiary radii (Theorem 11.3.2(d)).
We may apply Lemma 10.2.2 with replaced by + 1, keeping in mind
that | + 1| = 1 for || < 1. This has the net effect that everything that holds
for also holds for , except that the intrinsic generic radius of convergence
must be replaced by the extrinsic one. Rather than rederive everything, we
simply state the analogues of Theorems 10.4.2 and 10.5.1 and leave the proofs
as exercises.
Theorem 10.8.2. Let (V, D) be a finite differential module over F such that
R(V ) > p 1/( p1) . Then there exists a unique differential module (V , D )
over F equipped with an isomorphism V
= V such that R(V ) >
p/(
p1)
. For this V , one has in fact R(V ) = R(V ) p .
p
Theorem 10.8.3. Let V be a finite differential module over F with extrinsic subsidiary radii s1 , . . . , sn . Then the extrinsic subsidiary radii of V
comprise the multiset
p
n
$
{si , p p/( p1) ( p 1 times)} si > p 1/( p1) ,
i=1
{ p 1 si ( p times)}
si p 1/( p1) .
Remark 10.8.4. Note that one cannot expect Theorem 10.8.3 to hold for <
p 1/( p1) , as in that case p p/( p1) is too large to appear as a subsidiary radius
of V .
Notes
Lemma 10.2.2 is taken from [128, 5.3] with some typos corrected.
The Frobenius antecedent theorem of Christol and Dwork [49, Thorme 5.4]
is slightly weaker than the one given here: it only applies for I R(V ) > p 1/ p .
The discrepancy is due to the introduction of cyclic vectors, which create some
regular singularities which can only eliminated under the stronger hypothesis. Much closer to the statement of Theorem 10.4.2 is [128, Theorem 6.13];
however, uniqueness is only asserted there when I R(V ) I R(V ) p .
The concept of the Frobenius descendant, and the results deduced using
it, are original. This includes Theorem 10.5.1 and its off-center analogue
(Theorem 10.8.3), the strong decomposition theorem (Theorem 10.6.2), and
the refined strong decomposition theorem (Theorem 10.6.7). The latter was
suggested by Liang Xiao; see the notes for Chapter 6 for the motivation
for this.
Exercises
183
Exercises
(1) Prove Lemma 10.3.6.
(2) Prove that, for any finite differential module V over F with I R(V ) >
p p/( p1) , H 0 (V ) = H 0 ( V ). (The example V = Wm shows that the
bound on I R(V ) cannot be relaxed.)
(3) Derive Theorem 10.8.2 by imitating the proof of Theorem 10.4.2.
(4) Derive Theorem 10.8.3 by imitating the proof of Theorem 10.5.1.
11
Variation of generic and subsidiary radii
In this chapter, we apply the tools developed in the preceding chapters to study
the variation of the generic radius of convergence, and of the subsidiary radii,
associated with a differential module on a disc or annulus. We have already
seen some instances where this study is needed to deduce consequences about
the convergence of solutions of p-adic differential equations (Examples 9.6.2
and 9.9.3).
The statements we will formulate are modeled on statements governing the
variation of the Newton polygon of a polynomial over a ring of power series as
we vary the choice of Gauss norm on the power series ring. The guiding principle is that, in the visible spectrum, one should be able to relate the variation of
subsidiary radii to the variation of Newton polygons via matrices of action of
the derivation on suitable bases. This includes the relationship between subsidiary radii and Newton polygons for cyclic vectors (Theorem 6.5.3), but
trying to use that approach directly creates no end of difficulties because cyclic
vectors only exist in general for differential modules over fields. We will implement the guiding principle in a somewhat more robust manner than before,
using the discussion of matrix inequalities in Chapter 6.
To this principle we must add the techniques of descent along a Frobenius
morphism introduced in Chapter 10, including the off-centered variant. This
allows us to overcome the limitation to the visible spectrum.
As corollaries of this analysis, we deduce some facts about the true radius of
convergence of a differential module on a disc. We also establish a geometric
interpretation of subsidiary radii in terms of the convergence of local horizontal
sections around a generic point, extending a result of Young. We will build
further on this work when we discuss decomposition theorems in Chapter 12.
Throughout this chapter we retain Notation 10.0.1 but we do not assume that
p > 0. We will continue to use the convention
184
p = 0,
p 1/( p1)
p > 0.
185
t t + .
186
n
Pi T i with Pi K /t, t/an . By Proposition 8.5.2
Proof. Write P = i=0
the function vr (Pi ) is continuous and concave in r and piecewise affine with
slopes in Z. Moreover, even if = 0 there are only finitely many different
slopes.
For s R and r [ log , log ], put
vs,r (P) = min{vr (Pi ) + is};
i
187
that is, vs,r (P) is the y-intercept of the supporting line of NPr (P) of slope
s. Since vs,r (P) is the minimum of finitely many functions of the pair (r, s),
each of which is continuous, piecewise affine with only finitely many different
slopes, and concave, it also enjoys these properties. (Compare Remark 2.1.7.)
Note that Fi (P, r ) is the difference between the y-coordinates of the points
of NPr (P) having x-coordinates i n and n. That is,
Fi (P, r ) = sup{vs,r (P) (n i)s} vr (Pn ).
(11.2.1.1)
(11.2.1.2)
proving (b).
Assume that < 1 < . Then Proposition 11.1.2 implies that
s (Pi )
(i = 0, . . . , n).
s (Pi ) =
alg
If i = n or f i (P, 0) < f i+1 (P, 0) then the above equation imply plus
(11.2.1.2) that the desired inequality is in fact an equality. Otherwise, let j, k
be the least and greatest indices for which f j (P, 0) = f i (P, 0) = f k (P, 0);
then j i < k, and the convexity of the Newton polygon implies that
Fi (P, r )
k i
i j +1
F j1 (P, r ) +
Fk (P, r ),
k j +1
k j +1
(11.2.1.3)
with equality for r = 0. From this plus piecewise affinity we deduce that
k i
i j +1
s (Pn j+1 ) +
s (Pnk ),
k j +1
k j +1
k i
i j +1
s,i (P)
s (Pn j+1 ) +
s (Pnk )
k j +1
k j +1
s,i (P)
alg
( K ),
yielding (c).
Assume that = 0 and that P is monic. Then each vr (Pi ) is a nondecreasing
function of r , as is each vs,r (P). Since vr (Pn ) = 0, Fi (P, r ) is nondecreasing
by (11.2.1.1), proving (d).
188
To prove (e), one can reduce to working locally around r = 0 and then
deduce the claim from (c) and (d) (because the latter implies that s,i (P) 0
for = 0). However, one can also prove (e) directly as follows. Assume that
P is monic, so that Pn = 1 and (11.2.1.1) reduces to
Fi (P, r ) = sup{vs,r (P) (n i)s}.
s
It is not immediately clear from this that Fi (P, r ) is concave, since we are taking the supremum rather than the infimum of a collection of concave functions.
To get around this, pick r1 , r2 [ log , log ] and put r3 = ur1 +(1u)r2
for some u [0, 1]. For j {1, 2}, choose an s j achieving the supremum in
(11.2.1.1) for r = r j . Put s3 = us1 + (1 u)s2 ; then, using the concavity of
vs,r (P) in both s and r , we have
Fi (P, r3 ) vs3 ,r3 (P) (n i)s3
u(vs1 ,r1 (P) (n i)s1 ) + (1 u)(vs2 ,r2 (P) (n i)s2 )
= u Fi (P, r1 ) + (1 u)Fi (P, r2 ).
This yields concavity for Fi (P, r ), proving (e).
Remark 11.2.2. A more geometric interpretation of the previous proof can
j
be given by writing each Pi as
j Pi, j t and considering the lower convex
hull of the set of points {(i, j, v(Pi, j ))} in R3 . We leave elaboration of this
point to the reader.
Remark 11.2.3. If i = n or f i (P, r0 ) <
implies that
in other words, the height of the relevant point is determined by the supporting
lines of slope less than or equal to ar +b, rather than by all slopes. Note that, in
the notation of the proof of Theorem 11.2.1(e), the inequality s j ar j + b for
j = 1, 2 implies the same for j = 3, so the proof of concavity goes through.
For bounded elements, we obtain a similar but slightly weaker conclusion.
189
190
(c) (Subharmonicity) Suppose that < 1 < and that f i (M, 0) > 0. For
i = 1, . . . , n, let s,i (M) and s0,i (M) be the left-hand and right-hand
alg
slopes of Fi (M, r ) at r = 0. For ( K ) , let s,i (M) be the
191
Proof of Theorem 11.3.2(c), (d), (e) for i = 1. Choose a basis of M, and let
Ds be the matrix of action of D s on this basis. Then recall from Lemma 6.2.5
that
R1 (M, ) = min{, lim inf |Ds |1/s
}.
1/s
For each s, the function r log |Ds |er is convex in r by Proposition 8.2.3(b). This implies the convexity of
1/s
f 1 (M, r ) = max r, log + lim sup( log |Ds |er ) .
s
192
monic twisted polynomial P over F. We may then apply Corollary 6.5.4 and
Theorem 11.2.1 to deduce (a), (b), (c) of Theorem 11.3.2.
To deduce (d), we may work in a right-hand neighborhood of a single value
r0 of r . There is no harm in enlarging K (by Proposition 10.6.6), so we may
assume that v(K ) = R. Then we may reduce to the case r0 = 0 by replacing t
by t for some K .
Since v(K ) = R, we may pick c1 , . . . , cn K such that
log |c j | = min{ log f j (M, 0), 0}
( j = 1, . . . , n).
(i = 1, . . . , n).
( j = 0, . . . , n 1).
193
f i (M, r ) r +
1
p1
log p.
194
If f i (M, 0) > (1/( p 1)) log p, apply (11.6.1.1) and the induction hypothesis to write piecewise
Fi (M, r ) = (1/ p)(g1 ( pr ) + + g pi ( pr ) pi log p ( p 1)i pr )
= (1/ p)(m( pr ) + )
= mr + (1/ p)
for some m Z. (Note that is not guaranteed to be in pv(K ); this explains
Example 10.7.2.) Otherwise, we may apply (11.6.1.2) to write piecewise
Fi (M, r) = (1/ p)(g1 ( pr ) + + g pi+( p1)(ni) ( pr ) pn log p ( p 1)npr)
= (1/ p)(m( pr ) + )
= mr + (1/ p)
for some m Z.
Remark 11.6.2. In the proof of Lemma 11.6.1, note the importance of the
fact that the domains of applicability of (11.6.1.1) and (11.6.1.2) overlap: if
f i (M, 0) = (1/( p 1)) log p then (11.6.1.1) is valid for r = 0 but possibly
not for nearby r values.
The case f i (M, r0 ) = r0 remains inaccessible even using descent along a
Frobenius morphism, so we make an ad hoc argument.
Lemma 11.6.3. For any i {1, . . . , n} and any r0 such that f i (M, r0 ) = r0 ,
Theorem 11.3.2 holds in a neighborhood of r0 .
Proof. As in the proof of Lemma 11.5.1, it suffices to consider the case r0 =
0. We first check continuity. For this, note that the proofs of Lemma 11.5.1
and 11.6.1 show that, for any c > 0, the function max{ f i (M, r ), r + c} is
continuous at r = 0. Consequently, for any > 0, we can find 0 < < /2
such that
| max{ f i (M, r ), r + /4}| < /2
(|r | < ).
For such r , < < r fi (M, r ) < ; this yields continuity.
Next we check convexity. Using Remark 11.2.4, the proofs of Lemmas 11.5.1 and 11.6.1 show that, for any c > 0, the function
i
j=1 max{ f j (M, r ), r + c} is convex. (The key point is that the domain over
which this holds does not depend on c.) Since this function tends to Fi (M, r )
as c tends to 0, we may deduce (e).
We now check piecewise affinity by induction on i. Given that
f 1 (M, r ), . . . , f i1 (M, r ) are affine in a one-sided neighborhood of r = 0,
say [, 0], and given that f i (M, 0) = 0, it suffices to check the linearity
11.7 Monotonicity
195
11.7 Monotonicity
To complete the proof of Theorem 11.3.2 we must prove (d) for p > 0 without
the restriction f i (M, r0 ) > r0 log . The reason why we do not have (d) as
part of Lemma 11.6.1 is that passing from M to M introduces a singularity
196
( = 0, m).
s, p ( M) = 0,
s0, p ( M) = p 1,
s, p ( M) = 1
s, p ( M) = 0
( F
p ),
(
/ F p ),
197
and in turn
s,1 (M) = p + 1,
s0,1 (M) = 0,
s,1 (M) = 1
s,1 (M) = 0
( F
p ),
(
/ F p ).
198
Notes
199
(i = 1, . . . , m; r [ log , +)).
Notes
The harmonicity property of functions on annuli (Proposition 11.1.2) may
be best viewed as a theory of subharmonic functions on one-dimensional
Berkovich analytic spaces. Such a theory has been developed by Thuillier
[205] with a view towards applications in Arakelov theory. A related development is the work of Favre and Jonsson concerning potential theory on
200
the valuative tree, with applications to the theory of plurisubharmonic singularities on complex surfaces. See [87] and [88] and also a more recent
paper by Boucksom, Favre, and Jonsson [33] giving some higher-dimensional
generalizations.
For the function f 1 (M, r ) = F1 (M, r ) representing the generic radius of
convergence, Christol and Dwork established convexity [49, Proposition 2.4]
(using essentially the same short proof as that given here) and continuity
at endpoints [49, Thorme 2.5] (see also [80, Appendix I]) in the case of
a module over a full annulus. In the case of analytic elements, continuity
and piecewise affinity were conjectured by Dwork and proved by Pons [181,
Thorme 2.2]. The analogous results for the higher Fi (M, r ) are original.
When restricted to intrinsic subsidiary radii less than , Theorem 11.9.2 is
a result of Young [222, Theorem 3.1]. Youngs proof is an explicit calculation
using twisted polynomials and cyclic vectors.
As suggested earlier (see the notes for Chapter 9), Youngs definition of
the radii of optimal convergence suggests an analogue in the framework of
Baldassarri and Di Vizio. For instance, given a differential module on a closed
disc of radius , the radii of optimal convergence at a generic point of radius
(0, ) should be defined in terms of an optimal basis of local horizontal
sections which are allowed to extend all the way across the disc of radius
rather than just across the disc of radius . However, we do not know how to
prove the appropriate analogue of Theorem 11.3.2 for these quantities, since
working over F loses all information beyond radius . Accomplishing this is
an important open problem.1
Exercises
(1) Give an example to show that, in Theorem 11.2.1, f 2 need not be concave
(even though f 1 and f 1 + f 2 are concave).
(2) Verify Example 11.7.2.
(3) Give another proof of (9.9.5.1) as follows. First find one value 0 for
which Theorem 6.5.3 implies that I R(M F0 ) = 0b . Then use
Theorem 11.3.2 to show that (9.9.5.1) holds for [1, 0 ].
(4) State, then answer affirmatively (using Theorem 6.5.3), the analogue of
Question 11.8.3 for p = 0.
12
Decomposition by subsidiary radii
In the previous chapter we established a number of important variational properties of the subsidiary radii of a differential module over a disc or annulus.
In this chapter, we continue the analysis by showing that, under suitable conditions, one can separate a differential module into components of different
subsidiary radii. That is, we can globalize the decompositions by spectral
radius provided by the strong decomposition theorem, if a certain numerical
criterion is satisfied.
As in the previous chapter, our discussion begins with some observations about power series, in this case identifying criteria for invertibility.
We use these in order to set up a Hensel lifting argument to give the desired
decompositions; again we must start with the visible (see Definition 6.5.1) case
and then extend using Frobenius descendants. We end up with a number of distinct statements, covering open and closed discs and annuli as well as analytic
elements.
As a corollary of these results we recover an important theorem of Christol and Mebkhout. That result gives a decomposition by subsidiary radii on
an annulus in a neighborhood of a boundary radius at which the module is
solvable, that is, all the intrinsic subsidiary radii tend to 1. (It is not necessary to assume that the annulus is closed at this boundary.) One may view our
results as a collection of quantitative refinements of the ChristolMebkhout
theorem.
Note that nothing is this chapter is useful if the intrinsic subsidiary radii are
everywhere equal to 1. We will tackle this case in Chapter 13.
Throughout this chapter, besides Notation 10.0.1 we also retain Notation 11.3.1.
201
202
( K )
203
( K )
204
205
For annuli, we will use the related fact that, for any [, ], inside the
completion of L(t) for the -Gauss norm we have
F L/t, t/ = K /t, t/
(exercise).
We also need a lemma about polynomials over K t.
206
M1 M2 of modules (but not differential modules) with Mi = ker(Q i (N1 )).
We now invoke some of the approximation lemmas, as follows. Equip M with
the supremum norm compatible with | |1 defined by B1 , and then equip
M1 , M2 with the induced quotient norms; these are both supremum-equivalent
by Lemma 1.3.5. By Lemma 1.3.7, for any c > 1 we can approximate these
norms to within a factor of at most c by supremum norms defined by bases
of M1 K t F1 and M2 K t F1 . By Lemma 8.6.1 the latter norms are also
defined by bases B1,1 , B1,2 of M1 , M2 .
Let U be the change-of-basis matrix from B1 to B1,1 B1,2 , so that U 1 N1 U
is a block diagonal matrix and |U |1 , |U 1 |1 c. Then the matrix of action of
D on B1,1 B1,2 is U 1 N1 U + U 1 d(U ). If we write this in block form as
A B
C D
then, by taking c sufficiently close to 1, we may force the following conditions
to hold.
(a) The matrix A is invertible and |A1 |1 max{|d|1 , |B|1 , |C|1 , |D|1 } < 1.
(b) The Newton slopes of A under | |1 account for the first i subsidiary
radii of M K t F1 .
As in the proof of Lemma 6.7.3 we may now use Lemma 6.7.1 to produce a
submodule of M accounting for the last n i subsidiary radii of M K t F1 .
By repeating this argument for M , we obtain a submodule of M accounting
for the first i subsidiary radii of M K t F1 . By Remark 12.2.3 this suffices to
prove the desired result.
To prove Theorem 12.2.2 in general, we must use Frobenius descendants
again.
Proof of Theorem 12.2.2. The claim holds by Lemma 12.2.7 if p = 0, so we
may assume p > 0 throughout the proof. It suffices to prove that, for = 1,
Theorem 12.2.2 holds if f i (M, 0) > 1/( p j ( p 1)) log p for each nonnegative
integer j; we again proceed by induction on j, the base case j = 0 being
provided by Lemma 12.2.7.
Suppose that f i (M, 0) > 1/( p j ( p 1)) log p for some j > 0. Let M1 M2
be the decomposition of M separating the subsidiary radii less than or equal
to e p fi (M,0) into M1 (which exists by the induction hypothesis). This might
not be induced by a decomposition of M1 , because some factors of subsidiary
radius p p/( p1) that are needed in M2 are instead grouped into M1 . To correct
this, consider instead the decomposition
(0 (M1 ) p1 (M1 )) (0 (M2 ) + + p1 (M2 )),
207
where m is defined as in Lemma 10.3.6(a). By Lemma 10.3.6(b) this decomposition is induced by a decomposition of M. By Theorem 10.5.1, it has the
desired effect.
Remark 12.2.8. As in Lemma 11.5.1 one can prove Lemma 12.2.7 using
Lemma 8.6.2 in place of Lemma 8.6.1, at the expense of some extra complications. First, one must replace K by a spherical completion with value group R
and algebraically closed residue field (by invoking Theorem 1.5.3). One then
obtains the desired decomposition only over K t0 . By gluing with the original decomposition (defined over F1 ), we recover a decomposition over K tan .
To remove poles in the disc |t | < 1 we apply the same argument with M
replaced by T (M) (in the sense of Definition 11.1.1).
208
f i (M, 0) > log , as we may reduce the general case to this one as in the
proof of Theorem 12.2.2.
Again set the notation as in that part of the proof of Lemma 11.5.1 dealing with Theorem 11.3.2(d), but take = 1; as in the part dealing with
Theorem 11.3.2(e), we can find a basis B1 of M K /t,t K /t, t for some
[, 1) defining the same supremum norm as B0 . Let N1 be the matrix of
action of D on B1 . By conditions (a) and (b) of the lemma plus Lemma 12.1.4,
the coefficient of T ni in the characteristic polynomial of N1 is a unit in
K /t, t for some [, 1). We may thus continue as in the proof of
Lemma 12.2.7.
To prove Theorem 12.3.1 from Lemma 12.3.2, we proceed as in the proof
of Lemma 12.1.5. However, we must first give an alternate formulation of the
hypotheses of the theorem.
Remark 12.3.3. In the statement of Theorem 12.3.1, given condition (a) we
may reformulate conditions (b) and (c) together as the following condition.
(a) We have disci (M, r ) = 0 for log r log .
To see this, note that if < 1 < then condition (a) implies that equality
holds in Theorem 11.3.2(c), so we have s0,i (M) s,i (M) = disci (M, 0).
Consequently Fi (M, r ) is affine in a neighborhood of 0 if and only if
disci (M, 0) = 0. By rescaling, we obtain the desired equivalence. (Compare
the proof of Lemma 12.1.5.)
Proof of Theorem 12.3.1. The case = proceeds exactly as in
Theorem 12.2.2, so we will assume < hereafter. By Remark 12.3.3,
if M satisfies the given hypothesis then so does M K /t, t/ for each
closed subinterval [ , ] [, ]. For each (, ], Lemma 12.3.2 implies
that, for some [, ), M K /t, t/ admits a decomposition with
the desired property. Similarly, for each [, ) and for some (, ],
M K /t, t/ admits a decomposition with the desired property.
By the compactness of [, ] (Lemma 8.0.4), we can cover [, ] with
finitely many intervals [i , i ] for which M K i /t, t/i admits a decomposition with the desired property. Since the decomposition of M K i /t, t/i
is uniquely determined by the induced decomposition over F for any single
[i , i ], these decompositions agree on overlaps of the covering intervals. By the gluing lemma (Lemma 8.3.6), we obtain a decomposition of M
itself.
Remark 12.3.4. As in Remark 12.2.8, to prove Lemma 12.3.2 one may use
Lemma 8.6.2 in place of Lemma 8.6.1. Again, one enlarges K to be spherically complete with value group R and algebraically closed residue field. One
209
then notes that F = <1 K /t, tan is a field (Corollary 8.5.3), so one can
approximate the basis B0 of M K /t,t F1 with a basis B1 of M K /t,t F
defining the same supremum norm. One then obtains a decomposition of
M K /t,t K /t, tan for some , and one can remove the unwanted poles
as in Remark 12.2.8.
210
and 12.4.2. Similarly, one can obtain decomposition theorems on more exotic
subspaces of the affine line by gluing; the reader knowledgeable enough to be
interested in such statements should at this point have no trouble formulating
and deriving them. See also Remark 12.5.3 below.
211
Remark 12.5.4. If p = 0, it should be possible to get a decomposition theorem over K /t, t/0 even without assuming that f i (M, r ) >
f i+1 (M, r ) for r { log , log }; a statement along these lines (for K
discretely valued) appears in [144]. However, this fails completely if p > 0;
one can generate numerous counterexamples using the theory of isocrystals
(Chapter 23).
212
213
j+1 /( p1)
j /( p1)
cb =
n b1
;
m( 1)
then
I R((N Mb,cb ) F p j ) > p
jb
( [, ]).
214
215
Then, by Theorem 11.3.2(c), on the one hand we must have s, j1 (M) < 0
for some K .
On the other hand, we have f j (M, 0) > 0 if j > 0 (by Lemma 12.8.1) and
hence s, j (M) = 0 (by Theorem 11.3.2(c) again). Similarly, if h {0, . . . , n 2 }
denotes the smallest index for which f i (M M, r ) = r identically for i > h,
we have f h (M M, 0) > 0, if h > 0, and s,h (M M) = 0.
Choose a lift of in K , and set N = T (M) as a differential module over
K tan . Note that Fh (N N , r ) is constant for r in a right-hand neighborhood
of 0; f h (N N , 0) > 0 if h > 0; and f i (N N , r ) = r for i > h. By
Theorem 12.5.1 there exists a direct sum decomposition P0 P1 of N N
such that, for each (0, 1), P0 F accounts for the first h subsidiary
radii of (N N ) F . By Theorem 9.6.1, P1 restricts to a trivial differential
module over the open unit disc.
For any (0, 1), any direct sum decomposition of N F is defined
by projectors which are horizontal elements of (N N ) F . For sufficiently close to 1 the subsidiary radii of P0 F are all strictly less than
(by Lemma 12.8.1 again), so the projectors must belong to P1 F . Since P1
is trivial on the open unit disc, the projectors must also extend to horizontal
elements of N N over the open unit disc. That is, they define a direct sum
decomposition of N over the open unit disc.
It follows that, over the open unit disc, N admits a direct sum decomposition i Ni in which, for each i and each (0, 1) sufficiently close to 1,
Ni F has only a single subsidiary radius. Namely, given any decomposition not satisfying this condition, we can apply Theorem 10.6.2 and then the
previous paragraph to obtain a finer decomposition. (This can only be repeated
as many times as the rank of N .)
Let S be the set of indices i for which limr 0+ f 1 (Ni , r ) f j (M, 0). Since
f j (M, 0) > 0, in a neighborhood of r = 0 we have that f 1 (Ni , r ) is affine
and nonincreasing for each i S, by Theorem 11.3.2(a), (d), and F j (N , r )
is a positive linear combination of these functions. However, the right-hand
slope of F j (N , r ) at r = 0 is s, j (M) = 0, so f 1 (Ni , r ) must be constant in a
neighborhood of r = 0 for each i S. In a neighborhood of r = 0, F j1 (N , r )
is a nonnegative linear combination of the f 1 (Ni , r ) for i S, so it also has
right-hand slope 0 at r = 0. But this contradicts the fact that s, j1 (M) < 0.
This contradiction leads to the conclusion that j = 0, which implies (a).
Given (a), we may deduce (b) from the fact that f i (M, r ) and f i+1 (M, r ) are
affine functions on [ log , log ] satisfying f i (M, r ) f i+1 (M, r ) (or,
in the case i = n, the same argument with f i+1 (M, r ) replaced by r ). Given
(a) and (b), the hypothesis of (c) implies that f i (M, r ) > f i+1 (M, r ) for all
r [ log , log ], so the claim follows from Theorem 12.5.2.
216
Notes
Some results described here have also been obtained recently by Christol [46].
However, since we did not have access to Christols definitive manuscript at
the time of this writing, we are unable to provide detailed references.
Our results on modules solvable at a boundary are originally due to Christol and Mebkhout [51, 52]. In particular, Lemma 12.6.2 for the generic
radius is [51, Thorme 4.2.1] and the decomposition theorem (which implies
Lemma 12.6.2 in general) is [52, Corollaire 2.41]. However, the proof technique of Christol and Mebkhout is significantly different from ours: they
construct the desired decomposition by exhibiting convergent sequences for
a certain topology on the ring of differential operators. This does not appear to
give quantitative results; that is, the range over which the decomposition occurs
is not controlled, although we are not sure whether this is an intrinsic limitation of the method. (Keep in mind that the our approach here crucially uses our
method of Frobenius descendants, which was not available when [51, 52] were
written.)
Note also that Christol and Mebkhout work directly with a differential module on an open annulus as a ring-theoretic object; this requires a freeness result
of the following form. If K is spherically complete, any coherent locally free
module on the half-open annulus with closed inner radius and open outer
radius is induced by a finite free module over the ring [,) K /t, t/.
(That is, any coherent locally free sheaf on this annulus is freely generated by
global sections.) See the notes for Chapter 8 for further discussion.
A partial extension of the work of Christol and Mebkhout can be found in a
paper of Pons [181]. However, in the absence of a theory of Frobenius descendants, Pons was forced to impose somewhat awkward hypotheses in order to
avoid the exceptional value p 1/( p1) for the generic radius of convergence.
For the attribution of Theorem 12.7.2, see the notes for Chapter 17.
The notion of a clean differential module is original and was motivated
partly by discussions with Liang Xiao. It is a first attempt to model in p-adic
differential theory a higher-rank analogue of Katos notion of cleanness for
a rank-1 tale sheaf [119]. Similar considerations in residual characteristic 0
appear in [144].
Exercises
(1) Prove an analogue of Lemma 12.2.4 in which M is required only to be
locally free (in the sense of Remark 5.3.4).
(2) Let L be a complete extension of K . Prove that, for any > 0, within the
completion of L(t) for the -Gauss norm we have
Exercises
F Lt/ = K t/,
217
13
p-adic exponents
m=0,m=
218
tm
.
m
219
(m)
1
= lim inf
.
log p type() m+ m
(13.1.4.1)
(13.1.4.2)
(13.1.4.3)
220
p-adic exponents
m=0
xm
(x)m 1
= ex
.
(1 )(2 ) (m )
m! m
m=0
m
i=0 ci /(i ) for some ci Q. It is thus a rational function of of the
form P()/((1 ) (m )), where P has coefficients in Q and degree
at most m. To check that in fact P() = 1 identically, we need only check this
for = 0, . . . , m.
In other words, to check the original identity it suffices to check after multiplying both sides by i and evaluating at = i for each nonnegative integer
i. On the left-hand side we obtain
m=i
x m
.
(1)i1 i!(m i)!
(x)i
,
i!
221
m=0
xm
(1 )(2 ) (m )
i
N j Ui j
(i > 0).
j=1
222
p-adic exponents
n
|g h + i|1
g,h=1
for some c > 0 not depending on i. (For a somewhat more careful argument in
this vein, see Proposition 18.1.1.)
Thus, to conclude the theorem it suffices to verify that, for each g, h
{1, . . . , n}, the quantity = g h has the property that
m
max{1, | i|1 }
i=1
a(a + 1) (a + i)b(b + 1) (b + i)
i=0
c(c + 1) (c + i)i!
zi ,
223
to lie in Z p (e.g., if they are rational numbers with denominators prime to p),
one gets a strong necessary condition from the generic radius of convergence
function.
Definition 13.3.1. Let M be a finite differential module on the disc or annulus |t| I , for I an interval. We say that M satisfies the Robba condition if
I R(M F ) = 1 for all nonzero I .
Proposition 13.3.2. Let M be a finite differential module on the open disc
of radius for the derivation td/dt satisfying the Robba condition in some
annulus. Then the eigenvalues of the action of D on M/t M belong to Z p .
224
p-adic exponents
hope to identify them using purely p-adic considerations; in fact, we must use
archimedean considerations to identify them. Here are those considerations.
Definition 13.4.1. We will say that A, B Znp are equivalent (or sometimes
strongly equivalent) if there exists a permutation of {1, . . . , n} such that
Ai B (i) Z for i = 1, . . . , n. This is evidently an equivalence relation.
Definition 13.4.2. We say that A, B Znp are weakly equivalent if there exists
a constant c > 0, a sequence 1 , 2 , . . . of permutations of {1, . . . , n}, and
signs i,m {1} such that
(i,m (Ai Bm (i) ))(m) cm
(i = 1, . . . , n; m = 1, 2, . . . ).
225
Proposition 13.4.5. Suppose that A, B Znp are weakly equivalent and that
B has p-adic non-Liouville differences. Then A and B are equivalent.
Proof. There is no harm in replacing B by an equivalent tuple in which
Bi B j Z if and only if Bi = B j . For some c and m , we have, for all m,
((Ai Bm (i) ))(m) cm,
((Ai Bm+1 (i) ))(m+1) c(m + 1),
and so
((Bm (i) Bm+1 (i) ))(m) 2cm + c.
By hypothesis, the difference Bm (i) Bm+1 (i) is either zero or a p-adic
non-Liouville number which is not an integer; for m large, the previous
inequality is inconsistent with the second option, by Proposition 13.1.4, so
Bm (i) = Bm+1 (i) . That is, for m large we may take m = for some fixed
, so
((Ai B (i) ))(m) cm
(m = 1, 2, . . . ).
i=0
(2i)
p (2i+1) .
i=0
One may verify (exercise) that the pairs (, ) and ( , 0) are weakly
equivalent but not equivalent. In this case , , + , are all p-adic
Liouville numbers.
226
p-adic exponents
Our best hope at this point would be to prove that any differential module M
over K /t, t/ satisfying the Robba condition is isomorphic to such an N .
This turns out to be rather too much to ask for, owing to difficulties arising with
Liouville numbers (not to mention the closed boundary), but for the moment
let us postulate the existence of an isomorphism N K /t, t/
= M and
see what it tell us.
Assume (for clarity) that K contains the group p of all p-power roots of
unity (i.e., all roots of unity of orders that are powers of p) in K alg . Since M
and N satisfy the Robba condition, a Taylor series construction gives an action
of p ; recall that we have used the p-power action already in the construction of Frobenius antecedents (Theorem 10.4.2). The K -valued characters of
p may be naturally identified with Z p , by identifying a Z p with the map
carrying p to a ; we can use the structure of N to decompose M into
character spaces for this action. Namely, perform a shearing transformation if
needed (Proposition 7.3.10) to ensure that N0 has prepared eigenvalues. Then
choose the basis of N so that the matrix N0 splits into blocks corresponding
to individual eigenspaces V j for the action of D (so, in particular, the K -span
of the chosen basis of N equals j V j ). Let j Z p be the eigenvalue corresponding to V j . We may then identify M with ( j V j ) K K /t, t/; under
this identification t i V j is an eigenspace with character j + i.
The strategy for constructing p-adic exponents is then to turn this argument
on its head, by first constructing the eigenspaces for the actions of the finite
subgroups of p and then extracting elements of these which stabilize at a
basis of M. One might expect an obstruction to arise because the full action of
1 + m K by Taylor series need not be semisimple, but this does not occur; see
the exercises for a statement that may help to give the reason.
Following the strategy discussed in the previous remark, we now introduce
the definition of exponents.
Definition 13.5.2. Let M be a finite differential module of rank n over
K /t, t/ satisfying the Robba condition. For m > 0, let pm denote the
group of pth roots of unity in K alg , and put p = m>0 pm . For p ,
define the action of on M K /t,t/ K ( )/t, t/ via the Taylor
series
(x) =
( t t)i i
D (x);
i!
i=0
227
a sequence {Sm,A }
m=1 of n n matrices over K /t, t/ satisfying the
following conditions.
Then the resulting matrices Sm,A have the desired property (exercise).
For the general construction of exponents, we need the following lemma due
to Dwork and Robba. It will be easiest to postpone its proof until Chapter 18,
when we will derive it as a corollary of some explicit convergence bounds on
solutions of p-adic differential systems (Corollary 18.2.5).
228
p-adic exponents
log p i"
(i = 1, 2, . . . ).
( pm )
and use the formula for change of basis in a difference module. See
Remark 14.1.3 below.)
We will check that Sm,A satisfies (b) of Definition 13.5.2 using
m+1 /( p1)
by
Lemma 13.5.4. For each pm , we have | 1| p p
Example 2.1.6. If we write
E( ) =
( 1)i t i Ni
i=0
i p m+1
i p m
= (n1)m+(n1) log p (i p m )
.
p1
p( p 1)
229
p1
vm+1,A+ pm b, j
( j = 1, . . . , n).
b=0
(In words this second equation states that each eigenspace for the action of
pm is the direct sum of p eigenspaces for the action of pm+1 .) Hence, for
any A Znp and any m 0, we have
det(Sm,A ) =
det(Sm+1,A+ pm b ).
(13.5.5.1)
b{0,..., p1}n
i
we can choose b such that
Write
Sm,A = iZ Sm,A,i
t . By (13.5.5.1)
det(Sm+1,A+ pm b,0 ) det(Sm,A,0 ). Since S0,A is the identity matrix, this
det(Sm,A,0 )
satisfy
allows
us
to
choose
A
such
that
the
matrices
S
m,A
det(S0,A,0 ) = 1 for all m. Since det(Sm,A ) det(Sm,A,0 ) for all
For j = 1, . . . , n, put vm, j,A = i (Sm,A )i ei and vm, j,B = i (Sm,B )i ei . Let
1
be the change-of-basis matrix between the bases vm, j,A and
Tm = Sm,A Sm,B
1
vm, j,B of M. Since |S 1 | |Sm,A |n1 det(Sm,A ) (from a description of
m,A
( [, ]).
(13.5.6.1)
230
p-adic exponents
We will make progress by levering (in a metaphorical sense) the upper bound
(13.5.6.1) against various lower bounds involving Tm , using the log-convexity
of the Gauss norm as a function of the radius, i.e., the Hadamard three circles
theorem (Proposition 8.2.3(c)).
To begin with, we have
n
1
|det(Tm )| det(Sm,A ) Sm,A p nkm
( [, ]).
Put = . From the additive formula for the determinant of Tm , there must
be a permutation m of {1, . . . , n} such that
n
i=1
(13.5.6.2)
Write Ti,m (i) = jZ Ti, j t j with Ti, j K ; we can then choose j = j (i, m)
so that |Ti, j (i) t j | = |Ti,m (i) | . We repeat the leverage process to limit the
2
j
=
p (n +n)km ;
=
|Ti, j (i) t j |
in the case j 0, combine (13.5.6.2) with the case = of (13.5.6.1) to get
j =
! " j
|Ti, j (i) t j |
2
p (n +n)km .
j
|Ti, j (i) t |
231
232
p-adic exponents
( [ , ]).
Our first goal is to show that Tm ,m is close to its constant term. Write Tm ,m =
h
hZ Tm ,m,h t ; note that the (i, j)th entry of Tm ,m,h can only be nonzero if
h Ai A j (mod p m ). Since A has p-adic non-Liouville differences, for
any c > 0 and for m sufficiently large the congruence h Ai A j (mod p m )
forces either h = Ai A j = 0 or |h| cm. If h > 0 we then have
!
"m
|Tm ,m,h t h | |Tm ,m,h t h | |Tm ,m,h t h | cm p nk c
and, if h < 0,
!
"m
|Tm ,m,h t h | |Tm ,m,h t h | |Tm ,m,h t h | cm p nk c .
Pick any (0, p nk ). Since > 1, we may choose c in the above argument
such that p nk c ; we then deduce that, for some m 0 ,
|Tm ,m Tm ,m,0 | m
( [ , ], m m 0 ).
p nkm m < 1
( [ , ], m m 0 ),
( [ , ], m m 0 ).
233
( [ , ], m m 0 ).
m p nkm
We now write
1
Um ,m Um ,m = Tm ,m Tm1
,m,0 Tm ,m Tm ,m,0
1
= Tm ,m Tm1
,m,0 (Tm ,m,0 Tm ,m Tm ,m,0 )Tm ,m,0
1
= Tm ,m Tm1
,m,0 (Tm ,m,0 Tm ,m ,0 Tm ,m,0 )Tm ,m,0
+ Tm ,m (Tm ,m Tm ,m ,0 )Tm1
,m,0 .
Using this last expression, we obtain the bound
|Um ,m Um ,m | m p 4nkm
( [ , ], m m 0 ).
234
p-adic exponents
235
1
1
Z Z.
1
n
Since the slope for r > log is equal to 1, the slopes for r log
cannot exceed 1. Moreover, there cannot be a slope equal to 1 in this range
236
p-adic exponents
n1
( log ).
n
Notes
237
Notes
The definition of a p-adic Liouville number was introduced by Clark [54]; our
presentation follows [80, VI.1].
The cited theorem of Clark [54, Theorem 3] is actually somewhat stronger
than Theorem 13.2.3 as it allows differential operators of possibly infinite
order.
The example in Remark 13.2.4 was loosely inspired by an example of
Monsky (a counterexample against a slightly different assertion); see [81, 7]
or [80, IV.8] for discussion.
Proposition 13.3.2 was originally due to Christol; compare [80,
Proposition IV.7.7].
The theory of exponents for differential modules on a p-adic annulus
satisfying the Robba condition was originally developed by Christol and
Mebkhout [50, 45]; in particular, Theorem 13.6.1 appears therein as [50,
Thorme 6.24]. A somewhat more streamlined development was later given
by Dwork [79], in which Theorem 13.6.1 appears as [79, Theorem 7.1].
(Dwork notes that he did not verify the equivalence between the two constructions; we do not recommend losing any sleep over this.) Our treatment is
essentially that of Dwork with a few technical simplifications. Another treatment will appears in the upcoming book of Christol [46]. Besides all these
there is an expository article [161] that summarizes the proofs (using Dworks
approach) and provides some further context, including the formulation of the
p-adic index theorem.
It is claimed in [161, 3.2] that the converse of Theorem 13.5.6 also holds,
i.e., if A is an exponent of a differential module M and B is weakly equivalent
238
p-adic exponents
Exercises
(1) Prove that rational numbers are p-adic non-Liouville numbers.
(2) Give another proof of Lemma 13.1.6 (as in [80, Lemma VI.1.2]) by first
verifying that both sides of the desired equation have the same coefficients of x 0 and x 1 and are killed by the second-order differential operator
"
d !d
x .
dx dx
(3) Show that Theorem 13.2.2 can be deduced from Theorem 13.2.3. (Hint:
show that if H 0 (M) = 0 then 0 must occur as an eigenvalue of N0 .)
(4) Prove that there exists a Z p satisfying
type(a) = 1,
type(a) < 1.
Exercises
239
Part IV
Difference Algebra and Frobenius Modules
14
Formalism of difference algebra
We now step away from differential modules for a little while to study the
related subject of difference algebra. This is the theory of algebraic structures
enriched not with a derivation but with an endomorphism of rings. Our treatment of difference algebra will run largely in parallel with that for differential
algebra but in a somewhat abridged fashion; our goal is to be able to use difference algebra to say nontrivial things about p-adic differential equations. We
will begin to do that in Part V.
In this chapter we introduce the formalism of difference rings, fields, and
modules and the associated notion of twisted polynomials. Then we study
briefly the analogue of algebraic closure for a difference field. Finally, we
make a detailed study of difference modules over a complete nonarchimedean
field culminating in a classification of difference modules for the Frobenius
automorphism of a complete unramified p-adic field with algebraically closed
residue field (the DieudonnManin classification).
Hypothesis 14.0.1. Throughout this part of the book and the next, we retain
Notation 8.0.1 with p > 0 (so that K is a complete nonarchimedean field of
characteristic 0 and residual characteristic p > 0) but, unless otherwise specified, we will require K to be discretely valued. This is necessary to avoid a
number of technical complications, some of which we will point out as we go
along. (We will make almost no reference to K in this particular chapter; we
only include this hypothesis here in order to place it at the beginning of Part V.)
244
(r R, m M).
(m M, n N ).
Namely, if we use the basis to identify M with the space of column vectors of
length n over R then
(v) = A(v).
In parallel with the differential case, we call A the matrix of action of on the
basis e1 , . . . , en .
We now record the effect of a change of basis on a matrix of action.
Remark 14.1.3. Let M be a finite free difference module over R, and let
e1 , . . . , en and e1 , . . . , en be two bases of M. Recall that the change-of-basis
matrix U from the first basis to the second is the n n matrix U satisfying
245
( f M , m M).
In terms of a given basis, the matrix of action on the dual basis is given by the
inverse transpose AT . We call the resulting difference module the dual of M.
There is also a notion of an opposite difference ring but only under an extra
hypothesis.
Definition 14.1.5. We say that the difference ring R is inversive if is an automorphism. In this case, we can define the opposite difference ring R opp to be R
again, but now equipped with the endomorphism 1 . If R is inversive and M
is a finite free difference module over R, we define the opposite module M opp
of M to be M equipped with the inverse of ; that is, if A is the matrix of action
of on some basis then the matrix of action of 1 is given by 1 (A1 ).
Definition 14.1.6. For M a difference module, write
H 0 (M) = ker(id ),
H 1 (M1 M2 ) = Ext(M1 , M2 )
of groups (see the proof of Lemma 5.3.3 for the group structure on extensions).
246
247
248
To prove (b), note that F{T }/F{T }(T c1 ) must be trivial. Since each
element of this difference module is represented by some x F, we must have
x F such that T x x is divisible on the right by T c1 . By comparing
degrees, we see that T x x = y(T c1 ) for some y F. Then y = (x)
and yc1 = x, proving the claim.
1 c
To prove (c), form the -module V corresponding to the matrix
. By
0 1
construction, we have a short exact sequence 0 V1 V V2 0 with
V1 , V2 trivial; since V must also be trivial, this extension must split. In other
words we can find x F such that
1 x
1 c
1 (x)
1 0
=
,
0 1
0 1
0
1
0 1
that is, (x) x = c.
Conversely, suppose that (a)(c) hold. Every nonzero dualizable finite difference module over F admits an irreducible quotient. This quotient admits a
cyclic vector, by Lemma 14.2.3, and so admits a quotient of dimension 1 by
(a). That quotient in turn is trivial by (b). By induction, we deduce that every
dualizable finite difference module over F admits a filtration whose successive
quotients are trivial of dimension 1. This filtration splits, by (c).
The notion of difference closure is particularly simple for the absolute
Frobenius endomorphism (the p-power map) on a field of characteristic p > 0.
Proposition 14.3.4. Let F be a separably (resp. algebraically) closed field
of characteristic p > 0 equipped with a power of the absolute Frobenius
endomorphism. Then F is weakly (resp. strongly) difference-closed.
m
Pi T m F{T }, with m > 0, Pm = 1, and P0 = 0, the
Proof. For P = i=0
m
i
polynomial Q(x) = i=0 Pi x q has degree q m 2, and x = 0 occurs as a
root only with multiplicity 1. Moreover, the formal derivative of P is a constant
polynomial and so has no common roots with P; hence P is a separable polynomial. Since F is separably closed, there must exist a nonzero root x of Q;
this implies criteria (a) and (b) of Lemma 14.3.3. To deduce (c) note that, for
c F , the polynomial x q x c is again separable and so has a root in F.
249
F F .
We sometimes call F the perfection, or more precisely the -perfection, of F.
As in the differential case, we would like to classify finite difference modules over F by the spectral radius of ; this turns out to be somewhat easier in
the difference case because there is now no analogue of the distinction between
the visible and full spectra. A related fact is that the use of matrix inequalities
here is much closer to that in Chapter 4 than that in Chapter 6; the main difference is that we do not begin with a satisfactory theory of eigenvalues or
eigenvectors.
We first note the following analogue of Lemma 6.2.5.
Lemma 14.4.2. Let V be a nonzero finite difference module over F. Let
e1 , . . . , en be a basis of V . For each nonnegative integer s, let As be the matrix
of action of s on e1 , . . . , en . Then
||sp,V = lim |As |1/s .
s
250
|1 |V F = er .
Consequently
||sp,V = er ,
| |sp,V = er
then clearly
||V = er ,
|1 |V F = ||V = er .
We deduce that
||sp,V er ,
since
1 ||sp,V |1 |sp,V F er er ,
1 ||sp,V ||sp,V er er ,
||m
sp,V |F |.
Definition 14.4.6. Let V be a nonzero finite difference module over F. We say
that V is pure (and) of norm s if all the Jordan-Hlder constituents of V have
spectral radius s. Note that V is pure of norm 0 if and only if dim F V = 0. If
V is pure of norm 1, we also say that V is tale or unit-root.
Remark 14.4.7. It is more common to classify pure modules using additive
notation, i.e., in terms of the slope ( log s) instead of the norm s. (Correspondingly, pure modules are also called isoclinic modules.) For better or
251
worse, we have decided here to keep the notation consistent with the multiplicative terminology used in the differential case. We will switch to the
additive language only in the next section, in order to talk about Hodge and
Newton polygons.
Proposition 14.4.8. Let V be a nonzero finite difference module over F. Then
V is pure of norm s > 0 if and only if
||sp,V F = s,
|1 |sp,V F = s 1 ;
(14.4.8.1)
We thus have
1 ||sp,W F |1 |sp,W F ss 1 = 1,
which forces the equalities ||sp,W = ||sp,W F = s and |1 |sp,W F =
s 1 . In particular, W is pure of norm s. Moreover, by Lemma 14.2.3
there is an isomorphism W
= F{T }/F{T }P for some twisted polynomial
P; by Theorem 14.2.5, P has only one slope in its Newton polygon. By
Lemma 14.4.4 that slope must equal log s, and so W must be pure of norm
s 1 , as then is V .
Corollary 14.4.9. Let V1 , V2 be nonzero finite difference modules over F
which are pure and of respective norms s1 , s2 . Then V1 V2 is pure of
norm s1 s2 .
Note that this does not follow immediately from Lemma 14.4.3(b) because
the tensor product of two irreducibles need not be irreducible.
Proof. If s1 s2 = 0 then it is easy to check that V1 V2 is pure of norm 0.
Otherwise, Lemma 14.4.3(b) plus Proposition 14.4.8 yields
||sp,V1 V2 F = ||sp,V1 F ||sp,V2 F = s1 s2 ,
|1 |sp,V1 V2 F = |1 |sp,V1 F |1 |sp,V2 F = s11 s21 ;
thus Proposition 14.4.8 again implies that V1 V2 is pure of norm s1 s2 .
252
i (v)
i=0
i1 (v)
i=0
does likewise.
Corollary 14.4.12. If V1 , V2 are nonzero finite differential modules over F
which are pure of respective norms s1 , s2 and if either
(a) s1 < s2 or
(b) F is inversive and s1 > s2 ,
then any exact sequence 0 V1 V V2 0 splits.
Proof. If s2 > 0 then, by Proposition 14.4.8 and Corollary 14.4.9, V2 V1 is
pure of norm s1 /s2 , so Proposition 14.4.11 gives the desired splitting. Otherwise we must be in case (b), so we can pass to the opposite ring to make the
same conclusion.
If F is inversive, we again get a decomposition theorem.
Theorem 14.4.13. Suppose that F is inversive. Let V be a finite difference
module over F. Then there exists a unique direct sum decomposition
%
V =
Vs
s0
253
254
Remark 14.4.17. Note that, whenever V is pure and of positive norm, we can
apply Proposition 14.4.16 after replacing by some power of itself, thanks to
Corollary 14.4.5.
(i = 1, . . . , n),
(i = 1, . . . , n).
Proof. The first assertion follows from the corresponding fact in the linear
case (which is analogous to Lemma 4.1.9). The second assertion reduces to
the fact that if V is irreducible of dimension n and spectral radius s then i V
has spectral radius s i for i = 1, . . ., n; this follows from the fact that, in the
basis given by Lemma 14.4.4, i i V = s i .
Corollary 14.5.4 (Newton above Hodge). We have
s N ,1 + + s N ,i s H,1 + + s H,i
(i = 1, . . . , n)
255
s N ,1 + + s N ,i = s H,1 + + s H,i
then we can change basis, using a matrix in GLn (o F ), so that the matrix of
action of becomes block upper triangular; the top left block accounts for
the first i Hodge and Newton slopes of V . Moreover, if F is inversive and
s H,i > s H,i+1 , we can ensure that the matrix of action of becomes block
diagonal.
Proof. First we change basis by a matrix in GLn (o F ) such that the F-span
of the first i basis vectors equals the step of the filtration of Theorem 14.4.15
consisting of norms greater than or equal to es N ,i . We may then proceed as in
Theorem 4.3.11.
Remark 14.5.6. Be aware that the Newton polygon, unlike the Hodge polygon, cannot be read off directly from the matrix of action of ; see the
exercises for an example. However, this can be done if the matrix of is a
companion matrix. A restatement follows.
Proposition 14.5.7. If V
= F{T }/F{T }P then the Newton polygon of V
coincides with that of P.
Proof. This reduces to Lemma 14.4.4 using Theorem 14.2.5.
We also obtain the following analogue of Proposition 4.4.10.
Proposition 14.5.8. Let V be a finite difference module over F, equipped with
the supremum norm for some basis. For i = 1, 2, . . . , let s H,1,i , . . . , s H,n,i be
the Hodge slopes of V as a difference module over (F, i ). Then, as i ,
the quantities i 1 s H,1,i , . . . , i 1 s H,n,i converge to the Newton slopes of V .
Proof. The claim is independent of the choice of basis, so by Theorem 14.4.15
we may reduce to considering a pure module. In that case Lemma 14.4.2
implies that i 1 s H,1,i s N ,1 as i .
Note that s N ,1 = = s N ,n by purity and that i 1 s H,1,i i 1 s H, j,i for
j 1. Also, the Newton slopes of V as a difference module over (F, i ) are
is N ,1 , . . . , is N ,n ; by the equality case of Corollary 14.5.4 we obtain i 1 s H,1,i +
+ i 1 s H,n,i = s N ,1 + + s N ,n . All this plus the fact that i 1 s H,1,i s N ,1
as i yields the desired convergence.
Proposition 14.5.9. Suppose that F = K (i.e., F is discretely valued) and that
R = Ft0 carries the structure of an isometric complete nonarchimedean
difference ring for the norm | |1 . Let M be a finite free difference module over
256
R with least Newton slope c. Then there exists a basis of M with respect to
which, for each positive integer m, the least Hodge slope of m is the least
element of v(F ) greater than or equal to cm.
Proof. Recall that if K is discrete then E is the completed fraction field of R
(see Definition 9.4.3). Construct a suitable basis of M E by imitating the
proof of Lemma 4.3.13 and then apply Lemma 8.6.1.
...,
(ed1 ) = ed ,
(ed ) = e1 .
(i = 1, . . . , d).
Hence, by Proposition 14.4.16, V,d is pure of norm 1/d , as then is any submodule. But if the submodule were proper and nonzero then we would have a
violation of Corollary 14.4.5.
Next we show that, if F has a sufficiently large residue field, one can classify
all dualizable finite difference modules in terms of the standard modules V,d .
One can always enlarge F to reach this case; see the exercises.
Theorem 14.6.3. Let F be a complete discretely valued field, equipped with
an isometric endomorphism , such that F is strongly difference-closed. Then
every dualizable finite difference module over F can be split (non-uniquely) as
a direct sum of submodules, each of the form V,d for some , d. Moreover,
given any generator of m F , we can force each to be a power of .
Proof. We first check that if V is pure of norm 1 then V is trivial; for this
step, we only need F to be weakly difference-closed. We must show that, for
257
(mod m ).
(mod m+1 ).
(mod ),
(mod )
is possibly by criteria (b) and (c) from Lemma 14.3.3. (We must use (b) to
remove the leading coefficient in the first term before applying (c).)
At this point, we may apply Theorem 14.4.13 to reduce the desired result to
the case where V is pure of norm s > 0. Let d be the smallest positive integer
such that s d = | m | for some integer m. Then the first paragraph of the proof
implies that m d fixes some nonzero element of V ; this gives us a nonzero
map from V m ,d to V . By Lemma 14.6.2 this map must be injective. Repeating
this argument, we can write V as a successive extension of copies of V m ,d .
However, Vm ,d V m ,d is pure of norm 1 and so has trivial H 1 as above.
Thus V splits as a direct sum of copies of V m ,d , as desired.
By Proposition 14.3.4, Theorem 14.6.3 has the following immediate
corollary.
258
Notes
The parallels between difference algebra and differential algebra are close
enough that a survey of references for difference algebra strongly resembles
its differential counterpart. A well-established, perhaps rather dry, reference
is [56]; a somewhat more lively and more modern reference, which develops difference Galois theory under somewhat restrictive conditions, is [201].
We again mention [4] as a useful unifying framework for difference and
differential algebra.
The choice of the modifier difference in the phrase difference algebra is
motivated by the following basic example. Consider the automorphism on the
ring of entire functions f : C C given by ( f )(z) = f (z + q) for some q.
The difference operator
Notes
( f ) =
259
( f ) f
q
260
Exercises
(1) For M1 , M2 difference modules over a difference ring R, with M1 dualizable, give a canonical identification of H 1 (M1 M2 ) with the Yoneda
extension group Ext(M1 , M2 ). (Hint: as in Lemma 5.3.3 you may wish to
first reduce to the case M1 = R.)
(2) Prove Lemma 14.4.3. (Hint: Lemma 14.4.2 may be helpful, particularly
for (c) and (d).)
(3) Let F be a difference field (of arbitrary characteristic) containing an element x such that (x) = x for some fixed by which is not a
root of unity. Prove that every finite difference module for M admits a
cyclic vector. (Hint: imitate the proof of Theorem 5.4.2. At the key step,
instead of getting a polynomial in s you should get a polynomial in s ; the
root-of-unity condition forces s to take infinitely many values.)
(4) Let F be the completion of Q p (t) for the 1-Gauss norm, viewed as a difference field for equal to the substitution t t p . Let V be the difference
module corresponding to the matrix
1 t
A=
.
0 p
Prove that there is a nonsplit short exact sequence 0 V1 V V2
0 with V1 , V2 pure and of norms s1 , s2 with s1 < s2 .
(5) Here is a beautiful example from [120, 1.3] (attributed to B. Gross). Let
p be a prime congruent to 3 modulo 4, put F = Q p (i) with i 2 = 1,
and let be the automorphism i i of F over Q p . Define a difference
module V of rank 2 over F, using the matrix
Exercises
1 p
A=
( p + 1)i
261
( p + 1)i
.
p1
Compute the Newton polygons of A and V and verify that they do not
coincide. (Hint: find another basis of V , on which acts diagonally.)
(6) (a) Prove that every difference field can be embedded into a strongly
difference-closed field. (Hint: use your favorite equivalent of the
axiom of choice, e.g., Zorns lemma or transfinite induction.)
(b) Prove that every complete isometric discretely valued difference field
can be embedded into a complete isometric difference field with the
same value group but having a strongly difference-closed residue field.
(7) Prove that Theorem 14.6.3 continues to hold if the hypothesis that F is
discretely valued is replaced by one of the following hypotheses.
(a) There exists a constant > 0 with the following property: for c
{0, 1} and x o F there exists y o F with |(y) cy x| < |x|.
(Hint: use to replace the -adically convergent sequences in the
proof of Theorem 14.6.3.)
(b) The field F is spherically complete. (Hint: attempt to form convergent sequences as in (a) and then invoke spherical completeness if the
sequences fail to converge.)
(8) This exercise is related to the construction of Witt vectors. Let k, be
perfect fields of characteristic p > 0. Suppose that Ck , C are complete discrete valuation rings, having residue fields identified with k, ,
respectively, whose maximal ideals are generated by p. Prove that any
homomorphism k lifts uniquely to a homomorphism Ck C .
(Hint: read about Teichmller lifts in the notes.)
15
Frobenius modules
iZ
i=0
262
ci t : ci K , lim |ci | = 0, lim |ci | = 0 ,
i
i+
ci t i : ci K , sup{|ci |} < ,
i
K /t, t 0 =
K /t, t} =
ci t i : ci K , lim |ci | i = 0 ( (0, 1)) ,
i
i=0
ci t : ci K , lim |ci | = 0, sup{|ci |} < ,
i
iZ
iZ
263
i+
( (0, 1)) .
Definition 15.1.2. For later use, we give a special notation to certain rings
appearing in this framework. We have already defined E to be the completion
of o K ((t)) oK K = K t0 [t 1 ] for the 1-Gauss norm; that is, E consists of
i
ci t which have bounded coefficients
that is, E consists of formal sums
and converge in some range |t| < 1. This ring is sometimes called the
bounded Robba ring, since it consists of the bounded elements of the Robba
ring (see Definition 15.1.4 below). Note that it can also be written as
$
K /t, t0 .
E =
(0,1)
Lemma 15.1.3.
(a) The ring E is a field.
(b) Under the 1-Gauss norm | |1 , the valuation ring oE is a local ring with
maximal ideal m K oE .
(c) The field E , equipped with | |1 , is henselian (see Remark 3.0.2).
This last property implies that finite separable extensions of E = K ((t))
lift functorially to finite tale extensions of oE (and to unramified extensions
of E ). In particular, the maximal unramified extension E ,unr carries an action
of the absolute Galois group of K ((t)).
Proof. To check (a), note that the Newton polygon of any nonzero x E
has finite width (since K is discretely valued). We can thus choose such
that x K /t, t0 has no slope between 0 and log . In this case x is a
unit in K /t, t0 by Lemma 8.2.6(c), yielding (a). We may deduce (b) as an
immediate corollary of (a).
To prove (c) it suffices to check the criterion of Remark 3.0.2: for any monic
polynomial P(x) oE [x] and any simple root r E of P E [x], there
264
Frobenius modules
K /t, t};
(0,1)
265
where:
the map K : K K is an isometry;
the element u R satisfies |u t q |1 < 1. (If R = R, this forces
u E .)
(These are not the most inclusive conditions possible; see the notes.) The most
important case is when is absolute, i.e., when K provides the q-power
Frobenius lift on K .
Remark 15.2.2. Note that, unless is absolute, the property that it is a
Frobenius lift depends on the implicit choice of the series parameter t; that
is, is not invariant under isometric automorphisms of R.
Remark 15.2.3. Note that, in Definition 15.2.1, one cannot define a Frobenius
lift on an individual ring like K /t, t0 ; for instance, the simple substitution
t t q carries K /t, t0 to K 1/q /t, t0 . One can make this remark more
quantitative, as in the following lemma.
Lemma 15.2.4. Let be a Frobenius lift on E . Then there exists
(0, 1) such that, for , [, 1) with , carries K /t, t/ to
K 1/q /t, t/ 1/q and
| f | = |( f )| 1/q .
Proof. Since |(t)t q 1|1 < 1, by continuity we can choose (0, 1) and
(0, 1) so that |(t)t q 1| 1/q for [, 1]. This inequality implies
that |(t i )t qi 1| 1/q for all i Z. For such an , the claim is easily
1/q
1/q
266
Frobenius modules
always otherwise; see the exercises). We say that is zero-centered if its center
is equal to 0, i.e., if (t) 0 (mod t).
Lemma 15.2.6. Suppose that is a Frobenius lift on K t0 and that K :
K K is inversive. Then is centered and its center is unique.
Proof. Exercise.
267
268
Frobenius modules
A1 = U01 A0 (U0 ) =
B0 + C0 (D0 B01 )
E 0 (D0 B01 ) D0 B01 C0 (D0 B01 )
C0
.
E 0 D0 B01 C0
Since D0 B01 has entries in to K t, we have A1 A0 (mod t). We can thus
iterate the construction to obtain a sequence of block matrices
Bl Cl
Al =
(l = 0, 1, . . . )
Dl El
for which Al+1 = Ul1 Al (Ul ), with Ul GLn (o K t) equal to the block
lower triangular unipotent matrix with off-diagonal block Dl Bl1 . Note that
Bl+1 = Bl (I + Bl1 Cl (Dl Bl1 )), in which Bl1 Cl (Dl Bl1 ) has entries in
(t, m K ), so the Hodge slopes of Bl are preserved. In particular, the least Hodge
slope of Bl1 equals sg,i .
To complete the proof we must show that the Ul converge to the identity for
the (t, m K )-adic topology. Since the least Hodge slope of Bl1 equals sg,i , it is
enough to check that the Dl converge to zero for the same topology. Moreover,
it suffices to check that, for each positive integer m, Dl (mod t m ) converges
to zero for the m K -adic topology.
We do this by induction on m, the case m = 1 being clear because Dl 0
(mod t). Assume that the convergence is known modulo t m . Write
Dl+1 = El (Dl )(Bl1 ) Dl (Bl1 Cl )(Dl Bl1 ).
(mod t m+1 ),
(15.3.3.1)
where the ellipses represent terms depending on Dl,0 , . . . , Dl,m1 which are
already known to converge to 0 as l .
On the one hand the sum of the least Hodge slopes of the reductions of
El and (Bl1 ) modulo t equals sg,i+1 sg,i 0. Thus the reduction of
El (Dl,m t m )(Bl1 ) modulo t m+1 has p-adic valuation no less than that of the
reduction of (Dl,m t m ), which in turn has p-adic valuation strictly greater than
that of the reduction of Dl,m t m . On the other hand, Bl1 Cl has entries in o K t,
and (Dl Bl1 ) has entries in the ideal (t, m K ) since Dl is divisible by t. Thus
the reduction of Dl,m t m (Bl1 Cl )(Dl Bl1 ) modulo t m+1 has valuation strictly
greater than that of the reduction of Dl,m t m . We may conclude that, for any
c > 0, for l sufficiently large (so that the terms represented by the ellipses in
(15.3.3.1) all have norm less than c) we have |Dl+1,m | < max{|Dl,m |, c}. This
yields the desired convergence.
269
i
of on this basis. We wish to construct an n n matrix U =
i=0 Ui t
over o K t, with U0 = In , such that U 1 A(U ) = A0 or, equivalently,
1
U = A(U )A1
0 . Since the map U A(U )A0 is contractive for the
(t, m K )-adic topology on In + t Mnn (o K t), it has a unique fixed point, by
the contraction mapping theorem, which gives the desired isomorphism.
If r v(K ), we may apply the above argument after twisting by a scalar.
Otherwise we may replace by a power and then twist and apply the above
argument.
Since
is not complete, we cannot apply Theorem 14.4.15 to filter a finite
difference module over E with pure quotients of decreasing norms. It was
originally observed by de Jong that one can get a filtration with pure quotients
of increasing norms, but at the expense of replacing E with its -perfection.
Definition 15.4.1. Let be a Frobenius lift on E . Let E denote the
-perfection of E, that is, the completion of the direct limit R0 R1 ,
with Ri = E and the transition map Ri Ri+1 being . Choose as in
Lemma 15.2.4. For [, 1) we may define | | on E as the function
| | 1/q i on Ri ; this is consistent because Lemma 15.2.4 guarantees that the
transition maps are isometries. Let E be the subring of x E such that,
for some [, 1) depending on x, |x| < + for [, 1]. Note that,
within E ,
E E = E .
We obtain the following analogue of Lemma 15.1.3.
Lemma 15.4.2.
(a) The ring E is a field.
(b) Under the 1-Gauss norm | |1 , the valuation ring oE is a local ring
270
Frobenius modules
Proof. Exercise.
Example 15.4.3. Suppose that K is the identity and (t) = t q . Then E may
be identified with the set of formal sums iZ[1/ p] ci t i having the property
that, for each > 0, the set of i Z[1/ p] for which |ci | is both bounded
below and has bounded denominators. In this interpretation, E consists of
those sums i ci t i for which there exists (0, 1) for which |ci | i 0
as i . (Equivalently, there exists (0, 1) for which |ci | i remains
bounded as i .)
Theorem 15.4.4. Let V be a finite free dualizable difference module over E .
Then there exists a unique filtration
0 = V0 V1 Vl = V
of difference modules, such that each successive quotient Vi /Vi1 is pure and
of some norm si and s1 < < sl .
Proof. By Theorem 14.4.15 we obtain a decomposition of V = V E E
as a direct sum s Vs of difference modules in which each nonzero Vs is pure
of norm s. By replacing with a power, we can ensure that each s for which
Vs = 0 appears in |F |. It suffices to check that the summand of V of least
norm descends to a submodule of V , as we may then repeat the argument after
quotienting by this submodule.
By twisting and then approximating a suitable basis of V , we obtain a basis
of V on which the matrix of action of 1 is a block matrix
A B
,
C D
in which A is invertible over oE and B, C, D have entries in m K oE . Put
X = C A1 and then change basis using the block lower triangular unipotent
matrix with lower left block X , to obtain the new matrix of action
A + B 1 (X )
B
A B
=
.
C D
(D X B) 1 (X ) D X B
Repeating this operation yields a p-adically convergent sequence of changes
of basis; in the limit, we get a block upper triangular matrix in which the first
block corresponds to the summand of V of least norm. To prove that this
summand descends to a submodule of V , it suffices to check that the changeof-basis matrices are bounded in norm by 1 under | | for some (0, 1).
At the first stage, for sufficiently close to 1 we have
|A1 | max{|B| , |C| , |D| } < 1,
Notes
271
1
(xi ), with xi K ((t)) such that the
uniquely as the finite sum i=0
j
coefficient of t in xi vanishes whenever i > 0 and j is divisible by q.
This implies by an easy induction that each x E can be written uniquely
i
(xi ), with xi E
as a convergent (for the m K -adic topology) sum i=0
such that the coefficient of t j in xi vanishes whenever i > 0 and j is divisible
by q. In this presentation, for as in Lemma 15.2.4 we have | i (xi )| |x|
for [, 1) (exercise). We may thus take (x) = x0 .
In the general case, let K be the -perfection of K and put E = K 1/t, t0 .
(That is, E is the analogue of E with base field K instead of K .) Argue as above
Then choose any continuous linear map K K
to construct a map E E.
whose composition with the inclusion K K is the identity, and use it to
define a projection E E. The composition E E E has the desired
effect.
Corollary 15.4.6. The multiplication map : E E E E is injective.
Proof. Exercise.
Notes
Much existing literature makes the restriction that Frobenius lifts must be
absolute. The generalization that we have considered here is relevant for
some applications (e.g., to families of Galois representations in p-adic Hodge
theory), so it is prudent to allow it as much as possible.
However, we have not opted to consider cases where does not carry
K into K . (At that level of generality, one might consider that our definition
272
Frobenius modules
Exercises
(1) Verify the claims about Newton iteration from the proof of
Lemma 15.1.3(c).
(2) Suppose that K is perfect and that : E E is a continuous isometric
endomorphism inducing the absolute q-power Frobenius lift on K ((t)).
Prove that is an absolute Frobenius lift in the sense of Definition 15.2.1.
(Hint: use Witt vector functoriality to show that carries K into K . In
n
n
other words, for x K , show that (x p ) = (x) p is p-adically close to
an element of K .)
1
( (t)|t= ) on
(3) Prove Lemma 15.2.6. (Hint: show that the map K
m K is contractive; here (t)|t= denotes the substitution t .)
(4) Let K be the completion of Q p (x) for the 1-Gauss norm. Prove that the
Frobenius lift defined by (t) = t p + px does not have a center.
(5) Prove Lemma 15.4.2.
(6) Check the omitted details in the proof of Lemma 15.4.5. (Hint: you can
in fact show that |x| = supi {| 1 (xi )| }. To check this, it suffices to
m (E ) of E .)
consider x in the dense subset
m=0
(7) Prove Corollary 15.4.6. (Hint: if the claim fails, we can choose a nonzero
n
xi yi of the kernel with n as small as possible. Now
element z = i=1
use Lemma 15.4.5 to construct an element with an even shorter representation, as in the proof of Lemma 1.3.11. A related argument in Galois theory
is Artins proof of the linear independence of characters.)
16
Frobenius modules over the Robba ring
274
(16.1.1.2)
0
1
1
1
i 1 1
i
Ui+1
A(Ui+1 )A1
0 = (In + X i t ) Ui A(Ui )A0 (In + A0 (X i t )A0 ),
275
Remark 16.1.2. We will see a similar result later in the presence of a differential structure. This is introduced as Dworks trick (Corollary 17.2.2). The main
differences are that in the setting of Dworks trick there will be a canonical
isomorphism M
= (M/t M) K K {t}, and no restriction on K is needed.
Remark 16.1.3. Note that the function min{ 1/q , /| |} becomes
exactly the function min{1/ p , p} from Chapter 10 in the case where
q = p and K is absolutely unramified. See also Theorem 17.2.1.
One can apply Theorem 16.1.1 to coherent locally free modules on the open
unit disc, by virtue of the following observation. (Compare Definition 8.4.3,
remembering that in this part of the book K is assumed to be discretely valued.)
Proposition 16.1.4. Any coherent locally free module on the open unit disc
over K is freely generated by finitely many global sections.
Proof. Let M be such a module. Choose a sequence 0 < 1 < 2 < with
limit 1, and put Mi = M K t/i . Choose any basis B1 of M1 . Given a
basis Bi of Mi , there must exist X i GLn (K t/i ) such that changing basis
from Bi via X i produces a basis of Mi+1 . By Lemma 8.3.4 we can factor X i as
Vi Wi , with Vi GLn (K t/i ) such that |Vi In |i < 1 and Wi GLn (K [t]).
j
j
Write Vi =
j=0 Vi, j t , let Yi be the sum of Vi, j t over only those j for
which |Vi, j |1 1, and put Ui = Vi Yi1 . Changing basis from Bi via Ui also
produces a basis Bi+1 of Mi+1 .
Since K is discretely valued, for each positive integer j and for i sufficiently
large (depending on j), we have |Vi, j t j |1 1 if and only if |Vi, j t j |i
1. Consequently, the Ui converge t-adically to the identity matrix. Given any
(0, 1), if we choose (, 1) and combine the previous observation
with the fact that the Ui are bounded under | | , we may deduce that the
Ui converge under | | to the identity matrix. Hence the product U1 U2
is the change-of-basis matrix from B1 to a simultaneous basis of each Mi , as
desired.
276
(1)i1
i=1
ti
is not invertible (because its Newton polygon has infinitely many slopes). More
generally, we have the following easy fact.
Lemma 16.2.2. We have R = (E ) = E \ {0}.
Proof. On the one hand, a unit in K /t, t} must have empty Newton polygon,
so it must belong to E . On the other hand, E is a field by Lemma 15.1.3(a).
This proves the claim.
Definition 16.2.3. Because R consists of series with possibly unbounded
coefficients, it does not carry a natural p-adic norm or topology. The most useful topology on R is the LF topology, which is the direct limit of the Frchet
topology on each K /t, t} defined by the | | for [, 1). That is, a
sequence converges if on the one hand it is contained in some K /t, t} and
on the other hand it converges for the Frchet topology on that ring. Note
that it does not matter which is chosen since, for > , the inclusion
K /t, t} K /t, t} is strict (i.e., it is a homeomorphism to its image
equipped with the subspace topology).
In fact, the ring R is not even noetherian (by an argument similar to that
for K {t}; see the exercises for Chapter 8), but the following useful facts,
mentioned in the notes, are true.
Proposition 16.2.4. For an ideal I of R, the following are equivalent.
(a) The ideal I is closed in the LF topology.
(b) The ideal I is finitely generated.
(c) The ideal I is principal.
Remark 16.2.5. The equivalence of (b) and (c) in Proposition 16.2.4 implies
that R is a Bzout domain, i.e., an integral domain in which every finitely
generated ideal is principal. Such rings enjoy many properties analogous to
principal ideal domains; see the exercises.
We also have the following analogue of Proposition 16.1.4.
Proposition 16.2.6. Any coherent locally free module on the half-open annulus with closed inner radius and open outer radius 1 is represented by a
277
finite free module over K /t, t} and so corresponds to a finite free module
over R.
Proof. Let M be such a module. Then M K /t, t/ is finite free since
K /t, t/ is a principal ideal domain by Proposition 8.3.2. By choosing a
basis of M K /t, t/ and then invoking Lemma 8.3.6, we may extend M
to a coherent locally free module on the whole unit disc. We may then deduce
the claim from Proposition 16.1.4.
will see shortly (Corollary 16.3.7) that the module M is uniquely determined
by this requirement.
Lemma 16.3.2. Let A be an n n matrix over oE . Then the map v v
A(v) induces a bijection on (R/E )n .
Proof. Exercise, or see [136, Proposition 1.2.6].
Corollary 16.3.3. Let M be a finite free difference module over E such that
||sp,ME 1. Then the map H 1 (M) H 1 (M E R) is bijective.
Proof. This will follow from Lemma 16.3.2 once we produce a basis of M
on which the matrix of action of has entries in oE . Such a basis exists, as
explained in the proof of Proposition 14.5.9.
Corollary 16.3.4. Let M1 , M2 be two finite free dualizable difference modules
over E . Suppose that every Newton slope of M1 E E is less than or equal to
every Newton slope of M2 E E. Then
Hom(M1 , M2 ) = Hom(M1 E R, M2 E R).
Proof. Note that (M1 ) M2 has norm less than or equal to 1 (because its
Newton slopes are all nonnegative). It thus suffices to check that, for M of
norm s 1, H 0 (M ) = H 0 (M E R). As in the previous proof, we
can find a basis e1 , . . . , en of M such that the matrix A of action of has
278
279
280
281
Remark 16.4.8. Note that in Corollary 16.4.7 we need only the uniqueness of
the slope filtration in Theorem 16.4.1. The existence, which lies much deeper,
can be supplied using Theorem 16.1.1 because we start with a difference
module over K t0 .
(M) = ||sp,n M ;
this quantity is multiplicative in the sense that (M1 M2 ) = (M1 )(M2 ).
We say M is semistable if we have (M) (N ) for any nonzero difference
submodule N of M.
Lemma 16.5.2. Let M be a nonzero finite free dualizable difference module
over R that is pure of some norm. Then M is semistable.
Proof. Suppose on the contrary that there exists a difference submodule N of
M with (N ) > (M). Put e = rank(N ); then e N is of rank 1 and hence
pure. Consequently, the inclusion e N e M violates Corollary 16.3.5; this
contradiction yields the claim.
Lemma 16.5.3. Let M be a nonzero finite free dualizable difference module over R. There exists a unique filtration 0 = M0 Ml =
M by difference submodules, such that each quotient Mi /Mi1 is free and
semistable and
(M1 /M0 ) > > (Ml /Ml1 ).
Proof. First note that the claim holds for any module of rank 1 by
Lemma 16.5.2. From this point, the rest of the proof is formal; we leave it
as an exercise.
Definition 16.5.4. The filtration in Lemma 16.5.3 is called the Harder
Narasimhan or HN filtration of the module M. To prove Theorem 16.4.1 it
will suffice to check that semistable modules over R are pure.
Remark 16.5.5. The reader may wonder why it is necessary to prove
Theorem 16.4.1 if defining the HN filtration is a mere formality. The answer is
282
that one cannot prove much about the norms occurring in the HN filtration; for
instance, it has not been formally proved that the tensor product of semistable
modules is semistable but this is indeed a consequence of Theorem 16.4.1. A
similar situation arises in the theory of vector bundles, on which the usage
of the word semistable is modeled; see the discussion preceding [136,
Theorem 1.7.1]. Another loosely analogous situation is Delignes use of determinantal weights in his second proof of the Weil conjectures [69]; there one
has matrices over a p-adic field whose eigenvalues are supposed to be algebraic numbers with certain archimedean norms, but one has direct control only
over the determinants.
We need to introduce something like a residual difference closure of the
ring R. (This characterization is justified by Corollary 16.5.8.) This construction, based on Malcev-Neumann series (Example 1.5.8), is called the extended
Robba ring in [136].
i
be the ring of formal sums x =
Definition 16.5.6. Let R
iQ x i t with
xi K that satisfy the following properties. (Here is a number in (0, 1)
which may vary with x.)
(a) For (, 1) and we have limi |xi | i = 0.
(b) For (, 1) and for c > 0, the set of indices i for which |xi | i c
is a well-ordered subset of R (i.e., it contains no infinite decreasing
subsequence).
as follows. Recall that the field K ((t Q )) of MalcevWe may also construct R
For each (0, 1) take the Frchet completion for the norms | | , for
283
xi t i =
K (xi )t qi .
iQ
iQ
284
i.e., for some (0, 1) and for (, 1) we have |(x)| = |x| for all
x R (in particular, the two quantities are either both finite or both infinite).
This is clear if for instance (t) = t q , as we can then set (t) = t; it was
verified in general by [136, Proposition 2.2.6].
We then obtain the following results; the key tool used is descent for modules along a faithfully flat ring homomorphism. (We also make considerable
use of an analogue of the projection E E of Lemma 15.4.5.)
Lemma 16.5.10. Let M be a semistable dualizable finite difference module
L is semistable.
over R. Then M R R
Proof. See [136, Theorem 3.1.2].
Lemma 16.5.11. Let M be a dualizable finite difference module over R such
L is pure. Then M is pure.
that M R R
Proof. See [136, Theorem 3.1.3].
Putting everything together, we may prove Theorem 16.4.1 as follows.
Proof of Theorem 16.4.1. We wish to show that the HN filtration of M is pure.
By Lemma 16.5.10, if we start with the HN filtration of M and tensor with
L then the resulting filtration still has semistable successive quotients. ConR
L . By Lemma 16.5.7, each
sequently, it must be the HN filtration of M R R
successive quotient of the HN filtration of M becomes pure when tensored
L . By Lemma 16.5.11 each successive quotient is itself pure.
with R
Notes
The restriction that must be a Frobenius lift is probably stronger than necessary. In fact, as suggested in [136], if the action of on K ((t)) satisfies
v((x)) = mv(x) for some integer m > 1 this should suffice.
Our proof of Theorem 16.1.1 is essentially that of [130, Proposition 4.3],
except that in the latter only the absolute case is treated, and it is further
assumed that M descends to K t0 .
Proposition 16.2.4 is the essential content of a paper of Lazard [156].
Note that it depends on K being spherically complete and is false otherwise;
however, we have assumed in this part that K is discretely valued, so we
are safe.
The treatment of pure modules over R is an abridged version of [136, 1].
Corollary 16.3.4 has appeared in several guises previously. It figures
in the work of Cherbonnier and Colmez [38], which we will discuss in
Notes
285
286
theorem, due to Andr and Di Vizio [8]. A third application has been pursued by Hartl in the context of period morphisms for RapoportZink spaces.
These are moduli spaces for deformations of certain p-divisible groups into
mixed characteristic; the main conjecture of Rapoport and Zink is that a
period morphism between this space and a space of linear-algebraic objects
can be constructed, via which the Galois representations for to the p-divisible
groups correspond to some coherent data. In equal positive characteristic, Hartl
[101] established the appropriate analogue of this conjecture, using a form
of Theorem 16.4.1 in which K is of positive characteristic. (Hartl prove that
case of Theorem 16.4.1 directly, building on work of Hartl and Pink [104]
giving a form of Corollary 16.5.8 for K of characteristic p.) In the original
mixed-characteristic setting, Hartl has given some partial results [102, 103];
a complementary but related partial result has been established by Faltings
(though only the brief research announcement [86] was available at the time
of writing), and the combination of ideas may lead to a proof of the full
RapoportZink conjecture.
Exercises
(1) Show that if is a Frobenius lift such that (t) 0 (mod t 2 ) then the
conclusion of Theorem 16.1.1 holds without any restriction on K .
(2) Let R be a Bzout domain (an integral domain in which every finitely
generated ideal is principal).
(a) Prove that any x1 , . . . , xn R that generate the unit ideal appear as
the first row in some n n invertible matrix over R.
(b) Prove that every finitely generated torsion-free R-module is free.
(3) Prove Lemma 16.3.2. (Hint: reduce to the case where | A| 1 for
[, 1). For the injectivity, given that v A(v) (E )n show that
|v| is bounded for [, 1) by comparing |v| with |v| 1/q , using
Lemma 15.2.4. For the surjectivity, to find the class of w Rn in the
image of the map, separate off the positive terms w+ of w, replace w with
A(w+ ) + (w w+ ), and then repeat this procedure.)
(4) Prove that any A GLn (R) can be factored as U V with U GLn (K {t})
and V GLn (E ). (Hint: imitate the proof of Proposition 8.3.5.)
(5) Let M be a finite free difference module over K {t} such that M K {t} R
is pure of some norm s. Let M be the pure module over E satisfying
M K {t} R
= M E R. Prove that there exists a basis e1 , . . . , en of M
that is also a basis of M . (Hint: imitate the proof of Lemma 8.3.6 using
the previous exercise.)
Exercises
287
(6) Prove Lemma 16.5.3. (Hint: first check that (N ) (M) whenever N
is a difference submodule of M of full rank, by comparing the top exterior
powers of M and N . See also [136, Proposition 1.4.15].)
(7) Let M be a nonzero finite dualizable difference module over R. Let 0 =
M0 Ml = M be any filtration of M by difference submodules,
such that each quotient Mi /Mi1 is free and semistable. Form the convex
polygon of length n associated (as in Definition 4.3.2) with the multiset of
slopes given by log (Mi /Mi1 ), with multiplicity rank(Mi /Mi1 ), for
i = 1, . . . , l. Prove that this polygon lies on or below the corresponding
polygon for the HN filtration with the same endpoint. (Again, this is a
purely formal consequence of the definitions of semistability and the HN
filtration, so the proof is as in the theory of vector bundles.)
Part V
Frobenius Structures
17
Frobenius structures on differential modules
In this part of the book, we bring together the streams of differential algebra
(from Part III) and difference algebra (from Part IV), realizing Dworks fundamental insight that the study of differential modules on discs and annuli is
greatly enhanced by the introduction of Frobenius structures.
This chapter sets the foundations for this study. First, we introduce the
notion of a Frobenius structure on a differential module, with some examples. Then we consider the effect of Frobenius structures on the generic radius
of convergence and obtain the fact that a differential module on a disc has
a full basis of horizontal sections (Dworks trick). We also show that the
existence of a Frobenius structure does not depend on the particular choice of
Frobenius lift; this independence plays an important role in rigid cohomology
(Chapter 23).
Throughout Part V, Hypothesis 14.0.1 remains in force unless explicitly
contravened. In particular, K will by default be a discretely valued complete
nonarchimedean field.
d(t)
(D(m))
dt
(m M).
291
292
dA
d(t)
=
A(N ).
dt
dt
(17.1.1.1)
dA
t d(t)
=
A(N ).
dt
(t) dt
(17.1.2.1)
exp( f f p ) = E( f )E( f p ) E( f p
a1
),
293
unity, let M be the differential module of rank 1 over E with the action of D
on a generator v given by
h
i
i
D(v) =
( p 1)t p 1 v.
i=0
(Note that we have replaced t by t 1 in the formula from Example 9.9.3. ) Let
: E E be an absolute Frobenius lift fixing . On the disc |t 1 | < 1 we
have already constructed the horizontal section
h
1 pi
E p (t 1 )
i
v = exp
t p v,
pi
E p ( t 1 )
i=0
where
E p (t) = exp
pi
t
i=0
pi
E p (t 1 )E p ( t p )
v.
E p (t p )E p ( t 1 )
(17.1.5.1)
In fact, this gives a Frobenius structure defined over E , because the coefficient of v in (17.1.5.1), as a power series in t 1 , has radius of convergence
strictly greater than 1. We will not verify this here; it was shown for p > 2
by Matsuda [169] by an explicit calculation, and for all p by Pulita [185] as
part of a much broader result. We note here that the existence of this Frobenius
structure implies the following strengthening of Theorem 12.7.2.
Theorem 17.1.6. Let b be a positive integer. Let K be a complete nonarchimedean field (not necessarily discretely valued) containing the p h th roots of
unity for all h log p b and having a perfect residue field. Let M denote a
finite differential module of rank 1 on a half-open annulus with open outer
radius 1, which is solvable at 1 with differential slope b. Then there exist
c1 , . . . , cb {0} o
K and nonnegative integers j1 , . . . , jb such that
M M1,c1 Mb,cb
has differential slope 0, for Mi,ci defined as in Theorem 12.7.2.
Proof. Since K is perfect, K contains a subfield K 0 isomorphic to the fraction field of the Witt vectors of K . Let 0 be the Witt vector Frobenius
294
295
lim I R(M F ) = 1,
296
has entries in K t/. (Note that the nilpotency of the singularity is automatic
if the Frobenius lift is of the form described in Remark 17.1.2 and if the
Frobenius structure is defined on the entire disc.)
A nice application of Dworks trick is the following.
Proposition 17.2.5. Let M be a finite differential module over K t0 with
R(M) = 1. (For instance, this holds if M admits a Frobenius structure, by
Dworks trick.) Then H 0 (M) = H 0 (M E ).
Proof. By Theorem 9.6.1 there exists a horizontal basis e1 , . . . , en of M K t 0
n
K {t}. If v H 0 (M K t 0 E ) then when we write v = i=1
vi ei with vi R
we must have d(vi ) = 0 for i = 1, . . . , n. This forces vi K for i = 1, . . . , n,
and so
v (M K t 0 E ) (M K t 0 K {t}) = M K t 0 (E K {t}) = M.
(2 (t) 1 (t))i
i=0
i!
1 (D i (m)).
(17.3.1.1)
By Theorem 17.2.1 and the fact that |2 (t) 1 (t)|1 < 1, this series converges
under | | for (0, 1) sufficiently close to 1 (if this is well-defined for R),
and also under | |1 (if this is well-defined for R).
Corollary 17.3.2. Let 1 , 2 be two Frobenius lifts on R that agree on K .
Then there is a canonical equivalence between the categories of finite free
differential modules over R equipped with Frobenius structures with respect
to i for i = 1, 2; this equivalence is the identity functor on the underlying
difference modules.
297
298
Notes
299
Proposition 17.5.1. Let M be a finite differential module on the open unit disc
for the derivation td/dt with a nilpotent singularity at t = 0. Assume that
either:
(a) the Frobenius lift on K t0 is arbitrary, and M has no singularity at
t = 0; or
(b) the Frobenius lift on K t0 satisfies (t) = t q u for some u
o K t .
Then any Frobenius structure with respect to on the restriction of M to some
annulus with open outer radius 1 is induced by a Frobenius structure on M
itself.
Proof. It suffices to check this for a standard Frobenius lift, as then under either
(a) or (b) we may switch back and forth between the given Frobenius lift and
a standard Frobenius lift using Proposition 17.3.1. Thus we assume hereafter
that (t) = t q .
Let be a Frobenius structure with respect to on the restriction of M
to some annulus with open outer radius 1. By Corollary 17.2.4, M admits a
basis on which the matrix of action of D is a nilpotent matrix N over K . Let
A = iZ Ai t i be the matrix of action of on the same basis; then the commutation relation (17.1.2.1) between and D states that N A + td A/dt =
q A K (N ). Consequently, for each i Z we have N Ai + i Ai = q Ai K (N ).
By Lemma 7.3.5 the operator X N X + i X q X K (N ) on n n matrices
over K has all its eigenvalues equal to i, because N and K (N ) are both nilpotent. Hence Ai = 0 for i = 0, so A GLn (K ). In particular the Frobenius
structure can be defined on the entire open unit disc.
Notes
The statement of Theorem 17.1.6 (whose proof includes that of Theorem
12.7.2) is a slight weakening of [171, Corollaire 2.02], with a similar proof;
the technique goes back to Robba [190]. A complete classification of rank 1
solvable modules on an open annulus with outer radius 1 (and unspecified inner radius) has been given by Pulita [185] and can be formulated in
terms of any LubinTate group; using the formal multiplicative group returns
Theorem 17.1.6.
Theorem 17.2.1 admits the following partial converse: if M is a finite free
differential module over R of rank 1 that is solvable at 1 then there exists
Z p such that M V admits a Frobenius structure for some absolute
Frobenius lift. (Here V is defined as in Example 9.5.2.) For M defined over
F1 the field of analytic elements, this was shown by Chiarellotto and Christol
300
A direct proof of Corollary 17.2.4, which does not employ the transfer
theorem for a nilpotent regular singularity (Theorem 13.7.1), appeared in [134,
3.6].
The proof of Proposition 17.3.1 was taken from [209, Theorem 3.4.10].
Theorem 17.4.3, in the case of an absolute Frobenius lift, is the original form
of the slope filtration theorem suggested, though not explicitly conjectured,
by Tsuzuki in [209]. Its derivation from Theorem 16.4.1 is the same as the
derivation of [125, Theorem 6.12] from [125, Theorem 6.10]. See also [129,
7.1].
Exercises
(1) Verify the unproved assertion in Definition 17.1.3. (Hint: see [191, 7.2]).
(2) Verify the unproved assertions in Remark 17.1.7.
18
Effective convergence bounds
302
One can also give effective bounds for more general regular singularities (as
long as one takes into account the condition of p-adic non-Liouville differences), but the nilpotent case is sufficient for most applications in algebraic
geometry and the bounds are much easier to describe in this case. (The case of
no singularity reproduces the p-adic Cauchy theorem, Proposition 9.3.3.)
(i = 1, 2, . . . ).
(18.1.1.1)
Consequently, U has entries in K t/( p (2m1)/( p1) )0 , as does its inverse.
Proof. Recall that U is determined by the recursion (7.3.6.1):
N0 Ui Ui N0 + iUi =
i
N j Ui j
(i > 0).
j=1
2m2
(1) j i j1 f j (X ).
j=0
303
log p i"
max{1, |N |n1
}.
The first step in the proof of Theorem 18.2.1 is to change basis to reduce
|N | ; this comes at the expense of enlarging K slightly and decreasing
slightly.
Lemma 18.2.2. With notation as in Theorem 18.2.1, suppose that K has value
group R. Then, for any < 1, there exists an invertible n n matrix X over
K t/()0 such that
1
X N X + X 1 t d (X ) 1
dt
|X 1 | 1
|X | |N |n1
.
Proof. Let M be the differential module over K t/0 for the operator td/dt,
with a basis on which D acts via N , and let | | be the supremum norm defined
by this basis. Over the closed disc of radius , M becomes isomorphic to a
successive extension of trivial differential modules. Consequently the generic
radius of convergence of M F is equal to . In particular,
d
= .
p 1/( p1) = |t 1 D|sp,MF
dt F
By Proposition 6.5.6 plus Lemma 8.6.1, we obtain the desired matrix X .
Using Lemma 18.2.2, we wish to prove Theorem 18.2.1 by using Frobenius
antecedents to reduce the index from i to i/ p". One can improve upon this
argument if one has a Frobenius structure on the differential module; see
Lemma 18.3.2 below.
Lemma 18.2.3. With notation as in Theorem 18.2.1, suppose that |N | 1.
Then there exist n n matrices N , U over K t/ p , satisfying the hypotheses
of Theorem 18.2.1, such that
|N | p p,
max{|U j | j : 0 j i} max{|U j | pj : 0 j i/ p}.
304
i
Proof. Define the invertible n n matrix V =
i=0 Vi t over K t/
as follows. Start with V0 = In . Given V0 , . . . , Vi1 , if i 0 (mod p)
i1
j
1 N W +
then put Vi = 0. Otherwise, put W =
j=0 V j t and N W = W
1
W t (d/dt)(W ), and let Vi be the unique solution of the matrix equation
N0 Vi Vi N0 + i Vi = (N W )i .
By induction on i we have |Vi | i 1 for all i, so V is invertible over K t/0 .
Let : K t K t denote the substitution t t p . Put N = V 1 N V +
V 1 t (d/dt)(V ); then N has entries in K t p and | 1 (N )| p 1. Put
U = V 1 U ; then
(U )1 N U + (U )1 t
d
(U ) = N0 = N0 ,
dt
1
(log(1 + t))i
i!
(i = 0, . . . , n 1)
d
1
fi =
f i1
dt
1+t
(i = 1, . . . , n 1),
305
i!
(t t)i
Di .
i!
i=0
log p i"
j
j
j=0 Di, j (t t ) whose coefficients have entries in L, so that |Di, j |
|Di | for j 0 with equality for j = 0. (See Section 9.4 to recall where these
inequalities come from.) We then have
Ti =
i
(1) j
j=0
D j,i j
.
j!
log p j"
p (n1)
log p i"
log p i"
306
Ui t i , A = i=0
Ai t i be n n matrices over
Lemma 18.3.2. Let U = i=0
K t satisfying the following conditions.
(a) The matrix A has entries in K t0 .
(b) We have U0 = In , and the matrix A0 is invertible.
(c) We have U 1 A(U ) = A0 .
Then
max{|U j | : 0 j i} |A1
0 ||A|1 max{|U j | : 0 j i/q"}.
Consequently, for every positive integer i,
logq i
.
|Ui | (|A1
0 ||A|1 )
307
Proof. As noted in Remark 17.1.2, the commutation relation (d) implies that
N A = q A0 (N0 ), which forces N0 to be nilpotent. Put B = U 1 A(U ) =
0 0
i
i=0 Bi t . Then B0 = A0 and N0 B + t (d/dt)(B) = q B(N0 ). Hence
N0 Bi + i Bi = q Bi (N0 ) = Bi A1
0 N0 A0
or
1
1
N0 (Bi A1
0 ) + i(Bi A0 ) = (Bi A0 )N0 .
(18.3.3.1)
(| A1
0 ||A|1 )
logq i"
i
i=0 Ai t be an n n matrix over K t, and let K be chosen to satisfy
the following conditions.
(a) The matrix A has entries in K t0 .
(b) The matrix A0 is invertible.
(c) We have A(v) = v.
Then
max{|v j | : 0 j i} |1 ||A|1 max{|v j | : 0 j i/q"}.
308
f i t i for which
those f = i=0
| fi |
< +;
| f | = sup
(i + 1)
i
note that K t is complete under the norm | | . For = 0 we recover the ring
K t0 of bounded power series. However, K t is not a ring for > 0; rather,
we have
K t1 K t2 K t1 +2 .
Also, K t is stable under d/dt but antidifferentiation carries it into
K t+1 . Put
)
K t+ =
K t .
>
309
n1
i
i=0 m i (log t) , with m i M K t 0 K t, then Theorem 18.2.1 implies
that, for i = 0, . . . , n 1, m i has order of log-growth at most n 1. However,
we suspect that this can be improved to n 1 i.
In the presence of a Frobenius structure, one obtains a much sharper bound,
due to Chiarellotto and Tsuzuki [41, Theorem 6.17]. (One can also formulate
a refinement in the manner of Remark 18.4.4.)
Theorem 18.4.5. Assume that K is discretely valued. Let M be a differential
module of rank n over K t0 for the operator td/dt, equipped with a Frobenius
structure for a qth-power Frobenius lift as in Remark 17.1.2. Then any element
v H 0 (M K t 0 K t) satisfying (v) = v for some K has order
of log-growth at most ( log || s0 )/(log q), where s0 is the least generic
Newton slope of M.
Proof. We may assume that the Frobenius lift is standard, thanks to
Proposition 17.3.1. By then replacing the Frobenius lift by some power, we can
reduce to the case where s0 is a multiple of log p. We can then twist into the
case s0 = 0. By Proposition 14.5.9, we can choose a basis of M such that the
least generic Hodge slope of M is also 0. Then the claim follows immediately
from Theorem 18.3.6.
Remark 18.4.6. Refining a conjecture of Dwork, it was conjectured
by Chiarellotto and Tsuzuki [41] that if M is indecomposable then
Theorem 18.4.5 is optimal. That is, in the notation of Theorem 18.4.5, v should
have order of log-growth exactly ( log || s0 )/(log q); Chiarellotto and
Tsuzuki have proved this for rank(M) 2 [41, Theorem 7.2]. It should be
possible to extend their proof to all cases where log || is less than or equal
to the least Newton slope of M strictly greater than s0 , but it is less clear how
to extend to the general case.
Remark 18.4.7. By contrast, if M does not carry a Frobenius structure then
the order of log-growth of a horizontal section behaves much less predictably.
310
For instance, it need not be rational, and it could have the form + instead of
[41, 5.2].
Ni t i and U = i=0
Ui t i be n n matrices
Theorem 18.5.1. Let N = i=0
over K t satisfying the following conditions for some > 0.
(a) The matrix N has entries in K t/0 .
(b) We have U0 = In .
(c) We have U 1 N U + U 1 t (d/dt)(U ) = N0 .
(d) The matrix N0 has prepared eigenvalues in Z p .
(e) The matrices U and U 1 have entries in K {t/}.
Then, for every positive integer i,
|Ui | i p n+(n1)log p i max{1, |N |n1
}.
Proof. The statement of Lemma 18.2.2 carries over to this situation without change. However, we can carry out the construction in the proof of
Lemma 18.2.3 only if the exponents are divisible by p. Otherwise we must
first enforce this condition by performing up to p 1 shearing transformations,
yielding the weaker bound
max{|U j | j : 0 j i} max{|U j | pj : 0 j (i + p 1)/ p}.
In the case i p (and |N | = 1), we instead perform the shearing transformations and then calculate explicitly as in Proposition 18.1.1 to obtain the
bound
|Ui | i p 2n1 .
If i p h+1 for some h 0 then, after h iterations of the map i (i + p
1)/ p", we end up with a quantity that is at most p. We thus obtain the claimed
bound.
Notes
In the case of no singularities (N0 = 0), the effective bound of Theorem 18.2.1
is due to Dwork and Robba [82], but is slightly stronger: one may replace
p (n1) log p i" with the maximum of | j1 jn1 |1 over j1 , . . . , jn1 Z with
1 j1 < < jn1 i. See also [80, Theorem IV.3.1].
Exercises
311
The general case of Theorem 18.2.1 is due to Christol and Dwork [47],
except that their bound is significantly weaker: it is roughly p c(n1) log p i"
with c = 2 + 1/( p 1). The discrepancy comes from the fact that the role
of Proposition 6.5.6 is played in [47] by an effective version of the cyclic vector theorem, which does not give optimal bounds. As usual, the use of cyclic
vectors also introduces singularities, which must then be removed, leading to
some technical difficulties. See also [80, Theorem V.2.1].
Theorems 18.3.3 and 18.3.6 are original, but they owe a great debt to the
proof of [41, Theorem 7.2]. The main difference is that we prefer to argue in
terms of matrices rather than cyclic vectors.
In the case of no singularities, Proposition 18.4.3 was first proved by Dwork;
it appears in [75] and [76]. (See also [42].) The extension suggested by
Remark 18.4.4 is original; as noted above, the effective bounds in [47] are
not strong enough to imply this.
The archimedean motivation for the study of logarithmic growth comes from
Delignes study of regular singularities [68]. He showed that a differential
module over the ring of germs of meromorphic functions at a point has a regular singularity if and only if the horizontal sections have at worst logarithmic
growth at the singular point.
In the p-adic setting, the theory of logarithmic growth emerged from some
close analysis made by Dwork [75, 76] of the finer convergence behavior
of solutions of certain p-adic differential equations. The subject languished
until the recent work of Chiarellotto and Tsuzuki [41]; inspired by this, Andr
[6] proved a conjecture of Dwork [76, Conjecture 2] analogizing the semicontinuity theorem for Newton polygons (Theorem 15.3.2) to logarithmic
growth.
Theorem 18.5.1 is an improvement on [80, Theorem V.9.1], in which the
2
bound takes the form p (n +cn) log p i" for some constant c. Again, part (though
not all) of the improvement is due to the avoidance of cyclic vectors.
Exercises
(1) Prove that, for 0,
K t = { f K t : lim sup
1
| f |
< }.
( log )
sup{(i + 1) }
i
may be helpful.)
( log )
e
312
(2) Let M be a differential module of rank n over K t for the operator
td/dt, such that the action of D on M/t M is nilpotent. Show that if we
extend the action of td/dt to K t[log t], by setting (td/dt)(log t) = 1,
then M K t K t[log t] admits a basis of horizontal sections, and each
horizontal section has degree in log t bounded by n 1.
19
Galois representations and
differential modules
314
Remark 19.0.1. The reader encountering this material for the first time is
strongly encouraged to assume that K is perfect. For an explanation of how
things get complicated otherwise, see Section 20.5.
Notation 19.0.2. In this chapter we write d and instead of D and for
the actions on a differential or difference module. This is to avoid confusion with another standard usage of the letter D, which will first appear in
Definition 19.1.2.
315
316
EL
= E E E L ,
318
K ((t))
D (V ) = (V K (E sep
K ((t))
)unr )G K ((t))
and this will be a differential module over E of the correct dimension, again
admitting a unit-root Frobenius structure for any Frobenius lift.
319
A1
0
Ai t
i/m
i>0
The matrices U then converge to the identity under the (t 1/m , m K )-adic
topology on o K t 1/m .
Finally, we note that the compatibility N q A(N )A1 = t (d/dt)(A)A1
from (17.1.2.1) implies that having A in the desired form forces N to be
in the desired form as well. Namely, the compatibility first implies that N
320
modulo q involves only nonnegative powers of t 1/m . We then derive the same
conclusion modulo q 2 , q 3 , and so on.
The crux of the proof is the following lemma.
Lemma 19.3.4. Let M be a finite differential module over E admitting a
unit-root Frobenius structure. Suppose that M is c-constant for some c <
p 1/( p1) . Then M is trivial as a differential module.
Proof. We may assume that the Frobenius lift is standard. By Lemma 19.3.3
we may assume that there is a c-constant basis on which , t D act via matrices
A, N over E o K t 1 . Then A and N together represent a finite differential
module over (0,1) K /t equipped with a unit-root Frobenius structure.
Moreover, the commutation relation N A + t (d/dt)(A) = q A(N ) from
1
(17.1.2.1) and the fact that |A0 | = | A1
0 | = |A|1 = |A |1 = 1 together
force N0 = 0 and |N |1 c; consequently, there exists (0, 1) for which
|N | 1.
We now proceed as in the proof of Lemma 18.2.3. As in that proof, we
construct a matrix V, with entries in E K t 1 and with |V In |1 c and
|V In | 1, for which
A = V 1 A(V ),
N = V 1 N V + V 1 t
d
(V )
dt
have entries in E K t p . Let : K t 1 K t 1 be the K -linear substitution t 1 t p . Since is standard, commutes with ; consequently,
A = 1 (A ), N = p 1 1 (N ) again satisfy the commutation relation
N A + t (d/dt)(A ) = q A (N ). Since | A In |1 = |A In |1 c,
we deduce that |N |1 c; we also have |N | p = p|N | p. To choose
u [0, 1] such that p u c1u = 1 or, in other words, u log p + (1 u) log c = 0,
we take
u=
log c
.
log c log p
p log c
log .
log c log p
Since p log c < log c log p < 0, we have log (1 + )( log ) for
some fixed > 0.
Consequently, we can replace A, N by another pair for which the desired
result can be derived equivalently, with arbitrarily small. In particular we
can force < p 1/( p1) . Now let M be the differential module on the disc
321
322
decomposition of D (V ) R provided by Theorem 12.6.4. While the latter was inspired by analogues for meromorphic connections in the complex
analytic setting, the analogy with wild ramification was anticipated somewhat
before it was realized in Theorem 19.4.1.
Remark 19.4.4. Using the integrality properties of subsidiary radii
(Theorem 11.3.2(b)), we may deduce that, for (0, 1) sufficiently close to
1, the product of the subsidiary radii is an integral power of ; this amounts to
verifying the HasseArf theorem for V (the integrality of the Artin conductor).
An interesting corollary of Theorem 19.4.1 is the following.
Proposition 19.4.5. Let M be a finite free differential module, on the open
annulus with inner radius and outer radius , satisfying the Robba condition.
Suppose that for some closed interval [ , ] with < < < there
exists a finite tale extension R of K /t, t/ such that the differential module
M K /t,t/ R is unipotent (i.e., a successive extension of trivial modules).
Then there exists a positive integer m coprime to p such that the pullback of
M along the map t t m is unipotent.
Proof. Note that the conclusion may be checked by replacing K by a finite
Galois extension K ; this follows from the fact that
H 0 (M K K )G K /K = H 0 (M).
We may thus enlarge K and then rescale to ensure that 1 (, ). We may
also assume that R is Galois over K /t, t/ and (possibly after enlarging K
again) geometrically connected. That is, R is connected and remains so after
any further finite enlargement of K .
Put G = R K /t,t/ F1 . Our hypotheses so far ensure that G is a field
so, by Theorem 1.4.9, G carries a unique multiplicative norm extending | |1 .
Since that norm is computed using Newton polygons, we can replace K by a
finite extension to ensure that the multiplicative value group of G is the same
as that of K .
By Lemma 8.6.1 the norm on G can be defined as the supremum norm for
some basis of R K /t,t/ K t, t 1 . After moving and closer to 1, we
can define the same norm for a basis e1 , . . . , en of R itself.
Let A be the matrix of the trace pairing of R in terms of this basis, i.e., Ai j is
the trace of multiplication by ei e j as an endomorphism of R over K /t, t/.
On the one hand, because G has the same multiplicative value group as K , we
must have |det(A)|1 = 1. On the other hand, since R is tale over K /t, t/,
det(A) must be a unit in R. We conclude that det(A) = ct m for some c o
K
Notes
323
Notes
A more detailed survey of most material in this chapter (excluding the unit-root
p-adic local monodromy theorem) is given in the article [128].
Proposition 19.1.5 was originally formulated by Fontaine [89, A1.2.6], in a
slightly less general form and with no reference to differential modules. Our
presentation more closely follows [207, Theorem 4.1.3]; a related result in the
language of F-isocrystals is the theorem of Crew [61, Theorem 2.1].
For K perfect, the rank 1 case of Theorem 19.3.1 is due to Crew [61,
Theorem 4.12], while the general case is due to Tsuzuki [207, Theorem 5.1.1];
however, both arguments can be extended easily to the general case. An alternate exposition was given by Christol [44] in the case of a standard Frobenius
lift, although without discussion of the fact that standardness is not stable under
the replacement of K ((t)) by a finite separable extension. (In our argument
this is treated by the invariance of the c-constant property under a change in
Frobenius lift, as in Definition 19.3.2. The representation-theoretic approach
we used is that taken in [207].) Yet another exposition may be inferred from
[135, Theorem 4.5.2], where a stronger result is proved. (The stronger result
324
is used in the study of semistable reduction for isocrystals; see the notes for
Chapter 23.) All these proofs are similar in form to the proof given here; by
contrast, one may infer a rather different proof by specializing Theorem 20.1.4
below to the unit-root case (Remark 20.1.5).
The fact that one can give a direct quantitative relationship between wild
ramification and the spectral properties of differential modules fits nicely into
a well-developed analogy between the structures of irregular formal meromorphic connections and those of wildly ramified -adic tale sheaves. An early
suggestion along these lines was given by Katz [121] and pursued further by
Terasoma [204]; some further work in this direction is that of Beilinson, Bloch,
and Esnault [13].
Theorem 19.4.1 was originally stated in its present form by Matsuda [169,
Corollary 8.8]; a reformulation in the formalism of Tannakian categories
was given by Andr [5, Complement 7.1.2] as part of his formulation and
proof of the p-adic local monodromy theorem. However, thanks to the p-adic
global index theorem of Christol and Mebkhout [51, Theorem 8.41], [52,
Corollaire 5.012], this could have been deduced from the Grothendieck
OggShafarevich formula for unit-root overconvergent F-isocrystals in rigid
cohomology; such a formula was proved by Tsuzuki [208, Theorem 7.2.2]
(by Brauer induction, as is possible in the -adic case) and Crew [63,
Theorem 5.4] (using the KatzGabber theory of canonical extensions, as
also is possible in the -adic case). For a proof by direct computation and
Brauer induction (not using the ChristolMebkhout index theory), see [128,
Theorem 5.23].
In the case of an imperfect residue field, it was originally suggested by
Matsuda [170] that one should formulate an analogue of Theorem 19.4.1
relating the AbbesSaito conductor (discussed in the notes for Chapter 3) to
a suitable differential analogue. That differential analogue was described by
Kedlaya [133]; a comparison with the AbbesSaito conductor was established
by Chiarellotto and Pulita [40] for one-dimensional representations, and by
Xiao [219] in the general case. This has the side effect of establishing the integrality of the AbbesSaito conductor in the case of equal characteristic, which
is not evident from the original construction.
In mixed characteristic the appropriate analogue of the functor V D(V )
is provided by the theory of (, )-modules; see Chapter 24. The sort of analogue of Theorem 19.4.1 that should exist in mixed characteristic is distinctly
less clear. Even in the case of a perfect residue field, where one is asking for
a differential interpretation of the usual conductor, only a partial answer is
known, by a result of Marmora [167]: one has a differential interpretation of
the conductor once one passes to a suitably large cyclotomic extension. In
Exercises
325
general, one does at least have integrality of the AbbesSaito conductor when
the residue characteristic is odd; see Xiao [220].
Exercises
(1) Suppose that the fixed field K 0 of on K has residue field Fq and the same
value group as K . Prove that E =1 = K 0 . (Hint: reduce to a corresponding
equality of residue fields.)
(2) Prove that any finite free unit-root difference module over E admits a
unique compatible differential structure.
(3) Prove that the kernel of d/dt on E equals K . In particular, for any finite
separable extension L of K ((t)), prove that the kernel of d on E L is the
integral closure of K in E L .
(4) Write out explicitly the isomorphism D (V )
= M implied by Example 19.2.6. (Hint: this module is generated by (1 + p f )1/ p for any lift
f E of f .)
(5) Prove that, for an arbitrary Frobenius lift, the property whereby a finite
differential module over E or E equipped with a unit-root Frobenius structure is c-constant remains invariant under a change in Frobenius lift. (Hint:
in Lemma 17.3.4, if the matrix of action of D/i has norm at most then so
does the matrix of action of D i /i!. Apply this observation to (17.3.1.1).)
20
The p-adic local monodromy theorem
We are now ready to state the capstone theorem of this book, the p-adic local
monodromy theorem. This asserts that a finite differential module over an
annulus carrying a Frobenius structure has finite local monodromy, in the
sense that it becomes unipotent after making a suitable finite tale cover of the
annulus. In this chapter, we give the precise statement of the theorem, illustrate it with an example and a couple of basic applications, and discuss some
technical points that arise if the field K has imperfect residue field. We will
postpone discussion of the proof(s) of the theorem to the next chapter.
We will discuss two broad areas of application of the p-adic local monodromy theorem in later chapters. One of these is in the subject of rigid
cohomology, where the theorem plays a role analogous to the -adic local monodromy theorem of Grothendieck in the subject of tale cohomology (hence
the name); see Chapter 23. The other is in p-adic Hodge theory, where the
theorem clarifies the structure of certain p-adic Galois representations; see
Chapter 24.
Hypothesis 20.0.1. Throughout this chapter and the next, fix a homomorphism
: K ((t)) K ((t)) preserving (but not necessarily fixing) K and carrying
t to t q for some power q of p. We will assume that all the Frobenius lifts
considered are in fact lifts of this particular .
327
328
20.2 An example
It may be worth seeing how Theorem 20.1.4 applies in a particular example.
This example, corresponding to a Bessel equation, was originally considered
by Dwork [77]; the analysis given here is due to Tsuzuki [209, Example 6.2.6]
and was cited in the introduction to [125].
Example 20.2.1. Let p be an odd prime, put K = Q p ( ) with p1 = p,
and take to be the absolute p-power Frobenius morphism. Let M be the
differential module of rank 2 over R with the action of D on a basis e1 , e2
given by
0
t 1
N=
.
2 t 2 0
Then M admits a Frobenius structure; this was shown by explicit calculation in
[77] but can also be derived by consideration of a suitable PicardFuchs module. Define the tamely ramified extension L of K ((t)), and the corresponding
extension E L of E , by adjoining to K ((t)) an element u such that 4u 2 = t;
then put
u = 1 +
(1)n
n=1
(2n)!2
u n K {u}.
(32 )n n!3
u+
!
"
U = ud
(u + ) + 12 u 1 u +
du
u
!
"
ud
(u ) + 12 + u 1 u
du
329
and use it to change basis; then the action of d/du on the new basis e+ , e of
M R R L is via the matrix
1 1
0
2 u + u 2
.
0
12 u 1 u 2 .
That is, M R R L splits into two differential submodules of rank 1. To render these quasiconstant, we must adjoin to L a square root of u (to eliminate
the term 12 u 1 ) and a root of the polynomial z p z = u 1 (which, by
Example 19.2.6, eliminates the terms u 2 ).
By further analysis, carried out in [209, Example 6.2.6], one determines
that in this example the special Newton slopes are 12 log p, 12 log p. By contrast, the generic Newton slopes (obtained by writing M = M E R using
the chosen basis and then extending scalars to E) are 0, log p. This illustrates the conclusion of the semicontinuity theorem for Newton polygons
(Theorem 16.4.6).
rank 1 and Newton slope 0, and M/ 1 (E ) has all its Newton slopes less
than 0.
330
Proof. We may assume that is injective. Let M1 be the first step of the
filtration on M, given by Theorem 15.4.4. Then induces a nonzero element
of H 0 (M1 E E ), which can only exist if M1 is pure of norm 1. Moreover,
M E E E E ,
the last map being multiplication in E . The map is nonzero because
it restricts to on M. (Note that the multiplication map is injective by
Corollary 15.4.6, but we will use the proof technique of that result here rather
than its statement.)
By Lemma 20.3.2 the desired results will follow from the assertion that the
natural inclusions
1 (0) E E ( )1 (0),
1 (E ) E E ( )1 (E )
are surjective. To check this assertion, choose a basis m 1 , . . . , m n of M. Given
n
vi m i with v E .
v ( )1 (0) (resp. v ( )1 (E )), write v = i=1
We induct on the largest integer j for which v j = 0, the case where no such j
exists being a trivial base case.
Define the map : E E as in Lemma 15.4.5, and put M = id M :
M M, so that M = . Then the quantity
M (v/v j ) =
j
i=1
(vi /v j )m i
331
332
333
module over R. Then the spaces H 0 (M), H 1 (M) are finite-dimensional, and
there is a perfect pairing
H 0 (M) H 1 (M ) H 1 (M M ) H 1 (R)
=K
dt
.
t
334
Notes
335
such that ci , c j = 0, the ratio i/j is not a power of p. Now let j be the smallest
integer for which c j = 0. We are done if either j 0, in which case E is
unramified, or j < 0 is not divisible by p, in which case #(|E |/|F |) > 1
and so, by Lemma 3.1.1, E = F .
Suppose that, on the contrary, j is divisible by p. By the choice of P and the
fact that P (P) = y p y for some y F, we must have (c j t j ) = (c j t j )q .
1/q
That is, c j = 1 (c j ) K since K was assumed to be bijective. We can
thus replace P by P + (c j t j )1/ p c j t j to increase j. This process repeats only
finitely many times, after which we may deduce the claim.
Notes
The p-adic local monodromy theorem (Theorem 20.1.4) can be viewed
as an archimedean analogue of the following theorem of Borel [see 194,
Theorem 6.1]. Given a vector bundle with connection over the punctured unit
disc in the complex plane, equipped with a polarized variation of (rational)
Hodge structures, Borels theorem asserts that the monodromy transformation
is forced to be quasiunipotent (i.e., its eigenvalues must be roots of unity). It
appears that the Frobenius structure in the p-adic setting plays the role of the
variation of Hodge structures in the complex analytic realm.
The p-adic local monodromy theorem was originally formulated and proved
only in the absolute case. Then, it is often referred to in the literature as
Crews conjecture because it emerged from the work of Crew [62] on the
finite dimensionality of rigid cohomology with coefficients in an overconvergent F-isocrystal. Crews original conjecture was more limited still, as it
concerned only modules such that the differential and Frobenius structures
were both defined over E ; it was restated in a more geometric form by de
Jong [66]. A closer analysis of Crews conjecture was then given by Tsuzuki
[209], who explained (using Theorem 19.3.1, which he had proved in [207])
how Theorem 20.1.4 in the absolute case would follow from a slope filtration
theorem [209, Theorem 5.2.1].
The original proofs of Theorem 20.1.4 in the absolute case can be
briefly described as follows. The proof of Kedlaya uses slope filtrations
(Theorem 17.4.3) to reduce everything to the unit-root case (Theorem 19.3.1).
Andr and Mebkhout proceed by using properties of solvable differential
modules to reduce everything to the p-adic Fuchs theorem of Christol and
Mebkhout (Theorem 13.6.1). Andrs proof was phrased in the language of
Tannakian categories, whereas Mebkhouts proof was more explicit; the proof
in the relative case that we will give in the next chapter is closely modeled on
Mebkhouts proof.
336
Exercises
337
Exercises
(1) Prove that a finite free differential module M over R is quasiunipotent if
and only if M R R L is unipotent for some L. (Hint: produce a nonzero
quasiconstant submodule of M, e.g., by Galois descent.)
(2) Let 0 M1 M M2 0 be a nonsplit extension of differential
modules over R, where M1 and M2 are both trivial and of rank 1. Prove
that M cannot admit a Frobenius structure that induces unit-root Frobenius
structures on both M1 and M2 . (Hint: use the fact that H 1 (R) = K dt/t.)
(3) Prove Lemma 20.3.1. (Hint: reduce to the case where | A| 1 for some
(0, 1) for which |w| < . Then use |w| to bound the terms of v of
norm greater than some c > 0.)
(4) Complete the proof of Lemma 20.3.4 by proving the following assertion.
Let M be a finite unipotent differential module over R, equipped with a
Frobenius structure whose Newton slopes are all positive. Prove that any
exact sequence 0 M N R 0 in the category of differential
modules with Frobenius structures must split. (Hint: apply Lemma 9.2.3 to
N and then see how the resulting basis behaves under a standard Frobenius
lift.)
(5) In the notation of Theorem 20.3.5, suppose that (t) = t q . Prove that the
equality H 0 (M) = H 0 (M R E) also holds in the category of difference
modules. (Hint: if v H 0 (M) then t 1 D(v) H 0 (M) also.)
(6) Prove Proposition 20.4.2 in the case where M is unipotent. (Hint: use
Lemma 9.2.3.)
(7) Prove Corollary 20.4.3 using Proposition 20.4.2. (Hint: let P be an indecomposable finite free quasiunipotent differential module over R. First,
prove that P is a successive extension of copies of a single irreducible differential module M. Then construct an isomorphism P
= M N , with
N unipotent, by induction on the rank of P.)
21
The p-adic local monodromy theorem: proof
339
340
Lemma 21.2.1. Suppose that N has differential slope 0. Then we may choose
L such that N R F R L ,F is unipotent.
Before giving either proof, we insert a reduction common to both.
Remark 21.2.2. In Lemma 21.2.1 the existence of N forces M to have a nontrivial summand M0 of differential slope 0, such that N appears in M R R F .
It thus suffices to prove that if M is of differential slope 0 then we may choose
L such that M R R L is unipotent.
We first give the proof using p-adic exponents.
First proof of Lemma 21.2.1. As in Remark 21.2.2, we may assume that M
itself is of differential slope 0. By Corollary 17.3.2 we may change to a
standard Frobenius lift. We may then apply Corollary 13.6.2 to deduce that
M R R[t 1/m ] is unipotent for some positive integer m coprime to p.
We next give the proof using slope filtrations.
Second proof of Lemma 21.2.1. Again as in Remark 21.2.2, we may assume
that M itself is of differential slope 0. By Theorem 17.4.3 (and changing the
Frobenius lift, as in Corollary 17.3.2), we may reduce to the case where M is
pure of norm 1 as a difference module.
In other words, Lemma 21.2.1 is reduced to the following claim. Let M be
a finite differential module over E admitting a unit-root Frobenius structure
for a standard Frobenius lift. Suppose that I R(M F ) = 1 for (0, 1)
sufficiently close to 1. Then there exists a positive integer m coprime to p such
that M R R[t 1/m ] is unipotent.
To prove this claim, apply Lemma 19.3.3 to construct a basis of M R
E [t 1/m ] for some m, on which , t D act via matrices A, N such that A, A1 ,
and N have entries in E o K t 1/m . As in the proof of Lemma 19.3.4, this
forces N0 = 0.
For notational simplicity, assume hereafter that m = 1. Let f : K t 1
K t denote the substitution t 1 t. We may then view f (N ) as defining
a differential module M over K t/ for the operator d/dt for some > 1
such that I R(M K t/ F ) = 1. By Theorem 9.6.1 this module is trivial on
the open disc of radius ; this implies that the original module M is trivial, and
in particular unipotent.
Remark 21.2.3. It would be interesting to know whether one can prove
Lemma 21.2.1 without using either p-adic exponents or slope filtrations but
instead simply using the fact that the hypothesis forces M to extend across
341
the entire punctured open unit disc (by the pasting together of Frobenius
antecedents).
342
We may thus extend to the splitting field of P i in such a way that, for each
root a of P i , we have (a) = bq for some root b of P i .
This allows us to define a compositum of ArtinSchreier extensions L to
which extends; this includes all the choices of L 0 made above. For such
an L, N R F R L ,F has differential slope 0, so we can make it trivial by
Lemma 21.2.1. This gives the desired result.
Notes
343
Notes
The approach to Theorem 20.1.4 presented here is modeled on that given by
Mebkhout [171] in the absolute case, except that Mebkhout describes only
344
the first of our two proofs of Lemma 21.2.1. The approach of Andr [5] is
substantively similar but formally different, as it is phrased in the language of
Tannakian categories.
In the course of proving Lemma 21.3.1, we noticed that if is not an
absolute Frobenius lift, then the condition of admitting an extension of is
a highly nontrivial restriction on finite separable extensions of K ((t)). This
phenomenon was noted in the context of p-adic Hodge theory by Berger and
Colmez [18, Proposition 6.2.2].
Exercises
(1) Let G be a finite p-group and let : G GL(V ) be a complex linear
representation of G. Prove that if is nontrivial then either or
has a nontrivial one-dimensional subrepresentation. (Hint: consider the
possible dimensions of irreducible representations of G.)
(2) Let V be a finite-dimensional, not necessarily commutative, F-algebra.
Prove that there exists a finite extension F of F such that V F F is not a
division algebra. (Hint: we may assume that V itself is a division algebra.
Pick any nonzero x V , view x as a linear transformation from V to V
via left multiplication, and subtract an eigenvalue of this transformation.)
Part VI
Areas of Application
22
PicardFuchs modules
In this final part of the book, we touch briefly on some areas of application of
the theory of p-adic differential equations. These chapters are intended more to
inspire than to inform, with statements that are more illustrative than definitive.
In the present chapter we revisit the territory of Chapter 0, briefly discussing
how PicardFuchs modules give rise to differential equations with Frobenius
structures and what this has to do with zeta functions.
348
PicardFuchs modules
c (a + b + 1)z
ab
y
y = 0,
z(1 z)
z(1 z)
349
working in a unit disc not containing any points of S, we obtain a differential module with Frobenius structure over K t0 . In a unit disc containing one
or more points of S, we obtain a differential module with Frobenius structure
only over >0 K /t, t0 . If the disc contains exactly one point of S and the
exponents at that point are all 0, we can also obtain a differential module with
Frobenius structure over K t0 for the derivation td/dt, provided that fixes
that point.
Example 22.2.3. In Example 22.1.2, Theorem 22.2.1 applies directly except
when p = 2. In that case the reduction modulo p fails to be generically
smooth; one must change the defining equation to get a usable description
mod 2.
2 dim(
f )
i+1
i=0
d(t)
dA
=
A(N )
dt
dt
350
PicardFuchs modules
U 1 N U + U 1
dU
=0
dt
Notes
The notion of algebraic de Rham cohomology was introduced by Grothendieck
in [98], where he gave an algebraic construction of the topological de Rham
cohomology of a complex algebraic variety. The construction involves commingling the straightforward cohomology of the de Rham complex (constructed from a module of Khler differentials) with the cohomology of
coherent algebraic sheaves. The subject was further developed by Hartshorne
[105, 106]. A thorough treatment of Grothendiecks theorem can be found
in [97].
In the final footnote to [98], Grothendieck gave what we believe was the
first suggestion that one should consider in general what we call PicardFuchs
Notes
351
modules. His suggestion was based on Manins work on the Mordell conjecture over function fields; partly for this reason, PicardFuchs modules are also
commonly known as GaussManin connections. (This terminology is apparently due to Grothendieck. The name presumably refers to Gausss study of
hypergeometric differential equations, which incorporated the original Picard
Fuchs equation governing the periods of elliptic curves.) A good collection of
onward references can be found in [109].
The fact that a PicardFuchs module has regular singularities with rational
exponents can be proved in several ways, both geometric (Griffiths, Landman)
and arithmetic (Katz). One can also formulate a more abstract version, concerning polarized variations of Hodge structures; this was proved by Borel and
extended by Schmid (see the notes to Chapter 20 for further discussion of this
last point). Again, see [109] for references.
The construction of Frobenius structures on PicardFuchs modules is a
consequence of general results in the theory of p-adic cohomology, which
we discuss further in the next chapter. For the moment we point out that
Theorem 22.2.1 in its stated form can be found in [211, Theorem 3.3.1].
Lauders strategy for computing zeta functions (also called the deformation method) was introduced in [154]; it has been worked out in detail for
hyperelliptic curves by Hubrechts [116]. Hubrechts implemented the resulting
algorithm in version 2.14 of the computer algebra system Magma. A version
for hypersurfaces was described by Gerkmann [96].
23
Rigid cohomology
It has been suggested several times in this book that the study of p-adic
differential equations is deeply connected with the theory of p-adic cohomology for varieties over finite fields. In particular, the Frobenius structures arising
on PicardFuchs modules, discussed in the previous chapter, appear within this
theory.
In this chapter, we introduce a little of the theory of rigid p-adic cohomology, as developed by Berthelot and others. In particular, we illustrate the role
played by the p-adic local monodromy theorem in a fundamental finiteness
problem in the theory.
353
Remark 23.1.3. Crew [62] showed that in this construction H i computes the
rigid cohomology of M, Hci computes the rigid cohomology with compact
i computes some sort of local cohomology at .
supports, and Hloc
Crews main result in this setting was the following.
i (A1 , M)
Theorem 23.1.4 (Crew). The spaces H i (A1k , M), Hci (A1k , M), Hloc
K
are all finite-dimensional over K . Moreover, the Poincar pairings
loc
loc
are perfect.
The key ingredient is the fact that, by the p-adic local monodromy theorem
(Theorem 20.1.4), M A R is quasiunipotent which implies the finiteness
i (A1 , M). This further implies the finite dimensionalities, except for
of Hloc
k
1
Hc (A1k , M) and H 1 (A1k , M); however, these are related by a map with finitedimensional kernel and cokernel. Moreover, they carry incompatible topologies: the former is a Frchet space while the latter is dual to a Frchet space.
This incompatibility can be resolved only if both spaces are finite-dimensional.
354
Rigid cohomology
Motivated by the work of Dwork and also by related work of Monsky and
Washnitzer, Grothendieck proposed a method of constructing an analogue of
algebraic de Rham cohomology in positive characteristic. This was developed
by Berthelot and Ogus into the theory of crystalline cohomology. An important example of this is the fact that if X is a smooth proper scheme over Z p
then the algebraic de Rham cohomology of X carries a Frobenius action that
computes the zeta function of the special fibre. (This generalizes to explain
the results on Frobenius structures and zeta functions discussed in the previous
chapter.)
One defect of crystalline cohomology is that it gives a coherent cohomology theory only for schemes over a finite field that are smooth and proper.
By contrast, the work of Monsky and Washnitzer had given a good theory
for smooth affine schemes. To fuse these theories, Berthelot introduced the
theory of rigid cohomology as well as a theory of locally constant coefficient
objects, overconvergent F-isocrystals. The example in the previous paragraph
demonstrates the computation of the cohomology of coefficients on the affine
line; a theory of cohomology with compact supports, and a local cohomology
theory, are also demonstrated by Crews construction.
For trivial coefficients, it was shown by Berthelot that rigid cohomology has
all the desired properties of a Weil cohomology: finite dimensionality, Poincar
duality, Knneth formula, cycle class maps, the Lefschetz trace formula for
Frobenius morphisms, etc. These were extended to nonconstant coefficients
by Kedlaya, using a relative version of Theorem 23.1.4.
Berthelot also suggested a theory of constructible coefficients, based on
ideas from the theory of algebraic D-modules. (These are modules over a ring
of differential operators; they are the natural coefficient objects in algebraic de
Rham cohomology.) Recent work of Caro, Kedlaya, and Tsuzuki has shown
that they form a good theory of coefficients, enjoying the same formal properties as their -adic tale counterparts. For example, one can execute a proof of
the Weil conjectures entirely using p-adic cohomology [132].
Notes
355
Notes
Crews work, and subsequent work that builds on it (e.g., [131]), makes essential use of nonarchimedean functional analysis, as was evident in the discussion
of Theorem 23.1.4. We recommend Schneiders book [195] as a user-friendly
introduction to this topic.
For general surveys of the subject of p-adic cohomology, we recommend [117] (for crystalline cohomology only) and [141] (broader but more
advanced). Also useful at this level of generality is Berthelots original article
outlining the theory of rigid cohomology [22].
For the basics of crystalline cohomology, see [27] and references within
(largely to Berthelots thesis [21]). For the work of Monsky and Washnitzer
356
Rigid cohomology
which motivated it, in addition to the original papers [172, 173, 174] there is
also a useful survey by van der Put [213].
Until recently no comprehensive introductory text on rigid cohomology was
available. That state of affairs was remedied by the appearance of the book
of Le Stum [157]. Berthelots own attempt at a foundational treatise remains
incomplete, but what does exist [23] may also be helpful. Some of the function
of a foundational text has been served by Berthelots articles concerning finite
dimensionality [24] and Poincar duality [25].
For coefficients in an overconvergent F-isocrystal, the basic properties
(finite dimensionality, Poincar duality, the Knneth formula) are proved in
[131] using techniques similar to those of Crew in [62]. However, these
techniques do not appear to suffice for the construction of a full coefficient
theory. For this, one needs a higher-dimensional analogue of Theorem 20.1.4,
called the semistable reduction theorem for overconvergent F-isocrystals. This
asserts that an overconvergent F-isocrystal on a noncomplete variety can
always be extended to a logarithmic isocrystal on some compactification, after
pulling back along a generically finite cover; such a result allows reductions
to arguments about (logarithmic) crystalline cohomology. (Except for the part
of the theorem that concerns the cover, this is analogous to Delignes theory
of canonical extensions in [68].) See [134, 137, 140, 143] for the proof; the
original results from the present book feature prominently in [143].
Berthelot wrote a useful survey of his theory of arithmetic D-modules [26].
It is this theory that has been developed by Caro (in part jointly with Tsuzuki)
into a full theory of p-adic coefficients. This work is quite expansive and
requires a summary more detailed than we can provide here; see [35, 36] for a
representative sample.
On the subject of machine calculations, as a companion to our original paper
on hyperelliptic curves [124] we recommend Edixhovens course notes [83].
Some discussion is also included in [93, Chapter 7]. We gave a condensed
summary of the general approach in [127]. For more on the role of elliptic and
hyperelliptic curves in cryptography (including the relevance of the problem of
the machine computation of zeta functions), the standard first reference is [55]
even though it is some years behind the frontiers of this fast-moving subject.
(For instance, it predates the growing area of pairing-based cryptography.)
24
p-adic Hodge theory
For our last application we turn to the subject of p-adic Hodge theory. Recall
that in Chapter 19 we described a nonabelian ArtinSchreier construction,
giving an equivalence of categories between continuous representations of the
absolute Galois group of a positive characteristic local field on a p-adic vector space and certain differential modules with Frobenius structures. In this
chapter, we describe an analogous construction for the Galois group of a
mixed-characteristic local field. We also mention a couple of applications of
this construction.
Hypothesis 24.0.1. Throughout this chapter, let K be a finite extension of Q p ,
let V be a finite-dimensional Q p -vector space, and let : G K GL(V ) be a
continuous homomorphism for the p-adic topology on V .
358
of A. (See the notes for Chapter 14 for further discussion of Witt vectors.)
can be uniquely written as the sum
Definition 24.1.5. Each element of A
n
in which a sequence xm = n=0 p [xm,n ] converges to zero if, for each n,
v(xm,n ) + as m +. Let AQ p be the completion of Z p [([] 1) ]
for the weak topology; as a topological ring, it is isomorphic to the ring oE
in A
defined over the base field Q p with its own weak topology. It is also -stable
because ([]) = [] p .
Definition 24.1.6. Let A be the completion of the maximal unramified
Put
extension of AQ p , viewed as a subring of A.
A K = A HK ,
where the right-hand side makes sense because we have chosen all the rings so
far in a functorial fashion, so that they indeed carry a G K -action. Note that A K
can be written as a ring of the form oE , but with coefficients in F rather than
in Q p .
Definition 24.1.7. For any ring denoted with a boldface A so far, define the
corresponding ring with A replaced by B by tensoring over Z p with Q p . For
24.2 (, )-modules
359
24.2 (, )-modules
We are now ready to describe the mechanism, introduced by Fontaine, for converting Galois representations into modules over various rings equipped with
much simpler group actions.
Definition 24.2.1. Recall that V is a finite-dimensional vector space equipped
with a continuous G K -action. Put
D(V ) = (V Q p B) HK ;
by the HilbertNoether theorem, D(V ) is a finite-dimensional B K -vector
space, and the natural map D(V ) B K B V Q p B is an isomorphism.
Since we have taken only HK -invariants, D(V ) retains a semilinear action of
G K /HK = K ; it also inherits an action of from B. That is, D(V ) is a
(, )-module over B K , i.e., a finite free B K -module equipped with semilinear and K -actions that commute with each other. It is also tale, which is to
say the -action is tale (unit-root), because, as in Definition 19.2.4, one can
find a G K -invariant lattice in V .
Theorem 24.2.2 (Fontaine). The functor D, from the category of continuous
representations of G K on finite-dimensional Q p -vector spaces to the category
of tale (, )-modules over B K , is an equivalence of categories.
Proof. The inverse functor is
V = (D(V ) B K B)=1 .
The argument is similar to that in the proof of Proposition 19.1.5; see [89].
In much the same way that Proposition 19.1.5 was refined by
Theorem 19.3.1, Theorem 24.2.2 was refined by Cherbonnier and Colmez as
follows [38]. The big difference is that no additional restriction on the Galois
representations is imposed.
Definition 24.2.3. Let BQ p be the image of the bounded Robba ring E , under
the identification of E (having coefficients in Q p ) with BQ p , sending t to []
1. Let BK be the integral closure of BQ p in B K . Again, BK carries the actions
of and K .
n
Definition 24.2.4. Let A be the set of x =
n=0 p [x n ] A such that
lim inf{v(xn )/n} > .
n
Define
D (V ) = (V Q p B ) HK ;
it is an tale (, )-module over BK .
360
where R is the Robba ring over Q p ; the subscript rig indicates rigid in
the sense of rigid analytical geometry. Note that Brig,K admits continuous
extensions (for the LF-topology) of the actions of and K . Define
(V ) = D (V ) B Brig,K ;
Drig
K
Remark 24.2.9. The principal new content in Theorem 24.2.8 is that the base
extension functor from tale -modules over E to tale -modules over R is
fully faithful; this follows from Corollary 16.3.4. The essential surjectivity of
the functor is almost trivial, since the taleness of the -action is defined over
the Robba ring by base extension from E (Definition 16.3.1). One need only
check that the K -action also descends to any tale lattice, but this is easy: the
proof is similar to that of Lemma 17.4.2.
361
0 C ( K , D(V )) C ( K , D(V )) 0,
where C ( K , D(V )) is the usual cochain complex for computing Galois
cohomology (as in [199, I.2.2]). One might think of this as the monoid
cohomology of K Z0 acting on D(V ).
Theorem 24.3.3. The cohomology of the Herr complex computes the Galois
cohomology of V .
Proof. For B K , the desired result was established by Herr [111]. The argument
proceeds in two steps. One takes first the cohomology of the ArtinSchreier
sequence
1
0 Q p B B 0,
after tensoring with V . This reduces the claim to the fact that the inflation
homomorphisms
H i ( K , D(V )) H i (G K , V Q p B)
are bijections; this is proved by adapting a technique introduced by Sen.
For BK and Brig,K , the desired result was established by Liu [160]; his argument proceeds by comparison with the original Herr complex rather than by
imitation of the above argument, though one could probably do that also.
362
Remark 24.3.4. As in [111, 160], one can make Theorem 24.3.3 more precise.
For instance, the construction of Galois cohomology is functorial; there is also
an interpretation in the Herr complex of the cup product in cohomology.
Remark 24.3.5. One can also use the Herr complex to recover Tates fundamental theorems on Galois cohomology (finite dimensionality, the Euler
Poincar characteristic formula, local duality). This was done by Herr in [112].
( pm ) = pm .
For K sufficiently close to 1, we may compute
=
log
log ( )
as an endomorphism of D(V ), using the power series for log(1+ x). The result
does not depend on .
Remark 24.4.2. If one views K as a one-dimensional p-adic Lie group over
Z p then is the action of the corresponding Lie algebra.
Definition 24.4.3. Note that acts on Brig,K with respect to
f [] log[]
df
.
d[]
363
364
such objects using the Herr complex; it has been shown by Liu [160] that the
analogues of Tates theorems (see Remark 24.3.5) still hold. These functors
(V ) is a
Definition 24.5.1 (Colmez). We say that V is trianguline if Drig
Notes
Our presentation here was largely a summary of Bergers [15], which we highly
recommend. For detailed study, the recent notes by Brinon and Conrad for the
2009 Clay Mathematics Institute summer school [32] may be of considerable
value.
A variant of the theory of (, )-modules was introduced by Kisin [147],
n
using the Kummer tower K ( p 1/ p ) instead of the cyclotomic tower K ( pn ).
This leads to certain advantages, particularly when studying crystalline representations. For instance, Kisin was able to use his construction to establish
some classification theorems for finite flat group schemes and for p-divisible
groups, as conjectured by Breuil. Kisins work was based on an earlier paper
of Berger [17]; both of these use slope filtrations (as in Theorem 16.4.1)
to recover a theorem of Colmez and Fontaine classifying semistable Galois
representations in terms of certain linear algebraic data.
After [14] had appeared, Fontaine succeeded in giving a direct proof of
Corollary 24.4.5 that did not involve p-adic differential equations; see [91].
References
[1] A. Abbes and T. Saito, Ramification of local fields with imperfect residue fields, Amer. J.
Math. 124 (2002), 879920.
[2] A. Abbes and T. Saito, Ramification of local fields with imperfect residue fields, II, Doc.
Math. extra vol. (2003), 572.
[3] T. G. Abbott, K. S. Kedlaya, and D. Roe, Bounding Picard numbers of surfaces using
p-adic cohomology, in Arithmetic, Geometry and Coding Theory (AGCT 2005), Societ
Mathmatique de France, 2009, pp. 125159.
[4] Y. Andr, Diffrentielles non commutatives et thorie de Galois diffrentielle ou aux
diffrences, Ann. Sci. cole Norm. Sup. (4) 34 (2001), 685739.
[5] Y. Andr, Filtrations de type HasseArf et monodromie p-adique, Invent. Math. 148 (2002),
285317.
[6] Y. Andr, Dworks conjecture on the logarithmic growth of solutions of p-adic differential
equations, Compos. Math. 144 (2008), 484494.
[7] Y. Andr and F. Baldassarri, De Rham Cohomology of Differential Modules on Algebraic
Varieties, Progress in Math. 189, Birkhuser, 2001.
[8] Y. Andr and L. Di Vizio, q-difference equations and p-adic local monodromy, Astrisque
296 (2004), 55111.
[9] M. F. Atiyah and I. G. Macdonald, Introduction to Commutative Algebra, Addison-Wesley,
Reading, 1969.
[10] F. Baldassarri, Continuity of the radius of convergence of differential equations on p-adic
analytic curves, arXiv:0809.2479v3 (2009).
[11] F. Baldassarri and L. Di Vizio, Continuity of the radius of convergence of p-adic differential
equations on Berkovich analytic spaces, arXiv:0709.2008v3 (2008).
[12] M. Baker and R. Rumely, Potential Theory and Dynamics on the Berkovich Projective Line,
Math. Surveys and Monographs 159, Amer. Math. Soc., 2010.
[13] A. Beilinson, S. Bloch, and H. Esnault, -factors for GaussManin determinants, Moscow
J. Math. 2 (2002), 477532.
[14] L. Berger, Reprsentations p-adiques et quations diffrentielles, Invent. Math. 148 (2002),
219284.
[15] L. Berger, An introduction to the theory of p-adic representations, in Geometric Aspects of
Dwork Theory, Vols. I, II, de Gruyter, Berlin, 2004, 255292.
[16] L. Berger, Construction de (, )-modules: reprsentations p-adiques et B-paires, Algebra
and Num. Theory 2 (2008), 91120.
[17] L. Berger, quations diffrentielles p-adiques et (, N )-modules filtrs, Astrisque 319
(2008), 1338.
[18] L. Berger and P. Colmez, Familles de reprsentations de de Rham et monodromie p-adique,
Astrisque 319 (2008), 303337.
365
366
References
[19] V. G. Berkovich, Spectral Theory and Analytic Geometry over Non-Archimedean Fields,
Math. Surveys and Monographs 33, Amer. Math. Soc., Providence, 1990.
[20] V. G. Berkovich, tale cohomology for non-Archimedean analytic spaces, Publ. Math. IHS
78 (1993), 5161.
[21] P. Berthelot, Cohomologie Cristalline des Schmas de Caractristique p > 0, Lecture Notes
in Math. 407, Springer-Verlag, Berlin, 1974.
[22] P. Berthelot, Gomtrie rigide et cohomologie des varits algbriques de caractristique
p, in Introductions aux cohomologies p-adiques (Luminy, 1984), Mm. Soc. Math. France
(N.S.) 23 (1986), 732.
[23] P. Berthelot, Cohomologie rigide et cohomologie rigide support propre. Premire partie, Prpublication IRMAR 96-03, available at http://perso.university-rennes1.fr/pierre.
berthelot/.
[24] P. Berthelot, Finitude et puret cohomologique en cohomologie rigide (with an appendix in
English by A. J. de Jong), Invent. Math. 128 (1997), 329377.
[25] P. Berthelot, Dualit de Poincar et formule de Knneth en cohomologie rigide, C.R. Acad.
Sci. Paris 325 (1997), 493498.
[26] P. Berthelot, Introduction la thorie arithmtique des D-modules, cohomologies p-adiques
et applications arithmtiques, II, Astrisque 279 (2002), 180.
[27] P. Berthelot and A. Ogus, Notes on Crystalline Cohomology, Princeton University Press,
Princeton, 1978.
[28] R. Bhatia, Matrix Analysis, Graduate Texts in Math. 169, Springer-Verlag, New York,
1997.
[29] J. M. Borger, Conductors and the moduli of residual perfection, Math. Annalen 329
(2004), 130.
[30] S. Bosch, Lectures on formal and rigid geometry, preprint (2008) available at
http://wwwmath1.uni-muenster.de/sfb/about/publ/bosch.html.
[31] S. Bosch, U. Gntzer, and R. Remmert, Non-Archimedean Analysis, Grundlehren der Math.
Wiss. 261, Springer-Verlag, Berlin, 1984.
[32] O. Brinon and B. Conrad, p-adic Hodge theory, notes from the 2009 Clay Mathematics
Institute summer school, available online at http://math.stanford.edu/ conrad/.
[33] S. Boucksom, C. Favre, and M. Jonsson, Valuations and plurisubharmonic singularities,
Publ. Res. Inst. Math. Sci. 44 (2008), 449494.
[34] K. Buzzard and F. Calegari, Slopes of overconvergent 2-adic modular forms, Compos. Math.
141 (2005), 591604.
[35] D. Caro, F-isocristaux surconvergents et surcohrence diffrentielle, Invent. Math. 170
(2007), 507539.
[36] D. Caro, Dvissages des F-complexes de D-modules arithmtiques en F-isocristaux
surconvergents, Invent. Math. 166 (2006), 397456.
[37] N. L. Carothers, A Short Course on Banach Space Theory, Cambridge University Press,
2004.
[38] F. Cherbonnier and P. Colmez, Reprsentations p-adiques surconvergentes, Invent. Math.
133 (1998), 581611.
[39] B. Chiarellotto and G. Christol, On overconvergent isocrystals and F-isocrystals of rank
one. Compos. Math. 100 (1996), 7799.
[40] B. Chiarellotto and A. Pulita, Arithmetic and Differential Swan conductors of rank one
representations with finite local monodromy, arXiv:0711.0701v1 (2007); to appear in Amer.
J. Math.
[41] B. Chiarellotto and N. Tsuzuki, Logarithmic growth and Frobenius filtrations for solutions
of p-adic differential equations, J. Inst. Math. Jussieu 8 (2009), 465505.
[42] G. Christol, Modules Diffrentiels et Equations Diffrentielles p-adiques, Queens Papers
in Pure and Applied Math. 66, Queens University, Kingston, 1983.
References
367
[43] G. Christol, Un thorme de transfert pour les disques singuliers rguliers, in Cohomologie
p-adique, Astrisque 119120 (1984), 151168.
[44] G. Christol, About a Tsuzuki theorem, in p-adic Functional Analysis (Ioannina, 2000),
Lecture Notes in Pure and Appl. Math. 222, Dekker, New York, 2001, 6374.
[45] G. Christol, Thirty years later, in Geometric Aspects of Dwork Theory, Vol. I, de Gruyter,
Berlin, 2004, 419436.
[46] G. Christol, Le Thorme de Turritin p-adique, in preparation; draft available at http://
people.math.jussieu.fr/christol.
[47] G. Christol and B. Dwork, Effective p-adic bounds at regular singular points, Duke Math.
J. 62 (1991), 689720.
[48] G. Christol and B. Dwork, Differential modules of bounded spectral norm, in p-adic Methods in Number Theory and Algebraic Geometry, Contemp. Math. 133, Amer. Math. Soc.,
Providence, 1992.
[49] G. Christol and B. Dwork, Modules diffrentielles sur les couronnes, Ann. Inst. Fourier 44
(1994), 663701.
[50] G. Christol and Z. Mebkhout, Sur le thorme de lindice des quations diffrentielles, II,
Annals of Math. 146 (1997), 345410.
[51] G. Christol and Z. Mebkhout, Sur le thorme de lindice des quations diffrentielles, III,
Annals of Math. 151 (2000), 385457.
[52] G. Christol and Z. Mebkhout, Sur le thorme de lindice des quations diffrentielles, IV,
Invent. Math. 143 (2001), 629672.
[53] R. C. Churchill and J. J. Kovacic, Cyclic vectors, in Differential Algebra and Related Topics
(Newark, NJ, 2000), World Scientific, River Edge NJ, 2002, 191218.
[54] D. Clark, A note on the p-adic convergence of solutions of linear differential equations,
Proc. Amer. Math. Soc. 17 (1966), 262269.
[55] H. Cohen and G. Frey (eds.), Handbook of Elliptic and Hyperelliptic Curve Cryptography,
Discrete Math. and its Applications 34, Chapman & Hall/CRC, 2005.
[56] R. M. Cohn, Difference Algebra, John Wiley & Sons, New York, London, Sydney, 1965.
[57] R. F. Coleman, Dilogarithms, regulators and p-adic L-functions, Invent. Math. 69 (1982),
171208.
[58] P. Colmez, Reprsentations triangulines de dimension 2, Astrisque 319 (2008), 213258.
[59] B. Conrad, Several approaches to non-archimedean geometry, in M. Baker et al.
(eds.), p-adic Geometry: Lectures from the 2007 Arizona Winter School, University Lecture Series 45, Amer. Math. Soc., 2008.
[60] F. T. Cope, Formal solutions of irregular linear differential equations, II, Amer. J. Math. 58
(1936), 130140.
[61] R. Crew, F-isocrystals and p-adic representations, in Algebraic Geometry (Brunswick,
Maine, 1985), Proc. Symp. Pure Math. 46, Part 2, Amer. Math. Soc., Providence, 1987,
111138.
[62] R. Crew, Finiteness theorems for the cohomology of an overconvergent isocrystal on a curve,
Ann. Sci. c. Norm. Sup. 31 (1998), 717763.
[63] R. Crew, Canonical extensions, irregularities, and the Swan conductor, Math. Ann. 316
(2000), 1937.
[64] N. E. Csima, NewtonHodge filtration for self-dual F-crystals, arXiv:0706/2530v1 (2007).
[65] A. J. de Jong, Homomorphisms of BarsottiTate groups and crystals in positive characteristic, Invent. Math. 134 (1998), 301333; erratum, ibid. 138 (1999), 225.
[66] A. J. de Jong, BarsottiTate groups and crystals, in Proc. Int. Congress of Mathematicians,
Vol. II (Berlin, 1998), Doc. Math. Extra Vol. II (1998), 259265.
[67] A. J. de Jong and F. Oort, Purity of the stratification by Newton polygons, J. Amer. Math.
Soc. 13 (2000), 209241.
[68] P. Deligne, Equations Diffrentielles Points Singuliers Rguliers, Lecture Notes in Math.
163, Springer-Verlag, Berlin, 1970.
368
References
[69] P. Deligne, La conjecture de Weil, II, Publ. Math. IHS 52 (1980), 137252.
[70] P. Deligne and N. Katz (eds.), Groupes de Monodromie en Gomtrie Algbrique II, Proc.
Sminaire de Gomtrie Algbrique du Bois-Marie 19671969 (SGA 7 II), Lecture Notes in
Math. 340, Springer-Verlag, Berlin, New York, 1973.
[71] M. Demazure, Lectures on p-divisible Groups, Lecture Notes in Math. 302, SpringerVerlag, New York, 1972.
[72] J. Denef and F. Vercauteren, An extension of Kedlayas algorithm to hyperelliptic curves in
characteristic 2, J. Cryptology 19 (2006), 125.
[73] L. Di Vizio, Local analytic classification of q-difference equations with |q| = 1, J. Noncommutative Geom. 3 (2009), 125149.
[74] B. Dwork, p-adic cycles, Publ. Math. IHS 37 (1969), 27115.
[75] B. Dwork, On p-adic differential equations, II. The p-adic asymptotic behavior of solutions
of ordinary linear differential equations with rational function coefficients, Ann. Math. (2)
98 (1973), 366376.
[76] B. Dwork, On p-adic differential equations, III. On p-adically bounded solutions of ordinary linear differential equations with rational function coefficients, Invent. Math. 20 (1973),
3545.
[77] B. Dwork, Bessel functions as p-adic functions of the argument, Duke Math. J. 41 (1974),
711738.
[78] B. Dwork, Lectures on p-adic Differential Equations, Grundlehren der Math. Wissenschaften 253, Springer-Verlag, New York, 1982.
[79] B. M. Dwork, On exponents of p-adic differential modules, J. reine angew. Math. 484
(1997), 85126.
[80] B. Dwork, G. Gerotto, and F. Sullivan, An Introduction to G-Functions, Ann. Math. Studies
133, Princeton University Press, Princeton, 1994.
[81] B. Dwork and P. Robba, On ordinary linear p-adic differential equations, Trans. Amer. Math.
Soc. 231 (1977), 146.
[82] B. Dwork and P. Robba, Effective p-adic bounds for solutions of homogeneous linear
differential equations, Trans. Amer. Math. Soc. 259 (1980), 559577.
[83] B. Edixhoven, Point counting after Kedlaya, course notes (2006) available at http://
www.math.leidenuniv.nl/ edix/oww/mathofcrypt/carls_edixhoven/kedlaya.pdf.
[84] D. Eisenbud, Commutative Algebra with a View Toward Algebraic Geometry, Graduate
Texts in Math. 150, Springer-Verlag, New York, 1995.
[85] P. Enflo, A counterexample to the approximation problem in Banach spaces, Acta Math. 130
(1973), 309317.
[86] G. Faltings, Coverings of p-adic period domains, research announcement (2007) available
at http://www.mpim-bonn.mpg.de/preprints/.
[87] C. Favre and M. Jonsson, The Valuative Tree, Lecture Notes in Math. 1853, Springer-Verlag,
Berlin, 2004.
[88] C. Favre and M. Jonsson, Valuations and multiplier ideals, J. Amer. Math. Soc. 18 (2005),
655684.
[89] J.-M. Fontaine, Reprsentations p-adiques des corps locaux, I, in The Grothendieck
Festschrift, Vol. II, Progress in Math. 87, Birkhuser, Boston, 1990, 249309.
[90] J.-M. Fontaine, Reprsentations p-adiques semi-stables, in Priodes p-adiques (Bures-surYvette, 1988), Astrisque 23 (1994), 113184.
[91] J.-M. Fontaine, Reprsentations de de Rham et reprsentations semi-stables, Orsay preprint
number 2004-12, available online at http://www.math.u-psud.fr/ biblio/pub/2004/.
[92] J.-M. Fontaine and J.-P. Wintenberger, Le corps de normes de certaines extensions
algbriques de corps locaux, C.R. Acad. Sci. Paris Sr. A, B 288 (1979), A367A370.
[93] J. Fresnel and M. van der Put, Rigid Analytic Geometry and its Applications, Progress in
Mathematics 218, Birkhuser, Boston, 2004.
References
369
370
References
[121] N. M. Katz, On the calculation of some differential Galois groups, Invent. Math. 87 (1987),
1361.
[122] N. M. Katz, A simple algorithm for cyclic vectors, Amer. J. Math. 109 (1987), 6570.
[123] N. M. Katz and T. Oda, On the differentiation of de Rham cohomology classes with respect
to parameters, J. Math. Kyoto Univ. 8 (1968), 199213.
[124] K. S. Kedlaya, Counting points on hyperelliptic curves using MonskyWashnitzer cohomology, J. Ramanujan Math. Soc. 16 (2001), 323338; errata, ibid. 18 (2003), 417418.
[125] K. S. Kedlaya, A p-adic local monodromy theorem, Annals of Math. 160 (2004), 93184.
[126] K. S. Kedlaya, Full faithfulness for overconvergent F-isocrystals, in Geometric Aspects of
Dwork Theory (Vol. II), de Gruyter, Berlin, 2004, 819835.
[127] K. S. Kedlaya, Computing zeta functions via p-adic cohomology, in Algorithmic Number
Theory (ANTS VI), Lecture Notes in Comp. Sci. 3076, Springer-Verlag, 2004, 117.
[128] K. S. Kedlaya, Local monodromy of p-adic differential equations: an overview, Int. J.
Number Theory 1 (2005), 109154; errata at http://math.mit.edu/ kedlaya/papers/.
[129] K. S. Kedlaya, Slope filtrations revisited, Documenta Math. 10 (2005), 447525; errata, ibid.
12 (2007), 361362.
[130] K. S. Kedlaya, Frobenius modules and de Jongs theorem, Math. Res. Lett. 12 (2005),
303320.
[131] K. S. Kedlaya, Finiteness of rigid cohomology with coefficients, Duke Math. J. 134 (2006),
1597.
[132] K. S. Kedlaya, Fourier transforms and p-adic Weil II, Compos. Math. 142 (2006),
14261450.
[133] K. S. Kedlaya, Swan conductors for p-adic differential modules, I: A local construction,
Algebra and Number Theory 1 (2007), 269300.
[134] K. S. Kedlaya, Semistable reduction for overconvergent F-isocrystals, I: Unipotence and
logarithmic extensions, Compos. Math. 143 (2007), 11641212.
[135] K. S. Kedlaya, The p-adic local monodromy theorem for fake annuli, Rend. Sem. Math.
Padova 118 (2007), 101146.
[136] K. S. Kedlaya, Slope filtrations for relative Frobenius, Astrisque 319 (2008), 259301.
[137] K. S. Kedlaya, Semistable reduction for overconvergent F-isocrystals, II: A valuationtheoretic approach, Compos. Math. 144 (2008), 657672.
[138] K. S. Kedlaya, Swan conductors for p-adic differential modules, II: Global variation,
arXiv:0705.0031v3 (2008); to appear in J. Institut Math. Jussieu.
[139] K. S. Kedlaya, Effective p-adic cohomology for cyclic cubic threefolds, preprint (2008)
available at http://math.mit.edu/ kedlaya/papers/.
[140] K. S. Kedlaya, Semistable reduction for overconvergent F-isocrystals, III: Local semistable
reduction at monomial valuations, Compos. Math. 145 (2009), 143172.
[141] K. S. Kedlaya, p-adic cohomology, in Algebraic Geometry (Seattle 2005), Proc. Symp. Pure
Math. 80, Amer. Math. Soc., 2009, 667684.
[142] K. S. Kedlaya, Some new directions in p-adic Hodge theory, J. Thorie Nombres Bordeaux
21 (2009), 285300.
[143] K. S. Kedlaya, Semistable reduction for overconvergent F-isocrystals, IV: Local semistable
reduction at nonmonomial valuations, arXiv:0712.3400v3 (2009).
[144] K. S. Kedlaya, Good formal structures for flat meromorphic connections, I: Surfaces,
arXiv:0811.0904v4(2009); to appear in Duke Math. J.
[145] K. S. Kedlaya and Liang Xiao, Differential modules on p-adic polyannuli, J. Inst. Math.
Jussieu 9 (2010), 155201.
[146] M. Kisin, Overconvergent modular forms and the FontaineMazur conjecture, Invent. Math.
153 (2003), 373454.
[147] M. Kisin, Crystalline representations and F-crystals, in Algebraic Geometry and Number
Theory, Progress in Math. 253, Birkhuser, Boston, 2006, 459496.
References
371
[148] R. E. Kottwitz, Isocrystals with additional structure, Comp. Math. 56 (1985), 201220.
[149] R. E. Kottwitz, Isocrystals with additional structure. II, Comp. Math. 109 (1997), 255339.
[150] R. E. Kottwitz, On the HodgeNewton decomposition for split groups, Int. Math. Res.
Notices 26 (2003), 14331447.
[151] W. Krull, Allgemeine Bewertungstheorie, J. fr Math. 167 (1932), 160196.
[152] F.-V. Kuhlmann, Elimination of ramification I: The generalized stability theorem, preprint
available at http://math.usask.ca/ fvk/; to appear in Trans. Amer. Math. Soc.
[153] S. Lang, Algebraic groups over finite fields, Amer. J. Math. 78 (1956), 555563.
[154] A. G. B. Lauder, Deformation theory and the computation of zeta functions, Proc. London
Math. Soc. 88 (2004), 565602.
[155] G. Laumon, Cohomology of Drinfeld Modular Varieties I, Cambridge Studies in Advanced
Math. 41, Cambridge University Press, 1996.
[156] M. Lazard, Les zros dune fonction analytique dune variable sur un corps valu complet,
Publ. Math. IHS 14 (1962), 4775.
[157] B. Le Stum, Rigid Cohomology, Cambridge Tracts in Math. 172, Cambridge University
Press, 2007.
[158] A. H. M. Levelt, Jordan decomposition for a class of singular differential operators, Ark.
Mat. 13 (1975), 127.
[159] J. Lindenstrauss and L. Tzafriri, Classical Banach Spaces, Lecture Notes in Math. 338,
Springer, 1973.
[160] R. Liu, Cohomology and duality for (, )-modules over the Robba ring, Int. Math. Res.
Not. 2007, article ID rnm150 (32 pages).
[161] F. Loeser, Exposants p-adiques et thormes dindice pour les quations diffrentielles
p-adiques (daprs G. Christol et Z. Mebkhout), in Proc. Sminaire Bourbaki, Vol. 1996/97,
Astrisque 245 (1997), 5781.
[162] A. Loewy, ber einen Fundamentalsatz fr Matrizen oder lineare homogene Differentialsysteme, Sitzungsberichte der Heidelberger Akademie der Wissenschaften, 5. Abhandlung
(1918), 136.
[163] E. Lutz, Sur lquation y 2 = x 3 + Ax + B sur les corps p-adiques, J. reine angew Math.
177 (1937), 238247.
[164] B. Malgrange, Sur les points singuliers des quations diffrentielles, Enseign. Math. 20
(1974), 147176.
[165] B. Malgrange, Connexions mromorphes 2: le rseau canonique, Invent. Math. 124 (1996),
367387.
[166] Yu. I. Manin, The theory of commutative formal groups over fields of finite characteristic
(in Russian), Usp. Math. 18 (1963), 390; English translation, Russian Math. Surveys 18
(1963), 180.
[167] A. Marmora, Irrgularit et conducteur de Swan p-adiques, Doc. Math. 9 (2004), 413433.
[168] S. Matsuda, Local indices of p-adic differential operators corresponding to ArtinSchreier
Witt coverings, Duke Math. J. 77 (1995), 607625.
[169] S. Matsuda, Katz correspondence for quasi-unipotent overconvergent isocrystals, Comp.
Math. 134 (2002), 134.
[170] S. Matsuda, Conjecture on AbbesSaito filtration and ChristolMebkhout filtration, in
Geometric Aspects of Dwork Theory. Vols. I, II, de Gruyter, Berlin, 2004, 845856.
[171] Z. Mebkhout, Analogue p-adique du thorme de Turrittin et le thorme de la monodromie
p-adique, Invent. Math. 148 (2002), 319351.
[172] P. Monsky, Formal cohomology, II: the cohomology sequence of a pair, Ann. Math. 88
(1968), 218238.
[173] P. Monsky, Formal cohomology, III: Fixed point theorems, Ann. Math. 93 (1971),
315343.
[174] P. Monsky and G. Washnitzer, Formal cohomology, I, Ann. Math. 88 (1968), 181217.
372
References
[175] M. Nagata, Local Rings, Interscience Tracts in Pure and Applied Math. 13, John Wiley &
Sons, New York, 1962.
[176] F. Oort and T. Zink, Families of p-divisible groups with constant Newton polygon, Doc.
Math. 7 (2002), 183201.
[177] O. Ore, Theory of non-commutative polynomials, Ann. Math. 34 (1933), 480508.
[178] B. Osserman, The Weil conjectures, in W. T. Gowers (ed.), The Princeton Companion to
Mathematics, Princeton University Press, 2008.
[179] A. Ostrowski, Untersuchungen zur arithmetischen Theorie der Krper, Math. Z. 39 (1935),
269404.
[180] J. Pila, Frobenius maps of abelian varieties and finding roots of unity in finite fields, Math.
Comp. 55 (1990), 745763.
[181] E. Pons, Modules diffrentiels non solubles. Rayons de convergence et indices, Rend. Sem.
Mat. Univ. Padova 103 (2000), 2145.
[182] B. Poonen, Maximally complete fields, Enseign. Math. (2) 39 (1993), 87106.
[183] S. Priess-Crampe and P. Ribenboim, Differential equations over valued fields (and more),
J. reine angew. Math. 576 (2004), 123147.
[184] A. Pulita, Frobenius structure for rank one p-adic differential equations, in Ultrametric functional analysis, Contemp. Math. 384, Amer. Math. Soc., Providence, 2005,
247258.
[185] A. Pulita, Rank one solvable p-adic differential equations and finite Abelian characters via
LubinTate groups, Math. Ann. 337 (2007), 489555.
[186] J.-P. Ramis, Thormes dindices Gevrey pour les quations diffrentielles ordinaires, Mem.
Amer. Math. Soc. 48 (1984).
[187] P. Ribenboim, The Theory of Classical Valuations, Springer-Verlag, New York, 1999.
[188] J. F. Ritt, Differential Algebra, Colloq. Pub. XXXIII, Amer. Math. Soc., New York, 1950.
[189] P. Robba, Lemmes de Hensel pour les oprateurs diffrentiels. Application la rduction
formelle des quations diffrentielles, Enseign. Math. (2) 26 (1980), 279311.
[190] P. Robba, Indice dun oprateur diffrentiel p-adique. IV. Cas des systmes. Mesure de
lirrgularit dans un disque. Ann. Inst. Fourier (Grenoble), 35 (1985), 1355.
[191] A. M. Robert, A Course in p-adic Analysis, Graduate Texts in Math. 198, Springer-Verlag,
New York, 2000.
[192] R. T. Rockafellar, Convex Analysis, Princeton University Press, Princeton, 1970.
[193] W. H. Schikhof, Ultrametric Calculus: An Introduction to p-adic Analysis, Cambridge
Studies in Advanced Math. 4, Cambridge University Press, 1984.
[194] W. Schmid, Variation of Hodge structure: the singularities of the period mapping, Invent.
Math. 22 (1973), 211319.
[195] P. Schneider, Nonarchimedean Functional Analysis, Springer-Verlag, Berlin, 2002.
[196] R. Schoof, Elliptic curves over finite fields and the computation of square roots mod p,
Math. Comp. 44 (1985), 483494.
[197] E. Schrner, Ultrametric fixed point theorems and applications, in Valuation Theory and its
Applications, Vol. II (Saskatoon, SK, 1999), Fields Inst. Communications 33, Amer. Math.
Soc., 2003, 353359.
[198] J.-P. Serre, Local Fields, Graduate Texts in Math. 67, Springer-Verlag, New York, 1979.
[199] J.-P. Serre, Galois Cohomology, Corrected reprint of the 1997 English edition, Springer
Monographs in Math., Springer-Verlag, Berlin, 2002.
[200] J. H. Silverman, The Arithmetic of Elliptic Curves, second printing, Graduate Texts in Math.
106, Springer-Verlag, New York, 1991.
[201] M. F. Singer and M. van der Put, Galois Theory of Difference Equations, Lecture Notes in
Math. 1666, Springer-Verlag, Berlin, 1997.
[202] M. F. Singer and M. van der Put, Galois Theory of Linear Differential Equations,
Grundlehren der Math. Wiss. 328, Springer-Verlag, Berlin, 2003.
References
373
[203] J. Tate, p-divisible groups, in Proc. Conf. on Local Fields (Driebergen, 1966), SpringerVerlag, Berlin, 1967, 158183.
[204] T. Terasoma, Confluent hypergeometric functions and wild ramification, J. Algebra 185
(1996), 118.
[205] A. Thuillier, Thorie du potentiel sur les courbes en gomtrie analytique non archimdienne. Applications la theorie dArakelov, thesis, Universit de Rennes 1, 2005.
[206] N. Tsuzuki (as T. Nobuo), The overconvergence of morphisms of tale --spaces on a
local field, Compos. Math. 103 (1996), 227239.
[207] N. Tsuzuki, Finite local monodromy of overconvergent unit-root F-isocrystals on a curve,
Amer. J. Math. 120 (1998), 11651190.
[208] N. Tsuzuki, The local index and the Swan conductor, Comp. Math. 111 (1998), 245288.
[209] N. Tsuzuki, Slope filtration of quasi-unipotent overconvergent F-isocrystals, Ann. Inst.
Fourier (Grenoble) 48 (1998), 379412.
[210] N. Tsuzuki, Morphisms of F-isocrystals and the finite monodromy theorem for unit-root
F-isocrystals, Duke Math. J. 111 (2002), 385419.
[211] N. Tsuzuki, On base change theorem and coherence in rigid cohomology, Doc. Math. extra
volume in honour of Kazuya Katos fiftieth birthday (2003), 891918.
[212] H. L. Turrittin, Convergent solutions of ordinary linear homogeneous differential equations
in the neighborhood of an irregular singular point, Acta Math. 93 (1955), 2766.
[213] M. van der Put, The cohomology of Monsky and Washnitzer, in Introductions aux
cohomologies p-adiques (Luminy, 1984), Mm. Soc. Math. France 23 (1986), 3359.
[214] A. C. M. van Rooij, Non-Archimedean Functional Analysis, Monographs and Textbooks in
Pure and Applied Math. 51, Marcel Dekker, New York, 1978.
[215] M. Vaqui, Valuations, in Resolution of Singularities (Obergurgl, 1997), Progr. Math. 181,
Birkhuser, Basel, 2000, 539590.
[216] C. A. Weibel, An Introduction to Homological Algebra, Cambridge Studies in Advanced
Math. 38, Cambridge University Press, Cambridge, 1994.
[217] H. Weyl, Das asymptotische Verteilungsgesetz der Eigenwerte linearer partieller Differentialgleichungen (mit einer Anwendung auf die Theorie der Hohlraumstrahlung), Math. Ann.
71 (1912), 441479.
[218] H. Weyl, Inequalities between the two kinds of eigenvalues of a linear transformation, Proc.
Nat. Acad. Sci. USA 35 (1949), 408411.
[219] L. Xiao, On ramification filtrations and p-adic differential equations, I: equal characteristic
case, arXiv:0801.4962v2 (2008).
[220] L. Xiao, On ramification filtrations and p-adic differential equations, II: mixed characteristic
case, arXiv:0811.3792v1 (2008).
[221] L. Xiao, Non-archimedean differential modules and ramification theory, thesis, Massachusetts Institute of Technology, 2009.
[222] P. T. Young, Radii of convergence and index for p-adic differential operators, Trans. Amer.
Math. Soc. 333 (1992), 769785.
[223] H. Zassenhaus, ber eine Verallgemeinerung des Henselschen Lemmas, Arch. Math.
(Basel) 5 (1954), 317325.
[224] I. B. Zhukov, A approach to higher ramification theory, in Invitation to higher local
fields (Mnster, 1999), Geom. Topol. Monographs 3, Geom. Topol. Publ., Coventry, 2000,
143150.
[225] I. B. Zhukov, On ramification theory in the case of an imperfect residue field (in Russian),
Mat. Sb. 194 (2003), 330; translation, Sb. Math. 194 (2003), 17471774.
[226] T. Zink, On the slope filtration, Duke Math. J. 109 (2001), 7995.
Notation
| | (-Gauss (semi)norm), 26
|A| (norm of a matrix), 56, 63
C p (completed algebraic closure of
Q p ), 28
D(V )
equal characteristic, 314
mixed characteristic, 359
D (V )
equal characteristic, 317, 318
mixed characteristic, 359
(V ), 360
Drig
Diag(1 , . . . , n ) (diagonal
matrix), 55
disc( f, r ) (discrepancy), 202
E (completion of oK ((t)) o K K ), 156
E (overconvergent series), 263
E L (unramified extension of E), 314
E L (unramified extension of E ), 316
E (perfection of E), 269
E (perfection of E ), 269
E (completed maximal unramified extension
of E), 314
[E : F] (degree of a field extension), 27
Ext(, ) (Yoneda extension group), 82
F alg (algebraic closure), 45
F (completion of K (t) for | | ), 155
F (completion of K (t p ) for | | p ), 171
F (completion of K ((t 1) p 1) for
| | p ), 181
F sep (separable closure), 45
|F | (multiplicative value group), 16
374
Notation
R (Robba ring), 264
R L (extension of R), 327
R(M) (radius of convergence), 154
R{T } (ring of twisted polynomials)
over a difference ring, 246
over a differential ring, 85
(extended Robba ring), 282
R
R(V ) (generic radius of convergence), 156
R (group of units), 13
s ( f ) (slope of a function), 185
T (transpose), 55
(transpose, Hermitian), 56
T (inverse transpose), 55
|T | (operator norm), 94
T (translation), 185
375
Index
376
Index
difference
field, 244
difference-closed, 247
isometric, 249
Galois theory, 258
ideal, 244
module, 244
ring, 244
differential
Galois theory, 91
ideal, 78
module, 78
normed, 96
on an open annulus, 152
trivial, 78
unipotent, 78, 153
ring or field, 77
slopes, 211
dimensioned constants, 27
discrepancy
of a differential module, 203
of a function, 202
discretely valued field, 16
dual
of a difference module, 245
of a differential module, 82
duality for convex functions, 43
dualizable difference module, 245
Dwork
exponential series, 181, 292
trick, 273, 275, 280, 281, 295,
329, 336
eigenvalues
prepared, 123
weakly prepared, 123
elementary divisors, 63
elementary matrix, 55
elementary row or column
operation, 56
elliptic curve, 3
elliptic integral, 5
equivalence (of p-adic exponents), 224
tale cohomology, 5, 354
tale difference module, see also unit-root
difference module
tale fundamental group, 323
exponents
of a differential module, 122
of a p-adic differential module, 226
extended Robba ring, 282
exterior power, 57, 82
exterior product , 57
extrinsic generic radius of convergence, 157
Feketes lemma, 94
field of norms, 357
377
filtration
lower numbering, 52
upper numbering, 52, 321
finite local monodromy ( p-adic), 318
formal adjoint (of a twisted polynomial), 86,
246
Frchet completeness, 139
Frobenius
antecedent, 173, 181, 213, 225, 303, 341,
342
descendant, 171, 181, 193, 196, 206
lift, 265
absolute, 265
standard, 297
zero-centered, 266
off-center, 181, 196
structure (on a differential module), 160,
291, 327
Fuchs theorem, 125
p-adic, 221, 231, 302
Fuchsian singularities, 130
full spectrum, 100
fundamental solution matrix, 123, 221, 234,
295, 302, 308
Galois cohomology, 363
Galois descent, 129, 173
Gauss
(semi)norm, 26, 138, 282
for twisted polynomials, 104
(semi)valuation, 26
for twisted polynomials, 104
GaussJordan elimination, 63, 141
GaussManin connection, 351
generalized power series, 31, 282
generic Newton polygon, 266, 280
generic point, 160
generic radius of convergence, 156
extrinsic, 157
intrinsic, 157
Gevrey functions, 130
gluing lemma, 142, 208
Hadamard three-circle theorem, 139
HahnBanach theorem, 32
HasseArf theorem, 322
Hasses theorem, 3
henselian field, 46, 263
Hensels lemma, 41
Hermitian transpose, 56
Herr complex, 361
HilbertNoether theorem, 173, 314, 359
Hodge polygon
of a difference module, 254
of a matrix, 63
HodgeNewton decomposition, 65
for difference modules, 254
378
homomorphism
bounded, 14
isometric, 14
horizontal
element (of a differential module), 78
morphism (of differential modules), 81
Horn inequalities, 71
hypergeometric
differential equation, 5, 120, 159,
222, 348
series, 5, 159
Igusa polynomial, 6
index
of a differential module, 126
of nilpotency, 123
inertia group, 49, 323
intrinsic
generic radius of convergence, 157
radius, 157
subsidiary generic radii of
convergence, 162
invariant factors, 63
inversive difference ring, 245
irregularity, 119, 162
isoclinic difference module, 250
isometric
difference field, 249
homomorphism, 14
JordanHlder constituents, 80
KiehlTate theorems, 149
Kummer theory, 50
Lang torsor, 259
Lefschetz
principle, 119
trace formula, 5
limit (of a sequence), 14
Liouville number, 219
LittlewoodRichardson
numbers, 71
local horizontal section, 154
locally free module, 84
lower convex hull, 36, 105
lower numbering filtration, 52
MalcevNeumann series, 31, 282
master factorization theorem, 39, 104, 140,
142, 246
matrix of action, 80, 244
maximally complete field, 28
metrically equivalent seminorms, 14
minor (of a matrix), 61
mirror symmetry, 355
monodromy transformation, 122
Index
multiplicative
seminorm, 15
value group, 16
mysterious functor, 363
Newton above Hodge theorem, 64
for difference modules, 254
Newton polygon
generic, 266, 280
of a difference module, 254, 279
of a matrix, 64
of a polynomial, 36
of a power series, 138
of a twisted polynomial, 104, 246
semicontinuity theorem, 266, 280, 311, 329
special, 266, 280
Newtons method, 264
nilpotency index, 123
nonarchimedean
group, ring, field, 16
seminorm, 16
norm, 14
Gauss, 26
for twisted polynomials, 104
p-adic, 26
spectral, 95
supremum, 17
t-adic, 25
trivial, 25
normal (matrix), 60
normed differential module, 96
off-center Frobenius antecedents and
descendants, 181, 196
operator norm, 56, 94, 121
opposite
difference ring/module, 245
ring, 86
optimal basis, 198
order of log-growth, 308
Ostrowskis theorem, 17, 27
outer provinces, rebellious, 300
overconvergent F-isocrystal (on the affine
line), 352
p-adic
Cauchy theorem, 154, 302
Fuchs theorem, 221, 231, 302
Hodge theory, 357
integers, ring of, 26
Liouville number, 219
local monodromy theorem, 2, 114, 234,
279, 327
unit-root, 318
norm, 26
numbers, field of, 26
Turrittin theorem, 338
Index
p-divisible group, 285, 336, 364
p-power roots of unity, 226
perfection, 249, 339
(, )-module, 359
PicardFuchs
equation, 7
module, 88, 122, 294, 347
piecewise affine function, 136
Poincar pairing, 353
potentially semistable representation,
336, 363
prepared eigenvalues, 123
principal minor (of a matrix), 61
product formula, 26
product seminorm, 23
projective module, 84
pseudoconvergent sequence, 30
pseudolimit, 30
pure
difference module, 250, 277
differential module, 100
quasiconstant differential module, 327
quasiunipotent
differential module, 327
monodromy, 122
quotient seminorm, 16
radii of optimal convergence, 198
radius of convergence (of a differential
module), 154
ramification numbers, 211
RapoportZink spaces, 285
rational type, 164
real
semivaluation, 16
valuation, 16
rebellious outer provinces quelled by
Frobenius, 300
refined decomposition theorem
strong, 179, 214
visible, 113
refined differential module, 100, 342
regular differential module, 119,
234, 347
residue field (of a nonarchimedean field), 17
rigid analytic geometry, 8
rigid cohomology, 353
Robba condition, 223
Robba ring, 264, 360
bounded, 217, 263, 359
extended, 282
Schauder basis, 32
semiabsolute value, see also seminorm
semicontinuity theorem for Newton polygons,
266, 280, 311, 329
379
seminorm, 13
compatible, 17
Gauss, 26
metrically equivalent, 14
multiplicative, 15
product, 23
quotient, 16
submultiplicative, 15
supremum, 17
topologically equivalent, 14
semistable
difference module, 281
semivaluation, 16
Gauss, 26
separated abelian group, 14
shearing transformation, 125, 226, 227,
234, 310
Shilov boundary, 149
singular value, 95
archimedean, 56
decomposition, 56
nonarchimedean, 63
slope (of a polynomial), 36
slope filtration theorem, 279, 298,
340, 364
Smith normal form, 63
solvable differential module, 211, 294, 302
special Newton polygon, 266, 280
spectral norm, 95
spectral radius, 94
spectrum
full, 100
visible, 105
spherically complete metric space, 28
standard Frobenius lift, 297
strong decomposition theorem, 177
strong triangle inequality, 16
strongly difference-closed difference
field, 247
subharmonicity, 190
submultiplicative seminorm, 15
subsidiary (generic) radii (of convergence),
162
supersingular elliptic curve, 6
supremum (semi)norm, 17
supremum-equivalent seminorm, 18
t-adic valuation, 25
tame degree (of an extension), 49
tamely ramified extension, 49, 101, 113, 179,
323
Taylor series, 90, 159, 173, 226, 296
Teichmller lift, 7, 259, 358
tensor product
of difference modules, 244
of differential modules, 81
topologically equivalent seminorms, 14
380
totally ramified extension, 48
totally wildly ramified extension, 49
trace (element or component), 82
trace-zero component, 82
transfer theorem, 158
regular singular, 234, 295
triangle inequality, 14
strong, 16
trianguline representation, 364
trivial
difference module, 244
differential module, 78
TurrittinLeveltHukuhara decomposition
theorem, 114, 128, 338
twisted polynomial
over a difference ring, 246
over a differential ring, 85
type (of a field element), 218
ultrametric seminorm, 16
unipotent differential module,
78, 153
unit-root
difference module, 250, 298,
315, 318
p-adic local monodromy theorem, 318
unramified extension, 47
upper numbering filtration, 52, 321
Index
valuation, 16
Gauss, 26
for twisted polynomials, 104
t-adic, 25
valuation ring (of a nonarchimedean field), 17
value group, 16
variation of Hodge structures, 335
visible decomposition theorem, 107, 176
refined, 113
visible spectrum, 105
weak equivalence (of p-adic exponents), 224
weak topology, 358
weakly difference-closed difference field, 247
weakly prepared eigenvalues, 123, 221
wedge power, see also exterior power
Weierstrass preparation, 140
Weil conjectures, 4
well-ordered set, 31, 282
width (of a polynomial), 35
wild inertia subgroup, 49
wildly ramified extension, 49
Witt vectors, 258, 293, 358
Yoneda extension group, 82, 245
zero-centered Frobenius lift, 266
zeta function, 4, 349, 354