Matzat Greuel Hiss (Eds.) Algorithmic Algebra and Number Theory
Matzat Greuel Hiss (Eds.) Algorithmic Algebra and Number Theory
)
Algorithmic Algebra and Number Theory
Springer
Berlin
Heidelberg
New York
Barcelona
Budapest
Hong Kong
London
Milan
Paris
Singapore
Tokyo
B. Heinrich Matzat Gert -Martin Greuel
Gerhard Hiss (Eds.)
Algorithmic Algebra
and Number Theory
t Springer
B. Heinrich Matzat
Interdisziplinares Zentrum
fUr Wissenschaftliches Rechnen
der Universitat Heidelberg
1m Neuenheimer Feld 368
D-69120 Heidelberg, Germany
e-mail: [email protected]
Gert-Martin Greuel
Fachbereich Mathematik
Universitat Kaiserslautern
Postfach 3049
D-67653 Kaiserslautern Germany
e-mail: [email protected]
Gerhard Hiss
RWTHAachen
Lehrstuhl D fUr Mathematik
Templergraben 64
D-52062 Aachen, Germany
e-mail: [email protected]
This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned,
specifically the rights of translation, reprinting. reuse of illustrations, recitation, broadcasting, reproduction on
microfilm or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted
only under the provisions of the German Copyright Law of September 9,1965, in its current version, and permission for
use must always be obtained from Springer-Verlag. Violations are liable for prosecution under the German Copyright
Law.
© Springer-Verlag Berlin Heidelberg 1999
The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in
the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and
therefore free for general use.
Typesetting: By the authors using a Springer LaTeX Macro Package
Cover design: design & production GmbH, Heidelberg
SPIN: 10654649 46/3143 - 5 4 3 2 1 0 - Printed on acid-free paper
Preface
This book contains 22 lectures presented at the final conference of the Ger-
man research program (Schwerpunktprogramm) Algorithmic Number The-
ory and Algebra 1991-1997, sponsored by the Deutsche Forschungsgemein-
schaft.
The purpose of this research program and of the meeting was to bring
together developers of computer algebra software and researchers using com-
putational methods to gain insight into experimental problems and theoret-
ical questions in algebra and number theory.
The book gives an overview on algorithmic methods and on results ob-
tained during this period. This includes survey articles on the main research
projects within the program:
Some of the articles illustrate the current state of computer algebra sys-
tems and program packages developed with support by the research pro-
gram, such as KANT and LiDIA for algebraic number theory, SINGULAR,
RED LOG and INVAR for commutative algebra and invariant theory respec-
tively, and GAP, SYSYPHOS and CHEVIE for group theory and representation
theory.
According to the three main research directions, the book is divided
into three parts representing algorithmic aspects of algebraic number the-
ory, commutative algebra and algebraic geometry, and group theory and
representation theory, edited by B. H. Matzat, G.-M. Greuel and G. Hiss,
respectively.
The editors thank the contributors to this volume and the Deutsche
Forschungsgemeinschaft for its support of the research program and the
conference held in Heidelberg.
Algorithmic Algebraic
N umber Theory
Sieving Methods for Class Group Computation
Johannes Buchmann and Michael J. Jacobson, Jr. and Stefan Neis and Patrick
Theobald and Damian Weber
1 Introduction
Computing the class group and regulator of an algebraic number field K are two
major tasks of algorithmic algebraic number theory. The asymptotically fastest
method known has conjectured sub-exponential running time and was proposed
in [5]. In this paper we show how sieving methods developed for factoring algo-
rithms can be used to speed up this algorithm in practice. We present numerical
experiments which demonstrate the efficiency of our new strategy. For example,
we are able to compute the class group of an imaginary quadratic field with a dis-
criminant of 55 digits 20 times as fast as S. Dtillmann in an earlier record-setting
implementation ([1]) which did not use sieving techniques. We also present class
numbers of large cubic fields.
2 The Algorithm
We will consider the problem of computing the class group Cl(K) of an algebraic
number field K given by an irreducible monic polynomial of degree n = r + 2s,
where r is the number of real embeddings and s is the number of complex
embeddings of K into the field <C of complex numbers. We denote the maximal
order of K by OK. The norm of an algebraic number a will be denoted by N (a),
and the norm of an ideal a will be denoted by N(a). The class number of K will
be denoted by h and the regulator by R.
We briefly review the algorithm presented in [5]. Let FB be a set of prime
ideals over K and k = WBI. For e = (e1, ... ,ek) E 2Zk, we write
k
FB e = II p~i.
i=l
and
P: AFB ~ ZZk X IRr +s - I
where .1K denotes the discriminant of the field K and w denotes the number of
roots of unity in K. This formula enables us to compute a number h* with
h* < hR < 2h*.
If the determinant of the sublattice of P(AFB) is in this interval then the sub-
lattice is the full lattice. Otherwise, additional relations can be generated until
we find the full lattice.
3 Generating Relations
Several methods for finding relations among the factor base elements have been
suggested, for example, in [1], [14], and [6]. We show how to use sieving techniques
known from factoring algorithms to compute relations very efficiently.
(x + yp)O = (FE't
in the order 0 = LZ'[Xll(f), where FE' is a factor base consisting of prime ideals
of 0 and p is a zero of I. These factorizations can be lifted to factorizations in
OK by computing ordp(a) for the index divisors P 1 [OK: 0]. In this way, we
obtain factorizations
and vectors
(e, log lal (0)1,· .. ,log lar+s-l (0)1) E <l>(AFB).
However, this method still seems to be impractical for large discriminants,
since the lattice generated by the relations usually does not have full rank.
Therefore, we also try to carryover the idea described for quadratic fields. In
general, we will proceed as follows:
- Select a random v E {-10, ... , 0, ... ,10}k and compute a = F EV. Often v
will contain a certain prescribed non-zero entry at a position corresponding
to a linearly dependent row of the relation matrix. In most cases, this will
increase the rank of the relation matrix.
6 J. Buchmann, M.J. Jacobson Jr., S. Neis, P. Theobald and D. Weber
4 HNF Computation
After finding enough relations, we have to compute the Hermite normal form of
the lattice generated by the relations. Since this requires linear algebra over ZZ,
this problem is considerably harder than the linear algebra step involved in fac-
toring or discrete logarithm computation. However, since the matrices involved
are sparse and contain small entries, it is possible to apply special techniques to
do this computation as follows.
In the first stage, we apply an algorithm for computing the Hermite normal
form using unimodular operations over ZZ. This algorithm is basically described
in [7] and computes the HNF successively row by row. To conserve sparseness
and avoid entry explosion we use several heuristics in the pivot selection and
entry elimination step. These heuristics are based on results of [10], [9], [11], [12]
and a lot of practical experience. The heuristics used will be adapted, depending
on the number of entries in the remaining part of the matrix and the size of
these entries.
If the entries become too large or the matrix becomes "too dense" (for exam-
ple, if more than two thirds of the remaining entries are non-zero), we switch to
stage two. We compute a multiple of the lattice determinant of the relation ma-
trix. In this step, we try to obtain as small a multiple as possible to simplify the
following computation. Finally we apply the modular algorithm for computing
the Hermite normal form described in [8].
Sieving Methods for Class Group Computation 7
5 Practical Results
In 1991, S. Diillmann [1] needed 10 days on a distributed network of fourteen
Spare! and SparcSLC computers to compute the class number of the quadratic
field with discriminant -(4 x 1054 + 4) (55 digits) using an improved version of
Hafner and McCurley's algorithm with large prime variant. On a single Spar-
cUltra1, this should be approximately equivalent to 7 days and 17 hours. Now,
using sieving methods, it takes only 8 hours and 53 minutes to obtain this result
on a SparcUltral.
We present our results for the imaginary quadratic case. We were able to
compute the class group for the first four of the following discriminants. For the
last two, we have so far only been able to compute relation matrices. The number
in parenthesis after the discriminant is the number of decimal digits. The class
group is presented as [rno rn2 ... rns-d, where Cl(K) == Ef)o9<s 7Z /rni7Z.
.15 -46952046735522451306774137871578512166228058934334430430/
26971349460603 (70)
h ???
Cl ???
"# forms" is the number of forms which were used for sieving, tL is the is the
total CPU time in minutes required to generate the relation matrix, t is the total
time required to compute the class group, and told is an estimation of the time
required using the techniques in [2]. The computations were all carried out on a
SPARC-ultra computer.
For the real quadratic case, we present some statistics from computing rela-
tion matrices (using large prime variant) for discriminants .::1 = 10i + 1.
i IFBI M forms % lp tL
30 355 22628 90 0.00 0.23
35 650 43756 191 0.57 0.71
40 1150 81572 346 2.05 1.50
45 2500 192772 541 3.26 4.65
50 3000 235652 1075 3.61 10.55
55 4400 363212 3122 15.37 44.36
60 6100 522532 6103 45.91 149.99
65 7600 660332 15909 52.46 528.96
70 8800 764572 47955 55.48 1581.51
75 9600 859412 164547 58.07 5739.84
class group. The computations were all carried out on a Sparc4 computer with
32 MByte RAM.
m l(.1 K ) WBI h tL t
2431~-17 22 100 230666616 7 10
2431 3 -13 22 100 204575544 6 9
2431 3 -11 21 100 67575654 11 14
2431 3 -1 14 100 73143 11 13
2431 3 +1 19 100 11803968 10 13
2431 3 + 11 22 100 208464912 6 9
2431 3 +13 22 100 171651312 9 11
2431 3 +17 19 100 8682552 9 11
46189~ - 17 29 200 528112732248 36 62
46189 3 - 13 28 200 230653109628 43 61
46189 3 - 11 29 200 534799522773 39 58
46189 3 - 1 26 200 84034542636 46 62
46189 3 + 1 28 200 614506494573 41 57
46189 3 + 11 25 200 21657374964 41 59
46189 3 + 13 30 200 1201562002152 36 56
46189 3 + 17 29 200 439589101464 41 62
4869293~ - 17 38 400 7672357980222192 130 6 min
4869293 3 - 13 40 400 181790041779396270 136 7 min
4869293 3 - 11 42 400 732630532207386456 172 6.5 min
4869293 3 -1 41 400 1842592025672169858 133 6 min
4869293 3 + 1 41 400 931340490377083533 123 6 min
4869293 3 + 11 37 400 8165641344661584 143 6 min
4869293 3 + 13 42 400 1038577938978984723 121 6.5 min
4869293 3 + 17 42 400 1028253243988827468 118 6.5 min
m l(.1K) IFBI h tL t
840564439~ - 17 55 700 3842073100240397835871905 77 min 115 min
840564439 3 - 11 54 700 808279508202073001953848 74 min 107 min
840564439 3 - 1 55 700 4813779331073984156486937 43 min 76 min
840564439 3 + 1 54 700 1806681492048597457841100 76 min 115 min
840564439 3 + 13 55 700 2653130558498568497238285 43 min 83 min
840564439 3 + 17 53 700 165032036039112923443113 66 min 102 min
References
1. J. Buchmann and S. Diillmann. Distributed class group computation. In J. Buch-
mann, H. Ganzinger, and W.J. Paul, editors, Informatik - Festschrift aus Anlaft des
sechzigsten Geburtstages von Herm Prof. Dr. G. Hotz, volume 1 of Teubner-Texte
zur Informatik, pages 68-81. B. G. Teubner, 1991.
2. J. Buchmann and S. Diillmann. A probabilistic class group and regulator algo-
rithm and its implementation. In Computational number theory, Proc. Colloq.,
Debrecen/Hung. 1989, pages 53-72, 1991.
10 J. Buchmann, M.J. Jacobson Jr., S. Neis, P. Theobald and D. Weber
1 Introduction
Let us assume that C(K) contains a point Po. Using the fact that Jc is
isomorphic to pica (C), we get a morphism of C (or more generally of symmetric
products of C) to Jc with Po as base point.
This relation between C and Jc makes it obvious that information about
C(K) can be obtained from JoCK) and vice versa. It is important to note that
due to results of Grothendieck and Raynaud the arithmetic surface C is closely
related to the canonical extension of Jc to a group scheme over OK, its Neron
model. The L-series of C is equal to the L-series of Jc which can be interpreted
as an Euler product where the local factor at f is the characteristic polynomial of
F'robenius automorphisms acting on the first f-adic cohomology group of Jc, the
Tate module of Jc at f. Hence we get a relation between diophantine problems
and f-adic Galois representations of the Galois group G K of K which is of
greatest importance (cf. the proof of the theorem of Faltings).
Another important aspect of the L-series is the connection with rational
points on PicO(C). The local factors Lp(t) take the value #PicO(Cp)(OK/P) at
t = 1. Globally one knows by the Theorem of Mordell- Weil that JoCK) is a
finitely generated Z-module which has a natural euclidean structure given by
the Neron-Tate pairing. The Conjecture of Birch and Swinnerton-Dyer assumes
the truth of the conjecture of Hasse-Weil and predicts that the arithmetic data
of this euclidean module can be deduced from the analytical behavior of L( C, s)
at s = 1. Manin used this conjecture to construct a conditional algorithm to
determine generators of JoCK).
The purpose of this paper is to give a report on possibilities to solve a good
part of the algorithmic problems described above in special cases:
We take K = <Q and assume that Jc is isogenous to a factor of the Jacobian of
a modular curve X.
In fact we restrict ourselves to the simplest case and assume that X is one of
the modular curves Xo(N) parameterizing elliptic curves with cyclic isogenies of
degree N. We remark that nearly all algorithms could be applied to the curves
X = Xl (N) or intermediate curves, too.
The origin of our algorithms is the concrete knowledge of the holomorphic dif-
ferentials as cusp forms resp. their dual space, the singular homology, of Xo(N)
over Z together with the action of a big part of the endomorphism ring of the J a-
cobian given by the ring 1l'N of Hecke operators. Moreover the geometric L-series
of Xo(N) can be defined analytically and the conjecture of Hasse-Weil is true.
For the convenience of the reader we describe this theory in the next section.
In the third section we develop the basic algorithms which depend on results
of Manin, Merel and Zagier. They use the theory of Manin resp. modular symbols
with explicit action of 1l'N to decompose the space of cusp forms into eigenspaces
w.r.t. the action of the group ring 1l'N[GQ]. This determines the <Q-factors of
Jo(N), the Jacobian of Xo(N), up to isogeny. Moreover we can construct Z-
bases of these spaces by expanding modular forms into Fourier series. There are
other methods to compute the space of modular, respectively cusp forms, for
example the algorithm of Birch ([4]) which uses ternary quadratic forms. This is
a special case ofthe method of Brandt matrices developed by M. Eichler (see [11])
Arithmetic of Modular Curves and Applications 13
S = (0 -1) ,
1 0
and R = (0 -1) .
1 -1
For a positive integer N, denote by
ro(N) = {Q = (~~) E SL 2 (Z) Ie=: 0 mod N}
14 G. Frey, M. Miiller
the Hecke subgroup of SL2 (Z) of level N. SL 2 (lR) and hence ro(N) act via
fractional linear transformations given by
( a b) . Z r+ az±b
cd cz±d
lHl* := IHl u Q u { 00 }
obtained from IHl by adjoining the set of cusps Q U { oo} to IHl. We denote the
orbits of this action by
For any function f(z) on lHl, a positive integer k and matrices 0: = (~~) in
GL 2(lR) with positive determinant we define:
X:Z-+IC
For a precise definition of holomorphicity and vanishing at the cusps see [45]
Chap. (2.1). The pull-back of a holomorphic differential form W E f]l(Xo(N))
under the covering map 1f has locally the form f(z)dz with complex parameter
z. It is easy to see, but crucial, that fez) is a cusp form of weight 2 and level N.
Lemma 2.3. The map fCz) I-t 21fif(z)dz induces an isomorphism between the
space of cusp forms S2 (N) and the holomorphic differentials f]l (XoCN)) of
Xo(N). The dimension of S2(N) as a complex vector space is equal to the genus
of the modular curve Xo(N).
Since (fi t) belongs to Fo(N), property (2) of Definition (2.2) implies that a cusp
form is a periodic function on IHl with period 1. Therefore fCz) has a Fourier
expansion near infinity:
fez) =
n=l
Remark 2.4. It is a result due to Ogg [39] that for square-free level N > 37
Aut(Xo(N)) = {w n I nlN and gcd(n,N/n) = I}.
Next we look at the restriction map
for every positive integer n coprime to the level N. By combining the restriction
map an with the involutions Wn:
The AT are the (Hecke-) eigenvalues with respect to T. By EAT we denote the
AT-eigenspace, i.e.
ET~ := {J(z) E Mk(N, X) I T(f(z)) = AT . J(z)}
where T E 'lI'N is fixed.
We shall restrict our attention to the case of weight 2 cusp forms with trivial
nebentypus. We are interested in 'natural' subspaces of the vector space of cusp
forms.
z..
It is easy to see that, if g(z) E S2(M) and MIN, then g(dz) E S2(N) for
dl So we define the space of old forms of S2(N) as:
S~'d(N) := (g(dz) I g(z) E S2(M) with MIN; M "I N; dl z.). (4)
18 G. Frey, M. Muller
is denoted by S!jew(N), the space of new forms. fez) E S!jew(N) is called a new
form, if fez) = q + I:n;?:2 anqn and fez) is a Hecke eigenform.
The Heeke operator Tn with gcd(n,N) = 1 maps S!jew(N) (S~ld(N), resp.) to
itself and is selfadjoint with respect to the Petersson inner product. Hence S2(N)
decomposes into a direct sum of simultaneous eigenspaces and all eigenvalues are
real.
For elements in S!jew(N) we get more: Let E be a simultaneous eigenspace of
S!jew(N) under the Hecke algebra. Then dim (E) = 1 (Multiplicity one theorem)
[2].
Theorem 2.12. Let R be an commutative ring. The pairing
(See [8] or [31] for a proof.) We use this property to compute the Fourier expan-
sion of cusp forms (see Sec. 3).
Since 1l'N is a free Z-module of finite rank acting on S2 (N) we get
Lemma 2.15. Let f = q + I:~=2 anqn E S2(N) be a Hecke eigenform and
T E 1l' N. Then the eigenvalue AT is a totally real integral algebraic integer and
the field
(6)
is a finite extension of Q.
By definition 1l'N is a subring of End(Jo(N)).If N is square-free then
1l'N ® Q = End(Jo(N)) ® Q
(Ribet), and if N is a prime we even get
1l'N = End(Jo(N))
(cf. [28]).
Arithmetic of Modular Curves and Applications 19
Remark 2.16.
II 0 1)._
W r -- ( -N 0 .-
WN
"IN,,, prime
is the Fricke involution.
2. One can define the Hecke and Atkin-Lehner operators for cuspforms with
nebentype and higher weights (see [2]).
3. Without using the modular interpretation, but the theory of parabolic coho-
mology Shimura [45] proved the existence of a basis of modular (cusp) forms
with rational Fourier coefficients. For S2(N) this follows from the existence
of a non-degenerate pairing
of the integral homology with the cusp forms. This pairing is compatible
with the action of the Heeke operators, see [8].
4. A second way to define Heeke operators Tn is via the double coset
ro(N)\L1/ ro(N),
(9)
is a full lattice A v , i.e a discrete free Z-module of rank 2d in Cd. The complex
torus Av := (;d / A v has the structure of an abelian variety. For a proof see [45] or
[47]. Especially, given a new form f we construct the d-dimensional GQ-invariant
space
Vf := (fu, lUi Elf)
and the abelian variety Av! which we shall denote by AI. Shimura proved that
AI is the dual abelian variety of Af ® C
(See [45] or [8]).
To AI we associate - in the usual way - a geometric L-series:
For a prime number £ we define the £-adic Tate module as
The natural action of GQ on the £-adic Tate module induces an £-adic repre-
sentation:
(10)
hand side of (10) does not dependent on £ and is an element of the polynomial
ring Z[t] (see [43]). Let
2d
Lp(Af, t) = IT (1 - Ok,pt) (11)
k=l
be the factorization of the local Euler factor over the complex numbers. Then
the theorem of Hasse-Wei! states that IOk,pl ::; Vii and that we have equality if
Af has good reduction at p ( see [37]). Furthermore Lp(A" 1) is equal to the
number of points of the abelian variety Af ® lFp defined over the finite field lFp.
It follows that the L-series defined by
and define
L(f,s):= II Lp(f,p-S)-\ SEC.
p
Using properties of cusp forms one shows that: L(f' s) is a holomorphic function
on Re( s) > ~ and has an analytic continuation to the entire complex plane, see
[43].
A fundamental result is
Theorem 2.17. For a prime number p not dividing the level N, we have:
Corollary 2.18. L(A f , s) is a holomorphic function for Re(s) > ~ and can
be extended to an entire function to the whole complex plane which satisfies a
functional equation.
(13)
We summarize further properties of the abelian subvariety AI, see theorem 7.14
in [46).
Theorem 2.21. Let f = q + L:::'=2 anqn E S2(N) be a Hecke eigenform and let
KI = lQl(anln E N) be the field generated by the Fourier coefficients of f. Then
there exists an abelian subvariety AI of Jo(N) and an isomorphism 8 from KI
to End(Jo(N)) 01Ql with the properties:
The aim of this section is to describe the algorithm which computes S2(N) as
a 'fN[GQ]-module. The basic idea of the algorithm is to use Ht(Xo(N),Z), the
(+1)-eigenspace under the involution induced by i: Z f-+ -z on H 1 (Xo(N),Z),
instead of S2(N). This is possible since we have a good description of the ho-
mology space Ht(Xo(N),Z) by generators and relations and a non-degenerate
pairing
Remark 3.1. We restrict ourselves to the case of weight 2 cusp forms with
trivial nebentype. For more general cases see [32] and [50]. Merel makes use of the
Manin-Shokurov theory and Wang applied the Eichler-Shimura isomorphism
relating the space of cusp forms to the parabolic cohomology group, see [46].
(15)
One can show that the relative homology group splits into three direct summands
[32]
HI (Xo(N), cusp, Z) = Eis(N) EEl S2(N) EEl S2(N)
where Eis(N) corresponds to the Eisenstein series of ro(N) and S2(N) (S2(N))
to the holomorphic (anti-holomorphic) cusp forms.
Denote by J the right cosets r o(N)\SL 2 (Z). According to remark (3.2.3) the
map
if?: J - + HI(Xo(N),cusp,Z),j ....... {g(O),g(oo)} with 9 E j
is well defined.
Manin [28] proved:
Theorem 3.3. HI (Xo(N), cusp, Z) is a free Z-module of rank 2g + 1100 - 1 gen-
erated by the modular symbols {{g(O),g(oo)} I 9 E SL 2 (Z)}.
Recall: The projective line over Z/N is defined by
where
(c,d) rv (c',d') iff cd' == c'dmod N.
We denote the class of (c, d) by (c : d). It is a simple exercise to show that the
map
is a bijection between J and the projective line pl(Z/N). Extending this map
by linearity we get the Z-linear map
j+jS
j +jR+jR2
belong to the kernel of the map is.It can be shown that the kernel of the map is
is generated by these elements, see [28]. Therefore we get the isomorphism
pI (Z/N)
with the relations
(18)
(c: d) + (c : d)S
(c: d) + (c : d)R + (c : d)R2.
Arithmetic of Modular Curves and Applications 25
The next aim is to express this action on M. We follow Manin and Merel [32]
and collect their results in
Theorem 3.4. Let Mn := {a E M2x2 I det(a) = n} and
(c: d)M := { 0 rt
if (c : d)M ]P'l(Z/N)
(c: d)M if(c: d)M E ]P'l(Z/N).
such that
holds.
2. For any en satisfying condition (Cn ) and for gcd(n, N) = 1 the n-th Hecke
operator acts on Z[]P'l(Z/N)] by
- x >
- y = 0, and
-
Iy'l ::;
y' = 0, and Iyl ::; 2X.
tx'
Iyl > 0, x' > Iy'l > 0, and yy' >0
26 G. Frey, M. Miiller
e2 = [20]
01 + [21]
01 + [10]
12 + [10]
02
Remark 3.S.
3 0 0 0 0
-1 0 0 0-1
T2 = -1 0 0-1 0
-1 0-1 1 0
-1 -1 0 0 1
Next we compute the space Ht(Xo(N),Z). The map [) in the exact homology
sequence (15) is:
(19)
where a, b, c, d are integers and [%], [~] are cusps of ro(N). We divide out the
additional relation induced by the condition that the complex conjugation acts
1 This description of Hecke operators can also be found in the paper of Zagier [55J.
We thank the referee for this information.
Arithmetic of Modular Curves and Applications 27
trivial. (Of course, we can do this at the same time as we deal with the other
relations.) The quotient is denoted by M+. Via the isomorphism (17) we get the
map corresponding to 6 and we can compute the kernel of the boundary map 8.
Result: This kernel is dual to the space of cusp forms S2(N).
Remark 3.10.
1. There is a different method to identify the cusp form part in M+. It is
well known (see [46]) that there exists a basis {ft, ... ,fvoo-d of Hecke
eigenforms in the space of Eisenstein series which can be described explicitly.
Especially there are easy to compute formulas for the Fourier coefficients
of the J;. Therefore we know the operation of Tp on t:is(N) and so its
characteristic polynomial on this subspace of H 1 (Xo (N), cusp, Z). We use
this to find an element f in M which lies in the subspace S2(N), see [36].
2. With a similar trick as above we even find a vector f which lies in the 'new
part' of M or in the ±1 eigenspaces of the various Atkin-Lehner opera-
tors. This speeds up the computation of the minimal (characteristic, resp. )
polynomial which we have to determine.
3. One problem to calculate in the 'It-module M (see (18)) is to find a basis
for this space. To do this one has to use GauE elimination or sparse matrix
techniques to find a basis which can be rather painful since the matrices are
very large. But with a nice trick due to X. Wang (see [36]) we have a very
efficient implementation which avoids entirely such matrix operations.
4. We described the algorithm over C but we have a Z-structure on the symbols
and on the Hecke algebra '][' N. Therefore we can use the algorithm over Z
and over finite fields.
5. It turned out that for large n (about 109 ) and medium sized level N (~ 103 )
a rearrangement of the matrices can speed up the computation. For this
result and the implementation see [3].
Example 3.11.
The modular curve Xo(41) has genus 3 and the matrix for the Hecke operator
Tp with p = 109 + 7 is:
-15916-19368 -2773)
ATp = ( -52558 22820 35963 .
30417 -2773 -35284
Since the action of the Hecke operator is compatible with the pairing
(20)
28 G. Frey, M. Muller
N I dim(S2(N» I Splitting I
10009 833 393440
10037 836 398438
10039 836 384452
10061 838 383454
10067 839 399440
10069 838 406432
10079 840 1 353486
10091 841 1 382458
10093 840 410430
10099 841 1 1 396443
The description of the table is as follows: The first column is the level, the
second the dimension of the space of cusp forms. In the third column one finds
the dimensions of the 'll'N[GQ] invariant subspaces of S2(N). For example, for
N = 10099 the space S2(N) splits into two one dimensional subspaces, one 396
dimensional and one 443 dimensional subspace as 'll'N[GQ]-module. Since this
splitting induces a decomposition of Jo(N) in Q-simple abelian subvarieties (see
theorem (2.21)) we get:
In the last section we have determined the structure of S2(N) (or equivalently
of !71(Xo(N») as'll'N[GG]-module.
Now we want to compute explicitly a basis of S2(N)(Z) respectively a ba-
sis consisting of eigenforms. The elements f of these bases will be given by
the Fourier expansion of such forms. To identify them it is sufficient to com-
pute "enough" coefficients: If the level is N then f is determined by the first
If
L Dpl N(l + ~)J coefficients.
Later on we want to get finer information. For instance we want to compute
linear subspaces with special properties inside of the cusp forms and then we
shall need much more coefficients. So the performance of our algorithm will be
of great importance.
Arithmetic of Modular Curves and Applications 29
L
<Xl
By varying the random element e and the linear form if! we get a Z-basis of
8 2(N). (cf. [32]). More precisely: We use a sequence el := e, ei+1 = Tm(ei) with
gcd(m, N) = 1 and m as small as possible. This speeds up the computations
since we have to compute fewer matrices, see (3.4) and (3.5). We find with GauB
elimination or the Wiedemann algorithm (see [22]) an integer d such that the
set Bl := {ei I 1 ~ i ~ d} is linearely independent. If d = dim(S2(N)) we
have found a basis of S2(N) and we can use (21) to find the coefficients in the
q-expansion for each ei' In the case that d < dim(S2(N)) we choose another
random element e' or - alternatively - we take another Hecke operator Tn and
compute a second sequence e~. We add the new elements to the set Bl and test
linear independence again. We repeat this until we have a basis of S2(N) and
then we use (21) to find the Fourier coefficients of the corresponding cusp forms
and therefore a basis of 8 2 (N). In practice we have only to choose 2 or 3 random
elements e to find a basis of S2(N).
Example 3.12.
Take N = 45. The dimension of 8 2 (45) is equal to 3 and
{II =q- q4 _ qlO _ 2q13 _ q16 + 4q19 + 41 2 + ... ,
12 = q2 _ q5 _ 3q8 + 4qll _ 2q17 + q20 + ... ,
h =l - l - l - q12 + q15 + q18 + ... }
is a basis of 8 2 (45)(Z).
is a basis of S4(22)(Z).
4 Applications
4.1 Cusp Forms of Weight 1 and ~
In the previous section we described how one can compute cusp forms of weight
2: 2. The algorithm depends on the geometric interpretation of these forms. Such
an interpretation is not available for cusp forms of weight one and weight 3/2.
I; d:= 1 mod 4
Cd . - {
.- H;d:=3mod4
and take (cz + d)1/2 as square root of cz + d with non negative imaginary part.
Definition 4.1. Let N E 4N. For k E ~ + Z define Mk(N, X) (8dN, X), resp.)
as the set oj holomorphic Junctions on lHl bounded near the cusps (vanishing at
the cusps, resp.) such that Jor (~ ~) E ro(N) we have:
(cf. [47]).
In loco cit. one can find the definition of the" Shimura-Shintani" lift 'Pk which
maps these forms into forms of weight 2k - 1.
2 We want to thank V. Shoup and B. Raible for providing us with ntl resp. cin
32 G. Frey, M. Muller
In the frame of elliptic curves the case k = 3/2 is of special interest: Due
to the beautiful theorem of Waldspurger [49] one can compute the value of the
L-series at s = 1 of twists ED of a modular elliptic curve E with corresponding
modular form JE by the Fourier coefficients of a form FE of weight 3/2 with
<p;}.(FE) = fE. (For the precise formulation see [1].)
2
By multiplying with 8-functions one maps S ~ (N) into a product of spaces
S2(N') and then one describes the image as linear subspace ("Tate's trick"). For
example:
If 161N, then the map
where
L L
00 00
qm 2 and 8 3 (z) := 1 + 2 qm 2
m=l,m odd m=l
of the elliptic curve E. We want to calculate the degree of cpo The map cp induces
a homomorphism
rp: H1(E,Z) ---> Hl(Xo(N),Z).
Since dim( HI (E, Z)) = 2 it has two generators w, W as tl-module. A simple
calculation shows:
(rp(W),rp(W))Xo(N) = deg(cp)(w,w)E
El : y2 = x 3 + x 2 + X
E2 : y2 = x 3 - x 2 - 77x + 330
Hence deg(cp)2 describes the order of the largest group of torsion points which
cp*(E) and A<p have in common, and this group is GQ-invariant by definition. In
34 G. Frey, M. Miiller
other words:
There is an abelian variety A defined over Q which has no factor isogenous to E
but which contains a GQ-invariant subspace of order deg( rp? Galois isomorphic
to Edeg(<p), the kernel of the multiplication by deg(rp) on E.
Let us give a third interpretation of deg(rp). Since
II/W = deg(rp)Vol(E),
where II/W = (I, f) is the Petersson inner product and Vol(E) is the volume of
the fundamental discriminant of E we get a relation with the Faltings height,
see [14]:
Let hQ(E) be the Faltings height of E. Then
1 1
hQ(E) = "2log(deg(rp)) -log(lc<p1) - 27r log(11/11)
where c<p is the Manin constant (conjecturally IC<p1 = 1). The height conjecture
for elliptic curves predicts that
(24)
with constants C and d. It is well known that this conjecture implies the ABC-
conjecture. Estimates for IIIII (c.f. [12] and [35]) give: (24) holds if and only
if
log(deg(rp)) ~ O(log(N)).
Therefore it is of great interest to know more about the degree of the modular
parameterization.
One can regard analogous questions in the theory of cusp forms. This is the
subject of the very interesting theory of congruence primes which played a crucial
role in Wiles' proof of a large part of the Taniyama-Shimura conjecture. Let I be
a normalized cusp form oflevel N with integer coefficients. L := (I).l.. nS2(N)(Z)
where the orthogonal complement is taken with respect to the Petersson inner
product. Define r as the largest positive integer such that there exists a cusp
form gEL, with
g == I mod r.
Question 4.4. Let Ef be a strong modular curve, rp: Xo(N) ---t Ef be the
modular parameterization of Ef: Does the identity
hold?
It was proved by Ribet [42] (see also Zagier's paper [54]) that
rl deg(rp)lrN i
Arithmetic of Modular Curves and Applications 35
for some positive integer i. Furthermore if N is a prime number then the answer
of question (4.4) is yes, see [54].
At the end of this section we come back to the interpretation of deg( cp) as
order of cp*(E) n AlP and ask whether a part of this intersection can be realized
as an intersection between E and another elliptic curve E' defined over Q which
is not isogenous to E. In the language of cusp forms this means that
fE == fE' mod n
E[n] ~ E'[n]
as GQ-modules.
The question concerning the existence of such pairs of elliptic curves was
posed by Mazur, [30], page 133. It gave rise to various conjectures (cf. [14], (lb)).
The strongest formulation (due to Kani-Darmon) predicts that for primes l ~ 23
the set of such pairs should be finite and even empty if l is large enough (see also
Kani-Schanz [23] and Kraus-Oesterle [21] for this topic). By checking Fourier
coefficients of cusp forms corresponding to elliptic curves we find examples up
to primes l = 13 (for composite numbers see [18]). In the appendix table (5.3)
we give a list of these examples, see also [36].
Sp2d(Z) X IHId -----> IHId; « ~ 15) ,fl) ....... (Afl + B)(Cfl + D)-l
with A,B,C,D E Zdxd.
(25)
{WI, ... , W2d} is called a symplectic basis. We form the two complex d x d matrices
1 -1
[.? := -[.?2 [.?l
ed
the period matrix of the abelian variety A and the coset Sp2d(Z)[.? represents
the isomorphism class of A in Sp2d(Z)\lHld.
Let f be a new form of level N. The corresponding At is a Q-simple factor
of Jo(N) (see theorem (2.21)). In Wang [51] it is shown that the pairing
ImH(A, A) -+ Z
where Al arises from an old form coming of level 29, A3 is a 3-dimensional factor
and A2 is isogenous to the Jacobian of a hyperelliptic curve of genus 2 associated
to the cusp form of level 87 and weight 2, with Fourier coefficients:
0= ( .
i
0.63235869 ... + 0.89156287 ... -0.01791153 ... + 0.5141950 ...
-0.01791153 ... + 0.5141950 ... t -0.3085645 ... + 1.21905454 ... t
.
i) .
We find a model of the curve C over the complex numbers:
with
Xl = -0.200591196 ... - 0.027617620 ... i
X2 = 0.404999977 ... - 0.374771692 ... i
X3 = 1.314698500 ... - 0.312863721 ... i
and then we compute the absolute invariants as complex numbers
Since we know by theoretical reasons that these complex numbers are rational
numbers with denominators divisible at most by 2,3,29, we guess that
. 23 . 73 5 . 73 3 . 31 . 67 . 73 2 . 181
tl = 34 . 292 t2 = 2. 34 . 292 t3 = 23 . 34 . 292
To determine the curve equation we follow Mestre and have to intersect a cubic
with a quadric both defined over the rational numbers. In our case the quadric
is:
9417394 2 233886344942080 2 248292512862455513309184 2
48627125 Z1 + 9761465990744321 Z2 + 126163850622565659581977625 z3 = 0(27)
We have parameterize the quadric (27), and so we have to find an integral point
on it. One solution is:
Zl = -85739778180234548859481007438550060000
Z2 = -275094235504262977000201272298610147525
Z3 = 230658543012038610306820488436146680226
Now the intersection with the cubic is easily computed and gives a polynomial
f6(X) of degree 6 in Z[x].
(28)
Remark 4.7. We have to test the correctness of (28). We know the expected
conductor of C ( = N9 = 872 ) and the Euler factors of the L-series because
of the Eichler-Shimura relation (see Sec. (4.4)). Therefore we compute the con-
ductor of C and the local Euler factors for primes p by counting points on the
curve C modulo p. If these data match (for sufficiently many Euler factors) we
can be sure that the equation is the correct one (Theorem of Faltings).
With this method, Wang found hyperelliptic curves of genus 2 whose Jacobians
are factors of Jo(N). In the appendix 5 table (5.4) one finds a table computed
by him and partly published in [51].
E. Schaefer and V. Flynn used this table to verify the Birch-Swinnerton-Dyer
conjecture in these cases.
In the appendix table (5.5) there are examples of curves of higher genus
computed by H. -J. Weber.
Arithmetic of Modular Curves and Applications 39
We now come to the task of computing the Mordell-Weil group of the Jacobian
of a curve C in the global case. Of course we have to restrict ourself to a very
special situation. We assume that C is a hyperelliptic curve and that Jc is
isogenous to a factor of Jo(N).
We recall the theorem of Mordell-Weil: For a number field K and an abelian
variety A defined over K A(K) is a finitely generated Z-module and hence
where r K is the (geometric) rank ofthe abelian variety A and A( K}tor is the sub-
group of elements of finite order. Moreover A(K) @R has a Euclidean structure
induced by the Neron-Tate height h. We denote by
Since we can compute the Fourier expansions of the corresponding cusp forms
we can determine its value at s = 1 very precisely. So the most difficult part of
Manin's algorithm is to compute the height pairing. For elliptic curves there are
several implementations available which do this very efficiently (e.g. in SIMATH
or APECS.) For g( C) 2:: 2 and C hyperelliptic Ch. Hahne developped a method
to compute this height pairing by using Arakelov's intersection theory on the
arithmetic surface associated to C (see [16]). He implemented his algorithm for
curves of genus 2 and applied it to the curves given in the table table (5.4). He
succeeded to find a basis of Jc(IQ) in these cases. The results are:
which is a free module of rank 2 over lQ[x] with integral basis (1, Ll c ) (Lla is
the discriminant of F). In the table a point is given by its coordinates w.r.t. this
basis.
42 G. Frey, M. Miiller
5 Appendix
Table 5.1.
Splitting of SJ]ew(N)
11 1 1 1(1) 46 5 1 1(1)
12 0 0 47 4 4 4(1)
13 0 0 48 3 1 1(1)
14 1 1 1(1) 49 1 1 1(1)
15 1 1 1(1) 50 2 2 1(2)
16 0 0 51 5 3 1(1), 2(1)
17 1 1 1(1) 52 5 1 1(1)
18 0 0 53 4 4 1(1), 3(1)
19 1 1 1(1) 54 4 2 1(2)
20 1 1 1(1) 55 5 3 1(1), 2(1)
21 1 1 1(1) 56 5 2 1(2)
22 2 0 57 5 3 1(3)
23 2 2 2(1) 58 6 2 1(2)
24 1 1 1(1) 59 5 5 5(1)
25 0 0 60 7 0
26 2 2 1(2) 61 4 4 1(1), 3(1)
27 1 1 1(1) 62 7 3 1(1), 2(1)
28 2 0 63 5 3 1(1), 2(1)
29 2 2 2(1) 64 3 1 1(1)
30 3 1 1(1) 65 5 5 1(1), 2(2)
31 2 2 2(1) 66 9 3 1(3)
32 1 1 1(1) 67 5 5 1(1), 2(2)
33 3 1 1(1) 68 7 2 2(1)
34 3 1 1(1) 69 7 3 1(1), 2(1)
35 3 3 1(1), 2(1) 70 9 1 1(1)
36 1 1 1(1) 71 6 6 3(2)
37 2 2 1(2) 72 5 1 1(1)
38 4 2 1(2) 73 5 5 1(1), 2(2)
39 3 3 1(1), 2(1) 74 8 4 2(2)
40 3 1 1(1) 75 5 3 1(3)
41 3 3 3(1) 76 8 1 1(1)
42 5 1 1(1) 77 7 5 1(3), 2(1)
43 3 3 1(1), 2(1) 78 11 1 1(1)
44 4 1 1(1) 79 6 6 1(1), 5(1)
45 3 1 1(1) 80 7 2 1(2)
Explanation of the fourth column: The notation a(b) means that SJ]ew(N) has b
a-dimensional'll'N[GQ]-invariant subspaces.
;3 ;3
t?::f
tr tr
(D (D
~ ><
I» 01 01
~
~(D ~ ~
~ ~ II'" ., Large example of a Heeke operator
':'. ,;,; H ~ N = 95,p = 1000002667,9 = 9
~ ~ '" "' ... S'
'1
"' ... II ~ + ..
/I ,;,; H •• H ~'" ~ ~ 23806 -7956 -4126 -8456 20272 -16620 -8164 7070 474
!i", '" /I ~.
"'«:e..,'; /I 0
1IIIII00HDl .....
..... 0 -7752 18882 594 -13176 20272 -21340 -8164 7460 474
H ~ I-'" "" C"'f" oq
:!3"'R5H",~+ ~ =
'1
~ I ~ I 0 H -12482 -5280 6330 -7202 21232 -18724 -11522 16802 4694
(1)
= >
",,""00 g; "'13 ::l.
H~H~""+O (')
(1) 10000 636 -76 27152 -11522 15542 1922 -16328 -3944
1++IH""g. .,=
W<oooW+ H - tr
::i ..... ~.,.'-"+ (1) ATp = 32756 -2040 3000 12366 1050 17646 5280 ~26060 -8164
[,;-
I-"'OOW ~tV ....
.....
'" '-"
W 0
'" g,
"" ""
.,. W
~
(1)
-960 -20390 -12366 -3000 12482 5252 -5280 2850 8164
W
s:::
o
(1)
= -27464 -23466 -12658 -30758 26776 -33640
o
'" 10650 21944 19522
'-" :::: ~
"" -6"
.... ~
4330 -13380 9050 -9050 o -9050 0 22508 0 o
:f: :f: :f: i'i"
(')
II /I II 12484 -10204 -9108 258 -3734 844 586 -13202 25530
... ~ ... ~':::'~bl '1
=
Ol~ 01 .....
~/i~/iO/lg
=:i . . . =
.,~ Characteristic polynomial of the Heeke operator Tp
i
II ~ /I ~ II :;:; g.
8.
-.lO')-':'O')~O ~ .6'"
XATp (x) = (x 4 - 41852x 3 - 1705573680x 2 + 99715463270960x - 1145905723637314144) '0
""...... .....
"" .....
<0
(x 3 - 72244x 2 + 1423292512x - 4544597778992) (x - 13532)2
'" '" 0 ~
'-" ..... '"
'-" f-~-
""-:rg;""",oo~ ~
00 "" .,. .,. '" y"
o 0 ~ 1)
.,.
W
44 G. Frey, M. Millier
Table 5.4.
Tables of equations of hyperelliptic curves whose Jacobians are factors of Jo(N)
We remark that the models defined over Z[~] are calculated with a program of Liu
[26] and some corrections are made by E. Schaefer.
Arithmetic of Modular Curves and Applications 45
Table 5.5.
References
29. B. Mazur. Modular Curves and the Eisenstein ideal. Publ. Math. IHES,47:33-186,
1977.
30. B. Mazur. Rational Isogenes of Prime Degree. Invent. math. ,44:129-162, 1978.
31. L. Merel. Quelques aspectes arithmetiques et geometriques de la theorie des sym-
boles modulaires. PhD thesis, Universite Paris 6, 1993.
32. L. Merel. Universal Fourier expansions of modular forms. In G. Frey, editor, On
Arlin's Conjecture for Odd 2-dimensional Representations, volume 1585 of Lecture
Notes in Mathematics, pages 59-94, Berlin, Heidelberg, 1994. Springer-Verlag.
33. J. F. Mestre. La methode des graphes. Exemples et applications. In Proceedings of
the international conference on class numbers and fundamental units of algebraic
number fields, pages 217-242, Katata/ Japan, 1986.
34. J. F. Mestre. Construction de courbes de genre 2 a partir de leurs modules. In
T. Mora and C. Traverso, editors, Effective Methods in Algebraic Geometry, vol-
ume 94 of Progress in Mathematics, pages 313-334. Birkhauser Verlag, 1991.
35. L. Mai and M. R. Murty. The Phragmen-Lindelof theorem and modular elliptic
curves. Contemp. Math., 166:335-340, 1994.
36. M. MUller. Arithmetik von Modulkurven. PhD thesis, Universitat (GH) Essen,
1998. to appear.
37. David Mumford. Abelian Varieties. Oxford University Press, Oxford, 2 edition,
1974.
38. David Mumford. Curves and Their Jacobians. University of Michigan Press, Ann
Arbor, 1975.
39. D. Mumford. Tata lectures on Theta I-III. Birkhauser Verlag, Ziirich, 1983.
40. A. Ogg. On the Weierstrass Points of Xo(N). Illinois J. Math. ,22(1):31-35, 1978.
41. C. Poor. On the hyperelliptic locus. Duke Math. J. , 76(3):809-884, 1994.
42. K. Ribet. On mod p Heeke operators and congruences between modular forms.
Invent. math. , 71:193-205, 1983.
43. D. Rohrlich. Modular Curves, Hecke Correspondences and L-Functions. In G. Cor-
nell, J. H. Silverman, and G. Stevens, editors, Modular Forms and Fermat's Last
Theorem, pages 41-100, Berlin, Heidelberg, 1997. Springer-Verlag.
44. R. Schoof. Elliptic curves over finite fields and the computation of square roots
mod p. Math. Compo ,43:483-494, 1985.
45. J. P. Serre. Modular forms of weight one and Galois representations. In A. Frohlich,
editor, Algebraic Number Fields, pages 193-268, New York, 1977. Academic Press.
46. G. Shimura. Introduction to the Arithmetic Theory of Automorphic Jilunctions.
Princeton University Press, Princeton, New Jersey, 1971.
47. G. Shimura. On modular forms of half integral weight. Annals of Mathematics,
97:440-481, 1973.
48. G. Shimura. On the factors of the jacobian variety of a modular function field.
J. Math. Soc. Japan, 25(3):523-544, 1973.
49. J.-L. Waldspurger. Sur les coefficients de Fourier des formes modulaires de poids
demi-entiers. Journal de Mathematiques pures et appliquees, 60:375-484, 1981.
50. X. Wang. The Hecke operators on the cusp forms of ro(N) with nebentype. In
G. Frey, editor, On Arlin's Conjecture for Odd 2-dimensional Representations,
volume 1585 of Lecture Notes in Mathematics, pages 59-94, Berlin, Heidelberg,
1994. Springer-Verlag.
51. X. Wang. 2-dimensional simple factors of Jo(N). Manuscripta mathematica,
87:179-197, 1995.
52. H.-J. Weber. Hyperelliptic simple Factors of Jo(N) with dimension at least 3.
Experimental Math., 1997. to appear.
48 G. Frey, M. Miiller
53. A. Wei!. Zum Beweis des Torellischen Satzes. Nachr. Akad. Wiss. Gottingen,
2:33-53, 1957.
54. D. Zagier. Modular Parametrization of Elliptic Curves. Can. Math. Bull.
28(3):372-384, 1985.
55. D. Zagier. Hecke operators and periods of modular forms. In S. Gelbart, R. Howe,
and P. Sarnak, editors, Festschrift in honor of 1.1. Piatetski-Shapiro on the occasion
of his sixtieth birthday, volume 3 of Isr. Math. Conf. Proc., pages 321-336. The
Weizmann Science Press of Israel, 1990.
Local and Global Ramification Properties of
Elliptic Curves in Characteristics Two and Three
Ernst-Ulrich Gekeler
Introduction
As is well-known, the arithmetic of elliptic curves E in characteristics two and
three shows some special features peculiar to these characteristics.
Let for example E be defined over a local field K of residue characteristic p. The
conductor f(EI K) has the form ftame + fWild with ftame E {O, 1, 2} and fwild = 0
if p > 3. If char(K) = 0 then fwild is bounded by a constant c(K) depending
only on K, but this fails to hold in the case of equal characteristic.
The aim of the present article is twofold.
First, we study the possible conductors f(E I K), where K is a local field of
equal characteristic p = 2 or 3. It turns out that for p = 2, each non-negative
integer n may be realized as a conductor f(EIK) (Thm. 3.3, Cor. 3.4). This
contrasts with the case p = 3, where the set of conductors is {n E No I
n = 2 or n ~ 2 (mod 3)}, or equivalently, the set of wild conductors> 0 is
{n E N I n ~ 0 (mod 3)} (Thm. 3.10). Proofs of these facts are given by
specifying several series of ElK which give rise to the stated conductors, and
examining the possible output of Tate's algorithm. They do not give particular
insight why for p = 3 some conductors are missing.
Second, we consider curves ElK over the global field K = IF'q (T) of character-
istic two or three which are Tate curves at the infinite place and whose finite
ramification is concentrated in one rational place (without restriction, the place
corresponding to T = 0). (Without the condition at infinity, such a curve E
was twisted constant, cf. Prop. 4.2, 4.3.) Using the function field analogue of the
Shimura-Taniyama-Weil correspondence and some properties of Drinfeld modu-
lar curves, we obtain a complete classification of these curves (Thm. 5.4, Cor. 5.5
and 5.6). In some cases, dealt with in sect. 6, we are able to determine the torsion
of the Mordell-Weil group and the possible isogenies between such curves.
The results of this paper had their origin in calculations of Hecke operators in
spaces of automorphic forms over function fields. These were performed in the
framework of the "Schwerpunktprogramm Algorithmische Zahlentheorie und Al-
gebra" of Deutsche Forschungsgemeinschaft (DFG), whose support is gratefully
acknowledged.
In most cases, an elliptic curve E over a field K with algebraic closure K will
be given in WeierstraB form (WF)
(Ll)
b2 = ai, b4 = ala3, b6 = a~
bs = ara6 + ala3a4 + a2a5 + a~
(1.3)
Ll=~~+~~~+~~~+~~+~+~~
j = a~2/Ll.
Coordinate change with (u,r,s,t) yields
ua'l = al
u 2a'2 = a2 + sal + r + S2
u 3 a'3 = a3 + ral
u 4a'4 = a4 + sa3 + (t + rs)al + r2
u 6a'6 = a6 + ra4 + r 2a2 + r3 + ta3 + t 2 + rtal
(1.4) uSb's = bs + rb6 + r 2b4 + r3b2 + r 4
u 6b'6 = bs + r 2b2
u 4b'4 = b4 + rb2
u 2b'2 = b2
u l2 Ll' = Ll
j' =j.
If j 1= 0, we can find (u,r,s,t) that yields (upon replacing X', Y',a'i by X, Y,ai)
a short WeierstraB form (SWF)
(1.5)
s
Here Ll = as and j = a 1 . Note that among all coordinate changes, precisely
the s-transformations preserve (1.5).
(1.7) Let p be the map s t---t S2 + s, and for a E K, let K", / K be the separable
quadratic extension generated by a root s of pes) = a. If E is given in SWF
(1.5), its twist by K", (or simply by a, or by the corresponding character X'" of
Gal(K/K)) is
E", : y2 + XY = X 3 + (a2 + a)X2 + ae.
The set of isomorphism classes of E' / K that over K become isomorphic with
E corresponds bijectively to K/p(K) through a t---t E",. Note however that in
case j(E) = 0, (many) twists E' / K of E exist which are not of the form E",.
(1.8) If K is complete with respect to a discrete valuation v and v(j) < 0 then
(1.5) describes a Tate curve if and only if a2 E p(K) ([10] p. 33).
Let now K have characteristic three. Here the invariants are given by
~=~+~,~=~~-~,~=~+~
(1.9) bs = ara6 - ala3a4 + a2a6 + a2a~ - a~
Ll = -b~bs + b~, j = bVLl.
They transform under coordinate changes according to
ua'l = al - s
u 2a'2 = a2 - sal - S2
u 3a'3 = a3 + ral - t
u 4a'4 = a4 - sa3 - ra2 - (t + rs)al + st
uea'e = ae + ra4 + r 2a2 + r3 - ta3 - t 2 - rtal
(LlO) uSb's = bs + r 3b2
u 6b'6 = be - rb4 + r 2b2 + r3
u 4b'4 = b4 + rb2
u 2b'2 =b2
u 12 ,1' = ,1
j' =j.
(1.13) For a E K*, let K",/ K be the extension K", = K( fo). The corresponding
a-twist of the curve E/ K given by (1.11) is
E", : y2 = X 3 + (a2 . a)X2 + ae . a 3.
The set of isomorphism classes of E' / K that over K become isomorphic with E
corresponds to K* / K*2 through a t---t E",. Again, this doesn't remain true if
52 E.-U. Gekeler
j(E) = o.
(1.14) IT K is complete with respect to the discrete valuation v and v(j) < 0
then (1.11) describes a Tate curve if and only if a2 E K*2.
(2.3)
KOt/Kunramified ¢} f(KOt/K) =0 and
KOt/Kramified ¢} f(KOt/K) = 1, if p > 2.
Hence f(KOt/K) > 1 can occur for p = 2 only. The following is not difficult to
show (see e.g. [5]1.2, 1.3).
2.4 Proposition. Let char(K) = 2, i.e., K is isomorphic with a field of Lau-
rent series lFq «T»,
where q is a power of 2. The ramified separable quadratic
extensions K' of K are those K' = KOt where v(a) < 0 is maximal among all
v(a + s2 + s), s E K (or, what refers to the same, where v(a) < 0 is odd).
Writing v(a) = -k, the conductor is f(KOt / K) = k + 1.
The next result can easily be verified through a case by case consideration, using
(2.1) and (2.3), provided that p > 3. For the remaining cases of residue char-
acteristics p E {2,3}, the proof given in loco cit. Thm. 1.4 for K = lFq«T»of
characteristic two, applies verbatim.
Ramification of Elliptic Curves 53
2.5 Theorem. Let ElK be an elliptic curve with conductor f(EIK) E No, and
let Eo: be its twist by the separable quadratic extension Ko:I K with conductor
f(Ko:IK). Then
(3.2)
3.3 Theorem. Equation (3.2) is minimal in each of the following cases: (a)
°
m = 1 (b) m = 3 (c) m = 5 (d) m = 2, h3 i= (e) m = 4, 1 ~ 2, h5 i=
(f) m = 0, hI i= 0.
°
The conductors f = f (ElK) and K odaira types of E are given by:
(a) f = 61 + 1 = k + 2, type = n (b) f = 6l- 1 = k + 2, type = 10
(c) f = 61- 3 = k + 2, type = n" (d) f = 6l - 2 = k, type = 10
(e) f = 61 - 4 = k, type = n" (f) f = 61 = k, type = II.
3.4 Corollary. Let K be a local field of equal characteristic two. Each non-
negative integer occurs as the conductor of some elliptic curve E over K.
54 E.-U. Gekeler
Proof. The cases (a), ... ,(f) yield all the conductors I = 61 + 1,61 - 1,61 - 3,61-
2,61 - 4,61 with I ~ 3. Clearly, I = 0,1,2 are also attained, e.g. I = 2 by the
curve
y2 + 1I"XY + 1I"Y = X3. 0
3.5 Remarks. Cases (a),(b),(c) of (3.3), leading to odd conductors I, were al-
ready treated in [5J. The non-vanishing condition in (d2,(e),(f) is independent of
the choice of uniformizer 11". Its content is that f3 and f3 cannot be well approxi-
mated by squares.
Proolof Theorem 3.3. We apply the Tate algorithm ([14J, pp. 47-52), using the
notation given there. The quantities al, . .. , Ll of (3.2) are al = 11"1, a2 = a3 =
a4 = 0, a6 = i3, b2 = 11"21, b4 = b6 = 0, bs = 11"21 i3, Ll = 11"61 i3. We first deal with
the cases (a),(b),(c), where k is odd. Here the algorithm terminates without a
coordinate transformation.
Ifm = 1, the algorithm stops in step 3 with type II and 1= v(Ll) = 61+1 = k+2.
If m = 3, we pass step 5, where the polynomial (8.1) of loco cit. is peT) =
T3 + 11"- 3i3. Since peT) (mod 11") has three different roots in the algebraic clo-
sure of k(v), the algorithm stops in step 6, the type is IQ, and I = v(Ll) - 4 =
61-1 = k + 2.
If m = 5, peT) has a triple root 0 (mod 11"). The algorithm stops at step 10 with
type II" and I = v(Ll) - 8 = 61 - 3 = k + 2.
In the remaining cases, the algorithm requires some coordinate changes.
Let first m = 2, h3 -=1= O. We pass step 5 as before, having 1I"Ial and a2, 1I"21a3 and
a4. To obtain 1I"31a6, we perform the t-transformation with some t E OK such
that v(t2 + i3) ~ 3. This preserves the case conditions accumulated so far. Note
that vet) = 1. We now have
where in view of our condition h3 -=1= 0, the valuation v(a6) is 3. The polynomial
peT) is peT) = T3 + t1l"1-2T + 1I"-3(t2 + i3), which has three different roots
(mod 11"). Hence the algorithm stops in step 6 with type 10 and I = v(Ll) - 4 =
61- 2 = k.
Next, assume m = 4, I ~ 2, h5 -=1= O. After passing step 5, the polynomial peT) =
T3 + 11"- 3i3 has zero as a triple root (mod 11"), hence we are in the third branch.
Furthermore, the polynomial Q(Y) = y2 + 11"- 4i3 has a double zero (mod 11").
Applying a t-transformation with t E OK such that v(t 2 + i3) ~ 5 (which again
preserves the preceding case conditions) gives (*) with v(a4) ~ 4 and v(a6) = 5.
Thus the algorithm stops in step 10 with type II' and I = v(Ll) - 8 = 61- 4 = k.
Let finally m = 0 and hi -=1= O. Already in step 2, we have to use at-transformation
with t E OK such that v(t 2 + i3) ~ 1.
We get (*) with v(a6) = 1, which lets the algorithm stop in step 3 with Type II
and I = v(Ll) = 61 = k. 0
Next, we deal with the analogous case where char(K) = 3. We use the curve
E / K with equation
(3.6)
Ramification of Elliptic Curves 55
(3.9)
{J E ~ I f = 2 or f 1= 2 (mod 3)} C
{J E No If = f(EIK) for some elliptic curve ElK}.
3.10 Theorem. The two sets in (3.9) agree. That is, a non-negative integer f
is the conductor of some elliptic curve E over the local field K of characteristic
three if and only if f = 2 or f 1= 2 (mod 3).
Proof. It remains to show that f = 5, 8, 11, ... cannot be a conductor. This will be
done by examining the possible output of the Tate algorithm. The case j(E) =
is easily dealt with directly. Among the three remaining cases v(j(E)) > 0,
°
v(j(E)) = 0, v(j(E)) < 0, only the first one may produce conductors f strictly
larger than 2, as follows from (2.1), (2.3), and (2.5).
Thus let ElK be an elliptic curve with v(j(E)) > 0 and f(EI K) ~ 2, given in
WF (1.1). We may suppose a1 = a3 = 0 and a2,a4,a6 E OK. Then
follows, we allow ai = 0 and use the intuitive rules for inequalities involving
Vi = 00.)
I
Since v{b 2) = V2 > 0, we are in step 3. Suppose that Va = 11, in which case the
algorithm stops with type II, f = v{,1). Among the valuations 3V2 + 1, 2V2 + 2V4,
3V4 of the three summands -a~aa, a~a~, -al of ,1, it is impossible that 2v2+2v4
is least. If 3V2 + 1 = 2V2 + 2V4 then 3V4 < 3V2 + 1 = 2V2 + 2V4, hence v{,1) = 3V4.
If 2V2 + 2V4 = 3V4 then 3V2 + 1 < 2V2 + 2V4 = 3V4, hence v{,1) = 3V2 + l.
Therefore in any case, f = v{,1) ~ 2 (mod 3).
For further use, we label the present argument giving v{,1) by a (#).
I
Now assume Va ;?: 2. Ifl v{bs) = 21 (or equivalently, V4 = 11), the algorithm stops
with type III, f = v{,1) - 1 = 2.
Assume v{bs ) ;?: 3. Iflva = 21, we have type IV with f = v{,1) - 2. As in (#),
v{,1) = min{3v2 + 2, 3V4}, and thus f = v{,1) - 2 ~ 2 (mod 3).
Assume Va = v{b a) ;?: 3. Since V2 ;?: 1, V4 ;?: 2, Va ;?: 3, the polynomial P{T) =
T3+7r-la2T2+7r-2a4T+7r-3aa has coefficients in OK. Let P{T) be its reduction
I I
modulo 7r. We distinguish the following cases: (a) V2 = 11 (b) V2 > 1, V4 = 21
(c) V2 > 1, V4 > 2.
Case (a) is ruled out since then v(j) = v{aV,1) = 6 - v{,1) ::; 0 contradicts the
assumption v{j) > O.
In case (b), P{T) = T3 + cT + d with 0 t= c E k{v), which has three different
roots. Hence the type is 10 and f = v{,1) - 4 = 2.
In case (c), P{T) has a triple root, which we may assume T = 0, using a suitable
r-transformation if necessary. We then have V2 ;?: 2, V4 ;?: 3, Va ;?: 4.
If IVa = 41, the type is IV" and f = v{,1) - 6. Arguing as in (#), we get v{,1) =
min{3v2 + 4, 3V4} and f = v{,1) - 6 ~ 2 (mod 3).
Next, assume Va ;?: 5. If IV4 = 31, we have type III" and f = v{,1) - 7 = 2.
Let now finally V4 ;?: 4. If IVa = 51, the type is II' and f = v{,1) - 8. By means
of (#), we get v{,1) = min{3v2 + 5, 3V4} and f = v{,1) - 8 ~ 2 (mod 3).
If on the other hand Va ;?: 6, we may replace (a2' a4, aa) by (7r-2a2, 7r-4a4, 7r-aaa)
and start over. Since the algorithm stops after a finite number of loops, the only
f == 2 (mod 3) that may appear is f = 2. 0
For the remainder, we adapt the following notation. lFq is a finite field with q
elements (of characteristic two or three in most cases), and A is the polynomial
ring lFq [T] with field of fractions K = lFq (T) and degree function deg: K-+
Z U {-oo}. For a place V of K, K v , Ov, k{v) denotes the v-adic completion,
the completed v-adic integers and the residue class field, respectively. We simply
write "v = 0" and "v = 00" for the places corresponding to (T = 0) and
Ramification of Elliptic Curves 57
cond(E/K) = II (V)f(E/K v )
v place of K
4.2 Proposition. Let q be even, and let E / K have good reduction outside of
{oo}. Then j(E) E IF'q.
Depending on (a) 01= j(E) E IF'q (b) j(E) = 0, E is a quadratic twist (see (1.7))
of
E(j) : y2 + Xy = X 3 + rl (j = j(E))
or a twist (not necessarily quadratic) of
E(O) : y2 + Y =X 3•
Classifying the possible twists is easy in case (a) (lac. cit.) and difficult/still to
be carried out in case (b) (see [15]).
The same statement is true in characteristic three.
4.3 Proposition. Let q be a power of three and E / K an elliptic curve with good
reduction off {oo}. Then j(E) E IF'q.
Proof. Suppose that j(E) is non-constant. Then it has a pole at the unique place
00 of bad reduction, and E/ Koo is a twisted Tate curve. From (2.3) and Theorem
2.5 we see that f(E/Koo) ~ 2, Le., cond(E) = oof(E/K~} of total degree ~ 2,
which is impossible. 0
Again, depending on (a) 0 =f. j(E) E IF'q, (b) j(E) = 0, E is a quadratic twist of
E(j) : y2 = X 3 + X2 - r 1
E(O) : y2 = X 3 - X.
Since the requirement of good reduction outside of one rational place v leads to a
twisted constant curve (in characteristics two and three; in larger characteristics,
58 E.-U. Gekeler
such curves ElK cannot exist at all), we now admit two rational places v, w of
bad reduction of ElK. Again from (2.3), (2.5) and (4.1) (see also [5J 3.3), we
have the next two statements.
(4.4) There doesn't exist an elliptic curve ElK with multiplicative reduction at
the two rational places v,w of K and good reduction off {v,w}.
(4.5) Let ElK have multiplicative reduction at wand good reduction off {v, w},
where v i- w are rational places of K. Such curves ElK can only exist in char-
acteristics two or three, in which case v(j{E)) ~ o.
(4.6) ElK has good reduction outside of {O, oo} and split multiplicative reduc-
tion at 00.
These curves will be classified in the next section.
5.1 Theorem. For each elliptic curve ElK with the properties
EIKoo is a Tate curve, and cond(EIK) = (n)(oo)
((n) and (00) being regarded as divisors on K),
The preceding will be used as follows. We show that certain Drinfeld modular
curves Xo(n) are ordinary at certain places v of K. Together with (5.1), this
yields the ordinarity of elliptic curves ElK with cond(E) = (n)(oo) at v and,
using (1.6) and (1.12), (non-) divisibility properties of the j-invariants of such
ElK.
5.4 Theorem (cf. [5] 3.5). The Drinfeld modular curve Xo(Tk) is ordinary at
all the finite places v =f. 0 of K.
the cusps. In view of (iv) and Raynaud's criterion, the second ramification
group Qk,y,2 is trivial for each cusp y of Xk+l.
(vi) The Hurwitz formula ([12] VI sect. 4) together with (iii) implies that 7rk x K
and 7rk x k(v) have precisely the same ramification structure. That is, both
are unramified off cusps, and for each cusp y of Xk+l, redk+l identifies Qk,y,i
with Qk, red k+l(y),i.
(vii) Combining ~ (vi) with (5.3), we see that Xk+l x k(v) is ordinary if and
only if Xk x k(v) is.
(viii) As is well-known (e.g. [3] VII Thm. 5.13, or [6] Cor. 5.7), Xl has genus
zero and is therefore ordinary, as is its special fiber Xl x k(v).
(ix) By (vii) and (viii), X k x k(v) and therefore also its quotient Xo(Tk) x k(v)
is ordinary for all kEN. D
5.5 Corollary. Assume that K = IF'q(T) has characteristic two, and let E/K be
an elliptic curve as in (4.6).
Proof (cf. [5], Cor. 3.6). (i) Since E/ Koo is a Tate curve, its j-invariant j = j(E)
doesn't vanish, and E has a SWF (1.5). Now j = a 6l has a pole at 00 and,
by (1.6), (5.1) and (5.4), no other finite prime divisors than T, which yields the
shape of (3 = a6. Let a := a2. By (2.5), the extension Ka/K (see (1.7)) must
be unramified off {O} and split at 00 in order that E a,f3 = (Eo,f3)a satisfy the
conditions of (4.6). From (2.4) we see that a can be chosen as stated.
(ii) The arguments of (i) may be reversed.
(iii) Let ""," denote K-isogeny. We have Ea,f3 '" Ea2,f32 ~ Ea,f32 by the Frobenius
isogeny. Since the constant field IF'q is perfect, we can achieve k odd in the same
isogeny class. Replacing further a by a + pes) if necesary, we can annihilate the
even terms in a.
(iv) results from (2.4), (2.5) and (3.3). D
5.6 Corollary. Assume that K = IF'q (T) has characteristic three, and let E/ K
be as in (4.6).
Ramification of Elliptic Curves 61
EfJ : y2 = X 3 + X2 - /3,
where /3 = c' T- k , C E W;, kEN.
(ii) Each EfJ as in (i) has the properties (4.6)
(iii) E is K -isogeneous with some EfJ as above, where k is not divisible by three.
Such an EfJ is called a standard curve in characteristic three.
(iv) For E = EfJ as in (iii), we have cond(E/K) = (O)k+2(oo).
Proof. Analogous with the proof of (5.5), using (3.8) instead of (3.3). 0
Note that no quadratic twists of EfJ occur since there is no quadratic exten-
sion K",/ K unramified off {O} except for the quadratic constant field extension.
Standard curves in characteristics two and three are studied in more detail in
section 6.
(5.7) The proof scheme of Theorem 5.4 may be applied to more general sit-
uations. Suppose that X = Xo(n) is covered by some X k , where the series
XkH ~ Xk --t ... Xl of Drinfeld modular curves satisfies:
(i) All the Xk are defined over K and have good reduction at the place v of K;
(ii) For each k, 1Tk is galois with a p-group as its Galois group;
(iii) Xl is ordinary at v.
Then all the X k and therefore X are ordinary at v. (The condition that all
the Xk must be defined over K can be weakened, if we accept some technical
complications in the proof.) We illustrate that generalization with two examples.
More examples and a systematic study of the generalization alluded to will be
found in [11].
5.8 Example. Let K = lF2 (T). For each pair (k, l) of natural numbers and each
place v f; 0,1 of K, the Drinfeld modular curve XO(Tk(T + 1)1) is ordinary
at v. Consequently, each elliptic curve over K with conductor cond(E/ K) =
(O)k(I)I(oo) and Tate reduction at 00 has a j-invariant of the form j(E) =
TT(T + 1)8 with some r, s E Z, r + s > O. It may therefore be written in SWF
y2 + Xy = X 3 + aX 2 + T-T(T + 1)-',
where a E K is such that the quadratic extension K",/ K is unramified off {O, I}
and split at 00. (Such a are easily classified, using (2.4).)
G := == 1 (mod n) }.
{(acdb) E GL(2 ' A) I ca==== 0d(mod
k nk)
The series (Xk) satisfies the conditions of (5.7) except that the X k are defined
over the ray class field K(n) of K with conductor (n), and (i) and (iii) hold only
for all the extensions v' of v to K (n). (This weakening suffices for the conclusion.)
o
the +sign) and 3 (for the -sign) rational points over k(w) ~ IF'3. Hence E(3 and
E_(3 are non-isogeneous and {3 = {3', which shows (i).
(iii) From the above, a standard curve has at most cyclic 2- or 3-torsion and
no l-torsion for primes 1 > 3. Suppose that E has a non-trivial 2-torsion point.
It yields an isogeny cp: E --+ E' to some E' which itself is isogeneous with
some standard curve E" through a purely inseparable isogeny 7/J: E" --+ E'.
We see from (i) that E = E". The composite of 7/J with the transpose of cp is an
endomorphism of E of degree 2 x power of 3, which contradicts the fact End(Z) =
Z. The non-trivial 3-torsion points of ETk and E_Tk are (T- k/3, ±T-k/3) and
(_T- k/ 3, ±T-k/3), respectively, and are not K-rational for standard curves. 0
For a not necessarily prime constant field IF'q, we still can describe isogenies of
standard curves in some cases.
6.3 Theorem. Let q be a power of p E {2,3}, K = IF'q (T), and E a ,{3 (resp. E(3)
the standard curves in the two cases. There are no K -isogenies between standard
curves sharing the conductor (0)1 (00) at least in the following cases:
(i) p = 2, f = 3
(ii) p = 2, f = 4
(iii) p = 3, f = 3
Proof. First, recall some properties of the zeta functions of elliptic curves E / K
subject to (4.6).
For each place v i- 0,00 of K, let qv = #k(v), nv = #Ev(k(v)), the number of
k( v )-rational points of the reduction Ev, and Cv = qv + 1- nv. The zeta function
is
v
with Pv(s) = (1- cvq;S + q~-2s) if v is different from 0 and 00, Poo(s) = 1- q-S,
and Po(s) = 1. It is actually a polynomial in q-S of degree deg(cond(E/ K)) - 4
(which "explains" why cond(E/ K) has always degree at least 4. More details
about zeta functions can be found in [13], where however a different normaliza-
tion is used).
(i) The curves in question are the E O,{3, where {3 = b· T- 1 with some b E ~. The
multiplicative group ~ acts transitively on these through coordinate changes
"Iu : T = u . T' in K. Suppose that E = E o,{3 ,..., E O,{3' = E', where E O,{3' =
"I~(Eo,{3) with some u E ~ of order r. If v' = "I~(v) is the place v transformed by
"Iu, we have cv(E) = cv,(E') = cv,(E). Hence v f---t cv(E) is constant on orbits
of the subgroup (u) generated by u. In particular,
But 1 + E cv(E) is the coefficient of q-S in (E, s), hence vanishes since (E, s)
equals the constant 1. Therefore, r = 1 and {3 = {3'.
64 E.-U. Gekeler
(ii) Here we deal with curves Ea,{J, where a = aT-I, /3 = bT-I and a, bare
non-vanishing elements of lFq • Suppose two such are isogeneous, corresponding
to (a,/3) = =
(aT-I,bT- 1) and (0',/3' ) (aIT-1,b'T- 1), respectively. Applying
the ai-twist to Ea,{J and Ea',{J' yields Ea+a',{J""" EO,{J" which implies a+a' 0, =
hence a = a ' and EO,{J ,...., EO,{J' and finally /3 = /3' by (i).
(iii) The proof is identical to the one given in case (i). 0
6.4 Corollary. Let Jo(Tf) be the Jacobian of the Drinfeld modular curve Xo(Tf),
and Jeew(Tf) the corresponding new part. According to the cases of (6.3), we
have the following isogenies:
(p = 2)
(p = 2)
(p = 3),
where a = aT-I, /3 = bT- 1, and a, b run through F; .
Proof. The genera of Xo(Tf) equal 0, q - 1, q2 - 1 for f = 2,3,4, as results
from e.g. [1] Satz 3.4.18 or [6] Thm. 2.17. Therefore, JO(T3) =
J8 ew (T3) and
dim Je eW (T4) = (q _1)2. The assertion now follows from (5.1) and (6.3). 0
References
1. A. Brumer and K. Kramer: The conductor of an abelian variety. Compo Math. 92
(1994), 227-248.
2. E.-U. Gekeler: Drinfeld-Moduln und modulare Formen iiber rationalen Funktio-
nenkiirpern. Bonner Math. Schriften 119 (1980).
3. E.-U. Gekeler: Drinfeld modular curves. Lect. Notes Math. 1231, Springer 1986.
4. E.-U. Gekeler: Analytical construction of Weil curves over function fields. J. Th.
des Nombres de Bordeaux 7 (1995), 27-49.
5. E.-U. Gekeler: Highly ramified pencils of elliptic curves in characteristic two. Duke
Math. J. 89 (1997), 95-107.
6. E.-U. Gekeler and U. Nonnengardt: Fundamental domains of some arithmetic
groups over function fields. Int. J. Math. 6 (1995), 689-708.
7. E.-U. Gekeler and M. Reversat: Jacobians of Drinfeld modular curves. J. reine
angew. Math. 476 (1996), 27-93.
8. P. Lockhart, M. Rosen and J. Silverman: An upper bound for the conductor of an
abelian variety. J. Alg. Geom. 2 (1993), 569-601.
9. M. Raynaud: Mauvaise reduction des courbes et p-rang. C.R. Acad. Sci. Paris 319
(1994), 1279-1282.
10. P. Roquette: Analytic theory of elliptic functions over local fields. Vandenhoeck &
Ruprecht, Giittingen 1970.
11. A. Schweizer: In preparation.
12. J.-P. Serre: Corps locaux. Hermann, Paris 1968.
13. T. Shioda: Some remarks on elliptic curves over function fields. Asterisque 209
(1992), 99-114.
14. J. Tate: Algorithm for determining the type of a singular fiber in an elliptic pencil.
In Lect. Notes Math. 476, Springer 1975, 33-52.
15. A. Weil: Exercices dyadiques. Invent. Math. 27 (1974), 1-22.
Techniques for the Computation of Galois
Groups
Alexander Hulpke
Galois theory stands at the cradle of modern algebra and interacts with many
areas of mathematics. The problem of determining Galois groups therefore is of
interest not only from the point of view of number theory (for example see
the article [39] in this volume), but leads to many questions in other areas of
mathematics. An example is its application in computer algebra when simplifying
radical expressions [32].
Not surprisingly, this task has been considered in works from number theory,
group theory and algebraic geometry. In this note I shall give an overview of
methods currently used.
While the techniques used for the identification of Galois groups were known
already in the last century [26], the involved calculations made it almost imprac-
tical to do computations beyond trivial examples. Thus the problem was only
taken up again in the last 25 years with the advent of computers.
In this note we will restrict ourselves to the case of the base field <Q. Most
methods generalize to other fields like <Q(t), <Qp' lFp(t) or number fields.
The results presented here are the work of many mathematicians. I tried to
give credit by references wherever possible.
1 Introduction
2 Local Analysis
For a prime p we denote reduction modulo p by 1r. This reduction extends nat-
urally to the algebraic integers and to polynomial rings. If p does not divide the
discriminant of J, a theorem of DEDEKIND shows that the Galois group G of the
reduced polynomial J7r E IFp[x] embeds into G ([42, (II.7.12)], a proof in a more
computational context can be found in [49, §66]).
This Galois group G over IF p is cyclic, its orbits on the approximate roots
0i7r are simply given by the IFp-irreducible factors of J7r. We thus have
Lemma 1. If p does not the discriminant disc(J), the irreducible factors of J7r
correspond to the cycle structure of an element of G; if J7r = ITi fi, this element
is of the form (... ) . (... ) ..... C···) .
~~ ~
deg" deg h deg f~
(For ramified primes we can embed the Galois groups over <Qp into G. For
practical purposes however this usually does not yield new information.)
While this yields cycle structures of elements of G, the corresponding ar-
rangement of the 0i is only determined modulo p. Without further information
about the Galois group G there is no possibility to "connect" the arrangements
for different primes. Thus the cycle structures obtained this way can only be
used to rule out candidates for G which are too small to contain all cycle struc-
tures found. This however permits to identify symmetric and alternating groups
quickly [15], which is of practical importance as asymptotically all polynomials
have the symmetric group as Galois group [48]. As it requires only factorizations
over Fp this test is very cheap and should always be run as a first filter to restrict
the type of G.
Using analytic number theory, one can generalize Lemma 1 to TSCHEBOTA-
REFF'S theorem [47], by which the density of primes corresponding to a given
Techniques for the Computation of Galois Groups 67
cycle structure equals the frequency of this cycle structure among the elements
of G. Effective bounds for the probability that all shapes have been found when
considering only a limited number of primes are given in [31). This permits a
probabilistic approach to finding G by factoring I modulo different non-ramified
primes and checking for which transitive subgroup of Sn this approximates the
shape distribution best. Besides its probabilistic nature, this approach gets into
problems if the shape distribution does not identify groups uniquely. It happens
first in degree 8 with the groups TsNlO = [22)4 and TsNu = Qs : 2.
As the Galois group of lover a local field <Qp is usually a proper subgroup of
G, such local methods alone cannot determine G, but we have to look at global
properties:
3 Invariants
The approach to identifying the Galois group G will be to show that certain
relations between the roots of I are respected by G. This permits to identify G
from a list of transitive subgroups of Sn by finding enough relations that hold
(or do not hold) to determine one subgroup from this list uniquely.
The algorithms therefore usually rely on lists of transitive subgroups of Sn.
These subgroups are classified up to degree 31 [24) which covers the currently
interesting range for n. Explicit lists up to degree 15 can be found in [12).
The tool for the identification of G is the polynomial ring R = L2"[Xl, ... , xn).
The symmetric group Sn acts on R by permutation of indeterminants. We call
hER an invariant for U ::; Sn if h U = h for all u E U. The ring of all U-
invariants is traditionally denoted by R U . To abbreviate notation we shall write
~ for (Xl, ... , xn).
The specialization homomorphism cp: R -+ O(L), h t-+ h(al, ... , an) con-
nects the permutation action of G on R (by permuting the indeterminants)
with the Galois action of G on L; it is a homomorphism of G-modules. As
O(L) n <Q = L2", G-invariance of hER implies that cp(h) E L2". The con-
verse of this is not true in general: The polynomial I = x4 - 2 has Galois group
D( 4), generated by the permutations (1,2,3,4), (1,3) with respect to the root ar-
rangement {-v2, i-v2, --v2, -i-v2}. Then h = X1X2 -X3X4 is not D(4)-invariant,
though cp(h) = 0 E L2". This however is an "accidental" relation among the roots
which is not due to the Galois group. The following lemma shows under which
conditions this can be avoided
Lemma 2. II
cp(l) 1= cp(h) lor all l E h Sn \ {h} (1)
Proof. Assume that h is not G-invariant and cp(h) E L2". Then there is agE G
such that h 9 1= h, so we have by the assumption (1) that cp(h) 1= cp(h9), and
because cp respects the actions cp(h 9) = cp(h)9 = cp(h) holds, contradiction.
68 A. Hulpke
f by a Tschirnhaus transfor-
If the condition (1) is not fulfilled, one can change
mation to another polynomial j which defines the same field (and thus has the
same Galois group). It is shown in (21) that for a given h it is always possible
to find such a transform j such that condition (1) is fulfilled for 'Pi' Care how-
ever has to be taken to ensure that the coefficients of j do not become too big.
Therefore in practice only very simple Tschirnhaus transformations are used and
it may be well worth to try an alternative h instead.
As the roots 0i are only known by approximations ai, in practice however 'P
is only known by an approximation fP: h I-t h(al' ... ,an)' The test for integrality
therefore has to rely on the approximation being good enough.
4 Descending Approach
1 The name resolvent dates back to Lagrange. Polynomials of this type can be used to
construct subfields and thus were used for solving polynomial equations.
Techniques for the Computation of Galois Groups 69
Instead of continuing with this conjugate, the algorithm then re-sorts the roots
appropriately and continues with V.
A further advantage of forming the resolvent R is that R must be invariant
under the Galois action of G and thus has integer coefficients. When computing
with approximate roots (and using <p instead of <p) therefore the coefficients of
R can be rounded to the next integer. This permits to use exact methods to test
for integral roots.
Finally, as G-invariance is tested via c.p, a condition like (1) must be fulfilled.
As G ::; U the symmetric group Sn can be replaced in (1) by U; sufficiency holds
if R is square-free (a formal proof can be found in [46]).
Bounds for the needed accuracy of root approximation are given in [17, 2.1.3].
If the coefficients of f become big, however, these bounds can become infeasibly
large and thus some of the implementations work by standard with lower, non-
guaranteed bounds.
For the approximation of the roots ai essentially two methods have been
used; numerical approximation and p-adic approximation. The main advantage
of numerical approximation is that it is probably more likely available in a pro-
gramming language and that it allows to obtain quickly results which are not
guaranteed to be correct. Its main disadvantage is that approximation is not a
ring homomorphism and thus error propagation is difficult to keep under control.
To the authors knowledge there are neither theoretical results, nor implementa-
tions ("validated numerics") which analyze error propagation in these cases and
thus give proven results.
On the other hand p-adic approximation behaves much nicer from an alge-
braic viewpoint in that the approximation is a ring homomorphism and thus no
error propagation will happen. To compute bounds, estimates for the absolute
values lail are still needed. These can be obtained from general estimates as
found in [41, section 4.3].
The approach of p-adic approximation, combined with numerical approxi-
mation to obtain better bounds for the lail, has been used in [14] to verify
polynomials with Galois groups Mn and M 12 .
The problem of approximation can be avoided completely if deferring the
evaluation c.p as shown in [9]: If h is a U-relative invariant for V, the coefficients
of the unevaluated resolvent R = TIgE(hU)(t- g) E <Q(~)[t] are invariant under U
and therefore expressible in generators of the invariant ring <Q[~]u. On the other
hand, the elements {I, h, h2, ... , hlu:vI} form a <Q(~)U -Basis of <Q(~) v and [11]
gives an algorithm to express V-invariants in this basis explicitly. This permits
the following inductive approach: We start with U := Sn whose invariants are
generated by the elementary symmetric polynomials. At each step we assume
that we can express every U invariant polynomial in terms of the elementary
symmetric polynomials and known relative invariants of U and of subgroups
Sn ~ W ~ U. This certainly holds for U = Sn. If we descend to a subgroup V
70 A. Hulpke
there is a new invariant ring Q(~) v which is generated by Q(~)U and the invari-
ant h. The above mentioned algorithm now permits to express every V -invariant
in terms of Q(~)U and h. By the assumption on Q(~)U this yields an expres-
sion of every V-invariant in terms of the elementary symmetric polynomials and
known invariants of subgroups Sn 2: W 2: V. This however is the necessary
assumption for the induction once V becomes itself a new U in the next step.
By using the evaluations tp( ei) of the elementary symmetric polynomials, which
are (up to a sign) the coefficients of j, and the evaluations tp(h) of invariants of
larger subgroups, which are the integral roots obtained in earlier steps, we can
thereby express any tp-evaluated U-invariant. One thus obtains the evaluated
resolvents TIgE(hU)(t - tp(g)) needed for the algorithm.
If not tp( h) but the evaluation tp(g) of a conjugate is an integral root, of course
this conjugate 9 = h U (which is an invariant for the corresponding subgroup V U )
must be used.
In the process of specialization, denominators in expressions in Q(~) might
vanish. If this is the case, another invariant h(P(xd, ... ,p(xn)) for P E Q[t] is
chosen. It is shown in [9] that there always is a transform for which denominators
do not vanish when specializing.
There are various implementations of the descent method available [20, 18,
17,19] as standalones or in the systems PARI [4] and KANT [13].
4.2 Variations
To overcome the problem of a large index [U : V] when evaluating invariants, J.
McKAY suggested the following approach: Suppose we know an element e E G
by its explicit action on the approximate roots. This is the case, for example, for
the complex conjugation when using numerical approximation of the roots, or
for the FROBENIUS automorphism when using p-adic approximation. If hER-
is an invariant for V < U, an invariant image hg (g E G) is invariant under G
only if it is invariant under e. This permits to reduce the number of images of
hg that have to be evaluated via tp from [U : V] to lEI with
E = {g E Repres(V\U) I hg = (hgy = hge }
= {g E Repres(V\U) I Vg = Vge}.
A group theoretic argument shows that if the class of e in G is stable under
automorphisms, it is possible to choose E = Repres(Cv(e)\Cu(e)) and therefore
lEI = [Cu(e) : Cv(e)]. In the case of M12 < A 12 , for example this permits to
reduce 2520 potential images to 24.
For identification purposes, of course, E may not be chosen a priori for one
representative V, but must be selected from h and e. The reduction saves however
evaluations cpo Usually the images hg (g E E) do not contain full orbits of G,
therefore no resolvent is formed and the evaluated images cp(hg) must be tested
directly.
Another variant of the descending approach has been suggested in [51]: Here
the test for rationality of tp(h) is replaced by testing whether (h - tp(h)) is
Techniques for the Computation of Galois Groups 71
contained in ker <p <l R. This ideal is generated by the elementary symmetric
functions equating the coefficients of f. The ideal membership test is done using
p-adic approximation of idempotents in R/ ker<p.
4.3 Invariants
For the determination of invariant polynomials it is of course sufficient to obtain
one V-invariant polynomial which is not U-invariant, instead of generating the
full invariant ring for V. This invariant however should be chosen in a way to
keep the necessary approximation accuracy low for deducing that VJ(h) E LZ from
the value of ip(h). The needed accuracy grows with the absolute value IVJ(h)l. A
good heuristic to keep this value low is to select h to be of lowest possible degree.
Therefore the trivial possibility h = LVEV(XIX~.· .. ·x~)V is usually unsuitable.
An algorithm to compute a better h is given in [22].
If numerical approximation of the roots is used, the evaluation order in ip
can become crucial. In this situation it might be necessary to use the invariant
h in factorized form or to select other invariants to avoid exaggerated error
propagation.
A main problem of this approach is in the first steps down from the sym-
metric group. It is known [37] that almost all transitive maximal subgroups of
Sn or An have large index. Thus for almost all possible Galois groups G every
descendant chain from Sn down to G will contain a step with large index. A
large index however implies a large resolvent degree and in turn the need for a
high approximation accuracy.
5 Subfields
Subfields of <Q(o) form an important Galois invariant. Over the past years a
couple of algorithms for their computation have been suggested [33,16,34,5,23,
28,27], the last probably being the most effective at the moment. By the Galois
correspondence subfields correspond to subgroups of G which properly contain
the point stabilizer Staba (0). They therefore correspond to block systems of G as
permutation group. Suppose that <Q < <Q({3) < <Q(o) is a subfield with {3 = k(o)
and that m is the minimal polynomial of {3. Then by the embedding theorem for
imprimitive groups [29], G is a subgroup of the wreath product W := Sn I M,
where M is the Galois group of m. This embedding information can be used to
start the descent for the determination of G not with Sn but with W. To this end,
one has to determine the arrangement of the roots corresponding to this wreath
product: The arrangement of the blocks is determined by the arrangement of the
roots {3j of m corresponding to M, the root 0i is in the block corresponding to {3j
if k( Oi) = {3j. This arrangement ofthe roots to the blocks determines a conjugate
72 A. Hulpke
W' of W such that the descent process can be started with U = W. If several
block systems exist one can embed simultaneously in different wreath products
and thus start with U being the intersection of all these wreath products.
For imprimitive groups, this approach permits to avoid the large steps down
to maximal subgroups (which are in most cases wreath products and thus con-
tain W) of Sn, respectively An. It has been used successfully in the KANT
implementation [19].
As the computation of resolvents can be hard any information they convey should
be used. The descent method just checks for linear factors, that is orbits of G of
length one. As the roots of an irreducible polynomial form an orbit of the Galois
group, a complete factorization of a resolvent however exhibits also other orbits
of G and therefore may give further information about the Galois group. For
example [10] obtains an invariant for a subgroup of U containing G by complete
factorization of a resolvent for another subgroup V < U which not necessarily
contains G.
Let 1i c n be a set of polynomials which is invariant under G. Then
R = R(1i, f) = I1hEll(x - cp(h)) has rational coefficients. If condition (1) is
fulfilled (that is if R is square-free, again this can be ensured by a Tschirn-
haus transformation on 1) the irreducible factors of R correspond to the orbits
of G on 1i in the following way: Every orbit K C 1i corresponds to a factor
I1kEdx - cp(k)) and all factors of R arise this way. Furthermore, the image of
the operation of G on the orbit K is the Galois group of the corresponding poly-
nomial factors. So for example the factor discriminant is a square if and only if
the image group is a subgroup of the alternating group.
Similarly, by the Galois correspondence, orbits of a subgroup U < G corre-
spond to factors over a subfield <Q < <QCB) < L which is the field of elements
fixed by U. Such subfields of the splitting field can be obtained for example from
factors of (other) resolvents.
For a given degree and certain resolvents this information (mainly orbit
lengths) can be tabulated for all transitive subgroups of Sn a priori by sim-
ple group theoretic calculations. By considering sufficiently many resolvents to
distinguish all groups this permits to eliminate all but one conjugacy class of
transitive groups (all properties are conjugacy invariant as no roots are labelled),
which in turn has to contain the Galois group. This approach has been suggested
in [44,45]. In this form no arrangement of (approximate) roots will be obtained.
Because usually no subgroup U < Sn with corresponding root arrangement
is known a priori, the sets 1i are typically full orbits of Sn. We shall write
R(h,1) for R(h Sn , f). In [45] it is suggested to use linear polynomials for h.
For h = Xl + ... + Xm or h = Xl . . . . . Xm the set 1i correspond to the family
of m-subsets of {I, ... ,n}, similarly h = Xl + 2X2 + ... + mXm corresponds to
m-tuples. In [6] formulae for both types of set resolvents are given which require
only rational arithmetic. Resolvents arising from more general h are considered
Techniques for the Computation of Galois Groups 73
in [3J and [35J which use techniques from commutative algebra like resultants for
the computation.
Again shapes are used as a first filter. The existence of subfields, respectively
block systems, can be used as a further restriction.
A possible advantage of this method is that the resolvents considered do not
necessarily have to be "fitted" to subgroups containing the Galois group: The
resolvent R(h, 1) can be considered as an resolvent for the pair Stabsn (h) < Sn,
but if G ~ Stabsn (h) (up to conjugacy) this resolvent has no linear factor,
nevertheless the factor degrees will restrict the possibilities for G. This some-
times permits to use resolvents of smaller degree than used in the pure descent
approach. Also for groups G for which the chain G < Um < ... < U1 < Sn
contains many steps, not for every step a new resolvent has to be evaluated.
In this approach the approximation of roots takes place in the polynomial
factorizing algorithm and can therefore be considered to be under control. The
factorization of resolvents however can become a major obstacle: Because it is
an orbit length of Sn, the set sizes IhSn I - and thus the resolvent degrees - soon
become big. In addition, the resolvent polynomials are essentially of the worst
possible kind for the traditionally used Hensel-lifting based factorizing approach
[52J: By Lemma 1 they will split modulo each prime in factors whose degrees
are orders of elements of G while they factorize in characteristic zero in fac-
tors whose degrees are orbit lengths of G. Unless G is in regular representation
both measures can differ substantially and therefore many potential combina-
tions of the lifted factors have to be tried before the true factorization is found.
The most extreme are elementary abelian 2-groups, for which the resolvents are
Swinnerton-Dyer polynomials.
On the other hand the coefficients of the resolvents become that big, that the
break-even point for a polynomial time factoring algorithm [36] is yet beyond
the runtime for the classical approach.
The algorithm of [45] is implemented in Maple [7] up to degree 7. An ex-
tension to degree 8 for polynomials over <Q(t) is described in [38]. It is again
implemented in Maple. Tabulating further data for linear resolvents and using
also factorization over algebraic extensions to determine orbits of subgroups, this
approach has been implemented by the author in GAP 3 [43] up to degree 15, but
for some groups calculations in degrees 12 and beyond become infeasibly slow.
The factorizations of various resolvents not arising from the symmetric group is
tabulated for degree up to 11 in [2].
7 Relation Stabilizers
Evaluation of ij5(h) (h E 11.) and a test, of which resolvent factor the result is an
approximate root, permits to obtain the orbits of G on 11. from the factorized
resolvent. If approximation modulo p is used the additional computing time for
this is neglectable. This permits to compute the set-wise stabilizer Stab sn (11.)
of the orbit in the symmetric group. This stabilizer certainly must contain the
Galois group G. Thus subgroups containing G can be obtained without requiring
74 A. Hulpke
simple groups it is known that beyond triply transitivity the only offenders are
the Mathieu groups.
9 Final Remarks
Many of the implementations mentioned are available on the internet:
- The implementation of Y. Eichenlaub and M. Olivier can be found at
ftp://megrez.math.u-bordeaux.fr/pub/galois
- KANT (containing the routines by K. Geissler) is obtainable from
ftp://ftp.math.tu-berlin.de/pub/algebra/Kant/Kash/
- GAP (which contains the authors implementation) can be found under
http://www-gap.dcs.st-and.ac.uk/-gap/lnfo/distrib.html
This article benefitted implicitly from explanations by and discussions with
J. Kliiners, J. McKay, and L. Soicher, whom I would like to thank. The authors
implementation would have been impossible without many example polynomials
constructed by G. Malle, whom I would also like to thank for extensive tests of
the algorithms. Last, but not least, I would like to thank DFG, EPSRC and the
EU HCM program for their financial support.
References
1. Anai, H., Noro, M., Yokoyama, K.: Computation of the splitting fields and Ga-
lois groups of polynomials. In Algorithms in Algebraic Geometry and Applica-
tions ( L. GonzaJes-Vega, T. Recio, eds.), volume 145 of Progress in Mathematics.
Birkhiluser, Boston, 1996 22-50
2. Arnaudies, J.-M., Valibouze, A.: Groupes de Galois de polynomes en degre 10 ou 11.
Rapport interne 94.50, Laboratoire informatique theoretique et programmation,
Universite Paris VI, (1994)
3. Arnaudies, J.-M., Valibouze, A.: Lagrange resolvents. In Cohen and Roy [8], 1997
23-40
4. Batut, C., Bernardi, D., Cohen, H., Olivier, M.: User's guide to pari-gpo Technical
report, Universite Bordeaux I, (1993)
5. Casperson, D., Ford, D., McKay, J.: An ideal decomposition algorithm. J. Symb.
Comput. 21 (1996) 133-137
6. Casperson, D., McKay, J.: Symmetric functions, m-Sets and Galois groups. Math.
Compo 63 (1994) 208 749-757
7. Char, B. W., Geddes, K. 0., Gonnet, G. H., Monagan, M. B., Watt, S. M.: MAPLE
- Reference Manual, 5th edition. University of Waterloo, (1988)
8. Cohen, A. M., Roy, M.-F. (eds.): Proceedings MEGA'96, volume 117-118 of J.
Pure Appl. Algebra, (1997)
9. Colin, A.: Formal computation of Galois groups with relative resolvents. In Applied
algebra, Algebraic algorithms and Error-correcting Codes ( G. Cohen, M. Giustini,
T. Mora, eds.), volume 948 of Lecture Notes in Computer Science. Springer, Hei-
delberg, 1995 169-182
10. Colin, A.: Relative resolvents and partition tables in Galois group computations.
In Kiichlin [30], 1997169-182
76 A. Hulpke
11. Colin, A.: Theorie des invariants effective. Applications ala theorie de Galois et
a la resolution de systemes algebriques. Implantation en AXIOM. Ph.D. thesis,
Ecole poly technique, (1997)
12. Conway, J. H., Hulpke, A., McKay, J.: On transitive permutation groups. to appear
in LMS Journal of Computation and Mathematics 0
13. Daberkow, M., Fieker, C., Kliiners, J., Pohst, M., Roegner, K., Schornig, M.,
Wildanger, K: KANT V4. J. Symb. Comput. 24 (1997) 267-283
14. Darmon, H., Ford, D.: Computational verification of Mn and M12 as Galois groups
over Q. Comm. Algebra 17 (1989) 2941-2943
15. Davenport, J. H., Smith, G.: Fast recognition of symmetric and alternating Galois
groups, (to appear)
16. Dixon, J. D.: Computing subfields in algebraic number fields. J. Austral. Math.
Soc. Ser. A 49 (1990) 434-448
17. Eichenlaub, Y.: Problemes effectifs de theorie de Galois en degres 8 a 11. Ph.D.
thesis, Universite Bordeaux I, (1996)
18. Eichenlaub, Y., Olivier, M.: Computation of Galois groups for polynomials with
degree up to eleven. Preprint, Universite Bordeaux I, (1995)
19. Geissler, K: Zur Berechnung von Galoisgruppen. Diplomarbeit, Fachbereich Math-
ematik der TU Berlin, (1997)
20. Geyer, H.: Programme zur Berechnung der Galoisgruppen von Polynomen 8. und
9. Grades. Preprint 93-10, IWR Heidelberg, (1993)
21. Girstmair, K.: On the computation of resolvents and Galois groups. Manuscripta
Math. 43 (1983) 289-307
22. Girstmair, K: On invariant polynomials and their application in field theory. Math.
Compo 48 (1987) 781-797
23. Hulpke, A.: Block systems of a Galois group. Experimental Mathematics 4 (1995)
11-9
24. Hulpke, A.: Konstruktion transitiver Permutationsgruppen. Ph.D. thesis,
Rheinisch-Westflilische Technische Hochschule, Aachen, Germany, (1996)
25. Hulpke, A.: Galois groups through invariant relations. In Groups '97 Bath/St.
Andrews ( C. M. Campbell, E. F. Robertson, G. C. Smith, eds.). Cambridge Uni-
versity Press, to appear
26. Jordan, C.: Traite des substitutions et des equations algebriques. Gauthier-Villars,
(1870)
27. Kliiners, J.: Uber die Berechnung von Automorphismen und Teilkorpern algebrais-
cher Zahlkorper. Ph.D. thesis, Technische Universitat, Berlin, (1997)
28. Kliiners, J., Pohst, M. E.: On computing subfields. J. Symb. Comput. 24 (1997)
385-397
29. Krasner, M., Kaloujnine, L. A.: Produit complet des groupes de permutations et
probleme d'extension de groupes II. Acta Sci. Math. (Szeged) 14 (1951) 39-66
30. Kiichlin, W. (ed.): Proceedings of the 1997 International Symposium on Symbolic
and Algebraic Computation. The Association for Computing Machinery, ACM
Press, (1997)
31. Lagarias, J. C., Odlyzko, A. M.: Effective versions of the Chebotarev density theo-
rem. In Algebraic Number Fields (L-functions and Galois properties) ( A. Frohlich,
ed.). Academic Press, 1977409-464
32. Landau, S.: How to tangle with a nested radical. Math. Intelligencer 16 (1994) 2
49-55
33. Landau, S., Miller, G.: Solvability by radical is in polynomial time. J. Comput.
System Sci. 30 (1985) 179-208
Techniques for the Computation of Galois Groups 77
34. Lazard, D., Valibouze, A.: Computing subfields: Reverse of the primitive element
problem. In Computational Algebraic Geometry ( F. Eyssette, A. Galligo, eds.),
volume 109 of Progress in Mathematics. Birkhiiuser, Boston, 1993 163-176
35. Lehobey, F.: Resolvent computations by resultants without extraneous powers. In
Kiichlin [30], 199785-92
36. Lenstra, A., Lenstra, H., Lovasz, L.: Factoring polynomials with rational coeffi-
cients. Math. Ann. 261 (1982) 515-534
37. Liebeck, M. W., Praeger, C. E., Saxl, J.: A classification of the maximal subgroups
of the finite alternating and symmetric groups. J. Algebra 111 (1987) 365-383
38. Mattman, T. W., McKay, J.: Computation of Galois groups over function fields.
Math. Compo 66 (1997) 218 823-831
39. Matzat, B. H.: Fortschritte in der inversen Galoistheorie. This volume
40. McKay, J., Stauduhar, R.: Finding relations among the roots of an irreducible
polynomial. In Kiichlin [30], 1997 75-77
41. Mignotte, M.: Mathematics for Computer Algebra. Springer, Heidelberg, (1992)
42. Neukirch, J.: Algebraische Zahlentheorie. Springer, Heidelberg, (1992)
43. Schonert, M., et aI.: GAP 3.4, patchlevel4. Lehrstuhl D fiir Mathematik, Rheinisch-
Westfiilische Technische Hochschule, Aachen, (1997)
44. Soicher, L. H.: The computation of Galois groups. Master's thesis, Concordia
University, (1981)
45. Soicher, L. H., McKay, J.: Computing Galois groups over the rationals. J. Number
Theory 20 (1985) 273-281
46. Stauduhar, R. P.: The determination of Galois groups. Math. Compo 27 (1973)
981-996
47. Tschebotareff, N.: Die Bestimmung der Dichtigkeit einer Menge von Primzahlen,
welche zu einer gegebenen Substitutionsklasse gehoren. Math. Ann. 95 (1925)
191-228
48. van der Waerden, B. L.: Die Seltenheit der Gleichungen mit Affekt. Math. Ann.
109 (1934) 13-16
49. van der Waerden, B. L.: Algebra, erster Teil. Springer, Heidelberg, eighth edition,
(1971)
50. Wielandt, H.: Permutation groups through invariant relations and invariant func-
tions. Lecture notes, Department of Mathematics, The Ohio State University,
(1969)
51. Yokoyama, K.: A modular method for computing the Galois group of polynomials.
In Cohen and Roy [8], 1997 617-636
52. Zassenhaus, H.: On Hensel factorization I. J. Number Theory 1 (1969) 291-311
Fortschritte in der inversen Galoistheorie
Prof. H.-W. Leopoldt zurn 70. Geburtstag
B. Heinrich Matzat
Dieser Bericht setzt die mit Matzat (1988,1991b) begonnene Reihe von Uber-
sichtsartikeln uber konstruktive Galoistheorie fort und gibt einen Uberblick uber
seither erreichte neue Resultate. Da inzwischen mehrere einfUhrende Texte in
Buchform von Matzat (1987), Serre (1992), V61klein (1996) sowie Ishkhanov,
Lure und Faddeev (1997) vorhanden sind, kann ich mich bei der Zusammenstel-
lung der Grundlagen sehr kurz fassen. Eine ausfUhrliche Darstellung mit Bewei-
sen der meisten der hier zusammengestellten Resultate wird im Ergebnisbericht
Malle, Matzat (1998) enthalten sein.
Der Artikel gliedert sich in zwei Teile. Der erste, Galoisabstieg und Starrheit,
enthaIt Resultate uber Definitionski:irper von Galoiserweiterungen und starre Er-
zeugendensysteme endlicher Gruppen, der zweite, Geometrische Einbettungspro-
bleme, uber eigentliche Li:isbarkeit zerfallender Einbettungsprobleme mit Ein-
schluB von GAR-Realisierungen, Li:isungen spezieller Frattini-Einbettungspro-
bleme sowie uber explizite Konstruktionen von Galoiserweiterungen.
Ausgeklammert sind Resultate im Zusammenhang mit der Grothendieck-
Teichmuller-Gruppe und der Abhyankar-Vermutung, da hier umfassende Uber-
sichtsartikel von Ihara (1991) und Nakamura (1997), bzw. Harbater (1995) vor-
liegen.
Diese beiden Probleme sind eng miteinander verknUpft. Die Kenntnis von
G K (§) zieht selbstverstiindlich die Kenntnis von Q; K (§) als Menge aller endli-
chen Faktorgruppen von GK(§) nach sich. Umgekehrt kann aber zumindest im
Fall einer endlich erzeugten Fundamentalgruppe auch die Gruppe G K (§) ohne
Miihe aus Q;K(§) gewonnen werden (siehe Fried, Jarden (1986), Prop. 15.4).
Satz 1.1. (Hilbert (1892)) Es seien K ein iiber Q endlich erzeugter Korper
und t ein iiber K trans zen dentes Element. Dann gibt es zu jedem separablen
Polynom f(t, X) E K(t)[X) unendlich viele a E K mit
Gal(f(a, X)) ~ Gal(f(t, X)).
Ein Korper K heiBt nun ein Hilbertkorper, wenn der Satz 1.1 fUr K gilt. Zu
diesen ziihlen unter anderem auch die Uber IFp(t) (Franz (1931)) und dem ma-
ximal absolut abelschen Zahlkorper Q'b (Weissauer (1982)) endlich erzeugten
Korper. FUr diese gilt in der obigen Bezeichnungsweise:
Satz 1.3. Die Fundamentalgruppe GC(t) (§) = Gal(Ms/C(t)) ist eine freie
proendliche Gruppe von Rang s -1 mit s := I§I. Die eingebetteten Erzeugenden
Ii E 71"1 (X\Sj Po) sind Erzeugende von iiber '-Pi liegenden Triigheitsgruppen in
Ms/C(t).
Man uberlegt sich leicht, daB (a) fUr char(K) =p > 0 falsch ist, hingegen
bleibt (b) richtig, wie wir in §6 sehen werden.
(2.1) Aus der Folgerung 1.4 ergibt sich sofort, daB jede endliche Gruppe G
als Galoisgruppe einer Korpererweiterung N / K uber K := Q(t) vorkommt. Wir
wollen jetzt der Frage nachgehen, unter welchen Voraussetzungen eine soIche
Galoiserweiterung uber einem vorgegebenen Zwischenkorper K von Q(t)/Q(t)
definiert ist. Eine notwendige Voraussetzung hierfUr ist, daB die Fortsetzungen
der Automorphismen 8 E Gal(K/K) auf N den Korper N auf sich abbilden
und auf Gal(N / K) als innere Automorphismen operieren. Der Fixkorper K aller
8 E Gal(Q(t)/Q(t» mit dieser Eigenschaft heiSt der Modulkorper der Galoiser-
weiterung N / K. Dieser ist ein Dejinitionskorper von N / K, falls eine Galoiser-
weiterung N / K mit der Gruppe G existiert mit N K = N (in einer algebraisch
abgeschlossenen Hiille von K). Aus einem einfachen Resultat uber Gruppener-
weiterungen ergibt sich:
Satz 2.1. Die Obstruktion dafii.r, daft ein Modulkorper Keiner Galoiser-
weiterung N / K mit der Gruppe G ein Dejinitionskorper ist, liegt in der K oho-
mologiegruppe H2 (G K, Z (G» mit trivialer Operation von G K auf demo Zentrum
Z(G) von G.
Folgerung 2.2. Besitzt das Zentrum Z(G) von G = Gal(N / K) ein Kom-
plement in G, so ist der Moduikorper von N / K auch ein Dejinitionskorper.
Fur einen Beweis und weitergehende Resultate wird auf Matzat (1987), II
§2, Satz 2 bzw. Malle, Matzat (1998), Thm. 1.3.9 verwiesen.
N- C T.-
·-
M- Kern(.pa)
S , (2.1)
(2.2)
( [0" ],8
) t--t [0" ]0 mit []O
O"i = [C(0-1)]
0"(i)0-1 • (2.3)
(2.4) Die Operation von .1s auf Es(G)/Inn(G) ist durch (2.3) eindeutig
bestimmt, falls es fUr alle T E E.(G) mit [O"i] = h]. ein von i unabhangiges 0: E G
gibt mit O"r = ri. Dann heiBt 0" ein starres Erzeugendensystem von G; es heiBt
ein rational starres Erzeugendensystem, falls zusatzlich die Konjugiertenklassen
Fortschritte in der inversen Galoistheorie 83
[O"i) in G rational sind, d.h. wenn aile primitiven Potenzen von O"i in G zu O"i
konjugiert sind.
1m Falle rationaler Verzweigungspunkte ergibt sich damit der folgende Starr-
heitssatz:
Satz 2.4. Es sei G eine endliche Gruppe mit einem starren Erzeugendensy-
stem u = (0"1, ... ,0".), und es operiere Gal(Q(t)/Q(t)) trivial auf§ ~ IP(Q(t)/Q).
Dann wird der Modulkorper Ku von Nu/Q(t) fiber Q(t) durch die Charakterwer-
te von G auf den Klassen [O"i] fUr i = 1, ... , s erzeugt. Insbesondere gilt Kcr = Q(t)
im Faile eines rational starren Erzeugendensystems.
(2.5) Zusammen mit Folgerung 2.2 erhiHt man daraus den folgenden Exi-
stenzsatz fUr geometrische Galoiserweiterungen fiber q>b (t) und Q(t). Diese wer-
den im folgenden auch kurz als G-Realisierungen von G fiber q>b bzw. fiber Q
bezeichnet.
Satz 2.5. Es sei G eine endliche Gruppe, deren Zentrum ein Komple-
ment in G besitzt. Besitzt G ein starres Erzeugendensystem u = (0"1, ... ,0".), so
gibt es zu vorgegebenem § = {s.}Jt, ... , ~.} ~ IP(Q(t)/Q) mit Grad(~i) = 1 eine
geometrische Galoiserweiterung
Dabei wird Kcr fiber Q(t) durch die Charakterwerte auf den Klassen [u;) von G
erzeugt. 1st hierbei u sogar rational starr, so ist Kcr = Q(t).
Ffir einen Beweis wird wieder auf Matzat (1987), Kap. II, §4, Folg. 3, bzw.
Malle, Matzat (1998), Thm. 1.4.8 verwiesen. Weitergehende Aussagen kann man
unter Umstanden erhalten, wenn man eine nichttriviale Operation von
Gal(Q(t)/Q(t)) auf § zulafit (loc. cit. Thm. 1.4.11). Insbesondere fUhren auch
diejenigen starren Erzeugendensysteme zu einer G-Realisierung fiber Q, die bis
auf Konjugation invariant unter der Operation (2.3) sind. Diese nennen wir hier
kurz hinreichend rational.
(3.1) Nach den Resultaten des letzten Abschnitts stellt sich die Frage, welche
endlichen Gruppen (hinreichend rational) starre Erzeugendensysteme besitzen.
Zum Nachweis k6nnen verschiedene Methoden benutzt werden. Die nachstliegende
besteht darin, Starrheit direkt mittels der Definition nachzuweisen. Dies ffihrt
zum Beispiel bei der symmetrischen Gruppe Sn zum Ziel (siehe Shih (1974)). Da
bei der zugehOrigen G-Realisierung der Sn fiber Q der Fixk6rper der alternie-
renden Gruppe An ein rationaler Funktionenk6rper ist, erhaIt man damit auch
G-Realisierungen der An fiber Q fUr n ;::: 3.
84 B.H. Matzat
(3.2) Die zweite von Belyi (1979) eingefUhrte Methode verwendet Matri-
zendarsteIlungen. Dazu nennen wir eine Matrix U E GLn(k) eine Pseudospie-
gelung (oder PerspektiviUit), falls u eine Hyperebene punktweise festli:i13t bzw.
einen (n - 1)-dimensionalen Eigenraum zum Eigenwert 1 besitzt.
(3.1)
Bemerkung 3.2. Ein Erzeugendensystem u von G mit n(u) < 2 ist ein
starres Erzeugendensystem.
Davon sind zumindest ftir E7(2m) inzwischen G-Realisierungen tiber Q'b von
U. Schmidt (1998) angektindigt worden.
(3.5) Ftir den Nachweis von G-Realisierungen tiber IQ gentigt die Existenz
starrer Erzeugendensysteme noch nicht. Hinreichend ist nach Satz 2.5 die Exi-
Fortschritte in der inversen Galoistheorie 85
(3.6) Als ein sehr erfolgreiches Hilfsmittel fUr den Nachweis (semi-)ratio-
naler Belyi-Tripel hat sich eine auf Malle (1996), Reiter (1997) und V61klein
(1998a) zurtickgehende konstruktive Version des Belyi-Kriteriums erwiesen, die
hier in der eleganten Version von V61klein wiedergegeben wird (vergleiche auch
Malle, Matzat (1998), Thm. 11.2.6). Ein analoges Resultat im FaIle k = <C geht
bereits auf Levelt (1961) zurtick (zitiert in Beukers, Heckmann (1989), Thm. 3.5).
Diese Bemerkung liiBt aber zunachst noch offen, welche irreduzible Unter-
gruppe von GLn(q) durch u erzeugt wird. Hierzu konstruiert man zu geeig-
net gewahlten hinreichend rationalen Belyi-Tripeln zuerst invariante Bilinear-
formen, welche die moglichen Gruppen einschranken. AnschlieBend benutzt man
die Klassifikation der irreduziblen Pseudospiegelungsgruppen von Wagner (1978,
1980), Kantor (1979) und Zalesskii-Serezhkin (1980), urn schlieBlich die erzeug-
ten Gruppen zu identifizieren.
(3.7) Mit diesen Methoden konnten Malle (1996) und Reiter (1997) die
tiber ll"p definierten einfachen klassischen Gruppen weitgehend als Galoisgrup-
pen tiber Ql(t) und Ql realisieren. Zusammen mit iilteren Resultaten von Feit,
Fong (1984) und Malle (1988,1992) fUr exzeptionelle Gruppen vom Lie-Typ so-
wie eine Reihe weiterer Autoren fUr die sporadischen Gruppen (siehe Matzat
(1991b) fUr eine Bibliographie) ergibt sich:
§4 Zopfoperationen
mit dem Funktionenkorper C(X~) = C(t l , ... , t r ) bzw. von dem r-fachen unvoll-
standigen symmetrischen Produkt X; := X~/Sr mit C(X;) = C(tl, ... ,tr)' wobei
ti die elementarsymmetrischen Funktionen der ti bedeuten. Die bezuglich der
Projektion Po auf X~ von Po = (1, ... , r) E X; als Ausgangspunkt gebildete
Fundamentalgruppe
Satz 4.1. (a) Die Galoisgruppe der maximalen uber X~ unverzweigten al-
gebraischen Korpererweiterung Mr /C(tl , ... ,tr) ist die proendliche Hurwitzsche
Zopjgruppe fIr. Dabei gehort zu der Uberlagerung X; die proendliche reine Hur-
witzsche Zopjgruppe Hr als algebraische Fundamentalgruppe.
(b) Die A ussagen in (a) bleiben richtig fUr jeden algebraisch abgeschlossenen
K orper k der Charakteristik 0 anstatt von C.
Fortschritte in der inversen Galoistheorie 87
(4.2) Ab jetzt werde (3; E 1l"l(X;, Po) mit seinem kanonischen Bild in fIr
identifiziert, und es sei
(4.2)
Drum erzeugen 'Y1, .. ·,'Yr-1 in fir eine freie proendliche Gruppe G r - 1 vom Rang
r - 2 mit der definierenden Relation 'Y1 ... 'Yr-1 = 1 . Weiter operieren die (3i fUr
1 ::; i ::; r - 2 auf den 'Yi vermoge
('Y1, ... , 'Yr-1),8, = ('Y1, ... , 'Yi-b 'Yi'Yi+1 'Yi 1,'Yi, 'Yi+2, ... , 'Yr-1). (4.3)
Fur den Fixkorper von G r- 1 in der Galoiserweiterung Mr/Q(ii, ... ,ir) aus Satz
4.1 fUr Q statt <C gelten (siehe Matzat (1991a), §3.3 bzw. Malle, Matzat (1998),
III, §2.4):
Folgerung 4.2. (a) Fur r ~ 4 ist der Fixkorper von Mr unter dem ab-
geschlossenen Normalteiler G r - 1 von Hr gerade M r - 1(t r ) .Dabei ist M r -1 alge-
braisch abgeschlossen in Mr.
(b) Die Korpererweiterung M r /Mr - 1(tr) ist eine maximale auflerhalb der
Menge § = P-P1, ... , ~s} der Ziihlerdivisionen ~i von (ti - tr ) in M r - 1(t r )/ M r - 1
unverzweigte Galoiserweiterung.
(4.3) Damit ist man nun in einer der Folgerung 1.4 vollig entsprechenden
Ausgangssituation mit s = r - 1. Entsprechend gelten die Aussagen von §2 auch
fUr den Korperturm Mr/Mr-1(tr)/Q(t1 ... ,tr) anstatt von Ms/Q(t)/Q(t). Ins-
besondere gibt es zu jeder Galoiserweiterung Nu / Ms (t r) fUr s = r - 1 einen
Modulkorper Ku ~ Q(t) = Q(t1, ... ,tr).
Die Korpererweiterung Ku/iQ(t) laBt sich nun aufspalten in eine Konstan-
tenerweiterung ku(t)/iQ(t) gefolgt von einer geometrischen Korpererweiterung
Ku/ku(t), wobei der Grad letzterer mit der Lange der Hs-Zoptbahn von [0-] auf
E.(G)/Inn(G) libereinstimmt. Dabei ergibt sich die Zopfoperation von Hs auf
E.(G)/Inn(G) analog zu (2.3) aus (4.3) durch Substitution von 'Yi durch Ui und
Inversion:
Satz 4.3. (a) Es sei G eine endliche Gruppe mit einer starren H.-Zop/bahn
B C E.(G)/Inn(G) von Erzeugendensystemen. Dann ist fUr jedes
88 B.H. Matzat
[u] [(0"1, ... ,0"8)] E B der Modulkorper Ku von Nu lM8 regular uber dem
Korper ku der Charakterwerte von u. Insbesondere ist ku = Q falls [u] (hinrei-
chend) rational ist.
(b) Besitzt weiter Z(G) ein Komplement in G und Kulku(t) eine unverzweig-
te rationale Stelle, so gibt es eine geometrische Galoiserweiterung
Ffir den Beweis dieses Satzes und Verallgemeinerungen wird auf Matzat
(1991a), §6.1 und §8.1, Matzat (1993) bzw. Malle, Matzat (1998), Thm. IlL5.3,
Thm. IlL6.5 und Thm. IlL8.9 verwiesen. Insbesondere kann auch hier wie in
Satz 2.5 ku durch Q ersetzt werden, wenn (die Zopfbahn von) [u] im Sinne der
obigen Definition hinreichend rational ist.
1m Vergleich zu den Satzen 2.4 und 2.5 wird in Satz 4.3 die Starrheitsfor-
derung abgeschwacht zur Starrheit von Zopfbahnen. Daffir hat man aber eine
arithmetische Zusatzvoraussetzung in Form der Existenz eines rationalen Punk-
tes zu erfiillen. Letztere kann in Einzelfallen durch das Studium der Struktur
von Ku verifiziert werden. Damit wurde unter anderem die bisher einzige G-
Realisierung der M24 fiber Q (mit dem Klasssenvektor (2A, 2A, 2A, 12B) und
der Zopfbahnlange 144) gefunden. Weitere derartige Beispiele sind in Matzat
(1991a), §9, zusammengestellt.
(4.6) In einem weiteren Spezialfall ist die Existenz eines rationalen Punktes
in trivialer Weise erffillt, wenn namlich das Erzeugendensystem u selbst starr
und damit Ku = ku(t) ist. Solche Erzeugendensysteme mit s ~ 4 sind erst
kfirzlich von VOlklein (1998a) und Dettweiler, Reiter (1998) entdeckt worden
und stellen eine direkte Verallgemeinerung der Belyi-Tripel dar. Dazu heiBt ein
s-Tupel u = (0"1, ... ,0"8) von Elementen O"i E GLn(k) fiber einem Korper k mit
0"1" '0". E Z(GLn(k» linear starr, wenn es starr im Sinne von §2 ist (aber
nicht notwendig erzeugend). Es heiBe weiter irreduzibel, wenn es eine irreduzible
Untergruppe von GLn(k) erzeugt. Irreduzible linear starre s-Tupel lassen sich
folgendermaBen nachweisen (Volklein (1998b»:
Fortschritte in der inversen Galoistheorie 89
Bemerkung 4.5. Es sei k ein Korper und u = (0"1, ... ,0".) ein irreduzibles
s-Tupel in GLn(k). Dieses ist linear starr, falls gilt
•
l: dim(C(O"i)) = (s - 2)n 2 + 2. (4.5)
i=1
Hierbei bedeutet dim(C(O")) die Dimension des Zentralisators von 0" in GLn(k)
als algebraische Gruppe uber k. 1m Spezialfall algebraisch abgeschlossener Kor-
per kist (4.5) sogar eine hinreichende Bedingung fur die lineare Starrheit irre-
duzibler s-Thpel (siehe Katz (1996), Ch. 1.1 fUr k = C, bzw. Volklein (1998b)
fUr k allgemein).
1m Kapite16 von Katz (1996) wird eine Konstruktionsmethode fUr irreduzible
linear starre s- Thpel - diese beschreiben dort physikalisch starre lokale Systerne
- uber algebraisch abgeschlossenen Korpern der Charakteristik 0 entwickelt. In
der Regel kann man aus diesen durch Spezialisierung irreduzible linear starre s-
Thpel in linearen Gruppen uber gewissen endlichen Korpern k gewinnen. Diese
ergeben dann sogar starre Erzeugendensysteme der erzeugten Untergruppe G,
falls noch die Zusatzbedingung an N(G) in der Bemerkung 3.1. erfUllt ist (siehe
Katz (1996), Ch. 7, und Dettweiler, Reiter (1998), §4).
(4.7) Ais erste Erfolge konnten Dettweiler und Reiter (1998) mit dieser
Methode die ersten beiden Aussagen des folgenden Satzes gewinnen, der G-
Realisierungen uber IQ klassischer Gruppen mit Korperautomorphisrnen enthalt:
1m Gegensatz zu den Teilen (a) und (b) von Satz 4.6 beruht die Aussage (c)
von Thompson, VOlklein (1998a) nicht auf der Existenz starrer Erzeugendensy-
sterne, sondern grundet sich auf eine hinreichend rational starre Zopfbahn der
Lange 2n - 1 mit rationalern Definitionskorper KtT/lQ. Fur gerade q und n sind
zu (a) und (b) analoge Aussagen bereits von VOlklein (1993) bewiesen worden
(vergleiche auch Malle, Matzat (1998), Cor. I1I.9.12 und Cor. III.9.14).
(5.1) Urn eine vorgegebene Galoiserweiterung in eine solche mit groBerer Grup-
pe einbetten zu konnen, ist ein sogenanntes Einbettungsproblem zu lOsen. Die-
90 B.H. Matzat
ses laBt sich wie folgt prazisieren: Die Galoiserweiterung M / K mit der Grup-
pe H = Gal(M/K) fiber einem Korper K mit der Fundamentalgruppe G K
sei gegeben durch die Restriktion 0 : GK ~ H und die Gruppenerweiterung
G =F . H (pro-)endlicher Gruppen durch einen Epimorphismus (3 : G ~ H
mit Kern((3) = F. Dann heiBt die Frage nach der Existenz eines (stetigen) Ho-
momorphismus 'Y : GK ~ G mit 0 = (3 0 'Y ein Einbettungsproblem, abgekfirzt
durch £(0,(3). Der Homomorphismus 'Y heiBt Liisung von £(0,(3), im Falle ei-
nes Epimorphismus eine eigentliche Liisung. Weiter heiBt der Fixkorper N von
Kernb) Liisungskiirper von £(0, (3); fUr diesen gilt im Falle einer eigentlichen
Losung Gal(N/K) = G.
Y
GK
10
1-F-G-H-l (5.1)
(3
(5.2) 1st. bei einem Einbettungsproblem £(0,(3) der Grundkorper K ein
Funktionenkorper mit Konstantenkorper k und ist M/k regular, so heiBt £(0,(3)
ein geometrisches Einbettungsproblem, und eine Losung 'Y von £ (0, (3) heiBt eine
geometrische Liisung, falls auch der zugehOrige Losungskorper N fiber k regular
ist. Bei der Untersuchung geometrischer Einbettungsprobleme haben sich soge-
nannte Parameterlosungen bewahrt. Diese erhillt man folgendermaBen: Durch
Adjunktion einer Transzendenten t gewinnt man aus dem Einbettungsproblem
£(0,(3) mit O(GK) = Gal(M/K) ein neues Einbettungsproblem £(0*,(3*) mit
0* : GK{t) ~ H* := Gal(M(t)/K(t)) und (3* : G ~ H* ~ H. Eine Losung
'Y* dieses Einbettungsproblems heiBt Pammeterliisung von £ (0, (3), falls der zu-
gehorige Losungskorper N* regular fiber Mist. Offensichtlich ist bei diesem
Sprachgebrauch eine eigentliche Parameterlosung von £(0, (3) mit der Einsgrup-
pe H = O(GK) eine G-Realisierung von F = G.
Die folgenden Aussagen, die fast unmittelbar aus der Definition folgen, zeigen
die Universalitli.t von Parameterlosungen (vergleiche Matzat (1995) fUr Teil (b)).
Dabei heiBt ein Einbettungsproblem £(0, (3) endlich, falls Bild(o) und Kern((3)
endliche Gruppen sind.
(5.3) Mit dem folgenden Satz von Nobusawa (1961) (siehe auch Malle,
Matzat (1998), Thm. IV.5.2) lassen sich endliche Einbettungsprobleme in zwei
wesentlich verschiedene Klassen aufteilen:
Fortschritte in der inversen Galoistheorie 91
(5.4) Diese beiden Typen von Einbettungsproblemen zeigen ein vallig ver-
schiedenes Lasungsverhalten:
Dabei erhalt man (a) durch Komposition von a mit dem definitionsgemaB
existierenden homomorphen Schnitt t : H -+ G und (b) aus (F,,(GK)) = G.
Letzteres hat im Falle einer projektiven Fundamentalgruppe G K unmittelbar
zur Folge:
§6 Zerfallende Einbettungsprobleme
(6.1) Zerfallende Einbettungsprobleme lassen sich iiber eine Hauptreihe des
Kerns F = Fo t> Fl t> ••• t> Fr = 1 weiter zerlegen in ein zerfallendes Einbettungs-
problem mit charakteristisch einfachem Kern D ~ Fo / Fl (d.h. D ist das direkte
Produkt mehrerer Kopien einer einfachen Gruppe E) und ein wei teres Einbet-
tungsproblem mit dem Kern F 1 , so daB wir uns hier durch Induktion mit Satz
5.2 auf den Fall eines charakteristisch einfachen Kerns zuriickziehen kannen. 1st
hierbei E eine zyklische Gruppe, so wird E(a, fJ) ein zerfallendes Einbettungs-
problem mit abelschem Kern D. Diese wurden im Zahlkarperfall bereits von
Scholz (1929), im geometrischen Fall von Saltman (1982) und im parametri-
schen Fall schlieBlich in Matzat (1995) ge16st (siehe auch Malle, Matzat (1998),
Thm. IV.2.4).
Satz 6.2. Es sei E eine nichtabelsche ein/ache endliche Gruppe mit einer
GAR-Realisierung uber k. Dann besitzt jedes endliche Einbettungsproblem uber
k( t) mit Kern Er fUr r E N eine eigentliche ParameterlOsung.
(6.3) Damit stellt sich die Frage nach der Existenz von GAR-Realisierungen
nichtabelscher einfacher Gruppen. Dazu stellt man fest, daB von den in Satz
3.3 aufgezahlten G-Realisierungen die alternierenden Gruppen, die sporadischen
einfachen Gruppen und die Gruppen vom Lie-Typ mit nur trivialen Korperau-
tomorphismen (bis auf 3D4(p» konstruktionsgemaB zusatzlich die Automorphis-
menbedingung (A) erfUllen (siehe Malle, Matzat (1998), Thm. 11.10.2). Dasselbe
gilt auch ftir die linearen und unitaren Gruppen in Satz 4.6. Damit erhalt man
tiber Qfb das folgende Ergebnis:
(604) Da tiber 10 nieht jede Quadrik einen rationalen Punkt besitzt, stellt
hier die Rationalitatsbedingung (R) fUr die GAR-Realisierungen in den Listen
von Satz 3.5 noch eine echte Zusatzbedingung dar. Diese ist im eindimensionalen
Fall zum Beispiel erfUllt, wenn die auBeren Automorphismen von E eine zyklische
Gruppe oder eine Diedergruppe mit ungeradem zyklischen Normalteiler bilden,
im mehrdimensionalen Fall ist haufig das semilineare Rationalitatskriterium von
Speiser (1919) hilfreieh (siehe in diesem Zusammenhang auch Matzat (1992),
Satz 2 bzw. Malle, Matzat (1998), Prop. IVA.7). Dies fUhrt mit den Satzen 3.5
und 4.6 auf den folgenden Satz (vergl. Malle, Matzat (1998), Thm. IVA.3, fUr
eine prazise Formulierung):
(6.5) Nach wie vor bleibt aber die Frage offen, ob jedes zerfallende Einbet-
tungsproblem tiber 10 beziehungsweise tiber qtb eine eigentliche Losung besitzt.
Dieses andert sieh, wenn man den Grundkorper hinreiehend groB wahlt. Hierzu
heiBe ein Korper k ein weiter Korper, falls jede irreduzible algebraische Kurve
tiber k, die einen einfachen k-rationalen Punkt besitzt, sogar unendlich viele
k-rationale Punkte besitzt. Zu diesen zahlen neben den PAC-Korpern unter an-
derem auch die reell abgeschlossenen Korper und die Henselschen Korper. Ftir
diese konnte Pop (1996) mit Methoden der starren analytischen Geometrie all-
gemein beweisen (siehe auch Malle, Matzat (1998), Thm. VA.8, bzw. Haran,
Jarden (1998) ftir einen alternativen Beweis):
Satz 6.5. Uber einem weiten Korper k besitzt jedes endliche zerfallende
Einbettungsproblem eine eigentliche ParameterlOsung. Insbesondere besitzt jede
endliche Gruppe eine G-Realisierung uber k.
(6.6) 1m Falle eines Hilbertschen PAC-Korpers folgt hieraus mit der Fol-
gerung 504. die Losbarkeit aller endlichen Einbettungsprobleme im eigentlichen
Sinne. Dies fUhrt mit dem Freiheitssatz von Iwasawa (1953) zu
direkt aus der Folgerung 4.4. ableiten (siehe auch Malle, Matzat (1998), Thm.
IV.3.1O).
§7 Frattini-Einbettungsprobleme
(7.1) Wie im Fall zerfallender Einbettungsprobleme konnen wir uns auch bei
Frattini-Einbettungsproblemen auf Einbettungsprobleme mit charakteristisch ein-
fachem Kern zurfickziehen. Da die Frattinigruppe einer endlichen Gruppe auflosbar
ist, genfigen dann sogar Frattini-Einbettungsprobleme mit abelschem Kern. Da-
bei ist nach Bemerkung 5.3(b) jede Losung bereits eine eigentliche Losung. Es sei
also £(a, {3) ein endliches Frattini-Einbettungsproblem, dessen Kern eine endli-
che abelsche Gruppe A vom Exponenten mist. Dabei konnen wir annehmen,
daB m teilerfremd zur Charakteristik des Grundkorpers ist, da fiber einem Hil-
bertkorper K der Charakteristik p jedes endliche Einbettungsproblem mit einer
p-Gruppe als Kern eigentlich lOsbar ist. (Denn nach dem Satz von Witt (1936)
ist die maxim ale pro-p-Faktorgruppe von G K eine freie pro-p-Gruppe von un-
endlichem Rang.)
Durch Adjunktion einer primitiven m-ten Einheitswurzel (m zu M erhalten
wir zu M:= M((m) aus £(a,{3) ein verschobenes Einbettungsproblem £(&,/3),
in dem die Gruppe G = A· iI mit iI:= Gal(M/k) das Faserprodukt G XH iI
von G mit iI fiber H ist. Nach dem ersten Reduktionssatz von Kochendorffer
(1953) ist dieses genau dann losbar, wenn das ursprfingliche Einbettungspro-
blem lOsbar ist (vergl. Matzat (1995), Thm. 3 bzw. Malle, Matzat (1998), Thm.
IV.1.6). Insbesondere konnen wir weiter ohne Beschrankung der Allgemeinheit
annehmen, daB (m in M enthalten ist.
l - - A - - GXHGK ~ GK - - 1
l i! ;//la
l--A G H --1
(7.3) Damit gibt es hier fUr die Losbarkeit ein im allgemeinen nichttriviales
kohomologisches Hindernis h(a,{3) E H2(GK ,A). In dem Spezialfall, daB A G-
isomorph zu einer Untergruppe von M X ist, liegt h(a,{3) in der m-Torsion der
Brauergruppe Br(K) = H2(GK, [(X). Dementsprechend heiBt das zugehorige
Fortschritte in der inversen Galoistheorie 95
(7.5) Nun sei &(0:, (3) wieder ein beliebiges endliches Einbettungsproblem
mit abelschem Kern A. Wird dann durch ein Untergruppenpaar (U, B) mit B :S
A und A :S U :S G mit U :S Na(B) und UjA :S Gal(MjK) vermoge der
kanonischen Abbildungen durch
Satz 7.3. Ein endliches Einbettungsproblem mit abelschem Kern iiber ei-
nem globalen Korper ist genau dann konkordant, wenn aile zugehorigen lokalen
Einbettungsprobleme los bar sind.
Satz 7.4. Uber einem globalen Korper besitzt jedes endliche zerJaliende
(geometrische) Einbettungsproblem mit nilpotentem Kern eine (geometrische)
Losung.
Satz 7.5. Uber einem globalen Korper kommt jede endliche auflosbare
Gruppe als Galoisgruppe vor.
Folgerung 7.6. Uber einem endlichen Korper besitzt jede endliche auflos-
bare Gruppe eine G-Realisierung.
(8.1) Zum AbschluB dieser Ubersicht mochte ich noch zusammensteIlen, wie-
weit flir Permutationsgruppen kleinen Grades G-Realisierungen fiber Ql bekannt
und erzeugende Polynome berechnet sind (vgl. Matzat (1991b), Satz 8.3, flir eine
iiltere Ubersicht). Ein erster technischer Fortschritt ergab sich bei der Losung al-
gebraischer Gleichungssysteme. Hier hat Nauheim (1995,1998) einen modularen
Algorithmus (ffir algebraische Gleichungssysteme mit endlich vielen Losungen)
entwickelt, der ohne die Voraussetzung guter Reduktion auskommt. Dieser AIgo-
rithmus ist gerade bei der Konstruktion von Polynomen mit vorgegebener (fast
einfacher) Galoisgruppe besonders wertvoIl, da bei den zugehorigen Gleichungs-
systemen haufig flir aIle hinreichend kleinen Primzahlen schlechte Reduktion
vorliegt (als Teiler der Gruppenordnung). Damit konnte Nauheim (1995) unter
anderem das folgende Polynom vom Grad 28 mit der Gruppe U3 (3) berechnen:
Als zweites hierher gehoriges Resultat verdient die Konstruktion eines Po-
lynoms vom Grad 24 mit der Mathieu-Gruppe M24 durch Granboulan (1996)
hervorgehoben zu werden. Das M24-Polynom ist aber wegen seines Umfangs ffir
eine exemplarische Wiedergabe weniger geeignet.
und zum Beispiel fUr (tl' t 2 , t3, t4) 1-+ (675,114/27,00,0) eine uber Q(t) unver-
zweigte rationale Stelle besitzt. Bei diesen Verzweigungspunkten existiert also
nach Satz 4.3(b) eine Galoiserweiterung NulQ(t) mit t = t5 und der Gruppe
G = GL2(3).
Da die Permutationsdarstellung vom Grad 8 von G imprimitiv ist, ist es
zweckmii13ig, die Berechnung des Polynoms fur einen Stammkorper Lu von Nul
Q(t) in zwei Schritte zu zerlegen. Man berechnet zuerst ein Polynom fUr einen
Stammkorper Lit vom Grad 4 fur die Faktorgruppe G = G1Z (G) 3:! 8 4 zu 0-
mit cfi, cf2 E 3.4 (3-Zyklen) und cf3, cf4 E 2.4 (Transpositionen):
(8.4) Mit diesen neuen Resultaten von Dentzer (1995a), Malle (in Malle,
Matzat (1998), Appendices 1 und 2), Nauheim (1995) und unter Einbeziehung
von Ergebnissen von Smith (1998) fUr T7,3 und TS,25 sowie Eichenlaub (1998)
ftir T9 ,15 in der Numerierung von Butler, McKay (1983) ergibt sich nunmehr das
folgende Bild (vergl. Satz 8.3 in Matzat (1991b) fUr altere Resultate):
Satz 8.1. Fur die folgenden Permutationsgruppen G vom Grad n sind er-
zeugende Polynome fUr G-Realisierungen uber Q bekannt:
(a) die symmetrischen und alternierenden Gruppen 8 n und An fur n E N,
(b) aile transitiven Gruppen vom Grad n :::; 11 mit Ausnahme von T 9 ,14 ,
TlO ,17, T lO ,27, T ll ,2, T ll ,3, T ll ,4,
(c) aile primitiven, nicht auflosbaren Gruppen vom Grad 12 :::; n :::; 30 mit
Ausnahme der Gruppen mit 80ckel Z~, L2(16), L3(4), M 23 , Z~, L2(25),L 2(27).1
Es sollte vielleicht an dieser Stelle angemerkt werden, daB fUr zwei der in
(c) aufgefUhrten Ausnahmen bisher noch keine G-Realisierung uber Q gefunden
wurde, namlich die L2 (16) als kleinster einfacher Gruppe und die M23 als einzi-
ger sporadischer Gruppe.
Bis zum Grad 12 einschlieBlich ist auch diese Liste in Malle, Matzat (1998)
als Appendix 3 aufgenommen.
Literatur
Aschbacher, M. et al. (eds.) (1984): Proceedings of the Rutgers group theory year,
1983-1984, Cambridge University Press
Auslander, M., Brumer, A. (1968): Brauer groups of discrete valuation rings. Indag.
Math. 30, 286-296
Belyi, G.V. (1979): On Galois extensions of a maximal cyclotomic field. Izv. Akad.
Nauk SSSR Ser. Mat. 43, 267-276 (Russian) [English trans!.: Math. USSR Izv. 14
(1979), 247-256]
Belyi, G.V. (1983): On extensions of the maximal cyclotomic field having a given
classical Galois group. J. reine angew. Math. 341, 147-156
Beukers, F., Heckmann, G. (1989) Monodromy for the hypergeometric function nFn-l.
Invent. math. 95, 325-354
Butler, G., McKay, J. (1983): The transitive groups of degree up to eleven. Comm.
Algebra 11, 863-911
Delone, B.N., Faddeev, D.K. (1944): Investigations in the geometry of the Galois theory.
Math. Sb. 15(57), 243-284 (Russian with English summary)
Demushkin, S.P., Shafarevich, LR. (1959): The embedding problem for local fields. Izv.
Akad. Nauk. Sel. Mat. 23, 823-840 (Russian) [English trans!.: Amer. Math. Soc.
Trans!. II 27, 267-288 (1963)]
Dentzer, R. (1995a): Polynomials with cyclic Galois group. Comm. Algebra 23, 1593-
1603
Dentzer, R. (1995b): On geometric embedding problems and semiabelian groups. Ma-
nuscripta Math. 86, 199-216
Dettweiler, M., Reiter, S. (1998): On rigid tuples in linear groups of odd dimension.
Preprint, Gainesville
Eichenlaub, Y. (1998): Realisation explicite des produits semi-directs a noyau abelien
comme groupe de Galois. Preprint, Bordeaux
Faddeev, D.K. (1951): Simple algebras over a field of algebraic functions of one variable.
Trudy Mat. lnst. Steklov 38, 190-243 (Russian) [English trans!.: Amer. Math. Soc.
Trans!' II 3, 15-38 (1956)]
100 B.H. Matzat
Feit, W., Fong, P. (1984): Rational rigidity of G 2 (P) for any prime p > 5. Pp. 323-326
in: M. Aschbacher et al. (eds.) (1984)
Franz, W. (1931): Untersuchungen zum Hilbertschen Irreduzibilitatssatz. Math. Z. 33,
275-293
Fried, M.D., Jarden, M. (1986): Field arithmetic. Springer, Berlin
Fried, M.D., Viilklein, H. (1991): The inverse Galois problem and rational points on
moduli spaces. Math. Ann. 290, 771-800
Fried, M.D., Viilklein, H. (1992): The embedding problem over a Hilbertian PAC-field.
Ann. of Math. 135, 469-481
Fried, M.D. et al. (eds.) (1995): Recent developments in the inverse Galois problem.
Contemporary Math. vol. 186, AMS, Providence
Geyer, H. (1997): Rationale Verschiebungen von Galoiserweiterungen von Q(t}, Di-
plomarbeit, Heidelberg.
Granboulan, L. (1996): Construction d'une extension reguliere de IQI(T} de groupe de
Galois M24. Experimental Math. 5,3-14
Grothendieck, A. (1971): Revetements etales et groupe fondamental. Springer, Berlin
Haran, D., Jarden, M. (1988): Regular split embedding problems over function fields
of one variable over ample fields. Preprint, Tel Aviv
Harbater, D. (1995): Fundamental groups of curves in characteristic p. Pp. 656-666 in:
Proc. Int. Congress of Math., Zurich 1994 , Birkhauser
Hasse, H. (1948): Existenz und Mannigfaltigkeit abelscher Algebren mit vorgegebener
Galoisgruppe uber einem Teilkiirper des Grundkiirpers 1. Math. Nachr. 1, 40-61
Hasse, H., Brauer, R., Noether, E. (1932): Beweis eines Hauptsatzes in der Theorie der
Algebren. J. reine angew. Math. 167, 399-404
Hilbert, D. (1892): Uber die Irreducibilitat ganzer rationaler Functionen mit ganzzah-
ligen Coeffizienten. J. reine angew. Math. 110, 104-129
Hoechsmann, K. (1968): Zum Einbettungsproblem. J. reine angew. Math. 229, 81-106
Ihara, Y. (1991): Braids, Galois groups, and some arithmetic functions. Pp. 99-120 in:
Proc. Int. Congress of Math., Kyoto 1990, Springer
Ishkhanov, V.V., Lure, B.B., Faddeev, D.K. (1997): The embedding problem in Galois
theory. AMS, Providence
Iwasawa, K. (1953): On solvable extensions of algebraic number fields. Ann. of Math.
58, 548-572
Kantor W.M. (1979): Subgroups of classical groups generated by long root elements.
Trans. Amer. Math. Soc. 248, 347-379
Katz, N.M. (1996): Rigid local systems. Princeton University Press, Princeton
Kluners, J., Malle, G. (1998): Explicit Galois realization of transitive groups of degree
up to 15. In Vorbereitung
Kochendiirffer, R. (1953): Zwei Reduktionssatze zum Einbettungsproblem fUr abelsche
Algebren. Math. Nachr. 10, 75-84
Levelt, A.H.M. (1961): Hypergeometric functions. Dissertation, Amsterdam.
Lubeck, F., Malle, G. (1998): (2,3}-generation of exceptional groups. To appear in J.
London Math Soc.
Madan, M., Rzedowski-Calderon, M., Villa-Salvador, G. (1996): Galois extensions with
bounded ramification in characteristic p. Manuscripta Math. 90, 121-135
Malle, G. (1988): Exceptional groups of Lie type as Galois groups. J. reine angew.
Math. 392, 70-109
Malle, G. (1991): Genus zero translates of three point ramified Galois extensions. Ma-
nuscripta Math. 71, 97-111
Malle, G. (1992): Disconnected groups of Lie type as Galois groups. J. reine angew.
Math. 429, 161-182
Fortschritte in der inversen Galoistheorie 101
Shafarevich, I.R. (1954c): On the problem of imbedding fields. Izv. Akad. Nauk. SSSR
18, 389-418 (Russian) [English trans!.: Amer. Math. Soc. Trans!. II 4, 151-183
(1956))
Shafarevich, I.R. (1954d): Construction of fields of algebraic numbers with given sol-
vable Galois group. Izv. Akad. Nauk. SSSR 18,525-578 (Russian) [English trans!.:
Amer. Math. Soc. Trans!. II 4, 185-237 (1956))
Shafarevich, I.R. (1958): The embedding problem for split extensions. Dok!. Akad.
Nauk SSSR 120, 1217-1219 (Russian)
Shafarevich, I.R. (1989): Factors of descending central series. Math. Notes 45, 262-264
Shih, K.-y. (1974): On the construction of Galois extensions of function fields and
number fields. Math. Ann. 207, 99-120
Smith, G.W. (1991): Generic cyclic polynomials of odd degree. Comm. Algebra 19,
3367-3391
Smith, G.W. (1998): Some polynomials over Q(t) and their Galois groups. Math. Compo
(to appear)
Sonn, J. (1994a): Brauer groups, embedding problems, and nilpotent groups as Galois
groups. Israel J. Math. 85, 391-405
Sonn, J. (1994b): Rigidity and embedding problems over q>b(t). J. Number Theory
47,398-404
Speiser, A. (1919): Zahlentheoretische Satze aus der Gruppentheorie. Math. Z. 5, 1-6
Stoll, M. (1995): Construction of semiabelian Galois extensions. Glasgow Math. J. 37,
99-104
Tate, J. (1966): The cohomology groups of tori in finite Galois extensions of number
fields. Nagoya Math. J. 27, 709-719
Thompson, J.G. (1984): Some finite groups which appear as Gal(L/ K) where K ~
Q(Jl.n). J. Algebra 89, 437-499
Thompson, J.G., VOlklein (1998): Symplectic groups as Galois groups. J. Group Theory
1, 1-58
VOlklein, H. (1993): Braid group action via GLn(q) and Un(q) and Galois realizations.
Israel J. Math. 82, 405-427
VOlklein, H. (1994): Braid group action, embedding problems and the groups PGLn(q)
and PUn(l). Forum Math. 6, 513-535
Volklein, H. (1996): Groups as Galois groups. Cambridge University Press, Cambridge
Volklein, H. (1998a): Rigid generators of classical groups. To appear in Math. Ann.
Volklein, H. (1998b): On linearly rigid tuples. Preprint, Gainesville
Wagner, A. (1978): Collineation groups generated by homologies of order greater
than 2. Geom. Dedicata 7, 387-398
Wagner, A. (1980): Determination of the finite primitive reflection groups over an
arbitrary field of characteristic not 2, I-III. Geom. Dedicata 9, 239-253; 10, 183-
189; 10, 475-523
Weissauer, R. (1982): Der Hilbertsche Irreduzibilitatssatz. J. reine angew. Math. 334,
203-220
Witt, E. (1936): Konstruktion von galoisschen Korpern der Charakteristik p zu vorge-
gebener Gruppe der Ordnungpf. J. reine angew. Math. 174, 237-245
Yakovlev, A.V. (1964): The embedding problems for fields. Izv. Akad. Nauk. SSSR 28,
645-660 (Russian)
Yakovlev, A.V. (1967): The embedding problem for number fields. Izv. Akad. Nauk.
SSSR 31, 211-224 (Russian) [English trans!.: Math. USSR Izv. 1, 195-208 (1967))
Zalesskii, A.E., Serezhkin, V.N. (1980): Finite linear groups generated by reflections.
Izv. Akad. Nauk SSSR Ser. Math. 44, 1279-1307 (Russian) [English trans!.: Math.
USSR Izv. 17 (1981), 477-503)
From Class Groups to Class Fields
Michael E. Pohst
1 Introduction
Since the first printing of the book [35] by H. Zassenhaus and the author in 1989
algorithmic algebraic number theory has attracted rapidly increasing interest.
This is documented, for example, by a regular meeting ANTS (algebraic num-
ber theory symposium) every two years whose proceedings [1], [6] give a good
survey about ongoing research. Also there are several computer algebra packages
concentrating on number theoretical computations. At present the most promi-
nent ones, which are available for free, are Kant [12], Pari [2] and Simath [27].
Kant comes with a data base for algebraic number fields, already containing
more than a million fields of small degree. Kant is developed by the research
group of the author at Berlin and will be presented in some detail in section 5.
We note that almost all of Kant and Pari is also contained in the Magma system
[4].
In the sequel we shortly discuss the improvements which were obtained in
Berlin for the computation of the important invariants of algebraic number fields
during the last six years. During that time the interest gradually turned from
absolute extensions to relative extensions of number fields. If subfields exist the
information about the invariants of those subfields can be lifted and used in the
field under consideration. This relative point of view allows to do computations
in fields of much larger degrees and has important applications, for example to
class field computations. We were able to compute integral bases of fields of
degree beyond 1000 and to calculate Hilbert class fields of degree over 100 for
base fields of degree 3 and 4. Recently the latter algorithm was generalized to
the computation of ray class fields. This and other results from the past year are
presented in greater detail in sections 3 and 4.
extensions over the rationals. The algorithms of [35] for unit group and class
group computations were considerably improved. Major contributions were made
by J. von Schmettow in his thesis [36], by F. HeB in his diploma thesis [26] and
by the author [34]. J. von Schmettow developed two new algorithms, one for
principal ideal testing and the other one for enforcing relations between prime
ideals. Later F. HeB compared all existing methods for class group computations
and developed efficient heuristical strategies. In parallel, improved versions of
continued fraction expansions basing on lattice basis reduction speeded up the
computations of independent and fundamental units [5], [34]. It was observed
that the relations calculated for determining the class group also helped for unit
computations and both approaches were combined. Thus it became possible to
do such computations in number fields of degree beyond twenty. It also became
obvious that the treatment of number fields of larger degree was pushed to a
limit.
This and the fact that higher degree number fields of interest are rarely
primitive motivated us to begin the study of relative extensions. A first step into
this direction had already been done by W. Bosma and the author when they
developed methods for computing pseudo bases for ideals in relative extensions
in [3]. The next step consisted in generalizing ideas of Sommer [40] for the
calculation of generating elements of the integers of relative quadratic extensions
[10]. Then we concentrated our efforts to radical extensions of arbitrary degree.
By adjoining appropriate roots of unity Kummer theory could be applied to
solve the problem for relative extensions of prime degree, and hence of arbitrary
degree. The resulting algorithms in the thesis of M. Daberkow [9] then allowed
the calculation of relative integral bases (of generators of the ring of integers) for
number fields of total degree beyond 1000. The fast methods for unit and class
group computations combined with the newly gained experience in the use of
Kummer theory put us into a situation in which computing Hilbert class fields
became a natural goal. We developed an algorithm who could compute Hilbert
class fields of arbitrary fields of small degree (say, 3, 4 or 5) and class numbers
divisible by small primes [13]. The latter restriction is needed since for every
prime number p dividing the class number we must adjoin a primitive p-th root
of unity to our ground field and then compute the unit and class group of that
new field. This is the crucial step in the application of Kummer theory, even
though we can assume GRH in this case. Namely, the proved correctness of the
unit and class group is not required since once we know a generating equation for
the (likely) Hilbert class field, an isomorphism between the original class group
and that field can be quite easily determined by other methods.
A generalization of the methods for computing Hilbert class fields to the
calculation of arbitrary ray class fields was to follow. The crucial step in the
calculation of ray class groups, the calculation of suitable generators for the
multiplicative group of a residue class ring, was overcome by the use of Hasse's
one units in the diploma thesis of S. Pauli [33]. The remaining steps are the
result of ongoing research [17] and will be explained in greater detail in the next
section.
From Class Groups to Class Fields 105
We note that there are other methods for computing class fields over spe-
cial ground fields. For ray class fields over imaginary quadratic fields there is
the classical approach via complex multiplication. Using normalized functions
[38] this approach is indeed very fast and also allows the handling of large cyclic
factors. But this method has a severe disadvantage, it does not allow the compu-
tation of arbitrary class fields. In practice, this restricts the applicability to ray
class groups whose order is not much larger than 100. Otherwise the generated
polynomials become extremely large.
The use of analytical methods for other base fields via Stark units is under
investigation. Though this is based on an unproven conjecture, once results have
been obtained they can be verified unconditionally. At present, these methods
are studied in Berlin and in Bordeaux. Results were so far obtained only for
totally real fields of degree ~ 4 and discriminant < 1000.
Besides these results we obtained several other important ones on related
topics. The round 2 algorithm of Zassenhaus was generalized to relative exten-
sions in the diploma thesis by C. Friedrichs [18]. A paper of Dixon [14] gave us
the basic ideas for the computation of all subfields of a given number field. A
thorough study led to important improvements and in the sequel to the best
known algorithm for this task [31]. Then J. Kliiners had the idea to accumulate
knowledge on the subfield lattice of the field under consideration which speeded
up the method in cases, where many subfields exist. Thus, for example, Galois
fields could be handled much easier and faster than before. The relation between
subfields and subgroups of the Galois group guided him to develop powerful
methods for the calculation"of Galois groups of normal (absolute) extensions
and of abelian (relative) extensions in his thesis [30].
Two other theses were written on the solution of relative norm equations by
A. Jurk [29] and C. Fieker [16]. The second result is a natural generalization of
the absolute case and as efficient as one could hope for the general case in which
no exact analogue of LLL reduction is yet available. The results on integral bases
and fundamental unit computations were transfered to function fields over finite
fields by M. Schornig [39] who also developed the necessary reduction methods
from the geometry of numbers in that case. K. GeiBler extended earlier results
of Y. Eichenlaub and M. Olivier [15] for computing Galois groups to degree 12
in her diploma thesis [23] and is about to finish degrees up to 15 in the near
future.
Also, we developed new algorithms and improved known ones for solving
diophantine equations. Together with I. Galil and A. Petho methods for solving
index form equations in quartic fields were obtained in a series of papers. The
most complicated case of primitive quartic fields was eventually solved in [19],
where we reduced the task to the solution of Thue equations. Special fields of
degree larger than 4 were also studied recently [20], [21]. In his thesis [42] K.
Wildanger improved known methods for solving unit equations. His methods
can handle unit ranks up to ten (formerly only five). Consequently, he can solve
index form equations in cyclic number fields up to degree 23. Thus he proves a
conjecture of Bremner for all prime numbers p ~ 23, former results were only
106 M.E. Pohst
map p f-t [¥] can be multiplicatively extended to 111K/k yielding the Artin
map:
cPK/k : IllK/k --t Gal(K/k) : a = II pap II [-K/k]a .
f-t
p
p p p
Then K is the class field of k belonging to the ideal group H if and only if
Gal(K/ k) is isomorphic to 1m/ H, the isomorphism being induced by the Artin
map. By the famous existence theorem of class field theory there exists a class
field for every ideal group and every abelian extension is the class field belonging
to a suitable ideal group. The theoretical results of class field theory which will
be used in the sequel are adopted from [32] and [28].
Our goal is to develop an algorithm for the computation of the class field
belonging to a given ideal group H following the lines of the proof of the existence
theorem given in [32]. In a first step the task is transfered to an overfield E of k
which contains "sufficiently many" roots of unity. In E we can apply Kummer
theory and calculate class fields via the Artin map. Afterwards the class field
over k which we are actually interested in is obtained as a suitable subfield of a
classfield of E. For its calculation we again make use of the Artin map.
From Class Groups to Class Fields 107
In order to make the algorithm efficient the task for general ideal groups H
is reduced to smaller problems in which the corresponding groups fI are cyclic
of prime power order. Before we do this we show how images of ideals under the
Artin map can be explicitly calculated.
In the remainder of this section we assume that K / k is an abelian extension
of degree n. We list several useful properties of the Artin map:
- If K is an intermediate field of the abelian extension L/k, then the Artin
map satisfies: lPL/k IK = lPK/k.
- If L/k is finite, then lPLK/L = lPK/kNL/k for those ideals of L which are
coprime to 7JK/k.
- If m is divisible by all ramified primes of K/k, then the Artin map restricted
to fm is surjective.
- Any module m for which Pm is contained in the kernel of the Artin map
restricted to fm is called admissible. Admissible modules always exist.
It is important that we do have an efficient method for the actual computation
of the images of ideals under the Artin map. For this we assume that we know
all elements, say Ul, ... ,Un , of Gal(K/k). They can be computed with mehods
from [30], for example.
Since the Artin map is the multiplicative extension of the Frobenius map,
we start with the calculation of Frobenius automorphisms. Let p be a non zero
unramified prime ideal of Ok. The corresponding Frobenius automorphism up = U
satisfies
U(x) == xN(p) mod POK
for all x E OK. Of course, it suffices to choose the elements x from a full set
of representatives of OK/POK. Assuming the knowledge of Gal(K/k) we simply
compute xN(p) mod POK and compare this with u(x) mod POK for all U E
Gal(K/k). If we do this for sufficiently many elements, which generate generate
Kover k, for example, we can easily identify U E Gal(K/k).
In the case of Kummer extensions this procedure can be speeded up sub-
stantially. Let us assume that K := k(y) with y = \f1i and that k contains a
primitive n-th root of unity (. Without loss of generality we may also assume
that Ui(y) = (iy for 1 ~ i ~ n. If we set N(p) = qn+r with 0 ~ r < n we obtain
xN(p) = J.Lqyr, hence (i == J.Lqyr-l mod POK. Since pis unramified this implies
r = 1 and we find that for Kummer extensions the Frobenius automorphism can
be easily computed in k.
For the computation of lPK/k(a) of an arbitrary ideal a we have two options.
We can either make use of the definition of the Artin map, Le. we factor the
ideal into a product of prime ideals, calculate their Frobenius automorphisms
in Gal(K/k) and obtain lPK/k(a) as their product. The other approach is to
explicitly establish a surjection from the ray class group onto the Galois group.
The first method is certainly cumbersome for large ideals which seem to occur
frequently. For the second approach we need to compute the images of suitable
ideals which generate the ray class group. In our numerical experiments the
ray class groups usually had very few generators (in comparison with the group
108 M.E. Pohst
order). Hence, we suggest to try to generate the ray class group with small prime
ideals.
Considering the Artin map it follows immediately that the class field K of k
belonging to H is the composite of the class fields Ki belonging to Hi for i =
1, ... , t. The task of computing K can therefore be split into s smaller tasks,
namely to calculate all K i . In the sequel we therefore assume that IHI = pr
for a prime number p. We put E := k«(p.). Then F := KE is the class field
over E belonging to the ideal group HE := Pm,ENE/k(H) of I~. Because of
NE/k(Pm,E) ~ Pm the group HE can be defined with the module m, and we easily
compute HE = I~/HE = Pm,ENE/k(H) as a preimage of a homomorphism
between finite abelian groups. Also the approximation theorem tells us, that we
can use any multiple of min E for defining HE.
In the sequel we assume that IHEI = pr and that E contains the pr-th roots
of unity. We build a finite set 8 of valuations of E such that 8 contains all
valuations corresponding to divisors of m, of p, and to sufficiently many prime
ideals for generating the class group of E. We let s := 181 and denote by Us
the 8-units of E. Then (see [28], for example) the class field F which we are
interested in is a subfield of the abelian extension r := E( P::./US). Dirichlet's
unit theorem (for 8-units) shows that r = E( P\I€l, ... , P\I€.") for generators
€l, ..• ,€8-1 of the free part of Us and €8 a generator for the torsion units of 0E.
Hence, r is of degree (pr)s over E with Galois group isomorphic to C; •.
We illustrate the relation of these fields with the following diagram. For a
better readability we put n := pr and assume that the intersection of k and
Q«(n) is Q.
Since r is an abelian extension of E there is an admissible module msuch
that the mapping Cl m """"* Gal(r/ E) = C;. is surjective and that Pm is contained
in its kernel. Then m must be divisible by all ramified primes (and also by all
primes dividing m) and we need upper bounds for their exponents. We can get
them from the following lemma of [25].
m:= IT p IT pc(p)
p J-h pin
pl~G/E
with
c(p) := (vp(n) +p~ 1) eE/Q(p) + 1
is an admissible module for G / E.
From Class Groups to Class Fields 109
E( y'€l) E( ytt;)
We note that it is possible to compute the discriminant and the exact con-
ductor of Kummer extensions G/E. Although not difficult this is more time
consuming than to work with a (potentially) larger ray class group coming from
Hasse's estimate.
The properties of the Artin map show that the class field F in which we
are interested is that subfield of r which is fixed by all automorphisms of r j E
corresponding to ideal classes of elm which become trivial in HE. Therefore, we
begin by computing those classes and subsequently calculate the field fixed by
the corresponding automorphisms.
elm is given as a direct product of cyclic groups. The surjection t : elm -+ H
is represented by a matrix. Since we are only interested in subgroups of order
pT, the entries of the matrix are reduced modulo pr. Then the classes we are
looking for are in the null space of that matrix.
In the next step we compute the invariant field. Obviously, this can be done
by evaluating certain polynomials which are invariant under the automorphisms.
However, the degree of r/E being so large, this would be impractical. Instead,
we search directly for a Kummer sub-extension of degree dividing pT. A generator
of it can be chosen as apr -th root of an element of the form j.L := rr:=l
t~i with
o S ni < pT. The automorphisms in Gal(rjE) map each element P0; onto
an element (;' P0; for a suitable exponent mi. Therefore we can identify each
0" E Gal(r j E) with a vector m = (mj )15j58 E (ZjpTZ )8. Accordingly, we write
0" = O"lll. Then we obtain
and a necessary and sufficient condition for P\Iii being fixed by 17m is
•
Lnjmj == 0 mod pro
j=l
(i) We choose k:= ((l(Vl0), m:= 1080k and H:= Pm. Then the ray class group
is Clm ~ C 2 x C4 X C 9 . This example is explained in detail in [17]. For each of
the three cyclic factors of the ray class group we obtain the corresponding class
field by a generating polynomial over k.
x2 - (21 - 6v'iO) ,
(ii) Next let k be the maximal real subfield of Q«(27). It is generated over ((l by
~ := (27 + (27. As a congruence module we choose m := 81ok. The class group of
k is trivial and for the ray class group we obtain Clm ~ C27 . K/k is generated
by a root of the polynomial
+819x3 - 27x - 2 - 7e +
112 M.E. Pohst
(iii) We compute the Hilbert class field of k = Q(p) for a root p of x 3 + 28x + 175.
Clearly, k has one real and two complex conjugates. The discriminant of k is -
914683 (k has a power integral basis), the regulator of k is 4.328, and the class
group of k has the structure C6 x C12 • Hence, the class field r(k) of k is of total
degree 216 over Q. We compute r(k) = Q(Tl,T2,T3,T4), where the Ti are zeros
of the following polynomials Ji(x) (1 ~ i ~ 4):
Presently, there are about 600 internal functions installed, 550 additional prede-
fined user functions and comprehensive references are available. Because KASH
grows weekly, updates will be made more often than for the KANT-library.
Accessible from KASH is an SQL-database for number fields [41]. The database
is designed to give easy and fast access to more than one million number fields.
Currently the following invariants are stored (if known) and can be used as keys
in a selection:
Isomorphy can be tested with KASH. In a first step one can check some invariants
and if all tests are successful there is the possibility to choose between several
algorithms for proving the isomorphy.
The underlying SQL-database (PostgreSQL) is public domain and available for
every system supported by KASH.
As an example we will find all totally real cubic fields with discriminant less
than 10,000 and common inessential discriminant divisors. It is well known that
a sufficient and neccessary criterion for this is that exactly three different prime
ideals divide 2, so the following program is straightforward:
kash> DbOpen(); # open the database
info: last update: Tue Dec 17 12:45:01 MET 1996
true
kash> query := # we are interested only
> "degree=3 and disc < 10000 and [number of real zeroes]=3";;
kash> # in small totally real cubic
kash> # fields
kash> DbCountQuery(query);
382
kash> DbQuery(query);
true
kash> L:= [];;
kash> repeat
> 0:= DbNextOrder();
> if o<>false and
> Orderlndex(o)mod 2 = 0 and # 2 has to divide the index,
> # this is a fast criterion
> Length(Factor(2*0)=3 then # exactly 3 different primes
> Add(L, 0);
> fi;
> until o=false;
kash> Length(L); # we found 14 fields
14
114 M.E. Pohst
Computing subfields
The following example demonstrates the computation of subfields: We start by
creating the equation order Z[p] for p6 + 108 = o.
kash> o:=Drder(Z,6,-108);
Generating polynomial: x-6 + 108
The computation of proper subfields of the quotient field Q(p) of 0 yields the
following list of equation orders.
kash> L:=DrderSubfield(o);
[ Generating polynomial: x-2 + 108
Generating polynomial: x-3 - 108
, Generating polynomial: x-3 - 108
, Generating polynomial: x-3 - 108
]
There are 3 subfields of degree 3 which are isomorphic but not identical and one
subfield of degree 2. Let Pi, P2, P3 denote the roots of x 3 - 108. L[i] denotes the
i-th equation order in L.
kash> rl:=Elt(L[2],[0,1,0]);
[0, 1, 0]
kash> r2:=Elt(L[3],[0,1,0]);;
kash> r3:=Elt(L[4],[0,1,0]);;
The elements look identical, but they are indeed different which is detected upon
moving them into the order o.
kash> EltMove(rl,o) # This produces the element
# (6*rho-2-rho-5)/12
[0, 0, 6, 0, 0, -1] / 12
kash> EltMove(r2,o); # This produces the element -rho-2
[0, 0, -1, 0, 0, 0]
kash> EltMove(r3,o); # This produces the element
# (6*rho-2+rho-5)/12
[0, 0, 6, 0, 0, 1] / 12
Any element of L[i] can be lifted in an analogous way.
Solution of Thue-equations
Given an irreducible form f E Z[X, Y] of degree:::: 3 and an integer a, we
compute all (x,y) E Z2 subject to f(x,y) = a.
Let f(X, Y) := X3+X2Y -6Xy 2+2y3 and solve f(x, y) = 2. The corresponding
number field F is created by a root of the irreducible polynomial f(X, 1) E Z[X].
kash> t := Thue([1,1,-6,2]); # [1,1,-6,2] are the coeffs of f.
X-3 + X-2 Y - 6 X Y-2 + 2 Y-3
kash> Solve(t,2); # Compute all solutions [x,y].
[ [ -724, -411], [-4, -11 ], [ -3, 1 ], [ -1, -1 ],
From Class Groups to Class Fields 115
[ 0, 1 ], [ 2, 1 ] ]
Additionally, we can solve Thue-equations up to sign on the right hand side, for
example X 7 + X 6 y - 6X 5y2 - 5X 4 y3 + 8X 3y4 + 5X 2 y5 _ 2Xy6 _ y7 = ±l.
kash> t := Thue([l,l,-6,-5,8,5,-2,-1])j
X-7 + X-6 Y - 6 X-5 Y-2 - 5 X-4 Y-3 + 8 X-3 Y-4 + 5 X-2 Y-5\
- 2 X Y-6 - r7
kash> Solve(t,l," abs")j # Compute all solutions [x,y].
[ [ -2, -1 ], [ -1, -1 ], [ -1, 0 ], [ -1, 1 ], [0, -1 ],
[ 0, 1 ], [ 1, -1 ], [ 1, 0 ], [ 1, 1 ], [2, 1 ] ]
Time: 10 ms
kash> OrderClassGroup(o);
[8, [ 8 ] ]
Time: 230 ms
kash> ImQuadHilbert(o);
used real precision is 52 places
F[l]
/
/
El [1]
I
El[2]
/
/
Q
F [1] x-8 + 14*x-7 + 83*x-6 + 23*x-5 - 353*x-4 - 463*x-3 \
- 126*x-2 - 69*x - 1
E 1[ 1] Given by transformation matrix
E 1[ 2] x-2 + 111
Time: 680 ms
Next the Hilbert class field of a totally real field with class number 3 is computed.
kash> 0 := OrderMaximal(x-3 - 21*x + 28);;
kash> OrderClassGroup(o);;
kash> OrderUnitsFund(o);;
From Class Groups to Class Fields 117
kash> 0;
F[1]
I
F[2]
/
/
Q
F Given by transformation matrix
[1]
F x-3 - 21*x + 28
[2]
Discriminant: 3969
Regulator: 12.594188956918676531369477850688989344447689770084
Units:
[5, -6, 2] [17, 0, -2]
class number 3
class group structure C3
cyclic factors of the class group:
<2, [1, 0, 1]>
kash> ClassField(o);
[ x-3 - 3*x + 1 ]
Time: 4970 ms
Finally, we compute class fields for 2 non-trivial modules, the 2nd module con-
taining all infinite places.
kash> ClassField(25*o);
[ x-2 - 5, x-3 - 3*x + 1, x-5 - 10*x-3 + 5*x-2 + 10*x + 1 ]
Time: 1146 s
Here, we are interested only in the 2-part of the class field.
kash> RayClassField(4*0, [1,2,3],2);
[ x-2 + [231543, -261208, 85050], x-2 + [4051, -4570, 1488], \
x-2 + [5, -6, 2] ]
Time: 4320 ms
Availability
KASH is freely available via
ftp://ftp.math.tu-berlin.de!pub!algebra!Kant!Kash.
It has been ported to the following architectures:
- HP 7000 (HP-UX 9.01 and 10.20),
- IBM RS 6000 (AIX 3.2.5),
- Intel 486 (Linux Kernel 1.3.0),
- Intel 486 (MS DOS 5.0, OS/2, Windows 95),
- Silicon Graphics (IRIX 5.3 and 6.2),
- Sun SPARC (SunOS 4.1.3),
- Sun SPARC (SunOS 5.5).
118 M.E. Pohst
References
24. H. Hasse, Uber den Klassenkorper zum quadratischen Zahlkorper mit der Diskrim-
inante -47, Acta Arithmetica 9 (1964), 419 - 434.
25. H. Hasse, Vorlesungen uber Klassenkorpertheorie, Physika Verlag, Wiirzburg, 1967.
26. F. Hen, Zur Klassengruppenberechnung in algebraischen Zahlkorpern, Diplomar-
beit, TU Berlin, 1996.
27. C. Hollinger and P. Serf, SIMATH - a computer algebra system, in Computational
Number Theory, eds. A. Pethii, M. E. Pohst, H. C. Williams, H. G. Zimmer, de
Gruyter Verlag, Berlin, New York 1991, pp.331-342.
28. G. J. Janusz, Algebraic Number Fields, 2nd ed., AMS Graduate Studies in Math.,
Vol. 7, 1996.
29. A. Jurk, Uber die Berechnung von Losungen relativer Normgleichungen in alge-
braischen Zahlkorpern, Dissertation, Diisseldorf, 1993.
30. J. Kliiners, Uber die Berechnung von Automorphismen und Teilkorpern algebrais-
cher Zahlkorper, Thesis, TU Berlin 1997.
31. J. Kliiners and M. Pohst, On Computing subfields, J. Symb. Compo 24 (1997),
385-397.
32. S. Lang, Algebraic number theory, Grad. Texts in Math. 110, Springer Verlag, 1994.
33. S. Pauli, Zur Berechnung von Strahlklassengruppen, Diplomarbeit, TU Berlin, 1996.
34. M. Pohst, On computing fundamental units, J. Number Theory 47 (1994), 93-105.
35. M. Pohst and H. Zassenhaus, Algorithmic Algebraic Number Theory, Cambridge
University Press, Cambridge, 1989.
36. J. von Schmettow, Beitriige zur Klassengruppenberechnung, Dissertation,
Diisseldorf 1991.
37. Schiinert, M. et al.: GAP - Groups, Algorithms and Programming, version 3 release
3, RWTH Aachen, 1993.
38. R. Schertz, Zur expliziten Berechnung von Ganzheitsbasen in Strahlklassenkorpern
uber imaginiir-quadratischen Zahlkorpern, J. Number Theory 34 (1990), 41-53.
39. M. Schiirnig, Untersuchungen konstruktiver Probleme in global en F'v.nktio-
nenkorpern, Thesis, TU Berlin, 1996.
40. J. Sommer, Einfuhrung in Zahlentheorie, Teubner Verlag, Leipzig 1907.
41. A. Weber, M. Daberkow, A Database for Number Fields, J. Calmet and C. Limon-
gelIi (Eds.), Design and Implementation of Symbolic Computation Systems, pp.
320-330, Springer LNCS 1128, Berlin - Heidelberg - New York, 1996.
42. K. Wildanger, Uber das Losen von Einheiten- und Indexformgleichungen in
algebraischen Zahlkorpern mit einer Anwendung auf die Bestimmung aller ganzen
Punkte einer Mordellschen Kurve, Thesis, TU Berlin, 1997.
A Gross-Zagier Formula for Function Fields
1 Introduction
In their famous article [7], Gross and Zagier proved a formula relating heights
of Heegner points on modular curves and derivatives of L-series of cusp forms.
We prove the function field analog of this formula. The classical modular curves
parametrizing isogenies of elliptic curves are now replaced by Drinfeld modular
curves and isogenies of Drinfeld modules. Cusp forms on the classical upper half
plane are replaced by certain functions on a Bruhat-Tits tree.
Our results and proofs are deeply influenced by [7]. We will not present every
detail of the calculations in this paper, for this we refer to [13].
We consider the rational function field K = lFq (T) over a finite field lFq of odd
characteristic and an imaginary quadratic extension L = K(VD). Let N be an
element in the polynomial ring lFq [T] with the property that each of its prime
divisors is decomposed in L. In particular we have [~] = 1.
In the next section we consider Heegner points x on the Drinfeld modular curve
Xo(N) with complex multiplication by OL. We associate to the global height
pairings < (x) - (oo),T,x((X)<TA - (0)) > for varying Hecke operators T,x a cusp
form gAo In Theorem 3.2 and Theorem 3.4 we evaluate the local heights which
yield the Fourier coefficients of gAo
The evaluations of q;A and gA are carried out independently from each other. In
section 4 we connect these results and show that q;A and gA are very closely
related. Hence we can prove our main result Theorem 4.2, which says that
t.L(f,A,s) 1.=0 can be evaluated using the cusp form gA coming from the
height of Heegner points.
2 L-Series
Let lFq be the finite field with q = pO< elements (p i= 2), and let K = lFq (T)
be the rational function field over lFq. We distinguish the finite places given
by the irreducible elements in the polynomial ring IF'q [T] and the place 00 of
K. For 00 of K we consider the completion Koo with normalized valuation Voo
and valuation ring 0 00 . We fix the prime 7r00 = T- 1 , then Koo = lFq ((7r oo )). In
addition we define the following additive character 'lj;oo of Koo: Take (T : lFq -+ C*
with (T(a) = expe;i TrlFq/lFp(a)) and set 'lj;oo(L:ai7r~) = 17(-al).
The oriented edges of the Bruhat-Tits tree of GL 2 over Koo are parametrized
by the set GL2(Koo)/rooK~, where
f(AX) = f(X)
for all X E GL 2(Koo )/roo K;'" and A E ro(N),
iii) f has compact support modulo ro(N), i.e., there are only finitely many ele-
ments X in r o(N)\GL 2(Koo )/roo K'(", with f(X) :f. o.
Any function on GL 2(Koo )/roo K'(", which is invariant under (~FqrJ) has a
Fourier expansion
!
with
f*(7r:,>.)= f((7rf~))7j;oo(->'U)dU,
K~/F.[TJ
Since (~Fq rJ) c ro(N), this applies to automorphic cusp forms. In this
particular case the harmonicity conditions imply
Hence all the Fourier coefficients of an automorphic cusp form f of Drinfeld type
are uniquely determined by the coefficients f*(7r~g>'+2,..\) for..\ E lFq[T].
To an automorphic cusp form f one can attach an L-series LU, s) in the following
way (cf. [19], [20]): Let m be an effective divisor of K of degree n, then m =
(..\)0 + (n - deg ..\)00 with ..\ E lFq [T], deg..\ :::; n. We define
The C-vector space of automorphic cusp forms of Drinfeld type of level N is finite
dimensional and it is equipped with a non-degenerate pairing, the Petersson
product, given by
U, g) f-t f .g.
Important examples for newforms are given in the following way: Let E be an
elliptic curve over K with conductor N . 00, which has split multiplicative re-
duction at 00, then E belongs to a newform f of level N such that the L-series
of E satisfies ([1])
L(E, s + 1) = LU, s).
This newform is in addition an eigenform for all Hecke operators, but we do not
assume this property at the beginning.
We assume in this paper that D is irreducible and its degree is odd. In prin-
ciple all the arguments work in general, but the details are technically more
complicated.
Let A be an element of the class group Cl(OL) of OL. For an effective divisor
m = (A)O + (n - deg A)OO (as above) we define
(A(S) =L r A(m)N(m)-s.
m2>:O
eao,AO (( ab)(7r~U))_e
cd 0 1 - ao,Ao
((7r~u))[d]o
0 1 D cu+d q
-v~(cu+d) ,
if Voo(C7r~) > voo(cu + d). Here [15] is the Legendre symbol at D and 8 denotes
the local norm symbol at 00. This is the important starting point for all our
calculations.
Now we combine the L-series of a newform f and the partial zeta function of A
to the function
L(f,A,s) = L
J*(m)rA(m)N(m)-s.
m2':O
L(N)(2S+1)=q~1 L [~]q-(28H)degk.
kEIF.[T]
gcd(k,N)=1
This is motivated by the following fact: Let E be an elliptic curve with conductor
N· 00 and corresponding newform ! as above and let ED be its twist by D. Then
we get for the L-series the identity
Starting with the transformation rule of the theta series we get ([12], d. also
[13]):
Proposition 2.1.
with
e s (7r:,O) = qs(de g D-m+2)+de g D-m L [~] q-(2o+1)deg d
#0
and (J.L i- 0)
The fact that we replaced ro(N) by rJl)(N) is not important. We can change
this with the trace given by the inclusion rJl)(N) c ro(N).
A more serious point is the fact that <Po is not an automorphic cusp form, that in
particular the harmonicity condition fails. We introduce the following function
on GL2(Koo)/rooK~:
-
<ps(X) = <po(X (7r0001)
0 ),
A Gross-Zagier Formula for Function Fields 127
Proposition 2.2.
with
and (p, i= 0)
es (1r:,p,) = qs(-deg D-2deg N+m+1)+de g D/2-m+l/2
L(N)(2S+1)L(f,A,s)=4(q~1) ! f·(p.-p.)·
rci')(N)\GL2(K~)/r~K~
Proposition 2.3.
This proposition and the preceding formula yields the following functional equa-
tion:
128 H.-G. Riick, U. Tipp
then
Z(s) = - [~] Z(-s).
The first step is not difficult. We have to evaluate the derivative of iPs resp. of
its Fourier coefficients to get with Proposition 2.1
But we still have in mind to express this value as a Petersson product of f with
an automorphic cusp form.
where tf! is an automorphic cusp form of Drinfeld type of level N, whose Fourier
coefficients tf!* (1l"~g.H2, oX) (only these are important!) are given by the formula
Unfortunately the limit in this formula does not converge. We would be happy if
we could find an Eisenstein series E, which is perpendicular to f, hence does not
A Gross-Zagier Formula for Function Fields 129
affect the integral, and whose holomorphic projection formula has the same "be-
haviour of divergence" as the one of <P. The difference <P - E and the converging
holomorphic projection formula for <P - E would then solve our problem.
Fortunately this can be achieved. We will not give the details in this paper, we
refer to [13]. If one carries out this program one gets the following result:
where IJ!A is an automorphic cusp form of Drinfeld type of level N with Fourier
coefficients
_ q+l C1hdEqdega(deg>'-2dega))+(Eqdega)C2}
2(q - 1) al.\ al.\
They can also be described by the following data. The Drinfeld module </> is
isomorphic to an ideal a in OL. If nlN is an ideal of OL which contains exactly
one prime divisor of every conjugated pair over the primes dividing N, then n-1a
defines another Drinfeld module and a cyclic isogeny of degree N between them.
The Heegner point x is therefore given by the ideal class of a and the ideal n.
We get the following analytic realization of the Heegner point x, simply denoted
by the same letter. Let Coo be the completion of an algebraic closure of Koo and
f} = Coo - Koo the Drinfeld upper half plane. Then Xo(N) is analytically given
by the quotient ro(N) \ f} compactified by adjoining finitely many cusps. Let
xEf}
B+-1I5 2_
x = 2A ,NIA, B = D mod A.
Then the lattice < x, 1> is isomorphic to the ideal a = AIF'q[T] + (B+v'D)IF'q [T],
which defines together with the ideal n = NIF'q [T] + ((3 + -II5)IF'q [T] with (3 ==
B mod N a Heegner point.
Let x be a Heegner point on Xo(N). We denote by erA the element in the Galois
group of HL/ L which corresponds via class field theory to A E Cl(OL). Then XlTA
is again a Heegner point. We form the divisors (x) - (00) and (XVA - (0), where
00,0 are different cusps on Xo(N). In addition let T>. be the Hecke operator of
A E IF'q[T]. The global height pairing < (x) - (oo),T>.((X)lTA - (0)) > is in the
center of our interest.
A Gross-Zagier Formula for Function Fields 131
How this is related to cusp forms will be shown in the next proposition.
Let T),. be a Hecke operator and let 9 be an automorphic cusp form of Drinfeld
type of level N. If we associate to (T),.,g) the Fourier coefficient (T),.g) * (7r;', 1),
we get a bilinear map between the Heeke algebra and the space of cusp forms of
level N. If gcd(A, N) = 1, then
The Drinfeld reciprocity law ([2]) shows that this bilinear map is a non-degenerate
pairing.
We apply this to the linear form on the Hecke algebra, which maps T),. to
< (x) - (00), T),.«x)<T A - (0)) >, and we get the following
We want to compare 9A with the cusp form .pA of the previous section. Therefore
we have to evaluate this global height pairing.
We evaluate each local pairing < , >v individually. For that purpose we distin-
guish the cases that v is a divisor of the place 00 or not.
In each case the evaluation of < , >v is reduced to the evaluation of the local
intersection product ( . )v on a regular model of Xo(N). For different points x
and y the product (x· y)v is well known. By abuse of language we write x, y for
the points on the generic fibre as well as for their horizontal divisors. Only the
self intersection product (x· x)v has to be defined. We do this by a "global link" ;
we choose a uniformizing parameter t at x globally. Then we define (cf. [6])
From now on we assume that N is square free, because in contrast to the case
of the modular curves for elliptic curves (cf. [9], chapter 13) it is not clear how
the special fibres of Xo(N) over the primes of N, which occur with higher mul-
tiplicity, look like.
+
IJ.#O
deg(IJ.N):::;deg(>'D)
. ((t(tL,D) + I)(L
cllJ.
[~] degc) - (L [~]) degtL) }.
cllJ.
The crucial point is to find the correct Green function. This is done in [18], we
state the result in the next proposition.
G(x,y) = --1
1
( q+1
- I og Ix - ')'YI2)
q- ~Ero(N)
2(q - 1) q Ixlil,yli
1~-,yI2:$lzlihlll.
with
2(q- 1)3
C1 := [GL 2 (lFq[TJ) : ro(N)]"
Sketch of proof. For the proof of this proposition we use the fact that a
regular algebraic model over Leo can be described by rigid analytic reduction
with respect to a suitable pure admissible covering (cf. [5], chapter V). This
covering comes from a canonical covering of fl described in [4] which has the
property that the intersection graph of the reduction is the Bruhat-Tits tree.
The special fibre of the model of Xo(N) is then described through the quotient
graph of the Bruhat-Tits tree modulo the group ro(N). From this description
one sees that the horizontal extensions of two points x, y intersect in the special
fibre if and only if Ix - ')'y12 < !Xlil!yli for some, and the multiplicity of the
intersection is
log Ix - ,yl2 .
~Ero(N)
q Ixlil,yli
l Z -1'yI 2 :S:lzli 1")"!lli
The intersection pairing gives the local Neron-pairing only if one of the divisors
has trivial intersection with every fibre component. Whereas for the finite places
this is more or less automatically fulfilled, it is an essential part of the local
height pairing at 00. To achieve this condition one has to add a rational linear
combination of the fibre components to one of the divisors depending on the
other divisor. This is done by solving a system of linear equations given by the
intersection matrix. This matrix is totally described by the intersection graph
and it can be viewed as a special value of some kind of generalized Laplacian
operator for functions on graphs similar to those studied by H. Bass and others.
The solutions are described by generating functions associated to path lengths
in the graph. The limit s -+ 1 comes in because the generalized Laplacian is
invertible whereas the intersection matrix is not. The solutions therefore have
a pole at s = 1 and the residue can be calculated explicitly by reducing to the
case of the group GL2(lFq [TJ). These solutions, which are now functions on the
tree given by certain sums over,), E ro(N), can then again be expressed through
functions on fl, where the sum splits into the finite sum over, : Ix _ ,yl2 :::;
Ixlil!Yli summing up a constant and the infinite sum over, : Ix-,yl2 > !x!iI!Yli'
as stated in the proposition. 0
134 H.-G. Ruck, U. Tipp
by
q+l
2(q _ 1) + 2logq Ixli + 2logq 8x 18ul
with u according to the global definition of the self intersection (x,x)v. The value
I I
logq ~~ can be expressed by logq 1L1(x)l, where L1 is the discriminant function
of a rank two Drinfeld module (cf. [3]), and it can be computed by the zeta
function of the function field L.
If the points x or y are cusps of Xo(N), the Green function G(x,y) can be
computed by Eisenstein series which come from a limit process out of Proposition
3.3 [18, Prop. 8,9J.
With this Green function calculations as in [7J yield the following theorem (cf.
[13]):
- ~+\lim(
2 q - 1 .~o
"
L...J
rA((I1,N-AD)) (L[D])(t(J1o,D)+I)
c
deg(I'N»deg('>"D) ell'
_ q+l ClhL(Lqdega(degA-2dega))+(Lqdega)C2}
2(q - 1) a I'>" a I'>"
with
A Gross-Zagier Formula for Function Fields 135
and
4 Main Results
In this section we combine the results of the previous two parts. We recall the
assumptions: D E lFq [T] is irreducible of odd degree and N E lFq [T] is square
free, whose prime divisors are decomposed in L = K(VD).
We compare Theorem 3.2 and Theorem 3.4 with Theorem 2.5, where we have
to choose the constant C2 in an obvious way, and we get:
for all), E lFq[T] with gcd().,N) = 1. Hence q;A and (q -1)/2· q-(deg D+1)/2 gA
differ only by an old form.
We want to apply this theorem to the case of elliptic curves. Let X be a character
of the class group Cl(OL), we define
Proposition 4.2.
~L(f
as "X s) 1 _
._0
= q -1
2
q-(degD +I)/2 h- 1
Lx'
< c(f) c(f)
X
> f.J
/ f·j.
ro(N)\GL2(Koc )/rocK~
At the end we want to mention just one consequence of Theorem 4.4. We consider
the elliptic curve E over L. Its L-series is L(E, s)L(ED' s), which in our case has
a zero at s = 1 according to the functional equation of Corollary 2.4.
In the function field case it is known ([15), (14)) that the analytic rank of E/ Lis
not smaller than the Mordell-Wei! rank of E(L). Therefore Theorem 4.4 implies
A Gross-Zagier Formula for Function Fields 137
Corollary 4.1. If
8
8s (L(E,s)L(ED'S» 1.=1# 0,
then the Birch and Swinnerton-Dyer conjecture is true for E, z.e. the analytic
rank and the M ordell- Weil rank of E / L are both equal to 1.
References
1. P. Deligne, Les constantes des equations fonctionelles des fonctions L, in: Modular
functions of one variable II, LNM 349, Springer (1973), 501-597.
2. P. Deligne, D. Husemoller, Survey of Drinfeld modules, Contemp. Math. 67 (1987),
25-91.
3. E.-U. Gekeler, Drinfeld modular curves, LNM 1231, Springer, 1986.
4. E.-U. Gekeler, M. Reversat, Jacobians of Drinfeld modular curves, Journal reine
angew. Math. 476 (1996), 27-93.
5. L. Gerritzen, M. van der Put, Schottky groups and Mumford curves, LNM 817,
Springer.
6. B. Gross, Local heights on curves in "Arithmetic geometry", (ed. Cor-
nell/Silverman) Springer 1986.
7. B. Gross, D. Zagier, Heegner points and derivatives of L-series, Invent. Math. 84
(1986), 225-320.
8. D. R. Hayes, Explicit class field theory in global function fields, Studies in Alge-
bra and Number Theory, Advances in Mathematics supplementary studies vol.6,
Academic Press (1979), 173-217.
9. N. Katz, B. Mazur, Arithmetic moduli of elliptic curves, Princeton, Princeton
University Press, 1985.
10. H.-G. Riick, Theta Series of Imaginary Quadratic Function Fields, manuscript a
math. 88, 387-407, 1995.
11. H.-G. Riick, Poincare Series of Drinfeld Type, Arch. Math. 68, 190-201, 1997.
12. H.-G. Riick, L-series of automorphic cusp forms of Drinfeld type, Proceedings
of the Workshop on Drinfeld modules, modular schemes and applications, Alden-
Biesen, World Scientific, 1997, 311-329.
13. H.-G. Riick, U. Tipp, Heegner points and L-series of automorphic cusp forms of
Drinfeld type, (manuscript).
14. T. Shioda, Some remarks on elliptic curves over function fields, Asterisque 209
(1992), 99-114.
15. J. Tate, On the conjectures of Birch and Swinnerton-Dyer and a geometric ana-
log, Sem. Bourbaki 1965/66 in: Dix Exposes sur la Cohomologie des Schemas,
Amsterdam, North Holland, 1968.
16. U. Tipp, Hohenberechnungen von Heegnerpunkten auf Drinfeldschen Modulkur-
ven, Dissertation, Preprint No.10, IEM Essen, 1996.
17. U. Tipp, Local height pairings of Heegner points on Drinfeld modular curves, Pro-
ceedings of the Workshop on Drinfeld modules, modular schemes and applications,
Alden-Biesen, World Scientific, 1997, 344-356.
18. U.Tipp, Green functions for Drinfeld modular curves, (submitted).
19. A. Weil, Dirichlet Series and Automorphic Forms, LNM 189, Springer.
20. A. Weil, On the Analogue of the Modular group in Characteristic p, in "Functional
Analysis, etc", Proc. Conf. in honor of M. Stone, Springer 1970, 211-223.
Part B
Algorithmic
Commutative Algebra and
Algebraic Geometry
Extremal Lattices
o Introduction
This paper deals with discrete subgroups of euclidean vector spaces, equivalently
finitely generated free abelian groups (isomorphic to zn for some n E N) together
with a positive definite quadratic form. Such a structure will be called a lattice for
short, typically denoted by L,M, ... ,with values (v,w) E IR of the bilinear form
(where v,w E V J L, the enveloping IR-vector space). The general background
of this report is provided by the sphere packing problem (construction of lattices
with large minimum), by the theory of modular forms, and by the theory of finite
matrix groups. Almost all lattices of interest for one of the mentioned areas are
"algebraic" or even "rational", by which we mean that the form takes rational
values on them: (v, w) E Q for v, w E L. After rescaling, that is, multiplying the
form with some positive integral constant Q, a rational lattice becomes integral:
(v,w) E Z for all v,w E L. The rescaled lattice will be denoted by "L. By
definition, a lattice is integral if and only if it is contained in its dual lattice
H.-G. Quebbemann has observed that quite a few individual lattices which are
well known, or even famous, in one or several of the above mentioned areas (like
the Leech lattice A24 , the Barnes-Wall lattice BW16 , or the Coxeter-Todd lattice
K 12 , the Quebbemann lattice(s) Q32), share a common structure: after integral
normalization, they are similar to their respective dual lattice. That is, there
exists a bijective linear map u : V -t V, a similarity, and a positive integer f.,
the similarity factor, such that
We briefly recall a few standard definitions. A lattice (in the sense of number
theory) is a pair (L, b), where L is a free Z-module of finite rank, say rankL = n,
and b : L x L ~ Q is a positive definite symmetric bilinear form. An isometry
between two lattices (L 1,b1) and (L 2,b2) is a group isomorphism r.p: L1 -t L2
which is compatible with the forms: ~(r.px,r.py) = b1 (x,y) for all x,y ELI'
Sometimes L is considered to be embedded into a rational vector space, and
by defining V = L ®z Q we obtain an equivalent category if we regard lattices
as triples (V, bj L), where (V, b) is a "rational quadratic form", that is, a vector
space with a positive definite symmetric bilinear form, and L a "lattice on V",
that is a finitely generated subgroup which spans the vector space V. In this
form, the definition immediately extends to the case of a totally real number
field F instead of Q, replacing Z by the ring of integers 0 F, and requiring that
the form is totally positive definite: b(x, xy > 0 for all embeddings 0' : F ~ lR
and all x E V, x 1= o. At some places, we also deal with the case that F is a
totally complex number field endowed with an involution a H a whose fixed
field Fo is totally real, and with hermitian forms h : V x V -t F with respect
to the specified involution: h(ax,{3y) = aph(x,y). If necessary, we will speak
of quadratic lattices, respectively hermitian lattices. Notice that in the number
field case, lattices need not be free as modulesj the relevant structure theory of
such modules (finitely generated and torsion free) can be found for instance in
[34]. Lattices over OF are always free if the class number of F is one. In the
rational case, the form b will usually not occur in the notation, we just write
(x,y) := b(x,y). Observe that often the letter L refers to the whole structure
(V, bj L) or at least (L, b), and not just to the module L.
Isometry of lattices will be denoted by L ~ M.
If (L, b) is a lattice, and a E F is totally positive, then OIL denotes the
"scaled" lattice (L, ab)j similarly in the hermitian case with a E Fo. From the
introduction, we recall the definition of the dual lattice L#. A lattice is called
integral if L ~ L# and even if b(x, x) E 20F for all x E L.
Two lattices are in the same genus if they become isometric over all comple-
tions op of OF
The class number of a lattice, or rather of its genus g = gen L is the number of
isometry classes contained in g, often denoted by h(Y).
The local theory of lattices, that is, the theory of lattices over op, and the
theory of genera are well understood. Details will not playa role in this paper. We
only mention the fact that every lattice over op posseses a Jordan-decomposition
Extremal Lattices 143
.n
3=-8
(pi)"j.pnj,p with Cj,p = (detL j )
p
and nj,p = dimLj.
We do not describe a dyadic symbol in full generality here. Among other things,
the parity "even/odd" of the Jordan component belonging to q = 2t is recorded
by a subscribed ql respectively q].
As an example, consider the binary lattice B given by the Gram matrix
( ~ ~ ), of determinant 15. Over all completions Zp, p =f:. 2, it can be diagonalized
as B ® Zp ~ (2,30) ® Zp = (2,2·3· 5) ® Zp. So the 3-adic symbol is 1-13+1,
and the 5-adic symbol is 1-15+1. The 2-adic symbol is 112, where the +-sign
expresses the fact that detL = detLo,2 == ±1 mod 8. We shall usually suppress
the unimodular parts of the local symbols, since the dimensions no,p and the
signs cO,p are determined by the total dimension and determinant and the other
nj,p, Cj,p, j ~ 1. Furthermore, we shall indicate the parity of the unimodular
component at p = 2, which equals the parity of the total lattice, and the total
dimension n by writing the symbol as In(. .. ) respectively ][n( ... ) and omitting
also the component 2~/,;no.2. So our final symbol for the above example B will
be ][2(3+15+1). Similarly, the lattice (t l) can be diagonalized as (1,3·5) ®Zp
for p =f:. 2, and thus has the symbol ][2(3- 15- 1).
In the following, we use the notation mill if m and l are integers, m divides
l, and m and lim relatively prime. In this case we say that m is an exact divisor
oU.
144 R. Scharlau, R. Schulze-Pillot
We now come to the most important definition of this section, the significance
of which has first been pointed out by H.-G. Quebbemann.
We recall a few well known facts from the analytic theory of quadratic forms, or
lattices. If L is an even lattice of even dimension n = 2k and level £, we denote
by
8LCq) = L rLCm)qm, rLCm):= I{x ELI (x,x) = 2m}1
m~O
its theta series, where as usual q = e 2"iz and z is a variable in the upper half
plane. This is a modular form of weight k for the group ro(£) and a certain
quadratic character C : ro (£) -+ {±1}. Using standard notation for the action of
PSL 2(Ji) on modular forms of weight k, this means that
Proposition 1.4. For a prime number £ ::; 23, even weight k and X(Wi)
(_1)k/2, X trivial on ro(£), the dimension of Mk(£,X) is equal to
Extremal Lattices 147
with
0 lor R = 1,2,3,5,7,13
90(R) = { 1 lorR = 11,17,19
2/0rf = 23
The following dimension formulae were derived from [62]; see [48] for the levels
6,14,15.
Proposition 1.5. For even weight k and X trivial on Fo(R) one has:
dimSk(lO, X) = 41 . [3(k_l)+(_1)k/2
2 - 2 -llX(W2) + (_1)k/2 (X(Ws) + X(WlO))
]
{+1
with
1]= ~/k~2,4mod8
-1 ilk=0,6mod8.
o k := 4 mod 6
8k -1 4 2
tr(W1 ) = - 3 - + 3' { -1 k:= 2 mod 6 ,
+1 k:= omod 6
2 k:=4mod6
tr(W3)=(_1)k/2~+(_1)k/2~ { ~1 k:=2mod6,
+1 k:= 0 mod 6
tr(W7 ) = 0,
tr(W21 ) = 2(_1)k/2.
1.3 Extremality
In this subsection, we specify the general setup in which extremality of a lattice
can be defined in terms of its theta series.
Definition 1.6. a) Let M be a subspace of Mk(L). We say that extremality is
definable with respect to M if the projection M --* «Y to the first d = dim M
coefficients of the q-expansion
F= 1+ E amqm
m?:d
148 R. Scharlau, R. Schulze-Pillot
respectively
Example. Consider the genus 9 = .0"8(3- 45- 4) of dimension 8 and level 15,
represented for instance by B .1 B' .1 B' .1 B', with B and B' as above. This
genus does not contain "obvious", i.e. decomposable strongly modular lattices.
(The reason is that .0"4(3252) has only two classes B .1 B and B' .1 B' and thus
no strongly modular lattice). However, complete enumeration of g, with class
number h((i) = 68 shows the following:
Extremal Lattices 149
(a) 9 contains exactly 2 (classes of) strongly modular lattices; they have mini-
mum 4.
(b) 9 contains a unique lattice with minimum 6; this lattice is not strongly
modular.
(c) The strongly modular lattices with minimum 4 are strongly modular ex-
tremal.
Part (c) is verified as follows: for the appropriate character X given by
X(W3 ) = X(W5 ) = -1, the space M4(15,X) is two-dimensional, the space of
cusp forms 8 4 (15, X) is one-dimensional, and a non-zero cusp form En>lanqn
has a1 f O. Thus extremality is definable, and strongly modular lattices with
minimum 4 are extremal.
Notice that parts (a) and (b) of this example show that even without using
the notion of extremality, we can state the fact that the largest minimum in a
genus is not always attained by a strongly modular lattice
one has
dt:= dim Mk(£'X) = 1 + [k1~£)] .
150 R. Scharlau, R. Schulze-Pillot
ii) In the cases above the extremal modular form has integral Fourier coefficients
ak(n) (that are even for n > 0). Moreover, one has ak(dt) > 0 for all k and
ak (dl + 1) < 0 for k large enough (depending on £, X).
Proof. i) has been proven in [47,48], where it is also shown that Mk(£,X) has a
basis consisting of the functions e1vLl~, where N = N(£, X) denotes a strongly
modular lattice of minimal dimension ko = ko(£) with respect to £ and X,
2.
Llt(z) = II l1(mZ)a.<tY.
mil
This implies immediately that the extremal modular form has (even) integral
Fourier coefficients and allows to deduce the assertion about a(dl) and a(dl + 1)
in the same way as in [26]; some of the details have been carried out in [64]. We
sketch the main steps briefly: With
= L a(s)cp(q)8
00
eN(k/ko(l» (q)
8=0
in a sufficiently small disk Iql ~ r < 1, where eN(q) and cp(q) denote the
expansions with respect to q = exp (21riz) of these functions. For the coefficients
a(s) one has
and
a(dl + 1)
a(dl)
is asymptotic to
e~' (l)/ko(l» (e-21TYO)
LlI(e-21TYo)
where Yo is the (unique) positive zero of the derivative of (y f---t Ll I (e- 21TY »and
eN and Ll, are again viewed as functions of q. Moreover one has
a(dl) = -a(dl )
a(dl + 1) = -a(d, + 1) + a(dt)·
2k1(£)
( 0"0(£)
([_k
kl (£)
] +1) +(k1(£)
ko(£)
_ k-kl(£) [rdmJ))
ko(£) ,
where Cl = Cl (£) is the first Fourier coefficient of eN. The boundedness of the
quotient
Table 2.1
We calculated the extremal modular form in the weights below this bound
and found that the coefficient a(de + 1) becomes negative a little earlier than
this bound. The following tables give the values obtained for the jump weights
(Le., k with k1 (£)lk) and general weights.
Table 2.2
Table 2.3
Using the formulas above one can calculate from the extremal theta series
the coefficients a(de), a(d£+l) and finds that the quotient a:~(~~l) seems to tend
to its asymptotic value quite slowly, but from below, so that beyond the values
of k given above one should not expect any further extremal modular forms with
positive coefficient a( de + 1). In particular, there should be no extremal lattices of
the types considered and dimension bigger than the values of 2kmax (£, X) given
above (we will see that our list of known extremal lattices terminates much
earlier in all cases except £ = 23).
b) As in [26] one can also prove the existence of a bound kmax (£, x, v) for v E
N such that any modular form in Mk(£,X) with constant term 1 and vanishing
Fourier coefficients a(l), ... , a(de - v) of weight k > kmax (£, X, v) has at least
one negative Fourier coefficient (and hence cannot be a theta series).
c) If k = 2 and the character X on r;(£) is trivial the condition under which
extremality is definable just means that the modular curve H / r; (£) has no
WeierstraB point at the cusp 00. A more general notion of WeierstraB points
has been introduced by Petersson in [37] and investigated by Smart in [63]. A
detailed investigation of the WeierstraB points for the situations of interest here
and of the consequences for theta series of lattices will be the subject of future
work.
Remark 2. As in [48] our discussion of the extremal modular form in this
section is limited to those cases in which a direct generalization of the methods
from the case of level 1 is possible. In particular we do not discuss all strongly
152 R. Scharlau, R. Schulze-Pillot
modular genera oflevels 6, 14, 15 but only those in dimension 4r obtained by tak-
ing r-fold orthogonal sums of the 4-dimensional genera G 4(2-3+) == ff4(2i23-2)
(case6b), G 4(2+3-) == ff4(2j.3 2) (case 6a), G4(2+7-) == ff 4(2j.7 2),G4(3+5-) ==
ff 4 (3- 2 5- 2 ) with the G40-notation as in [48].
Remark 3. E. Rains informed us that he can prove the nonexistence of
strongly modular lattices in the genera r . G 4(2-7+) == ff4r(2i2r7-2r) (r odd)
and G 4(2+7-) .l r . G 4(2-7+) == ff 4r +4(2;-(2r+2)7-(2r+2)) (r odd) of dimension
4r resp. 4r + 4 and in certain other genera of levels 2N,4N with N == -1 mod 8.
We have received a preprint version ([49]) during the final revision of this article.
We conclude this subsection with a table of minima of extremal lattices for
the levels treated in Theorem 2.0.1, part i) up to minimum 12. An entry in
brackets means that the extremal modular form has a negative coefficient.
f1 2 3 5 6 7 111415 23
n
4- 2 2 2 2 2 4 4 4 6
6- - 2 4 4 (8)
82 2 2 4 4 4 6 6 6 (10)
10 - - 2 4 6 - - (12)
12 - 2 4 4 4 6 8 8 8
14 - - 4 6 8
162 4 4 6 6 6 (10) 10 10
18 - - 4 8 10 - -
20- 4 4 6 6 8 (12) 12 12
22 - - 4 8
244 4 6 8 810
26 - - 6 --10
28- 4 6 8 810
30 - - 6 - -12
324 6 610 10 12
34 - - 6 - -12
36- 6 810 10
404 6 81212
44- 6 81212
486 810
52- 810
566 810
60 - 812
64 61012
68 -10 12
72 810
76 -10
80 812
Table 2.4: Minima of (hypothetical) strongly modular extremal lattices
Extremal Lattices 153
The known (and rather trivial) classification of all these lattices shows that they
are all modular, and thus extremal. (The classification for £ = 2 and 3 can be
readily obtained from the classification in the higher-dimensional cases (16,2),
(12,3) in [53] by restricting to decomposable lattices.) The class numbers are
shown in the following small table.
n\£1237
4 - 1 11
6 --1
8 112
10 - - 3
12 - 3
16 2
Table 3.1: Some class numbers
£ 1 2 3 5 6 7 1114 15 23
n(£, 4) 24 16 12 886 4 4 4 2
h 24 24 10 5 8 3 3 3 3 2
ham 24 16 10 5 6 3 3 3 3 2
Table 3.2: Minimal dimensions for extremal lattices with minimum 4,
and class numbers
Other dimensions. The following table contains, for each level £, the dimen-
sions above the minimal dimension for which extremal lattices have minimum
4. The third row contains the class number of the respective genus, if this is
known. In the missing cases (for £ = 1,2,3), the class number is very large, and
a full classification at present is out of reach. The last row contains the number
of known isomorphism classes of extremal lattices. We see that extremal lattices
always exist. They are unique in the two cases (n, £) = (11,6), (7,8) of small
class number, and, more surprisingly, also for (3,14). In all other cases they are
most probably not unique.
Extremal Lattices 155
£ 1 2 3 5 6+ 6- 7 11
n 32 40 20 24 28 14 16 18 20 22 12 12 12 8 10 6
h 29163 48308284830 5
h ext ~ 22 ~ 3 ~ 3 ~ 1 ~ 1 1 6 ~ 1 ~ 1 ~ 1 4 6 4 1 4 1
Table 3.3: Extremal lattices of minimum 4 in non-minimal dimensions
self-dual Z[w, cs]-lattice J 3(5) used below in the case of minimum 8 and '- = 15.
A full classification of the genus without the use of a computer has been given
in [54].
1'- = 51 There exists a unique even unimodular lattice in dimension 4 over Z [¥].
Its orthogonal group is the reflection group of type H4; a Gram matrix is directly
read off from the Coxeter diagram
as follows: the diagonal entries are equal to 2, the ij-entry, for i # j, is -2 cos ;:.. ,
'J
20'1)
h~ ( a21 .
1 12
It also occurs in the work of Mimura [28], where hermitian lattices over imaginary
quadratic fields, generated by norm 2 vectors, are classified. (The significance of
the field Q( V-7) in this "direct" approach by the way is that a is the unique
imaginary quadratic number with trace 1 and norm 2.) The desired extremal
7-modular lattice is obtained from J 3 by ordinary transfer. It is isometric to the
Barnes lattice P6 and to the Craig lattice A~3).
Extremal Lattices 157
Ip. = 111 Due to the fact that N7r = 3, where 7r = 1+O, there is an obvious
binary unimodular Z[7r]-lattice with minimum 2, given by the Gram matrix
(! ;). From this a 4-dimensional extremal 11-modular lattice is obtained by
ordinary transfer.
Ip. = 141 An extremal 4-dimensional strongly modular lattice of level 14 is ob-
tained by ordinary transfer from the binary Z[a]-lattice (of determinant 2) given
by (! ~), where a = 1+0 as usual. The modularity with respect to the prime
2 is not directly given by a, but comes from the fact that (!~) is isometric
to its complex conjugate (cf. Proposition 1.3).
IP. = 151 The trace form tr~ (xy) on OF, where F = Q( -/5, A), clearly gives a
strongly modular lattice of level 15 and minimum 4.
Ip. = 11 One might suspect that many among, perhaps most of the 32-dimensional
unimodular lattices have minimum 4. However, despite the efforts of several
authors (notably Koch and Venkov, Koch and Nebe, and Blaschke-Requate),
only relatively few such lattice have been obtained by a structural algebraic
construction, and it is difficult to show that the lattices constructed are indeed
non-isomorphic. A few lattices have been constructed and distinguished using
particular automorphisms [46], [6]. A attempt to the systematic construction of
all extremal lattices regardless of their automorphisms is the notion of neighbor
defect J(L) of [22,23]. This number is defined as 32 minus the largest rank r of
the root system of a neighboring lattice M (the root system R(M) is necessarily
of the form rAt}. So the lattices of neighbor defect 0 are by definition the 2-
neighbors ofthe ordinary 'lifts' L(G), where G is a self-dual binary code oflength
n = 32 and minimal weight 8. The smallest values for 15 are 0,8, 12, 14, 15, and in
these cases the "defect lattice" is unique. The lattices with neighbor defect 0 and
8 are classified [22,23], [21], there are 5 + 10 of them. 7 pairwise non-isomorphic
lattices of neighbor defect 12 are known, by [7].
It is an easy consequence of the Minkowski-Siegel mass formula that there
are at least 8.45.10 51 classes of extremal even unimodular lattices in dimension
40; see Peters [38]. The first explicit example of such a lattice has been given
by McKay; see [12], Chapter 8.5. Later extremal unimodular lattices emerged
in a standard way from the investigation of self-dual binary codes G of length
40 and minimal weight 8; see above. Ozeki investigated in [35] these lattices for
three particular codes and showed that they have distinct Siegel theta series of
degree two. Using the above class number estimate, Peters showed in [39] that
the Siegel theta series of degree 2 are not a classifying invariant for the set of all
extremal even unimodular lattices in dimension 40. See also [8] and [12], Chapter
7.7, page 194, for further references.
158 R. Scharlau, R. Schulze-Pillot
If = 21 In dimension 20, two extremal lattices are known from [40), p. 49; see
also [6). A third one, with automorphism group of order 214 325 has been found
in the course of a projected complete enumeration of this genus, using a suitable
refinement of the computer program of [52). C. Bachoc and B. Venkov have
announced a proof of the fact that there are no further extremal lattices in this
genus.
In [1, 2], Theorem 6.7, it is shown, by a uniform construction using codes,
that, for all three dimensions in question, 20, 24 and 28, an extremal lattice
can be obtained from a suitable self-dual lattice over a maximal order of the
quaternion algebra ~,oo .
If = 31 For dimension 14, see [52). For dimensions 16 to 22, such lattices come
from self-dual hermitian lattices over the Eisenstein integers, by [15); see also
[2), Theorem 6.7.
If = 51 Complete enumeration of the 12-dimensional genus in [52) has shown
that there are 4 extremal lattices. A. Schiemann in [56) has shown that there are
no less than 29 classes of self-dual hermitian lattices over Z (A) giving rise to
such Z-lattices.
If = 61 For both genera in question, of dimension 12, we have obtained by com-
puter the full classification of all lattices, and the determination of the strongly-
modular ones among those with minimum 4. The 'plus-genus' ][12(2 6 36 ) is the
one coming from even self-dual hermitian lattices over Z (A). This genus has
been classified completely in [56). There are 161 classes, 41 of them have mini-
mum 4. It turns out that all 6 extremal lattices in ][12(2 6 36 ) have one or several
self-dual hermitian structures. Conversely, each hermitian lattice with minimum
4 and the correct number 84 of minimal vectors (there are 14 such lattices)
actually gives rise to a strongly modular Z-lattice.
If = 71 The classification of all lattices in dimensions 8 and 10 is again taken
from [52). The existence of a self-dual hermitian structure on such lattices is
mentioned in [2) and completely worked out in [56].
If = 111 See [52). No appropriate hermitian lattice exists, by [56).
e 1 2 3 5 6 711141523
n(e,6) 48 32 24 16 16 12 8 8 8 4
h 395318091 6
h ext ~ 3 ~ 3 ~ 1~ 1~ 5 0 1 1 2 1
Table 3.4: Extremal lattices with minimum 6 in minimal dimensions
Other dimensions. The following table collects the known information about
the existence of extremal lattices of minimum 6 above the minimal dimension.
The class numbers are very large and not known except for (n, e) = (10,11).
e 1 2 3 5 6 7 11
n 56 64 36 40 44 26 28 30 32 3420 2014 1610
hext ~ 1~ 1~ 3 ~ 1~ 1~ 1~ 1~ 1~ 1~ 1 ~ 1 ~ 1 2
it contains 4 lattices with minimum 6. Two of these are strongly modular, with
orthogonal groups of order 2432, respectively 25 32 ; the other two are modular,
but not strongly modular. (The latter lattices have larger groups, of orders 2433
and 213 32 ).
Description of the lattices in other dimensions.
I£ = 21 Already for n = 36, one may suspect that a large portion of this huge
genus of lattices consists of extremal ones. However, no attempt to prove a result
like this is known. A systematic construction of three such lattices B~:;:~, m =
3,4,5 is contained in [6] ("Quebbemann-Craig-Iattices"). They are self-dual over
Z[Al and admit an automorphism of order 19. Since the number 82080 of
minimal vectors in these lattices is sufficiently small and since one automorphism
of large order 19 is already known, the computer program of [41] allows to
calculate the orthogonal groups. The three groups are of different orders 24 .
32 . 5 . 19, 22 .32 • 19, 26 .3 3 ·5· 19 in particular, the lattices are pairwise non-
isometric. Another construction [2] which is partly systematic (a quaternionic
code construction) and partly ad-hoc (a 2-step neighboring) also gives a lattice
with the desired properties; it is very unlikely, but not completely impossible
that this lattice should be isometric to one of the B~:;'~.
In dimensions 40 and 44, an example again based on quaternions and codes
is given in [2]; nothing else is known.
1£ = 31 In dimensions 26, 28 and 32, such lattices have been obtained first in [2],
using codes. This method failed in dimensions 30 and 34. In these dimensions,
A. Schiemann has constructed extremal 3-modular lattices as a self-dual her-
mitian lattices, using a sophisticated computer search. (The same method was
succesfully applied also for the dimensions 26, 28 and 32.)
1£= 61 Using the same tools as in the case £ = 3, A. Schiemann has found a
lO-dimensional self-dual hermitian lattice over Zh,.C6). Its transfer to Z turned
out to be strongly modular.
1£ = 71 For n = 14, the existence is open. In [56], A. Schiemann shows that such
a lattice cannot have a self-dual hermitian structure over Z [Hp]. For n = 16,
it is readily checked that the well-known unique 8-dimensional self-dual Z[v'2]-
lattice with mm1mum 4 (see [16]) gives rise to a 7-elementary
lattice of minimum 6 by twisted transfer tr (1!Vif .-).(Notice that
162 R. Scharlau, R. Schulze-Pillot
tr (l!~(u + vV2)) = 2u+v and use some easy estimates.) A numerical verifi-
cation using Gram matrices proves strong modularity. Schiemann's work reveals
the surprising fact that also in this dimension, no appropriate Z [1+0]
-lattice
exists.
1£ = 111 These 10-dimensionallattices have been obtained by computer classifi-
cation of the whole genus in [52].
£ 1 2 3
n 72 80 88 48 52 56 60 36 38 40 42 44 46
? yes yes ? yes
£ 5 6 7 11 14 15 23
n 2428 2428 18 2022 12 14 12 12 6
yes yes ? yes no yes yes no
form has non-negative coefficients "as far as one can see" . It is however possible
to derive from the assumed knowledge of this theta series sufficiently many
restrictions on the second degree theta series of the lattice in question to succeed
in showing that the corresponding space of Siegel modular forms does not contain
such a series with non-negative coefficients.
Ie = 11 In dimension 80, it is not possible any more to verify by computer the
minimum of an arbitrary lattice (given by a gram matrix, say). Therefore, we
have to distinguish between the construction of an appropriate lattice and the
verification of its minimum. A plausible candidate of an extremal unimodular
lattice in dimension 80 has been given in [57], essentially going back to [42], using
an automorphism of order 79. Two candidates with automorphisms of order 41,
and a unimodular hermitian structure over Z(V-41) have been constructed in
(6). The first construction, using codes, where the minimum could actually be
verified was given by C. Bachoc; see [3]. The method is explained in (2). In this
case, it gives rise to two non-isometric lattices. A different construction relates
one of these to the well-known 20-dimensional extremal 7-modular lattice; see
below.
Ie = 21 A 48-dimensional 2-modular lattice with minimum 8 has been con-
structed by Bachoc in [2], Theorem 6.7. Quebbemann has observed that such
a lattice can be easily obtained by the so-called 'I}-construction from the Leech
lattice with its structure over the Hurwitz order. See the end of [2] for details.
Ie = 31 In [3] an extremal40-dimensional 3-modular lattice is constructed as the
tensor product over Z[a], a = (1 + ..;=7)/2 of the lO-dimensional Z[a]-lattice
mentioned below in the case e = 7, with a maximal order in the quaternion
algebra IQb,oo ramified at the places 3 and 00. A code construction for the same
lattice is also given.
Ie = 51 There is a well known, in fact unique, 12-dimensional even unimodular
lattice with minimum 4 over Z [¥]; see (13). By ordinary transfer, this gives
a 24-dimensional 5-modular lattice with minimum 8.
Ie= 61 Two lattices with minimum 8 in the appropriate 24-dimensional genus
are exhibited in (29) (numbers 16 and 17). In [30] it is shown that they are
strongly modular. Schiemann found a strongly modular lattice with minimum 8
in dimension 28 using hermitian lattices.
Ie = 71 A 10-dimensional self-dual hermitian Z[a)-lattice, a = (1 + ..;=7)/2,
giving rise to a 20-dimensional 7-modular Z-lattice with minimum 8 and auto-
morphism group 2.M22.2 has already been described in the Atlas [11].
Ie =141 A 12-dimensionallattice of determinant 26 76 and minimum 8, invariant
under the group L2(7) x Ds is exhibited in Sect. VII, p. 36 of [40]. Strong
modularity can be verified using the Gram matrix given on p. 64 of [40).
Ie = 151 From the theory of complex reflection groups (9), p. 408, the follow-
ing unimodular hermitian lattice over Z[w, c5], w = -1+/'=3, c5 = ¥with
164 R. Scharlau, R. Schulze-Pillot
minimum 2 is known:
J 3 (5) ~ (
2
-WC5
-WC5
2 -1
-1)
-1 -1 2
1£ = 61 After Sect. 3, only the genera ][2k (28k3-k) with k == 0(4) remain to be
considered. So the character values are X = (--), and the dimension formula
1.4 gives
dimSk(6,--) = ~-l.
4
Dimension 4. Both genera have class number 1, there are no non-zero cusp
forms. The respective unique lattice is necessarily strongly-modular and ex-
tremal.
Dimension 8. The genus ][s(2j5 4) has class number 24, the genus ][S(2;45- 4)
has class number 19, and 8, respectively 3 of the lattices are strongly modular.
In both cases, the space of cusp forms is one-dimensional, so strongly modular
lattices of minimum 4 are extremal. There are 3, respectively 1 such lattices.
Their theta series are [1,0,24,48,216,288,672, ... ], respectively
[1,0,10,120,110,320,520, ... J.
Dimension 12. Both genera are too large to be classified completely.
In the case of ][12(2~5-6), the space of cusp forms is two dimensional, ex-
tremality is definable, and therefore strongly modular lattices of minimum 6
would be extremal. No such lattice is known. Extensive computer search has
produced only two lattices with minimum 6. They are 10-modular and trans-
formed into each other by D2 (or D5). For ][12(28656), the space of cusp forms
is one-dimensional, extremality is definable, and extremal lattices (of minimum
4) exist. In fact, there is an abundance of minimum 4 lattices in this genus, with
various different theta series, therefore it was helpful for the search to calculate
first the extremal modular form
F 6 ,10,-+ = [1,0, 12, 136,444,1872, ... ].
Dimension 16. For the genus ][16 (2~5S), the space of cusp forms is 3-dimensional.
This means that the minimum 8 lattice in this genus given in [32] (see [30] for
strong modularity) is actually extremal. Two further such lattices (with smaller
groups of orders 210 • 3 and 29 • 32 ) have been found by computer search.
In the case of ][16(2;85- S), the space of cusp forms is two dimensional. The
extremal modular form is
F S,1O,-- = [1,0,0,100,770,3968, ... J ,
no lattice with this theta series has been found.
1f = 131 Even 13-elementary lattices of dimension n = 2k and determinant 13 k
exist only in dimensions divisible by 4, and the relevant modular forms belong
to the trivial character. The dimension formula Proposition 1.4 applies.
Dimension 4. We consider the genus ][4(13- 2 ), with character value 0 = -.
The dimension formula Proposition 1.4 gives dim M 2 (13, -) = 0, i.e. there are
no non-zero cusp forms. Thus extremality is definable, and extremal lattices have
minimum 2. The class number of the genus is 1, the unique lattice is of course
modular and has minimum 2.
Dimension 8. We have to consider modular forms of weight 4 and trivial
character X = Xo· The dimension of this space is known from Proposition 1.4:
dim M4 (13, +) = 3. The genus theta series (see [24]) reads
A basis of the space of cusp forms is given by the following two newforms:
C1 = [0,1,-10,12,38,-2,84, ... ]
C2 = [0,0, -2, 1, 10, -2, -20, ... ]
C3 = [0,0,0,-1,-3,0,6, ... ].
In the first case many lattices with minimum 8 are known, but no one with the
desired theta series. In the second case, the largest known minimum in the genus
is 8, and not 10.
Dimension 16. For the plus-genus, one has dx =
5, and there exists indeed a
strongly modular lattice of minimum 12 in this genus, by [32]. No other lattice of
minimum 12 has been found in this genus. For the minus-genus, the dimension
=
is d x 4, but no lattice of minimum 10 could be found.
References
1. C. Bachoc: Voisinage au sens de Kneser pour les reseaux quaternioniens. Comm.
Math. Helvet. 10 (1995), 350-374.
2. C. Bachoc: Applications of coding theory to the construction of modular lattices.
J. Comb. Th. Ser. A 18 (1997), 92-119.
3. C. Bachoc, G. Nebe: Extremal lattices of minimum 8 related to the Mathieu group
M22. J. reine angew. Math. 494 (1998)
4. C. Bachoc, G. Nebe: Classification of two genera of 32-dimensional lattices over
the Hurwitz order. Experimental Math. 6, No.2 (1997), 151-162.
5. E. S. Barnes, G. E. Wall: Some extreme forms defined in terms of Abelian groups.
J. Australian Math. Soc. 1 (1959), 47-63.
6. C. Batut, H.-G. Quebbemann, R. Scharlau: Computations of cyclotomic lattices.
Experimental Math. 4, No.3 (1995), 175-179 .
7. B. Blaschke-Requate: 32-dimensionale gerade unimodulare Gitter mit und ohne
Wurzeln, Dissertation, Bielefeld 1996
8. A. R. Calderbank, N.J.A. Sloane: Double Circulant Codes over Z4 and Even Uni-
modular Lattices. J. Algebraic Combinatorics 6, No.2 (1997), 119-131.
9. A. M. Cohen: Finite complex reflection groups. Ann. Sci. Ecole Norm. Sup. 9
(1976), 379-436.
10. J. H. Conway: A characterisation of Leech's lattice. Invent. Math. 1 (1969), 137-
143, also reprinted as Chap. 12 in CoS193.
11. J. H. Conway et al.: Atlas of finite groups. Oxford University Press, 1985.
12. J. H. Conway, N.J.A. Sloane: Sphere packings, lattices and groups. Springer-Verlag
New York, 1988.
13. P. J. Costello, J. S. Hsia: Even unimodular 12-dimensional quadratic forms over
Q( v'5). Adv. in Math. 64 (1987), 241-278.
14. H. S. M. Coxeter, J. A. Todd: An extreme duodenary form. Canad. J. Math. 5
(1953), 384-392.
15. W. Feit: Some lattices over Q( v' -3). J. Alg. 52 (1978), 248-263.
16. J. S. Hsia, D. C. Hung: Even unimodular 8-dimensional quadratic forms over
Q( v'2). Math. Ann. 283 (1989), 367-374.
17. D. C. Hung: Even positive definite unimodular quadratic forms over Q( v'3). Math.
Compo 51 (1991), 351-368.
18. Y. Kitaoka: Arithmetic of quadratic forms, Cambridge University Press, 1993
19. M. Kneser: Klassenzahlen definiter quadratischer Formen. Arch. Math. 8 (1957)
241 - 250.
20. M. Kneser: Quadratische Formen. Vorlesungs-Ausarbeitung, Gottingen 1973/74.
21. H. Koch, G. Nebe: Extremal even unimodular lattices of rank 32 and related codes,
Math. Nachr. 161 (1993), 309-319.
Extremal Lattices 169
22. H. Koch, B.B. Venkov: Uber ganzzahlige unimodulare euklidische Gitter. J. reine
angew. Math. 398 (1989).1 144-168.
23. H. Koch, B.B. Venkov: Uber gerade unimodulare Gitter der Dimension 32, III.
Math. Nachr. 152 (1991), 191-213.
24. A. Krieg: Modular forms on the Fricke group. Abh. Math. Sem. Univ. Hamburg
65 (1995), 293-299.
25. J. Leech: Some sphere packings in higher space. Can. J. Math. 16 (1964),657-682.
26. C. L. Mallows, A. M. Odlyzko, N.J.A. Sloane: Upper bounds for modular forms,
lattices and codes. J. Alg. 36 (1975), 68-76.
27. J. Milnor, D. Husemiiller: Symmetric bilinear forms, Springer Verlag, Berlin 1973.
28. Y. Mimura: On 2-lattices in an hermitian space. Math. Japonica 21 (1982), 213-
224.
29. G. Nebe: Endliche rationale Matrixgruppen vom Grad 24. Dissertation, Aachen
1995.
30. G. Nebe: The normalizer action and strongly modular lattices. L'Enseignement
Mathematique, 43 (1997), 67-76.
31. G. Nebe: Some cyclo-quaternionic lattices. J. Algebra (to appear).
32. G. Nebe, W. Plesken: Finite rational matrix groups of degree 16. Memoirs of the
A.M.S., Vol. 116 (556), (1995), 74-144.
33. G. Nebe, B.B. Venkov: Non-existence of extremal lattices in certain genera of
modular lattices. J. of Number Theory, 60, No.2 (1996), 310-317.
34. O.T. O'Meara: Introduction to quadratic forms, Springer-Verlag Berlin, 1971.
35. M. Ozeki: Examples of even unimodular extremal lattices of rank 40 and their
Siegel theta-series of degree 2 J. of Number Theory 28 (1988), 119-131.
36. M. Ozeki: Ternary code construction of even unimodular lattices. In: Theorie des
nombres - Number Theory, J.-M. De Koninck et. al. (eds.), Walter de Gruyter,
Berlin, New York 1989, 772-784.
37. H. Petersson: Uber Weierstrafipunkte und die expliziten Darstellungen der auto-
morphen Formen von reeller Dimension. Math. Z. 52 (1949), 32-59.
38. M. Peters: Definite unimodular 48-dimensional quadratic forms. Bull. London
Math. Soc. 15 (1983), 18-20.
39. M. Peters: Siegel theta series of degree 2 of extremal lattices. J. Number Theory
35 (1990), 58-61.
40. W. Plesken, G. Nebe: Finite rational matrix groups. Memoirs of the A.M.S., Vol.
116 (556), (1995), 1-73.
41. W. Plesken, B. Souvignier: Computing isometries of lattices. J. Symbolic Compo
24, no. 3/4 (1997), 327-334.
42. H.-G. Quebbemann: Zur Klassifikation unimodularer Gitter mit Isometrie von
Primzahlordnung. J. reine angew. Math. 326 (1981), 158-170.
43. H.-G. Quebbemann: An application of Siegel's formula over quaternion orders.
Mathematika, 31 (1984), 12-16.
44. H.-G. Quebbemann: A construction of integral lattices. Mathematika 31 (1984),
137-140.
45. H.-G. Quebbemann: Lattices with theta functions for G(v'2) and linear codes. J.
of Algebra 105, No.2 (1987), 443-450.
46. H.-G. Quebbemann: Unimodular lattices with isometries of large prime order II.
Math. Nachr. 156 (1992), 219-224.
47. H.-G. Quebbemann: Modular Lattices in Euclidean Spaces. J. of Number Theory
54 (1995), 190-202.
48. H.-G. Quebbemann: Atkin-Lehner eigenforrns and strongly modular lattices.
L'Enseignement Mathematique 43 (1997), 55-65.
170 R. Scharlau, R. Schulze-Pillot
49. E.M. Rains, N.J.A. Sloane: The shadow theory of modular and unimodular lattices.
Preprint 1998.
50. R. Scharlau: Unimodular lattices over real quadratic fields. Math. Zeitschrift 216
(1994), 437-452.
51. R. Scharlau, B. Blaschke: Reflective integral lattices. J. of Algebra 181 (1996),
934-961
52. R. Scharlau, B. Hemkemeier: Classification of integral lattices with large class
number. To appear in Math. of Comput.
53. R. Scharlau, B. Venkov: The genus of the Barnes-Wall lattice. Commentarii Math.
Helv. 69 (1994), 322-333.
54. R. Scharlau, B. Venkov: The genus of the Coxeter-Todd lattice. Preprint January
1995.
55. M. Schonert e.a.: Groups, Algorithms, and Programming. Lehrstuhl D fiir Mathe-
matik, RWTH Aachen 1997. Version 3, Release 4.4.
56. A. Schiemann: Classification of hermitian forms with the neighbour method.
Preprint, Saarbriicken 1997.
57. R. Schulze-Pillot: Quadratic residue codes and cyclotomic lattices. Archiv Math.
60 (1993), 40-45.
58. G. C. Shephard, J. A. Todd: Finite unitary reflection groups. Canad. J. Math. 6
(1954), 274-304.
59. C. L. Siegel: Berechnung von Zetafunktionen an ganzzahligen Stellen. Nachr. Akad.
Wiss. Gottingen 10 (1969), 87-102.
60. J.-P. Serre: Cours D'Arithmetique, Presses Universitaires de France, Paris 1970.
61. N.-P. Skoruppa: Appendix I: Modular Forms. in: F. Hirzebruch et al.: Manifolds
and Modular Forms, Vieweg Verlag, Braunschweig/Wiesbaden 1992.
62. N.-P. Skoruppa, Don Zagier: Jacobi forms and a certain space of modular forms.
Invent. math. 94 (1988), 113-146.
63. J. Smart: On Weierstrass points in the theory of elliptic modular forms. Math. Z.
94 (1966), 207-218.
64. F. Szewczyk: Modulare Gitter von Primzahlstufe und extremale Modulformen,
Diplomarbeit, Koln 1995.
65. B.B. Venkov: On the classification of integral even unimodular 24-dimensional
quadratic forms. Proc. Steklov Inst. Math. 4 (1980), 63-74, also reprinted as chap-
ter 18 in [12].
On the Real Nullstellensatz
1 Introduction
This paper contains a survey on topics around the Real Nullstellensatz, the real
algebraic geometry counterpart of the classical Nullstellensatz of Hilbert. Section
2 presents the Real Nullstellensatz in the broader setting of more general Real
Stellensiitze. In the sequel we concentrate on the Real Nullstellensatz where
the issues addressed here are much better understood. In section 3 we deal
with complexity issues; section 4 is meant as a survey of algorithms for the real
radical of an ideal with special emphasis on the class of binomial ideals. In the
last section we give an introduction to the problem of fields of definition for the
real radical.
2 Real Stellensatze
In 1927, Artin [1] solved Hilbert's 17th problem in the affirmative, i.e. he
proved that a positive semidefinite polynomial I E JR[X1 , ... , Xn] is always a
sum of squares in the field JR(X1 , •.. ,Xn ) of rational functions. Generalizations
of this result are Stengle's Positivstellensiitze (cr. [24]). In order to state the
following result properly we have to introduce some notation. If h,· .. , Ir E
R[X1 , ••• , Xn] where R is again an arbitrary real closed field we let
Theorem 4 (Wormann, 1996). Let i, h, . .. ,ir E JR[X1 , .•• , Xn) and assume
that the set S(h, ... , ir) is compact. If i is positive on S(h, ... , ir) then i E
T 2 k(h, ... , fr) for every odd natural number k. Here T 2 k(h, ... , ir) is the set
of all finite sums of elements of the iorm q . rr;=l iim, where ml,"" mr are
natural numbers and q is a sum of 2k-th powers of polynomials.
On the Real Nullstellensatz 175
3 Complexity Issues
Complexity issues concerning polynomial ideals were first considered by Her-
mann [11] in 1926. If f, ft, ... , fr E K[X 1 , .•• , Xn] is a system of polynomials
over an arbitrary field K such that f vanishes in all common zeroes of the
polynomials ft, . .. , fr then, by Hilbert's Nullstellensatz, there are t E IN and
ql,"" qr E K[X 1, ... ,Xn] such that
r
ft = Lqik
i=1
and
176 E. Becker, J. Schmid
Brownawell's bounds are quite sharp as the following example shows. Let
It is easy to see that (0, ... ,0) is the only zero of the ideal I. Hence X} vanishes
on V®(I) and thus
n-}
xf = L qi . (xt - Xi+}) + qn . X~
i=}
for certain t E IN,q}, ... ,qn E C[X}, ... ,Xn). Now the substitution Xi t-+
X di - 1 (1 ~ i ~ n) yields
It = Lqdi
i=}
and
(i) t~dn,
for certain t E IN, h}, ... , ht, qi, ... , qr E R[X} , ... , Xn) such that t and the
degrees of the polynomials hi, qi can be bounded by
On the Real Nullstellensatz 177
With the notion of the real radical the Real Nullstellensatz can be restated as
for every ideal I C R[X1, ... , Xn], IR denoting the vanishing ideal in the ring
R[X1, ... ,Xn].
In the recent years, several authors presented algorithms to compute the
usual radical of a polynomial ideal and certain algorithms are implemented. But
so far there is no working implementation of any algorithm for the computation
of the real radical of a general polynomial ideal.
The first algorithm for the computation of the real radical of a polynomial
ideal was presented by Becker and Neuhaus [3] in 1993. Their algorithm is of
double exponential complexity (cf. [17]). The basic idea of this algorithm is to
use the method of localizing, introduced by Gianni, Trager and Zacharias in [10].
This method reduces the general situation of an ideal I C R[X1"'" Xn] to the
univariate case over a field of rational functions R(X1, .. . , Xd), d being the di-
mension of I. This case can be handled by factorization and a positivity test. As
was shown in [17] factorization is really needed in an algorithm for the compu-
tation of the real radical. From the point of view of a practical implementation,
178 E. Becker, J. Schmid
rV'J = n rifF;.
m
i=1
of an ideal I C K[X 1 , ••• ,Xn ) can be computed provided that K allows polyno-
mial factorization and the existential theory Th3(K) is decidable (cf. [21)).
Further algorithms for the computation of the real radical of a polynomial
ideal were presented by Galligo/Vorobjov (1995)[9) and Roy /Vorobjov (1996)
[20). Both algorithms were based on the idea to compute an ideal J such that
V(J) equals the Zariski closure of VR(I) in V R(I) for the given ideal I. Then,
by Hilbert's Nullstellensatz and the real Nullstellensatz
and thus the computation of the real radical of I is reduced to the computation of
a usual radical. As the authors pointed out this reduction is of single exponential
complexity and thus the complexity for the computation of the real radical of
an ideal coincides with the complexity for the computation of the usual radical.
We now turn to a special class of ideals of great interset where much more can
be done. Polynomial ideals allowing a set of generators consisting of binomials
are called binomial ideals. They form a distinguished class of ideals, both from
a theoretical and algorithmical point of view. In the introduction of their paper
[7) Eisenbud and Sturmfels present a very interesting survey of the ubiquity
of binomial ideals. Their results suggested to study radicals of binomial ideals
in a quite general context. This was done by Becker, Grobe and Niermann in
On the Real Nullstellensatz 179
[2]. The authors designed algorithms for the computation of various radicals
of binomial ideals which benefit from the special structure of these ideals. The
essential idea the authors follow is the suggestion from [7] to decompose the
affine variety of the binomial ideal 1 C K[X 1 , ... , Xn] into cells and to study
the cells via Laurent polynomial rings K[Xll, ... ,X;1,X1, ... ,Xr],O::; r::; n.
Given a subset ..1 C {I, ... , n} the ..1-cell of V(I) is
u
Clearly
V(/) =
.:lC{l, ... ,n}
Let now 1 be a binomial ideal generated by a set of binomials {b 1 , ... , b.} and
fix some set ..1 C {I, ... , n}. Let 11"4 : K[Xb"" Xn] -t K[Xili E ..1) be the
K -algebra homomorphism defined by
Xi '""""* Xi if i E ..1
X i '""""* 0 if i 1. ..1'
11
Set 1.1 = 11".1(/) and = I.:lK[Xi,Xi-lli E ..1] the extended ideal in the Laurent
polynomial ring. Further write
A.:l := K[Xi,Xi-lli E ..1]//1.
Then, in a canonical way,
v.:l(/) ~ HomK(A.:l,K)
and hence V.:l(I) depends on the structure of the ideal 11
C K[Xi,X;-lli ELl].
This leads to the study of binomial ideals in Laurent polynomial rings. Such
a binomial ideal 1 C K[Xi,Xi- l ll ::; i ::; n] is fully described by a sublattice
L C LZ n and a character
p:L -+Kx.
This correspondence reduces ideal theoretical operations to matrix operations.
With these methods one can prove:
Theorem 7 ([2], [7]). The real radical of a binomial ideal 1 C 1R.[X1 , ... ,Xnl is
again binomial.
Furthermore, in [2J an algorithm is presented to compute the real radical of
a binomial ideal which computes the real radicals of the 2n ideals 11
and their
intersection. In a first attempt, this latter procedure either needs Grobner ba-
sis computations of non-binomial ideals, requiring the complicated arithmetic of
real algebraic numbers, or a noetherian induction. In [18J this basic algorithm
has been improved by reducing the number of intersections and removing this
induction. Furthermore the arithmetic of coefficients has been reduced to prod-
ucts of pure roots of the input data, and criteria for the a-priori-detection of
trivial or empty cells have greatly improved the practical performance.
180 E. Becker, J. Schmid
5 Fields of Definition
Proof. Let X := (Xl, ... ,Xn). H F is a field of definition for I L[X] then I L[X] =
JL[X] for some ideal J C F[X]. Therefore
set {rl u E Gal(K/K)} is finite and all its elements are contained in I. Denote
these elements by f1, . .. , f m' Then
m m
for certain go, ... , 9m-1 E I, 9m = 1. Furthermore gl, ... , gm are fixed under the
action of Gal(K / K) and thus gl,"" gm E In K[Xlo ... , Xn] = J. Therefore
m-1
fm = - L gd i E JK[X1"",Xn]'
i=O
Lemma 3. Let K be a perfect field, K its algebraic closure and assume that I
is a radical ideal of K[X1, ... , X n ]. Let further U := {u E Gal(K / K) I I" = I }
and denote the fixed field of U by F. Then
(i) U is a closed subgroup of finite index of Gal(K / K),
(ii) F is the smallest field of definition for I which contains K.
Proof. Obviously U C Gal(K / F). Now suppose that L is a field of definition for
I such that K C L c K. Then Lemma 1 yields I" = I for all u E Gal (K / L )
and thus Gal(K/L) cUe Gal(K/F). Thus F C L. We can choose a field L
of definition for I such that L / K is finite. Then Gal( K / L) and hence U are of
finite index in Gal(K / K). Consequently U is open w.r.t. the Krull topology on
Gal(K / K). Since U is of finite index it is also closed. Now infinite Galois theory
yields Gal(K / F) = U and thus F is a field of definition for I by Lemma 2. 0
Lemma 4. Let K be a subfield of IR and denote the relative algebraic closure
of Kin IR by R. Let furthermore I be an ideal of K[X 1, ... ,Xn]. Then:
(i) A polynomial f E IR[X 1, ... ,Xn vanishes on VIR(I) iff it vanishes on V R(I)
(ii) 'yfIIR[X1, ... ,Xn ] = re IR[X1, ... ,Xn ]IR[X 1, ... ,Xn ]
(iii) VlR(I) = V R(I) w.r.t. the IR-Zariski topology on IRn.
Prool. By the corollary of Chap. I, §5 in [12] we obtain that R is real closed. Let
f E IR[Xl,,,,,Xn]' If I vanishes on VIR(I) it also vanishes on VR(I) because
V R(I) C VIR(I). Now assume that I vanishes on V R(I). We choose an R-basis
(ai)iEA for IR. Then (ai)iEA is also an R[X1, ... ,Xn]-basis for IR[X 1, ... ,Xn ]
and therefore I admits a representation
for some finite set AI C A and certain J; E R[X1, ... , X n ]. Since f vanishes on
V R(I) all the polynomials J; must also vanish on V R(I). By model completeness
182 E. Becker, J. Schmid
of the theory of real closed fields the polynomials h also vanish on VIR.(I) and
thus also J vanishes on this set of zeroes. This proves (i) and (ii). Finally (iii) is
just a reformulation of (i). 0
(1) There exists a smallest field of definition for ry'IIR[Xl , ... , Xn]. This field
is a finite algebraic extension of K contained in R.
(2) An intermediate field L of R/ K is a field of definition for ry'IIR[X l , ... , Xn]
iffVR(I) is Gal(K/L)-invariant.
Proof. Let J:= ry'IR[Xl , ... ,Xn]. Then, by Lemma 1 and Lemma4ii) an inter-
mediate field L of the extension R/ K is a field of definition for ry' JIR[X l , ... , Xn]
iff it is a field of definition for JK[Xl'" "Xn' As JK[Xl , ... ,Xn] is a radical
ideal L is a field of definition for re IIR[Xl,,,,,Xn] iff Vf(J) is Gal(K/L)-
invariant by Lemma 2. Since J is a real ideal of R[Xl , . .. , Xn] we get VR(J) =
V f(J). Furthermore VR(I) = VR(J) by the Real Nullstellensatz. This proves
(2). In order to prove (1) let
and denote its fixed field by L. Obviously Gal(K / R) c U and thus L C R. Then,
by Lemma 3, L is the smallest field of definition for JK Xl"'" Xn], finite over
K and also the smallest field of definition for r. IIR[Xl. ... , Xn]. This proves
the theorem. 0
Proof. If J has only real zeroes, then ry' JIR[X] = JIR[X] and hence K is a
field of definition for ry' fIR[X]. Now assume that K is a field of definition for
ry' JIR[X] and let R be the relative algebraic closure of Kin IR. Then VIR.(f) =
V R(f) is Gal(K/K)-invariant. However, V R is a finite set and thus it is Zariski-
closed. Thus the set of real zeroes of J is Gal(K / K)-invariant. If V R(f) = 0 then
ry' JR[X] = (1) and ry' JIR[X] = (1) is defined over K. Assume VR(f) f:. 0. Since
any two zeroes of the irreducible polynomial J are conjugate over K we obtain
that all zeroes are real. 0
Finally we want to give an estimate how big a field of definition for a real
radical of a principal ideal can be. To do this, we first need a lemma.
Proof. Let g(X) = :E~=o aiXi E ~[X] be of degree n with exactly k real roots
but without any multiple root. Since n - k is even such a polynomial exists.
Let aI, ... ,an E <Q be the zeroes of g. To 9 we assign the following symmetric
(n x n)-matrix
Tg = (ti+i-2kj,
where tk = :E~=1 a~. The entries of Tg are symmetric functions in aI, ... , an and
hence they are polynomial expressions in the coefficients of g. Therefore they are
rational numbers and, moreover, they depend continuously on the coefficients of
g. By Pg E ~[X]let us denote the characteristic polynomial of Tg. Since Tg is
symmetric Pg has only real roots. By ([12], Chap. I, §7) Tg is regular, i.e. a = 0
is not a zero of Pg and sign(Tg) = k. Since
2n - 1
[K:~]>-I·
- n+
n) + (n)
(j-l j+l = (n-l)
j-2 + (n-l)
j-l + (n-l)
i + (n-l)
j+l
and hence
(n)k -> n+l
2n - 1
.
184 E. Becker, J. Schmid
for certain c E <Q', 01, ... ,On E L. We may assume that 01, ... ,Ok are the real
roots of J. Let
g(X) = I1~=1 (X - Oi),
[L : F] ::; (n - k)!,
References
1. Artin, E.: Uber die Zerlegung definiter Funktionen in Quadrate, Abh. Math. Sem.
Univ. Hamburg 5 (1927), 100-115
2. Becker, E., Grobe, R., Niermann, M.: Radicals of binomial ideals, Journal of Pure
and Applied Algebra 117 & 118 (1997), 41-75
3. Becker, E., Neuhaus, R.: Computation of real radicals of polynomial ideals, in:
Computational Algebraic Geometry, Progress in Math., Vol. 109, Birkhauser 1993,
1-20
4. Brownawell, D.: Bounds for the degrees in the Nullstellensatz, Ann. Math. 126
(1987), 577-592
5. Conti, P., Traverso, C.: Practical computation of the real radical, preprint 1998
On the Real Nullstellensatz 185
6. Dubois, D.W.: A nullstellensatz for ordered fields, Arkiv for Mat. 8 (1969), 111-
114
7. Eisenbud, D., Sturmfels, B.: Binomial ideals, Duke Math. J. 84 (1996), 1-45
8. Fried, M.D., Jarden, M.: Field Arithmetic, Ergebnisse der Mathematik, Vol. 11,
Springer 1986
9. Galligo, A., Vorobjov, N.: Complexity of finding irreducible components of a semi-
algebraic set, J. Complexity 11 (1995), 174-193
10. Gianni, P., Trager, B. Zacharias, G.: Grobner bases and primary decomposition
of polynomial ideals, in: Computational aspects of commutative algebra, Academic
press, 15-33
11. Hermann, G.: Die Frage der endlich vielen Schritte in der Theorie der Polynom-
ideale, Math. Ann. 95 (1926), 736-788
12. Knebusch, M., Scheiderer, C.: Einfiihrung in die reelle Algebra, Vieweg 1989
13. Kollar, J.: Sharp effective Nullstellensatz, Journal Amer. Math. Soc. 1 (1988),
963 - 975
14. Krivine, J.L.: Anneaux preordonnes, Journal d'analyse mathematique 12 (1964),
307-326
15. Lang, S.: Diophantine Geometry, Interscience, New York 1962
16. Lombardi, H.: Une borne sur les degres pour Ie theoreme des zeros reels effectif,
in: Real Algebraic Geometry, Lecture Notes in Math. 1524, Springer 1992, 323-345
17. Neuhaus, R.: Computation of real radicals of polynomial ideals, Part II, J. Pure
Appl. Algebra, to appear
18. Niermann, M.: Beitrage zur konstruktiven Idealtheorie, Thesis, Dortmund 1998
19. Risler, J.J.: Une caracterisation des varietes algebriques reelles, C.R.A.S. Paris
271 (1970), 1171-1173
20. Roy, M.F., Vorobjov, N.: Computing the complexification of a semialgebraic set,
Proc. of Int. Symposium on Symbolic an Algebraic Computation (1996), 26-34
21. Sander, T.: Effektive algebraische Geometrie iiber nicht algebraisch abgeschlosse-
nen Korpern, Thesis, Dortmund 1996
22. Schmid, J.: On the complexity of the real Nullstellensatz in the O-dimensional
case, to appear in: Journal of pure and applied Algebra
23. Schmiidgen, K.: The K-moment problem for compact semi-algebraic sets, Math.
Ann. 289 (1991), 203-206
24. Stengle, G.: A Nullstellensatz and a Positivstellensatz in Semialgebraic Geometry,
Math. Ann. 207 (1974), 87-97
25. Stengle,G.: Complexity Estimates for the Schmiidgen Positivstellensatz, J. Com-
plexity 12 (1996), 167-174
26. Wormann, T.: Short algebraic proofs of theorems of Schmiidgen and P6lya,
preprint 1996
27. Wiirmann, T.: Strikt positive Polynome in der semialgebraischen Geometrie,
Thesis, Dortmund 1998
Primary Decomposition: Algorithms and
Comparisons
1 Introduction
pIes using the computer algebra system SINGULAR [10]. Since primary decom-
position is a fairly complicated task, it is, therefore, best explained by dividing it
into several subtasks, in particular, while sometimes only one of these subtasks
is needed in practice. The paper is organized in such a way that we consider the
subtasks separately and present the different approaches of the above-mentioned
authors, with several tricks and improvements incorporated. Some of these im-
provements and the combination of certain steps from the different algorithms
are essential for improving the practical performance.
Section 2 contains the algorithms. After explaining some important splitting
tools, we explain two different approaches for computing the radical of I respec-
tively the radical of the equidimensional hull. In Subsection 2.2 we present two
algorithms for computing the equidimensional hull itself and a weak, that is, up
to radical, decomposition of the equidimensional hull.
The algorithms of [14] and [8] both reduce the general problem to pri-
mary decomposition of zero-dimensional ideals. We, therefore, consider the 0-
dimensional case, together with some theoretical background, in Subsection 2.3.
In Subsection 2.4 we describe the three algorithms of [14], [8] and [25] for
the general case, together with an algorithm to compute the minimal associ-
ated primes of I. The algorithm of [8] uses the normalization of K[Xl, . .. ,xn]1I
and we present a new algorithm ([15]), based on a criterion of Grauert and
Remmert, in Subsection 2.5. Another algorithm for computing the minimal as-
sociated primes is based on characteristic sets and this is presented, together
with some basic facts about characteristic sets, in Subsection 2.6.
Section 3 is devoted to the examples and comparisons of the different ap-
proaches. The examples were taken from a still larger list and they demonstrate
our present knowledge about the relative performance. Our table on the last
page shows that a general best strategy does not exist. Generally speaking, the
characteristic set method has problems if the examples require too many fac-
torizations over extension fields, while [14] has problems if the examples require
going to general position by a random coordinate change. So far, we can only
recommend a combination of the different subalgorithms, depending on the ex-
ample. In contrast to the opinion of some authors, our experience is that one
should use factorization as often as possible, since usually the Grobner bases
computations are the hardest part. This is, in particular, true for the algorithm
computing the minimal primes, where we use the factorizing Grobner, but also
there are exceptions. We are aware of the fact that comparison of algorithms
by examples is certainly affected by the choice of the examples and by tricky
implementation features. On the other hand, the present paper appears to be
the first systematic comparison of the four, so far, most important algorithms
under equal conditions.
All algorithms presented in this paper are, or are about to be, implemented in
SINGULAR with options for the user to combine his own favourite subalgorithms
and are available in the library primdec.lib and distributed with the programme
(d. [10]).
Primary Decomposition: Algorithms and Comparisons 189
2 The Algorithms
In this section, K is a field, R = K[X1' ... ,x n ], and I ~ R is an ideal.
Our aim is to explain how to compute several decompositions of I, its radical
VI, and the normalization of the factor ring R/ I. Our main tools are Gri:ibner
bases, but, for a complete primary decomposition, we also need multivariate
polynomial factorization. All algorithms presented in this note are, or are about
to be, implemented in SINGULAR.
If I = .n
<=1
Qi is a minimal primary decomposition (that is, r is minimal)
with associated primes Pi = ..fiJi, then we write Ev(I) := n Qi for
codim(Q;)=v
the equidimensional part of I of codimension v (if codim(Qi) =1= v for all i let
Ev(I) = R).
We are interested in solving the following problems:
Remark 1. Our experience shows that in all algorithms one should use Lemma
1 to split the ideal as often as possible.
Then for u E XI the extension IK(u)[x '-. u] is zero-dimensional, and the con-
traction IK(u)[x '-. u] n K[x] is equidimensional of dimension dim(l).
Instead of computing XI it is much easier to compute the (possibly proper)
subset X/ := XL(I), where < is a given admissible term-ordering on K[x], and
L(I) is the corresponding leading or initial ideal of I.
2.1 Radicals
Theorem 1. Let A be a local Artinian K -algebra with maximal ideal MA. Then
A is a complete intersection if and only if (0) : Jo(A) = MA.
- II := EQUIRADICAL(I);
- compute N such that I : It' = I : If+l;
- return lIn RADICAL(I : It').
The main drawback of Algorithms 2 and 3 is the computation of regular
sequences via random linear combinations. A second approach of Eisenbud,
Huneke, and Vasconcelos ([8]) avoids the computation of regular sequences. This
is based on
Primary Decomposition: Algorithms and Comparisons 193
Algorithm 4.
EQUIRADICAL(1)
Input: an ideal I in K[Xl' ... , xnJ
Output: the radical of the equidimensional hull of I
Algorithm 5.
EQUIDIMENSIONAL(I)
Input: an ideal I in K[Xl' ... ,xnl
Output: two ideals, the equidimensional hull Ec(I) of I (c = codimI), and an
ideal W of codimension > c such that 1= Ec(1) n W
- compute Xf j
- Result := (1), W := Ij
- for u E Xf do
• compute a Grabner basis S of W with respect to a block-ordering <
with u < < x , Uj
• choose T, a subset of S, which is a minimal Grabner basis of W K(u)[x,
u) and compute the least common multiple h E K[u) of the leading
coefficients of the elements in T as in Lemma 2j
• Result := Result n((T) : hOC) in K[x)j
• W:= (W,h)j
• if dim(W) < dim(1), then
return {Result, W}j
- Result = Result n EQUIDIMENSIONAL(W)[l)j
- return {Result, EQUIDIMENSIONAL(W)[2)}.
Ec(1) = 10 : (10 : 1) .
3. For v ~ c
JEv(I) = VEv(ann(Ext'R(R/I,R)))
Primary Decomposition: Algorithms and Comparisons 195
with fi E K[xn].
Remark 6. Notice that automatically in is irreducible and deg Ii < deg in,
i < n.
Every Q = (al, ... , an-I) Kn-l defines an automorphism <{is!. of
E
n-l
K[XI' ... ,xn] by «iQ.(Xi) = Xi if i < n, and «iQ.(xn) = Xn + L: aiXi.
i=l
Proposition 4. Let P C K[XI, ... , xn] be a maximal ideal. Then there exists
a dense open subset U C Kn-l such that every «iQ.(P), Q E U, is in general
position with respect to the lexicographical ordering induced from Xl > ... > Xn .
Algorithm 8.
ZEROPRIMDEC(I [, CHECK])
Input: a zero-dimensional ideal in K[Xl' ... ,Xn ]
Output: {Ql,Pl, ... ,Q.,p.}, Qi primary, -.fiJi = Pi, Pi f= P j for i f= j and
1= Ql n··· n Q.
# The ideal CHECK and all commands involving CHECK are optional;
# CHECK is needed later on for the higher dimensional decomposition
# in order to avoid redundant components.
- Result := 0;
- [if CHECK £;; I, then
return Result;]
- compute a Grobner basis G of I with respect to the lexicographical ordering
induced from Xl > ... > Xn;
- let G n K[xn] = {f};
- factorize f = J{' ..... ff' ;
- for k := 1 to s do
• [if CHECK ~ (I,ff'), then]
test whether (I, ff') is primary and in general position, that is, com-
pute a Grobner basis S of (I, ff') with respect to the lexicographical
ordering induced from Xl > .. , > X n , and check whether S contains
(k) (k)
hI , ... , hn such that
1 • h n(k) -- f kP'
2. hi(k) -_ (X.. - gi(k) ( Xn ))n~') mod (hi+l'
(k) (k»)..
... , hn ,z < n,
if (I, 1:') is primary and in general position, then
* rk:= (Xlk
D
- gl) , ... ,Xn-l - (k)
gn-ll . t ed prIme
k IS the assocla
f)' . to
Qk := (I, ff');
* Result := Result U{Qk, Pk};
else
* choose Q E Kn-l by random;
* Result:=ResultU<p~l (ZEROPRIMDEC(<Pa«(I, ff'))[,<Pa(CHECK)]))
- return Result.
to split the ideal as often as possible before starting Algorithm 8 (if in 2. the
condition (I, g) = (1) is not fulfilled, we still can apply 1. to a suitable power of
f). In order to use 1. and 2., we produce as many reducible elements as possible.
This leads to the following preprocessing algorithm.
Algorithm 9.
SPLIT(I)
Input: a zero-dimensional ideal I in K[XI" .. , xnJ
Output: two sets of ideals, Primary = {QI,Pl, ... ,Qs,Ps }, and Rest
{h, ... , Id, such that I = (nQi) n (nIi), Qi primary, and v'lJi = Pi
- Primary := 0, Rest := 0 ;
- for i := 1 to n do
• compute (Fi) := In K[XiJ;
• enlarge the system of generators of I by Fi;
- factorize all the generators of I and split the ideal and the resulting ideals
as often as possible;
- compute for all ideals obtained in this way a Grabner basis with respect to
the lexicographical ordering induced from Xl > ... > Xn;
- test whether the ideals are primary and in general position with respect to
the lexicographical ordering induced from Xl > ... > Xn; put the detected
primary ideals and their associated primes to Primary and the other ideals
to Rest;
- return Primary, Rest.
Remark 8. Each ideal in Rest comes with a set of generators (which in fact
is a Grabner basis with respect to the lexicographical ordering induced from
Xl > ... > xn) such that every generator is a power of an irreducible element.
= ..)(I,f)nJV:g) ,
")(I,/·g)
which holds without the assumption (I, g) = (1). In particular, we can use the
factorizing Grabner basis algorithm to split the ideal. Also the prime test for a
zero-dimensional ideal is simpler than the primary test:
I is prime if there is an irreducible gEl n K[XiJ for some i such that deg(g) =
dimK K[xl, ... , xnJI I.
Especially, we obtain:
I is prime and in general position with respect to the lexicographical ordering
induced from Xl > ... > Xn if and only if for a corresponding minimal Grabner
basis G, and {g} = GnK[xnJ, we have deg(g) = dimKK[xI, ... ,xnJII, and 9
is irreducible.
The following probabilistic algorithm, proposed by Eisenbud, Huneke, and
Vasconcelos ([8]), also goes to general position.
Primary Decomposition: Algorithms and Comparisons 199
Algorithm 10.
DECOMPEHV(I)
Input: a zero-dimensional radical ideal I in K[Xl' ... , xnl
Output: the associated prime ideals
Algorithm 11.
MINAssPRIMES(I)
Input: an ideal I in K[Xl, . .. ,xnl
Output: the minimal associated prime ideals of I
- Result := 0;
- choose any admissible term order < on K[Xl, ... , xnl;
200 W. Decker, G.-M. Greuel, G. Pfister
A third possibility, also starting not necessarily with a radical ideal, is based
on characteristic sets. We will treat this approach later.
Q = EQUIDIMENSIONAL(I + pm)
for some m. In this case, it is more difficult to estimate m (cf. [8]): let I[F] =
{b E R I I: b rt. Pl. Then Q is a P-primary ideal of a decomposition of I if and
only ifthe map (I[P] : POO)/I[p]) ----t R/Q is injective.
The Algorithm of Eisenbud, Huneke, and Vasconcelos
Algorithm 12.
PRIMARYDECEHV (1)
Input: an ideal I in R = K[Xl, ... , x n ]
Output: a set Result = {Ql,P1, ... ,Q.,p.} such that I = nQ" is a minimal
primary decomposition and ~ = P", v = 1 ... s.
- Result := 0j
- if CHECK is not defined, then
CHECK:=(I)j
- choose any admissible term-ordering < on K(Xl,"" xn]j
- if CHECK ~ I, then
return Resultj
- compute Xf j
- for u E Xf do
• m:= ZEROPRIMDEC(IK(u)(x" u], CHECK)j
• Result := Result U{Q n K(x],P n K(x] I (Q,P) E m}j
• compute h such that IK(u)(x" u] n K(x] = I : h = I : h 2 j
• 1:= (I,h)j
• for (Q, P) E m do
CHECK = CHECK n Qj
- Result = Result U PRIMARYDECGTZ(I, CHECK)j
- return Result.
1. Q=QnJ;
2. codimJ > codim(Q).
Algorithm 14.
PSEUDOPRIMARYDECOMP(!)
Input: an ideal I in K[XI' ... , xn)
Output: a set Result = {(QI,PI,/I), ... ,(Qr,Pr,ir),J} with Qi, Pi, /;, and J
as in Lemma 4
Algorithm 15.
EXTRACTION ( Q)
Input: a pseudo-primary ideal Q in K[Xl, ... , xnl, and P = .JQ
Output: (Q, J) as in Lemma 5
Lemma 6. Let I be an ideal in K[Xl, ... , xnj, and let T be a decomposition tree
of I. Then:
1. In Tred all component vertices have distinct associated primes.
204 W. Decker, G.-M. Greuel, G. Pfister
Algorithm 16.
PRIMARYDECSY(I)
Input: an ideal I in K[Xl' ... , xnl
Output: a set Result = {Q 1 , H, ... , Q., p.} such that I = nQ v is a minimal
primary decomposition, and -vr:Tv = Pv , v = 1 ... s.
- Result := 0, V := {I}j
- L:= {1}j
- w:=Oj
- while I ~ L do
• if {V E V I weighted tree depth of V = w} is empty, then
w:=w+lj
• choose a vertex V E V of weighted tree depth Wj
• V:= V\ {V}j
• W := {sons of V} (apply either PSEUDOPRIMARYDECOMP(V), or
EXTRACTION(V}}j
• if there is a component vertex Q E W, then
* if L rt. Q, then
Result: =Result U {(Q, P)}, where P is the radical of Q, which
is known from a pseudo-primary decomposition beforej
L:= LnQj
* W:= W\ {Q}j
• V:= V U {V E W I V satifies the seperating condition}j
- return Result.
Remark 12. Let Rand J be as in the proposition, and let i be a non-zero divisor
of J. Then
1. iJ: J = i· HomR(J,J),
and, consequently,
'D --+
r
..2-
• f
is surjective, and its kernel is the ideal generated by the elements of type
8 .. 8 8
T;Tj - E f.~JTk (with To = 1), and E 'fJkTk, where E 'fJkik = o.
k=O k=O k=O
Now we are prepared to give the normalization algorithm:
Algorithm 17.
NORMAL(I [, INFORM])
Input: a radical ideal I in K[X1' ... 'Xnl
Output: r polynomial rings R 1, ... ,Rr , r prime ideals It C R 1, ... ,Ir eRr,
and r maps 11"; : R ~ R;, such that the induced map 11" : K[X1, ... , xnll I ~
RtfIt x ... x Rrl Ir is the normalization of K[X1, . .. , xnll I
- Result := 0;
- compute the idempotents of K[X1' ... ,xn]II;
this is optional; the splitting
- for i := 1 to t do
• compute the singular locus J of h
• choose I E J ..... Ii and compute Ii : I to check whether I is a zero divisor
mod Ii;
• if Ii : I ~ Ii, then
Result := Result U NORMAL(Ii : (I; : f))U NORMAL(I; : f);
(notice that ..j (I;, I) = Ii : (Ii: f) in this situation;)
else
if Ii has an isolated singularity at 0 E Kn, then
J:= (Xl, ..• ,xn );
else
if J o is the radical of the singular locus of a normalization
loop before, given by the list INFORM, then
J := ..j(Ii' I + Jo);
else
J:= ..j(Ii,j);
• compute H := I J : J =: (I, II,·· . ,Is);
• if H = (I), then
Result := Result U{K[XI' . .. ,Xn],Ii, idK[xl, ... ,x n ] }
else
* compute, as described in Remark 33, an ideal L such that
1;
K[XI, ... , Xn , T I , ... , Ts]1 L ~ Hom(J, J), T i ."..
* S :=NORMAL(L);
* let t : K[XI' ... ,xn] -+ K[XI' ... ,Xn, TI , ... ,Ts] be the inclusion;
* replace S by S with all ring maps composed with t;
* Result := Result uS;
- return Result.
Algorithm 18.
IDEMPOTENTS(I)
Input: I S;; K[XI, ... ,xn] a (weighted) homogeneous radical ideal, deg(xt} =
... = deg(xk) = 0, deg(xi) > 0 for i > k, In K[XI' .. . , Xk] zero-dimensional.
Primary Decomposition: Algorithms and Comparisons 207
Output: ideals h, ... , It such that K[XI, ... , xnJI I = K[Xl, ... , xnJI II X ••• X
K[XI, ... ,xnJlIt, and such that InK[XI, ... ,XkJ = n(Iv nK[Xl, ... ,Xkj) is the
prime decomposition
- Result := 0;
- J:= In K[XI' ... ,Xk);
- compute the (zero-dimensional) prime decomposition J = PI n ... n Pt ;
- for i := 1 to t do
• choose 9i -# 0 in n Pv ;
v#i
• Result := Result u{I : gil;
- return Result.
The concept of characteristic sets goes back to Ritt ([20), [21]) and Wu (30). In
our context, when applying this concept, the basic strategy is the following.
Let X be a finite set of generators for the given ideal I C K[xI, ... , xn).
Compute a characteristic set of X. Successively extend this characteristic set via
pseudo-division. Split the radical of I with the help of the extended characteristic
set F. Distinguish two different types of splitting, depending on whether F is
irreducible (then F corresponds to a prime ideal) or not. When applying the
above idea recursively, the prime ideals corresponding to irreducible extended
characteristic sets provide a not necessarily minimal prime decomposition of VI.
Let us be more precise and recall the basic definitions and facts. We refer to
[5], [18), and [29J for details and proofs.
-< is well-founded, that is, every non-empty subset of K[Xl, ... ,xnl has a mini-
mal element. -< is, however, not a total ordering.
208 W. Decker, G.-M. Greuel, G. Pfister
Definition 6. Let I,g E K[Xl, ... ,XnJ. I is said to be 01 lower mnk than gil
I --( g. I and 9 are said to be 01 the same mnk, I ~ g, il neither I --( 9 nor
9 --( I, that is, class(l) = class(g) and cdeg(l) = cdeg(g).
1. If I f:. 0, 9 are polynomials in K[Xl, ... , xn], with class(J) = k, then pseudo-
division yields an expression
Algorithm 19.
CHARSET(I)
Input: a finite subset X of K[Xl' ... ,xnl '- {O}
Output: a characteristic set of X
- Result := 0, Rest := X j
- while Rest =1= 0 do
• choose f of lowest rank in Restj
• Result := Result U {J}j
• if class(1) = 0, then
Rest := 0j
else
Rest:= {g E Rest '- {J} I 9 is Ritt-Wu reduced with respect to J}j
- return Result.
Definition 9. Let X be any finite subset of K[Xl,.'" xnl \ {O}, and I = (X)
the ideal genemted by X. An ascending set :F = {It, ... , fr} c I is called an
extended chamcteristic set of X, if either
1. r = 1 and It is constant, or
2. class(ft} > 0 and prem(gl:F) = 0 for all 9 EX.
The existence of extended characteristic sets is clear from the following algo-
rithm (Ritt- Wu process) for the computation of an extended characteristic set.
Since -« is a well-founded ordering, the termination of this and the subsequent
algorithms is guaranteed by
Remark 16. Let X be any non-empty subset of K[Xl,"" xnl \ {O}, :F a char-
acteristic set of X, and 9 E K[Xl,"" xnl \ {O} Ritt-Wu reduced with respect to
:F. Then {} -« :F for every characteristic set {} of Xu {g}.
Algorithm 20.
EXTCHARSET(I)
Input: a finite subset X of K[Xl' ... , xnl \ {O}
Output: an extended characteristic set of X
- Int := Rest := Xj
- while Rest =1= 0 do
• Result := CHARSET(lnt)j
• if Result = {J} with f E K, then
Rest := 0j
else
Rest := {prem(gl Result) =1= 0 I 9 E Int \ Result }j
• Int := Int U Restj
- return Result.
Definition 10. Let F = {II, ... , fr} C K[X1, ... , xnl be an ascending set, and
let m = n-r. After renaming the variables we may assume that class(fi) = Xm+i,
i = 1, ... , r. With this assumption F is called irreducible, if each h is irreducible
in Ki[xm+il, where Ki is inductively defined by Kl := K(X1, . .. , xm), and Ki :=
Ki- 1 [xm+i-d/ (h-l).
Proposition 7. LetF= {1I, ... ,fr} C K[x1, ... ,xnl be an irreducible ascend-
ing set. Then
P:= {g E K[X1,'" ,xnll prem(gllI, ... , fr) = O}
is a prime ideal with F as a characteristic set.
It follows from the pseudo-remainder formula in Remark 38, 2. that P can
be computed via Grobner bases:
Lemma 7. Let F = {h, ... , fr} C K[X1,"" xnl be an irreducible ascending
set, J = (F) the ideal generated by F, and P the prime ideal with characteristic
set F as in Proposition 44. Then
P = (... ((J : In(ft}OO) : In(h)OO) : ... ) : In(fr)OO
Now we come to the two different types of splitting.
Lemma 8. Let X be any finite subset of K[X1,'" ,xnl \ {O}, I = (X) the ideal
generated by X and F = {h, ... , fr} an extended characteristic set of X. Sup-
pose that F is irreducible, and let P be the prime ideal with characteristic set F
as in Proposition 44. Then
Vi = P n J(X U {In(h)}) n··· n J(X U {InUr)})
The following remark allows the application of Lemma l.
Remark 17. Let F = {h, ... , fr} C K[X1, ... , xnl be a reducible ascending
set. Assume that the variables are ordered as in Definition 43 with m = n -
r.Choose i minimal with {h, ... , h} reducible. Let fi = iif' ..... ii~· be the
factorization of h into irreducible factors over K i • Then there is a relation of
type 9 = Gh - hf' ..... h~' in K[Xl,'" ,xnl, where hj is obtained from ii j by
clearing denominators, and where G E K[Xl, ... ,xml. Then g, considered as a
polynomial in Ki-l[x m+il, is zero. The irreducibility of {h, ... , h-I} implies
that prem(glh, ... ,h-l) = O. Hence there exist 81, •.. ,8i-l such that InUt}·, .
.... In(h_t}·'-'hf'·····h~· E (h,···,/;)· Definegj :=prem(hjlh, ... ,fi-d,
j = 1, ... , s. Then all gj are Ritt-Wu reduced with respect to F, class(gj) =
c1ass(fi), and In(fd'" ... ·In(h-l)·'-'gf'· .... g~. E (h,···,/;)·
Altogether we obtain the
Algorithm 21.
MINAssPRIMESCHARSETS(I)
Input: an ideal I in K[Xl, ... , xnl
Output: the minimal associated primes of I
Primary Decomposition: Algorithms and Comparisons 211
3 Examples
All algorithms described in Section 2 are, or are about to be, implemented in
SINGULAR.
In this Section we compare the implementations. In the table below we give
the timings (in seconds) for 34 examples computed on a HP 720. "*" means that
the computation was stopped after three hours. All computations are done over
the prime field K = IF32oo3. The ordering of the monomials is always the degree
reverse lexicographical ordering with the underlying ordering of the alphabet.
In the first column we give the timings for the computation of the minimal
associated primes via characteristic sets (Algorithm 22). In the second column,
we list the timings for the computation of the associated primes by first us-
ing Algorithm 7 (WEAKEQUIDlMENSIONAL), followed by a prime decomposition
of the equidimensional parts via Algorithm 11. The third column gives the
timings for the minimal associated primes by using Algorithm 11. The fourth
column contains the timings for a complete primary decomposition following
Gianni, Trager, and Zacharias (Algorithm 13). The fifth column gives the tim-
ings for the primary decomposition by using the Algorithm 16 of Shimoyama
and Yokoyama and computing the minimal associated primes via characteristic
sets. In this column "*, 46" means that the characteristic sets algorithm could
212 W. Decker, G.-M. Greuel, G. Pfister
not compute the minimal associated primes, and that we need 46 seconds with
Algorithm 16, if we first compute the minimal associated primes via Algorithm
11. In all other cases, the timings for Algorithm 16, based on Algorithm 11, can
be obtained by subtracting the first column from the fifth column and adding
the third column. The next two columns give the timings of the computation of
the equidimensional radical and the radical by using a combination of the Algo-
rithms 1, 2, and 3 (we use 2 and 3, if the first c generators of a c-codimensional
ideal already form a regular sequence, and if the number of variables is less or
equal to 5). Column 8 contains the information on the number and the dimension
of the components. 3, ... ,3,0, for instance, means 15 components of dimension 3
'----v---"
15
and one component of dimension O. Column 9 indicates, whether the given ideal
is already radical or not. Finally, column 10 contains the number of embedded
components.
The examples show that there is no unique strategy for the computation
of primary decompositions. Sometimes much more time is used for computing
the radical or the minimal associated primes than for the complete primary
decomposition a la Gianni, Trager, and Zacharias. The reason for this is the use
of the factorizing Buchberger algorithm, which is usually very efficient (in a few
cases, however, it can be quite time-consuming).
1. Chemistry (describes a chemical processes in glass melting)
y2 Z + 2xyt - 2x - z,
_x 3 Z + 4xy2 Z + 4x 2yt + 2y 3t + 4x 2 - 10y2 + 4xz - lOyt + 2,
2yzt + xt2 - x - 2z,
-xz 3 + 4yz 2t + 4xzt2 + 2yt 3 + 4xz + 4Z2 - 10yt - 10t2 + 2.
Primary Decomposition: Algorithms and Comparisons 215
_k 9 + 9k 8l- 36k7l 2 +84k6l 3 -126k 5l 4 + 126k4l 5 -84k 3l 6 + 36k 2z7 - 9kl 8 +l9,
-bk8 + 8bel + k 8l- 28bk6l 2 - 8el 2 + 56bk 5l 3 + 28k6 l 3 -70bk 4l 4 - 56k 5l 4 +
56bk 3l 5 + 70k 4l 5 - 28bk 2l 6 - 56k 3l 6 + 8bkl 7 + 28k 2l 7 - bl8 - 8kl 8 + 19,
ce - 7ck 6l - k7l + 21ck 5l 2 + 7k 6 l 2 - 35ck413 - 21k 5l 3 + 35ck 3l 4 + 35k4l 4 -
21ck 2l 5 - 35k3l 5 + 7ckl 6 + 21k 2l 6 - cl 7 - 7kl7 + l8,
-dk 6 + 6dk 5l + k 6l-15dk 4l 2 - 6k 5l 2 + 20dk 3l 3 + 15k4l 3 -15dk 2l4 - 20k 314 +
6dkl 5 + 15k2l 5 - dl 6 - 6kl 6 + l7,
ek 5 - 5ek41 - k 5l + lOek 3l 2 + 5k 412 - lOek 213 - 10k3l 3 + 5ekl 4 + lOk 2l 4 -
el 5 - 5kl 5 + l6,
- fk 4 + 4fk 3l + k41- 6fk 2l 2 - 4k 3l 2 + 4fkl 3 + 6k 2l 3 - fl4 - 4kl4 + l5,
gk 3 - 3gk2[ - k 3[ + 3gk[2 + 3k 2[2 - g[3 - 3kl 3 + [4,
-hk 2 + 2hkl + k 2l- hl 2 - 2kl2 + l3,
jk-jl-kl+l2.
29. Huneke
32. Siebert
w 2xy + w 2xz + w 2z 2, tx 2y + x 2yz + X 2Z 2,
twy2 + t y 2z + y2 z 2, t 2 wx + t 2 wz + t 2 z 2.
3 1
° ° 4 3
° °
15
2,1,1,1,1,0, ... ,
'"-v--'
° no 7
6
4 1 1 3 11 13 1 6,6,5,4 no 1
5
6
3
9
* 12 1210
2 1
20
20
°
9
9
9 3,3,3,2,2,2,2,0,0,0
9 3,3,3
no
no
2
no
*
7 15 25 2 5 16 1 1 3,3,3,3,3,3 yes no
8 5 2 4 1 22 4 5 0, ... ,0 yes no
'"-v--'
14
9 * 3 10 4 *,46 2 2 1, ... ,1 yes no
'"-v--'
12
10 6 5 2 2 12 1 2 1, ... ,1 yes no
'"-v--'
9
11 * 83 7 5 *,41 9 9 1, ... ,1 yes no
'"-v--'
6
12 44 89 70 111 452 35 35 1, ... ,1 no no
'"-v--'
25
13 * 52 402 35 *,548 396 396 0, ... ,0 no no
'"-v--'
20
14 2 1 3 3 1 1,1,0, ... ,0 no 6
15 103
° 14 12 123
°
2 2
'"-v--'
6
0, ... ,0 no no
*
'"-v--'
30
16 21 98 354 6 31 9 270 8,7,6,5 yes no
17 123 14 26 18 125 14 14 0,0 yes no
18 18 52 93 7 64 19 19 0, ... ,0 no no
'"-v--'
19
19 * 2 71 1 *,42 44 88 1,1 yes no
20 114 360 2 5 122 3 27 4,0 no 1
21 1 167 1 6 12 1 2 9,8,7,6,5,4,3,2,1 no 8
22 5 * 3 10 13 2 2 2,2,2,1,1,1,1,1,0 no 2
23 17 * 10 16 68 6 6 2, ... ,2,1, ... ,1 no 12
'"-v--' '"-v--'
10 17
24 3 21 1 11 11 1 8,6,6,4,4,4 no 1
25
26
35 * 2 325 43 °
5 342 5,4,3,3,1,1 no 2
* 101 14 13 *,224 12 13 0, ... ,0 no no
'"-v--'
40
27 7 2 1 1 9 1 1 4,4,4 yes no
28 25 4 1 2 91 4 4 7,7 yes no
29 1 * 11734 4266 11 * 2,1, ... ,1,0 no 11
'"-v--'
10
Primary Decomposition: Algorithms and Comparisons 219
References
1. Amrhein, B.; Gloor, 0.; Kiichlin, W.: Walking faster. DISCO 1996, LNCS 1996.
2. Backelin, J.; Frohberg, R.: How we prove that there are exactly 924 cyclic 7-roots.
Proc. ISSAC 91, 103-111.
3. Boge, R.; Gebauer, R.; Kredel, H.: Some examples for solving systems of algebraic
equations by calculating Grobner bases. J. Symb. Compo 2, 83-89 (1987).
4. Becker, E.; Wormann, T.: Radical computations of zero-dimensional ideals and
real root counting. Mathematics and Computers in Simulation 42, 561-569 (1996).
5. Chou, S.-C.: Mechanical geometry theorem proving. Mathematics and Its Appli-
cations, D. Reidel, 1988.
6. Caboara, M.; Conti, P.; Traverso, C.: Yet another ideal decomposition algorithm.
To appear in AAECC Proceedings.
7. Decker, W; Manolache, N.; Schreyer, F.-O.: Geometry of the Horrocks-bundle on
IPs. In: Complex Projective Geometry. London Math. Soc. Lecture Notes Series
179, 128-148 (1992)
8. Eisenbud, D.; Huneke, C.; Vasconcelos, W.: Direct methods for primary decompo-
sition. Invent. Math. 110, 207-235 (1992).
9. Eisenbud, D.; Sturmfels, B.: Binomial Ideals. Duke Mathematical Journal 84, 1-45
(1996)
10. Greuel, G.-M.; Pfister, G.; Schonemann, H.: SINGULAR Reference Manual, Re-
ports On Computer Algebra Number 12, May 1997, Centre for Computer Algebra,
University of Kaiserslautern from www.mathematik.uni-kl.de/zca/Singular.
11. Gianni, P.; Miller, V.; Trager, B.: Decomposition of algebras. ISSAC 88, Springer
LNC 358, 300-308.
12. Grauert, H.; Remmert, R.: Analytische Stellenalgebren. Springer 1971.
13. Gianni, P.; Trager, B.: Integral closure of noetherian rings. Preprint, to appear.
14. Gianni, P.; Trager, B.; Zacharias, G.: Grobner Bases and Primary Decomposition
of Polynomial Ideals. J. Symbolic Computation 6, 149-167 (1988).
15. de Jong, T.: An algorithm for computing the integral closure. J. Symbolic Com-
putation (to appear).
16. Krick, T.; Logar, A.: An algorithm for the computation of the radical of an ideal
in the ring of polynomials. AAECC9, Springer LNCS 539, 195-205.
17. Matsumura, H.: Commutative ring theory, Cambridge studies in advanced math.
8.
18. Mishra, B: Algorithmic Algebra, Texts and Monographs in Computer Science,
Springer, 1993.
220 W. Decker, G.-M. Greuel, G. Pfister
1 Introduction
The basic motivation of real quantifier elimination is to "eliminate" unwanted
variables from an algebraic description of some situation. The unwanted variables
may represent unknown real quantities, e.g. quantities that cannot be measured
or determined directly in the given model. Consider the following simple exam-
ple: We want to test the solvability of the equation ax 2 + b = 0 where a and bare
considered real parameters. Then we find that this equation has a real solution
if and only if ab < 0 or b = O.
Thus even if we start with a single equation and eliminate a single variable,
we cannot avoid the use of order inequalities in the elimination result. Moreover,
we have to allow logical connectives like "and" and "or." So it makes sense to
formulate the real elimination from the outset as the problem of eliminating
variables from a Boolean combination of polynomial equations and inequalities.
In fact, the proper formulation of the problem requires first-order logic.
Nevertheless, the heart of real quantifier elimination belongs to algebra, more
precisely to real algebra. The central problem is to count the number of real
solutions of a system of polynomial equations and inequalities with parametric
coefficients as a function of these coefficients. In other words, the coefficients may
take on arbitrary real values, and the number of solutions should be expressed
uniformly by conditions on these parametric coefficients.
The past history of real quantifier elimination dates back to the 17th century.
In 1637, Descartes established his rule of signs, cf. [17]:
Let f be a squarefree univariate real polynomial. Denote by s the number
of sign changes in the coefficients of f ignoring zeroes, and by n the
number of positive real roots of f. Then n ~ s, generally s - n is even,
and if all roots of f are real, then n = s.
Budan (1807), cf. [4], and Fourier (1831), cf. [26], showed that Descartes' rule
of signs is actually a special case of a more general theorem:
The first decisive exact result on real root counting is the famous theorem of
Sturm (1835), cf. [47]:
Sturm's crucial idea was to change Budan's and Fourier's sequence D of higher
non-zero derivatives to the Sturm sequence S described above in order to get the
exact number of roots.
In 1853 Sylvester observed that Sturm's theorem can easily be extended to
cover an additional inequality as a side condition, cf. [51]:
of the implemented methods. Not until a few years ago, some of these methods
have been able to solve problems of interesting size in science, engineering, and
also in economics, namely in operations research. Though the enormous increase
in computational power plays a certain role, it was mainly theoretical work that
contributed to this development.
On one hand CAD has gone through numerous improvements, cf. [38], [31,32],
resulting in partial CAD, cf. [12], implemented in Hong's QEPCAD program. On
the other hand it had been shown that real quantifier elimination is inherently
hard for some problem classes, cf. [18,53]. Thus the attention turned to special
procedures for restricted problem classes, where the elimination procedures can
be tuned to the structure of the problem. The focus was on considering formu-
las in which the occurrence of quantified variables is restricted to low degrees,
cf. [53,37,33,34,27,57,58]. This was initiated by the third author in 1988. In
his virtual substitution method the number of parameters plays a minor role for
the complexity. The worst-case complexity of the method is doubly exponential
only in the number of the quantifier blocks of the input formula. This makes the
method attractive for problems containing many parameters. It is implemented
in the REDUCE package REDLOG, cf. [20,21] by the first and the second author.
The version of the method currently implemented is incomplete in that it can
fail for input problems violating certain degree restrictions wrt. the quantified
variables. In principle the method can be extended to arbitrary degrees, cf. [58].
In 1993 the third author introduced a new complete elimination procedure
based on comprehensive Grabner bases in combination with multivariate real
root counting, cf. [55] and [54, 11,42]. The focus of the method is on problems
containing many equations. It is, however, complete, i.e., there is no restriction
on the possible input problems. This procedure has been implemented in the
package QERRC by the first author within the computer algebra system MAS,
cf. [19].
In recent years there have been impressive results on asymptotically fast real
elimination algorithms, cf. [43,8]. Implementation of these methods is still at a
very early stage. So the question to what extent these methods are of practical
relevance cannot be answered yet.
In this note, we will examine the applicability of automatic real quantifier
elimination using the three available packages QEPCAD, REDLOG, and QERRC.
We wish to emphasize that we do not consider this to be a competition between
the packages. In particular, the majority of the examples will be computed with
RED LOG since our current research focuses on this package and the methods
implemented there.
The plan of our paper is as follows: Section 2 summarizes the logical founda-
tions necessary for understanding the discussed elimination methods. Section 3
focusses on the mathematical background of the packages under consideration.
Section 4 discusses how to encode problems from science, engineering, and eco-
nomics in such a way that they can be solved by elimination methods. It also
includes some automatic elimination examples with timings and references to
further examples. In Section 5, we will summarize and evaluate our results.
224 A. Dolzmann, T. Sturm, V. Weispfenning
2 A Formal Framework
In order to give a formal framework for real quantifier elimination, we introduce
first-order logic on top of polynomial equations and inequalities.
We consider multivariate polynomials f (u, x) with rational coefficients, where
u = (Ul ... ,um ) and x = (Xl, ... ,xn ). We call u parameters and we call x main
variables. Equations will be expressions of the form f = 0, inequalities are of
the form f 2: 0, f > 0, or f =1= 0. Equations and inequalities are called atomic
formulas. Quantifier-free formulas are Boolean combinations of atomic formulas
by the logical operators "A," "V," and "...,." Existential formulas are of the form
3Xl ... 3x n'I/J(u, x), where 'I/J is a quantifier-free formula. Similarly, universal for-
mulas are of the form 't/Xl ... 't/xn'I/J(u, x). A general (prenex) first-order formula
has several alternating blocks of existential and universal quantifiers in front of
a quantifier-free formula.
The real quantifier elimination problem can be phrased as follows: Given a
formula <p, find a quantifier-free formula <p' such that both <p and <p' are equivalent
in the domain of the real numbers. A procedure computing such a <p' from <p is
called a real quantifier elimination procedure.
Quantifier elimination for an existential formula <p(u) == 3Xl ... 3xn'I/J(u, x)
has a straightforward geometric interpretation: Let
M = {(u,x) E Rm +n I 'I/J(u,x)},
and let M' = {u E Rm I <p(u) }. Then M' is the projection of M along the coor-
dinate axes of the existentially quantified variables x onto the parameter space.
Quantifier elimination yields a quantifier-free description of this projection.
Sets defined by first-order formulas are called definable sets. Sets defined
by quantifier-free formulas are called semi-algebraic sets. Obviously, every semi-
algebraic set is definable. The existence of a quantifier elimination procedure
implies that, vice versa, every definable set is already semi-algebraic.
We have already indicated in the Introduction that quantifier elimination for
existential formulas provides a test that determines the solvability of a paramet-
ric system of equations in dependence on the parameters. The procedure gives,
however, no information on possible solutions of the input system. This point
of view gives rise to the following generalization of quantifier elimination: Given
an existential formula <p == 3Xl" . xn'I/J(u, x), we wish to compute a set
ifl = {(<p~(U)'O!k(U)) IkE K}, K finite,
which has the following properties:
1. The <p~ are quantifier-free formulas. The O!k provide terms tl(U), ... , tn(u)
corresponding to the eliminated variables x!, ... , x n .
2. Define <p' as VkEK <Pk' Then <p' is equivalent to <p in the reals. In other words,
<p' is obtained from <p by quantifier elimination.
3. Fix real values for the parameters u: if <p and hence <p' holds, then there is
some <Pk which holds. The corresponding answer O!k is a sample point, which
when virtually substituted for x satisfies 'I/J.
Real Quantifier Elimination in Practice 225
For the notion of virtual substitution d. Section 3.2. An algorithm mapping <p
to (p' as described above is called an extended quantifier elimination procedure,
or a quantifier elimination with answer.
Cylindrical algebraic decomposition (CAD), cf. [14,1], is the oldest and most
elaborate implemented real quantifier elimination method. It was developed by
Collins and his students starting in 1974. During the last 10 years particularly
Hong made very significant theoretical contributions that improved the perfor-
mance of the method dramatically, cf. [31,32], resulting in partial CAD, d. [12].
Hong has implemented partial CAD in his program QEPCAD based on the com-
puter algebra C-library SACLIB. QEPCAD is not officially published but available
from Hong on request.
We sketch the basic ideas of CAD: Suppose we are given an input formula
Let F be the set of all polynomials occurring in 1/J as left hand sides of atomic
formulas. Call C ~ IRm +n sign invariant for F, if every polynomial in F has
constant sign on all points in C. Then 1/J(c) is either "true" or "false" for all
cE C.
Suppose we have a finite sequence III, ... , IIm+n with the following proper-
ties:
1. Each IIi is a finite partition of IRi into connected semi-algebraic cells. For
1 :::; j :::; n each IIm+j is labeled with Qj.
2. IIi-1 consists for 1 < i :::; m + n exactly of the projections of all cells in
IIi along the coordinate of the i-th variable in (UI, ... , U m , Xl, ... , xn). For
each cell C in IIi- 1 we can determine the preimage S(C) ~ IIi under the
projection.
3. For each cell C in lIm we know a quantifier-free formula c5c(UI, ... ,um)
describing this cell.
4. Each cell C in IIm+n is sign invariant for F. Moreover for each cell C in
IIm+n, we are given a test point to in such a form that we can determine
the sign of !(to) for each! E F and thus evaluate 1/J(tc).
A quantifier-free equivalent for <p is obtained as the disjunction of all c50 for
which C in lIm is valid in the following recursively defined sense:
1. For m :::; i < m + n, we have that IIi+1 is labeled:
(a) If IIi+1 is labeled "3," then C in IIi is valid if at least one C' E S(C) is
valid.
(b) If IIi+! is labeled "V," then C in IIi is valid if all C' E S(C) are valid.
2. A cell C in IIm+n is valid if 1/J(to) is "true."
226 A. Dolzmann, T. Sturm, V. Weispfenning
We now have to clarify how to obtain such a sequence III, ... , IIm+n' the
quantifier-free formulas oe, and the test points te. This happens in two phases,
the projection phase and the construction phase.
In the projection phase, one determines from F ~ IR[UI' ... ,U m , Xl, ... ,xnl
a new finite set
F' ~ IR[UI, ... ,U m ,XI"",x n -ll
such that the following condition is satisfied: Consider a, b E IRm +n - 1 such that
for all I' E F' the signs of both f'(a), f'(b) E IR are equal. Then for all 1 E F
the corresponding univariate polynomials l(a, xn), l(b, xn) E lR[x n l both have
the same number of different real and complex roots.
This has the following important consequence: Let C be a connected subset
of IRm +n - 1 that is sign invariant for F'. For each 1 E F consider the functions
r2k : C -t IR assigning to a E C the k-th real root of l(a, xn) E lR[xnl. Then all
these r2k are continuous. Moreover, the graphs of the various r2k do not intersect.
In other words, the order of the real roots does not change as they continuously
change their position on the real line. In the CAD framework they are called
delineated. The step from F to F' is called a projection.
Iterative application of such projections leads to a finite sequence F m +n ,
... , F I , where
-2
-4
-~~6~--~4--~-2O--*--~--'4--~6
u
C11 = C1 x ]-00,0[,
C 12 = C1 x {O},
C 13 = C1 X ]0, -u-F [,
C 14 = C1 {-U-F}'
X
C 15 = C1 X ] -u-F,
-utF [,
C 16 = C 1 X {-utF},
C17 -- C1 x] -utrr=4"
2 " oo[
C21 = C2 x ]-00,0[,
C22 = C2 X {O} = {(-2,0)},
C23 = C2 X ]0,1[,
C24 = C2 x {1} = {(-2, 1)},
C25 = C2 x ]1,00[.
T21 = {( -3, -1), (-3,0), (-3, t), (-3, 3-2/&), (-3,1), (-3, ¥), (-3,3)},
for the cylinder above C2 , we have
T23 = {(-1,-1),(-1,0),(-1,1)}.
Just as the decompositions, the subsets T 24 , ... , T27 of test points are similar
to those already computed. The test points tCij belonging to II2 are the union
T2 = T21 u··· UT27 ·
Since II2 already belongs to the quantified variable x, it is not necessary to
compute quantifier-free formulas describing the cells. Instead, II2 is labeled "3."
This concludes the construction phase.
We now return to II1 , which is the intermediate decomposition belonging to
the last-in our case only-free variable u. For each of the cells Ci E II1 , we
Real Quantifier Elimination in Practice 229
have to check whether Ci is valid. That is, we substitute the test points to,;
belonging to the cells in II2 origining from Ci into 1/J( U, x) evaluating it to either
"true" or "false." Since II2 is labeled ":3," C i is valid if and only if at least one
such test point yields "true."
For C1 the test points tON = (-3, 3-2v'5) and to'6 = (-3, ¥)
satisfy 1/J,
i.e., C1 is valid. So is C2 due to the test point to2• = (-2,1). The cells C3 ,
... , C7 are not valid. The result of the quantifier elimination is thus
The virtual substitution method dates back to a theoretical paper by the third
author in 1988, cf. [53]. During the last five years a lot of theoretical work has
been done to improve the method, cf. [37,57,23,22,58]. After promising experi-
mental implementations by Burhenne in 1990, cf. [10], and by the second author
in 1992, the method was efficiently reimplemented within the REDUCE package
REDLOG by the first and the second author. REDLOG is in fact a computer logic
system providing not only quantifier elimination but a sophisticated working en-
vironment for first-order logic over various languages and theories, cf. [21]. There
are also interfaces to QEPCAD and QERRC available such that these packages can
be called from RED LOG and the results are available to be further processed by
REDLOG. The REDLOG source code and documentation are freely available on
the WWW.l
The applicability of the method in the form described here is restricted to
formulas in which the quantified variables occur at most quadratically. Moreover
quantifiers are eliminated one by one, and the elimination of one quantifier can
increase the degree of other quantified variables. On the other hand there are
various heuristic methods built in for decreasing the degrees during elimination.
One obvious example for such methods is polynomial factorization.
For eliminating the quantifiers from an input formula
the elimination starts with the innermost quantifier regarding the other quanti-
fied variables within 1/J as extra parameters. Universal quantifiers are handled by
means of the equivalence Vx1/J +--+ ..,:3x..,1/J. We may thus restrict our attention
to a formula of the form
where the U rn +!, ... , Uk are actually Xi quantified from further outside.
1 http://vvv.fmi.uni-passau.de/Nredlog/
230 A. Doizmann, T. Sturm, V. Weispfenning
We fix real values a1, ... , ak for the parameters U1, ... , Uk. Then all poly-
nomials occurring in 'IjJ* become linear or quadratic univariate polynomials in x
with real coefficients. So the set
of all real values b of x satisfying 'IjJ* is a finite union of closed, open, and half-
open intervals on the real line. The endpoints of these intervals are among ±oo
together with the real zeroes of the linear and quadratic polynomials occurring
in 'IjJ*. Candidate terms 01, ... , Or for the zeroes can be computed uniformly in
U1, ..• , Uk by the solution formulas for linear and quadratic equations.
If all inequalities in 'IjJ* are weak, then all the intervals constituting M will,
into each direction, be either unbounded or closed. In the latter case, such an
interval will contain its real endpoint. Thus M is non-empty if and only if the
substitution of ±oo or of one of the candidate solutions OJ for x satisfies 'IjJ*.
The substitution of ±oo into a polynomial equation or inequality is evaluated
in the obvious sense. The substitution of expressions in U1, •.. , Uk of the form
(a + by'c)/d among the OJ can be rewritten in such a way that all denomi-
nators involving the Ui and all square-root expressions are removed from the
result, cf. [58]. By disjunctively substituting all candidates into 'IjJ* we obtain a
quantifier-free formula equivalent to 3x'IjJ*.
If 'IjJ* contains also strict inequalities, we need to add to our candidates for
points in M expressions of the form o±c, where 0 is candidate solution for some
left-hand side polynomial occurring in a strict inequality. The symbol c stands
for a positive infinitesimal number. Again the substitution of these expressions
into a polynomial equation or inequality can be rewritten in such a form that
there occur neither denominators involving any of the Ui, nor any square root
expressions, nor the symbol c: in the result, cf. [58]. Again this yields a quantifier-
free formula equivalent to 3x'IjJ* . For practical applications this method, of course,
has to be refined by a careful selection of a smaller number of candidate solutions
and by a combination with powerful simplification techniques for quantifier-free
formulas, cf. [22] for details.
Recall that the well-known solution formula for quadratic equations
is a sufficient set of candidate points. Virtual substitution for x of the terms has
to introduce the condition u 2 - 4 ~ 0 and to resolve the square roots and the
denominators. For the first term, we obtain
u 2 - 4 ~ 0 A 0 = 0 A -u + ..ju 2 - 4> 0 ==
u 2 - 4 ~ 0 A (-u > 0 V (_U)2 < u 2 - 4) == u 2 - 4 ~ 0 A u < o.
Similarly, we obtain for the second term
u 2 - 4 ~ 0 A 0 = 0 A -u - ..ju 2 - 4 > 0 ==
u 2 - 4 ~ 0 A -u > 0 A (-u? > u 2 - 4 == u 2 - 4 ~ 0 Au < o.
The final elimination result is the disjunction of both substitution results:
Let [ ~ JR[xI' ... ,xn ] be a zero-dimensional ideal. For 9 E IR[XI , ... ,xn ]
consider the symmetric quadratic form Qg = (trace(m gb,bj))I:5i,j::;d on
the linear IR-space S = IR[XI, ... ,xn]1[, where {b l , ... , bd} ~ S is a basis,
and the mh : S -+ S are linear maps defined by mh(f + 1) = (hI) + [.
Let s be the signature of Qg, and denote by n+ and n_ the number
of real roots of [ at which 9 is positive or negative, respectively. Then
n+ - n_ = s.
The use of a Grabner basis of the ideal [ allows to obtain a basis of S, and to
perform arithmetic there, cf. [9], thus obtaining the matrix Qg.
Similar to the Sturm-Sylvester theorem, this approach can be extended to
obtain the exact number of roots under a side condition, and can be moreover
extended to several side conditions:
Let F, {gl, ... , gk} ~ IR[XI , ... , xn] be finite, and assume that [ = Id(F)
is zero-dimensional. Denote by N the number of real roots a E IRn of F
for which gi > 0 for 1 :::; i :::; k. Define
For real quantifier elimination, this root counting has to be further extended
to multivariate polynomials with parametric coefficients in such a way that it
will remain correct for every real specialization of the parameters including spe-
cializations to zero. This task has been carried out by the third author using
comprehensive Grobner bases, cf. [55]. It has been implemented by the first au-
thor within the package QERRC of the computer algebra system MAS, cf. [19] and
also [46,36].
We explain how to eliminate the quantifiers from an arbitrary input formula
rp(Ut, ... ,Um) == Q1Xl ... QnXn.,p(Ul, ... ,Um,Xl, ... ,Xn), Qi E {3,'v'}.
such that whenever .,pi(a) holds for a E am, then Fi(a, x) is a Grobner basis of
{h(a, x), ... , h(a,x)} ~ R[x]. On the formula level this corresponds to passing
from rp* to the equivalent formula
r
rp**(u) == V (.,p;(u) /\ 3x1 ... 3xn( 1\ f(u,x) = 0/\ 1\ g(u,x) > 0)).
;=1 /EF, gEG
of rp**. Notice that the condition .,pi ensures that Fi is a Grobner basis of Id(F)
if rpi* holds, and that in this case we know the head terms of the polynomials
in F;, which are uniformly determined. We thus can uniformly determine the
dimension of Id(F;).
Let us for the moment assume that this dimension is zero. Then we can
determine for 'Y E r(G) the matrices Q'Y of the above theorem on root counting.
In contrast to the situation of the theorem, however, the matrix elements are
not real numbers but polynomials in Ul, ... , u m •
For a univariate polynomial p denote by r(p) the number of positive real
roots of p minus the number of negative real roots of p counting multiplicities.
234 A. Dolzmann, T. Sturm, V. Weispfenning
reducing the computation of the sum of signatures to that of r(I1 XQ~) for
a single univariate polynomial I1 XQ~ which has only real roots. We have to
construct a quantifier-free formula stating that r(I1 XQ~) :f. O. This can be done
by applying Descartes' rule of signs discussed in the introduction to all possible
combinations of signs of the coefficients of I1 XQ~ .
It remains to be clarified how to proceed in the case that Id(G;) has a di-
mension greater than zero: We then compute, uniformly in UI, ... , Urn, a max-
imally independent set Y ~ {Xl, ... , x n }. Since the elimination ideal ofId(G;)
wrt. {Xl, ... , Xn} \ Y is zero-dimensional, we can eliminate all quantifiers 3x; for
Xi i Y considering all Xj E Y as additional parameters.
In the procedure above, it can happen that already Y = {Xl, ... , x n }. That
is, there is no non-trivial equation under the condition 'l/Ji. It is, however, always
possible to replace <pi* by an equivalent formula
such that restarting the whole elimination procedure on <pi**, we can eliminate
at least one quantifier in the next step.
Introducing several sophisticated modifications, the first author has improved
the practical complexity of this method considerably. First, it is not necessary
to consider all permutations of signs of coefficients of I1 XQ~' Furthermore, in-
equalities 9 :f. 0 can be treated much more efficiently than by coding them as
9 > 0 V -g > 0 or g2 > 0, cf. [19).
Like CAD, this approach is a general real quantifier elimination procedure.
In practice a successful application requires that the system of equations in the
input is globally zero-dimensional i.e. zero-dimensional for all real parameter
values. Moreover there should be at most one order side condition. This class
of inputs is nevertheless mathematically quite interesting and comprises e.g.
most implicitization problems for parametric real varieties. Similar to the virtual
substitution method, the number of parameters plays a minor role compared to
that of the quantified variables.
Example 3. We once more eliminate the quantifier from <p(u) == 3X(X2 +ux+l =
01\ X > 0). This input formula is already a conjunctive existential formula such
as <po in the description of the procedure. The corresponding Grobner system is
Cancelling 0 = 0, the formula <p** of the description thus in turn equals the
original <p, and there is no disjunction to be split. Furthermore, x 2 + ux + 1 = 0
is a nontrivial equation, and thus Id(x2 + UX + 1) is zero-dimensional. From the
Real Quantifier Elimination in Practice 235
order inequality x > 0 we obtain r({x}) :::: {x,x 2 }. Using the basis {l,x} of
JR(u)[xlj Id(x 2 + ux + 1), we compute the matrices
Q _ (trace(m x )
x - trace(mx2)
traCe(m X 2))
trace(m x3) ::::
(-u u +
u2 - 2
2 -
_u 3
2 )
3u '
Q 2 _ (trace(m X2) trace(m",3)) (u2 - 2 _u 3 + 3u )
x - trace(m x3) trace(m x') :::: _u 3 + 3u u 4 - 4u 2 + 2 .
The product of the corresponding characteristic polynomials is
XQ • . XQ.2 :::: (X2 + (u 3 - 2u)X + (u 2 - 4)) . (X2 + (_u 4 + 3u2)X + (u 2 - 4))
:::: X4 + C3X3 + C2X2 + c1X + eo
with coefficients
C3 :::: _u 4+ u 3 + 3u2 - 2u,
C2 :::: -u 7 + 5u 5 - 6u 3 + 2u 2 - 8,
eo :::: u 4 - 8u 2 + 16.
The following quantifier-free formula states that the difference r(XQ • . XQ.,)
between the number of positive and negative roots-counting multiplicities-of
this product is different from 0:
..., ( (eo :::: 0 1\ Cl :::: 01\ (C3 :::: 0 V C2 < 0)) V (eo > 01\ (C2 < 0 V CIC3 < 0)) ).
Thus the result of the quantifier elimination is the following:
+ u 5 + 7u 4 - 6u 3 - 12u2 + 8u :::: 0 1\
..., ( (u 4 - 8u 2 + 16 :::: 01\ _u 6
(_u 4 + u 3 + 3u 2 - 2u :::: 0 V _u 7 + 5u 5 - 6u 3 + 2u 2 - 8 < 0)) V
(u 4 _ 8u 2 + 16 > 01\ (-u 7 + 5u 5 - 6u 3 + 2u 2 - 8 < 0 V
UlO _ 2u 9 - guS + 18u 7 + 25u 6 - 52u 5 - 16u 4 + 48u 3 - 16u 2 < 0)) ).
... ,
will yield "true" if and only if the system is feasible. Extended quantifier elimi-
nation will in addition yield a sample solution.
Moreover quantifier elimination can be straightforwardly applied to solve the
following generalizations of the problem:
- The constraints are arbitrary Boolean combinations instead of simply con-
junctions.
- The coefficients of the Xi are polynomials ai(Ul, .. . , urn) in the parameters.
- The constraints include strict inequalities such as ">" and ""#."
- The Xi occur with arbitrary degree. There are also arbitrary products of
different Xi and Xj. Here, one possibly has to obey restrictions imposed by
the quantifier elimination method used.
Notice that with strict inequalities the sample solutions provided by REDLOG
will in general involve non-standard terms containing "c." This can in principle
be overcome for the price of higher complexity, cf. [37]. With higher degrees,
solutions can in general not be determined as closed terms but in some algebraic
form.
Example 4. Determine necessary and sufficient conditions for the feasibility of
Applying QEPCAD to 3XaX2 (xi + X~ ~ UlI\ XI > U2) yields Ul - U2 > O!\ Ul ~ 0
after 1 s. REDLOG obtains the same result after 10 ms. QERRC could not compute
a result within 1 minute. Extended quantifier elimination in RED LOG yields also
within 10 ms in addition the sample point (Xl, X2) = (JUl, 0).
4.2 Optimization
Optimization differs from constraint solving in that a sample solution point is
required for which an objective function F(Xl' ... ,xn ) is minimal. The simplex
algorithm, cf. [16], is designed for this task with a parameter-free linear func-
tion. With the Fourier-Motzkin, cf. [40], method linear objective functions are
Real Quantifier Elimination in Practice 237
F=
Xl + 3X2 + 2
2Xl + X2 + 1
QEPCAD obtains after 1 s the elimination result 3z + 8 ~ 0 stating that -8/3 is
the minimal value of F. REDLOG obtains after 70 ms the less simplified though
equivalent result
3z 2 + 2z - 16 <.5. 0 V 2z + 3 ~ 0 V
(10z 2 + 13z + 4 ~ 0 A 2Z2 + 5z + 2 <.5. 0 A 2z2 + 3z + 1 ~ 0 A z + 3 =I- 0).
QERRC could not determine a minimum within 1 minute. Extended quantifier
elimination in RED LOG yields also within 70 ms in addition sample solution
points in the form of fractions involving z.
See [56, 59] for further information on optimization by quantifier elimination.
4.3 Scheduling
The classical scheduling problem is as follows: Tasks 1, ... , n have to be per-
formed on machines 1, ... , m. Each task i is assigned to a machine mi and
requires a processing time Pi. Moreover, there is a partial ordering "-<" defined
on the tasks. The problem is to determine starting times ti for each task i such
that none of the ordering constraints on the tasks is violated, no machine is
occupied by more than one task at the same time, and the last task is finished
at the earliest time possible.
This classical setting can be restated as a quantifier elimination problem as
follows:
n
:It 1 ... :ltn (1\ ti ~ 0 A
i=l
n
1\ ti + Pi <.5. tj A 1\ (ti + Pi <.5. tj V tj + Pj <.5. ti) A 1\ z ~ ti + Pi).
i-<i 1:5 i <;5" i=l
171i =Tnj
- For each job there is given an interval of time within which it has to be
performed.
- Certain machines are allowed to process several tasks in parallel.
- Certain jobs require more than one machine in parallel.
- The cost function to be minimized is an arbitrary piecewise rational function
of starting times of the tasks.
- The extreme freedom in the form of constraints allows to do hierarchical
scheduling: In two elimination steps, a second objective function is optimized
under the assumption of an optimal solution wrt. a first objective function.
This includes e.g. job shop problems on several machines, problems requiring
several units of different resources arising, e.g., in the construction of buildings,
and transportation problems involving penalties for earliness or tardiness of cer-
tain tasks. Here the constraint formula will usually involve disjunctions resulting
from resource restrictions.
job 1 2 3 4 5 6 7 8 9 10 11 1-<2-<3
~ 4020152020201015152015 4-<5-<6-<7
mj 4 2 1 1 2 4 3 2 1 3 4 8 -< 9 -< 10 -< 11
M1 4 II 9 3
M2 8 5 2
M3 10 [TI
M4 1 6 11 I
,
0 20 40 60 75 t
that the last task will have finished after 75 units of time. It also provides ap-
propriate starting times tl, ... , tu for the jobs. Figure 2 shows this result as a
Gantt chart.
Figure 3 shows a very simple electrical sample network, cf. [59J. It consists of
resistances rjk subject to wattage restrictions Pjk. For fixed resistances rjk and
input voltage U3 -Uo, all amperages ijk and voltages Uk are uniquely determined
by physical laws. A description of this circuit as a formula is obtained as follows:
By Ohm's law we have
w == 1\ (Uk - Uj) = rjkijk, N = {(O, I), (0, 2), (I, 2), (I, 3), (2, 3)}.
(j,k)EN
Geometry
Example 11. Let M be the center of the circumcircle of a triangle ABC. Then
LACB = LAMBj2 (see Figure 4). Choosing coordinates A = (-a,O), B =
(a,O), C = (xo, Yo), and M = (0, b) and encoding angles into tangents, an
algebraic translation of this problem reads as follows:
VXVtl VtzVtVb(c2 = a 2 + b2 A c2 = x~ + (Yo - b)Z A
Yotl = a + Xo AYot2 = a - Xo A (1- tlt2)t = tl + t2 --+ bt = a).
Both QEPCAD and QERRC fail on this input. REDLOG yields after 60 ms the
quantifier-free equivalent a/;O V Xo /; 0 V Yo /; 0 containing non-degeneracy
conditions for the triangles.
242 A. Dolzmann, T. Sturm, V. Weispfenning
5 Conclusions
We have given a survey of three implemented quantifier elimination methods
sketching their advantages and drawbacks for certain types of problems. Testing
their applicability to various problems in science, engineering, and economics, it
has turned out that none of them is definitely superior to the others. It can, in
contrast, be necessary to combine all three of them in order to solve a problem.
Exploring the rules governing the applicability of the various methods and de-
signing a scheduling procedure automating the generation of subproblems and
244 A. Dolzmann, T. Sturm, V. Weispfenning
References
1. Dennis S. Arnon, George E. Collins, and Scott McCallum. Cylindrical algebraic
decomposition I: The basic algorithm. SIAM Journal on Computing, 13(4):865-
877, November 1984.
2. Dennis S. Arnon, George E. Collins, and Scott McCallum. Cylindrical algebraic
decomposition II: An adjacency algorithm for the plane. SIAM Journal on Com-
puting, 13(4):878-889, November 1984.
3. Chaouki T. Abdallah, Peter Dorato, Wei Yang, Richard Liska, and Stanly Stein-
berg. Applications of quantifier elimination theory to control system design. In
Proceedings of the 4th IEEE Mediterranean Symposium on Control and Automa-
tion, pages 340-345. IEEE, 1996.
4. Ferdinand Franc;ois Desire Budan de Boislaurent. Nouvelle methode pour la
resolution des equations numeriques d'un degre queJconque, 1807. 2nd edition,
Paris 1822.
5. Eberhard Becker. Sums of squares and quadratic forms in real algebraic geometry.
In Cahiers du Seminaire d'Historie de Mathematiques, volume 1, pages 41-57.
Universite Pierre et Marie Curie, Laboratoire de Mathematiques Fondamentales,
Paris, 1991.
6. Mark de Berg, Marc van Kreveld, Mark Overmars, and Otfried Schwarzkopf. Com-
putational Geometry, Algorithms and Applications. Springer, Berlin, Heidelberg,
New York, 1997.
7. Michael Ben-Or, Dexter Kozen, and John Reif. The complexity of elementary
algebra and geometry. Journal of Computer and System Sciences, 32:251-264,
1986.
8. Saugata Basu, Richard Pollack, and Marie-Franc;oise Roy. On the combinatorial
and algebraic complexity of quantifier elimination. In Shafi Goldwasser, editor,
Proceedings of the 35th Annual Symposium on Foundations of Computer Science,
pages 632-641, Santa Fe, NM, November 1994. IEEE Computer Society Press, Los
Alamitos, CA.
9. Bruno Buchberger. Ein Algorithmus zum Auffinden der Basiselemente des Restk-
lassenringes nach einem nulldimensionalen Polynomideal. Doctoral dissertation,
Mathematical Institute, University of Innsbruck, Innsbruck, Austria, 1965.
10. Klaus-Dieter Burhenne. Implementierung eines Algorithmus zur Quantorenelimi-
nation £iir lineare reelle Probleme. Diploma thesis, Universitat Passau, D-94030
Passau, Germany, December 1990.
11. Eberhard Becker and Thorsten Wormann. On the trace formula for quadratic
forms. In William B. Jacob, Tsit-Yuen Lam, and Robert O. Robson, editors,
Recent Advances in Real Algebraic Geometry and Quadratic Forms, volume 155
of Contemporary Mathematics, pages 271-291. American Mathematical Society,
American Mathematical Society, Providence, Rhode Island, 1994. Proceedings of
the RAGS QUAD Year, Berkeley, 1990-1991.
Real Quantifier Elimination in Practice 245
12. George E. Collins and Hoon Hong. Partial cylindrical algebraic decomposition for
quantifier elimination. Journal of Symbolic Computation, 12(3):299-328, Septem-
ber 1991.
13. David Cox, John Little, and Donald O'Shea. Ideals, Varieties and Algorithms. Un-
dergraduate Texts in Mathematics. Springer-Verlag, New York, Berlin, Heidelberg,
1992.
14. George E. Collins. Quantifier elimination for the elementary theory of real closed
fields by cylindrical algebraic decomposition. In H. Brakhage, editor, Automata
Theory and Formal Languages. 2nd GI Conference, volume 33 of Lecture Notes
in Computer Science, pages 134-183. Gesellschaft fiir Informatik, Springer-Verlag,
Berlin, Heidelberg, New York, 1975.
15. Alain Colmerauer. Prolog III. Communications of the ACM, 33(7):70-90, July
1990.
16. George B. Dantzig. Maximization of a linear function of variables subject to lin-
ear inequalities. In T. C. Koopmans, editor, Activity Analysis of Production and
Allocation, pages 339-347. John Wiley & Sons, New York, 1951.
17. Rene Descartes. La geometrie-livre III. In Charles Adam and Paul Tannery,
editors, Discours de la Methode f3 Essais, volume 6 of (Euvres des Descartes, pages
442-485. Librairie Philosophique J. Vrin, Paris, new edition, 1973.
18. James H. Davenport and Joos Heintz. Real quantifier elimination is doubly expo-
nential. Journal of Symbolic Computation, 5(1-2):29-35, February-April 1988.
19. Andreas Dolzmann. Reelle Quantorenelimination durch parametrisches Zahlen
von Nullstellen. Diploma thesis, Universitat Passau, D-94030 Passau, Germany,
November 1994.
20. Andreas Dolzmann and Thomas Sturm. Redlog user manual. Technical Report
MIP-9616, FMI, Universitat Passau, D-94030 Passau, Germany, October 1996.
Edition 1.0 for Version 1.0.
21. Andreas Dolzmann and Thomas Sturm. Redlog: Computer algebra meets computer
logic. ACM SIGSAM Bulletin, 31(2):2-9, June 1997.
22. Andreas Dolzmann and Thomas Sturm. Simplification of quantifier-free formulae
over ordered fields. Journal of Symbolic Computation, 24(2):209-231, August 1997.
23. Andreas Dolzmann, Thomas Sturm, and Volker Weispfenning. A new approach for
automatic theorem proving in real geometry. Technical Report MIP-9611, FMI,
Universitat Passau, D-94030 Passau, Germany, May 1996. To appear in the Journal
of Automated Reasoning.
24. Peter Dorato, Wei Yang, and Chaouki Abdallah. Robust multi-objective feedback
design by quantifier elimination. Journal of Symbolic Computation, 24(2):153-159,
August 1997. Special issue on applications of quantifier elimination.
25. Horst A. Eiseit, Giorgio Pederzoli, and Carl-Louis Sandblom. Continuous Opti-
mization Models. De Gruyter, Berlin, 1987.
26. Jean Baptiste Joseph Fourier. Analyse des equations determinees. F. Didot, Paris,
1831.
27. Laureano Gonzalez-Vega. A combinatorial algorithm solving some quantifier elimi-
nation problems. Technical Report 11-1993, University of Cantabria, Departemento
de Mathematicas, Estadistica y Computacion, Facultad de Ciencias, Avenida de
Los Castros, 39071 Santander, Spain, November 1993.
28. Charles Hermite. Remarques sur Ie tbeoreme de M. Sturm. In Emile Picard,
editor, (Euvres des Charles Hermite, volume 1, pages 284-287. Gauthier-Villars,
Paris, 1905.
246 A. Dolzmann, T. Sturm, V. Weispfenning
29. Hoon Hong, Richard Liska, and Stanly Steinberg. Testing stability by quanti-
fier elimination. Journal of Symbolic Computation, 24(2}:161-187, August 1997.
Special issue on applications of quantifier elimination.
30. Christoph M. Hoffmann. Geometric and Solid Modeling. Computer Graphics and
Geometric Modeling. Morgan Kaufmann, San Mateo, California, 1989.
31. Hoon Hong. An improvement of the projection operator in cylindrical algebraic
decomposition. In Shunro Watanabe and Morio Nagata, editors, Proceedings of
the International Symposium on Symbolic and Algebraic Computation (ISSAC 90),
pages 261-264, Tokyo, Japan, August 1990. ACM, ACM Press.
32. Hoon Hong. Simple solution formula construction in cylindrical algebraic decom-
position based quantifier elimination. In Paul S. Wang, editor, Proceedings of the
International Symposium on Symbolic and Algebraic Computation (ISSAC 92),
pages 177-188, Berkeley, CA, July 1992. ACM, ACM Press.
33. Hoon Hong. Quantifier elimination for formulas constrained by quadratic equa-
tions. In Manuel Bronstein, editor, Proceedings of the International Symposium on
Symbolic and Algebraic Computation (ISSAC 99), pages 264-274, Kiev, Ukraine,
July 1993. ACM, ACM Press.
34. Hoon Hong. Quantifier elimination for formulas constrained by quadratic equations
via slope resultants. THE Computer Journal, 36(5}:440-449, 1993. Special issue
on computational quantifier elimination.
35. Mats Jirtsrand. Nonlinear control system design by quantifier elimination. Journal
of Symbolic Computation, 24(2}:137-152, August 1997. Special issue on applica-
tions of quantifier elimination.
36. Frank Lippold. Implementierung eines Verfahrens zum Ziihlen reeller Nullstellen
multivariater Polynome. Diploma thesis, Universitat Passau, D-94030 Passau, Ger-
many, September 1993.
37. Riidiger Loos and Volker Weispfenning. Applying linear quantifier elimination. The
Computer Journal, 36(5}:450-462, 1993. Special issue on computational quantifier
elimination.
38. Scott McCallum. An improved projection operation for cylindrical algebraic de-
composition of three-dimensional space. Journal of Symbolic Computation, 5(1-
2}:141-161, February-April 1988.
39. Scott McCallum. Partial solution to path finding problems using the
CAD method. Electronic Proceedings of the IMACS ACA 1995 on
http://math.unm.edu/ACA/1995.html, 1995.
40. Theodore S. Motzkin. Beitrage zur Theorie der linearen Ungleichungen. Doctoral
dissertation, Universitat Ziirich, 1936.
41. Paul Pedersen. Counting Real Zeros. Ph.D. dissertation, Courant Institute of
Mathematical Sciences, New York, 1991.
42. P. Pedersen, M.-F. Roy, and A. Szpirglas. Counting real zeroes in the multivariate
case. In F. Eysette and A. Galigo, editors, Computational Algebraic Geometry,
volume 109 of Progress in Mathematics, pages 203-224. Birkhauser, Boston, Basel;
Berlin, 1993. Proceedings of the MEGA 92.
43. James Renegar. On the computational complexity and geometry of the first-order
theory of the reals. Journal of Symbolic Computation, 13(3}:255-352, March 1992.
Part I-III.
44. Jaroslaw R. Rossignac. Blending and Offsetting Solid Models. Ph.D. thesis, De-
partment of Electrical Engineering, College of Engineering and Applied Science,
University of Rochester, Rochester, New York 14627, July 1985.
45. Jaroslaw R. Rossignac and Aristides A. G. Requicha. Offsetting operations in solid
modelling. Computer Aided Geometric Design, 3(2}:129-148, August 1986.
Real Quantifier Elimination in Practice 247
Gregor Kemper
Abstract. The Hilbert series and degree bounds play significant roles
in computational invariant theory. In the modular case, neither of these
tools is available in general. In this article three results are obtained,
which provide partial remedies for these shortcomings. First, it is shown
that the so-called extended Hilbert series, which can always be calcu-
lated by a Molien type formula, yields strong constraints on the degrees
of primary invariants. Then it is shown that for a trivial source module
the (ordinary) Hilbert series coincides with that of a lift to characteris-
tic 0 and can hence be calculated by Molien's formula. The last result is
a generalization of Gobel's degree bound to the case of monomial repre-
sentations.
Introduction
In order to study structural properties of an invariant ring k[V]G (see [13] in this
volume), it is usually necessary to calculate generating invariants for k[V]G. For
finite groups G, there exists quite an elaborate machinery for this purpose. Refer-
ences can be found in Sturmfels [22], Kemper [9,10]' and Kemper and Steel [14].
However, when the examples become larger, the difficulty of the calculations in-
creases quickly, and it becomes essential that any additional information on the
invariant ring be exploited. Let us consider the situation somewhat closer. By
Noether's normalization lemma, the problem is split into two parts, which are
of roughly equal difficulty. First one has to calculate a system of homogeneous
parameters (in the sequel abbreviated by hsop), which by definition is a system
h, . .. , In of homogeneous invariants, algebraically independent over the ground
field k, with the property that the invariant ring k[V]G is finitely generated as
a module over the subalgebra k[Il, ... ,In] generated by the hsop. The mem-
bers of an hsop are often called primary invariants. After an hsop has been
found, the second task is to calculate the (finitely many) generators of k[V]G
as a module over k[Il, ... ,In]. Such generators are usually called secondary
invariants. Together with the primary invariants, they form the desired system
of generators for the invariant ring (as a k-algebra).
Now the knowledge of the Hilbert series H(k[V]G, t) (for the definition, see
Section 1) can facilitate the computations considerably at various points:
- If II, ... ,in is an hsop of k[V)G with degrees di = deg(fi), then H(k[V)G, t)
can be written in the form
G f(t)
H(k[V) ,t) = (1 _ tdl) ... (1 _ tdn) (1)
with f(t) E Z[t) a polynomial with integer coefficients. This imposes a strong
constraint on the degree vectors (d 1 , • •. ,dn ) which can occur as the degrees
of an hsop. Since it is very important for the calculation of secondary invari-
ants that the d; be chosen as small as possible (see [10)), constraints of this
kind are crucial for a smooth operation of the algorithm from [10).
- If k[V)G is Cohen-Macaulay (see [13]), then the polynomial f(t) in Equa-
tion (1) is of the form
f(t) = te, + ... + te~,
where the ei are the degrees of the secondary invariants. This reduces the
task of finding secondary invariants to pure linear algebra (see [14]). Even if
k[V)G is not Cohen-Macaulay, a Hilbert-driven approach can speed up the
usual algorithm for the calculation of secondary invariants given in [9) or
[14).
- With the knowledge of H(k[V)G, t), one can get a very simple algorithm for
the calculation of k[V)G, which does not use primary and secondary invari-
ants. In fact, for a tentative set g1, ... ,gr of generators one can calculate
the algebraic relations between the gi by [22, Subroutine 2.5.3) and use the
algorithm given by Bayer and Stillman [1) to calculate the Hilbert series of
the sub algebra k[g1,' .. ,gr)' If this Hilbert series coincides with H(k[V)G, t),
then g1,' .. ,gr generate k[V)G. Now the tentative generators can be taken to
be k-bases for the vector spaces of all invariants of degree ~ d, for increasing
values of d.
If char(k) > IGI, the famous "Noether bound" says that k[V]G can be gen-
erated by invariants of degrees ~ IGI (Noether [16]). In the modular case, no
general bound on the generators is known and it has in fact been shown that
there exists no bound which only depends on the group order (see Richman [17)
or Kemper [12]). Degree bounds are very important for computational purposes,
since taking a k-basis for the vector space of invariants of degree smaller than
Hilbert Series and Degree Bounds in Invariant Theory 251
or equal to such a bound yields generators for k[V]G, which can then be mini-
malized by linear algebra. In the case that V is a permutation module, Gobel [6]
was able to prove that the invariant ring has generators of degrees bounded by
maxin, n(n - 1)/2}, where n = dimk(V). In Section 3, we give a generalization
to the case that V is a monomial representation. We also obtain a slight simpli-
fication of Gobel's arguments and of his rewriting algorithm.
I thank Jurgen Muller for suggesting to me that one could generalize Molien's
formula to the case of trivial source modules.
H(k(V]S, t) =L dimk(k(vl~)' t d •
d2:0
If the characteristic of k is zero, then the Hilbert series can easily be calculated
by Molien's celebrated formula
(2)
where Xs(a) = Trs(a) is the character associated to Sand detv is the deter-
minant of the action on V (see Benson [3, Theorem 2.5.3]; the slight difference
between our formula and Benson's arises from the fact that by Benson's conven-
tion k[V] = S(V*)). Molien's formula can easily be generalized to the case that
the characteristic of k is positive but not a divisor of IGI by taking Brauer lifts.
This case will be covered by the considerations below as well as by Section 2.
Then fora E Gp/, F ('Ij;(a) , 'Ij;(( 2 ), ... ,'Ij;(a n ); t) will be the Brauer lift of detv(1-
ta). The following theorem, which was already hinted at by Benson [3, p. 22]
and Smith [20, p. 218], gives the appropriate generalization of MoHen's formula
to the modular case.
Theorem 1. With the above notation, we have
Proof. If 8' is a simple kG-module with Brauer character 'PSI' then 1/IGI .
'PSI (a) . x(a- 1 ) is 1 if 8 ~ 8' and 0 otherwise by Goldschmidt [7,
2:<TEG p l
Theorem 6.10]. If W is any finite dimensional kG-module and 8 1 , ... ,8T the
composition factors (with repetitions) of W, then the Brauer character associ-
ated to W is 'Pw = 2:;=1 'Ps,' Hence the multiplicity of 8 as a composition factor
ofW is
Hilbert Series and Degree Bounds in Invariant Theory 253
-
H(k[VJ, S, t) = dimk(S)
IGI . "
L ( ,L , rpk[vld (u) . t d)-1
x(u ).
uEG p ' d2:0
Hence we must show that for u E G p ' the identity Ld2:0 rpk[vld (u) . t d
1/ F ('Ij;(u), 'Ij;(u 2 ), ••• , 'Ij;(un)j t) holds. Indeed, if 'Ij;(u) = A1 + ... + An with m-th
roots of unity Ai, we have
n
F ('Ij;(u),'Ij;(u 2 ), ••. ,'Ij;(un)jt) = II(1- tAi),
i=1
and
hence
n n n
L rpk[vld (u) . t d = L II (Ait)i< =II L(Ait)j =II (1 - tAi)-1.
d2:0 it,·.· ,jn?:o i=1 ;=1 j?:O ;=1
o
We now explain how the knowledge of the extended Hilbert series can be used
for computational purposes. The following theorem poses severe restrictions on
the degrees that can occur in an hsop of k[V]G. It is well known (and often used)
for the ordinary Hilbert series (see the Introduction).
Theorem 2. Suppose that il, ... , in E k[V]G is an hsop of k[V]G with d; =
deg(J;) (i = 1, ... , n). Then for every simple kG-module S we can write
H(k[V], S, t) in the form
- is(t)
H(k[V], S, t) = (1 _ t d1 ) ••• (1 _ t dn ) (3)
Proof. We write A = k[il, ... , in]. Then k[V] is a finitely generated graded
module over the group ring AG. We will prove the theorem for an arbitrary
finitely generated graded AG-module M. Since the graded components Md
of M are finitelY generated kG-modules, the subspace MS and the extended
Hilbert series H(M, S, t) are well defined. Now observe that MS is a graded
submodule of Mover AG. Hence the sum of all MS' for S' any simple kG-
module is also a graded submodule, which we denote by Soc(M). Now we can
recursively define M(O) = M and M(i+1) = M(i) / Soc(M(i)) for i > O. There
are canonical maps M -+ M( i), whose kernels we denote by Soci (M). Then
254 G. Kemper
With the right choices of degrees for the free generators of the Fi , all maps are
degree-preserving, and we obtain
r r
H(MS,t) = L(-l)iH(Fi ,t) = L(-l)iJ;(t)H(A,t)
i=O i=O
with J;(t) E No [t] encoding the degrees of the generators of Fi . But H(A, t) =
rr=1 (1- td;)-I (see, for example Sturmfels [22, Lemma 2.2.3]), which completes
the proof. 0
as the representation in the form (3) with the smallest possible degrees d i . It
turns out that the smallest degrees which are possible for an hsop are dl = 5
and d2 = 12 here, which corresponds to the representation
_ t I5 + t I3 + t ll + t lO + t 9 + t S + t7 + t6 + t 5 + t 4 + t 2 + 1
H(k[VJ, S, t) = (1 _ t5)(1 _ t 12 )
holds. In particular,
H(k[V]G, t) = H(K[V]G, t).
Proof. By assumption, there exist permutation modules P and F over kG and
RG, respectively, such that
where W and W are modules over kG and RG, respectively. For dEN, the d-th
symmetric power is
d
Sd(p) ~ EB (Si(V) Q9k Sd-i(W))
i=O
with the corresponding formula for Sd(F). Hence Sd(V) is a trivial source mod-
ule, and Sd(V) is a lifting. So by Benson [2, Corollary 3.11.4],
HOmRG(S,Sd(V))/pHomRG(S,Sd(V)) ~ HomkG(S,Sd(V)),
where p <J R is the maximal ideal. The dimension of the left hand side is equal
to
O-+V-+P-+k-+O
A comparison with Equation (4) illustrates the differences and similarities be-
tween the ordinary and the extended Hilbert series.
Corollary 1. Suppose that V is a trivial source module. Then the degree of
H(k[V]G, t) as a rational function in t is bounded from above by -n, where
n = dim(V).
Proof. This follows from Theorem 3, since Molien's formula holds for K[V]G,
and the degree of each summand in Molien's formula is -no 0
In general, invariant rings of trivial source modules do not have any nice
properties such as the Cohen-Macaulay property or the property that they obey
Noether's degree bound (see Kemper [11, Corollary 2.4] and [12]). However, we
do get some degree bound for the invariants of trivial source modules. We write
In the next section we will show that we can obtain a much better degree bound.
Sn = {7r' (0: 1
". 0) I 7r is an n x n permutation matrix, O:i E J-L} ::; GLn(R)
o O:n
and G ::; Sn a subgroup. G acts on the module V = Rn and also on the symmetric
algebra R[V], which is the polynomial ring R[Xl' ... ,xn ] in the standard basis
vectors Xi of V. Our goal is to establish the degree bound
G n(n + 1)
)3(R[V] )::; max{mn,m· 2 - n}
for the invariant ring. We introduce some notation. We write Si =
Si(xi", ... ,x~), where Si is the i-th elementary symmetric polynomial (i
1, ... ,n), and
M={x11···X~ I eiENo}, T={a·tIO#aER, tEM}
for the set of monomials and terms, respectively. For f E R[V] with f =
L:tEM at' t let T(f) = {at' t I at # O} CT. For t E T, set
Note that every element of the orbit enters only once into the sum.
The following lemma shows that the Si take the same role of a "universal
hsop" for subgroups of Sn as do the elementary symmetric polynomials for per-
mutation groups.
258 G. Kemper
Proof. We obviously have R[V]Sn = R[xj", ... , x~]Sn. Thus the lemma follows
from the corresponding property of the elementary symmetric polynomials. 0
where [z] denotes the greatest integer::; z. The set of all special monomials
in M is denoted by M. pec '
(b) Generally, obtain a special term t = a . Xfl ... x~n from t by applying the fol-
lowing step iteratively: Let k E ~ be minimal with k fJ. {[el/m] , ... , [en/m]},
and lower all ei with [edm] > k by m. Repeat this step until there are no
t
such ei. Then we write = Red(t).
n(n + 1)
deg(t) ::; (m - 1) + (2m - 1) + ... + (nm - 1) = m . 2 - n.
(a) Write II = 7r·diag( ai, ... , an) with 7r a permutation matrix and ai E 1-'. Since
the exponents of t and Red(t) are congruent modulo m, diag(a1, ... , an)
commutes with Red. The same is true for 7r by the definition of Red(t).
(b) By (a) we have II(Red(t)) = Red(lI(t)). Hence if lI(t) = t, then II(Red(t)) =
Red(t). If, on the other hand, II(Red(t)) = Red(t), then Red(lI(t)) = Red(t),
so lI(t) and t have the same coefficient a. As the application of Red does not
change the ordering of the exponents (Remark 1), we conclude lI(t) = t. 0
Hilbert Series and Degree Bounds in Invariant Theory 259
Definition 3. We define a relation >- on T as follows: For t = axr' ... x~n and
t' = a' x~~ ... x~~ choose permutations 7r and 7r' E Sn such that e"'(l) :::: e,..(2) ::::
... :::: e,..(n) and e~'(l) :::: ... :::: e~'(n)' Then we say that t >- t' if there exists a
k E {I, ... ,n} with
Remark 2. Clearly, the relations >- and :::; are transitive, and for t E T there
exists no infinite sequence 81, S2, .•. E T with
But :::; is not an ordering even when restricted to M, since t :::; t' and t' :::; t fail
to imply t = t'.
We can come to the central lemma now (compare Lemma 3.10 in Gobel [6]).
Proof.
(a) Obviously t lies in T(orbs n (u) . Red(t)). We have to show that t >- s for all
other s E T(orbs n (u) . Red(t)). So take such an 8 and assume that t :::; 8.
Clearly 8 = 7r(U) . Red(t) with 7r E Sn. If we can show that 7r(u) = u, then
8 = t and (a) is proved.
Let t = axr' ... x~n, Red(t) = axf' ... x~n and U = xt, ... x~n, where we may
assume by renaming the Xi thatel:::: e2 :::: ... :::: en. Then the construction of
Red(t) and Remark 1 imply el :::: ... :::: en and d l :::: ... :::: d n. The exponents
of 8 are el + d,..(l) , ... ,en +d,..(n)· Let k be maximal with dl = d2 = ... = d k •
Then t :::; 8 implies that 7r must permute the set {Xl,'" ,xd, since the
greatest exponents of t and 8 must coincide. Proceeding analogously for
the second-highest degrees dk+1, ... ,dz, we conclude that 7r also permutes
{Xk+l,'" ,xt}, and finally arrive at 7r(u) = u.
(b) For a E G we have by (a)
a(t) )
orbsn ( Red(a(t)) . Red(a(t)) = orbsn (a(u)) . a(Red(t))
= orbsn(u) . a(Red(t)),
260 G. Kemper
where the last equation follows since u acts on u by permuting the Xi because
the exponents of u are divisible by m. We obtain
Now by Lemma 2(b) the u E G fixing t are the same that fix Red(t), so
summation over coset representatives u of Stab(t) ~ G yields the result. D
We obtain the reduction algorithm listed on the current page.
Begin
for t E Mspec set Pt := 0;
while j =f. 0 do
choose t E T(J) that is maximal w.r.t. >-;
compute Red(t) =: a· t with t E Mspec;
set u:= tf Red(t);
calculate a polynomial P such that orbsn (u) = p( 81, ... , 8n );
set pt:= pt + a . P;
set j := f - orbs (u) ·orba(Red(t))
n
end while
end.
Algorithm!. Write an invariant in terms of special orbit sums and 8i
It is clear that this algorithm is correct if it terminates. For the proof of its
termination we need the following observation.
Lemma 4. If j E R[V)a is an invariant andt E T(J), then T(orba(t)) C T(J).
By Lemma 4, all terms in orba(t) are also terms in j, and by Lemma 3(b),
all terms in (orba(t) - orbsn (u) . orba(Red(t))) are strictly smaller than t. So
Hilbert Series and Degree Bounds in Invariant Theory 261
there are some maximal terms removed from f, and only strictly smaller terms
are added. Hence after a finite number of steps the maximum of T(J) w.r.t. >-
decreases strictly, and iterating further eventually yields f = 0 by Remark 2. 0
Theorem 6. The following degree bounds hold independently of the choice of
R:
(a) The invariant ring R[V)G is generated as a module over A = R[Sl, ... ,sn)
by homogeneous invariants gl, ... ,gr satisfying
n(n + 1)
deg(gi) ~ m . 2 - n.
n(n + 1)
max{mn,m· 2 -n}.
Proof. Part (a) follows from the algorithm and the degree estimate in Remark 1,
and (b) follows since deg(si) ~ mn. 0
The following remark shows that the bound from Theorem 6(a) is very nat-
ural and in many cases sharp.
Remark 3. Consider the special case that R is a field k whose characteristic does
not divide the group order IGI. Then k[V)G is Cohen-Macaulay. Let el, ... ,e r be
the degrees of the secondary invariants. Using Molien's formula for the Hilbert
series of k[V)G, Stanley obtained the equation [21, Proposition 3.8)
~ n(n+ 1)
max{el, ... ,e r } = L.,,(deg(si) -1) -I = m· 2 - n -I,
A
i=l
where I is the least degree of a homogeneous f E k[V) with a(J) = det(a)-l . f
for all a E G. This implies the bound from Theorem 6(a), and we see that in
our special case the bound is sharp if and only if G ~ SLn(k). Hence what we
have done is the generalization of a long-known result of non-modular invariant
theory to the case of arbitrary ground rings.
4 Some Conjectures
Since faithful monomial representations are induced from cyclic subgroups of
order not divisible by the characteristic k, it is natural to ask for a further
generalization to the case of a representation which is induced from any p'-
subgroup. More precisely, suppose that H is a finite group, k a field of char-
acteristic p not dividing IHI, and let W be a kH-module of dimension m.
Fix n E N, then the wreath product Sn = H I Sn has a natural action on
V = Efl7=1 Wi with Wi ~ W. Now we consider subgroups G ~ Sn. Let Xl, ... ,X m
262 G. Kemper
be a basis of Wand Xi,1, ... ,Xi,n the corresponding basis of Wi. Fix an hsop
!I(X1, ... ,xm),· .. ,im(Xl>'" ,Xm) of k[W)H and write J;,j = h(Xi,1,'" ,Xi,m)
E k[W;)H. Then, as in the previous section, we obtain a "universal hsop" for all
subgroups G ~ Sn by taking
where Si are the elementary symmetric functions. In the previous section, it was
essential that G acted by permutations on the set of terms. This feature can
be generalized to the situation considered here by assigning to Xi,j the multi-
degree (0, ... ,0,1,0, ... ,0) with 1 in the i-th position. This yields a direct sum
decomposition of k[V) into multi-homogeneous components with the property
that every element of G maps each component into another one. The author
has been unsuccessful in his efforts to find an appropriate definition of special
multi-degrees and to generalize the arguments from Section 3. The goal would
be to prove
Conjecture 1. With the above notation, let G ~ Sn and take the ii,j as primary
invariants for G. Then for the secondary invariants 9i ofk[V)G we have
n(n + 1)
deg(9i) ~ 2 . (deg(!I) + ... + deg(fn)) - nm.
There is a lot of computational evidence for this conjecture. For the case
where k[V]G is Cohen-Macaulay, it would be a consequence of yet another con-
jecture on the degree of the Hilbert series:
For the special case that V is a trivial source module, this is Corollary 1, and
if the Hilbert series is replaced by the extended Hilbert series, the conjecture
follows from Theorem 1. The degree of the Hilbert series is also called the a-
invariant and has some connection to local cohomology (see Bruns and Herzog [4,
p. 169]).
Hilbert Series and Degree Bounds in Invariant Theory 263
References
1. Dave Bayer, Mike Stillman, Computation of Hilbert Functions, J. Symbolic Com-
putation 14 (1992), 31-50.
2. David J. Benson, Representations and Cohomology I, Cambridge studies in ad-
vanced mathematics 30, Cambridge Univ. Press, Cambridge 1991.
3. David J. Benson, Polynomial Invariants of Finite Groups, Lond. Math. Soc. Lec-
ture Note Ser. 190, Cambridge Univ. Press, Cambridge 1993.
4. Winfried Bruns, Jiirgen Herzog, Cohen-Macaulay Rings, Cambridge University
Press, Cambridge 1993.
5. Charles W. Curtis, Irving Reiner, Methods of Representation Theory, vol. 1, J.
Wiley & Sons, New York 1981.
6. Manfred Gobel, Computing Bases for Rings of Permutation-invariant Polynomials,
J. Symbolic Computation 19 (1995), 285-291.
7. David M. Goldschmidt, Lectures on Character Theory, Publish or Perish, Inc.,
Berkeley 1980.
8. Christoph Jansen, Klaus Lux, Richard Parker, Robert Wilson, An Atlas of Brauer
Characters, Clarendon Press, Oxford 1995.
9. Gregor Kemper, Calculating Invariant Rings of Finite Groups over Arbitrary
Fields, J. Symbolic Computation 21 (1996),351-366.
10. Gregor Kemper, Calculating Optimal Homogeneous Systems of Parameters,
Preprint 97-08, IWR, Heidelberg, 1997, submitted.
11. Gregor Kemper, On the Cohen-Macaulay Property of Modular Invariant Rings,
Preprint 97-38, IWR, Heidelberg, 1997, submitted.
12. Gregor Kemper, Lower Degree Bounds for Modular Invariants and a Question of
1. Hughes, Transformation Groups (1998, to appear).
13. Gregor Kemper, Gunter Malle, Invariant rings and fields of finite groups, this
volume.
14. Gregor Kemper, Allan Steel, Some Algorithms in Invariant Theory of Finite
Groups, in: P. Drfuder, G.O. Michler, C. M. Ringel, eds., Proceedings of the Euro-
conference on Computational Methods for Representations of Groups and Algebras,
Progress in Mathematics, Birkhiiuser, Basel 1998 (to appear).
15. Peter Landrock, Finite Group Algebras and their Modules, Lond. Math. Soc. Lec-
ture Note Ser. 84, Cambridge Univ. Press, Cambridge 1983.
16. Emmy Noether, Der Endlichkeitssatz der Invarianten endlicher Gruppen, Math.
Ann. 77 (1916), 89-92.
17. David R. Richman, Invariants of Finite Groups over Fields of Characteristic p,
Adv. in Math. 124 (1996), 25-48.
18. Lorenzo Robbiano, Moss Sweedler, Subalgebra Bases, in: W. Bruns, A. Simis, eds.,
Commutative Algebra, Lecture Notes in Math. 1430, pp. 61-87, Springer-Verlag,
New York 1990.
19. Larry Smith, Polynomial Invariants of Finite Groups, A. K. Peters, Wellesley,
Mass. 1995.
20. Larry Smith, Polynomial Invariants of Finite Groups: A Survey of Recent Devel-
opments, Bull. Amer. Math. Soc. 34 (1997), 211-250.
21. Richard P. Stanley, Invariants of Finite Groups and their Applications to Comb i-
natorics, Bull. Amer. Math. Soc. 1(3) (1979), 475-511.
22. Bernd Sturmfels, Algorithms in Invariant Theory, Springer-Verlag, Wien, New
York 1993.
23. Jacques Thevenaz, G-Algebras and Modular Representation Theory, Clarendon
Press, Oxford 1995.
Invariant Rings and Fields of Finite Groups
1 Introduction
The study of rings and fields of invariants of linear groups has a long history. In
classical invariant theory, which had its golden age before and around the turn
of the century, the focus was on continuous groups such as SL 2 (C) and their
representations. Recently invariant theory has undergone a renaissance, which is
due to the emergence of better algorithms and machinery for the computation,
and a new interest in the modular case, which in some aspects parallels modular
representation theory.
In this paper we restrict our attention to invariants of finite groups, but the
ground fields will be arbitrary. We look at various structural properties which
rings or fields of invariants can have, and ask for connections between these
properties and properties of the group or the representation in question. For
example, it is a natural question whether the ring of invariants is isomorphic
to a polynomial ring, or whether the field of invariants is purely transcenden-
tal. Weakening the condition of polynomiality, we get a hierarchy of properties
which rings of invariants can have. In particular, we will address the complete
intersection property, the Gorenstein property and the Cohen-Macaulay prop-
erty. When writing the paper we were inspired by the beautiful survey article
of Stanley [29], who 20 years ago considered the same properties of rings of
invariants but focused on the non-modular case.
The first section of this paper is intended as an account of the present-day
knowledge on these question, with special emphasis on the peculiarities in the
modular case. In the course of the paper we will particularly stress the exam-
ple of groups generated by pseudo-reflections for which recently a number of
new results could be shown. These groups are interesting since in characteristic
zero they are exactly the groups whose rings of invariants are polynomial rings.
On the other hand, they provide good examples to illustrate the differences be-
tween the modular and the characteristic zero theory, since over fields of positive
characteristic there exist reflection groups whose invariants are not even Cohen-
Macaulay. As general reading on invariant theory we refer to the books [2,27,
30] of Benson, Smith and Sturmfels.
Pseudo-reflection groups make another appearance in the second section of
this paper, where we consider fields of invariants and ask whether they are purely
transcendental ("Noether's problem"). As it turns out, the fields of invariants of
irreducible pseudo-reflection groups are always purely transcendental, even in the
modular case. This illustrates how different the behavior of fields of invariants
and rings of invariants can be. Noether's problem is a natural and interesting
question on the structure of fields of invariants, but it also has applications in
the construction of so-called generic polynomials for certain Galois extensions.
We will explain this connection to Galois theory and give a few examples.
2 Rings of Invariants
Let k be a field and V a finite dimensional vector space over k. Let G :::; GL(V)
be a finite linear group on V. Then G also acts in a natural way on the symmetric
algebra k[V] of V. We are interested in the structure of the ring of invariants
A=E9 A
i=O
i with Ao = k.
L dim(A;) t; .
00
H(A, t) :=
;=0
with d; = deg(f;),
for some Laurent polynomial get) with integer coefficients. It is not surprising
that H(A, t) contains a lot of information about the structure of A. For example,
one can define the Gorenstein property by saying that A is Gorenstein if it is a
Cohen-Macaulay domain and its Hilbert-series as a rational function in t satisfies
(1)
(2)
for some m ~ 0, where n = dim(A). Let us explain why this property im-
plies the Gorenstein property. Indeed, take polynomials gm+1, ... ,gm+n E R :=
k[It, ... , fn+mJ whose images in A form a homogeneous system of parame-
ters of A. Then Rj (gb ... ,gm+n) is finitely generated as a k-vector space,
which by Nakayama's lemma (as stated in [27, Cor 5.2.5], for example) implies
that R is finitely generated as a module over k[g1, ... ,gm+nJ. In other words,
{g1, ... , gm+n} form a homogeneous system of parameters for R. But choosing
{it, ... , f n+m} as a homogeneous system of parameters shows that R is Cohen-
Macaulay, hence 91, ... , 9m+n is a regular sequence. Therefore, the images of
9m+!, ... ,9m+n are a regular sequence in A, hence A is Cohen-Macaulay. Now
we must verify Equation (1) for the Hilbert series of A. A slight generalization
of what we said about the Cohen-Macaulay property (see [27, Cor 6.2.8]) al-
lows us to conclude from the regularity of g1, ... ,gm that R is free as a module
over k[91,'" ,9mJ. If S is a minimal set of homogeneous generators for Rover
268 G. Kemper, G. Malle
k[gl, ... , gm] and H(S, t) is the formal power series whose i-th coefficient is the
number of generators in S of degree i, then we obtain the equation
where deg(fi) = di , deg(gi) = ei. On the other hand, the image of S under the
canonical map R -r A is a basis of A as a vector space over k by Nakayama's
lemma, hence H(A, t) = H(S, t). Together with the above equation we obtain
that the Hilbert series of a complete intersection A is of the form
Invariant rings of finite groups. Much of the theory of graded algebras was
prompted by the study of rings of invariants, and indeed it seems that rings
of invariants are not very much better behaved than graded algebras which
are domains in general. We will however mention a few special features here
before coming to the one-by-one investigation of the above properties for rings
of invariants.
Most importantly, the study of rings of invariants subdivides naturally into
two quite different cases. Either the characteristic of k divides the order of G
(the modular case), or it does not (the so-called coprime characteristic case). In
the latter case, much more is known about the structure of k[V)G. For example,
it is known that then k[V)G is always Cohen-Macaulay (see Th. 2.1O). This is
no longer true in the modular case. Furthermore, in the coprime characteristic
case the Hilbert series of the ring of invariants may be calculated by Molien's
formula: for any linear character X of G the Hilbert series for the module of
relative invariants
In the modular case, there is no such simple formula for H(k[Vf,t} (but see
the paper [13] of the first author in this volume).
Invariant Rings and Fields of Finite Groups 269
We first consider the question under which conditions the ring of invariants
k[V]G of a finite group is a polynomial ring.
This theorem was first proved by Shephard/Todd [26] for the field k = C of
complex numbers. For one direction of the proof they needed to classify all ir-
reducible finite groups generated by complex pseudo-reflections. (The reduction
to the irreducible case is trivial here, since all complex representations of finite
groups are completely reducible.) Later, Chevalley [5] gave a conceptual proof
avoiding the classification. The statement for arbitrary, but coprime character-
istic was given by Serre [25]. In that same paper, he also showed a necessary
condition in the case that the characteristic divides the order of G. Here the
centralizers
GG(W) = {CT E G I CTW = w for all wE W}
of subspaces W ~ Venter the picture:
Theorem 2.2. (Serre) Let G ~ GL(V) be a finite group such that k[V]G is poly-
nomial. Then G is generated by pseudo-reflections, and the point-wise stabilizer
of any subspace of V has polynomial ring of invariants {and thus is generated by
pseudo-reflections} .
Proposition 2.1. (Kemper) Let G :::; GL(V) be finite. Then k[V]G is polyno-
mial if and only if there exist n = dim(V) homogeneous invariants h, ... , In E
k[V]G such that the Jacobian determinant det«81;f8xj);'j) is nonzero and IGI =
07=1 deg(f;).
Irreducible groups. By the Theorem of Serre (Theorem 2.2) a finite group
with polynomial invariants is necessarily generated by reflections. In the case
of irreducible linear groups, the authors also found a sufficient criterion. Note
that an irreducible linear group generated by reflections is already absolutely
irreducible.
Theorem 2.3. (Kemper/Malle) Let V be a finite-dimensional vector space, G :::;
GL(V) a finite irreducible linear group generated by pseudo-reflections such that
the pointwise stabilizer in G of any nontrivial subspace of V has a polynomial
ring of invariants. Then k[V]G is a polynomial ring.
The proof of Theorem 2.3 in [14] is similar in spirit to the one of Shep-
hard/Todd in the complex case, in that it uses the classification of finite ir-
reducible linear groups generated by pseudo-reflections. This classification was
achieved by Kantor, Wagner and Zalesskii/Serezkin (see [14] for a precise state-
ment of this classification). As a by-product of the proof of Theorem 2.3 we
obtained a complete list of the irreducible reflection groups with polynomial
ring of invariants. This is reproduced in the form of Table 2.4.
As a corollary of the proof we can also verify the following assertion, which
is in partial confirmation of a conjecture of Kac [9]:
Corollary 2.1. Let G :::; GL(V) be a finite irreducible linear group containing
no transvections. Then k[V]G is polynomial if and only if the pointwise stabilizer
in G of any subspace of V is generated by pseudo-reflections.
The example of the transvection group Sp4(2) in its natural 4-dimensional
representation over lF2 shows that the assumption that G contains no transvec-
tions is necessary in Corollary 2.1 (see [14, Ex. 2.2]) .
The following example shows that even when a complex linear group and its
p-modular reduction are isomorphic and both have polynomial invariants, the
degrees of generating invariants and hence the corresponding Hilbert series need
not coincide:
Example 2.1. Let G :::; GL 3(iC) be the primitive complex reflection group of
order 648 denoted G25 by Shephard and Todd [26]. Its ring of invariants is
polynomial with algebraically independent generators in degrees 6,9,12. Let L
be a G-invariant Z2[R]-lattice in ca and consider its 2-modular reduction.
This yields a 3-dimensional reflection representation of G in characteristic 2.
According to [14, Table 6.2] this identifies G with the 3-dimensional unitary
group GU 3(2) :::; GL3(4). Here, the ring of invariants is also polynomial, but the
generators are in degrees 3,9,24.
Invariant Rings and Fields of Finite Groups 271
Theorem 2.5. (Nakajima) Let G ::; GLn(IF'p) be the direct product of a p-group
P with a pi -group H. Then k[v1G is polynomial if and only if
i) H is generated by pseudo-reflections, and
ii) there exists a basis {Xl, ... , xn} of V = JF; such that
n
II IP.xil = IFI
i=l
fit into the hierarchy stated in Section 2.1, but it has a strong connection to
pseudo-reflections, as the following theorem by Nakajima shows.
A proof can be found in [2, Cor. 3.9.3]. The theorem implies that rings of in-
variants of pseudo-reflection groups and of p-groups, where p is the characteristic
of k, are always unique factorization domains.
1 + t2
H(k[V]G, t) = (1 _ t 2)2 .
In view of the fact that Theorem 2.1 fails in the modular case, it seems
interesting to determine which modular reflection groups are at least CI-groups.
This question is still open at present, even the irreducible case has not been
settled. The following partial results have been obtained:
A proof can be found in [2, Thm. 8.3.11]. All exceptional irreducible pseudo-
reflection CI-groups are known (by what we said above, they must be among
the groups in the lower right-hand corner of Table 2.4):
G dim IWp(G)1 2 3 5 7
G24 3 336 - pol pol CI
G 28 4 1152 - CI pol pol
G 30 4 14400 - CI ROt(5) pol
G 3l 4 46080 - CI pol pol
G32 4 155520 - - nCI pol
G 33 5 51840 - R-Os(3) pol pol
G 34 6 19595520 CI - - -
G 36 7 2903040 - nCI pol pol
G37 8 696729600 - nCI nCI pol
The entries - indicate that either Wp(G) is strictly smaller than G, or it acts
reducibly. An entry 'nCI' signifies that the ring of invariants is not a complete
intersection. In two cases, the p-modular reduction W p ( G) in Table 2.8 is isomor-
phic to a classical group. For these, the question of when they are CI-groups is
not yet settled. The proof relies on computer calculations as well as on the sub-
sequent Theorem 2.9 and will be published elsewhere. See also Proposition 2.6
for a related statement.
Kac and Watanabe have proved a necessary condition for CI-groups very
similar to Serre's criterion (Theorem 2.2) for polynomiality. It uses the following
generalization of the notion of pseudo-reflection. An element a E GL(V) is called
m-special if it centralizes a subspace of codimension m, Le., if dim(ker(a -1)) ~
dim(V) - m. Thus pseudo-reflections are I-special. Elements which are 2-special
but not I-special are also called bi-reflections.
Theorem 2.9. (Kac/Watanabe) Let G ~ GL(V) be a finite CI-group. Then G
is generated by 2-special elements and the point-wise stabilizer of any subspace
of V is again a CI-group (and thus is generated by 2-special elements).
This is proved in [9] (see also [8] for fields of characteristic 0).
Theorem 2.9 focuses attention on groups generated by bi-reflections. In char-
acteristic 0 the irreducible ones among these groups were classified around 1980.
Up to imprimitive groups and groups of low dimension, these are just the alter-
nating and symmetric groups in their natural representation. Using this classi-
fication, Gordeev [8] and independently Nakajima ([22] and the references cited
there) determined the CI-groups G ~ GLn(lC). They show that the question for
reducible groups can be answered if the irreducible and the abelian CI-groups
are known. Their result consists of a list of those linear groups over IC which
are CI-groups, which we will not reproduce here. (The irreducible imprimitive
CI-groups of dimension 4 are not classified completely.) The problem of finding
a nice structural criterion for CI-groups is still open even in characteristic O.
274 G. Kemper, G. Malle
Not much is known about the Gorenstein property in the modular case. The
following proposition obtained in [3] shows that it is a useful property.
Invariant Rings and Fields of Finite Groups 275
It is easy to find linear groups whose rings of invariants are Cohen-Macaulay but
not Gorenstein:
Example 2.4. Let k be a field of characteristic different from 3 and G = Z3 the
cyclic group of order three acting by scalar matrices on V = k 2 • Then Molien's
formula gives
G 1 + 2t 3
H(k[V] , t) = (1 _ t 3)2 '
hence the ring of invariants is not Gorenstein. On the other hand it is clear that
k[V]G is generated by XiX2, X1X~ as a free module over k[xr, x~], hence k[V]G is
Cohen-Macaulay.
In the coprime characteristic case, all rings of invariants are Cohen-Macaulay
(see for example [2, Th. 4.3.6]):
Example 2.5. One of the simplest example of an invariant ring which is not
Cohen-Macaulay is given by the action of the cyclic group Z2 of order 2 on
k[V] = F2 [X1,X2,X3,Y1,Y2,Y3] by Xi ~ Yi ~ Xi. In fact, if k[V]G were Cohen-
Macaulay, then the sequence It = Xl + Y1, 12 = X2 + Y2, h = X3 + Y3 would have
to be regular, since it is the beginning of a homogeneous system of parameters.
But the relation
Ithh
o = It 12 h = u231t + u31h + u12h with Uij = XiYj + XjYi
Xl X2 X3
Proposition 2.5. (Kemper) Suppose that for an integerr > 0 the r-th cohomol-
ogy group Hr(G, k[V]) is nonzero, and that no element in G of order p =char(k)
is (r + I)-special (see page 273). Then k[V]G is not Cohen-Macaulay.
276 G. Kemper, G. Malle
Example 2.5 arises as an application of the theorem, since HI (Z2, IF'2) =f. o.
As a corollary, we obtain the following converse of Theorem 2.10.
Theorem 2.11. If G is a finite group and k a field such that for every finite
dimensional kG-module V the invariant ring k(v1G is Cohen-Macaulay, then the
characteristic of k is zero or coprime to the order of G.
For further examples, we take another look at the Table 2.4 of irreducible
reflection groups and ask if the rings of invariants are at least always Cohen-
Macaulay. As it turns out, this is not the case. The groups 6 n +2 appearing
in the table are the symmetric groups on n + 2 symbols with the following
representation: Let W = k n +2 be the natural permutation representation and
suppose that the characteristic p of k divides n + 2. Then the I-dimensional
invariant subspace WI = (Xl + ... + Xn+2) is contained in the n + I-dimensional
invariant subspace
3 Fields of Invariants
So far we have been looking at properties of polynomials f E k[V1 invariant
under a finite group G. Instead one could study invariant rational functions, Le.,
the field of invariants of G
which is equal to the field of fractions of k[V1G since G is finite. This will lead
us to Noether's problem and the notion of generic polynomials.
Invariant Rings and Fields of Finite Groups 277
worth noting that if we have a generic polynomial f for G over a Hilbertian field
k, then G occurs as a Galois group over k. An obvious example is the generic
polynomial f = xn + tn_lX n- l + ... + to for the symmetric group 6 n . Other
examples are polynomials for Artin-Schreier and Kummer extensions.
Now let G ::; GL(V) be a finite group such that the field ofinvariants k(V)G is
purely transcendental, generated by Sl, ... , Sn, say. Let B be a finite G-invariant
subset of V which generates V and define
f(X) := IT (X - v).
vEB
Then the coefficients of fare G-invariant, hence can be expressed in Sl, ... , Sn, so
f(X) = f(Sl, ... ,Sn,X) E k(Sl, ... ,Sn)[X]. Clearly, by construction, Gal(f) =
G. But we even have (see for example [10, Satz 1.7]):
Proposition 3.1. The polynomial f constructed above is generic for Gover k.
The first author has recently shown that the polynomial f has even the
following stronger property: if K :::: k is an infinite field and N / K a Galois
extension whose Galois group is a subgroup of G, then N is the splitting field of
f(al, ... , an, X) with suitable al, ... , an E K.
Example 3.1. Let G = GLn(q) in its natural representation on V = ~. Dickson
has shown that the ring of invariants is polynomial (see Table 2.4, generated by
the non-vanishing coefficients eo, ... , Cn-l of the polynomial
n
In particular, f as above is the generic polynomial for GLn(q) over IFq for the
subset B = V \ {a}. Note that all exponents of X in f are divisible by q - 1. The
polynomial f(Xl/(q-l) now has Galois group PGLn(q) (and is generic for those
PGLn(q)-extensions embeddable into a GLn(q)-extension). A simple argument
shows that the invariant eo is a (q - I)st power. Dickson shows that the ring of
invariants of SLn(q) is also polynomial, generated by u := ~/(q-l), Cl, ... , Cn-l.
Thus, considered as a polynomial over IFq [u, Cl, ... , Cn-l], f is generic for SLn (q)
over IFq and similarly as above we obtain a polynomial with group PSLn(q).
This example also shows why families of polynomials such as the ones found by
Abhyankar (see [1] and the papers cited there) have to exist.
Example 3.2. The previous example did not yield generic PSLn(q)-polynomials.
At least for n = 2 these can be obtained as follows. Let V = Sym2 (W) be
the symmetric square of the natural module W for SL2(q). Since the center of
SL 2(q) acts trivially on V, this defines a faithful representation of G = PSL2(q)
on V. It can be shown that G leaves invariant a quadratic form on V, and
is even isomorphic to the derived group 03(q) of the special orthogonal group
S03(V) with respect to this form. By a result of the first author [10, Satz 3.14]
the field of invariants k(V)03(q) is purely transcendental, and this yields generic
Invariant Rings and Fields of Finite Groups 279
polynomials for G = PSL2(q). In the cases q = 5,7 one obtains the following
two generic polynomials gq(tl' t2, X) E lFq (h, t2)[X] with two parameters:
9S(X) = X I2 - 2 u 2 X 8 + 2 (tl + 1)u4 X4 + 2t2 US X2 + (tl _ 2)2U 6
Notice that the classical groups GO~±)(q), GUn(q), SUn(q), and SPn(q) ap-
pear among the irreducible reflection groups (Table 2.4). Thus Theorem 3.1 to-
gether with Proposition 3.1 gives generic polynomials for these groups in defining
characteristic. For the proof, only those groups occurring in the right-hand col-
umn of Table 2.4 have to be considered. Most of them could be handled by the
following criterion (see for example [10]):
Proposition 3.2. (Kemper) Let /I, ... ,fn E k[V]G, n = dim(V), be homoge-
neous invariants such that I1?=1 deg(fi) < 21GI and the Jacobian determinant
det({)/;/{)Xj) is non-zero. Then k(V)G = k(/I, ... , fn) is purely transcendental.
No reduction principle of this sort exists for the polynomiality of the ring of
invariants. In fact, G does in general not act as a reflection group on V even if
it does on U :::; V. This illustrates how different the behavior of fields and rings
of invariants are: while the structure of rings of invariants very heavily depends
on the linear representation of the group, the field of invariants is in some sense
much less dependent on the representation.
It should be noted that one side effect of our case-by-case approach to the
proof of Theorem 3.1 is the fact that minimal bases for the fields of invariants
are explicitly known in each case (except for the symmetric groups 6 n +2 ), so
that generic polynomials can be computed from this at least in principle. (We
have the obvious generic polynomials for the symmetric groups.)
It would be interesting to know whether Theorem 3.1 continues to hold for
reducible reflection groups. In the extreme case of p-groups, where p is the char-
acteristic of k, we have the following result which is not restricted to reflection
groups:
See [18] for a proof. We see how different the behavior of rings and field s
of invariants can be by comparing to the examples of Nakajima [20] of p-groups
whose ring of invariants is not even Cohen-Macaulay.
References
1. A. S. ABHYANKAR, Projective polynomials, Proc. Amer. Math. Soc. 125 (1997),
1643-1650.
2. D.J. BENSON, Polynomial Invariants of Finite Groups, LMS Lecture Notes Series,
vol. 190, Cambridge University Press, Cambridge, 1993.
3. H. E. A. CAMPBELL, A. V. GERAMITA, 1. P. HUGHES, R. J. SHANK, D. L.
WEHLAU, Non-Cohen-Macaulay vector invariants and a Noether bound for a
Gorenstein ring of invariants, preprint, Kingston, Ontario 1996.
4. D. CARLISLE, P.H. KROPHOLLER, Rational invariants of certain orthogonal and
unitary groups, Bull. London Math. Soc. 24 (1992), 57-60.
5. C. CHEVALLEY, Invariants of finite groups generated by reflections, Amer. J. Math.
67 (1955), 778-782.
6. D. EISENBUD, Commutative Algebra with a View toward Algebraic Geometry,
Graduate Texts in Mathematics vol. 150, Springer, New York, 1995
7. E. FISCHER, Die Isomorphie der Invariantenkorper der endlichen Abelschen Grup-
pen linearer 'fransformationen, Nachr. Konigl. Ges. Wiss. Gottingen (1915),77-80.
8. N.L. GORDEEV, Finite linear groups whose algebras of invariants are complete
intersections, Math. USSR Izv. 28 (1987), 335-379.
9. V. KAC, K. WATANABE, Finite linear groups whose ring of invariants is a complete
intersection, Bull. Amer. Math. Soc. 6 (1982), 221-223.
10. G. KEMPER, Das Noethersche Problem und generische Polynome, Dissertation,
Universitiit Heidelberg, 1994.
11. G. KEMPER, Calculating invariant rings of finite groups over arbitrary fields, J.
Symbolic Computation 21 (1996), 351-366.
Invariant Rings and Fields of Finite Groups 281
Bernd Martin
1 Introduction
Consider the deformation functor of a geometric object. Under certain conditions
as given in, e.g., [6], it has a hull, i.e., a formal miniversal deformation. The
corresponding cohomology theory carries a graded Lie Algebra structure that
allows to recover by Massey products the complete local ring of the base space
of the formal miniversal deformation.
We are dealing with deformations of isolated singularities. They always have
an algebraic miniversal deformation, see [2]. Hence, if a degree by degree com-
putation of Massey products stops after a finite number of steps, the resulting
polynomial deformation represents a miniversal deformation. Although there is
no degree bound in general and the computation produces only a power series
representation, the procedure often finishes with a complete answer. For ex-
ample, in case of singularities with good (C* -action and only negative graded
infinitesimal deformations.
To find and to implement such an algorithm for computing the Massey prod-
ucts is one of the open problems in Computer Algebra listed by Eisenbud, see
[1] p. 367. We need a Computer Algebra System which offers standard basis
computations, resolutions in (complete) local rings and flexible programming
facilities. An implementation in SINGULAR [5] is explained in this paper. The
corresponding files can be found in deform.lib distributed with the SINGULAR
libraries. Because of a very high complexity of the procedures complete results
are not to expect by very complicated singularities. Nevertheless, a long list
of non-trivial examples is computable in acceptable time. This library contains
procedures that compute a versal family of a module over a local ring. For ex-
ample all versal deformations of maximal Cohen-Macaulay modules over the
E6-singularity are obtained.
For a detailed discussion of deformation theory and interesting applications,
cf. [9]. Moreover, MACAULAY-Scripts of J. Stevens for the cotangent cohomology
modules Tl(X O) and T2(XO) became an initial point for our implementation,
which includes zero characteristic and non-graded singularities. We shall not give
an abstract introduction of Massey products, cf. [3], but we shall explain how to
use and how to compute them. Here, a slightly different construction of Massey
products series is given, that allows to obtain in one step all Massey products of
a given degree. This implementation, is much more efficient than a one-to-one
translation of the Massey product calculus descriped in [4] and implemented in
the case of modules, cf. [8].
Acknowledgement: the author was supported by the Deutsche Forschungs-
gemeinschaft through the Schwerpunkt "Algorithmische Zahlentheorie und Al-
gebra".
f
o +-- Ox. +-- O n +-=-- Ok
n
r
+-- otn +--
r Oi
n
a a
ideal J E a'{T}, such that liftings F and R exist that fulfil F R == 0 mod J.
The main steps of the algorithm are:
1. Compute infinitesimal deformations F l , Rl and the obstruction space T2(XO)
(initial step),
2. compute the obstruction map and Massey products in degree d ~ 2,
3. determine the ideal Jd of obstructions up to degree d,
4. lift the equations and syzygies to the next degree,
5. test if we are done, otherwise go back to step 2 and replacing d by d + 1.
After the loop we obtain a versal deformation up to d-th order.
//----- T1 -------------------------------------------------------
def On = basering;
list resf = res(f,2);
matrix r = resf[2]; //columns are syzygies of f
matrix jacf jacob(f); //jacobi matrix of f
qring Ox std(f); //create local ring of x_a
matrix No transpose(fetch(On,r»;//ker(No) = normal bundle
matrix jacf fetch(On,jacf); //fetch to Ox-modules
list resN = res(No,3); //resolution of No
matrix Nx = resN[2];
matrix N1 resN[3];
matrix jac' lift(Nx,jacf); //lifting of jacobi matrix
matrix sbT1 std(N1,jac'); //SB representing T-1
int tau vdim(sbT1); //Tjurina number
matrix kbT1 = Nx*kbase(sbT1); //columns give a basis of
//infinitesimal deformations
4 Obstruction Calculus
(4.1) By the general theory of liftings, e.g. [3), it is known that T2(Xo) is the
obstruction space and the cup product u : Tl(XO) x T 1 (Xo) --t T2(XO) induces
an obstruction map. To any small extension of an Artin algebra A': 0 --t K --t
A --t A' --t 0, mAK = 0, an obstruction map OA,A' : Defxo (A') --t K 0T2(XO)
is defined, such that a deformation ~ E Defxo(A' ) is liftable to A if and only if
OA,AI(~) vanishes. Moreover, in case of A = <C[cl,c21/(ci,c~) and A' = A/clc2
we obtain OA,A'(~) = pri(~) Upr2(~), pri: A' -t A'/ci. We shall understand
by a small extension only those which do not increase the embedding dimension,
that is, K C m~.
(4.2) Recall a construction of T2 (Xo) and of the obstruction map.
Lemma 2. The matrix No, whose kernel induces the normal bundle N xo , defines
an OXo -module homomorphism into the dual of non-trivial syzygies S' .
Proof. The trivial syzygies are formed by the submodule of r generated by the
columns of the Koszul matrix of f:
11----- T2 -------------------------------------------------------
list resf = res(f,3)j
matrix r = resf[2]j
matrix kosf = koszul(f,2)j
matrix kos' lift(r,Kosf)j
matrix s = concat(kos',resf[3])j II represents non-trivial syz
qring Ox = std(f)j
matrix No = transpose(fetch(On,r))j
matrix So = transpose(fetch(On,s))j
list resS = res(So,3)j
matrix Sx = resS[2] j II Sx=ker(So) represents dual
matrix S1 = resS[3] j II of non-trivial syzygies
matrix sbT2 = std(Sl,lift(Sx,No))j II presents T~2
int tau' = vdim(sbT2)j II dimension of T~2
matrix kbT2 = resS[2]*kbase(sbT2)j II columns represent basis of
II T~2 in Ox~l-quotient
Note, both procedures Tl and T2 use the same resolution, hence they are put
together in one procedure T12 of the library sing. lib that gives back a list of such
objects used in further computations.
(4.4) Next we shall show that the obstruction for lifting a deformation by a
small extension lives in T2(XO)' A local Artin algebra (A, m) is called of order d,
if md =1= 0, but md +1 = O. We may restrict our considerations to small extensions
from order d-l to order d, so the kernel of an extension given by K = m d .
Lemma 3. Let A -t A' be a small extension from order d-l to order d,
FE Mat(l, k; A{X}) a represention over A of a deformation ~ E Def x.(A') and
REMat(k, l; A {X}) a lifting of syzygies, i. e. F R = 0 mod md , then we obtain
(il) F R mod j induces an element cup(F) := t flt P E m d ® O~o'
(i2) cup(F) belongs to md ® S', hence it defines a class 0(0 E md ® T2(Xo),
(iB) ~ has a lifting to A if and only if o(~) = O.
288 B. Martin
Note, that the map 0 : { f-t o({) is depending on the choice of the deformed
equations and of the syzygies over A', but not on their representatives with
coefficients from A.
A{XV ~A{X}k
(3)
A{X}k -4 A{X}
o
Computing Versal Deformations with SINGULAR 289
Note, the leading forms of m;d),j = 1 ... ,7', are uniquely determined and the
collection of its coefficients are called a d-order Massey product series.
Proof. The proposition is correct for d ::; 2. Let d 2:: 3 we obtain from the
assumtion the following equation
with a fixed set of representatives VI, ... , VT' of a basis of T2 (Xo). The obstruction
of lifting F(d-2) to (d-1)-st order is represented by t(F(d-2)R(d-2»). Consider
the maximal small extension of S(d-l):
t t
S(d) +-- B +--K
t t II
K +-- Wd +--K
Compute K as ideal generated by the coefficients of the obstruction with respect
to the fixed basis of T2(XO). We want to show that an ideal basis of K may be
represened by some liftings ofthe basis of Jd-l. Look at the unique repesentation
of the obstruction in Wd 0 T2(XO):
(5)
(6)
of type (4). Moreover, the relative (d-1)-jet with respect to T of (6) is just (4),
hence any representative hJd) is a lifting of hJd-l) automatically. The element
Fd and Rd from (6) are the correction terms of F(d-l) and R(d-l), respectively,
to obtain liftings to S(d) of the versal family and its syzygies. 0
Computing Versal Deformations with SINGULAR 291
(5.4) What is the connection between Massey product series and the original
algrithm from [3] and [7], where the Massey products of order d ~ 2 are sym-
metric multilinear and partially defined maps md : Tl(XO) x ... X Tl(XO) ... -+
T 2 (Xo) and m2 is the cup product?
Beginning with assumtion (5.3), determine a monomial basis Ed of Wd, called
a defining system for the d-th order Massey products. Ed is choosen with the
additional property that all monomials are multiples of monomials from the
previous step:
(7)
Because E2 = E~ is the set of all quadratic monomials, the elements of Ed
have all degree d. Any occuring monomial T OI E Ed corresponds to an element
(fOIl' ... , fOlJ E Tl(XO)$d and these elements span the subspace of definition of
md·
For any element TOI E Bd compute the obstruction 001 E T2(XO) for the small
si
extension <CTOI --+ d- l ) --+ S(d-l). Summing up all 001 • TOI and applying the
relations in Ed, we obain a representative of od(F(d-l), whose support belongs
to Ed. Its T-coefficients are used to define md. Representing od(F(d-l) with
respect to a given basis of T2(XO) we obtain Sed) as described in proposition 5.
Finally choose a monomial basis Ed C Ed of md S(d).
Note, that the support of this representation of od(F(d-l) and therefor the
supports of all obstruction equations h;d) fulfil property (7). Our Massey prod-
uct series constructed in (5.3) have this property, too, because the condition is
fulfilled automatically by using representaions of elements of quotient rings that
are in normal form with respect to a standard basis of the quotient ideal in a
fixed ordering, which must be local.
6 On the Implementation
(6.1) The first part of the procedure computes the infinitesimal versal family
F(l) and a representation of a basis of T2(XO) as explained in (3.4) and (4.3).
An infinitesimal lifting R(1) = r + Rl of the syzygy matrix r of f is obtained by
lifting -Flr by f. In the algorithm we need the ring extension ON = On@<C{T}
and its quotient rings Ox. @ <C {T} = 0/ f and OT @ On = ON / Jd. The last one
has to be replaced in every loop by the next approximation. The corresponding
objects in these rings are denoted by the same letters and have to be fetched
freely beteen the rings and after a modification.
(6.2) The main loop starts with d(~ 2) in the ring ON/f. From F(d-l) and
Re d - l ) we compute the obstruction 0d and its representation with respect to a
basis of T2(XO):
11---- compute obstruction --------------------------------------
obs = transpose(jet(Fd*Rd,d,rel_vec»;
II relative d-jet w.r.to T
obs = reduce(ideal(obs),std(maxT*Jd»;
II normal form w.r.t. max. small exten.
292 B. Martin
In the appendix of the library you will find a list of computable and non-
trivial examples including:
cones over generic points,
cones of rational normal curves,
cones over an elliptic curve,
- determinantal surface singularities,
- a cone over a del Pezzo surface, having a fat point as miniversal base (an
example of K. Altmann),
- several space curve singularities, which are always unobstructed,
- zero-dimensional sigularities.
References
1. Eisenbud, D.: Commutative Algebra with a View Toward Algebraic Geometry.
Springer Verlag (1996)
2. Grauert, H.: Uber die Deformationen isolierter analytischer Mengen. Inventiones
math. 15 (1972), pp. 171-198
3. Laudal, O.A.: Formal Moduli of Algebraic Structures. Lecture Notes in Mathematics
754, Springer Verlag (1979)
4. Laudal, O.A.: Matric Massey Products and Formal Moduli I. In Lecture Notes in
Mathematics 1183, Springer Verlag (1985), pp. 218-240
5. Greuel, G.-M., Pfister, G., Schonemann, H., et.al.: SINGULAR, a Computer Alge-
bra System for Singularity Theory, Algebraic Geometry and Commutative Algebra.
http://www.mathematik.uni-kl.de
6. Schlessinger, M.: Functors of Artin rings. Trans. Amer. Math. Soc. 130 (1968), pp.
208-222
7. Siqveland, A.: Matric Massey Products and Formal Moduli local and global. Thesis,
Univ. of Oslo (1995)
8. Siqveland, A.: Implementing the Algorithm for Computing Formal Moduli. Prep.
Series Math., Univ. of Oslo, No 1 (1997)
9. Stevens, J.:Deformations of Singularities. Habil. 1995, Univ. Hamburg
Part C
Thomas Siebert
1 Introduction
Given a set (!I, ... , Jp) of generators of a module M C Rn over a commu-
tative ring R, an important problem is to compute all relations (syzygies) of
(!I, ... , Jp ). Among others, syzygies provide a general description of any mod-
ule as factor of some free module. Moreover, iterations of syzygy computations
can be used to compute free resolutions of ideals and modules which is the base
for computations of homology and cohomology groups, of Betti numbers and of
various other invariants.
Thus, the computation of syzygies of ideals or modules is a basic tool for
computational approaches to algebraic geometry and singularity theory.
In recent years, several methods were developed to expedite computations of
syzygies. In this paper, we give an overview of the essential ideas behind these
methods, and of our practical experience with their realization.
Main contributions to that development have come from W. Trinks, G.
Zacharias, D.A. Spear, F.O. Schreyer, M. Stillman and D. Mumford, T. Mora
and H.M. Moeller. Recent improvements to compute minimal resolutions have
been contributed by C. Traverso, M. Caboara [11), R. La Scala [6), by myself
[10) and A. Capani, G. DeDominicis, G. Niesi, L. Robbiano [3).
The paper is organized as follows: in Section 2, the two basic algorithms
for computing syzygies are presented. Section 3 discusses various improvements
of these algorithms. Section 4 summarizes these improvements into a sugges-
tion of a hybrid algorithm for computing syzygies. Finally, Section 5 reports of
our implementation of the discussed methods in SINGULAR which includes an
evaluation of comparative timings.
Along the way (in subsection 2.2), we prove the previously unestablished fact
that the so-called STZS-algorithm applies not only to polynomial rings but also
to quotients of polynomial rings.
Acknowledgments: I should like to thank G. Pfister for the many fruitful
discussions about this work and O. Bachmann for proofreading the manuscript.
This work has in part been supported by the Deutsche Forschungsgemeinschaft
(DFG).
I
A I
A' I 0
I
I
1 0 0 ... 0 ===> - - -- (1)
o 1 0 '" 0 I
I
o0 1 B' I G'
0 I
o 0 ... 0 1 I
After a standard basis computation, we obtain a matrix like the right one: A'
corresponds to the standard basis of Aj B' is the transformation matrix, i.e.,
A' = A * B' j and G' is a basis of the syzygy module of A. In fact, every syzygy
constitutes a certain way to obtain 0 as linear combination of the given genera-
tors. In this way, we find a basis of all syzygies. Otherwise a syzygy which is not
included produces elements with new leading terms.
The algorithm can be implemented in a straight-forward way as illustrated
by the following pseudo-code:
Algorithms for Free Resolutions 297
W' := W
S' := S
WHILE searchUnit(W',i,j) DO
W' := Gauss(W' ,i,j)
S' := S'\ {sd
END DO
In the procedure inter Reduce we subtract multiples of elements from others
as long as any leading term divides another one. The inter-reduction of the
input should always precede the adding of new module components. Otherwise
the resulting syzygy module would have a much more complicated structure.
The procedure set8yzygyOrder sets a block ordering for the module com-
ponents to separate the standard basis part from the syzygy part, so that all
298 T. Siebert
S:-lj-l S:-lj+1
S:+1j-l S~+Ii+l
Notice that after each Gaussian elimination all the remaining columns are mul-
tiplied with the unit Ul. This implies that the length of all polynomials grows at
least by the number II = length(Ul) of monomials of Ul. Of course, if the chosen
monomial ordering is a well-ordering then units are just constants (with length
1). Hence, there is no growth. However, for local orderings units are polynomials
in the non-global variables. Hence, the lengths of the polynomials growth during
n Gaussian elimination is given by
where Ii = length( Ui) is the number of monomials in the ith unit counted at the
begin of the minimization. To decrease this growth, the Bareiss algorithm might
be used which, on the other hand, requires additional polynomial divisions.
Our experience agrees with this observation: in the non-global case it is often
the minimization which is the really hard computational part, whereas in the
global case it is usually the standard basis computation.
For gi,gj, having the leading term in the same component, that is L(gi) =
x""ek,L(gj) = x"'iek we consider spoly(gi,gj) := mjigi - mijgj with mji
c(gj) Icm(L(:~,' L(9;)) .
Because S is a standard basis we obtain ([5], Corollary 1.11)
Let ker(K[x]q -t K[xjT, E Wiei I-t E wigi) denote the module of syzygies,
syz(I), of {gl,'" ,gq}.
The STZS-algorithm is based on the following proposition which was proved
independently by F.O. Schreyer, W. Trinks, G. Zacharias and D.A. Spear be-
tween 1977 and 1980:
W:= { }
T := initSTZS(S)
L:= ini tPairs (T)
WHILE L <> { } DO
P := (a, b, s) the last element from L
L := L \ {(a, b, s)}
s := spoly(a,b)
h := NF(sIT)
W := W U {h}
END DO
W := cancelWeightMonomials(W).
T := initSTZS(S)
INPUT: S = {gl,'" ,gq}, a standard basis
OUTPUT: T = ([gl,lm(gl),O, ... ,0], ... , [gq,O, ... ,O,lm(gq)]}
300 T. Siebert
i := r; T := { }
S := reOrderDegRevLex(S)
WHILE S <> { } DO
s := the first element from S
i .= i+i
S := S \ {s}
s := s + lm(s)ei
T := T U {s}
END DO
setSyzygyOrder(r)
In the procedure initPairs we compute the leading term of the syzygy for
all admissible pairs of elements of T and apply to these initial terms the criteria
which follows from the divisibility check (statement (2) of proposition 1).
Since the monomial-weighted orderings are implemented by simply multi-
plying with the leading terms, we have to divide out these monomials from the
whole resolution at the end of the computation. This is done in the procedure
cancelW eightM onomials.
Notice that the regularity (see 3.1) can not be applied in the STZS-algorithm,
since a STZS-resolution is not a minimal one.
The main advantage of this algorithm is that the s-polynomials always reduce
to zero and hence the sets of elements and of pairs do not change during the
computation. It also allows us to consider only pairs built from one fixed element
at the same time which results in much smaller pair sets. The knowledge of the
leading terms of the syzygies is used to avoid useless pairs: syzygy having a
leading term divisible by the leading term of another syzygy can be canceled.
The run-time behaviour of the STZS-algorithm is dominated by monomial
comparisons. Using the definition of <1, each monomial comparison requires two
additional monomial multiplications to the base ordering <. However, if the base
ordering is such that the exponents dominate the module component then we
can avoid these additional multiplications:
Let F = {J} be a basis of a module, and let F be the transformation of F
obtained by multiplying all monomials of the ith component with the monomial
lm(gi). Then the following holds:
Lemma 1. F is a standard basis of a module w.r.t. the monomial-weighted or-
dering <1 if and only if F is a standard basis in the given ordering <.
Proof. Points to observe are that the map assigning i to the module compo-
nent of lm(gi) has to be monotone, i.e., it must not disturb the ordering of the
monomials and that for monomial-weighted orderings the comparison of expo-
nents precedes that of module components. Therefore, the Lemma applies only
to these orderings.
r(I) :$ r(in(I)),
9 =NF(J,F)
INPUT: F = {gl, ... , gn} generators of the given ideal, f an s-polynomial
OUTPUT: the normal form of f with respect to F
exchanged:=FALSE
while S:={f i E FI1t(f i ) Ilt(g)} <> { }
i f (3 fi E input: It(fi)=c*lt(g)) and not exchanged
h:=fi
fi:= g
input:=input\ {fi}
exchanged: =TRUE
else choose h E S w.r.t. the strategy
g:=spoly(g,h)
if exchanged
if g E syz(input)
g:=O
else
renew (g)
F:=F\ {filfi was reduced with h}
In the case of (quasi-)homogeneous input it is easy to organize the algorithm
in such a way that no conflicts result from the use of the generator in different
degrees: simply compute degree by degree.
A somewhat more sophisticated strategy is necessary for local computations.
We compute again degree by degree with respect to the leading terms. However,
if the degree changes, we use a lazy strategy. That is, we continue with the other
pairs of same degree before going to higher degreees. To our experience direct
reductions have to precede all other kinds of reductions, even with bad ecarts.
standard basis (and not the whole resolution). Of course, the algorithm applies
also to modules.
For an (quasi-)homogeneous input, the algorithm of LaScala produces all
information about the minimal resolution inside the full STZS-resolution:
H we simply count the minimal generators we obtain the graded Betti numbers
without any additional computation and it is possible to take out the minimal
resolution in a very efficient way.
We have implemented LaScala's algorithm for computing the standard basis
of the input and the resolution simultaneously as follows:
=
Let S be a basis of I S;; K[xr E;=l K[x]ei.
W. := StLaS(S)
INPUT: S, the basis of a module
OUTPUT: W., the modules of syzygies of S
W:= { }
L := initPairs(S)
WHILE LDext : = choseNextPairs (L) <> { } DO
WDev := { }
FOR (a, b, s) IN Lnext DO
(red,syz) : =reduce ( (a, b,s) ,W ,WDev )
WDev:=WDevU {red,syz}
END DO
L : = updatePairs (L, W, WDe.)
W. : =W.U WDev
END DO
W. := cancelWeightMonomials(W.)
W. := minimize(W.,L) //optional
Remark 1. To compute the exact Hilbert function ofthe ith module in the min-
imal resolution it is not necessary to know all generators. It suffices to know
the number and degree of a minimal set of generators. However, we have to be
careful while counting explicit generators. They may vanish during the Gaussian
elimination or by cancellations of components for syzygies which are deleted!
Theorem 1. By resolving the initial module (or ideal) of the input and by de-
tecting the direct reductions in SYZi-i (M) we obtain the Hilbert function Hi of
the module Syzi(M). The Hilbert functions Hi(Syzj) for j > i are upper bounds
for the Hj.
This number is called the dth discriminant of M. It counts the number of critical
pairs as the Hilbert function does in the ordinary standard basis computation.
From the left-hand side of the formula we see that it is determined by the
(d - I)-truncated computation.
This algorithm represents an elegant way to put all the different algorithms to
compute standard bases of ideals or modules and of their syzygies in a common
hull. For that purpose, we consider the algebra k[;r)[y] as an infinite module over
k[;r] and handle all computations within k[;r)[y] by interpreting module compo-
nents, membership in minimal systems of generators and other information as
new variables. Of course, this information does not admit algebraic structures
Algorithms for Free Resolutions 307
and, hence, can not be interpreted like a "real" variable. Thus, the normal form
procedure has to be rewritten to distinguish "variables" of different types. How-
ever, the main advantage we obtain is an "automatic" treatment of additional
information. For a detailed description see [11].
(F',SyzF)=computeFirstSyzygies(F)
S'=resolve(In(F'))
hilb.=computeHilbertFunctions(S')
L=buildPairs (SYZF)
i=1
WHILE ( L <> { } )
308 T. Siebert
- The naive algorithm is implemented for all types of SINGULAR rings (global,
local or quotient). It always uses the given ordering for its computations.
That is, even though SINGULAR-orderings dp and Is are usually far superior
for computations of resolutions with this algorithm, no ordering changes are
performed.
- The STZS-algorithm is implemented for all types of SINGULAR rings, as
well, using the monomial-weighted-orderings based on the given ones. The
optimal orderings are (c, dp) and (c, Is) .
Algorithms for Free Resolutions 309
The timings compare the standard algorithm res, the STZS-algorithm STZS,
the LaScala-algorithm with- (LaScala\_min) and without (LaScala) minimiza-
tion, and for a SINGULAR independent comparison the plain resolution com-
mand of MACAULAY-2 mac which is more or less algorithmically identical with
LaScala.
Comparisons with CoCoA are not included since no current and documented
version was available. As some of the examples are very extensive (random cubics
and quintics) we do not give a complete list of the examples. They can be
downloaded (as well as SINGULAR itself) via anonymous ftp from
ftp://www.mathematik.uni-kl.de:/pub/Math/Singular
There are files test...syz.mac and test...syz.sing containing a MACAULAY and a
SINGULAR version of the tested rings and ideals. All examples are global and
homogeneous.
The timings were obtained on a AMD K6 with 200Mhz and 64MB RAM
running under Linux, kernel version 2.0.32. Timings smaller than 0.5 seconds
are set to zero.
310 T. Siebert
References
1. O. Bachmann and H. Schonemann, Monomial representations for Grabner bases
computations, Preprint (1998), Center for Computer Algebra, University of Kaiser-
slautern.
2. D. Bayer and D. Mumford, What can be computed in algebraic geometry?, Pro-
ceedings of Computational algebraic geometry and commutative algebra Cortona
1991, Cambridge, (1993), 1-48.
3. A.Capani, G. DeDominicis, G. Niesi, L. Robbiano, Computing minimal finite free
resolution, Preprint (1996), Universita di Genova, 1-15.
4. P. Gianni, T. Mora, L. Robbiano and C. Traverso, Hilbert functions and Buch-
berger algorithm, Preprint, (1994).
5. H. Grassmann, G.M. Greuel, B. Martin, W. Neumann, G. Pfister, Th. Siebert, W.
Pohl and H. Schoenemann: Standard bases, syzygies and their implementation in
SINGULAR, Preprint 251 (1994), University Kaiserslautern.
6. R. LaScala and M. Stillman, Strategies for computing minimal free resolutions, to
appear in J. Symb. Compo
7. H. M. Moller and T. Mora, New constructive methods in classical ideal theory, J.
of Algebra 100, 138-178 (1986).
8. H. M. Moller and T. Mora, Computational aspects of reduction strategies to con-
struct resolution of monomial ideal, ???
9. F.O. Schreyer: Die Berechnung von Syzygien mit dem verallgemeinerten Weier-
strasschen Divisionssatz. Diplomarbeit, Hamburg (1980).
10. Th. Siebert, On the strategy and implementations for computations of free resolu-
tions, Preprint 8 (1996), Centre for Computeralgebra, University Kaiserslautern.
11. C. Traverso and M. Caboara, The powerful Buchberger algorithm for modules,
Preprint Pisa, (1996).
Computational Aspects of the Isomorphism
Problem
1 The Questions
R. Brauer posed in his famous lectures on modern mathematics 1963 more than
40 problems, questions and conjectures about the representation theory of finite
groups, cf. [10]. These questions have had a big influence on the development
of the theory since then. Most of the problems may be summarized under the
following central question.
Question. Which properties of a finite group G are determined by its rep-
resentations.
One basic task is to find information about a finite group G additional to its
character table in order to determine G up to isomorphism. Even R. Brauer indi-
cated that the integral group ring 7!..G is in this direction the canonical candidate.
This is in fact a topic which was considered for the first time in G. Higman's
thesis 1939 [57, p.100]. The question is the following.
(IP) Let G and H be groups. Assume that 7!..G and 7!..H are isomorphic as
rings. Does it follow that G and H are isomorphic as groups.
It is a result of G. Glauberman that finite groups G and H with isomorphic
integral group rings 7!..G and 7!..H have the same character table [22, (3.17)]. Thus
the isomorphism problem fits well into the context of Brauer's problems. On the
other hand it is a first question on the group of units U(7!..G) of a group ring.
Denote by V(7!..G) the subgroup of U(7!..G) consisting of units of augmentation
1. If G is finite then the finite subgroups of maximal order of V(7!..G have just
the order of G. Such subgroups have also the property that they from a 7!.. -
basis of 7!..G and thus they are called group basis of 7!..G. A positive solution to
the isomorphism problem shows that group bases of 7!..G are up to isomorphism
unique.
An important result on IP is the following recently announced counterexam-
ple of M. Hertweck.
Theorem 1. f19} There are non - isomorphic groups of order
97 28 x 221
with isomorphic integral group rings.
In a first moment one might think that the construction of such huge groups
certainly involves the use of computers. A second thought indicates however
that the order of the groups may be already too big in order to check their
isomorphism by a simple test with GAP. In fact Martin Hertweck's construction
is purely theoretical. It is along the lines developed for the isomorphism problem
within the last ten years and we discuss this in the sequel. Our main aim in this
article however will be to exhibit computational methods which may be used to
get results on (IP) and related questions.
An important remark which has to be made at this point is that the answer
to (IP) is not as negative as one might think. In fact it is justified to say that
(IP) almost has a positive answer and this may be summarized in the following
result.
Theorem 2. A finite group G is determined up to isomorphism by the category
of finitely generated ZX - modules, where X is the semi-direct product of G with
lFpG, p a prime not dividing IGI.
Theorem 2 is mainly a consequence of the following Theorem 3 which is
according to a remark in [46] a result due to L. Scott, see also [61], [45]. We like
to mention that the original proof as sketched in [45] had to be revised in the
case when p = 2, cf. [20]. The following holds with respect to a prime p.
Theorem 3. Let G be a finite group. Assume that F*(G) is a p - group. Then
group bases of ZG are conjugate within QG.
The content of Theorem 3 may be also given by saying that the second
Zassenhaus conjecture (ZC 2) is valid. The conjectures of Zassenhaus concerning
the structure of torsion subgroups of V(ZG) are as follows [64, p.497-98]
1. (ZC 1) Let u be a unit of finite order of V(ZG). Then u is conjugate within
QG to an element of G.
2. (ZC 2) Let H be a subgroup of V(ZG) with the same order as G. Then H
is conjugate to G by a unit of QG.
3. (ZC 3) Let U be a finite subgroup of V(ZG). Then U is conjugate within
QG to a subgroup of G.
While as the integral isomorphism problem may nowadays be regarded as
solved the so - called modular isomorphism problem is still wide open.
(MIP) Let G be a finite p - group. Assume that lFpG and IFpH are isomorphic
as algebras. Does it follow that G and H are isomorphic?
This is clearly a finite problem and may be attacked by computer programs.
In fact the problem may be reduced algorithmically to linear algebra over IFp,
which is of course always a desirable aim for computer applications.
In Section 3 we shall have a closer look at (MIP). Section 4 deals with simple
and almost simple groups and variations of the Zassenhaus conjectures. Finally in
Section 5 we discuss briefly related questions about Hecke algebras and Burnside
rings.
Our first section however will be devoted to the study of automorphisms of
groups in connection with group rings and vice versa. Several aspects of this
played a major role in the development of the whole theory.
Computational Aspects of the Isomorphism Problem 315
NV(SG)(G) = centre(V(SG)) x G.
Assume that u E V(SG) normalizes the group basis G. Then conjugation
by u induces a group automorphism of G and obviously the normalizer problem
may be rephrased in the following way.
TheoreUl 4. fB7} Let H be a group and let (3 E AutH. Denote by Coo the
infinite cyclic group generated by c. Let c act on H as (3 and form via this action
the semi-direct product H{3 := H >4{3Coo . Then
Denote by Z"'(G) the intersection of the local rings Zp,p E 7r(G). In [52J it
was shown that cpz~(G) is in general not injective. Moreover the semi-local ring
Z"'(G) may be replaced by a suitable ring of algebraic integers of an algebraic
number field. 1 The structure of the groups G is as follows. G has minimal
normal subgroups Ml and M2 of order p2, q2 resp. p and q are different odd
primes. H = G /(M1 • M2 ) has order 27 and is a pullback of the following form.
1 M. Hertweck however pointed out that for the examples given in [52]l/>z is injective,
cf. [19].
316 F.M. Bleher, W. Kimmerle, K.W. Roggenkamp, M. Wursthorn
II r r
1 - Cs - - H - Qs x (0") - 1
Here Cs denotes as usual a cyclic group of order 8. Let a be a generator of
Cs . Qs is the quaternion group of order 8. HI is the iterated semi-direct product
obtained by the action of Aut (Cs ) ~ C 2 x C 2 on Cs and the action of 0" on
Cs ~ Aut (Cs ). 0" is a conjugacy class preserving automorphism of Cs ~ Aut Cs
which is not inner. Note that 0" is unique up to inner automorphism and has
order 2.
Let R = Z,,(G). Using Clifford theory and Frohlich's exact sequence of Picard
groups the construction of an automorphism a of G which becomes inner on RG
may be carried out by the following computational process.
lying in the same equivalence class using the main algorithm implemented in
SISYPHOS.
The following theorem lists the group theoretical invariants used in the case
of groups of order 27 :
Theorem 7. For a p-group G the following objects are determined by FpG
1. G 1if>(G) {[15}}.
2. The center (G) of G and GIG' {[67}, [63}).
3. The sections Mi(G)IMi+1 (G), Mi(G)IMi+2(G) and Mi(G)IM2i+1 (G) for
all i ;::: 1 {[38}, U3}}.
4· Gh2(G)P'Y3(G) ([58}).
5. The numbers of conjugacy classes of maximal elementary abelian subgroups
of given rank {f42}}.
6. The number of conjugacy classes of G and the number of conjugacy classes
of all pn_th powers of elements of G ([71}).
7. Ei=l, ... ,t logp I
ICa(gi) / if>(Ca(gi)) (Roggenkamp, see [71}).
Here Mi(G) denotes the i-th term of the Brauer-Jennings-Zassenhaus series,
defined recursively by Ml(G) = G,Mn(G) = [Mn-1(G),Gj.Min/p)' where (nip)
is the smallest integer satisfying (nip)· p ;::: n. T = {gl, ... ,gt} is a set of
representatives of the conjugacy classes of G
It should be noted that invariant 3. usually does not distinguish any more groups
that share all of the other invariants. Omitting invariant 5. on the other hand
leads to approximately twice as many equivalent classes. Nevertheless this addi-
tional classes have also been checked on the computer.
On a modern PC (200MHz) the CPU time needed to test one pair of groups dif-
fers from a few seconds to several hours, the sizes of the automorphism groups
Aut (IFpG / Ia) can be as large as 2192 . With the old version of the program
and older hardware several pairs could not be checked at all. For these groups
a different test was used: Let Hi ::; Z(G i ), i = 1,2 be a subgroup such that
Gd HI ~ G 2/ H 2. Compute the number of epimorphisms from lFpG i to IFpGi/ Hi
and compare these numbers. SISYPHOS can carry out also this kind of computa-
tions.
The ample amount of data produced by the tests, especially the possibility to
compute Aut (IFpG) completely rendered by the new version of SISYPHOS leads
to several interesting questions, the answers to which would give new insight
into the structure of lFpG.
Question 1. 1. Let n be the smallest integer such that M n+1 (G) = 1. Is it true
that in this case
JF G/I(2n+1) ci!.JF H/I(2n+1)? (1)
p G r PH·
Does (1) hold under the additional assumptions that Mn(G) ~ Mn(H) is
cyclic and G/Mn(G) ~ H/Mn(H)?
2. For a E Aut (lFpG) let a be the induced automorphism on IFpG / lb. Does
there exist a group automorphism (J" E Aut (G), such that the induced auto-
morphism (J"' E Aut(IFpG) satisfies
(2)
Let IG be the augmentation ideal of IFpG, i.e. the kernel of c: IFpG -+ IF with
lFpG :3 E9EG a(g)g f-t E9EG a(g). Note that IG = rad(IFpG), IFpG has a filtra-
tion
Define
IFpG ~ IFpG/l m
J~f r/l m ,
320 F.M. Bleher, W. Kimmerle, K.W. Roggenkamp, M. Wursthorn
- 7r -
0 - - J-lFpG-lFpGIJ - 0
II 3e?!
- 7r -
e!
0 - - J-lFpG -lFpGIJ- 0
Here the subscript e indicates that the modules are considered as G-modules
via e.
Proposition 1. Let L{I be the set of all liftings of e : lFpH -4 lFpG 1 J modulo
conjugation with elements of 1 + J and let eo :
lFpH -4 lFpG be a fixed lifting,
then
1. Le = eo
+ H~(lFpH, J).
2. If J = 10110+1 then the set of allliftings of e modulo inner automorphisms
e
of lFpG is given by L~ = + H~ (lFpH, J) /Im 8•.
3. H~ (lFp G, J) corresponds to liftings of the identity lFp G 1 J --t lFp G 1 J
4. G~+1 = {e' I e' = a, 0: E Out(G), a == id mod J} is a subspace of
H:(lFpG, J)
5. The obstruction of e to be liftable is an element of H;(lFpH, J). e is liftable
if this element is actually in B;(lFpH, J).
This proposition can be used to construct a generating system for Aut (lFpG),
namely by constructing a stabilizer chain for Aut (lFpG) consisting of the sub-
groups En = {e E Aut (lFpG) I e == id mod la}· Then Enl En+1 is elementary
abelian and can be parameterized by a subspace of 101/0+1. Moreover, by the
proposition, G~+1 is a subgroup of Enl En+1 as is the set of inner automorphisms
(modulo 10+1) that are trivial modulo 10 . Both sets can be precomputed. This
means that the backtrack search to compute En can be restricted to a comple-
ment of the associated subspaces within 10110+1.
Computational Aspects of the Isomorphism Problem 321
4 Simple Groups
In this section we want to look at the conjecture (ZC 2) and some variations of it
with respect to simple and almost simple groups. By [49] the conjecture (ZC 2)
is not true in general, but there exist metabelian counterexamples. Nevertheless,
it is still an open question whether (ZC 2) holds for the chief factors of a finite
group. Note that this will be the case if (ZC 2) is valid for all simple groups. On
the other hand it is an obvious task to develop variations of (ZC 2) which hold
for wider classes of groups and which still provide a positive answer to (IP).
By Noether-Skolem, the conjecture (ZC 2) is equivalent to the statement:
There exists a group isomorphism T : X -t Y whose Z-linear extension preserves
all class sums of ZG.
If p is a rational prime then we call the following modification the p-variation
of (ZC 2).
(ZCp ) For any two group bases X, Y of ZG, there exists a group isomorphism
T : X -t Y whose Z-linear extension preserves all class sums of ZG corresponding
to p-elements.
Note that the p-variation still implies that (IP) has a positive solution. If
one wants to drop this implication the p-variation represents just a Sylow-like
statement and may be phrased as follows.
(p-ZC) If X is a finite subgroup of V(ZG) then the Sylow p-subgroups of
X are conjugate to a subgroup of G.
In case that X is a group basis and G is a finite soluble group, this vari-
ation has been verified in [28]. Further investigations are due to Juriaans and
Dokuchaev, see [16]. Note that so far no counterexample to (p-ZC) is known.
If (IP) has a positive solution then (ZC 2) is equivalent to the following
description of the group Aut n (ZG) of augmentation preserving ring automor-
phisms of ZG.
(AUT) Every U E Aut n (ZG) can be written as u = aT where a is the
Z-linear extension of a group automorphism of G and T E Aut n (ZG) preserves
all class sums of ZG.
For semi-simple groups and their automorphism groups, (IP) is valid by [27].
Thus, in particular for simple groups, the structure of ring automorphisms of
ZG can be used to study the group of units V(ZG).
(AUT) is first reformulated as a criterion which only uses the ordinary char-
acter table CT( G) and its character table automorphisms. A character table
automorphism of CT( G) is a permutation 7r which operates on the rows and on
the columns of CT(G) such that 7r(X)(C) = X(7r(C)) for all conjugacy classes C
and for all ordinary irreducible characters X. Since U E Aut n (ZG) permutes the
central primitive idempotents of m and since u induces a so-called class sum
correspondence, which means that u maps class sums to class sums, u induces
a character table automorphism of CT(G). Thus we get the following criterion:
Proposition 2. (AUT) is valid for G if and only if for all u E Aut" (ZG) there
exists a group automorphism a of G such that u and a induce the same character
table automorphism of eT(G).
322 F.M. Bleher, W. Kimmerle, K.W. Roggenkamp, M. Wursthorn
Studying the character tables of finite simple groups which are given in the
Atlas of finite groups shows that many character table automorphisms come from
Galois automorphisms. Such automorphisms can be avoided if one considers
instead of CG the rational group algebra IQ'G. This leads to the following Q-
variation of (ZC 2).
(ZCQ) For any two group bases X, Y of ZG, there exists a group isomor-
phism 7 : X ~ Y whose Z-linear extension fixes all blocks of the Wedderburn
decomposition of IQ'G.
(ZCQ) is usually easier to establish than (ZC 2). A typical example is the
case of the alternating groups. Here the proof of (ZCQ) [24] is analogous to the
proof of (ZC 2) for symmetric groups [41]. However, it seems to be difficult to
establish whether (ZC 2) is valid for alternating groups. So far (ZC 2) is known
to hold for degrees n :::; 10 and n = 12, see [4].
For the verification of (ZC 2), it is in most of the cases not enough to look
only at CT(G} to show that the character table automorphisms which are not
induced by Aut G are also not induced by Aut n (ZG). Thus we have to find
further obstructions to analyze this criterion. The main tool which is used here
is modular representation theory, since this often reflects better the connection
between the conjugacy classes than ordinary representation theory does.
Let now (K, R, k) be a p-modular system with K sufficiently large relative
to G. Then a E Aut n (ZG) induces an augmentation preserving K-algebra auto-
morphism of KG and also an augmentation preserving k-algebra automorphism
of kG, which we both denote again by a. Thus a permutes the blocks of kG, and
since a is augmentation preserving, it fixes the principal block Bo of kG. There-
fore a induces an autoequivalence of the category Bo-mod of finitely generated
Bo-modules. In case that Bo has cyclic defect groups, a fixes the isomorphism
class of every finitely generated Bo-module [8, Theorem 3.6]. For other blocks
with cyclic defect the same holds provided a fixes at least one simple module
in this block. For related results concerning conjugacy of cyclic defect groups in
the unit group of blocks see [62] and [34]. For blocks with non-cyclic defect, we
do not get such a strong result. However, a induces an operation on the decom-
position matrices and on the socle series of projective indecomposable modules.
These observations show that (ZC 2) is valid for 16 of the 26 sporadic simple
groups:
Theorem 8. The conjecture (ZC 2) is valid for the sporadic simple groups M l l ,
M 12 , M 22 , M 23 , M 24 , COl, CO 2 , C03 , J1 , h, J3 , H'S, H'N, Th, B, Ru.
With respect to the variations of (ZC 2) we get the following result:
Theorem 9. (i) The conjecture (ZCQ) is valid for all sporadic simple groups
and for all their automorphism groups.
(ii) The conjecture (ZCp ) is valid for all p for all sporadic simple groups and
their automorphism groups which are not isomorphic to McL, McL.2, Fi23,
J4 , Ly.
Theorem 8 is mainly proven in [4] and can also be found together with The-
orem 9 in [9]. The proofs use the character tables and decomposition matrices
Computational Aspects of the Isomorphism Problem 323
provided by the computer algebra system GAP [60]. For the Janko group J3
we also use the socle series of the projective indecomposable modules modulo 3
which were computed by K. Lux [35].
For finite simple groups of Lie type, different methods have to be developed
for the examination of (ZC 2). Here the description of the simple kG-modules
as tensor products [66], when char(k) is the defining characteristic, plays the
most important role. We have the following action of (J E Aut n (ZG) on tensor
products of characters:
Lemma 1. [5, Prop. 2.1] Let eand ( be two ordinary or two Brauer characters,
respectively. Then
The proof uses mainly the fact that (J also induces a character table auto-
morphism of the p-modular character table of G.
The strategy to prove the validity of (AUT) for some series of finite groups G of
Lie type is now the following: Let p be the defining characteristic of G.
Strategy:
(A) Show that for every (J E Aut n (ZG) there exists a E Aut G such that (J and a
induce the same character table automorphism on the p-modular character
table of G. Then (J operates as a on all ordinary irreducible characters which
are non-exceptional for p. Note that a character X is called exceptional for p,
if there exists another character X' such that X and X' have the same values
on the p-regular conjugacy classes of G.
(B) Look at the action of (J on the ordinary irreducible characters which are
exceptional for p.
For step (B), we use the generic ordinary character tables of CHEVIE [17]
to determine the exceptional characters.
The strategy leads to the following result:
Theorem 10. [5,6] The conjecture (ZC 2) is valid for the finite groups of Lie
type
SL(2, q), PSL(2, q), SL(3, q), PSL(3, q), SU(3, q2),
PSU(3, q2),Sp(4, q), PSp(4, q), G 2(q), 3 D4(q3),
2B2 (2 2m +1 ), 2G 2 (3 2m +l), 2F4(22m+l),
5 Related Topics
For integral group rings of finite Coxeter groups the following holds.
Theorem 12. [7, Theorem 1.6} (ZC 2) is valid for finite Coxeter groups W.
Note that the special case when W is the symmetric group of degree n has
been proved by G. Peterson and the case for Coxeter groups of type B is covered
by the results in [18].
Let (W,8) be a Coxeter system. Then the generic Iwahori - Hecke algebra
H can be viewed as a deformation of the integral group ring of W over the ring
A = Z[v, V-I] of Laurent polynomials in one indeterminate. The algebra H is a
free A-module with basis {Tw I w E W}, and the multiplication is given by the
rules
is a rational integer for each pair ([U], [W]) with [W] :::; [V.] for some [V] < [U].
Here 1L denotes the Mobius function of the subgroup lattice of G and U. and V.
are representatives of T[U] and T[V] respectively.
The answer also gives an algorithm for the calculation of Aut (il(G». As
observed by M. Wursthorn this may be easier done via the following algorithm.
Let gh(G) = I1[U]EV(O) Z[ul denote the ghost ring of il(G). Using fixed point
vectors il(G) can be viewed as a Z-suborder of gh(G), the embedding is just
326 F.M. Bleher, W. Kimmerle, K.W. Roggenkamp, M. Wursthorn
n(G) ~ n(G)
r
gh(G)
7r
-+
r
gh(G).
T·A=T·P, (3)
which has to been solved for a given permutation 7r E Sn. The properties (pI),
(p2) and (p3) allow to divide the elements of V(G) into equivalence classes which
must be stabilized by 7r. Making use of this fact one can compute a stabilizer
chain for Aut (n(G)) and thus find a generating set for this group.
References
1. C. Baginski, The isomorphism question for modular group algebras of met acyclic
p-groups, Proc. Amer. Math. Soc. 104 (1988), 39-42.
2. C. Baginski and A. Caranti, The modular group algebras of p-groups of maximal
class, Canad. J. Math. 40 (1988), 1422-1435.
3. F. M. Bleher, Zassenhaus Vermutung und einfache Gruppen, Diplomarbeit, Uni-
versitat Stuttgart, 1993.
4. F. M. Bleher, Automorphismen von Gruppenringen und Blocktheorie, Dissertation,
Universitat Stuttgart, 1995.
5. F. M. Bleher, Tensor Products and a Conjecture of Zassenhaus, Arch. Math.
(Basel) 64 (1995), 289-298.
6. F. M. Bleher, Finite groups of Lie type of small rank, Pacific J. Math., to appear.
7. F. M. Bleher, M. Geck and W. Kimmerle, Automorphisms of generic Iwahori-Hecke
algebras and integral group rings of finite Coxeter groups, J. Algebra 197 (1997),
615-655.
8. F. M. Bleher, G. Hiss and W. Kimmerle, Autoequivalences of Blocks and a Con-
jecture of Zassenhaus, J. Pure Appl. Algebra 103 (1995), 23-43.
9. F. M. Bleher and W. Kimmerle, On the structure of integral group rings of sporadic
groups, Preprint.
10. R. Brauer, Representations of Finite Groups, Lectures on Modern Mathematics,
Vol. I, 133-175, J. Wiley, 1963.
Reprinted in R. Brauer, Collected papers Vol. II, 183-225, MIT Press, Cambridge
MA,1980.
11. J. F. Carlson, Periodic modules over modular group algebras, J. London Math.
Soc. 15 (1977), 431-436.
12. J. H. Conway, R. T. Curtis, S. P. Norton, R. A. Parker and R. A. Wilson, Atlas
of finite groups, Oxford University Press, London/New York, 1985.
Computational Aspects of the Isomorphism Problem 327
13. E. C. Dade, Deux groupes finis distincts ayant la meme algebre de groupe sur tout
corps, Math. Z. 119 (1971), 345-348.
14. W. E. Deskins, Finite abelian groups with isomorphic group algebras, Duke Math.
J. 23 (1956), 34-40.
15. E. M. Dieckmann, Isomorphism of group algebras of p-groups, Ph.D. thesis, Wash-
ington University, St. Louis, Missouri, 1967.
16. M. A. Dokuchaev and S. O. Juriaans, Finite subgroups in integral group rings,
Canad. J. Math., to appear.
17. M. Geck, G. Hiss, F. Liibeck, G. Malle and G. Pfeiffer, CHEVIE-Generic Char-
acter Tables of Finite Groups of Lie Type, Hecke Algebras and Weyl Groups, IWR
Preprint 93-62, Heidelberg, 1993.
18. A. Giambruno, S. K. Sehgal and A. Valenti, Automorphisms of the integral group
rings of some wreath products, Comm. Algebra 19 (1991), 519-534.
19. M. Hertweck, A solution of the isomorphism problem, Preprint, 1997.
20. M. Hertweck and W. Kimmerle, On the F"-Theorem, to appear in: Groups Bath
- St. Andrews 1997.
21. G. Higman, The units of group rings, Proc. London Math. Soc. 46 (1940), 231-248.
22. 1. M. Isaacs, Character Theory of Finite Groups, Academic Press, New York, 1976.
23. C. Jansen, K. Lux, R. A. Parker, R. A. Wilson, An Atlas of Brauer Characters,
London Math. Soc. Monographs, New Series 11, Oxford University Press, Oxford,
1995.
24. W. Kimmerle, Beitrage zur ganzzahligen Darstellungstheorie endlicher Gruppen,
Bayreuther Mathematische Schriften 36 (1991), 1-139.
25. W. Kimmerle, Variations of the Zassenhaus conjecture, in: [51], DMV-Seminar 18,
Birkhauser 1992, 117-124.
26. W. Kimmerle, On Automorphisms of ZG and the Zassenhaus Conjectures, Pro-
ceedings of the ICRA VII, Cocoyoc, Mexico, CMS Conference Proceedings Vo!' 18
(1996), 383-397.
27. W. Kimmerle, R. Lyons, R. Sandling and D. Teague, Composition factors from the
group ring and Artin's Theorem on orders of simple groups, Proc. London Math.
Soc. 60 (1990), 89-122.
28. W. Kimmerle and K. W. Roggenkamp, Projective limits of group rings, J. Pure
App!. Algebra 88 (1993), 119-142.
29. W. Kimmerle and K. W. Roggenkamp, A Sylow-like Theorem for Integral Group
Rings of Finite Solvable Groups, Arch. Math. (Basel) 60 (1993), 1-6.
30. W. Kimmerle and K. W. Roggenkamp, Automorphisms of Burnside rings, Groups
Galway - St. Andrews 1993 Vo!.2, London Math. Soc. Lect. Note Series 212 (1995),
333-351.
31. W. Kimmerle and R. Sandling, The Determination of Abelian Hall Subgroups by
a Conjugacy Class Structure, Pub!. Mat., Barc. 36 (1992), 685-691.
32. W. Kimmerle and R. Sandling, Group Theoretic and Group Ring Theoretic Deter-
mination of Certain Sylow and Hall Subgroups and the Resolution of a Question
of R. Brauer, J. Algebra 171 (1995), 329-346.
33. L. Klingler, Construction of a counterexample to a conjecture of Zassenhaus,
Comm. Algebra 19 (1991), 2303-2330.
34. M. Linckelmann, The isomorphism problem for cyclic blocks and their source al-
gebras, Invent. Math. 125 (1996), 265-283.
35. K. Lux, Algorithmic Methods in Modular Representation Theory, Habilitationss-
chrift, Aachen, 1997.
36. A. Makasikis, Sur l'isomorphie d'algebres de groupes sur un champ modulaire,
Bull. Soc. Math. Belg. 28 (1976), 91-109.
328 F.M. Bleher, W. Kimmerle, KW. Roggenkamp, M. Wursthorn
37. M. Mazur, On the isomorphism problem for integral group rings of infinite groups,
Expo. Math. 13 No.5 (1995), 433-445.
38. I. B. S. Passi and S. K Sehgal, Isomorphism of modular group algebras, Math. Z.
129 (1972), 65-73.
39. D. S. Passman, Isomorphic Groups and Group Rings, Pacific J. Math. 35 (1965),
561-583.
40. D. S. Passman, The group algebras of groups of order p4 over a modular field,
Michigan Math. J. 12 (1965), 405-415.
41. G. L. Peterson, Automorphisms of the integral group ring of Sn, Proceedings AMS,
Vol. 59, Nr. 1 (1976), 14-18.
42. D. Quillen, The spectrum of an equivariant cohomology ring: I, Ann. of Math. (2)
94 (1971), 549-572.
43. J. Ritter and S. K Sehgal, Isomorphism of group rings, Arch. Math. (Basel) 40
(1983), 32-39.
44. K. W. Roggenkamp, Observations to a conjecture of H. Zassenhaus, Groups St.
Andrews 1989, Vol. 2, London Math. Soc. Lecture Note Series 160 (1991), 427-444.
45. K W. Roggenkamp, The Isomorphism Problem for Integral Group Rings of Finite
Groups, Proceedings of the Int. Congress of Mathematicians, Kyoto 1990, Springer
1991, 369-380.
46. K W. Roggenkamp, Problems on Group Rings, to appear in the Proc. of the Ring
Theory Conf. Miskolc 1996.
47. K W. Roggenkamp and L. L. Scott, Isomorphisms of p-adic group rings, Ann. of
Math. (2) 126 (1987), 593-647.
48. K W. Roggenkamp and L. L. Scott, A strong answer to the isomorphism problem
for finite p-solvable groups with a normal p-subgroup containing its centralizer,
manuscript, 1987.
49. K. W. Roggenkamp and L. L. Scott, A counterexample to a conjecture of H. Zassen-
haus, manuscript, 1987.
50. K W. Roggenkamp and L. L. Scott, Automorphisms and non abelian cohomology:
An algorithm, Linear Algebra Appl. 192 (1993), 355-382.
51. K. W. Roggenkamp and M. Taylor, Group rings and class groups, DMV-Seminar
18, Birkhiiuser, Basel Boston, Berlin 1992.
52. K. W. Roggenkamp and A. Zimmermann, Outer group automorphisms may be-
come inner in their integral group ring, J. Pure Appl. Algebra 103 (1995), 91-99.
53. K. W. Roggenkamp and A. Zimmermann, A counterexample for the isomorphism
problem of polycyclic groups, J. Pure Appl. Algebra 103 (1995), 101-103.
54. A. I. Saksonov, On the group ring of finite groups, I, Publ. Math. Debrecen 18
(1971), 187-209.
55. M. A. M. Salim and R. Sandling, The modular group algebra problem for groups
of order p5, J. Austral. Math. Soc. 61 (1996), 229-237.
56. M. A. M. Salim and R. Sandling, The modular group algebra problem for small
p-groups of maximal class, Canad. J. Math. 48 (1996), 1064-1078.
57. R. Sandling, Graham Higman's thesis "Units in group rings", in: Integral repre-
sentations and Applications, Lect. Notes in Math. 882, Springer 1981, 93-116.
58. R. Sandling, The modular group algebra of a central-elementary-by-abelian p-
group, Arch. Math. (Basel) 52 (1989), 22-27.
59. R. Sandling, The modular group algebra problem for metacyclic p-groups, Proc.
Amer. Math. Soc. 124 (1996), 1347-1350.
60. M. SchOnert et al., GAP - Groups, Algorithms, and Programming, Lehrstuhl D
fUr Mathematik, Rheinisch Westf31ische Technische Hochschule, Aachen, Germany,
first ed., 1992.
Computational Aspects of the Isomorphism Problem 329
61. L. L. Scott, Recent progress on the isomorphism problem, Proc. Symposia in Pure
Math. Vol. 47 (1987), 259-274.
62. L. L. Scott, Defect groups and the isomorphism problem, Representations lineaires
des groupes finis, Proc. Colloq. Luminy, France, 1988; Asterisque 181-182 (1990),
257-262.
63. S. K. Sehgal, On the isomorphism of group algebras, Math. Z. 95 (1967), 71-75.
64. S. K. Sehgal, Torsion Units in Group Rings; Proceedings Nato Institute on Methods
in Ring Theory, Antwerp, D. Reidel, Dordrecht, 1983, 497-504.
65. S. K. Sehgal, Units in Integral Group Rings, Longman, Harlow, 1993.
66. R. Steinberg, Representations of algebraic groups, Nagoya Math. J. 22 (1963),
33-56.
67. H. N. Ward, Some results on the group algebra of a group over a prime field, Sem-
inar on Finite Groups and Related Topics, Mimeographed notes, Harvard Univ.,
1960, pp. 13-19.
68. A. Weiss, p-adic rigidity of p-torsion, Ann. of Math. (2) 127 (1987), 317-332.
69. A. Weiss, Torsion units in integral group rings, J. reine angew. Math. 415 (1991),
175-187.
70. M. Wursthorn, Die modularen Gruppenringe der Gruppen der Ordnung 26 , Diplo-
marbeit, Universitat Stuttgart, 1990.
71. M. Wursthorn, Isomorphisms of modular group algebras: An algorithm and its
application to groups of order 26 , J. Symb. Comput. 15 (1993), 211-227.
Representations of Heeke Algebras and Finite
Groups of Lie Type
1 Mathematisches Institut B
Universitat Stuttgart
Postfach 801140
70550 Stuttgart
Germany
2 U.F.R. de Mathematiques de
l'Universite Paris 7
2 Place Jussieu
75251 Paris Cedex 05
France
3 Lehrstuhl D fiir Mathematik
RWTH Aachen
Templergraben 64
52062 Aachen
Germany
4 IWR der Universitat Heidelberg
1m Neuenheimer Feld 368
69120 Heidelberg
Germany
1 Introduction
The aim of this article is to lead the reader on a journey through the representa-
tion theory of finite groups of Lie type and Hecke algebras. We will present some
basic results obtained in recent years, explain the ideas behind them, and give
lots of examples; proofs are usually omitted but we provide explicit references
to an extensive bibliography.
Let G be a finite group of Lie type. Examples are G = GLn(q), where q
is a prime power, or any finite classical group. Hecke algebras arise naturally
as endomorphism algebras of certain induced modules for G. This shows their
importance for studying representations of G. By a result of Iwahori [82], they
can be obtained as deformations of group algebras of finite Coxeter groups. Here,
the standard example is the symmetric group 6 n .
The group G is defined over a finite field, whose characteristic we call the
defining characteristic of G. The case of representations over an algebraically
closed field of characteristic 0 is relatively well-understood. The irreducible rep-
resentations are classified by Lusztig [95], and Lusztig's subsequent work on
character sheaves has provided the framework in which the problem of comput-
ing the corresponding character values can be attacked; see Shoji [117].
2 Harish-Chandra Philosophy
2.1 Finite Groups with BN-Pairs
Many important results of Harish-Chandra theory can be formulated and proved
within the framework of groups with split BN-pairs. Thus let G be a finite group
with a split BN-pair of characteristic p, satisfying the commutator relations
(cf. [16, §§ 65,69]). Without further notice we shall use the letters B and N
to denote the Borel subgroup and the monomial subgroup of G, respectively.
We also fix an integral domain R and work in the category mod-RG of finitely
generated right RG-modules.
A proof of this theorem in the special case that R is a field of characteristic 0, was
given by Deligne (see Lusztig and Spaltenstein [97]). In the general case it was
proved independently by Dipper and Du [22] and by Howlett and Lehrer [81].
In the applications we have in mind, the assumption of Theorem 2.1 is always
satisfied. In these cases we may, and will, omit the subscript P from the notation
for Harish-Chandra induction and restriction.
Let us consider a simple example which shows that the assumption on the
characteristic of R cannot be omitted. Let G = SL 2 (P), where p is odd. For B
and N we take the upper triangular matrices and the monomial matrices in G,
respectively. Let R = k denote the prime field of characteristic p. Then the
group L of diagonal matrices in G is isomorphic to the multiplicative group of k.
Let M denote the I-dimensional kL-module affording this k-representation of L.
Then if B' denotes the subgroup of lower triangular matrices in G, the two kG-
modules Rt,B(M) = Ind~(Infif(M)) and Rt,B,(M) = Ind~,(Infif' (M)) are
not isomorphic. To see this, observe that B' = s-1 Bs where s is an element
in N \ L. Hence Ind~,(Infif' (M)) ~ Ind~(Infif(MS)). But Infif(MS) is not
isomorphic to Infif (M) and so, by Green correspondence, the induced modules
are not isomorphic either.
Definition 2.1. Suppose that p is invertible in R. An RG-module M is called
cuspidal, irR~(M) = {o} for all Levi subgroups L properly contained in G.
Define a partial order on K by (L, X) ::; (L', X') if and only if L ::; L' and X is
isomorphic to a composition factor of *Rf (X').
Then a pair (L, X) E K is minimal with respect to this partial ordering, if
and only if X is a cuspidal simple kL-module. In this case, the Harish-Chandra
series corresponding to (L, X) is the set of simple kG-modules M such that
(L,X)::; (G,M).
If M is a simple kG-module, there is a unique (up to conjugacy and isomor-
phism) minimal pair (L,X) with (L,X)::; (G,M).
(b) Let (L, X) be a minimal pair in K. Then the set of isomorphism classes of
simple kG-modules in the head of Rj;(X) equals the set of isomorphism classes
of simple kG-modules in the socle of Rj;(X). This set of simple kG-modules
equals the Harish-Chandra series corresponding to (L, X).
(c) Let (L, X) be a minimal pair in K and let M be a simple kG-module not
lying in the Harish-Chandra series corresponding to (L, X).
If M is a composition factor of Rj;(X) , the semisimple vertex of M' strictly
contains L. If P(M) is a direct summand of Rj;(P(X)), then the semisimple
vertex of M is strictly contained in L.
It may in particular happen that, in the notation of the above remark, some
composition factor of Rj;(X) is cuspidal, even is L is strictly contained in G.
Simple kG-modules which do not arise in this way are called supercuspidal.
W(L,X) = R(L,X)C(L,X)
Representations of Heeke Algebras and Finite Groups of Lie Type 337
with normal subgroup R(L, X) and complement C(L, X) such that R(L, X) is a
Coxeter group and the following hold:
(a) 1-l(L,X) is a C(L,X) graded k-algebra, i.e., there are subspaces 1-lx
of1-l(L,X), x E C(L,X), such that
1-l(L,X) = E9 1-lx,
XEC(L,X)
1-l1 is a subalgebra of 1-l(L, X) and 1-lx 'Hy = 'Hxy for all x, y E C(L, X).
(b) 'H 1 is an Iwahori-Hecke algebra associated to the Coxeter group R(L,X),
with multiplication possibly twisted by a 2-cocycle.
(c) 'H(L, X) is a symmetric k-algebra.
For the definition of an Iwahori-Hecke algebra associated to a Coxeter system,
we refer the reader to Section 3.2.
Special cases of Theorem 2.3 for char(k) = 0 were proved by Iwahori, Harish-
Chandra, Lusztig and Kilmoyer. The general case was settled by Howlett and
Lehrer in characteristic 0 [80] and, building on their work, by Geck, Hiss and
Malle [56, Section 3] in positive characteristic. The fact that 'H(L, X) is sym-
metric was proved in [53].
If Rt(X) has an indecomposable direct summand with multiplicity 1, or if X
can be extended to its inertia subgroup in (NG(L) n N)L, then the 2-cocycle
appearing in Theorem 2.3(c) is trivial (see [56, Corollary 3.13]). In this case we
call 'H(L, X) an extended Iwahori-Hecke algebra associated to W(L, X).
In Section 4.7 we shall give some examples for the types and parameters of
Iwahori-Hecke algebras occurring in groups of Lie type as endomorphism algebras
of Harish-Chandra induced cuspidal modules.
IGI(x) = xN IT Pd
d
between the character groups of LF and G F , adjoined to each other with respect
to the usual scalar product of characters. In the case that L is contained in an F-
Rr
stable parabolic subgroup of G this is just ordinary Harish-Chandra induction
and restriction. In the general case is defined as an alternating trace on
£-adic cohomology groups of a certain algebraic variety attached to L and G
(see for example [15]). (In fact, the definition also depends on the choice of a
Rr
satisfying 'ljJ~ 0 R~ =: o'ljJ~ for all q (see [9, Ths. 1.26 and 1.33]). In this sense,
the set of unipotent characters and the functors Rr and "R~ are generic for a
series of groups of Lie type.
Therefore, and because the unipotent characters play a prominent role in
Lusztig's classification of the irreducible characters of G F (see Section 2.3), we
will now restrict our attention to the subset Uch(GF) C Irr(G F ), respectively
to its generic version Uch(G). In this case we have the following characterization
of d-cuspidality (see [9, Prop. 2.9]):
Proposition 2.1. A unipotent character, E Uch(G) is d-cuspidal if and only
if Deg(f)d =: IGBB Id.
340 R. Dipper, M. Geck, G. Hiss, G. Malle
Here, we write fd for the Pd-part of f E Qfx]. Also, G•• denotes the semisim-
pie quotient of G, i.e., the generic finite reductive group attached to G /Rad( G).
A pair (M, /1-) consisting of a d-split Levi subgroup M of G and a unipotent
character /1- E Uch(M) is called d-split. It is called d-cuspidal if moreover /1- is
d-cuspidal. We introduce the following relation on the set of d-split pairs:
Definition 2.5. Let (Ml' /1-1), (M2' /1-2) be d-split. We say that (Ml, /1-d $d (M2' /1-2)
if Ml is a d-split Levi subgroup of M2 and /1-2 occurs in R~ (/1-1).
For a d-cuspidal pair (1L, A) ofG we write Uch(G, (1L, A)) for the set of-y E Uch(G)
such that (1L, A) $d (G, 'Y)' We call Uch(G, (1L, A)) the d-Harish Chandra series
above (1L, A) because of the following fundamental result, which shows that for
any d we obtain a generalized Harish-Chandra theory:
This was shown in special cases by Schewe [115] and Fong/Srinivasan [40].
RM
G
0
1M
(L,>.) = IG(L,>.) 0 I n dWG(IL,>.)
WM(IL,>.)·
Here M runs over the d-split Levi subgroups of G and (1L, >.) over the set of
d-cuspidal pairs of M.
This result was proved in [9, Theorem 3.2]. It seems to indicate that the
constituents of Rf(A) can be parameterized by the irreducible characters of
Representations of Heeke Algebras and Finite Groups of Lie Type 341
where the union now is over semisimple i-elements t in the centralizer CG* (s)F*
of s. The following result of Broue and Michel [10, Theoreme 2.2] shows a certain
compatibility between Lusztig series and i-blocks:
Theorem 2.8. (Broue/Michel) Each set &l(G F , s) is a union 0/ i-blocks o/G F .
The description of arbitrary i-blocks is analogous to that given for unipo-
the prime i have to be made, and the result is phrased in terms of Rr,
tent i-blocks in the previous section, although at present some restrictions on
whose
decomposition is not yet known in general. Let e be a positive integer. In exten-
sion of our above definitions we say that (L, A) is e-cuspidal if L is e-split and
A E Irr(LF) is e-cuspidal. We let «e be the transitive closure of the relation :::;e
on e-split pairs introduced above. As an example for the type of results which
can be obtained we cite:
Representations of Heeke Algebras and Finite Groups of Lie Type 343
3 Heeke Algebras
In Section 2.1 we described the endomorphism algebra of a Harish-Chandra
induced cuspidal representation of a finite group G of Lie type. These endomor-
phism rings carry a lot of information on representations of G. We shall make
this connection now more precise by defining certain functors which tie together
representations of G and those of the Hecke algebras in Theorem 2.3. This mo-
tivates to study the representations of Hecke algebras abstractly. In subsequent
sections we present some results on Iwahori-Hecke algebras associated with real
(and in view of Section 2.2 with complex) reflection groups.
The theorem is an easy consequence of Fittings lemma and was shown first
in [19, 4.7 and 4.10]. It is applied in the following situation. For a finite group
G = G F of Lie type we take a regular cuspidal representation CK of some Levi
subgroup L of G. The term regular means here, that CK occurs as constituent
of the Gelfand-Gmev representation rk of L. This is a special representation,
Representations of Heeke Algebras and Finite Groups of Lie Type 345
which is multiplicity free and is induced from the unipotent radical of any Borel
subgroup of L (see e.g. [15] for details). In particular r15 is a projective OL-
lattice, where the characteristic of K is now zero and that of k is a prime i
different from the describing characteristic p of G.
We recall the notion of Lusztig series introduced in Section 2.3. Thus cor-
responding to the character of CK we have a representative s of a semisimple
conjugacy class of the dual group L * of L. Since all irreducible direct summands
of rf( occur with multiplicity one, there is up to isomorphisms precisely one
OG-Iattice Co(s) affording this cuspidal representation such that Co(s) is an
epimorphic image of r15. Among other things this implies that r15contains a pro-
jective cover Q ofCo(s) as direct summand, the head of Ck(S) is simple occurring
with multiplicity one as composition factor of Ck(S), and EndkL(Ck(S)) = k. We
Harish-Chandra induce the epimorphism from Q onto Co(s) to get a surjective
map f30 : Po ~ Mo(s), where Po = Ry(Q) is a projective OG-Iattice and
Mo(s) = RY(Co(s)). As in Section 2.1, we set W(L) to be the relative Weyl
group of L in G. Note that W(L) acts on the conjugacy classes of semisimple
elements of L. For any tEL *, denote the stabilizer of the L * -conjugacy class
containing t in W(L*) by W(L,t). Note that, if R = K or if R = k and Ck(S)
is irreducible, W(L, s) is isomorphic to the group W(L, CR(S)) of Theorem 2.3.
In [24,4.10] it was shown that W(L, s) is a real reflection group provided that
the center of G is connected.
W(L,s) = W(L,s').
This theorem was proved in [24, 4.6] for connected groups with connected center
and [24, Part II, Theorem 1] in full generality. As a consequence of Theorem
3.1 we obtain that the decomposition matrix of 1l0(W(L, s)) is part of the
i-decomposition matrix of G.
The results of the previous section show that part of the decomposition numbers
of finite groups of Lie type can be obtained from those of Iwahori-Hecke algebras.
Therefore, we now briefly survey some features of the representation theory of
Iwahori-Hecke algebras associated with reflection groups.
Let (W, S) be a finite Coxeter system, i.e., W is a finite group and S is a
subset of W such that we have a presentation of W with .
generators: s E S,
relations: (st)m •• = 1 for s, t E S,
346 R. Dipper, M. Geck, G. Hiss, G. Malle
where (mst) is a matrix of integers with mss = 1 and mts = mst > 1 for 8 =I t.
The matrices (mst) such that the above presentation defines a finite group are
classified. It turns out that the irreducible Coxeter groups fall into several infinite
series and a number of exceptional types, which are conveniently described by
the following diagrams:
)w-
An
n;:::1
1
• • •
2 3
-.n
En 1 2 3 n
Dn
n;:::4 -.n
• • •
4
n;:::2 -.
1m 2 1 2 3 4
• • • •
4
12(m) F4 1 3 4 5 6
------- E6
• • • •
H3 1
• • •
52 3
H4 1 5
• • • •
2 3 4
12
1 3 4 5 6 7 1 3 4 5 6 7 8
E7
• • • • • Es • • • • • •
12 12
(where m ::::: 5 in type 12(m».
The vertices are in bijection with S, and two vertices 8 =I t are joined by an
edge if mst > 2 (Le., 8, t do not commute with each other). An edge is labelled
by the integer mst unless we have mst = 3 in which case we omit this label. For
example, the diagram of type A n - 1 defines the symmetric group on {I, ... , n},
generated by the basic transpositions (1,2), .. , , (n - 1, n).
An important tool in studying W is the length function l: W -+ No: given
w E W we can write w = 81 ... 8 n with 8i E S. If such an expression is of shortest
possible length, we call it a reduced expression and let lew) := n.
The Iwahori-Hecke algebra associated with (W, S) is defined as a deformation
of the group algebra of W over a commutative ring R:
generators: T w , wE W,
relations: T w = T. t •.. T' n if w = 81 ... Sn with Si E S is reduced,
T; = q.T1 + (qs - I)Ts for 8 E S.
Iwahori-Hecke algebra associated with (W, S); we shall denote it by 1-l. Given
any commutative ring R and elements {q. I S E S} ~ R as above, we have a
canonical isomorphism
for all 8 E S.
Secondly, it implies that the Gram matrix of the bilinear form 1-l x 1-l --+ Ao
defined by
(Tw, T w') f-t coefficient of T1 in T wT w',
is invertible over A o, and hence 1-l is a symmetric algebra.
The generic algebra and its character table. In view of our applications to
groups of Lie type, we will now only consider the case where W is a finite Weyl
group. Then it is known that Q is a splitting field for W. In order to obtain
a similar statement for 1-l, we consider the ring A = Z[U;1/2 I 8 E S] ;2 Ao
and let K be its field of fractions. It follows from results of Benson/Curtis and
Lusztig (see [4], [94] and also [49]) that the algebra 1-lK = K ®A o 1-l is split
semisimple. Hence, by Tits' Deformation Theorem (see [16, §68A]) , we have
in fact 1-lK ~ KW. In particular, there is a bijection between the irreducible
characters of 1-lK and those of W. Explicitly, such a bijection is given as follows:
since the structure constants of 1-lK with respect to the basis {Tw I w E W} lie
in A o, a general argument shows that if X is an irreducible character of 1-lK then
X(Tw) lies in the integral closure of Ao in K, i.e., we have
for all w E W.
Now consider the ring homomorphism 8: A --+ Q, U!/2 f-t 1 (8 E S). Then Tits'
Deformation Theorem states more precisely that if X is an irreducible character
of 1-lK then the map w f-t 8(X(Tw)) (w E W) defines an irreducible character of
W, and this gives the required bijection.
We now wish to define the character table of 1-lK as a square matrix of
character values X(Tw), where X runs over the irreducible characters of1-lK and
w E W runs over a set of representatives of the conjugacy classes of W. However,
simple examples show that we may have X(Tw) =1= X(Tw') even if w, WI E Ware
conjugate. The question of a good choice of representatives is solved as follows.
Theorem 3.3. (Geck/Pfeiffer [60]) Let CI(W) be the set of conjugacy classes
of W. Let C E Cl(W) and w, WI E C. Then, if w, WI have minimal length in C,
the basis elements T wand T w' are conjugate in 1-l.
348 R. Dipper, M. Geck, G. Hiss, G. Malle
Although Theorems 3.3 and 3.4 are formulated in general terms, their proofs
depend on a ease-by-case analysis. The first theorem depends on a strong form
of conjugation in the underlying Weyl group, while the second theorem follows
from a result about taking powers of suitable elements in the associated Artin-
Tits braid groups. These results in turn were proved by inductive methods for
the classical types An, En, Dn and were verified by explicit computations for the
exceptional types using CHEVIE.
Remark 3.1. Theorems 3.3 and 3.4 remain correct for arbitrary finite Coxeter
groups, i.e., also for non-Weyl groups, see [54,59]. The character tables of the
Iwahori-Hecke algebras of the non-crystallographic types 12(m), H 3 , H4 are con-
structed from explicit matrix representations as computed by Kilmoyer /Solomon
[16, §67C], Lusztig [94] and Alvis/Lusztig [1], respectively.
YJ = 2: (-I)I(w)ind(Tw)-lTw E 1-lR,
wEWJ
Note that YR e! R0Yo, since YJ1-lo is pure in 1-l0. Moreover StJ,K = YJR!i/ (IK)
is a K LJ-submodule of MK isomorphic to the Steinberg module of LJ, which
occurs with multiplicity one in the Gelfand-Graev representation rJ,K of LJ.
Consequently there is a unique OLrlattice StJ,O in StJ,K which is an epimorphic
image of rJ,O, which hence contains a projective cover of StJ,o. We set NJ,o =
RtJ(StJ,o) and
NR = R0 EB NJ,o,
J~S
be noted too that for J ~ S the OG-sublattice NJ,o of Mo contains yJMo and
is contained in the purification YJMKnMO of the latter, but is not equal to one
of these in general.
The R-algebra End llR (YR ) is called the q-Schur algebra associated with 1{R
(of parabolic type) and it is denoted by SR = SR(W) adding the parameters as
additional indices if needed.
In [29] Dipper and James introduced q-Schur algebras associated with Weyl
groups of type A and observed that their representations are closely related with
those of finite general linear groups, (compare Section 4.3). Subsequently it was
discovered by Dipper and Donkin that these come up as well as duals of the
sub-coalgebra of homogeneous parts in the quantum monoid of (n x n)-matrices,
for details see [21].
In [53] and [68] Geck, Gruber and Hiss defined q-Schur algebras (somewhat
smaller than the q-Schur algebras of parabolic type introduced above) for Weyl
groups of types Band D, and used them to relate the decomposition matrices
of classical groups with those of general linear groups (see Section 4.4).
Parshall and Wang showed in [108] that q-Schur algebras. of type A are quasi
hereditary. This is not always true for other types. In [37] Du, Parshall and
Scott found that those satisfy the weaker condition to be stratified algebras. For
further results see e.g. [64] and [114].
In [39] Du and Scott and independently in [32] Dipper, James, and Mathas
constructed quasi hereditary q-Schur algebras of type B, called q-Schu-? algebras
in [39] and (Q, q) -Schur algebras in [32] by using not only q-analogues of right
ideals YJ1{ coming from parabolic subgroups of W, but from arbitrary reflection
subgroups as well.
In [33] Dipper, James, and Mathas generalized this further by construct-
ing cyclotomic q-Schur algebras which are associated with Ariki-Koike algebras.
These are deformations of the group algebra of certain complex reflection groups,
which are wreath products of a cyclic group of order r by the symmetric group
6 n . For r = lone obtains the Iwahori-Hecke algebra of type An and for r = 2 of
type Bn. Thus the previous results appear as special cases of this construction.
In [65] R. Green used similar ideas to construct q-Schur algebras associated
with affine Hecke algebras of type A.
Using the quantum group approach Oehms defined a different generalized
q-Schur algebra of type B, called symplectic q-Schur algebra in his disserta-
tion [104]. The Brauer algebra acts on tensor space centralizing the action of
symplectic and orthogonal groups. A q-deformation of the Brauer algebra is
the Birman-Murakami-Wenzl algebra which acts on tensor space as well. Using
a generalization of the FRT-construction, which was introduced by Faddeev,
Reshetikhin und Takhtadjin as a tool to produce quantum groups, the symplec-
tic quantized monoid is defined as centralizer coalgebra of this action. This is a
bialgebra graded by subcoalgebras of finite rank, whose duals are defined to be
the symplectic q-Schur algebras.
352 R. Dipper, M. Geck, G. Hiss, G. Malle
1 ... )
· 1 ..
( · . 1. , respectively.
· .. 1
· . 11
If f> 3, then De,l is always the identity matrix of the appropriate size. These re-
sults are easily proved by explicit computations using the matrix representations
given in [16, §67C).
The above example seems to indicate that most adjustment matrices are in
fact identity matrices. A general argument (which is given for Iwahori-Hecke
algebras in [46, Prop. 5.5)) shows that this is indeed the case for all sufficiently
large primes f. The term "sufficiently large" is specified in the following conjec-
ture, which was first stated by James [86, §4] for Iwahori-Hecke algebras and
q-Schur algebras of type A n - 1 and then formulated for Iwahori-Hecke algebras
of any type in [46, (5.6)].
Conjecture 3.1. If 1£ is the generic Iwahori-Hecke algebra associated with a finite
Weyl group, then the adjustment matrix De,l is the identity matrix if f does not
divide the order of W.
Probably, in the case of q-Schur algebras and Iwahori-Hecke algebras of type
A n - 1 , this result will even be true under the strengthened condition that ee > n.
The conjecture is established in [46] for blocks of defect 1, where the decomposi-
tion matrix is given in terms of a Brauer tree, similarly as for blocks of defect 1
in finite groups.
For explicit results on Iwahori-Hecke algebras of classical type, see Sec-
tion 3.5. The known results for exceptional types are summarized as follows:
- For type G 2 with equal parameters, see Example 3.2. (The case of unequal
parameters is easily treated by similar methods.)
- The matrices De are completely determined for types F 4 , E6 and E7j see
[57], [47], [50], and [100]. Partial results on the matrices De for type Es
are obtained by J. Muller [100]. The techniques used include R. Parker's
MEATAXE [107] and S. Linton's VECTOR-ENUMERATOR [93]. In [101],
the analogous matrices for the non crystallographic types 12 (m), H 3 , H4 are
determined.
- The above conjecture is established by explicit computation of all possible
decomposition matrices for types F4 (with equal parameters) and E 6 , see [57]
and [47], respectively. K. Bremke [5] has considered those cases of unequal
parameters in type F4 which actually occur in groups of Lie type.
In the case of Iwahori-Hecke algebras of exceptional type or of type An, En, Dn
(n odd) the following result can be extracted from the above explicit results and
the results in [26,34,106].
354 R. Dipper, M. Geck, G. Hiss, G. Malle
Theorem 3.8. Let A be a partition of n. Then N>' and R>' are two sided ideals
of 11..
The Specht (or standard) module S>. is now the submodule of 11./ R>' gen-
erated by z>. = x>. + R>'/R>' E ll/R\ where x>. := LWEW, Tw. On S>. we
have an ll-invariant bilinear form whose radical is denoted by radS>'. We set
D>' = S>'/radS>'.
{D>'I A f- n}
Theorem 3.10. (Dipper/James) Let e be as above. Then D>' '" (0) precisely if
.A is e-regular.
The blocks of 1i are given bye-cores respectively p-cores in the group case
by [28, 4.13] and [87, 4.29], and the primitive central idempotents were exhibited
in [28, 5.4] to name a few further results.
In the special case of a field R of characteristic 0, where q is a primitive eth
root of unity, an algorithm how to compute the decomposition matrices explicitly
will be discussed in more detail in the next section.
We come now to other types of Iwahori-Hecke algebras and q-Schur algebras.
In [99] Mathas produced a cellular basis of Ariki-Koike algebras generalizing
results of Dipper, James and Murphy [34] on Iwahori-Hecke algebras of type
B. Here the labels of the basis elements are standard multi-tableaux whose
shapes are multi-partitions, which in turn label the standard and the irreducible
modules. The decomposition matrix is again lower unitriangular. The question
of actually classifying the irreducible modules is presently still open, although a
lot of evidence towards a conjecture concerning this question has been obtained,
(see [34, 8.13] for the case of Iwahori-Hecke algebras of type Band [99] for its
generalization to Ariki-Koike algebras).
A similar picture evolves for q-Schur algebras and their generalizations. The
case of type A was investigated first in [29] and [30] by Dipper and James.
They defined q- Weyl modules, labelled by partitions of n and determined a
semistandard basis of those, labelled by semistandard tableaux. The q- Weyl
modules have unique maximal submodules for fields R and the unique irreducible
quotients are precisely the irreducible modules for the q-Schur algebra. The
decomposition matrix describing the composition factors of the q- Weyl modules
is a square matrix and is unitriangular.
In [35] Du extended the Kazhdan-Lusztig basis for Iwahori-Hecke algebras to
q-Schur algebras and showed later (see e.g. [36]) that it is the same as Lusztig's
canonical basis arising from quantum groups. In [38] Du and Rui showed that
this basis is cellular.
In his thesis [63] Richard Green extended the Murphy basis of the Iwahori-
Heeke algebra of type A to q-Sehur algebras using bideterminants and showed
that it is cellular.
356 R. Dipper, M. Geck, G. Hiss, G. Malle
The cyclotomic q-Schur algebras (and hence the special cases for type A and
B of Section 3.3) are cellular ([33]) as well as the symplectic q-Schur algebras of
Oehms in [104]. Moreover, in both cases the bilinear form on standard modules
are non-zero, and hence the decomposition matrices are square matrices and the
algebras are quasi hereditary.
Under a special assumption there is a direct way to investigate Iwahori-
Hecke and q-Schur algebras of type B, yielding more complete results. Thus let
W = C 2 /6 n be the Weyl group of type B of rank n. The Iwahori-Hecke algebra
H involves two parameter q and Q. Let
n-l
fn(Q, q) = II (Q + qi).
i=-(n-l)
This theorem was shown in [68] for those choices of parameters occurring in
the representation theory of finite classical groups. The condition fn(Q, q) f= 0
holds in roughly a third of all possible cases for these groups. Gruber and Hiss
used their representations to establish Theorem 3.12 and to show even that using
only parabolic subgroups of type A produces another Morita equivalent q-Schur
algebra. However Theorem 3.12 holds in general as can be seen easily from [32].
Obviously Theorems 3.11 and 3.12 allow to carry results for the type A-case
to those for the type Bn-case if fn(Q,q) f= O. In particular using 3.10 there is
a complete description of the irreducible modules and decomposition matrix of
H in terms of Iwahori-Hecke algebras of type A and that of the corresponding
q-Schur algebra of type B in terms of q-Schur algebras of type A. In Section 4.4
this will be applied to representations of classical groups.
Iwahori-Hecke algebras of type D seem far less accessible. In particular there
is no cellular basis known, and only q-Schur algebras of parabolic type have been
considered so far. Pallikaros generalized Theorem 3.11 to Iwahori-Hecke algebras
of type D in [106]. However, his results are not as complete as in the type B
case. In particular, he does not get a complete classification of the irreducible
modules in the case of even n. Note, however, that the triangular shape for
the decomposition matrix is established in [51], using a completely different
Representations of Heeke Algebras and Finite Groups of Lie Type 357
approach. There are some more results in special cases, due to Gruber and Hiss
in [68], using the fact that Iwahori-Hecke algebras of type D are contained as
subalgebras in Iwahori-Hecke algebras oftype B for a special choice of the second
parameter.
There are many numerical results on decomposition matrices of Iwahori-
Hecke and q-Schur algebras, of which we shall mention only two, since they
are connected with Conjecture 3.1. Mathas wrote the share package SPECHT
in GAP [116] to compute decomposition numbers of Iwahori-Hecke algebras of
type A at roots of unity in characteristic zero. In his thesis which is under
preparation, Lipp used an implementation in GAP of combinatorial rules [86]
for the computation of decomposition numbers of q-Schur algebras of type A
to verify the James conjecture in many special cases (compare Section 3.4). In
general he shows that the conjecture holds for fields R of small characteristic
and based on results of Richards [113] proved a block wise version of James
conjecture for blocks of weight ::; 3.
Let ?in be the generic Iwahori-Hecke algebra associated with the symmetric
group 6 n . Consider a specialization 8: A -t C which takes the parameter u to a
primitive e-th root of unity (for some e 2: 1), and let De be the corresponding
decomposition matrix. Lascoux, Leclerc and Thibon [91] conjectured that the
problem of computing De can be translated to that of computing the canonical
basis (in the sense of Kashiwara and Lusztig) of a certain highest weight module
for an affine Kac-Moody algebra. This has now been proved by Ariki [3]. (In
October 1995, Grojnowski also announced a proof, following the ideas in [66].)
Here, we shall only describe the purely algorithmic part of the solution.
For any n 2: 0, let Fn be the Q(u)-vector space with a basis consisting of the
partitions A f- n; by convention, Fo has as basis the empty partition, 0. Let
We shall now define linear maps fr: F -t F (0 :::; r :::; e - 1) such that
fr(Fn) ~ F n + 1 for all n, r. Let A I- n and 0 :::; r :::; e - 1. Suppose that the i-th
row is an indent row of A whose e-residue is r. Then we let N(A, r, i) be the
number of indent rows strictly above i whose e-residue is r, minus the number
of removable boxes in rows above i whose e-residue is r. We define
fr(A) = LiuN(~,r,i) A[i],
where the sum is over all indent rows i of A with e-residue r, and A[i] denotes
the partition of n + 1 obtained by adding a new box to the i-th row. For any
k ~ 1, we let [k]u := (uk - u-k)/(u - u- 1 ), and define operators f;k) by
f$1) := fr and f;k) = f;k-l) I[k]u (k ~ 2).
Let Kn ~ Fn be the subspace generated by all f::(0) := fi~n) ... fi~')(0), where
£ = (i 1 , ... , in) and K, = (kl, ... , k n ) are any sequences of integers with 0 :::;
il, ... , in :::; e - 1 and k 1 , • •• ,kn ~ 1.
Lemma 3.1. For any e-regular partition v I- n, there exist £, K, as above such
that
Kv := f:(0) = v + Z[u±1]-linear combination of various AI- n, A <l v.
Here, <l denotes the usual dominance order. Note that this element depends on
the choice of £, K" but for any choice, the elements {Kv} form a basis of Kn.
The proof in [91, §6] also yields an algorithm for finding suitable sequences £, K,
for each v. Now, using an interpretation of F as a module for an affine Kac-
Moody algebra and applying the theory of canonical bases, it can be shown that
there exist unique elements {GI'} in Kn (for e-regular Ill- n) such that
Now let us finally come to D., the decomposition matrix that we are trying to
compute. By [26], we know already that De has rows labelled by all partitions
>. I- n and columns labelled by the e-regular partitions f.L I- n. The following
result, which is Conjecture 6.9(ii) in [91], shows that De is determined by Gp.:
Theorem 3.13. (Ariki [3, Theorem 4.4]) For any e-regular partition f.L I- n, let
us write Gp. = L:;,\I-n d,\p.(u) >. where d,\p.(u) E Z[u]. Then we have De = (d,\p.(I)).
We remark that Ariki proves in fact a more general result, which also applies to
Iwahori-Hecke algebras of type Bn and even to the cyclotomic Hecke algebras
associated with the wreath product of 6 n and a finite cyclic group. Moreover,
Leclerc and Thibon [92] have proposed a generalization to the q-Schur algebra.
4 Decomposition Matrices
The concept of the decomposition matrix provides a link between the "ordinary"
complex representations of a finite group and the "modular" representations over
a field of prime characteristic. In the case of finite groups of Lie type, the ordi-
nary character tables are relatively well understood, due to the work of Lusztig
and Shoji (see [95,117] and the references there). The situation for modular rep-
resentations in the defining characteristic is settled by Lusztig's conjecture, if
that characteristic is not too small (see [2] and the references there). In this sec-
tion we consider the case of non-defining characteristic, and give a survey of the
present state of knowledge. Most of the results were first proved for the groups
GLn(q) (see [18,29]). The precise formulation of possible generalizations to other
types of groups was in many cases only possible after extensive calculations in
exceptional groups of low rank (see [72]).
Definition 4.2. We say that D has a triangular shape with respect to the ordi-
nary basic set S, if it is possible to order the sets Sand 1Brl(B) in such a way
that the matrix Ds has a lower triangular shape with 1 along the diagonal.
There are examples where this is not possiblej e.g., for G = SL2 (8) and £ = 2.
But there are no counter-examples known for finite groups of Lie type in non-
defining characteristic! The advantage of having a triangular shape as above is
that we then have a canonical labelling of the irreducible Brauer characters in
B by the ordinary characters in S. Indeed, we have:
Lemma 4.1. Assume that D has a triangular shape with respect to S. Choose
labellings S = {Xl,"" Xm} and IBrl(B) = {Ol,"" Om} such that the corre-
sponding matrix D s is lower triangular with 1 along the diagonal. Then the
bijection
Xi ++ Oi (1 ~ i ~ m),
does not depend on the chosen labellings.
Now consider again a finite group of Lie type GF over lFq as in the previous
section, and let £ be a prime not dividing q.
362 R. Dipper, M. Geck, G. Hiss, G. Malle
Conjecture 4.1. (Brow~ [6, Section 2A]/Hiss [72, Section 4.4]) If the ordinary
irreducible characters in &(G F , 8) and in &(CGo (8)*F*, 1) are matched via the
Jordan decomposition of characters (see Section 2.3), then the decomposition
matrix of [(G F , 8) equals that of [(CGo (8)* F ,1).
O
In the case where CGo (8) is a Levi L* subgroup of G*, the Jordan decomposition
is given by the operator RT, and Broue has in fact conjectured that this should
define a Morita equivalence between the corresponding unions of blocks. We will
come back to these questions below in the discussion of general linear groups
and the linear prime case for classical groups.
By Theorem 4.1, the unipotent characters of G F form an ordinary basic set
for [t (G F , 1) provided that £ is a good prime for G and the center of G is con-
nected. Suppose we are in this situation. Then it is known that the corresponding
decomposition matrix has a triangular shape in the following cases:
- GF = GLn(q), any £; see Section 4.3 below. In this case, we also have a
Jordan decomposition of blocks and decomposition numbers.
- G F is a finite classical group, £ is a linear prime; see Section 4.4. There is
also a Jordan decomposition of blocks and decomposition numbers.
- GF = GUn(q), any £; see [44].
- G F is an exceptional group of low rank (possibly with further conditions on
q or e); see [72] for G 2 (q), [45] for 3 D 4(q), and [122] for F4(q). A discussion of
a special case in E6(q) is contained in [53, §7]. (Although, strictly speaking,
the Suzuki and Ree groups are not finite groups of Lie type as introduced in
Section 2.2, the questions discussed here make sense for these groups, too. It
is known that the decomposition matrix of the unipotent characters of the
Suzuki groups 2B2(q) and of the Ree groups 2G 2(q) has triangular shape.
These groups have cyclic defect groups in all odd (non-defining) character-
istics, so the only problem here is the case of 2G2 (q) in characteristic 2.
The decomposition numbers in this case were found by Landrock and Mich-
ler [89].)
Some partial results are known in the bad prime case. It is shown in [48, Part III]
that if G F is a finite classical group, q is large and £ = 2, then the decomposition
matrix of [2(G F , 1) also has a triangular shape but only with respect to ordinary
basic sets which do not necessarily consist of unipotent characters.
Among all irreducible characters, the cuspidal ones playa special role from
the viewpoint of Harish-Chandra theory. It has been shown by Dipper and James
[18] that, if G F = GLn(q), then the i-modular reduction of an ordinary cuspidal
irreducible character remains irreducible as a Brauer character, and all cuspidal
Representations of Heeke Algebras and Finite Groups of Lie Type 363
irreducible Brauer characters arise in this way. This need not be the case for
other types of groups: there are examples of cuspidal ordinary characters which
do not remain irreducible under f-modular reduction; and there are examples
of cuspidal Brauer characters which do not occur in the f-modular reduction of
any ordinary cuspidal irreducible character. However, the second named author
has formulated in his Ph.D. thesis (1990) the following:
If this were true, we would then have a connection between the theory of Harish-
Chandra series of ordinary characters and of f-modular Brauer characters. The
conjecture is known to be true in the following cases:
- GF = GUn(q), any f; see [55].
- GF is a finite classical group, q is large and f = 2; see [58].
- G F is an exceptional group of low rank (possibly with further conditions on
q or f); see [72] for G2(q), [45] for type 3 D 4(q), and [53, §7] for E6(q). (Also
for 2B2(q) and 2G'2(q) if f is odd, but it is not true for f = 2 in 2G'2(q) [89].
But notice that the latter is not a group of Lie type as defined in Section 2.2.)
The proofs of these results use the fact that the decomposition matrix is known
to have a triangular shape in these cases, and a kind of Harish-Chandra theory
for generalized Gelfand-Graev representations in order to produce "sufficiently
many zeroes" in the decomposition matrix (see [58]).
We concentrate now on the special case where the l-regular part s' of s is the
identity 1 of G. This is justified, since the union £((G, s) of alil-blocks of G with
semisimple part s (compare Theorem 2.8) has the same decomposition matrix as
the union £((GG(s), 1) of unipotent l-blocks of GG(s), i.e., Conjecture 4.1 holds
for G. There is, however, so far no independent proof of this fact, it follows a pos-
teriori by showing that the decomposition matrices of the same q-Schur algebras
determine these decomposition matrices with the same algorithms. Everything
what is said below can be generalized to the case of arbitrary SEC.
Thus s is an l-element of G. Then W(L., 1) in Theorem 3.2 is the normalizer
W(L.) of L. in the Weyl group W of G. We have:
Theorem 4.2. (Dipper) Gk(s) = k ® Go(s) is an irreducible cuspidal kL-
module. There exists an l-element t E C such that L = L t = L. and the sub-
group W(L, t) of W(L) equals W(L). Moreover Gk(t) ~ Gk(S) and W(L, s) is a
parabolic subgroup of W(L) = W(L, t).
This result can be used to reduce the computation of the decomposition num-
bers ofthe geometric conjugacy class £(G, s) completely to those of £(G, t). For
details and a proof we refer to [20, Section 4] and [29]. With other words it suffices
to calculate decomposition numbers of Harish-Chandra series £(G, s) satisfying
the hypothesis of Theorem 3.2. In particular we may assume that tiR(W(L, s» =
R®oti(W(L, s» for R E {K, 0, k}, and write tiReS) := tiR(W(L, s». Similarly,
we write SR(S) := SR(W(L,s» for the corresponding q-Schur algebras.
Let e = l if l divides q - 1 and e be the multiplicative order of q modulo l
otherwise. The standard and the irreducible tiR(s)-modules are now labelled by
multi-partitions respectively regular multi-partitions corresponding to the tensor
factors of tiReS), where R = K or k. Here regular means ei-regular in the ith
part were ei = e if di = 1 and ei = l otherwise. We now apply Theorems 3.1
and 3.5 (in their general form [25]) to obtain:
Theorem 4.3. (DipperfJames) The decomposition matrix..1. of the q-Schur
algebra So (s) is part of the decomposition matrix of £ (G, s). The columns corre-
sponding to regular multi-partitions are the columns of the decomposition matrix
oftio(s). If L. = GLd(q)m for natural numbers m,d with md = n, then..1. is
the complete l-modular decomposition matrix of £(G, s).
This was proved in [29] up to the last part, which is a result of James in
[85]. Consider now the special case s = 1. Then L can be taken as the Borel
subgroup of all lower triangular matrices in G and tiReS) = llR(6 n ; q). The
rows and columns of the decomposition matrix ..11 of the corresponding q-Schur
algebra are labelled by partitions of n. An easy counting argument (applied to
all semisimple l-regular elements sEC) shows that ..11 is the complete decom-
position matrix of £(G, 1), that is, we have a complete list of the irreducible
kG-modules in the union of the unipotent blocks. We have thus:
Theorem 4.4. (Dipper/James) There is a bijection between the irreducible kG-
modules and the irreducible So(s)-modules, where s runs through a set of rep-
resentatives of semisimple l-regular conjugacy classes of G. Moreover the l-
modular decomposition matrix of G is lower unitriangular.
Representations of Heeke Algebras and Finite Groups of Lie Type 365
The classical groups. Thus a classical group in this section is one of the groups
of the following list. Let lFq denote the field with q elements, where q is a power
of a prime p, and let n be a positive integer. We write Gn(q) for anyone of the
following groups.
(1) The unitary group GUn(q) of an n-dimensional vector space over lFq2 with
a non-degenerate unitary form.
(2) The special orthogonal group SOn(q) of an n-dimensional vector space over
lFq with a quadratic form of maximal Witt index, where n is odd.
(3) The conformal symplectic group eSPn(q) of an n-dimensional vector space
over lFq with a non-degenerate symplectic form (such that n is even).
(4) The conformal special orthogonal group eso;t(q) of an n-dimensional vector
space over lFq with a quadratic form of maximal Witt index, where n is even.
(5) The conformal special orthogonal group eso;; (q) of an n-dimensional vector
space over lFq with a quadratic form of Witt index n/2 - 1, where n is even.
Linear and very large primes. Given a non-zero integer a and a prime e not
dividing a we write d := d(a, e) for the multiplicative order of a modulo e, i.e., d
is the smallest positive integer such that e divides ad - 1.
Definition 4.3. Let G = Gn(q) be one of the groups in the list above, and let e
be a prime not dividing q. Put
for G, and s a non-isolated semisimple £I-element in the dual group G*, i.e.,
CG. (s) is contained in a proper Levi subgroup of G*. Then the decomposition
matrices of &(G, s) and of &(Ca. (s), 1) are the same, i.e., Conjecture 4.1 is true
in these cases.
Unfortunately, no a priori proof of this reduction is known. For very large
primes it follows from the fact that, for the two cases, corresponding arguments
can be given to establish the links on the Brauer trees. For linear primes it follows
from the fact that the two q-Schur algebras which determine the decomposition
numbers of &(G, s) and &(Ca. (s), 1) are isomorphic to each other.
We may thus, for the purpose of this survey, restrict attention to the decom-
position numbers for unipotent characters. It turns out that in case of linear
and very large primes, the distribution of the unipotent characters into Harish-
Chandra series (of ordinary characters) plays an important role.
Theorem 4.8. (Fong/Srinivasan [42]) Let G be a classical group from the list
above and let f be a very large prime for G. Consider a Harish-Chandra series S
of unipotent characters of G. Suppose that the elements of S fall into the f-blocks
B 1 , .•• , B. of G. Then, for 1 $ i $ s, the a-invariants of the unipotent characters
in S n Bi are all different. Moreover, arranging these characters according to
increasing a-invariants, the decomposition matrix of S n Bi equals J(b i ) (where
bi = ISnBil and J(b i ) denotes the lower unitriangular Jordan (b i x bi)-matrix).
With this ordering of the characters in S, the decomposition matrix .::1 of S is
the block diagonal matrix diag(J(b 1 ), J(b 2 ), ••• , J(b.)).
We finally turn to the case of linear primes. For this purpose we have to
describe, in some more detail, the parameterization of the characters inside a
Harish-Chandra series of unipotent characters. This is the same for all classical
groups and all Harish-Chandra series except for the principal series of the confor-
mal group CSO~(q). For reasons of simplicity we therefore omit the latter from
368 R. Dipper, M. Geck, G. Hiss, G. Malle
Ll=
o
Here, ® denotes the Kronecker product of the matrices.
To be more precise, the elements of S are labelled as X = X(1'1,1'2) by bi-
partitions (P,1, P,2) of m via ordinary Harish-Chandra theory. The Brauer char-
acters occurring as constituents of the elements in S also have a labeling as
c.p = c.p(Vl,V2) by bi-partitions (VI, V2) of m. The multiplicity of c.p(Vl,V2) in X(1'1,1'2)
is 0, if 1p,11 i= IV11, and it equals dl'lVl (q,j)dI'2v2(q,j), otherwise. Here, d AlA2 (l)
is the i-decomposition number of GLIAl I(l) which corresponds to the unipotent
character parameterized by the partition Al and the Brauer character parame-
terized by "\2'
Let us state one corollary of the results described in this section.
Corollary 4.1. Let G be a classical group from the list above and let i be a linear
or a very large prime for G. Then the ordinary irreducible characters of G can
be arranged in such a way that the decomposition matrix is lower unitriangular.
This follows from the theorems given in this section (except for the conformal
orthogonal groups CSO~(q), where it is also true) and from the corresponding
fact for the general linear groups, see Theorem 4.5.
We also have an important result on the size of the decomposition numbers.
Theorem 4.10. (Gruber/Hiss [68, Corollary 8.11)) Let G = Gn(q) be a clas-
sical group from the list above and let i be a linear prime for G. Then the
i-decomposition numbers of G are bounded above by the order of the Weyl group
ofG.
Note that this bound on the decomposition numbers is independent of q and i.
Representations of Heeke Algebras and Finite Groups of Lie Type 369
Brauer trees. If we consider primes i with the property that the Sylow i-
subgroups of G are cyclic, the situation is much better. In this case the de-
composition matrices are described by Brauer trees. These are known for all
exceptional groups of Lie type except for E7(q) and E8(q) (and some exceptions
for 2E6(q» [77,76].
(1)
(2)
and
11' = CPA,St := CPSt m-l ® CPSt mo ® CPSt m, ® ...
by Theorem 4.11. Thus the Harish-Chandra series of unipotent Brauer charac-
ters of G are in bijection with the partitions of n of the form (2). The question of
which unipotent Brauer characters lie in the Harish-Chandra series correspond-
ing to such a partition is answered in the following theorem. By the results
of Section 4.3 we have a natural labelling of the unipotent Brauer characters
of G by partitions of n. Let 11'/1 denote the unipotent Brauer character of G
corresponding to the partition I-' of n.
Theorem 4.12. (Dipper/Du [23,4.3.12)) Let I-' be a partition ofn with e-i-adic
expansion
I-' = 1-'(-1) + el-'(O) + eil-'(l) + ....
It follows in particular, that G has at most one cuspidal unipotent Brauer char-
acter. The Levi subgroups of G are of the form
Theorem 4.14. Let (ai; P,1, P,2) be the label of a unipotent Brauer character 'P
of G, and let p'~j) and p'~j), j = -1,0,1, ... , denote the parts of the e-P-adic
expansion of P,l and P,2, respectively. (Here, e is defined as in 4.3 with respect
to q0.) Then'P lies in the Harish-Chandra series corresponding to La,.>., where A
is the partition of (n - ai)/2 of the form (2) with mj = 1p,~j)1 + Ip,~\ j =
-1,0,1, ...
This theorem was conjectured for the unitary groups in [55, Conjecture 9.2],
and proved in the case that 'P lies in an P-block with a cyclic defect group [55,
Proposition 9.3]. In the general case it follows from the results of [68, Section 8].
The unitary groups. We now give a result which indicates the existence of
many cuspidal unipotent Brauer characters which are not liftable to ordinary
characters. Let G = GUn(q) and let P be an odd prime dividing q + 1. The
unipotent Brauer characters of G have a natural parameterization by partitions
of n (see Section 4.2). Let 'Pp. denote the unipotent Brauer character of G cor-
responding to the partition p, of n.
2-regular. Define m = IJLII and mj = IJL~j) I, where the JL~j) are the parts of the
l-adic expansion of JL2, j = 0, 1,2, ....
Then <PIL lies in the Harish-Chandra series of <PILI 0 <PSt mo 0 <PSt ml 0"',
where <PSt m; is the l-modular Steinberg character of GL t ; (q2)m;, j = 0,1,2, ...
These two results were conjectured in [55, Conjecture 9.6], and proved to be true
in [56, Theorem 4.12J for l > n, and in general by Gruber in [67J.
d L X R C 1i #irr.
3 1 1 E7 1 1i(E7; q) 32
A2 <PSt A5 2 1i(A5;q) : 2 14
A2 x A2 <PSt x <PSt G 2 x Al 1 1i(G2; q, 1) 01i(AI; 1) 8
D4 C C3 1 1i(C3 ;q,q) 4
E6 CI, C2 Al 1 1i(AI; 1) 2+2
E6 c3,···,Cij Al 1 1i(AI ;Pi), i = 1, ... ,4 8
+ 6 cuspidals
5 1 1 E7 1 1i(E7;q) 54
A4 <PSt A2 2 1i(A2; q) : 2 6
D4 C C3 1 1i(C3 ;q,q4) 10
E6 CI,C2 Al 1 1i(AI;q4) 2+2
+ 2 cuspidals
7 1 1 E7 1 1i(E7;q) 58
D4 C C3 1 1i(C3 ;q,q4) 10
A6 <PSt Al 1 1i(AI ;P5) 2
E6 CI,C2 Al 1 1i(AI;q2) 2+2
+ 2 cuspidals
9 1 1 E7 1 1i(E7;q) 58
D4 C C3 1 1i(C3 ;q,q4) 10
E6 Cr, C2 Al 1 1i(AI; 1) 2+2
E6 <PSt Al 1 1i(AI ;P6) 2
+ 2 cuspidals
Representations of Hecke Algebras and Finite Groups of Lie Type 373
Its first column contains the number d = d(q, f), the order of q modulo f.
The second column of the table gives the pairs (L, X), where L is a Levi
subgroup of G and X is a cuspidal simple kL-module. The Levi subgroups are
denoted by their types. The trivial kL-module (which is cuspidal only if L is the
maximally split torus), is denoted by 1, the Steinberg module, if it is cuspidal,
by 'PSt. Further cuspidal modules are denoted by cor Ci, i = 1, ... ,6.
The third and fourth columns then give the groups R = R(L, X) and C =
C(L, X), and the sixth column the endomorphism algebra H = H(L, X) =
EndkG(Rr(X» (see Theorem 2.3). The parameters of these all lie in the finite
field 1Ft ~ k, i.e., q also denotes its image in 1Ft. There are some parameters
Pi E 1Ft * \ { -1}, i = 1, ... ,6, which are not yet known explicitly.
The last column of the table gives the number of simple kG-modules in the
Harish-Chandra series corresponding to (L, X), which is equal to the number of
simple H-modules by Corollary 2.1.
References
1. D. ALVIS AND G. L USZTIG, The representations and generic degrees of the Hecke
algebra of type H4, J. reine angew. Math. 336 (1982), 201-212; Correction: ibid.
449 (1994), 217-218.
2. H.H. ANDERSON, J.C. JANTZEN, AND W. SOERGEL, Representations of quan-
tum groups at a p-th root of unity and of semisimple groups in characteristic p:
independence of p, Asterisque 220 (1884).
3. S. ARIKI, On the decomposition numbers of the Hecke algebra of G(m, 1, n), J. of
Math. Kyoto Univ. 36 (1996), 789-808.
4. C.T. BENSON AND C.W. CURTIS, On the degrees and rationality of certain char-
acters of finite Chevalley groups, Trans. Amer. Math. Soc. 165 (1972), 251-273;
Corrections and additions, ibid. 202 (1975), 405-406.
5. K. BREMKE, The decomposition numbers of Hecke algebras of type F4 with un-
equal parameters, Manuscripta Math. 83 (1994), 331-346.
6. M. BROUE, Isometries parfaites, types de blocs, categories derivees, in Repre-
sentations Lineaires de Groupes Finis, Asterisque 181-182 (1990),61-92.
7. M. BROUE AND G. MALLE, Theoremes de Sylow generiques pour les groupes
reductifs sur les corps finis, Math. Ann. 292 (1992), 241-262.
8. M. BROUE AND G. MALLE, Zyklotomische Heckealgebren, in: Representations
unipotentes generiques et blocs des groupes reductifs finis, Asterisque 212 (1993),
119-189.
9. M. BROUE, G. MALLE, AND J. MICHEL, Generic blocks of finite reductive groups,
in: Representations unipotentes generiques et blocs des groupes reductifs finis,
Asterisque 212 (1993), 7-92.
10. M. BROUE AND J. MICHEL, Blocs et series de Lusztig dans un groupe reductif
fini, J. reine angew. Math. 395 (1989), 56-67.
11. M. BROUE AND J. MICHEL, Blocs a. groupes de defaut abeliens des groupes
reductifs finis, in: Representations unipotentes generiques et blocs des groupes
reductifs finis, Asterisque 212 (1993), 93-118.
12. M. CABANES, A criterion for complete reducibility and some applications, in:
Representations lineaires de groupes finis, Luminy, 16-!H mai 1988, Asterisque
181-182 (1990),93-112.
374 R. Dipper, M. Geck, G. Hiss, G. Malle
13. M. CABANES AND M. ENGUEHARD, Local methods for blocks of reductive groups
over a finite field, in: Finite reductive groups: Related structures and representa-
tions (ed. M. Cabanes), pp. 141-163. Birkhiiuser, Basel, 1997.
14. R.W. CARTER, Representation theory of the O-Hecke algebra, J. Algebra 104
(1986),89-103.
15. R.W. CARTER, Finite groups of Lie type: Conjugacy classes and complex charac-
ters, Wiley, Chichester, 1985.
16. C. W. CURTIS AND I. REINER, Methods of representation theory, vols. 1,2, Wiley,
Chichester, 1981, 1987.
17. P. DELIGNE AND G. LUSZTIG, Duality for representations of a reductive group
over a finite field II, J. Algebra 81 (1983), 540-545.
18. R. DIPPER, On the decomposition numbers of the finite general linear groups,
7Tans. Amer. Math. Soc. 290 (1985), 315-343; II, 292 (1985), 123-133.
19. R. DIPPER, On quotients of Hom-functors and representations of finite general
linear groups I, J. Algebra 130 (1990), 235-259.
20. R. DIPPER, On quotients of Hom-functors and representations of finite general
linear groups II, Preprint, Stuttgart 1997.
21. R. DIPPER AND S. DONKIN, Quantum GL n , Proc. London Math. Soc. 63 (1991),
165-211.
22. R. DIPPER AND J. Du, Harish-Chandra vertices, J. reine angew. Math. 437
(1993), 101-130.
23. R. DIPPER AND J. Du, Harish-Chandra vertices and Steinberg's tensor product
theorem for general linear groups, Proc. London Math. Soc. 75 (1997), 559-599.
24. R. DIPPER AND P. FLEISCHMANN, Modular Harish-Chandra theory I, Math. Z.
211 (1992), 49-71; II, Arch. Math. 62 (1994), 26-32.
25. R. DIPPER AND J. GRUBER, Generalized q-Schur algebras and modular represen-
tation theory of finite groups with split (B, N)-pairs, Preprint, Stuttgart 1997.
26. R. DIPPER AND G. D. JAMES, Representations of Hecke algebras of general linear
groups, Proc. London Math. Soc. 52 (1986), 20-52.
27. R. DIPPER AND G. D. JAMES, Identification of the irreducible modular represen-
tations of GLn(q), J. Algebra 104 (1986), 266-288.
28. R. DIPPER AND G. D. JAMES, Blocks and idempotents of Hecke algebras of
general linear groups, Proc. London Math. Soc. 54 (1989), 57-82.
29. R. DIPPER AND G. D. JAMES, The q-Schur algebra, Proc. London Math. Soc. 59
(1989), 23-50.
30. R. DIPPER AND G. D. JAMES, q-tensor space and q-Weyl modules, Proc. Amer.
Math. Soc. 327 (1991), 251-282.
31. R. DIPPER AND G.D. JAMES, Representations of Hecke algebras of type B n ,
J. Algebra 146 (1992), 454-481.
32. R. DIPPER, G.D. JAMES, AND A. MATHAS, The (Q,q)-Schur algebra, Proc. Lon-
don Math. Soc., to appear.
33. R. DIPPER, G.D. JAMES, AND A. MATHAS, Cyclotomic q-Schur algebras,
Math. Z., to appear.
34. R. DIPPER, G.D. JAMES, AND G.E. MURPHY, Hecke algebras of type Bn at roots
of unity, Proc. London Math. Soc. 70 (1995), 505-528.
35. J. Du, Kazhdan-Lusztig bases and isomorphism theorems for q-Schur algebras,
Contemp. Math. 139 (1992), 121-140.
36. J. Du, A new proof for the canonical bases of type A, Preprint, 1996.
37. J. Du, B. PARSHALL, AND L. SCOTT, Stratifying endomorphism algebras associ-
ated to Hecke algebras, Preprint, Sydney 1996.
Representations of Hecke Algebras and Finite Groups of Lie Type 375
38. J. Du AND H. RU!, Based algebras and standard basis for quasi-hereditary alge-
bras, 7rans. Amer. Math. Soc., to appear.
39. J. Du AND L. SCOTT, The q-Schur 2 algebra, Preprint, Charlottesville 1997.
40. P. FONG AND B. SRINIVASAN, Generalized Harish-Chandra theory for unipotent
characters of finite classical groups, J. Algebra 104 (1986), 301-309.
41. P. FONG AND B. SRINIVASAN, The blocks of finite classical groups, J. reine angew.
Math. 396 (1989), 122-191.
42. P. FONG AND B. SRINIVASAN, Brauer trees in classical groups, J. Algebra 131
(1990), 179-225.
43. M. GECK, Irreducible Brauer characters of the 3-dimensional special unitary
groups in non-defining characteristic, Comm. Algebra 18 (1990), 563-584.
44. M. GECK, On the decomposition numbers of the finite unitary groups in non-
defining characteristic, Math. Z. 201 (1991), 83-89.
45. M. GECK, Generalized Gelfand-Graev characters for Steinberg's triality groups
and their applications, Comm. Algebra 19 (1991), 3249-3269.
46. M. GECK, Brauer trees of Hecke algebras, Comm. Algebra 20 (1992), 2937-2973.
47. M. GECK, The decomposition numbers of the Hecke algebra of type E6, Math.
Compo 61 (1993), 889-899.
48. M. GECK, Basic sets of Brauer characters of finite groups of Lie type, II, J. London
Math. Soc. 41 (1993), 255-268; III, Manuscripta Math. 85 (1994), 195-216.
49. M. GECK, On the character values of Iwahori-Hecke algebras of exceptional type,
Proc. London Math. Soc. 68 (1994), 51-76.
50. M. GECK, Beitriige zur Darstellungstheorie von Iwahori-Hecke-Algebren, Habilita-
tionsschrift, Aachen 1994, Aachener Beitriige zur Mathematik, Band 11, Aachen,
1995.
51. M. GECK, Kazhdan-Lusztig cells and decomposition numbers, Institut de Mathe-
matiques de Jussieu, Paris, Prepubl. 141, 1997.
52. M. GECK AND G. HISS, Basic sets of Brauer characters of finite groups of Lie
type, J. reine angew. Math. 418 (1991), 173-188.
53. M. GECK AND G. HISS, Modular representations of finite groups of Lie type
in non-defining characteristic, in Finite reductive groups: Related structures and
representations (ed. M. Cabanes), pp. 195-249. Birkhiiuser, Basel, 1997.
54. M. GECK, G. HISS, F. LUBECK, G. MALLE, AND G. PFEIFFER, CHEVIE - A
system for computing and processing generic character tables, AAECC 1 (1996),
175-210.
55. M. GECK, G. HISS, AND G. MALLE, Cuspidal unipotent Brauer characters, J.
Algebra 168 (1994), 182-220.
56. M. GECK, G. HISS, AND G. MALLE, Towards a classification of the irreducible
representations in non-defining characteristic of a finite group of Lie type, Math.
Z. 221 (1996), 353-386.
57. M. GECK AND K. Lux, The decomposition numbers of the Hecke algebra of type
F4, Manuscripta Math. 10 (1991), 285-306.
58. M. GECK AND G. MALLE, Cuspidal unipotent classes and cuspidal Brauer char-
acters, J. London Math. Soc. 53 (1996), 63-78.
59. M. GECK AND J. MICHEL, Good elements in finite Coxeter groups and repre-
sentations of Iwahori-Heeke algebras, Proc. London Math. Soc. (3) 14 (1997),
275-305.
60. M. GECK AND G. PFEIFFER, On the irreducible characters of Heeke algebras,
Adv. in Math. 102 (1993), 79-94.
376 R. Dipper, M. Geck, G. Hiss, G. Malle
61. M. GECK AND R. ROUQUJER, Centers and simple modules for Iwahori-Hecke
algebras, in Finite reductive groups: Related structures and representations (ed.
M. Cabanes), pp. 251-272. Birkhauser, Basel, 1997.
62. J.J. GRAHAM AND G.I. LEHRER, Cellular algebras, Invent. Math. 123 (1996),
1-34.
63. R.M. GREEN, q-Schur algebras and quantized enveloping algebras, Thesis, Uni-
versity of Warwick, 1995.
64. R.M. GREEN, Hyperoctahedral Schur algebras, J Algebra 192 (1997), 418-438.
65. R.M. GREEN, The affine q-Schur algebra, Preprint, Oxford, 1997.
66. I. GROJNOWSKI, Representations of affine Hecke algebras (and quantum GLn) at
roots of unity, Internat. Math. Research Notices 5 (1994), 215-217.
67. J. Gruber, Green vertex theory, Green correspondence and Harish-Chandra in-
duction, J. Algebra 186 (1997), 476-521.
68. J. GRUBER AND G. HISS, Decomposition numbers of finite classical groups for
linear primes, J. reine angew. Math. 485 (1997), 55-91.
69. T. HALVERSON AND A. RAM, Murnaghan-Nakayama rules for characters of
Iwahori-Hecke algebras of classical type, TI-ans. Amer. Math. Soc. 348 (1996),
3967-3995.
70. HARISH-CHANDRA, Eisenstein series over finite fields, in Functional analysis and
related fields, pp. 76-88. Springer, New York 1970.
71. G. HISS, On the decomposition numbers of G2(q), J. Algebra 120 (1989), 339-360.
72. G. HISS, Zerlegungszahlen endlicher Gruppen vom Lie-Typ in nicht-definierender
Charakteristik, Habilitationsschrift, RWTH Aachen, 1990.
73. G. HISS, The Brauer trees of the Ree groups, Comm. Algebra 19 (1991), 871-888.
74. G. HISS, Harish-Chandra series of Brauer characters in a finite group with a split
BN-pair, J. London Math. Soc. 48 (1993), 219-228.
75. G. HISS, Supercuspidal representations of finite reductive groups, J. Algebra 184
(1996), 839-85l.
76. G. HISS AND F. LUBECK, The Brauer trees of the exceptional Chevalley groups
of types F4 and 2E6, Arch. Math. 70 (1998), 16-21.
77. G. HISS, F. LUBECK, AND G. MALLE, The Brauer trees of the exceptional Cheval-
ley groups of type E 6, Manuscripta Math. 87 (1995), 131-144.
78. G. HISS AND J. SHAMASH, 3-Blocks and 3-decomposition numbers of G 2 (q), J.
Algebra 131 (1990), 371-387.
79. G. HISS AND J. SHAMASH, 2-Blocks and 2-decomposition numbers of the Cheval-
ley groups G2(q), Math. Compo 59 (1992), 645-672.
80. R. B. HOWLETT AND G. 1. LEHRER, Induced cuspidal representations and gen-
eralized Hecke rings, Invent. Math. 58 (1980), 37-64.
81. R. B. HOWLETT AND G. 1. LEHRER, On Harish-Chandra induction for modules
of Levi subgroups, J. Algebra 165 (1994), 172-183.
82. N. IWAHORI, On the structure of the Hecke ring of a Chevalley group over a finite
field, J. Fac. Sci. Univ. Tokyo Sect. IA Math. 10 (1964), 215-236.
83. G.D. JAMES, Unipotent representations of the finite general linear groups, J. Al-
gebra 74 (1982), 443-465.
84. G.D. JAMES, Representations of general linear groups, Cambridge Univ. Press,
Cambridge, 1984.
85. G.D. JAMES, The irreducible representations of the finite general linear groups,
Proc. London Math. Soc. 52 (1986), 236-268.
86. G.D. JAMES, The decomposition matrices of GLn(q) for n ~ 10, Proc. London
Math. Soc. 60 (1990), 225-265.
Representations of Hecke Algebras and Finite Groups of Lie Type 377
112. A. RAM, A Frobenius formula for the characters of the Hecke algebras, Invent.
Math. 106 (1991), 461-488.
113. M.J. RICHARDS, Some decomposition numbers for Heeke algebras of general linear
groups, Proc. Camb. Phil. Soc. 119 (1996).
114. H. RUI, On endomorphism algebras arising from multiparameter Hecke algebras,
J. Algebm 195 (1997), 308-319.
115. K. -D. SCHEWE, Blocke exzeptioneller Chevalley-Gruppen, Dissertation, Bonner
Mathematische Schriften, nr 165, Bonn, 1985.
116. M. SCHONERT ET AL., GAP - Groups, Algorithms, and Programming, Lehrstuhl
D fiir Mathematik, RWTH Aachen, Germany, fourth ed., 1994.
117. T. SHOJI, Character sheaves and almost characters ofreductive groups I, II, Adv.
in Math. 111 (1995), 244-313 and 314-354.
118. S. VEIGNEAU, ACE - An algebraic combinatorics environment for the computer
algebra system MAPLE, User's reference manual, Version 2.0, Institut Gaspard
Monge (1996), http://lleyl.univ-mlv.fr/~sv/ACE
119. M.-F. VIGNERAS, Sur la conjecture locale de Langlands pour GL(n,F) sur Pi,
C. R. A cad. Sci. Paris Ser. I Math. 318 (1994), 905-908.
120. K. WAKI, A note on the decomposition numbers of Sp(4, q), J. Algebm 185 (1996),
105-112.
121. D. WHITE, Decomposition numbers of Sp(4, q) for primes dividing q±l, J. Algebm
132 (1990), 488-500.
122. E. WINGS, Uber die unipotenten Chamktere der Chevalley-Gruppen vom Typ F4
in guter Chamkteristik, Dissertation, RWTH Aachen, 1995.
The Groups of Order 512
References
1. Hans Ulrich Besche and Bettina Eick: Construction of finite groups. J. Symbolic
Comput. (to appear) (1998).
2. Hans Ulrich Besche and Bettina Eick: The groups of order at most 1000 except
512 and 768. J. Symbolic Comput. (to appear) (1998).
3. Bettina Eick and E.A. O'Brien: Enumerating p-groups. In preparation (1998).
4. Graham Higman: Enumerating p-groups. I: Inequalities. Proc. London Math. Soc.
(3) 10, 24-30 (1960).
5. Rodney James: The Groups of Order p6 (p an Odd Prime). Math. Camp. 34,
613-637 (1980).
6. M.F. Newman: Determination of groups of prime-power order. Group Theory, Lec-
ture Notes in Math., 573, (Canberra, 1975), pp. 73-84. Springer-Verlag, Berlin,
Heidelberg, New York (1997).
7. E.A. O'Brien: The p-group generation algorithm. J. Symbolic Comput. 9, 677-698
(1990).
8. E.A. O'Brien: The Groups of Order 256. J. Algebra 143(1}, 219-235 (1991).
9. E.A. O'Brien: Computing automorphism groups of p-groups. Computational Alge-
bra and Number Theory (Sydney, 1992), pp. 83-90. Kluwer Academic Publishers,
Dordrecht (1995).
10. Martin Schonert et al.: GAP - Groups, Algorithms and Programming. Lehrstuhl
D fUr Mathematik, RWTH Aachen (1997).
11. Charles C. Sims: Enumerating p-groups. Proc. London Math. Soc. (3) 15, 151-166
(1965).
12. The GAP Team: GAP - Groups, Algorithms, and Programming, Version 4.
Lehrstuhl D fUr Mathematik, RWTH Aachen, and School of Mathematical and
Computational Sciences, University of St Andrews (1997).
Computational Aspects of Representation
Theory of Finite Groups II
which was done in [23] and which is explained in some detail in [41]). These
algorithms and implementations have a broad spectrum of applications in var-
IOus areas, e. g. in the recent verifications of the Dade conjectures in modular
representation theory (see for examples [13]) to mention just one.
Whereas the Dixon-Schneider algorithm works for any finite group in which
one can calculate explicitly with the conjugacy classes, there are also algorithms
due to Conlon (see [8]), and Baum (see [2], [3]) which work for p-groups, or
supersolvable groups. These algorithms use Clifford Theory and the fact that
irreducible representations of supersolvable groups are monomial and require
the group to be given by a power-commutator presentation. They also allow the
construction of representatives of the irreducible representations. These algo-
rithms have been implemented in GAP by H. U. Besche [4] and have proven to
be quite efficient. For most 2-groups of order 28 for example the implementation
of Baum's algorithm ("CharTablePGroup") in GAP takes about 2 seconds on a
Pentium 166 MH UNIX workstation, which is about three times as fast as the
implementation of the Dixon-Schneider Algorithm in these cases.
It is well known, that non-isomorphic groups may have the same character table,
the best known example being the quaternion and the dihedral groups of order
8. There are very many other examples of this sort, e.g among 2-groups, as the
following table shows:
such that
X(gi) = 'I/J(X)(</>(g)i)
for all 9 E G l and X E Irr(Gt} and j E :IN .
Computational Representation Theory 383
The first Brauer pairs were given by E. C. Dade in [11] . These were certain
p-groups of exponent p and order p7 defined for p ~ 5 . Observe that for such
groups of exponent p the powermaps don't give any further information which
is not contained in the matrix of the character values.
A comprehensive search for Brauer pairs among the 2-groups of order up to
28 was carried out in Aachen by E. Skrzipczyk in 1992 ([51]) using the library
of 2-groups established by E. O'Brien (see [43]), which is available e.g. in GAP.
Perhaps surprisingly (in view of the figures in the above table) no Brauer pairs
were found among the groups of order up to 27. But for the groups of order 256
the search was successful, giving the first examples of Brauer pairs among 2-
groups and the first examples of such pairs not consisting of groups of exponent
p.
Theorem 1. Among the 56092 groups of order 28 there are exactly 10 Brauer
pairs. These are
(G1734,G1735),(G1736,G1737), (G1739,G1740), (G1741,G1742), (G3378,G3380),
(G3379,G3381), (G3678,G3679), (G4154,G4157), (G4155,G4158 ),(G4156,G4159 ),
where G; denotes the group of order 28 with number i in O'Brien's data base.
Since it is impossible to compare 56092 character tables at the same time, it
was necessary to divide the groups into smaller families using invariants, which
are respected by the character tables. The rank and isomorphism type of the
commutator factor group, which are very easy to compute, were not sufficient
for these purposes. In addition the order of the center and the number of con-
jugacy classes were used for the preprocessing. These invariants brought down
the number of groups and tables to be considered at the same time to a man-
ageable size of a few hundreds (in many cases far less) except for a few cases
among the groups of rank 5. Since comparing large character tables (using e.g.
the GAP - command "TransformingPermutationsCharTable" , c.f. [50] ) is quite
time consuming a number of technical and quickly computable invariants had
been introduced, such as the values of characters of fixed degree on classes of
elements of given order and centralizer order. Of course also F'robenius Schur in-
dicators are taken into account and classes which consist of squares of elements
are distinguished from those which don't. It seems to be impossible to prove one
part of the Theorem without the help of a computer. On the other hand, it is
not impossible to verify by hand that the given pairs are indeed Brauer pairs;
indeed they also have the same irreducible 2-characters as defined by F'robenius
[15] (see also [31] and [30]) , where the 2-character corresponding to a character
X is defined to be the function X(2) : G x G -+ (: with
In fact this follows easily from looking at the presentations of the pairs (which we
are going to list now) and has also been observed by J. McKay and D. Sibley. We
mention in passing that on the other hand the irreducible 1-, 2- and 3-characters
characterize a finite group up to isomorphism, as was proved in [22].
We proceed to list power commutator presentations for the Brauer pairs.
All the groups in the Brauer pairs found have rank 2 and center isomorphic to
384 K. Lux, H. Pahlings
the Klein four-group 7Z2 x 7Z2 . The pairs fall into several families with very
similar structure We have changed some of the presentations given in O'Brien's
database slightly so that in all cases the center is < a7, as > and also so that for
each Brauer pair (G;,G j ) the map
From this one obtains power commutator presentations by adding the following
relations:
Finally G 367S and G 3679 are factor groups of H3 of order 512 with defining
relations
The given presentations reveal that for any of the above pairs (G i , G j) there
is a common supergroup Gi,j of order 512 in which G i and Gj intersect in a
group Di,j of order 128. In the first four pairs (G1734,GI735) to (G1741,GI742)
we have Di,j =< al,a3, ... ,as > whereas in all other cases Di,j =< a2, ... ,as >.
It turns out that some of the Brauer pairs consist of groups having the same
tables of marks, whereas others don't. We summarize some of the properties of
the pairs in the following table.
pair Gi,Gj GdG~ nclasses nsubs equal tables
G I734 , G l735 (2,2,4) 34 407 no
G 1736 , G l737 (2,2,4) 34 393 no
G1739,GI740 (2,2,4) 34 377 no
G 1741, G l742 (2,2,4) 34 283 yes
G337S,G33S0 (2,2,2) 28 381,373 no
G3379,G33S1 (2,2,2) 28 271,263 no
G 367S , G 3679 (2,2,2) 25 186 yes
G4154, G4157 (2,2,4) 46 555 no
G4155,G4158 (2,2,4) 46 481 no
G 4156 , G 4159 (2,2,4) 46 401 yes
The second column gives the type of the commutator factor group, (2 , 2 , 4)
means, that this factor is isomorphic to 7Z2 x 7Z2 X 7Z4 . In the next columns
the numbers nclasses of conjugacy classes of elements and nsubs of subgroups
are displayed. The last column indicates whether the groups G i and Gj have
the same table of marks (using appropriate orderings of the conjugacy classes of
subgroups).
386 K. Lux, H. Pahlings
2 Tables of Marks
As indicated in part I [35J tables of marks playa role in numerous applications.
Just to mention one in addition to those referred to in [35] we remark that
the first author has used such tables to find explicitly subgroups H of order
coprime to p such that a block B of a group algebra FG of G over a field
F of characteristic p is Morita equivalent to the "condensed block" eHBeH,
see page 392. Here eH is the idempotent rkr 2:hEH h corresponding to H in
FG. He has thereby determined the layers of the socle series of the projective
indecomposable B-modules for several sporadic simple groups, see page 393 (and
also [34]). The algorithm of G. Pfeiffer [44] sketched in [35] to construct tables
of marks inductively has been further developed by T. Merkwitz [38] and the
library of tables, which is now available in GAP, has been increased considerably.
One of the main improvements is that for any representative of a conjugacy
class of subgroups a set of generators is stored. For the whole group G , whenever
possible, standard generators as defined by R. Wilson [60] (see also [59]) are used.
The generators of subgroups are then expressed as words in these "standard
generators", although they are not stored in this manner but in a recursive
way, which is nowadays usually called a "straight line program". In this way
the amount of storage space necessary is kept in reasonable ranges whereas
the user has the possibility to perform concrete computations with elements of
representatives of subgroups, which has already proved to be invaluable for many
applications. In fact two frequently needed data for subgroups, their normalizers
in G and their commutator subgroups, are now already stored with the tables,
more precisely, for each conjugacy class of subgroups, which corresponds to a line
in the table of marks, the conjugacy class containing the normalizer (respectively
the commutator subgroup) of a representative of the class is given.
The GAP-library of tables of marks contains now the tables (with the addi-
tional information as described above) of the following groups:
Also most of the tables of marks of the maximal subgroups of the groups listed
above are contained in the library, in particular those of the larger groups in the
above list.
Since the publication of the first part of this paper (see [35], section 2) major
advances have been made in the area of computational modular representation
theory. The achievements can be split into the following topics.
In the following we give a survey on the topics a), b), and c).
The publication of the Atlas of Brauer Characters, see [27], surely represents
a major progress in topic a). This Atlas is the first published list of Brauer
character tables of finite simple groups. Moreover, it will be of great use to
all people who seriously want to study individual almost simple groups and
the properties of their irreducible Brauer characters. But a very competitive
approach to the printed version of Brauer character tables in the Atlas of Brauer
characters is given by the database of Brauer character tables implemented in the
GAP system. In contrast to the printed version of the Atlas of Brauer characters,
it allows the user to access the tables online and therefore use a table directly
in any further calculations, one wants to make.
Moreover one can easily derive new information from such a table, in the sense
that one can apply all elementary character theoretic manipulations within the
GAP system. As such, GAP nicely complements the MOe system, which as
described in [35] is used for determining Brauer character tables in the first
place.
The list of Brauer characters in GAP actually contains even all the new
results which have been obtained since the publication of the Atlas of Brauer
characters.
The following table lists a couple of new results obtained in the last few years
and also the authors of the corresponding tables. Note that the Brauer character
tables are complete for all groups listed in [9] up to page 135. Moreover, postscript
versions of all computed tables can be obtained on request from the first author.
388 K. Lux, H. Pahlings
3.2 Algorithms
The programs that were used for determining all the tables mentioned are part
of the two program systems MOC and M. Ringe's C-MeatAxe, see [45].
The MOC system has been in a stable state since the end of the 1980s. The
only major addition to its algorithmic part has been contributed by C. Jansen
in his diploma thesis, see [24].
In the thesis he deals with the problem of proving efficiently that a given
Brauer character or projective character is irreducible respectively indecompos-
able. The original version of MOC contained a very naive approach, see [35],
that was suffering from the known problem of combinatorial explosion.
Before we give a rough outline of the method that he uses let us recall some
facts from [35]. As described there, when studying the Brauer characters of a
finite group using MOC, one assumes that one is given several sets of Brauer
and projective characters:
i) A basic set of Brauer characters BS, i.e. a basis of the free abelian group of
Brauer characters consisting of genuine Brauer characters.
ii) An analogous basic set PS of projective characters.
Computational Representation Theory 389
iii) A set of Brauer characters B for which the decomposition into the basic
set BS contains negative coefficients. By abuse of notation we denote the
decompositions into BS by B, too.
iv) A set of projective characters P for which the decomposition into the basic
set P S contains negative coefficients. Again, by abuse of notation we denote
the decompositions into P S by P, too.
The aim of MOC is to derive from these sets the irreducible Brauer characters
IBr(G) and the indecomposable projective characters IPr(G). In addition to
the four sets above, we also consider for each of the basic sets the corresponding
dual basis called projective atoms P A and Brauer atoms BA. Denote the base
change from BS to BA by U, the base change from BS to IBr(G) by U1 and
the base change from PS to IPr(G) by U2 . Then U can be written as
This lemma indicates that in order to determine the evaluation of the cor-
responding functor on a module, we only have to know a generating set for H.
This is not true for idempotents different from eH in general. However, it is con-
ceivable and even useful to consider idempotents different from eH. Note also
that the algebra eHFGeH is isomorphic to the endomorphism ring of the per-
mutation module on the cosets of H, which sometimes is called a Hecke algebra,
see [46].
The example of the McLaughlin group in characteristic 5 given in [35] already
showed how condensation is used to find out which of a given set of possible
Brauer character tables is the correct one. Let us suppose that we are given
an FG-module M and the action of some elements of the algebra eHFGeH on
the condensed module Me. We can get lower bounds on the dimensions of the
simple modules involved in M by determining the composition factors of Me
as a module for the algebra generated by these elements. In the example given
in [35] this was sufficient to settle the problem of finding the correct Brauer
character table. However, in general, the lower bounds we get might not solve
the problem for the following reason: in general we do not know whether the
sub algebra generated by the elements selected really is the condensed algebra
eHFGeH. Even though it is quite likely that it is true, there is no guarantee in
general that we have found a generating set. The following example illustrates
that even given generators for G, the corresponding elements in the condensed
algebra eHFGeH do not necessarily generate eHFGeH.
Group Primes
Mll 2,3
M12 2,3
J1 2
M22 2,3
h 2,3,5
M 23 2,3
HS 3,5
h 3
M24 3
Remark 2. For finding a suitable subgroup H we made heavy use of the GAP
system in the sense of using the library of tables of marks created by G. Pfeiffer
and T. Merkwitz. Especially its extended version created by T. Merkwitz was
particularly useful, since it contains words in the standard generators that give
generators for all (!) subgroups of G up to conjugacy, as mentioned in section
2. This simplified the task of constructing the regular eHFGen-module consid-
erably. Moreover in some particular cases we had to use an enhanced version of
the above lemma which is due to M. Wiegelmann, see [36]. It basically says that
we can do a little bit better than studying the regular eHFGen-module.
Remark 3. As mentioned above the so de and the radical series of a module are
preserved under a Morita equivalence. This has been applied by the first author,
Computational Representation Theory 393
see [34], to determine the sode series of all the projective indecomposable mod-
ules for several of the sporadic simple groups. The following list gives an overview
of the characteristics for which the sode series of all blocks of noncydic defect
have now been computed. A question mark indicates that the corresponding so-
de series have not yet been determined.
Primes 2 3 5
Mu see [47] see [55]
M12 see [49] see [47]
J1 see [32]
M22 see [33] see [55]
h see [34] see [34] see [34]
M 23 see [36] see [55]
HS ? see [56] see [34]
J3 ? see [34]
M24 ? see [34]
The explicit Morita equivalences given for the groups listed above can also be
used to determine the complete structure of the basic algebra for blocks of FG.
S. Weiss, a former student in Aachen, has written several programs, see [57],
that given the projective indecomposable modules of an F-algebra A determine
the quiver with relations for the corresponding basic algebra. Given this data,
first the radical series of the projective indecomposable modules are determined.
This information is used to determine a basis for a complement to the vector
space of homomorphisms from each projective indecomposable module into the
radical of any other projective module to the subspace of homomorphisms into
the square of the radical is chosen. In the final step a program then determines
the relations amongst these chosen homomorphisms and thereby a presentation
for the quiver with relations is achieved. She applied her programs successfully
to J 1 in characteristic 2 and to HS in characteristic 3 using the explicit Morita
equivalences determined by the first author. Meanwhile the first author has also
succeeded along the same lines in determining the quiver with relations for the
principal block of h in characteristic 5.
Given the quiver with relations one can then start to study questions about
the cohomology of specific modules by using the program GRB written by C.
Feustel and E. Green, see [14], or by using a suggestion by J.F. Carlson. The pro-
gram GRB determines the syzygy modules of a given finitely presented module
of a finitely presented algebra, using Grabner basis techniques applied to non-
commutative algebras. J.F. Carlson's suggestion represents modules as matrix
representations but makes use of the well-known standard basis technique of the
MeatAxe. Both methods were successfully used for determining minimal projec-
tive resolutions for the basic algebra of the principal block of h in characteristic
5.
Originally the condensation method had only been implemented for the case
where FG-module M is actually a permutation module for G. However, in the
394 K. Lux, H. Pahlings
last couple of years the use of the condensation method has been extended
considerably.
In [46] A. Ryba describes his implementation of condensation which given
M determines the condensation of the n-th exterior power of M. His program
is however restricted to the case that the condensation subgroup H is acting
monomially on the FG-module M. As in the original case of a permutation
module, he proceeds in two steps: the first one in which the fixed space An(v)H
is determined and a second one where the action of eHgeH for a given 9 EGis
computed.
The general case of condensing the tensor product of two FG-modules V
and W was studied in [37]. As before, there are two steps to consider: determine
the fixed space in the tensor product of V and Wand then compute the action
of an element eHgeH on (V ® W)H. Both steps heavily rely on the fact that
the group algebra F H is semisimple. This implies that after a suitable base
change the matrix representations of H on V and W are block matrices whose
blocks are matrix representations on the irreducible FH-summands of V and
W, respectively W.
The technical details of the implementation of both steps can be found in
[37]. The resulting programs were written as part of M. Ringe's C-MeatAxe and
will be sent on request by the first author.
J. Muller and J. Rosenboom (see [40] ) were able to extend condensation from
the class of permutation modules to the much larger class of induced modules
by making use of the fundamental theorem of Mackey about homomorphisms
between induced modules. They have developed an algorithm, which derives from
an induced FG-module the corresponding condensed module without explicitly
constructing the induced module at all. Since it requires explicit information
about the group and certain subgroups, their implementation uses both the
GAP system and M. Ringe's C-MeatAxe.
Finally, based on a first version of a program written by R. A. Parker, G.
Cooperman and M. Tselman have extended the range of application of condensa-
tion for permutation modules quite considerably. R.A. Parker's original program
uses the idea that a permutation representation, which is constructed from the
action of a group G as a matrix group on a vector orbit, does not have to be
constructed explicitly in order to determine the matrices for elements of the
form eHgeH. The main ingredient in addition to this idea in the implementation
by Cooperman and Tselman is the idea of splitting the algorithm into subtasks
which are parallelized. The most impressive result, up to now, obtained in this
way is the determination of the Brauer tree in characteristic 19 of the Thomp-
son simple group, see [10], which required the study of a condensed module of a
permutation module of degree roughly a billion.
Summarizing, it seems as if computational modular representation theory is a
very active part of mathematical research at the moment. The last couple of years
have seen a tremendous amount of progress concerning results and methods.
Moreover, we can now start to study more sophisticated objects, such as for
Computational Representation Theory 395
example the complete module category of a group algebra FG, the cohomology
of FG and the quiver with relations for blocks of FG.
References
1. F. W. Anderson, K. R. Fuller, Rings and categories of modules, volume 13 of
Graduate Texts in Mathematics, Springer-Verlag, New York Heidelberg Berlin,
1974.
2. U. Baum, Existenz und effiziente Konstruktion schneller Fouriertransformationen
uberauflosbarer Gruppen. Dissertation Univ. Bonn 1991 33 pp.
3. U. Baum, M. Clausen, Computing irreducible representations of supersolvable
groups, Math. Compo 63 (1994), 351-359.
4. H. U. Besche, Die Berechnung von Charaktergraden und Charakteren endlicher
auflosbarer Gruppen im Computeralgebrasystem GAP, Diplomarbeit, Lehrstuhl
D fUr Mathematik, RWTH Aachen, 1992.
5. R. Brauer, Representations of finite groups, Lect. on Modern Math., Vol. 1, 133-
175, Wiley, New York 1963.
6. W. S. Burnside, Theory of groups of finite order, 2nd. ed. Cambridge 1911.
7. P. M. Cohn, Morita equivalence and duality, Mathematics Notes. Queen Mary
College, London, 1966.
8. S. B. Conlon, Calculating characters of p-groups, J. Symb. Comput. 9 (1990),
535-550.
9. J. H. Conway, R. T. Curtis, S. P. Norton, R. A. Parker, R. A. Wilson, Atlas of
Finite Groups. Oxford: Clarendon Press, 1987.
10. G. Cooperman, G. Hiss, K. Lux, J. Muller, The Brauer tree of the principal 19-
block of the sporadic simple Thompson group, Exp. Math. 6 (1997), 293-300.
11. E. C. Dade, Answer to a question of R. Brauer, J. Algebra 1 (1964), 1-4.
12. J. D. Dixon, High speed computation of group characters, Numer. Math. 10 (1976),
446-450.
13. G. Entz, H. Pahlings, The Dade conjectures for the McLaughlin group,
Proc. Groups-St. Andrews in Bath (1997), to appear .
14. C. D. Feustel, E. L. Green, E. Kirkman, J. Kuzmanovich, Constructing projective
resolutions, Comm. Algebra 21 (1993), 1869-1887.
15. G. Frobenius, Uber die Primfaktoren der Gruppendeterminante, I, II Sitzungs-
berichte Akad. Wiss. Berlin 1896, 1343-1382, 1903, 401-409.
16. A. Henke, G. Hiss, J. Muller, The 7-modular decomposition matrices of the spo-
radic O'Nan group, to appear in Journal of the London Math. Soc.
17. G. Hiss, The 3-modular characters of the Rudvalis sporadic simple group and its
covering group, Math. Compo 62 (1994), 851-863.
18. G. Hiss, C. Jansen, K. Lux, R. A. Parker, Computational Modular Character
Theory, manuscript, 1997.
19. G. Hiss, K. Lux, The 5-modular characters of the sporadic simple Fischer groups
Fb2 and Fb3, with an appendix by Thomas Breuer, Comm. Algebra 22 (1994),
3563-3590.
20. G. Hiss, J. Muller, The 5-modular characters ofthe sporadic simple Rudvalis group
and its covering group, Comm. Algebra 23 (1995), 4633-4667.
21. G. Hiss, D. White, The 5-modular characters of the covering group of the sporadic
simple Fischer group Fb2 and its automorphism group, Comm. Algebra 22 (1994),
3591-3611.
396 K. Lux, H. Pahlings
22. H. J. Hoehnke, K. W. Johnson, The 1-, 2-, and 3-characters determine a group,
Bull. Amer. Math. Soc. 112 (1992), 243-245.
23. A. Hulpke, Zur Berechnung von Charaktertafeln, Diplomarbeit, Lehrstuhl D fUr
Mathematik, RWTH Aachen, 1993.
24. C. Jansen, Irreduzibilitatskriterien in der modularen Darstellungstheorie, Diplo-
marbeit, Lehrstuhl D fUr Mathematik, RWTH Aachen, 1991,
25. C. Jansen, Ein Atlas 3-modularer Charaktertafeln, Dissertation RWTH Aachen,
1995.
26. C. Jansen, The minimal degrees offaithful, irreducible representations for sporadic
simple groups and their Schur covers, private communication, 1996.
27. C. Jansen, K. Lux, R. A. Parker, R. A. Wilson, An Atlas of Brauer Characters,
London Mathematical Society Monographs New Series. Oxford University Press,
1995.
28. C. Jansen, J. Miiller, The 3-modular decomposition numbers of the sporadic simple
Suzuki group, Comm. Algebra 25 (1997), 2437-2458.
29. C. Jansen, R. A. Wilson, The 2- and 3-modular characters of the O'Nan group, to
appear in Journal of the London Math. Soc.
30. K. W. Johnson, S. K. Sehgal, The 2-characters of a group and the group determi-
nant, European J. Combin. 16 (1995), 623-631.
31. W. Kimmerle, K. W. Roggenkamp, Non-isomorphic groups with isomorphic spec-
tral tables and Burnside matrices, Chinese Ann. Math. Ser. B 15 (1994) 273-282.
32. P. Landrock, G. O. Michler, Block structure of the smallest Janko group, Math.
Ann. 232 (1978), 205-238.
33. W. Lempken, R. Staszewski, The structure of the projective indecomposable mod-
ule of 3M;; in characteristic 2, Math. Compo 62 (1994), 841-850.
34. K. M. Lux, Algorithmic methods in modular representation theory, Habilitation-
sschrift RWTH Aachen 1997.
35. K. M. Lux, H. Pahlings, Computational aspects of representation theory, in:
Representation Theory of Finite Groups and Finite-Dimensional Algebras (ed.
G. O. Michler, C. M. Ringel) pp. 37-64, Basel: Birkhauser Verlag 1991.
36. K. Lux, M. Wiegelmann, An application of condensation, the P.I.M. structure of
the Mathieu group M23, in preparation, 1997.
37. K. Lux, M. Wiegelmann, Condensing tensor product modules, in: R. T. Curtis
and R. A. Wilson, editors, The Atlas, Ten Years On, Cambridge University Press,
to appear.
38. T. Merkwitz, Diplomarbeit, Lehrstuhl D fUr Mathematik, RWTH Aachen, in
preparation.
39. J. Miiller, The 5-modular characters of C03, Diplomarbeit, Lehrstuhl D fUr Math-
ematik, RWTH Aachen, 1991.
40. J. Miiller, J. Rosenboom, Condensation of induced representations and an appli-
cation: The 2-modular decomposition numbers of C02, to appear in Proceedings
of the Euroconference on representation theory, Institute of experimental mathe-
matics, Essen, Germany.
41. J. Neubiiser, An invitation to computational group theory, in: Groups Gal-
way/St. Andrews '93, vol. 2 (1995),457-475.
42. J. Neubiiser, H. Pahlings, W. Plesken, CAS; design and use of a system for the
handling of characters of finite groups, in: Computational Group Theory (ed.
M. D. Atkinson) pp. 195-247, London: Academic Press 1984.
43. E. A. O'Brien, The groups of order 256, J. Algebra 143 (1991), 219-235.
Computational Representation Theory 397
44. G. Pfeiffer, The subgroups of M24 or how to compute the table of marks of a finite
group, preprint.
45. M. Ringe, The C-MeatAxe, Release 2.2.0, Lehrstuhl D fiir Mathematik, RWTH
Aachen, 1994.
46. A. J. Ryba, Condensation programs and their application to the decomposition of
modular representations, J. Symb. Comput. 9 (1990), 591-600.
47. G. J. A. Schneider, The Mathieu group Mll, vertices, Green-correspondents and
soc1e series of the projective indecomposable modules, Master's thesis, St. Peter's
College, Oxford University, 1979.
48. G. J. A. Schneider, Dixon's character table algorithm revisited, J. Symb. Comput.
9 (1990), 601-606.
49. G. J. A. Schneider, The structure of the projective indecomposable modules of the
Mathieu group M12 in characteristic 2 and 3, Arch. Math. 60 (1993), 321-326.
50. M. Schiinert et. al. GAP - Groups, Algorithms, and Programming. Lehrstuhl D
fiir Mathematik, Rheinisch WestfaIische Technische Hochschule, Aachen, Germany,
fifth edition, 1995.
51. E. Skrzipczyk, Charaktertafeln von p-Gruppen. Diplomarbeit, Lehrstuhl D fiir
Mathematik, RWTH Aachen, 1992.
52. I. A. Suleiman, Modular Representations of Finite Simple Groups, PhD thesis,
University of Birmingham, 1990.
53. I. A. Suleiman, R. A. Wilson, The 3-modular characters of Conway's group C02,
Math. Proc. Cambridge Philos. Soc. 116 (1994), 275-283.
54. I. A. Suleiman, R A. Wilson, The 2-modular characters of Conway's third group
Co a , J. Symb. Comput. 24 (1997), 493-501.
55. K. Waki, The Loewy structure of the projective indecomposable modules for the
Mathieu groups in characteristic 3, Comm. Algebra 21 (1993), 457-1485.
56. K. Waki, The projective indecomposable modules for the Higman-Sims group in
characteristic 3, Comm. Algebra 21 (1993), 3475-3487.
57. S. Weiss, Die Berechnung von Ext-Kiichern mit Relationen fiir Bliicke von Grup-
penalgebren, Diplomarbeit, Lehrstuhl D fiir Mathematik, RWTH Aachen, 1995.
58. M. Wiegelmann, Fixpunktkondensation von Tensorproduktmoduln, Diplomarbeit,
Lehrstuhl D fiir Mathematik, RWTH Aachen, 1994.
59. R A. Wilson, An Atlas of group representations,
http://www.mat.bham.ac.uk./RA.Wilson/atlas.html. 1996.
60. R. A. Wilson, Standard generators for sporadic simple groups, J. Algebra 184
(1996), 505-515.
High Performance Computations In Group
Representation Theory
Gerhard O. Michler
Introduction
The classification of the finite simple groups is one of the important achievements
of the mathematicians in this century. According to p. 3 of the recent book [16]
by Gorenstein, Lyons and Solomon "The existing proof of the classification of the
finite simple groups runs to somewhere between 10 000 and 15 000 journal pages,
spread across some 500 separate articles by more than 100 mathematicians .... As
a result of these various factors, it is extremely difficult for even the most diligent
mathematician to obtain a comprehensive picture of the proof by examining the
existing literature." On p. 45 of [16] these authors write: "The most serious
problem concerns the sporadic groups, whose development at the time of the
completion of the classification theorem was far from satisfactory. The existence
and uniqueness of the sporadic groups and the development of their properties
form a very elaborate chapter of simple group theory, spread across a large
number of journal articles. Moreover, some of the results are unpublished (e.g.
Sims' computer calculations establishing the existence and uniqueness of the
Lyons group Ly). Furthermore, until very recently, the two principal sources for
properties of the sporadic groups were [5] and [15], Part 1, §5 consisting only of
statements of results without proofs."
Recently H. GoHan [11] has given a new computer construction of Lyons'
simple group Ly which is independent of Sims' unpublished work, see [30]. In
section 1 his methods are generalized in such a way that any sporadic simple
group G characterized by the centralizer H = Ga(u) of some involution u can
be constructed by the same procedure, provided sufficiently large computers are
available.
This approach to the revision project of the sporadic simple groups is mathe-
matically very easy. It requires the performance of many calculations with dense
matrices over small finite fields or large permutations. However, these computa-
tions are easily described and will be documented professionally.
In order to show that our approach of the revision project of the sporadic
simple groups is promising we describe in section 2 the recent existence proof
of the largest Janko group J 4 by Cooperman, Lempken, Michler and Weller [7],
see Theorem 2.5. This proof rests on Weller's construction [34] of a permutation
1I':X~Sm
finite simple group G. Let F = GF(q) be a finite field of odd characteristicp > O.
Let
<p: H -t GLn(q)
be a faithful semi-simple n-dimensional representation of Hover F such that
the eigenspace VI of u belonging to the eigenvalue 1 is a proper F H -submodule
of V = Fn.
Suppose that it has been proved that there is an isomorphism
H := <p(H) ~ CG(ut}.
Then there exists a simple subgroup a of GLn(q) with involution u = <p(u) and
centralizer Ca(u) = H which is isomorphic to G, if all the conditions of the
following steps can be verified:
(1) Construct a subgroup Min GLn(q) such that D = MnH is a proper subgroup
of H for which the restriction Vji) is a semi-simple FD-module. Then find
a matrix
t E CGLn(q)(H) \ CGLn(q) (D)/CGLn(q) (M)
such that V is a simple Fa-module for the subgroup a = (H, t- 1 Mt) of
GLn(q).
(2) Check that H ::; Ca(u).
(3) Applying the algorithm of Theorem 1.4 to the Fa-module V and its restric-
tion Vjii one obtains a permutation representation
1r : a -t 8 m with stabilizer H.
(f3) The restriction YIN of the simple FG-module V of step (1) of 1.5 contains a
proper FN-submodule W f; O.
Then by means of the following alternatives of the steps (3) and (4) of the
construction principle 1.5 one can save storage space and computing time.
(3') Applying the algorithm of Theorem 1.4 to the FG-module V and a minimal
F N-submodule W f; 0 of its restriction YIN to N one obtains a permutation
representation
7r : G --+ 8 m with stabilizer N.
ro = yx 21 y-1 U5 = U4(V4r2)2
r1 = x14yx21y-1x-14 = (ro)X28 U6 = [X 21 (X 21 )Yj2
r2 = y3 x 21 y7 Sl = y2 r1y -2 = (r1)y 8
r3 = x14y3x21y7 x- 14 = (r2)X 28 S2 = (X 21 )YX 28 = x14y-1x21yx-14
V1 = y6 x 21 y 4 d1 = [(rt}b, Sl], where b = y-2 x -6
V2 = y8 x 21y2 d 2 = (X 21 )Y
V3 = y4 x 21 y 6 a1 = d1X6d1X24d1
V4 = X 21 t1 = sl(r1)b s1
W1 = [x 6 ,y5] t2 = (X21 )y5
U1 = (V1 r 2)2 = [X-12(y5X6)2(X21)y-lX28j2 qo = r1 r3d1s1(x6 y2)4
U2 = (V3rO)2 a3 = Sl (qo)3 Sl (qO)4 SlS2
U3 = U1(V4 r0)2 a6 = t1 (X14y5)-2X14(x14y5)2t1
U4 = (V3 r 2)2 Z = (X14)t, where t = (X 14 y5)2
Observe that the elements a1, a3, a6, z E G have order 3, and qo E G has order
7. All other elements are involutions of G.
The following results are proved in my joint article [7] with Cooperman,
Lempken and Weller.
(e) M has six conjugacy classes of involutions with representatives: Ul, TO, Tlr2,
WI, U4rO and U4rlT2'
gl = 1,
g2 = x2c14 ycS yc2 and 1f32Gll = 15180,
93 = X2c 14 ycS yc7 and 1f33G21 = 4032,
94 = X2c 14 ycS yc3 and 1f34G31 = 256,
=
9s X2c 14 ycS yc6 and If3SG4/ = 4,
=
96 x2c 14 ycS yc4 and /f36Gsl = 4,
97 = X2c 14 ycS ycS and /f3TG 6 1 = 2
is a base for G with set stabilizers Gj = Staba; -1 ( {f31, f32, ... , f3j}), 1 :::; j :::;
7, where Go = G.
High Performance Computations in Group Representation Theory 407
(f) The following words Si, 1 ~ i ~ 11, form a set of strong generators for G
with respect to the base B:
where d = [(S4)2S5(S4)2]-1, has order 12 and stabilizes base point (31, and
e = S4(S5)2(S4)2[(S4)4(S5)2S4]-1 has order 4 and stabilizes base points (31,/32
and /33-
(g) The set stabilizers G i , 0 ~ i ~ 7, have the following strong generators:
Go = (S1,S2), G 1 = (82,S3),
G 2 = (S4,S5), G3 = (S6,S7,S8,S9,S12,S14),
G 4 = (S1O, SU, S12, S13, 814), G 5 = (S12, S13, S14),
G6 = (S14), G7 = L
Theorem 2.4 is the basic ingredient of the new existence proof for Janko's
sporadic simple group J4 [18] given by G_ Cooperman, W_ Lempken, M. Weller
and the author. Theorem 5.1 of their article [7] is restated as follows.
Theorem 2.4. Let G = (x, y) where x, y E GL 1333 (11) are matrices constructed
in f19} of orders o(x) = 42 and o(y) = 10. Then G is a simple group of order
(a) Q = 02(H) = (Ul, U2, U3, U4, us, VI, V2, ro, rl, r2, r3, d l , SI) is an extmspecial
normal subgroup of H with IQI = 2 13 .
(b) The element al = dlx6dly24dl of order 3 genemtes a Sylow 3-subgroup of
02,3(H), and GQ(al) = Z(Q) = (Ul).
(c) Uo = Gn(aI) = (U6,V3,V4,d2,s2,t2,al,a3,a6,xl 4, y S) ~ 6M22 , the sixfold
cover of the Mathieu group M22 with center Z(Uo) = (Ulal) .
(d) U = N n (al)) = Uo : (h) is a subgroup of H with U/Z(Uo) ~ Aut(M22)'
and center Z(U) = (Ul).
(e) H = QU, and Q n U = (Ul) = GQ(aI).
In particular, G is isomorphic to Janko's simple group J4.
In order to construct Janko's simple group h as an irreducible subgroup
of GL 112 (2) in [23) M. Weller and the author determine the structure of the
endomorphism ring End FG(P) of the 2-modular reduction P = Pz ®z F of the
integral permutation module Pz of degree 173067389 of G.
Proposition 2.2. Let Pz = 1M ®ZM ZG be the integml permutation module of
G = (x,y) with respect to the subgroup M = (x 3,y,z). Then by Theorem 2.4 (d)
G has seven double cosets MXiM of M, 1 ::; i ::; 7. By [l} to each double coset
MXiM corresponds an integml intersection matrix Dk = (pfj) of Pz . All the
entries pfj = I{Mx E MXkM 1 Mxxj E MXiM}I, 1::; i,j, k::; 7, are computed
explicitly in Theorem 2.1 of [23}.
Furthermore, the endomorphism ring EndzG(Pz) of Pz is commutative.
Using this result the precise ring structure of EndFG(P) is given in Theorem
2.2 of [23). Applying then Gollan's and the author's method [13] to construct a
45694-dimensional simple submodule of 9606125-dimensional permutation mod-
ule of Lyons' group Ly over F, Weller and the author prove in Theorem 3.1 of
[23) the following result.
Theorem 2.5. Let P be the permutation module of G = (x, y) with respect to
the subgroup M = (x 3, y, z) over the field F = GF(2). Let'P be the endomor-
phism of P corresponding to the 2-modular reduction
0000001
0000001
0000000
d7 := 0000000
0000000
0000000
0000000
~
J:= K-7r{G) = (K-7r{X),K-7r{Y)) ~ GL1l2 (2)
is commutative.
The explicit 112 x 112-matrices X = K-7r(x) and Y = K-7r(Y) are stated in [23].
By Theorems 2.4, 2.5 and 2.7 the subgroup J = (K-7r(x), K-7r(Y)) of GL 112 (2)
is isomorphic to Janko's simple group J4, see [18].
Since K-7r : G -+ J is an isomorphism, we denote the elements of J by the
same letter g as their preimages in G = (x, y) in order to simplify the notations.
In particular, now x,y E J ~ GL 1l2 (2), and all the elements rO,rl, ... ,a6,z of
Notation 2.1 are constructed in GL1l2(2) by the same formulas as given there.
Furthermore, all the statements of Theorem 2.5 characterizing G as the simple
Janko group J4 hold verbatim for J as for G. In particular, now all the conditions
of Theorem 2.5 can be verified by means of GAP [29] or MAGMA [3] on a
workstation.
By Theorem 2.4 (e) and (f) the group 7r(G) ~ 5173067389 has 14 strong gener-
ators 7r(Si), 1 ~ i ~ 14, with respect to the base B = [Mg l , Mg 2, ... , Mg7]. The
generators of the set stabilizers 7r(G)k = Stab,,(Gk_l){{Mgl ,Mg2, ... ,Mgk })
are given in Theorem 2.4 (g). Now let Jk = K-[7r{Gh] for 0 ~ k ~ 7. Let
M = (x 3 , y, z) ~ J = (x, y) ~ GL112(2). Using GAP it follows that the 3
matrices x 3 , y, z E GL 1l2 (2) have a unique common eigenvector 0 of. v E F1l2
with eigenvalue 1 E F. Let VI = V, and Vk = Vgk for 2 ~ k ~ 7. The set
B = [VI, V2, ... , V7] is called a generalized base of J.
For each step i E {I, 2, ... , 7} one computes the whole orbit ViJi - l in F112
under the multiplication of Vi with matrices of Ji-1. For each orbit element 0
one keeps track of the word w in the generators Sj of Ji-l which map Vi to
this orbit element o. This information can be stored efficiently in the so called
Schreier vector Si for 1 ~ i ~ 7. The implementation is described in section 4 of
Weller's article [34].
Theorem 2.6. Let B = [Vi \1 ~ i ~ 7] be the generalized base of the (Janko)
subgroup J = (x, y) of G Lll2 (2). Then the following assertions hold:
(a) 5 = {Sj E J \1 ~ j ~ 14} is a set of strong generators of J.
(b) For each vector Vi and subgroup J i - l defined above there is a 5chreier vector
Si describing the elements of the vector orbit set V;]i-l, where 1 ~ i ~ 7.
(c) Membership test: Let A be an arbitrary matrix of GLu2(2). Then by the
following algorithm it is decidable whether A belongs to the subgroup J of
GL1l2(2). If so, then this algorithm enables one to write A as a word in the
strong generators Sj E 5 of J.
410 G.O. Michler
The final part describes applications of the parallel linear algebra algorithms
to obtain efficient implementations of Berlekamp's and Niederreiter's algorithms
for the factorization of polynomials f(X) E IF[X] into irreducible factors. Using
the Gaussian elimination algorithms of the lecture notes [9], P. Roelse shows
by examples that one can factorize arbitrary polynomials f(X) of large degree
d provided IF' is small. In fact implementing his deterministic algorithms [27]
efficiently on the IBM SP2 supercomputer of the Theory Center of Cornell Uni-
versity he has been able to factorize a polynomial f(X) E GF(2)[X] of degree
d = 300000 chosen at random into its irreducible factors. This is a world record
according to the recent book [8) by Diaz, Emiris, Kaltofen and Pan. The running
times of his experiments on the supercomputer are given as follows.
Parallel CPU times for polynomial factorization in GF(2)[X]
Degree Degrees
d # processors Task CPU time of factors
75000 64 setup 0.27 1, 17, 35, 40, 81,
nullspace 2641 311, 622, 12225,
extract 26.9 21515,40153
total Oh44'28"
150000 128 setup 0.54 2,5,27,36,47,66,276,
nullspace 9623 284, 316, 948, 3690,
extract 106 5066,8536, 14534,
total 2h42'10" 27871,88296
300000 256 setup 1.25 2, 4, 165, 233, 275
nullspace 36540 10965, 18709,
extract 236 41770, 100948,
total lO h 12'57" 126929
Theorem 4.1. The 2-modular character table of Conway's group C03 is:
ACKNOWLEDGEMENTS
The author of this paper has been supported by the DFG research project "Algo-
rithmic Number Theory and Algebra" .
414 G.O. Michler
References
1. E. Bannai, T. Ito, Algebraic Combinatorics I: Association Schemes, Benjamin and
Cummings Publishing Company, Menlo Park, California (1984).
2. D. J. Benson, The Simple Group h, PhD. Thesis, Trinity College, Cambridge
(1980).
3. W. Bosma, J. Cannon, MAGMA Handbook, Sydney (1993).
4. G. Butler, Fundamental algorithms for permutation groups, Lecture Notes in Com-
puter Science, Springer Verlag, Heidelberg (1991).
5. J. H. Conway, R. T. Curtis, S. P. Norton, R. A. Parker, R. A. Wilson, Atlas of
finite groups, Clarendon Press, Oxford (1985).
6. G. D. Cooperman, L. Finkelstein, M. Tselman, B. York, Constructing permutation
representations for large matrix groups, J. Symb. Comput. 24 (1997), 471-488.
7. G. D. Cooperman, W. Lempken, G. O. Michler, M. Weller, A new existence proof
of Janko's simple group J4, Preprint, IEM Essen (1997).
8. A. Diaz, I. Z. Emiris, E. Kaltofen, V. Y. Pan, Algebraic algorithms, in: M. Attalah
(ed.), The Computer Science Handbook, CRC Press, Boca Raton, Florida (1998).
9. P. Fleischmann, G. O. Michler, P. Roelse, J. Rosenboom, R. Staszewski, C. Wagner,
M. Weller, Linear algebra over small finite fields on parallel machines, Vorlesungen
Fachbereich Mathematik Universitat GH Essen, Heft 23 (1995).
10. H. W. Gollan, The 5-modular representations of the Tits simple group, Math.
Compo 57 (1991), 369-386.
11. H. W. GoHan, A new existence proof for Ly, the sporadic group of R. Lyons,
Preprint, !EM Essen (1995).
12. H. W. Gollan, A contribution to the revision project of the sporadic group: Lyons'
simple group Ly, VOrlesungen Fachbereich Mathematik Universitat GH Essen, Heft
26 (1998).
13. H. W. GoHan, G. O. Michler, Construction of a 45694-dimensional simple module
of Lyons' sporadic group over GF(2), Linear Algebra Appl. 256 (1997), 185-197.
14. D. Gorenstein, Finite simple groups, Plenum Press, New York (1982).
15. D. Gorenstein, R. Lyons, The local structure of finite groups of characteristic 2
type, Mem. Amer. Math. Soc. 276 (1983).
16. D. Gorenstein, R. Lyons, R. Solomon, The classification of the finite simple groups,
Mathematical Surveys and Monographs 40, No.1, American Mathematical Society,
Providence, Rhode Island (1994).
High Performance Computations in Group Representation Theory 415
17. 1. M. Isaacs, Character theory of finite groups, Academic Press, New York (1976).
18. Z. Janko, A new finite simple group of order 86 . 775 . 571 . 046 . 077 . 562 . 880
which possesses M24 and the fu.!l couering group of M22 as subgroups, J. Algebra
42 (1976), 564-596.
19. W. Lempken, Constructing h in GL(1333, 11), Comm. Algebra 21 (1993), 4311-
4351.
20. K. Lux, J. Miiller, M. Ringe, Peakword condensation and submodule lattices: an
application of the MeatAxe, J. Symb. Comput. 17 (1994), 529-544.
21. G. Malle, J. Saxl, T. Weigel, Generation of classical groups, Geom. Dedicata 49
(1994),85-116.
22. G. O. Michler, 0. Solberg, Testing modules of groups of even order for simplicity,
J. Algebra, (to appear).
23. G. O. Michler, M. Weller, A new computer construction of the irreducible 112-
dimensional 2-modular representation of Janko's group J4, Preprint, IEM Essen
(1998).
24. J. Miiller, J. Rosenboom, Condensation of induced representations and an appli-
cation: The 2-modular decomposition numbers of C02, Preprint (1997).
25. S. Norton, The construction of h, Proceedings of Symposia in Mathematics AMS
37 (1980), 271-277.
26. R. A. Parker, R. A. Wilson, The computer construction of matrix representations
of finite groups over finite fields, J. Symb. Comput. 9 (1990), 583-590.
27. P. L. A. Roelse, Factoring high-degree polynomials over F2 with Niederreiter's al-
gorithm on the IBM SP2, Math. Comp., (to appear).
28. J. Rosenboom, Completing the 2-modular character table for Conway's third group
C03 using parallel computers, Exp. Math., (to appear).
29. M. Schiinert et al., GAP - Groups, Algorithms, and Programming, 3rd ed.,
Lehrstuhl D fiir Mathematik, RWTH Aachen (1993).
30. C. C. Sims, The existence and uniqueness of Lyons' group, in: T. Gagen, M. P.
Hale Jr., E. E. Shult (eds.), Finite groups '72 (Gainsville Conference) pp. 138-141,
North Holland, Amsterdam (1972).
31. 1. Suleiman, R. A. Wilson, The 2-modular characters of Conway's group Co2, Math.
Proc. Cambridge Philos. Soc. 116 (1994), 275-283.
32. 1. Suleiman, R. A. Wilson, The 2-modular characters of Conway's third group Co a ,
J. Symb. Comput. 24 (1997), 493-506.
33. J. G. Thompson, Finite-dimensional representations of free products with an amal-
gamated subgroup, J. Algebra 69 (1981), 146-149.
34. M. Weller, Construction of large permutation representations for matrix groups,
Preprint, IEM Essen (1997).
35. M. Wiegelmann, Fixpunktkondensation von Tensorproduktmoduln, Diplomarbeit,
Lehrstuhl D fiir Mathematik, RWTH Aachen, (1994).
The Structure of Maximal Finite Primitive
Matrix Groups
Gabriele Nebe
1 Introduction
The finite subgroups of GLn(Q) are classified up to dimension n = 31 by giving
a system of representatives for the conjugacy classes of the maximal finite ones
([12], [9], [5], [6], [7]) cf. [11] for a survey on this and interrelations between these
groups. Recently the classification has been extended to the one of absolutely
irreducible maximal finite subgroups G of GLnCD), where V is a totally definite
quaternion algebra and n . dimQ(V) ::; 40. As usual a subgroup G ::; GLn(V)
is called absolutely irreducible, if the enveloping Q-algebra QG := n:::9EG
agg I
a g E Q} ~ vnxn is the whole matrix ring vnxn (cf. [8]). The classification of
these groups yields a partial classification of the rational maximal finite matrix
groups in the new dimensions 32, 36, and 40 on one hand and on the other hand
it gives nice Hermitian structures for interesting lattices. For example one finds
eleven quaternionic structures of the Leech lattice as a Hermitian lattice of rank
n > lover a maximal order in a definite quaternion algebra V with absolutely
irreducible maximal finite automorphism group as displayed in Table l.
Rather than giving a survey of the classification results this note is devoted
to a general structure theorem (cf. Theorem 4 below).
1) n Auth(L) IAuth(L)1
Qoo,2 6 2.G2(4) 2" 6 .36 .5'.7.13
Qoo,5 6 8L2(25) 24 .3.5 2 .13
Qoo,5 6 2.h.2 29 .33 .5 2 .7
Qoo,l1 24 .3.5.11
6 8 L 2(11).2
Qoo,13 6 8L2(13).2 24 .3.7.13
Q.j3,002 3 (±U3(3)).225 .33 .7
Q,;5,002 3 2.h 28 .33 .5 2 .7
Q.,f'f,002 3 cd L2(7) 26 .3.7
Qv'f3,002 3 8L2(13) 23 .3.7.13
Qv'2T,002 3 ±cd L2(7) 25 .3 2 ·7
QW,",003 13 2 ±CI3 .C4 23 .13
Remark 1. Let G ::; GLn(V) be a finite primitive group and N:::! G be a normal
subgroup of G. Then the K-algebra KN spanned by the matrices in N over the
centre K = Z(V) is a simple algebra.
p-subgroups of a primitive matrix group G. Using this list and the classification
of finite simple groups and their characters (d. [2), [4], [15)), one gets the candi-
dates for the generalized fitting groups Fitgen (G), the product of the maximal
nilpotent normal subgroup with the quasi-semi-simple normal subgroups of G,
for finite primitive subgroups G of GLn(V).
3 Arithmetic Properties
To get further insight in the structure of the maximal finite primitive matrix
groups one has to use some arithmetic properties. Let ZK be the ring of integers
in K and M a maximal order in V!::'! Endvnxn(V n ).
If G :$ GLn(V) is a finite matrix group, then the Z-lattice M ®ZK ZKG
spanned by M and the matrices in G is closed under multiplication hence it is
an order in the algebra V ®K KG. Since orders are contained in maximal orders
and the latter are endomorphism rings of lattices (finitely generated projective
M-modules that span vn) one gets that the set of G-invariant M-Iattices
is a normal subgroup of G.
420 G. Nebe
Theorem 1. (ef. [8, Theorem 6.8)} If L = Z(A) is a totally real number field
then the quotient group S / B is of exponent 1 or 2.
Proof. Denote the commuting algebra Cvnxn (A) by C. Then Z(A) = Z(C).
Choose F E Fj;°(N). Then e f-+ eO := Fetr F- 1 is an involution on C and on
A. The other N-invariant Hermitian forms are of the form cF with c E C+,
the fixed space of o. Note that 0: C --+ C depends on the choice of F but
the corresponding involution on A does not. In particular the restriction of the
involution to the center L is independent of the choice of F. One easily sees
that the induced involution is the complex conjugation on L, hence it is trivial
because L is totally real.
Let s E S. Since s induces a L-algebra automorphism on the central simple L-
algebra A, the theorem of Skolem and Noether implies that there is an invertible
element a E A with as- 1 E C. The matrix aFatr is again N-invariant, hence
aFa tr = cF for some c E C+. Moreover c = aFatr F- 1 lies in the center L of
Maximal Finite Matrix Groups 421
C, because for x E C one has cxo = aPa,tr p-l (Px tr P-l) = aPa,tr xtr p-l =
aPxtra,tr p-l = axe pa,tr p-l = xOc.
Therefore c E A and a 2 c- 1 P(a 2 c 1)tr = P. Since the element s has finite
order, there is mEN such that (a 2 c- 1 )m E L commutes with the elements of N.
Then P = (a 2 c- 1)mp«a2 c l)m)tr = (a2c- 1)m(a 2c- 1)mp. Hence (a 2c- 1)2m =
1 and therefore a 2 c- 1 is a unit of finite order in A normalizing N. By Remark 2
one gets that b:= a 2 c- 1 lies in B. Moreover b- 1 S 2 commutes with every element
of N and therefore lies in Ca(N) ~ N. 0
If L = Z(A) is not totally real, then the theorem above may be no longer
true. I have no example of a primitive maximal finite group G where 5/ B is of
exponent > 2 but the following example shows that such groups are likely to
exist. Let N := C3 x C7 : C3 = (z,x,y I z3,x 7 ,y3,xY = x 2 ). Then N has an
automorphism s of order 3, with z' = z, x' = x, yS = yz. The non split extension
C2 x N.(s) is a maximal subgroup of the GU3 (5), in fact one could replace N
by the irreducible matrix group 3.u3(5):-:; GL144(L) where L:= Q[v'-3,H].
N has an irreducible faithful representation into GL3(L). The corresponding
character extends to 5 := ±N : (s) but the character value of xs involves further
irrationalities. So if N (or 3.u3 (5)) is a normal subgroup of a maximal finite
primitive group G, then 3 divides the order of 5/ B (in the notation above).
The reason for this phenomenon is that L = Z(QN) is a complex field. The
element a 2 c 1 in the above proof only satisfies
(a2c- 1)m(a 2c 1)m = 1. Let P be a prime ideal in Land (Ae)P be the completion
ofthe hereditary order Ae at the prime P. The (Ae)p-Iattices in the simple (Ae)P-
module form a chain Ch p . If P :f. P, then a 2 c- 1 acts on this chain Ch p say by
shifting k-steps down. But then a 2 c- 1 acts on the chain ChI> by shifting k-steps
up.
The group generated by all possible shifts is abelian, so one finds that the
image 5/ B of 5 is abelian. To get bounds on the rank and exponent of this
abelian group let U1(L) := {x E L* I xx = I} be the unitary group of Land
rp(U1(L)) be the image of U1(L) in I15hp =: 5h where the product runs over
the set of unordered pairs of prime ideal P :f. P of L. Then 5 2 / B is isomorphic
to a subgroup of 5h/(rp(U1(L)).
Let Cl be the group of all ideals I of L with II = (1) modulo the group
of principal ideals that are generated by an element of Ut{L). Denote the rank
of Cl by t and its exponent by g. Let r be the number of pairs of prime ideals
P :f. P of L that divide the discriminant of Ae (and therefore the order of N).
Let d be the degree of a L-irreducible constituent of the natural representation
of N.
Proposition 1. The quotient group 5/ B is abelian. The rank of 52 / B is bound-
ed by r + t and its exponent divides dg.
Proof. For a prime ideal P of L let 5hp ==' Z denote the group of all inclusion
preserving permutations of the (Ae)p-Iattices in the irreducible (Ae)p-module
that are induced by elements of the normalizer of (Ae)P in Ap. The multiplicative
group L* acts on the Ae-Iattices, hence one gets a homomorphism L* --+ 5hp.
422 G. Nebe
Hence the rank of S2 IBis bounded by the rank of Shl t.p(U1(L)) ::; r + t and
the exponent of S21B divides exp(Shlt.p(U1(L))) which divides dg. 0
References
1. H. Benz, H. Zassenhaus, Uber verschriinkte Produktordnungen, J. Number Theory
20 (1985), 282-298.
2. J. H. Conway, R. T. Curtis, S. P. Norton, R. A. Parker, R. A. Wilson, Atlas of
finite groups, Oxford University Press 1985.
3. B. Huppert, Endliche Gruppen I, Springer-Verlag 1967.
4. V. Landazuri, G. M. Seitz, On the minimal degrees of projective representations of
the finite Chevalley groups, J. Algebra 32 (1974), 418-443.
5. G. Nebe, Endliche rationale Matrixgruppen vom Grad 24, Dissertation, RWTH
Aachen 1995.
6. G. Nebe, Finite rational matrix groups of degree 24, Exp. Math. 5 (1996), 163-195.
7. G. Nebe, Finite subgroups ofGLn('Q) for 25::; n::; 31, Comm. Algebra 24 (1996),
2341-2397.
8. G. Nebe, Finite quaternionic matrix groups, Electronic Journal of Representation
Theory, to appear.
9. G. Nebe, W. Plesken, Finite rational matrix groups of degree 16, AMS-Memoirs,
No. 556, vol. 116 (1995).
10. W. Plesken, Some applications of representation theory, Progress in Mathematics,
vol. 95: Representation Theory of Finite Groups and Finite-Dimensional Algebras,
(ed. by G. O. Michler, C. M. Ringel), (1991), pp. 477-496.
11. W. Plesken, Finite rational matrix groups - a survey, Proceedings of the conference
"The Atlas Ten Years On" 10. - 13.7.95 in Birmingham. LMS Lecture Notes in
Mathematics (ed. by R. T. Curits, R. A. Wilson).
12. W. Plesken, G. Nebe, Finite rational matrix groups, AMS-Memoirs, No. 556, vol.
116 (1995).
13. I. Reiner, Maximal orders, Academic Press 1975.
14. I. Schur, Uber eine Klasse von endlichen Gruppen linearer Substitutionen, pp. 128-
142 in I. Schur: Gesammelte Abhandlungen I, Springer-Verlag 1973.
15. G. M. Seitz, A. E. Zalesskii, On the minimal degrees of projective representations
of the finite Chevalley groups, II, J. Algebra 158 (1993), 233-243.
Presentations and Representations of Groups
W. Plesken
RWTH Aachen, Lehrstuhl B fUr Mathematik
Templergraben 64, 52062 Aachen, Germany
[email protected]
1 Introduction.
Solving equations belongs to the oldest problems in mathematics. In a wider
sense, analyzing a group presentation
G = (Xl, .. ' ,Xn!rt(Xl,'" ,Xn) = 1, ... ,rk(xl,'" ,xn) = 1)
belongs to this class of problems, because one wants to know the most general
solutions of the defining relations. It is well known that there is no general pro-
cedure to solve the word problem by the famous Novikov-Boone Theorem, cf.
[30] Chapter 13 for an exposition. Even deciding the question whether G is finite
or infinite cannot be solved in general. Nevertheless, one can try to prove that
G is infinite, if one suspects this, by solving the equations given by the relators
in some group, where one can compute, for example in a matrix group. In case
of success this produces an epimorphic image of G which might be infinite. And
even if it is finite, one might use the representations of the finite quotient to pro-
duce bigger epimorphic images. Various techniques for carrying out these ideas
have been developed over the last years. They will be described in the next few
chapters.
There are many other methods available to study finitely presented groups
from the computational point of view. To prove finiteness the classical Todd-
Coxeter algorithm is still the basic tool. Knuth-Bendix type approaches have
been successful to prove infiniteness in many cases, cf. [34] for a general reference
and [8], [9] for recent progress. For instance, Derek Holt has managed to prove
the infiniteness of the Heineken group
(Xl,X2,X31[[Xl,X2],X2] = X3, [[X2,X3],X3] = Xl, [[X3,Xl],Xl] = X2),
which still has not been possible by any other methods. But on the other hand,
the methods in this survey have successfully produced infiniteness results, where
all other methods failed. The infinity proofs for finitely presented groups dis-
cussed in this paper always exhibit an infinite, residually finite factor group
explicitly.
Roughly speaking there are two methods to be described here. The first one
is not yet very much developed. It typically produces images whose Zariski clo-
sure in some general linear group is a nonsoluble algebraic group and at the
same time it reveals infinitely many finite simple factor groups of G. This will
be discussed in Chapter 4. The second method produces an image, which is free
abelian by finite. This is discussed in Chapter 3. Since this method requires ratio-
nal representations of finite groups, methods for constructing those are surveyed
in Chapter 2, which of course is also of independent interest. Finally Chapter 5
gives some short comments on the soluble quotient algorithm, where the basic
ideas underlying the methods of this paper have been automatized in [26) and
[1), and makes some comments on possible future development.
Note, splitting fields always contain the field Q[x) of character values of x.
Moreover Q[X) is already a splitting field if and only if the rational Schur index
of X is 1. Otherwise, a minimal splitting field need not be a splitting field of
minimal degree. The question arises where to look for splitting fields of minimal
degree. By Brauer's well known theorem one knows that the the e-th cyclotomic
number field is always a splitting field, where e is the exponent of G. B. Fein
has given examples, cf. [5), [6), of finite soluble groups with characters which do
not have an abelian splitting field of minimal degree at all. Nevertheless he gives
also conditions, cf. also [23), when one has a splitting field of minimal degree as
Presentations and Representations of Groups 425
subfield of the e-th cyclotomic number field. Even for soluble groups the degrees
of the characters seem to get rather big, when one does not have such a conve-
nient splitting field of minimal degree.
Already in the case of trivial Schur index, where the minimal splitting field
is unique, it is not always a triviality to realize the representation over this field.
Here is a corollary of Proposition 3.1 of [29), which at least shows how to retain
the minimal splitting field in the absence of a non trivial Schur index, when one
extends irreducible representations of subgroups.
n2
r(g) = L X(gbi)Ll(bi) ,
i=l
In [29] the use of this result for the actual construction of representations over
splitting fields of minimal or at least almost minimal degree has been shown.
For finite soluble groups the first two problems have been solved by Herbert
Bruckner in his thesis, cf. [1], more precisely, he gives an algorithm finding the
absolutely irreducible representations of a finite soluble group G over splitting
subfields of minimal degree of the e-th cyclotomic number field with e the expo-
nent of G. Often the third problem is clear, but in principle one would have to
verify that the splitting field he has found is not only of minimal degree among
the subfields of the cyclotomic field but of minimal degree among all splitting
fields. I come back to this later on. From a theoretical point of view his method
426 W. Plesken
leaves no gaps for the first two steps, however from a practical point of view, it
relies on other systems like KANT, cf. [19], to solve certain relative norm equa-
tions in abelian number fields. Here is a short outline of the method: Following
the classical approach by 1. Schur, one works one's way up a composition series of
G by extending absolutely irreducible representations whenever possible and in-
ducing them otherwise, starting with the trivial representation of the I-subgroup
ending with a set of representatives of the absolutely irreducible representations
over some cyclotomic number field. Using Galois descent one tries all possible
subfields starting with the maximal ones to find a splitting field of minimal de-
gree for each representation. The latter part can be abbreviated by remembering
at which stages in its construction history one had induced (because that are the
steps where the centers might get smaller and the Schur indices might increase).
It is also for this Galois descent where the relative norm equations come in.
Coming to the last problem of verification, one could keep track of all the local
Schur indices or at least of the global one as one constructs the representations
going up the composition series. Therefore one knows the global Schur index of
the final representations in the end and can decide whether the field was of min-
imal degree. This is not (yet) implemented, partially because it is not absolutely
necessary for the application for the soluble quotient program for which it was
mainly designed, cf. last chapter.
This is how far the approach via minimal splitting fields has been developed
so far. The advantage of the approach is that the degrees of the representations
remain as small as possible until the end, when one views the module over the
rational group algebra, i.e. inserts the regular representation of the splitting field
in the matrix representation.
What looks like a possibility of doing approximate calculations, cf. [3], turns
into an efficient way to do precise calculations, if V is of the form C 0z M for
some 'LG lattice M, cf. [27]. Because then the G-action is written in terms of
integral matrices and the denominator for the matrix of p divides the order IGI
of the group G. In fact, convergence is good enough in many cases, that one can
guess correctly the matrix of p after a couple of iterations even without knowing
Presentations and Representations of Groups 427
For constructing QG-modules from which to start one has the whole spec-
trum of possibilities of inducing, tensoring etc .. The crucial problem, which re-
mains, is the analysis of endomorphism rings, more precisely, deciding whether
they are division algebras and in case they are not, to produce a non-zero sin-
gular element. Tricks how to get around this in favorable cases and how to do
it in simple cases are discussed in [27]. In general one needs a p-adic analysis of
the situation, algorithmic details of which have not yet been developed.
c) Other approaches.
I briefly mention two other approaches, none of which has been developed
far enough yet to a systematic method for finding all irreducible rational rep-
resentations in general. There is the vector enumerator method by S. Linton,
cf. [15], [16], which has also been used to construct representations of other al-
gebras than group algebras. More recently there is the integral meataxe by R.
Parker, cf. [25], adjusting the meat axe methods which were extremely successful
in positive characteristic, cf. [24], [13]. Surprisingly, it turned out that similar to
finite characteristic words in the group algebra yielding images of small corank
did not seem to be as rare as expected.
Suppose a presentation
428 W. Plesken
The method to be described here was first used in [11] and refined in [29]. It
proves infiniteness, whenever the old method does so, but the use of representa-
tions of finite factor groups keeps the numerical difficulties under better control.
It supposes that one has already an epimorphism tp : G --t H onto a finite group
H. One now observes, cf. [11], if Hl(G, M) i= 0 for some (simple) QH-module,
which is viewed as QG-module via tp, then G is infinite. The method is based on
this observation and the fact that HI (G, M) can easily be computed from the
presentation by using derivations, cf. [35] or [10], where Fox derivatives are used,
for which meanwhile an efficient implementation [32] is available. Group theo-
retically the non-vanishing of the first cohomology group means that cP can be
lifted to an epimorphism of G onto an extension of a ZH-Iattice in M extended
by H. Technically speaking, this method only involves formal manipulations and
solving rational linear equations, once tp and the (irreducible) rational represen-
tations of the factor group H are known, which was discussed in the previous
section. Strictly speaking irreducibility is not necessary. One has the following
remark.
If one wants to compare this method with the conventional method described
above, one takes tp to be the permutation representation on the cosets of the low
Presentations and Representations of Groups 429
index subgroup, kertp to be the core of this subgroup and M the permutation
module on the cosets. This puts it into evidence that the new method is stronger
when one is able to come up with the epimorphism and the representations. The
previous chapter dealt with the last point. For the first point we have various
alternatives to the low index subgroup program. If G / kertp is soluble, the very
successful implementation by H. Bruckner in [1] of the soluble quotient algo-
rithm developed in [26] ought to be able to find H and tp. In the nonsoluble
case, QUOTPIC, cf. [14] provides good facilities to test for simple factor groups
up to a certain order. Another method will come out of the discussion of the
next chapter.
When the finite factor group H is already big, constructing all irreducible
rational representations might be very time consuming. Therefore it is desirable
to have some less laborious device to discard most of the candidates without
constructing them. This has been successfully carried out for factor groups H "'"
L 2 (P) in [29] by using the simple lFpL2(P)-modules and computing their first
cohomology group as G-modules. For instance the group
To conclude this chapter, I discuss two examples which were given to us for
investigation by Derek Holt. The investigations carried out by T. Schulz and
myself demonstrate some ideas how to find the right candidate for the factor
group H. The groups G are given by
430 W. Plesken
In the example G = G(6) one had an epimorphism onto the Mathieu group
M 12 , and no such detailed calculations were available. We just checked whether
the obvious guess would work: lift to the 2-fold cover of M12 and put the faithful
20-dimensional 2 . M 12 -lattice underneath, which had a symplectic invariant as
in the other case. It worked right away and proved that G(6) is also infinite.
can only generate a finite subgroup. Then one can use the representations of
this subgroups to reduce the number of indeterminates drastically as well as the
degree of the equations. One can then use both resultant techniques as well as
Grabner basis techniques. If one has many equations, it is advantageous to do
the actual computations over Z, so that the final resultants give a very short
list of primes, which are the possible characteristics of the fields, where one can
expect solutions. This method is especially useful to find all primes p for which
one has an epimorphism onto L 2 (P), cr. [11] for examples and [33) for other gen-
eral techniques for computations in PGL2(k). Beyond that these methods are
rather limited indeed, though we had some open examples, whose infiniteness we
could prove by this method in [28), because suitable solutions in characteristic
zero existed.
numbers can be computed and one gets a representation over some algebraic
number field. Taking things modulo prime ideals, in which none of the denom-
inators of the matrix entries lie, yields usually infinitely many finite images of G.
The reader is referred to [12], where the up to now biggest example of this
kind is treated: G := (2,3,7; 11), where
This G maps onto H := L 2 (43) and the first four layers of the 43-lower cen-
tral series of the kernel are of dimensions 11,14,14,14 over lF43 , suggesting that
the simple Lie algebra of type G 2 is involved. The power structure suggested a
completely ramified extension of degree 2 of Z43, for which one has the choice
of two. This resulted in an approximate representation of G of degree 7 over
Z43[.J-43)' which by careful choice of the module bases could be turned via
LLL into a degree 7 representation of G over a non-abelian number field of de-
gree 10 over Q. After this infinitely many factor groups of the form G 2 (q) for
certain prime powers q could be exhibited, none of which is in computational
reach by other methods.
References
1. H. Briickner, Algorithmen fur endliche auf/osbare Gruppen und Anwendungen, in
preparation.
2. J. H. Conway, R. T. Curtis, S. P. Norton, R. A. Parker, R. A. Wilson, Atlas of
finite groups, Oxford University Press 1985.
3. J. D. Dixon, Computing irreducible representations of groups, Math. Compo 24
(1970), 707-712.
4. J. D. Dixon, M. P. F. du Sautoy, A. Mann, D. Segal, Analytic pro-p groups, LMS
Lecture Note Series 157, 1991.
5. B. Fein, Minimal splitting fields for group representations, Pacific J. Math. 51
(1974), 427-431.
6. B. Fein, Minimal splitting fields for group representations II, Pacific J. Math. 11
(1978), 445-449.
7. W. Gaschiitz, M. F. Newman, On presentations of finite p-groups, J. reine angew.
Math. 245 (1970), 172-176.
8. D. F. Holt, The Warwick automatic groups software, in Geometrical and Compu-
tational Perspectives on Infinite Groups, DIMACS Series in Discrete Mathematics
and Theoretical Computer Science, vol. 25, ed. G. Baumslag et al. 1995, pp. 69-82.
9. D. F. Holt, D. F. Hurt, Computing automatic coset systems and subgroup presen-
tations, submitted to J. Symb. Comput.
10. D. F. Holt, W. Plesken, Perfect groups, Oxford University Press 1992.
434 W. Plesken
_ V,VH<""....
(omputation.1
MtlhodJ
in (ommutltl..
Algebr. Volume J
Ind Algob.. lc
G•• metry M. Bronstein
Symbolic Integration I
Transcendental Functions
pp. J figs.
1997. XIII, 299
Volume 2 Hardcover DM 78,-
ISBN 3-540-60511-5
W.Y. Vasconcelos
This book is bound to become the
Computational Methods standard reference for symbolic inte-
in Commutative Algebra gration. The author is the leading
and Algebraic Geometry expert on this topic and his book is
the first one to treat it comprehen-
1998. XI, 394 pp. 11 lig•. sively and in detail including new
Hardcover DM u8,·
results. Many algorithms are given
ISBN 3-540-60510-7
in pseudocode and, hence, can be
Deals with methods and techniques implemented. Also serves as a
to tackle problems that can be repre- textbook to be used for lecture
sented by data structures which are courses on symbolic integration.
essentially matrices with polynomial
entries, mediated by th,e disciplines of
commutative algebra and algebraic
geometry. The volume relates discov-
eries by a growing. interdisciplinary
group of researchers in the past
decade, highlightening the use of
advanced techniques to bring down
the cost of computation. Includes
concrete algorithms written
in MACAULAY.
• •••••••••
Please order from
Springer-Verlag Berlin
FIX: +491301827 87- 301
~mlU: [email protected]
or through your bookseller
, Springer
AtjonCohtn
!WI,Cu"""
H.Sttlk IEdl,1 ltulKobiu
Some Tapas
of :;pocts
Alg_b,.le
(omputer (ryplog,.phy
Algebra
Volume 4 Volume 3
A.M. Cohen, H. Cuypers, N. Koblitz
H. Sterk (Eds.) Algebraic Aspects of
Some Tapas of Cryptography
Computer Algebra 1998. IX. '06 pp. 7 fig •.
'998. Approx. 350 pp. Hardcover DM 98.-
Hardcover DM 79.- ISBN 3-540-63446-0
ISBN 3-540-63480-0 A comprehensive and systematic
A variety of topics in computer alge- account on the Carath~odory and
bra at an accessible (upper undergrad- Kobayashi distances, hyperbolic
uate/graduate) level with a view complex spaces and holomorphic
towards recent developments. For mappings with geometric methods.
those wanting to acquaint themselves A very complete list of references
somewhat further with the material, should be useful for prospective
the book also contains seven 'projects', researchers in this area.
which could serve as practical sessions
related to one or more chapters.
The contributions focus on topics
like Grabner bases, real algebraic
geometry, Lie algebras. factorisation
of polynomials, integer programming,
permutation groups, differential
equations, coding theory, automatic
theorem proving. and polyhedral
geometry. A must-read for everybody
• •••••••••
interested in computer algebra.
Springer