(Springer Undergraduate Mathematics) Franz Lemmermeyer - Quadratic Number Fields-Springer Nature Switzerland AG (2021)
(Springer Undergraduate Mathematics) Franz Lemmermeyer - Quadratic Number Fields-Springer Nature Switzerland AG (2021)
Franz Lemmermeyer
Quadratic
Number
Fields
Springer Undergraduate Mathematics Series
Adivsory Editors
Mark A. J. Chaplain, St. Andrews, UK
Angus Macintyre, Edinburgh, UK
Simon Scott, London, UK
Nicole Snashall, Leicester, UK
Endre Süli, Oxford, UK
Michael R. Tehranchi, Cambridge, UK
John F. Toland, Bath, UK
The Springer Undergraduate Mathematics Series (SUMS) is a series designed for
undergraduates in mathematics and the sciences worldwide. From core foundational
material to final year topics, SUMS books take a fresh and modern approach.
Textual explanations are supported by a wealth of examples, problems and fully-
worked solutions, with particular attention paid to universal areas of difficulty. These
practical and concise texts are designed for a one- or two-semester course but the
self-study approach makes them ideal for independent use.
Translation from the German language edition: Quadratische Zahlkörper by Franz Lemmermeyer,
© Springer-Verlag GmbH Deutschland 2017. Published by Springer Spektrum. All Rights Reserved.
© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland
AG 2021
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether
the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse
of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and
transmission or information storage and retrieval, electronic adaptation, computer software, or by similar
or dissimilar methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication
does not imply, even in the absence of a specific statement, that such names are exempt from the relevant
protective laws and regulations and therefore free for general use.
The publisher, the authors, and the editors are safe to assume that the advice and information in this book
are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or
the editors give a warranty, expressed or implied, with respect to the material contained herein or for any
errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional
claims in published maps and institutional affiliations.
This Springer imprint is published by the registered company Springer Nature Switzerland AG.
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Preface
This book evolved from a manuscript for an introductory lecture series at the
University of the Saarland in Saarbrücken in 1999. The goal was to present the
arithmetic of quadratic number fields and to explain how to apply the results to
problems in elementary number theory.
I expect the readers to be familiar with notions such as prime numbers and residue
class rings from elementary number theory, and with fundamental theorems such as
unique factorization, Fermat’s Little Theorem, and the quadratic reciprocity law.1
The theory of quadratic number fields deals with similar theorems in bigger rings of
integers, for example the ring of Gaussian integers, which consists of all numbers
of the form a + bi, where a and b are ordinary integers and where i 2 = −1. In this
ring, 5 is not a prime anymore because 5 = (1 + 2i)(1 − 2i). Whether an ordinary
prime number p remains prime in this ring depends on the Legendre symbol ( −1 p );
in general quadratic number rings, the behaviour of prime numbers also depends
on Legendre symbols. In this connection, we will learn that quadratic reciprocity,
which is perhaps perceived as a curiosity by someone who has never looked beyond
the horizon of elementary number theory, is a fundamental result that governs the
behaviour of prime numbers in quadratic number rings.
Finally, the theory of quadratic number fields has numerous applications to
elementary number theory. It puts several results such as the Two-Squares Theorem,
which asserts that primes of the form p = 4n + 1 can be written as the sum of two
squares, into a bigger perspective, and it allows you to solve Diophantine equations,
in particular certain Bachet–Mordell equations y 2 = x 3 + k, special cases of the
Catalan equation x p + y q = 1, or the Fermat equations x n + y n = zn for n ≤ 5.
Any book on quadratic number fields has to cover a set of standard topics such as
rings of integers, unique factorization into ideals, finiteness of the class group and
the solvability of the Pell equation. It is the topics outside the standard curriculum
that “define” this book, so I would like to say a few words about them here.
1 This result will be proved in several ways in this book, but I assume that the readers know how to
apply it.
v
vi Preface
1 Prehistory.. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 1
1.1 Pythagoras and Euclid . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 1
1.2 Diophantus.. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 5
1.3 Bachet .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 8
1.4 Fermat.. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 10
1.4.1 Integral Solutions of y 2 + 2 = x 3 . . . . .. . . . . . . . . . . . . . . . . . . . 11
1.4.2 The Fermat Equation x 4 + y 4 = z2 . . .. . . . . . . . . . . . . . . . . . . . 12
1.5 Euler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 13
1.5.1 The Two-Squares Theorem . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 14
1.5.2 Euler’s Algebra . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 19
1.5.3 Bachet’s Equation y 2 + 2 = x 3 . . . . . . . .. . . . . . . . . . . . . . . . . . . . 19
1.5.4 The Cubic Fermat Equation .. . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 20
1.5.5 Euler and the Problem of Units . . . . . . . .. . . . . . . . . . . . . . . . . . . . 21
1.6 Gauss. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 22
1.7 Kummer and Dedekind . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 23
1.7.1 From Ideal Numbers to Ideals . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 26
1.8 Exercises .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 26
2 Quadratic Number Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 31
2.1 Quadratic Number Fields . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 31
2.1.1 Quadratic Extensions as Vector Spaces.. . . . . . . . . . . . . . . . . . . 32
2.2 Rings of Integers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 33
2.3 The Unit Circle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 37
2.4 Platon’s Hyperbola . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 38
2.4.1 Platon’s Side and Diagonal Numbers ... . . . . . . . . . . . . . . . . . . . 40
2.5 Fibonacci’s Hyperbola . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 41
2.5.1 Generating Functions .. . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 42
2.5.2 Group Law . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 44
2.6 Vieta Jumping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 45
2.6.1 The IMO Problem . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 45
vii
viii Contents
B Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 261
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 331
Contents xi
The idea of transferring the arithmetic of the ordinary integers to quadratic number
rings appears to be so natural to those who are familiar with some abstract algebra
that we tend to underestimate the achievement by Carl √ Friedrich Gauss, who paved
the way by studying integers of the form a + b −1 in the early nineteenth
century. Before I discuss the contributions of Gauss and his successors Ernst Eduard
Kummer and Richard Dedekind, I would like to show the immense difficulties that
Leonhard Euler had to cope with when he used algebraic numbers for solving
problems going back to Pierre Fermat, Claude Gaspard Bachet de Meriziac and
ultimately even to Diophantus. Those who would like to familiarize themselves
with the number theoretical work of Fermat are well advised to study André Weil’s
excellent book [132] (and, if they read German, [88]).
1 2 m4 + 2m2 + 1 1 2
m+ −4= − 4 = m − .
m m2 m
Clearing the denominators yields the solution
(t 2 − u2 )2 + (2tu)2 = (t 2 + u2 )2 . (1.2)
With hindsight we can see that the basis of our derivation of (1.1) is the fact that
the equation x 2 + y 2 = z2 can be written as x 2 = z2 − y 2 , and that a difference of
squares can be factored:
We can also find Pythagorean triples starting with the famous diagram of the “square
in the middle” in Fig. 1.1. The area of the large square (Fig. 1.1) is (a + b)2 ; since
it is composed of the small square in the middle and four rectangles, this must be
equal to (a − b)2 + 4ab. Thus (a − b)2 + 4ab = (a + b)2 , or, after dividing through
by 4,
a + b 2 a − b 2
ab = − , (1.3)
2 2
which again shows that the difference of two squares is a product.
In order to find Pythagorean Triples we make ab equal to a square, for example,
by setting a = m2 and b = n2 . Then we obtain
1 For learning more about the methods used in “Babylonian algebra” see [63].
1.1 Pythagoras and Euclid 3
If we draw four diagonals into the Babylonian square in Fig. 1.1, we get a proof
of the Pythagorean Theorem for free. In fact, the area of the shaded square is c2 ,
where c denotes the hypotenuse of the right triangle with legs a and b; on the other
hand, it is also equal to 4 · ab
2 + (a − b) = 2ab + a − 2ab + b = a + b .
2 2 2 2 2
The verification that the triples (t −u , 2tu, t +u ) are solutions of the equation
2 2 2 2
2 For Euclid, the product of a number is the representation of a number as a product, not the result.
When Euclid wants the result of a product, he uses a clumsy phrase such as “if two numbers
multiplied make a number.”
1.2 Diophantus 5
may therefore assume that a is odd and b = 2u is even and then obtain
c−a c+a
(2u)2 = b2 = c2 − a 2 = (c − a)(c + a), i.e., u2 = · .
2 2
Observe that the last equation is just (1.3). Since a and c are coprime, so are the two
factors on the right hand side (as a matter of fact, any number dividing both c−a 2
and c+a
2 divides their sum c and their difference a), and according to Theorem 1.4
both numbers must be squares. Thus c−a 2 = t and 2 = s , which immediately
2 c+a 2
1.2 Diophantus
During the European Middle Ages, sciences in Europe were almost non-existent.
Even Euclid’s Elements had been largely forgotten. Only the contact with the
Muslim occupants of Spain and Sicily in the eleventh and twelfth century AD
made the few European scientists (almost all of them were monks and bishops,
since the monastery schools—the only schools where Latin was taught—were
reserved for the future clergy) aware of the existence of classical works on medicine,
astronomy, and mathematics. The invention of the printing press by Gutenberg made
the Elements available to people outside of monasteries (in particular once they had
been translated into other languages) and without access to the large libraries.
The most important event for the development of algebra and number theory
was without doubt the discovery of the Arithmetica by Diophantus, who must
have lived between 200 BC and 300 AD in Alexandria, a city near the Nile Delta
founded by Alexander the Great in 331 BC shortly before his death. Alexander’s
3 Presentations of their work may be found in Vogel [126, 127] and in Chemla and Guo [19].
6 1 Prehistory
successor decided to build a library there, which attracted scientists from the
whole Mediterranean world4 and made it the scientific center of antiquity. Among
the most famous scientists who have worked there are Euclid, the author of the
Elements, Archimedes, one of the greatest mathematicians and, like Heron, also an
exceptionally gifted engineer, the astronomer Ptolemy and Diophantus. Six of his
thirteen books on arithmetic problems have survived in Greek, and another four
(discovered only in the 1970s) in Arabic translation.
In this chapter we will discuss the following two problems solved by Diophantus
in his Arithmetica: His construction of Pythagorean triples and Problem VI.19,
which was to play an important role in the development of algebraic number theory.
We remark in advance that Diophantus already used some algebraic notation: He
had symbols for one unknown and its powers up to the sixth.
Pythagorean Triples Diophantus treats the problem of finding Pythagorean triples
in the following form:
II.10 To decompose a given square number into two squares.
Diophantus shows how to decompose 16 into a sum of two squares; he sets the
first square equal to x 2 , and then the second square is 16 − x 2 . Now he writes:
We form the square of an arbitrary multiple of x reduced by the side of the given square,
say 2x − 4.
Now we see the idea behind the choice of 2x − 4: On both sides of the equation we
have the constant term 16, which can be canceled; adding x 2 we obtain 5x 2 = 16x,
hence x = 16 256 144
5 . Thus one square is 25 , the other 25 .
Many centuries later it was observed that the substitution y = 2x − 4 may
be interpreted as the equation of a line in the Euclidean plane. The equations of
Diophantus then may be visualized geometrically as intersecting this line with the
circle with radius 4 around the origin (see Fig. 1.2).
In order to decompose a 2 into a sum of two squares let us call the smaller square
x ; then a 2 − x 2 = y 2 must also be a square. The substitution y = mx − a yields
2
a 2 − x 2 = m2 x 2 − 2amx + a 2 ,
4 This included the Hellenistic world. Among the scientists believed to have studied in Alexandria
are Archimedes from Syracuse in Sicily and Eratosthenes from Cyrene in North Africa. It is also
conceivable that well-educated scribes from Mesopotamia preferred the boomtown Alexandria to
the declining cities in Mesopotamia.
1.2 Diophantus 7
2am 2 m2 − 1 2
+ a· 2 = a2,
m2 + 1 m +1
and after canceling a 2 and getting rid of the denominator we recover (1.1).
Problem VI.19
To find a right-angled triangle in which the area increased by the hypotenuse is a square,
and the perimeter is a cube.
Diophantus solves this problem5 as follows. He denotes the area by x and the
hypotenuse as a square minus x, say c = 16 − x. The product of the legs is 2x; if
one leg was equal to 2, the other would be x, and the perimeter 2 + x + 16 − x = 18,
which is not a cube. Thus, says Diophantus, we need a square which increased by 2
makes a cube.
If one side of the square is m + 1 and the side of the cube is m − 1, then we must
solve m3 − 3m2 + 3m − 1 = m2 + 2m + 3, which gives m = 4. Thus the side of
the square is 5 that of the cube 3.
If x denotes the area of the original triangle and 25 − x its hypotenuse, and if 2
and x are its legs, then the theorem of Pythagoras gives us x 2 − 50x + 625 = x 2 + 4,
i.e., x = 621
50 , and the problem is solved.
Diophantus was forced to choose the substitution c = 16 − x; his calculations,
however, may be transferred to the general substitution c = k 2 − x. At one point,
5 This is problem VI.17 in Heath [59]; some problems are enumerated in a different way in different
editions.
8 1 Prehistory
1.3 Bachet
We do not know whether or how much Diophantus was studied in antiquity. Hypatia
(355–415), the daughter of Theon of Alexandria (335–405), is often said to have
written a comment on Diophantus Arithmetica; this story seems to be based on a
misguided interpretation by Tannery. Diophantus was studied by many Muslim (and
a few Byzantine) scientists; in Western Europe, Diophantus remained unknown until
Johannes Regiomontanus from Königsberg (Lower Franconia, Bavaria) discovered
a copy of six of the 13 books in a library in Venice in 1463.
The first edition of the Arithmetica was prepared a century later by Wilhelm
Holtzmann (1532–1576) under the name of Guilielmus Xylander; based on this
work Claude Gaspard Bachet de Mériziac (1581–1638) published an improved
version in 1621—not only the text had to be translated from Greek into Latin, Bachet
(like Xylander before him) also had to correct many corrupted passages that had
crept into the manuscript over the centuries, and he tried to make the text accessible
to his readers by detailed comments.
In his edition of Diophantus’ Arithmetica Bachet asked whether the equation
y 2 + 2 = x 3 that showed up in Problem VI.19 possesses other rational solutions
except the one given by Diophantus, and he answered this question in the positive
by presenting a method that allowed him to find a new solution of such an equation
from a known one.
Bachet achieved his result using the Diophantine technique of clever substitu-
tions, which we may interpret geometrically (see Fig. 1.3): If we intersect the curve
y 2 = x 3 − 2 (such a curve is called an elliptic curve) and its tangent
y= 27
10 (x − 3) + 5
in P (3, 5), then we obtain a second point of intersection 129 383
100 , 1000 .
Of course Bachet did not think of tangents at all, and he did not determine the
tangent using analytic means: Differential calculus had not yet been discovered,
and neither did coordinate systems exist, which came into being under the hands of
Pierre Fermat and René Descartes. Bachet rather chose his Diophantine substitution
1.3 Bachet 9
Fig. 1.3 Left: Tangent method on the elliptic curve y 2 = x 3 − 2; construction of 129 383
100 , 1000 from
(3, 5). Right: The elliptic curve y 2 = x 3 + 1 with the five integral points (−1, 0), (0, ±1) and
(2, ±3), two lines through three points and the tangents in (2, ±3)
in such a way that a linear equation resulted, which then necessarily has a rational
solution.
Bachet knew, as did his readers, that this calculation is a “proof by example,” that
is, this solution is general in the sense that it may be applied without any problems to
any equation of the form y 2 + k = x 3 . In fact, if we set y1 = y − η and x1 = x − rη,
then the equation y12 + k = x13 yields
2y
The constant term vanishes if 3rx 2 − 2y = 0, that is, if r = 3x 2
. Plugging this into
(1.4) and solving for η we obtain
27 6 9 3
η=− x + x ,
8y 3 2y
hence
9x 4 − 8y 2 x x 4 + 8kx
x1 = x − rη = = , (1.5)
4y 2 4y 2
27 6 9 3
y1 = y − η = y + x − x
8y 3 2y
8y 4 + 27x 6 − 36y 2x 3 −x 6 + 20kx 3 + 8k 2
= = . (1.6)
8y 3 8y 3
1.4 Fermat
In Fermat’s time, the problems studied in number theory were about perfect numbers
(numbers that are equal to the sum of their proper divisors such as 2p−1 (2p − 1)
for prime numbers 2p − 1), amicable numbers (pairs of numbers such as 220 and
284, for which the sum of the proper divisors of one number is equal to the other)
and figurate numbers (patterns in the sequences of triangular numbers and square
numbers). It was the study of Bachet’s edition of Diophantus’ Arithmetica that made
Pierre Fermat (1607–1665) start his own investigations in number theory. On the
margin of a page in his copy he wrote his remark that the equation x n + y n = zn
1.4 Fermat 11
is not solvable in natural numbers for any exponent n ≥ 3, and even claimed to
have a wonderful proof which the margin of his book was too small to contain.
Since he never made this claim public (it was published posthumously by his son
Samuel) and since Fermat was not exactly suffering from modesty we may assume
that he eventually discovered that his idea for proving the case n = 4 could not be
transferred to other exponents n.
In his copy of Diophantus’ Arithmetica, Fermat also made the following remark
concerning Bachet’s equation y 2 + 2 = x 3 :
Is there another square in integers apart from 25 that, increased by 2, gives a cube? This
seems difficult to investigate; but I can show by a rigorous demonstration that 25 is the only
square that is smaller by 2 than a cube. In rational numbers, Bachet’s method yields many
such squares, but the theory of integers, which is very beautiful and very subtle, was so far
known to nobody, neither to Bachet, or to any other author whose works I have seen.
In his letter to Carcavi written in 1657, Fermat tried to make Carcavi believe that
he was able to prove the following assertions using infinite descent:
• There is no cube that can be decomposed into two cubes.
• There is only one square which is 2 less than a cube, namely 25.
• There exist only two squares that, when you add 4, give a cube, namely 4 and
121.
• All squared powers6 of 2, increased by 1, are prime numbers.
n
The last claim is Fermat’s conjecture that all numbers of the form 22 + 1 are prime,
which Euler disproved by observing that F5 = 232 + 1 = 641 · 6700417. It seems
to me that Fermat did not know how to prove any of these claims but was convinced
that the key to their proofs was infinite descent.
We now will have a closer look at Bachet’s method of constructing rational points
on elliptic curves y 2 = x 3 − k. To this end, let P ( M
m n
, N ) be such a rational point,
and assume that the fractions are written in reduced form and with M, N > 0. It
follows from ( Nn )2 = ( M ) −k that n2 M 3 = m3 N 2 −kM 3 N 2 . Since N 2 divides the
m 3
right side, N 2 must also divide n2 M 3 . But n and N are coprime, so we can conclude
that7 N 2 | M 3 . In a similar way we obtain M 3 | N 2 . Thus the natural numbers M 3
and N 2 divide themselves, hence we must have M 3 = N 2 . This is only possible if
M is a square and N a cube, and thus there exists a natural number e with M = e2
and N = e3 .
Proposition 1.6 Each rational point on the elliptic curve y 2 = x 3 − k has the form
( em3 , en2 ), where gcd(m, e) = gcd(n, e) = 1.
If we now plug x = m
e3
and y = n
e2
into Bachet’s duplication formula, we find
m4
e12
+ 8k em3 m4 + 8kme9
x1 = = ,
4 n2 4n2 e8
e4
6 3
− em18 + 20k me9 + 8k 2 −m6 + 20km3e9 + 8k 2 e18
y1 = = .
3
8 ne6 8n3 e12
Thus if ( em3 , en2 ) is a rational point on E, for which m and n are both odd and e is
even, then m1 = m4 + 8kme9 and n1 = −m6 + 20km3e9 + 8k 2 e18 are again odd,
and e1 = 2ne4 is not only even, but divisible by a much higher power of 2 than e.
This shows
Proposition 1.7 Bachet’s method applied to the point (3, 5) on the elliptic curve
E : y 2 = x 3 − 2 yields only points whose coordinates have even denominator
(when written in lowest terms) and thus does not produce any point with integral
coordinates.
The proof of this proposition may have been within Fermat’s reach despite the
very modest tools he had at his disposal. But it does not follow from this proposition
that there are no integral solutions of y 2 + 2 = x 3 except (3, ±5) since Bachet’s
method does not yield all rational solutions. Similarly, Bachet’s method applied to
the equation y 2 + 4 = x 3 and the integral point (2, 2) does not yield any integral
points beyond (5, 11) (see Exer. 1.8.).
Already Diophantus showed that there exist Pythagorean triples in which one leg or
the hypotenuse is a square number, i.e., that the Diophantine equations a 4 + b 2 = c2
and a 2 + b 2 = c4 have nontrivial solutions (see Exercise 1.2). Fermat asked why
Diophantus did not discuss the question of finding Pythagorean triples in which two
sides are square numbers, and he answered this question by observing that this is
due to the unsolvability of the problem:
of the proof, but it is not known just how big Fermat’s contribution to Frenicle’s
publication actually was.
The proof is based on an application of infinite descent: Starting with a
hypothetical solution (x, y, z) in natural numbers one constructs a new solution
(x1 , y1 , z1 ) that is “smaller” (in a suitable way) than the original solution. Since
natural numbers cannot decrease indefinitely, this will lead to a contradiction.
For the proof of Fermat’s claim we assume that there is a solution (x, y, z) of
x 4 + y 4 = z2 in natural numbers with xy = 0. If p is a common divisor of x and
z, then p | y, hence p4 | z2 and p2 | z; but then we may cancel p4 , and applying
this reasoning repeatedly we arrive at a solution (x, y, z) in which x, y, and z are
pairwise coprime.
Clearly x and y have different parity, and we may assume that x is even and y is
odd. According to Proposition 1.5 there exist natural numbers a, b with x 2 = 2ab,
y 2 = a 2 − b2 and z = a 2 + b2 . Since y is odd, a and b have different parity. If
a is even and b is odd, then we obtain 1 ≡ y 2 = a 2 − b2 ≡ 0 − 1 ≡ −1 mod 4:
Contradiction. Thus a is odd and b is even, and applying Proposition 1.5 to the
equation b2 + y 2 = a 2 we obtain the existence of integers c, d ∈ N with b = 2cd,
y = c2 − d 2 and a = c2 + d 2 . This gives us x 2 = 4cd(c2 + d 2 ), hence (x/2)2 =
cd(c2 + d 2 ). Now c, d and c2 + d 2 are pairwise coprime (a common factor would
divide a and b, hence x and y) and their product is a square. Applying the Square
Lemma 1.4 twice (first to the pair cd and c2 + d 2 , then to c and d) we find that these
factors must be squares, up to possible factors ±1. By choosing c and d positive we
obtain c = e2 , d = f 2 and c2 + d 2 = g 2 for e, f, g ∈ N.
But now we have e4 + f 4 = g 2 , hence we have found a new solution of the
equation x 4 + y 4 = z2 . Since
this solution is smaller than our original solution. In other words: To every solution
(x, y, z) ∈ N3 with xy = 0 there exists another solution (e, f, g) ∈ N3 with 0 <
g < z (if we had g = 0, it would follow that e = f = 0 and thus b = 0, hence
x = 0: Contradiction). Thus there cannot exist a solution (x, y, z) ∈ N3 with xy = 0
since after finitely many steps we would obtain an integral solution (e, f, g) with
0 < g < 1. This proves Fermat’s claim.
As impressive as this proof is, it only uses descent and a repeated application of
the Square Lemma.
1.5 Euler
n
conjecture that all numbers of the form 22 + 1 are prime. Euler was not impressed,
but Goldbach did not let go. He showed Euler how to prove that no Fermat number
4
> 22 + 1 is divisible by any prime number below 100, and when Euler eventually
n
discovered that each prime factor of 22 + 1 has the form p = 2n k + 1 and that
5
22 + 1 is divisible by 541 = 40 · 16 + 1, he was hooked. Euler was the only
prominent mathematician8 of his time who studied number theory until Lagrange
appeared on the mathematical stage.
For explaining the idea behind Euler’s attempt at proving Fermat’s conjecture
that x = 3 and y = 5 are the only solutions of y 2 = x 3 − 2 in natural numbers,
we will look into his Algebra [38]. Euler’s proof contained a gap, but it displays his
originality.
In order to understand Euler’s reasoning it is necessary to study Euler’s proof
of one of the most beautiful results of elementary number theory, the Two-Squares
Theorem, according to which each prime number of the form 4n + 1 can be written
as a sum of two squares.
The first statement that every prime number of the form p = 4n + 1 can be written
as a sum of two squares shows up in Albert Girard’s (1595–1632) edition of Simon
Stevin’s (1548–1620) Arithmetique published in 1625. This edition contains the first
four books of the Arithmetica Diophantus translated by Stevin, and the fifth and
sixth book translated by Girard. In connection with Problem V.12, Girard writes:
Determination of a number that can be divided into two squares of integers.
I. Each square number.
II. Each prime number that exceeds a multiple of 4 by a unit.
III. The products of such numbers.
IV. And the double of each of these.
The first proof that every prime number of the form p = 4n + 1 can be written
uniquely as a sum of two squares is due to Fermat, and the first published proof to
Euler [37]. Euler approaches his proof slowly and thoughtfully and he explains why,
apart from p = 2 = 12 + 12 , only primes of the form 4n + 1 can be sums of two
squares.
His first considerations are concerned with the representation of 2p as sums of
two squares: If p = a 2 + b 2 , then
8 There were numerous less known mathematicians interested in Diophantine problems or the
c + di (c + di)(1 − i) c + d + (d − c)i
= = .
1+i (1 + i)(1 − i) 2
Euler now plays the same game with two odd prime numbers: If p = a 2 + b 2
and q = c2 + d 2 , then
The existence of two expressions for pq as sums of two squares suggests that
products of two primes of the form 4n + 1 can always be written as sums of two
squares in two different ways. In the second part of his article Euler uses this idea
to develop an algorithm for finding the prime factors of sums of two squares.
The idea behind Euler’s proof of the Two-Squares Theorem is reversing the
process above. We have seen (see Eqs. (1.7) and (1.8)) that a prime number p is
a sum of two squares if and only if 2p is. It is therefore a natural idea to show that
some multiple mp of a prime p ≡ 1 mod 4 is a sum of two squares, and then to
reduce m via the product formula (1.9) until we end up with m = 1.
Euler follows this idea using induction (Fermat had chosen the equivalent method
of infinite descent):
(1) The prime numbers p = 4n + 1 are those for which −1 is a quadratic residue;
it follows from x 2 ≡ −1 mod p that x 2 + 1 = mp for some natural number m.
(2) For each prime number p = 4n + 1 there is a multiple mp that can be written
as a sum of two squares, where we may choose m < p.
(3) If m is even, then m2 p is also a sum of two squares.
(4) Each odd prime factor q of m is sum of two squares by induction hypothesis,
and then mq p is a sum of two squares.
is divisible by p. Since the expression in the last bracket is a sum of two squares
(a n )2 + (bn )2 , everything boils down to showing that a and b can be chosen in such
a way that p does not divide the expression a 2n − b 2n in the first bracket.
As Euler was to find out later, this can be seen quite easily: Just pick b = 1 and
show that there is at least one integer a not divisible by p for which a 2n − 1 is not
divisible by p. In fact, if all integers a = 1, 2, . . . , p − 1 had the property that p is
a divisor of a 2n − 1, then the polynomial x 2n − 1 would have more than 2n = p−1 2
roots modulo p, which is impossible.
It remains to prove the last claim, the induction step. We assume that each prime
q = 4n + 1 less than p is a sum of two squares and then show that p is also a sum
of two squares.
Thus let mp = x 2 + y 2 for an integer m < p. If d is a common divisor of x and
y, then m is divisible by d 2 , and division by d 2 yields m1 p = x12 + y12 , where x1
and y1 are coprime. Moreover we may assume that m1 is odd.
Now write mp = a 2 + b2, where m is odd and where a and b are coprime. From
a + b2 ≡ 0 mod m we obtain the congruence (a/b)2 ≡ −1 mod m. In particular,
2
c2 (a 2 + b 2 ) = a 2 c2 + b 2 c2 and a 2 (c2 + d 2 ) = a 2 c2 + a 2 d 2
1.5 Euler 17
Since q = c2 + d 2 is prime, q divides one of the two factors by Euclid’s Lemma 1.2.
Changing the sign of d if necessary we may assume that q divides bc − ad. Then
and here the left side as well as the square (bc − ad)2 are divisible by q. Thus q also
divides ac + bd, and canceling q 2 yields
m bc − ad 2 ac + bd 2
·p = + .
q q q
q = 1, then we are done; if not, consider a prime factor q1 of q and repeat the
If m m
last step. After finitely many steps we have found a presentation of p as a sum of
two squares.
As a corollary of Euler’s investigation we observe
Theorem 1.9 (Euler’s Decomposition Theorem) If m = x 2 + y 2 is a sum of two
coprime squares, and if m = p1 p2 · · · pt is the prime factorization of m, then we
can choose integers xj , yj ∈ Z for which pj = xj2 + yj2 , and the decomposition of
m into two squares can be obtained by a repeated application of the identity (1.9)
to the decompositions of the primes pj as sums of two squares.
of prime numbers, and if each prime factor pj has the form pj = xj2 +myj2 , then the
signs of the yj can be chosen in such a way that the decomposition N = x 2 + my 2
is obtained by a repeated application of the identity
from
The usual counterexamples, which we will discuss repeatedly in the next few
chapters, do not apply to this result, which can be proved using tools available
to Euler. For solving Diophantine equations one needs a stronger version of the
decomposition theorem, namely the analogue of Theorem 1.9 for numbers of the
form x 2 + 2y 2 . The strong version would follow from the weak one if we could
show that prime divisors of numbers of the form x 2 + my 2 with gcd(x, y) = 1 again
have this form. But as we have already seen, this is false already for m = 5.
Euler eventually must have realized that there is a serious problem with his
approach. In one of the many posthumous papers of Euler [39, Art. 44] we find
the following question:
The formula 1812 + 7 = 323 is worthy of our whole attention; although 32 = 52 + 7 it is
not true that
√ √
181 + −7 = (5 + −7 )3 ,
although we have
√
√ 1 − 3 −7 √
181 + −7 = (5 + −7 )3 .
8
We also remark that
√ √
1 + 3 −7 1 − 3 −7
· = 1.
8 8
which shows that the development into imaginary factors requires further investigations.
One√can often read that Euler used unique factorization 2for integers of the form
a + b −2 in his solution of the √ Bachet-Fermat equation y + 2 = z3 ; as a matter
of fact, primes of the form x + y −2 do not occur anywhere in Euler’s work, and
it would be more precise to say that Euler’s proof had a gap that we can fill with
the unique factorization theorem for such numbers. What is true is that Euler tried
to transfer the
√ Square Lemma (Theorem 1.4) and its cubic analog to numbers of the
form a + b c.
x 3 + y 3 = 2p(p2 + 3q 2 ),
and Fermat’s claim boils down to the question whether the product 2p(p2 + 3q 2 )
can be a cube or not. Elementary congruences show that p must be divisible by 4,
so that p4 (p2 + 3q 2 ) is a cube.
We can easily show that p4 and p2 + 3q 2 are either coprime or have greatest
common divisor 3. In the first case, p2 + 3q 2 must be a cube:
Now let us make pp + 3qq a cube, which may be achieved, as we have shown above, by
setting
√ √ √ √
p + q −3 = (t + u −3 )3 and p − q −3 = (t − u −3 )3 .
This makes pp + 3qq = (tt + 3uu)3 into a cube, and now we find p = t 3 − 9tuu =
t (tt − 9uu) and q = 3ttu − 3u3 = 3u(tt − uu).
must be a cube in order to find a contradiction using infinite descent. In fact, the
coprimality of the factors implies that 2t = e3 , t − 3u = f 3 and t + 3u = g 3 must
be cubes. This implies
e3 = 2t = (t − 3u) + (t + 3u) = f 3 + g 3 ,
1.5 Euler 21
and we have found a new solution (e, f, g) of the cubic Fermat equation, which
easily can be shown to be smaller than the solution we started with unless xyz = 0.
The problematic point in Euler’s proof is the following, as was pointed out to
Euler by the Berlin mathematician and calculator Abraham Wolff in a letter to Euler
written on August 9, 1770:
The difficulty lies in the fact that I lack the trick by which I can convince myself that if
pp + 3qq = (tt + 3uu)3 , that is
√ √ √ √
(p + q −3 ) · (p − q −3 ) = (t + u −3 )3 (t − u −3 )3 ,
√ √
the value of p + q −3 must necessarily be (t + u −3 )3 .
It is not known whether Euler answered this letter; but he certainly knew that
others felt there was a gap.
√ such as ax +cy
In § 188, Euler discusses the question of how to make expressions 2 2
√
into a cube. To this end he uses numbers of the form x a + y −c. Euler obtains
his solutions by setting
√ √ √ √
x a + y −c = (p a + q −c )3 .
This yields
ax 2 + cy 2 = z3 (1.10)
for z = ap2 + cq 2 . But what Euler uses (and needs) in his applications is the
converse, namely that each solution is given by these equations. Euler’s occasional
remarks concerning the coprimality of the coefficients show that he has seen that this
converse must be proved. Numerous examples that perhaps were known to Euler
show, however, that such a proof cannot be as simple as Euler may√ have √ thought.
Euler’s digression into the theory of numbers of the form x a +√ y c cannot
be avoided: For example, we have 72√− 10 · 22 = 9 = 32 , yet 7 + 2 10 is not a
square of a number of the form a + b 10. In the√present√case, the obstacle may be
overcome by considering numbers of the form a 2 + b 5, because now
√ √ √
7 + 2 10 = ( 2 + 5 )2 .
22 1 Prehistory
In Euler’s Algebra, this example does not occur. He came across these numbers
when trying to make expressions such as 2x 2 − 5y 2 into cubes. In this case, Euler
observes, it is not sufficient to set
√ √ √ √
x 2 + y 5 = (p 2 + q 5)3 ;
1.6 Gauss
The final step in the direction of algebraic number theory was taken by Carl
Friedrich Gauss (1777–1855). Gauss is one of the greatest mathematicians of all
time. He was only 18 years old when he solved a 2000-year-old problem by showing
that a regular polygon with 17 sides can be constructed using ruler and compass—
he obtained the proof by developing the (algebraic!) theory of cyclotomy, which
he included in his Disquisitiones Arithmeticae, one of the most famous textbooks
on number theory. The Disquisitiones also contained the first complete proofs of
the quadratic reciprocity law. Another fundamental discovery by Gauss was elliptic
functions (doubly periodic functions C −→ C, obtained by inverting elliptic
integrals that Euler and Legendre had studied extensively and which appear in the
computation of the circumference of ellipses—whence their name), about which he
published almost nothing at all.
In his Disquisitiones Arithmeticae [43] published in 1801, Gauss gave a quite
modern presentation of elementary number theory (in the sense that he covered
congruences, unique factorization, residue class groups and primitive roots), erected
on a safe foundation: After the proof of Euclid’s Lemma 1.2 in [43, Art. 14] he
states and proves the theorem of unique factorization. Before Gauss, the uniqueness
of prime factorization had been known, but it was not regarded as a fundamental
property: As in Euclid’s Elements, it was rather regarded as an auxiliary result, e.g.,
for finding all the divisors of numbers of the form 2p−1 (2p − 1) for primes 2p − 1 in
the construction of perfect numbers. The observation that all divisors of a number N
are obtained exactly once by multiplying the divisors of the prime powers dividing
N is essentially equivalent to unique factorization as far as the content is concerned.
For Gauss (and the number theorists after him), unique factorization was a principle
on which elementary number theory is founded. In particular, Gauss realized that
unique factorization may be used to prove results about integers. Only after this
1.7 Kummer and Dedekind 23
Gauss uses the numbers a + bi not only as servants for finding identities, as
Euler and Lagrange have done, but develops the arithmetic in this domain ab ovo:
He defines divisibility, units, prime numbers and shows, with the help of binary
quadratic forms, that the integers a + bi can be factored uniquely, up to unit factors
and ordering, into prime elements. Both Dirichlet (1805–1859) and Jacobi (1804–
1851) were surprised and highly impressed by the idea of allowing these numbers
a + bi as modules, which allowed Gauss to transfer his theory of congruences to
these numbers.
Dirichlet later even extended Gauss’s theory of binary quadratic forms to the
ring of “Gaussian integers” (parts of his lectures on the elementary arithmetic of
this ring have survived as lecture notes by Gustav Arendt [3]), and he realized that
unique prime factorization in such domains is a consequence of the existence of a
Euclidean algorithm.
For a few number fields, unique factorization could be proved in this way, but
it was not clear what to do with number rings in which unique factorization does
not hold. It was more or less taken for granted that a general theory would have to
be based on a generalization of the theory of binary quadratic forms to forms of
higher degree. Eisenstein (1823–1852) developed a theory of cyclic cubic fields in
the language of cubic forms, and Dirichlet worked out the analytic class number
formula for cyclotomic fields using the language of forms before Kummer (1810–
1893) succeeded in creating an arithmetic of cyclotomic number fields based on
his notion of ideal numbers. Dedekind (1831–1916) extended Kummer’s ideas to
general number fields using his theory of ideals.
I have noticed that, even if α cannot be decomposed into complex factors, it might not
have the true nature of a complex prime number, since in general it lacks the first and most
important property of primes, namely that the product of two primes is not divisible by any
prime number different from them.
Thus the two factorizations result from combining the factors in the “prime
decomposition”
√ 2 √ √
6 = − −2 (1 + −2 )(1 − −2 )
±1 are units since they obviously divide 1. The result of these considerations is that
√ √
6 = 2 · 3 = (1 + −5 )(1 − −5 ) (1.12)
Dedekind goes on to say that, on the other hand, it is easy to define the notion of
coprime elements without using the decomposition into irreducibles:
Two non-zero algebraic integers α and β are called coprime if each element divisible by α
and β is also divisible by αβ.
Today we are accustomed to definitions such as this one; for constructivists such
as Leopold Kronecker this was a bad definition: it does not allow you to decide a
priori whether two given integers are coprime since the definition requires checking
infinitely many conditions. In fact we have to verify that the infinitely many integers
divisible by α and β are also divisible by the product αβ. It is therefore clear that for
computing with such numbers, one has to find an algorithm that allows us to decide
in finitely many steps whether two given elements are coprime or not.
A little later (p. 250) Dedekind observes that in algebraic number rings there
often exist essentially different decompositions of elements into irreducible ele-
ments and then continues as follows:
This contradicts the notion of the character of primality that holds in the number theory of
the rational integers to such an extent that we shall not accept an irreducible element as a
prime; thus we need to look out for a stronger criterion than the inadequate irreducibility,
similar to what we did earlier for the notion of coprimality [. . . ], by not decomposing the
integer we are investigating but by studying how it behaves as a module:
An integer μ shall be called prime if it is not a unit, and if every product ηρ divisible by
μ has at least one factor η or ρ divisible by μ.
The fact that in some algebraic number fields there exist irreducible elements that
lack the defining properties of primes implies that the theorem of unique prime
factorization does not hold in such rings. In the case of quadratic number rings it was
possible to justify calculations needed for solving certain Diophantine equations by
invoking the language of binary quadratic forms; for example, Dirichlet proved the
unsolvability of the quintic Fermat equation x 5 + y 5 = z5 in positive integers by
using the theory of the quadratic forms x 2 − 5y 2.
In order to be able to say something about certain Diophantine equations even
if the corresponding number field does not have unique factorization, Kummer
invented the notion of an “ideal” prime number. His basic idea is, from today’s
point of view, a very modern one: Investigate an algebraic structure by studying
homomorphisms into simpler structures (see [81]). √
Let us once more consider the ring R = Z[ −5 ]. We have seen above that
the elements 2 and 3 are irreducible in R, but not prime. If there was an element
π of norm 2, then we could consider the residue class ring of R modulo π; this
quotient ring would have two elements, because it can be shown that the number
of residue classes modulo an element of R is equal to its norm. Reduction modulo
π thus would give us a ring homomorphism f : R −→ Z/2Z. Kummer realized
that such a ring homomorphism exists √ even when there is no element of norm 2. In
fact, all we have to do is set f (a + b −5 ) = a + b + 2Z. Thus although there
is no prime element π of norm 2, we can work modulo π by simply applying f .
Such ring homomorphisms (or, less anachronistically, such procedures for attaching
a residue class to each element) were called “ideal primes” by Kummer.
Heinrich Jung has shown in [69] how to develop the whole theory of quadratic
number fields based on this notion of ideal primes as ring homomorphisms. The
only obstacle in this approach is the fact that it is not at all obvious how to multiply
ideal numbers. Dedekind later replaced these ideal numbers by the kernels of the
associated ring homomorphisms and called them ideals. In his theory, the product
of two ideals is simply the ideal generated by the products of the elements from each
ideal.
1.8 Exercises
1.1. Already the Babylonians, about 4000 years ago, knew how to calculate the
space diagonal of a door, and composed problems from integral solutions of
equations such as x 2 +y 2 +z2 = w2 . To this end they looked for Pythagorean
triples such as (3, 4, 5) and (5, 12, 13), in which the hypotenuse of one
triangle is equal to the leg of the other one, and then obtained the solution
32 + 42 + 122 = 132 .
Show how to find infinitely many such solutions.
1.8 Exercises 27
1.2. Show that there are infinitely many Pythagorean triples (a, b, c) in which a
or c is a square
2number.
4
1.3. The vectors 2 and 4 with lengths 3 and 9, respectively, suggest that
1 7 a √
there exist infinitely many vectors a whose length a 2 + a 2 + b 2 is an
b
integer. Clearly this holds if and only if 2a 2 + b2 = c2 is the square of an
integer c.
Write this equation in the form 2a 2 = c2 − b2 = (c − b)(c + b) and
conclude that setting c − b = 4s 2 and c + b = 2r 2 yields solutions. Deduce
that a = 2rs, b = r 2 − 2s 2 and c = r 2 + 2s 2 .
Parametrize the ellipse x 2 + 2y 2 = 1 also using the lines through the point
(−1, 0).
1.4. Parametrize the unit sphere x 2 + y 2 + z2 = 1 using lines through the point
(−1, 0, 0).
1.5. If a and b are represented by the form x 2 − my 2 , then so is their product. The
content of this identity was already known to Brahmagupta, who used it for
solving the equation x 2 = my 2 + 1 in integers.
1.6. Derive Bachet’s duplication formula using analytic geometry. The slope of
the tangent in a point may be obtained by implicit differentiation: 2yy = 3x 2
2
implies y = 3x 2y .
1.7. Show that Bachet’s duplication formula applied twice to the point (3, 5) on
y 2 = x 3 − 2 yields the rational point
2340922881 113259286337292
, .
76602 76603
1.8. Apply Bachet’s duplication formula to the point (2, 2) on the elliptic curve
y 2 + 4 = x 3 and show that the only integral point resulting by repeated
duplication is (5, 11).
1.9. Show that there is no Pythagorean triple whose legs are prime numbers.
Show moreover that if (p, b, q) is a Pythagorean triple in which one leg p
and the hypotenuse q are primes, then b = q − 1.
1.10. Show that if (a, b, c) is a Pythagorean triple, then so is (t − a, t − b, t + c).
Similarly, consider the triple (t + a, t + b, 2t − c) and find more ways of
constructing a new Pythagorean triple from a known one.
Show moreover that this method also works for sums of three squares.
1.11. Find a counterexample to the following statement: If p is prime and kp =
a 2 +mb2, and if k can be written in the form k = c2 +md 2 , then p = e2 +mf 2 .
1.12. Show that there are infinitely many primes of the form p = 4n + 1 as well as
of the form q = 4n + 3.
Hint: As in Euclid’s proof, consider the integers N1 = (p1 · · · pt )2 + 1 and
N3 = 4q1 · · · qt − 1.
1.13. The following trick due to Ernst Trost [123, 124] is simple but often
remarkably useful. Given a Diophantine equation at 2 + bt + c = 0 with a
28 1 Prehistory
√
rational solution t, the solution formula t1,2 = −b±2a
Δ
for quadratic equations
tells us that the discriminant Δ = b − 4ac must be a square.
2
in the product
a b a b aa + bc ab + bd
· =
c d c d ca + dc cb + dd
In this chapter we provide the foundations for doing arithmetic in quadratic number
rings. We will explain what a quadratic number field is, and which elements we will
regard as “integers.” In addition, we will visualize certain aspects of the arithmetic
of quadratic number fields geometrically by introducing Pell conics.
This book deals with the arithmetic of quadratic number fields, and in this and the
next section we will present the main actors in our play. In Chap. 4 we will give a
precise definition of what we mean by notions such as divisibility, units, and prime
elements, and only then will we return to the question how to put Euler’s solution of
the Diophantine equation y 2 + 2 = x 3 in integers onto a solid foundation and apply
his reasoning to other examples.
Let m ∈ Z \ {0, 1} be a squarefree integer; then the set
√ √
k = Q( m ) = {a + b m : a, b ∈ Q}
√
of numbers of the form a + b m, where a and b are rational numbers, is called a
quadratic number field (the fact that k is actually a field is proved in Exercise 2.1).
We call k real or complex quadratic
√ according as m > 0 or m < 0.
The element α = a + b m ∈ k is a root of the quadratic √ polynomial Pα (x) =
x 2 − 2ax + a 2 − mb 2 ∈ Q[x]; its other root α = a − b m is called the conjugate
of α. Moreover we call
Nα = αα = a 2 − mb 2 the norm of α,
Tr α = α + α = 2a the trace of α, and
disc (α) = (α − α )2 = 4mb2 the discriminant of α.
√ √
The conjugate, the norm, the trace, and the discriminant of α = 3+ 5
2 ∈ Q( 5 ),
for example, are
√
3− 5 32 − 5
α = , Nα = = 1, Tr α = 3 and disc (α) = 5.
2 4
√
As we have seen, Euler first used numbers of the form a + b c for solving
Diophantine equations in ordinary integers. In order to get equations in integers from
relations in quadratic number rings we need maps R −→ Z. Since we will mainly
exploit multiplicative relations (decomposition into factors, divisibility, units), maps
respecting the multiplicative structure such as the norm are particularly important
(see, e.g., Exercise 2.7, and for the proof of the proposition below, Exercise 2.6).
Proposition 2.1 For all α, β ∈ k we have
1 In extensions with non-abelian Galois groups one has to distinguish carefully between these
where ζ is a primitive p-th root of unity (see Exercise √ 2.45), and used Gaussian
periods in√ their subfields. If p = 3, then Q(ζ ) = Q( −3 ) since we can choose
√
ζ = −1+2 −3 ; in this case we see that the ring Z[ζ ] is strictly larger than Z[ −3 ].
Dirichlet solved the quintic Fermat equation x 5 + y 5 = z√5 using elements in
√
the ring Z[ 5 ]; he did not consider elements of the form p+q2 5 although this was
more or less suggested by equations such as
√
√ (φ + ψ 5 )5
P +Q 5= ,
24
which would look a lot more symmetric if they were written in the form
√ √
P + Q 5 φ + ψ 5 5
= .
2 2
34 2 Quadratic Number Fields
Dirichlet also proved his unit theorem (a generalization of the solvability of the Pell
equation) in rings of the form Z[α], where α is a root of a monic polynomial
x n + an−1 x n−1 + . . . + a1 x + a0
with coefficients aj ∈ Z.
It is not clear whether the question how to define algebraic integers was perceived
as a problem before Dedekind gave the definition. The quote that
Talent hits a target no one else can hit;
Genius hits a target no one else can see
must have integral coefficients. We will call α ∈ K an algebraic integer if Pα (x) has
coefficients in Z. More generally, algebraic integers are roots of monic polynomials
x n + an−1 x n−1 + . . . + a1 x + a0
√ √ √
with coefficients in Z. The numbers 2, −3 and 1+2 5 , for example, are algebraic
integers because they are roots of the monic polynomials x 2 −2, x 2 +3 and x 2 −x−1,
√
respectively, all of which have integral coefficients. On the other hand, √1 and 1+2 3
2
are algebraic numbers, but not algebraic integers because they are roots of the monic
polynomials x 2 − 12 and x 2 − x − 12 , respectively. It can be shown that algebraic
numbers form a field, and that the algebraic integers form an integral domain (or
simply a domain from now on).
The set of all integral elements in a number field k is called the ring Ok of
(algebraic) integers in k. For quadratic number fields we will show that this set
is actually a ring after having characterized these integers.
√
Theorem 2.2 The integral elements in the quadratic number field k = Q( m ) are
given by
√
{a + b m : a, b ∈ Z} if m ≡ 2, 3 mod 4,
Ok = √
{ a+b2 m : a, b ∈ Z, a ≡ b mod 2} if m ≡ 1 mod 4.
√
Proof Assume that α = r + s m is an algebraic integer with r, s ∈ Q; then
Tr α = 2r and Nα = r 2 − ms 2 are ordinary integers. If we plug 2r ∈ Z into the
second equation, then we find that 4ms 2 must be an integer. Since m is squarefree,
4s 2 and thus finally 2s must be an integer. In fact, write 4s 2 = x 2 /y 2 for coprime
integers x, y ∈ Z; since 4ms 2 is an integer, we find y 2 | mx 2 ; since gcd(x, y) = 1
we find y 2 | m, and since m is squarefree this implies y = ±1.
Thus we may write 2r = a and 2s = b for integers a, b ∈ Z. Now we exploit
once more the fact that Nα = r 2 − ms 2 is an integer and find that a 2 − mb2 ≡
0 mod 4.
• If m ≡ 2 mod 4, then 2 |√a, 4 | a 2 and 2 | b, hence r, s ∈ Z: Each algebraic
integer has the form r + s m with r, s ∈ Z.
• If m ≡ 3 mod 4, then 0 ≡ a 2 − mb 2 ≡ a 2 + b2 mod 4; this is only possible if a
and b are even, and as above this implies that r and s must be integers.
• If m ≡ 1 mod 4, then we obtain the congruence 0 ≡ a 2 − mb 2 ≡ a 2 − b 2 mod 4,
which holds if and only √ if a ≡ b mod 2. Thus the algebraic integers in this case
have the form 12 (a + b m ), where a and b are either both even or both odd. It is
easily verified that these numbers are indeed algebraic integers.
This completes the proof.
√ √
The field k = Q( m ) consists of all Q-linear combinations of 1 and m. Does
something similar hold for the ring Ok of integers, that is, does there exist an ω ∈ Ok
such that every α ∈ Ok is a Z-linear combination of 1 and ω? In this case we write
Ok = Z ⊕ ωZ and call {1, ω} an integral basis. The answer to our question is in fact
positive:
Corollary 2.3 We have Ok = Z ⊕ ωZ for
√
m, if m ≡ 2, 3 mod 4;
ω= √
1+ m
2 , if m ≡ 1 mod 4.
In particular, Ok is a ring.
Proof Only in the second case√there is something to show. Assume therefore that
m ≡ 1 mod 4 and α = 12 (a + b m ) with a ≡ b mod 2; setting c = a−b 2 and d = b
we find α = c + dω with c, d ∈ Z; the proof of the converse is just as simple.
36 2 Quadratic Number Fields
The fact that Ok is a ring is now easily seen to be true by showing that the sum,
difference, and the product of two elements of the form a + bω with a, b ∈ Z again
have this form. To this end we have to show that the product of two elements has
this form, and this boils down to showing that ω2 = r + sω for integers
√
r and s. But
clearly ω2 = m = m + 0ω for m ≡ 2, 3 mod 4, and ω2 = 1+m+2 4
m
= 4 +ω
m−1
for m ≡ 1 mod 4.
1 ω 2
The number Δ = disc k := 1 ω = (ω − ω )2 is called the discriminant3 of
the quadratic number field k. We find
4m if m ≡ 2, 3 mod 4,
disc k =
m if m ≡ 1 mod 4.
√
It is easily seen that {1, Δ+2 Δ } is an integral basis for any quadratic number field.
Our next result justifies our choice of the ring of integers in quadratic number
fields:
Proposition 2.4 The rational numbers contained in Ok are the ordinary integers:
Ok ∩ Q = Z.
3√ 3√ 3√
Q( 2 ) = {a + b 2 +c 4 : a, b, c ∈ Q},
−1p
where ζ is a root of xx−1 = 1 + x + . . . + x p−1 and p ≥ 5 is prime, and which have
degree p−1. We will occasionally use these fields as examples that lie outside of the
scope of this book, and in the last chapter we will show for a deeper understanding
of quadratic number fields we cannot avoid studying cyclotomic fields.
3 The discriminant of a quadratic number field does not depend on the choice of the integral basis;
The elements of a quadratic number field with norm 1 form a group with respect
to multiplication, since if Nα = 1 and Nβ = 1, then clearly N(αβ) = 1 and
N(α/β) = 1. The elements x + yi with norm 1 in the field Q(i) are characterized
by N(x + yi) = x 2 + y 2 = 1, i.e., the corresponding points (x, y) lie on the unit
circle. Elements with norm 1 may be easily constructed by forming the quotient of
m2 +n2
m−ni has norm m2 +n2 = 1, and from
two elements with the same norm: Thus m+ni
m + ni (m + ni)2 m2 − n2 + 2mni
= =
m − ni (m − ni)(m + ni) m2 + n2
m2 − n2 2mn
x= , y=
m2 + n2 m2 + n2
of the rational points on the unit circle. The fact that we get all rational points on the
unit circle in this way, i.e., that all elements of norm 1 can be written as quotients
m+ni
m−ni , is the content of Hilbert’s Theorem 90, which will be important in Chap. 9.
It is a natural question whether the group structure of rational points on the unit
circle given by the multiplication of the corresponding elements in Q(i) can be
interpreted geometrically. This is indeed the case (see Fig. 2.1):
Theorem 2.5 The elements a + bi ∈ Q(i) with norm 1 correspond to the rational
points (x, y) on the unit circle x 2 + y 2 = 1. If P (a, b) and Q(c, d) are two rational
points, then we obtain the point R corresponding to the product (a + bi)(c + di) as
follows:
• If P and Q are distinct, R is the second point of intersection of the unit circle
and the parallel to P Q through the point N(1, 0).
• If P = Q, then R is the second point of intersection of the unit circle and the line
through N that is parallel to the tangent in P .
The point R corresponding to the product (a + bi)(c + di) = ac − bd + (ad + bc)i
has coordinates (ac − bd, ad + bc). We have to show that the lines NR and P Q are
parallel; to this end we first assume that the x-coordinates of P and Q are distinct.
We then have to show that the slopes are equal:
d −b ad + bc
= .
c−a ac − bd − 1
which is equivalent to
(a + bi)(a − bi) = a 2 + b 2 = 1.
If finally P = Q, then the tangent is orthogonal to the line connecting the origin
with P , and thus has slope m = − ab . On the other hand, (a + bi)2 = a 2 − b 2 + 2abi,
i.e., the line through N and R(a 2 − b 2 , 2ab) has slope a 2 −b
2ab
2 −1 . Since a = 1 − b
2 2
as desired.
Since the argument of a product of two complex numbers is the sum of their
arguments, the group law on the unit circle is based on the addition of the
corresponding angles: We have P ⊕ Q = R if and only if NOP + NOQ =
NOR. Similar remarks apply for the group law on the elements with norm 1 in
arbitrary complex quadratic number fields.
quadratic number fields there can only be finitely many integral points on the norm-
1 ellipses for simple geometric reasons (and in fact only the points (±1, 0) except
when Δ = −3 or Δ = −4), the situation is fundamentally different in real quadratic
number fields. √
As √
a simple example consider the elements of norm 1√ in Z[ 2 ], that is, numbers
x + y 2 with x 2 − 2y 2 = 1. It is easy to see that 3 + 2 2 is such an element, and
that (3, 2) is an integral point on the hyperbola H : x 2 − 2y 2 = 1. Since N(1, 0)
is another integral point, we can define a geometric group law on the set of integral
(or rational) points on H by calling a point R = P ⊕ Q the sum of the points P and
Q if R is the second point of intersection of the parallel to P Q through N with the
hyperbola H (see Fig. 2.2).
Just as in the case of the unit circle we find
√ √
Theorem 2.6 The numbers a + b 2 ∈ Q( 2 ) with norm 1 correspond bijectively
to the rational points P (a, b) on the hyperbola H : x 2 − 2y 2 = 1. If P (a, b)
and Q(c, d) are√two such points,
√ then we obtain the point R corresponding to the
product (a + b 2 )(c + d 2 ) as the second point of intersection of the parallel
to P Q through N(1, 0) with the hyperbola H if P and Q are distinct, and as the
second point of intersection of the tangent in P if P = Q.
The proof is similar to the one for the unit circle. But as we shall see in a moment,
the hyperbola H contains infinitely many integral points, whereas the unit circle
only contains four such points. These integral points on H arise from √ P (3, 2) by
repeated addition. The point n · P corresponds to the element (3 + 2 2 )n . We claim
that the only integral points on the right branch of the hyperbola
√ are given by the
integral multiples of P , which correspond to the powers (3 + 2 2 )n with n ∈ Z.
To this end let Q be an arbitrary integral point on the upper right branch of H,
and assume that Q does not have the form nP . Since the x-coordinates of nP are
not bounded, there must exist a natural number n such that Q lies properly between
nP and (n + 1)P . Subtracting nP shows that Q nP is an integral point lying
properly between N(1, 0) and P (3, 2); but such a point does not exist.
The integral points on the lower right branch are obtained by reflection at the x-
axis, which corresponds geometrically to conjugation, i.e., to multiplication of the
exponent by −1. Thus every integral point on the right branch of the hyperbola is
an integral multiple of P .
Since the integral points on the left branch of the hyperbola H are obtained by
a reflection at the y-axis, which corresponds algebraically to multiplication by −1,
we have shown:
√
Theorem 2.7 The units of norm 1 in the ring Z[ 2 ] are given by
√
ε = (−1)m (3 + 2 2 )n
with 0 ≤ m ≤ 1 and n ∈ Z.
√ From these elements
√ we obtain
√ √ −1 via multiplication by 1 +
all units with norm
2. Since 3 + 2 2 = (1 + 2 )2 , each unit in Z[ 2 ] has the form
√
ε = (−1)m (1 + 2 )n
with 0 ≤ m ≤ 1 and n ∈ Z.
√
The map ε → (n, m) induces an isomorphism between the unit group in Z[ 2 ]
and√the abstract group Z/2Z ⊕ Z. In Chap. 7 we will show that the unit group of
Z[ m ] for any nonsquare integer m ≥ 2 is isomorphic to Z/2Z ⊕ Z.
We have already mentioned that Euler was initiated to number theory by his friend
Christian Goldbach (1690–1764). In one of his letters to Euler (see [89]) Goldbach
claimed not only to have proved Fermat’s theorem that 1 is the only triangular
number that is a fourth power, but that actually 1 was the only square among them.
Triangular numbers are numbers of the form Tn = n(n+1) 2 ; the reason behind their
name is the fact that Tn pebbles may always be arranged in the form of a triangle
(see [85]). Euler replied immediately that there are infinitely many squares among
the triangular numbers. In fact, setting Tn = m2 and completing the square gives
(2n + 1)2 − 2(2m)2 = 1, hence x = 2n + 1 and y = 2m satisfy the equation
x 2 − 2y 2 = 1. The smallest solution in positive integers clearly is (x, y) = (3, 2),
2.5 Fibonacci’s Hyperbola 41
which leads to (m, n) = (1, 1). The next solution is (x, y) = (17, 12), which yields
the triangular number T8 = 36, which clearly is a square.
These pairs of numbers (x, y) are called Platon’s side and diagonal numbers.
Platon (427–347) remarked that the square with side s = √ 5 has a diagonal
√ that
differs not much from d = 7. In fact, this diagonal has length √2 · 52 = 50 by the
Theorem of Pythagoras, whereas 72 = 49. The approximation 2 ≈ 75 thus comes
from the equation 72 − 2 · 52 = −1. Theon of Smyrna (ca. 70–135 A.D.; Smyrna
is today called Izmir) explained that if xn and yn are numbers with xn2 − 2yn2 = ±1,
2
then xn+1 − 2yn+1
2 = ∓1, where we have set
In this section we will discuss a few connections between Fibonacci numbers and
certain quadratic irrationalities, and will derive Binet’s4 Formula. Fibonacci (1170–
1250), also named Leonardo of Pisa, was the son of a merchant from Pisa. During
his education in North African countries he became familiar with the Hindu-Arabic
numbers. In his famous book Liber Abaci he presented these numbers and methods
for computing with them.
The Fibonacci numbers Un named after him show up in this book and are defined
recursively by
4 Binet published his formula in 1843; it was already known to Daniel Bernoulli in 1728—see [11,
p. 90].
42 2 Quadratic Number Fields
The recursion formula Un+1 = Un + Un−1 then provides us with the relation
In fact we have
f (q) = q + q 2 + 2q 3 + 3q 4 + . . . + Un q n + . . .
qf (q) = q 2 + q 3 + 2q 4 + . . . + Un−1 q n + . . .
q f (q) =
2 q 3 + q 4 + . . . + Un−2 q n + . . . ,
At this point we recall the dictum of Erich Hecke, who wrote in [60, p. 201] that the
precise knowledge of the behaviour of an analytic function in the neighbourhood of its
singular points is a source of number-theoretic theorems.
A(q) aj
= (2.3)
B(q) q − bj
j
2.5 Fibonacci’s Hyperbola 43
since clearly
q − bk 1 if k = j,
lim =
q→bk q − bj 0 if k = j.
In order to evaluate the left side, we use L’Hospital’s rule and find
A(q) A(q) + (q − bk )A (q) A(bk )
lim (q − bk ) = lim = .
q→bk B(q) q→bk B (q) B (bk )
This shows
Proposition 2.8 (Euler’s Formulas) Let A(q) and B(q) be polynomials in C[q],
where B is assumed to have only simple roots. Then the coefficients ak in the partial
fraction decomposition (2.3) are determined by
A(bk )
ak = . (2.4)
B (bk )
q 1 1 1
f (q) = = √ − .
1 − q − q2 5 1 − ωq 1−ωq
1 2 3
f (q) = √ 1 + ωq + ω2 q 2 + ω3 q 3 + . . . − 1 − ω q − ω q 2 − ω q 3 − . . .
5
1 2 3
= √ (ω − ω )q + (ω2 − ω )q 2 + (ω3 − ω )q 3 + . . . .
5
Comparing the coefficients of q n here and in the definition of the generating function
yields
Theorem 2.9 (Binet’s Formula) The Fibonacci numbers Un admit the explicit
representation
ωn − ω n
Un = , (2.5)
ω−ω
It is hardly surprising that the Fibonacci numbers show up in connection with the
hyperbola F : x 2 − xy − y 2 = 1 since the denominator of the function f (q) is
Q(1, q), where Q(x, y) = x 2 − xy − y 2 is a quadratic form with discriminant 5.
Theorem 2.10 The group law on the hyperbola F : x 2 − xy − y 2 = 1 with neutral
element N(1, 0), in which the sum of two points P and Q is the second point of
intersection of the parallel to P Q through N with F , is given by the equation
(x1 , y1 ) ⊕ (x2 , y2 ) = (x3 , y3 ) with
x3 = x1 x2 + y1 y2 , y3 = x1 y2 + x2 y1 − y1 y2 .
We can also generate infinitely many integral points on the Fibonacci hyperbola
using a technique that has become known as “Vieta jumping” in recent years, and
we can then show that there are no others.
The fundamental observation is the following: If P = (x, y) is any integral point
on the Fibonacci hyperbola, then there is a second integral point P ∗ = (x, y ) with
the same x-coordinate. This is because for a fixed value of x, the quadratic equation
x 2 − xy − y 2 = 1 in y has two solutions y1 , y2 , and that if y1 is an integer, so is
y2 = −x − y1 . For the same reason there must be an integral point P∗ = (x , y)
with the same y-coordinate as P .
Vieta jumping on conics is connected with the group law; in our case, P ⊕ P ∗ =
(1, −1) and P ⊕ P∗ = (−1, 0), as is easily seen from the geometric interpretation
of the group law (see Fig. 2.4).
In order to show that all integral points on the Fibonacci hyperbola have the form
kP or kP ⊕ (−1, 0) we consider an arbitrary integral point Q(x, y). If x > y ≥ 1,
then Q∗ = (x , y) is an integral point with y < x; if y > x > 1, on the other hand,
then Q∗ = (x, y ) is an integral point with y < x . Repeating this descent eventually
leads to an integral point with x = ±1, thus one of the four points (±1, 0) or
(±1, ∓1). Conversely we have to show that all points arising by the two operations
P ∗ and P∗ from P (1, 0) have the form kP or kP ⊕(−1, 0). We will leave the details
once more to the reader (see Exercise 2.34).
The following problem due to Stephan Beck was posed at the International
Mathematical Olympiad in 1988.
Fig. 2.5 Vieta jumping on the Fibonacci hyperbola (left) and on C4 : x 2 − 4xy + y 2 = 4 (right)
a 2 +b2
Let a and b be positive integers such that ab + 1 divides a 2 + b2 . Prove that ab+1 is a
perfect square.
For the proof, assume that P (a, b) is an integral point on the conic Ck : x 2 −
kxy + y 2 = k, and that k is not a square. Since k is not a square, we must have
k ≥ 2 (and as a matter of fact k ≥ 3, since k = 2 implies 2 = (x − y)2 , which is
impossible in integers) (Fig. 2.5).
Next we claim that as long as a = b we can find an integral point (a , b ) on Ck
lying in the first quadrant with a + b < a + b. Applying this step sufficiently often
we obtain an integral point of the form (A, A); but then A2 = 2−k k
implies k = 1
contradicting our assumptions.
The construction of (a , b ) is easy: Assume that b > a; then P ∗ (a, b ) with
b = ka − b is an integral point on Ck , and ab = a 2 − k shows that b < a. If a > b,
then P∗ = (kb − a, b) has the desired properties. This proves our claims.
For k = 3, this equation has the obvious integral solution (1, 1, 1), and Vieta
jumping gives rise to a whole tree of integral solutions.
2.7 Exercises 47
For more on Markov’s equation, its history and unsolved problems connected
with it, see Aigner [1].
2.6.3 Summary
We have introduced the following notions, which will be fundamental for the
following chapters:
• quadratic number fields
• norms, traces, and discriminants
• Galois groups of quadratic extensions of Q
• rings of integers (maximal order)
• integral bases
For an introduction to the theory of group laws on conics see [86].
2.7 Exercises
√
2.1. Show that a quadratic number field k = Q( m ), where m is a squarefree
integer = 1, is a field. √
2.2. Show that elements α, β ∈ K = Q( m ) form a Q-basis of K if and only
α
if the 2 × 2-matrix M defined by β = M √1m is a matrix in the group
GL2 (Q), i.e., if and only if det M = 0.
2.3. Show√ that elements α, β ∈ OK , where OK is the ring of integers of K =
Q( m ), form basis of OK if and only if the 2 × 2-matrix M
an integral
defined by βα = M ω1 is a matrix in the group SL2 (Z), i.e., if and only if
the matrix has integral entries and determinant det M = ±1.
2.4. Verify the equation
n+1
Un Un+1 01
=
Un+1 Un+2 11
for Fibonacci numbers Un . Diagonalize T = 01 11 (i.e., find an invertible
matrix S ∈ M2 (C) with D = S −1 T S = α0 β0 ) and observe that T n =
(S −1 DS)n = S −1 D n S. Since it is very easy to take powers of diagonal
matrices, one now obtains a formula for the numbers Un .
2.5. Prove that p | Up±1 by expanding ωp using the binomial theorem. Also show
that for primes p ≡ ±1 mod 5 we have p | Up−1 , for primes p ≡ ±2 mod 5,
on the other hand, p | Up+1 . The last result is due to Lagrange.
Joseph Louis Lagrange (1736–1813) was a French mathematician with
Italian origins. In number theory, he is known for his proofs of the Four-
48 2 Quadratic Number Fields
Squares Theorem (each natural number is the sum of at most four square
numbers) and the solvability of the Pell equation, as well as for his theory of
reduction of binary quadratic forms.
Hint: Show that the congruence (a + b)p ≡ a p + b p mod p holds in
arbitrary rings. √ √
2.6. Prove Proposition 2.1. In particular if α = a + b m ∈ Q( m ), where m is
not a square, show the following:
1. Tr(α) = 0 if and only if a = 0.
2. disc α = 0 if and only if b = 0.
3. Nα = 0 if and only if a = b = 0.
2.7. Show that if α | β in Ok , then Nα | Nβ in Z.
2.8. Let x 2 +px +q = 0 be a quadratic equation with the solutions ω and ω . Show
that disc ω = (ω − ω )2 = p2 − 4q coincides with the discriminant of the
quadratic equation. What happens in case of the equation ax 2 + bx + c = 0?
2.9. Let m be a nonzero integer. Show that the following assertions are equiva-
lent:
√ √
1. Q( m ) = {a + b m, a, b ∈ Q} is a field.
2. x 2 − m is irreducible in Q[x].
√ m is not a square in Q.
3. The integer
4. N(a + b m ) = a 2 − mb 2 = 0 implies a = b = 0.
√
√ m be a squarefree integer and K = Q( m ). Show that the2square root
2.10. Let
b of an integer b is an element of K if and only if either b = r is a square
or b = s 2 m for some integer s.
2.11. Show that σ : k −→ k is a ring homomorphism, i.e., show that σ (α + β) =
σ (α) + σ (β) and σ (αβ) = σ (α)σ (β) for all α, β ∈ k. Show moreover that
α ∈ k is in Q if and only if α = σ (α).
2.12. Let K/Q be a quadratic extension. Verify√that K is a Q-vector space.
Show that multiplication by α = a+b √m ∈ K is a Q-linear map K → K;
Q-basis {1, m } of K, the√
show that, with respect to the map is described by
x → Ax, where x =
r
s describes the element r + s m and where A is
given by A = ab mb a .
Show that Nα = det A and Tr α = Tr A, and that norm and trace do not
depend on the choice of the basis.
2.13. Show that an element α of a quadratic number field is integral if and only if
α = σ (α) is integral.
2.14. Show that if {1, ω} is an integral basis of Ok , then so is {1, ω − a} for any
integer a ∈ Z.
Show more generally: If {ω1 , ω2 } is an integral basis and if a, b, c, d are
integers such that ad − bc = 1, then {aω1 + bω2 , cω1 + dω2 } is also an
integral basis. √
2.15. Determine all m < 0 for which the ring Ok of integers in k = Q( m )
contains an element of norm 2 or 3.
2.7 Exercises 49
√ i + √5 2 1 − √−5 2
2 + −5 = i =− √ .
1+i 2
Explain
√ the relation 312 − 26 · 62 = 52 by a similar decomposition of 31 +
6 26. √ √
2.21. The norm of 17√+ 4 15 √ is a square. Show that the square root of 17 + 4 15
has the form a 3 + b 5, and find more examples.
2.22. Let m√be a positive integer. Show that if a 2 − mb 2 = c2 and√1 ≤ a ≤ m, then
a +b m cannot be the square of a number of the form r +s m with r, s ∈ Z
and s = 0. Show moreover that such examples exist for every composite
positive integer m. √
2.23. Consider the quadratic number fields K = Q( −m ) with squarefree m =
u2 − 4 for an odd integer u √≥ 3. Show that 22 + m = u2 is a counterexample
to the Square Lemma in Z[ −m ].
2.24. An entry in Joseph Liouville’s notebook, probably written while the French
mathematicians struggled with Gabriel Lamé’s purported proof of Fermat’s
Last Theorem, contains the following equation:
√ √
169 = 13 · 13 = (4 + 3 −17 )(4 − 3 −17 ).
Show
√ that this is a counterexample to the Square Product Theorem in
Z[ −17 ].
√ equation 32 = 5 + 7 are due to the
2.25. Show that Euler’s problems with the 2
√
Z[ 1+ 2 −7 ]. Verify that
√ −1 − √−7 3
5+ −7
= ,
2 2
√
and factorize 181+2 −7 similarly.
2.26. Use the fact that addition of points on the unit circle corresponds to the
addition of angles to derive the addition formulas for trigonometric functions.
2.27. Project the points on the unit circle from the point Z(−1, 0) to the tangent t
in N, and associate the point Z with the “point at infinity” on t. Which group
law on t is induced by the group law on the unit circle under this projection?
2.28. The inverse of the duplication formula 2(x, y) = (x 2 − y 2 , 2xy) for rational
points on the unit circle corresponds to taking the square root of the complex
number x + yi corresponding to the point(x, y). Show that the two solutions
of 12 (x, y), where x, y > 0, are given by ε 1+x2 ,ε
1−x
2 for ε = ±1.
Convince yourself that a repeated application of halving points to cos π4 =
√
sin π4 = 12 2 yields the formulas
π 1 √ π 1 √
cos = 2 + 2, sin = 2 − 2,
8 2 8 2
π 1 √ π 1 √
cos = 2+ 2+ 2, sin = 2− 2+ 2,
16 2 16 2
etc.
2.29. Show that the group law on the hyperbola xy = 1 with neutral element
N(1, 1) is given by (x1 , y1 ) ⊕ (x2 , y2 ) = (x1 x2 , y1 y2 ).
2.30. Show that the group law on the parabola y = x 2 with neutral element N(0, 0)
is given by (x1 , y1 ) ⊕ (x2 , y2 ) = (x1 + x2 , y1 + y2 + 2x1 x2 ).
2.31. Show that the generating function f (q) of the Fibonacci numbers satisfies the
functional equation
1
f = f (−q).
q
2.34. Determine all integral points on the Fibonacci hyperbola using Vieta jumping.
2.35. Consider the Lucas–Lehmer hyperbola x 2 − 3y 2 = 1. Show that the group
law with neutral element N(1, 0) is given by
Show that the integral points on this hyperbola are the multiples of P (2, 1)
and their negatives. Show in addition that 2k P = (xk , yk ) with xk+1 = 2xk2 −
1.
2.36. Let n be an odd natural number. Show that n is prime if and only if there is
an integer a with a n−1 ≡ 1 mod n and a k ≡ 1 mod n for each proper divisor
k of n − 1.
Deduce that n = 2m + 1 is prime if and only if 3(n−1)/2 ≡ −1 mod n (this
is called Pépin’s test).
We can formulate this primality test in the language of conics. An odd
integer n is prime if and only if there is a point P on the hyperbola xy = 1
defined over Z/nZ for which (n − 1)P = (1, 1) and kP = (1, 1) for each
proper divisor k of n − 1.
For n = 17 and P = (3, 6) (the coordinates have to be read modulo
17), for example, we have 2P = (9, 2), 4P = (13, 4), 8P = (−1, −1) and
16P = (1, 1), and this proves that 17 is prime.
For more on primality tests using conics see Hambleton [50]. Factorization
algorithms based on the arithmetic of Pell conics are studied in Eelkema [33].
We also mention a proof of the quadratic reciprocity law based on Pell conics
due to Hambleton and Scharaschkin [52].
2.37. Let p be a prime number with ( p3 ) = −1. Show that the points modulo p on
the conic x 2 − 3y 2 = 1 form a cyclic group of order p + 1.
Show moreover that p = 2q − 1 is prime if and only if p+1 2 P = (−1, 0)
for P = (2, 1). Show also that this is equivalent to p+1 4 P = (0, b) for a
suitable b modulo p.
2.38. Find all integral points on the Beck conic x 2 − 4xy + y 2 = 4.
For Q = (a, b) let Q∗ = (a, b ) and Q∗ = (a , b) denote the points
derived from Q by Vieta jumping. With P = (2, 0) and T = (0, 2) show that
P ⊕ P = T∗ and P ⊕ T = P∗ .
2.39. Find all integral points on the conic x 2 + y 2 − 3xy + 1 = 0.
2.40. (Romanian Team Selection Test 1991) Let a and b be positive integers. Prove
that if the number a+1b + a is an integer, then it is equal to 3.
b
√
2.44. Show√that algebraic integers form a ring using the example α = 3 and
β = 3 2, i.e., find monic polynomials with integral coefficients whose roots
are α + β and αβ, respectively.
2.45. Let ζ be a primitive n-th root of unity, i.e., an algebraic number with the
n = 1.
property that n is the smallest positive exponent satisfying ζ
Show that the set Z[ζ ] consisting of all elements α = n−1 j
j =0 aj ζ with
aj ∈ Z forms a ring.
2.46. Let α be a root of an irreducible monic polynomial f of degree n and with
integral coefficients, and let K = Q(α) be the smallest field extension of Q
containing α. Show that K consists of all expressions ω = a0 + a1 α + a2 α 2 +
. . . + an−1 α n−1 with aj ∈ Q.
The conjugates of α are the roots α1 = α, α2 , . . . , αn of f , and the
conjugates of ω are ωj = a0 + a1 αj + a2 αj2 + . . . + an−1 αjn−1 . Define the
norm of ω to be the product of its conjugates: N(ω) = ω1 ω2 · · · ωn . Show
that N(ω) is an integer, and that ω is a unit if and only if N(ω) = ±1.
2.47. A natural number n is called powerful if p | n for some prime p implies that
p2 | n; in other words: if the exponent of each prime in the prime factorization
of n is at least 2.
Show that there are infinitely many consecutive powerful numbers; the
smallest example is (8, 9).
Chapter 3
The Modularity Theorem
In the last chapter we have investigated a few Pell conics such as x 2 − 2y 2 = 1 and
x 2 − xy − y 2 = 1. For finding all integral points on Pell conics Q(x, y) = 1, where
Q(x, y) = ax 2 + bxy + cy 2 is a binary quadratic form, it is natural to ask whether
this equation has solutions in rational numbers or in residue class rings.
The general philosophy behind this way of investigating a mathematical problem
in the integers is to study the object in question in simpler rings such as the field of
rational numbers or finite fields.1
1 Beginners in mathematics may find it hard to believe that mathematicians think of finite fields
(and even p-adic numbers) as being simpler objects than integers. One possible way of measuring
the simplicity of structures A and B is counting homomorphisms from A and B into structures
C. For example, there are many homomorphisms from Z to finite fields Fp , whereas the only
homomorphisms from Fp to Z or to finite fields are either the trivial homomorphism mapping
everything to 0 or (in the case of Fp −→ Fp ) an isomorphism.
(x + 1)(x − 1 − mt 2 (x + 1)) = 0.
1 + mt 2 2t
x= , y = t (x + 1) = .
1 − mt 2 1 − mt 2
Theorem 3.1 The rational points (x, y) = (−1, 0) on the Pell conic P : x 2 −
my 2 = 1, where m is a nonsquare integer, are given by
r 2 + ms 2 2rs
x= , y= . (3.1)
r 2 − ms 2 r 2 − ms 2
Finding the rational points on a Pell conic C is thus a rather easy task. It is much
more difficult to find the integral points on C.
Theorem 3.2 Let m be a nonzero integer and p an odd prime number not dividing
m. Every point P (x, y) = (−1, 0) on the Pell conic P : x 2 − my 2 = 1 with
x, y ∈ Fp is given by
1 + mt 2 2t
xt = , yt = ,
1 − mt 2 1 − mt 2
x0 = 1, y0 = 0,
x1 = 2, y1 = −2,
9
x2 = − ≡ −2, y2 = −2,
7
19
x3 = − ≡ −2, y3 = 2,
17
x4 = 2, y4 = 2.
This shows that there are, together with (−1, 0), exactly six F5 -rational points on
the Pell conic x 2 − 2y 2 = 1.
Counting the number of Fp -rational points on an arbitrary Pell conic x 2 − my 2 =
1 is not hard (we are still assuming that p 2m). The number of Fp -rational points
on P depends on whether the equation mt 2 = 1 has a solution in Fp . Such a solution
exists if and only if m is a square in F× ×
p ; in fact, if m = n in Fp , then mt = 1 for
2 2
Theorem 3.3 Let m be an integer and p an odd prime not dividing 2m. Then the
number of Fp -rational point on the Pell conic P : x 2 − my 2 = 1 is given by
p−1 ifm is a square in F×
p,
#P(Fp ) =
p+1 otherwise.
This property is a consequence of the fact that (Z/pZ)× is cyclic. Recall that
an integer g is called a primitive root modulo N if each coprime residue class a
modulo N has the form a ≡ g k mod N. It is known that there exist primitive roots
modulo every prime. A primitive root modulo an odd prime p is always a quadratic
p−1
nonresidue: If g ≡ h2 mod p, then g 2 ≡ hp−1 ≡ 1 mod p, which would imply
that the powers of g represent at most half of the coprime residue classes modulo p.
For the same reason, all odd powers g 2k+1 are quadratic nonresidues since if
g 2k+1 ≡ h2 mod p, then g ≡ (g −k h)2 mod p would be a quadratic residue. Thus
k
g is a quadratic residue modulo p if and only if k is even. But now multiplicativity
follows: If, e.g., a and b are quadratic nonresidues modulo p, then a ≡ g k and
b ≡ g h mod p for odd exponents k and h, hence ab ≡ g k+h mod p is a quadratic
residue.
The existence of primitive roots also implies Euler’s criterion:
Proposition 3.5 (Euler’s Criterion) For all integers a not divisible by the prime p
we have
a p−1
≡ a 2 mod p.
p
p−1
If a is a quadratic residue, then a ≡ g 2k mod p, hence a 2 ≡ g (p−1)k] ≡
p−1
1 mod p; if a is a quadratic nonresidue, then a ≡ g 2k+1 mod p, hence a 2 ≡
p−1 p−1 p−1
g (p−1)k g 2 ≡ g 2 ≡ −1 mod p. This follows from the fact that x ≡ g 2 mod
3.2 The Symbols of Legendre, Kronecker, and Jacobi 57
The Kronecker symbol is a slight modification of the Legendre symbol and will
turn out to be useful for describing the behavior of prime numbers in quadratic
number fields. The numerator of a Kronecker symbol is restricted to discriminants
Δ of quadratic number fields. For odd prime numbers p, the Kronecker symbol
(Δp ) coincides with the ordinary Legendre symbol. If Δ is odd, we set, in addition,
( Δ2 ) = +1 or −1 according as Δ ≡ 1 mod 8 or Δ ≡ 5 mod 8. In other words, we
define ( Δ2 ) = ( Δ2 ), where ( Δ2 ) is a Kronecker symbol and ( Δ2 ) a Legendre symbol.
a · aj ≡ εj aj
for εj = ±1 and some aj ∈ Ap . Taking the product over all m such congruences
yields
am aj = εj aj .
Since no aj occurs twice, we must have aj = aj , and since this product is
coprime to p, it follows that
am ≡ εj mod p.
For determining ( 27 ), for example, we take the half system {1, 2, 3} modulo 7 and
write
2 · 1 ≡ +2 mod 7,
2 · 2 ≡ −3 mod 7,
2 · 3 ≡ −1 mod 7,
−1 p−1
−1 p−1
≡ (−1) 2 mod p implies the equation = (−1) 2 .
p p
Since the power of −1 on the right side only depends on the residue class of p mod
4, we find
p−1
Proposition 3.7 We have ( −1
p ) = (−1)
2 . In particular, the Legendre symbol ( −1 )
p
for primes p ≥ 3 only depends on the residue class of p mod 4; in fact, for positive
prime numbers p we have
−1 +1 if p ≡ 1 mod 4,
=
p −1 if p ≡ 3 mod 4.
In order to become familiar with Gauss’s Lemma we now use it for giving a
second proof of this proposition. To this end we write p = 2n + 1 and multiply the
representatives of the half system {1, 2, . . . , n} by −1:
−1 · 1 ≡ −1 mod p,
−1 · 2 ≡ −2 mod p,
... ...
−1 · n ≡ −n mod p.
p−1
Gauss’s Lemma then tells us that ( −1
p ) = (−1) = (−1)
n 2 .
3.2 The Symbols of Legendre, Kronecker, and Jacobi 59
In a similar way we can now determine the Legendre symbol ( p2 ). We first assume
that p = 4m + 1 and write
2 · 1 ≡ 2 mod p,
2 · 2 ≡ 4 mod p,
... ...
2 · m ≡ 2m mod p,
2 · (m + 1) ≡ 2m + 2 ≡ −(2m − 1) mod p,
2 · (m + 2) ≡ 2m + 4 ≡ −(2m − 3) mod p,
... ...
2 · 2m ≡ 4m ≡ −1 mod p.
Gauss’s Lemma requires the choice of a half system, but the resulting quadratic
character of a modulo p does not depend on this choice. Zolotarev and Frobenius2
have found a modification of Gauss’s Lemma that does not require choosing a half
system. Let n be an odd integer and a an integer coprime to n. Then multiplication
by a induces a permutation πa of the residue classes modulo n. Each permutation
of finitely many objects can be written (in many different ways) as a product of
transpositions (permutations that switch two elements). The sign of a permutation
is −1 or +1 according as this number of transpositions is odd or even.
For describing permutations
we can use the matrix and the cycle notation. The
permutation π = 12 21 33 of the set {1, 2, 3} maps 1 to 2 and 2 to 1, thus switches
1 and 2, and leaves 3 fixed. We can write π also as the product of the cycles (1 2)
and (3), where the cycle (1 2) maps 1 to 2 and 2 to the beginning 1 of the cycle,
whereas (3) leaves 3 (and all the other elements) fixed. We can even omit (3) and
simply write π = (1 2) when we demand that elements that do not occur in a cycle
are fixed.
Multiplication by 2 on Z/7Z induces the permutation π2 = 00 12 24 36 41 53 65 . We
can also write π2 as a product of cycles: π2 = (124)(365). Decomposing these
cycles into transpositions (here we read from right to left; see Exercise 3.7) we find
π2 = (12)(24)(36)(65). Thus π2 has sign +1.
We now define the Zolotarev symbol [ an ] for odd integers n > 1 by
a
= sign πa .
n
2 See [136] and [41]; our presentation is a simplification of the one given in [62].
3.2 The Symbols of Legendre, Kronecker, and Jacobi 61
This follows from the isomorphism Z/mnZ Z/mZ × Z/nZ, i.e., the Chinese
Remainder Theorem. If α : A −→ A and β : B −→ B are permutations, then α ×β
denotes the induced permutation on A × B. Clearly (see Exercise 3.8)
on (Z/pZ)× whose sign is given by sign πg = (−1)p−2 = −1. This can be seen by
writing the permutation as a product of cycles:
1 2 3 4 567
14 13 12 11 10 9 8
2 4 6 8 10 12 14
13 11 9 7 5 3 1
The vertical pairs coincide with the original pairs except that some pairs are flipped.
This is because if (a, p − a) is such a pair and if 2a ≡ b mod p, then 2(p − a) ≡
p − b mod p.
Now we perform the permutation γ that interchanges the entries at the top and
at the bottom if the number on top is larger than the one at the bottom; observe that
the number of swaps is the number of sign changes in Gauss’s Lemma:
2 467 5 3 1
13 11 9 8 10 12 14
Finally we apply a permutation σ that puts the vertical pairs in the original order.
Since we are always changing the place of two residue classes at the same time,
sign σ = +1:
1 2 3 4 567
14 13 12 11 10 9 8
Using Gauss’s Lemma for composite values we now can determine the value of
( −1
m ) for all positive odd integers m:
The proof via Gauss’s Lemma for prime moduli (see Prop. 3.7) carries over word
for word. The following result follows painlessly from this proposition:
Proposition 3.13 Let Δ be a quadratic discriminant, and set N = |Δ|. Then
Δ +1 if Δ > 0,
= sgn(Δ) = (3.4)
N −1 −1 if Δ < 0.
64 3 The Modularity Theorem
As a corollary we observe that if Δ < 0, then there always exists a prime number
p < |Δ| such that ( Δ p ) = −1. This is also true for positive discriminants (see
Theorem 3.21), but in this case we seem to need more than just the modularity of
the Kronecker symbol ( −4 · ).
p 5 7 11 13 17 19 23 25 29 31
( p3 ) −1 −1 +1 +1 −1 −1 +1 +1 −1 −1
The pattern is obvious: The values have period 12. Numerical experiments with
other small integers a suggest the following conjecture due to Euler:4
Theorem 3.14 (Euler’s Modularity Conjecture) For each nonzero integer a there
a
exists a modulus N such that the Jacobi symbol ( m ) for natural numbers m only
depends on the residue class of m modulo N. In other words: For all natural
numbers m and n we have
a a
= if m ≡ n mod N. (3.5)
m n
In fact, we can always choose N = 4|a|. If a > 0, Eq. (3.5) also holds if m ≡
−n mod N.
Euler formulated this conjecture for prime numbers m and n, and of course without
using Legendre or Jacobi symbols.
The following result holds in many similar situations in which some notion of
modularity shows up:
a
Proposition 3.15 If Euler’s Modularity Conjecture for ( m ) for the moduli N1 and
N2 , then it also holds modulo N = gcd(N1 , N2 ).
Proof Assume that the Jacobi symbol ( a· ) is modular for the moduli N1 and N2 ,
and let N = gcd(N1 , N2 ). If m is a natural number coprime to 2a, then we have to
show that ( ma
) = ( m+N
a
).
To this end we write N = rN1 −sN2 , where we assume without loss of generality
that r, s > 0 (if not, we simply switch N1 and N2 ). Then
a a
= modularity moduloN1
m m + rN1
a
= modularity moduloN2
m + rN1 − sN2
a
= N = rN1 + sN2 .
m+N
This completes the proof.
This property allows us to define the conductor of the Kronecker symbol as the
smallest positive integer N for which ( Δ· ) is modular.
We have already seen that the Kronecker symbol ( −4 · ) is defined modulo 4; since
−4 −4
−1 = ( 3 ) = ( 5 ) = +1, the conductor cannot be a proper divisor of 4 and
thus is equal to 4. In a similar way we can see that the Kronecker symbols ( 8· ) and
( −8
· ) have conductor 8. We will prove below that the Kronecker symbol ( m ) has
Δ
conductor N = |Δ|.
Next we show that it is sufficient to prove Euler’s Modularity Conjecture for
a = −1 and prime values of a:
Proposition 3.16 Assume that the Jacobi symbols ( a· ) and ( b· ) are defined modulo
4|a| and 4|b|, respectively. Then the Jacobi symbol ( ab
· ) is defined modulo 4|ab|.
As a corollary we obtain
Corollary 3.17 If Euler’s Modularity Conjecture holds for a = −1 and for prime
values of a, then it holds in general.
This Modularity Theorem is equivalent to the quadratic reciprocity law and should
be seen as its essential content. Legendre’s formulation of the reciprocity law, which
determines the value of the product ( pq )( pq ), is an historical accident.
Kronecker was the first to emphasize that the heart of the quadratic reciprocity
law is not Legendre’s formula
p q p−1 q−1
= (−1) 2 · 2 . (3.6)
q p
In connection with higher reciprocity laws and the class fields of complex multipli-
cation [70] he pointed out that Euler’s formulation catches the essence of quadratic
reciprocity better than that of Legendre:
Very early on Euler had made the observation that the prime divisors of quadratic forms
with discriminant D are contained in certain linear forms mD + α, but only in 1783 he
formulated this observation, which was highly important for the development of number
theory, in the remarkable way which gave rise to the name reciprocity law.† The elegance
of the correlation, which was—rightly—emphasized, pushed the meaning and the aim of
Euler’s original observation to the background. When I recently found a specific new law
by applying the arithmetic theory of singular modules to the power residues of complex
numbers I was reminded of this first formulation with which Euler had published the
essential content of the quadratic reciprocity law; and since this law in the theory of power
residues is particularly important not only because of its analogy with the historical point of
departure but also because it suggests a new phase of the development of reciprocity laws,
I would like to present this observation briefly to the Academy today.
† Compare my remarks in the Monatsbericht from April 1875, p. 268. [Werke II, p. 3–4].
because m + 4a ≡ m mod 4.
If a = 2b is even we may assume that a is squarefree, so b is odd. Then
a 2 b 2 m + 8b
= (−1) 2 · 2
m−1 b−1
=
m + 4a m + 8b m + 8b m b
2 m 2 b a
= (−1) 2 · 2
m−1 b−1
= = .
m b m m m
Modularity Implies Reciprocity The modularity of the Kronecker symbol with
conductor 4 implies that ( −1
m ) only depends on the residue class of m mod 4. Since
( −1
3 ) = −1 and ( −1
5 ) = +1, we have ( −1
m ) = +1 or −1 according as m ≡ 1 mod 4
or m ≡ −1 mod 4. But this is the exact content of the first supplementary law.
Similarly, the second supplementary law follows from the fact that ( m2 ) only
depends on the residue class of m modulo 8; this implies that ( m2 ) = +1 when
m ≡ ±1 mod 8 and ( m2 ) = −1 otherwise, which is the second supplementary law.
For deriving (3.7) in the case m ≡ n mod 4 and m > n from the modularity
theorem, for example, we set a = m−n
4 and verify that
a 4a m − n −n a 4a m − n m
= = = , = = = .
m m m m n n n n
This implies ( −n m −1
m ) = ( n ). If m ≡ 1 mod 4, we have ( m ) = +1, hence ( n ) = ( n ),
m m
m n 4a − n 4a − m a a
= = = 1,
n m n m n m
where the last equality follows from the modularity conjecture since m ≡ −n mod
4a and a > 0.
68 3 The Modularity Theorem
Before we turn to the proof of Euler’s Modularity Conjecture we look at yet another
special case.
12.
The fact that the conductor is not a proper divisor of 12 follows from
12 12 12 12
−1 = = =1 and −1= = = 1.
5 11 7 11
The proof of the general case5 proceeds in the same way. We claim that if m is an
a
odd natural number coprime to a, then the Jacobi symbol ( m ) only depends on the
residue class of m modulo 4a; in particular we claim that ( m a
) = ( m+4a
a
). We may
assume that a is positive: If a is negative, the claim follows from the observation
(m a
) = ( −1 −a
m )( m ) since the symbols on the right only depend on m modulo 4 and
modulo 4|a|, hence modulo 4|a|.
Now consider the half system A = {1, 2, 3, . . . , n} modulo m = 2n + 1. The
number of sign changes is equal to the number of integers ak lying in the intervals
(2b−1)m
( m2 , m), ( 3m
2 , 2m), . . ., until ( 2 , bm), where b = a2 or b = a−12 according as
a is even or odd.
Dividing through by a we see that this number is the same as the number of
integers in the intervals
m m 3m 2m (2b − 1)m bm
, , , , ..., , . (3.8)
2a a 2a a 2a a
5 This proof is lifted from Davenports beautiful book [27]; its basic idea goes back to the proof
( 2a , 2 + 2a ) minus the point a , so this union contains exactly two integers. Similar
m m m
arguments show the same for the other intervals, and now the claim follows.
We first prove the claim for prime discriminants. We have already proved the
claim if Δ ∈ {−4, ±8}, so we may assume that Δ is an odd prime number. Since
( p· ), where p ≡ 1 mod 4 is prime, is defined modulo p, its conductor is either N =
p
p or N = 1. In the second case the symbols ( m ) would all have the same values,
p
and by multiplicativity we conclude that we must have ( m ) = 1 for all natural
numbers m coprime to p. If p ≡ 5 mod 8 we obtain the desired contradiction from
the observation ( p2 ) = −1. Thus it remains to prove, for each prime p ≡ 1 mod 8,
the existence of a natural number q with ( pq ) = −1.
The existence of such primes (satisfying a few additional conditions) played a
large role in the first proofs of the quadratic reciprocity law. Legendre’s proof was
incomplete since he did not succeed in proving the existence of such primes, and
Gauss gave a highly ingenious proof of the existence of such a prime q (with the
additional condition that q < p, which he needed for his induction proof to work)
in his first proof of the quadratic reciprocity law.
Let us formulate the existence of such primes in the following form:
Theorem 3.21 Let a be a nonzero integer. If ( pa ) = +1 for all prime numbers p a,
then a is a square number.
70 3 The Modularity Theorem
• a is even. Then we may assume that a = 2b for b odd. Choose an integer n with
n
( |b| ) = −1; adding multiples of b to n we can make sure that n ≡ 1 mod 8.
As above, the quadratic reciprocity law and the first supplementary law implies
( nb ) = −1, and since n ≡ 1 mod 8 we have ( n2 ) = +1, hence ( an ) = ( 2b
n ) = −1.
But then n must have a prime factor p such that ( pa ) = −1.
We call a discriminant Δ = disc k of a quadratic number field k a prime
discriminant if Δ is one of −4, ±8, p, or −q for primes p ≡ 1 mod 4 and
q ≡ 3 mod 4. It is easy to see that each discriminant can be factored into prime
discriminants:
Theorem 3.22 Each discriminant of a quadratic number field can be written
uniquely (up to order) as a product of prime discriminants.
The proof is not difficult. First observe that either Δ ≡ 1 mod 4 is odd, or Δ
is divisible exactly by 4, or exactly by 8. In the second case, Δ = 4m for some
m ≡ 3 mod 4, hence Δ = −4Δ1 for some Δ1 ≡ 1 mod 4. In the last case we can
always write Δ = ±8Δ1 for some Δ1 ≡ 1 mod 4.
Since −4 and ±8 are prime discriminants it is sufficient to prove that any
squarefree odd integer Δ1 ≡ 1 mod 4 is the product of prime discriminants. To this
end, write Δ1 = p1 · · · pr q1 · · · qs for primes pj ≡ 1 mod 4 and qj ≡ 3 mod 4.
Since Δ1 ≡ 1 mod 4, the number of primes factors qj ≡ 3 mod 4 is even. But now
Δ1 = p1 · · · pr (−q1 ) · · · (−qs )
This shows that κ1 is defined modulo N1 and modulo n1 N2 ; but then it is defined
modulo gcd(N1 , n1 N2 ) = n1 contradicting our assumptions.
The strong modularity theorem now follows by induction on the number of prime
discriminants dividing Δ.
p−1
f (t)
Np (C) = #C(Fp ) = p + .
p
t =0
72 3 The Modularity Theorem
This is a consequence of the fact that there are as many quadratic residues as there
are nonresidues modulo p. For a formal proof, let n be a quadratic nonresidue
modulo p, and set S = ( pt ). Then −S = ( pn )S = ( nt p ). But if t runs through
a system of coprime residue classes modulo p, so does nt, hence the last sum is S.
Now −S = S implies that S = 0.
For polynomials f with degree 1 we have
Proposition 3.26 Let f (t) = at + b with p a. Then
p at + b
= 0.
p
t =0
This is clear since at +b runs through a complete system of residue classes if t does.
Jacobsthal Sums for Quadratic Polynomials For quadratic polynomials f (x) =
ax 2 + bx + c we assume that p a and that p is odd; completing the square we
3.4 Fp -Rational Points on Curves 73
It is therefore sufficient to compute the character sum for polynomials of the form
f (t) = t 2 − D. We now set
p−1 2
t − D
ψ(D) = .
p
t =0
Clearly ψ(0) = p − 1.
Lemma 3.27 We have ψ(a 2 D) = ψ(D) for all integers a with p a.
This is easily seen to be true as
p−1 2
− a 2D
p−1 2 2
− a 2D
p−1 2
t a s s − D
ψ(a 2 D) = = = = ψ(D),
p p p
t =0 s=0 s=0
where we have used that s runs through a complete system of residues modulo p
when t = as does.
Next we show:
Lemma 3.28 We have ψ(1) = −1.
p−1 2
− 1
p−1
t t − 1 t + 1
ψ(1) = =
p p p
t =0 t =0
p−1
s s + 2
= s =t −1
p p
s=1
p−1
s −1 s + 2
p−1
1 + 2s −1
= =
p p p
s=1 s=1
p−1
1 + 2r
= rs ≡ 1 mod p
p
r=1
p−1
1 + 2r
= −1 + = −1
p
r=1
When studying a family of objects it is often a good idea to consider them all at
once; in the present case we can form the sum over all ψ(D) and find
p−1 2
− D t2 − D
p p p−1 p
t
ψ(D) = = =0 (3.9)
p p
D=1 D=0 t =0 t =0 D=0
by Proposition 3.26.
We will now compute this sum in a different way. We know that ψ(D) only
depends on ( D
p ); thus if n denotes an arbitrary quadratic nonresidue modulo p, then
we have
p
p−1
ψ(D) = ψ(0) + · ψ(1) + ψ(n) .
2
D=1
Since this sum is 0 and since ψ(0) = p − 1, we deduce that ψ(1) + ψ(n) = −2.
Thus ψ(1) = −1 implies ψ(n) = −1, and we have shown
Proposition 3.29 We have
p−1 2
t − D −1 if p D,
ψ(D) = =
t =0
p p−1 if p | D.
p−1
t t 2 − k
φp (k) = φ(k) = . (3.10)
p p
t =1
Now we claim
Theorem 3.31 Let p ≡ 1 mod 4 be a prime number, and write p = a 2 + 4b2 . Then
p−1
t t 2 − 1
φ(1) = = 2a, (3.11)
p p
t =1
p−1
φ(k)2 = 2(p − 1)(x 2 + y 2 ).
k=1
We now compute φ(k)2 directly. In our calculation we need the following
Lemma 3.32 We have
p−1
k k + b −1 if p b,
=
k=1
p p p−1 if p | b.
p−1
k k + b
p−1 2
+ bk
p−1
k 4k 2 + 4bk
= =
p p p p
k=1 k=1 k=1
p−1
(2k + b)2 − b 2
p−1 2
t − b2
= = .
p p
k=1 t =1
Now we have
p−1 p−1
s s 2 − k
p−1
t t 2 − k
p−1
φ(k)2 =
p p p p
k=1 k=1 s=1 t =1
p−1 st p−1 s 2 − k t 2 − k
= .
p p p
s,t =0 k=1
p−1 2
− k t 2 − k
p−1
s l l + t 2 − s 2 −1 if s ≡ ±t,
= =
k=1
p p
l=1
p p p−1 if s ≡ ±t.
76 3 The Modularity Theorem
Thus
p−1 t2 −t 2 st
φ(k) = (p − 1)
2
+ − = 2(p − 1)p.
p p p
k=1 s=t s=−t s=±t
p−1 2
In fact, t =1 ( tp ) = p − 1, so the sums in the brackets have value 2(p − 1). Since
st p−1
s
p−1
t
= =0
p p p
s,t s=1 t =1
we have
t2
= −2(p − 1).
p
s=±t
Thus the whole sum is 2(p − 1)2 + 2(p − 1) = 2(p − 1)p as claimed.
It remains to determine the sign of a. To this end we have to compute φ(1)
modulo 4. Let R denote the number of residue classes t with ( t p−t ) = +1 and
3
vanishes for the residue classes t ≡ ±1 mod p. The two residue classes t = ±i,
p−1
where i 2 ≡ −1 mod p, give rise to the value ( t p−t ) = (−1) 4 ( −2
3
p ) = +1. The
remaining p−5 residue classes can be divided into 4-tuples consisting of the residue
classes (r, −r, s, −s), where rs ≡ 1 mod p. The residue classes in each 4-tuple
clearly give rise to the same value ( t p−t ); thus R ≡ 2 mod 4 and N ≡ 0 mod 4.
3
1 2n
a≡ mod p.
2 n
6 See the beautiful article [24] by Cosgrave and Dilcher for an introduction to such congruences.
3.4 Fp -Rational Points on Curves 77
This is an almost incredible congruence, but we can easily verify it for some
small primes p:
2n 1 2n
p a n 2 n mod p
5 1 2 1
13 −3 20 −3
17 1 70 1
29 5 3432 5
The key to the proof is a useful congruence also going back to Gauss:
Lemma 3.34 For each odd prime number p we have
p−1
0 mod p if (p − 1) m,
x ≡
m
(3.12)
x=1 −1 mod p if (p − 1) | m.
Set S = x m and let g denote a primitive root modulo p. Then
p−1
gm S = (gx)m ≡ S mod p,
x=1
since gx runs through a coprime system of residue classes modulo p when x does.
Thus p divides (g m − 1)S. Now g m ≡ 1 mod p if and only if m is a multiple of
p − 1; thus if (p − 1) m, then p | S. If (p − 1) | m, on the other hand, then
x p−1 ≡ 1 mod m implies that
p−1 p−1
S= xm ≡ 1 = p − 1 ≡ −1 mod p
x=1 x=1
as claimed.
We now apply this to the character sum (3.11); using p = 4n + 1 we find
p−1 3
−t
p−1
t
φ(1) = ≡ (t 3 − t)2n
p
t =1 t =1
p−1 2n p−1 2n
2n 3k 2n 2n+2k
≡ t (−t)2n−k = (−1)k t
k k
t =1 k=0 t =1 k=0
2n p−1
2n
≡ (−1)k t 2n+2k mod p.
k
k=0 t =1
78 3 The Modularity Theorem
2n
φ(1) ≡ −(−1)n mod p.
n
for an odd integer n. Let Nn (p) denote the number of Fp -rational points on An , i.e.,
the number of solutions of the congruence
for an odd prime number p. Then N1 (p) = 2 since x12 ≡ 1 mod p has exactly two
solutions. Next N2 (p) = p − 1 since for solving the congruence
Proposition 3.35 The number Nn (p) of Fp -rational points on An has the following
properties:
1. It satisfies the recursion
Here we have used the fact that the sums ( tp1 ), ( t1pt2 ), . . . , ( t1 t2 ···t
p
n−1
), etc.
vanish. For a proof it is, after reordering the ti , sufficient to show that
t t
1 s
··· =0
p p
t1 +...+tn =1
since all the character sums in the brackets vanish. This finishes the proof of (3.15).
A more conceptual proof of the vanishing of these character sums is the
following: Let a denote a quadratic nonresidue modulo p and set u1 = at1 , u2 = t2 ,
. . . , us−1 = ts−1 and un via u1 +. . .+un = 1. If S denotes the character sum above,
then aS = S as in the proof of Lemma 3.25, and this implies S = 0.
Proof of the Quadratic Reciprocity Law We know by (3.14) that the number of
Fp -rational points on Aq , where q is an odd prime different from p, is
q−1
p
Nq = pq−1 + p 2 ≡1+ mod q (3.16)
q
−1 q−1 t t · · · t
2 1 2 n
Nq = pn−1 +
p p
t1 +...+tn =1
p−1
q−1 t q p−1 q−1
q
≡ 1 + (−1) 2 · 2 = 1 + (−1) 2 · 2 mod q,
p p
where we have used Fermat’s Little Theorem, the first supplementary law, the fact
that distinct shifts give rise to q identical terms that vanish modulo q, and finally
q
that ( tp ) = ( pt ) = ( pq ) since t = q1 .
Comparing this with (3.16) yields the congruence
p p−1 q−1
q
≡ (−1) 2 · 2 mod q,
q p
which implies the quadratic reciprocity law since for odd primes q, a congruence of
the form ±1 ≡ ±1 mod q implies equality.
The quadratic reciprocity law can often be used for proving that certain Diophantine
equations do not have solutions in integers. A prominent example is the Fermat
equation for even exponents:
Theorem 3.36 (Terjanian) Let p be an odd prime number, and assume that x 2p +
y 2p = z2p for integers x, y, and z. Then 2p | x or 2p | y.
As for the usual Fermat equation x p +y p = zp , the “second” case where p | xyz
is much more difficult than the “first case”
Clearly we may assume that x, y, and z are pairwise coprime. Since x and y
cannot both be odd (otherwise z2p ≡ 2 mod 4), we may assume that x is even and
y and z are odd. Now
z2p − y 2p
x 2p = z2p − y 2p = (z2 − y 2 ) · . (3.17)
z2 − y 2
82 3 The Modularity Theorem
mp −np
Set Qp (m, n) = m−n ; then x 2p = (z2 − y 2 )Qp (z2 , y 2 ). For coprime integers m
and n we have
35 − 1 53 − 33 74 − 1
= 112 , = 72 , = 202 , 83 − 73 = 132 .
3−1 5−3 7−1
The following theorem provides us with a large class of nonsquares of the form
Qp (m, n):
Theorem 3.37 Let p be an odd prime number, and let m and n be coprime integers
m ≡ n ≡ 1 mod 4. Then Qp (m, n) is not a square number.
Theorem 3.37 (and therefore Terjanian’s Theorem 3.36) follows immediately
from the following calculation of a Jacobi symbol:
Theorem 3.38 If q is an odd integer, and if m and n are coprime natural numbers
with m ≡ n ≡ 1 mod 4, then
Q (m, n) p
p
= (3.19)
Qq (m, n) q
mp −np
Lemma 3.39 Let Qp (m, n) = m−n .
(a) If p is odd and m = n, then Qp (m, n) is positive.
(b) If p = aq + r, then
(c) If p = aq − r, then
Qp (m, n) ≡ p mod 8.
Proof
(a) Since Qp (−m, −n) = Qp (m, n) for odd integers p we may assume that m ≥ 0.
Next Qp (0, n) = np−1 > 0 since n = 0 in this case. If m > n ≥ 0, then both
numerator and denominator of Qp (m, n) are positive, and if 0 < m < n, then
both numerator and denominator of Qp (m, n) are negative, hence Qp (m, n) >
0.
(b) Using p = aq + r we find
as claimed.
(c) If p = aq − r, then we have7
mp − np maq−r − naq−r
Qp (m, n) = =
m−n m−n
maq−r − mq−r naq−q mq−r naq−q − naq−r
= +
m−n m−n
maq−q − naq−q mq−r nr − nr nq−r
= mq−r + naq−q−r
m−n m−n
= mq−r Qa−1 (mq , nq ) + naq−q−r [−mq−r Qr (m, n) + Qq (m, n)]
= mq−r [Qa−1 (mq , nq ) + naq−q−r Qq (m, n)] − naq−q−r mq−r Qr (m, n).
Since
the claim now follows since the first factor is clearly an integer.
(d) Let > 1 be a common prime divisor of Qp (m, n) and Qq (m, n) with p+q ≥ 1
minimal. If p = q, then p = q = 1 since gcd(p, q) = 1, and then Q1 (m, n) =
1: Contradiction. Since we may assume that p > q we can write ep = aq + r.
Equation (3.20) and the fact that Qq (m, n) divides Qq (ma , na ) shows that d
divides nm−r Qr (m, n). If | n, then | Qp (m, n) implies | m, which is
impossible. Thus | Qr (m, n). Thus Qp (m, n) and Qr (m, n) have a common
divisor, and since m + r < m + n this contradicts the minimality of m + n.
(e) Observe that m2 ≡ 1 mod 8; now
in fact, the first step is an application of the quadratic reciprocity law together with
the congruences Qp (m, n) ≡ p and Qq (m, n) ≡ q mod 4 that we have proved
in Lemma 3.39.(f), the second step is our assumption (Qp (m, n)/Qq (m, n)) =
−(p/q), and the last step is another application of the quadratic reciprocity law.
This shows that the result also fails for the pair (q, p).
Thus we may assume that p > q. Write p = aq ± r with 0 ≤ r < q and r odd.
If p = aq + r, then by (3.20) we have
Q (m, n) nm−r Q (m, n) Q (m, n)
p r r
= =
Qq (m, n) Qq (m, n) Qq (m, n)
3.5 Terjanian’s Theorem 85
hence the result also fails for the pair (r, q). Since r + q < p + q, this contradicts
the minimality of m + n.
If p = aq − r for some odd natural number r, then by (3.21) we have
since p − q and q − r are even, both np−q and mq−r are squares, hence
Q (m, n) −Q (m, n) −Q (m, n)
p r r
= = .
Qq (m, n) Qq (m, n) Qq (m, n)
Therefore
−Q (m, n) Q (m, n) p aq − r −r
r p
= =− =− =− ,
Qq (m, n) Qq (m, n) q q q
so the pair (r, q) with r + q < p + q is another pair for which the theorem fails, and
this again contradicts the minimality of p + q.
For an alternative proof of Theorem 3.38 see Exercises 3.14–3.16.
3.5.1 Summary
Fig. 3.1 The cubic curves y 2 = x 3 with a cusp and y 2 = x 3 + x 2 with a double point at the origin
3.6 Exercises
3.4. Assume that a = bc2 for nonzero integers a, b, c. Show that ( pa ) = ( pb ) for
all primes p c.
2 3
3.5. Compute ( 15 ) and ( 35 ) using Gauss’s Lemma.
3.6. Prove that ( −1
m−1
m ) = (−1)
2 for positive odd integers using the corresponding
3.9. Show that Gauss’s Lemma does not do what it is supposed to do when we
restrict to coprime residue classes: Let N = mn be the product of two coprime
integers m, n > 1. Let A = {a1 , . . . , aφ(N) } denote a (coprime) half system
modulo N. If a is an integer coprime to N and a · aj ≡ (−1)sj aj for some
aj ∈ A, then (−1) sj = 1.
3.10. (Romanian Team Selection Test 2008) Let m, n ≥ 2 be integers with (2m −
1) | (3n − 1). Prove that n is even.
3.11. Show that φ(k) = 0 (see 3.10) for all primes p ≡ 3 mod 4.
3.12. Show that Jacobsthal sums φm are multiplicative: φm (1)φn (1) = φmn (1) for
coprime values of m and mn.
3.13. Let p be a prime number ≡ 1 mod 4. Show that the number of residue classes
t 3 −t
t mod p with 1 ≤ t ≤ p−1 2 for which ( p ) = −1 is a multiple of 4.
3.14. Prove that for odd coprime integers m ≡ n ≡ 1 mod 4 with mn ≡ 1 mod 8
we have
Q (m, n) m + n 2
2
= = . (3.22)
Qq (m, n) Qq (m, n) q
3.16. Prove Theorem 3.38 for odd natural numbers m, n with mn ≡ 1 mod 8 using
induction on q.
3.17. Let RR denote the number of pairs of consecutive quadratic residues modulo
an odd prime number p:
Deduce that
p−2 a a + 1
4RR = 1+ 1+ . (3.23)
p p
a=1
p−1 a
Expand the product and show, using a=1 p = 0, that
p−2
a −1 p−2
a + 1
=− and = −1.
p p p
a=1 a=1
3.18. Show that, for primes p ≡ 3 mod 4, the quadratic residues modulo p form a
half system. Show that Gauss’s Lemma holds trivially in this case.
3.19. Let p be an odd prime number. Show that the number Nn of Fp -rational points
on An for even integers n is given by
n−2
Nn = pn−1 − p 2
for all n ≥ 2.
3.6 Exercises 89
3.20. Show that here exist infinitely many positive odd integers m and n with m ≡
n mod 4 such that Q4 (m, n) is a square. Is it true that all such integers have a
common divisor > 1?
3.21. The following proof of the quadratic reciprocity law based on Gauss’s Lemma
is due to Christian Zeller [135]. We will give it for p = 5 and q = 23.
• Write down the absolutely smallest remainders of kp mod q and hq mod
p:
k 1 2 3 456 7 8 9 10 11
kp mod q 5 10 −8 −3 2 7 −11 −6 −1 4 9
h 12
hq mod p −2 1
In this chapter we will study the notion of divisibility in general domains. We will
restrict our attention to commutative domains R containing a unit1 1, i.e., an element
with the property 1r = r for all r ∈ R. Recall that a ring R is called a domain if it
does not contain any zero divisors, that is, if ab = 0 for elements a, b ∈ R implies
that a = 0 or b = 0. Subrings of fields are always domains, and every domain may
be interpreted as a subring of its field of quotients (see Exercise 4.3). Our goal is the
definition of units, primes, and irreducible elements and a first investigation of the
question in which quadratic number rings the theorem of unique factorization holds.
√ √
Here i = −1 denotes a primitive fourth and ρ = 12 (−1 + −3 ) a primitive cube
root of unity.
√
Proof Assume first that m ≡ 1, 2 mod 4, and let ε = a + b −m be a unit. Then
1 = Nε = a 2 + mb2 (the case Nε = −1 cannot occur since m > 0). For m > 1,
this implies a = ±1 and b = 0, and hence ε = ±1 (and of course ±1 are units). If
m = 1, there are four possibilities,
√ namely a = ±1, b = 0 and a = 0, b = ±1. All
these units are powers of i = −1. √
If m ≡ 3 mod 4, we set ε = 12 (a + b −m ) for integers a, b and find 4 =
a 2 + mb 2 as a necessary and sufficient condition for ε to be a unit. For m > 3, there
are again only the trivial solutions corresponding to ε = ±1; if m = 3, then we
obtain the units
√ √
−1 + −3 1 + −3
±1, ± , ± .
2 2
√
Setting ρ = −1+2 −3 (this is a cube root of unity since ρ 3 = 1), we find that Ek is
generated by −ρ (a primitive sixth root of unity).
The determination of the unit group of rings of integers in real quadratic number
fields boils down to solving the Pell equation t 2 − mu2 = ±4; we will prove in
Chap. 7 below that this equation has integral solutions√ whenever m ≥ 2 is not a
square.
√ At this point we only observe that ε = √1 + 2 is a unit with infinite order
in Z[ 2 ] (see Theorem 2.7): If we had√(1 + 2 )n = ±1 for some n ≥ 1, then
taking absolute values (after identifying
√ n 2 with the positive real square root
√ ofn 2),
we obtain
√ 1 = | ± 1| = |1 + 2 | > 1, and√similarly 1 = | ± 1| = |1 + 2| =
|1 − 2 |−n < 1 if n ≤ −1. In particular, Z[ 2 ] has infinitely many units.
John Pell (1611–1685) was an English mathematician. His name got attached to
the Pell equation through a mistake by Euler, who apparently confused him with
Lord William Brouncker. It was Brouncker who developed a method for solving
such equations in integers in connection with Fermat’s challenge in 1657 for the
English mathematicians. The proof that Brouncker’s method always leads to a
solution was given much later by Lagrange.
A method for solving the Pell equation similar to Brouncker’s had already been
developed by Indian mathematicians, in particular Brahmagupta (ca. 598–670) and
Bhaskara II (1114–1185); their contributions (see Plofker [104]) became known in
Europe only during the nineteenth century. We will present a method for solving the
Pell equation in Chap. 7.
Elements a, b ∈ R are called associated, if there is a unit e ∈ R × such that
a = be; we write a ∼ b and verify easily that this defines an equivalence relation
on R.
Irreducible and Prime Elements An element a ∈ R \R × is called irreducible if a
has only trivial divisors, that is, units and associates. More exactly: a is irreducible
in R if a = bc implies that b or c is a unit. An element p ∈ R \ R × is called prime if
94 4 Divisibility in Integral Domains
We have already seen that elements π ∈ Ok for which p = |Nπ| is a rational prime
are always irreducible. As a matter of fact, such elements are always prime. This
will follow easily from the theory of ideals that we will develop later; here we will
give a direct proof based on Proposition 4.7.
Proposition 4.8 If k is a quadratic number field with ring of integers Ok , then each
π ∈ Ok with prime norm is prime.
This is easy to see if Ok is a unique factorization domain (see the next section):
Elements with prime norm are irreducible, and in unique factorization domains,
irreducible elements are prime. In order to prove this for general rings Ok , we show
that the residue class ring Ok /πOk does not have zero divisors. In fact, we will
show that Ok /πOk Fp = Z/pZ is isomorphic to the field with p elements.
To this end, let {1, ω} be an integral basis of Ok ; then π = a + bω for integers
a, b ∈ Z. We claim that b is not divisible by π (and thus not divisible by p = |ππ |).
In fact, π | b implies π | a since a = π − bω, and taking norms, we find p | a 2 and
p | b2 . Since p is prime, this implies that p | a and p | b. But then π = a + bω
would be divisible by p, and hence π would be a unit: a contradiction.
Thus there exists an integer c ∈ Z with bc ≡ 1 mod p, and in particular, we
have bc ≡ 1 mod πOk . We find bω ≡ −a mod π, after multiplying through by c,
thus ω ≡ −ac mod πOk . If any γ = r + sω ∈ Ok is given, then we find γ ≡
r − sac mod πOk , and thus modulo π every element is congruent to an ordinary
integer. Reducing this number modulo p (and p is a multiple of π), we find that γ
is congruent to one of the numbers 0, 1, 2, . . . , p − 1 modulo π.
4.1 Units, Primes, and Irreducible Elements 95
Now it is easy to show that there are no zero divisors in the ring of residue
classes: If we had αβ ≡ 0 mod π and if A, B ∈ {0, 1, . . . , p − 1} are integers
with α ≡ A mod πOk and β ≡ B mod πOk , then π | AB; taking norms yields
p | A2 B 2 , and hence p | A or p | B. Thus A = 0 or B = 0, and therefore
α ≡ A = 0 mod π or β ≡ B = 0 mod π.
Proposition
√ 4.9 Let p be an odd prime number and Ok the ring of integers in k =
Q( m ). Then p is prime in Ok if and only if the congruence x 2 ≡ m mod p is not
solvable.
√ √ √
Proof If x 2 ≡ m mod p is solvable, then p | (x+ m )(x− m ), but p (x± m ).
Thus p is not prime.
Now we show that p remains prime in Ok if ( m p ) = −1. This case is not covered
by Proposition 4.8 since here N(p) = p2 is not prime. The idea for proving the
result is the same as in the proof of Proposition 4.8: We show that the residue classes
modulo p in Ok form a field. √
We will give the proof in the case where Ok = Z[ m ].√ Here the residue classes
modulo p in Ok are represented by the p2 elements a + b m with 0 ≤ a, b < p;
clearly every α ∈ Ok is congruent modulo p to one of these elements, and they are
pairwise distinct. These residue classes form a ring, and we want to show that they
form a√field. This will follow if we can write down an inverse for each residue class
a + b m mod p different from 0 mod p. Now
√
1 a−b m
√ = 2 ,
a+b m a − mb 2
GCD–2. Every common divisor of a and b divides d, i.e., if c | a and c | b for some
c ∈ R, then c | d.
Again we would like to emphasize the fact that this definition is well suited for
building a theory of greatest common divisors but cannot easily be used for finding
a greatest common divisor of two elements in some domain.
In unique factorization domains, the greatest common
divisor oftwo elements
can be written down explicitly. In fact, if a = u pαp and b = v pβp are the
prime factorizations
min(αp ,βof a and b (with units u, v ∈ R × ), then we can easily show
that d = p )
p is a greatest common divisor of a and b. One has to remark
that even in the case of the ordinary integers, finding the prime factorization of two
(large) integers can be very difficult.
Two elements a and b of some unique factorization domain R are called coprime
(or relatively prime) if their greatest common divisor is a unit. Observe that we
demand that R be a unique factorization domain. In fact, in domains without unique
factorization, a greatest common divisor need not exist, and if it does, it need not
have the properties we expect from a greatest common divisor, such as gcd(a, b)2 =
gcd(a 2 , b 2 ).
Proposition 4.12 If R is a unique factorization domain, if a, b ∈ R are coprime,
and if ab = ex n (n ≥ 2) for some unit e ∈ R × and some x ∈ R, then there exist
units e1 , e2 ∈ R × and elements c, d ∈ R such that a = e1 cn and b = e2 d n , where
cd = x and e1 e2 = e.
Proof We prove this by induction on the number of prime factors of a. If a is a unit,
then the claim follows with c = 1, d = x, e1 = a, and e2 = ea −1 .
Assume that the claim is true for all a ∈ R with at most t different prime factors,
and let p ∈ R be a prime with p | a. Assume that ph a (we write ph a if ph | a
and ph+1 a, i.e., if ph is the largest power of p that divides a). Since ph x n (here
we use the fact that a and b are coprime), we must have h = nk for some k ∈ N and
pk x. Thus a = ph a1 , x = pk x1 and a1 b = ex1n . By induction assumption, we
have a1 = e1 cn and b = e2 d n , and now the claim follows since a = e1 (cpk )n .
Corollary 4.13 If R is a unique factorization domain, if gcd(a, b) = p for elements
a, b, p ∈ R, where p is prime, and if ab = ex n (n ≥ 2) for some e ∈ R × and x ∈ R,
then there exist units e1 , e2 ∈ R × and c, d ∈ R with a = e1 pcn and b = e2 pn−1 d n
(after switching a and b, if necessary).
Proof Exercise 4.28.
Principal ideal domains will play a minor role in this chapter, mainly as a link in the
chain of inclusions
Euclidean Domains ⊂ Principal Ideal Domains ⊂ Unique Factorization Domains
98 4 Divisibility in Integral Domains
that we will use for constructing unique factorization domains. Both inclusions
are proper; for rings of integers in quadratic number fields (and in fact of general
number fields), the second inclusion is in fact an equality.
First we will have to explain the notion of a principal ideal domain. To this end,
consider a domain R; a subring I of R is called an ideal of R if I · R ⊆ I . Thus an
ideal is a subset of a domain that is closed with respect to addition (I + I ⊆ I ) as
well as with respect to multiplication by arbitrary elements of the domain R.
Observe that I is a subring of R if the weaker condition I · I ⊂ I is satisfied.
In the domain R = Z, it can be shown that each subring is an ideal. The following
example shows that this is not true for general domains: The set
√ √
M = Z + 2 m Z = {a + 2b m : a, b ∈ Z }
√
is a subring of Z[ m ], but not an ideal. This is because MR √ = R; in fact, 1 ∈ M
implies that each element of R is contained in MR. Since m ∈ R \ M, the subring
M is not an ideal.
It is very easy to write down examples of ideals. If we are given elements
a1 , . . . , an ∈ R, then the set of all R-linear combinations
I = (a1 , . . . , an ) := {a1 r1 + . . . + an rn : rj ∈ R}
factorization domain is a principal ideal domain; the best known example is the
domain C[x, y] of polynomials in two variables with complex coefficients; here,
(x, y) is not principal, as is easily seen.
Remark The fact that C[x, y] is a unique factorization domain follows from a well-
known theorem in algebra: If R is a UFD, then so is the polynomial ring R[y]. Since
R = C[x] is a UFD (this ring is even Euclidean—see Sect. 4.4), the claim follows.
Now we prove that principal ideal domains have unique factorization.
Theorem 4.15 Principal ideal domains are unique factorization domains.
Proof Assume that UFD–1 is not satisfied. Then there is an a1 ∈ R that cannot
be written as a product of irreducible elements (in particular, a1 is not irreducible).
Thus, a1 = a2 b2 (for non-units a2 , b2 ∈ R \R × ), where one of the factors, say a2 , is
not a product of irreducible elements. Thus, a2 = a3 b3 , etc., and we obtain a chain
of elements a1 , a2 , a3 . . . ∈ R with a2 | a1 , a3 | a2 , . . . , where ai and ai+1 are not
associated.
Now consider the ideal I = (a1 , a2 , . . .) generated by the ai . By assumption,
there is an element a ∈ R with I = (a), and thus there exist m ∈ N and ri ∈ R
such that a = r1 a1 + . . . + rm am . Since am | am−1 | · · · | a1 , we have am | a. Since
am+1 ∈ (a), there is an element r ∈ R such that am = ar, i.e., with a | am+1 . By
construction of the ai , we have am+1 | am , and hence am and am+1 are associated in
contradiction to the construction of the ai .
Now we show that irreducible elements are prime (UFD–2). To this end, let a ∈
R be irreducible, and let x, y ∈ R be given with a | xy and a x; then we have to
show that a | y. Now (a, x) = (d) for some d ∈ R; thus d | a and d | x. If we
had d ∼ a, it would follow that a | x in contradiction to our assumption. Since a
is irreducible, d must be a unit. Thus d −1 ∈ R, and therefore 1 = d −1 d ∈ (d) =
(a, x), i.e., there exist m, n ∈ R with 1 = ma + nx. Multiplication by y yields
y = may + nxy, and since a | xy, we find a | y. This is what we wanted to show.
An important property of principal ideal domains is the fact that they are Bézout
domains:2 A domain R is called a Bézout domain if for all a, b ∈ R there exists
a d ∼ gcd(a, b) such that d = ar + bs is an R-linear combination of a and b.
Principal ideal domains are always Bézout domains: Given a, b ∈ R, we form the
ideal I = (a, b); since R is a principal ideal domain, there is an element d ∈ R with
(a, b) = (d). We claim that d ∼ gcd(a, b). In fact, since a ∈ (d), there is a t ∈ R
with a = dt; this shows that d | a, and similarly we find that d | b, and hence d is
a common divisor of a and b. On the other hand, d ∈ (a, b) implies that there are
elements r, s ∈ R with d = ar + bs; if e is any common divisor of a and b, then e
2 Étienne Bézout (1730–1783) was a French mathematician, an author of textbooks. Bézout proved
the existence of Bézout elements for polynomial rings; in the case of integers, they already occurred
in the work of Bachet.
100 4 Divisibility in Integral Domains
In his Lectures on number theory [31, p. 20], Dirichlet (actually we do not know how
much of this is due to Dedekind) discusses the foundations of elementary number
theory and then writes the following:
It is now clear that the whole structure rests on a single foundation, namely the algorithm for
finding the greatest common divisor of two numbers. [. . . ] any analogous theory, for which
there is a similar algorithm for the greatest common divisor, must also have consequences
analogous to those in our theory.
Now the natural numbers f (rj ) cannot become arbitrarily small; thus there exists
an index n ∈ N with rn+1 = 0. We then claim that rn ∼ gcd(a, b). In fact, it follows
from the last row that rn | rn−1 , and then the next to last row gives rn | rn−2 , and in
this way we climb the ladder until we reach rn | r1 , rn | b and rn | a. Thus rn is a
common divisor of a and b.
Conversely, if d is any common divisor of a and b, then the first row tells us that
d | r1 , the second d | r2 , etc., and eventually we reach d | rn . In other words, rn is a
greatest common divisor.
It may be said that the definition of the greatest common divisor is chosen in such
a way that the proof of this fundamental result on the Euclidean algorithm becomes
essentially trivial.
We obtain the Bézout elements r, s ∈ R as follows: We start with rn = rn−2 −
rn−1 qn−1 and replace the rj with the maximal index by the linear combination in
the preceding row, in our case rn−1 by rn−1 = rn−3 − rn−2 qn−2 . Now we have
written rn as a linear combination of rn−2 and rn−3 . Next we replace rn−2 by rn−2 =
rn−4 − rn−3 qn−3 , etc., until we finally have written rn as an R-linear combination
of a and b.
4.4.1 Summary
4.5 Exercises
4.1. In the ring R = Z[x] of polynomials, show that x | f (x) for some f ∈ R if
and only if f (0) = 0. Show more generally that (x − a) | f (x) if and only if
f (a) = 0.
Show that these properties continue to hold in polynomial rings R = K[x]
over fields K. What about polynomial rings over domains or arbitrary rings?
4.2. Show
√ that (1.12) is also a counterexample to the Four Numbers Theorem in
Z[√−5 ], whereas (1.11) is compatible with the Four Numbers Theorem in
Z[ −2 ].
4.3. Let R be a domain. Consider the set S of pairs (p, q) and define an
equivalence relation on S by (p, q) ∼ (r, s) if and only if ps = qr. On
the set K of equivalence classes, define addition and multiplication via
• (p, q) + (r, s) = (ps + qr, qs);
• (p, q) · (r, s) = (pr, qs).
Show that this is well defined and that it makes K into a field with neutral
elements (0, 1) for addition and (1, 1) for multiplication.
Show that the map ι : R −→ K : r → (r, 1) is an injective ring
homomorphism. The field K is called the quotient field of R, and we may
regard R as a subring of K via the embedding ι.
4.4. Let R ⊆ S be domains, and let a, b, m ∈ R. Does a ≡ b mod m in R imply
the same congruence in S? Is the converse true?
4.5. Each fraction in Q can be reduced
√
to lowest terms in a unique way; in
√
Z[ −5 ], on the other hand, 1+ 2 −5 = 1−√ 3
−5
, and both fractions are reduced
to lowest terms. Find more such examples.
4.6. Let α, β ∈ Ok ; show that α | Nα. If moreover α | β, then Nα | Nβ (even in
Z).
4.5 Exercises 103
√ √
4.7. Show that if −2 | y in Z[ −2] √ for some y ∈ Z, then 2 | y.
Show more generally that m | y, where m is squarefree, always implies
that m | y.
Find a counterexample to the claim that α | y always implies Nα | y.
4.8. Show that a + bi ≡ a + b mod (1 + i) in Z[i].
4.9. Prove Proposition 4.1.
4.10. Prove Proposition 4.3.
4.11. Show that a | b in Z implies a | b in the ring of integers Ok in a quadratic
number field k.
4.12. Show that the set of units R × in some ring R is a group with respect to
multiplication.
4.13. Show that if R = K is a field, then K × = K \ {0}.
4.14. If R is a domain and R[X] the ring of polynomials in one variable X with
coefficients from R, then R[X]× = R × , that is, the units in this polynomial
ring are all constant.
Show, on the other hand, that the polynomial 2X + 1 in (Z/4Z)[X] is a
unit. √
4.15. Show that the unit groups of the domains R = Z[ m ] for m < −1 are given
by R × = {−1, +1}.
4.16. Let Ok be the ring of integers in a quadratic number field k, and let Ek = Ok×
be its unit group. Show that Ek is a Gal (k/Q)-module (see Exercise 2.16).
4.17. Show: If R is a domain containing Z, and if π is prime in R, then the smallest
natural number divisible by π in R is a prime number.
4.18. Show that Nα = 1 for α = 1+2i 1−2i ∈ Q(i), but that α is not a unit in Z[i].
Construct infinitely many such examples.
4.19. Show that Z is Euclidean with respect to the absolute value.
4.20. Show that the polynomial ring K[x], where K is a field, is Euclidean with
respect to f (a) = 2deg a , where deg a denotes the degree of a ∈ K[x], and
where we have set deg 0 = −∞√ −∞ = 0.
√to have 2√ = 2
in order deg 0
√ √
4.21. Discuss
√ the examples 2 · 3√= − −6 · √ −6 in Z[ −6 √ ], 2 · 3 = 6 · 6 in
Z[ 6 ], and 2 · 7 = (2 + −10 )(2 − −10 ) in √Z[ 10 ] as in (1.12).
4.22. Consider the quadratic number field k = Q( m ); which of the rational
prime numbers p ∈ {2, 3, 5} in Ok with m ∈ {−5, −3, −2, −1, 2, 3, 5} are
irreducible and which are not?
4.23. Show that elements π ∈ Ok are irreducible if Nπ is a rational prime.
4.24. Let R be a unique factorization domain. Show:
a.gcd(a 2 , b 2 ) = (gcd(a, b))2 for all a, b ∈ R.
b.If gcd(a, b) = 1, then gcd(a 2 , b) = 1.
c.gcd(a + b, b) = gcd(a, b).
d.gcd(ra, rb) = r gcd(a, b).
√ √
4.25. Show that the elements a = 1 + −5 and b = 1 − −5 do not have a
common divisor except ±1, but that 2 is a common divisor of a 2 and b2.
104 4 Divisibility in Integral Domains
√
4.26. Let S be the domain you obtain by adjoining the element ω = 12 (1 + −5 )
√
to R = Z[ −5 ]. Show that S = R[ 12 ] and S ∩ Q = Z[ 12 ].
Show moreover that the decomposition (1.12) is not an example for
nonunique factorization into irreducible elements because 3 = 12 (1 −
√ √
−5 )(1+ −5 ) is a factorization of 3 into the unit 12 and the two irreducible
√
(and even prime)
√ √ elements 1 ± −5. Explain the equation 3 · 3 = (2 −
−5 )(2 + −5 ) by giving a factorization into irreducible elements. √
4.27. Solve √the Diophantine equation x 2 + 5y 2 = z2 by setting x + y −5 =
(r + s −5 )2 as Euler did, and show that the resulting parametrization x =
r 2 − 5s 2 , y = 2rs does not√yield all integral solutions of the equation.
Use the domain S = Z[ −5, 12 ] from the preceding exercise for obtaining
a complete parametrization of the solutions.
4.28. Prove Corollary 4.13. Hint: Try to obtain a = pa1 and b = pn−1 b1 , and then
apply Proposition 4.12 to a1 and b1 .
4.29. Determine all integral points on the elliptic curve 4y 2 = x 3 + 1, i.e., all pairs
(x, y) ∈ Z × Z satisfying this equation. √
4.30. Find all ring homomorphisms κ from Z[ −5 ] to Z/2Z, Z/3Z and Z/5Z,
and determine their kernels.
4.31. Show that the even integers 2Z form an ideal in Z. More generally, the sets
mZ for arbitrary m ∈ Z are ideals in Z.
4.32. Let (a) and (b) be principal ideals in some domain R. Show that a | b if and
only if (a) ⊇ (b). Show moreover that this implies the equivalence of the
following assertions:
a. (a) = (b);
b. a | b and b | a;
c. a = be for some unit e ∈ R × .
4.33. Show that the set
ab
T = : a, b, d ∈ Z
0d
4.37. Let k be a quadratic number field. Show that Z is a subring of Ok , but not an
ideal in Ok . √ √
4.38. Show√ that the set 2Z + 2 Z is an ideal √ in Z[ 2 ] consisting√of the multiples
of 2. Show moreover that Z + 2 2 Z is a subring of Z[ 2 ], but not an
ideal.
4.39. An order O in some quadratic number field is a subring of Ok that properly
contains Z. Consider the set F = {f ∈ Z : f ω ∈ O for all ω ∈ Ok }. Show
that F is an ideal in Z; the generator f > 0 of this ideal F = (f ) is called
the conductor of the order O. Show that the maximal order Ok has conductor
1.
4.40. Show that gcd(2, x) = 1 in the unique factorization domain Z[x] and that
there do not exist associated Bézout elements, i.e., that there do not exist
polynomials p, q ∈ Z[x] with 2p(x) + xq(x) = 1.
Is (2, x) a principal
√ ideal√in Z[x] or in Q[x]?
√
4.41. Find ideals in Z[ −6 ], Z[ −10 ], and Z[ 10 ] that are not principal.
4.42. Let R be the domain of all algebraic integers. Show that 2 does not
possess
√ √ a factorization
√ into irreducible elements. Also show that the ideal
(2, 2, 4 2, 8 2, . . .) is not principal in R and that it is not even finitely
generated (this means that it is not generated by finitely many elements, i.e.,
it does not have the form (a1 , . . . , an ) for suitable elements aj ∈ R).
4.43. Let R be a domain containing Z (for example, R = Ok ). Show that if a, b ∈ Z
are coprime in Z, then they are also coprime in R. (Hint: Bézout).
4.44. Compute the Bézout elements for gcd(21, 15) in Z.
4.45. For n ≥ 3, compute the greatest common divisor of the polynomials x n +
x 2 − 2 and x 2 − 1 in Z[x] (the result will depend on n). How can the result
that x − 1 is always a common divisor be verified in advance?
4.46. Let α, β ∈ Ok and (Nα, Nβ) = 1 in Z. Then gcd(α, β) ∼ 1 in Ok even if Ok
is not a unique factorization domain.
4.47. Bézout elements can be used for inverting residue classes. Assume for
example that a and m are coprime integers; show how to find the inverse
of the residue class a mod m in (Z/mZ)× (i.e., the element b ∈ Z such that
ab ≡ 1 mod m). Compute 12 mod 21 and 15 mod 33.
4.48. Study the equation y 2 = x 3 + 9 in integers.
4.49. Use the factorization (y − k)(y + k) = x 3 to deduce results on the integral
solutions of the Diophantine equation y 2 = x 3 + k 2 for a fixed integer k. This
106 4 Divisibility in Integral Domains
√
Although already Euler had used numbers of the form a + b −2 for solving the
Diophantine equation y 2 +2 = x 3 in integers, it was Gauss who laid the foundations
for the arithmetic of quadratic number rings such as Z[i] by defining prime elements
and units and proving unique factorization for the first number ring strictly larger
than the ordinary integers. He did so in his second memoir on biquadratic residues
published in 1831.
In this chapter we will discuss a few √examples of quadratic number rings. For
the example of the ring of integers in Q( 5 ), we refer to the dissertation by Dodd
[32]; more examples may be found in Sommer’s book [118] and in the still excellent
introduction to number theory by Hardy and Wright [53].
We will start our journey through various quadratic number rings with the ring Z[i]
of Gaussian integers.
Consider the domain R = Z[i]; we want to show that the norm is a Euclidean
function in R. To this end, we have to find, for each α ∈ R and each β ∈ R \ {0}, an
element γ ∈ R such that
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 107
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6_5
108 5 Arithmetic in Some Quadratic Number Fields
Since we are dealing with infinitely many pairs (α, β), this looks difficult. But if
we divide (5.1) by N(β) using the multiplicativity of the norm, we see that it is
sufficient to find some γ ∈ R for each ξ = α/β ∈ k = Q(i) such that
If we can find such a γ ∈ R for some ξ , then we can solve this inequality for any
ζ ∈ k that differs from ξ by an integer η ∈ R since
1 1 1
N(ξ − γ ) = N(ξ ) = x 2 + y 2 ≤ + = < 1.
4 4 2
Thus Z[i] is Euclidean with respect to the norm, and in particular it is a unique
factorization domain.
In the plane of Gaussian numbers, the elements ξ ∈ Q(i) with N(ξ ) ≤ 12 lie
√
inside a circle with radius 1/ 2 around the origin. If we place a circle with this
radius around each lattice point, i.e., around each point a + bi with a, b ∈ Z (see
Fig. 5.1), then these circles cover the whole plane. This implies that for each ξ ∈
Q(i), there is a γ ∈ Z[i] satisfying (5.2), and in fact we may even demand that
N(ξ − γ ) ≤ 12 .
This proof that Z[i] is Euclidean is constructive: We can use it for finding the
greatest common divisor of two Gaussian integers. In order to compute, e.g., gcd(1+
12i, 7 + 4i), we find the nearest integer to
which is 1+i, and we obtain, as the first step in the Euclidean algorithm, the equation
7 + 4i (7 + 4i)(2 + i)
= = −2 − 3i.
−2 + i (−2 + i)(2 + i)
Proposition 5.1 Let Ok be the ring of integers in some quadratic number field k.
Then for each prime π ∈ Ok , there is a unique prime number p ∈ N with π | p. In
particular, Nπ = ±p or Nπ = ±p2 .
Proof Since π | Nπ, we see that π divides the prime factor p of Nπ ∈ Z; if we also
had π | q for a different prime q = p, then π would divide gcd(p, q) = 1, which
is impossible since π is not a unit. The second claim follows easily from π | p by
taking the norm. In fact, we find Nπ | p2 in Z, and since Nπ = ±1 (otherwise π
would be a unit), we are left with the possibilities Nπ = ±p and Nπ = ±p2 .
Thus there exist the following possibilities:
(1) p is prime in Ok ; then Np = p2 , and we say that p is inert in k.
(2) p is not prime in Ok , but irreducible; this can only happen if Ok is not a unique
factorization domain.
(3) p is reducible Ok .
Let us have a close look at the third case. Here p = αβ for non-units α, β ∈ R.
It follows from NαNβ = p2 that Nα = Nβ = ±p (if one factor had norm 1, it
110 5 Arithmetic in Some Quadratic Number Fields
Quadratic Residues We will now briefly look at Fermat’s Little Theorem in the
domain Z[i] of Gaussian integers. It is easily checked that, for odd prime numbers
p, we have
a + bi mod p, if p ≡ 1 mod 4,
(a + bi) ≡ p
(5.3)
a − bi mod p, if p ≡ 3 mod 4.
α Nπ−1 ≡ 1 mod π.
For the proof of the second claim, we observe that (a + bi)p ≡ a − bi mod π,
2
and hence (a + bi)p ≡ a + bi mod p. If α = a + bi is not divisible by p, we are
allowed to cancel α in this congruence.
We also have, in analogy with the ordinary integers, the following proposition.
Proposition 5.6 (Euler’s Criterion) If π ∈ Z[i] is prime with odd norm and if
α ∈ Z[i] not divisible by π, then the following assertions are equivalent:
1. α is a quadratic residue modulo π, i.e., the congruence α ≡ ξ 2 mod π is solvable
with ξ ∈ Z[i];
2. The congruence α (Nπ−1)/2 ≡ 1 mod π holds.
This result allows us to define the quadratic residue symbol [ πα ] with values in
{±1} by the congruence
α
α (Nπ−1)/2 ≡ mod π.
π
Dirichlet has shown how to derive the quadratic reciprocity law in Z[i] first
formulated by Gauss from the reciprocity law in ordinary integers by a few simple
calculations (see [77]).
Theorem 5.7 (Quadratic Reciprocity Law) If π and λ are non-associated primes
with odd norm in Z[i], and if π ≡ λ ≡ 1 mod 2, then
λ π
= ,
π λ
The generalization of the quadratic reciprocity law to general quadratic number
fields is quite technical but can be done in a similar way. The generalization to
arbitrary number fields, on the other hand, requires much deeper means and leads,
as Hilbert has shown, directly to class field theory; see for example the last chapter
in Hecke’s introduction to algebraic number theory [60].
112 5 Arithmetic in Some Quadratic Number Fields
Certain Fermat equations do have solutions in quadratic number fields; we can easily
verify, for example, that
1 + √−7 4 1 − √−7 4 √
+ = 14 in Z[ 1+ 2 −7 ],
2 2
9 + √−31 3 9 − √−31 3 √
−31
+ = (−3)3 in Z[ 1+ 2 ],
2 2
√ 3 √ √ √
(5 − 9 5 ) + (12 5 )3 = (5 + 9 5 )3 in Z[ 5 ].
we will prove these claims below. In this section we will solve (following Hilbert
[61]) the Fermat equation with exponent 4.
Theorem 5.8 The equation α 4 + β 4 = γ 2 has only trivial solutions in Z[i].
If a + bi ∈ Z[i] has odd norm, then a and b have different parity, and hence
a 2 −b2 ≡ ±1 mod 4 and 2ab ≡ 0 mod 4; this implies (a +bi)2 = a 2 −b2 +2abi ≡
±1 mod 4. If the elements α, β ∈ Z[i] have odd norm, then α 4 + β 4 ≡ 2 mod 4,
but 2 is not a square modulo 4 (the only squares modulo 4 are 0, 2i, and ±1). Thus
we may assume that β has even norm.
We will now show that if the equation α 4 + ελ4n β 4 = γ 2 is solvable, where
ε ∈ {±1, ±i} is a unit, λ = 1 + i, and where β is not divisible by 1 + i, then
(1) n ≥ 2;
(2) there exist α1 , β1 , γ1 ∈ Z[i] and a unit ε1 with
α 4 + ελ4 β 4 = γ 2
ελ4n β 4 = γ 2 − α 4 = (γ − α 2 )(γ + α 2 )
γ + β 2 = ηλ4n−2 ζ 4 , γ − β 2 = η λ2 ξ 4
2β 2 = ηλ4n−2 ζ 4 − η λ2 ξ 4 .
β 2 = ηiλ4(n−1) ζ 4 − η iξ 4 .
η1 λ4(n−1) ζ 4 + ξ 4 = β 2 .
√
1
2 ((2x − a) + (2y − b) −3 ); clearly, we can choose b ∈ Z in such a way that
|2y − b| ≤ 12 . Next we have to determine an integer a ∈ Z with a ≡ b mod 2 in
such a way that |2x −a| becomes small. By choosing a from the integers ≡ b mod 2,
we can make |2x − a| ≤ 1 (the nearest integer with given parity has at most distance
1 from 2x). But then N(ξ − γ ) ≤ 14 (1 + 34 ) = 16
7
< 1.
√
In√the diagram in Fig. 5.2, each number x + y −3 corresponds to the point
(x, y 3 ) in R2 ; the domain Z[ρ] then corresponds
√ to a 2-dimensional lattice,
and we have drawn a circle with radius 1/ 3 around each lattice point. These
7
circles cover the whole plane, and hence the constant 16 in the proof above may
be improved to 3 , and we can always find an integral γ such that N(ξ − γ ) ≤ 13 .
1
α = a + bρ −α = −a − bρ
αρ = −b + (a − b)ρ −αρ = b + (b − a)ρ .
αρ 2 = b − a − aρ −αρ 2 = a − b + aρ
√ √
Now −3 = ρ − ρ 2 is a prime element with 3 = −( −3 )2 , whereas λ2 = −3ρ
for the element λ = 1 − ρ.
When is an element α = a + bρ not divisible by λ? Since α = a + bρ =
a + b − b(1 − ρ) ≡ a + b mod λ, this is true if and only if a + b ≡ 0 mod 3. In
5.2 The Eisenstein Integers 115
this case one of the three numbers a, b, or a − b is divisible by 3, and the list above
shows that there is an associate of α whose coefficient of ρ is divisible by 3.
√
Proposition 5.9 If α ∈ Z[ρ] is not divisible by −3, then there is a t ∈ {0, 1, 2}
such that ρ t α = a + bρ with b ≡ 0 mod 3.
α Nπ−1 ≡ 1 mod π
for all α ∈ Z[ρ] not divisible by the prime π. In particular, we observe that if π = 2;
then
α 3 ≡ 1 mod 2
Euler has given a proof of Fermat’s claim that the Diophantine equation
x 3 + y 3 + z3 = 0 (5.4)
has only trivial solutions (those with xyz = 0) in integers. In this proof he has used
properties of numbers of the form c2 + 3d 2 for which he did not give complete
proofs.2 The first proof of Fermat’s Last Theorem for cubes using the arithmetic of
Z[ρ] was given by Gauss, who showed the stronger result that Eq. (5.4) does not
have nontrivial solutions in the larger ring Z[ρ]. In the following, we will give a
rigorous version of Euler’s proof using the methods of Gauss. The idea behind the
proof goes back to Fermat, who called his method infinite descent.
For proving that a Diophantine equation in many variables x, y, z, . . . does not
have a solution in integers, assume you do have such a solution (x, y, z, . . .), and
then show that for each such solution there is a smaller solution (u, v, w, . . .)
(smaller in the sense that e.g. |u| < |x|). Since natural numbers cannot decrease
indefinitely, this results in a contradiction. For a simple application of this technique,
see Exercise 5.7.
Theorem 5.12 The Diophantine equation x 3 +y 3 +z3 = 0 has only trivial solutions
in integers.
Instead of proving this theorem for x, y, z ∈ Z, we show (as Gauss) that the
cubic Fermat equation does not have a solution in the domain Z[ρ]. This follows
immediately by setting α = x 3 , β = y 3 , and γ = z3 in the following theorem,
whose smooth formulation I learned from Paul Monsky3.
2 The gap is the one that we have pointed out in Chap. 1, namely the missing proof for the
decomposition theorem for numbers of the form x 2 + 3y 2 : If c2 + 3d 2 = r 3 , then there exist
integers p and q with c = p(p2 − 9q 2 ) and d = 3q(p2 − q 2 ); see [10].
3 See https://mathoverflow.net/questions/39561.
5.2 The Eisenstein Integers 117
pairwise coprime. Any solution (x, y, z) ∈ Z3 \ {(0, 0, 0)} gives rise to infinitely
many non-primitive solutions (kx, ky, kz) for k ∈ Z.
Another result can be found in the book [14] on elliptic curves by J.W.S.Cassels.
There you can find a sketch of a proof that the equation x 3 + y 3 = q1 q2 z3 , where
q1 ≡ 2 mod 9 and q2 ≡ 5 mod 9 are prime, has only trivial solutions. It is natural
to ask how this is connected to elliptic curves. In fact, the cubic Fermat equation
x 3 + y 3 = z3 is an elliptic curve: Dividing through by z and setting r = x/z,
s = y/z, we obtain r 3 +s 3 = 1; with r = u+v and s = u−v, we get 2u3 +6uv 2 = 1,
and hence 2 + 6(v/u)2 = 1/u3 . Multiplying through by 63 and setting Y = 36v/u,
X = 6/u, we obtain the equation of an elliptic curve Y 2 = X3 − 432 in the well-
known Weierstrass form.
It is known since Euclid that there is no largest prime number; nevertheless, there
usually is a largest known prime number, mainly because there is no simple formula
for computing arbitrarily large primes. Fermat once believed to have found such a
n
formula: He conjectured that all numbers of the form Fn = 22 + 1 are prime (and
in fact he almost believed to have a proof). Euler later showed that F5 = 232 + 1 =
641 · 6700417 is composite, and in fact, no other Fermat prime beyond F4 has been
discovered until now. For quite a few years now, the largest known prime number
always has been a prime number of the form 2p −1, where p is prime; such numbers
are called Mersenne numbers.
Marin Mersenne (1588–1648) was a French priest. He corresponded with most
mathematicians and many scientists of his time, in particular with Fermat. He is
known for his conjecture that p = 2, 3, 5, 7, 13, 17, 19, 31, 67, 127, and 257 (these
are all numbers that differ at most by 3 from a power of 2) are the only prime
numbers ≤ 257 for which 2p − 1 is prime. Later it was shown that the Mersenne
numbers for p = 67 and p = 257 are composite and that p = 61, 89, and
107 give rise to primes. The smallest Mersenne number not completely factored
today is M1207; this number has the prime factors 131 071, 228 479, 48 544 121, and
212 885 833, and the remaining factor is a composite number with 337 digits.
It is easy to show that 2p − 1 is composite if p is. This follows from the fact that
2 − 1 always divides 2ab − 1 since
a
The reason why the largest known prime is usually a Mersenne prime is that
there is a very effective primality test for such numbers developed by Édouard Lucas
(1842–1891) and Derrick Lehmer (1905–1991). In fact, the number Mp = 2p − 1
(where p ≥ 3) is prime if and only if Sp−1 ≡ 0 mod Mp , where the sequence Sn is
5.3 The Lucas–Lehmer Test 119
defined recursively by S1 = 4 and Sn+1 = Sn2 − 2. Using this test, Lucas was able
to show that 2127 − 1 is prime.
Example Let p = 5; then M5 = 31, and we find
S1 = 4
S2 = 14
S3 = 194 ≡ 8 mod 31
S4 ≡ 62 ≡ 0 mod 31,
√
5.3.1 The Arithmetic in Z[ 3 ]
√
We begin
√ by showing √ that R = Z[ 3 ] is norm-Euclidean.
√ To this end, let ξ =
x + y 3 ∈ k = Q( 3 ) be given, and choose α = a + b 3 ∈ R in such a way that
|x − a| < 12 and |y − b| ≤ 12 . Then,
3
|N(ξ − α)| = |(x − a)2 − 3(y − b)2 | ≤ since
4
1
(x − a)2 − 3(y − b)2 ≤ (x − a)2 ≤ and
4
3
(x − a)2 − 3(y − b)2 ≥ −3(y − b)2 ≥ − .
4
In particular, R is norm-Euclidean.
120 5 Arithmetic in Some Quadratic Number Fields
Proposition 5.15 Let q be a rational prime number that is inert in R. Then R/qR
is a finite field with q 2 elements.
Proof Clearly, the residue class 2
√ ring modulo qR contains at most q elements since
each integral
√ element a + b 3 is congruent modulo qR to one of the elements of
{r + s 3 : 0 ≤ r, s ≤ q − 1}. Moreover, it is easily seen that the elements of
this set are pairwise incongruent modulo q, which implies that the residue class
ring does indeed have q 2 elements. Finally, R/qR does not have any zero divisors:
αβ ≡ 0 mod qR implies, since q is prime, that α ≡ 0 mod qR or β ≡ 0 mod qR.
It is therefore sufficient to show that finite rings without zero divisors are fields.
All we have to do is show the existence of inverses. Assume therefore that A is a
finite domain and that a = 0. Since A is finite, the sequence a, a 2 , . . . , a m must
contain two equal elements, say a i = a j for some i < j . Since A is a domain, we
may cancel i, which gives us a j −i = 1. But then a j −i−1 is an inverse of a.
√
Fig. 5.3 The domain Z[ 3 ] is norm-Euclidean
5.3 The Lucas–Lehmer Test 121
√ p m √
(2α)p ≡ a p + b p m ≡ a + b m mod p
p
√ p √
since a p ≡ a mod p and m = m(p−1)/2 m. The claim now follows from 2p ≡
2 mod p.
Assume conversely that Sp−1 ≡ 0 mod q; then ω(q+1)/2 ≡ −1 mod qR. Since
q+1
2 = 2p−1 is a power of 2, q+1
2 must be the smallest exponent n > 0 for which
ωn ≡ −1 mod qR. On the other hand, for each prime divisor | q, the same
congruence ω(q+1)/2 ≡ −1 mod R holds, and again the exponent q+1 2 is minimal.
Now either ω+1 ≡ 1 mod R or ω−1 ≡ 1 mod R by Proposition 5.17, i.e., we
have − 1 ≥ 2 q+1
2 = q + 1 or + 1 ≥ 2q + 1. The first case is impossible, the
second shows that ≥ q, and hence all divisors of q are ≥ q. But then q is prime.
Dirichlet’s first mathematical result was his proof [29] that the quintic Fermat
equation x 5 + y 5 = z5 has only the trivial solutions with xyz = 0 in integers.
Legendre completed Dirichlet’s proof before Dirichlet did but did not deal properly
with the issues of unique factorization and units. In the following, we will give a
streamlined version of Dirichlet’s proof using ideas due to J. Plemelj [103] and L.
Tschakaloff [125].
5.4 Fermat’s Last Theorem for the Exponent 5 123
√ √
First we observe that the ring of integers in Q( 5 ) is Z[ω] with ω = 1+ 5
2 .
Proposition 5.18 The ring Z[ω] is Euclidean with respect to the absolute value of
the norm. In particular, it is a unique factorization domain.
√
As before we need to show that for every ξ ∈ Q( 5 ), there is an element γ ∈
Z[ω] with |N(ξ √ − γ )| < 1. By symmetry, it is enough to prove the existence of γ
for ξ = x + y 5 with 0 ≤ x, y ≤ 12 . But for such ξ , the value γ = 1 always works
with the single exception ( 12 , 12 ), for which we can pick γ = 0 (see Fig. 5.4).
We also see that −1 and ω are units in Z[ω]; it will follow from the results in the
next chapter that every unit is, up to sign, a power of ω.
Assume now that x 5 + y 5 = z5 for nonzero integers. Classical proofs of Fermat’s
Last Theorem for exponents ≥ 5 are usually divided into the first and the second
case. The first case, where one of x, y, or z is divisible by p, is a lot easier to prove
than the second case.
Observe that x 5 ≡ ±1, ±7 mod 25 if 5 x. This shows that the only solutions of
x + y 5 ≡ z5 mod 25 in rational integers are those in which 5 | xyz.
5
n 1 2 3 4 5
ωn ω 1+ω 1 + 2ω 2 + 3ω 3 + 5ω
shows that λ | (x + y); therefore, the expression in the square brackets is divisible
exactly by λ2 , and hence x + y must be divisible by λ3 . Since any common divisor
divides 5x 2y 2 , we conclude that the greatest common divisor of both factors is λ2 =
5.
Next we multiply x 5 + y 5 = z5 with a suitable power of ω5 to make sure that
x ≡ 1 mod 5 and therefore y ≡ −1 mod 5. Next
x 5 + y 5 = (x + y)(x 4 − x 3 y + x 2 y 2 − xy 3 + y 4 )
= (x + y)[(x + y)4 − 5xy(x 2 + xy + y 2 )]
= (x + y)[(x + y)4 − 5xy(x + y)2 + 5x 2 y 2 ]
= (x + y)[λxy − ω(x + y)2 ][λxy + ω (x + y)2 ].
5.4 Fermat’s Last Theorem for the Exponent 5 125
x + y = ε1 λ3 γ 5 ,
λxy − ω(x + y)2 = ε2 λα 5 ,
λxy + ω (x + y)2 = ε3 λβ 5 .
xy − ω x+y
λ = ε2 α .
5
x + y = λ3 γ 5 , (5.5)
λxy − ω(x + y)2 = λα 5 , (5.6)
λxy + ω (x + y) = λβ .
2 5
(5.7)
with αβγ = z.
Thus we have obtained a new solution to the quintic Fermat equation in which γ
has fewer distinct prime factors than z except when α and β are units (observe that
they are coprime to λ). But this is impossible, as we will show now.
In fact, dividing (5.8) through by α 5 , we find an equation 1 ± ω5k = γ 5 for
some γ divisible by λ. Since ω5k ≡ ±1 mod 5, the integer k is even. Dividing our
equation by its conjugate and using
1 + ω5k 1 + ω5k
= ω5k = ω5k ,
1+ω 5k ω5k + (−1)5k
we obtain
γ 5
ω5k = ± ,
γ
126 5 Arithmetic in Some Quadratic Number Fields
and this shows that γ = ±ωk γ . Thus γ and γ have the same prime factorization,
and hence γ is a product of a power of the ramified prime element λ and a rational
integer. Since λ2 = 5, we have γ = a or γ = λa for a rational integer a.
Since 1 ± ω5k = a 5 immediately implies k = 0 and a = 0, we must have 1 ±
ω = λ5j a 5 for some odd integer j . Taking the trace of 1±(F5k−1 +F5k ω) = λ5j a 5
5k
m = 2, 3, 5, 6, 7, 11, 13, 17, 19, 21, 29, 33, 37, 41, 57, 73.
The full proof of this result is very technical; we now present a clever idea
for proving that several rings with small discriminant are norm-Euclidean due to
Oppenheim [101]. More geometric√ proofs can be found in [34].
Assume first that K = Q( m ) with m ≡ 2, 3 mod 4. The ring OK is norm-
√ y) ∈ Q × Q, there exists a pair of integers (a, b) such that
Euclidean√if for every (x,
|N(x + y m − (a + b m ))| < 1, i.e., with
We now assume that OK is not norm-Euclidean. We will show that this implies
m ≥ 8, and then it will follow that the rings with m ≤ 7 are norm-Euclidean.
4 This result is due to Theodore Motzkin (1908–1970) [98]. It can be proved quite easily and has
is true for all pairs of integers (a, b). We will consider the following set of
inequalities:
Now clearly P (0, 0) is false; therefore, N(0, 0) must be true. But then P (1, 0) is
false, and hence N(1, 0) must hold. Next P (−1, 0) and N(1, 0) imply (1 + x)2 ≥
2 + (1 − x)2 , hence 4x ≥ 2, x = 12 , and my 2 = 54 , and hence y is irrational. Thus
P (−1, 0) is false, and N(−1, 0) must hold. But now my 2 ≥ 1 + (1 + x)2 ≥ 2
implies m ≥ 8.
Therefore the Euclidean algorithm holds for all m < 8, i.e., for m = 2, 3, 5, 6, 7.
The very same proof works for fields with odd discriminant if we replace the
inequality (5.9) by
|(x − b
2 − a)2 − m
4 (y − b)2| < 1,
√
and the result is that Z[ 1+2 m ] is norm-Euclidean if m < 32, i.e., for m =
5, 13, 17, 21, 29.
For real quadratic number fields, there are fields that are Euclidean but not norm-
Euclidean, and in fact it is expected (and can be proved by assuming the truth of
the Generalized Riemann Hypothesis) that all number fields whose ring of integers
is a unique factorization domain are Euclidean for a suitable function (with the
exception of complex quadratic fields). The first number field √ that was known not
to be norm-Euclidean and was shown to be Euclidean is Q( 69 ); nowadays, many
examples are known; see, e.g., Harper [54, 55].
The fact that the domains listed in Theorem 5.22 are norm-Euclidean can
nowadays be done by computer (as we have seen, it is possible to do this by hand
for small values of the discriminant). The proof that the other fields are not norm-
Euclidean is much more technical. The heart of the proof is an article by Davenport,
which uses the language of quadratic forms; see [49].
128 5 Arithmetic in Some Quadratic Number Fields
Then Ok is a unique factorization domain if and only if for all prime numbers p <
Mk with (Δ/p) = −1, there exist elements π ∈ Ok with |Nπ| = p.
Using this result, we can quickly verify that there are nine complex quadratic
number fields whose ring of integers has unique factorization, namely those with
discriminants
5 The Dedekind–Hasse criterion was published by Helmut Hasse [57]. Emmy Noether later found
this criterion among Dedekind’s papers when she edited his collected works [28]; see also [90,
Anm. 1, S. 60].
6 Gauss formulated this conjecture for class numbers of binary quadratic forms with even middle
coefficients.
5.6 Quadratic Unique Factorization Domains 129
For most quadratic number fields, we can often decide that the ring of integers
does not have unique factorization just by looking at its discriminant. We will later
explain our partial results here by more advanced techniques such as the ambiguous
class number formula.
In this section we will prove that the ring of integers in a quadratic number field
cannot have unique factorization if its discriminant Δ has more than two distinct
prime factors. The proof we will give is due to Laszlo Rédei [106]; in Chap. 9 we
will derive this observation as a corollary of the much more general theorem on the
structure of the ideal class group.
√
Theorem 5.25 Let k = Q( m ) be a quadratic number field whose ring of integers
Ok is a unique factorization domain, and let Δ = disc k denote its discriminant,
If Δ < 0, then Δ = −4, −8 or Δ = −q for some positive prime number
q ≡ 3 mod 4.
If Δ > 0, then Δ is either a prime discriminant or the product of two negative
prime discriminants:
⎧
⎪
⎪ p ≡ 1, 3 mod 4 prime,
⎨p,
m = 2q, q ≡ 3 mod 4 prime,
⎪
⎪
⎩pq, p ≡ q ≡ 3 mod 4 prime.
Assume therefore that εp is not a square for all p | Δ. Since every positive unit
is a power of the fundamental unit ε, it follows that pε is a square for each prime
p | Δ. If p and q are such primes, then pqε2 and therefore also pq is a square in k,
√
and hence k = Q( pq ).
Thus it remains to show that we cannot have p ≡ q ≡ 1 mod 4 or p ≡ 1 mod 4
and q = 2. In both cases, pq = a 2 + b 2 would be a sum of two squares, and we may
√
assume that a is odd. We set ρ = b + pq and ω = gcd(a, ρ). Observe that either
b is odd and pq is even, or b is even and pq is odd; this implies that gcd(ρ, 2) = 1.
Next
observe that the gcd has the usual properties since Ok has unique factorization. Thus
ω2 /ρ is a unit with negative norm: a contradiction.
In his letter from September 28, 1743, Goldbach pointed out to Euler that the
quadratic polynomial f (x) = x 2 + 19x − 19 represents many prime numbers for
small values of x (the first composite number is f (19) = 19 · 37). Some time
later, Euler gave the polynomial n(x) = x 2 − x + 41, which represents prime
numbers for 0 ≤ x ≤ 40. The discriminant of Euler’s polynomial
√ is Δ = −163,
and the ring of integers of the quadratic number field Q( −163 ) is, as we will
show in the next chapter, a unique factorization domain. Georg Frobenius [42] and
Juri Rabinowitsch [105] have discovered that this is no coincidence, and today the
mathematical literature knows hundreds if not thousands of publications (see, e.g.,
[7, 120] and Paulo Ribenboim’s book [109], to mention but three) that deal with this
topic. We claim:
√
Theorem 5.27 The ring of integers in the quadratic number field Q( −p ), where
p ≡ 3 mod 8 is a prime number, is a unique factorization domain if and only if the
polynomial n(x) = x 2 + x + p+1 p−3
4 attains prime values for 0 ≤ x < 4 .
We start with the following remark:
Lemma 5.28 If k is a complex quadratic number field with discriminant Δ ≤ −11
for which the ring of integers is a unique factorization domain, then Δ = −p is
prime and we have p ≡ 3 mod 8.
Proof We already know that Δ = −p must be a prime discriminant, and thus we
have p ≡ 3 mod 4. If we had p ≡ 7 mod 8, then 2 in Ok cannot prime since 2 |
5.6 Quadratic Unique Factorization Domains 131
√ √
1+p 1− −p 1+ −p
4 = 2 · 2 divides a product without dividing one of the factors. Since
Ok is a unique factorization domain, 2 must be reducible,
√
and this is only possible
x+y −p x 2 +py 2
≥ x +11y
2 2
if Ok contains an element of norm 2. Since N( 2 ) = 4 4 > 2,
this is impossible.
If there is a composite number occurring among the values of the polynomial
n(x) for 0 ≤ x < p−3 4 , then this number is divisible by a prime number q. This
prime q is odd since n(x) = x 2 + x + p+1
4 ≡ x + x + 1 ≡ 1 mod 2 for all integers
2
5.6.2 Summary
In this chapter we have discussed a few minor applications of the theory of quadratic
number fields. In particular, we have shown that Z[i] and Z[ρ] are norm-Euclidean
domains and that the decomposition of primes p in these rings is connected with the
representations of p in the form x 2 + y 2 and x 2 + 3y 2 , respectively.
132 5 Arithmetic in Some Quadratic Number Fields
5.7 Exercises
5.1. Determine gcd(26−29i, 13+4i) using the Euclidean algorithm in Z[i]. Verify
the result using the prime factorization of these numbers. Also determine the
corresponding Bézout elements.
5.2. Let p = a 2 +b2 be an odd prime number. Show that a and b can be computed
from a solution of the congruence x 2 ≡ −1 mod p by applying the Euclidean
algorithm to the numbers p and x + i in Z[i].
This consequence of the fact that Z[i] is Euclidean can of course be
generalized. How would you prove that each positive prime number p ≡
1, 3 mod 8 can be written in the form p = c2 + 2d 2 ?
5.3. Show that both {0, ±1, ±i} and {0, 1, 2, 3, 4} are complete systems of
residues modulo 1 + 2i in Z[i].
5.4. Show that the associates of a +bi ∈ Z[i] are given by ±(a +bi), ±(−b +ai).
5.5. Show that for α ∈ Z[i], the following assertions are equivalent:
1. (1 + i) α.
2. Nα is odd.
3. Nα ≡ 1 mod 4.
4. α has an associate of the form a + bi with a − 1 ≡ b ≡ 0 mod 2.
5. α has an associate congruent to 1 mod (2 + 2i).
5.6. Solve the Pythagorean equation x 2 + y 2 = z2 by factoring the left side and
using the arithmetic of Z[i].
5.7. Use infinite descent to show that the equation x 3 + 3y 3 + 9z3 = 0 has only
the trivial solution (0, 0, 0) and generalize.
5.8. Compute the quadratic residue symbols [ 1+2i3+2i ], [ 3+2i ] and [ 1+4i ].
1+4i 1+2i
5.9. In the following, we prove the quadratic reciprocity law in Z[i] using an idea
of Dirichlet.
1. Show by comparing the definitions of the quadratic residue symbols in Z
and Z[i] that for primes π ∈ Z[i] with odd prime norm p and a ∈ Z we
always have [ πa ] = ( pa ).
Show next that [ qa ] = 1 for all a ∈ Z not divisible by q ≡ 3 mod 4.
2. If π = a + bi ≡ 1 mod 2 is prime, then [ a+bi a
] = 1.
3. Let π = a + bi and λ = c + di be primes ≡ 1 mod 2 with norms Nπ = p
and λ = q. Use the congruences ci ≡ d mod (c + di) and ( qc ) = 1 for
c+di ] = ( q ).
proving [ a+bi ac+bd
pq ) = 1.
4. Use the quadratic reciprocity law in Z for verifying that ( ac+bd
5. Prove the quadratic reciprocity law in Z[i].
5.10. Show using Theorem 5.24 that Ok is a unique factorization domain for Δ =
−19, −43, −67, −163.
5.7 Exercises 133
is Euclidean with respect to the norm√ Nu defined by taking the maximal odd
factor
√ of the usual norm N(x + y −5 ) = x 2 + 5y 2 ; for example, N (1 +
u
−5 ) = 3.
Show that the unit group of R is R × = −1, 2 and thus is isomorphic as
an abelian group to Z/2Z × Z.
The domains in this exercise are called rings of S-integers (in our case we
had S = {2}). These domains are occasionally used if one would like to apply
theorems that hold for principal ideal domains. The price one has to pay is
a larger unit group, which usually outweighs the advantage of having unique
factorization in almost all number theoretic problems.
5.13. Find the prime factorizations of 7, 13, and 19 in Z[ρ].
5.14. Show that for each α ∈ Z[ρ], we have α 3 ≡ 0, 1 mod 2.
5.15. The integral solutions of the equation y 2 = x 3 + 24 are (1, 5), (−2, 4),
(10, 32), and (8 158, 736 844). How close to this result√can you come by
factoring y 2 − 24 = x 3 in the quadratic number field Q( 6 )?
5.16. (See [87]) Consider Goldbach’s polynomial f (x) = x 2 + 19x − 19, and show
that it represents infinitely many composite integers by verifying the identity
Show more generally that f ((x +f (x)) is, for any polynomial f with integral
coefficients and degree n, the product of f (x) and another polynomial with
integral coefficients.
5.17. Show that the only integral solutions of y 2 + 1 = 2x 3 are (x, y) = (1, ±1).
5.18. In his proof of the first case of Fermat’s Last Theorem for the exponent 5,
Gauss considered the equation x 5 + y 5 + z5 = 0 and set y + z = a, z + x = b
and x + y = c. Show that this implies
(a + b + c)5 = 80abc(a 2 + b 2 + c2 ).
134 5 Arithmetic in Some Quadratic Number Fields
Verify this identity and derive the first case of Fermat’s Last Theorem for the
exponent 5.
5.19. (Werebrussow) Let φ(x, y) = x 2 + xy − y 2 . Verify the identity
x 5 + y 5 = (x + y)φ(x 2 − xy + y 2 , x 2 − 2xy + y 2 ).
ωn − ω n
Un = ,
ω−ω
√ √ √
where ω = 1+2 5 and ω = 1−2 5 ; in particular, ω − ω = 5.
Now prove the following congruences: For primes p with ( p5 ) = +1, we
have
5
Up−1 ≡ 0, Up ≡ 1, Up+1 ≡ 1 mod p if = +1,
p
5
Up−1 ≡ 1, Up ≡ −1, Up+1 ≡ 0 mod p if = −1.
p
Chapter 6
Ideals in Quadratic Number Fields
In this chapter we will show how to work with ideals in quadratic number rings and
how they can be applied to number theoretical problems.
6.1 Motivation
is an example
√ of nonunique factorization into irreducible elements in the domain
R = Z[ −5 ]. This factorization also provides a counterexample to other results
that hold in unique factorization domains. We know, for example, that in unique
factorization domains,
√ it follows from gcd(a, c) = 1 that gcd(a 2 , c) = 1. In our
case, 2 and 1 + −5 √ are both irreducible;
√since they are not associate,√ they must
be coprime. Yet (1 + −5 )2 = −4 + 2 −5 shows that gcd((1 + −5 )2 , 2) is
nontrivial
√ since 2 is a common divisor.
If Z[ −5] were a √ principal ideal domain, we could write down such √ a factor
immediately:
√ (2, 1 + −5 ) = (α) would imply α ∼ gcd(2,
√ 1 + −5 ). But
Z[ −5] is not a principal ideal domain, and the ideal
√ (2, 1 + −5 ) is not principal.
Dedekind’s idea was to regard√the ideal (2, 1 + −5 ) as the “correct” greatest
common divisor of 2 and (1 + −5 ). The introduction of such “ideal” factors then
allows us to replace the non-existent unique factorization into elements by a unique
factorization into prime ideals.
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 135
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6_6
136 6 Ideals in Quadratic Number Fields
Already in the first chapter, we have pointed out that Kummer’s ideal numbers
may be interpreted as ring homomorphisms Ok −→ Fq from the ring of integers
of a (quadratic) number field to finite fields and that Dedekind replaced these
homomorphisms by their kernels. In this chapter we will have a closer look at the
situation.
We may, for example, study the domain Z[i] by looking at ring homomorphisms
from this ring to finite fields Fp = Z/pZ. Recall that ring homomorphisms f :
R −→ S satisfy f (r1 + r2 ) = f (r1 ) + f (r2 ) and f (r1 r2 ) = f (r1 )f (r2 ) and must
have the property that the unit element of R is mapped to the unit element of S.
Each ring homomorphism κ2 : Z[i] −→ Z/2Z satisfies, by definition, the
equation κ2 (1) = 1 + 2Z. Since κ2 (i)2 = κ2 (i 2 ) = κ2 (−1) = −1 + 2Z = 1 + 2Z,
we have κ2 (i) = 1 + 2Z. Thus,
The ideal numbers that Kummer introduced in cyclotomic number fields are
essentially such ring homomorphisms; for Kummer, they were procedures for
attaching residue classes to elements. Each such ring homomorphism possesses a
kernel, and kernels of ring homomorphisms today are called ideals. If f : R −→ S
is a ring homomorphism, then its kernel I has the following properties:
6.1 Motivation 137
Proposition 6.1 Let k be a quadratic number field with ring of integers Ok . For
α, β ∈ Ok , the following assertions are equivalent:
1. α | β;
2. (α) ⊇ (β).
138 6 Ideals in Quadratic Number Fields
AB = {α1 β1 + . . . + αm βm : αj ∈ A, βj ∈ B},
For every ideal a in quadratic number rings, we can define the conjugate ideal
aσ = a (here σ is the nontrivial automorphism of k/Q), which consists of all
6.1 Motivation 139
The calculation of the products ab, ac, and c2 is done in Exercise 6.3.
Let us have another look at the equation
√ √
2 · 3 = (1 + −5 )(1 − −5 ).
22 + 5 · 12 = 32 ,
22 + 5 · 32 = 72 ,
222 + 5 · 32 = 232,
382 + 5 · 92 = 432,
22 + 5 · 212 = 472 .
So far we have only discussed properties of ideals that hold in general domains.
From now on we will exploit the fact that ideals in rings of integers of algebraic
number fields have additional properties. For deriving them, it is useful to introduce
the concept of Z-modules. In arbitrary domains R, a Z-module in R is just an
additive subgroup of R.
where {1, ω} is an integral basis of Ok . The Z-module coeff(M) does not depend on
the choice of ω: Replacing ω by ω1 = ω − r for an arbitrary integer r clearly leaves
coeff(M) invariant.
The following table displays the Z-modules M ∩ √ Z and coeff(M) for a few
choices of Z-modules M in a quadratic number ring Z[ m ]:
√
M Z Z[ m ] Ok 5Z ⊕ (1 + 2i )Z
M ∩Z Z 0 Z 5Z
coeff(M) 0 Z Z 2Z
is exact. This means little more than that there are maps ι : M ∩Z −→ M (inclusion)
and coeff : M −→ coeff(M) (projection) with im ι = ker coeff.
Those familiar with the concept of direct sums in group theory will know that for
writing M as a direct sum of two modules of rank 1, one needs a lift coeff(M) −→
M. Clearly, given an element m ∈ coeff(M), there is an element a + bω ∈ M; this
“lift” will occur in our proofs below, which do not use the exact sequence above or
any other fancy tools from commutative algebra.
We now prove the following “basis theorem” for Z-modules of Ok :
This proposition claims that each such Z-module possesses a Z-basis; thus Z-
modules in Ok behave as subspaces of a vector space. The number of elements of a
basis does not depend on the choice of the basis and is called the rank
√ of the module.
For example, the module M = (0) has rank 0, the modules Z and m · Z have rank
1, and Ok has rank 2. Clearly M = nZ ⊕ (a + mω)Z has rank 2 if and only if
mn = 0.
Proof of Proposition 6.5 Write M ∩ Z = nZ and coeff(M) = mZ for integers
m, n ∈ N0 . By construction there is an integer a ∈ Z with a + mω ∈ M; since we
may change a by a multiply of n, the integer a is only determined modulo n.
Next we verify that these integers have the desired properties. We have to show
that M = nZ ⊕ (a + mω)Z. The fact that M ⊇ nZ ⊕ (a + mω)Z is clear. Assume
therefore that r + sω ∈ M. Since s ∈ coeff(M), we conclude that s = um for some
u ∈ Z, and then r − ua = r + sω − u(a + mω) ∈ M ∩ Z, hence r − ua = vn. Now
we obtain r + sω = r − ua + u(a + mω) = vn + u(a + mω) ∈ nZ ⊕ (a + mω)Z.
Now let M be a Z-module in R = Ok . We consider the factor group R/M. This
group consists of all expressions of the form r+M with r ∈ R, where r+M = s+M
if and only if r − s ∈ M. This set becomes a group by setting (r + M) + (s + M) =
(r + s) + M. The idea behind computing with factor groups such as this one is doing
calculations in R and identifying elements that differ by an element in M.
The number of residue classes modulo M, i.e., the cardinality of the residue class
group R/M, is called the norm of the module M, and we write N(M) = (R : M).
The norm of a module M need not be finite, as the example R = Ok and M = Z
shows.
The importance of the numbers m and n in Proposition 6.5 is also emphasized by
our next result.
Proposition 6.6 Let M = nZ ⊕ (a + mω)Z be a Z-module of rank 2 in Ok . Then
S = {r + sω : r, s ∈ Z; 0 ≤ r < n, 0 ≤ s < m}
We have to show
(a) that each element of R is congruent modulo M to some element of S and
(b) that elements of S are congruent modulo M only if they are equal.
For proving the first claim, take an element x + yω ∈ R, and write y = mq + s
with 0 ≤ s < m and x − qa = np + r with 0 ≤ r < n. Then
(r + M) + (s + M) = r + s + M.
What could prevent us from defining the product of these residue classes by
(r + M) · (s + M) = rs + M?
All we have to do is verify that our product is well defined. To this end, we replace
s by s + m for some m ∈ M; then
(r + M) · (s + m + M) = rs + rm + M,
Proof We first show that nZ = a ∩ Z ⊆ coeff(a) = mZ, which then implies (to
contain is to divide) that m | n. To this end assume that c ∈ a ∩ Z; then cω ∈ a, and
by definition of the coefficient module, we conclude that c ∈ coeff(a).
For showing that m | a, we observe that ω2 = x + yω for integers x, y ∈ Z since
{1, ω} is an integral basis. Now a is an ideal; thus if it contains a + mω, it will also
contain (a + mω)ω = mx + (a + my)ω. Thus a + my ∈ coeff(a) is a multiple of
m, and this implies immediately that m | a, and hence a = mb for some b ∈ Z.
For verifying the last divisibility relation we set α = a + mω = m(b + ω). With
α ∈ a clearly α(b+ω ) is contained in the ideal a. Since m1 Nα = m(b+ω)(b+ω ) ∈
a ∩ Z we find that m1 Nα = m · N(b + ω) is a multiple of n.
Our next goal is the statement that the norm aa of an ideal a is generated by an
integer. For principal ideals this is clear since (α)(α) = (α)(α ) = (αα ) = (Nα).
144 6 Ideals in Quadratic Number Fields
A key step in proving unique factorization for ideals in general number fields is
showing that for each integral ideal a = (0) there is an ideal b = (0) such that
ab = (α) is principal.
Proposition 6.8 Let a = (0) be an ideal in Ok . Then there is an a ∈ N with
aa = (a).
Remark Here the notation (a) is slightly ambiguous since it is not clear whether we
are talking about the ideal aZ in Z or the ideal aOk generated by a in Ok . Since on
the left side there is an ideal in Ok , clearly (a) must be the ideal (a) = aOk .
For the proof of Proposition 6.8, we use the following lemma1 due to A. Hurwitz:
Lemma 6.9 (Hurwitz’s Lemma) Assume that α, β ∈ Ok and m ∈ N. If Nα, Nβ
and Tr αβ are divisible by m, then m | αβ and m | α β.
Proof Let γ = αβ /m; then γ = α β/m, and we know that γ + γ = (Tr αβ )/m
Nβ
and γ γ = Nαm m are integers. If the norm and the trace of an element of a quadratic
number field are integers, the element must be an algebraic integer, hence γ ∈ Ok ,
and the claim follows.
Proof of Proposition 6.8 We write a = (α, β) for α, β ∈ Ok (we can do so by
Proposition 6.5). Then a = (α , β ), and thus aa = (Nα, αβ , α β, Nβ). If we set
a = gcd(Nα, Nβ, Tr αβ ) (in Z), then by Hurwitz’s Lemma 6.9, the two numbers
αβ αβ Nα Nβ αβ α β
a and a are algebraic integers; thus we obtain aa = (a)( a , a , a , a ),
where the last ideal lies in Ok by Hurwitz’s Lemma. In order to show that aa = (a),
Nβ αβ α β
it is enough to show that 1 ∈ ( Nα a , a , a , a ). But 1 is a Z-linear combination
Nβ Tr αβ Nα Nβ αβ
a , a and a , hence a Ok -linear combination of
of Nα a , a and a + αaβ , and
the claim follows.
The natural number a in Proposition 6.8 is called the norm of the ideal a; we
thus have aa = (Na). Since (Nab) = (ab)(ab) = (aa )(bb ) = (Na)(Nb), the
ideal norm is multiplicative. Other important properties of the norm of ideals are the
following:
• Na = 1 ⇐⇒ a = (1): In fact, Na = 1 implies (1) = aa ⊆ a ⊆ Ok = (1),
and the converse is clear.
• Na = 0 ⇐⇒ a = (0): It follows from aa = (0) that Nα = αα = 0 for all
α ∈ a.
The following property shows that the norm of an ideal can easily be computed
from its Z-basis:
1 This lemma is related to Dedekind’s “Prague Theorem”; see [80]. At this point we are using the
fact that the ring Ok is integrally closed, i.e., is equal to the maximal order.
6.2 Unique Factorization into Prime Ideals 145
For proving this claim, we write a = nZ + (a + mω)Z as in Prop. 6.7 and set
α = m(b + ω). Then a = (n, α), a = (n, α ) and
Now we approach the theorem of unique factorization into prime ideals. The idea
behind the proof is the same as for numbers. In that case we could immediately
conclude from an equation αβ = αγ with α = 0 that β = γ (we just multiply by
the inverse of α); in the case of ideals, this is not yet possible since we do not have an
“inverse ideal” at our disposal. The fact that the “cancellation law” is nevertheless
correct is the content of the next proposition.
Proposition 6.11 (The Cancellation Law) If a, b, and c are nonzero ideals in Ok
with ab = ac, then b = c.
Proof Assume first that a = (α) is principal; then ab = αb, and hence b = α −1 ab
and c = α −1 ac = α −1 ab = b.
If a is an arbitrary ideal, then ab = ac immediately implies that (aa )b = (aa )c.
Since aa = (Na) is principal, the claim now follows from the first part of the
proof.
Thus the ideals in Ok form a monoid with cancellation. Such monoids can be
completed to a group in a formal way by imitating the construction of Q from Z,
namely by considering expressions of the form a/b, which are multiplied via a/b ·
c/d = ac/bd. It is possible to interpret an element ab−1 of this group as a set by
setting ab−1 = 1b ab , where b is the norm of b, and defining m1 a = { m
α
: α ∈ a}.
Such sets are called fractional ideals.
146 6 Ideals in Quadratic Number Fields
Now that we have defined products of ideals we can study divisibility questions. Of
course we say that an ideal b is divisible by an ideal a if there exists an ideal c such
that b = ac. Since c ⊆ Ok , it follows from a | b that b = ac ⊆ a(1) = a: To divide
is to contain. The converse also holds.
If p is a prime ideal in Ok , then there is a unique prime number p > 0 with p | (p).
In fact, p | pp = (Np); factoring Np in the integers and observing that p is prime,
we deduce that there is a prime number p such that p | p. The fact that p cannot
divide two distinct prime numbers should be clear: If p | (p) and p | (q), then
p, q ∈ p, hence 1 ∈ p, and this is a contradiction.
148 6 Ideals in Quadratic Number Fields
If p is the prime number that p divides, then we say that p lies above p. Since
the ideal (p) in Ok has norm p2 , it follows that each prime ideal above p has norm
p or p2 .
The determination of all prime ideals in Ok is not difficult (the case p = 2 is
taken care of in Exercise 6.21).
Theorem√6.14 Let p be an odd prime number, m a squarefree integer, and
k = Q( m ) a quadratic number field with discriminant Δ. Then we have the
following:
√
• If p | Δ, then (p) = (p, m )2 ; we say that p is ramified.
• If (Δ/p) = +1, then (p) = pp for prime ideals p = p ; we say that p splits.
• If (Δ/p) = −1, then the ideal (p) is prime; we say that p is inert.
√
Proof √Assume first that p√| Δ; since p is odd, we have p | m. √ Now (p, m )2 =
(p2 , p m, m) = (p)(p, m, m p ) = (p) since the ideal (p, m, p ) contains the
m
m
coprime integers p and p and thus is equal to the unit ideal (1).
Now assume that (Δ/p) = 1; then Δ is a quadratic residue modulo p, and since
Δ = m or Δ√= 4m, so is m. Thus there is an x ∈ Z with x 2 ≡ m mod p. We set
p = (p, x + m ) and find
√ √
pp = (p2 , p(x + m ), p(x − m ), x 2 − m)
√ √
= (p)(p, x + m, x − m, (x 2 − m)/p).
√ √ √ √
Clearly, 2 m = x + m − (x − m ), and thus 4m = (2 m)2 is contained in
the last ideal. Since p and 4m are coprime, this ideal is the unit ideal, and we have
pp = (p). If we had p = p , then it would follow as above that 4m ∈ p and p = (1):
a contradiction.
Finally, assume that (Δ/p) = −1. If there were an ideal p with norm p, then
by Proposition
√ 6.7 it would have the form p = (p, b + ω) with p | N(b + ω). If
ω = m, this means b2 − m ≡ 0 mod p, hence √ (Δ/p) = (4m/p) = (m/p) = +1
contradicting our assumption. If ω = 12 (1 + m ), then (2b + 1)2 ≡ m mod p, and
we get a contradiction as above.
We can combine the two cases p = 2 and p = 2 by using the Kronecker symbol
(Δ/p). Recall that this symbol coincides with the Legendre symbol for odd values
of p; for p = 2 and Δ ≡ 1 mod 4, it is defined by (Δ/2) = (−1)(Δ−1)/4, and for
Δ ≡ 1 mod 4, we set (Δ/2) = 0. Using the Kronecker symbol, a prime number p
p ) = +1, 0, or −1, respectively.
splits, ramifies, or is inert according as ( Δ
√
= ±1, whereas the ideal (2, 1 + −5 ) generated by them describes the “correct”
greatest common divisor.
There are also pairs of elements with common divisors but without a greatest
one. For example,
√ √ √ √ √
6 = 2 · 3 = (1 + −5 )(1 + −5 ) and − 4 + 2 −5 = 2(−2 + −5 ) = (1 + −5 )2
√
have common divisors 2 and 1 + −5, but there √ is no greatest common divisor.
On the other hand, there √ Z[ −5 ] that do have a greatest common
are elements in √
divisor, for example, 2 + 2 −5 and 3 + 3 −5. Here we have
√ √ √ √
gcd(2 + 2 −5, 3 + 3 −5 ) = (1 + −5 ) gcd(2, 3) ∼ 1 + −5.
How can we decide whether such a greatest common divisor of two elements α and
β exists? To answer this question, we consider the ideal (α, β) generated by them
and check whether it is principal. If (α, β) = (δ), then δ is an “honest” greatest
common divisor of α and β. One goal of this (and the next) chapter is providing a
method for testing whether an ideal is principal or not.
We have seen that the set of integral ideals = (0) in Ok forms a monoid with
the cancellation law. Such monoids can be made into a group Ik in a rather
formal way resembling the construction of the field Q of rational numbers from
the multiplicative monoid Z. Such quotients of ideals are called fractional ideals.
Formally, two such ideals a/b and c/d are multiplied in the same way as fractions
of numbers, and of course we may cancel common factors. Principal ideals of the
form (α) = αOk , where α ∈ k × is not necessarily an algebraic integer, are called
principal fractional ideals, and they form the subgroup Pk in Ik . The quotient group
Cl(k) = Ik /Pk is called the ideal class group of k.
Those who do not like such a formal approach may describe fractional ideals as
sets. In fact, write a fractional ideal ab−1 as ab (bb )−1 = 1b ab, where b = Nb
denotes the norm of b. Then we define α1 c := { γα : γ ∈ c}. On the set of fractional
ideals = (0), we define products as for integral ideals; then we show that they form
a group.
We will use a third approach that does not use any fractional ideals. In fact, the
definition of the ideal class group above implies that two ideals a and b belong to
the same class modulo the group Pk of principal ideals if a = ξ b for some ξ ∈ k × .
If we write ξ = β/α with α, β ∈ Ok , then this is equivalent with αa = βb.
Such equations define an equivalence relation on the set of nonzero integral
ideals: We will call ideals a and b equivalent and write a ∼ b if there exist elements
150 6 Ideals in Quadratic Number Fields
We will now show that each ideal class in the ring of integers of a quadratic number
field k contains an ideal whose norm is bounded by a constant depending only on k.
This immediately implies that the class number is finite. Let us call an ideal primitive
if it is not divisible by an ideal of the form (m) = (1) with m ∈ Z. Clearly, each
ideal class is represented by a primitive ideal since dividing an ideal by the principal
ideal (m) does not change its class.
By Proposition 6.7, each ideal a possesses a Z-basis of the form {n, m(b + ω)}
with m | n; in particular, a is primitive if and only if m = 1. In other words:
6.3 Ideal Class Groups 151
Proposition 6.16 If the ideal a is primitive, then there exist n ∈ N and b ∈ Z with
a = nZ ⊕ (b + ω)Z, and we have Na = n. In particular, we have a ∩ Z = (Na) in
this case.
Now we claim the following:
√
Theorem 6.17 Let m ∈ Z be squarefree and k = Q( m ) a quadratic number field
with ring of integers Ok = Z[ω] and discriminant Δ. Define the Gauss bound μk
by
√
Δ/5, if Δ > 0,
μk = √
−Δ/3, if Δ < 0.
Then each ideal class of k contains an integral ideal = (0) with norm ≤ μk ; in
particular, the number h of ideal classes is finite.
Before we prove this result, we will present a few applications. Clearly, the
bounds are best possible since for Δ = 5 and Δ = −3, they cannot be improved.
If μk < 2, then each ideal class contains an integral ideal = (0) with norm < 2,
hence with norm 1. The only such ideal is (1), and this implies that there is a single
ideal class, namely the class of principal ideals. Thus in this case, Ok is a unique
factorization domain. Theorem 6.17 tells us that this holds for all fields k with
discriminant −12 ≤ Δ ≤ 20, i.e., for m ∈ {−11, √ −7, −3, −2, −1, 2, 3, 5, 13, 17}.
Next√consider the ring of integers R = Z[ −5 ] in the quadratic number field
k = Q( −5 ) with Δ √ = −20; according to Theorem 6.17, each ideal class contains
an ideal with norm < 20/3 and so with norm ≤ 2. Since there are only √two such
ideals, namely the principal ideal (1) and the nonprincipal ideal (2, 1 + −5 ), the
field k has class number 2.
Proof of Theorem 6.17 Let c = [a] be an ideal class represented by an ideal a.
√assume that a is primitive. Thus a = (a, α) with
Without loss of generality, we may
a = Na and α = b + ω = s + 12 Δ for some s ∈ Q with 2s ∈ Z. If a ≤ μk , then
we are done; otherwise, we apply the Euclidean algorithm to the pair (s, a) and find
an integer q ∈ Z with s − qa = r and
a
|r| ≤ if Δ < 0,
2
a
≤ |r| ≤ a if Δ > 0.
2
√
Setting α1 = r + 1
2 Δ, we will show below that
(1) α1 ∈ a,
(2) |Nα1 | ≤ a −Δ
2
4 ≤ a 2, and
(3) a1 := a α1 a ∼ a is an integral ideal with [a1 ] = [a] and Na1 < Na.
1
152 6 Ideals in Quadratic Number Fields
We repeat this step until we have found an ideal with norm ≤ μk ; since the norm
decreases at each step by at least 1, this process must terminate after finitely many
steps.
Now clearly α1 = α − qa ∈ a, which proves (1). The proof of the inequality (2)
is easy: If Δ < 0, we have |Nα1 | = |r 2 − Δ4 | ≤ a +|Δ| < 1 since a 2 > μ2k = |Δ|
2
4 3 ,
and if Δ > 0, we clearly have −a 2 = a −5a
2 2
4 < r 2 − Δ4 < a 2.
It remains to show that the ideal a1 is integral; but since
1
α a ⊆ Ok ⇐⇒ α a ⊆ (a) = aa ⇐⇒ (α ) ⊆ a ,
a 1
this is clear.
The following observation, which generalizes our results on representations of
prime numbers in the form x 2 + y 2 or x 2 + 3y 2 , is an important consequence of
Theorem 6.17.
√
Corollary 6.18 Let m be a squarefree integer and k = Q( m ) a quadratic number
field with class number h, and assume that pOk = pp in Ok splits. Then there exist
integers x, y ∈ N with ±4ph = x 2 − my 2 .
√
Proof The h-th power of each ideal in Ok is principal. In particular, ph = ( x+y2 m
),
| x −my
2 2
and taking norms, we obtain ph = 4 |.
We now show how to compute class groups in a given quadratic number field.
√ √
k = Q( −21 ), Δ = −84 The Gauss bound is μk = 84/3, so we have to
√ with norm ≤ 5. Since 22 | Δ, the prime √ 2 is ramified: (2) = a for
consider ideals 2
√ √
fact, we have −21 ≡ 2 mod c , and hence 3 + −21 ≡ 3 + 2 = 5 ≡ 0 mod c .
Thus abc ∼ 1, and now c ∼ c−1 implies ab ∼ c.
Finally, c ∼ c−1 ∼ a−1 b−1 ∼ ab since a2 ∼ b2 ∼ 1. Thus there exist exactly 4
ideal classes: the class of principal ideals and three classes [a], [b], and [a][b] with
order 2 in the class group. Thus the class group is isomorphic to Z/2Z × Z/2Z,
which is called Klein’s four group.
√
k = Q( −17 ), Δ = −68 Here √ we have to consider all ideals with norm √ ≤ 4. We
have (2) = a2 with a = (2, 1 + −17 ) and (3) = bb with b = (3, 1 + −17 ).
The ideals with norm ≤ 4 thus are (1) a, b, b , and (2) = a2 .
Now b2 cannot be principal: The only elements with norm 9 are ±3, but
b =√(3) = bb by unique factorization into prime ideals. On the other hand,
2
Since [a] = [(1)], it follows from the first relation that [b] = [c ] = [c]−1 . The third
relation shows that [b] = [d ] = [d]−1 . Thus the ideal class group is generated by
the ideal b whose third power is principal. It remains to check whether b is principal.
As we will see in the next chapter in connection with the bounds (7.7), this is not
the case. Thus Cl(k) Z/3Z.
Here is a small table with nontrivial class numbers for practicing class number
calculations:
√
pOk = pp ). Then either p =√(a + b −5 ) is principal and thus p = a 2 + 5b2, or
p ∼ a, and then ap = (C +d −5 ) is principal. In this case we find 2p = C 2 +5d 2 ;
but since C and d are odd, we can write C = 2c + d for some c ∈ Z, and then we
find 2p = (2c + d)2 + 5d 2 = 4c2 + 4cd + 6d 2 , and hence p = 2c2 + 2cd + 3d 2 .
In other words, if (−5/p) = +1, then p can be written in the form p = a 2 + 5b 2
or p = 2c2 + 2cd + 3d 2 .
Polynomials Ax 2 + Bxy + Cy 2 ∈ Z[x, y] are called binary quadratic forms.
Their discriminant is defined by Δ = B 2 − 4AC. In particular, the binary quadratic
forms x 2 + 5y 2 and 2x 2 + 2xy + 3y 2 both have discriminant Δ = −20. This is not
a coincidence: Gauss defined an equivalence relation on the set of binary quadratic
forms with the same discriminant, and Dirichlet and Dedekind have shown that
these equivalence classes correspond, in the case of fundamental discriminants, to
ideal classes in quadratic number fields (with a technical complication in case of
positive discriminants). For Δ = −20, there exist two different classes, namely
those represented by x 2 + 5y 2 and 2x 2 + 2xy + 3y 2 .
By the Modularity Theorem, we have
−5
= +1 ⇐⇒ p ≡ 1, 3, 7, 9 mod 20.
p
If we investigate which prime numbers are represented by which of the forms above,
then we find
x 2 + 5y 2 if p ≡ 1, 9 mod 20,
p=
2x + 2xy + 3y
2 2 if p ≡ 3, 7 mod 20.
( −1
p ) = ( p ) = −1.
5
Let us now investigate what we can say about the solutions of the Bachet–Mordell
equation y 2 = x 3 − d for integers d > 0, where we will make suitable assumptions
on d in the course of our calculations.
We begin by factoring the right side and write
√ √
x 3 = y 2 + d = (y + −d )(y − −d ).
√
We would like the ideals a = (y + −d ) and a to be coprime. Clearly, √ each
common
√ prime ideal factor p (with p | p) also divides the difference 2 −d; since
p | −d (and p = 2) immediately yields p | d, p | y, p | x and finally p2 | d, we
may exclude this case by assuming that d is squarefree . Thus only the possibility
p | 2 remains; we now distinguish the following cases:
√ √
• d ≡ 2 mod 4: Then p | ( −d ) (since p = (2, −d )), hence p | y, p | y, and
finally x 3 = y 2 + d ≡ 2 mod 4: this is a contradiction since a cube cannot be
divisible exactly by 2. √ √
• d ≡ 1 mod 4: Then p = (2, 1 + −d ), and hence p | (y + −d ) if and only
if y is odd. This implies x 3 = y 2 + d ≡ 1 + 1 ≡ 2 mod 4, and this is again a
contradiction. √
• d ≡ 3 mod 4: Here y + −d is divisible by p (even by 2) if and only if y is odd. It
follows from d = x 3 −y 2 that x must be even, and hence d ≡ −y 2 ≡ −1 mod 8.
Thus if we assume that d ≡ 7 mod 8 , then p | 2 cannot be a common divisor of
a and a also in this case.
Thus a and a are in fact coprime. Since their product is a cube, there is an ideal
b such that a = b3 , which implies that we also have √ a 3 = b 3 . Now we need the
next assumption: If h denotes the class number of Q( −d ), then we demand that
3 h . Then b3 and bh are principal ideals, hence so is b3a+hb for all a, b ∈ Z, and
since 3 and
√
h are coprime we find, using Bézout, that b itself must be principal, say
b = ( r+s 2 −d ) for integers r, s with r ≡ s mod 2.
If we assume that d > 0, d = 1, 3 , then ±1 are the only units, and from the
equation of ideals above, we obtain the equation of elements
√
√ r + s −d 3
y+ −d = ,
2
where we have subsumed the sign into the cube. Comparing coefficients now yields
1 = 18 (3r 2 s − ds 3 ), and therefore 8 = 3r 2 s − ds 3 = s(3r 2 − ds 2 ). Clearly, we
must have s | 8, and hence s = ±1 or r ≡ s ≡ 0 mod 2. In the first case we find
±8 = 3r 2 − d, hence d = 3r 2 ∓ 8, and in the second case, we set r = 2t, s = 2u
and find 1 = u(3t 2 − du2 ), that is, u = ±1 and d = 3t 2 ∓ 1.
156 6 Ideals in Quadratic Number Fields
Thus we have shown: If d satisfies our assumptions and does not have the form
d = 3t 2 ± 1 or d = 3t 2 ± 8, then the Diophantine equation y 2 = x 3 − d does not
have an integral solution.
What happens if d has one of these forms? Assume for example that d = 3r 2 −8;
then comparing coefficients immediately yields (observe that s = 1)
y 2 + d = r 6 − 6r 4 + 12r 2 − 8 = (r 2 − 2)3 ,
d (x, y)
3t 2 −1 (4t 2 − 1, ±t (8t 2 − 3))
3t 2 + 1 (4t 2 + 1, ±t (8t 2 + 3))
3t 2 − 8 (t 2 − 2, ±t (3 − t 2 ))
3t 2 + 8 (t 2 + 2, ±t (3 + t 2 ))
2 Ina similar way it can be shown that the Fermat equation x p + y p = zp for prime exponents p
has only trivial solutions with xyz = 0 if p does not divide the class number of the field Q(ζp ) of
p-th roots of unity—this is essentially Kummer’s approach to Fermat’s Last Theorem.
6.4 The Diophantine Equation y 2 = x 3 − d 157
Observe that Theorem 6.20 contains several results on this equation that we have
obtained before: Since 2 = 3 · 12 − 1, the equation y 2 = x 3 − 2 has the only integral
solution (3, ±5).
If we look carefully at the case d = 26 = 3 · 32 − 1, we see that y 2 = x 3 − 26
has the solution (35, ±207) given by our theorem as well as the additional solutions
(3, ±1).√ This is not a contradiction: The theorem now implies that the class number
of Q( −26 ) must be divisible by 3. In fact, the class number is equal to 6. This
example can be generalized (see Exercise 6.27).
It is natural to ask whether the solutions found for d = 26 are the only ones. We
cannot answer this question here, but we would like to show how to begin such an
investigation. √ √
As above we find (y + −26 )(y − −26 ) = x√3 , and since the two factors on
the left hand side are coprime, we must have (y + −26 ) = a3 for some ideal a.
If a = (α) is principal, then the only solution is, as we have already seen, (x, y) =
(35, ±207). If a is not principal, then it lies in
√ some ideal class of order 3. One such
class is generated by√the ideal p = (3, 1 + −26 ) the other one by its conjugate;
in fact, p3 = (1 + −26 ), and clearly p is not principal. Thus√either pa = (α)
or p a√ = (α) is principal. In the first case, multiplying (y + −26 ) = a3 by
(1 + −26 ), we obtain the equation
√ √
(1 + −26 )(y + −26 ) = (pa)3 = (α)3 ,
and hence
√ √
y − 26 − (y + 1) −26 = (a − b −26 )3 .
Comparing the coefficients of the real and imaginary parts, we obtain the equations
y − 26 = a(a 2 − 78b2),
y + 1 = b(3a 2 − 26b 2).
27 = −a 3 + 3a 2b + 78ab2 − 26b 3.
A3 − 9Ab2 + 2b 3 = 1. (6.1)
158 6 Ideals in Quadratic Number Fields
At this point we invoke algebraic number theory. We consider the cubic number field
Q(ω), where ω is a root of the polynomial f (x) = x 3 − 9x + 2 = 0, and the domain
Z[ω]. The norm of an element α = A − bω can be determined, as in the quadratic
case, by computing the determinant of the linear map given by multiplication by α.
We find
αω = Aω − bω2 ,
αω2 = Aω2 − bω3 = Aω2 − b(9ω − 2)
Thus (6.1) boils down to the question whether there is a unit of the form A − bω in
Z[ω]. Since f has three real roots, Dirichlet’s unit theorem tells us that there exist
two independent units. Using pari, we find the units ε1 = 3ω2 + 9ω − 1 and
ε2 = 2ω2 + 4ω − 1. Thus the question (which is anything but easy to answer) is
whether there exist exponents m and n with ε1m ε2n = A − bω.
As we have seen, it is a highly nontrivial problem to determine which integers
are represented by a binary cubic form such as (6.1). Thue [122] showed in 1909
that an equation F (U, V ) = m, where F (U, V ) = AU 3 + BU 2 V + CU V 2 + DV 3
is an irreducible cubic form, has only finitely many solutions in integers.
Let me say a few words about a connection between class numbers and elliptic
curves of the form y 2 = x 3 − m. If we write this equation in the form y 2 + m = x 3 ,
we see √ that for each integral point (x, y) on this elliptic curve, the principal ideal
(y + −m√) is, except for factors coming from common divisors with its conjugate
ideal (y − −m ), is a cube of an ideal. Ideals whose third √ powers are principal are
sources for ideal classes of order 3 in the class group of Q( −m ). √
In general, the equation y 2 = x 3 − mz2 will lead to ideals (y + z −m ) that
often are cubes of ideals. For more on the connection between this equation and the
3-class group3 of quadratic number fields, see [51] and the literature cited there.
If we factor the equation y 2 = x 3 − m on the √ right side, then we have to
study
√ the 2-class
√ group of pure cubic number fields Q( 3
m ) or their normal closure
Q( −3 , m ). This will require familiarity with the basic concepts of algebraic
3
3 This is the 3-Sylow subgroup of the ideal class group, which consists of all ideal classes whose
order is a power of 3.
6.5 Exercises 159
6.4.1 Summary
6.5 Exercises
6.1. Show that the elements a + bi with a + b ≡ 0 mod 2 are exactly the multiples
of 1 + i.
6.2. Let R be a principal ideal domain and (d) = (a, b) for a, b, d ∈ R. Show that
d is a greatest common
√ divisor of a √
and b. √
6.3. Let a = (1√+ −5, 2), b = √ (1 + −5, 3), and c = √ (1 − −5, 3). Verify
ab = (1 + −5 ), ac = (1 − −5 ), and c2 = (2 − −5 ).
6.4. Show that the integer a in the basis of Proposition 6.5 is only determined
modulo n.
6.5. Show that the ideal I = (1 + 2i) properly contains the Z-module M =
5Z + (1 + 2i)Z by showing that −2 + i ∈ M. Determine the Z-basis of the
ideal I . √
6.6. Let R = Ok for k = Q( m √ ) and a squarefree integer m, and M = Z.
Show that the residue classes b m (b ∈ Z) in R/M are all distinct and that
N(M) = ∞. √
6.7. Show that (7, 1 + −5 ) = (1). Show more generally that (a, α) = (1) for
a ∈ Z and α ∈ Ok if gcd(a, Nα) = 1. √
6.8. Determine the prime ideal factorization of√ (4 + −5 ). √
6.9. Compute a greatest common
√ divisor of 8+ −14 and 4− −14 [107, S. 313].
6.10. The ideal (21, 10 + −5 ) has√norm 21 and is divisible
√ by prime ideals above
3 and 7 and hence by (3, 1 ± −5 ) and (7, 3 ± −5 ). Determine the exact
prime ideal factorization [107, S. 350].
6.11. Let m = a 2 √ + b2 be a sum√of two squares, and assume that a is odd. Show
that (a, b + m )2 = (b + m ).
√ 2
6.12. Explain 2 · 3 = − −6 by factoring the elements into prime √
ideals.
√ 1+ −23
6.13. Let k = Q( −23 ); show that (2) = aa for a = (2, ) and a3 =
√ 2
−23
( 3−2 ). Why is the ideal a2 not principal?
160 6 Ideals in Quadratic Number Fields
6.22. If Δ ≡ 5 mod 8, then (2) remains prime, and there are no ideals of norm 2
in Ok . Show that this implies that the fields with Δ = −19, 21, 29, 37 have
class number 1. Which fields do we obtain by demanding in addition that
Δ ≡ 2 mod 3? √
6.23. Show that the class number of Q( −m ) is even √ for each m ≡ 1 mod 4
with m √> 1. To this end, show that (2, 1 + −m )2 is principal but that
(2, 1 + −m ) is not.
6.24. Let k be a complex quadratic number field with discriminant Δ < 0. For
some small values of Δ, compute the sum
|Δ|/2
w Δ
h= r,
2|Δ| r
r=1
√ √
6.29. Solve the preceding exercise for the fields Q( −6 ) and Q( −10 ).
6.30. Verify the following assertions for small prime numbers p:
Let p be an odd prime number with ( −23 p ) = +1. Then the two prime ideals
√
above p in Q( −23 ) are principal if and only if the polynomial f (x) =
x 3 − x + 1 splits into three linear factors modulo p.
Observe that f has discriminant −23. This result is a consequence of class
field theory.
6.31. Show that (2, x) ⊇ (2) in R = Z[x], but that there does not exist an ideal I
in R with (2, x)I = (2).
6.32. Consider the set S of all sequences of rational numbers. This set is a ring with
respect to addition and multiplication defined as follows:
class.
6.38. Show that the Diophantine equation y 2 = x 3 − 31 does not have an integral
solution (Hall [47]).
Chapter 7
The Pell Equation
Complex quadratic number√rings have finitely many units; in the real quadratic case,
the rings of integers of Q( m) seem to contain a unit ε of infinite order:
m 2 3 5 6 7
√ √ √ √ √
ε 1+ 2 2+ 3 2 (1 +
1
5) 5+2 6 8+3 7
√
The existence of nontrivial units ε = x+y2 m in real quadratic number fields
√
Q( m) is equivalent to the solvability of the Pell equation x 2 − my 2 = ±4 in
nonzero integers for all squarefree values of m > 0. In this chapter we will prove
that the equation x 2 − my 2 = 1 has a nontrivial1 solution in integers whenever
m > 0 is not a square, and we will provide methods for computing units in real
quadratic number fields.
Before we prove the solvability of the Pell equation, we make a few remarks on
the connection between the equations x 2 −my 2 = ±4√ and the equation x −my = 1
2 2
and on how to compute the fundamental unit of Q( m) from the minimal nontrivial
solution of x 2 − my 2 = 1 and vice versa.
Consider for example the case m = 13. Here the fundamental unit of√the
√
ring of integers Ok of the quadratic number field k = Q( 13) is ε = 3+2 13 ,
√ solution (3, 1) of the Pell√equation x − 13y = −4.
which corresponds to the 2 2
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 165
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6_7
166 7 The Pell Equation
√
Proposition 7.1 If k = Q( m) for some squarefree √ integer m ≡ 1 mod 4, and if ε
is a unit in Ok , then ε3 is a unit in the order Z[ m ].
In other words, if t 2 − mu2 = ±4, then T 2 − mU 2 = ±1, where
√ t + u√m 3
T +U m= .
2
Proof If t 2 −my 2 = ±4 for odd integers t and u, then reducing this equation modulo
√
8 shows that m ≡ 5 mod 8. In this case, the prime ideal (2) is inert in Q( m);
hence, the group of coprime residue classes modulo 2 has order N(2) − 1 = 3, and
this in turn implies that ε3 ≡ 1 mod 2.
Clearly, if ε is a unit with norm −1, then ε2 is a unit with
√ norm +1. Thus if we
want to compute a fundamental unit from the unit x + y m corresponding to the
smallest positive solution of the Pell equation x 2 − my 2 = 1, √
then we have to check
whether ε is a square, a cube, or a sixth power of a unit in Q( m).
We will explain how to do this in the case at hand. Assume we have the minimal
√ solution (649, 180) of the Pell equation x − 13y = 1; then η = 649 +
positive 2 2
180 13 is√a unit with norm 1 (and the smallest positive unit with norm 1 in the
domain Z[ 13 ]). √
√
For checking whether η ∈ Q( 13), we use the real approximations
√
η = 649 + 180√13 ≈ 1297.9992295 . . .,
η = 649 − 180 13 ≈ 0.0007704 . . . .
√
Clearly, the trace η + η = 2 · 649 is an integer. If η is a square, then the trace of η
must also be an integer. We find
√
η ≈ 36.0277563773 . . .,
√
η ≈ 0.0277563773 . . ..
√ √ √
√ η+ η
This shows2 that 12 ( η − η ) ≈ 18 and √ ≈ 5 are very close to integers,
√ √
2 13
which in turn suggests that η = 18 + 10 13. Now we can readily verify that
√ √ √ √
(18 + 5 13)2 = η. Observe that 18 − 5 13 < 0, which is why 18 − 5 3 ≈ − η .
In a similar way we can check that η is a cube and in fact a sixth power:
√6 η ≈ 3.30277563773 . . .,
√
6
η ≈ 0.30277563773 . . .,
The history of the Pell equation in Europe3 begins with Fermat’s challenge in 1657.
In that year, Fermat posed the following problem (among others) and asked his
contemporaries, in particular the English mathematicians John Wallis and William
Brouncker, for a solution:
Given an arbitrary natural number, which is assumed to be not a square, there are infinitely
many square numbers with the property that after adding 1 to the product of one of these
square numbers with the given number, another square is produced [. . . ]. We ask e.g. for a
square that produces another square after adding 1 to the product with 149 or 109 or 433
etc.
3 Strictly speaking, the investigation of Platon’s side and diagonal numbers by Theon may be seen
as the only serious investigation of a Pell equation in ancient Greece. Equations of Pell type also
figure prominently in the Cattle Problem of Archimedes; it is not known, however, whether there
were any attempts at solving this problem before it was discovered by Lessing in 1773.
4 We have derived the rational parametrization of Pell conics in Theorem 3.1.
5 An excellent account of Indian mathematics was given by Kim Plofker [104]. For an investigation
1
|Aξ1 + Bξ2 | ≤ (|ξ1 | + |ξ2 |), |A| ≤ N, |B| ≤ N. (7.1)
N
Proof We assume that ξ1 and ξ2 are both positive (otherwise we just have to change
the signs of a and b in the proof below). The irrationality of ξ1 /ξ2 implies that the
function
6 It seems that this principle was given a name rather late (in the twentieth century?); a pigeonhole
is a drawer, so the last thing you would like to put there are pigeons.
7.1 The Solvability of the Pell Equation 169
into N 2 subintervals of equal length N1 (|ξ1 | + |ξ2 |), then since (N + 1)2 > N 2
there must exist, according to Dirichlet’s pigeonhole principle, at least two pairs
(a, b) = (a , b ) with |f (a, b) − f (a , b )| ≤ N1 (|ξ1 | + |ξ2 |). Now we set A = a − a
and B = b − b ; these integers have the desired properties.
Corollary 7.4 Assume that m ∈ N is not a square. Then there exists an integer c
such that the equation A2 − mB 2 = c has infinitely many solutions (A, B) ∈ Z × Z.
Proof By the preceding lemma, there exist numbers A, B ∈ Z, not both 0, that
satisfy the inequalities
√ 1 √
|A − B m | ≤ (1 + m ), |A| ≤ N, |B| ≤ N. (7.3)
N
The triangle inequality shows that
√ √ √
|A + B m | ≤ |A| + |B m | ≤ (1 + m ) · N, (7.4)
Now let N → ∞; then infinitely many distinct√pairs (A, B) must occur, since if
we had only finitely many, then the set {|A − B m | : A, B ∈ Z} would possess a
minimum, which is impossible because of (7.3).
Since |A2 − mB√ | is
2 bounded from above by (7.3), there must exist an integer c
with |c| ≤ (1 + m )2 for which A2 − mB 2 = c has infinitely many solutions in
integers.
Now we can prove Theorem 7.2. According to Corollary 7.4, there exists an
integer c = 0 such that there are infinitely many pairs (A, B) with A2 − mB 2 = c;
here we may clearly assume that A > 0. Among these infinitely many solutions, we
choose (c + 1)2 solutions and consider the residue classes of A and B modulo c;
by Dirichlet’s pigeonhole principle, there must exist pairs (A1 , B1 )√= (A2 , B2 ) with
A1 ≡ A2 mod c and B1 ≡ B2 mod c. The elements ηj = Aj +Bj m then have the
same norms Nη1 = Nη2 = c and satisfy the congruence η1 ≡ η2 mod c. It follows
from N(η1 /η2 ) = 1 that η1 /η2 is a unit if we can show that this is an algebraic
integer. To this end, observe that η1 /η2 = 1 + (η1 − η2 )/η2 = 1 + (η1 − η2 )η2 /c.
Since the difference η1 − η2 is divisible by c by construction, η1 /η2 is indeed an
algebraic integer and thus a unit.
It remains to show that η1 /η2 = ±1 is a nontrivial unit. But η1 /η2 = 1 follows
from η1 = η2 , and η1 /η2 = −1 follows from the fact that A1 and A2 are both
positive. This concludes the proof of Theorem 7.2.
We now know that there exist nontrivial units in each real quadratic number field.
In fact, it is possible to determine the abstract structure of the unit group: For real
quadratic number fields k, we have Ok× (Z/2Z) × Z. As we will show in a
170 7 The Pell Equation
moment, each unit η ∈ Ok× can be written in the form η = (−1)s εt for some
“fundamental unit” ε, and then the map λ : Ek −→ (Z/2Z) × Z defined by λ(η) =
(s, t) provides us with an isomorphism of abelian groups. This is the content of our
next theorem.
Theorem 7.5 If k is a real quadratic number field, then there is a unit ε ∈ Ok× with
the property that every unit η ∈ Ok× can be written uniquely in the form η = ±εt
for some t ∈ Z. In particular,
Ok× Z/2Z ⊕ Z.
We immediately see that if ε has the property in Theorem 7.5, then so do the
units ±ε±1 . Among these four units, there are two that are positive, and among
these exactly one is > 1. This unit ε > 1 will be called the fundamental unit of k.
√
Proof We identify the numbers a + b m with those real numbers that correspond
×
√ η ∈ Ok with |η| = 1 then are
to the positive square root of m. The only units
η = ±1, which follows from irrationality of m.
We claim that among the units with |η| > 1, there is one with minimal absolute
value. Otherwise there would exist a unit (in fact, infinitely many) with 1 < |η| < 54
(just pick two units that are sufficiently close to the infimum of the absolute values
and consider their quotient). Since |ηη | = 1, this implies 45 < |η | < 1. If we write
√
η = a + b m (where 2a, 2b ∈ Z), then 2|a| = |η + η | ≤ |η| + |η | < 94 , and
hence |a| ≤ 1. Since a = 0 is not possible, we must have a = ±1. Then it follows
immediately from 1 < |η| < 54 that b = 0, and hence η = 1 in contradiction to our
assumption.
Let ε be a unit with minimal absolute value > 1. We claim that ε has the
properties listed in Theorem 7.5. Otherwise there would exist a unit η with εn <
|η| < εn+1 for some n ∈ N (the proof is similar to that of Theorem 2.6). But
then ηε−n is a unit whose absolute value lies strictly between 1 and |ε|, and this
contradicts the choice of ε.
Uniqueness is clear: ±εt = ±εu implies |εt −u | = 1, which in turn implies t = u
since ε is irrational. But then the signs must also coincide.
Remark The proof of the solvability of the Pell equation t 2 − mu2 = 1 given here
does not provide us with a method of computing the fundamental√ unit, except for
very small
√ values of m. For example, ε = 48842 +5967 67 is the fundamental unit
of Q( 67), and this solution is hard to find by solving the Pell equation by brute
force (i.e., looking for an integer m = 1, 2, 3, . . . such that mu2 +1 = t 2 is a square).
A much better way of computing the fundamental unit of quadratic number fields
with modest discriminants is based on the theory of continued fractions. For number
fields of higher degree, the computation of the unit group becomes time consuming
with growing degree and discriminant even when using the best algorithms that are
known today.7
7 Good sources for the state of the art are [20, 91], and, in particular, [66].
7.1 The Solvability of the Pell Equation 171
The equation t 2 − mu2 = −1 is called the negative Pell equation or sometimes the
anti-Pell equation. In this section we will show how to derive solvability conditions
for the negative Pell equation from the solvability of the usual Pell equation.
We begin by considering the equation t 2 − pu2 = 1 for prime values of p. We
can write this equation in the form
The greatest common divisor of t + 1 and t − 1 divides their difference 2, and hence
one of the following four possibilities must occur:
t + 1 = a 2, t − 1 = pb2 ,
t + 1 = pb2 , t − 1 = a2,
t + 1 = 2a 2, t − 1 = 2pb2 ,
t + 1 = 2pb2 , t − 1 = 2a 2.
We choose the integers a and b positive. Subtracting the right equation from the left,
we find that at least one of the following four equations must be solvable in integers:
a 2 − pb 2 = 2; a 2 − pb 2 = −2; a 2 − pb 2 = 1; a 2 − pb 2 = −1.
If we assume that (t, u) is the smallest positive solution of the Pell equation, then
we can exclude the equation a 2 − pb2 = 1 since t + 1 = 2a 2 implies that a < t.
A necessary condition for the equation a 2 − pb2 = 2 to be solvable is that
p ≡ ±1 mod 8. Similar considerations yield the following table:
Proposition 7.6 The solvability of t 2 − pu2 = 1 for odd prime numbers p implies
the solvability of
a 2 − pb 2 = −1 for p ≡ 1 mod 4,
a 2 − pb 2 = −2 for p ≡ 3 mod 8,
a 2 − pb 2 = +2 for p ≡ 7 mod 8.
The only method we know so far for showing the unsolvability of the norm equation
x 2 − my 2 = c for given values of m ∈ N and c ∈ Z is reducing the equation modulo
n for some choice of n, where n in general is a divisor of m or c, and showing
that the congruence does not have a solution. For example, x 2 − 10y 2 = ±2 is not
solvable in integers since the congruence x 2 ≡ ±2 mod 5 does not have solutions.
This method does not work in the case of the equation x 2 − 79y 2 = ±3, the reason
being that x 2 − 79y 2 = −3 has the rational solution x = 25 , y = 15 ; in particular,
this equation is solvable module each modulus coprime to 5. Similarly, the solutions
x = 137 and y = 7 show that the congruence is solvable for each modulus coprime
2
|Nα| = c. The basic idea is to choose the exponent √ n in β = αε in such a way that
n
the coefficients of β with respect to the basis {1, m } become as small as possible.
It is clear from geometric considerations that there exists an exponent n ∈ Z such
that
1 ≤ |εn α| < ε.
√
If we set β = εn α and write β = a + b m (again a and b are allowed to be
half-integers), then
|ββ | c
|β | = = ,
|β| |β|
This immediately yields bounds for a and b, and now the problem can be solved in
finitely many steps by simply checking the possible values of a and b one by one.
Before we do this in our example, we will improve the bounds on a and b.
To this end, we set β = εn α and choose the exponent n ∈ Z in such a way that
√
c √
√ ≤ |β| < cε .
ε
√ √ √
In our case, we have r = cε and s = c; thus |β + β | ≤ c ε + √1
ε
.
Since √1 < 1, this bound improves the previous one by a factor of about 2. We
ε
have proved the following:
Theorem 7.8 Let k be a quadratic number field with a unit ε >√1; then for each
α ∈ Ok with norm |Nα| = c, there exists an associate β = a + b m (with integers
or half-integers a, b) such that
1 √ √ 1 1 √ √ 1
|a| ≤ c ε+ √ and |b| ≤ √ c ε+ √ . (7.7)
2 ε 2 m ε
√
√ is an element α ∈ Z[ 79 ] with norm √
If there ±3, then (set m = 79, ε =
80 + 9 79, and c = 3) there is an element a + b 79 with norm ±3 such that
|b| < 1.25. Thus it is sufficient to consider b = 1, but the equation a 2√−79·12 = ±3
is not solvable in integers since 79 ± 3 is not a square. Thus Z[ 79 ] does not
contain an
√ element √ with norm ±3, and hence 3 is irreducible, but not prime since
3 | (2 − 79 )(2 + 79).
Remark Theorem 7.8 goes back to Pafnuty Chebyshev [18]; the corresponding
result in general number fields but with weaker bounds had been obtained before
by Dirichlet [30]. Chebyshev is best known for his contributions to the proof of
the prime number theorem. This theorem states that the number π(x) of all prime
numbers ≤ x is asymptotically equal to π(x) ∼ logx x in the sense that the quotient
of these functions has limit 1 as x → ∞; here log x denotes the natural logarithm.
Chebyshev proved that if the limit of x/π(x)
log x for x → ∞ exists, then it must be equal
to 1. The existence of this limit and thus the prime number theorem was established
independently in 1896 by Jacques Hadamard and Charles-Jean de la Vallée-Poussin.
Using Theorem 7.8, it is easy to prove a result going back to Harold Davenport:
√
For a proof, set ξ = x + y m; we will show that if | Nξ |√= n is not a square,
√ n ≥ 2t. Assume therefore that n < 2t; since ε = √
then t + m > 1 is a unit in
Z[ m ], we can find a power η of ε for which ξ η = a + b m has coefficients
√ a and
b that satisfy the bounds from Theorem 7.8. Because of 2t < ε < 2 m, we find
√
n √ 1 1
|b| ≤ √ ε+ √ <1+ .
2 m ε t
Since the claim is trivial for t = 1, we may assume √ that t ≥ 2, and then the last
inequality gives |b| ≤ 1. If b = 0, then (x + y m )η = a is associated with √ a
rational integer, and |Nξ | = a 2 is a square. If b = ±1, then α =√ξ η = a ± m.
Now |Nξ | = |Nα| = |a 2 − m| is minimal for values of a close to m, and we find
2t if a = t ± 1;
n = |a − m| =
2
1 if a = t.
Thus either n = 1 (which we have excluded) or n ≥ 2t. This proves our claims.
Proposition 7.9 was used by Ankeny, Chowla, and Hasse [2] for constructing
quadratic number fields with nontrivial class groups.
√
Proposition 7.10 The quadratic number field k = Q( m) with m = t 2 + 1 and
t = 2lq, where q is prime and l > 1, has class number > 1.
Since m ≡ 1 mod q, the prime q splits in k, and we have (q) = qq . If q is
principal, then the equation x 2 − my 2 = ±4q has integral solutions. But since
4q < 2t = 4lq is not a square, this contradicts Proposition 7.9.
Examples In the following examples, m = t 2 + 1 is prime. The ambiguous class
number formula (see Chap. 9) will explain why the class number h is odd in this
case.
q l t2 + 1 h q l t2 + 1 h
3 4 577 7 5 2 401 5
6 1297 11 4 1601 7
9 2917 3 9 8101 13
11 4357 5 11 12101 5
14 7057 21 12 14401 43
The following result8 shows that even a simple result such as Proposition 7.9
allows us to deduce astonishingly simple lower bounds for class numbers of fields
of Richaud–Degert type.
8 This theorem is due to Halter-Koch [48] and the proof presented here to Mollin [95].
176 7 The Pell Equation
Theorem 7.11 Let t be an odd integer with prime factorization t = p1e1 · · · pses and
set m =√t 2 + 1, and assume that m = t 2 + 1 is squarefree. Then the class number
h of Z[ m ] satisfies h ≥ S = 2τ (n) − 2, where
relation p0 qe = (t + 1 + m), together with the fact that no ideal of the form 2qj
with 0 ≤ j < e is principal, implies that the ideal class of q has order 2e; thus we
obtain the lower bound 2e | h, where 2e = 2τ (t) − 2.
The following table compares the lower bound S in Theorem 7.11 with the class
number h for a few small values of m:
t m S h t m S h t m S h
3 10 2 2 21 442 6 8 37 1370 2 4
5 26 2 2 23 530 2 4 39 1522 6 12
9 82 4 4 25 626 4 4 45 2026 10 14
11 122 2 2 27 730 6 12 47 2210 2 8
13 170 2 4 29 842 2 6 49 2402 4 8
15 226 6 8 31 962 2 4 51 2602 6 10
17 290 2 4 33 1090 6 12 53 2810 2 8
19 362 2 2 35 1226 6 10 55 3026 6 16
7.3 Computing the Solution of the Pell Equation 177
32 − 11 = −2,
42 − 11 = +5.
√
For y = 2, we choose x ≈ 2 11, and we find
62 − 11 · 22 = −8,
72 − 11 · 22 = +5.
√ √
Thus we already have found elements 4 ± 11 and 7 ± 2 11 with the same norm 5.
Which of these generate the same ideal? One possibility of finding the right choice
of signs is simply computing the quotients:
√ √ √ √
7 + 2 11 (7 + 2 11)(4 − 11) 6 + 11
√ = √ √ = ,
4 + 11 (4 + 11)(4 − 11) 5
√ √
which is not an algebraic integer; thus 7 + 2 11 and 4 + 11 generate distinct
prime ideals above 5. On the other hand,
√ √ √ √
7 + 2 11 (7 + 2 11)(4 + 11) 50 + 15 11 √
√ = √ √ = = 10 + 3 11,
4 − 11 (4 + 11)(4 − 11) 5
√
and we have found the nontrivial unit ε = 10 + 3 11. √ √
Here is a more elegant way of verifying that 7 + 2 11 and 4 − 11 generate
the same ideal: We know that these elements have norm 5, and hence they generate
√
prime ideals above 5. There are only two such ideals, namely 51 = (5, 1 + 11)
178 7 The Pell Equation
√ √ √
and 52 = (5, 1 − 11). Thus 11 ≡ −1 mod 51 and 11 ≡ +1 mod 52 , hence
√ √
7 + 2 11 ≡ 0 mod 51 , 7 + 2 11 ≡ 4 mod 52 ,
√ √
4 + 11 ≡ 3 mod 51 , 4 + 11 ≡ 0 mod 52 ,
√ √
and this shows that (7 + 2 11 ) = (4 − 11) = 51 .
Another possibility of finding a √
nontrivial unit is based on the observation
√ that
(2) = 22 is ramified in K. Since 3+ 11 has norm −2, we must have 2 = (3+ 11),
√ √ √ √
and then (2) = 22 = (3 + 11 )2 = (20 +6 11) shows that 20+62 11 = 10 +3 11
is a unit.
Now let us see how this method works for larger values of m, say for m = 3431.
Again we begin by collecting elements with small norms:
α Nα α Nα
√ √
55 + m −2 · 7 · 29 60 + m 132
√ √
56 + m −5 · 59 61 + m 2 · 5 · 29
√ √
57 + m −2 · 7 · 13 62 + m 7 · 59
√ √
58 + m −67 63 + m 2 · 269
√ √
59 + m 2 · 52 64 + m 5 · 7 · 19
We remark in passing that 602 − m = 132 is a square; this implies that m = 602 −
132 = (60 − 13)(60 + 13) = 47 · 73. Fermat’s method of factorization is based on
this idea.
The fact that 3 does not occur among these prime factors √ is explained by the
observation that there is not even an ideal with norm 3 in Q( m) since ( m3 ) = −1.
For the same reason, the primes 11 and 17 do not show up as factors. Instead of
waiting until elements with the same norm occur, we will use an idea that was
already used by Fermat and his contemporaries in their search for numbers whose
sums of divisors are squares or cubes. We factor the elements with small norm into
primes. It is easy to write
√ down a list of prime
√ ideals with small
√ norms; in our√ case,
these are 2 = (2, 1√ + m), 51 = (5, 1 + m), 52 = (5, 1 − m), 7 = (7, 1 + m),
and 72 = (7, 1 − m). Now we factor all elements with small norm that are only
divisible by 2, 5, and 7:
α 2 51 52 71 72
√
1+ m 1 1 0 3 0
√
1− m 1 0 1 0 3
√
41 + m 1 3 0 0 1
√
41 − m 1 0 3 1 0
√
59 + m 1 0 2 0 0
√
59 − m 1 2 0 0 0
7.3 Computing the Solution of the Pell Equation 179
The first line in this table records the prime ideal decomposition
√
(1 + m ) = 21 · 511 · 731 .
is an algebraic integer with the factorization 25 . Since the ideal 2 is ramified, the
element ε = 25 /α 2 must be a unit, and we have
√
ε = 152009690466840 + 2595140740627 m.
Observe that this method gives us not only a nontrivial unit but also something
called a “compact representation” of this unit:
√ √
2(1 + m )2 (41 + m )6
ε= √ .
76 (59 − m )10
Also observe that the prime ideal factorization in quadratic number fields is an
essential component of this method of solving the completely elementary equation
x 2 − my 2 = 1.
After having found a nontrivial unit ε, the question remains how we can check
that this unit is fundamental. So far we only know that ε = ±ηn for some integer
n, where η is the fundamental unit. Since ε > 1, the positive sign must hold, and
we have n ≥ 1. Clearly, ε is not a square as we can read off from its compact
representation. Thus we only have to check whether ε is an n-th power for the values
n = 3, 5, 7, . . . , and the first problem is bounding this exponent.
180 7 The Pell Equation
ε ≈ 304 019 380 933 679.999 999 999 999 996 711
1/ε ≈ 0.000 000 000 000 003 289.
√ √
Now ε +1/ε is an integer; in fact, if we write ε = a +b m, then 1/ε = a −b m =
ε and thus ε + ε = 2a. If ε = η were a fifth power, then η + 1/η = η + η would
1 5
√
Proposition 7.13 If m√ = t 2 − 1 is squarefree for t ≥ 2, then εm = t + m is the
fundamental unit of Z[ m ].
√
also holds if m is not squarefree, but then Z[ m ] is not the ring of
This result √
integers of Q( m).
Since we have already shown that εm is a unit, it only remains to show that
εm is fundamental.
√ But since εm > 1, this unit can only be not fundamental √ if
εm = (r + s m )k for some exponent k ≥ 2, and in that case the coefficient √ of m
in ε would have√ to be strictly greater than 1; for example, we have (r + s m )2 =
r 2 + ms 2 + 2rs m.
The case m = t 2 + 1 is slightly more complicated.
In the examples above, the units are rather small. For finding fields with larger
fundamental units,9 we construct elements α and β with Nα = ±a n and Nβ = a;
using some additional conditions, we can make sure that the quotient ε = α/β n is
integral and therefore a unit. √
For finding fields K = Q( m) containing √ elements with norm n±a , we can
n
γn
ε=
α
√
is a nontrivial unit in Z[ m ].
hence a | (γ ). Moreover,
√
an = (a n , a n−1 α, . . . , aα n−1 , α n ) = (α)(r − m, a n−1 , . . . , α n−1 ) = (α)
9 The class number formula roughly implies that fields with large fundamental units tend to have
small class numbers; constructing families of fields with large fundamental units is therefore
important with respect to Gauss’s conjecture that there are infinitely many real quadratic number
fields with class number 1.
7.4 Parametrized Units 183
case, p is ramified. Since p | Nα = a n , this implies p | (a). But then p | (m) implies
p | (r) contradicting the assumption that r and a are coprime.
Now γ ∈ a and |Nγ | = Na implies a = (γ ). This shows that (α) = (γ )n , and
hence ε is a unit as claimed.
It remains to show that the√unit ε is nontrivial, i.e., that ε = ±1. But ε = ±1 is
equivalent to ±γ n = α = r + m, and this is impossible for n ≥ 2 √ as soon as a > 1.
√ √ √
Clearly, ±(t + u m )2 = r + m is impossible; similarly, ±( t +u2 m )3 = r + m
implies t = u = 1 and m = 5, which in turn is only possible if r = 2 and a = 1.
Now let m = r 2 + a 3 and Q = (−a, 2r, a 2). Setting Q(x, 1) = 1 and solving
for r, we obtain
1 − a 2 + ax 2
r= .
2x
hence
√
a 2 − 2a − 1 √ a2 + 1 − 2 m
γ = a(a − 1) − − m=
2 2
has norm −a.
An explicit calculation yields the unit
a 5 − a 4 + 3a 3 + a 2 + 2 √
ε= + (a 3 − a 2 + 2a) m.
2
The first few examples are given in the following table:
a r m γ ε
√ √
5 7 174 13 − m 1451 + 110√174
√
9 31 1690 41 − m 27379 + 666 √1690
√
13 71 7238 85 − m 174747 + 2054√ 7238
√
17 127 21042 145 − m 675683 + 4658 21042
There are many other choices of r, each of which yields a similar family of units.
Now let Δ = (2a + 1)2 + 4 · 2n for some integer a. Then
1−Δ
(2a + 1)2 − Δ = −4 · 2n , or a2 + a + = −2n .
4
that have an integral point. The simplest possible form is Q = (2, −2a −1, −2n−1),
and the simplest possible integral points are those with y = ±1. A necessary
condition for the existence of an integral solution of Q(x, ±1) = 1, that is, of
Setting this expression equal to (2a + 3)2 quickly yields a = 2n−1 . In this case, the
quadratic equation
2n + 1 ± (2n + 3) 1
x1,2 = , i.e., x1 = − , x2 = 2n−1 + 1.
4 2
Thus we now have Δ = (2n + 3)2 − 8 = (2n + 1)2 + 4 · 2n , and the conic
Q(x, y) = 1 with Q = (2, −2n − 1, −2n−1 ) has the integral point (2n−1 + 1, 1).
Since
γn
ε=− ,
α
which is a unit with norm −1. This family is due to Michael Nyberg [100] and
(independently) to Daniel Shanks [115].
7.5 Factorization Algorithms 185
n Δ ε
√
1 17 4+ Δ
√
2 41 32 + 5 Δ
√
3 113 776 + 73 Δ
√
4 353 71264 + 3793 Δ
√
5 1217 27628256 + 791969 Δ
√
6 4481 46496952832 + 694603585 Δ
The same idea that we have used for computing the fundamental unit of a real
quadratic number field can be applied directly for factoring large integers. As a
modest example,
√ we choose N = 4469 and begin by factoring the integers a 2 − N
for a ≈ 4469 ≈ 67. We keep only those factorizations that involve sufficiently
small prime numbers:
a −1 2 5
62 1 0 4
63 1 2 3
67 0 2 1
The first line in this table encodes the factorization 622 − N = −54 .
Already the Indian mathematician Narayana Pandit (ca. 1340–1400) and later
Pierre Fermat had used a similar method for factoring integers that do not have
small factors. They checked whether any of the numbers a 2 − N for N = 1, 2,
3, . . . is a square number: If a 2 − N = b2 , then we obtain the factorization N =
a 2 − b 2 = (a − b)(a + b).
The essential idea behind the modern factorization methods based on this idea
(see, e.g., [130]) is the observation that we do not need a solution of the equation
a 2 − N = b 2 but only a solution of the congruence a 2 ≡ b2 mod N. Once we have
186 7 The Pell Equation
found such a pair of integers a and b, the numbers gcd(a + b, N) and gcd(a − b, N)
are (possibly trivial) factors of N. Now observe that
Moreover we have 622 ≡ −54 mod N, and hence 632 · 672 ≡ 42 · 622 mod N, and
we find only the trivial factor gcd(63 · 67 − 4 · 62, N) = 1.
By enlarging our factor base, we obtain
a −1 2 5 11 13
62 1 0 4 0 0
63 1 2 3 0 0
67 0 2 1 0 0
71 0 2 0 1 1
72 0 0 1 1 1
83 0 2 1 2 0
Now we see 672 · 722 ≡ 712 · 52 mod N, but this solution gives us once again just
the trivial factorization. We are more lucky with 672 · 112 ≡ 832 mod N since now
gcd(67 · 11 − 83, N) = 109, and in fact we have N = 41 · 109.
Finding such relations is essentially linear algebra: We interpret the exponents in
the factorizations as elements of an F2 -vector space, and then finding squares boils
down to finding linear dependent vectors. The factorization method based on this
idea is called the quadratic sieve.
Factoring Integers with the Pell Equation The computation of the fundamental
unit is, for many values of m, about as difficult as factoring m. Indeed it follows
from x 2 − my 2 = 1 that my 2 = x 2 − 1 = (x − 1)(x + 1), and gcd(m, x − 1) is a
√ factor of m. For m = 91, for example, the fundamental unit is ε =
(possibly trivial)
1574 + 165 91, and we have gcd(91, 1573) = 13. The Bohemian mathematician
Franz von Schafgotsch [128] factored a = 909 191 by solving the Pell equation for
m = 5a = 4 545 955; he obtained
790482741705651738629349656268492900551186678587245833797608742 =
m · 370748861793367258280487230881607848045136342896607634986552 + 1,
7.6 Diophantine Equations 187
and used the Euclidean algorithm to find the greatest common divisor of
79048274170565173862934965626849290055118667858724583379760874 + 1
n 0 1 2 3 4 5 6 7
Vn 2 1 3 4 7 11 18 29
One consequence of the theorem we are about to prove is that the only squares in
this sequence are V1 = 1 and V3 = 4.
We will need the following observations:
Proposition 7.17 For all k, n ∈ Z, we have
Equation (7.9) immediately implies that the numbers V2n cannot be squares.
Finally,
Vn+2 = aVn+1 + Vn
by induction.
In the following, k will always denote an integer not divisible by 3. Thus
3 mod 8 if k is odd,
V2k = Vk2 − 2(−1) ≡
k
7 mod 8 if k is even.
√
a+b m
Theorem 7.18 Let m ≡ √ 5 mod 8 be squarefree, and let ε = 2 denote the
fundamental unit of Q( m), where we assume that a and b are odd. The number
Vn = εn + ε n is a square only in the following cases:
1. n = 1 and a is a square;
2. n = 3 and a(a 2 + 3) is a square.
Using sage, it is possible to show that the elliptic curve y 2 = a(a 2 + 3) has exactly
four integral points, namely (0, 0), (1, 0), (3, 6), and (12, 42). A proof by hand leads
to the Diophantine equation x 4 − 3y 4 = −2, which seems to be difficult to solve
with the methods presented here.
For the proof of Theorem 7.18, we will distinguish several cases.
1. a ≡ 5, 7 mod 8 and n ≡ 3 mod 4 We write n = 2 · 3r k − 1 for an even integer
k not divisible by 3. Then Vk ≡ 3 mod 4 and
by (7.10); hence,
V a
n
= since Vn ≡ a mod Vk ,
Vk Vk
−1 V
since ( Vak ) = ( −V
k
= k
a )
a a
−2
= sinceVk ≡ 2 mod a if k is even.
a
= −1 since a ≡ 5, 7 mod 8.
But this implies our claim that Vn is not a square in this case.
2. a ≡ 5, 7 mod 8 and n ≡ 1 mod 4 Here we write n = −3 + 2 · 3r k for some
even integer n not divisible by 3 and find Vn ≡ −V−3 = V3 mod Vk . Now V3 =
a(a 2 + 3) = a · 4b for some odd integer b; hence,
V a b
3
= since 4
Vk =1
Vk Vk Vk
a −V
k
= since Vk ≡ 3 mod 4
Vk b
a
= since Vk ≡ V2 = a 2 + 2 ≡ −1 mod b.
Vk
−2
= −1 since a
Vk = ( −V
a )=
k
a = −1 as above.
7.6.1 Summary
7.7 Exercises
7.1. Let k be a real quadratic number field; assume that η = α 2 and√Nα < 0 for
√
elements η, α ∈ k × . Show that, as real numbers, Tr α = η − η .
7.2. Show that if m = n2 is a square, then the equation x 2 − my 2 = 1 has only the
trivial solutions x = ±1 in integers.
7.3. Show using Dirichlet’s pigeonhole principle that for each real number x, there
exist infinitely many pairs (p, q) ∈ Z × Z such that |x − pq | < q12 .
Hint: Consider the remainders modulo 1 of the numbers 0, x, 2x, . . . , nx;
these n + 1 remainders lie in the n intervals [0, n1 ), [ n1 , n2 ), . . . , [ n−1
n , 1).
7.4. Find elements
√ with small nontrivial norm in the family of quadratic number
fields Q( m) with m = t 2 − 1 and m = t 2 ± 4.
Use this result for finding examples of real quadratic number fields with
large class number.
7.5. Prove the √
following lemma (Hasse [58]): If m > 0 is not a square and
t +u m √
ε = 2 the fundamental unit of Q( m), and if n is the smallest positive
nonsquare for which x 2 − my 2 = ±4n is solvable in nonzero integers, then
t
, if Nε = −1,
n≥ u2
t −2
u2
, if Nε = +1.
√ √
7.9. Show: If ε = t +u2 m is the fundamental unit of Q( m) for m ≡ 1 mod 8,
then t and u are even. √
7.10. Let m = n2 −√1 for some natural number n ≥ 2. Show that ε√= n + m
is a unit in Z[ m ] and that it is the fundamental unit of Q( m) if m is
squarefree.
More generally, find units for m = n2 ± 1 and m = n2 ± 4. √
7.11. Compute the class number and the fundamental unit of K = Q( 478).
Hint: Consider the prime ideal above (2) and the√prime ideals above 3 and
7. Determine the prime ideal factorizations of (a + 478) for a = 10, 17, 22,
24, and 25, and conclude that K has class number 1.
7.12. The solvability of the Pell equation x 2 − my 2 = 1 for positive nonsquares
m may be formulated as follows: The part of the Euclidean plane defined
by the hyperbolas x 2 − my 2 = 1 and x 2 − my 2 = −1 that contains their
asymptotes contains infinitely many lattice points. In this formulation, the
claim even holds when m is a square; in this case, all integral points lie on the
asymptotes.
Show that the region between the two hyperbolas 2x 2 − 5y 2 = 1 and
2x − 5y 2 = −1 does not contain any lattice point
2
√ except (0, 0).
7.13. Show that the continued fraction expansion of m for m = t 2 − 1 is given by
√
m = [t − 1; 1, 2t − 2, 1, 2t − 2, 1, 2t − 2, . . .] = [t − 1; 1, 2t − 2].
For example,
√ 1
3=1+ .
1
1+
1
2+
1
1+
1
2+
1
1+
2 +...
√
7.14. Show that the continued fraction expansion of m for m = t 2 + 2 is given by
√
m = [t; t, 2t].
where y = x2 .
192 7 The Pell Equation
Show moreover that the equation y 2 = x(x 2 + 3) has the only integral
points (0, 0), (1, ±2), (3, ±6), and (12, ±42) assuming that the equation r 3 −
3s 4 = −2 has the only integral solution r = s = s.
√
7.16. Let p ≡ 3 mod 4 be a prime number, and let ε = t + u p denote the
√
fundamental unit of Q( p). Show that t is even and that t ≡ 1 − ( p2 ) mod 4.
7.17. Show that the function
√ f defined in (7.2) is injective.
√
7.18. Let ε = 2 + 3 be the fundamental unit of Z[ 3 ]. Define the numbers
Vn = εn + ε n for n ≥ 0. Show that these numbers satisfy the recurrence
relation
Also show that V2n = Vn2 − 2 and that the subsequence V2n consists of the
numbers occurring in the Lucas–Lehmer test.
Chapter 8
Catalan’s Equation
In this chapter we will show how to apply the arithmetic of quadratic number fields
to special cases of Catalan’s conjecture.
In 1844, Catalan conjectured that the only powers of (positive) natural numbers
that differ by 1 are 23 = 8 and 32 = 9; in other words, the Diophantine equation
xp − yq = 1
has 32 − 23 = 1 as its only nontrivial solution. This conjecture was proved by Preda
Mihailescu [94] in 2004. His proof uses the work of many other mathematicians and
in particular results about the arithmetic of cyclotomic number fields that are beyond
the scope of the present book.
We will, however, be able to cover the equations x p − y q = 1 with p = 2 or
q = 2 because these cases can be attacked using the arithmetic of quadratic number
fields.
The following proof based on the arithmetic of the ring Z[i] of Gaussian integers
is essentially the one given by Lebesgue. Clearly, y cannot be odd since otherwise
y 2 + 1 ≡ 2 mod 4 cannot be a nontrivial power. Thus y is even.
The exponent m must be odd; in fact, if m is even, then y 2 and x m are consecutive
squares, which implies x = 1 and y = 0, a case we have excluded.
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 193
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6_8
194 8 Catalan’s Equation
x m = y 2 + 1 = (y + i)(y − i).
Since the two factors are coprime (recall that y is even), we deduce that there exists
a Gaussian integer a + bi and a unit i k with
Observe that 2+i has odd norm, and hence a and b must have different parity. These
equations imply
Since the expression in the bracket is divisible by b, we must have b = ±1, and
a must be even.
• k = 2r + 1 is odd; then we find
m m−2 2
1 = (−1)r a m − a b + . . . ± mab m−1 . (8.3)
2
m 2 m 4
1− c + c − . . . ± mcm−1 = ±1,
2 4
where c is an even integer. If the right side of this equation is −1, then c2 would
divide 2, which is nonsense. Thus we can subtract 1 from both sides and divide
through by −c2 ; in this way, we obtain
m m 2
− c + . . . ± mcm−3 = 0.
2 4
If m = 3, this equation says 32 = 0, which is nonsense. Thus m ≥ 5, and then m2
must be divisible by 4 since c is even. But m is odd, and hence m2 = m(m−1)
2 is
divisible by 4 if and only if m ≡ 1 mod 8.
8.2 Euler’s Theorem 195
We will now finish the proof by showing that m2 is divisible by a smaller power
of 2 than all the other terms. This will then imply that the sum cannot vanish.
Observe that the factor (m−2)(m−3)
3·4 c2 of
m 2 m (m − 2)(m − 3) 2
c = · c
4 2 3·4
general we have
Now c2r−2 is divisible by 22r−2; for r ≥ 3, the factor r is not divisible by 22r−2
since r < 22r−2 . This implies our claims.
z2 = p4 + 9p2 q 2 + 27q 4
does not have solutions in nonzero integers. The proof is quite involved but only uses
the arithmetic of the ordinary integers. Wakulicz [131] was apparently unaware of
the fact that his proof is essentially that of Euler.
We will now present Paul Monsky’s beautiful solution of the Diophantine equation
y 2 = x 3 +1. Euler first showed that the (2, 3) is the only solution in positive rational
numbers.
Observe that the equations α +β +γ = 0 and αβγ = 2μ3 are homogeneous, and
hence it will be sufficient to prove the result for algebraic integers α, β, γ ∈ Z[ρ].
For proving that Theorem 8.3 implies Theorem 8.2, assume that x 2 = y 3 + 1
for rational numbers x, y. Setting α = 1 − x, β = 1 + x, and γ = −2, we find
α + β + γ = 0 and αβγ = 2(x 2 − 1) = 2y 3 , and hence α = 0 (and x = 1), β = 0
(and x = −1), α = β (and x = 0), α = γ (and x = 3), or β = γ (and x = −3).
Thus we obtain Theorem 8.2, and in fact we have also proved that the only rational
points on the curve x 2 = y 3 + 1 are the integral points given in Theorem 8.2.
Proof Let (α, β, γ ) be a counterexample. Then α, β, and γ are pairwise coprime
in Z[ρ], and so (after a suitable permutation of the numbers) we find that there exist
A1 , B1 , C1 ∈ Z[ρ] with
Since B13 ≡ C13 ≡ 1 mod 2, we may assume according to Proposition 5.10 that
B1 ≡ C1 ≡ 1 mod 2.
Now we set α1 = B1 + C1 , β1 = ρB1 + ρ 2 C1 , and γ1 = ρ 2 B1 + ρC1 . Then
• α1 + β1 + γ1 = B1 (1 + ρ + ρ 2 ) + C1 (1 + ρ + ρ 2 ) = 0.
• α1 β1 γ1 = B13 + C13 = β + γ = −α = 2(−A1 )3 .
• β1 + γ1 = (B1 + C1 )(ρ + ρ 2 ) = −(B1 + C1 ) = 0 since β + γ = −α = 0.
• N(α1 β1 γ1 ) = Nα | N(αβγ ); if we had equality, it would follow that N(β) =
N(γ ) = 1 and thus β, γ = ±1. But this yields β = 1, γ = −1, and α = 0
contradicting our assumption.
Thus (α1 , β1 , γ1 ) is a solution with N(α1 β1 γ1 ) < N(αβγ ), which contradicts our
assumption on the minimality of Nα. This completes the proof.
8.3 The Theorems of Størmer and Ko Chao 197
n n−3 3
un = nt1n−1 u1 + t u1 m + . . . ,
3 1
and hence
k k−3
sn = ktpk−1 sp + t sp A + . . . .
3 p
198 8 Catalan’s Equation
Thus sn is divisible by sp , and hence each prime dividing sp will divide A. But since
p−1 p p−3
sp = pt1 + t A + ..., (8.5)
3 1
p−1
we see that each prime dividing sp divides pt1 . The equation t12 = A + e shows
that t1 and A are coprime, and hence the only prime dividing sp is p. Since sp > 1,
we can write sp = pr for some integer r ≥ 1. Plugging this into (8.5), we obtain
p−1 p p−3
pt1 + t A + . . . = pr . (8.6)
3 1
Since the factors are coprime, this is only possible if 2t1 − 1 = 1, which implies
t1 = 1, r = 1 and then A = 0, which is impossible.
p > 3 Here all the terms in (8.6) except possibly the first one are divisible by p2
p−1
since p | A. Since t1 and A are coprime, the first term pt1 is not divisible by p2 ,
and this implies r = 1. Dividing (8.6) by p yields
Theorem 8.6 (Nagell) If there are positive integers x, y with x 2 − y q = 1 for some
odd prime number q, then 2 | y and q | x.
For proving the first claim, we write the equation in the form y q = x 2 − 1 =
(x − 1)(x + 1). Then gcd(x − 1, x + 1) | 2; if y is odd, then x is even and the two
factors are coprime. But then x − 1 = a q and x + 1 = bq must be q-th powers,
hence bq − a q = 2, which is impossible. This contradiction shows that y must be
even as claimed.
For the second claim, we write the equation in the form
yq + 1
x2 = yq + 1 = (y + 1).
y+1
+1
q
Observe that yy+1 = Qq (y, −1) in the notation of Sect. 3.5. By (3.18), we have
gcd(Qq (y, −1), y + 1) | q. Thus if q y, the factors are coprime and must both be
squares, i.e., there exist natural numbers a and b with
yq + 1
y + 1 = a 2, = b2 , x = ab.
y+1
b 2q + (2a)q x − 1 2 x + 3 2
(b2 + 2a) = b 2q
+ (2a) q
= + 2(x + 1) = .
b2 + 2a 2 2
Solving these quadratic equations yields a contradiction as the solutions are not
integers.
If a − b = ±2, on the other hand, then we find
to the first: simply replace x by −x). In a similar way as above, we now find the
equation 1 = a 3 − 2b 3 .
We will show below that the only integral solutions of this equation are
given by (a, b) = (1, 0) and (−1, −1). These lead to the solutions (x, y) =
(±1, 0), (±3, 2) of the original equation.
We will attack the equation a 3 − 2b 3 = 1 directly by writing it in the form
3√ 3√ 3√
1 = (a − b 2 )(a 2 + 2 ab + 4 b2 )
3√ 3√
and observing that a − b 2 is a unit in the ring Z[ 2 ]. It can be shown that
3√ 3√
R × = −1, 1− 2 , and the claim then boils down to showing that ±(1− 2 )n =
3√ 3√ 3√
a−b 2 implies |n| ≤ 1 (in general, this power will have the form r +s 2 +t 4
for some t = 0). √
The calculations3 will be performed in the pure cubic number field Q( 3 2 ). Its
ring of integers is
√
3
√
3
√
3
Z[ 2 ] = {a + b 2 + c 4 : a, b, c ∈ Z}.
This ring is Euclidean with respect to the absolute value of the norm, and hence it
has unique factorization. The solution of the equation a 3 + 2b3 = 1 that we √ will
give does not use any of this:
√ All we need is the fact that the units of the ring Z[ 3
2]
are generated by −1 and 3 2 − 1, and we will prove this below.
√
−1+ −3
and where ρ = 2 . A straightforward calculation yields
Nα = r 3 + ab2s 3 + a 2 bt 3 − 3abrst.
As in quadratic number fields, units have norm ±1, and integral elements with
norm ±1 are units. In particular, if (x,
√ y) is an integral
√ solution of the Diophantine
equation x 3 + dy 3 = 1, then x + y 3 d is a unit in Z[ 3 d ]. According to a theorem
due to Dirichlet, this ring has a fundamental unit ε with the property that all units
can be written in the form η = ±εn for some n ∈ Z. The nontrivial part of this
theorem claims that the equation
r 3 + ab2s 3 + a 2 bt 3 − 3abrst = 1
has solutions, whereas the assertion that each unit can be written up to sign as a
power of the fundamental
√ unit √
follows as in the real quadratic case by studying
3 3
absolute values |r + s ab2 + t a 2 b |.
Let us determine the√fundamental √ unit in the case we are mainly interested in,
namely for the field Q( 3 2 ). Here 3 2 − 1 is a unit since
√
3
√
3
√
3
( 2 − 1)(1 + 2 + 4) = 1.
√ √ √
If we interpret 3
2 as a real number, then ε = 1 + 3
2 + 3 4 > 1. Now we claim the
following:
√ √ √
Lemma 8.8 Let ε = 1 + 3 2 + 3 4. Then each unit η > 1 in Z[ 3 2 ] has the form
η = εn for an integer n ≥ 1.
Proof The units εn all have value > 1 for n ≥ 1. If there is a unit η > 1 not of this
form, then η lies between two powers of ε:
1 = |η1 η1 η1 | = η1 |η1 |2 ,
this implies
1
√ < |η1 | < 1.
ε
√ √
If we write η1 = r + s 3 2 + t 3 4, then the triangle inequality shows that
and therefore |t| ≤ 1. Going through all possible values then yields the desired
contradiction.
If 0 < η < 1, then 1/η = εn for some n ≥ 1, and hence every positive unit has
the form η = εn for some n ∈ Z. Finally, if η < 0, then −η must be a power of ε.
We have shown the following:
√
Proposition 8.9 Each unit ε ∈ Z[ 3 2 ] can be written uniquely in the form
η = (−1)m εn
√ √
for√m ∈ {0, 1} and n ∈ Z, where ε = 1 + 3
2+ 3
4 is the fundamental unit of
Z[ 3 2 ].
√
This statement remains correct if we replace ε by ε−1 = 3
2 − 1.
Expanding the left hand side using the binomial theorem and comparing coeffi-
cients, we obtain
n n n
x =1−2 +4 −8 +...
3 6 9
n n n
−y = −4 + 42 ∓
1 4 7
n n n
0= −4 + 42 ∓...
2 5 8
n
The last equation implies that 2 must be divisible by 4, which happens if and only
if n ≡ 0, 1 mod 4.
204 8 Catalan’s Equation
nWe now assume n ≥ 2 (and thus n ≥ 4); dividing the last equation through by
2 , we find
n−2 2(−2)k
−1 = .
3k (3k + 1)(3k + 2)
k≥1
n−2 n−2
1+ + + . . . ≡ 0 mod 3.
3 6
m m m
+ + + . . . ≡ 0 mod 3.
0 3 6
Proof If we set
m m m
S0 = + + + ...,
0 3 6
m m m
S1 = + + + ...,
1 4 7
m m m
S2 = + + + ...,
2 5 8
then we find
S0 + S1 + S2 = 2m ≡ (−1)m mod 3
as well as
m m m m−3 m m
S1 = + +... ≡ ·m+ · m + . . . ≡ mS0 mod 3,
0 1 3 4 0 3
m m−1 m m−4
S2 = + +...
1 2 4 5
m m
≡ · (1 − m) + · (1 − m) + . . . ≡ −mS1 + S1 mod 3,
1 4
8.4 Euler’s Equation via Pure Cubic Number Fields 205
and hence
Proposition 8.11 The only integral solutions of the equation (8.7) are n = 0 and
n = 1.
√ √ √
Now if x 3 + 2y 3 =√1, then x + y √3
2 is a unit in Z[ 3 2 ] since N(x + y 3 2 ) =
x 3 + 2y 3. Thus x + y 3 2 = ±(1 − 3 2 )n according to Proposition 8.9, and now
Proposition 8.11 implies that n = 0 or n = 1. Therefore (x, y) = (1, 0) and
(x, y) = (−1, 1) are the only integral solutions of the equation x 3 + 2y 3 = 1.
4 Another often used transliteration is Delone; Delaunay is the French variant. In 1915, Delaunay
published his theorem in Russian, but it became known in the West only through a publication in
French in 1920.
206 8 Catalan’s Equation
Using the action of the Galois group on the class group of cyclotomic number
fields,5 in particular a result called Stickelberger’s Theorem, Mihailescu was able to
strengthen Cassels’ result:
Theorem 8.15 If x p − y q = ±1 for nonzero integers x, y and odd prime exponents
p, q, then p2 | y and q 2 | x.
These are strong conditions; there are only 7 Wieferich pairs known; the smallest
are (2, 1093) and (83, 4871).
The full proof (presented in the books [12] and [113]) uses a more detailed
analysis of Stickelberger’s method. The state of the art before Mihailescu’s proof
is presented in Ribenboim’s book [108], where the full proofs of Nagell’s results on
the equations x 3 − y q = 1 can be found.
8.5.1 Summary
In this chapter we have proved some special cases of Catalan’s conjecture that were
accessible with elementary methods.
8.6 Exercises
m m m
S0 = + + +...
0 3 6
satisfy
1 mod 3 if m ≡ 0, 1, 2 mod 6,
S0 ≡
2 mod 3 if m ≡ 3, 4, 5 mod 6.
m m m m m m
+ + +... = + + + . . . = 2m−1 .
0 2 4 1 3 5
It is quite difficult to determine class numbers, even in the simplest case of quadratic
number fields, for fields with large discriminant. It is, however, possible to make
several rather precise statements concerning the parity of class numbers of quadratic
number fields. The theory behind these statements is called genus theory and goes
back to Gauss, who worked with quadratic forms rather than quadratic number
fields. Genus theory may be generalized to cyclic extensions, and in fact the question
we will answer is how the Galois group of an extension acts on the ideal classes.
In this chapter we will only scratch the surface of genus theory by proving the
ambiguous class number formula.
The essential idea behind the proof is to reduce the action of the Galois group on
ideal classes to the action on ideals, then on principal ideals and finally on elements,
where everything can be done explicitly. Once more we will be studying a difficult
object, namely the class group, by studying homomorphisms into simpler structures.
Let A be a finite abelian group. Then A can be written as a direct sum of cyclic
groups, say A = A1 ⊕ · · · ⊕ An . If A is a finite 2-group, i.e., a group whose order
is a power of 2, then the 2-rank of A is the number n of cyclic components. Since it
is easy to see that A/A2 A1 /A21 ⊕ · · · ⊕ An /A2n , and since Aj /A2j Z/2Z for
cyclic groups Aj , the 2-rank of A is n if and only if #A/A2 = 2n .
The determination of the order of the quotient group Cl(k)/Cl(k)2 , i.e., of the
2-rank of the ideal class group, goes back to Gauss, who solved this problem in
the language of binary quadratic forms. It is almost impossible to miss the central
questions of this theory when studying the operation of the Galois group G = {1, σ }
of k/Q on the ideal class group.
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 209
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6_9
210 9 Ambiguous Ideal Classes and Quadratic Reciprocity
For an ideal class c = [a] we set cσ = [aσ ]; of course we have to show that this
action is well defined (see Exercise 9.1). Clearly an ideal class c and its conjugate
cσ always have the same order. Moreover, since c · cσ = [a][aσ ] = [(Na)] = (1) is
the principal class, cσ = c−1 is always the inverse class of c.
We call an ideal class c ∈ Cl(k) ambiguous if cσ = c. Similarly, an ideal a is
called ambiguous, if aσ = a.
√ √ √
Lemma 9.1 The nontrivial automorphism σ : m → − m of k = Q( m ) acts
as −1 on the class group Cl(k). In particular, an ideal class c is ambiguous if and
only if c2 = 1.
Proof We have already seen that a1+σ = aa = (Na) is principal, and that this
implies that cσ = c−1 .
If c is ambiguous, i.e., if c = cσ , then c2 = c1+σ = 1. Conversely, if c2 = 1,
then cσ = c−1 = c.
If k is a number field with class√ number 2, then the nontrivial ideal class c is
always ambiguous.√For k = Q( −5 ), the nontrivial ideal class is generated √ byσthe
prime ideal
√ (2, 1+ −5 );
√ since this ideal is ambiguous because of (2, 1+ −5
√) =
(2, 1− −5 ) =√ (2, 1+ −5 ). The√ideal class c is also √ generated by
√ (3, 1+ −5 ),
and here (3, 1 + −5 )σ = (3, 1 − −5 ) = (3, 1 + −5 ). In Q( −5 ), each ideal
class contains an ambiguous ideal (the principal class contains the ambiguous ideal
(1)), as well as many non-ambiguous ideals.
For ideal class groups of order 4, the number of ambiguous classes determines
the structure. If Cl(k) Z/2Z ⊕ Z/2Z is elementary abelian, then the number
of ambiguous ideal classes is 4 since in this case, every ideal class is ambiguous.
If Cl(k) Z/4Z, on the other hand, then the two classes with order 4 are not
ambiguous, whereas the class with order 2 and the principal class are ambiguous,
Thus there are only 2 ambiguous ideal classes in this case.
If an ideal a is ambiguous, then so is the ideal √ class c = [a] it generates; the
converse is not true in general: Since k = Q( 34 ) has class number 2, the ideal
class c of order 2 is ambiguous. This ideal class is not generated by an ambiguous
ideal for the simple reason that all ambiguous ideals in k are principal. As we will
see below, each ambiguous ideal is a product of ramified √ prime ideals √ and ideals
generated
√ by ordinary√ integers. But in k we have (2, 34 ) = (6 + 34 ) and
(17, 34 ) = (17 + 3 34 ).
The ambiguous ideal classes form a group Am(k), in which the ideal classes
generated by ambiguous ideals form a subgroup, namely the group Amst (k) of
strongly ambiguous ideal classes. Our goal is determining the structure of the group
Am(k).
This will allow us to deduce information about the elements of order 2 in the class
group. In fact, since cσ = c−1 we have c1−σ = c2 , and therefore the homomorphism
c → c1−σ maps the class group Cl(k) of a quadratic number field k to the group
Cl(k)1−σ = Cl(k)2 of ideal classes that are squares, and this homomorphism is
onto. Its kernel consists of the ideal classes c with c1−σ = 1, i.e., of the ambiguous
9.1 Ambiguous Ideal Classes 211
ideal classes. This implies that the order of the group Am(k) of ambiguous ideal
classes is equal to the order of Cl(k)/Cl(k)2 :
Proposition 9.2 Let k be a quadratic number field. Then
#Cl(k)/Cl(k)2 = # Am(k),
and, in particular, the class number of k is odd if and only if the number of
ambiguous ideal classes is 1.
Actually, since both groups are elementary abelian, equal cardinality implies
isomorphism. The last claim follows from the observation that squaring is an
isomorphism on a finite group if and only if it has odd order.
The calculations below are far easier to digest by using exact sequences. A short
sequence of abelian groups A, B, C consists of group homomorphisms α : A −→ B
and β : B −→ C, which are composed as follows:
α β (9.1)
1 −−−−→ A −−−−→ B −−−−→ C −−−−→ 1.
1−σ
1 −−−−→ Am(k) −−−−→ Cl(k) −−−−→ Cl(k)2 −−−−→ 1.
212 9 Ambiguous Ideal Classes and Quadratic Reciprocity
The definitions of principal ideals and the ideal class group Cl(k) of a number field
k provide us with two exact sequences, namely
where Ek is the unit group, Hk the group of (fractional) principal ideals = (0), and
Ik the group of all fractional ideals = (0).
The group Amst (k) of strongly ambiguous ideal classes is, by definition, equal to
Amst (k) = AH /H A/A ∩ H , where A denotes the group of nonzero ambiguous
ideals and H the group of nonzero principal ideals. Clearly A∩H = H G is the group
of ambiguous principal ideals, and so we have Amst (k) A/H G . This observation
gives us the exact sequence
ι
1 −−−−→ H G −−−−→ A −−−−→ Amst (k) −−−−→ 1.
The group P of all fractional ideals (a) with a ∈ Q× is a subgroup of both H G and
A; this allows us to modify the exact sequence slightly and turn it into
ι π
1 −−−−→ H G /P −−−−→ A/P −−−−→ Amst (k) −−−−→ 1. (9.2)
N
1 −−−−→ (k × )1−σ −−−−→ k × −−−−→ k × ,
1 For integral ideals, the statement is trivial since then Na = (1) is equivalent to a = (1).
214 9 Ambiguous Ideal Classes and Quadratic Reciprocity
Now we claim
Proposition 9.5 There is an exact sequence
λ
1 −−−−→ E 1−σ −−−−→ E[N] −−−−→ H G /P −−−−→ 1.
Proof The map E 1−σ −→ E[N] is the inclusion map: Each unit ε1−σ has norm
1 and thus is an element of E[N]. For constructing λ : E[N] −→ H G /P assume
that ε ∈ E[N], i.e., Nε = 1. By Hilbert’s Theorem 90 there is an α ∈ k × such that
ε = α 1−σ ; clearly (α) ∈ H G since (α)σ = (α σ ) = (εα) = (α). The map ε → (α)
is not well defined, however, since with α each element αa for any a ∈ Q× has
the property (αa)1−σ = ε. For this reason we set λ(ε) = (α)P , and this map now
is well defined. Clearly ε ∈ ker λ if and only if λ(ε) = P ; this is equivalent to
(α) = (a), i.e., to α = aη for some unit η. This implies ε = α 1−σ = η1−σ , which
shows that ker λ = E 1−σ .
The surjectivity of λ is clear: If (α) is ambiguous, then (α)σ = (α) and thus
εα = α for some unit ε, hence ε = α 1−σ .
σ
The content of this proposition may also be expressed by the isomorphism
E[N]/E 1−σ H G /P .
2 Those who are familiar with the first principles of cohomology get the sequence for free: The
trivial sequence
in which H denotes the group of nonzero fractional principal ideals, provides the long exact
sequence
1 −→ E G −→ (k × )G −→ H G −→ H 1 (G, E) −→ H 1 (G, k × ),
from which the claim follows using Hilbert’s Theorem 90 (H 1 (G, k × ) = 1), the periodicity
H 1 (G, A) H −1 (G, A) for cyclic groups G, as well as (k × )G = Q× , E G = {±1} and
Q× /E G P .
9.2 The Ambiguous Class Number Formula 215
E[N]/E 1−σ = E/E 2 Z/2Z. If Δ > 0, then let ε denote the fundamental unit. If
Nε = +1, then again E[N] = E and E 1−σ = E 2 , hence E[N]/E 1−σ = E/E 2 =
−1, ε/ε2 (Z/2Z)2 . If Nε = −1, on the other hand, then E[N] = −1, ε2
and E 1−σ = E 2 = ε2 , hence E[N]/E 1−σ = E/E 2 Z/2Z.
Lemma 9.6 Let k be a quadratic number field whose unit group E is generated by
the fundamental unit ε (and −1). Then
⎧
⎪
⎨Z/2Z,
⎪ if d < 0,
−1
H (G, E) = E[N]/E 1−σ
Z/2Z, if d > 0, Nε = −1,
⎪
⎪
⎩(Z/2Z)2 , if d > 0, Nε = +1.
It remains to determine the order of A/P . To this end we will use the following
lemma.
Lemma 9.7 An ideal a is ambiguous if and only if a is the product of ramified prime
ideals and an ideal (a) with a ∈ Q× . More exactly we have
A/P (Z/2Z)t ,
where t is the number of primes that ramify in k/Q, in other words, the number of
distinct prime factors of the discriminant of k.
Proof We may assume that a is an integral ideal (otherwise we multiply it by a
suitable rational integer). Among all decompositions a = (a)b with an integral
ideal b we pick one in which a ∈ N is maximal.
Let p denote a prime ideal with pσ = p; if p divides b, then we must have pσ | b.
In fact by applying σ to p | a we see that pσ | bσ = b. Thus (p) | b, where
(p) = ppσ , which contradicts the maximality of a. This shows that b is not divisible
by a split prime ideal.
For the same reason, b is not divisible by any inert prime ideal (p). Thus b is a
product of ramified prime ideals. If p is such a prime ideal, then p2 = (p), and the
maximality of a implies that we can write a uniquely in the form
e
a = (a) pj j ,
where pj runs through the ramified prime ideals and where ej ∈ {0, 1}. Now we set
e
φ : A/P −→ (Z/2Z)t : (a) pjj −→ (e1 , . . . , et )
Thus it remains only to determine the difference between the group of ambiguous
ideal classes Am(k) and that of strictly ambiguous ideal classes Amst (k):
Proposition 9.9 There is an exact sequence
μ
1 −→ Amst (k) −→ Am(k) −→ (EQ ∩ Nk × )/NEk −→ 1.
In particular, Am(k) = Amst (k) except when −1 is the norm of an element, but not
of a unit. In this case, # Am(k) = 2 · # Amst (k).
Proof Let c = [a] be ambiguous. Then aσ ∼ a, hence aσ = αa. Taking norms
yields (Nα) = (1), that is Nα = ±1 ∈ EQ ∩ Nk × . We set μ(c) = Nα · NEk
and claim that μ is well defined. In fact if we start from c = [b], then b = γ a, and
bσ = γ σ aσ = γ σ αa = γ σ −1 αb shows that N(γ σ −1 α) · NEk = Nα · NEk since
elements of the form γ σ −1 have norm 1. Thus μ is well defined.
If c ∈ ker μ, then Nα = Nη, d.h. N(αη) = 1. According to Hilbert’s Theorem
90, we have αη = β 1−σ , and now aσ = αa implies (βa)σ = (β)a. Thus b = βa
is an ambiguous ideal equivalent to a, and therefore c = [b] is strongly ambiguous.
Conversely, strongly ambiguous ideal classes are clearly contained in ker μ.
In order to prove the surjectivity of μ we have to show that −1NEk lies in the
image of μ if −1√ is the norm of an element from k. Assume therefore that Nα = −1
for α = x + y m. Then x 2 − my 2 = −1, hence −1 is a quadratic residue modulo
each odd prime divisor p of m. We know from elementary number theory (or from
the arithmetic of Gaussian integers) that this holds if and only if m = a 2 + b2
is a sum of √ two squares; here we may assume that a is odd. Now we verify that
a = (a, b + m ) generates an ambiguous ideal class c = [a], and that μ(c) = −1.
In fact we have
√ √
a2 = (a 2 , ab + a m, b2 + 2b m + m)
√ √
= (a 2 , ab + a m, 2b2 + 2b m )
√ √ √ √
= (a 2 , a(b + m ), 2b(b + m )) = (a 2 , b + m ) = (b + m )
9.2 The Ambiguous Class Number Formula 217
√ √
√ of gcd(a ,σ2b) = 1 and (b + m )(b − m ) = b2 − m = −a 2. Thus√a2 =
because 2
Examples
Criterion.
Similarly we can prove
Theorem 9.13 (Second Supplementary Law) For all odd prime numbers p, the
following assertions are equivalent:
(1) ( p2 ) = +1, i.e., the congruence x 2 ≡ 2 mod p is solvable.
(2) We have p = e2 − 2f 2 for integers e, f ∈ Z.
(3) We have p ≡ ±1 mod 8.
The equivalence of (1) and (3) can also be expressed by the equation
2 p2 −1
= (−1) 8 .
p
√
Proof (1) ⇒ (2): If ( p2 ) = +1, then p splits in Q( 2 ), and we have ±p =
√ √
x 2 − 2y 2 ; multiplying x + y 2, if necessary, by the unit 1 + 2 we can make sure
that p = e2 − 2y 2 .
(2) ⇒ (3): Reduction modulo 8 yields p ≡ ±1 mod 8 in all cases.
√
(3) ⇒ (1): Let h denote the class number of k = Q( p ), which is odd by
Corollary 9.11. If p ≡ ±1 mod 8, then 2 splits in k/Q, hence 2Ok = pp for prime
√
ideals p, p . Since ph = 12 (x + y p ) is a principal ideal, taking the norm yields
x 2 − py 2 = ±4 · 2h . Reduction modulo p shows that ±2h and thus ±2 is a quadratic
residue modulo p; the claim now follows from the first supplementary law.
9.4 Exercises 219
The quadratic reciprocity law for odd prime numbers is the content of the
following theorem.
Theorem 9.14 (Quadratic Reciprocity Law) If p and q are odd primes, then
p q p−1 q−1
= (−1) 2 2 .
q p
9.3.1 Summary
In this chapter we have proved the ambiguous class number formula for quadratic
number fields, and derived the quadratic reciprocity as a corollary.
9.4 Exercises
9.1. Show that the operation [a]σ = [aσ ] on the ideal class group of a quadratic
number field is well defined, i.e., that [a] √
= [b] implies [aσ ] = [bσ ]. √
√ the ideal class of√order 2 in Q( 10 ) contains the ideals (2, 10 ),
9.2. Show that
(3, 1 + 10 ) and (5, 1 + 10 ). Which of these ideals are ambiguous?
220 9 Ambiguous Ideal Classes and Quadratic Reciprocity
√
9.3. Let p ≡ 5 mod 8 be prime.√ Show that the ideal class Q( 2p ) generated by
the ambiguous ideal (2, 2p ) has order 2. √
9.4. Let p ≡ 1 mod 8 be √ prime. Show that the ambiguous ideal (2, 2p ) is
principal in k = Q( 2p ) if and only if the norm of the fundamental unit
in k is +1.
write 2p = a 2 + b2 with a > b > 0 and show that the ideal
In this case √
a = (a, b + m ) generates an ambiguous ideal class of order 2.
9.5. Show that if k is a quadratic number field with class number 2, then Am(k) =
Cl(k).
9.6. Show that if k is a quadratic number field with odd class number, then
Am(k) = 1.
9.7. Show: If A and B are subgroups of an abelian group, then AB/B A/A∩B.
Hint: Show that A ∩ B is the kernel of the natural map A −→ AB/B.
9.8. Show that the inclusion ι : H G /P −→ A/P in (9.2) is injective and that
the map π : A/P −→ Amst (k) defined by π(aP ) = [a] is well defined and
surjective. Also show that ker π = im ι.
9.9. (O. Taussky) Solve the Pythagorean equation x 2 + y 2 = z2 using Hilbert’s
Theorem 90. Hint: α = x+yi z ∈ Q(i) satisfies the equation Nα = 1. Write
α = m−ni and rationalize the denominator.
m+ni
√
9.16. Let k = Q( m ) be a quadratic number field with fundamental unit εm . Show:
If Nεm =√ +1, then there is an ambiguous principal ideal a = (α) with
a = (1), ( m ).
Hint: By Hilbert’s Theorem 90 we have ε = α /α.
√
9.17. Show that the norm of the fundamental unit ε of Q( p ) is negative if p ≡
1 mod 4 is prime.
Hint: Use the preceding exercise.
9.18. The idea behind Kummer’s ideal numbers was the construction of ring
homomorphisms Ok −→ Fq of the ring of integers of number fields into
finite fields. Restrict these homomorphisms to the multiplicative group, that
is, consider the group homomorphism ψ : Ok× −→ F× q . Find examples of
real quadratic number fields and primes q for which this homomorphism is
trivial, or where it is surjective.
9.19. Let p be a prime number such that ( 10 p ) = +1. Then there exist two
possibilities:
1. ( p2 ) = ( p5 ) = +1; in this case p = x 2 − 10y 2 .
2. ( p2 ) = ( p5 ) = −1; in this case ±2p = X2 + 10y 2 and, using X = 2x,
±p = 2x 2 − 5y 2.
√ √
Show that this implies that each element x + y 10 ∈ Z[ 10 ] can be written √
uniquely
√ as √ a product of a unit and irreducible elements of the form a + b 10
or c 2 + d 5. √
9.20. Let q ≡ 3 mod 8 be a prime number. Show that the class number of Q( 2q )
is odd and deduce that the equation 2x 2 − qy 2 = −1 is solvable. Deduce that
( q2 ) = −1.
Chapter 10
Quadratic Gauss Sums
At the heart of the notion of a Dirichlet character is the idea of studying algebraic
structures by constructing (and investigating) homomorphisms into simpler struc-
tures. Characters map groups to groups of complex numbers, and the multiplicative
group of complex numbers is simple in the sense that its finite subgroups are cyclic.
An example of a character is the Legendre symbol ( p· ), which is a group
homomorphism from the coprime residue class group (Z/pZ)× , where p is an odd
prime number, to the subgroup {−1, +1} of the complex numbers. More generally,
a Dirichlet character χ defined modulo m is a group homomorphism
χ : (Z/mZ)× −→ C×
assigning complex numbers to all the coprime residue classes modulo m. Since
(Z/mZ)× is a finite abelian group of order n = φ(m), the Theorem of Euler-Fermat
a n ≡ 1 mod m implies 1 = χ(1) = χ(a n ) = χ(a)n , from which we can read off
that the image of a Dirichlet character is an n-th root of unity. The Dirichlet character
χ is called a quadratic Dirichlet character if χ only attains the values +1 and −1.
Example There exist three nontrivial Dirichlet characters defined modulo 8. For
positive representatives a of the coprime residue classes modulo 8 these may be
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 223
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6_10
224 10 Quadratic Gauss Sums
a mod 8 1 3 5 7
χ−4 +1 −1 +1 −1
χ8 +1 −1 −1 +1
χ−8 +1 +1 −1 −1
This extension clearly has the property that χ(a + m) = χ(a) for all natural
numbers a.
Example The Dirichlet character χ−4 defined modulo 4 by χ−4 (1 + 4Z) = 1 and
χ−4 (3 + 4Z) = −1 may be extended to all natural numbers by setting χ−4 (2n) = 0
and χ−4 (2n + 1) = (−1)n . For odd integers a ∈ N we then have χ−4 (a) = ( −4
a )=
( −1
a ).
The extension to negative integers, when needed, must be done with care. For
the Dirichlet character χ−4 we have χ(−1 + 4Z) = χ(3 + 4Z) = −1, whereas
( −1
−1 ) = +1.
becomes a little bit more impressive when stated via the commutativity of the
diagram
ψ8
(Z/8Z)× −−−−→ {−1, +1}
⏐ ⏐
⏐
π
⏐
id
χ−4
(Z/4Z)× −−−−→ {−1, +1}.
The group X(A) is called the character group of A. Our goal is the determination of
the algebraic structure of the character group; in fact we will find that the character
group of a finite abelian group is isomorphic to the group itself, which means that
we can read off all algebraic properties of such groups from their character group.
Lemma 10.1 For finite abelian groups A and B we have
which is surjective by what we have already said, and whose kernel consists of all
characters χ of A ⊕ B for which we have χ(1, b) = χ(a, 1) = 1. But this implies
χ(a, b) = χ(a, 1) · χ(1, b) = 1 for all a ∈ A and b ∈ B, hence χ is the trivial
character, and λ is injective.
Next we show
Proposition 10.3 For each finite abelian group A we have X2 (A) A/A2 .
The proof is similar to the one above: It is sufficient to prove the claim for cyclic
groups. If A = g and if the order n of A (and g) is odd, then A/A2 = 1 and
X2 (A) = 1, since 1 = χ(1) = χ(g)n = χ(g)n and χ(g) = ±1 imply χ(g) = 1. If
n is even, then χ0 (g) = 1 and χ1 (g) = −1 define the only two possible quadratic
characters on A, and then A/A2 Z/2Z and X2 (A) Z/2Z.
If B is a subgroup of the finite abelian group A, then each character χ0 on A/B
defines a character χ on A via
χ(a) = χ0 (aB).
With A = (Z/2m Z)× we thus have A/A2 Z/2Z ⊕ Z/2Z, hence there are four
quadratic characters modulo 2m ≥ 8; apart from the trivial character these are the
characters χ−4 , χ8 and χ−8 defined above; here χ−4 is a primitive character modulo
4, the other two are primitive characters modulo 8.
Lemma 10.7 There exist exactly four quadratic characters defined modulo 2m for
m ≥ 3, namely the trivial character, as well as the characters χ−4 with conductor 4
and the characters χ8 and χ−8 with conductor 8.
Thus primitive quadratic Dirichlet characters exist only modulo 4, 8 and for
odd prime numbers p. Because of Lemma 10.1 there is exactly one primitive
quadratic Dirichlet character modulo N, where N is a product of such moduli. These
integers N are exactly those positive integers that are, up to sign, discriminants
of quadratic number fields: N = |Δ|. The decomposition of Δ = Δ1 · · · Δt into
prime discriminants corresponds to a decomposition χ = χ1 · · · χt of a primitive
quadratic character χ into primitive quadratic characters defined modulo Nj =
|Δj |. According to Lemma 10.4 χ is primitive if and only if the components χj
are primitive. Thus we have the following
Theorem 10.8 There exists a bijection between primitive quadratic Dirichlet char-
acters and discriminants of quadratic number fields.
The fact that there is a bijection between the primitive quadratic Dirichlet
characters and quadratic number fields suggests the question whether this bijection
may be extended from quadratic to arbitrary Dirichlet characters. The answer is yes,
and the primitive Dirichlet characters correspond to cyclotomic number fields.
Proposition 10.9 Let N = |Δ| be a natural number. If χ is a primitive quadratic
Dirichlet character defined modulo N, then
+1 for Δ > 0,
χ(−1) =
−1 for Δ < 0.
In particular, Δ = χ(−1) · N.
and since the claim is also true for the three primitive quadratic Dirichlet characters
defined modulo 4 and 8, the proposition is now completely proved.
10.2 Pell Forms 229
Quite often in mathematics there is a deep conceptual reason why bijections such
as the one in Theorem 10.8 exist. In our case, the existence of the bijection would
be explained by the fact that for quadratic number fields with discriminant Δ there
exists a Dirichlet character χ with conductor N = |Δ|. This is indeed true: The
Kronecker symbol ( Δ p ) introduced in Sect. 3.2, which describes the splitting of
primes p in the quadratic number field with discriminant Δ (see Thm. 6.14), defines
a “Kronecker character” κΔ (a) = ( Δ a ) for all natural numbers a ≥ 1, which assigns
the value +1 or −1 to all integers a coprime to Δ. It is, however, not at all obvious
that κ is a Dirichlet character, i.e., that there exists a modulus m with
Δ Δ
= for all k ≥ 0.
a a + km
It is the Modularity Theorem for Kronecker characters that guarantees the existence
of such a modulus m:
of a Dirichlet character χ defined modulo N. To this end we set χ(a) = 0 for all
integers a that are not coprime to N. Clearly the geometric series majorizes fχ (q),
hence this series converges absolutely for all complex numbers q with |q| < 1. Let
us now compute fχ (q) for the two discriminants Δ = −4 and Δ = 8:
• Δ = −4: For κ(p) = ( −4
p ) we obtain
q
fχ (q) = q − q 3 + q 5 − q 7 + . . . = q(1 − q 2 + q 4 − q 6 + . . .) = .
1 + q2
This is a rational function with poles at the primitive 4-th roots of unity. In
addition, we find
1 1
q
q
fχ = = = fχ (q),
q 1+ 1 q2 + 1
q2
fχ (q) = q − q 3 − q 5 + q 7 + q 9 − . . . = (q − q 3 − q 5 + q 7 )(1 + q 8 + q 16 + . . .)
q − q3 − q5 + q7 q − q3
= = 4
1−q 8 q +1
because of
Here we obtain
1 q − q3
1 1
q3 − q
fχ = = = −fχ (q).
q 1
+1 q4 + 1
q4
∞ ∞ N
fχ (q) = χ(n)q n = χ(n)q n q kN
n=1 k=0 n=1
N Fekχ (q)
= χ(n)q n (1 + q N + q 2N + . . .) = ,
1 − qN
n=1
10.2 Pell Forms 231
where
N−1
Fekχ (q) = χ(n)q n
n=1
denotes the Fekete polynomial for the Dirichlet character χ with conductor N (see
Table 10.1).
Fekχ (q)
fχ (q) =
1 − qN
that can be extended, except for possible poles at the N-th roots of unity, to the
whole complex plane.
Fekete polynomials first occurred explicitly in Dirichlet’s proof of the theorem
on primes in arithmetic progression, according to which there exist infinitely
many primes in each coprime residue class modulo some integer N. Implicitly,
Fekete polynomials already showed up in Gauss’s sixth proof [44] of the quadratic
reciprocity law; later Cauchy, Jacobi and Eisenstein published variants of this proof
in which they replaced x by a p-th root of unity.
Yet Fekete polynomials have remained mathematical wallflowers; one of the few
articles that underline the importance of Fekete polynomials for the arithmetic of
quadratic number fields is Ayoub [6].
The periodicity of χ allowed us to write the generating function fχ as a rational
function; but rational functions can be extended to meromorphic functions on the
whole complex plane, and the only possible poles are at the N-th roots of unity.
Our first task is the determination of the poles of Pell forms fχ , which we know
can only occur at the N-th roots of unity. A few calculations for Pell forms with
small conductor show that fχ does not have poles at each N-th root of unity. If we
232 10 Quadratic Gauss Sums
factor numerator and denominator of the rational function fχ and cancel as many
factors as possible, then we find, for small values of N:
q − q2 q(1 − q) q
f−3 (q) = = = ,
1 − q3 (1 − q)(1 + q + q 2 ) 1 + q + q2
q − q3 q(1 − q 2 ) q
f−4 (q) = = = ,
1 − q4 (1 − q 2 )(1 + q 2 ) 1 + q2
q − q2 − q3 + q4 q − q3
f5 (q) = = ,
1−q 5 1 + q + q2 + q3 + q4
Already these few examples suggest that the poles of the function fχ are exactly
at the primitive N-th roots of unity. Here an N-th root of unity ζ is called primitive
if the equation ζ m = 1 holds for m = N, but not for any smaller value 1 ≤ m < N.
For proving this claim we proceed as in our derivation of Binet’s formulas: We
determine the partial fraction decomposition of fχ . To this end we set
N−1
ak
fχ (q) = ;
ζk − q
k=0
then a simple application of Euler’s formulas (2.4) shows that the coefficients ak are
given by
Fekχ (ζ k ) ζ k Fekχ (ζ k )
ak = =− .
−Nζ k(N−1) N
The expression
N−1
Fekχ (ζ k ) = χ(n)ζ kn =: τk (χ)
n=1
It is clear that fχ (q) has a pole in q = ζ k if and only if τk (χ) = 0. The question of
the location of the poles of Pell forms thus boils down to determining the values of
k for which the quadratic Gauss sums τk (χ) vanish.
10.2 Pell Forms 233
It turns out that the quadratic Gauss sums τk are, up to a root of unity, equal to
τ = τ1 (χ). In fact we have:
Proposition 10.12 For primitive Dirichlet characters defined modulo N and all
natural numbers k we have
where we have used that b = ka runs through all coprime residue classes of
(Z/NZ)× when a does.
If gcd(k, N) = d > 1, on the other hand, then we write N = dn and k = ds for
coprime integers n and s. We first claim that there exists an integer b ≡ 1 mod n
with χ(b) = 1. Since χ is primitive, χ is not trivial on the kernel of the projection
map (Z/NZ)× −→ (Z/nZ)× , and this is exactly what we have claimed.
Next we have k ≡ bk mod N since bk − k = k(b − 1) ≡ 0 mod dn; in particular,
we have ζ k = ζ bk . Now we get
N−1 N−1
χ(b)τk (χ) = χ(ab)ζ ka = χ(ab)ζ kab = τk (χ),
a=1 a=1
N−1 N−1
τ χ(k) τ χ(k)
fχ (q) = −q · = . (10.4)
N q − ζk N 1 − qζ −k
k=1 k=1
Observe that this implies τ = 0 since we already know that fχ (q) is a nontrivial
rational function.
Before we continue, let us give two simple examples of Gauss sums.
• Consider the Dirichlet character χ(a) = ( a2 ) defined modulo 8. If ζ denotes a
primitive 8th root of unity, then
• Now let χ(a) = ( a5 ) denote the quadratic Dirichlet character defined modulo 5,
and let ζ denote a primitive 5th root of unity. Then
τ1 = ζ − ζ 2 − ζ 3 + ζ 4
and thus
τ12 = ζ 2 + ζ 4 + ζ 6 + ζ 8 − 2ζ 3 − 2ζ 4 + 2ζ 5 + 2ζ 5 − 2ζ 6 − 2ζ 7
= ζ + ζ 2 + ζ 3 + ζ 4 4 − 2ζ − 2ζ 2 − 2ζ 3 − 2ζ 4
= −1 + 4 + 2 = 5,
τ2 = Δ (10.5)
N−1
τ χ(k)
fχ (q) = . (10.6)
N 1 − ζ −k q
k=1
Expanding the left side into a power series we get n≥1 χ(n)q n , and on the right
side we obtain, when we develop the fractions into geometric series,
χ(k)
= χ(k)(1 + ζ −k q + ζ −2k q 2 + . . .) = χ(k) + χ(k)ζ −k q + . . . .
1 − ζ −k q
N N
τ τ τ
1= χ(k)ζ −k = χ(k)ζ k = · τ,
N N N
k=1 k=1
√
hence τ τ = N and |τ | = N.
10.3 Fekete Polynomials 235
For proving the second claim we observe that, in the case of quadratic characters,
we have χ = χ, hence
N N
τ= χ(k)ζ −k = χ(k)ζ −k = χ(−1)τ.
k=1 k=1
τ 2 = χ(−1)N = Δ.
Since τ by definition is an element of Q(ζN ), this implies that each quadratic number
field is a subfield of some cyclotomic number field Q(ζN ), and in fact that we can
choose N = |Δ|.
Gauss’s sixth proof of the quadratic reciprocity law is today usually presented in
the form given by Jacobi and Cauchy, who used the basic arithmetic of cyclotomic
number fields. These proofs have the advantage of being very slick and short. Here
we will present Gauss’s original sixth proof of the quadratic reciprocity law in such
a way that the role of the Fekete polynomials becomes clearly visible. The necessary
changes are mainly of a cosmetic nature. Apart from Fekete polynomials, Gauss also
uses the cyclotomic polynomial
xp − 1
Φp (x) = 1 + x + x 2 + . . . + x p−1 = .
x−1
This polynomial is known to be irreducible over the rationals, as can be seen most
easily using a method due1 to Schönemann and Eisenstein: one shows that Φp (x+1)
is an “Eisenstein polynomial,” i.e., that the it has the form
1 See [26].
236 10 Quadratic Gauss Sums
In fact we have
Φp (x n ) x np − 1 x − 1
= n · .
Φp (x) x − 1 xp − 1
Φp (x n ) x np − 1 x mn − 1 + x − x hp+1
= n ·
Φp (x) x −1 xp − 1
x np − 1 x mn − 1 x(x np − 1) x hp − 1
= · − · p ,
xp − 1 xn − 1 xn − 1 x −1
Φp (x n ) − p = 1 + x n + x 2n + . . . + x n(p−1) − (1 + 1 + . . . + 1)
= x n − 1 + x 2n − 1 + . . . + x n(p−1) − 1.
xn − 1 x mp − 1 x mp − 1 x p − 1
= = p ·
x−1 x−1 x −1 x−1
−1
p
we deduce that it is divisible by Φp (x) = xx−1 .
In the following, let Fekp (x) be the Fekete polynomial for the primitive quadratic
Dirichlet character with odd prime conductor p.
Lemma 10.16 For every natural number 1 ≤ q < p, the polynomial
q
Fekp (x q ) − Fekp (x)
p
implies that
q
τq (χ) = τ.
p
p−1 p−1
Fekp (x q ) = χp (a)x aq = χp (q) χp (aq)x aq .
a=1 a=1
Thus if a runs through a coprime system of residue classes modulo p, then so does
aq. Each exponent aq is thus congruent modulo p to exactly one number c with
1 ≤ c < p, i.e., we have aq = c + ka p for an integer ka depending on a. This
implies
hence
p−1 p−1
Fekp (x q ) = χp (q) χp (aq)x aq = χp (q) χp (c)x c mod (x p − 1)
a=1 c=1
as claimed.
Next Gauss turns his attention to the polynomial Fekp (x)2 . Clearly
p−1
k k
Fekp (x)2 = x Fekp (x).
p
k=1
p−1
h kh
p−1
h
p−1 p−1
Fekp (x)2 ≡ xk x = x kh+k
p p
k=1 h=1 h=1 k=1
p−1
h
p−1
h
= (Φp (x h+1 ) − 1) = Φp (x h+1 ) mod Φp (x),
p p
h=1 h=1
238 10 Quadratic Gauss Sums
where we have used that ( ph ) = 0. Using Lemma 10.15 we now obtain
p − 1 −1
Fekp (x)2 ≡ Φp (x p ) ≡ p mod Φp (x).
p p
We have proved
Proposition 10.17 Fekete polynomials satisfy the congruence
−1
Fekp (x)2 ≡ p mod Φp (x).
p
If we set x = ζ for a primitive p-th root of unity ζ , then the congruence above turns
into the equation
−1
τ2 = p τ,
that we have proved above. Instead of working with double congruences modulo
q and modulo Φp (X) we write the congruences as equations—another possibility
would be working modulo q in cyclotomic number fields. The congruences above
then become the following equations:
q−1 q−1
Fekp (x)q = Fekp (x)2 2 Fekp (x) = p∗ + Φp (x)A(x) 2 Fekp (x)
q−1
= (p∗ ) 2 Fekp (x) + Φp (x)A1 (x) Fekp (x)
p∗
= Fekp (x) + qh Fekp (x) + Φp (x)A1 (x) Fekp (x),
q
as well as
q
Fekp (x)q = Fekp (x q ) + qB(x) = Fekp (x) + Φp (x)C(x) + qB(x).
p
Thus
q p∗
Fekp (x) − Fekp (x) = qR(x) + Φp (x)S(x)
p q
for polynomials R, S ∈ Z[x]. Our goal is showing that the polynomial on the left
hand side is 0. To this end we first write
with ε = ( −1
p ) and some polynomial F (x) of degree ≤ p − 2.
Next F (0) = Fekp (0) − εΦp (0) = −ε. Thus we have
q p∗
− F (x) = qR(x) + Φp (x)T (x).
p q
The polynomial on the left side has degree ≤ p − 2 and is divisible by Φp (x). Since
Φp is irreducible, this is only possible if the polynomial vanishes:
q p∗
− F (x) − qr(x) = 0.
p q
240 10 Quadratic Gauss Sums
Plugging in x = 0 yields
q p∗ q p∗
− F (0) − qr(0) = −ε − − qr(0) = 0,
p q p q
In this last section we will sketch possible extensions of our investigations. We have
already seen that the factor 1 − q of the Fekete polynomial may be canceled with
the corresponding factor in 1 − q N . This fact allows us to determine the value fχ (1)
(see Table 10.2).
The fact that fχ (1) = 0 for Δ > 0 follows immediately from the functional
equation of fχ (see Exercise 10.9). The values for negative discriminants are
mysterious; if we extend the table far enough, then it turns out that, for negative
√
discriminants Δ < −3, the value fχ (1) is related to the class number of Q( Δ ) in
a very simple and striking way:
Theorem 10.18 We have fχ (1) = 0 if and only if the unit group of the quadratic
number field with discriminant Δ has rank 1, i.e., if and only if the Pell equation
T 2 − ΔU 2 = 4 has a nontrivial solution.
q − q3
5 5 +1 0
1 + q + q2 + q3 + q4
q + 2q 2 + q 3 + 2q 4 + q 5
−7 7 −1 1
1 + q + q2 + q3 + q4 + q5 + q6
q − q3
8 8 +1 0
1 + q4
q + q3
−8 8 −1 1
1 + q4
q − q3
12 12 +1 0
1 − q2 + q4
q − q 3 + 2q 4 − q 5 + q 7
−15 15 +1 2
1 − q + q3 − q4 + q5 − q7 + q8
10.4 The Analytic Class Number Formula 241
2h h (#Cl(K) : #Cl(Q))
fχ (1) = = = , (10.11)
w w/2 (#WK : #WQ )
where h √= #Cl(K) denotes the class number, w the number of roots of unity in
K = Q( Δ ), and WK the group of roots of unity in K.
Observe that Cl(Q) = 1 since Z has unique factorization, and that WQ = {±1},
hence #WQ = 2.
The expression on the right shows that the formula fχ (1) = 2h w is actually a
relative class number for the quadratic extension K/Q and beautifully explains the
occurrence of the factor 2 in the numerator.
The investigation of the generating functions of Kronecker and Dirichlet charac-
ters has led us into rather deep waters. Although the terms in (10.11) all are closely
related to the arithmetic of number fields, the natural proof of this equation uses
analytic methods.
In this proof, a central role is played by Dirichlet L-series, which Dirichlet had
also used for proving his theorem on primes in arithmetic progression. L-series
provide a second possibility of writing down a generating function for Dirichlet
characters χ, which is different from the Pell form, which is a power series. We set
∞
L(s, χ) = χ(n)n−s
n=1
and then show that this series converges absolutely for all s > 1.
By manipulating divergent series without fear and evaluating the L-series L(s, χ)
at places where it is not defined we find
1 1 1 π
L(1, χ) = 1 − + − +... = .
3 5 7 4
242 10 Quadratic Gauss Sums
There is a connection between this Leibniz series and Pell forms: Clearly
q3 q5 q7
Fχ (q) = q − + − +...
3 5 7
is a primitive of
1 fχ (q)
1 − q2 + q4 − q6 + . . . = = ,
1 + q2 q
and this can be done for arbitrary Dirichlet characters since we have
1 1 1
fχ (q) χ(n)
dq = χ(n)q n−1 dq = χ(n) q n−1 dq = = L(1, χ),
0 q 0 n≥1 0 n
n≥1 n≥1
where we once more point out that interchanging the order of taking limits requires
a proof.
For χ = χ8 , for example, we obtain
1 1 1 1 1 − q2
L(1, χ) = 1 − − + +... = dq.
3 5 7 0 1 + q4
These class number formulas, which underline again the central importance of
the Pell forms fχ and the Fekete polynomials for the arithmetic of quadratic number
fields, are due to Dirichlet, who proved them for quadratic forms rather than for
1 f (q)
quadratic number fields. The integral representation of L(1, χ) = 0 χq dq may
be transformed via the partial fraction decomposition of fχ into a finite sum, which
has a certain charm, but is not very well suited for the computation of class numbers
except for small discriminants.
Dirichlet’s main motivation for working out the class number formula was the
obvious corollary that L(1, χ) = 0 for all quadratic Dirichlet characters. Since
the corresponding claim for Dirichlet characters that attain nonreal values may be
proved rather easily, Dirichlet obtained that L(1, χ) = 0 for all Dirichlet characters
modulo N. This in turn quickly implies (by an idea going back to Euler) that for
any pair of coprime integers a and N there exist infinitely many prime numbers
p with p ≡ a mod N. This is Dirichlet’s famous theorem on primes in arithmetic
progression.
For a proof of these results we refer the reader to the wonderful books by
Scharlau and Opolka [111] and by Zagier [134].
10.5 Modularity
α 3 = ζ 3 + 3ζ + 3ζ −1 + ζ −3 = ζ 3 + ζ 6 + 3α = 3α − 1,
hence α is a root of f . Thus the roots of f generate a cubic subfield of Q(ζ ), and
this implies that f is a polynomial with an abelian Galois group.
2 In the theory of complex multiplication there exists something called “the modular polynomial.”
10.5 Modularity 245
σ (α) = σ (ζ + ζ −1 ) = ζ 2 + ζ −2 = α 2 − 2.
This map permutes the roots of f and makes the fact that f has an abelian Galois
group explicit.
The classification of modular polynomials is achieved by class field theory; the
result is
Theorem 10.20 A polynomial is modular if and only if its Galois group is abelian.
The Galois group of a polynomial is abelian if and only if its splitting field (up to
isomorphism, the smallest extension of Q containing all the roots of f ) is abelian.
The “finitely many exceptions” have to do with the choice of f . Clearly f (x) =
x 2 − x − 1 and g(x) = x 2 − 5 have the same splitting behavior for every odd prime
since 4f (x) = (2x − 1)2 − 5 = g(2x − 1). √ But f is irreducible modulo 2 (which
corresponds to the fact that 2 is inert in Q( 5 )), yet x 2 − 5 ≡ (x + 1)2 mod 5. Note
that disc f = 5 and disc g = 20.
Let us now call the Pell form fκ of a Kronecker symbol κ = ( Δ· ) modular if the
following conditions are satisfied:
• There exist polynomials A, B ∈ Z[q], with B monic, such that fκ (q) = ± A(q)
B(q) ;
• fκ satisfies a functional equation of the form fκ ( q1 ) = ±fκ (q) for some choice
of the sign.
We say that fκ is strongly modular if we can choose B(q) = q N − 1 for N = |Δ|.
The following theorem tells us that the modularity of the Kronecker symbol is a
consequence of analytic properties of the associated Pell forms:
Theorem 10.24 The modularity of fκ implies the modularity of the Kronecker
symbol κ(p) = ( Δ
p ).
A(q)
Assume that fκ (q) = B(q) is rational and satisfies the functional equation
= ±fκ (q). Since fκ converges absolutely inside the unit circle, fκ does not
fκ ( q1 )
have any poles there. By the functional equation, it cannot have any poles outside
the unit circle. Thus the rationality and the functional equation imply that fκ has all
its poles on the unit circle.
Since B(q) is monic, the poles of fκ must be algebraic integers. Thus if fκ has a
pole in q = ζ , then ζ and all of its conjugates lie on the unit circle. Now we invoke
the following result due to Kronecker:
Proposition 10.25 (Kronecker) If η is an algebraic integer with the property that
all of its conjugates lie on the unit circle, then η is a root of unity.
Let η be a root of a monic polynomial with degree n. By Dirichlet’s pigeonhole
principle, there exist natural numbers r < s such that |ηs − ηr | < 2−n . The
conjugates ηjk of ηk all lie on the unit circle, hence |ηjs − ηjr | ≤ 2. Since the norm
of an algebraic number is the product of its conjugates (see Exercise 2.46), we have
|N(ηs − ηr )| < 2−n 2n−1 = 12 . Since η is an algebraic integer, its norm is a rational
integer, and we conclude that its norm is 0. But then ηs = ηr , hence ηs−r = 1, and
this implies that η is a root of unity.
10.5 Modularity 247
where η1k , . . . , ηm
k are the conjugates of ηk . Clearly
Since the absolute values of the ηjk are = 1, this implies that
m m
|am−1 | ≤ m, |am−2 | ≤ , . . . , |ak | ≤ , . . . , |a1 | ≤ m, |a0 | = 1.
2 k
These bounds show that there are only finitely many such polynomials, hence there
must exist natural numbers r < s with ηr = ηs , and this implies as above that η is
a root of unity.
Now we can finish the proof of Theorem 10.24. Since we have just shown that
the poles of fκ are roots of unity, we can choose an integer N such that the poles
are roots of x N − 1. Then we can write
C(q)
fκ (q) = ,
1 − qN
C(q)
fκ (q) = = C(q) + C(q)q N + C(q)q 2N + . . .
1 − qN
with the definition of fκ (q) we find that κ(m) only depends on the value of m
modulo N, and this finally shows that κ is modular.
248 10 Quadratic Gauss Sums
Fig. 10.1 The elliptic curve y 2 = x 3 − x over the reals and the group law
10.6 Modularity of Elliptic Curves 249
E(F5 ) = {∞, (0, 0), (1, 0), (2, ±1), (3, ±2), (−1, 0)}.
Thus N5 = #E(F5) = 8.
Similar calculations for other odd primes yield the following table:
p 3 5 7 11 13 17 19 23 29 31 37 41
Np 4 8 8 12 8 16 20 24 40 32 40 32
p 5 13 17 29 37 41
ap −2 6 2 −10 −2 10
The pattern becomes visible if we write these primes p as sums of two squares: In
fact, if p = a 2 + b2 , where a ≡ 1 mod 4, then ap = −2a. We have already proved
this result in Theorem 3.31.
was proved by Helmut Hasse in the 1930s. Hasse’s theorem can be interpreted as a
Riemann conjecture for the zeta function attached to the elliptic curve over Fp . For
the history of this result, see [110].
250 10 Quadratic Gauss Sums
It is rather difficult to explain the modularity of elliptic curves from scratch.3 As for
Pell conics, the main content is the existence of a modulus N for each elliptic curve
E such that the values ap = p + 1 − Np , where Np = #E(Fp ) is the number of
Fp -rational points on the elliptic curve E, is determined by what is called a modular
form on Γ0 (N).
Consider the following example, which is essentially already contained in
Shimura’s work. Let E : y 2 − y = x 3 − x 2 be an elliptic curve; its discriminant is
Δ = −11. Consider the function
∞
∞
f (q) = q (1 − q n )2 (1 − q 11n )2 = an q n .
n=1 n=1
This is a cusp form of weight 2 living on Γ0 (11). What this means is that this
function satisfies a functional equation of the following form: Set q = e2πiz and
interpret f as a function of the complex variable z. Then
az + b
f = (cz + d)2 f (z)
cz + d
for all matrices ac db with a, b, c, d ∈ Z, determinant ad − bc = 1 and c ≡
0 mod 11. Since 10 11 ∈ Γ0 (N) for every N, we have in particular f (z+1) = f (z);
but this was clear from the fact that f is a function of q = e2πiz .
A simple calculation yields
f (q) = q − 2q 2 − q 3 + 2q 4 + q 5 + 2q 6 − 2q 7 − 2q 9 − 2q 10 + q 11
− 2q 12 + 4q 13 + 4q 14 − q 15 − 4q 16 − 2q 17 + 4q 18 + 2q 20 + · · ·
p 2 3 5 7 11 13 17 19
ap −2 −1 1 −2 −1 4 −2 0
The pattern now is clear: For all prime numbers p not dividing the modulus N = 11,
the coefficient ap in the power series expansion of f (q) coincides with the number
ap = p + 1 − Np that determines the number of Fp -rational points on E.
3 I highly recommend the books [4, 5] to everyone interested in learning more about the big picture.
10.6 Modularity of Elliptic Curves 251
The first few primes p for which p | Tp−1 are p = 19, 47, 53, 103 and 163. The
prime p = 19 is exceptional, and the others are represented by x 2 + 11y 2 :
The Tribonacci numbers can be expressed explicitly using the roots of the cubic
polynomial f (x) = x 3 −x 2 −x −1; the corresponding elliptic curve E2 : y 2 = x 3 −
x 2 − x − 1 has discriminant Δ2 = −26 · 11 and is a “quadratic twist” of the elliptic
curve E above. In fact, consider the elliptic curve E : y 2 −y = x 3 −x 2 . Multiplying
through by 4 and completing the square shows that (2y − 1)2 = 4x 3 − 4x 2 + 1,
hence
√
[ −2(2y − 1)]2 = −8x 3 + 8x 2 − 2 = (−2x)3 + 2(−2x)2 − 2.
√
Setting Y = −2(2y − 1) and ξ = −2x we obtain the equation Y 2 = ξ 3 + 2ξ 2 − 2.
Finally, setting ξ = X − 1 gives E2 : Y 2 = X3 − X2 − X − 1.
This implies that both curves have the same number Fp -rational points if ( −2
p )=
+1; if ( −2
p ) = −1, on the other hand, then the number of points is Np = p + 1 − ap
on one and Np = p +1+ap on the other curve. In all cases, ap (E) = ( −2p )·ap (E2 ).
This in turn implies that the condition ap ≡ 0 mod 2 in Theorem 10.26 is equivalent
to the character sum
p−1 3
x − x2 − x − 1
S=
p
x=0
being odd. I do not know whether there is a link between the value of S and the
representations of p in the form x 2 + 11y 2.
These examples are only the tip of a massive iceberg. Following the breakthrough
by Wiles it was shown that every elliptic curve defined over Q is modular. The
modularity theorems for quadratic number fields (and actually for all abelian
number fields) and for elliptic curves are pieces of a large area of conjectures due to
Robert Langlands.
10.7 Exercises
10.1. Let A and B be finite abelian groups. Show that the set A ⊕ B of all pairs
(a, b) with a ∈ A and b ∈ B becomes a group by setting (a1 , b1 )·(a2 , b2 ) =
(a1 a2 , b1 b2 ).
This is a purely formal exercise. If 1A and 1B denote the neutral elements of
A and B, then (1A , 1B ) is the neutral element of A ⊕B. The inverse element
of (a, b) is (a −1 , b −1 ), and associativity is directly inherited from A and B.
10.2. Let A and B be finite abelian groups. Show that each subgroup of A ⊕ B has
the form A1 ⊕ B1 , where A1 and B1 are subgroups of A and B, respectively.
10.3. Let χ1 and χ2 denote two Dirichlet characters defined modulo N1 and
modulo N2 , respectively, and assume that N1 and N2 are coprime. Then
10.10. Prove without using Euler’s formulas (2.4) that the partial fraction decompo-
sition of the Pell form for the primitive Dirichlet character χ defined modulo
4 is given by
q 1 1 1
fχ (q) = = − .
1 + q2 2i 1 − qi 1 + qi
10.11. Compute the partial fraction decomposition of the Pell form for the primitive
Dirichlet character modulo 8 defined by χ(n) = ( n2 ).
10.12. Since Fekχ (q) is divisible by q, we can determine the partial fraction
decomposition of
Fekχ (q)
.
q(1 − q N )
N−1
Fekχ (q) 1 Fekχ (ζ k )
=− .
q(1 − q )
N N q − ζk
k=1
p−1
Fekp (x) ≡ nm x n mod p. (10.12)
n=1
−1 mod p.
10.15. Show that the cyclotomic polynomial Φp (x) is irreducible for prime values
of p.
Hint: Consider Φp (x + 1).
10.16. Count the number Np of solutions of the congruence y 2 = x 3 + 1 mod p
for various prime numbers p. Observe that multiplying x by a cube root of
unity ρ does not change the equation y 2 = x 3 + 1, so this elliptic curve
has complex multiplication by ρ. Use this information for writing down a
conjecture for Np .
10.17. Let x be a p-th root of unity, i.e., assume that x p = 1. Show that
p−1
Fekp (x) =
n
Jn (a)x a ,
a=0
254 10 Quadratic Gauss Sums
where Jn (a) is the character sum we have studied at the end of Sect. 3.4
defined by
t t · · · t
1 2 n
Jn (a) = ,
p
A.1 Pari
Pari1 is very easy to use, and the basic version is installed within seconds.
(Observe that
Mod(3^(N-1),N)
is doing something completely different, even if the result is the same); the
answer
Mod(95591506202441271281, 147573952589676412927)
[761838257287 1].
1 pari was developed at the University of Bordeaux by Henri Cohen and his colleagues.
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 255
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6
256 A Computing with Pari and Sage
The command lift transforms the residue class Mod(2,127) into the integer
2; its only purpose is to produce a nicer output. Other methods for programming
loops may be found by typing ?11; the command ?while produces an explanation
of how to program a while-loop.
The result 0 in the above computation shows that 261 − 1 is prime. Putting a loop
around the commands we obtain a program that finds the small Mersenne primes:
{forstep(p=3,2000,2,if(isprime(p),
N=2^p-1; S=Mod(4,N);
for(n=1,p-2,S=S^2-2);
if(S,,print(p)))) }
p = 3, 5, 7, 13, 17, 19, 31, 61, 89, 107, 127, 521, 607, 1279.
You can obtain a generator of the ring of integers of the quadratic number field with
discriminant d = 12 by typing
w = quadgen(12)
√ √
produces the result ε = 2 + w = 2 + 1+2 21 = 5+ 21
2 . The norm and the trace of
the fundamental unit are computed via
eps = quadunit(21); print(norm(eps)," ",trace(eps))
Class numbers and the class groups of quadratic number fields are obtained
easily:
quadclassunit(-84)[1]
√
yields the class number 4 of Q( −21 ), and
quadclassunit(-84)[2]
gives the structure of the class group: [2, 2] denotes the abelian group Z/2Z⊕Z/2Z.
Residue classes modulo rational primes may be realized via
w = quadgen(21)*Mod(1,7)
using
(3-2*w)^7
√
then shows that (3 − 2 1+2 21 )7 ≡ 2 mod 7.
For other calculations, e.g., with ideals we have to define a number field. This is
accomplished by
nf = bnfinit(x^2-79);
This command computes the basic invariants of the quadratic number field
√
Q( 79 ), namely an integral basis, the discriminant, the fundamental unit, and the
class group. The semicolon at the end tells pari not to print the results of these
calculations. We have access to the individual results by commands such as
nf.zk
Here [1, x] denotes the integral basis {1, x}, where x is the root of the polynomial
x 2 − 79.
We get the ideal class group of this number field with
nf.clgp
the expression
[3, [3], [[3, 2; 0, 1]]]
gives the class number 3, the structure of the class group ([3] denotes the cyclic
group of order 3), and an ideal (here it is a prime ideal q above 3) that generates the
class group. Using
idealfactor(nf,5)
we choose the first prime ideal, which we will denote by p in the following. The
command
bnfisprincipal(nf,p)
then yields
[[2]~, [19/9, -2/9]~]
which means that the prime ideal p lies in the ideal class q2 , where q is the prime
ideal above (3) found above, and that the principal ideal pq−2 is generated by 19
9 −
√ √
2
79, i.e., that we have 9p = (19 − 2 79 )q 2 ist.
9 √
The prime ideal decomposition of 19 − 2 79 may be controlled by
idealfactor(nf,19-2*x)
A.2 Sage
In pari, only a few very basic functions for doing arithmetic with elliptic functions
are implemented. For computing on elliptic curves it is a good idea to familiarize
yourself with sage.2 As a matter of fact, sage is also more comfortable for doing
arithmetic in number fields, and you can access pari from within sage.
and
K.units()
2 This program was developed under William Stein and uses work done for other computer algebra
√
yields the fundamental unit 9 79 − 80. The command
K.integral_basis()
√
explains itself. The ideal I = [5, 2 + 79] ] is defined by
I = K.ideal([5,2+a])
For everything else we refer the readers to several introductions to sage that can
be found quickly on the world wide web.
In order to find the integral solutions of the equation y 2 = x 3 −26, we first define
the elliptic curve
E = EllipticCurve([0,-26])
The command
E.rank()
then shows that the group of rational points on this elliptic curve has rank 2; with
E.gens()
corresponding to the affine points (x, y) = (3, 1) and (35 : 207). Finally,
E.integral_points() ,
shows that these are the only integral solutions of the equation y 2 = x 3 − 26. By
copying
for a in [1..30]:
E = EllipticCurve([0,-a])
print(a, E.integral_points())
260 A Computing with Pari and Sage
into the sage window and pressing Enter you obtain the following table with all
integral solutions of the equations y 2 = x 3 − d for 1 ≤ d ≤ 30:
d y2 = x3 − d d y2 = x3 − d d y2 = x3 − d
1 (1, 0) 11 (3, 4), (15, 58) 21
2 (3, 5) 12 22
3 13 (17, 70) 23 (3, 2)
4 (2, 2), (5, 11) 14 24
5 15 (4, 7) 25 (5, 10)
6 16 26 (3, 1), (35, 207)
7 (2, 1), (32, 181) 17 27 (3, 0)
8 (2, 0) 18 (3, 3) 28 (4, 6), (8, 22), (37, 225)
9 19 (7, 18) 29
10 20 (6, 14) 30
In order to test the truth of Theorem 6.20 we run the following program in sage
for t in [2,4,..20]:
d = 3*t^2+1
E = EllipticCurve([0,-d])
K.<a> = QuadraticField(-d)
print(d, K.class_number(), E.integral_points())
and obtain
d h Solutions d h Solutions
13 2 (17, 70) 433 12 (13, 42), (577, 13860)
49 1 (65, 524) 589 16 (785, 21994)
109 6 (5, 4), (145, 1746) 769 20 (1025, 32816)
193 4 (257, 4120) 973 12 (1297, 46710)
301 8 (401, 8030) 1201 16 (1601, 64060)
The limits of sage become visible by extending the loop until t = 20: For
t = 14 and d = 2353, sage does not produce an answer. The reason for this
behavior is that either the generators of the group of rational points on E are huge,
or that the elliptic curve has a nontrivial Tate–Shafarevich group.
By looking at the table one is led to the conjecture that the class numbers of
√
Q( m ) for all m = 12t 2 + 1 are divisible by 3 whenever t is a multiple of 3. This is
indeed true, but quite likely this is very difficult to prove without class field theory.
Appendix B
Solutions
Chapter 1
1.1. Let (m2 −n2 , 2mn, m2 +n2 ) be the first triple. If we choose m even and n odd,
then m2 + n2 is odd, and we can find integers r and s such that m2 + n2 =
r 2 − s 2 ; for example, we can set r − s = 1 and r + s = m2 + n2 , that is
r = 12 (m2 + n2 + 1) and s = 12 (m2 + n2 − 1). With these values, we have
as desired.
1.2. Write a = m2 −n2 , b = 2mn and c = m2 +n2 . For making b = 2mn a square
it is only necessary to set m = a 2 and n = 2b2. For making c = m2 + n2 a
square, write m2 + n2 = p2 and set m = r 2 − s 2 , n = 2rs and p = r 2 + s 2 .
1.3. For solving 2a 2 = c2 − b2 = (c − b)(c + b) it is sufficient to set c − b =
4s 2 and c + b = 2r 2 . Then 2a 2 = 8r 2 s 2 shows that a = 2rs. Moreover,
2c = (c + b) + (c − b) = 2r 2 + 4s 2 , which gives us c = r 2 + 2s 2 . Similarly,
b = r 2 − 2s 2 .
For a geometric parametrization of the ellipse x 2 + 2y 2 = 1, consider the
lines through P (−1, 0). These have the equation y = m(x + 1); intersecting
these lines with the ellipse we get x 2 + 2m2 (x + 1)2 = 1, which is equivalent
to
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 261
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6
262 B Solutions
1 + 2m2 2m
x= and y = m(x + 1) = .
1 − 2m2 1 − 2m2
Writing m = s
r and simplifying the resulting expressions we get
r 2 + 2s 2 2rs
x= and y = ,
r 2 − 2s 2 r2 − 2s 2
Intersecting this line with the unit sphere results in a quadratic equation with
the solutions t1 = 0 and t2 = a 2 +b2a2 +c2 , which then provides us with the
points
a 2 − b2 − c2 2ab 2ac
x= , y= , z= .
a 2 + b2 + c2 a 2 + b2 + c2 a 2 + b2 + c2
2
1.6. Implicit differentiation yields 2yy = 3x 2 , hence y = 3x
2y . Thus the tangent
in the point (u, v) on Bachet’s curve y = x − k has the equation y =
2 3
3u2
2v (x − u) + v. Intersecting this line with Bachet’s curve gives rise to the
equation
3u2 2
x3 − (x − u) + v − k = 0.
2v
This equation has a double root x = u; if we denote the third root by x3 , then
the sum of the roots 2u + x3 is the negative coefficient of x 2 in this equation:
9u4
2u + x3 = − .
4v 2
B Solutions 263
9u4 u4 + 8ku
x3 = − 2u = .
4v 2 4v 2
2
Plugging this value of x = x3 into the line equation y = 3u 2v (x − u) + v we
also get (1.6).
1.7. Applying the duplication formulas to (3, 5) on the elliptic curve y 2 + 2 = x 3
we find
34 + 8 · 2 · 3 129 383
x1 = = 2 y1 = ,
4 · 52 10 103
2340922881 113259286337292
x2 = y2 =
76602 76603
1.8. We find
24 + 8 · 4 · 2 −26 + 20 · 4 · 23 + 8 · 42
x1 = = 5, y1 = = 11.
4 · 22 8 · 23
Two more applications of the duplication formula yield
785 5497
x2 = , y2 = − ,
222 223
3227836439105 5799120182710629023
x3 = , y3 = − .
2418682 2418683
leads to the same result. The same approach works for equations of the form
x12 + x22 + . . . + xn2 = y 2 .
1.11. We have 24 · 13 = 172 + 23 · 12 and 24 = 12 + 23 · 12 , yet 13 cannot be
represented by the form x 2 + 23y 2.
1.12. Let p1 , . . . , pt be distinct primes of the form 4n+1, and set N = 4p12 · · · pt2 +
1. Clearly N is not divisible by any pj . If N is prime, then we have found a
new prime of the form 4n + 1. If N is composite, any of its prime divisors q
divides N, which is a sum of coprime squares; but then q has the form 4N +1.
If q1 , . . . , qt are primes of the form 4N −1, set N3 = 4q1 · · · qt −1. Clearly N
is not divisible by any qj . If N is prime, then we are done; if not, then at least
one of the prime factors q of N has the form 4n + 3: in fact, if all of them had
the form 4n + 1, then the product N would also have this form. Since q = pj ,
this completes the proof.
1.13. The discriminant of the quadratic equation x 4 − 2ty 2 = t 2 in t is D = 4(x 4 +
y 4 ). If x 4 −2y 2 = 1 has a rational solution with y, then the quadratic equation
in t must have a rational solution, so its discriminant is a square; this leads
to y 4 + x 4 = w2 , which implies that x = 0 or y = 0. But then xy = 0 in
x 4 − 2y 2 = 1, which is only possible if y = 0. This proves the claim.
1.14. Assume that y 2 = x 3 − dx has a nontrivial rational solution. Then the
Diophantine equation dxt 2 + y 2 t − x 3 = 0 has a rational solution with
t = 1, hence its discriminant D = y 4 + 4dx 4 must be a square, hence
y 4 + 4dx 4 = w2 must have a rational solution.
Observe that the trivial solution x = 0 of the second equation corresponds to
the trivial solution x = 0 of y 2 = x 3 − dx.
1.15. Assume first that p = 8n + 1. Then
some nonzero integer k with |k| < p. Cancelling common divisors of c and
d from our equation we may assume that c and d are coprime. In particular,
c is odd, and this implies c2 − 2d 2 ≡ ±1 mod 8. Since p ≡ ±3 mod 8 we
B Solutions 265
c2 + 2d 2 k
= ·p
e + 2f
2 2 q
is an integer of the form c12 + 2d12 . In this way we can eliminate all prime
factors of k and end up with a representation of p in the form p = c2 + 2d 2 .
Note that
k
· p = c12 + 2d12
q
as desired.
266 B Solutions
√
1.17. From√21 = 12 + 5 · 42 = 42 + 5 · 12 we read off the elements 1 + 2 −5 and
4 + −5 with norm 21. Squaring these elements we obtain
√ √ √ √
(1 + 2 −5 )2 = −19 + 4 5 and (4 + −5 )2 = 11 + 8 −5.
y 2 + B 2 = x 3 − A3 = (x − A)(x 2 + Ax + A2 ).
(x, y) = (−2, 3), (−1, 4), (2, 5), (49), (8, 23), (43, 282), (52, 375), (5234, 378661).
1.26. We have
a b a b 2 2 2 2
= p2 + q 2 + r 2 + s 2 and =p +q +r +s .
c d c d
Thus the left hand side is the product of two sums of four squares.
Now
= (rp + sq + pr − qs )2 + (rq − sp − ps − qr )2 .
Thus the determinant on the right hand side is a sum of four squares.
1.27. Multiplying through by a shows that it is sufficient to consider equations of
the form x 2 +ay 2 = bz2 . Assume now that (ξ, η, ζ ) is a nontrivial solution of
this equation (such solutions exist by the Local–Global Principle if and only
if the conic has nontrivial points in every completion of Q). Then multiplying
bz2 = x 2 + ay 2 through by bζ 2 gives
Similarly,
or
Thus “Euler’s trick” provides us with three different factorizations of the form
AB = mC 2 , which we have collected in the following table:
A B C m
I bζ z + aηy + ξ x bζ z − aηy − ξ x ξy − ηx a
II bζ z + aηy + ξ x bζ z − aηy + ξ x ξ z + ζ x b
III bζ z + aηy + ξ x bζ z + aηy − ξ x ηz + ζy ab
Chapter 2
√
2.1. Clearly we can add, subtract, and multiply numbers of the form a + b m,
where a, b ∈ Q, in the obvious way. For example,
√ √ √
(a + b m )(c + d m ) = ac + mbd + (ad + bc) m.
B Solutions 269
The first claim is now proved by induction, the induction step being
For diagonalizing the matrix T we determine its eigenvalues. These are the
roots of the characteristic polynomial
det(T − λI ) = λ2 − λ − 1 = 0,
√ √
which gives λ1 = 1+2 5 = ω and λ2 = 1+2 5 = ω .
The corresponding eigenvectors are v1 = ω1 and v2 = ω1 : In fact
0 1 ω ω
ω = ω+1 = ω2 = ω ω .
1 1
11
Therefore the diagonalizing matrix S is given by S = ω1 ω1 . We now find
0 1 1 1
S −1 = ω−ω 1
· −ω 1 , hence D = S −1 T S = 1 · −ω 1
ω −1 ω −1 1 1 ω ω =
ω−ω
ω−ω ·
1 2+ω
0 −2−ω
0
= ω0 ω0 . Since T n = (S −1 DS)n = S −1 D n S we now
270 B Solutions
find
Un Un+1 ωn 0
Tn = = S −1 S,
Un+1 Un+2 0 ωn
ωn − ω n
Un = .
ω−ω
√
2.5. Let α = a+b2 m be an algebraic integer in some quadratic number field
√
Q( m ). Since the binomial coefficients pk are divisible by p for 1 ≤ k ≤
p − 1 (since p divides the numerator p!, but not the denominator k!(p − k)!),
we find
√ p
a p + bp m
αp ≡ mod p.
2p
By Fermat’s Little Theorem we have a p ≡ a, bp ≡ b and 2p ≡ 2 mod p.
√ p p−1 √ √
Moreover, m = m 2 m ≡ ( m p ) m mod p by Euler’s criterion. This
shows that
a+b√m
a + b √m p mod p p ) = +1,
if ( m
≡ 2√
a−b m
p ) = −1.
if ( m
2 mod p
2
√
In the special case α = ω = 1+ 5
2 we have
ωp ≡ ω mod p if ( p5 ) = +1,
ωp ≡ ω mod p if ( p5 ) = −1,
√
where ω = 1−2 5 . In the first congruence we may cancel ω, and we find
ωp−1 ≡ 1 mod p. If we multiply the second congruence by ω we obtain
ωp+1 ≡ ωω ≡ −1 mod p.
Applying these congruences to Binet’s formula we find, if ( p5 ) = +1,
ω −ω
Up ≡ ≡ −1 mod p and Up+1 ≡ 0 mod p.
ω−ω
B Solutions 271
m2 + 2mn − n2 m2 − 2mn − n2
x= , y= , z = m2 + n2 .
2 2
√ √
2.18. If m ≡ 1 mod 4, then clearly ω = 1+2 m and σ (ω) = 1−2 m form an integral
basis since 1 = ω + σ (ω). √
If√m ≡ 2, 3 mod 4, then {1, m} is an integral basis. Assume that ω = a +
b m generates a normal integral basis. Then there exist integers r, s ∈ Z
such that
√ √
1 = r(a + b m ) + s(a − b m ).
B Solutions 273
√
Comparing the coefficients of m we find that r = s, hence 1 = 2ar, which
is a contradiction. √
2.19. It is clear that K = Q( 3 2 ) is closed with respect to addition and
multiplication.
For proving that it is always possible to divide by nonzero elements we
observe that it is sufficient to show that α1 is an element of K. If we write
√
α = β, then αβ = 1. With α(x) = a + bx + cx we have α( 2) = α. The
1 2 3
√ 6 + √26 2 √ √
31 + 6 26 = √ = (3 2 + 13 )2 .
2
√ √ √
2.21. We
√ have√ 172+ 4 15 √ = (2 3 + 5 )2 . Finding more examples is trivial:
( 2 + √5 ) = 7 + 2√ 10.
2.22. If a + b m = (r + s m )2 , then a = r 2 + ms 2 . Since 1 ≤ a ≤ m we must
have s = 0, contradicting our assumptions.
√ √
2.23. We have u2 = 22 + m = (2 + −m √ )(2 − −m ); since the only units are
±1 and since the elements ±(2 ± −m ) are not squares (see the preceding
exercise),
√ this means that the Square Product Theorem does not hold in
Z[ −m ].
2.24. We have to show that the factors in the equation
√ √
169 = 13 · 13 = (4 + 3 −17 )(4 − 3 −17 )
√
are irreducible in Z[ −17 ]. Clearly 13 = a 2 + 17b 2 is not solvable in
integers,√which implies that 13 is irreducible.
If 4 + 3 −17 ) = αβ, then taking norms we find 132 = Nα · Nβ, and unless
α or β is a unit, this implies Nα = Nβ = 13, which is impossible.
2.25. The verification of
√ √
5 + −7 −1 − −7 3
=
2 2
274 B Solutions
is straightforward. Similarly,
√ 1 + √−7 3
181 + −7
=− .
2 2
2.26. Consider the points P (x1 , y1 ) and P2 (x2 , y2 ) on the unit circle. Then
P ⊕ Q = (x1 x2 − y1 y2 , x1 y2 + x2 y1 ).
Since
and
2.27. Consider the point P (r, s) on the unit circle, where r = cos α and s = sin α.
By similarity, the projection P (1|t) of P onto the line x = 1 satisfies 2t =
1+r , so t = 1+r .
s 2s
t1 + t1
t3 = .
1 − t1 t2
y3 y1 + y2 + 2x1 x2 (x1 + x2 )2
m= = = = x1 + x2 , and
x3 x1 + x2 x1 + x2
y2 − y1 x 2 − x12
m1 = = 2 = x1 + x2 .
x2 − x1 x1 + x2
q
2.31. We have f (q) = 1−q−q 2
, hence
1 1
q −q
q
f = = = = f (−q).
q 1− q − q2
1 1 q2 − q − 1 1 + q − q2
where α = ωω = − ω12 . Since |α| < 1, the fraction tends to 1, and this implies
the claim.
2.33. We have to show:
1. If x12 − x1 y1 − y12 = x22 − x2 y2 − y22 = 1, then x32 − x3 y3 − y32 = 1 for
x3 = x1 x2 + y1 y2 , y3 = x1 y2 + x2 y1 − y1 y2 .
276 B Solutions
(1, 0), (1, −1), (−2, −1), (−2, 3), (5, 3), (5, −8), . . .
(−1, 0), (−1, 1), (2, 1), (2, −3), (−5, −3), (−5, 8), . . . .
Observe that the coordinate with the largest absolute value is conserved. If,
for example, (x, y) is an integral point with x > y > 0, then the second
solution of the quadratic equation Y 2 + xY − x 2 + 1 = 0 is Y = −y − x, and
we obtain the second integral point (x, −x − y). In all examples, |x| + |y| is
increasing.
We claim that every integral point on the Fibonacci hyperbola belongs to one
of these two sequences. Assume therefore that (x, y) is an integral point on
the Fibonacci hyperbola with |y| > 0. Vieta jumping gives us a new point
(x1 , y1 ) with |y1 | ≤ |y| and |x1 | + |y1 | < |x| + |y| until we find one with
y-coordinate 0. But then x = ±1, and (x, y) belongs to one of the two
sequences above.
2.35. We will show more generally that the group law on the Pell conic x 2 − my 2 =
1 is given by P1 ⊕ P2 = P3 , where Pj = (xj , yj ) and
x3 = x1 x2 + my1 y2 , y3 = x1 y2 + x2 y1 .
We show that P3 is on the conic and that the slopes of the lines P1 P2 and NP3
coincide:
x32 − my32 = (x1 x2 + my1 y2 )2 − m(x1 y2 + x2 y1 )2 = (x22 − my22 )x12 − m(x22 − my22 )y12
= x12 − my12 = 1,
y2 − y1 y3
=
x2 − x1 x3 − 1
(x3 − 1)(y2 − y1 ) = y3 (x2 − x1 )
(x1 x2 + my1 y2 − 1)(y2 − y1 ) = (x1 y2 + x2 y1 )(x2 − x1 )
y2 (x12 − my12 ) − y2 = y1 (x22 − my22 ) − y1
y2 − y1 = y2 − y1 ,
Since its kernel has at most p + 1 elements, it must be onto. Thus the kernel
C(Z/pZ), which consists of the points on the conic x 2 − 3y 2 = 1 over Z/pZ,
has exactly p + 1 elements. Since the multiplicative groups of finite groups
are cyclic, and since subgroups of cyclic groups are cyclic, it follows that the
points modulo p on the conic x 2 − 3y 2 = 1 form a cyclic group of order
p + 1.
If P ∈ C(Z/pZ) is any point on the conic x 2 − 3y 2 = 1 modulo p, where
( p3 ) = −1, then (p + 1)P = N = (1, 0). Since the group C(Z/pZ) is cyclic,
p+1 p+1
2 P = N or 2 P = T , where T (−1, 0) is the unique point of order 2.
278 B Solutions
otherwise.
We now claim that P = (2, 1) does not have the form P = 2Q for some
Q ∈ C(Z/pZ). In fact, if Q = (a, b), then 2Q = (a 2 + 3b 2, 2ab); but
the system of congruences a 2 + 3b2 ≡ 2 mod p and 2ab ≡ 1 mod p is
equivalent to a 2 + 3( 2a 1 2
) ≡ 2 mod p, i.e., to 0 ≡ 4a 4 − 8a 2 + 3 = (2a 2 −
1)(2a −3) mod p. The congruence 2a 2 ≡ 3 mod p
2
√
is not solvable for primes
p = 2q − 1 ≡ 7 mod 8 with ( p3 ) = −1. Let a ≡ 22 mod p be a solution of
the congruence 2a 2 ≡ 1 mod p. Then b ≡ a mod p, but the point (a, b) is
not in C(Z/pZ).
Thus p+1 2 P = (−1, 0) for P = (2, 1) if p = 2 − 1 is prime. Assume
q
implies that P has order p+1. This in turn implies that C(Z/pZ) is cyclic, and
that p is prime: If p = qr is a product of coprime integers, then C(Z/pZ)
C(Z/qZ) × C(Z/rZ), which contradicts the fact that C(Z/pZ) is cyclic; if p
is divisible by the square q 2 of an odd prime q, then the order of C(Z/pZ) is
a multiple of q.
Finally let us compute the point Q with 2Q = p+1 2 P = (−1, 0). Setting
Q = (a, b) we obtain 2Q = a 2 + 3b 2 , 2ab). If b = 0, then a 2 ≡ −1 mod p,
which is impossible since p ≡ 3 mod 4. Thus a = 0 and 3b2 ≡ −1 mod p.
Since ( p3 ) = ( −1
p ) = −1, this congruence is solvable, and Q has the form
(0, ±b) as claimed.
2.38. Jumping upwards we construct the sequence of integral points (2, 0), (8, 2),
(30, 8), (112, 30), . . . ; the remaining integral points are obtained from these
by switching x and y and by replacing (x, y) with (−x, −y). Inverting the
process we easily see that there are no other integral points.
There √ integral points. Using ε =
√ are explicit formulas of Binet type for these
2 + 3 we define integers Un , Vn via Un + Vn 3 = εn ; then induction shows
that the integral points in the first quadrant are given by (2Vn+1 , 2Vn ).
2.39. The conic C : x 2 + y 2 − 3xy + 1 = 0 has the integral point (x, y) = (1, 1).
Applying Vieta jumping we obtain the sequence of integral points
(1, 1), (1, 2), (2, 5), (5, 13), (13, 34), . . .
and one in which x and y are interchanged. We now prove that there are no
other integral points on the conic lying in the first quadrant (we obtain similar
sequences in the third quadrant by switching the signs of x and y) (Fig. B.1).
In fact, assume that (x, y) is an integral point on C lying in the first quadrant.
If y > x, then (x, y ) with y = 3x − y is another integral point on C in the
first quadrant with y ≤ x. In fact, the equation yy = x 2 − 1 immediately
implies y = x y−1 < xy < x.
2 2
B Solutions 279
Continuing in this way we eventually must find an integral point (ξ, η) in the
first quadrant with ξ = η, and this implies (ξ, η) = (1, 1). Thus (x, y) must
arise by Vieta jumping from (1, 1), and this is what we wanted to prove.
2.40. We first study the conic x 2 + y 2 − 3xy + x = 0. It has the integral point
(x, y) = (1, 1). Applying Vieta jumping we obtain the sequence of integral
points
(1, 1), (1, 2), (4, 2), (4, 10), (10, 25), . . . .
to P∗ (−x − 2y, y) and P ∗ (x, −2x − y); starting from (1, 0) we obtain the
sequence of integral points
(1, 0), (1, 2), (−5, 2), (−5, −12), (29, −12), (29, 70), . . . ,
(1, 0), (3, 2), (−3, 2), (−17, −12), (17, −12), (99, 70), . . .
on Platon’s hyperbola H.
The standard argument shows that every integral point on H comes from the
sequence beginning with (1, 0) or the one with (−1, 0).
2.42. The substitution x = X − 2Y and y = Y transforms the hyperbola C :
x 2 − 3y 2 = 1 into C : X2 − 4XY + Y 2 = 1. Given a point P (x, y) on C (Z),
Vieta jumping gives rise to P∗ (4y − x, y) and P ∗ (x, 4x − y); the sequence
of integral points starting from (1, 0) is
on C (Fig. B.2).
and
(0, 1), (2n, 1), (2n, 4n2 − 1), (8n3 − 4n, 4n2 − 1), . . .
of integral points on C .
2.44. The field Q(α, β) as basis {1, β, β 2 α, αβ, αβ 2 }. We now multiply each basis
element with α + β and find
This procedure shows that if α and β are algebraic integers (if you cannot
√
see
where we are using this fact, go through the calculation with α = 1+2 3 ), then
so is α + β.
The same procedure works for products of algebraic integers; here we
multiply B with α · β.
282 B Solutions
Chapter 3
y Δs 2 + r 2 y 2rs
x+ = , = .
2 Δs 2 − r 2 2 Δs 2 − r 2
The number of Fp -rational points on C for odd primes p follows from the
corresponding result for Pell conics of the form X2 − dY 2 = 1: We have
singular point (0, 0). The same formulas hold for Fp -rational points, hence
#C(Fp ) = p.
For C : y 2 = x 3 +x 2 we find with the same pencil y = tx that t 2 x 2 = x 3 +x 2 ;
this is equivalent to x 2 (x + 1 − t 2 ) = 0. This yields the parametrization
(x, y) = (t 2 − 1, t 3 − t). The singular point (0, 0) is parametrized twice,
namely for t = ±1. The last fact is responsible for #C(Fp ) = p − 1.
3.3. The map λ is well defined since λ(s + Z) = λ(s + 1 + Z). Now we find
λ(s + Z) + λ(t + Z) = (cos 2πs, sin 2πs) + (cos 2πt, sin 2πt)
= (cos(2πs) cos(2πt) − sin(2πs) sin(2πt),
sin(2πs) cos(2πt) + sin(2πt) cos(2πs))
= (cos(2π(s + t)), sin(2π(s + t)) = λ(s + t + Z).
The kernel of λ consists of all cosets t + Z with cos 2πt = 1 and sin 2πt = 0;
these equations imply that t is an integer, hence t + Z = Z, and λ is injective.
Since λ is clearly surjective, it must be bijective.
Since λ is an isomorphism, the image of the cyclic subgroup of order n
generated by n1 + Z generates a cyclic subgroup of order n on the unit circle.
2π i
The claims now follow since e n = cos 2π n + i sin n is a primitive n-th root
2π
2·1 ≡ 2 2·4 ≡ −7
2·2 ≡ 4 2·5 ≡ −5
2·3 ≡ 6 2·6 ≡ −3
2·7 ≡ −1
2
which implies that ( 15 ) = (−1)4 = +1.
3
For computing ( 35 ), we choose the half system {1, 2, . . . , 17} and compute
the remainders of the products with 3; the only negative remainders occur for
3 · 1 ≡ −17 3 · 4 ≡ −8
3 · 2 ≡ −14 3 · 5 ≡ −5
3 · 3 ≡ −11 3 · 6 ≡ −2,
3
which implies that ( 35 ) = (−1)6 = +1.
284 B Solutions
A × B = {(1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (2, 3)}.
The permutation πA that swaps 1 and 2 has signature −1, and the induced
permutation on A × B swaps #B = 3 elements.
3.9. If we take the product over all φ(mn)/2 congruences a · aj ≡ (−1)sj aj , then
we obtain, after cancelling the products of all integers in the half system, the
congruence
a φ(mn)/2 ≡ (−1) sj
mod mn.
p−1
−t (−t)2 − 1 −1
p−1
t t 2 − 1
S= = = −S,
p p p p p
t =0 t =0
hence S = 0.
B Solutions 285
3.12. Each residue class t mod mn can be written uniquely in the form t = rm+sn,
where r and s run through the residue classes modulo n and m, respectively.
Thus
pq−1
t t 2 − 1
n−1 m−1
rm + sn (rm + sn)2 − 1
φmn (1) = =
mn mn mn mn
t =0 r=0 s=0
n−1 m−1
sn rm (sn)2 − 1 (rm)2 − 1
=
m n m n
r=0 s=0
m−1
sn (sn)2 − 1
n−1
rm (rm)2 − 1
=
m m n n
s=0 r=0
= φm (1)φn (1).
mq − nq 2mq q−1 2
Qq (m, n) = ≡ ≡ mq−1 = m 2 mod k
m−n 2m
is a square modulo k.
3.15. We have
m2t − n2t m2 − n2
Qr (m, n) = = Qt (m2 , n2 ) · (m + n),
m2 − n2 m − n
hence
Q (m, n) Q (m2 , n2 ) Q (m2 , n2 )
r 2 t
= .
Qq (m, n) Qq (m, n) Qq (m, n)
The claim now follows from (3.22).
286 B Solutions
3.16. The claim is true for q = 1 and all odd integers p since Q1 (m, n) = 1.
Assume that the claim holds for all odd integers q < q and all p = q. If
p < q, we have, by the quadratic reciprocity law,
Q (m, n)
p−1 q−1 Qq (m, n)
p
p−1 q−1 q
= (−1) 2 · 2 = (−1) 2 · 2
p
= .
Qq (m, n) Qp (m, n) p q
If p > q write p ≡ r mod q with 0 < r < q and use the preceding exercise.
Applying this result inductively to r = 2j t for some odd integer t < q we
find
Q (m, n) 2 j Q (m2 , n2 ) 2 j t r
r t
= = = .
Qq (m, n) q Qq (m, n) q q q
This implies the claim if p > q, and thus finishes the proof.
3.17. 1. For p = 13, only the pairs (3, 4) and (9, 10) are consecutive quadratic
residues, hence RR = 2. Similarly, the pairs (5, 6), (6, 7), and (7, 8) are
consecutive nonresidues, hence NN = 3. Similarly we find RN = NR = 3.
2. The equation RR + RN + NR + NN = p − 2 follows from the fact that
there are exactly p − 2 pairs of consecutive nonzero residue classes modulo
p.
3. Assume that a(a + 1) is a quadratic residue modulo p, and write a(a +
1) = y 2 in Fp . completing the square gives (2a + 1)2 = 4y 2 + 1, hence
(2a + 1 + 2y)(2a +1 −2y) = 1. Setting 2a + 1 + 2y = t and 2a + 1 − 2y = 1t
for some t ∈ F× p we obtain a parametrization of the conic a + a = y . The
2 2
a 3 9
P (±4, ±2) (±3, ±6)
Observe that the group law on C with neutral element N(0, 1) is given by
Thus P = (5, 0) has order 4 since 2P = (0, −1) is the element of order 2.
Moreover, (x, y) ⊕ (0, −1) = (−x, −y) and (x, y) ⊕ (5, 0) = (5y, 5x).
3.18. Since ( −1p ) = −1, the quadratic nonresidues {n1 , . . . , nm } form a half system
modulo p = 2m + 1.
If ( pa ) = −1, then ani ≡ −nj mod p; similarly ani ≡ nj mod p if ( pa ) =
+1. In the first case, there are an odd number of sign changes (namely m), in
the second case there is none. Thus Gauss’s Lemma holds in both cases.
3.19. We have N2 = p−1 and N4 = p2 (p−1)+pN2 = p3 −p2 +p2 −p = p3 −p.
n−4
Assume that Nn−2 = pn−3 − p 2 ; then
n−4 n−2 n−2
Nn = pn−2 (p − 1) + p(pn−3 − p 2 ) = pn−1 − pn−2 + pn−2 − p 2 = pn−1 − p 2
as claimed.
3.20. It is easy to produce infinitely many integral solutions of Q4 (m, n) = x 4 : If
Q4 (m, n) = c, then Q4 (mc, nc) = c4 . If m ≡ n ≡ 1 mod 4, then
m4 − n4
Q4 (m, n) = = (m + n)(m2 + n2 ) = 4c
m−n
for some odd integer c, hence Q4 (mc, nc) = 4c4 and mc ≡ nc mod 4.
3.21. The first claim is Gauss’s Lemma.
Assume that hq ≡ r mod p for some 0 < r < p2 ; then hq − kp = r for some
integer k with 0 < k < q2 , and then kp ≡ −r mod q. Thus if r is positive
in the first row, then −r shows up in the second row. The other negative
remainders in the second row are the numbers −r with p2 < r < q2 , and they
come in pairs. In fact, if k < q−1
2 and
1 1
kp ≡ −r mod q and p < r < q,
2 2
Chapter 4
4.1. Euclidean division shows that f (x) = (x − a)q(x) + r for some constant r.
Plugging in x = a shows that f (a) = 0 if and only if r = 0, i.e., if and only
if x − a divides f (x). This argument holds in all polynomial rings over fields
because they are Euclidean.
These results are in fact valid over arbitrary polynomial rings: if f (x) =
an x n + . . . + a1 x + a0 , then a0 = 0 implies that f (x) = xg(x) for
g(x) = an x n−1 + . . . + a1 . Conversely, f (x) = xg(x) implies f (0) = 0.
The substitution X = x − a allows us to prove this for general a.
Observe, however, that in (Z/8Z)[x], the polynomial x 2 − 1 is divisible by
x − a for a = 1, 3, 5, 7 since x 2 − 1 = (x − 1)(x + 1) = (x − 3)(x + 3). In
√ f (x) =√3x + 2 does not have any root.
(Z/6Z)[x], the linear polynomial
4.2. The relation 2 · 3 = (1 + −5 √ )(1 − −5 ) is a counterexample to the
Four Numbers Theorem in Z[ −5 ] since all factors are irreducible. The
Four Numbers Theorem claims that all factorizations can be explained by
a common refinement. √ √
The equation 2 · 3 = (2√+ −2 )(2 − −2 ) is compatible with the Four
Numbers Theorem in Z[ −2 ] since
√ √ √ √
2 = − −2 · −2, 3 = (1 + −2 )(1 − −2 ),
√ √ √ √ √ √
2 + −2 = −2 (1 − −2 ), 2 − −2 = − −2 (1 + −2 ).
4.3. Clearly (p, q) ∼ (p, q) since pq = qp, and if (p, q) ∼ (r, s), then (r, s) ∼
(p, q) since ps = qr is equivalent to rq = ps.
B Solutions 289
For checking transitivity assume that (p, q) ∼ (r, s) and (r, s) ∼ (t, u). Then
ps = qr and ru = st. Multiplying these equations yields prsu = qrst;
since R is a domain, we can cancel rs and obtain pu = qt, which implies
(p, q) ∼ (t, u).
The cancellation rule in domains may be proved as follows: If ac = bc for
c = 0, then (a − b)c = 0. Since R is a domain, this implies a = b since c = 0
by assumption.
When verifying the next claims keep in mind that we think of (p, q) as the
“fraction” pq . Clearly (p, q) + (0, 1) = (p, q) and (p, q) · (1, 1) = (p, q),
so (0, 1) and (1, 1) are the neutral elements with respect to addition and
multiplication, respectively. The additive inverse of (p, q) is (−p, q), and
the multiplicative inverse of (p, q) is (q, p) if p = 0.
Finally, the map ι : R −→ K : r → (r, 1) is an injective ring homomorphism
(which allows us to interpret R as a subring of the field we just have
constructed). In fact, 1 ∈ R is mapped to (1, 1) ∈ K, and λ(rs) = (rs, 1) =
(r, 1)(s, 1) = λ(r)λ(s). Finally, r ∈ ker λ if and only if (r, 1) ∼ (1, 1), which
is equivalent to r · 1 = 1 · 1 = 1, i.e., to r = 1.
4.4. We have a ≡ b mod m in R if and only if a − b = mq for some q ∈ R. Since
R ⊆ S, this implies a ≡ b mod m in S.
The converse does not hold in general; in the ring S = Z[ 12 ], we have 1 ≡
0 mod 2 since 1 − 0 = 2 · 12 .
4.5. Another example is
√
2+ −5 3
= √ .
3 2 − −5
p 2 3 5
√
−5 −( −5 )2
√
−3 −( −3 )2
√ √ √
−2 −( −2 )2 (1 + −2 )(1 − −2 )
−1 i(1 − i)2 (2 + i)(2 − i)
√ 2
2 2
√ √ √ 2
3 (2 − 3 )(1 + 3 )2 3
√ 2
5 5
4.23. Assume that π = αβ. Taking norms shows that Nπ = NαNβ. Since Nπ is
a prime by assumption, it is irreducible, hence Nα = ±1 or Nβ = ±1; this
implies that α or β is a unit, hence π is irreducible.
4.24. Most of these claims can be proved using the prime factorization.
a b min(ai ,bi )
1. Here we write a = pi i and b = pi i . Then gcd(a, b) = pi
min(2ai ,2bi )
and gcd(a 2 , b 2 ) = pi = (gcd(a, b))2 as claimed.
2. Assume there is a prime p | gcd(a 2 , b). Then p | a 2 and p | b. Since
p is prime, p | a 2 = a · a implies that p | a, and then p | gcd(a, b):
Contradiction.
3. Let d = gcd(a, b); then d | a, d | b, hence d | (a + b) and therefore
d | gcd(a + b, b).
Conversely, if d = gcd(a + b, b), then d | (a + b) and d | b, hence d
divides (a + b) − b = a, and we have d | gcd(a, b). But then gcd(a, b)
and gcd(a + b, b) divide each other, hence differ at most by a unit.
a b r
4. Write a = pi i , b = pi i and r = pi i . Then
min(ai +ri ,bi +ri )
r +min(ai ,bi )
gcd(ra, rb) = pi = pi i = r gcd(a, b).
√ √
4.25. Any common divisor of a = 1 + −5 and b = 1 − −5 divides their sum
2; since 2 is irreducible,
√
the greatest common divisor is either 1 or 2. But it
√
cannot be 2 since 1+ 2 −5 is not an element of Z[ −5 ].
√ √
On the other hand, a 2 = −4 + 2 −5 and b2 = −4 − 2 −5 have common
divisor 2, which is easily seen to be their greatest common divisor.
4.26. Clearly ω ∈ R[ 12 ]; it is therefore sufficient to show that 12 ∈ S. But
√ √ √ √
( 1+ 2 −5 )2 = −2+2 −5
, so S contains −2+ −5
2 − 1+ 2 −5 + 2 = 12 . This proves
that S = R[ 12 ].
√
Every element of S has the form a+b2m −5 , and such an element is in Q if and
only if b = 0. Thus S ∩ Q consists of all elements of the form 2am , which is
Z[ 12 ].
292 B Solutions
√ √
The factorization 6 = 2 · 3 = (1 + −5 )(1 + −5 ) is not an example of
nonunique factorization in S. In fact, 2 is a unit, and
1 √ √
3= (1 − −5 )(1 + −5 )
2
is a factorization of 3 into the unit 12 and the two irreducible (and even prime)
√
elements 1 ± −5. √ √
The factorization 3 · 3 = (2 − −5 )(2 + −5 ) can be refined:
1 √ √
3·3= (1 − −5 )2 (1 + −5 )2 ,
4
and we also have
√ √
(2 + −5 ) = −2(1 − −5 )2 ,
where −2 is a√unit in S. √
4.27. Setting x + y −5 = (r + s −5 )2 gives x = r 2 − 5s 2 and y = 2rs. Clearly
the solution (x, y, z) = (2, 1, 3) of x 2 + 5y 2√= z2 does not have this√form.
Now let us work in the domain S = Z[ −5, 12 ]. Here x + y −5 =
√
±2n (r + s −5 )2 for some unit ±2n . This implies x = ±2n (r 2 − 5s 2 ) and
y = ±2n+1 rs. We are interested in coprime integral solutions, so we may
assume gcd(r, s) = 1. Now there are two cases:
• r and s have different parity; then n = 0 and x = ±(r 2 − 5s 2 ), y = ±2rs.
• r and s are both odd; then n = −1 and x = r −5s
2 2
2 , y = rs.
Choosing r = 1, s = −1 and the negative sign gives us the solution
(x, y, z) = (2, 1, 3).
4.28. Assume that ab = ex n for some unit e, and that gcd(a, b) = p. Then both a
and b must be divisible by p, but not both of them are divisible by p2 . Assume
therefore that p a, i.e., that p | a and p2 a. Since ab = ex n we must have
pn−1 | b. Thus we can write a = pa1 and b = pn−1 b1 ; Proposition 4.12
shows that there exist units e1 and e2 with a1 = e1 cn and b1 = e2 d n . This
implies the claim.
4.29. We have x 3 = 4y 2 − 1 = (2y − 1)(2y + 1). Since the factors on the right
are coprime, we must have 2y − 1 = a 3 and 2y + 1 = b3. This implies
b3 − a 3 = 2, and since there are no cubes that differ by 2, the equation does
not have any integral solutions. √
√ only ring homomorphism κ2 : Z[ −5 ] −→ Z/2Z. is given
4.30. The √ by κ2 (a +
b −5 ) = a +b +2Z. Its kernel consists of all elements
√ a +b −5 for which
a and b have the same parity; equivalently, a + b −5 has even norm.
B Solutions 293
There are two ring homomorphisms κ3 and κ3 to Z/3Z, and they are defined
√ √
by κ3 (a + b −5 ) = a + b + 3Z and κ3 (a + b −5 ) = a√ − b + 3Z. The
kernel of κ3 consists of all elements that
√ are congruent to 1 − −5 modulo
√ 3.
The only ring homomorphism κ5 : Z[ −5 ] −→ Z/5Z is κ5 (a √ + b −5 ) =
a + 5Z. Its kernel consists of all elements of the form 5c + b −5.
4.31. The set mZ of multiples of m is an ideal in Z: it is closed under addition
and subtraction since am ± bm = (a ± b)m is also a multiple of m; it is
also closed with respect to multiplication by arbitrary elements r ∈ Z since
r · (am) = ra · m is also a multiple of m.
4.32. If a | b, then b = ar for some r ∈ R, which implies ∈ (a) and therefore
(b) ⊆ (a). The converse is also clear.
As for the remaining claims, we immediately deduce that (1) implies (2).
Now assume that a and b divide each other. Then b = ad and a = be, hence
a = ade and thus de = 1. Thus d and e are units, and we have a = be as
claimed.
Finally, if a = be for some unit e, then (a) = (b) since clearly a ∈ (b) and
b ∈ (a).
4.33. The sum and the product of upper triangular matrices is upper triangular,
which shows that T is a subring of R with unit 1 0 . But T is not an ideal in
01
R since the product 10 01 · 01 00 = 01 00 is not upper triangular.
4.34. Clearly I ∩ R is closed with respect to addition and subtraction. Moreover it
is closed with respect to multiplication by elements r ∈ R since if a ∈ I ∩ R,
then r · a ∈ I since I is an ideal in S and r · a ∈ R since r, a ∈ R and R is a
domain. Thus r · a ∈ I ∩ R, and I ∩ R is an ideal in R as claimed.
4.35. Since I is a nonzero ideal in OK , it contains an element α = 0. Since I is
closed with respect to multiplication by elements of Ok , the element α · α =
N(α) is also in I . Thus I contains a nonzero integer.
The ideal (X) in Z[X] or Q[X], on the other hand, consists of multiples of X,
and so the only constant polynomial in (X) is the zero polynomial 0.
4.36. Reducing a polynomial f ∈ Z[x] modulo a prime number p yields a
polynomial in Fp [x]; clearly this map is a ring homomorphism. Reducing
f modulo x is the same as evaluating f at x = 0 and therefore also is a ring
homomorphism. Since both maps commute our claims follow.
4.37. Clearly Z is a subring of Ok . It is not an ideal since Z√is not closed with
respect to multiplication by ring√elements. For example, m · 1 is not √in Z.
4.38. For showing that√I = (2a + 2 b : a, b ∈√Z} is an ideal in Z[ 2 ] we
show √ that I = ( 2 ). For proving that I ⊆ √ ( 2 ) take
√ an arbitrary
√ element
2a + 2 b ∈ I ; the claim follows√ from 2a + 2 b
√ = 2 · (a
√ 2 + b). On√the
other hand, every element in ( 2 ) has the form 2(b + a 2 ) = 2a + b 2,
and these are elements in√I . √
√ O = Z + 2 2√
The order Z is clearly√a subring √ of Z[ 2 ]; But although
1 + 2 2 ∈ O, the element 2 · (1 + 2 2 ) = 4 + 2 is not in O.
4.39. If f1 ω ∈ Ok and f2 ω ∈ O, then clearly (f1 ± f2 )ω ∈ O. Thus F is an
additive group. For showing that it is an ideal observe that if f ∈ F , then
294 B Solutions
21 = 15 + 6,
15 = 2 · 6 + 3,
6 = 2 · 3,
invoke
√ cubic fields right from the start and factor the equation y 2 = x 3 + 9 in
Q( 9 ).
3
y + ki = (a + bi)3, y − ki = (a − bi)3.
a 2 4 6 8 12
k 11 47 107 191 407
x 5 17 37 65 145
y 2 52 198 488 1692
(1 + 8i)i + (5 + 4i)(1 − i) = 1,
Chapter 5
13 + 4i − i(5 − 7i) = 6 − i.
Finally
5 − 7i = (1 − i)(6 − i).
6 − i = 13 + 4i − i(5 − 7i)
= 13 + 4i − i(26 − 29i − (1 − 2i)(13 + 4i))
= (3 + i)(13 + 4i) − i(26 − 29i).
5.2. Write x 2 + 1 = kp; then both x + i and p are divisible by one of the primes
above p, hence gcd(x + i, p) = (a + bi), where a 2 +√b2 = p. √
If p ≡ 1, 3 mod 8 and x 2 ≡ −2 mod p, then gcd(x − −2, p) = c +d −2,
where c2 + 2d 2 = p.
5.3. We begin by observing that the elements of the second system are pairwise
incongruent modulo π = 1 + 2i. Now −1 ≡ 4, i ≡ 2 and −i ≡ 3 mod π.
5.4. This is a trivial exercise: Just multiply a + bi by the units ±1 and ±i.
5.5. Clearly N(a + bi) = a 2 + b2 is odd if and only if a ≡ b mod 2. In this
(a+bi)(1−i)
1+i =
case, a+bi = a+b
2 − 2 i. Conversely, (1 + i)(c + di) =
a−b
2
c − d + (c + d)i has even norm since c − d ≡ c + d mod 2.
If N(a + bi) = a 2 + b2 is odd, then a and b have different parity. If a is
odd and b is even, then q 2 equiv1 and b2 ≡ 0 mod 4, hence N(a + bi) =
a 2 + b 2 ≡ 1 mod 4.
If a + bi has odd norm and a is even, then (a + bi)i = −b + ai has an odd
real part. Thus every Gaussian integer with odd norm is associated with an
element a + bi ≡ 1 mod 2. Observe that this congruent is equivalent to a ≡ 1
and b ≡ 0 mod 2.
Finally observe that a complete system of coprime residue classes modulo
2 + 2i is {±1, ±i, ±1 + 2i, 2 ± i}. If a + bi ≡ 1 mod 2, then a + bi ≡ 1
298 B Solutions
quadratic reciprocity.
3. Multiplying the trivial congruence c + di ≡ 0 mod (c + di) through by i
we find ci ≡ d mod (c + di). Thus
a + bi a + bi c ac + bci ac + bd ac + bd
= = = = .
c + di c + di c + di c + di c + di q
Thus
ac + bd pq (ac + db)2 + (ad − bc)2 ad − bc 2
= = = = 1.
pq ac + bd ac + bd ac + bd
5. Now
ac + bd a + bi c + di
= .
pq c + di a + bi
cases:
√
• a ≡ b mod 2: Then Nu (a + b −5 ) ≤ a +5b
2 2
2 , hence Nu (ξ ) ≤ 12 · 1+5
4 =
3
4 < 1.
• a ≡ 1, b ≡ 0 mod 2: Replace a by a ± 1 such that | ab | ≤ 1; then Nu (ξ ) ≤
1 a 2 +5b2
4 c2 ≤ 14 (1 + 54 ) < 1.
300 B Solutions
which is impossible
√ since the right hand side is divisible by 3.
If ε = 5 + 2 6 we obtain
a b x y
−5 −2 −2 −4
−−7 3 10 32
−211 90 8.158 736.844
1 = 6b2 ± 6b + 1
in the brackets;
√ but this is impossible since squares are congruent to 0 or
±1 mod 5.
5.19. This is a simple calculation:
φ(x 2 − xy + y 2 ,x 2 − 2xy + y 2 ) = (x 2 − xy + y 2 )2
+ (x 2 − xy + y 2 )(x 2 − 2xy + y 2 ) − (x 2 − 2xy + y 2 )2
= x 4 − x 3 y + x 2 y 2 − xy 3 + y 4 ,
p
ωp − ω ω−ω
Up = ≡ ≡ 1 mod p and
ω−ω ω−ω
ωp+1 − ω p+1
ω2 − ω 2
Up+1 = ≡ = ω + ω ≡ 1 mod p.
ω−ω ω−ω
p
ωp − ω ω −ω
Up = ≡ ≡ −1 mod p
ω−ω ω−ω
and
p+1
ωp+1 − ω ωω − ω ω
Up+1 = ≡ ≡ 0 mod p.
ω−ω ω−ω
The residue class of Up−1 mod p now follows from Up−1 = Up+1 − Up .
B Solutions 303
Chapter 6
6.1. Write a + b = 2c; then a + bi = a + (2c − a)i = a(1 − i) + 2ci. Since 1-i
= -i(1+i) 2 = (1 + i)(1 + i), these elements are multiples of 1 + i.
Conversely, (a +bi)(1+i) = a −b +(a +b)i, and then (a −b)+(a +b) = 2a
is even.
6.2. If (a, b) = d, then there exist elements r, s ∈ R with d = ra + sb. This
implies that gcd(r, s) divides d.
Conversely, if d divides both a and b, then d divides gcd(a, b).
6.3. We find
√ √ √
ab = ((1 + −5 )2 , 2(1 +
−5 ), 3(1 + −5 ), 6)
√ √ √ √ √
= ((1 + −5 )2 , 2(1 + −5 ), 3(1 + −5 ), (1 + −5 )(1 − −5 ))
√ √ √ √
= (1 + −5 )(1 + −5, 2, 3, (1 − −5 ) = (1 + −5 )
√
√ the other hand, the ideals (2) and (2, 1 + −3 ) are distinct since 1 +
On
−3 ∈ (2). √
6.16. With I = (2, 1 + m ) we have I 2 = (2)I , yet clearly I = (2).
6.17. With p = (2, 1 + 3i) we easily check I 2 = (2). The prime ideals (q) are
inert in Z[i] and thus also in Z[i]. If p ≡ 1 mod 4, write p = a 2 + b 2 ; then
p1 = (p, 3a + 3bi) and p2 = (p, 3a − 3bi) satisfy p1 p2 = (p).
We have 1−3i = 2(3+6i)−5−15i ∈ (5, 3+6i) and 3+6i = 5−2(1−3i) ∈
(5, 1 − 3i).
Clearly (3) ⊃ (3 + 6i). If there was an ideal A in Z[3i] with (3)A = (3 + 6i),
then 3 + 6i = 3a for some a ∈ A; but a = 1 + 2i is not even an element of
Z[3i].
Finally we have
Since m2 is odd, the second ideal contains 1 and therefore is the unit ideal.
√
Thus (2) = (2, m )2 in this case.
If m ≡ 3 mod 4, then
√ 2 √ √ √
(2, 1 + m ) = (4, 2 + 2 m, m + 1 + 2 m ) = (4, 2 + 2 m, m − 1
√
= (2)(2, 1 + m, m−1
2 )
306 B Solutions
√ √
since m + 1 + 2 m − (2 + 2 m ) = n − 1. The last√ideal contains 2 and the
2 , hence is equal to (1). Thus (2, 1 + m ) = (2) in this case.
odd integer m−1 2
2
h= 1 + 1 + 1 + 1 − 1 + 1 − 1 + 1 + 1 − 1 − 1) = 3.
23
√
6.25. In each case, the primes below the Gauss bound |Δ|/3 are inert.
6.26. If the prime ideals above (2) are principal, then there must be elements with
norm 2, i.e., the equation x 2 + my 2 = 8 must have integral solutions. For
m ≡ 7 mod 8, this implies m = 7.
6.27. Consider the equation y 2 = x 3 − d for d = 3t 2 − 1 with t = 3c3 , that is,
y 2 = x 3 − 27c6 + 1. Clearly this equation has the solutions (3t 2 , ±1) not
√ if d = 27c − 1 is squarefree and
listed in Theorem 6.20. This implies that 6
c d h(d) c d h(d)
1 26 6 5 421, 874 900
2 1, 727 36 6 125, 9711 1608
3 19, 682 108 7 3, 176, 522 1512
4 110, 591 444 8 7, 077, 887 2088
The field Q( −10 ) also has class number√2, and the nonprincipal ideal class
is generated by the prime ideal a = (2, −10 ) above 2. The primes p ≡
1, 7, 9, 11, 13, 19, 23, 37 mod 40 split in K, and either p = x 2 + 10y 2 (if the
prime ideals p and p above p are principal) or p = 2x 2 + 5y 2 (if p and p lie
in the same class as a). Since x 2 + 10y 2 ≡ ±1 mod 8, this form represents
the primes p ≡ 1, 7, 9, 23 mod 40, and the form 2x 2 + 5y 2 represents the
primes p ≡ 11, 13, 19, 37 mod 40.
308 B Solutions
6.30. The smallest primes p 23 for which f (x) = x 3 − x + 1 splits into three
linear factors modulo p are the following:
√ √
Thus x and y are odd. We find x 3 = y 2 +4f = (y+2 −f )(y−2 −f √ ). The
gcd of the factors on the right must be an ideal with odd norm dividing −f .
Assume that p is a prime ideal dividing both factors; since p is ramified, we
have p2 = p for some prime p | f ; but p | y and p | x then imply p2 | 4f ,
which contradicts our assumption that f is squarefree. √
By unique factorization into prime ideals we conclude√that (y + 2 −f ) =
a3 . Since we have assumed that the class number of Q( −f ) is not divisible
by 3, the ideal a must
√
be principal, say a = (α)
r+s −f
Now write α = 2 ; then, up to sign,
√
3 r 3 − 3f rs 2 + s(3r 2 − f s 2 ) −f
y + 2 −f = α = ,
8
√
and comparing coefficients of −f we obtain
16 = s(3r 2 − f s 2 ).
f h f integral points
11 1 3 · 32 − 16 (5, 9)
19 1 3 · 12 + 16 (5, 7), (101, 1015)
43 1 3 · 32 + 16 (13, 45)
59 3 3 · 52 − 16 (21, 95)
91 2 3 · 52 + 16 (29, 155), (485, 10681)
131 5 3 · 72 − 16 (45, 301)
163 1 3 · 72 − 16 (53, 385)
The equation y 2 = x 3 − 339 has two solutions (13, 291) and (61,√ 475) not
predicted by this result. We conclude that the class number of Q( −339 )
must be divisible by 3. In√fact, the class number is h = 6.
√
6.34. We claim that a1 = ( 11+2 85 ). Since a21 = (2 + 85 ) has norm 81, the ideal
√ √ √ √
a has norm 9. Moreover, 9 = 11+2 85 · 11−2 85 and 2 + 85 = 2 · 11+2 85 − 9
√
are both contained in ( 11+2 85 ), hence a divides this ideal, and since they have
the same norm, they must be equal. √
If the second ideal is principal, then there exists an element β = x+y2 85 with
norm ±7. But the equation x 2 − 85y 2 = ±4 · 7 is not solvable modulo 5.
We have m = 92 + 32√
6.35. √ 2 = 232 + 242 = 312 + 122 = 332 + 42 . Let a = (32 +
√ 1√
m, 9), a2 = (24 + m, 23), a3 = (12 + m, 31) and a4 = (4 + m, 33).
Since K has class group (2, 2), each square of an √ ideal is principal.√In
31+ m 33+ m
particular, a must be principal. The elements α = 2 √ and β = 2
√
have norms Nα = −36 and Nβ = −4, hence γ = − 31+ m
√ = 133 + 4 m
33− m
has norm Nγ √ = 9.
N(1795 + 54 m ) = −5 · 31.
6.36. We first show that x and y must be odd. If both are even, set x = 2X and
y = 2Y ; then 8X3 + 4 = 4pY 2 , i.e., 2X3 + 1 = pY 2 . Then Y must be odd,
hence 2X3 + 1 ≡ p ≡ 5 mod 8; but this implies 2X3 ≡ 4 mod 8, which is
impossible.
Thus x and y are odd. Then
√ √
(−a)3 = 4 − py 2 = (2 − y p )(2 + y p ).
√
Since the factors are coprime, we must have (2 − y p ) = a3 . Since 3 does
√
not divide the class number h of K = Q( p ), a must be principal, hence
√
2 − y p = ηα 3
√
±1± p
fundamental unit ε ≡ 2 mod 2, and this implies that η = 1. Thus
√ c + d √p 3
2+y p = ,
2
and this implies
d(3c2 + pd 2 )
16 = c(c2 + 3pd 2 ) and b = .
8
If c = 1, then 3pd 2 = 15, hence p = 5 and d = 1. If c is even, we get a
contradiction.
Primes of the form p = x 3 +4 have an obvious integral point (x, 1), hence the
√
class number of Q( p ) must be divisible by 3 for these primes. Examples
are p = 93 + 4 = 733 and p = 253 + 4 = 15629.
6.37. If y is odd, then x = 2x1 is even, and we have
y + √k y − √k
2x13 =
2 2
in OK . If q is a prime number dividing both factors on the right, then q divides
their sum y and their difference k. But then q | x and q 2 | (x 3 − y 2 ) = k,
which contradicts the assumption that k is squarefree.
Thus the factors on the right are coprime; we choose the sign of y in such a
way that y ≡ 1 mod 4. This implies
y + √k y − √k
3
= pa and = qb3 ,
2 2
where pq = (2) and ab = (x1 ). But then [p] lies in the cube of an ideal class,
which contradicts our assumption.
√ √
6.38. The prime ideal p = (2, 1+ 2−31 ) generates the class group of Q( −31 ),
which has order 3. By the preceding exercise, the equation y 2 = x 3 − 31 does
not have an integral solution with y odd. By Exercise 1.23 there is no solution
with y even since 31 = 33 + 22 .
Chapter 7
√ √
7.1. Observe that α = η > 0. Since√Nα = αα < 0 we have η = −α and
√
therefore Tr α = α + α = η − η .
7.2. If m = n2 , then
1 = x 2 − my 2 = m2 − n2 y 2 = (x − ny)(x + ny),
312 B Solutions
{ax} − {bx} = ax − bx + "bx# − "ax# = "ax − bx# + {ax − bx} + "bx# − "ax#
√ √
Z[ m ], we can find a power η of ε for which ξ η = a + b m has coefficients
√
a and b that satisfy the bounds from Theorem 7.8. Because of t < ε < m
we find
√
n √ 1
|b| ≤ √ ε+ √ < 1.
m ε
t − 1 = 4a 2 , t + 1 = 2pb 2 ,
t − 1 = 2a 2 , t + 1 = 4pb 2 ,
t − 1 = 4pa 2 , t + 1 = 2b 2 ,
t − 1 = 2pa 2 , t + 1 = 4b 2 .
314 B Solutions
The first and the last equation are impossible modulo p, the third contradicts
the minimality of u. Thus the second equation a 2 − 2pb2 = −1 must be
solvable in integers.
7.7. Assume that t 2 − 2pu2 = 1 for minimal u ≥ 1. Then (t − 1)(t + 1) = 2pu2 ,
and since the factors on the left have greatest common divisor 2 we have one
of the following equations:
t − 1 = 4a 2 , t + 1 = 2pb 2 ,
t − 1 = 2a 2 , t + 1 = 4pb 2 ,
t − 1 = 4pa 2 , t + 1 = 2b 2 ,
t − 1 = 2pa 2 , t + 1 = 4b 2 .
1 2 √
ε= α = 170 + 39 19
2
√
√ the5 coefficient of 19 for squares is even, ε cannot be a square.3
is a unit. Since
Since (1 + 19 ) > ε, the unit ε is either fundamental or a cube. But ε = α
is easily shown to be impossible.
√ √ √
m = 43: We set a = (2, 1 + 43 ), p = (3, 1 + 43 ) and q = (7, 1 + 43 ).
B Solutions 315
√ √
a N(a + 43 ) (a + 43 )
5 −2 · 32 ap 2
6 −7 q
7 2·3 ap
8 3·7 pq
√ √
m = 199: Here 14+ 199 generates a prime ideal p of norm 3 and 19+ 199
has norm 2 · 34 . Thus
√
19 + 199 √
α= √ = 127539 + 9041 199
(14 − 199 )4
1 2 √
ε= α = 16266196520 + 1153080099 199
2
is a unit, which we can show to be fundamental by checking that ε is not a
k-th power for k = 2, 3, 5, and 7.
7.9. From ±4 = t 2 − mu2 ≡ t 2 − u2 mod 8 we immediately deduce that t ≡ u ≡
0 mod 2. √ √
7.10. Clearly Nε = n2 −m = 1, so ε is a unit. Since (1+ m )2 = m+1+2 m >
ε, the unit must be fundamental. √
If m = n2 +1, the element ε = n+ √ m is a unit with norm −1. If m = n2 ±4
is squarefree, then n is odd, and n+2 m is a unit.
√ √
7.11. We have (2) = a2 for a = (2,√ 478 )2 , (3) = pp for p = (3, 1 + 478 ),
and (7) = qq for q = (7, 3 + 478 ). We find
√ √
a N(a + 478 ) (a + 478 )
10 2 · 33 · 7 ap3 q
17 33 · 7 p 3q
22 2·3 ap
24 2 · 72 aq2
25 3 · 72 pq 2
√ √ √
Next (10 + 478 )(17 + 478 ) = (27)aq, hence aq2 = (24 + 478 ), but
we already knew that. But
√
2(10 + 478 ) √
√ = −4635 + 212 478
((22 + 478 ))3
1 2 √
ε= α = 1617319577991743 + 73974475657896 478
2
is a unit, and in fact the fundamental unit. The last claim requires showing
that ε is no p-th power for all primes p ≤ 11.
Since a is principal, so is p, and this implies that q is principal. For the class
number to be 1 we need to show that the prime ideals with norm less than the
Minkowski bound (thus with norm ≤ 19) are principal. This √ is now a √ matter
of a few
√ simple calculations involving the elements 18 + 478, 19 + 478,
23 + 478: It follows that the prime ideals above 11, 13, and 17 are also
principal.
7.12. The equation 2x 2 −5y 2 = ±1 is impossible in integers since it is not solvable
modulo 5. Thus |2x 2 −5y
√ | ≤ 1 implies 2x −5y = 0, which is only possible
2 2 2
1
x= .
1
2+
1
1+
2 +...
1
Then x = yields, after simplifying the fraction, the quadratic
1
1+
1+x √
equation 2x 2 + 2x − 1 = 0, whose unique positive solution is x = 3 − 1.
The partial convergents in this case are
5 7 19 26
, , , ,....
3 4 11 15
7.14. Euclidean division shows
√ √
m=t + m−t
√ √
1 m+t m−t
√ = =t+
m−t 2 2
2 √ √
√ = m + t = 2t + m − t.
m−t
318 B Solutions
√ 1
m=t+ ,
1
t+ √
2t + m − t
In fact we can show that t is divisible by 8 in the latter case. Since gcd(t −
1, t + 1) = 1 we have the two cases
• t − 1 = a 2 , t + 1 = pb2 ; then a 2 − pb 2 = −2, which contradicts the fact
that p ≡ 7 mod 8.
• t + 1 = a 2 , t − 1 = pb 2 . Since a is odd, the first equation implies t ≡
0 mod 8.
7.17. It follows immediately from f (a, b) = f (a , b ) that (a −a )ξ1 +(b−b )ξ2 =
0. Thus ξ1 /ξ2 = a−a
b−b
∈ Q if a − a = 0. This contradiction shows that
a − a = 0 and therefore b − b = 0, hence (a, b) = (a , b ) and finally the
injectivity of f .
7.18. Since ε + ε = 4 and εε = 1 we have
n n
Vn+1 = (εn + ε )(ε + ε ) − εε − ε εn = 4Vn − Vn−1 .
Similarly,
2n n
V2n = ε2n + ε = (εn + ε )2 − 2 = Vn2 − 2.
Chapter 8
+1 q
8.1. (See Mignotte [93]) We already know that in this case y +1 = qa 2 and yy+1 =
qb2, where ab = x.
Assume that q = 8k + r for r = 5, 7. Then y = qa 2 − 1 ≡ a − 1 mod 8.
Moreover,
implies
(−1)m
S0 ≡ mod 3.
1 + 2m − m2
The residue class modulo 3 of the numerator depends on m mod 2, that of the
denominator on m mod 3. Thus the residue class of S0 mod 3 only depends on
m mod 6, and the claim follows by verifying it for all integers m with 1 ≤ m ≤
6.
8.3. We have
m m m m
(1 + 1)m = + + + + ...,
0 1 2 3
m m m m
(1 − 1)m = − + − + ..., hence
0 1 2 3
m m m
2m = 2 +2 +2 +...
0 2 4
m m m
=2 +2 +2 + ....
1 3 5
8.4. Let ρ denote a primitive cube root of unity. Then for k = 0, 1, 2 we have
m m m m m
(1 + ρ k )m = + ρk + ρ 2k + + ρk + ....
0 1 2 3 4
• If m ≡ 0 mod 3, then
m m m 2m + 2(−1)m
+ + + ... = .
0 3 6 3
m m m m m
(1 + i k )m = + ik + (−1)k + i 3k + +...
0 1 2 3 4
hence
⎧
⎪ m−2 + (−1) m m−2
if m ≡ 0 mod 4,
⎪2
⎪
4 2 2
⎪
⎨2m−2 + (−1) m−1 m−3
m m m 4 2 2 if m ≡ 1 mod 4,
+ + +... =
0 4 8 ⎪
⎪ 2m−2 if m ≡ 2 mod 4,
⎪
⎪
⎩ m−2 m+1 m−3
2 + (−1) 4 2 2 if m ≡ 3 mod 4
as claimed.
8.6. If y and y + 1 are S-smooth, then 2 ∈ S since either y or y + 1 is even. Thus
4y(y + 1) = (2y + 1)2 − 1 is S-smooth. We claim that there are at most 3n
integers a > 0 for which a 2 − 1 is S-smooth.
Assume that a 2 − 1 = p1e1 · · · pnen . Write ej = 2fj + gj with gj ∈ {0, 1, 2},
gj
where gj = 0 if ej = 0 and gj = 2 otherwise, and set d = pj and
fj
b = pj ; then a 2 − 1 = db 2 , or a 2 − db2 = 1, and each prime dividing b
also divides d. Størmer’s Theorem 8.4 then implies that, for a fixed value of d,
the equation a 2 − 1 = db2 has at most one positive solution with the property
g
that primes dividing b also divide d. Since d = pj j and 0 ≤ gj ≤ 2 there
are at most 3n choices for d, and this proves our claim.
For improvements of this procedure see [75].
8.7. Let x and y be natural numbers satisfying x 2 +x+1 = 3y 2 . Then x ≡ 1 mod 3,
hence we can write x = 3z+1 for some√natural number z. √Then 9z +3 = 12y
2 2
Chapter 9
9.1. The equality [a] = [b] of ideal classes is by definition equivalent to the
existence of an element α ∈ k × with a = (α)b. Applying σ to this equation
shows that aσ = (α σ )bσ , which in turn implies that [aσ ] = [bσ ]. Since
σ 2 = id, applying σ to the last equation now √ proves the √converse.
9.2. The norm of the fundamental unit ε = 3 + 10 of Q( 10 ) has norm −1,
hence an ideal above a prime number p is principal if and only if the equation
x 2 − 10y 2 = p has integral solutions. Clearly x 2 − 10y 2 = 2 and x 2 −
10y 2√= 5 are impossible
√ modulo
√ 5 and modulo 8,√respectively. Moreover,
(2 − 10 ) = (2, 10 )(3, 1 + 10 ), hence (3, 1 + 10 ) cannot be principal
either. √ √
The ideals (2, 10 ) and (5, 1+ 10 ) are √ generated by ramified prime ideals,
hence are ambiguous, whereas (3, 1 + 10 ) is not ambiguous.
9.3. We know that the fundamental unit ε has norm −1, so ideals above primes q
are principal if and only if q = x 2 −2py 2 has integral solutions. The equation
2 = x 2 −2py 2 is not solvable modulo p since 2 is a nonsquare modulo primes
p ≡ 5 mod 8. √
9.4. Assume that (2, 2p ) = (α) is principal. Then ε = 12 α 2 is a unit, and ε
cannot be a square since 2 is no square in K. But Nε = 14 (Nα)2 = +1,
hence the fundamental unit must have norm +1.
Conversely, assume that Nε = +1. By Hilbert’s Theorem 90 there is an
element α ∈ OK with α 1−σ = ε. Then (α) is ambiguous, √ and getting√rid of
rational
√ prime we find that (α) is one of the ideals (1), (2, 2p ), (p, 2p )
or ( 2p).
If (α) = (1), then α = η for some unit η, but then ε = η11−σ cannot be
fundamental. √ √
Similarly, if (α) = ( 2p), then α = η 2p for some unit η, and again
α 1−σ = −η1−σ contradicts the fact that √ ε is fundamental.
√
Thus√ (α) must be one of the ideals (2, 2p ) or(p, 2p ); since their product
is ( 2p ), they must both be principal.
B Solutions 323
√
Finally assume that 2p = a 2 + b2 with a > b > 0 and set a = (a, b + m ).
Clearly
a2 = (a 2 , a(b + 2p ), (b + 2p )2 ) = (2p − b2 , a(b + 2p ), (b + 2p )2 )
= (b + 2p )(b − 2p, a, b + 2p ) = (b + 2p ).
m + ni (m + ni)2 m2 − n2 2mn
α= = = 2 + 2 · i.
m − ni (m − ni)(m + ni) m + n2 m + n2
x = m2 − n2 , y = 2mn and z = m2 + n2
as desired.
324 B Solutions
is a solution of x 2 − my 2 = z2 .
9.10. An element α ∈ k × belongs to H G if and only if (α) = (α σ ). This is
equivalent to the existence of a unit ε with α = εα σ , i.e., to ε = α 1−σ .
Clearly ε ∈ E[N] since Nε = 1, and the kernel of the homomorphism
λ((α)) = εE 1−σ consists of all principal ideals (α) with α 1−σ = η1−σ for
some unit η. But then β = α/η has the property that (α) = (β) and β σ = β,
which shows that β√∈ Q× . Thus ker λ = P .
9.11. √ ideal (3, 1 + 10 ) has norm 3 and is not principal, whereas (1, 3 +
The
10 ) = (1) lies in the principal class.
9.12. If p = (π), then (p) = p2 = (π 2 ). Thus π 2 = εp for some unit ε. If ±ε is a
square, then so is ±p, and we must have m = ±p, which we have excluded.
This works more generally √ for products n of disjoint
√ ramified primes. If m =
30, the elements α = 6 + 30 and β = 5 + 30 generate ramified ideals,
2 2 √
and we have α6 = β5 = 11 + 2 30.
∗ √ ∗
9.13. If pq = +1, then qp = +1 for q ∗ = ( −1 q )q. Thus p splits in k = Q( q ),
and we have ±4ph = x 2 + q ∗ y2 , where h is the odd class number of k.
Reduction modulo q then implies pq = +1.
9.14. If p ≡ 3 mod 4, then the equation u2 −pu2 = −1 is impossible modulo p (as
well as modulo 4). Assume therefore that p ≡ 1 mod 4 is prime, and let (t, u)
be the smallest positive solution of the Pell equation t 2 − pu2 = 1. Then t
must be odd, and in pu2 = (t − 1)(t + 1) we have gcd(t + 1, t − 1) = 2. Thus
either t − 1 = 2a 2 and t + 1 = 2pb2 or t − 1 = 2pa 2 and t + 1 = 2b2 . In the
second case we obtain b2 − pa 2 = 1 for a smaller pair (a, b) contradicting
our assumptions. Thus pb2 − a 2 = 1 and therefore a 2 − pb2 = −1, hence
√
the fundamental unit of Q( p ) has negative norm if p ≡ 1 mod 4 is prime.
9.15. Since x 2 − qy 2 is even, x and y must have the same parity. If x and y are
odd, then x 2 − qy 2 ≡ x 2 + y 2 ≡ 2 mod 4, which contradicts the fact that
x 2 − qy 2 is divisible by 4. Thus x = 2A and y = 2Y are even, and we find
±ph ≡ X2 − qY 2 .
Again we have X2 − qY 2 ≡ X2 + Y 2 mod 4, and we find that X and Y must
be odd and that ±ph ≡ 1 mod 4. Since p ≡ 1 mod 4, the plus sign must
hold.
B Solutions 325
9.16. If Nεm = +1, then εm = α σ −1 by Hilbert’s Theorem 90. Set a = (α); then
aσ = (α σ ) = (αεm ) = (α), so a is an ambiguous √ principal ideal. If a = (1)
then εm = 1, which is nonsense; similarly, a = ( m ) leads to εm = −1.
9.17. Assume that p ≡ 1 mod 4 is a prime number, and let ε denote the fundamen-
√
tal unit of k = Q( p ). If Nε = +1, then there is an ambiguous principal
√
ideal (α) = (1), ( p ). But ambiguous principal ideals are generated by
√
ramified primes, and the only ramified prime in Q( p ) is p (here we have
used p ≡ 1 mod 4, so the discriminant √ of k is Δ =√p).
9.18. The fundamental unit ε = 170 + 39 19 of Z[ 19 ] is ≡ 1 mod 13, so
reduction modulo the prime ideals above 13 only yields the trivial residue
classes ±1. √
The fundamental unit ε = 1+2 5 , on the other hand, is a primitive root
modulo the prime ideals above 11, so in this case the image of the reduction
homomorphism
√ is the whole coprime residue class group modulo π =
4 + 5.
9.19. It suffices to prove the result for coprime
√ values of x and y since rational
primes p have√the form p = p + 0 10. Now write the norm n = x 2 − 10y 2
of α = x + y 10 as a product of primes p satisfying ( p2 ) = ( p5 ) = +1 and
primes q satisfying ( p2 ) = ( p5 ) = −1. If p | n, then p = ππ , and either
π | α or π | α. Thus it remains to prove the result for elements α whose
norm is a product of primes q.
Let (α) = q1 · qt denote its prime ideal factorization; observe that t = 2s
must be√ even since each ideal
√ qj has order √ 2 in the class group. Now√write
qj (2, 10 ) = (2aj + bj 10 ). Then (2, 10 )t (α) = (2aj + bj 10 ),
√ √
where (2, √ 10 )t = √2s . Dividing each factor on the right by 2 we obtain
(α) = (aj 2 + bj 5 ), and this implies our claim. Analogous results hold
for other fields with class number 2.
9.20. By the ambiguous class number formula, the class number is odd since there
are two ramified primes (2 and q) and since the fundamental unit has norm
+1 since the equation t 2 − 2pu2 = −1 does not have a solution modulo p by
the first supplementary law.
√
Since the prime ideal a = (2, pq ) satisfies a2 = (2), it must be principal
(there is no class of even order since the class number is odd); thus there must
be an element with norm ±2: X2 − 2py 2 = ±2. Clearly X = 2x must be
even, and we deduce 2x 2 − py 2 = ±1. The equation 2x 2 − qy 2 = 1 is
impossible modulo 8; thus 2x 2 − qy 2 = −1 must be solvable. Reducing this
equation modulo q implies 2x 2 ≡ −1 mod q, hence ( q2 ) = ( −1 q ) = −1.
326 B Solutions
Chapter 10
10.1. This is a purely formal exercise. If 1A and 1B denote the neutral elements of
A and B, then (1A , 1B ) is the neutral element of A ⊕ B. The inverse element
of (a, b) is (a −1 , b −1 ), and associativity is directly inherited from A and B.
10.2. For a subgroup U of A ⊕ B define the subgroup A1 of A as the set of all
a ∈ A for which there exists an element (a, b) ∈ U , and define B1 similarly.
Clearly A1 ⊕ B1 is a subgroup of U . Conversely, given an element (a, b) ∈
U , we have a ∈ A1 and b ∈ B1 by definition, hence (a, b) ∈ A1 ⊕ B1 .
10.3. This is easy: Changing a by a multiple of N = n1 N2 does not change the
residue classes a + N1 Z and a + N2 Z.
10.4. Let d = gcd(N1 , N2 ) and write d = mN1 + nN2 . Then
q + q2 − q3 + q4 − q5 − q6 q + 2q 2 + q 3 + 2q 4 + q 5
Δ = −7 : f (q) = =
1−q 7 1 + q + q2 + q3 + q4 + q5 + q6
q + q3 − q5 − q7 q + q3
Δ = −8 : f (q) = =−
1−q 8 1 + q4
q + q 3 + 2q 4 + 3q 5 + 2q 6 + q 7 + q 9
Δ = −11 : f (q) =
1 + q + q 2 + . . . + q 10
q − q 5 − q 7 + q 11 q − q3
Δ = 12 : f (q) = = .
1−q 12 1 − q2 + q4
B Solutions 327
10.9. Since
Fekχ (q)
fχ (q) =
1 − qN
we have
we obtain
q − q3 1 1 1 1 1
fχ (q) = = √ − − + .
1 + q4 2 2 1 − ζq 1 − ζ 3q 1 − ζ 5q 1 − ζ 7q
Fekχ (ζ k )
ζk Fekχ (ζ k )
ak = =− .
−Nζ k(N−1) N
p−1
n n
p−1
Fekp (x) = x ≡ nm x n mod p.
p
n=1 n=1
p−1
Therefore Fekp (1) ≡ n=1 nm ≡ 0 mod p by Gauss’s congruence (3.12).
10.14. We form the derivatives of the polynomials in the preceding exercise:
··· ···
p ≡
Fek(k) nm · n(n − 1) · · · (n − k + 1)x n−k mod p.
(x + 1)p − 1 p p−2 p
Φp (x + 1) = = x p−1 + x +...+ .
x 1 p−1
This polynomial is Eisenstein since p | pk for 1 ≤ k ≤ p − 1 and since
p
p2 p−1 = p. Therefore Φp (x + 1) is irreducible.
10.16. We find the following values for Np :
p 5 7 11 13 17 19 23 29 31 37
Np 5 3 11 15 17 27 23 29 27 27
10.17. We have
t t t
x t1 +t2 +...+tn .
1 2 n
Fekp (x)n = ···
t1 ,...,tn
p p p
p−1
Fekp (x)n = Jn (a)x a
a=0
as claimed.
Bibliography
1. M. Aigner, Markov’s Theorem and 100 Years of the Uniqueness Conjecture (Springer, Cham,
2013)
2. N.C. Ankeny, S. Chowla, H. Hasse, On the class-number of the maximal real subfield of a
cyclotomic field. J. Reine Angew. Math. 217, 217–220 (1965) √
3. G. Arendt, Éléments de la théorie des nombres complexes de la forme a + b −1. Programme
Collège Royal Français, September 1863
4. A. Ash, R. Gross, Fearless Symmetry. Exposing the Hidden Patterns of Numbers (Princeton
University Press, Princeton, 2006)
5. A. Ash, R. Gross, Elliptic Tales: Curves, Counting, and Number Theory (Princeton University
Press, Princeton, 2012)
6. R. Ayoub, On L-functions. Monatsh. Math. 71, 193–202 (1967)
7. R. Ayoub, S. Chowla, On Euler’s polynomial. J. Numb. Theory 13, 443–445 (1981)
8. E.J. Barbeau, Pell’s Equation (Springer, New York, 2003)
9. E. Benjamin, C. Snyder, Elements of order four in the narrow class group of real quadratic
fields. J. Aust. Math. Soc. 100, 21–32 (2016)
10. C. Bergmann, Über Eulers Beweis des großen Fermatschen Satzes für den Exponenten 3.
Math. Ann. 164, 159–175 (1966)
11. D. Bernoulli, Observationes de seriebus quae formantur ex additione vel subtractione
quacuncque terminorum se mutuo consequentium. Commentarii Acad. Sci. Imp. Petropol.
III (1728), 85–100
12. Y.F. Bilu, Y. Bugeaud, M. Mignotte, The Problem of Catalan (Springer, New York, 2014)
13. J.H. Bruinier, G. van der Geer, G. Harder, D. Zagier, The 1-2-3 of Modular Forms (Springer,
New York, 2008)
14. J.W.S. Cassels, Lectures on Elliptic Curves (Cambridge University Press, Cambridge, 1991)
15. J.W.S. Cassels, Local Fields (Cambridge University Press, Cambridge, 1986)
16. W. Castryck, A shortened classical proof of the quadratic reciprocity law. Am. Math. Monthly
115, 550–551 (2008)
17. H.H. Chan, L. Long, Y. Yang, A cubic analogue of the Jacobsthal identity. Am. Math. Monthly
118, 316–326 (2011)
18. P. Chebyshev, Sur les formes quadratiques. J. Math. Pures Appl. 16, 257–282 (1851)
19. K. Chemla, S. Guo, Les neuf chapitres. Le Classique mathématique de la Chine ancienne et
ses commentaires (Dunod, Paris, 2004)
20. H. Cohen, A Course in Computational Algebraic Number Theory (Springer, Berlin, Heidel-
berg, 1993)
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 331
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6
332 Bibliography
21. J.H.E. Cohn, Eight Diophantine equations, Proc. Lond. Math. Soc. (3) 16, 153–166 (1966);
Corr. ibid. (3) 17, 381 (1967)
22. H. Cohn, Advanced Number Theory (Dover Publications, New York, 1980); Original: A
Second Course in Number Theory (Wiley, New York, 1962)
23. H. Cohn, A Classical Invitation to Algebraic Numbers and Class Fields, 2nd edn. Universitext
(Springer, New York, 1988)
24. J.B. Cosgrave, K. Dilcher, An Introduction to Gauss Factorials, Am. Math. Monthly 118,
812–829 (2011)
25. D. Cox, Primes of the Form x 2 + ny 2 (Wiley, New York, 1989)
26. D. Cox, Why Eisenstein proved the Eisenstein criterion and why Schönemann discovered it
first. Am. Math. Monthly 118, 3–21 (2011)
27. H. Davenport, The Higher Arithmetic, 8th edn. (Cambridge University Press, Cambridge,
2008)
28. R. Dedekind, Gesammelte mathematische Werke (Friedrich Vieweg & Sohn, Braunschweig,
1932)
29. P.G.L. Dirichlet, Mémoire sur l’impossibilité de quelques équations indéterminées du cin-
quième degré. Acad. Sci. Royale France 1825; Werke I, 1–46
30. P.G.L. Dirichlet, Einige Resultate von Untersuchungen über eine Classe homogener Functio-
nen des dritten und der höheren Grade, Ber. Verh. Königl. Preuß. Akad. Wiss. 1841, 280–285
(1841); Werke I, 625–632
31. P.G.L. Dirichlet, Vorlesungen über Zahlentheorie, 2nd edn., ed. by R. Dedekind (Brunswick
1871); English translation Lectures on Number Theory (American Mathematical Society and
London Mathematical Society, London , 1999) √
32. F.W. Dodd, Number theory in the integral domain Z[ 12 + 12 5 ], Dissertation Univ. Northern
Colorado, 1981; published as Number Theory in the Quadratic Field with Golden Section
Unit (Polygonal Publishing House, Passaic, NJ, 1983)
33. D.S.L. Eelkema, Integer factorisation using conics, Bachelor thesis, Groningen (2020)
34. R.B. Eggleton, C.B. Lacampagne, J.L. Selfridge, Euclidean quadratic fields. Am. Math.
Monthly 99, 829–837 (1992)
35. G. Eisenstein, Neuer und elementarer Beweis des Legendre’schen Reciprocitäts-Gesetzes. J.
Reine Angew. Math. 27, 322–329 (1844); Math. Werke I, 100–107
36. L. Euler, Theoremata circa divisores numerorum in hac forma paa ± qbb contentorum.
Commun. Acad. Sci. Petropol. 14, 151–158 (1751); Opera Omnia I - 2, 194–222
37. L. Euler, De numeris, qui sunt aggregata duorum quadratorum. Nova Comm. Acad. Sci.
Petropol. 4(1752/3), 1758, 3–40; Opera Omnia I - 2, 295–327
38. L. Euler, Vollständige Anleitung zur Algebra (Birkhäuser, Basel, 1770); Leipzig (1883)
39. L. Euler, Opera Postuma. Fragmenta arithmetica ex adversariis mathematicis deprompta,
vol. 1, (1862), pp. 231–232; S. 157
40. T. Evink, A. Helminck, Tribonacci numbers and primes of the form p = x 2 + 11y 2 . Math.
Slovaca 69, 521–532 (2019)
41. F.G. Frobenius, Über das quadratische Reziprozitätsgesetz I, Sitzungsberichte Berliner Akad.
335–349 (1914). Ges. Abhandl. 628–642
42. F.G. Frobenius (unter Benutzung einer Mitteilung des Herrn Dr. R. Remak), Über quadratis-
che Formen, die viele Primzahlen darstellen. Sitz. Kön. Preuß. Akad. Wiss. Berlin (1912),
966–980; Ges. Abh. III, 573–587
43. C.F. Gauß, Disquisitiones Arithmeticae, 1801; deutsche Übersetzung Maser 1889; Neuauflage
(K. Reich, Hrsg.), Georg Olms Verlag 2015
44. C.F. Gauß, Theorematis fundamentalis in doctrina de residuis quadraticis demonstrationes et
amplicationes novae, 1818; Werke II, 47–64
45. C.F. Gauß, Theorie der biquadratischen Reste. Zweite Abhandlung (Göttingen, 1832);
deutsche Übersetzung Maser 1889
46. K. Girstmair, Kroneckers Lösung der Pellschen Gleichung auf dem Computer. Math.
Semesterber. 53, 45–64 (2006)
Bibliography 333
47. M. Hall, Some equations y 2 = x 3 − k without integer solutions. J. Lond. Math. Soc. 28,
379–383 (1953)
48. F. Halter-Koch, Quadratische Ordnungen mit großer Klassenzahl. J. Numb. Theory 34, 82–94
(1990)
49. K. Halupczok, Euklidische Zahlkörper. Diplomarbeit (Hartung-Gorre Verlag, Konstanz,
1997)
50. S. Hambleton, Generalized Lucas-Lehmer tests using Pell conics. Proc. Am. Math. Soc. 140,
2653–2661 (2012)
51. S. Hambleton, F. Lemmermeyer, Arithmetic of Pell surfaces. Acta Arith. 146, 1–12 (2011)
52. S. Hambleton, V. Scharaschkin, Pell conics and quadratic reciprocity. Rocky Mt. J. Math. 42,
91–96 (2012)
53. G.H. Hardy, E.M. Wright, Einführung in die Zahlentheorie (R. Oldenbourg Verlag, München,
1958) √
54. M. Harper, A proof
√ that Z[ 14 ] is Euclidean, Ph.D. thesis, McGill University, 2000
55. M. Harper, Z[ 14 ] is Euclidean. Can. J. Math. 56, 55–70 (2004)
√
56. K. Hashimoto, L. Long, Y. Yang, Jacobsthal identity forQ( −2 ), Forum Math. 24, 1225–
1238 (2012)
57. H. Hasse, Über eindeutige Zerlegung in Primelemente oder in Primhauptideale in Integritäts-
bereichen. J. Reine Angew. Math. 159, 3–12 (1928)
58. H. Hasse, Über mehrklassige, aber eingeschlechtige reellquadratische Zahlkörper. Elem.
Math. 20, 49–59 (1965)
59. T.L. Heath, Diophantus of Alexandria. A Study in the History of Greek Algebra (Cambridge
University Press, Cambridge, 1910)
60. E. Hecke, Lectures on the Theory of Algebraic Numbers (Springer, Berlin, 1981)
61. D. Hilbert, Die Theorie der Algebraischen Zahlkörper, Jahresber. DMV 4, 175–546 (1897);
Engl. Transl. I. Adamson, The Theory of Algebraic Number Fields (Springer, New York,
1998)
62. F. Hirzebruch, D. Zagier, The Atiyah-Singer Theorem and Elementary Number Theory
(Publish or Perish, Boston, 1974)
63. J. Høyrup, Algebra in Cuneiform (Max-Planck-Gesellschaft zur Förderung der Wis-
senschaften, Berlin, 2017)
64. A. Hurwitz, Über eine Aufgabe der unbestimmten Analysis. Archiv. Math. Phys. 3, 185–196
(1907); Mathematische Werke 2, 410–421
65. K. Ireland, K. Rosen, A Classical Introduction to Modern Number Theory (Springer, New
York, 1990)
66. M. J. Jacobson, H. C. Williams, Solving the Pell Equation (CMS, New York, 2009)
67. E. Jacobsthal, Anwendungen einer Formel aus der Theorie der quadratischen Reste, Diss.,
Berlin, 1906
68. E. Jacobsthal, Über die Darstellung der Primzahlen der Form 4n + 1 als Summe zweier
Quadrate. J. Reine Angew. Math. 132, 238–246 (1907)
69. H.W.E. Jung, Einführung in die Theorie der quadratischen Zahlkörper (Jänicke, Leipzig,
1936)
70. L. Kronecker, Ueber die Potenzreste gewisser complexer Zahlen (Monatsber, Berlin, 1880),
pp. 404–407; Werke II, 95–101
71. E.E. Kummer, Zur Theorie der complexen Zahlen. J. Reine Angew. Math. 35, 319–326
(1847)
72. R.C. Laubenbacher, D. Pengelley, Eisenstein’s misunderstood geometric proof of the
quadratic reciprocity theorem. College Math. J. 25, 29–34 (1994)
73. V.A. Lebesgue, Recherches sur les nombres. J. Math. Pures Appl. 3, 113–144 (1838)
74. V.A. Lebesgue, Sur l’impossibilité, en nombres entiers, de l’équation x m = y 2 + 1. Nouv.
Ann. Math. (1) 9, 178–181 (1850)
75. D.H. Lehmer, On a problem of Störmer. Illinois J. Math. 8, 57–79 (1964)
76. F. Lemmermeyer, The Euclidean algorithm in algebraic number fields. Expositiones Math.
13, 385–416 (1995)
334 Bibliography
107. L.W. Reid, The Elements of the Theory of Algebraic Numbers (The Macmillan Co., New York,
1910)
108. P. Ribenboim, Catalan’s Conjecture (are 8 and 9 the only Consecutive Primes? (Academic,
Boston, 1994)
109. P. Ribenboim, Meine Zahlen, meine Freunde. Glanzlichter der Zahlentheorie (Springer,
Berlin, 2009)
110. P. Roquette, The Riemann Hypothesis in Characteristic p in Historical Perspective (Springer,
Cham, 2018)
111. W. Scharlau, H. Opolka, Von Fermat bis Minkowski. Eine Vorlesung über Zahlentheorie und
ihre Entwicklung (Springer, Berlin, 1980)
112. A. Scholz, Einführung in die Zahlentheorie (de Gruyter, Berlin, 1939)
113. R. Schoof, Catalan’s Conjecture (Springer, London, 2008)
114. C.-O. Selenius, Rationale of the Chakravala Process of Jayadeva and Bhaskara II. Hist. Math.
2, 167–184 (1975)
115. D. Shanks, On Gauss’s class number problems. Math. Comp. 23, 151–163 (1969)
116. H. Siebeck, Die recurrenten Reihen, vom Standpuncte der Zahlentheorie aus betrachtet. J.
Reine Angew. Math. 33, 71–77 (1846)
117. J. Silverman, J. Tate, Rational Points on Elliptic Curves (Springer, New York, 1992)
118. J. Sommer, Vorlesungen über Zahlentheorie. Einführung in die Theorie der algebraischen
Zahlkörper (Teubner, Leipzig, 1907)
119. C. Størmer, Solution d’un problème curieux qu’on rencontre dans la théorie élémentaire des
logarithmes, Nyt tidsskrift for matematik 19, 1–7 (1908)
120. G. Szekeres, On the number of divisors of x 2 + x + A, J. Number Theory 6, 434–442 (1974)
121. G. Terjanian, Sur l’équation x 2p + y 2p = z2p . C. R. Acad. Sci. Paris 285, 973–975 (1977)
122. A. Thue, Über Annäherungswerte algebraischer Zahlen. J. Reine Angew. Math. 135, 284–305
(1909)
123. E. Trost, Eine Bemerkung zur diophantischen Analysis. Elem. Math. 26, 60–61 (1971)
124. E. Trost, Solution of Problem E 2332. Am. Math. Monthly 87, p. 77 (1972)
125. L. Tschakaloff,
√ Unmöglichkeitsbeweis der Gleichung α 5 + β 5 = ηγ 5 im quadratischen
Körper K( 5 ). Tôhoku Math. J. 27, 189–194 (1926)
126. K. Vogel, Vorgriechische Mathematik II. Die Mathematik der Babylonier (Schroedel, Han-
nover, 1959)
127. K. Vogel (Hrsg.), Neun Bücher arithmetischer Technik (Ostwalds Klassiker der exakten
Naturwissenschaften, Braunschweig, 1968)
128. F. von Schafgotsch, Abhandlung über einige Eigenschaften der Prim- und zusammengeses-
tzten Zahlen, Abhandlung der Böhmischen Gesellschaft der Wissenschaften in Prag (1786),
pp. 123–159
129. L. von Schrutka, Ein Beweis für die Zerlegbarkeit der Primzahlen von der Form 6n + 1 in ein
einfaches und ein dreifaches Quadrat. J. Reine Angew. Math. 140, 252–265 (1911)
130. S. Wagstaff, The Joy of Factoring (AMS, Providence, 2013)
131. A. Wakulicz, On the equation x 3 + y 3 = 2z3 . Colloq. Math. 5, 11–15 (1957)
132. A. Weil, Number Theory: An Approach Through History from Hammurapi to Legendre
(Birkhäuser, New York, 1984)
133. A. Widmer, Über die Anzahl der Lösungen gewisser Kongruenzen nach einem
Primzahlmodul, Diss. ETH Zurich, 1919
134. D. Zagier, Zetafunktionen und quadratische Körper (Springer, Berlin, Heidelberg, 1981)
135. Ch. Zeller, Beweis des Reciprocitätsgesetzes für die quadratischen Reste (Monatsber, Berlin,
1872), pp. 846–847
136. G. Zolotareff, Nouvelle démonstration de la loi de réciprocité de Legendre. Nouv. Ann. Math
(2) 11, 354–362 (1872)
Name Index
Girard, A., 14 M
Goldbach, Ch., 13 Mersenne, M., 118
Guo, S., 5 Mignotte, M., 319
Mihailescu, P., 193
H Mollin, R., 175
Hadamard, J., 174 Monsky, P., 116, 195
Halter-Koch, F., 175 Monzingo, M.G., 72
Hambleton, S., 51 Mordell, L.J., 205
Hardy, G.H., 107 Motzkin, Th., 126
Harper, M., 127
Hashimoto, K., 72 N
Hasse, H., 128, 175, 249 Nagell, T., 117, 201, 205
Hecke, E., 42 Narayana, 185
Heegner, K., 128, 161 Noether, E., 25, 128
Helminck, A., 251 Nyberg, M., 184
Hermite, 29
Heron, 6 O
Hilbert, D., 111, 112, 212, 246 Opolka, H., 243
Hurwitz, A., 144 Oppenheim, A., 126
Hypatia, 8
P
Pépin, Th., 51, 117
J
Platon, 41
Jacobi, C.G.J., 23, 231
Plemelj, J., 122
Jacobsthal, E., 72
Plofker, K., 93, 167
Jung, H., 26
Ptolemy, 6
K
R
Kronecker, L., 25, 243, 246, 248
Rabinowitsch, J., 130
Kummer, E.E., 1, 23, 26, 33, 221
Regiomontanus, J., 8
Ribenboim, P., 130, 207
L
Lagrange, J.-L., 14, 47, 93, 167
S
Lamé, G., 49
Schafgotsch, F., 186
Langlands, R., 251
Scharaschkin, V., 51
Lebesgue, V.A., 78, 193
Scharlau, W., 243
Legendre, A.-M., 66, 117
Schönemann, Th., 113, 235
Lehmer, D.H., 118, 321
Scholz, A., 68
Leibniz, G.W., 241
von Schrutka, L., 72
Leveque, W.J., 205
Shanks, D., 184
Liouville, J., 49
Shimura, G., 248
Long, L., 72
Siebeck, H., 134
Lucas, É., 118, 187
Name Index 339
A Diophantine equation
Algebraic integer, 34 x 2 + y 2 = 2z2 , 49
Associated, 93 x 2 + y 2 = z2 , 1–3, 220
Automorphism, 32 x 2 − 2y 2 = 1, 39
x 2 − 3y 2 = 1, 51
B x 2 − 4xy + y 2 = 1, 46
Bézout domain, 99 x 2 − my 2 = 1, 167
Bézout property, 100 x 2 − xy − y 2 = 1, 44
Binet’s formula, 41, 223, x 3 + y 3 = z3 , 116
229, 232, 270 x 3 − y q = 1, 206
x 4 + y 4 = z2 , 12
C x 4 − 2x 2 = 1, 28
Cancellation law, 145 x 5 + y 5 = z5 , 122
Catalan’s conjecture, 193 x p − y 3 = 1, 206
Character, 223 x p − y q = 1, 193
group, 225 x 2p + y 2p = z2p , 81
primitive, 225, 226 x12 + x22 + x32 = 3x1 x2 x3 , 46
sum, 71 y 2 + 1 = x m , 193
Class number, 150 y 2 = 2x 3 − 1, 133
Common divisor, 96 y 2 = x 3 + 1, 195
Conductor, 65, 105, 225, 245 y 2 = x 3 + 17, 28
Conjugate, 31 y 2 = x 3 + 3x, 192
Coprime, 97 y 2 = x 3 + 4, 133
y 2 = x 3 + 7, 28
D y 2 = x 3 − 2, 8, 11, 19
Dedekind domain, 160 y 2 = x 3 − d, 155, 161, 162
Dedekind-Hasse criterion, 128 y 2 = x 3 − dx, 28
Degree, 33 Dirichlet character, 223
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 341
F. Lemmermeyer, Quadratic Number Fields, Springer Undergraduate
Mathematics Series, https://doi.org/10.1007/978-3-030-78652-6
342 Subject Index
P T
Pell equation, 93, 165 Theorem
Pell form, 229, 246 Kronecker-Weber, 246
functional equation, 230, 240, unique factorization, 3, 20, 22, 26
246, 252 Trace, 31
Pigeonhole principle, 168, 169, Tribonacci numbers, 251
190 Two-Squares Theorem, 14
Plane numbers
similar, 4 U
Plimpton 322, 1 Unique factorization domain, 96
Polynomial Unit, 92
cyclotomic, 253 Unit group, 92
modular, 244
Prime, 93 V
Prime discriminant, 70 Vieta jumping, 45, 51, 278
Prime ideal factorization
unique, 140 W
Prime number Wieferich pair, 206
inert, 109, 148
ramified, 148 Z
split, 148 Zolotarev symbol, 60