0% found this document useful (0 votes)

64 views

Learning and Experiencing Cryptography With Cryptool and Sagemath by Bernhard Esslinger

Uploaded by

gokusen67

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

Learning and Experiencing Cryptography With Cryptool and Sagemath by Bernhard Esslinger

Uploaded by

gokusen67

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 665

COMPUTER SECURITY

Esslinger
Learning and Experiencing Cryptography with CrypTool and SageMath presents
a broad overview of classic and modern cryptography and encourages you
to actively try out cryptography experiments and simulations using your
own data with modern open-source cryptography tools. This learn-by-doing
approach goes beyond simple teaching, showing you how to directly access
and use CrypTool (all versions), as well as the computer-algebra system (CAS)
SageMath, to derive direct feedback and results from your input.
LEARNING AND EXPERIENCING
CRYPTOGRAPHY WITH

LEARNING AND EXPERIENCING CRYPTOGRAPHY

All codes written with these open-source tools are available, and detailed
instructions for using each of them are provided. Chapters can be explored

CRYPTOOL AND SAGEMATH

independently and are enriched with references, web links, and abundant
footnotes, providing a comprehensive learning experience.

WITH CRYPTOOL AND SAGEMATH

The book covers a wide range of cryptography and cryptanalysis topics, with
a strong emphasis on the Rivest-Shamir-Adleman (RSA) encryption algorithm
and public-key infrastructure (PKI), aligning its teachings with the latest
recommendations from the U.S. National Institute of Standards and Technology
(NIST) and the German Federal Office for Information Security (BSI). Bernhard Esslinger
With its unique hands-on approach, this valuable resource has something
for everyone interested in cryptography, from students and self-learners
entering the field, to experienced developers and users seeking ideas and
understanding for practical implementations.

Bernhard Esslinger worked for SAP in various positions in Germany and

the United States until 1998. He headed the development department for all
security components of the SAP R/3 system and then SAP’s main product line.
He was also SAP ‘s global chief information security officer (CISO). From 1998
to 2013 he worked for Deutsche Bank as global head of information security
in the corporate center there and later as head of the competence center for
cryptography. Since 2008 he has been an honorary professor for business
informatics, and researches and teaches at Faculty III for economics, business
informatics, and business law at the University of Siegen. CrypTool was
developed under his leadership since and has been continuously expanded for
more than twenty years.

ISBN: 978-1-68569-017-5

ARTECH HOUSE
BOSTON I LONDON
www.artechhouse.com

605d32 - C: 56, M: 47, Y: 89, K: 33

92864b - C: 42, M: 38, Y: 82, K: 11
i i

“Esslinger” — 2023/11/30 — 19:47 — page i — #1

“Esslinger” — 2023/11/30 — 19:47 — page iv — #4

Contents

Preface xv

Acknowledgments xix

Introduction xxi

CHAPTER 1
Ciphers and Attacks Against Them 1
1.1 Importance of Cryptology 2
1.2 Symmetric Encryption 2
1.2.1 AES 4
1.2.2 Current Status of Brute-Force Attacks on Symmetric Algorithms 4
1.3 Asymmetric Encryption 5
1.4 Hybrid Procedures 7
1.5 Kerckhoffs’ Principle 7
1.6 Key Spaces: A Theoretical and Practical View 8
1.6.1 Key Spaces of Historic Cipher Devices 8
1.6.2 Which Key Space Assumptions Should Be Used 11
1.6.3 Conclusion of Key Spaces of Historic Cipher Devices 13
1.7 Best Known Attacks on Given Ciphers 14
1.7.1 Best Known Attacks Against Classical Ciphers 15
1.7.2 Best Known Attacks Against Modern Ciphers 15
1.8 Attack Types and Security Definitions 16
1.8.1 Attack Parameters 16
1.8.2 Indistinguishability Security Definitions 20
1.8.3 Security Definitions 21
1.9 Algorithm Types and Self-Made Ciphers 24
1.9.1 Types of Algorithms 24
1.9.2 New Algorithms 24
1.10 Further References and Recommended Resources 24
1.11 AES Visualizations/Implementations 25
1.11.1 AES Animation in CTO 26
1.11.2 AES in CT2 26
1.11.3 AES with OpenSSL at the Command Line of the Operating
System 28
1.11.4 AES with OpenSSL within CTO 29
1.12 Educational Examples for Symmetric Ciphers Using SageMath 29

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:47 — page vi — #6

i i

vi Contents

1.12.1 Mini-AES 29
1.12.2 Symmetric Ciphers for Educational Purposes 32
References 32

“Esslinger” — 2023/11/30 — 19:47 — page xiv — #14

i i

xiv Contents

12.7.1 Suggestions for Choice of Scheme 576

12.7.2 Year 2023: Conclusion Remarks 577
References 577

CHAPTER 13
Future Use of Cryptography 581
13.1 Widely Used Schemes 581
13.2 Preparing for Tomorrow 583
13.3 New Mathematical Problems 584
13.4 New Signatures 585
13.5 Quantum Cryptography: A Way Out of the Dead End? 585
13.6 Post-Quantum Cryptography 585
13.7 Conclusion 586
References 587

APPENDIX A
Software 589
A.1 CrypTool 1 Menus 589
A.2 CrypTool 2 Templates and the WorkspaceManager 590
A.3 JCrypTool Functions 592
A.4 CrypTool-Online Functions 594

APPENDIX B
Miscellaneous 601
B.1 Movies and Fictional Literature with Relation to Cryptography 601
B.1.1 For Grownups and Teenagers 601
B.1.2 For Kids and Teenagers 612
B.1.3 Code for the Light Fiction Books 614
B.2 Recommended Spelling within the CrypTool Book 615
References 616

About the Author 617

Index 621

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:47 — page xv — #15

i i

CHAPTER 0
CHAPTER 0

Preface

The rapid spread of the internet has led to intensified research in the technologies
involved, especially within the area of cryptography where a good deal of new
knowledge has arisen.
This book provides a thorough overview of classical and modern cryptography.
In particular, it also guides you very specifically to try it out. The CrypTool (CT)
programs are used for this purpose, as well as sample code written for the computer-
algebra system SageMath. Both CrypTool and SageMath are open-source and free.
Another special feature is the selection of topics and the clear statements for
users. On the one hand, the theory is presented, but it also emphasizes which
procedures are really reliable and which official recommendations exist where.
To our knowledge, this book contains the first concrete summary in book form
of what concrete tasks the science of historical cryptology consists of.
This book is written for both laymen and beginners, as well as for students and
practitioners who would like to delve deeper into this field. Anyone who enjoys
prime numbers or wants to know what modern lattice methods are will find very
understandable information here. For a large number of ciphers, you can find in
tabular form what are currently the best attacks on them.
The first chapter of this book explains the principles of symmetric and
asymmetric encryption and discusses definitions for their resistibility.
Because of didactic reasons the second chapter gives an exhaustive overview
of paper-and-pencil encryption methods and explains a typical example of a
precomputer machine cipher that later became embroiled in scandal.
Chapter 3 gives a comprehensive overview about historical cryptology, a new
research area, which deals with the practical problems of cryptanalyzing and
contextualizing encrypted historical documents.
A major part of this book is dedicated to the fascinating topic of prime numbers
(Chapter 4).
Then, Chapter 5 introduces modular arithmetic and elementary number theory
using numerous examples. Here, the features of the RSA procedure are a key aspect.
Chapter 6 provides insights into the mathematical ideas and concepts behind
modern asymmetric cryptography including a new geometric illustration of the
processes involved in RSA encryption.
Chapter 7 gives a very brief overview about the status of attacks against modern
hash algorithms and is then briefly devoted to digital signatures and public-key
infrastructures, which are an essential component of e-business applications.
Chapter 8 describes elliptic curves, which are an alternative cryptosystem to
RSA and are particularly well suited for use on smart cards.

i i

CUDA programs on Linux and Windows.
As with the e-learning program CrypTool, the quality of the book grows with
the suggestions and proposals for improvement from you, the reader. We look
gratefully forward to your feedback.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:47 — page xviii — #18

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:47 — page xix — #19

i i

CHAPTER 0
CHAPTER 0
Acknowledgments

There are many people involved in the creation and publication of a comprehensive
book about cryptography.
At this point I’d like to thank explicitly the following people who in particular
contributed to the CrypTool project. They applied their very special talents and
showed really great engagement:
• Mr. Henrik Koy
• Mr. Jörg-Cornelius Schneider
• Mr. Florian Marchal
• Dr. Peer Wichmann
• Mr. Dominik Schadow
• Mr. Simon Leischnig
• Dr. Nils Kopal
• Staff of Prof. Johannes Buchmann, Prof. Claudia Eckert, Prof. Alexander
May, Prof. Torben Weis, and especially Prof. Arno Wacker
The students must be mentioned who contributed through their far over 100
bachelor’s and master’s theses. Also, I want to thank the many people not mentioned
here for their focused work (mostly carried out in their spare time).
Thanks to Bart Wessel and George Lasry for information about the differences
between the C-52/CX-52 models. Thanks to Georg Illies for pointing me to Pari-
GP. Thanks to Lars Fischer for his help with fast Pari-GP code for primitive roots.
Thanks to Volker Simon for writing the SageMath Example 5.36. Thanks to Minh
Van Nguyen from Australia for his always quick, professional, and exhaustive help
with the first SageMath code samples. It’s a pity that he is no longer reachable
… Many thanks to Klaus Pommerening, who handed over the script of his lecture
about symmetric cryptography to the CrypTool project and who shared our love
for SageMath. We then first extended his script together. Subsequently, the editor
took over Chapter 9 on his own. R.I.P. Prof. Pommerening—we lost an admirable
person.
The contributors to this book would like to take this opportunity to thank
their colleagues in the particular companies and at the universities of Bochum,
Darmstadt, Frankfurt, Gießen, Karlsruhe, Lausanne, Munich, Paris, and Siegen.
A special thank you to Dr. Doris Behrendt, who took over the laborious task
to bring two books of 500+ pages to KOMA-Script, to clean up and modernize the
TeX sources written by different authors over years, and in addition critically read
the content.

xix

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:47 — page xx — #20

i i

xx Acknowledgments

Thanks also to the readers who sent us feedback, and especially to Olaf
Ostwald, Helmut Witten, and Prof. Ralph-Hardo Schulz for constructively proof-
reading many parts of this book. And to Herbert Voss, who helped us when things
got difficult in LaTeX. And finally, many thanks to the publisher’s staff and their
external reviewer, all of whom have been very helpful in keeping everything focused.
I hope that many readers have fun with this book and that they get out of
it more interest and a greater understanding of this modern but also very ancient
topic.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:47 — page xxi — #21

i i

CHAPTER 0
CHAPTER 0

Introduction

This section shows how the book and the programs work together.

The CrypTool Book

The chapters of this book are largely self-contained and can be read and understood
independently of the CrypTool programs. In the following, we often abbreviate
“CrypTool” by CT.
Chapters 6 (“Modern Asymmetric Cryptography”), 8 (“Elliptic Curves”), 9
(“Modern Asymmetric Cryptography”), 10 (“Homomorphic Ciphers”), and 12
(“Results for Solving Discrete Logarithms and for Factoring”) require a deeper
knowledge of mathematics, while the other chapters should be understandable with
a high school diploma.
The authors have tried to describe cryptography for a broad audience—without
being mathematically incorrect, but with various links to get practical experience.
We believe that this didactic approach is the best way to promote awareness of IT
security and the willingness to use standardized modern cryptography.
This book provides a thorough overview of classical and modern cryptography
and also guides you to try it out using the following free programs.

The Programs CrypTool 1, CrypTool 2, and JCrypTool

CrypTool 1 (CT1) and its successor versions CrypTool 2 (CT2) and JCrypTool
(JCT) are used worldwide for training in companies and teaching in schools and
universities. CrypTool 1 is an educational program for Windows that allows you to
use and analyze cryptographic procedures within a unified graphical user interface.
The comprehensive online help in CrypTool 1 contains both instructions on how to
use the program and explanations of the methods themselves (both not as detailed
and in a different structure as in this book).
CT2 also runs on Windows and now has a significantly larger range of crypt-
analytic functions than CT1. JCT runs on Windows, Linux, and macOS, and now
includes many things not included in CT1.
The setups of these standalone desktop programs are downloaded more than
10,000 times a month.

The Programs on CrypTool-Online (CTO)

The CrypTool-Online website (http://www.cryptool-online.org or https://www

.cryptool.org/en/cto/), where you can try out and use cryptographic methods in
a browser on a PC, tablet, or smartphone, is another part of the CT project.

xxi

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:47 — page xxii — #22

i i

xxii Introduction

The scope of CTO is not yet as broad as that of the standalone CT1, CT2, and
JCT programs. However, as CTO is what people are using more and more as a first
contact, a lot of effort is going into the CTO development. So we redesigned the
backbone and frontend system using modern web technology to provide a fast, con-
sistent, and responsive look and feel. CTO also includes plugins using WebAssembly
(wasm) such as a Python IDE, Msieve, or OpenSSL. Using WebAssembly makes this
functionality run in a browser almost as fast as native applications. Another mod-
ern technology offered is models for cryptanalysis trained with machine learning
algorithms (deep learning, neural networks). See Section A.4.
Besides the classic ciphers, the most popular plugins in CTO are “RSA step-
by-step,” “RSA visual and more,” “AES step-by-step,” “AES animation,” and the
“Password meter.”

MysteryTwister

MTW is the abbreviation for MysteryTwister (https://www.mysterytwister.org), an

international cryptography contest (“cipher contest by cryptool”), which is also
based on the CT project. Here you can find cryptographic puzzles in four categories,
a high-score list, and a moderated forum.
As of July 2023 more than 10,000 users are participating, and more than 360
challenges are offered (301 of them are solved by at least one participant).

The SageMath Computer-Algebra System

SageMath is a comprehensive open-source CAS package that can be used to easily

program the mathematical methods explained in this book. A special feature of this
CAS is that the scripting language is Python (version 3 since SageMath 9). Thus, in
a Sage script, you have at your disposal not only the mathematical commands of
SageMath, but also all the functions of the Python language. SageMath is increas-
ingly becoming the standard CAS system at universities. Since SageMath 8, there is
also a version for Windows that runs in a Bash shell.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 1 — #1

i i

CHAPTER 1
CHAPTER 1
Ciphers and Attacks Against Them

For centuries, plaintext messages were encrypted by the military, by diplomats, and
by alchemists, and much less frequently by businesses and the general population.
The goal of cryptography was to protect the privacy between sender and receiver.
Since the 1970s, further goals have been added to achieve integrity, authenticity,
and non-repudiation, and also to compute on encrypted data in the cloud or to
achieve quantum-computer resistance.
The science that deals with encryption is called cryptology—divided into the
branches of cryptography (designing secure encryption procedures) and cryptanal-
ysis (breaking encryption procedures). In reality, however, these branches are
closely interrelated and the terms cryptography and cryptology are often used
interchangeably. Therefore, cryptology is currently subdivided into fields like sym-
metric cryptography, public-key cryptography, hardware and embedded systems
cryptography, theoretical cryptology, and real-world crypto [1].
The importance of cryptology continues to grow as our society becomes more
and more dependent on information technology. Although cryptology and infor-
mation security are interdisciplinary fields of research, mathematics now plays the
largest role in cryptology. Finally, learning about cryptology can also be fun and
entertaining.
The special thing about this book is that you can always try out the procedures
right away—by using the links (in the footnotes) to the programs from the CrypTool
project, from OpenSSL, or from SageMath. All these programs are open-source.
In this book, the basics are covered in great detail, then from the very extensive
field of cryptology certain (current) topics are selected (like RSA, ECC, or lattices).
This makes this book accessible to a wide audience, not just only for those interested
in the natural sciences.
This chapter introduces the topic in a more descriptive way without using math-
ematics. To do so, it uses modern methods (RSA, AES) as examples. Then we dive
deepen, for example, the property, how many possible keys (key space) different
methods have (Section 1.6) and what are the best attacks against known methods
(Section 1.7). Recommended books are presented in Section 1.10. In Section 1.11
you will find screenshots of how to use AES in various programs. Classic methods
are presented in Chapters 2 and 3.
The purpose of encryption is to change data (plaintext messages) in such a way
that only an authorized recipient is able to reconstruct the plaintext. This allows
us to transmit encrypted data without worrying about it getting into unauthorized
hands. Authorized recipients possess a secret information—called the key—which
allows them to decrypt the data while it remains hidden from everyone else. An
attacker cannot only try to break a cipher: She still can disturb the connection

i i

6 Ciphers and Attacks Against Them

Figure 1.3 Asymmetric or public-key encryption.

the letter through it. After that, I can no longer read or change the message myself,
because only the legitimate recipient has the key for the letter box.
The advantage of asymmetric procedures is the easier key management. Let’s
look again at a network with n subscribers. In order to ensure that each participant
can establish an encrypted connection to each participant, each participant must
possess a pair of keys. We therefore need 2n keys or n pairs of keys. Furthermore,
no secure channel is needed before messages are transmitted, because all the infor-
mation required in order to communicate confidentially can be sent openly. In this
case, you simply have to pay attention to the accuracy (integrity and authenticity)
of the public key. Nevertheless, the requirements for the key generation are not triv-
ial. What could go wrong is explained, for example, in Section 5.12.5.4. Besides
that, nowadays also (public-key) infrastructures themselves are targets of cyberat-
tacks. A disadvantage of pure asymmetric procedures is that they take a lot longer
to perform than symmetric ones (see Section 1.4).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 7 — #7

i i

1.4 Hybrid Procedures 7

The most well-known asymmetric procedure is the RSA algorithm,7 named

after its developers Ronald Rivest, Adi Shamir, and Leonard Adleman. The RSA
algorithm was published in 1978. The concept of asymmetric encryption was first
introduced by Whitfield Diffie and Martin Hellman in 1976. It is worth noting
that the concept was known at the secret services Government Communications
Headquarters (GCHQ) and National Security Agency (NSA) several years prior to
its independent rediscovery by Diffie and Hellman. Today, the ElGamal procedures
also play a decisive role, particularly the Schnorr variant in the Digital Signature
Algorithm.
The German Federal Office for Information Security (BSI) requires a security
level of 120 bit for processes used beyond 2022. Applied to RSA, the corresponding
technical guideline recommends a key length of 3,000 bit (see [7], page 18, comment
on Table 1.2).

1.4 Hybrid Procedures8

In order to benefit from the advantages of symmetric and asymmetric techniques

together, hybrid procedures are usually used (for encryption) in practice.
In this case the bulk data is encrypted using symmetric procedures. The key used
for this is a secret session key generated by the sender randomly that is only used
for this message. This session key is then encrypted using the asymmetric procedure
and transmitted to the recipient together with the message. Recipients can determine
the session key using their private keys and then use the session key to decrypt the
message.
In this way, we can benefit from the feasible key management of asymmet-
ric procedures (using public/private keys) and we benefit from the efficiency of
symmetric procedures to encrypt large quantities of data (using secret keys).

1.5 Kerckhoffs’ Principle

In 1883, the Dutch cryptographer Auguste Kerckhoffs formulated six principles for
the construction of secure military encryption procedures. The second one, Kerck-
hoffs’ principle or Kerckhoffs’ maxim, is now regarded as the principle of modern
cryptography. It states that an encryption scheme should be secure even if every-
thing about the scheme is known except the key used. Kerckhoffs’ principle is often
contrasted with “security through obscurity,” in which the encryption algorithm
must also be kept secret.

7. The RSA algorithm is extensively described within this book in Section 5.10. The topical research results
concerning RSA are described in Section 5.12. In Section 6.5 the RSA algorithm is more deeply reasoned
from number theory: The RSA plane is a model to illustrate the processes in this algorithm using pictures
of rectangles.
8. - Using CT1 Encrypt/Decrypt F Hybrid you can follow the single steps and its dependencies with concrete
numbers. The variant with RSA as the asymmetric algorithm is graphically visualized; the variant with ECC
uses the standard dialogs. In both hybrid cases AES is used as the symmetric algorithm.
- Using JCT Algorithm Perspective F Hybrid Ciphers also offers hybrid methods like ECIES.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 8 — #8

i i

8 Ciphers and Attacks Against Them

Kerckhoffs’ principle was reinterpreted several times. For example, Claude

Shannon formulated that one should design encryption systems under the assump-
tion that an enemy knows the system exactly from the very beginning (Shannon’s
maxim).

1.6 Key Spaces: A Theoretical and Practical View

For good encryption procedures used today, the time needed to break an encryption
is so long that it is almost impossible to do so. Such procedures are considered (prac-
tically) secure—from an algorithm’s point of view. After the knowledge gathered
by Edward Snowden, there were many discussions debating whether encryption is
secure. In [9] is the result of an evaluation, which cryptographic algorithms can
be relied on—but only according to current knowledge. The article investigates:
Which cryptosystems can—despite the reveal of the NSA/GCHQ attacks—still be
considered as secure? Where have systems been intentionally weakened? How can
we create a secure cryptographic future? What is the difference between math and
implementation?
The key space of a cipher is an important indicator for the security of a cipher. In
a monoalphabetic substitution (MASC; also called simple substitution) for instance,
using an alphabet of length of k, the key space is k !. For AES-128 it is 2128 .
A (sufficiently) large key space (approx. 2100 ) is a necessary prerequisite for
a secure cipher, but not a sufficient condition: The MASC has a large key space
(with an alphabet of 26 characters approx. 288.4 that corresponds to the number of
possible ciphertext alphabets), but it has been cracked with frequency analysis for
centuries.
The key space is used to calculate the effort required for a brute-force (BF)
attack (i.e., for the systematic testing of all possible keys). If the key space is so small
that an attacker can carry out a complete BF attack, the procedure is broken—not
only theoretically but also practically.
In the case of a BF attack, the attacker decrypts the ciphertext (or parts of it)
with every possible key (see Section 1.2.2). Then the found plaintext is evaluated.
How surprisingly well fitness algorithms can recognize correct natural texts can be
seen in Figures 1.49 and 1.5.10 CT1 uses similar fitness functions as the solvers and
analyzers in CT2.
Whether an attacker really has to try the maximal, theoretical key space is
questionable, at least with the older ciphers. For this reason, the practical key space
introduced by Ralph Simpson for historic cipher devices and the work factor, which
is also known as attack time, are considered.

1.6.1 Key Spaces of Historic Cipher Devices

Key spaces of historic cipher devices are often reported in the popular press as a
gargantuan number designed to impress the reader about the incredible strength
of the encryption. This is often a lead-in to the story of the amazing ingenuity of

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 13 — #13

i i

1.6 Key Spaces: A Theoretical and Practical View 13

4. A “message setting” will be sent with each message, separate from the fixed
machine setting.

1.6.2.3 Explanation of the NSA Key Space Assumptions

These assumptions detailed above seem reasonable and straightforward, except for
possibly the last assumption of both the maximum and practical key spaces: A
“message setting” will be sent with the message, separate from the fixed machine
setting. The meaning and effect of this assumption requires further explanation.
For the Enigma, all possible wirings of the rotors are included in the max-
imum key space. Also, Miller includes the rotational starting positions of the
rotors. Including the rotor starting position in the key space—besides it is already
accounted for in all possible wirings—can be considered as redundant.
For instance, if a rotor is in position “A” and a particular wiring scheme is
determined to be correct, that same wiring scheme could be advanced one position
and now this new wiring scheme works when the rotor is moved to position “B.” So
all wiring schemes should yield 26 correct solutions as you rotate the rotor through
the 26 positions. It seems you should just ignore the rotor starting position for the
three rotors, which accounts for a contribution to the key space of 263 . For this
reason, many others have reported the Enigma key space without this factor.
We don’t go deeper here into Enigma. There are many books and articles about
this rotor machine and its history. A good summary of its design (flaws) and another
approach calculating its relevant key space can be found in [11].
By including the rotor setting in the key space, Miller was allowing for a slightly
larger key space that would break all daily messages after cryptanalysis of the first
message. All subsequent messages using the same machine setting could then be
decrypted in real time, just as the enemy would decrypt their own message.
Miller’s rationale of the rotor position applies to all the rotor-based historic
cipher devices, including the mechanical devices, like the Hagelin M-209. For this
machine, all possible pin settings on each rotor are analyzed and included in the
key space. So knowledge of the rotor rotational position is not necessary to break a
message. The pin settings are part of the machine setting and fixed for the day, and
the rotor setting is part of the message setting, which changes with every message.
Again, just like in the case of the Enigma, the rotor positions must be known to
break all daily messages in real time.

1.6.3 Conclusion of Key Spaces of Historic Cipher Devices

Having a clearly defined set of assumptions for key spaces, the key spaces could be
calculated accordingly.
Table 1.1 lists 34 historic and 4 modern cipher systems, showing the maximum
and practical key spaces for each one, using that same set of assumptions. This
table was first presented to the International Conference on Cryptographic History
(ICCH) group [12] by Ralph Simpson in Decmber 2022. The key spaces for some
of these devices have not been previously reported, such as the Hebern, Japanese
Purple machine, NEMA, KL-7, Transvertex HC-9, Russian VIC, and Hagelin CD-
57. Most of the other historic cipher devices required new calculations to match
the maximum and practical assumptions listed above.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 14 — #14

i i

14 Ciphers and Attacks Against Them

Table 1.1 Key Space Sizes for 34 Historic and 4 Modern Cipher Systems
Year Cipher Maximum Key Space Practical Key Space
600 BCE Monoalphabetic substitution 4.03 · 1026 288 4.03 · 1026 288
50 BCE Caesar 2.50 · 101 25 2.50 · 101 25
1466 Vigenère (repeating keyword – 15 char.) 1.68 · 1021 271 1.68 · 1021 271
1586 Vigenère (autokey – 314 char. message) 2.00 · 10444
21476 2.00 · 10444
21476
1854 Playfair 6.20 · 1023 279 6.20 · 1023 279
1860s Wheatstone Cryptograph 4.03 · 1026 288 4.03 · 1026 288
1912 Lugagne Transpositeur 1.30 · 10532
21768 1.32 · 1013
244
1912 M-94 cylinder cipher 3.45 · 10666
22214 3.88 · 1026
288
1916 M-138A strip cipher 3.69 · 10799 22656 1.95 · 1059 2197
1918 ADFGX 4.19 · 1047 2158 4.19 · 1047 2158
1918 ADFGVX 1.01 · 1064
2213 1.01 · 1064
2213
1922 Hebern 5-rotor 1.27 · 10140
2466 4.56 · 1010
235
1924 Kryha 2.02 · 1053 2177 1.78 · 1029 297
1926 Enigma Swiss K 1.60 · 10101 2336 1.85 · 109 231
1930 Lugagne Le Sphinx 1.30 · 10532
21768 2.43 · 1024
281
1931 Abwehr Enigma G 7.17 · 10121
2405 4.82 · 1010
235
1932 Enigma I 3.28 · 10114 2380 4.31 · 1022 275
1937 SIGABA 1.82 · 10285 2941 5.95 · 1028 296
1939 Japanese Purple 3.81 · 1059
2198 1.45 · 1031
2104
1939 Japanese JN-25 codebook (100 words) 1.00 · 1012
240 8.25 · 1010
236
1941 Lorenz SZ40/SZ42 1.05 · 10170 2565 1.05 · 10170 2565
1941 SG-41 “Hitler Mill” 4.24 · 1051 2171 4.24 · 1051 2171
1942 M-209 pin & lug 6.16 · 1060
2202 6.02 · 1058
2195
1942 Enigma M4 2.33 · 10145
2483 3.13 · 1025
285
1942 T-52d Geheimschreiber 7.23 · 10213 2710 8.11 · 1023 279
1943 Typex Mark 22 1.82 · 10195 2649 5.51 · 1054 2182
1947 NEMA 5.99 · 10164
2551 1.83 · 1019
264
1952 Hagelin C-52 1.68 · 10117
2389 7.17 · 1057
2192
1952 Hagelin CX-52 1.17 · 10123 2409 1.10 · 10104 2346
1952 KL-7 5.87 · 10431
21434 1.70 · 1034
2114
1950s Transvertex HC-9 2.96 · 1071
2237 4.39 · 1069
2231
1953 VIC paper & pencil 9.09 · 1040 2136 1.00 · 1027 290
1956 Fialka 2.82 · 10458 21523 6.24 · 1077 2258
1957 Hagelin CD-57 1.52 · 10103
2343 1.49 · 1060
2200
1976 DES (56 bit) 7.21 · 1016 256 7.21 · 1016 256
1977 RSA-4096 2.22 · 101225 24071 2.22 · 101225 24071
1992 AT&T TSD 3600-E Clipper chip 1.21 · 1024 280 1.21 · 1024 280
2001 AES-256 1.16 · 1077 2256 1.16 · 1077 2256
Courtesy of Ralph Simpson.

It is important to remember that these key spaces are still not a good sole indi-
cator of the cryptographic strength of the encryption method—examples for these
criticisms are monoalphabetic substitution (288 ), Enigma I (275 ), and Playfair (279 ).
But using a common set of assumptions will at least add a level of consistency
among all these disparate devices.

1.7 Best Known Attacks on Given Ciphers

Tables 1.2 and 1.3 contain the best attacks known today for well-known classical
and modern ciphers. For modern procedures, the effort (number of steps or attack

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 15 — #15

i i

1.7 Best Known Attacks on Given Ciphers 15

time) is also given in Table 1.3. To our knowledge, this is the first time such a
complete table is created.
For symmetric ciphers, the key space derived from the key length is an impor-
tant indicator (see Section 1.6). It is used to calculate the effort required for a BF
attack, the maximum effort that an attacker can have.
The following applies to AES-128 (see Table 1.3): The key length is 128 bits.
The key space is 2128 and so is the theoretical attack time. The best known attack
(biclique attack) reduces this maximum effort to 2126.1 steps. This difference of
around 2 in the exponent means that the attack is about 4 times faster than a BF
attack on average. This shows that AES is vulnerable in principle, but this attack is
not at all relevant to practical security.

1.7.1 Best Known Attacks Against Classical Ciphers

The historical ciphers shown in Table 1.2 represent different periods in the history
of cryptography, ranging from simple Caesar ciphers to more complex machine-
assisted systems like Enigma. These selections are based on their historical signif-
icance. The attack types and methods shown in the table are the currently best
known computerized methods for attacking these ciphers. All of the hand ciphers
are vulnerable to simulated annealing and hill climbing. Composed ciphers, in
our example here ADFGVX, need more sophisticated methods. With ADFGVX,
a divide-and-conquer attack can be used to break substitution and transposi-
tion independently. Also noteworthy is SIGABA, since it can be attacked with
a meet-in-the-middle attack. Additionally, all shown hand ciphers (substitution,
transposition, and composed ciphers) can today be attacked in a pure ciphertext-
only scenario. An exception are nomenclature ciphers, since the nomenclature
elements (code words) can often only be decrypted when having either the original
key or enough context to deduce them. Also, the chances of successfully attack-
ing cipher machines, such as the Enigma and Typex, are enhanced when a crib (a
partially known plaintext) is available. Only attacks on SIGABA still require the
complete plaintext to be successful.

1.7.2 Best Known Attacks Against Modern Ciphers

Table 1.3 presents a selection of modern ciphers and the best attacks against them.
The table includes historically significant ciphers such as DES and FEAL, ISO stan-
dards like AES, Camellia, and SNOW 2, national standards like GOST and SM4,
and ciphers that were actively used in industrial solutions such as KeeLoq and A5.1.
Cipher names typically encompass a family of encryption methods rather than refer-
ring to a single algorithm. These algorithms usually differ in the size of the key used
and, in the case of block ciphers, the size of the data block. It is important to note
that the best attacks against various versions of a cipher may differ. For the sake of
brevity, we provide a single example from each cipher family and present the most
successful attack against it.
In the right-most column of Table 1.3, the term “attack time” is used. “Time” is
an established term used in modern cryptography. In order to understand what the
attack time—as a measure for the resistability of a cipher—means, see Section 1.8
which introduces attack costs and different attack types.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 16 — #16

i i

16 Ciphers and Attacks Against Them

Table 1.2 Best Known Attacks Against 17 Historical Ciphers

Cipher Attack (Best) Cryptanalysis Methods References
Requirements
Substitution ciphers
Caesar PCO Brute force, frequency analysis [13]
Monoalphabetic substitution PCO Hill climbing, frequency analysis [13]
Homophonic substitution PCO Hill climbing / simulated [14]
annealing
Nomenclatures PCO Manual (deduced by context; or [15, 16]
nomenclature available)
Polyalphabetic substitution PCO Hill climbing / simulated [13]
annealing / (Friedman + Kasiski)
Playfair PCO; crib Simulated annealing [17, 18]
Code books PCO; crib/KP Manual (deduced by context; [19]
availability of similar code book)
Chaocipher PCO Hill climbing / simulated [20]
annealing
Transposition ciphers
Scytale PCO Brute force [13]
Columnar transposition PCO Brute force (short keys) / hill [21]
climbing / simulated annealing
Double columnar transposition PCO Hill climbing / simulated [22]
annealing; IDP attack
Composed
ADFGVX PCO DAC + hill climbing / simulated [23]
annealing
Machines
Enigma PCO, crib DAC; hill climbing / simulated [24, 25, 26]
annealing; Turing Bombe
Typex PCO, crib DAC; hill climbing / simulated [24, 25, 26]
annealing; Turing Bombe
SZ42 PCO, crib Testery methods and hill climbing [27]
M209 PCO, crib Simulated annealing / hill [28, 29]
climbing
SIGABA KP Meet in the middle; hill climbing / [30, 31]
simulated annealing
PCO = pure ciphertext-only, KP = known-plaintext, DAC = divide and conquer.

1.8 Attack Types and Security Definitions

Attack time. The attack time is generally expressed in the number of a partic-
ular cipher’s encryptions. This is done in order to demonstrate by which factor

divided into the following four main categories based on the type of access to the
ciphertext and plaintext (assuming the key is always unknown):

• Ciphertext-only attacks (COA) assume access only to ciphertexts without

knowledge of corresponding plaintexts;
• Known-plaintext attacks (KPA) involve pairs of known plaintext and their
corresponding ciphertext, aiming to recover the secret key;
• Chosen-plaintext attacks (CPA) allow the attacker to choose arbitrary plain-
texts and obtain their ciphertexts, providing flexibility in analyzing the
encryption algorithm;
• Chosen-ciphertext attacks (CCA) enable the attacker to choose arbitrary
ciphertexts and obtain their plaintexts, possessing the power to manipulate
ciphertexts during decryption.

Additionally, attacks differ based on specific mathematical methods, such as

differential cryptanalysis (analyzing how differences between inputs of the ciphers
affect resultant differences between outputs), linear cryptanalysis (exploiting lin-
ear relationships in the encryption process), meet-in-the-middle, biclique, integral,
boomerang, cube, and other attacks. All these methods are unique, so we refer to
the provided references for a comprehensive explanation.

1.8.2 Indistinguishability Security Definitions

The attack types CPA and CCA have a direct relationship with the cryptographic
security definitions IND-CPA, IND-CCA1, and IND-CCA2. These definitions play
a crucial role in the provable security branch of cryptography. This field focuses on
proving mathematically the security of the cryptographic schemes. This is achieved
by demonstrating that breaking a certain scheme would require solving a problem
that is widely known to be difficult, such as factoring large numbers or computing
discrete logarithms.

Indistinguishability under chosen-plaintext attack (IND-CPA). In this model, an

attacker is allowed to choose arbitrary plaintexts and obtain the corresponding
ciphertexts from the encryption oracle as many times as he needs. Then the adver-
sary chooses two distinct challenge messages and sends them to the encryption
oracle, which returns a ciphertext of just one of them called challenge ciphertext.
After that, the attacker is allowed to perform any number of additional computa-
tions and encryptions. An encryption scheme is considered secure if the attacker
can’t guess to which plaintext the challenge refers to with the probability higher

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 21 — #21

i i

1.8 Attack Types and Security Definitions 21

than |1/2 + η| where η is negligible. Clearly, the attacker cannot choose the same
messages for the challenge for which he gets the ciphertexts from the oracle. This
security definition can be applied to both symmetric and asymmetric encryption
schemes, although formally they are described differently [101]. However, in case
of deterministic asymmetric encryption schemes, an attacker has access to the pub-
lic key, which means that he can easily distinguish which ciphertext was produced
by which message by encrypting the messages by himself. Therefore, the definition
is only applied to probabilistic public-key encryption schemes where randomness is
used in the encryption process. This implies that the same message encrypted several
times under the probabilistic encryption scheme results in different ciphertexts.

Indistinguishability under chosen-ciphertext attack, also known as nonadaptive or

lunchtime attack (IND-CCA1). This security definition imposes a higher level of
security than IND-CPA. In this model, an attacker can choose both the plaintexts
and obtain their corresponding ciphertexts from the oracle, and also decrypt arbi-
trary ciphertexts and get the corresponding plaintexts. The further procedure is
similar to the IND-CPA case. However, in the case of IND-CCA1 after the adversary
gets the challenge the decryption oracle becomes unavailable.

Indistinguishability under adaptive chosen-ciphertext attack (IND-CCA2). This is

the strongest definition providing the highest level of security. It allows the attacker
to continue to interact with the decryption oracle even after the challenge ciphertext
is received.
When considering modern cryptographic encryption primitives, selecting the
best attack is not a straightforward task. In Table 1.3, we have kept the informa-
tion concise and prioritized key-recovery attacks requiring minimal computation
and being faster than brute-force, which is a universal attack method against any
encryption algorithm. By this prioritizing, we have left out other complexities such
as data and memory costs (e.g., number of required plaintext-ciphertext pairs).
Single-key scenarios are typically targeted, except for two exceptions in our
table: the related-key attack against Kasumi cipher and the variable-key attack
against RC4. If the full cipher is not compromised, we aim to select attacks that
break as many rounds as possible. We only refer to distinguishing attacks against
MUGI and Enocoro as we are not aware of any published key-recovery attacks.

1.8.3 Security Definitions

Modern cryptography is heavily based on mathematical theory and computer
science practice. Cryptographic algorithms are designed around computational
hardness assumptions, making such algorithms hard to break in practice by any
adversary.
There are different approaches (categories) to define the security of crypto-
systems.
Most commonly, two fundamental approaches are used for formally defining
the security of an encryption scheme [102]:

• The first one is semantic security, which implies that it is infeasible for an
attacker to learn any information about the plaintext from the ciphertext;

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 22 — #22

i i

22 Ciphers and Attacks Against Them

• The second definition determines security as the infeasibility of distinguish-

ing between encryptions of two given messages.
In both definitions of security, the term “infeasible” rather than “impossible”
is used. This is because generic attacks exist against almost every known encryption
scheme (with the exception of the one-time-pad). One such universal attack, namely
a brute force, was discussed in Section 1.2.2. Brute-force attacks can be extended
to time-memory trade-off (TMTO) attacks, a broader class of attacks, which in
certain cases allow to reduce the key-recovery time by increasing the memory cost.
See Table 1.3 for an in-depth discussion of different attack types.
Another main category in literature defines security depending on the adver-
sary’s capabilities (e.g., Cryptography 101 [103, Chap. 1.2.2]):

Computational, conditional, or practical security. A cipher is computationally

secure if it is theoretically possible to break such a system, but it is infeasible to
do so by any known practical means. Theoretical advances (e.g., improvements
in integer factorization algorithms) and faster computing technology require these
solutions to be continually adapted.
Even using the best known algorithm for breaking it will require so many
resources (e.g., 1,000,000 years) that essentially the cryptosystem is secure.
So this concept is based on assumptions of the adversary’s limited computing
power and the current state of science.
A typical example of a pragmatically secure procedure is AES: No practicable
attack is known on it. Even so, AES is theoretically broken, which just means it can
be broken with less effort than a brute-force attack. This effort is still unrealistically
high. See Section 1.7.

Information-theoretical or unconditional security. A cipher is considered uncondi-

tionally secure if its security is guaranteed no matter how many resources (time,
space) the attacker has. Even if the adversary has unlimited resources he is unable
to gain any meaningful data from a ciphertext.
The only information-theoretically secure schemes that provably cannot be bro-
ken even with unlimited computing power are the one-time pad (OTP) or variants
of it.
Figure 1.6 shows that it may be impossible to determine the correct plaintext
from a OTP (if the OTP method has been applied correctly and if all keys have
the same likelihood). The example in this figure uses an 8-character long given
ciphertext: 11 1B 1E 18 00 04 0A 15. The hex values correspond to the ASCII
values of the letters: For example, the letter C has the numerical value 67 (decimal),
which is 43 in hex representation.
There are many meaningful words with eight letters and for each there is a
correct key. So an attacker cannot determine alone from the ciphertext which is the
correct key and which is the correct plaintext word. In other words, with different
keys the same ciphertext can lead to different meaningful plaintexts and so, in this
case, it cannot be distinguished which plaintext is the correct one.12
12. The OTP procedure is discussed in more detail in Section 2.2.4 in item “One-time pad.”
Also see Figure 9.12, where a corresponding example with text strings is built with SageMath, and the
XOR method is explained.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 23 — #23

i i

1.9 Algorithm Types and Self-Made Ciphers 23

Figure 1.6 Illustration of the information-theoretically secure OTP scheme.13

As the OTP is information-theoretically secure it derives its security solely

from information theory and is secure even with unlimited computing power at
the adversary’s disposal. However, OTP has several practical disadvantages (the
key must be used only once, must be randomly selected, and must be at least
as long as the message being protected), which means that it is hardly used
except in closed environments such as for the hot wire between Moscow and
Washington.
Two more security concepts are sometimes used:

• Provable security. This means that breaking such a cryptographic system is

as difficult as solving some supposedly difficult problem, such as discrete log-
arithm computation, discrete square root computation, or very large integer
factorization.
Example: Currently we know that RSA is at most as difficult as factor-
ization, but we cannot prove that it’s exactly as difficult as factorization.
So RSA has no proven minimum security. Or in other words, we cannot
prove that if RSA (the cryptosystem) is broken, then factorization (the hard
mathematical problem) can be solved.
The Rabin cryptosystem was the first cryptosystem that could be proven
to be computationally equivalent to a hard problem (integer factorization).
• Ad-hoc security. A cryptographic system has this security feature if it is not
worth trying to break the system because the effort to do so is more expensive
than the value of the data that would be obtained by doing so. Or an attack
can’t be done in sufficiently short time (see [104]).
Example: This may apply if a message relevant to the stock market will
be published tomorrow, and you would need a year to break it.

13. Source of the four photos: https://pixabay.com/.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 24 — #24

i i

24 Ciphers and Attacks Against Them

1.9 Algorithm Types and Self-Made Ciphers

Here, two aspects of crypto procedures are mentioned briefly, which are often not
discussed early enough: types of algorithms and the thinking up of new algorithms.

1.9.1 Types of Algorithms

Algorithms can be categorized as follows:
• Random-based. Algorithms can be divided up into deterministic and heuris-
tic methods. Often students only become aware of deterministic methods,
where the output is uniquely determined by the input. On the other hand,
heuristic methods make decisions using random values and the results are
only correct with a certain probability. One can differentiate even more
precisely between randomized algorithms, and probabilistic and heuris-
tic methods, but these subtleties are not important for understanding the
contrast to deterministic methods.
Random looms large in cryptographic methods. Keys have to be selected
randomly, which means that at least for the key generation “random”
is necessary. In addition, some methods, especially from cryptanalysis, are
heuristic.
• Constant-based. Many modern methods (especially hash methods and sym-
metric encryption) use numeric constants. Their values should be plausible,
and they shouldn’t contain back doors. Numbers fulfilling this requirement
are called nothing-up-my-sleeve numbers.

1.9.2 New Algorithms

It happens again and again that someone without deeper knowledge of adequate
design concepts comes up with a “new” encryption procedure. However, reality
shows that this is not a good idea. That’s why people usually learn early not to design
their own cryptosystem if they hope that the fact that it is not known will protect
them. There are many reasons for this, including that it only takes one disgruntled
employee or any other malicious actor to reveal the secrets that make the scheme
secure. Designing secure cryptographic schemes is extremely difficult. It is incredibly
easy to create something that looks secure, but actually leaks information.
Offering prize money and just single ciphertexts is unprofessional—serious
researchers have little time and will not spend any effort on it (perhaps they give it
to students as an exercise for didactic reasons). Modern best practice is that if you
want to create a new encryption scheme, first publish it with a detailed explanation
of how it works, its advantages, and any evidence of its security. Then you can see
if anyone can find any weaknesses. This is not a quick process—you should expect
it to take years.

1.10 Further References and Recommended Resources

Here are some good cryptography books that can serve as useful background on var-
ious topics in order from beginners (history) to intermediate (applied) to advanced

14. https://www.cryptool.org/en/cto/aes-animation.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 27 — #27

i i

1.11 AES Visualizations/Implementations 27

Figure 1.8 AES visualization from CTO (part 2).

1.11.4 AES with OpenSSL within CTO18

As CTO has integrated a WebAssembly-based version of OpenSSL, this also can
be done locally in your browser without the need to install OpenSSL. While Linux
systems mostly have OpenSSL on board, Windows systems or smart phones don’t.
For such systems this plugin is helpful.
For the example in Figure 1.11 we store the message AESTEST1USINGCT2
in a file called “klartext-1.hex.” Then we upload this file from the file system of
the operating system into a virtual file system in the browser: This upload is done
in the tab “Files” of the OpenSSL plugin. Then in the OpenSSL plugin the same
openssl command is executed as before in the terminal (see Section 1.11.3). And
if you download the resulting file klartext-1.hex.enc and compare it with the result
from the terminal, you see both are identical.

1.12 Educational Examples for Symmetric Ciphers Using

SageMath

Section 1.12.1 shows the SageMath implementation of a cipher (called Mini-

AES) stripped for didactic purposes. Further publications with ciphers reduced for
didactic reasons are listed in Section 1.12.2.

1.12.1 Mini-AES
The SageMath module crypto/block_cipher/miniaes.py supports Mini-AES to
allow students to explore the inner working of a modern block cipher.
Mini-AES, originally described in [111], is a simplified variant of AES to be
used for cryptography education.
Here is a short list about how Mini-AES was simplified compared to AES:

17. xxd creates a hex dump of a given file or of standard input. With the option “-r” it converts hex dump
back to its original binary form.
18. https://www.cryptool.org/en/cto/openssl.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 30 — #30

i i

30 Ciphers and Attacks Against Them

Figure 1.11 AES encryption using OpenSSL in the browser.

• The AES has a block size of 128 bits, and supports key sizes of 128, 192,
and 256 bits. The number of rounds is 10, 12, or 14 for the three different
key sizes, respectively.
Mini-AES has a 16-bit block size, a 16-bit key size, and 2 rounds.
• The 128-bit block of the AES is expressed as a matrix of 4 × 4 bytes, in
contrast to Mini-AES expressing its 16-bit block as a matrix of 2 × 2 nibbles
(half-bytes).
• The AES key schedule takes the 128-bit secret key and expresses it as a group
of four 32-bit words.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 31 — #31

i i

1.12 Educational Examples for Symmetric Ciphers Using SageMath 31

The Mini-AES key schedule takes the 16-bit secret key and expresses it as a
group of four nibbles (4-bit words).

How to use Mini-AES is exhaustively described at this SageMath ref-

erence page: https://doc.sagemath.org/html/en/reference/cryptography/sage/cryp-
to/block_cipher/miniaes.html.
SageMath Example 1.1 was originally taken from the release tour of SageMath
19
4.1 and calls the implementation of the Mini-AES.

SageMath Example 1.1: Encryption and Decryption with Mini-AES

print ("\n# CHAP01 -- Sage -Script -SAMPLE 010: =========")

# (1) Encrypting a plaintext using Mini -AES

from sage.crypto.block_cipher.miniaes import MiniAES
maes = MiniAES ()
K = FiniteField(16, "x")
MS = MatrixSpace(K, 2, 2)

P = MS([K("x^3 + x"), K("x^2 + 1"), K("x^2 + x"), K("x^3 + x^2")]); �

� print ("(1) P:\n",P, sep ="")

key = MS([K("x^3 + x^2"), K("x^3 + x"), K("x^3 + x^2 + x"), K("x^2 + x �

� + 1")]); print ("key:\n",key , sep ="")

C = maes.encrypt(P, key); print ("C:\n",C, sep ="")

# decryption process
plaintxt = maes.decrypt(C, key)
print(plaintxt == P)

# (2) Working directly with binary strings

maes = MiniAES ()
bin = BinaryStrings ()
key = bin.encoding ("KE"); print ("\n(2) key:\n",key , sep ="")

P = bin.encoding (" Encrypt this secret message !"); print ("P:\n",P,sep �

� ="")
C = maes(P, key , algorithm =" encrypt "); print ("C:\n",C,sep ="")
plaintxt = maes(C, key , algorithm =" decrypt ")
print(plaintxt == P)

# 3) Or working with integers n such that 0 <= n <= 15:

maes = MiniAES ()
P = [n for n in range(16)]; print ("\n(3) P:\n",P, sep ="")
key = [2, 3, 11, 0]; print ("key:\n",key , sep ="")

P = maes.integer_to_binary(P)
key = maes.integer_to_binary(key)
C = maes(P, key , algorithm =" encrypt "); print ("C:\n",C, sep ="")
plaintxt = maes(C, key , algorithm =" decrypt ")
print(plaintxt == P)

19. See https://mvngu.wordpress.com/2009/07/12/sage-4-1-released/.

Further example code for Mini-AES can be found in [112].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 32 — #32

i i

32 Ciphers and Attacks Against Them

Further details concerning cryptosystems within SageMath (e.g., about the Sim-
plified Data Encryption Standard (SDES)) can be found in the thesis of Minh Van
Nguyen [113].

1.12.2 Symmetric Ciphers for Educational Purposes

Compared to public-key ciphers based on mathematics, the structure of AES and
most other modern symmetric ciphers (like DES, IDEA, or Present), is very complex
and cannot be explained as easily as RSA.
So, simplified variants of modern symmetric ciphers were developed for educa-
tional purposes in order to allow beginners to perform encryption and decryption
by hand and gain a better understanding of how the algorithms work in detail.
These simplified variants also help to understand and apply the corresponding
cryptanalysis methods.20
The most well-known of these variants are SDES21 and Simplified-AES (S-
AES)22 by Ed Schaefer and his students [115], and Mini-AES (see Section 1.12.1):
• Edward F. Schaefer: A Simplified Data Encryption Standard Algorithm
[116].
• Raphael Chung-Wei Phan: Mini Advanced Encryption Standard (Mini-
AES): A Testbed for Cryptanalysis Students [111].
• Raphael Chung-Wei Phan: Impossible Differential Cryptanalysis of Mini-
AES [117].
• Mohammad A. Musa, Edward F. Schaefer, Stephen Wedig: A Simplified AES
Algorithm and Its Linear and Differential Cryptanalyses [118].
• Nick Hoffman: A Simplified Idea Algorithm [119].
• S. Davod. Mansoori, H. Khaleghei Bizaki: On the Vulnerability of Simplified
AES Algorithm Against Linear Cryptanalysis [120].

References

[1] International Association for Cryptologic Research (IACR), IACR, https://www.iacr.org/.

[2] War of the Letters, BBC documentary film (in German, Krieg der Buchstaben), 1994,
https://www.youtube.com/watch?v=yfw1JLI8aWk.
[3] Nichols, R. K., Classical Cryptography Course, Volumes 1 and 2, Tech. Rep., 12 lessons,
Aegean ParkPress 1996, https://www.cryptogram.org/resource-area/crypto-lessons
-tutorials-lanaki/.

The term paper-and-pencil methods (P&P methods, also called hand ciphers or
pen’n’paper ciphers) embraces all techniques that people can apply manually to
encrypt and decrypt (encipher or decipher) a message. This includes all classic
encryption methods (as opposed to those that require machines or computers), but
also some newer ones that were deliberately developed with the aim of achieving
a very high level of security by hand (which was not always successful), such as
ElsieFour, Solitaire, Hutton, or Handycipher.
P&P methods were also popular with secret services, as a writing pad and a
pencil are unsuspicious.
This chapter provides a broad overview (encyclopedic approach) of many of
these P&P methods, each with an example and references to deeper information.1
Section 2.5 in this chapter presents Hagelin rotor machines as an example of
electromechanical cipher machines that were in use until the 1970s.
At the end of this chapter (Section 2.8) you find sample code (for ciphers like
Caesar, Atbash, monoalphabetic substitution, Vigenè, Hill, columnar transposi-
tion) written for the computer-algebra system SageMath.
While this chapter describes the procedures from the point of view of modern
cryptography (and so that the ciphertext is already available correctly), Chapter 3
takes the view of a historian who first has to transcribe a found document before
he can cryptographically examine it.
The first paper-and-pencil methods already arose about 3,500 years ago in
Mesopotamia. All paper-and-pencil methods are a matter of symmetric methods.
Even the earliest encryption algorithms use the basic principles such as trans-
position, substitution, block construction, and their combinations. Hence, it is
worthwhile to closely consider these ancient methods especially under didactic
aspects.
Ciphers to be successful and widespread had to fulfill the same characteristics
required for modern algorithms:

1. The footnotes of this chapter describe how the cryptographic methods can be performed using the offline
programs CrypTool 1 (CT1), CrypTool 2 (CT2), and JCrypTool (JCT). See Sections A.2 and A.3. Many of
the methods can also be performed within a browser (e.g., on the website CrypTool-Online (CTO)). See
Section A.4.
While the CrypTool websites and programs offer both classic and modern ciphers, there are several
sites related to the American Cryptogram Association (ACA) [1] that provide very detailed focus only on
classic ciphers: for instance the sites of Bion [2] and Pilcrow [3].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 40 — #2

i i

40 Paper-and-Pencil and Precomputer Ciphers

• Exhaustive description, almost standardization (including special cases,

padding, etc.);
• Good balance between security and usability (because methods being too
complicated were error-prone or unacceptably slow).

Convention: If the alphabet uses only 26 letters, we write from now onward the
plaintext in lowercase letters and the ciphertext in capital letters.
The letters of the ciphertext are—as used historically—grouped within blocks
of five letters. It does not matter if a different (constant) block length is used for the
output or if there is no separation by blanks.

2.1 Transposition Ciphers

Encrypting a message by means of transposition does not change the original char-
acters of this message, only their order in the plaintext is modified (transposition =
exchange). There is no ciphertext alphabet.
Sometimes, the name permutation is used to describe how characters, groups
of characters, numbers, or columns of the plaintext are exchanged; for example,
(1, 2, 3, 4, 5) ⇔ (3, 4, 2, 1, 5).

2.1.1 Introductory Samples of Different Transposition Ciphers

• Rail fence cipher2 [4]: The characters of a message are alternately written
in two (or more) lines, creating a zigzag pattern. The resulting ciphertext is
read out line by line. This is more a children’s method.
For an example, see Table 2.1.
Plaintext: an example of transposition

Table 2.1 Rail Fence Cipher

n x m l o t a s o i i n
a e a p e f r n p s t o

Ciphertext: NXMLO TASOI INAEA PEFRN PSTO

2. - Using CTO, this cipher can be seen in the browser in the plugin “Rail fence”: https://www.cryptool
.org/en/cto/railfence. Here, not only the result but also the graphical zigzag display is output.
- This method can also be found using CT1 Encrypt/Decrypt F Symmetric (classic) F Scytale /
Rail Fence.
- A rail fence with 2 lines and offset 1 can be simulated as simple columnar transposition; that is, using
CT1 Encrypt/Decrypt F Symmetric (classic) F Permutation, using as key “(B,A)” and accepting the
default settings (only one permutation, where your input is done line-by-line and the output is taken column-
by-column). This was done in Table 2.1.
Using the key “(A,B)” would start the zigzag pattern in Table 2.1 in such a way that the first letter is
written into the first line instead of the second line. Normally, the rail fence cipher cannot be simulated via
a simple columnar transposition.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 41 — #3

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 44 — #6

i i

44 Paper-and-Pencil and Precomputer Ciphers

Table 2.5 Nihilist Transposition

W O R D S D O R S W
W a n e x a D s p o i s
O m p l e o O e p l o m
R f t r a n R a t r n f
D s p o s i S n i o - t
S t i o n - W x n e a a

Cadenus cipher [1]: Cadenus is a form of a columnar transposition that uses two
keywords. Cadenus combines a complete columnar transposition with a keyed
column rotation. The procedure—both for encryption and decryption—is rather
error-prone.
Let’s say the first keyword (KEY) has a length of n = 3. The second keyword
has a maximal length of the alphabet. It can be shorter, but it must contain the letters
of the first keyword. The second keyword is a permutation of the used elements.
We have a table with a first column (stub column), and three blocks each con-
sisting of n columns. The first keyword is written in the header row of each block
and used to swap the columns of the second and third block later.
The second keyword is used to build the stub column and so define the initial
letter of each column of the third block.
The plaintext is filled line by line in the first block. Then each column of the first
block is moved to the second block, transposed in the order of the first keyword.
The columns of the third block are created by copying each column from block 2 to
block 3 and moving the elements within a column wrap-around such that it begins
with the letter, which is in the same line as the corresponding key letter of the first
keyword within the second keyword. The ciphertext is then read out line by line
from the third block.
See Table 2.6: The letters of the first keyword are marked bold in the stub
column. Within the second block, those letters are printed blue, which will be at

Table 2.6 Cadenus Cipher

K E Y E K Y E K Y
A c a d a c d s a a
D e n u n e u s r p
X s i s i s s i f i
K a f o f a o u l o
C r m o m r o n n s
W f c o c f o k t g
N l u m u l m w n e
S n a r a n r d o o
Y t r a r t a a t x
E n s p s n p n n d
D o s i s o i i i u
T t i o i t o f a s
U n u s u n s m y o
B i n g n i g c r o
R a k e k a e u c o
G y w o w y o a e m
H r d x d r x r s r

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 45 — #7

i i

2.2 Substitution Ciphers 45

the top of the columns in block 3 (after wrapping around the selected columns). The
sample here is a slightly enhanced version of Cadenus, as the length of the second
keyword must not be exactly 25. But still the plaintext length (here 51 with the filler
“x”) has to be a multiple of the length of the second keyword (here 17).

Plaintext: cadenus is a form of columnar transposition using a keywordx.

First keyword: KEY

Second keyword: ADXKCWNSYEDTUBRGH

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 47 — #9

i i

2.2 Substitution Ciphers 47

See Table 2.7.

Plaintext: an example of substitution

Table 2.7 Nihilist Substitution

K E Y
(35) (31) (54)
a n e
(58) (53) (85)
x a m
(88) (54) (96)
p l e
(78) (72) (85)
o f s
(56) (63) (65)
u b s
(47) (44) (65)
1 2 3 4 5 t i t
1 S U B T I (49) (46) (68)
2 O N A C D u t i
3 E F G H K (47) (55) (69)
4 L M P Q R o n
5 V W X Y Z (56) (53)
(a) Matrix (b) Table

Ciphertext: 58 53 85 88 54 96 78 72 85 56 63 65 47 44 65 49 46 68 47
55 69 56 53

Codes [4]: In the course of time, codebooks were used again and again. A codebook
assigns a codeword, a symbol, or a number to every possible word of a message.
Only if both parties hold identical codebooks and if the assignment of codewords
to plaintext words is not revealed, a successful and secret communication can take
place.

Nomenclator [4]: A nomenclator refers to techniques that combine the use of a

cipher algorithm with a codebook. Often the encryption system is based upon a
ciphertext alphabet. This alphabet is used to encrypt (via substitution) the bigger
part of the message. Particularly frequent or top-secret words are replaced by a
limited number of codewords existing besides the ciphertext alphabet.

Map cipher [7]: This method constitutes a combination of substitution and

steganography.15 Plaintext characters are replaced by symbols that are arranged
in a map following certain rules.

15. Pure steganography tries to conceal the existence of the message instead of encrypting it.
Different steganographic techniques can be found in CT2: For example, image steganography (with
BPCS and with LSB), text steganography [with capital letters (alone or in binary mode), with letter
marking, with zero width spaces], or watermarking (invisible, robust, or visible). See CT2 Templates F
Steganography F Image Steganography with BPCS.

i i

Figure 2.1 CT2 component for the straddling checkerboard.

traversed column by column from the grille and transferred row by row into the
matrix.
See Table 2.9.
Plaintext: an example of substitution

Table 2.9 Variant of the Straddling Checkerboard

K E Y W O R D
A B C F G H I
Grille
J L M N P Q S
T U V X Z . /
0 1 2 3 4 5 6 7 8 9
A T E - N O R - I S
Matrix
3 K J B L U Y C M V W
7 F X G P Z H Q . D /

Ciphertext: 04271 03773 33257 09343 29181 34185 4

Ché Guevara cipher18 :

With a high probability, Ché Guevara used a special case of the checkerboard
variant above (this case has an additional substitution step and a slightly changed
checkerboard):

• The seven most frequent letters in Spanish are distributed in the first row;
• Four instead of three rows are used;
• So one could encrypt 10 · 4 − 3 = 37 different characters.

Tridigital cipher [1]: A keyword with 10 letters is used to create a numeric key by
numbering its letters corresponding to their alphabetical order. This key is written
above the columns of the 3 × 10-matrix. This matrix is filled line by line with the
alphabet as follows: The different letters of a keyword are inserted first, followed by
the remaining letters. The last column is left out. Plaintext characters are substituted
with numbers, and the number of the last column is used to separate words.

18. CT2 Templates F Cryptography F Classical F Ché Guevara Cipher

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 50 — #12

i i

50 Paper-and-Pencil and Precomputer Ciphers

Baconian cipher19 [1]: This is actually a code and not a cipher (there is no key).
Assign a five-digit binary code to every letter and to six numbers or special charac-
ters (00000 = A, 00001 = B, etc.), and replace the plaintext characters with this
binary code. Now use a second, unsuspicious message to hide the ciphertext inside
of it. This may happen by upper and lower case or italicized letters: For example,
all letters of the unsuspicious message below a binary “1” are capitalized. Overall
this is obtrusive.
See Table 2.10.
Table 2.10 Baconian Cipher
Plaintext / message F I G H T
Intermediate ciphertext 00101 01000 00110 00111 10011
Unsuspicious message itisw arman thesu nissh ining
Ciphertext itIsW aRman thESu niSSH IniNG

2.2.2 Homophonic Substitution

Homophonic methods constitute a special form of monoalphabetic substitution.
Several or all characters of the plaintext alphabet are assigned to more than one
ciphertext character. This methodology, in many variations, was widespread in his-
torical reality, for example, in the Copiale cipher, an encrypted manuscript from
the 18th century (see Figure 2.2).
Homophonic monoalphabetic substitution20 [4]: Each language has a typical fre-
quency distribution of the letters of the alphabet. To conceal this distribution,
each plaintext letter is assigned to several ciphertext characters. The number
of ciphertext characters assigned depends on the frequency of the letter to be
encrypted.
Book cipher: The words of a message are substituted by triples “page-line-
position.” This method requires a detailed agreement of which book to use,
especially regarding the edition (layout, error correction, etc.).

Figure 2.2 Copiale cipher, scaled page 16/17. (Source: [8].)

19. CT2 Templates F Cryptography F Classical F Baconian Cipher

20. - CT1 Encrypt/Decrypt F Symmetric (classic) F Homophone
- CT2 Templates F Cryptography F Classical F Homophonic Substitution Cipher and Nomen-
clature. In addition, CT2 contains powerful analyzers for this cipher type.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 51 — #13

i i

2.2 Substitution Ciphers 51

Beale cipher [4]: The Beale cipher is a book cipher that numbers the words of a key
text. These numbers replace the plaintext letters by the words’ initial letters.
Grandpré cipher [6]: A square grille with 10 columns (other layouts are possible,
too) is filled with 10 words. The initial letters should result in an eleventh word. As
columns and rows are numbered from 0 to 9, letters can be replaced by two-digit
numbers. It is obvious that with the table having 100 fields, most letters can be
represented by more than one pair of numbers. You should keep in mind that those
10 words have to contain all letters of the plaintext alphabet.
Spanish strip cipher (SSC) [9]: This homophonic substitution cipher was the official
method of ciphering by the Spanish ministries in the late 19th century, and the
method mostly used by both sides, the Republicans and the Nationalists, in the
Spanish Civil War (1936–1939).21

2.2.3 Polygraphic Substitution

Polygraphic techniques do not work solely by replacing single characters; they
(also) replace whole groups of characters. In most cases, these groups are digrams,
trigrams, or syllables. For these letter sequences of defined length, different alter-
native designations are known that are also used in the frequency analysis: 2-gram,
digram, bigram, and digraph all mean the same thing. Because of the better read-
ability and the practical comparability of different lengths, the notation n-gram
slowly wins through.

Figure 2.3 CT2 template for the Spanish strip cipher.

21. - CT2 Templates F Cryptography F Classical F Spanish Strip Cipher. See Figure 2.3. CT2 can
break it with the Homophonic Analyzer.
- MTC3: For example, https://mysterytwister.org/challenges/level-2/spanish-strip-cipher-
part-1.
- The Kopal video explains in detail how the keys are generated and how to break this cipher with CT2:
https://www.youtube.com/watch?v=-C-hgnrUMKo.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 52 — #14

Two-square cipher [6]: The two-square cipher resembles the four-square cipher,
but the matrix is reduced to two quadrants. Both letters of the digraph part of the
same row, they are just exchanged. Otherwise, the plaintext letters are considered
as opposite corners of a rectangle and substituted by the other vertices. Quadrants
can be arranged horizontal and vertical.
Tri-square cipher [1]: Three quadrants are filled with the same alphabet. The first
plaintext letter is looked up in the first quadrant and can be encrypted with every
letter of that column. The second plaintext letter is looked up in the second quadrant
(diagonally across) and can be encrypted with every letter of that row. Between these
two ciphertext characters, the letter at the intersection point is set.
Dockyard cipher [6]: Used by the German Navy during World War II.

2.2.4 Polyalphabetic Substitution

Concerning polyalphabetic substitution, the assignment of ciphertext characters
to plaintext characters is not static, but changes during the process of encryption
(depending on the key). Polyalphabetic ciphers are historically far more important
than polygraphic ciphers.
If the key is generated like a key stream from existing plaintext or ciphertext
parts, this is called an autokey method. An incorrect letter at any point renders

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 54 — #16

i i

54 Paper-and-Pencil and Precomputer Ciphers

the entire text that follows unusable. Such errors can occur at the encryptor, in
transmission, in reception, or occur at the decryptor (i.e., also at the parties who
know the correct key).

• Vigenère23 [4]: Each plaintext character is encrypted with a different cipher-

text alphabet that is determined by the characters of a keyword (the so-called
Vigenère tableau serves auxiliary means). If the plaintext is longer than the
key, the key is (cyclically) repeated.
See Table 2.13.

Table 2.13 Vigenère Tableau

Plaintext the alphabet is changing
Key KEY KEYKEYKE YK EYKEYKEY
Ciphertext DLC KPNREZOX GC GFKRESRE
- A B C D E F G H I J K LM N O P Q R S T U VWX Y Z
A A B C D E F G H I J K LM N O P Q R S T U VWX Y Z
B B C D E F G H I J K LM N O P Q R S T U VW X Y Z A
C CD E F G H I J K LM N O P Q R S T U VW X Y Z A B
D D E F G H I J K LM N O P Q R S T U VW X Y Z A B C
E E F G H I J K LM N O P Q R S T U VW X Y Z A B CD
F FG H I J K LM N O P Q R S T U VW X Y Z A B CD E
G GH I J K LM N O P Q R S T U VW X Y Z A B CD E F
H H I J K LM N O P Q R S T U VW X Y Z A B C D E FG
I I J K LM N O P Q R S T U VW X Y Z A B C D E FGH
J J K LM N O P Q R S T U VW X Y Z A B C D E FGH I
K K LM N O P Q R S T U VW X Y Z A B C D E F GH I J
... ... ...

Vigenère variants:
• Interrupted key: The key is not repeated continuously, but starts again with
every new word of the message.
• Autokey24 [6]: After using the agreed key, use the message itself as a key.
See Table 2.14.

Table 2.14 Autokey Variant of Vigenère

Plaintext the alphabet is changing
Key KEY THEALPHA BE TISCHANG
Ciphertext DLC TSTHLQLT JW VPSPNIAM

23. - CTO in the plugins “Vigenère,” “Autokey,” “Beaufort,” “Porta,” and “Trithemius.” In addition, CTO
contains an interactive analyzer to determine the key length and the smallest value of the autocorrelation:
https://www.cryptool.org/en/cto/vigenerebreak. For an automated analysis the autocorrelation is
used: https://www.cryptool.org/en/cto/autocorrelation.
- CT1 Encrypt/Decrypt F Symmetric (classic) F Vigenère
Visualized in CT1 Indiv. Procedures F Visualization of Algorithms F Vigenère...
- CT2 Templates F Cryptography F Classical F Vigenère. In addition, CT2 contains powerful ana-
lyzers for this cipher.
- JCT Default Perspective F Algorithms F Classic F Vigenère
24. Autokey can be found in the CT variants mainly the same way as Vigenère.

i i

Table 2.15 Ragbaby Cipher

Plaintext th e a l pha b e t i s cha n g i n g
Numbering 1 2 3 2 3 4 5 6 7 8 9 3 4 4 5 6 7 8 9 10 11
Ciphertext ULO CPVP IMCO NX IP IZTX Y X

2.3 Combining Substitution and Transposition

In the history of cryptography one often comes across combinations of the previ-
ously mentioned methods. These have become—on average—proven to be safer
than procedures that are based only on one of the principles of transposition or

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 57 — #19

i i

2.3 Combining Substitution and Transposition 57

substitution. However, cascades29 do not have to be safer (e.g., the sequential exe-
cution of two Caesar ciphers with different shift values is simpler than the unique
executing of Caesar with a third shift value), so that’s not at all a security gain. On
the other hand, executing two column transpositions one after the other can lead to
a significant gain in security; it is equivalent to a column transposition in which the
key is as long as the product of the two individual key lengths, provided the keys are
constructed correctly. See https://mysterytwister.org/challenges/level-
3/double-column-transposition-reloaded-part-1.
ADFG(V)X30 [4]: ADFG(V)X-encryption was developed in Germany during World
War I. The alphabet is filled into a 5 × 5 or 6 × 6 matrix, and columns and rows
are marked with the letters ADFGX and V, depending on the size of the grille. Each
plaintext character is substituted by the corresponding pair of letters. Finally, a
(row-) transposition cipher is performed on the resulting text.
Fractionation [6]: Generic term for all kinds of methods that encrypt one plaintext
character by several ciphertext characters and then apply a transposition cipher to
this intermediate ciphertext so that ciphertext characters originally belonging to
each other are separated.

• Bifid/Polybius square/checkerboard [5]: Bifid encryption is the basic form of

fractionation. A 5 × 5 matrix is filled with the plaintext alphabet (see Playfair
encryption), rows and columns are numbered so that each plaintext charac-
ter can be substituted by a pair of digits. Mostly the plaintext is divided into
blocks of equal length. The length of blocks (here 5) is another configura-
tion parameter of this cipher. Block-by-block all line numbers are read out
first, followed by all numbers naming the columns. To obtain the cipher-
text, the digits are pairwise transformed into letters again. The numbers can
be any permutation of (1,2,3,4,5): in our case (1,4,2,3,5) for the rows and
(2,4,5,1,3) for the columns. This is one key or configuration parameter of this
cipher. Instead of numbering rows and columns, a keyword can also be used.
See Table 2.16.
Table 2.16 Bifid Cipher
2 4 5 1 3
1 K E Y W O
4 R D A B C
2 F G H I L
3 M N P Q S
5 T U V X Z

Note that for a complete description, it is necessary to explain how sender and
receiver handle texts that fill in the last block only 1–5 characters. The possibil-
ities range from ignoring a last and incomplete block to padding it with random
characters or with characters predefined in advance.
See Table 2.18.
Table 2.18 Digrafid Cipher
1 2 3 4 5 6 7 8 9
A B C D E F G H I 1 2 3
J K L M N O P Q R 4 5 6
S T U V W X Y Z . 7 8 9
A J S 1
B K T 2
C L U 3
D M V 4
E N W 5
F O X 6
G P Y 7
H Q Z 8
I R . 9

co mb in in gs ub st it ut io na nd tr an sp os it io
3 4 9 9 7 3 1 9 3 9 5 5 2 1 1 6 9 9
2 4 2 2 3 7 9 3 9 2 4 4 8 2 8 6 3 2
6 2 5 5 1 2 2 2 2 6 1 4 9 5 7 1 2 6
LI KB FN .C BY EB SU I. BK RN KD FD BA HQ RP X. FT AO

Nicodemus cipher [1]: First, a simple columnar transposition is carried out. Before
reading out the columns, the message is encrypted additionally by Vigenère (all
letters of a column are enciphered with the corresponding keyword letter). The
ciphertext is read out in vertical blocks.
See Table 2.19.
Table 2.19 Nicodemus Cipher
K E Y E K Y E K Y
c o m o c m S M K
b i n i b n M L L
i n g n i g R S E
s u b u s b Y C Z
s t i t s i X C G
t u t u z t Y J R
i o n o i n S S L
a n d n a d R K B
t r a r t a V D Y
n s p s n p W X N
o s i s o i W Y G
t i o i t o M D N

Plaintext: combining substitution and transposition

Ciphertext: SMRYX MLSCC KLEZG YSRVW JSKDX RLBYN WMYDG N
Double column transposition (DCT) / Granit E160 [10]: Granit is a two-step cipher.
The second step is the double column transposition. In the first step, the plaintext

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 60 — #22

i i

60 Paper-and-Pencil and Precomputer Ciphers

is substituted by a sequence of digits using a codebook and a matrix (a variant of

the Polybius square).
The Granit cipher was used for instance by the spy Guenter Guillaume for his
communication with the Ministry of State Security of the former German Demo-
cratic Republic (GDR) until about 1960.31 If the passwords are chosen correctly,
this is a very secure procedure.

2.4 Further P&P Methods

The P&P procedures include both classic procedures and newer ones—which
should always be executable by hand and operate on a few printable characters.
The new methods include, for example, LC4 (2017) and Handycipher (2014).
Pinprick encryption [4]: For centuries, this simple encryption method has been
put into practice for different reasons (actually steganography). During the Victo-
rian Age, for example, small holes underneath letters in newspaper articles marked
the characters of a plaintext, as sending a newspaper was much cheaper than the
postage on a letter.
Stencil: Stencils (cardboard with holes) are also known as a Cardinal-Richelieu-Key.
Sender and receiver have to agree upon this stencil. The stencil is laid above a text
and the letters that remain visible make up the ciphertext. This is not to be confused
with the Grille of Section 2.1.1, because here not only the scrambled plaintext letters
are transmitted, but the entire text, most of whose letters are meaningless (called
nulls).
Card games [6]: The key is created by means of a pack of cards and rules that are
agreed upon in advance. All methods mentioned in this paragraph are designed as
paper-and-pencil methods (i.e., they are applicable without electronic aid). A pack
of cards is unsuspicious to outsiders, shuffling the deck provides a certain amount
of coincidence, and cards can be transformed into numbers easily to be used in a
substitution cipher without any further aid.
Solitaire cipher (Bruce Schneier)32 [11]: Sender and receiver have to own a deck of
cards shuffled in the same manner. A key stream is generated that has to consist of
as many characters as the message to be encrypted.
The algorithm to generate the key is based on a shuffled deck of 54 cards (ace,
2–10, jack, queen, king in four suits, and two jokers). The pack of cards is held
face up:
1. Swap the first joker with the card beneath it.
2. Move the second joker two cards down.

31. MTC3 offers a number of related challenges: If you enter in your browser at https://mysterytwister
.org/challenges/level-2/ the search item “Granit,” you’ll find six challenges about it.
A detailed 20-page description about the Granit cipher can be found at:
https://mysterytwister.org/media/challenges/pdf/mtc3-drobick-01-doppelwuerfel-01-en.pdf
32. - CT1 Encrypt/Decrypt F Symmetric (classic) F Solitaire (detailed GUI window);
- CT2 Templates F Cryptography F Classical F Solitaire. In addition, CT2 contains a brute-force
analyzer for this cipher.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 61 — #23

i i

2.4 Further P&P Methods 61

3. Now swap the cards above the first joker with those below the second one.
4. Look at the bottom card and convert it into a number from 1 to 53 (bridge
order of suits: clubs, diamonds, hearts, spades; joker = 53). Write down this
number and count down as many cards starting with the top card. These
cards are swapped with the remaining cards; only the bottom card remains
untouched.
5. Look at the top card and convert it into a number as well. Count down as
many cards starting with the top card.
6. Write down the number of the following card. This card is converted into
your first keystream character. As we need numbers from 1 to 26 to match
the letters of our alphabet, clubs and hearts correspond to the numbers 1 to
13, diamonds and spades to 14 to 26. If your output card is a joker, start
again.

For each keystream character you would like to generate, these six steps have
to be carried out. This procedure is—manually—very lengthy (4 hours for 300
characters, dependent on your exercise) and requires high concentration.
Encryption takes place by addition modulo 26. Encryption is relatively fast
compared to the key stream generation.
This P&P cipher creates a key stream that is so good, even today it is hard to
crack the cipher if you don’t know the originally sorted card deck (ciphertext-only
attack).
Mirdek cipher (Paul Crowley) [12]: Even though this method is quite complicated,
the author provides a good example to illustrate the procedure.
Playing card cipher (John Savard) [6]: This algorithm uses a shuffled deck of 52
cards (no joker). Separate rules describe how to shuffle the deck. A keystream is
created via the following steps:

1. The pack of cards lies in front of the user, top down. Cards are turned up
and dealt out in a row until the total of the cards is eight or more.
2. If the last card dealt out is a jack, queen, or king, write down its value;
otherwise write down the sum of the cards dealt out (a number between 8
and 17). In a second row, deal out that number of cards.
3. The remaining cards are dealt out in rows under the second row. The first
one ends under the lowest card of the top row, the second one under the
next lowest card, and so on. If there are two identical cards, red is lower
than black.
4. The cards dealt out under step 3 are collected column by column, starting
with the column under the lowest card. The first card that is picked up
becomes the bottom card (face up).
5. The cards dealt out in step 1 and 2 are picked up, beginning with the last
card.
6. The deck is turned over, the top card is now the bottom card (face down).
Afterward, steps 1 to 6 are repeated twice.

i i

Figure 2.7 Front view of the Hagelin M-209 [19].

of time starting shortly before World War II and until after the Vietnam War [21].
About 140,000 units of this model were produced. This model and its implementa-
tion in CrypTool 2 are nicely presented on the YouTube channel Cryptography for
Everybody [22].
The BC-38 electromechanical cipher machine, as the name suggests, was an
improved model of the mechanical C-38. “B” was added as a prefix to the model
number to indicate the presence of a keyboard.
The latest Hagelin machines we discuss are the ones that refer to the model
range C-52/CX-52, which were developed in 1952. Information about the machines
that appeared later can be found in [19] and [21]. The C-52 had great success on the
market. This machine had six removable cipher wheels—the pinwheels—that could
be configured outside of the machine and installed in any order. A more advanced
and secure version was the CX-52 where the rotation of the pinwheels became
irregular. It is considered to be one of the most successful Hagelin mechanical cipher
machines, which made the encryption much more secure.
Before the CX-machines were invented, the United States and United Kingdom
were able to break the high-level cipher systems of most of the countries. The devel-
opment of the CX-52 model changed the situation, which resulted in the Rubicon
operation (see Section 2.5.2.4).
In the next section, the architecture of these latter machines is explained.

2.5.2 Hagelin C-52/CX-52 Models

The C-52 and CX-52 are all-mechanical pin-and-lug type cipher machines that do
not require any power source. All the models have a relatively similar architecture.
However, before the architecture of the machine is provided, to enable an easier
understanding of their construction, we first explain the general encryption method
they use.

2.5.2.1 Encryption Principle

In fact, these machines (as well as most other rotor cipher machines) are mechanical
implementations of stream ciphers (see Chapter 9). This means that the heart of

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 66 — #28

i i

66 Paper-and-Pencil and Precomputer Ciphers

this machine is a pseudorandom keystream generator. In other words, the Hagelin

machine produces a sequence of pseudorandom numbers that are used to encrypt
the plaintext. The encryption is applied to each plaintext character one by one using
a Beaufort encoding, which is in principle a Caesar cipher with an inverted alphabet.
That is, each plaintext character is substituted by a different one from the reversed
alphabet.
Let’s use the following notations:
• P = ( p1 , p2 , ..., p L ) is the plaintext, which is the original message that has
to be encrypted. It is a sequence of Latin characters pi , 1 ≤ i ≤ L, where L
is the length of the message.
• C = (c1 , c2 , ..., c L ) is the ciphertext received after encrypting the plaintext
with a Hagelin machine also represented as a sequence of Latin characters.
• Ind(a ) is the index of the character a in the alphabet, for example,
Ind('A') = 0; Ind('E') = 4.
• Char(n ) is the character located at the nth position in the alphabet, for
example, Char(0) = 'A', Char(4) = 'E'.
• D = {d1 , d2 , ..., d L } is the sequence of pseudorandom integer numbers gen-
erated by the Hagelin machine during the encryption process. This sequence
is also called displacement sequence.
The encryption takes place in accordance with (2.1).

ci = Char(Ind('Z') − Ind( pi ) + di mod 26) (2.1)

In this equation Ind('Z') is the constant value that is the index of the character
'Z' in the alphabet and is equal to 25.
Let’s consider the following example. Suppose that we want to encrypt plain-
text P = ('K', 'C') and the 2 displacement values generated by a given Hagelin
machine are D = {d1 , d2 } = {4, 1}. Then the ciphertext is computed as follows:

c1 = Char(Ind('Z') − Ind('K') + 4) mod 26) = Char(25 − 10 + 4) = Char(19) = 'T'

c2 = Char(Ind('Z') − Ind('C') + 1) mod 26) = Char(25 − 2 + 1) = Char(24) = 'Y'

Hence, the received ciphertext is C = ('T', 'Y').

To understand the key idea of the Beaufort encoding take a look at the posi-
tions of the plaintext characters in the original alphabet and at the positions of the
ciphertext characters in its reverse representation. It’s easy to see that the distance
between the plaintext and the corresponding ciphertext characters is equal to the
displacement values d1 = 4 ('K' to 'T') and d2 = 1 ('C' to 'Y'):
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Z Y X W V U T S R Q P O N M L K J I H G F E D C B A
This encryption principle is the same for all the Hagelin models discussed
here. However, the way these pseudorandom sequences of displacement values are
computed is different for each of the machines and defines how secure the given
machine is.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 67 — #29

i i

2.5 Hagelin Machines as Models for Precomputer Ciphers 67

2.5.2.2 Architecture of the Machines

To keep the presentation in this section compact, we focus only on the most essential
parts of the machines that are important from a cryptographic point of view (for a
description of the other parts and their purpose, see [23]).
Pseudorandom displacement generator. The pseudorandom displacement generator
of these machines compromises two main parts (Figure 2.8), which are:

• The set of 6 pinwheels (also known as keywheels, pin disks, rotors, cipher
wheels or just wheels);
• Rotating cage (also known as drum) with 32 bars.

The pinwheels for each of the machine versions can be selected from a set of
12 with 25, 26, 29, 31, 34, 37, 38, 41, 42, 43, 46, and 47 pins. The wheels are
labeled with both letters and numbers. Next to each label there is a pin that can be
set to an active or nonactive state.
A printer located at the left of the machine (Figure 2.8) acts as the input/output
device. It has a double print head, where one letter ring contains the alphabet in the
regular order, and the rightmost ring—the alphabet in reversed order (Figure 2.9).
We call the former the input ring and the latter the output ring.
To encrypt each character pi the operator sets the input ring of the printwheel
to the position of this character. Then he turns the handle, and the drum makes a
complete revolution. Each of the 32 bars located in the drum can be affixed with the
small lugs at one or more of the six positions against each of the wheels [21]. During
the revolution of the drum all the lugs that are affixed to the bars will contact the
active guide arm of the wheel if the pin currently located against the drum is active.
All bars whose lugs had this contact will slide to the left. Each bar that is slid to
the left becomes an additional tooth on the drum, which will displace the output
ring by one extra position. That is, the total displacement value di is equal to the

Figure 2.8 Hagelin C-52 [19].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 68 — #30

i i

68 Paper-and-Pencil and Precomputer Ciphers

Figure 2.9 Double print head with two letter rings; in regular-alphabet and in reversed-alphabet
order [19].

number of bars that were slid to the left during the revolution of the drum. After
the displacement value di is computed, the encryption happens following (2.1).
To make sure that decryption is possible, both sender and receiver must set
their machines in the same way. The wheel pins and the lugs that form an internal
secret key were normally changed on a daily basis, according to the key lists that
were distributed periodically [24]. To keep the communication secure, the initial
position of the six wheels, which is called an external key, has to be changed for
each new message.
Advancement of the wheels. In the case of the C-52 model, all the pin wheels regu-
larly step by one position after each character encryption. However, in some of the
CX-52 versions, this advancement of the wheels happens irregularly, which makes
the encryption considerably more secure. This is realized by the special drum bars
which have cams that can advance the pinwheels, so that the next cycle starts with
a different pinwheel setting and thus another set of six active and inactive pins [23].
Those bars, which can influence the stepping of the wheels, are called advance bars.
If a bar is forced to the left by an active guide arm that contacts a lug, the wheel
that is commanded by that bar will move.
Key space size. Assuming that the pin wheels and bars are known and are not part
of the key, the whole key space of the machine is composed of the wheel settings key
space and the lug settings key space. The wheel settings key space depends on the
set of the selected pin wheels. Suppose that the selected machine pin wheels have
the sizes that form a set: L = {l1 , l2 , ..., l6 }. Then the number of all possible initial
positions of the wheels is given by:

Ki = l 1 · l 2 · l 3 · l 4 · l 5 · l 6

Moreover, each of the wheel pins can be active or not active. In total there are
l1 + l2 + l3 + l4 + l5 + l6 pins. Hence, the total number of possible pin settings is:

Kp = 2l1 +l2 +l3 +l4 +l5 +l6

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 69 — #31

i i

2.5 Hagelin Machines as Models for Precomputer Ciphers 69

In addition, the machine has 32 bars, where each bar has six positions where lugs
can be placed (against each of the pin wheels). Some of the bars cannot be lugged.
If such bars are used in the machine, they should not be included in the key space
size computation.
The total number of lugs is 32 · 6 = 192. Therefore the maximal number of all
possible lug settings is:
Kl = 2192

This gives the total key space size of:

K = Ki · Kp · Kl = l1 · l2 · l3 · l4 · l5 · l6 · 2l1 +l2 +l3 +l4 +l5 +l6 · 2192

For example, if the selected wheel sizes are L = {l1 , l2 , ..., l6 } = {29, 31, 37, 41, 43,
47} and all the bars can be lugged then the total key space of such machine is:

K = 29 · 31 · 37 · 41 · 43 · 47 · 229+31+37+41+43+47 · 2192 ≈ 231.3 · 2228 · 2192 ≈ 2451.3

We note, however, that the cryptographic-relevant key space is usually considerably

smaller for the concrete machines due to the existence of equivalent keys. That
is, different settings may lead to the same sequence of displacements D, which
means that for an attacker it is sufficient to know only one of such equivalent keys.
Moreover, settings may result in only slightly different displacement sequences that
could also be sufficient for an attack. This can reduce the security of the concrete
machine even further.
Printer offset and the fixed-variable feature (F-V feature). Some of the models allow
applying an offset on the printer between the input ring and the output ring. If this
is done, when encrypting a given character pi with the offset o, the machine will
encipher the letter Char(Ind( pi ) − o) mod 26. For example, if the offset o = 2 and
the plaintext character pi = 'M', the machine will encipher character 'K'.
When a given model is equipped with a fixed-variable feature, this offset can
be increased after encrypting every character. When this feature is installed and
activated, the relative position of the input ring and output ring grows by the
displacement value di in each encrypting cycle i.

2.5.2.3 Differences Between C-52/CX-52 Models

The differences between each of the C-52/CX-52 models known to us are presented
next.
The main differences between the machine versions are the supported pin wheel
types and the way the pin wheels advance during the encryption process. The lat-
ter is achieved by various types of the bars used in the machines. We provide an
overview of the main principles, and for concrete specifications of the wheels and
bars used in C-52/CX-52 models refer to [23]. To better understand the machine
mechanism and how the models differ, we also encourage the readers to use the
CrypTool 2 component “Hagelin Machine,” which is discussed in Section 2.5.3.
C-52. In Hagelin C-52, all pin wheels always step by one. This is similar to the pre-
vious Hagelin models, however, the important improvement is that the pin wheels

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 70 — #32

i i

70 Paper-and-Pencil and Precomputer Ciphers

i i

72 Paper-and-Pencil and Precomputer Ciphers

• Wheels state—indexes (letters or numbers) of the initial positions of the six

wheels (see input box “Wheel start positions” in the template in Figure 2.10).
• Wheel pins—indexes (letters or numbers) of the pins that are activated for
each wheel (see input box “Active pins on wheels 1 to 6” in the template).
• Bar lugs—numbers of the wheels for which lugs on each of the bars are active
(see input box “Lug positions on bars 1 to 32” in the template).
and the following outputs:
• Output text—the result of encryption or decryption;
• Keystream—the array of the generated displacement values D;
• Report—report explaining intermediate steps.
The template contains two instances of the Hagelin Machine component. They
share the same settings except for the operation mode. That is, the first instance is
set to encryption mode and the second one to decryption mode. After the encrypted
text is processed by the second Hagelin Machine component in the template, the
received output text is equal to the original input text (see output box “Decrypted
text” in the template in Figure 2.10).
Figure 2.10 also shows that the presentation tab of the Hagelin component has
two main parts. The graphics demonstrate the current state of the six wheels after
the last input character was processed. The table below the graphics presents the
essential values of the internal state of the machine.

2.5.3.2 Hagelin Machine Settings

Using the necessary settings, the component can be adjusted in such a way that it
implements the machine with the needed functionality.
As the very first step the user has to select the model of the machine
(Figure 2.11). By default, the model CX-52a is selected. The models define the
sets of the available pin wheels and which bars are used in the machine. The con-
figuration of the bars defines how pin wheels are advanced during the encryption
process.
Besides the predefined models, the user has an option to select the “Custom”
model, which allows for the most flexible settings.

2.5.4 Recap on C(X)-52: Evolution and Influence

The C(X)-52 is a purely mechanical rotor cipher machine developed by the Swedish
inventor and entrepreneur Boris Hagelin in 1952 and manufactured by his company
Crypto AG. The machine was very flexible and easily customized according to the
requirements of the customers and the desired level of security. Within a few years,
different variations of the machine were exported to more than 60 countries [27],
which made C(X)-52 the most used rotor machine of its time.
The successors of C(X)-52 are the HX-63 rotor machine developed in 1963 and
the fully electronic H-460 cipher machine. Despite the emergence of new models,
many countries, including France, continued to order the CX-52 by the end of
the 1960s. Due to its purely mechanical architecture, CX-52 had the advantage
over into successors of being immune to the so-called electromagnetic pulse (EMP)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 73 — #35

i i

2.6 Ciphers Defined by the American Cryptogram Association 73

sage.crypto.classical.TranspositionCryptosystem

To construct and work with a transposition cipher in SageMath, we first need

to determine the alphabet that contains the symbols used to build our plaintext
and ciphertext. Typically, this alphabet will be the uppercase letters of the English
alphabet, which can be accessed via the function

sage.monoids.string_monoid.AlphabeticStrings

We then need to decide on the block length of a permutation, which is the length
of the row vector to be used in the simple column transposition. This row vector is
our key, and it specifies a permutation of a plaintext.
The first example of the transposition ciphers (SageMath Example 2.1) has
block length 14, and the key is built in a way that every letter in the plaintext is
shifted to the right by two characters, with wraparound at the end of the block.
That is the encryption process. The decryption process is shifting each letter of the
ciphertext to the left by 14 − 2 = 12.

SageMath Example 2.1: Simple Transposition by Shifting the Letters in the

Message (Key is Constructed with “Range”)
print ("\n# CHAP02 -- Sage -Script -SAMPLE 010: =========")

# transposition cipher using a block length of 14 and upper -case �

� alphabet
blocklen = keylen = 14; shift = 2
T = TranspositionCryptosystem (AlphabeticStrings (), blocklen)

# create encryption key (shifts each letter in the block by 2 positions �

� to the right)
# key = [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 1, 2]
key = [(i+shift).mod(keylen) + 1 for i in range(keylen)]
print ("key: ", key , " keylen: ", len(key), sep ="")

# given plaintext
P = "a b c d e f g h i j k l m n"; print ("P: ", P)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 77 — #39

i i

2.8 Examples Using SageMath 77

SageMath Example 2.1 (continued)

# encode plaintext (get rid of non -alphabet chars , convert lower -case �
� into upper -case)
msg = T.encoding(P); print ("msg: ", msg)

# encrypt plaintext message

C = T.enciphering(key , msg); print ("C: ", C)

# decrypt using built -in deciphering method. Requires to change type of �

� key
DC = T.deciphering(T(key).key(), C); print ("DC: ", DC)

# Test correctness of decryption

print ("msg == DC:", msg == DC)

#------------------------------------
# CHAP02 -- Sage -Script -SAMPLE 010: =========
# key: [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 1, 2] keylen: 14
# P: a b c d e f g h i j k l m n
# msg: ABCDEFGHIJKLMN
# C: CDEFGHIJKLMNAB
# DC: ABCDEFGHIJKLMN
# msg == DC: True

The second example of transposition ciphers (SageMath Example 2.2) is also a

simple shifting column transposition. But now the generation of the plaintext is a
bit automated: The sample plaintext is generated from the alphabet. Contrary to the
first sample, the inverse key is generated explicitly—once from the shift parameter
and once by using the transposition function from SageMath.

SageMath Example 2.2: Simple Transposition by Shifting (Key and Inverse

Key are Constructed)
print ("\n# CHAP02 -- Sage -Script -SAMPLE 020: =========")

# transposition cipher using a block length of 14, code more flexible

blocklen = keylen = 14; shift = 2
A = AlphabeticStrings ()
T = TranspositionCryptosystem (A, blocklen)

# encryption key
# key = [3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 1, 2]
key = [(i+shift).mod(keylen) + 1 for i in range(keylen)]
print ("key: ", key , " keylen: ", len(key), sep ="")

# construct the plaintext from the first 14 letters of the alphabet �

� plus blanks
# P = "A B C D E F G H I J K L M N"
print ("A.gens ():\n", A.gens (), sep ="")
P=''
for i in range(keylen): P = P + " " + str(A.gen(i))
print ("P: ", P)

# encode plaintext (get rid of non -alphabet chars)

msg = T.encoding(P); print ("msg: ", msg)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 78 — #40

i i

78 Paper-and-Pencil and Precomputer Ciphers

SageMath Example 2.2 (continued)

# encrypt plaintext by shifting to the left by 2 letters (do it in one �

� step)
C = T.enciphering(key , msg); print ("C: ", C)

# decryption key [shifting to the left by 12 letters (= 2 to the right) �

�]
# keyInv = [13, 14, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12]
shiftInv=keylen -shift;
keyInv = [(i+shiftInv).mod(keylen) + 1 for i in range(keylen)]; print (" �
� keyInv: ", keyInv , sep ="")

# decrypt by using keyInv and enciphering

DC = T.enciphering(keyInv , C); print ("1-DC:", DC)

# decrypt by using the "deciphering method with key" (without the need �
� to calculate keyInv)
# Remark: Strangely , using the deciphering method requires to change �
� the type of the variable key
# The following does not work: DC = T.deciphering(key , C); �
� print ("2-DC:", DC)
DC = T.deciphering(T(key).key(), C); print ("2-DC:", DC)

# Remarks about different representations of key and its inverse key

print ("\n---Remark 1: TranspositionCryptosystem describing itself and �
� the used cyclic groups ")
print ("T:", T)
print ("T(key).key(): ", T(key).key())
print ("T.inverse_key ():", T.inverse_key(T(key).key()))

The third example of transposition ciphers (SageMath Example 2.3) is only a

small modification of the second example. Here the order of the characters within
a block are just inverted.

SageMath Example 2.3: Simple Column Transposition Just Reverting Each

Plaintext Block
print ("\n# CHAP02 -- Sage -Script -SAMPLE 030: =========")

blocklen = keylen = 14
T = TranspositionCryptosystem (AlphabeticStrings (), keylen);

key = [ keylen -i for i in range(keylen) ]; print ("key: ", key , " �

� keylen: ", keylen , sep ="")

P = "THECATINTHEHAT"
msg = T.encoding(P); print ("msg:", msg , " msglen:", len(msg))
C = T.enciphering(key , msg); print ("C: ", C)
DC = T.deciphering(T(key).key(), C); print ("DC: ", DC)

#------------------------------------
# CHAP02 -- Sage -Script -SAMPLE 030: =========
# key: [14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1] keylen: 14
# msg: THECATINTHEHAT msglen: 14
# C: TAHEHTNITACEHT
# DC: THECATINTHEHAT

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 79 — #41

i i

2.8 Examples Using SageMath 79

In the fourth example of transposition ciphers (SageMath Example 2.4) we use

an arbitrary permutation as key in the encryption and decryption processes in order
to scramble the characters within each block (permutation block length = number
of columns in a simple column transposition). In addition, it shows the key space
of the simple column transposition.
If the block length is n, then the key must be a permutation on n symbols. This
example uses the method random_key() of the class TranspositionCryptosystem
to create a random key. Each call to random_key() most likely produces a differ-
ent key. Note that therefore your results (key and ciphertext) most likely will be
different from the results shown in the following example.

SageMath Example 2.4: Simple Column Transposition with Randomly

Generated (Permutation) Key and Showing the Size of the Key Space
print ("\n# CHAP02 -- Sage -Script -SAMPLE 040: =========")

blocklen = keylen = 14
A = AlphabeticStrings ()
T = TranspositionCryptosystem (A, blocklen);

# key_space () delivers generic info; order () of keyspace delivers the �

� number of keys
print ("T.key_space ():", T.key_space ())
print ("T.key_space ().order ():", T.key_space ().order ())
print(' {0:.2e}'.format( T.key_space ().order () ))

P = "a b c d e f g h i j k l m n o p q r s t u v w x y z a b"
msg = T.encoding(P); print ("msg:", msg)
key = T.random_key (); print ("key: ", key)
Pkey = Permutation(key); print (" Pkey:", Pkey , "Pkeylen :", len(Pkey))

# enciphering in one and in two steps

C = T.enciphering(key , msg); print ("1-C: ", C)

E = T(key)
C = E(msg); print ("2-C: ", C)

# deciphering
DC = T.deciphering(key , C); print ("DC: ", DC)

# Just another way of decryption: Using "enciphering" with the inverse �

� key
keyInv = T.inverse_key(key); print (" keyInv: ", keyInv)
PkeyInv = Permutation(keyInv); print (" PkeyInv :", PkeyInv , "PkeyInvlen �
� :", len(PkeyInv))
DDC = T.enciphering(keyInv , C)

# Test correctness of both ways of decryption

print ("msg == DC == DDC:", msg == DC == DDC) # Expect True

The fifth example of transposition ciphers (SageMath Example 2.5) demon-

strates the built-in encoding and ensures the correct message length (after encoding
it must be a multiple of the block length). Here we don’t apply padding.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 80 — #42

i i

80 Paper-and-Pencil and Precomputer Ciphers

SageMath Example 2.5: Simple Column Transposition (Demonstrates

Encoding and Ensures Correct Message Length)
print ("\n# CHAP02 -- Sage -Script -SAMPLE 050: =========")

keylen = 14
T = TranspositionCryptosystem (AlphabeticStrings (), keylen)
key = T.random_key (); print ("key: ", key , " keylen: ", keylen , sep ="")

# P = "The cat in the hat.--The cat in the hat.XYZ" or P = " �

� ABCDEFGHIJUUUU"
P = "The cat in the hat.--The cat in the hat." # plaintext
print ("P: ", P, " Plen:", len(P))

# Encoding takes only letters from the alphabet (and potentially �

� converts it to uppercase).
msg = T.encoding(P); print ("msg:", msg , " msglen :", len(msg))

# Ensure that the length of msg is a multiple of keylen

# if (len(msg)%keylen !=0): print (" Error: msg length isn 't a multiple of �
� blocklength. len(msg) =",len(msg))
rest = len(msg) % keylen
if (rest != 0):
print (" Error: msg length isn 't a multiple of blocklength. len(msg) �
� =", len(msg))
chunk_size = len(msg) - rest
msg = msg[0:chunk_size]
print (" Truncated message =", msg)
assert len(msg) % keylen == 0

# Enciphering only works , if the number of symbols in msg is a multiple �

� of keylen
C = T.enciphering(key , msg); print ("C: ", C)

# Decryption using method deciphering

DC = T.deciphering(key , C); print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

2.8.2 Substitution Ciphers

Substitution cryptosystems are implemented in SageMath in the class
sage.crypto.classical.SubstitutionCryptosystem
SageMath Example 2.6 constructs a substitution cipher with a random
key. A random key can be generated using the method random_key() of the
class SubstitutionCryptosystem. Different keys determine different substitution
ciphers: With each call to random_key() a different result is expected to be returned.

SageMath Example 2.6: Monoalphabetic Substitution with Randomly

Generated Key
print ("\n# CHAP02 -- Sage -Script -SAMPLE 060: =========")

S = SubstitutionCryptosystem (AlphabeticStrings ())

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 81 — #43

i i

2.8 Examples Using SageMath 81

SageMath Example 2.6 (continued)

key = S.random_key (); print ("key: ", key , " keylen: ", len(key), sep �
� ="")

P = "MASC: Substitute this with something else using a random key"

print ("P: ", P, " Plen:", len(P))

msg = S.encoding(P); print ("msg:", msg , " msglen :", len(msg))

C = S.enciphering(key , msg); print ("C: ", C)
DC = S.deciphering(key , C); print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

#------------------------------------
# CHAP02 -- Sage -Script -SAMPLE 060: =========
# key: ZMUPXCHBVGTIYLKDQOSENFWARJ keylen: 26
# P: MASC: Substitute this with something else using a random key �
� Plen: 60
# msg: MASCSUBSTITUTETHISWITHSOMETHINGELSEUSINGARANDOMKEY msglen �
� : 50
# C: YZSUSNMSEVENEXEBVSWVEBSKYXEBVLHXISXNSVLHZOZLPKYTXR
# DC: MASCSUBSTITUTETHISWITHSOMETHINGELSEUSINGARANDOMKEY
# msg == DC: True

2.8.2.1 Atbash Cipher

SageMath Example 2.7 constructs an Atbash cipher. Strictly speaking, Atbash like
Rot13 is a “code” not a “cipher,” because the key is fixed. See Section 3.1.

SageMath Example 2.7: Atbash (Substitution by Reverting the Alphabet)

print ("\n# CHAP02 -- Sage -Script -SAMPLE 065: =========")

A = AlphabeticStrings (); print(A)

# key = A([25-i for i in range(26)]); print ("key: ", key , sep ="")
lenA = len(A.alphabet ()) # lenA = 26
key = A([lenA -1-i for i in range(lenA)]); print ("key: ", key , sep ="")

# Atbash: using the Substitution cipher and reverted alphabet as key

S = SubstitutionCryptosystem (A); print(S)

msg = S.encoding (" Substitute this with something else .")

print ("msg:", msg , " msglen :", len(msg))

# shift the plaintext

C = S.enciphering(key , msg); print ("C: ", C)

# decrypt the ciphertext and ensure that it is the original plaintext

DC = S.deciphering(key , C); print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

#------------------------------------
# CHAP02 -- Sage -Script -SAMPLE 065: =========
# Free alphabetic string monoid on A-Z
# key: ZYXWVUTSRQPONMLKJIHGFEDCBA
# Substitution cryptosystem on Free alphabetic string monoid on A-Z
# msg: SUBSTITUTETHISWITHSOMETHINGELSE msglen: 31
# C: HFYHGRGFGVGSRHDRGSHLNVGSRMTVOHV
# DC: SUBSTITUTETHISWITHSOMETHINGELSE
# msg == DC: True

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 82 — #44

i i

82 Paper-and-Pencil and Precomputer Ciphers

2.8.2.2 Caesar Cipher

SageMath Example 2.8 constructs a Caesar cipher. The key is a number describing
how far the alphabet is shifted.

SageMath Example 2.8: Caesar (Substitution by Shifting the Alphabet; Key

Explicitly Given, Step-by-Step Approach)
print ("\n# CHAP02 -- Sage -Script -SAMPLE 070: =========")

A = AlphabeticStrings () # plaintext/ciphertext alphabet

keylen = len(A.alphabet ()); # Alternative: keylen = len(A.gens ())
shift = 3; shiftInv=keylen -shift;
print (" keylen: ", keylen , " shift: ", shift , " shiftInv: ", shiftInv , sep ="")

S = SubstitutionCryptosystem (A)

# construct key for Caesar cipher

numkey = [(i+shift).mod(keylen) for i in range(keylen)] ### Now without +1 (after mod)
print (" numkey", numkey)
key = A(numkey); print ("key: ", key , " keylen: ", len(key), sep ="")

P = "Shift the alphabet three positions to the right ."

print ("P: ", P, " Plen:", len(P))

# encrypt message (method encoding can be called from A or from S)

msg = S.encoding(P); print ("msg:", msg , " msglen:", len(msg))
C = S.enciphering(key , msg); print ("C: ", C)

# decrypt message (using inverse key and enciphering instead of deciphering)

numkeyInv = [(i+shiftInv).mod(keylen) for i in range(keylen)]; print (" numkeyInv", numkeyInv)
keyInv = A(numkeyInv); print (" keyInv: ", keyInv , sep ="")

DC = S.enciphering(keyInv , C); print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

2.8.2.3 Shift Cipher

The shift cipher can also be thought of as a generalization of the Caesar cipher.
While the Caesar cipher restricts us to shift exactly three positions along an
alphabet, the shift cipher allows us to shift any number of positions along the
alphabet.
In the above samples we applied the SubstitutionCryptosystem and built
Caesar as a special kind of substitution. In contrast, here Caesar can be built as a
special kind of shift cipher.
The shift cipher is implemented directly in the SageMath class

sage.crypto.classical.ShiftCryptosystem

In SageMath Example 2.9, we construct a shift cipher over the capital letters
of the English alphabet. We then encrypt a plaintext P by shifting it key positions
along the alphabet. Finally, we decrypt the ciphertext C and check whether the
result (DC) is indeed the original plaintext.
Shifting is a special way of substitution. The original Caesar cipher is just simply
a shift cipher whose shifting key is 3.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 83 — #45

i i

2.8 Examples Using SageMath 83

SageMath Example 2.9: Constructing the Caesar Cipher Using the Shift
Cipher
print ("\n# CHAP02 -- Sage -Script -SAMPLE 080: =========")

key = 3 # shift can be any integer number. Ancient Caesar used 3.

print (" shift key: ", key , sep ="")

# Caesar cipher using the Shift cipher instead of the more general Substitution cipher
S = ShiftCryptosystem(AlphabeticStrings ()); print(S)
msg = S.encoding (" Shift me any number of positions ."); print ("msg:", msg , " msglen :", len �
� (msg))

# shift the plaintext

C = S.enciphering(key , msg); print ("C: ", C)

# decrypt the ciphertext and ensure that it is the original plaintext

DC = S.deciphering(key , C); print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

2.8.2.4 Affine Cipher

The affine cipher is implemented in the SageMath class
sage.crypto.classical.AffineCryptosystem
In SageMath Example 2.11, we construct an affine cipher with the key (a, b) =
(3, 13). A given plaintext P (strictly speaking the encoded message msg) is then
encrypted which results in the ciphertext C. The ciphertext is then decrypted and
the result DC is compared to the original plaintext.
Mathematically, an affine cipher can be described as follows: Let n be the num-
ber of letters in the alphabet (in the example n = 26, as the alphabet consists only
of the 26 capital letters). The individual letters of the plaintext P = ( p1 , p2 , ..., pk )
are encrypted as follows: ci = a · pi + b (mod n ).
The following applies:
- To the factor a: 0 < a < n and a relatively prime to n (coprime with 26).
- To the shift value b: 0 ≤ b ≤ n − 1.
As a is coprime to n, here a is one of the 12 numbers calculated in the SageMath
Example 2.10.

SageMath Example 2.10: n.coprime_integers()

sage: n=26
sage: n. coprime_integers (n)
[1, 3, 5, 7, 9, 11 , 15 , 17 , 19 , 21 , 23 , 25]

If a is not relatively prime to n, that is, gcd (a, n ) > 1, then the mapping cannot
be inverted, and the ciphertext cannot be uniquely deciphered. That would be the
case here with an even a and with a = 13.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 84 — #46

i i

84 Paper-and-Pencil and Precomputer Ciphers

In SageMath Example 2.11 we catch the case where a is not coprime to

n. Not doing so, the command AffineCryptosystem(AlphabeticStrings())! catches
this case and shows: ValueError: (a,b) = (4,13) is outside the range of
acceptable values for a key of this affine cryptosystem.
The key space is 12 · 26 = 312 (all possible combinations (a, b) result from the
12 values for a and the 26 different values for b).

SageMath Example 2.11: An Affine Cipher with Key (3, 13)

print ("\n# CHAP02 -- Sage -Script -SAMPLE 090: =========")

A = AlphabeticStrings ()
# int conversion needed: A.ngens () creates type <class 'int '>, but we need <class 'sage.rings. �
� integer.Integer '>
# via ngens () no hard coded "26" is needed.
n=Integer(A.ngens ()) # n = 26 = number of free alphabetic string monoid on A-Z.

# a and b must be < n (this is checked by AffineCryptosystem(AlphabeticStrings ()) too)

key = a, b = (3%n, 13%n); print (" affine key: ", key , sep ="")

cop_list=n.coprime_integers(n) # Here it 's necessary that n is of type Sage integer , not of �

� Python int
print (" coprimes of n=%d:" % n, cop_list)
if a not in cop_list: # a must be coprime to n
print ("Exit , because a is no coprime to 26.")
sys.exit (); # exit sage script

# create an affine cipher

AS = AffineCryptosystem(A); print(AS)
msg = AS.encoding ("The affine cryptosystem ."); print ("msg:", msg , " msglen :", len(msg))

# encrypt the plaintext using the affine key

C = AS.enciphering(a, b, msg); print ("C: ", C)

# decrypt the ciphertext and make sure that it is equivalent to the original plaintext
DC = AS.deciphering(a, b, C); print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

We can also construct a shift cipher using the affine cipher. To do so, we need
to restrict keys of the affine cipher to be of the form (a = 1, b) where b is any
nonnegative integer.
To create the Caesar cipher using the affine cipher, the encryption/decryption
key must be (1, 3). SageMath Example 2.9 works analogously with the affine cipher
in SageMath Example 2.12.

SageMath Example 2.12: Constructing the Caesar Cipher Using the Affine
Cipher
print ("\n# CHAP02 -- Sage -Script -SAMPLE 100: =========")

key = a, b = (1, 3); print (" affine key: ", key , sep ="")

# construct a shift cipher using an affine cipher with a=1

AS = AffineCryptosystem(AlphabeticStrings ())
msg = AS.encoding (" Shift the alphabet by three positions to the right .")
print ("msg:", msg , " msglen :", len(msg))

# shift the plaintext to get the ciphertext

C = AS.enciphering(a, b, msg); print ("C: ", C)

# decrypt the ciphertext and ensure that it is the original plaintext

DC = AS.deciphering(a, b, C); print ("DC: ", DC)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 85 — #47

i i

2.8 Examples Using SageMath 85

SageMath Example 2.12 (continued)

print ("msg == DC:", msg == DC) # Expect True

#------------------------------------
# CHAP02 -- Sage -Script -SAMPLE 100: =========
# affine key: (1, 3)
# msg: SHIFTTHEALPHABETBYTHREEPOSITIONSTOTHERIGHT msglen: 42
# C: VKLIWWKHDOSKDEHWEBWKUHHSRVLWLRQVWRWKHULJKW
# DC: SHIFTTHEALPHABETBYTHREEPOSITIONSTOTHERIGHT
# msg == DC: True

2.8.2.5 Vigenère Cipher

The Vigenère cipher is implemented in the SageMath class
sage.crypto.classical.VigenereCryptosystem
For our ciphertext/plaintext space, we can work with the upper-case let-
ters of the English alphabet, the binary number system, the octal number sys-
tem, or the hexadecimal number system. SageMath Example 2.13 uses the class
AlphabeticStrings, which implements the English capital letters.

SageMath Example 2.13: Vigenère Cipher

print ("\n# CHAP02 -- Sage -Script -SAMPLE 110: =========")

# construct Vigenere cipher

keylen = 14
A = AlphabeticStrings ()
V = VigenereCryptosystem (A, keylen)

# Here , a random key of length keylen is generated.

# Alternatively , a key could be given explicitly like key = A('ABCDEFGHIJKLMN ')
key = V.random_key (); print ("key: ", key , " keylen: ", len(key), sep ="")

P = "The Vigenere cipher is polyalphabetic ."; print ("P: ", P, " Plen:", len(P))
msg = V.encoding(P); print ("msg:", msg , " msglen:", len(msg))

C = V.enciphering(key , msg); print ("C: ", C, " Clen: ", len(C))

DC = V.deciphering(key , C); print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

#------------------------------------
# CHAP02 -- Sage -Script -SAMPLE 110: =========
# key: OHZZMJTRCFOWKN keylen: 14
# P: The Vigenere cipher is polyalphabetic. Plen: 38
# msg: THEVIGENERECIPHERISPOLYALPHABETIC msglen: 33
# C: HODUUPXEGWSYSCVLQHEYHCAFZLRNPLSHO Clen: 33
# DC: THEVIGENERECIPHERISPOLYALPHABETIC
# msg == DC: True

2.8.2.6 Hill Cipher

The Hill [40, 41] or matrix cipher43 is based on linear algebra and was invented by
Lester S. Hill in 1929. It was the first polygraphic cipher in which it was practical to

43. - CT1 Encrypt/Decrypt F Symmetric (classic) F Hill.

- CT2 Templates F Cryptography F Classical and CT2 Templates F Cryptanalysis F Classical.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 86 — #48

i i

86 Paper-and-Pencil and Precomputer Ciphers

Figure 2.13 Hill dialog in CT1 showing the operations and options available.

operate on more than three symbols at once. The Hill cipher is not important from a
security point of view, but because it was the first cipher trying to apply mathematics
to cryptography. The encryption key of this cipher is an invertible square matrix
(here called key) whose determinant is relatively prime to 26. Originally, plaintext
and ciphertext are vectors (P and C). The encryption and decryption processes use
matrix operations modulo 26: C = P · key (mod 26).
The Hill cipher is implemented in the SageMath class
sage.crypto.classical.HillCryptosystem
In SageMath Example 2.14, our plaintext/ciphertext space is the capital letters
of the English alphabet. The Hill cipher assigns each letter of this alphabet a unique
integer modulo 26. The size of the key matrix (also called its dimension) is not
restricted by the cipher.
Comparing the Hill implementation in CrypTool v1.4.42 and in SageMath
version 9.3:
• SageMath offers fast command-line operations; CT1 offers its functionality
within a GUI.
• SageMath offers for the key matrix any dimension; CT1 is restricted to a
matrix size between 1 and 10.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 87 — #49

i i

2.8 Examples Using SageMath 87

• SageMath allows negative numbers in the key matrix, and converts them
automatically into appropriate nonnegative numbers; CT1 doesn’t allow
negative numbers in the key matrix.
• SageMath always sets the first alphabet character to 0, only allows the 26
capital letters as an alphabet, and it uses only the multiplication variant
plaintext row vector · key matrix: C = P · key.
• CT1 offers to choose also 1 as value for the first alphabet character, you can
customize your alphabet within the text options dialog, and it also offers to
use a reverse multiplication variant: C = key ·P.
SageMath only provides the function for encryption and decryption for the
classic ciphers and usually with a very restrictive alphabet. You have to implement
methods for cryptanalysis yourself. A KPA against the Hill cipher is introduced
in Section 2.8.3.2.
While SageMath Example 2.14 calculates for the entered characters always
with their ASCII numerical values, SageMath Example 2.19 cannot only carry out
a KPA analysis, but also put the key matrix in front of the plaintext (order of the
multiplication) and the characters in the alphabet start from 0 or 1.44
Reference [43] is a very good article developing the formulas for how many
invertible Hill matrices there are for a given dimension (compared to the total
number of all matrices and to the number of involutory matrices).

SageMath Example 2.14: Hill Cipher with Given Key Matrix

print ("\n# CHAP02 -- Sage -Script -SAMPLE 120: =========")

keylen = 3 # Alternative key length: keylen=13 --- ensure msg length is a multiple of keylen
A = AlphabeticStrings ()
H = HillCryptosystem(A, keylen)

# Alternative 1: Non -random key creation (needs HKS; even H.key_space () is not enough)
HKS = H.key_space ()
key = HKS ([[1,0,1],[0,1,1],[2,2,3]]); print ("key: \n", key , sep ="")

# Alternative 2: Random key creation

# key = H.random_key (); print ("key: \n", key , sep ="")

# the key object has no method len(), but block_length ()

print (" block_length (): ", H.block_length (), " key.det(): ", key.det(), sep ="")

# encoding (Length of msg is a multiple of matrix dimension (block_length))

P = "HHill or matrix cipher uses matrix operations ."
print ("P: ", P, " Plen:", len(P))
msg = H.encoding(P); print ("msg:", msg , " msglen:", len(msg))

# encryption
C = H.enciphering(key , msg); print ("C: ", C, " Clen: ", len(C))

# decryption
DC = H.deciphering(key , C); print ("DC: ", DC)
print ("msg == DC:", msg == DC) # Expect True

# alternative way to decrypt using inverse matrix

keyInv = key.inverse (); keyInv
keyInv = H.inverse_key(key); print ("\ nkeyInv: \n", keyInv , sep ="")
DC = H.enciphering(keyInv , C); print ("DC: ", DC)
print ("msg == DC:", msg == DC) # Expect True

44. $ sage chap02_hill_enc_dec_kpa.sage -enc -dim 3 -pt ”ACTCAT” -A [[6,24,1],[13,16,10],[20,17,15]]”

encrypts the example “ACTCAT” to “POHFIN” [42].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 88 — #50

i i

88 Paper-and-Pencil and Precomputer Ciphers

SageMath Example 2.14 (continued)

print ("\n---Remark: Output C as a sequence of ASCII characters and their according numbers ")
# print (" type(C):", type(C)) # 'sage.monoids.string_monoid_element.StringMonoidElement '
# 'StringMonoidElement ' object has no attribute to directly convert to integer sequence
from sage.crypto.util import ascii_to_bin , ascii_integer
# print (" a_to_b: ", ascii_to_bin(str(C)))
print ("C[i]:", [C[i] for i in range(len(C))])
print (" binary C[i]:", [ascii_to_bin(str(C[i])) for i in range(len(C))])
print (" integer C[i]:", [ascii_integer(ascii_to_bin(str(C[i]))) for i in range(len(C))])

2.8.2.7 Substitution with Symbols Using Not Only Capital Letters

Up to now, we used the capital letters for the classical ciphers. This set {A, B, ..., Z }
is the standard alphabet used for the classical ciphers in SageMath.
A substitution cipher can be considered as a stream cipher that acts on the plain-
text by making a substitution of the characters with elements of a new ciphertext
alphabet or by a permutation of the characters in the plaintext alphabet. Besides
the capital letters, the predefined functions for classical ciphers in SageMath only
offer the hexadecimal and binary systems as alphabets.
These alphabets can be called via the three functions that implement the free
string monoids (i.e., sets whose elements can be concatenated to any finite length):
S = AlphabeticStrings()
H = HexadecimalStrings()
B = BinaryStrings()

The following samples demonstrate that one can vary the alphabet. The first
two samples use the hexadecimal and the binary system. The last sample shows how
to define your own alphabet. This currently requires you to also write your own
cipher algorithm. We do this by defining an own MASC with a longer alphabet.

MASC with a hexadecimal alphabet. In SageMath Example 2.15, the hexadecimal

number system is used as a substitution alphabet for plaintext/ciphertext.

SageMath Example 2.15: Monoalphabetic Substitution with a Hexadecimal

Alphabet (and Decoding in Both SageMath and Python)
print ("\n# CHAP02 -- Sage -Script -SAMPLE 130: =========")

A = HexadecimalStrings ()
S = SubstitutionCryptosystem (A)

key = S.random_key (); print ("key: ", key , " keylen: ", len(key), sep ="")
print (" Number of possible keys: ", len(key), "! = ", factorial(len(key)), sep ="")

P = "Working with a larger alphabet ."; print ("P: ", P, " Plen:", len(P))
msg = A.encoding(P); print ("msg:", msg , " msglen:", len(msg))

C = S.enciphering(key , msg); print ("C: ", C, " Clen: ", len(C))

DC = S.deciphering(key , C); print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

# Conversion hex in DC back to ASCII:

DDC = DC.decoding (); # print ("DDC:", DDC)
print ("P == DDC:", P == DDC) # Expect True

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 89 — #51

i i

2.8 Examples Using SageMath 89

SageMath Example 2.15 (continued)

## Remark: Other ways for the decoding transformation using sage.crypto.util

# - AlphabeticStrings () and HexadecimalStrings () don 't have an according method.
# - http :// doc.sagemath.org/html/en/reference/cryptography/sage/crypto/util.html
# from sage.crypto.util import ascii_integer
# print (" ascii_integer: ", ascii_integer ("01000100"))
# from sage.crypto.util import bin_to_ascii
# print (" bin_to_ascii :", bin_to_ascii ("01000100"))
#
## Remark: Alternative conversion hex back to ASCII , using native Python
# import binascii
# DDC = binascii.a2 b_hex(repr(DC));

#------------------------------------
# CHAP02 -- Sage -Script -SAMPLE 130: =========
# key: c7834de0f1a65b29 keylen: 16
# Number of possible keys: 16! = 20922789888000
# P: Working with a larger alphabet. Plen: 31
# msg: 576f726b696e6720776974682061206c617267657220616c7068616265742e msglen: 62
# C: d0e908e6e1e2e08c00e104ef8ce78ce5e708e0ed088ce7e50 cefe7e8ed0482 Clen: 62
# DC: 576f726b696e6720776974682061206c617267657220616c7068616265742e
# msg == DC: True
# P == DDC: True

MASC with a binary alphabet. In SageMath Example 2.16 the three cipher types,
monoalphabetic substitution, shift, and Vigenère are used. The alphabet symbols
here are in each case from the binary number system.
Except for the Vigenère cipher, which can be enhanced as a one-time pad, these
variants are very unsecure. Because the plaintext/ciphertext alphabet has only the
two elements 0 and 1, there are—for example, with monoalphabetic substitution—
then only two possible keys: (0 1) and (1 0). The key of a monoalphabetic
substitution cipher must contain all symbols of the alphabet exactly once.

SageMath Example 2.16: Different Substitution Ciphers on the Binary

Alphabet
print ("\n# CHAP02 -- Sage -Script -SAMPLE 140: =========")

# the plaintext/ciphertext alphabet is a binary sequence

B = BinaryStrings (); # print ("B", B); print ("B.alphabet ()", B.alphabet ())

# substitution cipher over the alphabet B; no keylen argument possible

S = SubstitutionCryptosystem (B)
print ("1. Substitution: alphabet_size :", S.alphabet_size ())

# Alternative: To get a substitute for each symbol , key has always the length of the alphabet
# key = S.random_key ()
key = B("10") # "10" inverts all bits , "01" leaves them unchanged; "0", "1", "00" and "11" �
� cause exception!
print ("key: ", key , " keylen: ", len(key))
# print ("## type key: ", type(key))

P = "MA -Substitution on binary alphabet is very unsecure (flip bit or not)."; print ("P: ", P, �
�" Plen:", len(P))
msg = B.encoding(P); print ("msg:", msg , " msglen:", len(msg))

C = S.enciphering(key , msg); print ("C: ", C, " Clen: ", len(C))

DC = S.deciphering(key , C); # print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

S = ShiftCryptosystem(B) # Shift in the binary alphabet B which has only two elements.
print ("\n2. Shift: alphabet_size :", S.alphabet_size ())

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 90 — #52

i i

90 Paper-and-Pencil and Precomputer Ciphers

SageMath Example 2.16 (continued)

# Alternative: key = S.random_key (); print (" randK:", key)
# print ("## type key: ", type(key))
key = 1; print ("key:", key) # 1 inverts each bit; 0 leaves each bit unchanged.

P = "Shift on binary alphabet offers only 2 possibilities: very unsecure ."; print ("P: ", P, " �
� Plen:", len(P))
msg = B.encoding(P); print ("msg:", msg , " msglen:", len(msg))

C = S.enciphering(key , msg); print ("C: ", C, " Clen: ", len(C))

DC = S.deciphering(key , C); # print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

keylen = 14;
# Alternative settings for alphabet and using a given key
# B = AlphabeticStrings (); key = B('ABCDEFGHIJKLMN '); print ("key:", key , " keylen: ", len(key) �
�)
# B = BinaryStrings (); key = B('11111111111111 '); print ("key:", key , " keylen: ", len(key))
V = VigenereCryptosystem(B, keylen)
print ("\n3. Vigenere: alphabet_size :", V.alphabet_size ())
key = V.random_key (); print (" randkey :", key , " keylen: ", len(key))

msg = V.encoding (" Vigenere on binary alphabet with long key is close to OTP ."); print ("msg:", �
� msg , " msglen :", len(msg))
C = V.enciphering(key , msg); print ("C: ", C, " Clen: ", len(C))
DC = V.deciphering(key , C); # print ("DC: ", DC)

print ("msg == DC:", msg == DC) # Expect True

MASC with a self-defined alphabet. SageMath Example 2.17 uses an augmented

alphabet—one that contains the alphabetic letters (upper and lowercase) plus
numbers and/or some extra symbols.

SageMath Example 2.17: MASC Over Own Alphabet

print ("\n# CHAP02 -- Sage -Script -SAMPLE 150: =========")

# Using own definitions for a classical cipher instead of the r2r Sage commands (r2r=ready -to - �
� run)
# An arbitrary alphabet made from A..Z, a..z, 0..9, and some symbols which are arranged in a �
� wished order.

import string
import random

alph1 Lower_string = string.ascii_lowercase

# print(alph1 Lower_string)
alph2 Upper_string = string.ascii_uppercase
# print(alph2 Upper_string)
alph3 Digits_string = string.digits
# print(alph3 Digits_string)
alph4 Punctuation_string = string.punctuation
# print(alph4 Punctuation_string)

# Determine the order of the elements of the alphabet by ordering the 4 parts
alphabet = alph1 Lower_string + alph2 Upper_string + alph4 Punctuation_string + alph3 Digits_string
print(alphabet)

print('Length alphabet:', len(alphabet))

print('Check: Value of letter B in this alphabet:', alphabet.index('B'))

# Shuffle rearranges the given object. As strings and tuples are immutable ,
# we have to use random.sample () instead of random.shuffle ().
random.seed(int(15)) # argument not necessary. Initialized the PRNG just to have always the �
� same to compare with.
# argument 15 without casting throws TypeError: The only supported seed types are: None , �
� int , float , str , bytes , and bytearray.
r_alphabet = ''.join(random.sample(alphabet , len(alphabet)))

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 91 — #53

i i

2.8 Examples Using SageMath 91

SageMath Example 2.17 (continued)

print('1. shuffled alphabet:', r_alphabet)
r_alphabet = ''.join(random.sample(alphabet , len(alphabet)))
print('2. shuffled alphabet:', r_alphabet)

# Use these two alphabets to build a MASC

PA = alph1 Lower_string + alph2 Upper_string + alph4 Punctuation_string + alph3 Digits_string
print('\ nPlaintext alphabet PA :', PA , ' Length of PA ', len(PA))

random.seed(int(0)) # Initialized the seed to generate a fixed permutation

CA = ''.join(random.sample(PA , len(PA)))
print('Ciphertext alphabet CA:', CA , ' Length of CA ', len(CA))

codetableC2P = str.maketrans(CA ,PA) # requires the 2 strings CA , PA to have the same len
codetableP2C = str.maketrans(PA ,CA) # requires the 2 strings CA , PA to have the same len

P1 = '''ATESTZtestTEST1234 '''
P2 = '''DWHVWCCNBCWHVWNOPQ '''

C1 = P1.translate(codetableP2C);
C2 = P2.translate(codetableP2C);

P1 _revealed = C1.translate(codetableC2P);
P2 _revealed = C2.translate(codetableC2P);

2.8.3 Cryptanalysis of Classical Ciphers with SageMath

Of course, you can also do the cryptanalysis of classical methods with Python
or SageMath. Good examples can be found in Kohel’s book [44] on page
19 ff and page 110 ff. You can also find cryptanalysis examples for affine
ciphers and shift ciphers at https://doc.sagemath.org /pdf/en/reference/
cryptography/cryptography.pdf. Very sophisticated cryptanalysis methods for
classical ciphers are part of CT2.
Two simple analyses are presented here as examples. For a shift cipher, the
brute-force method built-in SageMath is used to match a ciphertext to the correct
plaintext and key. A self-written program is presented for the Hill cipher: The attack
determines the correct key from a given plaintext/ciphertext pair.

2.8.3.1 Cryptanalysis with SageMath: Ciphertext-Only Attack Against Shift Cipher

The Caesar method built into SageMath has only 26 possible keys for an alpha-
bet consisting of 26 capital letters. So a brute-force approach needs to try very
few possibilities. This is why brute force is already integrated as a method in the
cryptographic procedure ShiftCryptosystem.45
It is clear that one of the 26 possible keys again generates the plaintext. There-
fore, some authors subtract this key from the key space. But this is a matter of
definition and the majority of the authors like [45] count it to the key space. In
[46, page 27], these keys are referred to as “trivial.” They form together with the
“non-trivial” keys the key space. So Caesar has 25 nontrivial keys and one trivial
key, the key set has 26 elements, and the key space has the value 26.

45. In SageMath 9.7, only the two classic cryptosystems ShiftCryptosystem and AffineCryptosystem
have this method built in. For example, SubstitutionCryptosystem does not have it because the
search space (26!) is too large. See https://doc.sagemath.org/html/en/reference/cryptography/
sage/crypto/classical.html.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 92 — #54

i i

92 Paper-and-Pencil and Precomputer Ciphers

In SageMath Example 2.18, first the built-in method brute_force is executed

and then a statistical test (chi-square or squared differences) is used, which finds
with a high probability the right plaintext (in human language)—depending on the
length of the given ciphertext.
The plaintext found does not contain any spaces, since these do not belong to
the alphabet. Thus, the words are not clearly separated from each other.46

SageMath Example 2.18: Ciphertext-Only Attack Against Shift Cipher

(Caesar)
print ("\n# CHAP02 -- Sage -Script -SAMPLE 160: =========")

# Find the most likely plaintext of a ciphertext encrypted via a shift cipher
# - Instead of explicitely looping over all revealed plaintexts ,
# use the built -in brute -force method and apply a statistical measure
# - automated ciphertext -only attack against Caesar

# pt = "Hello this is a test. Please enter your text here ."

# ct = "Pmttw BpqA qA i BmAB. XtmiAm mvBmz GwCz BmFB pmzm ." # via k=8
ct = " PmttwBpqAqAiBmABXtmiAmmvBmzGwCzBmFBpmzm "
print(f"Given ciphertext ct:\n{ct}")

S = AlphabeticStrings ()
E = ShiftCryptosystem(S)

# both , ct and ctstr have the value PMTTWBPQAQAIBMABXTMIAMMVBMZGWCZBMFBPMZM

ct = S.encoding(ct) # type: sage.monoids. string_monoid_element.StringMonoidElement '
ctstr = str(ct) # type: str

print ("\n---------- brute -force , No ranking ")

dict = E.brute_force(ct) # type(dict)=dict
print( *sorted(dict.items ())[:26], sep ="\n" ) # output one element per line

print ("\n---------- chi_square ranking ")

L = E.brute_force(ct , ranking =" chisquare ") # type(L))=list
print (*L[:5], sep ="\n") # display only the top 5 candidate keys and plaintexts

print ("\n---------- squared_differences ranking ")

L = E.brute_force(ct , ranking =" squared_differences ") # type(L))=list
print (*L[:5], sep ="\n") # display only the top 5 candidate keys and plaintexts

print ("\n---------- Probably correct values ")

print (" Probable correct key: ", L[0][0])
print (" Probable correct pt: ", L[0][1])

2.8.3.2 Cryptanalysis with SageMath: KPA Against Hill Cipher

The Hill cipher is very difficult to break if only ciphertext is given, but it is vulnera-
ble to KPAs. Given the corresponding plaintext and ciphertext, it is very likely that
the key (matrix) can be determined.
Now the SageMath program hill_enc_dec_kpa.sage will be presented, which
can execute a known-plaintext attack against the Hill cipher. While SageMath
46. Within CT2 Startcenter F Templates F Tools you can find the template “Split a Text” which recog-
nizes words of different languages and restores the separators in the revealed plaintext almost automatically
– With only little manual reworking (see Figure 2.14):
HELLOTHISISATESTPLEASEENTERYOURTEXTHERE.
→ HELLO THIS IS ATE S T PLEASE ENTER YOUR TEXT HERE.
→ HELLO THIS IS A TEST PLEASE ENTER YOUR TEXT HERE.
Instead of using CT2, you can also do the splitting very well with AI tools like ChatGPT [47] or
YouChat [48].

i i

94 Paper-and-Pencil and Precomputer Ciphers

SageMath Example 2.19: KPA Against the Hill Cipher

# Apply Hill cipher from SageMath in 3 modes: encrypt , decrypt , or do �
� known -plaintext analysis.
# - these modes can be set via command line option. Internally they �
� call the
# functions PerformEncDev ("Enc"), PerformEncDev ("Dec"), or PerformKPA �
� ()
# - the data used can be set
# - either via -s SelectedExampleNo (to select one of the predefined �
� examples stored in JSON syntax)
# - or by entering all arguments on the command line [they overwrite �
� values from a stored sample , if both are given]
#
# Usage: $ sage chap02 _hill_enc_dec_kpa.sage [-h] [-V] (-enc | -dec | �
� -kpa) [-v] [-dim DIM]
# [-pt STRING] [-ct STRING] [-A STRING] [-i �
� INDEX] [-kl] [-s SAMPLE]

References

[1] ACA, Length and Standards for All ACA Ciphers, 2021, https://www.cryptogram.org/re-
sourc e-area/cipher-types/.
[2] Bion, Recreational Cryptography Programs, https://williammason.github.io/rec-crypt/.
[3] Pilcrow, P., CryptoPrograms, http://www.cryptoprograms.com.
[4] Singh, S., The Code Book: The Science of Secrecy from Ancient Egypt to Quantum
Cryptography, New York: Anchor Books, 1999.
[5] Goebel, G., Codes, Ciphers and Codebreaking, Version 2.3.2, 2014, http://web.archive
.org/web/20151027000247/http://www.vectorsite.net/ttcode.html.
[6] Savard, J. J. G., A Cryptographic Compendium, 1999, http://www.quadibloc.com/cryp-
to/jscrypt.htm.
[7] ThinkQuest Team 27158, Data Encryption, 1999.
[8] Knight, K., B. Megyesi, and C. Schaefer, Copiale Cipher; Scaled Page 16/17, Wikimedia
Commons, 2011, https://commons.wikimedia.org/wiki/File:Copial e-cipher09s.png.
[9] Sanguino, L. A. B., et al., “Analyzing the Spanish Strip Cipher by Combining Combinato-
rial and Statistical Methods,” in Cryptologia, Vol. 40, No. 3, 2016, pp. 261–284, https://
www.semanticscholar.org/paper/Analyzing-the-Spanish-strip-cipher-by-combining-and-
Sanguino-Leander/b4278e62c804ec0bf349a1e5c74a1b35bb276d83.
[10] Drobick, J., Abriss DDR-Chiffriergeschichte: SAS- und Chiffrierdienst, 2015, http://scz
.bpla ced.net/m.html#dwa.
[11] Schneier, B., The Solitaire Encryption Algorithm, v. 1.2, 1999, https://www.schneier
.com/ac ademic/solitaire/.
[12] Crowley, P., Mirdek: A Card Cipher Inspired by “Solitaire,” 2000, http://www.cipher-
goth.org/crypto/mirdek/.
[13] Géraud-Stewart, R., and D. Naccache, “A French Cipher from the Late 19th Century,”
Cryptologia, 2020, pp. 1–29, https://doi.org/10.1080/01611194.2020.1753265.
[14] Kallick, B., Handycipher: A Low-Tech, Randomized, Symmetric-key Cryptosystem,
Cryptology ePrint Archive, Report 2014/257, 2014, https://eprint.iacr.org/2014/257.
[15] Kaminsky, A., ElsieFour: A Low-Tech Authenticated Encryption Algorithm for Human-
to-Human Communication, Cryptology ePrint Archive, Report 2017/339, 2017,
https://eprint.iacr.org/2017/339.

i i

Historical cryptology studies (original) encrypted manuscripts, often handwritten

sources, produced in our history. These historical sources can be found in archives,
often hidden without any indexing and therefore hard to locate. Once found they
need to be digitized and turned into a machine-readable text format before they
can be deciphered with computational methods. The focus of historical cryptology
is not primarily the development of sophisticated algorithms for decipherment, but
rather the entire process of analysis of the encrypted source from collection and
digitization to transcription and decryption. The process also includes the interpre-
tation and contextualization of the message set in its historical context. There are
many challenges on the way, such as mistakes made by the scribe, errors made by
the transcriber, damaged pages, handwriting styles that are difficult to interpret,
historical languages from various time periods, and hidden underlying language of
the message. Ciphertexts vary greatly in terms of their code system and symbol sets
used with more or less distinguishable symbols. Ciphertexts can be embedded in
clearly written text, or shorter or longer sequences of cleartext can be embedded in
the ciphertext. The ciphers used mostly in historical times are substitutions (simple,
homophonic, or polyphonic), with or without nomenclatures, encoded as digits or
symbol sequences, with or without spaces.
So the circumstances are different from those in modern cryptography which
focuses on methods (algorithms) and their strengths and assumes that the algo-
rithm is applied correctly. For both historical and modern cryptology, attack vectors
outside the algorithm are applied like implementation flaws and side-channel
attacks.
In this chapter, we give an introduction to the field of historical cryptology and
present an overview of how researchers today process historical encrypted sources.

3.1 Introduction

Historical cryptology deals with the encryption and decryption of historical, man-
ually constructed ciphers. An encrypted source usually counts as historical if it has
been produced no later than the mid-20th century. There is no exact break-even
point; however, the development of telegraphy (from the 1830s) led to more sophis-
ticated and complex mathematical methods applied to encryption requiring more
advanced cryptanalysis.
Historical cryptology involves the field of cryptography (the art and science of
code making and the encryption of messages), and the field of cryptanalysis (the
art and science of code breaking [1], i.e., the decipherment of messages without the

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 98 — #2

i i

98 Historical Cryptology

key). In everyday language, the terms “cryptography” and “cryptology” are often
used interchangeably.
As in all scientific fields, historical cryptology has its own terminology. We start
the chapter by introducing the most important terms and give a brief overview of
historical ciphers and keys before we move on to the components needed to process
and decipher historical ciphers. Given that historical cryptology as a scientific field
of study is rather new, the terminology standards and the usage of the terms are
still to be established and under discussion in the historical cryptology community
(see for example [2] and [3]). We summarize the important terms in Figure 3.6 and
Table 3.1, as well as illustrate the crypto process in Figure 3.7.
A cipher (sometimes also “cypher,” which is simply the old spelling) refers to an
algorithm that describes the procedure of encryption or decryption. The encrypted
source itself is called ciphertext, though the terms “code” and “cipher” are often
not distinguished from ciphertext in everyday language. A ciphertext consists of a
sequence of symbols from a ciphertext alphabet. The ciphertext alphabet can be
the same as the plaintext alphabet (e.g., the Latin letters), but often it consists of
different symbol systems and alphabets, such as Greek letters, digits, graphic signs
(e.g., alchemical or zodiac signs), or Chinese hieroglyphs. Figure 3.1 illustrates the
variation of the symbol systems from ciphertext alphabets in three ciphertexts. The
ciphertexts are extracts taken from the Borg cipher [4], a digit-based cipher from
the National Archives of Sweden [5], and the Copiale cipher [6].
In ciphertexts, we can find regular usage of space marking word boundaries
as in the Borg cipher (see Figure 3.1) even though most of the ciphertexts from
the past use continuous script (scriptio continua) without any spaces, as shown in
the examples from the Swedish National Archives and the Copiale cipher. Word
boundaries were often removed in historical ciphers to make codebreaking more
difficult. A ciphertext might also contain additional information such as accents
and other diacritics, or punctuation marks appearing more or less systematically in
connection to symbols, as in the example of the Copiale cipher. We can also find
overwritings for corrections, underlined sequences, and unintentional ink spots in
the manuscripts.

Figure 3.1 Three examples of ciphertexts. (From: [4–6].)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 99 — #3

i i

3.1 Introduction 99

Apart from the ciphertext in an encrypted message, nonencrypted sequences of

texts that we call cleartext are also common. For example, the first line of the Borg
cipher in Figure 3.1 contains a cleartext “Contra dissenteriam” in Latin. Cleartext
passages bear important information about the possible underlying language(s) and
the topic of the encrypted source.
The encrypted source might also include decrypted plaintext, written on the
same page, often found above the ciphertext lines, as illustrated in Figure 3.2.
A cipher key (Figure 3.3) using a given cipher defines how to encrypt a plain-
text and how to decrypt a ciphertext. Historical cipher keys usually contain a list of
plaintext elements (letters, syllables, words, names, phrases) and the corresponding
symbol or combination of symbols taken from the ciphertext alphabet, henceforth
the code elements. Two examples of cipher keys are shown in Figures 3.3 and 3.4.

Figure 3.2 Ciphertext (underlined), cleartext (in red), and plaintext (in blue) in an encrypted
manuscript.

Figure 3.3 Cipher key: simple substitution. (Flanders, 1596 [7].)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 100 — #4

i i

100 Historical Cryptology

In both keys, the letters of the plaintext alphabet (A–Z) are listed horizontally in
the first line of the key tables. Moreover, underneath each plaintext letter, we can
find either one (Figure 3.3) or several ciphertext symbols (Figure 3.4), henceforth
alphabet-code elements, assigned to each plaintext letter. In Figure 3.3 these single
ciphertext letters are taken from the plaintext alphabet but in a different position. In
Figure 3.4, on the other hand, the lengths of the alphabet-code elements vary; two-
digit code elements to encode the plaintext alphabet and three-digit code elements
to encode the words. Note that the most frequently occurring plaintext alphabet
letters have four alphabet-code elements, whereas the least frequent ones received
three code elements. Adding several code elements to the frequently occurring plain-
text elements leads to an increased difficulty of decipherment and renders a cipher
homophonic.
In the columns of both keys we find a shorter or longer list of plaintext elements
(names, content, and function words) with code elements assigned to each. Such
a list as part of the key is called nomenclature, sometimes also spelled nomencla-
tor. Sometimes the entire key that contains a nomenclature (i.e., a list of plaintext
elements) is called a nomenclator. Here, we make a distinction between the vari-
ous parts of the key. The nomenclature shown in Figure 3.3 consists of roughly 100
items in which we can see code elements using a single ciphertext symbol, for exam-
ple “A” for “Royne d’Angleterre” and others with multiple ciphertext symbols,
such as “12” for “Siuille.” Here, the various types of nomenclature elements receive

Figure 3.4 Cipher key: simple and homophonic substitution. (Hungary, 1703–1711 [8].)

i i

archives, libraries, museums, or private collections. They might be difficult to find
as they are hardly indexed as ciphers in archive or library catalogs. Only a small
but increasing percentage of the historical encrypted sources are digitized and made
available online, and even fewer are turned into a computer-readable text format.
Finding, analyzing, and deciphering encrypted manuscripts are challenging and
need various kinds of expertise. In this section, we give a bird’s-eye view on the
different steps and components involved in processing encrypted manuscripts from
collection through transcription to decipherment, as illustrated in Figure 3.9. Then
we describe each step of the process in detail in the subsequent sections.
Collecting encrypted sources requires knowledge about the whereabouts of the
documents. Once found, the documents need to be digitized, turned into images,
and described with a set of metadata according to some standard. Information can
include the sender and receiver of the documents, the time and place when the
encrypted source was produced or sent, and a description of its content. Describ-
ing historical sources in terms of metadata is as important as the content of the
document itself.
Before we can cryptanalyze a ciphertext, we usually need to transcribe it (i.e.,
turning the ciphertext image(s) into a computer-readable text format). By doing so,
we look closely at the symbol set and group the similar ciphertext symbols into
types, which helps us in the identification of the entire ciphertext alphabet. A tran-
scription is a text representing the ciphertext symbols from the image(s) symbol by
symbol and line by line. This requires interpreting the handwriting style and motion
educated guesses about the intentions of the scribe; in other words to interpret the
handwriting. The transcription needs to be thorough; all symbols, diacritics, punc-
tuation marks, and spaces must be transcribed to avoid error propagation during
decipherment.
Given a (couple of lines of) transcription we can go on with the cryptanalysis.
First, we need to segment the ciphertext into code elements and analyze the fre-
quencies and co-occurrences of the various symbol types and code elements. We
need to make educated guesses about the cipher type and about the underlying
language. Dictionaries and language models for various time periods might be of
help on the way when guessing the plaintext underneath. Once we have a decrypted
text, we interpret the plaintext, correct wrongly transcribed symbols, and adjust the
assumed key to get an appropriate and reasonable plaintext output. We might then
translate the text to one or several languages, and set the plaintext in a historical
context; what was written, by whom, to whom, and why.
Deciphering a ciphertext—albeit lots of fun—is often challenging. In the past,
many historians and people worked individually in an uncoordinated fashion on
the identification and deciphering of secret writings. Without access to automatic
methods that can accelerate the decipherment, it’s a time-consuming process. At the
same time, cryptanalysts, computer scientists, and computational linguists develop
automatic cryptanalysis algorithms to identify cipher types and to break various
ciphers without having access to real historical ciphertexts.
To coordinate the efforts of various expertise and build research infrastructure
in terms of resources and tools for historical cryptology, an international research
program was created in 2018: the DECRYPT project [12]. The aim of the project

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 105 — #9

i i

3.2 Analyzing Historical Ciphers: From Collection to Interpretation 105

was to establish a new cross-disciplinary scientific field of historical cryptology by

bringing the expertise of the different disciplines together to digitize encrypted his-
torical documents, build a database of historical ciphers, and develop software tools
for transcription and cryptanalysis. We are not aware of any other cross-disciplinary
project in the field that takes a holistic approach from collection through transcrip-
tion to decipherment by developing open-source resources and tools for historical
cryptology in large scale. Therefore, we base this chapter largely on the experiences
and findings of the results of the cross-disciplinary cooperation in the DECRYPT
project. However, there are many relevant high-quality studies on various aspects
of historical cryptology and we will refer to the most prominent ones in the relevant
parts of the subsequent sections.
To be able to study the characteristics of historical ciphers with the ultimate
goal to decipher all cipher types from historical times, we need a large set of his-
torical sources to be collected and stored from various places and time periods.
The DECODE database [13] was created to store images of ciphertexts, encryption
keys, and information about their provenance, transcriptions, and possible decryp-
tions. The process of (semi-) automatic decryption involves, as mentioned before,
transcription by applying image recognition to automatically convert the images to
machine-readable format and a mapping of symbols to a transcription scheme. The
detection of the underlying plaintext language of the ciphertext on the basis of his-
torical text sources, the automatic identification of the cipher type, the cryptanalysis
of the ciphertext, and finally its decryption are taken care of in the cryptanaly-
sis step. The methods developed are based on a wide range of algorithms: from
classical cryptanalysis to advanced deep-learning architectures taken from artificial
intelligence. Various (neural network) models for transcription are released in the
TranscriptTool [14] (see Figure 3.8), while the algorithms for cryptanalysis have
been implemented in CrypTool 2 (CT2) [15]. CT2 is called in a command line
version on the webserver of the DECRYPT pipeline. Both tools (TranscriptTool

Figure 3.8 TranscriptTool for creating transcriptions of scanned historical manuscripts, offered as part of the
DECRYPT pipeline.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 106 — #10

i i

106 Historical Cryptology

i i
i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 114 — #18

i i

114 Historical Cryptology

Table 3.2 Transcription of the

116 Historical Cryptology

Figure 3.13 Ciphertext cryptanalyzed with the program CrypTool Transcriber and Solver.

analysis addresses the problem of automatically recognizing document content

being it printed text, handwritten text, or graphic elements. Traditionally, optical
character recognition (OCR) programs recognize clusters of pixels as letters and, at
a higher level, validate joint interpretations to end up transforming a digital image
into an editable text file. Despite advances during the last decades, reading sys-
tems still have limitations, and document analysis research must advance to offer
large-scale solutions. In the case of historical handwritten documents, the different
handwriting styles, the paper degradation, or the use of ancient languages makes
the recognition difficult. Moreover, the use of unknown alphabets, which is com-
monly the case in such encrypted sources, makes its automatic transcription even
more challenging. For this reason, recognition methods must be guided by human
experts, and, once the transcription is provided, it must be validated to correct any
transcription errors.
Typically, the stages when recognizing text include preprocessing, layout seg-
mentation, and transcription. Given that labeled data (transcribed data) is often not
available, the recognition methods are divided into learning-free and learning-based
techniques. Next, the main stages of automatic transcription are described.

3.4.3.1 Document Preprocessing

The processing of the image includes those techniques that are usually applied
after the digitization of the document. These techniques are essentially applied for
improving the quality of the images to make the document more readable, both for
people and also for automatic reading systems. In the case of very old and poorly
preserved documents, it is necessary to apply document enhancement techniques
for minimizing show-through or bleed-through effects, paper discoloration, or loss
of ink intensity. Although many document enhancement methods can be directly
applied to any input document image, recent deep learning-based methods, such as

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 117 — #21

i i

3.4 Transcription 117

generative adversarial networks and transformer networks [26] have demonstrated

a superior performance. They need similar (labeled) data to train such systems for a
good performance. An example of a document enhancement, which includes bina-
rization, is shown in Figure 3.14. Binarization here means converting a color or
grayscale image into a binary image with only black and white pixels.

3.4.3.2 Layout Segmentation

Once the document has been preprocessed and enhanced, the central area of the page
must be identified within the image. Layout analysis methods aim to identify the
structure and nature of the regions within the document. Many historical documents
contain heterogeneous contents, such as text, drawings, or music scores. In the case
of ciphertexts, this stage is usually focused on detecting the blocks of text and sep-
arating them into lines, words, and ideally, into characters/symbols [27]. However,
in many manuscripts, symbols are touching or even overlapping, which makes the
segmentation at symbol level difficult, as shown in Figure 3.15 (see the bounding
boxes in red color). In such cases, it is preferable to opt for learning-based models.

3.4.3.3 Text/Cipher Recognition

Once the structure of the document has been analyzed and the text regions,
lines, and/or symbols have been extracted, these are processed to obtain the final

Figure 3.14 Example of a document enhancement method (binarization).

Figure 3.15 Example of symbol segmentation and transcription; segmentation shown in red, transcription
in blue.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 118 — #22

i i

3.5.2.1 Hill Climbing

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 127 — #31

i i

3.5 Cryptanalysis 127

physical properties of the material. While in physical annealing, the real temper-
ature is slowly decreased; with simulated annealing a virtual temperature value is
used. The basic simulated-annealing algorithm consists of six steps:

1. Select a randomly chosen start key k

2. Set the temperature to a start value t := tstart
3. Decrypt the ciphertext ct to get pt := decrypt(ct, k )
4. Compute the cost value of pt with f := cost( pt )
5. Loop while t > 0

a. Generate a new key k 0 , which is a slightly modified k

b. Decrypt the ciphertext ct to get pt 0 := decrypt(ct, k 0 )
c. Compute the cost value of pt 0 with f 0 := cost( pt 0 )
d. If f 0 ≥ f then assign f := f 0 and assign k := k 0 else

• Compute a degradation value d := − abs( f − f 0 )

d
• Compute an acceptance probability p = e t
• Choose a random value r in the interval ] 0 ; 1 [
• If p > pmin and r < p then assign f := f 0 and assign k := k 0

e. Decrease temperature, for example, by using a defined step size ss to get

t := t − ss

6. Output the key k (which most likely is the correct key kc )

In step (2) a start temperature is set. The start temperature, among other new
values needed for simulated annealing, has to be tweaked for each type of cipher
and often also for each individual ciphertext, which you want to cryptanalyze. The
termination criterion in step (5) now checks if the temperature t is still higher than
0. Inside the main loop of the algorithm, when a key k 0 is not accepted in step (5d),
a probability p based on the degradation value is computed and a random value r is
chosen. If r is smaller than the computed probability and the computed probability
is greater than a minimum probability pmin , the bad key is kept. In practice, we
set the minimum probability to pmin = 0.85%, which gave us good results. This
allows the simulated-annealing algorithm to jump away from local maxima. While
the algorithm is being executed, the temperature value t is reduced by a step size
ss. The value of s is predefined and can be determined, for example, by dividing
the start temperature by the number of wanted steps s, and then the algorithm
should perform. So ss := tstart s . Other temperature reduction strategies are also
possible. For example, instead of reducing the temperature by the same value ss all
the time, it could also be reduced by a percentage value of t with t := t − 0.01 · t.
The different strategies have to be evaluated to find the best one for the specific
case. Figure 3.20 shows a simulation of the key acceptance probability of simulated
annealing over time with a fixed temperature step size and Figure 3.21 shows a
simulation of simulated annealing with a percentage-based temperature step size.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 128 — #32

i i

128 Historical Cryptology

Figure 3.20 Key acceptance probability of simulated annealing with linear decreased temperature
over time.

Figure 3.21 Key acceptance probability of simulated annealing with percentage decreased tem-
perature over time.

Improving simulated annealing for homophonic substitution ciphers. During the

cryptanalysis of the homophonic substitution cipher, plaintext letters from the
plaintext alphabet are assigned to all homophones and the ciphertext is decrypted
for testing. During a single iteration of the simulated-annealing algorithm, we swap
the assignments of two plaintext letters. As with the simple substitution, we test all
possible two-letter swaps of all homophones.
In the following, we present some adaptions and strategies to be applied to
the simulated-annealing algorithm to improve its performance, especially for the
cryptanalysis of homophonic substitution ciphers.

1. Good start keys. With the homophonic substitution cipher, it is helpful

when the start keys for the cryptanalysis algorithm are already chosen in a

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 129 — #33

i i

3.5 Cryptanalysis 129

way that reflects the distribution of letter frequencies of the language. For
example, it is better to assign more homophones to more frequent plaintext
letters (e.g., the “E” with English) than to less frequent letters (e.g., the “X”
with English). Therefore, the Homophonic Substitution Analyzer of CT2
allows distributing the letters among the homophones based on probabilities
that are based on the original text frequencies of the language.
2. Homophone locking (manual). When analyzing homophonic substitution
ciphers, it may improve the cryptanalysis if already correctly assigned let-
ters can be fixed by the user. The Homophonic Substitution Analyzer of CT2
allows this in the semiautomatic mode. Here, the user may pause the analysis
and lock homophones, meaning the corresponding assignment of plaintext
letters to the homophones cannot be changed anymore by the cryptanaly-
sis algorithm during the further iterations. Also, the user may change and
correct the already made assignments.
3. Homophone locking (automatic with a dictionary). Besides manually lock-
ing homophones as described above, it is possible to automatically lock
homophones based on words found in a dictionary. Therefore, the Homo-
phonic Substitution Analyzer of CT2 provides a dictionary to the cryptanal-
ysis algorithm. Every time a new global best value (best key) is found, the
analyzer searches for words with a minimum and maximum length. If it finds
more words than a specified threshold value, it automatically locks all corre-
sponding homophones to their corresponding plaintext letters. This can also
be combined with the manual method for homophone locking described in
the second adaption.

3.5.3 Cost Functions

While optimizing a key k with hill climbing or simulated annealing, the algorithm
needs a way to decide if a modified key k 0 is better or worse than the original key
k. To rate a key, we use cost or fitness functions on the text previously decrypted
with the key k 0 .
The basic idea of a cost function cost (t ) is that it calculates a number that
reflects how natural a given text t is. The closer the text is to a real text, the higher
the cost value should be. The more random (not natural) a text is, the lower the
cost value should be. In the best case, the cost function returns the highest value
when we enter the original plaintext. Between the lowest and the highest value,
there should be a smooth curve that the cryptanalysis algorithm can follow during
the optimization of the key.
A common practice is to use a language model built from a large text corpus.
For historical ciphers, it also turned out that the cryptanalysis algorithm can benefit
from using a language model based on a historical text corpus [36]. A language
model returns the probability of a given text being a text of the language it was
built for.
The language models used in our cost functions are n-gram models. Such an
n-gram model provides a value (probability) for a given n-gram. Clearly, frequent
n-grams of the language, such as “ING” in English, return a higher n-gram value
than less frequent n-grams, such as “XYZ.” An overview of English and German

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 130 — #34

i i

130 Historical Cryptology

language frequencies can be found in CrypTool-Online4 and a set including different

other language n-grams can be found on Practical Cryptography.5
We created different language models by using large corpora of text. To create
a model, we first count the number of occurrences of all individual n-grams (e.g.,
from “AAA” to “ZZZ” for a 3-gram model) of the set. Also, we count the total
number of all n-grams of the corpus. Then, for each individual n-gram, we divide
its number by the number of all n-grams to obtain its probability. To compute
the cost value of a given text, we could multiply all the computed values of all n-
grams of that particular text to obtain a probability (the cost value) of the text.
Here, we have two problems: (1) the probability values of each n-gram are very
small numbers, which will result in many precision errors when multiplying these
numbers on a computer, and (2) multiplications can be costly, so the performance
of the computation may be poor. On modern PCs, problem 2 is negligible, but
problem 1 is a huge problem. A common way to get rid of both problems is the
usage of logarithmic values. Instead of multiplying all small values of the language
model, we add the logarithms of each value. This is possible due to the logarithm
law logb (x · y ) = logb (x ) + logb ( y ). In the end, to obtain the final value, we could
raise the used base b to the power of the sum c, meaning bc . But this is not needed
since the optimization algorithm can also run on the logarithmic values. In CT2, the
cost values are normalized to double precision floating point values in the interval
of [ 0 : 10000000 ]. By doing so, the CT2 language models are comparable to each
other.
A final note on the data format of language models: During cryptanalysis, the
letters are mapped into an integer number space based on the used alphabet. For
example, with the 26-letter Latin alphabet, the letter “A” is represented by 0, the
letter “B” by 1, ..., and the letter “Z” is represented by 25. A language model is an
n-dimensional array. To look up, for example, the 3-gram “ABC,” which is encoded
as integers 0, 1, 2, we can just look up the language model array using the integers
as indices. Doing the encoding of letters this way is easy and fast.
The CT2 language model files have a specific binary file format:
Header:
"CTLM" 4 ASCII characters (magic number)
LanguageCode 0-terminated UTF-8 string (language code)
GramLength 4 byte integer (length of n-grams)
Alphabet 0-terminated UTF-8 string (alphabet)
Data:
(Alphabet.Length ^ GramLength) * 4 bytes (model data)

A language model file starts with the four ASCII characters “CTLM” (CrypTool
Language Model) to identify the file type. The “LanguageCode” string identifies the
language model. The “GramLength” defines the size of the n-gram model. The “Alpha-
bet” defines the used alphabet. In the data section, the actual language model data is
stored as 4-byte float values containing the logarithmic values computed using a text
corpus. The sizes of the n-gram models increase quickly with n, so the models are com-
pressed using the gzip algorithm. For the English language with 26 characters the file

4. See https://www.cryptool.org/en/cto/frequency-analysis.
5. See http://practicalcryptography.com/cryptanalysis/letter-frequencies-various-languages/.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 131 — #35

i i

3.6 Contextualization and Interpretation: Historical and Philological Analysis 131

sizes on disc are (rounded): 1-gram: 1 kB, 2-gram: 3 kB, 3-gram: 50 kB, 4-gram:
800 kB, 5-gram: 8500 kB. Decompressed in RAM (rounded on 1 kB): 1-gram: 1 kB,
2-gram: 3 kB, 3-gram: 71 kB, 4-gram: 1828 kB, 5-gram: 47526 kB. One observation
here is that the more data (texts) are used to create these language models, the smaller
the amount of file size reduction achieved by compressing the models. The reason for
this is that the increase in entropy (aka amount of information) of the data used leads
to lower compressibility.

3.6 Contextualization and Interpretation: Historical and

Philological Analysis

Once we have managed to reveal (parts of) the plaintext, we aim to set the manuscript
in a historical context to recover what was written, by whom, to whom, and why. Such
a contextualization concerns historical and philological interpretation, which will be
the topic of this section. These approaches involve a broader type of analysis than
cryptanalysis described above, because they do not primarily restitute the message,
but rather investigate the linguistic and historical context in which the message was
written, encrypted, and sent. Linguistic analysis involves the contextualization of the
given ciphertext into the contemporary language usage, which presupposes that we have
sufficient knowledge about how languages were used in the given time period and geo-
graphical area. Historical analyses do not only involves the identification of the sender
and receiver (and perhaps the code-breaker) of the ciphertext, and the political context,
but also the transfer of knowledge in the field of cryptology, as well as the social history
of those who applied this technology of secrecy.

3.6.1 Analysis of Historical Languages (Linguistic Analysis)

Historical languages pose some specific challenges to the cryptanalyst. One important
aspect is that most languages show a great deal of variation before they were standard-
ized sometime in the eighteenth century. This means, for instance, that one and the
same word could be written in many different ways (i.e., orthography was not normal-
ized and even the same scribe could use various spellings for the same word in one text
[37]). Moreover, in languages such as English, German, or Italian, we find a lot of dif-
ferent dialectal forms in the same language. Languages also change over time, certain
words or word forms disappear, new ones emerge. The pilot study [36] on the decipher-
ment of German and English historical homophonic substitution ciphertexts showed
that using 4-gram models derived from century-specific texts leads to significantly bet-
ter performance than language models built on more modern, contemporary texts for
ciphertexts produced in the 17th century or earlier. A corpus of historical texts such as
a digital library of online texts like the Project Gutenberg or the collection of histori-
cal texts with diplomatic transcriptions for 16 European languages available within the
HistCorp collection [38] can serve well as a basis for the creation of language models.
Another general aspect to bear in mind in the use of algorithms for cryptanalysis is
that in the plaintext alphabet a historical cipher is based on might differ from modern
alphabets in specific languages: In many cases, only one letter is used for both u and v,
for instance, and usually, letters with diacritics (such as ä, ö, ü in German; or á, é, í, ó´,
etc. in Hungarian) do not form part of plaintext alphabets. At the same time, plaintext
alphabets also might merge commonly co-occurring alphabet letters and treat these as
one plaintext element, such as ss or sch in the Copiale [6] cipher with German as its
plaintext language.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 132 — #36

i i

132 Historical Cryptology

References

[1] Friedman, W. F. D., and L. Callimahos, “Military Cryptanalytics, Part I,” National Security
Agency, United States Government, Washington, DC, 1959 (available through Aegean Park
Press, Laguna Hills, CA).
[2] Schmeh, K., Revisited: A Terminology for Codes and Nomenclators, 2018, https://
scienceblogs.de/klausis-krypto-kolumne/2018/10/07/revisited-a-terminology-for-codes-and
-nomenclators/.
[3] Mikhalev, V., et al., “What is the Code for the Code? Historical Cryptology Terminol-
ogy,” in Proceedings of the 6th International Conference on Historical Cryptology, 2023,
pp. 130–138, https://ecp.ep.liu.se/index.php/histocrypt/article/view/702.
[4] Aldarrab, N., Kevin Knight, and Beáta Megyesi, The Borg Cipher, https://cl.lingfil.uu.se
/∼bea/borg.
[5] Cipher ID-3816,reproduced image from the Swedish National Archive Riksarkivet 1637,
https://de-crypt.org/decrypt-web/RecordsView/189.
[6] Knight, K., B. Megyesi, and C. Schaefer, “The Copiale Cipher,” invited talk at ACL Work-
shop on Building and Using Comparable Corpora (BUCC), Association for Computational
Linguistics, 2011.
[7] Key ID-345, Reproduced image from the National Archives in Kew, State Papers.
TNA_SP106/2_ElizabethI_f58(0069). 1596. url: https://de-crypt.org/decrypt-web/
RecordsView/345.
[8] Key ID-633, Reproduced image from the National Archives in Hungary, G15
Caps. C. Fasc. 44. 01, DECODE ID 633, 1703–1711, https://de-crypt.org/decrypt-
web/RecordsView/633.
[9] Megyesi, B., et al. “Keys with Nomenclatures in the Early Modern Europe,” Cryptologia,
2022, doi: 10.1080/01611194.2022.2113185.
[10] Lasry, G., et al., “Deciphering ADFGVX Messages from the Eastern Front of World War I,”
Cryptologia, Vol. 41, No. 2, 2017, pp. 101–136.
[11] Lasry, G., N. Biermann, and S. Tomokiyo, “Deciphering Mary Stuart’s Lost Letters from
1578–1584,” Cryptologia, 2023, doi: 10.1080/01611194.2022.2160677.
[12] Megyesi, B., et al., “Decryption of Historical Manuscripts: The DECRYPT Project,”
Cryptologia, Vol. 44, No. 6, 2020, pp. 545–559, https://doi.org/10.1080/01611194.2020
.1716410.
[13] Megyesi, B., N. Blomqvist, and E. Pettersson, “The DECODE Database: Collection of
Ciphers and Keys,” in Proceedings of the 2nd International Conference on Historical
Cryptology, 2019.
[14] Szigeti, F., and M. Héder, “The TRANSCRIPT Tool for Historical Ciphers by
the DECRYPT Project,” in Proceedings of the 5th International Conference on
Historical Cryptology, 2022, pp. 208–211, https://ecp.ep.liu.se/index.php/histocrypt/
article/view/409/367.
[15] Kopal, N., and B. Esslinger, “New Ciphers and Cryptanalysis Components in CrypTool
2,” in Proceedings of the 5th International Conference on Historical Cryptology, 2022,
pp. 127–136.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 136 — #40

i i

136 Historical Cryptology

[16] Zandbergen, R., The Voynich Manuscript, http://www.voynich.nu/.

[17] Pelling, N., The Cipher Mysteries Blog, www.ciphermysteries.com.
[18] Schmeh, K., Cipherbrain, https://scienceblogs.de/klausis-krypto-kolumne/ (updates on this
website stopped end of 2022).
[19] Láng, B., Real Life Cryptology: Ciphers and Secrets in Early Modern Hungary, Amster-
dam: Atlantis Press, Amsterdam University Press, 2018.
[20] Tomokiyo, S., Cryptiana: Articles on Historical Cryptography, http://cryptiana.web
.fc2.com/code/crypto.htm.
[21] Antal, E., and P. Zajac, “HCPortal Overview,” in Proceedings of the 3rd International
Conference on Historical Cryptology, 2020, pp. 18–20, doi: 10.3384/ecp2020171003,
https://hcportal.eu.
[22] Megyesi, B., “Transcription of Historical Ciphers and Keys,” in Proceedings of the 3rd
International Conference on Historical Cryptology, 2020.
[23] Megyesi, B., and C. Tudor, Transcription of Historical Ciphers and Keys: Guidelines,
version 2.0, https://cl.lingfil.uu.se/∼bea/publ/transcription-guidelines-v2.pdf.
[24] Unicode, The Unicode® Standard Version 12.0–Core Specification, 2019, https://unicode
.org/standard/standard.html.
[25] Lasry, G., “Armand de Bourbon’s Poly-Homophonic Cipher–1649,” in Proceedings
of the 6th International Conference on Historical Cryptology, 2023, pp. 105–112,
https://ecp.ep.liu.se/index.php/histocrypt/article/view/699.
[26] Souibgui, M. A., et al. “DocEnTr: An End-to-End document Image Enhancement Trans-
former,” in 26th International Conference on Pattern Recognition (ICPR), 2022.
[27] Axler, G., and L. Wolf, “Toward a Dataset-Agnostic Word Segmentation Method,” in 25th
IEEE International Conference on Image Processing (ICIP), IEEE, 2018, pp. 2635–2639.
[28] Frinken, V., and H. Bunke, “Continuous Handwritten Script Recognition,” in Handbook
of Document Image Processing and Recognition (D. Doermann and K. Tombre, eds.),
Springer, 2014, pp. 391–425.
[29] Kang, L., et al., “Pay Attention to What You Read: Non-Recurrent Handwritten Text-Line
Recognition,” Pattern Recognition, Vol. 129, 2022, p. 108766.
[30] Bogacz, B., N. Howe, and H. Mara, “Segmentation Free Spotting of Cuneiform Using
Part Structured Models,” in 15th International Conference on Frontiers in Handwriting
Recognition (ICFHR), IEEE, 2016, pp. 301–306.
[31] Baró, A., et al., “Towards a Generic Unsupervised Method for Transcription of Encoded
Manuscripts,” in Proceedings of the 3rd International Conference on Digital Access to
Textual Cultural Heritage, 2019, pp. 73–78.
[32] Souibgui, M. A., et al., “Few Shots Are All You Need: A Progressive Learning Approach
for Low Resource Handwritten Text Recognition,” Pattern Recognition Letters, Vol. 160,
2022, pp. 43–49, https://doi.org/10.1016/j.patrec.2022.06.003.
[33] Souibgui, M. A., et al., “A User Perspective on HTR Methods for the Automatic Transcrip-
tion of Rare Scripts: The Case of Codex Runicus,” Journal on Computing and Cultural
Heritage, 2022.
[34] Kopal, N., “Solving Classical Ciphers with CrypTool 2,” in Proceedings of the 1st
International Conference on Historical Cryptology, 2018, pp. 29–38.
[35] Kopal, N., “Cryptanalysis of Homophonic Substitution Ciphers Using Simulated Anneal-
ing with Fixed Temperature,” in Proceedings of the 2nd International Conference on
Historical Cryptology, 2019, pp. 107–116.
[36] Megyesi, B., et al., “Historical Language Models in Cryptanalysis: Case Studies on English
and German,” in Proceedings of the 6th International Conference on Historical Cryptol-
ogy, 2023, pp. 120–129. url: https://ecp.ep.liu.se/index.php/histocrypt/article/view/701.
[37] Waldispühl, M., “Variation and Change,” in The Cambridge Handbook of Historical
Orthography (M. Condorelli and H. Rutkowska, eds.), Cambridge University Press, 2023,
pp. 245–264.

i i

Prime Numbers

This chapter introduces prime numbers (primes) and corresponding results from
number theory by asking and answering questions. It provides lots of examples
and is not as rigorous as mathematical textbooks usually are. At the end, you will
have a good understanding what primes are, what their distribution is like, and
why they are very useful in cryptography.

4.1 What Are Prime Numbers?

Prime numbers are integers greater than or equal to 2 that can only be divided by 1
and themselves. All other natural numbers that are not primes and greater or equal
to 4 are composite, they can be created by multiplying prime numbers.
The natural numbers N = {1, 2, 3, 4, . . .} thus comprise

• The number 1 (the unit value);

• The primes;
• The composite numbers.

Prime numbers are particularly important for three reasons:

• In number theory, they are considered to be the basic components of natural

numbers, upon which numerous brilliant mathematical ideas are based.
• They are of extreme practical importance in modern cryptography (public-
key cryptography). The most common public-key procedure, invented at
the end of the 1970s, is the RSA encryption. Using large prime numbers is
required for particular parameters to guarantee that the RSA procedure is
secure, and also further modern procedures (e.g., elliptic curves).
• The search for the largest known prime numbers does not have any practical
usage known to date, but it is an excellent benchmark (e.g., for the possibility
of determining the performance of computers) [1].

Many people have been fascinated by prime numbers over the past two millen-
nia. Ambition to make new discoveries about prime numbers has often resulted in
brilliant ideas and conclusions. The following section provides an easily compre-
hensible introduction to the basics of prime numbers. We will also explain what
is known about the distribution of prime numbers (i.e., density, number of prime
numbers; intervals in particular intervals), and how prime number tests work.

139

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 140 — #2

i i

140 Prime Numbers

4.2 Prime Numbers in Mathematics

Each integer number has a factor. Only the number 1 has one factor, itself, whereas
the number 12 has the six factors 1, 2, 3, 4, 6, 12. Many numbers can only be divided
by themselves and by 1. With respect to multiplication, these are the atoms in the
area of numbers. Such numbers are called prime numbers.
In mathematics, a slightly different (but equivalent) definition is used.

Definition 4.1 An integer p ∈ N is called prime if p > 1 and p only possesses the
trivial factors ±1 and ± p.

By definition, the number 1 is not a prime number. In the following sections, p

will always denote a prime number (notation).
The sequence of prime numbers starts with:

2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71,
73, 79, 83, 89, 97, . . .

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 142 — #4

i i

142 Prime Numbers

Figure 4.3 Primes within the first 40,000 integers in a (200 · 200) rectangle as Ulam spiral.

Theorem 4.2 Each element n of the natural numbers greater than 1 can be written
as the product n = p1 · p2 . . . pm of prime numbers ( p1 , p2 , . . . , pm are called the
prime factors of n). If two such factorizations

n = p1 · p2 · · · pm = p10 · p20 · · · pm
0
0

are given, then they can be reordered such that m = m 0 and for all i: pi = pi0 .
In other words, each natural number other than 1 can be written as a product
of prime numbers in precisely one way, if we ignore the order of the factors. The
factors are therefore unique (or you can say: the expression as a product of factors
is unique). For example,

60 = 2 · 2 · 3 · 5 = 22 · 31 · 51 .

And this—other than changing the order of the factors—is the only way in which
the number 60 can be factorized. If you allow numbers other than primes as factors,
there are several ways of factorizing integers and the uniqueness is lost:

60 = 1 · 60 = 2 · 30 = 4 · 15 = 5 · 12 = 6 · 10 = 2 · 3 · 10 = 2 · 5 · 6 = 3 · 4 · 5 = · · · .

This paragraph is for those familiar with mathematical logic: The first funda-
mental theorem may appear to be obvious, but we can construct numerous other
sets of numbers (i.e., other than positive integers greater than 1), for which num-
bers in the set cannot be expressed uniquely as a product of the prime numbers of
the set: In the set M = {1, 5, 10, 15, 20, . . .} there is no equivalent to the fundamen-
tal theorem under multiplication. The first five prime numbers of this sequence are

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 143 — #5

i i

4.3 How Many Prime Numbers Are There? 143

5, 10, 15, 20, 30 (note: 10 is prime, because 5 is not a factor of 10 in this set—the
result is not an element of the given basic set M). As the following applies in M:

100 = 5 · 20 = 10 · 10

and 5, 10, 20 are all prime numbers in this set, the expression as a product of prime
factors is not unique here.

4.3 How Many Prime Numbers Are There?

For the natural numbers, the primes can be compared to elements in chemistry
or the elementary particles in physics (see [2, p. 22]) as their building blocks.
Although there are only 92 natural chemical elements, the number of prime numbers
is unlimited.
Even the Greeks knew this in the third century B.C.; the theorem of the infinite-
ness of the primes had already been distinguished and proven in Euclid’s Elements
(Book IX, theorem 20). Euclid was a Greek mathematician in fourth and third
century B.C. who worked at the Egyptian academy of Alexandria and wrote The
Elements, the most well known systematic textbook of the Greek mathematics.
The following theorem of Euclid does not denote Euclid as the inventor of
the theorem; rather the true inventor is not known. The phraseology in the Greek
original is remarkable due to the fact that the word infinite is not used. The text
reads as follows:

O ί π%ω̃τ oι ὰ%ιϑµoὶ πλε ίoυς εὶσ ὶ π αντ òς τ oυ̃ π %oτ εϑ έντ oς π λήϑ oυς
π%ώτ ων ὰ%ιϑµω̃ν

The English translation is: The prime numbers are more than any previously existing
amount of prime numbers.
Or in a less literal translation:
Theorem 4.3 (Euclid). The sequence of prime numbers does not discontinue.
Therefore, the quantity of prime numbers is infinite.
His proof that there is an infinite number of primes is still considered to be
a brilliant mathematical consideration and conclusion today (proof by contradic-
tion). He assumed that there is only a finite number of primes and therefore there
exists a largest prime number. Based on this assumption, he drew logical conclu-
sions until he obtained an obvious contradiction. This meant that something must
be wrong. As there were no mistakes in the chain of conclusions, it could only be the
assumption that was wrong. Therefore, there must be an infinite number of primes!

Table 4.1 The 20+ Largest Known Primes and Their Particular Number Types∗
Definition Decimal Digits Year Description
1 282589933 − 1 24862048 2018 Mersenne, 51st known
2 277232917 − 1 23249425 2018 Mersenne, 50th known
3 274207281 − 1 22338618 2016 Mersenne, 49th known
4 257885161 − 1 17425170 2013 Mersenne, M-48
5 243112609 − 1 12978189 2008 Mersenne, M-47
6 242643801 − 1 12837064 2009 Mersenne, M-46
7 237156667 − 1 11185272 2008 Mersenne, M-45
8 232582657 − 1 9808358 2006 Mersenne, M-44
9 10223 · 231172165 + 1 9383761 2016 Generalized Mersenne
10 230402457 − 1 9152052 2005 Mersenne, M-43
11 225964951 − 1 7816230 2005 Mersenne, M-42
12 224036583 − 1 7235733 2004 Mersenne, M-41
13 202705 · 221320516 + 1 6418121 2021 Generalized Mersenne
14 220996011 − 1 6320430 2003 Mersenne, M-40
15 10590941048576 + 1 6317602 2018 Generalized Fermat1
16 9194441048576 + 1 6253210 2017 Generalized Fermat
17 168451 · 219375200 + 1 5832522 2017 Generalized Mersenne
18 3 · 218924988 + 1 5696990 2022 Generalized Mersenne
19 69 · 218831865 − 1 5668959 2021 Generalized Mersenne
20 7 · 218233956 + 1 5488969 2020 Generalized Mersenne2
…
42 213466917 − 1 4053946 2001 Mersenne, M-39
…
46 19249 · 213018586 + 1 3918990 2007 Generalized Mersenne
…
152 26972593 − 1 2098960 1999 Mersenne, M-38
…
2115 1372930131072 + 1 804474 2003 Generalized Fermat3
…
2152 342673 · 22639439 − 1 794556 2007 Generalized Mersenne
∗ As of April 2022.
1 Generalized Fermat number: 10590941048576 + 1 = 1059094(220 ) + 1.
2 Generalized Mersenne number: As 18233956 is no power of 2 it is no Fermat number.
3 Generalized Fermat number: 1372930131072 + 1 = 1372930(217 ) + 1.

Written out in binary form, a Mersenne number consists only of 1s. M (2) = 3,
or in binary digits 11. M (3) = 7, or 111. M (4) = 15, or 1111. M (5) = 31, or
11111.
Almost all of the biggest known prime numbers are special candidates of the
form 2 p − 1, where the exponent p is a prime. Not all Mersenne numbers of this
form are prime:

M (2) : 22 − 1 = 3 ⇒ prime
M (3) : 23 − 1 = 7 ⇒ prime
M (5) : 25 − 1 = 31 ⇒ prime
M (7) : 27 − 1 = 127 ⇒ prime
M (11) : 211 − 1 = 2047 = 23 · 89 ⇒ NOT prime !

Even Mersenne knew already that not all Mersenne numbers are prime (see
exponent p = 11). A prime Mersenne number is called Mersenne prime number.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 146 — #8

i i

146 Prime Numbers

Figure 4.4 Number of digits of largest known prime by year (as of April 2022) (own plot, thanks
to data from Chris Caldwell [3]).

However, it is with thanks to Mersenne for the interesting conclusion that a

number of the form 2n − 1 cannot be a prime number if n is a composite number:
Theorem 4.4 (Mersenne) If 2n − 1 is a prime number, then n is also a prime number
(or to put it another way: 2n − 1 is prime, only if n is prime).
Proof
The theorem of Mersenne can be proved by contradiction. We therefore assume
that there exists a composite natural number n (with real factorization) n = n 1 · n 2 ,
with the property that 2n − 1 is a prime number.
Abbreviated, the theorem is:

[ M (n ) is prime ⇒ n is prime ]

Consequently, our assumption for the proof by contradiction then is:

[ n is composite and M (n ) is a prime number ]

From

(x r − 1)((x r )s−1 + (x r )s−2 + · · · + x r + 1) = ((x r )s + (x r )s−1 + (x r )s−2 + · · · + x r )

− ((x r )s−1 + (x r )s−2 + · · · + x r + 1)
= (x r )s − 1 = x r s − 1,

we conclude

2n 1 n 2 − 1 = (2n 1 − 1)((2n 1 )n 2 −1 + (2n 1 )n 2 −2 + · · · + 2n 1 + 1).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 147 — #9

i i

4.4 The Search for Extremely Large Primes 147

Because 2n − 1 is a prime number, one of the above two factors on the right-
hand side must be equal to 1. This is the case if and only if n 1 = 1 or n 2 = 1.
But this contradicts our assumption. Therefore, the assumption is false. This means
that there is no composite number n, such that 2n − 1 is a prime.
Notes:
The following two statements are equivalent because from A ⇒ B follows ¬B ⇒
¬A:

[ M (n ) is a prime number ⇒ n is prime] ≡ [n is composite

⇒ M (n ) is not a prime number]

Unfortunately, Theorem 4.4 only applies in one direction (the inverse statement
does not apply, no equivalence): That means that there are prime exponents for
which the Mersenne number is not prime (see the above example 211 − 1, where 11
is prime, but 211 − 1 is not).
Mersenne claimed that 267 − 1 is a prime number. There is also a mathematical
history behind this claim: It took over 200 years before Edouard Lucas (1842–1891)
proved that this number is composite. However, he argued indirectly and did not
name any of the factors. In 1903, Frank Nelson Cole showed which factors make
up this composite number:

267 − 1 = 147573952589676412927 = 193707721 · 761838257287

Cole admitted to having worked for 20 years on the factorization (dissection as a

product of prime factors)2 of this 21-digit decimal number.
Due to the fact that the exponents of the Mersenne numbers do not use all
natural numbers, but only the primes, the experimental space is considerably lim-
ited. The exponents of the currently known 51 Mersenne prime numbers are listed
in Table 4.2.
The 19th number with the exponent 4253 was the first with at least 1000 digits
in decimal system (the mathematician Samual Yates coined the expression titanic
prime for this; it was discovered by Hurwitz in 1961); the 27th number with the
exponent 44497 was the first with at least 10000 digits in the decimal system—
Yates coined the expression gigantic prime for this. Today, these expressions are
long outdated.
For the first 48 Mersenne prime numbers we know that this list is complete.
The exponents up to the 51st Mersenne prime number have not yet been checked
completely [4]. See Section 4.5 for hints on checking the primality of a number.
2. Factorization algorithms can be found in CTO, CT2, and CT1.
- In CTO in the plugin “Msieve Factorizer”: https://www.cryptool.org/en/cto/msieve.
- In CT2 Startcenter F Templates F Mathematics F Factorization with Quadratic Sieve (QS).
- Using CT1 Indiv. Procedures F RSA Cryptosystem F Factorization of a Number you can fac-
torize numbers. With the quadratic sieve (QS), CT1 factorizes numbers up to 250 bit in a reasonable time
(on a single PC).
- The current factorization records are listed in Section 5.12.4. Single factoring algorithms like Bill
Hart’s quadratic sieve and Paul Zimmermann’s GMP-ECM are also available in SageMath: see https://
doc.sagemath.org / html /en/thematic_tutorials/explicit_methods_in_number_theory/integer_
factorization.html.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 148 — #10

i i

148 Prime Numbers

Table 4.2 Exponents of Currently Known Mersenne Prime Numbers

1: 2 14: 607 27: 44497 40: 20996011
2: 3 15: 1279 28: 86243 41: 24036583
3: 5 16: 2203 29: 110503 42: 25964951
4: 7 17: 2281 30: 132049 43: 30402457
5: 13 18: 3217 31: 216091 44: 32582657
6: 17 19: 4253 32: 756839 45: 37156667
7: 19 20: 4423 33: 859433 46: 42643801
8: 31 21: 9689 34: 1257787 47: 43112609
9: 61 22: 9941 35: 1398269 48: 57885161
10: 89 23: 11213 36: 2976221 49: 74207281
11: 107 24: 19937 37: 3021377 50: 77232917
12: 127 25: 21701 38: 6972593 51: 82589933
13: 521 26: 23207 39: 13466917

As of May 1, 2023 all prime exponents smaller than 63,589,987 have been
tested and double-checked. So we can be certain that M-48 is really the 48th
Mersenne prime number, and that there are no smaller undiscovered Mersenne
primes (it is common not to use the notation M-nn until it is proven that the nn-th
“known” Mersenne prime is really the nn-th Mersenne prime).
Here are some examples in more detail:

M-37 – January 1998

The 37th Mersenne prime, called M-37,

23021377 − 1

has 909,526 digits in the decimal system, which are equivalent to 33 pages of a
newspaper.

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 150 — #12

i i

150 Prime Numbers

4.4.3 Challenge of the Electronic Frontier Foundation

This search is also spurred on by a competition started by the nonprofit organi-
zation EFF using the means of an unknown donor. The participants are rewarded
with a total of $500,000 USD if they find the longest prime number. In promot-
ing this project, the unknown donor is not looking for the quickest computer, but
rather wants to draw people’s attention to the opportunities offered by cooperative
networking: https://www.eff.org/awards/coop.
The discoverer of M-38 received $50,000 USD from the EFF for discovering
the first prime with more than 1 million decimal digits.
The next prize of $100,000 USD for a proven prime with more than 10 million
decimal digits was awarded to Edson Smith, who found the number 243112609 − 1
within the GIMPS project.
According to the EFF rules for their prizes, in the next stage $150,000 USD is
being offered for a proven prime with more than 100 million decimal digits.
Edouard Lucas (1842–1891) held the record for the longest prime number for
over 70 years by proving that 2127 − 1 is prime. No new record is likely to last that
long.

4.5 Prime Number Tests3

In order to implement secure encryption procedures we need extremely large prime

numbers (but still much smaller than the prime records). These numbers in the
region of 22048 have more than 600 digits in the decimal system.
In order to do this, random numbers are considered and then examined whether
they are prime or not. If even the smallest prime factor is huge, it would take far too
long to factor the candidates. Factorizing numbers using systematic division (brute-
force) or the Eratosthenes’ sieve can be used with today’s computers for numbers
with up to about 20 digits in the decimal system. The largest number that has so
far been factorized into its two approximately equal prime factors in a multimonth
undertaking with sophisticated methods has 250 decimal digits (see Section 5.12.4).
Instead of factoring, there are very fast, heuristic methods that can be used
to test the primality of a number. Such fast algorithms that can very reliably state
whether a number is prime or not are the Fermat primality test, Lucas test, Solovay-
Strassen test, Miller-Rabin test, and Baillie-PSW test. Because these algorithms are
imperfect (and probabilistic), tests are also passed by a few numbers that are not
prime, called pseudoprimes.

3. - Using CT1 Indiv. Procedures F RSA Cryptosystem F Prime Number Test the following tests can be
performed: Miller-Rabin, Fermat, Solovay-Strassen und AKS. The first three are probabilistic tests.
- With the educational tool for number theory NT you can apply the tests of Fermat and of Miller-Rabin: See
in there the NT learning units 3.2 and 3.3, pages 3-11/11. NT can be called via CT1 Indiv. Procedures
F Number Theory Interactive F Learning Tool for Number Theory.
- Using CT2 Templates F Mathematics F Primes Test a brute-force test with small factors and the
Miller-Rabin test is performed.
- In CT2 Crypto Tutorials F World of Primes F Primality test the following methods are visu-
alized and their single steps can be reproduced: Sieve of Eratosthenes, Miller-Rabin test and Sieve of
Atkin.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 151 — #13

i i

4.5 Prime Number Tests 151

We will not go into the various test methods here, but show the most important
property of prime numbers and which numbers are false witnesses in this context
(i.e., they fulfill the primality property even though they are composite). Primality
tests only answer the primality property of a number, but cannot determine the
prime factors of composite numbers. Small implementations for this can also be
found in the supplied Python and SageMath programs.
Obviously, determining primality cannot be harder than factoring: If we know
how to factor, we have a test for primality. But it is surprising and fundamental for
some asymmetric algorithms that primality testing is so much easier than factoring.

4.5.1 Special Properties of Primes for Tests

This section is a good example of applying mathematical logic.
Fermat put forward in 1640 an important theorem: Many rapid prime number
tests are based on the (little) Fermat theorem (see also Section 5.8.3).
Theorem 4.5 (“little” Fermat). Let p be a prime number and a be any integer, then
for all a
a p ≡ a mod p

This could also be formulated as follows:

Let p be a prime number and a be any integer that is not a multiple of p (so a 6≡
0 mod p or gcd (a, p) 6= p), then a p−1 ≡ 1 mod p.
Since numbers that satisfy this congruence are rare, much rarer than
prime numbers, and because modular exponentiation (i.e., the computation of
a n−1 mod n), is efficient even for relatively large numbers, Fermat’s little theorem
is in principle suitable as a primality test. This congruence (satisfying an equation
modulo a number) can be checked much faster than doing factorization.
Unfortunately, the converse to Fermat’s theorem does not hold—otherwise we
would have a simple proof of the prime number property (or to put it in other
words, we would have a simple prime number criterion).
If you are not used to calculating with remainders (modulo), please simply
accept the theorem or first read Chapter 5 “Introduction to Elementary Number
Theory with Examples.” It is important here to realize this theorem implies that if
this equation is not met for any integer a, then p is not a prime. The tests (e.g., for
the first formulation) can easily be performed using the test basis a = 2.
This gives us a criterion for nonprime numbers; that is, a negative test (criterion
for exclusion), but no proof that the number p is prime:

[a p 6≡ a mod p ⇒ p not prime]

or
[(a not divisible by p ∧ a p−1 6≡ 1 mod p) ⇒ p not prime]

If an a exists where the congruence from Theorem 4.5 is not met, we say a
is a “Fermat witness” to the composite nature of p. So witnesses can very quickly
inform us that a number p is not prime.
One can also say that Fermat’s prime property is a necessary but not a sufficient
condition for the number p to be prime.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 152 — #14

i i

152 Prime Numbers

4.5.2 Pseudoprime Numbers

A pseudoprime is a probable prime (PRP). So a PRP is an integer that shares a
property common to all prime numbers, but is actually not prime. Pseudoprimes
are classified according to which property of primes they satisfy. They are called
pseudoprime in reference to this property.
The following three types of numbers successfully pass such negative tests even
though they are not prime numbers:

1. Fermat pseudoprime numbers. Numbers n that have the property

2n ≡ 2 mod n

but are not prime are called pseudoprime numbers for the basis 2 (i.e., the exponent
and the modulus n are not prime). The first pseudoprime number for the basis 2 is

341 = 11 · 31

In SageMath4 you can easily verify that 341 satisfies the negative test 2341 ≡ 2
mod 341, which is derived from Theorem 4.5:

SageMath Example 4.1: Find on the Sage Command Line Nonprimes That
Pass the Fermat Test
# Calculate pseudoprimes for base a = 2 (which pass the Fermat �
� primality test)
# Note: 2.powermod(n,n) is circa 3 times faster than power_mod(2,n,n) �
� cause usage of gmp
sage: a=2; count=0
....: for n in range(1,4000):
....: if n not in Primes () and a.powermod(n,n) == 2:
....: print ("%
....: print ("How many numbers found with this property :", count)
....:
2^n mod n == n for n = 341
2^n mod n == n for n = 561
2^n mod n == n for n = 645
2^n mod n == n for n = 1105
2^n mod n == n for n = 1387
2^n mod n == n for n = 1729
2^n mod n == n for n = 1905
2^n mod n == n for n = 2047
2^n mod n == n for n = 2465
2^n mod n == n for n = 2701
2^n mod n == n for n = 2821
2^n mod n == n for n = 3277
How many numbers found with this property: 12

There are infinitely many Fermat pseudoprimes for each basis.

4. SageMath is a free computer-algebra system (CAS). See the introduction at https://www.cryptool

.org/en/documentation/ctbook/.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 153 — #15

i i

4.5 Prime Number Tests 153

2. Carmichael Numbers. There are pseudoprime numbers n that pass the Fermat
test
a n−1 ≡ 1 mod n

with all bases a that are relatively prime to n [gcd (a, n ) = 1], even though the
numbers n tested are not prime. These numbers are called Carmichael numbers.
So the set of powers to be tested is restricted to those where a and n are relatively
prime. For an n now it is not enough to test just an arbitrarily chosen a, but all a < n
that are coprime to n. The first of these is

561 = 3 · 11 · 17

Example: The number to be tested is n = 561.

Because 561 = 3 · 11 · 17, the test condition is a 560 mod 561 = 1. This congru-
ence
- Is satisfied for a = 2, 4, 5, 7, 8, 10, · · · ,
- But not for a = 3, 6, 9, 11, 12, 15, 17, 18, 21, 22, · · · .
The test condition does “not” have to be fulfilled either for multiples of the
prime factors 3, 11, or 17: For instance, the test applied for
- a = 3 results in: 3560 mod 561 = 375,
- a = 5 results in: 5560 mod 561 = 1.

SageMath Example 4.2 can find all Carmichael numbers up to a specified

limit. There are very fast methods to construct single large Carmichael numbers,
but the methods we know that generate the list completely, are relatively slow. Sage-
Math Example 4.3 (chap04_sample080.sage) is significantly faster than SageMath
Example 4.2 (chap04_sample070.sage), but still very slow with larger numbers.

SageMath Example 4.2: Find Carmichael Numbers up to n = 10000 (Plus

Some More Information)
print ("\n# CHAP04 -- Sage -Script -SAMPLE 070: =========")

# Calculate the Carmichael numbers n (composite numbers with a prime property) up to upper=1000 �
�0
# Like primes they fulfill the Fermat congruence a^n = a mod n,
# plus: for them the congruence is valid for all a with gcd(a,n)=1
# Remarks:
# - With "for a in range(2, 4000):" a's type is <class 'int '>. Using then
# "a.powermod(n,n)" causes AttributeError: 'int ' object has no attribute 'powermod '
# Therefore , the range over Sage integers (class Integer) is used.
# - It 's good enough to test "a" up to the given "n" (must not always go to "upper ")
# - This is a very straightforward implementation. It could be made quicker :-)

verbose = False # if True the coprime bases up to n are listed too

count1=0; count2=0;
upper=10000 # 20,4000,341,561 (up to 4000 are 3449 nonprimes; up to 10000: 8770)
Carm_list =[]
for n in range(3, upper+1, 2): # a Carmichael number is an odd composite number.
if n not in Primes ():
count2 += 1 # counts how many non -primes n will be considered
count_gcd = 0; count_pmod = 0
a_list =[]
for a in [2..n]: # for a in [2.. upper ]: # for a in [2..7]:
if gcd(a,n) == 1:
count_gcd += 1
if a.powermod(n,n) == a:
count_pmod += 1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 154 — #16

i i

154 Prime Numbers

SageMath Example 4.2 (continued)

a_list.append(a)
if count_gcd >0 and len(a_list)== count_gcd and count_gcd == count_pmod:
if verbose:
print ("a^n mod n == n for n = %d and all %d a (%s) with gcd(a,n)==1" % (n, count_gcd , �
� a_list));
else:
print ("a^n mod n == n for n = %d and all %d bases a with gcd(a,n)==1" % (n, count_gcd)) �
�;
count1 += 1
Carm_list.append(n)
print ("How many numbers found with this property :", count1, "[how many nonprimes n considered: �
� %d]" % count2)
print (" List of Carmichael numbers found up to %d: %s" % (upper , Carm_list))

SageMath Example 4.3: Find All Carmichael Numbers up to n = 100,000

print ("\n# CHAP04 -- Sage -Script -SAMPLE 080: =========")

# Calculate the Carmichael numbers n (composite numbers with a prime property)

# Remarks:
# - This script (sample 08) is much faster , but less explicit than sample 07.
# - Slightly modified [in Primes ()" instead of "is_prime(n)"] the pure Python 3 script from
# https :// stackoverflow.com/questions/58944035/printing -carmichael -numbers -in -a-given -limit

from math import gcd

upper = 100_000 # 30000

def is_carmichael(n): # now expects only odd numbers >= 3 !

if n in Primes ():
return False
for a in range(3, n, 2): # Why not start with a=2 and why handle only odd bases?
if gcd(a, n) == 1:
if pow(a, n-1, n) != 1:
return False
return True

def print_carmichael(maximum):
for n in range(3, maximum+1, 2): # consider only odd numbers >= 3
if is_carmichael(n):
print(n)

print_carmichael(upper)

For some composite numbers it’s hard to find a witness with the Fermat test.
With Carmichael numbers the Fermat test actually always fails. They are “liars”
for all coprime bases.
The first 16 Carmichael numbers are 561, 1105, 1729, 2465, 2821, 6601,
8911, 10585, 15841, 29341, 41041, 46657, 52633, 62745, 63973, and 75361.
They start to become increasingly rare after that.
There are 2,0138,200 Carmichael numbers between 1 and 1021 . This is
approximately one in 50 trillion (50 · 1012 ) numbers [5].
The largest known Carmichael number has almost 300 · 109 digits [6]. This
number is much greater than the largest known prime (see Table 4.1).
In 1992, Carl Pomerance proved that there are infinitely many Carmichael
numbers. Carmichael numbers have at least three prime factors, none of which are
duplicates. So they are square free.
The Carmichael numbers are sequence A002997 at OEIS. This list contains
all Carmichael numbers up to 1, 713, 045, 574, 801 ≈ 1.7 · 1012 (these are the first
10,000 ones).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 155 — #17

i i

4.6 Special Types of Numbers and the Search for a Formula for Primes 155

Feitsma/Galway prepared a very good website (http://www.cecm.sfu.ca/Pseudo

primes/index-2-to-64.html) that contains gzipped lists of the Carmichael numbers
and their factors up to 264 = 1.84 · 1019 .

3. Strong pseudoprime numbers. A stronger test is provided by Miller/Rabin [7]:

It is only passed by prime numbers and strong pseudoprime numbers.
Let n ∈ N be of the form n = 1 + 2s · m for an odd natural number m and
s ∈ N. Then n is called a strong pseudoprime with base b if n itself is not a prime
number and

either bm ≡ 1 mod n
i
or ∃ i ∈ {0, 1, . . . , s − 1} : b2 m ≡ −1 mod n

holds. A number n is called a strong pseudoprime if a base b ≥ 2 exists, so that n is

a strong pseudoprime with base b.
Again, these strong pseudoprime numbers are not primes, but they are rare as
compared to simple pseudoprime numbers or to Carmichael numbers. The smallest
strong pseudoprime number base 2 is

15841 = 7 · 31 · 73

If you test all four bases, 2, 3, 5, and 7, you will find only one strong pseudoprime
number up to 25 · 109 (i.e., a number that passes the test and yet is not a prime
number).
More extensive mathematics behind the Rabin test delivers the probability that
the number examined is nonprime (such probabilities are currently around 10−60 ).
Detailed descriptions of tests for finding out whether a number is prime can be
found at [8, 9].

4.6 Special Types of Numbers and the Search for a Formula

for Primes
There are currently no useful, open, nonrecursive polynomial-like formulae known
that only deliver prime numbers (recursive means that in order to calculate the
function the same function is used with a smaller variable). Mathematicians would
be happy if they could find a formula that leaves gaps (i.e., does not deliver all
prime numbers) but does not deliver any composite (nonprime) numbers.
Ideally, we would like for the number n to immediately be able to obtain the
nth prime number; that is, for f (8) = 19 or for f (52) = 239.
Ideas for that are seriously discussed in [10].
Table 4.12 in Section 4.11 contains the precise values for the nth prime numbers
for selected n.
For prime number formulae usually very special types of numbers are used. The
following enumeration contains the most common ideas for prime number formulae
and what our current knowledge is about very big elements of the number series:
Is their primality proven? If they are compound numbers, could their prime factors
be determined?

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 156 — #18

i i

156 Prime Numbers

4.6.1 Mersenne Numbers f (n) = 2n − 1 for n Prime

As seen in Section 4.4.2, this formula yields a relatively large number of large prime
numbers, but—like for n = 11 with f (11) = 211 − 1 = 2047—the result is not
always prime even for prime exponents. Currently (as of May 2023) we know all
the Mersenne prime numbers up to 17,000,000 decimal digits (M-48).

4.6.2 Generalized Mersenne Numbers f (k, n) = k · 2n ± 1 for n Prime and k

Small Prime/Proth Numbers
This first generalization of the Mersenne numbers creates the Proth numbers. There
are (for small k) extremely quick prime number tests (see [11]). These can be
performed in practice using software such as Proth 20 [12].

4.6.3 Generalized Mersenne Numbers f (b, n) = bn ± 1 / The Cunningham

Project
This is another possible generalization of the Mersenne numbers. The Cunningham
Project determines the factors of all composite numbers that are formed as follows:

f (b, n ) = bn ± 1 for b = 2, 3, 5, 6, 7, 10, 11, 12

(b is not equal to powers of bases already used, such as 4, 8, 9).

Details of this can be found at [13].

n
4.6.4 Fermat Numbers Fn = f (n) = 22 + 1
In the seventeenth century, Pierre de Fermat wrote to Mersenne that he presumed
that all numbers of the form
n
f (n ) = 22 + 1
are prime for all integers n ≥ 0.
The first 5 numbers F0 = 3, F1 = 5, F2 = 17, F3 = 257, F4 = 65537 are
all prime (see Table 4.4). As mentioned, Fermat wrote to Mersenne regarding his
assumption that all numbers of this type are primes. This assumption was already
disproved by Euler in 1732. The prime number 641 divides F5 = f (5).
And as early as in the 19th century, it was discovered that the 39-digit number
7
f (7) = 22 + 1 (27 = 128)

is not prime. However, it was not until 1970 that Morrison/Billhart managed to
factorize it.

f (7) = 340282366920938463463374607431768211457
= 59649589127497217 · 5704689200685129054721

The project Distributed Search for Fermat Number Dividers [14] finds both new
compound Fermat numbers and new monster primes.
Example: On February 22, 2003, John Cosgrave discovered:

• The largest composite Fermat number up to that time;

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 157 — #19

i i

4.6 Special Types of Numbers and the Search for a Formula for Primes 157

Table 4.4 List of the First Eight Fermat Numbers and Their Factorization
f(n) Value Prime?
0
f (0) = 22 + 1 = 21 + 1 =3 Prime
1
f (1) = 22 + 1 = 22 + 1 =5 Prime
2
f (2) = 22 + 1 = 24 + 1 = 17 Prime
3
f (3) = 22 + 1 = 28 + 1 = 257 Prime
4
f (4) = 22 + 1 = 216 + 1 = 65537 Prime
5
f (5) = 22 + 1 = 232 + 1 = 4294967297 = 641 · 6700417 Not prime !
6
f (6) = 22 + 1 = 264 + 1 = 18446744073709551617
= 274177 · 67280421310721 Not prime !
7
f (7) = 22 + 1 = 2128 + 1 = (see Section 4.6.4) Not prime !

• The largest prime nonsimple Mersenne number so far with 645,817 decimal
digits.

This Fermat number

2145351 )
f (2145351) = 2(2 +1

is divisible by the prime

p = 3 · 22145353 + 1

At that time this prime p was the largest known prime generalized Mersenne
number and the fifth largest known prime number of all.
f (18233954) is the biggest Fermat number of which a factor is known (as of
July 2023).
It is assumed that f (4) = 65, 537 is the last (and thus also the largest) Fermat
prime.

n
4.6.5 Generalized Fermat Numbers f (b, n) = b2 + 1
With generalized Fermat numbers, the base of the power is no longer restricted to
2. Generalized Fermat numbers are more numerous than Mersenne numbers of an
equal size and many of them are waiting to be discovered to fill the big gaps between
the Mersenne primes already found or still undiscovered. Progress in number theory
made it possible that numbers, where the representation is not limited to the base
2, can be tested at almost the same speed as Mersenne numbers.
The program Proth.exe was widely used to investigate generalized Fermat num-
bers. Proth.exe was created by Yves Gallot in 1998 as a single-threaded CPU
program that found many prime number records more than 20 years ago. The
successor genefer is a highly optimized GPU application, created in 2022 [12].
Using the original program, on February 16, 2003, Michael Angel discovered
the largest of them with 628,808 digits, which at that time became the fifth largest
known prime number:
17
b2 + 1 = 62722131072 + 1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 158 — #20

i i

158 Prime Numbers

4.6.6 Idea Based on Euclid’s Proof: p1 · p2 · . . . · pn + 1

This idea is based on Euclid’s proof (see Section 4.3) that there are infinite prime
numbers:
2·3 + 1 =7 7→ Prime
2·3·5 + 1 = 31 7→ Prime
2·3·5·7 + 1 = 211 7→ Prime
2·3· . . . ·11 + 1 = 2311 7→ Prime
2 · 3 · . . . · 13 + 1 = 59 · 509 7→ Not prime!
2 · 3 · . . . · 17 + 1 = 19 · 97 · 277 7→ Not prime!

4.6.7 As Above but −1 except +1: p1 · p2 · . . . · pn − 1

2·3−1 =5 7→ Prime
2·3·5−1 = 29 7 → Prime
2 · 3 · ... · 7 − 1 = 11 · 19 7 → Not prime!
2 · 3 · . . . · 11 − 1 = 2309 7 → Prime
2 · 3 · . . . · 13 − 1 = 30029 7 → Prime
2 · 3 · . . . · 17 − 1 = 61 · 8369 7 → Not prime!

4.6.8 Euclid Numbers en = e0 · e1 · . . . · en−1 + 1 with n ≥ 1 and e0 := 1

The number en−1 is not the (n − 1)th prime number, but the number previously
found here. Unfortunately this formula is not open but recursive. The sequence
starts with

e1 =1+1 =2 7→ Prime
e2 = e1 + 1 =3 7 → Prime
e3 = e1 · e2 + 1 =7 7 → Prime
e4 = e1 · e2 · e3 + 1 = 43 7 → Prime
e5 = e1 · . . . · e4 + 1 = 13 · 139 7 → Not prime!
e6 = e1 · . . . · e5 + 1 = 3263443 7 → Prime
e7 = e1 · . . . · e6 + 1 = 547 · 607 · 1033 · 31051 7 → Not prime!
e8 = e1 · . . . · e7 + 1 = 29881 · 67003 · 9119521 · 6212157481 7 → Not prime!

Also, e9 , . . . , e17 are composite, which means that this formula is not particu-
larly useful.
Comment:
However, it is very particular that no pair of any of these numbers does have a
common factor other than 1. Therefore, they are relatively prime.
SageMath Example 4.4 calculates the Euclid numbers with +1 and −1.

SageMath Example 4.4: Euclid Numbers with −1

print ("\n# CHAP04 -- Sage -Script -SAMPLE 015: =========")
print ("# Euclid numbers with +1 (and a variant with -1)")

def euclidnumbers(beg , end , variant , startProd):

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 159 — #21

i i

4.6 Special Types of Numbers and the Search for a Formula for Primes 159

SageMath Example 4.4 (continued)

EProdn = startProd
for n in (beg..end):
En = EProdn + variant
EProdn = EProdn*En
B = is_prime(En)
print(n,En ,B);

# Initialization
e1 = 2; En=e1; print(1, En , is_prime(En))
e2 = 3; En=e2; print(2, En , is_prime(En))
Prod = e1 * e2

print("--Variant with +1:")

euclidnumbers(3,9, +1, Prod) # default calculation of next Euclid number
print("--Variant with -1:")
euclidnumbers(3,9, -1, Prod) # modified calculation with -1

#------------------------------------
# CHAP04 -- Sage -Script -SAMPLE 015: =========
# Euclid numbers with +1 (and a variant with -1
# 1 2 True
# 2 3 True
# --Variant with +1:
# 3 7 True
# 4 43 True
# 5 1807 False
# 6 3263443 True
# 7 10650056950807 False
# 8 113423713055421844361000443 False
# 9 12864938683278671740537145998360961546653259485195807 False
# --Variant with -1:
# 3 5 True
# 4 29 True
# 5 869 False
# 6 756029 False
# 7 571580604869 False
# 8 326704387862983487112029 False
# 9 106735757048926752040856495274871386126283608869 False

4.6.9 f (n) = n 2 + n + 41
This sequence starts off promisingly, but that is by no means proof that things will
continue like this:

f (0) = 41 7→ Prime
f (1) = 43 7→ Prime
f (2) = 47 7→ Prime
f (3) = 53 7→ Prime
f (4) = 61 7→ Prime
f (5) = 71 7→ Prime
f (6) = 83 7→ Prime
f (7) = 97 7→ Prime
..
.
f (33) = 1163 7→ Prime
f (34) = 1231 7→ Prime

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 160 — #22

i i

160 Prime Numbers

f (35) = 1301 7→ Prime

f (36) = 1373 7→ Prime
f (37) = 1447 7→ Prime
f (38) = 1523 7→ Prime
f (39) = 1601 7→ Prime
f (40) = 1681 = 41 · 41 7→ Not prime!
f (41) = 1763 = 41 · 43 7→ Not prime!

The first 40 sequence values are different prime numbers (which have the obvi-
ous regularity that their difference starts with 2 and increases by 2 each time). Of
the 240 possible prime numbers p with 41 ≤ p ≤ 1601, there are 40 that occur in
the sequence.5
But the 41st and 42nd values are not prime numbers. It is easy to recognize
that f (41) cannot be a prime number:

f (41) = 412 + 41 + 41 = 41(41 + 1 + 1) = 41 · 43

The Euler polynomial f (n ) = n 2 + n +41 can be made visible in the Ulam spiral
(antidiagonal) by setting 41 as the starting value in the center. See Figure 4.5.6

4.6.10 f (n) = n 2 − 79n + 1601 and Heegner Numbers

This function delivers prime numbers for all function values from n = 0 to n =
79. See Table 4.5. The source to compute this table can be found in SageMath
Example 4.12.
Unfortunately, f (80) = 1681 = 41 · 41 is not a prime number. To this date,
no function has been found that delivers more prime numbers in a row. On the
other hand, each prime occurs twice (first in the decreasing then in the increasing
sequence), which means that the algorithm delivers a total of 40 different prime
values. These are the same ones as delivered by the function in Section 4.6.9.
But there are polynomials that deliver more than 40 distinct prime numbers in
a row. In this context, the Heegner numbers play a role. Longer polynomials have
been found since 2000, for example, in Al Zimmermann’s Programming Contests
[15]. For example, f (n ) = n 4 − 97 · n 3 + 3294 · n 2 − 45458 · n + 213589 returns 49
different, consecutive prime numbers—but only if you count the 9 negative ones as
prime.
The polynomial A121887 from OEIS [16] (n 5 − 133 ·n 4 +6729 ·n 3 − 158379 ·n 2 +
1720294 · n − 6823316)/4 even yields 57 different primes in a row (14 of which are
negative), but the coefficients of the polynomial are not integers. You can simulate
this with SageMath Example 4.14.

5. How many prime numbers are really in a range can be easily determined with SageMath. Here, for example,
with len(list(primes(41,1602)))) or with pari(1601).primepi() - pari(40).primepi() = 252-
12 = 240.
6. Graphics from CT2 Crypto Tutorials F World of Primes F Distribution of primes F Ulam's
spiral.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 161 — #23

i i

4.6 Special Types of Numbers and the Search for a Formula for Primes 161

Figure 4.5 Ulam’s prime spiral for the Euler polynomial f (n ) = n 2 + n + 41, starting with 41 in the
center.

4.6.11 Polynomial Functions f (x) = an x n + an−1 x n−1 + · · · + a1 x 1 + a0

(ai ∈ Z, n ≥ 1)
There is no polynomial that delivers prime values only for all x in Z. For a proof
of this, please refer to [17, p. 83 f.], where you will also find further details about
prime number formulae.
This means there is no hope in looking for further formulae (functions) similar
to that in Section 4.6.9 or Section 4.6.10, if one expects that these produce only
primes for all n.

4.6.12 Catalan’s Mersenne Conjecture

Eugene Charles Catalan conjectured that C4 and any further term in this sequence
is a prime:

C 0 = 2,
C1 = 2C0 − 1,
C2 = 2C1 − 1,
C3 = 2C2 − 1,
C4 = 2C3 − 1, . . .

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 162 — #24

i i

162 Prime Numbers

Table 4.5 Values of the Prime Number Function f (n ) = n 2 − 79 · n + 1601

f (0) = 1601 7→ prime f (26) = 223 7→ prime
f (1) = 1523 7 → prime f (27) = 197 7 → prime
f (2) = 1447 7 → prime f (28) = 173 7 → prime
f (3) = 1373 7 → prime f (29) = 151 7 → prime
f (4) = 1301 7 → prime f (30) = 131 7 → prime
f (5) = 1231, 7 → prime f (31) = 113 7 → prime
f (6) = 1163 7 → prime f (32) = 97 7 → prime
f (7) = 1097 7 → prime f (33) = 83 7 → prime
f (8) = 1033 7 → prime f (34) = 71 7 → prime
f (9) = 971 7 → prime f (35) = 61 7 → prime
f (10) = 911 7 → prime f (36) = 53 7 → prime
f (11) = 853 7 → prime f (37) = 47 7 → prime
f (12) = 797 7 → prime f (38) = 43 7 → prime
f (13) = 743 7 → prime f (39) = 41 7 → prime
f (14) = 691 7 → prime f (40) = 41 7 → prime
f (15) = 641 7 → prime f (41) = 43 7 → prime
f (16) = 593 7 → prime f (42) = 47 7 → prime
f (17) = 547 7 → prime f (43) = 53 7 → prime
f (18) = 503 7 → prime ···
f (19) = 461 7 → prime f (77) = 1447 7→ prime
f (20) = 421 7 → prime f (78) = 1523 7 → prime
f (21) = 383 7 → prime f (79) = 1601 7 → prime
f (22) = 347 7 → prime f (80) = 41 · 41 7 → NOT prime!
f (23) = 313 7 → prime f (81) = 41 · 43 7 → NOT prime!
f (24) = 281 7 → prime f (82) = 1847 7 → prime
f (25) = 251 7 → prime f (83) = 1933 7 → prime
f (84) = 43 · 47 7 → NOT prime!

This sequence is defined recursively and increases extremely fast (much quicker
than the Mersenne prime numbers). Does this sequence consist only of primes?

C0 =2 7→ Prime
C1 = 22 − 1 =3 7 → Prime
C2 = 23 − 1 =7 7 → Prime
C3 = 27 − 1 = 127 7 → Prime
C4 = 2127 − 1 = 170141183460469231731687303715884105727 7 → Prime

It is not (yet) known whether C5 = 2C4 − 1 and all higher elements are prime.
In any case, it has not been proved that this formula delivers only primes.
It seems very unlikely that C5 (or many of the larger terms) would be prime. So
this could be another example of Guy’s law of small numbers.

4.6.13 Double Mersenne Primes

From C2 onwards, the above Catalan-Mersenne numbers are a subset of the double
Mersenne primes. A double Mersenne prime is a Mersenne prime of the form
p −1
M M p = 22 −1

where p is a Mersenne prime exponent and M p is a prime Mersenne number.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 163 — #25

i i

4.7 Density and Distribution of the Primes 163

The first values of p for which M p is prime are p = 2, 3, 5, 7, 13, 17, 19, 31,
61, 89, 107, 127, 521, ... (see above).
M M p is known to be prime for p = 2, 3, 5, 7, and has the appropriate values:

7, 127, 2147483647, 170141183460469231731687303715884105727.

SageMath Example 4.5 calculates these values.

SageMath Example 4.5: Double Mersenne Primes

print ("\n# CHAP04 -- Sage -Script -SAMPLE 010: =========")
print ("# Double Mersenne primes ")

for p in (2,3,5,7):
Mp=2^p-1
MMp=2^Mp -1
B=is_prime(MMp)
print(p,Mp ,MMp ,B);

#------------------------------------
# CHAP04 -- Sage -Script -SAMPLE 010: =========
# 2 3 7 True
# 3 7 127 True
# 5 31 2147483647 True
# 7 127 170141183460469231731687303715884105727 True

For p = 11, 13, 17, 19, and 31, the corresponding double Mersenne numbers
are not prime. The next candidate for the next double Mersenne prime is M M61 =
22305843009213693951 − 1.
Being approximately 1695 · 10694127911065419641 this number—like C5 (see
Section 4.6.12)—is far too large for any currently known primality test to be
successfully applied.

4.7 Density and Distribution of the Primes

As Euclid proved, there is an infinite number of primes. However, some infinite sets
are denser than others.
Within the set of natural numbers, there is an infinite number of even, uneven,
and square numbers. How to compare the density of two infinite sets is shown with
the sets of even and square numbers.
Whereas in colloquial language you can often hear that “there are more” even
numbers than square ones, mathematicians say that from both there are infinitely
many that their sets are equivalent to N (so both are infinite and countable; i.e., one
can assign an integer to each even number and to each square number). However,
the set of even numbers is denser than the set of square numbers. The following
proves that the even numbers are distributed more densely than square ones:

• The size of the nth element:

The nth element of the even numbers is 2n; the nth element of the square

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 164 — #26

i i

164 Prime Numbers

numbers is n 2 . Because for all n > 2: 2n < n 2 , the nth even number occurs
much earlier than the nth square number.
• The numbers of values that are less than or equal to a certain maximum
value x in R are:
√
There are bx /2c such even numbers and b xc square numbers. Because
for all x > 6 the value bx /2c is greater than the largest integer smaller or
equal to the square root of x, the even numbers are distributed more densely.

The Value of the nth Prime P (n )

Theorem 4.6 For large n: The value of the nth prime P (n ) is asymptotic to n ·ln(n );
that is, the limit of the relation P (n )/(n · ln n ) is equal to 1 if n tends to infinity.
For n > 5, P (n ) lies between 2n and n 2 . This means that prime number are less
dense than natural numbers, but denser than square numbers. See Section 4.10.

The number of prime numbers P I (x ). The definition for the number P I (x )—also
conventionally written as Π(x ), is similar: It is the number of all primes that does
not exceed the maximum value x.
Theorem 4.7 P I (x ) is asymptotic to x / ln(x ).
This is the famous prime number theorem. It was put forward by Adrien-Marie
Legendre and Carl Friedrich Gauss but not proved until over 100 years later.
Alternative ways of expressing this are:

P I → x / ln(x )
ln(x )

lim P I · =1
x→∞ x
x
lim ln(x ) =
x→∞ PI

The distribution is graphically presented in Figure 4.10 in Section 4.14.

The formulae for the prime number theorem only apply when n tends to infinity.
The formula of Gauss can be replaced by more precise formulae. For x ≥ 67:

ln(x ) − 1, 5 < x / P I (x ) < ln(x ) − 0, 5

Given that we know P I (x ) = x / ln x only for very large x (x tending towards

infinity), we can create the following overview:

x ln(x ) x / ln(x ) P I (x )(counted) P I (x )/(x / ln(x ))

103 6.908 144 168 1.160
106 13.816 72386 78498 1.085
109 20.723 48254942 50847534 1.054

For a binary number (these consist only of the digits 0 and 1) x of length of 250
bits (2250 is approximately 1.81 · 1075 ) and because the quotient P I (x )/(x / ln(x ))

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 165 — #27

i i

4.8 Outlook 165

is moving closer to 1, PI(x) can be estimated very well this way:

P I (x ) = x / ln x = 2250 /(250 · ln 2) ≈ 2250 /173.28677 ≈ 1.05 · 1073

We can therefore expect that the set of numbers with a bit length of less than 250
contains approximately 1073 primes—a reassuring result!
We can also express this as follows: Let us consider a random natural number
n. Then the probability that this number is prime is around 1/ ln(n ). For example,
let us take numbers in the range of 1016 . Then we must consider 16 · ln 10 = 36.8
numbers (on average) until we find a prime. A precise count shows that there are
10 prime numbers between 1016 − 370 and 1016 − 1.
Another way to express this is: The average gap between two consecutive
primes near the number n is close to the natural logarithm of n. For example, for
a number n close to 100, ln(n ) ≈ 4.6, so roughly every fifth number in this range
should be prime. Further details about prime gaps can be found on [18] and in
Section 4.9.5.
Under the heading How Many Primes Are There in [19], you can find numerous
other details. Using the website in [20] you can easily determine P (n ) and P I (x ).
The distribution of primes displays several irregularities for which no system
has been found yet. On the one hand, many occur closely together, like 2 and 3, 11
and 13, 809 and 811, on the other hand large gaps containing no primes also occur.
For example, no primes lie between 113 and 127, 293 and 307, 317 and 331, 523
and 541, 773 and 787, 839 and 853 as well as between 887 and 907. Discovering
the secrets of these irregularities is precisely part of what motivates mathematicians.
Some visualizations (plots) of the quantity of primes in different number dimensions
can be found in Section 4.14.

Sieve of Eratosthenes. An easy way of calculating all P I (x ) primes less than or

equal to x is to use the sieve of Eratosthenes. In the third century B.C., he found an
extremely easy, automatic way of discovering this. To begin with, you write down
all numbers from 2 to x, circle 2, then cross out all multiples of 2. Next, you circle
the lowest number that hasn’t been circled or crossed out (now 3) and again cross
out all multiples of this number, and so on.
You have to keep crossing out only until you reach the largest number whose
square is less than or equal to x (here up to 10, as 112 is already >120).7
Prime numbers never end in the digit 0. Apart from 2, prime numbers are never
even. Apart from 2 and 5, prime numbers never end in 2 or 5. So you only need to
consider numbers ending in 1, 3, 7, or 9 anyway (there are infinite primes ending
in each of these digits; see [22, Vol. 1, p. 137]).

4.8 Outlook

Currently, you can find large databases that contain either many primes or the fac-
torization of numerous composite numbers. The fastest factorizers on the internet
are FactorDB by Markus Tervooren [23] and Alpertron, the integer factorization

7. CT2 Crypto Tutorials F World of Primes also contains a visualization of this method.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 166 — #28

i i

166 Prime Numbers

Figure 4.6 The sieve of Eratosthenes, applied to the first 120 numbers [21].

i i

168 Prime Numbers

The longest known arithmetic prime number sequence contains 27 elements

(as of January 2023). Table 4.6 lists the longest currently known arithmetic prime
number sequences with minimal difference. In Table 4.6, n is the number of elements
of the sequence and Digits is the number of digits of the difference d of the sequence
elements. How to read table Table 4.6 (Table 4.7 shows the values of k#)?
For n = 3: d = 2 = 2# (so k = 2). The sequence is: 3, 5, 7.
For n = 4: d = 6 = 3# (so k = 3). The sequence is: 5, 11, 17, 23.
For n = 5: d = 6 = 4# = 3# (so k = 3). The sequence is: 5, 11, 17, 23, 29.
For n = 6: d = 30 = 5# (so k = 5). The sequence is: 7, 37, 67, 97, 127, 157.
For n = 7: d = 150 = 5 ∗ 5# (so multiple of k# with k = 5). The sequence is:
7, 157, 307, 457, 607, 757, 907.
Table 4.6 was built using [25] and lists the sequences that have the smallest
known difference for a given length. In contrast, the “largest known AP-k” listed
in [26] contain as the last sequence element a prime as large as possible.
As a team, the two mathematicians Ben Green and Terence Tao were able in
2004 to prove Hardy’s conjecture, which had puzzled mathematicians for over 80
years. It states that for any arbitrary length there exists an arithmetic prime number
sequence (PAP). Additionally, they managed to prove that for any given length there
are infinitely many different sequences.

Table 4.6 Arithmetic Prime Number Sequences with Minimal Distance∗,∗∗

n First Element Distance d Year Discovered By
Digits
3 3 2 -
= 2# 1
4 5 6 -
= 3# 1
5 5 6 -
= 3# 1
6 7 30 1909 G. Lenaire
= 5# 2
7 7 150 1909 G. Lenaire
= 5 · 5# 3
...
...
21 28112131522731197609 9699690 2008 Jaroslaw Wroblewski
= 19# 7
22 166537312120867 96599212710 2006 Markus Frind
= 9959 · 19# 11
23 403185216600637 2124513401010 2006 Markus Frind
= 9523 · 23# 13
24 158209144596158501 14517322329510 2014 Bryan Little
= 65073 · 23# 14
25 6171054912832631 81737658082080 2008 Raanan Chermoni,
= 366384 · 23# 14 Jaroslaw Wroblewski
26 3486107472997423 371891575525470 2012 James Fry
= 1666981 · 23# 15
27 224584605939537911 18135696597948930 2019 Rob Gahan
= 81292139 · 23# 18
∗ Smallest known AP-n.
∗∗ As of April 2022.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 169 — #31

i i

4.9 Notes about Primes 169

Table 4.7 Products of the First Primes

<= k, Called k Primorial or k#
i k k#
1 2 2
2 3 6
3 5 30
4 7 210
5 11 2310
6 13 30030
7 17 510510
8 19 9699690
9 23 223092870
10 29 6469693230
11 31 200560490130

Green and Tao intended to prove that there are infinitely many arithmetic
sequences of length four. For this purpose they considered sets of numbers con-
sisting of primes and near primes. These are numbers with a small set of divisors
like numbers that are the product of exactly two primes—these numbers are called
half primes. Thus, they managed to considerably simplify their work because there
were already a lot of useful theorems about near primes. Finally, they discovered
that the results of their theorem were far more reaching than they had assumed.
Therefore, they were able to prove Hardy’s conjecture.
Anyone who believes that it is easy to use Green’s and Tao’s 49-page proof to
compute arithmetic prime number sequences of arbitrary length will soon become
disappointed, because the proof is nonconstructive. It is a proof of existence. This
means that these mathematicians have shown “only” that these sequences exist, but
not how to find them in practice.
This indicates that in the set of the natural numbers there is, for example,
a sequence of one billion primes that all have the same distance; and there are
infinitely many of them. However, these sequences lie extremely far beyond the
numbers we usually use (far outside).
The length of a sequence determines the minimal distance between the single
primes of the sequence. Given a sequence with n = 6 elements the distance between
them has to be 30 or a multiple of 30. More precisely: If a PAP-k does not begin
with the prime k, then the common difference is a multiple of the primorial k#.
The number 30 results from the product of all primes smaller than the length of
the sequence (here 6): 6# = 5# = 2 · 3 · 5 = 30. Another example: 10# = 7# =
2 · 3 · 5 · 7 = 210. If you look for a sequence with 15 elements, then the common
distance is at least 15# = 13# = 2 · 3 · 5 · 7 · 11 · 13 = 30030. The primorial notation
starts with k = 2: 2# = 2, 3# = 2 · 3 = 6, 5# = 2 · 3 · 5 = 30.
This signals that the length of an arithmetic prime sequence can be arbitrary
big, but the distance between the elements cannot be any arbitrary number. For
example, there is no arithmetic prime sequence with the distance 100 because 100
cannot be divided by 3.
The minimal distances for sequences of length n are multiples of k# with k =
n − 1. The k-primorials (prime factorials) are shown in Table 4.7—the values were
calculated by SageMath Example 4.6.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 170 — #32

i i

170 Prime Numbers

4.9.2.1 Further Restriction for the Arithmetic Progression

If you look at arithmetic prime sequences that fulfill the additional requirement that
all primes are consecutive, also called consecutive prime arithmetic progressions
(or CPAP-k or CAP-k) then the longest such sequence (as of May 2022), has a
length of 10. The first CPAP-10 was found in 1998; it has the common distance
10# = 7# = 210 and starts with the 93-digit prime: 100 9969724697 1424763778
6655587969 8403295093 2468919004 1803603417 7589043417 0334888215
9067229719.

SageMath Example 4.6: Calculate the k Primorials (k#) for Table 4.7
print ("\n# CHAP04 -- Sage -Script -SAMPLE 060: =========")

# calculate the k primorials (k#) for tables in chapter 4

# k# = product of first k primes. Note , function defines k#=1 for k=1 (besides k=1 is no prime)
# k# = defined in Sage as a Sloane sequence
# https :// doc.sagemath.org/html/en/reference/combinat/sage/combinat/sloane_functions.html

print('\nGet k# = primorial numbers from OEIS Integer Sequence for Table 4.8:')
n=33; print(' Upper border for prime k (k<n): n =', n)

primlist = [i for i in primes(1,n)]

lenprimlist = len(primlist)
print(' lenprimlist =', lenprimlist , '; pList =', primlist)
# P=Primes (); print(' Alternative to get prime k using unrank: ', [P.unrank(i-1) for i in range �
� (1,lenprimlist+1)])

a = sloane.A002110 # Begins with 1 instead of 2

aa = a.list(lenprimlist+1)[1:] # get lenprimlist elements and discard 1st element
print(' lenaa =', len(aa), '; aaList =', aa)

L = [(i,primlist[i-1],aa[i-1]) for i in range(1, lenprimlist+1)]

print(' List of tuples (i, k, k#):')
print(' ', L)

print('\nSome distances for Table 4.7:')

print(' 19# =', a[8])
print(' 9959·19# =', 9959*a[8])
print(' 9523·23# =', 9523*a[9])
print(' 65 073·23# =', 65073*a[9])
print(' 366 384·23# =', 366384*a[9])
print(' 1 666 981·23# =', 1666981*a[9])
print(' 81 292 139·23# =', 81292139*a[9])

Twin primes are prime numbers whose difference is exactly 2. Examples include 5
and 7, or 101 and 103, or 1693965 · 266443 ± 1, or 318032361 · 2107001 ± 1. Cousin
primes are prime numbers that differ, for example, by four, like 13 and 17.
The conjecture that there are infinite many twin primes is not obvious. It’s known
that for large numbers in average the expected gap between primes is constantly
growing at around 2.3 times the number of decimal digits. For example, among 100-
digit decimal numbers the expected gap between primes is in average 230. But this
statement is true just on average—often the gap is much bigger, or much smaller.
Note: There is only one triplet prime: 3, 5, 7. For all other sets of three
consecutive odd numbers, one of them is always divisible by 3 and thus not a prime.
The biggest known twin pairs (as of April 2023) are:

2, 996, 863, 034, 895 · 21290000 ± 1 with 388342 decimal digits

3, 756, 801, 695, 685 · 2666669 ± 1 with 200700 decimal digits

They were found in 2011 and 2016.

Open questions about twin primes are:
• What is the number of twin primes: Are there infinitely many or only a
limited number?
• Does a formula exist for calculating the number of twin primes per interval?

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 174 — #36

i i

174 Prime Numbers

In order to approach such questions, one can take different paths and ask
different leading questions.
One was asked by the Norwegian mathematician Viggo Brun (1885–1978),
who looked at the sum of the reciprocals of successive twin primes. In 1919, he
proved that this sum converges to a specific numerical value (≈ 1.90216), which
is now called Brun’s constant. The fact that this sum converges shows that twin
primes are relatively rare, even though there might be infinitely many of them. In
contrast, the sum of the reciprocals of all primes diverges.
It’s interesting that the Pentium FDIV bug was found in 1994 by Thomas Nicely
when he used massive computing power to calculate Brun’s constant. The flaw in the
Pentium microprocessor caused only certain types of arithmetic errors. However,
his discovery forced the chip’s manufacturer to replace about one million faulty
processors (costing Intel about half a billion dollars).
In the following subsections, two major milestones are explained that may
allow us to come closer to the riddle of the number of twin primes.

4.9.5.1 GPY 2003, Proof Correction 2005

A big step toward the solution of this problem was made by Dan Goldston, János
Pintz, and Cem Yildirim in 2003. The three mathematicians were investigating the
distribution of prime numbers. They could prove that
pn +1 − pn
lim inf = 0,
n→∞ log pn

where pn denotes the nth prime number.

p +1 − pn
This means that the smallest limit point (lim inf) of the sequence nlog pn
equals
zero. A point is called limit point of a sequence, if—in any arbitrary small neigh-
borhood of that point—there lie infinitely many elements of the sequence. log pn is
about the average distance between the prime pn and the next prime pn +1 . Hence,
the term above implies that there are infinitely many consecutive primes with a gap
between them, which is arbitrarily small compared to the expected average gap.
Moreover, it was proved that

pn +1 − pn < (log pn )8/9

holds true for infinitely many primes.

4.9.5.2 Zhang 2013

In May 2013, the results of Yitang Zhang became known. Zhang proved that there
are infinitely many cousin primes, or more explicitely, that there is some number
H smaller than 70 million such that there are infinitely many pairs of primes that
differ by H . Whereas the gap between the primes of a twin prime is exactly 2, cousin
primes do denote two primes that have a gap between them, which has a value of
a bigger, even, but finite number H . In the meantime this minimal gap H of 70
millions was improved in further work. The corresponding progress is documented
in the Polymath8 project Bounded Gaps between Primes. The best known value of
H was 4680 (as of August 2013) and is until now (as of April 2022) 2460—this

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 175 — #37

i i

4.9 Notes about Primes 175

is good progress compared to 70 million, but far away from 2. H then has been
reduced to 6 (but only on the assumption of the Elliott-Halberstam conjecture).
Those results could be the basis for a final proof that infinitely many twin
primes exist.

4.9.6 Prime Gaps

A prime gap is the difference between two consecutive primes: gm = pm +1 − pm . The
smallest prime number gap is the first one: g1 = 3 − 2 = 1. All other prime number
gaps are even, since 2 is the first and only even prime number and thus the difference
of all others is formed between two odd numbers. The second prime number gap
is g2 = 5 − 3 = 2. Some authors use prime number gap differentiating this as the
number of composite numbers between two prime numbers, which is one less than
according to the definition used here. So their g2 would be (5 − 3) − 1 = 1. Gaps
between consecutive prime numbers were already discussed briefly in Section 4.7.
Whether there are infinitely many twin primes (i.e., gaps of length 2) is one of
the great unsolved problems in mathematics. See Section 4.9.5.
Here we mention a fact that initially is somewhat surprising: In the sequence
of all prime numbers p1 , p2 , p3 , . . ., there are gaps between prime numbers of an
arbitrary long length n. In other words, there exists a sequence of n − 1 consecutive
composite integers ≥ 0 for any given value of n. That is, for any positive integer n,
there is an integer m with gm ≥ n, where m is the index of the gap.
It is easy to argue that such gaps of length n exist. Let N be a natural num-
ber that is not coprime to any of the numbers 2, 3, 4, . . . , n. Then the numbers
N + 2, N + 3, N + 4, . . . , N + n are also not coprime to N , and consequently they
are not prime numbers. So the largest prime number before this sequence is at most
N + 1; the smallest afterward is at least N + n + 1, so that the length of this prime
number gap is at least n.
Such an N can be constructed in at least three different ways:

1. Factorial N = n !
This is technically the easiest way to prove it. Then the considered N + k in
the sequence N + 2, N + 3, N + 4, . . . , N + n are each divisible by the k.
2. N = lcm (2, . . . , n )
You can also choose the least common multiple of the numbers from 2 to n.
3. Primorial N = n#
The smallest possible candidate for N is found through the primorial. If pm
is the smallest prime number greater than n, then n# = ( pm−1 )# applies.
This method of constructing such n-gaps (prime number gaps of length n)
uses primorials: n# = product of all primes ≤ n. See the introduction to
primorials in Section 4.9.2.

Although N was chosen as small as possible in the last case, it is still not guar-
anteed that the gap found is always the first gap of the required length n. In this
respect, all of these methods provide solid evidence and a specific gap. However,
they are only of limited use when searching for the first occurrence of large gaps.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 176 — #38

i i

176 Prime Numbers

The first prime number gap of length n usually occurs well before n !+2, respectively,
N + 2.

Examples:
1. For example, to find a gap of at least length 4 (4-gap); that is, having at least
a triple of composite numbers in the gap, you set n = 4 and get the sequence
4! + 2, 4! + 3, 4! + 4, where 4! = 24. So a triple of composite numbers inside
the gap is (26, 27, 28). The first occurrence of such a triple is already before
4! at (8, 9, 10).
2. With n = 6 one finds a prime number gap of at least length 6 between the
following prime number candidates:
• Via factorial: N = 6! = 720 ⇒ N + 2 = 722; N + 6 = 726.
Since 721 is not prime, the gap is even larger. It is framed by the prime
numbers [719 and 727] and thus has the length 8.
• Via lcm: N = lcm (1, . . . , 6) = 60 ⇒ N + 2 = 62; N + 6 = 66.
Since both borders in [61,67] happen to be prime, the length is also
exactly 6.
• Via primorial: N = 6# = 2 · 3 · 5 = 30 ⇒ N + 2 = 32; N + 6 = 36.
The gap found [31, 37] has exactly the length 6 since both are prime
numbers.
• First gap of length 6: [23, 29] with g9 = 6. The index 9 means that it is
the gap between the 9th and the 10th prime number.
3. The factorial is the fastest growing function among the three functions
considered.
• For n = 6 this was: n ! = 720, lcm (2, . . . , 6) = 60, and n# = 30.
The first gap of exactly length 6 is [23,29] with g9 = 6.
You can calculate this with SageMath:
sage: n=6; factorial(n); lcm(2..n);
primorial=sloane.A002110; primorial(int(pari(n).primepi()))
• For n = 10 the following applies: n ! = 3628800, lcm (2, . . . , 10) =
2520, and n# = 210.
The first gap of exactly length 10 is [139,149] with g34 = 10.
• For n = 14 the following applies:
n ! = 87178291200, lcm (2, . . . , 14) = 360360, and n# = 30030.
The first gap of exactly length 14 is [113,127] with g30 = 14.
Gaps of lengths 10 and 12 follow after the first occurrence of the gap
of length 14.
• For n = 20 the following applies:
n ! = 2432902008176640000, lcm (2, . . . , 20) = 232792560, and n# =
9699690.
The first gap of exactly length 20 is [887,907] with g154 = 20.
You can get the gap index via SageMath: pari(887).primepi() -->
154.

i i

Table 4.8 Gaps Between the First Primes: Gap Length gi = n

i p pnext gi newgmax id x gmax gmax n! lcm n#
1 2 3 1 True 1 1 1 1 1
2 3 5 2 True 2 2 2 2 2
3 5 7 2 False – – – – –
4 7 11 4 True 3 4 24 12 6
5 11 13 2 False – – – – –
6 13 17 4 False – – – – –
7 17 19 2 False – – – – –
8 19 23 4 False – – – – –
9 23 29 6 True 4 6 720 60 30
10 29 31 2 False – – – – –
...
23 83 89 6 False – – – – –
24 89 97 8 True 5 8 40320 840 210
25 97 101 4 False – – – – –
26 101 103 2 False – – – – –
27 103 107 4 False – – – – –
28 107 109 2 False – – – – –
29 109 113 4 False – – – – –
30 113 127 14 True 6 14 87178291200 360360 30030
31 127 131 4 False – – – – –
32 131 137 6 False – – – – –
33 137 139 2 False – – – – –
34 139 149 10 False – – 3628800 2520 210
35 149 151 2 False – – – – –

Table 4.8 was completely created with the SageMath script chap04_sample100
.sage (this script is not printed here, but can be downloaded from the CT website).
Table 4.9 shows the maximal prime number gaps for the first prime numbers.
The first six maximum gaps occur after one of the first 30 prime numbers. Column
gi again contains the length n of the gap. The last three columns again show the
values N of the functions factorial, lcm, and primorial mentioned in Section 4.9.6.
After these candidates, at the latest, a gap of this length is to be expected or can be
constructed in this way. The specific gap, its actual length, and its merit value are
output for each candidate.
Table 4.9 has the same content as the website of Jens Kruse Andersen [33].
Table 4.9 was completely created with the SageMath Example 4.7.
Since the entire SageMath Example 4.7 is over 100 lines long, only the file
header is listed here. The entire file is available on the CT server: see https://www
.cryptool.org/en/documentation/ctbook/sagemath.

SageMath Example 4.7: List of First Maximal Prime Gaps with Merits
print ("\n# CHAP04 -- Sage -Script -SAMPLE 110: =========")

# Calculate table with details for the maximal prime gaps (starting with p_1 = 2)
# - plus some candidate gaps starting with N+2 according to three formulas
# - plus the latex code for the table used in the CTB (currently commented out)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 179 — #41

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 181 — #43

i i

4.11 Indexing Prime Numbers: nth Prime Number 181

Table 4.10 How Many Primes Exist within the First Intervals of Tens, Hundreds,
and Thousands?
Ten-Sized Intervals Hundred-Sized Intervals Thousand-Sized Intervals
Interval Number Interval Number Interval Number
1-10 4 1-100 25 1-1000 168
11-20 4 101-200 21 1001-2000 135
21-30 2 201-300 16 2001-3000 127
31-40 2 301-400 16 3001-4000 120
41-50 3 401-500 17 4001-5000 119
51-60 2 501-600 14 5001-6000 114
61-70 2 601-700 16 6001-7000 117
71-80 3 701-800 14 7001-8000 107
81-90 2 801-900 15 8001-9000 110
91-100 1 901-1000 14 9001-10000 112

Table 4.11 How Many Primes Exist within the First Intervals of Dimensions?
Dimension Interval Number Average Number per 1000
4 1 - 10000 1229 122.90
5 1 - 100000 9592 95.92
6 1 - 1000000 78498 78.50
7 1 - 10000000 664579 66.46
8 1 - 100000000 5761455 57.62
9 1 - 1000000000 50847534 50.85
10 1 - 10000000000 455052512 45.51

4.11 Indexing Prime Numbers: nth Prime Number

Table 4.12 shows the index for a few selected prime numbers. The index in the first
column starts with 1.
It is very easy to calculate the nth prime if the given n is not too big. For exam-
ple, SageMath responds almost instantaneously (30 µsec) to get the billionth prime
with the unrank function. As this function starts indexing from 0 (so the index of
the first prime 2 is 0), we have to reduce the index in SageMath Example 4.8 by 1.
However, to find the trillionth prime number, SageMath did not come back even
after 2 days.

SageMath Example 4.8: Get the nth Prime Number with SageMath
sage: P=Primes (); P.unrank(10^9-1)
22801763489

Does the opposite work too, given a prime p get its index or position? Above,
with Primes().unrank(n) we got a prime p. For example, Primes().unrank(
999) delivers 7919. Now we want something like Primes().ununrank(7919) to
get n = 999, but Primes() doesn’t have such a method. However, there is the
prime counting function prime_pi to determine the number of primes up to a given
number (and this upper number may also be a prime): This n is then the position
of a prime p; that is, to find the n when p is the nth prime. So prime_ pi (7919)
delivers 1000. See SageMath Example 4.9.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 182 — #44

Table 4.14 Corresponding Special Values

of the Binary and Decimal Systems
Binary System Decimal System
210 1024
240 1.09951 · 1012
256 7.20576 · 1016
264 1.84467 · 1019
280 1.20893 · 1024
290 1.23794 · 1027
2112 5.19230 · 1033
2128 3.40282 · 1038
2150 1.42725 · 1045
2160 1.46150 · 1048
2192 6.27710 · 1057
2250 1.80925 · 1075
2256 1.15792 · 1077
2320 2.13599 · 1096
2512 1.34078 · 10154
2768 1.55252 · 10231
21024 1.79769 · 10308
22048 3.23170 · 10616
24096 1.04439 · 101233

keys and the search effort. This can be done provided that, for example, one million
keys can be tested within one second.
Such tables can easily be generated using computer algebra systems (CAS) as
here with SageMath Example 4.10.

SageMath Example 4.10: Special Values of the Binary and Decimal Systems
print ("\n# CHAP04 -- Sage -Script -SAMPLE 020: =========")

E = [10,40,56,64,80,90,112,128,150,160,192,256,1024,2048,4096]
for e in E:
print( '2^%4d --- ' % e, RR(2^e).n(24) )

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 184 — #46

i i

184 Prime Numbers

4.14 Visualization of the Quantity of Primes in Higher Ranges

4.14.1 The Distribution of Primes
There are four primes between 1 and 10. There are already 1061 primes between
103 and 104 . In the interval [109 , 1010 ] lie 404204977 ≈ 4 · 108 primes, and in the
interval from 1019 to 1020 there are

1986761935284574233 ≈ 1.9 · 1018

primes.

4.14.1.1 The Prime Number Theorem

The number P I (x ) of primes up to a given number x can be approximately deter-
mined by a formula, derived from the prime number theorem (see Section 4.7).
P I (x ) denotes the number of primes that are smaller or equal to x:

x
P I (x ) ∼
ln x

Note that this formula only gives an approximation of the number of primes
smaller or equal to x. It becomes more exact as the number x increases. In the
following we are using the prime number theorem to examine the distribution of
primes.
In order to understand why the number of primes is growing so rapidly,
although the boundaries of the intervals only differ by the exponent 1. Let’s have a
closer look at both components of the right side of the formula: x and ln x.

4.14.1.2 The Functions x and 10x

The function x is a straight line. It is shown in Figure 4.7(a).
In the next step the function of the boundaries of the intervals are drawn in
Figure 4.7(b). To get an idea of how the functions look, like the domain of definition
was chosen to be from 0 to 1010 and from 0 to 10, respectively. You can see that
with increasing exponent x the numbers grow stronger.

4.14.1.3 The Function ln x

In comparison to functions x and 10x , we now consider the function ln x.
Figure 4.8(a) shows the graph with the domain of definition from 1 to 100.
Figure 4.8(b) the domain of definition was chosen between 1 and 1010 .
One can see that the values of the function ln x grow slowly compared to the
growth of the function x. This is visualized by the graph of both functions in one
picture shown in Figure 4.9. In addition to that, the graph of the function lnx x was
drawn in the same figure.

4.14.1.4 The Function P I (x) = x

ln x
The function lnx x consists of the function x as the numerator and the function ln x
in the denominator which, in comparison to x, increases very slowly. Compared to

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 185 — #47

i i

4.14 Visualization of the Quantity of Primes in Higher Ranges 185

Figure 4.7 Graph of the functions (a) x and (b) 10x .

the number x itself, the number of primes less or equal to x is small. But still, x
ln x
is an increasing function as you can see in Figure 4.9.

4.14.1.5 The Number of Primes in the Different Intervals

Figure 4.10 visualizes how the number of primes behaves in the intervals [1, 10x ]
and [10x−1 , 10x ]. The result of the approximation function is used to calculate it
faster (not the exact numbers like in Tables 4.10 and 4.11).
x x x−1
Here for each base 10 exponent two bars are drawn: ln1010x and ln1010x − ln1010x−1 :
The top chart in Figure 4.10 shows the values for the exponents x from 1 to 5, and
the right one for x from 1 to 10, where x is the base 10 exponent.
The blue bars represent the overall number of primes up to 10x . The red bars
show how many primes accrue in the interval [10x−1 , 10x ], respectively. This makes

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 186 — #48

i i

186 Prime Numbers

Figure 4.8 Graph of the function ln x (a) up to 100 and (b) up to 1010 .

clear that the number of primes in intervals of higher exponents keeps growing quite
fast.
A table containing the number of primes in some dedicated intervals can be
found in Section 4.10. For example, within the interval [1, 104 ] there are 1229
primes; thereof are in the interval [103 , 104 ] 1229 - 168 = 1061 primes.
More theory about the prime number theorem and the function PI(x) can be
found in Section 4.7.
SageMath Example 4.11 creates the graphs for the three functions x, log(x),
and x/log(x), shown in Figures 4.7 to 4.9. It also calculates values for the bars in
Figure 4.10.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 187 — #49

i i

4.14 Visualization of the Quantity of Primes in Higher Ranges 187

Figure 4.9 The functions x (blue), ln x (red), and x

ln x
(green).

Figure 4.10 Numbers of primes in the interval [1, 10x ] (blue) and in the interval [10x−1 , 10x ] (red)
for different exponents x.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 188 — #50

i i

188 Prime Numbers

SageMath Example 4.11: Generation of the Graphs of the Three Functions x,

log(x), and x/log(x)
print ("\n# CHAP04 -- Sage -Script -SAMPLE 030: =========")

def Display(F,fname):
# parameter 'fname ' should contain no blanks

### The following commands work in Sage CLI for all OS , but not when called from script
# F.show () # Alternative , also working in Sage CLI: F.plot ()
# CLI outputs: 'Launched png viewer for Graphics object consisting of 1 graphics �
� primitive '
# This automatically comes with annotated axes / Achsen sind automatisch beschriftet

### This works from a Sage script under Ubuntu (adapt path and viewer name for other OS)
pngfile ='/tmp/'+fname+'.png '; # print (" pngfile =", pngfile)
F.save(pngfile ,axes=True)
imv = 'feh ' # 'okular ' # 'gwenview ' # 'eog ' ## image viewer to start from the �
� terminal under Linux
# imv = 'open -a preview ' ## image viewer to start from the terminal under macOS
oscommand = imv + pngfile + ' &'; # print (" oscommand =", oscommand)
os.system(oscommand) # With Ubuntu: eog = Eye of GNOME file viewer
## os.system('display /tmp/F.png &') # Alternative: The display command needs 'imagemagick �
� ' to be installed.
return

# Definition of function f(x)=x and plots for the domains 0 to 100 and 0 to 10^10
def f(x):return x
F1=plot(f,(0,100)); Display(F1, "F1") # it doesn 't matter whether range starts from 0 or 1
F2=plot(f,(0,10^10)); Display(F2, "F2")

# Definition of function g(x)=10^x and plots for the domain 0 to 10

def g(x): return 10^x
G=plot(g,(0,10)); Display(G, "G")

# Definition of function h(x)=log(x) and plots for the domains 1 to 100 and 1 to 10^10
def h(x): return log(x)
H1=plot(h,(1,100),color ="red"); Display(H1, "H1")
H2=plot(h,(1,10^10),color ="red"); Display(H2, "H2")

# Definition of function k(x)=x/log(x) and plots for the domain 2 to 100

def k(x): return x/log(x)
K1=plot(k,(1,100),color =" green "); Display(K1, "K1")
# K=plot(k,(2,100),color =" green "); Display(K, "K")

# Plots of the functions f, k and h for the domain up to 100

Display(F1+K1+H1, "F1+K1+H1")

# Generation of the data for the bar charts ..........................

# Determination of the number of primes in the interval [1,10]
print ("#p in %13s:" % "[1,10]", pari(10).primepi () - pari(1).primepi () )

# Determination of the number of primes in the interval [1,100]

print ("#p in %13s:" % "[1,100]", pari(100).primepi () - pari(1).primepi () )

# Determination of the number of primes in the interval [10^3,10^4]

print ("#p in %13s:" % "[10^3,10^4]", pari(10**4).primepi ()-pari(10**3).primepi () )

# Determination of the number of primes in the interval [10^8,10^9]

print ("#p in %13s:" % "[10^8,10^9]", pari(10**9).primepi ()-pari(10**8).primepi () )

# Determination of the number of primes in the interval [10^9,10^10]

print ("#p in %13s:" % "[10^9,10^10]", pari(10**10).primepi ()-pari(10**9).primepi () )

# Determination of the number of primes in the interval [10^10,10^11]

print ("#p in %13s:" % "[10^10,10^11]", pari(10**11).primepi ()-pari(10**10).primepi () )

# Determination of the number of primes in the interval [1,10^11]

print ("#p in %13s:" % "[1,10^11]", pari(10**11).primepi ()-pari(1).primepi () )

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 189 — #51

i i

4.15 Examples Using SageMath 189

4.15 Examples Using SageMath

Below is SageMath source code related to the contents of this chapter.

4.15.1 Some Basic Functions about Primes Using SageMath

SageMath Example 4.12 shows some calls to answer very simple questions about
primes.

SageMath Example 4.12: Some Basic Functions about Primes

# Methods of the class of the set of prime numbers
sage: P=Primes (); P
Set of all prime numbers: 2, 3, 5, 7, ...

sage: P.next(5)
7

# unrank(n): Return the n-th prime number

sage: P.unrank(0); P.unrank(5)
2
13

sage: P[5]
13

# Function to return the next prime number

sage: next_prime(5)
7

# Returns how many primes <=x are there

sage: pari(10).primepi ()
4

# Returns the first x primes

sage: primes_first_n(6)
[2, 3, 5, 7, 11, 13]

# Returns the primes in an interval

sage: list(primes(4)); list(primes(2,10))
[2, 3]
[2, 3, 5, 7]

4.15.2 Check Primality of Integers Generated by Quadratic Functions

SageMath Example 4.13 verifies the primality of integers generated by the
function f (n ) = n 2 − 9n + 61. The code defines a function called
quadratic_prime_formula() that takes three arguments:

• start — An integer that is the lower bound for integers in the sequence
start, start + 1, start + 2, . . . , end − 1, end.
• end — An integer that is the upper bound for the integers in this sequence.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 190 — #52

i i

190 Prime Numbers

• verbose — (default: True) a flag to signify whether to print a message indi-

cating the primality of an integer generated by f (n ). If False, only the
statistics are printed.

Since the output of SageMath Example 4.13 has many lines, only the beginning
and the end of the output are listed here.

SageMath Example 4.13: Verify the Primality of Integers Generated by a

Quadratic Function
1 print ("\n# CHAP04 -- Sage -Script -SAMPLE 040: =========")
2
3 def quadratic_prime_formula(start , end , verbose=True):
4 print ("N -- N^2 - 9*N + 61")
5 P = 0 # the number of primes between start and end
6 for n in range(start , end + 1):
7 X = n^2 - 9*n + 61
8 if is_prime(X):
9 P += 1
10 if verbose:
11 print(str(n) + " -- " + str(X) + " is prime ")
12 else:
13 if verbose:
14 print(str(n) + " -- " + str(X) + " is NOT prime ")
15 print (" Number of primes: " + str(P) + " in range (%d, %d)" % (start ,end))
16 print (" Percentage of primes: " + str(float ((P * 100) / (end - start + 1))))
17
18 # Compute the values of f(n) = n^2 - 9n + 61 for n = 0, ..., 50
19 # and verify the primality of the generated integers
20 quadratic_prime_formula (0, 50)
21
22 #------------------------------------
23 # CHAP04 -- Sage -Script -SAMPLE 040: =========
24 # N -- N^2 - 9*N + 61
25 # 0 -- 61 is prime
26 # 1 -- 53 is prime
27 # 2 -- 47 is prime
28 # 3 -- 43 is prime
29 # 4 -- 41 is prime
30 # 5 -- 41 is prime
31 # 6 -- 43 is prime

71 # 46 -- 1763 is NOT prime

72 # 47 -- 1847 is prime
73 # 48 -- 1933 is prime
74 # 49 -- 2021 is NOT prime
75 # 50 -- 2111 is prime
76 # Number of primes: 48 in range (0, 50)
77 # Percentage of primes: 94.11764705882354

With the function call at the end of the code sample we compute the values of
f (n ) = n 2 − 9n +61 for n = 0, 1, 2, . . . , 50 and verify the primality of the generated
integers.
The last two lines of the output contain a small statistics. You can see that f (n )
generates 48 primes when 0 ≤ n ≤ 50, which is approximately 94% of the values
generated by f (n ).
A modification of this code is in SageMath Example 4.14: There, the function
values of other functions f (n ) are checked for their primality.
For larger sequences, it is impractical to print all single messages indicating the
primality of integers. Therefore, SageMath Example 4.14 sets the verbose param-
eter of the function to False. So only the statistics at the end are printed: the

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 191 — #53

i i

4.15 Examples Using SageMath 191

overall number of primes and the percentage of primes, generated by f (n ) where

0 ≤ n ≤ 1000.
Here the function quadratic_prime_formula() is slightly expanded: It can
also handle negative prime numbers and noninteger coefficients: The 4th argument
countNegativePrimeToo of the function has the default False. This is a flag that
determines whether a function value f (n ) that is a negative prime should also be
considered prime.

SageMath Example 4.14: Primality of Results of a Quadratic Function

(Variable n Runs Until 1000)
print ("\n# CHAP04 -- Sage -Script -SAMPLE 050: =========")

def quadratic_prime_formula(start , end , verbose=True , countNegativePrimeToo =False):

a_list = []
print ("N -- N^2 - 79*N + 1601 .........") # all Y = f(n) are positive; 80 consecutive primes; �
� 40 unique ones.
#a# print ("N -- N^2 - 9*N + 61 .........") # all Y = f(n) are positive; 45 consecutive primes �
� ; 40 unique ones.
#b# print ("n -- 36*n^2 - 810*n + 2753 .........") # 14 Y = f(n) are negative; 45 consecutive �
� primes; 45 unique ones.
#c# print ("n -- n^4 - 97*n^3 + 3294*n^2 - 45458*n + 213589 .........") # 9 neg / 50 con / 49 �
� uni.
#d# print ("n -- (n^5 - 133*n^4 + 6729*n^3 - 158379*n^2 + 1720294*n - 6823316) / 4 .........") �
� # 14 neg / 57 con / 57 uni.
P = 0 # the number of calculated primes Y between start and end
Pneg = 0 # how many negative primes found als function value of the polynomial
for n in range(start , end + 1):
Y = n^2 - 79*n + 1601 # [0,79] from 41 to 1601 and back
#a# Y = n^2 - 9*n + 61 # [0,44] from 61 to 1601
#b# Y = 36*n^2 - 810*n + 2753 # [0,44] from 2753 to 36809
#c# Y = n^4 - 97*n^3 + 3294*n^2 - 45458*n + 213589 # [0,49] from 213589 to 247889
#d# Y = (n^5 - 133*n^4 + 6729*n^3 - 158379*n^2 + 1720294*n - 6823316) / 4 # [0,56] from - �
� 1705829 to 4325119
if is_prime(int(Y)): # Y within int() is necessary if non -integer coeff in f(n)
P += 1
a_list.append(Y)
if verbose:
print(str(n) + " -- " + str(Y) + " is prime ")
elif is_prime(abs(int(Y))): # Y within int() is necessary if non -integer coeff
Pneg += 1
if verbose:
print(str(n) + " -- " + str(Y) + " is NOT prime , but -n is prime ")
if countNegativePrimeToo:
P += 1
a_list.append(-Y)
else:
if verbose:
print(str(n) + " -- " + str(Y) + " is NOT prime ")

Punique = len(set(a_list))
a_range = end - start + 1
print (" Number of primes in f(n): %d in n-range (%d, %d) [n takes %d diff. values as both �
� borders are included ]" % (P, start ,end , end -start+1))
print (" Number of unique primes: %d in the list of found primes (%d)" % (Punique , len(a_list)) �
�)
print (" Percentage of primes: %.2f" % float ((P * 100) / a_range))
print (" Percentage of unique primes: %.2f" % float (( Punique * 100) / a_range))
if Pneg > 0:
if countNegativePrimeToo:
print (" Number of negative primes found: %d. These are counted as primes ." % Pneg)
else:
print (" Number of negative primes found: %d. These are not counted as primes ." % Pneg)

# quadratic_prime_formula (0, 50) # Get same output as in chap04 _sample040.sage , if #a# is �

� activated
# quadratic_prime_formula (0, 79, countNegativePrimeToo =True) # (0,60)
quadratic_prime_formula (0, 1000, verbose=False) # If verbose ==False , only the statistics are �
� printed.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 192 — #54

i i

192 Prime Numbers

SageMath Example 4.14 (continued)

#------------------------------------
# CHAP04 -- Sage -Script -SAMPLE 050: =========
# N -- N^2 - 79*N + 1601 .........
# Number of primes in f(n): 602 in n-range (0, 1000) [n takes 1001 diff. values as both borders �
� are included]
# Number of unique primes: 562 in the list of found primes (602)
# Percentage of primes: 60.14
# Percentage of unique primes: 56.14

References

[1] Great Internet Mersenne Prime Search (GIMPS), GIMPS, https://www.mersenne.org/

A large proportion of modern, asymmetric cryptography is based on mathemati-

cal knowledge—on the properties of integers, which are investigated in elementary
number theory. Here, the word “elementary” means that questions raised in
number theory are essentially rooted in the set of natural and whole numbers
(integers).
Further mathematical disciplines currently used in cryptography include (see
[1, p. 2; 2, p. 3]):

• Group theory;
• Combination theory;
• Complexity theory;
• Stochastic (ergodic theory);
• Information theory.

Number theory or arithmetic (the emphasis here is more on the aspect of per-
forming calculations with numbers) was established by Carl Friedrich Gauss as a
special mathematical discipline. Its elementary features include the greatest com-
mon divisor (gcd), congruences (remainder classes), factorization, the Euler-Fermat
theorem, and primitive roots. However, the most important aspect is prime numbers
and their multiplicative operation.
For a long time, number theory was considered to be the epitome of pure
research, the ideal example of research in the ivory tower. It delved into the

195

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 196 — #2

i i

196 Introduction to Elementary Number Theory with Examples

mysterious laws of the realm of numbers, giving rise to philosophical considera-

tions whether it described elements that exist everywhere in nature or whether it
artificially constructed elements (numbers, operators, and properties).
With the number-theoretical applications of modern cryptography, it became
clear that a discipline that had been regarded as purely theoretical for centuries was
now being applied in practice. Today, experts in the field are sought after on the
job market.
Applications in (computer) security now use cryptography because this math-
ematical discipline is simply better and easier to prove than all other creative
substitution procedures that have been developed over the course of time and bet-
ter than all sophisticated physical methods such as those used to print banknotes
[3, p. 4].
This chapter explains the basics of elementary number theory in a way that you
can easily understand. It provides numerous examples and very rarely goes into any
proofs (these can be found in mathematical textbooks).
The goal is not to exhaustively explain the number theory findings, but to show
the essential procedures. The scope of the material is oriented towards being able
and apply the RSA method in more detail.
For this purpose we will use both theory and examples to explain how to per-
form calculations in finite sets and describe how these techniques are applied in
cryptography. Particular attention will be paid to the traditional Diffie-Hellman
(DH) and RSA public-key procedures.
It was important to me to make verifiable statements about the security of
the RSA algorithm, and to add runnable Python or SageMath code for as many
examples as possible. SageMath is an open-source Python-based computer-algebra
system (CAS); see [4].

5.2 Introduction to Number Theory

Number theory studies positive integers 1, 2, 3, 4, · · · , also referred to as the set

of natural numbers N. These are the first mathematical constructs used by human
civilization. According to Leopold Kronecker, they are a creation of God. In Julius
Dedekind’s opinion, they are a creation of the human intellect. Dependent upon
one’s ideology, this is an unsolvable contradiction or one and the same thing.
In ancient times, no distinction was made between number theory and numerol-
ogy, which attributed a mystical significance to specific numbers. In the same
way as astronomy and chemistry gradually detached themselves from astrology
and alchemy during the Renaissance (from the 14th century), number theory also
separated itself from numerology.
Number theory has always been a source of fascination for both amateurs and
professional mathematicians. In contrast to other areas of mathematics, many of
the problems and theorems in number theory can be understood by nonexperts. On
the other hand, the solutions to these problems or the proof to the theorems often
resisted to the mathematicians for a very long time. It is therefore one thing to pose
good questions but quite another matter to find the answer. One example of this is
what is known as Fermat’s last theorem. One of the things we learn in mathematics

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 197 — #3

i i

5.2 Introduction to Number Theory 197

at school is Pythagoras’ theorem, which states for a right-angle triangle: a 2 + b2 =

c2 , where a and b are the real-valued lengths of the sides next to the right angle and
c is the length of the hypotenuse. Fermat famously proposed that a n + bn 6= cn for
a, b, c ∈ N and integer exponents n > 2. Unfortunately, the border of his book from
Diophant where he made the claim did not have enough space for him to prove it.
The theorem was not proven until over 300 years later [5, pp. 433–551]. The name
“last” got attached to it because it has been the last conjecture Fermat had made
that remained open. See Section 8.2.
Up until the mid-20th century, number theory was considered to be the purest
area of mathematics, an area that had no practical use in the real world. This
changed with the development of computers and digital communication, as num-
ber theory was able to provide several unexpected solutions to real-life tasks. At
the same time, advances in information technology allowed specialists in number
theory to make huge progress in factorizing large numbers, finding new prime num-
bers, testing (old) conjectures, and solving numerical problems that were previously
impossible to solve. Modern number theory is made up of areas such as:

• Elementary number theory

• Algebraic number theory
• Analytic number theory
• Geometric number theory
• Combinatorial number theory
• Numeric number theory
• Probability theory

All the different areas are concerned with questions regarding integers (both
positive and negative whole numbers plus zero). However, they each have different
methods to deal with them.
This chapter mainly deals with the area of elementary number theory.

5.2.1 Convention and Notation

Unless stated otherwise:

• The letters a, b, c, d, e, k, n, m, q are used to represent integers (whole num-

bers).
• The letters i and j represent natural numbers.
• The letter p always represents a prime number.
• The sets N = {1, 2, 3, · · · } and Z = {· · · , −3, −2, −1, 0, 1, 2, 3, · · · } are the
natural numbers and integers, respectively.
• Zn = {0, 1, 2, · · · , n − 2, n − 1} and Z∗p (= Z p \ {0}, where p is prime) are
finite sets with n or p − 1 elements.

Often instead of Zn the notation Z/nZ is used. However, we use the first nota-
tion here, as it is easier to write and as there is no danger of confusion with the

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 198 — #4

i i

198 Introduction to Elementary Number Theory with Examples

so-called p-adic numbers Z p for n = p. The * representation also exists for integers
(that is, compound n and not only prime p), see Definition 5.9.
The following list shows the SageMath commands to call the three impor-
tant structures Z, Zn , and Z∗n – these commands are applied in the SageMath
Example 5.1:
• Z: IntegerRing() or ZZ
• Zn : Integers(n) or Zmod(n) or IntegerModRing(n)
• Z∗n : [a for a in Integers(n) if gcd(a,n) == 1] or
Zmod(n).list_of_elements_of_multiplicative_group() or Zmod(n).
unit_group()

SageMath Example 5.1: Z, Zn , and Z∗n in SageMath

print ("\n# CHAP05 -- Sage -Script -SAMPLE 017: =========")

print (" Different ways in Sage to define Z, Z/nZ , and (Z/nZ)∗ :")

n = 10 # n = 17
print("- n: ", n, " type(n): ", type(n))

print ("1### Z ==> Sage: IntegerRing () = ZZ")

# Ring of all integers
#
R1 = IntegerRing (); e = R1(5)
print("- R1 = IntegerRing (): ", R1, type(R1))
print (" e: ", e, " ", type(e))
print (" R1.range(n): ", R1.range(n))

print("- xx IntegerRing ()==ZZ: ", IntegerRing ()==ZZ)

# print("- ZZ.range(0,50,5): ", ZZ.range(0,50,5)) # [0, 5, 10, 15, 20, �

� 25, 30, 35, 40, 45]
# a = ZZ('1234 ') # Alternative arguments for ZZ: a = ZZ(1234) or ZZ �
� ('0x4D2 ') because of
# i=1234; l=i.digits(base=16); l; j=ZZ(l,base=16); j; i �
� ==j
# [2, 13, 4] // 1234 // True
# print("- a:", a, " // type(a): ", type(a)) # <class 'sage.rings. �
� integer.Integer '>
# b = 1234
# print (" xx a==b:", a==b)

print ("2### quotient ring Z/nZ ==> Sage: Integers(n) = Zmod(n) = �

� IntegerModRing(n)")
# Ring of integers from 0 to n-1 or ring of integers modulo n = �
� additive group
#
R2 = Integers(n)
print("- R2 = Integers(n): ", R2)
print (" list(R2): ", list(R2))
print (" R2.order (): ", R2.order ())

R3 = Zmod(n)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 199 — #5

i i

5.3 Prime Numbers and the First Fundamental Theorem of Elementary Number Theory 199

SageMath Example 5.1 (continued)

print("- R3 = Zmod(n): ", R3)

print (" list(R3): ", list(R3)) # Alternative: L = [a for a in R3]; �
� print (" L: ", L)
print (" R3.order (): ", R3.order ())

R4 = IntegerModRing(n)
print("- R4 = IntegerModRing(n): ", R4)
print (" list(R4): ", list(R4))
print (" R4.order (): ", R4.order ())

print("- xx Integers(n)== Zmod(n): ", Integers(n)== Zmod(n), " // ",

"Zmod(n)== IntegerModRing(n): ", Zmod(n)== IntegerModRing(n))

# a = R4(5) # No alternative is: R4(5**(10^62)

# print("- a**(10^62): ", a**(10^62)) # calculating in the finite �
� ring is very quick!

print ("3### Multiplicative group (Z/nZ)∗ ==> Sage: further handle �

� Zmod(n) or Integers(n)")
#
# Return a list of all invertible elements (type of each is Sage int)
L1 = [a for a in R2 if gcd(a,n) == 1]
print("- L1 (via Integers(n)): ", L1); print (" type(L1[1]): ", type(L �
� 1[1]))
# print (" R2.multiplicative_table (): ", R2.multiplication_table ()) # �
� this works well
# print (" R2.multiplicative_order (): ", R2.multiplicative_order ()) # �
� no such attribute
# --> L1 has no attribute 'order '; L1, R2, R3, R3 have no attribute ' �
� multiplicative_order '
m=7 # m=4; multiplicative order of m is only defined if m is a unit �
� modulo n !
print (" R4(%d):" % m, R4(m), " R4(m).multiplicative_order (): ", R4(m) �
� .multiplicative_order ())

G3 = R3.unit_group ()
print("- G3 (via Zmod.unit_group): ", G3)
print (" list(G3): ", list(G3))
print (" G3.order (): ", G3.order ())

# Return a list of all invertible elements (type of each is Python int)

L2 = R3. list_of_elements_of_multiplicative_group ()
print("- L2 (via Zmod): ", L2); print (" type(L2[1]): ", type(L2[1]))

5.3 Prime Numbers and the First Fundamental Theorem of

Elementary Number Theory

Many of the problems in elementary number theory are concerned with prime
numbers (see Chapter 4).
Every integer has divisors or factors. The number 1 has just one—itself, whereas
the number 12 has the six factors 1, 2, 3, 4, 6, and 12.
The SageMath method divisors() gives a list of all divisors of a number n.
For instance of n = 12 in the SageMath Example 5.2.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 200 — #6

i i

200 Introduction to Elementary Number Theory with Examples

SageMath Example 5.2: Edit all Divisors of an Integer a and the Number of
Divisors τ (a )
sage: a=12; a.divisors (); number_of_divisors(a)
[1, 2, 3, 4, 6, 12]
6

Many numbers are only divisible by themselves and by 1. When it comes to

multiplication, these can be regarded as the atoms in the realm of numbers.
Definition 5.1 Prime numbers are natural numbers greater than 1 that can only be
divided by 1 and themselves.
By definition, 1 is not a prime number. Every integer is either prime, composite,
or 1.
If we write down the prime numbers in ascending order (prime number
sequence), then we get:

2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67,
71, 73, 79, 83, 89, 97, · · ·

The first 100 numbers include precisely 25 prime numbers. After this, the
percentage of primes decreases, but never reaches zero. See Tables 4.10 and 4.11.
We come across integers that are prime fairly often. In the last decade of the
20th century, only three years were prime: 1993, 1997, and 1999. If they were rare,
cryptography would not be able to work with them to the extent it does.
Prime numbers can be factorized in a unique (trivial) way:

5=1·5
17 = 1 · 17
1013 = 1 · 1013
1296409 = 1 · 1296409

Definition 5.2 Natural numbers greater than 1 that are not prime are called
composite numbers. These have at least two factors other than 1.
The dissection of a number into its prime factors is called (complete) factor-
ization. Examples of the unique decomposition of composite numbers into prime
factors:
4=2·2
6=2·3
91 = 7 · 13
161 = 7 · 23
767 = 13 · 59
1029 = 3 · 73
5324 = 22 · 113

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 201 — #7

i i

5.4 Divisibility, Modulus and Remainder Classes 201

Theorem 5.1 Each composite number a has a lowest factor greater than 1. This
factor is a prime number p and is less than or equal to the square root of a.
All integers greater than 1 can be expressed as a product of prime numbers—in
a unique way.
This is the claim of the first fundamental theorem of number theory (= funda-
mental theorem of arithmetic = fundamental building block of all positive integers).
It was formulated precisely for the first time by Carl Friedrich Gauss in his
Disquisitiones Arithmeticae (1801).
Theorem 5.2 Gauss 1801 Every even natural number greater than 1 can be written
as the product of prime numbers. Given two such decompositions a = p1 · p2 · . . . ·
pn = q1 · q2 · . . . · qm , these can be resorted such that n = m and for all i, pi = qi .
In other words, each natural number other than 1 can be written as a product
of prime numbers in precisely one way (if we ignore the order of the factors). The
factors are therefore unique (the expression as a product of factors is unique).
For example, 60 = 2 · 2 · 3 · 5 = 22 · 3 · 5. And this—other than changing the
order of the factors—is the only way in which the number 60 can be factorized.
If you allow numbers other than primes as factors, there are several ways of
factorizing integers and the uniqueness is lost:

60 = 1 · 60 = 2 · 30 = 4 · 15 = 5 · 12 = 6 · 10 = 2 · 3 · 10 = 2 · 5 · 6 = 3 · 4 · 5 = · · ·

In mathematics one also studies sets of numbers where the factorization into
primes (or objects that have prime properties inside those sets) is not unique. An
example for this (see Theorem 4.2) and further details on prime numbers (e.g., how
Fermat’s little theorem can be used to test extremely large numbers to determine
whether they are prime) can be found in Chapter 4 of this book.

5.4 Divisibility, Modulus and Remainder Classes

There is a close connection between divisibility and congruences, which we will

explain here using several examples.

5.4.1 Divisibility
If integers are added, subtracted, or multiplied, the result is always another integer.
However, the division of two integers does not always result in an integer. For
example, if we divide 158 by 10 the result is the decimal number 15.8, which is not
an integer.
If, however, we divide 158 by 2 the result 79 is an integer. In number theory we
express this by saying that 158 is divisible by 2 but not by 10. In general, we say:
Definition 5.3 An integer n is divisible by another integer d if the quotient n /d is
an integer c such that n = c · d.
In other words: An integer d divides an integer n, if n = c · d for some c ∈ Z.
n is called a multiple of d, whereas d is called a divisor or factor of n.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 202 — #8

i i

202 Introduction to Elementary Number Theory with Examples

The mathematical notation for this is d|n (read “d divides n”). The notation
d6 |n means that d does not divide the number n. It is:

[ d | n ] ⇔ [ n is a multiple of d ]

In our example therefore: 106 |158 but 2|158.

We have several possibilities to check with SageMath if the integer d divides
the integer n. The most directly one is the method is_integer(). See SageMath
Example 5.3.

SageMath Example 5.3: Check to Find Out If a Variable or a Term Is Integer

sage: n=158; d1=10; d2=2
sage: n.is_integer (); n in ZZ
True
True
sage: d1.divides(n); (n % d1) == 0; (n / d1).is_integer (); (n / d1) in �
� ZZ
False
False
False
False
sage: d2.divides(n); (n % d2) == 0; (n / d2).is_integer (); (n / d2) in �
� ZZ
True
True
True
True

Two further important definitions are those of the greatest common divisor
(gcd) and the least common multiple (lcm) of two integers.
Definition 5.4 The gcd (a, b) is the largest integer dividing both a and b.
Definition 5.5 The lcm (a, b) is the smallest positive integer divisible by both a
and b.
Two numbers are called relatively prime or coprime, if their greatest common
divisor equals 1. For example, 9 = 3 · 3 and 28 = 2 · 2 · 7 are coprime. The following
equivalence holds:

[ gcd(a, b) = 1 ] ⇔ [ a and b are coprime.]

For coprimes a, b the following is true: lcm (a, b) = a · b.

The functions gcd and lcm are available in SageMath—see SageMath Exam-
ple 5.4.
For describing divisor relations for a set {a1 , . . . , an } of more than two elements,
one has to be careful:
• a1 , a2 , . . . , an are relatively prime, if gcd(a1 , . . . , an ) = 1. Here the
gcd(a1 , . . . , an ) has to be computed stepwise by, for example, computing
gcd(a1 , a2 ) = g1 first and then gcd(g1 , a3 ) = g2 , and so forth, and finally
gcd(gn−2 , an ).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 203 — #9

i i

5.4 Divisibility, Modulus and Remainder Classes 203

• An even stronger request for more than two numbers is:

a1 , . . . , an are pairwise relatively prime, if for all i = 1, . . . , n and j =
1, . . . , n with i 6= j: gcd(ai , a j ) = 1.

Example:
2, 3, 6 are relatively prime, because gcd(2, 3, 6) = 1. They are not pairwise relatively
prime because gcd(2, 6) = 2 > 1.

SageMath Example 5.4: Calculate gcd and lcm

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 205 — #11

i i

5.4 Divisibility, Modulus and Remainder Classes 205

Therefore: If m divides the difference, there exists an integer q such that:

a = b + q · m. As an alternative to the congruence notation, we can also use the
divisibility notation: m|(a − b).
Remark:
This equivalence does apply only to the difference (a−b), but not to the sum (a + b).
Example: 11 ≡ 2 (mod 3), therefore 11 − 2 = 9 ≡ 0 (mod 3); but 11 + 2 = 13 is not
divisible by 3. The statement in Theorem 5.3 does not even apply to sums in one
direction. It is correct for sums only if the remainder is 0 and only in the following
direction: If a divisor divides both summands with no remainder, it also divides the
sum with no remainder.
Example of equivalent statements:
35 ≡ 11 (mod 3) ⇐⇒ 35 − 11 ≡ 0 (mod 3)), where 35 − 11 = 24 is divisible by 3
without remainder while 35/3 and 11/3 leave the remainder 2.

SageMath Example 5.6: Division With and Without Remainder

sage: 10/4
5/2
sage: 10//4 # for integer arguments , "//" returns the integer �
� quotient
2
sage: 10 % 3 # for integer arguments , "%" means mod , i.e., remainder
1
#
sage: 11//3
3
sage: 11 % 3
2
#
sage: int(11/3)
3
sage: type(int(11/3))
<class 'int '>
#
sage: type(11/3)
<class 'sage.rings.rational.Rational '>
sage: type(11//3)
<class 'sage.rings.integer.Integer '>
sage: type(11%3)
<class 'sage.rings.integer.Integer '>

We can apply the equivalence in Theorem 5.3 if we need a quick and easy
method for determining whether large numbers are divisible by a certain number.
Example: Is 69993 divisible by 7?
The number can be written in the form of a difference in which it is clear that each
operand is divisible by 7: 69993 = 70000 − 7. Therefore, the difference is also
divisible by 7.
Although these considerations and definitions may seem to be rather theoret-
ical, we are so familiar with them in everyday life that we no longer think about

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 206 — #12

i i

206 Introduction to Elementary Number Theory with Examples

the formal procedure. For example, the 24 hours on a clock are represented by
the numbers 1, 2, · · · , 12. We obtain the hours after 12 noon as the remainder of a
division by 12 and know immediately that 2 o’clock in the afternoon is the same as
14:00.
The modular arithmetic (based on division remainders) forms the basis of asym-
metric encryption procedures. Cryptographic calculations are therefore not based
on real numbers like the calculations mostly performed at school, but rather on
number sets with a limited length (finite sets), like on positive integers that cannot
exceed a certain value.
So we choose a large number m and calculate modulo m. That is, we ignore
integer multiples of m and, rather than working with a number, we only work with
the remainder when this number is divided by m. The result is that all results are in
the range 0 to m − 1.
Since m is really large in practice, the set is also significantly larger than in our
examples and cannot be completely stored in the computer’s memory. But it has the
advantages and properties of modular computing.

5.5 Calculations with Finite Sets

Here we consider congruences, which are modulo relations between integers.

Congruences form a special equivalence relation (i.e., the relation is reflexive,
symmetric, and transitive). From algebra, it follows that essential parts of the con-
ventional calculation rules are kept to when we proceed to modular calculations
over a basic set Z. For example, addition remains commutative. The same goes for
multiplication modulo m.

5.5.1 Laws of Modular Calculations

The known laws apply:

1. Associative law
((a + b) + c) (mod m ) ≡ (a + (b + c)) (mod m )
((a · b) · c) (mod m ) ≡ (a · (b · c)) (mod m )
2. Commutative law
(a + b) (mod m ) ≡ (b + a ) (mod m )
(a · b) (mod m ) ≡ (b · a ) (mod m )

The associative law and the commutative law apply to both addition and
multiplication.

3. Distributive law
(a · (b + c)) (mod m ) ≡ (a · b + a · c) (mod m )
4. Reducibility
(a + b) (mod m ) ≡ (a (mod m ) + b (mod m )) (mod m )
(a · b) (mod m ) ≡ (a (mod m ) · b (mod m )) (mod m )
When adding or multiplying the order in which the modulo operation is
performed does not matter.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 207 — #13

i i

5.6 Examples of Modular Calculations 207

5. Existence of an identity (neutral element)

(a + 0) (mod m ) ≡ (0 + a ) (mod m ) ≡ a (mod m )
(a · 1) (mod m ) ≡ (1 · a ) (mod m ) ≡ a (mod m )
6. Existence of an inverse element
• Additive inverse
For all integers a and m there exists another integer −a such that:
(a + (−a )) (mod m ) ≡ 0 (mod m )
• Multiplicative inverse modulo a prime p
For each integer a (with a 6≡ 0 (mod p )) and p prime) there exists an
integer a −1 such that: (a · a −1 ) (mod p ) ≡ 1 (mod p )
• Multiplicative inverse modulo a compound number m
For all integers a and m (with a 6≡ 0 (mod m )) and gcd (a, m ) = 1)
there exists an integer a −1 such that: (a · a −1 ) (mod m ) ≡ 1 (mod m )
7. Closure
a, b ∈ G =⇒ (a + b) ∈ G
a, b ∈ G =⇒ (a · b) ∈ G
More on the topic of closure can be found in Section 5.7.
8. Transitivity
[a ≡ b (mod m ) ∧ b ≡ c (mod m )] =⇒ [a ≡ c (mod m )]
9. Modular division
For k, x in {1, . . . , m} with gcd(k, m ) = 1 and arbitrary x the division of
x by k is just the multiplication of x with the (existing) multiplicative inverse
of k:
x : k (mod m ) = x · k −1 (mod m )

If k −1 does not exist, the term x : k is not defined. See also Table 5.3.

5.5.2 Patterns and Structures (Part 1)

In general, mathematicians investigate structures. They ask, for example, at a·x ≡ b
mod m, which values x can take for given values of a, b, m.
In particular the case is investigated where the result b of this operation is the
neutral element. Then x is the (multiplicative) inverse of a regarding this operation.

5.6 Examples of Modular Calculations

As we have already seen:

• For two natural numbers a and m, a mod m denotes the remainder obtained
when we divide a by m. This means that a (mod m ) is always a number
between 0 and m − 1.
• For example, 1 ≡ 6 ≡ 41 (mod 5) because the remainder is always 1.
Another example is: 2000 ≡ 0 (mod 4) because 4 divides 2000 with no
remainder.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 208 — #14

i i

208 Introduction to Elementary Number Theory with Examples

• Modular arithmetic only works on a limited quantity of nonnegative num-

bers. The number of these is specified by the modulus m. If, for example, the
modulus is m = 5, then only the 5 numbers in the set {0, 1, 2, 3, 4} are used.
• A calculation result larger than 4 is then reduced modulo 5. In other words,
it is the remainder when the result is divided by 5. For example, 2 · 4 ≡ 8 ≡ 3
(mod 5) because 3 is the remainder when we divide 8 by 5.

5.6.1 Addition and Multiplication

The following shows two tables:
• The addition table for mod 5 (Table 5.1);
• The multiplication tables for mod 5 (Table 5.2) and mod 6 (Table 5.3).
Those tables were generated with SageMath; see SageMath Example 5.14 for
the source code.

Example of an Addition Table.

The result when we add 3 and 4 (mod 5) is determined as follows: Calculate 3+4 =
7 and keep subtracting 5 from the result until the result is less than the modulo:
7 − 5 = 2. Therefore: 3 + 4 ≡ 2 (mod 5).

Example of a Multiplication Table.

The result of the multiplication 4 · 4 (mod 5) is determined as follows: Calculate
4 · 4 = 16 and subtract 5 until the result is less than the modulus.

16 − 5 = 11; 11 − 5 = 6; 6 − 5 = 1

Table 5.2 directly shows that 4 · 4 ≡ 1 (mod 5) because 16/5 = 3 remainder 1.

Note that the multiplication is defined on the set Z excluding 0 (as 0 ·x is always
0, and 0 has no inverse).

5.6.2 Additive and Multiplicative Inverses

You can use the tables to read the inverses for each number in relation to addition
and multiplication.
The inverse of a number is the number that gives the result 0 when the two
numbers are added, and 1 when they are multiplied (i.e., as a result the neutral
element for the respective operation). Thus, the inverse of 4 for addition mod 5 is
1, and the inverse of 4 for multiplication mod 5 is 4 itself, because

4 + 1 = 5 ≡ 0 (mod 5);
4 · 4 = 16 ≡ 1 (mod 5).

Table 5.1 Addition Table Modulo 5

+ 0 1 2 3 4
0 0 1 2 3 4
1 1 2 3 4 0
2 2 3 4 0 1
3 3 4 0 1 2
4 4 0 1 2 3

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 209 — #15

i i

5.6 Examples of Modular Calculations 209

Table 5.2 Multiplication Table Modulo 5:

Operations Table of (Z∗5 , ·)
· 1 2 3 4
1 1 2 3 4
2 2 4 1 3
3 3 1 4 2
4 4 3 2 1

Table 5.3 Multiplication Table Modulo 6: Operations Table

of (Z6 \ {0}, ·)
* 1 2 3 4 5
1 1 2 3 4 5
2 2 4 0 2 4
3 3 0 3 0 3
4 4 2 0 4 2
5 5 4 3 2 1

The inverse of 1 for multiplication mod 5 is 1, while the inverse modulo 5 of 2

is 3 and, since multiplication is commutative, the inverse of 3 is again 2.
If we take a random number (here 2) and add or multiply another number (here
4) and then add or multiply the corresponding inverse of the other number (1 or 4)
to the interim result (1 or 3), then the end result is the same as the initial value.

Example:

2 + 4 ≡ 6 ≡ 1 (mod 5); 1 + 1 ≡ 2 ≡ 2 (mod 5)

2 · 4 ≡ 8 ≡ 3 (mod 5); 3 · 4 ≡ 12 ≡ 2 (mod 5)

In the set Z5 = {0, 1, 2, 3, 4} for the addition, and in the set Z∗5 = Z5 \ {0} for
the multiplication, all numbers have a unique inverse modulo 5.
In the case of modular addition, this is true for every integer used as modulus
(not just for 5).
However, this is not the case for modular multiplication (important theorem):

In modular arithmetic, raising to the power is defined as repeated multiplication—
which is standard. With small exceptions we can even apply the usual rules, such as:

a b+c = a b · a c
(a b )c = a b·c = a c·b = (a c )b

Modular powers work in the same way as modular addition and modular
multiplication:
32 = 9 ≡ 4 (mod 5)

Even consecutive powers work in the same way:

Example 1:
(43 )2 = 642 ≡ 4096 ≡ 1 (mod 5)

1. We can speed up the calculation by reducing the interim results modulo 5,

but we need to take care because not everything will then work in the same
way as in standard arithmetic.

(43 )2 ≡ (43 (mod 5))2 (mod 5)

≡ (64 (mod 5))2 (mod 5)

Table 5.6 Multiplication Table Modulo 12 for a = 5 and a = 6

i⇒ 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
5·i 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90
remainder 5 10 3 8 1 6 11 4 9 2 7 0 5 10 3 8 1 6
6·i 6 12 18 24 30 36 42 48 54 60 66 72 78 84 90 96 102 108
remainder 6 0 6 0 6 0 6 0 6 0 6 0 6 0 6 0 6 0

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 212 — #18

i i

212 Introduction to Elementary Number Theory with Examples

≡ 42 (mod 5)
≡ 16 ≡ 1 (mod 5)

Remark: The time required to calculate the multiplication of two numbers

normally depends on the length of the numbers. We can observe this if we use
the school method to calculate, for instance, 474 · 228. The time required
increases in a quadratic square manner because we need to multiply 3 · 3
numbers. The numbers become considerably smaller if we reduce the interim
result.
2. In standard arithmetic, consecutive powers can be reduced to a single power
by multiplying the exponents:

(43 )2 = 43·2 = 46 = 4096.

In modular arithmetic let’s try what happens if we substitute 3 · 2 (mod 5)

for the product of the exponents 3 · 2:
(mod 5) (mod 5)
(43 )2 ≡ 43·2 ≡ 46 ≡ 41 ≡ 4 (mod 5)

But as we saw above, the correct result is 1.

3. Therefore, the rule is slightly different for consecutive powers in modular
arithmetic: We do not multiply the exponents in (mod m) but rather in (mod
φ (m )).
Using φ (5) = 4 gives:
(mod φ (5)) (mod 4)
(43 )2 ≡ 43 · 2 ≡ 46 ≡ 42 ≡ 16 ≡ 1 (mod 5)

This delivers the correct result for m = 5, but there are cases where it can’t
be done like that. For example if m = 12 we have φ (m ) = 4. The element 2
divides m, and if we calculate the 9th power, we get 29 (mod 12) = 8, but
(mod φ (12)) (mod 4)
29 = 29 = 21 = 2 6= 8.

Theorem 5.6 Reduction in the exponent mod φ (m )

Let gcd(a, m ) = 1. Then
(mod φ (m ))
(a b )c ≡ a b·c (mod m ).

This is a consequence of the theorem of Euler and Fermat (see Theorem 5.13).
Assume bc = r + kφ (m ) with r < m and r, k ∈ N0 , then

a bc = a r · a k·φ (m ) = a r · ( a| φ{z
(m ) k
} ) ≡a
r
(mod m ).
≡1(mod m )
Example 2:
(mod 10)
328 = 34 · 7 ≡ 34 · 7 ≡ 38 ≡ 6561 ≡ 5 (mod 11)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 213 — #19

i i

5.6 Examples of Modular Calculations 213

5.6.4 Fast Calculation of High Powers (Square and Multiply)

RSA encryption and decryption are famous examples where calculating high powers
modulo m is needed (see Sections 5.10 and 5.14). For example, the calculation 1005
(mod 3) exceeds the 32-bit long integer number range if we calculate a n by actually
multiplying a with itself n times.
Remark: A 32-bit computer architecture refers to computer systems where
all major system components like processor and memory can operate on data
in 32-bit (4 byte) units—within registers. A 32-bit register can store 232 differ-
ent values. If an integer is represented as unsigned binary number, the range is
0 through 4294967295 = 232 − 1. Modern operating systems support 64 bits.
Such a register can hold any of 264 (over 18 quintillion = 1.8 · 1019 ) different val-
ues. Representing an integer as unsigned binary number, the range is 0 through
18446744073709551615 = 264 − 1.
In case of extremely large numbers, even a fast computer would take longer
than the age of the universe to calculate a single exponentiation. Luckily, there is an
extremely effective shortcut for calculating exponentiations (but not for calculating
logarithms).
If the expression is divided differently using the rules of modular arithmetic,
then the calculation does not even exceed the 16-bit short integer number range:

(a 5 ) ≡ (((a 2 (mod m ))2 (mod m )) · a ) (mod m ) cause 510 = 1012

We can generalize this by representing the exponent as a binary number. For exam-
ple, the naive method would require 36 multiplications in order to calculate a n for
n = 37. However, if we write n in the binary representation as 100101 = 1 · 25 + 1 ·
5 2 0 5 2
22 + 1 · 20 , then we can rewrite the expression as: a 37 = a 2 +2 +2 = a 2 · a 2 · a 1 .

Example 3: 8743 (mod 103)

Since 43 = 32 + 8 + 2 + 1 , 103 is prime, 43 < φ (103), and the squares (mod 103)
can be calculated beforehand

872 ≡ 50 (mod 103),

874 ≡ 502 ≡ 28 (mod 103),
878 ≡ 282 ≡ 63 (mod 103),
8716 ≡ 632 ≡ 55 (mod 103),
8732 ≡ 552 ≡ 38 (mod 103),
we have:
8743 ≡ 8732+8+2+1 (mod 103),
≡ 8732 · 878 · 872 · 87 (mod 103),
≡ 38 · 63 · 50 · 87 ≡ 85 (mod 103).

The powers (a 2 )k can be determined easily by means of repeated squaring.

As long as a does not change, a computer can calculate them beforehand and—if
enough memory is available—save them. In order to then find a n in each individual

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 214 — #20

i i

214 Introduction to Elementary Number Theory with Examples

case, it now only needs to multiply those (a 2 )k for which there is a one in the kth
position of the binary representation of n. The typical effort is then reduced from
2600 to 2 · 600 multiplications! This frequently used algorithm is called square and
multiply.
SageMath Example 5.8 contains source code implementing the square-and-
multiply method in SageMath manually. It outputs the intermediate results, so you
can reproduce the calculations above. See also Section 5.17.2 for a sample using
the function power_mod built in SageMath.

5.6.5 Roots and Logarithms

Instead of computing the value of a power for a given basis and a given exponent
modulo m, we can try to find a fitting exponent for a given value and a given basis
(logarithm) or try to find a fitting basis for a given value and a given exponent (nth
root).
The roots and logarithms are again integers. Yet in contrast to the usual sit-
uation, they are not only difficult to calculate but, in the case of large numbers,
cannot be calculated at all within a reasonable amount of time.
Let us take the equation: a ≡ bc (mod m ).
a. Existence and Uniqueness:
If we restrict the numbers a, b, c to be elements of the set {0, 1, . . . , m − 1}
and m > 2 a natural number, then
– x ≡ bc (mod m ) for b, c, m is always well-defined and has a unique
solution for x (trivial),
– a ≡ x c (mod m ) for a, c, m is not always solvable, and if it is solvable,
it is not always uniquely solvable, for example, x 2 ≡ 2 (mod 15) has no
solution while x 2 ≡ 4 (mod 15) has four different solutions 2, 7, 8, 13.
– a ≡ b x (mod m ) for a, b, m is not always solvable, and if it is solvable,
it is not always uniquely solvable, for example, 2x ≡ 5 (mod 15) has no
solution while 2x ≡ 1 (mod 15) has solutions x = 4, 8, 12, . . . .
b. Taking the Logarithm (Determining c); Discrete Logarithm Problem:
If we know a and b of the three numbers a, b, and c that meet this equation,
then every known method of finding c (if it exists) is approximately just as
time-consuming as trying out all m possible values one after the other. For
a typical m of the order of magnitude of 10180 (600-digit binary number),
this is a hopeless task. Further details about the discrete logarithm problem
can be found in Section 6.4. More on the complexity of this problem can
be found in Sections 5.12.1 and 12.1.
c. Calculating the Root; Determining b:
The situation is similar if b is the unknown variable, and we know the values
of a and c. Here we use the Euler function (see Section 5.8.2). If we know
the value of the Euler function φ (m ) and gcd(φ (m ), c) = 1 is true, then
there exists a unique root b: For a given c we can easily calculate d with
c · d ≡ 1 (mod φ (m )) and use Theorem 5.6 to obtain
(mod φ (m ))
a d ≡ (bc )d ≡ bc·d ≡ bc·d ≡ b1 ≡ b (mod m )

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 215 — #21

i i

5.7 Groups and Modular Arithmetic in Zn and Z∗n 215

where b is the c-th root of a.

If φ (m ) cannot be determined in a reasonable amount of time it is diffi-
cult to calculate the cth root. According to the first fundamental theorem of
number theory and Theorem 5.11, we can determine φ (m ) if we know the
prime factors of m, but in real world settings those prime factors cannot be
found quickly. This forms the basis for the security assumption used by the
RSA encryption system (see Sections 5.10, 5.12.1, and 6.3.1).
Both logarithm and nth root—if well-defined and unique—can be seen as
inverse operations of exponentiation. Compared with inverting exponentiation, the
time required for inverting addition and multiplication is simply proportional to
log m or (log m )2 . Power functions (x 7→ x k with k fixed) and exponential func-
tions (x 7→ k x with k fixed) are therefore typical one-way functions (compare
Sections 5.13.1 and 6.1).

5.7 Groups and Modular Arithmetic in Zn and Z∗n

Mathematical groups play a decisive role in number theory and cryptography. We

only talk of groups if, for a defined set and a defined relation (an operation such as
addition or multiplication), the following properties are fulfilled:
• The set is closed;
• A neutral element exists;
• An inverse element exists for each element;
• The associative law applies.
The abbreviated mathematical notation is (G, +) or (G, ∗).
After this somewhat more formal definition, we work again in Zn (compare
Section 5.5).
Definition 5.8 Zn :

Zn comprises all numbers from 0 to n − 1 : Zn = {0, 1, 2, . . . , n − 2, n − 1}.

Zn is an often used finite group of the natural numbers. It is sometimes also

called the remainder set R modulo n. The remainder set and the reduced remainder
set (resulting from multiplication) must not be confused with the remainder class
(see Definition 5.6).
For example, today’s 64-bit computers directly work only with integers in a
finite set, that is the value range 0, 1, 2, . . . , 264 − 1. Since 2003, 64-bit processors
have been introduced. A 64-bit register can represent 264 ≈ 1.8 · 1019 different
integer values.
This value range is equivalent to the set Z264 .

5.7.1 Addition in a Group

If we define the operation mod+ on such a set where
a mod + b := (a + b) (mod n ),

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 216 — #22

i i

216 Introduction to Elementary Number Theory with Examples

then the set Zn together with the relation mod+ is a group because the following
properties of a group are valid for all elements in Zn :

• a mod + b is an element of Zn . The set is closed.

• There is a neutral element (the 0).
• Each element a ∈ Zn has an inverse for this operation, namely n − a
[because a mod + (n − a ) ≡ a + (n − a )(mod n ) ≡ n ≡ 0(mod n )].
• (a mod + b) mod + c ≡ a mod + (b mod + c).
mod+ is associative.

Since additionally, the operation is commutative; that is, (a mod + b) =

(b mod + a ), this structure is actually a commutative group.

5.7.2 Multiplication in a Group

If we define the operation mod* on the set Zn where

a mod* b := (a · b) (mod n ),

then Zn together with this operation is usually not a group because not all properties
are fulfilled for each n.
Example:

a. In Z15 , for example, the element 5 does not have an inverse. That is to say,
there is no a with 5 · a ≡ 1 (mod 15). Each modulo product with 5 on this
set gives 5, 10, or 0.
b. In Z55 \ {0}, for example, the elements 5 and 11 do not have multiplicative
inverses. That is to say, there is no a ∈ Z55 such that 5 · a ≡ 1 (mod 55) and
no a such that 11·a ≡ 1 (mod 55). This is because 5 and 11 are not relatively
prime to 55. Each modulo product with 5 on this set gives 5, 10, 15, . . . , 50
or 0. Each modulo product with 11 on this set gives 11, 22, 33, 44, or 0.

On the other hand, there are subsets of Zn that form a group with the operation
mod*. If we choose all elements in Zn that are relatively prime to n, then this set
forms a group with the operation mod*. We call this set Z∗n .

Definition 5.9 Z∗n :

Z∗n = {a ∈ Zn gcd(a, n ) = 1}.

Z∗n is sometimes also called the reduced remainder set R 0 modulo n.

Example: For n = 10 = 2 · 5, the following applies:

– Full remainder set R = Zn = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9};

– Reduced remainder set R 0 = Z∗n = {1, 3, 7, 9} −→ φ (n ) = 4.

SageMath Example 5.7 calculates the residue set R 0 and the Euler φ function
of n = 10.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 217 — #23

i i

5.8 Euler Function, Fermat’s Little Theorem, and Euler-Fermat 217

SageMath Example 5.7: Calculate Residue System R 0 of 10 with φ (10)

sage: n=10; R1=n.coprime_integers(n); [i for i in R1]
....: eu1=euler_phi(n); eu1
[1, 3, 7, 9]
4

Comment:
R 0 or Z∗n is always a genuine subset of R or Zn because 0 is always an element of R
but never an element of R 0 . Since 1 and n − 1 are always relatively prime to n, they
are always elements of both sets.
If we select a random element in Z∗n and multiply it by every other element in
Zn , then the products are all in Z∗n . This is due to the fact that Z∗n is closed with
∗

respect to the multiplication and due to the gcd property:

[a, b ∈ Z∗n ] ⇒ [((a · b) (mod n ))) ∈ Z∗n ], more precisely:
[a, b ∈ Z∗n ] ⇒ [gcd(a, n ) = 1, gcd(b, n ) = 1] ⇒ [gcd(a · b, n ) = 1] ⇒ [((a · b)
(mod n )) ∈ Z∗n ].
Those products also induce a unique permutation on the elements in Z∗n . Since
1 is always an element of Z∗n , there is a unique partner in this set such that the
product is 1. In other words:
Theorem 5.7 Each element in Z∗n has a multiplicative inverse.
Example:
For a = 3 modulo n with n = 10 and Z∗n = {1, 3, 7, 9}, we have that a −1 = 7 and
multiplying a = 3 by any other number in Z∗n gives a permutation of the values
in Z∗n :

3 ≡ 3 · 1 (mod 10)
9 ≡ 3 · 3 (mod 10)
1 ≡ 3 · 7 (mod 10)
7 ≡ 3 · 9 (mod 10)

The unique invertibility is an essential condition for cryptography (see

Section 5.10).

5.8 Euler Function, Fermat’s Little Theorem, and Euler-Fermat

Euler’s phi function is an important function in number theory. Likewise, the Euler-
Fermat theorem is of great importance for RSA.

5.8.1 Patterns and Structures (Part 2)

As mathematicians investigate the structure a · x ≡ b (mod m ) (see Section 5.5.2),
they also look at the structure x a ≡ b (mod m ).
Again here they are interested in the cases, if b = 1 (then x is the multi-
plicative inverse of a) and if b = x (then the function f (x ) = x a (mod m ) has

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 218 — #24

i i

218 Introduction to Elementary Number Theory with Examples

a fixpoint). Compare the multiplicative order in Section 5.9 and the RSA fixed
points in Section 5.17.7.

5.8.2 The Euler Phi Function

Given n, then the number of numbers from the set {1, . . . , n − 1} that are relatively
prime to n is equal to the value of the Euler function φ (n ).
Definition 5.10 The Euler phi function φ (n ) specifies the number of elements in
Z∗n : φ (n ) = |Z∗n |
Compare this definition with Definition 5.9 and eventually read the explanations
about the Euler function φ (n ) in Section 6.3.1.
The Euler phi function sometimes is also written as Φ(n ) or phi(n ).
The number of these elements in the group is also called its cardinality or the
order of the group.
φ (n ) can be calculated very easily if we know the prime factors of n.
Theorem 5.8 For each prime number p holds: φ ( p ) = p − 1.
Theorem 5.9 For the product of two distinct primes p and q, the following is true:

φ ( p · q ) = ( p − 1) · (q − 1) or φ ( p · q ) = φ ( p ) · φ (q ).

This case is important for the RSA procedure.

Theorem 5.10 If n = p1 · p2 ·. . .· pk where p1 to pk are distinct prime numbers (i.e.,
no factor occurs more than once), then the following is true (as a generalization of
Theorem 5.9):
φ (n ) = ( p1 − 1) · ( p2 − 1) · . . . · ( pk − 1).

Theorem 5.11 In general, the following is true for every prime number p and every
n in N:
1. φ ( p n ) = p n−1 · ( p − 1).
2. If n = p1e1 · p2e2 · . . . · pkek , where p1 to pk are distinct prime numbers, then:

φ (n ) = [( p1e1 −1 )·( p1 −1)]·. . .·[( pkek −1 )·( pk −1)] = n·([( p1 −1)/ p1 ]·. . .·[( pk −1)/ pk ]).

Example:
• n = 70 = 2 · 5 · 7 ⇒ using Theorem 5.10: φ (n ) = 1 · 4 · 6 = 24.
• n = 9 = 32 ⇒ using Theorem 5.11: φ (n ) = 31 · 2 = 6, because Z∗9 =
{1, 2, 4, 5, 7, 8}.
• n = 2701125 = 32 · 53 · 74 ⇒ using Theorem 5.11:

φ (n ) = [31 · 2] · [52 · 4] · [73 · 6] = 1234800.

Remark: Number-Theoretic Functions in CT2

The Euler phi function is just one of several number-theoretic functions or statis-
tics used. In CT2 you can get an overview and a quick comparison for different

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 219 — #25

i i

5.8 Euler Function, Fermat’s Little Theorem, and Euler-Fermat 219

numbers. In Figure 5.1 the phi function is highlighted for the number 24.
Navigate to there in CT2 from CT2 Crypto Tutorials F World of Primes F
Distribution of primes F Number line.

5.8.3 The Theorem of Euler-Fermat

In order to prove the RSA procedure, we need Fermat’s little theorem and its
generalization (Euler-Fermat theorem).

Theorem 5.12 Fermat’s Little Theorem

Let p be a prime number and a be a random integer, then:

ap ≡ a (mod p)

An alternative formulation of Fermat’s little theorem is as follows: Let p be a

prime number and a be a random integer that is relatively prime to p, then:

a p−1 ≡ 1 (mod p )

Figure 5.1 Number-theoretic functions in CT2.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 220 — #26

i i

220 Introduction to Elementary Number Theory with Examples

Because if a and p are relatively prime (or coprime), an inverse a −1 (mod p ) always
exists. Multiply the first congruence with a −1 (mod p ) from the left as well as from
the right and the second congruence follows.
See also Section 4.5. Theorem 4.5 corresponds to Theorem 5.12 here.
Theorem 5.13 Euler-Fermat theorem (generalization of fermat’s little theorem) For
all elements a in the group Z∗n (i.e., a and n are natural numbers that are coprime):

a φ (n ) ≡ 1 (mod n )

This theorem states that if we raise a group element (here a) to the power
of the order of the group (here φ (n )), we always obtain the neutral element for
multiplication (the number 1).
See for example [8, S. 94 ff] in the literature for a proof of this theorem.
The second formulation of Fermat’s little theorem is derived directly from
Euler’s theorem if n is a prime number.
If n is not a prime number, then in most cases there do not exist primitive roots
modulo n and the exponent φ (n ) in Theorem 5.13 is not sharp; that is, can be
replaced by a proper divisor of φ (n ). The following formulation of the theorem is
taken from an unpublished handout of Professor Geyer; see [9]. Alternatively this
can be found in the famous classic of Hardy and Wright [10], on page 63 ff. There
you can also find the proofs.
Theorem 5.14 Sharper Euler-Fermat theorem Let n not be divisible by 8 and not
of the form 2u with u ≡ 1 mod 2.
1. If n = pr is a prime power, then there does exist a primitive root modulo n
with order φ (n ) = pr −1 ( p − 1) and the exponent φ (n ) in 5.13 can not be
replaced by a smaller one.
2. If n is not a prime power (and not a prime), then there exists no primitive
root modulo n of order φ (n ). If

n = p1α1 · p2α2 · · · · · prαr (r > 1)

is the prime factorization of n, then the multiplicative order of a residue

modulo n is always a divisor of the least common multiple

(n ) = lcm(φ ( p1α1 ), φ ( p2α2 ), . . . , φ ( prαr ))

and (n ) occurs as the order of some residue. Therefore, the Euler-Fermat

theorem can be improved to

a (n ) ≡ 1 mod n

for a, n coprime.
Remarks:
1. For numbers divisible by 8, (n ) must be divided by 2 for getting the best
exponent because a 2 ≡ 1 mod 8 for all odd a.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 221 — #27

i i

5.8 Euler Function, Fermat’s Little Theorem, and Euler-Fermat 221

2. The condition that the number n is not of the form 2u is needed because for
∼ Z∗ .
n = 2u with odd u we have Z∗2u = u
3. The SageMath Example 5.28 shows how such an element of maximal order
can be constructed.

If n is the product of two prime numbers, we can—in certain cases—use

Euler’s theorem to calculate the result of a modular power very quickly. We have:
a ( p−1)·(q−1) ≡ 1 (mod pq ).

Examples for calculating a modular power:

• What is 52 (mod 6)?

With 2 = 1·2 and 6 = 2·3 where 2 and 3 are both prime; φ (6) = 2 because
only 1 and 5 are relatively prime to 6, we obtain the equation 52 ≡ 5φ (6) ≡ 1
(mod 6), without having to calculate the power.
• What is 31792 (mod 851)?
With 792 = 22 · 36 and 23 · 37 = 851 where 23 and 37 are both prime, it
follows for 31 ∈ Z∗851 that 31792 ≡ 31φ (23·37) ≡ 31φ (851) ≡ 1 (mod 851).

5.8.4 Calculation of the Multiplicative Inverse

Another interesting application is a special case of determining the multiplica-
tive inverses using the Euler-Fermat theorem (multiplicative inverses are otherwise
determined using the extended Euclidean algorithm).

Example:
Find the multiplicative inverse of 1579 modulo 7351.
According to Euler-Fermat: a φ (n ) = 1 (mod n ) for all a in Z∗n . If we divide both
sides by a, we get: a φ (n )−1 ≡ a −1 (mod n ). For the special case that the modulus is
prime, we have φ (n ) = p − 1. Therefore, the modular inverse is

a −1 = a φ (n )−1 ≡ a ( p−1)−1 ≡ a p−2 (mod p).

For our example, this means:

Since the modulus 7351 is prime, p − 2 = 7349.

1579−1 ≡ 15797349 (mod p).

By cleverly breaking down the exponent, we can calculate this power relatively easy
(see Section 5.6.4):

7349 = 4096 + 2048 + 1024 + 128 + 32 + 16 + 4 + 1

1579−1 ≡ 4716 (mod 7351)

SageMath Example 5.8 contains source code implementing the square-and-

multiply method in SageMath manually. It outputs the intermediate results, so you
can reproduce the calculations above.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 222 — #28

i i

222 Introduction to Elementary Number Theory with Examples

SageMath Example 5.8: Square and Multiply Done Manually in SageMath

print ("\n# CHAP05 -- Sage -Script -SAMPLE 013: =========")

print (" Square & multiply , applied to 2 samples :")

print("- Sample from 5.6.4 just quickly calculates the power 87^43 mod 103.")
print("- Sample from 5.8.4 calculates multiplicative inverse quickly in a special case , where �
� exponent e = p-2.")
print (" a=1579; p = 7351; a^(-1) = a^(p-2) mod p")
print (" So a^(-1) = a^7349 mod 7351; and 7349=4096+2048+1024+128+32+16+4+1")

### Choose one of the two following examples:

### Sample from 5.6.4 ###
# p=103 ; R=Integers(p); a=R(87); e=43
# print ("\ nSample from 5.6.4: p, a, e:", p, a, e)

### Sample from 5.8.4 ###

p=7351; R=Integers(p); a=R(1579); e = p-2
print ("\ nSample from 5.8.4: p, a, e:", p, a, e)

print ("\ nFirst , calculate all squares :")

b = ZZ(e).bits (); blen = len(b); b = [1] * blen
print ("b:", b, " len(b):", len(b))

expo=0; sumall=0; prodall=1;

for pos , bit in enumerate(b):
expo=2^pos; sumall += expo
if expo == 1:
z = a # keep a
else:
z = power_mod(z, 2, p); prodall *=z
print ("Pos: %2d" % pos , " Expo :%5d" % expo , " Sum:%5d" % sumall , " z:%5d" % z, " Prod :%5d �
� " % prodall)

print ("\ nSecond , calculate power a^e manually via square & multiply :")
b = ZZ(e).bits (); print ("b:", b, " len(b):", len(b), " e:", e); # b=b[::-1]; print(b);

expo=0; sumall=0; prodall=1; sum=0; prod=1

for pos , bit in enumerate(b):
expo=2^pos; sumall += expo
if expo == 1:
z = a # keep a
else:
z = power_mod(z, 2, p); prodall *=z
if bool(bit):
sum+= expo; prod *=z
print ("Pos: %2d" % pos , " Expo :%5d" % expo , " Sum:%5d" % sum , " z:%5d" % z, " Prod :%5d" �
� % prod)

# Using the build in Sage function

print ("\ nValidation :")
print (" power_mod(87,43,103) = ", power_mod(87,43,103)) # 85
# print (" power_mod(87,e,p) = ", power_mod(87,e,p))
# print (" power_mod(1579,e,p) = ", power_mod(1579,e,p))
print (" power_mod(1579,7349,7351) = ", power_mod(1579,7349,7351), "\n") # 4716

5.8.5 How Many Private RSA Keys d Are There Modulo 26

This chapter addresses in detail some of the questions posed by thoughtful students,
even if such no-gos are otherwise rather skipped in the literature.
According to Theorem 5.6, the arithmetic operations of modular expressions
are performed in the exponents modulo φ (n ) rather than modulo n. Note that here
we adopt the usual practice for the RSA procedure to use “n” rather than “m” to
denote the modulus.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 223 — #29

i i

5.8 Euler Function, Fermat’s Little Theorem, and Euler-Fermat 223

In a e·d ≡ a 1 (mod n ), if we wish to determine the inverses for the factor e in

the exponent, we need to calculate modulo φ (n ).
Example: (with reference to the RSA algorithm)
If we calculate modulo 26, which set can e and d come from?
Solution: We have e · d ≡ 1 (mod φ (26)).
The reduced remainder set (reduced residue system) R 0 are the elements in
Z26 , which have a multiplicative inverse; that is, which are relatively prime to
26: R 0 = Z∗26 = {1, 3, 5, 7, 9, 11, 15, 17, 19, 21, 23, 25} (see Definition 5.9).
R 0 has φ (26) = 12 elements.
The reduced remainder set R 00 contains all numbers from 1 to φ (n ) that are
relatively prime to φ (n ) = 12 : R 00 = {1, 5, 7, 11}. R 00 has φ (φ (26)) = 4
elements. Note that in the general case not necessarily all elements of R 00 have
to also be contained in R 0 . For n = 26 however, this is the case: R 00 ⊆ R 0 .
For every e in R 00 , there exists a unique d in R 00 such that a ≡ (a e )d (mod n ).
So there are four values possible for key d mod(26).
For every e in R 00 , there exists precisely one element d such that e · d ≡ 1
(mod φ (26)). This element d is not necessarily different from e: for example,
5 · 5 ≡ 1 (mod 12).
SageMath Example 5.9 calculates the two residue sets and the two φ functions.

SageMath Example 5.9: Calculate Residue System R 0 of 26 and Reduced

Residue System R 00 with φ (26)
sage: n=26; R1=n.coprime_integers(n); [i for i in R1]
[1, 3, 5, 7, 9, 11, 15, 17, 19, 21, 23, 25]
sage: f=euler_phi(n); R2=f.coprime_integers(f); R2; f; euler_phi(f); �
� len(R2)
[1, 5, 7, 11]
12
4
4

SageMath Example 5.9 considered the case for n = 26. With SageMath
Example 5.34 you can consider the general case, where n can be any integer. See
Section 5.17.6. The SageMath program delivers the number of all values d.
For all e that are coprime φ (n ) we can calculate d as follows using the Euler-
Fermat theorem:

d ≡ e−1 (mod φ (n ))
≡ eφ (φ (n ))−1 (mod φ (n )),
because a φ (n ) ≡ 1 (mod n ) matches a φ (n )−1 ≡ a −1 (mod n )

The problems of factorizing n = pq with q 6= p and of finding φ (n ) have a

similar degree of difficulty, and if we find a solution for one of the two problems,
we also have a solution for the other one:

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 224 — #30

i i

224 Introduction to Elementary Number Theory with Examples

If we know the factors of n = p · q with p 6= q, then φ (n ) = ( p − 1) · (q − 1) =

n − ( p + q ) + 1. Additionally, the values p and q are solutions of the quadratic
equation x 2 − ( p + q )x + pq = 0.
If only n and φ (n ) are known, then mind that pq = n and p + q = n − φ (n ) + 1.
So you get p and q by solving the equation x 2 + (φ (n ) − n − 1)x + n = 0.
See also condition 3 in Section 5.10.1.

5.9 Multiplicative Order and Primitive Roots

The multiplicative order (see Definition 5.10) and the primitive root are two useful
constructs (concepts) in elementary number theory.
Mathematicians often ask under which conditions the repeated application of
an operation results in the neutral element (compare Section 5.8.1).
For the i-times successive modular multiplication of a number a by itself there
is an i from {1, . . . , m − 1} where the power a i (mod m ) is the neutral element of
the multiplication if and only if a and m are relatively prime.

Definition 5.11 The multiplicative order or dm (a ) of an integer a (mod m ) (where

a and m are coprime) is the smallest integer i for which a i ≡ 1 (mod m ).

Example 1:
Table 5.7 shows the values a i mod 11 for the exponents i = 1, 2, . . . , 10, and for
the bases a = 1, 2, . . . , 10 as well as the resulting value or d11 (a ) for each a.
Table 5.7 also shows, for example, that the order of 3 modulo 11 has the value
5. SageMath Example 5.17 contains the source code to generate the entries. See
Section 5.17.3.
In a multiplicative group (here Z∗11 ) not all numbers necessarily have the same
order. The different orders in this case are 1, 2, 5, and 10, and we notice that:

1. The orders are all factors of 10.

2. The numbers a = 2, 6, 7, and 8 have the order 10, so we say that these
numbers have the maximum order in Z∗11 .

Definition 5.12 If a and m are coprime and if or dm (a ) = φ (m ) (i.e., a has maximum

order), then we say that a is a primitive root of m.

Table 5.7 Values of a i (mod 11), 1 ≤ a, i < 11 and Corresponding Order of a (mod 11)
i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8 i=9 i=10 or d11 (a )
a=1 1 1 1 1 1 1 1 1 1 1 1
a=2 2 4 8 5 10 9 7 3 6 1 10
a=3 3 9 5 4 1 3 9 5 4 1 5
a=4 4 5 9 3 1 4 5 9 3 1 5
a=5 5 3 4 9 1 5 3 4 9 1 5
a=6 6 3 7 9 10 5 8 4 2 1 10
a=7 7 5 2 3 10 4 6 9 8 1 10
a=8 8 9 6 4 10 3 2 5 7 1 10
a=9 9 4 3 5 1 9 4 3 5 1 5
a=10 10 1 10 1 10 1 10 1 10 1 2

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 225 — #31

i i

5.9 Multiplicative Order and Primitive Roots 225

In Section 5.17.4, “Primitive Roots” there are SageMath examples to calculate

primitive roots.
What is special with numbers a that are primitive roots, is that the powers a i
(mod 11) for running i with 1 ≤ i < 11 take on all values in Z∗n (cf. Theorem 5.15).
As already mentioned in Theorem 5.14, not for every modulus m does not have
a number a that is a primitive root. For example, m = 45 has no primitive roots a
at all.
In Table 5.7, only a = 2, 6, 7, and 8 are a primitive root with respect to mod
m = 11 (or dm (a ) = φ (11) = 10). SageMath outputs only the first primitive root
(2) via primitive_root(11). The SageMath Example 5.22 can output all primitive
roots a for a given m.
A few more comments based on Table 5.7:
• For your own experience it is good to be able to calculate values quickly.
The values of a column of this table can be calculated with SageMath like
this: [power_mod(a,7,11) for a in [1..10]]. This results in the seventh
column: [1, 7, 9, 5, 3, 8, 6, 2, 4, 10]
• How would the table go on? What would the 11th column be? With
[power_mod(a,11,11) for a in [1..10]] you can see that you get the
first column again. The columns repeat with a cycle length of φ (m ) = 10.
So a 11 = a 1 (mod 11) for all a = 1, 2, . . . , 10.
• It holds: a i = a i +k·φ (m ) (mod m ), since the exponent is calculated modulo
φ (m ).
Using the primitive roots, we can clearly establish the conditions for which
powers modulo m there is a unique inverse, and where the calculations in the
exponents is manageable.
The two Tables 5.8 and 5.9 show the multiplicative orders and primitive roots
modulo 45 and modulo 46.
Example 2:
Table 5.8 shows the values a i mod 45 for the exponents i = 1, 2, · · · , 12 and for
the bases a = 1, 2, . . . , 12 as well as the resulting value or d45 (a ) for each a.
SageMath Example 5.18 contains the source code to generate Table 5.8. See
Section 5.17.3.
φ (45) is calculated using Theorem 5.11: φ (45) = φ (32 · 5) = 31 · 2 · 4 = 24.
Since 45 is not a prime, there is no multiplicative order for all values of a (for
all numbers that are not relatively prime to 45 : 3, 5, 6, 9, 10, 12, · · · , because 45 =
32 · 5).
Example 3:
Is 7 a primitive root modulo 45?
The necessary—but not sufficient—requirement/condition gcd(7, 45) = 1 is
fulfilled. Table 5.8 shows that the number a = 7 is not a primitive root of 45,
because or d45 (7) = 12 6= 24 = φ (45).
Example 4:
Table 5.9 answers the question as to whether the number a = 7 is a primitive root
of 46.

The set {a i (mod m )| i = 1, . . . , φ (m )} equals the multiplicative group Z m

∗ if and

only if or dm (a ) = φ (m ).

Even for prime moduli p and 0 < a < p, not all a are of order φ ( p ) = p −
1. Compare Table 5.7 as an example. But if or dm (a ) = φ (m ), a i (mod p) goes
through all the values 1, . . . , p − 1. Exhausting all possible values of the set is an
important cryptographic proposition (compare Theorem 5.5). This determines a
permutation π ( p − 1).
Table 5.9 demonstrates that also for composite moduli m not all a are
of maximal order φ (m ). In this example only 5, 7, 11, 15, 17, 19, and 21 are of
order 22.
The left-hand side of Theorem 5.15 holds exactly if a is a primitive root modulo
m (see Definition 5.12).
The multiplicative group Z m ∗ (see Definition 5.9) contains all values from 1 to

m − 1 if and only if m is prime.

Example 5: Length of Cycles
Tables 5.10 and 5.11 serve as samples to introduce cycle lengths. This is a topic
that goes beyond the multiplicative order.
Cycle here means a sequence of numbers a i mod n with 1 ≤ i < n for a given a,
and a repeating sequence. According to the generation method as modular power,
here each number is unique within a cycle. The cycles here don’t have to contain
the 1 unless this cycle belongs to a multiplicative order: Then they have the 1 at the
end of the cycle and at the position a n−1 mod n.
With l we now mean the cycle length. The maximum cycle length lmax is φ (n ).
For elements that do not belong to Z∗n this can be explained with the Chinese
remainder theorem (see e.g., [11, p. 167]):
Let π be the mapping that maps every a ∈ Zn to the tuple of its remainders
(a1 , . . . , ar ) with ai ≡ a mod m i for i = 1, . . . , r . If a 6= 0 is not invertible in
Zn and therefore has no group order, at least one of those ai —but not all—are
equal to 0.
Now imagine substituting 1’s for every 0 component in this tuple. Then this
element is invertible and has a well-defined order in Z∗n which is by the theorem of
Lagrange (see also [11]) a divisor of φ (n ) and cannot be larger than φ (n ) of course.
See also Theorem 5.14.
In Tables 5.10 and 5.11, a runs through different values. For instance, for φ (a )
we have (according to Theorem 5.11):

• φ (14) = φ (2 · 7) = 1 · 6 = 6
• φ (22) = φ (2 · 11) = 1 · 10 = 10

The values in the tables can be explained this way:

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 229 — #35

i i

5.10 Proof of the RSA Procedure with Euler-Fermat 229

for every single component of that tuple find a primitive root modulo the
corresponding m i , the modulus of that component. In this manner we
get a tuple of primitive roots—with respect to the moduli m i —for which
by the Chinese remainder theorem there corresponds a unique number
a ∈ {1, . . . , n − 1}. This number then generates a cycle of maximal length
lcm(φ (m 1 ), φ (m 2 ), . . . , φ (m r )) as already mentioned in Theorem 5.14. The
listing 5.29 shows a SageMath example for computing such an element.
b. In some cases the maximum cycle length can be achieved for some values
of a despite no multiplicative order exists for them (then in Tables 5.10
and 5.11 under the column header or dm (a ) there is a “0” instead of a dash
like in Table 5.8).
Samples:
– In Table 5.10: lmax = φ (14) = 6 for a = 10, 12 (cell highlighted in red)
– In Table 5.11: lmax = φ (22) = 10 for a = 2, 6, 8, 18
Both cases are special cases of Theorem 5.14 because 14 and 22 are of the
form 2u with u not only odd but also prime.
SageMath Example 5.20 contains the source code to generate
Tables 5.10 and 5.11—see Section 5.17.3.

The topic of cycles and their lengths is also treated in detail in context with the
RSA plane, where the notions orbit and path is used. See Section 6.5, especially
Sections 6.5.8, 6.5.8.2, and 6.5.8.3.

5.10 Proof of the RSA Procedure with Euler-Fermat

Using the Euler-Fermat theorem (see Theorem 5.13) we can prove the RSA
procedure in the group Z∗n .
The RSA procedure is the most common asymmetric cryptography procedure.
Developed in 1978 by Ronald Rivest, Adi Shamir, and Leonard Adleman, it can be
used both for signatures and for encryption.

5.10.1 Basic Idea of Public-Key Cryptography and Requirements for

Encryption Systems
The basic idea behind public-key cryptography is that all participants possess a
different pair of keys (P and S) and the public keys for all recipients are published.
You can retrieve the public key P for a recipient from a directory just as you would
look up someone’s phone number in the phone book. Furthermore, each recipient
has a secret key S that is needed in order to decrypt the message and that is not
known to anyone else. If the sender wishes to send a message M, he encrypts it
before sending using the public key P of the recipient.
The ciphertext C is determined as C = E ( P, M ), where E (encryption) is the
encryption rule. The recipient uses his private key S to decrypt the message with
the decryption rule M = D ( S, C ).
In order to ensure that this system works for every message M, the following
four requirements must be met:

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 230 — #36

i i

230 Introduction to Elementary Number Theory with Examples

1. D ( S, E ( P, M )) = M for every M (invertibility).

2. All ( S, P ) pairs are different for all participants.
3. The time required to derive S from P is at least as high as the time required
to decrypt M with no knowledge of S.
4. Both C and M can be calculated relatively easily if the corresponding key is
known.

The first requirement is a general condition for all cryptographic encryption

algorithms.
The prerequisite of the second requirement can easily be met because there
is a very large number of prime numbers. According to the prime number theo-
rem (Theorem 4.7) of Legendre and Gauss, there are approximately n / ln(n ) prime
numbers up to the number n. This means, for example, that there are 6.5 · 1074
prime numbers under n = 2256 (= 1.1 · 1077 ) and 3.2 · 1074 prime numbers under
n = 2255 . Between 2255 and 2256 there are therefore 3.3 · 1074 prime numbers with
precisely 256 bits. Because of this large number of primes we cannot simply store
them all—just because of physics (see the number of atoms in the universe in the
overview under Section 4.12).
In addition, the second requirement can be ensured by a central office that
issues certificates (see Section 5.12.5.4).
It is the last requirement that makes the procedure actually usable. This is
because it is possible to calculate the powers in a linear amount of time (because
there is a restriction on the length of the numbers).
Although Whitfield Diffie and Martin Hellman formulated the general method
as early as 1976, the actual procedure that met all four requirements was publicly
discovered later by Rivest, Shamir, and Adleman in 1978.

5.10.2 How the RSA Procedure Works

The RSA procedure including its prerequisites and secondary conditions is described
here in detail. The seven individual steps for performing the RSA procedure can be
clustered as follows. Steps 1 to 3 constitute key generation, steps 4 and 5 are the
encryption, and steps 6 and 7 are the decryption:

1. Select two distinct random prime numbers p and q and calculate n = p · q.

The value n is called the RSA modulus.
In CT1 and often in the literature, the RSA modulus is denoted with a
capital “N .”
2. Select an arbitrary e ∈ {3, · · · , n− 1} such that e is relatively prime to φ (n ) =
( p − 1) · (q − 1).
We can then throw away p and q.
3. Calculate d ∈ {1, · · · , n − 1} with e · d ≡ 1 mod φ (n )); that is, d is the
multiplicative inverse of e modulo φ (n ). We can then throw away φ (n ).

→ (n, e) is the public key P.

→ (n, d ) is the private key S (only d must be kept secret).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 231 — #37

i i

5.10 Proof of the RSA Procedure with Euler-Fermat 231

4. For encryption, the message represented as a (binary) number is divided

into parts such that each part of the number is less than n.
5. Encryption of the plaintext (or the parts of it) M ∈ {1, · · · , n − 1}:

C = E ((n, e), M ) = M e (mod n )

6. For decryption, the ciphertext represented as a binary number is divided

into parts such that each part of the number is less than n.
7. Decryption of the ciphertext (or the parts of it) C ∈ {1, · · · , n − 1}:

M = D ((n, d ), C ) = C d (mod n )

Remarks:
1. The numbers p, q, n chosen in step 1 are extremely large in practice (e.g.,
p and q have 1000 bit each, n 2000 bit length).
2. Further security aspects of the implementation and the algorithm itself are
discussed in Sections 5.11 and 5.12.
3. In Section 6.5 the RSA algorithm is more deeply reasoned from number
theory: The RSA plane is a model to illustrate the processes in this algorithm
using pictures of rectangles.
4. Compaq introduced the multiprime method with high marketing effort in
2000. n was not the product of two primes, but of three: of two big ones
and one relative small prime: n = o · p · q. With Theorem 5.10 we get:
φ (n ) = (o − 1) · ( p − 1) · (q − 1). This method did not assert itself.
One reason probably was that Compaq claimed a patent on it. Gene-
rally there is less understanding in Europe and within the open-source
community that one can claim patents on algorithms. But there is really
no understanding outside the United States, that one can get a patent for
a special case (3 instead of 2 factors) of an algorithm (RSA), although the
patent for the general case was almost expired.1
5. If the two primes p and q are equal then (m e )d ≡ m mod n is not true for
all m < n (although e · d ≡ 1 mod φ (n ) is fulfilled).
Example: If n = 52 then according to Theorem 5.11, it is φ (n ) = 5 · 4 =
20, e = 3, d = 7, e · d = 21 ≡ 1 mod φ (n ).
But it is (53 )7 ≡ 0 mod 25. Therefore, p and q must be different.
6. The BSI (German Information Security Agency) recommends to choose the
prime factors p and q almost the same, but not too close:

0.5 < | log2 ( p) − log2 (q )| < 30

They recommend generating the primes independently and check that the
restriction is fulfilled (see [12]).

1. The multiprime RSA method is contained in JCT Default Perspective F Visuals as well as in the JCT
Algorithm Perspective.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 232 — #38

i i

232 Introduction to Elementary Number Theory with Examples

7. For reasons of security, the selected e should not be too small. Since φ (n ) =
( p − 1) · (q − 1) is even and e has to be relatively prime to φ (n ), e cannot be
2. So the smallest value for e is 3 [13, Chap. 7.2.7].
The BSI reference [12] recommends 216 + 1 ≤ 2256 − 1. The procedure
also allows us to select d freely and then calculate e. However, this has
practical disadvantages. We usually want to be able to encrypt messages
quickly, which is why we choose a public exponent e such that it has a short
bit length compared to the modulus n and as few binary ones as possible
(e.g., 216 + 1). So a fast exponentiation is possible when encrypting. The
prime numbers 3, 17, and 65537 have proved to be particularly practical for
this purpose. The most often used number is 65537 = 216 + 1, or in binary:
10 · · · 00 · · · 01 (this number is prime and therefore relatively prime to many
other numbers).

5.10.3 Proof that RSA Fulfills Requirement 1 (Invertibility)

Four requirements were set out in Section 5.10.1 that every practical asymmetric
encryption method must meet. Requirement 1 was that the procedure must be able
to be reversed unambiguously, such as a bijective mapping (this is fulfilled by RSA,
but not by the default Rabin cryptosystem).
For pairs of keys (n, e) and (n, d ) that possess the properties defined in steps 1
to 3 of the RSA procedure, the following must be true for all M < n:

M ≡ ( M e )d (mod n ) with ( M e )d = M e·d

This means that the deciphering algorithm above works correctly.

We therefore need to show that:

M e·d ≡ M (mod n )

We will show this in three steps using Theorem 5.12 (Fermat’s little theorem)
(according to [3, p. 131ff]).
Step 1:
In the first step we show that: M e·d ≡ M (mod p).
Since n = p · q and φ ( p · q ) = ( p − 1) · (q − 1) and since e and d are selected
in such a way that e · d ≡ 1 (mod φ (n )), there is an integer k such that: e · d =
1 + k · ( p − 1) · (q − 1).

M e·d ≡ M 1+k·φ (n ) ≡ M · M k·φ (n ) ≡ M · M k·( p−1)·(q−1) (mod p)

≡ M · ( M p−1 )k·(q−1) (mod p) based on little Fermat: M p−1 ≡ 1 (mod p)
≡ M · (1)k·(q−1) (mod p)
≡M (mod p)

The requirement for using the simplified Euler-Fermat (Theorem 5.12) was that M
and p are relatively prime.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 233 — #39

i i

5.10 Proof of the RSA Procedure with Euler-Fermat 233

Since this is not true in general, we need to consider the case when M and p
are not relatively prime. Since p is a prime number, this implies that p is a factor of
M. But this means:
M ≡ 0 (mod p ).

If p is a factor of M, then p is also a factor of M e·d . Therefore:

M e·d ≡ 0 (mod p ).

Since p is a factor of both M and M e·d , it is also a factor of their difference:

( M e·d − M ) ≡ 0 (mod p).

And therefore our conjecture is also true in this special case.

Step 2:
In exactly the same way we prove that: M e·d ≡ M (mod q).
Step 3:
We now combine the conjectures from step 1 and 2 for n = p · q to show that:

M e·d ≡ M (mod n) for all M < n.

From step 1 and 2 we have ( M e·d − M ) ≡ 0 (mod p ) and ( M e·d − M ) ≡ 0 (mod q ).

Therefore, p and q are both factors of the same number z = ( M e·d − M ). Since p
and q are distinct prime numbers, their product must also be a factor of this number
z. Thus:

( M e·d − M ) ≡ 0 (mod p · q ) or M e·d ≡ M (mod p · q ) or

M e·d
≡M (mod n ).

Comment 1:
We can also condense the three steps if we use the Theorem 5.13 (Euler-Fermat),
that is, not the simplified theorem where n = p and which corresponds to Fermat’s
little theorem:

( M e )d ≡ M e·d ≡ M ( p−1)(q−1)·k +1 ≡ ( |M ( p−{z

1)(q−1) k k
} ) ·M ≡ 1 ·M ≡ M (mod n ).
≡M φ (n ) ≡1 (mod n )

Comment 2:
When it comes to signing messages, we perform the same operations but first use
the secret key d (for signing), followed by the public key e (for validation). The RSA
procedure can also be used to create digital signatures because:

M ≡ ( M d )e (mod n ).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 234 — #40

i i

234 Introduction to Elementary Number Theory with Examples

• In 1994 a 129-digit RSA modulus (428 bits), published in 1977, was factor-
ized by a distributed implementation of the quadratic sieve algorithm (QS),
developed 1982 by Pomerance. This effort took 8 months. Please refer to
[16].
• In 1999 a 155-digit modulus (512 bits) was factorized with an implementa-
tion of the GNFS developed by Buhler, Lenstra, and Pomerance. The GNFS
is more efficient than QS if n is longer than about 116 decimal digits. This
effort took 5 months. Please refer to [17].
• Ten years later, at the end of 2009, a 232-digit modulus (768 bits) was
factorized by Kleinjung after 2 ½ years. See [18].
This clearly demonstrates that a modulus length of 768 bits no longer provides
sufficient protection against attackers.
For details about factorization progress since 1999, see Section 5.12.4.
A good website for online factorization is Dario Alpern’s Integer Factorization
Calculator; see [19].
RSA cannot only be attacked by factorization but by several—well known—
poorly chosen settings. A tool that implemented almost all these attacks (mostly in
Python, some in SageMath) is RsaCtfTool. See [20].

Lattice Base Reduction Algorithms

The modulus length l is not the only parameter relevant for security. Beneath
requirements from implementation and engineering the sizes and the proportions
of the parameters e, d, and n are relevant.
Corresponding attacks based on lattice reductions are a real threat for (too)
simple implementations of RSA. These attacks can be structured into the following
four categories:
• Attacks against very small public keys e (e.g., e = 3);
• Attacks against relatively small private exponents d (e.g., d < n 0.5 );
• Factorization of the modulus n, if one of the factors p or q is partly known;
• Attacks requiring that a part of the private key d is known (the motivation
for these partial key exposure attacks mainly arises from the study of side-
channel attacks on RSA).
Sections 11.8.2 and 11.9 go into more detail on lattice-based attacks. A very
good overview can be found on the website. LatticeHacks which is a joint work
by Daniel J. Bernstein, Nadia Heninger, and Tanja Lange; see [21]. On this website
you will find both a lecture at the CCC 2017 and the SageMath sources for this.
The four categories mentioned above are implemented in CTT; see Matthias
Schneider’s diploma thesis [22].
You can also find out which lattice-based methods and attacks are offered in
CrypTool in Section 11.12.

5.12.3 Forecasts about Factorization of Large Integers

Since 1980 a lot of progress has been made regarding factorization of large integers.
Estimations about the future development of the ability to factor RSA moduli vary

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 238 — #44

C158
On January 18, 2002, researchers at the University of Bonn factorized a 158-digit
decimal number into its both prime factors (these are built with 73 and 86 decimal
digits) using the GNFS method.
This record got much less attention within the press than the solution of RSA-
155. The task of the researchers was not initiated by a challenge, but they wanted
to find the last prime factors of the integer 2953 − 1 (see “Wanted List” in the
Cunningham Project [29]). The six smaller prime factors, already found before
have been:

3, 1907, 425796183929,
1624700279478894385598779655842584377,
3802306738549441324432139091271828121 and
128064886830166671444802576129115872060027

The first three factors can be easily computed.4 The next three prime factors were
found by P. Zimmermann, T. Grandlund, and R. Harley during 1999 and 2000
using the elliptic curve factorization method.

4. For example, using CT1 Indiv. Procedures F RSA Cryptosystem F Factorization of a Number.
CTO’s Msieve shows errors. Alpertron’s Calculator also finds the first three factors immediately. CT1 can
factorize in a reasonable time numbers only not longer than 250 bits.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 241 — #47

i i

5.12 Regarding the Security of the RSA Algorithm 241

The last remaining factor, called C158, was known to be composite by then,
but its factors were not known (the following three lines contain one number):

39505874583265144526419767800614481996020776460304936
45413937605157935562652945068360972784246821953509354
4305870490251995655335710209799226484977949442955603

The factorization of C158 resulted in the following two 73- and 86-digit prime
factors:
3388495837466721394368393204672181522
815830368604993048084925840555281177

and
1165882340667125990314837655838327081813101
2258146392600439520994131344334162924536139.

So now all eight prime factors of 2953 − 1 have been found.

RSA-160
On January 18, 2002, researchers at the University of Bonn factorized a 160-digit
number into its both prime factors (each with 80 decimal digits) using the GNFS
method. The computations for the factorization of RSA-160 also took place at the
German Information Security Agency (BSI) in Bonn.
The 160-digit decimal number origins from the old challenge list of RSA Secu-
rity. This number was retracted after RSA-155 had been factorized successfully. The
prime factors of RSA-160 were still unknown. So this record of the team of Franke
provides the solution of the old challenge, for which no prize is awarded anymore.
The composite number called RSA-160 is (the following three lines contain one
number):

215274110271888970189601520131282542925777358884567598017049
767677813314521885913567301105977349105960249790711158521430
2079314665202840140619946994927570407753

The factorization of RSA-160 resulted in the following two prime factors:

p = 45427892858481394071686190649738831
656137145778469793250959984709250004157335359

and
q = 47388090603832016196633832303788951
973268922921040957944741354648812028493909367

The calculations took place between December 2002 and April 2003.

RSA-200
On May 9, 2005, the research group of Jens Franke at the University of Bonn

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 242 — #48

i i

242 Introduction to Elementary Number Theory with Examples

announced that they factorized a 200-digit number into its both prime factors (each
with 100 decimal digits) using the GNFS method.
The composite number called RSA-200 is (the following three lines contain one
number):

2799783391122132787082946763872260162107044678695542853756000992932
6128400107609345671052955360856061822351910951365788637105954482006
576775098580557613579098734950144178863178946295187237869221823983

The factorization of RSA-200 resulted in the following two prime factors:

p = 35324619344027701212726049781984643686711974001976
25023649303468776121253679423200058547956528088349

and

q = 79258699544783330333470858414800596877379758573642
19960734330341455767872818152135381409304740185467

The calculations took place between December 2003 and May 2005. The
research group included Bahr, Böhm, Franke, Kleinjung, Montgomery, and te Riele.
The operating expense of the calculations was about 120,000 MIPS-years. A MIPS-
year (MY) is the quantity of operations a machine can perform in one year if the
machine constantly achieves one million integer operations per second (MIPS). For
context, an Intel Pentium processor then had about 10 MIPS.
To factorize a 2048-bit modulus it is estimated to need about 8.5 · 1040 MY.
A current processor (such as AMD Ryzen 5900) achieved around 105 MIPS at the
end of 2021.

C307/M1039
In May 2007, Franke, Kleinjung (University of Bonn), the Japanese telecommu-
nication company NTT, and Arjen Lenstra (Polytechnical University of Lausanne)
announced that they managed to factorize a 307-digit decimal number into its both
prime factors with the SNFS method (special number field sieve) within 11 months
(the two factors have 80 and 227 decimal digits).
The task of the researchers was not initiated by a challenge, but they wanted
to find the last prime factors of the Mersenne number 21039 + 1 from the “Wanted
List” of the Cunningham Project [29].
The numbers in the Cunningham table have the following notation: “(2,n)-”
means 2n − 1; “(2,n)+” means 2n + 1.
To describe the magnitude one writes “p<n>” or “c<n>”: “n” is the number of
decimal digits and “p” and “c” tell whether the number is prime or composite: So
21039 − 1 = p7 · c307 = p7 · p80 · p227.
It is explained more precisely in [34]. “2,651+” means 2651 + 1 and the size
(c209 means 209 decimal digits) of the number that was factored. Then come the
new factor(s), the discoverer, and the method used. Recently, only the multiple poly-
nomial quadratic sieve (ppmpqs), the elliptic curve method (ecm), and the number
field sieve (nfs) have been used. “hmpqs” stands for hypercube multiple polynomial

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 243 — #49

i i

5.12 Regarding the Security of the RSA Algorithm 243

quadratic sieve. Under “new factors,” “p90” means a 90-digit prime and “c201”
is a 201-digit composite number.
The number 21039 − 1 consists of three prime factors. The smallest one, p 7 =
5080711, was already known.5
To complete this, the second factor (codivider) “C307” had to be factorized.
Until then it was only known that the last remaining factor was composite, but it
was unknown how many prime factors it had and what were the prime factors. The
following five lines contain one number:

C 307 = 1159420574072573064369807148876894640753899791702017724986
868353538822483859966756608000609540800517947205399326123020487
440286043530286191410144093453512334712739679888502263075752809
379166028555105500425810771176177610094137970787973806187008437
777186828680889844712822002935201806074755451541370711023817

The factorization of C307 resulted in the following two 80- and 227-digit prime
factors:

p 80 = 558536666199362912607492046583159449686465270184
88637648010052346319853288374753

and

p 227 = 207581819464423827645704813703594695162939708007395209881208
387037927290903246793823431438841448348825340533447691122230
281583276965253760914101891052419938993341097116243589620659
72167481161749004803659735573409253205425523689.

So now the number 21039 − 1 is completely factorized in its three prime factors.

RSA-768
On December 12, 2009, the research group of Thorsten Kleinjung announced that
they factorized a 232-digit number into its both prime factors (both factors have 116
decimal digits). They used the GNFS method in a way where they did oversieving
on several hundred computers before starting the matrix step.
The composite number called “RSA-768” is (the following three lines contain
one number):

123018668453011775513049495838496272077285356959533479219732245215
172640050726365751874520219978646938995647494277406384592519255732
630345373154826850791702612214291346167042921431160222124047927473
7794080665351419597459856902143413

5. This one can also be found using CT1 Indiv. Procedures F RSA Cryptosystem F Factorization of
a Number with the algorithms of Brent, Williams, or Lenstra, which are good to separate relatively small
factors.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 244 — #50

i i

244 Introduction to Elementary Number Theory with Examples

The factorization of RSA-768 resulted in the following two prime factors (each with
384 bits):

p = 3347807169895689878604416984821269081770479498371376856891
2431388982883793878002287614711652531743087737814467999489

and

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 246 — #52

i i

246 Introduction to Elementary Number Theory with Examples

of 1024-bit integers the matrix step can, under an optimistic assumption

about the matrix size, be completed within a day by a device that costs a
few thousand dollars. We conclude that from a practical standpoint, the
security of RSA relies exclusively on the hardness of the relation collection
step of the number field sieve.

RSA Security concludes in its analysis of the Bernstein paper [37] from April 8,
2002, also—as expected—that RSA is still not compromised.

5.12.5.2 The TWIRL Device

In January 2003, Adi Shamir and Eran Tromer from the Weizmann Institute of
Science published a preliminary draft called “Factoring Large Numbers with the
TWIRL Device,” raising concerns about the security of key sizes up to 1024
bits [38].
Their abstract summarizes their results very well:

The security of the RSA cryptosystem depends on the difficulty of factor-

ing large integers. The best current factoring algorithm is the number field
sieve (NFS), and its most difficult part is the sieving step. In 1999 a large
distributed computation involving thousands of workstations working for
many months managed to factor a 512-bit RSA key, but 1024-bit keys were
believed to be safe for the next 15–20 years. In this paper we describe a new
hardware implementation of the NFS sieving step …which is 3–4 orders of
magnitude more cost-effective than the best previously published designs
…Based on a detailed analysis of all the critical components (but without
an actual implementation), we believe that the NFS sieving step for 1024-
bit RSA keys can be completed in less than a year by a $10M device, and
that the NFS sieving step for 512-bit RSA keys can be completed in less
than ten minutes by a $10K device. Coupled with recent results about the
difficulty of the NFS matrix step … this raises some concerns about the
security of those key sizes.

A detailed explanation from these two authors also can be found in the RSA
Laboratories CryptoBytes [39].
The three-page article in the DuD issue of June 2003 [40] contains a good
explanation of how the attack using the GNFS works and what progress is made to
factorize numbers. With GNFS we can distinguish two general steps: The sieve step
(relation collecting) and the matrix reduction. Besides that the sieve step is highly
parallelizable, it also dominates the overall calculation burden. Shamir and Tromer
haven’t built a TWIRL device yet, but the estimated costs of 10 to 50 million Euro
(in order to factorize a 1024-bit number) is not prohibitive for secret agencies or
big criminal organizations, as the costs for a single espionage satellite is estimated,
for example, to be several billion USD. The authors therefore recommend getting
rid of sensible RSA keys with a key length below 2048 bit as soon as possible.
This fits with recommendations like the BSI’s annual technical guideline [41]
to switch to longer RSA key lengths.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 247 — #53

i i

5.12 Regarding the Security of the RSA Algorithm 247

5.12.5.3 Primes in P: Primality Testing is Polynomial

In August 2002, the three Indian researchers M. Agrawal, N. Kayal, and N. Sax-
ena published the paper “Primes in P” about a new primality testing algorithm
called AKS [42]. They discovered a polynomial time deterministic algorithm for
determining if a number is prime or not.
The importance of this discovery is that it provides number theorists with new
insights and opportunities for further research. Lots of people over the centuries
have been looking for a polynomial time test for primality, and this result is a major
theoretic breakthrough. It shows that new results can be generated from already
known facts.
But even its authors note that other known algorithms may be faster (for exam-
ple ECPP). The new algorithm works on any integer. For example the GIMPS
project uses the Lucas-Lehmer primality test which takes advantage of the special
properties of Mersenne numbers. This makes the Lucas-Lehmer test much faster,
allowing to test numbers with millions of digits, while general-purpose algorithms
are limited to numbers with a few thousand digits.

5.12.5.4 Shared Primes: Moduli with Common Prime Factors

The RSA algorithm is based on the presumed difficulty of factorizing large biprime
integers (moduli), the factorizing problem. However, as pointed out in Lenstra et al.
[43] it is possible, given a set of moduli, to factorize quickly those that share prime
factors. In this case, the factorization problem is bypassed using the—relatively
easy—greatest common divisor (gcd) operation. On the other hand, it is no trivial
task to extract common shared primes and to factorize the corresponding moduli
efficiently for a very big number of given moduli (several millions).
Shared primes only occur if the RSA keys were not generated randomly. Taking
into consideration the significance of strong cryptographic keys, it is important to
verify that all keys were generated following the principle of true randomness [44].
When Lenstra et al. published their paper [43] in February 2012, they did not
publish the source code. However, soon afterward the source code of a similar
program was published at the CrypTool website [45] in Python and C++, and—
again a bit later—at the page used by [46]. The fastest code known to me comes
from [46].
These applications find all shared factors that may exist, given a finite set of
moduli—even if this set includes millions of moduli. Such an application enables
system administrators to test their own RSA keys.
The quite naive way to find all shared factors would be to compare each mod-
ulus with all other moduli, which has a complexity growing quadratically with the
number of moduli.
A very efficient method using trees for finding shared prime factors is based on
a publication of Dan Bernstein in 2005 [47]. Bernstein uses a precalculation that
leads to the product of all moduli.
This is another example showing how helpful precalculations can be to break
cryptographic systems. Another famous example are rainbow tables used to find
the origin of a hash value [48].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 248 — #54

i i

248 Introduction to Elementary Number Theory with Examples

Comparing the Execution Time to Calculate gcd and to Factorize

SageMath Example 5.10 shows the very different run times when calculating a
gcd and a factorization. The following operations are very fast: multiplication of
factors, dividing a modulus by a known factor, or calculating the gcd. However,
factorizing moduli steeply increases with longer moduli. Even the relatively small
moduli used in this example show this: For the smaller modulus m 1 (69 decimal
digits, 228 bits) 35 sec were needed; the larger m 2 (72 decimal digits, 239 bits)
took 99 sec. Furthermore, the operations (multiplication, division, and gcd) show
big differences in execution time when the used operands are very different in size.

SageMath Example 5.10: Comparing the Execution Time for Calculating a

gcd and Performing a Factorization
print ("\n# CHAP05 -- Sage -Script -SAMPLE 010: =========")

import time # in scripts: measure time like in Python and calculate �

� execution time

print (" Multiplication: -----------------------")

cf=3593875704495823757388199894268773153439
Start_Time = time.time ()
m1=cf * 84115747449047881488635567801
print (" Time = %f sec \n m1:" % (time.time ()-Start_Time), m1)

Start_Time = time.time ()
m2=cf * 162259276829213363391578010288127
print (" Time = %f sec \n m2:" % (time.time ()-Start_Time), m2)

print (" Division: -----------------------")

Start_Time = time.time ()
r=302301541122639745170382530168903859625492057067780948293331060817639 �
� / \
3593875704495823757388199894268773153439
print (" Time = %f sec \n m1/cf:" % (time.time ()-Start_Time), r)

Start_Time = time.time ()
r=583139672825572068433667900695808357466165186436234672858047078770918 �
� 753 / \
3593875704495823757388199894268773153439
print (" Time = %f sec \n m2/cf:" % (time.time ()-Start_Time), r)

print ("gcd: -----------------------")

Start_Time = time.time ()
r=gcd(58313967282557206843366790069580835746616518643623467285804707877 �
� 0918753, \
3023015411226397451703825301689038596254920570677809482933310 �
� 60817639)
print (" Time = %f sec \n gcd(m2,m1):" % (time.time ()-Start_Time), r)

print (" Factorization: -----------------------")

Start_Time = time.time ()
r=factor(58313967282557206843366790069580835746616518643623467285804707 �
� 8770918753)
print (" Time = %f sec \n m2 =" % (time.time ()-Start_Time), r)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 249 — #55

i i

5.12 Regarding the Security of the RSA Algorithm 249

SageMath Example 5.10 (continued)

Start_Time = time.time ()
r=factor(30230154112263974517038253016890385962549205706778094829333106 �
� 0817639)
print (" Time = %f sec \n m1 =" % (time.time ()-Start_Time), r)

Differences Between Sage Script: Sage Command Line

The calls from the script in the SageMath Example 5.11 can also be executed on the
SageMath CLI (command line interface); see Section 5.12.5.4. For short sequences
of commands the CLI is easier: Operations output their results immediately (with-
out print()); and the execution time is obtained syntactically easier, namely simply
by putting %time in front of it. The usual way to achieve this in a script is to
get current time and calculate execution time as the difference for each statement:
ExecutionTime = time.time() - StartTime.
Hint 1: In both cases, a terminating backslash is used to span a single line of code
over multiple lines.
Hint 2: To stop and exit your script somewhere in between use the corresponding
Python call: sys.exit(1).
Remark: The factor() command can be used with the PARI defaults or you can spec-
ify the dedicated algorithm (here Bill Hart’s quadratic sieve and Paul Zimmermann’s
GMP-ECM).

SageMath Example 5.11: Printing the Execution Time on the Sage Command
Line / Factorize
sage: # Calculate gcd
sage: %
....: 3023015411226397451703825301689038596254920570677809 �
� 48293331060817639)
CPU times: user 15 µs , sys: 0 ns , total: 15 µs
Wall time: 16.2 µs
3593875704495823757388199894268773153439

sage: # Factorize (using PARI at the time of writing)

sage: %
CPU times: user 1min 33s, sys: 129 ms , total: 1min 33s
Wall time: 1min 33s
162259276829213363391578010288127 * 35938757044958237573881998942687731 �
� 53439

sage: # Factorize (select dedicated algorithms)

sage: n=583139672825572068433667900695808357466165186436234672858047078 �
� 770918753

sage: %
/usr/lib/python3/dist -packages/IPython/core/interactiveshell.py:2364: �
� RuntimeWarning:
the factorization returned by qsieve may be incomplete (the factors may �
� not be prime)
or even wrong; see qsieve? for details result = fn(*args , ** kwargs)
CPU times: user 1.92 ms , sys: 10 µs , total: 1.93 ms

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 250 — #56

i i

250 Introduction to Elementary Number Theory with Examples

SageMath Example 5.11 (continued)

Wall time: 48.8 s

162259276829213363391578010288127 * 35938757044958237573881998942687731 �
� 53439

sage: %
CPU times: user 74 ms , sys: 28 ms , total: 102 ms
Wall time: 3min 48s
162259276829213363391578010288127 * 35938757044958237573881998942687731 �
� 53439

sage: # Knowing the size of the number and its factors

sage: n=162259276829213363391578010288127; len(n.digits ())
33
sage: n=3593875704495823757388199894268773153439; len(n.digits ())
40
sage: n=583139672825572068433667900695808357466165186436234672858047078 �
� 770918753; len(n.digits ())
72

Efficient Computing of All Shared Primes

The paper “Mining Your Ps and Qs: Detection of Widespread Weak Keys in
Network Devices” [46] explains how the algorithm efficiently calculates the gcds
(greatest common divisors) of every pair of RSA moduli (taken from a given but
huge set of RSA moduli). More precisely, the gcds of all pairs of RSA moduli are
not computed but the gcds with other arguments, which is faster and delivers the
same result: shared primes.
This section explains the essential part of the method used in this paper: Using
two trees greatly accelerates the calculation of the gcds.
First the product P of all moduli m i for i = 1, . . . , k is calculated by using a
product tree (see Figure 5.4):
Yk
P= mi
i =1

Then, using a remainder tree, for each i the remainder z i of the division of P by m i2
is computed:
z i ≡ P mod m i2 , z i ∈ {0, 1, . . . , m i2 − 1}

Now this remainder modulo the square of m i is divided by m i , yielding an integer

value because m i2 as well as P are divisible by m i and therefore also z i is divisible
by m i :
zi
ri = ; ri ≤ m i
mi

To finish, we only have to compute the gcd:

gcd (ri , m i )

This is visualized in Figure 5.4, which is taken from [46] with some minor changes.

“Esslinger” — 2023/11/30 — 19:44 — page 253 — #59

i i

5.13 Applications of Asymmetric Cryptography Using Numerical Examples 253

example, the sender of a message could add a number to be kept secret (the key
S) to the plaintext number M and thereby obtain the ciphertext number C:

C =M+S

By reversing this operation, that is, by subtracting S, the receiver can reconstruct
the plaintext:
M =C−S
Adding S reliably obfuscates the plaintext. Nevertheless, this “encryption” is very
weak: If an eavesdropper gets her hands on just one pair of plaintext and ciphertext
numbers, she can calculate the key

S=C−M

i i

256 Introduction to Elementary Number Theory with Examples

SageMath Example 5.12 (continued)

print ("a) via 'normal ' integer numbers ")

print (" a:", discrete_log(mod(49,347),mod(11,347)))

print ("b) via the ring of integers (better)")

R=Integers(347)
g=R(11)
A=R(49)
print (" a:", discrete_log(A,g))

print ("Get the secret key of Bob: (with g=11, p=347, B=285, b=39) ---")
B=R(285)
print (" b:", discrete_log(B,g))

#------------------------------------
# CHAP05 -- Sage -Script -SAMPLE 020: =========
# Get the secret key of Alice (with g=11, p=347, A=49, a=240 or 67) ---
# a) via 'normal ' integer numbers
# a: 67
# b) via the ring of integers (better)
# a: 67
# Get the secret key of Bob: (with g=11, p=347, B=285, b=39) ---
# b: 39

As the SageMath function discrete_log expects as arguments only elements of

a ring (integers between 0 and an upper limit), we can enforce this type by entering
the numbers directly with the corresponding modulo operator:
discrete_log( mod(49, 347), mod(11, 347) )
A much better alternative is to let SageMath know from the very beginning
that they are elements of a ring (as in SageMath Example 5.12). After this extra
“burden” for the initialization, you can write the formulas as you are used to:
discrete_log(A, g)
Such number theoretic tasks can also be solved using other tools like PariGP,
BC, or Mathematica. Here is the corresponding syntax to get the discrete log for
Alice (all function calls deliver the result 67):
• Pari-GP: znlog(Mod(49,347),Mod(11,347)).
• Mathematica: MultiplicativeOrder[11, 347, 49]
The general “Solve” function provides the “em tdep message”: The
equations appear to involve the variables to be solved for in an essentially
nonalgebraic way.
Why did the functions deliver the value 67 for the discrete logarithm of Alice
rather than 240, which Alice selected as exponent a?
The discrete logarithm is the smallest natural exponent that solves the equation
11 ≡ 49 (mod 347). Both x = 67 and x = 240 (the number selected in the exam-
x

ple) satisfy the equation and can therefore be used to calculate the session key:
285240 ≡ 28567 ≡ 268 (mod 347). If Alice and Bob had selected a primitive root
modulo p as base g, then for every remainder from the set {1, 2, . . . , p − 1} there is
exactly one exponent from the set {0, 1, . . . , p − 2}.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 257 — #63

5.14 The RSA Procedure with Specific Numbers 259

Key generation (steps 1 to 3):

1. p = 503, q = 509 (n = 256027; φ (n ) = ( p − 1)(q − 1) = 255016 =
23 · 127 · 251)
2. e = 65537 (e must be relatively prime to 255016)
3. d = 231953 (since e ≡ d −1 (mod φ (n )) : 65537 · 231953 ≡
15201503761 ≡ 1 (mod 255016)). Other possible combinations of (e, d )
include: (3, 170011), (5, 204013), (7, 36431).

Variant 1: All ASCII characters are en-/decrypted separately (no blocks are formed)
See Section 5.17.5 for the source code for RSA operations like modular exponenti-
ation or the Euler function using SageMath.

4. Encryption:
Text: R S A w o r k s !
Number: 82 83 65 32 119 111 114 107 115 33

The letters are not combined here. For secure procedures we need large numbers
that accept—as far as possible—all values up to n − 1. If the possible value set for
the numbers in the message is too small, even large prime numbers cannot make
the procedure secure. An ASCII character is represented by 8 bits. If we want larger
values we must combine several numbers. Two characters need 16 bits, whereby
the maximum value that can be represented is 65536. The modulus n must then
be greater than 216 = 65536. This is applied in variant 2. When the numbers are
combined, the leading zeros are kept in binary notation (just as if we were to write
all numbers with three digits in decimal notation above and were then to obtain the
sequence 082 083, 065 032, 119 111, 114 107, 115 033).
Each character is encrypted using: C = M 65537 (mod 256027):

212984 025546 104529 031692 248407

100412 054196 100184 058179 227433

9. - You can handle this exercise using CT1 Indiv. Procedures F RSA Cryptosystem F RSA
Demonstration.
- Using CT2 Templates F Mathematics F RSA with big numbers for single numbers.
- Using JCT Default Perspective F Visuals F RSA Cryptosystem you can handle this task too.
- In CTO in the plugin “RSA visual and more” both variants (with and without block formation)
and both sequences of coding (concatenation of the binary representation of the individual characters,
or their decimal representation is concatenated first and then converted to binary) can also be used:
https://www.cryptool.org/en/cto/rsa-visual. See Figures 5.8 and 5.9 for variant 2.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 262 — #68

i i

262 Introduction to Elementary Number Theory with Examples

Each block is encrypted using: C ≡ M 65537 (mod 256027):

158721 137346 37358 240130 112898

5. Decryption:
Ciphertext:

158721 137346 37358 240130 112898

Each block is decrypted using: M ≡ C 231953 (mod 256027):

21075 16672 30575 29291 29473

4’. Encryption:
Blocks are formed (each block contains two ASCII characters, and the ASCII
characters are written as two 3-digit decimal numbers one after the other):

82083 65032 119111 114107 115033

RSA encryption works correctly with the modulus n = 256027 because each ASCII
block of two characters will be encoded into a number that is smaller or equal to
the number 255,255.

11. You can solve this using CT1 Indiv. Procedures F RSA Cryptosystem F RSA Demonstration with the
following options: all 256 ASCII characters, b-adic, block length 2, and decimal representation.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 265 — #71

i i

5.14 The RSA Procedure with Specific Numbers 265

Each block is encrypted using: C ≡ M 65537 (mod 256027):

198967 051405 254571 115318 014251

5’. Decryption:
Ciphertext:

198967 051405 254571 115318 014251

Each block is decrypted using: M ≡ C 2473 (mod 67519):

82083 65032 119111 114107 115033

5.14.4 A Small RSA Cipher Challenge, Part 1

The following task is taken from [11, Exercise 4.6] and the pure solution has been
published by Douglas Stinson. However, it is not the result that is important here
but rather the individual steps of the solution; that is, the explanation of the crypt-
analysis. The method of solving the problem is outlined in the scenario of the online
help to CT1 and in the CT1 presentation on the CT website.
Here is the task in its original text:
Two samples of RSA ciphertext are presented in Tables 5.15 and 5.16. Your
task is to decrypt them. The public parameters of the system are

n = 18923 and e = 1261 (for Table 5.15);

n = 31313 and e = 4913 (for Table 5.16).

The cryptanalysis can be accomplished as follows. First, factor n (which is easy

because it is so small). Then compute the exponent d from φ (n ), and, finally, decrypt
the ciphertext. Use the square-and-multiply algorithm to exponentiate modulo n.
In order to translate the plaintext back into ordinary English text, you need to
know how alphabetic characters are encoded as elements in Zn . Each element of Zn
represents three alphabetic characters as in the following examples (with A = 0):

DOG 7→ 3 · 262 + 14 · 26 + 6 = 2398

CAT 7 → 2 · 262 + 0 · 26 + 19 = 1371
ZZZ 7 → 25 · 262 + 25 · 26 + 25 = 17575

You will have to invert this process as the final step in your program.
The first plaintext was taken from The Diary of Samuel Marchbanks by Robert-
son Davies, 1947, and the second was taken from Lake Wobegon Days by Garrison
Keillor, 1985.

5.14.5 A Small RSA Cipher Challenge, Part 2

The following task is a corrected version from the book written by Song Yan [51,
Example 3.3.7, p. 318]. Like in Section 5.14.4, it is not the result that is impor-
tant here but understanding the individual steps of the solution. The method of
solving the problem is outlined in the scenario of the online help to CT1 and in

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 266 — #72

i i

266 Introduction to Elementary Number Theory with Examples

Table 5.15 RSA Ciphertext A

12423 11524 7243 7459 14303 6127 10964 16399
9792 13629 14407 18817 18830 13556 3159 16647
5300 13951 81 8986 8007 13167 10022 17213
2264 961 17459 4101 2999 14569 17183 15827
12693 9553 18194 3830 2664 13998 12501 18873
12161 13071 16900 7233 8270 17086 9792 14266
13236 5300 13951 8850 12129 6091 18110 3332
15061 12347 7817 7946 11675 13924 13892 18031
2620 6276 8500 201 8850 11178 16477 10161
3533 13842 7537 12259 18110 44 2364 15570
3460 9886 8687 4481 11231 7547 11383 17910
12867 13203 5102 4742 5053 15407 2976 9330
12192 56 2471 15334 841 13995 17592 13297
2430 9741 11675 424 6686 738 13874 8168
7913 6246 14301 1144 9056 15967 7328 13203
796 195 9872 16979 15404 14130 9105 2001
9792 14251 1498 11296 1105 4502 16979 1105
56 4118 11302 5988 3363 15827 6928 4191
4277 10617 874 13211 11821 3090 18110 44
2364 15570 3460 9886 9988 3798 1158 9872
16979 15404 6127 9872 3652 14838 7437 2540
1367 2512 14407 5053 1521 297 10935 17137
2186 9433 13293 7555 13618 13000 6490 5310
18676 4782 11374 446 4165 11634 3846 14611
2364 6789 11634 4493 4063 4576 17955 7965
11748 14616 11453 17666 925 56 4118 18031
9522 14838 7437 3880 11476 8305 5102 2999
18628 14326 9175 9061 650 18110 8720 15404
2951 722 15334 841 15610 2443 11056 2186

Table 5.16 RSA Ciphertext B

6340 8309 14010 8936 27358 25023 16481 25809
23614 7135 24996 30590 27570 26486 30388 9395
27584 14999 4517 12146 29421 26439 1606 17881
25774 7647 23901 7372 25774 18436 12056 13547
7908 8635 2149 1908 22076 7372 8686 1304
4082 11803 5314 107 7359 22470 7372 22827
15698 30317 4685 14696 30388 8671 29956 15705
1417 26905 25809 28347 26277 7897 20240 21519
12437 1108 27106 18743 24144 10685 25234 30155
23005 8267 9917 7994 9694 2149 10042 27705
15930 29748 8635 23645 11738 24591 20240 27212
27486 9741 2149 29329 2149 5501 14015 30155
18154 22319 27705 20321 23254 13624 3249 5443
2149 16975 16087 14600 27705 19386 7325 26277
19554 23614 7553 4734 8091 23973 14015 107
3183 17347 25234 4595 21498 6360 19837 8463
6000 31280 29413 2066 369 23204 8425 7792
25973 4477 30989

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 267 — #73

i i

5.15 Didactic Comments on Modulo Subtraction 267

the CrypTool 1 presentation (see https://www.cryptool.org/assets/ct1/presentation-

s/CrypTool1-Presentation-en.pdf, pp. 52–57).
There are three tasks with completely different degrees of difficulty here. In
each case we know the ciphertext and the public key (e, n ):

a. Known-plaintext attack: find the secret key d using the additionally known
original message.
b. Ciphertext-only attack: find d and the plaintext.
c. Calculate the RSA modulus; in other words, factorization (with no knowl-
edge of the message).

n = 63978486879527143858831415041, e = 17579
Message:

1401202118011200,
1421130205181900,
0118050013010405,
0002250007150400

Cipher:

45411667895024938209259253423,
16597091621432020076311552201,
46468979279750354732637631044,
32870167545903741339819671379

Comment:
The original message consisted of a sentence containing 31 characters (coded with
the capital letters’ alphabet from Section 5.14.2). Each group of 16 decimal numbers
is then combined to form one number (the last number is filled with zeros). These
numbers are raised to the power of e.
When you decrypt the message you must fill the calculated numbers with lead-
ing zeros in order to obtain plaintext. This needs to be stressed because the type
of padding is extremely important during implementation and standardization for
interoperable algorithms.

5.15 Didactic Comments on Modulo Subtraction

Comment on subtraction modulo 5: 2 − 4 = −2 ≡ 3 mod 5.

It is therefore not true that −2 ≡ 2 mod 5.
People often make the mistake of equating this. It is easy to see why this is not
the same if you place the permutation (0, 1, 2, 3, 4) in Z5 , for example from −11
to +11, over the range of numbers in Z, like in Figure 5.10. Moving then on the
number line of integers from 3 to the left by 5, one ends up with the next element
belonging mod 5 to the same residue class as 3, which is −2.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 268 — #74

i i

268 Introduction to Elementary Number Theory with Examples

Figure 5.10 Number line of integers compared to modulo 5 numbers.

Table 5.17 Schematic Representation of Which Integers

Belong to the Same Residue Class Modulo 26
−26 −25 ... −2 −1
0 1 ... 24 25
26 27 ... 50 51

From time to time, some students ask how to deal with negative results (e.g.,
−1). For example, for affine ciphers with 26 characters, you calculate modulo 26.
In Z26 = {0, 1, 2, · · · , 25} it is 0 − 1 = −1 = 25 (mod 26).
The modulo calculation and which numbers belong to the same residue class
are illustrated in Table 5.17. You can see that numbers in the same column belong to
the same residue class. Such didactic representations help certain types of learners
more than formulas.
So here you can see that the following numbers are congruent: 51 ≡ 25 ≡
−1 (mod 26). Being congruent means belonging to the same residue class. Two
numbers are congruent if their difference can be divided by the modulus. So, (25 −
(−1)) = (51 − 25) = 26 are all divisible by 26.

5.16 Base Representation and Base Transformation of

Numbers and Estimation of Length of Digits

For a given number z one may ask how to represent√ such a number. In general
we use representations like z = 2374 or z = 2. The second number consists of
an infinite number of digits and therefore it can never be described precisely by
the first representation. You can get around this problem by writing the number
symbolically. But if you have to write it in digits, the number must be rounded.
We represent numbers usually in the decimal system (base 10). Computers are
working with the binary representation of numbers—only for the display numbers
are represented in decimal or sometimes hexadecimal (base 16) form.
This section describes how to generate arbitrary base representations of any
positive integer and how to determine the number of required digits via the
logarithm function.

5.16.1 b-adic Sum Representation of Positive Integers

Given base b, each positive integer z can be represented as a b-adic sum

z = an bn + an−1 bn−1 + · · · + a1 b + a0 ,

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 269 — #75

i i

5.16 Base Representation and Base Transformation of Numbers and Estimation of Length 269

where ai ∈ {0, 1, . . . , b − 1}, i = 0, 1, . . . , n are called digits.

For this sum, we have:

1. For arbitrary digits a0 , a1 , . . . , an it holds: bn +1 > an bn + an−1 bn−1 + · · · +

a1 b + a0 .
2. There exist digits a0 , a1 , . . . , an (namely ai = b − 1 for i = 0, . . . , n), with
bn +1 − 1 ≤ an bn + an−1 bn−1 + · · · + a1 b + a0 .

(Using these inequalities it can be shown that each positive integer can be repre-
sented by a b-adic sum).
By writing the digits an an−1 · · · a1 a0 in a row directly after each other (without
the bi ) the usual writing for numbers becomes available.

Example:
base b = 10: 10278 = 1 · 104 + 0 · 103 + 2 · 102 + 7 · 101 + 8.
base b = 16: FE70A = 15 · 164 + 14 · 163 + 7 · 162 + 0 · 161 + 10.

5.16.2 Number of Digits to Represent a Positive Integer

For a positive integer z the length of the b-adic representation can be determined
via the following steps. Starting from the inequality bn +1 > z ≥ bn we have—after
applying the logarithm function on basis b—n + 1 > logb z ≥ n. Therefore, we have
n = blogb zc (the notion bxc for a positive real number indicates to round down
to the next positive integer or do nothing if x itself is an integer). We call lb (z ) the
number of required digits to represent the number z on the base b. We have

lb (z ) := blogb zc + 1

Applying the logarithm formula on base b and b0 we have logb z = logb0 z / logb0 b.
It is therefore easy using, for example, logarithm tables for the base b0 = 10 to com-
pute the logarithm of base b = 2. With SageMath it is even easier: The command
log(n,b) returns the logarithm ob n to the base b, usually in an algebraic form:
log(101,10) returns log(101)/log(10) where log(n) is the natural logarithm
with base e. For numerical values use log(101,10).n(), then you get 2.0043...
and for rounding down use floor(log(101,10)) then you get 2.

Example 1 (decimal→hex)
We compute for the decimal number z = 234 (EA in hex) the hexadecimal
representation (number base b = 16)

l16 (z ) = blog16 (z )c + 1 = bln(z )/ ln(16)c + 1 = b1.96 . . . c + 1 = 1 + 1 = 2.

Example 2 (decimal→binary)
We compute for the decimal number z = 234 (11101010 in binary) the binary
representation (number base b = 2)

l2 (z ) = blog2 (z )c + 1 = bln(z )/ ln(2)c + 1 = b7.87 . . . c + 1 = 7 + 1 = 8.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 270 — #76

i i

270 Introduction to Elementary Number Theory with Examples

Example 3 (binary→decimal)
We compute for the binary number z = 11101010 (234 decimal) the decimal
representation (number base b = 10)

l10 (z ) = blog10 (z )c + 1 = bln(z )/ ln(10)c + 1 = b2, 36 . . . c + 1 = 2 + 1 = 3.

5.16.3 Algorithm to Compute the Base Representation

Given the number z one can compute the base b representation of z using the
following algorithm:

input: z, b
n := 0, z 0 := z
while z 0 > 0 do
an := z 0 (mod b)
z 0 := bz 0 /bc
n := n + 1
end do
output: an an−1 · · · a1 a0 in base b representation.

Example 4 (decimal→hex)
The integer z = 234 on the number base 10 will be transformed into the hex rep-
resentation via a0 = 234 (mod 16) = 10 = A; 234/16 = 14 = E, a1 = 14
(mod 16) = E, and therefore we have E A.

Example 5 (binary→decimal):
The binary number z = 1000100101110101 is transformed into the decimal repre-
sentation via the following steps:

1000100101110101 = 1001 (mod 1010) =⇒ a0 = 9, 1000100101110101/1010 =

110110111110
110110111110 = 1000 (mod 1010) =⇒ a1 = 8, 110110111110/1010 = 101011111
101011111 = 1 (mod 1010) =⇒ a2 = 1, 10101111/1010 = 100011
100011 = 101 (mod 1010) =⇒ a3 = 5, 100011/1010 = 1
11 = 11 (mod 1010) =⇒ a4 = 3
therefore z = 35189.

SageMath Example 5.13 contains code for the examples about digit length and
converting the representation between different bases. Sage integers can be read
as a decimal, octal, hexadecimal, or binary number: Integer() or ZZ() interpret
strings that begin with “0o” as octal numbers, strings that begin with “0x” as
hexadecimal numbers, and strings that begin with “0b” as binary numbers. We can
omit Integer() and ZZ() when entering a number on the Sage command line, as
the interpretation as a Sage integer is the default.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 271 — #77

i i

5.16 Base Representation and Base Transformation of Numbers and Estimation of Length 271

SageMath Example 5.13: Number of Digits Representing a Positive Integer

and Transformation Between Different Bases
# Length of decimal number
z=234; z.ndigits ()
3

# Sample 1 -- Length of hex: decimal --> hex

sage: z.ndigits(16)
2

# Sample 2 -- Length of binary: decimal --> binary

sage: z.ndigits(2) # Alternative: 234.nbits ()
8

# Sample 3 -- Length of decimal: binary --> decimal

sage: z=Integer('0b11101010 '); z; z.ndigits ()
234
3

# Enter a number of a given base as Sage integer

sage: Integer('0x12 ')
18
sage: Integer('-0o12 ')
-10
sage: Integer ('+0b101010 ')
42

# Output a Sage integer as number of a required base

sage: Integer(2^10).str(2)
'10000000000'
sage: print(Integer(800).oct())
1440

# Sample 4 -- show representation of hex: decimal --> hex

sage: print(Integer(234).hex())
ea

# Sample 5 -- show representation of decimal: binary --> decimal

# a) via a conversion using Python int() and str()
sage: z=1000100101110101; z
1000100101110101
sage: type(z)
<class 'sage.rings.integer.Integer '>
sage: int(str(z), base=2)
35189
# b) more directly using Sage integer
# sage: z=ZZ('0b1000100101110101 '); z
sage: z=0b1000100101110101; z
35189

sage: s = '12d'; zx=ZZ('0x'+s); zx; print(Integer(zx).hex())

301
12d

sage: a = ZZ(189866136719308462018271159242437168532); a.binary (); �

� print(a.binary ())
# '100011101101011011100011010001 ... 10100000000110110010100'
# 100011101101011011100011010001 ... 10100000000110110010100

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 272 — #78

i i

272 Introduction to Elementary Number Theory with Examples

5.17 Examples Using SageMath

Below you can find SageMath source code related to contents of this Chapter 5.
We also recommend the short article by Nguyen, which is didactically very clear
and treats basic number theory and SageMath usage [52].

5.17.1 Addition and Multiplication Tables Modulo m

SageMath Example 5.14 calculates the addition and multiplication Tables 5.1, 5.2,
and 5.3. The calculation is done once with self-written code, and once ready-made
functions from SageMath are used.

SageMath Example 5.14: Creating Small Addition and Multiplication Tables

with SageMath
print ("\n# CHAP05 -- Sage -Script -SAMPLE 025: =========")

# Create tables with own code .....................................

m = 5; print (" Addition table mod %d" % m)
for i in range(0,m):
print( [mod(i+j, m) for j in range(0,m)] )

m = 5; print (" Multiplication table mod %d" % m)

for i in range(1,m):
print( [mod(i*j, m) for j in range(1,m)] )

m = 6; print (" Multiplication table mod %d" % m)

for i in range(1,m):
print( [mod(i*j, m) for j in range(1,m)] )

# Create tables with predefined Sage functions for magmas .....................................

# https :// doc.sagemath.org/html/en/reference/categories/sage/categories/magmas.html#sage. �
� categories.magmas.Magmas.ParentMethods.multiplication_table
m = 5; R=Zmod(m)
T = R.addition_table(names='digits '); print ("\n", T, sep='') # same result with (names='digits �
� ')

m = 5; R=Zmod(m)
T = R.multiplication_table(names='digits '); print(T)
# print( latex(T) ) # get the code for LaTeX to print the according table

m = 5; R=Zmod(m); elem = [str(i) for i in range(1,m)] #; print (" elem: ", elem)
T = R.multiplication_table(names='elements ', elements=elem); print(T) # =elem instead of �
� elements =('1 ','2 ','3 ','4 ')
# print(T.column_keys (), "\n") # (1, 2, 3, 4)

m = 6; R=Zmod(m) # m = 15
T = R.multiplication_table(names='digits '); print(T)

SageMath Example 5.15 calculates the multiplication Table 5.4. for a · i

(mod m ), with m = 17, a = 5 and a = 6, and i from 0 to 16, respectively.

SageMath Example 5.15: Multiplication Tables for a · i (mod m ) with

m = 17, a = 5, and a = 6
print ("\n# CHAP05 -- Sage -Script -SAMPLE 030: =========")

m = 17; a = 5
print( [mod(a * i, m) for i in range(m)] )
a = 6
print( [mod(a * i, m) for i in range(m)] )

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 273 — #79

i i

5.17 Examples Using SageMath 273

SageMath Example 5.15 (continued)

#------------------------------------
# CHAP05 -- Sage -Script -SAMPLE 030: =========
# [0, 5, 10, 15, 3, 8, 13, 1, 6, 11, 16, 4, 9, 14, 2, 7, 12]
# [0, 6, 12, 1, 7, 13, 2, 8, 14, 3, 9, 15, 4, 10, 16, 5, 11]

The function mod() returns an object that represents integers modulo m (in our
case m = 17).
The other multiplication table examples modulo 13 (Table 5.5) and modulo 12
(Table 5.6) can be computed similarly by replacing m = 17 with m = 13 and m =
12, respectively.

5.17.2 Fast Exponentiation

The fast exponentiation modulo m can be computed using the SageMath function
power_mod(). The result of this function is an integer.
With the SageMath Example 5.16 you can reproduce the idea of the square-
and-multiply method, as shown in the example in Section 5.6.4.

SageMath Example 5.16: Fast Exponentiation of a e mod m = 103

print ("\n# CHAP05 -- Sage -Script -SAMPLE 040: =========")

a = 87; m = 103
exp = [2, 4, 8, 16, 32, 43]
z = [power_mod(a, e, m) for e in exp]
print( type(z), "\n", z )

#------------------------------------
# CHAP05 -- Sage -Script -SAMPLE 040: =========
# <class 'list '>
# [50, 28, 63, 55, 38, 85]

5.17.3 Multiplicative Order

The order ordm (a ) of a number a in the multiplicative group Z∗m is the smallest
number i ≥ 1 such that a i ≡ 1 (mod m ) (see Section 5.9).
To create Table 5.7 we can print all exponentiation a i (mod 11) like in
SageMath Example 5.17.

SageMath Example 5.17: Table with All Powers a i (mod m ) for m = 11,
a, i = 1, ..., 10
print ("\n# CHAP05 -- Sage -Script -SAMPLE 050: =========")

m = 11
for a in range(1, m):
print( [power_mod(a, i, m) for i in range(1, m)] )

# E: adding a last column with the order of each 'a' mod (11)
# D: die letzte Spalte um die Ordnung des jeweiligen 'a' mod (11) ergänzen
print () # add an empty line between the two tables

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 274 — #80

i i

274 Introduction to Elementary Number Theory with Examples

SageMath Example 5.17 (continued)

for a in range(1, m):

lst = [power_mod(a, i, m) for i in range(1, m)]
lst.append( multiplicative_order(mod(a,m)))

for k in range(0,m-1): # some beautifier formatting

print ("{:>4}". format(lst[k]), end = '') # print first m-1 cells of current row
print ("{:>6}". format(lst[m-1])) # print last cell of current row

Table 5.8 gives examples for ord45 (a ) and the Euler number φ (45). SageMath
Example 5.18 constructs a table similar to Table 5.8. In addition, this sample uses
the method table in order to print the layout of the table.

SageMath Example 5.18: Table with All Powers a i (mod 45) for a, i =
1, . . . , 12 Plus the Order of a
print ("\n# CHAP05 -- Sage -Script -SAMPLE 060: =========")

tbl = []
m = 45
noCols=m; noRows=m # so whole table isprinted
noCols=13; noRows=13 # so smaller , more clear table isprinted

for a in range(1, noRows):

lst = [power_mod(a, i, m) for i in range(1, noCols)]
try:
lst.append( multiplicative_order (mod(a, m)))
except:
lst.append ("No mult. order ")
lst.append(euler_phi(m))
# print(lst)
tbl.append(lst) # build up a table from the single rows (lst)

print(table(tbl , align='center ')) # print whole table

The number ordm (a ) only exists if a is relatively prime to m, which can be

checked with gcd(a,m). For our example with m = 45 = 32 · 5, the values a =
3, 5, 6, 9, 10, 12, ... are not relatively prime to m and so have no multiplicative order.

Programming hint: In the code example 5.18, we put the calculation of the mul-
tiplicative order within a try-except block. So you can catch any exceptions or
errors raised by the function multiplicative_order(). If an exception or error is
raised in the try block, then we know that ordm (a ) does not exist for that particu-
lar value of a. Hence in the except block we append the string "No mult. order"
to the row represented by the object lst.
Table 5.9 displays exponentiation a i (mod 46) as well as the order ord46 (a ).
SageMath Example 5.19 creates such a table.

SageMath Example 5.19: Table with All Powers a i (mod 46) for a, i =
1, . . . , 23 Plus the Order of a
print ("\n# CHAP05 -- Sage -Script -SAMPLE 070: =========")

m = 46
print( euler_phi(m) ); print ()

for a in range(1, 24):

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 275 — #81

i i

5.17 Examples Using SageMath 275

SageMath Example 5.19 (continued)

lst = [power_mod(a, i, m) for i in range(1, 24)]
try:
lst.append( multiplicative_order(mod(a, m)))
except:
lst.append (" None ") # No multiplicative order exists for this 'a'
print(lst)

SageMath Example 5.20 generates Tables 5.10 and 5.11. It also delivers the
result in a way that can be easily processed in LaTeX. The prerequisite is that all
content is assigned to one SageMath object (here the matrix r ).
A note about SageMath Example 5.20, especially about the SageMath
indices:
• for x in range(2, 5) delivers 2,3,4.
• m = matrix(ZZ, 2, 5) has 2 rows and 5 columns. The cells are named
m(0,0) to m(1,4).
• All elements of the matrix have to be numerical, so “0” is used instead of
None as in the tables before.
• The output of matrices can be controlled in SageMath with:
sage: from sage.matrix.matrix import set_max_cols, set_max_rows
sage: set_max_cols(100)
sage: set_max_rows(100)

• The length of the cycle in the last column of the Tables 5.10 and 5.11 was
added manually.

SageMath Example 5.20: Code for Tables with All Powers a i (mod m ) for
Variables a and i Plus Order of a and Eulerphi of m
print ("\n# CHAP05 -- Sage -Script -SAMPLE 080: =========")

def power_mod_order_matrix(m, max_a , max_i):

r = matrix(ZZ , max_a+1, max_i+3)
for a in range(0, max_a+1):
r[a, 0] = a
for i in range(1, max_i+1):
if a==0:
r[a,i] = i
else:
r[a, i] = power_mod(a, i, m)
try:
r[a, max_i+1] = multiplicative_order(mod(a, m))
except:
r[a, max_i+1] = 0
r[a, max_i+2] = euler_phi(m)
return r

print ("\n#1: m=45; max_i=13; max_a=13"); m=45; max_i=13; max_a=13

r = power_mod_order_matrix (m, max_a , max_i)
print(r); print( latex(r) )

print ("\n#2: m=46; max_i=25; max_a=25"); m=46; max_i=25; max_a=25

r = power_mod_order_matrix (m, max_a , max_i)
print( r.str() ); print( latex(r) )

print ("\n#3: m=14; max_i=13; max_a=16"); m=14; max_i=13; max_a=16

r = power_mod_order_matrix (m, max_a , max_i)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 276 — #82

i i

276 Introduction to Elementary Number Theory with Examples

SageMath Example 5.20 (continued)

print(r); print( latex(r) )

print ("\n#4: m=22; max_i=21; max_a=25"); m=22; max_i=21; max_a=25

r = power_mod_order_matrix (m, max_a , max_i)
print( r.str() ); print( latex(r) )

5.17.4 Primitive Roots

Section 5.9 explained what primitive roots are and why they are useful. Computing
a primitive root in SageMath is very straightforward. If n is an integer, the com-
mand primitive_root(n) delivers one primitive root of the multiplicative group
Z∗n , if such a primitive root exists. If n is prime then this is the same as calculat-
ing a primitive root of Zn . If the number under consideration is a prime number,
primitive_root(n) returns the smallest primitive root. For nonprimes this is not
always the case as you can see with m = 10, when comparing SageMath Example
5.21 and SageMath Example 5.22: The function returns 7 instead of 3.

a. SageMath Example 5.21.

The example 5.21 calculates primitive roots of a few integers first from some
nonprimes (see the special cases of Theorem 5.14 in the remarks there), then
from the first 15 primes.

SageMath Example 5.21: Calculating One Primitive Root for a Given

Number
print ("\n# CHAP05 -- Sage -Script -SAMPLE 090: =========")

Chapter 9), here Python is used to import just this function from another Sage
script.
To do so, the importing script uses the function my_import which gets as
argument the function name to be imported. my_import then uses Python
mechanisms to load and execute the function from the argument from
another Sage script. Note that the function is loaded from the Python ver-
sion of the Sage script. This works because SageMath always transforms
the Sage script into a Python script, which then is performed (sample:
When calling $sage myfile.sage a Python file myfile.sage.py is created
from myfile.sage). Currently, this my_import function is written in such
a way that it requires a typical Unix shell (e.g., Bash), which is available
by default on Linux and macOS. However, this also works on Windows,
where SageMath for Windows is delivered together with the Bash shell (see
https://wiki.sagemath.org/SageWindows).

The call within the own Sage script looks like this:
my_import("chap05_sample100", "enum_PrimitiveRoots_of_an_Integer").

SageMath Example 5.23: Function count_PrimitiveRoots_of_an

_IntegerRange to Calculate the Number of All Primitive Roots for a
Given Range of Integers
print ("\n# CHAP05 -- Sage -Script -SAMPLE 110: =========")

# function to include 'enum_PrimitiveRoots_of_an_Integer ' from another sage script

def my_import(module_name , func_name ='*'):
import os
command = 'sage --preparse ' + module_name + '.sage '
os.system(command) # this creates the *. sage.py file
command = 'mv ' + module_name + '.sage.py ' + module_name + '.py '
os.system(command) # this renames *. sage.py to *.py
from sage.misc.python import Python; python = Python ()
command = 'from ' + module_name + ' import ' + func_name
python.eval(command , globals ())

def count_PrimitiveRoots_of_an_IntegerRange (start , end , bPrimesOnly=True , �

� bShowPrimRoots=False):
"""
Compute all primitive roots of all numbers between start and end ,
inclusive , and count them.
- If the flag bPrimesOnly is True (default), it performs primality tests and the �
� calculation is
done only for primes (printing the count of found primes from start to end , �
� inclusive).
- If the flag bPrimesOnly is False , it not only considers the primes; and it
additionally prints the count of (even) numbers which have NO primitive root.
- If the flag bShowPrimRoots is True (default is False), then the primitive roots �
� are listed too.
"""
nCheckedNumb = 0
nCheckedNumb_WithoutPrimitivRoots = 0
nPrimitiveRoots = 0
for n in range(start , end+1):
if bPrimesOnly:
if is_prime(n):
nCheckedNumb += 1
L = enum_PrimitiveRoots_of_an_Integer (n)
nPrimitiveRoots += len(L)
if bShowPrimRoots == True: print ("n=%d, lenL =%d, L=%s" % (n, len(L), L))
else:
nCheckedNumb += 1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 279 — #85

i i

5.17 Examples Using SageMath 279

SageMath Example 5.23 (continued)

L = enum_PrimitiveRoots_of_an_Integer (n)
if L== None:
nCheckedNumb_WithoutPrimitivRoots += 1
else:
nPrimitiveRoots += len(L)
if bShowPrimRoots == True: print ("n=%d, lenL =%d, L=%s" % (n, len(L), L))

if bPrimesOnly:
print (" Found all %s" % nPrimitiveRoots + \
" primitive roots of %s prime(s)." % nCheckedNumb)
else:
if nCheckedNumb_WithoutPrimitivRoots == 0:
print (" Found all %s " % nPrimitiveRoots + \
"primitive roots of %s number(s)." % nCheckedNumb)
else:
print (" Found all %s " % nPrimitiveRoots + \
"primitive roots of %s number(s)." % \
(nCheckedNumb - nCheckedNumb_WithoutPrimitivRoots ))
print ("( Total of numbers checked: %s " % nCheckedNumb + \
"Amount of numbers without primitive roots: %s)" % \
nCheckedNumb_WithoutPrimitivRoots )

import time # in sage scripts: measure time like in Python and calculate execution time

my_import (" chap05 _sample100",

" enum_PrimitiveRoots_of_an_Integer ") # import from 'chap05 _sample100.sage '

print ("\ nBB_Start -- Testcases for count_PrimitiveRoots_of_an_IntegerRange (start , end , �

� bPrimesOnly , bShowPrimRoots)")

print ("\n-----------Testcase 1: (5, 6)"); StartTime = time.time () # default: only prime �

� moduli in range
count_PrimitiveRoots_of_an_IntegerRange (5, 6)
print (" Time = %f sec" % (time.time ()-StartTime))
...

According to the test cases with the function count_PrimitiveRoots_of

_an_IntegerRange, it becomes clear that the elapsed time for the calcula-
tion raises exponentially with the size of the numbers. In 2021, on a modern
PC and with SageMath 9.3, the execution of the script in 5.23 lasted more
than 5 minutes. Most of that time was spent on the last test case.
d. SageMath Example 5.24
The example 5.24 counts how many primitive roots are there in a
given primes range and enumerates all these primitive roots. Function
enum_PrimitiveRoots_of_an_Integer is used, which lists all primitive
roots of a single prime number p. This function again is included from a
Sage script already introduced before.
From this list of primitive roots, we can determine the smallest and largest
primitive root for Z p , as well as count the number of primitive roots of Z p .

SageMath Example 5.24: Function count_PrimitiveRoots_of_a

_PrimesRange to Calculate the Number of Primitive Roots for a Given
Range of Primes
print ("\n# CHAP05 -- Script -SAMPLE 120: =========")
...
def count_PrimitiveRoots_of_a_PrimesRange (start , end):
"""

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 280 — #86

i i

280 Introduction to Elementary Number Theory with Examples

SageMath Example 5.24 (continued)

Compute all primitive roots of all primes between start and end , inclusive.
"""
nPrimes = 0
nPrimitiveRoots = 0
for p in primes(start , end+1):
L = enum_PrimitiveRoots_of_an_Integer (p)
print(p, len(L))
nPrimes += 1
nPrimitiveRoots += len(L)
print (" Found all %s" % nPrimitiveRoots + " primitive roots of %s primes ." % nPrimes �
�)

import time
my_import (" chap05 _sample100", " enum_PrimitiveRoots_of_an_Integer ")

print ("\ nCC_Start -- Testcases for count_PrimitiveRoots_of_a_PrimesRange (start , end)")

print("-----------Testcase: (1, 1500)"); StartTime = time.time ()
count_PrimitiveRoots_of_a_PrimesRange (1, 1500)
print (" Time = %f sec" % (time.time ()-StartTime))

e. SageMath Example 5.25

A variant of the function count_PrimitiveRoots_of_a_PrimesRange from
SageMath Example 5.25 (slightly modified by Minh Van Nguyen for the
example 5.25) was used to generate a database of all primitive roots of all
primes between 1 and 100,000.

SageMath Example 5.25: Code to Generate the Database with All

Primitive Roots for All Primes Between 1 and 100000
print ("\n# CHAP05 -- Sage -Script -SAMPLE 130: =========")
...
my_import (" chap05 _sample100", " enum_PrimitiveRoots_of_an_Integer ")
start = 1
end = 100 # 10^5 needs ca. an hour. For testing choose end=100
fileName = "./ primroots.dat"
print ("... Creating file %s with end=%d" % (fileName , end))
file = open(fileName , "w")
for p in primes(start , end+1):
L = enum_PrimitiveRoots_of_an_Integer (p)
# print(p, len(L)) # just temporarily active
# Output to a file. The format is:
# (1) the prime number p under consideration
# (2) the number of primitive roots of Z/pZ
# (3) all the primitive roots of Z/pZ
file.write(str(p) + " " + str(len(L)) + " " + str(L) + "\n")
file.flush ()
file.close ()

This code and the function enum_PrimitiveRoots_of_an_Integer were

also executed in a Sage script noninteractively. In 2016, it took about 6 hours
on a then-modern PC with SageMath 7.2. In 2021, it took less than 1 hour
with SageMath 9.3.
Between 1 and 100,000 there are 9,592 primes. For them, more than 169
million primitive roots have been calculated. For each prime p > 3 it holds
that between 20% and almost 50% of all integers between 1 and p are a cor-
responding primitive root. So, prime numbers have relatively many primitive
roots.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 281 — #87

i i

5.17 Examples Using SageMath 281

The resulting file primroots.dat is a database of all primitive roots of all

primes between 1 and 100,000 inclusive. It is a large file (about 1.1 GB
uncompressed, and 153 MB compressed with 7Zip). You can find the com-
pressed file at https://www.cryptool.org/download/ctb/primroots_1-
100000.7z. Its content looks like this (prime number, count, and set of
primitive roots):
2 1 {1}
3 1 {2}
5 2 {2, 3}
7 2 {3, 5}
11 4 {8, 2, 6, 7}
...
89 40 {3, 6, 7, 13, 14, 15, 19, ..., 66, 70, 74, 75, 76, 82, �
� 83, 86}
97 32 {5, 7, 10, 13, 14, 15, 17, ..., 76, 80, 82, 83, 84, 87, �
� 90, 92}
...
99989 42840 {2, 3, 8, 10, 11, 13, 14, ..., 99978, 99979, 99981, 9998 �
� 6, 99987}
99991 24000 {65539, 6, 65546, 11, 12, ..., 65518, 65520, 87379, 6552 �
� 6, 65528}

f. SageMath Example 5.26

The example 5.26 calculates all primitive roots for all primes up to one mil-
lion, and outputs for each prime number four values: the prime number, the
number of different primitive roots, and its smallest and its biggest primitive
root.
SageMath Example 5.26: Code to Generate the Database with the
Smallest Primitive Root for All Primes Between 1 and 1000000
print ("\n# CHAP05 -- Sage -Script -SAMPLE 140: =========")
...
my_import (" chap05 _sample100", " enum_PrimitiveRoots_of_an_Integer ")

import time
StartTime = time.time ()
start = 1 # normal value: 1 // test value: 10^5+3
end = start+200 # normal value: 10^6 // start+200 test value
fileName = "./ primroot -smallest_up -to -one -million.dat"
print ("... Creating file %s with start =%d, end=%d" % (fileName , start , end))
file = open(fileName , "w")
file.write (" Used parameters: start = " + str(start) + ", end = " + str(end) + "\n")
file.write("---- StartTime: " + str(StartTime) + " sec ----\n")
file.flush ()
for p in primes(start , end+1):
L = enum_PrimitiveRoots_of_an_Integer (p)
# - To commandline , output only p and number of prim roots of Z_p
# print(p, len(L)) # just temporarily active to see where you are while testing
# - To file , output much more in the following format:
# (1) the prime number p
# (2) the number of primitive roots of Z_p
# (3) the smallest primitive root of Z_p
# (4) the largest primitive root of Z_p
LL = sorted(L) # sort necessary as the smallest primroot is
# not always found first (see L of p=43)
file.write(str(p) + " " + str(len(L)) + " " + str(LL[0]) + " " + str(LL[-1]) + "\n �
� ")
file.flush ()

EndTime = time.time (); EllapsedTime = EndTime -StartTime

file.write("---- EndTime: " + str(EndTime) + " sec ----\n")

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 282 — #88

i i

282 Introduction to Elementary Number Theory with Examples

SageMath Example 5.26 (continued)

file.write("---- EllapsedTime: " + str(EllapsedTime) + " sec ----\n")
file.flush ()
file.close ()

SageMath Example 5.26 was stopped after several weeks (running on

a modern PC with SageMath 7.2) after investigating all primes up to
half a million. The result was stored in the file primroot_number-
of-and-smallest_up-to-prime-500107.dat, which is 617 kB uncom-
pressed, and 178 kB compressed with 7Zip. You can find the compressed
file at https://www.cryptool.org/download/ctb/primroot_number-of-
and-smallest_up-to-prime-500107.7z.
This file contains all primes p between 1 and 500,107 together with the cor-
responding number of primitive roots and the corresponding smallest prime
root mod p. It holds that the number of primitive roots (for p > 3) is always
an odd number. The number of primitive roots modulo a prime p is always
equal to φ (φ ( p )) = φ ( p − 1) because the set of in Z p invertible elements
forms the cyclic group Z∗p and this group has p − 1 elements and φ ( p − 1)
generators, the latter being exactly the primitive roots modulo p. A cyclic
group of order n has always φ (n ) generators. A proof for this can be found
in [53, p. 36].
So this file may be interesting to some number theorists. Its content looks like
this:
2 1 1
3 1 2
5 2 2
7 2 3
11 4 2
13 4 2
17 8 3
...
99989 42840 2
99991 24000 6
100003 28560 2
...
500069 250032 2
500083 151520 2
500107 156864 2

If you are looking only for the smallest primitive root, then this script could
be accelerated dramatically by applying mathematical theory and searching
more directly for possible candidates (instead of first generating all primitive
roots with enum_PrimitiveRoots_of_an_Integer).
g. SageMath Example 5.27
The database file primroots_1-100000.dat from the SageMath Example
5.27 then was used as input to create three graphics using the example 5.27).
For a change, here the Sage code is from the CLI instead of from a Sage script.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 283 — #89

i i

5.17 Examples Using SageMath 283

SageMath Example 5.27: Code to Generate the Graphics about the

Primitive Roots (for Big Datasets)
sage: # open a database file on primitive roots for p between 1 �
� and 100,000
sage: file = open ("./ primroots.dat", "r")
sage: plist = [] # list of all primes between 1 and 100,000
sage: nlist = [] # number of primitive roots modulo prime p
sage: minlist = [] # smallest primitive root modulo prime p
sage: maxlist = [] # largest primitive root modulo prime p
sage: for line in file:
....: # get a line from the database file and tokenize it for �
� processing
....: line = line.strip ().split (" ", 2)
....: # extract the prime number p in question
....: plist.append(Integer(line[0]))
....: # extract the number of primitive roots modulo p
....: nlist.append(Integer(line[1]))
....: # extract the list of all primitive roots modulo p
....: line = line[-1]
....: line = line.replace ("{", "")
....: line = line.replace ("}", "")
....: line = line.split(", ")
....: # sort the list in non -decreasing order
....: line = [Integer(s) for s in line]
....: line.sort ()
....: # get the smallest primitive root modulo p
....: minlist.append(line[0])
....: # get the largest primitive root modulo p
....: maxlist.append(line[-1])
....:
sage: file.close () # close the database file
sage: # plot of number of primitive roots modulo p
sage: nplot = point2d(zip(plist , nlist), pointsize=1)
sage: nplot.axes_labels (["x", "y"])
sage: nplot
sage: # plot of smallest primitive root modulo prime p
sage: minplot = point2d(zip(plist , minlist), pointsize=1)
sage: minplot.axes_labels (["x", "y"])
sage: minplot
sage: # plot of largest primitive root modulo prime p
sage: maxplot = point2d(zip(plist , maxlist), pointsize=1)
sage: maxplot.axes_labels (["x", "y"])
sage: maxplot

Figure 5.11 graphs the number of primitive roots for each prime between 1
and 100,000. The x-axis represents primes between 1 and 100,000, while the
y-axis counts the number of primitive roots for each prime.
Figure 5.12 graphs the smallest primitive roots of all primes between 1 and
100,000. The x-axis again represents primes between 1 and 100,000. The
y-axis represents the smallest primitive root of each prime.
Figure 5.13 shows a corresponding graph for the largest primitive root of
each prime within the same interval between 1 and 100,000.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 284 — #90

i i

284 Introduction to Elementary Number Theory with Examples

Figure 5.11 The number of primitive roots of all primes between 1 and 100,000.

Figure 5.12 The smallest primitive roots of all primes between 1 and 100,000.

h. SageMath Example 5.28

In order to do some experiments, another much smaller dataset was used,
which considered only the primitive roots for primes in the range between 1
and 100. The database file primroots.dat from SageMath Example 5.24
then was used as input to create three graphics using the SageMath
Example 5.28 (Figure 5.14).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 285 — #91

i i

5.17 Examples Using SageMath 285

Figure 5.13 The largest primitive roots of all primes between 1 and 100,000.

Figure 5.14 Number and smallest and biggest primitive root for all primes up to 100.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 286 — #92

i i

286 Introduction to Elementary Number Theory with Examples

SageMath Example 5.28: Code to Generate the Graphics about the

Primitive Roots
print ("\n# CHAP05 -- Sage -Script -SAMPLE 150: =========")

def Display(F,fname):
...

# Open a database file on primitive roots (the file "primroots.dat" used

# here , was created with the script chap05 _sample130.sage where end=100)
file = open ("./ primroots.dat", "r")
plist = [] # list of all primes between 1 and 100
nlist = [] # number of primitive roots modulo prime p
minlist = [] # smallest primitive root modulo prime p
maxlist = [] # largest primitive root modulo prime p
for line in file:
# get a line from the database file and tokenize it for processing
line = line.strip ().split (" ", 2)
# extract the prime number p in question
plist.append(Integer(line[0]))
# extract the number of primitive roots modulo p
nlist.append(Integer(line[1]))
# extract the list of all primitive roots modulo p
line = line[-1]
line = line.replace ("{", "")
line = line.replace ("}", "")
line = line.split(", ")
# sort the list in non -decreasing order
line = [Integer(s) for s in line]
line.sort ()
# get the smallest primitive root modulo p
minlist.append(line[0])
# get the largest primitive root modulo p
maxlist.append(line[-1])

file.close () # close the database file

print (" length :", len(nlist)) # print just for test purposes
print (" plist :", plist) # print just for test purposes
print (" nlist :", nlist) # print just for test purposes
print (" minlist :", minlist) # print just for test purposes
print (" maxlist :", maxlist) # print just for test purposes
print (" zipped :", list(zip(plist , nlist))) # just for test purposes

# Generate 3 graphics:
# 1) Plot of number of primitive roots modulo p
# a) Either plot with the 2D plotting class "point2d()" built in sage
# Remark 1: If you have a smaller primes range , use bigger
# pointsize values or get rid of this parameter
# For huge sets , "pointsize=1" is fine.
# Remark 2: point2d() has no option "plotjoined=True"
# nplot = point2d(zip(plist , nlist))
# nplot = point2d(zip(plist , nlist), pointsize=1)
# b) or plot with "list_plot ()"
# Remark 3: "list_plot(list(zip(plist , nlist))" needs cast with list ()
# and has same result as "point2d(zip(plist , nlist))"
# Remark 4: list_plot () has option "plotjoined=True" to connect the
# points. This gives a better impression for smaller sets.
nplot = list_plot(list(zip(plist , nlist)), plotjoined=True)
nplot.axes_labels (["p", "number of primitive roots "])
Display(nplot , "Plot_number -of -all -primitive -roots ")

# 2) Plot of smallest primitive root modulo prime p

## minplot = point2d(zip(plist , minlist), pointsize=1)
minplot = list_plot(list(zip(plist , minlist)), plotjoined=True)
minplot.axes_labels (["p", "smallest primitive root "])
Display(minplot , "Plot_smallest -prim -root ")

# 3) Plot of largest primitive root modulo prime p

## maxplot = point2d(zip(plist , maxlist), pointsize=1)
maxplot = list_plot(list(zip(plist , maxlist)), plotjoined=True)
maxplot.axes_labels (["p", "biggest primitive root "])
Display(maxplot , "Plot_biggest -prim -roots ")

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 287 — #93

i i

5.17 Examples Using SageMath 287

Figure 5.14 contains three graphs. The x-axis always represents the primes
between 1 and 100. In the top-left graph the y values are the number of
primitive roots for each prime. In the graph in the top-right, the y value
represents the smallest primitive root for the corresponding prime number.
In the bottom graph, the y value represents the biggest primitive root for the
corresponding prime number.
Compared to SageMath Example 5.27, here list_plot() is used which
offers the option plotjoined. This option connects the single points with
lines, which only makes sense for small sets of points.
i. SageMath Example 5.29

SageMath Example 5.29: Code to Generate an in Zn Invertible Element

of Maximal Order
sage: n=45 #change n as desired
sage: l=list(factor(n))
sage: ms=[l[i][0]^l[i][1] for i in range(len(l))]
sage: dim=len(ms)
sage: m=[ primitive_root(ms[i]) for i in range(dim)]
sage: maxelt=crt(m,ms)
sage: maxelt
2

5.17.5 RSA Examples with SageMath

Below is SageMath source code for the simple RSA examples in Section 5.14.

Example in Section 5.14.2:

SageMath Example 5.30 executes the RSA exponentiation M 37 (mod 3713) on
message M = 120.

SageMath Example 5.30: RSA Exponentiation

sage: power_mod(120, 37, 3713)
1404

Example in Section 5.14.3:

SageMath Example 5.31 executes the factorization of φ (256027) = 255016 = 23 ·
127 · 251.

SageMath Example 5.31: Factoring a Number

sage: factor(255016)
2^3 * 127 * 251

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 288 — #94

i i

288 Introduction to Elementary Number Theory with Examples

Example in Section 5.14.3:

SageMath Example 5.32 encrypts the integer values of a sequence of ASCII
characters. SageMath can do RSA encryption as follows:

SageMath Example 5.32: RSA Encryption by Modular Exponentiation of the

Number Values of the Characters of a Message
sage: A = [82, 83, 65, 32, 119, 111, 114, 107, 115, 33]
sage: e = 65537; m = 256027
sage: [power_mod(a, e, m) for a in A]
[212984, 25546, 104529, 31692, 248407, 100412, 54196, 100184, 58179, 22 �
� 7433]

Example in Section 5.14.3:

SageMath Example 5.33: RSA Encryption Using SageMath

sage: A = [21075, 16672, 30575, 29291, 29473]
sage: e = 65537; m = 256027
sage: [power_mod(a, e, m) for a in A]
[158721, 137346, 37358, 240130, 112898]

Example in Section 5.14.3:

SageMath Example 5.34: RSA Encryption Using SageMath

sage: A = [82083, 65032, 119111, 114107, 115033]
sage: e = 65537; m = 256027
sage: [power_mod(a, e, m) for a in A]
[198967, 51405, 254571, 115318, 14251]

5.17.6 How Many Private RSA Keys d Exist within a Given Modulo Range?
The RSA encryption procedure was described in Section 5.10.2. Steps 1 to 3
constitute key generation, steps 4 and 5 are the encryption:
1. Select two distinct random prime numbers p and q and calculate n = p · q.
The value n is called the RSA modulus.
2. Select an arbitrary e ∈ {2, · · · , n − 1} such that:
e is relatively prime to φ (n ) = ( p − 1) · (q − 1).
We can then throw away p and q.
3. Select d ∈ {1, · · · , n − 1} with e · d ≡ 1 (mod φ (n )).
That is, d is the multiplicative inverse of e modulo φ (n ). We can then throw
away φ (n ).
→ (n, e) is the public key P.
→ (n, d ) is the private key S (only d must be kept secret).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 289 — #95

i i

5.17 Examples Using SageMath 289

4. For encryption, the message represented as a (binary) number is divided into

parts such that each part of the number represents a number less than n.
5. Encryption of the plaintext (or the parts of it) M ∈ {1, · · · , n − 1}:
C = E ((n, e), M ) = M e (mod n ).
To crack a given RSA ciphertext C, the default way would be to use the public
key of the recipient and to try to factorize n. Then you can go through the steps 2
and 3 and generate the private key e, which is normally used to decrypt a ciphertext.
According to the prime number theorem (see Theorem 4.7), the number of
prime numbers P I (x ) is asymptotic to x /ln (x ). Between 1 and a given n there are
about n /ln (n ) different primes.
If you don’t want to use factorization you may ask a question like in classic
encryption: Does an exhaustive search over all keys make sense? Therefore, you
may want to find out how many possible private keys (n, d ) are there for a given n
or for a given range n ∈ [a, b]? Section 5.8.5 deals with the special case n = 26.
The general question is answered by the function count_Number_of_RSA_Keys
(if the modulus is not too big). This function is defined in SageMath Example 5.35.
Some remarks to the code of sample 5.35:
a. Calling sage: count_Number_of_RSA_Keys(100,1000) means to consider
the interval [100, 1000] for n. We define n by the two primes p and q as:
n = p · q. So here one prime can have the maximal value 500 because 2 ·
500 = 1000 (while then the other prime will have the smallest possible prime
value 2).
• The number of primes in the given range is 143 (prime_pi(1000) -
prime_pi(100) = 168 - 25).
• The number of possible combinations of primes is comb = 258.
• The number of private keys is 34816.
b. Calling sage: count_Number_of_RSA_Keys(100, 100, True) has the fol-
lowing output:
• Number of private keys for modulus in a given range: 0.
• Number of primes in a given range: 0.
The reason for that is with this call only n = 100 is considered, and the
function investigates only semiprime n: 100 is not semiprime, which means
100 is not the product of only two primes.
c. The output of count_Number_of_RSA_Keys with either the parameters
(26,26,True) or (713,713,True) shows, that if there is only one possible
combination of primes (as only one semiprime n is given), then the number
of elements in {R 2} \ {1} is the number of possible keys: With n = 26 = 2 · 13
there are three possible private keys (there are three elements > 1 in R 2, so
d can be 5, 7, or 11); with n = 713 = 23 · 31 there are 159 possible private
RSA keys.
The program also calculates the reduced residue systems: The identifiers R 0 and
R 00 defined in Section 5.8.5 are referred to as R 1 and R 2 in SageMath Example 5.35.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 290 — #96

i i

290 Introduction to Elementary Number Theory with Examples

SageMath Example 5.35: How Many Private RSA Keys d Are There If You
Know a Range for the Public Key n?
print ("\n# CHAP05 -- Sage -Script -SAMPLE 160: =========")

def count_Number_of_RSA_Keys(start , end , Verbose=False):

"""
How many private RSA keys (n,d) exist , if only modulus n is given , and start <= n <= end?
- prime_range(u,o) delivers all primes >=u und < o
- verbose = True adds p,q,n, phi , residue systems ,
verbose = 2 in addition prints the primelist and the given ranges.
"""
a = start; b = end
s = 0; comb = 0

for p in prime_range(1, b/2+1):

for q in prime_range(p+1, b/2+1):
n=p*q
if Verbose >=3:
print (" Considered primes: p=%d, q=%d, n=%d" % (p,q,n))
if a <= n and n <= b:
comb = comb+1
eu1 = euler_phi(n)
eu2 = euler_phi(eu1)-1 # Not only determine the count , but also the values of d.
s = s + eu2
if Verbose:
print ("p=%s, q=%s, n=%s, eu1=%s, eu2=%s, s=%s" % (p, q, n, eu1, eu2, s))
R1=n.coprime_integers(n)
print ("R1: ", [i for i in R1])
R2=[i for i in R1 if gcd(i,eu1)==1 and i <= eu1]
print ("R2: ", [i for i in R2])

print (" Number of private keys d for moduli in given range: %s (# comb =%s), " % (s, comb))

# Just for comparison: How many primes are in the given moduli range [a,b+1]?
if Verbose:
s = 0
primeslist =[]
for p in prime_range(a, b+1):
if Verbose >=2:
primeslist.append(p)
s = s + 1
if Verbose >=2:
print (" a=%s, " % a + "b=%s, " % b + "\n primeslist =%s" % primeslist)
print (" Number of primes in given range: %s" % s)

import time

print ("\ nDD_Start -- Testcases for count_Number_of_RSA_Keys (start , end)")

print ("\n-----------Testcase: Modulus n=26, verbose=True (just 1 semi prime number)")

count_Number_of_RSA_Keys (26, 26, True)

print ("\n-----------Testcase: Modulus n=713, verbose=True (just 1 semi prime number)")

count_Number_of_RSA_Keys (713, 713, True)

print ("\n-----------Testcase: Moduli range = (100, 107, True)[this range has 3 primes ]")
StartTime = time.time ()
count_Number_of_RSA_Keys (100, 107, True)
print (" Time = %f sec" % (time.time ()-StartTime))
...

As there are so many private keys (n, d ) within a bigger range of values for
n, even brute-force factoring would be more efficient than brute-force trying all
possible private keys.

5.17.7 RSA Fixed Points m ∈ {1, ..., n − 1} with m e = m mod n

Encryption methods can also have fixed plaintext messages where the correspond-
ing ciphertext matches the original. In mathematics, variables mapped by the

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 291 — #97

i i

5.17 Examples Using SageMath 291

algorithm (function) onto themselves are called fixed points. In cryptography the
corresponding messages are called unconcealed messages.
Generally speaking, fixed points are undesirable and there should be only very
few compared to the overall range of possible messages. The more fixed points an
encryption algorithm contains, the easier it is to break it. To deal with this and
with rainbow attacks, in practice random padding like in OAEP is added (instead
of using textbook RSA).
With the RSA procedure, n = pq is the product of two different prime numbers,
and there exists e where gcd (e, ( p − 1)(q − 1)) = 1. The encryption is then c = m e
mod n. A fixed point in the RSA procedure is a message m, where: m = m e mod n.
The result of the encryption is the given message.
When the size of n is sufficiently big, the probability of the occurrence of fixed
points in RSA is very small, as illustrated in Figure 5.16. Experimentally, we found
on average not more than 40 fixed points for a single given RSA parameter set.
Students often presume the occurrence of fixed points is high, because they
encounter a relatively large number of fixed point examples when experimenting
with small prime numbers, as m = 0, 1, n − 1 are always fixed points with RSA.
In practice, where large prime numbers are chosen, fixed points have no signif-
icance for the security of RSA. Therefore, here we refer more to the mathematical
questions.

5.17.7.1 The Number of RSA Fixed Points

In this section we show how many RSA fixed points m ∈ {1, . . . , n − 1} exist with
m e = m mod n for a fixed key e. Theorem 5.17 gives an answer to this question.
For the proof of Theorem 5.17 we first need Theorem 5.16.
This theorem characterizes roots of unity, these are numbers x that equal 1
when raised to some integer power n. A priori they can be complex numbers but in
our context they are integers mod n: An nth root of unity x is primitive if there is
no kth root of unity for all integers k smaller than n:

x n = 1 and x k 6= 1 (k = 1, 2, 3, ..., n − 1)

If F is a finite field and n is a positive integer, then an n-th root of unity in F is a

solution of the equation
x n − 1 = 0 in F

Keep in mind that Zn is a field if n is prime.

Theorem 5.16 from [54, p. 69] characterizes all (e − 1)th roots of unity in Z∗p :
Theorem 5.16 g α is a (e − 1)th root of unity in Z∗p if and only if (e − 1)α =
0 mod p − 1. There are gcd ( p − 1, e − 1) of these.
Proof
The first theorem results directly from the little theorem from Fermat (Theorem
5.12):
g α (e−1) = 1 mod p ⇒ α (e − 1) = 0 mod p − 1
α (e−1)
1
Let δ = gcd ( p− 1, e− 1). Then α (e− 1) = 0 mod p− 1 implies δ = 0 mod p− δ .
p− 1
Since e−δ 1 and δ are coprime (each was reduced by the gcd of their corresponding

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 292 — #98

i i

292 Introduction to Elementary Number Theory with Examples

p−1
numerator), α must be a multiple of δ .

p−1
α=k· for some k ∈ Z
δ
For k = 1, . . . , δ we get δ different possible values for α. For larger k write k =
r · δ + k 0 with k 0 < δ, then we have
p−1 p−1 0 p−1 0 p−1 0 p−1
g α = g k·
0
δ = g (r ·δ+k )· δ = gr ·( p−1)+k · δ = (gr ) p−1 · g k · δ = gk · δ

and therefore k was already counted by k 0 , so we can assume k ≤ δ. The negative

case k can be omitted. Indeed for the power g α of the generator g we did not require
α ≥ 0, but we could, because for every natural number β > 0 we have g −β =
(g −1 )β , this is a positive power of the multiplikative inverse of g. The span of a
group element g and its inverse g −1 is always the same. So if α = −β for some
β > 0, it was already counted.
So then these δ different powers (k = 1, . . . , δ) correspond to the (e − 1)th roots
p−1
of unity g k· δ mod p in Z∗p .
Analog for q: For m e−1 = 1 mod q we then have gcd (q − 1, e − 1) many of
(e − 1)th roots of unity.
Theorem 5.17 The number of RSA fixed points m ∈ {1, . . . , n − 1} for an e with
gcd(e, ϕ (n )) = 1 is
gcd( p − 1, e − 1) · gcd(q − 1, e − 1).

Proof
Given m e = m mod n. According to the Chinese remainder theorem (CRT), the
following statements are equivalent:

[m e = m mod n ] ⇔ [m e = m mod p and m e = m mod q ]

Furthermore, the decomposition on the right side is equivalent to:

m e−1 = 1 mod p and m e−1 = 1 mod q.

We consider m e−1 = 1 mod p and search all (e − 1)th roots of unity in Z∗p .
We know that Z∗p for a prime p is cyclic. ⇒ A generator g exists which
produces Z∗p : Z∗p =< g >.
Theorem 5.16 delivers the remaining part.
The number of combinations of the (e − 1)th root of unity in Z∗p and Zq∗ gives
the total quantity of RSA fixed points: m e = m mod n with m ∈ {1, ..., n − 1}:

gcd( p − 1, e − 1) · gcd(q − 1, e − 1)

Adding m = 0 to the above, results in the Theorem 5.18:

Theorem 5.18 If m ∈ {0, ..., n − 1}, then the quantity of the RSA fixed points is:

(gcd ( p − 1, e − 1) + 1) · (gcd (q − 1, e − 1) + 1)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 293 — #99

i i

5.17 Examples Using SageMath 293

5.17.7.2 Lower Bound for the Quantity of RSA Fixed Points

In the following section, we show that there is a lower bound for the quantity of
RSA fixed points. This lower bound 6 exists when the two different RSA prime
numbers are the smallest possible values (2 and 3).

Theorem 5.19 Given: p = 2, q = 3

The quantity of RSA fixed points is (gcd ( p − 1, e − 1) +1) · (gcd (q − 1, e − 1) +1)
| {z } | {z }
=1 =2
=2·3=6

Theorem 5.20 Given: p 6= q ; p > 2, q > 2

The quantity of RSA fixed points is ≥ 9.

Proof
Each value ( p − 1) and (q − 1) is even for primes p, q > 2.
The RSA algorithm requires choosing e so that 1 < e < φ (n ) = ( p − 1)(q − 1) and
gcd (e, ( p − 1)(q − 1)) = 1.
Since ( p − 1) and (q − 1) are even, e is odd ⇒ e − 1 is even.
Since ( p − 1) and (e − 1) are even, then: gcd ( p − 1, e − 1) ≥ 2.
⇒ (gcd ( p − 1, e − 1) + 1) ≥ 3 and (gcd (q − 1, e − 1) + 1) ≥ 3.
⇒ (gcd ( p − 1, e − 1) + 1) · (gcd (q − 1, e − 1) + 1) ≥ 9.
Samples with m ∈ {0, ..., n − 1}:

• For (e, n ) = (17, 6) with 6 = 2 · 3, all six possible messages {0, 1, 2, 3, 4, 5}

are fixed points (for n = 6, it is independent of the value of e).
• For (e, n ) = (17, 10) with 10 = 2 · 5, all 10 possible messages are fixed
points.
• For (e, n ) = (19, 10), only 6 of the 10 possible messages are fixed points.
• For (e, n ) = (7, 55) with 66 = 5 ∗ 11, 9 of the 55 messages
({0, 1, 10, 11, 21, 34, 44, 45, 54}) are fixed points. Figure 5.15 visualizes this.

Figure 5.15 was taken from “RSA Visual and More,” https://www.cryptool
.org/en/cto/rsa-visual. In this plugin you can either see with graphics how
RSA assigns its input values when encrypting, or you can test textbook RSA with
big numbers (and subject keys in a key store to simulate certificate exchange in
a pupils’ lab), or use RSA with OAEP, padding, and certificates as it is used in
practice.

5.17.7.3 Unfortunate Choice of e (Weak or Unsuitable e)

5.17.7.4 An Empirical Estimate of the Quantity of Fixed Points for Growing Moduli
In this section, we make an empirical estimate of the quantity of fixed points for
growing moduli (here we did not use weak e as in Section 5.17.7.3).
For this, we randomly choose p and q from the six following ranges each char-
acterized by its lower and upper bound: (22 , 210 ), (210 , 220 ), (220 , 240 ), (240 , 280 ),
(280 , 2160 ), (2160 , 2320 ).
For each range, 10 attempts were made. For the exponent e, the standard value
e = 216 + 1 was always chosen. The quantity of fixed points for all 60 attempts
was computed with SageMath Example 5.36.
Figure 5.16 shows that the average number of fixed points was not greater than
40 in any of the six size ranges.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 296 — #102

fp=numpy.array ([0])
fq=numpy.array ([0])

[27] Tervooren. M., factordb.com, http://factordb.com/index.php?query=2%5C%5E1193-1.

[28] MysteryTwister (MTC3), https://www.mysterytwister.org.
[29] Wagstaff, S., The Cunningham Project, https://homes.cerias.purdue.edu/_ssw/cun/.
[30] Brillhart, J., et al., Factorizations of bn ±1, b = 2, 3, 5, 6, 7, 10, 11, 12 Up to High
Powers, AmericanMathematical Society, 2002, http://citeseerx.ist.psu.edu/viewdoc/down-
load?doi=10.1.1.121.4467&rep=rep1&type=pdf.
[31] Schulz. R.-H., and H. Witten, “Zeitexperimente zur Faktorisierung. Ein Beitrag zur
Didaktik der Kryptographie,” LOG IN, Vols. 166/167, 2010, pp. 113–120, https://in-
formatik.schule.de/krypto/.
[32] Ziegler, G. M., The Great Prime Number Record Races,Notices of the AMS,
Vol. 51, 2004, pp. 414–416, http://www.ams.org/notices/200404/comm-ziegler.pdf,
and https://www.mi.fu-berlin.de/math/groups/discgeom/ziegler/Preprintfiles/095e-
PREPRINT.pdf.
[33] Wagstaff, S. S. Jr., “The Cunningham Project,” Fields Institute Communications, 2000,
pp. 1–12, https://homes.cerias.purdue.edu/∼ssw/cun1.pdf.
[34] Wagstaff, S., The Cunningham Project, https://homes.cerias.purdue.edu/∼ssw/cun/no-
tat.txt.
[35] Bernstein, D. J., Circuits for Integer Factorization: A Proposal, 2001, https://cr.yp.to/pa-
pers/nfscircuit.ps url2: https://cr.yp.to/djb.html.
[36] Lenstra, A. J., et al., Analysis of Bernstein’s Factorization Circuit, 2002, https://tau.ac.il/
∼tromer/papers/meshc.pdf.
[37] RSA Security, Has the RSA Algorithm Been Compromised as a Result of Bernstein’s Paper?
Tech. Rep., RSASecurity, April 2002, http://www.networkdls.com/Articles/bernstein.pdf.
[38] Shamir, A., and E. Tromer, Factoring Large Numbers with the TWIRL Device, 2003,
https://cs.uwec.edu/∼tan/priv/www-docs/cs376/Readings/twirl2003.pdf.
[39] Shamir, A., and E. Tromer, “On the Cost of Factoring RSA-1024,” RSA Laboratories Cryp-
toBytes, Vol. 6, No. 2, 2003, pp. 11–20, https://www.tau.ac.il/∼tromer/papers/cbtwirl.pdf.
[40] Weis, R., S. Lucks, and A. Bogk, “Sicherheit von 1024 bit RSA-Schlüsseln gefährdet,”
Datenschutz und Datensicherheit (DuD), Vol. 27, No. 6, 2003, pp. 360–362.
[41] BSI, Technische Richtlinie TR-02102-1, Kryptographische Verfahren: Empfehlungen und
Schlüssellängen (Version 2022-01), Tech. Rep., 2022, https://www.bsi.bund.de/SharedDo
cs/Downloads/DE/BSI/Publikationen/TechnischeRichtlinien/TR02102/BSI-TR-02102.pdf.
[42] Agrawal, M., N. Kayal, and N. Saxena, PRIMES in P, corrected version, August
2002, https://www.cse.iitk.ac.in/users/manindra/algebra/primality_v6.pdf/, and http://fat-
phil.org/maths /AKS/.
[43] Lenstra, A. K., et al., “Ron Was Wrong, Whit Is Right, A Sanity Check of Public Keys
Collected on the Web,” Cryptology ePrint Archive, February 2012, https://eprint.iacr.org/
2012/064.pdf.
[44] Esslinger, B., J. Schneider, and V. Simon, “RSA–Sicherheit in der Praxis,” in KES Zeitschrift
für Informationssicherheit, 2012.2, April 2012, pp. 22–27, https://www.cryptool.org/as-
sets/ctp /documents/kes_2012_RSA_Sicherheit.pdf.
[45] CrypTool Contributors, Testing RSA Moduli for Shared Prime Factors, https://www.cryp-
tool.org/en/posts/2012-05-24/rsa-sanity-check.
[46] Heninger, N., et al., “Mining Your Ps and Qs: Detection of Widespread Weak Keys in
Network Devices,” in Proceedings of the 21st USENIX Security Symposium, August 2012,
https://factorable.net/paper.html.
[47] Bernstein, D. J., “Factoring into Coprimes in Essentially Linear Time,” Journal of
Algorithms, Vol. 54, 2005, https://cr.yp.to/lineartime/dcba-20040404.pdf.
[48] Oechslin, P., Making a Faster Cryptanalytic Time-Memory Trade-Off, Tech. Rep., Crypto
2003, https://lasecwww.epfl.ch/pub/lasec/doc/Oech03.pdf.
[49] Schneier, B., Applied Cryptography, Protocols, Algorithms, and Source Code in C, Second
Edition, Indianapolis, IN: John Wiley & Sons, 1996.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 300 — #106

i i

300 Introduction to Elementary Number Theory with Examples

[50] Eckert, C.,IT-Sicherheit: Konzepte-Verfahren-Protokolle, Ninth Edition, De Gruyter Old-

enbourg, 2014.
[51] Yan, S. Y., Number Theory for Computing, Berlin: Springer, 2000.
[52] Nguyen, M. V., Number Theory and the RSA Public Key Cryptosystem–An Introduc-
tory Tutorial on Using SageMath to Study Elementary Number Theory and Public Key
Cryptography, 2009, https://faculty.washington.edu/moishe/hanoiex/Number%20The-
ory%20Applications/numtheory-crypto.pdf.
[53] Hungerford, T. W., Algebra, New York: Springer, 1974.
[54] Katzenbeisser, S., Recent Advances in RSA Cryptography, New York: Springer, 2001.
[55] Giry, D., BlueKrypt: Cryptographic Key Length Recommendation, Version 32.3, May
2020, https://www.keylength.com/.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 301 — #1

i i

CHAPTER 6
CHAPTER 6
The Mathematical Ideas Behind
Modern Asymmetric Cryptography

This chapter provides an introduction to the algorithms of modern cryptography

(mostly asymmetric cryptography as it appeared before the usage of post-quantum
cryptography and of multiparty communication). Individual parts of this intro-
duction are deepened or extended in further chapters, like elliptic curves in
Chapter 8.
You can find corresponding functions within the programs CT1, CT2, JCT,
and CTO: see the list of the included functions within Sections A.1, A.2, A.3,
and A.4, and at the website https://www.cryptool.org/en/documentation/
functionvolume.
Section 6.5 introduces the RSA plane. In principle, this is a two-dimensional
graphic interpretation of the Chinese remainder theorem in the case of RSA. Many
statements about RSA can be understood intuitively in the RSA plane without hav-
ing to have previous knowledge of algebra, for example the Pollard p − 1 algorithm
for factoring natural numbers. Numerous graphics and examples lead step-by-step
to an understanding of the RSA encryption process and associated mathematics,
especially group theory. And although the meaning of the Chinese remainder theo-
rem for RSA really is nothing new it seems that an explicit graphical model for this
doesn’t exist in the literature to date.

6.1 One-Way Functions with Trapdoor and Complexity

Classes
A one-way function is a function that can be calculated efficiently, but whose inverse
is extremely complicated and practically impossible to calculate.
To put it more precisely: A one-way function is a mapping f from a set X to a
set Y, such that f (x ) can be calculated easily for each element x of X , whereas for
(almost) every y from Y it is practically impossible to find an inverse image x (i.e.,
an x where f (x ) = y).
An everyday example of a one-way function is a telephone book: the function
to be performed is to assign a name to the corresponding telephone number. This
can be done easily due to the fact that the names are sorted alphabetically. However,
the inverse function—assigning a name to a given number—is obviously difficult if
you only have a telephone book available.
One-way functions play a decisive role in cryptography. Almost all crypto-
graphic terms can be rephrased using the term one-way function. Let’s take for
example public-key encryption (asymmetric cryptography):

301

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 302 — #2

i i

302 Mathematical Ideas

Each subscriber T to the system is assigned a key pair: a private key dT and a
public key eT . These keys must have the following property (public-key property):

For an opponent who knows the public key eT , it is practically impossible to

determine the private key dT .

In order to construct useful public-key procedures, therefore, we look for a

one-way function that is easy to calculate in one direction but is difficult (prac-
tically impossible) to calculate in the other direction, provided that a particular
piece of additional information (trapdoor) is not available. This additional piece
of information allows the inverse to be found efficiently. Such functions are called
trapdoor one-way functions. In the above case, the one-way function is the encryp-
tion via exponentiation with the public key eT as exponent. The private key dT is
the trapdoor information.
In this process, we describe a problem as “easy” if it can be solved in polynomial
time as a function of the length of the input. If the length of the input is n bit, then
the time for calculating the function is proportional to n a , where a is a constant.
We say that the complexity of such problems is O (n a ) (Landau or big-O notation).
If you compare two functions 2n and n a , where a is a constant, then there
always exists a value for n, from which for all further n it applies: n a < 2n . The
function n a has a lower complexity. So for a = 5 the following applies: From
the length n = 23, 2n is greater than n 5 ; for further n, 2n clearly increases more
quickly [(222 = 4194304, 225 = 5153632), (223 = 8388608, 235 = 6436343),
(224 = 16777216, 245 = 7962624)].
The term “practically impossible” is slightly less precise. In general, we can say
that a problem cannot be solved efficiently if the time required to solve it increases
more quickly than the polynomial time as a function of the size of the input. If, for
example, the length of the input is n bits and the time required for calculating the
function is proportional to 2n , then the following currently applies: The function
cannot practically be calculated for n > 80.
In order to develop a public-key procedure that can be implemented in practice,
it is therefore necessary to discover a suitable trapdoor one-way function.
In order to tidy things up among this confusing multitude of possible problems
and their complexities, we group problems with similar complexities into classes.
The most important complexity classes are the classes P and NP:

• The class P: This class contains those problems that can be solved in a
polynomial amount of time.
• The class NP: The definition of this class doesn’t look at the time required
to solve a problem, but rather at the time required to verify a given solu-
tion. The class NP consists of those problems for which a given solution
can be verified in a polynomial amount of time. Hereby, the term NP “non-
deterministic” means polynomial and is based on a calculation model (i.e.,
on a computer that only exists in theory and can guess correct solutions
nondeterministically then verify them in polynomial time).

The class P is contained in the class NP. A well-known unsolved problem is

whether or not P 6= NP is true (i.e., whether or not P is a true subset). An important

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 303 — #3

i i

6.2 Knapsack Problem as a Basis for Public-Key Procedures 303

property of the class NP is that it also contains what are known as NP-complete
problems. These are problems that represent the class NP as follows: If a “good”
algorithm for such a problem exists, then “good” algorithms exist for all prob-
lems from NP. In particular, if P only contained one complete problem (i.e., if a
polynomial solution algorithm existed for this problem), then P would be equal
to NP. In this sense, the NP-complete problems are the most difficult problems
in NP.
Many cryptographic protocols are formed in such a way that the “good” sub-
scribers only have to solve problems from P, whereas a perpetrator is faced with
problems from NP.
Unfortunately, we do not yet know whether one-way functions actually exist.
However, we can prove that one-way functions exist if and only if P 6= NP [1,
p. 63].
Some mathematicians have again and again claimed to have proven this
equivalence, but so far the claims have always turned out to be false [2].
A number of algorithms have been suggested for public-key procedures. In
many cases—although they at first appeared promising—it was discovered that
they could be solved in polynomial time. The most famous failed applicant is the
knapsack with trapdoor suggested by Ralph Merkle [3]. Also see Section 11.8.1.

6.2 Knapsack Problem as a Basis for Public-Key Procedures

An exhaustive treatment of knapsacks can be found in Knapsack Problems by

Kellerer, Pferschy, and Pisinger [4]. In Section 11.8.1 you can find further infor-
mation and also a challenge with knapsacks.

6.2.1 Knapsack Problem

You are given n objects G 1 , . . . , G n with the weights g1 , . . . gn and the values
w1 , · · · , wn . The aim is to carry away as much as possible in terms of value while
restricted to an upper weight limit g. You therefore need to find a subset of
{G 1 , · · · , G n }, that is {G i1 , . . . , G ik }, so that wi1 + · · · + wik is maximized under
the condition gi1 + · · · + gik ≤ g.
Such questions belong to the NP-complete problems (not deterministically
polynomial) that are difficult to calculate.
A special case of the knapsack problem is:
Pn the natural numbers a1 , . . . , an and g. Find x1 , . . . , xn ∈ {0, 1} where
Given
g = i =1 xi ai (i.e., where gi = ai = wi is selected). This problem is also called a
0-1 knapsack problem and is identified with K (a1 , . . . , an ; g ).
Two 0-1 knapsack problems K (a1 , . . . , an ; g ) and K (a10 , . . . , an0 ; g 0 ) are called
congruent if two coprime numbers w and m exist in such a way that

(1) m > max{ in=1 ai , in=1 ai0 };

P P

(2) g ≡ wg 0 mod m ;
(3) ai ≡ wai0 mod m for all i = 1, . . . , n.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 304 — #4

i i

304 Mathematical Ideas

Comment:
Congruent 0-1 knapsack problems have the same solutions. No quick algorithm is
known for clarifying whether two 0-1 knapsack problems are congruent.

A 0-1 knapsack problem can be solved by testing the 2n possibilities for

x1 , . . . , xn . The best method requires O (2n /2 ) operations, which for n = 100 with
2100 ≈ 1.27 · 1030 and 2n /2 ≈ 1.13 · 1015 represents an insurmountable hurdle
for computers. However, for special a1 , . . . , an the solution is quite easy to find;
for example, for ai = 2i−1 . The binary representation of g immediately delivers
x1 , . . . , xn . In general, the 0-1 knapsack problem can be solved easily if a permuta-
P j−1
tion π of 1, . . . , n exists with aπ ( j ) > i =1 aπ (i ) with j = 1, . . . , n. If, in addition,
π is the identity; that is, π (i ) = i for i = 1, 2, . . . , n, then the sequence a1 , . . . , an
is said to be super-increasing. Crypto Procedure 6.1 solves the knapsack problem
with a super-increasing sequence in the time of O (n ).

Crypto Procedure 6.1: Solving Knapsack Problems with Super-Increasing

Weights
for i = n to 1 do
if T ≥ ai then
T := T − si
xi := 1
else
xi := 0
if T = 0 then
X := (x1 , . . . , xn ) is the solution.
else
No solution exists.

6.2.2 Merkle-Hellman Knapsack Encryption

In 1978, Merkle and Hellman [3] specified a public-key encryption procedure that
is based on defamiliarizing the easy 0-1 knapsack problem with a super-increasing
sequence into a congruent one with a super-increasing sequence. It is a block cipher-
ing that ciphers an n-bit plaintext each time it runs; see Crypto Procedure 6.2 for
the details.

Crypto Procedure 6.2: Merkle-Hellman (Based On Knapsack Problems)

Let (a1 , . . . ,P
an ) be super-increasing. Let m and w be two coprime numbers
n
with m > i =1 ai and 1 ≤ w ≤ m − 1. Select w̄ with w w̄ ≡ 1 mod m
the modular inverse of w and set bi := wai mod m, 0 ≤ bi < m for i =
1, . . . , n, and verify whether the sequence b1 , . . . bn is not super-increasing. A
permutation bπ (1) , . . . , bπ (n ) of b1 , . . . , bn is then published and the inverse
permutation µ to π is defined secretly. A sender writes his/her message in

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 305 — #5

i i

6.3 Decomposition into Prime Factors as a Basis for Public-Key Procedures 305

Crypto Procedure 6.2 (continued)

( j) ( j)
blocks (x1 , . . . , xn ) of binary numbers n in length, calculates

n
( j)
X
( j)
g := xi bπ (i )
i =1

and sends g ( j ) , ( j = 1, 2, . . . ).
The owner of the key calculates

G ( j ) := w̄g ( j ) mod m, 0 ≤ G( j) < m

( j) ( j)
and obtains the xµ(i ) ∈ {0, 1} (and thus also the xi ) from
n n
( j) ( j)
X X
G( j) ≡ w̄g ( j ) = xi bπ (i ) w̄ ≡ xi aπ (i ) mod m
i =1 i =1
n n
( j) ( j)
X X
= xµ(i ) aπ (µ(i )) = xµ(i ) ai mod m
i =1 i =1

by solving the easier 0-1 knapsack problems K (a1 , . . . , an ; G ( j ) ) with super-

increasing sequence a1 , . . . , an .

In 1982, Shamir [5] specified an algorithm for breaking the system in polyno-
mial time without solving the general knapsack problem. Len Adleman [6] and
Jeff Lagarias [7] specified an algorithm for breaking the twice iterated Merkle-
Hellman knapsack encryption procedure in polynomial time. Ernst Brickell [8] then
specified an algorithm for breaking multiply iterated Merkle-Hellman knapsack
encryption procedures in polynomial time. This made this procedure unsuitable as
an encryption procedure. It therefore delivers a one-way function whose trapdoor
information (defamiliarization of the 0-1 knapsack problem) could be discovered
by an eavesdropper.

6.3 Decomposition into Prime Factors as a Basis for

Public-Key Procedures

Primes form the basis for numerous algorithms for public-key procedures.

6.3.1 The RSA Procedure

As early as 1978, R. Rivest, A. Shamir, and L. Adleman [9] introduced the most
important asymmetric cryptography procedure to date (see algorithm in Crypto
Procedure 6.3).
RSA is also exposed in this book, for example, Section 5.10 ff. or RSA2D in
Section 6.5.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 306 — #6

i i

306 Mathematical Ideas

You can gain practical experience with the RSA procedure in CT under:
• CTO has two broad plugins for RSA:
◦ Via “RSA (Step-by-step)” https://www.cryptool.org/en/cto/rsa-
step-by-step.
◦ Via “RSA visual and more” kb=https://www.cryptool.org/en/cto/
rsa-visual you can see with graphics how RSA assigns its input values
when encrypting, you can test textbook RSA with big numbers, and also
use RSA with OAEP, padding, and certificates as it is used in practice.
• Using CT1 Indiv. Procedures F RSA Cryptosystem F RSA Demonstra-
tion.
• Using CT2 Templates F Mathematics F RSA with big numbers and fur-
ther RSA templates in CT2.
• Using JCT Default Perspective F Visuals and JCT Algorithm Per-
spective.

Crypto Procedure 6.3: RSA (Based On the Factorization Problem)

Key generation:
Let p and q be two different prime numbers and N = pq. Let e be any num-
ber relative prime to φ ( N ); that is, gcd(e, φ ( N )) = 1. Using the Euclidean
algorithm, we calculate the positive integer d < φ ( N ) such that

ed ≡ 1 mod φ ( N )

whereby φ is the Euler phi function.

The output text is divided into blocks and encrypted, whereby each block
has a binary value x ( j ) ≤ N .

Public key: N, e

Private key: N, d

Encryption: y = enc(x ) = x e mod N

Decryption: x = dec( y ) = y d mod N

Comment: Euler Phi Function

The Euler phi function is defined as:
φ (n ) is the number of integers 1 ≤ x < n
that do not have a common factor with n.
“No common factor” is defined as: Two integers a and b are coprime if
gcd(a, b) = 1.
The first values of the Euler phi function are:

φ (1) = 1, φ (2) = 1, φ (3) = 2, φ (4) = 2, φ (6) = 2, φ (10) = 4, φ (15) = 8.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 307 — #7

i i

6.3 Decomposition into Prime Factors as a Basis for Public-Key Procedures 307

For example, φ (24) = 8, because

|{x < 24 : gcd(x, 24) = 1}| = |{1, 5, 7, 11, 13, 17, 19, 23}|.

SageMath Example 6.1 shows how to get these values via SageMath and how to
create the graphics for the Euler phi function (Figure 6.1).

SageMath Example 6.1: Phi and the List of Coprime Numbers via SageMath
sage: n=24; philist =[i for i in range(n) if gcd(n,i) == 1];
print('n =', n, '; Len =', euler_phi(n), '; List =', philist)
n = 24 ; Len = 8 ; List = [1, 5, 7, 11, 13, 17, 19, 23]

sage: n=25; philist =[i for i in range(n) if gcd(n,i) == 1];

print('n =', n, '; Len =', euler_phi(n), '; List =', philist)
n = 25 ; Len = 20 ;
List = [1, 2, 3, 4, 6, 7, 8, 9, 11, 12, 13, 14, 16, 17, 18, 19, 21, 22, �
� 23, 24]

sage: n=26; philist =[i for i in range(n) if gcd(n,i) == 1];

print('n =', n, '; Len =', euler_phi(n), '; List =', philist)
n = 26 ; Len = 12 ; List = [1, 3, 5, 7, 9, 11, 15, 17, 19, 21, 23, 25]

P = plot(euler_phi , -3, n); P

Table 6.1 shows values of φ (n ) up to n = 25.

If p is a prime number, then φ ( p ) = p − 1.
In the case of N = pq:

φ ( N ) = pq (1 − 1/ p )(1 − 1/q ) = p (1 − 1/ p )q (1 − 1/q ) = ( p − 1)(q − 1).

Figure 6.1 Values of the phi function up to n = 26.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 308 — #8

i i

308 Mathematical Ideas

Table 6.1 Euler Phi Function

n φ (n ) The Natural Numbers That Are Coprime to n and Less Than n
1 1 1
2 1 1
3 2 1, 2
4 2 1, 3
5 4 1, 2, 3, 4
6 2 1, 5
7 6 1, 2, 3, 4, 5, 6
8 4 1, 3, 5, 7
9 6 1, 2, 4, 5, 7, 8
10 4 1, 3, 7, 9
15 8 1, 2, 4, 7, 8, 11, 13, 14
20 8 1, 3, 7, 9, 11, 13, 17, 19
25 20 1, 2, 3, 4, 6, 7, 8, 9, 11, 12, 13, 14, 16, 17, 18, 19, 21, 22, 23, 24

Table 6.2 L ( N ) Value Table∗

N 1050 10100 10150 10200 10250 10300
L(N ) 1.42 · 1010 2.34 · 1015 3.26 · 1019 1.20 · 1023 1.86 · 1026 1.53 · 1029
∗ Factorization effort related to the length of the modulus.

If we know the various prime factors p1 , . . . , pk of n, then

1 1
φ (n ) = n · 1 − · ... · 1 − .
p1 pk

Further formulas for the Euler phi function are in Section 5.8.2.
The function enc is a one-way function whose trapdoor information is the
decomposition of N into primes.
At the moment, no algorithm is known that can factorize two prime numbers
sufficiently quickly for extremely large values (e.g., for several hundred decimal
places). The quickest algorithms known todayp [10] factorize a compound integer
N in a time period proportional to L ( N ) = e ln( N ) ln(ln( N )) . Some example values
can be found in Table 6.2.
To this date, it has not been proved that the problem of breaking RSA is equiv-
alent to the factorization problem. Nevertheless, it is clear that the RSA procedure
will no longer be safe if the factorization problem is solved.

6.3.2 Rabin Public-Key Procedure 1979

“Rabin’s scheme [. . .] gets its security from the difficulty of finding square roots
modulo a composit number. This problem is equivalent to factoring” [11]. Unfor-
tunately, this procedure is susceptible to chosen-ciphertext attacks. See [11] for
more details.
This algorithm can be performed step-by-step in JCT Default Perspective
F Visuals F Rabin Cryptosystem.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 309 — #9

i i

6.4 The Discrete Logarithm as a Basis for Public-Key Procedures 309

Crypto Procedure 6.4: Rabin (Based On the Factorization Problem)

Let p and q be two different prime numbers with p, q ≡ 3 mod 4 and n = pq.
Let 0 ≤ B ≤ n − 1.
Public key: (n, B )

Private key: ( p, q )

Encryption: y = enc(x ) = x (x + B ) mod n

Decryption:
q
dec( y ) = y + B 2 /4 − B /2 mod n

Note that the encryption function is not injective: If doing the decryption, for
each ciphertext value you get exactly four different plaintext values that have eT (x )
as inverse image: x, −x − B, ω(x + B /2) − B /2, −ω(x + B /2) − B /2 where ω is
one of the four roots of unity.
Backdoor information is the decomposition into prime numbers of n = pq.

6.4 The Discrete Logarithm as a Basis for Public-Key

Procedures
Discrete logarithms form the basis for many algorithms for public-key procedures.

6.4.1 The Discrete Logarithm in Z p

Let p be a prime number and let g ∈ Z∗p = {0, 1, . . . , p − 1}. Then the discrete
exponential function with base g is defined as

eg : k −→ y := g k mod p, 1 ≤ k ≤ p − 1.

The inverse function is called a discrete logarithm function logg ; the following
holds:
logg (g k ) = k.

The problem of the discrete logarithm (in Z∗p ) is understood to be as follows:

Given p, g and y, determine k such that y = g k mod p

It is much more difficult to calculate the discrete logarithm than to evaluate the
discrete exponential function (see Section 5.9). Table 6.3 lists several procedures
for calculating the discrete logarithm and their complexity [10].
From the CrypTool Variants, you can try CT2 Templates F Mathematics F
Discrete Logarithm or JCT Default Perspective F Visuals F Shanks
Babystep-Giantstep. The baby-step giant-step algorithm computes the discrete
logarithm or order of an element in a finite abelian group. The algorithm is based
on a space-time trade-off. Although the algorithm is superior to nonely trying out all
possibilities concerning the runtime, it is still not practicable for very large groups.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 310 — #10

i i

310 Mathematical Ideas

Table 6.3 Procedures for Calculating the Discrete

Logarithm over Z∗p
Name Complexity
√
Baby-step-giant-step O ( p)
Silver-Pohlig-Hellman Polynomial in q, the greatest
prime
factor of p − 1
ln( p ) ln(ln( p ))
p
Index-Calculus O e(1+o(1))

In February 2007, the group Kleinjung, Franke, and Bahr at the University
of Bonn set the then-record for calculating discrete logarithms. They calculated the
discrete logarithm modulo a 160-digit (530-bit) prime number p and a generator g:

p = b10159 π c + 119849
= 314159265358979323846264338327950288419716939937510582097494
459230781640628620899862803482534211706798214808651328230664
7093844609550582231725359408128481237299
g=2

Note that this is indeed a prime number:

sage: is_prime(floor(10^159*pi)+119849)
True

More precisely, the discrete logarithm k of the following integer y—the first
159 digits of the Euler number e—was determined:

y = b10159 ec
= 271828182845904523536028747135266249775724709369995957496696
762772407663035354759457138217852516642742746639193200305992
1817413596629043572900334295260595630738
k = logg ( y ) mod p
= 829897164650348970518646802640757844024961469323126472198531
845186895984026448342666252850466126881437617381653942624307
537679319636711561053526082423513665596

The search was performed with GNFS method (index-calculus) and took about 17
CPU years on 3.2 GHz Xeon machines.
The current record (as state of November 2021) considering integers modulo p
as the finite cyclic group G was on December 2, 2019 by Fabrice Boudot, Pierrick
Gaudry, Aurore Guillevic, Nadia Heninger, Emmanuel Thomé, and Paul Zimmer-
mann [12]. They computed the discrete logarithm modulo the 240-digit (795-bit)
prime RSA-240 + 49204 (the first safe prime above RSA-240), using the number
field sieve algorithm and the open-source software CADO-NFS.

6.4.2 Diffie-Hellman Key Agreement

The mechanisms and algorithms of classical cryptography only take effect when
the subscribers have already exchanged the secret key. In classical cryptography

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 311 — #11

i i

6.4 The Discrete Logarithm as a Basis for Public-Key Procedures 311

you cannot avoid exchanging secrets without encrypting them. Transmission safety
here must be achieved using noncryptographic methods. We say that we need a
secret channel for exchanging secrets. This channel can be realized either physically
or organizationally.
What is revolutionary about modern cryptography is, among other things, that
you no longer need secret channels: You can agree on secret keys using nonsecret
(i.e., public) channels.
One protocol that solves this problem is that of Diffie and Hellman (Crypto
Procedure 6.5).

Crypto Procedure 6.5: Diffie-Hellman Key Agreement

Two subscribers A and B want to agree on a joint secret key.
Let p be a prime number and g a natural number. These two numbers do not
need to be secret.
The two subscribers then select a secret number a and b from which they
calculate the values α = g a mod p and β = g b mod p. They then exchange
the numbers α and β. To end with, the two subscribers calculate the received
value to the power of their secret value to get β a mod p and α b mod p.
Thus
β a ≡ (g b )a ≡ g ba ≡ g ab ≡ (g a )b ≡ α b mod p

This exchange protocol has been visualized, and you can execute the single
steps with specific numbers in:
• CT1 Indiv. Procedures F Protocols F Diffie-Hellman Demonstration
• JCT Default Perspective F Visuals F Diffie-Hellman Key Exchange (EC)

The safety of the Diffie-Hellman protocol is closely connected to calculating

the discrete logarithm mod p. It is even thought that these problems are equivalent.

6.4.3 ElGamal Public-Key Encryption Procedure in Z∗p

By varying the Diffie-Hellman key agreement protocol slightly, you can obtain
an asymmetric encryption algorithm, Crypto Procedure 6.6. This observation was
made by Taher ElGamal.

Crypto Procedure 6.6: ElGamal (Based On the Discrete Logarithm Problem)

Let p be a prime number such that the discrete logarithm in Z p is difficult
to compute. Let α ∈ Z∗p be a primitive element. Let a ∈ N and β = α a mod p.

Public key: p, α, β

Private key: a

Let k ∈ Z p−1 be a random number and x ∈ Z∗p the plaintext.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 312 — #12

i i

312 Mathematical Ideas

Crypto Procedure 6.6 (continued)

Encryption: enc(x, k ) = ( y1 , y2 )

where y1 = α k mod p

and y2 = xβ k mod p

Decryption: dec( y1 , y2 ) = y2 ( y1a )−1 mod p

In Java CrypTool this can be explored with JCT Default Perspective F

Visuals F ElGamal.

6.4.4 Generalized ElGamal Public-Key Encryption Procedure

The discrete logarithm can be generalized in any number of finite groups (G, ◦).
The following provides several properties of G that make the discrete logarithm
problem difficult. Instead of g ◦ h we often write only gh.

Calculating the Discrete Exponential Function

Let G be a group with the operation ◦ and g ∈ G. The (discrete) exponential
function base g is defined as

eg : k 7−→ g k , for all k ∈ N.

with
g k := g ◦ . . . ◦ g .
| {z }
k times

The exponential function is easy to calculate.

Lemma
The power g k can be calculated in at most 2 log2 k group operations.

Proof
Let k = 2n + kn−1 2n−1 + · · · + k1 2 + k0 be the binary representation of k. Then
n ≤ log2 (k ), because 2n ≤ k < 2n +1 . k can be written in the form k = 2k 0 + k0 with
k 0 = 2n−1 + kn−1 2n−2 + · · · + k1 . Thus,
0 0
g k = g 2k +k0 = (g k )2 g k0 .
0
We therefore obtain g k from g k by squaring and then multiplying by g. The claim
is thus proved by induction to n.

Problem of the Discrete Logarithm

Let G by a finite group with the operation ◦. Let α ∈ G and β ∈ H =
{α i : i ≥ 0}.
We need to find a unique a ∈ N with 0 ≤ a ≤ |H | − 1 and β = α a .
We define a as logα (β ).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 313 — #13

i i

6.4 The Discrete Logarithm as a Basis for Public-Key Procedures 313

Calculating the Discrete Logarithm

A simple procedure for calculating the discrete logarithm of a group element, which
is considerably more efficient than simply trying all possible values for k, is the
baby-step-giant-step algorithm.

Theorem 6.1 (baby-step-giant-step algorithm). Let G be a group and g ∈ G. Let

n be the smallest natural number with |G| ≤ n 2 . Then the discrete logarithm of an
element h ∈ G can be calculated base g by generating the following two lists each
containing n elements and comparing these lists:
Giant-step list: {1, g n , g 2n , . . . , g n·n },
Baby-step list: {hg −1 , hg −2 , . . . , hg −n }.
After detecting a common element the calculation can be stopped. In order to
calculate these lists, we need 2n group operations.

Proof
If g jn = hg −i , that is h = g i + jn , then the problem is solved. If the lists are disjoint,
then h cannot be represented as g i + jn , i, j ≤ n. As all powers of g are thus recorded,
the logarithm problem does not have a solution.

You can use the baby-step-giant-step algorithm to demonstrate that it is much

more difficult to calculate the discrete logarithm than to calculate the discrete expo-
nential function. If the numbers that occur have approximately 1,000 bits in length,
then you only need around 2,000 multiplications (see Theorem 6.1) to calculate all
g k , but around 2500 ≈ 10150 operations to calculate the discrete logarithm using
the baby-step-giant-step algorithm.
In addition to the baby-step-giant-step algorithm, there are also numerous
other procedures for calculating the discrete logarithm [10].

The Theorem from Silver-Pohlig-Hellman

In finite abelian groups, the discrete logarithm problem can be reduced to groups
of a lower order.

Theorem 6.2 (Silver-Pohlig-Hellman) Let G be a finite abelian group with |G| =

p1a1 p2a2 ·. . .· psas . The discrete logarithm in G can then be reduced to solving logarithm
problems in groups of the order p1 , . . . , ps .

If |G| contains a dominant prime factor p, then the complexity of the logarithm
problem is approximately
√
O ( p ).

Therefore, if the logarithm problem is to be made difficult, the order of the

group used G should have a large prime factor. In particular, if the discrete expo-
nential function in the group Z∗p is to be a one-way function, then p − 1 must be a
large prime factor. In this case a generalized ElGamal procedure can be defined (see
Crypto Procedure 6.7).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 314 — #14

i i

314 Mathematical Ideas

Crypto Procedure 6.7: Generalized ElGamal (Based On the Factorization

Problem)
Let G be a finite group with operation ◦, and let α ∈ G, so that the
discrete logarithm in H = {α i : i ≥ 0} is difficult to calculate. Let a with
0 ≤ a ≤ |H | − 1 and let β = α a .

Public key: α, β

Private key: a

Let k ∈ Z|H | be a random number, k 6= 0, and x ∈ G be a plaintext.

Encryption: enc(x, k ) = ( y1 , y2 )

where y1 = α k

and y2 = x ◦ β k

Decryption: dec( y1 , y2 ) = y2 ◦ ( y1a )−1

316 Mathematical Ideas

Figure 6.3 Rectangular, two-dimensional arrangement of the numbers from 0 to 76 ( p = 11, q = 7,

n = 77).

We define the finite plane as the set of all points (x, y ) whose coordinates are
elements of a finite field K . As usual, we associate a two-dimensional vector space
with this plane. That means that we use vectors for describing, for example,the
way from the origin (0, 0) to the point (2, 3). In vector notation this would be 2
3
using a column style, whereas points are written in row style. In geometry, points
and vectors are not the same objects, which is why we use the term associated.
Vectors are used because one cannot do computations with points; for example, it
isn’t possible to add them (that’s actually not always true: It is indeed possible to
define an addition on points of an elliptic curve, see Section 8.5).
Vectors, on the contrary, can be added in the way it is known from school
mathematics. The main difference of addition in finite vector spaces, compared
with infinite ones, is that when adding the same vector repeatedly one has to end
up with the zero vector at some point because of the finite field from which the
coordinates of the vector are taken from. It is known from algebra that finite fields
are either equal to Z p for a prime p or that they contain Z p as a subset.

Remark: To be exact, instead of using the term “equal” to Z p we should use the
term “isomorphic” because the elements of a finite field are not always denoted by
the “symbols” {0, 1, 2, . . . }. Sometimes variables are used; for example, e instead
of 1 or for a multiplicative generator. The same is true for the subset: To be more
exact, we should say: They contain a subset that is isomorphic to Z p .

i i

Figure 6.5 The line y = 6x + 2 in the finite plane over the field Z11 .

Since we are in Z p and p is prime, the k−fold multiple of every nonzero vector
for k < p is different from the zero vector. Therefore, we can conclude that every
line of a finite plane has exactly p points.

6.5.4 Lines in the RSA Plane

One can try to carry over the concept of a line in a finite plane to the RSA plane. To
do this, we begin with the analogon of the line through the origin that we plotted
earlier in Figure 6.4. The set

{(x, y ) ∈ Z p × Zq : y = x, x ∈ Z p }

is a subset of the RSA plane with p elements; see Figure 6.6. We call the structure
that is formed by this set an RSA line or just line. Formally we should write the
RSA line more precisely like this:

{(x, y ) ∈ Z p × Zq : y ≡ x (mod q ), x ∈ Z p }

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 320 — #20

i i

320 Mathematical Ideas

Figure 6.6 RSA line y = x in the RSA plane for p = 11 and q = 7.

Our first interesting observation is that the linear equation y = x has two
different solutions in the (11, 7)-plane for some of the possible y values, which is
not possible in a finite plane over a field. When choosing examples with p > q and
in the case p = 11 and q = 7 additionally p < 2q, then the equation y = x has
two solutions for p − q = 4 of the possible 7 y−values. In the case of p > 2q there
would be several solutions for every y, so just think of the pattern in Figure 6.6
extended to the right.
The vertical lines are a special case: They have exactly q points whereas all
oblique lines consist of p points.
In Figure 6.7 one can see the analogon to Figure 6.5, the RSA line y = 6x + 2.
This line has slope −1 since the number 6 is congruent to −1 modulo 7, resulting
in the pattern “one to the right and one down.”
There’s another interesting difference between the situation in Z p × Z p (finite
plane) and the situation in Z p × Zq (RSA plane; as before p 6= q) that we want
to look at: The length of the series of points that we pass through if we repeatedly
move one to the right and one up. In the case of the square one comes back to the
starting point after p steps like in Figure 6.6. In the case of the rectangle one passes
all n = p · q points before getting back to the starting point. Figure 6.3 shows
this behavior, beginning in (0, 0), then passing through all points until arriving at
number 76 or (10, 6). Compared to the finite plane, in the RSA plane case one
passes n = p · q points and not only p before closing the loop.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 321 — #21

i i

6.5 The RSA Plane 321

Figure 6.7 The RSA line y = 6x + 2 in the RSA plane with p = 11 and q = 7.

6.5.5 Alternative Choice of Representatives

Instead of {0, . . . , n −n1} one can also choose
o the symmetric (or central with respect
(n−1)
to 0) representatives − 2 , . . . , 2 for Zn . Figure 6.8 shows the usual repre-
n−1

sentatives in blue brackets and the alternative representatives in black brackets, the
latter being positioned symmetrically to the left and right of 0.
Choosing
n the representatives
o n of Z p and Zq in othis alternative way, one has the
( p−1) p−1 (q−1) q−1
two sets − 2 ,..., 2 and − 2 ,..., 2 .
In the two-dimensional version this gives us Figure 6.9. Here all the points are
symmetrical to the p- and q-axis. In addition to that the last point when following
the line starting in the origin and moving in the direction to the upper left is not the
point (10, 6), respectively, 76 like in Figure 6.3, but (−1, −1) resp. −1.

Figure 6.8 Translation of representatives mod n.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 322 — #22

i i

322 Mathematical Ideas

Figure 6.9 Rectangular two-dimensional pattern of the numbers from −38 to 38 for p = 11, q = 7, n = 77
with the origin in the center instead of in the bottom left corner.

Definition: We use the term pq-plane or RSA plane for both rectangular repre-
sentations of Z p × Zq regardless of whether the origin is in the center or bottom
left. When using specific numbers instead of the variables p and q we also write
( p, q )-plane; for example, the (11, 7)-plane for Figure 6.6.

6.5.6 Points on the Axes and Inner Points

If one looks at Figure 6.9 it is immediately apparent that the points on the p-axis
correspond to the numbers {0, ±7, ±14, ±21, ±28, ±35}, which are all multiples of
q = 7. On the q-axis on the other hand there are the 2D images of multiples of
p = 11 (i.e., {0, ±11, ±22, ±33}). In the other representation (Figure 6.3) these are
the sets {0, 7, 14, . . . , 70} and {0, 11, 22, . . . , 66}.
The CRT ensures that a number that has a common divisor with n has one
coordinate = 0 if mapped to the corresponding 2-tuple in Z p ×Zq . Therefore points
(n−2)
on the 2D axes correspond exactly to the numbers from 0 to n−1 (resp. from − 2
to n−2 ) that have common divisors with n. These points are called axis points or
1

do not occur as orbit lengths. These are exactly those divisors of ( p − 1)(q − 1)
with a prime power divisor of the form 2i with i ≥ 6 or 3 j with j ≥ 3. Those divide
( p − 1)(q − 1) but neither p − 1 nor q − 1. Figure 6.16 shows the orbits of 2, 5,
and 811 with length 144, 288, and 3, respectively.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 329 — #29

i i

6.5 The RSA Plane 329

Figure 6.15 Orbits in the (83, 59)-plane: (a) Orbit of 2, (b) orbit of 60, (c) orbit of 60 multicolor, and
(d) orbit of 117.

Finally, we summarize: The orbits of the second case where there are only small
prime divisors of φ (n ) cannot be as long as in the first case where there are large
prime divisors of φ (n ).
This fact is, for example, used in the Pollard p − 1 algorithm for factoring large
numbers with only small prime divisors. We’ll come back to this in Section 6.5.11.

6.5.8.3 RSA Orbits

We already mentioned in Section 6.5.7 that f k : z 7→ z k for gcd(k, φ (n )) = 1 is an
automorphism of Z∗n .
First of all, this means that f k is a 1:1 mapping. What cannot happen with
an automorphism is similar to squaring, where two different points are mapped
onto the same image point, as illustrated in Figure 6.11 (in fields of characteristic
2 squaring is bijective, for p 6= 2 it is not, but the case p = 2 is irrelevant for RSA).
If gcd(k, φ (n )) = 1 then also gcd(k, p − 1) = 1 and gcd(k, q − 1) = 1 because
of φ (n ) = ( p − 1)(q − 1) for RSA. Therefore, the map f k is also a 1:1 map on the
axes. To make this clearer, have a look at Section 6.5.8.2 and associate the axes
with groups of order p − 1 or q − 1.

i i

The composition fl ◦ f k for two numbers k, l ∈ {1, . . . , φ (n )} relatively prime

to φ (n ) yields:

(∗) (∗)
( fl ◦ f k )(z ) = fl ( f k (z )) = (z k )l = z k·l = z k·l mod φ (n ) = f k·l mod φ (n ) (z )

In the case of k and l not having common divisors with φ (n ) the product kl also is
relatively prime to φ (n ). If kl is larger than φ (n ), the remainder r := kl mod φ (n )
with 0 < r < φ (n ) is also relatively prime to φ (n ). Therefore, the composition
fl ◦ f k is also a one-to-one map on the inner points.
We mention that the equalities marked with (∗) in the computation above hold
only in the finite group Z∗n . Because if z k or z k·l would be natural numbers larger
than n we would have to write · · · ≡ . . . mod n instead.
For the inverse of f k , the deciphering map, one uses the extended Euclidean
algorithm: For every k ∈ {1, . . . , φ (n )} there can be found a unique k inv ∈
{1, . . . , φ (n )} such that k · k inv ≡ 1 mod φ (n ). Then f k inv ◦ f k is the identity map
and ( f k )−1 = f k inv is the inverse map of f k .
The set of all automorphisms of Z∗n is usually denoted with Aut(Z∗n ). Note that
not every automorphism of Aut(Z∗n ) is of the form z 7→ z k .
The Chinese remainder theorem gives us

∼ Aut(Z∗ × Z∗ ),
Aut(Z∗n ) = p q

but because of p − 1 and q − 1 having common divisors, there is no isomor-

phism from the automorphism group of the product Z∗p × Zq∗ to the product of
the automorphism groups of Z∗p and Zq∗ :

6∼ Aut(Z∗p ) × Aut(Zq∗ )
Aut(Z∗p × Zq∗ ) =

The structure of the full automorphism group of finite abelian groups is described
in the literature in, for example, https://arxiv.org/pdf/math/0605185.pdf.
For RSA we only need automorphisms that are power maps. Those form a sub-
group of Aut(Z∗n ) that will be written as A or sometimes, to avoid confusion, as An
or also AutRSA (Z∗n ). The corresponding automorphisms we call power automor-
phisms or RSA automorphisms. How many and which prime factors the number
φ (n ) has and also with which exponents they occur in p − 1 and q − 1 deter-
mines the number of possible power automorphisms of Z∗n . The following holds:
|A| = φ (lcm( p − 1, q − 1))
Now finally let’s define the notion of an RSA orbit.

Definition: Let n = pq be the product of two different primes and A ⊆ Aut(Z∗n )

the subgroup of power automorphisms of Z∗n . Further, let z be an arbitrary element
of Z∗n . Then we call the set z A := {z a : a ∈ A} the RSA orbit or the RSA path of z.
Every RSA orbit of an element z is contained in the full orbit of z, but RSA
orbits are not groups themselves, because z A cannot contain the unit 1 except, of
course, if z = 1. If the orbit of z has length d for some divisor d of φ (n ), then
the RSA orbit of z has exactly φ (d ) elements, whereas the normal orbit consists

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 332 — #32

The groups C3 and C5 have prime order, so there are no nontrivial (trivial subgroups
of any group G are always the subgroup consisting of the unit element alone and

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 334 — #34

i i

334 Mathematical Ideas

the whole group G) subgroups and exactly 2 = 3 − 1 and 4 = 5 − 1 generators.

From this we get AutRSA (C3 ) = ∼ C2 as well as AutRSA (C5 ) =
∼ C4 , and then also
∼
A77 = C2 × C4 , and therefore finally |A77 | = 8.
The automorphism σ := f 7 : z 7→ z 7 , operating repeatedly on 2 ∈ Z77 ,
generates the following series:

σ σ σ σ
2 −→ 27 −→ 249 = 219 −→ 219·7 = 213 −→ 213·7 = 2

Using the automorphism τ := f 11 we have 2 → 211 → (211 )11 = 2121 = 2.

For ρ := f 17 there are four steps again:

ρ ρ ρ ρ
2 −→ 217 −→ 2289 = 219 −→ 219·17 = 223 −→ 213·17 = 2

The last of the maps in the previous list (∗) we call λ := f 29 . The map λ then is the
multiplicative inverse 2−1 = 229 of the element 2 in Z77 . From this it follows that
λ2 = f 1 = id.
We showed that {σ, σ 2 = ρ 2 , σ 3 , τ, ρ, ρ 3 , λ, id} form exactly the power
automorphism group Aut R S A (Z∗77 ), which is isomorphic to the additive group
Z2 × Z4 .
This means the enciphering of the “plaintext” 2 via RSA with the modulus 77
can only be done in seven possible ways.
Second example: n = 3097, p = 163, q = 19, p − 1 = 162 = 2 · 34 , q − 1 = 18 =
2 · 32 , φ (n ) = 2916 = 22 · 36 , lcm(162, 18) = 162.
The elements of Z3097 can have a path of the following length:

{1, 2, 3, 6, 9, 18, 27, 54, 81, 162}

Here are the first elements:

Element Length of Path in Z3097

0 1
1 1
2 162 = 2 · 34
3 162
4 81 = 34
5 54 = 2 · 32
6 27
7 162
8 54
9 81
10 162
11 162
... ...
85 9
... ...

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 335 — #35

i i

6.5 The RSA Plane 335

The lengths of the corresponding RSA orbits are as follows:

Element Length of RSA Orbit in Z3097

0 1
1 1
2 54
3 54
4 54
5 18
6 18
7 54
8 18
9 54
10 54
11 54
... ...
85 6
... ...

This second example 2 was chosen similar to the preceding one (example 1) where
also φ (n ) does have only small prime divisors, here 2 and 3. We investigate the RSA
automorphism group A3097 by decomposing like in the previous example:

A3097 ∼ Aut RS A (Z∗ × Z∗ )

= 163 19
∼
= Aut RS A (C2·34 × C2·32 )
∼ Aut RS A (C2 × C2 × C 2 × C 4 )
= 3 3
∼
= Aut RS A (C 4 )
3
∼ C 4
= φ (3 )
∼
= C54
∼ C2 × C 3
= 3
∼ (Z2 , +) × (Z 3 , +)
= 3

So for n = 3097 = 163 · 19 an RSA path has maximal length 54 and therefore every
plaintext m ∈ Zn cannot have more than 54 ciphertexts.
In Figure 6.18 we show as an example all paths of length 9 and in Figure 6.19
the corresponding RSA paths of the generating elements of those paths. The cor-
responding Table 6.4 consists of one cyclic group per row, denoted by ca , . . . cl .
The elements z i are taken from the set of representatives {0, 1, . . . , 3096} with the
generator z in the third column. For the pictures we chose the alternative set of
representatives from −(n − 1)/2 to (n − 1)/2, see Section 6.5.5. SageMath Example
6.2 produces the LaTeX code for Table 6.4.

i i

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 338 — #38

i i

338 Mathematical Ideas

Table 6.4 Orbits of Inner Points of Length 9 in Z3097

9
z z z2 z3 z4 z5 z6 z7 z8
ca 1 1996 1274 267 248 2585 58 1179 2661
cb 1 327 1631 653 2935 2772 2120 2609 1468
cc 1 2322 2904 919 85 2259 2177 690 1031
cd 1 1600 1878 710 2498 1670 2386 2096 2646
ce 1 593 1688 653 104 2829 2120 2875 1525
cf 1 574 1194 919 1016 948 2177 1507 955
cg 1 2911 529 710 1111 853 2386 2172 1715
ch 1 384 1897 653 2992 3038 2120 2666 1734
ci 1 1505 1118 919 1833 2335 2177 2856 2741
cj 1 2987 2809 710 2422 3019 2386 785 366
ck 1 2648 296 267 900 1607 58 1831 1683
cl 1 1926 2367 58 216 1018 267 140 201

SageMath Example 6.2: Generation of Table 6.4 with SageMath

print ("\n# CHAP06 -- Sage -Script -SAMPLE 010: =========")

n=3097; # n: RSA modulus

R=Zmod(n); U=R.unit_group (); # R: Ring of integers mod n, U: elements �
� of R that do have
# an multiplicative inverse mod n
oi=9 #order of interest

# define function that returns the multiplicative orbit of z in Zmod(n) �

� as a list
def orbit(z,n):
if gcd(z,n)!=1:
return(print('multiplicative order does not exist!'))
o=R(z).multiplicative_order ()
orbit =[]
x=1
for i in range(o):
orbit.append(x)
x=x*z%n
return(orbit)

# define function returning a list of lists

# with all inner elements with order i, partitioned by cyclic group
def parti(i,n):
l=[] # list of all orbits of order i as list , so list of lists
s=Set() # set of all elements of order i
for x in U:
if x.multiplicative_order ()==i:
if R(x) not in s:
l=l+[ orbit(int(R(x)),n)]
s=s+Set(Set(l[-1]))
return(l)

# define function that returns the LaTeX source code for all the non - �
� header -rows of the table
def rows ():
orbs=parti(oi ,n)
s=''
col0=[r'$c_a$ ',r'$c_b$ ',r'$c_c$ ',r'$c_d$ ',r'$c_e$ ',r'$c_f$ ', r'$c_g$ �
� ',r'$c_h$ ',r'$c_i$ ',r'$c_j$ ',r'$c_k$ ',r'$c_l$ ']

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 339 — #39

i i

6.5 The RSA Plane 339

SageMath Example 6.2 (continued)

for i in range(len(orbs)):
zeile=LatexExpr(col0[i])+' & '+''.join(str(orbs[i][j])+' & ' for j �
� in range(oi -1))+str(orbs[i][oi -1])+LatexExpr(r'\\')+'\n '
s=s+zeile
return(s)

# define string containing LaTeX source code for defining the structure �
� and layout of the table
head=LatexExpr(r'\ begin{tabular }{@{} r')+f'*{oi}'+ LatexExpr(r'{>{$}r<{$ �
� }} @{}}\ toprule ')+'\n'
# define string containing LaTeX source code for the header line of the �
� table
row0=LatexExpr(r'&z^9&\bm{z}&\bm{z^2}&z^3&\bm{z^4}&\bm{z^5}&z^6&\bm{z^7 �
� }&\bm{z^8 }\\\ midrule ')+'\n'

# concatenate all strings for LaTeX table and append LaTeX code for �
� ending the table
table=head+row0+rows ()+LatexExpr(r'\ bottomrule ')+'\n'+ LatexExpr(r'\end{ �
� tabular }')

# ------ main: write string >>table << to file 'orbits -order9.txt ' �
� ------
filename = 'orbits -order ' + str(int(oi)) + '.txt '
with open(filename , 'w') as file:
try:
file.write(table)
print (" File '" + filename + "' created .")
except:
print (" File '" + filename + "' could not be opened .")
# after the "with" block is done , the file is guaranteed to be closed.

The smallest positive integer representing an element of order 9 in Z∗3097 is 85.

It is contained in the third cyclic group cc , but is not its generator, but its fourth
power: (23224 − 85) : 3097 = 9386585843.
Table 6.4 was generated with SageMath. The loop that produces the rows fol-
lows the order in which SageMath orders the unit group (U=R.unit_group() and
for x in U, see SageMath Example 6.2). We won’t go into details here on how
SageMath orders these units (it follows from the internal representation of abelian
groups in SageMath). But this is the reason why on first sight the order of the rows
of the table as well as the choice of the generating element z seem to be random.
Instead of z, the generator of a cyclic group of order 9 can also be z 2 , z 4 , z 5 , z 7 , or z 8 .
These six elements then form an RSA path. For the RSA automorphism f k : z 7→ z k
every k ∈ {1, 2, 4, 5, 7, 8} is possible. But because of 2 dividing ( p−1)(q−1) = 2916,
there remain only k = 1, 5, and 7 at first thought. But as the following is valid:

52 = 25 ≡ 7 mod 9 =⇒ f 5 ◦ f 5 = f 7 reduced to hzi for some z of order 9

53 ≡ 35 ≡ 8 mod 9 =⇒ f 5 ◦ f 5 ◦ f 5 = f 8 ...
54 ≡ 40 ≡ 4 mod 9 ...
55 ≡ 20 ≡ 2 mod 9 ...
56 ≡ 10 ≡ 1 mod 9, ...

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 340 — #40

i i

340 Mathematical Ideas

this means that—restricted to the cyclic groups of order 9 under consideration—also

even exponents do occur.
The 12 different paths of length 9 can be further divided into four times three,
the following triples having a subgroup of order 3 as intersection:

ca ∩ ck ∩ cl = {1, 267, 58}

cb ∩ ce ∩ ch = {1, 653, 2120}
cc ∩ c f ∩ ci = {1, 919, 2177}
cd ∩ cg ∩ c j = {1, 710, 2386}

If we strip away those three elements from such a cyclic group of order 9, the
remaining set of elements forms an RSA orbit consisting of all the generators of
this group of order 9.
This is illustrated in Figure 6.19. The upper part shows the union of all paths of
length 9, and the lower part shows the four subgroups of order 3 (in the upper image
(a) in black), these are exactly eight elements if the 1 (in the bottom figure (b) in
black) is excluded. Points in the upper figure with the same color—not black—form
exactly one RSA orbit of length 6.

6.5.9 Projections
Just as in the real plane {(x, y ) : x, y ∈ R}, one can also investigate projections
onto the axes in finite planes. From the point of view of linear algebra, a two-
dimensional vector space—a plane—is mapped onto a one-dimensional subspace
with a line through the origin, so this map is a linear map of rank 1. The image
of this map is the line that is projected onto, and the kernel of the map is identical
with the line through the origin that is parallel to the direction of the projection,
which must not be orthogonal to the image.
If the reader is not familiar with the terms subspace, rank, or kernel, please
consult any introduction to linear algebra, such as [14].
Because in our model only orthogonal projections are considered, we use the
short term projection instead of orthogonal projection.
The RSA plane is not a vector space in two dimensions; we already discussed
this in Section 6.5.2. As a mapping of sets the projection map can be well-defined
on the RSA plane.
The maps πx : (x, y ) 7→ (x, 0) and π y : (x, y ) 7→ (0, y ) map points onto
their shadows on the horizontal and vertical axes, so these are the two orthogonal
projections that everyone knows from elementary geometry.
Following an inner point while stepping through its path, the shadows also
follow those steps and move with the path. Vice versa, it is possible to reconstruct
from the combined paths of the horizontal and vertical shadows the path of the
point having those shadows.
We illustrate this with the example p = 19, q = 7, n = 133, z = 12.
First we show the path of 12 in Figure 6.20.
Figure 6.21 also shows the paths of the shadows.
For the path of 12 in the (19, 7)-plane, the length of the path of this inner point
is for both axes equal to the length of the projected path. But this doesn’t have to be

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 341 — #41

i i

6.5 The RSA Plane 341

Figure 6.20 Orbit of 12 in the (19, 7)-plane.

is usually additionally required to respect the geometric structure; that is, to map
straight lines back to straight lines and not to take a point out of a straight line and
replace it by one not on the line. If one did not specify this condition, in principle
every permutation on the given n points or numbers with order 2 would be an

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 346 — #46

i i

346 Mathematical Ideas

involution. Such permutations are then either transpositions or a product of disjoint

transpositions.
A transposition on n elements swaps exactly two of these elements and fixes
all others. Disjoint transpositions swap different elements; that is, element 1 with
element 2 (noted by (1, 2)) as well as element 3 with element 4, written (3, 4). The
two transpositions (1, 2) and (2, 3) are not disjoint. If you compose these two, you
get cycle (1, 3, 2), which has not the order 2 but 3. More about permutations and
transpositions can be found in Hungerford ([15]) on pages 46 ff.
We choose the following notations:

• σ : Reflection about the horizontal axis

• τ : Reflection about the vertical axis

• ρ: Rotation of 180◦ σ ◦ τ = τ ◦ σ = ρ

• κ: For the sake of simplicity, we sometimes use the letter κ for any involution
from the set K ∗ := {σ, τ, ρ}. When we speak of a reflection κ about an axis,
either σ or τ is meant.

Although the question of whether there are other involutions operating on the
RSA plane apart from σ, τ , and ρ is not uninteresting, we will not pursue it here.
It should be noted that the three reflections are not automorphisms of Z∗n :
The neutral element of the multiplication, in the two-dimensional setting the point
(1, 1), leaves its place under each of the three mappings σ, τ , and ρ, which can-
not be the case for an automorphism. However, they are bijective and preserve the
partition of Zn into interior points and axis points; moreover, the origin is a fixed
point. A fixed point is called an invariant.
As with all geometric mappings, one asks about the invariants. One distin-
guishes:

Figure 6.25 Possible images of the point (1, 1) under the involutions σ , τ , or ρ.

Next, realize that a symmetric orbit can only contain exactly one of the points
{(−1, 1), (−1, −1), (1, −1)}, because orbits are cyclic subgroups of Z∗p × Zq∗ and
these can have at most one element of order two.
Let us first consider the first component x of any orbit h(x, y )i. It generates a
cyclic subgroup of Z p whose order o(x ) is a divisor of p − 1. If o(x ) is odd, the −1
is not in the path of x. Then none of the points (−1, 1) and (−1, −1) are contained
in the orbit h(x, y )i and thus for odd o(x ) the reflections σ and ρ are already ruled
out. Of course, the same reasoning applies to the second component y or o( y ).
Since o((x, y )) = lcm(o(x ), o( y )), an orbit of odd length can have neither axis nor
point symmetry.

Theorem 6.3 A necessary condition for the symmetry of an orbit is that the order
of the element which generates it is even.

We now decompose the numbers p − 1 and q − 1 into an even and an odd part:

p − 1 = 2a · c and q − 1 = 2b · d,

where a and b are positive integers and c and d are odd. Since p and q are odd,
p − 1 and q − 1 are even and therefore a and b positive.
Moreover, let u be a generating element of Z∗p and v a generating element of Zq∗ .
Thus, for the pair (x, y ) there are exponents r and s (these exponents are unique if
one requires that they are divisors of p − 1 or q − 1 respectively) such that:

x = ur , r p−1
y=v , s
s q −1
p−1 2a · c
o( x ) = = := 2α · γ , γ odd, α ≥ 0
r r

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 349 — #49

i i

6.5 The RSA Plane 349

q −1 2b · d
o( y ) = = := 2β · δ, δ odd, β ≥ 0
s s
o((x, y )) = lcm(o(x ), o( y )) = lcm(2α γ , 2β δ )

Four cases arise:

Case 1: α =β =0
Case 2: α =β >0
Case 3: α >β ≥0
Case 4: β >α ≥0
Case 1: If α and β are both equal to zero, the element orders of x and y are odd,
so the element order of (x, y ) is also odd, and thus the associated path is neither
symmetric to any axis nor point-symmetric.
Case 2: For α = β ≥ 1, the orbit h(x, y )i is point-symmetric. This can be easily seen
by representing (x, y ) using the generating elements:

(x, y ) = (u r , v s )

Considering the case α = β > 0, we set α = β = m > 0 and write lcm(o(x ), o( y )) =

2m · w with odd w = lcm(γ , δ ). We also write γ = t · γ 0 and δ = t · δ 0 , where t is the
greatest common divisor of γ and δ, obtain w = tγ 0 δ 0 , and then take the power as
follows:
m−1
(x, y )2 ·w

This element is different from (1, 1) and is of order 2 in Z∗p × Zq∗ . The same power
applied to the individual components also yields an element of order 2, but in Z∗p
and Zq∗ , respectively: Z∗p or Zq∗ :

m−1 ·tγ 0 δ 0
0 2m−1 ·γ
= xδ
m−1 ·w
x2 = x2

m−1 ·tγ 0 δ 0
0 2m−1 ·δ
= yγ
m−1 ·w
y2 = y2

Since o(x ) and δ 0 are coprime, x and x δ have the same order in Z∗p . If (x δ )2 ·γ
0 0 m−1

was already equal to 1, the element x δ and hence x would have an order equal to
0

or a divisor of 2m−1 · γ = o(x ), a contradiction.

Arguing analogously for y, we find: (−1, −1) is contained in the orbit of (x, y )
if the maximal powers of two dividing o(x ) and o( y ) are identical.
Now we are nearly done: A trajectory containing (−1, −1) is point-symmetric,
because for each element ( X, Y ) from the orbit of (x, y ), the point (−1, −1) ·
( X, Y ) = (−X, −Y ) = ρ (( X, Y )) is also a point on this orbit, since the orbit
forms a cyclic group and therefore is multiplicatively closed. It follows ρ (h(x, y )i) ⊂
h(x, y )i.
It remains to explain why also h(x, y )i ⊂ ρ (h(x, y )i) holds. So consider any ele-
ment of h(x, y )i, this has the form (x, y )k for a k ∈ Z. If (x, y )k were not contained

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 350 — #50

i i

350 Mathematical Ideas

in ρ (h(x, y )i), then (x, y )k 6= (−1, −1) · (x, y )l for all l ∈ Z. However, since an orbit
is a group and equations are uniquely solvable in groups, there exists some l ∈ Z
that solves the equation (x, y )k = (−1, −1) · (x, y )l . Thus, we have shown:

Theorem 6.4 Let n ∈ N be a product of different odd primes p and q and let (x, y )
be a point of the RSA plane associated with an element z ∈ Z∗n via the Chinese
remainder theorem. Further, let 2a and 2b be the respective maximal powers of two
occurring as divisors of p − 1 and q − 1, respectively; that is, p − 1 = 2a · c and
q − 1 = 2b · d with c and d odd. Then the following is true:

1. The orbit of an element (x, y ) ∈ Z∗p × Zq∗ is point-symmetric if and only if

the order o(x ) in Z∗p and the order o( y ) in Zq∗ have the same maximal power
of two as divisors.
2. Every RSA plane contains at least one point-symmetric orbit, namely the one
generated by the point (−1, −1) and containing only the points (−1, −1)
and (1, 1).
3. If an element x ∈ Z∗p has order o(x ) = 2α · k with α ≥ 1, 2α · k p − 1 and
odd l, then the orbit of (x, y ) is point-symmetric. Therefore, for any power
of two 2α with α ≥ 1 on the one hand and α ≤ min(a, b) on the other hand,
one can construct point-symmetric orbits of length 2α · lcm(k, l ) by choosing
x and y such that 2α divides their order but 2α +1 does not. (Such elements
exist because, in general, in cyclic groups C of order N , for every divisor t
N
of N there exists an element of that order t. For if ε is a generator of C, ε t
has exactly the order t.)

Let us briefly consider the example of the (11, 7)-plane, which we took up
before. Because of 11 − 1 = 10 = 2 · 5 and 7 − 1 = 6 = 2 · 3, α from point 3
of Theorem 6.4 can only be equal to 1. Then for point-symmetric orbits there are
exactly the following combinations of possible element orders of the components x
or y:
o( x ) o( y ) o((x, y ))
2 2 2
2 6 6
10 2 10
10 6 30 = lcm(10, 6)

Figure 6.26 shows the corresponding picture for the element of order 30, namely
the 17.
The 17 corresponds to the point (6, 3) = (−5, 3) in the (11, 7)-plane. The order
of 6 in Z11 is in fact 10 and the order of 3 in Z7 is 6:

sage: for i in range(11): 3 : 7

....: print(i,":",6^i% 4 : 9
....: 5 : 10
0 : 1 6 : 5
1 : 6 7 : 8
2 : 3 8 : 4

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 351 — #51

i i

6.5 The RSA Plane 351

Figure 6.26 Orbit of 17 in the (11, 7)-plane.

9 : 2 1 : 3
10 : 1 2 : 2
3 : 6
sage: for i in range(7):
4 : 4
....: print(i,":",3^i%
5 : 5
....:
6 : 1
0 : 1

For (−1, 3), which corresponds to element 10 in Z77 , Figure 6.27 shows on
the top the associated path of length 6. The point (−5, −1), which corresponds
to element 6 in Z77 , has a path of length 10, as can be seen in Figure 6.27 on the
bottom. The point (−1, −1) has the trivial, always existing path of length 2, which
consists of (−1, −1) and (1, 1) (not shown in the figure, because it is trivial).
Case 3: o(x ) = 2α · γ , o( y ) = 2β · δ, α > β ≥ 0
In this case, the orbit of (x, y ) contains the element (−1, 1) and is symmetric with
respect to the vertical axis. Similar to case 2, for each pair (2α , 2β ) with α > β ≥
0 and α ≤ a, β ≤ b, one can choose elements x, y of order o(x ) = 2α · γ and
o( y ) = 2β ·δ (γ and δ odd), respectively, which thus produce a symmetric orbit with
respect to one of the two axes. This then has length lcm(o(x ), o( y )) = 2α ·lcm(γ , δ ).
Elaborated proofs of this can be found in [16].
Case 4: o(x ) = 2α · γ , o( y ) = 2β · δ, 0 ≤ α < β
Here you have symmetry about the horizontal axis, you argue analogous to case 3.

Axis Points
If we consider the horizontal axis, it is pointwise fixed by τ ; that is, τ reduced to
this axis, is the identical mapping. Moreover, ρ and σ have the same action when
restricted to the horizontal axis, so it is sufficient to speak of symmetry, since there is
no difference between point and axis symmetry in this case. The path of an element
(x, 0) on the horizontal axis is symmetric if and only if the order of x is even. So,
again, for every even divisor t of p − 1, one can choose an element of order t and

If we come back to our example with p = 11 and q = 7 we have to check that

there are no symmetric RSA orbits at all, since the length of an orbit in this RSA
plane is a divisor of lcm(11 − 1, 7 − 1) = 30 and 4 is not a divisor of 30.
This leads to the partition of the RSA plane into disjoint RSA orbits shown in
Table 6.5.
These 24 RSA orbits are shown in Figure 6.28 and, as can be seen, they are
all nonsymmetric. The dark rectangle corresponds to the generator chosen for the
respective RSA orbit. It always comes first in the list of 24 orbits.
An easy example where 4 divides p − 1 as well as q − 1 is p = 13 and q = 5.
In this case we can observe point symmetry as well as symmetry with respect to the
axes; see Figure 6.29. There you can see some—not all—symmetric orbits of the
(13, 5)-plane.

Table 6.5 Partition of the (11,7)-Plane into Disjoint Nonsymmetric RSA Orbits
1 orbit: {1} 13 orbit: {17, 19, 24, −37, −25, −16, −9, −4}
2 orbit: {2, 18, 30, −38, −31, −26, −5, −3} 14 orbit: {20, 27, −29, −8}
3 orbit: {3, 5, 26, 31, 38, −30, −18, −2} 15 orbit: {21}
4 orbit: {4, 9, 16, 25, 37, −24, −19, −17} 16 orbit: {22}
5 orbit: {6, 13, −36, −15} 17 orbit: {23, −10}
6 orbit: {7, 28, 35, −14} 18 orbit: {32, −12}
7 orbit: {8, 29, −27, −20} 19 orbit: {33, −11}
8 orbit: {10, −23} 20 orbit: {34}
9 orbit: {11, −33} 21 orbit: {−34}
10 orbit: {12, −32} 22 orbit: {−22}
11 orbit: {14, −35, −28, −7} 23 orbit: {−21}
12 orbit: {15, 36, −13, −6} 24 orbit: {−1}
These correspond to the following points, respectively:
1 orbit: {(1, 1)}
2 orbit: {(2, 2), (−4, −3), (−3, 2), (−5, −3), (2, −3), (−4, 2), (−5, 2), (−3, −3)}
3 orbit: {(3, 3), (5, −2), (4, −2), (−2, 3), (5, 3), (3, −2), (4, 3), (−2, −2)}
4 orbit: {(4, −3), (−2, 2), (5, 2), (3, −3), (4, 2), (−2, −3), (3, 2), (5, −3)}
5 orbit: {(−5, −1), (2, −1), (−3, −1), (−4, −1)}
6 orbit: {(−4, 0), (−5, 0), (2, 0), (−3, 0)}
7 orbit: {(−3, 1), (−4, 1), (−5, 1), (2, 1)}
8 orbit: {(−1, 3), (−1, −2)}
9 orbit: {(0, −3), (0, 2)}
10 orbit: {(1, −2), (1, 3)}
11 orbit: {(3, 0), (−2, 0), (5, 0), (4, 0)}
12 orbit: {(4, 1), (3, 1), (−2, 1), (5, 1)}
13 orbit: {(−5, 3), (−3, −2), (2, 3), (−4, −2), (−3, 3), (−5, −2), (2, −2), (−4, 3)}
14 orbit: {(−2, −1), (5, −1), (4, −1), (3, −1)}
15 orbit: {(−1, 0)}
16 orbit: {(0, 1)}
17 orbit: {(1, 2), (1, −3)}
18 orbit: {(−1, −3), (−1, 2)}
19 orbit: {(0, −2), (0, 3)}
20 orbit: {(1, −1)}
21 orbit: {(−1, 1)}
22 orbit: {(0, −1)}
23 orbit: {(1, 0)}
24 orbit: {(−1, −1)}

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 354 — #54

i i

354 Mathematical Ideas

Figure 6.28 Partition of the (11, 7)-plane (except the origin) into RSA orbits. None of them have any kind
of symmetry; the dark square is always the first element of the corresponding RSA orbit listed in Table 6.5.

Figure 6.29 Some symmetric RSA orbits in the (13, 5)-plane.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 355 — #55

i i

6.5 The RSA Plane 355

After these geometric examinations of the inner workings of RSA with a pure
mathematical focus, which are described nowhere else in the literature as far as
we know, we now return to something widely known among cryptographers: The
algorithm of Pollard.

6.5.11 The Pollard p − 1 Algorithm for RSA in the 2D Model

If for some RSA modulus n, only small prime divisors of φ (n ) exist, one can use
the Pollard p − 1 algorithm for factoring n = pq in a “short” time. In [17] one can
study what exactly small and short means in this context. We only want to illustrate
the idea behind the algorithm in our geometric model.
Essentially, what is being done is this: One chooses an arbitrary point z ∈ Z∗n
and starts to compute not its path, but a part of its path, that being:

z, z2, z 2·3 , z 2·3·4 , z 2·3·4·5 , ... , zk!

Or more precisely:

z, z 2 mod n, z 2·3 mod n, z 2·3·4 mod n, z 2·3·4·5 mod n, ... , z k ! mod n

Now consider the p- and q-coordinate of such a z-power z k ! . If z k ! ≡ x mod p

and z k ! ≡ y mod q we find that if we subtract 1, the number z k ! − 1 corresponds
to the point that is located one step to the left and one step lower than the point
representing z k ! ; that is (x − 1, y − 1). This should be clear if one recalls how the
lines of Figures 6.3 or 6.9 were generated: Addition of 1 in the linear representation
corresponds to “1 to the right and 1 up” in the two-dimensional model, except if
a point lies on the upper or right border, then one has to jump to the bottom and
then 1 right or to the left and then 1 up.
Every point lying one position lower than a point

Figure 6.30 (97, 73)-plane with 3, 32! , 33! , and 34! .

p − 1 algorithm one misses the first candidate π (312 ) = (75, 1) and the first hit is
π (323 ) = (96, 1).

6.5.12 Final Remarks about the RSA Plane

There are other aspects of RSA that can be illustrated with the RSA plane. The
fixed points have already been mentioned as special cases of orbits. However, we
also see potential to illustrate other attacks, for example, Fermat’s factorization
method (see, e.g., [18]), which is applicable for the case where the difference of p
and q is small.
If one interprets the RSA plane embedded in a two-dimensional lattice (on
the topic of lattices see Chapter 11), one can also shed light on some interest-
ing aspects of lattice cryptography, in particular ideas of Coppersmith [19]. For
this purpose, however, it is necessary to consider the neighborhoods of the inte-
ger points in the Euclidean plane. The notion of “neighborhood” plays a major
role in the mathematical subfield of topology. For example, if we represent the
point (1, 1) as a square, the square can also be taken to be the neighborhood
U = {(x, y ) ∈ Q : 12 < x, y < 32 } of (1, 1) in the plane over the rational numbers.
If one takes U as a subset of Q2 , and not as a subset of R2 , one can build on this

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 358 — #58

i i

358 Mathematical Ideas

to create a different, non-Euclidean geometry, the ultrametric plane. It allows cer-

tain structure-preserving mappings onto finite planes. See, for example, [20] or [21]
for more background on this topic. Ultrametric planes are an example of compact
planes. Since these mappings are defined not only on the integer lattice, but also on
the neighborhood of the lattice points, we chose the representation with squares: A
square symbolizes the lattice point together with its neighborhood, which contains
infinitely many points. In comparison to points, this square model is extensible by
topological aspects and therefore a more flexible model.

6.6 Outlook

Several other modern asymmetric methods exist. Some are presented in Chapters
Elliptic Curves (Chapter 8), Homomorphic Ciphers (Chapter 10), Lattices
(Chapter 11), Solving Discrete Logarithms and Factoring (Chapter 12), and Long-
Term Cryptographic Perspectives (Chapter 13). Elliptic curves for instance provide
useful groups for public-key encryption procedures, which offer shorter key lengths.

References

[1] Balcazar, J. L., J. Daaz, and J. Gabarr, Structural Complexity I, Berlin: Springer Verlag,
1998.
[2] Hesselink, W. H., The Borderline between P and NP, February 2001, https://www
.cs.rug.nl/∼wim/pub/whh237.pdf.
[3] Merkle, R., and M. Hellman, “Hiding Information and Signatures in Trapdoor Knap-
sacks,” IEEE Transactions on Information Theory, Vol. 24, No. 5, 1978.
[4] Kellerer, H., U. Pferschy, and D. Pisinger, Knapsack Problems, Berlin: Springer, 2004.
[5] Shamir, A., “A Polynomial Time Algorithm for Breaking the Basic Merkle-Hellman
Cryptosystem,” in Symposium on Foundations of Computer Science, 1982, pp. 145–152.
[6] Adleman, L., “On Breaking the Iterated Merkle-Hellman Public-Key Cryptosystem,” in
Advances in Cryptology, Proceedings of Crypto 82, Plenum Press, 1983, pp. 303–308.
[7] Lagarias, J.C., “Knapsack Public Key Cryptosystems and Diophantine Approximation,”
Advances in Cryptology Proceedings of Crypto 83, Plenum Press, 1983.
[8] Brickell, E. F., “Breaking Iterated Knapsacks,” in Advances in Cryptology: Proceedings of
CRYPTO’84, Vol. 196, Berlin: Springer, 1985, pp. 342–358.
[9] Rivest, R. L., A. Shamir, and L. Adleman, “A Method for Obtaining Digital Signatures and
Public-Key Cryptosystems,” Communications of the ACM, Vol. 21, No. 2, April 1978,
pp. 120–126.
[10] Stinson, D. R., Cryptography–Theory and Practice, Third Edition, Boca Raton, FL:
Chapman & Hall/CRC, 2006.
[11] Schneier, B., Applied Cryptography, Protocols, Algorithms, and Source Code in C, 20th
Anniversary Edition, Second Edition, New York: John Wiley, & Sons, 2015.
[12] NMBRTHRY Archives, https://listserv.nodak.edu/cgi-bin/wa.exe?A2=NMBRTHRY;fd
743373.1912.
[13] Lang, S., Algebra, Third Edition, Reading MA: Addison-Wesley, 1993.
[14] Lang, S., Introduction to Linear Algebra,Second Edition, New York: Springer, 1986.
[15] Hungerford, T. W., Algebra, Springer Verlag, 1974.
[16] Quade, L., “RSA Studies in a Two-Dimensional Group-Theoretic Model,” BA thesis,
Universität Siegen, 2023.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 359 — #59

This chapter mainly gives a high-level overview about hash functions and their
application in digital signatures and certificates.

7.1 Hash Functions

A hash function maps a message of any length to a bit sequence of constant size
(like 128 bit). The function value is called hash value or message digest. Outside of
cryptography, hash functions are used to quickly retrieve stored data.
We consider here only cryptographic hash functions1 fulfilling the requirements
laid out in Section 7.1.1.
They have a multitude of uses such as in message authentication codes (MACs),
pseudorandom number generators (PRNGs), digital signatures, blockchain, and
cryptocurrencies. To store password verification values or derive keys from pass-
words, special hash functions or key derivation functions (KDFs) are used, which
were created to slow brute-force searches and other guessing attacks. A current
KDF recommendation is Argon2.

7.1.1 Requirements for Hash Functions

Cryptographically secure hash functions fulfill the following three requirements (the
order is in a way that the requirements increase):

1. Cryptographic hash functions are implemented in CT at several places:

- The menus CT1 Indiv. Procedures F Hash and CT1 Analysis F Hash enable
- To apply one of six hash functions to the content of the current window;
- To calculate the hash value of a file;
- To test how changes to a text change the corresponding hash value (in the “Hash Demonstration”);
- To calculate a key from a password according to the PKCS#5 standard;
- To calculate HMACs from a text and a secret key;
- To perform a simulation, how digital signatures could be attacked by a targeted search for hash
value collisions.
- Using CT2 Templates F Hash Functions you can find BLAKE, Grostl, Keccak, MD5, and attacks on
them. Plus password-based key-derivation functions (PBKDF) like PKCS#5 and PBKDF2. Modern KDFs
are PBKDF2, scrypt, or Argon2.
- JCT Default Perspective F Visuals F Hash Sensitivity demonstrates how sensitive hash values
react to the smallest changes in the input.
- See the functions’ lists within Sections A.2 and A.3.

Figure 7.1 Avalanche effect with hash functions in CT2.2

The output length of the SHA algorithms was enhanced for the SHA-2 family
because of the possibility of birthday attacks: these make n-bit AES and a 2n-bit
hash roughly equivalent: 128-bit AES – SHA-256, 192-bit AES – SHA-384, 256-
bit AES – SHA-512.3
At Crypto 2004 (August 2004) this safety-feeling was disputed: Chinese
researchers published collision attacks against MD4, SHA-0, and parts of SHA-1.
This globally caused new motivation to engage in new hash attack methods.
The initially published result reduced the expected complexity for one SHA-1
collision search from 280 (brute-force) to 269 [3]. Further announcements claimed
to reduce the required effort to 263 [4] and 252 [5]. This would bring collision
attacks into the practical realm, as similar efforts have been mastered in the past
(see Section 1.2.2).
Already before Crypto 2004, the U.S. National Institute of Standards and Tech-
nology (NIST) decided to discontinue SHA-1 and to standardize new methods with
longer hash values.
So in 2008 NIST opened a competition to develop a new cryptographic hash
algorithm beyond the SHA-2 family [6]. In October 2012, Keccak was announced
as “SHA-3.”

2. CT2 Templates F Cryptoanalysis F Modern F Avalanche (Hash Functions).

3. - Using CT1 Analysis F Hash F Attack on the Hash Value of the Digital Signature you can
comprehend the birthday attack on digital signatures.
- CT2 Templates F Hash Functions F MD5 Collision Finder.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 364 — #4

i i

364 Hash Functions, Digital Signatures, and Public-Key Infrastructures

SHA-3 (Secure Hash Algorithm 3) was released by NIST in August 2015. SHA-
3 is a subset of the cryptographic family Keccak. This current NIST standard is
described in FIPS Publication 202.
The Keccak algorithm is based on a relatively new approach called sponge con-
struction. Given a state vector (sponge) of b bit, and a message in blocks of size
r < b. The sponge then absorbs the message blockwise into a subset of the state,
which is then transformed as a whole using a random permutation function f (this
is the inputting part). Then the result is squeezed out (outputting). This construc-
tion leads to great flexibility. For SHA-3, four instances are defined: SHA3-224,
SHA3-256, SHA3-384, and SHA3-512.
In CT2, there are 3 templates showing the Keccak component—as a stream
cipher, for SHA-3, and as a PRNG. The individual steps within the Keccak algo-
rithm are animated in detail in the Keccak component. Figure 7.2 shows the
visualization of Keccak in CT2.

7.1.4 Attacks on Password Hashes

An attack has the objective to recover the password from a hash value. Thus, the
resistance against first preimage is the password hashes property that determines
its security.

Figure 7.2 SHA-3: Keccak hash function: Theta part of the permutation function f in CT2.4

7.3 RSA Signatures 367

Other digital signature schemes where developed exclusively for this purpose,
as the DSA (digital signature algorithm), and are not directly connected with a
corresponding encryption scheme.
The two signature methods that are probably still most frequently used, RSA
and DSA, are discussed in more detail in the following Sections 7.3 and 7.4. After
that we explain shortly how digital signatures can be used to create the digital
equivalent of ID cards. This is called public-key certification.

7.3 RSA Signatures

As mentioned at the end of Section 5.10.3, it is also possible to perform the RSA
private and public-key operation in reverse order: If H = hash (message), then
raising H first to the power of d (mod N ) and then to the power of e (mod N )
yields H again. Therefore, RSA can be used as a signature scheme.
The RSA signature S for a message hash H is created by performing the private
key operation:
S ≡ H d (mod N )

In order to verify H is correct, the corresponding public-key operation is performed

on the signature S and the result is compared with the message hash H :

S e ≡ ( H d )e ≡ ( H e )d ≡ H (mod N )

“Esslinger” — 2023/11/30 — 19:44 — page 369 — #9

i i

7.5 Public-Key Certification 369

Crypto Procedure 7.1 (continued)

Private key
x < q (a 160-bit number)

Signing
m the message to be signed
k choose at random, less than q
r = (g k mod p ) mod q
s = (k −1 (SHA-1(m ) + xr )) mod q
Remark:
• (s, r ) is the signature.
• The security of the signature depends not only on the mathematical
properties, but also on using a good random source for k.
• SHA-1 is a 160-bit hash function.

Verifying
w = s −1 mod q
u 1 = (SHA-1(m )w) mod q
u 2 = (r w) mod q
v = (g u 1 y u 2 ) mod p ) mod q
Remark: If v = r , then the signature is verified.

7.5 Public-Key Certification

Public-key infrastructure (PKI) is the term used to describe a system that issues,
distributes, and verifies digital certificates. The certificates issued within a PKI can
be used to enable computers and people in entire organizations to authenticate each
other and to secure their communication.
The aim of public-key certification is to guarantee the connection between a
public key and a user and to make it traceable for external parties. This trust anchor
is confirmed by the so-called certificate. In cases in which it is impossible to ensure
that a public key really belongs to a particular person, many protocols are no longer
secure, even if the individual cryptographic components cannot be broken.
Another area using PKIs are internet-connected devices (IoT). The cybersecurity
of IoT affects not only smart home and consumer electronics products, but also
vehicles and industrial plants. PKI is particularly suitable for the provisioning of
digital identities during production and operation.

7.5.1 Impersonation Attacks

Assume Charlie has two key pairs (P K 1 , S K 1 ) and (P K 2 , S K 2 ), where S K denotes
the secret private key and P K the public key. Further, assume that he manages to
palm off P K 1 on Alice as Bob’s public key, and P K 2 on Bob as Alice’s public key
(by falsifying a public key directory).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 370 — #10

i i

370 Hash Functions, Digital Signatures, and Public-Key Infrastructures

Then he can attack as follows:

• Alice wants to send a message to Bob. She encrypts it using P K 1 because
she thinks that this is Bob’s public key. She then signs the message using her
secret key and sends it.
• Charlie intercepts the message, removes the signature and decrypts the mes-
sage using S K 1 . If he wants to, he can then change the message in any way
he likes. He then encrypts the message again, but this time using Bob’s gen-
uine public key, which he has taken from a public-key directory, signs the
message using S K 2 , and forwards it to Bob.
• Bob verifies the signature using P K 2 and will reach the conclusion that the
signature is correct. He then decrypts the message using his secret key.
In this way, Charlie can listen in on communication between Alice and Bob and
change the exchanged messages without them noticing. The attack will also work
if Charlie only has one pair of keys.
The impersonation attack is a kind of “man-in-the-middle attack.” Users
are promised protection against this type of attack by public-key certification,
which is intended to guarantee the authenticity of public keys. The most common
certification method is the X.509 standard.

7.5.2 X.509 Certificate

X.509 [9] is an International Telecommunication Union (ITU) standard defining
the format of public-key certificates.
X.509 certificates play a major role in TLS (for security between a browser
and web server when accessing web pages via HTTPS) and in S/MIME (for signing
and encrypting e-mails),7 but also play a role in offline applications such as the
electronic signature of documents. For a long time, most web server certificates were
created using OpenSSL and the Microsoft CA. This share has dropped drastically
since almost all domain-validated TLS certificates are created by Let’s Encrypt. As
of July 2023, the TLS certificates of 350 million websites come from Let’s Encrypt
[10]. You can get these in an automated process and free of charge.
Each participant who wants to have an X.509 certificate verifying that his pub-
lic key belongs him/her as a real person consults what is known as a certification
authority (CA). If the certificates are not only offered to a closed user group, the
CA is called trust center or more generally trusted third party.
The participant proves his/her identity to this CA (for example by showing
his/her ID) often via a registration authority (RA). This process is called registration.
The CA then issues him/her an electronic document (certificate) that essentially
contains the name of the certificate-holder and the name of the CA, the certificate-
holder’s public key, and the validity period of the certificate. The CA then signs the
certificate using its private key.
Figure 7.4 shows the demonstration PKI in JCT. You can create key pairs and
certificate signing requests for public keys. Afterwards, you can take a look at each
7. In Germany, for example, all university members can receive free email S/MIME certificates. These are
issued by the DFN (https://www.pki.dfn.de/ueberblick-dfn-pki/) after an identification by the local
university.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:44 — page 371 — #11

i i

7.5 Public-Key Certification 371

Figure 7.4 Demonstrations-PKI in JCT.8

375

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 376 — #2

i i

376 Elliptic-Curve Cryptography

Figure 8.1 Prognosis of the key lengths regarded to be safe for RSA and for elliptic curves.

i i

378 Elliptic-Curve Cryptography

therefore make it obvious that elliptic curves are generally based on highly complex
mathematics.
So much for the role of elliptic curves in pure mathematics. In 1985 Neal
Koblitz and Victor Miller independently suggested using elliptic curves in cryptog-
raphy. Elliptic curves have thus also found a concrete practical application. Another
interesting area of application for elliptic curves is for factorizing integers: The RSA
cryptographic system is based on the difficulty/complexity of finding prime factors
of an extremely large number (see Section 5.12). In this area, procedures based on
elliptic curves have been investigated and used since 1987 (see Section 8.8). There
are also prime number tests based on elliptic curves.
So, elliptic curves are used differently in the various areas: Encryption proce-
dures based on elliptic curves are based on the difficulty of the problem known as
elliptic curve discrete logarithm. The factorization of natural composite numbers n
uses the fact that numerous elliptic curves can be generated for n.

8.3 Elliptic Curves: Mathematical Basics

This section provides information about groups and fields. (See Section 5.7.) Didac-
tically very well-prepared introductions into elliptic curves can be found in [5] and
via the CTO plugin “Elliptic Curves” shown in Figure 8.5.

8.3.1 Groups
Because the term group is used differently in everyday language than in mathe-
matics, we will, for reasons of completeness, begin by introducing the essential
statement of the formal definition of a group:

• A group is a nonempty set G on which an operation “·”. The set G is closed

under this operation, which means that for any two elements a, b taken from
G, performing the operation on them gives an element in G (i.e., ab = a · b
lies in G).
• For all elements a, b, and c in G: (ab)c = a (bc) (associative law).
• There exists an element e in G that behaves neutrally with respect to the
operation ·. That means that for all a in the set G : ae = ea = a.
• For each element a in G there exists a unique inverse element a −1 in G such
that: aa −1 = a −1 a = e. The inverse is uniquely determined because if x, y ∈
G are both inverse to a (i.e., ax = xa = e) and ay = ya = e, then x = xe =
x (ay ) = (xa ) y = ey = y.

If, in addition, it applies ab = ba (commutative law) for all a, b in G, then we

call the group an abelian group.
Since we may define different operations on the same set, we distinguish them
by giving them different names (e.g., + addition or · multiplication).
The simplest example of an (abelian) group is the group of integers under
the standard operation of addition. The set of integers denoted as Z =
{· · · , −4, −3, −2, −1, 0, 1, 2, 3, 4, · · · } has an infinite number of elements. For

Let K be a field and 1 be the neutral element of K with respect to the multiplicative
operation “·”. Then the characteristic of K is said to be the order of 1 with respect
to the additive operation. This means that the characteristic of K is the smallest
positive integer n such that
1| + 1 +{z· · · + 1} = 0.
n times

If there is no such n (i.e., if 1 + 1 + · · · + 1 6= 0 no matter how many 1s we add)

then we call K a field with characteristic 0.
Thus, fields with characteristic 0 are infinite since they contain the (pairwise
distinct) elements 1, 1 + 1, 1 + 1 + 1, …. On the other hand, fields with finite
characteristic may be either finite or infinite.
If the characteristic is finite, it has to be prime. This fact can easily be proved:
Assume n = pq, p, q < n, is the characteristic of a field K . By definition of n, the
elements p̄ = 1 | +1+ {z· · · + 1}, q̄ = 1
| +1+ {z· · · + 1} of K are not equal to 0. Thus,
p times q times
there exist inverse elements p̄ −1 , q̄ −1 with respect to multiplication. It follows that
( p̄q̄ )( p̄−1 q̄ −1 ) = 1, which contradicts the fact that p̄q̄ = n̄ = 1| + 1 +
{z· · · + 1} = 0
n times
and, hence, ( p̄q̄ )( p̄ −1 q̄ −1 ) = 0.
| {z }
=0
Comment
The field of real numbers has the characteristic 0; the field Z p has the characteristic
p. If p is not prime, Z p is not a field at all.
The most simple field is Z2 = {0, 1}. It contains only two elements, the neutral
elements with respect to addition and multiplication. In particular, we have 0 + 0 =
0, 0 + 1 = 1 + 0 = 1, 1 + 1 = 0, 1 · 1 = 1, 0 · 0 = 0 · 1 = 1 · 0 = 0.

8.3.2.2 Finite Fields

As mentioned previously, each finite field has a characteristic p 6= 0, where p is
a prime. On the other hand, given a prime p there is a field which has exactly p
elements; that is Z p .
However, the number of elements of a field need not be prime in general. For
example, it is not hard to construct a field with 4 elements: The set K = {0, 1, a, b}
fitted with the operations defined in the following table is a field:

+ 0 1 a b · 0 1 a b
0 0 1 a b 0 0 0 0 0
1 1 0 b a and 1 0 1 a b
a a b 0 1 a 0 a b 1
b b a 1 0 b 0 b 1 a

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 381 — #7

i i

8.4 Elliptic Curves in Cryptography 381

One can show that the order of any field is a prime power (i.e., the power of
a prime number). We can construct a field with p n elements for any given prime
p and positive integer n and denote it by G F ( p n ) or by Fnp . Here G F stands for
Galois Field to commemorate the French mathematician Galois.
Two fields that have the same number of elements cannot be distinguished in
the following sense: If K , K 0 are fields with k = p n elements, then there is a one-
to-one map ϕ : K → K 0 , that respects the arithmetic of the field. Such a map
is called an isomorphism. Isomorphic fields mathematically behave in the same
way so that it makes no sense to distinguish between them. For example, Z2 and
K 0 = {Z E R O, O N E} with zero-element Z E R O and one-element O N E are iso-
morphic. We note that mathematical objects are only defined by their mathematical
properties.
The fields G F ( p ) of prime order play a prominent role. They are called prime
fields and are often denoted by Z p . For prime fields, both additive and multiplica-
tive group are cyclic. Furthermore, each field G F ( p n ) contains a subfield that is
isomorphic to the prime field Z p .

8.4 Elliptic Curves in Cryptography

In general, expressions of the form P = i1 ,...,in ai1 ...in x1i1 . . . xnin with i 1 , . . . , i n ∈ N
P
with coefficients ai1 ...in ∈ K are called polynomials in n variables x1 , . . . , xn with
underlying field K , if deg P := max{i 1 + · · · + i n : ai1 ...in 6= 0} is finite [i.e., the sum
has only finitely many nonzero terms (monomials)]. The sum of the exponents of
the variables of each term of the sum is at most 3, and at least one term of the sum
has a single variable with 3 as value of the corresponding exponent.
We now consider a curve given by the zeros of a polynomial F of degree 3 in 3
variables.
In cryptography, elliptic curves are a useful tool. Such curves are described as
solutions of an equation of the form

i i

8.5 Operating on the Elliptic Curve 383

A coordinate transformation is combination of a rotation and a dilatation of

the coordinate system without changing the elliptic curve itself. A coordinate trans-
formation can result in a simpler version of the Weierstrass equation. Depending
on whether p > 3, different transformations are used, and we obtain
• In case of G F ( p ), p > 3, the elliptic curve equation of the form

y 2 = x 3 + ax + b (8.3)

with 4a 3 + 27b2 6= 0.
• In case of G F (2n ) the elliptic curve equation of the form

y 2 + x y = x 3 + ax 2 + b

with b 6= 0.
These conditions on the parameters a, b ensure that the elliptic equation can be used
in the context of cryptography. Formally we call such curves nonsingular.
The form (8.3) is called the standard form of the Weierstrass equation. If the
characteristic of the field is 2 or 3, we obtain 4 = 0, respectively, 27 = 0, which
means that the condition on parameters a, b collapse. Loosely speaking, this is the
reason why the transformation to the standard form does not work in these cases.
Let |E| denote the number of elements of an elliptic curve E given an underlying
field G F (k ) (for practical reasons either k = √p with p prim or k = 2n ). Then
Hasse’s theorem
√ [6] yields | |E| −√k − 1 | ≤ 2 · k. This inequality is equivalent to
k + 1 − 2 k < |E| < k + 1 + 2 k. In particular, this means that the number of
elements of an elliptic curve is approximately k (for large k).

8.5 Operating on the Elliptic Curve

In order to work with elliptic curves in practice, we define an operation (often

written in an additive way +) on the set of points on the curve. If we have a curve
over the field G F ( p ), we define the commutative operation + by
1. P + O = O + P = P for all P ∈ E;
2. For P = (x, y ) and Q = (x, −y ) we set P + Q = O;
3. For P1 = (x1 , x2 ), P2 = (x2 , y2 ) ∈ E with P1 , P2 6= O and (x2 , y2 ) 6=
(x1 , −y1 ) we set P3 := P1 + P2 , P3 = (x3 , y3 ) defined by

x3 := −x1 − x2 + λ2 , y3 := −y1 + λ(x1 − x3 )

with the auxiliary quotient

y −y
1 2


 x1 − x2 if P1 6= P2 ,
λ :=
3x 2 + a
 1 if P1 = P2 .


2 y1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 384 — #10

i i

384 Elliptic-Curve Cryptography

In particular, we obtain −P = (x, −y ) for P = (x, y ) ∈ E.

If we deal with a curve over the field G F (2n ), we define the operation + in an
analogous way by
1. P + O = O + P = P for all P ∈ E;
2. For P = (x, y ) and Q = (x, x + y ) we set P + Q = O;
3. For P1 = (x1 , x2 ), P2 = (x2 , y2 ) ∈ E with P1 , P2 6= O and (x2 , y2 ) 6=
(x1 , x1 + y1 ) we set P3 := P1 + P2 , P3 = (x3 , y3 ) defined by

x3 := −x1 + x2 + λ + λ2 + a , y3 := y1 + x3 + λ(x1 + x3 )

with auxiliary quotient

y1 + y2


 if P1 6= P2 ,
x1 + x2

λ :=
y1
x1 + if P1 = P2 .


x1

In particular, we obtain −P = (x, −y ) for P = (x, y ) ∈ E.

Note that −(−P ) = (x, x +(x + y )) = (x, 2x + y ) = (x, y ), since the underlying
field has characteristic 2.
One can verify that + defines a group operation on the set E ∪{O}. In particular
this means that the sum of two points is again a point on the elliptic curve. How
his operation works is geometrically visualized in the following subsection.

How to Add Points on an Elliptic Curve

Figure 8.4 shows how points on an elliptic curve over the field of real numbers are
summed up using affine coordinates: (a) doubles a point P, (b) sums up two different
points P and Q. We note that the point infinity O cannot be shown in the affine
plane.

8.5.1 Web Programs with Animations to Add Points on an Elliptic Curve

The best interactive animations about operations on elliptic curves can be found on
the internet in the following:

the curve are taken from this underlying field. If E is an elliptic curve over such a
field and P is a point on the curve E, then we can derive for all positive integers m

{z· · · + P} .
m P := |P + P +
m times
Looking at this operation from the cryptographic point of view, it turns out to be
very interesting because of the following reason: On the one hand, one needs only
log m operations to calculate m P—one simply has to calculate P, 2 P, 22 P, 23 P,
…, write m in a binary form, and finally add all these multiples 2k P of P with
respect to the binary representation of m. On the other hand, it seems to be very
hard to find m given P and Q = m P on E. Of course, we may simply calculate
P, 2 P, 3 P, 4 P, 5 P, . . . and compare each of them with Q. But this will take as much
as m operations.
Yet there is no algorithm known that efficiently derives m given P and G. The
√
best algorithms known so far need about q operations where q is the (largest)
prime factor of p − 1, in case the underlying field is G F ( p ); here m should be
between 1 and q so that √
one needs at most log q operations to calculate m P.
q
However, the quotient log q tends to +∞ very fast for large q.
If we choose sufficiently large parameters (for example, let p be prime and at
least 160 bits long), a computer will easily be able to calculate m P (in less than a
second). The inverse problem, however, to derive m from m P and P, can (still) not
be solved in reasonable time.
This problem is known as the elliptic curve discrete logarithm problem
(ECDLP).
In elliptic-curve cryptography we formally look at points on the elliptic curve as
elements of a group with point addition + as operation. Furthermore, we use only
elliptic curves that have a sufficiently large number of points. However, in special
cases curves may be weak and not useful due to other reasons. For such special
cases the ECDLP can be much easier to solve than in the general case. This means
that one has to look carefully at the parameters when choosing an elliptic curve for
cryptographic applications.
Not useful for cryptography are a-normal (curves over Z p for which the set
E consists of exactly p elements) and supersingular curves (curves for which the
ECDLP can be reduced to the “normal” discrete logarithms in another, smaller
finite field). This means that there are cryptographically useful and nonuseful ellip-
tic curves. Given the parameters a and b, it is possible to determine whether a curve
is useful or not. In many publications one can find parameters that turned out to
be useful for cryptography. The open (scientific) discussion guarantees that these
results take into account latest research.
Given a secure curve, the time that is needed to solve the ECDLP is strongly
correlated with parameter p in case G F ( p ), respectively, n in case of G F (2n ).
The larger these parameters become, the more time an attacker needs to solve the
ECDLP—at least with the best algorithms known so far. Experts recommend bit-
lengths of 200 for p for secure curves. A comparison with the RSA modulus length
shows why elliptic curves are so interesting for applications. We note that the com-
putation effort for signing and encryption is closely related to the bit-length of the

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 387 — #13

i i

8.7 Encryption and Signing with Elliptic Curves 387

parameters. In addition, the initiation process (i.e., the generation of the private-
public-key-pair) becomes more complicated the larger p is. Thus, one looks for the
smallest parameters that still come along with the security required. It is remarkable
that a length of 200 bits for p is sufficient to construct a good elliptic curve that is as
secure as RSA with a 1024-bit RSA modulus (as far as we know today). In short, the
reason for this advantage of ECC lies in the fact that the best algorithms known for
solving the ECDLP need exponential time while the best algorithms for factorizing
are subexponential (number field sieve quadratic sieve or factorizing with elliptic
curves). Hence, the parameters for a cryptosystem that is based on the problem of
factorizing large integers have to be larger than the parameters for a system based
on ECDLP.

8.7 Encryption and Signing with Elliptic Curves

The elliptic curve discrete logarithm problem is the basis for elliptic-curve cryptog-
raphy. Based on this problem, there are different signature schemes. In order to
apply one of these, we need:

• An elliptic curve E with an underlying field G F ( p n ).

• A prime q 6= p and a point G on the elliptic curve E with order q. This
means that qG = O and r G 6= O for all r ∈ {1, 2, . . . , q − 1}. Thus, q is a
factor of the group order (i.e., the number of elements) #E of E. Since q is
prime, G generates a cyclic subgroup of E of order q.

The parameters mentioned are often called domain parameters. They describe
the elliptic curve E and the cyclic subgroup of E on which the signature scheme is
based.

8.7.1 Encryption
Using elliptic curves, one can construct a key exchange protocol based on the
Diffie-Hellman protocol (see Section 6.4.2). The key exchanged can be used for
a subsequent symmetric encryption. We note that in contrast to RSA there is no
pair of private and public key that can be used for encryption and decryption!
In the notation of elliptic curves, the Diffie-Hellman protocol reads as follows:
First, both partners (A and B) agree on a group G and an integer q. Then they choose
r A , r B ∈ {1, 2, . . . , q − 1} at random, derive the points R A = r A G, R B = r B G
on the elliptic curve, and exchange them (using an insecure channel). After that,
A easily obtains R = r A R B ; B gets the same point (R = r A r B G) by calculating
r B R A = r B r A G = r A r B G = R. We note that R A , R B are easy to derive as long as r A
respectively r B are known G. However, the inverse operation, to get R A respectively
R B from r A respectively r B is hard.
Using the best algorithms known so far, it is impossible for any attacker to
obtain R without knowing either r A or r B —otherwise he would have to solve the
ECDLP.
In order to prohibit a man-in-the-middle attack, one may sign the values
G, q, R A , R B as described in Section 7.5.1.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 388 — #14

i i

388 Elliptic-Curve Cryptography

8.7.2 Signing
Using the DSA signature scheme, one can proceed as follows: The signing party
chooses a (nontrivial) number s ∈ Zq , which will be the private key, and publishes
q, G and R = sG. We note that s cannot be obtained from G and R sufficiently—a
fact on which the security of the signature scheme is based.
Given the message m, which should be signed, one first constructs a digital fin-
gerprint using a hash-algorithm h such that h (m ) has its values in {0, 1, 2, . . . , q − 1}.
Thus, h (m ) can be considered as an element of Zq . Then the signing party chooses a
random number r ∈ Zq and derives R = (r1 , r2 ) = r G. We note that the first com-
ponent r1 of R is an element of G F ( p n ). This component will then be projected
onto Zq (i.e., in the case of n = 1 it is interpreted as the remainder of an element of
{0, 1, . . . , p − 1} divided by q. This projection of r1 onto Zq is denoted by r̄1 . Then
one determines x ∈ Zq such that

r x − s r̄1 − h (m ) = 0.

The triple (m, r1 , x ) is then published as the digital signature of message m.

8.7.3 Signature Verification

In order to verify a signature, one has to build u 1 = h (m )/x, u 2 = r̄1 /x in Zq and
derive
V = u 1 G + u 2 Q.

Since we have Q = sG, the point V = (v1 , v2 ) satisfies v1 = u 1 + u 2 s. We note that

this operation (addition) takes place in the field G F ( p n ). The projection of G F ( p n )
on Zq mentioned previously should be chosen in such a way that v̄1 = u 1 + u 2 s is
an element of Zq . Then it follows that

v̄1 = u 1 + u 2 s = h (m )/x + r̄1 s /x = (h (m ) + r̄1 s )/x = r x /x = r.

Since R = r G, we obtain v̄1 = r̄1 (i.e., R and V coincide modulo the projection
onto Zq ).

8.8 Factorization Using Elliptic Curves

There are factorization algorithms based on elliptic curves. The biggest factor found
by factoring compound numbers with elliptic curves (GMP-ECM) has 83 decimal
digits (state July 2023); see the ECMNET project [12]. More precisely, these pro-
cedures exploit the fact that elliptic curves can be defined over Zn (n composite
number). Elliptic curves over Zn do not form a group, because not every point on
such an elliptic curve has an inverse point. This is connected with the fact that, if n
is a composite number, there exist elements in Zn that do not have an inverse with
respect to multiplication mod n. In order to add two points on an elliptic curve over
Zn , we can calculate in the same way as on elliptic curves over Z p .
Addition of two points (on an elliptic curve over Zn ), however, fails if and only
if a factor of n has been found. The reason for this is that the procedure for adding

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 389 — #15

i i

8.9 Implementing Elliptic Curves for Educational Purposes 389

points on elliptic curves gives elements in Zn and calculates the inverse elements
for these (with respect to multiplication mod n) in Zn . The extended Euclidean
algorithm is used here. If the addition of two points (that lie on an elliptic curve
over Zn ) gives an element in Zn that does not have an inverse element in Zn , then
the extended Euclidean algorithm delivers a genuine factor of n.
Factorization using elliptic curves thus principally works as follows: Random
curves over Zn are selected, as well as random points (that lie on this curve),
and added; you thus obtain points that also lie on the curve or find a factor
of n. Factorization algorithms based on elliptic curves therefore work proba-
bilistically. The opportunity of defining a large number of elliptic curves over
Zn allows you to increase the probability of finding two points that you can
add to obtain a factor of n. These procedures are therefore highly suitable for
parallelization.

8.9 Implementing Elliptic Curves for Educational Purposes

Besides the web programs listed in Section 8.5.1, there are not many free didac-
tic programs offering especially ECC under a graphical user interface. Sections
8.9.1 and 8.9.2 explain shortly which corresponding functionality is available in
CrypTool and in SageMath.

Remark: Another interesting educational program was ECvisual, which visualizes

elliptic curves and finite fields, but has not been maintained since 2012. Currently,
neither the static nor the dynamic version can be started on current Ubuntu versions
[13]. This underlines once again how important it is that learning programs are
maintained permanently and reliably so that students and teachers can build on
them.

8.9.1 CrypTool
CT1 offers elliptic curves for the digital signature function4 and for ECC-AES
hybrid encryption.5 Also implemented are the basic algorithms for group opera-
tions, for generating elliptic curves, and for importing and exporting parameters
for elliptic curves over finite fields with p elements ( p prime). The implemen-
tation in CT1 complies with draft number 8 of the IEEE P1363 work group
Standard Specifications for Public-Key Cryptography [14]. The point addition on
elliptic curves is visualized for two types of elliptic curves in CT1 and for 3 types
in JCT.6
Figure 8.6 shows the visualization of the point addition in JCT (the imple-
mentation in JCT is much more mature than that in CT1). Figure 8.5 shows a
visualization of elliptic curves in CrypTool-Online (CTO).

4. The dialog box in CT1 Digital Signatures/PKI F Sign Message offers the EC methods ECSP-DSA and
ECSP-NR (these Nyberg-Rueppel and DSA signatures are based on elliptic curves).
5. See CT1 Encrypt/Decrypt F Hybrid.
6. - CT1 Indiv. Procedures F Number Theory -- Interactive F Point Addition on Elliptic Curves.
- JCT Default Perspective F Visuals F Elliptic Curve Calculations.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 390 — #16

i i

390 Elliptic-Curve Cryptography

Figure 8.6 Addition on an F p type of elliptic curve in JCT.

8.9.2 SageMath
There are many functions around elliptic curves in SageMath. See [15–20].
By Johannes Bauer there is a very nice tutorial (as of 2015) with SageMath and
OpenSSL code [15].

8.10 Patent Aspects

If the field G F (2n ) is used instead of the prime field G F ( p ), one has to make sub-
stantial changes in the implementation. The advantage of G F (2n ) lies in the fact that
calculations in G F (2n ) can be implemented very efficiently using the binary repre-
sentation. In particular, divisions are much easier to process compared to G F ( p )
(this is particularly important in the signature scheme mentioned previously where
a division is needed for processing a signature as well as for the verification).
In order to achieve maximal gain in efficiency, one may choose a field that
allows a special basis like a polynomial basis (useful for software implementations)
or a normal basis (best for hardware implementations). For special n (like, for

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 391 — #17

i i

8.11 Elliptic Curves in Use 391

example, n = 163179181) one may even combine both advantages. However, they
are still nonstandard.
Sometimes only the first component and one additional bit is used as represen-
tation of a point on the elliptic curve instead of the full two components. Since the
first component together with the additional bit is sufficient to derive the full point,
this representation minimizes the memory capacity needed. In particular, for a nor-
mal basis, this point compression can be implemented efficiently. In addition, the
cryptographic protocols themselves become more effective. A disadvantage is, how-
ever, that point compression can be used for about half of all elliptic curves only and
is protected under U.S. patent (US Patent 6141420, Certicon), causing additional
costs. In the general case G F ( p n ) (and also in case n = 1) often so-called affine or
projective coordinates are used. Depending on the application, these coordinates
may result in a gain in efficiency as well.
A comprehensive description of all implementations and their advantages and
disadvantages would go far beyond the scope of this book. We only want to state
that there is a variety of possible implementations for elliptic-curve cryptography,
much more than for RSA. Therefore, there are serious efforts to reduce this large
number of implementation to a few standard implementations. Some standardiza-
tion committees even try to reduce the complexity by focusing on some (prescribed)
curves (ASC approach).
It is still not clear whether these standardization initiatives will be successful
or not. However, without agreed standards, ECC is not likely to become a real
alternative for RSA.

8.11 Elliptic Curves in Use

Today elliptic-curve cryptography is already broadly in use. Besides S/MIME or

TLS, a prominent example is the information network Bonn-Berlin, used for the
exchange of strictly confidential documents between different German federal gov-
ernmental institutions in Berlin and Bonn. The main requirement was security, but
not interoperability.
In Austria ECC has been massively launched for the bank card with digital-
signature function.
Both examples show typical applications for elliptic-curve cryptography: For
high security solutions and for implementations on smart cards in which the key
length is crucial (because of lack of physical memory available).

References

[1] Lenstra, A. K., and E. R. Verheul, “Selecting Cryptographic Key Sizes (1999 + 2001),”
Journal of Cryptology, Vol. 14, 2001, pp. 255–293, https://www.cs.ru.nl/E.Verheul/
papers/Joc2001/joc2001.pdf.
[2] Merkle, J., Elliptic-Curve Cryptography Workshop. 2001.
[3] BSI,“Cryptographic Mechanisms: Recommendations and Key Lengths (Version 2022-
01),” Tech. rep., Technical Guideline TR-02102-1, 2022, https://www.bsi.bund.de/Shared
Docs/Downloads/EN/BSI/Publications/TechGuidelines/TG02102/BSI-TR-02102-1.pdf.
[4] Nagell, T., Introduction to Number Theory, Wiley, 1951.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 392 — #18

i i

392 Elliptic-Curve Cryptography

[5] Bauer, J., “ECC-Tutorial (with Python and SagemathCode),” 2015, https://www.johannes-
bauer.com/compsci/ecc/.
[6] Silverman, J.,The Arithmetic of Elliptic Curves, 2nd ed., Graduate Texts in Mathematics,
Springer, Vol. 106, 2009.
[7] Certicom, “ECC Tutorial,” https://www.certicom.com/content/certicom/en/ecc-tutorial
.html.
[8] Laubrock, T.,“Tutorial mit Java Applets – Krypto-Verfahren basierend auf elliptischen Kur-
ven,” 1999, http://www.warendorf-freckenhorst.de/elliptische-kurven/frame.html (visited
on07/26/2023).
[9] Kebekus, S., “Elliptic Curve Plotter,” https://cplx.vm.uni-freiburg.de/storage/software/
ellipticcurve/wasm/ellipticcurve.
[10] Corbellini, A., “Elliptic Curve Point Addition and Multiplication in R and Fp ,” https://an-
drea.corbellini.name/ecc/interactive/reals-add.html.
[11] Driscoll, M.,“The Animated Elliptic Curve,” https://curves.xargs.org/ (visited on
07/26/2023).
[12] Zimmermann, P., “The ECMNET Project,” 2023, https://members.loria.fr/PZimmer
mann/records/ecmnet.html.
[13] Shene, C.-K., “ECvisual,” 2012, https://pages.mtu.edu/∼shene/NSF-4/.
[14] Working Group P1363, The IEEE P1363 Home Page: Standard Specifications for Public-
Key Cryptography, https://web.archive.org/web/20150405005140/; http://grouper.ieee
.org/groups/1363/.
[15] The Sage Development Team,“Constructions: Elliptic Curves,” https://doc.sagemath.org/
html/en/constructions/elliptic_curves.html.
[16] Stein, W., “An Elliptic Curve Cryptography (ECC) Tutorial,” 2006, http://www
.williamstein.org/simuw06/notes/notes/node12.html.
[17] The Sage Development Team,“Thematic Tutorials: Elliptic Curves,” https://doc.sagemath
.org / html /en / thematic_tutorials/explicit_methods_in_number%20_theory/elliptic_curves
.html.
[18] Stein, W., “Elliptic Curves in Sage—Presentation at Microsoft Research,” 2011. url: https://
www.youtube.com/watch?v=9j_Lj071vek.
[19] De Feo, L.,“Using Elliptic Curves and Isogenies in Sage,” in Jupyter notebook, 2017,
https://defeo.lu/jupyter/notebooks/example.html.
[20] Mastermath, “Elliptic Curves,” worksheet with exercises, 2011, https://cocalc.com/share/
public_paths/9ba3432cea2b7afaa75e400a2bf7815f8c6e64ee/3701-3801%2F3755-Elliptic
%20curves.sagews.
[21] Giry, D., “BlueKrypt: Cryptographic Key Length Recommendation,” Version 32.3, May
2020, https://www.keylength.com/.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 393 — #1

i i

CHAPTER 9
CHAPTER 9
Foundations of Modern Symmetric
Encryption

While number theoretic methods prevail for the construction and analysis of asym-
metric encryption algorithms, modern symmetric encryption algorithms almost
always rely on Boolean algebra, that is on the manipulation of bits. This involves
a quite different kind of mathematics and might be unfamiliar to beginners. There-
fore, in this chapter we attempt a smooth introduction into this mathematical
subject. As previous knowledge, we assume elementary mathematical notions such
as variable and function, and a small knowledge of elementary algebra and number
theory.
Nevertheless, this chapter quickly becomes very mathematical—and is more
mathematical than most of the other chapters in this book. We try to compensate
for this by elaborating on the ideas and making them comprehensible by means of
examples, so that readers can skip the mathematics and still understand the connec-
tions anyway. Extensive use is made of SageMath for this purpose. On the website
(see Section 9.4) there is quite a bit more material for deepening.
Let us start with the description on how to interpret and process bits, and how
to apply functions to them. Such functions are called Boolean functions, named
after George Boole who formalized logic by introducing the elementary logical
operations, and thereby made logic a part of mathematics (logical calculus). Most
modern symmetric ciphers, as well as hash functions, can be expressed as systems
of Boolean functions.
The focus of this chapter is on introducing the mathematical foundations of
ciphers that operate on bits. We won’t define single ciphers in detail but instead
recommend the books by Menezes/Orschot/Vanstone [1], Oppliger [2, 3], Paar and
Pelzl [4], Schmeh [5, 6], and Stamp [7].
A word on nomenclature: In the existing literature these ciphers usually are
called block ciphers or stream ciphers without the prefix “bit.” Sometimes this
usage might cause a misunderstanding since, in particular for stream ciphers,
ciphers could operate on other character sets (alphabets, letters) as their basic
units. For clarity, in case of doubt it’s better to make the “bits” explicit parts of
the notations.
The explanations in this chapter are supported by almost 20 code exam-
ples in SageMath. These are available in separate SageMath scripts of the form
chap09_sample<nnn>.sage that can be executed directly. Within them, functions
and classes written by ourselves are used, which come from the small library
bitciphers.sage which is included in the SageMath examples if necessary. See
Section 9.4.

393

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 394 — #2

i i

394 Foundations of Modern Symmetric Encryption

This being said, we could express the subject of this chapter—(bit)block

ciphers as well as (bit)stream ciphers—in other words as symmetric encryption of
information given by bits.
The mathematical foundations and methods belong to the domains of Boolean
algebra and finite fields.

9.1 Boolean Functions

In Boolean functions, the independent variables xi can only take the values 0 and 1
and the result is 1 bit. n different independent variables can be combined in at most
n
22 different ways. If the result is a bit vector instead of one bit, one speaks of a
vector Boolean function (like an S-box in cryptography).

9.1.1 Bits and Their Composition

On the lowest level, computers operate on bits, or small groups of bits, for example
bytes, or words consisting of 32 or 64 bits depending on the processor architecture.
This text assumes some familiarity with the bits 0 and 1 and with elementary logical
operations such as AND, OR, NOT, and “exclusive or” (XOR). Nevertheless, we
give a short description to make the terminology clear.
Bits have several distinct interpretations: logically as truth values “True” (T)
and “False” (F), algebraically as objects 0 (corresponding to F) and 1 (corresponding
to T). Mathematically they are the elements of the two-element set {0, 1} that in this
chapter is denoted by F2 . Here is why:
Consider the residue class ring of Z modulo 2. This ring has two elements and
is a field since 2 is a prime number. Addition in this field exactly corresponds to
the logical composition XOR, and multiplication corresponds to the logical com-
position AND, as is seen in Table 9.1. Table 9.2 lists the transformation formulas
between the elementary logical and algebraic operations.
Because this algebraic structure as a field plays a predominant role in cryptog-
raphy, we use the common notation Fq for finite fields from algebra (often also
noted as GF(q ) for “Galois field” where q is the number of elements). In this con-
text it also makes sense to use the algebraic symbols + (for XOR) and · (for AND),
and, as is common in mathematics, we often omit the multiplication dot. Cryptog-
raphers instead tend to use the symbols ⊕ and ⊗, that in mathematics are loaded

Table 9.1 The Most Important Compositions of Bits∗

Logical Algebraic
Bits Composition Bits Composition
x y OR AND XOR x y + ·
F F F F F 0 0 0 0
F T T F T 0 1 1 0
T F T F T 1 0 1 0
T T T T F 1 1 0 1
∗ The logical XOR is identical with the algebraic +, the logical AND with
the algebraic · (multiplication).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 395 — #3

i i

9.1 Boolean Functions 395

Table 9.2 Transformation of Algebraic Opera-

tions to Logical Ones and Vice Versa
Algebraic to Logic
x+y = (x ∨ y ) ∧ (¬x ∨ ¬y )
x·y = x∧y

Logic to Algebraic
x∨y = x+y+x·y
x∧y = x·y
¬x = 1+x

with quite different meanings like direct sum and tensor product of vector spaces.
Therefore, in this chapter we avoid them except in diagrams.
For clarification, we explicitly hint at some special aspects of algebraic calcu-
lations in the binary case (or in characteristic 2):
• Two equal summands in a sum cancel out, that is, together give 0. As a
general rule: x + x = 0, or 2x = 0.
• More generally, an even number of equal summands always gives 0 and an
odd number of equal summands gives exactly this summand. As a general
rule: (
0 for even m
m x := |x + ·{z
· · + x} =
x for odd m.
m

• For algebraic manipulations, a subtraction means exactly the same operation

as an addition—plus and minus signs are arbitrarily interchangeable. As a
general rule: x + y = x − y.
• All three binomial formulas, for (x + y )2 , (x − y )2 , (x + y )(x − y ), collapse
to a single one:
( x + y )2 = x 2 + y 2 .

Since mixed term occurs twice, that results in a 0.

9.1.2 Description of Boolean Functions

Let us first define Boolean functions quite naively: A Boolean function is a rule (or
an algorithm) that takes a certain number of bits and produces a new bit from them.
Before rephrasing this naive definition more precisely in mathematical language (see
Definition 9.1) we make it a little more vivid.
As a first simple example, consider AND as a Boolean function: It takes two
bits and produces one new bit by the well-known rules shown in Table 9.1.
For a slightly more complex example take the function f 0 that produces the
value
f 0 (x1 , x2 , x3 ) = x1 AND (x2 OR x3 ) (9.1)

from three bits x1 , x2 , x3 . The mechanism inside this black box can be described
from several different points of view:

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 396 — #4

i i

396 Foundations of Modern Symmetric Encryption

• Mathematically by a formula;
• Informatically by an algorithm;
• Technically by a circuit (or plugging diagram);
• Pragmatically by a truth table (that is the complete lookup table of its
values).

Our sample function f 0 is mathematically defined by (9.1). The corresponding algo-

rithm is also adequately specified by this formula since it has no branching points
or conditional statements. As a circuit we visualize f 0 in Figure 9.1. The truth table
is in Table 9.3.
The term truth table is motivated by the interpretation of the bits in logical cal-
culus: 0 (= F) means false, 1 (= T) means true. The value f (x1 , . . . , xn ) of a Boolean
function f indicates whether the complete expression is true or false whenever the
single input bits x1 , . . . , xn have the respective truth values.
The connection with electrical engineering—that is the connection between
logical calculus and electric circuits—was essentially developed by Claude Elwood
Shannon.

9.1.3 The Number of Boolean Functions

The truth table of f 0 , Table 9.3, suggests an easy way of enumerating all Boolean
functions: Three variables combine to 8 = 23 different input triples, since each input
bit may assume the values 0 or 1 independently of the other ones. Furthermore, a
Boolean function f may assume the values 0 or 1 at each triple independently of
the seven other triples. This makes 8 independent choices of 0 or 1, a total of 28 .
Therefore, the number of Boolean functions of three variables is 256 = 28 .

Figure 9.1 Example of a circuit.

Table 9.3 Example of a Truth Table

x1 x2 x3 f 0 (x1 , x2 , x3 )
0 0 0 0
0 0 1 0
0 1 0 0
0 1 1 0
1 0 0 0
1 0 1 1
1 1 0 1
1 1 1 1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 397 — #5

i i

9.1 Boolean Functions 397

In the general case we have N = 2n different allocations of the n input variables,

and for each of these N input tuples the function may assume the values 0 or 1. This
makes a total of 2 N different choices. Thus, the general formula is:
n
Theorem 9.1 The number of different Boolean functions of n variables is 22 .
For four variables we have 216 = 65536 different functions. By the for-
mula, the number grows superexponentially with n, and even the exponent grows
exponentially.
All the 16 Boolean functions of two variables are listed in Table 9.4.

9.1.4 Bitblocks and Boolean Functions

Collections of bits are denoted by different names depending on the context: vec-
tors, lists, (n-)tuples, … For certain sizes we often use special denotations such as
bytes or octets (for 8 bits), or words (for 32 or 64 bytes depending on the processor
architecture). In this chapter we usually use the denomination bitblocks which is
common in cryptography. Thus, a bitblock of length n is a list (x1 , . . . , xn ) of bits
where the order matters. There are eight different bitblocks of length 3:

(0, 0, 0), (0, 0, 1), (0, 1, 0), (0, 1, 1), (1, 0, 0), (1, 0, 1), (1, 1, 0), (1, 1, 1).

If the danger of misunderstanding is negligible, we write them as bitstrings without

parentheses or commas:

000, 001, 010, 011, 100, 101, 110, 111. (9.2)

We often use the abbreviation x for (x1 , . . . , xn ). This short form highlights the
fact that we consider bitblocks as objects of their own.
The 2n different bitblocks of length n are the elements of the Cartesian product
F2 = F2 × · · · × F2 . This Cartesian product has a natural structure as a vector space
n

over the field F2 ; bitblocks x and y ∈ Fn2 may be added or multiplied by scalars
a ∈ F2 :

(x1 , . . . , xn ) + ( y1 , . . . , yn ) = (x1 + y1 , . . . , xn + yn ),
a · (x1 , . . . , xn ) = (a · x1 , . . . , a · xn ).

Now we can write down the mathematically exact definition:

Definition 9.1 A Boolean function of n variables is a map

f : Fn2 −→ F2 .

It takes a bitblock of length n as argument, and produces a single bit.

In this text we sometimes denote the set of all Boolean functions on Fn2 by Fn .
n
By Theorem 9.1 its cardinality is #Fn = 22 .

Convention: If we describe a Boolean function by its truth table, we usually order

the truth table lexicographically with respect to x ∈ Fn2 , as in the previous example.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 398 — #6

i i

398 Foundations of Modern Symmetric Encryption

This order corresponds to the natural order of the integers a = 0, . . . , 2n − 1, if

these are expanded in base 2

a = x1 · 2n−1 + · · · + xn−1 · 2 + xn

and assigned to the corresponding bitblocks (x1 , . . . , xn ) ∈ Fn2 .

The lexicographic order orders strings (like a dictionary) by the value of their
first characters—here 0 or 1 with 0 < 1. If the first characters are equal, the order
looks at the second character, and so on. The sequence 011, 100, 101 is in lexico-
graphic order. A counterexample is the sequence 100, 101, 011. Here the third string
begins with 0, which is smaller than the first character of the string preceding it.
The sequence of the eight bitblocks of length 3 in (9.2) is in lexicographic order.

9.1.5 Logical Expressions and Conjunctive Normal Form

For describing Boolean functions in mathematical terms, that is by formulas, there
are two approaches (beyond truth tables):
• In the logical approach, Boolean functions are expressed by disjunctions
(the operation OR, also written as ∨), conjunctions (the operation AND,
also written as ∧), and negations (the operation NOT, also written as ¬).
Compositions of these operations are called logical expressions.
• In the algebraic approach, Boolean functions are expressed by additions +
and multiplications · in the field F2 . Compositions of these operations are
called (binary) polynomial expressions.
In Theorem 9.2 and Theorem 9.3 we see that both approaches describe all Boolean
functions, and that we even can require a certain structure as so-called normal
forms. Of course there are algorithms to switch between the three representa-
tions (truth tables, logical expressions, and binary polynomial expressions); for
all Boolean functions. But we cannot hope that these algorithms are efficient for
large numbers n of variables—even writing down the truth table involves 2n bits.
It seems that the algebraic approach allows a smoother handling of Boolean
functions for cryptologic purposes due to its (yet to explore) more rigid structure.
In contrast, the logical approach more easily leads to a realization in hardware
by circuits since the elementary Boolean operations have direct analogs as circuit
elements (gates).
Since in this subsection the logical approach plays a minor role we state the
result on normal forms without further reasoning. The possibility of a logical
representation (without normalization) will follow as a corollary in Theorem 9.5.
Theorem 9.2 Each Boolean function of n variables x1 , . . . , xn has a representation
of the form (conjunction)

f (x ) = s1 (x ) ∧ . . . ∧ sr (x )

with some index r where the s j (x ) for j = 1, . . . , r each have the form (disjunctions)

s j (x ) = t j 1 (x ) ∨ . . . ∨ t jn j (x )

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 399 — #7

i i

9.1 Boolean Functions 399

with a certain number n j of terms t jk (x ) ( j = 1, . . . , r and k = 1, . . . , n j ), each of

which in turn has the form xi (an input bit) or ¬xi (a negated input bit) for some
index i.
In particular, n j ≤ n for j = 1, . . . , r . Each individual input bit xi occurs in
each of the t jk (x ) either directly, negated, or not at all.
In other words, we can build each Boolean function by first forming a handful
of expressions (the s j (x )) as OR of some of the input bits or their negations, and
then join these expressions by AND (“conjunction of disjunctions”). This normal
form cleanly separates AND- and OR-compositions into two layers—there is no
further intermixture. The example function f 0 from Section 9.1.2 was defined by
the formula
f 0 (x1 , x2 , x3 ) = x1 ∧ (x2 ∨ x3 )
|{z} | {z }
s1 ( x ) s2 ( x )

that already has the conjunctive form from Theorem 9.2 with

n 1 = 1, s1 (x ) = t11 (x ) = x1 , n 2 = 2, t21 (x ) = x2 , t22 (x ) = x3 .

This is no longer true if we expand it:

f 0 (x ) = (x1 ∧ x2 ) ∨ (x1 ∧ x3 )

This example doesn’t display negated input bits. However, in Table 9.4 we see some
of them.
The form of a Boolean function according to Theorem 9.2 is called conjunctive
normal form (CNF). It is not unique.1 Without further explanation we remark that
there is a further simplification as a canonical CNF that guarantees a certain unique-
ness. There is also an analogous disjunctive normal form (DNF) (a “disjunction of
conjunctions”).

9.1.6 Polynomial Expressions and Algebraic Normal Form

We consider (binary) polynomial expressions in the variables x1 , . . . , xn , such as
x12 x2 + x2 x3 + x32 . Since we work over the field F2 only the constants 0 and 1 occur
as coefficients and these don’t show up explicitly.
As 02 = 0 and 12 = 1, or more general a 2 = a for all elements a ∈ F2 , and even
a = a for all exponents e ≥ 1, leads to another simplification of the expressions. As
e

a consequence for binary polynomial expressions we need to consider the variables

x1 , . . . , xn with exponents 0 and 1 only. Therefore, our sample expression may be
written as x1 x2 + x2 x3 + x3 . Another example: x13 x2 + x1 x22 = x1 x2 + x1 x2 = 0.
In general, a monomial expression (or simply monomial) has the form
Y
x I := xi with a subset I ⊆ {1, . . . , n};
i∈I

1. The SageMath class sage.logic.boolformula.BooleanFormula provides transformations of a logical

expression to the CNF by the function convert_cnf(), and to the corresponding truth table by the function
truthtable().

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 400 — #8

i i

400 Foundations of Modern Symmetric Encryption

in other words it is a product of some of the variables where the subset I specifies
the choice of “some.” Here is an illustrative example with n = 3:

I = {2, 3} =⇒ x I = x2 x3 .

The total number of such monomial expressions is exactly 2n , corresponding

to the number of choices of products of n potential factors. Here the empty set
corresponds to an empty product of 0 factors whose usual interpretation is 1. Note
that empty sums are usually are interpreted as 0. Thus,

I = ∅ =⇒ x I = 1

A monomial expression has an immediate interpretation as a Boolean function. At

first sight we don’t know whether all of these functions are distinct, but we’ll see
this in a few moments.
A polynomial expression is a sum of monomial expressions—remember that
we are in the binary case where coefficients take the values 0 or 1 only. Thus, the
most general (binary) polynomial expression has the form
X
aI x I ,
I ⊆{1,...,n}

where all coefficients a I are 0 or 1. In other words we add a subset of the 2n potential
n
monomial expressions, and for this we have 22 choices. All these expressions give
different Boolean functions, but we have yet to prove this. First, we must prove that
each Boolean function has a polynomial expression.
Theorem 9.3 (ANF) For each Boolean function f : Fn2 −→ F2 there are coefficients
a I ∈ F2 (that is = 0 or 1), where I runs through all subsets of {1, . . . , n}, such that
f may be written as a polynomial expression in n variables of the form:
X
f (x1 , . . . , xn ) = aI x I . (9.3)
I ⊆{1,...,n}

Proof
(Induction on n) Start with n = 1. The four Boolean functions of one variable x
are the constants 0 and 1 and the functions given by x and 1 + x (= the negation of
x). They all have the claimed form.
1
Now let n ≥ 1. For x = (x1 , . . . , xn ) ∈ Fn2 we abbreviate (x2 , . . . , xn ) ∈ Fn−
2 as
x . Then we can also write x = (x1 , x ) instead of x = (x1 , . . . , xn ).
0 0

Now take a function f ∈ Fn . For each fixed value b of the first variable x1 , the
choices being b = 0 or b = 1, we consider the function x 0 7→ f (b, x 0 ) of the n − 1
variables that x 0 consists of. By induction (for b = 0 as well as for b = 1) we know
1
f (b, x 0 ) = pb (x 0 ) for all x 0 ∈ Fn−
2

where p0 , p1 are polynomial expressions in x 0 of the desired form:

X X
p0 ( x 0 ) = bJ x J , p1 ( x 0 ) = cJ x J .
J ⊆{2,...,n} J ⊆{2,...,n}

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 401 — #9

i i

9.1 Boolean Functions 401

Therefore,
(
p0 (x 0 ), if x1 = 0,
f (x1 , x ) =
0
for all x = (x1 , x 0 ) ∈ Fn2
p1 (x 0 ), if x1 = 1,

since x1 assumes the values 0 or 1 only. We combine this conditional formula into

f (x1 , x 0 ) = (1 + x1 ) p0 (x 0 ) + x1 p1 (x 0 ) for all x ∈ Fn2 , (9.4)

To check, substitute x1 = 0 or x1 = 1 in (9.4). By expanding the right-hand side

and eliminating repeated monomials we get a polynomial expression in x of the
claimed form:

f (x1 , x 0 ) = p0 (x 0 ) + x1 ( p0 (x 0 ) + p1 (x 0 ))
X X
= bJ x J + (b J + c J ) x 1 x J .
J ⊆{2,...,n} J ⊆{2,...,n}
| {z } | {z }
all monomials without x1 all monomials withx1

The wording of this theorem is mathematically compact. As an illustration
look at the second column of Table 9.4, where the variables are x and y instead of
x1 and x2 , and the coefficients are a, b, c, d instead of a∅ , a{1} , a{2} , a{1,2} . Each
row of the table describes a Boolean function of two variables. The corresponding
polynomial expression is the sum of the terms 1, x, y, x y that have a coefficient
1 in the representation by (9.3), whereas terms with coefficients 0 don’t show up
explicitly.
Theorem 9.3 provides a representation of a Boolean function as a polynomial
expression. This expression is called the algebraic normal form (ANF).2 The ANF
n
is unique: Since the total number of polynomial expressions is 22 , and since they
n
represent all 22 different Boolean functions, all these polynomial expressions must
differ as functions, and furthermore this representation of a Boolean function as a
polynomial expression must be unique. We have shown:
Theorem 9.4 The representation of a Boolean function in algebraic normal form
is unique.
Definition 9.2 The (algebraic) degree of a Boolean function f ∈ Fn is the degree
of its polynomial expression in algebraic normal form,

deg f = max{#I | a I 6= 0}.

It is always ≤ n.

2. The transformation of ANF to truth table and vice versa is provided by the (internal) function
__convert() of the class BoolF(), which comes from the file bitciphers.sage. SageMath’s own module
sage.crypto.boolean_function also provides initialization by a truth table or by a Boolean polynomial,
and functions algebraic_normal_form() and truth_table() for the transformations.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 402 — #10

i i

402 Foundations of Modern Symmetric Encryption

Table 9.4 The 16 Operations on Two Bits (= Boolean Functions of 2 Variables),

Using Table 9.2∗
a b c d ANF Logical Operation CNF
0 0 0 0 0 False constant x ∧ ¬x
1 0 0 0 1 True constant x ∨ ¬x
0 1 0 0 x x projection x
1 1 0 0 1+x ¬x negation ¬x
0 0 1 0 y y projection y
1 0 1 0 1+y ¬y negation ¬y
0 1 1 0 x+y x XOR y XOR (x ∨ y ) ∧ (¬x ∨ ¬y )
1 1 1 0 1+x +y x ⇐⇒ y equivalence (¬x ∨ y ) ∧ (x ∨ ¬y )
0 0 0 1 xy x∧y AND x∧y
1 0 0 1 1 + xy ¬(x ∧ y ) NAND (¬x ) ∨ (¬y )
0 1 0 1 x + xy x ∧ (¬y ) x ∧ (¬y )
1 1 0 1 1 + x + xy x =⇒ y implication (¬x ) ∨ y
0 0 1 1 y + xy (¬x ) ∧ y (¬x ) ∧ y
1 0 1 1 1 + y + xy x ⇐= y implication x ∨ (¬y )
0 1 1 1 x + y + xy x∨y OR x∨y
1 1 1 1 1 + x + y + xy ¬(x ∨ y ) NOR (¬x ) ∧ (¬y )
∗ The order of the first column is lexicographic if a, b, c, d are considered in reverse order.

The degree indicates how many different variables maximally occur in a monomial
of the ANF.

Example Independently of the number of variables there are exactly two Boolean
functions of degree 0: the two Boolean constants 0 and 1.
Functions of degree ≤ 1 are called affine functions. They are a sum of a constant and
a Boolean linear form; see Section 9.1.9. If the degree is > 1 the function is called
nonlinear, even though the denomination “nonaffine” would be more accurate.

Example The Boolean function given by x 7→ x1 x2 + x2 x3 + x3 has degree 2.

Remark Boolean functions have a high degree not by high powers of some variables
but only by large products of different variables. Each single variable occurs with
exponent at most 1 in each monomial of the ANF. Another way to express this fact
is to say that all partial degrees (the degrees in the single variables xi without regard
for the other variables) are ≤ 1.

9.1.7 Boolean Functions of Two Variables

All the 24 = 16 Boolean functions of two variables x and y are enumerated in
Table 9.4, as polynomial expressions in algebraic normal form a +bx +cy +d x y, and
as logical expressions. The parameters a I from Theorem 9.3 translate as follows:
a = a∅ , b = a{1} , c = a{2} , d = a{1,2} , the input variables as x = x1 , y = x2 .
We have already seen that each Boolean function admits a polynomial expres-
sion. To show that each Boolean function also admits a logical expression we only
have to make sure that the algebraic operations + and · have expressions by the log-
ical operations ∨, ∧, and ¬. To see this, look at the corresponding rows of Table 9.4.
Thus, we have shown (as a weak form of the here unproven Theorem 9.2):
Theorem 9.5 Each Boolean function admits a logical expression, that is a repre-
sentation by a composition of the logical operations ∨, ∧, and ¬.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 403 — #11

i i

9.1 Boolean Functions 403

Hint In the algebraic interpretation the logical negation ¬ corresponds to the

addition of 1.

Remark The analogous form3 of the ANF for a Boolean function of three variables
x, y, z is

(x, y, z ) 7→ a + bx + cy + dz + ex y + f x z + gyz + hx yz.

Here we see eight coefficients a, . . . , h. This fits the observations that:

• A Boolean function of three variables has up to 8 = 23 monomials;
3
• The number of such functions is 22 = 28 = 256.

Example What is the ANF of the function f 0 from Section 9.1.2, written as
f 0 (x, y, z ) = x ∧ ( y ∨ z ) and using the variables x, y, z? By Table 9.4 we have
( y ∨ z ) = y + z + yz, whereas ∧ simply is the multiplication in the field F2 . Hence,

f 0 (x, y, z ) = x · ( y + z + yz ) = x y + x z + x yz,

and by the way we see that, the degree of f 0 is 3.

Remark From Table 9.4 we might directly read off a naive algorithm for translating
logical expressions into (binary) polynomial expressions, and vice versa.

9.1.8 Boolean Maps

Cryptographic algorithms usually produce several bits at once, not only single bits.
An abstract model for this is a Boolean map, that is a map4

q
f : Fn2 −→ F2

with natural numbers n and q.

The instances of f are bitblocks of length q. Decomposing them into their
components,
q
f (x ) = ( f 1 (x ), . . . , f q (x )) ∈ F2 ,

3. In this formula, the letter f (in contrast with the common use in this text) denotes a coefficient, not a func-
tion. Mathematicians almost always use letters as symbols relative to the context, and only in exceptional
cases with an absolute meaning. Such exceptions are the numbers e, i, and π. But even i often denotes (in
contexts without complex numbers) some other object, for example an index in a sum. Or sometimes e is
used as exponent, or coefficient.
4. The distinction between the concepts of function and map is somewhat arbitrary. Mathematicians often use
them to indicate whether the values belong to a one-dimensional or multi-dimensional domain. Boolean
maps (as systems of Boolean functions) are often denoted as vector valued Boolean functions or vectorial
Boolean functions (VBF).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 404 — #12

i i

404 Foundations of Modern Symmetric Encryption

q
we see that we may interpret a Boolean map to F2 as a q-tuple (or system) of Boolean
functions
f 1 , . . . , f q : Fn2 −→ F2 .
q
Definition 9.3 The (algebraic) degree of a Boolean map f : Fn2 −→ F2 is the
maximum of the algebraic degrees of its components,

deg f = max{deg f i | i = 1, . . . , q}.

q
Theorem 9.6 Each Boolean map f : Fn2 −→ F2 has a unique representation as
X
f (x1 , . . . , xn ) = x I aI
I ⊆{1,...,n}

q
with a I ∈ F2 , and monomials x I as in Theorem 9.3.
This representation of a Boolean map is also called algebraic normal form.
It results from combining the algebraic normal forms of its component functions
f 1 , . . . , f q . Compared with Theorem 9.3 the x I and a I occur in reversed order. This
follows the convention that usually scalars (here the x I ∈ F2 ) precede vectors (here
q
the a I ∈ F2 ). The a I are the q-tuples of the respective coefficients of the component
functions.

Example
Define a Boolean map g: F32 −→ F22 by a pair of logical expressions in three variables
x, y, z: !
x ∧ ( y ∨ z)
g (x, y, z ) :=
x∧z

where the components are written below each other, in column form, for clarity.
We recognize the function f 0 as the first component. The second component is the
product x · z. Hence, the ANF of g is
! ! ! !
x y + x z + x yz 1 1 1
g (x, y, z ) = = xy · + xz · + x yz · .
xz 0 1 0

The algebraic degree is 3, and the value table is in Table 9.5. Here the values
g (x, y, z ) ∈ F22 of g are written as bitstrings of length 2.

9.1.9 Linear Forms and Linear Maps

A Boolean function f : Fn2 −→ F2 is called a linear form if it has degree 1 and
absolute term 0. This means that its algebraic normal form has linear terms only:

n
X
f (x ) = si xi for all x = (x1 , . . . , xn ) ∈ Fn2
i =1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 405 — #13

i i

9.1 Boolean Functions 405

Table 9.5 The Value Table

of a Sample Boolean Map
x y z g (x, y, z )
0 0 0 00
0 0 1 00
0 1 0 00
0 1 1 00
1 0 0 00
1 0 1 11
1 1 0 10
1 1 1 11

with si ∈ F2 for i = 1, . . . , n. Because the si are 0 or 1 a linear form is a partial sum

X
f (x ) = α I (x ) = xi for all x = (x1 , . . . , xn ) ∈ Fn2
i∈I

over a subset I ⊆ {1, . . . , n} of all indices, namely

I = {i | si = 1}.

In particular there are exactly 2n Boolean linear forms in n variables, and they
correspond in a natural way to the power set P({1, . . . , n}).
Other common notations are (for I = {i 1 , . . . , ir }):

f (x ) = α I (x ) = x [ I ] = x [i 1 , . . . , ir ] = xi1 + · · · + xir .

Theorem 9.7 relates the definition with the notion of linear forms from linear
algebra:
Theorem 9.7 A Boolean function f : Fn2 −→ F2 is a linear form if and only if the
following two conditions hold:
(i) f (x + y ) = f (x ) + f ( y ) for all x, y ∈ Fn2 .
(ii) f (ax ) = a f (x ) for all a ∈ F2 and all x ∈ Fn2 .
Proof
The representation by partial sums shows that each linear form meets the two
conditions mentioned.
For the reverse direction let f be a Boolean function with (i) and (ii). Let e1 =
(1, 0, . . . , 0), …, en = (0, . . . , 1) be the canonical unit vectors. Then each x =
(x1 , . . . , xn ) ∈ Fn2 is a sum

x = x1 e1 + · · · + xn en .

Hence,

f (x ) = f (x1 e1 ) + · · · + f (xn en ) = x1 f (e1 ) + · · · + xn f (en )

is the partial sum of the xi over the index set consisting of the i for which the
constant value f (ei ) is 1. Therefore, f is a linear form in the sense of the previous
definition.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 406 — #14

i i

406 Foundations of Modern Symmetric Encryption

q
A Boolean map f : Fn2 −→ F2 is called linear if all of its component functions
f 1 , . . . , f q are linear forms. As in the case q = 1 we can show Theorems 9.8 and 9.9.
q
Theorem 9.8 A Boolean map f : Fn2 −→ F2 is linear if and only if the following
two conditions hold:
(i) f (x + y ) = f (x ) + f ( y ) for all x, y ∈ Fn2 .
(ii) f (ax ) = a f (x ) for all a ∈ F2 and all x ∈ Fn2 .
q
Theorem 9.9 A Boolean map f : Fn2 −→ F2 is linear if and only if it has the form
n
X
f (x ) = xi si
i =1

q
with si ∈ F2 .
(Here again the xi and si are written in reverse order.)
Affine (Boolean) maps are maps of algebraic degree ≤ 1. They result from
adding linear maps and constants.
In the case q = 1, that is for functions, the only possible constants are 0 and
1. Adding the constant 1 effects a logical negation, that is a flipping of all bits.
Therefore we can say the affine Boolean functions are the linear forms and their
negations.

9.1.10 Systems of Boolean Linear Equations

Linear algebra over the field F2 is quite simple; many complications known from
other mathematical areas boil down to trivialities. Such is the case for the solution
of systems of linear equations, explicitly written as

a11 x1 + ··· + a1n xn = b1

.. .. ..
. . .
am 1 x 1 + · · · + amn xn = bm

with given ai j and bi ∈ F2 , and unknown x j for which we search solutions. In

matrix terms this system has an elegant expression as

Ax = b

where A is an m × n matrix, and x and b are column vectors, that is n × 1 or m × 1

matrices.

9.1.10.1 Systems of Linear Equations in SageMath

To clarify the relation with common linear algebra we consider an example of a
system of linear equations over the rational numbers:

x1 + 2x2 + 3x3 = 0
3x1 + 2x2 + x3 = −4
x1 + x2 + x3 = −1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 407 — #15

i i

9.1 Boolean Functions 407

and study how to handle this in SageMath. The complete solution is in SageMath
Example 9.1. Here are the single steps:
 
1 2 3
1. Define the coefficient matrix A = 3 2 1.
1 1 1
2. Define the image vector b = (0, −4, 1).
3. Let SageMath calculate a solution vector x. Since we wrote the left-hand
side of the system as matrix product Ax we have to use the method
solve_right().
4. Our system of linear equations could admit several solutions. We find them
all by solving the corresponding homogeneous system Az = 0 replacing the
right-hand side b by 0. If z is a solution of the homogeneous system, then
A · (x + z ) = Ax + Az = b + 0 = b, so x + z is a solution of the original
(inhomogeneous) system. In this way we get all solutions. This is because if
Ax = b and Ay = b, then A· ( y −x ) = 0, hence the difference y −x solves the
homogeneous system. For the solution of the homogeneous system we use
the SageMath method right_kernel(). In SageMath, the default kernel of
a matrix A is the left kernel, (i.e., the space of vectors z, such that z A = 0).
Therefore, the right kernel is needed here.
5. The output appears somewhat cryptic. It says that all solutions of the
homogeneous system are multiples of the vector z = (1, −2, 1). Since all
coefficients were integers SageMath worked over Z (= Integer Ring).
6. We verify the solution y = x − 4z by checking that Ay = b.

SageMath Example 9.1: Solution of a System of Linear Equations Over Q

print ("\n# CHAP09 -- Sage -Script -SAMPLE 010: =========")

A = Matrix ([[1,2,3],[3,2,1],[1,1,1]])
b = vector ([0 ,-4 ,-1])
x = A.solve_right(b)
print ("x = ", x)

LK = A.kernel () # left kernel: just for information

print ("LK = ", LK)

RK = A.right_kernel () # right kernel: this is needed here

print ("RK = ", RK)

## LEBM = A.kernel ().matrix ()

## print (" lvvv", vector(LEBM))
## y = x - 4*vector(LEBM)
REBM = A.right_kernel ().matrix () # calculate REBM dynamically , �
� instead
print (" vec_REBM =", vector(REBM)) # of setting the known solution like
y = x - 4*vector(REBM) # y = x - 4*vector ([1 ,-2,1])

print ("y = ", y)

print ("A*y =", A*y)
print ("A*y==b :", A*y == b )

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 408 — #16

i i

408 Foundations of Modern Symmetric Encryption

SageMath Example 9.1 (continued)

#------------------------------------
# CHAP09 -- Sage -Script -SAMPLE 010: =========
# x = (-2, 1, 0)
# LK = Free module of degree 3 and rank 1 over Integer Ring
# Echelon basis matrix:
# [ 1 1 -4]
# RK = Free module of degree 3 and rank 1 over Integer Ring
# Echelon basis matrix:
# [ 1 -2 1]
# vec_REBM = (1, -2, 1)
# y = (-6, 9, -4)
# A*y = (0, -4, -1)
# A*y==b : True

9.1.10.2 Systems of Linear Equations in the Boolean Case

In the general case (over an arbitrary field) the underlying algorithm for solving a
system of linear equations is Gaussian elimination. This algorithm of course also
hides in the SageMath method solve\_right().
In the Boolean case (over the field F2 ), the solution of a system of linear
equations by Gaussian elimination is extremely simple since all coefficients are 0 or
1, and multiplication and division are completely trivial. We don’t need to deal with
complicated coefficients (such as fractions over Q), or inexact coefficients (such as
floating point numbers over R). So simple is the method that even for six unknowns,
calculating by paper and pencil almost outperforms the feeding of the correspond-
ing small SageMath program with the correct input values. The following example
will illustrate this effect.
The idea of elimination is: reduce a system of m equations with n unknowns to
a system with only n − 1 unknowns, or “eliminate” one unknown.
Case 1 xn only occurs with coefficients ain = 0 for i = 1, . . . , m. In other words,
xn doesn’t occur at all. Then the system is already reduced.
Case 2 xn has coefficient 1 in one of the equations. Then solve this equation for
xn , and substitute the resulting expression for xn ,
xn = ai 1 x1 + · · · ai,n−1 xn−1 + bi ,
in the other m − 1 equations. Then, the remaining equations contain only
the unknowns x1 , . . . , xn−1 .
Continue recursively until there remains only one unknown or one equation. Now
for the example that illustrates this simple procedure.

Example with m = 5 and n = 6

x1 +x3 +x6 = 1
x1 +x2 +x4 +x6 = 0
x2 +x3 +x5 +x6 = 0
x1 +x4 +x5 = 1
x2 +x4 +x5 = 1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 409 — #17

i i

9.1 Boolean Functions 409

From the first equation we get x6 = x1 + x3 + 1 (using the rule that plus
and minus are the same). Elimination results in a reduced system consisting of the
equations 2 to 5 (note x1 + x1 = 0, etc.):

x2 +x3 +x4 = 1
x1 +x2 +x5 = 1
x1 +x4 +x5 = 1
x2 +x4 +x5 = 1

Solving the second equation of the reduced system for x5 and substituting x5 =
x1 + x2 + 1 in the other ones gives

x2 +x3 +x4 = 1
x2 +x4 = 0
x1 +x4 = 0

Now the last two equations yield x4 = x2 = x1 , and then the first one yields
x3 = 1. Thus, the complete solution is

x1 = x2 = x4 = x6 = a with a ∈ F2 arbitrary, x 3 = 1, x 5 = 1.

Since a may assume the values 0 and 1 our result consists of exactly two
solutions: (0, 0, 1, 0, 1, 0) and (1, 1, 1, 1, 1, 1).

The Example in SageMath

SageMath Example 9.2 shows the solution in SageMath code. The SageMath
method solve_right() gives the solution (0, 0, 1, 0, 1, 0) only. To get all solutions
we have to solve the homogeneous system. Its solutions are the multiples of the
vector v = (1, 1, 0, 1, 0, 1), that is, the two vectors (0, 0, 0, 0, 0, 0) = 0 · v and
(1, 1, 0, 1, 0, 1) = 1 · v. Thus, the second solution of the inhomogeneous system is
(0, 0, 1, 0, 1, 0) + (1, 1, 0, 1, 0, 1) = (1, 1, 1, 1, 1, 1).

SageMath Example 9.2: Solution of a System of Boolean Linear Equations

print ("\n# CHAP09 -- Sage -Script -SAMPLE 020: =========")

M = MatrixSpace(GF(2), 5, 6) # GF(2) = field with two elements

A = M([[1,0,1,0,0,1],[1,1,0,1,0,1],[0,1,1,0,1,1],[1,0,0,1,1,0],\
[0,1,0,1,1,0]])
print ("A :\n", A, sep ="") # sep ="" prevents print adding a space �
� between arguments
b = vector(GF(2) ,[1,0,0,1,1])
x1= A.solve_right(b)
print ("x1 :", x1)
K = A.right_kernel ()
print ("K :", K)
REBM = A.right_kernel ().matrix ()
x2 = x1 + vector(REBM)
print ("x2 :", x2)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 410 — #18

i i

410 Foundations of Modern Symmetric Encryption

SageMath Example 9.2 (continued)

#------------------------------------
# CHAP09 -- Sage -Script -SAMPLE 020: =========
# A :
# [1 0 1 0 0 1]
# [1 1 0 1 0 1]
# [0 1 1 0 1 1]
# [1 0 0 1 1 0]
# [0 1 0 1 1 0]
# x1 : (0, 0, 1, 0, 1, 0)
# K : Vector space of degree 6 and dimension 1 over Finite Field of �
� size 2
# Basis matrix:
# [1 1 0 1 0 1]
# x2 : (1, 1, 1, 1, 1, 1)

9.1.10.3 Estimate of the Costs

What about the costs of solving a system of Boolean linear equations in general?
Consider m equations with n unknowns. Then the matrix A of coefficients has size
m · n. The expanded matrix ( A, b) has size m · (n + 1).
We only aim at a coarse estimate and neglect possible optimizations of the
procedure. For simplicity, we assume m = n. In the case m > n we would ignore
additional equations (of course, we must then check if the solutions we found also
satisfy the additional equations). In the case that m < n we would append null
equations (of the kind 0 · x1 + · · · + 0 · xn = 0).
The elimination step, that is the reduction of the problem size from n to n − 1,
amounts to exactly one pass through all n rows of the expanded matrix:
• At first we search the first entry 1 in column n, consisting of the coefficients
of xn . This costs at most n single bit comparisons.
• Then we add the chosen row (containing the first entry 1 in column n) to
all those rows below it that also contain a 1 in column n. This amounts (per
row) to a single bit comparison and up to n bit additions—we ignore the nth
entry since we know already that it becomes 0.
All in all this makes n bit comparisons and at most n · (n − 1) bit additions, a total of
at most n 2 bit operations. Let N (n ) be the number of bit operations for the complete
solution of the system. Then we have the following inequality:
N (n ) ≤ n 2 + N (n − 1) for all n ≥ 2.
Now N (1) = 1: We only have to check the one coefficient of the one unknown
whether it is 0 or 1. From this we decide whether the equation has a unique solution
(for coefficient 1), or whether it is never true (coefficient 0, right-hand side b = 1),
or whether it is true for arbitrary values of the unknown (coefficient 0, right-hand
side b = 0).
Then we conclude N (2) ≤ 22 + 1, N (3) ≤ 32 + 22 + 1, and so on. By induction,
we immediately get
Xn
N (n ) ≤ i 2.
i =1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 411 — #19

i i

9.1 Boolean Functions 411

The explicit value of this sum is well-known, and we have shown:

Theorem 9.10 The number N (n ) of needed bit comparisons and bit additions
for solving a system of n Boolean linear equations with n unknowns is upper
bounded by
1
N (n ) ≤ · n · (n + 1) · (2n + 1).
6

A somewhat more sloppy wording of this result expresses the cost as O(n 3 ). In
any case it is polynomial of small degree in terms of the problem size n.

Remark The notation by “O” obscures the difference with the cost over arbitrary
fields that is generally bounded by O(n 3 ). The “felt” much better performance in
the Boolean case is partly founded by the exact estimate in Theorem 9.10 that even
in the worst case is about 13 · n 3 . Moreover, in the Boolean case we count simple bit
operations only, and not arithmetic operations or floating point instructions that
are significantly more expensive.

9.1.11 The Representation of Boolean Functions and Maps

We will discuss here what exactly is meant by the different representation forms of
Boolean functions.

9.1.11.1 Various Interpretations of Bitblocks

We used the term bitblock for a variety of slightly different objects. A bitblock
b = (b1 , . . . , bn ) ∈ Fn2 describes:

• A vector b ∈ Fn2 written as a row or a column. This is the primary meaning

of the term bitblock.
• An argument of a Boolean function or map of n variables, also used as row
index of a value table (or truth table).
• A bitstring of length n.
• A subset I ⊆ {1, . . . , n} defined by b as indicator: i ∈ I ⇔ bi = 1.
• A linear form α on Fn2 expressed as sum of the variables xi with bi = 1. The
evaluation of α comes down to the scalar product of vectors: α (x ) = b · x.
• A monomial in n variables x1 , . . . , xn with all partial degrees ≤ 1. In this
interpretation bi specifies the exponent 0 or 1 of the variable xi .
• An integer between 0 and 2n − 1 in binary representation (that is in the base-2
system). The sequence of binary digits (bits) is identical with the correspon-
ding bitstring.5 Conversely, the integer is the index (beginning with 0) of the
bitstring when the bitstrings are lexicographically ordered in a list.

Of course there are further interpretations—after all, each piece of information

has a binary coding. The bitblocks for n = 3 are listed in Table 9.6.

5. The SageMath method binary() transforms an integer to a bitstring, suppressing leading zeros. Example:
10.binary() yields '1010'.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 412 — #20

i i

412 Foundations of Modern Symmetric Encryption

Table 9.6 Interpretations of Bitblocks of Length 3

Integer Bitstring Subset Linear Form Monomial
0 000 ∅ 0 1
1 001 {3} x3 x3
2 010 {2} x2 x2
3 011 {2, 3} x2 + x3 x2 x3
4 100 {1} x1 x1
5 101 {1, 3} x1 + x3 x1 x3
6 110 {1, 2} x1 + x2 x1 x2
7 111 {1, 2, 3} x1 + x2 + x3 x1 x2 x3

9.1.11.2 Representation of the Truth Table of a Boolean Function

Section 9.1.11.1 described (and Table 9.6 illustrated) how to interpret the bitblocks
x = (x1 , . . . , xn ) of length n as integers i (x ) = 0, 1, . . . , 2n − 1 in base-2 representa-
tion. The example in Table 9.7 suggests how to describe the truth table of a Boolean
function f : Fn2 −→ F2 in a parsimonious way by a bitblock b = (b0 , . . . , b2n −1 ) of
length 2n : simply take the last column in the order given by the indices i (x ). The
general procedure for arbitrary n runs as follows:

bi (x ) = f (x ) where i (x ) = x1 · 2n−1 + · · · + xn−1 · 2 + xn

for x = (x1 , . . . , xn ) ∈ Fn2 .

This might look entangled, but it simply means: Interpret x as the base-2 represen-
tation of an integer i (x ), and set f (x ) as the bit at position i (x ) from the bitblock
b. An additional column i (x ) in the truth table of the function f 0 ( f 0 was defined
in Formula 9.1) illustrates this procedure (see Table 9.7). The last column of this
table, written in row form, is the bitblock b.
In this way, the bitblock (0, 0, 0, 0, 0, 1, 1, 1) or, even more parsimoniously, the
bitstring
00000111

of length 23 = 8 completely specifies the truth table of f 0 .

9.1.11.3 Representation of the Algebraic Normal Form

The algebraic normal form is a sum of monomials. Each monomial is a product
of a subset of {x1 , . . . , xn }, and hence has a representation as an integer between

Table 9.7 An Extended Truth Table [for f 0 (x1 , x2 , x3 ) =

x1 ∧ (x2 ∨ x3 )] with n = 3 and 2n = 8
x1 x2 x3 i (x ) f 0 (x1 , x2 , x3 )
0 0 0 0 0
0 0 1 1 0
0 1 0 2 0
0 1 1 3 0
1 0 0 4 0
1 0 1 5 1
1 1 0 6 1
1 1 1 7 1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 413 — #21

i i

9.1 Boolean Functions 413

0 and 2n − 1. So the ANF has a characterization by 2n bits: the coefficients of the

2n different monomials (see Theorem 9.3). Therefore, we may view a bitblock a =
(a0 , . . . , a2n −1 ) as representation of the ANF of a Boolean function f : Fn2 −→ F2
in the following way:
n −1
2X
e (i ) e (i )
f (x ) = ai x11 · · · x nn where i = e1 (i ) · 2n−1 + · · · + en (i )
i =0

with e1 (i ), . . . , en (i ) = 0 or 1.

This formula means: Interpret the n-tuple e of exponents of a monomial as the base-
2 representation of an integer i. The ith element of the bitblock a indicates whether
this monomial occurs in the ANF of f or not.
For the sample function f 0 (see Table 9.7) we already saw (or can easily check,
because f (1, 1, 1) = 1 + 1 + 1 = 1 since we add mod 2) that the ANF is

f 0 (x ) = x1 x3 + x1 x2 + x1 x2 x3 .

It involves the monomials with exponent triples 101, 110, 111 that correspond to
the integers 5, 6, 7. Therefore, we set the bits at the positions 5, 6, 7 to 1, and
the remaining bits to 0, and get the parsimonious representation of the ANF by
a bitstring:
00000111.

Warning This is the same bitstring as for the truth table by pure chance—a spe-
cial property of the function f 0 ! The function f (x1 , x2 ) = x1 has truth table
0011 (it takes the value 1 if and only if x1 = 1, or if the argument has the form
x = (1, any bit)) and ANF 0010 (since it contains the single monomial x1 ).

The SageMath class BoolF() has a method for calculating the ANF6 . SageMath
Example 9.3 demonstrates its application to f 0 .

SageMath Example 9.3: A Boolean Function with Truth Table and ANF
print ("\n# CHAP09 -- Sage -Script -SAMPLE 030: =========")

load ("./ bitciphers.sage ")

bits = "00000111"
x = str2bbl(bits)
print ("x :", x)
f = BoolF(x)
y = f.getTT ()
print ("y :", y)
z = f.getANF ()
print ("z :", z)

6. This transformation that converts a bitstring of length 2n (the truth table) into another bitstring of length
2n (the coefficient list of the ANF) is sometimes called Reed-Muller transformation or binary Moebius
transformation.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 414 — #22

i i

414 Foundations of Modern Symmetric Encryption

SageMath Example 9.3 (continued)

#------------------------------------
# CHAP09 -- Sage -Script -SAMPLE 030: =========
# x : [0, 0, 0, 0, 0, 1, 1, 1]
# y : [0, 0, 0, 0, 0, 1, 1, 1]
# z : [0, 0, 0, 0, 0, 1, 1, 1]

Remark Evaluating a Boolean function f at all arguments x ∈ Fn2 the naive way
costs 2n evaluations f (x ), each with at most 2n summands, each of which need-
ing at most n − 1 multiplications. Thus, the costs have an order of magnitude of
about n · 2n · 2n . If we relate the costs to the input size N = 2n they are essentially
quadratic: N 2 · log2 ( N ). A common method, binary recursion, or “divide-and-
conquer,” divides a problem into two subproblems of half the input size, and leads
to a significantly more efficient algorithm. Starting from (9.4), finally we achieve a
reduction to almost linear costs 3 N · log2 N . This algorithm, also denoted as fast
binary Moebius transformation, an analogue of the fast Fourier transformation
(FFT), is implemented in the class BoolF().

9.2 Block Ciphers

In classical cryptography the weakness of simple monoalphabetic substitutions is

remedied in two different ways: first by polygraphic substitutions that encrypt
groups of letters at once, second by polyalphabetic substitutions that change the
substitution alphabet depending on the position in the plaintext.
If we consider bits instead of letters, the two principles of hardening monoal-
phabetic substitutions can be transferred to two classes of useful encryption
methods for binary encoded information:

• Block ciphers split bitstrings into blocks of a fixed length and encrypt one
complete block per step.
• Stream ciphers encrypt bit by bit, each one by another substitution (so each
single bit is either unchanged or flipped by a position-dependent rule).

No mathematically complete proof exists for the security of any block or stream
cipher. Thus, the situation is even worse than for asymmetric ciphers where the
proof of security often reduces to a well-studied, if not solved, mathematical prob-
lem. The best we can do is to consider a symmetric cipher as secure if none of the
known attacks is significantly faster than a complete exhaustion of the key space
(also known as “brute-force attack”).

9.2.1 General Description

Block ciphers transform bitblocks of a fixed length n to bitblocks of the same length
controlled by a key that itself is a bitblock of a certain length l. The extension of
this cipher to input bitstrings of arbitrary lengths is the subject of Section 9.2.4. For
the moment we neglect this aspect.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 415 — #23

i i

9.2 Block Ciphers 415

An adequate model of a block cipher is a Boolean map

F : Fn2 × Fl2 −→ Fn2

often interpreted as a family ( Fk )k∈K of Boolean maps

Fk : Fn2 −→ Fn2 for all k ∈ K = Fl2

where Fk (a ) = F (a, k ).

9.2.1.1 Choosing the Key Length

For the key length l we have an obvious criterion: l must be large enough to prevent
an exhaustion of the key space, a “brute force attack.” The key space is the set Fl2 ,
so it contains 2l different keys. We assume that the probabilities for all keys are the
same, that is 1/2l . In other words, we assume that keys are chosen uniformly at
random.
With these assumptions we have a lower bound of about 80 bits for a secure
key length according to the state of the art [8]. Popular ciphers use keys of lengths
128 or more, so have a sufficient security margin. The outdated standard cipher
DES used 56 bit keys. The technology of today breaks it relative quickly.

9.2.1.2 Choosing the Block Length

The block length n should be large enough to prevent analyses of patterns or fre-
quencies. Even more it should prevent leaking any information about the plaintext
into the ciphertext, for example the presence of repetitions.
If the attacker observes about 2n /2 ciphertexts corresponding to random plain-
texts encrypted with the same key, the probability of a collision (by the birthday
paradox) is about 12 . Therefore, this number 2n /2 should exceed the number of
available memory cells. And the key should change frequently—long before this
number of blocks is reached.
From this point of view the frequently used block length 64 is risky. Only fre-
quent key change could justify it, and only if the plaintext contains few repetitions.
A better cipher, as the current standard AES, uses blocks of 128 bits.
These considerations about key or block lengths are typical for the discussion
of security in modern cryptography: We use large security margins and avoid any
weaknesses, no matter how small they might be, even if there is no known practical
attack that uses them. But since we have a broad choice of good and fast ciphers
that provide large security margins there is no need to rely on a weaker cipher, even
if this precaution seems paranoid.

9.2.2 Algebraic Cryptanalysis

Cryptanalysis deals with the analysis of ciphers in order to discover hidden secrets.
Algebraic cryptanalysis is a method of cryptanalysis that mainly uses algebraic
techniques such as equation-solving algorithms. Thus, it involves expressing the
operations in the cipher as a system of equations and replacing some of the variables
with known data to find the key [9].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 416 — #24

i i

416 Foundations of Modern Symmetric Encryption

An algebraic attack is similar to a brute-force attack in that it can theoretically

break any symmetric cipher, but in practice it is completely useless against any
reasonable cipher.

9.2.2.1 Attacks with Known Plaintext

An attack with known plaintext assumes that the attacker knows or guesses a small
piece of plaintext, and then tries to deduce the key or some more plaintext that is
unknown to her. For the present section we assume that the known plaintext is a
complete bitblock.
Let a block cipher be given by a Boolean map

F : Fn2 × Fl2 −→ Fn2 .

By Theorem 9.6, F is an n-tuple F = ( F1 , . . . Fn ) of polynomial expressions in n + l

variables all of whose partial degrees are ≤ 1.
A known plaintext block a ∈ Fn2 with corresponding ciphertext block c ∈ Fn2
yields a system
F (a, x ) = c

of n polynomial equations for the unknown key x ∈ Fl2 .

Systems of equations of this type (over arbitrary fields) are subjects of algebraic
geometry. The general theory is quite deep, in particular if we search for concrete
solution procedures. However—couldn’t the fact that our polynomials have all their
partial degrees ≤ 1 simplify the problem?

Example 1 Let n = l = 2,

F (a1 , a2 , x1 , x2 ) = (a1 + a2 x1 , a2 + a1 x2 + x1 x2 ),

a = (0, 1), c = (1, 1) ∈ F22 . The equations for the key (x1 , x2 ) ∈ F22 are
! !
1 0 + x1
= .
1 1 + 0 + x1 x2

The immediate solution is x1 = 1, x2 = 0.

Example 2 Linear maps: If F is a linear map, then the system of equations is acces-
sible by the efficient solution algorithms of linear algebra; see Section 9.1.10. We
have n linear equations for l unknowns. If l < n the attacker needs some additional
blocks of known plaintext, or she executes an exhaustion of the remaining n − l
key bits. For this method to work F needs to be linear only in x.
Example 3 Substitution: Often polynomial equations look complex at first sight but
aren’t so. Here is an example (over F2 )

x 1 x 2 x 3 + x 1 x 2 + x 1 x 3 + x 2 x 3 + x 2 + x 3 = 0.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 417 — #25

i i

9.2 Block Ciphers 417

By the substitutions xi = z i + 1 it is transformed to

z1 z2 z3 + z1 = 0

(for an easy proof look in the reverse direction). This has the solutions

z 1 = 0, z 2 , z 3 arbitrary or z 1 = z 2 = z 3 = 1.

Therefore the complete solution of the original equation is

x1 = 1, x2 , x3 arbitrary or x1 = x2 = x3 = 0.

There are two powerful general approaches for solving systems of (polynomial)
equations over F2 :

• SAT solvers [10]: SAT denotes the satisfiability problem of propositional

logic. Consider a logical expression in Boolean variables x1 , . . . , xn and ask
if there exist values of the variables that make the expression True. In other
words, consider a Boolean function f and ask if it assumes the value 1. A
SAT solver is an algorithm that takes a logical expression in CNF and decides
the satisfiability by finding a solution x, or showing there’s no solution.
The naive algorithm uses the truth table and exhausts the 2n possible argu-
ments. However, there are much faster algorithms, the most popular being
the DPLL algorithm (after Davis, Putnam, Logemann, and Loveland) and
BDD based algorithms (Binary Decision Diagram). The SageMath modules
sage.sat.solvers and sage.sat.boolean_polynomials contain some of
these algorithms.
• Elimination using Groebner bases See the dissertation [11], the textbooks
[9, 12, 13], the script [14], or the paper [15].

Both methods work well for a few unknowns. With a growing number of
unknowns their complexity becomes unmanageable (in fact, SAT was the first prob-
lem in history shown to be NP-complete). Of course, we always find a solution
by searching through the complete value table. But this naive method is ineffi-
cient (exponential in the number of unknowns, and so hopeless for 80 or more
unknowns). But also the costs of SAT solvers and Groebner-basis methods grow
exponentially with the number of unknowns. Not even the fact that all partial
degrees are ≤ 1 is of vital help.

9.2.2.2 The Complexity of the Algebraic Attack

The theoretical analysis of the cost for finding a solution leads to one of the central
notions of complexity theory, NP-completeness.

Theorem 9.11 (Garey/Johnson): The problem of finding a solution for a system of

polynomial equations over F2 is NP complete.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 418 — #26

i i

418 Foundations of Modern Symmetric Encryption

We won’t explain the notion NP-complete but only mention that the (up to now
unproven) P 6= NP conjecture implies that an NP-complete problem admits no effi-
cient algorithmic solution, or that there is no solution algorithm whose execution
time grows at most polynomially with the number of input variables.
A common interpretation of this theorem is: For an appropriately chosen block
cipher F : Fn2 × Fl2 −→ Fn2 the attack with known plaintext (against the key k ∈ Fl2 )
is not efficient. However, from a strict mathematical point of view the theorem
doesn’t prove anything of practical relevance:

1. It relates to an algorithm for arbitrary polynomial equations (over F2 ). It

doesn’t contain any assertion for special classes of polynomials, or for a
concrete system of equations.
2. Likewise, it gives a pure proof of (non)existence, and provides no hint as
how to construct a concrete example of a difficult system of equations. Note
that we know that some concrete systems admit easy solutions.
3. Even if we could find concrete examples of difficult systems the theorem
would not make any assertion, whether only some rare instances (the worst
cases) are difficult, or almost all (the generic cases)—and this is what the
cryptologist wants to know. Maybe there is an algorithm that solves poly-
nomial systems for almost all tuples of unknowns efficiently, and only fails
for a few exceptional tuples.

Despite these critical comments the theorem raises hope that there are secure
block ciphers, and the designers of block ciphers follow the:

Rule of thumb Systems of linear equations for bits admit very efficient solutions.
Systems of nonlinear equations for bits in almost all cases admit no efficient
solution.

9.2.3 The Structure of Block Ciphers

In an ideal world we would know how to reliably measure the security of a block
cipher
F : Fn2 × Fl2 −→ Fn2

for realistic values of the block length n and the key length l, say of an order of
magnitude of 128 bit or more.
In fact, we know explicit measures of security, for example the linear poten-
tial, or the differential potential, that quantify the deviation from linearity, or the
algebraic immunity, or others. Unfortunately all of these only give necessary, not
sufficient, conditions for security; and moreover the efficient computability of these
measures is limited to small block lengths n, about 8 or slightly larger.
Lacking a general efficient approach to security, the design of block ciphers usu-
ally relies on a structure that, although not obligatory, in practice seems to provide
plausible security (according to verifiable criteria). Most of the generally approved
standard ciphers, such as DES and AES, follow this approach.
This common design scheme starts by constructing Boolean maps of small
dimensions and then extending them to the desired block length in several steps:

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 419 — #27

i i

9.2 Block Ciphers 419

1. Define one or more Boolean maps of small dimension q (block length of the
definition domain), say q = 4, 6, or 8, that are good for several security
criteria. These maps are called S-boxes, and are the elementary building
blocks of the cipher. (“S” stands for substitution.)
2. Mix the round input with some of the key bits and then apply m S-boxes in
parallel (or apply the one S-box m times in parallel) to get a map with the
desired input width n = mq.
3. Then permute the complete resulting bitblock over its total width.
4. These steps together are a round of the complete scheme. Assess the weak-
nesses of the round map, that mainly result from using S-boxes of small
dimension. Then reduce these weaknesses in a reasonably controlled way
by iterating the scheme over several rounds of the same structure but with a
changing choice of key bits.
5. Don’t stop as soon as the security measures give satisfying values but add
some surplus rounds to get a wide security margin.

Figure 9.2 outlines the scheme for a single round.

The complete scheme is a special case of a somewhat more general proposal
that goes back to Shannon who required two basic features of block ciphers:

Diffusion The bits of the plaintext block “smear” over all parts of the block. This
is achieved by permutations.

Confusion (complex dependencies): The interrelation between plaintext block and

key on the one hand, as well as ciphertext block on the other hand should be as
complex as possible (in particular as nonlinear as possible). Basic building blocks
for this are substitutions.

Figure 9.2 A single round of a block cipher (with S-boxes S, permutation P, and key k).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 420 — #28

i i

420 Foundations of Modern Symmetric Encryption

The overall effect of both requirements, taken together, should result in an

unforeseeable change of ciphertext bits for a slight change of the key.

The attacker should have no means to recognize whether a guessed key is

nearly correct.

For the construction of strong block ciphers Shannon proposed an alternating

sequence of substitutions and transpositions (permutations), so-called SP networks:

S1 (•,k ) P1 (•,k )
Fn2 −→ Fn2 −→ Fn2 −→ . . .
Sr (•,k ) Pr (•,k )
. . . −→ Fn2 −→ Fn2 −→ Fn2

depending on a key k ∈ Fl2 . In this scheme

Si = ith substitution
Pi = ith permutation
Pi ◦ Si = ith round

Altogether the encryption function consists of r rounds.

Note that the permutations are special linear maps P : Fn2 −→ Fn2 . Some recent
block ciphers, the most prominent being AES, replace permutations by more general
linear maps that provide an even better diffusion. However, the proper term LP
network is not yet in use.

9.2.4 Modes of Operation

In this section the key plays no role: Therefore, we omit k in the notation. Consider
a block cipher function f : Fn2 −→ Fn2 . If we want to apply it to longer or shorter
bit sequences we must:
1. Split a bit sequence a into n-bit blocks a1 , …, ar ,
2. Fill (“pad”) the last block ar , if necessary, up to length n with
• Zeros;
• Random values;
• Context information.
Then the most obvious encryption algorithm is encipher the blocks one by one.
This is called ECB mode (for “Electronic Code Book”). In Figure 9.3 this is shown
schematically.
ECB mode simply realizes a monoalphabetic substitution where the blocks in
Fn2 are interpreted as letters. For a sufficiently large n this is secure from a ciphertext-
only attack. However, the cipher leaks information on repeated blocks. For some
plaintexts this is a real danger.
• For example, MS-Word files contain long sequences consisting of the bytes
00000000 and 00000001.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 421 — #29

i i

9.2 Block Ciphers 421

Figure 9.3 ECB mode.

• An even more alarming case is provided by image files with large single-color
areas. They contain many identical blocks such that structures of the image
may appear in the ciphertext file.7

In view of this weakness, generating some additional diffusion between the

plaintext blocks seems a good idea. A simple but effective approach is CBC (cipher
block chaining). Choose a random start value c0 (also called IV for initialization
vector). Then the procedure looks like it does in Figure 9.4.
The formula for encryption in CBC mode is

ci := f (ai + ci−1 ) for i = 1, . . . , r

= f (ai + f (ai−1 + · · · f (a1 + c0 ) . . .)).

Each ciphertext block depends on all previous plaintext blocks (diffusion), and
identical plaintext blocks in general encrypt to different ciphertext blocks.
The formula for decryption is

ai = f −1 (ci ) + ci−1 for i = 1, . . . , r.

Figure 9.4 CBC mode.

7. - For a convincing example see the Wikipedia entry “Block cipher mode of operation.”
- The template in CT2 Startcenter F Templates F Cryptography F Modern F Symmetric F Block
Modes of Symmetric Ciphers allows you to try this with your own pictures.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 422 — #30

i i

422 Foundations of Modern Symmetric Encryption

Question Does it make sense to keep the initialization vector c0 secret and use it as
an additional key component? (Then, for the example of DES, we had 56 proper
key bit plus a 64-bit initialization vector, making a total of 120 key bit.)

Answer No!

Reason In the decryption process only a1 depends on c0 . This means that keeping
c0 secret conceals known plaintext only for the first block. If the attacker knows the
second or any later plaintext block, then she may attack the key as in ECB mode
(by an attack with known plaintext).

There are several other modes of operation. Worth mentioning is that the modes
OFB (output feedback) and CTR (counter) convert a block cipher into a stream
cipher.

9.2.5 Statistical Analyses

For cryptanalyzing block ciphers we know some basic approaches:

1. Exhaustion = brute-force searching the complete key space;

2. Algebraic attack (see Section 9.2.2);
3. Statistical attacks against hidden linearity:

(a) Linear cryptanalysis (LCA) (Matsui/Yamagishi 1992);

(b) Differential cryptanalysis (DCA) (Murphy, Shamir, Biham 1990):
Differential cryptanalysis was known at IBM and NSA as early as
in 1974. Despite linear cryptanalysis is conceptually simpler, it was
unknown to the designers of DES. Accordingly, the resistance of DES
against linear cryptanalysis is suboptimal.
(c) Generalizations and mixtures of (a) and (b).

All these statistical attacks hardly break a cipher in the sense of classical crypt-
analysis. They usually assume lots of known plaintexts, much more than an attacker
could gather in a realistic scenario. Therefore, a more adequate term is analysis
instead of attack. The analyses make sense for finding measures for some partial
aspects of security of block ciphers. They measure security for example by the num-
ber of known-plaintext blocks needed for the attack. If a cipher resists an attacker
even with exaggerated assumptions on her capabilities, then we feel safe to trust it
in real life.
Given an SP network, the analysis starts with the nonlinear components of
the single rounds, in particular with the S-boxes. The next step is extending the
potential attack over several rounds. This shows how the cost of the attack grows
with the number of rounds. In this way we find criteria for the number of rounds
for which the cipher is secure—at least from this special attack.
SageMath scripts applying linear cryptanalysis methods to Lucifer’s S-
box, to Mini-Lucifer, and to two other toy block ciphers can be found at
https://www.cryptool.org/en/documentation/ctbook/sagemath.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 423 — #31

i i

9.2 Block Ciphers 423

In CT2, differential cryptanalysis can be reproduced on three toy ciphers in

detail (each in an automatic and a tutorial mode).8 The CT2 template “Differential
Cryptanalysis Tutorial 3 (Automatic Mode)” shown in Figure 9.5 performs a fully-
automatic attack on a 96-bit toy cipher:

• The DCA Oracle and ToyCipher components generate plaintext-ciphertext

pairs needed by the DCA KeyRecovery component to recover the key using
differential cryptanalysis.
• The DCA PathFinder component find characterstics and paths through the
toy cipher needed by the DCA KeyRecovery component to perform the
actual differential cryptanalysis.
• After five rounds and a total of 160,008 messages (see Figure 9.6), the used
key could be recovered. This key is shown in the TextOutput component
“Output of the key” of Figure 9.5. You can see that the correct key was found
when comparing it to the key in the TextInput component “Input of the key,”
showing the key used to generate more than 160,000 plaintext-ciphertext
pairs.

9.2.6 Security Criteria for Block Ciphers

To escape attacks, block ciphers, their round maps, or their S-boxes, should fulfill
some requirements.

• Balance: All preimages have the same number of elements, or in other

words, the values of the map are uniformly distributed. Irregularities of the
distribution would provide hooks for statistical cryptanalysis.

Figure 9.5 CT2 template “Differential Cryptanalysis Tutorial 3” performing DCA of a toy cipher.

8. See under CT2 Templates F Cryptanalysis F Modern.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 424 — #32

i i

424 Foundations of Modern Symmetric Encryption

Figure 9.6 Presentation view of the DCA KeyRecovery component showing the final state after a successful
differential cryptanalysis of a toy cipher (in CT2).

• Diffusion/avalanche effect: If a single plaintext bit changes, about 50% of

the ciphertext bits change. This effect conceals similarity of plaintexts.
• Algebraic complexity: The determination of preimages or parts thereof
should lead to equations whose solutions are as difficult as possible. This
requirement is related to the algebraic degree of the map, but only in an
indirect way. A suitable measure is algebraic immunity.
• Nonlinearity: We know several criteria that measure linearity, also hidden
linearity, and are relatively easy to describe and to handle. For example,
they quantify how susceptible Boolean maps are for linear or differential
cryptanalysis [16].

◦ The linear potential should be as low as possible, and the linear profile
as balanced as possible.
◦ The differential potential should be as low as possible, and the differential
profile as balanced as possible.

Some of these criteria are compatible with each other; some criteria contradict
other ones. Therefore, the design of a block cipher requires a balance between
different criteria. Instead of optimizing a map for a single criterion the designer
should aim at a uniformly high level for all criteria.

9.2.7 AES
The current standard for modern symmetric ciphers is AES (see Section 1.2.1).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 425 — #33

i i

9.2 Block Ciphers 425

Figures 9.7 and 9.8 show the design of AES9 and the realization of the design
principles derived in Section 9.2.3.

• The block length is n = 128, the key length, l = 128, 192, or 256, the
number of rounds, r = 10, 12, or 14.
• At the beginning of each round and after the last round a partial key is added
to the current bitblock. The complete algorithm involves r + 1 partial keys.
• The 128-bit “partial keys” k (i ) are not partial keys in the proper sense but
extracted from the master key k by a somewhat involved algorithm (key
schedule). They are not independent.
• Each round starts by splitting the current 128-bit block into 16 parts
each consisting of 8 bits. Each of these parts is fed into the same S-box
S : F82 −→ F82 . This S-box has a mathematically quite elegant description that
however assumes some advanced knowledge of abstract algebra, hence it is
omitted here. The linear potential of the S-box is 64
1
.
• The diffusion step consists of a permutation followed by a linear map. This
step is slightly more complex than for a pure SP network as in Figure 9.2.

Figure 9.7 Structure of AES in the large.

9. - Using CTO (cryptool-online.org) in the browser, AES can be seen in 2 plugins: as animation and via
“AES (step-by-step).”
- Using CT1 Indiv. Procedures F Visualization of Algorithms F AES you can find 3 visualizations
for this cipher.
- Using the search string AES in CT2 Startcenter F Templates you can find a plugin performing AES
step-by-step.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 426 — #34

i i

426 Foundations of Modern Symmetric Encryption

Figure 9.8 The round function f of AES.

9.2.8 Outlook on Block Ciphers

As we saw, linear cryptanalysis provides some evidence for the security of a cipher,
in particular for choosing the number of rounds. But only some parts of the theory
have a mathematically satisfying basis. Most existing publications only give ad-
hoc analyses of concrete ciphers. For example, Matsui showed how for DES 243
known plaintexts reveal 14 key bits with high certainty, reducing the exhaustion
to the remaining 42 = 56 − 14 key bits, a feasible task (at least if the analyst gets
that many plaintexts). The treatment of linear cryptanalysis serves as an example
of similar analyses.
Differential cryptanalysis as well as generalized and mixed variants follow sim-
ilar lines of thought. For more information see [17] which also explicitly specifies
the most important block ciphers DES and AES.
With the work of Gohr [18] an interesting research approach for cryptanalysis
was brought to life in 2019: differential-neural cryptanalysis as the machine-
learning assisted differential cryptanalysis of block ciphers. The idea is to distin-
guish ciphertext pairs that belong to a fixed plaintext difference from random ones.
For this, neuronal networks were employed, leading to an 11-round key recov-
ery attack on Speck32/64 which is at least competitive with classical differential
cryptanalysis.
A lot of follow-up works have been published since. In [19], Gohr et al. exam-
ined different differential-neural distinguishers for a wide variety of ciphers and
additionally provided general insights.
A very comprehensive report [20] with 200 pages, written by Leander et al. was
published by the BSI in 2022. In the report, the applicability of machine-learning
techniques was examined to the symmetric ciphers Simon and Speck, and the most

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 427 — #35

i i

9.3 Stream Ciphers 427

promising ones were applied to the ciphers Present, Katan, ChaCha, and Skinny.
For this purpose, round-reduced versions of these ciphers as well as versions with
smaller block length were used.

9.3 Stream Ciphers

A stream cipher sequentially encrypts each single bit of a bitstring by an individual

rule. The two possibilities are leave the bit unchanged or negate it. Leaving the bit
unchanged is equivalent with (binary) adding 0; negating the bit is equivalent with
adding 1. Thus, every stream cipher may be interpreted as an XOR encryption. We
distinguish between

• Synchronous stream ciphers: where the key stream is generated indepen-

dently of the plaintext;
• Asynchronous stream ciphers: where the key stream depends on the
plaintext or other context parameters.

In this chapter we only treat synchronous stream ciphers.

9.3.1 XOR Encryption

The basic method of stream encryption is simply denoted by XOR. It interprets
plaintexts10 as sequences of bits. Also, the key is a bit sequence, called key stream.
The encryption algorithm adds the current bit of the plaintext and the current bit
of the key stream by XOR. Figure 9.9 illustrates the algorithm,11 and Figure 9.10
shows an example.
In the 1920s, XOR ciphers were invented to encrypt teleprinter messages.
These messages were written on five-hole punched tapes as in Figure 9.11. Another
punched tape provided the key stream. Gilbert Vernam filed his U.S. patent in
1918. He used a key tape whose ends were glued together, resulting in a periodic
key stream. Joseph Mauborgne immediately recognized that a nonperiodic key is
obligatory for security.

Figure 9.9 The principle of XOR encryption.

10. The default SageMath method ascii_to_bin() from the module sage.crypto.util converts ordinary
texts to bitstrings. The inverse method is bin_to_ascii(). However, these bitstrings belong to the class
StringMonoidElement, and are cumbersome to process further. Therefore, corresponding own functions
are defined in the file bitciphers.sage.
11. In CT2 Templates F Cryptography F Classical F XOR Cipher you can try this directly.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 428 — #36

i i

428 Foundations of Modern Symmetric Encryption

Figure 9.10 Example of XOR encryption.

Figure 9.11 Punched tape—each column represents a five-bit character.

In its strongest form, the one-time pad (OTP), XOR encryption is an example
for perfect security in the sense of Shannon. As algorithm A5 or E0 , XOR is used
to secure mobile phones or the Bluetooth protocol for wireless data transmission.
As RC4 it was part of the SSL protocol that encrypts client-server communication
between browser and web server, and of the PKZIP compression software. There
are many other current applications, not all of them fulfilling the expected security
requirements.
The scope of XOR encryption ranges from simple ciphers that are trivially
broken to unbreakable ciphers.

Advantages of XOR Ciphers

• Encryption and decryption are done by the same algorithm, since ci = ai + ki

also ai = ci + ki . Thus, decryption also consists of adding key stream and
ciphertext (elementwise binary).
• The method is extremely simple to understand and to implement.
• It is very fast—provided that the key stream is available. For high transfer
rates one may precompute the key stream at both ends of the line.
• If the key stream is properly chosen, the security is high.

Disadvantages of XOR Ciphers

• XOR ciphers are vulnerable for known-plaintext attacks, each correctly

guessed plaintext bit reveals a key bit.
• If the attacker knows a piece of plaintext she also knows the corresponding
piece of the key stream, and then is able to exchange this plaintext at will.
For example, she might replace “I love you” with “I hate you,” or replace an
amount of $1,000 with $9,999. In other words, the integrity of the message
is poorly protected, so the sender has to implement additional procedures.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 429 — #37

i i

9.3 Stream Ciphers 429

• XOR ciphers provide no diffusion in the sense of Shannon’s criteria since

each plaintext bit affects the one corresponding plaintext bit only. In the
opposite, block ciphers were designed for using diffusion.
• Each reuse of a part of the key sequence (also in form of a periodic repetition)
opens the door for an attack. The historical successes in breaking stream
ciphers almost always used this effect, for example the attacks on encrypted
teleprinters in World War II, or the project Venona during the Cold War.

A remark on the first disadvantage, the vulnerability for attacks with known
plaintext: The common ISO character set for texts has a systematic weakness. The
8-bit codes of the lower-case letters a…z all start with 011, of the upper-case letters
A…Z, with 010. A supposed sequence of six lower-case letters (no matter which)
reveals 6 · 3 = 18 key bits. The occurrence of many zeros in the leading bits of the
bytes is an important recognition feature for natural texts in European languages.
In other words we cannot prevent the attacker from getting or guessing a good
portion of the plaintext. Thus, the security against an attack with known plaintext
is a fundamental requirement for an XOR cipher, even more than for any other
cryptographic procedure.

9.3.2 Generating the Key Stream

The main naive methods for generating the key stream are:

• Periodic bit sequence,

• Running-text,
• True random sequence.

A better method uses a

• Pseudorandom sequence

and leads to really useful procedures. The essential criterion is the quality of the
pseudorandom generator.

9.3.2.1 Periodic Bit Sequences

Example: We generate a key stream of period 8 by repeating k = 10010110. Each letter
is represented by bytes in the ISO character set. The plaintext here is in German.

D | u | | b | i | s |
a: 01000100|01110101|00100000|01100010|01101001|01110011|
k: 10010110|10010110|10010110|10010110|10010110|10010110|
-------- -------- -------- -------- -------- --------
c: 11010010|11100011|10110110|11110100|11111111|11100101|

t | | d | o | o | f
01110100|00100000|01100100|01101111|01101111|01100110
10010110|10010110|10010110|10010110|10010110|10010110
-------- -------- -------- -------- -------- --------
11100010|10110110|11110010|11111001|11111001|11110000

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 430 — #38

i i

430 Foundations of Modern Symmetric Encryption

This encryption is easily done by hand, or by SageMath Example 9.4.

Reencoding the ciphertext bytes in the ISO-8859-1 character set the cryptogram
looks like this:

Òã¶ôœåâ¶òùùð

This might bedazzle laypersons. An expert immediately notes that all characters
are from the upper half of the possible 256 bytes. This observation suggests that
the plaintext is in natural language, encrypted with a key whose leading bit is 1. If
the attacker guesses that the conspicuous character ¶ = 10110110 corresponds to
the space character 00100000, she derives the key as the difference 10010110. This
breaks the cryptogram.

Known or probable plaintext easily breaks periodic XOR encryption.

SageMath Example 9.4: XOR Encryption in Python/SageMath

print ("\n# CHAP09 -- Sage -Script -SAMPLE 130: =========")

load ("./ bitciphers.sage ") # for txt2bbl() + bbl2str()

plaintext = "Du bist doof"

bintext = txt2bbl(plaintext) # array
binstr = bbl2str(bintext)
print(binstr)
testkey = [1,0,0,1,0,1,1,0]
keystr = bbl2str(testkey)
print ("key:", keystr)
ciphertext = xor(bintext ,testkey)
ciphstr = bbl2str(ciphertext)
print(ciphstr)

#------------------------------------
# CHAP09 -- Sage -Script -SAMPLE 130: =========
# 010001000111010100100000011000100110100101110011
# 011101000010000001100100011011110110111101100110
# key: 10010110
# 110100101110001110110110111101001111111111100101
# 111000101011011011110010111110011111100111110000

9.3.2.2 MS Word and Periodic XOR

The following table (generated ad hoc by simple character counts) shows the
frequencies of the most frequent bytes in MS Word files.

Byte (hexadecimal) Bits Frequency

00 00000000 7–70%
01 00000001 0.8–17%
20 (space) 00100000 0.8–12%
65 (e) 01100101 1–10%
FF 11111111 1–10%

Note that these frequencies relate to the binary files, heavily depend on the type of
the document, and may change with every software version. The variation is large,

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 431 — #39

i i

9.3 Stream Ciphers 431

we often find unexpected peaks, and all bytes 00–FF occur. But all this doesn’t
matter here since we observe long chains of 00 bytes.
For a Microsoft Word file that is XOR encrypted with a periodically repeated
key, the ubiquity of zeros suggests an efficient attack. First, determine the length of
the key. If the length of the period is unknown, you can determine it by the methods
for periodic polyalphabetic substitutions from classical cryptanalysis named after
Kasiski, Friedman, or Sinkov. Or simply try all possible lengths. Then split the
stream of ciphertext bits into blocks corresponding to the length of the period and
add the blocks pairwise. If one of the plaintext blocks essentially consists of zeros,
then the sum is readable plaintext. Why? Consider the situation

... Block 1 ... Block 2 ...

Plaintext: ... a1 . . . as ... 0 ... 0 ...
Key: ... k1 . . . ks ... k1 . . . ks ...
Ciphertext: ... c1 . . . cs ... c10 . . . cs0 ...

where ci = ai + ki and ci0 = 0 + ki = ki for i = 1, . . . , s. Thus, the key reveals itself

in block 2, however the attacker doesn’t recognize this yet. But tentatively, pairwise
adding all blocks she gets (among other things)

ci + ci0 = ai + ki + ki = ai for i = 1, . . . , s,

that is, a plaintext block. If she realizes this (for example recognizing typical
structures), then she sees the key k1 , . . . , ks .
Should it happen that the sum of two ciphertext blocks is zero then the cipher-
text blocks are equal, and so are the corresponding plaintext blocks. The probability
that both of them are zero is high. Thus, the key could immediately show through.
To summarize: XOR encryption with a periodic key stream is quite easily broken
for messages with a known structure.
This is true also for a large period, say 512 bytes = 4096 bits, in spite of the
hyperastronomically huge key space of 24096 different possible keys.

9.3.2.3 Running-Text Encryption

A classical approach to generating an aperiodic key is taking a data stream, file,
or text, that has at least the length of the plaintext. In classical cryptography this
method was called running-text encryption or book ciphers, and the keys were
taken from books beginning at a certain position. The main method of breaking the
cipher was finding or guessing the book. The same weakness affects the electronic
analog that uses a file, a CD, or a DVD. As soon as the attacker knows the source of
the key bits, the key space is much too small—exhausting a file of several gigabytes
is easily done, the costs are linear in the size of the file.
But even when the attacker is unable to guess the source of the key bits
ciphertext-only, cryptanalysis is possible: Plaintexts as well as keys contain struc-
tures that are not completely concealed by binary addition. We won’t discuss this

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 432 — #40

i i

432 Foundations of Modern Symmetric Encryption

here12 but summarize: XOR encryption with running-text keys is fairly easily
broken.

9.3.2.4 True Random Sequence

The extreme choice for a key is a true random sequence of bits as key stream. Then
the cipher is called (binary) one-time pad. In particular no part of the key stream
must be repeated at any time. The notation pad comes from the idea of a tear-off
calendar—each sheet is destroyed after use. This cipher is unbreakable, or perfectly
secure. Shannon gave a formal proof of this in [17].
Without mathematical formalism the argument is as follows: The ciphertext
divulges no information about the plaintext (except the length). It could result from
any plaintext of the same length: simply take the (binary) difference of ciphertext
and alleged plaintext as key. Consider the ciphertext c = a + k with plaintext a
and key k, all represented by bitstreams and added bit by bit as in Figure 9.9. For
an arbitrary different plaintext b the formula c = b + k 0 likewise shows a valid
encryption using k 0 = b + c as key.
This property of the OTP could be used in a scenario of forced decryption
(rubber hose cryptanalysis) to produce an innocuous plaintext, as exemplified in
Figure 9.12 or visualized in Figure 1.6.
If the one-time pad is perfect—why not use it in any case?

• The key management is unwieldy: Key agreement becomes a severe problem

since the key is as long as the plaintext and awkward to memorize. Thus,
the communication partners have to agree on the key stream (prior to trans-
mitting the message) and store it. Agreeing on a key only just in time needs
a secure communication channel—but if there was one why not use it to
transmit the plaintext in clear?
• The key management is inappropriate for mass application or multiparty
communication because of its complexity that grows with each additional
participant.
• The problem of message integrity requires an extended solution for OTP like
for any XOR cipher.

There is another, practical, problem when encrypting on a computer: how to

get random sequences. True random bits arise from physical events like radioac-
tive decay or thermal noise on an optical sensor. The apparently deterministic
machine computer can also generate true random bits, for instance by special chips
that produce usable noise. Moreover, many events are unpredictable, such as the
exact mouse movements of the user, or arriving network packets that, although not
completely random, contain random ingredients that may be extracted. On Unix
systems these are provided by /dev/random.
However these random bits, no matter how true, are not that useful for encryp-
tion by OTP. The problem is on the side of the receiver that cannot reproduce the
key. Thus, the key stream must be transmitted independently.

12. In JCT Default Perspective F Analysis F Viterbi Analysis an automatic recognition of the two
plaintexts is offered, where only the running-key ciphertext is needed.

i i

9.3.3 Pseudorandom Generators

Pseudorandom generators mimic true random processes by deterministic algo-
rithms. Usually such an algorithm is called random generator, omitting the prefix
“pseudo” if there is no danger of confusion. The main difference between a
pseudorandom sequence and a true random sequence is its reproducibility.
Cipher designers hope to approximate the ideal properties of the one-time pad
by using a pseudorandom bit sequence as key stream, treating the short start string
as “effective key.” Even for random generators of modest quality the resulting
ciphertexts are immune against statistical analyses. The all-dominant problem is
the security against known-plaintext attacks.
Thus, the critical question for a pseudorandom sequence and for a random
generator is:

Given a known chunk (maybe fragmented) of the sequence, is there a way

to determine some more bits of the sequence, be it forwards or backwards?

For classical random generators that are popular in statistical applications and
simulations the answer is yes (see Section 9.3.4). But we’ll learn about random gen-
erators that (presumably) are cryptographically secure. The cipher designer faces
the problem of finding a good trade-off between efficiency and security.
The two main serious methods of generating pseudorandom bit sequences, or
key streams, are:

• (Feedback) shift registers (FSR) and combinations thereof, with theoretical

foundations in Boolean algebra;
• Perfect random generators with theoretical foundations in number theory.

Figure 9.13 shows the schematic functionality of a random generator. It hides

an inner state that changes with each step by a given algorithm. This algorithm is
controlled by parameters, some of which are public, but some of which are secret

Boolean Sample Function f
a ∈ F42 f (a ) ∈ F2
0000 1
0001 0
0010 0
0011 0
0100 0
(3) 0101 1
0110 0
(1) 0111 1
1000 1
1001 1
(4) 1010 0
(2) 1011 0
1100 1
1101 0
1110 0
1111 1

Table 9.9 The Stepping of a Sample FSR

Feedback State Output
start 0111 −→ 1
f (0111) = 1 −→ 1011 −→ 1
f (1011) = 0 −→ 0101 −→ 1
f (0101) = 1 −→ 1010 −→ 0
f (1010) = 0 −→ 0101 from here on periodic

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 437 — #45

i i

9.3 Stream Ciphers 437

In this respect the realization in hardware differs from a software implemen-

tation: Hardware allows using an adjustable feedback function only by complex
additional circuits. Thus, in this case we usually assume an unchangeable feedback
function, and (at least in the long run) we cannot prevent the attacker from figuring
it out. In contrast, a software implementation allows a comfortable change of the
feedback function at any time such that it may serve as part of the key.

SageMath Example 9.5: A Feedback Shift Register in Python/SageMath

def fsr(self ,u,n):
""" Generate a feedback shift register sequence
using the actual Boolean function (repres. by self).
Parameters: start vector u, number n of output bits.
Caution: The vector u is modified by this function ."""
outlist = []
for i in range (0,n):
b = self.valueAt(u)
c = u.pop()
u.insert(0,b)
outlist.append(c)
return outlist

SageMath Example 9.6: A (Very Poor) Pseudorandom Sequence in

Python/SageMath
print ("\n# CHAP09 -- Sage -Script -SAMPLE 140: =========")

load ("./ bitciphers.sage ") # for txt2bbl() and classes BoolF + LFSR

print ("\n---------- Using f.fsr(): ----------")

bits = "1000010111001001" # truth table as bitstring
x = str2bbl(bits) # truth table as list (= bitblock)
print (" Input x (same as truth table of Bool function f):"); print(x)
f = BoolF(x)
# print (" Truth table of Bool function f:")
# print(f.getTT ()) # return truth table as bitlist
start = [0,1,1,1] # initial state of the register
print (" Input start :"); print(start)
bitlist = f.fsr(start , 32) # output sequence
print ("f.fsr() output :")
# print(bitlist) # array
print(bbl2str(bitlist))
print ("New state of the register :", start)

print ("\n---------- Using LFSR (): ----------")

# Sage Example 1.4 from https :// www.staff.uni -mainz.de/pommeren/Cryptology/Bitstream/Bitstream. �
� pdf
coeff = [0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,1]
print (" Input coeff :"); print(coeff)
reg = LFSR(coeff)
## start = [0,1,1,1] # AssertionError: LFSR_Error: Bitblock has wrong length (sl=4; __length=1 �
� 6).
start = [0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1] # original
print (" Input start :"); print(start)
reg.setState(start)
bitlist = reg.nextBits(20)
print (" LFSR () output :")
print(bbl2str(bitlist))

print ("\n-- Using f.fsr() with coeff from LFSR () as input x: --")
x = coeff
print (" Input x:"); print(x)
f = BoolF(x)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 438 — #46

i i

438 Foundations of Modern Symmetric Encryption

SageMath Example 9.6 (continued)

start = [0,1,1,1]
## start = [0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1] # AssertionError: boolF_Error: Block has false �
� length (ll=16; __dim=4).
print (" Input start :"); print(start)
bitlist = f.fsr(start , 32) # output sequence
print ("f.fsr() output :")
print(bbl2str(bitlist))
print ("New state of the register :", start)

9.3.3.2 The Period of a Finite-State Machine

In computer science a feedback shift register is a special case of a finite-state
machine. Therefore, the sequence of its states is periodic. Here is why.
Let M be a finite set of m = #M elements. Imagine M as the collection of all
possible states of a machine. Consider a map (transition)

g : M −→ M.

For each element (initial state) x0 ∈ M define a sequence (xi )i≥0 in M by the recur-
sive formula xi = g (xi−1 ) for i ≥ 1. After a previous period (preperiod) of length
µ the sequence runs into a period of length ν (see Figure 9.15).
Since the set M is finite the states must eventually repeat. Thus, there are small-
est integers µ ≥ 0 and ν ≥ 1 such that xµ+ν = xµ . To see this, simply take µ as the
first index such that the element xµ reappears in the sequence at another position,
and µ + ν as the first index where this repetition occurs. Then also (by induction)

xi +ν = g (xi +ν−1 ) = g (xi−1 ) = xi for i > µ.

Here 0 ≤ µ ≤ m − 1, 1 ≤ ν ≤ m, µ + ν ≤ m. The values x0 , . . . , xµ+ν−1 are all

different, and the values x0 , . . . , xµ−1 never reappear in the sequence.

Definition: µ is called the (length of the) preperiod, ν, the (length of the) period.

A pseudorandom generator in the sense of Figure 9.13 inevitably generates

periodic sequences. The best we can hope for is a period so huge that the
practical application never exhausts its size.

9.3.3.3 Linear Shift Registers

The simplest and best understood instances of FSRs are the linear feedback shift
registers (LFSR). Their feedback functions f are linear. From Section 9.1.9 we know

Figure 9.15 Period and preperiod.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 439 — #47

i i

9.3 Stream Ciphers 439

that a linear function is simply a partial sum from an l-bit block:

l
X
f (u n−1 , . . . , u n−l ) = s j u n− j , (9.6)
j =1

where the coefficients s j are 0 or 1. If I is the subset of indices j with s j = 1, then

the iteration formula (9.5) takes the form
X
un = u n− j . (9.7)
j∈I

A simple graphical representation of an LFSR is shown in Figure 9.16. Here, the

subset I defines the contacts (taps) that feed the respective cells into the feedback
sum.
For a good choice of the parameters (that we won’t discuss further) the sequence
has a period of about 2l , the number of possible different states of the register, and
statistical tests are hardly able to distinguish it from a uniformly distributed true
random sequence [21]. It is remarkable that such a simple approach generates pseu-
dorandom sequences of fairly high quality. Of course the initial state u = (0, . . . , 0)
is inappropriate. For an initial state 6= 0 the maximum possible period is 2l − 1.
Obtaining this period is easy. The necessary and sufficient condition is that the
feedback polynomial 1 + s1 x + s2 x 2 + · · · + sl x l is primitive as polynomial over
the field F2 . Caution: Don’t confuse the feedback polynomial and the feedback
function, the former being a (formal) polynomial in a single variable, the latter a
Boolean linear form in l variables.
Using an LFSR for stream encryption the secret inner parameters, the coeffi-
cients s1 , . . . , sl , as well as the initial state u 0 , . . . , u l−1 , together constitute the key.
In contrast, the length l of the register is assumed as known to the attacker.
SageMath Example 9.7 implements an LFSR as function lfsr()1314 ; the
output is a pseudorandom bitstream.
The class LFSR in the file bitciphers.sage also implements a linear feedback
shift register. A sample call of this function for an LFSR of length 16, generating
1024 bits, is in SageMath Example 9.8. First the register reg is instantiated as an

Figure 9.16 Simple graphical representation of an LFSR.

13. Within, it uses the function binScPr() from the file bitciphers.sage, which defines the “scalar product”
of two binary vectors.
14. For a more systematic approach, a class LFSR is defined in the file bitciphers.sage. Alternatively, we can
reproduce the result of lfsr() by using fsr(). To this end we represent the linear feedback function by its
truth table (or by its ANF), and then instantiate a Boolean function of the class BoolF. Instead of n bits (that
define the taps) we need 2n bits for the truth table. This is quite uneconomical. Thus, the separate definition
of lfsr() makes sense. Another alternative is the default function sage.crypto.lfsr.lfsr_sequence of
SageMath.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 440 — #48

i i

440 Foundations of Modern Symmetric Encryption

object of the class LFSR, the taps being defined by the bitblock coeff. Then we
set the initial state of the register as the bitblock start, and generate 1024 bits of
output. Figure 9.17 shows this output (printed without parantheses or delimiters).
We could apply a series of statistical tests to this bitstream, for example tests of
uniform distribution, and would always see good results. Instead, we visualize the
sequence in Figure 9.18 for optical inspection—of course an even more insufficient
proof. However, the superficial impression shows a quite random sequence. The
function visualize in SageMath Example 9.8 generated this picture.
Don’t take offense at the sequence of nine (black) ones in the third to last row;
the probability of nine ones in nine random bits is (1/2)9 = 1/512. Therefore, in a
random bitstream of length 1024 a run of this kind occurs with high probability.

Neither the usual statistical tests nor the visual impression are valid
testimonials of the quality of a pseudorandom sequence.

As we’ll see, the random properties of LFSR sequences are poor. Cryptanalysis
detects deficiencies that evade standard statistical tests.

SageMath Example 9.7: Defining an LFSR in Python/SageMath

def lfsr(s,x,n):
""" Generate a linear feedback shift register sequence.
Parameters: Coefficient vector s, start vector x, number n of output �
� bits ."""

Figure 9.17 A pseudorandom bit sequence from an LFSR.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 441 — #49

i i

9.3 Stream Ciphers 441

Figure 9.18 Visualization of the pseudorandom bit sequence from Figure 9.17, generated by Sage-
Math Example 9.8 (1 = black, 0 = white).

SageMath Example 9.7 (continued)

l = len(s)
assert l == len(x), "lfsr_Error: Bad length of start vector ."
u = x # in Python use u = x.copy ()
outlist = []
for i in range (0,n):
b = binScPr(s, u)
c = u.pop()
u.insert(0,b)
outlist.append(c)
return outlist

SageMath Example 9.8: A Pseudorandom Bit Sequence in Python/SageMath

print ("\n# CHAP09 -- Sage -Script -SAMPLE 150: =========")

load ("./ bitciphers.sage ") # for bbl2str() and class LFSR

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 442 — #50

i i

442 Foundations of Modern Symmetric Encryption

SageMath Example 9.8 (continued)

print ("\n---------- Using LFSR (): ----------")
coeff = [0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,1] # define the taps of the LFSR
print (" Input coeff :"); print(coeff)
reg = LFSR(coeff) # initialize the register
start = [0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1] # seed the register
print (" Input start :"); print(start)
reg.setState(start)
bitlist = reg.nextBits(1024) # ... and make it work
print (" LFSR () output :")
print(bbl2str(bitlist))

print ("\n---------- Visualize bit sequence: ----------")

def visualize(lst ,w,h):
""" Arrange the list lst as a rectangle of width w and height h, row
by row , as far as lst has entries , and discarding excess entries.
Then plot a 'checkerboard ' showing a white square for zero entries ,
and a black square otherwise. If lst has less than w*h entries ,
the remaining squares are left gray ."""

l = len(lst)
p = polygon ([(0,0), (w,0), (w,h), (0,h)], color = "grey ")
for j in range(h+1):
p += line ([(0,j) ,(w,j)], color =" black ")
for i in range(w+1):
p += line ([(i,0) ,(i,h)], color =" black ")
for j in range(h):
for i in range(w):
if (j*w + i < l):
if (lst[j*w + i] == 0):
p += polygon ([(i,h-1-j) ,(i+1,h-1-j) ,(i+1,h-j) ,(i,h-j)], color = "white ")
else:
p += polygon ([(i,h-1-j) ,(i+1,h-1-j) ,(i+1,h-j) ,(i,h-j)], color = "black ")
p.axes(False)
return p

p2 = visualize(bitlist ,32,32)
# show(p2) # works in Jupyter notebook
# Display(p2, "LFSRsquare ") # works in script called from CLI if Display () is defined
p2.save (" LFSRseq.png")
print (" Graphic saved to LFSRseq.png")

Figure 9.19 shows how LFSR is implemented in CT2. On the right of the
screenshot you can see the settings for the LFSR component.
LFSRs have been implemented many times. A very nice implementation is avail-
able in the Python package “pylfsr” [22]. Section 9.3.3.3 shows an example with
pylfsr. This code also contains the conversions between the input at CT2 and at
pylfsr—because there is no generally accepted standard in which order the bits
of the seed have to be entered and how to enter the coefficients of the feedback
polynomial.

Python Example 9.1: LFSR with the pylfsr Package in Python

print ("\n# CHAP09 -- Python -Script -SAMPLE 153: =========")
print("---------- Apply LFSR using Python package pyLFSR ----------")
# Output sequence by pyLFSR using state = [0,0,1,1,1,0,0,1] and fpoly = �
� [8,6,4,3]
# - script creates the same result as CT2, which needs fpoly and state �
� just in reversed order.
# - see https :// lfsr.readthedocs.io/en/latest/dispViz.html

from pylfsr import LFSR

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 443 — #51

i i

9.3 Stream Ciphers 443

Figure 9.19 Sample LFSR x 8 + x 6 + x 4 + x 3 + 1 in CT2.

Python Example 9.1 (continued)

# Give bitsequence for seed as in CT2; reverse it and offer it as list �

� for pyLFSR .......
seed = '10011100' # seed here is a string. This bit sequence is given �
� in CT2 as state.
width = len(seed) # length of register
num = int(seed , 2) # you also could enter it directly as 0b10011100 or �
� 15 or 0x15 or 0 xAAAA

output = [int(x) for x in '{:0{size}b}'. format(num , size=width)] # �

� width for potentially leeding bits
print('1a: given seed =', output)
output.reverse () # reverse , as pyLFSR needs state in reverse order as �
� CT2
print('1b: reversed seed =', output)
state = output # state = seed = initial value
statecheck = [0,0,1,1,1,0,0,1]; assert state == statecheck

# Give bitsequence for tap sequence as in CT2; create list of �

� polynomial orders for pyLFSR .......
fpoly = '00110101' # tap sequence
num = int(fpoly , 2)
output = [int(x) for x in '{:0{size}b}'. format(num , size=width)]
print('2a: fpoly = ', output)
fpoly = [i+1 for i, val in enumerate(output) if val]
fpoly.reverse ()
print ("2b: fpoly = ",fpoly)
fpolycheck = [8,6,4,3]; assert fpoly == fpolycheck

# Calculate the binary LFSR output sequence

L = LFSR(initstate=state ,fpoly=fpoly ,counter_start_zero=True)
print('-'*50)
print('count \tstate \t\t\t\toutbit \tseq ')
print('-'*50)
for _ in range(15):

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 444 — #52

i i

444 Foundations of Modern Symmetric Encryption

Python Example 9.1 (continued)

print(L.count ,L.state ,'',L.outbit ,L.seq ,sep='\t')

L.next ()
print('-'*50)
print('Output: ',L.seq)

9.3.4 Algebraic Attack on LFSRs

Even simple random generators such as LFSRs produce bit sequences that are vir-
tually indistinguishable from true random sequences by statistical methods, and
so provide no hooks for statistical methods of cryptanalysis. This is not true for
attacks with known plaintext. The resulting equations for the key bits are accessi-
ble for algebraic cryptanalysis. If the key stream originates from a known source,
trying to solve these equations promises success. In particular this holds for LFSRs.
Consider a key bitstream u 0 , u 1 , . . . generated by an LFSR by formulas (9.6)
or (9.7). Assume a plaintext a is XOR encrypted using this key stream, resulting in
the ciphertext c, where ci = ai + u i for i = 0, 1, . . . What are the prospects of an
attacker who knows a chunk of the plaintext?
Assume she knows the first l + 1 bits of the plaintext. She immediately derives
the corresponding bits u 0 , . . . , u l of the key stream, in particular the initial state of
the LFSR. For the yet unknown coefficients si she knows a linear relation:

s1 u l−1 + · · · + sl u 0 = u l .

Each additional known plaintext bit yields one more relation, and having l
relations, from 2l bits of known plaintext, the easy linear algebra over the field F2
(in nondegenerate cases) finds a unique solution. The l · l coefficient matrix of this
linear equation system is essentially the matrix U of Section 9.3.4.1.
Theorem 9.12 An LFSR of length l is completely predictable from the first 2l bits
for the cost of about 13 · l 3 bit operations.

9.3.4.1 Prediction of LFSRs

Assume we know the first 2l bits u 0 , . . . , u 2l−1 from an LFSR of length l. For an
elegant formulation of the linear algebra methods we introduce the state vectors

u (i ) = (u i , . . . , u i +l−1 ) for i = 0, 1, . . .

The vector u (i ) is the register content for step i (in reversed order compared with
Figure 9.14). Thus, the analysis focuses on the states, not directly on the output.
The recursion formula (9.6) in matrix form (for n ≥ l) is
    
u n−l +1 0 1 ... 0 u n−l
 .  . .. . 
 ..   .. ..
. ..   .. 

 = .  . 
 u n−1   0 0 . . . 1  u n−2 
    
un sl sl−1 . . . s1 u n−1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 445 — #53

i i

9.3 Stream Ciphers 445

or more parsimoniously (the indices being substituted by m = n − l + 1)

u (m ) = S · u (m−1) for m ≥ 1

where S is the coefficient matrix. As a further step we collect l consecutive state

vectors u (i ) , . . . , u (i +l−1) in a state matrix
 
ui u i +1 . . . u i +l−1
 u i +1 u i +2 . . . u i +l 
 
U(i ) =  . .. .. .. 
.

 . . . . 

u i +l−1 u i +l . . . u 2l−2

and set U = U(0) , V = U(1) . This gives the formula

V = S·U

that expresses the unknown coefficients s1 , . . . , sl by the known plaintext bits

u 0 , . . . , u 2l−1 . Most notably it allows us to write down the solution immediately,
provided that the matrix U is invertible:

S = V · U −1 .

The matrix S explicitly displays the coefficients s1 , . . . , sl . We’ll discuss the invert-
ibility later on.

Example
Assume we are given a ciphertext:
10011100 10100100 01010110 10100110 01011101 10101110
01100101 10000000 00111011 10000010 11011001 11010111
00110010 11111110 01010011 10000010 10101100 00010010
11000110 01010101 00001011 11010011 01111011 10110000
10011111 00100100 00001111 01010011 11111101
We suspect that the cipher is XOR with a key stream from an LFSR of length l = 16.
The context suggests that the text is in German and begins with the word “Treff-
punkt” (meeting point). To solve the cryptogram we need 32 bits of plaintext, that
is the first four letters only, presupposed that the theory applies. This gives 32 bits
of the key stream:
01010100 01110010 01100101 01100110 = T r e f
10011100 10100100 01010110 10100110 cipher bits
-------- -------- -------- --------
11001000 11010110 00110011 11000000 key bits
SageMath Example 9.9 determines the coefficient matrix. Its last row tells us that
all si = 0 except s16 = s5 = s3 = s2 = 1.
Now we know the LFSR and the initial state, and can reconstruct the complete
key stream—yes, it is the same as in Figure 9.17—and write down the plaintext
(that by the way begins a bit differently from our guess).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 446 — #54

i i

446 Foundations of Modern Symmetric Encryption

SageMath Example 9.9: Determining a Coefficient Matrix

print ("\n# CHAP09 -- Sage -Script -SAMPLE 160: =========")

l = 16
kbits = [1,1,0,0,1,0,0,0,1,1,0,1,0,1,1,0,0,0,1,1,0,0,1,1,1,1,0,0,0,0,0, �
� 0]
ulist = []
for i in range(0,l):
state = kbits[i:(l+i)]
ulist.append(state)
U = matrix(GF(2),ulist)
print ("det(U) =", det(U))
W = U.inverse ()
vlist = []
for i in range(1,l+1):
state = kbits[i:(l+i)]
vlist.append(state)
V = matrix(GF(2),vlist)
S = V*W
print(S)

#------------------------------------
# CHAP09 -- Sage -Script -SAMPLE 160: =========
# det(U) = 1
# [0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0]
# ...
# [1 0 0 0 0 0 0 0 0 0 0 1 0 1 1 0]

9.3.4.2 Proof of Theorem 9.12: How Many Bits Must be Known to Predict an LFSR
We showed that the coefficients are uniquely determined assuming the state matrix
U = U(0) is invertible. As a consequence in this case the LFSR is completely known,
and all output bits are predictable. We have yet to discuss the case where the matrix
U is singular.
If one of the first l state vectors (= rows of the matrix U ) is zero, then all
following state vectors are zero too, and prediction is trivial.
Thus, we may assume that none of these vectors are zero, but that they are
linearly dependent. Then there is a smallest index k ≥ 1 such that u (k ) is contained
in the subspace spanned by u (0) , . . . , u (k−1) , and we find coefficients t1 , . . . , tk ∈ F2
such that
u (k ) = t1 u (k−1) + · · · + tk u (0) .

Then also u (k +1) = S · u (k ) = t1 S · u (k−1) + · · · + tk S · u (0) = t1 u (k ) + · · · + tk u (1) ,

and by induction we get

u (n ) = t1 u (n−1) + · · · + tk u (n−k ) for all n ≥ k.

This formula predicts all the following bits.

The statement on the cost follows from Theorem 9.10.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 447 — #55

i i

9.3 Stream Ciphers 447

Discussion

• For a singular state matrix this consideration yields a shorter LFSR (of length
k < l) that generates exactly the same sequence. Then our method doesn’t
determine the coefficients of the original register but nevertheless correctly
predicts the sequence.
• If the bits the attacker knows aren’t just the first ones but 2l contiguous
ones at a later position, then the theorem yields only the prediction of the
following bits. In the main case of an invertible state matrix U the LFSR is
completely known and may be run backwards to get the previous bits. For
a singular state matrix we achieve the same effect using the shorter LFSR
constructed previously.
• The situation where 2l bits of the key stream are known but at noncontigu-
ous positions is slightly more involved. We get linear relations that contain
additional (unknown) intermediate bits. If m is the number of these then we
get l + m linear equations for l + m unknown bits.
• What if the length l of the LFSR is unknown? Exhaustively trying all values
l = 1, 2, 3, . . . is nasty but feasible. A better approach is provided by the
Berlekamp-Massey15 algorithm that is efficient also without knowledge of
l. We won’t treat it in this chapter.

Summary
Given a random generator as in Figure 9.13 cryptanalytic, targets are:

• Secret parameters
• Initial state
• Additional parts of the output (prediction problem)

given some parts of the output. As we saw for LFSRs the prediction problem has a
solution even when the internal parameters remain unknown. Thus:

Cryptanalysis of a random generator first of all means solving the prediction

problem. A random generator is cryptographically secure if its prediction
problem admits no efficient solution.

Linear feedback shift registers are not cryptographically secure.

9.3.5 Approaches to Nonlinearity for Feedback Shift Registers

LFSRs are popular (in particular among electrical engineers and military) for several
reasons:

• Very easy implementation;

• Extreme efficiency in hardware;

15. Berlekamp-Massey is contained in SageMath as sage.crypto.lfsr.berlekamp_massey. With CT2

Templates F Cryptanalysis F Generic F Berlekamp-Massey you can try this algorithm directly.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 448 — #56

i i

448 Foundations of Modern Symmetric Encryption

• Good qualification as random generators for statistical applications and

simulations;
• Unproblematic operation in parallel even in large quantities.

But unfortunately from a cryptological view they are completely insecure if used
naively. To capitalize their positive properties while escaping their cryptological
weakness there are several approaches.

Approach 1: Nonlinear Feedback

Nonlinear feedback follows the scheme from Figure 9.14 with a nonlinear Boolean
function f . We won’t pursue this approach here. We saw a very simple toy exam-
ple in SageMath Example 9.6. There is a general proof that in realistic use cases
NLFSRs (abbreviation for nonlinear feedback shift register) are cryptographically
useless if used in the direct naive way [23].
NLFSRs have often been used within stream ciphers, but free implementations
of the principle are rare. Figure 9.20 shows how a simple NLFSR is implemented
in CT2.16 On the right of the screenshot you can see the settings for the NLFSR
component.

Approach 2: Nonlinear Output Filter

The nonlinear output filter (nonlinear feedforward) realizes the scheme from
Figure 9.21. The shift register itself is linear, the Boolean function f , nonlinear.
The nonlinear output filter is a special case of a nonlinear combiner.

Approach 3: Nonlinear Combiner

The nonlinear combiner uses a battery of n LFSRs—preferably of different lengths—
operated in parallel. The output sequences of the LFSRs serve as input of a Boolean
function f : Fn2 −→ F2 , see Figure 9.22.

Figure 9.20 Simple NLFSR x 5 ∗ x 2 + 1 in CT2.

16. With CT2 Templates F Mathematics F LFSR or NLFSR you can try this directly.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 449 — #57

i i

9.3 Stream Ciphers 449

Figure 9.21 Nonlinear output filter for an LFSR.

Figure 9.22 Nonlinear combiner.

Approach 4: Output Selection/Decimation/Clocking

There are different ways of controlling a battery of n parallel LFSRs by another
LFSR:

• Output selection takes the current output bit of exactly one of the LFSRs
from the battery, depending on the state of the auxiliary register, and outputs
it as the next pseudorandom bit. More generally we could choose r from n.
• For decimation one usually takes n = 1 and outputs the current bit of the one
battery register only if the auxiliary register is in a certain state, for example
its own current output is 1. Of course this kind of decimation applies to
arbitrary bit sequences in an analogous way.
• For clocking we look at the state of the auxiliary register and, depending on
it, decide which of the battery registers to step in the current cycle (and by
how many positions), leaving the other registers in their current states. This
is reminiscent of the control logic of rotor machines in classical cryptography.

These methods turn out to be special cases of nonlinear combiners if properly

rewritten. Thus approach 3 represents the most important method of making the
best of LFSRs.
The encryption standard A5/1 for mobile communications uses three LFSRs of
lengths 19, 22, and 23, each with maximum possible period, and slightly differently

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 450 — #58

i i

450 Foundations of Modern Symmetric Encryption

clocked. It linearly (by simple binary addition) combines the three output streams.
The (even weaker) algorithm A5/2 controls the clocking by an auxiliary register.
Both variants can be broken on a standard PC in real-time.
The Bluetooth encryption standard E0 uses four LFSRs and combines them in
a nonlinear way. This method is somewhat stronger than A5, but also too weak for
real security [5].

Example: The Geffe Generator

The Geffe generator provides a simple example of output selection. Its description
is in Figure 9.23. The output is x, if z = 0, and y, if z = 1. Expressed by a formula:
(
x, if z = 0,
u =
y, if z = 1
= (1 − z )x + zy = x + zx + zy.
This formula shows how to interpret the Geffe generator as a nonlinear combiner
with a Boolean function f : F32 −→ F2 of degree 2. For later use we implement f in
SageMath Example 9.10.

SageMath Example 9.10: The Geffe Function

print ("\n# CHAP09 -- Sage -Script -SAMPLE 170: =========")

load ("./ bitciphers.sage ")

geff = BoolF(str2bbl("00011100"),method ="ANF")
geff.printTT ()

#------------------------------------
# CHAP09 -- Sage -Script -SAMPLE 170: =========
# Value at 000 is 0
# Value at 001 is 0
# Value at 010 is 0
# Value at 011 is 1
# Value at 100 is 1

Figure 9.23 Geffe generator.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 451 — #59

i i

9.3 Stream Ciphers 451

SageMath Example 9.10 (continued)

# Value at 101 is 0
# Value at 110 is 1
# Value at 111 is 1

9.3.6 Implementation of a Nonlinear Combiner with the Class LFSR

A nonlinear combiner uses several LFSRs, operated in parallel. This suggests an
implementation of LFSRs as objects of a class LFSR.

Class LFSR:
Attributes:
• length: The length of the register;
• taplist (constant): The list of coefficients (or taps) that define the bits for
feedback;
• state (variable): The state of the register.
Methods:
• setLength: Define the length (used only implicitly for initialization);
• setTaps: Define the list of taps (used only implicitly for initialization);
• setState: Set the state of the register;
• getLength: Output the length;
• nextBits: Generate a given number of output bits, and set the next state.

9.3.6.1 Example: Complete Geffe Generator

First we choose three LFSRs of lengths 15, 16, 17, whose periods are 215 − 1 =
32767, 216 − 1 = 65535, and 217 − 1 = 131071. These are pairwise coprime
(see SageMath Example 9.11). Combining their outputs (in each step) as bit-
blocks of length 3 yields a sequence with a period that has an impressive length
of 281459944554495, about 300 · 1012 (300 trillions for Americans are 300 billions
for Europeans).
SageMath Example 9.12 defines the three LFSRs. The recursive formula for
the third one, the control register reg17, is u n = u n−3 + u n−17 , since exactly the
taps 3 and 17 are active. We let each of the LFSRs generate a sequence of length
100 (see SageMath Example 9.13). The Geffe function combines them in SageMath
Example 9.14. This is the complete sample.

SageMath Example 9.11: Calculating a Period

print ("\n# CHAP09 -- Sage -Script -SAMPLE 180: =========")

n15 = 2**15 - 1; print ("n15 =", n15, "=", n15.factor ())

n16 = 2**16 - 1; print ("n16 =", n16, "=", n16.factor ())
n17 = 2**17 - 1; print ("n17 =", n17, "=", n17.factor ())

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 452 — #60

i i

452 Foundations of Modern Symmetric Encryption

SageMath Example 9.11 (continued)

print ("lcm =", lcm([n15,n16,n17]))

period = n15 * n16 * n17; print (" period =", period)

#------------------------------------
# CHAP09 -- Sage -Script -SAMPLE 180: =========
# n15 = 32767 = 7 * 31 * 151
# n16 = 65535 = 3 * 5 * 17 * 257
# n17 = 131071 = 131071
# lcm = 281459944554495
# period = 281459944554495

SageMath Example 9.12: Three LFSRs

print ("\n# CHAP09 -- Sage -Script -SAMPLE 190: =========")

load ("./ bitciphers.sage ")

reg15 = LFSR ([1,0,0,0,0,0,0,0,0,0,0,0,0,0,1])

reg15.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1])
print(reg15)
reg16 = LFSR ([0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,1])
reg16.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1])
print(reg16)
reg17 = LFSR ([0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1])
reg17.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1,1])
print(reg17)

SageMath Example 9.13: Three LFSR Sequences

print ("\n# CHAP09 -- Sage -Script -SAMPLE 200: =========")

load ("./ bitciphers.sage ")

reg15 = LFSR ([1,0,0,0,0,0,0,0,0,0,0,0,0,0,1])

reg15.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1])

reg16 = LFSR ([0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,1])

reg16.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1])

reg17 = LFSR ([0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1])

reg17.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1,1])

nofBits = 100
outlist15 = reg15.nextBits(nofBits)
print (" outlist15:\n", outlist15)
outlist16 = reg16.nextBits(nofBits)
print (" outlist16:\n", outlist16)
outlist17 = reg17.nextBits(nofBits)
print (" outlist17:\n", outlist17)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 453 — #61

i i

9.3 Stream Ciphers 453

SageMath Example 9.14: The Combined Sequence

print ("\n# CHAP09 -- Sage -Script -SAMPLE 210: =========")

load ("./ bitciphers.sage ")

reg15 = LFSR ([1,0,0,0,0,0,0,0,0,0,0,0,0,0,1])

reg15.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1])

reg16 = LFSR ([0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,1])

reg16.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1])

reg17 = LFSR ([0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1])

reg17.setState ([0,1,1,0,1,0,1,1,0,0,0,1,0,0,1,1,1])

nofBits = 100
outlist15 = reg15.nextBits(nofBits)
outlist16 = reg16.nextBits(nofBits)
outlist17 = reg17.nextBits(nofBits)

geff = BoolF(str2bbl("00011100"),method ="ANF")

outlist = []
for i in range(0,nofBits):
x = [outlist15[i], outlist16[i], outlist17[i]]
outlist.append(geff.valueAt(x))
print(outlist)

#------------------------------------
# CHAP09 -- Sage -Script -SAMPLE 210: =========
# [1, 1, 0, 1, 0, 0, 0, 1, 1, 1, 0, 0, 0, 1, 1, 0, 1, 1, 0, 1,
# 0, 0, 1, 0, 0, 1, 0, 0, 1, 1, 0, 0, 0, 0, 1, 1, 0, 0, 1, 1,
# 1, 0, 1, 0, 1, 1, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 1, 1, 0,
# 1, 1, 1, 1, 0, 1, 0, 0, 1, 0, 0, 0, 1, 1, 0, 1, 0, 1, 0, 1,
# 0, 0, 1, 1, 0, 1, 0, 0, 1, 1, 0, 1, 1, 0, 0, 0, 1, 0, 0, 1]

9.3.7 Design Criteria for Nonlinear Combiners

From the forgoing discussion we derive design criteria for nonlinear combiners:

• The battery registers should be as long as possible.

• The combining function f should have a low linear potential.

How long should the battery registers be? There are some algorithms for fast cor-
relation attacks using the Walsh transformation, in particular against sparse linear
feedback functions (that use only a few taps) [24]. These don’t reduce the com-
plexity class of the attack (exponential in the length of the shortest register) but
reduce the cost by a significant factor. So they are able to attack registers whose
feedback functions have up to 100 monomials with coefficients in their ANF. As a
consequence

• The single LFSRs should have a length of at least 200 bits, and use about
100 taps each.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 454 — #62

i i

454 Foundations of Modern Symmetric Encryption

To assess the number n of LFSRs we bear in mind that the combining function
should be correlation immune, in particular have a low linear potential. A well-
chosen Boolean function of 16 variables should suffice. To us, there are no known
recommendations in the literature.
Rainer Rueppel found an elegant way out to make the correlation attack break
down: Use a time-dependent combining function, that is a family ( f t )t∈N . The bit u t
of the key stream is calculated by the function f t . We won’t analyze this approach
here.
Observing that the correlation attack needs knowledge of the taps, the security
could be somewhat better if the taps are secret. Then the attacker has to perform
additional exhaustions that multiply the complexity by factors such as 2l1 for the
first LFSR alone. This scenario allows choosing LFSRs of somewhat smaller lengths.
But bear in mind that for a hardware implementation the taps are parts of the
algorithm, not of the key, so they are public parameters in the sense of Figure 9.13.

Efficiency
LFSRs and nonlinear combiners allow efficient realizations by special hardware that
produces one bit per clock cycle. This rate can be enlarged by parallelization. From this
point of view, estimating the cost of execution on a usual PC processor is somewhat
inadequate. Splitting each of the ≥ 200 bit registers into 4 parts of about 64 bits
shifting a single register requires at least 4 clock cycles, summing up to 64 clock
cycles for 16 registers. Add some clock cycles for the combining function. Thus, one
single bit would take about 100 clock cycles. A 2-GHz processor, even with optimized
implementation, would produce at most 2 · 109 /100 = 20 million bits per second.
As a summary we note: Using LFSRs and nonlinear combining functions, we
can build useful and fast random generators, especially in hardware.
Unfortunately there is no satisfying theory for the cryptologic security of this
type of random generators, even less a mathematical proof. Security is assessed
by plausible criteria that—as for block ciphers—are related to the nonlinearity of
Boolean functions.

9.3.8 Perfect (Pseudo)Random Generators

As we saw, the essential cryptologic criterion for random generators is unpre-
dictability. In the 1980s cryptographers, guided by an analogy with asymmetric
cryptography, found a way of modeling this property in terms of complexity the-
ory: Prediction should boil down to a known hard algorithmic problem such as
factoring or discrete logarithm. This idea established a new quality standard for
random generators, much stronger than statistical tests, but eventually building on
unproven mathematical hypotheses. Thus, the situation with respect to the security
of random generators is comparable to asymmetric encryption.
As an interesting twist it soon turned out that in a certain sense unpredictability is a
universal property. For an unpredictable sequence there is no efficient algorithm at all
that distinguishes it from a true random sequence, a seemingly much stronger require-
ment [see Theorem 9.13 (Yao’s theorem)]. This universality justifies the denomination
perfect for the corresponding random generators. In particular there is no efficient
statistical test that is able to distinguish the output of a perfect random generator from

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 455 — #63

i i

9.3 Stream Ciphers 455

a true random sequence. Thus, on the theoretical side, we have a very appropriate
model for random generators that are absolutely strong from a statistical viewpoint,
and invulnerable from a cryptological viewpoint. In other words:

Perfect random generators are cryptographically secure and statistically

indistinguishable from true random sources.

Presumably perfect random generators exist, but there is no complete

mathematical proof of their existence.

The first concrete approaches to the construction of perfect random generators,

the best known being the BBS generator (for Blum, Blum, Shub), yielded algo-
rithms that were too slow for most practical uses (given the then current CPUs).
But modified approaches soon provided random generators that are passably fast
and nevertheless (presumably) cryptographically secure.

9.3.9 The BBS Generator

As with the RSA cipher we consider an integer modulus m that is a product of two
large prime numbers. For the BBS generator we choose (for technical reasons not
to be discussed here) Blum primes; these are primes ≡ 3 mod 4. A product of two
Blum primes is called a Blum integer.
The BBS generator works in the following way: As a first step, we choose two
large random Blum primes p and q, and form their product m = pq. As a second
step, we choose a random integer seed s with 1 ≤ s ≤ m − 1, and coprime with m.
Remark 1: If we catch an s not coprime with m, we have factorized m by hazard.
This might happen, but is extremely unlikely, and can easily be captured at initial-
√
ization time. Remark 2: If xi < m, then xi2 mod m = xi2 , the integer square, so
xi2+1 has the same parity as xi . In order to avoid a constant segment at the begin-
√ √
ning of the output, often the boundary area s < m, as well as s > m − m, is
excluded. However, if we really choose s as a true random value, the probability
for s falling into these boundary areas is extremely low. But to be on the safe side
√ √
we may require m ≤ s ≤ m − m.
Now we proceed with generating a pseudorandom sequence: Take x0 = s 2 mod
m as initial state (so x0 is a quadratic residue), and form the sequence of inner
states of the random generator: xi = xi− 2 mod m for i = 1, 2, 3, . . . In each step,
1
output that last significant bit of the binary representation; that is u i = xi mod 2
for i = 0, 1, 2, . . ., or in other words, the parity of xi .

Example
Of course an example with small numbers is practically irrelevant, but it illustrates
the algorithm. Take p = 7, q = 11, m = 77, s = 53. Then s 2 = 2809, hence
x0 = 37, and u 0 = 1 since x0 is odd. The naive SageMath Example 9.15 shows the
beginning of the sequence of states:
i 0 1 2 3 ...
xi 37 60 58 53 ...
ui 1 0 0 1 ...

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 456 — #64

i i

456 Foundations of Modern Symmetric Encryption

SageMath Example 9.15: A (Much Too) Simple Example for BBS

print ("\n# CHAP09 -- Sage -Script -SAMPLE 260: =========")

p = 7; q = 11; m = p*q; s = 53
print ("m =", m, ", s =", s)
x0 = (s^2) % m; print ("x0 =", x0)
x1 = (x0^2) % m; print ("x1 =", x1)
x2 = (x1^2) % m; print ("x2 =", x2)
x3 = (x2^2) % m; print ("x3 =", x3)

Treating the Blum primes p and q as secret is essential for the security of the BBS
generator. They serve for forming m only; afterward they may even be destroyed.
In contrast with RSA there is no further use for them. Likewise, all the nonoutput
bits of the inner states xi must be secret.
The standard distribution of SageMath contains the BBS generator. It consists
of the procedures:
• random_blum_prime() in the module sage.crypto.util. To generate a
random Blum prime p with a given number k of bits (= digits of the binary
representation) call it as p = random_blum_prime(2**(k-1), 2**k). The
correctness of this algorithm is only empirically founded: In fact there is
always a prime between 2k−1 and 2k , but this doesn’t need to be a Blum
prime. This is a special case of Bertrand’s postulate, proved by Chebyshev
in 1850: There is a prime between n and 2n (for all n ≥ 2). Nevertheless,
empiricism tells us that there are lots of Blum primes in this interval, namely
about 2k /(k log(2)). Thus, an attack by exhaustion will fail.
• blum_blum_shub() from sage.crypto.stream. To generate a sequence
of r pseudorandom bits first generate two random Blum primes p and
q and an initial value x0 = s 2 mod pq, and then call the procedure as
blum_blum_shub(r,x_0,p,q).
SageMath Example 9.16 demonstrates the procedure. The intermediate results
p, q, and x0 are shown in Tables 9.10, 9.11, and 9.12, the result in Table 9.13. By
convention s as well as the factors p and q must be kept secret. Moreover, there
is no reason to reveal the product m = pq. However, considering the progress of
factorization algorithms we should better use Blum integers of at least 2048 bit (see
Section 9.3.10). And in any case s must be a true random value! We neglected this
duty by choosing s as a pure power.

SageMath Example 9.16: Generating a Sequence of BBS Pseudorandom Bits

print ("\n# CHAP09 -- Sage -Script -SAMPLE 270: =========")

from sage.crypto.util import random_blum_prime

from sage.crypto.stream import blum_blum_shub

sl = 1000 # Number of bits to generate in the pseudo random bit �

� sequence
print (" Using random start values (1000 bit length):")
p = random_blum_prime(2^511, 2^512)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 457 — #65

i i

9.3 Stream Ciphers 457

SageMath Example 9.16 (continued)

q = random_blum_prime(2^511, 2^512)
x0 = 11^248 % (p*q) # s = 11^124 % (p*q)
print(blum_blum_shub(sl ,x0,p,q))

print ("\ nUsing the fixed values from the given table as start values :")
no = "8 445 834 617 855 090 512 176 000 413 196 767 417 799 332\
626 936 992 170 472 089 385 128 414 279 550 732 184 808 226\
736 683 775 727 426 619 339 706 269 080 823 255 441 520 165\
438 397 334 657 231 839 251"
p = ZZ(no.replace ("\n", "").replace ("\r", "").replace (" ", ""))

no = "12 580 605 326 957 495 732 854 671 722 855 802 182 952 894\
232 088 903 111 155 705 856 898 413 602 721 771 810 991 595\
365 229 641 230 483 180 760 744 910 366 324 916 344 823 400\
588 340 927 883 444 616 787"
q = ZZ(no.replace ("\n", "").replace ("\r", "").replace (" ", ""))

no = "1 842 408 460 334 540 507 430 929 434 383 083 145 786 026\
412 146 359 363 362 017 837 922 966 741 162 861 257 645 571\
680 482 798 249 771 263 305 761 292 545 408 040 659 753 561\
970 871 645 393 254 757 072 936 076 922 069 587 163 804 708\
256 246 366 137 431 776 175 309 050 064 068 198 002 904 756\
218 898 942 856 431 647 438 473 529 312 261 281"
x0 = ZZ(no.replace ("\n", "").replace ("\r", "").replace (" ", ""))

s= str(blum_blum_shub(sl ,x0,p,q))
bl=4 # blocklength
split = [s[i:i+bl] for i in range (0, len(s), bl)]
print (" Number of blocks = len(split) =", len(split))

for i in range(len(split)):
print ("%s " % (split[i]), end = '' )
print ()

Table 9.10 A Blum Prime p with 512 Bits (154 Decimal Places)
8 445 834 617 855 090 512 176 000 413 196 767 417 799 332
626 936 992 170 472 089 385 128 414 279 550 732 184 808 226
736 683 775 727 426 619 339 706 269 080 823 255 441 520 165
438 397 334 657 231 839 251

Table 9.11 A Blum Prime q with 512 Bits (155 Decimal Places)
12 580 605 326 957 495 732 854 671 722 855 802 182 952 894
232 088 903 111 155 705 856 898 413 602 721 771 810 991 595
365 229 641 230 483 180 760 744 910 366 324 916 344 823 400
588 340 927 883 444 616 787

Table 9.12 An Initial Value x0

1 842 408 460 334 540 507 430 929 434 383 083 145 786 026
412 146 359 363 362 017 837 922 966 741 162 861 257 645 571
680 482 798 249 771 263 305 761 292 545 408 040 659 753 561
970 871 645 393 254 757 072 936 076 922 069 587 163 804 708
256 246 366 137 431 776 175 309 050 064 068 198 002 904 756
218 898 942 856 431 647 438 473 529 312 261 281

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 458 — #66

i i

458 Foundations of Modern Symmetric Encryption

Table 9.13 1000 BBS Pseudorandom Bits

1010 0110 0011 0100 0000 0111 1111 0100 1111 0111 0010 1001
0000 0100 1111 0000 0010 1010 1011 1111 1000 0101 1110 0011
1110 1000 1001 1100 1000 1000 0110 0111 0011 0011 1010 0011
1100 1111 0011 1000 1011 0110 1011 1110 0110 1110 0111 1000
1101 0011 1101 0010 1000 1101 0000 1100 0100 1011 1110 0011
0110 0010 1011 0000 1010 1001 0110 0000 0011 1010 0011 1111
1010 0110 0101 1000 1011 0100 0100 1111 1010 1011 0001 1100
0000 0011 1101 1001 0001 0000 1111 1010 1001 0111 0111 0111
0000 1010 0101 0111 0111 0001 0110 1001 0011 1011 0000 0011
1000 0000 0111 0110 0110 1010 0110 0011 0111 1100 0010 0110
0011 1001 1010 1111 0001 0010 1111 0010 1100 1111 0110 0100
0001 1000 0101 0011 0000 0101 1111 1100 0101 0000 0100 0100
0100 0101 0010 1110 1010 1011 1011 0110 0101 1011 1111 1110
1100 1001 1011 0110 1001 0111 0111 1110 0101 0111 0011 0100
1101 1110 0011 1111 1101 0100 1111 1011 1010 0010 0111 1111
1010 1000 1100 1001 1010 1001 1010 0111 0100 0100 1010 0110
0011 0010 1110 0111 0101 0111 1101 0000 0110 0000 1110 1100
0101 1010 0111 1000 0101 1111 0010 1101 0110 0100 0010 1101
0000 1101 0111 1011 0010 1010 1000 0110 0100 0111 1100 0000
1101 0000 1011 1111 0101 1011 0011 1110 0010 1110 1101 0001
1110 1111 1000 0111 1010 0000 1100 0101 0110 0001

9.3.10 Perfectness and the Factorization Conjecture

Informally, we define a pseudorandom generator (shortly: a random generator) as
an efficient algorithm that takes a short bitstring s ∈ Fn2 and converts it into a long
bitstring s ∈ Fr2 .
The terminology of complexity theory allows us to give a mathematically
exact (but not completely satisfying from a practical point of view) definition
r (n )
by considering parameter-dependent families of Boolean maps G n : Fn2 −→ F2 ,
and analyzing their behavior when the parameter n grows to infinity. Such an
algorithm—represented by the family (G n ) of Boolean maps—can be efficient only
if the expanding function r : N −→ N grows at most polynomially with the
parameter n, otherwise even writing down the output sequence efficiently way is
impossible. Then we measure the cost somehow in a meaningful way; for example,
count the number of needed bit operations that likewise must be at most polynomial
with respect to the asymptotic behavior.
On the attacker’s side we consider algorithms that predict further bits, or aim
at detecting some other weaknesses of our random generator. We analyze the costs
of these algorithms also as functions of n. In case the cost grows faster than any
polynomial, say exponentially, we rate the attack as inefficient.
Pursuing this approach would require a lot of additional formalism including a
model of probabilistic algorithms that are essential tools for the cryptanalyst. This
would take us too far apart for the moment being. However, we bear in mind that
there is a mathematically correct theory formalizing the intuitive idea of efficiency.
Relying on this knowledge we don’t hesitate to reasoning the naive way, and draft
the following definition that in the given form is mathematically incorrect but might
be made correct.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 459 — #67

i i

9.3 Stream Ciphers 459

Definition 9.4 Consider a pseudorandom generator. A next bit predictor is an

algorithm that takes a piece u 0 , . . . , u r −1 from the beginning of the pseudorandom
sequence and calculates the next bit u r , without using the internal parameters of
the pseudorandom generator.
The pseudorandom generator passes the prediction test if there is no efficient
next bit predictor.

For example, LFSRs don’t pass the prediction test: We constructed an efficient
next bit predictor in Theorem 9.12.

Definition 9.5 Consider a pseudorandom generator. A distinguisher is an algorithm

that decides whether a given sequence is purely random or is generated by the pseu-
dorandom generator, without using the internal parameters of the pseudorandom
generator.
The pseudorandom generator is perfect if there is no efficient distinguisher
for it.

In particular no efficient statistical test is able to distinguish a perfect pseudorandom

generator from a true random source. It is a bit of a surprise that the seemingly much
weaker property of passing the prediction test already implies perfectness. In other
words the prediction test is universal:

Theorem 9.13 (Yao’s criterion) A pseudorandom generator is perfect if and only if

it passes the prediction test.

Here stated without proof. Unfortunately, this approach only gives qualitative
results, and so it is somewhat dissatisfying. However, as often in complexity theory,
this is the best we can achieve.

9.3.10.1 The (Conjectured) Perfectness of the BBS Generator

The factorization hypothesis states that there is no efficient algorithm that decom-
poses large natural numbers into their prime factors. This hypothesis is the base of
the security of RSA, as well of the perfectness of the BBS generator:

Theorem 9.14 (Blum/Blum/Shub/Vazirani/Vazirani) Assume the factorization

hypothesis holds. Then the BBS generator is perfect.

We omit the proof (that is quite involved). Sloppily expressed, the theorem
says:

Whoever is able to predict a single bit of a BBS sequence given a partial

sequence is also able to factor the modulus.

This statement assumes that the attacker knows the modulus m of the BBS generator.
However, the modulus might also be secret, that is, considered as a part of the key.
Assuming this, the cryptographic security of BBS should even be better—but no
proof of this stronger statement seems to be known, not even an informal one.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 460 — #68

i i

460 Foundations of Modern Symmetric Encryption

9.3.11 Examples and Practical Considerations

We saw that the BBS generator is perfect under a plausible but unproven assump-
tion, the factorization hypothesis. However, we don’t know relevant concrete
details, for example what parameters might be inappropriate. We know that cer-
tain initial states generate output sequences with short periods. Some examples
of this effect are known, but we are far from a complete answer. However, the
security proof (depending on the factorization hypothesis) doesn’t require addi-
tional assumptions. Therefore, we may confidently use the BBS generator with a
pragmatic attitude: randomly choosing the parameters (primes and initial state) the
probability of hitting bad values is extremely low, much lower than finding a needle
in a haystack, or even in the universe.
Nevertheless some questions are crucial for getting good pseudorandom
sequences from the BBS generator efficiently:

• How large should we choose the parameter m?

• How many bits can we use for a fixed modulus and initial state without
compromising the security?
The provable results, relative to the factorization hypothesis, are qualitative
only, not quantitative. The recommendation to choose a modulus that escapes
the known factorization methods also rests on heuristic considerations only, and
doesn’t seem absolutely mandatory for a modulus that itself is kept secret. The real
quality of the pseudorandom bit sequence, be it for statistical or for cryptographic
applications, can only be assessed by empirical criteria for the time being. We are
confident that the danger of generating a bad pseudorandom sequence is extremely
small, in any case negligible, for moduli that escape the presently known factoriza-
tion algorithms, say at least of a length of 2048 bit, and for a true random choice
of the modulus and the initial state. Remark: Émile Borel proposed an informal
ranking of negligibility of extremely small probabilities: ≤ 10−6 from a human
view; ≤ 10−15 from a terrestrial view; ≤ 10−45 from a cosmic view. By choosing
a sufficiently large modulus m for RSA or BBS we easily undercut Borel’s bounds
by far.
For the length of the usable output sequence we only know the qualitative crite-
rion “at most polynomially many” that is useless in a concrete application. But even
if we only use “quadratically many” bits we wouldn’t hesitate to take 4 millions
bits from the generator with a ≥ 2000 bit modulus. If we need substantially more
bits we would restart the generator with new parameters after every few millions
of bits.
An additional question suggests itself: Are we allowed to output more than a
single bit of the inner state in each iteration step to enhance the practical benefit of
the generator? At least 2 bits?
Vazirani and Vazirani, and independently Alexi, Chor, Goldreich, and Schnorr
gave a partial answer to this question, unfortunately also a qualitative one only:
at least O(log2 log2 m ) of the least significant bits are safe. Depending on the con-
stants that hide in the O, we need to choose a sufficiently large modulus, and trust
empirical experience. A common recommendation is using blog2 log2 mc bits per
step. Then for a modulus m of 2048 bits, or roughly 600 decimal places, we can

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 461 — #69

i i

9.3 Stream Ciphers 461

use 11 bits per step. Calculating x 2 mod m for an n bit number m takes ( 64 ) mul-
n 2

tiplications of 64-bit integers and subsequently the same number of divisions of the
type 128 bits by 64 bits.
For n = 2048 this makes a total of 2 · (25 )2 = 2048 multiplicative opera-
tions to generate 11 bits, or about 200 operations per bit. A well-established rule
of thumb says that a modern CPU executes one multiplicative operation per clock
cycle. Remark: Special processors that use pipelines and parallelism are significantly
faster. Thus, on a 2-GHz CPU with 64-bit architecture we may expect roughly
2 · 109 /200 ≈ 10 million bits per second, provided the algorithm is implemented in
an optimized way. This consideration shows that the BBS generator is almost com-
petitive with a software implementation of a sufficiently secure nonlinear combiner
of LFSRs, and is fast enough for many purposes if executed on a present day CPU.
The cryptographic literature offers several pseudorandom generators that
follow similar principles as BBS:

The RSA Generator (Shamir) Choose a random modulus m of n bits as a product

of two large primes p, q, and an exponent d that is coprime with ( p − 1)(q − 1),
furthermore a random initial state x = x0 . The state transition is x 7→ x d mod
m. Thus, we calculate xi = xi− d
1 mod m, and output the least significant bit, or
the blog2 log2 mc least significant bits. If the RSA generator is not perfect, then
there exists an efficient algorithm that breaks the RSA cipher. Since calculating dth
powers is more expensive by a factor n than squaring, the cost is even higher for
BBS: for a random d the algorithm needs O(n 3 ) cycles per bit.

The Index Generator (Blum/Micali) As modulus choose a random large prime p

of n bits, and find a primitive root a for p. A primitive root for p is an integer
whose powers run through all residue classes 6= 0 mod p, or in algebraic terms, a
generating element of the multiplicative group mod p. Furthermore, choose a ran-
dom initial state x = x0 , coprime with p − 1. Then calculate xi = a xi−1 mod p, and
output the most significant bit of xi , or the blog2 log2 pc most significant bits. The
perfectness of the index generator relies on the hypothesis that calculating discrete
logarithms mod p is hard. The cost per bit also is O(n 3 ).

The Elliptic Index Generator (Kaliski) It works like the index generator, but replaces
the group of invertible elements of the field F p by an elliptic curve over F p (such a
curve is a finite group in a canonical way).

9.3.12 The Micali-Schnorr Generator

Micali and Schnorr proposed a pseudorandom generator that is a descendant of the
RSA generator. Fix an odd number d ≥ 3. The parameter set is the set of all products
m of two primes p and q whose bit length differs by at most 1, and such that d is
coprime with ( p − 1)(q − 1). For an n-bit number m let h (n ) be an integer ≈ 2dn .
Then the dth power of an h (n )-bit number is (approximately) a 2n-bit number.
In the ith step calculate z i = xi− d
1 mod m. Take the first h (n ) bits as the new
state xi , that is xi = bz i /2 n−h ( n ) c, and output the remaining bits, that is yi =
z i mod 2n−h (n ) . Thus, the bits of the result z i are partitioned into two disjoint parts:
the new state xi , and the output yi . Figure 9.24 illustrates this scheme.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 462 — #70

i i

462 Foundations of Modern Symmetric Encryption

Figure 9.24 Micali-Schnorr generator.

But why may we hope that this random generator is perfect? This depends on
the hypothesis: There is no efficient test that distinguishes the uniform distribution
on {1, . . . , m − 1} from the distribution of x d mod m for uniformly distributed x ∈
{1, . . . , 2h (n ) }. If this hypothesis is true, then the Micali-Schnorr generator is perfect.
This argument seems tautological, but heuristic considerations show a relation with
the security of RSA and with factorization. Anyway we have to concede that this
proof of security seems considerably more airy than that for BBS.
How fast do the pseudorandom bits tumble out of the machine? As elemen-
tary operations we again count the multiplication of two 64-bit numbers, and the
division of a 128-bit number by a 64-bit number with 64-bit quotient. We multi-
ply and divide by the classical algorithms. Remark: Multiplication by fast Fourier
transformation (FFT) has an advantage only for much larger numbers. Thus, the
product of s (64-bit) words and t words costs s · t elementary operations. The cost
of division is the same as the cost of the product of divisor and quotient.
The specific recommendation by the inventors is: d = 7, n = 512. Today we
would choose a larger n. The output of each step consists of 384 bits, withholding
128 bits as the new state. The binary power algorithm for a 128-bit number x with
exponent 7 costs several elementary operations:

• x has 128 bits, hence 2 words.

• x 2 has 256 bits, hence 4 words, and costs 2 · 2 = 4 elementary operations.
• x 3 has 384 bits, hence 6 words, and costs 2 · 4 = 8 elementary operations.
• x 4 has 512 bits, hence 8 words, and costs 4 · 4 = 16 elementary operations.
• x 7 has 896 bits, hence 14 words, and costs 6 · 8 = 48 elementary operations.
• x 7 mod m has ≤ 512 bits, and likewise costs 6·8 = 48 elementary operations.

This makes a total of 124 elementary operations; among them only one reduction
mod m (for x 7 ). Our reward consists of 384 pseudorandom bits. Thus, we get about
3 bits per elementary operation, or, by the assumptions in Section 9.3.11, about 6

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 463 — #71

i i

9.4 Table of SageMath Examples in This Chapter 463

American billions (European milliards) bits per second. Compared with the BBS
generator this amounts to a factor of about 1000.
Parallelization increases the speed virtually without limit: The Micali-Schnorr
generator allows complete parallelization. Thus distributing the work among k
CPUs brings a profit by the factor k since the CPUs can work independently of
each other without need of communication.

9.3.13 Summary and Outlook on Stream Ciphers

Stream ciphers need cryptographically secure random generators. These prob-
ably exist, however their security (like the security of almost all ciphers) is
mathematically not completely proven.
But implementing a useful stream cipher takes more than just a good random
generator:

• Message integrity requires additional means such as a combination with a

cryptographic hash function.
• The operational conditions must prevent the reuse of (parts of) the key
stream reliably. This means that the key management requires utmost pru-
dence. A possible approach is using a longtime general key that consists
of certain inner parameters of the random generator, and use the remaining
parameters including the initial state as one-time message key. So you should
treat the start value as a part of the key (i.e., keep it secret and use it only
once). This was a usual approach with the cipher machines of World War II.

In contrast with block ciphers where we have the accepted standard AES (and
the outdated standard DES) for stream ciphers there is no established standard.
Closest to standardization is the eSTREAM portfolio developed in a European
project from 2004 until 2008. It recommends a bunch of several ciphers [5].
Unfortunately, several proprietary ciphers, mostly stream ciphers developed in
back rooms, found their way into security critical applications. Despite the fact that
they relied on security through obscurity, they could (easily) be analyzed by reverse
engineering, and teared to shreds by cryptologists. Therefore, we finish this chapter
with the advice that in an analogous form applies to all parts of cryptography:

Never trust a random generator whose algorithm is kept secret, or for which
no analysis results are publicly available. Statistical analyses are insufficient
as security proofs, just as little as gargantuan periods, or a gigantic choice
of initial states.

9.4 Table of SageMath Examples in This Chapter

Table 9.14 lists all SageMath scripts used in this chapter.

The files from the third column (including the small libraries bitciphers.sage
and FSR.sage) can be downloaded from: https://www.cryptool.org/en/
documentation/ctbook/sagemath.
Further samples not listed in this chapter can be found on the same website.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 464 — #72

i i

464 Foundations of Modern Symmetric Encryption

Table 9.14 SageMath Examples in This Chapter

# Title File name
9.1 Solution of a System of Linear Equations Over Q chap09_sample010.sage
9.2 Solution of a System of Boolean Linear Equations chap09_sample020.sage
9.3 A Boolean Function with Truth Table and ANF chap09_sample030.sage
9.4 XOR Encryption in Python/SageMath chap09_sample130.sage
9.5 A Feedback Shift Register (FSR) in Python/SageMath bitciphers.sage
9.6 A (Very Poor) Pseudorandom Sequence in Python/SageMath chap09_sample140.sage
9.7 Defining an LFSR in Python/SageMath FSR.sage
9.8 A Pseudorandom Bit Sequence in Python/SageMath chap09_sample150.sage
9.9 Determining a Coefficient Matrix chap09_sample160.sage
9.10 The Geffe Function chap09_sample170.sage
9.11 Calculating a Period chap09_sample180.sage
9.12 Three LFSRs chap09_sample190.sage
9.13 Three LFSR Sequences chap09_sample200.sage
9.14 The Combined Sequence chap09_sample210.sage
9.15 A (Much Too) Simple Example for BBS chap09_sample260.sage
9.16 Generating a Sequence of BBS Pseudorandom Bits chap09_sample270.sage

References

We first clarify the meaning and the origin of the term homomorphic. This term
in cryptography is derived from its counterpart in mathematics: In mathematics, a
homomorphism is a structure-preserving map between two algebraic structures. In
the common sense, this means that a homomorphism f : X → Y maps the structure
of X to the structure of Y . Using an example, this can be easily illustrated: Let ( X, +)
and (Y, ∗) two algebraic groups with group operations + and ∗, respectively. A
homomorphism f : X → Y maps any given x ∈ X to a value y ∈ Y , in a way that
it holds (additive homomorphic system):

f (x1 + x2 ) = f (x1 ) ∗ f (x2 ) (10.1)

for any two x1 , x2 in X . This means that for any two values x1 , x2 it does not matter
whether we first compute their sum (group operation of X ) and then apply f (this
is the left side of (10.1)) or, whether we first apply f to the values x1 , x2 , and then
compute their product in Y , thus applying the group operation of Y . Please note
that the operations + and ∗ were chosen here only as an example; they always
depend on the algebraic group they belong to. Naturally, the same relation holds
for homomorphisms between groups with the same group operation.
Example: Let X = Z be the set of integer values. The set Z together with the addi-
tion operation forms an algebraic group G 1 = (Z, +). Similarly, the real values
R without the value zero together with the multiplication operation form a group
G 2 = (R\{0}, ∗). The function f : Z→R\{0}, z→e z is a homomorphism, since for

467

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 468 — #2

i i

468 Homomorphic Ciphers

all z 1 , z 2 ∈ Z it holds: f (z 1 + z 2 ) = e(z 1 +z 2 ) = f (z 1 ) ∗ f (z 2 ). On the contrary,

f : Z → R\{0}, z → z 2 is an example of a function that is not a homomorphism.

10.2 Decryption Function Is a Homomorphism

In the remainder of this chapter we will consider public-key cryptosystems with a

special property, namely that its decryption function is a homomorphism. A public-
key cryptosystem with this property will be called homomorphic.
Let us for now assume, the described homomorphism f (see (10.1)) is the
decryption function of a known cryptosystem. This means that we can perform
certain algebraic operations in the ciphertext space, knowing which effects this will
have on their plaintexts. Following the previous given example we have: Y cor-
responds to the set of cipher texts, X is the set of plaintexts. For two plaintexts
x1 , x2 with corresponding ciphertexts y1 , y2 it holds (multiplicative homomorphic
system):
f ( y1 ∗ y2 ) = f ( y1 ) + f ( y2 ) = x1 + x2 (10.2)

The equation (10.2) can be interpreted as follows: If we multiply two cipher-

texts y1 , y2 with each other and subsequently decrypt their product, then we will
obtain the sum of the originally encrypted values x1 and x2 . Everybody can—
without knowledge of the plaintexts, without having to decrypt, and even without
knowing the private decryption key—compute a product of the two ciphertexts and
know that upon decryption the owner of the private key will obtain the sum of the
two originally encrypted plaintexts.

10.3 Classification of Homomorphic Methods

Homomorphic encryption can be performed by different encryption schemes. The

schemes are classified as:
1. Partially homomorphic;
2. Somewhat homomorphic;
3. Leveled fully homomorphic;
4. Fully homomorphic encryption.
Fully homomorphic systems possess both additive and multiplicative homomorphic
properties.
The first fully homomorphic method was presented by Craig Gentry in 2009
in his dissertation [1]. It was a lattice-based method (see Chapter 11).
The first cryptosystems that allowed fully-homomorphic encryption required
far too much effort to be used in practice. According to their practicability, these
FHE methods are divided into (so far four) generations, with only the first three
representing a real breakthrough:
1. First-generation FHE (e.g., Gentry-Halevi from 2010). The idea of these
systems is to first construct a somewhat homomorphic cryptosystem and
then convert it to a fully homomorphic cryptosystem using bootstrapping.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 469 — #3

i i

10.4 Examples of Homomorphic Pre-FHE Ciphers 469

2. Second-generation FHE (e.g., Brakerski-Gentry-Vaikuntanathan (BGV)

from 2011). The security of most of these second-generation schemes is
based on the hardness of the (ring) learning with errors (RLWE) problem.
3. Third-generation FHE (e.g., Gentry-Sahai-Waters (GSW) from 2013 [2]),
FHEW from 2014 [3], and TFHE from 2016 [4]. They drastically reduced
the bootstrapping time.
4. Fourth-generation FHE (e.g., Cheon-Kim-Kim-Song (CKKS) ab 2016 [5]).

Since 2017, the Homomorphic Encryption Standardization consortium

(https://homomorphicencryption.org/) tries to standardize homomorphic
encryption. Curated lists of open-source FHE libraries can be found in [6–8].
A very good overview including classification, not only about fully homo-
morphic encryption but also about the current state of techniques such as secure
multiparty computation (MPC), is offered by the 100-page report “Encrypted Com-
puting Compass,” prepared by KIT and CISPA at the end of 2022 and published
by the German cyber agency in 2023 [9].
Note: The cryptosystems based on lattice problems, which are used for post-
quantum cryptography, are usually fully homomorphic.

10.4 Examples of Homomorphic Pre-FHE Ciphers

Here, the homomorphic properties of the three pre-FHE cryptosystems Paillier,

RSA, and ElGamal are described as an introduction.

10.4.1 Paillier Cryptosystem

One of the oldest cryptosystems with homomorphic properties is the one by Paillier
[10]. First we will see how the Paillier key generation process works. After that, we
will show that the Paillier cryptosystem indeed has homomorphic properties.

10.4.1.1 Key Generation

First, we generate two random prime numbers p, q in a way that their product n =
pq forms a valid RSA modulus. Here n should have a bit length that corresponds
to the current security standard for RSA modules (as of January 2022, 2048 bits).
Using the prime values p and q, we can compute the value λ = lcm( p − 1, q − 1).
lcm here denotes the least common multiple. The RSA modulus n will now be the
public key, while the private key is the value λ.

10.4.1.2 Encryption
Let m be the message which will be encrypted, where m is taken from the plain-
text space Zn . For each encryption, we first choose a random element r from the
plaintext space Zn . Subsequently, using the public key, we compute the ciphertext
n as:
c = E (m, r ) = (n + 1)m ∗ r n mod n 2

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 470 — #4

i i

470 Homomorphic Ciphers

10.4.1.3 Decryption
Given the private key λ and a ciphertext c ∈ Z∗n 2 , we first compute S = cλ mod n 2
and subsequently T = φ (n )(−1) mod n 2 , where φ denotes the Euler function.
Finally, we compute the plaintext m = D (c) = ( S − 1)/n · T mod n.

10.4.1.4 Homomorphic Property

To prove the homomorphic property of the Paillier cryptosystem, we use E to
denote the encryption and D to denote the decryption function. For simplicity, we
set in the following g := n + 1. From any two plaintexts m 1 , m 2 and random values
r1 , r2 we obtain ciphertexts c1 , c2 as

c1 = g m 1 · r1 n mod n 2 and c2 = g m 2 · r2 n mod n 2 ,

respectively. Now it is easy to see that for the product c3 = c1 · c2 it holds

c3 = (g m 1 · r1 n mod n 2 ) · (g m 2 · r2 n mod n 2 ) = g m 1 +m 2 · (r1 · r2 )n mod n 2

= E (m 1 + m 2 , r 1 · r 2 )

Thus, the product of two given ciphertexts is in fact a valid ciphertext, namely the
encryption of the sum of the originally encrypted messages. Now it is straightfor-
ward to see that the decryption function is a homomorphism. Given two plaintexts
m 1 , m 2 it holds

D ( E (m 1 , r1 ) · E (m 2 , r2 )) = D ( E (m 1 + m 2 , r1r2 )) = m 1 + m 2
= D ( E (m 1 , r1 )) + D ( E (m 2 , r2 ))

10.4.2 Other Cryptosystems

Older public-key cryptosystems can also have homomorphic properties. Both the
ElGamal cryptosystem and RSA constitute famous examples. We will show their
homomorphic properties by means of some easy examples.

10.4.2.1 RSA
Let (e, n ) be the public RSA key (e the public encryption exponent, n the RSA
modulus). For any two messages m 1 , m 2 we obtain the ciphertexts c1 = m 1 e mod n
and c2 = m 2 e mod n. For the product of these two ciphertexts, it holds: c1 · c2 =
m 1 e · m 2 e mod n = (m 1 · m 2 )e mod n. Thus, we obtain an encryption of the product
of the two messages m 1 and m 2 . As it is straightforward to see, this property holds
for any two plaintexts m 1 , m 2 and similar as for Paillier, the decryption function is
a homomorphism. As we have seen here, RSA is an example for a homomorphism,
where both groups have the same group operation.

10.4.2.2 ElGamal
Similar to RSA, we can also show the homomorphic properties of the ElGamal
cryptosystem. Let ( p, g, K ) be the public key while the private key is k (thus, it holds

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 471 — #5

i i

10.5 Applications 471

g k mod p = K ). For any two messages m 1 , m 2 and random values r, s we obtain

ciphertexts ( R, c1 ) = ( K r mod p, m 1 · gr mod p ) and ( S, c2 ) = ( K s mod p, m 2 ·
g s mod p ). As for RSA, we verify that their product ( R · S, c1 · c2 ) is an encryption
of m 1 · m 2 . Again, it is straightforward to see that the decryption function is a
homomorphism.

10.5 Applications

The homomorphic property can be used to add two encrypted values or to multiply
any value under encryption with a known constant (note that the multiplication
corresponds to the repeated application of the addition operation). This makes
homomorphic ciphers important and easy to use base primitives in cryptographic
applications.

1. One of these applications is electronic voting. Electronic voting allows many

voters to submit their ballots in an encrypted form. This is important in
situations where the voters cannot come together to the same location. This
happens, for example, if the voters can only communicate over the internet
via email. If the voting behavior of the single parties should remain secret,
then the use of homomorphic ciphers is a good solution to this problem.
The main principle of electronic voting using homomorphic ciphers is as
follows.
• All voters (on the left in Figure 10.1) encrypt the value 1 if they opt
positive and the value 0, if opposed to the decision.
• Using the homomorphic property, one can compute the sum of all
encrypted ballots. Since this happens on encrypted values, the voting
behavior of all participants remains secret.

Figure 10.1 Voting example for Paillier.

Figure 10.3 Visualization of homomorphic properties in JCrypTool (JCT).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 474 — #8

i i

474 Homomorphic Ciphers

Figure 10.4 Poll demo in CTO using homomorphic encryption.

In 2008, Damgard et al. suggested a homomorphic encryption scheme (DGK)

[14] that can be used as the basis of efficient and general secure MPC. They used it
in a protocol for secure comparison of integers in order to improve the security of
online auctions with this protocol.2

10.6.2 JCrypTool with RSA, Paillier, and Gentry/Halevi

In JCrypTool there is an implementation (see Figure 10.3), that visualizes the
homomorphic properties of various cryptosystem:
• For RSA and Paillier it shows that, for each of them, either one opera-
tion of one type (multiplications for RSA and additions for Paillier) can be
performed on encrypted values.
• For the fully-homomorphic cryptosystem by Gentry-Halevi it is possible to
perform both multiplications and additions on encrypted values.3

10.6.3 Poll Demo in CTO Using Homomorphic Encryption

The poll demo in CTO visualizes, how a Doodle-like application can be built with
a homomorphic encryption algorithm such that security by design is fulfilled. This
implementation makes use of the fully homomorphic BFV scheme with the help of
the library node-seal. When conducting a poll online this way, the inputs, calcula-
tions, and results are hidden from the aggregation server. Figure 10.44 shows what
participant 2 can see after participant 1 entered his data.

References

“Esslinger” — 2023/11/30 — 19:45 — page 476 — #10

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 477 — #1

i i

C H A P T E R 11
C H A P T E R 11
Lightweight Introduction to Lattices

In this chapter, our goal is to cover the basic theory behind lattices in a lightweight
fashion. The theory covered is accompanied by many practical examples, SageMath
code, and cryptographic challenges.
Sections 11.1 through 11.7 introduce the notation and methods needed to
work with and understand lattices (this makes up about a third of this chapter).
Sections 11.8 and 11.9 cover lattices in more detail and their application to attack
RSA. Section 11.10 is intended as a deeper look, providing some algorithms for
lattice basis reduction and their use to break cryptosystems. Section 11.12 contains
screenshots of where lattice algorithms can be found in the CrypTool programs.

11.1 Preliminaries

You are not required to have an advanced background in any mathematical

domain or programming language. Nevertheless, expanding your knowledge
and learning new mathematical concepts and programming techniques will give
you a great boost towards your goal as a future expert in cryptology. This
chapter is self-contained—you are not required to read the previous chapters in
the book. The examples and the practical scenarios are implemented in Sage-
Math (a computer-algebra system that uses Python as scripting language; see
https://www.cryptool.org/en/documentation/ctbook/). We believe that the
code in this chapter is very intuitive and even if you don’t have any experience with
Python syntax you will understand the idea behind it.

11.2 Equations

Equations help us to mathematically describe a relationship between some objects.

https://www.cryptool.org/en/documentation/ctbook/sagemath

x04 − 150x03 + 4389 x02 − 43000 x0 + 131100 = 0

x110 − 177 x19 + 9143 x18 − 228909 x17 + 3264597 x16 − 28298835 x15 +
+152170893 x14 − 502513551 x13 + 974729862 x12 − 995312448 x1 + 396179424 = 0

x210 − 196 x29 + 12537 x28 − 397764 x27 + 7189071 x26 − 77789724 x25 +
+506733203 x24 − 1941451916 x23 + 4165661988 x22 − 4501832400 x2 + 1841875200 = 0

x35 − 153 x34 + 5317 x33 − 77199 x32 + 510274 x3 − 1269840 = 0

x48 − 194 x47 + 11791 x46 − 352754 x45 + 6011644 x44 −

−61295576 x43 + 370272864 x42 − 1222050816 x4 + 1696757760 = 0

x56 − 169 x55 + 7702 x54 − 153082 x53 + 1477573 x52 − 6672349 x5 + 11042724 = 0

x68 − 202 x67 + 12936 x66 − 406082 x65 + 7170059 x64 −

−74124708 x63 + 439747164 x62 − 1365683328 x6 + 1701311040 = 0

x79 − 206 x78 + 13919 x77 − 467924 x76 + 8975099 x75 − 102829454 x74 +
+699732361 x73 − 2673468816 x72 + 4956440220 x7 − 2888395200 = 0

11.3 Systems of Linear Equations

We have already introduced the concepts of variables, equations, and the domain
of an equation. We have shown how to declare variables in SageMath and how to
automatically find solutions to single-variable equations using solve(). What if we
have two different variables in our equation? Let’s take the following equation as
an example: x + y = 10. Again we will try to solve this equation using SageMath.
This time we need the tuple (x, y ) as the solution of solve().
sage: x = var('x', domain =ZZ)
sage: y = var('y', domain =ZZ)
sage: solve(x+y==10, (x,y))
(t_0, -t_0 + 10)
We get as a solution x = t0 and y = −t0 +10 and indeed x + y = t0 +(−t0 +10) =
t0 − t0 + 10 = 10. The notation with t0 is used by SageMath to show us that there
are infinitely many integer solutions to the given equation. In correct mathematical
notation, the solution is the following so-called parameterized set. The parameter t0
is used by SageMath to indicate that there is only one degree of freedom, counting
the first degree of freedom as number zero as usual. Outside SageMath we don’t
use this index and write just t:

L = (x, y ) ∈ Z2 x = t, y = −t + 10, t ∈ Z or L = (t, −t + 10) ∈ Z2 t ∈ Z

What if we have other constraints on x and y? For example, what if we know

that they are equal? Equality defines another equation that is related to the first

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 481 — #5

i i

11.3 Systems of Linear Equations 481

one, so we can form a system of equations (in a system of equations, the individual
equations are not independent of each other):
(
x + y = 10
x=y

Let’s solve this system of equations. We can easily organize all the equations from
this two-equation system using a list array.
sage: x,y = var('x y', domain =ZZ)
sage: solve ([x+y==10 ,x==y], (x,y))
[[x == 5, y == 5]]
As we can see, we have only the one solution (x, y ) = (5, 5).
A rich collection of mathematical problems can be solved by using systems of
linear equations. For example, let’s take the simple puzzle in Figure 11.1 and solve
it using SageMath.
As usual, each row consists of three items and their total price. Usually, the goal
in such puzzles is to find the price of each individual item. We have three different
items. Let’s define the price of each pencil as x, the price of each computer display
as y, and the price of each bundle of servers as z. From the previous declarations
we can write down the following system of linear equations:



2x + y = 15
x + y + z = 20


3z = 30

We can also solve this puzzle using only pen and paper. The last equation gives the
value of z = 10. If we eliminate the variable z by replacing its value in the previous
equations, we reduce the system to system of two unknown variables:


 2x + y = 15

x + y = 10

(z = 10 not unknown anymore)



Figure 11.1 Visual puzzle.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 482 — #6

i i

482 Lightweight Introduction to Lattices

We can now subtract the second equation from the first to get:

2x + y − (x + y ) = 15 − 10 = 5
⇒ x =5

Substituting this into the second equation x + y = 10, we get y = 5. So we have

the following solution to the puzzle:

 x =5

y=5


z = 10

Now let’s try to solve the same puzzle using SageMath.

sage: x,y,z = var('x y z', domain =ZZ)
sage: solve ([x + x + y == 15 , x+y+z == 20 , z+z+z == 30], ( �
� x,y,z))
[[x == 5, y == 5, z == 10 ]]

Challenge 11.2: System of Linear Equations as a Picture

Can you find the hidden message in the picture puzzle in Figure 11.2? Each
symbol represents a distinct decimal digit. There is a balance so that each left
side is equal to the corresponding right side. Automate the process using Sage-
Math. Hint: ASCII (American Standard Code for Information Interchange) is
involved.

In the next sections, we will introduce the definition of matrices, which will
help us describe a given system of linear equations in a much more compact way.

Figure 11.2 Puzzle challenge (picture created by the author).

We now have a tool (algorithm) for solving a system of linear equations. How to
do this with SageMath is described in Section 11.6.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 486 — #10

i i

486 Lightweight Introduction to Lattices

Challenge 11.3: System of Linear Equations

We found a solution to the system of linear equations by just following an
algorithm and using only three main operations. As an exercise, and using
the newly discovered method of solving systems of linear equations, can you
solve this challenge?


 115 b + 111 h + 108 f = 2209

118 b + 101 h + 115 f = 2214


111 b + 114 h + 116 f = 2286


 97 q + 100 m + 100 a = 1582

111 q + 110 m + 101 a = 1748


116 q + 111 m + 101 a = 1786

 97 r + 99 n + 104 t = 910

108 r + 101 n + 116 t = 1005


116 r + 101 n + 114 t = 1019

Definition 11.1 Some, but not all, quadratic matrices have inverses; that is, for
A = (ai, j ) there exits a matrix A−1 such that
 1 0 ... 0 
0 1 ... 0
A · A−1 =  .. .. . . .. 
. . ..
0 0 ... 1

Then A−1 is called the inverse of A. If A has an inverse matrix, A is said to be

invertible.
If A has such an inverse, the inverse is unique and the result of the product A· A−1 =
A−1 · A doesn’t depend on the order of the factors. Note that matrix multiplication
in general is not commutative.
Of course, we can easily calculate the inverse of a matrix in SageMath, if it
exists:
sage: A= matrix ([[0,2,0 ,0],[3 ,0 ,0 ,0],[0 ,0 ,5 ,0],[0 ,0 ,0 ,7 ]]); �
� A
[0 2 0 0]
[3 0 0 0]
[0 0 5 0]
[0 0 0 7]
sage: A. inverse ()
[ 0 1/3 0 0]
[1/2 0 0 0]
[ 0 0 1/5 0]
[ 0 0 0 1/7]

i i

i i
i i

−→ −→
Let’s express the vectors P Q and R Q with the three points P (0, 1), Q (2, 2),
and R (1.5, 1.5) as shown in Figure 11.4.
We can easily do this by following the definition:

−→
P Q = (2 − 0, 2 − 1) = (2, 1)
−→
R Q = (2 − 1.5, 2 − 1.5) = (0.5, 0.5)

Furthermore, if we define the origin as O (0, 0) and some random point Z (x, y )
−→
we can easily define the vector O Z = (x − 0, y − 0) = (x, y ). Using this observation
we can easily calculate the desired vectors using SageMath.
sage: vOP = vector ([0,1 ])
sage: vOQ = vector ([2,2 ])
sage: vOR = vector ([1.5 ,1.5 ])
sage: vPQ = vOQ - vOP
sage: vRQ = vOQ - vOR
sage: print(vPQ , vRQ)
(2,1) (0.5, 0.5)
−→
Intuitively, we can easily check the results. P Q = (2, 1) means that if we start
at point P and move two times to the right and one time up, we will reach point
−→
Q. Following the same interpretation, R Q = (0.5, 0.5) means that if we start from
point R and move 0.5 to the right and 0.5 up, we will reach point Q.

Challenge 11.4: Hidden ASCII

As an exercise, find a famous English quote hidden in Figure 11.5. Hint:
0xASCII, vectors.

x = c1 v1 + c2 v2 + · · · + ck vk

is called a linear combination of vectors v1 , v2 , . . . , vk .

Definition 11.8 Given a vector x = (x1 , x2 , . . . , xn ) in R n ,
q
kxk = x12 + x22 + · · · + xn2

is called the norm (or length) of x.

You can easily calculate the norm of the vector by using SageMath:
sage: v = vector ([3,6,2 ])
sage: v.norm () # square root of (9 + 36 + 4)
7

Definition 11.9 For vectors x = (x1 , x2 , . . . , xn ), y = ( y1 , y2 , . . . , yn ) in R n ,

x1 y1 + x2 y2 + · · · + xn yn

is called the dot product, or scalar product, or inner product of x and y and is
denoted by x · y.
We will not use the Definitions 11.10 to 11.12 in the following sections, but
they are useful consequences of the previous definitions.
Definition 11.10 For nontrivial vectors x = (x1 , x2 , . . . , xn ), y = ( y1 , y2 , . . . , yn )
x·y
in R n there exists θ with 0 ≤ θ ≤ π or 0◦ ≤ θ ≤ 180◦ and ||x||·||y|| = cos θ. Then θ
is called the angle between x and y.
Definition 11.11 If x · y = 0, then x is orthogonal to y. If x is a scalar multiple of
y, then x is parallel to y.
We can easily calculate the inner product of two vectors using SageMath.
sage: x = vector ([5,4,1 ,3 ])
sage: y = vector ([6,1,2 ,3 ])
sage: x*y
45
sage: x. inner_product (y)
45
We can either use the multiply operator or be more strict and use the second
syntax. In fact, x · y = 5 · 6 + 4 · 1 + 1 · 2 + 3 · 3 = 45.
Now it’s time to define the building blocks of a vector space.
Definition 11.12 For any arbitrary, nonzero vector v ∈ R n , u = 1
kvk · v is a unit
vector. In R n , unit vectors of the form:

e1 = (1, 0, 0, . . . , 0), e2 = (0, 1, 0, . . . , 0), . . . , en = (0, 0, 0, . . . , 1)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 491 — #15

i i

11.6 Equations Revisited 491

are called standard unit vectors or coordinate vectors.

Definition 11.12 allows us to express any arbitrary vector x = (x1 , x2 , . . . , xn )

of R n as linear combination of the standard unit vectors:

x = x1 e1 + x2 e2 + · · · + xn en

11.6 Equations Revisited

We introduced the concept of matrices, more specifically the coefficient matrix and
the augmented matrix (see Section 11.4). We studied the Gaussian elimination and
how to use it to solve a system of linear equations.
So, let’s solve the following system of linear equations using SageMath:

96x1 + 11x2 + 101x3 = 634




97x1 + 15x2 + 99x3 = 637


88x1 + 22x2 + 100x3 = 654

First, we define the coefficient matrix A.

sage: A = matrix ([[96 , 11 , 101],[97 , 15 , 99],[88 , 22 , 100 �
� ]])
sage: A
[ 96 11 101]
[ 97 15 99]
[ 88 22 100]
Then, we define the right-hand sides of the equations as a vector b and construct
the augmented matrix.
sage: b = vector ([634 , 637 , 654 ])
sage: b
(634, 637, 654)
sage: B = A. augment (b)
sage: B
[ 96 11 101 634]
[ 97 15 99 637]
[ 88 22 100 654]
Now all we have to do is compute the reduced row echelon form using the
command rref.
sage: B.rref ()
[1 0 0 1]
[0 1 0 3]
[0 0 1 5]
As a final solution, we have x1 = 1, x2 = 3, x3 = 5.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 492 — #16

i i

492 Lightweight Introduction to Lattices

We have already defined operations for dealing with vectors (see Defini-
tion 11.4 and Definition 11.9). Let’s use the same operations when dealing with
matrices.
Definition 11.13 (Addition) Given two matrices A = [ai j ]m×n and B = [bi j ]m×n ,
the sum of A + B is defined by

A + B = [ai j + bi j ]m×n

Definition 11.14 Given a matrix A = [ai j ]m×n and a real number k, the scalar
multiple k A is defined by
k A = [kai j ]m×n

Let’s try some examples with SageMath:

sage: A = matrix ([[9,7 ,0],[0 ,5 ,6],[1 ,3 ,3 ]])
sage: B = matrix ([[8,5 ,2],[8 ,2 ,2],[0 ,0 ,1 ]])
sage: A
[9 7 0]
[0 5 6]
[1 3 3]
sage: B
[8 5 2]
[8 2 2]
[0 0 1]
sage: A+B
[17 12 2]
[ 8 7 8]
[ 1 3 4]
sage: A-B
[ 1 2 -2]
[-8 3 4]
[ 1 3 2]
sage: 2*A
[18 14 0]
[ 0 10 12]
[ 2 6 6]
sage: A.row(0) # get first row of a matrix
(9, 7, 0)
sage: A. column (0) # get first col of a matrix
(9, 0, 1)

Definition 11.15 Given two matrices A = [ai j ]m× p and B = [bi j ] p×n , we define the
product AB of A and B, so that AB = [ci j ]m×n = C, where

ci j = ai 1 b1 j + ai 2 b2 j + ai 3 b3 j + · · · + ai p b pj

Note that the number of columns of the first factor must be equal to the number of
rows in the second factor.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 493 — #17

i i

11.6 Equations Revisited 493

sage: A*B
[128 59 32]
[ 40 10 16]
[ 32 11 11]
From the definition of a product of two matrices, we can make the following
observations: The inner product (see Definition 11.9) of the ith row vector of A
and the jth column vector of B is the (i, j ) entry of C = AB. To demonstrate this
observation we first must introduce another simple definition:
Definition 11.16 The transpose of a matrix is a new matrix whose rows are the
columns of the original; that is, if A = [ai j ]m×n , then A T , the transpose of A, is
A T = [a ji ]n×m .

sage: B. transpose () # alternative command doing the same: �

� B.T
[8 8 0]
[5 2 0]
[2 2 1]
Using this handy SageMath method, we can easily calculate the product of two
matrices:
sage: for a in A:
....: for b in B. transpose ():
....: print(a*b)
....: print ()
....:
128
59
32

40
10
16

32
11
11
Note the preference of SageMath operators:
A*B.transpose() = A*(B.transpose()) and not (A*B).transpose()
In order to introduce the concept of a lattice, we need some more definitions.
Let’s look at the next system of equations and try to express each of the variables
x, y, and z as an expression of the coefficients a, b, c, d, e, f, g, h, i, r1 , r2 , r3 .

ax + by + cz = r1
d x + ey + f z = r2 (11.1)
gx + hy + i z = r3

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 494 — #18

i i

494 Lightweight Introduction to Lattices

Let’s multiply the first equation in (11.1) by ei, the second equation by hc, and the
last equation by b f .

aei x + bei y + cei z = eir1

dhcx + ehcy + f hcz = hcr2 (11.2)
gb f x + hb f y + ib f z = b f r3

Again, using (11.1), we multiply the first line by f h, the second line by bi, and the
last line by ce.

a f hx + b f hy + c f hz = f hr1
dbi x + ebi y + f bi z = bir2 (11.3)
gcex + hcey + icez = cer3

Now we derive a new equation by subtracting from the sum of all (11.2) and the
sum of all (11.3):

(aei + dhc + gb f − a f h − dbi − gce) · x +

+ (bei + ehc + hb f − b f h − ebi − hce) · y +
+ (cei + f hc + ib f − c f h − f bi − ice) · z =
= (ei − f h ) · r1 + (hc − bi ) · r2 + (b f − ce) · r3

Simplify the equation by removing the equal expressions:

(aei + dhc + gb f − a f h − dbi − gce) · x

= (ei − f h ) · r1 + (hc − bi ) · r2 + (b f − ce) · r3

We are now ready to express x:

(ei − f h ) · r1 + (hc − bi ) · r2 + (b f − ce) · r3

x= (11.4)
aei + dhc + gb f − a f h − dbi − gce

Following the same procedure, and carefully choosing the coefficients with
which to multiply the equations, we can also express y and z. But what if we have
a system of 100 equations with 100 variables? Moreover, how does one use a more
elegant way to recover the variables? It’s time to introduce the definitions of minors
and determinants.

Definition 11.17 A minor Mi j of a square matrix A of size n is the (n − 1) × (n − 1)

matrix formed by the rows and columns of A excluding the ith row and the jth
column.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 495 — #19

i i

11.6 Equations Revisited 495

Example
 
a b c
Let’s have a matrix A = d e f . By the definition of minors we have M11 =
g h i

d f a c
e f
, M12 = g i or M22 = i .

h i g
Let’s take the general case of a matrix A with size n.
 
a1,1 a1,2 a1,3 . . . a 1, j . . . a1,n
a2,1 a2,2 a2,3 . . . a 2, j . . . a2,n 
 
. . . a 3, j . . . a3,n 
 
a3,1 a3,2 a3,3
 ... ... ... ... ... ... ... 
 
A= 
a
 i,1 ai,2 ai,3 . . . ai, j . . . ai,n 

 ... ... ... ... ... ... ... 
 
an,1 an,2 an,3 . . . an, j . . . an,n
Then the minor Mi j of A is equal to

. . . a1, j−1 . . . a1,n

 
a1,1 a1,2 a1,3 a1, j +1
 a
 2,1 a2,2 a2,3 . . . a2, j−1 a2, j +1 . . . a2,n  
. . . a3, j−1 . . . a3,n 
 
 a3,1 a3,2 a3,3 a3, j +1
 ... . . . ... ... ... ... ...
 
.

Mi j = 
a
 i−1,1 ai−1,2 ai−1,3 . . . ai−1, j−1 ai−1, j +1 . . . ai−1,n 
ai +1,1 ai +1,2 ai +1,3 . . . ai +1, j−1 ai +1, j +1 . . . ai +1,n 
 
 ... ... ... ... ... ... ...
 

an,1 an,2 an,3 . . . an, j−1 an, j +1 . . . an,n
Now that we have the definitions of minors, we can finally define the determinant
of a matrix.
Definition 11.18 Let A be a square matrix with real elements and some fixed inte-
gers r ∈ {1, . . . , n} and c ∈ {1, . . . , n}. Then its determinant, det ( A) = A is a real
number that can be calculated either by column c or by row r :
n
X
A = (−1)i +c ai,c Mic (expansion along column c)
i =1
n
X
A = (−1)r +i ar,i Mri (expansion along row r )
i =1

Let’s calculate the determinants of a 2 × 2 matrix B and a 3 × 3 matrix C by using

minors on row 1 and column 2, respectively.
Example
Expansion along row 1:

a b
det ( B ) = = (−1)1+1 · a · d + (−1)1+2 · b · c = ad − bc
c d

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 496 — #20

i i

496 Lightweight Introduction to Lattices

Note that in the above calculation, |d| and |c| do not denote the absolute values of
the numbers d and c, but the determinants of the 1 × 1 matrices consisting of d or
c, the minors d = M11 and c = M12 of the matrix B. Note also that det(x ) = x for
every 1 × 1 matrix with entry x, so |d| = d and |c| = c in the last step.

Example
Expansion along column 2:

a b c
det (C ) = d e f
g h i

d f a c a c
= (−1)1+2 · b · + (−1)2+2 · e · + (−1)3+2 · h ·
g i g i d f
= −b(di − f g ) + e(ai − cg ) − h (a f − cd ) =
= aei + dhc + gb f − a f h − dbi − gce

The determinant of this example is exactly equal to the denominator of the right
side of (11.4). What about the numerator? We can easily verify that the numerator
r1 b c
is equal to the determinant of the matrix B1 = r2 e f . If we define the matrices
r3 h i
a r1 c a b r1
B2 = d r2 f and B3 = d e r2 , we can easily calculate the solutions x, y
g r3 i g h r3
and z of the given system of linear equations:

det ( B1 ) det ( B2 ) det ( B3 )

x= , y= , z= (11.5)
det (C ) det (C ) det (C )

So we can use determinants to solve systems of linear equations. SageMath provides

an easy way to compute a determinant, as well as submatrices of our choice.
sage: M = matrix ([[1,2 ,3], [4 ,5 ,6], [7 ,8 ,9 ]])
sage: M
[1 2 3]
[4 5 6]
[7 8 9]
sage: M. determinant ()
0
sage: det(M)
0
sage: M.det ()
0

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 497 — #21

i i

11.6 Equations Revisited 497

# consider row 1 and 3 , and col 1 and 3 (row 1 has index 0 �

�]
sage: M. matrix_from_rows_and_columns ([0 ,2],[0 ,2 ])
[1 3]
[7 9]
!
1 3
M22 =
7 9
The constructor matrix_from_rows_and_columns takes two lists as argu-
ments. The first list defines which rows of a matrix A should be taken to construct
the new matrix, while the second list defines the columns. For example, to construct
the minor (see Definition 11.17) Mi j of the matrix A with size n, we call:
A. matrix_from_rows_and_columns ([0 ,\dots ,i-1 ,i+1 ,\dots
,n-1],[0 ,\dots ,j-1,j+1 ,\dots ,n-1 ])
Now, with all this information we can automate the process of solving a large
system of equations.
Let’s look at the following system of equations:


 x + 9 y + 3z = 61

2x + 4 y + 8z = 94


5x + 7 y + 6z = 128

We can convert the same system of equations into a matrix equation:

     
1 9 3 x 61
2 4 8 ·  y  =  94 
     
5 7 6 z 128

We solve this equation using SageMath:

sage: M = matrix ([[1 ,9 ,3], [2 ,4 ,8], [5 ,7 ,6 ]])
sage: M
 
1 9 3
 2 4 8 
 
5 7 6

sage: r = matrix ([[61],[94],[128 ]])

sage: r
 
61
 94
 

128

sage: M. solve_right (r)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 498 — #22

i i

498 Lightweight Introduction to Lattices

 
13
 3 
 
7
This gives the final solutions x = 13, y = 3, and z = 7.

Challenge 11.5: Leetspeak

Alice and Bob have come up with an interesting (but insecure) encryption
scheme. Alice creates n equations with n variables and sends them to Bob
over an insecure channel using two packets. The first packet consists of all
the coefficients used in the equations in the form of a matrix without any
changes. The second packet, however, consists of all the right-hand sides of
the equations in a scrambled order. Their shared secret key consists of the
original indeces of the scrambled right-hand sides of the equations.
With the secret key, Bob can unscramble the right-hand sides of the equations
and recover the unknown variables. Then, he multiplies all the recovered vari-
ables and the final number is the decrypted message. They used the leet
language to create or read the final number. For example, the word sage in
leet language is 5463. Eve captured the following two packets P1 and P2 :
 
33 79 29 41 47
79 27 39 79 44
 
 
P1 = 
90 83 58 1 90; P2 = [ 73300, 167887, 243754, 254984, 458756 ]
38 32 13 15 96
 

72 82 88 83 23

Can you reconstruct the original message?

11.7 Vector Spaces

We need to introduce another important building block of linear algebra, vector

spaces. First, let’s define what a vector space is. This definition can be found in any
undergraduate book on linear algebra:
Definition 11.19 A vector space over the real numbers R consists of a set V and
two operations ⊕ and , subject to the following conditions/properties/axioms:
1. Closure under addition: ∀E
x ∀Ey ∈ V : xE ⊕ yE ∈ V
2. Commutativity of addition: ∀E
x ∀Ey ∈ V : xE ⊕ yE = yE ⊕ xE
3. Associativity of addition: ∀E
x ∀Ey ∀Ez ∈ V : (xE ⊕ yE) ⊕ Ez = xE ⊕ ( yE ⊕ Ez )
4. Neutral element under addition: ∃ oE ∈ V : (∀E
x ∈ V : xE ⊕ oE = xE)
We call oE the zero vector.
5. Additive inverse: ∀Ex ∈ V ∃Ey ∈ V : xE ⊕ yE = oE
We call yE the additive inverse or inverse element of addition of xE, and vice
versa.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 499 — #23

i i

11.7 Vector Spaces 499

6. Closure in scalar multiplication: ∀r ∈ R, ∀E

x ∈V :r xE ∈ V
7. Distributivity 1: ∀r, s ∈ R, ∀E
x ∈ V : (r + s ) xE = r xE ⊕ s xE
x , yE ∈ V : r
8. Distributivity 2: ∀r ∈ R, ∀E (xE ⊕ yE) = r xE ⊕ r yE
9. Associativity: ∀r, s ∈ R, ∀E
x ∈ V : (r s ) xE = r (s xE)
10. Neutral action of scalar multiplication with 1: ∀E
x ∈V :1 xE = xE
Usually, we do not use the symbol , but just the multiplication point or no
dot at all. But sometimes, as in this definition, we want to point out that the scalar
multiplication is a mapping from R × V to V , while the “regular” multiplication is
something else (i.e., a mapping from R×R to R). Also note that there is some danger
of confusion if we use the same multiplication symbol (or no symbol at all) for both
the scalar multiplication and the scalar product (vector product, inner product).
This can cause some problems when looking at more complicated formulas that
contain both types of products.
We can also define vector spaces over larger sets, such as the set of complex
numbers. It is also possible to define structures like the one above over smaller
sets, for example the set of integers. In this case—when the set of scalars is a so-
called ring and not a so-called field—these algebraic structures are not called vector
spaces, but modules. Our vectors, as well as our choice of the two operators ⊕ and
play an important role in determining whether our space is a vector space or not.

Example Let’s define the set M of all 2 × 2 matrices with entries of real numbers.
Furthermore, we choose the operator ⊕ as a regular additive operator on matrices;
that is, ! ! !
a1 a2 b1 b2 a1 + b1 a2 + b2
⊕ = .
a3 a4 b3 b4 a3 + b3 a4 + b4

We choose the operation to be the already known scalar multiplication of

matrices; that is, ! !
a1 a2 ra1 ra2
r = .
a3 a4 ra3 ra4

We can easily check that all the conditions hold and that this is indeed a vector
space, in which the zero vector is ( 00 00 ).

Example The set P of polynomials with real coefficients is a vector space with
the operator ⊕ defined as the
P regular P
additive operator on polynomials and the
operator defined via r ( ai x i ) := rai x i . For example, if ai , bi , r ∈ R, then:

(a0 + a1 x + · · · + an x n ) ⊕ (b0 + b1 x + · · · + bn x n ) =
= (a0 + b0 ) + (a1 + b1 )x + · · · + (an + bn )x n
r (a0 + a1 x + · · · + an x n ) = (ra0 ) + (ra1 )x + · · · + (ran )x n

Definition 11.20 For any vector space V with operations ⊕ and , a subspace U
is a subset of V which is itself a vector space over the same field under the inherited

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 500 — #24

i i

500 Lightweight Introduction to Lattices

operations ⊕ and . This means that U is closed under addition ⊕ and scalar
multiplication .
Example A trivial subspace of R2 is the 1-element set of the zero vector {(0, 0)}.

Example Let’s define the vector space of cubic polynomials

Cu = {a + bx + cx 2 + d x 3 | a, b, c, d ∈ R}

and the vector space of linear polynomials L i = {e + f x | e, f ∈ R}. Then, L i is a

subspace of Cu .
Definition 11.21 The span (or linear closure) of a non-empty subset S of a vector
space V is the set of all linear combinations of vectors from S.
In short, we can write down the span of a subset S of a vector space V as
follows:
span ( S ) = {c1 vE1 ⊕ · · · ⊕ cn vEn | ci ∈ R, vEi ∈ S}

Note that S itself need not be a subspace, but span( S ) is always a subspace of V . If
span( S ) = U , we say that S generates (or spans) U .

Example For any nonzero vector xE ∈ R3 , the span of xE is a line through the origin
(0, 0, 0), or, more precisely, a one-dimensional subspace containing the zero vector.
Note that a point (e.g., the origin) is not the same as a vector (e.g., the zero vector).

Example Let’s define S = ( 22 ), −22 . We will show that span ( S ) = R2 .

If this is the case, then every vector ( xy ) ∈ R2 can be represented as a linear

combination of vectors in S. So r1 , r2 ∈ R exist, such that
! ! !
2 2 x
r1 · + r2 · =
2 −2 y

So we can express every possible vector ( xy ) ∈ R2 by choosing r1 and r2 such that

r1 = x + y x−y
4 and r2 = 4 . For example, if x = 9 and y = 1, we have r1 = 2 and
5

r 2 = 2. ! ! ! ! !
5 2 2 5 4 9
· +2· = + =
2 2 −2 5 −4 1

The previous example is one way of spanning R2 . Can the set R2 be spanned
by three or more vectors? Sure, we can just duplicate one
of the
elements in the
previous example, namely we can take the set ( 22 ), −22 , −11 . But can R2 be

[2, 1, 3]
sage: per. inverse ()
[2, 3, 1]
sage: per. complement ()
[1, 3, 2]
sage: matrix (per)
[0 1 0]
[0 0 1]
[1 0 0]
sage: grelt= PermutationGroupElement ([1 ,3 ,2 ])
sage: grelt. matrix ()
[1 0 0]
[0 0 1]
[0 1 0]
An interesting thing about these permutation matrices is that their inverse is
identical to their transposed matrix:
sage: matrix (per). inverse () == matrix (per).T
True

Challenge 11.6: Vector Challenge

Inspired by the concept of a basis of a vector space, Alice and Bob invent yet
another cryptosystem. This time they use an encoding encode, which first
encodes each letter into a number. This number is equal to the index of the
corresponding letter in the English alphabet. For example, the word Bob is
encoded in an array [ 2, 15, 2 ].
Alice and Bob have carefully chosen a set of private keys K that they share
as common prior knowledge. Depending on the length of the message to be
sent, a different key is used. Let’s define the key km ∈ K as the key used to
encryt messages of length m.
Each key km is an m × m matrix generated by the following rules:
• Each element of the matrix is either 0 or a prime number between 100
and 999.
• The row vectors of the matrix are linearly independent.
• There are exactly m numbers other than 0.
Now, the encryption E of a message M of length m is a straightforward
procedure:
1. Alice encodes M, so she has the encoded message encode( M ).
2. Alice encrypts encode( M ) with the corresponding key km ; that is,

E (encode( M ), km ) = km · encode( M ).

3. The message E (encode( M ), km ) is sent to Bob over the insecure

channel.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 503 — #27

i i

11.8 Lattices 503

Challenge 11.6 (continued)

Then, the decryption D of a message E (encode( M ), km ) is performed by Bob

following these simple steps:

1. The length of the encrypted message uniquely defines the key km that
Bob should use.
2. Bob constructs the decryption key dm by replacing every element
greater than zero in the secret key km with its reciprocal value.
3. Bob then performs the decryption; that is, D ( E (encode( M ), km )) =
E (encode( M ), km ) · dm .
4. Bob decodes the decrypted message to recover the original message.
Can you verify the correctness of this encryption scheme? Why does this
decryption work or why does it not? Can you recover the following encrypted
ciphertext (one word):
(6852, 3475, 17540, 3076, 12217, 6383, 745, 1347, 661, 6088, 15354, 2384,
2097, 11415, 3143)

Note the third step from Bob in the Challenge 11.6: The matrix dm is not the inverse
of km . The matrix km can be written as a product km = P · D with a diagonal
matrix D and a permutation matrix P. Then km −1 = ( P · D )−1 = D −1 · P −1 =

D · P . If we now construct the matrix dm by replacing each nonzero element

− 1 T

in km by its reciprocal, we get dm = P · D −1 . If we look at the transposed matrix

dmT , we have dmT = ( P · D −1 )T = ( D −1 )T · P T = D −1 · P −1 = ( P · D )−1 = km
−1 .

Instead of using the matrix dm on a column vector from the left, we can let the
T

matrix dm operate on a row vector from the right, since in general for matrices A
and column vectors v and b there is an equivalence Av = b ⇔ ( Av )T = b T ⇔
v T A T = b T . This corresponds to the notation from above, where in the encryption
process, E (encode( M ), km ) = km · encode( M ) means km · encode( M ) as well as
encode( M ) is treated as a column vector while in the decryption process, dm is
written on the right of E (encode( M ), km ), so in this case E (encode( M ), km ) as
well as E (encode( M ), km ) · dm is treated as a row vector.

11.8 Lattices

Now we have all the building blocks to introduce the concept of lattices.
Definition 11.24 Let v1 , . . . , vn ∈ Zm , m ≥ n be linearly independent vectors. An
integer lattice L spanned by {v1 , . . . , vn } is the set of all integer linear combinations
of v1 , . . . , vn , such that:
n
( )
X
L = v ∈ Zm | v = ai vi , ai ∈ Z (11.6)
i =1
Note: Linear combination means that all ai are integers. Integer lattice means that
all vi j (components of vectors vi ) are integers, and thus all points in the infinite
graph have integer coordinates.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 504 — #28

i i

504 Lightweight Introduction to Lattices

SageMath uses the notion of an integral lattice, which is a slightly more com-
plicated concept. Roughly speaking, the components of the vectors v1 , . . . , vn are
not restricted to Z, but can be arbitrary real numbers, while the allowed linear com-
binations
Pn still have to be integer linear combinations; that is, the coefficients ai in
v
i =1 i i have to be integers. Every integer lattice is an integral lattice.
a
The set of vectors B = {v1 , . . . , vn } is called a basis of the lattice L. We also say
that L = L ( B ) is spanned by the vectors of the basis B.
We define the dimension of L as dim( L ) := n.
In the case where n = m, we can canonically construct a quadratic matrix
from the vectors of a lattice basis by writing them down row by row (or column
by column). If we denote this matrix by M, we can compute the product M · M T ,
which is sometimes called the Gram matrix of the lattice. If this Gram matrix has
only integer entries, the lattice is integral. Note that to go into detail here we would
have to introduce some more mathematics, in particular the theory of quadratic
forms, symmetric bilinear forms, and so on, which are a kind of generalization of
the vector product introduced earlier.
The example in Figure 11.6(a) shows a two-dimensional lattice with

Ba = {v1 , v2 } = (1, 2), (−1, 1) ,

while the example in Figure 11.6(b) shows a two-dimensional lattice with

Bb = {v1 , v2 } = (−1, −2), (−1, −1) .

Let’s see how we can construct a lattice using SageMath:

sage: M = matrix(ZZ , [[1,2], [-1,1]])
sage: M
!
1 2
−1 1

Figure 11.6 Example of 2D lattices with different basics: (a) ba and (b) Bb .

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 505 — #29

i i

11.8 Lattices 505

Now we can easily check if a given point (z 1 , z 2 ) belongs to the lattice. If

(z 1 , z 2 ) ∈ L, then it belongs to the span of M.
# The span of a matrix in SageMath is by default the span �
� of the row
# vectors of the matrix .
sage: vector ([1,1]) in span(M)
False

sage: vector ([1,2]) in span(M) # true because [1 ,2 ]=1*x+0 �

� *y
True

sage: vector ([-1,2]) in span(M)

False

sage: vector ([-101,5 ]) in span(M)

True
What if we want to see the exact linear combination that produces the point
z = (z 1 , z 2 )? That is, we want to know how this point z is uniquely constructed by
the given basis vectors x=[1,2] and y=[-1,1].
sage: M. solve_left ( vector ([-101 ,5 ]))
(-32, 69)
And indeed:
sage: -32*M[0] + 69*M[1]
(-101, 5)
We can define the same lattice using different bases. For example, let’s introduce
a lattice over the matrix M 2 with the following basis:
sage: M2 = matrix (ZZ , [[1 ,2], [0 ,3 ]])
sage: M2
!
1 2
0 3
Using SageMath’s span function, we can easily compare the identity of two
objects defined by different bases.
sage: span(M) == span(M2)
True
Screenshots from CT2 showing different lattices can be found in Section 11.12.2.

11.8.1 Merkle-Hellman Knapsack Cryptosystem

Now let’s create another definition before we take a look at the Merkle-Hellman
knapsack cryptosystem. See [3], pages 472 and on.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 506 — #30

i i

506 Lightweight Introduction to Lattices

Definition 11.25 Any set of distinct nonzero natural numbers is called a knapsack.
Furthermore, if this set can be arranged in an increasing list in such a way that
each number is greater than the sum of all previous numbers, we call this list a
superincreasing knapsack.

Challenge 11.7: Superincreasing Knapsacks

Inspired by the definition of the superincreasing knapsack, Alice and Bob
constructed another insecure cryptosystem of their own. Can you find the
hidden word in this intercepted message, shown in the following sequence
of numbers? Hint: Is the knapsack superincreasing? Why or why not?
Each number contains a secret bit. (A copy-and-paste version of these num-
bers is also available at https://www.cryptool.org/en/documentation/
ctbook/sagemath.)

0, 0, 1, 2, 3, 6, 12, 25, 49, 98, 197, 394, 787, 1574, 3148, 6296,
12593, 25185, 50371, 100742, 201484, 402967, 805935, 1611870,
3223740, 6447479, 12894959, 25789918, 51579835, 103159670,
206319340, 412638681, 825277361, 1650554722, 3301109445,
6602218890, 13204437779, 26408875558, 52817751117, 105635502233,
211271004467, 422542008933, 845084017867, 1690168035734,
3380336071467, 6760672142934, 13521344285869, 27042688571737,
54085377143475

The Merkle-Hellman knapsack cryptosystem is another asymmetric cryptosys-

tem that is theoretically interesting because it basically allows sensitive information
to be sent over an insecure channel.
It consists of two knapsack keys:

• Public key, used only for encryption. It’s called a hard knapsack.
• Private key, used only for decryption. It consists of a superincreasing knap-
sack, a multiplier, and a modulus. The multiplier and modulus can be used
to transform the superincreasing knapsack into the hard knapsack.

The key generation algorithm performs the following steps:

1. First, we create a superincreasing knapsack W = [w1 , w2 , · · · , wn ].

2. We choose an integer q that is greater than the sum of all elements in W ;
that is,
Xn
q> wi
i =1

We define q as the modulus of our cryptosystem.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 507 — #31

i i

11.8 Lattices 507

3. We choose an integer r ; that is, r ∈ [1, q ) and (r, q ) = 1, where (r, q ) is

the notation for the greatest common divisor (gcd) of r and q and [1, q ) =
{1, 2, . . . , q − 1}. We define r as the multiplier of our cryptosystem.
4. The private key of the cryptosystem consists of the tuple (W, r, q ).
5. We generate the sequence H = [h 1 , h 2 , · · · , h n ], s.t. h j = w j ∗ r mod q, for
1 ≤ j ≤ n. We define H as the public key of the cryptosystem.
If we want to encrypt a message m, we first take its bit representation Bm =
m 1 m 2 . . . m n , where m i denotes the ith bit; that is, m i ∈ {0, 1}. To ensure the
correctness of the algorithm, our superincreasing knapsack K should have at
least n elements. Let’s define it as W = [w1 , w2 , . . . , wn , . . . ]. After the key gen-
eration procedure, we generate its corresponding public key H ; that is, H =
[h 1 , h 2 , . . . , h n , . . . ], with some appropriate q and r . Then, the encryption c of m is
the sum
X n
c= mi hi .
i =1

If we want to decrypt the message c, we first compute c0 = c ·r −1 mod q, where r −1

is the modular inverse of r mod q. Then, we start a decomposition procedure of c0
by selecting the largest elements in W that are less than or equal to the remaining
value being decomposed. Finally, we recover m = m 1 m 2 . . . m n by replacing m j with
1 if the element w j was selected in the previous step. Otherwise, m j is 0.
We intentionally describe the pure algorithm only as a cryptographic scheme
(because of its weakness against lattice attacks), and do not discuss practical imple-
mentation issues such as ensuring that the length of Bm is less than or equal to the
length of H , or how and when to apply padding.

Example Let’s assume that Alice wants to encrypt and send the message crypto to
Bob using the Merkle-Hellman knapsack cryptosystem. Throughout this example,
each letter is treated independently. Thus, n is always 8, because each letter has an
8-bit binary representation.
First, Bob must generate his private and public keys. Bob initiates the process
of generating the private key by first generating a superincreasing knapsack W :

W = [11, 28, 97, 274, 865, 2567, 7776, 23253]

Then, Bob generates the corresponding modulus q and multiplier r :

n
X
q = 48433 > wi = 34871
i =1

r = 2333 < q
(2333, 48433) = 1

So, Bob composes the private key Pr = (W, r, q ):

Pr = ([11, 28, 97, 274, 865, 2567, 7776, 23253], 2333, 48433)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 508 — #32

i i

508 Lightweight Introduction to Lattices

The final step for Bob is to generate the hard knapsack H and the public key Pu =
( H ) and deliver it to Alice:

H = [25663, 16891, 32569, 9613, 32292, 31552, 27466, 4289]

Pu = ([25663, 16891, 32569, 9613, 32292, 31552, 27466, 4289])

Before encrypting the message M = crypto, Alice divides the message into
individual letters and replaces each letter with its own bit representation; that is:

c = 01100011
r = 01110010
y = 01111001
p = 01110000
t = 01110100
o = 01101111

Now, Alice computes the corresponding encrypted number for the bit representa-
tion of each letter using the public key H . Thus, the algorithm must be applied six
times. Finally, the list of encrypted numbers C of the word crypto is:

C = [81215, 86539, 95654, 59073, 90625, 145059]

When Bob receives C, he first calculates C 0 using r and q from Pr .

C 0 = [31154, 8175, 24517, 399, 2966, 34586]

Then, using W from Pr , he represents each element in C 0 as a sum of elements in

W , following the above algorithm. For example, let’s decompose 31154. The sign
3 will denote the elements in W that are part of the decomposition of 31154, and
the sign 7 will denote those that aren’t. The sign * will denote the unknowns.

[ 11, 28, 97, 274, 865, 2567, 7776, 23253 ]

[ * * * * * * * * ] , 31154

The largest number in W less than 31154 is 23253. We mark it as an element

used in the decomposition of 31154, and we continue with the decomposition of
the remaining value 7901 = 31154 − 23253:

[ 11, 28, 97, 274, 865, 2567, 7776, 23253 ]

[ * * * * * * * 3 ] , 7901

The largest element less than 7901 is 7776. We continue with this algorithm
until we reach 0.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 509 — #33

i i

11.8 Lattices 509

[ 11, 28, 97, 274, 865, 2567, 7776, 23253 ]

[ * * * * * * 3 3 ] , 125
[ * * * * * 7 3 3 ] , 125
[ * * * * 7 7 3 3 ] , 125
[ * * * 7 7 7 3 3 ] , 125
[ * * 3 7 7 7 3 3 ] , 28
[ * 3 3 7 7 7 3 3 ], 0
Thus, at the end, 31154 is decomposed into 01100011, which is the bit rep-
resentation of the letter c. By applying the same decryption algorithm to all the
elements of C, Bob finally recovers the encrypted message crypto.

Challenge 11.8: Encryption with Knapsacks

in time polynomial in n and in the bit size of the entries of the basis matrix B.

In other words, the L 3 algorithm will produce another basis of the lattice con-
sisting of vectors with restrained norms given by the inequality in Theorem 11.1.
The L 3 algorithm is already built into SageMath.
Example Let’s say Eve intercepts a message between Alice and Bob that is encrypted
using the Merkle-Hellman knapsack cryptosystem. Since everyone has access to the
public key of the cryptosystem, Eve also has it. The intercepted message C is:

C = [318668, 317632, 226697, 388930, 357448, 297811,

344670, 219717, 388930, 307414, 220516, 281175]

The corresponding public key hard knapsack H is the vector:

H = [106507, 31482, 107518, 60659, 80717, 81516, 117973, 87697]

To recover the message, Eve must decrypt each element c in C. For example, let’s
start with c = 318668.
sage: H = [106507, 31482 , 107518 , 60659 ,
80717, 81516 , 117973 , 87697]
sage: c = 318668
Then we start to construct the lattice by first building the identity matrix:
sage: I = identity_matrix (8)
sage: I
 
1 0 0 0 0 0 0 0
 0 1 0 0 0 0 0 0 
 
 0 0 1 0 0 0 0 0 
 
 
 0 0 0 1 0 0 0 0 
 
 0 0 0 0 1 0 0 0 
 
 0 0 0 0 0 1 0 0 
 

 0 0 0 0 0 0 1 0 


0 0 0 0 0 0 0 1
We add another row full of zeros:
sage: I = I. insert_row (8 , [0 for x in range (8)])
sage: I

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 512 — #36

i i

512 Lightweight Introduction to Lattices

 
1 0 0 0 0 0 0 0

 0 1 0 0 0 0 0 0  
0 0 1 0 0 0 0 0 
 

 

 0 0 0 1 0 0 0 0  

 0 0 0 0 1 0 0 0  

 0 0 0 0 0 1 0 0  
0 0 0 0 0 0 1 0 
 

 
 0 0 0 0 0 0 0 1 
0 0 0 0 0 0 0 0
Finally, we add the last column with H transposed and c. However, we will
flip the sign of c; so the first vector of the reduced basis should have a last element
equal to 0 and all other elements equal to 1 (instead of −1).
sage: L_helper = [[x] for x in H] # vector of vectors
sage: L_helper . append ([-c])
sage: L = I. augment ( matrix ( L_helper ))
sage: L
 
1 0 0 0 0 0 0 0 106507
 0 1 0 0 0 0 0 0 31482 
 
 0 0 1 0 0 0 0 0 107518 
 
 
 0 0 0 1 0 0 0 0 60659 
 
 0 0 0 0 1 0 0 0 80717 
 
 0 0 0 0 0 1 0 0 81516 
 
 0 0 0 0 0 0 1 0 117973 
 
 
 0 0 0 0 0 0 0 1 87697 
0 0 0 0 0 0 0 0 −318668
To reduce the basis, we will now apply the L 3 algorithm by simply calling the
SageMath LLL() function.
sage: L.LLL ()
 
0 1 0 0 0 1 1 1 0
 −1 1 0 1 −1 −2 −2 2 1 
 
 3 1 2 −1 1 1 −1 1 1 
 
 
 1 −1 −2 −1 −3 −1 1 1 1 
 
 2 −2 −1 1 0 2 −3 1 1 
 
 0 0 3 −4 −2 1 0 0 0 
 
 −1 3 −1 3 0 0 −1 −3 2 
 
 
 0 −1 1 4 0 0 0 0 4 
−2 −1 −2 −3 1 −1 2 1 3
The first candidate (the shortest vector in the reduced basis) is the one we were
looking for:
sage: L.LLL ()[0][:-1]. dot_product ( vector (H))
318668

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 513 — #37

i i

11.9 Lattices and RSA 513

So the binary representation of the encrypted character is 01000111. Using

SageMath again, we can check the corresponding letter:
sage: bs = ''.join ([ str(x) for x in L.LLL ()[0 ][: -1 ]])
sage: bs
'01000111'

sage: chr(int(bs ,2))

'G'

Challenge 11.9: LLL

In the example in Section 11.8.2 above, G is the first letter of the recovered
text. Using SageMath, lattices, and the LLL algorithm, can you recover the
rest of the text?

The CT2 screenshots in Figures 11.12 and 11.13 show a mouse-driven

visualization of reducing a two-dim basis with Gauss and a ready-to-run LLL
implementation for reducing the basis of higher-dimensional bases.

11.9 Lattices and RSA

RSA is one of the first asymmetric cryptosystems. The inner workings of RSA
have been thoroughly explained and demonstrated in Chapter 5 of this book. This
section assumes that you are already familiar with how the RSA cryptosystem
works. However, we will briefly review the basics of key generation for the RSA
algorithm using SageMath. Then we show how RSA can be attacked using lattices.

11.9.1 Textbook RSA

The raw RSA method (without padding) is called textbook RSA and is not suitable
for practical use as either an encryption or signature method [6]. It consists of the
following steps (see Section 5.10.2 for more details).
• Two large distinct primes p and q are generated.
• Their product n = pq is called the modulus.
• Then, we pick a number e, such as e is relatively prime to φ (n ), Euler’s totient
function. We define e as the public-key exponent.
• We compute d as the modular multiplicative inverse of e modulo φ (n ). We
define d as the private-key exponent.
• The pair (n, e) is the public key.
• The pair (n, d ) is the private key.
To avoid some known attacks on RSA, we need to choose our parameters
wisely. Some of the requirements and recommendations can be found in [7].
Now let’s encrypt the word asymmetric using SageMath and the RSA cryp-
tosystem. First, we need to think about the encoding strategy (i.e., the translation of

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 514 — #38

i i

514 Lightweight Introduction to Lattices

strings into numbers). Throughout this section, we will use the following encoding
procedure:

• Let’s denote the string to be encoded as S = s1 s2 · · · sn .

• We replace each symbol si in the string with its decimal ASCII code
representation. For example, the symbol “g” is replaced by “103.”
• Then each decimal ASCII code is replaced by its binary representation. For
reversibility purposes, as long as the length of the binary representation is less
than 8, we append at the beginning as many 0s as necessary. For example,
the binary representation of 103 is 1100111. However, the length of the
binary representation is seven, so we add another zero at the beginning to
get 01100111.
• We form an 8n-bit string by concatenating the n strings of 8 bit each, starting
with the 8-bit representation of the first letter of S from the left.
• Finally, we convert S to an integer in decimal representation.

For example, let’s encode the word asymmetric. First, we replace each symbol
of S with its corresponding decimal ASCII value:
sage: S = " asymmetric "
sage: S_ascii = [ord(x) for x in S]
sage: S_ascii
[97, 115, 121, 109, 109 , 101 , 116 , 114 , 105 , 99]
Then we replace each element in S_ascii with its binary equivalent. To get rid
of the leading 0b of the binary strings, we use [2:].
sage: S_bin = [bin(x)[2 :]. zfill (8) for x in S_ascii ]
sage: S_bin
['01100001 ', '01110011 ', '01111001 ', '01101101 ', '01101101 �
� ',
'01100101 ', '01110100 ', '01110010 ', '01101001 ', '01100011 �
� ']
Finally, we concatenate all the elements in S_bin and convert this concatenation
to a decimal number:
sage: SS = Integer (''. join( S_bin ),2)
sage: SS
460199674176765747685731
To check the reversibility of the encoding procedure, let’s decode the result
back:
sage: SS_bin = bin(SS)[2 :]
sage: SS_bin
'110000101110011011110010110110101101101011001010111010001 �
� 1100100110100101100011 '
sage: len( SS_bin )
79

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 515 — #39

i i

11.9 Lattices and RSA 515

sage: while len( SS_bin ) %

....: SS_bin = '0 ' + SS_bin
sage: SS_ascii = [chr(int( SS_bin [x*8:8 *(x+1)],2))
....: for x in range (len( SS_bin )/8)]
sage: ''.join( SS_ascii )
'asymmetric '
When we are done with the encoding procedure, we initialize the RSA
parameter generation step and generate p, q, and n:
sage: b = 512
sage: p = random_prime (2 **b-1 , lbound =2 **(b-1)+2 **(b-2))

b−In the previous example, we generated a random prime number in the interval
1 + 2 , 2 − 1 . Let’s say we have two primes in this interval; that is,
b−2 b

2

p = 2b−1 + 2b−2 + ρ1
q = 2b−1 + 2b−2 + ρ2

for some ρ1 and ρ2 . When we multiply the primes, we have:

p · q = (2b−1 + 2b−2 + ρ1 )(2b−1 + 2b−2 + ρ2 ) =

= 22b−2 + 22b−3 + 2b−1 ρ2 + 22b−3 + 22b−4 +
+ 2b−2 ρ2 + ρ1 2b−1 + ρ1 2b−2 + ρ1 ρ2 =
= 22b−2 + 2 · 22b−3 + Ω =
= 22b−2 + 22b−2 + Ω =
= 2 · 22b−2 + Ω = 22b−1 + Ω > 22b−1

This guarantees that the bit length of their product is 2b. The method nbits()
returns the bit length of a number.
sage: p.nbits ()
512
sage: q = random_prime (2 **b-1 , lbound =2 **(b-1)+2 **(b-2)); �
� q.nbits ()
512
sage: N = p*q; N.nbits ()
1024
It’s time to choose the public exponent e. A common choice of value for e is
216 + 1.
sage: e = 2**16 + 1; e
65537
SageMath has a built-in function euler_phi(). However, if we directly type
euler_phi(N), SageMath will try to factor N = pq. It’s better to manually calculate
the number of positive integers that are relatively prime to N and not greater than N
(this is easy as we know the factors of N and can use the formula—see Section 5.8.2):

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 516 — #40

i i

516 Lightweight Introduction to Lattices

sage: phi_N = (p-1)*(q-1)

Now that we have φ (n ), we can calculate d using the built-in function
inverse_mod():
sage: d = inverse_mod (e, phi_N )
Let’s make sure that ed ≡ 1 mod φ (n ):
sage: assert d < phi_N
sage: e*d %
1
We are ready to encrypt the encoding SS of the message “asymmetric.” The
encryption can be calculated directly using SS**e%N. However, we will use the built-
in function power_mod(), which is much faster than the direct calculation. (On a
2020 Mac Mini, the direct calculation took 62 ms of CPU time, and power_mod
took less than 1 ns. “Much faster” is indeed true, though not in our human
experience of time.)
sage: encrypted = power_mod (SS ,e,N)
To decrypt the message:
sage: decrypted = power_mod (encrypted ,d,N)
sage: decrypted
460199674176765747685731

Challenge 11.10: RSA

Alice and Bob again decided to use their own encoding scheme and RSA imple-
mentation to secure their communication. The encoding scheme used is quite
simple: Alice translates each letter from the plaintext to its decimal ASCII rep-
resentation. In this way, Alice sends as many encrypted messages as the length
of the original unencrypted message.
This scheme has one major drawback: When large plaintexts are
encrypted, the collection of intercepted encrypted messages is vulnerable to
frequency analysis attacks. To work around this, Alice and Bob renew their
RSA keys when they reach a threshold number of messages sent.
However, they made another mistake. The RSA public key is:
N = 68421763258426820318471259682647346897299270457991365227523187215179279937768
782117901469556159380911527267431206861529333842025857168541446464704428050808114
500301719380630918908935780489117272692352098164110413822642670298657847312225801
755784399864594975116547815856011474793429956418177277806187836101061
e = 127
Eve intercepted the following 11 messages (sent in this order):
c1 = 20842273115788965044434568911351496365025941709963878891654635864614247250595
415337877670412884297380645809556224513056164981861313077151294657843754553657687
729573741274326907928991221246371764225295669345864765449163254397423969283552347
12078183
c2 = 20893454506753506195646780042379588087816651548061669824679147298111722465210
531962697364936758882446841738989887223022907938140873068062126260148654403891017
812919490588535501594235621529922408698085740859622639421733168633622772463221300
97359903249313

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 517 — #41

i i

11.9 Lattices and RSA 517

Challenge 11.10 (continued)

c3 = 15351988337964905099701960376347076532439043589617467841763029200910162258670
083394997893897566293618070669709960876401857858929595090590610409163098252906678
940270348578645367087079347216129233790173543224953780408040213693490500724979084
761978285646999197843456
c4 = 27904208858505919506689042950498726643957813119400643293322771534900822430923
585848781848455603612081226575100801226570484212026594858252089217840328837906708
276016306114842897236574701434742246311142664328247890170520592851161647470983489
359620795002699
c5 = 14562438053393942865563565506076319964337086564418653275680839364685346358348
263872708128968423412681687735816462730409112745256517215618953897227627256898533
454858045297931958376394955610471867756244498725191655684274134657700794939801031
701760045360349184
c6 = 37370666535363066168961624931547694539547602092327751979114535130065929115448
532953082477972777170290304404725670126936586698604529648793581659263060970546938
259944838952911170478265448614822495177677220252704340545251785434955476627944717
241828329
c7 = 57018830572739461491984788995673407709179639540390781558806685226845173001582
252740946299992152591496992831944316269907785235915676185879264232465783672876342
034636885982343764812696958235155060812119686263202672834115657789006658553081283
546825372990992701071
c8 = 45667206025566800148122417818312587397117202643844088800399634507595677539812
531804389800633373563635203026530295808267186537869501854999997585813165610459945
099323041449890076258008953903360989998622098817497527261455918497690247104725594
122565082035057621175773
c9 = 16862273135393186478865050591393354467469966242203319758781467127096457948108
889467619633506282224651511348513130613164713603622844197532314784054159853644772
397257957431077887146712893548225102037664810557100757780577122589408625865295995
70943303841410139029504
c10 = 3418491651580535268325058631927829312241628227597886128328917486594860261067
379103830760216028449225930969223686237530104472510254235823993961419544594682867
68770464472831982875580635659918156592765109749350304246573018358473129678989
c11 = 5112410579669852183534608382871115572785704026088086397778432135886054190982
581109427995967742224987294310956529550255351980372648223511404048486808382051821
395722995189698471430744248012819713379428438493366462166096818135752055667353488
388471305370330810546875
Can you decrypt each of these messages and reconstruct the original message?

Screenshots from CT1 of a ready-to-run implementation of an attack against

textbook RSA can be found in Section 11.12.1.

11.9.2 Lattices versus RSA

In [8] and [9] a whole new family of attacks on RSA is published, attacks that use
lattices and lattice reduction algorithms.
As we showed earlier, it is easy (using SageMath) to find the roots of a polyno-
mial in a single variable over the integers. However, finding the roots of a modular
polynomial is hard; that is,
f (x ) ≡ 0 mod N

Let N be a large composite integer of unknown factorization, and let us have

a univariate integer polynomial (a polynomial in a single variable x) f of degree r ;
that is,
f (x ) = x r + ar −1 x r −1 + ar −2 x r −2 + · · · + a1 x + a0

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 518 — #42

i i

518 Lightweight Introduction to Lattices

520 Lightweight Introduction to Lattices

as T1 ; that is, T1 ("xy") = T1 ("x")||T1 ("y"), where the symbol || denotes the
concatenation of strings.
With this in mind, we can write:

S 0 = T1 ( S ) = T1 ("The password for AES usage is: ")||T1 ("C1 C2 · · · C7 ")

Challenge 11.11: RSA Attack for Small Exponents

Eve has inspected Bob’s public key and intercepted the encrypted message c:
N = 87105263120665488502276714807618005091167547295414522403403
858260445937978202584195976927011541286969726503590767189236676
74207764635845821959411262760499
e=3
c = 53324798259879463395628746557109686362316082380119849133012
624471422613225752245493713055662721650611249304697332495775034
7628241445331227809291995164455
As an exercise, can you recover the original message using the above lattice
attack?

After this first case, which we could solve with m = 1 (see (11.10) after
Theorem 11.2), we will now look at another example where we need a larger m.

Challenge 11.12: Harder RSA Attack for Small Exponents

Soon after Eve’s successful lattice-reduction attack, Alice and Bob were aware
of the insecure scheme they were using to encrypt their correspondence. How-
ever, they thought that this was possible because they were using passwords
that were too short. They increased the length of their passwords from 7 to
13 characters.
Bob’s public key was left intact. The newly intercepted message c is:

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 524 — #48

i i

524 Lightweight Introduction to Lattices

Challenge 11.12 (continued)

c = 74875898628081924295230958863232737993265641657662214847638
483603488006886812559351998510022388544254768721487918947883514
84389959862897996873514056282948
Can you recover the newly exchanged password? Here are a few things to
consider:

• First, let’s try with m = 1. Why does the attack fail?

• Now let’s try with m = 2. The dimension of the new lattice will be
em + 1.
• The polynomial f is still the same. However, we need more helper
polynomials gi, j to construct our larger lattice. Equation (11.10) gives
6 polynomials gi j and the coefficients of the expanded polynomial
( f (x ))2 in the last row. Finally, you should construct the following
lattice:
N2
 
0 0 0 0 0 0
 0 N 2X 0 0 0 0 0 
 
 0 0 N 2X2 0 0 0 0 
 
 N (a 3 − c) 3N a 2 X 3N a X 2 N X3 0 0 0
 


 0 N (a 3 − c)X 3N a 2 X 2 3N a X 3 N X 4 0 0


0 N (a 3 − c)X 2 3N a 2 X 3 3N a X 4 N X 5
 
 0 0 
f0 f1 f2 f 3 15a 2 X 4 6a X 5 X 6

f 0 = (a 3 − c)2

f 1 = 6a 2 (a 3 − c)X

f 2 = a(15a 3 − 6c)X 2

f 3 = (20a 3 − 2c)X 3

Challenge 11.13: Even Harder RSA Attack for Small Exponents

For the next stereotype challenge, you have the following parameters:
N = 11225354852531229312705821542018938144842129865964887302659
527454109100726811386634830746189351282654513875609737248472970
850378942751600939858273386551545517779039415955461309475780898
540832830799402322878253010276386956878356093590307746836948987
2109334310118979950207071108280219620362737467760308227448837
e=7
c = 10670654096244930306696108877164811975817784621106090830133
614424028968837154232320341636292740214826278191136787096724376
919541317293439292857379222722071531141744387571381425401895924
313275286061958740212489324845146783892027379475831082755439284
1573679450441556883666302722319029010463140829183505391092171
This time the degree of the polynomial f is 7 and the password consists of
14 ASCII characters. You will need to try different values of m to construct a
large enough, but still compact lattice.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 525 — #49

Input: Lattice basis {a, b} = {(1, 7), (−1, 1)}

a·b
{a, b} = {b − · a, a}
a·a
(1, 7) · (−1, 1)

= {(−1, 1) − · (1, 7), (1, 7)}
(1, 7) · (1, 7)

6
= {(−1, 1) − · (1, 7), (1, 7)}
50
= {(−1, 1), (1, 7)}
√ √
Since ||b|| = 50 > 40 = ||a − b|| we need to run another iteration:

a·b
{a, b} = {b − · a, a}
a·a
(−1, 1) · (1, 7)

= {(1, 7) − · (−1, 1), (−1, 1)}
(−1, 1) · (−1, 1)

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 527 — #51

i i

11.10 Lattice Basis Reduction 527

6
= {(1, 7) − · (−1, 1), (−1, 1)}
2
= {(1, 7) − 3 · (−1, 1), (−1, 1)}
= {(4, 4), (−1, 1)}

√ √
Now ||a|| = 32 > 2 = ||b|| and we need another iteration:

a·b
{a, b} = {b − · a, a}
a·a
(4, 4) · (−1, 1)

= {(−1, 1) − · (4, 4), (4, 4)}
(4, 4) · (4, 4)

0
= {(−1, 1) − · (−1, 1), (4, 4)}
32
= {(−1, 1), (4, 4)}

√ √ √
Since ||a|| = 2< 32 = ||b|| < 34 = ||a − b||, the algorithm ends.
Output: {a, b} = {(−1, 1), (4, 4)}
The vector (−1, 1) is a shortest nonzero vector in L and (4, 4) is a shortest vector
of L that is linear independent of (−1, 1).

In higher dimensions the calculation of an HKZ-reduced basis (see Definition

11.28) is very inefficient (no polynomial time algorithm is known). We define a
hierarchy of notions that approximate HKZ-reduced bases in reasonable time.

Definition 11.29 An ordered lattice basis b1 , · · · , bm ∈ Rn is called size-reduced

if |µi, j | ≤ 12 for 1 ≤ j < i ≤ m. An individual basis vector bi is size-reduced if
|µi, j | ≤ 12 for 1 ≤ j < i.

A size-reduced lattice basis consists of vectors that are almost orthogonal to

each other.
In Section 11.8.2 we already used the LLL algorithm in SageMath in order to
reduce a basis and solve a knapsack example. We now give a formal definition of
LLL-reduction as well as the details of the algorithm as this is used as a subroutine
in further algorithms described later in this chapter.

Definition 11.30 Let δ be a constant, 0 < δ ≤ 1. We call a basis b1 , . . . , bm ∈ Rn

LLL-reduced with δ if it is size-reduced, such that

δ||b̂k−1 ||2 ≤ ||b̂k + µk,k−1 b̂k−1 ||2 for k = 2, · · · , m

The two algorithms for calculating respectively the ordered lattice basis and
LLL-reduced δ basis, are summarized in the Crypto Procedures 11.2 and 11.3.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 528 — #52

i i

528 Lightweight Introduction to Lattices

Crypto Procedure 11.2: Algorithm for Size Reduction of Basis Vector bk

input b1 , . . . , bm ∈ Rn (lattice basis)
µi, j for 1 ≤ j < i ≤ m (its Gram-Schmidt coefficients)
for j = k − 1, . . . , 1 do
if |µk, j | > 1
2 then
bk ← bk − dµk, j cb j
for i = 1, . . . , k − 1 do
µk,i ← µk,i − dµk, j c · µ j,i
output b1 , . . . , bm ∈ Rn (lattice basis where bk is size-reduced)
µi, j for 1 ≤ j < i ≤ m (its Gram-Schmidt coefficients)

Crypto Procedure 11.3: Algorithm for LLL Reduction

input b1 , . . . , bm ∈ Rn (lattice basis), δ with 0 ≤ δ ≤ 1
Step 1
k ← 2 (k is the stage. When entering stage k, the basis b1 , . . . , bk−1 is
already L 3 -reduced with δ, the Gram-Schmidt coefficients µi, j are calcu-
lated for 1 ≤ j < i < k as well as the normsquares ci = ||b̂i ||22 for
i = 1, . . . , k − 1)
Step 2
while k ≤ m do
for j = 1, . . . , k − 1 do
P j−1
bk ·b j − µ j,i µk,i ci
µk, j ← i =1
cj
Pk−1
ck ← bk · bk − j =1 µk, j c j
Step 3 (size-reduce bk )
for j = k − 1, . . . , 1 do
µ ← dµk, j c
for i = 1, . . . , j − 1 do
µk,i ← µk,i − µµ j,i
µk, j ← µk, j − µ
bk ← bk − µb j
Step 4
if δck−1 > ck + µ2k,k−1 ck−1 then
exchange bk and bk−1
k ← max(k − 1, 2)
k ←k+1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 529 — #53

i i

11.10 Lattice Basis Reduction 529

Crypto Procedure 11.3 (continued)

output basis b1 , . . . , bm which is LLL-reduced with δ

Gram-Schmidt coefficients µi, j for 1 ≤ j < i ≤ m
normsquares ci = ||b̂i ||22 for i = 1, . . . , m

In practice, replacement of step 4 with the deep insertion rule proposed by

Schnorr and Euchner [15] proved to be more efficient (by still being polynomial in
n, m and input length) for any fixed value t:

Crypto Procedure 11.4: t Deep Insertions

Step 4, alternative
c ← ||bk ||22 , T ←min(t, k − 1), i ← 1
while i < T do
if δci > c then
(b1 , . . . , bk ) ← (b1 , . . . , bi−1 , bk , bi , . . . , bk−1 )
k ← max(i − 1, 2)
goto Step 2
c ← c − µ2k,i ci
i ←i +1
if δck−1 > ck + µ2k,k−1 ck−1 then
exchange bk and bk−1
k ← max(k − 1, 2)
k ←k+1

Furthermore, Schnorr and Euchner [15] invented the notion of blockwise

Korine-Zolotarev-reduced bases with block size β. For a lattice basis b1 , . . . , bm
and β = 2 the notion is equivalent to a LLL-reduced basis and it is equivalent to
the notion of HKZ-reduced bases for β = m.

Definition 11.31 Let β ≥ 2 be an integer and δ ∈ (0, 1] be real. A basis b1 , . . . , bm

of a lattice L ⊆ Rn is called (β, δ )−block reduced, if the following holds for i =
1, . . . , m:
1. |µi, j | ≤ 1
2 for all j < i,
2. δ||b̂i ||2 ≤ λ21 ( L (πi (bi ), . . . , πi (bmin(i +β−1,m ))).

Remark: In the literature, the notion of BKZ-reduced bases is also used for block-
reduced bases.
Although there is no proven polynomial bound for the number of operations
of any algorithm to √ calculate a (β, δ )–block-reduced basis for β > 2 (except for
β = 3 and δ ∈ [ 12 , 12 3); see [16]), the following algorithm proved to be efficient

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 530 — #54

i i

530 Lightweight Introduction to Lattices

in practice for small bock sizes (β ≤ 30). Its core component is the enumeration
algorithm verb!enum(j,k)! which finds an integer, nonzero minimum (u j , . . . , u k )
of the following term:
k
X
c j (ũ j , . . . , ũ k ) := ||π j ( ũ i bi )||22 , (ũ j , . . . , ũ k ) ∈ Zk− j +1
i= j

Before going into the details of enum(j,k) let’s have a look at the block-reduction
algorithm in the Crypto Procedure 11.5. It cyclically iterates over all positions j,
ensures that the basis is size-reduced, and that it is enforced for all j:
δ||b̂ j ||2 ≤ λ21 ( L (π j (b j ), . . . , π j (bmin ( j +β−1,m )))

Crypto Procedure 11.5: Algorithm for (β, δ) Block Reduction

input basis b1 , . . . , bm ∈ Rn of L , β ∈ N, 2 ≤ β ≤ m, δ ∈ R, 0 ≤ δ ≤ 1
Step 1
LLL-reduce b1 , . . . , bβ , j ← m, z ← 0
Step 2
while z < m − 1 do
j ← j +1
if j = m then
j ←1
k ← min( j + β − 1, m )
enum( j, k ) outputs integer coefficients (u j , . . . , u k ) of a lattice vector
bnew = ik= j u i bi and c j := ||π j (bnew
j )|| = λ1 ( L (π j (b j ), . . . , π j (bk )))
2 2
P
j
h ← min(k + 1, m )
if c j < δc j then
extend b1 , . . . , b j−1 , bnew
j to a basis
b1 , . . . , b j−1 , bnew new
j , . . . , bk , bk +1 , . . . , bm of L,
LLL-reduce b1 , . . . , bhnew
z←0
LLL-reduce b1 , . . . , bh
z ← z+1
output (β, δ ) block reduced basis b1 , . . . , bm

The extension of b1 , . . . , b j−1 , bnew

j to a basis b1 , . . . , b j−1 , bnew new
j , . . . , bk ,
bk +1 , . . . , bm of L is done with the algorithm in Crypto Procedure 11.6:

Crypto Procedure 11.6: Algorithm BASIS

input basis b1 , . . . , bm , (u j , . . . , u k )
Step 1
bnew
Pk
j = i= j u i bi

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 531 — #55

i i

11.10 Lattice Basis Reduction 531

Crypto Procedure 11.6 (continued)

Step 2
g ← max{t : j ≤ t ≤ k, u t 6= 0}
Step 3
while |u g | > 1 do
i ← max{t : j ≤ t < g : u t 6= 0}

q ← u g /u i
ui ← u g − q · ui
u g ← u iold
bg ← q · bg + bi
bi ← bgold
Step 4
for i = g, . . . , j + 1 do
bi ← bi−1
Step 5
b j ← bnew
j
output b1 , . . . , bm

By introducing the naming conventions c̃t := ||πt ( ik=t ũ i bi ||2 and ct :=

||b̂t ||2 = ||πt (bt )||2 , we get c̃t = c̃t +1 + (ũ t + ik=t +1 ũ i µi,t )2 ct . For fixed
P
(ũ t +1 , . . . , ũ k ) we can easily enumerate all integers ũ t , lsuch that correspond-
k
ing values of c̃t are nondecreasing, starting with ũ t = − ik=t +1 ũ i µi,t . The
P

(basic) variant of algorithm enum in Crypto Procedure 11.7 traverses the result-
ing search tree in depth-first search order. Other variants (e.g., traversing the tree
in breadth-first search order or incomplete—pruned—traversals) are given in [16].

Crypto Procedure 11.7: Algorithm enum(j,k)

input j, k, ci for i = j, . . . , k and µi,t for j ≤ t < i ≤ k
Step 1
s ← t ← j, c j ← c j , ũ j ← u j ← 1,
v j ← y j ← ∆ j ← 0, δ j ← 1,
for i = j + 1, . . . , k + 1 do
c̃i ← u i ← ũ i ← vi ← yi ← ∆i ← 0, δi ← 1
Step 2
while t ≤ k do
c̃t ← c̃t +1 + ( yt + ũ t )2 ct
if c̃t < c j then
if t > j then
Ps
t ← t − 1, yt ← i =t +1 ũ i µi,t ,

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 532 — #56

i i

532 Lightweight Introduction to Lattices

Crypto Procedure 11.7 (continued)

ũ t ← vt ← d−yt c , ∆t ← 0
if ũ t > −yt then
δt ← −1
δt ← 1
c j ← c̃ j , u i ← ũ i for i = j, . . . , k
t ←t +1
s ←max(s, t )
if t < s then
∆t ← − ∆t
if ∆t δt ≥ 0 then
∆t ← −∆t + δt
ũ t ← vt + ∆t
output (u j , . . . , u k ), c j

Crypto Procedure 11.9: Deyption by Orton

input public and secret key, encrypted message ( y1 , y2 )
1: recombine y (r ) ≡ y j mod q j (j=1,2) using Chinese remainder theorem:
y (r ) ← q2 (( y1 − y2 )q2−1 mod q1 ) + y2
2: y (k−1) ← y (k ) (w (k ) )−1 mod p (k ) for k = r, . . . , 1
(0)
solve in=1 xi ai = y (0) with xi ∈ [0, 2s ) this can easily be done since
P
3:
(0) P 1 (0)
ai > (2s − 1) i− j =1 a j )
output decrypted message (x1 , . . . , xn )

In the following, by using lattice algorithms we show how to reconstruct a

message encrypted by the Orton cryptosystem. We first construct a lattice basis
b1 , . . . , bm +2 ∈ Zm +r +2 s.t. the original message can easily be recovered from any
lattice vector with l∞ -norm 1. The l∞ -norm ||v||∞ of a vector v = (v1 , . . . , vn ) is
defined as the maximal absolute value of its coefficients vi .

||v||∞ = max(|v1 |, . . . , |vn |), v ∈ Rn

We then show how such a lattice vector can be found efficiently.

The decryption problem is stated as follows:
Given the public parameters (r, n, s ), the public key (q1 , prec(k ), ai, j , f ik ), and
the encrypted message ( y1 , y2 ), find the plaintext message (x1 , . . . , xn ); that is,
find integers x1 , . . . , xn ∈ [0, 2s ), xn +k ∈ [0, 2s +k +log2 n +1 ) satisfying the following
equations:

n+
Xr −1
xi ai,1 = y1 mod q1 (11.11)
i =1
n+
Xr −1
xi ai,2 = y2 (11.12)
i =1
$n +k−1 %
(k )
X
x n +k = xi f i or k = 1, . . . , r − 1 (11.13)
i =1

Let’s transform these equations into a set of r + 1integer linear equations with
m 0-1-unknowns, where m := ns + (r − 1)(r /2 + s + log2 n − 1) + rk−
P 1
=1 prec (k ).
(k ) prec(k )
Since f i 2 ∈ [0, 2prec(k ) ) is integral we can write (11.13) as

n+ k−1
xn +k 2prec(k ) =
(k ) prec(k )
X
xi f i 2 − xn +r +k−1 for k = 1, . . . , r − 1, (11.14)
i =1

where the additional variables xn +r +k−1 are integers in [0, 2prec(k ) ).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 535 — #59

i i

11.10 Lattice Basis Reduction 535

With

(k )


 f i 2prec(k ) for i = 1, . . . , n + k − 1
prec(k )

for i = n + k

 −2



ai,k +2 := 0 for i = n + k + 1, . . . , n + r + k − 2


−1 for i = n + r + k − 1





for i = n + r + k, . . . , n + 2r − 2

0


equations in (11.14) simplify to:

n +2
X r −2
xi ai,k +2 = 0 for k = 1, . . . , r − 1 (11.15)
i =1

The unique solution of (11.11), (11.12), (11.15) directly transforms into the
unique solution of (11.11) - (11.13). To get 0 − 1-variables we use the binary
representation of the integer variables.
We set


 s for 1 ≤ i ≤ n

s + i + log2 n − n − 1 for n + 1 ≤ i ≤ n + r − 1

di :=

prec(i − (n + r − 1)) for n + r ≤ i ≤ n + 2r − 2



P 1
and Di := i− j =1 d j .
Let t Di +1 , . . . , t Di +di ∈ {0, 1} be the binary representation of xi ; that is,

i −1
dX
xi = t Di +l +1 2l ,
l =0

and set

A Di +l +1, j := ai, j 2l for i = 1, . . . , n + 2r − 2, j = 1, . . . , r + 1, l = 0, . . . , di − 1,

where ai,1 := ai,2 := 0 for i > n + r − 1.

With y3 := · · · := yr +1 := 0 equations (11.11), (11.12), (11.15) simplify to

m
X
ti Ai,1 = y1 + zq1
i =1
m
X (11.16)
ti Ai, j = y j for j = 2, . . . , r + 1,
i =1

ti ∈ {0, 1}, z ∈ Z

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 536 — #60

i i

536 Lightweight Introduction to Lattices

The row vectors b1 , . . . , bm +2 ∈ Zm +r +2 of the following matrix form the basis

of lattice L:
 
0 2 0 · · · 0 N A1,1 N A1,2 · · · N A1,r +1
 0 0 2 ... 0 N A
 
2,1 N A2,2 · · · N A2,r +1 


 . . .
 .. .. . . . . . ... .. .. .. 
. . . 
(11.17)
 
 .. 

 0 0 . 0 2 N A 2,1 N A 2,2 · · · N A

2,r +1 
..
 
 0 0
 . 0 0 N q1 0 ··· 0


1 1 · · · 1 1 N y1 N y2 ··· N yr +1

For every integer N ≥ 2 we can obtainP the unique solution t1 , . . . , tm of (11.16)

+2
from each vector v = (v0 , . . . , vm +r +1 ) = im=1 ci bi with l∞ -norm 1:
The vector v has the form {±1} m +1 × {0}r +1 , where cm +2 ∈ {±1}, cm +1 ∈ Z
and c1 , . . . , cm ∈ {0, −cm +2 }. The zeros in the last r + 1 coefficients imply

m
X
ci Ai,1 + cm +2 y1 = 0 mod q1
i =1
m
X
ci Ai, j + cm +2 y j = 0 for j = 1, . . . , r + 1.
i =1

With ti := |ci | = (|vi − v0 |)/2 for i = 1, . . . , m we obtain the unique solution of

(11.16) and we directly get the original message from v:

s−1
X
xi := |vs (i−1)+ j +1 − v0 |2 j−1 for i = 1, . . . , n.
j =0

To find a vector with l∞ -norm 1 we modify algorithm enum in order to search for
short vectors in l∞ norm instead of the Euclidean norm ||.||2 . To do that we make
use of Hölder’s inequality [22, p. 347]:

|x · y| ≤ ||x||∞ ||y||1 for all x, y ∈ Rn .

The
Pn expression n||y||1 is defined to be the l1 -norm of y, given by ||y||1 :=
i =1 |yi |, y ∈ R .
For t = m, . . . , 1 we define the following functions wt , ct with integer
arguments ũ t , . . . , ũ m (using the notions of Definition 11.26):

m m
! !
X X
wt := wt (ũ t , . . . , ũ m ) := πt ũ i bi = wt +1 + ũ i µi,t b̂t
i =t i =t
m
!2
X
c̃t := c̃t (ũ t , . . . , ũ m ) := ||wt ||22 = c̃t +1 + ũ i µi,t ||b̂t ||22
i =t

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 537 — #61

i i

11.10 Lattice Basis Reduction 537

Let’s have a look into the algorithm enum described previously. It enumerates
in depth-first search order all nonzero integer vectors (ũ t , . . . , ũ m ) for t = m, . . . , 1
satisfying c̃t (ũ t , . . . , ũ m ) < c1 , where c1 is the current minimum for the function
c̃1 (ũ 1 , . . . , ũ m ). In order to find a shortest lattice vector with respect to the l∞ -norm
we modify this and recursively enumerate all nonzero integer vectors (ũ t , . . . , ũ m )
2
satisfying c̃t (ũ t , . . . , ũ m ) < n · B , where B is the current minimal l∞ -norm of all
lattice vectors w1 enumerated so far. The resulting enumeration area is illustrated
√
in Figure 11.7. We enumerate all vectors wt inside the sphere B with radius n ·
B centered at the origin. We can then stop the enumeration using the following
observations:
Since, for fixed ũ t , . . . , ũ m we can only reach lattice vectors in the hyperplane H
orthogonal to wt , we can prune the enumeration as soon as this hyperplane doesn’t
intersect with the set M of all points with l∞ -norm less or equal B. Using Hölder’s
inequality we get c̃t > B||wt ||1 whenever the intersection is empty. The inequality
can be tested in linear time and restricts the enumeration to the shaded area U ; that
√
is, the union of all balls with radius 12 n B centered in {±B /2}n .
The number of vectors wt to be enumerated and therefore the running time
of the enumeration can roughly be approximated by the volume of the area that
needs to be traversed. As a consequence the running time of the pruned enu-
meration algorithm enum∞ in the Crypto Procedure 11.11 is faster by the factor
volume(U )/ volume( B ). For dimension 2 this factor is exactly π2+2 π and in dimen-
sion n it is approximately ( π2+2 π )n−1 . This means that enum
∞ is faster by a factor
exponential in the dimension of the lattice. For more details see [16].
We are now able to formulate the attack algorithm:

Crypto Procedure 11.10: Algorithm ATTACK-Orton

input public key, encrypted message y1 , y2
1: Build the basis b1 , . . . , bm +2 with N := n 2 according to matrix (11.17)
from before
2: LLL-reduce b1 , . . . , bm +2 with δ = 0.99

Figure 11.7 Pruning based on Hölder’s inequality.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 538 — #62

i i

538 Lightweight Introduction to Lattices

Crypto Procedure 11.10 (continued)

3: Call enum∞ ; we get a vector v with ||v||∞ = 1

P 1
4: xi ← ls−=0 |vs (i−1)+l +1 − v0 |2
l−1 for i = 1, . . . , n

output original message x1 , . . . , xn

Crypto Procedure 11.11: Algorithm enum∞

input b̂i , ci ← ||b̂i ||22 , µi,t for 1 ≤ t ≤ i ≤ m
Step 1
s←t ←1
ũ 1 ← u 1 ← 1
b ← b1
c ← n||b1 ||2∞
B ← ||b1 ||∞
vj ← yj ← ∆j ← 0
δj ← 1
for i = 1, . . . , m + 1 do
c̃i ← u i ← ũ i ← vi ← yi ← ∆i ← 0
ηi ← δi ← 1
wi ← (0, . . . , 0)
Step 2
while t ≤ m do
c̃t ← c̃t +1 + ( yt + ũ t )2 ct
if c̃t < c then
wt ← wt +1 + ( yt + ũ t )b̂t
if t > 1 then
if c̃t ≥ B||wt ||1 then
if ηt = 1 then
increase_t()
ηt ← 1, ∆t ← −∆t
if ∆t δt ≥ 0 then
∆t ← ∆t + δt
ũ t ← vt + ∆t
t ← t − 1, ηt ← ∆t ← 0, yt ← is=t +1 ũ i µi,t , ũ t ← vt ←
P

d−yt c
if ũ t > −yt then
δt ← −1
δt ← 1

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 539 — #63

i i

11.10 Lattice Basis Reduction 539

Crypto Procedure 11.11 (continued)

if ||w1 ||∞ < B then

b ← w1 , c ← n||b||2∞ ,
u i ← ũ i for i = 1, . . . , m
increase_t()
output (u j , . . . , u k ), b

Crypto Procedure 11.12: Subroutine increase_(t)

t ←t +1
s ← max(t, s )
if ηt = 0 then
∆t ← − ∆t
if ∆t δt ≥ 0 then
∆t ← ∆t + δt
∆t ← ∆t + δt
ũ t ← vt + ∆t

With the following modifications of enum∞ we can further improve the running
time of the attack:
Since ||v||22 = m + 1 and ||v||∞ = 1, we initialize c := m + 1.0001, B := 1.0001
and stop the algorithm as soon as we have found v. We also cut the enumeration for
ũ t as soon as there is an index j ∈ [0, m ] with bi, j = 0 for i = 1, . . . , t − 1 and bt, j 6=
0, |wt, j | 6= 1. We don’t miss the solution since w1, j = wt, j 6= ±1 for all choices of
ũ 1 , . . . , ũ t−1 . As the original basis vectors b1 , . . . , bm +1 only depend on the public
key, we can precompute the LLL-reduced basis b10 , . . . , bm 0
+1 of b1 , . . . , bm +1 once
for every public key we want to attack. For all messages which are encrypted with
the same public key we use the precomputed vectors b10 , . . . , bm 0
+1 together with
bm +2 instead of the original basis. More details on the attack including practical
results may be found in [23] and [16].

11.10.2 Factoring
Many public-key cryptosystems are based on the assumption that factoring large
natural numbers is hard. In 1993, C. P. Schnorr [24] proposed to use lattice basis
reduction to factorize natural numbers:
Crypto Procedure 11.13: Factoring
input N (a natural number with at least two prime factors),
α, c ∈ Q with α, c > 1
Step 1
calculate the list p1 , . . . , pt of the first t primes, pt = (ln N )α

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 540 — #64

i i

540 Lightweight Introduction to Lattices

Crypto Procedure 11.13 (continued)

Step 2
Use lattice basis reduction in order to find m ≥ t + 2 pairs (u i , vi ) ∈ N2
with
Qt a ,j
ui = j =1 p ji with ai, j ∈ N
and
|u i − vi N | can be factorized over prime factors p1 , . . . , pt
Step 3
Factorize u i − vi N over primes p1 , . . . , pt and p0 = −1.
b
Let u i − vi N = tj =0 p j i, j , bi = (bi,0 , . . . , bi,t ) and ai = (ai,0 , . . . , ai,t ) with
Q

ai,0 = 0
Step 4

Find a 0-1-solution (c1 , . . . , cm ) 6= (0, . . . , 0) of equation

m
X
ci (ai + bi ) = 0 (mod 2)
i =1
Step 5
t Pm
Y c (a +b )/2
x← p j i =1 i i, j i, j (mod N )
j =0
t Pm t Pm
i =1 ci bi, j i =1 ci ai, j
Y Y
y← pj (mod N ) = pj (mod N )
j =0 j =0

(this construction implies x 2 = y 2 (mod N ))

Step 6
If x 6= ±y (mod N ), then output gcd(x + y, N ) and stop, else goto step 4
and find another solution (c1 , . . . , cm )

In [25], enumeration of short lattice vectors in l1 -norm (similar to ENUM∞ ) is

used to find the solutions more efficiently. However, those algorithms are still far
away from being efficient for large numbers.

11.10.3 Usage of Lattice Algorithms in Post-Quantum Cryptography and

New Developments (Eurocrypt 2019)
As it is hard to find the shortest vector in a high-dimensional lattice (in cases
when no special structures exist, like those found in the Chor-Rivest and Orton
cryptosystem), several cryptosystems based on the shortest vector problem are
proposed.
The basic idea for constructing lattice-based public-key encryption schemes is
to use a well-formed high-dimensional lattice basis B as secret key and a scrambled
version P of B as public key. For encryption, the sender of a message m maps the

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 541 — #65

i i

11.11 PQC Standardization 541

message to a point m in the lattice, by using the public basis P, and then adds a
random error to m; such that the resulting point c is still closer to m than to any
other point in the lattice. Then, c is sent to the receiver who can use the well-formed
basis B in order to find m efficiently and obtain the original message.
The security of the scheme is based on the assumption that an attacker who
is not in the possession of the well-formed basis B needs to spend an infeasi-
ble amount of computational time in order to decipher the message, even with
an aid of quantum computers. However, the security of lattice-based schemes
against quantum-computer attacks is not yet well-understood. For example at Euro-
crypt 2019, several aspects of post-quantum cryptography based on lattices were
discussed:

• A. Pellet-Mary, G. Hanrot, and D. Stehlé [26] describe an algorithm to solve

the approximate shortest vector problem for lattices corresponding to ide-
als of integers of an arbitrary number field K . The running time is still
exponential in the input size, but improved compared to previous results.
• C. Ba�etu, F. B. Durak, L. Huguenin-Dumittan, A. Talayhan, and S. Vau-
denay [27] describe misuse attacks on several post-quantum cryptosystems
submitted to the National Institute of Standards and Technology (NIST),
including several lattice-based schemes.
• M.R. Albrecht, L. Ducas, G. Herold, E. Kirshanova, E.W. Postlethwaite,
and M. Stevens [28] propose a sieve method in order to find a shortest lat-
tice vector, or a lattice vector nearest to a given (nonlattice) vector as an
alternative to the enumeration algorithms described in this chapter. It would
be interesting to check the performance of the enumeration algorithms on
modern computers, rather than the implementations on machines as of the
late nineties; see [16].

11.11 PQC Standardization

In 2016, NIST launched a competition to identify and eventually standardize

suitable alternative methods for the current generation of crypto methods (like
RSA or ECDSA). This next generation of cryptographic algorithms is called post-
quantum cryptography (PQC). Overall, 82 proposals were submitted. In July
2022, as a result of the third round, NIST announced which methods it wants
to standardize [29]:

• For public-key encryption and key-exchange: CRYSTALS-Kyber;

• For digital signatures: CRYSTALS-Dilithium, Falcon, SPHINCS+.

There, NIST recommends using CRYSTALS-Kyber as the encryption algo-

rithm and CRYSTALS-Dilithium for signatures for most use cases. Both methods
belong to the group of lattice-based algorithms. Falcon has shorter signatures than
CRYSTALS-Dilithium; SPHINCS+ is hash-based.
A very good overview about the post-quantum cryptography standardization
organized by NIST can be found in [30, 31].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 542 — #66

i i

542 Lightweight Introduction to Lattices

11.12 Screenshots and Related Plugins in the CrypTool

Programs

Sections 11.12.1 to 11.12.3 contain screenshots from CrypTool 1 (CT1), CrypTool

2 (CT2), and JavaCrypTool (JCT). These show both plugins dealing with lattices
in a didactical manner and plugins with attacks (like the attacks implemented in
CT1 in Section 5.12.2). All of these CrypTool programs continue to be main-
tained; the vast majority of the further software development takes place in CT2
and CTO.
All functions in all CrypTool programs are listed at https://www.cryptool
.org/en/documentation/functionvolume. Specifying a category or a filter string
or unboxing one of the four programs allows one to search for a special function.
Figure 11.8 shows the result of the search, when the selection was restricted to
the two programs CT1 and CT2, and the filter string lattice was set.

Figure 11.11 CT1 dialog: Factoring N when the private exponent/key is too small (Bloemer/May, 2001).

– Find closest vector (nice visualization in two-dim).

• Lattice-based attacks against:
– Merkle-Hellman knapsack;
– RSA (Coppersmith attack).
• Lattice-based cryptography:
– GGH (Goldreich-Goldwasser-Halevi);
– LWE (learning with errors).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 546 — #70

i i

546 Lightweight Introduction to Lattices

Figure 11.12 CT2 tutorial Lattice-based cryptography: SVP via Gauss.

Figure 11.13 CT2 tutorial Lattice-based cryptography: SVP via LLL algorithm.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:45 — page 547 — #71

i i

11.12 Screenshots and Related Plugins in the CrypTool Programs 547

Figure 11.14 CT2 tutorial Lattice-based cryptography: CVP, Find closest vector.

11.12.3 Plugin in JCrypTool (JCT)

JCT contains a visualization of the Merkle-Hellman knapsack cryptosystem.3 This
plugin is just a didactical visualization showing all the necessary steps for private
keys with maximum 20 elements. The Merkle-Hellman knapsack cryptosystem is
vulnerable to Shamir’s lattice reduction attack [32].

“Esslinger” — 2023/11/30 — 19:45 — page 554 — #78

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 555 — #1

i i

C H A P T E R 12
C H A P T E R 12
Solving Discrete Logarithms and
Factoring

Recent algorithmic developments for solving discrete logarithms in finite fields of

small characteristics led to some uncertainty among cryptographic users fueled by
the media about the impact for the security of currently deployed cryptographic
schemes (see for instance the discussion in [1] using the catchword “cryptoca-
lypse”).
This chapter provides a broader picture about the currently best algorithms
for computing discrete logarithms in various groups and about the status of the
factorization problem.
The subject requires a very mathematical presentation—more mathematical
than most of the other chapters in this book. We try to compensate this by working
out the ideas as well.
Our goal is to clarify what currently can be done algorithmically and what
cannot be done without further major breakthroughs. In particular, we currently
do not see a way how to extend the current algorithmic progress for finite fields
of small characteristic to either the case of large characteristic finite fields or to the
integer factorization problem. This means that there is no danger for the methods
currently used (RSA, DHKE, DS, ECC) if the parameters are correctly selected and
there are no breakthroughs either in algorithms or in quantum computers.
The recommendations also consider the possibility of embedding trapdoors in
cryptographic schemes by governmental organizations.

12.1 Generic Algorithms for the Discrete Logarithm Problem

in Any Group

The hardiness of the discrete logarithm problem depends on the group over which
it is defined. In this chapter we review cryptanalytical algorithms that work for any
group. From a cryptographic point of view it is desirable to identify groups for
which one is unable to find better algorithms. One candidate for these groups are
elliptic curve groups.
In this chapter, we describe general cryptanalytical algorithms that apply for
any finite abelian group. That means, any group used in cryptography (e.g., multi-
plicative groups of finite fields or of elliptic curves) are susceptible to this kind of
algorithm. We will see that we can always compute a discrete logarithm in a group
√
of order n in O( n ) steps by Pollard’s rho method. This in turn means that for
achieving a security level of 2k one has to choose a group of order at least 22k . For
example, for achieving a security level of 80 bit, one has to choose a group of order

555

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 556 — #2

i i

556 Solving Discrete Logarithms and Factoring

at least 160 bit. This explains why in practice we usually take elliptic curve groups
with at least 160 bit order.
Moreover, let G be a group of order n and let n = p1e1 · . . . · p`e` be the prime
factorization of n. Then we will see that discrete logarithms in G can be computed
√ √
in time O(e1 p1 + . . . + e` p` ). Notice that this bound is equal to Pollard’s bound
√
O ( n ) if and only if n is a prime. Otherwise, the complexity of computing the
discrete logarithm is mainly determined by the size of the largest prime divisor of
its group order. This explains why, for example, Schnorr/DSA signatures are imple-
mented in groups which contain by construction a prime factor of size at least 160
bit. This also explains why usually elliptic curve groups have prime order or order
containing only a very small smooth cofactor.

12.1.1 Pollard Rho Method

Let G be a finite abelian group. Let g be a generator of some large subgroup G 0 =
{g, g 2 , . . . , g n } ⊆ G (e.g., g could generate G itself). Let y = g x . Then the discrete
logarithm problem is to find on input g and y the output x mod n. We write x =
dlogg ( y ).
Pollard’s rho method tries to generate elements g ai y bi ∈ G 0 with ai , bi ∈ N
in a pseudorandom but deterministic fashion. Let us assume for simplicity that we
generate random elements from the n elements in G 0 . Then by the birthday paradox,
√
we expect to find after only O( n ) steps two elements which are identical. In our
case, this means that
g ai y bi = g a j y b j .
ai −a j

This can be rewritten as g b j −bi = y. This in turn implies that we can recover our
a −a
discrete logarithm as x ≡ bij −bji mod n.
Hence, with Pollard’s rho method one can compute discrete logarithms in any
√
finite abelian group of order n in O( n ) steps. By using so-called cycle-finding
techniques, one can also show that Pollard’s rho method can be implemented within
constant space.
Moreover, it is also possible to improve the efficiency of square root algorithms
when multiple discrete logarithms in the same group are desired: When computing
√
√
L distinct logarithms, one can reduce the global cost from O( L n ) to O( Ln ) [2].

12.1.2 Silver-Pohlig-Hellman Algorithm

As before let y = g x for a generator g of order n. We have to compute the discrete
logarithm x mod n. Moreover, let n = p1e1 · . . . · p`e` be the prime factorization of n.
Then by the Chinese remainder theorem x mod n is uniquely defined by the system
of congruences
x ≡ x1 mod p1e1
.. (12.1)
.
x ≡ x` mod p`e` .

The algorithm of Silver-Pohlig-Hellman computes all discrete logarithms

√
xi mod pi in the subgroups of order pi in O( pi ) steps by using Pollard’s rho

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 557 — #3

i i

12.1 Generic Algorithms for the Discrete Logarithm Problem in Any Group 557

method. Then it is quite easy to find a logarithm modulo the prime power
xi mod piei by a Hensel lifting process that performs ei calls to the discrete log-
arithm procedure modulo pi . In a Hensel lifting process, we start by a solution
xi mod pi , and then consecutively compute xi mod pi2 , xi mod pi3 , and so on until
xi mod piei (see [3] for Hensel’s formula).
Finally, one computes the desired discrete logarithm x mod n from the system
of (12.1) by Chinese remaindering. In total, the running time is mainly determined
by computing xi mod pi for the largest prime factor pi . That is, the running time
√
is roughly O(maxi { pi }).

12.1.3 How to Measure Running Times

Throughout this chapter, we want to measure the running time of analysis algo-
rithms for discrete logarithms as a function of the bit-size of n. Note that any integer
n can be written with (roughly) log n bit, where log is to base 2. Thus, the bit-size
of n is log n.
For expressing our running times we use the notation L n [b, c] =
exp (ln n ) (ln ln n )
b 1−b
c· for constants b ∈ [0, 1] and c > 0. Notice that L n [1, c] =
ec·ln n = n c is a function that is for constant c a polynomial in n. Therefore, we
say that L n [1, c] is polynomial in n. Also notice that L n [1, c] = n c = (2c )log2 n is
a function that is exponential in log n. Therefore, we say that L n [1, c] is exponen-
tial in the bit-size log n of n. So our Pollard’s rho algorithm achieves exponential
running time L [1, 12 ].
On the other end, L n [0, c] = ec·ln ln n = (ln n )c is polynomial in the bit-size of
n. Notice that the first parameter b is more important for the running time than
the second parameter c, since b interpolates between polynomial and exponential
running time. We shortly denote L n [b] if we do not want to specify the constant c.
Some of the most important algorithms that we discuss in the subsequent
sections achieve a running time of L n [ 12 + o(1)] or L n [ 13 + o(1)] (where the o(1)-part
vanishes for n → ∞), which is a function that grows faster than any polynomial but
slower than exponential. For cryptographic schemes, such attacks are completely
acceptable, since the desired security level can be easily achieved by a moderate
adjustment of the key sizes.
However, the recent algorithm of Joux et al. for computing discrete logarithms
in finite fields of small characteristic achieves a running time of L n [o(1)], where
o(1) converges to 0 for n → ∞. This means that these algorithms are quasi poly-
nomial time, and the underlying fields are no longer acceptable for cryptographic
applications. A finite field F pn has small characteristic if p is small, that is, the base
field F p is small and its extension degree n is usually large. In the recent algorithms
we need a small p, since the algorithms enumerate over all p elements in the base
field F p .

12.1.4 Insecurity in the Presence of Quantum Computers

In 1995, Shor published an algorithm for computing discrete logarithms and fac-
torizations on a quantum computer. He showed that computing discrete logarithms
in any group of order n can be done in polynomial time which is almost O(log n 2 ).

i i

When we write this equality to the base g, we obtain a relation

k
X
xi ≡ ei j dlogg ( f j ) mod n,
j =1

where n is the order of g. A relation is a linear equation in the k unknowns

dlogg ( f 1 ), . . . , dlogg ( f k ).

Once we have k linear independent relations of this type, we can compute these
unknowns by linear algebra. This means we actually first compute all discrete
logarithms of the factor base elements before we compute our desired individual
logarithm of y.
Qk e
Dlog computation: Express ygr = g x +r = j =1 f j j in the factor base for some
integer r . This gives us another relation

k
X
x +r ≡ e j dlogg ( f j ) mod n,
j =1

which can be easily solved in the only unknown x = dlogg y.

Let us provide an easy example for an index calculus algorithm that computes
x = dlog2 (5) in F∗11 . Since 2 generates the multiplicative group F∗11 , the order of 2
is 10.

Factor base: Define F = {−1, 2}.

Relation finding: 21 = (−1)0 21 gives us a first trivial relation

1 ≡ 0 · dlog2 (−1) + 1 · dlog2 (2) mod 10.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 560 — #6

i i

560 Solving Discrete Logarithms and Factoring

If we compute 26 = 64 ≡ −2 mod 11 we obtain a second relation

6 ≡ 1 · dlog2 (−1) + 1 · dlog2 (2) mod 10.

Therefore, we can solve the system of linear equations

! ! !
0 1 dlog2 (−1) 1
· ≡ mod 10.
1 1 dlog2 (2) 6

We obtain as the unique solution dlog2 (−1) ≡ 5 and dlog2 (2) ≡ 1.

Dlog computation: Since 5 · 21 = 10 ≡ −1 mod 11 we obtain that

x + 1 ≡ 1 · dlog(−1) + 0 · dlog(2) mod 10.

This leads to the solution x ≡ 4 mod 10.

Runtime: Choosing a large factor base makes it easier to find relations, since it
increases the likelihood that a certain number splits in the factor base. On the other
hand, for a large factor base we have to find more relations in order to compute
the dlogs of all factor base elements. An optimization of this tradeoff leads to a
running time of L p [ 12 ] for the relation finding step and also L p [ 12 ] for performing
the individual discrete logarithm computation in step 3.
Let us briefly discuss the advantages and disadvantages of the simple index
calculus algorithm from a cryptanalyst’s point of view.

Advantages:
e
For g xi = kj =1 f j i j it is trivial to compute the discrete logarithm on the
Q
•

left-hand size.
Disadvantages:
• We need to factor relatively large numbers g xi over the integers. One can
show that this intrinsically leads to a running time of L p [ 12 ], and there is no
hope to get below the constant 12 .
• We need to compute all discrete logarithms of the factor base elements. This
is inherent to all index calculus algorithms.
We will eliminate the first disadvantage by allowing factorizations over number
fields. The second disadvantage is eliminated by choosing a factor base with very
efficient discrete logarithm computations of its elements.

12.2.2 The Number Field Sieve for Calculating the Dlog

A number field Q[α ] is a k-dimensional vector space over Q and can be obtained
by adjoining a root α of some irreducible degree-k polynomial f to Q. This means
we can write every element of Q[α ] as a0 + a1 α + . . . ak−1 α k−1 with ai ∈ Q. If we
restrict the ai to integers we are in the ring Z[α ].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 561 — #7

i i

12.2 Best Algorithms for Prime Fields F p 561

Remark: When calculating the dlog there is only the term number field sieve and
no distinction between general versus special. This is in the opposite to the number
field sieve for factorization in Section 12.4.1.
The number field sieve is also an index calculus algorithm. Compared to the
previous approach it has the advantage to involve smaller numbers. This is done by
choosing a specific representation of the prime field F p , which is implicitly defined
as a finite field where two polynomials of small degree with small coefficients pos-
sess a common root. There are several methods that allow one to construct such
polynomials with a common root modulo p. In particular, for primes of a spe-
cial form (i.e., with a sparse representation), it is possible to construct polynomials
which are much better than in the general case. One typicalPconstruction that works
well is to choose a number m andP write p in basis m as it =0 ai m i . We then find
that f 1 ( X ) = X − m and f 2 ( X ) = it =0 ai m i have m as a common root modulo p.
Equipped with two polynomials f 1 and f 2 of this form, with m as their common
root modulo p, we obtain the commutative diagram in Figure 12.1:
Let r1 , r2 be roots of f 1 , f 2 , respectively. Then we are working with the number
fields Q[r1 ] ' Q[ X ]/( f 1 ( X )) and Q[r2 ] ' Q[ X ]/( f 2 ( X )).

Factor base: Consists of small-norm prime elements in both number fields.

Relation finding: The basic principle of the number field sieve consists of sending
elements of the form a + bX to both sides of the diagram and to write a relation
when both sides factor into the factor base. Technically, this is quite challenging,
because we need to introduce several tools to account for the fact that the left and
right sides are not necessarily unique factorization domains. As a consequence, we
need to factor elements into ideals and take care of the obstructions that arise from
the class groups and unit groups. This procedure gives us the discrete logarithms of
the factor base elements.

Discrete log computation: Express the desired logarithm as a linear combination of

the factor base elements.

Runtime: The number field sieve is the most efficient currently known algorithm
for the large characteristic discrete logarithm problem. In the general case, which
means that p is not of a special form (e.g., close to a prime power), its complexity
1/3
is L p [ 13 , 64
9 ].

References and further reading: For an introduction to index calculus and

the involved mathematical tools see May’s lecture notes on number theory [3]

Figure 12.1 NFS in F p with two polynomials and common roots, shown as a commutative diagram.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 562 — #8

i i

562 Solving Discrete Logarithms and Factoring

(in German) and the number theory book by Müller-Stach, Piontkowski [15]. For
gaining a deep understanding of the number field sieve, one has to study the book of
Lenstra and Lenstra [16] that contains all original works that led to the development
of the number field sieve algorithm in the late 1980s and early 1990s.

As a good start for understanding the number field sieve, we recommend to

first study its predecessors that are described in the original works of Adleman [17],
Coppersmith [18], and Pomerance [19, 20].

12.3 Best Known Algorithms for Extension Fields F pn and

Recent Advances
The groups over extension fields are attacked by the new algorithms of Joux et al.
Before the invention of these attacks, the security of extension field groups appeared
to be similar to the prime order groups from the last chapter. The new attacks
render these groups completely insecure. However, the new attacks do not affect
the security of prime order groups.
First, we will discuss the former best algorithm from 2006 (due to Joux and
Lercier) that achieves a running time of L n [ 13 ]. We will then describe the recent
developments that led to the dramatic improvement in the running time down to
L n [o(1)], which is quasi-polynomial time.

12.3.1 The Joux-Lercier Function Field Sieve

Any finite field F pn can be represented by a polynomial ring F p [x ]/ f (x ), where
f (x ) is an irreducible polynomial over F p with degree n. Thus, any element in F pn
can be represented by a univariate polynomial with coefficients in F p of degree less
than n. An addition of two elements is the usual addition of polynomials, where
the coefficients are reduced modulo p. Multiplication of two elements is the usual
multiplication of polynomials, where the result is reduced modulo f (x ) in order to
again achieve a polynomial of degree less than n.
It is important to notice that the description length of an element is n O(log p ).
Thus, a polynomial time algorithm achieves a running time which is polynomial
in n and log p. We will also consider fields of small characteristic p, where p is
constant. Then polynomial running time means polynomial in n.
It is known that for any p there are always polynomials f (x ) of degree n that
are irreducible over F p . Usually, there are many of these polynomials, which in turn
means that we obtain different representations of a finite field when choosing dif-
ferent polynomials f (x ). However, it is also known that all of these representations
are isomorphic, and the isomorphisms are efficiently computable.
This fact is used in the algorithm of Joux and Lercier, who exploit different
representations F p [x ]/ f (x ) and F p [ y ]/g ( y ) of the same field. This is illustrated in
the commutative diagram in Figure 12.2.

Factor base: We choose all degree-1 polynomials x − a and y − b from F p [x ] ∪ F p [ y ].

Thus, the factor base has size 2 p.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 563 — #9

i i

12.3 Best Known Algorithms for Extension Fields F pn and Recent Advances 563

Figure 12.2 Isomorphic representations in F pn as a commutative diagram.

Relation finding: On both sides, that is for polynomials h from F p [x ]/ f (x ) and

from F p [ y ]/g ( y ), we try to factor into the linear factors from the factor base. This
can be done by an easy gcd computation gcd(h, x p − x ) in time O( p ) for each
polynomial. It can be shown that the number of polynomials that have to be tested
is bounded by L pn [ 13 ].

Discrete log computation: This step is done by writing a polynomial as a linear

combination of polynomials of smaller degree and by repeating recursively, until
degree-1 is found. This recursion is called a (degree) decent and requires running
time L pn [ 13 ], just like the relation finding step.

12.3.2 Recent Improvements for the Function Field Sieve

The first recent improvement upon the Joux-Lercier FFS was presented at Euro-
crypt 2013 by Joux, who showed that it is possible to drastically lower the
complexity of finding relations by replacing the classical sieving approach with a
new technique based on a linear change of variables called pinpointing.
At the Crypto Conference 2013, Göloglu, Granger, McGuire, and Zumbrägel
presented another approach, related to pinpointing that works very efficiently
within a characteristic-2 subfield. Their paper was considered so important by the
cryptographic community that they received the best paper award.
The new results hold for finite fields Fq n of characteristic two; that is, q = 2` .
Notice that we use the standard convention that denotes primes by p and prime
powers by q = p ` . For these fields Fq n the relation finding step in the Joux-Lercier
algorithm simplifies, since one can construct polynomials that split with a higher
probability than generic polynomials of the same degree.
Let us give a high-level description of the ideas of their improvement.

Factor base: All degree-1 polynomials as in the Joux-Lercier algorithm.

Relation finding: Göloglu, Granger, McGuire, and Zumbrägel show that one can
construct a special type of polynomials over Fq [x ] (the so-called Bluher polynomi-
als) that by construction split over Fq [x ]. So similar to our simple version of index
calculus for integers in Section 12.2.1, we obtain one side of the equation for free.
The cost for splitting the polynomials in Fq [ y ] is roughly O(q ) and the cost for find-
ing the discrete logarithms of the factor base elements is roughly O(n · q 2 ). We will
explain why this gives us the discrete logarithms of the factor base in polynomial
time for properly chosen parameters.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 564 — #10

i i

564 Solving Discrete Logarithms and Factoring

Discrete log computation: The individual discrete logarithm computation is similar

to the Joux-Lercier algorithm.

Runtime: We are computing in a field Fq n , where q = 2` . Hence, a polynomial time

algorithm would require running time polynomial in the parameters n and log q.
However, the relation finding above takes time O(n · q 2 ), which is polynomial in
n but exponential in log q. So actually the algorithm performs very poorly with
respect to the size of the base field Fq = F2` .

The trick to work around this is to decrease the size of the base q to q 0 while
slightly increasing the extension degree n to n 0 . Our goal is that the new base field
size q 0 roughly equals the new extension degree n 0 ; that is q 0 ≈ n 0 . In this case, we
again obtain a running time which is polynomial in n 0 and q 0 , but now q 0 is also
polynomially bounded by n 0 . So, in total, for step 2 our running time is polynomially
bounded by n 0 .
Let us give a simple example of how this can be done for concrete parameters.
Assume that we wish to compute a discrete logarithm in F(2100 )100 . Then we would
lower the base field to q 0 = 210 and at the same time increase the extension degree
to n 0 = 1000; that is, compute in F(210 )1000 . Notice that this can always be done
by using the efficiently computable isomorphisms between finite fields of the same
cardinality.
Warning: One might be tempted to bypass the above with the selection of
exponents that do not split appropriately; that is, by choosing F2 p with prime
p. However, we can always embed our finite field in some larger field—as well
as the respective discrete logarithms. Hence, finite fields with small characteristic
have to be considered insecure, independently of the special form of the extension
degree n.
While the relation finding in step 2 of Göloglu, Granger, McGuire, and Zum-
brägel can be done in polynomial time, the individual log computation is still
time-consuming. If one does it naively, step 3 is even more time-consuming than
in Joux-Lercier because of the increased extension degree n 0 . If one balances out
the running times of step 2 and 3, one ends up with an improved overall running
1
time of L q n [ 13 , ( 49 ) 3 ].

12.3.3 Quasi-Polynomial Dlog Computation of Joux et al.

In the previous section, it was shown that the discrete logarithms of all elements
of a factor base can be computed in polynomial time. However, it remained a hard
problem to use that fact for computing individual logarithms.
This problem has been recently solved by Joux [21] and Barbulesu, Gaudry,
Joux, and Thomé [22]. In the paper of Joux, it was shown that the individual loga-
rithm step can be performed in L [ 14 ]. Shortly after, this was improved by Barbulescu,
Gaudry, Joux, and Thomé to L [o(1)], which is a function that grows slower than
L [ ] for any > 0. So they achieve quasi-polynomial time.
Let us briefly describe the modifications of these two papers to the Function
Field Sieve (FFS) algorithm.

Factor base: Consists of degree-1 polynomials as before.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 565 — #11

i i

12.3 Best Known Algorithms for Extension Fields F pn and Recent Advances 565

Relation finding: One starts with the trivial initial polynomial

Y
h (x ) = x q − x = (x − α )
α∈Fq

that obviously factors in the factor base. Now, one applies linear and rational trans-
formations (called homographies) to h (x ), which preserve its property to split over
the factor base. One can show that there are sufficiently many independent homo-
graphies in order to construct sufficiently many relations. So out of one trivial
polynomial h (x ), we obtain for free all O(q ) relations. This enables us to compute
the discrete logarithms of the factor base elements in time O(q ).

Discrete log computation: Barbulescu et al. present an efficient degree decent algo-
rithm that on input of a polynomial p (x ) of degree n outputs a linear relation
between the discrete log of p (x ) and O(nq 2 ) polynomials of degree n2 in time poly-
nomial in q and D. This implies that we get a tree of polynomials, where the degree
drops in every level by a factor of two, which in turns implies a tree depth of log n.
This results in a running time of O(q O(log n ) ).

Runtime: As in Section 12.3.2 let us assume that the size q of the base field is of
the same size as the extension degree n; that is, q = O(n ). Then step 2 runs in
time O(q ) = O(n ), which is polynomial in n. Step 3 runs in time O(q O(log n ) ) =
2
O(n O(log n ) ) = L q n [o(1)]. Notice that n log n = 2log n grows faster than any poly-
c
nomial function in n but slower than any subexponential function 2n for some
c > 0.

12.3.4 Conclusions for Finite Fields of Small Characteristic

To give some examples what the theoretical quasi-polynomial run time of the pre-
vious results implies in practice, we illustrate in Table 12.1 what can currently be
achieved in computing discrete logarithms.
Recommendation: The use of small characteristic fields for discrete log-based is
completely insecure, no matter which key sizes are used. Fortunately, we are
not aware of such a usage in actual applications in wide-spread/standardized
cryptographic schemes.

Table 12.1 Small Characteristic Records

Date Field Bitsize Cost (CPU hours) Algorithm
2012/06/17 36·97 923 895 000 [23]
2012/12/24 p 47 1175 32 000 [24]
2013/01/06 p 57 1425 32 000 [24]
2013/02/11 21778 1778 220 [21]
2013/02/19 21778 1991 2200 [25]
2013/03/22 24080 4080 14 100 [21]
2013/04/11 26120 6120 750 [21]
2013/05/21 26168 6168 550 [21]

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 566 — #12

i i

566 Solving Discrete Logarithms and Factoring

568 Solving Discrete Logarithms and Factoring

m modulo N . Usually this is done by simply defining g ( X ) = X − m mod n and

constructing some low degree polynomial f ( X ) with f (m ) ≡ 0 mod n (e.g., by
expanding n in base m as in Section 12.2.2).
Since f and g are different, they define different rings Z[ X ]/ f ( X ) and
Z[ X ]/g ( X ). But since f and g share the same root m modulo n, both rings are iso-
morphic to Z/nZ; and this isomorphism can be explicitly computed by the mapping
X 7→ m. This is illustrated in the commutative diagram in Figure 12.3.

Factor base: Consists of small-norm prime elements in both number fields.

Relation finding: We look for arguments x̃ such that simultaneously π f := f (x̃ )

splits in Q[ X ]/( f ( X )) and πg := g (x̃ ) splits in Q[ X ]/(g ( X )) into the factor base.
Such elements are called relations.

Linear Algebra: By linear algebra, we search for a product of the elements π f which
is a square and whose corresponding product of the πg is also a square. If we
send these elements via our homomorphism X 7→ m to Z/nZ, we obtain elements
x 2 , y 2 ∈ Z/nZ such that x 2 ≡ y 2 mod n. If we first compute the square roots of π f
and πg in their respective number fields before applying the homomorphism, we
obtain x, y ∈ Z/nZ with x 2 ≡ y 2 mod N , as desired. The independence of x, y here
stems from the different representations in both number fields.

Runtime: The above algorithm is up to some details (e.g., the square root compu-
tation in the number field) identical to the algorithm of Section 12.2.2 and shares
1/3
the same running time L [ 13 , 64
9 ].

12.4.2 Relation to the Index Calculus Algorithm for Dlogs in F p

Firstly, we know that computing discrete logarithms in composite order groups
Z/nZ is at least as hard as factoring n = pq. This in turn means that any algorithm
that computes discrete logarithms in Z/nZ computes the factorization of n:

Dlogs in Z/nZ ⇒ Factoring n.

Let us briefly give the idea of this relation. We compute the order k = ord(a ) for an
arbitrary a ∈ Z/nZ by our dlog algorithm; that is, we compute the smallest positive
k
integer k such that a k ≡ 1 mod n. If k is even, then a 2 6≡ 1 is a square root of 1. We
k
have a 2 6≡ −1 with probability at least 12 , since 1 has 4 square roots modulo n. Set

Figure 12.3 Isomorphic to Z/nZ as a commutative diagram.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 569 — #15

570 Solving Discrete Logarithms and Factoring

factor a number n 1 , then we extrapolate the running time for some n 2 > n 1 by the
formula
1/3
L n 1 [ 13 , 64 ]
T· 9
1/3
.
L n 2 [ 13 , 64

9 ]

So, we use the L-formula to estimate the relative factor that we have to spend
in addition. Notice that this (slightly) overestimates the security, since the L-
formula is asymptotic and thus becomes more accurate in the numerator than in the
denominator—the denominator should include a larger error term. So, in practice,
one obtains (only slightly) less security than predicted by this formula.
We computed the formula for several choices of the bit-size of an RSA number
n, respectively a dlog prime p, in Table 12.2. Recall from Section 12.4.1 that the
running time of the number field sieve algorithm for factoring is indeed a function
of n and not of the prime factors of n.
We start with RSA-768 that has been successfully factored in 2009 [27]. In
order to count the number of instructions for factoring RSA-768, one has to define
what an instruction unit is. It is good practice in cryptography to define as a unit
measure the time to evaluate DES in order to obtain comparability of security levels
between secret and public key primitives. Then by definition of this unit measure,
DES offers 56-bit security against brute-force key attacks.
In terms of this unit measure, the factorization of RSA-768 required T = 267
instructions. From this starting point, we extrapolated the security level for larger
bit-sizes in Table 12.2.
We successively increase the bit-size by 128 up to 2048 bit. We see that in the
beginning, this leads to roughly an increase of security of 5 bit per 128-bit step,
whereas in the end we only have an increase of roughly 3 bit per 128-bit step.
By Moore’s law the speed of computers doubles every 1.5 years. Hence after
5 · 1.5 = 7.5 years we have an increase of 25 , which means that currently we
should roughly increase our bit-size by 128 bit every 7.5 years; and when we come
closer to 2000 bit our increase of 128-bit steps should be in intervals of no later
than 4.5 years. For more conservative choices that also anticipate some algorithmic
progress rather than just an increase in computers’ speed see the recommendations
in Section 12.7.

Table 12.2 Bitsize of n,

p Versus Security Level
Bitsize Security
768 67.0
896 72.4
1024 77.3
1152 81.8
1280 86.1
1408 90.1
1536 93.9
1664 97.5
1792 100.9
1920 104.2
2048 107.4

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 571 — #17

i i

12.5 Best Known Algorithms for Elliptic Curves E 571

References and further reading: An introduction to several factorization algorithms

including the quadratic sieve (the predecessor of the number field sieve) can be
found in May’s lecture notes on number theory [3]. We recommend Blömer’s lecture
notes on algorithmic number theory [28] as an introduction to the number field
sieve. The development of the number field sieve is described in the textbook of
Lenstra and Lenstra [16] that includes all original papers. The relation of discrete
logarithms and factoring has been discussed by Bach [29]. Details of the current
factorization record for RSA-768 can be found in [27].

12.5 Best Known Algorithms for Elliptic Curves E

Elliptic curves are the second standard group for the discrete logarithm problem.
The new attacks do not affect these groups; their security remains unchanged.
We would like to discuss elliptic curves E [ p n ] over finite extension fields F pn
and elliptic curves E [ p ] over prime fields F p . The latter are usually used for cryp-
tographic purposes. The reason to discuss the former too is to illustrate (similar to
the previous sections) the vulnerabilities of extension fields F pn as opposed to prime
field F p . However, we would like to point out that we assume in the following (in
contrast to the previous section) that n is fixed. This is because as opposed to the
algorithm of Joux et al., the algorithms for E [ p n ] have complexities that depend
exponentially on n.
We present two different approaches for elliptic curves over extension fields:
cover (or Weil descent) attacks introduced by Gaudry, Hess, and Smart (GHS), and
decomposition attacks proposed by Semaev and Gaudry. In some cases, it is possible
to combine the two approaches into an even more efficient algorithm as shown by
Joux and Vitse [30].

12.5.1 The GHS Approach for Elliptic Curves E[ p n ]

This approach introduced by Gaudry, Hess, and Smart aims at transporting the
discrete logarithm problem from an elliptic curve E defined over an extension field
F pn to a higher genus curve defined over a smaller field, for example F p . This can
be done by finding a curve H over F p together with a surjective morphism from H
to E. In this context, we say that the curve H is a cover of E. Once such a curve
H is obtained, it is possible using the so called conorm technique to pull back a
discrete logarithm problem on E to a discrete logarithm problem on the Jacobian
of H . If the genus g of the target curve is not too large, this can lead to an efficient
discrete logarithm algorithm. This uses the fact that there exists an index calculus
algorithm on high genus curve of genus g over F p with complexity max(g ! p, p 2 ).
This was introduced by Enge, Gaudry, and Thomé [31].
Ideally, one would like the genus g to be equal to n. However, this is not possible
in general. Classifying the possible covers for elliptic curve seems to be a difficult task.

12.5.2 The Gaudry-Semaev Algorithm for Elliptic Curves E[ p n ]

Let Q = α P be a discrete logarithm on an elliptic curve E [ p n ]. So the goal is to find
the integer α ∈ N such that k times the point P ∈ E [ p n ] added to itself is equal to
the point Q ∈ E [ p n ].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 572 — #18

i i

572 Solving Discrete Logarithms and Factoring

Gaudry’s discrete logarithm algorithm is of index calculus type. We briefly

outline the basic steps.

Factor base: Consists of all points (x, y ) on the elliptic curve E [ p n ] such that x ∈ F p .
That is x lies in the ground field F p rather than in the extension.

Relation finding: Given a random point R = a P, with a ∈ N, we try to write R as a

sum of exactly n points from the factor base, where n is the extension degree. This is
achieved by using the nth Semaev polynomial f n +1 . This polynomial is a symmetric
polynomial of degree 2n−2 in n + 1 unknowns x1 , …, xn +1 which encodes the fact
that there exists points with respective abscissae x1 , …, xn +1 that sum to zero. Of
course, the coefficients of f depend on the curve E. Replacing xn +1 by the abscissa
of R, we can find a decomposition of R as a sum of points from the factor base by
searching for a solution (x1 , · · · , xn ) in the base field F p . In order to do this, one first
rewrites f as a multivariate system of n equations by decomposing the constants
that appear in the polynomial over some basis of F pn over F p . This system of n
equations in n unknowns can be solved using a Groebner basis computation.

Individual discrete log computation: To compute the discrete logarithm of Q, it suf-

fices to find one additional relation that expresses a random multiple of Q, namely
R = a Q in terms of the points in the factor base. This is done in the exact same
way as the generation of relations in the previous step.

Runtime: The factor base can be computed in time O( p ). Every R can be written
as a sum of n factor base elements; that is, yields a relation, with probability expo-
nentially small in n (but independent of p). If it yields a solution, the running time
of a Groebner basis computation is also exponential in n (but polynomial in log p).
In total, we need roughly p relations which can be computed in time linearly in p
and exponentially in n. Since we assumed n to be fixed, we do not care about the
bad behavior in n. The linear algebra step on a ( p × p )-matrix can then be per-
formed in O( p 2 ), since the matrix is sparse—every row contains exactly n nonzero
2
entries. With additional tricks one achieves a running time of O( p 2− n ) for Gaudry’s
algorithm.
n
This should be compared to the generic bound of O( p 2 ) that we achieve when
using Pollard’s rho algorithm from Section 12.1. Similar to Section 12.3, almost the
whole complexity of the problem seems to be concentrated in the size of the base
field p, and not in the extension degree n. Notice that as in Section 12.3, Gaudry’s
algorithm is exponential in log p.

12.5.3 Best Known Algorithms for Elliptic Curves E[ p] Over Prime Fields
Generic discrete log solving: In general, the best algorithm that we know for arbi-
√
trary elliptic curves E [ p ] is Pollard’s rho method with a running time of O( p ).
For the moment, it seems that nobody knows how to exploit the structure of an
elliptic curve group or its elements in order to improve over the generic bound.
We would also like to point out that random elliptic curves; that is, where the
elliptic curve parameters a, b in the defining Weierstrass equation y 2 ≡ x 3 + ax +

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 573 — #19

i i

12.5 Best Known Algorithms for Elliptic Curves E 573

b mod p are chosen in a uniformly random manner, are among the hard instances.
To further harden elliptic curves, one chooses for standardization only those curves
that have (almost) prime order. This means that the cofactor of the largest prime
in the group order is usually 1, which abandons the use of Silver-Pohlig-Hellman’s
algorithm.

Embedding E [ p ] into F pk : It is known that in general elliptic curves E [ p ] can be

embedded into a finite field F pk , where k is the so-called embedding degree. In F pk
we could use the number field sieve for discrete logarithm computations. Hence
√
such an embedding would be attractive if L pk [ 13 ] is smaller than p, which is the
case only if the embedding degree k happens to be very small. However, for almost
all elliptic curves the embedding degree is known to be huge, namely comparable
to p itself.
Some constructions in cryptography (e.g., those that make use of bilinear pair-
ings), exploit the advantages of a small embedding degree. Thus, in these schemes
elliptic curves are explicitly chosen with a small embedding degree (e.g., k = 6),
balances out the hardness of the discrete logarithm problem on E [ p ] and in Fkp .

The xedni calculus algorithm: In 2000, Silverman published his xedni calculus algo-
rithm (read xedni backwards) that uses the group structure of E [ p ] for discrete
logarithm computations, and thus is the only known non-generic algorithm that
works directly on E [ p ]. However, it was soon after his publication discovered that
the so-called lifting process in Silverman’s algorithm has a negligible probability of
succeeding in computing a discrete logarithm.

12.5.4 Relation of Key Size versus Security for Elliptic Curves E[ p]

Similar to the discussion in Section 12.4.4 about key sizes for dlog in F p and for
factoring, we want to evaluate how key sizes have to be adapted for elliptic curves
E [ p ] in order to guard against an increase in computer speed. For elliptic curves,
such an analysis is comparably simple. The best algorithm that we know for the
dlog in E [ p ] is Pollard’s rho method with running time

1 √ log p
L p [1, ] = p = 2 2 .
2

This means that for achieving a security level of k bit, we have to choose a prime
p with 2k bit. In other words, increasing the bit-size of our group by 2 bit leads
to increase of 1 bit in security. By Moore’s law we loose 1 bit of security every 1.5
years just from an increase of a computer’s speed. In order to guard against this
loss over 10 years, it thus suffices to increase the group-size by just 7 · 2 = 14 bit.
Notice that as opposed to the case of dlog in F p and factoring in Section 12.4.4
this increase is linear and independent of the starting point. That means to guard
against technological speedups over 20 years, an increase of 28 bit is sufficient.
Of course, this analysis only holds if we do not have to face any major break-
through in computer technology or algorithms. For a more conservative choice see
the advice in Section 12.7.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 574 — #20

i i

574 Solving Discrete Logarithms and Factoring

12.5.5 How to Securely Choose Elliptic Curve Parameters

A comprehensive description on how to choose elliptic curve domain parameters
over finite fields can be found in RFC 5639 “ECC Brainpool Standard Curves and
Curve Generation” by Manfred Lochter and Johannes Merkle [32, 33]. This RFC
defines a publicly verifiable way of choosing pseudorandom parameters for elliptic
curve parameters, and thus it excludes the main source for embedding a trapdoor
in the definition of a group. The authors discuss all known properties of a curve
E [ p ] that might potentially weaken its security:

• A small embedding degree for the embedding into a finite field: This would
allow for the use of more efficient finite field algorithms. Especially, the
requirement excludes supersingular curves of order p + 1.
• Trace one curves that have order |E [ p ]| = p: These curves are known to be
weak by the discrete logarithm algorithms of Satoh-Araki [34], Semaev [35],
and Smart [36].
• Large class number: This excludes that E [ p ] can be efficiently lifted to a
curve defined over some algebraic number field. This requirement is quite
conservative, since even for small class numbers there is currently no efficient
attack known.

Moreover, the authors insist on the following useful properties:

• Prime order: This simply rules out subgroup attacks.

• Verifiable pseudorandom number generation: The seeds for a pseudorandom
number generator are chosen in a systematic way by Lochter and Merkle,
who use in their construction the first seven substrings of length 160 bit of
the fundamental constant π = 3.141 . . ..

In addition, Lochter and Merkle specify a variety of curves for p’s of bit-lengths in
the range 160 to 512. For TLS/SSL there is also a new set of proposed Brainpool
curves available [37].
The work of Bos, Costello, Longa, and Naehrig [38] gives a valuable introduc-
tion for practitioners on how to choose elliptic curve parameters that are secure and
also allow for efficient implementation in various coordinate settings (Weierstrass,
Edwards, Montgomery). Additionally, Bos et al. focus on side-channel resistance
against timing attacks by proposing constant-time scalar multiplications.
We highly recommend the SafeCurve project by Daniel Bernstein and Tanja
Lange [39] that provides an excellent overview for several selection methods, their
benefits and drawbacks. The goal of Bernstein and Lange is to provide security
of elliptic curve cryptography, rather than just strength of elliptic curves against
discrete logarithm attacks. Therefore, they take into account various types of side-
channels that may leak secrets in an implementation.
References and further reading: For an introduction to the mathematics of
elliptic curves and their cryptographic applications we refer to the textbooks of
Washington [40], Galbraith [6], and Silverman [41].
This section described the results of the original works of Gaudry, Hess,
Smart [42], Gaudry [43], Semaev [44], and the xedni algorithm of Silverman [41].

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 575 — #21

i i

12.6 Possibility of Embedded Backdoors in Cryptographic Keys 575

12.6 Possibility of Embedded Backdoors in

Cryptographic Keys

All cryptography seems to offer the possibility of embedding backdoors. Dlog

schemes offer some advantage over factoring-based schemes in the sense that
carefully chosen system-wide parameters protect all users.
The possibility of embedding trapdoors in cryptographic schemes to bypass
cryptography and thus to decrypt/sign/authenticate without the use of a secret key
is a long recognized problem that has been intensively discussed in the crypto-
graphic community (e.g., at the panel discussion at Eurocrypt 1990). However, the
wide-spread use of NSA’s backdoors as described by Edward Snowden has recently
renewed the interest in this topic.
It appears that by construction some schemes are way more vulnerable than
others. For example, for discrete-log based schemes the definition of the group
parameters is a system-wide parameter that is used by any user in the scheme. Thus,
a party that is able to manipulate the definition of a group in such a way that enables
this party to compute discrete logarithms in this group efficiently, can decrypt all
communication. On the other hand, a carefully specified secure group also offers
security for all users.
Currently, there is some speculation whether the NSA influenced NIST, the U.S.
standardization agency, to standardize certain elliptic curves. But the definition of
a group is not the only way to embed backdoors. All cryptographic schemes rely
inherently on a good source of (pseudo)random bits. It is well known that so-called
semantic security of encryption schemes cannot be achieved without randomness,
and every cryptographic secret key is assumed to be randomly chosen. Thus, a weak
pseudorandom generator opens the door for bypassing cryptography. Such a weak
pseudorandom generator was standardized by NIST as Special Publication 800-90,
although there have been warnings by the cryptographic community.
For factoring-based schemes the situation is slightly different from discrete
log-based schemes. As opposed to discrete log schemes, there are no system-wide
parameters that define a group. Nevertheless, there are known ways to embed,
for example, information about the factorization of the RSA modulus N in the
RSA public exponent e. Moreover, recent attacks on RSA public key infrastruc-
tures [45, 46] show that it appears to be a difficult problem to generate RSA public
keys with different primes in the public, mainly due to bad initialization of pseudo-
random generators. This of course does only affect badly chosen keys of individuals
as opposed to all users of cryptographic scheme.
Recommendation: Dlog-based schemes seem to be easier to control from a crypto-
designers perspective, since here all users have to take the same system-wide
parameters.
We do not discuss the possibility of malware here—which may render obsolete
any cryptographic protection method—or how to protect against it. But we would
like to stress the following (somewhat trivial) warning that addresses a crucial point
in practice.
Warning: Cryptography can only protect data if it is properly implemented and
does not leak its (imminent) secret. So in addition to the mathematical hardness

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 576 — #22

i i

576 Solving Discrete Logarithms and Factoring

Table 12.3 Security Level 100 Bit

System Key Size in Bits
Dlog in F p 2000 until 2019, then 3000
Factoring 2000 until 2019, then 3000
Dlog in E [ p ] 224 until 2015, then 250
Source: BSI [49], ANSSI [50].

of the underlying problems, we also have to trust in the implementor of a cryp-

tographic scheme. This trust does not only include that the cryptographic scheme
is implemented in the way it was originally designed (without embedding of any
backdoors), but also that the implementor does not reveal the generated secret keys
to a third party.
It seems that in the NSA affair, some companies were forced to reveal secret
keys. Thus, one has to keep in mind that one has to buy cryptographic schemes
from a completely reliable company that has not been compromised.
References and further reading: For a nice discussion of how to embed undetectable
backdoors in various cryptographic schemes, see the original works of Young and
Yung [47, 48]. See [45] for a current attack on a significant portion of RSA keys in
practice due to bad pseudorandom number generation.

12.7 Conclusion: Advice for Cryptographic Infrastructure

Despite recent discrete logarithm attacks, discrete logarithm-based schemes over

prime order groups and elliptic curve groups remain secure. The same holds for
factoring-based schemes. All discrete logarithm-based groups with small characteri-
stics are completely insecure. Our suggestion is to choose elliptic curve groups.

12.7.1 Suggestions for Choice of Scheme

As we saw in the previous sections of this chapter, discrete log-based schemes in F p
and over E [ p ] remain secure, as well as factoring-based schemes. In this subsection,
we suggest key sizes for these schemes that provide a sufficient security level for the
next two decades under the assumption that no major algorithmic breakthrough
occurs.
Our preference is to use elliptic curve groups E [ p ] since they offer the following
advantages:

• Algorithms for discrete logarithms in F p and factoring are closely linked. So

any progress in one of these two might imply some progress for the other.
But such progress is unlikely to affect the security of elliptic curve groups.
• The best algorithms for E [ p ] are those of generic type from Section 12.1,
which are inferior to the best algorithms for prime order discrete logarithm
and factoring with L [ 13 ] running time. This in turn means that the key growth
that compensates technological progress of faster computers is much smaller
for E [ p ]—roughly 2 bit every 1.5 years according to Moore’s law.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 577 — #23

i i

12.7 Conclusion: Advice for Cryptographic Infrastructure 577

• Getting algorithmic progress by using the group structure of E [ p ] seems to

be harder than for F p since, as opposed to F p , we do not even have an initial
starting group-structure index calculus algorithm that we could improve.
• If an elliptic curve E [ p ] is properly chosen (i.e., the group is computationally
hard and backdoor-free, then all users profit from the hardness of the discrete
logarithm problem. Notice that this choice is crucial: If the group is not
secure, then all users also suffer from its insecurity.

Warning: One should keep in mind that the suggestions above only hold in a world
without large quantum computers. It seems crucial to keep track of current progress
in this area, and to have some alternative quantum-resistant cryptosystems ready
to enroll within the next 15 years.
References and further reading: For a good and conservative choice of key sizes we
highly recommend following the suggestions of the Bundesamt für Sicherheit in der
Informationstechnik (BSI) [49] and the Agence nationale de la sécurité des systèmes
d’information [50]. Both sources also provide various valuable recommendations
how to correctly implement and combine different cryptographic primitives.

12.7.2 Year 2023: Conclusion Remarks

Since the first advice in April 2014, quite a lot of things have changed (there have
been new records in dlog finite fields and some marginal improvements of the L(1/3)
algorithms in some contexts). However, this does not affect the overall conclusion
that (only) small characteristic finite fields are no longer secure.
The recommendations of this chapter are still valid. See:

• BSI: “TR-02102-1: Cryptographic Mechanisms: Recommendations and Key

Lengths” in 2022 [49];
• NIST: “SP 800-186 (Final) Recommendations for Discrete Logarithm-Based
Cryptography: Elliptic Curve Domain Parameters” in February 2023 [51];
• NIST: “SP 800-56B Recommendation for Pair-Wise Key-Establishment
Schemes Using Integer Factorization Cryptography” in 2019 [52].

In April 2022, Fabrice Boudot et al. published the very good article “The State
of the Art in Integer Factoring and Breaking Public-Key Cryptography” in IEEE
Security & Privacy. There they review the three number-theoretic problems of inte-
ger factorization, discrete logarithms in finite fields, and discrete logarithms over
elliptic curves, and come to very similar results [53].

References

[1] Ptacek, T., et al., “The Factoring Dead—Preparing for the Cryptopocalypse,” in Black Hat
Conference (2013).
[2] Fouque, P.-A., A. Joux, and C. Mavromati, “Multi-User Collisions: Applicationsto Discrete
Logarithm, Even-Mansour and Prince,” in Cryptology ePrint Archive, 2014, https://eprint
.iacr.org/2013/761.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 578 — #24

i i

578 Solving Discrete Logarithms and Factoring

[3] May, A., Vorlesungsskript Zahlentheorie, 2013, https://www.cits.ruhr-uni-bochum.de/

imperia/md/content/may/13/ss13/zahlenss13/zahlentheorie.pdf.
[4] Menezes, A. J., P. C. van Oorschot, and S. A. Vanstone, Handbook of Applied Cryptog-
raphy, 5th ed., Series on Discrete Mathematics and Its Application, CRC Press, 2001,
https://cacr.uwaterloo.ca/hac/.
[5] Joux, A., Algorithmic Cryptanalysis, CRC Cryptography and Network Security Series,
Chapman & Hall, 2009.
[6] Galbraith, S. D., Mathematics of Public Key Cryptography, Cambridge University Press,
2012.
[7] May, A., Vorlesungsskript Kryptanalyse 1, 2008, https://www.cits.ruhr-uni-bochum.de/
imperia/md/content/may/pkk08/skript.pdf.
[8] May, A., Vorlesungsskript Kryptanalyse 2, 2012, https://www.cits.ruhr-uni-bochum.de/
imperia/md/content/may/12/ws1213/kryptanal12/kryptanalyse_2013.pdf.
[9] Homeister, M., Quantum Computer Science: An Introduction, Vieweg + Teubner Verlag,
2007.
[10] Mermin, D. N., Quantum Computing Verstehen, Cambridge University Press, 2008.
[11] Pollard, J. M., “A Monte Carlo Method for Factorization,” in BIT Numerical Mathematics
15, Vol. 3, 1975, pp. 331–334.
[12] Pollard, J. M., “Kangaroos, Monopoly and Discrete Logarithms,” in J. Cryptology,
Vol. 13, No. 4, 2000, pp. 437–447.
[13] Shor, P. W., “Algorithms for Quantum Computation: Discrete Logarithms and Factoring,”
in FOCS, 1994, pp. 124–134.
[14] Shoup, V., “Lower Bounds for Discrete Logarithms and Related Problems,” in EURO-
CRYPT, 1997, pp. 256–266.
[15] Müller-Stach, and Piontkowski, Elementare und Algebraische Zahlentheorie, Vieweg
Studium, 2011.
[16] Lenstra, A. K., and H. W. Lenstra Jr., The Development of the Number Field Sieve, Lecture
Notes in Mathematics, Springer, Verlag, 1993.
[17] Adleman, L. M., “A Subexponential Algorithm for the Discrete Logarithm Problem with
Applicationsto Cryptography (Abstract),” in FOCS, 1979, pp. 55–60.
[18] Coppersmith, D., A. M. Odlyzko, and R. Schroeppel, “Discrete Logarithms in GF(p),” in
Algorithmica, Vol. 1, No. 1, 1986, pp. 1–15, http://dx.doi.org/10.1007/BF01840433.
[19] Pomerance, C., “The Quadratic Sieve Factoring Algorithm,” in Proceedings of Crypto ‘84,
LNCS 196, G.R. Blakley and D. Chaum (eds.), Springer, 1984, pp. 169–182.
[20] Pomerance, C., “A Tale of Two Sieves,” in Notices Amer. Math. Soc, Vol. 43, 1996,
pp. 1473–1485.
[21] Joux, A., “A New Index Calculus Algorithm with Complexity L(1/4+o(1)) in Very Small
Characteristic,” in IACR Cryptology ePrint Archive 2013, 2013, p. 95.
[22] Barbulescu, R., et al., “A Quasi-Polynomial Algorithm for Discrete Logarithm in Finite
Fields of Small Characteristic,” in CoRR, 2013, abs/1306.4244.
[23] Joux, A., and R. Lercier, “The Function Field Sieve in the Medium Prime Case,” in
EUROCRYPT, 2006, pp. 254–270.
[24] Joux, A., “Faster Index Calculus for the Medium Prime Case Application to 1175-bit and
1425-bit Finite Fields,” in EUROCRYPT, 2013, pp. 177–193.
[25] Göloglu, F., et al., “On the Function Field Sieve and the Impact of Higher Splitting
Probabilities—Application to Discrete Logarithms,” in CRYPTO (2), 2013, pp. 109–128.
[26] Coppersmith, D., “Evaluating Logarithms in GF(2n),” in STOC, 1984, pp. 201–207,
https://dl.acm.org/doi/10.1145/800057.808682.
[27] Kleinjung, T., et al., “Factorization of a 768-Bit RSA Modulus,” in CRYPTO, 2010, pp.
333–350, http://dx.doi.org/10.1007/978-3-642-14623-7_18.
[28] Blömer, J., Vorlesungsskript Algorithmische Zahlentheorie, 1999.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 579 — #25

i i

12.7 Conclusion: Advice for Cryptographic Infrastructure 579

[29] Bach, E., Discrete Logarithms and Factoring, UCB/CSD-84-186, June 1984, https://www2
.eecs.berkeley.edu/Pubs/TechRpts/1984/5973.html; https://www2.eecs.berkeley.edu/Pubs/
TechRpts/1984/CSD-84-186.pdf.
[30] Joux, A., and V. Vitse, “Cover and Decomposition Index Calculus on Elliptic Curves Made
Practical. Application to a Seemingly Secure Curve Over Fp6,” in IACR Cryptology ePrint
Archive, 2011, p. 20.
[31] Enge, A., P. Gaudry, and E. Thomé, “An L(1/3) Discrete Logarithm Algorithm for Low
Degree Curves,” in J. Cryptology, Vol. 24, No. 1, 2011, pp. 24–41.
[32] Lochter, M., and J. Merkle, Elliptic Curve Cryptography (ECC) Brainpool Stan-
dard Curves and Curve Generation, RFC 5639, 2010, https://datatracker.ietf.org/
doc/html/rfc5639.
[33] Lochter, M., and J. Merkle, ECCBrainpool Standard Curves and Curve Generation
v. 1.0, 2005, https://www.teletrust.de/fileadmin/files/oid/oid_ECC-Brainpool-Standard-
curves-V1.pdf.
[34] Satoh, T., and K. Araki,“Fermat Quotients and the Polynomial Time Discrete Log Algo-
rithm for Anomalous Elliptic Curves,” in Commentarii Mathematici Universitatis Sancti
Pauli 47, 1998.
[35] Semaev, I., “Evaluation of Discrete Logarithms on Some Elliptic Curves,” in Mathematics
of Computation 67, 1998.
[36] Smart, N., “The Discrete Logarithm Problem on Elliptic Curves of Trace One,” in Journal
of Cryptology 12, 1999.
[37] Lochter, M., and J. Merkle, Elliptic Curve Cryptography (ECC) Brainpool Curves for
Transport Layer Security (TLS), RFC 7027, 2013, https://datatracker.ietf.org/doc/html/
rfc7027.
[38] Bos, J. W., et al., Selecting Elliptic Curves for Cryptography: An Efficiency and Security
Analysis, 2014, https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/
selecting.pdf.
[39] Bernstein, D., and T. Lange, SafeCurves: Choosing Safe Curves for Elliptic-Curve Cryp-
tography, 2014, https://safecurves.cr.yp.to.
[40] Washington, L. C., Elliptic Curves: Number Theory and Cryptography, Discrete Mathe-
matics and its Applications, Chapman and Hall/CRC, 2008.
[41] Silverman, J. H., “The Xedni Calculus and The Elliptic Curve Discrete Logarithm
Problem,” in Designs, Codes and Cryptography, Vol. 20, 1999, pp. 5–40.
[42] Gaudry, P., F. Hess, and N. P. Smart, “Constructive and Destructive Facets of Weil
Descenton Elliptic Curves,” in J. Cryptology, Vol. 15, No. 1, 2002, pp. 19–46.
[43] Gaudry, P., “Index Calculus for Abelian Varieties of Small Dimension and the Ellip-
tic Curve Discrete Logarithm Problem,” in J. Symb. Comput., Vol. 44, No. 12, 2009,
pp. 1690–1702.
[44] Semaev, I., “Summation Polynomials and the Discrete Logarithm Problem on Elliptic
Curves,” in IACR Cryptology ePrint Archive, 2004, p. 31.
[45] Lenstra, A. K., et al., “Public Keys,” in CRYPTO, 2012, pp. 626–642, http://dx.doi.org/
10.1007/978-3-642-32009-5_37.
[46] Heninger, N., et al., “Mining Your Ps and Qs: Detection of Widespread Weak Keys in
NetworkDevices,” in Proceedings of the 21st USENIX Security Symposium, August 2012,
https://factorable.net/paper.html.
[47] Young, A. L., and M. Yung, “The Dark Side of Black-Box Cryptography, or: Should We
TrustCapstone?” in CRYPTO, 1996, pp. 89–103.
[48] Young, A. L., and M. Yung. “Kleptography: Using Cryptography Against Cryptography,”
in EUROCRYPT. 1997, pp. 62–74.
[49] BSI,Technical Guideline TR-02102-1, Cryptographic Mechanisms: Recommendations and
Key Lengths (Version 2022-01), Tech. rep., 2022, https://www.bsi.bund.de/SharedDocs/
Downloads/EN/BSI/Publications/TechGuidelines/TG02102/BSI-TR-02102-1.pdf.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 580 — #26

i i

580 Solving Discrete Logarithms and Factoring

[50] Agencenationale de la sécurité des systèmes d’information, Référentiel général de sécu-

rité Version 2.02, 2013, https://www.ssi.gouv.fr/administration/reglementation/confiance-
numerique/le-referentiel-general-de-securite-rgs/.
[51] Chen, L., et al., Recommendations for Discrete Logarithm-based Cryptography: Elliptic
Curve DomainParameters, Special Publication (NIST SP), National Institute of Standards
and Technology, 2023, https://csrc.nist.gov/publications/detail/sp/800-186/final.
[52] Barker, E., et al., Recommendation for Pair-Wise Key-Establishment Schemes Using Integer
Factorization Cryptography, Special Publication (NIST SP), National Institute of Standards
and Technology, 2019, https://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.800-
56Br2.pdf.
[53] Boudot, F., et al., “The State of the Art in Integer Factoring and Breaking Public-Key Cryp-
tography,” in IEEE Security & Privacy, Vol. 20, No. 2, 2022, pp. 80–86, https://ieeexplore
.ieee.org/document/9740707.

i i

Figure 13.2 Secure key sizes: result in BlueKrypt for the year 2022.

one would have to double the bit length of an RSA key to achieve the same level of
security. Despite intensive research in this area, it is still too early to say whether we
will ever be able to build quantum computers of sufficient capacity to apply Shor’s
algorithm to numbers of relevant size. See Section 5.12.3.
Early announcements by D-Wave about the performance of their quantum com-
puter were met with a lot of skepticism, even ridicule. As large companies have
invested heavily in QC, the skepticism has turned into hype.
The development of attacks on another widely used scheme called Digital Sig-
nature Algorithm and the elliptic curve cryptography class of schemes is analogous
to those on RSA. The security of these schemes depends on the difficulty of comput-
ing discrete logarithms. Even today, significant algorithmic progress is being made.
Quantum computers would render these schemes insecure.
And what’s the status of symmetric (so called secret-key) encryption schemes?
In 1977, DES was introduced as Data Encryption Standard [5]. Twenty-one years
later, the Electronic Frontier Foundation built Deep Crack, a specialized machine
that took only 56 hours to break a DES key. The problem with DES was that it
used keys that were too short. It seems that the inventors of DES did not anticipate
the speed of hardware development. The Advanced Encryption Standard [6], the
successor to DES, is currently considered secure, although there are interesting,
though still inefficient, methods to attack AES using algebraic methods.
AES is the gold standard for all symmetric ciphers—and because of more pow-
erful and cheaper chips, it is now even used in low-power, resource-constrained
devices such as sensors.

13.4 New Signatures

In 1979, Ralph Merkle proposed a remarkable framework for new signature

schemes in his PhD thesis [10]. Unlike all other signature schemes, its security is
not based on the difficulty of a number-theoretic, algebraic, or geometric problem.
It requires only what other signature schemes require anyway: a cryptographically
secure hash function and a secure pseudorandom number generator. Each new hash
function leads to a new signature algorithm. As a result, the Merkle scheme has the
potential to solve the problem of long-term availability of digital signature schemes.
Merkle uses so-called one-time signatures in his construction: Each new signa-
ture requires a new signing key and a new verification key. Merkle’s idea was to
use a hash tree to reduce the validity of many verification keys to the validity of a
unique public hash value. When generating keys for the Merkle scheme, one must
determine in advance the number of signatures that can be made with them. For
a long time, this seemed to be a significant drawback. In [11], however, a variant
the Merkle scheme was proposed that allows 240 signatures to be computed with a
single key pair.2
Another new signature scheme uses multivariate cryptography. This asymmet-
ric scheme uses multivariate polynomials over a finite field.3

13.5 Quantum Cryptography: A Way Out of the Dead End?

From the point of view of the current state of cryptography, the problem of
long-term confidentiality remains unsolved: There is no practical way to keep an
encrypted message secret for a very long time.
Quantum cryptography can provide a way out here: These quantum technolo-
gies establish a secure channel to enable the exchange of keys (e.g., very long keys for
one-time pads). Their security is guaranteed by the laws of quantum mechanics; see
[13]. However, the known methods of quantum cryptography are currently rather
inefficient and allow only symmetric methods. Governments, for example, can use
them to exchange top-secret information. For many applications such as signatures,
symmetric cryptography alone is not sufficient. Note that quantum cryptography
should not be confused with post-quantum cryptography.

13.6 Post-Quantum Cryptography

Today’s cryptography provides tools to ensure short- and medium-term security.

Software developers can use these tools in their applications with a clear conscience

2. Under JCT Default Perspective F Visuals you can find several components and variants of this: the
one-time signature WOTS+, the normal Merkle signature (MSS), the extended Merkle signature scheme
(XMSS), and the multitree Merkle signature scheme (XMSS_MT).
In addition, the SPHINCS+ signature is extensively visualized. SPHINCS+ was one of the second track
candidates in the NIST post-quantum computing contest in round three (2020).
Many variants are offered in the JCT Algorithm Perspective, delivered by the BouncyCastle library.
3. In JCT Default Perspective F Visuals F Multivariate Cryptography, the rainbow signature vari-
ant by Jintai Ding and Dieter Schmidt [12] is used, which utilizes several layers of multivariate linear
equation systems.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 586 — #6

i i

586 Future Use of Cryptography

as long as they ensure that components can be quickly exchanged when they become
insecure.
To ensure IT security in the future, we need to build a portfolio of secure cryp-
tographic schemes. This portfolio must include schemes that are suitable for the
world of ubiquitous computing with many less powerful computers. It also needs to
include schemes that will remain secure in the event that powerful quantum comput-
ers are built. Several promising candidates have been discussed in this chapter. The
question of how to ensure long-term confidentiality still remains an open research
problem.
In 2016, NIST launched a competition to identify suitable alternatives to the
current generation of cryptographic methods (such as RSA or ECDSA). This next
generation of cryptographic algorithms is called “post-quantum cryptography.”
In July 2022, as a result of the third round, NIST announced which methods it
wants to standardize [14]:

• For public-key encryption and key exchange: CRYSTALS-Kyber (see

Section 11.11);
• For digital signatures: CRYSTALS-Dilithium, Falcon, SPHINCS+.

13.7 Conclusion

Cryptography is important and a lot of work, but cryptography is also intellectually

challenging and fun.
For the users (both private and business) cryptography is mostly an invisible
part of IT security and of corporate risk management as outlined in Figure 13.3.4
We are seeing more and more end-to-end encryption and sophisticated proto-
cols in products. Messengers are a good example: Signal5 was the first widely used
protocol for postcompromise security, and its successor Messaging Layer Security
(MLS) will make even chat groups secure. MLS is an emerging standard that sup-
ports end-to-end encryption in messaging applications, and was published as RFC
9420 in July 2023 [15].
IT security is now less at risk from bad cryptographic algorithms than from:

• Attackers who just need to find one weak link in the chain. For example, one
server with weak password hashing, one computer on a network without
updates, one misconfigured router, one outdated component or library, and
so on;
• Users who mainly want speed and good usability, but don’t care about
security (awareness, backups, and common sense are needed even on the
computer);
• Monocultures and digital dependencies: This includes hardware with subsys-
tems such as “management engine” or “secure technology” and operating

4. See “CrypTool for Awareness” https://www.cryptool.org/en/education/awareness.

5. In JCT Default Perspective F Visuals F Signal Encryption, the double ratchet scheme of the
Signal protocol is visualized step by step.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 587 — #7

i i

13.7 Conclusion 587

Figure 13.3 Embedding cryptology between corporate risk management and science.

systems or antivirus software that are always online and may send back
sensitive data or have backdoors;
• Last but not least, law enforcement and surveillance: Despite all the legit-
imate reasons, government agencies have never been able to keep the data
they collect to themselves. According to an unnamed intelligence source, all
the zero-day exploits they collected end up in the hands of organized crime
after an average of two years.

References

[1] Rivest, R. L., A. Shamir, and L. Adleman, “A Method for Obtaining Digital Signatures and
Public-Key Cryptosystems,” in Communications of the ACM, Vol. 21, No. 2, April 1978,
pp. 120–126.
[2] Lenstra, A. K., and E. R. Verheul, Selecting Cryptographic Key Sizes (1999 + 2001), in
Journal of Cryptology, Vol. 14, 2001, pp. 255–293, https://www.cs.ru.nl/E.Verheul/paper-
s/Joc2001/joc2001.pdf.
[3] Giry, D., BlueKrypt: Cryptographic Key Length Recommendation, Version 32.3, May
2020, https://www.keylength.com/.
[4] Shor, P. W., “Polynomial Time Algorithms for Prime Factorization and Discrete Loga-
rithms on a Quantum Computer,” in SIAM Journal on Computing, Vol. 26, No. 5, 1997,
pp. 1484–1509.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:46 — page 588 — #8

i i

588 Future Use of Cryptography

[5] Data Encryption Standard (DES), Federal Information Processing Standards (FIPS) 46.
National Bureau of Standards, National Technical Information Service, Springfield, Vir-
ginia: U.S. Department of Commerce, 1977.
[6] Dworkin, M. J., et al., Advanced Encyption Standard (AES), Federal Information Process-
ing Standards(FIPS) 197, National Institute of Standards and Technology (NIST). Gaithers-
burg: U.S. Departmentof Commerce, November 26, 2001, https://nvlpubs.nist.gov/nist-
pubs/FIPS/NIST.FIPS.197.pdf.
[7] Bernstein, D., and T. Lange, Post-Quantum Cryptography—Dealing with the Fall-
out of Physics Success, in Nature, 2017, http://www.readcube.com/articles/10.1038/na-
ture23461; https://eprint.iacr.org/2017/314.pdf.
[8] Beullens, W., et al., Post-Quantum Cryptography: Current State and Quantum Mitigation,
Tech. rep., 2021, https://www.enisa.europa.eu/publications/post-quantum-cryptography-
current-state-and-quantum-mitigation/@@download/fullReport.
[9] McEliece, R. J., “A Public Key Cryptosystem Based on Algebraic Coding Theory,” in DSN
Progress Report 42–44, 1978, pp. 114–116.
[10] Merkle, R. C., “Secrecy, Authentication, and Public Key Systems,” PhD thesis, Department
of Electrical Engineering, Stanford University, 1979.
[11] Buchmann, J., et al., “CMSS—An Improved Merkle Signature Scheme,” in 7th Interna-
tional Conference on Cryptology in India—Indocrypt’06, R. Barua and T. Lange (eds.),
lecture notes incomputer science 4392, Springer-Verlag, 2006, pp. 349–363.
[12] Ding, J., and D. Schmidt, “Rainbow, a New Multivariable Polynomial Signature Scheme,”
in Applied Cryptography and Network Security, J. Ioannidis, A. Keromytis, and M. Yung
(eds.), Springer, 2005, pp. 164–175.
[13] C. H. Bennett and G. Brassard. “An Update on Quantum Cryptography,” in Advances in
Cryptology—CRYPTO ’84, G. R. Blakley, and D. Chaum (eds.), Vol. 196, lecture notes in
computer science, Springer-Verlag, 1985, pp. 475–480.
[14] Report on the Third Round of the NIST Post-Quantum Cryptography Standardization
Process, National Institute of Standards and Technology (NIST), July 2022 (updated Sept.
2022), https://nvlpubs.nist.gov/nistpubs/ir/2022/NIST.IR.8413.pdf.
[15] IETF, The Messaging Layer Security (MLS) Protocol RFC 9420, https://data-
tracker.ietf.org/doc/rfc9420/ (visited on 08/02/2023).

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 589 — #1

i i

APPENDIX A
APPENDIX A

Software

Sections A.1 to A.4 briefly describe the four CT variants CT1, CT2, JCT, and CTO.1
For each, the functions offered (via menus, templates, or plugins) are shown.

A.1 CrypTool 1 Menus

On the internet, a list with all functions offered by CrypTool 1 (CT1) can be created
with: https://www.cryptool.org/en/documentation/functionvolume?ctver
sion=ct1.
The main menu of CT1 contains both generic service functions in the six main
menu items

• File; • Options;
• Edit; • Window;
• View; • Help.

and the actual crypto functions in the following four main menus:

• Encrypt / decrypt; • Individual procedures;

• Digital signature / PKI; • Analysis.

Within Individual Procedures you find visualizations of single algorithms

and of protocols. Some procedures are implemented both for a fast perfor-
mance (mostly under the main menu Encrypt/Decrypt) and for a step-by-step
visualization.
Which of the menu items in CrypTool 1 are active (that means not grayed)
depends on the type of currently active document window: The brute-force analysis
for DES, for example, is only available if the active window is opened in the hex-
adecimal view. On the other hand, the menu item “Generate Random Numbers…”
is always available (even if no document is opened).
Screenshots from CT1 can be found at https://www.cryptool.org/en/ct1/
screenshots/screenshots.
1. From 2011, changes for CrypTool 1 were limited to bugfixes and pure maintenance. However, regularly new
developments went into the two CT1 successors CrypTool 2 and JCrypTool (JCT). In the meantime, their
functional range is bigger than the one of CT1. From 2023, JCT comes to pure maintenance. CT2 is still
actively developed. The web version CrypTool-Online (CTO) was and further will be expanded considerably.
- CT1: https://www.cryptool.org/en/ct1/documentation/features;
- JCT: https://www.cryptool.org/en/jct/documentation/resources;
- CT2: https://www.cryptool.org/en/ct2/resources;
- CTO: https://www.cryptool.org/en/cto/.

589

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 590 — #2

i i

590 Software

A.2 CrypTool 2 Templates and the WorkspaceManager

When you start CT2 it first shows the Startcenter (see Figure A.1).2
Figure A.2 shows the beginning of a list with all functions offered by Cryp-
Tool 2. This list was created with https://www.cryptool.org/en/documentation
/functionvolume?ctversion=ct2.
Besides the information on how you can get in the web a list of all functions in
CT2, this appendix contains information about the templates (graphical programs
included in CT2) and about the graphical editor (“WorkspaceManager”) of CT2.
When CT2 is started, the Startcenter opens first.
In the Startcenter, you have the choice to open CT2 templates in two different
ways:

• Via the Wizard (second icon with magic wand, below “Main functions”),
which guides you to the provided templates.
• Via the template tree (window in the center of the Startcenter), from which
you can select ready-made cryptographic workflows.

Figure A.1 Startcenter in CT2 (Nightly Build, October 2023).

2. The current CT2 release is CT 2.1 (release 2023.1 from June 2023). Each day a new “nightly build” is
generated.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 591 — #3

• Default perspective;
• Algorithm perspective.

All functions of the default perspective can be found both in the menus and in
the navigation bar called “Crypto Explorer” (at the right side). The default perspec-
tive contains all important methods like classic transposition or modern AES, and
many visualizations (e.g., Diffie-Hellman key exchange or calculations on elliptic
curves).

3. The current JCT release is JCT 1.0.9 (July 2023). Occasionally a new weekly build is generated. You can
find further information about JCT at: https://www.cryptool.org/en/jct/volunteer.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 593 — #5

i i

A.3 JCrypTool Functions 593

Figure A.3 Extract of the expanded template tree in CT2.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 594 — #6

i i

i i

596 Software

Figure A.6 Starting page of CTO (February 2022).

Miscellaneous

B.1 Movies and Fictional Literature with Relation to

Cryptography

Cryptographic applications—classical as well as modern ones—have been used in

literature and movies. In some media they are only mentioned and are a pure
admixture; in others they play a primary role and are explained in detail; and
sometimes the purpose of the story, which forms the framework, is primarily to
transport this knowledge and achieve better motivation. Here is the beginning of an
overview.

B.1.1 For Grownups and Teenagers

The Gold Bug, Edgar Allan Poe, 1843.
In this short story Poe tells as first-person narrator about his acquaintanceship with
the curious Mr. Legrand. They detect the fabulous treasure of captain Kidd via a
gold bug and a vellum found at the coast of New England. The cipher consists
of 203 cryptic symbols, and it proves to be a general monoalphabetic substitution
cipher (see Section 2.2.1). The story tells how they solve the riddle step by step using
a combination of semantic and syntax analysis (frequency analysis of single letters
in English texts).
In this novel the code breaker Legrand says the famous statement: “Yet it may
be roundly asserted that human ingenuity cannot concoct a cipher which human
ingenuity cannot resolve—given the according dedication.” Poe not only was a well-
known writer, but also a talented cryptographer. His story is also told in the book
Code Breaking [1].

Mathias Sandorf, Jules Verne, 1885.

This is one of the most famous novels of the French author Jules Verne (1828–1905),
who was called “Father of Science Fiction.” In Mathias Sandorf he tells the story
of the freedom fighter Earl Sandorf, who is betrayed to the police, but finally he
can escape.
The whistleblowing worked, because his enemies captured and decrypted a
secret message sent to him. For decryption, they needed a special grille, which they
stole from him. This turning grille was a quadratic piece of jig with 6 × 6 squares,
of which one-quarter (nine) were holes (see the turning grille in Section 2.1.1).

In this movie Ruth tries to solve the secret that made her daughter fall silent. Here
two young people suffering from autism communicate via 5- and 6-digit primes (see
Chapter 4). After more than 1 hour the movie contains the following encrypted two
series of primes:
21383, 176081, 18199, 113933, 150377, 304523, 113933;
193877, 737683, 117881, 193877
Compare the story The Dialogue of the Sisters.

Sneakers, Directed by Phil Alden Robinson, Universal Pictures Film, 1992.

In this movie the “sneakers,” computer experts under their boss Martin Bishop, try
to get back the deciphering box SETEC from the “bad guys.” SETEC, invented by
a genius mathematician before he was killed, allows decrypting all codes from any
nation. In the movie the code is not described in any way.
Leonard Adleman (the “A” within RSA) worked as mathematical consultant
for “sneakers.” He describes the funny story about his contribution at his home-
page https://theworld.com/ reinhold/math/sneakers.adleman.html. It is
assumed that the cipher used everywhere is RSA. So on the chip a fast, unknown
factorization method is implemented.

Total Control, David Baldacci, Mass Market Paperback, 1997.

Jason Archer, executive with a technology company suddenly disappears. Sidney
Archer tries to find out about her husband’s surprising death. She gets a clue how
the global financial system is abused and that the real control belongs to those with
the most money. Here even good passwords don’t help.

Cube, Directed by Vincenzo Natali, Mehra Meh Film, 1997.

In this Canadian low-budget-movie, seven complete strangers of widely varying
personality characteristics are involuntarily placed in a Kafkaesque maze of cubical
rooms containing deadly traps.
To get out, the persons have to move through these rooms. To find out which
rooms are dangerous, mathematics is crucial: Each cubic room has at its entrance
a numerical marking consisting of three sets of three digits. First they deduce that
all rooms marked at their entrance with at least one prime number are trapped.
Later it comes out that a trapped room can also be marked by a number which is a
power of a prime (so traps are p n , e.g., 128 = 27 or 101 = 1011 = prime, but not
517 = 11 ∗ 47).

Mercury Rising, Directed by Harold Becker, Universal Pictures Film, 1998.

The NSA developed a new cipher, which is pretended to be uncrackable by humans
and computers. To test its reliability some programmers hide a message encrypted
with this cipher in a puzzle magazine.
Simon, a nine-year-old autistic boy, cracks the code. Instead of fixing the code,
a government agent sends a killer. FBI agent Art Jeffries (Bruce Willis) protects the
boy and sets a snare for the killers.
The code is not described in any way.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 604 — #4

i i

604 Miscellaneous

Digital Fortress, Dan Brown, E-Book, 1998.

Dan Brown’s first novel was published in 1998 as an e-book, but it was largely
unsuccessful then.
The National Security Agency uses a huge computer, that enables them to
decrypt all messages (needless to say only of criminals and terrorists) within minutes
even if they use the most modern encryption methods.
An apostate employee invents an unbreakable code and his computer pro-
gram Diabolus forces the super computer to do self-destructing operations. The
plot, where also the beautiful computer expert Susan Fletcher has a role, is rather
predictable.
The idea that the NSA or another secret service is able to decrypt any code
is currently a popular topic. In Digital Fortress the super computer has 3 million
processors—nevertheless from today’s view this is by no means sufficient to hack
modern ciphers.
The Dialogue of the Sisters, C. Elsner, c’t, Heise, 1999.
In this short story, the sisters confidentially communicate using a variant of RSA (see
Section 5.10). They are residents of a madhouse being under permanent surveillance.
The PDF file is displayed in CT1 if you there enter, for example, the search
term “sisters” in the online help.
Cryptonomicon, Neal Stephenson, Harper, 1999.
This very thick novel deals with cryptography both in WW2 and today. The two
heroes from the 1940s are the excellent mathematician and cryptanalyst Lawrence
Waterhouse, and the overeager and morphine-addicted U.S. marine Bobby Shaftoe.
They both are members of the special allied unit 2702, which tries to hack the
enemy’s communication codes and at the same time to hide its own existence.
This secretiveness also happens in the present plot, where the grandchildren of
the war heroes—the dedicated programmer Randy Waterhouse and the beautiful
Amy Shaftoe—team up.
Cryptonomicon is notably heavy for nontechnical readers in parts. Several pages
are spent explaining in detail some of the concepts behind cryptography. Stephenson
added a detailed description of the solitaire cipher (see Section 2.4), a paper-and-
pencil encryption algorithm developed by Bruce Schneier which is called “Pontifex”
in the book. Another, modern algorithm called “Arethusa” is not explained in detail.
The Chinese Labyrinth, C. Elsner, c’t, Heise, 2001, Updated 2020.
In this short story, which is included in the CrypTool package as a PDF file, Marco
Polo has to solve problems from number theory within a competition to become a
major consultant of the Great Khan. All solutions are included and explained.
The new version (with lots of SageMath code) can be found at https://www
.cryptool.org/assets/ctp/documents/cttc/chinlab-en.pdf.
Artemis Fowl, Eoin Colfer, Viking, 2001.
In this book for young people the 12-year-old Artemis, a genius thief, gets a copy
of the top secret Book of the Elfs. After he decrypted it with his computer, he finds
out things that men never should have known.
The used code is not described in detail or revealed.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 605 — #5

i i

B.1 Movies and Fictional Literature with Relation to Cryptography 605

A Beautiful Mind, Directed by Ron Howard, 2001.

This is the film version of Sylvia Nasar’s biography of the game theorist John Nash.
After the brilliant but asocial mathematician accepts secret work in cryptography,
his life takes a turn to the nightmarish. His irresistible urge to solve problems
becomes a danger for himself and his family. Nash is—within his belief—a most
important hacker working for the government. Details of his way of analyzing code
are not described in any way.

Enigma, Directed by Michael Apted, 2001.

This is the film version of Robert Harris’ historical fiction Enigma (Hutchinson,
London, 1995) about the World War II code-breaking work at Bletchley Park in
early 1943, when the actual inventor of the analysis Alan Turing (after Polish pre-
work) already was in the United States. So the fictional mathematician Tom Jericho
is the lead character in this spy-thriller. Details of his way of analyzing the code are
not described.

The Museum of the Stolen Memories (original title: Das Museum der gestohlenen
Erinnerungen), Ralf Isau, Thienemann-Verlag, 1997/2003.
In this exciting novel the last part of the oracle can only be solved with the joined
help of the computer community. The book got several awards and exists in eight
different languages, but not in English yet.

The Da Vinci Code, Dan Brown, Doubleday, 2003.

The director of the Louvre is found murdered in his museum in front of a picture
of Leonardo da Vinci, and the symbol researcher Robert Langdon is involved in a
conspiracy.
The plot mentions different classic codes (substitution like Caesar or Vigenère,
as well as transposition and number codes). Also, there are hints about Schneier
and the sunflower. The second part of the book contains a lot of theological
considerations.
This book has become one of the most widely read books of all time.

Final Solution, Scott McBain, manuscript not published by Harper Collins, 2004
(German version was published in 2005).
In a near future, politicians, chiefs of military, and secret services of many dif-
ferent countries take over all the power. With a giant computer network called
“Mother” and complete surveillance, they want to cement their power and com-
mercialization of life forever. Humans are only assessed according to their credit
rating, and globally acting companies elude of any democratic control. Within the
thriller, the obvious injustice, but also the realistic likelihood of this development,
are considered again and again.
With the help of a cryptographer, a code to destroy was built into the super
computer “Mother”: In a race several people try to start the deactivation (Lars
Pedersen, Oswald Plevy, the female American president, the British prime minister,
and an unknown Finnish person named Pia, who wants to take revenge for the
death of her brother). On the opposite side a killing group acts under the special
guidance of the British foreign minister and the boss of the CIA.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 606 — #6

i i

606 Miscellaneous

The Cryptographer, Tobias Hill, Faber & Faber, 2003.

London 2021: The company SoftMark developed and established an electronic cur-
rency that guarantees the highest security standards by an unbreakable code. The
inventor and company founder, called the cryptographer because of his mathemat-
ical talent, has become the richest man in the world. But the code was hacked, and
in a worldwide economic crisis his company goes bankrupt. Additionally, the tax
investigator Anna Moore is set on him.

Tyrannosaur Canyon, Douglas Preston, Forge Books, 2005.

A very exciting thriller that also struggles with the question of why the dinosaurs
died off.
Archeologist Stem Weathers is shot in a canyon. Before his murderer appears he
gives his notebook to Tom Broadbent, a local animal doctor, coming by accidentally.
The notebook contains on 60 pages only digits. Therefore Tom takes it to
Wyman Ford an ex-CIA cryptanalyst, who now lives in a nearby abbey, after his
wife was killed in action. Wyman first declines and says that self-invented codes are
“idiot ciphers,” devised by an idiot and easily crackable by each idiot. The note-
book then proves to be not that easy. After intensive analysis he finds out that the
digits are no code but the output of an Earth radar device showing the picture of a
well-preserved T. rex.
After around 250 pages of endless chases, a surprising turn comes up: Masago,
head of a so-called black-detachment unit of the CIA. He explains, new weapons
invented once have always been used. Mankind will kill herself, but it’s his task
to postpone that as far as possible. As head of the LS480 department he will
prevent by any means possible that terrorists get any new dangerous biological
weapon.
When scanning the dead body of Weathers, the murderer only found some rock
cuttings he took. These rocks are investigated by a young researcher named Melody
Crookshank, although she doesn’t know where the rock cuttings come from. She
finds within them a very special kind of virus apparently coming from outer-space.

Heidelberg Lies (original title: Heidelberger Lügen), Wolfgang Burger, Piper, 2006.
This detective story playing in the Rhein-Neckar area in Germany has several inde-
pendent strands and local stories, but mainly it is about police officer Gerlach from
Heidelberg. On page 207, the cryptographic reference for one strand is shortly
explained: The soldier Hörrle had copied circuit diagrams of a new digital NATO
decryption device and the murdered man had tried to sell his perceptions to China.

The Black Sun, James Twinig, HarperCollins, 2006.

A history-based thriller with some artificially constructed elements, dealing also
with a treasure hunt to get the hidden uranium of the Nazis, and naturally the
future of the world depends on today’s bad guys being stopped in time.
Heros are Tom Kirk, a London-based ex-CIA agent and former professional
art thief, and Dominique de Lecourt, who loves challenges including riddles and
codes.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 607 — #7

i i

B.1 Movies and Fictional Literature with Relation to Cryptography 607

The only cryptographic parts are a “Sprungcode” (the criminals use this method
to communicate via newsletter adverts), steganography (used to hide the Enigma
key), and an Enigma message (containing the encrypted coordinates of the treasure).
At the beginning of the plot an Enigma device is stolen with high efforts which
is necessary to let the story play in the constructed way. But in the reality today such
a theft is completely needless, as there are great software emulators for the Enigma.

Kryptum, Agustin Sanchez Vidal, Dtv, 2006.

The first novel of the Spanish professor of art history has some similarities with
Dan Brown’s The Da Vinci Code from 2003, but allegedly Vidal started his writing
of the novel already in 1996. Vidal’s novel is a mixture between historic adventure
and mystery thriller. It was a huge success in Spain and Germany. There is currently
no English version available.
In the year 1582, Raimundo Randa is waiting to be condemned to death—
he was trying to solve a mystery all his life. This mystery is about a parchment
with cryptic characters, where a unique power is behind. Around 400 years later
the American scientist Sara Toledano is fascinated by this power until she vanishes
in Antigua. Her colleague, the cryptographer David Calderon, and her daughter
Rachel are searching for her, and simultaneously they try to solve the code. But
also secret organizations like the NSA chase after the secret of the last key. They
don’t hesitate to kill for it.

Perdition (original title: Flickan som lekte med elden), Stieg Larsson, 2006.
The author was posthumously awarded in 2006 with the Scandinavian thriller
award. The superhero Lisbeth Salander uses PGP and occupies herself with
mathematical riddles like the Fermat theorem.

The Judas Documents (original title: Die Judas-Papiere), Rainer M. Schröder,

Arena, 2008.
In the year 1899 Lord Pembroke has three men and one woman in his grip. So they
have to follow his order to try to decipher the encrypted messages in the notebook
of his dead brother Mortimer and to find the missing gospel according to Judas,
which could shock the whole of Christendom. The four people therefore have to
solve riddles at many places in the world. The story explains some classic ciphers
like Polybius and Freemason.

A King for Germany (original title: Ein König für Deutschland), Andreas
Eschbach, Lübbe, 2009.
The novel deals with manipulations of electronic voting machines.
Vincent Merrit, a young American programmer, is blackmailed to write such
a program. Besides commercially oriented blackmailers, massively multiplayer
online role-playing games (MMORPGs) and live action role playing (LARP), have
a role. Because Merrit assumed that his program will be misused, he installed a
trapdoor: If a party with the name VWM participates at the election, it auto-
matically gets 95% of the votes. The fictional story line is based on many veri-
fiable and well researched facts, which are referenced in footnotes. While the

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 608 — #8

i i

608 Miscellaneous

cryptographic protocols themselves could be made secure, their implementation

and their organizational management stays susceptible against misuse.
Currently there is no English translation of the book.

Tetraktys, Ari Juels, Emerald Bay Books, 2009.

The plot exposes the vulnerability of modern computer based identity, authentic-
ity, and security interweaving modern cryptography with classical art and literature.
Cryptographer and classicist Ambrose Jerusalem is a University of California Berke-
ley graduate with a beautiful girlfriend and a comfortable future, until the NSA
recruits him to track a strange pattern of computer break-ins. Many small pieces
provide disturbing evidence that someone has broken RSA encryption. Even more
bizarre, a secret cult of latter-day followers of Pythagoras, the great Greek mathe-
matician and philosopher who believed reality could be understood only through a
mystical system of numbers, appears to be behind the attacks.

Daemon, Daniel Suarez, Penguin Books, 2009

This is considered as one of the most exciting books during the last few years—
it’s a near-science-fiction thriller combining developments in the real world and
possibilities coming from current research like from the Google X Lab (augmented
reality head-mounted displays (HMD) like Google glass, self-driving cars, and 3-D
printers) to a plausible story.
After the computer genius and game developer Matthew Sobol died, a daemon
starts acting on the internet, which seemingly ruthlessly manipulates and trains
more and more humans and companies.
By ruling the data everybody seems to be a helpless victim. All the communi-
cation of his mercenary soldiers is affected by high-tech and encryption—also the
communication between the distributed instances of his incarnation. Core is an
MMORPG game which reminds many of WoW. Here also encryption is used, for
example, to advertise the best players: m0wFG3PRCoJVTs7JcgBwsOXb3U7yPxBB.
The plot is without redundancy, complex, manifold, very fascinating, and with
its critique of the plutocrats it also contains concrete social elements. The end is
open. And the ideas seem to be realizable in the very next future.

Freedom (TM), Daniel Suarez, Penguin Books, 2010

“The propulsive, shockingly plausible sequel to the bestseller Daemon.” Freedom
(TM) (Daemon #2) patches a number of holes the writer left in the first book. The
prose is tighter, the descriptions more direct, the characters are fleshed out, espe-
cially Loki. Having laid the groundwork in Daemon, Suarez uses this foundation
in order to explore a new concept of social organization based on empowering
information technology and the reasoning why and how the battle runs between
the old potentates and the daemon society, which also evolves further during the
story. Cryptography is a natural part of modern technology and modern warfare as
described in this book. The new society emerging in Freedom (TM) is based on the
darknet, an alternative to the internet using fast wireless meshes in order to increase
the durability and availability of the network. Despite the story being shocking in
some parts, it appears to be realistic and not far away from the parallel usage of

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 609 — #9

i i

B.1 Movies and Fictional Literature with Relation to Cryptography 609

modern technology integrated into our modern lives as a virtual world overlaying
our real world.

Rafael 2.0, Karl Olsberg, Thienemann Verlag, 2011

Michael and Rafael Ogilvy are talented twins who get along very well. Before the
terminally ill Rafael dies, his father developed a virtual computer effigy of him, an
artificial intelligence (AI). This is a good kept secret until Michael one day finds out
what his father is hiding before him. However, his first horror soon turns into joy.
So he still has something that reminds him of his brother.
But this computer system is also interesting for the military. One day Michael’s
father is kidnapped and the company, and thus also the computer program Rafael
2.0, falls into the wrong hands. Michael is banished by his uncle in a boarding
school, from which he can flee. Henceforth, Michael and his friends try their best
to find his father, whom they assume was abducted by a competing company. From
there the story gets really exciting. Michael learns that there is another artificial
intelligence, Metraton, which is not so well-disposed to the people. Nothing is
too engrossed; young teenagers are the target audience. Nevertheless, depth and
substance are created when, for instance, the machinations in acquisitions are
discussed.
From a crypto perspective, the section about factoring is thrilling: With a
variant Michael can detect whether the computer is cheating.

The Fifth Murderer (original title: Der fünfte Mörder), Wolfgang Burger, Piper,
2011.
Location and time of the story: Germany / Heidelberg, 1990 to 2009. Episode
7 of the Alexander-Gerlach series. Inspector Alexander Gerlach almost became a
victim of a bomb blast when the sport utility vehicle (SUV) of a Bulgarian panderer
exploded. Gerlach starts investigating because he wants to prevent a gang warfare,
but then his bosses call him off. When the journalist Machatschek supports Gerlach,
he communicates with him only via Skype using an add-on encryption program
which he believes is the most secure in the world.

Master of the Universe: Master of all Staff (original title: Herr aller Dinge),
Andreas Eschbach, Lübbe, 2011.
This novel deserved a much broader audience: The idea in it of the “most terrific of
all crimes,” which is the origin of the whole story, is new and almost revolutionary,
but also infinitely sad. Along the failing partnership of Hiroshi (inventor genius)
and Charlotte, important topics like justice, human wealth, and power are dealt
with.
From a crypto perspective, Hiroshi uses distributed calculations and deve-
loped an encryption and backup system which misleads the government which
bugged him.

Blackout – Tomorrow is too Late (original title: Blackout – Morgen ist es zu spät),
Marc Elsberg, Blanvalet, 2012.
During a cold day in winter, all power supply networks in Europe break down.
Agencies, energy suppliers, and security companies are in the dark and unable to

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 610 — #10

i i

610 Miscellaneous

solve the problem. The Italian computer scientist Piero Manzano believes that this is
caused by terrorists using hackers: All customers use smart meters, electricity meters
controlled by software that was manipulated. Despite the integrated security and
encryption components, they have been hacked, and are out of order by wrong
control sequences. The terrifying consequences happening at various locations are
described realistically and excitingly, as are the reactions of the human beings.

The Eigths Revelation (original title: Die achte Offenbarung), Karl Olsberg, Auf-
bau Taschenbuch, 2013.
Can a message from the past change our future? An ancient, encrypted manuscript
fell into the hands of historian Paul Brenner. The more he decodes the text, the more
puzzling the content is: Because the book tells with remarkable precision events
years ahead of the time of its presumed creation. While highly dangerous genetic
material disappears from a U.S. laboratory, someone tries to prevent, at any price,
Paul from deciphering the last (the eighth) revelation. A gripping thriller about a
shockingly realistic apocalypse with many human aspects.
As a reader, you can participate in the deciphering of the manuscript.
The experiments of Paul to make the right persons aware of his discovery and
to correct it later, are described very excitingly—even chief editors have a dilemma
with conspiracy.
The cipher on the last book page is offered as a challenge in the crypto
competition MTC3: https://mysterytwister.org/challenges/level-1/the-
last-note.

ZERO – They Know What You Are Doing (original title: ZERO – Sie wissen, was
du tust), Marc Elsberg, Blanvalet Verlag, 2014.
London. In a pursuit a boy is shot. His death takes the journalist Cynthia Bonsant
to the acclaimed internet platform Freemee. Freemee collects and analyzes data, and
thus promises its millions of users (rightly) a better life and more success. There is
only one who warns about Freemee and about the power that the online newcomer
could give just a few: ZERO, the most searched online activist in the world. As
Cynthia begins precisely to research, she’s becoming the quarry. And in a world of
cameras, headsets, and smartphones there is no escape.
Highly topical and menacing: the transparent person under control. The novel
takes place in the near future (fiction) and contains many contemporary references
such as PRISM, predictive analytics, and gamification. By the way, references to
well-known science fiction media like The Running Man, Monkey Wrench Gang,
V as Vendetta (V wears a Guy Fawkes mask, now the hallmark of Anonymous),
Network, and Body Snatchers are processed.
Technologically / cryptologically the protagonists move on the highest level,
which is not further explained: Alice Kinkaid communicates with a Raspberry Pi.
Cynthia’s daughter Vi uses mesh networks.

Genocide of One, Kazuaki Takano, 2014. (Orginal in Japanese: Jenosaido, 2011;

as paperback in English again under the title Extinction, 2016)
The cover text of the English version (Mulholland Books, 2014) says: “He is a new
kind of human. He may mean the end for the rest of us... One bright morning in
Washington D.C., the U.S. President learns of a terrifying new threat to national

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 611 — #11

i i

B.1 Movies and Fictional Literature with Relation to Cryptography 611

security. Soon afterward, American mercenary Jonathan Yeager is asked to lead a

team into the Congo to eliminate a mysterious enemy—a job which will help him
pay for treatment for his dying son. But when they reach Africa, the threat turns
out to be a three-year-old child named Akili: the next step in human evolution. The
soldiers are under orders to kill the boy before his full potential can be realized. Yet
Akili’s advanced knowledge might be the only hope Yeager has to save his son’s life...
With time running out to choose a side, Yeager must decide whether to follow his
orders or to save a creature who may not be as harmless or innocent as he appears.
Because Akili is already the smartest being on the planet, with the power to either
save humanity—or destroy it.”
This is a very exciting book. After having overcome the first 100–200 pages
you’ll be awarded with surprising insights. According to the recensions, it’s very
well researched, but not for superficial readers.
From a crypto perspective, RSA and OTP are direct drivers of the story and
are explained correctly. Breaking RSA by factorization is so important that the CIA
wouldn’t accept that this knowledge isn’t in their ownership.

The Girl in the Spider’s Web, David Lagercrantz, Quercus, 2015.

This is the fourth novel in the Millennium series, and the first not written by Stieg
Larsson. While Mikael Blomkvist’s print medium is struggling to survive, the reader
gets more and more insight in the inner structures and the combinations of pub-
lishers, secret services, public agencies, organized crime, and industrial espionage.
Here, no care is taken for single humans, and normal humans would have no chance
against this mix of interests. However, the special skills of Lisbeth Salander make a
difference, and so the NSA is informed that parts of it are led and misused by orga-
nized crime. The characters of the Millenium trilogy have been developed further
in a credible way. Very exiting.
From a crypto perspective, Lisbeth and August deal with elliptic curves to
crack RSA.

Remark 1:
A long list of (partly commented) samples of cryptology in fictional literature can be
found on the following German web page: https://www.staff.uni-mainz.de/
pommeren/Kryptologie/Klassisch/0_Unterhaltung/. For some older authors
(e.g., Jules Verne, Karl May, Arthur Conan Doyle, and Edgar Allen Poe) there are
even links to the original and relevant text pieces.

Remark 2:
You can find title pages of some of these books on the website of Tobias Schrödel,
who collects classic books about cryptography: https://cryptobooks.org/.

Remark 3:
If you know of further books and movies, where cryptography has a major role then
we would be very glad if you could send us the exact title and a short explanation
about the movie/book’s content. We will insinuate your possible enthusiasm for a
title. Thanks a lot.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 612 — #12

i i

612 Miscellaneous

B.1.2 For Kids and Teenagers

The following list contains movies and children’s books. The children books contain
both stories, and collections of simple ciphers, prepared in a didactic and exciting
manner (please send us similar English children books and children movies, because
at the moment our list contains mostly German children books).

Top Secret – The Book for Detectives and Spies (original title: Streng geheim – Das
Buch für Detektive und Agenten), author unknown, Edition moses, year unknown.
This is a thin book for small kids with Inspector Fox and Dr. Chicken.

The Three Investigators: The Secret Key (Original German title: Die 3 ???: Der
geheime Schlüssel nach Alfred Hitchcock (volume 119), Robert Arthur, Kosmos-
Verlag (from 1960).
The three detectives Justus, Peter, and Bob have to decrypt covered and encrypted
messages within this story to find out what is behind the toys of the Copperfield
company.

Ciphers (original title: Geheimschriften), Karl-Heinz Paraquin, Ravensburger

Taschenbuch Verlag, 1988 (1st edition 1977).
On 125 pages filled with a small font this mini format book explains many methods
that children can apply directly to encrypt or hide their messages. A little glossary
and a short overview about the usage of encryption methods in history complete
this little book.
Right at page 6 it summarizes for beginners in an old fashion style “The
Important Things First” about paper-and-pencil encryption (compare Chapter 2):
• “It must be possible to encrypt your messages at any place and at any
location with the easiest measures and a small effort in a short time.”
• “Your cipher must be easy to remember and easy to read for your partners.
But strangers should not be able to decrypt them. Remember: Fastness before
finesse, security before carelessness.”
• “Your message must always be as short and precise as a telegram. Shortness
outranks grammar and spelling. Get rid of all needlessness like salutations
or punctuation marks. Preferably use only small or only capital letters.”

The Manual for Detectives. Everything You Need to Know About Ciphers, Codes,
Reading Tracks and the Biggest Detectives of the World (original title: Das Hand-
buch für Detektive. Alles über Geheimsprachen, Codes, Spurenlesen und die
großen Detektive dieser Welt), Matthias Müller-Michaelis, Südwest, 2002.
A small collection on 62 pages.

Top Secret! – How to Encrypt Messages and to Hack Codes (original title: Streng
geheim! – Wie man Botschaften verschlüsselt und Zahlencodes knackt), Rudolf
Kippenhahn, rororo, 2002.
In this novel, a grandpa, an expert for secret writings, teaches his four grandchildren
and their friends how to encrypt messages that nobody should read. Because there

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 613 — #13

i i

B.1 Movies and Fictional Literature with Relation to Cryptography 613

is someone who hacks their secrets, the grandpa has to teach them more and more
complicated methods.
Within the framework of this story, the most important classic encryption
methods and their analyses are explained in a manner exciting and appropriate
for children.

Top Secret. The Big Book for Detectives (original title: Streng geheim. Das große
Buch der Detektive), Corinna Harder and Jens Schumacher, Moses, 2003.
A collection on 118 pages.

Your Mission in the Weird Villa. Riddle Thriller (original title: Dein Auftrag in
der unheimlichen Villa. Kennwort Rätselkrimi), Helga Talke and Milena Baisch,
Loewe, 2003.
From 4th form. http://www.antolin.de.
Young detectives solve simple ciphers and codes during their missions.

The Three Investigators: Manual for Secret Messages (original title: Die 3 ???:
Handbuch Geheimbotschaften), Bernd Flessner, Kosmos, 2004.
On 127 pages you learn in an easy and exciting manner, structured by the method
types, which secret languages (like the one of the Navajo Indians or dialects)
and which secret writings (real encryption or hiding via technical or linguistic
steganography) existed and how simple methods can be decrypted.
The author tells where in history the methods were used and in which novels
authors used encryption methods [like in Edgar Allan Poe’s The Gold Bug, like with
Jules Verne’s hero Mathias Sandorf, or like with Astrid Lindgren’s master detective
Blomquist who used the ROR language (similar inserting ciphers are the spoon or
the B language)].
This is a didactically excellent introduction for younger teens.

The Treasure of the White Hawks (original title: Der Schatz der weißen Falken),
directed by Christian Zübert, 2005.
This exciting adventure movie for kids ties in with the tradition of classics like Mark
Twain’s The Adventures of Tom Sawyer and Huckleberry Finn or Enid Blytons “The
Famous Five.” The plot happens in summer 1981. In an old half tumbledown villa
three young kids find the treasure map of the “White Hawks,” which they decrypt
with the help of a computer. Traced by another gang they aim to go to an old castle.

The Three Investigators: Secret Messages (German version: Die 3 ???:

Geheimnisvolle Botschaften) (volume 160), Christoph Dittert, Kosmos, 2011.
In the house of Professor Mathewson an old hand-made book was stolen. The
three detectives Justus, Peter, and Bob are getting attacked by a ruthless opponent,
who seems to be always a step ahead. A major part in this story is played by a
palimpsest, an ancient manuscript page, written upon newly. Using X-rays they
can make visible again the old text below. It’s not only the story that is exciting, but
also the way, how the instruction for a treasure hunt is encrypted. Despite using the
simple railfence cipher it’s not easy to solve it, as the message is distributed onto
two slips and the printed symbols don’t mean single letters.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 614 — #14

i i

614 Miscellaneous

Remark 1:
You can find title pages of many of these kid books on the website of Tobias
Schrödel, who collects classic books about cryptography: https://cryptobooks
.org/.
Remark 2:
If you know of further books that address cryptography in a didactic and, for chil-
dren, adequate way, then we would be very glad if you could send us the exact book
title and a short explanation about the book’s content. Thanks a lot.

B.1.3 Code for the Light Fiction Books

Section B.1.1 lists The Gold Bug by E.A. Poe as the first book.
Using Python Example B.1 [4] you can decrypt the ciphertext of Captain Kidd
(see the original text of The Gold Bug in http://pinkmonkey.com/dl/library1
/gold.pdf, page 21).
The code already contains the ASCII characters of the ciphertext and the cor-
related alphabets for the plaintext and the ciphertext of this monoalphabetic cipher
(MASC). Alternatively, you could use SageMath Example 2.1 which also uses a
self-defined alphabet.
The easiest way to perform the decryption is using the SageMathCell server
(https://sagecell.sagemath.org/) in a browser: There you can switch between
the programming languages Sage and Python. The code can be executed by inserting
it with “copy-and-paste” and then pressing “Evaluate.”

Python Example B.1: Decryption of the Gold-Bug Ciphertext from the Novel
of E.A. Poe (with Python)
print ("\n# Appendix_B --SAMPLE 010: =========")

# Chap. B.1.3 Code for the light fiction books

# Decryption of the Gold -Bug ciphertext from the novel of E.A. Poe
# Usage on terminal: python appB1 _sample01.py (needs Python 3)

PA = 'ETHSONAIRDGLBVPFYMUC '
print('Plaintext alphabet PA: ', PA , ' Length of PA ', len(PA))
CA = "8;4)+*56(!302 '.1:9?-"
print('Ciphertext alphabet CA:', CA , ' Length of CA ', len(CA))

codetableC2P = str.maketrans(CA ,PA) # the strings CA and PA must have �

� the same length

C = '''53++!305))6*;4826)4+.)4+);806*;48!8'60))85;1 +(;:+*8!83(88)5*!;46
(;88*96*?;8)*+(;485);5*!2 :*+(;4956*2(5*-4)8'8*;4069285);)6!8)4++;1(+9;4
8081;8:8+1;48!85;4)485!528806*81(+9;48;(88;4(+?34;48)4+;161;:188;+?;'''
P = C.translate(codetableC2P);
print('\nKidd decrypted :')
print(P)

# if str contains symbols not in the translation , they are left �

� unchanged
intab = "aeiou"

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:42 — page 615 — #15

Bit: Upper/Lower Case and Abbreviation

The “bit” is not defined in the International System of Units (SI). However, the Inter-
national Electrotechnical Commission issued standard IEC 60027, which specifies
that the symbol for binary digit should be “bit,” and this should be used in all mul-
tiples, such as “kbit,” for kilobit. However, the lower-case letter “b” is widely used
as well and was recommended by the IEEE 1541 Standard (2002). In contrast, the
upper-case letter “B” is the standard and customary symbol for byte.
So as a unit in formulas, we write “bit” in lower case and without the plural “s.”
Remark: The unit for quantum information is qubit.

Byte: Upper/Lower Case and Abbreviation

The unit symbol for the byte was designated as the upper-case letter “B” by the IEC
and by the Institute of Electrical and Electronics Engineers (IEEE).

1000B = 1 kB = 1 kilobyte
1024B = 1 KiB = 1 kibibyte = 1 KB [sometimes wrongly as 1 kilobyte]

Hyphens

Public-key cryptography: Hyphen if the two words are used like one adjective.
Brute-force attack: Hyphen if the two words are used like one adjective.
https://www.scribendi.com/academy/articles/hyphenation.en.html.
https://dictionary.cambridge.org/grammar/british-grammar/hyphens.

References

[1] Kippenhahn, R., VerschlüsselteBotschaften: Geheimschrift, Enigma und Chipkarte, 1st ed,
Rowohlt, 1997.
[2] Slade, R., REVIEW: “Kim,” Rudyard Kipling, 2006, http://catless.ncl.ac.uk/Risks/24.49
.html%5C#subj12.
[3] Kipling, R., Kim, https://kipling.thefreelibrary.com/Kim.
[4] Witten, H., I. Letzner, and R.-H. Schulz. “RSA & Co. in der Schule: ModerneKryptologie,
alteMathematik, raffinierteProtokolle, Teil 1: Sprache und Statistik”. In: LOG IN 3/4,
1998, pp. 57–65, https://informatik.schule.de/krypto/.
[5] Wright, J., Siunitx—A comprehensive (SI) Units Package, 2023, https://ctan.org/pkg/
siunitx.

i i

i i
i i

“Esslinger” — 2023/11/30 — 19:43 — page 617 — #1

i i

CHAPTER 0
CHAPTER 0

About the Author

Bernhard Esslinger
Initiator of the CrypTool project, editor, and main author of this book. Professor
for IT security and cryptography at the University of Siegen. He is the former CISO
of SAP AG, and former head IT security at Deutsche Bank.
Email: [email protected].

Contributors

Doris Behrendt
Author of Section 6.5 (“The RSA Plane”). Mathematician, member of CT Team
since 2018. Took over the project lead of the CT project in 2023 at Bundeswehr
University, Munich. Email: [email protected].

Matthias Büger
Contributor to Chapter 8 (“Elliptic-Curve Cryptography”). Research analyst at
Deutsche Bank.

Miroslav Dimitrov
First author of Chapter 11 (“Lightweight Introduction to Lattices”). Bulgarian
Academy of Sciences. Email: [email protected].

Bartol Filipovic
Original author of the CT1 elliptic curve implementation and of Chapter 8
(“Elliptic-Curve Cryptography”).

Martin Franz
Original author of Chapter 10 (“Homomorphic Ciphers”). Works and carries out
research in the area of applied cryptography.

Erik Dahmen
Coauthor of Chapter 13 (“Future Use of Cryptography”). Researcher at the
Chair for Theoretical Computer Science (Cryptography and Computer Algebra),
Department of Computer Science, Technische Universität Darmstadt, Germany.

Ulrich Vollmer
Coauthor of Chapter 13 (“Future Use of Cryptography”). Researcher at the
Chair for Theoretical Computer Science (Cryptography and Computer Algebra),
Department of Computer Science, Technische Universität Darmstadt, Germany.

Antoine Joux
Coauthor of Chapter 12 (“Solving Discrete Logarithms and Factoring”). Antoine
Joux is the holder of the Cryptology chair of the Foundation of the University
Pierre et Marie Curie (Paris 6) and a senior security expert at CryptoExperts, Paris.
He worked in various fields of cryptanalysis, and he is a key player in the recent
advances in computing discrete logarithms in fields of small characteristic.

Arjen Lenstra
Coauthor of Chapter 12 (“Solving Discrete Logarithms and Factoring”). Arjen
Lenstra is a full professor at École Polytechnique Fédérale de Lausanne (EPFL) and
head of the laboratory for cryptological algorithms. He is one of the inventors of
the best algorithm currently available for factoring integers (the number field sieve).
He was involved in many practical factoring records.

Beáta Megyesi
Coauthor of Chapter 3 (“Historical Cryptology”). Professor of computational
linguistics, Uppsala University, Sweden. PI of the DECRYPT project. Email:
[email protected].

Alicia Fornés
Coauthor of Chapter 3 (“Historical Cryptology”). Computer Vision Center, Uni-
versitat Autònoma de Barcelona, Spain.

Benedek Láng
Coauthor of Chapter 3 (“Historical Cryptology”). Historian of science, Eötvös
Loránd University, Budapest, Hungary.

Michelle Waldispühl
Coauthor of Chapter 3 (“Historical Cryptology”). Associate professor of German
linguistics and language acquisition at the University of Gothenburg, Sweden.

collision, 362 Best known attacks
costs, 18 about, 14–15
defined, 18 classical ciphers, 15
dictionary, 365 historical ciphers list, 16
distinguishing, 19 modern ciphers, 15
impersonation, 369–70 modern ciphers list, 17–18
key-recovery, 19 See also Attacks
with known plaintext, 416–17 Binary and decimal systems, special values,
known-plaintext (KPA), 20 182–83
related key, 20 Binary translation function, 519
single-key, 19 Bitblocks, 397–98, 411–12
success, 19 Bits
time, 18–19 about, 394
time-memory trade-off (TMTO), 22 composition, 394–95
types of, 16–23 denotation, 397
variable-key, 19 See also Boolean functions
Augmented matrix, 483, 485 Blockchain, 361
Autokey, 54 Block ciphers
Automatic transcription about, 414
about, 115–16 AES, 424–26
document preprocessing, 116–17 algebraic cryptanalysis and, 415–18
future of, 119 block length, 415
layout segmentation, 117 Boolean maps and, 416
symbol segmentation and transcription CBC mode, 421–22
example, 117 CTR mode, 422
text/cipher recognition, 117–19 ECB mode, 420–21
See also Transcription general description, 414–15
Automorphism groups, 331, 335 key length, 415
Automorphisms, 315, 330, 331, 334, 346 modes of operation, 420–22
Avalanche effect, 363 OFB mode, 422
Axis points, 322, 323, 351–52 outlook on, 426–27
security criteria, 423–24
B single round of, 419
Baconian cipher, 50 statistical analysis, 422–23
Base representation, 270–71 strong, 420
Bazeries cipher, 58 structure of, 418–20
BBS generator Book cipher, 50
about, 455 Boolean functions
example, 455–56 about, 394
perfectness of, 459 algebraic normal form (ANF), 412–14
sequence generation, pseudorandom bits, bitblocks and, 397–98
456–57 bits and, 394–95
Beale cipher, 51 conjunctive normal form (CNF), 399
Beaufort, 55 description of, 395–96
Benford’s law, 171 interpretations of bitblocks, 411–12
linear form, 404

Esslinger index.indd 622 11/30/2023 1:29:39 PM

Index 623

linear maps, 404–6 hidden ASCII, 488

logical expressions, 398 leetspeak, 498
maps, 403–4 RSA, 516–17
monomial expression, 399–400 RSA attack for small exponents, 523
number of, 396–97 superincreasing knapsacks, 506
polynomial expressions, 398, 399–402 system of linear equations, 486
representation of, 411–14 system of linear equations as a picture, 482
systems of linear equations, 406–11 vector, 502–3
of three variables, 403 Ché Guevara cipher, 49
truth tables, 396, 397, 412 Chinese remainder theorem (CRT), 166,
of two variables, 402–3 227–29, 292, 301, 314, 331, 556, 567
Boolean maps Chor-Rivest cryptosystem, 532–33
about, 403 Chosen-ciphertext attacks (CCA), 20
affine, 406 Chosen-plaintext attacks (CPA), 20
block ciphers and, 416 Cipher keys
example, 404 about, 99
interpretations of bitblocks, 411–12 difficulties, 108
linear, 404–6 finding, 106–7
representation of, 404, 411–14 handwritten, recognizing, 106
value table, 405 illustrated examples, 99–100
Borg cipher, 98 large-scale statistical analyses, 133
Brun’s constant, 174 nomenclature elements, 100–101
Brute-force attacks, 4–5 plaintext elements, 100
BSA (German Information Security Agency), representation of layout, 112
231 as tables, 101
transcribing, 112
C Ciphers
C158, 240–41 about, 98
C307/M1039, 242–43 best known attacks on, 14–16, 17–18
Cadenus cipher, 44–45, 46, 82 block, 414–27
Card games, 60 combining substitution and transposition,
Carmichael numbers, 153–55 56–60
Catlan, Eugene Charles, 161 common notation when using, 2
CBC mode, 421–22 defined, 113
Certification authority (CA) defined by ACA, 73–74
about, 370 historical, analyzing, 103–6
proving identity to, 370 homomorphic, 467–74
public key, using, 371 open-access publications on cracking, 74
signature validation, 372 paper-and-pencil (P&P), 39–63
Challenges precomputer, 63–64
encryption with knapsacks, 509 stream, 427–63
equations, 479–80 substitution, 45–56
even harder RSA attack for small exponents, symmetric, educational examples, 30–32
524 transposition, 40–45
harder RSA attack for small exponents, XOR, 427–29
523–24 Ciphertext-only attacks (COA), 20

Esslinger index.indd 623 11/30/2023 1:29:39 PM

624 Index

Ciphertexts cost functions, 129–31

about, 97–103 defined, 113
deciphering, 104 differential (DCA), 422, 423, 424
defined, 98, 113 differential-neural, 426
examples of, 98 heuristic algorithms for, 121–29
as handwritten or printed, 104 historical cryptology, 120–31
RSA, cracking, 289 lattice-based, 510–13
transcription of, 104, 111 linear (LCA), 422
word boundaries, 98 of random generator, 447
Cleartext, 113 tokenization, 120–21
Clocking, 449, 450 Cryptocurrencies, 361
Closest vector problem (CVP), 544, 547 Cryptographic infrastructure, 576–77
Closure, 207 Cryptography
Code elements, 113 about, 1
Codes, 47 asymmetric, 301–58
Code separator/token separator, 113 asymmetric, applications, 252–57
Collision attacks, 362 elliptic-curve (ECC), 375–91
Column transposition, 42–43 future use of, 581–87
Complexity classes, 302–3 inverses and, 210
Composite numbers, 200 mathematics and, 195–96
Computational complexity, 18 movies and fictional literature and, 601–15
Computational security, 22 multivariate, 585
Computer-algebra system (CAS), 196 number sets and, 206
Congruences postquantum, 540–41, 585–86
about, 206 public-key, 229
Chinese remainder theorem and, 556 quantum, 585
divisibility and, 201 Cryptology
equivalence relation, 206 defined, 1
linear, solving systems of, 314 embedding between risk management and
working with, 203–6 science, 587
Conjunctive normal form (CNF), 399 historical, 97–135
Consecutive prime arithmetic progressions, importance of, 1, 2
169–70 modern, 97
Constant-based algorithms, 24 references and resources, 24–25
Convolutional neural networks (CNN), 118 CrypTool
Cooperative networking, 150 about, xv
Coordinate vectors, 491 lattices and, 542–52
Copiale cipher, 98 overview of functions, 542
Cost functions recommended spelling and, 615–16
about, 129 CrypTool 1 (CT1)
cryptanalysis, 129–31 about, xvi, xxi
hill-climbing algorithm, 123–24 attack on stereotyped message dialog, 544
Cryptanalysis brute-force analysis of AES in, 9
about, 1, 120, 415 dialogs, 543–44
algebraic, 415–18 elliptic curves, 389
cipher type and alphabet and, 120 factoring dialog, 543, 545
cipher types and, 106 menus, 589

Esslinger index.indd 624 11/30/2023 1:29:39 PM

Index 625

RSA signature generation, 368 CTTS (CryptTool Transcriber and Solver)

CrypTool 2 (CT2) about, 106, 114
about, xvi, xxi ciphertext cryptanalyzed with, 116
AES in, 26–28 ciphertext transcribed with, 115
brute-force analysis of AES in, 10 steps for manual transcription, 114–15
differential cryptanalysis, 423 Cunningham Project, 240, 242
lattice-based tutorial, 544–46, 547, 548, C(X)-52 (Hagelin)
549, 550, 551 about, 65
LFSR in, 443 architecture, 67–69
number-theoretic functions in, 218–19 in CT2, 71–72
with Paillier and DGK, 472–74 encryption principle, 65–66
templates, 590, 593 evolution and influence, 72–73
Workspace Manager, 592 key space size, 68–69
CrypTool-Online (CTO) machine differences, 69–70
about, xvi, xxi–xxii Operation Rubicon and, 70–71
AES animation in, 26 printer offset, 69
CRYSTALS-Kyber in, 552 pseudorandom displacement generator,
encryption with OpenSSL, 235 67–68
first functions display, 598 settings, 72, 73
fixed points, 294 wheels advancement, 68
functions, 594– 99 See also Hagelin machines
operations on elliptic curves, 385 Cycles
poll demo in, 474 about, 227
RSA in, 259, 260, 262, 263 length of, 227–29
signature validation and validity models, LFSR efficiency and, 454
373 Cyclic groups, 379
starting page, 596
CrypTool Variants, xvi, 25, 309, 472 D
CrypTool website, xvi Databases of ciphers, 108–9
Crypto Procedures Data Encryption Standard (DES)
algorithm ATTACK-Orton, 537–38 about, 4–5, 583
algorithm BASIS, 530–31 SDES, 32
algorithm enum(j,k), 531–32 Triple-DES, 4
algorithm enumx, 538–39 Decimation, 449
algorithm for block reduction, 530 DECODE database, 105–6, 134
algorithm for LLL reduction, 528 Decomposition into prime factors
algorithm for size reduction of basis vector, for public-key procedures, 305–9
528 Rabin public-key procedure, 308–9
decryption by Orton, 534 RSA procedure, 305–8
deep insertions, 529 Decryption
encryption by Orton, 533 CBC mode, 421
factoring, 539–40 defined, 113
Gauss reduction algorithm, 526 by Orton, 533
subroutine increase(t), 539 Paillier cryptosystem, 470
Crypto process, 103 RSA procedure, 259, 262, 264, 265
CRYSTALS-Kyber algorithm, 541, 552 DECRYPT project, 105, 109, 111–12
CTR (counter) mode, 422

Esslinger index.indd 625 11/30/2023 1:29:39 PM

626 Index

Deep insertions, 529 Distributivity, 499

Deep learning-based architectures, 118 Divisibility, 201–3, 205, 352
Diagonal matrix, 487 Division with remainder, 203–4
Dictionary attacks, 365 Double column transposition (DCT), 42, 59–60
Differential cryptanalysis (DCA), 422, 423, 424 Double Mersenne primes, 162–63
Differential-neural cryptanalysis, 426 DSA signatures, 367–69
Diffie-Hellman key agreement, 310–11
Diffie-Hellman key-exchange protocol E
about, 253–54 ECB mode, 420–21
example using small numbers, 255–57 ECMNET project, 388
procedure, 254–55 Electronic Frontier Foundation (EFF), 150
process illustration, 254 Electronic voting, 471–72
Digital signatures ElGamal public-key encryption procedure,
goal of, 365–66 311–12, 470–71
hash functions and, 361, 366–67 Elliptic-curve cryptography (ECC)
message integrity, 366 about, 375
RSA, 367, 368 efficiency, 375
RSA procedure and, 233 patent aspects, 390–91
signature procedure, 366 security of, 385–87
signing hash value, 366–67 standardization and, 376–77
user authenticity, 365 use of, 391
validation and validity models, 372–73 Elliptic curve discrete logarithm problem
Digrafid cipher, 58–59 (ECDLP), 385–87
Digraphs, substituting by symbols, 53 Elliptic curve method (ECM), 166, 242
Dimensions, 182, 183 Elliptic curves
Discrete exponential function, 312 adding points to, 384
Discrete logarithms algorithms for, 571–74
about, 309–10 in cryptography, 381–83
algorithms for extension fields, 562–67 CrypTool and, 389–90
algorithms for factoring integers, 567–71 for educational purposes, 389–90
algorithms for prime fields, 558–62 encryption, 387
as basis for public-key procedures, 309–14 factorization with, 388–89
calculating, 313 fields and, 379–81
Diffie-Hellman key agreement and, 310–11 Gaudry-Semaev algorithm for, 571–72
ElGamal public-key encryption procedure GHS approach for, 571
and, 311–12 groups and, 378–79
generalized ElGamal encryption procedure history of, 377–78
and, 312–14 key lengths and, 376
generic algorithms for, 555–58 key size versus security for, 573
Pollard Rho method, 556 mathematical basis, 378–81
problem of, 312 operating on, 383
running times, measuring, 557 over prime fields, algorithms for, 572–73
Silver-Pohlig-Hellman algorithm, 556–57 parameters, securely choosing, 574
Disjoint transpositions, 346 programs to add points on, 384–85
Displacement sequence, 66 in pure mathematics, 377–78
Distinguisher, 459 real numbers example, 382
Distinguishing attacks, 19

Esslinger index.indd 626 11/30/2023 1:29:39 PM

Index 627

SageMath and, 390 of small characteristic, 565

signature verification, 388
signing, 388 F
signing and verification time and, 376 Factoring
use of, 378 CT1 dialog, 543, 545
Elliptic index generator (Kaliski), 461 integers, algorithms for, 567–71
ElsieFour cipher, 62 key size versus security for dlog in, 569–71
Embedded backdoors, in cryptographic keys, large integers, 387
575–76 lattice basis reduction, 539–40
Encryption numbers, 287
asymmetric, 5–7 RSA challenge, 239
based on lattice problems, 584 Factorization
CBC mode, 421 about, 147
defined, 113 algorithms, 236–37, 239
elliptic curves, 387 of big numbers, 235
Hagelin C(X)-52, 65–66 breakthroughs in, 235
hybrid, 7, 389 complete, 200
with knapsacks, 509 with elliptic curves, 388–89
Merkle-Hellman knapsack, 304–5 with Eratosthenes’ sieve, 150
method types, 102 gcd calculation execution time comparison,
by Orton, 533 248–49
Paillier cryptosystem, 469 integer, in practice, 569
RSA, 288 of large integers, 237–38
RSA procedure, 258–59, 261, 264–65 Mersenne numbers, 239
running-text, 431–32 number field sieve for, 567–68
XOR, 427–29, 430–31 record size comparison, 244
See also Symmetric encryption; specific research results, 244–52
types of encryption of specific large numbers, 238–44
Enigma, 11–13 with systematic division, 150
ENISA report, 584 Factorizers, 165–66
Equations, 477–80, 491–98 Fast exponentiation, 273
Euclid numbers, 158–59 Fast Fourier transform (FFT), 462
Euler-Fermat theorem Feasibility of distinguishing, 18
about, 220–21 Feedback shift registers (FSRs)
proof of RSA procedure with, 229–33 about, 434, 435
requirement for using, 232 nonlinearity approaches, 447–51
Euler phi function, 218–19, 306–7, 515 stepping, 436
Euler polynomial, 160, 161 See also Linear shift registers (LFSRs)
Exchange protocol, 311 Fermat numbers, 156–57
Experimental space, 147 Fermat pseudoprime numbers, 152
Extension fields Fermat’s last theorem, 196
about, 562 Fermat’s little theorem, 151, 219–20
algorithms for factoring integers, 562–67 Few-shot learning, 118–19
Joux-Lercier FFS and, 562–63 Fields
Joux-Lercier FFS improvements, 563–64 about, 379
quasi-polynomial dlog computation and, characteristic of, 380
564–65

Esslinger index.indd 627 11/30/2023 1:29:39 PM

628 Index

Fields (continued) Gaudry-Semaev algorithm, 571–72

extension, 562–65 Gauss, Carl Friedrich, 164
finite, 379–81 Gauss reduction algorithm, 526
Galois, 317, 381 Geffe generator, 450–52
infinite, 379 Generalized ElGamal public-key encryption
prime, 558–62, 572–73 procedure, 312–14
Finite fields, 379–81 Generalized Fermat numbers, 157
Finite planes Generalized Mersenne numbers, 156
about, 315–16 General number field sieve (GNFS), 236, 567
characteristic, 317 Geometric figures, 43
linear, one-dimensional arrangement, 315 GGH (Goldreich-Goldwasser-Halevi) crypto-
line illustration, 318 system, 545, 550
lines in, 317–19 Gigantic primes, 147
vectors, 316, 317–18 GIMPS (Great Internet Mersenne Prime Search)
See also RSA planes project, 149, 247
Fixed points, RSA GMP-ECM, 388
about, 290–91, 347 Goldbach, Christian, 170
average number of, 295 Goldbach conjectures
CTO, 294 about, 171
example, 296–97 interconnection between, 173
fixed counterclockwise, 347 strong, 172
fixed setwise, 347 weak, 172
number of, 291–92 Gram matrix, 504
property, 295 Gram-Schmidt orthogonalization, 525
quantity for growing moduli, 295–96 Grand Chiffre, 52
roots of unity and, 291 Grandpré cipher, 51
RSA, 290–97 Granit cipher, 59–60
as undesirable, 291 Grille cipher, 41
weak/unsuitable e and, 293–95 Gronsfeld, 55
Four-square cipher, 53 Groups
Fractionation, 57–58 about, 215, 378
Full orbits, 325, 333, 347–52 addition in, 215–16
Fully homomorphic encryption (FHE) methods, cyclic, 379
468–69 finite, 379
Function field sieve (FFS), 562–64 modular arithmetic and, 215–17
Functions multiplication in, 216–17
Boolean, 394–414 order of, 379
CrypTool-Online (CTO), 594– 99
discrete exponential, 312 H
discrete logarithm, 309 Hagelin machines
Euler phi, 253, 301–3, 306–7, 515 about, 63
hash, 361–65 B-21, 64
JCrypTool (JCT), 592–94 BC-38, 65
one-way, 253, 301–3 C-35, 64
C-36, 64
G C-38, 64–65
Galois fields, 317, 381 C-52/CX-52, 65–73

Esslinger index.indd 628 11/30/2023 1:29:39 PM

Index 629

C-362, 64 areas of, 97–98

M-209, 64–65 cipher analysis, 103–6
as models for precomputer ciphers, 63–73 cipher keys, 99–102, 106–8
overview, 63–65 ciphertexts, 97–98, 104
Handwritten text recognition (HTR) methods, conclusion, 134–35
118 contextualization and interpretation, 131–33
Handycipher, 62 cryptanalysis, 120–31
Hardy’s conjecture, 168–69 encrypted sources collection and, 104
Hash functions introduction to, 97–103
about, 361 linguistic analysis, 131–32
attacks, as standardization driver, 362 manuscript collection and, 106–8
attacks on password hashes, 364–65 metadata creation and, 108–9
avalanche effect with, 363 social history, 133
collision resistance, 362 terminology, 98
digital signatures and, 361, 366–67 transcription, 109–19
generic collision attacks and, 362 See also Cryptology
Keccak, 364 HKZ-reduced basis, 527
requirements for, 361–62 Hölder’s inequality, 536, 537
resistance against preimage attacks, 362 Homographies, 565
SHA-1, 363 Homomorphic ciphers
SHA-2, 363 about, 467
SHA-3, 364 applications, 471–72
SHA-256, 365 classification of methods, 468
uses, 361 CrypTool and, 472–74
Heegner numbers, 160–61 decryption function, 468
Heuristic algorithms electronic voting application, 471–72
for cryptanalysis, 121–29 FHE methods, 468–69
hill climbing, 122–26 origin of term, 467–68
simulated annealing, 126–29 Paillier cryptosystem, 469–70
HICRYPT, 135 pre-FHE, 469–71
Hill cipher, 85–88, 92–94 secure multiparty computation (SMC) ap-
Hill-climbing algorithm plication, 472
cost function, 123–24 Homomorphic property, 470, 471, 473
decrypt function and key representation, Homomorphism, 343, 468
123 Homophone locking, 129
goal of, 122 Homophonic substitution, 45, 50–51
key modification, 124 Howgrave-Graham theorem, 518
start key, 123 Hutton cipher, 62–63
steps of, 122 Hybrid encryption, 7, 389
strategies to counter getting stuck, 125
termination criteria, 124–25 I
visualization of, 122–23 Impersonation attacks, 369–70
See also Heuristic algorithms IND-CCA, 234
Historical cryptology IND-CPA, 234
about, 97 Index calculus algorithms, 568–69
analysis and different research approaches, Index generator (Blum/Micali), 461
132–33

Esslinger index.indd 629 11/30/2023 1:29:39 PM

630 Index

Indexing prime numbers, 181–82 elliptic curves and, 389–90

Indistinguishable under adaptive chosen-cipher- functions, 592–94
text attack (IND-CCA2), 21 homomorphic properties, 473
Indistinguishable under chosen-ciphertext at- PKI in, 370–71
tack (IND-CCA1), 21 plugin, Merkle-Hellman knapsack crypto-
Indistinguishable under chosen-plaintext attack system, 547, 552
(IND-CPA), 20–21 with RSA, Paillier, and Gentry/Halevi, 474
Infinite fields, 379 signature validation and validity models,
Information-theoretical security, 22–23 372
Inner points Josse’s cipher, 62
invariant full orbits, 347–51 Joux-Lercier function field sieve, 562–64
orbits of, 338–39
path, 340 K
path, projection, 343 Keccak algorithm, 364
RSA plane and, 322, 323 Kerckhoff’s principle, 7–8, 18
Integer factorization problem (IFP), 236 Key derivation functions (KDFs), 361
Integer lattice, 504 Keys
Integers defined, 113
checking primality of, 189–92 embedded backdoors, possibility of, 575–76
divisibility, 201–3 knapsack, 506
factoring, algorithms for, 567–71 private, 7, 288–90
large, factoring, 387 public, 5, 229, 237, 370, 507, 575
large, forecasts and factorization, 237–38 RSA, 222–24, 232, 247
number of digits representation, 269–70 secret, 5, 7, 20, 311, 576
sum representation, 268–69 See also Cipher keys
Integral lattice, 504 Key spaces
Interrupted key, 54 about, 8
Invariant full orbits assumptions, 11–13
about, 347 conclusions, 13–14
axis points, 351–52 cryptoanalysis methods and, 11
inner points, 347–51 of historic cipher devices, 8–11
Invariant RSA orbits maximum versus practical, 11–12
nonsymmetric, 353 periodic bit sequences, 429–30
symmetric, 352, 354 problems with, 9–11
theorem, 352–53 sizes, 14, 19
Invariants, 346, 347 use of, 8
Inverse matrix, 486 Key stream generation
Inverses methods, 429
about, 208 pseudorandom generators, 434–44
additive, 208–11 running-text encryption, 431–32
cryptography and, 210 true random sequence, 432–34
multiplicative, 208–11, 221–22 See also Stream ciphers
Isomorphism, 315 Klein four group, 328, 332–33, 343
K-means clustering, 118
J Knapsack cryptosystems
JCrypTool (JCT) breaking, 532–39
about, xvi Chor-Rivest, 532–33

Esslinger index.indd 630 11/30/2023 1:29:39 PM

Index 631

Merkle-Hellman, 505–9 and, 505–9

Orton, 533–39 PQC standardization, 541
Knapsack problem, 303–4 reasons for using, 518
Known-plaintext attacks (KPA), 20, 92–94 RSA versus, 517–25
systems of linear equations and, 480–82
L vectors and, 487–91
Label propagation, 118 vector spaces and, 498–503
Large-scale statistical analyses, 133 Laws of modular calculations, 206–7
Largest known prime gaps, 177 Learning-free methods, 118
Lattice-based cryptanalysis, 510–13 Legendre, Adrien-Marie, 164
Lattice-based cryptography CT2 tutorial Length of orbits, 325, 326–29
about, 544–45 Lexicographic order, 398
attack against Merkle-Hellman knapsack Limit point, 174
cryptosystem, 548 Linear cryptanalysis (LCA), 422
attack against RSA, 549 Linear maps, 404–6, 416
CVP, closest vector, 547 Linear shift registers (LFSRs)
GGH cryptosystem, 550 about, 438–39
LWE cryptosystem, 551 algebraic attack on, 444–47
SVP via Gauss, 546 bits needed to predict, 446–47
SVP via LLL algorithm, 546 in CT2, 443
Lattice basis reduction defining in Python/SageMath, 440–41
about, 525 graphical representation, 439
algorithm for LLL reduction, 528–29 nonlinear combiner and, 451–53
algorithm for size reduction, 528 prediction of, 444–46
algorithms, 237 with pylfsr package in Python, 442–44
factoring, 539–40 random properties of sequences, 440
goal of, 525 See also Feedback shift registers (FSRs);
Gram-Schmidt orthogonalization and, 525 Pseudorandom generators
knapsack cryptosystems and, 532 Linguistic analysis, 131–32
lattice algorithm use, 540–41 LLL algorithm, 518, 527, 546
ordered lattice basis and, 527 LLL-reduced basis, 529, 539
size-reduced lattice basis and, 527 LLL-reduction, 527–29
LatticeHacks, 237 Logarithms
Lattice reduction algorithm, 522 about, 214–15
Lattices calculating, 253
about, 584 formula, 269
cryptanalysis and, 510–13 See also Discrete logarithms
CrypTool and, 542–52 Logical expressions, 398–99
with different basis, 504 Long short-term memory recurrent neural net-
encryption based on problems, 584 works (LMRNN), 118
equations and, 477–80, 491–98 Lucas-Lehmer primality test, 246
Gram matrix of, 504 LWE (learning with errors) cryptosystem, 545,
integer, 503 551
integral, 504
matrices and, 483–91 M
Merkle-Hellman knapsack cryptosystem Magnitude, orders of, 182

Esslinger index.indd 631 11/30/2023 1:29:39 PM

632 Index

Manual transcription Mersenne numbers, 144–45, 156, 239

about, 109–10 Mersenne prime numbers
basic principle of, 111 currently known, 148
challenges of, 110–11 defined, 145
CTTS and, 114–15 double, 162–63
damaged documents and, 112–13 examples of, 148–49
example, 114 first 48, 147
goal of, 109 theorem, 146–47
handwriting styles and, 110 See also Prime numbers (primes)
margin notes, 112 Message authentication codes (MACs), 361
See also Transcription Messaging Layer Security (MLS), 586
Manuscript collection, 106–8 Metadata creation, 108–9
Map cipher, 47 Microhistory approach, 133
MASC Mini-AES, 30–32
about, 8 Mirdek cipher, 61
with binary alphabet, 89–90 Modular arithmetic
with hexadecimal alphabet, 88–89 addition and multiplication, 208
with self-defined alphabet, 90–91 additive and multiplicative inverses, 208–11
simple, 45 examples of, 207–15
Mathematics fast calculation of high powers (square and
cryptography and, 195–96 multiply), 213–14
prime numbers and, 140 groups and, 215–17
Matrices laws of, 206–7
about, 483 raising to the power, 211–12
augmented, 483, 485 roots and logarithms, 214–15
definition of product of, 493 Modular division, 207
diagonal, 487 Modulo operation, 203–6, 222–24
Gram, 504 Modulo subtraction, 267–68
inverse, 486 Monoalphabetic substitution
operations for, 492 about, 45
permutation, 501–2 Affine cipher, 46, 83–85
square, 494–95 Atbash cipher, 46, 81
system of linear equations and, 483–84 Baconian cipher, 50
transpose of, 493 Caesar cipher, 46, 82
Maximal prime gaps, 177–79 Ché Guevara cipher, 49
Maximum key space, 11, 12–13 codes, 47
Merkle-Hellman knapsack cryptosystem general, 45
about, 505–6 map cipher, 47
attack against, 548 Nihilist substitution, 46
decomposition procedure, 507 nomenclator, 47
encryption with knapsacks, 304–5, 509 shift cipher, 46, 81–82, 91–92
example, 507–9 straddling checkerboard, 48–49
JCT plugin, 547, 552 with symbols, 46
key generation algorithm, 506 Tri-digital cipher, 49
knapsack keys, 506 Movies/fictional literature, cryptography,
modulus, 506–7 601–15
Mersenne conjecture, 161–62 Msieve library, 166

Esslinger index.indd 632 11/30/2023 1:29:39 PM

Index 633

MS Word files, 429–30 Nulls/nullities, 113

Multiparty computation (MPC), 469 Number field sieve, 560–62
Multiplication Number of digits, 269–70, 271
in groups, 216–17 Numbers
scalar, closure in, 499 Carmichael, 153–55
scalar, neural action of, 499 composite, 200
tables, 208, 209, 210, 211, 272–73 Euclid, 158–59
Multiplicative inverses, 208–11, 221–22 factoring, 287
Multiplicative order, 224, 273–76 Fermat, 156–57
Multivariate cryptography, 585 Heegner, 160–61
MysteryTwister (MTC3), xxii inverse of, 208–11
Mersenne, 145–46, 156
N natural, 196, 207
National Institute of Standards and Technology polynomials versus, 566
(NIST), 238, 362–64, 367, 525, 541, pseudoprime, 152–55
577 special types of, 155–63
National Security Agency (NSA), 4, 7, 575–76 See also Prime numbers (primes)
Natural numbers, 196, 207 Number sets, 206
Negative tests, 151 Number theory
Next bit predictor, 459 about, 195–96
Nicodemus cipher, 59 areas of, 197
Nihilist substitution, 46 convention and notation, 197–99
Nihilist Transposition, 43–44 divisibility, modulus, and remainder classes,
Nomenclator, 47 201–6
Nomenclature, 113 Euler-Fermat theorem, 220–21
Nomenclature-code elements, 113 Euler function and, 218–19
Nomenclature elements Fermat’s little theorem, 219–20
about, 15 finite sets, 206–7
cipher keys, 100–101 fundamental theorem of, 201
cryptanalysis and, 120 groups and modular arithmetic, 215–17
defined, 113 introduction to, 196–99
Nonlinear combiners modular arithmetic, 207–15
about, 448–49 multiplicative order and primitive roots,
design criteria, 453–54 224–29
efficiency, 454 prime numbers and, 199–201
implementation of, 451–53
Nonlinear feedback, 448 O
Nonlinearity for FSRs OFB (output feedback) mode, 422
about, 447–48 One-time pad (OTP), 22–23, 55, 428
nonlinear combiner, 448–49 One-time signatures, 585
nonlinear feedback, 448 One-way functions
nonlinear output filter, 448 about, 253, 301–2
output selection/decimation/clocking, defined, 301
449–51 trapdoor, 302
Nonlinear output filter, 448 Open-access publications on cracking ciphers,
NOT, 394, 398–99 74

Esslinger index.indd 633 11/30/2023 1:29:39 PM

634 Index

OpenSSL, 1, 28–30 Pentium FDIV bug, 174

Operational code elements Period, 438, 451–52
about, 112 Periodic XOR encryption, 430–31
cryptanalysis and, 101 Permutation matrices, 501–2
defined, 113 Phillips cipher, 56
Operation Rubicon, 70–71 Pinpointing, 563
Optical character recognition (OCR) programs, Pinprick encryption, 60
116, 118 Plaintext
Optimal Asymmetric Encryption Padding about, 1–2
(OAEP), 234 alphabet, 113
OR, 394, 398–99 defined, 113
Orbits elements, 100, 113
about, 325 Playfair cipher, 52
defined, 325 Playing card cipher, 61–62
examples of, 325–26 Political history, 133
full, 325, 333, 347–52 Pollard algorithm, 355–57
generator, 325 Pollard Rho method, 556
illustrated, 325, 326 Polyalphabetic substitution
length of, 325, 326–29 about, 45, 53–54
orbit of 2, 330, 344 one-time pad (OTP), 55
orbit of 5, 330 Phillips cipher, 56
orbit of 12, 341, 342 Ragbaby cipher, 56
orbit of 17, 351 Vigenère cipher, 54–55, 85
orbit of 25, 327, 329 Polygraphic substitution, 45, 51–53
orbit of 30, 327 Polynomial expressions, 398, 400, 401–2
orbit of 60, 329 Polynomial functions, 161
orbit of 117, 329 Polynomials, 381, 519, 521–23, 557, 566
orbit of 811, 330 Polyphonic substitution ciphers, 102
RSA, 329–40, 352–55 Porta, 55
See also RSA planes Postquantum cryptography, 540–41, 585–86
Organization, this book, xv–xix Power(s)
Orthogonal projections, 340 high, fast calculation of, 213–14
Orton cryptosystem, breaking, 533–39 modular, calculating, 221
Output selection, 449 raising to, 213–14
Practical key space, 11
P Precomputer ciphers, Hagelin machines as
Paillier cryptosystem, 469–70 models, 63–73
Paper-and-pencil (P&P) ciphers Preperiod, 438
about, 39 Primality testing, 247
combining substitution and transposition, Prime fields
56–60 about, 558
further methods, 60–63 best algorithms for, 558–62
SageMath examples, 74–94 elliptic curves over, best known algorithms
substitution ciphers, 45–56 for, 572–73
transposition ciphers, 40–45 index calculus algorithms, 559–60
Password hashes, 364–65 number field sieve and, 560–62

Esslinger index.indd 634 11/30/2023 1:29:39 PM

Index 635

Prime gaps 20+ largest known, 144, 145

about, 175 twin, open questions, 173–75
examples, 176 unproven statements, conjectures and open
largest known, 177 questions, 170–71
length, 175 visualization of, 180–81
maximal, 177–79 Prime number sequence (PAP), 168, 170
SageMath example, 178 Prime number tests
table, 178 about, 150–51
Prime numbers (primes) negative, 151
about, 139 special properties, 151
arithmetic sequences, 167–70 Prime number theorem, 184
Contact movie (1997) and, 180 Primerecords, 239
defined, 140 Primitive roots
density and distribution of, 163–65 about, 224
distinct, 233 calculating all, 277, 278–79
distribution of, 184–88 calculating for a given range of primes,
EFF challenge and, 150 279–80
elements and elementary particles and, 143 generating database of, 280–81
Euclid and, 143–44 generating database of smallest, 281–82
extremely large, search for, 144–50 generating graphics about, 282, 284–87
within first 390 integers, 141 largest, in all primes, 285
within first 999 integers, 141 number and smallest and biggest, for all
within first 40,000 integers, 142 primes, 285
further topics, 166 number of, of all primes, 284
gigantic, 147 SageMath examples, 276–87
GIMPS, 149 SageMath output, 225
Google recruitment (2004) and, 179 tables, 226, 228
in higher ranges, visualization of quantity of, Private keys, 7, 288–90
184–88 Progressive key, 55
importance of, 139 Projections
indexing, 181–82 about, 340
listening to, 180 inner point path, 343
in mathematics, 140 orbit of 2, 344
Mersenne, 145–49, 162–63 orbit of 12, 342
notes about, 166–80 orthogonal, 340
nth, value of, 164 as part of commutative diagram, 345
number of, 143–44, 164 See also RSA planes
number of, in different intervals, 185–86 Provable security, 23
number of, in various intervals, 180–81 Pseudoprime numbers
number theory and, 199–201 about, 152
peculiar and interesting things, 179–80 Carmichael, 153–55
proven statements and theorems, 166–67 Fermat, 152
pseudoprime, 152–55 strong, 155
SageMath examples, 189–92 See also Prime numbers
search for formula for, 155–63 Pseudorandom displacement generator, 67–68
shared, 247, 250–52 Pseudorandom generators
theorems, 140–42 about, 434

Esslinger index.indd 635 11/30/2023 1:29:39 PM

636 Index

Pseudorandom generators (continued) R

BBS generator, 455–58 Rabin public-key procedure, 308–9
bit sequence visualization, 441 Ragbaby cipher, 56
distinguisher, 459 Rail fence cipher, 40
elliptic index generator (Kaliski), 461 Rainbow tables, 247
feedback shift registers (FSR), 434, 435–38 Raising to the power, 211–12
illustrated, 435 Random-based algorithms, 24
index generator (Blum/Micali), 461 Random generators, 434
linear shift registers (LFSR), 438–44 Raw RSA, 513–17
Micali-Schnorr generator, 461–63 RC5, 4–5
next bit predictor, 459 Reflections
perfect, 454–55 about, 343–46
perfectness and factorization conjecture, about horizontal axis, 346
458–59 about vertical axis, 346
period of finite-state machine, 438 automorphisms and, 346
RSA generator (Shamir), 461 involutions, 343–46
Pseudorandom number generators (PRNGs), See also RSA planes
361 Registration authority (RA), 370
Public-key certification Related key attacks, 20
about, 369 Remainder classes, 204
impersonation attacks, 369–70 Remainder set, 215
signature validation and validity models, Representation
372–73 about, 268
X.509 certificate, 370–71 b-adic sum, 268–69
Public-key cryptography, 229 base, algorithm to compute, 270–71
Public-key infrastructure (PKI), 369 number of digits, 269–70
Public-key procedures Riemannput, Bernhard, 171
decomposition into prime factors as basis, Rijndael algorithm, 4
305–9 Roots
development of, 302 logarithms and, 214–15
Diffie-Hellman key agreement, 311 modular, 166, 214–15
discrete logarithm as basis, 309–14 primitive, 195, 224–29, 276–87
DSA signature, 368 of unity, 291
ElGamal, 311–12 Row transposition, 43
generalized ElGamal, 314 RSA
knapsack problem as basis, 303–5 ciphertext, cracking, 289
Merkle-Hellman knapsack, 304–5 in CTO, 259, 260, 262, 263
Rabin, 308–9 encryption by modular exponentiation, 288
RSA, 306 encryption/decryption, high powers and,
Public keys, 5, 229, 237, 370, 507, 575 213
Puzzle challenges, 482, 489 encryption using SageMath, 288
examples with SageMath, 287–88
Q exponentiation, 287
Quadratic functions, 189–92 fixed points, 290–97
Quantum computers, 557–58 homomorphic properties, 470
Quantum cryptography, 585 implementation security, 234
Quasi-polynomial dlog computation, 564–65

Esslinger index.indd 636 11/30/2023 1:29:39 PM

Index 637

lattices versus, 517–25 action of the map, 322–24

small cipher challenge, 265–67 alternative choice of representatives, 321–22
textbook (raw), 513–17 axis points, 322, 323
use of, 581–82 defined, 314–15
RSA-155, 240, 241 final remarks, 357–58
RSA-160, 241 finite planes and, 315–17
RSA-200, 238, 241–42 inner points, 322, 323
RSA-576, 239 line illustration, 320, 321
RSA-640, 239 lines in, 319–21
RSA-768, 238, 243–44, 570 model, 317
RSA-2048, 581 orbits, 325–40
RSA algorithm points of, 330
about, 7 Pollard algorithm, 355–57
Bernstein’s paper and, 245–46 projections, 340–43
complexity, 236 rectangular two-dimensional arrangement,
factorization algorithms and, 236–37 316
factorization/prime number tests research, rectangular two-dimensional pattern, 322
244–52 reflections, 343–55
factorization status of large numbers, ultrametric, 358
238–44 vertical lines, 320
forecasts and factorization of large integers RSA procedure
and, 237–38 decomposition into prime factors, 305–8
lattice base reduction algorithms and, 237 digital signatures creation and, 233
modulus bit length, 238 functioning of, 230–32
primality testing and, 247 with larger primes, 260–65
private key calculation and, 236 pairs of keys and, 232
security, 234–52 proof with Euler-Fermat, 229–33
security parameters because of new algo- with slightly larger primes, 258–60
rithms, 236–37 with small prime numbers, 257–58
shared primes and, 247, 250–52 with specific numbers, 257–67
TWIRL device and, 246 steps, 230–31
RSA Factoring Challenge, 239 RSA Security, 581
RSA generator (Shamir), 461 RSA signatures, 367, 368
RSA keys Running-key cipher, 55
in modulo 26, 222–24 Running-text encryption, 431–32
shared primes and, 247
testing, 247 S
RSA orbits SafeCurve project, 574
for axis elements, 332 SageMath
of inner points, 338 about, xvi, xxii
invariant, 352–55 BBS generator, 456
length of, 335 Cryptanalysis with, 91–94
nonsymmetric, 353 elliptic curves and, 390
symmetric, 352, 354 RSA examples in, 287–88
See also Orbits sequence generation (BBS pseudorandom
RSA planes bits), 456–57
about, 301, 314 symmetric ciphers using, 29–32

Esslinger index.indd 637 11/30/2023 1:29:39 PM

638 Index

SageMath examples RSA exponentiation, 287

addition tables, 272 RSA fixed points, 296–97
Affine cipher, 83–85 Sage command line execution time, 249–50
Atbash cipher, 81 shift cipher, 82–83, 91–92
basic functions about primes, 189 special values of binary and decimal systems,
Boolean function with truth table and ANF, 183
413–14 square and multiply, 222
Caesar cipher, 82 structure and naming conventions, 75–76
Carmichael numbers, 153–54 substitution ciphers, 80–91
checking primality of integers, 189–92 substitution with symbols not only capital
ciphertext-only attacks (COA), 92 letters, 88–91
combined sequence, 453 symmetric encryption, 463–64
coprimes of an integer, 203 system of Boolean linear equations, 409–10
cryptanalysis, KPA against Hill cipher, system of linear equations, 407–8
92–94 three LFSRs, 452
Diffie-Hellman key-exchange protocol, three LFSR sequences, 452
255–56 transposition ciphers, 76–80
factoring a number, 287 Vigenère cipher, 85
fast exponentiation, 273 XOR encryption, 431–32
feedback shift registers (FSR), 437 Scytale, 41
gcd calculation and factorization, 248–49 SDES, 32
Geffe function, 450–51 Secret keys, 5, 7, 20, 311, 576
graph generation of functions, 188 Secure key length, 582
Hill cipher, 85–88, 92–94 Secure key sizes, 583
KPA against Hill cipher, 94 Secure multiparty computation (SMC), 472
LFSR, defining, 440–41 Security
LFSR with pylfsr package in Python, 442–44 ad-hoc, 23
multiplication tables, 272–73 block ciphers, 418, 423–24
multiplicative order, 273–76 computational, 22
number of digits, 271 definitions, 21–23
number of private RSA keys, 290 of elliptic-curve cryptography, 385–87
number theory, 198–99 elliptic curves and, 573–74
orbits of inner points, 338–39 indistinguishable definitions, 20–23
paper-and-pencil (P&P) ciphers, 74–94 information-theoretical, 22–23
period calculation, 451–52 provable, 23
Phi and list of coprimes, 307 RSA algorithm, 234–52
prime gaps, 178 RSA implementations, 234
prime numbers, 189–92 Security parameters, 17
primitive roots, 276–87 Self-initializing quadratic sieve (SIQS), 166
pseudorandom bit sequence, 441–42 Self-supervised learning, 118, 119
pseudorandom sequence (very poor), Semisupervised learning, 118, 119
437–38 Sequence-to-sequence models (S2S), 118
residue system, 217 SHA-1, 63
residue value, 223 SHA-2, 363
RSA encryption, 288 SHA-3, 364
RSA encryption by modular exponentiation, SHA-256, 365
288 Shared primes, 247, 250–52

Esslinger index.indd 638 11/30/2023 1:29:39 PM

Index 639

Shift cipher, 46, 81–82, 91–92 polygraphic substitution, 51–53

Shortest vector problem (SVP), 544, 546 polyphonic substitution, 102
Sieve of Eratosthenes, 165, 166 SageMath examples, 80–91
Signature procedure, 366 See also Ciphers
Signatures Subtraction, modulo, 267–68
digital, 233, 361, 365–69, 372–73 Superposition, 55
multivariate cryptography and, 585 Symmetric encryption
one-time, 585 about, 2–3, 393–94
validation and validity models, 372–73 advantage of, 3
Signing, elliptic curves, 388 block ciphers and, 414–27
Silver-Pohlig-Hellman algorithm, 313–14, Boolean functions and, 394–414
556–57 brute force attacks on, 4–5
Simple columnar transposition, 42 illustrated, 3
Simplified-AES (S-AES), 32 SageMath examples, 463–64
Simulated annealing, 126–29 stream ciphers and, 427–63
Single-key attacks, 19 See also Advanced Encryption Standard
Slidefair, 55 (AES)
Smooth cofactor, 556 Systems of linear equations
Solitaire cipher, 60–61 about, 406, 480–81
Spanish strip cipher, 51 Boolean, 408–10
Special values, binary and decimal systems, estimate of costs, 410–11
182–83 Gaussian elimination, 484
Square and multiply, 214, 222 matrices and, 483–84
Square matrix, 494–95 as a picture, 482
Stencils, 60 in SageMath, 406–7
Straddling checkerboard, 48–49 solving, 481–82
Stream ciphers See also Boolean functions
about, 427
algebraic attack on LFSRs, 444–47 T
BBS generator, 455–58 Terms and definitions, this book, 113
key stream generation, 429 Textbook RSA
nonlinear combiner, 451–53 about, 513
nonlinear combiners design criteria, 453–54 challenge, 516–17
perfectness and factorization conjecture, encoding procedure, 514
458–59 parameter generation, 515
practical considerations, 460–61 reversibility of encoding procedure, 514–15
pseudorandom generators, 434, 454–55 steps, 513
summary and outlook, 463 Time-memory trade-off (TMTO) attacks, 22
XOR encryption and, 427–29 Tokenization, 120–21
Strong pseudoprime numbers, 155 Tokenizers, developing, 121
Stuart, Mary, 102 Transcription
Substitution ciphers about, 109
about, 45 automatic, 115–19
combining with transposition, 56–60 historical cryptology, 109–19
homophonic substitution, 50–51 incremental, 119
monoalphabetic substitution, 45–50 manual, 109–14
polyalphabetic substitution, 53–56

Esslinger index.indd 639 11/30/2023 1:29:39 PM

640 Index

TranscriptTool, 105–6 adding, 317

Transformer networks (TN), 118 coordinate, 491
Transitivity, 207 directional, 318
Transkribus.ai, 106 finding, 488
Transposition ciphers illustrated example, 487
about, 40, 102 puzzle challenge, 489
Cadenus cipher, 44–45 shortest, 518, 522
column and row, 42–43 Vector spaces
combining with substitution ciphers, 56–60 about, 498
geometric figures, 43 basis of, 501
grille cipher, 41 conditions/properties/axioms, 498–99
introductory samples, 40–41 of cubic polynomials, 500
Nihilist Transposition, 43–44 examples of, 499–502
rail fence cipher, 40 over real numbers, 498
SageMath examples, 76–80 subset of, 500
Scytale, 41 VIC cipher, 62
turning grille, 41 Vigenère cipher, 54–55, 85, 102
Union Route Cipher, 43 Vigenère disk, 11
See also Ciphers
W
T Weierstrass equation, 382–83
Transpositions, 42, 57, 346, 420 Work factor, 8, 11–12
Trapdoor one-way functions, 302 Workspace Manager (CT2), 592
Tri-digital cipher, 49
Trigraphic Playfair, 52–53 X
Triple-DES (TDES, 3DES), 4 X.509 certificate, 370–72
Tri-square cipher, 53 Xedni calculus algorithm, 573
True random sequence, 432–34 XOR, 394
Truth tables, 396, 397, 412 XOR ciphers, 427–29
Twin primes, 173–75 XOR encryption
TWIRL device, 246 about, 427
Two-square cipher, 53 algorithmically generated bit sequences and,
434
U OTP, 428
Ulam’s prime spiral, 161 periodic, 430–31
Ultrametric planes, 358 principle, 427
Union Route Cipher, 43 with pseudorandom key stream, 434

V Z
Variable-key attacks, 19 Zhang, Yitang, 174–75
Vectors
about, 316, 487

Esslinger index.indd 640 11/30/2023 1:29:39 PM

Recent Titles in the Artech House
Computer Security Series
Rolf Oppliger, Series Editor

Bluetooth Security, Christian Gehrmann, Joakim Persson, and Ben Smeets

Computer Forensics and Privacy, Michael A. Caloyannides
Computer and Intrusion Forensics, George Mohay, et al.
Contemporary Cryptography, Second Edition, Rolf Oppliger
Cryptography 101: From Theory to Practice, Rolf Oppliger
Cryptography for Security and Privacy in Cloud Computing, Stefan Rass and
Daniel Slamanig
Defense and Detection Strategies Against Internet Worms, Jose Nazario
Demystifying the IPsec Puzzle, Sheila Frankel
Developing Secure Distributed Systems with CORBA, Ulrich Lang and
Rudolf Schreiner
Electric Payment Systems for E-Commerce, Second Edition, Donal O'Mahony,
Michael Peirce, and Hitesh Tewari
Engineering Safe and Secure Software Systems, C. Warren Axelrod
Evaluating Agile Software Development: Methods for Your Organization,
Alan S. Koch
Implementing Electronic Card Payment Systems, Cristian Radu
Implementing the ISO/IEC 27001 Information Security Management System
Standard, Edward Humphreys
Implementing Security for ATM Networks, Thomas Tarman and Edward Witzke
Information Hiding, Stefan Katzenbeisser and Fabien Petitcolas, editors
Internet and Intranet Security, Second Edition, Rolf Oppliger
Introduction to Identity-Based Encryption, Luther Martin
Java Card for E-Payment Applications, Vesna Hassler, Martin Manninger,
Mikail Gordeev, and Christoph Müller
Learning and Experiencing Cryptography with CrypTool and SageMath,
Bernhard Esslinger
Lifecycle IoT Security for Engineers, Kaustubh Dhondge
Modern Vulnerability Management: Predictive Cybersecurity, Michael Roytman and
Ed Bellis
Multicast and Group Security, Thomas Hardjono and Lakshminath R. Dondeti
Non-repudiation in Electronic Commerce, Jianying Zhou
Outsourcing Information Security, C. Warren Axelrod
The Penetration Tester’s Guide to Web Applications, Serge Borso
Privacy Protection and Computer Forensics, Second Edition,
Michael A. Caloyannides
Role-Based Access Control, Second Edition, David F. Ferraiolo, D. Richard Kuhn, and
Ramaswamy Chandramouli
Secure Messaging with PGP and S/MIME, Rolf Oppliger
Securing Information and Communications Systems: Principles, Technologies and
Applications, Javier Lopez, Steven Furnell, Sokratis Katsikas, and Ahmed Patel
Security Fundamentals for E-Commerce, Vesna Hassler
Security Technologies for the World Wide Web, Second Edition,
Rolf Oppliger
Techniques and Applications of Digital Watermarking and Content Protection,
Michael Arnold, Martin Schmucker, and Stephen D. Wolthusen
User’s Guide to Cryptography and Standards, Alexander W. Dent and
Chris J. Mitchell

For further information on these and other Artech House titles, including previously considered out-of-print
books now available through our In-Print-Forever® (IPF®) program, contact:

Artech House Artech House

685 Canton Street 16 Sussex Street
Norwood, MA 02062 London SW1V HRW UK
Phone: 781-769-9750 Phone: +44 (0)20 7596-8750
Fax: 781-769-6334 Fax: +44 (0)20 7630-0166
e-mail: [email protected] e-mail: [email protected]
Find us on the World Wide Web at: www.artechhouse.com

Book of Intergrals
No ratings yet
Book of Intergrals
227 pages
PDF Art of Problem Solving Volume 2 and Beyond 7th the Wei Zhi download
No ratings yet
PDF Art of Problem Solving Volume 2 and Beyond 7th the Wei Zhi download
16 pages
College Algebra by Blitzer
No ratings yet
College Algebra by Blitzer
1 page
Brenner Four Pillars of Math Edu PDF
No ratings yet
Brenner Four Pillars of Math Edu PDF
165 pages
MA20218 Analysis 2A: Lecture Notes
No ratings yet
MA20218 Analysis 2A: Lecture Notes
62 pages
Ch05 FiniteFields
100% (1)
Ch05 FiniteFields
34 pages
Combinatorics Introduction
No ratings yet
Combinatorics Introduction
351 pages
Contemporary Abstract Algebra 9th Edition Gallian Solutions Manual - All Chapter Instant Download
100% (4)
Contemporary Abstract Algebra 9th Edition Gallian Solutions Manual - All Chapter Instant Download
7 pages
All of Math in Three Pages: Individual Test Topics
No ratings yet
All of Math in Three Pages: Individual Test Topics
3 pages
Rsa - TCR PDF
No ratings yet
Rsa - TCR PDF
89 pages
Proofsss
No ratings yet
Proofsss
30 pages
Instant Access To (Ebook PDF) Technical Calculus With Analytic Geometry 5th Edition by Peter Kuhfittig Ebook Full Chapters
100% (9)
Instant Access To (Ebook PDF) Technical Calculus With Analytic Geometry 5th Edition by Peter Kuhfittig Ebook Full Chapters
41 pages
Fisher I A Brief Introduction To The Infinitesimal Calculus 1921
No ratings yet
Fisher I A Brief Introduction To The Infinitesimal Calculus 1921
106 pages
Algebra I Honors
No ratings yet
Algebra I Honors
47 pages
Jupyter Notebook Viewer
No ratings yet
Jupyter Notebook Viewer
32 pages
The Art of Problem Solving: David Patrick
No ratings yet
The Art of Problem Solving: David Patrick
6 pages
Mathematica Tutorial (Differential Equations)
No ratings yet
Mathematica Tutorial (Differential Equations)
8 pages
Galois Theory 2nd Edition David A. Cox 2024 Scribd Download
100% (6)
Galois Theory 2nd Edition David A. Cox 2024 Scribd Download
81 pages
Introduction to Number Theory Art of Problem Solving 2nd Edition Mathew Crawford All Chapters Instant Download
100% (2)
Introduction to Number Theory Art of Problem Solving 2nd Edition Mathew Crawford All Chapters Instant Download
81 pages
Exploring Quantitative Skills-1
No ratings yet
Exploring Quantitative Skills-1
207 pages
(Kristopher Tapp) Symmetry A Mathematical Explora
No ratings yet
(Kristopher Tapp) Symmetry A Mathematical Explora
230 pages
Calculus of Inductive Constructions
No ratings yet
Calculus of Inductive Constructions
18 pages
Instant Access to Real Analysis and Foundations 2nd Edition Steven G. Krantz ebook Full Chapters
100% (1)
Instant Access to Real Analysis and Foundations 2nd Edition Steven G. Krantz ebook Full Chapters
76 pages
The Art of Problem Solving Volume 1 The Basics Solutions Manual by Richard Rusczyk, Sandor Lehoczky
No ratings yet
The Art of Problem Solving Volume 1 The Basics Solutions Manual by Richard Rusczyk, Sandor Lehoczky
143 pages
Solution Manual For Precalculus Enhanced With Graphing Utilities 4th Edition
No ratings yet
Solution Manual For Precalculus Enhanced With Graphing Utilities 4th Edition
230 pages
Circular Functions and Its Graphs Week 5
No ratings yet
Circular Functions and Its Graphs Week 5
42 pages
Download (Ebook) A Concise Introduction to Pure Mathematics, Fourth Edition by Liebeck, Martin W ISBN 9781498722933, 1498722938 ebook All Chapters PDF
100% (10)
Download (Ebook) A Concise Introduction to Pure Mathematics, Fourth Edition by Liebeck, Martin W ISBN 9781498722933, 1498722938 ebook All Chapters PDF
65 pages
Solution Manual for Introduction to Applied Linear Algebra 1st by Boydinstant download
100% (5)
Solution Manual for Introduction to Applied Linear Algebra 1st by Boydinstant download
54 pages
Principles and Techniques of Combinatorics PDF
100% (1)
Principles and Techniques of Combinatorics PDF
308 pages
Aops Community 2023 Aime
No ratings yet
Aops Community 2023 Aime
6 pages
10.2 Arithmetic Sequence and Series
100% (2)
10.2 Arithmetic Sequence and Series
46 pages
MIR Vilenkin N Combinatorial Mathematics For Recreation 1972 PDF
No ratings yet
MIR Vilenkin N Combinatorial Mathematics For Recreation 1972 PDF
205 pages
Combinatorial Algorithms - Edward M Reingold
0% (1)
Combinatorial Algorithms - Edward M Reingold
12 pages
8th Grade Algebra Curriculum Map
No ratings yet
8th Grade Algebra Curriculum Map
9 pages
(Undergraduate Texts in Mathematics) Charles Chapman Pugh - Real Mathematical Analysis-Springer (2002)
No ratings yet
(Undergraduate Texts in Mathematics) Charles Chapman Pugh - Real Mathematical Analysis-Springer (2002)
453 pages
APEXPreCalculus PDF
No ratings yet
APEXPreCalculus PDF
164 pages
Mathematical Plums - Hosberger
No ratings yet
Mathematical Plums - Hosberger
190 pages
Multivariate Calculus and Geometry 3rd Ed 2014 Ebook PDF
100% (1)
Multivariate Calculus and Geometry 3rd Ed 2014 Ebook PDF
4 pages
Section 1.1
100% (1)
Section 1.1
519 pages
Notes
No ratings yet
Notes
422 pages
Symbolic Mathematics in Python - Scipy Lecture Notes
No ratings yet
Symbolic Mathematics in Python - Scipy Lecture Notes
8 pages
Infinite Series-Fichtenholz
No ratings yet
Infinite Series-Fichtenholz
139 pages
Student Guide 06
No ratings yet
Student Guide 06
152 pages
Integrated Math 1 Cumulative Review
No ratings yet
Integrated Math 1 Cumulative Review
12 pages
CSE331 - 211 Midterm Question Day PDF
No ratings yet
CSE331 - 211 Midterm Question Day PDF
2 pages
00 Number Theory Step by Step
100% (2)
00 Number Theory Step by Step
398 pages
Project Name: P-Adic Integers
No ratings yet
Project Name: P-Adic Integers
51 pages
Goldreich O. Introduction To Complexity Theory
100% (1)
Goldreich O. Introduction To Complexity Theory
375 pages
Alg1 Unit2 Student
No ratings yet
Alg1 Unit2 Student
146 pages
Mathmatical Methods Final
No ratings yet
Mathmatical Methods Final
371 pages
A Complex Math Calculation Tool
No ratings yet
A Complex Math Calculation Tool
7 pages
Dolciani HW 2
No ratings yet
Dolciani HW 2
2 pages
Esslinger-017_Chap03-Historical-Cryptology
No ratings yet
Esslinger-017_Chap03-Historical-Cryptology
58 pages
Cryptography and Mathematics
No ratings yet
Cryptography and Mathematics
223 pages
Cryptography
No ratings yet
Cryptography
209 pages
Cryptography
No ratings yet
Cryptography
201 pages
Basic Lectures On Cryptography: (With Sage Programs)
No ratings yet
Basic Lectures On Cryptography: (With Sage Programs)
91 pages
Elementary Number Theory PDF
No ratings yet
Elementary Number Theory PDF
172 pages
Rosen - Elementary Number Theory and Its Applications (001-100)
No ratings yet
Rosen - Elementary Number Theory and Its Applications (001-100)
100 pages
Rosen - Elementary Number Theory and Its Applications - 1st
No ratings yet
Rosen - Elementary Number Theory and Its Applications - 1st
462 pages
Botteri G. The Astronomical System of Aristotle. An Interpretation 2023
No ratings yet
Botteri G. The Astronomical System of Aristotle. An Interpretation 2023
346 pages
SMCF_Text_R FINALPrint2014 - Michael Eardley
No ratings yet
SMCF_Text_R FINALPrint2014 - Michael Eardley
22 pages
Wang F. Foundation of Probability Theory 2024
100% (2)
Wang F. Foundation of Probability Theory 2024
208 pages
Ang A., Tang W. Probability Concepts in Engineering 2ed 2007
100% (1)
Ang A., Tang W. Probability Concepts in Engineering 2ed 2007
419 pages
Carmichael 1913 PDF
No ratings yet
Carmichael 1913 PDF
20 pages
Karanataka CET 1984 Mathematics
0% (1)
Karanataka CET 1984 Mathematics
15 pages
Cryptography and Network Security 2010
No ratings yet
Cryptography and Network Security 2010
4 pages
Finite Fields: Theory and Computation
No ratings yet
Finite Fields: Theory and Computation
531 pages
Generalized Wilson - Yimin Ge - X
No ratings yet
Generalized Wilson - Yimin Ge - X
8 pages
4-1 2 3-Handout
No ratings yet
4-1 2 3-Handout
63 pages
C Programming Notes
No ratings yet
C Programming Notes
118 pages
Diophantine Equations 12.03.16
No ratings yet
Diophantine Equations 12.03.16
55 pages
Lecture 14 & 15 - Single Row Functions
No ratings yet
Lecture 14 & 15 - Single Row Functions
54 pages
Proofs, Generalizations and Analogs of Menon's Identity
No ratings yet
Proofs, Generalizations and Analogs of Menon's Identity
46 pages
Problem Set 2
No ratings yet
Problem Set 2
5 pages
CCPC Finals 2021
No ratings yet
CCPC Finals 2021
17 pages
Reciprocidade Problemas Resolvidos PDF
100% (2)
Reciprocidade Problemas Resolvidos PDF
13 pages
VTAMPS 20 SS Set 3 (Corrected Solution Manual)
No ratings yet
VTAMPS 20 SS Set 3 (Corrected Solution Manual)
14 pages
Mathematics: Self-Learning Module 4 17
100% (2)
Mathematics: Self-Learning Module 4 17
12 pages
2023 Amc 9-10
No ratings yet
2023 Amc 9-10
10 pages
11RU-CBCS Course Structure and Syllabus-Sem-V - VI For BSC
No ratings yet
11RU-CBCS Course Structure and Syllabus-Sem-V - VI For BSC
227 pages
Simple Divisibility Rules For The First 1000 Prime Numbers
No ratings yet
Simple Divisibility Rules For The First 1000 Prime Numbers
14 pages
Primes and Greatest Common Divisors: Section 4.3
No ratings yet
Primes and Greatest Common Divisors: Section 4.3
48 pages
MSC Maths
No ratings yet
MSC Maths
71 pages
3 540 39799 X
No ratings yet
3 540 39799 X
543 pages
CS8792-Cryptography and Network Security QB 1
No ratings yet
CS8792-Cryptography and Network Security QB 1
10 pages
Two Irrational Numbers That Give The Last Non-Zero Digits of N! and N
No ratings yet
Two Irrational Numbers That Give The Last Non-Zero Digits of N! and N
6 pages
A. Equation: Input Output
No ratings yet
A. Equation: Input Output
2 pages
Multiplicative Inverse
No ratings yet
Multiplicative Inverse
15 pages
DAA - Unit 1
No ratings yet
DAA - Unit 1
24 pages
Vbscript Quiz
No ratings yet
Vbscript Quiz
3 pages
Syllabus of DisMath (Online - Offline Format) 2021-2022
No ratings yet
Syllabus of DisMath (Online - Offline Format) 2021-2022
8 pages
Divisibility Test Modular Design Presentation
No ratings yet
Divisibility Test Modular Design Presentation
38 pages
Quadratic Residues
No ratings yet
Quadratic Residues
5 pages