Hayter Hyperbolic Report
Hayter Hyperbolic Report
Ross Hayter
This project begins by considering the failure of Euclid’s fifth postulate and the need
for an alternative. It outlines the basic structure of Lorentz 3-space which allows the
first model of the hyperbolic plane to be derived. The project focuses on four
models; the hyperboloid model, the Beltrami-Klein model, the Poincaré disc model
and the upper half plane model. The main objective is the derivation and
transformation of each model as well as their respective characteristics. The
advantages and disadvantages of each model are discussed by using examples.
Finally the project addresses the consistency of hyperbolic geometry and the
implications of non-Euclidean geometry for mathematics as a whole.
Contents
1 No More Euclid 5
1.1 The Elements and The Fifth Postulate . . . . . . . . . . . . . . . . . . 5
1.2 The Mathematical Toolkit . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.2.1 Möbius Transformations . . . . . . . . . . . . . . . . . . . . . . 7
1.2.2 Transitive Property of Möb+ . . . . . . . . . . . . . . . . . . . 7
1.2.3 Differentiation of Möbius Transformation . . . . . . . . . . . . 8
1.2.4 The Riemann Sphere . . . . . . . . . . . . . . . . . . . . . . . . 9
1.3 Paths . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.3.1 Length and Elements of Arc-length . . . . . . . . . . . . . . . . 11
1.3.2 The Path Metric Space . . . . . . . . . . . . . . . . . . . . . . 14
2 Hyperbolic Space 16
2.1 Lorentz Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.1.1 Lorentzian Inner Product . . . . . . . . . . . . . . . . . . . . . 16
2.2 Lorentz Transformations . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.3 Lorentzian Cross Product . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.3.1 Properties of Lorentzian Cross Product . . . . . . . . . . . . . 21
2.4 Curvature of Hyperbolic Space . . . . . . . . . . . . . . . . . . . . . . 22
2.4.1 Curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.4.2 Hyperbolic Triangles . . . . . . . . . . . . . . . . . . . . . . . . 27
2.5 The Hyperboloid Model . . . . . . . . . . . . . . . . . . . . . . . . . . 28
2.5.1 Derivation of the Model . . . . . . . . . . . . . . . . . . . . . . 28
2.5.2 H-lines and Distance . . . . . . . . . . . . . . . . . . . . . . . . 29
2.6 The Hyperbolic Metric . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.7 Lobachevski’s Formula . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
1
3.2 Poincaré Disc Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.2.1 Derivation of the Poincaré Disc . . . . . . . . . . . . . . . . . . 41
3.2.2 Description of the Poincaré disc . . . . . . . . . . . . . . . . . . 43
3.2.3 Distance in the Poincaré Disc . . . . . . . . . . . . . . . . . . . 43
3.2.4 A Conformal Model . . . . . . . . . . . . . . . . . . . . . . . . 44
3.3 M.C. Escher and the Poincaré Disc Model . . . . . . . . . . . . . . . . 48
3.4 Poincaré Disc Model to Klein Model . . . . . . . . . . . . . . . . . . . 49
Bibliography 63
2
Figure 1: J.P Petit, The Adventures of Archibald Higgins: Euclid Rules OK? (1982)
3
Figure 2: J.P Petit, The Adventures of Archibald Higgins: Euclid Rules OK? (1982)
4
Chapter 1
No More Euclid
That, if a straight line falling on two straight lines makes the interior angles of the
same side less than two right angles, the two straight lines, if produced indefinitely,
meet on that side on which are the angles less than the two right angles.
This statement was not obviously true and the search for its proof using the previ-
ous four postulates became popular for many mathematicians. Throughout the years
mathematicians such as Clairaut and Legendre published many attempts to prove
the postulate [7]. Each attempt was unsuccessful since they relied on an unprovable
assumption [22].
In the first half of the nineteenth century a revolution took place as certain math-
ematicans1 began to investigate the possibilty of geometry that used the negation of
the parallel postulate. This concept had a huge impact on the field of mathematics
1
Schweikart, Taurinus, Gauss, Lobachevski and Bolyai [3]
5
and its implications were not taken lightly.
This revolution was as spectacular to mathematics as the discovery of special rela-
tivity to physics or the realisation of the helix structure of DNA to biochemistry. The
consequences of this negation was the creation of non-Euclidean geometry and it is
hyperbolic geometry that we shall concern ourselves with here.
The foundations of hyperbolic geometry are based on one axiom that replaces
Euclid’s fifth postulate, known as the hyperbolic axiom.
Hyperbolic Axiom 1 Let there be a line l and a point P such that P does not lie on
l. Then there exists at least two unique lines parallel to l that pass through the point
P. [7]
6
1.2 The Mathematical Toolkit
Before we begin our study of hyperbolic geometry we must first consider some concepts
from complex analysis which will allow us to fully explore the non-Euclidean world.
Möbius Transformations are by far one of the most elegant and fundamental map-
pings in geometry. They work by sending one point in the complex plane to a cor-
responding point, while preserving angles. Their simplicity creates many interesting
results and I refer the interested reader to Needham [12] for a visual interpretation of
such results.
We can break φ(z) into a sequence of different mappings [14]. Möbius functions
are well defined as z tends to infinity.
az + b a + zb a
φ(∞) = lim = lim d
=
z→∞ cz + d z→∞ c + c
z
Given that φ(∞) = ac then φ(∞) = ∞ if and only if c = 0. For ease, let the set of
all Möbius transformations be denoted by Möb+ .
7
From the definition of Möbius transformations it appears that it would be necessary
to have all four complex coefficents a, b, c and d to specify a unique transformation
φ(z). If this were true, to construct any Möbius transformation we would need to
calculate the image of four distinct points in the complex plane.
Note that if we multiply φ(z) by some non-zero complex constant κ we will see
that the same mapping is produced, [12]
κaz + κb az + b
κφ(z) = = = φ(z).
κcz + κd cz + d
Therefore, it is the ratios of the coefficients that truly determine the mapping, of which
there are only three ab , cb and dc . We can generalise this useful property to create the
following theorem.
This theorem says that given the ordered complex triples (z1 , z2 , z3 ) and (w1 , w2 , w3 ) ∈
C there exists some unique φ belonging to Möb+ such that
φ(z1 ) = w1 , φ(z2 ) = w2 and φ(z3 ) = w3 .
This emphasises that we only need three distinct images to define a Möbius transfor-
mation. The proof of this can be found in Anderson [1].
Example Find a Möbius transformation φ that maps the points Z = (0, −i, −1) to
W = (i, 1, 0) respectively.
We can substitute these values into three different Möbius functions to give,
a(0) + b a(−i) + b a(−1) + b
i= ,1 = , and 0 =
c(0) + d c(−i) + d c(−1) + d
8
If we take the same limit of φ we then can show that the derivative of a Möbius
transformation is
1
φ0 (z) =
(cz + d)2
∂U ∂U ∂V ∂V
Ux = , Uy = , Vx = and Vy = ,
∂x ∂y ∂x ∂y
exist on some disk centre (x0 , y0 ). Let these partial derivatives be continuous at
z0 = x0 + iy0 and if the Cauchy-Riemann equations hold at z0 then f (z) is complex
differentiable at z0 .
The Cauchy-Riemann equations are
Ux = Vy and Uy = −Vx .
Definition 4 The extended complex plane, denoted C̄, is the union of the set of points
in the complex plane and the point of infinity {∞} that is not on C.
C̄ = C ∪ {∞}
9
By placing a sphere Π on the complex plane such that its centre is at the origin
and it has the same radius as the unit circle S we are able to comprehend this point
of infinity [12]. Instead of the complex numbers being points on a plane, we can let
them be points on Π, where the ‘North Pole’ (in cartesian co-ordinates (0, 0, 1)) is the
point of infinity {∞}.
Now we are faced with another problem, how do we represent the complex numbers
on the plane? The solution to this is to use stereographic projecton.
We can define the stereographic projection in cartesian co-ordinates as the mapping
σ : Π → C [16],
x2 + y 2 + z 2 − 1
2x 2y
σ(x, y, z) = , , .
1 + x2 + y 2 + z 2 1 + x2 + y 2 + z 2 x2 + y 2 + z 2 + 1
Consider the point p and the line l that passes through p. If we construct a line
between N (the ‘North Pole’ of the shere Π) and p, it intersects Π at the point p̄;
where p̄ is known as the stereographic projection of p.
Now imagine that p moves along the line l. Note that as p moves along l, p̄ moves
closer to N , i.e. p → ∞ ⇔ p̄ → N our single point of infinity {∞}.
Notice in the figure that the stereographic image of the line l is a Euclidean circle
on Π passing through {∞}.
Unlike our description above which uses the unit sphere Π, we can prove this theorem
using the unit circle S in the complex plane3 .
Proof Construct the unit circle S in C and remove i from S. Define Lz to be the
Euclidean line through the pair of points i and z ∈ {C − {i}}.
Let there exist a function δ : S − {i} → R where δ(z) = Lz ∩ R. Therefore δ
maps the points where Lz intersects the real line.
Let v, w ∈ {S − {i}} such that δ(v = δ(w). Then Lv and Lw pass through {i}
and v, w respectively. This implies that v = w and so δ is a bijection.
Note that the image (stereographic image) of δ(z) is the real line R.
As δ is a one-to-one function we can reverse this process and create a Euclidean
circle by just adding {i} to line Lz . We can denote the absent point {i} as the
point of infinity {∞}.
From this the stereogrpahic image of a line in C is a cirlce on Π passing through {∞}
situated at the ‘North Pole’ on Π.
10
Notation It is usually denoted L̄ = L ∪ {∞} to be the circle in C̄ containing the line
L.
1.3 Paths
For many readers this section will be familar but it is important to recall paths and
elements of arc-length. This section follows the introductory work in Anderson [1].
We now concern ourselves with this integration in the complex plane. We can
redefine the function f as a complex function
where x(t) and y(t) are real functions. Now if we differentiate this function 4 we get
11
Definition 7 Let there be a differentiable path f : [a, b] → C. The length of f is the
path integral
Z Z b
lengthγ (f ) = γ(z)|dz| = γ(f (t))|f 0 (t)|dt.
f a
We now shall consider two problems stated in Anderson [1] that explores the pre-
vious definiton well.
Example Consider the function δ on S = {z ∈ C : |z| < 1}, defined by setting δ(z)
to be the reciprical of the Euclidean distance from z to ∂S (the boundry of the
unit circle). What is the explicit formula for δz in terms of z?
Firstly one must realise that the distance from z to ∂S is 1 − |z| and as the
function δ is the reciprical of this distance then
1
δ(z) = .
1 − |z|
Example For each 0 < r < 1, let Cr be the Euclidean circle in S with Euclidean
centre 0 and Euclidean radius r. What is the length of Cr with respect to the
arc length of δ(z)|dz|?
We firstly need to parametrize Cr by the path f : [0, 2π] → S where f is the
contour f (t) = reit . From this we know that |f (t)| = r. We can then show that
|f 0 (t)| = |ireit | = r.
The following concept is very useful and we will rely upon it later when we discuss
the element of arc-length in the hyperbolic plane. Lets say we have the function
f : [a, b] → C, and a function h such that [a, b] = h[α, β]. How do we create a path for
the composition function g = f ?
Recall that,
Z b
lengthγ (f ) = γ(f (t))|f 0 (t)|dt
a
12
is the path integral for the length of f with the element of arc-length γ(z)|dz|. So
obviously the length of g is simply g substituted into the above equation instead of f
with its limits changed from a, b to α, β.
Z β
lengthγ (g) = γ(g(t))|g 0 (t)|dt.
α
We can now expand the function g into the functions f and h since g = f .
Z β Z β
0
γ(g(t))|g (t)|dt = γ((f ◦ h)(t))|(f ◦ h)0 (t)|dt
α α
Z β
= γ(f (h(t)))|f 0 (h(t))||h0 (t)|dt.
α
The above manipulation was just a simple substitution and some very basic calcu-
lus. However, an important result that we need to understand is that the lengthγ (f )
and the lengthγ (g) are connected by
lengthγ (f ) = lengthγ (f ◦ h).
To prove this is quite simple and only involves integration using a change of variable.
There are two cases we need to consider. When h0 (t) ≥ 0, this implies that h(α) = a
and that h(β) = b. Clearly as h0 (t) is non-negative we have that h0 (t) = |h0 (t)|. Now
say that h(t) = s and dt = h01(t) ds we can make the change of variable so that
Z β
lengthγ (g) = γ(f (h(t)))|f 0 (h(t))||h0 (t)|dt
α
Z b
1
= γ(f (s))|f 0 (s)|h0 (t) 0 ds
a h (t)
Z b
= γ(f (s))|f 0 (s)|ds.
a
Then by definition
Z β
γ(f (h(t)))|f 0 (h(t))||h0 (t)|dt = lengthγ (f )
α
The second case to consider is when h0 (t) ≤ 0. Obviously we have that |h0 (t)| = −h0 (t)
by the definition of absolute value and the limits of integration would then become
h(α) = b and h(β) = a. Once again we can use the substitution h(t) = s to show that
lengthγ (f ) = lengthγ (g).
Z β
lengthγ (g) = γ(f (h(t)))|f 0 (h(t))||h0 (t)|dt
α
Z a
= − γ(f (s))|f 0 (s)|ds
b
Z b
= γ(f (s))|f 0 (s)|ds
a
= lengthγ (f )
13
From this we have shown that lengthγ (f ) = lengthγ (f ◦ g).
d:X ×X →R
n : C × C → R where d(z, w) = |z − w|
d(z, w) = |z − w| ≥ 0.
Using the standard triangle inequality we can show that the third part of the
definition for the metric holds, where v, w, z ∈ C.
d(z, v) + d(v, w) = |z − v| + |v − w|
≥ |(z − v) + (v − w)|
≥ |z − w| = d(z, w)
We can regard two metrics to be equivalent if there exists a bijection τ such that
τ : X → X. Let there exist two metrics d1 (x, y) and d2 (x, y) where x, y ∈ X. Then
d1 and d2 are said to be isometric if
5
This section follows the literature on metric spaces given in [1] and [9].
14
Example Here are some metrics in R3 that are equivalent:
p
• d(x, y) = x − y) · (x − y),
1
3 q
X
p
• d(x, y) = |xj − yj | , where q ≥ 1
j=1
Let X is the set of elements that measure lengths of paths and let x, y ∈ X. Define
Γ[x, y] to be a non-empty set of paths f : a, b→ X such that f (a) = x and f (b) = y.
Now every path f that belongs to Γ[a, b] can be assigned with a non-negative real
number length(f ). From this we can now define a path metric space.
Definition 9 (X, d) is a path metric space if for all pairs of points x and y ∈ X we
have
d(x, y) = inf{length(f ) : f ∈ Γ[x, y]}
and there exists a path in f ∈ Γ[x, y] that satisfies
d(x, y) = length(f ).
To make some sense of this function consider all pairs of x and y in the complex
plane. Then let Γ[x, y] be the set of differeniable paths f : [a, b] → C and let length(f )
be the Euclidean length of f and so d(x, y) is the shortest Euclidean distance between
two points on a Euclidean line.
The metric d(x, y) does not always exist. The following example from Anderson
[1] highlights this well.
15
Chapter 2
Hyperbolic Space
The first step of our hyperbolic journey is to define the abstract concept of hyperbolic
space or perhaps, more conveniently, a space in which hyperbolic geometry can exist.
We achieve this by defining a new space which we call Lorentz 3-space. This chapter
begins by creating a feel for this non-Euclidean space by highlighting some of its key
features and crucial properties. Once the reader becomes more familiar with this new
environment we then intoduce the first model of the hyperbolic plane. This study of
the hyerbolic plane takes an approach that is similar to the study of the globe and its
relation to a flat map. [17]
Definition 10 Let u and v be vectors in Rn with n > 1 The Lorentzian inner product
of u and v is defined as [16]
This definition is for the general space Rn but we shall only consider when n = 3
and denote this as L2,1 or Lorentz 3-space L3 .
When studying mathematics the search for some physical context is often an useful
one. In this case the inner product1 has a useful application when n = 4 in the theory
of special relativity [3]. The components of the vector can be thought of as space
variables except for u4 v4 which is a time variable. These are most commonly known
1
Often physicists call this space Minowski 4-space.
16
as spacetime co-ordinates and in our notation we denote the inner product space as
L3,1 .
Upon first inspection the the inner product of L2,1 is not disimilar to that of three
dimensional Euclidean space execpt for the subtaction of the final components in the
inner product. In fact it is this subtlety that make a major difference.
We use the double modulus sign to emphasise the concept of length. Unlike the
standard Euclidean inner product the norm of lorentz space can actually be imaginary
as well as positive or zero. Trivally this imaginary number will always be a positive
imaginary number. The concept of imaginary distance is remarkable and here is a
simlpe example to demonstrate this concept.
√
kxk = x◦x
p
= (1 × 1) + (0 × 0) − (3 × 3)
√
= 2 2i
√
So the norm of x is the positive imaginary number 2 2i.
The next logical concept in defining Ln is to define the distance between two
vectors. In fact, this follows a similar definition to that of distance in Euclidean space.
dL (u, v) = ku − vk.
17
(featuring in many elementary physics courses2 ) and one can use them to show a that
transformaton of an object for one observer is different to another. There exists much
material Lorentz transformations of spacetime co-ordinates [6], however we shall stay
clear of this idea and approach the subject by concentrating on a more general per-
spective. We begin by defining a Lorentz transformation and Lorentz orthonormality.
θ(u) ◦ θ(v) = u ◦ v
These transformations are often known as orthogonal and form a group which in
lorentz three space we denote as O(Ln ). We denote transforms that map elements to
itself as G(Ln ) where G is a subgroup of O.
The proof of this theorem is fairly simple and acts an useful exercise4 .
We begin by considering the orthonormal basis {θ(e1 ), ..., θ(en )}. Let there exist
the mapping θ such that θ : Ln → Ln . By the definition of the Lorentz transform we
have that
1
if i = j
ei ◦ ej = θ(ej ) ◦ θ(ej ) = 0 if i 6= j
−1 if i = j = n.
2
such as Mathematical Physics II or [18] at Durham University
3
Where δij is the Kronecker delta: δij = 1 iff i = j and δij = 0 iff i 6= j.
4
The proof here differs to the proof given in Ratcliffe [16] but in essence they mount to the same
result.
18
We saw earlier that {e1 , ..., en } is an orthonormal basis in Ln and so {θ(e1 ), ..., θ(en )}
also forms a basis in Ln .
Let u and v be vectors belonging to the set U where U is a subset of Rn . Now
recall for a mapping φ : Rn → Rn to be linear it must satisfy:
1. φ(λv) = λφ(v), ∀ λ ∈ F, ∀ u ∈ U.
θ(u) ◦ θ(v) = u ◦ v.
n
! n
X X
= vi θ(ei ) ◦ uj θ(ej )
i=1 j=1
19
We can then use the property of summation to show that
n X
X n
θ(v) ◦ θ(u) = (vi θ(ei ) ◦ uj θ(ej )).
i=1 j=1
20
x ◦ (x ⊗ y) = x ◦ J(x × y)
= x ◦ (J(y) × J(x))
x1 e1 e2 e3
= x2 ◦ y1 y2 −y3
x3 x1 x2 −x3
x1 x 2 y3 − x 3 y2
= x 2 x 3 y1 − x 1 y3
x3 x 1 y2 − x 2 y1
= x1 x2 y3 − x1 x2 y3 + x2 x3 y1 − x2 x3 y1 + x3 x1 x2 − x3 x1 x2
= 0
The same argument applies for y so a similar arrangement to above gives that
y ◦ (x ⊗ y) = 0 and therefore y is orthogonal to x ⊗ y [16].
21
2.
(x ⊗ y) ◦ z = J(x × y) ◦ z
= (x × y) · z
z1 z2 z3
= x1 x2 x3
y1 y2 y3
3.
x ⊗ (y ⊗ z) = J(x × (y ⊗ z))
= J(y ⊗ z) × J(x)
= JJ(y × z) × J(x)
= (y × z) × J(x)
= (y · J(x))z − (z · J(x))y
= (y ◦ x)z − (z ◦ x)y.
4.
(x ⊗ y) ◦ (z ⊗ w) = J(x × y) ◦ J(z × w)
= JJ(x × y) ◦ (z × w)
= (x × y) ◦ (z × w)
= J(x × y) · (z × w)
= (J(x) × J(y)) · (w × z)
e1 e2 e3 e1 e2 e3
= y1 y2 −y3 · z1 z2 z3
x1 x2 −x3 w1 w2 w3
z · J(y) z · J(x)
=
w · J(y) w · J(x)
z◦y z◦x
= .
w◦y w◦x
22
geometry. In this section we only touch the surface, so to speak, of this interesting
field of mathematics.
We then look at the effect of the hyperbolic axiom on triangles and from there we
determine a crucial concept of the hyperbolic plane.
2.4.1 Curvature
Consider a straight Euclidean line. We can clearly say that it is not curved. If we
were to make some arbitrary measure, it would be simple to say its curvature was
equivalent to zero. On the other hand a circle is clearly curved. One could claim that
a circle has more curvature than a straight line and so with our measure of curvature
we may say the curvature of a circle is greater than zero.
Lets consider a smooth curve ϕ with parameter t over some interval a, b ∈ R. such
that
Note that k(s) does not necessarily have to be constant and can actually vary
depending where on the point on ϕ.
Earlier we discussed the concept of curvature of a circle in qualitive terms. From
our definition above, we are able to calculate the precise curvature of a circle. One
can decribe the path of a circle with radius r and centre at the origin with parameter
t as
α(t) = (r cos t, r sin t), where t ∈ [0, 2π].
The unit vector is given by
23
Since ds
dt = kα0 (t)k, we have that
q
kα0 (t)k =
p
(−r sin t)2 + (r cos t)2 = r2 (sin2 t + cos2 t) = r.
Then s = rt + c, but s = 0 at t = 0 which implies that c = 0. We are now able to
reparameterise in terms of arc-length.
s s
α(s) = r cos , r sin
r r
0
Given V (s) = α (s), we have
1 s s
V (s) = −r sin , r cos ,
r r r
and differentiate again to give
1 s s
V 0 (s) = −r cos , −r sin .
r2 r r
Since kV 0 (s)k = k(s) we then have the curvature for a circle
r
1 s 2 s 2 1
k(s) = 2 −r cos + −r sin = .
r r r r
Therefore the curvature of the circle is 1r . The calculation of the curvature of a
circle is simple and we can use this to calculate the curvature of any curve.
Consider an arbitrary point p on a smooth curve γ on the standard Euclidean
plane. Fix the point p on γ and choose a point q on γ and let there exsist a line l
between p and q. If we then let q approach p then the line l will approach the tangent
to γ at p. If we let two points p1 and p2 be either side of p one will notice that we can
construct a unique circle δ through these three points.
If we again fix p and let p1 and p2 approach p along γ a circle that ‘best fits’ γ
forms and we note that the tangent to a point on a circle is the line perpendicular to
the line of the radius to that point. The circle δ is called the osculating circle and so
we can define the curvature of the point p as the curvature of the circle δ. [7]
The osculating circle will vary in size as it moves along the curve as the curvature
of the curve varies. An important observation is the the oculating circle will either be
above or below the curve and so we define curvature as being positive the osculating
circle is below the curve and negative if the osculating circle is above the curve. [7]
We now shall consider curvature of surfaces [4]. Let there exsist some surface Γ
and an arbitrary point P such that P ∈ Γ. We can then define v as a unit vector
tangent to Γ at the point P and u be the unit vector normal to Γ. From u and v we
can then intersect a plane through P that forms a curve v at intersection with the
surface. We often call αv the normal section of Γ at P in the direction v. For example,
the normal section of a sphere at a point P is a great circle through P .
If we take a small segment of αv around the point P we can say it approximates
the point the osculating circle at P with radius rv (P ). Therefore we can define the
curvature of this point P as k(P ) given by:
24
1
k(P ) = ± .
rv (P )
Note: if the radius of the osculating circle is zero we say it is undefined and
equivalent to being infinite and so the curvature of a flat surface is zero.
The curvature of most surfaces is not constant so k(P ) will vary, this is equivalent
to saying that different normal sections at P have different curvatures. There are in
fact two orthogonal directions in which k(P ) has maximum and minimum values, kmax
and kmin respectively, known as principal directions. [4]
Definition 17 The product of the two principal curvatures is called the Gauss curva-
ture k
K(P ) = kmin kmax .
If we consider a sphere with radius r, its normal sections are circles of radius r. A
sphere has constant curvature, therefore the minimum and maximum curvatures are
same. Therefore the gaussian curvature of a sphere is given by
11 1
K= = 2.
rr r
25
Figure 2.1: Plane: A surface of zero curvature
26
2.4.2 Hyperbolic Triangles
Since primary school we have all been aware that the sum of the interior angles of a
triangle is π and this is a direct consequence of Euclid’s parallel postulate. Since we
are considering the compliment of this postulate the interior angles of a triangle do
not necessarily add to π. The following description proves that the sum of the interior
angles of a triangle in hyperbolic space is less than π.6
Firstly we must use the hyperbolic axiom, so we let there exsist a line l and a point
P1 that does not lie on l. We also let there exsist two lines that pass through P1 that
are parallel to l. Let one parallel l1 have a perpendicular between P1 and P2 and we
denote that as P1 P2 . Denote line l2 as the other parallel line of l through P1 .
Now take a point Q on l1 and Q0 on l2 . Let the segment P1 Q0 on l2 be between
the segments P1 P2 and P1 Q. There is a point L on l on the same side as P1 P2 , Q and
Q0 such that ∠P2 LP1 < ∠QP Q0 .
So the line P1 L is in the inside of the angle ∠P2 P1 L and therefore P1 Q0 would
meet P1 which contradicts the hyperbolic axiom that l2 does not meet l. This then
implies that ∠LP1 P2 < Q0 P1 P2 and so
π
∠P1 LP2 + ∠LP1 P2 < .
2
Therefore the right-angled triangle P1 P2 L has interior angles summing to less than π
ad so brings us to our next definition.
Given that the angles of a triangle sum to less than π this implies that hyperboloic
geometry has an angular excess E < 0.
It is important to proceed by denoting the area of a triangle as A(∆). In hyperbolic
geometry the angle magnitude depends on the size of the triangle itself and we can say
that the angular excess of a triangle is proportional to the triangle’s area i.e. E ∝ A.
Given that the angular excess in hyperbolic geometry is negative, we can say that
some negative constant k exists such that [12]
E(∆) = kA(∆).
Most geometry one has encountered has been that of flat surfaces where straight
lines are the shortest distance between two points. This is quite clearly not true for
curved surfaces and so we need to replace this with a new notion of a geodesic [16].
We define a geodesic as connecting two points A and B on a curved surface as ‘taking
the shortest path’ within the surface from A to B.
6
This proof is taken from Bonola [2] which has a translation of Nicholas Lobachewsky’s paper
titled ‘The Theory of Parallels’. This paper was one of the very first pieces of literature published
on hyperbolic geometry and so this proof is an interesting part of the development of non-Euclidean
geometry.
27
Trivally on a plane a geodesic is a Euclidean line. Consider a Geodesic on a sphere,
if we stretch a piece of elastic between points A and B the elastic will travel along the
shortest path and so form a geodesic, that is of a great circle. From this definition
of geodesic given any three points on any curved surface we can form a triangle by
joining the points with the geodesics.
We shall assume that it is the curvature of a surface that alters the angular excess
from E(∆) = 0 in Euclidean geometry, to that of E(∆) < 0 in hyperbolic geometry
and the constant k is infact the constant of curvature.
Consider drawing a triangle ∆ on a piece of paper and then rolling the paper into
a cylinder. The object is no longer flat but the geodesics are still straight Euclidean
lines and the sum of the interior angles of ∆ is still π. This implies even though the
paper appears curved it has that E(∆) = 0 and so is actually a flat Euclidean surface
this is because the intrinsic geometry of the surface does not change. More precisely
arc-lengths and angles of all curves are invariant to the bending of a surface. [4]
Gauss showed that the curvature of a surface is intrinsic. This property has a
useful application that is if you are on the surface then you will be able to measure
its curvature known as theorema egregrium7 . If we let δ∆ be an infinitesimal small
triangle on some surface located at p, then from earlier we have that
E(δ∆) = k(p)dA
We can then integrate over this area by adding up the Gaussian curvature of the
triangle’s interior yielding Z Z
E(∆) = k(p)dA.
∆
We only consider surfaces of constant curvature and we can treat k as a constant
and so Z Z
E(∆) = k dA = kA.
∆
This demonstrates the intrinsic property of curvature and shows that an inhabi-
tant of hyperbolic space would be able to determine if the surface they were on was
negatively curved by drawing a triangle and measuring its angles.
28
Figure 2.4: The hyperboloid model.
curvature r12 and we were shown that imaginary distances exist in L3 . Using these two
properties we are able to construct a model of the hyperbolic plane.
Let there
√ exist a sphere with radius r and let r = iκ where i is the imaginary
number −1. We can show the Guassian curvature of a sphere with imaginary radius
is
1 1 1
k= 2 = = − 2.
r (iκ)2 κ
In standard Euclidean space this proposition would be impossible but in Lorentzian
3-space such a quantity can exist. A sphere of imaginary radius in Euclidean space is
a two sheeted hyperboloid given by the cartesian equation
x2 + y 2 − z 2 = 1
This has two symmetric sheets, the sheet where z > 0 is known as the positive sheet
and the other sheet where z < 0 is called the negative sheet. We define the hyperboloid
model of the hyperbolic space as the positive sheet. [16]
We can more acurately define the hyperboloid model as
H = {v ∈ L3 : kvk = i}.
Definition 19 A hyperbolic line in the hyperboloid model is defined as the line formed
at the intersection of a Euclidean plane through the origin and the hperboloid, denoted
as H-lines. [16]
8
This section follows [17].
29
Lets consider this intersection of the plane and hyperboloid. Let ax + by + cz = 0
be the Euclidean plane through the origin with some constants a, b and c. The line of
intersection is then given as
a 2 b 2ab
1− x + 1− y 2 − 2 xy = 1
c c c
which is the equation for a hyperbola. Therefore we have that H-lines are hyperbolas.
Now we shall consider distance of two points A and B on H which we shall denote
d(A, B). We can define the arc-length of an object by partitioning it such that A =
p0 , p1 , ..., pn = B and the define the distance as [17]
n
X
d(A, B) = lim kpi − pi−1 k.
n→∞
i=1
Perhaps a better aproach is to parametrise the path from A to B [17]. Let σ be a piece-
wise differentiable function that maps the interval a ≤ t ≤ b to the line AB, where
σ(a) = A and σ(b) = B. Then the partition can be given as pi = σ(ti ) respectively.
Taking the limit of these partitions gives [17]
Z b
d(A, B) = kσ 0 (t)kdt.
a
We can only use ths formula to compute distances along a cross section of H, i.e. in
a two-dimensional space.
Let A and B be in a plane and we describe there location as ax e1 + ay e2 and
bx e1 + by e2 respectively, where {e1 , e2 } are the standard basis. Then we can define σ
as p
σ(t) = 1 + t2 e1 + te2 where ay ≤ t ≤ by .
Then using our equation for distance we have that [17]
Z by s
t2 h p
2+1
iby
d(A, B) = 1− dt = ln t + t .
ay 1 + t2 ay
√
Notice that ln t + t2 + 1 = arcsinht and therefore simplfies the distance to
From here we are able to generate the hyperbolic sine and cosine functions as
ψ = arcsinht
is the inverse of
eψ − e−ψ
t= ,
2
30
√
and parametrising over 1 + t2 gives the inverse of
eψ + e−ψ
cosh ψ = .
2
Therefore we can reparametrise a H-line as
σ(ψ) = (cosh ψ)e1 + (sinh ψ)e2 .
This model is distance preserving under Lorentz transformations. Let θ be a
Lorentz transform we then have by the definition of lorentz transformation that
θ(A) ◦ θ(B) = A ◦ B.
Therefore using our definition of arc-length, under the transform θ, given the definition
of Lorentz transformations and their linearity property we have that
Xn
d(θ(A), θ(B)) = lim kθ(pi ) − θ(pi−1 )k
n→∞
i=1
n
X
= lim kθ(pi − pi−1 )k
n→∞
i=1
Xn
= lim kpi − pi−1 k
n→∞
i=1
= d(A, B).
31
dH = ku − vk > 0
so it is non-negative. Finally we can easily show that it is symmetric by the statement
dH = ku − vk = kv − uk.
ku ⊗ vk2 = (u ⊗ v) ◦ (u ⊗ v).
We have that
u◦v u◦u
(u ⊗ v) ◦ (z ⊗ w) = det
v◦v v◦u
= (u ◦ v)2 − (u ◦ u)(v ◦ v)
= (u ◦ v)2 − kuk2 kvk2
and so
ku ⊗ vk2 = (u ◦ v)2 − kuk2 kvk2 .
Recall our definition earlier
ku ⊗ vk2 = kuk2 kvk2 (cosh2 (η(u, v)) − 1) = kuk2 kvk2 sinh2 (η(u, v)).
32
Therefore we have
The last fact from the lorentz cross product that we will need is that
cosh(η(u, v) + η(v, w)) = cosh(η(u, v)) cosh(η(v, w)) + sinh(η(u, v)) sinh(η(v, w))
= (u ◦ v)(v ◦ w) + ku ⊗ vkkv ⊗ wk
≥ (u ◦ v)(v ◦ w) + (u ⊗ v) ◦ (v ⊗ w)
= (u ◦ v)(v ◦ w) + (u ◦ w)(v ◦ v) − (u ◦ v)(v ◦ w)
= (u ◦ w)(v ◦ v)
= −u ◦ w
= coshη(u, w)
Recall from section 1.3.2 we saw that metrics are isometric if there exist a bijection
between them. We will consider three other models of the hyperbolic space and the
metrics wil be isometric to dH since bijections exist between each model.
33
2.7 Lobachevski’s Formula
In this section we consider one of the most amazing formulae in the whole of mathe-
matics. Lobachevski published a paper, ‘The Theory of Parallels’ [10] which formed
the basis for hyperbolic geometry as a whole, in which he referrred to the concept of
this formulae. Often this formulae is referred to as the Angle of Parallelism.
Let there exist a line from A to be B in the Euclidean plane which we shall denote
as AB. Let there also exist a point P not on AB. Construct a perpendicular line from
P to AB such that the foot of this line is at the point Q which lies on AB, call this
line P Q. Denote the length of P Q as d. Construct a parallel line m to the line AB
which passes through P .
Now construct a line that passes through P and intersect AB to the ‘right’ of Q
at the point R. Let θ denote the angle ∠QP R. Fix points Q and P . Let the point
R not cross Q but move towards infinity. The line n approaches the parallel m. The
angle θ at which the line n tends to m is called the angle of parallelism. Now there is
a relation between θ and d given by the Lobachevsky’s formula.
Now consider d → −∞, notice that the exponential e−d will approach infinity. We
know that
π
tan−1 (α) → as α → ∞
2
and so we have the limit
lim Π(d) = π.
d→−∞
Therefore Π(d) is a decreasing function.
The proof of this theorem is interesting as there are many ways to approach it. The
‘classical’ proof involves horocycles which we shall consider later. One such proof is
by Ruoff [19] which is slightly easier to grasp than the ‘classical’ proofs. Ruoff’s proof
uses area theory and polygons to demonstrate the limit of θ as R moves to infinity.
For those readers who find the concept of Lobachevski’s formula interesting but are
new to hyperbolic geometry I recommend that they cast an eye over Ruoff’s approach
to the theorem.
34
Chapter 3
35
the hyperboloid from below. Even though the sheet tends to infinity it would appear
as a disc and we can use this to our advantage.
Take a projection point at the origin (0, 0, 0) and a point (x, y, z) in hyperbolic
space. We can construct a projection line from the origin to this point with parameter
t as
t(x, y, z) + (1 − t)(0, 0, 0) = (tx, ty, tz).
We can then find for which t this line intersects the hyperboloid as follows:
Now if we set the third component equal to one, we can then project our points on
the hyperboloid onto the points on the plane at z = 1. So we have that
z x2 + y 2
p = 1 ⇒ z2 =
x2 + y 2 − z 2 2
x y x y
which gives the points , , 1 . Notice that , , 1 lies on the line from the
z z z z
point of the hyperboloid to the origin and that z ≥ 1. We then have that
x 2 y 2 1
+ = 1 − 2.
z z z
Form this we can show that the projection on the plane is in fact the unit disc.
We too can map H-lines to this disc as we know that H-lines are the intersection
of the plane through the origin and the hyperboloid. Therefore our planes get mapped
to straight lines in the disc
x y
ax + by + cz = 0 → +b + c = 0.
z z
This model is called the Klein model.
1
This description follows the introduction to the Klein model given in Greenberg [7]
36
Figure 3.1: The Klein model formed by stereographic projection of the hyperboloid
37
Figure 3.2: Lines in the Klein disc.
Fix the unit circle SK in the Euclidean plane centred at the origin O. The points
in the hyperbolic space are the points in the interior of the circle SK . A chord is a
segment joining AB on SK . In this model we represent hyperbolic lines as open chords
and denote them as K-lines.
Consider the diagram, the open chords r and s that pass through the point P are
‘parallel ’ to the open chord l. In this model we define a pair of parallel hyperbolic
lines as two open chords that share no common points in SK .
The possibility that these chords may meet if they were extended is simply not a
possibility as any space outside of SK is not on the hyperbolic plane. In other words
we only concern ourselves with the interior of SK . The following axiom allows the
Klein Model to be a successful model for the hyperbolic plane.
Axiom For any two distinct points A and B in SK , there exists a unique open chord
that both points A and B lie on. [7]
Proof Let A and B lie on the straight Euclidean line l. If we extend the Euclidean
line l it will intersect the boundry of SK at two distinct points, say C and D.
Therefore the points A and B lie on the only chord connecting points C and D.
Definition 23 Let A and B be two points inside the disc SK and let P and Q be the
points at the end of the chord through A and B. The Klein distance between the points
38
A and B is given as
1 AP · BQ
dK (A, B) = ln .
2 AQ · BP
·BQ
The term AP AQ·BP is often referred to as the cross ratio and can be written as
(AB, P Q), so the formula for distance becomes
| ln(AB, P Q)|
dK (A, B) = .
2
39
Figure 3.3: The different parallel lines in the Klein disc
Definition 24 Let l and m be two hyperbolic lines in SK . If at least one of the lines is
the diameter of SK then l can be considered to be perpendicular to m in the Euclidean
sense. [7]
We now turn our attention to angles in the Klein model. Angles are only conformal
at the origin in SK and the remaining interior is non conformal.
We know that the sum of the interior angles of a triangle in hyperbolic space is
less than π. In the Klein model a triangle angle appears to have a sum of angles equal
to pi as we use chords to construct it. Therefore we must interpret angles in the Klein
model differently to the Euclidean measure. Here we shall show an example that the
Klein model is not angle-preseving by contradiction using a simple example4 .
4
There are many examples and it is a useful exercise to use the definitions of perpendicularity of
the Klein model to prove the distortion of angles
40
Example Firstly lets assume that the Klein model is conformal. We say an angle in
SK is distorted if it does not equal the Euclidean angle in the Euclidean plane.
Let there exist two lines l1 and l2 that we define by the cartesian equations
y = √12 and x = √12 respecively, such that they intersect at the ideal point
1 1
i.e. they intersect at infinity on the model at the co-ordinates √ , √ . In
2 2
standard Euclidean geometry they are perpendicular.
Now recall the definitions of perpendicularity for the Klein model. Firstly neither
l1 of l2 pass through origin of √
SK and so neither are the diameter of SK . Secondly,
the pole of l2 is at the point ( 2, 0) and no straight Euclidean line passes through
the interior of SK and the pole of l1 therefore l1 and l2 are not perpendicular
and the distortion of angles exist.
The mapping T maps the hyperboloid to the open disc. We can show this simply
by substituting the components of T into the equation of the hyperboloid.
2 2
x y
+ + (0)2 = 1.
1+z 1+z
This mapping takes lines in the hyperboloid and maps them to arcs of circles in
the open unit disc. This model is called the Poincaré Disc Model.
41
Figure 3.4: Stereographic projection of the hyperboloid to the Poincaré disc.
42
Figure 3.5: P-lines in the Poincaré disc model are arcs of circles.
There exist two different type of lines in the Poincaré disc. Firstly, lines than travel
through the origin of SP are straight Euclidean lines. Lines that do not pass through
the origin belong to a circle σ that intersects SP orthogonally. [21]
In other words, for two points in the Poincaré disc there exist a open arc of a circle
that intersects the boundary at right angles, as shown in the diagram below. We call
all lines in this model P-lines
Two lines are said to be parallel if and only if they share no common points.
Similarly to the Klein model we can define lines that intesect at the boundry as ideal
parallel lines and lines that will never intersect as ultra-ideal parallel lines.
43
plane as Poincaré ‘bugs’ [5]. He described the ‘bugs’ as having a spine that followed a
small segment of a P-line with three legs either side of it. For the Euclidean God-like
viewer the ‘bug’s’ legs closer to the boundry of SP would appear to be smaller than
those closer to the centre of SP . It would therefore appear that if the bug attempted
to walk in a straight line it would in fact follow the line of an arc.
More importantly, the Euclidean God-like viewer would notice that the ‘bug’ would
shrink as it approached the boundry of SP . For the ‘bug’ living in the plane, it would
not feel this decrease insize, for the ‘bug’ would feel no difference as it pointlessly
walked towards infinity.
Definition 27 Let A and B be two points inside the disc SP and let P and Q be the
points at the end of the chord through A and B. The Poincaré distance between the
points A and B is given as
Lets consider one example which we shall use its result in the next section. Let
the Poincaré disc SP lie in the complex plane.
Example Let the line P Q be the line that connects P = −1 and Q = 1. There P Q
is the diameter of SP passing through the origin. Let A be a point that lies in
the line segement OQ. The distance OA is therefore given by
Notice that the distances OP and OQ are equal. One should also take note that
the distance
AQ = 1 + OA and that AP = 1 − OA.
Therefore the distance dp (OA) is given by5
OP AQ 1 + OA
dp (OA) = ln · = ln
OQ AP 1 − OA
44
Figure 3.6: Angle of parallelism in the Poincaré disc
The proof of this theorem is fairly simple but not trivial.6 In section 2.7 we laboured
the importance of this formulae not only for hyperbolic geometry but for mathematics
as a whole.
Proof We begin by defining the line l in the unit disc SP , with origin O, of the
complex plane as
l = {z ∈ C : z ∈ (−1, 1), Im(z) = 0}
and let the points P and Q be the end points of l respectively. Now let A lie on
the perpendicular line to l given by the line m such that
Now let the angle of parallelism be θ and with respects to the distance dp (OA),
which we shall shorten to dP for ease. Therefore we can define θ as
θ = Π(dp ).
6
A sketch proof of this theorem is given in Greenberg [7] which omitts some algebraic and trigono-
metric manipulations. The omitted parts of the proof are detailed in the proof stated here.
45
This angle refers to the limiting parallel ray to l through A. In the Poincaré
disc, the limiting ray that passes through A is the arc of the circle σ, that has
the tangent P Q at Q which is orthogonal to SP .
Now we are able to construct a tangent to σ at A that meets l at the point R
(similar to section 2.7). Therefore by definition R is a pole of the chord AQ to
the circle σ. We know from circle geometry that ∠RAQ = ∠RQA. Lets denote
the magnitude in radians of these angles as ϕ also note that θ = ∠RAO. We
then have that ∠ORQ = π − 2ϕ and so
π θ
We can substitue ϕ = 4 − 2 into the trigonometric identity above to give
θ
1 − tan
π θ 2
tan − = θ
.
a 4 1 + tan 2
We can then substitute this equation into edp to give the following,
2 tan 2θ 1 + tan 2θ
dp θ
e = · = tan .
1 + tan (θ2) 2 2
θ Π(dp )
Thus edp = tan = tan , which is the required formula.
2 2
46
In section 2.4.2 we showed that if a surface has negative curvature then an inhabi-
tant on that surface would be able to determine its curvature by summing the interior
angles of a triangle. Hyperbolic triangles interior angles tend to zero as their verticies
approach infinity. There are many applets on the web that demonstrate this7 .
This property of hyperbolic triangles is a true mathematical phenomenon and here
is my example to illustrate this property. In this example we use an equilateral triangle
but it is an interesting exercise to prove this for any triangle in the hyperbolic plane.
Example Let there exist a equilateral triangle ∆T in the Poincaré disc. Recall that
we can submerge the disc SP in the complex plane with origin at O. We can
therefore describe its vertices A, B and C at the points
2 4
A = re0πi , B = re 3 πi , C = re 3 πi where r ∈ [0, 1].
We are then able to redefine the statement that the sum of the interior angles
of a triangle tend to zero as the vertices tend to infinity. Instead we can prove
the angle defect E(∆T ) approaches π as r tends to one.
Since all angles of ∆T are equal we can study one vertice and by symmetry it
will be true for the other two. We denote in radians each angle having a measure
of ϕ, and so the angle defect is given by
E(∆T ) = π − 3ϕ.
where the endpoints of L are P and Q at {−1} and {1} respectively. Notice that
A lies on L. If we extend the points B and B to the boundry of γ along the
P-lines connecting B and C to A, we can say they intersect at the ideal points
I1 and I2 respectively. Interestingly I1 and I2 are symmetric in the line l.
We then connect I1 and I2 by the P-line m which is orthogonal to l since I1 and
I2 are symmetric. We can call the point of intersection of the P-lines l and m
the point λ. Now we have the inequality
Now using the formula for distance in the Poincaré disc in section 3.2.3 we have
that
OP · AQ 1−r
dP (O, a) = ln = ln .
OQ, AP 1+r
7
[Link] joel/NonEuclid/[Link]
47
Now dp (λ, A) → ∞ as r → 1, and notice that dp (O, A) → ∞ as r → 1. We
know that Π(dp ) is a decreasing function so dp (O, A) approaches the boundry, ϕ
becomes the angle of parallelism such that
ϕ = 2Π(λ, a) → 0 as r → 1.
48
Figure 3.7: M.C. Escher, Circle Limit III (1959)[23]
Figure 3.8: Stereographic projection of the Poincaré disc to the Klein disc.
49
Take the unit disc S (which can be regarded as either the Klein Model or the
Poincaré disc model and consider them as the points τ and υ respectively) and let
S be the equator of a unit sphere Π. This we then allow us to construct a mapping
S : υ → τ ,that is an isomorphism for the Poincaré disc to the Klein model.
In Cartesian space we can obviously define Π as x2 + y 2 + z 2 = 1. Similar to
the Riemman sphere we can set our projection point as (0, 0, 1), the north pole
of the sphere denoted as N . We can then take a point p = (x, y, 0) belonging to
υ and project it onto the point on the southern hemisphere that intersects the line
t(x, y, 0) + (1 − t)(0, 0, 1) for 0 ≤ t ≤ 1 with the respective co-ordinates (tx, ty, 1 − t).
Substituting this projection line into the equation for Π, t can be found:
and so
2
t= .
x2+ y2 + 1
Our projected point on the sphere can be given by substituting the expression for t
into (tx, ty, 1 − t) which is
x2 + y 2 − 1
2x 2y
, , .
x2 + y 2 + 1 x2 + y 2 + 1 x2 + y 2 + 1
The equator of the sphere is the unit disc with the equation x2 +y 2 = 1. The numerator
of the third component above has the term x2 + y 2 − 1 which at the equator is zero.
Therefore we can project onto the disc using the mapping S given by [7]
2x 2y
S(x, y, 0) = , ,0 .
x2 + y 2 + 1 x2 + y 2 + 1
Note that clearly the z component of the co-ordinate is trivial and adds no addi-
tional information in this mapping. Therefore we can eliminate this component and
define the mapping in terms of a complex function. If we let w = x + iy we can show
that
2x 2y 2w
S(w) = 2 2
+i 2 2
= .
x +y +1 x +y +1 1 + |w|2
We now need to show that the mapping S actually maps the arc of a circle (P-line)
to an open chord (K-line). Let P and Q be the end points of any open chord in the
unit disc S. Then define a circle ζ as a circle orthogonal to S that intersects at P and
Q with origin C at co-ordinates (cx , cy ).
With this set up we are able to show that if there is a point R on the arc of the
circle ζ between P and Q inside S then S(R) is the point of intersection between
the line through the origin and R, and the open chord betweeen P and Q. This is
equivalent to showing the S maps the Poincaré disc to the Klein disc.
From basic geometry we know that the circle σ with diameter CO intersect S at
the points P and Q [7]. The equation for this circle is
50
cx 2 cy 2 cx 2 cy 2
x− + y− = + .
2 2 2 2
Expanding and simplifying this gives the equation of σ as
x2 − cx x + y 2 − cy y = 0.
To find the open chord connecting P and Q we simply subtract the equation for σ
from the equation for S.
S − σ ⇔ x2 + y 2 − (x2 − cx x + y 2 − cy y) = 1 − 0
⇔ cx x + cy y = 1.(†)
By construction ζ is orthogonal to S and so the Euclidean triangle 4OCQ is a right
angled triangle, with the right angle given at ∠OQC. We can then calculate the radius
of ζ by using the Euclidean Pythagoras’ Theorem.
q 2 −2
2 −2
2
−2
= c2x + c2y − 1
|QC| = |OC| − |QO| = 2 2
cx + cy − 1 .
Knowing this radius and from our definition of the origin of ζ being C the equation
for ζ is as follows:
(x − cx )2 + (y − cy )2 = c2x + c2y − 1.
Simplifying this equation gives an important relationship
51
Chapter 4
The final model we shall look at is the Poincaré upper half plane model. This chapter
has three main themes, the first discussing the characteristics of the model. The other
themes consider length in the upper half plane and we use the connection between the
Poincaré disc model and the upper half plane model to show that they are isometric
and derive the arc-length of the Poincaré disc model.
52
Figure 4.2: Hyperbolic lines in the upper half plane model
Definition 28 The first type of hyperbolic line is the intersection of H with a Eu-
clidean line in C perpendicular to the real axis R in C.
The second type of hyperbolic lines is the intersection of H with a Euclidean circle
centred on the real axis R. We call these line H-lines.
We can define parallel lines in a similar way to the other models of the hyperbolic
plane.
Definition 29 Two hyperbolic lines are parallel if and only if they are disjoint (do
not share any common points) We can distinguish between parallel lines in the upper
half plane model.
Definition 30 If parallel lines intersect at infinity they are known as parallel lines, if
the lines do not intersect at infinity then they are known ultra parallel hyperbolic lines.
This is a similar concept to that of ordinary, ideal and ultraideal lines in the Klein
model.
53
Theorem 7 For every positive constant c, the element of arc-length
c
|dz|
Im(z)
on H invariant under the action of Möb(H).
Firstly we must recall from earlier work,
Z Z b
lengthρ (f ) = ρ(z)|dz| = ρ(f (t))|f 0 (t)|dt.
f a
Also recall that lengthρ (f ) = lengthρ (f ◦ h). For our own condition that length is
invariant under the action of Möb(H) the following too must hold.
lengthρ (f ) = lengthρ (γ ◦ f )
where ρ(z)|dz| is the element of arc-length on H, f is a function such that f : [a, b]H
and finally γ belongs to the group of transformations on H Möb(H). Using previous
statements concerning lengths and the definition just given we can concude that
Z b
lengthρ (f ) = lengthρ (γ ◦ f ) = ρ((γ ◦ f )(t))|(γ ◦ f )0 (t))|dt
a
From the chain rule we know that
so substitute this in and then we can expand the expression for lengthρ (γ ◦ f ) to be
Z b
ρ((γ ◦ f )(t))|γ 0 (f (t))||f 0 (t)|dt.
a
So now we can bring the integrals to one side and therefore have
Z b Z b
ρ(f (t))|f 0 (t)|dt − ρ((γ ◦ f )(t))|γ 0 (f (t))||f 0 (t)|dt = 0
a a
In Anderson [1] a substitution µγ is made to simplify this integral and therefore make
future working less congested. We shall make this alteration and so we define µγ as
54
An important note made in Anderson [1] which we should mention here is that since
ρ(z) is contiuous and that γ is differentiable (which we have discussed in section 1.2.3),
we then have that µγ (z) is continuous for every element γ belonging to Möb(H).
A property of µγ we shall need to look at is the behaviour of µγ under composition
of two elements of the Möbius transformations group. Firstly let there be exist the
elements θ and φ belonging to Möb(H).
So µγ = µθ◦φ becomes
Now we shall be slightly devious and add nothing to this expression. Actually we shall
introduce
ρ(φ(z))|φ0 (z)| − ρ(φ(z))|φ0 (z)|,
which is equivalent to adding zero to the equation above and then µθ◦φ becomes
It should be clear that a generator function is itself an element the the transfor-
mation group Möb(H). From this we can consider the function m(z) and express it
as
az + b
m(z) =
cz + d
55
which by definition belongs to Möb(H) and for this to be true ad − bc = 1. There are
only two situations to consider for this to hold.
Firstly if c = 0. This would then imply the following using the expression we stated
above
az + b
m(z) =
d
and then ad = 1 which would allow the function m(z) = z + b to exist.
Secondly and perhaps more complicatedly if c 6= 0. For this we define two functions
f (z) and g(z) as
a
f (z) = z + and g(z) = c2 z + cd,
c
Knowing that K(z) = − z1 we have that
1 1 a
K(g(z)) − 2
z + cd and f (K(g(z)) = 2 + = m(z).
c c + cd c
To complete this proof we turn our attention to a new element n(z) defined as
az̄ + b
n(z) = .
cz + d
Only when a, b, c and d are imaginary and ab−dc = 1 can this be an element belonging
to Möb(H).
Obviously B ◦ n = m which by definition belongs to Möb(H). Therefore we have
shown that m(z), K(z) and B(z) generate the elements in Möb(H) and so are generator
functions.
for every piecewise differentiable path f : [a, b] → D, then it can be said that[1]µ ≡ 0.
Our aim now is to show that µ ≡ 0 for all elements γ ∈ Möb(H) as this will then allow
us to find ρ(z)|dz| i.e. the element of arc-length of H. One such method to show this
is to demonstrate that if µγ ≡ 0 for γ in the generating set of Möb(H), this would in
turn imply that µγ ≡ 0 for all γ ∈ Möb(H). We shall use the generating functions
discussed earlier as our generating set.
Lets consider the function in the generating set m(z) = az + b where a and b
are real numbers. For the following we use the method described in the literature in
Anderson [1].
We will break m(z) into two parts, focusing on γ(z) = z + b and then γ(z) = az.
Consider the following function γ(z) = z + b, and so γ 0 (z) = 1. By the lemma we can
construct the following
56
µγ = ρ(z) − ρ(γ(z))|γ 0 (z)| = ρ(z) − ρ(z + b).
Recalling that µγ ≡ 0, we therefore can have that ρ(z)−ρ(z+b) = 0 and then obviously
Lets take a closer look at this statement for a moment. Let z be split into x, y such
that z = x + iy and lets also suppose that there exists two points, say z1 and z2 that
satisfy z1 = x1 + iy and z2 = x2 + iy. It is important to notice here that both these
points fall on the same imaginary line.
By subtracting z2 from z1 we get
z1 − z2 = x 1 − x 2
Once again this expression is equivalent to zero as from the Lemma we know that
µ ≡ 0. Similarly we have that ρ(z) = aρ(az). Then substituting this into our real
function r that we discussed above we then have r(y) = ar(ay)
Note that we know that r(y) = ρ(iy) this is equivalent to saying that ρ(z) =
r(Im(z)). The quantities a and y are interchangeable as r(y) = ar(ay) can be stated
as
yr(ay) = r(a).
Lets now divide through by y and so
1
yr(ay) = r(a) ⇒ r(ay) = r(a)
y
Realising that a is simply a constant we can take a out of the function r(ay) and create
a function r(c) where c is some new constant. We then have that
1
r(y) = r(a).
y
Using the fact that r(y) = r(Im(z)) we can rewrite the expression as
1
r(Im(z)) = × c.
Im(z)
57
We noted earlier that ρ(z) = r(Im(z)) and so we have constructed that
c
ρ(z) =
Im(z)
c
We have shown it possible that Im(z)|dz| is a viable element of arc-length for H. How-
ever we need also to show that the element of arc-length is invariant with regards to
the other generating sets K(z) and B(z)1 . To show this we could derive the element
of arc-length again for both K(z) and B(z). However, it is just sufficient to show that
ρ(z) = Imc(z) satisfies ρ(z)|dz| under K(z) and B(z). This makes the proof much easier
and more straight forward.
Lets consider K(z) = − z1 where K 0 (z) = z12 . Then substituting this in µK we
know that
0 1 1
µK = ρ(z) − ρ(K(z))|K (z)| = ρ(z) − ρ −
z |z|2
We know that µK ≡ 0 gives us that
1 1
ρ(z) = ρ −
z |z|2
and so ρ(− z1 ) = ρ(− zz̄z̄ ) = ρ(− |z|z̄ 2 ). Using the fact that
c
|dz| is a possible element
Im(z)
c
of arc-length we can state that ρ(z) = Im(z) . We now have the following equations:
z̄ c
ρ − 2 =
|z| Im |z|−z̄
2
c Im(−z̄)
=
1 |z|2
|z|2 c
=
Im(−z̄)
|z|2 c
= .
Im(z̄)
So
1 1
µK = ρ(z) = ρ − .
z |z|2
This implies that
|z|2
c 1
= .
Im(z) |z|2 Im(z)
Cancelling |z|2 from the right-hand-side of the equation gives
c
= ρ(z).
Im(z)
1
This is posed in Anderson [1] as an exercise, the following description is my solution to the exercise
but a similar set of solutions can be found in [1].
58
This shows that the element of arc-length is invariant under the generating set K(z).
Finally we need to consider B(z) = −z̄. Unfortunatley we cannot approach this
function in the same way we have approached m(z) and K(z). In order to calculate
µB (z) we would need to find the derivative B 0 (z), which we are in fact unable to do.
Split z into its x and y components so that we have z̄ = x − iy. Using the Cauchy-
Riemann equations to test for complex differentiability in section 1.2.3 we find that
B ◦ Z = −U (t) + iV (t),
We can then use integration by change of variable fully given that |dz| = |(B)0 (t)|dt.
This then yields
Z
c
lengthρ (B ◦ Z) = |(B ◦ Z)0 (t)|dt|.
B◦Z Im(B ◦ Z)(t)
By the chain rule we have that |(B ◦ Z)(t)| = (1)|Z 0 (t)| and by inspection we know
that
Im((B ◦ Z)(t)) = v(t) = Im(z).
Putting these two statements into the previous expression we can show
Z
c
lengthρ (B ◦ Z) = |Z 0 |dt.
Im(Z)
Given that lengthρ (B ◦ Z) = lengthρ (z) we have then shown that B(z) is invariant
under the element of arc-length and thus proved the theorem.
59
4.4 Half Plane Model to Poincaré Disc Model
We can derive the arc-length for each model of the hyperbolic plane. However this
would be a long and fairly tiresome task. Instead we can transform the element of
arc-length from the upper half plane model to the other models. In this section we
shall show that Möbius transformations can map the upper half plane to the Poincaré
disc2 .
Firstly we must define a subset of C; or the moment let X ⊆ C. X must be a subset
of C such that there is a mapping δ : X → H that is one-to-one and let there exist
an inverse δ −1 of the function δ. The function δ must also be complex differentiable.
Obviously we take the unit disc as our subset of the complex plane so we have
X = SP ⊆ C.
Earlier we showed that a Mobius Transfomaton mapping the unit disc to the upper
half plane was defined as
iz + 1
φ(z) =
−(z + i)
As φ is a Mobius transformation it has an inverse and is complex differentiable, so
we can take φ(z) = δ(z) which yields the mapping δ : SP → H.
If we wish to, which we do not, we could use brute force and use our function δ to
transfer all the hyperbolic geometry in H to SP and so defining a hyperbolic line in
SP to simply be the image of the hyperbolic line in H using δ −1 . This makes difficult
looking mathematics and is a very long winded approach for making any calculations
in SP . Instead we transfer the hyperbolic element of arc-length from H to SP .
This is the most difficult step of the proccess, as once we have established the
new hyperbolic element of arc-length for SP we can actually use it to calculate the
geometry in the hyperbolic plane.
The following is from Anderson [1] and details a general method for constructing
this element of arc length.
Define the hyperbolic element of arc-length dsX on X by declaring that
Z Z
1
lengthX (f ) = dsX = |dz| = lengthH (δ ◦ f )
f δ◦f Im(z)
lengthγ (f ) = lengthγ (f ◦ h)
In other words we now need to derive the hyperbolic element of arc-length for SP using
the equations above.
2
For this section we shall follow Anderson [1].
60
Let there be the piecewise differentiable path f : [a, b] → SP with lengthSP (f ). We
also know from earlier work that
Z
1
length(f ) = |dz|.
δ◦f Im(z)
Z Z b Z
1 1 1
|dz| = |(δ ◦ f )0 (t)|dt = |δ 0 (z)||dz|.
δ◦f Im(z) a Im((δ ◦ f )(t)) f Im(δ(z))
Now to continue with this calculation we need to find the imaginary part of δ.
Firstly manipulate the expression δ(z) such that
1 − |z|2
−iz − 1
So from this Im(δ(z)) = Im = . Finally it can be shown that
z+i | − z − i|2
2
|δ 0 (z)| = .
1 − |z|2
By combining these to we can then show that the hyperbolic element of arc-length is
2
|dz|.
1 − |z|2
So any Mobius transformation ρ can be used and will yield the same hyperbolic element
of arc length as
2
|dz|.
1 − |z|2
61
Chapter 5
There is no natural conclusion to this project. The ideas discussed here would be of
concern to a reader who has strongly followed Euclidean geometry throughout their
mathematical education. They can however take some comfort in the knowledge that
they are not alone.
The birth of non-Euclidean geometry was like an earthquake for mathematics,
destroying its very foundations. Mathematicians began to ask themselves profound
questions: What is a point? What is a line? What is geometry? It is worth noting
that the word ‘geometry’ is derived from the ancient Greek meaning ‘to measure the
Earth’. When one considers these questions they realise that geometry is concerned
with idealised lines. Diagrams are only there to help us gain a better insight into the
mathematics as opposed to dictating the laws of geometry.
Consider an Euclidean right angled triangle with sides√ of unit length. By Pythago-
ras’ theorem we know the hypotenuse of that triangle is 2, an irrational number. We
are able to draw such a triangle without any concern for its irrational hypotenuse.
From this simple illustration it is easier to understand the essence of geometry as
being of a purely mathematical form.
To what degree can one believe the hyperbolic axiom? Is it conceivable? In
Trudeau [22] a section entitled The Psychological Impossibility of Non-Euclidean Ge-
ometry describes a conversation between a professor and a student concerning the
hyperbolic axiom. By the end of the conversation the reader begins to realise that the
negation of Euclid’s postulate is perfectly plausible and not absurd.
Is hyperbolic geometry consistent? We can turn this question on its head and ask:
Is Euclidean geometry consistent? We can reach the logical conjecture that Euclidean
geometry is consistent if and only if hyperbolic geometry is consistent1 . For some this
may feel as if we are just trying to avoid proving the parallel postulate.
If we consider this in more detail, an interesting result unfolds. Lets assume that
there is a proof that verifies the parallel postulate. Consequently the hyperbolic ax-
iom would be false and so hyperbolic geometry would be inconsistent. Under this
assumption our conjecture states that Euclidean geometry would also be inconsistent.
1
This is Metamathematical Theorem 1 in Greenberg [7]
62
Therefore the disproof of the parallel will never be found. This unfortunately im-
plies that the attempts to prove the postulate, which are still worthy, were inevitably
unsuccessful.
Another approach is to consider which geometry is more convenient2 . For instance,
engineers need Euclidean geometry to build structures however physicists have found
hyperbolic geometry useful when investigating special relativity.
This leads to my penultimate point. Having discovered non-Euclidean geometry,
we can look at our universe through different eyes and ask ourselves: Is the geometry
of the universe Euclidean? In 1830 Professor Ostrogradsky of Petrograd University
walked out of a lecture on Non-euclidean geometry saying that there was no point to it
since the universe was Euclidean [13]. Unlike Ostrogradsky, Gauss was not convinced
that curvature of the universe was zero. He allegedly used the peaks of three mountains
as the vertices of a giant triangle in an attempt to measure the curvature. Sadly his
results were inconclusive due to experimental error. [7]
We have considered much ground breaking mathematics and our adventure of
hyperbolic geometry has come to an end. However, there is much more literature on
this subject and this project could easily have continued for another hundred pages
and so in truth your hyperbolic journey has only just begun. We could investigate
more geometry of the hyperbolic plane such as trigonometry and tesselations or we
could stride further into hyperbolic space and consider higher dimensions. I hope this
material has left you with a sense of awe for mathematics and its infinite horizons
(even if they are bounded).
Finally, it is worth noting that non-Euclidean geometry does have a turbulent his-
tory. There was much resistance to non-Euclidean geometry as its implications were
enormous. Farkas Bolyai, one of the founding fathers of this geometry, faced much
pressure not to publish his work. Bolyai was warned by his father not to publish his
findings so not to risk his life and happiness. Boylai remained defiant and replied to
his father, [7]
“But I have discovered such wonderful things that I was amazed, and it
would be an everlasting piece of bad fortune if they were lost. When you,
my dear father, see them, you will understand; at present I can say nothing
except this: that out of nothing I have created a strange new universe.”
2
Poincaré famously pointed out that the geometry we used only depends on the needs we have for
it.[5]
63
Bibliography
[4] R. L. Faber Differential Geometry and Relativity Theory, Marcel Dekker INC,
1983.
[8] T. L. Heath, Euclid, The Thirteen Books of the Elements, Dover Publications,
Vol I, 1956.
[9] C. J. Isham, Modern Differential Geometry for Physicists, World Scientific Pub-
lishing, 1999.
[11] J. L. Locher, editor, M. C. Escher, his life and complete graphic work, Abradale
Press, New York, 1992.
[13] J. P Petit, The Adventures of Archibald Higgins: Euclid Rules OK?, John Murray
Ltd, 1982.
64
[16] J.G. Ratcliffe, Foundations of Hyperbolic Manifolds, Springer-Verlag, 1994.
Websites
[23] [Link]
Computer Programmes
Maple 11, Version for Windows
[Link], Linux
LATEX, TEXnic Center for Windows
65