The Kurzweil-Henstock Integral For Undergraduates: Alessandro Fonda
The Kurzweil-Henstock Integral For Undergraduates: Alessandro Fonda
Alessandro Fonda
The Kurzweil-
Henstock
Integral for
Undergraduates
A Promenade Along the Marvelous
Theory of Integration
Compact Textbooks in Mathematics
Compact Textbooks in Mathematics
The
Kurzweil-Henstock
Integral for
Undergraduates
A Promenade Along the Marvelous
Theory of Integration
Alessandro Fonda
Dipartimento di Matematica e Geoscienze
Università degli Studi di Trieste
Trieste, Italy
Mathematics Subject Classification (2010): 26A39, 26A42, 26A63, 26A66, 26B15, 26B20, 28A75
Introduction
This book is the outcome of the beginners’ courses held over the past
few years for my undergraduate students. The aim was to provide them
with a general and sufficiently easy to grasp theory of the integral. The
integral in question is indeed more general than Lebesgue’s in RN , but its
construction is rather simple, since it makes use of Riemann sums which,
being geometrically viewable, are easily understandable.
This approach to the theory of the integral was developed independently
by Jaroslav Kurzweil and Ralph Henstock since 1957 (cf. [5, 8]). A number
of books are now available [1, 4, 6, 7, 9–13, 15–19, 21]. However, I feel that
most of these monographs are addressed to an expert reader, rather than to
a beginner student. This is why I wanted to maintain here the exposition
at a very didactical level, trying to avoid as much as possible unnecessary
technicalities.
The book is divided into three main chapters and five appendices, which
I now briefly describe, mainly as a guide for the lecturer.
The first chapter outlines the theory for functions of one real variable. I
have done my best to keep the explanation as simple as can be, following
as far as possible the lines of the theory of the Riemann integral. However,
there are some interesting peculiarities.
▬ The Fundamental Theorem of differential and integral calculus is very
general and natural: one only has to assume the given function to be
primitivable, i.e., to be the derivative of a differentiable function. The
proof is simple and clearly shows the link between differentiability and
integrability.
▬ The generalized integral, on a bounded but not compact interval, is
indeed a standard integral: in fact, Hake’s theorem shows that a function
having a generalized integral on such an interval can be extended to a
function which is integrable in the standard sense on the closure of its
domain.
▬ Integrable functions according to Lebesgue are those functions which are
integrable and whose absolute value is integrable, too.
Contents
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221
1 1
Alessandro Fonda
Along this chapter, we denote by I a compact interval of the real line R, i.e., an interval
of the type [a, b].
Definition 1.1
A P-partition of the interval I = [a, b] is a set
whose elements appear as couples (xj , [aj −1 , aj ]), where [aj −1 , aj ] is a subset of
I and xj is a point in it. Precisely, we have
xj ∈ [aj −1 , aj ] .
2 Chapter 1 • Functions of One Real Variable
1
Example Consider the interval [0, 1]. As examples of P-partitions of I we have the
following sets:
1
= , [0, 1]
6
1 1 1
= 0, 0, , , ,1
3 2 3
1 1 1 1 2 2 2
= , 0, , , , , , ,1
3 3 3 3 3 3 3
1 1 3 1 1 5 1 3 7 3
= , 0, , , , , , , , , ,1 .
8 4 8 4 2 8 2 4 8 4
We consider now a function f , defined on the interval I , having real values. To each
P-partition of the interval I we can associate a real number, in the following way.
Definition 1.2
Let f : I → R be a function and
m
S(I, f, ) = f (xj )(aj − aj −1 ) .
j =1
In order to better understand this definition, assume for simplicity the function f to
be positive on I. Then, to each P-partition of I we associate the sum of the areas of the
rectangles having base [aj −1 , aj ] and height [0, f (xj )].
f (x )
a x 1 a 1 x2 a2 x3 a 3 x4 b x
1.2 · The Notion of δ-Fineness
3 1
If f is not positive on I, the areas will be considered with positive or negative sign
depending on whether f (xj ) be positive or negative, respectively. If f (xj ) = 0, the j -th
term of the sum will obviously be zero.
Then,
15 1 1 33 1 9
S(I, f, ) = − · +0· + · = .
16 4 2 16 4 32
Now we ask whether, taking the P-partitions finer and finer, the Riemann sums
associated to them will converge to some value. When this happens for a positive
function f, such a value can be visualized as the area of the region in the Cartesian
plane which is confined between the graph of f and the horizontal axis. To be able to
analyze this question, we need to specify what we mean for a P-partition to be “fine”.
Let us introduce the notion of “fineness” for the P-partition previously defined. For
brevity, we call gauge on I every function δ : I → R such that δ(x) > 0 for every
x ∈ I. Such a function will be useful for having a control on the amplitude of the
various intervals determined by the points of the P-partition.
Definition 1.3
Given a gauge δ on I, we say that the P-partition introduced above is δ-fine if,
for every j = 1, . . . , m,
or else
xj − δ(xj ) ≤ aj −1 ≤ xj ≤ aj ≤ xj + δ(xj ) .
We will show now that it is always possible to find a δ-fine P-partition of the interval
I, whatever the gauge δ. In the following theorem, due to P. Cousin, the compactness of
the interval I plays an essential role.
4 Chapter 1 • Functions of One Real Variable
1
Theorem 1.4
Given a compact interval I, for every gauge δ on I there is a δ-fine P-partition of I.
Proof
We proceed by contradiction. Assume there exists a gauge δ on I for which it is impossible to
find any δ-fine P-partition of I. Let us divide the interval I in two equal sub-intervals, having
the mid point of I as common extremum. Then, at least one of the two sub-intervals does not
have any δ-fine P-partition. Let us choose it, and divide it again in two equal sub-intervals.
Continuing this way, we construct a sequence (In )n of bottled sub-intervals, whose lengths
tend to zero, each of which does not have any δ-fine P-partitions. By the Cantor Theorem
there is a point c ∈ I belonging to all of these intervals. Moreover, it is clear that, from a
certain n thereof, every In will be contained in [c − δ(c), c + δ(c)]. Choose one of these,
e.g. In̄ . Then the set = {(c, In̄ )}, whose only element is the couple (c, In̄ ), is a δ-fine
P-partition of In̄ , in contradiction with the above.
Examples Let us see, as examples, some δ-fine P-partitions of the interval [0, 1]. We start
with a constant gauge: δ(x) = 15 . Since the previous theorem does not give any information
on how to find a δ-fine P-partition, we will proceed by guessing. As a first guess, we choose
the aj equally distant and the xj as the middle points of the intervals [aj −1 , aj ]. Hence:
j 2j − 1
aj = , xj = (j = 1, . . . , m) .
m 2m
For the corresponding P-partition to be δ-fine, it has to be
1 1 1 1
xj − aj −1 = ≤ , aj − xj = ≤ .
2m 5 2m 5
If, instead of taking the points xj in the middle of the respective intervals, we would like
−1
to choose them, for example, at the left extremum, i.e. xj = j m , in order to have a δ-fine
P-partition we should ask that
1 1 1
xj − aj −1 = 0 ≤ , aj − xj = ≤ .
5 m 5
These inequalities are verified if m ≥ 5. For instance, if m = 5, we have the δ-fine P-partition
1 1 1 2 2 2 3 3 3 4 4 4
0, 0, , , , , , , , , , , , ,1 .
5 5 5 5 5 5 5 5 5 5 5 5
1.2 · The Notion of δ-Fineness
5 1
Notice that, with such a choice of the aj , if m ≥ 5, the points xj can actually be taken
arbitrarily in the respective intervals [aj −1 , aj ], still obtaining δ-fine P-partitions.
The previous example shows how it is possible to construct δ-fine P-partitions in the case
of a gauge δ which is constant with value 15 . It is clear that a similar procedure can be used for
a constant gauge with arbitrary value. Consider now the case when δ is a continuous function.
Then, the Weierstrass Theorem says that there is for δ(x) a minimum positive value: let it be
δ̄. Consider then the constant gauge with value δ̄, and construct a δ̄-fine P-partition with the
procedure we have seen above. Clearly, such a P-partition has to be δ-fine, as well. We have
thus seen how the case of a continuous gauge can be reduced to that of a constant gauge.
Consider now the following non-continuous gauge:
⎧
⎪ 1
⎨ if x = 0 ,
δ(x) = 2x
⎪
⎩ if x ∈ ]0, 1] .
2
As before, we proceed by guessing. Let us try, as above, taking the aj equally distant and the
xj as the middle points of the intervals [aj −1 , aj ]. This time, however, we are going to fail;
indeed, we should have
x1
x1 = x1 − a0 ≤ δ(x1 ) = ,
2
which is clearly impossible if x1 > 0. The only way to solve this problem is to choose
x1 = 0. We decide then, for instance, to take the xj to coincide with aj −1 , as was also done
above. We thus find a δ-fine P-partition:
1 1 1 3 3 3
0, 0, , , , , , ,1 .
2 2 2 4 4 4
Consider a function f , defined on the interval I = [a, b]. We are now in the position to
define what we mean by convergence of the Riemann sum when the P-partitions become
“finer and finer”.1
Definition 1.5
A function f : I → R is said to be integrable if there is a real number A with the
following property: given ε > 0, it is possible to find a gauge δ on I such that, for
every δ-fine P-partition of I, it is
|S(I, f, ) − A| ≤ ε .
Let us prove that there is at most one A ∈ R which verifies the conditions of the
definition. If there were a second one, say A , we would have that, for every ε > 0 there
would be two gauges δ and δ on I associated respectively to A and A by the definition.
Define the gauge
Once a δ -fine P-partition of I is chosen, we have that is both δ-fine and δ -fine,
hence
1 The following definition is due to J. Kurzweil and R. Henstock: see Appendix E for a historical overview.
1.3 · Integrable Functions on a Compact Interval
7 1
The presence of the letter x in the notation introduced here has no independent
importance. It could be replaced by any other letter t, u, α, . . . , or by any other symbol,
unless already used with another meaning. For reasons to be explained later on, we set,
moreover,
a
b
a
f =− f, and f = 0.
b a a
m
S(I, f, ) = xj (aj − aj −1 ) .
j =1
In order to find a candidate for the integral, let us consider a particular P-partition where the
xj are the middle points of the intervals [aj −1 , aj ]. In this particular case, we have
m
m
aj −1 + aj 1 2
m
1
xj (aj − aj −1 ) = (aj − aj −1 ) = (aj − aj2−1 ) = (b2 − a 2 ).
2 2 2
j =1 j =1 j =1
We want to prove now that the function f (x) = x is integrable on [a, b] and that its integral
is really 12 (b2 − a 2 ). Fix ε > 0. Taken any P-partition , we have:
m
1 m
a −1 + a
S(I, f, ) − (b2 − a 2 ) = xj (aj − aj −1 ) −
j j
(aj − aj −1 )
2 j =1 j =1
2
m
aj −1 + aj
≤
xj − 2 (aj − aj −1 )
j =1
m
aj − aj −1
≤ (aj − aj −1 ) .
2
j =1
ε
If we choose the gauge δ to be constant with value b−a , then, for every δ-fine P-partition
we have:
m
aj − aj −1 ε
m
S(I, f, ) − 1 (b2 − a 2 ) ≤ 2δ = (aj − aj −1 ) = ε .
2 2 b−a
j =1 j =1
8 Chapter 1 • Functions of One Real Variable
1
The condition of the definition is thus verified with this choice of the gauge, and we have
proved that
b 1 2
x dx = (b − a 2 ) .
a 2
j −1 + aj −1 aj + aj
m a2 2
1 3
m
1
(aj − aj −1 ) = (aj − aj3−1 ) = (b3 − a 3 ) .
3 3 3
j =1 j =1
At this point, it is possible to proceed like in the case n = 1 to prove that the function
f (x) = x 2 is integrable on [a, b] and that its integral is 13 (b3 − a 3 ) : once fixed ε > 0,
choose the constant gauge δ = 2(b2ε−a 2 ) so that, for any δ-fine P-partition,
m
1 aj2−1 + aj −1 aj + aj2
S(I, f, ) − (b − a ) ≤
3 3
xj −
2
(aj − aj −1 )
3 3
j =1
m
≤ (aj2 − aj2−1 )2δ
j =1
ε
m
= (aj2 − aj2−1 ) = ε .
b2 −a 2
j =1
Let f, g be two real functions defined on I = [a, b], and α ∈ R be a constant. It is easy
to verify that, for every P-partition of I,
These linearity properties are inherited by the integral, as will be proved in the following
two propositions.
Proposition 1.6
If f and g are integrable on I, then f + g is integrable on I and one has
(f + g) = f+ g.
I I I
Proof
Set A1 = I f and A2 = I g. Being ε > 0 fixed, there are two gauges δ1 and δ2 on I such
that, for every P-partition of I, if is δ1 -fine, then
ε
|S(I, f, ) − A1 | ≤ ,
2
Let us define the gauge δ on I as δ(x) = min{δ1 (x), δ2 (x)}. Let be a δ-fine P-partition of
I. It is thus both δ1 -fine and δ2 -fine, and we have:
Proposition 1.7
If f is integrable on I and α ∈ R, then αf is integrable on I and one has
(αf ) = α f.
I I
10 Chapter 1 • Functions of One Real Variable
1
Proof
If α = 0, the result is obvious. If α
= 0, set A = I f and fix ε > 0. There is a gauge δ on I
such that
ε
|S(I, f, ) − A| ≤ ,
|α|
for every δ-fine P-partition of I. Then, for every δ-fine P-partition of I, we have
ε
|S(I, αf, ) − αA| = |αS(I, f, ) − αA| ≤ |α| = ε,
|α|
We have just proved that the set of integrable functions on [a, b] is a vector space
and that the integral is a linear function on it.
Example Every polynomial function is integrable on an interval [a, b]. If for instance
f (x) = 2x 2 − 3x + 7, we have
b
b
b
b 2 3 3
f =2 x 2 dx − 3 x dx + 7 1 dx = (b − a 3 ) − (b2 − a 2 ) + 7(b − a) .
a a a a 3 2
We now study the behavior of the integral with respect to the order relation in R.
Proposition 1.8
If f is integrable on I and f (x) ≥ 0 for every x ∈ I, then
f ≥ 0.
I
Proof
Fix ε > 0. There is a gauge δ on I such that
S(I, f, ) − f ≤ ε ,
I
Corollary 1.9
If f and g are integrable on I and f (x) ≤ g(x) for every x ∈ I, then
f ≤ g.
I I
Proof
It is sufficient to apply the preceding proposition to the function g − f.
Corollary 1.10
If f and |f | are integrable on I, then
f ≤ |f | .
I I
Proof
Applying the preceding corollary to the inequalities
−|f | ≤ f ≤ |f | ,
we have
− |f | ≤ f ≤ |f | ,
I I I
The following theorem constitutes a link between the differential and the integral
calculus. It is called the Fundamental Theorem of differential an integral calculus.
More briefly, we will call it the Fundamental Theorem.
Theorem 1.11
Let F : [a, b] → R be a differentiable function, and let f be its derivative: F (x) =
f (x) for every x ∈ [a, b]. Then, f is integrable on [a, b], and
b
f = F (b) − F (a) .
a
12 Chapter 1 • Functions of One Real Variable
1
Proof
Fix ε > 0. For any x ∈ I = [a, b], since F (x) = f (x), there is a δ(x) > 0 such that, for
every u ∈ I ∩ [x − δ(x), x + δ(x)], one has
ε
|F (u) − F (x) − f (x)(u − x)| ≤ |u − x| .
b−a
We thus have a gauge δ on I. Consider now a δ-fine P-partition of I ,
xj − δ(xj ) ≤ aj −1 ≤ xj ≤ aj ≤ xj + δ(xj ) ,
one has
We deduce that
m
F (b) − F (a) − f (xj )(aj − aj −1 ) =
j =1
m
m
= [F (aj ) − F (aj −1 )] − f (xj )(aj − aj −1 )
j =1 j =1
m
= [F (aj ) − F (aj −1 ) − f (xj )(aj − aj −1 )]
j =1
m
≤ F (aj ) − F (aj −1 ) − f (xj )(aj − aj −1 )
j =1
m
ε
≤ (aj − aj −1 ) = ε ,
b−a
j =1
Definition 1.12
A function f : I → R is said to be primitivable if there is a differentiable function
F : I → R such that F (x) = f (x) for every x ∈ I. Such a function F is called a
primitive of f .
Theorem 1.13
Let f : [a, b] → R be primitivable and let F be a primitive. Then f is integrable on
[a, b], and
b
f = F (b) − F (a) .
a
It is sometimes useful to denote the difference F (b) − F (a) with the symbols
[F ]ba , [F (x)]x=b
x=a ,
The fact that the difference F (b) − F (a) does not depend from the chosen primitive
is explained by the following proposition.
Proposition 1.14
Let f : I → R be a primitivable function, and let F be one of its primitives. Then, a
function G : I → R is a primitive of f if and only if F − G is a constant function on
I.
14 Chapter 1 • Functions of One Real Variable
1
Proof
If F − G is constant, then
which we call the indefinite integral of f . We denote this function by one of the
following symbols:
·
·
f, f (t) dt
a a
(notice that in this last notation it is convenient to use a different letter from x for the
variable of f ; for instance, we have chosen here the letter t). The Fundamental Theorem
tells us that, if F is a primitive of f, then, for every x ∈ [a, b],
x
f = F (x) − F (a) .
a
·
This fact yields, taking into account Proposition 1.14, that the function a f is itself a
primitive of f. We thus have the following
Corollary 1.15
·
Let f : [a, b] → R be a primitivable function. Then, the indefinite integral a f is
one of its primitives: it is a function defined on [a, b], differentiable and, for every
x ∈ [a, b], we have
·
f (x) = f (x) .
a
·
Notice that the choice of the point a in the definition of a f is by no way necessary.
·
One could take any point ω ∈ I and consider the function ω f. The conventions made
1.6 · Primitivable Functions
15 1
on the integral with exchanged extrema are such that the above stated theorem still holds.
Indeed, if F is a primitive of f, even if x < ω we have
x
ω
f =− f = −(F (ω) − F (x)) = F (x) − F (ω) ,
ω x
·
so that ω f is still a primitive of f.
We will denote the set of all primitives of f with one of the following symbols:
f, f (x) dx .
Concerning the use of x, an observation analogous to the one made for the integral can
be made here, as well: it can be replaced by any other letter or symbol, with the due
precautions. When applying the theory to practical problems, however, if F denotes a
primitive of f, instead of correctly writing
f = {F + c : c ∈ R} ,
where c ∈ R stands for an arbitrary constant; we will adapt to this habit, too. Let us
make a list of primitives of some elementary functions:
ex dx = ex + c ,
sin x dx = − cos x + c ,
cos x dx = sin x + c ,
x α+1
x α dx = +c (α
= −1) ,
α+1
1
dx = ln |x| + c ,
x
1
dx = arctan x + c ,
1 + x2
1
√ dx = arcsin x + c .
1 − x2
16 Chapter 1 • Functions of One Real Variable
1
Notice that the definition of primitivable function makes sense even in some cases where
f is not necessarily defined on a compact interval, and indeed the formulas above are
valid on the natural domains of the considered functions.
Notice that the presence of the arbitrary constant c can sometimes lead to apparently
different results. For example, it is readily verified that one also has
1
√ dx = − arccos x + c .
1 − x2
This is explained by the fact that arcsin x = π2 − arccos x for every x ∈ [−1, 1], and
one should not think that here c refers to the same constant as the one appearing in the
last formula of the above list.
One should be careful with the notation introduced for the primitives, which looks
similar to that for the integral, even if the two concepts are completely different. Their
relation comes from the Fundamental Theorem: we have
·
f ∈ f,
ω
actually the left term is a real number, while the right term is something we have not even
b
defined (it could be the set whose only element is a f ). In the applications, however,
one often abuses of these notations.
From the known properties of derivatives, one can easily prove the following
proposition.
Proposition 1.16
Let f and g be two functions, primitivable on the interval I, and α ∈ R be arbitrary.
Let F and G be two primitives of f and g, respectively. Then
(Continued )
1.6 · Primitivable Functions
17 1
E = {x ∈ [a, b] : f (x)
= 0}
Let now = {(x1 , [a0 , a1 ]), . . . , (xm , [am−1 , am ])} be a δ-fine P-partition of [a, b]. By
the way f is defined, the associated Riemann sum becomes
S([a, b], f, ) = f (xj )(aj − aj −1 ) .
{1≤j ≤m : xj ∈E}
2 Hereand in the following we use in an intuitive way the algebraic operations involving sets. To be precise, the
sum of two sets A and B is defined as
A + B = {a + b : a ∈ A, b ∈ B} .
18 Chapter 1 • Functions of One Real Variable
1
Now, [aj −1 , aj ] ⊆ [xj − δ(xj ), xj + δ(xj )], so that aj − aj −1 ≤ 2δ(xj ), and if xj is in
E it is xj = en , for some n ∈ N. To any such en can however correspond one or two
points xj , so that we will have
∞
∞
ε
f (xj )(aj − aj −1 ) ≤ 2 |f (en )|2δ(en ) = 4 = ε.
2 n+3
{1≤j ≤m : xj ∈E} n=0 n=0
b
This shows that f is integrable on [a, b] and that a f = 0.
Let us see now that, if E isnon-empty, then f is not primitivable on [a, b]. Indeed, if
·
it were, its indefinite integral a f should be one of its primitives. Proceeding as above,
x
one sees that, for every x ∈ [a, b], one has a f = 0. Then, being the derivative of a
constant function, f should be identically zero, which is false.
We introduce here two methods frequently used for determining the primitives of certain
functions. The first is known as the method of primitivation by parts.
Proposition 1.17
Let F, G : I → R be two differentiable functions, and let f, g be the respective
derivatives. One has that f G is primitivable on I if and only if such is F g, in which
case a primitive of f G is obtained subtracting from F G a primitive of F g; we will
briefly write:
f G = FG − Fg .
Proof
Being F and G differentiable, such is F G, as well, and we have
(F G) = f G + F g .
Example We would like to find a primitive of the function h(x) = xex . Define the following
functions: f (x) = ex , G(x) = x, and consequently F (x) = ex , g(x) = 1. Applying the
formula given by the above proposition, we have:
ex x dx = ex x − ex dx = xex − ex + c ,
b
b
f G = F (b)G(b) − F (a)G(a) − Fg .
a a
Examples Applying the formula directly to the function h(x) = xex of the previous
example, we obtain
1
1
ex x dx = e1 · 1 − e0 · 0 − ex dx = e − [ex ]10 = e − (e1 − e0 ) = 1 .
0 0
Notice that we could attain the same result using the Fundamental Theorem, having already
found that a primitive of h is given by H (x) = xex − ex :
1
ex x dx = H (1) − H (0) = (e − e) − (0 − 1) = 1 .
0
Let us see some more examples. Let h(x) = sin2 x. With the obvious choice of the
functions f and G, we find
sin2 x dx = − cos x sin x + cos2 x dx
= − cos x sin x + (1 − sin2 x) dx
= x − cos x sin x − sin2 x dx ,
Consider now the case of the function h(x) = ln x, with x > 0. In order to apply the
formula of primitivation by parts, we choose the functions f (x) = 1, G(x) = ln x. In this
way, we find
1
ln x dx = x ln x − x dx = x ln x − 1 dx = x ln x − x + c .
x
Proposition 1.18
Let ϕ : I → R be a differentiable function and f : ϕ(I ) → R be a primitivable
function on the interval ϕ(I ), with primitive F. Then, the function (f ◦ ϕ)ϕ is
primitivable on I, and one of its primitives is given by F ◦ ϕ. We will briefly write:
(f ◦ ϕ)ϕ = f ◦ϕ.
Proof
The composite function F ◦ ϕ is differentiable on I and
(F ◦ ϕ) = (F ◦ ϕ)ϕ = (f ◦ ϕ)ϕ .
2
Example We look for a primitive of the function h(x) = xex . Defining ϕ(x) = x 2 , f (t) =
1 t
2 e (it is advisable to use different letters to indicate the variables of ϕ and f ), we have that
h = (f ◦ ϕ)ϕ . Since a primitive of f is seen to be F (t) = 12 et , a primitive of h is F ◦ ϕ, i.e.
2 1 x2
xex dx = F (ϕ(x)) + c = e +c.
2
and thus find 12 (ln x)2 + c (in this case, writing t = ln x, one has that the symbol dt replaces
1
x dx).
b
ϕ(b)
f (ϕ(x))ϕ (x) dx = f (t) dt .
a ϕ(a)
2
Example Taking the function h(x) = xex considered above, and defining ϕ(x) = x 2 ,
f (t) = 12 et , we have
2 2
4 1 t 1 e4 − 1
xex dx = e dt = [et ]40 = .
0 0 2 2 2
Clearly, the same result is obtainable directly by the Fundamental Theorem, once we know
2
that a primitive of h is given by H (x) = 12 ex . Indeed, we have
2
2 1 4 1 0 e4 − 1
xex dx = H (2) − H (0) = e − e = .
0 2 2 2
√
Example Looking for a primitive of f (t) = 1 − t 2 , with t ∈ ] − 1, 1[ , we may try to
consider the function ϕ : ]0, π[→] − 1, 1[ defined as ϕ(x) = cos x, so that
f (ϕ(x))ϕ (x) = 1 − cos2 x (− sin x) = − sin2 x .
22 Chapter 1 • Functions of One Real Variable
1
As we have already proved, this last function is primitivable, so we can write
1 − t 2 dt = − sin2 x dx
x=arccos t
1
=− (x − sin x cos x) +c
2 x=arccos t
1
= − arccos t − t 1 − t 2 + c .
2
We are now in the position to compute primitives and integrals for a large class of
functions. Some of these are proposed in the exercises below.
Exercises
1. Making use of the known rules for the computation of the primitives, recover the
following formulas:
1 1
dx = − +c,
(2 + 3x)7 18(2 + 3x)6
√ 2
x + 7 dx = (x + 7)3 + c ,
3
x 2 + 3x − 2 2 5 √
√ dx = x + 2 x3 − 4 x + c ,
x 5
1 2
√ √ dx = − (x + 1)3 + x 3 + c ,
x − x+1 3
1
dx = ln |x − 3| − ln |x − 2| + c ,
x2 − 5x + 6
1
dx = arctan(x + 2) + c ,
x 2 + 4x + 5
1 1
dx = tan x − +c,
sin2 x cos2 x tan x
1
dx = 2 arctan(ex ) + c ,
cosh x
ln x 1
dx = (ln x)2 + c .
x 2
1.8 · The Cauchy Criterion
23 1
2. Primitivation by parts gives the following:
x sin x dx = sin x − x cos x + c ,
√ √
1 − x2 1 − x2
dx = − + arcsin x + c ,
x2 x
(ln x)2 dx = x (ln x)2 − 2 ln x + 2 + c ,
arcsin x dx = x arcsin x + 1 − x2 + c .
Theorem 1.19
A function f : I → R is integrable if and only if for every ε > 0 there is a gauge δ on
of I , one has
I such that, taking two δ-fine P-partitions ,
≤ ε.
|S(I, f, ) − S(I, f, )|
Proof
Let us see first the necessary condition. Let f be integrable on I with integral A, and fix
ε > 0. Then, there is a gauge δ on I such that, for every δ-fine P-partition of I, it is
ε
|S(I, f, ) − A| ≤ .
2
24 Chapter 1 • Functions of One Real Variable
1
are two δ-fine P-partitions, we have:
If and
ε ε
)| ≤ |S(I, f, ) − A| + |A − S(I, f,
|S(I, f, ) − S(I, f, )| ≤ + = ε.
2 2
Let us see now the sufficient condition. Once assumed the stated condition, let us choose
ε = 1 so that we find a gauge δ1 on I such that
≤ 1,
|S(I, f, ) − S(I, f, )|
whenever and are δ1 -fine P-partitions of I. Taking ε = 1/2, we can find a gauge δ2 on
I, that we can choose so that δ2 (x) ≤ δ1 (x) for every x ∈ I, such that
)| ≤ 1 ,
|S(I, f, ) − S(I, f,
2
whenever and are δ2 -fine P-partitions of I. We can continue this way, choosing ε = 1/k,
with k a positive integer, and find a sequence (δk )k of gauges on I such that, for every x ∈ I,
)| ≤ 1 ,
|S(I, f, ) − S(I, f,
k
1 1
|S(I, f, k1 ) − S(I, f, k2 )| ≤ ≤ ≤ ε̄ .
k1 m
This proves that (S(I, f, k ))k is a Cauchy sequence. Hence, it has a finite limit, which we
denote by A.
Now we show that A is just the integral of f on I. Fix ε > 0, let n be a positive integer
such that nε ≥ 1, and consider the gauge δ = δn . For every δ-fine P-partition of I and for
every k ≥ n, it is
1
|S(I, f, ) − S(I, f, k )| ≤ ≤ ε.
n
|S(I, f, ) − A| ≤ ε .
In this section we will see that if a function is integrable on an interval I = [a, b],
it is also integrable on any of its sub-intervals. In particular, it is possible to consider
its indefinite integral. Moreover, we will see that, if a function is integrable on two
contiguous intervals, it is also integrable on their union. More precisely, we have the
following property of additivity on sub-intervals.
Theorem 1.20
Given three points a < c < b, let f : [a, b] → R be a function. Then, f is integrable
on [a, b] if and only if it is integrable both on [a, c] and on [c, b]. In this case,
b
c
b
f = f+ f.
a a c
f(x)
a c b x
Proof
First assume f to be integrable on [a, b]. Let us choose for example the first sub-interval,
[a, c], and prove that f is integrable on it, using the Cauchy criterion. Fix ε > 0 ; being f
integrable on [a, b], it verifies the Cauchy condition on [a, b], and hence there is a gauge δ
on [a, b] such that
≤ ε,
|S([a, b], f, ) − S([a, b], f, )|
for every two δ-fine P-partitions , of [a, b]. The restrictions of δ to [a, c] and [c, b] are
two gauges δ1 and δ2 on the respective sub-intervals. Let now 1 and 1 be two δ1 -fine
P-partitions of [a, c]. Let us fix a δ2 -fine P-partition 2 of [c, b] and consider the P-partition
of [a, b] made by 1 ∪ 2 , and the P-partition of [a, b] made by 1 ∪ 2 . It is clear
that both and are δ-fine. Moreover, we have
Let now
be a δ-fine P-partition of [a, b]. Notice that, because of the particular choice of the gauge δ,
there must be a certain j¯ for which xj¯ = c. Hence, we have
Let us set
1 = {(x1 , [a0 , a1 ]), (x2 , [a1 , a2 ]), . . . , (xj¯−1 , [aj¯−2 , aj¯−1 ]), (c, [aj¯−1 , c])}
and
2 = {(c, [c, aj¯ ]), (xj¯+1 , [aj¯ , aj¯+1 ]), . . . , (xm , [am−1 , am ])}
(but in case aj¯−1 or aj¯ coincide with c we will have to take away an element). Then 1 is a
δ1 -fine P-partition of [a, c] and 2 is a δ2 -fine P-partition of [c, b], and we have
1
Being f constant on [0, 1], it is integrable there, and 0 f = 2. Moreover, on the interval
[1, 2] the function f differs from a constant only in one point: we have that f (x) − 3 is
zero except for x = 1. For what have been proved somewhat before, the function f − 3 is
integrable on [1, 2] with zero integral and so, being f = (f − 3) + 3, even f is integrable
2
and 1 f = 3. In conclusion,
2
1
2
f (x) dx = f (x) dx + f (x) dx = 2 + 3 = 5 .
0 0 1
It is easy to see from the theorem just proved above that if a function is integrable
on an interval I, it still is on any sub-interval of I. Moreover, we have the following
Corollary 1.21
If f : I → R is integrable, for any three arbitrarily chosen points u, v, w in I one has
w
v
w
f = f+ f.
u u v
Proof
The case u < v < w follows immediately from the previous theorem. The other possible
cases are easily obtained using the conventions on the integrals with exchanged or equal
extrema.
28 Chapter 1 • Functions of One Real Variable
1
1.10 R-Integrable Functions and Continuous Functions
Definition 1.22
We say that an integrable function f : I → R is R-integrable (or integrable
according to Riemann), if among all possible gauges δ : I → R which verify the
definition of integrability it is always possible to choose one which is constant on I.
It is immediate to see, repeating the proofs, that the set of R-integrable functions is
a vector subspace of the space of integrable functions. Moreover, the following Cauchy
criterion holds for R-integrable functions, whenever one considers only constant gauges.
Theorem 1.23
A function f : I → R is R-integrable if and only if for every ε > 0 there is a constant
of I , one has
δ > 0 such that, taken two δ-fine P-partitions ,
≤ ε.
|S(I, f, ) − S(I, f, )|
μ([a, b]) = b − a .
It will be useful, moreover, to make the convention that the length of the empty set is
equal to zero.
Theorem 1.24
Every continuous function f : I → R is R-integrable.
Proof
Fix ε > 0. Being f continuous on a compact interval, it is uniformly continuous there, so
that there is a δ > 0 such that, for x and x in I,
ε
|x − x | ≤ 2δ ⇒ |f (x) − f (x )| ≤ .
b−a
1.10 · R-Integrable Functions and Continuous Functions
29 1
We will verify the Cauchy condition for the R-integrability by considering the constant
gauge δ. Let
and
be two δ-fine P-partitions of I. Let us define the intervals (perhaps empty or reduced to a
single point)
Then, we have
m
m
aj − aj −1 = μ(Ij,k ) , ãk − ãk−1 = μ(Ij,k ) ,
k=1 j =1
m
m
≤ |f (xj ) − f (x̃k )| μ(Ij,k )
j =1 k=1
m
m
ε
≤ μ(Ij,k ) = ε .
b−a
j =1 k=1
Therefore, the Cauchy condition holds true, and the proof is completed.
Theorem 1.25
Every continuous function f : [a, b] → R is primitivable.
30 Chapter 1 • Functions of One Real Variable
1
Proof
Being continuous, f is integrable on every sub-interval of [a, b], so that we can consider the
· ·
function a f, indefinite integral of f. Let us prove that a f is a primitive of f, i.e., that
·
taken a point x0 in [a, b], the derivative of a f in x0 coincides with f (x0 ). We first consider
the case when x0 ∈ ]a, b[ . We want to prove that
x0 +h
x0
1
lim f− f = f (x0 ) .
h→0 h a a
f (x )
a x0 x0 + h b x
Equivalently, since
x0 +h
x0
x0 +h
1 1
f− f − f (x0 ) = (f (x) − f (x0 )) dx ,
h a a h x0
Fix ε > 0. Being f continuous in x0 , there is a δ > 0 such that, for every x ∈ [a, b] satisfying
|x − x0 | ≤ δ, one has |f (x) − f (x0 )| ≤ ε. Taking h such that 0 < |h| ≤ δ, we distinguish
two cases. If 0 < h ≤ δ, then
x0 +h
1 1 x0 +h 1 x0 +h
(f (x) − f (x0 )) dx ≤ |f (x) − f (x0 )| dx ≤ ε dx = ε ;
h h x0 h x0
x0
m
=
S([a, b], f, ) − S([a, b], f, ) [f (xj ) − f (x̃j )](aj − aj −1 )
j =1
m
= (aj − aj −1 ) = b − a .
j =1
Since δ > 0 has been taken arbitrarily, the Cauchy condition for R-integrability does not
hold, so that f cannot be R-integrable on [a, b].
Exercises
1. Recalling that
2 tan θ 1 − tan2 θ
sin(2θ) = , cos(2θ) = ,
1 + tan2 θ 1 + tan2 θ
Alternatively,
1 1 1 + sin x
dx = ln +c.
cos x 2 1 − sin x
3 By “elementary formula” I mean here an analytic formula where only polynomials, exponentials, logarithms
and trigonometric functions appear.
32 Chapter 1 • Functions of One Real Variable
1
2. Show that, if f : [0, 1] → R is defined as
1 if x ∈
/ Q,
f (x) =
3 if x ∈ Q ,
1
then 0 f (x) dx = 1.
c
3. Let f : R → R be continuous and odd. Then, −c f = 0, for every c ∈ R.
4. Prove that, if f : [a, b] → R is monotone, then it is R-integrable.
5. Show that every R-integrable function f : [a, b] → R is necessarily bounded.
m
m
aj
S(I, f, ) = f (xj )(aj − aj −1 ) , f = f,
j =1 I j =1 aj −1
we have
m
a j
f (xj )(aj − aj −1 ) − f ≤ ε .
j =1 aj −1
a
This fact tells us that the sum of all “errors” (f (xj )(aj − aj −1 ) − ajj−1 f ) is arbitrarily
small, provided that the P-partition be sufficiently fine. Notice that those “errors” may
be either positive or negative, so that in the sum they could compensate with one another.
The following Saks–Henstock’s theorem tells us that even the sum of all absolute
values of those “errors” can be made arbitrarily small.
1.11 · The Saks–Henstock Theorem
33 1
Theorem 1.26
Let f : I → R be an integrable function and let δ be a gauge on I such that, for
every δ-fine P-partition of I, it happens that |S(I, f, ) − I f | ≤ ε. Then, for such
P-partitions = {(x1 , [a0 , a1 ]), . . . , (xm , [am−1 , am ])} we also have
m
aj
f (xj )(aj − aj −1 ) − f ≤ 4ε .
aj −1
j =1
Proof
We consider separately in the sum the positive and the negative terms. Let us prove that the
sum of the positive terms is less than or equal to 2ε. In an analogous way one can proceed for
the negative terms. Rearranging the terms in the sum, we can assume that the positive ones
a
be the first q terms (f (xj )(aj − aj −1 ) − ajj−1 f ), with j = 1, . . . , q, i.e.,
a1
aq
f (x1 )(a1 − a0 ) − f , . . . , f (xq )(aq − aq−1 ) − f.
a0 aq−1
Being f integrable on these intervals, there exist some gauges δk on [ak−1 , ak ], respectively,
which we can choose such that δk (x) ≤ δ(x) for every x ∈ [ak−1 , ak ], for which
ak ε
S([ak−1 , ak ], f, k ) − f ≤ ,
m − q
ak−1
for every δk -fine P-partition k of [ak−1 , ak ]. Consequently, the family made by the
couples (x1 , [a0 , a1 ]), . . . , (xq , [aq−1 , aq ]) and by the elements of the families k , with k
varying from q + 1 to m, is a δ-fine P-partition of I such that
q
m
=
S(I, f, ) f (xj )(aj − aj −1 ) + S([ak−1 , ak ], f, k ) .
j =1 k=q+1
Then, we have:
q
aj
q q
aj
f (xj )(aj − aj −1 ) − f = f (xj )(aj − aj −1 ) − f
j =1 aj −1 j =1 j =1 aj −1
m
m
ak
−
= S(I, f, ) S([ak−1 , ak ], f, k ) − f− f
k=q+1 I k=q+1 ak−1
34 Chapter 1 • Functions of One Real Variable
1
m
ak
≤ S(I, f,
) − f +
S([ak−1 , ak ], f, k ) −
f
I k=q+1 ak−1
ε
≤ ε + (m − q) = 2ε .
m−q
The following corollary will be useful in the next section to study the integrability
of the absolute value of an integrable function.
Corollary 1.27
Let f : I → R be an integrable function and let δ be a gauge on I such that, for
every δ-fine P-partition of I, it happens that |S(I, f, ) − I f | ≤ ε. Then, for such
P-partitions = {(x1 , [a0 , a1 ]), . . . , (xm , [am−1 , am ])} we also have
m
aj
S(I, |f |, ) − f ≤ 4ε .
aj −1
j =1
Proof
Using the well known inequalities for the absolute value, we have:
m
m
aj aj
S(I, |f |, ) − f = |f (xj )|(aj − aj −1 ) − f
aj −1 aj −1
j =1 j =1
m
aj
≤ |f (xj )(aj − aj −1 )| − f
aj −1
j =1
m
aj
≤ f (xj )(aj − aj −1 ) − f ≤ 4ε .
aj −1
j =1
In the sequel it will be useful to consider even the so-called sub-P-partitions of the
interval I. A sub-P-partition is a set of the type
= {(ξj , [αj , βj ]) : j = 1, . . . , m},
where the intervals [αj , βj ] are non-overlapping, but not necessarily contiguous, and
ξj ∈ [αj , βj ] for every j = 1, . . . , m. Using the Cousin’s theorem, it is easy to see that
every sub-P-partition can be extended to a P-partition of I.
1.12 · L-Integrable Functions
35 1
For a sub-P-partition
, it is still meaningful to consider the associated Riemann
sum:
m
S(I, f,
) = f (ξj )(βj − αj ) .
j =1
Moreover, given a gauge δ on I, the sub-P-partition is δ-fine if, for every j, one has
ξj − αj ≤ δ(ξj ) e βj − ξj ≤ δ(ξj ) .
Theorem 1.28
Let f : I → R be an integrable function and let δ be a gauge on I such that, for every
δ-fine P-partition of I, it happens that |S(I, f, ) − I f | ≤ ε. Then, for every
δ-fine sub-P-partition
= {(ξj , [αj , βj ]) : j = 1, . . . , m} of I, we have
m
βj
f (ξj )(βj − αj ) − f ≤ 4ε .
αj
j =1
Definition 1.29
We say that an integrable function f : I → R is L-integrable (or integrable
according to Lebesgue), if even |f | happens to be integrable on I.
Proposition 1.30
Let f : I → R be an integrable function, and consider the set S of all real numbers
q
ci
f ,
i=1 ci−1
Proof
Assume first f to be L-integrable on I . If a = c0 < c1 < · · · < cq = b, then f and |f | are
integrable on every sub-interval [ci−1 , ci ], and we have
q
q
ci ci
f ≤ |f | = |f | .
i=1 ci−1 i=1 ci−1 I
Consequently, the set S is bounded above: sup S ≤ I |f |.
On the other hand, assume now S to be bounded above and let us prove that in that case
|f | is integrable on I and I |f | = sup S . Fix ε > 0. Let δ1 be a gauge such that, for every
δ1 -fine P-partition of I, one has
ε
S(I, f, ) − f ≤ .
I 8
Letting A = sup S , by the properties of the sup there surely are a = c0 < c1 < · · · < cq = b
such that
q
ε ci
A− ≤ f ≤ A.
2 ci−1
i=1
We construct the gauge δ2 in such a way that, for every x ∈ I, it has to be that [x −δ2 (x), x +
δ2 (x)] meets only those intervals [ci−1 , ci ] to which x belongs. In this way,
▬ if x belongs to the interior of one of the intervals [ci−1 , ci ], we have that [x − δ2 (x), x +
δ2 (x)] is contained in ]ci−1 , ci [ ;
▬ if x coincides with one of the ci in the interior of [a, b], then [x − δ2 (x), x + δ2 (x)] is
contained in ]ci−1 , ci+1 [ ;
▬ if x = a, then [x, x + δ2 (x)] is contained in [a, c1 [ ;
▬ if x = b, then [x − δ2 (x), x] is contained in ]cq−1 , b].
1.12 · L-Integrable Functions
37 1
Set, for every x ∈ I, δ(x) = min{δ1 (x), δ2 (x)}. Once taken a δ-fine P-partition =
{(x1 , [a0 , a1 ]), . . . , (xm , [am−1 , am ])} of I, consider the intervals (possibly empty or reduced
to a point)
The choice of the gauge δ2 yields that, if Ij,i has a positive length, then xj ∈ Ij,i . Indeed, if
xj
∈ [ci−1 , ci ], then
m
m
q
S(I, |f |, ) = |f (xj )|(aj − aj −1 ) = ) .
|f (xj )|μ(Ij,i ) = S(I, |f |,
j =1 j =1 i=1
Moreover,
q
q m
m
ε ci
q
A− ≤ f = f ≤ f ≤ A,
Ij,i
2
i=1 ci−1
i=1 j =1 I j,i i=1 j =1
Consequently, we have
− A|
|S(I, |f |, ) − A| = |S(I, |f |, )
q m
q m
≤ S(I, |f |, )
− f
+ f − A
i=1 j =1 Ij,i
i=1 j =1 Ij,i
ε ε
≤ + = ε,
2 2
Corollary 1.31
Let f, g : I → R be two integrable functions such that, for every x ∈ I,
|f (x)| ≤ g(x) ;
then f is L-integrable on I.
Proof
Take c0 , c1 , . . . , cq in I so that a = c0 < c1 < · · · < cq = b. Being −g(x) ≤ f (x) ≤ g(x)
for every x ∈ I, we have that
ci
ci
ci
− g≤ f ≤ g,
ci−1 ci−1 ci−1
i.e.
ci ci
f ≤ g,
ci−1 ci−1
q
q
ci ci
f ≤ g= g.
i=1 ci−1 i=1 ci−1 I
Then, the set S is bounded above by I g, so that f is L-integrable on I .
Corollary 1.32
Let f, g : I → R be two L-integrable functions and α ∈ R be a constant. Then f + g
and αf are L-integrable on I.
Proof
By the assumption, f, |f | and g, |g| are integrable on I . Then, such are also f + g, |f | + |g|,
αf, and |α||f |. On the other hand, for every x ∈ I, it is
Corollary 1.33
Let f1 , f2 : I → R be two L-integrable functions. Then min{f1 , f2 } and max{f1 , f2 }
are L-integrable on I .
Proof
It follows immediately from the formulas
1
min{f1 , f2 } = (f1 + f2 − |f1 − f2 |) ,
2
1
max{f1 , f2 } = (f1 + f2 + |f1 − f2 |)
2
Corollary 1.34
A function f : I → R is L-integrable if and only if both its positive part f + =
max{f, 0} and its negative part f − = max{−f, 0} are integrable on I . In that case,
+ −
I f = I f − I f .
Proof
It follows immediately from Corollary 1.33 and the formulas f = f + −f − , |f | = f + +f − .
x2 1
G(x) = cos 2 ,
2 x
if x
= 0, and G(0) = 0. Moreover, h is continuous on [0, 1], hence it is primitivable
there, too. Hence, even the function f = g + h is primitivable on [0, 1]. By the
Fundamental Theorem, f is then integrable on [0, 1]. We will show now that |f | is
not integrable on [0, 1]. Consider the intervals [((k + 1)π)−1/2 , (kπ)−1/2 ], with k ≥ 1.
The function |f | is continuous on these intervals, hence it is primitivable there. By the
substitution y = 1/x 2 , we obtain
(kπ )−1/2
(k+1)π
1 1 1
sin 2 dx = | sin y| dy .
((k+1)π )−1/2 x x kπ 2y
n
(kπ )−1/2
≥ |f |
k=1 ((k+1)π )−1/2
n
1
≥ ,
k=1
(k + 1)π
∞ 1
which is impossible, since the series k=1 (k+1)π diverges. Hence, f is not L-integrable
on [0, 1].
In this section and in the next one, we will consider the situation where a sequence of
integrable functions (fk )k converges pointwise to a function f : for every x ∈ I,
Example Let us first show that in some cases the answer could be in the negative. Consider
the functions fk : [0, π] → R, with k = 1, 2, . . . , defined by
k sin(kx) if x ∈ [0, πk ] ,
fk (x) =
0 otherwise.
We will see now that the formula holds true if the sequence of functions is monotone,
or bounded in some way. Let us start with the following result, known as the Monotone
Convergence Theorem, due to B. Levi.
Theorem 1.35
We are given a function f : I → R and a sequence of functions fk : I → R, with
k ∈ N, verifying the following conditions:
1. the sequence (fk )k converges pointwise to f ;
2. the sequence (fk )k is monotone;
3. each function fk is integrable on I ;
4. the real sequence ( I fk )k has a finite limit.
Proof
We assume for definiteness the sequence (fk )k to be increasing on I ; therefore, we have
We will prove that f is integrable on I and that A is its integral. Fix ε > 0. Being every fk
integrable on I, there are some gauges δk∗ on I such that, if k is a δk∗ -fine P-partition of I,
ε
S(I, fk , k ) − fk ≤
3 · 2k+3 .
I
and since the sequence (fk )k converges pointwise on I to f, for every x ∈ I there is a natural
number n(x) ≥ k̄, such that, for every k ≥ n(x), one has
ε
|fk (x) − f (x)| ≤ .
3(b − a)
∗
δ(x) = δn(x) (x) .
Let now = {(x1 , [a0 , a1 ]), . . . , (xm , [am−1 , am ])} be a δ-fine P-partition of I. We have:
m
|S(I, f, ) −A| = f (xj )(aj − aj −1 ) − A
j =1
m
≤ [f (xj ) − fn(xj ) (xj )](aj − aj −1 ) +
j =1
m
aj
m aj
+ fn(xj ) (xj )(aj − aj −1 ) −
fn(xj ) + fn(xj ) − A .
j =1 aj −1 j =1 aj −1
m
ε ε
≤ (aj − aj −1 ) = .
3(b − a) 3
j =1
1.13 · The Monotone Convergence Theorem
43 1
In order to estimate the second term, set
and notice that, putting together the terms whose indices n(xj ) coincide with a same value k,
by the second statement of Saks–Henstock’s theorem (Theorem 1.28) we obtain
aj
m
fn(xj ) (xj )(aj − aj −1 ) − fn(xj ) =
j =1 aj −1
⎧ ⎫
aj
s ⎨ ⎬
= fk (xj )(aj − aj −1 ) − fk
k=r ⎩{1≤j ≤m : n(xj )=k} aj −1 ⎭
aj
s
≤ fk (xj )(aj − aj −1 ) − fk
aj −1
k=r {1≤j ≤m : n(xj )=k}
s
ε ε
≤ 4 ≤ .
3 · 2k+3 3
k=r
Concerning the third term, since r ≥ k̄, using the monotonicity of the sequence (fk )k we
have
m
aj
0≤ A− fs = A − fs ≤
I j =1 aj −1
m
aj
≤A− fn(xj ) ≤
j =1 aj −1
m
aj
ε
≤A− fr = A − fr ≤ ,
3
j =1 aj −1 I
from which
m
a
j ε
f − A ≤ .
n(xj ) 3
j =1 aj −1
Hence,
ε ε ε
|S(I, f, ) − A| ≤ + + = ε,
3 3 3
Corollary 1.36
We are given a function f : I → R and a sequence of functions fk : I → R, with
k ∈ N, verifying the following conditions:
1. the series k fk converges pointwise to f ;
2. for every k ∈ N and every x ∈ I, it is fk (x) ≥ 0;
3. each function fk is integrable on I ;
4. the series k ( I fk ) converges.
5. Then, f is integrable on I , and
∞
f = fk .
I k=0 I
2
Example Consider the Taylor series associated to the function f (x) = ex ,
∞
2 x 2k
ex = .
k!
k=0
2k
The functions fk (x) = xk! satisfy the assumptions 1 and 2 of the corollary. Moreover, they
are integrable on I = [a, b] and
b
b x 2k+1 b2k+1 − a 2k+1
fk (x) dx = = ,
a (2k + 1)k! a (2k + 1)k!
so that it can be seen that the series k ( I fk ) converges. It is then possible to apply the
corollary, thus obtaining
∞ 2k+1
b 2 b − a 2k+1
ex dx = .
a (2k + 1)k!
k=0
·
In particular, considering the indefinite integral 0 f, we find an expression for the primitives
2
of ex , i.e.,
∞
2 x 2k+1
ex dx = +c.
(2k + 1)k!
k=0
1.14 · The Dominated Convergence Theorem
45 1
1.14 The Dominated Convergence Theorem
Lemma 1.37
Let f1 , f2 , . . . , fn : I → R be integrable functions. If there exists an integrable
function g : I → R such that, for every x ∈ I and 1 ≤ k ≤ n it happens that
g(x) ≤ fk (x) ,
Proof
Consider the case n = 2. The functions f1 −g and f2 −g, being integrable and non-negative,
are L-integrable. Hence, min{f1 − g, f2 − g} and max{f1 − g, f2 − g} are L-integrable, by
Corollary 1.33. The conclusion then follows from the fact that
min{f1 , f2 } = min{f1 − g, f2 − g} + g ,
max{f1 , f2 } = max{f1 − g, f2 − g} + g .
We are now ready to state and prove the following important result due to H.
Lebesgue, known as the Dominated Convergence Theorem.
Theorem 1.38
We are given a function f : I → R and a sequence of functions fk : I → R, with
k ∈ N, verifying the following conditions:
1. the sequence (fk )k converges pointwise to f ;
2. each function fk is integrable on I ;
3. there are two integrable functions g, h : I → R for which
By the above proved lemma, all φk, and k, are integrable on I . Moreover, for any fixed
k, the sequence (φk, ) is decreasing and bounded below by g, and the sequence (k, ) is
increasing and bounded above by h. hence, these sequences converge to two functions φk
and k , respectively:
3
k2 π π π
lim arctan kx − dx = − + 2 = .
k→∞ 0 k+1 2 2 2
Exercises
1. By the use of the Dominated Convergence Theorem, prove that
1
√1 x
sin(ekx )
lim √ dx = 0 , lim k arctan dx = 0 .
k→∞ 0 k k→∞ −1 k
Prove that
1
lim fk (x) dx = 0 ,
k→∞ 0
Explain why, in this case, neither the Monotone nor the Dominated Convergence
Theorems can be applied.
3. Compute the following limit:
π x x
lim k 3 sin − dx .
k→∞ 0 k k
exists and is finite. In that case, the above limit is called the integral of f on [a, b[
b b
and it is denoted by a f , or a f (x) dx.
+∞ +∞
In particular, if b = +∞, we will write: a f , or a f (x) dx.
Examples Let a > 0; it is readily seen that the function f : [a, +∞[ → R, defined by
f (x) = x −α , is integrable if and only if α > 1, in which case we have
+∞ dx a 1−α
= .
a x α α−1
Consider now the case a < b < +∞. It can be verified that the function f : [a, b[ → R,
defined by f (x) = (b − x)−β , is integrable if and only if β < 1, in which case we have
b dx (b − a)1−β
= .
a (b − x) β 1−β
a a b
One often speaks of improper integral when considering functions which are
defined on non-compact intervals. One also says that the integral c converges if the
function f is integrable on [a, b[ , i.e., when the limit limc→b− a f exists and is finite.
If the limit does not exist, it is said that the integral is undetermined. If it exists and
equals +∞ or −∞, it is said that the integral diverges to +∞ or to −∞, respectively.
It is clear that the convergence of the integral depends solely on the behavior of the
function near the point b. In other words, modifying the function outside a neighborhood
of b, the convergence of the integral is by no means compromised.
Let us now state the Cauchy convergence criterion.
1.15 · Integration on Non-Compact Intervals
49 1
Theorem 1.40
Let f : [a, b[ → R be a function, which is integrable on [a, c], for every c ∈ ]a, b[ .
A necessary and sufficient condition for f to be integrable on [a, b[ is that for every
ε > 0 there is a c̄ ∈ ]a, b[ such that, taking as c and c any two numbers in [c̄, b[ ,
it is
c
f ≤ ε .
c
Proof
It is a direct consequence of the Cauchy criterion for the limit, when applied to the function
c
F : [a, b[ → R defined as F (c) = a f .
Theorem 1.41
Let f : [a, b[ → R be a function, which is integrable on [a, c], for every c ∈ ]a, b[ . If
there is an integrable function g : [a, b[ → R such that, for every x ∈ [a, b[ ,
|f (x)| ≤ g(x) ,
Proof
Once fixed ε > 0, there is a c̄ ∈ ]a, b[ such that, taking arbitrarily c , c in [c̄, b[ , it is
c
| c g| ≤ ε. If for example c ≤ c , being −g ≤ f ≤ g, one has
c
c
c
− g≤ f ≤ g,
c c c
and therefore
c c
f ≤ g ≤ ε.
c c
Corollary 1.42
Let f : [a, b[ → R be a function, which is integrable on [a, c], for every c ∈ ]a, b[ . If
|f | is integrable on [a, b[ , then also f is such, and
b b
f ≤ |f | .
a a
In the case when both f and |f | are integrable on [a, b[ , we say that f is L-
integrable, or absolutely integrable, on [a, b[ .
Example Consider the function f : [π, +∞[ → R defined by f (x) = sinx x . We will see that
it is integrable on [π, +∞[ , but not absolutely integrable there. To see that it is integrable,
take c > π and integrate by parts:
c
c sin x − cos x c cos x
dx = − dx .
π x x π π x2
We find that
c
c
1 cos x
lim f =− − lim dx ,
c→+∞ π π c→+∞ π x2
n−1
(k+1)π
1
≥ | sin x| dx
(k + 1)π kπ
k=1
2 1
n−1
= .
π k+1
k=1
∞ 1
but this is impossible, since the series k=1 k+1 diverges.
Let us now state a corollary of the comparison criterion which is often used in
practice.
1.15 · Integration on Non-Compact Intervals
51 1
Corollary 1.43
Let f, g : [a, b[ → R be two functions with positive values, which are integrable on
[a, c] for every c ∈ ]a, b[ . Assume that the following limit exists:
f (x)
L = lim .
x→b− g(x)
Then, the following conclusions hold:
a) if L = 0 and g is integrable on [a, b[ , then such is f, as well;
b) if 0 < L < +∞, then f is integrable on [a, b[ if and only if such is g;
c) if L = +∞ and g is not integrable on [a, b[ , then neither f is such.
x 2 e1/(x +1) − 1
2
f (x)
= lim x 2 (e1/(x +1) − 1) = lim 2
2
lim = 1.
x→+∞ g(x) x→+∞ x→+∞ x + 1 1/(x 2 + 1)
Definition 1.44
We say that a function f : ]a, b] → R is integrable if f is integrable on [c, b] for
every c ∈ ]a, b[ , and the limit
b
lim f
c→a + c
(Continued )
52 Chapter 1 • Functions of One Real Variable
1
Definition 1.44 (continued)
exists and is finite. In that case, the above limit is called the integral of f on ]a, b]
b b
and it is denoted by a f , or a f (x) dx.
Definition 1.45
We say that f : ]a, b[ → R is integrable if, once we fix a point p ∈ ]a, b[ , the
function f is integrable on [p, b[ and on ]a, p]. In that case, the integral of f on
]a, b[ is defined by
b
p
b
f = f+ f.
a a p
It is easy to verify that the given definition does not depend on the choice of p ∈
]a, b[ .
is integrable on ]a, b[ if and only if β < 1. In this case, it is possible to choose, for instance,
p = (a + b)/2. Another case arises when a = −∞ and b = +∞. For example, one easily
verifies that the function f (x) = (x 2 +1)−1 is integrable on ]−∞, +∞[ . Taking for instance
p = 0, we have:
+∞
0
+∞
1 1 1
dx = dx + dx = π .
−∞ x2 + 1 −∞ x2 + 1 0 x2 + 1
A further case one could face in the applications is when a function happens not to
be defined in an interior point of an interval.
1.16 · The Hake Theorem
53 1
Definition 1.46
Given a < q < b, we say that a function f : [a, b] \ {q} → R is integrable if f is
both integrable on [a, q[ and on ]q, b]. In that case, we set
b
q
b
f = f+ f.
a a q
√
For example, if a < 0 < b, the function f (x) = |x|/x is integrable on [a, b]\{0},
and
√
b
|x| 0
−1 b
1 √ √
dx = √ dx + √ dx = 2 b − 2 −a .
a x a −x 0 x
On the other hand, the function f (x) = 1/x is not integrable on [a, b] \ {0}, even if
the fact that f is odd could lead someone to define the integral on symmetric intervals
with respect to the origin as being equal to zero. However, by doing so, some important
properties of the integral would be lost, as for example the additivity on sub-intervals.
Different situations can be faced combining together those treated above. I prefer
not to go deeper into these details; in each single case, the choice of the appropriate
method will be made by the right guess.
exists and is finite. We want to prove the following result due to H. Hake.
Theorem 1.47
Let b < +∞, and assume f : [a, b[ → R to be a function which is integrable on
[a, c], for every c ∈ ]a, b[ . Then, the function f is integrable on [a, b[ if and only if it
is the restriction of an integrable function f¯ : [a, b] → R. In that case,
b
b
f¯ = f.
a a
54 Chapter 1 • Functions of One Real Variable
1
Proof
Assume first that f be the restriction of an integrable function f¯ : [a, b] → R. Fix ε > 0;
we want to find a γ > 0 such that, if c ∈ ]a, b[ and b − c ≤ γ , then
c b
f− f¯ ≤ ε .
a a
b
Let δ be a gauge such that, for every δ-fine P-partition of [a, b], it is |S(I, f¯, )− a f¯| ≤ ε8 .
We choose a positive constant γ ≤ δ(b) such that γ |f¯(b)| ≤ ε2 . If c ∈ ]a, b[ and b − c ≤ γ ,
by the Saks–Henstock theorem, taking the δ-fine sub-P-partition = {(b, [c, b])}, we have
b ε ε
f¯(b)(b − c) − f¯ ≤ 4 = ,
8 2
c
and hence
c b b ε ε ε ε
f− f¯ = f¯ ≤ + |f¯(b)(b − c)| ≤ + |f¯(b)|γ ≤ + = ε .
2 2 2 2
a a c
Let us prove now the other implication. Assume f to be integrable on [a, b[ , and let A
be its integral. We extend f to a function f¯ defined on the whole interval [a, b], by setting,
for instance, f¯(b) = 0. In order to prove that f¯ is integrable on [a, b] with integral A, fix
ε > 0. There is a γ > 0 such that, if c ∈ ]a, b[ and b − c ≤ γ , then
c ε
f − A ≤ .
2
a
b−a
ci = b − .
i +1
To better clarify what we just said, assume for example that q = 2; then there must be a j¯1
for which xj¯1 = c1 , and a j¯2 for which xj¯2 = c2 . Then,
S([a, b], f¯, ) = [f (x1 )(a1 − a) +. . .+ f (xj¯1 −1 )(aj¯1 −1 − aj¯1 −2 ) + f (c1 )(c1 − aj¯1 −1 )]
q
S([a, b], f¯, ) = S([ci−1 , ci ], f, i ) + S([cq , am−1 ], f, q+1 ) +
i=1
am−1 q
ci
am−1
f = f+ f,
a i=1 ci−1 cq
The above theorem suggests that even for a function of the type f : [a, +∞[ → R
the definition of the improper integral could be reduced to that of a usual integral.
Indeed, fixing arbitrarily b > a, we could define a continuously differentiable strictly
increasing function ϕ : [a, b[ → R such that ϕ(a) = a and limu→b− ϕ(u) = +∞; for
example, take ϕ(u) = a + ln b−a
b−u
. A formal change of variables then gives
+∞
b
f (x) dx = f (ϕ(u))ϕ (u) du ,
a a
xj − δ(xj ) ≤ aj −1 ≤ xj ≤ aj ≤ xj + δ(xj ) ,
then
m−1
f (xj )(aj − aj −1 ) − A ≤ ε .
j =1
We refer to the book of Bartle [1] for a complete treatment of this case.
Needless to say, similar considerations can be made in the case when the function f
is defined on an interval of the type ]a, b], with −∞ ≤ a.
We now prove a theorem which shows the close connection between the theory of
numerical series and that of the improper integral.
1.17 · Integrals and Series
57 1
Theorem 1.48
Let f : [1, +∞[ → R be a function which is positive, decreasing and integrable on
[1, c], for every c > 1. Then f is integrable on [1, +∞[ if and only if the series
∞
k=1 f (k) converges. Moreover, we have
∞
+∞ ∞
f (k) ≤ f ≤ f (k) .
k=2 1 k=1
Proof
For x ∈ [k, k + 1], it has to be f (k + 1) ≤ f (x) ≤ f (k). Hence,
k+1
f (k + 1) ≤ f ≤ f (k) .
k
n
n+1
n
f (k + 1) ≤ f ≤ f (k) .
k=1 1 k=1
c
Being f positive, the sequence ( nk=1 f (k))n and the function c → 1 f are both increasing
and therefore have a limit. The conclusion now follows from the comparison theorem for
limits.
1 2 3 4 5
Observations It is clear that the choice a = 1 in the theorem just proved is by no way
necessary. Notice moreover that this theorem is often used to determine the convergence
of a series, giving the estimate
+∞ ∞
+∞
f ≤ f (k) ≤ f (1) + f.
1 k=1 1
58 Chapter 1 • Functions of One Real Variable
1
∞
Example Consider the series k=1 k −3 ; in this case:
+∞ 1∞
+∞
1 1
3
dx ≤ ≤ 1+ dx ,
1 x k3 1 x3
k=1
and then
∞
1 1 3
≤ 3
≤ .
2 k 2
k=1
A greater accuracy is easily attained by computing the sum of a few first terms and then using
the estimate given by the integral. For example, separating the first two terms, we have:
∞
∞
1 1 1
3
=1+ + ,
k 8 k3
k=1 k=3
with
+∞ 1∞
+∞
1 1 1
3
dx ≤ ≤ + dx .
3 x k3 27 3 x3
k=3
∞
255 1 263
≤ ≤ .
216 k3 216
k=1
Exercises
1. Establish whether the following improper integrals converge:
+∞ #
x
dx
0 x3 + 1
+∞ 1
√ √ dx
0 x (1 + x + x)
converge?
1.17 · Integrals and Series
59 1
3. The following improper integrals converge?
2
1
ln x 1
√ dx , dx
0 x cos x 0 x ln x
∞
∞
1 1
,
k ln k k ln k ln(ln k)
k=1 k=1
61 2
Alessandro Fonda
In this chapter we extend the theory developed in the previous one to functions of several
variables defined on subsets of RN, with values in R. In order to simplify the exposition,
we will often concentrate on the case N = 2. It will not be so difficult for the reader to
extend the various results to the case of a generic dimension N.
We say that two rectangles are non-overlapping if their interiors are disjoint.
A P-partition of the rectangle I is a set
= {(x 1 , I1 ), (x 2 , I2 ), . . . , (x m , Im )} ,
where the Ij are non-overlapping rectangles whose union is I and, for every j =
1, . . . , m, the point x j = (xj , yj ) belongs to Ij .
62 Chapter 2 • Functions of Several Real Variables
((5, 4), [3, 10] × [4, 6]), ((10, 0), [7, 10] × [0, 4]), ((5, 3), [3, 7] × [2, 4])} .
x2 I3
x3
I2
I5 x5
I4
x1 I1
x4
Let us now consider a function f defined on the rectangle I, with values in R, and let
= {(x j , Ij ) : j = 1, . . . , m} be a P-partition of I. We call Riemann sum associated
to I, f and the real number S(I, f, ) defined by
m
S(I, f, ) = f (x j )μ(Ij ) .
j =1
Whenever f happens to be positive, this number is the sum of the volumes of the
parallelepipeds having as base Ij and height [0, f (x j )].
x3
x2
x1
We now introduce the notion of fineness for the P-partition defined above. We
call gauge on I every function δ : I → R such that δ(x) > 0 for every x ∈ I. Given
2.1 · Integrability on Rectangles
63 2
a gauge δ on I, we say that the P-partition introduced above is δ-fine if, for every
j = 1, . . . , m,
In the following, given x = (x, y) ∈ I and r > 0, in order to shorten the notations we
will write
B[x, r] = [x − r, x + r] × [y − r, y + r] ;
Ij ⊆ B[x j , δ(x j )] .
We want to find a δ-fine P-partition of I. Similarly as was seen in the case N = 1 we have
in this case that one of the points x j necessarily has to coincide with (0, 0). We can then
choose, for example,
1 1 1 1 1
= (0, 0), 0, × 0, , , 1 , 0, × ,1 ,
2 2 2 2 2
1 1 1 1 1
1, , , 1 × 0, , (1, 1), , 1 × ,1 .
2 2 2 2 2
As in the one-dimensional case, one can prove that for every gauge δ on I there
exists a δ-fine P-partition of I (Cousin’s Theorem). The following definition is identical
to the one seen in Chapter 1.
64 Chapter 2 • Functions of Several Real Variables
Definition 2.1
2 A function f : I → R is said to be integrable (on the rectangle I ) if there is a real
number A with the following property: given ε > 0, it is possible to find a gauge δ
on I such that, for every δ-fine P-partition of I, it is
|S(I, f, ) − A| ≤ ε .
We briefly overview all the properties which can be obtained from the given
definition in the same way as was done in the case of a function of a single variable.
There is at most one A ∈ R which verifies the conditions of the definition. Such a
number is called the integral of f on I and is denoted by one of the following symbols:
f, f (x) dx , f (x, y) dx dy .
I I I
The set of integrable functions is a vector space, and the integral is a linear function on
it:
(f + g) = f + g , (αf ) = α f
I I I I I
Theorem 2.2
Let f : I → R be a function and K1 , K2 , . . . , Kl be non-overlapping sub-rectangles
of I whose union is I. Then, f is integrable on I if and only if it is integrable on each
of the Ki . In that case, we have
l
f = f.
I i=1 Ki
Exercises
1. Let f : [a, b] × [c, d] → R be defined as f (x, y) = x. Prove that f is integrable and
compute
f (x, y) dx dy .
[a,b]×[c,d]
3. Prove that, if f : I → R is equal to 0 for every (x, y) ∈
/ Q × Q, then I f = 0.
4. By the use of the dDominated Convergence Theorem, prove that
cos(ek(x +y ) )
2 2
lim √ dx dy = 0 ,
k→∞ [0,1]×[0,1] k
√ x +y
lim k sinh dx dy = 0 .
k→∞ [−1,1]×[−1,1] k
66 Chapter 2 • Functions of Several Real Variables
Proposition 2.3
Let I1 and I2 be two rectangles containing the set E. Then, fE is integrable on I1 if
and only if it is integrable on I2 . In that case, we have I1 fE = I2 fE .
Proof
We consider for simplicity the case N = 2. Assume that fE be integrable on I1 . Let K be
a rectangle containing both I1 and I2 . We can construct some non-overlapping rectangles
K1 , . . . , Kr , also non-overlapping with I1 , such that I1 ∪ K1 ∪ · · · ∪ Kr = K. We
now prove that fE is integrable on each of the subrectangles K1 , . . . , Kr , and that the
integrals K1 fE , . . . , Kr fE are all equal to zero. Notice that fE restricted to each of these
subrectangles is zero everywhere except perhaps on one of their edges. We are thus led to
prove the following lemma, which will permit us to conclude the proof.
Lemma 2.4
Let K be a rectangle and g : K → R be a function which is zero everywhere except
perhaps on one edge of K. Then g is integrable on K and K g = 0.
Proof
We first assume that the function g be bounded on K, i.e., that there is a constant C > 0 for
which
|g(x, y)| ≤ C ,
for every (x, y) ∈ K. Fix ε > 0. Let L be the edge of the rectangle K on which g can be
nonzero, and denote by its length. Define the constant gauge δ = C
ε
. Then, for every δ-fine
2.2 · Integrability on a Bounded Set
67 2
P-partition = {(x 1 , I1 ), (x 2 , I2 ), . . . , (x m , Im )} of K, we have:
m
|S(K, g, )| ≤ |g(x j )|μ(Ij )
j =1
= |g(x j )|μ(Ij )
{j : x j ∈L}
≤C μ(Ij )
{j : x j ∈L}
≤ Cδ = ε .
This proves that g is integrable on K and K g = 0 in the case when g is bounded on K. If
g is not such, assume that it has non-negative values. Define the following sequence (gk )k of
functions:
gk (x) = min{g(x), k} .
Being the functions gk bounded, for what have been seen above we have K gk = 0, for every
k. It is easily seen that the sequence thus defined satisfies the conditions of the Monotone
Convergence Theorem and converges pointwise to g. It then follows that g is integrable on
K and
g = lim gk = 0 .
K k K
If g does not have only non-negative values, it is always possible to consider g + and g − .
From the above, K g + = K g − = 0, and then K g = 0, which is what was to be
proved.
E = {x ∈ E : f (x) ≥ 0} ,
Definition 2.6
A bounded set E is said to be measurable if the constant function 1 is integrable on
E. In that case, the number E 1 is said to be the measure of E and is denoted by
μ(E).
The measure of a measurable set is thus a non-negative number. The empty set is
assumed to be measurable, and its measure is equal to 0. In the case of a subset of R2 ,
its measure is also called the area of the set. If E = [a1 , b1 ] × [a2 , b2 ] is a rectangle, it
is easily seen that
μ(E) = 1 = (b1 − a1 )(b2 − a2 ) ,
E
2.3 · The Measure of a Bounded Set
69 2
so that the notation is in accordance with the one already introduced for rectangles. For
a subset of R3, the measure is also called the volume of the set.
Not every set is measurable. It is shown in Appendix C that, when dealing with
non-measurable sets, some paradoxical situations can arise. In the following, we will be
careful to always consider measurable sets.
Let us analyze some properties of the measure. It is useful to introduce the
characteristic function of a set E, defined by
1 if x ∈ E ,
χE (x) =
0 if x
∈ E .
Proposition 2.7
Let A and B be two bounded and measurable sets. The following properties hold:
(a) if A ⊆ B, then B \ A is measurable, and
Proof
Let I be a rectangle containing A ∪ B. If A ⊆ B, then χB\A = χB − χA , and property (a)
follows by integrating on I .
Being χA∪B = max{χA , χB } and χA∩B = min{χA , χB }, we have that χA∪B and χA∩B
are integrable on I . Moreover,
χA∪B + χA∩B = χA + χB ,
The following proposition states the property of complete additivity of the measure.
70 Chapter 2 • Functions of Several Real Variables
2 Proposition 2.8
If (Ak )k≥1 is a sequence of bounded and measurable sets, whose union A = ∪k≥1 Ak
is bounded, then A is measurable, and
∞
μ(A) ≤ μ(Ak ) .
k=1
Proof
Assume first that the sets Ak be pairwise disjoint. Let I be a rectangle containing their union
A. Then, for every x ∈ I,
∞
χA (x) = χAk (x) .
k=1
∞
the series ∞k=1 I χAk = k=1 μ(Ak ) converges. By the corollary following the Monotone
Convergence Theorem, we have that A is measurable and
∞ ∞
∞
μ(A) = χA = χAk = χAk = μ(Ak ) .
I I k=1 k=1 I k=1
When the sets Ak are not pairwise disjoint, consider the sets B1 = A1 , B2 = A2 \ A1 and,
in general, Bk = Ak \ (A1 ∪ · · · ∪ Ak−1 ). The sets Bk are measurable, pairwise disjoint, and
∪k≥1 Bk = ∪k≥1 Ak . The conclusion then follows from what has been proved above.
Proposition 2.9
If (Ak )k≥1 is a sequence of bounded and measurable sets, their intersection A =
∩k≥1 Ak is a measurable set.
2.3 · The Measure of a Bounded Set
71 2
Proof
Let I be a rectangle containing the set A. Then,
⎛ ⎞
% $
Ak = I \ ⎝ (I \ (Ak ∩ I ))⎠ ,
k≥1 k≥1
The following two propositions will provide us with a large class of measurable sets.
Proposition 2.10
Every open and bounded set is measurable.
Proof
Consider for simplicity the case N = 2. Let A be an open set contained in a rectangle I.
We divide the rectangle I in four rectangles of equal areas using the axes of its edges. Then
we proceed analogously with each of these four rectangles, thus obtaining sixteen smaller
rectangles, and so on. Being A open, for every x ∈ A there is a small rectangle among those
just constructed which contains x and is contained in A. In this way, it is seen that the set
A is covered by a countable family of rectangles; being the union of a countable family of
measurable sets, it is therefore measurable.
Proposition 2.11
Every compact set is measurable.
Proof ◦ ◦
Let
◦
B be a compact set, and let I be a rectangle whose
◦
interior
◦
I contains B. Being I and
I \ B open and hence measurable, we have that B = I \ (I \ B) is measurable.
E = {(x, y) ∈ R2 : 1 < x 2 + y 2 ≤ 4}
is measurable, being the difference of the closed disks with radius 2 and 1 centered at the
origin:
E = {(x, y) ∈ R2 : x 2 + y 2 ≤ 4} \ {(x, y) ∈ R2 : x 2 + y 2 ≤ 1} .
72 Chapter 2 • Functions of Several Real Variables
Er = {x ∈ E : f (x) > r}
is measurable, and
1
μ(Er ) ≤ f.
r E
Proof
Let I be a rectangle containing E. Once fixed r > 0, we define the following functions on
I:
0 ≤ fk (x) ≤ 1 ,
for every k and every x ∈ I. The Monotone (or Dominated, as well) Convergence Theorem
guarantees that χEr is integrable on I , i.e., that Er is measurable. Since, for every x ∈ E, it
is rχEr (x) ≤ f (x), integrating we obtain the inequality we are looking for.
Corollary 2.13
Let E be a bounded and measurable set, and f : E → R an integrable function with
non-negative values. Then, taken two real numbers r, s such that 0 ≤ r < s, the set
is measurable.
2.5 · Negligible Sets
73 2
Proof
Let Er = {x ∈ E : f (x) ≥ r}. With the notations of the previous theorem, if r > 0, we have
%
Er = Er− 1 ,
k
k> 1r
Definition 2.14
We say that a bounded set is negligible if it is measurable and its measure is equal
to zero.
Every set made of a single point is negligible. Consequently, all finite or countable
bounded sets are negligible. The edge of a rectangle in R2 is a negligible set, as shown
by Lemma 2.4. Another interesting example of a negligible and not countable set is
given by the Cantor set (see, e.g., [1, Theorem 4.16]).
By the complete additivity of the measure, the union of any sequence of negligible
sets, if it is bounded, is always a negligible set.
Theorem 2.15
If E is a bounded set and f : E → R is equal to zero except for a negligible set, then
f is integrable on E and E f = 0.
Proof
Let T be the negligible set on which f in non-zero. Assume first that the function f be
bounded, i.e., that there is a constant C > 0 such that
|f (x)| ≤ C ,
for every x ∈ E. We consider a rectangle I containing E and prove that I fE = 0. Fix
ε > 0. Since T has zero measure, there is a gauge δ such that, for every δ-fine P-partition
= {(x j , Ij ), j = 1, . . . , m} of I,
ε
S(I, χT , ) = μ(Ij ) ≤ ,
C
{j : x j ∈T }
74 Chapter 2 • Functions of Several Real Variables
so that
2
|S(I, fE , )| ≤ |f (x j )|μ(Ij ) ≤ C μ(Ij ) ≤ ε .
{j : x j ∈T } {j : x j ∈T }
Hence, in case f is bounded, it is integrable on E and E f = 0. If f is not bounded, assume
first that it has non-negative values. Define on E a sequence of functions (fk )k :
Being the functions fk bounded and equal to zero except on T , for what has just been seen
they are integrable on E with E fk = 0, for every k. It is easily seen that the defined
sequence satisfies the conditions of the Monotone Convergence Theorem and converges
pointwise to f. Hence, f is integrable on E, and
f = lim fk = 0 .
E k E
If f does not have non-negative values, it is sufficient to consider f + and f − , and apply to
them what has been said above.
Theorem 2.16
If f : E → R is an integrable function on a bounded set E, having non negative
values, with E f = 0, then f is equal to zero except on a negligible set.
Proof
Using the Chebyshev inequality, we have that, for every positive integer k,
μ(E 1 ) ≤ k f = 0.
k E
Hence, every E 1 is negligible, and since their union is just the set where f is non-zero, we
k
have the conclusion.
Definition 2.17
Let E be a bounded set. We say that a proposition is true almost everywhere on E
(or for almost every point of E) if the set of points for which it is false is negligible.
Corollary 2.18
If two functions f and g, defined on the bounded set E, are equal almost everywhere
on E, then f is integrable on E if and only if such is g. In that case, E f = E g.
This last corollary permits us to consider some functions which are defined almost
everywhere, and to define their integral.
Definition 2.19
A function f, defined almost everywhere on E, with real values, is said to be
integrable on E if it can be extended to an integrable function g : E → R. In this
case, we set E f = E g.
It can be seen that all the properties and theorems seen before remain true for
such functions. The reader is invited to verify this.
Lemma 2.20
Let E be a set contained in a rectangle I, and let δ be a gauge on E. Then, there is
a finite or countable family of non-overlapping rectangles Jk , contained in I, whose
union covers the set E, with the following property: in each of the sets Jk there is a
point x k belonging to E such that Jk ⊆ B[x k , δ(x k )].
Proof
We consider for simplicity the case N = 2. Let us divide the rectangle I in four rectangles,
having the same areas, by the axes of its edges. We proceed analogously with each of these
four rectangles, obtaining sixteen smaller rectangles, and so on. We thus obtain a countable
family of smaller and smaller rectangles. For every point x of E we can choose one of these
rectangles which contains x and is itself contained in B[x, δ(x)]. These rectangles would
satisfy the properties of the statement, if they were non-overlapping.
In order that the sets Jk be non-overlapping, it is necessary to make a careful choice
of them, and here is how to do it. We first choose those from the beginning four, if there
are any, which contain a point x k belonging to E such that Jk ⊆ B[x k , δ(x k )]; once
this choice has been made, we eliminate all the smaller rectangles contained in them. We
consider then the sixteen smaller ones and, among the ones which remained after the first
elimination procedure, we choose those, if there are any, which contain a point x k belonging
76 Chapter 2 • Functions of Several Real Variables
to E such that Jk ⊆ B[x k , δ(x k )]; once this choice has been made, we eliminate all the
2 smaller rectangles contained in them; and so on.
Remark Notice that if, in the assumptions of the covering lemma, it happens that E is
contained in an open set which is itself contained in I, then all the rectangles Jk can be
chosen so to be all contained in that open set.
We can now prove the following characterization of the bounded measurable sets.1
Proposition 2.21
Let E be a bounded set, contained in a rectangle I. The three following propositions
are equivalent:
(i) the set E is measurable;
(ii) for every ε > 0 there are two finite or countable families (Jk ) and (Jk ), each
made of [non-overlapping] rectangles contained in I, such that
$ $ $ $
E⊆ Jk , I \ E ⊆ Jk and μ Jk ∩ Jk ≤ ε;
k k k k
(iii) there are two sequences (En )n≥1 and (En )n≥1 of bounded and measurable
subsets such that
Proof
Let us first prove that (i) implies (ii). Assume that E be measurable, and fix ε > 0. By the
Saks–Henstock’s theorem, there is a gauge δ on I such that, for every δ-fine sub-P-partition
= {(x j , Kj ) : j = 1, . . . , m} of I,
m
ε
χE (x j )μ(Kj ) − χE ≤ .
Kj 2
j =1
N
ε
μ(Jk ) − χ ≤ ,
E 2
k=1 Jk
whence
N N
ε ε ε
μ(Jk ) ≤ χE + ≤ χE + = μ(E) + .
2 2 2
k=1 k=1 Jk I
Since this holds for every positive integer N, we have thus constructed a family (Jk ) of
non-overlapping rectangles, such that
$ ε
E⊆ Jk , μ(Jk ) ≤ μ(E) + .
2
k k
Consider now I \E, which is measurable, as well. We can repeat the same procedure as above
replacing E by I \ E, thus finding a family (Jk ) of non-overlapping rectangles, contained in
I , such that
$ ε
I \E ⊆ Jk , μ(Jk ) ≤ μ(I \ E) + .
2
k k
Consequently,
$ $
I\ Jk ⊆E⊆ Jk ,
k k
and hence
$ $ $ $
μ Jk ∩ Jk =μ Jk \ I \ Jk
k k k k
$ $
=μ Jk −μ I \ Jk
k k
$ $
=μ Jk − μ(I ) + μ Jk
k k
ε ε
≤ μ(E) + − μ(I ) + μ(I \ E) +
2 2
= ε,
Taking ε = n1 , it is easy to see that (ii) implies (iii). Let us prove now that (iii) implies
2 (i). Consider the measurable sets
$ %
=
E En , =
E En ,
n≥1 n≥1
⊆ E ⊆ E
E , ) = μ(E)
μ(E .
Equivalently, we have
χE ≤ χE ≤ χE , (χE − χE ) = 0 ,
I
) =
so that χE = χE = χE almost everywhere. Then, E is measurable and μ(E) = μ(E
Moreover,
μ(E).
hence μ(E) = limn μ(En ). Analogously it is seen that μ(E) = limn μ(En ), and the proof
is thus completed.
Proposition 2.22
Let E be a bounded set. Then, E is negligible if and only if for every ε > 0 there is a
finite or countable family (Jk ) of [non-overlapping] rectangles such that
$
E⊆ Jk , μ(Jk ) ≤ ε .
k k
Proof
The necessary condition is proved in the first part of the proof of the previous proposition.
Let us prove the sufficiency. Once fixed ε > 0, assume there exists a family (Jk ) with
the given properties. Let I be a rectangle containing the set E. On the other hand, consider a
family (Jk ) whose elements all coincide with I. The condition (ii) of the previous proposition
is then satisfied, so that E is indeed measurable. Then,
$
μ(E) ≤ μ Jk ≤ μ(Jk ) ≤ ε ;
k k
Corollary 2.23
If IN −1 is a rectangle in RN −1 and T is a negligible subset of R, then IN −1 × T is
negligible in RN .
Proof
Fix ε > 0 and, according to Proposition 2.22, let (Jk ) be a finite or countable family of
intervals in R such that
$ ε
T ⊆ Jk , μ(Jk ) ≤ .
μ(IN −1 )
k k
We begin this section by showing that the continuous functions are L-integrable on
compact sets.
Theorem 2.24
Let E ⊆ RN be a compact set and f : E → R be a continuous function. Then, f is
L-integrable on E.
Proof
We consider for simplicity the case N = 2. Being f continuous on a compact set, there is a
constant C > 0 such that
|f (x)| ≤ C ,
80 Chapter 2 • Functions of Several Real Variables
for every x ∈ E. Let I be a rectangle containing E. First we divide I into four rectangles, by
2 tracing the segments joining the mid points of its edges; we denote by U1,1 , U1,2 , U1,3 , U1,4
these subrectangles. We now divide again each of these rectangles in the same way,
thus obtaining sixteen smaller subrectangles, which we denote by U2,1 , U2,2 , . . . , U2,16 .
Proceeding in this way, for every k we will have a subdivision of the rectangle I in 22k
small rectangles Uk,j , with j = 1, . . . , 22k . Whenever E has non-empty intersection with
◦ ◦
Uk,j , we choose and fix a point x k,j ∈ E ∩ Uk,j . Define now the function fk in the following
way: ◦ ◦
▬ if E ∩ Uk,j is non-empty, fk is constant on Uk,j with value f (x k,j );
◦ ◦
▬ if E ∩ Uk,j is empty, fk is constant on Uk,j with value 0.
The functions fk are thus defined almost everywhere on I, not being defined only on the
points of the grid made up by the above constructed segments, which form a countable family
of negligible sets. The functions fk are integrable on each subrectangle Uk,j , being constant
in its interior. By the property of additivity on subrectangles, these functions are therefore
integrable on I. Moreover,
|fk (x)| ≤ C ,
for almost every x ∈ I and every k ≥ 1. Let us see now that fk converges pointwise almost
everywhere to fE . Indeed, taking
◦
a point x ∈ I not belonging to the grid, for every k there is
a j = j (k) for which x ∈ Uk,j (k) ; we have two possibilities:
◦
a) x
∈ E; in this case, being E closed, we have that, for k sufficiently large, Uk,j (k) (whose
dimensions tend to zero as k → ∞) will have empty intersection with E, and then
fk (x) = 0 = fE (x).
b) x ∈ E; in this case, if k → +∞, we have that x k,j (k) → x (again using the fact that
◦
Uk,j (k) has dimensions tending to zero). By the continuity of f, we have that
Theorem 2.25
Let f : E → R be a L-integrable function on a bounded set E. Then, f is L-integrable
on every measurable subset of E.
Proof
Assume first f to have non-negative values. Let S be a measurable subset of E, and define
on E the functions fk = min{f, kχS }. They form an increasing sequence of L-integrable
2.7 · Continuous Functions and L-Integrable Functions
81 2
functions, since such are both f and kχS , which converges pointwise to fS . Moreover, it is
fk ≤ f,
E E
for every k. The Monotone Convergence Theorem then guarantees that f is integrable on S
in this case. In the general case, f being L-integrable, both f + and f − are L-integrable on
E. Hence, by the above, they are both L-integrable on S, and then such is f , too.
Let us prove now the property of complete additivity of the integral for L-
integrable functions. We will say that two bounded measurable subsets are non-
overlapping if their intersection is a negligible set.
Theorem 2.26
Let (Ek ) be a finite or countable family of measurable non-overlapping sets whose
union is a bounded set E. Then f is L-integrable on E if and only if the two following
conditions hold:
(a) f is L-integrable on each Ek ;
(b) k Ek |f (x)| dx < +∞.
Proof
Observe that
f (x) = fEk (x) , |f (x)| = |fEk (x)| ,
k k
for almost every x ∈ E. If f is L-integrable on E, from the preceding theorem, (a) follows.
Moreover, it is obvious that (b) holds whenever the sets Ek are in a finite number. If instead
they are infinite, for any fixed n, we have
n
n
|f (x)| dx = |fEk (x)| dx ≤ |f (x)| dx ,
k=1 Ek k=1 E E
Convergence Theorem, when applied to the series k fEk , yields the conclusion. In the
2 general case, it is sufficient to consider, as usual, the positive and the negative parts of f .
Exercises
1. Compute the area of the set
{(x, y) ∈ R2 : 0 ≤ x ≤ 1 , 0 ≤ y ≤ x} .
Let X be a metric space, Y a bounded subset of RN, and consider a function f : X×Y →
R. (For simplicity, we may think of X and Y as subsets of R.) The first question we want
to face is the following: when does the formula
lim f (x, y) dy = lim f (x, y) dy
x→x0 Y Y x→x0
Theorem 2.27
Let x0 be an accumulation point of X, and assume that:
(i) for every x ∈ X \ {x0 }, the function f (x, ·) is integrable on Y, so that we can
define the function
F (x) = f (x, y) dy ;
Y
(ii) for almost every y ∈ Y the limit limx→x0 f (x, y) exists and is finite, so that we
can define almost everywhere the function
(Continued )
2.8 · Limits and Derivatives Under the Integration Sign
83 2
Proof
Let us take a sequence (xk )k in X \ {x0 } which tends to x0 . Define, for every k, the functions
fk : Y → R such that fk (y) = f (xk , y). By the assumptions (i), (ii) and (iii) we can apply
the Dominated Convergence Theorem, so that
lim F (xk ) = lim fk (y) dy = lim fk (y) dy = η(y) dy .
k k Y Y k Y
The conclusion then follows from the characterization of the limit by the use of sequences.
Corollary 2.28
If X is a subset of RM, Y ⊆ RN is compact, and f : X × Y → R is continuous, then
the function F : X → R, defined by
F (x) = f (x, y) dy ,
Y
is continuous.
Proof
The function F (x) is well defined, being f (x, ·) continuous on the compact set Y. Let us fix
x0 ∈ X and prove that F is continuous at x0 . By the continuity of f,
Moreover, once taken a compact neighborhood U of x0 , there is a constant C > 0 such that
2 |f (x, y)| ≤ C, for every (x, y) ∈ U × Y. Theorem 2.27 can then be applied, and we have:
lim F (x) = f (x0 , y) dy = F (x0 ) ,
x→x0 Y
Let now X be a subset of R. The second question we want to face is the following:
when does the formula
d ∂f
f (x, y) dy = (x, y) dy
dx Y Y ∂x
Theorem 2.29
Let X be an interval in R containing x0 , Y ⊆ RN be bounded, and assume that:
(i) for every x ∈ X, the function f (x, ·) is integrable on Y, so that we can define the
function
F (x) = f (x, y) dy ;
Y
∂f
(ii) for every x ∈ X and almost every y ∈ Y, the partial derivative ∂x (x, y) exists;
(iii) there are two integrable functions g, h : Y → R such that
∂f
g(y) ≤ (x, y) ≤ h(y) ,
∂x
Proof
We define, for x ∈ X different from x0 , the function
f (x, y) − f (x0 , y)
ψ(x, y) = .
x − x0
2.8 · Limits and Derivatives Under the Integration Sign
85 2
For every x ∈ X \ {x0 }, the function ψ(x, ·) is integrable on Y. Moreover, for almost every
y ∈ Y, it is
∂f
lim ψ(x, y) = (x0 , y) .
x→x0 ∂x
By the Lagrange Mean Value Theorem, for (x, y) as above there is a ξ ∈ X between x0 and
x such that
∂f
ψ(x, y) = (ξ, y) .
∂x
for every x ∈ X \ {x0 } and almost every y ∈ Y. We are then in the hypotheses of the previous
theorem, and we can conclude that the function ∂f∂x (x0 , ·), defined almost everywhere on Y,
is integrable there, and
∂f
lim ψ(x, y) dy = (x0 , y) dy .
x→x0 Y Y ∂x
F (x) − F (x0 )
= lim ,
x→x0 x − x0
Corollary 2.30
If X is an interval in R, Y is a compact subset of RN, and the function f : X × Y → R
is continuous and has a continuous partial derivative ∂f ∂x : X × Y → R, then the
function F : X → R, defined by
F (x) = f (x, y) dy ,
Y
Proof
2 The function F (x) is well defined, being f (x, ·) continuous on the compact set Y. Taking a
point x0 ∈ X and a nontrivial compact interval U ⊆ X containing it, there is a constant C > 0
such that | ∂f
∂x (x, y)| ≤ C, for every (x, y) ∈ U × Y. By Theorem 2.27, F is differentiable at
x0 . The same argument holds replacing x0 by any x ∈ X, and
∂f
F (x) = (x, y) dy .
Y ∂x
e−x (y +1)
2 2
f (x, y) = .
y2 + 1
1
We want to see if the corresponding function F (x) = 0 f (x, y) dy is differentiable and, in
this case, to find its derivative. We have that
∂f
(x, y) = −2xe−x (y +1) ,
2 2
∂x
which, for y ∈ [0, 1] and x ≥ 0, is such that
#
2
≤ −2xe−x ≤ −2xe−x (y +1) ≤ 0 .
2 2 2
−
e
Now that we have found the derivative of F (x), let us make a digression, so to present a
nice formula. By the change of variable t = xy, one has
1
x
x 2
d
e−x
2 (y 2 +1)
dy = −2e−x e−t dt = − e−t dt
2 2 2
−2x .
0 0 dx 0
We would like now to pass to the limit for x → +∞. Since, for x ≥ 0, one has
e−x (y +1)
2 2
0≤ ≤ 1,
y2 + 1
2.9 · The Reduction Formula
87 2
we can pass to the limit under the sign of integration, thus obtaining
e−x (y +1) e−x (y +1)
1 2 2 1 2 2
lim dy = lim dy = 0 .
x→+∞ 0 y2 + 1 0 x→+∞ y 2 + 1
Hence,
+∞ 2
π
e−t dt
2
=
0 4
and, by symmetry,
+∞ √
e−t dt =
2
π,
−∞
Before stating the main theorem, it is useful to first prove a preliminary result.
Proposition 2.31
Let f : I → R be an integrable function on the rectangle I = [a1 , b1 ] × [a2 , b2 ].
Then, for almost every x ∈ [a1 , b1 ], the function f (x, ·) is integrable on [a2 , b2 ].
Proof
Let T ⊆ [a1 , b1 ] be the set of those x ∈ [a1 , b1 ] for which f (x, ·) is not integrable on
[a2 , b2 ]. Let us prove that T is a negligible set. For each x ∈ T , the Cauchy condition does
not hold. Hence, if we define the sets
⎧ ⎫
⎪
⎨ for every gauge δ2 on [a2 , b2 ] there are two ⎪
⎬
Tn = x : δ2 -fine P-partitions 2 and 2 of [a2 , b2 ] such that ,
⎪
⎩ ⎪
⎭
2 ) >
S([a2 , b2 ], f (x, ·), 2 ) − S([a2 , b2 ], f (x, ·), 1
n
we have that each x ∈ T belongs to Tn , if n is sufficiently large. So, T is the union of all Tn ,
and if we prove that any Tn is negligible, by the properties of the measure we will also have
that T is such. In order to do so, let us consider a certain Tn and fix ε > 0. Being f integrable
on I, there is a gauge δ on I such that, taken two δ-fine P-partitions and of I, it is
ε
≤
|S(I, f, ) − S(I, f, )| .
n
88 Chapter 2 • Functions of Several Real Variables
The gauge δ on I determines, for every x ∈ [a1 , b1 ], a gauge δ(x, ·) on [a2 , b2 ]. We now
2 x of [a2 , b2 ] in the
associate to each x ∈ [a1 , b1 ] two δ(x, ·)-fine P-partitions x2 and 2
following way:
– if x ∈ Tn , we can choose x2 and x such that
2
1
x2 ) >
S([a2 , b2 ], f (x, ·), x2 ) − S([a2 , b2 ], f (x, ·), ;
n
x thus determined:
Let us write the two P-partitions x2 and 2
To this aim, define the following two P-partitions of I which make use of the elements of
1 :
xi ) : i = 1, . . . , k , j = 1, . . . , m
= {((xi , ỹ xi ), Ji × K xi } .
j j
ε
≤
|S(I, f, ) − S(I, f, )| .
n
)| =
|S(I, f, ) − S(I, f,
mxi m̃xi
k
k
= xi )
x x x
f (xi , yj i )μ(Ji × Kj i ) − f (xi , ỹj i )μ(Ji × K j
i=1 j =1 i=1 j =1
⎡ x ⎤
m̃xi
k m i
= μ(Ji ) ⎣ xi )⎦
x x x
f (xi , yj i )μ(Kj i ) − f (xi , ỹj i )μ(K j
i=1 j =1 j =1
2.9 · The Reduction Formula
89 2
k
xi i )]
x
= μ(Ji )[S([a2 , b2 ], f (xi , ·), 2 ) − S([a2 , b2 ], f (xi , ·),
2
i=1
= xi )] .
μ(Ji )[S([a2 , b2 ], f (xi , ·), x2i ) − S([a2 , b2 ], f (xi , ·), 2
{i : xi ∈Tn }
Recalling that, if xi ∈ Tn ,
1
xi ) >
S([a2 , b2 ], f (xi , ·), x2i ) − S([a2 , b2 ], f (xi , ·), ,
2 n
we conclude that
ε 1
)| >
≥ |S(I, f, ) − S(I, f, μ(Ji ) ,
n n
{i : xi ∈Tn }
and hence S([a1 , b1 ], χTn , 1 ) ≤ ε, which is what we wanted to prove. All this shows that
the sets Tn are negligible, and therefore T is negligible, too.
Theorem 2.32
Let f : I → R be an integrable function on the rectangle I = [a1 , b1 ] × [a2 , b2 ].
Then:
(i) for almost every x ∈ [a1 , b1 ], the function f (x, ·) is integrable on [a2 , b2 ];
b
(ii) the function a22 f (·, y) dy, defined almost everywhere on [a1 , b1 ], is integrable
there;
(iii) we have
b1
b2
f = f (x, y) dy dx .
I a1 a2
Proof
We have already proved (i) in the preliminary proposition. Let us now prove (ii) and (iii).
Let T be the negligible subset of [a1 , b1 ] such that, for x ∈ T , the function f (x, ·) is not
integrable on [a2 , b2 ]. Being T × [a2 , b2 ] negligible in I, we can modify on that set the
function f without changing the integrability properties. (We can choose, for example, f = 0
on that set.) In this way, we can assume without loss of generality that T be empty. Let us
define
b2
F (x) = f (x, y) dy .
a2
90 Chapter 2 • Functions of Several Real Variables
Let ε > 0 be fixed. Because of the integrability of f on I, there is a gauge δ on I such that,
for every δ-fine P-partition of I,
S(I, f, ) − f ≤ ε .
2
I
We now associate to each x ∈ [a1 , b1 ] a δ(x, ·)-fine P-partition x2 of [a2 , b2 ] such that
ε
|S([a2 , b2 ], f (x, ·), x2 ) − F (x)| ≤
2(b1 − a1 )
(this is possible since f (x, ·) is integrable on [a2 , b2 ] with integral F (x)). Let us write the
P-partitions thus determined:
1 = {(xi , Ji ) : i = 1, . . . , n} ,
and construct the following δ-fine P-partition of I , making use of the elements of 1 :
n
ε ε
≤ μ(Ji ) + = ε .
2(b1 − a1 ) 2
i=1
Example Consider the function f (x, y) = x 2 sin y on the rectangle I = [−1, 1] × [0, π].
Being f continuous on a compact set, it is integrable there, so that:
1
π
f = x 2 sin y dy dx
I −1 0
1
1 1
x3 4
= x 2 [− cos y]π0 dx = 2 x 2 dx = 2 = .
−1 −1 3 −1 3
Clearly, the following version of the Reduction Theorem holds, which is symmetric
with respect to the preceding one.
Theorem 2.33
Let f : I → R be an integrable function on the rectangle I = [a1 , b1 ] × [a2 , b2 ].
Then:
(i) for almost every y ∈ [a2 , b2 ], the function f (·, y) is integrable on [a1 , b1 ];
b
(ii) the function a11 f (x, ·) dx, defined almost everywhere on [a2 , b2 ], is integrable
there;
(iii) we have:
b2
b1
f = f (x, y) dx dy .
I a2 a1
Therefore, if the above equality does not hold, then the function f is not integrable on I.
92 Chapter 2 • Functions of Several Real Variables
so that
1
1 x2 − y2
1
1 π
dy dx = dx = [arctan x]10 = .
0 0 (x 2 + y 2 )2 0 x +1
2 4
y=1
1 xy −x
dy = = 0,
−1 (x 2 + y 2 )2 2(x 2 + y 2 ) y=−1
so that
1
1
xy
dy dx = 0 .
−1 −1 (x 2 + y 2 )2
1
1
xy
dx dy = 0 .
−1 −1 (x 2 + y 2 )2
Nevertheless, we are not allowed to conclude that f is integrable on I. Truly, it is not at all.
Indeed, if f were integrable, it should be such on every subrectangle, and in particular on
2.9 · The Reduction Formula
93 2
[0, 1] × [0, 1]. But, if x
= 0, we have
y=1
1 xy −x 1
dy = = ,
0 (x 2 + y 2 )2 2(x 2 + y 2 ) y=0 2x(x 2 + 1)
P1 E = {x ∈ [a1 , b1 ] : Ex
= Ø} , P2 E = {y ∈ [a2 , b2 ] : Ey
= Ø} .
E
Ex
x P1 E
Theorem 2.34
Let f : E → R be an integrable function on the bounded set E. Then:
(i) for almost every x ∈ P1 E, the function fE (x, ·) is integrable on the set Ex ;
(ii) the function x → Ex f (x, y) dy, defined almost everywhere on P1 E, is integrable
there;
(iii) we have:
f = f (x, y) dy dx .
E P1 E Ex
(Continued )
94 Chapter 2 • Functions of Several Real Variables
f = f (x, y) dx dy .
E P2 E Ey
E = {(x, y) ∈ R2 : 0 ≤ x ≤ 1, −x 2 ≤ y ≤ x 2 } .
Being f continuous and E compact, the theorem applies; we have P1 E = [0, 1] and, for
every x ∈ P1 E, Ex = [−x 2 , x 2 ]. Hence:
1 x2
f = |xy| dy dx
E 0 −x 2
1 x 2
1 1
y|y| x6 1
= |x| dx = x dx =
5
= .
0 2 −x 2 0 6 0 6
Corollary 2.35
If E ⊆ R2 is a bounded and measurable set, then:
(i) for almost every x ∈ P1 E, the set Ex is measurable;
(ii) the function x → μ(Ex ), defined almost everywhere on P1 E, is integrable
there;
(iii) we have:
μ(E) = μ(Ex ) dx .
P1 E
In the case of functions of more than two variables, analogous results to the preceding
ones hold true, with the same proofs. One simply needs to separate the variables in two
different groups, calling x the first group and y the second one, and the same formulas
hold true.
Example We want to compute the volume of the three-dimensional ball with radius R > 0 :
let E = {(x, y, z) ∈ R3 : x 2 + y 2 + z2 ≤ R 2 }. Let us group together the variables (y, z)
√ the projection on the x-axis: P1 E = [−R, R]. The sections Ex then are disks
and consider
of radius R 2 − x 2 , and we have:
R R
x3 4
μ(E) = π(R 2 − x 2 ) dx = π(2R 3 ) − π = πR 3 .
−R 3 −R 3
Another way to compute the same volume is to group the variables (x, y) and consider
P1 E = {(x, y) : x 2 + y 2 = R 2 }. For every (x, y) ∈ P1 E, it is
. .
E(x,y) = − R −x −y , R −x −y ,
2 2 2 2 2 2
so that
⎛ √ ⎞
.
R
R2 −x 2 .
μ(E) = 2 R 2 − x 2 − y 2 dx dy = ⎝ √ 2 R 2 − x 2 − y 2 dy ⎠ dx
P1 E −R − R 2 −x 2
R
π/2
R 4
= 2(R 2 − x 2 ) cos2 t dt dx = π(R 2 − x 2 ) dx = πR 3 ,
−R −π/2 −R 3
√
by the change of variable t = arcsin y/ R 2 − x 2 .
formulas like
2
b1 b2 bN
f = ... f (x1 , x2 , . . . , xN ) dxN . . . dx2 dx1 .
I a1 a2 aN
Exercises
1. Compute the following integrals:
x 2 y dx dy ,
[0,1]×[2,3]
2
y exy dx dy ,
[−1,1]×[1,2]
sin(x 3 ) arctan(x 2 y) dx dy .
[−1,1]×[−2,2]
E = {(x, y) ∈ R2 : 1 ≤ x ≤ α, x 2 − y 2 ≥ 1} .
Lemma 2.36
Let A ⊆ RN be an open set, and ϕ : A → RN be a C 1 -function; if S is a subset of A
of the type
Proof
For simplicity, let us concentrate on the case of a subset of R2 of the type
S = [0, 1] × {0} .
with k = 1, . . . , n. For n large enough, they are contained in a rectangle R which is itself
contained in A. Being R a compact set, there is a constant C > 0 such that ϕ (q) ≤ C,
for every q ∈ R. Then, ϕ is Lipschitz
√ continuous on R with Lipschitz constant C. Since the
sets Jk,n have as diameter n1 2, the√sets ϕ(Jk,n ) are surely contained in some squares J˜k,n
whose sides’ lengths are equal to Cn 2. We then have that ϕ(S) is covered by the rectangles
J˜k,n , and
C√ 2
n
2C 2
μ(J˜k,n ) ≤ n 2 = .
n n
k=1
Since this quantity can be made arbitrarily small, the conclusion follows from Corollary 2.22.
98 Chapter 2 • Functions of Several Real Variables
As a consequence of the above lemma, it is easy to see that the image of the boundary
2 of a rectangle through a diffeomorphism ϕ is a negligible set. In particular, given two
non-overlapping rectangles, their images are non-overlapping sets.
We are now ready to prove a first version of the Theorem on the Change of
Variables in the integral, which will be generalized in a later section.
Theorem 2.37
Let ϕ be a diffeomorphism between two open and bounded sets A and B = ϕ(A), and
f : B → R be a continuous function. Then, for every closed subset D of A, we have:
f (x) dx = f (ϕ(u)) | det ϕ (u)| du .
ϕ(D) D
Proof
Notice first of all that the integrals appearing in the formula are both meaningful, being the
sets D and ϕ(D) compact and the considered functions continuous. We will proceed by
induction on the dimension N. Let us first consider the case N = 1.
First of all, using the method of integration by substitution, one verifies that the formula is
true when D is a compact interval [a, b] : it is sufficient to consider the two possible cases in
which ϕ is increasing or decreasing, and recall that every continuous function is primitivable.
For instance, if ϕ is decreasing, we have ϕ([a, b]) = [ϕ(b), ϕ(a)], so that:
ϕ(a)
f (x) dx = f (x) dx
ϕ([a,b]) ϕ(b)
a
= f (ϕ(u))ϕ (u) du
b
b
= f (ϕ(u))|ϕ (u)| du
a
= f (ϕ(u))|ϕ (u)| du .
[a,b]
◦
Let now R be a closed subset of A whose interior R contains D. Being both f and
(f ◦ ϕ)|ϕ | continuous,
◦ ◦
they are integrable on the compact sets ϕ(R) and R, respectively.
The open sets R and R \ D can each be split into a countable union of non-overlapping
compact intervals, whose images through ϕ also are non-overlapping
◦ ◦
close intervals. By the
complete additivity of the integral, the formula holds true for R and R \ D :
◦
f (x) dx = ◦
f (ϕ(u))|ϕ (u)| du ,
ϕ(R ) R
◦
f (x) dx = ◦
f (ϕ(u))|ϕ (u)| du .
ϕ(R \D) R \D
2.10 · Change of Variables in the Integral
99 2
Hence,
f (x) dx = ◦ ◦
f (x) dx
ϕ(D) ϕ(R \(R \D))
= ◦
f (x) dx − ◦
f (x) dx
ϕ(R ) ϕ(R \D)
= ◦
f (ϕ(u))|ϕ (u)| du − ◦
f (ϕ(u))|ϕ (u)| du
R R \D
= f (ϕ(u))|ϕ (u)| du ,
D
where, for j = 1, . . . , N, it is
Vt = {(v1 , . . . , vN ) : (v1 , . . . , vN , t) ∈ V } ,
PN +1 V = {t : Vt
= Ø} .
which happens to be a diffeomorphism, defined on the open set Vt , whose image is the open
2 set
Wt = {(x1 , . . . , xN ) : (x1 , . . . , xN , t) ∈ W } .
Moreover, det βt (v1 , . . . , vN ) = det β (v1 , . . . , vN , t). Consider also the sections
t = {(v1 , . . . , vN ) : (v1 , . . . , vN , t) ∈ D}
D ,
= {t : D
PN +1 D t
= Ø} .
t and PN +1 β(D).
Analogously, we consider β(D) By the definition of β, it is
t = βt (D
β(D) t ) , = PN +1 D
PN +1 β(D) .
In an analogous way we define α(D)u1 ,...,uN and P1,...,N α(D). They are all closed sets and,
by the definition of α, we have
t → ϕN +1 (u1 , . . . , uN , t)
is a diffeomorphism of one variable between the open sets Uu1 ,...,uN and Vu1 ,...,uN , sections
of U and V , respectively. Using the Reduction Theorem and the one-dimensional formula of
change of variables proved above, we have that
f˜ = f˜(v1 , . . . , vN +1 ) dvN +1 dv1 . . . dvN
α(D) P1,...,N α(D) α(D)u1 ,...,uN
= f˜(v1 , . . . , vN +1 ) dvN +1 dv1 . . . dvN
P1,...,N D ϕN+1 (u1 ,...,uN ,Du1 ,...,uN )
= f˜(u1 , . . . , uN , ϕN +1 (u1 , . . . , uN +1 ))·
P1,...,N D Du1 ,...,uN
∂ϕN +1
·
(u1 , . . . , uN +1 ) duN +1 du1 . . . duN
∂uN +1
= f˜(α(u)) | det α (u)| du .
D
We have then proved that, for every u ∈ A, there is a δ(u) > 0 such that the thesis holds true
when D is contained in B[u, δ(u)]. A gauge δ is thus defined on A. By Lemma 2.20, we can
now cover A with a countable family (Jk ) of non-overlapping rectangles, each contained in
a rectangle of the type B[u, δ(u)], so that the formula holds for the closed sets contained in
any of these rectangles.
At this point let us consider an arbitrary closed subset D of A. Then, the formula holds
for each D ∩ Jk and, by the complete additivity of the integral and the fact that the sets
102 Chapter 2 • Functions of Several Real Variables
= f (ϕ(u)) | det ϕ (u)| du
k D∩Jk
= f (ϕ(u)) | det ϕ (u)| du .
D
Remark The formula on the change of variables is often written, setting ϕ(D) = E, in the
equivalent form
f (x) dx = f (ϕ(u)) | det ϕ (u)| du .
E ϕ −1 (E)
E = {(x, y) ∈ R2 : −1 ≤ x ≤ 1, x 2 ≤ y ≤ x 2 + 1} ,
In this section we study how the measure is changed by the action of a diffeomorphism.
Theorem 2.38
Let ϕ be a diffeomorphism between two open and bounded sets A and B. If D is a
measurable subset of A, then ϕ(D) is measurable, | det ϕ | is integrable on D, and
μ(ϕ(D)) = | det ϕ (u)| du .
D
Proof
By the preceding theorem, the formula holds true whenever D is closed. Since every open
set can be written as the union of a countable family of non-overlapping (closed) rectangles,
2.11 · Change of Measure by Diffeomorphisms
103 2
by the complete additivity and the fact that A is bounded, the formula holds true even if D is
an open set.
Assume now that D is a measurable◦
set whose closure D is contained in A. Let R be a
closed subset of A whose interior R contains D. Then, there is a constant C > 0 such that
| det ϕ (u)| ≤ C for every u ∈ R. By Proposition 2.21, for every ε > 0 there are two finite or
◦
countable families (Jk ) and (Jk ), each made of non-overlapping rectangles contained in R,
such that
◦ $ $ $ $
R \ Jk ⊆ D ⊆ Jk , μ Jk ∩ Jk ≤ ε.
k k k k
Since the formula to be proved holds both on the open sets and on the closed sets, it certainly
holds on each rectangle Jk and Jk ; then, it holds on ∪k Jk , on ∪k Jk , and since it holds even
◦ ◦ ◦
on R, it has to be true on R \ (∪k Jk ), as well. We have thus that ϕ(∪k Jk ) and ϕ(R \ (∪k Jk ))
are measurable,
◦ $ $
ϕ R \ Jk ⊆ ϕ(D) ⊆ ϕ Jk ,
k k
and
$ ◦ $
μ ϕ Jk − μ ϕ R\ Jk =
k k
= | det ϕ (u)| du − ◦
| det ϕ (u)| du
∪k Jk R \(∪k Jk )
= | det ϕ (u)| du
(∪k Jk )∩(∪k Jk )
$ $
≤ Cμ Jk ∩ Jk
k k
≤ Cε .
◦
Taking ε = n1 , we find in this way two sequences Dn = ∪k Jk,n and Dn = R \ (∪k Jk,n
) with
the above properties. By Proposition 2.21, we have that ϕ(D) is measurable and μ(ϕ(D)) =
limn μ(ϕ(Dn )) = limn μ(ϕ(Dn )). Moreover, since χDn converges almost everywhere to χD ,
by the Dominated Convergence Theorem we have:
2 = | det ϕ (u)|χD (u) du
R
= | det ϕ (u)| du .
D
We can now consider the case of an arbitrary measurable set D in A. Being A open we
can consider a sequence of non-overlapping rectangles (Kn ) contained in A whose union is
A. The formula holds for each of the sets D ∩ Kn , by the above. The complete additivity of
the integral and the fact that A is bounded then permit us to conclude.
we have a diffeomorphism between the set D = ]1, 2[ × ]3, 4[ and E = ϕ(D). Moreover,
1/v −u/v 2 u2
det ϕ (u, v) = det = .
2u/v −u2 /v 2 v3
Applying the formula on the change of measure and the Reduction Theorem, we have:
2
4 u2
2 7 2 49
μ(E) = dv du = u du = .
1 3 v3 1 288 864
We are now interested in generalizing the Theorem on the Change of Variables assuming
f not necessarily continuous, but only L-integrable on a measurable set. In order to do
this, it will be useful to prove the following important relation between the integral of a
function having non-negative values and the measure of its epigraph.
Proposition 2.39
Let E be a bounded and measurable set and f : E → R be a bounded function with
non-negative values. Let Gf be the set thus defined:
Gf = {(x, t) ∈ E × R : 0 ≤ t ≤ f (x)} .
(Continued )
2.12 · The General Theorem on the Change of Variables
105 2
Proof
Assume first Gf to be measurable. By the Reduction Theorem, since P1 Gf = E, the
f (x)
sections being (Gf )x = [0, f (x)], we have that the function x → 0 1 = f (x) is
integrable on E, and
f (x)
μ(Gf ) = 1= 1 dt dx = f (x) dx .
Gf E 0 E
Assume now f to be integrable on E. Let C > 0 be a constant such that 0 ≤ f (x) < C, for
every x ∈ E. Taken a positive integer n, we divide the interval [0, C] in n equal parts and
consider, for j = 1, . . . , n, the sets
j j −1 j
En = x ∈ E : C ≤ f (x) < C ;
n n
by Corollary 2.13 they are measurable, non-overlapping and their union is E. We can then
define on E the function ψn in the following way:
n
j
ψn = Cχ j ,
n En
j =1
and so
n
$
j j
Gψn = En × 0, C .
n
j =1
j
By Proposition
2.21, it is easy to see that, being the sets En measurable, such are the sets
j
En × 0, jn C , too. Consequently, the sets Gψn are measurable. Moreover, since
%
Gf = Gψn ,
n≥1
We are now in the position to prove the second version of the Theorem on the
Change of Variables in the integral.
106 Chapter 2 • Functions of Several Real Variables
2 Theorem 2.40
Let ϕ be a diffeomorphism between two bounded and open sets A and B = ϕ(A)
of RN , D a measurable subset of A and f : ϕ(D) → R a function. Then, f is L-
integrable on ϕ(D) if and only if (f ◦ ϕ) | det ϕ | is L-integrable on D, in which case
we have:
f (x) dx = f (ϕ(u)) | det ϕ (u)| du .
ϕ(D) D
Proof
Assume that f be L-integrable on E = ϕ(D). We first consider the case when f is bounded
with non-negative values.
Let C > 0 be such that 0 ≤ f (x) < C, for every x ∈ E. We define the open sets
à = A× ] − C, C[ , B̃ = B× ] − C, C[ ,
This function is a diffeomorphism and det ϕ̃ (u, t) = det ϕ (u), for every (u, t) ∈ Ã. Let Gf
be the epigraph of f :
Gf = {(x, t) ∈ E × R : 0 ≤ t ≤ f (x)} .
Using the formula on the change of measure and the Reduction Theorem, we have
μ(Gf ) = | det ϕ̃ (u, t)| du dt
ϕ̃ −1 (Gf )
= | det ϕ (u)| du dt
ϕ̃ −1 (Gf )
f (ϕ(u))
= | det ϕ (u)| dt du
D 0
= f (ϕ(u)) | det ϕ (u)| du .
D
2.13 · Some Useful Transformations in R2
107 2
On the other hand, by Proposition 2.39, we have that μ(Gf ) = ϕ(D) f, and this proves that
the formula holds in case f is bounded with non-negative values.
In the case when f is not bounded but still has non-negative values, we consider the
functions
For each of them, the formula holds true, and using the Monotone Convergence Theorem one
proves that the formula holds for f even in this case.
When f does not have non-negative values, it is sufficient to consider its positive and
negative parts, apply for them the formula and then subtract.
In order to obtain the opposite implication, it is sufficient to consider (f ◦ ϕ) | det ϕ |
instead of f and ϕ −1 instead of ϕ, and to apply what has been just proved.
There are some transformations which do not change the measure of any measurable
set. We consider here some of those which are most frequently used in applications.
Translations We call translation by a given vector a = (a1 , a2 ) ∈ R2 , the transforma-
tion defined by
ϕ(u, v) = (u + a1 , v + a2 ) .
Here det ϕ = −1, so that, taking for example the first case, we have:
f (x, y) dx dy = f (−u, v) du dv .
ϕ(D) D
It is a diffeomorphism, with
cos α − sin α
det ϕ (u, v) = det = (cos α)2 + (sin α)2 = 1 .
sin α cos α
3 Let us mention here that reference [2] contains an ingenious example of a integrable function in R2 whose
rotation by α = π/4 is not integrable. This is why we have restricted our attention only to L-integrable
functions.
2.13 · Some Useful Transformations in R2
109 2
Another useful transformation is the function ψ : [0, +∞[ ×[0, 2π[ → R2 given by
y
ρ
θ
x
E = {(x, y) ∈ R2 : x ≥ 0, y ≥ 0, x 2 + y 2 < 9} .
π/2
3
81
π/2 81
f = ρ 3 cos θ sin θ dρ dθ = cos θ sin θ dθ = .
E 0 0 4 0 8
110 Chapter 2 • Functions of Several Real Variables
y
θ ρ
x
Example Let us compute the integral E f, where f (x, y, z) = x 2 + y 2 and
√
E = {(x, y, z) ∈ R3 : x 2 + y 2 ≤ 1, 0 ≤ z ≤ x + y + 2} .
Consider now the function σ : [0, +∞[ ×[0, 2π[ ×[0, π] → R3 defined by
φ
ρ y
θ
x
We have:
μ(E) = 1 dx dy dz
E
= ρ 2 sin φ dρ dθ dφ
σ −1 (E)
1
π/4
2π
= 2
ρ sin φ dθ dφ dρ
0 0 0
1
π/4
= 2π ρ 2 sin φ dφ dρ
0 0
√
1
2
= 2π 1 − ρ 2 dρ
2 0
√
2 2π
= 1− .
2 3
Exercises
1. Let E be a planar set contained in [0, +∞[ ×R, and define the set
.
E rot = (x, y, z) ∈ R3 : (x, y 2 + z2 ) ∈ E
(i.e., the set obtained rotating E around the x-axis). Prove that
μ(E rot ) = 2π x dx dy .
E
2. Use the above formula to compute the volume of a sphere: V = 43 πR 3 . Moreover, prove
that the volume of the torus with minor radius r and major radius R is equal to 2π 2 Rr 2 .
3. By some modified cylindrical coordinates, prove that the volume of the cone
# /
x 2 y 2 z
(x, y, z) ∈ R : 3
+ ≤ ≤1 ,
a b h
x2 y2 z2
(x, y, z) ∈ R3 : + + ≤ 1 .
a2 b2 c2
2.15 · The Integral on Unbounded Sets
113 2
5. Compute the integral
.
x 2 + y 2 + z2 dx dy dz ,
B1
where B1 is the three-dimensional open ball centered at the origin, with radius 1.
While for L-integrable functions the extension of the theory to unbounded sets does
not encounter great difficulties, it seems not to exist a satisfactory general definition of
integrability for functions of several variables. This is the reason why, in the following,
we will concentrate only on the theory for L-integrable functions. We will use the
notation
B[0, r] = (x1 , . . . , xN ) ∈ RN : max{|x1 |, . . . , |xN |} ≤ r .
Definition 2.41
Given a set E ⊆ RN, not necessarily bounded, a function f : E → R is said to be
L-integrable (on E) if it is L-integrable on each of the bounded sets E ∩ B[0, r],
with r > 0, and the two following limits exist and are finite:
lim f, lim |f | .
r→+∞ E∩B[0,r] r→+∞ E∩B[0,r]
In this case, the first of these limits is said to be the integral of f on E and is
denoted by the symbol E f.
Equivalently, f is L-integrable on E if the two following limits exist and are finite:
f + = lim f+, f − = lim f−;
E r→+∞ E∩B[0,r] E r→+∞ E∩B[0,r]
in that case, we have E f = E f + − E f − .
It is not difficult to prove that the set of L-integrable functions is a vector space, and
the integral is a linear function on it which preserves the order. Moreover, one easily
verifies that a function f is L-integrable on a set E if and only if the function fE is
L-integrable on RN .
114 Chapter 2 • Functions of Several Real Variables
Definition 2.42
2 A set E ⊆ RN is said to be measurable if E ∩ B[0, r] is measurable, for every
r > 0. In that case, we set
Notice that μ(E), in some cases, can be +∞. It is finite if and only if the constant
function 1 is L-integrable on E, i.e., the characteristic function of E is L-integrable
on RN . The properties of bounded measurable sets extend easily to unbounded sets. In
particular, all open sets and all closed sets are measurable.
The Monotone Convergence Theorem of B. Levi attains the following general
form.
Theorem 2.43
We are given a function f and a sequence of functions fk , with k ∈ N, defined almost
everywhere on a subset E of RN, with real values, verifying the following conditions:
1. the sequence (fk )k converges pointwise to f , almost everywhere on E;
2. the sequence (fk )k is monotone;
3. each function fk is L-integrable on E;
4. the real sequence ( E fk )k has a finite limit.
Proof
Assume, for definiteness, that the sequence (fk )k is increasing. By considering the sequence
(fk − f0 )k instead of (fk )k , we can assume without loss of generality that all the functions
have almost everywhere non-negative values. Let A = limk ( E fk ); for every r > 0, we
can apply the Monotone Convergence Theorem on the bounded set E ∩ B[0, r], so that f is
integrable on E ∩ B[0, r] and
f = lim fk ≤ lim fk = A .
E∩B[0,r] k→∞ E∩B[0,r] k→∞ E
Let us prove that the limit of E∩B[0,r] f exists, as r → +∞, and that it is equal to A. Fix
ε > 0; there is a k̄ ∈ N such that, for k ≥ k̄,
ε
A− ≤ fk ≤ A ;
2 E
2.15 · The Integral on Unbounded Sets
115 2
being moreover
fk̄ = lim f ,
E r→∞ E∩B[0,r] k̄
Then, since the sequence (fk )k is increasing, we have that, for every k ≥ k̄ and every r ≥ r̄,
it is
A−ε ≤ fk ≤ A .
E∩B[0,r]
Corollary 2.44
We are given a function f and a sequence of functions fk , with k ∈ N, defined almost
everywhere on a subset E of RN, with real values, verifying the following conditions:
1. the series k fk converges pointwise to f , almost everywhere on E;
2. for every k ∈ N and almost every x ∈ E, it is fk (x) ≥ 0;
3. each function fk is L-integrable on E;
4. the series k ( E fk ) converges.
∞
f = fk .
E k=0 E
2 Theorem 2.45
We are given a function f and a sequence of functions fk , with k ∈ N, defined almost
everywhere on a subset E of RN, with real values, verifying the following conditions:
1. the sequence (fk )k converges pointwise to f , almost everywhere on E;
2. each function fk is L-integrable on E;
3. there are two functions g, h, defined almost everywhere and L-integrable on E,
such that
Theorem 2.46
Let (Ek ) be a finite or countable family of pairwise non-overlapping measurable
subsets of RN, whose union is a set E. Then, f is L-integrable on E if and only if
the two following conditions hold:
(a) f is L-integrable on each set Ek ;
(b) k Ek |f (x)| dx < +∞.
As another consequence, we have the Leibniz rule for not necessarily bounded
subsets Y of RN , which is stated as follows.
Theorem 2.47
Let f : X × Y → R be a function, where X is an interval of R containing x0 , and Y
is a subset of RN , such that:
(Continued )
2.15 · The Integral on Unbounded Sets
117 2
∂f
(ii) for every x ∈ X and almost every y ∈ Y, the partial derivative ∂x (x, y) exists;
(iii) there are two L-integrable functions g, h : Y → R such that
∂f
g(y) ≤ (x, y) ≤ h(y) ,
∂x
P1 E = {x ∈ RN1 : Ex
= Ø} , P2 E = {y ∈ RN2 : Ey
= Ø} .
Theorem 2.48
Let f : E → R be a L-integrable function. Then:
(i) for almost every x ∈ P1 E, the function f (x, ·) is L-integrable on the set Ex ;
(ii) the function x → Ex f (x, y) dy, defined almost everywhere on P1 E, is L-
integrable there;
(iii) we have:
f = f (x, y) dy dx .
E P1 E Ex
(Continued )
118 Chapter 2 • Functions of Several Real Variables
f = f (x, y) dx dy .
E P2 E Ey
Proof
Consider for simplicity the case N1 = N2 = 1, the general case being perfectly analogous.
Assume first that f has non-negative values. By the Reduction Theorem for bounded sets,
once fixed r > 0, we have that, for almost every x ∈ P1 E ∩ [−r, r], the function f (x, ·)
is L-integrable on Ex ∩ [−r, r]; the function gr (x) = Ex ∩[−r,r] f (x, y) dy, defined almost
everywhere on P1 E ∩ [−r, r], is L-integrable there, and
f = gr (x) dx .
E∩B[0,r] P1 E∩[−r,r]
In particular,
gr (x) dx ≤ f,
P1 E∩[−r,r] E
Keeping s fixed, we let r tend to +∞. Since f has non-negative values, gr (x) will be
increasing with respect to r. Consequently, for almost every x ∈ P1 E ∩ [−s, s], the limit
limr→+∞ gr (x) exists (possibly infinite), and we set
g(x) = lim gr (x) = lim f (x, y) dy .
r→+∞ r→+∞ E ∩[−r,r]
x
By Theorem 2.12 these are measurable sets and the Chebyshev inequality yields
1 1
μ(Enr ) ≤ gr (x) dx ≤ f.
n P1 E∩[−s,s] n E
2.15 · The Integral on Unbounded Sets
119 2
Hence, since the sets Enr increase with r, also the sets Fn = ∪r Enr are measurable, and
we have that μ(Fn ) ≤ n1 E f . Being T ⊆ ∩n Fn , we deduce that T is measurable, with
μ(T ) = 0.
Hence, for almost every x ∈ P1 E ∩ [−s, s], the function f (x, ·) is L-integrable on the
set Ex and, by definition,
f (x, y) dy = g(x) .
Ex
Moreover, if we take r in the set of natural numbers and apply the Monotone Convergence
Theorem to the functions gr , it follows that g is L-integrable on P1 E ∩ [−s, s], and
g = lim gr ,
P1 E∩[−s,s] r→∞ P E∩[−s,s]
1
so that
f (x, y) dy dx ≤ f.
P1 E∩[−s,s] Ex E
exists and in finite; therefore, the function x → Ex f (x, y) dy, defined almost everywhere
on P1 E, is L-integrable there, and its integral is the preceding limit. Moreover, from the
above proved inequality, passing to the limit, we have that
f (x, y) dy dx ≤ f.
P1 E Ex E
In conclusion, equality must hold, and the proof is thus completed in the case when f
2 has non-negative values. In the general case, just consider f + and f − , and subtract the
corresponding formulas.
Corollary 2.49
Let E be a measurable set, with a finite measure. Then,
(i) for almost every x ∈ P1 E, the set Ex is measurable and has a finite measure;
(ii) the function x → μ(Ex ), defined almost everywhere on P1 E, is L-integrable
there;
(iii) we have:
μ(E) = μ(Ex ) dx .
P1 E
The Theorem on the Change of Variables also extends to unbounded sets, with the
same statement.
Theorem 2.50
Let ϕ be a diffeomorphism between two open sets A and B = ϕ(A) of RN, D be a
measurable subset of A, and f : ϕ(D) → R be a function. Then, f is L-integrable on
ϕ(D) if and only if (f ◦ ϕ) | det ϕ | is L-integrable on D, in which case we have:
f (x) dx = f (ϕ(u)) | det ϕ (u)| du .
ϕ(D) D
Proof
Assume first that f be L-integrable on E = ϕ(D) with non-negative values. Then, for every
r > 0,
f (ϕ(u)) | det ϕ (u)| du = f (x) dx
D∩B[0,r] ϕ(D∩B[0,r])
≤ f (x) dx ,
ϕ(D)
2.15 · The Integral on Unbounded Sets
121 2
so that the limit
lim f (ϕ(u)) | det ϕ (u)| du
r→+∞ D∩B[0,r]
The formula is thus proved when f has non-negative values. In general, just proceed as usual,
considering f + and f − .
To obtain the opposite implication, it is sufficient to consider (f ◦ ϕ) | det ϕ | instead of
f and ϕ −1 instead of ϕ, and to apply the above.
It is thus seen that f is integrable on E if and only if α > 1, in which case the integral is
π
α−1 .
Exercises
1. Prove that the sets Q2 , Q × R and R × Q are negligible in R2 .
2. Compute the integral
1
dx dy .
R2 (1 + x2 + y2 ) x2 + y2
Eγ = {(x, y) ∈ R2 : x ≥ 1, 0 ≤ y ≤ x γ } ,
2.15 · The Integral on Unbounded Sets
123 2
for some γ ∈ R. When is the function f (x, y) = xy integrable on Eγ ? For those values
of γ , compute
xy dx dy .
Eγ
Differential Forms
Alessandro Fonda
Consider, for every positive integer M, the sets M (RN ) made by the M-linear
antisymmetric functions on RN, with real values. It is well known that these are vector
spaces on R. We also adopt the convention that 0 (RN ) = R.
If we choose the indices i1 , . . . , iM in the set {1, . . . , N}, we can define the M-linear
antisymmetric function dxi1 ,...,iM : it is the function which associates to the vectors
⎛ ⎞ ⎛ ⎞
v1(1) v1(M)
⎜ . ⎟ ⎜ . ⎟
v (1) =⎜ ⎟
⎝ .. ⎠ , . . . , v
(M)
=⎜
⎝ .. ⎠ ,
⎟
vN(1) vN(M)
det ⎜
⎝ .. · · · .. ⎠ .
⎟
(1) (M)
viM . . . viM
Notice that, whenever two indices coincide, we have the zero function. If two indices are
exchanged, the function changes sign. Let us recall the following result from elementary
algebra.
126 Chapter 3 • Differential Forms
Proposition 3.1
2N 3
If 1 ≤ M ≤ N, the space M (RN ) has dimension M . A basis is given by
3 (dxi1 ,...,iM )1≤i1 <···<iM ≤N . If M > N, then M (RN ) = {0}.
We are mostly interested in the case N = 3. Let us give a closer look to the spaces
1 (R3 ), 2 (R3 ) and 3 (R3 ).
Consider 1 (R3 ), the space of linear functions defined on R3, with values in R. We
denote by dx1 , dx2 , dx3 the following linear functions:
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
v1 v1 v1
⎜ ⎟ ⎜ ⎟ ⎜ ⎟
dx1 : ⎝ v2 ⎠ → v1 , dx2 : ⎝ v2 ⎠ →
v2 , dx3 : ⎝ v2 ⎠ →
v3 .
v3 v3 v3
the space 1 (R3 ) has dimension 3 and (dx1, dx2 , dx3 ) is one of its bases.
Consider 2 (R3 ), the space of bilinear antisymmetric functions defined on R3 × R3,
with values in R. It has dimension 3, and a basis is given by (dx1,2 , dx1,3 , dx2,3 ), where
⎛⎛ ⎞ ⎛ ⎞⎞
v1 v
⎜⎜ ⎟ ⎜ 1 ⎟⎟ v1 v1
dx1,2 : ⎝⎝ v2 ⎠ , ⎝ v2 ⎠⎠ → det = v1 v2 − v2 v1 ,
v2 v2
v3 v3
⎛⎛ ⎞ ⎛ ⎞⎞
v1 v
⎜⎜ ⎟ ⎜ 1 ⎟⎟ v1 v1
dx1,3 : ⎝⎝ v2 ⎠ , ⎝ v2 ⎠⎠ → det = v1 v3 − v3 v1 ,
v3 v3
v3 v3
⎛⎛ ⎞ ⎛ ⎞⎞
v1 v
⎜⎜ ⎟ ⎜ 1 ⎟⎟ v2 v2
dx2,3 : ⎝⎝ v2 ⎠ , ⎝ v2 ⎠⎠ → det = v2 v3 − v3 v2 .
v3 v3
v3 v3
⎛⎛ ⎞ ⎛ ⎞ ⎛ ⎞⎞ ⎛ ⎞
v1 v1 v1 v1 v1 v1
⎜⎜ ⎟ ⎜ ⎟ ⎜ ⎟⎟ ⎜ ⎟
dx1,2,3 : ⎝⎝ v2 ⎠ , ⎝ v2 ⎠ , ⎝ v2 ⎠⎠ → det ⎝ v2 v2 v2 ⎠ .
v3 v3 v3 v3 v3 v3
3.2 · Differential Forms in RN
127 3
Every element of the vector space 3 (R3 ) is a scalar multiple of dx1,2,3 : the space
3 (R3 ) has dimension 1. Recall that
Definition 3.2
Given an open subset U of RN, we call differential form of degree M (or
M-differential form) a function
ω : U → M (RN ) .
If M ≥ 1, once we consider the basis (dxi1 ,...,iM )1≤i1 <···<iM ≤N , the components of
the M-differential form ω will be denoted by fi1 ,...,iM : U → R. We will then write
ω(x) = fi1 ,...,iM (x) dxi1 ,...,iM .
1≤i1 <···<iM ≤N
2N 3
Hence, the M-linear antisymmetric function ω(x) is determined by the M
-dimensional
vector
2 3
F (x) = fi1 ,...,iM (x) 1≤i1 <···<iM ≤N .
With these definitions, it can be checked that the set of differential forms of degree M
becomes a vector space.
Let us give a closer look to the case N = 3. Denoting by ωM a M-differential form,
3 with M = 1, 2, 3, we can write
Notice that ω1 (x) and ω2 (x) are determined by the three-dimensional vectors
F (x) = (f1 (x), f2 (x), f3 (x)) , and G(x) = (f12 (x), f13 (x), f23 (x)) ,
respectively.
and
ω(x) = gj1 ,...,jM̃ (x) dxj1 ,...,jM̃ ,
1≤j1 <···<jM̃ ≤N
we set
(ω ∧
ω)(x) = fi1 ,...,iM (x)gj1 ,...,jM̃ (x) dxi1 ,...,iM ,j1 ,...,jM̃ .
1≤i1 <···<iM ≤N
1≤j1 <···<jM̃ ≤N
Usually the symbol ∧ is omitted when one of the two is a 0-differential form, since
the external product is, in this case, similar to the product with a scalar. Notice that, in
the above sum, all elements with a repeating index will be zero. Let us see now some
properties of the external product.
3.3 · External Product
129 3
Proposition 3.3
ω,
If ω, ˜ respectively, then
ω are three differential forms of degrees M, M̃, M̃,
ω ∧ ω = (−1)M M̃ ω ∧
ω,
ω) ∧
(ω ∧ ω∧
ω = ω ∧ ( ω) ;
if c ∈ R, then
(c ω) ∧
ω = ω ∧ (c
ω) = c(ω ∧
ω) ;
ω) ∧
(ω + ω = (ω ∧ ω ∧
ω) + ( ω) ,
ω) = (
ω ∧ (ω + ω ∧ ω) + (
ω∧
ω) .
Proof
Assume that ω and
ω are written as above, and let
ω(x) =
hk1 ,...,k ˜ (x) dxk1 ,...,k ˜ .
M̃ M̃
1≤k1 <···<k ˜ ≤N
M̃
The first identity is obtained observing that, in order to arrive from the sequence of indices
i1 , . . . , iM , j1 , . . . , jM̃ to the one j1 , . . . , jM̃ , i1 , . . . , iM , one has first to move j1 towards
the left making M exchanges, then the same has to be done for j2 , if there is one, and so
on, till jM̃ is reached. In the total, it is then necessary to operate M M̃ exchanges of indices.
Taking into account the fact that the differential form changes sign each time there is an
exchange, we have the formula we wanted to prove.
The proof of the second identity (associative property) shows no great difficulties, as well
as for the identities where the constant c appears.
Concerning the distributive property, when M = M̃ we have
ω) ∧
((ω +
ω)(x) =
= (fi1 ,...,iM (x) + gi1 ,...,iM (x))hk1 ,...,k ˜ (x) dxi1 ,...,iM ,k1 ,...,k ˜
1≤i1 <···<iM ≤N M̃ M̃
1≤k1 <···<k ˜ ≤N
M̃
= (fi1 ,...,iM (x)hk1 ,...,k ˜ (x)+
1≤i1 <···<iM ≤N M̃
1≤k1 <···<k ˜ ≤N
M̃
= ((ω ∧
ω∧
ω) + (
ω))(x) .
130 Chapter 3 • Differential Forms
The last identity in the statement is proved either in an analogous way, or using the first and
the fourth identities.
ω(x) = dx1 ,
ω(x) = dx2 , for every x ∈ U,
More generally, in view of the associative property of the external product, we can write
In the following, we will use indifferently the one or the other notation.
Given a M-differential form ω if class C 1 , we want to define the differential form dex ω,
of degree M + 1, which is said to be the external differential of ω.
If ω is a 0-differential form, ω = f : U → R, its external differential dex ω(x) is
just the differential df (x), which is a linear function defined on RN, with values in R.
Being, for every v = (v1, . . . , vN ),
∂f ∂f
df (x)v = (x) v1 + · · · + (x) vN ,
∂x1 ∂xN
we have
∂f ∂f ∂f N
df (x) = (x) dx1 + · · · + (x) dxN = (x) dxm .
∂x1 ∂xN m=1
∂xm
we set
dex ω(x) = dfi1 ,...,iM (x) ∧ dxi1 ∧ · · · ∧ dxiM ,
1≤i1 <···<iM ≤N
3.4 · External Differential
131 3
or, equivalently,
N
∂fi1 ,...,iM
dex ω(x) = (x) dxm ∧ dxi1 ∧ · · · ∧ dxiM .
1≤i1 <···<iM ≤N m=1
∂xm
In the following, in order to simplify the notations, we will always write dω instead of
dex ω. Let us see some properties of the external differential.
Proposition 3.4
If ω and
ω are two differential forms of class C 1 , of degrees M and M̃, respectively,
then
d(ω ∧
ω) = dω ∧
ω + (−1)M ω ∧ d
ω;
if M = M̃ and c ∈ R, it is
d(ω +
ω) = dω + d
ω,
d(c ω) = c dω ;
if ω is of class C 2 , then
d(dω) = 0 .
Proof
Concerning the first identity, if ω and
ω are as above, we have:
N
∂
d(ω ∧
ω)(x) = (fi ,...,i gj ,...,jM̃ )(x) dxm,i1 ,...,iM ,j1 ,...,jM̃
1≤i1 <···<iM ≤N ∂xm 1 M 1
1≤j1 <···<jM̃ ≤N m=1
N
∂fi 1 ,...,iM
= gj1 ,...,jM̃ +
1≤i1 <···<iM ≤N ∂xm
1≤j1 <···<jM̃ ≤N m=1
∂gj1 ,...,jM̃
+fi1 ,...,iM (x) dxm,i1 ,...,iM ,j1 ,...,jM̃
∂xm
N
∂fi 1 ,...,iM
= gj1 ,...,jM̃ (x) dxm,i1 ,...,iM ,j1 ,...,jM̃ +
1≤i1 <···<iM ≤N ∂xm
1≤j1 <···<jM̃ ≤N m=1
N
∂gj1 ,...,jM̃
+ (−1)M fi1 ,...,iM (x) dxi1 ,...,iM ,m,j1 ,...,jM̃
1≤i1 <···<iM ≤N ∂xm
1≤j1 <···<jM̃ ≤N m=1
= (dω ∧
ω)(x) + (−1)M (ω ∧ d
ω)(x) .
132 Chapter 3 • Differential Forms
The second and third identities follow easily from the linearity of the derivative.
Concerning the last identity, we can see that
3 d(dω)(x) =
N
N
∂ ∂fi1 ,...,iM
(x) dxk,m,i1 ,...,iM .
∂xk ∂xm
1≤i1 <···<iM ≤N k=1 m=1
Since
∂ ∂fi1 ,...,iM ∂ ∂fi1 ,...,iM
= ,
∂xk ∂xm ∂xm ∂xk
taking into account the fact that dxk ∧ dxm = −dxm ∧ dxk , it is seen that all the terms in the
sums pairwise eliminate one another, so that d(dω)(x) = 0.
When N = 3, if ω1 and
ω1 are two 1-differential forms, e.g.,
ω1 ∧
ω1 = (f1 g2 − f2 g1 ) dx1,2 + (f1 g3 − f3 g1 ) dx1,3 + (f2 g3 − f3 g2 ) dx2,3 .
it is
∂f ∂f ∂f
dω0 (x) = (x) dx1 + (x) dx2 + (x) dx3 .
∂x1 ∂x2 ∂x3
ω2 (x) = g1,2 (x) dx1,2 + g1,3 (x) dx1,3 + g2,3 (x) dx2,3 ,
then
∂g2,3 ∂g1,3 ∂g1,2
dω2 (x) = (x) − (x) + (x) dx1,2,3 .
∂x1 ∂x2 ∂x3
At this point, it is time to observe that, in view of future applications, a wiser choice
for the basis of the vector space 2 (R3 ) could be the following:
2 3
dx2,3 , dx3,1 , dx1,2 .
we have that ω1 ∧ :
ω1 corresponds to the vector product of F and F
F ×F 3 − F3 F
= (F2 F 2 , F3 F
1 − F1 F
3 , F1 F
2 − F2 F
1 ) ;
if instead of
ω1 we take the associated 2-differential form
1 dx2,3 + F
ω2 = F
2 dx3,1 + F
3 dx1,2 ,
we have that ω1 ∧ :
ω2 corresponds to the scalar product of F and F
F , F 1 + F2 F
= F1 F 2 + F3 F
3 .
The properties of the external product and those of the external differential lead
to formulas involving the gradient, the curl and the divergence. Taking f : U → R,
f˜ : U → R, F : U → R3 and F : U → R3, we have, for example, the following:
curl(grad f ) = 0 ,
div(curl F ) = 0 ,
grad(f f˜) = f˜(grad f ) + f (grad f˜) ,
curl(f F ) = (grad f ) × F + f (curl F ) ,
) = grad f , F̃ + f (div F
div(f F ) ,
) = curl F , F
div(F × F − F , curl F
.
The proofs are left to the reader, who might also enjoy the following exercises.
Exercises
1. Let ω,
ω : R2 → 1 (R2 ) be defined as
ω(x, y) = y 2 dx − x 2 dy ,
ω(x, y) = xy dx + (x 2 + y 2 ) dy .
Compute ω ∧
ω, dω, d ω, d(ω ∧ ω) and dω ∧ d
ω.
2. Let ω : R3 → 2 (R3 ) be defined as
ω(x, y, z) = x 2 yz dy ∧ dz + xy 2 z dz ∧ dx + xyz2 dx ∧ dy .
Compute dω : R3 → 3 (R3 ).
3. Let f : R3 → R be the function defined as f (x, y, z) = xy 2 z3 . Compute
f := div(gradf ) .
3.6 · M-Surfaces
135 3
F (x, y, z) = (x − y + z2 , x 2 + yz , x + y 2 − 2z) .
Compute
3.6 M-Surfaces
Definition 3.5
We call M-surface in RN a function1 σ : I → RN of class C 1 . If M = 1, σ is also
said to be a curve; if M = 2, we will simply say surface. The set σ (I ) is called the
support of the M-surface σ. We will say that the M-surface σ is regular if, for
every u ∈ ˚I , the Jacobian matrix σ (u) has rank M.
σ (t)
τσ (t) = .
σ (t)
σ(b )
τσ(t )
σ(t ) + τσ(t )
σ(a )
σ(t )
1 The partial derivatives of σ must be continuous on the whole I , and in the points of the boundary they are
interpreted, if necessary, as right or left derivatives. Equivalently, σ could be extended to a C 1 -function defined
on an open set containing I . In this perspective, the domain of σ could be a more general set than a rectangle,
as e.g. the closure of any bounded and open set, so that the differential be well defined at the boundary points,
as well. Analogous considerations can be made on the domains of the considered differential forms.
136 Chapter 3 • Differential Forms
{(x, y, z) : x 2 + y 2 = R 2 , z = 0}
(which is covered twice). Being σ (t) = (−2R sin(2t), 2R cos(2t), 0), it is a regular curve,
and
σ(u ,v ) + νσ(u ,v )
νσ(u ,v )
σ(u ,v)
∂σ
(u, v) × ∂σ
(u, v)
νσ (u, v) = ∂u ∂v
.
∂u (u, v) ×
∂σ ∂σ
∂v
(u, v)
Examples
{(x, y, z) : x 2 + y 2 + z2 = R 2 , y ≥ 0} .
3.6 · M-Surfaces
137 3
Being
∂σ
(φ, θ) = (R cos φ cos θ, R cos φ sin θ, −R sin φ) ,
∂φ
∂σ
(φ, θ) = (−R sin φ sin θ, R sin φ cos θ, 0) ,
∂θ
we compute
∂σ ∂σ
(φ, θ) × (φ, θ) = (R 2 sin2 φ cos θ, R 2 sin2 φ sin θ, R 2 sin φ cos φ) .
∂φ ∂θ
.
{(x, y, z) : ( x 2 + y 2 − R)2 + z2 = r 2 } .
138 Chapter 3 • Differential Forms
{(x, y, z) : x 2 + y 2 + z2 ≤ R 2 } .
Definition 3.6
Two M-surfaces σ : I → RN and σ̃ : J → RN are said to be equivalent if they
have the same support and there are two open sets A ⊆ I, B ⊆ J, and a
diffeomorphism ϕ : A → B with the following properties: the sets I \ A and J \ B
are negligible and σ (u) = σ̃ (ϕ(u)), for every u ∈ A. We say that σ and σ̃ have the
same orientation if det ϕ (u) > 0, for every u ∈ A; they have opposite
orientation if det ϕ (u) < 0, for every u ∈ A.
Examples Given a curve σ : [a, b] → RN, an equivalent curve with opposite orientation is,
for example, σ̃ : [a, b] → RN defined by
σ̃ (t) = σ (a + b − t) .
3.6 · M-Surfaces
139 3
If σ is regular, an interesting example of an equivalent curve with the same orientation is
obtained by considering the function
t
ϕ(t) = σ (τ ) dτ .
a
Since ϕ (t) = σ (t) > 0, for every t ∈ ]a, b[ , setting ι1 = ϕ(b), we have that ϕ :
[a, b] → [0, ι1 ] is bijective and the curve σ1 : [0, ι1 ] → RN , defined as σ1 (s) = σ (ϕ −1 (s))
is equivalent to σ. Notice that, for every s ∈ ]0, ι1 [ , it is
σ̃ (u, v) = σ (u, a2 + b2 − v) ,
or by
σ̃ (u, v) = σ (a1 + b1 − u, v) .
As will be seen in the sequel, two M-surfaces with the same support are not
necessarily equivalent. Let us introduce a particular class of M-surfaces for which this
inconvenience does not happen.
Definition 3.7
A M-surface σ : I → RN is a M-parametrization of a set M if it is regular,
injective on ˚
I , and σ (I ) = M. We say that a subset of RN is M-parametrizable if
there is a M-parametrization of it.
The following theorem is crucial for the treatment of the measure of M-parametrizable
M-surfaces.
140 Chapter 3 • Differential Forms
Theorem 3.8
Two M-parametrizations of the same set are always equivalent.
3
Proof
Let M be the subset of RN taken in consideration, and let σ : I → RN and σ̃ : J → RN be
two of its M-parametrizations. We define the sets
◦
A=˚
I ∩ σ −1 (M \ (σ (∂I ) ∪ σ̃ (∂J ))) , B = J ∩ σ̃ −1 (M \ (σ (∂I ) ∪ σ̃ (∂J ))) .
Then, for
◦
every u ∈ A, since σ (u) ∈ M \ (σ (∂I ) ∪ σ̃ (∂J )) and σ̃ (J ) = M, there exists
a v ∈ J such that σ̃ (v) = σ (u). Clearly,
◦
σ̃ (v) ∈ M \ (σ (∂I
◦
) ∪ σ̃ (∂J )), so that v ∈ B.
Moreover, since σ̃ is injective on J , there is a unique v in J with such a property. We can
thus define ϕ : A → B by setting ϕ(u) = v. Hence, for u ∈ A and v ∈ B,
Definition 3.9
We say that the M-differential form ω : U → M (RN ) is integrable on the
M-surface σ : I → U if, for every choice of the indices i1 , . . . , iM in the set
3 {1, . . . , N}, the function (fi1 ,...,iM ◦ σ ) det σ(i 1 ,...,iM ) is integrable on I. In that case,
we set
ω= fi1 ,...,iM (σ (u)) det σ(i 1 ,...,iM ) (u) du .
σ 1≤i1 <···<iM ≤N I
For example, ω is surely integrable on σ when all its components are continuous
functions. Notice that
⎛ ∂σi1 ∂σi1 ⎞
∂u1
(u) . . . ∂uM
(u)
∂(σi1 , . . . , σiM ) ⎜ ⎟
σ(i1 ,...,iM ) (u) = ⎜
(u) = ⎝ .
.. .. ⎟.
∂(u1 , . . . , uM ) ··· . ⎠
∂σiM ∂σiM
∂u1
(u) ... ∂uM
(u)
2N 3
If we define, for every x ∈ U and every u ∈ I , the M
-dimensional vectors
2 3
F (x) = fi1 ,...,iM (x) 1≤i1 <···<iM ≤N ,
2 3
(u) = det σ(i 1 ,...,iM ) (u) 1≤i <···<i ≤N ,
1 M
we have that
ω = F (σ (u)) , (u) du ,
σ I
N
where · , · denotes here the Euclidean scalar product in R(M ) .
It is important to analyze how the integral of a differential form ω changes on two
equivalent M-surfaces having the same orientations, or opposite orientations.
Theorem 3.10
Let σ : I → RN and σ̃ : J → RN be two equivalent M-surfaces. If they have the
same orientations, then
ω= ω;
σ σ̃
= fi1 ,...,iM (σ̃ (ϕ(u))) det σ̃(i 1 ,...,iM ) (ϕ(u)) det ϕ (u) du
1≤i1 <···<iM ≤N A
=± fi1 ,...,iM (σ̃ (v)) det σ̃(i 1 ,...,iM ) (v) dv
1≤i1 <···<iM ≤N B
=± ω,
σ̃
Remark In general, if σ and σ̃ are equivalent, we do not necessarily have the equality
| σ ω| = | σ̃ ω|. It is not guaranteed, indeed that they have the same or opposite orientations.
For example, if we consider the two surfaces σ, σ̃ : [1, 2] × [0, 2π] → R3 , defined by
3 3 v 3 3 v 3 v
σ (u, v) = + u− cos cos v, + u− cos sin v, u − sin ,
2 2 2 2 2 2 2 2
π
σ̃ (u, v) = σ u, v + ,
2
it is possible to see that they are both parametrizations of the same set (a Möbius strip), and
therefore they are equivalent (the reader is invited to explicitly find a diffeomorphism ϕ :
A → B with the properties required by the definition). On the other hand, if we consider the
2-differential form ω(x1 , x2 , x3 ) = dx12 , determined by the constant vector field (0, 0, 1),
computation yields
√
ω=0, ω = −3 2 .
σ σ̃
144 Chapter 3 • Differential Forms
3 Theorem 3.11
Let M = N; if σ is regular and injective on ˚
I with det σ > 0, and ω is of the type
then
ω= f.
σ σ (I )
Proof
Using the theorem of local inversion it is seen that σ induces a diffeomorphism between ˚
I and
σ (˚
I ). Being both the boundary of I and its image through σ negligible (see Lemma 2.36),
by the Theorem on the Change of Variables in the integral, we have
ω= f (σ (u)) det(σ (u)) du
σ I
= f (σ (u)) det(σ (u)) du
˚
I
= f = f.
σ (˚
I) σ (I )
Hence,
b
ω= [F1 (σ (t))σ1 (t) + F2 (σ (t))σ2 (t) + F3 (σ (t))σ3 (t)] dt
σ a
b
= F (σ (t)) , σ (t) dt .
a
3.7 · The Integral of a Differential Form
145 3
This quantity will be called line integral2 of the vector field F = (F1 , F2 , F3 ) along the
curve σ, and will be denoted by
F, d .
σ
Example Let us compute the line integral of the vector field F (x, y, z) = (−y, x, z2 ) along
the curve σ : [0, 2π] → R3, defined by σ (t) = (cos t, sin t, t) :
2π 8π 3
F, d = [(sin t)2 + (cos t)2 + t 2 ] dt = 2π + .
σ 0 3
ω(x) = F1 (x) dx2 ∧ dx3 + F2 (x) dx3 ∧ dx1 + F3 (x) dx1 ∧ dx2 .
Hence,
⎡ ⎛ ⎞
b2
b1
∂σ2
(u, v) ∂σ 2
(u, v)
⎢ ⎜ ∂u ∂v
⎟
ω= ⎣F1 (σ (u, v)) det ⎝ ⎠+
σ a2 a1 ∂σ3
∂u
(u, v) ∂σ
∂v
3
(u, v)
⎛ ⎞
∂σ3
(u, v) ∂σ 3
(u, v)
⎜ ∂u ∂v
⎟
+F2 (σ (u, v)) det ⎝ ⎠+
∂σ1
∂u
(u, v) ∂σ
∂v
1
(u, v)
⎛ ⎞⎤
∂σ1
(u, v) ∂σ 1
(u, v)
⎜ ∂u ∂v
⎟⎥
+F3 (σ (u, v)) det ⎝ ⎠⎦ du dv
∂σ2
∂u
(u, v) ∂σ
∂v
2
(u, v)
b2
b1 7 8
∂σ ∂σ
= F (σ (u, v)) , (u, v) × (u, v) du dv .
a2 a1 ∂u ∂v
This quantity is called the surface integral or flux3 of the vector field F = (F1 , F2 , F3 )
through the surface σ, and will be denoted by
F, d S .
σ
2 In mechanics this concept is used, for example, to define the work done by a field of forces on a particle
moving along a curve.
3 In fluidodynamics this concept is used, for instance, to define the amount of fluid crossing a given surface in
Example Let us compute the flux of the vector field F (x, y, z) = (−y, x, z2 ) through the
surface σ : [0, 1] × [0, 1] → R3 , defined by σ (u, v) = (u2 , v, u + v) :
1
1
3 F, d S = [(−v)(−1) + u2 (−2u) + (u + v)2 (2u)] du dv =
3
.
σ 0 0 2
where
2 3
F (x) = fi1 ,...,iM (x) 1≤i1 <···<iM ≤N ,
2 3
(u) = det σ(i 1 ,...,iM ) (u) 1≤i <···<i ≤N .
1 M
Definition 3.12
The function f : U → R is integrable on the M-surface σ : I → RN if
(f ◦ σ ) is integrable on I. In that case, we set
f = f (σ (u)) (u) du
σ I
2 12
= f (σ (u)) det σ(i 1 ,...,iM ) (u) du .
I 1≤i1 <···<iM ≤N
In this context, the integral does not differ for equivalent M-surfaces.
Theorem 3.13
If σ and σ̃ are two equivalent M-surfaces, then
f = f.
σ σ̃
3.8 · Scalar Functions and M-Superficial Measure
147 3
Proof
With the notations introduced previously, since σ = σ̃ ◦ ϕ, with ϕ : A → B, we have
(u) = det σ(i 1 ,...,iM ) (u)
1≤i1 <···<iM ≤N
= det σ̃(i 1 ,...,iM ) (ϕ(u))ϕ (u)
1≤i1 <···<iM ≤N
= det σ̃(i 1 ,...,iM ) (ϕ(u)) det ϕ (u)
1≤i1 <···<iM ≤N
= (ϕ(u)) det ϕ (u) .
Therefore, by the Theorem on the Change of Variables in the integral, being I \ A and J \ B
negligible, we have that
f = f (σ (u)) (u) du
σ A
=
f (σ̃ (ϕ(u))) (ϕ(u)) | det ϕ (u)| du
A
= (v) dv
f (σ̃ (v))
B
= f,
σ̃
Consider the interesting case when f is constantly equal to 1 : having in mind the
physical situation of a particle in motion along the curve described by σ , in this case the
line integral is called the length4 (or curvilinear measure) of the curve σ, and we write
b
ι1 (σ ) = σ (t) dt .
a
4 Thisdefinition could also be justified by geometrical considerations, which we omit here for the sake of
briefness.
148 Chapter 3 • Differential Forms
Example Let σ : [0, b] → R3 be defined by σ (t) = (t, t 2 , 0). Its support is an arc of
parabola, and its length is given by
3 ι1 (σ ) =
b
1 + (2t)2 dt
0
sinh−1 (2b) 1
= (cosh u)2 du
sinh−1 (0) 2
−1
1 u + sinh u cosh u sinh (2b)
=
2 2 0
1
= sinh−1 (2b) + 2b 1 + 4b2
4
1 b
= ln 2b + 1 + 4b2 + 1 + 4b2 .
4 2
Again it is interesting to consider the case when f is constantly equal to 1 : in this case
we call area (or surface measure) of the surface σ the following integral:
b2
b1 4 4
4 ∂σ 4
ι2 (σ ) = 4 (u, v) × ∂σ (u, v)4 du dv .
4 ∂u ∂v 4
a2 a1
In the case when, for example, the surface happens to be a 2-parametrization of a set
in R3 , this integral is the flux of a vector field which at every point coincides with the
normal versor to the surface itself.5
= 4πR 2 .
5 Alsothe definition of the area of a surface can be justified by geometrical considerations, even if the procedure
is much more delicate than in the case of a curve.
3.8 · Scalar Functions and M-Superficial Measure
149 3
In general, the case when f is constantly equal to 1 gives
1 = (u) du ,
σ I
Definition 3.14
We call M-superficial measure of a M-surface σ : I → RN the following integral:
2 12
ιM (σ ) = det σ(i 1 ,...,iM ) (u) du .
I 1≤i1 <···<iM ≤N
Corollary 3.15
Two equivalent M-surfaces always have the same M-superficial measure. In particu-
lar, this is true for any two M-parametrizations of a given set.
Notice that, even if they have the same support, these curves are not equivalent. Indeed, as is
easily seen, ι1 (σ ) = 2π while ι1 (σ̃ ) = 4π.
Definition 3.16
We call M-dimensional measure of a M-parametrizable set M ⊆ RN the
M-superficial measure of any of its M-parametrizations.
6 We emphasize here the fact that the area of a sphere of radius R, which we found to be equal to 4πR 2 by
taking a particular parametrization, will always be the same when computed with any parametrization. This
fact is not always proved in other textbooks.
150 Chapter 3 • Differential Forms
Exercises
1. Find the length of the helicoidal curve γ : [0, 2π] → R3 defined as
M = {(x, y, z) ∈ R3 : x 2 + 4y 2 + 9z2 = 1} ,
where I ⊆ R2 is some rectangle. Then, compute the integral σ f , where f : R3 → R
is the function defined by the formula
f (x, y, z) = xyz .
M = {(x, y, z) ∈ R3 : |z| ≤ 4x 2 + 9y 2 ≤ 1} ,
Definition 3.17
We call glueing◦ of the M-surfaces
◦
σ1 , . . . , σn any function σ : I → RN whose
restrictions to J1, . . . , Jn coincide with σ̃1 , . . . , σ̃n , respectively; it is almost
everywhere differentiable, and we can define σ ω by the same formula we have
used for the M-surfaces of class C 1 . Hence,
ω= ω + ··· + ω.
σ σ1 σn
where the meaning of the symbol 9 is to suppress the underlying variable. Consider
moreover some M-surfaces αk− , βk− : Ik → RM+1 , equivalent to αk+ , βk+ , respectively,
with opposite orientation.
7 We are thus considering here the situation when N = M + 1. This setting will be maintained also in the next
section.
152 Chapter 3 • Differential Forms
Definition 3.18
We call oriented boundary of the rectangle I a function ∂I which is a glueing of
the following M-surfaces:
3 (a) αk− and βk+ if k is odd;
(b) αk+ and βk− if k is even.
Let M = 1, and consider the rectangle [a1 , b1 ] × [a2 , b2 ]. Then, for example,
We can visualize geometrically ∂I as the glueing of the sides of the rectangle I oriented
in such a way that the perimeter be described in counter-clockwise direction.
β2 −
b2
β1 +
α1 −
a2
α2 +
a1 b1
In this case, we can visualize ∂I as the glueing of the six faces of the parallelepiped
I, each oriented in such a way that the normal versor be always directed towards the
exterior.
α1 −
β3 +
α2 + β2 −
β1 +
α3 −
In this section, I will be a rectangle in RN, with N ≥ 2. In the following theorem, the
elegant Gauss formula is obtained.8
Theorem 3.19
If ω is a (N − 1)-differential form of class C 1 defined on an open set containing the
rectangle I in RN, then
dω = ω.
I ∂I
Proof
We can write ω as
N
ω(x) = :j ∧ · · · ∧ dxN .
Fj (x) dx1 ∧ · · · ∧ dx
j =1
Then,
N
N
∂Fj
dω(x) = :j ∧ · · · ∧ dxN
(x) dxm ∧ dx1 ∧ · · · ∧ dx
3 ∂xm
j =1 m=1
N
∂Fj
= (−1)j −1 (x) dx1 ∧ · · · ∧ dxN .
∂xj
j =1
Being the partial derivatives of each Fj continuous, they are integrable on the rectangle I,
and we can use the Reduction Theorem:
N
∂Fj
dω = (−1)j −1 (x) dx1 . . . dxN
I j =1 I ∂xj
N bj ∂Fj
= (−1)j −1 (x1 , . . . , xN ) dxj dx1 . . . d:
xj . . . dxN
Ij aj ∂xj
j =1
N
= (−1)j −1 [Fj (x1 , . . . , xj −1 , bj , xj +1 , . . . , xN ) −
j =1 Ij
:j . . . dxN ,
−Fj (x1 , . . . , xj −1 , aj , xj +1 , . . . , xN )] dx1 . . . dx
N
ω= :j ∧ · · · ∧ dxN
Fj dx1 ∧ · · · ∧ dx
αk+ +
j =1 αk
N
= (Fj ◦ αk+ ) det(αk+ ) (1,...,j,...,N :j . . . dxN
dx1 . . . dx
ˆ )
j =1 Ik
= :k . . . dxN ,
Fk (x1 , . . . , xk−1 , ak , xk+1 , . . . , xN ) dx1 . . . dx
Ik
being
0 if j =
k,
det(αk+ ) ˆ =
(1,...,j,...,N ) 1 if j = k .
Similarly,
ω= :k . . . dxN ,
Fk (x1 , . . . , xk−1 , bk , xk+1 , . . . , xN ) dx1 . . . dx
β+ Ik
3.11 · Oriented Boundary of a M-Surface
155 3
so that
N
ω= (−1)k−1 ω− ω
∂I k=1 βk+ αk+
N
= (−1)k−1 [Fk (x1 , . . . , xk−1 , bk , xk+1 , . . . , xN ) −
k=1 Ik
:k . . . dxN ,
−Fk (x1 , . . . , xk−1 , ak , xk+1 , . . . , xN )] dx1 . . . dx
Definition 3.20
For 1 ≤ M ≤ N − 1, we call oriented boundary of σ a function ∂σ = σ ◦ ∂I,
which is a glueing of the following M-surfaces:
(a) σ ◦ αk− and σ ◦ βk+ if k is odd;
(b) σ ◦ αk+ and σ ◦ βk− if k is even.
Given a M-differential form ω whose domain contains the support of ∂σ, we will
then have
M+1
M+1
ω = (−1)k ω+ (−1)k−1 ω
∂σ k=1 σ ◦αk+ k=1 σ ◦βk+
M+1
= (−1) k−1
ω− ω .
k=1 σ ◦βk+ σ ◦αk+
Remark It is useful to extend the meaning of ∂σ ω to the case when σ : [a, b] → RN is a
curve, with N ≥ 1, and ω = f : U → R is a 0-differential form; in this case, we set
ω = f (σ (b)) − f (σ (a)) .
∂σ
156 Chapter 3 • Differential Forms
The first curve has as support a circle with radius r, which degenerates into the origin in the
case when r = 0. The second has as support a circle with radius R. Notice however that
these two circles are described by the two curves in opposite directions. The last two curves
are equivalent with opposite orientations.
Consider, for example, the vector field F (x, y, z) = (−y, x, xyez ). Then,
F, d = F, d + F, d
∂σ σ ◦α1− σ ◦β1+
2π
2π
= [−r 2 sin2 v − r 2 cos2 v] dv + [R 2 sin2 v + R 2 cos2 v] dv
0 0
= 2π(R 2 − r 2 ) .
3.11 · Oriented Boundary of a M-Surface
157 3
2. Consider the surface σ : [r, R] × [0, 2π] → R3, with 0 < r < R, defined by
v
r+R r +R
σ (u, v) = + u− cos cos v,
2 2 2
v
r +R r +R
+ u− cos sin v,
2 2 2
r +R v
u− sin ,
2 2
whose support is a Möbius strip. In this case, the oriented boundary is given by a glueing of
v
r +R R−r
σ ◦ α1− (v) = + cos cos v,
2 2 2
v
r +R R−r
− + cos sin v,
2 2 2
R−r v
− sin ,
2 2
v
r +R R−r
σ ◦ β1+ (v) = + cos cos v,
2 2 2
v
r +R R−r
+ cos sin v,
2 2 2
R−r v
sin ,
2 2
σ ◦ α2+ (u) = (u, 0, 0) ,
σ ◦ β2− (u) = (u, 0, 0) .
Notice that in this case the two last curves are the same.
158 Chapter 3 • Differential Forms
Notice that the first two curves are degenerated in one point, while the two others are
equivalent with opposite orientations. Hence, for any choice of a vector field F, it will be
∂σ F, d = 0.
whose support is the closed ball, centered at the origin, with radius R > 0. The oriented
boundary ∂σ is a glueing of the following six surfaces:
Notice that the first surface is degenerated in a point (the origin), the second has as support
the entire sphere, the third and the fourth are degenerated in two lines, while the remaining
two are equivalent with opposite orientations. Hence, given a vector field F, we will have
F, d S = F, d S .
∂σ σ ◦β1+
Let us state the following generalization of the Gauss theorem, where the important
Stokes–Cartan formula is obtained.
3.12 · The Stokes–Cartan Formula
159 3
Theorem 3.21
Let 0 ≤ M ≤ N − 1. If ω : U → M (RN ) is a M-differential form of class C 1 and
σ : I → RN is a (M + 1)-surface whose support is contained in U, then
dω = ω.
σ ∂σ
Theorem 3.22
Let f : U → R be a scalar function of class C 1 and σ : [a, b] → R3 a curve whose
support is contained in U. Then,
grad f, d = f (σ (b)) − f (σ (a)) .
σ
Proof
Consider the function G : [a, b] → R defined by G(t) = f (σ (t)). It is of class C 1 , and by
the Fundamental Theorem we have
b
G (t) dt = G(b) − G(a) .
a
Remark The line integral of the gradient of a function f does not depend on the chosen
curve itself, but only on the values of the function at the two extrema σ (b) and σ (a).
Example Given
x y z
F (x, y, z) = − , ,
[x 2 + y 2 + z2 ]3/2 [x 2 + y 2 + z2 ]3/2 [x 2 + y 2 + z2 ]3/2
160 Chapter 3 • Differential Forms
and the curve σ : [0, 4π] → R3 defined by σ (t) = (cos t, sin t, t), we want to compute the
line integral σ F, d. Observe that F = grad f, with
3 f (x, y, z) =
1
.
x 2 + y 2 + z2
Hence,
1
F, d = f (σ (4π)) − f (σ (0)) = √ − 1.
σ 1 + 16π 2
Theorem 3.23
Let F : U → R3 be a C 1 -vector field and σ : [a1 , b1 ] × [a2 , b2 ] → R3 be a surface
whose support is contained in U. Then,
curl F, d S = F, d .
σ ∂σ
Verbally The flux of the curl of the vector field F through the surface σ is equal to the
line integral of F along the oriented boundary of σ.
Proof
Let I = [a1 , b1 ] × [a2 , b2 ], and define the following 1-differential form
ω : I → 1 (R2 ) :
7 8 7 8
∂σ ∂σ
ω(u, v) = F (σ (u, v)) , (u, v) du + F (σ (u, v)) , (u, v) dv .
∂u ∂v
The integrals on β1+ , α2+ and β2− are then treated analogously, so that
ω=
F, d .
∂I ∂σ
3.12 · The Stokes–Cartan Formula
161 3
so that
ω=
d curl F, d S .
I σ
The Gauss formula applied to ω thus yields the conclusion in this case.
The assumption that σ be of class C 2 can eventually be eliminated by an approximation
procedure: it is possible to construct a sequence (σn )n of surfaces of class C 2 which converge
to σ together with all their partial derivatives of the first order. The Stokes–Ampère formula
holds then for those surfaces, and passing to the limit, by the Dominated Convergence
Theorem, we have the conclusion.
Example Let F (x, y, z) = (−y, x, 0) and γ : [0, 2π] → R3 be the curve defined by γ (t) =
(R cos t, R sin t, 0); we want to compute the line integral γ F, d. We have already seen
how to compute this integral by the direct use of the definition. We now proceed in a different
way: consider the surface σ : [0, R]×[0, 2π] → R3 given by σ (ρ, θ) = (ρ cos θ, ρ sin θ, 0).
Observe that γ = σ ◦ β1+ , so
F, d = F, d = F, d = curl F, d S .
γ σ ◦β1+ ∂σ σ
∂σ ∂σ
(ρ, θ) × (ρ, θ) = (0, 0, ρ) ,
∂ρ ∂θ
we then have
R
2π
F, d = (0, 0, 2) , (0, 0, ρ) dθ dρ = 2πR 2 .
γ 0 0
ω(x) = F1 (x) dx2 ∧ dx3 + F2 (x) dx3 ∧ dx1 + F3 (x) dx1 ∧ dx2 ,
Theorem 3.24
Let F : U → R3 be a C 1 -vector field and σ : I = [a1 , b1 ] × [a2 , b2 ] × [a3 , b3 ] → R3
3 be a volume whose support is contained in U . Then,
div F dx1 ∧ dx2 ∧ dx3 = F, d S .
σ ∂σ
In Intuitive Terms The integral of the divergence of the vector field F on the set
V = σ (I ) is equal to the flux of F which exits from V .
Proof
Let
ω : I → 2 (R3 ) be the 2-differential form defined by
7 8
∂σ ∂σ
ω(u) = F (σ (u)) , (u) × (u) du2 ∧ du3 +
∂u2 ∂u3
7 8
∂σ ∂σ
+ F (σ (u)) , (u) × (u) du3 ∧ du1 +
∂u3 ∂u1
7 8
∂σ ∂σ
+ F (σ (u)) , (u) × (u) du1 ∧ du2 .
∂u1 ∂u2
With the analogous computations on the remaining five surfaces which determine ∂I we can
say that
ω=
F, d S .
∂I ∂σ
3.12 · The Stokes–Cartan Formula
163 3
Hence,
ω=
d div F (σ (u)) det σ (u) du1 ∧ du2 ∧ du3 = div F dx1 ∧ dx2 ∧ dx3 .
I I σ
The Gauss formula applied to ω thus yields the first conclusion. On the other hand, if σ is
regular and injective on ˚
I , with det σ > 0, Theorem 3.11 yields the second formula, and the
proof is completed. The assumption that σ be of class C 2 can eventually be eliminated, as in
the previous theorem.
Recall that η = σ ◦ β1+ , where σ : I = [0, R] × [0, π] × [0, 2π] → R3 is the volume
given by
Hence,
F, d S = F, d S = div F .
η ∂σ σ (I )
We have the following result, analogous to the one obtained in the previous section for
the case N = 3.
Theorem 3.25
Let f : U → R be a scalar function of class C 1 and σ : [a, b] → R2 be a curve with
support contained in U. Then,
7 8
∂f ∂f
, , d = f (σ (b)) − f (σ (a)) .
σ ∂x1 ∂x2
Theorem 3.26
Let F = (F1 , F2 ) : U → R2 be a C 1 -vector field and σ : I = [a1 , b1 ]×[a2 , b2 ] → R2
be a surface whose support is contained in U . Then,
∂F2 ∂F1
− dx1 ∧ dx2 = F, d .
σ ∂x1 ∂x2 ∂σ
Proof
As for the Stokes–Ampère theorem, we consider the auxiliary differential form
ω : I →
1 (R2 ), defined by
7 8 7 8
∂σ ∂σ
ω(u, v) = F (σ (u, v)) , (u, v) du + F (σ (u, v)) , (u, v) dv ,
∂u ∂v
3.13 · Analogous Results in R2
165 3
and see that
ω=
F, d .
∂I ∂σ
If σ is of class C 2 , then
ω is of class C 1 and computation shows that
7 8 7 8
∂ ∂σ ∂ ∂σ
ω(u, v) =
d F (σ (u, v)) , (u, v) − F (σ (u, v)) , (u, v) du ∧ dv
∂u ∂v ∂v ∂u
∂F2 ∂F1
= (σ (u, v)) − (σ (u, v)) det σ (u, v) du ∧ dv .
∂x1 ∂x2
Hence,
∂F2 ∂F1
ω=
d (σ (u, v)) − (σ (u, v)) det σ (u, v) du ∧ dv
I I ∂x1 ∂x2
∂F2 ∂F1
= − dx1 ∧ dx2 .
σ ∂x1 ∂x2
The Gauss formula applied to ω gives us the first conclusion. If σ is regular and injective on ˚
I,
with det σ > 0, then Theorem 3.11 yields the conclusion. As mentioned in the case N = 3,
the assumption that σ is of class C 2 can be eliminated by an approximation procedure.
∂F2 ∂F1
(x, y) − (x, y) = 2 ,
∂x ∂y
Exercises
√
1. Let γ : [0, 4] → R2 be the curve defined as γ (t) = (t, t ). Compute the integral
3 (xy 2 , yx 2 ), d ,
γ
σ (u, v) = (u2 , v 2 ) .
Compute ∂σ F, d with F : R2 → R2 given by
F (x, y) = (x 2 − y 2 , 2xy) .
F (x, y, z) = (x − y, y − z, z − x) .
σ (u, v, w) = (u2 + v 2 , u2 + v 2 , w) ,
F (x, y, z) = (x 2 , y 3 , z4 ) .
Compute σ divF dx ∧ dy ∧ dz, both directly and by the use of the Gauss–Ostrogradski
formula.
5. Let σ : [0, 1] × [0, π] × [0, 2π] → R3 be the parametrization of the unit ball in spherical
coordinates, given by
F (x, y, z) = (x − y + z , y − z + x , z − x + y) .
3.14 · Exact Differential Forms
167 3
We are now interested in the following problem. Given a differential form ω, when is
it possible to write it as the external differential of another differential form
ω, to be
determined? In the following, we assume M ≥ 1.
Definition 3.27
A M-differential form ω is said to be closed if dω = 0; it is said to be exact if there
is a (M − 1)-differential form ω = ω.
ω such that d
−y x
ω(x, y) = dx + 2 dy
x2 +y 2 x + y2
−y x
F1 (x, y) = , F2 (x, y) = ,
x2 + y2 x2 + y2
∂F2 ∂F1
(x, y) = (x, y) .
∂x ∂y
Let us compute the line integral of its vector field F = (F1 , F2 ) on the curve σ : [0, 2π] →
R2 , defined by σ (t) = (cos t, sin t) :
2π
F, d = F (σ (t) , σ (t) dt
σ 0
2π
= (− sin t, cos t) , (− sin t, cos t) dt
0
= 2π .
The Poincaré theorem says that the situation of the preceding example can never
happen if, for example, the open set U on which the differential form is defined is star-
shaped.
3
Definition 3.28
A set U is star-shaped with respect to a point x̄ if, with each of its points x, the set
U contains the whole segment joining x to x̄, i.e.,
For example, every convex set is star-shaped (with respect to any of its points). In
particular, a ball, or a rectangle, or even the whole space RN are star-shaped. Clearly,
the set R2 \ {(0, 0)} considered above is not star-shaped.
Theorem 3.29
Let U be an open subset of RN , star-shaped with respect to a point x̄. For 1 ≤ M ≤ N,
a M-differential form ω : U → M (RN ) of class C 1 is exact if and only if it is closed.
In that case, if ω is of the type
ω(x) = fi1 ,...,iM (x) dxi1 ∧ · · · ∧ dxiM ,
1≤i1 <···<iM ≤N
M
ω(x) = (−1)s+1 (xis − x̄is ) ·
1≤i1 <···<iM ≤N s=1
1
· :is ∧ · · · ∧ dxiM .
t M−1 fi1 ,...,iM (x̄ + t (x − x̄)) dt dxi1 ∧ · · · ∧ dx
0
The proof of this theorem will be given in Appendix B. As for the Stokes–Cartan
theorem, we consider here some corollaries which hold true for the case N = 3, giving
for each of them a direct proof. In order to simplify the notations, we will assume
without loss of generality that x̄ = (0, 0, 0).
The Case M = 1 A C 1 -vector field F = (F1 , F2 , F3 ), defined on an open subset U of
R3, determines a 1-differential form
This is closed if and only if curl F = 0. In this case, the vector field is said to be
irrotational. On the other hand, the vector field is said to be F conservative if there is
3.14 · Exact Differential Forms
169 3
a function f : U → R such that F = grad f. In that case, f is a scalar potential of the
vector field F .9
Theorem 3.30
If U ⊆ R3 is star-shaped with respect to the origin, then the vector field F : U → R3
is conservative if and only if it is irrotational, and in that case a function f : U → R
such that F = grad f is given by
1
f (x) = F (tx), x dt .
0
Proof
Set
ω = f : U → R. Let us verify that d
ω = ω. Using the fact that curl F = 0 and the
Leibniz rule, we have
1
∂
ω ∂
(x) = F (tx), x dt
∂xj 0 ∂xj
1 3
∂Fi
= (tx)txi + Fj (tx) dt
0 ∂xj
i=1
3
1 ∂Fj
= (tx)txi + Fj (tx) dt .
0 ∂xi
i=1
3
∂Fj
φ (t) = (tx)txi + Fj (tx) ,
∂xi
i=1
∂
ω t=1
(x) = tFj (tx) = Fj (x) .
∂xj t=0
This proves that F = grad f. The second part of the theorem follows directly from the fact
that, if grad f = grad f˜, then f − f˜ has to be constant on U.
9 Beware that in Mechanics it is often the function −f which is called “the potential”.
170 Chapter 3 • Differential Forms
Example Consider the vector field F (x, y, z) = (2xz + y, x, x 2 ) which, as easily verified,
is irrotational. A scalar potential is then given by
3 f (x, y, z) =
1
((2t 2 x 2 z + txy) + txy + t 2 x 2 z) dt = xy + x 2 z .
0
ω(x) = F1 (x) dx2 ∧ dx3 + F2 (x) dx3 ∧ dx1 + F3 (x) dx1 ∧ dx2 .
This is closed if and only if div F = 0. In this case, the vector field is said to be
solenoidal. One says that F has a vector potential if there is a vector field V =
(V1 , V2 , V3 ) such that F = curl V .
Theorem 3.31
If U ⊆ R3 is star-shaped with respect to the origin, then the vector field F : U → R3
has a vector potential if and only if it is solenoidal, and in that case a vector field
V : U → R3 for which F = curl V is given by
1
V (x) = t (F2 (tx)x3 − F3 (tx)x2 ) dt ,
0
1
t (F3 (tx)x1 − F1 (tx)x3 ) dt ,
0
1
t (F1 (tx)x2 − F2 (tx)x1 ) dt ,
0
Proof
Consider the 1-differential form determined by the vector field V ,
1
ω(x) = t (F2 (tx)x3 − F3 (tx)x2 ) dt dx1 +
0
1
+ t (F3 (tx)x1 − F1 (tx)x3 ) dt dx2 +
0
1
+ t (F1 (tx)x2 − F2 (tx)x1 ) dt dx3 .
0
3.14 · Exact Differential Forms
171 3
ω = ω. By the Leibniz rule, taking into account the fact that ω is
We have to prove that d
closed, we find
1
1
∂ ∂
t (F1 (tx)x2 − F2 (tx)x1 ) dt − t (F3 (tx)x1 − F1 (tx)x3 ) dt =
∂x2 0 ∂x3 0
1
∂F1 ∂F1 ∂F1
= t2 (tx)x1 + (tx)x2 + (tx)x3 + 2tF1 (tx) dt
0 ∂x1 ∂x2 ∂x3
= F1 (x) ,
by applying the Fundamental Theorem to the function φ(t) = t 2 F1 (tx). Analogously one
obtains the remaining two identities, thus proving the formula. The second part of the
, then, by the previous theorem, V − V
theorem follows from the fact that, if curl V = curl V
is a conservative vector field.
Example Consider the solenoidal vector field F (x, y, z) = (y, z, x). A vector potential is
then given by
1 1 2
V (x, y, z) = t (ty, tz, tx) × (x, y, z) dt = (z − xy, x 2 − yz, y 2 − xz) .
0 3
Theorem 3.32
If U ⊆ R3 is star-shaped with respect to the origin, the function f : U → R is always
of the type f = div W, where W : U → R3 is the vector field defined by
1
W (x) = t 2 f (tx) dt x .
0
Proof
Using the Leibniz rule, we have
3 ∂ 1
t 2 f (tx)x1 dt +
∂ 1
t 2 f (tx)x2 dt +
∂ 1
t 2 f (tx)x3 dt
∂x1 0 ∂x2 0 ∂x3 0
1
∂f ∂f ∂f
= t 3
(tx) + (tx) + (tx) + 3t f (tx) dt
2
0 ∂x1 ∂x2 ∂x3
= f (x) ,
by applying the Fundamental Theorem to the function φ(t) = t 3 f (tx). The second part of
the theorem follows from the fact that, if div W = div W̃ , then, by the preceding theorem,
W − W̃ has a vector potential.
Example Consider the scalar function f (x, y, z) = xyz. Then, a vector field whose
divergence is f is given by
1
1 2
W (x, y, z) = t 5 xyz dt (x, y, z) = (x yz, xy 2 z, xyz2 ) .
0 6
Exercises
1. Let F : R3 → R3 be the vector field defined by the formula
and let ω be the associated 1-differential form. Prove that ω is exact, and find a 0-
differential form ω = ω. Compute then the integral
ω such that d
ω,
γ
γ (t) = (t, t 2 , t 3 ) .
2xy 2 2x 2 y −2x 2 y 2
F (x, y, z) = , , .
z2 z2 z3
defined as
1 1 1
ω(x, y, z) = dx + 2 dy + dz,
x 2 yz xy z xyz2
Appendix A
Differential Calculus in RN
Alessandro Fonda
Definition A.1
We say that f is differentiable at x 0 if there exists a linear function : RN → RM
for which one can write
r(x)
lim = 0.
x→x 0 x − x 0
df (x 0 ) .
Following the tradition for linear functions, taking h ∈ RN we will often write
df (x 0 )h instead of df (x 0 )(h).
We will now review the main results needed in this book concerning the differential
calculus.
176 Appendix A • Differential Calculus in RN
Assume first, for simplicity, that M = 1. We start by fixing a direction i.e., a vector
v ∈ RN with v = 1, also called a versor. Whenever it exists, we call directional
derivative of f at x 0 in the direction v the limit
f (x 0 + tv) − f (x 0 )
lim ,
t→0 t
∂f
(x 0 ) .
∂v
∂f
(x 0 ) .
∂xk
∂f f (x 0 + tek ) − f (x 0 )
(x 0 ) = lim
∂xk t→0 t
f (x10 , x20 , . . . , xk0 + t, . . . , xN0 ) − f (x10, x20 , . . . , xk0 , . . . , xN0 )
= lim ,
t→0 t
so that it is frequent to call it “partial derivative with respect to the k-th variable”.
Theorem A.2
If f is differentiable at x 0 , then f is continuous at x 0 . Moreover, all the directional
derivatives of f at x 0 exist: for every direction v ∈ RN one has
∂f
(x 0 ) = df (x 0 )v .
∂v
Proof
We know that the function = df (x 0 ), being linear, is continuous, and (0) = 0. Then,
∂f
(x 0 ) = df (x 0 )ek .
∂xk
i.e.,
N
∂f
df (x 0 )h = (x 0 )hk .
k=1
∂x k
Theorem A.3
If f has partial derivatives defined in a neighborhood of x 0 , and they are continuous
at x 0 , then f is differentiable at x 0 .
178 Appendix A • Differential Calculus in RN
Proof
In order to simplify the notations, we will assume that N = 2. We define the function :
R2 → R associating to every vector h = (h1 , h2 ) the real number
∂f ∂f
(h) = (x 0 )h1 + (x 0 )h2 .
∂x1 ∂x2
We will prove that is indeed the differential of f at x 0 . First of all, it is readily verified that
it is linear. Moreover, writing x 0 = (x10 , x20 ) and x = (x1 , x2 ), by the Lagrange Mean Value
Theorem one has
Letting x tend to x 0 , we have that (ξ1 , x2 ) → (x10 , x20 ) and (x10 , ξ2 ) → (x10 , x20 ) so that, being
∂f ∂f
∂x1 and ∂x2 continuous at x 0 = (x1 , x2 ), it has to be
0 0
|r(x)|
lim = 0,
x→x 0 x − x 0
∂ 2f ∂ ∂f ∂2f ∂ ∂f
(x 0 ) = (x 0 ) , (x 0 ) = (x 0 ) ,
∂x12 ∂x1 ∂x1 ∂x2 ∂x1 ∂x2 ∂x1
∂ 2f ∂ ∂f ∂2f ∂ ∂f
(x 0 ) = (x 0 ) , 2
(x 0 ) = (x 0 ) .
∂x1 ∂x2 ∂x1 ∂x2 ∂x2 ∂x2 ∂x2
Theorem A.4
∂ 2f ∂2f
If the second order partial derivatives ∂x2 ∂x1 , ∂x1 ∂x2 exist in a neighborhood of x 0 and
they are continuous at x 0 , then
∂2f ∂2f
(x 0 ) = (x 0 ) .
∂x2 ∂x1 ∂x1 ∂x2
Proof
Let ρ > 0 be such that B(x 0 , ρ) ⊆ .1 We write x 0 = (x10 , x20 ) and we take an x =
(x1 , x2 ) ∈ B(x 0 , ρ) such that x1
= x10 and x2
= x20 . It is then possible to define
∂f ∂f
∂x1 (ξ1 , x2 ) − ∂x1 (ξ1 , x2 )
0
g(x1 , x2 ) − g(x10 , x2 ) ∂g
= (ξ1 , x2 ) = ,
x1 − x10 ∂x1 x2 − x20
∂f ∂f
∂x2 (x1 , ξ2 ) − ∂x2 (x1 , ξ2 )
0
h(x1 , x2 ) − h(x1 , x20 ) ∂h
= (x1 , ξ2 ) = .
x2 − x20 ∂x2 x1 − x10
1 Letus recall the notation B(x 0 , ρ) for the open ball centered at x 0 with radius ρ > 0, and B(x 0 , ρ) for the
closed ball.
180 Appendix A • Differential Calculus in RN
Again by the Lagrange Mean Value Theorem, there is a η2 ∈ ]x20 , x2 [ such that
∂f ∂f
∂x1 (ξ1 , x2 ) − ∂x1 (ξ1 , x2 )
0
∂2f
= (ξ1 , η2 ) ,
x2 − x20 ∂x2 ∂x1
∂f ∂f
∂x2 (x1 , ξ2 ) − ∂x2 (x1 , ξ2 )
0
∂2f
= (η1 , ξ2 ) .
x1 − x10 ∂x1 ∂x2
Hence,
∂2f ∂2f
(ξ1 , η2 ) = (η1 , ξ2 ) .
∂x2 ∂x1 ∂x1 ∂x2
Taking the limit, as x = (x1 , x2 ) tends to x 0 = (x10 , x20 ), we have that both (ξ1 , η2 ) and
(η1 , ξ2 ) converge to x 0 , and the continuity of the second order partial derivatives leads to the
conclusion.
Theorem A.5
The function f is differentiable at x 0 if and only if such are all its components. In this
case, for any vector h ∈ RN it is
Proof
Considering the components in the equation
we can write
fj (x) = fj (x 0 ) + j (x − x 0 ) + rj (x) ,
r(x) rj (x)
lim =0 ⇐⇒ lim =0 for every j = 1, 2, . . . , M ,
x→x 0 x − x 0 x→x 0 x − x 0
where e1 , e2 , . . . eN are the vectors of the canonical basis of RN . This matrix is called
Jacobian matrix associated to the function f at x 0 , and is denoted by Jf (x 0 ). Recalling
that
∂fj
(x 0 ) = dfj (x 0 )ek ,
∂xk
⎜ ⎟
⎜ ⎟
Jf (x 0 ) = ⎜
∂f2 ∂f2
(x 0 ) ∂x (x 0 )··· ∂f2
(x 0 ) ⎟ .
⎜ ∂x1 2 ∂xN ⎟
⎜ .. .. .. ⎟
⎝ . . . ⎠
0) · · ·
∂fM ∂fM ∂fM
∂x1
(x 0) ∂x2
(x ∂xN
(x 0 )
182 Appendix A • Differential Calculus in RN
Theorem A.6
If f, g : → Y are differentiable at x0 and α, β are two real numbers, then
Proof
Writing
f (x) = f (x0 ) + df (x0 )(x − x0 ) + r1 (x) , g(x) = g(x0 ) + dg(x0 )(x − x0 ) + r2 (x) ,
we have that
Theorem A.7
If f : → RM is differentiable at x 0 , while ⊆ RM is an open set containing f ()
and g : → RL is differentiable at f (x 0 ), then g ◦ f is differentiable at x 0 , and
Proof
Setting y 0 = f (x 0 ), we have
with
r1 (x) r2 (y)
lim = 0, lim = 0.
x→x 0 x − x 0 y→y 0 y − y 0
where
Hence,
4 4
r3 (x) 4 r1 (x) 4
≤4
4 dg(f (x )) 4+
x − x 0 4
0
x − x 0
4 4
4 x − x0 4
+ 4 df (x ) 4 + r1 (x) R2 (f (x)) .
4 0
x − x 0 4 x − x 0
184 Appendix A • Differential Calculus in RN
r3 (x)
lim = 0.
x→x 0 x − x 0
It is well-known that the matrix associated to the composite of two linear functions
is the product of the two respective matrices. From the above theorem we then have the
following formula for the Jacobian matrices:
J (g ◦ f )(x 0 ) = J g(f (x 0 )) · Jf (x 0 ) ,
We can thus derive the formula for the partial derivatives, usually called chain rule:
∂(g ◦ f )i
(x 0 ) =
∂xk
∂gi ∂f1 ∂gi ∂f2 ∂gi ∂fM
= (f (x 0 )) (x 0 ) + (f (x 0 )) (x 0 ) + · · · + (f (x 0 )) (x 0 )
∂y1 ∂xk ∂y2 ∂xk ∂yM ∂xk
M
∂gi ∂fj
= (f (x 0 )) (x 0 ) ,
∂yj ∂xk
j =1
where i = 1, 2, . . . , L and k = 1, 2, . . . , N.
Theorem A.8
Let ⊆ RM × RN be open, g : → RN a C 1 -function, and (x 0 , y 0 ) a point in for
which
∂g
g(x 0 , y 0 ) = 0 , and det (x 0 , y 0 )
= 0 .
∂y
g(x, y) = 0 ⇐⇒ y = η(x) .
Moreover, the function η is of class C 1 , and the following formula holds true:
−1
∂g ∂g
J η(x) = − (x, η(x)) (x, η(x)) .
∂y ∂x
Proof
It will be carried out by induction. We first prove the case N = 1.2
Assume for instance that ∂g∂y (x 0 , y0 ) > 0. By the continuity of
∂g
∂y , there is a δ > 0 such
∂g
that, if x − x 0 ≤ δ and |y − y0 | ≤ δ, then ∂y (x, y) > 0. Hence, for every x ∈ B(x 0 , δ),
the function g(x, ·) is strictly increasing on [y0 − δ, y0 + δ]. Being g(x 0 , y0 ) = 0, we have
that
2 The proof reported here is most probably due to Giuseppe Peano, and can be found in the Italian book “Calcolo
differenziale e principii di calcolo integrale”, published in 1884. This important work was written by Peano
himself, who at that time was only 25 years old, but the official author is Angelo Genocchi, the professor who
was in care of Peano as an assistant at the University of Torino. Genocchi got indeed very angry when he was
told that the book had been published, and publicly declared that he was not aware of what had been written in
the volume, recalling however the fact that Peano had followed his lessons in infinitesimal analysis.
186 Appendix A • Differential Calculus in RN
g(x, y) = 0 ⇐⇒ y = η(x) .
In order to verify the continuity of η, let us fix a x̄ ∈ U and prove that η is continuous at x̄.
Taken x ∈ U and considered the function γ : [0, 1] → U × V , defined as
the Lagrange Mean Value Theorem applied to g ◦ γ tells us that there is a ξ ∈ ]0, 1[ for which
∂g ∂g
g(x, η(x)) − g(x̄, η(x̄)) = (γ (ξ ))(x − x̄) + (γ (ξ ))(η(x) − η(x̄)) .
∂x ∂y
∂g
Since the partial derivatives of g are continuous and ∂y in not zero on the compact set U ×V ,
we have that there is a constant c > 0 for which
4 4
1 4 ∂g 4
4 (γ (ξ ))(x − x̄)4 ≤ cx − x̄ .
∂g 4 ∂x 4
| ∂y (γ (ξ ))|
∂g
η(x̄1 + h, x̄2 , . . . , x̄M ) − η(x̄1 , x̄2 , . . . , x̄M ) ∂x (γ (ξh ))
= − ∂g1 ,
h (γ (ξh ))
∂y
with γ (ξh ) belonging to the segment joining (x̄, η(x̄)) to (x, η(x)). If h tends to 0, we have
that γ (ξh ) tends to (x̄, η(x̄)), and hence
∂g
∂η η(x̄1 + h, x̄2 , . . . , x̄M ) − η(x̄1 , x̄2 , . . . , x̄M ) ∂x (x̄, η(x̄))
(x̄) = lim = − ∂g1 .
∂x1 h→0 h (x̄, η(x̄))
∂y
Appendix A • Differential Calculus in RN
187 A
The partial derivatives with respect to x2 , . . . , xM are computed similarly, thus yielding that
η is of class C 1 , and
1 ∂g
J η(x) = − ∂g (x, η(x)) .
∂x
∂y (x, η(x))
We now assume that the statement holds till N − 1, for some N ≥ 2 (and any M ≥ 1),
and prove that it then also holds for N. We will use the notation
y 1 = (y1 , . . . , yN −1 ) ,
at least one of the elements in the last column is different from zero. We can assume without
loss of generality, possibly changing the rows, that ∂g
∂yN (x 0 , y 0 )
= 0. Writing y 0 = (y 1 , yN ),
N 0 0
with y 01 = (y10 , . . . , yN
0
−1 ), we then have
∂gN
gN (x 0 , y 01 , yN
0
) = 0, and (x 0 , y 01 , yN
0
)
= 0 .
∂yN
Then, by the already proved one-dimensional case, there are an open neighborhood U1 of
(x 0 , y 01 ), an open neighborhood VN of yN
0 , and a C 1 -function η : U → V such that
1 1 N
U1 × VN ⊆ , with the following properties: if (x, y 1 ) ∈ U1 and yN ∈ VN ,
gN (x, y 1 , yN ) = 0 ⇐⇒ yN = η1 (x, y 1 ) ,
and
1 ∂gN
J η1 (x, y 1 ) = − ∂g (x, y 1 , η1 (x, y 1 )) .
N
∂yN (x, y 1 , η1 (x, y 1 ))) ∂(x, y 1 )
so that
and
∂φ ∂g(1,...,N −1) ∂g(1,...,N −1) ∂η1
(x 0 , y 01 ) = (x 0 , y 0 ) + (x 0 , y 0 ) (x 0 , y 01 ) . (*)
∂y 1 ∂y 1 ∂yN ∂y 1
∂g ∂g ∂η1
= det (x 0 , y 0 ) + (x 0 , y 0 ) (x 0 , y 01 ) 0 .
∂y ∂yN ∂y 1
Appendix A • Differential Calculus in RN
189 A
We now recall that, adding a scalar multiple of one column to another column of a matrix
does not change the value of its determinant. Hence, being
∂g ∂η1
(x 0 , y 0 ) (x 0 , y 01 ) 0 =
∂yN ∂y 1
∂g
each of its columns is a scalar multiple of ∂yN (x 0 , y 0 ), hence
∂g ∂g ∂η1 ∂g
det (x 0 , y 0 ) + (x 0 , y 0 ) (x 0 , y 01 ) 0 = det (x 0 , y 0 ) .
∂y ∂yN ∂y 1 ∂y
∂φ
φ(x 0 , y 01 ) = 0 , and det (x 0 , y 01 )
= 0 .
∂y 1
φ(x, y 1 ) = 0 ⇐⇒ y 1 = η2 (x) .
This function is of class C 1 , since both η1 and η2 are such. Since g(x, η(x)) = 0 for every
x ∈ U, one easily deduces that
∂g ∂g
(x, η(x)) + (x, η(x))J η(x) = 0 ,
∂x ∂y
Clearly, the following analogous statement holds true, where the roles of x and y
are interchanged.
Theorem A.9
Let ⊆ RM × RN be open, g : → RN a C 1 -function, and (x 0 , y 0 ) a point in for
which
∂g
g(x 0 , y 0 ) = 0 , and det (x 0 , y 0 )
= 0 .
∂x
g(x, y) = 0 ⇐⇒ x = η(y) .
Moreover, the function η is of class C 1 , and the following formula holds true:
−1
∂g ∂g
J η(y) = − (y, η(y)) (y, η(y)) .
∂x ∂y
Definition A.10
Given A and B, two open subsets of RN , a function ϕ : A → B is said to be a
diffeomorphism if it is of class C 1 , it is a bijection, and its inverse ϕ −1 : B → A is
also of class C 1 .
Appendix A • Differential Calculus in RN
191 A
Let us state the important Local Diffeomorphism Theorem.
Theorem A.11
Let A and B be open subsets of RN , and ϕ : A → B be a C 1 -function. If, for some
x 0 ∈ A, one has that det J ϕ(x 0 )
= 0, then there exist an open neighborhood U of x 0
contained in A, and an open neighborhood V of ϕ(x 0 ) contained in B, such that the
restricted function ϕ|U : U → V is a diffeomorfism.
Proof
We consider the function g : A × B → RN defined as
g(x, y) = y − ϕ(x) .
∂g
g(x 0 , y 0 ) = 0 , and det (x 0 , y 0 ) = det J ϕ(x 0 )
= 0 .
∂x
−1
Hence, η = ϕ|U , and the proof is thus completed.
193 B
Appendix B
Stokes–Cartan and Poincaré Theorems
Alessandro Fonda
In this appendix we will give a complete proof of the Stokes–Cartan and Poincaré
theorems, of which only particular cases have been proved in Chap. 3.
Let U be an open set in RN, V an open set in1 RP and φ : V → U a function of
class C 1 :
Notice that
P
∂φi1 ∂φiM
= (y) · · · (y) dyj1 ∧ · · · ∧ dyjM
j1 ,...,jM =1
∂yj1 ∂yjM
1 Whenever the sets would not be open, see the footnote in Sect. 3.6.
2 This is usually called pull-back and denoted by φ ∗ ω.
194 Appendix B • Stokes–Cartan and Poincaré Theorems
(being aware that here the indices j1 , . . . , jM are not in an increasing order). It is readily
verified that, taken a c ∈ R, it is
Tφ (cω) = c Tφ ω ;
if
ω is a M̃-differential form defined on U, then
Tφ (ω ∧
ω ) = Tφ ω ∧ Tφ
ω,
Tφ (ω +
ω ) = Tφ ω + Tφ
ω.
Proposition B.1
If ψ : W → V and φ : V → U, then
Tψ (Tφ ω) = Tφ◦ψ ω .
Proof
By the linearity properties seen above, it will be sufficient to consider the case of a differential
form of the type
Then,
⎡ ⎤
P
∂φi1 ∂φiM
Tψ (Tφ ω) = ⎣(fi1 ,...,iM ◦ φ) ··· ⎦ ◦ ψ dψj1 ∧ · · · ∧ dψjM .
∂yj1 ∂yjM
j1 ,...,jM =1
and being
P
∂φi
d(φ ◦ ψ)ik = d(φik ◦ ψ) = k
◦ψ dψj ,
∂yj
j =1
Proposition B.2
Assume that φ be of class C 2 . If ω is of class C 1 , then also Tφ ω is such, and
d(Tφ ω) = Tφ (dω) .
Proof
Here, too, it is sufficient to consider the case ω = fi1 ,...,iM dxi1 ∧ · · · ∧ dxiM . We have
N
∂fi1 ,...,iM
dω(x) = (x) dxm ∧ dxi1 ∧ · · · ∧ dxiM ,
∂xm
m=1
hence
N
∂fi 1 ,...,iM
Tφ (dω) = ◦φ dφm ∧ dφi1 ∧ · · · ∧ dφiM ,
∂xm
m=1
Proposition B.3
If σ : I → RN is a M-surface whose support is contained in U, then
ω= Tσ ω .
σ I
196 Appendix B • Stokes–Cartan and Poincaré Theorems
Proof
As above, we just consider the case ω = fi1 ,...,iM dxi1 ∧ · · · ∧ dxiM . We have
M
∂σi1 ∂σiM
Tσ ω = fi1 ,...,iM (σ (u)) (u) . . . (u) duj1 ∧ · · · ∧ dujM
I I ∂uj1 ∂ujM
j1 ,...,jM =1
= fi1 ,...,iM (σ (u)) det σ(i 1 ,...,iM ) (u) du
I
= ω.
σ
We are now ready to give the proof of the Stokes–Cartan theorem, whose
statement, we recall, is the following.
Theorem B.4
Let 0 ≤ M ≤ N − 1. If ω : U → M (RN ) is a M-differential form of class C 1 and
σ : I → RN is a (M + 1)-surface whose support is contained in U, then
dω = ω.
σ ∂σ
Proof
The case M = 0 follows from the Fundamental Theorem applied to the function ω ◦ σ :
[a, b] → R. Assume now 1 ≤ M ≤ N − 1. Being
ω= Tσ ◦α + ω = Tα + (Tσ ω) = Tσ ω ,
σ ◦αk+ Ik k
Ik k
αk+
M+1
M+1
ω= (−1)k ω+ (−1)k−1 ω
∂σ k=1 σ ◦αk+ k=1 σ ◦βk+
M+1
M+1
= (−1) k
Tσ ω + (−1) k−1
Tσ ω
k=1 αk+ k=1 βk+
= Tσ ω .
∂I
Appendix B • Stokes–Cartan and Poincaré Theorems
197 B
But
d(Tσ ω) = Tσ (dω) = dω .
I I σ
Consider now the set [0, 1] × U, whose elements will be denoted by (t, x) =
(t, x1 , . . . , xN ). Let us define the linear operator K which transforms a generic M-
differential form α : [0, 1] × U → M (R N +1 ) in a (M − 1)-differential form
K(α) : U → M−1 (R N ) in the following way:
a) if α(t, x) = f (t, x) dt ∧ dxi1 ∧ · · · ∧ dxiM−1 (notice that here the term dt appears),
then
1
K(α)(x) = f (t, x) dt dxi1 ∧ · · · ∧ dxiM−1 ;
0
b) if α(t, x) = f (t, x) dxi1 ∧ · · · ∧ dxiM (here the term dt does not appear), then
K(α) = 0;
c) in all the other cases, K is defined by linearity (for each component of a generic M-
differential form α, the term dt may appear or not, and the previous two definitions
apply).
Lemma B.5
If α : [0, 1] × U → M (R N +1 ) is a M-differential form of class C 1 , then
d(K(α)) + K(dα) = Tξ α − Tψ α .
Proof
Because of the linearity, it will be sufficient to consider the two cases when the differential
form α is of one of the two kinds considered in a) and b).
a) If α(t, x) = f (t, x) dt ∧ dxi1 ∧ · · · ∧ dxiM−1 , by the Leibniz rule we have
N
1
∂f
d(K(α))(x) = (t, x) dt dxm ∧ dxi1 ∧ · · · ∧ dxiM−1 ;
0 ∂xm
m=1
∂f
dα(t, x) = (t, x) dt ∧ dt ∧ dxi1 ∧ · · · ∧ dxiM−1 +
∂t
N
∂f
+ (t, x) dxm ∧ dt ∧ dxi1 ∧ · · · ∧ dxiM−1
∂xm
m=1
N
∂f
=− (t, x) dt ∧ dxm ∧ dxi1 ∧ · · · ∧ dxiM−1 ,
∂xm
m=1
and hence
N
1
∂f
K(dα)(x) = − (t, x) dt dxm ∧ dxi1 ∧ · · · ∧ dxiM−1
0 ∂xm
m=1
= −d(K(α))(x) .
∂f
dα(t, x) = (t, x) dt ∧ dxi1 ∧ · · · ∧ dxiM +
∂t
N
∂f
+ (t, x) dxm ∧ dxi1 ∧ · · · ∧ dxiM ,
∂xm
m=1
Appendix B • Stokes–Cartan and Poincaré Theorems
199 B
and hence
1
∂f
K(dα)(x) = (t, x) dt dxi1 ∧ · · · ∧ dxiM
0 ∂t
= (f (1, x) − f (0, x)) dxi1 ∧ · · · ∧ dxiM .
Moreover,
We can now give the proof of the Poincaré theorem, whose statement is recalled
below.
Theorem B.6
Let U be an open subset of RN , star-shaped with respect to a point x̄. For 1 ≤ M ≤ N,
a M-differential form ω : U → M (RN ) of class C 1 is exact if and only if it is closed.
In that case, if ω is of the type
ω(x) = fi1 ,...,iM (x) dxi1 ∧ · · · ∧ dxiM ,
1≤i1 <···<iM ≤N
M
ω(x) = (−1)s+1 (xis − x̄is )·
1≤i1 <···<iM ≤N s=1
1
· :is ∧ · · · ∧ dxiM .
t M−1 fi1 ,...,iM (x̄ + t (x − x̄)) dt dxi1 ∧ · · · ∧ dx
0
Proof
To simplify the notations, we can assume without loss of generality that x̄ = (0, 0, . . . , 0);
let φ : [0, 1] × U → U be defined by
Observe that
K(Tφ ω)(x) =
1
M
= fi1 ,...,iM (tx)t M−1 :is ∧ · · · ∧ dxiM ,
(−1)s+1 xis dt dxi1 ∧ · · · ∧ dx
0 s=1
so that
ω = K(Tφ ω). We want to prove that d
ω = ω. Being ω closed, we have
ω = d(K(Tφ ω))
d
= Tξ (Tφ ω) − Tψ (Tφ ω) − K(d(Tφ ω))
= Tξ (Tφ ω) − Tψ (Tφ ω)
= Tφ◦ξ ω − Tφ◦ψ ω .
Being φ ◦ ξ the identity function and φ ◦ ψ the null function, we have that Tφ◦ξ ω = ω and
Tφ◦ψ ω = 0, which concludes the proof.
We have thus concluded the proof of the two main theorems of the theory, and maybe
something should be said about the need for such a theory. Of course, its mathematical
beauty would alone justify its existence and development. Nevertheless, such a nice
theory also finds a lot of applications in the physical world. The reason of this probably
lies in the fact that it was motivated by the classical theorems in Electromagnetism and
Fluidodynamics, hence the abstract construction lies on some very concrete bases.
So we find an example here of how Physics and Mathematics interact to help and
motivate each other, leading to wonderful successful theories which may have a lot of
practical implications in our lifes.
201 C
Appendix C
On Differentiable Manifolds
Alessandro Fonda
We would like to show here how the theory on differential forms developed in
Chap. 3, and in particular the Stokes–Cartan theorem, can be adapted to the context of
differentiable manifolds. However, contrary to our habit, we will not give the complete
proofs of all the results of this section; the interested reader will find useful to refer, e.g.,
to [20]. We consider a subset M of RN .
Definition C.1
The set M is a M-dimensional differentiable manifold, with 1 ≤ M ≤ N (or
briefly a M-manifold) if, taken a point x in M, there are an open neighborhood A
of x, an open neighborhood B of 0 in RN and a diffeomorphism ϕ : A → B such
that ϕ(x) = 0 and, either
(a) ϕ(A ∩ M) = {y = (y1 , . . . , yN ) ∈ B : yM+1 = · · · = yN = 0} ,
or
(b) ϕ(A ∩ M) = {y = (y1 , . . . , yN ) ∈ B : yM+1 = · · · = yN = 0 and yM ≥ 0} .
It can be seen that (a) and (b) cannot hold at the same time. The points x for which
(b) is verified make up the boundary of M, which we denote by ∂ M. If ∂ M is empty,
we are speaking of a M-manifold without boundary; otherwise, M is sometimes said
to be a M-manifold with boundary.
First of all we notice that the boundary of a differentiable manifold is itself a
differentiable manifold, with a lower dimension.
202 Appendix C • On Differentiable Manifolds
Theorem C.2
The set ∂ M is a (M − 1)-manifold without boundary:
∂(∂ M) = Ø .
Proof
Taken a point x in ∂ M, there are an open neighborhood A of x, an open neighborhood B of
0 in RN and a diffeomorphism ϕ : A → B such that ϕ(x) = 0 and
Reasoning on the fact that the conditions (a) an (b) of the definition can not hold
simultaneously for any point of M, it is possible to prove that
Let us now see that, given a M-manifold M, correspondingly to each of its point x
it is possible to find a local M-parametrization.
Theorem C.3
For every x ∈ M, there is a neighborhood A of x such that A ∩ M can be M-
parametrized with a function σ : I → RN, where I is a rectangle of RM of the type
[−α, α]M if x ∈
∂M ,
I=
[−α, α]M−1 × [0, α] if x ∈ ∂ M ,
and σ (0) = x.
Proof
Consider the diffeomorphism ϕ : A → B given by the above definition and take an α > 0
such that the rectangle B = [−α, α]N be contained in B. Setting A = ϕ −1 (B ), we have
that A is a neighborhood of x (indeed, the set B =] − α, α[N is open and hence also A =
ϕ −1 (B ) is open, and x ∈ A ⊆ A ). We can then take the rectangle I as in the statement
and define σ (u) = ϕ −1 (u, 0). It is readily seen that σ is injective and σ (I ) = A ∩ M.
Moreover, ϕ(1,...,M) (σ (u)) = u for every u ∈ I ; hence, ϕ(1,...,M) (σ (u)) · σ (u) is the identity
matrix, so that σ (u) has rank M, for every u ∈ I.
Appendix C • On Differentiable Manifolds
203 C
Remark In the proof we have seen that M can be covered by a family of open sets of the
type A , so that for each of them there is a local M-parametrization σ , defined on an open
set containing I and injective there, such that A ∩ M ⊆ σ (I ). Restricting if necessary the
sets A , this property still holds if instead of A we take an open ball B(x, ρx ). Moreover, if
x is a point of the boundary ∂ M, the M-parametrization σ is such that the interior points of
a single face of the rectangle I are sent on ∂ M.
We want to define an orientation for M, which will automatically induce one also
for ∂ M. Given x ∈ M, let σ : I → RN be a local M-parametrization, with σ (0) = x.
Since σ (u) has rank M, for every u ∈ I, we have that the vectors
∂σ ∂σ
(u) , . . . , (u)
∂u1 ∂uM
form a basis for a vector space of dimension M which will be called the tangent space
to M at the point σ (u) and will be denoted by Tσ (u) M (in particular, if u = 0, we have
the tangent space Tx M).
Now, once u ∈ I is considered, the point σ (u) will belong also to the images of
other M-parametrizations. There can be a σ̃ : J → RN such that σ (u) = σ̃ (v), for
some v ∈ J. We know from Chap. 3 how it is possible to change the orientation
to such a σ̃ with a simple change of variable. Hence, we can choose these local M-
parametrizations so that the bases of the tangent space Tσ (u) M = Tσ̃ (v) M associated to
them all be coherently oriented; this means that the matrix which permits to pass from
one basis to the other has a positive determinant. We will call coherent such a choice.
A coherent choice of the local M-parametrizations is therefore always possible,
remaining in a neighborhood of x. But we are interested in the possibility of making
a global coherent choice, i.e., for all possible local M-parametrizations of M. This is
not always possible: for example, it can be seen that this can not be done for a Möbius
strip, which is a 2-manifold.
Whenever all the local M-parametrizations of M can be chosen coherently, we say
that M is orientable. From now on we will always assume that M is orientable and
that a coherent choice of all the local M-parametrizations has been made. We then say
that M has been oriented.
Once we have oriented M, let us see how it is possible to define, from that, an
orientation on ∂ M. Given x ∈ ∂ M, let σ : I → RN be a local M-parametrization
with σ (0) = x; recall that in this case I is the rectangle [−α, α]M−1 × [0, α].
Being ∂ M a (M − 1)-manifold, the tangent vector space Tx ∂ M has dimension
M − 1 and is a subspace of Tx M, which has dimension M. Hence, there are two
versors in Tx M which are orthogonal to Tx ∂ M. We denote by ν(x) the one which
is obtained as a directional derivative ∂σ ∂v
(0) = dσ (0)v, for some v = (v1 , . . . , vM )
with vM < 0. At this point, we choose a basis [v (1) (x), . . . , v (M−1) (x)] in Tx ∂ M such
that [ν(x), v (1) (x), . . . , v (M−1) (x)] be a basis of Tx M oriented coherently with the one
already chosen in this space. Proceeding in this way for every x, it can be seen that ∂ M
is thus oriented: we have assigned to it the induced orientation from that of M.
204 Appendix C • On Differentiable Manifolds
In general, we have seen that M can be covered by some open sets A of RN, which
we can assume to be open balls, for each of which there is a local M-parametrization
σ : I → RN with A ∩ M ⊆ σ (I ). Being M compact, there is a finite sub-covering:
let it be given by A 1 , . . . , A n . The open set V = A 1 ∪ · · · ∪ A n contains then M. We
now need the following result.
Theorem C.4
There exist some functions φ1 , . . . , φn : V → R, of class C ∞ , such that, for every x
and every k ∈ {1, . . . , n}, the following properties hold:
(i) 0 ≤ φk (x) ≤ 1 ,
(ii) x
∈ A k ⇒ φk (x) = 0 ,
and, for x ∈ M,
n
(iii) k=1 φk (x) = 1.
Proof
Let A k = B(xk , ρk ), with k = 1, . . . , n. Consider the function f : R → R defined by
exp 1
u2 −1
if |u| < 1 ,
f (u) =
0 if |u| ≥ 1 ,
and set
x − x k
ψk (x) = f .
ρk
Then, for every x ∈ V , it is ψ1 (x) + · · · + ψn (x) > 0, and we can define the functions
ψk (x)
φk (x) = .
ψ1 (x) + · · · + ψn (x)
It is possible to prove that such a definition does not depend neither on the (coherent)
choice of the local M-parametrizations, nor on the particular partition of unity defined
above.
We can now state the analogue of the Stokes–Cartan theorem.
Theorem C.5
If ω : U → M (RN ) is a M-differential form of class C 1 and M is a compact,
oriented (M + 1)-manifold contained in U, then
dω = ω
M ∂M
Proof
Let us first assume that there is a local M-parametrization σ : I → RN such that
σ (I ) ∩ ∂ M = Ø ,
and ω|M is equal to zero outside σ (I ). By the injectivity of σ and the continuity of ω, we
have that ω has to be zero on all points of the support of ∂σ, so that
dω = dω = ω = 0.
M σ ∂σ
and since ω is zero outside the support of ∂σ except for the points coming from Ij , which
belong to ∂ M, we have
ω= ω.
∂σ ∂M
we then have
n
dω = φk · dω
M k=1 M
n
n
= dφk ∧ ω + φk · dω
k=1 M k=1 M
n
= d(φk · ω)
k=1 M
n
= φk · ω
k=1 ∂ M
= ω,
∂M
Appendix D
The Banach–Tarski Paradox
Alessandro Fonda
Definition D.1
Two subsets A, B of R3 are said to be equi-decomposable if there are some sets
A1 , . . . , An , which are pairwise disjoint, and B1 , . . . , Bn , also pairwise disjoint,
such that
A = A1 ∪ · · · ∪ An , B = B1 ∪ · · · ∪ Bn ,
B = {x ∈ R3 : x ≤ 1} ; S = {x ∈ R3 : x = 1} .
Moreover, once we have fixed a vector v ∈ R3 such that v > 2, for every subset E of
R3 we denote by ET the translation of E by the vector v :
ET = {x ∈ R3 : x − v ∈ E} .
Theorem D.2
It truly happens that B ∼ B ∪ BT .
Lemma D.3
There exist two independent rotations of R3 .
Proof
Let ρ be the rotation with angle arccos 13 in counter-clockwise direction around the x-axis and
φ be the analogous rotation around the z-axis. We have that ρ, ρ −1 , φ, φ −1 are represented
by the following matrices:
⎛ ⎞ ⎛ √ ⎞
1 0 0√ 1
∓232 0
⎜ ⎟ ⎜ 23√2 ⎟
ρ ±1 = ⎝ 0 1
3√ ∓232 ⎠ , φ ±1 = ⎝± 3 1
0⎠ .
3
0 ±232 1
3 0 0 1
We will show that any simplified composition f of elements in {ρ, ρ −1 , φ, φ −1 } can not be
the identity, proving by induction the following proposition, for n ≥ 1 :
(Pn ) For every f having n components, one has that f (1, 0, 0) is of the form
1 √
(a, b 2, c) ,
3n
φ ±1 ρ ±1 f , ρ ±1 φ ±1 f , φ ±1 φ ±1 f , ρ ±1 ρ ±1 f ,
σ (x) = {f (x) : f ∈ F } .
It is easily seen that σ (x) is contained in S \ D and that two orbits either coincide or are
disjoint. Hence, the set of all orbits makes up a partition of S \ D. By the axiom of choice,
we can construct a set M taking a single point from each of these orbits.
Let us prove now that, varying f ∈ F , the sets f (M) generate a partition of S \ D. First
observe that, taken x ∈ S \ D, there is a u ∈ σ (x) ∩ M; it will be u = g(x), for some g ∈ F .
Then, setting f = g −1 , we have that f ∈ F and x = f (u) ∈ f (M). This proves that the
sets f (M) cover S \ D. Secondly, if there is a x ∈ f1 (M) ∩ f2 (M), with f1 , f2 ∈ F , then
both f1−1 (x) and f2−1 (x) belong to σ (x) ∩ M, and therefore coincide, by the way M has
been defined. Then, x = f2 (f1−1 (x)), which means that x is a fixed point for f2 f1−1 . Since
x
∈ D, it has to be f1 = f2 . this proves that the sets f (M) are pairwise disjoint.
210 Appendix D • The Banach–Tarski Paradox
They form a partition of (S \ D) \ M, since the identity is excluded from those f starting on
the left side with ρ, ρ −1 , φ, φ −1 . Let us prove that
A1 ∪ ρ(A2 ) = S \ D , A1 ∩ ρ(A2 ) = Ø ,
A3 ∪ φ(A4 ) = S \ D , A3 ∩ φ(A4 ) = Ø .
If x ∈ S \ D, it has to belong to one and only one of the sets f (M), with f ∈ F . If f begins
with ρ, then x ∈ A1 ; otherwise, since we are only considering simplified compositions,
ρ −1 f begins with ρ −1 , so that ρ −1 (x) ∈ A2 , that is x ∈ ρ(A2 ). The two first equalities then
follow. Analogously one proves the two second ones.
If we define B1 = A1 , B2 = ρ(A2 ) , B3 = (A3 )T , B4 = (φ(A4 ))T , we thus have seen
that
(S \ D) \ M ∼ (S \ D) ∪ (S \ D)T .
β : (S \ D) \ M → (S \ D) ∪ (S \ D)T ,
∞
$
C= α n ((S \ D)T ) = (S \ D)T ∪ α((S \ D)T ) ∪ α 2 ((S \ D)T ) ∪ . . . ,
n=0
Step 2 We will now prove that S ∼ S ∪ ST . Let be a line, passing through the origin 0 of
R3, which does not intersect the countable set D. Let us see that there is a rotation ψ, having
as axis, such that
∀n ≥ 1 ψ n (D) ∩ D = Ø .
Consider first the set 1 made of those angles which are rational multiples of 2π. These
angles determine all the possible rotations f around the axis for which a point of D comes
back to itself by applying an iterate f n . Consider now two points x 1 , x 2 ∈ D which lie on
the same orthogonal plane to : the point x 1 will be moved onto the point x 2 by a rotation
around the axis by a certain angle θ12 ∈ [0, 2π[ . Define the set 2 made of the angles of
the type (θ12 + 2πm)/n; these angles determine all the possible rotations f around the axis
for which the point x 1 is moved on x 2 by applying an iterate f n . Since D is countable, the
sets n can be ordered in a sequence. Moreover, each of the sets n is countable and hence
their union is countable, too. It is therefore sufficient to take an angle which does not depend
on any n to find a rotation ψ with the required property.
Consider now the set
∞
$
=
D ψ n (D) = D ∪ ψ(D) ∪ ψ 2 (D) ∪ . . . .
n=0
Then,
∪ (S \ D)
S=D ∼ ψ(D)
∪ (S \ D)
= S \D.
Step 3 Associating to each x ∈ S the radius without its origin {λx : λ ∈ ]0, 1]}, we can
proceed exactly as above to prove that B \{0} ∼ (B \{0})∪(B \{0})T . Let now η be a rotation
by an angle which is an irrational multiple of 2π around the line {(x, 0, 1/2) : x ∈ R}.
Consider the set Õ = {ηn (0) : n ≥ 0}. Then,
In an analogous way one sees that BT ∼ (B \ {0})T ; by the symmetry and the transitivity of
the relation ∼, we conclude that B ∼ B ∪ BT , and the theorem is thus proved.
As we said above, the Banach–Tarski theorem should make anyone aware of the
strange consequences one can face when dealing with non-measurable sets. Those sets
are constructed by the use of the axiom of choice. Nevertheless, this axiom is still usually
accepted for its remarkable usefulness in modern mathematics.
213 E
Appendix E
A Brief Historical Note
Alessandro Fonda
In this appendix, I will take a brief look at the historical evolution of the concept of
integral, without laying any claims to completeness. In particular, I would like to stress
the role played by the Riemann sums, in different stages.
The primary motive for the integral calculus stems from geometrical problems such
as the computation of the length of a curve, the area of a surface, planar or not, and the
volume of a solid. Some of these were already computed since Ancient Greek times,
in particular by Eudoxus (4th century B.C.) and Archimedes (3rd century B.C.). The
method used at that time was based on two steps: first, a candidate for the integral was
found by the use of approximations which could resemble the Riemann sums; then,
the rigorous proof was given by the so-called “exhaustion method”. Obviously, the
notations were completely different from ours, and the procedure followed was mainly
geometrical rather than analytical.
The main significant change in the setting of the problem was made in the seven-
teenth century, when Descartes discovered Analytic Geometry. In particular, differential
calculus started to be developed as the method of determining the tangents to a given
curve. A fundamental step was then made by Leibniz (1682) and Newton (1687). They
independently understood the link between differential and integral calculus, finding
that, if F is a function whose derivative coincides with f, then
b
f (x) dx = F (b) − F (a) .
a
interest at that time. In this framework, the above formula was sometimes used as the
very definition of integral, and the Riemann sums approach took on a secondary role.
However, a fundamental research by Fourier (1811) put in evidence that such a
theory was too restrictive: it was time to develop a theory which could deal with some
discontinuous, not primitivable functions. Cauchy (1823) was the first to provide a
rigorous basis for the theory, but it was Riemann (1854) who introduced the definition
of integrable function, the one we have called R-integrable in this book, which we recall
here.
Definition E.1
A function f : [a, b] → R is R-integrable and its integral is some real number A if
the following property holds: for every ε > 0 there is a real number δ > 0 for which
one has
m
f (x )(a − a ) − A ≤ ε,
j j j −1
j =1
with
aj − aj −1 ≤ δ , and aj −1 ≤ xj ≤ aj .
Nevertheless, the Cauchy-Riemann theory was not the final solution to the problem
of integration: Volterra (1881) gave an example of a primitivable and bounded function
which is not R-integrable on [a, b]. The main problem was that this procedure did not
take into account the particular properties of the functions involved. According to Borel,
it was like “a ready-made outfit which doesn’t suit to each and everyone”. The problem
of finding a theory where both R-integrable and primitivable functions were integrable
persisted.
Lebesgue (1902) faced this problem introducing the following integration procedure.
Given f : [a, b] → R, bounded with non-negative values, and C > 0 such that
0 ≤ f (x) < C for every x ∈ [a, b], let usconsider the subdivision of the interval [0, C[
(and not of the interval [a, b]!) in n parts j −1
n
C, j
n
C , and consider the sets
j −1 j
Enj = x ∈ [a, b] : C ≤ f (x) < C .
n n
The problem is how to “measure” these sets. In general, given a set E ⊆ [a, b],
Lebesgue calls “outer measure” of E the infimum of the set of all sums k (dk − ck ),
finite or countable, obtained by considering a covering of E with the intervals [ck , dk ].
Appendix E • A Brief Historical Note
215 E
Denoting by μ∗ (E) this exterior measure, the “interior measure” of E is given by
μ∗ (E) = (b − a) − μ∗ ([a, b] \ E). The set E is measurable if μ∗ (E) = μ∗ (E), in
which case this number is called the “measure of E” and is denoted by μ(E). At this
point, Lebesgue defines the integrable functions, which in this book have been called
L-integrable.
Definition E.2
A function f : [a, b] → R, such that 0 ≤ f (x) < C for every x ∈ [a, b], is
L-integrable and its integral is a real number A if the following property holds: for
j
every ε > 0 there is a natural number n̄ such that, taken n ≥ n̄, the sets En are
measurable and
n
j
Cμ(E
j
) − A ≤ ε.
n
n
j =1
The definition is then extended to functions which are not bounded and with arbitrary
real values in the following way.
Definition E.3
Assume that the function f : [a, b] → R has non-negative values but is not
bounded above. In this case, f is said to be L-integrable on [a, b] if, for every
positive integer k, the function fk (x) = min{f (x), k} is L-integrable on [a, b] and
b
the sequence of the integrals ( a fk )k has a finite limit as k tends to +∞. Such a
limit is said to be the “integral of f on [a, b]”. Whenever f not only has
non-negative values, it is said to be L-integrable if both its positive part f + and its
negative part f − are L-integrable; the integral of f is then the difference of the
respective integrals of f + and f − .
Again according to Borel, this procedure “is custom made, and perfectly adapts
to the properties of each function”. Lebesgue’s theory was completely satisfactory for
several aspects. It was shown that every R-integrable function is also L-integrable and
the two integrals have the same value, and if a function is primitivable and bounded,
then it is L-integrable.
However, the problem of the integrability of not bounded primitivable functions
remained unsolved at the time. It finally found a solution some years later by Denjoy
(1912) and Perron (1914), who extended Lebesgue’s theory with two different but
equivalent approaches. While Denjoy used a transfinite induction method starting from
Lebesgue definition, Perron’s method is more in line with the formula for the integral of
primitivable functions. Let us take a simplified and brief look at Perron’s method.
We call lower-primitive of f a function F− such that F− (a) = 0 and, for every
x ∈ [a, b], it is F− (x) ≤ f (x); similarly, we define an upper-primitive F+ of f, by
changing the inequality sign. We say that f is almost primitivable if there are a lower-
216 Appendix E • A Brief Historical Note
Definition E.4
A function f : [a, b] → R is P-integrable if it is almost primitivable and
While it is clear that every primitivable function is P-integrable, to prove that every
L-integrable function is P-integrable (with the same value for the integral) is rather
complicated. It comes out that a function f is L-integrable if and only if both f and
|f | are P-integrable.
The approach to the definition of integral had been set aside for a long time
when Kurzweil (1957) and Henstock (1961) proposed the following modification to
Riemann’s definition: after observing that the two conditions aj − aj −1 ≤ δ and
aj −1 ≤ xj ≤ aj can be replaced by
xj − δ ≤ aj −1 ≤ xj ≤ aj ≤ xj + δ ,
without modifying the definition, in order to adapt the procedure to each function they
decided to allow greater freedom of choice for the δ. With a simple but far-reaching
insight, the formerly constant δ could now vary in the interval [a, b], according to the
needs of the function. This is how they reached the definition that we have adopted in
this book:
Definition E.5
A function f : [a, b] → R is integrable (according to Kurzweil and Henstock) and
its integral is a real number A if the following property holds: for every ε > 0 there
is a function δ : [a, b] → R, with positive values, for which one has
m
f (x )(a − a ) − A ≤ ε,
j j j −1
j =1
and
xj − δ(xj ) ≤ aj −1 ≤ xj ≤ aj ≤ xj + δ(xj ) .
Appendix E • A Brief Historical Note
217 E
Surprisingly, a function is integrable (according to Kurzweil and Henstock) if and
only if it is P-integrable, and in that case the value of the integral is the same.
Some years later Mac Shane (1969) proved that, by modifying in the above definition
the condition aj −1 ≤ xj ≤ aj with the less restrictive one xj ∈ [a, b], an alternative
definition of L-integrable functions is obtained.
In conclusion, it can be said that the Riemann sums have played and still play a
fundamental role in the theory of integration, even if with alternate fortunes. Intuitively
used by the ancient Greeks, they were placed on the back burner once the theory of
Leibniz and Newton was introduced, only to be back in the limelight thanks to Cauchy
and Riemann. Overshadowed once again by the theories of Lebesgue, Denjoy and
Perron, they proved to be important again in the work of Kurzweil and Henstock, who
re-introduced them to unify these theories in an easy and intuitive way. And they are the
subject of interesting research developments even today.
219
Bibliography
1. R.G. Bartle, A Modern Theory of Integration, American Mathematical Society, Providence, 2001.
2. Z. Buczolich, The g-integral is not rotation invariant, Real Analysis Exchange 18 (1992/93),
437–447.
3. A. Fonda, Lezioni sulla Teoria dell’Integrale, Ed. Goliardiche, Roma, 2001.
4. R.A. Gordon, The Integrals of Lebesgue, Denjoy, Perron, and Henstock, American Mathematical
Society, Providence, 1994.
5. R. Henstock, Definitions of Riemann type of the variational integrals, Proceedings of the London
Mathematical Society 11 (1961), 402–418.
6. R. Henstock, Theory of Integration, Butterworths, London, 1963.
7. R. Henstock, The Generalized Theory of Integration, Clarendon Press, Oxford, 1991.
8. J. Kurzweil, Generalized ordinary differential equations and continuous dependence on a parameter,
Czechoslovak Mathematical Journal 7 (1957), 418–449.
9. J. Kurzweil, Nichtabsolut Konvergente Integrale, Teubner, Leipzig, 1980.
10. J. Kurzweil, Henstock–Kurzweil Integration: Its Relation to Topological Vector Spaces, World
Scientific, Singapore, 2000.
11. S. Leader, The Kurzweil–Henstock Integral and its Differentials, Marcel Dekker, New York, 2001.
12. P.Y. Lee, Lanzhou Lectures on Henstock Integration, World Scientific, Singapore, 1989.
13. P.Y. Lee and R. Vyborny, The Integral. An Easy Approach after Kurzweil and Henstock,
Cambridge University Press, Cambridge, 2000.
14. T.Y. Lee, Henstock–Kurzweil integration on Euclidean spaces, World Scientific, Singapore, 2011.
15. J. Mawhin, Analyse: Fondements, Techniques, Evolution, De Boeck, Bruxelles, 1979–1992.
16. R.M. McLeod, The Generalized Riemann Integral, Mathematical Association of America,
Washington, 1980.
17. E.J. McShane, Unified Integration, Academic Press, New York, 1983.
18. W.F. Pfeffer, The Riemann Approach to Integration, Cambridge University Press, 1993.
19. W.F. Pfeffer, Derivation and Integration, Cambridge University Press, Cambridge, 2001.
20. M. Spivak, Calculus on Manifolds, Benjamin, Amsterdam, 1965.
21. Ch. Swartz, Introduction to Gauge Integrals, World Scientific, Singapore, 2001.
Index
negligible, 73 negligible, 73
non-overlapping, 61, 81 non-overlapping, 81
normal versor, 136 star-shaped, 168
solenoidal, 170
orientation spherical coordinates, 111
induced, 203 star-shaped set, 168
oriented boundary support of a M-surface, 135
of a M-surface, 155 surface, 135
of a rectangle, 152 surface integral, 145