Introduction To Mathematical Analysis I - Second Edition
Introduction To Mathematical Analysis I - Second Edition
Second Edition
Beatriz Lafferriere
MTH 311
Gerardo Lafferriere
Mau Nam Nguyen
BeatrizLafferriere
GerardoLafferriere
NguyenMauNam
PDXOpenTextbooks
2016
2016PortlandStateUniversity
ISBN:9781365605529
Thisworkislicensedundera
CreativeCommonsAttributionNonCommercial4.0InternationalLicense
Youarefreeto:
Sharecopyandredistributethematerialinanymediumorformat
Adaptremix,transform,andbuilduponthematerial
Thelicensorcannotrevokethesefreedomsaslongasyoufollowthelicenseterms.
Underthefollowingterms:
AttributionYoumustgiveappropriatecredit,providealinktothelicense,andindicateif
changesweremade.Youmaydosoinanyreasonablemanner,butnotinanywaythatsuggests
thelicensorendorsesyouoryouruse.
NonCommercialYoumaynotusethematerialforcommercialpurposes.
ThispublicationwasmadepossiblebyPDXOpenpublishinginitiative
PublishedbyPortlandStateUniversityLibrary
Portland,OR972071151
CoverdesignbyMattEich
Contents
2 SEQUENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.1 CONVERGENCE 31
2.2 LIMIT THEOREMS 37
2.3 MONOTONE SEQUENCES 41
2.4 THE BOLZANO-WEIERSTRASS THEOREM 47
2.5 LIMIT SUPERIOR AND LIMIT INFERIOR 50
2.6 OPEN SETS, CLOSED SETS, COMPACT SETS, AND LIMIT POINTS 55
4 DIFFERENTIATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
4.1 DEFINITION AND BASIC PROPERTIES OF THE DERIVATIVE 101
4.2 THE MEAN VALUE THEOREM 107
4.3 SOME APPLICATIONS OF THE MEAN VALUE THEOREM 114
4.4 LHOSPITALS RULE 117
4.5 TAYLORS THEOREM 123
4.6 CONVEX FUNCTIONS AND DERIVATIVES 126
4.7 NONDIFFERENTIABLE CONVEX FUNCTIONS AND SUBDIFFERENTIALS 132
Our goal in this set of lecture notes is to provide students with a strong foundation in mathematical
analysis. Such a foundation is crucial for future study of deeper topics of analysis. Students should be
familiar with most of the concepts presented here after completing the calculus sequence. However,
these concepts will be reinforced through rigorous proofs.
The lecture notes contain topics of real analysis usually covered in a 10-week course: the
completeness axiom, sequences and convergence, continuity, and differentiation. The lecture notes
also contain many well-selected exercises of various levels. Although these topics are written in a
more abstract way compared with those available in some textbooks, teachers can choose to simplify
them depending on the background of the students. For instance, rather than introducing the topology
of the real line to students, related topological concepts can be replaced by more familiar concepts
such as open and closed intervals. Some other topics such as lower and upper semicontinuity,
differentiation of convex functions, and generalized differentiation of non-differentiable convex
functions can be used as optional mathematical projects. In this way, the lecture notes are suitable
for teaching students of different backgrounds.
Hints and solutions to selected exercises are collected in Chapter 5. For each section, there is at
least one exercise fully solved. For those exercises, in addition to the solutions, there are explanations
about the process itself and examples of more general problems where the same technique may be
used. Exercises with solutions are indicated by a I and those with hints are indicated by a B.
Finally, to make it easier for students to navigate the text, the electronic version of these notes
contains many hyperlinks that students can click on to go to a definition, theorem, example, or
exercise at a different place in the notes. These hyperlinks can be easily recognized because the text
or number is on a different color and the mouse pointer changes shape when going over them.
worked out in detail, and numerous new exercises. In all we added over 50 examples in the main text
and 100 exercises (counting parts).
We included more prominently the notion of compact set. We defined compactness as what is
more commonly termed sequential compactness. Students find this definition easier to absorb than
the general one in terms of open covers. Moreover, as the emphasis of the whole text is on sequences,
this definition is easier to apply and reinforce.
The following are the more significant changes.
Chapter 1 Added the proofs of several properties of the real numbers as an ordered field.
Chapter 2 We added the proof that compactness is equivalent to closed and bounded in the main
text.
Chapter 3 We added the theorem on extension of uniformly continuous functions and moved the
discussion of Lipschiptz and Hlder continuous functions to the section on uniform continuity.
We created a separate section for limit superior/inferior of functions.
Chapter 4 We clarified the statement and the proof of the second version of LHospitals rule.
We have used these notes several times to teach the one-quarter course Introduction to Mathe-
matical Analysis I at Portland State University. As we are now preparing a companion text for the
second term (Introduction to Mathematical Analysis II) we now added the roman numeral I to the
title.
BASIC CONCEPTS OF SET THEORY
FUNCTIONS
THE NATURAL NUMBERS AND MATHEMATICAL INDUC-
TION
ORDERED FIELD AXIOMS
THE COMPLETENESS AXIOM FOR THE REAL NUMBERS
APPLICATIONS OF THE COMPLETENESS AXIOM
This chapter discusses various mathematical concepts and constructions which are central to the
study of the many fundamental results in analysis. Generalities are kept to a minimum in order to
move quickly to the heart of analysis: the structure of the real number system and the notion of limit.
The reader should consult the bibliographical references for more details.
A = {a, b, c, d}.
We say that a set A is a subset of a set B if every element of A is also an element of B, and write
A B or B A.
Two sets are equal if they contain the same elements. If A and B are equal, we write A = B. The
following result is straightforward and very convenient for proving equality between sets.
Theorem 1.1.1 Two sets A and B are equal if and only if A B and B A.
If A B and A does not equal B, we say that A is a proper subset of B, and write
A ( B.
8 1.1 BASIC CONCEPTS OF SET THEORY
The set 0/ = {x : x 6= x} is called the empty set. This set clearly has no elements. Using
Theorem 1.1.1, it is easy to show that all sets with no elements are equal. Thus, we refer to the empty
set.
Throughout this book, we will discuss several sets of numbers which should be familiar to the
reader:
Z = {0, 1, 1, 2, 2, . . .}, the set of integers (that is, the natural numbers together with zero
and the negative of each natural number).
[a, b] = {x R : a x b},
(a, b] = {x R : a < x b},
[a, ) = {x R : a x},
(a, ) = {x R : a < x},
and similar definitions for (a, b), [a, b), (, b], and (, b). We will say more about the
symbols and in Section 1.5.
Since the real numbers are central to the study of analysis, we will discuss them in great detail in
Sections 1.4, 1.5, and 1.6.
For two sets A and B, the union, intersection, difference, and symmetric difference of A and B are
given respectively by
A B = {x : x A or x B},
A B = {x : x A and x B},
A \ B = {x : x A and x
/ B}, and
A B = (A \ B) (B \ A).
If A B = 0,
/ we say that A and B are disjoint.
The difference of A and B is also called the complement of B in A. If X is a universal set, that is,
a set containing all the objects under consideration, then the complement of A in X is denoted simply
by Ac .
Theorem 1.1.2 Let A, B, and C be subsets of a universal set X. Then the following hold:
(a) A Ac = X;
(b) A Ac = 0;
/
(c) c c
(A ) = A;
(d) (Distributive law) A (B C) = (A B) (A C);
9
Proof: We prove some of the results and leave the rest for the exercises.
(a) Clearly, A Ac X since both A and Ac are subsets of X. Now let x X. Then either x is an
element of A or it is not an element of A. In the first case, x A and, so, x A Ac . In the second
case, x Ac and, so, x A Ac . Thus, X A Ac .
(b) No element of x can be simultaneously in A and not in A. Thus, A Ac = 0. /
(d) Let x A (B C). Then x A and x B C. Therefore, x B or x C. In the first
case, since x is also in A we get x A B and, hence, x (A B) (A C). In the second case,
x A C and, hence, x (A B) (A C). Thus, in all cases, x (A B) (A C). This shows
A (B C) (A B) (A C).
Now we prove the other inclusion. Let x (A B) (A C). Then x A B or x A C.
In either case, x A. In the first case, x B and, hence, x B C. It follows in this case that
x A (B C). In the second case, x C and, hence, x B C. Again, we conclude x A (B C).
Therefore, (A B) (A C) A (B C) as desired.
A set whose elements are sets is often called a collection/family of sets and is often denoted by
script letters such as A or B.
Let I be a nonempty set such that to each i I corresponds a set Ai . Then the family of all sets
Ai as i ranges over I is denoted by
{Ai : i I}.
Such a family of sets is called an indexed family and the set I is called the index set. Consider the
indexed family of sets {Ai : i I}. The union and intersection of this family as i ranges over I is
defined respectively by
[
Ai = {x : x Ai for some i I}
iI
and
\
Ai = {x : x Ai for every i I}.
iI
(a) Let the index set be I = N and for each n N we have An = [n, n]. Then
[ \
An = R An = [1, 1].
nN nN
(b) Here we let the index set be J = (0, 1] and for each J we have A = (, ). Then
[ \
A = (1, 1) A = {0}.
J J
10 1.1 BASIC CONCEPTS OF SET THEORY
The proofs of the following properties are similar to those in Theorem 1.1.2. We include the
proof of part (a) and leave the rest as an exercise.
Theorem 1.1.3 Let {Ai : i I} be an indexed family of subsets of a universal set X and let B be a
subset of X. Then the following hold:
T
(a) B iI Ai = iI B Ai ;
T
S
(b) B iI Ai = iI B Ai ;
S
S
(c) B \ iI Ai = iI B \ Ai ;
T
T
(d) B \ iI Ai = iI B \ Ai ;
S
c S
Ac ;
T
(e) iI Ai =
S c TiI c
(f) iI Ai = iI A .
Proof of (a): Let x B iI Ai . Then x B or x iI Ai . If x B, then x B Ai for all i I
T T
then we must have that x Ai for all i I. Therefore, x iI Ai and, hence, x B iI Ai . This
T T
A B = {(1, 2), (1, 0), (1, 1), (2, 2), (2, 0), (2, 1)}.
Example 1.1.3 If A and B are the intervals [1, 2] and [0, 7] respectively, then A B is the
rectangle
We will make use of cartesian products in the next section when we discuss functions.
Exercises
1.1.1 Prove the remaining items in Theorem 1.1.2.
(X \Y ) Z = Z \ (Y Z).
1 For a precise definition of ordered pair in terms of sets see [Lay13]
11
1.1.5 Let A X and B Y . Determine if the following equalities are true and justify your answer:
(a) (X Y ) \ (A B) = (X \ A) (Y \ B).
(b) (X Y ) \ (A B) = [(X \ A) Y ] [X (Y \ B)].
1.2 FUNCTIONS
Definition 1.2.1 Let X and Y be sets. A function from X into Y is a subset f X Y with the
following properties
(a) For all x X there is y Y such that (x, y) f .
(b) If (x, y) f and (x, z) f , then y = z.
The set X is called the domain of f , the set Y is called the codomain of f , and we write f : X Y .
The range of f is the subset of Y defined by {y Y : there is x X such that (x, y) f }.
It follows from the definition that, for each x X, there is exactly one element y Y such that
(x, y) f . We will write y = f (x). If x X, the element f (x) is called the value of f at x or the
image of x under f .
Note that, in this definition, a function is a collection of ordered pairs and, thus, corresponds
to the geometric interpretation of the graph of a function given in calculus. In fact, we will refer
indistinctly to the function f or to the graph of f . Both refer to the set {(x, f (x)) : x X}.
Let f : X Y and g : X Y be two functions. Then the two functions are equal if they are
equal as subsets of X Y . It is easy to see that f equals g if and only if
f (x) = g(x) for all x X.
It follows from the definition that two equal functions must have the same domain.
Let f : X Y be a function and let A be a subset of X. The restriction of f on A, denoted by f|A ,
is a new function from A into Y given by
f|A (a) = f (a) for all a A.
Definition 1.2.2 A function f : X Y is called surjective (or is said to map X onto Y ) if for every
element y Y , there exists an element x X such that f (x) = y.
The function f is called injective (or one-to-one) if for each pair of distinct elements of X, their
images under f are also distinct. Thus, f is one-to-one if and only if for all x and x0 in X, the
following implication holds:
[ f (x) = f (x0 )] [x = x0 ].
If f is both surjective and injective, it is called bijective or a one-to-one correspondence. In this case,
for any y Y , there exists a unique element x X such that f (x) = y. This element x is then denoted
by f 1 (y). In this way, we already built a function from Y to X called the inverse of f .
12 1.2 FUNCTIONS
Theorem 1.2.1 Let f : X Y . If there are two functions g : Y X and h : Y X such that
g( f (x)) = x for every x X and f (h(y)) = y for every y Y , then f is bijective and g = h = f 1 .
Proof: First we prove that f is surjective. Let y Y and set x = h(y). Then, from the assumption on
h, we have f (x) = f (h(y)) = y. This shows that f is surjective.
Next we prove that f is injective. Let x, x0 X be such that f (x) = f (x0 ). Then x = g( f (x)) =
g( f (x0 )) = x0 . Thus, f is injective.
We have shown that for each y Y , there is a unique x X, which we denote f 1 (y) such that
f (x) = y. Since for such a y, g(y) = g( f (x)) = x, we obtain g(y) = f 1 (y). Since f (h(y)) = y, we
also conclude that h(y) = x = f 1 (y).
Example 1.2.1 Consider the function f : (1, 2] [3, 4) given by f (x) = 4 (x 1)2 . We show
that f is bijective. First let x, y (1, 2] be such that f (x) = f (y). That is, 4 (x 1)2 = 4 (y 1)2 .
Then (x 1)2 = (y 1)2 . Since both x > 1 and y > 1, we conclude that x 1 = y 1 and, so, x = y.
This proves f is injective.
Next let y [3, 4). We want x (1, 2] such that f (x) = y. Let us set up 4 (x 1)2 = y and
solve for x. We get, x = 4 y + 1. Note that since y < 4, y 4 has a square root. Also note that
since 3 y < 4, we have 1 4 y > 0 and, hence, 2 4 y + 1 > 1. Therefore, x (1, 2]. This
proves f is surjective.
Definition 1.2.3 Let f : X Y be a function and let A be a subset of X. Then the image of A under
f is given by
Remark 1.2.2 Note that, despite the notation, the definition of preimage does not require the
function to have an inverse. It does not even require the function to be injective. The examples below
illustrate these concepts.
Example 1.2.2 Let f : R R be given by f (x) = 3x 1. Let A = [0, 2) and B = {1, 4, 5}.
Then f (A) = [1, 5) and f 1 (B) = { 23 , 1, 2}.
Example 1.2.3 Let f : R R be given by f (x) = x + 7. Let A = [0, 2) and B = (, 3]. Then
f (A) = (5, 7] and f 1 (B) = [4, ).
Example 1.2.4 Let f : R R be given by f (x) = x2 . Let A = (1, 2) and B = [1, 4). Then
f (A) = [0, 4) and f 1 (B) = (2, 1] [1, 2).
Theorem 1.2.3 Let f : X Y be a function, let A be a subset of X, and let B be a subset of Y . The
following hold:
(a) A f 1 ( f (A)).
13
(b) f ( f 1 (B)) B.
Theorem 1.2.4 Let f : X Y be a function, let A, B X, and let C, D Y . The following hold:
(c) f 1 ( J B ) = J f 1 (B );
S S
(d) f 1 ( f 1 (B ).
T T
J B ) = J
y = f (x). From the definition of union of a family of sets, there is 0 I such that x A0 . Therefore,
y = f (x) f (A0 ) and, so, y I f (A ). .
S
Theorem 1.2.6 Let f : X Y and g : Y Z be two functions and let B Z. The following hold:
Definition 1.2.4 A sequence of elements of a set A is a function with domain N and codomain A.
We discuss sequences in detail in Chapter 2.
Definition 1.2.5 We say that set A is finite if it is empty or if there exists a natural number n and a
one-to-one correspondence f : A {1, 2, . . . , n}. A set is infinite if it is not finite.
We leave it as an exercise to prove that the union of two finite sets is finite. It is also easy to show,
by contradiction, that N is infinite. The following result will be useful when studying sequences and
accumulation points.
Theorem 1.2.7 Suppose A is an infinite set. Then there exists a one-to-one function f : N A.
Proof: Let A be an infinite set. We define f as follows. Choose any element a1 A and set f (1) = a1 .
Now the set A \ {a1 } is again infinite (otherwise A = {a} (A \ {a1 }) would be the union of two
finite sets). So we may choose a2 A with a2 6= a1 and we define f (2) = a2 2 . Having defined
f (1), . . . , f (k), we choose ak+1 A such that ak+1 A \ {a1 , . . . , ak } and define f (k + 1) = ak+1
(such an ak+1 exists because A \ {a1 , . . . , ak } is infinite and, so, nonempty). The function f so defined
clearly has the desired properties.
To paraphrase, the previous theorem says that in every infinite set we can find a sequence made
up of all distinct points.
Exercises
1.2.1 I Let f : X Y be a function. Prove that:
1.2.2 Let f : R R be given by f (x) = x2 3 and let A = [2, 1) and B = (1, 6). Find f (A) and
f 1 (B).
1.2.8 Prove parts (1), (2), (3), and (5) of Theorem 1.2.6.
1.2.9 Prove that the union of two finite sets is finite. Hint: it is easier to show when the sets are
disjoint.
Theorem 1.3.1 Principle of Mathematical Induction. For each natural number n N, suppose
that P(n) denotes a proposition which is either true or false. Let A = {n N : P(n) is true}. Suppose
the following conditions hold:
(a) 1 A.
(b) For each k N, if k A, then k + 1 A.
Then A = N.
Proof: Suppose conditions (a) and (b) hold. Assume, by way of contradiction, that A 6= N. Set
B = N \A, that is, B = {n N : P(n) is false}. Then B is a nonempty subset of N. By the Well-
Ordering Property of the natural numbers, there exists a smallest element ` B. By condition (a),
1 6 B. Hence, ` 2. It follows that k = ` 1 is a natural number. Since k < `, k 6 B and, hence, we
have that P(k) is true. By condition (b), we obtain that P(k + 1) is true. But k + 1 = `, and P(`) is
false, since ` B. This is a contradiction, so the conclusion follows.
To paraphrase, the principle says that, given a list of propositions P(n), one for each n N, if
P(1) is true and, moreover, P(k + 1) is true whenever P(k) is true, then all propositions are true.
We will refer to this principle as mathematical induction or simply induction. Condition (a) above
is called the base case and condition (b) the inductive step. When proving (b), the statement P(k) is
called the inductive hypothesis.
Example 1.3.1 Prove using induction that for all n N
n(n + 1)
1+2++n = .
2
1(1+1)
The statement P(n) is the equality 1+2+ +n = n(n+1)
2 . Now the base case says that 1 = 2 ,
which is clearly true.
16 1.3 THE NATURAL NUMBERS AND MATHEMATICAL INDUCTION
k(k+1)
Suppose P(k) is true for some k N. That is, suppose that 1 + 2 + + k = 2 (this is the
inductive hypothesis). Now we have
k(k + 1) k(k + 1) + 2(k + 1) (k + 1)(k + 2)
1 + 2 + + k + (k + 1) = + (k + 1) = = .
2 2 2
This shows that P(k + 1) is true. We have now proved conditions (a) and (b) of Theorem 1.3.1.
Therefore, by the principle of mathematical induction we conclude that
n(n + 1)
1+2++n = for all n N .
2
Example 1.3.2 Prove using induction that for all n N, 7n 2n is divisible by 5.
For n = 1, we have 7 2 = 5, which is clearly a multiple of 5.
Suppose that 7k 2k is a multiple of 5 for some k N. That is, there is an integer j such that
7k 2k = 5 j. Let us write 7k = 2k + 5 j. Now, substituting this expression below, we have
7k+1 2k+1 = 77k 22k = 7(2k +5 j)22k = 72k 22k +75 j = 2k (72)+57 j = 5(2k +7 j).
It follows that 7k+1 2k+1 is a multiple of 5. This proves the inductive step.
We conclude by induction that 7n 2n is divisible by 5 for all n N.
Example 1.3.3 Prove using induction that for all n N
n + 1 2n
(k + 1) + 1 2k + 1 2k + 2k = 2 2k = 2k+1 .
Theorem 1.3.2 Generalized Principle of Mathematical Induction. Let n0 N and for each
natural n n0 , suppose that P(n) denotes a proposition which is either true or false. Let A = {n
N : P(n) is true}. Suppose the following two conditions hold:
(a) n0 A.
(b) For each k N, k n0 , if k A, then k + 1 A.
Then {k N : k n0 } A.
Proof: Suppose conditions (a) and (b) hold. Assume, by way of contradiction, that A 6 {k N :
k n0 }. Set B = {n N : n n0 , P(n) is false}. Then B is a nonempty subset of N. By the Well-
Ordering Property of the natural numbers, there exists a smallest element ` B. By condition (a),
n0 6 B. Hence, ` n0 + 1. It follows that k = ` 1 n0 . Since k < `, k 6 B and, so, we have that
P(k) is true. By condition (b), we obtain that P(k + 1) is true. But k + 1 = `, and P(`) is false, since
` B. This is a contradiction, so the conclusion follows.
17
where the second inequality follows since k 4 and, so, 2k 16 > 3. This shows that P(k + 1) is
true. Thus, by the generalized principle of induction, the inequality holds for all n 4.
Next we present another variant of the induction principle which makes it easier to prove the
inductive step. Despite its name, this principle is equivalent to the standard one.
Theorem 1.3.3 Principle of Strong Induction. For each natural n N, suppose that P(n) denotes
a proposition which is either true or false. Let A = {n N : P(n) is true}. Suppose the following
two conditions hold:
(a) 1 A.
(b) For each k N, if 1, 2, . . . , k A, then k + 1 A.
Then A = N.
Remark 1.3.4 Note that the inductive step above says that, in order to prove P(k + 1) is true, we
may assume not only that P(k) is true, but also that P(1), P(2),. . . ,P(k 1) are true.
There is also a generalized version of this theorem where the base case is for some integer n0 > 1.
Example 1.3.5 Prove by induction that every positive integer greater than 1 is either a prime
number or a product of prime numbers.
Clearly, the statement is true for n = 2. Suppose the statement holds for any positive integer
m {2, . . . , k}, where k N, k 2. If k + 1 is prime, the statement holds for k + 1. Otherwise,
there are positive integers p, q > 1 such that k + 1 = pq. Since p, q k, by the inductive assumption
applied to both p and q we can find prime numbers r1 ,. . . ,r` and s1 ,. . . ,sm such that p = r1 r` and
q = s1 sm (note that ` and m may both equal 1). But then
k + 1 = r1 r` s1 sm . (1.2)
Thus, the statement holds true for k + 1. The conclusion now follows by the Principle of Strong
Induction.
Exercises
1.3.1 Prove the following using Mathematical Induction.
n(n+1)(2n+1)
(a) 12 + 22 + + n2 = 6 for all n N.
n2 (n+1)2
(b) 13 + 23 + + n3 = 4 for all n N.
(c) 1 + 3 + + (2n 1) = n2 for all n N.
1 an+1
1 + a + a2 + + an = for all n N .
1a
1.3.6 I The Fibonacci sequence is defined by
Prove that
1 h 1 + 5 n 1 5 n i
an = .
5 2 2
1.3.7 Let a 1. Prove by induction that
1.3.8 B Let a, b R and n N. Use Mathematical Induction to prove the binomial theorem
n
n n k nk
(a + b) = ab ,
k=0 k
n n!
where = .
k k!(n k)!
(1d) For each x R, there exists a unique element x R such that x + (x) = 0.
(3a) For all x, y R, exactly one of the three relations holds: x = y, y < x, or x < y.
(3b) For all x, y, z R, if x < y and y < z, then x < z.
(3c) For all x, y, z R, if x < y, then x + z < y + z.
(3d) For all x, y, z R, if x < y and 0 < z, then xz < yz.
We will use the notation x y to mean x < y or x = y. We may also use the notation x > y to
represent y < x and the notation x y to mean x > y or x = y.
A set F together with two operations + and and a relation < satifying the 13 axioms above is
called an ordered field. Thus, the real numbers are an example of an ordered field. Another example
of an ordered field is the set of rational numbers Q with the familiar operations and order. The
integers Z do not form a field since for an integer m other than 1 or 1, its reciprocal 1/m is not an
integer and, thus, axiom 2(d) above does not hold. In particular, the set of positive integers N does
not form a field either. As mentioned above the real numbers R will be defined as the ordered field
which satisfies one additional property described in the next section: the completeness axiom.
From these axioms, many familiar properties of R can be derived. Some examples are given
in the next proposition. The proof illustrates how the given axioms are used at each step of the
derivation.
(a) If x + y = x + z, then y = z;
(b) (x) = x;
(c) If x 6= 0 and xy = xz, then y = z;
(d) If x 6= 0, then 1/(1/x) = x;
(e) 0x = 0 = x0;
(f) x = (1)x;
(g) x(z) = (x)z = (xz).
(h) If x > 0, then x < 0; if x < 0, then x > 0;
(i) If x < y and z < 0, then xz > yz;
(j) 0 < 1.
20 1.4 ORDERED FIELD AXIOMS
Proof: (a) Suppose x + y = x + z. Adding x (which exists by axiom (2d)) to both sides, we have
(x) + (x + y) = (x) + (x + z).
Then axiom (1a) gives
[(x) + x] + y = [(x) + x] + z.
Thus, again by axiom (2d), 0 + y = 0 + z and, by axiom (1c), y = z.
(e) Using axiom (2e) we have 0x = (0 + 0)x = 0x + 0x. Adding (0x) to both sides (axiom (2d))
and using axioms (1a) and (1c), we get
0 = (0x) + 0x = (0x) + (0x + 0x) = ((0x) + 0x) + 0x = 0 + 0x = 0x.
That 0x = x0 follows from axiom (2b).
(f) Using axioms (2c) and (2e) we get x + (1)x = 1x + (1)x = (1 + (1))x. From axiom (2d)
we get 1 + (1) = 0 and from part (e) we get x + (1)x = 0x = 0. From the uniqueness in axiom (2d)
we get (1)x = x as desired.
(g) Using axioms (2e) and (1c) we have xz + x(z) = x(z + (z)) = x0 = 0. Thus, using
axiom (2d) we get that x(z) = (xz). The other equality follows similarly.
(h) From x > 0, using axioms (3c) and (1c) we have x + (x) > 0 + (x) = x. Thus, using
axiom (2d), we get 0 > x. The other case follows in a similar way.
(i) Since z < 0, by part (h), z > 0. Then, by axiom (3d), x(z) < y(z). Combining this with
part (g) we get xz < yz. Adding xz + yz to both sides and using axioms (1a), (3c), (1b), and (1c)
we get
xy = (xz + xz) + xy = xz + (xz + xy) < xy + (xz + xy) = xy + (xy + xz) = (xy + xy) + xz = xz.
(j) Axiom (2c) gives that 1 6= 0. Suppose, by way of contradiction, that 1 < 0. Then by part (i),
1 1 > 0 1. Since 1 1 = 1, by axiom (2c) and 0 1 = 0 by part (e), we get 1 > 0 which is a
contradiction. It follows that 1 > 0.
Note that we can assume that the set of all natural numbers is a subset of R (and of any ordered
field, in fact) by identifying the 1 in N with the 1 in axiom (2c) above, the number 2 with 1 + 1, 3
with 1 + 1 + 1, etc. Furthermore, since 0 < 1 (from part (j) of the previous proposition), axiom (3c)
gives, 1 < 2 < 3, etc (in particular all these numbers are distinct). In a similar way, can include Z
and Q as subsets.
We say that a real number x is irrational if x R \ Q, that is, if it is not rational.
Definition 1.4.1 Given x R, define the absolute value of x by
(
x, if x 0;
|x| =
x, if x < 0.
21
The following properties of absolute value follow directly from the definition.
Proposition 1.4.2 Let x, y, M R and suppose M > 0. The following properties hold:
(a) |x| 0;
(b) | x| = |x|;
(c) |xy| = |x||y|;
(d) |x| < M if and only if M < x < M. (The same holds if < is replaced with .)
Proof: We prove (d) and leave the other parts as an exercise.
(d) Suppose |x| < M. In particular, this implies M > 0. We consider the two cases separately:
x 0 and x < 0. Suppose first x 0. Then |x| = x and, hence, M < 0 x = |x| < M. Now suppose
x < 0. Then |x| = x. Therefore, x < M and, so x > M. It follows that M < x < 0 < M.
For the converse, suppose M < x < M. Again, we consider different cases. If x 0, then
|x| = x < M as desired. Next suppose x < 0. Now, M < x implies M > x. Then |x| = x < M.
Note that as a consequence of part (d) above, since |x| |x| we get |x| x |x|.
The next theorem will play an important role in the study of limits.
|x + y| |x| + |y|.
Proof: From the observation above, we have
|x| x |x|
|y| y |y|.
Adding up the inequalities gives
|x| |y| x + y |x| + |y|.
22 1.5 THE COMPLETENESS AXIOM FOR THE REAL NUMBERS
Since |x| |y| = (|x| + |y|), the conclusion follows from Proposition 1.4.2 (d).
Remark 1.4.5 The absolute value has a geometric interpretation when considering the numbers in
an ordered field as points on a line. The number |a| denotes the distance from the number a to 0.
More generally, the number d(a, b) = |a b| is the distance between the points a and b. It follows
easily from Proposition 1.4.2 that d(x, y) 0, and d(x, y) = 0 if and only if x = y. Moreover, the
triangle inequality implies that
Exercises
1.4.1 Prove that n is an even integer if and only if n2 is an even integer. (Hint: prove the if part
by contraposition, that is, prove that if n is odd, then n2 is odd.)
1.4.3 Let a, b, c, d R. Suppose 0 < a < b and 0 < c < d. Prove that ac < bd.
x + y + |x y| x + y |x y|
max{x, y} = and min{x, y} = .
2 2
1.4.7 Let x, y, M R. Prove the following
(a) |x|2 = x2 .
(b) |x| < M if and only if x < M and x < M.
(c) |x + y| = |x| + |y| if and only if xy 0.
x M for all x A.
L x for all x A,
and A is said to be bounded below if it has a lower bound. We also say that A is bounded if it is both
bounded above and bounded below.
It follows that a set A is bounded if and only if there exist M R such that |x| M for all x A
(see Exercise 1.5.1).
Definition 1.5.2 Let A be a nonempty set that is bounded above. We call a number a least upper
bound or supremum of A, if
(1) x for all x A (that is, is an upper bound of A);
(2) If M is an upper bound of A, then M (this means is smallest among all upper bounds).
It is easy to see that if A has a supremum, then it has only one (see Exercise 1.5.2). In this case,
we denote such a number by sup A.
Example 1.5.1
an upper bound of A but M < 4. Choose a number y R such that M < y < 4 and 0 < y. Set
x = y. Then 2 < x < 0 < 1 and, so, y = x2 A. However, y > M which contradicts the
fact that M is an upper bound. Thus 4 M. This proves that 4 = sup A.
Proposition 1.5.1 Let A be a nonempty subset of R that is bounded above. Then = sup A if and
only if
Proof: Suppose first that = sup A. Then clearly (1) holds (since this is identical to condition (1)
in the definition of supremum). Now let > 0. Since < , condition (2) in the definition of
supremum implies that is not an upper bound of A. Therefore, there must exist an element a
of A such that < a as desired.
Conversely, suppose conditions (1) and (2) hold. Then all we need to show is that condition (2)
in the definition of supremum holds. Let M be an upper bound of A and assume, by way of con-
tradiction, that M < . Set = M. By condition (2) in the statement, there is a A such that
a > = M. This contradicts the fact that M is an upper bound. The conclusion now follows.
The following is an axiom of the real numbers and is called the completeness axiom.
The Completeness Axiom. Every nonempty subset A of R that is bounded above has a least upper
bound. That is, sup A exists and is a real number.
This axiom distinguishes the real numbers from all other ordered fields and it is crucial in the
proofs of the central theorems of analysis.
There is a corresponding definition for the infimum of a set.
Definition 1.5.3 Let A be a nonempty subset of R that is bounded below. We call a number a
greatest lower bound or infimum of A, denoted by = inf A, if
(1) x for all x A (that is, is a lower bound of A);
(2) If N is a lower bound of A, then N (this means is largest among all lower bounds).
Using the completeness axiom, we can prove that if a nonempty set is bounded below, then its
infimum exists (see Exercise 1.5.5).
Example 1.5.2
The following proposition is useful when dealing with infima and its proof is completely
analogous to that of Proposition 1.5.1.
Proposition 1.5.2 Let A be a nonempty subset of R that is bounded below. Then = inf A if and
only if
(1) x for all x A;
(2) For any > 0, there exists a A such that a < + .
The following is a basic property of suprema. Additional ones are described in the exercises.
Theorem 1.5.3 Let A and B be nonempty sets and A B. Suppose B is bounded above. Then
sup A sup B.
Proof: Let M be an upper bound for B, then for x B, x M. In particular, it is also true that x M
for x A. Thus, A is also bounded above. Now, since sup B is an upper bound for B, it is also an
upper bound for A. Then, by the second condition in the definition of supremum, sup A sup B as
desired.
It will be convenient for the study of limits of sequences and functions to introduce two additional
symbols.
Definition 1.5.4 The extended real number system consists of the real field R and the two symbols
and . We preserve the original order in R and define
< x <
for every x R
The extended real number system does not form an ordered field, but it is customary to make the
following conventions:
(a) If x is a real number, then
x + = , x + () = .
(b) If x > 0, then x = , x () = .
(c) If x < 0, then x = , x () = .
(d) + = , + () = , = () () = , and () = () = .
We denote the extended real number set by R. The expressions 0 , + (), and () +
are left undefined.
The set R with the above conventions will be convenient to describe results about limits in later
chapters.
Definition 1.5.5 If A 6= 0/ is not bounded above in R, we will write sup A = . If A is not bounded
below in R, we will write inf A = .
With this definition, every nonempty subset of R has a supremum and an infimum in R. To
complete the picture we adopt the following conventions for the empty set: sup 0/ = and
inf 0/ = .
Exercises
1.5.1 Prove that a subset A of R is bounded if and only if there is M R such that |x| M for all
x A.
26 1.6 APPLICATIONS OF THE COMPLETENESS AXIOM
1.5.2 Let A be a nonempty set and suppose and satisfy conditions (1) and (2) in Definition 1.5.2
(that is, both are suprema of A). Prove that = .
1.5.3 For each subset of R below, determine if it is bounded above, bounded below, or both. If it is
bounded above (below) find the supremum (infimum). Justify all your conclusions.
1.5.4 I Suppose A and B are nonempty subsets of R that are bounded above. Define
A + B = {a + b : a A and b B}.
1.5.6 Let A be a nonempty subset of R and R. Define A = {a : a A}. Prove the following
statements:
(a) If > 0 and A is bounded above, then A is bounded above and sup A = sup A.
(b) If < 0 and A is bounded above, then A is bounded below and inf A = sup A.
1.5.7 Suppose A and B are nonempty subsets of R that are bounded below. Prove that A + B is
bounded below and
1.5.8 Let A, B be nonempty subsets of R that are bounded below. Prove that if A B, then
inf A inf B.
Theorem 1.6.1 The Archimedean Property. The set of natural numbers is unbounded above.
Proof: Let us assume by contradiction that N is bounded above. Since N is nonempty,
= sup N
exists and is a real number. By Proposition 1.5.1 (with = 1), there exists n N such that
1 < n .
Proof: (a) Fix any x R. Since N is not bounded above, x cannot be an upper bound of N. Thus,
there exists n N such that x < n.
(b) Fix any > 0. Then 1/ is a real number. By (1), there exists n N such that
1/ < n.
A = {n N : x < n}.
From part (a), A is nonempty. Since A is a subset of N, by the Well-Ordering Property of the
natural numbers, A has a smallest element `. In particular, x < ` and ` 1 is not in A. Since ` N,
either ` 1 N or ` 1 = 0. If ` 1 N, since ` 1 6 A we get ` 1 x. If ` 1 = 0, we have
` 1 = 0 < x. Therefore, in both cases we have ` 1 x < ` and the conclusion follows with m = `.
In the case x 0, by part (1), there exists N N such that
|x| < N.
In this case, N < x < N, so x + N > 0. Then, by the result just obtained for positive numbers, there
exists a natural number k such that k 1 x + N < k. This implies
k N 1 x < k N.
1
1 < 1 .
n
This proves condition (2) with a = 1 1n and completes the proof.
Theorem 1.6.3 The Density Property of Q. If x and y are two real numbers such that x < y, then
there exists a rational number r such that
x < r < y.
Proof: We are going to prove that there exist an integer m and a positive integer n such that
or, equivalently,
Since y x > 0, by Theorem 1.6.2 (3), there exists n N such that 1 < n(y x). Then
ny = nx + n(y x) > nx + 1.
m 1 nx < m.
r2
2= ,
s2
29
and, hence,
2s2 = r2 . (1.3)
It follows that r2 is an even integer. Therefore, r is an even integer (see Exercise 1.4.1). We can then
write r = 2 j for some integer j. Hence r2 = 4 j2 . Substituting in (1.3), we get s2 = 2 j2 . Therefore,
s2 is even. We conclude as before that s is even. Thus, both r and s have a common factor, which is a
contradiction.
The next theorem shows that irrational numbers are as ubiquitous as rational numbers.
Theorem 1.6.5 Let x and y be two real numbers such that x < y. Then there exists an irrational
number t such that
x < t < y.
This implies
x < r + 2 < y.
Since r is rational, the number t = r + 2 is irrational (see Exercise 1.6.4) and x < t < y.
Exercises
1.6.1 For each sets below determine if it is bounded above, bounded below, or both. If it is bounded
above (below) find the supremum (infimum). Justify all your conclusions.
3n
(a) :nN
n+4
n 1
(b) (1) + : n N
n
n
n (1)
(c) (1) :nN
n
1.6.2 I Let r be a rational number such that 0 < r < 1. Prove that there is n N such that
1 1
<r .
n+1 n
1
1.6.3 Let x R. Prove that for every n N, there is r Q such that |x r| < .
n
30 1.6 APPLICATIONS OF THE COMPLETENESS AXIOM
1.6.4 Prove that if x is a rational number and y is an irrational number, then x + y is irrational. What
can you say about xy?
1.6.5 Prove that in between two real numbers a and b with a < b, there are infinitely many rational
numbers.
1.6.6 Prove that in between two real numbers a and b with a < b, there are infinitely many irrational
numbers.
CONVERGENCE
LIMIT THEOREMS
MONOTONE SEQUENCES
THE BOLZANO-WEIERSTRASS THEOREM
LIMIT SUPERIOR AND LIMIT INFERIOR
OPEN SETS, CLOSED SETS, COMPACT SETS, AND LIMIT
POINTS
2. SEQUENCES
We introduce the notion of limit first through sequences. As mentioned in Chapter 1, a sequence
is just a function with domain N. More precisely, a sequence of elements of a set A is a function
f : N A. We will denote the image of n under the function with subscripted variables, for example,
an = f (n). We will also denote sequences by {an } n=1 , {an }n , or even {an }. Each value an is called
a term of the sequence, more precisely, the n-th term of the sequence.
Example 2.0.1 Consider the sequence an = 1n for n N. This is a sequence of rational numbers.
On occasion, when the pattern is clear, we may list the terms explicitly as in
1 1 1 1
1, , , , , . . .
2 3 4 5
n
Example 2.0.2 Let an = (1) for n N. This is a sequence of integers, namely,
1, 1, 1, 1, 1, 1, . . .
Note that the sequence takes on only two values. This should not be confused with the two-element
set {1, 1}.
2.1 CONVERGENCE
Definition 2.1.1 Let {an } be a sequence of real numbers. We say that the sequence {an } converges
to a R if, for any > 0, there exists a positive integer N such that for any n N with n N, one
has
|an a| < (or equivalently, a < an < a + ).
In this case, we call a the limit of the sequence (see Theorem 2.1.3 below) and write limn an = a.
If the sequence {an } does not converge, we call the sequence divergent.
Remark 2.1.1 It follows directly from the definition, using the Archimedean property, that a
sequence {an } converges to a if and only if for any > 0, there exists a real number N such that for
any n N with n > N, one has
|an a| < .
32 2.1 CONVERGENCE
Example 2.1.1 Let an = 1n for n N. We claim that limn an = 0. We verify it using the
definition. Let > 0. Choose an integer N > 1/. (Note that such an integer N exists due to the
Archimidean Property.) Then, if n N, we get
1 1 1 1
|an 0| = = < = .
n n N 1/
Example 2.1.2 We now generalize the previous example as follows. Let > 0 and consider the
sequence given by
1
an = for n N .
n
Let > 0. Choose an integer N such that N > ( 1 )1/ . For every n N, one has n > ( 1 )1/ and,
hence, n > 1 . This implies
1
0 = 1 < 1 = .
n n 1/
We conclude that limn an = 0.
Example 2.1.3 Consider the sequence {an } where
3n2 + 4
an = .
2n2 + n + 5
We will prove directly from the definition that this sequence converges to a = 32 .
Let > 0. We first search for a suitable N. To that end, we simplify and estimate the expression
|an a|. Notice that
2 2 2
an 3 = 3n + 4 3 = 2(3n + 4) 3(2n + n + 5) = 7 3n
2 2n2 + n + 5 2 2(2n2 + n + 5) 2(2n2 + n + 5)
3n + 7 10n 10
= 2
< 2 = .
2(2n + n + 5) 4n 4n
10
To guarantee that the last expression is less than , it will suffice to choose N > . Indeed, if n N,
4
we get
10 10 10
|an a| < 10 = .
4n 4N 4 4
4n2 1
an = .
3n2 n
We claim limn] an = 43 .
Let > 0. We search for a suitable N. First notice that
2
4n 1 4 12n2 3 12n2 + 4n 4n 3
3n2 n 3 = = 3(3n2 n) .
3(3n2 n)
33
4
Thus, if N > 6 , we have, for n N
2
4n 1 4 4 4
3n2 n 3 6n 6N < .
n2 + 5
an = .
4n2 + n
We prove directly from the definition that {an } converges to 14 .
Let > 0. Now,
2
1 4n2 + 20 4n2 n
n +5 |20 n|
4n2 + n 4 = = 4(4n2 + n) .
4(4n2 + n)
If n 20, then |20 n| = n 20. Therefore, for such n we have
2
n +5 1 n 20 n 1
4n2 + n 4 = 4(4n2 + n) 16n2 = 16n .
1
Choose N > max{ 16 , 20}. Then, for n N we get
2
n +5 1 1 1
4n2 + n 4 16n 16N < .
The following result is quite useful in proving certain inequalities between numbers.
Lemma 2.1.4 Given real numbers a, b, then a b if and only if a < b + for all > 0.
Proof: Suppose a < b + for all > 0. And suppose, by way of contradiction, that a > b. then set
0 = a b. Then 0 > 0. By assumption, we should have a < b + 0 = b + a b = a, which is a
contradiction. It follows that a b.
The other direction follows immediately from the order axioms.
The following comparison theorem shows that (non-strict) inequalities are preserved in the
limit.
Theorem 2.1.5 Comparison Theorem. Suppose {an } and {bn } converge to a and b, respec-
tively, and an bn for all n N. Then a b.
Proof: For any > 0, there exist N1 , N2 N such that
a < an < a + , for n N1 ,
2 2
b < bn < b + , for n N2 .
2 2
Choose N > max{N1 , N2 }. Then
a < aN bN < b + .
2 2
Thus, a < b + for any > 0. Using Lemma 2.1.4 we conclude a b.
Theorem 2.1.6 The Squeeze Theorem. Suppose the sequences {an }, {bn }, and {cn } satisfy
an bn cn for all n N,
` < an < ` +
` < cn < ` +
` < an bn cn < ` + ,
Definition 2.1.2 A sequence {an } is bounded above if the set {an : n N} is bounded above.
Similarly, the sequence {an } is bounded below if the set {an : n N} is bounded below. We say that
the sequence {an } is bounded if the set {an : n N} is bounded, that is, if it is both bounded above
and bounded below.
It follows from the observation after Definition 1.5.1 that the sequence {an } is bounded if and
only if there is M R such that |an | M for all n N.
35
Since |an | |a| ||an | |a|| |an a|, this implies |an | < 1 + |a| for all n N. Set
subsequence of {an }
n=1 if there exists a sequence of increasing positive integers
Theorem 2.1.9 If a sequence {an } converges to a, then any subsequence {ank } of {an } also con-
verges to a.
Proof: Suppose {an } converges to a and let > 0 be given. Then there exists N such that
|ank a| < .
Example 2.1.7 Let an = (1)n for n N. Then the sequence {an } is divergent. Indeed, suppose
by contradiction that
lim an = `.
n
36 2.1 CONVERGENCE
Then every subsequence of {an } converges to a number ` R. From the previous theorem, it follows,
in particular, that
Remark 2.1.10 Given a positive integer k0 , it will be convenient to also talk about the sequence
{an }nk0 , that is, a function defined only for the integers greater than or equal to k0 . For simplicity
of notation, we may also denote this sequence by {an } whenever the integer k0 is clear from the
context. For instance, we talk of the sequence {an } given by
n+1
an = .
(n 1)(n 2)
although a1 and a2 are not defined. In all cases, the sequence must be defined from some integer
onwards.
Exercises
2.1.1 Prove the following directly from the definition of limit.
2n2 + 2
(a) limn = 0.
3n3 + 1
n2 + 1
(b) limn 2 = 1.
5n + n + 1 5
2n3 + 1 1
(c) limn 3 = .
4n n 2
3n2 + 5 1
(d) limn 2 = .
6n + n 2
4n2 1
(e) limn 2 = 4.
n n
2.1.2 Prove that if {an } is a convergent sequence, then {|an |} is a convergent sequence. Is the
converse true?
2.1.3 Let {an } be a sequence. Prove that if the sequence {|an |} converges to 0, then {an } also
converges to 0.
sin n
2.1.4 Prove that limn = 0.
n
2.1.5 Let {xn } be a bounded sequence and let {yn } be a sequence that converges to 0. Prove that
the sequence {xn yn } converges to 0.
2.1.6 Prove that the following limits are 0. (Hint: use Theorem 2.1.6.)
n + cos(n2 3)
(a) lim .
n 2n2 + 1
37
3n
(b) lim .
n n!
n!
(c) lim n .
n n
n2
(d) lim n . (Hint: see Exercise 1.3.4(c)).
n 3
2.1.7 Prove that if limn an = ` > 0, then there exists N N such that an > 0 for all n N.
a
2.1.8 I Prove that if limn an = ` 6= 0, then limn an+1
n
= 1. Is the conclusion still true if ` = 0?
2.1.9 Let {an } be a sequence of real numbers such that limn an = 3. Use Definition 2.1.1 to
prove the following
(a) Prove that limn an = ` if and only if limk a2k = ` and limk a2k+1 = `.
(b) Prove that limn an = ` if and only if limk a3k = `, limk a3k+1 = `, and
limk a3k+2 = `.
a1 + a2 + . . . + an
bn = .
n
(a) Prove that if limn an = `, then limn bn = `.
(b) Find a counterexample to show that the converse does not hold in general.
Theorem 2.2.1 Let {an } and {bn } be sequences of real numbers and let k be a real number. Suppose
{an } converges to a and {bn } converges to b. Then the sequences {an + bn }, {kan }, and {an bn }
converge and
Proof: (a) Fix any > 0. Since {an } converges to a, there exists N1 N such that
|an a| < for all n N1 .
2
Similarly, there exists N2 N such that
|bn b| < for all n N2 .
2
Let N = max{N1 , N2 }. For any n N, one has
|(an + bn ) (a + b)| |an a| + |bn b| < + = .
2 2
Therefore, limn (an + bn ) = a + b. This proves (a).
(b) If k = 0, then ka = 0 and kan = 0 for all n. The conclusion follows immediately. Suppose
next that k 6= 0. Given > 0, let N N be such that |an a| < for n N. Then for n N,
|k|
|kan ka| = |k||an a| < . It follows that limn (kan ) = ka as desired. This proves (b).
(c) Since {an } is convergent, it follows from Theorem 2.1.7 that it is bounded. Thus, there exists
M > 0 such that
1 1
lim = .
n bn b
39
3n2 2n + 5
an = . (2.3)
1 4n + 7n2
Dividing numerator and denominator by n2 , we can write
3 2/n + 5/n2
an = (2.4)
1/n2 4/n + 7
Therefore, by the limit theorems above,
This implies
b1
0 < an 1 .
n
40 2.2 LIMIT THEOREMS
b1
For each > 0, choose N > . It follows that for n N,
b1 b1
|an 1| = an 1 < < .
n N
Thus, limn an = 1.
In the case where b = 1, it is obvious that an = 1 for all n and, hence, limn an = 1.
1
If 0 < b < 1, let c = and define
b
n
1
xn = c= .
an
Since c > 1, it has been shown that limn xn = 1. This implies
1
lim an = lim = 1.
n n xn
Exercises
2.2.1 Find the following limits:
3n2 6n + 7
(a) lim ,
n 4n2 3
1 + 3n n3
(b) lim 3 .
n 3n 2n2 + 1
(a) If {an } and {bn } are convergent sequences, then {an + bn } is a convergent sequence.
(b) If {an } and {bn } are divergent sequences, then {an + bn } is divergent sequence.
(c) If {an } and {bn } are convergent sequences, then {an bn } is a convergent sequence.
(d) If {an } and {bn } are divergent sequences, then {an bn } is a divergent sequence.
(e) If {an } and {an + bn } are convergent sequences, then {bn } is a convergent sequence.
(f) If {an } and {an + bn } are divergent sequences, then {bn } is a divergent sequence.
It is called decreasing if
an an+1 for all n N.
If {an } is increasing or decreasing, then it is called a monotone sequence.
The sequence is called strictly increasing (resp. strictly decreasing) if an < an+1 for all n N
(resp. an > an+1 for all n N).
It is easy to show by induction that if {an } is an increasing sequence, then an am whenever
n m.
Theorem 2.3.1 Monotone Convergence Theorem. Let {an } be a sequence of real numbers.
The following hold:
Proof: (a) Let {an } be an increasing sequence that is bounded above. Define
A = {an : n N}.
Then A is a subset of R that is nonempty and bounded above and, hence, sup A exists. Let ` = sup A
and let > 0. By Proposition 1.5.1, there exists N N such that
` < aN `.
On the other hand, since ` is an upper bound for A, we have an ` for all n. Thus,
Therefore, limn an = `.
(b) Let {an } be a decreasing sequence that is bounded below. Define
bn = an .
42 2.3 MONOTONE SEQUENCES
Then {bn } is increasing and bounded above (if M is a lower bound for {an }, then M is an upper
bound for {bn }). Let
` = lim bn = lim (an ).
n n
Remark 2.3.2 It follows from the proof of Theorem 2.3.1 that if {an } is increasing and bounded
above, then
lim an = sup{an : n N}.
n
lim an = 0.
n
This is clear if r = 0. Let us first consider the case where 0 < r < 1. Then 0 an+1 = ran an for
all n. Therefore, {an } is decreasing and bounded below. By Theorem 2.3.1, the sequence converges.
Let
` = lim an .
n
Since an+1 = ran for all n, taking limits on both sides gives ` = r`. Thus, (1 r)` = 0 and, hence,
` = 0. In the general case, we only need to consider the sequence defined by bn = |an | for n N; see
Exercise 2.1.3.
Example 2.3.2 Consider the sequence {an } defined as follows:
a1 = 2 (2.6)
an + 5
an+1 = for n 1. (2.7)
3
First we will show that the sequence is increasing. We prove by induction that for all n N,
an < an+1 . Since a2 = a13+5 = 73 > 2 = a1 , the statement is true for n = 1. Next, suppose ak < ak+1
for some k N. Then ak + 5 < ak+1 + 5 and (ak + 5)/3 < (ak+1 + 5)/3. Therefore,
ak + 5 ak+1 + 5
ak+1 = < = ak+2 .
3 3
It follows by induction that the sequence is increasing.
Next we prove that the sequence is bounded by 3. Again, we proceed by induction. The statement
is clearly true for n = 1. Suppose that ak 3 for some k N. Then
ak + 5 3 + 5 8
ak+1 = = 3.
3 3 3
It follows that an 3 for all n N.
43
From the Monotone Convergence Theorem, we deduce that there is ` R such that limn an = `.
Since the subsequence {ak+1 }
k=1 also converges to `, taking limits on both sides of the equation
in (2.7), we obtain
`+5
`= .
3
Therefore, 3` = ` + 5 and, hence, ` = 5/2.
Example 2.3.3 The number e. Consider the sequence {an } given by
1 n
an = 1 + , n N.
n
The corresponding expression for an+1 has one more term and each factor (1 nk ) is replaced by the
k
larger factor (1 n+1 ). It is then clear that an < an+1 for all n N. Thus, the sequence is increasing.
Moreover,
1 1 1
an 1 + 1 + + ++
2! 3! n!
1 1 1
< 2+ + ++
1.2 2.3 (n 1) n
n1
1 1 1
= 2+ = 3 < 3.
k=1 k k+1 n
(a) n=1 In 6= 0.
T
/
(b) If, in addition, the lengths of the intervals In converge to zero, then
T
n=1 In consists of a single
point.
Proof: Let {In } be as in the statement with In = [an , bn ]. In particular, an bn for all n N. Given
that In+1 In , we have an an+1 and bn+1 bn for all n N. This shows that {an } is an increasing
sequence bounded above by b1 and {bn } is a decreasing sequence bounded below by a1 . By the
44 2.3 MONOTONE SEQUENCES
Monotone Convergence Theorem (Theorem 2.3.1), there exist a, b R such that limn an = a
and limn bn = b. Since an bn for all n, by Theorem 2.1.5, we get a b. Now, we also have
an a and b bn for all n N (since {an } is increasing and {bn } is decreasing). This shows that if
a x b, then x In for all n N. Thus, [a, b]
T
n=1 In . It follows that n=1 In 6= 0.
T
/ This proves
part (a).
Now note also that
T
n=1 In [a, b]. Indeed, if x n=1 In , then x In for all n. Therefore,
T
an x bn for all n. Using Theorem 2.1.5, we conclude a x b. Thus, x [a, b]. This proves the
desired inclusion and, hence,
T
n=1 In = [a, b].
We now prove part (b). Suppose the lengths of the intervals In converge to zero. This means
bn an 0 as n . Then b = limn bn = limn [(bn an ) + an ] = a. It follows that
T
n=1 In =
{a} as desired.
When a monotone sequence is not bounded, it does not converge. However, the behavior follows
a clear pattern. To make this precise we provide the following definition.
Definition 2.3.2 A sequence {an } is said to diverge to if for every M R, there exists N N such
that
an > M for all n N.
In this case, we write limn an = . Similarly, we say that {an } diverges to and write
limn an = if for every M R, there exists N N such that
an < M for all n N.
Remark 2.3.4 We should not confuse a sequence that diverges to (that is, one that satisfies the
previous definition), with a divergent sequence (that is, one that does not converge).
+1 2
Example 2.3.4 Consider the sequence an = n 5n . We will show, using Definition 2.3.2, that
limn an = .
Let M R. Note that
n2 + 1 n 1 n
= + .
5n 5 5n 5
Choose N > 5M. Then, if n N, we have
n N
an > M.
5 5
Th following result completes the description of the behavior of monotone sequences.
Theorem 2.3.5 If a sequence {an } is increasing and not bounded above, then
lim an = .
n
Proof: Fix any real number M. Since {an } is not bounded above, there exists N N such that
aN M. Then
an aN M for all n N
because {an } is increasing. Therefore, limn an = . The proof for the second case is similar.
45
Theorem 2.3.6 Let {an }, {bn }, and {cn } be sequences of real numbers and let k be a constant.
Suppose
Then
M
an for all n N1 .
2
Similarly, there exists N2 N such that
M
bn for all n N1 .
2
Let N = max{N1 , N2 }. Then it is clear that
an + bn M for all n N.
Exercises
2.3.1 I Let a1 = 2. Define
p
an+1 = an + 2 for n 1.
2.3.2 B Prove that each of the following sequences is convergent and find its limit.
an + 3
(a) a1 = 1 and an+1 = for n 1.
2
(b) a1 = 6 and an+1 = an + 6 for n 1.
1 1
(c) an+1 = 2an + 2 , n 1, a1 > 0.
3 an
1 b
(d) an+1 = an + , b > 0.
2 an
2.3.3 B Prove that each of the following sequences is convergent and find its limit.
p
q p
(a) 2; 2 2; 2 2 2;
1 1
(b) 1/2; ; ;
2 + 1/2 1
2+
2 + 1/2
1 1 1
an = 1 + + ++ , n N.
2! 3! n!
2.3.5 B Let a and b be two positive real numbers with a < b. Define a1 = a, b1 = b, and
p an + bn
an+1 = an bn and bn+1 = for n 1.
2
Show that {an } and {bn } are convergent to the same limit.
2n2 + n + 1
(a) lim = .
n n2
1 3n2
(b) lim = .
n n + 2
Theorem 2.4.1 Bolzano-Weierstrass. Every bounded sequence {an } of real numbers has a
convergent subsequence.
Proof: Suppose {an } is a bounded sequence. Define A = {an : n N} (the set of values of the
sequence {an }). If A is finite, then at least one of the elements of A, say x, must be equal to an for
infinitely many choices of n. More precisely, Bx = {n N : an = x} is infinite. We can then define
a convergent subsequence as follows. Pick n1 such that an1 = x. Now, since Bx is infinite, we can
choose n2 > n1 such that an2 = x. Continuing in this way, we can define a subsequence {ank } which
is constant, equal to x and, thus, converges to x.
Suppose now that A is infinite. First observe there exist c, d R such that c an d for all
n N, that is, A [c, d].
We define a sequence of nonempty nested closed bounded intervals as follows. Set I1 = [c, d].
Next consider the two subintervals [c, c+d c+d c+d
2 ] and [ 2 , d]. Since A is infinite, at least one of A [c, 2 ]
c+d c+d c+d c+d
or A [ 2 , d] is infinite. Let I2 = [c, 2 ] if A [c, 2 ] is infinite and I2 = [ 2 , d] otherwise.
Continuing in this way, we construct a nested sequence of nonempty closed bounded intervals {In }
such that In A is infinite and the length of In tends to 0 as n .
We now construct the desired subsequence of {an } as follows. Let n1 = 1. Choose n2 > n1 such
that an2 I2 . This is possible since I2 A is infinite. Next choose n3 > n2 such that an3 I3 . In this
way, we obtain a subsequence {ank } such that ank Ik for all k N.
Set In = [cn , dn ]. Then limn (dn cn ) = 0. We also know from the proof of the Monotone
Convergence Theorem (Theorem 2.3.1), that {cn } converges. Say ` = limn cn . Thus, limn dn =
limn [(dn cn ) + cn ] = ` as well. Since ck ank dk for all k N, it follows from Theorem 2.1.5
that limk ank = `. This completes the proof.
Definition 2.4.1 (Cauchy sequence). A sequence {an } of real numbers is called a Cauchy sequence
if for any > 0, there exists a positive integer N such that for any m, n N, one has
|am an | < .
lim an = a.
n
Then for any > 0, there exists a positive integer N such that
In particular,
Let M = max{|a1 |, . . . , |aN1 |, 1 + |aN |}. Then, for n = 1, . . . , N 1, we clearly have |an | M.
Moreover, for n N,
Let {ank } be a subsequence of {an } that converges to some point a. For the above , there exists a
positive number K such that
Remark 2.4.6 It follows from Definition 2.4.1 that {an } is a Cauchy sequence if and only if for
every > 0, there exists N N such that
Definition 2.4.2 A sequence {an } is called contractive if there exists k [0, 1) such that
Thus,
for all n, p N. Since kn1 0 as n (independently of p), this implies {an } is a Cauchy
sequence and, hence, it is convergent.
Example 2.4.1 The condition k < 1 in the previous theorem is crucial. Consider the following
example. Let an = ln n for all n N. Since 1 < n+2 n+1
n+1 < n for all n N and the natural logarithm is
an increasing function, we have
n + 2 n+2
|an+2 an+1 | = | ln(n + 2) ln(n + 1)| = ln
= ln
n+1 n+1
n+1
< ln = | ln(n + 1) ln n| = |an+1 an |.
n
Therefore, the inequality in Definition 2.4.2 is satisfied with k = 1, yet the sequence {ln n} does
not converge.
Exercises
2.4.1 I Determine which of the following are Cauchy sequences.
(a) an = (1)n .
(b) an = (1)n /n.
(c) an = n/(n + 1).
(d) an = (cos n)/n.
n cos(3n2 + 2n + 1)
an =
n+1
has a convergent subsequence.
50 2.5 LIMIT SUPERIOR AND LIMIT INFERIOR
2.4.3 Let f : [0, ) R be such that f (x) > 0 for all x. Define
f (n)
an = .
f (n) + 1
Prove that the sequence an has a convergent subsequence.
2.4.4 Define
1 + 2n
an = for n N .
2n
Prove that the sequence an is contractive.
2.4.5 Let r R be such that |r| < 1. Define an = rn for n N. Prove that the sequence {an } is
contractive.
1
2.4.6 Prove that the sequence is not contractive.
n n=1
sn = sup{ak : k n} (2.8)
and
Definition 2.5.1 Let {an } be a sequence. Then the limit superior of {an }, denoted by lim supn an ,
is defined by
lim sup an = lim sup{ak : k n}.
n n
lim sn = ,
n
lim tn = ,
n
Remark 2.5.3 By Theorem 2.5.2, we see that if {an } is not bounded above, then
lim sup an = .
n
lim inf an = .
n
Theorem 2.5.4 Let {an } be a sequence and ` R. The following are equivalent:
lim ank = `.
k
Proof: Suppose lim supn an = `. Then limn sn = `, where sn is defined as in (2.8). For any
> 0, there exists N N such that
1 1
` < ank < ` + .
k k
Therefore, limk ank = `.
We now prove the converse. Given any > 0, there exists N N such that
sm = sup{ak : k m} ` + .
Theorem 2.5.5 Let {an } be a sequence and ` R. The following are equivalent:
lim ank = `.
k
The following corollary follows directly from Theorems 2.5.4 and 2.5.5.
Corollary 2.5.6 Let {an } be a sequence. Then
lim ank = `0 .
k
Then `0 `.
(b) Suppose lim infn an = ` and {ank } is a subsequence of {an } with
lim ank = `0 .
k
Then `0 `.
Proof: We prove only (a) because the proof of (b) is similar. By Theorem 2.5.4 and the definition
of limits, for any > 0, there exists N N such that
Each element of the set A called a subsequential limit of the sequence {an }. It follows from Theorem
2.5.4, Theorem 2.5.5, and Corollary 2.5.7 that A 6= 0/ and
Theorem 2.5.9 Suppose {an } is a sequence such that an > 0 for every n N and
an+1
lim sup = ` < 1.
n an
Then limn an = 0.
Proof: Choose > 0 such that ` + < 1. Then there exists N N such that
an+1
< ` + for all n N.
an
Theorem 2.5.10 Suppose {an } is a sequence such that an > 0 for every n N and
an+1
lim inf = ` > 1.
nan
Then limn an = .
Example 2.5.1 Given a real number , define
n
an = , n N.
n!
When = 0, it is obvious that limn an = 0. Suppose > 0. Then
an+1
lim sup = lim = 0 < 1.
n an n n+1
Thus, limn an = 0. In the general case, we can also show that limn an = 0 by considering
limn |an | and using Exercise 2.1.3.
Exercises
All sequences in this set of exercises are assumed to be in R.
2.5.1 Find lim supn an and lim infn an for each sequence.
(a) an = (1)n .
n
(b) an = sin .
2
1 + (1)n
(c) an = .
n
n
(d) an = n sin .
2
2.5.2 For a sequence {an }, prove that:
(a) lim infn an = if and only if limn an = .
(b) lim supn an = if and only if limn an = .
2.5.5 Let {an } be a convergent sequence and let {bn } be an arbitrary sequence. Prove that
(a) lim supn (an + bn ) = lim supn an + lim supn bn = limn an + lim supn bn .
(b) lim infn (an + bn ) = lim infn an + lim infn bn = limn an + lim infn bn .
55
2.6 OPEN SETS, CLOSED SETS, COMPACT SETS, AND LIMIT POINTS
The open ball in R with center a R and radius > 0 is the set
B(a; ) = (a , a + ).
Definition 2.6.1 A subset A of R is said to be open if for each a A, there exists > 0 such that
B(a; ) A.
Example 2.6.1 (1) Any open interval A = (c, d) is open. Indeed, for each a A, one has c < a < d.
Let
= min{a c, d a}.
Then
B(a; ) = (a , a + ) A.
Therefore, A is open.
(2) The sets A = (, c) and B = (c, ) are open, but the set C = [c, ) is not open. The reader can
easily verify that A and B are open. Let us show that C is not open. Assume by contradiction that C
is open. Then, for the element c C, there exists > 0 such that
B(c; ) = (c , c + ) C.
a G0 .
B(a; ) G0 .
This implies
B(a; ) G
because G0 G. Thus, G is open.
56 2.6 OPEN SETS, CLOSED SETS, COMPACT SETS, AND LIMIT POINTS
(c) Suppose Gi , i = 1, . . . , n, are open subsets of R. Let us show that the set
n
\
G= Gi
i=1
is also open. Take any a G. Then a Gi for i = 1, . . . , n. Since each Gi is open, there exists i > 0
such that
B(a; i ) Gi .
Let = min{i : i = 1, . . . , n}. Then > 0 and
B(a; ) G.
Thus, G is open.
Proof: The proofs for these are simple using the De Morgans law. Let us prove, for instance, (b).
Let {S : I} be a collection of closed sets. We will prove that the set
\
S= S
I
Thus, Sc is open because it is a union of opens sets in R (Theorem 2.6.1(b)). Therefore, S is closed.
Example 2.6.3 It follows from part (c) and Example 2.6.2 that any finite set is closed.
Theorem 2.6.3 A subset A of R is closed if and only if for any sequence {an } in A that converges
to a point a R, it follows that a A.
Proof: Suppose A is a closed subset of R and {an } is a sequence in A that converges to a. Suppose by
contradiction that a 6 A. Since A is closed, there exists > 0 such that B(a; ) = (a , a + ) Ac .
Since {an } converges to a, there exists N N such that
a < aN < a + .
Let us now prove the converse. Suppose by contradiction that A is not closed. Then Ac is not
open. Since Ac is not open, there exists a Ac such that for any > 0, one has B(a; ) A 6= 0. / In
particular, for such an a and for each n N, there exists an B(a; 1n ) A. It is clear that the sequence
{an } is in A and it is convergent to a (because |an a| < n1 , for all n N). This is a contradiction
since a / A. Therefore, A is closed.
Theorem 2.6.4 If A is a nonempty subset of R that is closed and bounded above, then max A exists.
Similarly, if A is a nonempty subset of R that is closed and bounded below, then min A exists
Proof: Let A be a nonempty closed set that is bounded above. Then sup A exists. Let m = sup A. To
complete the proof, we will show that m A. Assume by contradiction that m / A. Then m Ac ,
which is an open set. So there exists > 0 such that
(m , m + ) Ac .
m < a m.
This contradicts the fact that m is the least upper bound of A (see Proposition 1.5.1). Therefore,
max A exists.
Definition 2.6.3 A subset A of R is called compact if for every sequence {an } in A, there exists a
subsequence {ank } that converges to a point a A.1
Example 2.6.4 Let a, b R, a b. We show that the set A = [a, b] is compact. Let {an } be a
sequence in A. Since a an b for all n, then the sequence is bounded. By the Bolzano-Weierstrass
theorem (Theorem 2.4.1), we can obtain a convergent subsequence {ank }. Say, limk ank = s. We
now must show that s A. Since a ank b for all k, it follows from Theorem 2.1.5, that a s b
and, hence, s A as desired. We conclude that A is compact.
|an | n.
Since A is compact, there exists a subsequence {ank } that converges to some a A. Then
Therefore, limk |ank | = . This is a contradiction because {|ank |} converges to |a|. Thus A is
bounded.
Let us now show that A is closed. Let {an } be a sequence in A that converges to a point a R.
By the definition of compactness, {an } has a subsequence {ank } that converges to b A. Then
a = b A and, hence, A is closed by Theorem 2.6.3.
For the converse, suppose A is closed and bounded and let {an } be a sequence in A. Since A
is bounded, the sequence is bounded and, by the Bolzano-Weierstrass theorem (Theorem 2.4.1), it
1 This definition of compactness is more commonly referred to as sequential compactness.
58 2.6 OPEN SETS, CLOSED SETS, COMPACT SETS, AND LIMIT POINTS
has a convergent subsequence, {ank }. Say, limk ank = a. It now follows from Theorem 2.6.3 that
a A. This shows that A is compact as desired.
Theorem 2.6.6 Any infinite bounded subset of R has at least one limit point.
Proof: Let A be an infinite subset of R and let {an } be a sequence of A such that
am 6= an for m 6= n
(see Theorem 1.2.7). Since {an } is bounded, by the Bolzano-Weierstrass theorem (Theorem 2.4.1),
it has a convergent subsequence {ank }. Set b = limk ank . Given > 0, there exists K N such
that ank B(b; ) for k K. Since the set {ank : k K} is infinite, it follows that b is a limit point of
A.
The following definitions and results provide the framework for discussing convergence within
subsets of R.
Definition 2.6.5 Let D be a subset of R. We say that a subset V of D is open in D if for every a V ,
there exists > 0 such that
B(a; ) D V.
Theorem 2.6.7 Let D be a subset of R. A subset V of D is open in D if and only if there exists an
open subset G of R such that
V = D G.
Proof: Suppose V is open in D. By definition, for every a V , there exists a > 0 such that
B(a; a ) D V.
Define
G = aV B(a; a )
59
V G D = aV [B(a; a ) D] V.
Therefore, V = G D.
Let us now prove the converse. Suppose V = G D, where G is an open set. For any a V , we
have a G, so there exists > 0 such that
B(a; ) G.
It follows that
B(a; ) D G D = V.
Example 2.6.7 Let D = [0, 1) and V = [0, 12 ). We can write V = D (1, 21 ). Since (1, 12 ) is
open in R, we conclude from Theorem 2.6.7 that V is open in D. Notice that V itself is not an open
subset of R.
The following theorem is now a direct consequence of Theorems 2.6.7 and 2.6.1.
Theorem 2.6.9 Let D be a subset of R. A subset K of D is closed in D if and only if there exists a
closed subset F of R such that
K = D F.
Proof: Suppose K is a closed set in D. Then D \ K is open in D. By Theorem 2.6.7, there exists an
open set G such that
D \ K = D G.
It follows that
K = D \ (D \ K) = D \ (D G) = D \ G = D Gc .
D \ K = D \ (D F) = D \ F = D F c .
Since F c is an open subset of R, applying Theorem 2.6.7 again, one has that D \ K is open in D.
Therefore, K is closed in D by definition.
60 2.6 OPEN SETS, CLOSED SETS, COMPACT SETS, AND LIMIT POINTS
Example 2.6.8 Let D = [0, 1) and K = [ 12 , 1). We can write K = D [ 12 , 2]. Since [ 12 , 2] is closed
in R, we conclude from Theorem 2.6.9 that K is closed in D. Notice that K itself is not a closed
subset of R.
Corollary 2.6.10 Let D be a subset of R. A subset K of D is closed in D if and only if for every
sequence {xk } in K that converges to a point x D it follows that x K.
Proof: Let D be a subset of R. Suppose K is closed in D. By Theorem 2.6.9, there exists a closed
subset F of R such that
K = D F.
Let {xk } be a sequence in K that converges to a point x D. Since {xk } is also a sequence in F and
F is a closed subset of R, x F. Thus, x D F = K.
Let us prove the converse. Suppose by contradiction that K is not closed in D or D \ K is not
open in D. Then there exists x D \ K such that for every > 0, one has
B(x; ) D * D \ K.
Example 2.6.9 Consider the set D = [0, 1) and the subset A = [ 12 , 1). Clearly, A is bounded. We
showed in Example 2.6.8 that A is closed in D. However, A is not compact. We show this by finding
a sequence {an } in A for which no subsequence converges to a point in A.
1
Indeed, consider the sequence an = 1 2n for n N. Then an A for all n. Moreover, {an }
converges to 1 and, hence, every subsequence also converges to 1. Since 1 6 A, it follows that A is
not compact.
Exercises
2.6.1 Prove that a subset A of R is open if and only if for any x A, there exists n N such that
(x 1/n, x + 1/n) A.
2.6.2 Prove that the interval [0, 1) is neither open nor closed.
2.6.3 I Prove that if A and B are compact subsets of R, then A B is a compact set.
61
2.6.4 Prove that the intersection of any collection of compact subsets of R is compact.
2.6.5 Find all limit points and all isolated points of each of the following sets:
2.6.6 Let D = [0, ). Classify each subset of D below as open in D, closed in D, neither or both.
Justify your answers.
In this chapter, we extend our analysis of limit processes to functions and give the precise
definition of continuous function. We derive rigorously two fundamental theorems about continuous
functions: the extreme value theorem and the intermediate value theorem.
Remark 3.1.1 Note that the limit point x in the definition of limit may or may not be an element of
the domain D. In any case, the inequality | f (x) `| < need only be satisfied by elements of D.
Example 3.1.1 Let f : R R be given by f (x) = 5x 7. We prove that limx2 f (x) = 3. Let
> 0. First note that | f (x) 2| = |5x 7 3| = |5x 10| = 5|x 2|. This suggests the choice
= /5. Then, if |x 2| < we have
Example 3.1.3 Let f : R R be given by f (x) = x2 . We show that limx2 f (x) = 4. First
note that | f (x) 4| = |x2 4| = |(x 2)(x + 2)| = |x 2||x + 2|. Since the domain is all of R the
expression |x + 2| is not bounded and we cannot proceed as in Example 3.1.2. However, we are
interested only in values of x close to 2 and, thus, we impose the condition 1. If |x 2| < 1, then
1 < x 2 < 1 and, so, 1 < x < 3. It follows, for such x, that |x| < 3 and, hence |x| + 2 < 5.
Now, given > 0 we choose = min{1, 5 }. Then, whenver |x 2| < we get
Proceeding as in the previous example, if |x 1| < 1 we get 1 < x 1 < 1 and, so, 0 < x < 2. Thus
|x| < 2 and |x + 7| |x| + 2 < 9.
Now, given > 0, we choose = min{1, 32 }. It follows that if |x 1| < we get
f (x) ( ) |x + 7| |x 1| < 9 .
1
2 6 6
The following theorem will let us apply our earlier results on limits of sequences to obtain new
results on limits of functions.
Theorem 3.1.2 Sequential Characterization of Limits. Let f : D R and let x be a limit point
of D. Then
if and only if
for every sequence {xn } in D such that xn 6= x for every n and {xn } converges to x.
Proof: Suppose (3.1) holds. Let {xn } be a sequence in D with xn 6= x for every n and such that {xn }
converges to x. Given any > 0, there exists > 0 such that | f (x) `| < whenever x D and
0 < |x x| < . Then there exists N N with 0 < |xn x| < for all n N. For such n, we have
| f (xn ) `| < .
Corollary 3.1.3 Let f : D R and let x be a limit point of D. If f has a limit at x, then this limit is
unique.
Proof: Suppose by contradiction that f has two different limits `1 and `2 . Let {xn } be a sequence
in D \ {x} that converges to x. By Theorem 3.1.2, the sequence { f (xn )} converges to two different
limits `1 and `2 . This is a contradiction to Theorem 2.1.3.
The following corollary follows directly from Theorem 3.1.2.
Corollary 3.1.4 Let f : D R and let x be a limit point of D. Then f does not have a limit at x if
and only if there exists a sequence {xn } in D such that xn 6= x for every n, {xn } converges to x, and
{ f (xn )} does not converge.
Example 3.1.5 Consider the Dirichlet function f : R R given by
(
1, if x Q;
f (x) =
0, if x Qc .
Then limxx f (x) does not exist for any x R. Indeed, fix x R and choose two sequences {rn },
{sn } converging to x such that rn Q and sn 6 Q for all n N. Define a new sequence {xn } by
(
rk , if n = 2k;
xn =
sk , if n = 2k 1.
It is clear that {xn } converges to x. Moreover, since { f (rn )} converges to 1 and { f (sn )} converges
to 0, Theorem 2.1.9 implies that the sequence { f (xn )} does not converge. It follows from the
sequential characterization of limits that limxx f (x) does not exist.
Then `1 `2 .
Proof: Let {xn } be a sequence in B(x; ) D = (x , x + ) D that converges to x and xn 6= x for
all n. By Theorem 3.1.2,
Theorem 3.1.7 Let f , g, h : D R and let x be a limit point of D. Suppose there exists > 0
such that f (x) g(x) h(x) for all x B(x; ) D, x 6= x. If limxx f (x) = limxx h(x) = `, then
limxx g(x) = `.
Proof: The proof is straightforward using Theorem 2.1.6 and Theorem 3.1.2.
Remark 3.1.8 We will adopt the following convention. When we write limxx f (x) without speci-
fying the domain D of f we will assume that D is the largest subset of R such that if x D, then
f (x) results in a real number. For example, in
1
lim
x2 x + 3
we assume D = [0, ).
Exercises
3.1.1 Use the definition of limit to prove that
(a) limx2 3x 7 = 1.
(b) limx3 (x2 + 1) = 10.
x+3
(c) limx1 = 2.
x+1
(d) limx0 x = 0.
(e) limx2 x3 = 8.
3.1.3 Let f : D R and let x be a limit point of D. Prove that if limxx f (x) = `, then
3.1.4 Let f : D R and let x be a limit point of D. Suppose f (x) 0 for all x D. Prove that if
limxx f (x) = `, then
p
lim f (x) = `.
xx
Determine which of the following limits exist. For those that exist find their values.
for x D.
e
Then
Proof: Let us first prove (a). Let {xn } be a sequence in D that converges to x and xn 6= x for every n.
By Theorem 3.1.2,
lim f (xn ) = ` and lim g(xn ) = m.
n n
Applying Theorem 3.1.2 again, we get limxx ( f + g)(x) = ` + m. The proofs of (b) and (c) are
similar.
Let us now show that if m 6= 0, then x is a limit point of D.
e Since x is a limit point of D, there is
a sequence {uk } in D converging to x such that uk 6= x for every k. Since m 6= 0, it follows from an
easy application of Theorem 3.1.6 that there exists > 0 with
g(x) 6= 0 whenever 0 < |x x| < , x D.
This implies
xD
e whenever 0 < |x x| < , x D.
Then uk D
e for all k sufficiently large, and hence x is a limit point of D.
e The rest of the proof of (d)
can be completed easily following the proof of (a).
x2 + 2x 3
Example 3.2.1 Consider f : R \ {7} R given by f (x) = . Then, combining all
x+7
parts of Theorem 3.2.1, we get
limx2 (x2 + 2x 3) limx2 x2 + limx2 2x limx2 3
lim f (x) = =
x2 limx2 (x + 7) limx2 x + limx2 7
(limx2 x)2 + 2 limx2 x limx2 3 (2)2 + 2(2) 3 3
= = = .
limx2 x + limx2 7 2 + 7 5
Example 3.2.2 We proceed in the same way to compute the following limit.
Theorem 3.2.2 (Cauchys criterion) Let f : D R and let x be a limit point of D. Then f has a
limit at x if and only if for any > 0, there exists > 0 such that
| f (r) f (s)| < whenever r, s D and 0 < |r x| < , 0 < |s x| < . (3.3)
Proof: Suppose limxx f (x) = `. Given > 0, there exists > 0 such that
| f (x) `| < whenever x D and 0 < |x x| < .
2
Thus, for r, s D with 0 < |r x| < and 0 < |s x| < , we have
Let us prove the converse. Fix a sequence {un } in D such with limn un = x and un 6= x for every n.
Given > 0, there exists > 0 such that
This implies
Thus, { f (un )} is a Cauchy sequence, and hence there exists ` R such that
lim f (un ) = `.
n
We now prove that f has limit ` at x using Theorem 3.1.2. Let {xn } be a sequence in D such that
limn xn = x and xn 6= x for every n. By the previous argument, there exists `0 R such that
lim f (xn ) = `0 .
n
Fix any > 0 and let > 0 satisfy (3.3). There exists K N such that
for all n K. Then | f (un ) f (xn )| < for such n. Letting n , we have |` `0 | . Thus, ` = `0
since is arbitrary. It now follows from Theorem 3.1.2 that limxx f (x) = `.
The rest of this section discussed some special limits and their properties.
Definition 3.2.2 Let a R and > 0. Define
Given a subset A of R, we say that a is a left limit point of A if for any > 0, B (a; ) contains
an infinite number of elements of A. Similarly, a is called a right limit point of A if for any > 0,
B+ (a; ) contains an infinite number of elements of A.
70 3.2 LIMIT THEOREMS
It follows from the definition that a is a limit point of A if and only if it is a left limit point of A
or it is a right limit point of A.
Definition 3.2.3 (One-sided limits) Let f : D R and let x be a left limit point of D. We write
lim f (x) = `
xx
We say that ` is the left-hand limit of f at x. The right-hand limit of f at x can be defined in a similar
way and is denoted limxx+ f (x).
Example 3.2.4 Consider the function f : R \ {0} R given by
|x|
f (x) = .
x
Let x = 0. Note first that 0 is a limit point of the set D = R \ {0} R. Since, for x > 0, we have
f (x) = x/x = 1, we have
We have
and
The following theorem follows directly from the definition of one-sided limits.
Theorem 3.2.3 Let f : D R and let x be both a left limit point of D and a right limit point of D.
Then
lim f (x) = `
xx
if and only if
Theorem 3.2.4 Suppose f : (a, b) R is increasing on (a, b) and x (a, b). Then limxx f (x)
and limxx+ f (x) exist. Moreover,
sup f (x) = lim f (x) f (x) lim+ f (x) = inf f (x).
a<x<x xx xx x<x<b
Proof: Since f (x) f (x) for all x (a, x), the set
{ f (x) : x (a, x)}
is nonempty and bounded above. Thus,
` = sup f (x)
a<x<x
is a real number. We will show that limxx f (x) = `. For any > 0, by the definition of the least
upper bound, there exists a < x1 < x such that
` < f (x1 ).
Let = x x1 > 0. Using the increasing monotonicity, we get
` < f (x1 ) f (x) ` < ` + for all x (x1 , x) = B (x; ).
Therefore, limxx f (x) = `. The rest of the proof of the theorem is similar.
Let
B0 (x; ) = B (x; ) B+ (x; ) = (x , x + ) \ {x}.
Definition 3.2.5 (infinite limits) Let f : D R and let x be a limit point of D. We write
lim f (x) =
xx
Infinite limits of functions have similar properties to those of sequences from Chapter 2 (see
Definition 2.3.2 and Theorem 2.3.6).
1
Example 3.2.7 We show that limx1 (x1)2 = directly from Definition 3.2.5.
1
Let M R. We want to find > 0 that will guarantee (x1)2
> M whenever 0 < |x 1| < .
1
As in the case of finite limits, we work backwards from (x1)2 > M to an inequality for |x 1|.
1
To simplify calculations, note that |M| + 1 > M. Next note that (x1)2 > |M| + 1, is equivalent to
q
1
|M|+1 > |x 1|.
q
1
Now, choose such that 0 < < |M|+1 . Then, if 0 < |x 1| < we have
1 1 1
2
> 2> 1
= |M| + 1 > M,
(x 1) |M|+1
as desired.
Definition 3.2.6 (limits at infinity) Let f : D R, where D is not bounded above. We write
lim f (x) = `
x
lim f (x) = `
x
in a similar way.
3x2 + x 3
lim = .
x 2x2 + 1 2
The approach is similar to that for sequences, with the difference that x need not be an integer.
Let > 0. We want to identify c so that
2
3x + x 3
2x2 + 1 2 < , (3.5)
1 2(2x2 + 1)
< . (3.6)
|2x 3|
2 +1) 2
We first restrict x to be less than 0, so |2x 3| > 3. Then, since 4x3 < 2(2x
|2x3| , 3.6 will be guaranteed
p p
if 1/ < 4x2 /3 or, equivalently 3/(4) < |x|. We set c < min{0, 3/(4)}. Then, if x < c, we
2 +1)
have 3/(4) < x = |x|. Thus, 1/ < 2(2x
p
|2x3| and, hence,
2
3x + x 3 |2x 3|
2x2 + 1 2 = 2(2x2 + 1) < .
Exercises
3.2.1 Find the following limits:
3x2 2x + 5
(a) limx2 ,
x3
x2 + 4x + 3
(b) limx3
x2 9
3.2.2 Let f : D R and let x is a limit point of D. Prove that if limxx f (x) exists, then
3.2.6 Determine the one-sided limits limx3+ [x] and limx3 [x], where [x] denotes the greatest
integer that is less than or equal to x.
74 3.3 CONTINUITY
x+1
(a) limx1+ .
x 1
(b) limx0+ x3 sin(1/x).
(c) limx1 (x [x]).
3.2.9 Determine all values of x such that the limit limxx (1 + x [x]) exists.
(a) If f is bounded above, then limxb f (x) exists and is a real number.
(b) If f is not bounded above, then limxb f (x) = .
State and prove analogous results in case f is bounded below and in case that the domain of f is one
of (, b), (a, ), or (, ).
3.3 CONTINUITY
Definition 3.3.1 Let D be a nonempty subset of R and let f : D R be a function. The function f
is said to be continuous at x0 D if for any real number > 0, there exists > 0 such that if x D
and |x x0 | < , then
Example 3.3.1 Let f : R R be given by f (x) = 3x + 7. Let x0 R and let > 0. Choose
= /3. Then if |x x0 | < , we have
| f (x) f (x0 )| = |3x + 7 (3x0 + 7)| = |3(x x0 )| = 3|x x0 | < 3 = .
This shows that f is continuous at x0 .
Remark 3.3.1 Note that the above definition of continuity does not mention limits. This allows
to include in the definition, points x0 D which are not limit points of D. If x0 is an isolated
point of D, then there is > 0 such that B(x0 ; ) D = {x0 }. It follows that for x B(x0 ; ) D,
| f (x) f (x0 )| = 0 < for any epsilon. Therefore, every function is continuous at an isolated point
of its domain.
To study continuity at limit points of D, we have the following theorem which follows directly
from the definitions of continuity and limit.
Example 3.3.2 Let f : R R be given by f (x) = 3x2 2x + 1. Fix x0 R. Since, from the
results of the previous theorem, we have
lim f (x) = lim (3x2 2x + 1) = 3x02 2x0 + 1 = f (x0 ),
xx0 xx0
Theorem 3.3.3 Let f : D R and let x0 D. Then f is continuous at x0 if and only if for any
sequence {xk } in D that converges to x0 , the sequence { f (xk )} converges to f (x0 ).
The proofs of the next two theorems are straightforward using Theorem 3.3.3.
Theorem 3.3.4 Let f , g : D R and let x0 D. Suppose f and g are continuous at x0 . Then
(a) f + g and f g are continuous at x0 .
(b) c f is continuous at x0 for any constant c.
f
(c) If g(x0 ) 6= 0, then (defined on D e = {x D : g(x) 6= 0}) is continuous at x0 .
g
Proof: We prove (a) and leave the other parts as an exercise. We will use Theorem 3.3.3. Let {xk }
be a sequence in D that converges to x0 . Since f and g are continuous at x0 , by Theorem 3.3.3 we
obtain that { f (xk )} converges to f (x0 ) and {g(xk )} converges to g(x0 ). By Theorem 2.2.1 (a),we
get that { f (xk ) + g(xk )} converges to f (x0 ) + g(x0 ). Therefore,
lim ( f + g)(xk ) = lim f (xk ) + g(xk ) = f (x0 ) + g(x0 ) = ( f + g)(x0 ).
k k
Since {xk } was arbitrary, using Theorem 3.3.3 again we conclude f + g is continuous at x0 .
Exercises
3.3.1 Prove, using definition 3.3.1, that each of the following functions is continuous on the given
domain:
(a) f (x) = ax + b, a, b R, on R.
(b) f (x) = x2 3 on R.
(c) f (x) = x on [0, ).
1
(d) f (x) = on R \ {0}.
x
3.3.2 Determine the values of x at which each function is continuous. The domain of all the
functions is R.
sin x , if x 6= 0;
(a) f (x) = x
1, if x = 0.
sin x , if x 6= 0;
3.3.5 Prove Theorem 3.3.3. (Hint: treat separately the cases when x0 is a limit point of D and when
it is not.)
Prove that if g(a) = h(a), for some a [0, 1], then f is continuous at a.
(b) Let f : [0, 1] R be the function given by
(
x, if x Q [0, 1];
f (x) =
1 x, if x Qc [0, 1].
f (x) = q q
0, if x is irrational.
3.3.11 Suppose that f , g are continuous functions on R and f (x) = g(x) for all x Q. Prove that
f (x) = g(x) for all x R.
m an < m + 1/n.
For each n, since an A = f (D), there exists xn D such that an = f (xn ) and, hence,
By the compactness of D, there exists an element x D and a subsequence {xnk } that converges to
x D as k . Because
1
m f (xnk ) < m + for every k,
nk
by the squeeze theorem (Theorem 2.1.6) we conclude limk f (xnk ) = m. On the other hand, by
continuity we have limk f (xnk ) = f (x). We conclude that f (x) = m f (x) for every x D. Thus,
f has an absolute minimum at x. The proof is similar for the case of absolute maximum.
Remark 3.4.3 The proof of Theorem 3.4.2 can be shortened by applying Theorem 2.6.4. However,
we have provided a direct proof instead.
Corollary 3.4.4 If f : [a, b] R is continuous, then it has an absolute minimum and an absolute
maximum on [a, b].
79
Corollary 3.4.4 is sometimes referred to as the Extreme Value Theorem. It follows immediately
from Theorem 3.4.2, and the fact that the interval [a, b] is compact (see Example 2.6.4).
The following result is a basic property of continuous functions that is used in a variety of
situations.
Lemma 3.4.5 Let f : D R be continuous at c D. Suppose f (c) > 0. Then there exists > 0
such that
Proof: Let = f (c) > 0. By the continuity of f at c, there exists > 0 such that if x D and
|x c| < , then
| f (x) f (c)| < .
This implies, in particular, that f (x) > f (c) = 0 for every x B(c; ) D. The proof is now
complete.
Theorem 3.4.7 Let f : [a, b] R be a continuous function. Suppose f (a) f (b) < 0 (this means
either f (a) < 0 < f (b) or f (a) > 0 > f (b)). Then there exists c (a, b) such that f (c) = 0.
Proof: We prove only the case f (a) < 0 < f (b) (the case f (a) > 0 > f (b) is completely analogous).
Define
A = {x [a, b] : f (x) 0}.
This set is nonempty since a A. This set is also bounded since A [a, b]. Therefore, c = sup A
exists and a c b. We are going to prove that f (c) = 0 by showing that f (c) < 0 and f (c) > 0
lead to contradictions.
Suppose f (c) < 0. Then there exists > 0 such that
Because c < b (since f (b) > 0), we can find s (c, b) such that f (s) < 0 (indeed s = min{c +
/2, (c + b)/2} will do). This is a contradiction because s A and s > c.
Suppose f (c) > 0. Then there exists > 0 such that
Since a < c (because f (a) < 0), there exists t (a, c) such that f (x) > 0 for all x (t, c) (in fact,
t = max{c /2, (a + c)/2} will do). On the other hand, since t < c = sup A, there exists t 0 A
with t < t 0 c. But then t < t 0 and f (t 0 ) 0. This is a contradiction. We conclude that f (c) = 0.
Proof: Define
(x) = f (x) , x [a, b].
Then is continuous on [a, b]. Moreover,
By Theorem 3.4.7, there exists c (a, b) such that (c) = 0. This is equivalent to f (c) = . The
proof is now complete.
Corollary 3.4.9 Let f : [a, b] R be a continuous function. Let
Then for every [m, M], there exists c [a, b] such that f (c) = .
Example 3.4.1 We will use the Intermediate Value Theorem to prove that the equation ex = x
has at least one real solution. We will assume known that the exponential function is continuous on
R and that ex < 1 for x < 0.
First define the function f : R R by f (x) = ex + x. Notice that the given equation has a
solution x if and only if f (x) = 0. Now, the function f is continuous (as the sum of continuous
functions). Moreover, note that f (1) = e1 + (1) < 1 1 = 0 and f (0) = 1 > 0. We can now
apply the Intermediate Value Theorem to the function f on the interval [1, 0] with = 0 to conclude
that there is c [1, 0] such that f (c) = 0. The point c is the desired solution to the original equation.
Example 3.4.2 We show now that, given n N, every positive real number has a positive n-th root.
Let n N and let a R with a > 0. First observe that (1 + a)n 1 + na > a (see Exercise 1.3.7).
Now consider the function f : [0, ) R given by f (x) = xn . Since f (0) = 0 and f (1 + a) > a, it
follows from the Intermediate Value Theorem that there is x (0, 1 + a) such that f (x) = a. That is,
xn = a, as desired. (We show later in Example 4.3.1 that such an x is unique.)
We present below a second proof of Theorem 3.4.8 that does not depend on Theorem 3.4.7, but,
instead, relies on the Nested Intervals Theorem (Theorem 2.3.3).
Second Proof of Theorem 3.4.8: We construct a sequence of nested intervals as follows. Set
a1 = a, b1 = b, and let I1 = [a, b]. Let c1 = (a + b)/2. If f (c1 ) = , we are done. Otherwise, either
f (c1 ) > or
f (c1 ) < .
81
In the first case, set a2 = a1 and b1 = c1 . In the second case, set a2 = c1 and b2 = b1 . Now set
I2 = [a2 , b2 ]. Note that in either case,
f (c2 ) > or
f (c2 ) < .
In the first case, set a3 = a2 and b3 = c2 . In the second case, set a3 = c2 and b3 = b2 . Now set
I3 = [a3 , b3 ]. Note that in either case,
Proceeding in this way, either we find some cn0 such that f (cn0 ) = and, hence, the proof is complete,
or we construct a sequence of closed bounded intervals {In } with In = [an , bn ] such that for all n,
(i) In In+1 ,
(ii) bn an = (b a)/2n1 , and
(iii) f (an ) < < f (bn ).
In this case, we proceed as follows. Condition (ii) implies that limn (bn an ) = 0. By the Nested
Intervals Theorem (Theorem 2.3.3, part (b)), there exists c [a, b] such that n=1 In = {c}. Moreover,
T
Since f (an ) < < f (bn ) for all n, condition (iii) above and Theorem 2.1.5 give
f (c) and
f (c) .
It follows that f (c) = . Note that, since f (a) < < f (b), then c (a, b). The proof is now
complete.
Now we are going to discuss the continuity of the inverse function. For a function f : D E,
where E is a subset of R, we can define the new function f : D R by the same function notation.
The function f : D E is said to be continuous at a point x D if the corresponding function
f : D R is continuous at x.
Theorem 3.4.10 Let f : [a, b] R be strictly increasing and continuous on [a, b]. Let c = f (a) and
d = f (b). Then f is one-to-one, f ([a, b]) = [c, d], and the inverse function f 1 defined on [c, d] by
Proof: The first two assertions follow from the monotonicity of f and the Intermediate Value
Theorem (see also Corollary 3.4.9). We will prove that f 1 is continuous on [c, d]. Fix any y [c, d]
and fix any sequence {yk } in [c, d] that converges to y. Let x [a, b] and xk [a, b] be such that
Then f 1 (y) = x and f 1 (yk ) = xk for every k. Suppose by contradiction that {xk } does not converge
to x. Then there exist 0 > 0 and a subsequence {xk` } of {xk } such that
Since the sequence {xk` } is bounded, it has a further subsequence that converges to x0 [a, b]. To
simplify the notation, we will again call the new subsequence {xk` }. Taking limits in (3.7), we get
On the other hand, by the continuity of f , { f (xk` )} converges to f (x0 ). Since f (xk` ) = yk` y as
` , it follows that f (x0 ) = y = f (x). This implies x0 = x, which contradicts (3.8).
Remark 3.4.11 A similar result holds if the domain of f is the open interval (a, b) with some
additional considerations. If f : (a, b) R is increasing and bounded, following the argument
in Theorem 3.2.4 we can show that both limxa+ f (x) = c and limxb f (x) = d exist in R (see
Exercise 3.2.10). Using the Intermediate Value Theorem we obtain that f ((a, b)) = (c, d). We can
now proceed as in the previous theorem to show that f has a continuous inverse from (c, d) to (a, b).
If : (a, b) R is increasing, continuous, bounded below, but not bounded above, then limxa+ f (x) =
c R, but limxb f (x) = (again see Exercise 3.2.10). In this case we can show using the Inter-
mediate Value Theorem that f ((a, b)) = (c, ) and we can proceed as above to prove that f has a
continuous inverse from (c, ) to (a, b).
The other possibilities lead to similar results.
A similar theorem can be proved for strictly decreasing functions.
Exercises
3.4.1 Let f : D R be continuous at c D and let R. Suppose f (c) > . Prove that there
exists > 0 such that
3.4.2 Let f , g be continuous functions on [a, b]. Suppose f (a) < g(a) and f (b) > g(b). Prove that
there exists x0 (a, b) such that f (x0 ) = g(x0 ).
3.4.3 Prove that the equation cos x = x has at least on solution in R. (Assume known that the
function cos x is continuous.)
3.4.4 Prove that the equation x2 2 = cos(x + 1) has at least two real solutions. (Assume known
that the function cos x is continuous.)
(a) Prove that the equation f (x) = x has a solution on [a, b].
(b) Suppose further that
Prove that the equation f (x) = x has a unique solution on [a, b].
3.4.6 B Let f be a continuous function on [a, b] and x1 , x2 , . . . , xn [a, b]. Prove that there exists
c [a, b] with
Suppose further that f is monotone. Prove that there exists x0 [0, 1] such that
f (x0 ) = g(x0 ) = x0 .
Example 3.5.1 Any constant function f : D R, is uniformly continuous on its domain. Indeed,
given > 0, | f (u) f (v)| = 0 < for all u, v D regardless of the choice of .
The following result is straightforward from the definition.
Example 3.5.3 Let f : [3, 2] R be given by f (x) = x2 . This function is uniformly continuous
on [3, 2].
Let > 0. First observe that for u, v [3, 2] we have |u + v| |u| + |v| 6. Now set = /6.
Then, for u, v [3, 2] satisfying |u v| < , we have
The number is called the Hlder exponent of the function. If = 1, then the function f is called
Lipschitz continuous.
If ` = 0, then f is constant and, thus, uniformly continuous. Suppose next that ` > 0. For any
1/
> 0, let = ` . Then, whenever u, v D, with |u v| < we have
(2) Let D = [0, ). Then f is not Lipschitz continuous on D, but it is Hlder continuous on D and,
hence, f is also uniformly continuous on this set.
Indeed, suppose by contradiction that f is Lipschitz continuous on D. Then there exists a
constant ` > 0 such that
| f (u) f (v)| = | u v| `|u v| for every u, v D.
This implies
n ` or n `2 for every n N .
This is a contradiction. Therefore, f is not Lipschitz continuous on D.
Let us show that f is Hlder continuous on D. We are going to prove that
The inequality in (3.9) holds obviously for u = v = 0. For u > 0 or v > 0, we have
| f (u) f (v)| = | u v|
uv
=
u+ v
p
p |u v|
= |u v|
u+ v
p
|u| + |v| p
|u v|
u+ v
p
= |u v|.
While every uniformly continuous function on a set D is also continuous at each point of D, the
converse is not true in general. The following example illustrates this point.
Example 3.5.6 Let f : (0, 1) R be given by
1
f (x) = .
x
We already know that this function is continuous at every x (0, 1). We will show that f is not
uniformly continuous on (0, 1). Let = 2 and > 0. Set 0 = min{ /2, 1/4}, x = 0 , and y = 20 .
Then x, y (0, 1) and |x y| = 0 < , but
1 1 y x 0 1
| f (x) f (y)| = =
= = 2 = .
x y xy 202 20
u, v D and |u v| < . Let N N be such that |un vn | < for n N. For such n, we have
| f (un ) f (vn )| < . This shows limn ( f (un ) f (vn )) = 0.
To prove the converse, assume condition (C) holds and suppose, by way of contradiction, that f
is not uniformly continuous. Then there exists 0 > 0 such that for any > 0, there exist u, v D
with
It follows that for such sequences, limn (un vn ) = 0, but { f (un ) f (vn )} does not converge to
zero, which contradicts the assumption.
Example 3.5.7 Using this theorem, we can give an easier proof that the function in Example 3.5.6
is not uniformly continuous. Consider the two sequences un = 1/(n + 1) and vn = 1/n for all n 2.
Then clearly, limn (un vn ) = 0, but
1 1
lim ( f (un ) f (vn )) = lim = lim (n + 1 n) = 1 6= 0.
n n 1/(n + 1) 1/n n
The following theorem shows one important case in which continuity implies uniform continuity.
Since D is compact, there exist u0 D and a subsequence {unk } of {un } such that
unk u0 as k .
Then
1
|unk vnk | ,
nk
for all k and, hence, we also have
vnk u0 as k .
By the continuity of f ,
f (unk ) f (u0 ) and f (vnk ) f (u0 ).
Therefore, { f (unk ) f (vnk )} converges to zero, which is a contradiction. The proof is now com-
plete.
88 3.5 UNIFORM CONTINUITY
We now prove a result that characterizes uniform continuity on open bounded intervals. We
first make the observation that if f : D R is uniformly continuous on D and A D, then f is
uniformly continuous on A. More precisely, the restriction f|A : A R is uniformly continuous on A
(see Section 1.2 for the notation). This follows by noting that if | f (u) f (v)| < whenever u, v D
with |u v| < , then we also have | f (u) f (v)| < when we restrict u, v to be in A.
Theorem 3.5.5 Let a, b R and a < b. A function f : (a, b) R is uniformly continuous if and
only if f can be extended to a continuous function f : [a, b] R (that is, there is a continuous
function f : [a, b] R such that f = f|(a,b) ).
Proof: Suppose first that there exists a continuous function f : [a, b] R such that f = f|(a,b) . By
Theorem 3.5.4, the function f is uniformly continuous on [a, b]. Therefore, it follows from our early
observation that f is uniformly continuous on (a, b).
For the converse, suppose f : (a, b) R is uniformly continuous. We will show first that
limxa+ f (x) exists. Note that the one sided limit corresponds to the limit in Theorem 3.2.2. We will
check that the - condition of Theorem 3.2.2 holds.
Let > 0. Choose 0 > 0 so that | f (u) f (v)| < whenever u, v (a, b) and |u v| < 0 . Set
= 0 /2. Then, if u, v (a, b), |u a| < , and |v a| < we have
|u v| |u a| + |a v| < + = 0
and, hence, | f (u) f (v)| < . We can now invoke Theorem 3.2.2 to conclude limxa+ f (x) exists.
In a similar way we can show that limxb f (x) exists. Now define, f : [a, b] R by
f (x),
if x (a, b);
f (x) = limxa+ f (x), if x = a;
limxb f (x), if x = b.
By its definition f|(a,b) = f and, so, f is continuous at every x (a, b). Moreover, limxa+ f(x) =
limxa+ f (x) = f(a) and limxb f(x) = limxb f (x) = f(b), so f is also continuous at a and b by
Theorem 3.3.2. Thus f is the desired continuous extension of f .
Exercises
3.5.1 Prove that each of the following functions is uniformly continuous on the given domain:
(a) f (x) = ax + b, a, b R, on R.
(b) f (x) = 1/x on [a, ), where a > 0.
3.5.2 I Prove that each of the following functions is not uniformly continuous on the given domain:
(a) f (x) = x2 on R.
1
(b) f (x) = sin on (0, 1).
x
(c) f (x) = ln(x) on (0, ).
3.5.3 Determine which of the following functions are uniformly continuous on the given domains.
89
3.5.6 Let D be a nonempty subset of R and let f : D R. Suppose that f is uniformly continuous
on D. Prove that if {xn } is a Cauchy sequence with xn D for every n N, then { f (xn )} is also a
Cauchy sequence.
lim f (x) = c.
x
We say that the function f is locally bounded above around x if there exists > 0 and M > 0 such
that
Clearly, if f is locally bounded above around x, then lim supxx f (x) is a real number, while
lim supxx f (x) = in the other case. Similar discussion applies for the limit inferior.
Theorem 3.6.1 Let f : D R and let x be a limit point of D. Then ` = lim supxx f (x) if and only
if the following two conditions hold:
(2) For every > 0 and for every > 0, there exists x B0 (x; ) D such that
` < f (x ).
` = inf g( ),
>0
where g is defined in (3.10). For any > 0, there exists > 0 such that
Thus,
which proves condition (1). Next note that for any > 0 and > 0, we have
` < f (x ).
Let us now prove the converse. Suppose (1) and (2) are satisfied. Fix any > 0 and let > 0
satisfy (1). Then
g( ) = sup f (x) ` + .
xB0 (x; )D
This implies
This implies
Corollary 3.6.2 Suppose ` = lim supxx f (x). Then there exists a sequence {xk } in D such that
{xk } converges to x, xk 6= x for every k, and
lim f (xk ) = `.
k
Moreover, if {yk } is a sequence in D that converges to x, yk 6= x for every k, and limk f (yk ) = `0 ,
then `0 `.
1
Proof: For each k N, take k = . By (1) of Theorem 3.6.1, there exists k > 0 such that
k
f (x) < ` + k for all x B0 (x; k ) D. (3.11)
Let k0 = min{k , 1k }. Then k0 k and limk k0 = 0. From (2) of Theorem 3.6.1, there exists
xk B0 (x; k0 ) D such that
` k < f (xk ).
Moreover, f (xk ) < ` + k by (3.11). Therefore, {xk } is a sequence that satisfies the conclusion of the
corollary.
Now let {yk } be a sequence in D that converges to x, yk 6= x for every k, and limk f (yk ) = `0 .
For any > 0, let > 0 be as in (1) of Theorem 3.6.1. Since yk B0 (x; ) D when k is sufficiently
large, we have
f (yk ) < ` +
Remark 3.6.3 Let f : D R and let x be a limit point of D. Suppose lim supxx f (x) is a real
number. Define
Then the previous corollary shows that A 6= 0/ and lim supxx f (x) = max A.
if and only if there exists a sequence {xk } in D such that {xk } converges to x, xk 6= x for every k, and
limk f (xk ) = .
Proof: Suppose lim supxx f (x) = . Then
inf g( ) = ,
>0
where g is the extended real-valued function defined in (3.10). Thus, g( ) = for every > 0.
1
Given k N, for k = , since
k
g(k ) = sup f (x) = ,
xB0 (x;k )D
there exists xk B0 (x; k ) D such that f (xk ) > k. Therefore, limk f (xk ) = .
Let us prove the converse. Since limk f (xk ) = , for every M R, there exists K N such
that
xk B0 (x; ) D
g( ) = sup f (x) M.
xB0 (x; )D
if and only if for any sequence {xk } in D such that {xk } converges to x, xk 6= x for every k, it follows
that limk f (xk ) = . The latter is equivalent to limxx f (x) = .
Following the same arguments, we can prove similar results for inferior limits of functions.
93
Theorem 3.6.6 Let f : D R and let x be a limit point of D. Then ` = lim infxx f (x) if and only
if the following two conditions hold:
(2) For every > 0 and for every > 0, there exists x B0 (x; ) D such that
f (x) < ` + .
Corollary 3.6.7 Suppose ` = lim infxx f (x). Then there exists a sequence {xk } in D such that xk
converges to x, xk 6= x for every k, and
lim f (xk ) = `.
k
Moreover, if {yk } is a sequence in D that converges to x, yk 6= x for every k, and limk f (yk ) = `0 ,
then `0 `.
Remark 3.6.8 Let f : D R and let x be a limit point of D. Suppose lim infxx f (x) is a real
number. Define
if and only if there exists a sequence {xk } in D such that {xk } converges to x, xk 6= x for every k, and
limk f (xk ) = .
if and only if for any sequence {xk } in D such that {xk } converges to x, xk 6= x for every k, it follows
that limk f (xk ) = . The latter is equivalent to limxx f (x) = .
lim f (x) = `
xx
if and only if
Proof: Suppose
lim f (x) = `.
xx
Then for every > 0, there exists > 0 such that
` < g( 0 ) ` + .
It follows that
` inf
0
g( 0 ) ` + .
>0
Therefore, lim supxx f (x) = ` since is arbitrary. The proof for the limit inferior is similar. The
converse follows directly from (1) of Theorem 3.6.1 and Theorem 3.6.6.
Exercises
3.6.1 Let D R, f : D R, and x be a limit point of D. Prove that lim infxx f (x) lim supxx f (x).
Similarly, we say that f is upper semicontinuous (u.s.c.) at x if for every > 0, there exists > 0
such that
It is clear that f is continuous at x if and only if f is lower semicontinuous and upper semicontin-
uous at this point.
95
Theorem 3.7.1 Let f : D R and let x D be a limit point of D. Then f is lower semicontinuous
at x if and only if
Proof: Suppose f is lower semicontinuous at x. Let > 0. Then there exists 0 > 0 such that
This implies
f (x) h(0 ),
where
h( ) = inf f (x).
xB0 (x; )D
Thus,
Theorem 3.7.2 Let f : D R and let x D. Then f is l.s.c. at x if and only if for every sequence
{xk } in D that converges to x,
Similarly, f is u.s.c. at x if and only if for every sequence {xk } in D that converges to x,
Proof: Suppose f is l.s.c. at x. Then for any > 0, there exists > 0 such that (3.12) holds. Since
{xk } converges to x, we have xk B(x; ) when k is sufficiently large. Thus,
for such k. It follows that f (x) lim infk f (xk ). Since is arbitrary, it follows that f (x)
lim infk f (xk ).
We now prove the converse. Suppose lim infk f (xk ) f (x) and assume, by way of contra-
diction, that f is not l.s.c. at x. Then there exists > 0 such that for every > 0, there exists
x B(x; ) D with
f (x) f (x ).
1
Applying this for k = , we obtain a sequence {xk } in D that converges to x with
k
f (x) f (xk ) for every k.
This implies
This is a contradiction.
97
Definition 3.7.2 Let f : D R. We say that f is lower semicontinuous on D (or lower semicontinu-
ous if no confusion occurs) if it is lower semicontinuous at every point of D.
Proof: We first prove that f is bounded below. Suppose by contradiction that for every k N, there
exists xk D such that
f (xk ) < k.
Since D is compact, there exists a subsequence {xk` } of {xk } that converges to x0 D. Since f is
l.s.c., by Theorem 3.7.2
This is a contraction because lim inf` f (xk` ) = . This shows f is bounded below. Define
La ( f ) = {x D : f (x) a}
and
Ua ( f ) = {x D : f (x) a}.
Proof: Suppose f is lower semicontinuous. Using Corollary 2.6.10, we will prove that for every
sequence {xk } in La ( f ) that converges to a point x D, we get x La ( f ). For every k, since
xk La ( f ), f (xk ) a.
Since f is lower semicontinuous at x,
B(x; ) D G.
It follows that
Therefore, f is lower semicontinuous. The proof for the upper semicontinuous case is similar.
For every a R, we also define
La ( f ) = {x D : f (x) < a}
and
Theorem 3.7.7 Let f : D R. Then f is continuous if and only if for every a, b R with a < b,
the set
is an open set in D.
Proof: Suppose f is continuous. Then f is lower semicontinuous and upper semicontinuos. Fix
a, b R with a < b. Then
Oa,b = Lb Ua .
By Theorem 3.7.6, the set Oa,b is open since it is the intersection of two open sets La and Ub .
Let us prove the converse. We will only show that f is lower semicontinuous since the proof of
upper semicontinuity is similar. For every a R, we have
Exercises
3.7.1 Let f be the function given by
(
x2 , if x 6= 0;
f (x) =
1, if x = 0.
3.7.3 Let f , g : D R be lower semicontinuous functions and let k > 0 be a constant. Prove that
f + g and k f are lower semicontinous functions on D.
4. DIFFERENTIATION
In this chapter, we discuss basic properties of the derivative of a function and several major
theorems, including the Mean Value Theorem and lHpitals Rule.
f (x) f (a)
a (x) =
xa
is defined on G \ {a}. Since G is an open set, a is a limit point of G \ {a} (see Example 2.6.6). Thus,
it is possible to discuss the limit
f (x) f (a)
lim a (x) = lim .
xa xa xa
Definition 4.1.1 Let G be an open subset of R and let a G. We say that the function f defined on
G is differentiable at a if the limit
f (x) f (a)
lim
xa xa
exists (as a real number). In this case, the limit is called the derivative of f at a denoted by f 0 (a),
and f is said to be differentiable at a. Thus, if f is differentiable at a, then
f (x) f (a)
f 0 (a) = lim .
xa xa
f (x) f (a) xa
lim = lim = lim 1 = 1.
x a xa x a x a x a
and
f (x) f (0) |x| x
lim = lim = lim = 1.
x 0 x0 x 0 x x 0 x
f (x) f (0)
Therefore, limx 0 x0 does not exist and, hence, f is not differentiable at 0.
Remark 4.1.2 The converse of Theorem 4.1.1 is not true. For instance, the absolute value function
f (x) = |x| is continuous at 0, but it is not differentiable at this point (as shown in the example above).
Theorem 4.1.3 Let G be an open subset of R and let f , g : G R. Suppose both f and g are
differentiable at a G. Then the following hold.
(c f )0 (a) = c f 0 (a).
103
f
(d) Suppose additionally that g(a) 6= 0. Then the function is differentiable at a and
g
0
f f 0 (a)g(a) f (a)g0 (a)
(a) = .
g (g(a))2
Proof: The proofs of (a) and (b) are straightforward and we leave them as exercises. Let us prove (c).
For every x G \ {a}, we can write
Thus,
( f g)(x) ( f g)(a)
lim = f 0 (a)g(a) + f (a)g0 (a).
xa xa
This implies (c).
Next we show (d). Since g(a) 6= 0, by (4.1), there exists an open interval I containing a such that
f
g(x) 6= 0 for all x I. Let h = . Then h is defined on I. Moreover,
g
f (x) f (a) f (a) f (a)
h(x) h(a) g(x) g(x) + g(x) g(a)
=
xa xa
1 f (a)
g(x) ( f (x) f (a)) + g(x)g(a) (g(a) g(x))
=
xa
1 f (x) f (a) g(x) g(a)
= g(a) f (a) .
g(x)g(a) xa xa
Taking the limit as x a, we obtain (d). The proof is now complete.
Example 4.1.2 Let f : R R be given by f (x) = x2 and let a R. Using Example 4.1.1(a)
and Theorem 4.1.3(c) we can provide an alternative derivation of a formula for f 0 (a). Indeed, let
g : R R be given by g(x) = x. Then f = g g so
The following lemma is very convenient for studying the differentiability of the composition of
functions.
u(x) = xa
0, x = a.
Since f is differentiable at a, we have
f (x) f (a)
lim u(x) = lim f 0 (a) = f 0 (a) f 0 (a) = 0.
xa xa xa
Therefore, the function u satisfies the conditions of the lemma.
Theorem 4.1.5 Chain rule. Let f : G1 R and let g : G2 R, where G1 and G2 are two open
subsets of R with f (G1 ) G2 . Suppose f is differentiable at a and g is differentiable at f (a). Then
the function g f is differentiable at a and
(g f )0 (a) = g0 ( f (a)) f 0 (a).
Proof: Since f is differentiable at a, by Lemma 4.1.4, there exists a function u defined on G1 with
f (x) f (a) = [ f 0 (a) + u(x)](x a) for all x G1 ,
and limxa u(x) = 0.
Similarly, since g is differentiable at f (a), there exists a function v defined on G2 with
g(t) g( f (a)) = [g0 ( f (a)) + v(t)][t f (a)] for all t G2 , (4.2)
and limt f (a) v(t) = 0.
Applying (4.2) for t = f (x), we have
g( f (x)) g( f (a)) = [g0 ( f (a)) + v( f (x))][ f (x) f (a)].
Thus,
g( f (x)) g( f (a)) = [g0 ( f (a)) + v( f (x))][ f 0 (a) + u(x)](x a) for all x G1 .
This implies
g( f (x)) g( f (a))
= [g0 ( f (a)) + v( f (x))][ f 0 (a) + u(x)] for all x G1 \ {a}.
xa
By the continuity of f at a and the property of v, we have limxa v( f (x)) = 0 and, hence,
g( f (x)) g( f (a))
lim = g0 ( f (a)) f 0 (a).
xa xa
The proof is now complete.
105
Example 4.1.3 Consider the function h : R R given by h(x) = (3x4 + x + 7)15 . Since h(x) is a
polynomial we could in principle compute h0 (x) by expanding the power and using Example 4.1.2.
However, Theorem 4.1.5 provides a shorter way. Define f , g : R R by f (x) = 3x4 + x + 7 and
g(x) = x15 . Then h = g f . Given a R, it follows from Theorem 4.1.5 that
Example 4.1.4 By iterating the Chain Rule, we can extended the result to the composition of
more than two functions in a straightforward way. For example, given functions f : G1 R,
g : G2 R, and h : G3 R such that f (G1 ) G2 , g(G2 ) G3 , f is differentiable at a, g is
differentiable at f (a), and h is differentiable at g( f (a)), we obtain that h g f is differentiable at a
and (h g f )0 (a) = h0 (g( f (a)))g0 ( f (a)) f 0 (a).
Definition 4.1.2 Let G be an open set and let f : G R be a differentiable function. If the function
f 0 : G R is also differentiable, we say that f is twice differentiable (on G). The second derivative
of f is denoted by f 00 or f (2) . Thus, f 00 = ( f 0 )0 . Similarly, we say that f is three times differentiable
if f (2) is differentiable, and ( f (2) )0 is called the third derivative of f and is denoted by f 000 or f (3) .
We can define in this way n times differentiability and the nth derivative of f for any positive integer
n. As a convention, f (0) = f .
Definition 4.1.3 Let I be an open interval in R and let f : I R. The function f is said to be
continuously differentiable if f is differentiable on I and f 0 is continuous on I. We denote by C1 (I)
the set of all continuously differentiable functions on I. If f is n times differentiable on I and the nth
derivative is continuous, then f is called n times continuously differentiable. We denote by Cn (I) the
set of all n times continuously differentiable functions on I.
Exercises
4.1.1 Prove parts (a) and (b) of Theorem 4.1.3.
4.1.2 Compute the following derivatives directly from the definition. That is, do not use Theo-
rem 4.1.3, but rather compute the appropriate limit directly (see Example 4.1.1).
(a) f (x) = mx + b where m, b R.
1
(b) f (x) = (here assume x 6= 0).
x
(c) f (x) = x (here assume x > 0)
4.1.4 Let
(
x , if x > 0;
f (x) =
0, if x 0.
106 4.1 DEFINITION AND BASIC PROPERTIES OF THE DERIVATIVE
4.1.5 Use Theorems 4.1.3 and 4.1.5 to compute the derivatives of the following functions at the
indicated points (see also Example 4.1.4). (Assume known that the function sin x is differentiable at
all points and that its derivative is cos x.)
3x4 + 7x
(a) f (x) = at a = 1.
2x2 + 3
(b) f (x) = sin5 (3x2 + 2 x) at a =
8
4.1.7 Determine if each of the following functions is differentiable at 0. Justify your answer.
(
x2 , if x Q;
(a) f (x) = 3
x , if x
/ Q.
x f (a) a f (x)
(a) limxa .
xa
f (x)g(a) f (a)g(x)
(b) limxa .
xa
4.1.9 Let G be an open subset of R and a G. Prove that if f : G R is Lipschitz continuous,
then g(x) = ( f (x) f (a))2 is differentiable at a.
4.1.10 B Let f be differentiable at a and f (a) > 0. Find the following limit:
!n
f (a + n1 )
lim .
n f (a)
4.1.14 Prove that f : R R, given by f (x) = |x|3 , is in C2 (R) but not in C3 (R) (refer to Defini-
tion 4.1.3). (Hint: the key issue is differentiability at 0.)
Similarly, we say that f has a local (or relative) maximum at a D if there exists > 0 such that
In January 1638, Pierre de Fermat described his method for finding maxima and minima in a
letter written to Marin Mersenne (15881648) who was considered as the center of the world of
science and mathematics during the first half of the 1600s. His method presented in the theorem
below is now known as Fermats Rule.
Theorem 4.2.1 Fermats Rule. Let I be an open interval and f : I R. If f has a local minimum
or maximum at a I and f is differentiable at a, then f 0 (a) = 0.
108 4.2 THE MEAN VALUE THEOREM
Proof: Suppose f has a local minimum at a. Then there exists > 0 sufficiently small such that
Theorem 4.2.2 Rolles Theorem. Let a, b R with a < b and f : [a, b] R. Suppose f is
continuous on [a, b] and differentiable on (a, b) with f (a) = f (b). Then there exists c (a, b) such
that
f 0 (c) = 0. (4.3)
Proof: Since f is continuous on the compact set [a, b], by the extreme value theorem (Theorem 3.4.2)
there exist x1 [a, b] and x2 [a, b] such that
f (x1 ) = min{ f (x) : x [a, b]} and f (x2 ) = max{ f (x) : x [a, b]}.
Then
If x1 (a, b) or x2 (a, b), then f has a local minimum at x1 or f has a local maximum at x2 . By
Theorem 4.2.1, f 0 (x1 ) = 0 or f 0 (x2 ) = 0, and (4.3) holds with c = x1 or c = x2 .
If both x1 and x2 are the endpoints of [a, b], then f (x1 ) = f (x2 ) because f (a) = f (b). By (4.4),
f is a constant function, so f 0 (c) = 0 for any c (a, b).
We are now ready to use Rolles Theorem to prove the Mean Value Theorem presented below.
Theorem 4.2.3 Mean Value Theorem. Let a, b R with a < b and f : [a, b] R. Suppose f
is continuous on [a, b] and differentiable on (a, b). Then there exists c (a, b) such that
f (b) f (a)
f 0 (c) = . (4.5)
ba
Proof: The linear function whose graph goes through (a, f (a)) and (b, f (b)) is
f (b) f (a)
g(x) = (x a) + f (a).
ba
110 4.2 THE MEAN VALUE THEOREM
Define
f (b) f (a)
h(x) = f (x) g(x) = f (x) (x a) + f (a) for x [a, b].
ba
Then h(a) = h(b), and h satisfies the assumptions of Theorem 4.2.2. Thus, there exists c (a, b)
such that h0 (c) = 0. Since
f (b) f (a)
h0 (x) = f 0 (x) ,
ba
it follows that
f (b) f (a)
f 0 (c) = 0.
ba
Thus, (4.5) holds.
sin x sin 0
= cos c.
x0
| sin x|
Therefore, = | cos c|. Since | cos c| 1 we conclude | sin x| |x| for all x > 0. Next suppose
|x|
x < 0. Another application of the Mean Value Theorem shows there exists c (x, 0) such that
sin 0 sin x
= cos c.
0x
| sin x|
Then, again, = | cos c| 1. It follows that | sin x| |x| for x < 0. Since equality holds for
|x|
x = 0, we conclude that | sin x| |x| for all x R.
Example 4.2.2 We show that
1 + 4x < (5 + 2x)/3 for all x > 2.
Let f (x) = 1 + 4x for all x 2. Then
4 2
f 0 (x) = = .
2 1 + 4x 1 + 4x
Now, fix x R such that x > 2. We apply the Mean Value Theorem to f on the interval [2, x]. Then,
since f (2) = 3, there exists c (2, x) such that
1 + 4x 3 = f 0 (c)(x 2).
Since f 0 (2) = 2/3 and f 0 (c) < f 0 (2) for c > 2 we conclude that
2
1 + 4x 3 < (x 2).
3
Rearranging terms provides the desired inequality.
111
A more general result which follows directly from the Mean Value Theorem is known as Cauchys
Theorem.
Theorem 4.2.4 Cauchys Theorem. Let a, b R with a < b. Suppose f and g are continuous
on [a, b] and differentiable on (a, b). Then there exists c (a, b) such that
Proof: Define
Then h(a) = f (b)g(a) f (a)g(b) = h(b), and h satisfies the assumptions of Theorem 4.2.2. Thus,
there exists c (a, b) such that h0 (c) = 0. Since
f (x) f (a)
lim
xa+ xa
exists, we say that f has a right derivative at a and write
f (x) f (a)
f+0 (a) = lim+ .
xa xa
If the limit
f (x) f (b)
lim
xb xb
exists, we say that f has a left derivative at b and write
f (x) f (b)
f0 (b) = lim .
xb xb
We will say that f is differentiable on [a, b] if f 0 (x) exists for each x (a, b) and, in addition, both
f+0 (a) and f0 (b) exist.
Theorem 4.2.5 Intermediate Value Theorem for Derivatives. Let a, b R with a < b. Sup-
pose f is differentiable on [a, b] and
f 0 (c) = .
112 4.2 THE MEAN VALUE THEOREM
g(x) = f (x) x.
Thus,
g(x) g(a)
lim+ < 0.
xa xa
It follows that there exists 1 > 0 such that
Since g is continuous on [a, b], it attains its minimum at a point c [a, b]. From the observations
above, it follows that c (a, b). This implies g0 (c) = 0 or, equivalently, that f 0 (c) = .
Remark 4.2.6 The same conclusion follows if f+0 (a) > > f0 (b).
113
Exercises
4.2.1 B Let f and g be differentiable at x0 . Suppose f (x0 ) = g(x0 ) and
4.2.2 Prove the following inequalities using the Mean Value Theorem.
(a) 1 + x < 1 + 12 x for x > 0.
(b) ex > 1 + x, for x > 0. (Assume known that the derivative of ex is itself.)
x1
(c) < ln x < x 1, for x > 1. (Assume known that the derivative of ln x is 1/x.)
x
4.2.3 I Prove that | sin(x) sin(y)| |x y| for all x, y R.
4.2.4 B Let n be a positive integer and let ak , bk R for k = 1, . . . , n. Prove that the equation
n
x + (ak sin kx + bk cos kx) = 0
k=1
has a solution on (, ).
4.2.5 B Let f and g be differentiable functions on [a, b]. Suppose g(x) 6= 0 and g0 (x) 6= 0 for all
x [a, b]. Prove that there exists c (a, b) such that
1 f (a) f (b)
= 1 f (c) g(c)
,
g (c) f 0 (c) g0 (c)
g(b) g(a) g(a) g(b)
0
a1 + a2 x + a3 x2 + + an xn1 = 0
4.2.7 Let f : [0, ) R be a differentiable function. Prove that if both limx f (x) and limx f 0 (x)
exist, then limx f 0 (x) = 0
f (x)
(a) Show that if limx f 0 (x) = a, then limx = a.
x
f (x)
(b) Show that if limx f 0 (x) = , then limx = .
x
(c) Are the converses in part (a) and part (b) true?
Proposition 4.3.1 Let f be continuous on [a, b] and differentiable on (a, b). If f 0 (x) = 0 for all
x (a, b), then f is constant on [a, b].
Proof: Suppose by contradiction that f is not constant on [a, b]. Then there exist a1 and b1 such
that a a1 < b1 b and f (a1 ) 6= f (b1 ). By Theorem 4.2.3, there exists c (a1 , b1 ) such that
f (b1 ) f (a1 )
f 0 (c) = 6= 0,
b1 a1
which is a contradiction.
The next application of the Mean Value Theorem concerns developing simple criteria for
monotonicity of real-valued functions based on the derivative.
Example 4.3.1 Let n N and f : [0, ) R be given by f (x) = xn . Then f 0 (x) = nxn1 . There-
fore, f 0 (x) > 0 for all x > 0 and, so, f is strictly increasing. In particular, this shows that every
positive real number has exactly one n-th root (refer to Example 3.4.2).
Example 4.3.2 Let n N and consider the function f : (0, ) R given by f (x) = xn . Then f
is differentiable and f 0 (x) = nxn1 6= 0 for all x (0, ). It is also clear that f ((0, )) = (0, ). It
follows from the Inverse Function Theorem that f 1 : (0, ) (0, ) is differentiable and given
y (0, )
1 1
( f 1 )0 (y) = = .
f 0 ( f 1 (y)) n( f 1 (y))n1
Given y > 0, the value f 1 (y) is the unique positive real number whose n-th power is y. We call
f 1 (y) the (positive) n-th root of y and denote it by n y. We also obtain the formula
1
( f 1 )0 (y) = .
n( y)n1
n
Exercises
4.3.1 (a) Let f : R R be differentiable. Prove that if f 0 (x) is bounded, then f is Lipschitz
continuous and, in particular, uniformly continuous.
(b) Give an example of a function f : (0, ) R which is differentiable and uniformly continuous
but such that f 0 (x) is not bounded.
4.3.3 B Let f and g be differentiable functions on R such that f (x0 ) = g(x0 ) and
Prove that
4.3.5 B Let f be twice differentiable on an open interval I. Suppose that there exist a, b, c I
with a < b < c such that f (a) < f (b) and f (b) > f (c). Prove that there exists d (a, c) such that
f 00 (d) < 0.
4.3.6 B Prove that the function f defined in Exercise 4.1.11 is not monotone on any open interval
containing 0.
Theorem 4.4.1 Suppose f and g are continuous on [a, b] and differentiable on (a, b). Suppose
f (x) = g(x) = 0, where x [a, b]. Suppose further that there exists > 0 such that g0 (x) 6= 0 for all
x B(x; ) [a, b], x 6= x.
If
f 0 (x)
lim = `,
xx g0 (x)
then
f (x)
lim = `. (4.9)
xx g(x)
Proof: Let {xk } be a sequence in [a, b] that converges to x and such that xk 6= x for every k. By
Theorem 4.2.4, for each k, there exists a sequence {ck }, with ck between xk and x, such that
f (xk ) f 0 (ck )
= 0 .
g(xk ) g (ck )
Under the assumptions that g0 (x) 6= 0 for x near x and g(x) = 0, we also have g(xk ) 6= 0 for sufficiently
large k. By the squeeze theorem (Theorem 2.1.6), {ck } converges to x. Thus,
2x + sin x
lim = 1.
x0 x2 + 3x
118 4.4 LHOSPITALS RULE
First we observe that the conditions of Theorem 4.4.1 hold. Here f (x) = 2x + sin x, g(x) = x2 + 3x,
and x = 0. We may take [a, b] = [1, 1], for example, so that f and g are continuous on [a, b]
f (x)
and differentiable on (a, b) and, furthermore, g(x) is well defined on [a, b] \ {x}. Moreover, taking
0
= 7/3, we get g (x) = 2x + 3 6= 0 for x B(x; ) [a, b]. Finally we calculate the limit of the
quotient of derivatives using Theorem 3.2.1 to get
3x3 2x2 + 4x 5
lim .
x1 4x4 2x 2
Here f (x) = 3x3 2x2 + 4x 5 and g(x) = 4x4 2x 2. Thus f (1) = g(1) = 0. Moreover, f 0 (x) =
9x2 4x + 4 and g0 (x) = 16x3 2. Since g0 (1) = 14 6= 0 and g0 is continuous we have g0 (x) 6= 0 for
x near 1. Now,
9x2 4x + 4 9
lim = .
x1 16x3 2 14
9
Thus, the desired limit is 14 as well.
Example 4.4.3 If the derivatives of the functions f and g themselves satisfy the assumptions of
Theorem 4.4.1 we may apply LHospitals rule to determine first the limit of f 0 (x)/g0 (x) and then
apply the rule again to determine the original limit.
Consider the limit
x2
lim .
x0 1 cos x
Here f (x) = x2 and g(x) = 1 cos x so both functions and all its derivatives are continuous. Now
g0 (x) = sin x and, so, g0 (x) 6= 0 for x near zero, x 6= 0. Also, f 0 (0) = 0 = g0 (0) and g00 (x) = cos x 6= 0
for x near 0. Moreover,
f 00 (x) 2
lim 00
= lim = 2.
x0 g (x) x0 cos x
f 0 (x) f 00 (x) 2
lim = lim = lim = 2.
x0 g0 (x) x0 g00 (x) x0 cos x
x2 f (x) f 0 (x)
lim = lim = lim 0 = 2.
x0 1 cos x x0 g(x) x0 g (x)
119
Using the derivative rules at x 6= 0 and the definition of derivative at x = 0 we can see that f is
differentiable and
(
2x sin 1x cos 1x , if x 6= 0;
f 0 (x) =
0, if x = 0,
However, f 0 is not continuous at 0 (since limx0 f 0 (x) does not exist) and, hence, LHospitals rule
cannot be applied in this case.
x2 sin 1x
On the other hand limx0 x+3x2
does exist as we can see from
Theorem 4.4.2 Let a, b R, a < b, and x (a, b). Suppose f , g : (a, b) \ {x} R are differentiable
on (a, b) \ {x} and assume limxx f (x) = limxx g(x) = . Suppose further that there exists > 0
such that g0 (x) 6= 0 for all x B(x; ) (a, b), x 6= x.
If ` R and
f 0 (x)
lim = `, (4.10)
xx g0 (x)
then
f (x)
lim = `. (4.11)
xx g(x)
Proof: Since limxx f (x) = limxx g(x) = , choosing a smaller positive if necessary, we can
assume that f (x) 6= 0 and g(x) 6= 0 for all x B(x; ) (a, b).
f (x) f (x)
We will show that limxx+ = `. The proof that limxx = ` is completely analogous.
g(x) g(x)
Fix any > 0. We need to find 0 > 0 such that | f (x)/g(x) `| < whenever x B+ (x; 0 )
(a, b).
From (4.10), one can choose K > 0 and a positive 1 < such that
0 0
f (x)
K and f (x) ` <
g0 (x) g0 (x) 2 (4.12)
such x, since g0 (z) 6= 0 if x < z < , Rolles theorem (Theorem 4.2.2) guarantees that g(x) 6= g().
Therefore, for all x B+ (x; 2 ) we can write,
g()
1
f (x) f (x) f () g(x)
= .
g(x) g(x) g() f ()
1
f (x)
Now, define
g()
1
g(x)
H (x) = for x B+ (x; 2 ).
f ()
1
f (x)
Since limxx f (x) = limxx g(x) = , we have that limxx+ H (x) = 1. Thus, there exists a positive
< 2 such that
|H (x) 1| < whenever x B+ (x; ).
2K
For any x B+ (x; ), applying Theorem 4.2.4 on the interval [x, ], we can write [ f (x) f ()]g0 (c) =
[g(x) g()] f 0 (c) for some c (x, ) (note that, in particular, c B(x : 1 ) (a, b)). For such c we
get
f (x) f 0 (c)
= 0 H (x).
g(x) g (c)
Since c B(x : 1 ) (a, b), applying (4.12) we get that, for x B+ (x; ) = B+ (x; ) (a, b),
0
f (x) f (c)
g(x) ` =
g0 (c) H (x) `
0 0 (c)
f (c) f
= 0 (H (x) 1) + 0 `
g (c) g (c)
0 0
f (c) f (c)
0 |H (x) 1| + 0
`
g (c) g (c)
<K + = .
2K 2
Setting 0 = completes the proof.
Here f (x) = ln x2 , g(x) = 1 + 31 2 , x = 0, and we may take as (a, b) any open inteval containing
x
0. Clearly f and g satisfy the differentiability assumptions and g0 (x) 6= 0 for all x 6= 0. Moreover,
limxx f (x) = limxx g(x) = . We analyze the quotient of the derivatives. We have
2/x
3 5
x 3
lim 2 1 = lim 3 = lim 3 x2 = 0.
x0
3 3 5
x0 x x0
x
121
Remark 4.4.3 The proofs of Theorem 4.4.1 and Theorem 4.4.2 show that the results in these
theorems can be applied for left-hand and right-hand limits. Moreover, the results can also be
modified to include the case when x is an endpoint of the domain of the functions f and g.
The following theorem can be proved following the method in the proof of Theorem 4.4.1.
Theorem 4.4.4 Let f and g be differentiable on (a, ). Suppose g0 (x) 6= 0 for all x (a, ) and
If ` R and
f 0 (x)
lim = `,
x g0 (x)
then
f (x)
lim = `.
x g(x)
1
lim .
x x( arctan x)
2
1/x
Writing the quotient in the form we can apply Theorem 4.4.4. We now compute the limit
2 arctan x
of the quotient of the derivatives
1/x2 x2 + 1
lim = lim = 1.
x 21 x x2
x +1
Theorem 4.4.5 Let f and g be differentiable on (a, ). Suppose g0 (x) 6= 0 for all x (a, ) and
If ` R and
f 0 (x)
lim = `,
x g0 (x)
then
f (x)
lim = `.
x g(x)
122 4.4 LHOSPITALS RULE
ln x
lim .
x x
Clearly the functions f (x) = ln x and g(x) = x satisfy the conditions of Theorem 4.4.5. We have
f 0 (x) 1/x
lim = lim =0
x g0 (x) x 1
Exercises
4.4.1 Use LHospitals Rule to find the following limits (you may assume known all the relevant
derivatives from calculus):
x3 4x
(a) lim .
x2 3x2 + 5x 2
ex ex
(b) lim .
x0 sin x cos x
x1
(c) lim .
x1 x + 1 2
ex ex
(d) lim .
x0 ln(1 + x)
ln x
(e) lim .
x1 sin(x)
4.4.2 For the problems below use LHospitals rule as many times as appropriate to determine the
limits.
1 cos 2x
(a) lim .
x0 x sin x
(x 2 )2
(b) lim .
x0 1 sin x
x arctan x
(c) lim .
x0 x3
x sin x
(d) lim .
x0 x tan x
4.4.3 Use the relevant version of LHospitals rule to compute each of the following limits.
3x2 + 2x + 7
(a) lim .
x 4x2 6x + 1
ln x
(b) lim+ .
x0 cot x
123
2 arctan x
(c) lim .
ln(1 + 1x )
x
(d) lim xex . (Hint: first rewrite as a quotient.)
x
4.4.4 Prove that the following functions are differentiable at 1 and -1.
x2 ex2 , if |x| 1;
(a) f (x) = 1
, if |x| > 1.
e
arctan x, if |x| 1;
(b) f (x) = x1
sign x + , if |x| > 1.
4 2
4.4.5 B Let P(x) be a polynomial. Prove that
lim P(x)ex = 0.
x
f (n+1) (c)
f (x) = Pn (x) + (x x)n+1 ,
(n + 1)!
where
n
f (k) (x)
Pn (x) = (x x)k .
k=0 k!
Proof: Let x be as in the statement and let us fix x 6= x. Since x x 6= 0, there exists a number R
such that
f (x) = Pn (x) + (x x)n+1 .
(n + 1)!
We will now show that
= f (n+1) (c),
124 4.5 TAYLORS THEOREM
Then
n
f (k) (x)
g(x) = f (x) (x x)k (x x)n+1 = f (x) Pn (x) (x x)n+1 = 0.
k=0 k! (n + 1)! (n + 1)!
and
n
f (k) (x)
g(x) = f (x) (x x)k (x x)n+1 = f (x) f (x) = 0.
k=0 k! (n + 1)!
By Rolles theorem, there exists c in between x and x such that g0 (c) = 0. Taking the derivative of
g (keeping in mind that x is fixed and the independent variable is t) and using the product rule for
derivatives, we have
!
n
0 0 f (k+1) (c) k f (k) (c) k1
g (c) = f (c) + (x c) + (x c) + (x c)n
k=1 k! (k 1)! n!
1
= (x c)n f (n+1) (c)(x c)n
n! n!
= 0.
Remark 4.5.2 The conclusion of Taylors theorem still holds true if x = x. In this case, c = x = x.
Example 4.5.1 We will use Taylors theorem to estimate the error in approximating the function
f (x) = sin x with it 3rd Taylor polynomial at x = 0 on the interval [/2, /2]. Since f 0 (x) = cos x,
f 00 (x) = sin x and f 000 (x) = cos x, a direct calculation shows that
x3
P3 (x) = x .
3!
More over, for any c R we have | f (4) (c)| = | sin c| 1. Therefore, for x [/2, /2] we get (for
some c between x and 0),
| f (4) (c)| /2
| sin x P3 (x)| = |x| 0.066.
4! 4!
Theorem 4.5.3 Let n be an even positive integer. Suppose f (n) exists and continuous on (a, b). Let
x (a, b) satisfy
Proof: We will prove (a). Suppose f (n) (x) > 0. Since f (n) (x) > 0 and f (n) is continuous at x, there
exists > 0 such that
Fix any x B(x; ). By Taylors theorem and the given assumption, there exists c in between x and x
such that
f (n) (c)
f (x) = f (x) + (x x)n .
n!
Since n is even and c B(x; ), we have f (x) f (x). Thus, f has a local minimum at x.
Now, for the converse, suppose that f has a local minimum at x. Then there exists > 0 such
that
Fix a sequence {xk } in (a, b) that converges to x with xk 6= x for every k. By Taylors theorem, there
exists a sequence {ck }, with ck between xk and x for each k, such that
f (n) (ck )
f (xk ) = f (x) + (xk x)n .
n!
f (xk ) f (x)
f (n) (ck )
f (xk ) f (x) = (xk x)n 0.
n!
This implies f (n) (ck ) 0 for such k. Since {ck } converges to x, f (n) (x) = limk f (n) (ck ) 0.
The proof of (b) is similar.
Example 4.5.2 Consider the function f (x) = x2 cos x defined on R. Then f 0 (x) = 2x cos x x2 sin x
and f 00 (x) = 2 cos x 4x sin x x2 cos x. Then f (0) = f 0 (0) = 0 and f 00 (0) = 2 > 0. It follows from
the previous theorem that f has a local minimum at 0. Notice, by the way, that since f (0) = 0 and
f () < 0, 0 is not a global minimum.
Example 4.5.3 Consider the function f (x) = x6 + 2x5 + x4 4x3 + x2 + 2x 3 defined on R. A
direct calculations shows f 0 (1) = f 00 (1) = f 000 (1) = f (4) (1) = 0 and f (5) (1) < 0. It follows from the
previous theorem that f has a local maximum at 1.
126 4.6 CONVEX FUNCTIONS AND DERIVATIVES
Exercises
4.5.1 B Use Taylors theorem to prove that
m
xk
ex > k!
k=0
4.5.2 Find the 5th Taylor polynomial, P5 (x), at x = 0 for cos x. Determine an upper bound for the
error |P5 (x) cos x| for x [/2, /2].
4.5.3 Use Theorem 4.5.3 to determine if the following functions have a local minimum or a local
maximum at the indicated points.
4.5.4 Suppose f is twice differentiable on (a, b). Show that for every x (a, b),
f (x + h) + f (x h) 2 f (x)
lim = f 00 (x).
h0 h2
4.5.5 I (a) Suppose f is three times differentiable on (a, b) and x (a, b). Prove that
2
f (x + h) f (x) f 0 (x) 1!h f 00 (x) h2! f 000 (x)
lim = .
h0 h3 3!
(b) State and prove a more general result for the case where f is n times differentiable on (a, b).
n
hn
Pn (h) = f (n) (x) n! for h R
k=0
Prove that
f (x + h) Pn (h)
lim = 0.
h0 hn
(Thus, we have
f (x + h) = Pn (h) + g(h),
f ( x + (1 )y) = ( x + (1 )y)2
= 2 x2 + 2 (1 )xy + (1 )2 y2
2 x2 + (1 )(x2 + y2 ) + (1 )2 y2
= ( x2 + (1 )x2 ) + (1 )( y2 + (1 )y2 )
= x2 + (1 )y2
= f (x) + (1 ) f (y).
(c) f : R R, f (x) = |x|. This follows from the triangle inequality and other basic properties of
absolute value.
Theorem 4.6.1 Let I be an interval of R. A function f : I R is convex if and only if for every
i 0, i = 1, . . . , n, with ni=1 i = 1 (n 2) and for every xi I, i = 1, . . . , n,
!
n n
f i xi i f (xi ). (4.13)
i=1 i=1
Proof: Since the converse holds trivially, we only need to prove the implication by induction. The
conclusion holds for n = 2 by the definition of convexity. Let k be such that the conclusion holds for
any n with 2 n k. We will show that it also holds for n = k + 1. Fix i 0, i = 1, . . . , k + 1, with
k+1
i=1 i = 1 and fix every xi I, i = 1, . . . , k + 1. Then
k
i = 1 k+1 .
i=1
128 4.6 CONVEX FUNCTIONS AND DERIVATIVES
If k+1 = 1, then i = 0 for all i = 1, . . . , k, and (4.13) holds. Suppose 0 k+1 < 1. Then, for each
i = 1, . . . , k, i /(1 k+1 ) 0 and
k
i
1 k+1 = 1.
i=1
It follows that
!
k+1
ki=1 i xi
f i xi = f (1 k+1 ) + k+1 xk+1
i=1 1 k+1
k
i=1 i xi
(1 k+1 ) f + k+1 f (xk+1 )
1 k+1
!
k
i
= (1 k+1 ) f xi + k+1 f (xk+1 )
i=1 1 k+1
k
i
(1 k+1 ) f (xi ) + k+1 f (xk+1 )
i=1 1 k+1
k+1
= i f (xi ),
i=1
where the first inequality follows from the definition of convexity (or is trivial if k+1 = 0) and the
last inequality follows from the inductive assumption. The proof is now complete.
Theorem 4.6.2 Let I be an interval and let f : I R be a convex function. Then f has a local
minimum at x if and only if f has an absolute minimum at x.
Proof: Clearly if f has a global minimum at x, then it also has a local minimum at x.
Conversely, suppose that f has a local minimum at x. Then there exists > 0 such that
1 1
f (x) f (x)
n n
and, hence, f (x) f (x). Since x was arbitrary, this shows f has an absolute minimum at x.
Theorem 4.6.3 Let I be an open interval and let f : I R be a convex function. Suppose f is
differentiable at x. Then
Since f is differentiable at x,
This implies
Lemma 4.6.5 Let I be an open interval and suppose f : I R is a convex function. Fix a, b, x I
with a < x < b. Then
f (x) f (a) f (b) f (a) f (b) f (x)
.
xa ba bx
Proof: Let
xa
t= .
ba
Then t (0, 1) and
xa
f (x) = f (a + (x a)) = f a + (b a) = f (a + t(b a)) = f (tb + (1 t)a).
ba
By convexity of f , we obtain
Thus,
xa
f (x) f (a) t f (b) + (1 t) f (a) f (a) = t[ f (b) f (a)] = ( f (b) f (a)).
ba
130 4.6 CONVEX FUNCTIONS AND DERIVATIVES
Equivalently,
f (x) f (a) f (b) f (a)
.
xa ba
Similarly,
xb
f (x) f (b) t f (b) + (1 t) f (a) f (b) = (1 t)[ f (a) f (b)] = [ f (b) f (a)].
ba
It follows that
f (b) f (a) f (b) f (x)
.
ba bx
The proof is now complete.
Theorem 4.6.6 Let I be an open interval and let f : I R be a differentiable function. Then f is
convex if and only if f 0 is increasing on I.
Proof: Suppose f is convex. Fix a < b with a, b I. By Lemma 4.6.5, for any x (a, b), we have
f (x) f (a) f (b) f (a)
.
xa ba
This implies, taking limits, that
f (b) f (a)
f 0 (a) .
ba
Similarly,
f (b) f (a)
f 0 (b).
ba
Therefore, f 0 (a) f 0 (b), and f 0 is an increasing function.
Let us prove the converse. Suppose f 0 is increasing. Fix x1 < x2 and t (0, 1). Then
x1 < xt < x2 ,
where xt = tx1 + (1 t)x2 . By the Mean Value Theorem (Theorem 4.2.3), there exist c1 and c2 such
that
with
This implies
t f (xt ) t f (x1 ) = f 0 (c1 )t(1 t)(x2 x1 ) f 0 (c2 )t(1 t)(x2 x1 ) = (1 t) f (x2 ) (1 t) f (xt ).
Corollary 4.6.7 Let I be an open interval and let f : I R be a function. Suppose f is twice
differentiable on I. Then f is convex if and only if f 00 (x) 0 for all x I.
Proof: It follows from Proposition 4.3.2 that f 00 (x) 0 for all x I if and only if the derivative
function f 0 is increasing on I. The conclusion then follows directly from Theorem 4.6.6.
Example 4.6.2 Consider the function f : R R given by f (x) =
x2 + 1. Now, f 0 (x) =
x/ x2 + 1 and f 00 (x) = 1/(x2 + 1)3/2 . Since f 00 (x) 0 for all x, it follows from the corollary
that f is convex.
Theorem 4.6.8 Let I be an open interval and let f : I R be a convex function. Then it is locally
Lipschitz continuous in the sense that for any x I, there exist ` 0 and > 0 such that
In particular, f is continuous.
Proof: Fix any x I. Choose four numbers a, b, c, d satisfying
Choose > 0 such that B(x; ) (b, c). Let u, v B(x; ) with v < u. Then by Lemma 4.6.5, we
see that
f (b) f (a) f (u) f (a) f (u) f (v) f (d) f (v) f (d) f (c)
.
ba ua uv d v d c
Exercises
4.6.1 (a) Let I be an interval and let f , g : I R be convex functions. Prove that c f , f + g, and
max{ f , g} are convex functions on I, where c 0 is a constant.
(b) Find two convex functions f and g on an interval I such that f g is not convex.
4.6.2 Let f : R R be a convex function. Given a, b R, prove that the function defined by
4.6.3 I Let I be an interval and let f : I R be a convex function. Suppose that is a convex,
increasing function on an interval J that contains f (I). Prove that f is convex on I.
4.6.4 B Prove that each of the following functions is convex on the given domain:
The set of all subderivatives of f at x is called the subdifferential of f at x and is denoted by f (x).
133
Lemma 4.7.1 Let f : R R be a convex function. Fix a R. Define the slope function a by
f (x) f (a)
a (x) = (4.17)
xa
for x (, a) (a, ). Then, for x1 , x2 (, a) (a, ) with x1 < x2 , we have
a (x1 ) a (x2 ).
Theorem 4.7.2 Let f : R R be a convex function and let x R. Then f has left derivative and
right derivative at x. Moreover,
Proof: By Lemma 4.7.1, the slope function x defined by (4.17) is increasing on the interval (x, )
and bounded below by x (x 1). By Theorem 3.2.4, the limit
f (x) f (x)
lim x (x) = lim+
xx+ xx x x
exists and is finite. Moreover,
This implies
f (x) f (x)
u for all x > x.
x x
135
Thus,
f (x) f (x)
u lim+ = f+0 (x).
xx x x
Similarly, we have
Thus,
f (x) f (x)
u for all x < x.
x x
This implies u f0 (x). So
To prove the opposite inclusion, take u [ f0 (x), f+0 (x)]. By Theorem 4.7.2
f (x) f (x)
u x (x) = for all x > x.
x x
It follows that
f (x) = { f 0 (x)}.
By Theorem 4.7.3,
Example 4.7.2 Let f (x) = a|x b| + c, where a > 0. Then f is a convex function and
Thus,
Definition 4.7.2 Let A and B be two nonempty subsets of R and let R. Define
A + B = {a + b : a A, b B} and A = {a : a A}.
Theorem 4.7.5 Let f , g : R R be convex functions and let > 0. Then f + g and f are convex
functions and
By Theorem 4.7.3,
Example 4.7.3 Let a1 < a2 < < an and let i > 0 for i = 1, . . . , n. Define
n
f (x) = i |x ai |.
i=1
where
0 0
m = min fi (x) and M = max fi+ (x).
iI(x) iI(x)
This implies
Thus, f is a convex function. Similarly we verify that f+0 (x) = M and f0 (x) = m. By Theorem 4.7.3,
Remark 4.7.7 The product of two convex functions is not a convex function in general. For instance,
f (x) = x and g(x) = x2 are convex functions, but h(x) = x3 is not a convex function.
The following result may be considered as a version of the first derivative test for extrema in the
case of non differentiable functions.
Theorem 4.7.8 Let f : R R be a convex function. Then f has an absolute minimum at x if and
only if
This implies
0 (x x) = 0 f (x) f (x) for all x R .
It follows from (4.16) that 0 f (x).
Conversely, if 0 f (x), again, by (4.16),
0 (x x) = 0 f (x) f (x) for all x R .
Thus, f has an absolute minimum at x.
Example 4.7.4 Let k be a positive integer and a1 < a2 < < a2k1 . Define
2k1
f (x) = |x ai |,
i=1
for x R. It follows from the subdifferential formula in Example 4.7.3 that 0 f (x) if and only if
x = ak . Thus, f has a unique absolute minimum at ak .
Then 0 g(x) if and only if x [ak , ak+1 ]. Thus, g has an absolute minimum at any point of
[ak , ak+1 ].
139
The following theorem is a version of the Mean Value Theorem (Theorem 4.2.3) for nondifferen-
tiable functions.
Theorem 4.7.9 Let f : R R be a convex function and let a < b. Then there exists c (a, b) such
that
f (b) f (a)
f (c). (4.19)
ba
Proof: Define
f (b) f (a)
g(x) = f (x) (x a) + f (a) .
ba
Then g is a convex function and g(a) = g(b). Thus, g has a local minimum at some c (a, b) and,
hence, g also has an absolute minimum at c. Observe that the function
f (b) f (a)
h(x) = (x a) + f (a)
ba
f (x + h) f (x) `|h|
f+0 (x) = lim+ lim+ = `.
h0 h h0 h
f (v) f (u)
f (c) [`, `],
vu
for some c in between u and v. This implies
Exercises
4.7.1 B Find subdifferentials of the following functions:
4.7.3 I Let f (x) = nk=1 |x k|. Find all absolute minimizers of the function.
141
4.7.5 B Let f : R R be a convex function. Suppose that f (x) [0, ) for all x R. Prove that
f is monotone increasing on R.
5. Solutions and Hints for Selected Exercises
SECTION 1.1
Exercise 1.1.2. Applying basic rules of operations on sets yields
(X \Y ) Z = Y c Z = Z \Y.
and
Z \ (Y Z) = (Z \Y ) (Z \ Z) = (Z \Y ) 0/ = Z \Y.
Therefore, (X \Y ) Z = Z \ (Y Z).
SECTION 1.2
Exercise 1.2.1. (a) For any a A, we have f (a) f (A) and, so, a f 1 ( f (A)). This implies A
f 1 ( f (A)). Note that this inclusion does not require the injectivity of f . Now fix any a f 1 ( f (A)).
Then f (a) f (A), so there exists a0 A such that f (a) = f (a0 ). Since f is one-to-one, a = a0 A.
Therefore, f 1 ( f (A)) A and the equality holds.
(b) Fix any b f ( f 1 (B)). Then b = f (x) for some x f 1 (B). Thus, b = f (x) B and, hence,
f ( f 1 (B)) B. This inclusion does not require the surjectivity of f . Now fix b B. Since f is onto,
there exists x X such that f (x) = b B. Thus, x f 1 (B) and, hence, b f ( f 1 (B)). We have
shown that B f ( f 1 (B)) and the equality holds.
Without the injectivity of f , the equality in part (a) is no longer valid. Consider f (x) = x2 , x R,
and A = [1, 2]. Then f (A) = [0, 4] and, hence, f 1 ( f (A)) = [2, 2], which strictly contains A. It is
also not hard to find an example of a function f and a set B for which the equality in part (b) does
not hold true.
144 Solutions and Hints for Selected Exercises
SECTION 1.3
Exercise 1.3.6. For n = 1,
1 h1+ 5 1 5i 1 2 5
= = 1.
5 2 2 5 2
Thus, the conclusion holds for n = 1. It is also easy to verify that the conclusion holds for n = 2.
Suppose that
1 h 1 + 5 k 1 5 k i
ak =
5 2 2
for all k n, where n 2. Let us show that
1 h 1 + 5 n+1 1 5 n+1 i
an+1 = . (5.1)
5 2 2
By the definition of the sequence and the induction hypothesis,
an+1 = an + an1
1 h 1 + 5 n 1 5 n i 1 h 1 + 5 n1 1 5 n1 i
= +
5 2 2 5 2 2
1 h 1 + 5 n1 1 + 5
1 5 n1 1 5
i
= +1 1 .
5 2 2 2 2
Observe that
1+ 5 3 + 5 1 + 5 2 1 5 3 5 1 5 2
+1 = = and +1 = = .
2 2 2 2 2 2
Therefore, (5.1) follows easily.
1+ 5 1 5
In this exercise, observe that the two numbers and are the roots of the quadratic
2 2
equation
x2 = x + 1.
A more general result can be formulated as follows. Consider the sequence {an } defined by
a1 = a;
a2 = b;
an+2 = an+1 + an for n N .
Suppose that the equation x2 = x + has two solutions x1 and x2 . Let c1 and c2 be two constants
such that
c1 x1 + c2 x2 = a;
c1 (x1 )2 + c2 (x2 )2 = b.
This is a very useful method to find a general formula for a sequence defined recursively as above.
For example, consider the sequence
a1 = 1;
a2 = 1;
an+2 = an+1 + 2an for n N .
xn = c1 2n + c2 (1)n ,
c1 (2) + c2 (1) = 1;
c1 (2)2 + c2 (1)2 = 1.
1 1
an = 2n (1)n for all n N .
3 3
Exercise 1.3.8. Hint: Prove first that, for k = 1, 2, . . . , n, we have
n n n+1
+ = .
k k1 k
SECTION 1.4
Exercise 1.4.5. In general, to prove that |a| m, where m 0, we only need to show that a m and
a m.
For any x, y R,
|x| = |x y + y| |x y| + |y|,
This implies
Similarly,
|y| = |y x + x| |x y| + |x|,
This implies
Therefore,
SECTION 1.5
Exercise 1.5.4.
Let us first show that A + B is bounded above. Since A and B are nonempty and bounded above,
by the completeness axiom, sup A and sup B exist and are real numbers. In particular, a sup A for
all a A and b sup B for all b B.
For any x A+B, there exist a A and b B such that x = a+b. Thus, x = a+b sup A+sup B,
which shows that A + B is bounded above.
We will now show that sup A + sup B is the supremum of the set A + B by showing that sup A +
sup B satisfies conditions (1) and (2) of Proposition 1.5.1.
We have just shown that sup A + sup B is an upper bound of A + B and, hence, sup A + sup B
satisfies condition (1).
Now let > 0. Using 2 in part (2) of Proposition 1.5.1 applied to the sets A and B, there exits
a A and b B such that
sup A < a and sup B < b.
2 2
It follows that
This proves condition (2). It follows from Proposition 1.5.1 applied to the set A + B that sup A +
sup B = sup(A + B) as desired.
SECTION 1.6
1
Exercise 1.6.2. Let x = . By Theorem 1.6.2(d), there exists m Z such that
r
1
m1 < m.
r
Since 1/r > 1, we get m > 1 and, so, m 2. It follows that m 1 N. Set n = m 1 and then we
get
1 1
<r .
n+1 n
SECTION 2.1
Exercise 2.1.12. (a) Suppose that limn an = `. Then by Theorem 2.1.9,
Now suppose that (5.2) is satisfied. Fix any > 0. Choose N1 N such that
Therefore, limn an = `.
This problem is sometimes very helpful to show that a limit exists. For example, consider the
sequence defined by
x1 = 1/2,
1
xn+1 = for n N .
2 + xn
that {x2n+1 } and {x2n } both converge to
We will see later 2 1, so we can conclude that {xn }
converges to 2 1.
(b) Use a similar method to the solution of part (a).
Exercise 2.1.8. Consider the case where ` > 0. By the definition of limit, we can find n1 N such
that
|an | > `/2 for all n n1 .
Given any > 0, we can find n2 N such that
`
|an `| < for all n n2 .
4
Choose n0 = max{n1 , n2 }. For any n n0 , one has
`
+ `4
an+1 |an an+1 | |an `| + |an+1 `| 4
an 1 = < = .
|an | |an | `
2
SECTION 2.2
Exercise 2.2.3. (a) The limit is calculated as follows:
p n2 + n n n2 + n + n
lim n2 + n n = lim
n n n2 + n + n
n
= lim
n n2 + n + n
n
= lim p
n n2 (1 + 1/n) + n
1
= lim p = 1/2.
n 1 + 1/n + 1
148 Solutions and Hints for Selected Exercises
3n2
= lim p p
n 3 n6 (1 + 3/n)2 + n 3 n3 (1 + 3/n) + n2
3n2
= lim p
n 2 3
p
n (1 + 3/n)2 + 3 (1 + 3/n) + 1
3
= lim p = 1.
n 3
p
2 3
(1 + 3/n) + (1 + 3/n) + 1
(c) We use the result in par (a) and part (b) to obtain
p p p p
3 3
lim ( n3 + 3n2 n2 + 1) = lim n3 + 3n2 n + n n2 + 1
n n
p p
3
= lim n3 + 3n2 n + lim n n2 + 1 = 1 1/2 = 1/2.
n n
SECTION 2.3
Exercise 2.3.1. (a) Clearly, a1 < 2. Suppose that ak < 2 for k N. Then
p
ak+1 = 2 + ak < 2 + 2 = 2.
ak + 2 < ak+1 + 2,
which implies p p
ak + 2 < ak+1 + 2.
Thus, ak+1 < ak+2 . By induction, an < an+1 for all n N. Therefore, {an } is an increasing sequence.
By the monotone convergence theorem, limn an exists. Let ` = limn an . Since an+1 =
(c)
2 + an and limn an+1 = `, we have
` = 2 + ` or `2 = 2 + `.
149
a1 = c > 0,
an+1 = c + an for n N .
1 + 1 + 4c
We can prove that {an } is monotone increasing and bounded above by . In fact, {an }
2
1 + 1 + 4c
converges to this limit. The number is obtained by solving the equation ` = c + `,
2
where ` > 0.
Exercise 2.3.3. (a) Let {an } be the given sequence. Observe that an+1 = 2an . Then show that
{an } is monotone increasing and bounded above. The limit is 2.
(b) Let {an } be the given sequence. Then
1
an+1 = .
2 + an
Show that {a2n+1 } is monotone decreasing and bounded below; {a2n } is monotone increasing and
bounded above. Thus, {an } converges by Exercise 2.1.12. The limit is 2 1.
SECTION 2.4
Exercise 2.4.1. Here we use the fact that in R a sequence is a Cauchy sequence if and only if it is
convergent.
(a) Not a Cauchy sequence. See Example 2.1.7.
(b) A Cauchy sequence. This sequence converges to 0.
(c) A Cauchy sequence. This sequence converges to 1.
(d) A Cauchy sequence. This sequence converges to 0 (see Exercise 2.1.5).
SECTION 2.5
Exercise 2.5.4. (a) Define
By the definition,
By Exercise 2.5.3,
n n + n for all n N .
This implies
lim n lim n + lim n for all n N .
n n n
Therefore,
lim sup(an + bn ) lim sup an + lim sup bn .
n n n
This conclusion remains valid for unbounded sequences provided that the right-hand side is well-
defined. Note that the right-hand side is not well-defined, for example, when lim supn an = and
lim supn bn = .
(b) Define
n = inf (an + bn ), n = inf ak , n = inf bk .
kn kn kn
SECTION 2.6
Exercise 2.6.3. Suppose A and B are compact subsets of R. Then, by Theorem 2.6.5, A and
B are closed and bounded. From Theorem 2.6.2(c) we get that A B is closed. Moreover, let
MA , mA , MB , mB be upper and lower bounds for A and B, respectively. Then M = max{MA , MB }
and m = min{mA , mB } are upper and lower bounds for A B. In particular, A B is bounded. We
have shown that A B is both closed and bounded. It now follows from Theorem 2.6.5 that A B is
compact.
SECTION 3.1
Exercise 3.1.6. (a) Observe that when x is near 1/2, f (x) is near 1/2 no matter whether x is rational
or irrational. We have
(
|x 1/2|, if x Q;
| f (x) 1/2| =
|1 x 1/2|, if x 6 Q.
lim f (x) = `,
x0
where ` is a real number. Choose a sequence {rn } of rational numbers that converges to 0, and choose
also a sequence {sn } of irrational numbers that converges to 0. Then f (rn ) = rn and f (sn ) = 1 sn
and, hence,
` = lim f (rn ) = 0
n
and
` = lim f (sn ) = lim (1 sn ) = 1.
n n
This is a contradiction.
(c) By a similar method to part (b), we can show that limx1 f (x) does not exists.
Solving this problem suggests a more general problem as follows. Given two polynomials P and
Q, define the function
(
P(x), if x Q;
f (x) =
Q(x), if x 6 Q.
If a is a solution of the equation P(x) = Q(x), i.e., P(a) = Q(a), then the limit limxa f (x) exists
and the limit is this common value. For all other points the limit does not exist.
152 Solutions and Hints for Selected Exercises
Similar problems:
1. Determine all a R at which limxa f (x) exists, where
(
x2 , if x Q;
f (x) =
x + 2, if x x 6 Q.
SECTION 3.2
Exercise 3.2.5. The given condition implies that if both x1 and x2 are close to x, then they are close
to each other and, hence, f (x1 ) and f (x2 ) are close to each other. This suggests the use of the Cauchy
criterion for limit to solve the problem. Given any > 0, choose = . If x1 , x2 D \ {x}
2(k + 1)
with |x1 x| < and |x2 x| < , then
| f (x1 ) f (x2 )| k|x1 x2 | k(|x1 x| + |x2 x|) < k( + ) = 2k < .
2(k + 1)
Therefore, limxx f (x) exists.
SECTION 3.3
Exercise 3.3.8. (a) Observe that f (a) = g(a) = h(a) and, hence,
(
|g(x) g(a)|, if x Q [0, 1];
| f (x) f (a)| =
|h(x) h(a)|, if x Qc [0, 1].
It follows that
| f (x) f (a)| |g(x) g(a)| + |h(x) h(a)| for all x [0, 1].
Exercise 3.3.9. At any irrational number a (0, 1], we have f (a) = 0. If x is near a and x is irrational,
it is obvious that f (x) = 0 is near f (a). In the case when x is near a and x is rational, f (x) = 1/q
where p, q N. We will see in part (a) that for any > 0, there is only a finite number of x (0, 1]
such that f (x) . So f (x) is close to f (a) for all x (0, 1] except for a finite number of x Q.
Since a is irrational, we can choose a sufficiently small neighborhood of a to void such x.
(a) For any > 0,
p 1 p 1
A = {x (0, 1] : f (x) } = x = Q : f (x) = = x = Q : q .
q q q
153
1 p
Clearly, the number of q N such that q is finite. Since 0 < q 1, we have p q. Therefore,
A is finite.
(b) Fix any irrational number a (0, 1]. Then f (a) = 0. Given any > 0, by part (a), the set A is
finite, so we can write
for some n N, where xi Q for all i = 1, . . . , n. Since a is irrational, we can choose > 0 such that
xi
/ (a , a + ) for all i = 1, . . . , n (more precisely, we can choose = min{|a xi | : i = 1, . . . , n}).
Then
| f (x) f (a)| = f (x) < whenever |x a| < .
Therefore, f is continuous at a.
Now fix any rational number b = qp (0, 1]. Then f (b) = 1q . Choose a sequence of irrational
numbers {sn } that converges to b. Since f (sn ) = 0 for all n N, the sequence { f (sn )} does not
converge to f (b). Therefore, f is not continuous at b.
In this problem, we consider the domain of f to be the interval (0, 1], but the conclusion remain valid
for other intervals. In particular, we can show that the function defined on R by
1 p
q , if x = q , p, q N, where p and q have no common factors;
f (x) = 1, if x = 0;
0, if x is irrational,
SECTION 3.4
Exercise 3.4.6. Let = min { f (x) : x [a, b]} and = max { f (x) : x [a, b]}. Then
Exercise 3.4.8. First consider the case where f is monotone decreasing on [0, 1]. By Exercise 3.4.5,
f has a fixed point in [0, 1], which means that there exists x0 [0, 1] such that
f (x0 ) = x0 .
Since f is monotone decreasing, f has a unique fixed point. Indeed, suppose that there exists
x1 [0, 1] such that f (x1 ) = x1 . If x1 < x0 , then x1 = f (x1 ) f (x0 ) = x0 , which yields a contradiction.
It is similar for the case where x1 > x0 . Therefore, x0 is the unique point in [0, 1] such that f (x0 ) = x0 .
Since f (g(x)) = g( f (x)) for all x [0, 1], we have
f (g(x0 )) = g( f (x0 )) = g(x0 ).
Thus, g(x0 ) is also a fixed point of f and, hence, g(x0 ) = x0 = f (x0 ). The proof is complete in this
case.
Consider the case where f is monotone increasing. In this case, f could have several fixed points on
[0, 1], so the previous argument does not work. However, by Exercise 3.4.5, there exists c [0, 1]
such that g(c) = c. Define the sequence {xn } as follows:
x1 = c,
xn+1 = f (xn ) for all n 1.
Since f is monotone increasing, {xn } is a monotone sequence. In fact, if x1 x2 , then {xn } is
monotone increasing; if x1 x2 , then {xn } is monotone decreasing. Since f is bounded, by the
monotone convergence theorem (Theorem 2.3.1), there exists x0 [0, 1] such that
lim xn = x0 .
n
Since f is continuous and xn+1 = f (xn ) for all n N, taking limits we have f (x0 ) = x0 .
We can prove by induction that g(xn ) = xn for all n N. Then
g(x0 ) = lim g(xn ) = lim xn = x0 .
n
SECTION 3.5
Exercise 3.5.2. (a) Let f : D R. From Theorem 3.5.3 we see that if there exist two sequences {xn }
and {yn } in D such that |xn yn | 0 as n , but {| f (xn ) f (yn )|} does not converge to 0, then
f is not uniformly continuous on D. Roughly speaking, in order for f to be uniformly continuous on
D, if x and y are close to each other, then f (x) and f (y) must be close to each other. The behavior
of the graph of the squaring function suggests the argument below to show that f (x) = x2 is not
uniformly continuous on R.
1
Define two sequences {xn } and {yn } as follows: xn = n and yn = n + for n N. Then
n
1
|xn yn | = 0 as n . However,
n
1 2
1
| f (xn ) f (yn )| = n + n2 = 2 + 2 2 for all n N .
n n
155
Exercise 3.5.7. Hint: For part (a) use Theorem 3.5.5. For part (b) prove that the function can be
extended to a continuous function on [a, b] and then use Theorem 3.5.5.
Exercise 3.5.8. (a) Applying the definition of limit, we find b > a such that
Since f is continuous on [a, b], it is bounded on this interval. Therefore, f is bounded on [a, ).
(b) Fix any > 0, by the definition of limit, we find b > a such that
| f (x) c| < whenever x > b.
2
Since f is continuous on [a, b + 1], it is uniformly continuous on this interval. Thus, there exists
0 < < 1 such that
| f (u) f (v)| < whenever |u v| < , u, v [a, c + 1].
2
Then we can show that | f (u) f (v)| < whenever |u v| < , u, v [a, ).
(c) Since limx f (x) = c > f (a), there exists b > a such that
Thus,
inf{ f (x) : x [a, )} = inf{ f (x) : x [a, b]}.
The conclusion follows from the Extreme Value Theorem for the function f on [a, b].
SECTION 3.6
Exercise 3.7.4. Since limx f (x) = limx f (x) = , there exists a > 0 such that
Since f is lower semicontinuous, by Theorem 3.7.3, it has an absolute minimum on [a, a] at some
point x [a, a]. Obviously,
f (x) f (x) for all x [a, a].
In particular, f (0) f (x). If |x| > a, then
SECTION 4.1
Exercise 4.1.10. Use the identity
!n
f (a + 1n ) 1
lim = lim exp(n[ln( f (a + )) ln( f (a)]).
n f (a) n n
Exercise 4.1.11. (a) Using the differentiability of sin x and Theorem 4.1.3, we conclude the function
is differentiable at any a 6= 0. So, we only need to show the differentiability of the function at a = 0.
By the definition of the derivative, consider the limit
which implies
Since limx0 (|x|) = limx0 |x| = 0, applying the squeeze theorem yields
lim x sin(1/x) = 0.
x0
Using Theorem 4.1.3 and the fact that cos x is the derivative of sin x, the derivative of f can be written
explicitly as
2x sin 1 cos(1/x) + c, if x 6= 0;
0
f (x) = x
c, if x = 0.
157
From the solution, it is important to see that the conclusion remains valid if we replace the
function f by
xn sin 1 , if x 6= 0;
g(x) = x
0, if x = 0,
where n 2, n N. Note that the function h(x) = cx does not play any role in the differentiability
of f .
We can generalize this problem as follows. Let be a bounded function on R, i.e., there is
M > 0 such that
|(x)| M for all x R .
Define the function
(
xn (1/x), if x 6= 0;
f (x) =
0, if x = 0,
and
( 2
x2 e1/x , if x 6= 0;
f (x) =
0, if x = 0.
Show that if n 2, the function is differentiable on R and find its derivative. Show that if n = 1 and
limx (x) does not exists, then f is not differentiable at 0.
(b) Hint: Observe that
0 1 0 1
f = 1 + c < 0 and f = 1 + c > 0.
2n (2n + 1)
SECTION 4.2
Exercise 4.2.1. Define the function
Exercise 4.2.3. The inequality holds obviously if a = b. In the case where a 6= b, the equality can be
rewritten as
sin(b) sin(a)
1.
ba
sin(b) sin(a)
The quotient is the slope of the line connecting (a, f (a)) and (b, f (b)). We need
ba
to show that the absolute value of the slope is always bounded by 1, which can also be seen from
the figure. The quotient also reminds us of applying the Mean Value Theorem for the function
f (x) = sin(x).
Consider the case where a < b and define the function f : [a, b] R by f (x) = sin(x). Clearly,
the function satisfies all assumptions of the Mean Value Theorem on this interval with f 0 (x) = cos(x)
for all x (a, b).
By the Mean Value Theorem, there exists c (a, b) such that
f (b) f (a)
= f 0 (c) = cos(c),
ba
which implies
f (b) f (a)
b a = | cos(c)| 1.
It follows that | f (a) f (b)| |a b|. The solution is similar for the case where a > b.
It is essential to realize that the most important property required in solving this problem is the
boundedness of the derivative of the function. Thus, it is possible to solve the following problems
159
Exercise 4.2.8. (a) Given > 0, first find x0 large enough so that a /2 < f 0 (x) < a + /2 for
x > x0 . Then use the identity
f (x) f (x0 ) f (x0 )
f (x) f (x) f (x0 ) + f (x0 ) xx0 + xx 0
= = x0 ,
x x x0 + x0 1 + xx 0
SECTION 4.3
Exercise 4.3.2. (a) We can prove that f is uniformly continuous on R by definition. Given any > 0,
1
choose = and get
`+1
| f (u) f (v)| `|u v| < ` = ` <
`+1
whenever |u v| < . Note that we use ` + 1 here instead of ` to avoid the case where ` = 0.
(b) We will prove that f is a constant function by showing that it is differentiable on R and f 0 (a) = 0
for all a R. Fix any a R. Then, for x 6= a,
f (x) f (a) `|x a| 1
x a |x a| = `|x a| .
f (x) f (a)
lim = 0.
xa xa
Then h0 (x) = g0 (x) f 0 (x) 0 for all x [x0 , ). Thus, h is monotone increasing on this interval. It
follows that
h(x) h(x0 ) = g(x0 ) f (x0 ) = 0 for all x x0 .
Therefore, g(x) f (x) for all x x0 .
SECTION 4.4
Exercise 4.4.5. Suppose that
P(x) = a0 + a1 x + + an xn .
Then apply LHospitals rule repeatedly.
161
Exercise 4.4.6. We first consider the case where n = 1 to get ideas for solving this problem in the
general case. From the standard derivative theorems we get that the function is differentiable at any
x 6= 0 with
1 2 1
f 0 (x) = 2x3 e x2 = 3 e x2 .
x
Consider the limit 1
f (x) f (0) e x 2
lim = lim .
x0 x0 x0 x
Letting t = 1/x and applying LHospital rule yields
1
e x 2 t 1
lim+ = lim t 2 = lim = 0.
x0 x t e t 2tet 2
Similarly,
1
e x2
lim = 0.
x0 x
It follows that f is differentiable on R with
2 e x12 , if x 6= 0;
0
f (x) = x3
0, if x = 0.
Based on these calculations, we predict that f is n times differentiable for every n N with
P 1 e x12 , if x 6= 0;
(n) x
f (x) =
0, if x = 0,
where P is a polynomial. Now we proceed to prove this conclusion by induction. The conclusion is
true for n = 1 as shown above. Given that the conclusion is true for some n N, for x 6= 0 we have
(n+1) 2 0 1 2 1 12 1 12
f (x) = x P + 3P e x =Q e x ,
x x x x
where Q is also a polynomial. It is an easy exercise to write the explicit formula of Q based on P.
Moreover, successive applications of lHpitals rule give
Therefore, f (n+1) (0) = 0. We have proved that for every n N, f is n times differentiable and, so,
f Cn (R). Here we do not need to prove the continuity of f (n) because the differentiability of f (n)
implies its continuity.
In a similar way, we can also show that the function
( 1
e x , if x > 0;
f (x) =
0, if x 0
SECTION 4.5
Exercise 4.5.1. Let f (x) = ex . By Taylors theorem, for any x > 0, there exists c (0, x) such that
m
f (k) (0) k f (m+1) (c) m+1
f (x) = ex = x + c
k=0 k! (m + 1)!
m k m
x ec xk
= k! + (m + 1)! cm+1 > k! .
k=0 k=0
Exercise 4.5.5. (a) Observe that a simpler version of this problem can be stated as follows: If f is
differentiable on (a, b) and x (a, b), then
f (x + h) f (x) f 0 (x)
lim = .
h0 h 1!
This conclusion follows directly from the definition of derivative.
Similarly, if f is twice differentiable on (a, b) and x (a, b), then
SECTION 4.6
Exercise 4.6.3. We apply the definition to solve this problem. Given any u, v I and (0, 1), we
have
f ( u + (1 )v) f (u) + (1 ) f (v)
by the convexity of f .
Since f (u), f (v) J and J is an interval, f (u) + (1 ) f (v) J. By the nondecreasing
property and the convexity of ,
( f ( u + (1 )v)) ( f (u) + (1 ) f (v)) ( f (u)) + (1 ) ( f (v)).
Therefore, f is convex on I.
The result from this problem allows us to generate convex functions. For example, consider
f (x) = |x| and (x) = x p , p > 1. We have seen that f is convex on R. The function is convex
and increasing on [0, ) which contains the range of the function f . Therefore, the composition
2
g(x) = |x| p , p > 1, is convex on R. Similarly, h(x) = ex is also a convex function on R.
Observe that in this problem, we require the nondecreasing property of . A natural question is
whether the composition of two convex functions is convex. The answer is negative. Observe that
f (x) = x2 and (x) = |x 1| are convex, but ( f )(x) = |x2 1| is nonconvex.
SECTION 4.7
Exercise 4.7.1. (a) By Theorem 4.7.5,
{a}, if x < 0;
f (x) = [a, a], if x = 0;
{a}, if x > 0.
164 Solutions and Hints for Selected Exercises
Exercise 4.7.3. To better understand the problem, we consider some special cases. If n = 1, then
f (x) = |x 1|. Obviously, f has an absolute minimum at x = 1. If n = 2, then f (x) = |x 1| + |x 2|.
The graphing of the function suggests that f has an absolute minimum at any x [1, 2]. In the case
where n = 3, we can see that f has an absolute minimum at x = 2. We then conjecture that if n is
odd with n = 2m 1, then f has an absolute minimum at x = m. If n is even with n = 2m, then f
has an absolute minimum at any point x [m, m + 1].
Let us prove the first conclusion. In this case,
2m1 2m1
f (x) = |x i| = fi (x),
i=1 i=1
The subdifferential sum rule yields f (x) = [1, 1] which contains 0. Thus, f has an absolute
minimum at x. If x > m, we can see that f (x) (0, ), which does not contain 0. Similarly, if
x < m, then f (x) (, 0). Therefore, f has an absolute minimum at the only point x = m.
The case where n is even can be treated similarly.
Exercise 4.7.5. Fix a, b R with a < b. By Theorem 4.7.9, there exists c (a, b) such that
f (b) f (a)
f (c) [0, ).
ba
This implies f (b) f (a) 0 and, hence, f (b) f (a). Therefore, f is monotone increasing on R,
Bibliography
[Die69] J. Dieudonn. Foundation of Modern Analysis. Second Edition. Academic Press, Inc.,
1969.
[Fit09] P. M. Fitzpatrick. Advanced Calculus. Second Edition. American Mathematical Society,
2009.
[FJ96] J. Franchini and J.-C. Jacquens. Analyse I. Ellipses, Paris, 1996.
[HUL01] J.-B. Hiriart-Urruty and C. Lemarchal. Fundamentals of Convex Analysis. Springer,
2001.
[Lay13] S. R. Lay. Analysis with an Introduction to Proof. Fifth Edition. Pearson, 2013.
[Mun00] J. Munkres. Topology. Prentice Hall, 2000.
[Ros68] M. Rosenlicht. Introduction to Analysis. Dover Publications, 1968.
[Rud76] W. Rudin. Principles of Mathematical Analysis. Third Edition. McGraw-Hill, 1976.
[Sto00] M. Stoll. Introduction to Real Analysis. Second Edition. Pearson, 2000.
[Wad09] W. R. Wade. An Introduction to Analysis. Fourth Edition. Prentice Hall, 2009.