MATH1131-1141-Calculus-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)
MATH1131-1141-Calculus-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)
and
CALCULUS NOTES
Preface
Note.
This version of the Calculus Notes has been prepared by Robert Taggart and Peter Brown. They
build on notes first developed by Tony Dooley and subsequently edited by several members of
the School of Mathematics and Statistics. The main editors include Mike Banner, Ian Doust and
V. Jeyakumar.
c
Copyright is vested in The University of New South Wales,
2020.
iv
c
2020 School of Mathematics and Statistics, UNSW Sydney
v
Contents
Preface iii
Calculus Syllabus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
2 Limits 27
2.1 Limits of functions at infinity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.1.1 Basic rules for limits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.1.2 The pinching theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.1.3 Limits of the form f (x)/g(x) . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
p p
2.1.4 Limits of the form f (x) − g(x) . . . . . . . . . . . . . . . . . . . . . . . . 30
2.1.5 Indeterminate forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.2 The definition of lim f (x) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
x→∞
2.3 Proving that lim f (x) = L using the limit definition . . . . . . . . . . . . . . . . . . 34
x→∞
2.4 Proofs of basic limit results (MATH1141 only) . . . . . . . . . . . . . . . . . . . . . 37
2.5 Limits of functions at a point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
2.5.1 Left-hand, right-hand and two-sided limits . . . . . . . . . . . . . . . . . . . . 38
2.5.2 Limits and continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . 40
2.5.3 Rules for limits at a point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
2.6 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
c
2020 School of Mathematics and Statistics, UNSW Sydney
vi
4 Differentiable functions 63
4.1 Gradients of tangents and derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.2 Rules for differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
4.3 Proofs of results in Section 4.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
4.4 Implicit differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
4.5 Differentiation, continuity and split functions . . . . . . . . . . . . . . . . . . . . . . 74
4.6 Derivatives and function approximation . . . . . . . . . . . . . . . . . . . . . . . . . 77
4.7 Derivatives and rates of change . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
4.8 Local maximum, local minimum and stationary points . . . . . . . . . . . . . . . . . 79
4.9 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
c
2020 School of Mathematics and Statistics, UNSW Sydney
vii
8 Integration 157
8.1 Area and the Riemann Integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
8.1.1 Area of regions with curved boundaries . . . . . . . . . . . . . . . . . . . . . 157
8.1.2 Approximations of area using Riemann sums . . . . . . . . . . . . . . . . . . 160
8.1.3 The definition of area under the graph of a function and the Riemann integral163
8.2 Integration using Riemann sums . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
8.3 The Riemann integral and signed area . . . . . . . . . . . . . . . . . . . . . . . . . . 168
8.4 Basic properties of the Riemann integral . . . . . . . . . . . . . . . . . . . . . . . . . 169
8.5 The first fundamental theorem of calculus . . . . . . . . . . . . . . . . . . . . . . . . 171
8.6 The second fundamental theorem of calculus . . . . . . . . . . . . . . . . . . . . . . 175
8.7 Indefinite integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
8.8 Integration by substitution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
8.9 Integration by parts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
8.10 Improper integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
8.11 Comparison tests for improper integrals . . . . . . . . . . . . . . . . . . . . . . . . . 189
8.12 Functions defined by an integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
8.13 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199
c
2020 School of Mathematics and Statistics, UNSW Sydney
viii
Index 252
c
2020 School of Mathematics and Statistics, UNSW Sydney
ix
CALCULUS SYLLABUS
The calculus course for both MATH1131 and MATH1141 is based on these MATH1131/MATH1141
Calculus Notes that are included in the Course Pack. A detailed syllabus and lecture schedule will
be posted on Moodle.
The computer package Maple will be used in the calculus course. An introduction to Maple is
included in the booklet titled First Year Maple Notes.
The Calculus problems are located at the end of each chapter of the Calculus Notes booklet. They
are also available from the course module on the UNSW Moodle server. Some of the problems
are very easy, some are less easy but still routine and some are quite hard. To help you decide
which problems to try first, each problem is marked with an [R], an [H] or an [X]. The problems
marked [R] form a basic set of problems which you should try first. Problems marked [H] are
harder and can be left until you have done the problems marked [R]. Problems marked [V] have a
video solution available on Moodle.
You do need to make an attempt at the [H] problems because problems of this type will occur
on tests and in the exam. If you have difficulty with the [H] problems, ask for help in your tutorial.
The problems marked [X] are intended for students in MATH1141 – they relate to topics which
are only covered in MATH1141. Extra problem sheets for MATH1141 may be issued in lectures.
Remember that working through a wide range of problems is the key to success in mathematics.
However, solving problems and writing mathematics clearly are two separate skills that need to be
developed through practice. We recommend that you keep a workbook to practice writing solutions
to mathematical problems.
c
2020 School of Mathematics and Statistics, UNSW Sydney
x
c
2020 School of Mathematics and Statistics, UNSW Sydney
1
Chapter 1
In the days of the Roman empire, the word ‘calculus’ denoted a pebble that was used for counting
and gambling. As time progressed, the word ‘calculare’ came to mean ‘to compute.’ In the second
half of the seventeenth century, two mathematicians, the Englishman Isaac Newton and the German
Gottfried Leibniz, independently invented methods for
• calculating gradients of tangents to curves,
c
2020 School of Mathematics and Statistics, UNSW Sydney
2 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
A set is a collection of distinct objects. The objects in a set are called the elements or members of
the set. Some commonly used sets of numbers are listed below.
N = {0, 1, 2, 3, 4, . . .}.
• The set Q of rational numbers is the collection of all numbers of the form pq , where p and q
are integers and q 6= 0.
• The set R of real numbers may be represented as the collection of points lying on the number
line.
| | | | | | |
−3 −2 −1 0 1 2 3
If A is a set of numbers and the number x is a member of the set A, then we write
x ∈ A.
−3 −2 −1 0 1 2 3
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.1. SETS OF NUMBERS 3
Sets that are represented by intervals of the real line occur so frequently that we introduce
a special notation for them. Suppose that a and b are real numbers and that a < b. Then the
intervals (a, b), [a, b], (a, b] and [a, b) are given by
a b
• [a, b] = {x ∈ R : a ≤ x ≤ b}
a b
• [a, b) = {x ∈ R : a ≤ x < b}
a b
a b
In each case, the numbers a and b are called the endpoints of the interval. An interval [a, b] that
includes its endpoints is called a closed interval, while an interval (a, b) that excludes its endpoints
is called an open interval. The intervals [a, b) and (a, b] are neither open nor closed.
Interval notation can be extended to describe rays of the real line by using the symbol ∞ for
infinity. Thus we have
• [a, ∞) = {x ∈ R : a ≤ x}
|
a b
• (−∞, b) = {x ∈ R : x < b}
|
a b
• (−∞, ∞) = R.
Definition 1.1.3. Suppose that A and B are two sets. We say that A is a subset
of B if x ∈ A implies that x ∈ B. If A is a subset of B then we also say that B
contains the set A.
c
2020 School of Mathematics and Statistics, UNSW Sydney
4 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
• N is a subset of Z,
• R is a subset of R, and
x2 + x − 6 > 0.
The values of x for which x2 + x − 6 is positive are the same as those for which the graph of
y = x2 + x − 6
y = (x − 2)(x + 3).
Hence y = 0 when x = 2 and when x = −3. We can now easily sketch its graph.
y
y = x2 + x − 6
−3 2 x
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.2. SOLVING INEQUALITIES 5
Since the graph lies above the x-axis when x < −3 and when x > 2, the solution to the inequality
is
x < −3 or x > 2.
1
(x − 1)2 < x − 1.
3
Multiplication by 3 and rearrangement gives
So proceding in a manner similar to the previous example, we seek to identify the values of x for
which the graph of
y = (x − 1)(x − 4)
lies below the x-axis.
y
y = (x − 1)(x − 4)
1 4 x
1 < x < 4.
The next example is slightly more difficult, but uses techniques illustrated in the previous two
examples.
c
2020 School of Mathematics and Statistics, UNSW Sydney
6 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
y = (x − 1)(x − 2)(x + 3)
−3 1 2 x
Remark 1.2.4. A variety of methods (some good and others bad) exist for solving inequalities
where an unknown lies in the denominator. It is strongly recommended that students use the
method shown in this section, or that shown by their first year calculus lecturer.
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.3. ABSOLUTE VALUES 7
(F2) If a and x are real numbers then |x − a| is equal to the distance between x and a on the real
number line.
and
|y| > a if and only if y < −a or y > a.
These facts give different methods for solving inequalities involving absolute values.
Example 1.3.2. Solve the following inequalities:
(a) |x − 3| ≤ 5
|x − 3| ≤ 5
−5 ≤ x − 3 ≤ 5
−2 ≤ x ≤ 8.
c
2020 School of Mathematics and Statistics, UNSW Sydney
8 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
The points x on the number line that have this property are shown below.
5 5
|
−2 3 8
Hence −2 ≤ x ≤ 8.
(b) An algebraic solution. Using (F3) the inequality is equivalent to
2x + 5 < −8 or 2x + 5 > 8.
Hence
2x < −13 or 2x > 3,
|x − 3| < |x − 1|.
|x − 3| < |x − 1|, x 6= 1.
In other words, x is closer to 3 than to 1. Points that have this property are shown on the numberline
below.
| |
1 2 3
Hence x > 2.
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.4. FUNCTIONS 9
1.4 Functions
(Ref: SH10 §1.5)
A function f : A → B is a rule which assigns to every element x belonging to a set A exactly one
element f (x) belonging to a set B. The set A is called the domain of the function f and the set B
is called the codomain of f . In this course, A and B are always sets of real numbers.
for all x in [0, ∞). Informally, this means that f takes a number x from [0, ∞) as input and gives
√ √
the number x as output. More formally, we say that f maps an element x of [0, ∞) to x. We
offer a few comments on terminology employed to describe f .
• The expression f : [0, ∞) → R says that [0, ∞) is the domain of f and that R is the codomain
of f . We write
Dom(f ) = [0, ∞) and Codom(f ) = R.
• The codomain of f is a set that contains all the output values of the function. Since the
output values are all real numbers, we say that f is a real-valued function. While all the
outputs must lie in Codom(f ), not every number in Codom(f ) need be an output value.
• The expression f (x) (read as ‘f of x’) is the value of f at the point x. That is, f (x) is the
unique number in R that corresponds to the input x. To emphasise the point: f (x) is a real
number, not a function.
• The statement
√
‘f (x) = x for all x in [0, ∞)’
can be abbreviated as
√
f (x) = x ∀x ∈ [0, ∞).
In high school, functions are often described by specifying a rule without specifying a domain.
For example, the following sentence was taken from an HSC study guide:
There are two problems with this sentence. First, neither f (x) nor f (x) = x2 is a function. As
mentioned above, f (x) is a real number, and f (x) = x2 is an equation. (This may seem picky,
but being able to distinguish between a function f and its value f (x) at the point x is crucial to
understanding key concepts in both algebra and calculus.) Second, the quoted sentence does not
specify the domain of the function. The next example gives two different functions that obey the
same squaring rule.
c
2020 School of Mathematics and Statistics, UNSW Sydney
10 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
x x
If, for whatever reason, the domain of a function is not specified, but the function rule is, then
the default domain, known as the maximal or natural domain, is the largest possible domain for
which the rule makes sense. We give two examples.
• If √
f (x) = x−1
then the rule makes sense only if x − 1 ≥ 0, that is, if x ≥ 1. Therefore the maximal domain
of the function f is [1, ∞).
• If
2
f (x) =
(x − 2)(x − 8)
then the rule only makes sense if x 6= 2 or x 6= 8. So the maximal domain is the set
{x ∈ R : x 6= 2, x 6= 8}.
In other words, the range of f is the set of all output values. Equivalently, if the rule of a
function is described by the equation y = f (x), then the range of f is the set of all corresponding
y-values. The range is always a subset of the codomain, but the range need not equal the codomain.
The difference between range and codomain is illustrated by the following examples.
Example 1.4.4. Suppose that f : R → R is given by the rule
f (x) = sin x ∀x ∈ R.
Then Dom(f ) = R and Codom(f ) = R (this is obtained from the expression f : R → R). To
determine the range, consider the graph of f is sketched below.
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.4. FUNCTIONS 11
f (x)
−1
The set of all output values (equivalently, the set of all corresponding y-values) is [−1, 1]. Hence
Range(f ) = [−1, 1].
Then Dom(f ) = [−1, 1] and Codom(f ) = R (again, we are simply reading this from f : [−1, 1] →
R). To determine the range, we observe that the graph of f is a semicircle.
f (x)
−1 1 x
The set of all outputs (or y-values) is [0, 1]. Hence Range(f ) = [0, 1].
If f and g are two functions with the same domain, then one can combine f and g to form new
functions.
c
2020 School of Mathematics and Statistics, UNSW Sydney
12 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
The final example illustrates the importance of bearing in mind domain and range when com-
posing two functions.
Example 1.4.9. Suppose that f : [0, ∞) → R and g : R → R are given by
√
f (x) = x ∀x ∈ [0, ∞) and g(x) = sin x − 2 ∀x ∈ R.
Find, if they exist, (f ◦ g)(x) and (g ◦ f )(x).
Solution. Consider first f ◦ g. For this function to exist, we require that Range(g) is a subset of
Dom(f ). Now
−1 ≤ sin x ≤ 1 ∀x ∈ R
so
−3 ≤ sin x − 2 ≤ −1 ∀x ∈ R;
that is, Range(g) = [−3, −1]. Since Dom(f ) = [0, ∞), Range(g) is not a subset of Dom(f ) and
hence f ◦ g does not exist. (The point here is that, if
√
(f ◦ g)(x) = f (g(x)) = sin x − 2,
then the expression under the square root sign is always negative.)
Now consider g ◦ f . Since f is a real-valued function and Dom(g) = R, it is clear that Range(f )
is a subset of Dom(g). (The fact that Range(f ) = [0, ∞) is not needed because Dom(g) is so large.)
Hence √
(g ◦ f )(x) = g(f (x)) = sin( x) − 2 ∀x ∈ Dom(f ) = [0, ∞).
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.5. POLYNOMIALS AND RATIONAL FUNCTIONS 13
• If n = 1 then f (x) = a1 x + a0 . The graph of f is a straight line with gradient a1 and y-axis
intercept a0 .
The general shape of the graph of a polynomial function of degree greater than 1 is indicated
by the table below.
Odd degree
Even degree
The number of turning points in the graph is always strictly less than the degree of the polynomial.
Exactly how many turning points the graph has depends on the coefficients of the polynomial, and
can be determined using calculus (see, for example, Chapters 4 and 5).
By dividing two polynomials, one obtains a new type of function.
c
2020 School of Mathematics and Statistics, UNSW Sydney
14 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
x x3 − 4x2 + 1
f (x) = 2
and f (x) =
x +2 x−2
both give rise to rational functions, as does
3
f (x) = x − 1 +
x2 + 3
(to see why, rewrite right-hand side with a common denominator of x2 + 3).
The graphs of two simple rational functions are shown below.
y y
1 1
y= x y= x2
x x
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.6. THE TRIGONOMETRIC FUNCTIONS 15
Consider an angle θ and the corresponding point P (x, y) that lies on the unit circle centred at
the origin, as shown below.
P (x, y)
y
θ 1
x
Other trig functions are defined in terms of the sine and cosine functions:
sin θ
tan θ = , provided that cos θ 6= 0
cos θ
1
sec θ = , provided that cos θ 6= 0
cos θ
1
cosec θ = , provided that sin θ 6= 0
sin θ
cos θ
cot θ = , provided that sin θ 6= 0.
sin θ
The graphs of sin, cos and tan are shown below.
sin θ
1
|
| | | | |
−2π π π 3π 2π θ
2 2
−1
|
cos θ
1
|
| | | | |
−2π π π 3π 2π θ
2 2
−1
|
c
2020 School of Mathematics and Statistics, UNSW Sydney
16 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
tan θ
|
| | | | |
−2π π π 3π 2π θ
−1 2 2
|
The sine and cosine functions are 2π-periodic, while the tangent function is π-periodic, which
means that
whenever θ is in the domain of the respective function. Sine and cosine both have an amplitude of
1.
It is not hard to obtain functions whose graphs have the same general shape as the sine (or
cosine) curve with a period other than 2π and amplitude other than 1. For example, if A and ω
are positive real numbers then the function f : R → R, given by
f (x) = A sin(ωx) ∀x ∈ R,
2π
has an amplitude of A and a period of ω . Its graph is shown below.
f (x)
A
|
| | | | |
− 2π 2π x
ω ω
−A
|
The six trigonometric functions are related to one another by various identities and formulae:
• complementary identities
π
sin − x = cos x
2
π
cos − x = sin x
2
• Pythagorean identities
cos2 x + sin2 x = 1
1 + tan2 x = sec2 x
cot2 x + 1 = cosec2 x
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.7. THE ELEMENTARY FUNCTIONS 17
ln x
(e, 1)
b
b
(1, e) 1
x
1
x
√ √
x 3
x
b b
(1, 1) (1, 1)
x (−1, −1) x
b
c
2020 School of Mathematics and Statistics, UNSW Sydney
18 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
The elementary functions are all those functions that can be constructed by combining a finite
number of polynomials, exponentials, logarithms, roots and trigonometric functions (including the
inverse trigonometric functions) via function composition, addition, subtraction, multiplication and
division. Hence the following expressions give rise to elementary functions:
f (x) = esin x + x2
ln x − tan x
g(x) = √
x
p3
h(x) = x4 − 2x2 + 5.
abs : R → R
abs(x) = |x| ∀x ∈ R,
The graph of |f | is obtained by reflecting in the x-axis any part of the graph of f that lies below
the x-axis. This is illustrated below.
f (x) |f (x)|
x x
Most, if not all functions studied in high school were elementary functions. In this course we
shall meet some useful functions that are not elementary functions. (These are defined by integrals;
see Section 8.12.)
The set of points (x, y) satisfying this equation are shown on the graph below.
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.8. IMPLICITLY DEFINED FUNCTIONS 19
y
Note that the curve cannot be described by one function since some x-values have two corresponding
y-values. However, the curve can be described by combining the graphs of two functions f :
[−a, a] → R and g : (−a, a) → R, which are defined by the rules
y = f (x)
(x2 + y 2 − 1)3 − x2 y 3 = 0
y≥b
and
y = g(x)
(x2 + y 2 − 1)3 − x2 y 3 = 0
y<b
y y
b y = f (x) b y = g(x)
|
| | | |
−a a x −a a x
Example 1.8.1. The set of points (x, y) that satisfy the equation
x2 + y 2 = 1
describes a circle of radius 1 centred at the origin. The above equation could be used to implicitly
define many different functions. We give two examples.
c
2020 School of Mathematics and Statistics, UNSW Sydney
20 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
1 y = f (x)
1 x
−1 1
x
y = g(x)
−1
Many heavenly bodies (such as planets and comets) trace out paths that are roughly elliptic,
parabolic or hyperbolic. (The fact that planets trace out elliptic paths around the sun was discov-
ered by Johannes Kepler at the beginning of the sixteenth century, and was later used by Newton, in
conjunction with calculus, to establish his universal law of gravitation.) Circles, ellipses, parabolas
and hyperbolas are known as conic sections because they can be obtained by intersecting a cone
with a plane. Equations that describe conic sections, accompanied by corresponding diagrams, are
shown below.
y y
r b
−r r x −a a x
−r −b
Circle: x2 + y 2 = r 2 x2 y 2
Ellipse: + 2 =1
a2 b
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.9. CONTINUOUS FUNCTIONS 21
y y = ab x y
b y = ab x
−a a
x x
−b
x2 y 2 y 2 x2
Hyperbola: − 2 =1 Hyperbola: − 2 =1
a2 b b2 a
tan x
| | | |
−π − π2 π π x
2
The break in the graph when, say, x = −π/2 or x = π/2, is a consequence of a break in the domain
of the function rather than a discontinuity of the function. Similarly, the trigonometric functions
sec, cosec and cot are continuous, as are the rational functions. The notion of continuity shall be
discussed in more precise terms in Chapters 2 and 3.
Intuitively, a function f has a discontinuity at a point a if a ∈ Dom(f ) and there is a break in
the graph of f at a. It is important to realise that there are different types of discontinuities. We
compare three examples.
c
2020 School of Mathematics and Statistics, UNSW Sydney
22 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
f (x) g(x) h(x)
b
bc bc
b b
| | |
a x a x a x
• For f , the discontinuity can be easily removed by redefining the value of f (a) to ‘plug up the
hole’ at a. This is an example of a removable discontinuity.
• For g, the discontinuity is a result of a finite jump in the graph of g at the point a. The
discontinuity for g, which is an example of a jump discontinuity, is worse than that for f but
not nearly as bad as that for h.
• For h, the discontinuity is a result of rapid oscillation on the right-hand side of a and an
infinite jump on the left-hand side of a. This is an example of an essential discontinuity.
While students are not expected to learn or use the terms removable, jump and essential disconti-
nuity, it is expected that they will appreciate that some discontinuities are worse than others.
Many chapters in these notes conclude with a summary of relevant Maple commands. Other sources
of information are the First Year Computing Notes and the online Maple help command. While
Maple can relieve you of the tedium of calculations, it is not a substitute for understanding the
mathematics behind the calculations, which is what you need in order to interpret Maple output
intelligently.
The following Maple command is relevant to the material of this chapter:
plot(f(x),x=a..b); draws a (two-dimensional) plot of the graph of y = f (x) for
a ≤ x ≤ b. To find out about the many options available, use ?plot. Implicitly defined
functions can be plotted using the smartplot command. For example,
> f:=x^5-2*x^4+3*x^3-6*x^2-4*x+8:
> g:=4*x^4-8*x^3-12*x^2+16*x+16:
c
2020 School of Mathematics and Statistics, UNSW Sydney
1.10. MAPLE NOTES 23
> # use the Maple plot command implicitplot --- need more plotting tools
> with(plots):
> implicitplot(y^5+x^2*y^3+16,x=-10..10,y=-2..0);
c
2020 School of Mathematics and Statistics, UNSW Sydney
24 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
Questions marked with [R] are routine, with [H] are harder and with [X] are for MATH1141 only.
Questions marked with [V] have a video solution available from Moodle. You should make sure
that you can do the easier questions before you tackle the more difficult questions.
a) {x ∈ Z : −π < x < π}
b) {x ∈ R : x2 − x − 1 < 0}
c) {x ∈ Q : x2 = 2}
3. [R] Sketch the set of points (x, y) which satisfy the following relations.
6. [R] [V]
a) By expanding (x − y)2 , prove that x2 + y 2 ≥ 2xy for all real numbers x and y.
a+b √
b) Deduce that ≥ ab for all non-negative real numbers a and b. When does
2
equality hold?
1
c) Use the result above to find the minimum value of y = x2 + 2 .
x
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 1 25
1 1 4
8. [H] Prove that (x + y)2 ≥ 4xy and hence deduce that + ≥ 2 .
x2 y 2 x + y2
9. [H] [V]
10. [R] Determine the (maximal) domain and corresponding range for each function f de-
scribed below.
√ √
a) f (x) = 5 − x2 b) f (x) = x2 − 5
√
c) f (x) = (x − 8)−1/3 d) f (x) = x − 1
1 √
e) f (x) = √ f) f (x) = sin x
x−1
√
g) f (x) = 1 − 2 sin x h) f (x) = 1 + tan2 x
cos
√
x if x < 0
i) f (x) = 1 − x if 0 ≤ x ≤ 1
|x| if x > 1
1
12. [R] If f (x) = x − 1 and g(x) = √ , then give the explicit forms of
x−1
f
a) (f + g)(x) b) (f g)(x) c) (x) d) (f ◦ g)(x).
g
13. [R] Draw neat sketches (preferably without using calculus) of the graphs given by the
following equations.
c
2020 School of Mathematics and Statistics, UNSW Sydney
26 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
4
a) y = x2 − 5x + 6 b) y = 2x3 − 16 c) y =
x−3
1
d) y = 2ex−1 e) y = f) y = 3 sin 2x
x2 +4
√
g) y = x − 1
14. [R]
√
a) Sketch the graph of y = x + 1 and use your graph to sketch (on the same diagram)
1
y=√ .
x+1
b) Repeat for y = x2 − 4x + 3.
15. [R] Sketch the graph of y = x2 − 7x − 8 and hence sketch the graph of y = |x2 − 7x − 8|.
18. [R]
c
2020 School of Mathematics and Statistics, UNSW Sydney
27
Chapter 2
Limits
In many situations, it is desirable to know what the long term state of a system will be. For
example, suppose that an initially unpolluted lake containing 109 litres of water has a river flowing
through it at a rate of 106 litres per day. A factory is built next to the lake and discharges 104
litres of pollution per day. By making some simple assumptions, it can be shown that the amount
P (t) of pollution in the lake after t days of the factory’s operation is given by
109 5
P (t) = 1 − e−101t/10 . (2.1)
101
Environmental authorities want to know what the long term level of the pollution will be. To see
whether the pollution in the lake eventually stabilises, and what the level of pollution will be, one
studies the behaviour of P (t) as t → ∞.
Understanding the limiting behaviour of functions at infinity is important for practical and
mathematical reasons. That is what we study in the next four sections.
lim f (x) = L.
x→∞
• If f (x) gets arbitrarily large (that is, approaches ∞) as x tends to ∞ then we write
f (x) → ∞ as x → ∞.
We do not write lim f (x) = ∞ since ∞ is not a real number. (The lim notation is only
x→∞
used when the limit exists and is a real number.)
c
2020 School of Mathematics and Statistics, UNSW Sydney
28 CHAPTER 2. LIMITS
• If f (x) → ∞ as x → ∞ then
1
lim = 0.
x→∞ f (x)
This is intuitively obvious and gives limits such as
1 5
lim = 0, lim = 0, lim e−x = 0
x→∞ x x→∞ 4x6 x→∞
1
(for the last example, recall that e−x = ex ).
The following proposition shows that the arithmetic of limits behaves nicely in many situations.
Proposition 2.1.1. Suppose that lim f (x) and lim g(x) exist and are finite real numbers. Then
x→∞ x→∞
Part of this proposition will be proved in Section 2.4. For now, we demonstrate how the
proposition is used to calculate limits.
if the limit exists. Justify each step of your working with reference to Proposition 2.1.1.
Solution.
In practice, you may use the rules of Proposition 2.1.1 without referring to them, and leave out
some of the steps shown above.
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.1. LIMITS OF FUNCTIONS AT INFINITY 29
L
b
|
x
y = f (x)
Theorem 2.1.3 (The pinching theorem). Suppose that f , g and h are all defined on the interval
(b, ∞), where b ∈ R. If
f (x) ≤ g(x) ≤ h(x) ∀x ∈ (b, ∞)
and
lim f (x) = lim h(x) = L
x→∞ x→∞
then
lim g(x) = L.
x→∞
One can modify the theorem to cover the case when x → −∞.
sin x
Example 2.1.4. Use the pinching theorem to find lim .
x→∞ x
−1 ≤ sin x ≤ 1,
which is valid for every real number x. If we restrict x to the interval (0, ∞) then we have
−1 sin x 1
≤ ≤ .
x x x
Now
1 1
lim − = lim = 0,
x→∞ x x→∞ x
and so
sin x
lim =0
x→∞ x
by the pinching theorem.
c
2020 School of Mathematics and Statistics, UNSW Sydney
30 CHAPTER 2. LIMITS
where both f (x) and g(x) tend to infinity as x → ∞. In this situation, we cannot apply the rules of
Proposition 2.1.1 directly because the numerator and denominator don’t have finite limits. Instead,
we find an equivalent form of fg(x)
(x)
for which Proposition 2.1.1 can be applied. The key idea is to
divide both f and g by the fastest growing term appearing in the denominator g.
4x2 − 5
Example 2.1.5. Evaluate lim , if it exists.
x→∞ 2x2 + 3x
Solution. There are two terms appearing in the denominator: 2x2 and 3x. As x → ∞, the term
which grows fastest is the one involving x2 . So we divide the numerator and denominator by x2 to
evaluate the limit:
4x2 − 5 4 − 5/x2
=
2x2 + 3x 2 + 3/x
4−0
→
2+0
as x → ∞. Therefore
4x2 − 5
lim = 2.
x→∞ 2x2 + 3x
In general, we divide the numerator and denominator by the highest power of x in the denom-
inator.
5x3 + 6x2 − 4 sin x
Example 2.1.6. Find lim .
x→∞ cos 3x + 5x − 2x3
x2 + 3x
Example 2.1.7. Find lim √ .
x→∞ 2x4 + 3 − 4x
p p
2.1.4 Limits of the form f (x) − g(x)
√ √ √ √
Consider the limit of x + 5 − x + 2 as x → ∞. Since both x + 5 and x + 2 tend to infinity
(rather than a finite number) as x → ∞, one cannot apply Proposition 2.1.1. As the next example
shows, a simple algebraic trick is used to overcome this difficulty.
√ √
Example 2.1.8. Evaluate lim x + 5 − x + 2 , if it exists.
x→∞
√
Solution. The algebraic trick is to multiply both the numerator and denominator by x + 5 +
√
x + 2 and expand the numerator as a difference of squares:
√ √ √ √
√ √ ( x + 5 − x + 2)( x + 5 + x + 2)
x+5− x+2= √ √
x+5+ x+2
(x + 5) − (x + 2)
=√ √
x+5+ x+2
3
=√ √
x+5+ x+2
→0
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.2. THE DEFINITION OF LIM F (X) 31
X→∞
as x → ∞.
One should not be fooled into thinking, on the basis of the above example, that all limits of
this type are zero.
p 1
Example 2.1.9. Show that lim x2 + x − x = .
x→∞ 2
f (x)
lim ,
x→∞ g(x)
when f (x) → ∞ and g(x) → ∞ as x → ∞. We say that such a limit is of the form ∞ ∞ . We cannot
∞
say in advance whether or not a limit of the form ∞ exists, and if it does exist, what its value is.
For example, while the following limits have the form ∞ ∞ , each displays very different limiting
behaviour as x → ∞:
x2
• x → ∞ as x → ∞
x
• x2
→ 0 as x → ∞
2x2
• x2
→ 2 as x → ∞
x2 1
• 2x2
→ 2 as x → ∞.
Because we cannot determine in advance what kind of limiting behaviour something of the form
∞ ∞
∞ has, we say that ∞ is an indeterminate form.
Other types of indeterminate forms (for example, limits of the form 00 and ∞−∞) arise naturally
in many applications. Techniques for evaluating such limits will be developed at various stages of
the course.
Up to this point, we have treated limits in a rather intuitive way. It is now time to establish a
rigorous basis for our treatment of limits. That is, we seek a definition for lim f (x) = L that is
x→∞
mathemetically precise and agrees with our intuitive notion of limits. How do we do this? A good
strategy is to think deeply about a few simple examples before considering the general case.
Consider the real-valued functions f , g and h defined by
1
f (x) = , g(x) = sin x and h(x) = 0.05
x
whenever x > 0. Their graphs are sketched below.
c
2020 School of Mathematics and Statistics, UNSW Sydney
32 f (x) g(x) h(x) CHAPTER 2. LIMITS
2π 4π
0 0 0
x x x
Intuitively, we see that f (x) tends to 0 as x tends to infinity, while neither g nor h have this
property. What is it about f that distinguishes its limiting behaviour from that of g or h?
The above graph shows that the distance between f (x) and its limit 0 is small whenever x is
large. However, to get to the core difference between f and the other two functions, we must be
more precise.
• The distance between h(x) and 0 is also small, but its limit is not 0. The reason why f (x)
tends to 0 as x → ∞ is that the distance between f (x) and 0 can be made as small as we
like whenever x is large enough.
• The distance between g(x) and 0 can also be made as small as we like, by taking x
sufficiently close to an integer multiple of π. However the limit of g(x) is not 0. The
difference between f and g is that the distance between f (x) and 0 remains small for all
sufficiently large values of x. The same is not true for g.
In summary, f (x) tends to 0 as x → ∞ because the distance between f (x) and 0 can be made as
small as we like for all values of x that are sufficiently large.
The next step in crafting a good definition is to express phrases like ‘can be made as small as we
like’ and ‘sufficiently large’ in concrete mathematical language. The following list is one attempt
to do this. (As a reminder, f (x) = 1/x.)
• The distance between f (x) and 0 is less than 1 whenever x > 1.
y
y = f (x)
(1, 1)
0
1 x
−1
• The distance between f (x) and 0 is less than 0.2 whenever x > 5.
y
y = f (x)
(5, 0.2)
0.2
0
−0.2 5 x
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.2. THE DEFINITION OF LIM F (X) 33
X→∞
• The distance between f (x) and 0 is less than 0.1 whenever x > 10.
• The distance between f (x) and 0 is less than 0.01 whenever x > 100.
• The distance between f (x) and 0 is less than 0.0001 whenever x > 10000.
• etc.
Of course, to express the fact that ‘the distance between f (x) and 0 can be made as small as
we like’ in this way would require an infinite list. Instead, we say that, for every small positive
number ǫ, there is real number M such that
• the distance between f (x) and 0 is less than ǫ whenever x > M .
y
y = f (x)
( 1ǫ , ǫ)
ǫ
0
−ǫ M x
By looking at the graph we can see that M = 1ǫ . (The symbol ǫ is the Greek letter ‘epsilon’ and is
traditionally used by mathematicians in this context.)
So far we have agreed that f (x) → 0 as x → ∞ precisely because
for every positive real number ǫ, there is a real number M such that the distance
between f (x) and 0 is less than ǫ whenever x > M .
This statement forms a basis for defining what is meant by lim f (x) = 0. A general definition for
x→∞
what is meant by lim f (x) = L is now obtained by
x→∞
• replacing ‘the distance between f (x) and L’ with |f (x) − L| (see Section 1.3).
We are now ready to give a general definition for the limit of a function at infinity.
Remark 2.2.2. Suppose that lim f (x) = L. Definition 2.2.1 can be interpreted geometrically in
x→∞
the following way. For every small number ǫ, you can find a marker M on the x-axis such that the
distance between f (x) and the limit L is less than ǫ whenever x lies to the right of M .
c
2020 School of Mathematics and Statistics, UNSW Sydney
34 CHAPTER 2. LIMITS
y
y = f (x)
L+ǫ
L
L−ǫ
|
M x
Remark 2.2.3. In Definition 2.2.1, the number M depends on ǫ. In general, the smaller the value
of ǫ, the larger the value of M . We note, however, that for any particular value of ǫ the choice of
M is not unique. For example, the M chosen for the above diagram is different from the M chosen
for the diagram below, but both choices guarantee that
y
y = f (x)
L+ǫ
L
L−ǫ
|
M x
Remark 2.2.4. Students who find Definition 2.2.1 difficult to understand can take comfort in the
fact that a rigorous formulation of limits evaded mathematicians for over 2000 years. The problems
and paradoxes involving limiting processes tabled around 450 BC by the ancient Greek philosopher
Zeno of Elea were only satisfactorily resolved in the nineteenth century, when a rigorous definition
of the limit was given by Bolzano (in 1817) and Weierstrass (in the 1850s).
In the previous section we gave a definition for lim f (x) = L. In this section this definition is used
x→∞
to prove limits that were previously taken for granted.
4
Example 2.3.1. Prove from Definition 2.2.1 that lim = 0.
x→∞ x2
Proof. Suppose that f (x) = 4/x2 , L = 0 and ǫ > 0. We begin by calculating the distance between
the function and its limit:
4
|f (x) − L| = 2 − 0
x
4
= 2. (2.2)
x
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.3. PROVING THAT LIM F (X) = L USING THE LIMIT DEFINITION 35
X→∞
We need to find a condition on x such that |f (x) − L| < ǫ. By (2.2), we require that
4
< ǫ.
x2
By rearrangement, we see that this is satisfied whenever
2
x> √ .
ǫ
2
M=√ . (2.3)
ǫ
The above proof can be streamlined by omitting some explanatory commentary. In the next
example, explanatory comments (which are not part of the proof) are inserted inside square brack-
ets.
Proof. Suppose that ǫ > 0. For convenience, suppose also that x > 0. [This is allowed since we are
considering the behaviour of f (x) for large positive values of x.] Now
5x
|f (x) − L| = − 5
x+3
5x − 5(x + 3)
=
x+3
−15
=
x + 3
15
= [since x > 0]
x+3
15
< [to make algebra simpler later on].
x
In summary,
15
|f (x) − L| < .
x
[This inequality gives an upper bound for the distance between f (x) and L.] Hence |f (x) − L| < ǫ
whenever
15
< ǫ.
x
This condition is equivalent to
15
x> .
ǫ
c
2020 School of Mathematics and Statistics, UNSW Sydney
36 CHAPTER 2. LIMITS
15
Hence if M = ǫ then
|f (x) − L| < ǫ whenever x > M,
as required.
Before attempting one more proof, we reflect on the general strategy employed. Given ǫ, we
need to find a number M such that
Remark 2.3.4. As pointed out in Remark 2.2.3, the value of M is not unique. For instance, the
upper bound for |f (x) − L| in our solution to Example 2.3.3 was 1/(9x2 ) and the corresponding
√
value for M was 1/(3 ǫ). If instead we use the upper bound 1/e3x then the corresponding value
for M is 13 ln 1ǫ .
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.4. PROOFS OF BASIC LIMIT RESULTS (MATH1141 ONLY) 37
ǫ
|f (x) − L1 | < whenever x > M1 .
2
Similarly, there is a positive M2 such that
ǫ
|g(x) − L2 | < whenever x > M2 .
2
Hence, by the triangle inequality,
(f (x) + g(x)) − (L1 + L2 ) ≤ |f (x) − L1 | + |g(x) − L2 |
ǫ ǫ
< +
2 2
=ǫ
The other rules in Proposition 2.1.1 can be proved similarly, except that sometimes the desired
inequalities are harder to obtain. We move on to prove the pinching theorem.
and that
lim f (x) = lim h(x) = L.
x→∞ x→∞
c
2020 School of Mathematics and Statistics, UNSW Sydney
38 CHAPTER 2. LIMITS
and
|h(x) − L| < ǫ whenever x > M2 .
In other words,
L − ǫ < f (x) < L + ǫ whenever x > M1
and
L − ǫ < h(x) < L + ǫ whenever x > M2 .
Hence
L − ǫ < f (x) ≤ g(x) ≤ h(x) < L + ǫ
whenever x > max{M1 , M2 }. This shows that
f (x)
4 b
|
3 bc
|
2 bc
|
1 b b
|
| | | |
2 4 6 8 x
With reference to this graph, we will discuss the behaviour of f (x) when x is near the points 2, 4,
6 and 8.
• As x approaches 2 from the left-hand side, f (x) approaches 3. In this situation, we write
lim f (x) = 3
x→2−
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.5. LIMITS OF FUNCTIONS AT A POINT 39
and say that ‘the limit of f (x) as x approaches 2 from the left is 3.’ (Note that f (2) = 1 6= 3.
One should not confuse a limit with a function value.) As x approaches 8 from the left-hand
side, f (x) approaches 1 and hence we write
lim f (x) = 1.
x→8−
• As x approaches 2 from the right-hand side, f (x) approaches 1. In this situation, we write
lim f (x) = 1
x→2+
and say that ‘the limit of f (x) as x approaches 2 from the right is 1.’ As x approaches 6
from the right-hand side, f (x) approaches 2 and hence we write
lim f (x) = 2.
x→6+
• As x approaches 4 from the left-hand side, f (x) is positive and grows large without bound.
In this case, lim f (x) does not exist because f (x) does not approach a real number.
x→4−
However, we can write
f (x) → ∞ as x → 4− .
• As x approaches 4 from the right-hand side, f (x) is negative and grows large without
bound. In this case, lim f (x) does not exist but we can we write
x→4+
f (x) → −∞ as x → 4+ .
Once an understanding of left- and right-hand limits is grasped, we can talk about two-sided
limits.
Definition 2.5.1. If the left-hand limit lim f (x) and the right-hand limit
x→a−
lim f (x) both exist and equal the same real number L, then we say that the limit
x→a+
of f (x) as x → a exists and is equal to L, and we write
lim f (x) = L.
x→a
If any one of these conditions fails then we say that lim f (x) does not exist.
x→a
Recall the graph of the function f introduced at the beginning of this section.
• The two-sided limit lim f (x) does not exist because
x→2
• The two-sided limit lim f (x) does not exist because lim f (x) does not exist.
x→4 x→4−
c
2020 School of Mathematics and Statistics, UNSW Sydney
40 CHAPTER 2. LIMITS
lim f (x) = 2.
x→6
That is, the two-sided limit of a function at a point need not equal the value of the function
at that point.
• Since the left-hand and right-hand limits at 8 both exist and equal 1, the two-sided limit
lim f (x) exists and
x→8
lim f (x) = 1.
x→8
In this case, the two-sided limit of f at 8 equals the value of the function at 8. That is,
The last two points emphasise an important feature of limits and functions. If lim f (x) = L
x→a
then f (x) approaches L as x tends to a. The function f may not be defined at a, or if it is defined
at a, we can have the situation where f (a) = L or where f (a) 6= L. In other words, the value of
f (a) does not determine the value of lim f (x). As far as the limit is concerned, all that matters is
x→a
how f behaves when it is very close to the point a.
Remark 2.5.2. In this section we have not given a proper definition for limits of the form lim f (x)
x→a−
or lim f (x). At this stage it is enough to know that such a rigorous definition exists and is similar
x→a+
to the definition for limits of the form lim f (x) (see Definition 2.2.1). This definition can be used
x→∞
to prove the basic limit results presented in the remainder of this chapter.
Definition 2.5.3. Suppose that f is defined on some open interval containing the
point a. If lim f (x) = f (a), then we say that f is continuous at a; otherwise, we
x→a
say that f is discontinuous at a.
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.5. LIMITS OF FUNCTIONS AT A POINT 41
f (x)
bc
bc
| | |
2 4 6 x
The function f is continuous at 2, discontinuous at 4 and undefined at 6. (We do not say that f is
discontinuous at 6 since 6 ∈
/ Dom(f ).)
We now extend our formal notion of continuity at a point to continuity on the real line.
If a function f : R → R is continuous everywhere then it has the property that its graph can be
drawn on the Cartesian plane as an unbroken curve. This coincides with intuitive notions about
continuity gained in high school.
As mentioned in Section 1.9, polynomials, rational functions, the trigonometric functions, ex-
ponentials and logarithms are continuous at every point in the respective domains. Moreover, if
f is continuous at a point a then |f | is also continuous at a (see Section 1.7 for the definition of
|f |). The proof of these facts requires a rigorous definition of the limit at a point, and, as already
mentioned, such a definition is not presented in this course. However, we will discuss aspects of
the proof in Chapter 3.
A consequence of the above definition is that both one-sided and two-sided limits of f at a point
a are easy to evaluate provided that (a) a ∈ Dom(f ) and (b) we know that f is a continuous at a.
Hence, in view the facts mentioned above,
Remark 2.5.5. Continuity is a deep property for a function to have. Contrary to the impression
that many students form from their study of functions at school, most functions are not continuous.
Three interesting functions with discontinuities are mentioned below.
c
2020 School of Mathematics and Statistics, UNSW Sydney
42 CHAPTER 2. LIMITS
If the functions f and g are continuous everywhere, then Proposition 2.5.7 implies that
lim g(f (x)) = g lim f (x)
x→a x→a
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.5. LIMITS OF FUNCTIONS AT A POINT 43
Solution. Since the cosine function and polynomials are continuous everywhere,
!
π π
2 2
√ cos x + 6 = cos
lim lim
√ x +6 (by Proposition 2.5.7)
x→ π/3 x→ π/3
π π
= cos + (since polynomials are continuous)
3 6
π
= cos
2
= 0.
A version of the pinching theorem for two-sided limits is given below. Similar versions exist for
left- and right-hand limits.
y = h(x)
L
y = g(x)
a
|
x
y = f (x)
Theorem 2.5.9 (The pinching theorem). Let I be an open interval containing the point a. Suppose
that f , g and h are all defined on I except possibly at the point a. If
and
lim f (x) = lim h(x) = L
x→a x→a
then
lim g(x) = L.
x→a
Remark 2.5.10. The pinching theorem can be used to prove the well-known limit
sin θ
lim = 1.
θ→0 θ
We close this chapter with four examples involving limits of the form lim f (x), where f is not
x→a
necessarily continuous at the point a. Determining whether or not the limit exists, and if so what
its value is, requires special analysis of the function f .
c
2020 School of Mathematics and Statistics, UNSW Sydney
44 CHAPTER 2. LIMITS
lim x sin(1/x)
x→0
−1 ≤ sin θ ≤ 1
−1 ≤ sin(1/x) ≤ 1 whenever x 6= 0.
−x ≤ x sin(1/x) ≤ x.
Since lim −x = lim x = 0, it follows from the pinching theorem that lim x sin(1/x) = 0.
x→0+ x→0+ x→0+
Case 2: Suppose that x < 0. Then
−x ≥ x sin(1/x) ≥ x.
Since lim −x = lim x = 0, it follows from the pinching theorem that lim x sin(1/x) = 0.
x→0− x→0− x→0−
Conclusion: Since the left- and right-hand limits exist and are both equal to 0, the two-sided
limit exists and
lim x sin(1/x) = 0.
x→0
This limit is illustrated graphically below.
y
y = −x y=x
y = x sin(1/x)
c
2020 School of Mathematics and Statistics, UNSW Sydney
2.5. LIMITS OF FUNCTIONS AT A POINT 45
Solution. Since we are discussing limiting behaviour, the value of f at 2 is irrelevant. All that
matters is how f (x) behaves near 2. So suppose that x 6= 2. Then
x2 − 4
f (x) =
x−2
(x − 2)(x + 2)
=
x−2
= x + 2.
f (x)
4 bc
2 x
Since
lim f (x) = lim f (x) = 4,
x→2− x→2+
the two-sided limit exists and we have lim f (x) = 4. (Note that f is not continuous at 2 since
x→2
lim f (x) 6= f (2).)
x→2
Solution. Since f is an odd function, we initially restrict our attention to positive values of x. Now
sin θ = 0 whenever θ ∈ {π, 2π, 3π, . . .}. If θ = π/x then we have sin(π/x) = 0 whenever
1 1 1
x = , , ,....
1 2 3
Similarly, sin θ = 1 whenever θ ∈ {π/2, 5π/2, 9π/2, 13π/2, . . .} so sin(π/x) = 1 whenever
2 2 2 2
x= , , , ....
1 5 9 13
c
2020 School of Mathematics and Statistics, UNSW Sydney
46 CHAPTER 2. LIMITS
Consequently, no matter how close x is to 0 on the right, we can always find two closer points x1
and x2 such that f (x1 ) = 0 and f (x2 ) = 1. We conclude that f has no right-hand limit at 0. Since
f is odd, it follows that f has no left-hand limit at 0. Hence lim f (x) does not exist.
x→0+
The graph of f near the origin is shown below.
f (x)
1
−1
The f (x) oscillates wildly between 1 and −1 as x tends to 0, and lim f (x) does not exist.
x→0+
1/2
> # Maple produces results that are not in strict agreement with our definitions
as x goes to plus or minus infinity
> limit(x^2, x=-infinity);
∞
> limit(sin(x), x=-infinity);
−1 . . . 1
> limit(x^2*sin(x), x=-infinity);
undefined
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 2 47
2x2 + 5x − 1 x5 + 5x + 1
c) lim d) lim
x→∞ x3 + x x→∞ x4 + 3
5x2 − 3x + cos 7x
e) lim f) lim sin x
x→∞ 4 + sin 2x + x2 x→∞
3. [R] [V]
√ √
a) Prove that lim ( x + 1 − x) = 0.
x→∞
p 1
b) Show that lim ( x2 + x − x) = .
x→∞ 2
4. [R]
lim f (x) = L.
x→∞
1
b) Evaluate lim .
x→∞2x2
c) Verify from the formal definition that your answer in (b) is correct.
5. [R]
x2 + 1
a) Evaluate lim .
x→∞ x2
x2 + 1
b) Find a real number M such that the distance between and its limit is less
x2
than 0.01 whenever x > M .
c) Suppose that ǫ > 0. Find a real number M (expressed in terms of ǫ) such that the
x2 + 1
distance between and its limit is less than ǫ whenever x > M .
x2
c
2020 School of Mathematics and Statistics, UNSW Sydney
48 CHAPTER 2. LIMITS
6. [R] [V] For each of the following, find the limit of f (x) as x tends to infinity and prove
from the definition that your answer is correct.
4x x−3
a) f (x) = b) f (x) = 2 c) f (x) = e−2x
x+7 x +3
sin x sin 3x
d) f (x) = e) f (x) = 2
x x +4
7. [X]
a) With ǫ in (0, 1), Sarah solves the inequality |f (x) − 4| < ǫ and finds that the required
x values satisfy
1
x∈ ,∞ .
ǫ
Does lim f (x) exist? Give reasons for your answer.
x→∞
b) With ǫ in (0, 1), Lyndal solves the inequality |g(x)−5| < ǫ and finds that the inequality
holds for all x satisfying
1
x∈ ,∞ .
ǫ
Does lim g(x) exist? Give reasons for your answer.
x→∞
8. [R] [V] A parcel is dropped from an aeroplane. A simple model, taking into account
gravity and air resistance, suggests that the parcel’s velocity v(t) (in metres per second)
is given by v(t) = 50(1 − e−t/5 ), where t is the number of seconds since leaving the plane.
a) Calculate the terminal velocity of the parcel (that is, find lim v(t)).
t→∞
b) The parcel never attains its terminal velocity. How long does it take to come within
1 metre per second of its terminal velocity?
9. [X] For each question below, give reasons for your answer. [In some cases a single example
will be sufficient while in other cases a general proof will be required. As a reminder, if
f (x) → ∞ as x → ∞ then lim f (x) does not exist.]
x→∞
a) If lim f (x) and lim g(x) do not exist, can lim [f (x) + g(x)] or lim f (x)g(x) exist?
x→∞ x→∞ x→∞ x→∞
b) If lim f (x) exists and lim [f (x) + g(x)] exists, must lim g(x) exist?
x→∞ x→∞ x→∞
c) If lim f (x) exists and lim g(x) does not exist, can lim [f (x) + g(x)] exist?
x→∞ x→∞ x→∞
d) If lim f (x) exists and lim f (x)g(x) exists, does it follow that lim g(x) exists?
x→∞ x→∞ x→∞
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 2 49
12. [R] By finding the left- and right-hand limits first, decide whether or not each of the
following limits exist and if so find their values.
x |x2 − 4| x−4 4
a) lim b) lim c) lim d) lim
x→0 |x| x→2 x − 2 x→4 |x − 4| x→0 x
14. [R] [V] Suppose that θ is a (positive) angle measured in radians and consider the diagram
below.
D
C
θ
O A B
The curve segment CB is the arc of a circle of radius 1 centre O.
a) Write down, in terms of θ, the length of arc CB and the lengths of the line segments
CA and DB.
b) By considering areas, deduce that sin θ cos θ ≤ θ ≤ tan θ whenever 0 < θ < π2 .
θ
c) Use the pinching theorem to show that lim = 1.
θ→0+ sin θ
sin θ
d) Deduce that lim = 1.
θ→0 θ
c
2020 School of Mathematics and Statistics, UNSW Sydney
50 CHAPTER 2. LIMITS
c
2020 School of Mathematics and Statistics, UNSW Sydney
51
Chapter 3
In the world of functions, there is a small civilised country inhabited by the race of continuous
functions. Citizens of this country, such as the polynomials and exponentials, are studied (and
perhaps even loved) by students in high schools throughout the human world. However, outside
this small civilised country is a vast untamed universe of all sorts of discontinuous functions. From
this perspective, the continuous functions are a very rare breed.
However, continuous functions are far from endangered. They are among the most useful
functions for modelling real-life phenomena, particularly when the quantity that is being modelled
occurs in nature and changes smoothly with time. For example,
Proposition 3.1.1. Suppose that the functions f and g are continuous at a point a. Then f + g,
f − g and f g are continuous at a. If g(a) 6= 0 then f /g is also continuous at a.
c
2020 School of Mathematics and Statistics, UNSW Sydney
52 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS
Proof. Suppose that f and g are continuous at a. Then, by the definition of continuity at a point
(see Definition 2.5.3), lim f (x) = f (a) and lim g(x) = g(a). Therefore
x→a x→a
Hence f + g is continuous at a.
The proofs that the functions f − g, f g and f /g are continuous at a are similar.
Proposition 3.1.2. Suppose that f is continuous at a and that g is continuous at f (a). Then g ◦ f
is continuous at a.
In Chapters 1 and 2 it was stated (without proof) that the polynomials, rational functions
and trigonometric functions are continuous at every point in their respective domains. The next
example shows that the continuity of these functions can be deduced from the continuity of
Example 3.1.3. Suppose that a is a point in R. By assuming that the constant functions, the sine
function and the function f : R → R, given by f (x) = x, are continuous everywhere, show that
(c) the cosine and tangent functions are continuous at a (provided that a ∈ Dom(tan)).
Solution. (a) Every polynomial can be constructed from constant functions and the function f by
a finite number of function multiplications and additions. (For example,
x3 − 4x2 + 5 = (x × x × x) + ((−4) × x × x) + 5
for all x in R.) Since the constant functions and f are continuous at a, it follows from Proposition
3.1.1 that every polynomial is continuous at a.
(b) Every rational function r can be written in the form p/q, where p and q are polynomials.
By the result of part (a), p and q are continuous a. It follows from Proposition 3.1.1 that p/q is
also continuous at a, provided that q(a) 6= 0.
(c) Recall that
cos x = sin(π/2 − x) ∀x ∈ R.
That is, cos x = h(g(x)) where h(x) = sin x and g(x) = π/2 − x. Since the polynomial g is
continuous at a and the sine function h is continuous everywhere, the cosine function must also be
continuous at a by Proposition 3.1.2.
c
2020 School of Mathematics and Statistics, UNSW Sydney
3.2. CONTINUITY ON INTERVALS 53
Finally, since
sin x
tan x = ,
cos x
the tangent function is also continuous at a (provided that a ∈ Dom(tan)) by Proposition 3.1.1.
The next example justifies an assertion made in Section 2.5.2. We continue working under
the assumption that the constant functions and the function f : R → R, given by f (x) = x, are
continuous everywhere.
Example 3.1.4. Suppose that g : R → R is continuous at the point a and let |g| denote the
function given by
|g|(x) = |g(x)| ∀x ∈ R.
Show that |g| is continuous at the point a.
Remark 3.1.5. Throughout this section we have taken for granted that the constant functions,
the sine function and the function f : R → R, given by f (x) = x, are continuous everywhere. To
prove this, one needs a rigorous definition of the limit of a function at a point. Such a definition is
similar in flavour to the definition for the limit of a function at infinity, but will not be presented
in these notes.
Example 3.1.6. Suppose that a and b are real numbers and consider the function f : R → R given
by (
eax+b if x ≥ 0
f (x) =
cos x if x < 0.
For what values of a and b will the function f be continuous at 0?
Now
lim f (x) = lim cos x = 1, f (0) = eb and lim f (x) = lim eax+b = eb .
x→0− x→0− x→0+ x→0+
c
2020 School of Mathematics and Statistics, UNSW Sydney
54 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS
To illustrate these definitions, consider the functions f , g and h, whose graphs are shown below.
b bc b b
bc b b
| | | | | |
a b x a b x a b x
All three functions are defined on the interval [a, b]. We see that
• f is continuous on the open interval (a, b) and at the endpoint b;
• g is continuous at the endpoints a and b but not continuous on the open interval (a, b);
and
• h is continuous on the closed interval [a, b] (and, by implication, on the open interval (a, b)
and at both endpoints a and b).
z
|
f (a) b
|
| | |
a c b x
c
2020 School of Mathematics and Statistics, UNSW Sydney
3.3. THE INTERMEDIATE VALUE THEOREM 55
It is clear from the graph that one can find a real number c in [a, b] such that f (c) = z. The
intermediate value theorem, which is stated below, says that this can always be done for any
function that is continuous on [a, b].
Theorem 3.3.1 (The intermediate value theorem). Suppose that f is continuous on the closed
interval [a, b]. If z lies between f (a) and f (b) then there is at least one real number c in [a, b] such
that f (c) = z.
The intermediate value theorem is proven using the least upper bound property of the real
numbers. (It is this property that distinguishes the real numbers from the rational numbers.) We
shall not examine the proof here.
We note three important points about the theorem. First, there may be more than one number
c in [a, b] that satisfies the conclusion of the intermediate value theorem. The diagram below
illustrates this possibility.
f (x)
f (a) b
|
z
|
f (b) b
|
| |
a c1 c2 c3 b x
Second, the condition in Theorem 3.3.1 that f is continuous cannot be relaxed. For example,
consider the function f : [a, b] → R whose graph is shown below.
f (x)
f (b) b
|
b
z
|
bc
f (a) b
|
| |
a b x
Because of the discontinuity, z is not in the range of f , even though z lies between f (a) and f (b).
That is, there is no real number c in [a, b] such that f (c) = z.
Third, the intermediate value theorem is a theorem about continuous functions defined on the
real numbers. There is no analogous theorem for continuous functions defined on the rational
numbers. For example, consider the functions f : R → R and g : Q → Q, given by
c
2020 School of Mathematics and Statistics, UNSW Sydney
56 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS
However, while there√is a number c in [0, 2] such that f (c) = 0, there is no number c in [0, 2] such
that g(c) = 0 (since 2 ∈/ Dom(g)).
One of the chief applications of the intermediate value theorem is determining whether an
equation has a solution. Moreover, if a solution exists, then the theorem helps us determine an
approximate location of this solution. This is particularly helpful when the equation cannot be
solved by simple algebra.
Now
f (0) = e0 − sin 0 − 4 = −3
while
f (π/2) = eπ + 1 − 4 ≈ 20.14.
Since f is continuous on the closed interval [0, π/2] and 0 lies between f (0) and f (π/2), the
intermediate value theorem implies that there is a real number c in [0, π/2] such that f (c) = 0.
That is,
e2c = sin c + 4
for some c in [0, π/2]. Moreover, one can check that c 6= 0. Hence the equation has at least one
positive solution c.
Note that in equation (3.1), one cannot isolate x to find an explicit solution. However, now
that it is known a solution exists in the interval [0, π/2], one could use Newton’s method to find a
good approximation to this solution.
c
2020 School of Mathematics and Statistics, UNSW Sydney
3.4. THE MAXIMUM-MINIMUM THEOREM 57
(a) We say that a point c in [a, b] is an absolute minimum point for f on [a, b]
if f (c) ≤ f (x) for all x in [a, b]. The corresponding value f (c) is called the
absolute minimum value of f on [a, b]. If f has an absolute minimum point
on [a, b] then we say that f attains its minimum on [a, b].
(b) We say that a point d in [a, b] is an absolute maximum point for f on [a, b]
if f (x) ≤ f (d) for all x in [a, b]. The corresponding value f (d) is called the
absolute maximum value of f on [a, b]. If f has an absolute maximum point
on [a, b] then we say that f attains its maximum on [a, b].
An absolute maximum point and an absolute minimum point are sometimes referred to as a
global maximum point and a global minimum point.
Example 3.4.2. Consider the functions g and h, whose graphs are illustrated below.
g(x) h(x)
4 4
|
3 3
|
2 2
|
1 1 b
|
| | | | | | | | | |
1 2 3 4 5 x 1 2 3 4 5 x
The absolute minimim and maximum points of g and h on [1, 5] are recorded in the following table.
g h
Absolute minimum points none 3
Absolute minimum value n.a. 1
Absolute maximum points 2, 4 none
Absolute maximum value 4 n.a.
The above example shows that a function f : [a, b] → R need not have an absolute maximum
point (or an absolute minimum point) on a closed interval [a, b]. However, if f is continuous on
[a, b] then such points always exist.
As with the intermediate value theorem, the least upper bound property of the real numbers is
used to prove the maximum-minimum theorem. We omit the proof.
c
2020 School of Mathematics and Statistics, UNSW Sydney
58 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS
Find the absolute maximum and absolute minimum values of f on the interval [1, 10].
Solution. Since f (x) = (x − 2)(x − 8), the function f has zeros at 2 and 8.
f (x)
|
2 8 |
1 5 10 x
By symmetry, we know that the axis of the parabola is the line x = 5. Hence 5 is an absolute
minimum point for f on [1, 10]. Since f (5) = −9, the absolute minimum value of f on [1, 10] is −9.
It is clear from the sketch of f that at least one of the endpoints of the interval [1, 10] must be
an absolute maximum point for f on [1, 10]. Now f (1) = 7 while f (10) = 16. So 10 is an absolute
maximum point for f on [1, 10] and the corresponding maximum value of f is 16.
Remark 3.4.5. While continuity on a closed interval guarantees the existence of at least one
absolute maximum point and absolute minimum point, locating such points may not be easy. A
systematic approach to finding maximum and minimum points will be developed in Chapters 4 and
5.
Remark 3.4.6. All the hypotheses stated in the maximum-minimum theorem must be satisfied for
the conclusion of the theorem to be valid. If the interval is closed but the function is not continuous,
then an absolute maximum (or minimum) point may not exist (see Example 3.4.2). The next
example shows that if the function is continuous on the interval but the interval is not closed then
an absolute maximum (or minimum) point may not exist. It is expected that students will know
the precise hypotheses of the maximum-minimum theorem (and other important theorems, such as
the intermediate value theorem) and know how to apply the theorem correctly.
f (x)
2
|
|
0 1 x
As x → 1− , f (x) → 2 but there is no real number d in (0, 1) such that f (d) = 2. By a similar
observation, f has no absolute minimum point on (0, 1).
c
2020 School of Mathematics and Statistics, UNSW Sydney
3.4. THE MAXIMUM-MINIMUM THEOREM 59
The final result of this chapter is a corollary of maximum-minimum theorem. We begin with a
definition.
In other words, f is bounded if the y-values of its graph lie between −M and M for some
positive number M . The sine and cosine functions are obvious examples of functions that are
bounded on the whole of the real line.
Proof. Suppose that f is continuous on [a, b]. Then the function |f | is also continuous on [a, b]
(see Example 3.1.4). By the maximum-minimum theorem, |f | attains its maximum on [a, b]. If M
denotes the absolute maximum value of |f | on [a, b] then
Hence f is bounded.
c
2020 School of Mathematics and Statistics, UNSW Sydney
60 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS
where k is a real number. For which values of k (if any) will f be continuous everywhere?
4. [H] Use the pinching theorem for limits to show that if f , g and h are three functions
defined on an open interval I, such that
5. [R] Show that the function f , given by f (x) = x3 − 5x + 3, has a zero in each of the
intervals [−3, −2], [0, 1] and [1, 2].
6. [R] [V] Use the intermediate value theorem to show that the equation ex = 2 cos x has at
least one positive real solution.
7. [H] Suppose that f is continuous on [0, 1] and that Range(f ) is a subset of [0, 1]. By using
g(x) = f (x) − x, prove that there is a real number c in [0, 1] such that f (c) = c.
8. [X] Suppose that f is a continuous function such that f (0) = 1 and lim f (x) = −1. Show
x→∞
that f has a zero somewhere in (0, ∞).
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 3 61
9. [R] In each case, determine whether or not f attains a maximum on the given interval.
Give reasons for your answer.
2
x ln x
a) f (x) = x − 4 on [−3, 5] b) f (x) = sin(e ) + 2
on [2, 4]
x − 1
c) f (x) = x2 − 4 on (−3, 5) d) f (x) = −(x2 − 4) on (−3, 5)
10. [H] [V] Suppose that f is a continuous function on R and that lim f (x) = lim f (x) = 0.
x→∞ x→−∞
a) Give an example of such a function which has both a maximum value and a minimum
value.
b) Give an example of such a function which has a minimum value but no maximum
value.
c) [X] Show that if there is a real number ξ such that f (ξ) > 0 then f attains a
maximum value on R. [Note that the maximum-minimum theorem only applies to
finite closed intervals [a, b].]
c
2020 School of Mathematics and Statistics, UNSW Sydney
62 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS
c
2020 School of Mathematics and Statistics, UNSW Sydney
63
Chapter 4
Differentiable functions
A cyclist accelerates from rest in such a way that her displacement s (in metres) from her starting
position after t seconds is given by
s = t2 ∀t ∈ [0, 3].
The corresponding displacement–time graph for the first three seconds is shown below.
displacement (m)
4 b
|
1 b
|
| |
0 1 2 3
time (s)
∆s
v= ,
∆t
where ∆s denotes the change in displacement corresponding to a change ∆t in time. For example,
the average velocity over the time interval [1, 1.5] is 2.5 metres per second, as calculated below:
∆s (1.5)2 − (1)2
v [1,1.5] = = = 2.5. (4.1)
∆t 1.5 − 1
Average velocities over small time intervals are recorded in the following table.
c
2020 School of Mathematics and Statistics, UNSW Sydney
64 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
The table suggests that the average velocity approaches 2 meters per second as ∆t → 0. Hence it
appears that instantaneous velocity of the cyclist when t = 1 is 2 meters per second.
One of the goals of this chapter is put these kinds of ideas on a rigorous footing in a general
context (where they can be applied to understanding rates of change of any quantity, not just
displacement). Hence we reframe these ideas in terms of functions and analytic geometry. More
precisely, we shall now show that the problem of determining instantaneous velocity is equivalent
to the problem of finding the gradient of a tangent line to the graph of a function.
To begin, the problem of calculating average velocity is equivalent to the problem of calculating
the gradient of a secant to the graph of a function. (A secant is a straight line that intersects a
curve at least twice.) For example, compare the calculation of average velocity in (4.1) with the
calculation of the gradient of the secant shown in the following diagram.
s = t2
secant
2.25 b
|
s
∆s (1.5)2 −(1)2
gradient of secant = ∆t = 1.5−1 = 2.5
1 b
|
| |
1 1.5
t
In the same way, the above table of average velocities can be reinterpreted geometrically as a table
of gradients corresponding to secants of the curve s = t2 .
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.1. GRADIENTS OF TANGENTS AND DERIVATIVES 65
The secants used in the table are illustrated over the page in Figure 4.1. The figure suggests that
as ∆t → 0, the gradient of the corresponding secant approaches the gradient of the tangent line to
the curve. Thus the problem of determining instantaneous velocity is equivalent to the problem of
determining the gradient of a tangent to the graph of a function. It is to this general problem that
we now turn.
Example 4.1.1. Suppose that f : R → R is given by f (x) = x3 . Find the gradient of the tangent
to the graph of f when x = 2.
Solution. The tangent to the graph of f passes through the point P (2, 23 ). We will approximate
the gradient of the tangent with the gradient of a secant passing through the points P (2, 23 ) and
Q(2 + h, (2 + h)3 ), where h is a very small real number. (See the illustration below for the case
when h > 0.)
y
y = f (x)
secant
23 P (2, 23 ) b
|
| |
2 2+h x
Using the ‘rise over run’ formula, the gradient of the secant is
(2 + h)3 − 23
.
h
Now, as h approaches 0, the point Q moves along the curve towards P , and hence the gradient of
c
2020 School of Mathematics and Statistics, UNSW Sydney
66 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
s = t2
secants
tangent
2.25 b
|
1.96 b
|
s
1.69 b
|
1.44 b
|
1.21 b
|
1 b
|
| | | | | |
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.1. GRADIENTS OF TANGENTS AND DERIVATIVES 67
There is nothing special about the point 2 in the previous example; we could calculate the
gradient of the tangent to f at any point x. The calculation would go something like this:
(x + h)3 − x3
(gradient of tangent at x) = lim
h→0 h
x + 3x2 h + 3xh2 + h3 − x3
3
= lim
h→0 h
2
3x h + 3xh 2
= lim
h→0 h
2
= lim 3x + 3xh
h→0
2
= 3x .
So the gradient of the tangent to f when x = 5 is 75 (since 3 × 52 = 75), while the gradient of the
tangent when x = −4 is 48. The gradient function of f , denoted by f ′ and given by
f ′ (x) = 3x2 ∀x ∈ R,
is called the derivative of f .
We can repeat the same procedure for functions other than f (x) = x3 . The gradient function
(or derivative) may be calculated using the limit given in the next definition.
Definition 4.1.2. Suppose that f is defined on some open interval containing the
point x. We say that f is differentiable at x if
f (x + h) − f (x)
lim
h→0 h
exists. If the limit exists, we denote it by f ′ (x). We call f ′ (x) the derivative of f at
x.
df d
Other notation for f ′ (x) includes dx (x) and dx f (x). If y = f (x) then the derivative is often
dy
denoted by y ′ or dx . The ratio
f (x + h) − f (x)
h
is called the difference quotient for f at the point x.
c
2020 School of Mathematics and Statistics, UNSW Sydney
68 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
dy ∆y
Remark 4.1.3. The notation dx originates from the gradient formula ∆x , where ∆y represents a
change in y corresponding to a small change ∆x in x. (The symbol ∆ represents the Greek letter
‘delta’, which is equivalent to our capital ‘D’.) Some calculus texts prefer the use of delta notation
and write the difference quotient for a function f as
f (x + ∆x) − f (x)
.
∆x
√
Example 4.1.4. Suppose that f : [0, ∞) → R is given by f (x) = x. Find the equation of the
tangent to f when x = 16.
Solution. First we shall calculate the derivative of f (if it exists) at 16. Note that, in the working
below, the limit as h approaches 0 is difficult to evaluate immediately, so we use a standard trick
(see, for example, Subsection 2.1.4):
√ √
f (16 + h) − f (16) 16 + h − 16
=
h √ h √ √ √
16 + h − 16 16 + h + 16
= ×√ √
h 16 + h + 16
16 + h − 16
= √ √
h( 16 + h + 16)
h
= √ √
h( 16 + h + 16)
1
=√ √
16 + h + 16
1
→
8
as h → 0. So the gradient of the tangent is 1/8. Using the point-gradient formula, the equation of
the tangent is
y − y1 = m(x − x1 )
1
y − 4 = (x − 16)
8
8y − x − 16 = 0.
Example 4.1.5. Suppose that f : R → R is given by f (x) = |x|. Determine whether or not f is
differentiable at the point 0.
Solution. We calculate the difference quotient of f at 0:
f (0 + h) − f (0) |h|
=
h (h
h
h if h > 0
= −h
h if h < 0.
(
1 if h > 0
=
−1 if h < 0.
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.2. RULES FOR DIFFERENTIATION 69
Because of the ‘split formula,’ we must consider the left- and right-hand limits as h approaches 0
separately. Now
f (0 + h) − f (0)
lim = −1
h→0− h
while
f (0 + h) − f (0)
lim = 1.
h→0+ h
Since the left- and right-hand limits are different, the two-sided limit doesn’t exist and hence f is
not differentiable at 0.
Note that this conclusion makes sense geometrically, since the graph of f has a ‘vertex’ at 0
and there is no unique tangent that touches the graph at this point.
f (x) f ′ (x)
C, where C is a constant 0
xn , where n is a positive integer nxn−1
sin x cos x
ex ex
c
2020 School of Mathematics and Statistics, UNSW Sydney
70 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
With these basic functions, it is possible to construct many other functions via function addition,
subtraction, multiplication and division. The next theorem describes what happens to the resulting
derivatives.
Theorem 4.2.1 (Rules for differentiation). Suppose that f and g are differentiable at x. Then
f + g, f − g and f g are differentiable at x. If g(x) 6= 0 then fg is also differentiable at x. Moreover,
Theorem 4.2.2 (The chain rule). Suppose that g is differentiable at the point x and f is differen-
tiable at the point g(x). Then f ◦ g is differentiable at x and
You are probably more familiar with the chain rule expressed in the following way: if y = f (u)
and u = g(x) then
dy dy du
= . (4.3)
dx du dx
While formula (4.3) looks simpler than (4.2), it does not specify the relationship between the
variables or the points at which each derivative is to be evaluated. It should therefore be used with
care.
The next example illustrates how the chain rule formula (4.2) is applied.
Example 4.2.3. Suppose that y = (sin x + 1)7 . Find the derivative of y at the point x.
Solution. Suppose that f (x) = x7 and g(x) = sin x + 1. Then f ′ (x) = 7x6 and g′ (x) = cos x. Then
y = (f ◦ g)(x). By the chain rule, the derivative of y at x is given by
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.3. PROOFS OF RESULTS IN SECTION 4.2 71
As mentioned at the beginning of this section, derivatives of many functions can be obtained
from those of a few basic functions by the application of simple rules. The next example illustrates
this point.
Example 4.2.4. Show that
d m
(a) x = mxm−1 , where m is an integer;
dx
d
(b) cos x = − sin x; and
dx
d
(a) tan x = sec2 x.
dx
c
2020 School of Mathematics and Statistics, UNSW Sydney
72 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
The derivative of ex will be dealt with in Chapter 9. The derivative of constant functions can
be easily computed from Definition 4.1.2.
Suppose that f is given by f (x) = xn , where n is a positive integer. The derivative of f can be
computed using the same method as Example 4.1.1, with the assistance of the binomial theorem.
Alternatively, one can use mathematical induction and the product rule for differentiation.
Proof of the product rule for differentiation. Suppose that f and g are differentiable at the point
x. The difference quotient of f g at x gives
(f g)(x + h) − (f g)(x)
h
f (x + h)g(x + h) − f (x)g(x)
=
h
f (x + h)g(x + h) − f (x)g(x + h) + f (x)g(x + h) − f (x)g(x)
=
h
f (x + h) − f (x) g(x + h) − g(x)
= g(x + h) + f (x) . (4.6)
h h
Now g is differentiable at x, so g is also continuous at x (see Theorem 4.5.1). Hence g(x+h) → g(x)
as h → 0. Therefore (4.6) implies that
(f g)(x + h) − (f g)(x)
→ g(x)f ′ (x) + f (x)g ′ (x)
h
as h → 0.
Proofs of other differentiation rules are found in most undergraduate calculus textbooks.
Example 4.4.1. Suppose that y is a function of x, and that y and x are related by the formula
x2 + y 2 = 1.
dy
Calculate dx .
d 2 d 2 d
(x ) + (y ) = (1). (4.7)
dx dx dx
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.4. IMPLICIT DIFFERENTIATION 73
d d
Now clearly, dx (x2 ) = 2x and dx (1) = 0. To simplify the central term of (4.7), we regard y as a
function of x and suppose that g(y) = y 2 . The chain rule gives
d 2 dg
(y ) =
dx dx
dg dy
=
dy dx
dy
= 2y .
dx
So continuing from (4.7) we have
dy
2x + 2y = 0.
dx
dy
By isolating dx we obtain
dy −2x
=
dx 2y
x
=−
y
as required.
(We note in passing that this answer coincides with the derivative obtained by first expressing
y as a function of x and then differentiating. In particular, if
y = (1 − x2 )1/2
then
dy 1 x x
= (1 − x2 )−1/2 (−2x) = − =−
dx 2 (1 − x2 )1/2 y
as above.)
d dy
g(y) = g′ (y) (4.8)
dx dx
by the chain rule. The use of this formula is illustrated below.
y 4 + x3 − x2 e3y = 4. (4.9)
Find the equation of the tangent to the corresponding curve at the point where (x, y) = (2, 0).
d n
In Section 4.2 we saw that dx x = nxn−1 whenever n is an integer. A similar formula holds
when n is any rational number. This can be easily proved using implicit differentiation.
d q
x = qxq−1 .
dx
c
2020 School of Mathematics and Statistics, UNSW Sydney
74 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
m
Proof. Since q is a rational number, there are integers m and n such that q = n and n 6= 0. Now
if y = xq then y = xm/n and taking the nth power of both sides gives
y n = xm .
Differentiating both sides with respect to x yields
dy
ny n−1 = mxm−1 .
dx
Hence
dy mxm−1
=
dx ny n−1
m xm−1
=
n xq(n−1)
= qx(m−1)−qn+q
= qxq−1
as required.
or equivalently, that
lim f (a + h) = f (a).
h→0
Now
f (a + h) − f (a)
lim f (a + h) − f (a) = lim ×h
h→0 h→0 h
f (a + h) − f (a)
= lim × lim h (4.10)
h→0 h h→0
′
= f (a) × 0
= 0.
(Note that we can separate the limits in (4.10) because we know that the limit of the difference
quotient exists.) Hence
lim f (a + h) = f (a),
h→0
proving the theorem.
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.5. DIFFERENTIATION, CONTINUITY AND SPLIT FUNCTIONS 75
The next function studied is an example of a ‘split function’. Whether or not the function is
differentiable at the ‘split point’ can be determined by calculating left- and right-hand limits of the
difference quotient.
Example 4.5.4. Suppose that f : R → R is defined by
(
sin x if x ≥ 0
f (x) = 2
x + bx + c if x < 0,
where b and c are real numbers. Find all possible values of b and c such that f is (a) continuous at
0 and (b) differentiable at 0.
Solution. (a) For f to be continuous at 0, we require that
lim f (x) = f (0) = lim f (x).
x→0− x→0+
c
2020 School of Mathematics and Statistics, UNSW Sydney
76 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
f (x)
| | |
−1 2π x
Example 4.5.5.
(
(x − 1)2 − 1 if − 2 ≤ x < 0
f (x) =
1 − (x − 1)2 if 0 ≤ x ≤ 2.
In this example, each of the functions p(x) = (x − 1)2 − 1 and q(x) = 1 − (x − 1)2 have had
their domains restricted and then ‘glued’ together at the point 0.
If the constituent functions are continuous and differentiable in some interval containing the
point a, then the following theorem tells us when the corresponding split function is differentiable.
Theorem 4.5.6. Suppose a is a fixed real number and the function f is defined by
(
p(x) if x ≥ a
f (x) =
q(x) if x < a,
where p(x) and q(x) are continuous and differentiable in some open interval containing a. Then if
f is continuous at a and p′ (a) = q ′ (a), then f is differentiable at x = a.
This theorem in particular can be applied to splines since the constituent functions are polyno-
mials.
Example 4.5.7.
(
sin x if x ≥ 0
f (x) =
x2 + x if x < 0.
The functions sin x and x2 + x are continuous and differentiable everywhere. We saw in Example
4.5.4 that f is continuous at x = 0. Also, with p(x) = sin x and q(x) = x2 + x, we have p′ (0) =
cos 0 = 1 and q ′ (0) = 1 so f is differentiable at x = 0.
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.6. DERIVATIVES AND FUNCTION APPROXIMATION 77
f (a) b
|
a x
The equation of the tangent to f at a may be calculated using the point-gradient formula:
y − f (a) = f ′ (a)(x − a)
y = f ′ (a)(x − a) + f (a).
Observe in the above diagram that the tangent line is close to the graph of f for all points that are
close to a. In other words,
f (x) ≈ f ′ (a)(x − a) + f (a) (4.11)
when x is close to a.
Formula (4.11) reflects the fact that every differentiable function can be locally approximated
by a linear function. We use this principle in the following example.
√
Example 4.6.1. Estimate 9.001 without using a calculator.
√
Solution. Suppose that f (x) = x. Since 9.001 is close to 9 and f is a continuous function, we
could use the approximation √
9.001 = f (9.001) ≈ f (9) = 3. (4.12)
However, we can do better than this. We shall approximate f with a linear function at the
point 9 using (4.11). Since f ′ (9) = 1/6 we have
f (x) ≈ (x − 9)/6 + 3
whenever x is close to 9. When x = 9.001 this gives
√
9.001 = f (9.001) ≈ (9.001 − 9)/6 + 3 = 3 + 1/6000. (4.13)
The table below gives the error involved in each approximation.
Approximation Error
Approximation (4.12) 1.667−4
Approximation (4.13) 4.637−9
Clearly (4.13) is the superior approximation.
c
2020 School of Mathematics and Statistics, UNSW Sydney
78 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
Example 4.7.1. A spherical balloon is being inflated and its radius is increasing at a constant
rate of 6 mm/sec. At what rate is its volume increasing when the radius of the balloon is 20 mm?
Solution. Suppose that V (t) is the volume of the balloon and r(t) is its radius at time t. We are
told that dr dV
dt = 6 and we need to find dt when r = 20. By the chain rule,
dV dV dr
= .
dt dr dt
Now V = 43 πr 3 so dV
dr = 4πr 2 . Hence
dV
= 4πr 2 × 6.
dt
When r = 20,
dV
= 24π(20)2 = 9600π.
dt
So the volume is increasing at a rate of 9600π mm3 /sec when the radius is 20 mm.
2. Write down what is known in terms of these variables and their derivatives.
3. Write down what you need to find in terms of the these variables and their derivatives.
4. Write down anything else you know that relates the variables (for example, a volume or
area formula).
5. Use the chain rule (or implicit differentiation) to find the relevant derivative.
Example 4.7.2. A point P moves to the right along the positive x-axis at a constant rate of 5
cm per second, and a point Q moves up the positive y-axis at a constant rate of 10 cm per second.
How fast is the distance between them changing when OP = 30 cm and OQ = 40 cm?
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.8. LOCAL MAXIMUM, LOCAL MINIMUM AND STATIONARY POINTS 79
| | | x
c−h c c+h
Definition 4.8.1. Suppose that f is defined on some interval I. We say that a point
c in I is a local mininum point if there is a positive number h such that f (c) ≤ f (x)
whenever x ∈ (c − h, c + h) and x ∈ I. We say that a point d in (a, b) is a local
maximum point if there is a positive number h such that f (x) ≤ f (d) whenever
x ∈ (d − h, d + h) and x ∈ I.
Theorem 4.8.2. Suppose that f is defined on (a, b) and has a local maximum or minimum point
at c for some c in (a, b). If f is differentiable at c then f ′ (c) = 0.
Proof. Suppose that f is a local maximum at c. To show that f ′ (c) = 0 we consider the difference
quotient
f (c + h) − f (c)
.
h
Since f is a maximum at c, f (c + h) ≤ f (c) for h sufficiently close to 0. So if h is sufficiently small
and positive,
f (c + h) − f (c)
≤ 0,
h
while if h is sufficiently small and negative,
f (c + h) − f (c)
≥ 0.
h
c
2020 School of Mathematics and Statistics, UNSW Sydney
80 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
That is,
f (c + h) − f (c)
lim ≤0
h→0+ h
while
f (c + h) − f (c)
lim ≥ 0.
h→0− h
Hence
f (c + h) − f (c)
lim =0
h→0 h
which is equivalent to saying that f ′ (c) = 0.
The proof when c is a local minimum is similar and will be omitted.
The set of points where the derivative of a function is zero is important. In particular, tangents
to the graph of the function at these points are horizontal lines. This motivates the following
definition.
Example 4.8.4. Suppose that f : R → R is given by f (x) = 4x5 − 5x4 − 40x3 − 2. Find all the
stationary points of f .
Hence the solutions to f ′ (x) = 0 are x = −2, 0, 3. So the points −2, 0 and 3 are the stationary
points of f .
f (x)
200
|
| |
−2 3 x
−200
|
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.9. MAPLE NOTES 81
It is clear that −2 is a local maximum point of f and that 3 is a local minimum point of f . However,
0 is neither a local minimum point nor local maximum point, and none of these points are global
maximum points or global minimum points. So while the local maxima and minima of a function
occur where f ′ is zero, it does not follow that any point c satisfying f ′ (c) = 0 is a local maximum
point or local minimum point of f .
Tools for identifying whether a stationary point is a local maximum, local minimum or neither
are developed in the next chapter using an important result called the mean value theorem.
3 x2 e5 x + 5 x3 e5 x
> diff(%,x);
6 xe5 x + 30 x2 e5 x + 25 x3 e5 x
> f2ndDeriv:=diff(x^3*exp(5*x),x$2);
f2ndDeriv := 6 xe5 x + 30 x2 e5 x + 25 x3 e5 x
c
2020 School of Mathematics and Statistics, UNSW Sydney
82 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
f (0 + h) − f (0)
a) If it exists, evaluate lim .
h→0+ h
f (0 + h) − f (0)
b) If it exists, evaluate lim .
h→0 − h
c) State the value of f ′ (0) or explain why f is not differentiable at 0.
4. [R] [V] Determine at which points each function f is (i) differentiable; (ii) continuous.
(
sin x if x ≤ 0 x3 − 6x + 4
a) f (x) = |x| b) f (x) = c) f (x) = 2
x if x > 0 x + 4x + 4
5. [R] Sketch the graph of f , where f (x) = x1/3 . Is f differentiable at 0? Give reasons.
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 4 83
8. [R] (An exercise on notation.) Suppose that f (x) = x + cos 2x. Write down
a) f (x + 17π) b) f ′ (x + nπ) c) f (2 − x2 )
d
d) f ′ (2 − x2 ) e) dx f (2 − x2 ).
dy
10. [R] Find for the curve x4 + y 4 = 16. Sketch the graph of the curve.
dx
11. [R] [V] Find the equation of the line tangent to the curve x3 + y 3 = 3(x + y) at the point
(1, 2).
12. [R] Suppose that a and b are real numbers. Find all values of a and b (if any) such that
the functions f and g, given by
( (
ax + b if x < 0 ax + b if x < 0
a) f (x) = and b) g(x) = ,
sin x if x ≥ 0 e2x if x ≥ 0
are (i) continuous at 0 and (ii) differentiable at 0.
where a and b are real numbers. Find all values of a and b (if any) such that f is differ-
entiable at 0.
c
2020 School of Mathematics and Statistics, UNSW Sydney
84 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
15. [R] At a certain instant the side length of an equilateral triangle is a cm and this length
is increasing at r cm/s. How fast, in cm2 /s, is the area increasing?
16. [R] [V] A 5 m ladder is leaning against a vertical wall. Suppose that the bottom of the
ladder is being pulled away from the wall at a rate of 1 m/s. How fast is the area of the
triangle underneath the ladder changing at the instant that the top of the ladder is 4 m
from the floor?
17. [R] A spherical balloon is to be filled with water so that there is a constant increase in
the rate of its surface area of 3 cm2 /s.
(The surface area A and volume V of a sphere of radius r is given by A = 4πr 2 and
V = 43 πr 3 .)
a) Find the rate of increase in the radius when the radius is 3 cm.
b) Find the volume when the volume is increasing at a rate of 10 cm3 /s.
18. [R]
θ = tan−1 ( 12 )
θ
Water is poured in at the rate of 10 mm3 /s. Find the rate at which the depth, h mm,
is increasing when the depth of water in the cone is 50 mm.
b) [H] The cone is filled to a depth of 100 mm and pouring is then stopped. A hole is
then opened
√ at the vertex of the cone and water flows out of the hole at the rate
of 50π h mm3 per second, where h is the depth at time t. Show that it takes 200
seconds to empty the cone.
c
2020 School of Mathematics and Statistics, UNSW Sydney
85
Chapter 5
The mean value theorem is one of the most important results for establishing the theoretical
structure of calculus. Several results that you are familiar with from high school calculus are based
on the mean value theorem. Applications of the mean value theorem include
• identifying where a function is increasing or decreasing,
• identifying different types of stationary points,
• determining how many zeros a polynomial has,
∞
• evaluating limits which are indeterminate forms of type ∞ and 00 ,
• proving useful inequalities and
• estimating errors in approximations.
We begin the chapter by introducing the theorem.
c
2020 School of Mathematics and Statistics, UNSW Sydney
86 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
f (x)
C
f (b) B
|
f (a) A
|
| | |
a c b x
Theorem 5.1.1 (The mean value theorem). Suppose that f is continuous on [a, b] and differentiable
on (a, b). Then there is at least one real number c in (a, b) such that
f (b) − f (a)
= f ′ (c).
b−a
The proof of the theorem will be presented in the next section. For now we illustrate the
theorem with a simple example.
√
Example 5.1.2. Suppose that f : [1, 8] → R is given by f (x) = 3 x. Find a number c in (1, 8)
that satisfies the conclusions of the mean value theorem for f on [1, 8].
Solution. Note that f is continuous on [1, 8] and differentiable on (1, 8). By the mean value theorem
there is a real number c in (1, 8) such that
f (8) − f (1)
= f ′ (c).
8−1
2−1 1
= √ .
7 3 2
3 c
Rearranging to find c gives
√
3 7
c2 =
3
r
343
c= .
27
Since c ≈ 3.56 it is clear that c ∈ (1, 8). This is illustrated graphically below.
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.2. PROOF OF THE MEAN VALUE THEOREM 87
√
3
x
2 b
|
b
1 b
|
| | |
1 c 8 x
Theorem 5.2.1. Suppose that g is continuous on [a, b] and differentiable on (a, b). Suppose also
that g(a) = g(b) = 0. Then there is a real number c in (a, b) such that g ′ (c) = 0.
c
2020 School of Mathematics and Statistics, UNSW Sydney
88 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
g(x)
A B
a c b x
From here it is not hard to see how to prove the mean value theorem. Consider the graph of
the function f in Figure 5.1 and ‘subtract’ the chord AB. The result will look something like the
graph of g in Figure 5.2, to which we can apply Rolle’s theorem and obtain a horizontal tangent
at C. We then ‘add’ the chord back again to obtain the tangent to f of Figure 5.1. This rough
geometric argument is made rigorous below.
Proof of the mean value theorem. Suppose that f is continuous on [a, b] and differentiable on (a, b).
We consider the function g given by
f (b) − f (a)
g(x) = f (x) − (x − a) + f (a) .
b−a
(The part in square brackets is taken from the equation of the chord AB in Figure 5.1.) One can
check that g is continuous on [a, b], differentiable on (a, b) and that g(a) = g(b) = 0. By Rolle’s
theorem there is a c in (a, b) such that g ′ (c) = 0; that is, such that
′ f (b) − f (a)
f (c) − = 0.
b−a
If we rearrange this equation then we obtain
f (b) − f (a)
f ′ (c) =
b−a
as required.
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.3. PROVING INEQUALITIES USING THE MEAN VALUE THEOREM 89
c
2020 School of Mathematics and Statistics, UNSW Sydney
90 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
Example 5.4.1.
√ Use√the mean value theorem to find an upper bound for the error involved if we
approximate 27 by 25.
Example 5.4.2. Use the mean value theorem to find an upper bound for the error involved if we
approximate sin 5π 2π
7 by sin 3 .
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.5. THE SIGN OF A DERIVATIVE 91
If we apply the mean value theorem to the sine function on the interval [2π/3, 5π/7] then
sin 5π 2π
7 − sin 3
5π 2π = cos c
7 − 3
for some c in (2π/3, 5π/7). Taking absolute values of both sides and rearranging gives
sin 5π − sin 2π = 5π − 2π | cos c |
7 3 7 3
where 2π/3 < c < 5π/7. Now | cos c | < | cos 5π 7 | (because the absolute value of the cosine is
increasing over the interval (π/2, π)) but we don’t have an exact value for cos 5π 7 . A simple way
out of this situation is to just use the fact that | cos c | < 1 for all c in (2π/3, 5π/7). Hence we have
shown that
15π 2π 5π 2π
absolute error = sin − sin < − × 1.
21 3 7 3
This proves that
π
absolute error < < 0.15.
21
If we aren’t so lazy with our estimates we can quickly spot that
cos 5π = cos 15π < cos 15π = √1 .
7 21 20 2
This allows us to prove the more accurate statement that
π
absolute error < √ < 0.11.
21 2
c
2020 School of Mathematics and Statistics, UNSW Sydney
92 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
Example 5.5.2. The function f : R → R given by f (x) = x2 is increasing on [0, ∞) and decreasing
on (−∞, 0].
f (x)
decreasing increasing
x
The following theorem should be familiar to you. It is proved using the mean value theorem.
Theorem 5.5.3. Suppose that f is continuous on [a, b] and differentiable on (a, b).
(i) If f ′ (x) > 0 for all x in (a, b) then f is increasing on [a, b].
(ii) If f ′ (x) < 0 for all x in (a, b) then f is decreasing on [a, b].
Theorem 5.5.3 has several important applications. The first of these is classifying different
types of stationary points. The basic idea is as follows. Suppose that f is differentiable on the
open interval (1, 3) and that for some small positive number h we have the following:
• f ′ (x) > 0 for all x in (2 − h, 2),
• f ′ (2) = 0, and
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.6. THE SECOND DERIVATIVE AND APPLICATIONS 93
−5− −5 −5+ 1− 1 1+ 4− 4 4+
x−4 − − − − − − − 0 +
x−1 − − − − 0 + + + +
(x + 5)2 + 0 + + + + + + +
f ′ (x) + 0 + + 0 − − 0 +
Gradient − − −
For example, the notation 4− is shorthand for those points x lying in a small interval to the
immediate left of 4. For such a point x, the factor x − 4 is negative, while the factors x − 1 and
(x + 5)2 are positive. Hence f ′ (x) is negative when x is slightly to the left of 4. Other columns of
the table are filled by a similar process.
In conclusion, 4 is a local minimum point, 1 is a local maximum point and −5 is neither a
maximum nor minimum point. (In fact, −5 is called a horizontal point of inflexion since the
tangent to f at −5 is horizontal and the concavity of the function changes about this point. See
any standard undergraduate calculus text or any advanced high school calculus text for further
details.)
Example 5.6.1. Find the second derivative of the function f : R → R, given by the rule
dy
Remark 5.6.2. If y = f (x) then the first derivative of f is often written as dx or y ′ while the
d2 y
second derivative of f is often written as dx2 or y ′′ .
Remark 5.6.3. Even if a function is differentiable at a point a, the function may not have a second
derivative at a. (Consider, for example, the function f : R → R given by
f (x) = x4/3 ,
at the point 0.) A function which has a second derivative at a is called twice differentiable at a.
If f is twice differentiable, the sign of f ′′ can often be used to determine whether a stationary
point is a local maximum point or a local minimum point.
c
2020 School of Mathematics and Statistics, UNSW Sydney
94 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
Theorem 5.6.4 (The second derivative test). Suppose that a function f is twice differentiable on
(a, b) and that c ∈ (a, b).
(i) If g ′ (c) > 0 then g(c − h) < g(c) < g(c + h) for all positive h sufficiently small.
(ii) If g ′ (c) < 0 then g(c + h) < g(c) < g(c − h) for all positive h sufficiently small.
Sketch proof. We give an intuitive argument for statement (i) and leave (ii) as an exercise. Suppose
that g ′ (c) > 0. By the definition of the derivative,
g(c + h) − g(c)
lim = g′ (c). (5.3)
h→0 h
That is,
g(c + h) − g(c)
≈ g′ (c)
h
when h is close to 0. Now, if h > 0 then
Hence
g(c + h) − g(c) > 0 (5.4)
g(c − h) − g(c)
lim = g′ (c)
h→0 −h
(to see this, simply replace h with −h in (5.3)). Now, if h > 0 and h is close to 0 then
Hence
g(c) − g(c − h) > 0 (5.5)
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.6. THE SECOND DERIVATIVE AND APPLICATIONS 95
Proof of Theorem 5.6.4. We prove statement (i) and leave the proof of (ii) as an exercise. Suppose
that f ′ (c) = 0, f ′′ (c) > 0 and that g = f ′ . Then g ′ (c) > 0 and
for all positive h sufficiently small, by Lemma 5.6.5. Since f ′ (c) = g(c) = 0 we conclude that
f ′ (c − h) < 0 < f ′ (c + h)
for all positive h sufficiently small. So f ′ is positive on a small interval immediately to the right
of c, zero at x and negative on a small interval immediately to the left of c. By Theorem 5.5.3 we
now conclude that f has a local minimum at the point c.
Example 5.6.6. Find and classify the stationary points of the function f : R → R given by
f (x) = x3 − 6x2 + 9x − 5.
Hence f has stationary points at 1 and 3. Now f ′′ (1) = −6 < 0, so f has a local maximum point
at 1. For second stationary point, f ′′ (3) = 6 > 0, so f has a local minimum stationary point at 3.
(To illustrate these conclusions, we have included the graph of f , shown below.)
f (x)
1 3
| |
x
−5
Remark 5.6.7. Suppose that f ′ (c) = 0 and that f ′′ (c) = 0. Then we cannot apply the second
derivative test. In fact, the point c may be a local maximum, a local minimum or neither of these.
The examples below illustrate various possibilities.
x4 −x4 x3
x x x
c
2020 School of Mathematics and Statistics, UNSW Sydney
96 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
• If f (x) = x3 then f ′ (0) = f ′′ (0) = 0 and there is neither a a local maximum and minimum
at 0 (in fact, we have a horizontal point of inflexion at 0).
Hence if f ′ (c) = f ′′ (c) = 0 then it is best to classify the stationary point c by examining the sign
of the derivative on either side of c, as illustrated in Section 5.5.
• f is not differentiable at c.
The set of points satisfying these properties is useful so we give it a special name.
Definition 5.7.1. Suppose that f is defined on [a, b]. We say that a point c in [a, b]
is a critical point for f on [a, b] if c satisfies one of the following properties:
Theorem 5.7.2. Suppose that f is continuous on [a, b]. Then f has a global maximum and global
minimum on [a, b]. Moreover, the global maximum point and the global minimum point are both
critical points for f on [a, b].
The proof of the theorem follows the same reasoning given at the beginning of this section. The
theorem provides a systematic way of finding global minima and maxima of functions.
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.7. CRITICAL POINTS, MAXIMA AND MINIMA 97
f (x) f (x)
b
3 b b b
4 b
b | | b |
1 2 x 3 4 5 x
2
(a) (b)
Clearly f is not differentiable at 1. Morover, its derivative is never zero. So the only critical
points of f on [0, 2] are 1 and the endpoints 0 and 2. By Theorem 5.7.2, the global maximum and
minimum points must be among the points 0, 1 and 2. But
Hence the global minimum value of f on [0, 2] is 0 while the global maximum value of f on [0, 2] is
3. (See Figure 5.3 (a) for a sketch of f that shows its critical points.)
Find the absolute maximum and absolute minimum values of f on the interval [0, 5].
{(x, y) ∈ R2 : y = x2 − x − 1, −2 ≤ x ≤ 3}
(see Figure 5.4 (a)). Find where the arc is (i) closest to and (ii) furthest from the origin.
Solution. If (x, y) is a point on the arc then its distance from the origin is given by
p
x2 + y 2 .
c
2020 School of Mathematics and Statistics, UNSW Sydney
98 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
y
g(x)
34 b
|
b
| |
−2 3 x b
b
| |
−2 3 x
(a) (b)
p
To find where the arc is closest to the origin, we want to minimise x2 + y 2 subject to the con-
straints y = x2 − x − 1 and −2 ≤ x ≤ 3. In other words, we want to find a global minimum for the
function p
f (x) = x2 + (x2 − x − 1)2
on the interval [−2, 3]. Because calculations involving square roots can get messy, we instead
consider the function g, given by
on [−2, 3]. (Note that g(x) may be interpreted as the square of the distance between the origin
and a point on the arc with coordinate (x, x2 − x − 1). The global minimum of both f and g will
occur at the same point x.)
We seek the critical points of g, so we compute its derivative:
Hence the critical points of g on [−2, 3] are −2, −1/2, 1 and 3. Now
(see Figure 5.4 (b)). Therefore the arc is closest to the origin when x = −1/2 and furthest from
the origin when x = 3.
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.8. COUNTING ZEROS 99
Example 5.8.1. Determine how many real numbers satisfy the equation
Solution. Suppose that f (x) = x3 − 6x2 − 15x + 8 for all real x. Ideally, we would look for one
root of f , factor this out and solve a quadratic equation to determine the remaining roots (if any).
However, finding one root by inspection is not straightforward, so we need another approach.
Note that f is differentiable (and hence continuous) everywhere. We begin by identifying
intervals where f is monotonically decreasing and intervals where f is monotonically increasing.
The derivative of f is given by
So the stationary points of f are 5 and −1. Morover, the table below shows that f is increasing on
the intervals (−∞, −1] and [5, ∞), while it is decreasing on [−1, 5].
f (x)
(−1, 16)
b
(5, −92)
We now argue that f has exactly one root on each of the intervals mentioned above.
• On the interval [−1, 5], the function f is decreasing. Hence f cannot have more than one
root on [−1, 5]. On the other hand, since f (−1) > 0 and f (5) < 0 there is at least one c
in [−1, 5] such that f (c) = 0 by the intermediate value theorem. Hence f has exactly one
root on [−1, 5].
• On the interval (−∞, −1], the function f is increasing. Hence f cannot have more than
one root on (−∞, −1]. On the other hand, since f (−1) > 0 and f (−5) < 0 there is at
least one c in [−5, −1] such that f (c) = 0 by the intermediate value theorem. Hence f has
exactly one root on (−∞, −1].
c
2020 School of Mathematics and Statistics, UNSW Sydney
100 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
• On the interval [5, ∞), the function f is increasing. Hence f cannot have more than one
root on [5, ∞). On the other hand, since f (5) < 0 and f (10) > 0 there is at least one c
in [5, 10] such that f (c) = 0 by the intermediate value theorem. Hence f has exactly one
root on [5, ∞).
In conclusion, equation (5.6) has exactly three real solutions, one in each of the intervals [−5, −1],
[−1, 5] and [5, 10].
Guided by our solution to Example 5.8.1, we now outline a general approach for solving problems
of this type.
Suppose that f is continuous and differentiable everywhere and that its derivative is also con-
tinuous. To determine how many real solutions the equation f (x) = 0 has, one may follow the
procedure below.
2. Determine the intervals where the derivative is positive and the intervals where it is neg-
ative.
3. By step 2 we know the intervals where the function f is monotonically increasing, and the
intervals where f is monotonically decreasing.
4. Evaluate the function f at the endpoints of each interval. If f changes sign on the interval,
there is exactly one root on that interval. If it does not change sign, there are no roots
on that interval. (This step must be modified slightly for intervals of the form (−∞, a] or
[b, ∞).)
Example 5.8.2. Determine how many real numbers satisfy the equation
By differentiating f we obtain
The stationary points of f are 1 and 2. By studying the sign of f ′ , it is clear that f is increasing
on the intervals (−∞, 1) and (2, ∞) while it is decreasing on (1, 2). Now
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.9. ANTIDERIVATIVES 101
f (x)
(1, 4)
b
b
(2, 3)
x
So it is clear that f has no zeros on [1, 2] or on [2, ∞). On (−∞, 1), the function f has no more
than one zero (since it is increasing on this interval). Moreover, f changes sign on the interval [0, 1]
(since f (0) = −1 < 0 and f (1) > 0). Hence f has exactly one root on [0, 1] by the intermediate
value theorem.
In conclusion, equation (5.7) has exactly one real solution and this solution lies somewhere in
the interval [0, 1].
5.9 Antiderivatives
We open with an example.
Example 5.9.1. While filming a movie, a stunt man jumps out of a stationary helicopter 900
meters above the ground. The scriptwriter wants the man to make a 20 second mobile phone call
before opening the parachute. Physical considerations (gravity and air resistance) suggest that the
man’s velocity f (t) (in metres per second) without an open parachute is given by
f (t) = 50(1 − e−t/5 ),
where t is the number of seconds after jumping from the helicopter. How far will the man fall 20
seconds?
Solution. Write F (t) for the distance fallen (in metres) after t seconds. We note that F (0) = 0.
Since velocity is the rate of change of displacement, F satisfies the equation
F ′ (t) = f (t).
(Any equation like this involving differentiation is called a differential equation). The differential
equation may be rewritten as
dF
= 50 − 50e−t/5 . (5.8)
dt
One can check by differentiation that if
F (t) = 50t + 250e−t/5 + C, (5.9)
where C is a real constant, then F satisfies (5.8). Let’s assume (for the moment) that all possible
solutions to the differential equation (5.8) are of the form (5.9). Imposing the condition that
F (0) = 0 allows us to evaluate the unknown constant C:
0 = F (0) = 50 × 0 + 250e0 + C,
c
2020 School of Mathematics and Statistics, UNSW Sydney
102 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
whence C = −250. Therefore F (t) = 50t + 250e−t/5 − 250. To complete the solution,
Hence the stunt man will fall about 755 metres in the first 20 seconds.
To be certain of this solution, we need to show that every function F satisfying the differential
equation (5.8) is of the form (5.9). This can be done using the mean value theorem. We begin by
introducing the notion of an antiderivative.
f (t) = 50 − 50e−t/5 ∀t ∈ R
xn+1 xn+1
F (x) = and G(x) = +5
n+1 n+1
then F and G are both antiderivatives of f on the interval (−∞, ∞). So f has more than
one antiderivative on (−∞, ∞). In fact, if C is any real number and
then H is an antiderivative of f on (−∞, ∞). Thus the function f has infinitely many
antiderivatives on (−∞, ∞).
Example 5.9.3 (b) illustrates the general principle that if F is an antiderivative of f on I then
F +C
is also an antiderivative of f on I, for any real constant C. The next theorem says that all
antiderivatives are of this form.
Theorem 5.9.4. Suppose that f is a continuous function on an open interval I and that F and G
are two antiderivatives of f on I. Then there is a real constant C such that
G(x) = F (x) + C
for all x in I.
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.10. L’HÔPITAL’S RULE 103
Proof. Suppose that F and G are two antiderivatives of f on I and let H denote the function given
by
H(x) = G(x) − F (x)
for all x in I. Then H is differentiable on I and
for all x in I. Hence there is a constant C such that H(x) = C for all x in I (by Theorem 5.5.3
(c)). Therefore
G(x) = F (x) + H(x) = F (x) + C
for all x in I.
Theorem 5.9.4 justifies the assumption made in the solution to Example 5.9.1. It also enables
us to write down all possible antiderivatives of some well-known functions. In the table below, C
is any real constant.
Function Antiderivative
1
xr , where r is rational and r 6= −1 r+1 x
r+1 +C
sin x − cos x + C
cos x sin x + C
1 ax
eax ae +C
f ′ (x)
ln |f (x)| + C
f (x)
Since both the numerator and denominator approach 0 as x → 0+ , the limit is an example of an
indeterminate form of type 00 . None of the limit rules of Chapter 2 can be easily applied to evaluate
the limit. However, the mean value theorem gives another rule which helps in this situation.
Suppose that f (x) = ex − 1 and g(x) = 5x + x2 . Instead of considering the quotient f (x)/g(x)
(whose limit is difficult to calculate), we consider the quotient f ′ (x)/g ′ (x) of derivatives (whose
limit is easier to calculate):
f ′ (x) ex 1
lim ′ = lim = .
x→0 g (x) x→0 5 + 2x 5
c
2020 School of Mathematics and Statistics, UNSW Sydney
104 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
Since this second limit exists, l’Hôpital’s rule (which is stated below) implies that
f (x) f ′ (x)
lim = lim ′ .
x→0 g(x) x→0 g (x)
Hence
ex − 1 1
lim = .
x→0 5x + x2 5
Theorem 5.10.1 (l’Hôpital’s rule). Suppose that f and g are both differentiable functions and a
is a real number. Suppose also that either one of the two following conditions hold:
• f (x) → 0 and g(x) → 0 as x → a;
exists then
f (x) f ′ (x)
lim = lim ′ .
x→a g(x) x→a g (x)
• limits as x → ∞ or x → −∞;
Solution. If f (x) = ln x and g(x) = 1 − x then both f (x) and g(x) approach 0 as x → 1. Moreover,
f and g are both differentiable. So we look at the quotient of the derivatives:
f ′ (x) 1/x 1
′
= = − → −1
g (x) −1 x
as x → 1. We conclude by l’Hôpital’s rule that
f (x)
lim = −1.
x→1 g(x)
ln x 1/x
lim = lim (5.10)
x→1 1 − x x→1 −1
1
= lim −
x→1 x
= −1
provided that we check that our application of l’Hôpital’s rule in (5.10) is valid.)
c
2020 School of Mathematics and Statistics, UNSW Sydney
5.10. L’HÔPITAL’S RULE 105
Sometimes l’Hôpital’s rule must be applied more than once to calculate a limit. The next
example illustrates this.
x2
Example 5.10.4. Determine the limiting behaviour of as x → ∞.
e2x
Solution. Observe that both the numerator and denominator approach ∞ as x → ∞. Differenti-
ating the numerator and denominator gives
2x
.
2e2x
∞
Again, as x → ∞ we have an indeterminate form of type ∞ . If we differentiate the numerator and
denominator once more we have
2
4e2x
and this limit can be evaluated. Piecing this together, we have
x2 2x 2
lim 2x
= lim 2x
= lim =0
x→∞ e x→∞ 2e x→∞ 4e2x
by l’Hôpital’s rule.
As the next example shows, l’Hôpital’s rule can be used to solve limits that are indefinite forms
of type 0 × ∞.
Example 5.10.5. Determine the limiting behaviour of x ln x as x → 0+ .
The next example warns against the improper application of l’Hôpital’s rule.
2x − cos x
Example 5.10.6. Determine the limiting behaviour of as x → ∞.
3x + cos x
We shall now sketch the proof of l’Hôpital’s rule for the case when x → a+ . Other cases are done
similarly. (A complete proof uses a generalisation of the mean value theorem known as Cauchy’s
mean value theorem.)
Sketch proof of Theorem 5.10.1. Suppose that f and g are differentiable (and hence continuous)
everywhere, that f (x) → 0 and g(x) → 0 as x → a+ , and that
f ′ (x)
lim
x→a+ g ′ (x)
exists. We need to show that
f (x) f ′ (x)
lim = lim ′ . (5.11)
x→a+ g(x) x→a+ g (x)
We begin by applying the mean value theorem to both functions f and g on the interval [a, x].
So there are real numbers c and d in (a, x) such that
f (x) − f (a) g(x) − g(a)
= f ′ (c) and = g ′ (d). (5.12)
x−a x−a
Since f is continuous and f (x) → 0 as x → a+ , we have f (a) = 0. Similarly g(a) = 0. So by (5.12)
we have
f (x) = (x − a)f ′ (c) and g(x) = (x − a)g′ (d)
c
2020 School of Mathematics and Statistics, UNSW Sydney
106 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 5 107
2. [R] Suppose that f (x) = 1/x. Show that there is no real number c in [−1, 2] such that
f (2) − f (−1)
f ′ (c) = .
2 − (−1)
Why does this not contradict the mean value theorem?
3. [R] Consider the function f given by f (x) = (x − 2)4 cos(x2 − 4x + 4). Use the mean value
theorem to show that f ′ has a zero on the interval [1, 3].
5. [R]
a) Use the mean value theorem to show that sin t ≤ t whenever t > 0.
b) Hence show that sin t < t whenever t > 0.
1
c) Using the pinching theorem and part (a), evaluate the limit lim sin .
x→∞ x
7. [R] [V] Use the mean value theorem to find an upper bound for the error involved if we
approximate
√ √
a) 17 by 16 = 4;
1998 2
b) by 22 = 4;
1000
1 1
c) by .
1002 1000
c
2020 School of Mathematics and Statistics, UNSW Sydney
108 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
8. [R] The derivative of a function f : R → R is given by f ′ (x) = 3(x + 1)(x − 1)2 (x − 4)3 .
Locate all stationary points of f and identify any local maximum or minimum points of
f.
10. [R] Find the maximum and minimum values for each function f over the given interval.
a) f (x) = 3 − x3 over [−2, 4] b) f (x) = 3 − x4 over [−2, 4]
c) f (x) = x3 − x4 over [−5, 5] d) f (x) = 2x(x + 4)3 over [−2, 1]
e) f (x) = |x2 − 3x + 2| over [0, 3]
11. [R] Find the point on the straight line 2x + 3y = 6 which is closest to the origin.
x2 x3
12. a) i) [R] Show that the polynomial p3 , where p3 (x) = 1 + x + + , has at least
2! 3!
one real root.
x2
ii) [H] Show that the polynomial p2 , where p2 (x) = 1 + x + , has no real roots
2!
and deduce that p3 has exactly one real root.
x2 x3 x4
iii) [X] Deduce that p4 (x) = 1 + x + + + > 0 for all real numbers x.
2! 3! 4!
n
X xk
b) [X] Suppose that pn (x) = whenever n = 1, 2, 3, . . . . Use induction to prove
k!
k=0
that
i) if n is even then pn (x) > 0 for all real numbers x, and
ii) if n is odd then pn (x) has exactly one real root and this root is negative.
13. [R] A wire of length 100 cm is cut into two pieces of length x cm and y cm. The piece of
length x cm is bent into the shape of a square and the piece of length y cm into the shape
of a circle. Find x and y so that the sum of the areas enclosed by the shapes will be
a) a minimum b) a maximum.
14. [X] Suppose that a ≥ 0. Find the greatest and least distances from the point (a, 0) to
the ellipse
x2 y 2
+ = 1.
4 1
(Have a precise answer before comparing with the given answer.)
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 5 109
15. [X] Find all the values of a and x, both in [0, 2π], where
17. [R] [V] Suppose that p(x) = x3 − 12x2 + 45x − 51 whenever x ∈ R. How many real zeros
does p have?
18. [R]
b) Are there any other functions with these properties? Explain your answer.
19. [R] [V] A particle moving along the x-axis has velocity 2t − t2 units per second after t
seconds. Find
c
2020 School of Mathematics and Statistics, UNSW Sydney
110 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
1 1
22. [H] Find the value of lim + .
t→0 ln(1 + t) ln(1 − t)
ax − 1 + ebx
23. [H] Find (a, b) such that lim = 1.
x→0 x2
4x + sin x
24. [R] Explain why l’Hôpital’s Rule cannot be used to find lim . Use another
x→∞ 2x − sin x
method to find this limit.
is differentiable at 0.
26. [R]
√
cos
h−1
a) Evaluate lim .
h→0+ h
b) A function f is defined by
( √
cos x if x ≥ 0
f (x) =
ax + b if x < 0,
where a and b are real numbers. By using the limit calculated in (a), find all possible
values of a and b such that f is differentiable at 0.
where a and b are real numbers. Find all possible values of a and b such that f is
differentiable at 0.
c
2020 School of Mathematics and Statistics, UNSW Sydney
111
Chapter 6
Inverse functions
We return again to the pollution example (see the introductions to Chapter 2). According to our
model, the volume of pollution P (t) in the lake after t days of the factory’s operation is given by
109 5
P (t) = 1 − e−101t/10 .
101
Environmental authorities, after hearing that the eventual amount of pollution in the lake will be
109 /101 litres, determine that this level would be environmentally devastating. Recommendations
are made that the amount of pollution should not exceed 8 × 106 litres. If factory operations do
not change, in how many days will pollution levels in the lake exceed this amount?
Note that the function P expresses pollution as a function of time. One approach to solving
this problem is to find a function T that expresses time as a function of pollution. Such a function
is called an inverse function of P . This chapter is devoted to a study of inverse functions.
So f takes an initial number and squares it, while g takes the square root and recovers the original
number. A way of writing down the fact that g undoes (or reverses) what f does to any positive
number x is:
g(f (x)) = x ∀x ∈ Dom(f ). (6.1)
c
2020 School of Mathematics and Statistics, UNSW Sydney
112 CHAPTER 6. INVERSE FUNCTIONS
output output
In fact, f also undoes what g does to any positive number. This is easily verified by checking that
Because f and g have this special relationship, they are called inverses of each other.
The relationship between any function f and its inverse function g can also be viewed in the
following way. If y = f (x) and we want to express x as a function of y, then we find that x = g(y).
For the functions of Example 6.1.1, this is easily verified by the following calculation:
√
y = f (x) ⇐⇒ y = x2 ⇐⇒ y=x ⇐⇒ x = g(y).
The next example shows that changing the domain of f gives a different inverse function g.
Example 6.1.2. Jack changes the game slightly. He says, ‘A negative number, when squared, is
equal to 36. What was the original number?’ Jill correctly answers that the number was −6.
Again, we model the game with two functions. Jack’s function f is given by
Note that g undoes what f does to any negative number, and vice-versa. That is, f and g satisfy
equations (6.1) and (6.2) and are therefore inverse functions of each other.
Each function f in Example 6.1.1 and Example 6.1.2 had the property that for every output
there is a unique corresponding input (See Figure 6.1 (a) and (b)). However, the function f graphed
in Figure 6.1 (c) does not have a one-to-one correspondence between its outputs and inputs. As
we see in the next example, this function f has no inverse function.
Example 6.1.3. Jack changes the game one more time. This time he says, ‘A real number, when
squared, is equal to 64. What was the original number?’ This time Jill cannot give a definite
answer to his question. The answer is either 8 or −8, but she has no way of telling which it is.
The model for this situation is the following. Jack’s function f is now given by
c
2020 School of Mathematics and Statistics, UNSW Sydney
6.2. ONE-TO-ONE FUNCTIONS 113
but there is no function g that undoes what f does to a real number x. That is, there is no function
g that satisfies the equation (6.1). Therefore f has no inverse function. The reason why this is the
case is clear: the domain of f now includes the positive and negative real numbers, and therefore
some (in fact most) outputs of f have two possible corresponding inputs. That is, there is no
one-to-one correspondence between inputs and outputs of f .
• If f has an inverse function g, then g undoes (or reverses) what f does to elements of
Dom(f ).
• Suppose f and g are inverse functions. Then y = f (x) if and only if x = g(y).
• If a function f does not have an inverse function, then we can sometimes modify the
domain of f so that it does have an inverse function. (For example, if f (x) = x2 then
f : R → [0, ∞) does not have an inverse function while the function f : (0, ∞) → (0, ∞),
with domain restricted to the positive real numbers, does have an inverse function.)
In other words, a function is one-to-one if every output has a unique input. Equivalently, f is
one-to-one if
f (x1 ) 6= f (x2 ) whenever x1 6= x2 ,
provided that both x1 and x2 belong to Dom(f ). One-to-one functions are sometimes called
injective functions.
c
2020 School of Mathematics and Statistics, UNSW Sydney
114 CHAPTER 6. INVERSE FUNCTIONS
2x1 − 3 = 2x2 − 3.
x1 = x2 .
Hence f is one-to-one.
Example 6.2.3. Show that the function f : R → R, given by f (x) = x2 , is not one-to-one.
There are other ways of identifying one-to-one functions. The following method is a simple
geometric test.
Proposition 6.2.4 (The horizontal line test). Suppose that f is a real-valued function defined on
some subset of R. Then the following statements are equivalent:
(a) f is one-to-one;
(b) every horizontal line in the Cartesian plane intersects the graph of f at most once.
A
x x x
• f is not one-to-one because the horizontal line passing through the point A cuts the graph
of f more than once;
• g is one-to-one (in fact, since g is increasing, every horizontal line can cut the graph of g
graph no more than once);
Although not every one-to-one function is increasing (or decreasing), it is true that every in-
creasing function is one-to-one.
c
2020 School of Mathematics and Statistics, UNSW Sydney
6.3. INVERSE FUNCTIONS 115
is one-to-one.
Solution. The derivative of f is given by f ′ (x) = 10x4 + 3x2 + 1. Since f ′ (x) ≥ 1 for all real x, the
function f is increasing and hence one-to-one.
Remark 6.2.8. Not every function whose derivative is only positive (or only negative) on its
domain is one-to-one. For example, on Dom(tan)
d
tan x = sec2 x ≥ 1,
dx
but tan is not one-to-one, nor is it increasing on its domain. The problem here is that the domain
of tan has gaps.
and
f (g(x)) = x ∀x ∈ Range(f ). (6.5)
Moreover,
Range(g) = Dom(f )
and g is one-to-one.
Proof. Suppose that f is one-to-one and let D and R denote the domain and range of f respectively.
Since f is one-to-one, for every y in R there is a unique x in D such that y = f (x).
Now define a function g : R → D by the following rule: if y ∈ R then g(y) is the unique x in D
such that f (x) = y.
It is left as an exercise to show that g is one-to-one and that equations (6.4) and (6.5) hold.
Definition 6.3.2. Suppose that f is a one-to-one function. Then the inverse func-
tion of f is the unique function g given by Theorem 6.3.1. The inverse function for
f is often written as f −1 .
c
2020 School of Mathematics and Statistics, UNSW Sydney
116 CHAPTER 6. INVERSE FUNCTIONS
Remark 6.3.3. If f −1 denotes the inverse function of a one-to-one function f , then equations (6.4)
and (6.5) can be expressed as
f −1 (f (x)) = x ∀x ∈ Dom(f )
and
f (f −1 (x)) = x ∀x ∈ Range(f ).
(Warning: This notation is potentially confusing if not understood correctly. The function f −1
is not the reciprocal function of f (that is, 1/f (x)). To denote the reciprocal of f (x) using index
notation we write [f (x)]−1 rather than f −1 (x).)
Remark 6.3.4. If f is a bijection and g is the inverse of a function f , then f is the inverse of g.
This is due to the symmetry between f and g in the statements of Theorem 6.3.1.
If f is a one-to-one function then equation (6.5) can often be used to find an explicit formula
for its inverse g.
(d) Use the inverse function g to solve the equation f (x) = 13.
Solution. (a) Since f ′ (x) = −x2 ≤ 0 for all x in R (with f ′ (x) = 0 only when x = 0), the function
f is decreasing and is therefore one-to-one. Hence f has an inverse function.
(b) Let g denote the inverse function of f . Then Dom(g) = Range(f ) = R and Range(g) =
Dom(f ) = R. To find an explicit formula for g(x), suppose that
y = g(x).
f (y) = f (g(x)) = x.
1
4 − y 3 = x,
3
whereupon rearrangement gives √
3
y= 12 − 3x.
Therefore g : R → R is given by √
3
g(x) = 12 − 3x. (6.6)
(c) The graphs of f and g are shown below. Note that each graph is the reflection the other in
the line y = x.
c
2020 School of Mathematics and Statistics, UNSW Sydney
6.3. INVERSE FUNCTIONS 117
y
y=x
y = g(x)
y = f (x)
(d) To solve the equation f (x) = 13, apply g to both sides. This gives
Remark 6.3.6. As was illustrated in Example 6.3.5 (c), the graph of a one-to-one function f and
its inverse g are reflections of each other in the line y = x. We give a proof of this fact below:
If a function f is not one-to-one, then we can sometimes restrict the domain of f so that it
becomes one-to-one. The function f with restricted domain then has an inverse function.
f (x) = 4 − x2 .
Find a restriction of f such that f becomes one-to-one. Find the inverse function g of this restriction.
Solution. A quick sketch of the graph of f (see Figure 6.2 (a)) shows that f is not one-to-one. If we
restrict the domain of f to [0, ∞), then the restricted function f : [0, ∞) → R passes the horizontal
line test and is therefore one-to-one. Let g denote the inverse of f : [0, ∞) → R. Then
g : (−∞, 4] → [0, ∞)
c
2020 School of Mathematics and Statistics, UNSW Sydney
118 CHAPTER 6. INVERSE FUNCTIONS
y y y
4 y = g(x) 4 y = f (x) 4
−2 2 4 4
x x x
y = g(x)
y = f (x) y = f (x)
Figure 6.2: Two different restrictions of the function f , and their corresponding inverses (see
Example 6.3.7).
since Dom(g) = Range(f ) = (−∞, 4] and Range(g) = Dom(f ) = [0, ∞). We find an explicit
formula for g:
y = g(x)
f (y) = x (by equation 6.5)
4 − y2 = x
√
y = 4−x (taking the positive root since Range(g) = [0, ∞)).
√
Hence g(x) = 4 − x. The graphs of f and g are sketched in Figure 6.2 (b).
Remark 6.3.8. In the solution to Example 6.3.7, one could have instead chosen the restriction
f : (−∞, 0] → R. The corresponding inverse g is given by
p
g : (−∞, 4] → (−∞, 0], g(x) = − 4 − x2 .
Find all intervals I, each as large as possible, such that the function f : I → R (with domain
restricted to I) has an inverse function.
Remark 6.3.10. If you study MATH1081 you will see that an abstract function f : A → B is said
to be invertible if it is a bijection from the set A onto the set B. That is, it is both one-to-one and
its range is all of its codomain B and so the problem of solving f (x) = b has a unique solution for
all b ∈ B. In the algebra strand of this course you will see that an m × n matrix A is invertible
if the matrix equation Ax = b has a unique solution for all b ∈ Rm , and we’ll see the idea again
with linear transformations in Mathematics 1B.
In calculus we often initially define a function without precisely knowing its range, and so we
usually say,
“Define f : (a, b) → R by ...”. Rx 2
For example, a very important function is f : R → R, f (x) = 0 e−t dt. It is easy to check that this
√ √
function f is one-to-one, but it is hard to see that the range of f is (− π/2, π/2). In calculus,
when we say that this function f is invertible, we are not saying that f is a bijection from its
domain onto it’s codomain R, but rather that f is a bijection from it domain onto its range.
c
2020 School of Mathematics and Statistics, UNSW Sydney
6.4. THE INVERSE FUNCTION THEOREM 119
Since
1
g′ (x) = √
3
,
x2
g is not differentiable at 0. So while f is differentiable everywhere, its inverse g is not.
The problem with the above example occurs where the derivative of f is zero. When a horizontal
tangent to f is reflected in the line y = x, it becomes a vertical tangent to g and hence g is not
differentiable at this point.
y
However, if f is a differentiable function whose derivative is never zero, its inverse g will also be
differentiable.
Theorem 6.4.1 (The inverse function theorem). Suppose that I is an open interval, f : I → R is
differentiable and f ′ (x) 6= 0 for all x in I. Then
1
g′ (x) =
f ′ (g(x))
Some parts of the inverse function theorem are easy to prove while others are hard. Statement
(i) is true since f ′ (x) 6= 0 for x in I implies that f is either increasing on I or decreasing on I (by
c
2020 School of Mathematics and Statistics, UNSW Sydney
120 CHAPTER 6. INVERSE FUNCTIONS
the mean value theorem). Hence f is one-to-one. Statement (ii) is difficult to prove and involves a
delicate limiting argument with the difference quotients of f and g.
It is important that students understand the proof of statement (iii). Suppose (by statement
(ii)) that g is differentiable. Beginning with the equation
f (g(x)) = x
f ′ (g(x)).g ′ (x) = 1
by the chain rule. Since f ′ is never zero on I, we can divide by f ′ (g(x))) to obtain
1
g′ (x) =
f ′ (g(x))
as required.
In the next section the inverse function theorem will be applied to the trigonometric functions.
c
b
θ
a
However, if we know the ratio and instead want to find the corresponding angle, we use the inverse
sine function. Thus relationship between the sine and inverse sine functions is given by
b b
sin θ = , sin−1 = θ. (6.7)
c c
This relationship works in elementary trigonometry where the angle θ is acute. However, if θ is
allowed to be any real number then the relationship between sin and sin−1 is not so straightforward.
This is because the function sin : R → R is not one-to-one and therefore has no inverse (see Figure
6.3).
In the next example, we define exactly what we mean by sin−1 and compute its derivative.
Example 6.5.1 (The inverse sine function). Since the usual sine function is not a one-to-one
function, we consider instead a restricted sine function
sin : [− π2 , π2 ] → [−1, 1]
which is one-to-one (see Figure 6.3 (a)). This restricted function has an inverse
sin−1 : [−1, 1] → [− π2 , π2 ],
c
2020 School of Mathematics and Statistics, UNSW Sydney
6.5. APPLICATIONS TO THE TRIGONOMETRIC FUNCTIONS 121
sin x sin−1 x
(1, π2 )
( π2 , 1) b
b
x x
(− π2 , −1) b
(−1, − π2 )
(a) (b)
We now examine the differentiability of sin−1 . If I = (− π2 , π2 ) then the sine function is differen-
tiable on I and
d
(sin x) = cos x 6= 0
dx
for all x in I. So by the inverse function theorem (Theorem 6.4.1), sin−1 is differentiable on (−1, 1).
While the derivative of sin−1 could be computed using the formula of Theorem 6.4.1 (iii), we prefer
to compute it directly by implicit differentiation.
If y = sin−1 x then
We seek a simple expression for cos y in terms of x. By the trigonometric identity sin2 y +cos2 y = 1,
and the fact that y = sin−1 x, we have
q q p
cos y = 1 − sin2 y = 1 − (sin(sin−1 x))2 = 1 − x2 , (6.9)
where the positive square root was taken since cos y > 0. Substituting (6.9) into (6.8) gives
dy 1
=√
dx 1 − x2
and hence
d 1
(sin−1 ) = √
dx 1 − x2
whenever −1 < x < 1.
As a corollary, note that the derivative of sin−1 is positive, so sin−1 is an increasing function.
The following table summarises the domain, range and derivatives of the restricted (and hence
one-to-one) trigonometric functions and their inverses. The derivatives exist on the largest open
c
2020 School of Mathematics and Statistics, UNSW Sydney
122 CHAPTER 6. INVERSE FUNCTIONS
(−1, π) cos−1 x tan−1 x
b
π
2
b
(0, π2 ) b
(1, 0) x x
b
− π2
intervals contained in the domain of each function. Proofs of the derivatives for the inverse trigono-
metric functions are done in much the same way as illustrated in Example 6.5.1. It is expected that
students will be able to derive these derivatives, using the method of Example 6.5.1, if required.
The graphs of cos−1 and tan−1 are shown in Figure 6.4.
We note also that sin−1 and tan−1 are both odd functions, but that
The next example illustrates that one must be careful with the notation sin and sin−1 . It is
not always true that sin−1 (sin x) = x. This is because sin−1 is not the inverse of sin : R → R. It is
only the inverse of sin : [− π2 , π2 ] → R. In particular,
sin−1 (sin x) = x
(b) sin(cos−1 53 ).
c
2020 School of Mathematics and Statistics, UNSW Sydney
6.5. APPLICATIONS TO THE TRIGONOMETRIC FUNCTIONS 123
(b) Let θ denote cos−1 35 . Then we seek sin θ. By the trigonometric identity
sin2 θ + cos2 θ = 1,
we have p
sin θ = ± 1 − cos2 θ,
which simplifies to
s 2
3 4
sin θ = ± 1− =± .
5 5
3
Since θ = cos−1 5 > 0 we have sin θ > 0 and hence
4
sin θ = + ,
5
5
4
θ
A 3 C
The final example uses theory from all chapters covered in the course so far.
Example 6.5.3. A man and a woman sit on opposite edges of a canyon which is 50 metres wide
and 100 metres deep. The man drops a small stone from rest into the canyon. The woman watches
the stone fall to the bottom of the canyon through a telescope. At what time during the stone’s
fall will the angle that the telescope makes with the horizontal change the fastest? (That is, when
will the angular velocity of the telescope be greatest?) Assume that the air resistance experienced
by the stone is negligible.
Solution. Let x(t) denote the distance (in metres) that the stone has travelled exactly t seconds
after the stone was dropped. Let θ(t) denote the angle that the telescope makes with the horizontal
at t seconds. Let g denote acceleration due to gravity (g ≈ 9.8 ms−2 ).
c
2020 School of Mathematics and Statistics, UNSW Sydney
124 CHAPTER 6. INVERSE FUNCTIONS
man woman
θ(t)
x(t)
100 m
r
stone
50 m
dx
= v, x(0) = 0, (6.10)
dt
dv
= g, v(0) = 0. (6.11)
dt
From (6.11) we obtain v(t) = gt + C1 . By imposing the initial condition that v(0) = 0 we see that
C1 = 0. Hence v(t) = gt and substituting this into (6.10) gives
dx
= gt, x(0) = 0.
dt
It follows that x(t) = 12 gt2 + C2 and one easily shows that C2 = 0.
To find out when the stone hits the canyon floor, we solve the equation
1
100 = gt2
2
p
for t. Hence t = 200/g ≈ 4.5. So it takes about 4.5 seconds for the stone to hit the canyon floor.
By examining the triangle in the above diagram, we see that
x(t) gt2
tan θ = = .
50 100
dθ
To solve the problem, we need to determine the value of t when dt is a maximum. Now
−1 gt2
θ = tan
100
and so
dθ 1 gt
= 2 ×
dt gt2 50
1+ 100
c
2020 School of Mathematics and Statistics, UNSW Sydney
6.6. MAPLE NOTES 125
dω d2 θ
= 2
dt dt
(104 + g2 t4 )200g − 200gt(4g2 t3 )
=
(104 + g 2 t4 )2
(10 )200g − 600g3 t4
4
= .
(104 + g2 t4 )2
104
t4 = .
3g2
p p
Since t ≥ 0, we obtain t = 10/ 4 3g2 ≈ 2.42. So the critical points of ω on [0, 200/g] are
p p
0, 10/ 4 3g2 and 200/g.
p p
ω(0) = 0, ω(10/ 4 3g 2 ) ≈ 1.10, ω( 200/g) ≈ 0.26.
p
Clearly ω(t) attains its maximum when t = 10/ 4 3g2 .
p
In conclusion, the angular velocity of the telescope is greatest 10/ 4 3g2 (or about 2.26) seconds
after the stone is dropped. At this time, the angular velocity is about 1.10 radians per second (that
is, about 62.9 degrees per second).
Maple knows about the inverse trigonometric functions. The functions sin−1 , cos−1 , tan−1 , cosec−1 ,
sec−1 , cot−1 are written respectively as: arcsin, arccos, arctan, arccsc, arcsec and arccot.
For example,
c
2020 School of Mathematics and Statistics, UNSW Sydney
126 CHAPTER 6. INVERSE FUNCTIONS
> # Note the gap around zero in the graph of arccsc(x) --- please explain!
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 6 127
2. [R]
3. [R] Show that the function f : R → R, given by f (x) = x3 +3x+1, has an inverse function
whose domain is R.
6. [H]
7. [H] For each function f : R → R given below, find all possible intervals I of R, each as
large as possible, such that the restricted function f : I → R is one-to-one. State the
range of each restricted function f : I → R. What can you say about existence, domain
of definition, continuity and differentiability of the corresponding inverse functions?
c
2020 School of Mathematics and Statistics, UNSW Sydney
128 CHAPTER 6. INVERSE FUNCTIONS
12. [R] Prove that sin−1 x + cos−1 x is constant. For what values of x is this valid and what
is the constant?
13. [H] [V] Suppose that f (x) = tan−1 x + tan−1 (1/x) whenever x 6= 0.
a) Show that f ′ (x) = 0 whenever x 6= 0.
b) Hence evaluate f on the intervals (0, ∞) and (−∞, 0).
c) How do you account for this result geometrically?
14. [H]
a) Draw the graph of cosec x.
b) Show that cosec restricted to the interval (0, π2 ] has an inverse function. Sketch the
graph of the inverse and calculate its derivative.
15. [X]
a) Show that 2 tan−1 2 = π − cos−1 (3/5).
b) Show that cos−1 (1 − 2x2 ) = 2 sin−1 x whenever 0 ≤ x ≤ 1.
c) Suppose that q(x) = cos−1 (1 − 2x2 ). Is q differentiable at 0?
c
2020 School of Mathematics and Statistics, UNSW Sydney
6.6. MAPLE NOTES 129
where a and b are real numbers. Find all values of a and b such that f is differentiable
at 0.
17. [H] A lighthouse containing a revolving beacon is located 3 km from P , the nearest point
on a straight shoreline. The beacon revolves with a constant rotation rate of 4 revolutions
per minute and throws a spot of light onto the shoreline. How fast is the spot of light
moving when it is (a) at P and (b) at a point on the shoreline 2 km from P ?
18. [H] [V] A picture 2 metres high is hung on a wall with its bottom edge 6 metres above
the eye of the viewer. How far from the wall should the viewer stand for the picture to
subtend the largest possible vertical angle with her eye?
c
2020 School of Mathematics and Statistics, UNSW Sydney
130 CHAPTER 6. INVERSE FUNCTIONS
c
2020 School of Mathematics and Statistics, UNSW Sydney
131
Chapter 7
Curve sketching
There are a variety of ways to describe curves that lie in the plane. In this chapter we study curves
which are described by using
√
• a Cartesian equation (for example, y = x2 or y = 1 − x2 ),
• polar coordinates.
Each section of this chapter is devoted to one of these methods. The use of Cartesian equations
will already be familiar. However, many curves cannot be easily described by Cartesian equations.
The following diagrams illustrate two curves that are better described by a parameter or with polar
coordinates.
y y
c
2020 School of Mathematics and Statistics, UNSW Sydney
132 CHAPTER 7. CURVE SKETCHING
In this section we survey techniques for sketching curves that are described by a Cartesian equation
of the form y = f (x).
y = f (x)
x
It appears that the graph of f approaches the line y = ax + b as x → ∞. If this is indeed the case,
we say that f is asymptotic to the line. Since this line is neither vertical, nor horizontal, we call it
an oblique asymptote to the function f .
Definition 7.1.1. Suppose that a and b are real numbers and that a 6= 0. We say
that a straight line, given by the equation
y = ax + b,
is an oblique asymptote for a function f if
lim f (x) − (ax + b) = 0.
x→∞
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.1. CURVES DEFINED BY A CARTESIAN EQUATION 133
y =x+1
1
x
x=2
(Note that we have obtained sufficient information to make a useful sketch of the graph without
resorting to calculus. However, calculus would help to refine this by locating the turning points
and confirming where the graph is increasing or decreasing.)
If f is a rational function of the form p/q, where p and q are polynomials and deg(p) > deg(q),
then the asymptotic behaviour of f may be determined by polynomial division.
Example 7.1.3. Suppose that f is defined by the equation
x2 − 4
f (x) =
x+1
whenever x 6= −1. Identify all asymptotes to f and hence sketch its graph.
c
2020 School of Mathematics and Statistics, UNSW Sydney
134 CHAPTER 7. CURVE SKETCHING
Solution. As in the previous example, clearly there is a vertical asymptote when x = −1. It is also
clear that f (x) → ∞ as x → ∞, but the precise asymptotic behaviour of f (x) as x → ∞ is still
not clear. However, polynomial division gives
x−1
x+ 1 ) x2 −4
x2 + x
−x − 4
−x − 1
−3.
Hence
3
f (x) = x − 1 − . (7.1)
x+1
(See Remark 7.1.4 for an alternative method of establishing (7.1).)
3
Since x+1 → 0 from above as x → ∞, we conclude that f (x) approaches x − 1 from below as
x → ∞. A similar observation shows that f (x) approaches x − 1 from above as x → −∞. So the
line y = x − 1 is an oblique asymptote to the graph of f at both ∞ and −∞.
Finally, we calculate the axes intercepts:
f (0) = −4
and
f (x) = 0 ⇒ x2 − 4 = 0 ⇒ x = ±2.
y =x−1
−2 2 x
y = f (x) −4
x = −1
(Again, we have obtained sufficient information for a useful sketch of the graph without the use of
calculus.)
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.1. CURVES DEFINED BY A CARTESIAN EQUATION 135
Remark 7.1.4. We give a method for establishing equation (7.1) that uses an algebraic ‘trick’
rather than polynomial ‘long division’:
x2 − 4
f (x) =
x+1
x2 − 1 − 3
=
x+1
(x − 1)(x + 1) − 3
=
x+1
3
= (x − 1) − .
x+1
7.1.3 Examples
We complete this section with three examples.
Example 7.1.5. Sketch the graph of f , where f is defined by
2
f (x) = xe−x .
Solution. It is clear that Dom(f ) = R. Moreover,
2
f (−x) = −xe−x = −f (x) ∀x ∈ R,
so f is an odd function. Therefore, it suffices to consider x ≥ 0; the rest of the graph can be
obtained by symmetry.
2
Since e−x > 0, it is easy to see that the only axis intercept occurs at (0, 0). We look now at
the behaviour of f (x) as x → ∞. By l’Hôpital’s rule,
x 1
lim f (x) = lim x 2 = lim 2 = 0.
x→∞ x→∞ e x→∞ 2xex
So y = 0 is an horizontal asymptote.
Finally, we use calculus to determine the shape of the curve. By the product rule,
2 2
f ′ (x) = x − 2xe−x + e−x (1)
2
= (1 − 2x2 )e−x .
2
Since e−x > 0 for all x in R, we conclude that
1
f ′ (x) = 0 when x = √ ,
2
while
1 1
f ′ (x) > 0 when 0 ≤ x < √ and f ′ (x) < 0 when x > √ .
2 2
So f is increasing on the interval (0, √12 ), stationary at √1 and decreasing on ( √1 , ∞). Now
2 2
√ 1 1
f (1/ 2) = √ e−1/2 = √ ≈ 0.429.
2 2e
Moreover, f ′ (0) = 1, implying that the tangent line to the graph at the origin has a gradient of 1.
This information can be used to sketch the graph when x ≥ 0, and the fact that f is odd allows
us to deduce the shape of the graph when x < 0.
c
2020 School of Mathematics and Statistics, UNSW Sydney
136 CHAPTER 7. CURVE SKETCHING
f (x)
√1
|
2e
−2 − √12
| | | |
√1 2 x
2
− √12e
|
Example 7.1.6. Sketch the graph of f , where f is defined by
3
f (x) = x2 − 4 + .
x2
The next function is important since it is later used to define the Si function (see Example
8.12.1). The Si function is used by electrical engineers for digital signal processing and by surveyors
using GPS (Global Positioning System).
Example 7.2.1. Suppose that a stone is thrown horizontally from the deck of Sydney Harbour
Bridge at 20 metres per second, and that the air resistance experienced by the stone is negligible.
Let x(t) denote the horizontal distance (in metres) travelled by the stone t seconds after being
thrown and let y(t) denote its height (in metres) above the water. The deck is 50 metres above sea
level.
deck
b
stone
x(t)
50m
y(t)
water
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.2. PARAMETRICALLY DEFINED CURVES 137
y = f (x)
√ √
−43 1 3
| √
2 3−4 x
y = x2 − 4
−4
c
2020 School of Mathematics and Statistics, UNSW Sydney
138 CHAPTER 7. CURVE SKETCHING
If we take the acceleration due to gravity to be 10 m/s2 , then Newton’s laws of motion show that
x(t)
t= and y(t) = 50 − 5t2 .
20
If we substitute the equation on the left into the equation on the right then
x(t) 2
y(t) = 50 − 5
20
which becomes
x2
y = 50 − .
80
√
So the trajectory is a parabola. It is easy to see that y = 50 when x = 0 and that x = 20 10 when
y = 0. We therefore obtain the following sketch.
y
50
√ x
20 10
x2
y = 50 − .
80
The variable t is called a parameter.
• They arise naturally with moving bodies, as illustrated in Example 7.2.1. In this con-
text they also contain more information than a corresponding Cartesian equation (since a
Cartesian equation only records the trajectory of the body, not the position of the body
at any given time).
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.2. PARAMETRICALLY DEFINED CURVES 139
• Many curves that cannot be described by an equation of the form y = f (x) can be described
parametrically.
• Even if a curve is the graph of a function, it may be much easier to describe the curve
parametrically than to write down an equation describing the function. (This is the case
with the cycloid, which will be studied in Example 7.2.7.)
• Curves in three (or higher) dimensional space are most easily described using parameters.
is an ellipse. Sketch the curve, showing how the point (x(t), y(t)) moves as t varies from 0 to 2π.
Solution. Suppose that t ∈ [0, 2π]. From (7.3) we see that
x(t)2 y(t)2
= cos2 t, = sin2 t.
a2 b2
Since cos2 t + sin2 t = 1,
x(t)2 y(t)2
+ 2 = cos2 t + sin2 t = 1.
a2 b
Hence the point (x(t), y(t)) lies on the ellipse
x2 y 2
+ 2 = 1.
a2 b
(Conversely, any point (x, y) that satisfies this equation can be expressed as x = a cos t, y = b sin t
for some t in R.)
In fact, it is easy to see that, as t moves from 0 to 2π, (x(t), y(t)) traces out the entire ellipse.
This is illustrated in the diagram below.
y(t)
π
t= 2
π b
π
2 <t<π b 0<t< 2
t=π t = 0, 2π
b b
−a a
x(t)
π<t< 3π −b b
3π
< t < 2π
2 2
3π
t= 2
c
2020 School of Mathematics and Statistics, UNSW Sydney
140 CHAPTER 7. CURVE SKETCHING
The table below lists some commonly used parametrisations of conic sections.
y(t) = at2
y(t) = a sin t
x2 y2
Ellipse + =1 x(t) = a cos t
a2 b2
y(t) = b sin t
x2 y2
Hyperbola − =1 x(t) = a sec t
a2 b2
y(t) = b tan t
There are also other parametrisations for each of these curves. (We give a second parametrisation
of the hyperbola in Chapter 10.)
In Examples 7.2.1 and 7.2.3, we sketched a parametrically defined curve by first eliminating the
parameter t. It is not always possible to do this, as the next example illustrates. Such curves can
be sketched with the aid of a table of values.
Solution. We begin by noting that the above parametric equation is similar to the parametrisation
x(t), y(t) = (a cos t, a sin t) (7.5)
of the circle. The only difference is that the points in (7.5) have a fixed distance a from the origin,
whereas the points in (7.4) have a variable distance t2 from the origin. So intuitively, as t increases,
the point (x(t), y(t)) in (7.4) rotates about origin, while at the same time getting further away from
the origin. Therefore it is likely that the curve is a spiral.
The following table of values helps verify this reasoning.
π 3π 5π
t 0 2 π 2 2π 2 3π
x 0 0 −π 2 0 4π 2 0 −9π 2
π2 9π 2 25π 2
y 0 4 0 4 0 4 0
We plot these points on the Cartesian plane and interpolate between them to produce the following
sketch.
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.2. PARAMETRICALLY DEFINED CURVES 141
y(t)
b
25π 2
4
π2
4
−π 2 b
b b b b
x(t)
−9π 2 4π 2
b
2
− 9π4
Proposition 7.2.5. Suppose that x and y are both differentiable functions of t and that y is a
function of x. If x′ (t) =
6 0 then
dy
dy dt y ′ (t)
= dx = ′ . (7.6)
dx dt
x (t)
y(t) = f (x(t)).
df
y ′ (t) = (x(t)) x′ (t)
dx
by the chain rule. Since x′ (t) 6= 0,
df y ′ (t)
(x(t)) = ′
dx x (t)
which is equivalent to (7.6).
Find the equation of the tangent to the curve at the point (0, 3).
c
2020 School of Mathematics and Statistics, UNSW Sydney
142 CHAPTER 7. CURVE SKETCHING
A b
b
B
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.2. PARAMETRICALLY DEFINED CURVES 143
Solution. The solution will be built up over three steps, of which the first two correspond to the
diagrams below.
y y
θ rt
t x t x
P (x, y) P (x, y)
rt rt y = −r
Step 1. Step 2.
Step 1. First we consider a circle of radius r centred at the origin. If the circle rotates about
its centre by t radians in a clockwise direction (as shown) then
x(t) = r cos θ = r cos( 3π
2 − t) = r cos 3π
2 cos t + sin 3π
2 sin t = −r sin t,
3π 3π 3π
y(t) = r sin θ = r sin( 2 − t) = r sin 2 cos t − cos 2 sin t = −r cos t
y(t) = −r cos t + r.
where t ≥ 0.
Remark 7.2.8. The curve of fastest descent from A to B is the unique arc of an (inverted) cycloid
whose tangent at A is vertical.
c
2020 School of Mathematics and Statistics, UNSW Sydney
144 CHAPTER 7. CURVE SKETCHING
• (x, y), where x is the horizontal signed distance of P from the origin, and y is the vertical
distance, or
• (r, θ), where r is the distance of P from the origin and θ is the angle (taken in the
anticlockwise direction) between OP and the positive horizontal axis.
x b
P b
P
r
y
θ
O O
The pair (x, y) is called the Cartesian coordinates of P and the pair (r, θ) are polar coordinates of
P . (If P is the origin then r = 0 and θ is not defined.)
The polar coordinates (r, θ) of P are related to the Cartesian coordinates (x, y) of P by the
formulae p
x = r cos θ r = x2 + y 2
y (7.8)
y = r sin θ tan θ = , provided x 6= 0.
x
This is easily seen using trigonometry, Pythagoras’ theorem and the diagram below.
P (x, y)
b
r
r sin θ
θ
r cos θ
When converting between the polar coordinates and Cartesian coordinates, it is best to draw a
diagram.
Example 7.3.1. Suppose that the polar coordinates of P and Q are given by (2, π4 ) and (3, 7π
6 )
respectively. Find the Cartesian coordinates for P and Q.
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.3. CURVES DEFINED BY POLAR COORDINATES 145
y
P (2, π4 )
2
13π
6 π
4
π x
6
3
7π
Q(3, 6 )
√
Example 7.3.2. Suppose that the Cartesian coordinate of R is given by (−2, 2 3). Find polar
coordinates for R.
√ r
2 3
θ
α
2 x
√
2 3
√
In the diagram, tan α = 2 = 3, so α = π3 . Hence
2π
θ =π−α= .
3
By Pythagoras’ theorem, √
r 2 = 22 + (2 3)2 = 16.
Hence in polar coordinates, R can be written as (4, 2π
3 ).
Remark 7.3.3. Note that the point R does not have unique polar coordinates. For example
(4, 2π
3 + 2π) and (4, 2π
3 − 2π)
are also polar coordinates for R. In practice, we often choose the value for θ such that −π < θ ≤ π.
c
2020 School of Mathematics and Statistics, UNSW Sydney
146 CHAPTER 7. CURVE SKETCHING
(a) r = 3
3π
(b) θ = 4 .
Solution. (a) Since r = 3, we seek all points in the plane whose distance from the origin is 3. This
is a circle of radius three centred at (0, 0).
(b) This is the collection of all points P such that the angle between OP and the positive x-axis
is 3π
4 . Hence the curve is the ray sketched below with the origin excluded.
y y
3π
3 4
3 x x
r=3 θ= 3π
4
To sketch a curve, described by the polar equation r = f (θ), in the xy-plane, it may be helpful
to first sketch r = f (θ) in the rθ-plane.
r = 2 − cos θ.
√1
b b
2+
|
2 d f r = 2 − cos θ
2 b b
|
c g
2− √1
2 b b
|
b h
1 a b
i
b
| | | | | | | |
π π 3π π 5π 3π 7π 2π θ
4 2 4 4 2 4
The r-value of each point a, b, c, ..., i in the rθ-plane corresponds to the distance from the origin to
each point a′ , b′ , ..., i′ in the xy-plane.
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.3. CURVES DEFINED BY POLAR COORDINATES 147
y
θ = π2
θ = 3π
4 θ= π
4
b c′ r = 2
b
d′
b′ b
r=3 a′ r = 1 θ=0
θ=π b b
e ′ i′ x
b
h′
b
f′ g′ r = 2
b
5π 7π
θ= 4 θ= 4
3π
θ= 2
Our task now is to interpolate between the points a′ , b′ , ..., i′ to obtain a sketch of the curve.
Consider what happens to r as θ moves from 0 to 2π.
y
3π π
θ= 4 θ= 4
b
2
b
−3 b b
1 x
b
b
b
5π −2 7π
θ= 4 θ= 4
r = 2 − cos θ
We list a few more helpful tips for sketching a curve described by the polar equation r = f (θ).
• If f is an even function (that is, f (−θ) = f (θ)) then the polar curve is symmetric about
the x-axis.
c
2020 School of Mathematics and Statistics, UNSW Sydney
148 CHAPTER 7. CURVE SKETCHING
r = 2 sin θ, 0 ≤ θ ≤ π.
Solution. Graphing r against θ, followed by y against x leads to the following two diagrams.
r y
r=2
2 b
θ= 43π
θ = π4
b
|
√2 b b
|
2
b b
√ √
r= 2 r= 2
b | | | |b b
π π 3π π θ r=0 x
4 2 4
r = 2 sin θ r = 2 sin θ
The curve looks as though it is a circle, an observation that can be confirmed by rewriting r = 2 sin θ
in terms of x and y:
r = 2 sin θ
r 2 = 2r sin θ (multiply by r)
2 2
x + y = 2y (by (7.8))
x2 + y 2 − 2y + 1 = 1 (completing the square)
2 2
x + (y − 1) = 1.
Therefore the polar curve is a circle with Cartesian centre (0, 1) and radius 1.
Example 7.3.7. Sketch the curve that is described by the polar equation
r = 4| sin(3θ)|.
Hence,
dy
dy dθ y ′ (θ)
= dx
=
dx dθ
x′ (θ)
f (θ) cos θ + f ′ (θ) sin θ
=
−f (θ) sin θ + f ′ (θ) cos θ
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.3. CURVES DEFINED BY POLAR COORDINATES 149
(Rather than memorising this formula, we recommend that students understand how it is obtained.)
Knowing the value of derivative of the curve at various points allows for greater accuracy in
sketching. We illustrate with one example.
Example 7.3.8. Sketch the curve described by the polar equation r = 1 − sin θ.
Solution. First we graph r against θ and then give a preliminary sketch of y against x.
r θ= 3π y θ= π
4 4
2 b
|
√1
b b
1+
|
2
−1 b b
1
1 b b b b b b
|
x
1− √1
2 b b
|
b b
| |b | | | | | |
π π 3π π 5π 3π 7π 2π θ
4 2 4 4 2 4 5π b 7π
θ= 4 −2 θ= 4
r = 1 − sin θ r = 1 − sin θ
dy
To obtain a clearer idea of the behaviour of the curve, we calculate dx . We have
and
y = r sin θ = (1 − sin θ) sin θ.
Hence
and
Thus
dy y ′ (θ) cos θ(1 − 2 sin θ)
= ′ = .
dx x (θ) 2 sin2 θ − sin θ − 1
To locate horizontal and vertical tangents, we examine where
c
2020 School of Mathematics and Statistics, UNSW Sydney
150 CHAPTER 7. CURVE SKETCHING
x′ ( π2 ) = y ′ ( π2 ) = 0.
5π
y π
θ= 6
θ= 6
HT HT
−1 1
x
gradient of tangent is 1 gradient of tangent is −1
VT VT
7π 11π
θ= 6 θ= 6
−2 HT
r = 1 − sin θ
c
2020 School of Mathematics and Statistics, UNSW Sydney
7.4. MAPLE NOTES 151
Remark 7.3.9. (MATH1141 only) In the above example, the curve has a ‘vertical cusp’ at the
dy
origin. This is seen by analysing dx when θ is close to π2 . By l’Hôpital’s rule we see that
Hence
dy y ′ (θ) π−
= ′ →∞ as θ→ .
dx x (θ) 2
By symmetry,
dy y ′ (θ) π+
= ′ → −∞ as θ→ .
dx x (θ) 2
π
Therefore the derivative has an infinite discontinuity when θ = 2. Such a point is called a ‘cusp
point.’
Remark 7.3.10. In most cases, a reasonable sketch of a polar curve can be made without appealing
to calculus. As seen in the example above, while employing calculus gives a more accurate sketch,
it also dramatically increases the amount of work and time required.
c
2020 School of Mathematics and Statistics, UNSW Sydney
152 CHAPTER 7. CURVE SKETCHING
> # What is the reason for the straight lines through the origin in graph above?
> plot([sin(5*theta), theta, theta=-Pi..Pi], coords=polar);
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 7 153
2. [R] Write down the period of each of the following functions f (where possible). Determine
which are odd or even. Sketch the graph of each function.
a) f (x) = sin 3x b) f (x) = 1 + sin(2x/3) c) f (x) = x sin x
d) f (x) = tan 3x e) f (x) = cos2 x f) f (x) = sin x + cos x
3. [R] Suppose that f is an odd function (not everywhere zero). Determine whether each
function g below is odd, even or neither.
a) g(x) = x2 f (x) b) g(x) = x3 f (x) c) g(x) = x2 + f (x)
d) g(x) = x3 + f (x) e) g(x) = sin(f (x)) f) g(x) = f (cos x)
4. [R] For each function f , identify any vertical and oblique asymptotes and hence sketch
the graph. (Do not use calculus.)
1 x2 − 2 x3 − 7x + 8
a) f (x) = x + 2 + b) f (x) = c) [H] f (x) =
x−3 x+1 x2 + x − 6
3x2 − 10x + 3
6. [H] (Longer rather than difficult) Suppose that y = .
3x2 + 10x + 3
a) Find the values of x for which y ≥ 0. b) Find the asymptotes.
c) Find the turning points. d) Find the domain and range.
e) Sketch the graph.
7. [R] Sketch the curves given by the following parametric equations. Also find, where
possible, a Cartesian equation for the curve.
a) x = 4 cos t, y = 5 sin t
b) x = 3 sec t, y = 2 tan t
.
c) x = t3 , y = t2
d) x = et cos t, y = et sin t
c
2020 School of Mathematics and Statistics, UNSW Sydney
154 CHAPTER 7. CURVE SKETCHING
x=1−t x = 3t + 2 x = cos t
a) b) c)
y =1+t y = t4 − 1 y = sin t,
9. [R] [V]
t t
a) Find the equation of the normal to the curve x = , y = at the point P
t+1 t−1
when t = 2.
b) Eliminate t from the above equations and find the gradient of the normal at P using
the Cartesian form.
b) Show that the equation of the chord joining the points with parameters t1 , t2 is
d) Suppose that P is a point with coordinates (a, b) and that P does not lie on on the
curve or on the y-axis.
i) Show that either one or three tangents may be drawn from P to the given curve.
Illustrate on a sketch the region in which P must lie so that there are three
tangents from P to the curve.
ii) Assume that P lies in this region and let Q1 , Q2 , Q3 denote the points of contact
of the tangents from P to the curve. Show that the centroid of the triangle
Q1 Q2 Q3 is the point (−2a, 2b).
11. [H] Consider a fixed circle of radius 1 centred at the origin and a smaller circle of radius 14
initially centred at ( 43 , 0). The smaller circle rolls (without slipping) around the inside rim
of the larger circle such that the centre Q of the smaller circle moves in an anticlockwise
direction. A point P , fixed on the rim of the smaller circle and initially with coordinates
(1, 0), traces out a curve as the smaller circle moves inside the larger circle.
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 7 155
y y
x θ P x
O Q P O
12. [R] [V] In the 1960s two French car engineers, Paul de Casteljau and Pierre Bezier,
independently discovered a remarkable new approach to parameterising curves. Let’s in-
troduce their approach by finding a quadratic curve determined by the control points
A(1, 1), B(2, 2) and C(3, 1) with respecitve coordinate vectors a, b and c.
c
2020 School of Mathematics and Statistics, UNSW Sydney
156 CHAPTER 7. CURVE SKETCHING
c) The quadratic de Casteljau Bezier curve r(t) determined A, B and C, is the linear
interpolation of the vectors p(t) and q(t):
r(t) = (1 − t)p(t) + tq(t).
Find an explicit expression for r(t) and find the polynomials p0 , p1 and p2 such that
r(t) = p0 (t)a + p1 (t)b + p2 (t)c.
13. [R] The following points are given in polar coordinate form. Plot them on a diagram and
find their Cartesian coordinates.
a) (3, 0) b) (6, 7π/6) c) (2, 7π/4)
14. [R] Convert these Cartesian coordinates into polar forms with r ≥ 0 and −π < θ ≤ π.
√
a) (−3, 0) b) (−1, −1) c) (−2, 2 3)
√ √
d) (0, 1) e) (−2 3, 2) f) (−2 3, −2)
16. [R]
a) Express r = 6 sin θ, where 0 ≤ θ ≤ π, in Cartesian form and hence draw its graph.
b) Repeat this for r = 2 cos θ, where −π/2 ≤ θ ≤ π/2.
18. [H] [V] The hyperbolic spiral is described by the equation rθ = a whenever θ > 0, where
sin θ
a is a positive constant. Using the fact that lim = 1, show that the line y = a is a
θ→0 θ
horizontal asymptote to the spiral. Sketch the spiral.
5
19. [H] Show that r = is the polar equation of an ellipse by finding the Cartesian
3 − 2 cos θ
equation of the curve (and completing the square).
20. [X]
a) For what values of θ is r 2 = 25 cos 2θ defined?
b) Sketch the graph of this curve. What difference would it make if you allowed negative
values of r?
dy
21. [H] Find dx for the curves in Q16a,b,c.
c
2020 School of Mathematics and Statistics, UNSW Sydney
157
Chapter 8
Integration
The problem of understanding and calculating the area of regions in the plane has a long history.
Some of the main contributors to this field are Archimedes (the great Greek mathematician of
antiquity), Isaac Barrow (Isaac Newton’s mentor), Isaac Newton (one of the greatest mathemati-
cians and physicists of all time), Gottfried Leibniz (Newton’s contemporary), Bernhard Riemann
(a German mathematician of the nineteenth century) and Henri Lebesgue (a French mathemati-
cian of the early twentieth century). In this chapter we see a method for calculating the area of
regions with curved boundaries known as Riemann integration. We then give a startling result,
known as the fundamental theorem of calculus, which connects the problem of calculating areas
with antidifferentiation. This theorem was known to Barrow and its implications were developed
by Newton, Leibniz and their disciples. A consequence of this theorem and other related results,
is that the problem of calculating area is much the same as that of calculating mass, volume, work
and probability. The unifying feature is a body of theory known as the integral calculus.
• the area of a whole region is the sum of the areas of its parts.
When precisely formulated, these points could be taken together as a definition of area. With this
definition, one can derive a formula for the area of a right-angled triangle (see Figure 8.1 (b)), then
c
2020 School of Mathematics and Statistics, UNSW Sydney
158 CHAPTER 8. INTEGRATION
h h
b b1 b2
b
(a) Partition of a polygon (b) A = 12 bh (c) A = 12 bh
for an arbitrary triangle (see Figure 8.1 (c)) and then calculate the area of an arbitrary polygon
(as in Figure 8.1 (a)).
When it comes to calculating the area of a region with curved boundaries (see, for example,
Figures 8.2 and 8.3), a more sophisticated definition of area is needed. Before asking what shape
this definition might take, it is worthwhile asking the question, What is area?
First, we recognise that the area of a region is a non-negative real number. Also, our intuition
of area informs us that it should satisfy the following properties (see Figure 8.2 for a diagram
illustrating each property). We assume throughout that the regions considered are bounded.
(A1) If Ω is a region of the plane then
area(Ω) ≥ 0.
(A2) If one region Ω1 is contained in another region Ω2 , then
area(Ω1 ) ≤ area(Ω2 ).
(A3) If the area of a region Ω is partitioned into two smaller disjoint regions Ω1 and Ω2 , then
area(Ω) = area(Ω1 ) + area(Ω2 ).
These properties are axioms that any definition of area should satisfy.
Rather than give a definition for the area of any bounded region of the plane, we shall only
give a definition for ‘the area under the graph of a function.’ The following example gives some
justification for this decision.
Example 8.1.1. Consider the region Ω of Figure 8.3. By partitioning Ω with straight lines as
shown, one can see by axiom (A3) that
area(Ω) = area(Ω1 ) + area(Ω2 ) + area(Ω3 ).
By rotating and translating each subregion, we see by axiom (A4) that each of area(Ω1 ), area(Ω2 )
and area(Ω3 ) is equal to the area under the graph of a function.
This procedure can be done for any region in the plane with a ‘reasonable’ boundary. Hence
the problem of defining the area of a region with a curved boundary can be reduced to defining
what is meant by ‘the area under the graph of a function.’
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.1. AREA AND THE RIEMANN INTEGRAL 159
Ω Ω2 Ω
Ω1 Ω1 Ω2 Ω2
Ω1
(a) (A1) (b) (A2) (c) (A3) (d) (A4)
a
(e) (A5)
Ω1
Ω Ω3
Ω2
(a) (b)
y y y
f1
f2 f3
x x x
(c)
c
2020 School of Mathematics and Statistics, UNSW Sydney
160 CHAPTER 8. INTEGRATION
f (x) f (x) f (x)
Ω Ω1 Ω2
1 x 1 x 1 x
(a) (b) (c)
Example 8.1.2. Find different upper and lower bounds for the area of the shaded region Ω of
Figure 8.4 (a).
(This may be proved using mathematical induction. If students need to use such a formula in an
exam, they will be given it).
Solution to Example 8.1.2. Let A denote the (as yet undefined) area of the region Ω. Rather than
calculating an upper bound for A by using 10 rectangles (as in Figure 8.4 (c)), we use n rectangles,
where n is a positive integer (see Figure 8.5). The bases of the rectangles are constructed by
subdividing the interval [0, 1] into n subintervals
[0, n1 ], [ n1 , n2 ], [ n2 , n3 ], . . . , [ n−1
n , 1].
that divides the interval [0, 1] into these subintervals is called a partition of [0, 1].
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.1. AREA AND THE RIEMANN INTEGRAL 161
y y = f (x)
Rk
··· ···
Let Rk denote the area of the kth rectangle in Figure 8.5. Then
Rk = width × height
1
= × f ( nk )
n
2
1 k
= ×
n n
1
= 3 × k2 .
n
If S Pn (f ) denotes the total area of the shaded region in Figure 8.5, then
n
X
S Pn (f ) = Rk
k=1
n
X 1
= × k2
n3
k=1
n
1 X 2
= 3 k
n
k=1
1 2
= 3
1 + 22 + 32 + · · · + (n − 1)2 + n2
n
1 1
= × n(n + 1)(2n + 1) (by formula (8.1))
n3 6
1 1 1
= 1+ 2+
6 n n
1 1 1
= + + .
3 2n 6n2
c
2020 School of Mathematics and Statistics, UNSW Sydney
162 CHAPTER 8. INTEGRATION
The quantity S Pn (f ) is called the upper Riemann sum of f with respect to the partition Pn . Axiom
(A2) imples that
1 1 1
A ≤ S Pn (f ) = + + 2,
3 2n 6n
and by taking different values of n we obtain different upper bounds for A.
To compute a lower bound corresponding to n rectangles, consider Figure 8.6. The area Rk of
the kth rectangle is given by
1 1
Rk = × f ( k−1
n ) = 3 (k − 1)2 .
n n
The sum of all the areas of the rectangles is called the lower Riemann sum for the function f over
the partition Pn and is denoted by S Pn (f ). We see that
n
X
S Pn (f ) = Rk
k=1
n
X 1
= × (k − 1)2
n3
k=1
n
1 X
= (k − 1)2
n3
k=1
1 2
= 3
1 + 22 + 32 + · · · + (n − 1)2
n
1 1
= 3
× (n − 1)n(2n − 1) (by formula (8.1))
n 6
1 1 1
= 1− 2−
6 n n
1 1 1
= − + 2.
3 2n 6n
By axiom (A2),
1 1 1
A ≥ S Pn (f ) =− + 2.
3 2n 6n
Hence for each positive integer n, the inequality
1 1 1 1 1 1
− + 2 ≤A≤ + + 2 (8.2)
3 2n 6n 3 2n 6n
gives an upper and lower bound for A.
From inequality (8.2) we see that, however the area A of Ω is eventually defined, it must be
squeezed between
1 1 1 1 1 1
− + 2 and + + 2
3 2n 6n 3 2n 6n
for every value of n. But as n → ∞,
1 1 1 1 1 1 1 1
− + 2 → and + + 2 → . (8.3)
3 2n 6n 3 3 2n 6n 3
So there is only one number A that satisfies (8.2) for every positive integer n, namely A = 31 . Our
definition of the area under the graph of a function must agree with this calculation.
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.1. AREA AND THE RIEMANN INTEGRAL 163
y y = f (x)
Rk
··· ···
Remark 8.1.3. The process of calculating upper and lower Riemann sums and taking a limit as
in (8.3) is called integration.
Guided by the ideas introduced in this subsection, we are now ready to give a definition for the
area under the graph of a function.
8.1.3 The definition of area under the graph of a function and the Riemann
integral
Suppose that f is a bounded function on [a, b] and that f (x) ≥ 0 for all x in [a, b]. In this subsection
we define what is meant by ‘the area under the graph of f from a to b’. This is done by constructing
upper and lower Riemann sums with respect to partitions of [a, b].
Suppose that P is a partition of [a, b]. In a manner similar to Example 8.1.2, P is used
to construct rectangles that approximate the region under the graph of f (see Figure 8.7). As
illustrated in the diagram, the points of P need not be evenly spaced.
The area of the kth rectangle in Figure 8.7 is
c
2020 School of Mathematics and Statistics, UNSW Sydney
164 f (x) CHAPTER 8. INTEGRATION
··· ···
where f k is the maximum value of the function f on the subinterval [ak−1 , ak ]. The upper Riemann
sum S P (f ) for f with respect to the partition P is defined by the formula
n
X
S P (f ) = (ak − ak−1 )f k . (8.4)
k=1
Hence the upper Riemann sum corresponds to the total area of the rectangles in Figure 8.4. Like-
wise, the lower Riemann sum S P (f ) for f with respect to the partition P is defined by the formula
n
X
S P (f ) = (ak − ak−1 )f k , (8.5)
k=1
Definition 8.1.5 (Definition of the area under the graph of a function). Suppose
that a function f is bounded on [a, b] and that f (x) ≥ 0 for all x in [a, b]. If there
exists a unique real number A such that
S P (f ) ≤ A ≤ S P (f ) for every partition P of [a, b], (8.6)
then we say that A is the area under the graph of f from a to b.
It may not seem obvious now but, as we shall see in Section 8.2, not every bounded function f
has an area under its graph in the sense of Definition 8.1.5. The following definition gives a special
name to those functions that do have a well-defined area under their graph. At this point, we lift
the restriction that f (x) ≥ 0 for all x in [a, b].
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.2. INTEGRATION USING RIEMANN SUMS 165
Rb
Remark 8.1.7. The notation a f (x) dx is due to Leibniz. It evolved from a slightly different way
of writing down lower and upper Riemann sums. For example, S P (f ) may be written as
n
X
S P (f ) = f (xk )∆xk ,
k=1
where ∆xk = ak − ak−1 and f attains its maximum value on [ak−1 , ak ]Pat the point xk . When
taking a limit, as in (8.3), ∆xk was replaced with dx and the symbol was replaced with an
elongated ‘S’ (‘S’ stands for ‘sum’).
Rb
Remark 8.1.8. If f is Riemann integrable on [a, b], then the real number a f (x) dx is the area
under the graph of f from a to b, provided that f (x) ≥ 0 for all x in [a, b]. A geometric interpretation
Rb
of the real number a f (x) dx when f takes negative values will be discussed in Section 8.3.
Remark 8.1.9. (MATH1141 only.) If f is bounded but not continuous, then f may not attain
a maximum or minimum value on the closed interval [ak , ak−1 ]. This technical difficulty can be
overcome by defining the upper and lower Riemann sums in terms of suprema and infima rather than
maxima and minima. The concept of a supremum and infimum will be introduced in MATH1241.
Remark 8.1.10. One should think about why Definition 8.1.5 is a good definition for the area
under the graph of a function. The following questions need to be answered:
• If the area under the graph of f is well-defined by Definition 8.1.5, would it agree with
any other reasonable definition for the area under the graph of f ?
• Is Definition 8.1.5 consistent with the axioms of area listed in Subsection 8.1.1?
The answer to each of these questions is ‘yes,’ but we leave it to the reader to ponder why this
might be so. Some of the results given over the next few sections do confirm that our definition of
area under a graph is consistent with the axioms of area.
c
2020 School of Mathematics and Statistics, UNSW Sydney
166 CHAPTER 8. INTEGRATION
f (x) f (x) f (x) f (x)
2
C
1
|
a bx 1 x 1 2 x 1 x
(a) (b) (c) (d)
Example 8.2.1. Suppose that f (x) = C where C is a positive constant. Show that f is Riemann
integrable on [a, b] and calculate the area under the graph of f from a to b (see Figure 8.8 (a)).
Solution. Suppose that P is a partition of [a, b] and that P = {a0 , a1 , . . . , an }. Since f is constant,
f k = f k = C for every k between 1 and n. Therefore
n
X
S P (f ) = f k (ak − ak−1 )
k=1
n
X
= C(ak − ak−1 )
k=1
= C(a1 − a0 + a2 − a1 + a3 − a2 + · · · + an−1 − an−2 + an − an−1 )
= C(ak − a0 )
= C(b − a).
The same calculation gives S P (f ) = C(b − a). Since P is an arbitrary partition of [a, b], we have
Hence there is a unique number A satisfying (8.7), namely A = C(b − a). Therefore f is Riemann
integrable,
Z b
f (x) dx = C(b − a)
a
and the area under the graph of f is C(b − a).
Remark 8.2.2. The area under the graph of f is consistent with axiom (A5).
There are many other kinds of functions which are Riemann integrable. One of the largest
classes consists of those functions that are bounded and piecewise continuous.
Figure 8.9 illustrates the graphs of two functions f and g that are piecewise continuous on [a, b].
The function f is bounded on [a, b] while the function g is not. According to the next theorem, the
area of the shaded region under the graph of f is well-defined.
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.2. INTEGRATION USING RIEMANN SUMS 167
f (x) g(x)
b
b b
bc
b
| | | |
a b x a b x
Theorem 8.2.4. If f is bounded and piecewise continuous on [a, b] then f is integrable on [a, b].
lim S Pn (f ) = lim S Pn (f )
n→∞ n→∞
for some sequence of partitions Pn of [a, b]. If I denotes this common limit, then
Z b
f (x) dx = I.
a
Example 8.2.5. If f (x) = x2 , find the area under the graph of f from 0 to 1 (see Figure 8.8 (b)).
Solution. We first note that f is integrable on [0, 1] since it is bounded and continuous on [0, 1].
Suppose that
Pn = {0, n1 , n2 , n3 , . . . , n−1 n
n , n .}
Then
1 1 1 1 1 1
S Pn (f ) = + + 2 and S Pn (f ) = − + 2
3 2n 6n 3 2n 6n
(see Example 8.1.2 for these calculations). Since
1
lim S Pn (f ) = = lim S (f ),
n→∞ 3 n→∞ Pn
we conclude that Z 1
1
f (x) dx = .
0 3
So the area underneath the graph of f from 0 to 1 is 31 .
R2
Example 8.2.6. Suppose that f (x) = 3 − x. Calculate 1 f (x) dx (see Figure 8.8 (c)).
Our final example shows that some functions are not Riemann integrable. For such a function
f , the area under the graph of f is not defined according to Riemann integration.
c
2020 School of Mathematics and Statistics, UNSW Sydney
168 CHAPTER 8. INTEGRATION
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.4. BASIC PROPERTIES OF THE RIEMANN INTEGRAL 169
f (x) f (x)
Ω1
c c
a b x a b x
Ω2
(a) (b)
Each of these properties can be interpreted geometrically (see Figure 8.11). For example, under
the assumption that f (x) ≥ 0, property (iv) says that if the graph of f lies beneath the graph of g
then the area under the graph of f will be less than the area under the graph of g. Except for (i),
each of these properties is easy to prove.
c
2020 School of Mathematics and Statistics, UNSW Sydney
170 CHAPTER 8. INTEGRATION
f (x) y g y
f
f M
a c b x a b x a b x
(a) Prop 8.4.1(ii) (b) Prop 8.4.1 (iv) (c) Prop 8.4.1 (v)
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.5. THE FIRST FUNDAMENTAL THEOREM OF CALCULUS 171
Definition 8.4.2. Suppose that b < a and that f in integrable on [b, a]. Then we
define Z b
f (x) dx
a
to be the real number Z a
− f dx
b
and we define Z a
f (x) dx
a
to be 0.
This definition gives a more general version of Proposition 8.4.1 (ii). We omit the proof.
Proposition 8.4.3. Suppose that a, b and c are real numbers and that f is integrable over some
interval containing a, b and c. Then
Z b Z c Z b
f (x) dx = f (x) dx + f (x) dx.
a a c
For x in [a, b], the number F (x) represents the (signed) area captured underneath the graph of x
on the interval [a, x] (see Figure 8.12 (a)). Thus F is an ‘area function’.
(Note that the limit x in (8.8) is a different symbol from the ‘dummy variable’ t. It would be
incorrect and confusing to write Z x
F (x) = f (x) dx
a
for formula (8.8).)
We investigate some of the properties of F . First, it is easy to see that if f (t) ≥ 0 for all t in
[a, b] then the shaded area under f increases as x increases. That is, if f is a non-negative function
then F is an increasing function.
Second, we ask whether F continuous on [a, b] and differentiable on (a, b). Since differentiability
on (a, b) implies continuity on (a, b), we address the question of differentiability first. Consider the
difference quotient
F (x + h) − F (x)
h
c
2020 School of Mathematics and Statistics, UNSW Sydney
172 CHAPTER 8. INTEGRATION
y y = f (t) y y = f (t)
F (x)
| | | | | | |
a x b t a x x+h b t
(a) (b)
when x ∈ (a, b) and h is small. Now, if h is a small positive number then Figure 8.12 (b) shows
that
since f (t) ≈ f (x) for all t in [x, x + h] by the continuity of f . Dividing both sides by h gives
F (x + h) − F (x)
≈ f (x).
h
As h gets smaller, this approximation gets better and so we expect that
F (x + h) − F (x)
lim = f (x).
h→0+ h
A similar argument may be repeated when h is a small negative number to obtain the corresponding
left-hand limit. Therefore it seems true that
F (x + h) − F (x)
F ′ (x) = lim = f (x).
h→0 h
This intuitive argument can be made rigorous (see the proof at the end of this section). Hence we
conclude that F is continuous and differentable on (a, b) and that F ′ (x) = f (x) for all x in (a, b).
The fact that a continuous function f and its ‘area function’ F are related by the differential
equation
F ′ (x) = f (x)
has far-reaching consequences. This result is so important that it has come to be known as the
fundamental theorem of calculus.
Theorem 8.5.1 (The first fundamental theorem of calculus). If f is continuous function defined
on [a, b] then the function F : [a, b] → R, defined by
Z x
F (x) = f (t) dt, (8.9)
a
F ′ (x) = f (x)
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.5. THE FIRST FUNDAMENTAL THEOREM OF CALCULUS 173
The first fundamental theorem of calculus is a deep mathematical result which relates differen-
tiation to integration. From the statement of the theorem we note the following points.
• The fact that F satisfies the equation F ′ = f on (a, b) says that F is an antiderivative of
f on (a, b). Hence every continuous function f has an antiderivative F , given by formula
(8.9).
• Since any two antiderivatives of f differ by a constant (see Theorem 5.9.4), every an-
tiderivative of f is of the form
F + constant,
where F is given by the integral formula (8.9). Hence the process of integration and of
antidifferentiation are essentially the same. (This is surprising from a geometric point of
view, where integration is used to calculate areas while differentiation is used to calculate
gradients of tangents.)
• The above point suggests that integration and differentiation are inverse processes. If one
takes a function f , integrates it and differentiates the result, then one obtains f again.
That is, differentiation undoes what integration does to f . This is precisely expressed by
the formula Z x
d
f (x) = f (t) dt .
dx a
Whether or not the converse statement is true (that is, that integration undoes what
differentiation does to a function f ) will be discussed in the next section.
Thus the first fundamental theorem of calculus suggests that there may be a different method for
calculating the area under the graph of f ; rather than integrating via limits of Riemann sums, try
integrating via antidifferentiation. This approach is investigated in the next section.
We end this section by presenting a proof of the first fundamental theorem of calculus. The
proof of the differentiability of F is based on the intuitive argument given immediately prior to the
statement of Theorem 8.5.1.
Proof of Theorem 8.5.1. We begin by showing that F is differentiable on (a, b). Suppose that
x ∈ (a, b) and consider a small positive number h. By Proposition 8.4.1 (ii) one can show that
Z x+h
F (x + h) − F (x) = f (t) dt. (8.10)
x
Since f is continuous on [a, b], it attains a minimum value mh and a maximum value Mh on [x, x+h];
that is,
mh ≤ f (t) ≤ Mh ∀t ∈ [x, x + h].
By Proposition 8.4.1 (v) (see also Figure 8.13),
Z x+h
mh h ≤ f (t) dt ≤ Mh h
x
and hence
mh h ≤ F (x + h) − F (x) ≤ Mh h.
by (8.10). Now h > 0 so
F (x + h) − F (x)
mh ≤ ≤ Mh .
h
c
2020 School of Mathematics and Statistics, UNSW Sydney
174 CHAPTER 8. INTEGRATION
y
Mh y = f (x)
mh
x x+h x
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.6. THE SECOND FUNDAMENTAL THEOREM OF CALCULUS 175
Theorem 8.6.1 (The second fundamental theorem of calculus). Suppose that f is a continuous
function on [a, b]. If F is an antiderivative of f on [a, b] then
Z b
f (t) dt = F (b) − F (a). (8.11)
a
The second fundamental theorem of calculus is an astonishing result. To calculate the signed
area under the graph of a continuous function f , all one needs to know is the value of any an-
tiderivative of f at the endpoints a and b. To illustrate the power of this result, compare the
lengthy calculation of Riemann sums in Example 8.1.2 with the example below.
Example 8.6.2. Suppose that f (x) = x2 for all real numbers x. Calculate the area of the region
bounded by the x-axis, the graph of f and lines x = 0 and x = 1.
t3
where f (t) = t2 (see Figure 8.4 (a)). An antiderivative F of f is given by F (t) = 3. Hence, by the
second fundamental theorem of calculus,
Z 1
13 03 1
f (t) dt = F (1) − F (0) = − = .
0 3 3 3
Remark 8.6.3. The expression F (b) − F (a) appearing in (8.11) is used so frequently that it is
often abbreviated to b h ib
F (x) or F (x) .
a a
When calculating areas, it is important to bear in mind that the integral in (8.11) represents
the signed area.
Example 8.6.4. Calculate the area of the region bounded by the x-axis, the lines x = 0 and x = 4
and the function f given by
f (x) = (x − 2)(x − 4).
c
2020 School of Mathematics and Statistics, UNSW Sydney
176 CHAPTER 8. INTEGRATION
y
y = f (x)
2 4 x
Solution. The above sketch shows that part of the region lies beneath the x-axis. So the area we
seek corresponds to the integral
Z 4 Z 2 Z 4
|f (x)| dx = f (x) dx + (−f (x)) dx
0 0 2
Z 2 Z 4
= x2 − 6x + 8 dx − x2 − 6x + 8 dx
0 2
4 2
x3 x32 2
= − 3x + 8x − − 3x + 8x
3 3
0 2
8
= − 12 + 16 − 0
3
64 8
− − 48 + 32 − − 12 + 16
3 3
= 8.
So the area is 8 square units.
Z 4
Note that this is a different problem to: Evaluate f (x) dx, whose solution is:
0
Z 4 Z 4
f (x) dx = x2 − 6x + 8 dx
0 0
4
x3
= − 3x2 + 8x
3 0
16
= .
3
We now present a proof of the second fundamental theorem of calculus. The tools used in the
proof are the first fundamental theorem of calculus and a corollary of the mean value theorem.
Proof of Theorem 8.6.1. Suppose that f is continuous on [a, b] and that F is an antiderivative of
f on [a, b]. Define a function G : [a, b] → R by the formula
Z x
G(x) = f (t) dt ∀x ∈ [a, b].
a
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.6. THE SECOND FUNDAMENTAL THEOREM OF CALCULUS 177
By the first fundamental theorem of calculus (see Theorem 8.5.1), G is continuous on [a, b] and
G′ (x) = f (x) for all x in (a, b). Hence G is also an antiderivative of f on [a, b]. By Theorem 5.9.4,
there is a constant C such that
G(x) = F (x) + C (8.12)
for all x in [a, b]. Note that Z a
G(a) = f (t) dt = 0.
a
0 = F (a) + C.
as required.
We return to the question of whether integration and differentiation are inverse processes.
Suppose that f is continuous on [a, b]. The first fundamental theorem of calculus states that if one
integrates a function f and differentiates the result, then one obtains f again. This is expressed by
the formula Z x
d
f (x) = f (t) dt ∀x ∈ (a, b).
dx a
To be able to say that integration and differentiation are inverses operations of each other, we must
ask whether the converse is true. That is, if one differentiates a function f and integrates the result,
then does one obtain f again? In precise terms, we would like the formula
Z x
f (x) = f ′ (t) dt ∀x ∈ (a, b) (8.13)
a
to hold. Unfortunately, this converse is false. The following corollary to the second fundamental
theorem of calculus says that one obtains f again only when f (a) = 0. Otherwise formula (8.13) is
‘out by a constant.’
Corollary 8.6.5. Suppose that f is continuous on [a, b] and has a continuous derivative on (a, b).
Then Z x
f ′ (t) dt = f (x) − f (a)
a
Proof. Apply the second fundamental theorem of calculus to the function f ′ on the interval [a, x].
The second fundamental theorem of calculus allows a fruitful interaction between Riemann
sums and antidifferentiation.
c
2020 School of Mathematics and Statistics, UNSW Sydney
178 CHAPTER 8. INTEGRATION
1
Example 8.6.6. Suppose that f (x) = . By considering the lower Riemann sum of f with
1 + x2
respect to the partition
{0, n1 , n2 , . . . , n−1 n
n , n}
Solution. Let Pn denote the above partition of [0, 1]. Since f is a decreasing function on [0, 1], the
lower Riemann sum is given by
n
X 1 1
S Pn (f ) =
n 1 + (k/n)2
k=1
n
X n
= .
n2 + k2
k=1
Hence
n
π X n
= lim
4 n→∞ n + k2
2
k=1
as desired.
Remark 8.6.7. The solution to Example 8.6.6 uses the following result. Suppose that f is Riemann
integrable on [a, b] and that {Pn }∞
n=1 is a sequence of partitions of [a, b]. Let sn denote the maximum
size of the subintervals generated by Pn . If lim sn = 0 then
n→0
Z b
lim S Pn (f ) = f (x) dx = lim S Pn (f ).
n→∞ a n→∞
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.7. INDEFINITE INTEGRALS 179
for some suitable constant C. If we have no particular interest in the interval [a, x] and merely
want to indicate that F is an antiderivative of f , then we write
Z
f (t) dt = F (t) + C. (8.14)
An integral expressed in this way, without limits, is called an indefinite integral. The constant C
in (8.14) is called the constant of integration.
The table of antiderivatives given in Section 5.9 is equivalent to the following table of indefinite
integrals.
Indefinite integrals
Z
1
xr dx = xr+1 + C, where r is a rational number and r 6= −1
r+1
Z
sin x dx = − cos x + C
Z
cos x dx = sin x + C
Z
1
eax dx = eax + C
a
Z ′
f (x)
dx = ln |f (x)| + C
f (x)
Since differentiation and indefinite integration (or antidifferentiation) are inverse processes, they
have many analogous properties. For example, differentiation behaves linearly, which means that
and
(αf )′ (x) = α × f ′ (x)
for every real number α and every differentiable function f and g. The next proposition says that
indefinite integration is also behaves linearly. (We have already shown in Proposition 8.4.1 that
definite integration is linear.)
Proposition 8.7.1. Suppose that f and g are integrable functions and that α is a real constant.
Then Z Z Z
(f + g)(x) dx = f (x) dx + g(x) dx (8.15)
and Z Z
(αf )(x) dx = α f (x) dx. (8.16)
Proof. We prove (8.15) only and leave the proof of 8.16 as an exercise. Suppose that F , G and H
are antiderivatives of f , g and f + g respectively. Then
c
2020 School of Mathematics and Statistics, UNSW Sydney
180 CHAPTER 8. INTEGRATION
Remark 8.7.3. It is standard practice to combine of all constants of integration and simply write
‘+C’. Thus the calculation in Example 8.7.2 may be written as
Z Z Z Z
(10x + 8 sin x − 9e ) dx = 10 x dx + 8 sin x dx − 9 ex dx
4 x 4
1
= 10 × x5 + 8(− cos x) − 9ex + C
5
= 2x − 8 cos x − 9ex + C.
5
as can be easily verified by differentiating the right-hand side via the chain rule. On the other
hand, Z
f (u) du = F (u) + C. (8.18)
If we make the substitution u = g(x) then the right-hand sides of (8.17) and (8.18) are the same.
This suggests the substitution du = g′ (x) dx, so that the left-hand side of (8.17) is transformed
into the left-hand side of (8.18). These mechanical substitutions can be used to transform the
complicated integral into a simpler one.
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.8. INTEGRATION BY SUBSTITUTION 181
u = x2 − 3x + 4.
Since
du
= 2x − 3,
dx
we also make the substitution
du = (2x − 3) dx.
Then
Z Z
2
(2x − 3) cos(x − 3x + 4) dx = cos(u) du
= sin u + C
= sin(x2 − 3x + 4) + C.
We can verify our answer by differentiating sin(x2 − 3x + 4) via the chain rule.
du
Solution. If u = 1 + x3 then dx = 3x2 . Hence we make the following substitutions:
u = 1 + x3
du = 3x2 dx
u = 1 (when x = 0)
u = 9 (when x = 2).
Thus
Z 2 p Z 9√
3x2 1 + x3 dx = u du
0 1
9
2 3/2
= u
3 1
2
= (93/2 − 13/2 )
3
52
= .
3
In general, the reversal of the chain rule via integration gives rise to the change of variables
formula
Z b Z g(b)
′
f (g(x))g (x) dx = f (u) du, (8.19)
a g(a)
c
2020 School of Mathematics and Statistics, UNSW Sydney
182 CHAPTER 8. INTEGRATION
The substitutions
u = g(x), du = g ′ (x) dx,
are only to be understood within this context. Precise conditions under which formula (8.19) holds
is given by the following theorem.
Theorem 8.8.3 (Change of variables formula). Suppose that g is a differentiable function such
that g′ is continuous on [a, b]. If f is continuous on any interval I containing g(a) and g(b) then
the change of variables formula (8.19) holds.
Proof. Since f is continuous on I it has an antiderivative F : I → R by the first fundamental
theorem of calculus. By the chain rule (Theorem 4.2.2),
(F ◦ g)′ (x) = F ′ (g(x))g ′ (x)
= f (g(x))g ′ (x).
So by two applications of the second fundamental theorem of calculus,
Z b
f (g(x))g ′ (x) dx = (F ◦ g)(b) − (F ◦ g)(a)
a
= F (g(b)) − F (g(a))
Z g(b)
= f (u) du,
g(a)
Finding a fruitful substitution is not always easy (and sometimes not even possible). In principle,
one looks for a function g in the integrand whose derivative g ′ is also in the integrand. Once such
a function g is identified, try the substitution u = g(x). As illustrated below, it may be necessary
to manipulate the integrand to implement this strategy.
Example 8.8.4. Evaluate √
Z
e x
√ dx.
x
Solution. Note that
d √ 1
x = √
dx 2 x
√
and so if g(x) = x then g ′ (x) appears in the integrand (up to the constant factor 1/2). So the
substitution we use is
√ 1
u = x, du = √ dx.
2 x
Hence
Z √x Z √x
e e
√ dx = 2 √ dx
x 2 x
Z
= 2 eu du
= 2eu + C
√
x
= 2e + C.
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.8. INTEGRATION BY SUBSTITUTION 183
There are occasions when we make a substitution of the form x = g(u) rather than u = g(x).
Example 8.8.5. Evaluate the integral I, where
Z 9
dx
I= √ .
4 2+ x
Solution. If the integral were Z 9
dx
,
4 2+x
then it would be easy to evaluate. The difficulty with I clearly lies with the square root appearing
in the integrand. To remove the square root, we use the substitution x = u2 . This gives
x = u2
dx = 2u du
u = 2 when x = 4
u = 3 when x = 9.
(Note that we are implicitly assuming that u > 0. It would be incorrect to take the limits in the
variable u to be −2 and 3, for example.) Hence
Z 3
2u du
I=
2 2+u
Z 3
u
=2 du
2 2 + u
Z 3
2+u−2
=2 du
2 2+u
Z 3
2
=2 1− du
2 2+u
h i3
= 2 u − 2 ln |2 + u|
2
4
= 2 + 4 ln .
5
If an integrand possesses symmetry, then the corresponding definite integral may be easy to
evaluate. For example, if f is an odd function and we integrate f over a balanced interval [−a, a],
then the ‘negative area’ cancels out the ‘positive area’ and consequently the integral is zero (see
Figure 8.14). This result is proved using integration by substitution.
Proposition 8.8.6. Suppose that f is a continuous function and a is a real number.
(i) If f is even then Z Z
a a
f (x) dx = 2 f (x) dx.
−a 0
c
2020 School of Mathematics and Statistics, UNSW Sydney
184 CHAPTER 8. INTEGRATION
f (x)
−a
| |
a x
Proof. We prove (ii) only and leave the proofs of (i) and (iii) as an exercise. Suppose that f is an
odd function. By breaking up the integral and using a substitution we have
Z a Z 0 Z a
f (x) dx = f (x) dx + f (x) dx
−a −a 0
Z 0 Z a
=− f (−u) du + f (x) dx (substituting u = −x)
Z aa Z a0
= f (−u) du + f (x) dx
0
Z a Z0 a
=− f (u) du + f (x) dx (since f (−u) = f (u))
0 0
= 0,
Solution. If f (x) = sin5 x then f is both odd and periodic with period 2π. Hence
Z π Z
17
+2π 2π
5
sin x dx = sin5 x dx (by Proposition 8.8.6 (iii) when a = π
17 )
π
17
0
Z π
= sin5 x dx (by Proposition 8.8.6 (iii) when a = −π)
−π
=0 (by Proposition 8.8.6 (ii)).
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.9. INTEGRATION BY PARTS 185
Solution. We set
u=x v = 12 e2x
u′ = 1 v ′ = e2x .
= 12 xe2x − 14 e2x + C,
c
2020 School of Mathematics and Statistics, UNSW Sydney
186 CHAPTER 8. INTEGRATION
and the integral on the right-hand side is harder to evaluate than the one we started with.)
Sometimes integration by parts must be applied several times to evaluate the integral.
Example 8.9.3. Find the integral I, where
Z π
I= x2 sin x dx.
0
| |
a R x
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.10. IMPROPER INTEGRALS 187
Definition 8.10.1. (a) Suppose that there is a real number L such that
Z R
f (x) dx → L
a
f (x) dx = L.
a
Rb
One can write down a similar definition for improper integrals of the form −∞ f (x) dx.
Example 8.10.2. Evaluate the following improper integrals or show that they diverge:
Z ∞
1
(a) 2
dx;
0 x +1
Z ∞
1
(b) √ dx;
1 x
Z 0
(c) e2x dx.
−∞
c
2020 School of Mathematics and Statistics, UNSW Sydney
188 CHAPTER 8. INTEGRATION
We now consider improper integrals whose interval of integration is the entire real line.
Z ∞
The following example shows that the improper integral f (x) dx may diverge even if
−∞
Z R
lim f (x) dx exists.
R→∞ −R
However Z R h
1 2
iR R2
x dx = 2 x = →∞
0 0 2
as R → ∞. Hence Z ∞
x dx
0
diverges, and by Definition 8.10.3 Z ∞
x dx
−∞
diverges also.
The following proposition determines the convergence or divergence of improper integrals whose
integrand is a power of x. Such integrals will be used frequently in Section 8.11
Proposition 8.10.5 (Convergence and divergence of p-integrals). The improper integral
Z ∞
1
dx
1 xp
is convergent if p > 1 and divergent if p ≤ 1.
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 189
Proof. If p 6= 1 then
Z R R
x1−p
x−p dx =
1 1−p 1
R1−p − 1
=
1−p
(
1
when 1 − p < 0
→ p−1
∞ when 1 − p > 0
as R → ∞. Hence the integral converges when 1 − p < 0 (that is, when p > 1) and diverges when
1 − p > 0 (that is, when p < 1).
In the case when p = 1, we have
Z R h iR
1
dx = ln x = ln R − ln 1 → ∞ as R → ∞.
1 x 1
Remark 8.10.6. While you may be familiar with the fact that the ln function is unbounded, the
proof that ln R → ∞ as R → ∞ requires the use of Riemann sums and is given later in §9.2.
have integrands without any antiderivative in the elementary functions. In these cases, determining
convergence via a computation like
Z ∞ Z R
f (x) dx = lim f (x) dx = lim F (R) − F (a),
a R→∞ a R→∞
(where F is an antiderivative
R∞ of f ) is not a viable strategy. Instead, we compare the integral of f
with another integral a g(x) dx whose behaviour is known. The basic idea is as follows.
Suppose that 0 ≤ f (x) ≤ g(x) whenever x > a, as illustrated below.
y
g
a x
c
2020 School of Mathematics and Statistics, UNSW Sydney
190 CHAPTER 8. INTEGRATION
Obviously the area under the graph of g is greater than that under the graph of f . Hence if the
area under g is finite then the area under f is also finite. If the area under f is infinite then so too
is the area under g. The next theorem expresses this conclusion in terms of improper integrals.
Theorem 8.11.1 (The comparison test). Suppose that f and g are integrable functions and that
0 ≤ f (x) ≤ g(x) whenever x > a.
Z ∞ Z ∞
(i) If g(x) dx converges then f (x) dx converges.
a a
Z ∞ Z ∞
(ii) If f (x) dx diverges then g(x) dx diverges.
a a
Proof. We begin by proving R(i). Suppose that f and g are integrable functions, that 0 ≤ f (x) ≤ g(x)
∞
for all x in [a, ∞) and that 0 g(x) dx converges. Then
Z R Z R
0≤ f (x) dx ≤ g(x) dx
a a
When applying the comparison test, we often compare an improper integral I with
Z ∞
1
dx,
1 xp
since the behaviour of this second integral is known (see Proposition 8.10.5). The value of p is chosen
by analysing the dominant terms appearing in the integrand of I. The next example illustrates
this procedure.
Example 8.11.2. Determine whether or not the following improper integrals converge.
Z ∞
1
(a) 3
dx
1 x +1
Z ∞
x+2
(b) 3/2
dx.
2 x −1
Solution. (a) When x is large, the dominant term in the denominator of
1
x3 + 1
is x3 . So intuitively,
1 1
≈ 3
x3 + 1 x
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 191
To make this comparison rigorous, we seek an appropriate inequality. Note that x3 + 1 > x3
for all x > 1 and so
1 1
< 3
x3 + 1 x
for all x > 1. Since Z ∞
1
dx
1 x3
converges (see Proposition 8.10.5), Z ∞
1
dx
1 x3 +1
also converges by the comparison test.
There are many examples where the ‘dominant term analysis’ is straightforward but it is difficult
to obtain an appropriate inequality for successful use of the comparison test. This difficulty can
sometimes be overcome by multiplying one of the integrands by a ‘fudge factor’. The name ‘fudge
factor’ might sounds like we’re about to ‘cheat’ and do something that is not mathematically
rigourous, but despite the name we really will be finding an inequality that will rigourously prove
convergence or divergence of our improper integral.
Now the integral to the right converges (by Proposition 8.10.5) but it is clear that
1 1
> 3/2
x3/2 −1 x
when x > 2, so we cannot immediately apply the comparison test.
However, by multiplying the integrand on the right with the ‘fudge factor’ 2, we see that
1 2
< 3/2 (8.22)
x3/2 − 1 x
c
2020 School of Mathematics and Statistics, UNSW Sydney
192 CHAPTER 8. INTEGRATION
Finding a fudge factor can be a lot of work in itself. Later we give a method for determining
the convergence of this integral that does not rely on the use of fudge factors.
Dominant term analysis is not always straightforward, and at times may not even be useful.
One must develop an intuition of how functions decay at infinity and of what comparisons to use.
Example 8.11.4. Determine whether or not the following improper integrals converge.
Z ∞ √
(a) e− x dx
1
Z ∞
sin x + 2
(b) √ dx.
1 x+1
Solution. (a) Dominant term analysis does not apply since there is only
√ one term appearing in the
integrand. Instead, we recognise that the decay of the exponential e− x is very rapid; it is certainly
much faster than the decay of x12 . This can be verified by the following limit calculation:
√
e− x x2
lim 1 = lim
x→∞ x→∞ ex1/2
x2
2x
= lim (by l’Hôpital’s rule)
x→∞ 1 x−1/2 ex1/2
2
4x3/2
= lim 1/2
x→∞ ex
4. 23 x1/2
= lim (by l’Hôpital’s rule)
x→∞ 1 x−1/2 ex1/2
2
12x
= lim
x→∞ ex1/2
..
. (two more applications of l’Hôpital’s rule)
24
= lim
x→∞ ex1/2
= 0.
1
√ .
x
Since Z ∞
1
√ dx
1 x
diverges, we seek an inequality of the form
sin x + 2
f (x) ≤ √
x+1
√
where f (x) is 1/ x with an appropriate fudge factor.
Now, −1 ≤ sin x, and so
sin x + 2 1
√ ≥√
x+1 x+1
1
≥ √
2 x
The next theorem provides an alternative approach for constructing a comparison of integrals.
It is particularly useful if dominant term analysis is straightforward but one wants to avoid the use
of fudge factors.
Theorem 8.11.5 (The limit form of the comparison test). Suppose that f and g are non-negative
and bounded on [a, ∞). If
f (x)
lim =L
x→∞ g(x)
c
2020 School of Mathematics and Statistics, UNSW Sydney
194 CHAPTER 8. INTEGRATION
The hypothesis
f (x)
lim =L
x→∞ g(x)
indicates that f and g exhibit similar behaviour at infinity. The proof of the theorem will be given
at the end of this section.
Example 8.11.6. Determine whether or not the following improper integrals converge.
Z ∞
1
(a) √ dx
1 1 + x + x2
Z ∞ 2
x + 3x
(b) dx.
2 5x4 − 2
1
Solution. (a) Suppose that f (x) = √ . We need to find a suitable function g to compare
1 + x + x2
f with. As x gets large, the dominant term under the square root is x2 . Therefore it seems that
1 1 1
√ ≈√ =
1 + x + x2 x2 x
when x is large. So define g by g(x) = x1 . The following calculation verifies that our choice of g is
suitable:
r s
f (x) x x2 1
lim = lim √ = lim = lim 1 = 1.
x→∞ g(x) x→∞ 1+x+x 2 x→∞ 1+x+x 2 x→∞
x2
+ x1 + 1
Remark 8.11.7. As a general rule, students will find the limit form of the comparison test easier
to use than the inequality form of the comparison test when the integrand is a rational function,
or is a ratio of roots of polynomials.
Proof of Theorem 8.11.1 (MATH1141 only). Suppose that f and g are non-negative and bounded
on [a, ∞). (It follows, for later use, that if a < M < R and R is increasing then the integrals
Z R Z R
f (x) dx and g(x) dx,
M M
f (x)
lim =L
x→∞ g(x)
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 195
where 0 < L < ∞. Choose a positive number ǫ such that 0 < ǫ < L. By Definition 2.2.1, there is
a positive real number M such that
f (x)
g(x) − L < ǫ
R∞ RR
• if M f (x) dx is divergent then M g(x) dx → ∞ as R → ∞; and
Z ∞ RR
• if g(x) dx is divergent then M f (x) dx → ∞ as R → ∞.
M
Hence either Z Z
∞ ∞
f (x) dx and g(x) dx both converge
M M
or Z Z
∞ ∞
f (x) dx and g(x) dx both diverge.
M M
The proof of the theorem now easily follows by first observing that
Z R Z M Z R
f (x) dx = f (x) dx + f (x) dx
a a M
and
Z R Z M Z R
g(x) dx = g(x) dx + g(x) dx
a a M
c
2020 School of Mathematics and Statistics, UNSW Sydney
196 CHAPTER 8. INTEGRATION
Si(x)
π
|
2
| |
−10π 10π x
− π2
|
Figure 8.15: The Si function.
• Symmetries of f can help in understanding the behaviour of F (see, for example, Propo-
sition 8.8.6).
We give one example. Others may be found in the problem set for Chapter 8 and at the beginning
of Chapter 9.
Consider the function f : R → R given by
(
sin t
t if t 6= 0
f (t) =
1 if t = 0.
sin t
Since lim = 1, the function f is continuous everywhere. We can now define the function
t→0 t
Si : R → R by Z x
Si(x) = f (t) dt ∀x ∈ R.
0
The name Si comes from ‘sine integral.’ This function is used in electrical engineering for signal
processing and in surveying for the Global Positioning System (GPS). Its graph is plotted in Figure
8.15.
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.12. FUNCTIONS DEFINED BY AN INTEGRAL 197
Calculate G′ (x).
Z 0
Solution. (a) Si(0) = f (t) dt = 0.
0
(b) We need to show that Si(−x) = − Si(x). Now
Z −x
sin t
Si(−x) = dt
0 t
Z x
sin(−u)
= (−du) (using the substitution t = −u)
0 −u
Z x
− sin u
=− du
−u
Z0 x
sin u
=− du
0 u
= − Si(x).
d 2x sin(x2 ) 2 sin(x2 )
G′ (x) = Si(x2 ) = 2x Si′ (x2 ) = 2x × f (x2 ) = = .
dx x2 x
(If this calculation is too dense, then consider
y = G(x), u = x2
so that Z u
dy
y= f (t) dt and = f (u)
0 du
c
2020 School of Mathematics and Statistics, UNSW Sydney
198 CHAPTER 8. INTEGRATION
dy dy du 2x sin(x2 ) 2 sin(x2 )
= = f (u) × 2x = 2xf (x2 ) = = .
dx du dx x2 x
2 sin(x2 )
Hence G′ (x) = x .)
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.13. MAPLE NOTES 199
− (ln (x))−1
> int(1/(x*log(x)^2),x=2..infinity);
(ln (2))−1
> int( exp(-x^2)*ln(x), x=0..1 );
R 1 −x2
0e ln (x) dx
> evalf(%);
−0.9059404763
c
2020 School of Mathematics and Statistics, UNSW Sydney
200 CHAPTER 8. INTEGRATION
1
2. [R] An electrical signal S(t) has its amplitude |S(t)| tested (sampled) every 10 of a second.
It is desired to estimate the energy over a period of half a second, given exactly by
Z 1
!1
2
2
|S(t)|2 dt .
0
t .1 .2 .3 .4 .5
|S(t)| 60 50 50 45 55
e(t) 5 3 7 4 10
a) Using the above data for S(t), set up an appropriate Riemann sum and compute an
approximate value for the energy.
b) It is known that the signal varies by an amount of at most ±e(t), as shown above, in
1
each 10 second period. Calculate upper and lower bounds for the energy.
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 8 201
4. [R] Find the area of the region bounded by the line y = x and the parabola y = x2 − 2.
5. [R] Find
Z 9 Z 2
x3 − x
a) dx b) |x| dx.
4 x3/2 −4
1 1
Z 1
1
7. [R] Explain why dx = − = −1 − 1 = −2 is not valid.
−1 x2 x −1
8. [H]
a) Suppose that f is a continuous increasing (and hence invertible) function on [a, b]. If
c = f (a), d = f (b) and a, b, c, d ≥ 0, then explain why
Z d Z b
f −1 (t) dt = bd − ac − f (x) dx.
c a
Z 1
b) Use this to find sin−1 x dx.
1/2
c
2020 School of Mathematics and Statistics, UNSW Sydney
202 CHAPTER 8. INTEGRATION
11. [H] Suppose that f (t) = sin(t2 ). Sketch the graph of f on Zthe interval [0, 3]. Use this to
x
sketch the graph of F on the interval [0, 3], where F (x) = f (t) dt. Indicate where F
0
has local maxima and minima.
12. [R] [V] Find F ′ (x) for each function F : R → R given below.
Z x Z x3
2
a) F (x) = sin(t ) dt b) F (x) = sin(t2 ) dt
0 0
Z 1 Z x3
2
c) F (x) = sin(t ) dt d) F (x) = sin(t2 ) dt
x3 x
Z 4
d
13. [R] Find (5 − 4t)5 dt.
dx x
14. [R]
1
a) Suppose that f (x) = . By considering the lower Riemann sum for f with respect
x
to the partition
n n+1 n+2 2n
, , ,...,
n n n n
of [1, 2], show that
1 1 1
ln 2 = lim + + ··· + .
n→∞ n+1 n+2 2n
1
b) Suppose that f (x) = √ .
1 − x2
i) Show that f is increasing on the interval [0, 12 ].
ii) Find the upper Riemann sum for f with respect to the partition
0 1 2 3 n
, , , ,...,
2n 2n 2n 2n 2n
of [0, 21 ].
iii) Hence evaluate
1 1 1 1
lim √ +√ +√ + ··· + √ .
n→∞ 4n2 − 12 4n2 − 22 4n2 − 32 4n2 − n2
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 8 203
19. [R] [V] Evaluate the following improper integrals or show that they diverge.
Z ∞ Z 1 Z ∞
dx
a) e−5x dx b) e−0.01x dx c)
0 −∞ 0 4 + x2
Z ∞ Z ∞ Z ∞
4 dx dx
d) x3 e−x dx e) 3/2
f)
−∞ 2 (x − 1) e x ln x
Z ∞
20. [H] Prove that xn e−x dx = n! whenever n = 0, 1, 2, . . . .
0
21. [H]
c
2020 School of Mathematics and Statistics, UNSW Sydney
204 CHAPTER 8. INTEGRATION
Z R
x
a) Find lim dx.
R→∞ −R 1 + x2
Z 2R
x
b) Find lim dx.
R→∞ −R 1 + x2
Z ∞
x
c) Does dx converge? Explain.
−∞ 1 + x2
22. [R] Use the inequality form of the comparison test to determine whether or not the
following improper integrals converge.
Z ∞ Z ∞ Z ∞
1 1 1
a) [V] √ dx b) √ dx c) dx
1 1+x 4
2
3 2
x −x 2 ln x
23. [R] Use the limit form of the comparison test to determine whether or not the following
improper integrals converge.
Z ∞ Z ∞ Z ∞
x 2x − 1 1
a) 3−1
dx b) 2+2
dx c) √ dx
2 2x 1 x 2
6
x −1
24. [R] Use a comparison test to determine whether or not the following improper integrals
converge.
Z ∞ Z ∞ 3 Z ∞
3x + sin x + 2 4x − x + 5 ln t
a) 3
dx b) 4 2
dx c) [H] dt
1 2x − x + 8 4 x −x +1 2 t3/2
25. [H] Find all real numbers s such that the improper integral
Z ∞
xs
dx
1 1+x
is convergent.
Z ∞
1
26. [H] Find all real numbers p such that dx converges.
2 x(ln x)p
Z ∞
xb
27. [H] For which pairs of numbers (a, b) does the improper integral dx con-
1 (1 + x2 )a
verge?
28. [R] [V] Given a positive real number x, let π(x) denote the number of primes less than
or equal to x. The function Li with domain (1, ∞) is given by
Z x
1
Li(x) = dt
2 ln t
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.13. MAPLE NOTES 205
and is known as the ‘logarithmic integral function’. It has the property that
Li(x)
≈1
π(x)
The function erf is an error function and can be used to calculate the probability that a
measurement has an error in a given range of values.
c
2020 School of Mathematics and Statistics, UNSW Sydney
206 CHAPTER 8. INTEGRATION
c
2020 School of Mathematics and Statistics, UNSW Sydney
207
Chapter 9
The volume of a colony of bacteria on an agar plate is 1 cubic millimetre and doubles every day.
Let V (t) denote the volume of bacteria after t days. It is clear that
if t is a nonnegative integer. The corresponding points are plotted in Figure 9.1 (a).
If one wanted to determine the volume of the colony part-way through a day, it would be natural
to use the formula
V (t) = 2t ,
where t is a nonnegative real number. This formula makes sense when t is a rational number. If
t = p/q where p and q are integers and q > 0, then 2t is defined to be the unique positive qth root
of 2p√. However, if t is an irrational number, this definition does not apply. So what do we mean
by 2 3 or 2π ? We have not yet seen a definition for such numbers.
That this shortcoming in our current definition of 2t is a serious problem can be illustrated
graphically. The graph corresponding to V (t) = 2t , where t is confined to the nonnegative rational
numbers, has an infinite number of ‘gaps’ (see Figure 9.1 (b)). Clearly this is not a satisfactory
state of affairs. One of the aims of this chapter is to rectify this deficiency and show that, when
t is an irrational number, 2t can be defined in such a way that the graph of V : [0, ∞) → R is
continuous (see in Figure 9.1 (c)).
Our plan of attack is the following.
1. In Section 9.2, we define the function ln as an integral (by the fundamental theorem of
calculus).
2. In Section 9.3, we define the function exp as the inverse of ln (by the inverse function
theorem).
3. In Section 9.4, we define the number bt , where b > 0 and t is an irrational number, by a
formula involving both ln and exp.
c
2020 School of Mathematics and Statistics, UNSW Sydney
208 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
V (t) V (t) V (t)
16 b
16 16
|
|
b
b
b
| | |
We shall see that when the definition of bt (for rational t) is combined with the definition of bt (for
irrational t), the resulting function f : R → R, given by
f (t) = bt ∀t ∈ R,
is continuous.
Definition 9.1.1. Suppose that b is a positive real number. If p and q are integers
and q > 0 then bp/q is defined to be the unique positive qth root of bp .
Definition 9.1.2. Suppose that b is a positive real number not equal to one, c is a
rational number and a = bc . Then logb a is defined by the formula
c = logb a.
Note that, according to Definition 9.1.2, one cannot write down logb a for any positive number
a (even if a is rational); logb a is only defined if a = bc for some rational number c. (This short-
coming in the domain of the function logb will be rectified later in the chapter when Definition
9.1.2 is superseded by a superior definition.)
c
2020 School of Mathematics and Statistics, UNSW Sydney
9.2. THE NATURAL LOGARITHM FUNCTION 209
The elementary properties of logarithms are summarised below: if x > 0, y > 0 and b > 0 with
b 6= 1 then
logb 1 = 0, logb b = 1,
logb (xy) = logb x + logb y, logb xy = logb x − logb y,
logb (br ) = r, logb (xr ) = r logb x. (9.1)
Remark 9.1.3. In the properties listed above, it is implicitly assumed that r is a rational number,
and that x and y are of the form bc for some rational number c. One of the aims of this chapter is
to remove these awkward assumptions.
1 x t
One can also immediately see, by the first fundamental theorem of calculus, that
d 1
ln x = .
dx x
This and other properties of ln are stated in the proposition below.
c
2020 School of Mathematics and Statistics, UNSW Sydney
210 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
(iii) ln x → −∞ as x → 0+ and ln x → ∞ as x → ∞;
(iv) ln(xy) = ln(x) + ln(y) for all positive real numbers x and y;
(v) ln xy = ln(x) − ln(y) for all positive real numbers x and y; and
(vi) ln(xr ) = r ln(x) whenever r is a rational number and x is a positive real number.
Proof. (i) Apply the first fundamental theorem of calculus (Theorem 8.5.1) to the definition of ln.
1
R ∞ 1t > 0 when t > 0.
(ii) This follows from Definition 8.4.2 and the fact that
(iii) We need to show that the improper integral 1 t dt diverges to infinity. The diagram
below shows that
Z 2 Z 4 Z 8
dt 1 dt 1 dt 1
≥1× , ≥2× , ≥4×
1 t 2 2 t 4 4 t 8
and so on.
y 1
y=
t
1 2 4 8 t
Hence
Z 2n
dt 1 1 1
≥ + + ··· +
1 t |2 2 {z 2}
n terms
n
=
2
→∞
as n → ∞. Hence ln x → −∞ as x → 0+ .
(iv) Suppose that y is some fixed positive real number and that x ∈ (0, ∞). Then, by the chain
rule and part (i),
d 1 1 d
ln(xy) = y ln′ (xy) = y × = = ln(x) .
dx xy x dx
c
2020 School of Mathematics and Statistics, UNSW Sydney
9.2. THE NATURAL LOGARITHM FUNCTION 211
Hence
ln(xy) = ln(x) + C
ln(y) = ln(1) + C = 0 + C = C
as required.
(v) The proof uses the same technique as the proof of (vi) and is left to the reader.
(vi) This proof also uses the same technique as the proof of (iv).
The fact that ln(1) = 0 and that ln satisfies properties (iv), (v) and (vi) suggests that ln coincides
with one of the logarithm functions logb for some base b (with the advantage that Dom(ln) =
(0, ∞)). We now aim to identify the base.
The function ln is increasing and continuous with Range(ln) = R (see Proposition 9.2.2 (i) and
(ii)). Hence, by the intermediate value theorem, there is exactly one real number x that satisfies
the equation
ln(x) = 1.
We call this solution e (after Leonard Euler, who was probably the greatest mathematician of the
eighteenth century).
Definition 9.2.3. The real number e is defined to be the unique number x that
satisfies the equation Z x
1
dt = 1.
1 t
Thus ln(e) = 1. One can show that e is irrational and that e ≈ 2.71828. From Proposition
9.2.2, we see now that
ln(er ) = r
for every rational number r. This equation shows that ln coincides with the logarithm to the base
e. Since the number e arises naturally in many contexts, ln is usually called the natural logarithm.
By using the properties of ln given by Propoistion 9.2.2, one can draw the graph of ln.
c
2020 School of Mathematics and Statistics, UNSW Sydney
212 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
ln x
1 e x
d
We note that, although ln(x) → ∞ as x → ∞, we have dx ln(x) = x1 → 0 as x → ∞. This means
the graph of ln becomes flatter towards infinity and hence approaches infinity very slowly.
By reflecting the graph of ln in the line y = x, one obtains the graph of exp.
y
(1, e)
b
1 b
y = exp x (e, 1)
1 x
y = ln x
Proposition 9.3.2. The function exp : R → (0, ∞) has the following properties:
c
2020 School of Mathematics and Statistics, UNSW Sydney
9.3. THE EXPONENTIAL FUNCTION 213
Definition 9.3.3. Suppose that x is an irrational number. Then we define the real
number ex by the formula
ex = exp x.
Thus Definition 9.3.3 extends Definition 9.1.1, in the case when b = e, to include irrational
powers. Moreover, since exp x = ex and exp is a continuous function, the function f : R → R, given
by f (x) = ex , is also continuous. We have thus made significant progress towards the problem
outlined at the beginning of the chapter. Finally, since exp x = ex , we call exp the exponential
function.
c
2020 School of Mathematics and Statistics, UNSW Sydney
214 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
by the definitions and properties established in the last two sections. This inspires the following
definition.
bx = ex ln b ∀x ∈ R. (9.4)
f (x) = bx ∀x ∈ R,
is also continuous (see Proposition 3.1.2). This solves the problem articulated at the beginning of
the chapter. Thus
√ √
3 3 ln 2
2 =e ≃ 1.492106 . . . .
Moreover, we can discard our old definition of logb (see Definition 9.1.2), together with its
technical difficulties (see Remark 9.1.3), and replace it with a simpler and more powerful definition.
Definition 9.4.2. Suppose that b is a positive real number, b 6= 1. Then the function
logb : (0, ∞) → R is defined to be the inverse of the function f : R → (0, ∞), where
f (x) = bx
for all x in R.
We leave it as an exercise to show that the function f of Definition 9.4.2 is one-to-one and
hence invertible. The graphs of various exponential functions and their corresponding logarithmic
functions are shown below.
c
2020 School of Mathematics and Statistics, UNSW Sydney
9.5. INTEGRATION AND THE LN FUNCTION 215
y= 8x y= 3x
y y = 2x
y = log2 x
y = log3 x
1
y = log8 x
1 x
The following proposition gives a formula for logb in terms of ln. Its proof is left as an exercise.
Proposition 9.4.3. Suppose that b is a positive real number, b 6= 1. Then
ln x
logb x = ∀x ∈ (0, ∞). (9.5)
ln b
Using equations (9.4) and (9.5), one can easily establish familiar properties for the exponentials
and logarithms to the base b, with the advantage that we no longer need the technical restrictions
of Remark 9.1.3.
c
2020 School of Mathematics and Statistics, UNSW Sydney
216 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
Thus Z
1
dx = ln |x| + C
x
provided x is restricted to an interval not containing zero. By generalising this calculation we
obtain the formula
Z ′
f (x)
dx = ln |f (x)| + C,
f (x)
provided that f is never zero over the interval of integration.
sin x
Solution. (a) By using the fact that tan x = cos x we have
Z Z
− sin x
tan x dx = − dx
cos x
= − ln | cos x| + C.
(a) y = 10x
3/5
(3x2 + 4)(x + 2)
(b) y =
x3 + 5x
(c) y = (sin x)cos x .
c
2020 School of Mathematics and Statistics, UNSW Sydney
9.7. INDETERMINATE FORMS WITH POWERS 217
ln y = ln 10x
= x ln 10.
1 dy
= ln 10
y dx
and hence
dy
= (ln 10) × y
dx
= (ln 10)10x .
d x d x ln 10 )
Alternatively: dx (10 ) = dx (e = ln 10ex ln 10 = (ln 10)10x .
Remark 9.6.2. Recall that one can only take the logarithm of a positive number. Hence our
solution to Example 9.6.1 (b) and (c) is only valid when y > 0.
The first limit is of the form 00 while the second is of the form ∞0 . Both of these forms are
examples of indeterminate forms and hence we cannot say what the corresponding limits are in
each case without further calculation. Since each limit involves a power, it is natural to first take
the logarithm of the limit and then bring l’Hôpital’s rule into play.
Solution. The limit is an indeterminate form of the type 00 . By taking the natural logarithm, we
∞
can transform the limit into an indeterminate form of the type ∞ :
lim x2x = lim exp ln x2x (since ln and exp are inverses)
x→0+ x→0+
= lim exp 2x ln x
x→0+
= exp lim 2x ln x (since exp is continuous)
x→0+
ln x
= exp lim .
x→0+ 1/(2x)
c
2020 School of Mathematics and Statistics, UNSW Sydney
218 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
We can now apply l’Hôpital’s rule to the problem. By differentiating the numerator and denomi-
nator and then simplifying we obtain
1/x
lim x2x = exp lim
x→0+ x→0+ −1/(2x2 )
= exp lim −2x
x→0+
= exp(0)
= 1.
Each of the limits in (9.6) can be evaluated by using the ideas in this example.
Remark 9.7.2. The limit of Example 9.7.1 is of the form 00 and has the value 1. The limit
1 x
lim e− x
x→0+
The fact that limits of the type 00 can have different values explains why 00 is an indeterminate
form.
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 9 219
2. [R]
a) Prove, using upper and lower Riemann sums and the definition of ln x, that ln 2 <
1 < ln 4, and hence that 2 < e < 4.
5
b) [H] Use Maple and the method of part (a) to prove that 2 < e < 3. How many
partition points do you need?
4. [R] Find
Z Z Z
e2x e1/x
a) dx b) dx c) 3x dx
1 + e2x x2
Z √x Z Z
e ln x
d) √ dx e) dx f) cot x dx.
8 x x
c
2020 School of Mathematics and Statistics, UNSW Sydney
220 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
6. [R]
ln x
a) Sketch the curve y = , noting any turning points and asymptotes.
x
b) By using (a) or otherwise, prove that π e < eπ .
7. [R] [V]
y
1
y= x
1
t
1+t
1 1+ 1 x
t
1 1 1
a) From the graph, explain why ≤ ln 1 + ≤ whenever t ≥ 0.
1+t t t
t
1 1 t
b) Deduce that lim ln 1 + = 1 and hence find the value of lim 1 + .
t→∞ t t→∞ t
10. [H] Prove that the functions f : (−1, ∞) → R and g : (−1, ∞) → R, given by
x2 x2 x3
f (x) = ln(1 + x) − x − and g(x) = x − + − ln(1 + x),
2 2 3
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 9 221
x2 x2 x3
x− < ln(1 + x) < x − +
2 2 3
whenever x > 0.
c
2020 School of Mathematics and Statistics, UNSW Sydney
222 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
c
2020 School of Mathematics and Statistics, UNSW Sydney
223
Chapter 10
a x/a
y= e + e−x/a (10.1)
2
(see Figure 10.1 (c)). In this chapter we study the function described by equation (10.1) and
other functions related to it. Such functions are important to mathematics, engineering, physics
(especially the theory of relativity) and architecture.
We note from the outset that a large number of new formulae will be introduced in this chapter.
However, students do not need to memorise all of them. Those which should be memorised are
listed in Section 10.7.
a
x
(a) (b) (c)
c
2020 School of Mathematics and Statistics, UNSW Sydney
224 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
The reasoning behind the name ‘hyperbolic cosine’ will gradually become apparent. Since cosh
is a linear combination of exponential functions, it is differentiable and
d 1 d x 1
cosh x = e + e−x = (ex − e−x ).
dx 2 dx 2
The derivative of cosh is important in its own right and is given a special name.
Some people pronounce sinh as ‘shine.’ One can easily show that the derivative of sinh is cosh.
Thus we have
d d
(sinh x) = cosh x and (cosh x) = sinh x. (10.2)
dx dx
We now consider the graphs of these functions.
• Since
sinh(−x) = 21 (e−x − ex ) = − 12 (ex − e−x ) = − sinh x,
we see that sinh is an odd function. As a corollary, sinh(0) = 0.
• Since
d
(sinh x) = cosh x = 12 (ex + e−x ) > 0,
dx
sinh is an increasing function.
c
2020 School of Mathematics and Statistics, UNSW Sydney
10.1. HYPERBOLIC SINE AND COSINE FUNCTIONS 225
y y y = cosh x
y = sinh x 1 y = 21 ex
y = 21 ex
x x
(a) (b)
From these observations we obtain a sketch of the graph of sinh (see Figure 10.2).
• cosh(0) = 1;
The reader should verify each of these properties by appealing to Definition 10.1.1. From these
observations, we obtain a sketch of the graph of cosh (see Figure 10.2).
The functions cosh and sinh are related by the following important identity.
cosh2 x − sinh2 x = 1.
Proof. If x ∈ R then
2 2
cosh2 x − sinh2 x = 1 x
2 (e + e−x ) − 21 (ex − e−x )
1
= × 2ex × 2e−x difference of two squares
4
= 1,
c
2020 School of Mathematics and Statistics, UNSW Sydney
226 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
We are now in a position to appreciate why these functions are called ‘hyperbolic cosine’ and
‘hyperbolic sine.’ First, we examine their similarity to the trigonometric cosine and sine functions.
The sine and cosine functions have the following properties:
d d
(cos x) = − sin x (sin x) = cos x
dx dx
cos is an even function sin is an odd function
cos2 x+ sin2 x = 1.
The functions sinh and cosh have analogous properties (with the occasional adjustment of a negative
sign):
d d
(cosh x) = sinh x (sinh x) = cosh x
dx dx
cosh is an even function sinh is an odd function
cosh2 x− sinh2 x = 1.
The next example explains the origin of the name hyperbolic in ‘hyperbolic cosine.’
where t ∈ R.
Solution. The parameter t in equation (10.3) can be eliminated by Proposition 10.1.5 to obtain
1 = cosh2 t − sinh2 t
2 2
= x(t) − y(t) ,
or simply
x2 − y 2 = 1.
This equation describes an hyperbola. Since cosh t > 0 for every real number t, it follows that
x > 0 and hence the curve is the branch of the hyperbola that lies in the right-half plane (see the
black curve in Figure 10.3 (b)). Its asymptotes are the lines y = x and y = −x.
(The other branch of the hyperbola is parameterised by
(
x(t) = − cosh t
y(t) = sinh t
Hence we use trigonometric sine and cosine to paramaterise a circle or ellipse, and hyperbolic
sine and cosine to parameterise an hyperbola.
c
2020 School of Mathematics and Statistics, UNSW Sydney
10.2. OTHER HYPERBOLIC FUNCTIONS 227
1
x
c
2020 School of Mathematics and Statistics, UNSW Sydney
228 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
tanh x
1
−1
sinh x
Solution. Since tanh x = cosh x and cosh x > 0 for all x in R, we see that Dom(tanh) = R. Some
properties of tanh are listed below.
• By using the fact that cosh is even and sinh is odd, we see that
sinh(−x) − sinh x
tanh(−x) = = = − tanh(x).
cosh(−x) cosh x
• Since d
dx (tanh x) = sech2 x > 0 for all x in R, the function tanh is increasing everywhere.
sinh x
tanh x =
cosh x
ex − e−x
= x
e + e−x
1 − e−2x
=
1 + e−x
→1
as x → ∞.
d
= sech2 0 = 1.
(tanh x)
dx x=0
c
2020 School of Mathematics and Statistics, UNSW Sydney
10.3. HYPERBOLIC IDENTITIES 229
those for cos and sin. This analogy can also be seen in the following larger list of identities. There
are the ‘difference of squares’ identities
cosh2 x − sinh2 x = 1,
1 − tanh2 x = sech2 x,
coth2 x − 1 = cosech2 x,
and
We emphasise that Remark 10.3.1 provides a way to remember each formula, not to prove it.
Proofs are constructed using techniques that are illustrated in the following example.
Example 10.3.2. (a) Prove the identity
c
2020 School of Mathematics and Statistics, UNSW Sydney
230 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
(c) By using the result of (a), prove the double-angle formula for cosh.
(d) By assuming that
sinh(x + y) = sinh x cosh y + cosh x sinh y, (10.4)
use the result of (a) to show that
tanh x + tanh y
tanh(x + y) = .
1 + tanh x tanh y
Solution. (a) By the definition of cosh,
LHS = 1
2 ex+y + e−(x+y)
while
1 x
−x 1 y −y
1 x
−x 1 y −y
RHS = 2 e +e 2 e +e +2 e − e 2 e −e
1 x+y
= 4 e + ex−y + e−x+y + e−x−y + 14 ex+y − ex−y − e−x+y + e−x−y
1 x+y
= 2e + 12 e−x−y
= LHS.
This proves the identity.
Remark 10.3.3. Note that in Example 10.3.2, each proof started by simplifying either the left-
hand side of the equation, or the right-hand side of the equation. It would be wrong to begin the
proof of (a) by
cosh(x + y) = cosh x cosh y + sinh x sinh y
x+y
1
2 e + e−(x+y) = 21 ex + e−x 12 ey + e−y + 21 ex − e−x 12 ey − e−y
··· = ···
Such a method is flawed and can be used to ‘prove’ erroneous statements such as ‘1 = 0’.
and so on.
c
2020 School of Mathematics and Statistics, UNSW Sydney
10.5. THE INVERSE HYPERBOLIC FUNCTIONS 231
f (x) = ln(sech(2x)).
−2 sech(2x) tanh(2x)
f ′ (x) = = −2 tanh(2x).
sech(2x)
Z √
sech2 ( x)
(c) √ dx
x
Z
(d) ex sinh x dx.
c
2020 School of Mathematics and Statistics, UNSW Sydney
232 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
y y y
1 −1
x 1 x 1 x
(a) sinh and sinh−1 (b) cosh and cosh−1 (c) tanh and tanh−1
Figure 10.5: Graphs of the hyperbolic functions (in gray) and their inverses (in black).
Graphs of the functions sinh−1 , cosh−1 and tanh−1 are illustrated in Figure 10.5.
Expressions involving the hyperbolic functions and their inverses can be simplified using the
identities of Section 10.3. It is important to note that
cosh−1 (cosh x) = x
only when x ≥ 0.
We need to decide whether to take the positive or negative square root. Since cosh−1 is a nonneg-
ative function, we have cosh−1 43 > 0 and hence
sinh(cosh−1 34 ) > 0.
Therefore √
sinh cosh−1 4
3 =+ 7
3 .
c
2020 School of Mathematics and Statistics, UNSW Sydney
10.5. THE INVERSE HYPERBOLIC FUNCTIONS 233
(b) We cannot write cosh−1 (cosh(−7)) = −7 because cosh−1 is only the inverse function of cosh
with domain restricted to [0, ∞). However, since cosh is an even function,
cosh−1 (cosh(−7)) = cosh−1 (cosh(7)) = 7.
(c) We use the double-angle formula
cosh(2t) = 1 + 2 sinh2 t
for cosh:
2
cosh(2 sinh−1 3) = 1 + 2 sinh(sinh−1 3)
= 1 + 2 × 32
= 19.
Each of the functions sinh, cosh and tanh can be expressed in terms of the exponential function.
It is not surprising, then, that their inverses can be expressed in terms of the natural logarithm.
Proposition 10.5.3. The following identities hold:
p
sinh−1 x = ln(x + x2 + 1) ∀x ∈ R,
p
cosh−1 x = ln(x + x2 − 1) ∀x ∈ [1, ∞),
−1 1 1+x
tanh x = ln ∀x ∈ (−1, 1).
2 1−x
Proof. We prove the formula for sinh−1 only; the others can be proved similarly.
Suppose that y = sinh−1 x. Then
sinh y = x,
which means that
1 y −y
2 (e − e ) = x.
If we multiply this equation through by 2ey then we obtain
e2y − 1 = 2xey ,
which is a quadratic equation in ey . Further rearrangement gives
(ey )2 − 2x(ey ) = 1,
to which we complete the square to obtain
(ey − x)2 = 1 + x2 .
Since ey > 0, we take the positive square root. Thus
p
ey = x + x2 + 1.
(Note that this may also be obtained by using the quadratic formula rather than completing the
square). Taking the natural logarithm of both sides of this equation gives
p
y = ln(x + x2 + 1)
as required.
c
2020 School of Mathematics and Statistics, UNSW Sydney
234 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
are differentiable on their respective domains. A formula for each derivative is given in the next
proposition.
Proposition 10.5.4. The derivatives of sinh−1 , cosh−1 and tanh−1 are given by
d 1
sinh−1 x = √ ,
dx 2
x +1
d 1
cosh−1 x = √ ,
dx 2
x −1
d 1
tanh−1 x = .
dx 1 − x2
d
Proof. We prove the formula for dx (sinh−1 x) only; the others can be proved similarly.
−1
Suppose that y = sinh x, Then
sinh y = x
and differentiating with respect to x gives
dy
cosh y = 1.
dx
Hence
dy 1 1
= = . (10.5)
dx cosh y cosh(sinh−1 x)
The identity cosh2 t − sinh2 t = 1 gives
p
cosh t = 1 + sinh2 t
(where we have taken the positive square root since cosh t > 0 for all t). Hence
p
cosh(sinh−1 x) = 1 + x2 .
d 1
sinh−1 x = √
dx 1 + x2
as required.
Remark 10.5.5. Students should be familiar with the proofs of Propositions 10.5.3 and 10.5.4.
c
2020 School of Mathematics and Statistics, UNSW Sydney
10.6. INTEGRATION LEADING TO THE INVERSE HYPERBOLIC FUNCTIONS 235
d x 1 1 1
sinh−1 = q =√ .
dx a a
x 2 x 2 + a2
a +1
Z
dx x
√ = cosh−1 + C
2
x −a 2 a
p
= ln(x + x2 − a2 ) + (C − ln a), x ≥ a > 0,
Z (
dx tanh−1 xa + C, |x| < a
1
a
=
a − x2
2
coth−1 xa + C, |x| > a > 0
1
a
1 a + x
= ln + C, x2 6= a2 .
2a a − x
These formulae are included in the table of standard integrals that is issued at the final examination.
1 1
√ = p .
1 + 9x 2 3 (1/3)2 + x2
Hence
Z Z
dx 1 dx
√ = p
1 + 9x 2 3 (1/3)2 + x2
1 x
= sinh−1 +C
3 1/3
1
= sinh−1 (3x) + C.
3
Alternatively, one could approach this problem by using the substitution u = 3x.
(b) The technique we apply to this example is known as ‘completing the square’:
x2 − 2x + 10 = x2 − 2x + 1 + 9 = (x − 1)2 + 32 = u2 + 32 ,
c
2020 School of Mathematics and Statistics, UNSW Sydney
236 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
where u = x − 1. Hence
Z Z
dx du
√ = √
2
x − 2x + 10 + 32u2
u
= sinh−1 + C
3
−1 x − 1
= sinh + C.
3
• Identities which express the inverse hyperbolic functions in terms of the natural loga-
rithm (see Proposition 10.5.3) can be read from the table of standard integrals.
• The definition of coth, sech and cosech are in exact analogy with the definition of cot,
sec and cosec.
• The hyperbolic identities listed in Section 10.3 are easily remembered from their trigono-
metric counterparts (see Remark 10.3.1).
c
2020 School of Mathematics and Statistics, UNSW Sydney
10.8. (APPENDIX):THE CATENARY 237
y
suspended cable
T
T sin θ
P b
θ
T cos θ
A
H b
w ℓ(x)
• It is not expected that students will memorise the derivatives of coth, sech and cosech.
However, they can be easily derived from first principles provided that the derivatives
of sinh and cosh are known.
A table of standard integrals will be issued at the final examination.
To prove the theorem, it suffices to show that if the catenary is described by y = f (x), where
f is a differentiable function, then
H wx
f (x) = cosh
w H
for a particular choice of y0 .
We begin with a general formula that relates a function f to its arc length function ℓ by a
differential equation.
Lemma 10.8.2. Suppose that f is a differentiable function. Then the arc length function ℓ is also
differentiable and q
′
2
ℓ (x) = 1 + f ′ (x) (10.6)
Sketch proof. Suppose that h is a small nonzero real number and consider the diagram below.
f (t)
b
x + h, f (x + h)
ℓ(x + h) − ℓ(x)
f (x + h) − f (x)
x, f (x) b
By Pythagoras’ theorem,
q 2
ℓ(x + h) − ℓ(x) ≈ h2 + f (x + h) − f (x) .
If we divide both sides by h then
s 2
ℓ(x + h) − ℓ(x) f (x + h) − f (x)
≈ 1+ .
h h
As h → 0 the approximation gets better and we obtain
q 2
ℓ′ (x) = 1 + f ′ (x)
as required.
Proof. The first two conditions are a result of our choice of coordinate system and the assumption
that f is differentiable. It remains to establish the third condition.
Since the cable is stationary, the forces w ℓ(x) and T sin θ that act vertically cancel each other
out. That is,
T sin θ = w ℓ(x).
Similarly, the horizontal forces H and T cos θ also cancel each other out and so
T cos θ = H.
Using these equations, and the fact that f ′ (x) = tan θ at the point P (x, y), gives
T sin θ w ℓ(x)
f ′ (x) = tan θ = = .
T cos θ H
So far we have
w ℓ(x)
f ′ (x) = .
H
Since w and H are constants, differentiating this equation with respect to x gives
w ′
f ′′ (x) = ℓ (x).
H
Therefore q
′′ w 2
f (x) = 1 + f ′ (x) .
H
by equation (10.6).
Proof of Theorem 10.8.1. It remains to find all functions f that satisfy the conditions of Lemma
10.8.3. If we let u denote f ′ (x) and b denote w/H, then equation (10.9) becomes
du p
= b 1 + u2 .
dx
Hence
dx 1
= √ .
du b 1 + u2
Every solution to this differential equation is of the form
1
x = sinh−1 u + C1 .
b
Hence we have
f ′ (x) = u = sinh(bx − bC1 ).
Enforcing condition (10.8) shows that either b = 0 or C1 = 0. Now if b = 0 then f ′ (x) = 0 and f
is simply a constant function (that is, the cable is pulled tight and exhibits no sag). On the other
hand, if b 6= 0 then C1 = 0 and hence
f ′ (x) = sinh(bx).
Therefore
cosh(bx)
f (x) = + C2
b
and enforcing the condition (10.7) yields C2 = 1b − y0 . If we choose y0 to be 1/b then C2 = 0 and
the function f that describes the catenary is given by
cosh(bx) H wx
f (x) = = cosh ,
b w H
proving the theorem.
c
2020 School of Mathematics and Statistics, UNSW Sydney
240 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 10 241
2. [R] By expressing the following hyperbolic functions in terms of sinh x and cosh x, find
the derivative of each function f given below.
a) f (x) = tanh x b) f (x) = sech x c) f (x) = coth x
4. [R]
a) Given the formula sinh(A + B) = sinh A cosh B + cosh A sinh B, find a formula for
sinh 2x. By differentiation or otherwise, find a formula for cosh 2x.
b) [H] Using the results of part
Z (a), express sinh 3x as a cubic polynomial in sinh x.
Hence, or otherwise, find sinh3 x dx.
5. [R] Show that cosh x+sinh x = ex . Deduce that (cosh x+sinh x)n = cosh nx+sinh nx.
(cosh t, sinh t)
A(t)
0 1 x
a) Using the definitions of cosh and sinh, prove that, for every real number t, the
point (cosh t, sinh t) lies on the hyperbola.
c
2020 School of Mathematics and Statistics, UNSW Sydney
242 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
b) When t > 0, let A(t) denote the shaded region in the diagram. Explain why
Z cosh t p
1
A(t) = cosh t sinh t − x2 − 1 dx.
2 1
t
c) By first calculating A′ (t), prove that A(t) = .
2
dy
11. [R] Find if
dx
a) y = sinh−1 (2x)
b) y = tanh−1 (1/x)
c) y = cosh−1 (sec x) whenever 0 < x < π/2.
13. [X] Sketch the function sech−1 . What is its maximal domain? For y = sech−1 x, show
that
√ !
dy −1 1 + 1 − x2
a) = √ b) y = ln .
dx x 1 − x2 x
c
2020 School of Mathematics and Statistics, UNSW Sydney
243
Chapter 1
3. Answer for both: the interior and boundary of the triangle with vertices at (0, 0), (2, 0) and
(2, 4).
1
6. c) From (a) we have x2 + ≥ 2 with equality if and only if x = ±1.
x2
7. a) F b) F c) T d) T e) F
11. a) 22 b) x2 + 10x + 22 c) 6 d) x2 + 2
√ √ √
12. a) x − 1 + 1/ x − 1 b) x−1 c) (x − 1)3/2 d) (1/ x − 1) − 1
17. x = 1, 7
c
2020 School of Mathematics and Statistics, UNSW Sydney
244 CHAPTER 3
Chapter 2
1. a) 1 b) 2 c) 0
d) Doesn’t exist (→ ∞). e) 5 f) Doesn’t exist.
2. a) 0 b) 0
4. b) 0
√
5. a) 1 b) M = 10 (best possible) c) M = 1/ ǫ will do.
6. a) 4 b) 0 c) 0 d) 0 e) 0
7. a) Not necessarily, as the information given indicates only that the inequality holds for a
subset of (ǫ−1 , ∞).
b) Yes. In fact one can prove that lim g(x) = 5 from the definition of the limit by taking
x→∞
M to be 1ǫ .
10. a) 10 b) 4 c) 3 d) −1/9
11. a) −1 b) 1 c) No
13. a) 0 b) 0
15. Neither the left-hand nor right-hand limits exist due to wild oscillatory behaviour.
c
2020 School of Mathematics and Statistics, UNSW Sydney
ANSWERS 245
Chapter 3
1. b) Yes
3. k = 8
Chapter 4
3. a) 0 b) 0 c) f ′ (0) = 0
7. 2pf ′ (a)
dy 3x2 − y dy √ √
9. a) = b) = (y − 4x xy)/(4y xy − x)
dx x − 3y 2 dx
11. y = 2
13. a = 1, b = 0
16. 7/8 m2 /s
c
2020 School of Mathematics and Statistics, UNSW Sydney
246 CHAPTER 5
1
17. a) 8π cm/s
32000π 3
b) 81 cm
dh 2
18. a) dt = 125π mm/s when h = 50 mm.
Chapter 5
q
7 1
1. 3 b) 2
5. c) 0
√ √ 1
7. a) By the Mean Value Theorem, for some c with 16 < c < 17, 17 − 16 = √ < √1 =
2 c 2 16
0.125.
b) 0.008
c) 2 × 10−6 .
8. −1, 1 and 4 are stationary points; 4 is a local minimum point;−1 is a local maximum point.
9. No
12. p′n (x) = pn−1 (x), and if pn−1 (x) = 0 then pn (x) = xn /n!. These hints are all you need!
15. a = π2 , x = 3π 7π 3π π 5π
4 , 4 ; a = 2 , x = 4 , 4 . The Maple commands
with(plots):
animate(plot,[cos(a) + 2*cos(2*x) + cos(4*x-a), x=0..2*Pi],a=0..2*Pi);
should confirm your answers.
19. a) 0 b) 8/3
1 m
20. a) 3 b) n c) −1 d) − 21 e) 1
4 f) 1
3
21. a) → 0 b) → ∞ c) → 0
d) → 1 e) → 1 f) → 32
c
2020 School of Mathematics and Statistics, UNSW Sydney
ANSWERS 247
22. Combine the two fractions and apply l’Hôpital twice only. You will need to simplify the quotient
obtained after the first application of l’Hôpital. Maple can confirm your answer.
√ √ √ √
23. (a, b) = (− 2, 2) or ( 2, − 2)
26. a) −1/2
b) a = −1/2, b = 1
27. c) a = b = 0
Chapter 6
1
2. a) f −1 (x) = (x − 1)
3
√
b) g −1 (x) = − x − 1, Dom(g −1 ) = [1, ∞),
−1
Range(g ) = (−∞, 0], (g −1 )′ (x) = 2√−1
x−1
4. b) 1/3
5. b) The restriction of f to (−∞, −1] has an inverse with domain (−∞, 3],
the restriction of f to [−1, 1] has an inverse with domain [−1, 3], and
the restriction of f to [1, ∞) has an inverse with domain [−1, ∞).
6. a) No b) Yes
7. a) The graph is symmetric about x = − 21 , which surely gives a local maximum of f (x).
There will be four (maximal) intervals where f will have an inverse. Try this exercise on
Maple. The commands plot, diff and solve should suffice.
b) f is one-to-one; f −1 (x) = x1/17 − 1 is not differentiable when x = 0.
c) I can be one of four intervals.
13. b) f (x) = π/2 when x > 0 and f (x) = −π/2 when x < 0.
√
14. b) The derivative of the inverse is −1/x x2 − 1 when x > 1.
16. a = π/2, b = 0
c
2020 School of Mathematics and Statistics, UNSW Sydney
248 CHAPTER 7
Chapter 7
a) x = 3, y = x + 2 b) x = −1, y = x − 1 c) x = −3, x = 2, y = x − 1.
x2 y2 x2 y2
7. a) + = 1, ellipse b) − = 1, hyperbola
16 25 9 4
2/3 d) spiral
c) y = x
1
9. a) 3x − 27y + 52 = 0 b) 9
2
10. a) y = 3x 3 .
11. b) Hint: the length of one particular arc of the larger circle equals the length of one arc on
the smaller circle.
d) x2/3 + y 2/3 = 1
12. a) p(t) = a + t(b − a), p(0) = a, p(1) = b b) y = 4 − x, q(1/2) is the coodinate vector for
the midpoint of B and C c) p0 (t) = (1 − t)2 , p1 (t) = 2t(1 − t), p2 (t) = t2
√ √ √
13. a) (3, 0) b) (−3 3, −3) c) ( 2, − 2)
√
14. a) (3, π) b) ( 2, −3π/4) c) (4, 2π/3)
d) (1, π/2) e) (4, 5π/6) f) (4, −5π/6)
c
2020 School of Mathematics and Statistics, UNSW Sydney
ANSWERS 249
x x
x
a) b) c)
y
y y
x x x
d) e) f)
(x − 2)2 y2
19. + =1
9 5
Chapter 8
1. a) i) S Pn (f ) = S Pn (f ) = 1
ii) S Pn (f ) = 21 1 − n1 , S Pn (f ) = 12 1 + n1
iii) S Pn (f ) = 61 1 − n1 2 − n1 , S Pn (f ) = 16 1 + n1 2 + n1
v) S Pn (f ) = 1, S Pn (f ) = 0
1 1 1
b) i) 1 ii) 2 iii) 3 iv) 4 (v) Not Riemann integrable
√
2. a) 1365 = 36.95
√ √
b) 1690.9 = 41.12 and the lower bound is 1078.9 = 32.85
4. 4.5
5. a) 82.4 b) 10
1
6. f (x) =
x2 +x+1
1
7. x is not differentiable on all of [−1, 1] so the FTC doesn’t apply.
√
8. a) Draw a picture! b) 5π/12 − 3/2
2 √
15. a) 12 ex + C b) −2 cos x + C c) 15/4
√ √
d) 4 2 a9/2 9 e) 1/4 f) (2 2 − 1)/3
√ √ 1 1 21 1
16. a) 2 x − 2 ln(1 + x) + C b) 25 21 (5x − 1) + 20 (5x − 1)20 + C
c) x/(x + 1)2 + C d) 4 − 10 ln(7/5)
c
2020 School of Mathematics and Statistics, UNSW Sydney
250 CHAPTER 9
1 t2 − 1
17. √ tan−1 √ for t 6= 0
2 2t
5
18. a) 4e25+1 b) x2 sin x + 2x cos x − 2 sin x + C
√
π
c) x(ln(x) − 1) + C d) 12 + 23 − 1
7e8 +1 π
f) 2
e)
ex
64 √
g) (cos x + sin x) h) x tan−1 x − ln 1 + x2 + C
√2 √
2 1
i) 2 + 2 ln(1 + 2)
21. a) 0 b) ln 2 c) No
25. s < 0
26. p > 1
28. a) 4, 8, 2
c) Li′ (x) = 1
ln x > 0 so Li is an increasing function; Li(2) = 0.
6
10 −2
d) Li(106 ) ≥ 6 ln 10 .
π(106 )
e) x ' 0.07238.
2
29. a) √2 e−x
π
d) (i) 0.749 < erf(1) < 0.928 (iii) 1/e (iv) 1.344
Chapter 9
2
(ln x) f) ln | sin x|
e) 2
7. b) e
c
2020 School of Mathematics and Statistics, UNSW Sydney
ANSWERS 251
1/5
x3 − 3 3x2 2x
8. a) 3x ln 3 b) −
x2 + 1 5(x
3 − 3) 5(1 + x2 )
sin x
c) (sin x)sin x cos x (1 + ln(sin x)) d) cos(xsin x ) xsin x cos x ln x +
x
9. a) 0 b) 0 c) 1 d) e2 e) 1
f) 1 g) ea h) 0 i) 0
Chapter 10
− sinh(1/x) 1 1
3. a) 6x cosh(3x2 ) b) x2 c) 2 + 2x2
sinh 4x 1 √
7. a) 4 b) 12 c) (2x + sinh 2x)/4 d) 2 cosh x
8. 5/4, 3, 5/12
√ 1
11. a) 2/ 1 + 4x2 b) 1−x2 for |x| > 1 c) sec x
12. a) 1
2 sinh−1 2x b) tanh−1 1
2 = 1
2 ln 3 c) sinh−1 x+2
3
c
2020 School of Mathematics and Statistics, UNSW Sydney
252
c
2020 School of Mathematics and Statistics, UNSW Sydney
INDEX 253
Index
c
2020 School of Mathematics and Statistics, UNSW Sydney
254 INDEX
e, 211 hyperbolic sine, 224
elementary functions, 18 sum and difference formulae, 229
elements (of set), 2
ellipse, 20, 140 improper integral
endpoints p-integrals, 188
critical points, 96 convergence of, 187
endpoints of intervals, 3 indeterminate form(s), 31, 103, 217, 218
equations inequalities, 4
solutions, 98 and absolute value, 7
erf, 205 triangle inequality, 7
error bounds, 90 inequality, 4
error function, 205 infinity, symbol for, 3
even function, 132 injective, 113
exp, 212 integers, 2
exponential function, 214 integral calculus, 1, 157
integral(s), 165
fastest growing term, 30 definite
function(s), 1, 9 limits of, 165
composition of, 12, 70 improper
decreasing, 91 p-integrals, 186
error function, 205 comparison test, 190
even, 132 convergence of, 186
exponential, 213 limit comparison test, 193
implicitly defined, 19 indefinite, 179
increasing, 91 over (a, ∞), 187
inverse function, 115 Riemann sum, 162, 164
limit at infinity, 33 table of, 237
local approximation, 77 integrand, 165
natural logarithm, 209 integration, 163
odd, 132 and differentiation, 177
quadratic, 13 by parts, 185
range of, 10 by substitution, 180
rational, 14 constant of, 179
real-valued, 9 dummy variable, 171
sine integral, 196 intermediate value theorem, 55
value of, 9 number of solutions, 100
fundamental theorem of calculus, 1, 196 intervals, 3
first fundamental theorem, 172 closed, 3
second fundamental theorem, 175 open, 3
inverse function(s), 115
Galileo, 142, 223 hyperbolic functions, 231
inverses of, 231
horizontal line test, 114 restriction, 118
horizontal point of inflexion, 93 trigonometric functions, 120, 121
Huygens, Christiaan, 223 irrational numbers, 2
hyperbola, 20, 140, 226
hyperbolic functions, 227 l’Hôpital’s rule, 104, 217
double-angle formulae, 229 l’Hôpital, Guillaume, 142
hyperbolic cosine, 224 least upper bound property, 55, 57
c
2020 School of Mathematics and Statistics, UNSW Sydney
INDEX 255
Lebesgue, Henri, 157, 168 polar coordinates, 144
left-hand limits, 39 polar curves, 146
Leibniz, Gottfried, 1, 142, 157, 223 polynomial, 13
Leonard Euler, 211 coefficients of, 13
limit(s), 1, 27 degree of, 13
at infinity, 33 leading coefficient, 13
composition of function, 42 polynomial division, 134
left-hand, 39 primitive, 102
of continuous functions, 40 product rule, 72
of the form lim f (x), 27
x→∞ quadratic function, 13
pinching theorem, 37, 43
quartic, 13
right-hand, 39
quintic, 13
two-sided, 39
ln, 209 radians, 14
logarithmic functions, 214 range of a function, 10
logarithms, 209 rates of change, 78
rational function, 14
maximum rational numbers, 2, 55
global, 96 real numbers, 2, 55
local, 79, 94 Riemann integrable, 165
second derivative test, 94 Riemann integral, 163
maximum-minimum theorem, 56, 57 area under the graph, 163
mean value theorem, 85, 86 basic properties of, 169
members (of set), 2 Riemann sum
minimum upper, 162, 164
global, 96 Riemann sums, 177, 196
local, 94 lower, 162, 164
second derivative test, 94 Riemann, Bernhard, 157
right-hand limits, 39
natural domain, 10 Rolle’s theorem, 87
natural logarithm, 211, 233
natural numbers, 2 secant, 64
Newton, Isaac, 1, 142, 157 second derivative, 93, 95
set(s), 2
oblique asymptote, 132 sine integral, 196
odd function, 132 stationary point, 80
one-to-one correspondence, 112, 113 classification of, 92
one-to-one function, 113 subset, 3
open interval, 3 sum and difference formulae, 229
c
2020 School of Mathematics and Statistics, UNSW Sydney