0% found this document useful (0 votes)

617 views265 pages

MATH1131-1141-Calculus-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)

Uploaded by

shirley

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

617 views265 pages

MATH1131-1141-Calculus-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)

Uploaded by

shirley

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 265

MATH1131 Mathematics 1A

and

MATH1141 Higher Mathematics 1A

CALCULUS NOTES

CRICOS Provider No: 00098G c

2020 School of Mathematics and Statistics, UNSW
Sydney
iii

Preface

Please read carefully.

These Notes form the basis for the calculus strand of MATH1131 and MATH1141. However, not
all of the material in these Notes is included in the MATH1131 or MATH1141 calculus syllabuses.
A detailed syllabus will be uploaded to Moodle.
In using these Notes, you should remember the following points:
1. Most courses at university present new material at a faster pace than you will have been
accustomed to in high school, so it is essential that you start working right from the beginning
of the session and continue to work steadily throughout the session. Make every effort to keep
up with the lectures and to do problems relevant to the current lectures.
2. These Notes are not intended to be a substitute for attending lectures or tutorials. The
lectures will expand on the material in the notes and help you to understand it.
3. These Notes may seem to contain a lot of material but not all of this material is equally
important. One aim of the lectures will be to give you a clearer idea of the relative importance
of the topics covered in the Notes.
4. Use the tutorials for the purpose for which they are intended, that is, to ask questions about
both the theory and the problems being covered in the current lectures.
5. Some of the material in these Notes is more difficult than the rest. This extra material
is marked with the symbol [H]. Material marked with an [X] is intended for students in
MATH1141.
6. Some of the problems are marked [V]. These have a video solution available from Moodle.
7. It is essential for you to do problems which are given at the end of each chapter. If you
find that you do not have time to attempt all of the problems, you should at least attempt a
representative selection of them. You will find advice about this on Moodle. You should also
work through the Online Tutorals that you will find on Moodle.
8. You will be expected to use the computer algebra package Maple in tests and understand
Maple syntax and output for the end of term examination.

Note.
This version of the Calculus Notes has been prepared by Robert Taggart and Peter Brown. They
build on notes first developed by Tony Dooley and subsequently edited by several members of
the School of Mathematics and Statistics. The main editors include Mike Banner, Ian Doust and
V. Jeyakumar.
c
Copyright is vested in The University of New South Wales, 2020.
iv

c
2020 School of Mathematics and Statistics, UNSW Sydney
v

Contents

Preface iii
Calculus Syllabus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix

1 Sets, inequalities and functions 1

1.1 Sets of numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Solving inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 Absolute values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.4 Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.5 Polynomials and rational functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.6 The trigonometric functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.7 The elementary functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.8 Implicitly defined functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.9 Continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
1.10 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

Problems for Chapter 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

2 Limits 27
2.1 Limits of functions at infinity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.1.1 Basic rules for limits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.1.2 The pinching theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.1.3 Limits of the form f (x)/g(x) . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
p p
2.1.4 Limits of the form f (x) − g(x) . . . . . . . . . . . . . . . . . . . . . . . . 30
2.1.5 Indeterminate forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.2 The definition of lim f (x) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
x→∞
2.3 Proving that lim f (x) = L using the limit definition . . . . . . . . . . . . . . . . . . 34
x→∞
2.4 Proofs of basic limit results (MATH1141 only) . . . . . . . . . . . . . . . . . . . . . 37
2.5 Limits of functions at a point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
2.5.1 Left-hand, right-hand and two-sided limits . . . . . . . . . . . . . . . . . . . . 38
2.5.2 Limits and continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . 40
2.5.3 Rules for limits at a point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
2.6 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

Problems for Chapter 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

c
2020 School of Mathematics and Statistics, UNSW Sydney
vi

3 Properties of continuous functions 51

3.1 Combining continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
3.2 Continuity on intervals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
3.3 The intermediate value theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
3.4 The maximum-minimum theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

Problems for Chapter 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60

4 Differentiable functions 63
4.1 Gradients of tangents and derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.2 Rules for differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
4.3 Proofs of results in Section 4.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
4.4 Implicit differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
4.5 Differentiation, continuity and split functions . . . . . . . . . . . . . . . . . . . . . . 74
4.6 Derivatives and function approximation . . . . . . . . . . . . . . . . . . . . . . . . . 77
4.7 Derivatives and rates of change . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
4.8 Local maximum, local minimum and stationary points . . . . . . . . . . . . . . . . . 79
4.9 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81

Problems for Chapter 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82

5 The mean value theorem and applications 85

5.1 The mean value theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
5.2 Proof of the mean value theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
5.3 Proving inequalities using the mean value theorem . . . . . . . . . . . . . . . . . . . 88
5.4 Error bounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
5.5 The sign of a derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
5.6 The second derivative and applications . . . . . . . . . . . . . . . . . . . . . . . . . . 93
5.7 Critical points, maxima and minima . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
5.8 Counting zeros . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
5.9 Antiderivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
5.10 L’Hôpital’s rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103

Problems for Chapter 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107

6 Inverse functions 111

6.1 Some preliminary examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
6.2 One-to-one functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
6.3 Inverse functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
6.4 The inverse function theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
6.5 Applications to the trigonometric functions . . . . . . . . . . . . . . . . . . . . . . . 120
6.6 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125

Problems for Chapter 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127

7 Curve sketching 131

7.1 Curves defined by a Cartesian equation . . . . . . . . . . . . . . . . . . . . . . . . . 131
7.1.1 A checklist for sketching curves . . . . . . . . . . . . . . . . . . . . . . . . . . 132
7.1.2 Oblique asymptotes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132

c
2020 School of Mathematics and Statistics, UNSW Sydney
vii

7.1.3 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135

7.2 Parametrically defined curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136
7.2.1 Parametrisation of conic sections . . . . . . . . . . . . . . . . . . . . . . . . . 139
7.2.2 Calculus and parametric curves . . . . . . . . . . . . . . . . . . . . . . . . . . 141
7.2.3 The cycloid and curve of fastest descent . . . . . . . . . . . . . . . . . . . . . 142
7.3 Curves defined by polar coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
7.3.1 Polar coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
7.3.2 Basic sketches of polar curves . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
7.3.3 Sketching polar curves using calculus . . . . . . . . . . . . . . . . . . . . . . . 148
7.4 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151

Problems for Chapter 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153

8 Integration 157
8.1 Area and the Riemann Integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
8.1.1 Area of regions with curved boundaries . . . . . . . . . . . . . . . . . . . . . 157
8.1.2 Approximations of area using Riemann sums . . . . . . . . . . . . . . . . . . 160
8.1.3 The definition of area under the graph of a function and the Riemann integral163
8.2 Integration using Riemann sums . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
8.3 The Riemann integral and signed area . . . . . . . . . . . . . . . . . . . . . . . . . . 168
8.4 Basic properties of the Riemann integral . . . . . . . . . . . . . . . . . . . . . . . . . 169
8.5 The first fundamental theorem of calculus . . . . . . . . . . . . . . . . . . . . . . . . 171
8.6 The second fundamental theorem of calculus . . . . . . . . . . . . . . . . . . . . . . 175
8.7 Indefinite integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
8.8 Integration by substitution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
8.9 Integration by parts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
8.10 Improper integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
8.11 Comparison tests for improper integrals . . . . . . . . . . . . . . . . . . . . . . . . . 189
8.12 Functions defined by an integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
8.13 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199

Problems for Chapter 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200

9 The logarithmic and exponential functions 207

9.1 Powers and logarithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208
9.2 The natural logarithm function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209
9.3 The exponential function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212
9.4 Exponentials and logarithms with other bases . . . . . . . . . . . . . . . . . . . . . . 214
9.5 Integration and the ln function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215
9.6 Logarithmic differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
9.7 Indeterminate forms with powers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217

Problems for Chapter 9 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219

10 The hyperbolic functions 223

10.1 Hyperbolic sine and cosine functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
10.2 Other hyperbolic functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
10.3 Hyperbolic identities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228

c
2020 School of Mathematics and Statistics, UNSW Sydney
viii

10.4 Hyperbolic derivatives and integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . 230

10.5 The inverse hyperbolic functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231
10.6 Integration leading to the inverse hyperbolic functions . . . . . . . . . . . . . . . . . 234
10.7 A summary of important hyperbolic formulae . . . . . . . . . . . . . . . . . . . . . . 236
10.8 (Appendix):The catenary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237
10.9 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240

Problems for Chapter 10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241

Answers to selected problems 243

Chapter 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243
Chapter 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 244
Chapter 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245
Chapter 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245
Chapter 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 246
Chapter 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247
Chapter 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248
Chapter 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
Chapter 9 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
Chapter 10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251

Index 252

c
2020 School of Mathematics and Statistics, UNSW Sydney
ix

CALCULUS SYLLABUS

The calculus course for both MATH1131 and MATH1141 is based on these MATH1131/MATH1141
Calculus Notes that are included in the Course Pack. A detailed syllabus and lecture schedule will
be posted on Moodle.
The computer package Maple will be used in the calculus course. An introduction to Maple is
included in the booklet titled First Year Maple Notes.

CALCULUS PROBLEM SETS

The Calculus problems are located at the end of each chapter of the Calculus Notes booklet. They
are also available from the course module on the UNSW Moodle server. Some of the problems
are very easy, some are less easy but still routine and some are quite hard. To help you decide
which problems to try first, each problem is marked with an [R], an [H] or an [X]. The problems
marked [R] form a basic set of problems which you should try first. Problems marked [H] are
harder and can be left until you have done the problems marked [R]. Problems marked [V] have a
video solution available on Moodle.
You do need to make an attempt at the [H] problems because problems of this type will occur
on tests and in the exam. If you have difficulty with the [H] problems, ask for help in your tutorial.
The problems marked [X] are intended for students in MATH1141 – they relate to topics which
are only covered in MATH1141. Extra problem sheets for MATH1141 may be issued in lectures.
Remember that working through a wide range of problems is the key to success in mathematics.
However, solving problems and writing mathematics clearly are two separate skills that need to be
developed through practice. We recommend that you keep a workbook to practice writing solutions
to mathematical problems.

c
2020 School of Mathematics and Statistics, UNSW Sydney
x

c
2020 School of Mathematics and Statistics, UNSW Sydney
1

Chapter 1

Sets, inequalities and functions

In the days of the Roman empire, the word ‘calculus’ denoted a pebble that was used for counting
and gambling. As time progressed, the word ‘calculare’ came to mean ‘to compute.’ In the second
half of the seventeenth century, two mathematicians, the Englishman Isaac Newton and the German
Gottfried Leibniz, independently invented methods for
• calculating gradients of tangents to curves,

• calculating instantaneous acceleration and velocity,

• calculating the area of regions with a curved boundary,

• calculating the volume of solids with curved boundary,

• calculating the length of a curve,

• calculating the work done by a force, and

• calculating the centre of mass of a general solid.

These methods were developed by combining algebra, geometry and trigonometry with the limiting
process and became known as the calculus.
Calculus (as it is known today) has many applications to engineering, physics, chemistry, bi-
ology, geology, surveying, sociology, economics and statistics. It includes two major branches: the
differential calculus (introduced in Chapter 4) and the integral calculus (introduced in Chapter 8).
These two branches are related by the fundamental theorem of calculus (see Theorem 8.5.1). The
underlying tool used to develop these branches is the concept of the limit (introduced in Chapter
2), which is applied to functions (introduced in Chapter 1).
The main goal of this chapter is to introduce functions. Although functions will be familiar to
students from high school, important points that deserve students’ attention are emphasised here.
Since the functions we study in this course take real numbers as inputs and give real numbers of
outputs, we begin the chapter with a review of different sets that consist of real numbers. We also
devote a little time to revising inequalities and absolute values, in preparation for our discussion
of limits in Chapter 2.

1.1 Sets of numbers

(Ref: SH10 §1.2)

c
2020 School of Mathematics and Statistics, UNSW Sydney
2 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

A set is a collection of distinct objects. The objects in a set are called the elements or members of
the set. Some commonly used sets of numbers are listed below.

• The set N of natural numbers is given by

N = {0, 1, 2, 3, 4, . . .}.

• The set Z of integers is given by

Z = {. . . , −3, −2, −1, 0, 1, 2, 3, . . .}.

• The set Q of rational numbers is the collection of all numbers of the form pq , where p and q
are integers and q 6= 0.

• The set R of real numbers may be represented as the collection of points lying on the number
line.

| | | | | | |

−3 −2 −1 0 1 2 3

If A is a set of numbers and the number x is a member of the set A, then we write

x ∈ A.

If x is not a member of A then we write

x∈
/ A.
The next example illustrates this notation.

Example 1.1.1. The following statements are true:

3 3 √ √
2 ∈ N, −2 ∈
/ N, ∈
/ Z, ∈ Q, 2 ∈ R, 2∈
/ Q.
4 4
Many other sets of numbers can be described using N, Z, Q, R and inequalities.

Example 1.1.2. (a) The set

{n ∈ Z : n > 0}
is the set of all numbers n belonging to the integers such that n is greater than 0. In other words,
it is the set of positive integers. (Note that the colon ‘:’ above is read as ‘such that’.)
(b) The set
{x ∈ R : −2 ≤ x < 3}
is represented by the following interval on the real line.
| | | | | | |

−3 −2 −1 0 1 2 3

(c) The set

{x ∈ R : x ∈
/ Q}
is the set of irrational numbers.

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.1. SETS OF NUMBERS 3

Sets that are represented by intervals of the real line occur so frequently that we introduce
a special notation for them. Suppose that a and b are real numbers and that a < b. Then the
intervals (a, b), [a, b], (a, b] and [a, b) are given by

• (a, b) = {x ∈ R : a < x < b}

a b

• [a, b] = {x ∈ R : a ≤ x ≤ b}

a b

• [a, b) = {x ∈ R : a ≤ x < b}

a b

• (a, b] = {x ∈ R : a < x ≤ b}.

a b

In each case, the numbers a and b are called the endpoints of the interval. An interval [a, b] that
includes its endpoints is called a closed interval, while an interval (a, b) that excludes its endpoints
is called an open interval. The intervals [a, b) and (a, b] are neither open nor closed.
Interval notation can be extended to describe rays of the real line by using the symbol ∞ for
infinity. Thus we have

• [a, ∞) = {x ∈ R : a ≤ x}

|
a b

• (−∞, b) = {x ∈ R : x < b}

|
a b

• (−∞, ∞) = R.

We end by introducing an important set relation.

Definition 1.1.3. Suppose that A and B are two sets. We say that A is a subset
of B if x ∈ A implies that x ∈ B. If A is a subset of B then we also say that B
contains the set A.

The above definition is illustrated by the following examples:

c
2020 School of Mathematics and Statistics, UNSW Sydney
4 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

• {0, 1, 2, 3} is a subset of {0, 1, 2, 3, 4},

• N is a subset of Z,

• [−1, 2] is a subset of (−3, ∞),

• R is a subset of R, and

• [1, 4) is not a subset of (1, 4].

1.2 Solving inequalities

(Ref: SH10 §1.3)
When solving an inequality, one must remember that if the inequality is multiplied by a negative
number, the sign of the inequality reverses.
Two types of inequalities deserve special attention: quadratic inequalities and inequalities with
an unknown quantity in the denominator.
Example 1.2.1. Solve the inequality
x2 > 6 − x.
Solution. Rearranging the inequality gives

x2 + x − 6 > 0.

The values of x for which x2 + x − 6 is positive are the same as those for which the graph of

y = x2 + x − 6

lies above the x-axis.

To graph
y = x2 + x − 6,
we factorise the right-hand side to obtain

y = (x − 2)(x + 3).

Hence y = 0 when x = 2 and when x = −3. We can now easily sketch its graph.
y

y = x2 + x − 6

−3 2 x

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.2. SOLVING INEQUALITIES 5

Since the graph lies above the x-axis when x < −3 and when x > 2, the solution to the inequality
is
x < −3 or x > 2.

Example 1.2.2. Solve the inequality

1 1
< .
3 x−1
Solution. We begin by noting that x 6= 1.
To solve the inequality, we cannot simply multiply the inequality through by x − 1, since we
do not know whether x − 1 is positive or negative. (Taking the reciprocal of both sides is similarly
problematic.) Instead, we multiply through by the positive number (x − 1)2 to obtain

1
(x − 1)2 < x − 1.
3
Multiplication by 3 and rearrangement gives

(x − 1)2 − 3(x − 1) < 0

and by factorisation we obtain

(x − 1)[(x − 1) − 3] = (x − 1)(x − 4) < 0.

So proceding in a manner similar to the previous example, we seek to identify the values of x for
which the graph of
y = (x − 1)(x − 4)
lies below the x-axis.
y

y = (x − 1)(x − 4)

1 4 x

Hence the solution to the inequality is given by

1 < x < 4.

The next example is slightly more difficult, but uses techniques illustrated in the previous two
examples.

c
2020 School of Mathematics and Statistics, UNSW Sydney
6 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

Example 1.2.3. Solve the inequality

4
x+2≥
x−1
Solution. First observe that x 6= 1.
Multiplying the inequality through by (x − 1)2 gives
(x − 1)2 (x + 2) ≥ 4(x − 1).
Hence
(x − 1)2 (x + 2) − 4(x − 1) ≥ 0.
Now
(x − 1)2 (x + 2) − 4(x − 1) = (x − 1)[(x − 1)(x + 2) − 4]
= (x − 1)[x2 + x − 6]
= (x − 1)(x − 2)(x + 3).
So the original inequality is equivalent to
(x − 1)(x − 2)(x + 3) ≥ 0, x 6= 1.
We now sketch the graph of
y = (x − 1)(x − 2)(x + 3)
and see for which values of x the graph lies on or above the x-axis.
y

y = (x − 1)(x − 2)(x + 3)

−3 1 2 x

Hence the solution to the inequality is

−3 ≤ x < 1 or x ≥ 2.

Remark 1.2.4. A variety of methods (some good and others bad) exist for solving inequalities
where an unknown lies in the denominator. It is strongly recommended that students use the
method shown in this section, or that shown by their first year calculus lecturer.

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.3. ABSOLUTE VALUES 7

1.3 Absolute values

(Ref: SH10 §§1.2, 1.3)
The magnitude, or absolute value, of a real number x is denoted by |x| and defined by
(
x if x ≥ 0
|x| = (1.1)
−x if x < 0.

Absolute values of real numbers interact in the following way.

Proposition 1.3.1. Suppose that x and y are real numbers. Then
(i) | − x| = |x|,

(ii) |xy| = |x||y|,

x |x|
(iii) = , provided that y 6= 0, and
y |y|
(iv) |x + y| ≤ |x| + |y|.
Property (iv) is known as the triangle inequality.
Three other useful facts about absolute values are noted below.
(F1) For every real number x, √
|x| = x2 and |x|2 = x2 . (1.2)

(F2) If a and x are real numbers then |x − a| is equal to the distance between x and a on the real
number line.

(F3) For any positive real number a,

|y| < a if and only if −a<y <a

and
|y| > a if and only if y < −a or y > a.

These facts give different methods for solving inequalities involving absolute values.
Example 1.3.2. Solve the following inequalities:
(a) |x − 3| ≤ 5

(b) |2x + 5| > 8

x − 3
(c) < 1.
x − 1
Solution. (a) An algebraic solution. Using (F3) we have

|x − 3| ≤ 5
−5 ≤ x − 3 ≤ 5
−2 ≤ x ≤ 8.

(a) A geometric solution. By (F2) the inequality |x − 3| ≤ 5 is equivalent to saying that

c
2020 School of Mathematics and Statistics, UNSW Sydney
8 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

‘the distance between x and 3 is less than or equal to 5.’

The points x on the number line that have this property are shown below.

5 5
|

−2 3 8

Hence −2 ≤ x ≤ 8.
(b) An algebraic solution. Using (F3) the inequality is equivalent to

2x + 5 < −8 or 2x + 5 > 8.

Hence
2x < −13 or 2x > 3,

which gives the solution

13 3
x<− or x> .
2 2
(c) An algebraic solution. By Prop.1.3.1 (iii), we can write, for x 6= 1,

x − 3 |x − 3|
x − 1 = |x − 1|

and since the denominator is positive, the inequality can be written as

|x − 3| < |x − 1|.

Squaring both (positive) sides),and using (F1), we have

(x − 3)2 < (x − 1)2 .

Expanding and solving, we have x > 2.

|x − 3| < |x − 1|, x 6= 1.

By (F2) the inequality is interpreted geometrically as

‘the distance from x to 3 is less than the distance from x to 1.’

In other words, x is closer to 3 than to 1. Points that have this property are shown on the numberline
below.

| |

1 2 3

Hence x > 2.

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.4. FUNCTIONS 9

1.4 Functions
(Ref: SH10 §1.5)
A function f : A → B is a rule which assigns to every element x belonging to a set A exactly one
element f (x) belonging to a set B. The set A is called the domain of the function f and the set B
is called the codomain of f . In this course, A and B are always sets of real numbers.

Example 1.4.1. A function f : [0, ∞) → R is given by

√
f (x) = x

for all x in [0, ∞). Informally, this means that f takes a number x from [0, ∞) as input and gives
√ √
the number x as output. More formally, we say that f maps an element x of [0, ∞) to x. We
offer a few comments on terminology employed to describe f .

• The expression f : [0, ∞) → R says that [0, ∞) is the domain of f and that R is the codomain
of f . We write
Dom(f ) = [0, ∞) and Codom(f ) = R.

• The domain of f is the set of all inputs for the function.

• The codomain of f is a set that contains all the output values of the function. Since the
output values are all real numbers, we say that f is a real-valued function. While all the
outputs must lie in Codom(f ), not every number in Codom(f ) need be an output value.

• The expression f (x) (read as ‘f of x’) is the value of f at the point x. That is, f (x) is the
unique number in R that corresponds to the input x. To emphasise the point: f (x) is a real
number, not a function.

• The statement
√
‘f (x) = x for all x in [0, ∞)’

can be abbreviated as
√
f (x) = x ∀x ∈ [0, ∞).

The symbol ∀ means ‘for all.’

In high school, functions are often described by specifying a rule without specifying a domain.
For example, the following sentence was taken from an HSC study guide:

‘Consider the function f (x) = x2 .’

There are two problems with this sentence. First, neither f (x) nor f (x) = x2 is a function. As
mentioned above, f (x) is a real number, and f (x) = x2 is an equation. (This may seem picky,
but being able to distinguish between a function f and its value f (x) at the point x is crucial to
understanding key concepts in both algebra and calculus.) Second, the quoted sentence does not
specify the domain of the function. The next example gives two different functions that obey the
same squaring rule.

c
2020 School of Mathematics and Statistics, UNSW Sydney
10 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

Example 1.4.2. Consider the functions f and g, defined by

f :R→R g : [0, ∞) → R
2
and
f (x) = x ∀x ∈ Dom(f ) g(x) = x2 ∀x ∈ Dom(g).
Even though they are both squaring functions, f and g are not the same function. For example, f
can square negative numbers while g does not. On the other hand, g is invertible (this important
and desirable property is discussed in Chapter 6) while f is not. Their graphs, shown below, are
also noticeably different.
f (x) g(x)

x x
If, for whatever reason, the domain of a function is not specified, but the function rule is, then
the default domain, known as the maximal or natural domain, is the largest possible domain for
which the rule makes sense. We give two examples.
• If √
f (x) = x−1
then the rule makes sense only if x − 1 ≥ 0, that is, if x ≥ 1. Therefore the maximal domain
of the function f is [1, ∞).
• If
2
f (x) =
(x − 2)(x − 8)
then the rule only makes sense if x 6= 2 or x 6= 8. So the maximal domain is the set
{x ∈ R : x 6= 2, x 6= 8}.

Another set associated to a function is its range.

Definition 1.4.3. Suppose that f : A → B is a function. The range of f , denoted

by Range(f ), is defined by
Range(f ) = {f (x) ∈ B : x ∈ A}.

In other words, the range of f is the set of all output values. Equivalently, if the rule of a
function is described by the equation y = f (x), then the range of f is the set of all corresponding
y-values. The range is always a subset of the codomain, but the range need not equal the codomain.
The difference between range and codomain is illustrated by the following examples.
Example 1.4.4. Suppose that f : R → R is given by the rule
f (x) = sin x ∀x ∈ R.
Then Dom(f ) = R and Codom(f ) = R (this is obtained from the expression f : R → R). To
determine the range, consider the graph of f is sketched below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.4. FUNCTIONS 11
f (x)

−1

The set of all output values (equivalently, the set of all corresponding y-values) is [−1, 1]. Hence
Range(f ) = [−1, 1].

Example 1.4.5. Suppose that f : [−1, 1] → R is given by the rule

p
f (x) = 1 − x2 ∀x ∈ [−1, 1].

Then Dom(f ) = [−1, 1] and Codom(f ) = R (again, we are simply reading this from f : [−1, 1] →
R). To determine the range, we observe that the graph of f is a semicircle.

f (x)

−1 1 x

The set of all outputs (or y-values) is [0, 1]. Hence Range(f ) = [0, 1].

If f and g are two functions with the same domain, then one can combine f and g to form new
functions.

Definition 1.4.6. Suppose that f : A → B and g : A → B are real-valued functions.

Then the functions f + g, f − g, f.g and f /g are defined by the rules
(f + g)(x) = f (x) + g(x) ∀x ∈ A
(f − g)(x) = f (x) − g(x) ∀x ∈ A
(f.g)(x) = f (x)g(x) ∀x ∈ A
f (x)
(f /g)(x) = ∀x ∈ A, provided that g(x) 6= 0.
g(x)

Another way of constructing new functions is given below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
12 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

Definition 1.4.7. Suppose that f : C → D and g : A → B are functions such that

Range(g) is a subset of Dom(f ). Then the composition f ◦ g : A → D is defined by
the rule
(f ◦ g)(x) = f (g(x)) ∀x ∈ A.

Example 1.4.8. Suppose that f : R → R and g : R → R are given by the rules

f (x) = x2 + 1 ∀x ∈ R and g(x) = x − 2 ∀x ∈ R.
Find (f + g)(5) and (f ◦ g)(x). Is 2 in the domain of the function f /g? Give reasons.
Solution. First,
(f + g)(5) = f (5) + g(5)
= (52 + 1) + (5 − 2)
= 29.
Second, if x in R then
(f ◦ g)(x) = f (g(x))
= (x − 2)2 + 1
= x2 − 4x + 5.
Finally, g(2) = 0 so 2 is not in the domain of f /g.

The final example illustrates the importance of bearing in mind domain and range when com-
posing two functions.
Example 1.4.9. Suppose that f : [0, ∞) → R and g : R → R are given by
√
f (x) = x ∀x ∈ [0, ∞) and g(x) = sin x − 2 ∀x ∈ R.
Find, if they exist, (f ◦ g)(x) and (g ◦ f )(x).
Solution. Consider first f ◦ g. For this function to exist, we require that Range(g) is a subset of
Dom(f ). Now
−1 ≤ sin x ≤ 1 ∀x ∈ R
so
−3 ≤ sin x − 2 ≤ −1 ∀x ∈ R;
that is, Range(g) = [−3, −1]. Since Dom(f ) = [0, ∞), Range(g) is not a subset of Dom(f ) and
hence f ◦ g does not exist. (The point here is that, if
√
(f ◦ g)(x) = f (g(x)) = sin x − 2,
then the expression under the square root sign is always negative.)
Now consider g ◦ f . Since f is a real-valued function and Dom(g) = R, it is clear that Range(f )
is a subset of Dom(g). (The fact that Range(f ) = [0, ∞) is not needed because Dom(g) is so large.)
Hence √
(g ◦ f )(x) = g(f (x)) = sin( x) − 2 ∀x ∈ Dom(f ) = [0, ∞).

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.5. POLYNOMIALS AND RATIONAL FUNCTIONS 13

1.5 Polynomials and rational functions

(Ref: SH10 §1.6)
In the next three sections we discuss some important classes of functions. Since these were intro-
duced in high school, we give a brief survey only.

Definition 1.5.1. A function f : R → R is called a polynomial if

f (x) = an xn + an−1 xn−1 + · · · + a2 x2 + a1 x + a0 ∀x ∈ R,
where n is a natural number, the constants a0 , a1 , . . . , an are real numbers and
an 6= 0. We call n the degree of f , and the constants a0 , a1 , . . . , an are called the
coefficients of f . The constant an is called the leading coefficient of f .

Polynomials of small degree have specials names and properties.

• If n = 0 then f (x) = a0 . Hence f is a constant function and the graph of f is a horizontal

straight line.

• If n = 1 then f (x) = a1 x + a0 . The graph of f is a straight line with gradient a1 and y-axis
intercept a0 .

• If n = 2 then f is called a quadratic function and its graph is a parabola.

• If n is 3, 4 or 5 then f is called a cubic, quartic or quintic, polynomial respectively.

The general shape of the graph of a polynomial function of degree greater than 1 is indicated
by the table below.

Positive leading coefficient Negative leading coefficient

Odd degree

Even degree

The number of turning points in the graph is always strictly less than the degree of the polynomial.
Exactly how many turning points the graph has depends on the coefficients of the polynomial, and
can be determined using calculus (see, for example, Chapters 4 and 5).
By dividing two polynomials, one obtains a new type of function.

c
2020 School of Mathematics and Statistics, UNSW Sydney
14 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

Definition 1.5.2. Suppose that p and q are polynomials. A function f is called a

rational function if
Dom(f ) = {x ∈ R : q(x) 6= 0}
and
p(x)
f (x) = , ∀x ∈ Dom(f ).
q(x)

Hence the expressions

x x3 − 4x2 + 1
f (x) = 2
and f (x) =
x +2 x−2
both give rise to rational functions, as does
3
f (x) = x − 1 +
x2 + 3
(to see why, rewrite right-hand side with a common denominator of x2 + 3).
The graphs of two simple rational functions are shown below.
y y

1 1
y= x y= x2

x x

The graphs of more sophisticated rational functions are studied in Chapter 7.

1.6 The trigonometric functions

(Ref: SH10 §1.6)
The trigonometric functions will already be familiar to you. This section gives a brief summary of
their definitions and some of their properties.
Remark 1.6.1. In this course, angles are always measured in radians rather than degrees. (Radian
measure is a more natural way of measuring angles; moreover, important formulae used in calculus,
such as
d
(sin θ) = cos θ,
dθ
only make sense geometrically when θ is interpreted as an angle measured in radians rather than
degrees.) We remind readers that

2π radians = 360 degrees.

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.6. THE TRIGONOMETRIC FUNCTIONS 15

Consider an angle θ and the corresponding point P (x, y) that lies on the unit circle centred at
the origin, as shown below.

P (x, y)
y

θ 1
x

The functions cos : R → R and sin : R → R are defined by

cos θ = x and sin θ = y.

Other trig functions are defined in terms of the sine and cosine functions:

sin θ
tan θ = , provided that cos θ 6= 0
cos θ
1
sec θ = , provided that cos θ 6= 0
cos θ
1
cosec θ = , provided that sin θ 6= 0
sin θ
cos θ
cot θ = , provided that sin θ 6= 0.
sin θ
The graphs of sin, cos and tan are shown below.

sin θ
1
|

| | | | |

−2π π π 3π 2π θ
2 2
−1
|

cos θ
1
|

| | | | |

−2π π π 3π 2π θ
2 2
−1
|

c
2020 School of Mathematics and Statistics, UNSW Sydney
16 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
tan θ

|
| | | | |

−2π π π 3π 2π θ
−1 2 2

|
The sine and cosine functions are 2π-periodic, while the tangent function is π-periodic, which
means that

sin(θ + 2π) = sin θ, cos(θ + 2π) = cos θ and tan(θ + π) = tan θ

whenever θ is in the domain of the respective function. Sine and cosine both have an amplitude of
1.
It is not hard to obtain functions whose graphs have the same general shape as the sine (or
cosine) curve with a period other than 2π and amplitude other than 1. For example, if A and ω
are positive real numbers then the function f : R → R, given by

f (x) = A sin(ωx) ∀x ∈ R,
2π
has an amplitude of A and a period of ω . Its graph is shown below.

f (x)

A
|

| | | | |

− 2π 2π x
ω ω

−A
|

The six trigonometric functions are related to one another by various identities and formulae:

• complementary identities
π
sin − x = cos x
2
π
cos − x = sin x
2

• Pythagorean identities

cos2 x + sin2 x = 1
1 + tan2 x = sec2 x
cot2 x + 1 = cosec2 x

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.7. THE ELEMENTARY FUNCTIONS 17

• the sum and difference formulae

sin(x ± y) = sin x cos y ± cos x sin y
cos(x ± y) = cos x cos y ∓ sin x sin y
tan x ± tan y
tan(x ± y) =
1 ∓ tan x tan y
• double-angle formulae
sin(2x) = 2 sin x cos x
cos(2x) = cos2 x − sin2 x
2 tan x
tan(2x) = .
1 − tan2 x
The above formulae, especially the double-angle formulae for sin(2x) and cos(2x), should be learnt
carefully.
The inverse trigonometric functions will be discussed in Chapter 6.

1.7 The elementary functions

(Ref: SH10 §1.6)
Other important functions that have not yet been mentioned include the exponential and logarithm
functions. Their graphs are shown below.
ex

ln x
(e, 1)
b
b
(1, e) 1
x
1
x

These functions will be studied in greater depth in Chapter 9.

Functions arising from the study of roots of polynomials are also important to mathematics,
√
especially those functions f of the form f (x) = n x, where n is a positive integer. It is important
that students are familiar with the graphs of the square root and cube root functions, shown below.

√ √
x 3
x
b b

(1, 1) (1, 1)
x (−1, −1) x
b

c
2020 School of Mathematics and Statistics, UNSW Sydney
18 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

The elementary functions are all those functions that can be constructed by combining a finite
number of polynomials, exponentials, logarithms, roots and trigonometric functions (including the
inverse trigonometric functions) via function composition, addition, subtraction, multiplication and
division. Hence the following expressions give rise to elementary functions:

f (x) = esin x + x2
ln x − tan x
g(x) = √
x
p3
h(x) = x4 − 2x2 + 5.

It also follows that every rational function is an elementary function.

The absolute values function, given by

abs : R → R
abs(x) = |x| ∀x ∈ R,

is an elementary function by equation (1.2). It follows that if f is an elementary function then

abs ◦ f is also an elementary function. It is standard to denote function abs ◦ f by |f |. Thus

|f |(x) = (abs ◦ f )(x) = |f (x)| ∀x ∈ Dom(f ).

The graph of |f | is obtained by reflecting in the x-axis any part of the graph of f that lies below
the x-axis. This is illustrated below.
f (x) |f (x)|

x x

Most, if not all functions studied in high school were elementary functions. In this course we
shall meet some useful functions that are not elementary functions. (These are defined by integrals;
see Section 8.12.)

1.8 Implicitly defined functions

Many curves on the plane can be described as all those points (x, y) on the plane that satisfy some
equation involving x and y. For example, consider the equation

(x2 + y 2 − 1)3 − x2 y 3 = 0. (1.3)

The set of points (x, y) satisfying this equation are shown on the graph below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.8. IMPLICITLY DEFINED FUNCTIONS 19
y

Note that the curve cannot be described by one function since some x-values have two corresponding
y-values. However, the curve can be described by combining the graphs of two functions f :
[−a, a] → R and g : (−a, a) → R, which are defined by the rules

 y = f (x)
(x2 + y 2 − 1)3 − x2 y 3 = 0

y≥b

and 
 y = g(x)
(x2 + y 2 − 1)3 − x2 y 3 = 0

y<b

(where a ≈ 1.139 and b ≈ 0.545), as illustrated below.

y y

b y = f (x) b y = g(x)
|

| | | |
−a a x −a a x

We say that f and g are implicitly defined by equation (1.3).

Example 1.8.1. The set of points (x, y) that satisfy the equation

x2 + y 2 = 1

describes a circle of radius 1 centred at the origin. The above equation could be used to implicitly
define many different functions. We give two examples.

c
2020 School of Mathematics and Statistics, UNSW Sydney
20 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

• The function f : [0, 1] → R is defined by the rule


 y = f (x)
x2 + y 2 = 1

y ≥ 0.
Its graph is shown below.
y

1 y = f (x)

1 x

• The function g : (−1, 1) → R is defined by the rule


 y = f (x)
x2 + y 2 = 1

y ≤ 0.
Its graph is shown below.
y

−1 1
x

y = g(x)
−1

Many heavenly bodies (such as planets and comets) trace out paths that are roughly elliptic,
parabolic or hyperbolic. (The fact that planets trace out elliptic paths around the sun was discov-
ered by Johannes Kepler at the beginning of the sixteenth century, and was later used by Newton, in
conjunction with calculus, to establish his universal law of gravitation.) Circles, ellipses, parabolas
and hyperbolas are known as conic sections because they can be obtained by intersecting a cone
with a plane. Equations that describe conic sections, accompanied by corresponding diagrams, are
shown below.
y y

r b

−r r x −a a x

−r −b
Circle: x2 + y 2 = r 2 x2 y 2
Ellipse: + 2 =1
a2 b

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.9. CONTINUOUS FUNCTIONS 21

y y = ab x y

b y = ab x
−a a
x x
−b

x2 y 2 y 2 x2
Hyperbola: − 2 =1 Hyperbola: − 2 =1
a2 b b2 a

Students should be able to recognise such curves.

1.9 Continuous functions

The sine, cosine, exponential, logarithmic and polynomial functions are examples of continuous
functions. Intuitively, a function f is continuous on its domain if small changes in the input variable
x produce small changes in f (x) (assuming, of course, that x always remains within Dom(f )).
The intuitive notion that a function is continuous if ‘its graph can be drawn without lifting the
pencil off the page’ works well for some functions, but not for others. For example, the tangent
function is a continuous function, even though its graph (part of which is shown below) cannot be
drawn without lifting the pencil off the page.

tan x

| | | |
−π − π2 π π x
2

The break in the graph when, say, x = −π/2 or x = π/2, is a consequence of a break in the domain
of the function rather than a discontinuity of the function. Similarly, the trigonometric functions
sec, cosec and cot are continuous, as are the rational functions. The notion of continuity shall be
discussed in more precise terms in Chapters 2 and 3.
Intuitively, a function f has a discontinuity at a point a if a ∈ Dom(f ) and there is a break in
the graph of f at a. It is important to realise that there are different types of discontinuities. We
compare three examples.

c
2020 School of Mathematics and Statistics, UNSW Sydney
22 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
f (x) g(x) h(x)

b
bc bc

b b

| | |
a x a x a x

The functions f , g and h each have a discontinuity at a.

• For f , the discontinuity can be easily removed by redefining the value of f (a) to ‘plug up the
hole’ at a. This is an example of a removable discontinuity.

• For g, the discontinuity is a result of a finite jump in the graph of g at the point a. The
discontinuity for g, which is an example of a jump discontinuity, is worse than that for f but
not nearly as bad as that for h.

• For h, the discontinuity is a result of rapid oscillation on the right-hand side of a and an
infinite jump on the left-hand side of a. This is an example of an essential discontinuity.

While students are not expected to learn or use the terms removable, jump and essential disconti-
nuity, it is expected that they will appreciate that some discontinuities are worse than others.

1.10 Maple notes

Many chapters in these notes conclude with a summary of relevant Maple commands. Other sources
of information are the First Year Computing Notes and the online Maple help command. While
Maple can relieve you of the tedium of calculations, it is not a substitute for understanding the
mathematics behind the calculations, which is what you need in order to interpret Maple output
intelligently.
The following Maple command is relevant to the material of this chapter:
plot(f(x),x=a..b); draws a (two-dimensional) plot of the graph of y = f (x) for
a ≤ x ≤ b. To find out about the many options available, use ?plot. Implicitly defined
functions can be plotted using the smartplot command. For example,

> # draw a rational function

> f:=x^5-2*x^4+3*x^3-6*x^2-4*x+8:

> g:=4*x^4-8*x^3-12*x^2+16*x+16:

> plot(f/g, x=-10..10,y=-10..10,numpoints=1000,discont=true);

c
2020 School of Mathematics and Statistics, UNSW Sydney
1.10. MAPLE NOTES 23

> # draw a hyperbola, via implicit definition

> smartplot(x^2/9-y^2/4=1);

> # use the Maple plot command implicitplot --- need more plotting tools
> with(plots):
> implicitplot(y^5+x^2*y^3+16,x=-10..10,y=-2..0);

c
2020 School of Mathematics and Statistics, UNSW Sydney
24 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS

Problems for Chapter 1

Questions marked with [R] are routine, with [H] are harder and with [X] are for MATH1141 only.
Questions marked with [V] have a video solution available from Moodle. You should make sure
that you can do the easier questions before you tackle the more difficult questions.

Problems 1.1 : Sets of numbers

1. [R] Express the following sets in words. Graph the sets on the number line (if possible).

a) {x ∈ Z : −π < x < π}
b) {x ∈ R : x2 − x − 1 < 0}
c) {x ∈ Q : x2 = 2}

2. [R] Graph on the number line the following sets.

a) [3, ∞), (−∞, 3), (−∞, ∞), (−3, 3]

b) {x : |x − 2| < 5}
c) {x : x2 + 4x − 5 > 0}

3. [R] Sketch the set of points (x, y) which satisfy the following relations.

a) 0 ≤ y ≤ 2x and 0≤x≤2 b) y/2 ≤ x ≤ 2 and 0≤y≤4

Problems 1.2 : Solving inequalities and

1.3 : Absolute values

4. [R] Solve the following inequalities.

1 1
a) x(x − 1) > 0 b) (x − 1)(x − 2) < 0 c) >−
x 2
1 1 6
d) > e) x ≥
1−x 2 x−1

5. [R] Solve the following inequalities.

x − 1
a) x + 1< 3 b) x + 2> 3 c) 3x + 2< 1 d)
<1
x + 1

6. [R] [V]

a) By expanding (x − y)2 , prove that x2 + y 2 ≥ 2xy for all real numbers x and y.
a+b √
b) Deduce that ≥ ab for all non-negative real numbers a and b. When does
2
equality hold?
1
c) Use the result above to find the minimum value of y = x2 + 2 .
x
c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 1 25

7. [R] True or false:

1 1
a) If a > b then < . b) If a < b then a2 < b2 .
a b
c) If 0 < a < b then a2 < b2 . d) If a2 + b2 = 0 then a = b = 0.
e) If −1 < a < b then a2 < b2 .

1 1 4
8. [H] Prove that (x + y)2 ≥ 4xy and hence deduce that + ≥ 2 .
x2 y 2 x + y2

9. [H] [V]

a) Prove that f (x) = 1 + x + x2 is positive for all real numbers x.

b) By considering cases (or otherwise) prove that 1 + x + x2 + x3 + x4 is always positive.
c) Generalise the above results.

Problems 1.4 : Functions

10. [R] Determine the (maximal) domain and corresponding range for each function f de-
scribed below.
√ √
a) f (x) = 5 − x2 b) f (x) = x2 − 5
√
c) f (x) = (x − 8)−1/3 d) f (x) = x − 1
1 √
e) f (x) = √ f) f (x) = sin x
x−1
√
g) f (x) = 1 − 2 sin x h) f (x) = 1 + tan2 x

cos

√
x if x < 0
i) f (x) = 1 − x if 0 ≤ x ≤ 1


|x| if x > 1

11. [R] If f (x) = x + 5 and g(x) = x2 − 3 then find

a) (g ◦ f )(0) b) (g ◦ f )(x) c) (f ◦ g)(2) d) (f ◦ g)(x).

1
12. [R] If f (x) = x − 1 and g(x) = √ , then give the explicit forms of
x−1

f
a) (f + g)(x) b) (f g)(x) c) (x) d) (f ◦ g)(x).
g

Problems 1.5 : Polynomials and rational functions,

1.6 : The trigonometric functions and
1.7 : The elementary functions

13. [R] Draw neat sketches (preferably without using calculus) of the graphs given by the
following equations.

c
2020 School of Mathematics and Statistics, UNSW Sydney
26 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS
4
a) y = x2 − 5x + 6 b) y = 2x3 − 16 c) y =
x−3
1
d) y = 2ex−1 e) y = f) y = 3 sin 2x
x2 +4
√
g) y = x − 1

14. [R]
√
a) Sketch the graph of y = x + 1 and use your graph to sketch (on the same diagram)
1
y=√ .
x+1
b) Repeat for y = x2 − 4x + 3.

15. [R] Sketch the graph of y = x2 − 7x − 8 and hence sketch the graph of y = |x2 − 7x − 8|.

16. [R] What range of values will x2 + 4 take if −2 ≤ x ≤ 3?

17. [R] Use a graphical approach to solve |2x − 5| = x + 2.

18. [R]

a) Show that if p and q are polynomials then p ◦ q is again a polynomial.

b) [H] Is the same true for rational functions?

Problems 1.8 : Implicitly defined functions

19. [R] Sketch the graphs given by the following equations.

x2 y 2 x2 y 2
a) + =1 b) − =1
9 4 9 4
y 2 x2
c) 4x2 + 9y 2 = 36 d) − =1
9 4

c
2020 School of Mathematics and Statistics, UNSW Sydney
27

Chapter 2

Limits

In many situations, it is desirable to know what the long term state of a system will be. For
example, suppose that an initially unpolluted lake containing 109 litres of water has a river flowing
through it at a rate of 106 litres per day. A factory is built next to the lake and discharges 104
litres of pollution per day. By making some simple assumptions, it can be shown that the amount
P (t) of pollution in the lake after t days of the factory’s operation is given by

109 5
P (t) = 1 − e−101t/10 . (2.1)
101
Environmental authorities want to know what the long term level of the pollution will be. To see
whether the pollution in the lake eventually stabilises, and what the level of pollution will be, one
studies the behaviour of P (t) as t → ∞.
Understanding the limiting behaviour of functions at infinity is important for practical and
mathematical reasons. That is what we study in the next four sections.

2.1 Limits of functions at infinity

(Ref: SH10 §4.7)
In this section, we examine some techniques for calculating limits of the form lim f (x).
x→∞

2.1.1 Basic rules for limits

There are a few basic things to bear in mind when working with limits.
• If f (x) gets arbitrarily close to some real number L as x tends to infinity, then we write

lim f (x) = L.
x→∞

We can also write f (x) → L as x → ∞.

• If f (x) gets arbitrarily large (that is, approaches ∞) as x tends to ∞ then we write

f (x) → ∞ as x → ∞.

We do not write lim f (x) = ∞ since ∞ is not a real number. (The lim notation is only
x→∞
used when the limit exists and is a real number.)

c
2020 School of Mathematics and Statistics, UNSW Sydney
28 CHAPTER 2. LIMITS

• If f (x) → ∞ as x → ∞ then
1
lim = 0.
x→∞ f (x)
This is intuitively obvious and gives limits such as

1 5
lim = 0, lim = 0, lim e−x = 0
x→∞ x x→∞ 4x6 x→∞

1
(for the last example, recall that e−x = ex ).

The following proposition shows that the arithmetic of limits behaves nicely in many situations.

Proposition 2.1.1. Suppose that lim f (x) and lim g(x) exist and are finite real numbers. Then
x→∞ x→∞

(i) lim [f (x) + g(x)] = lim f (x) + lim g(x)

x→∞ x→∞ x→∞

(ii) lim [f (x) − g(x)] = lim f (x) − lim g(x)

x→∞ x→∞ x→∞

(iii) lim [f (x)g(x)] = [ lim f (x)][ lim g(x)]

x→∞ x→∞ x→∞

f (x) lim f (x)

(iv) lim = x→∞ , provided that lim g(x) 6= 0.
x→∞ g(x) lim g(x) x→∞
x→∞

Part of this proposition will be proved in Section 2.4. For now, we demonstrate how the
proposition is used to calculate limits.

Example 2.1.2. Calculate

3 − 1/x2
lim
x→∞ 2 + e−x

if the limit exists. Justify each step of your working with reference to Proposition 2.1.1.

Solution.

3 − 1/x2 lim (3 − 1/x2 )

x→∞
lim = (rule (iv))
x→∞ 2 + e−x lim (2 + e−x )
x→∞
lim 3 − lim 1/x2
x→∞ x→∞
= (rules (i) and (ii))
lim 2 + lim e−x
x→∞ x→∞
3−0
=
2+0
3
= .
2

In practice, you may use the rules of Proposition 2.1.1 without referring to them, and leave out
some of the steps shown above.

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.1. LIMITS OF FUNCTIONS AT INFINITY 29

2.1.2 The pinching theorem

The idea behind the following theorem is intuitively obvious. Suppose, as illustrated in the diagram
below, that the graph of a function h always lies above the graph of a function f . Suppose also
that f and h have the same limit as x → ∞. If the graph of a function g always lies between the
graphs of h and f (so that g is ‘pinched’ between f and h) then g has the same limit also.
y
y = h(x)
y = g(x)

L
b
|
x

y = f (x)

This idea is stated precisely below.

Theorem 2.1.3 (The pinching theorem). Suppose that f , g and h are all defined on the interval
(b, ∞), where b ∈ R. If
f (x) ≤ g(x) ≤ h(x) ∀x ∈ (b, ∞)
and
lim f (x) = lim h(x) = L
x→∞ x→∞

then
lim g(x) = L.
x→∞

One can modify the theorem to cover the case when x → −∞.
sin x
Example 2.1.4. Use the pinching theorem to find lim .
x→∞ x

Solution. We begin with the basic inequality

−1 ≤ sin x ≤ 1,

which is valid for every real number x. If we restrict x to the interval (0, ∞) then we have

−1 sin x 1
≤ ≤ .
x x x
Now
1 1
lim − = lim = 0,
x→∞ x x→∞ x
and so
sin x
lim =0
x→∞ x
by the pinching theorem.

c
2020 School of Mathematics and Statistics, UNSW Sydney
30 CHAPTER 2. LIMITS

2.1.3 Limits of the form f (x)/g(x)

Suppose that we want to calculate a limit of the form
f (x)
lim
x→∞ g(x)

where both f (x) and g(x) tend to infinity as x → ∞. In this situation, we cannot apply the rules of
Proposition 2.1.1 directly because the numerator and denominator don’t have finite limits. Instead,
we find an equivalent form of fg(x)
(x)
for which Proposition 2.1.1 can be applied. The key idea is to
divide both f and g by the fastest growing term appearing in the denominator g.
4x2 − 5
Example 2.1.5. Evaluate lim , if it exists.
x→∞ 2x2 + 3x

Solution. There are two terms appearing in the denominator: 2x2 and 3x. As x → ∞, the term
which grows fastest is the one involving x2 . So we divide the numerator and denominator by x2 to
evaluate the limit:
4x2 − 5 4 − 5/x2
=
2x2 + 3x 2 + 3/x
4−0
→
2+0
as x → ∞. Therefore
4x2 − 5
lim = 2.
x→∞ 2x2 + 3x

In general, we divide the numerator and denominator by the highest power of x in the denom-
inator.
5x3 + 6x2 − 4 sin x
Example 2.1.6. Find lim .
x→∞ cos 3x + 5x − 2x3

x2 + 3x
Example 2.1.7. Find lim √ .
x→∞ 2x4 + 3 − 4x
p p
2.1.4 Limits of the form f (x) − g(x)
√ √ √ √
Consider the limit of x + 5 − x + 2 as x → ∞. Since both x + 5 and x + 2 tend to infinity
(rather than a finite number) as x → ∞, one cannot apply Proposition 2.1.1. As the next example
shows, a simple algebraic trick is used to overcome this difficulty.
√ √
Example 2.1.8. Evaluate lim x + 5 − x + 2 , if it exists.
x→∞
√
Solution. The algebraic trick is to multiply both the numerator and denominator by x + 5 +
√
x + 2 and expand the numerator as a difference of squares:
√ √ √ √
√ √ ( x + 5 − x + 2)( x + 5 + x + 2)
x+5− x+2= √ √
x+5+ x+2
(x + 5) − (x + 2)
=√ √
x+5+ x+2
3
=√ √
x+5+ x+2
→0

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.2. THE DEFINITION OF LIM F (X) 31
X→∞

as x → ∞.

One should not be fooled into thinking, on the basis of the above example, that all limits of
this type are zero.
p 1
Example 2.1.9. Show that lim x2 + x − x = .
x→∞ 2

2.1.5 Indeterminate forms

We have already mentioned that the rules of Proposition 2.1.1 only apply when the limits involved
are finite. In Section 2.1.3, we studied limits of the type

f (x)
lim ,
x→∞ g(x)

when f (x) → ∞ and g(x) → ∞ as x → ∞. We say that such a limit is of the form ∞ ∞ . We cannot
∞
say in advance whether or not a limit of the form ∞ exists, and if it does exist, what its value is.
For example, while the following limits have the form ∞ ∞ , each displays very different limiting
behaviour as x → ∞:
x2
• x → ∞ as x → ∞
x
• x2
→ 0 as x → ∞

2x2
• x2
→ 2 as x → ∞

x2 1
• 2x2
→ 2 as x → ∞.

Because we cannot determine in advance what kind of limiting behaviour something of the form
∞ ∞
∞ has, we say that ∞ is an indeterminate form.
Other types of indeterminate forms (for example, limits of the form 00 and ∞−∞) arise naturally
in many applications. Techniques for evaluating such limits will be developed at various stages of
the course.

2.2 The definition of lim f (x)

x→∞

Up to this point, we have treated limits in a rather intuitive way. It is now time to establish a
rigorous basis for our treatment of limits. That is, we seek a definition for lim f (x) = L that is
x→∞
mathemetically precise and agrees with our intuitive notion of limits. How do we do this? A good
strategy is to think deeply about a few simple examples before considering the general case.
Consider the real-valued functions f , g and h defined by

1
f (x) = , g(x) = sin x and h(x) = 0.05
x
whenever x > 0. Their graphs are sketched below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
32 f (x) g(x) h(x) CHAPTER 2. LIMITS

2π 4π
0 0 0
x x x

Intuitively, we see that f (x) tends to 0 as x tends to infinity, while neither g nor h have this
property. What is it about f that distinguishes its limiting behaviour from that of g or h?
The above graph shows that the distance between f (x) and its limit 0 is small whenever x is
large. However, to get to the core difference between f and the other two functions, we must be
more precise.
• The distance between h(x) and 0 is also small, but its limit is not 0. The reason why f (x)
tends to 0 as x → ∞ is that the distance between f (x) and 0 can be made as small as we
like whenever x is large enough.
• The distance between g(x) and 0 can also be made as small as we like, by taking x
sufficiently close to an integer multiple of π. However the limit of g(x) is not 0. The
difference between f and g is that the distance between f (x) and 0 remains small for all
sufficiently large values of x. The same is not true for g.
In summary, f (x) tends to 0 as x → ∞ because the distance between f (x) and 0 can be made as
small as we like for all values of x that are sufficiently large.
The next step in crafting a good definition is to express phrases like ‘can be made as small as we
like’ and ‘sufficiently large’ in concrete mathematical language. The following list is one attempt
to do this. (As a reminder, f (x) = 1/x.)
• The distance between f (x) and 0 is less than 1 whenever x > 1.
y
y = f (x)
(1, 1)

0
1 x
−1

• The distance between f (x) and 0 is less than 0.2 whenever x > 5.
y
y = f (x)

(5, 0.2)

0.2
0
−0.2 5 x

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.2. THE DEFINITION OF LIM F (X) 33
X→∞

• The distance between f (x) and 0 is less than 0.1 whenever x > 10.

• The distance between f (x) and 0 is less than 0.01 whenever x > 100.

• The distance between f (x) and 0 is less than 0.0001 whenever x > 10000.

• etc.
Of course, to express the fact that ‘the distance between f (x) and 0 can be made as small as
we like’ in this way would require an infinite list. Instead, we say that, for every small positive
number ǫ, there is real number M such that
• the distance between f (x) and 0 is less than ǫ whenever x > M .
y

y = f (x)
( 1ǫ , ǫ)

ǫ
0
−ǫ M x

By looking at the graph we can see that M = 1ǫ . (The symbol ǫ is the Greek letter ‘epsilon’ and is
traditionally used by mathematicians in this context.)
So far we have agreed that f (x) → 0 as x → ∞ precisely because
for every positive real number ǫ, there is a real number M such that the distance
between f (x) and 0 is less than ǫ whenever x > M .
This statement forms a basis for defining what is meant by lim f (x) = 0. A general definition for
x→∞
what is meant by lim f (x) = L is now obtained by
x→∞

• replacing the limit 0 with the limit L, and then

• replacing ‘the distance between f (x) and L’ with |f (x) − L| (see Section 1.3).
We are now ready to give a general definition for the limit of a function at infinity.

Definition 2.2.1. Suppose that L is a real number and f is a real-valued function

defined on some interval (b, ∞). We say that lim f (x) = L if
x→∞

for every positive real number ǫ, there is a real number M such

that if x > M then |f (x) − L| < ǫ.

Remark 2.2.2. Suppose that lim f (x) = L. Definition 2.2.1 can be interpreted geometrically in
x→∞
the following way. For every small number ǫ, you can find a marker M on the x-axis such that the
distance between f (x) and the limit L is less than ǫ whenever x lies to the right of M .

c
2020 School of Mathematics and Statistics, UNSW Sydney
34 CHAPTER 2. LIMITS
y
y = f (x)

L+ǫ
L
L−ǫ
|

M x

Remark 2.2.3. In Definition 2.2.1, the number M depends on ǫ. In general, the smaller the value
of ǫ, the larger the value of M . We note, however, that for any particular value of ǫ the choice of
M is not unique. For example, the M chosen for the above diagram is different from the M chosen
for the diagram below, but both choices guarantee that

|f (x) − L| < ǫ whenever x > M.

y
y = f (x)

L+ǫ
L
L−ǫ
|

M x

Remark 2.2.4. Students who find Definition 2.2.1 difficult to understand can take comfort in the
fact that a rigorous formulation of limits evaded mathematicians for over 2000 years. The problems
and paradoxes involving limiting processes tabled around 450 BC by the ancient Greek philosopher
Zeno of Elea were only satisfactorily resolved in the nineteenth century, when a rigorous definition
of the limit was given by Bolzano (in 1817) and Weierstrass (in the 1850s).

2.3 Proving that lim f (x) = L using the limit definition

x→∞

In the previous section we gave a definition for lim f (x) = L. In this section this definition is used
x→∞
to prove limits that were previously taken for granted.
4
Example 2.3.1. Prove from Definition 2.2.1 that lim = 0.
x→∞ x2

Proof. Suppose that f (x) = 4/x2 , L = 0 and ǫ > 0. We begin by calculating the distance between
the function and its limit:

4
|f (x) − L| = 2 − 0
x
4
= 2. (2.2)
x

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.3. PROVING THAT LIM F (X) = L USING THE LIMIT DEFINITION 35
X→∞

We need to find a condition on x such that |f (x) − L| < ǫ. By (2.2), we require that
4
< ǫ.
x2
By rearrangement, we see that this is satisfied whenever
2
x> √ .
ǫ

So far we have shown that |f (x) − L| < ǫ whenever x > √2 . Hence if

2
M=√ . (2.3)
ǫ

then |f (x) − L| < ǫ whenever x > M . This completes the proof.

The above proof can be streamlined by omitting some explanatory commentary. In the next
example, explanatory comments (which are not part of the proof) are inserted inside square brack-
ets.

Example 2.3.2. Prove that

5x
lim =5
x→∞ x+3
by using Definition 2.2.1.

Proof. Suppose that ǫ > 0. For convenience, suppose also that x > 0. [This is allowed since we are
considering the behaviour of f (x) for large positive values of x.] Now

5x
|f (x) − L| = − 5
x+3

5x − 5(x + 3)
=
x+3

−15
=
x + 3
15
= [since x > 0]
x+3
15
< [to make algebra simpler later on].
x

In summary,
15
|f (x) − L| < .
x
[This inequality gives an upper bound for the distance between f (x) and L.] Hence |f (x) − L| < ǫ
whenever
15
< ǫ.
x
This condition is equivalent to
15
x> .
ǫ
c
2020 School of Mathematics and Statistics, UNSW Sydney
36 CHAPTER 2. LIMITS
15
Hence if M = ǫ then
|f (x) − L| < ǫ whenever x > M,
as required.

Before attempting one more proof, we reflect on the general strategy employed. Given ǫ, we
need to find a number M such that

|f (x) − L| < ǫ whenever x > M.

The number M can be found by following the procedure below.

1. Find a good upper bound for |f (x) − L|.
2. Find a simple condition on x such that this upper bound is less than ǫ.
3. Use this condition to state an appropriate value for M (in terms of ǫ).
Example 2.3.3. Prove that
sin x
lim =0
x→∞ 9x2 + e3x
by using Definition 2.2.1.

Proof. Suppose that ǫ > 0. Now

sin x
|f (x) − L| = 2
− 0
9x + e3x
| sin x|
=
|9x2 + e3x |
1
≤ 2 [using the fact that −1 ≤ sin x ≤ 1]
9x + e3x
1
< 2 [decreasing the denominator increases the RHS].
9x
In summary,
1
|f (x) − L| < .
9x2
[This is our upper bound for |f (x) − L|.] Now
1
<ǫ
9x2
whenever
1
x> √ .
3 ǫ
Set
1
M= √ .
3 ǫ
Then |f (x) − L| < ǫ whenever x > M , as required.

Remark 2.3.4. As pointed out in Remark 2.2.3, the value of M is not unique. For instance, the
upper bound for |f (x) − L| in our solution to Example 2.3.3 was 1/(9x2 ) and the corresponding
√
value for M was 1/(3 ǫ). If instead we use the upper bound 1/e3x then the corresponding value
for M is 13 ln 1ǫ .

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.4. PROOFS OF BASIC LIMIT RESULTS (MATH1141 ONLY) 37

2.4 Proofs of basic limit results (MATH1141 only)

By using Definition 2.2.1, one can now prove the limit rules stated in Section 2.1. The selection of
proofs given in this section is designed to give MATH1141 students a taste of how such results are
proved.

Proof of Proposition 2.1.1 (i). Suppose that

lim f (x) = L1 and lim g(x) = L2 .

x→∞ x→∞

We need to show, using Definition 2.2.1, that

lim f (x) + g(x) = L1 + L2 .
x→∞

We begin by fixing a positive real number ǫ.

Since lim f (x) = L, there is a positive M1 such that
x→∞

ǫ
|f (x) − L1 | < whenever x > M1 .
2
Similarly, there is a positive M2 such that
ǫ
|g(x) − L2 | < whenever x > M2 .
2
Hence, by the triangle inequality,

(f (x) + g(x)) − (L1 + L2 ) ≤ |f (x) − L1 | + |g(x) − L2 |
ǫ ǫ
< +
2 2
=ǫ

whenever both x > M1 and x > M2 . Now set M = max{M1 , M2 } so that

(f (x) + g(x)) − (L1 + L2 ) < ǫ whenever x > M.

This proves that lim f (x) + g(x) = L1 + L2 .
x→∞

The other rules in Proposition 2.1.1 can be proved similarly, except that sometimes the desired
inequalities are harder to obtain. We move on to prove the pinching theorem.

Proof of Theorem 2.1.3. Suppose that

f (x) ≤ g(x) ≤ h(x) ∀x ∈ (b, ∞)

and that
lim f (x) = lim h(x) = L.
x→∞ x→∞

Fix a positive real number ǫ.

Since lim f (x) = lim h(x) = L, there are real numbers M1 and M2 , both greater than b, such
x→∞ x→∞
that
|f (x) − L| < ǫ whenever x > M1

c
2020 School of Mathematics and Statistics, UNSW Sydney
38 CHAPTER 2. LIMITS

and
|h(x) − L| < ǫ whenever x > M2 .
In other words,
L − ǫ < f (x) < L + ǫ whenever x > M1
and
L − ǫ < h(x) < L + ǫ whenever x > M2 .
Hence
L − ǫ < f (x) ≤ g(x) ≤ h(x) < L + ǫ
whenever x > max{M1 , M2 }. This shows that

|g(x) − L| < ǫ whenever x > max{M1 , M2 },

completing the proof.

2.5 Limits of functions at a point

(Ref: SH10 §§2.1–2.3, 2.5)
So far we have only discussed limits of the form lim f (x). In this section we study limits of the
x→∞
form lim f (x), where a is a finite real number.
x→a

2.5.1 Left-hand, right-hand and two-sided limits

Consider the function f whose graph is shown below.

f (x)

4 b
|

3 bc
|

2 bc
|

1 b b
|

| | | |

2 4 6 8 x

With reference to this graph, we will discuss the behaviour of f (x) when x is near the points 2, 4,
6 and 8.

• As x approaches 2 from the left-hand side, f (x) approaches 3. In this situation, we write

lim f (x) = 3
x→2−

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.5. LIMITS OF FUNCTIONS AT A POINT 39

and say that ‘the limit of f (x) as x approaches 2 from the left is 3.’ (Note that f (2) = 1 6= 3.
One should not confuse a limit with a function value.) As x approaches 8 from the left-hand
side, f (x) approaches 1 and hence we write

lim f (x) = 1.
x→8−

Both of these limits are called left-hand limits.

• As x approaches 2 from the right-hand side, f (x) approaches 1. In this situation, we write

lim f (x) = 1
x→2+

and say that ‘the limit of f (x) as x approaches 2 from the right is 1.’ As x approaches 6
from the right-hand side, f (x) approaches 2 and hence we write

lim f (x) = 2.
x→6+

Both of these limits are called right-hand limits.

• As x approaches 4 from the left-hand side, f (x) is positive and grows large without bound.
In this case, lim f (x) does not exist because f (x) does not approach a real number.
x→4−
However, we can write
f (x) → ∞ as x → 4− .

• As x approaches 4 from the right-hand side, f (x) is negative and grows large without
bound. In this case, lim f (x) does not exist but we can we write
x→4+

f (x) → −∞ as x → 4+ .

Once an understanding of left- and right-hand limits is grasped, we can talk about two-sided
limits.

Definition 2.5.1. If the left-hand limit lim f (x) and the right-hand limit
x→a−
lim f (x) both exist and equal the same real number L, then we say that the limit
x→a+
of f (x) as x → a exists and is equal to L, and we write
lim f (x) = L.
x→a
If any one of these conditions fails then we say that lim f (x) does not exist.
x→a

Recall the graph of the function f introduced at the beginning of this section.
• The two-sided limit lim f (x) does not exist because
x→2

lim f (x) 6= lim f (x).

x→2− x→2+

• The two-sided limit lim f (x) does not exist because lim f (x) does not exist.
x→4 x→4−

c
2020 School of Mathematics and Statistics, UNSW Sydney
40 CHAPTER 2. LIMITS

• Since the left-hand and right-hand limits both exist at 6 and

lim f (x) = lim f (x) = 2,

x→6− x→6+

the two-sided limit lim f (x) exists and

x→6

lim f (x) = 2.
x→6

Note that f (6) = 4 and so

lim f (x) 6= f (6).
x→6

That is, the two-sided limit of a function at a point need not equal the value of the function
at that point.

• Since the left-hand and right-hand limits at 8 both exist and equal 1, the two-sided limit
lim f (x) exists and
x→8
lim f (x) = 1.
x→8

In this case, the two-sided limit of f at 8 equals the value of the function at 8. That is,

lim f (x) = f (8).

x→8

The last two points emphasise an important feature of limits and functions. If lim f (x) = L
x→a
then f (x) approaches L as x tends to a. The function f may not be defined at a, or if it is defined
at a, we can have the situation where f (a) = L or where f (a) 6= L. In other words, the value of
f (a) does not determine the value of lim f (x). As far as the limit is concerned, all that matters is
x→a
how f behaves when it is very close to the point a.

Remark 2.5.2. In this section we have not given a proper definition for limits of the form lim f (x)
x→a−
or lim f (x). At this stage it is enough to know that such a rigorous definition exists and is similar
x→a+
to the definition for limits of the form lim f (x) (see Definition 2.2.1). This definition can be used
x→∞
to prove the basic limit results presented in the remainder of this chapter.

2.5.2 Limits and continuous functions

Using limits we can now give a definition for continuity.

Definition 2.5.3. Suppose that f is defined on some open interval containing the
point a. If lim f (x) = f (a), then we say that f is continuous at a; otherwise, we
x→a
say that f is discontinuous at a.

To illustrate, consider the function f whose graph is sketch below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.5. LIMITS OF FUNCTIONS AT A POINT 41
f (x)

| | |

2 4 6 x

The function f is continuous at 2, discontinuous at 4 and undefined at 6. (We do not say that f is
discontinuous at 6 since 6 ∈
/ Dom(f ).)
We now extend our formal notion of continuity at a point to continuity on the real line.

Definition 2.5.4. If f : R → R is continuous at every point a in R then we say

that f is continuous everywhere.

If a function f : R → R is continuous everywhere then it has the property that its graph can be
drawn on the Cartesian plane as an unbroken curve. This coincides with intuitive notions about
continuity gained in high school.
As mentioned in Section 1.9, polynomials, rational functions, the trigonometric functions, ex-
ponentials and logarithms are continuous at every point in the respective domains. Moreover, if
f is continuous at a point a then |f | is also continuous at a (see Section 1.7 for the definition of
|f |). The proof of these facts requires a rigorous definition of the limit at a point, and, as already
mentioned, such a definition is not presented in this course. However, we will discuss aspects of
the proof in Chapter 3.
A consequence of the above definition is that both one-sided and two-sided limits of f at a point
a are easy to evaluate provided that (a) a ∈ Dom(f ) and (b) we know that f is a continuous at a.
Hence, in view the facts mentioned above,

• lim sin(x) = sin(2)

x→2

• lim (x2 − x + 1) = lim (x2 − x + 1) = 32 − 3 + 1 = 7

x→3+ x→3

• lim |x − 10| = |4 − 10| = 6.

x→4

Remark 2.5.5. Continuity is a deep property for a function to have. Contrary to the impression
that many students form from their study of functions at school, most functions are not continuous.
Three interesting functions with discontinuities are mentioned below.

• The function f : R → R, defined by

(
1 if x ∈ Q
f (x) =
0 if x ∈
/ Q,

is discontinuous at every point in its domain.

c
2020 School of Mathematics and Statistics, UNSW Sydney
42 CHAPTER 2. LIMITS

• The function f : R → R, defined by

(
x if x ∈ Q
f (x) =
−x if x ∈
/ Q,

is continuous at 0 but discontinuous everywhere else.

• It is not too difficult to construct a function f : R → R which is continuous at every

irrational point but discontinuous at every rational point.
Hence, one should not assume that the graph of a general function f : R → R is continuous at
even a majority of points in its domain. Continuous functions are quite special and deserve much
attention. We will study them in greater depth in Chapter 3.

2.5.3 Rules for limits at a point

The following proposition, which is analogous to Proposition 2.1.1, shows that limits at a point
have nice arithmetic properties.
Proposition 2.5.6. Suppose that a ∈ R and that lim f (x) and lim g(x) exist and are finite real
x→a x→a
numbers. Then
(i) lim (f (x) + g(x)) = lim f (x) + lim g(x)
x→a x→a x→a

(ii) lim (f (x) − g(x)) = lim f (x) − lim g(x)

x→a x→a x→a

(iii) lim f (x)g(x) = lim f (x) lim g(x)

x→a x→a x→a

f (x) lim f (x)

(iv) lim = x→a , provided that lim g(x) 6= 0.
x→a g(x) lim g(x) x→a
x→a

A similar set of rules hold for left- and right-hand limits.

Proposition 2.5.6 says that limits interact nicely with function addition, subtraction, multiplica-
tion and division. The next proposition shows that limits interact nicely with function composition.
Proposition 2.5.7. If lim f (x) = L and g is continuous at L then
x→a

lim g(f (x)) = g lim f (x) .
x→a x→a

If the functions f and g are continuous everywhere, then Proposition 2.5.7 implies that

lim g(f (x)) = g lim f (x)
x→a x→a

for any point a in R.

Example 2.5.8. Evaluate the limit

lim
√ cos x2 + π
6 ,
x→ π/3

justifying each step of your calculation.

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.5. LIMITS OF FUNCTIONS AT A POINT 43

Solution. Since the cosine function and polynomials are continuous everywhere,
!
π π
2 2
√ cos x + 6 = cos
lim lim
√ x +6 (by Proposition 2.5.7)
x→ π/3 x→ π/3
π π
= cos + (since polynomials are continuous)
3 6
π
= cos
2
= 0.

A version of the pinching theorem for two-sided limits is given below. Similar versions exist for
left- and right-hand limits.

y = h(x)

L
y = g(x)
a
|
x

y = f (x)

Theorem 2.5.9 (The pinching theorem). Let I be an open interval containing the point a. Suppose
that f , g and h are all defined on I except possibly at the point a. If

f (x) ≤ g(x) ≤ h(x) ∀x ∈ I

and
lim f (x) = lim h(x) = L
x→a x→a

then
lim g(x) = L.
x→a

Remark 2.5.10. The pinching theorem can be used to prove the well-known limit

sin θ
lim = 1.
θ→0 θ

See the problem set of Chapter 2 for details.

We close this chapter with four examples involving limits of the form lim f (x), where f is not
x→a
necessarily continuous at the point a. Determining whether or not the limit exists, and if so what
its value is, requires special analysis of the function f .

c
2020 School of Mathematics and Statistics, UNSW Sydney
44 CHAPTER 2. LIMITS

Example 2.5.11. Determine whether or not the limit

lim x sin(1/x)
x→0

exists. If it exists, find its value.

Solution. We begin with the inequality

−1 ≤ sin θ ≤ 1

which is valid for all θ in R. In particular, if θ = 1/x then

−1 ≤ sin(1/x) ≤ 1 whenever x 6= 0.

Case 1: Suppose that x > 0. Then we have

−x ≤ x sin(1/x) ≤ x.

Since lim −x = lim x = 0, it follows from the pinching theorem that lim x sin(1/x) = 0.
x→0+ x→0+ x→0+
Case 2: Suppose that x < 0. Then

−x ≥ x sin(1/x) ≥ x.

Since lim −x = lim x = 0, it follows from the pinching theorem that lim x sin(1/x) = 0.
x→0− x→0− x→0−
Conclusion: Since the left- and right-hand limits exist and are both equal to 0, the two-sided
limit exists and
lim x sin(1/x) = 0.
x→0
This limit is illustrated graphically below.
y
y = −x y=x

y = x sin(1/x)

Example 2.5.12. Suppose that f : R → R is defined by the formula

 2
x − 4
if x 6= 2
f (x) = x − 2

0 if x = 2.

Discuss the limiting behaviour of f (x) as x approaches 2.

c
2020 School of Mathematics and Statistics, UNSW Sydney
2.5. LIMITS OF FUNCTIONS AT A POINT 45

Solution. Since we are discussing limiting behaviour, the value of f at 2 is irrelevant. All that
matters is how f (x) behaves near 2. So suppose that x 6= 2. Then

x2 − 4
f (x) =
x−2
(x − 2)(x + 2)
=
x−2
= x + 2.

The graph of f is now easy to sketch.

f (x)

4 bc

2 x

Since
lim f (x) = lim f (x) = 4,
x→2− x→2+

the two-sided limit exists and we have lim f (x) = 4. (Note that f is not continuous at 2 since
x→2
lim f (x) 6= f (2).)
x→2

Example 2.5.13. Suppose that f : R → R is defined by the formula

 2
 |x − 9|
if x 6= 3
f (x) = x−3

2 if x = 3.

Discuss the limiting behaviour of f (x) as x approaches 3.

Example 2.5.14. Suppose that

π
f (x) = sin , x 6= 0.
x
Discuss the limiting behaviour of f (x) as x approaches 0.

Solution. Since f is an odd function, we initially restrict our attention to positive values of x. Now
sin θ = 0 whenever θ ∈ {π, 2π, 3π, . . .}. If θ = π/x then we have sin(π/x) = 0 whenever

1 1 1
x = , , ,....
1 2 3
Similarly, sin θ = 1 whenever θ ∈ {π/2, 5π/2, 9π/2, 13π/2, . . .} so sin(π/x) = 1 whenever

2 2 2 2
x= , , , ....
1 5 9 13
c
2020 School of Mathematics and Statistics, UNSW Sydney
46 CHAPTER 2. LIMITS

Consequently, no matter how close x is to 0 on the right, we can always find two closer points x1
and x2 such that f (x1 ) = 0 and f (x2 ) = 1. We conclude that f has no right-hand limit at 0. Since
f is odd, it follows that f has no left-hand limit at 0. Hence lim f (x) does not exist.
x→0+
The graph of f near the origin is shown below.
f (x)
1

−1

The f (x) oscillates wildly between 1 and −1 as x tends to 0, and lim f (x) does not exist.
x→0+

2.6 Maple notes

The following Maple command is relevant to the material of this chapter:
limit(f(x),x=a); finds the (two-sided) limit of f (x) as x → a. One-sided limits can be
specified using limit(f(x),x=a,right); or limit(f(x),x=a,left);. For example,
> limit(sin(x)/x, x=0);
1
> limit(sin(x)/x, x=infinity);
0
> # the next limit needs methods from chapter 5 for hand calculations
> limit((1-cos(x))/x^2, x=0, left);

1/2
> # Maple produces results that are not in strict agreement with our definitions
as x goes to plus or minus infinity
> limit(x^2, x=-infinity);
∞
> limit(sin(x), x=-infinity);
−1 . . . 1
> limit(x^2*sin(x), x=-infinity);

undefined

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 2 47

Problems for Chapter 2

Problems 2.1 : Limits of functions at infinity

1. [R] [V] Evaluate the following limits if they exist.
x2 − 1 2x2 + x − 1
a) lim b) lim
x→∞ x2 + 1 x→∞ x2 + 4x − 3

2x2 + 5x − 1 x5 + 5x + 1
c) lim d) lim
x→∞ x3 + x x→∞ x4 + 3
5x2 − 3x + cos 7x
e) lim f) lim sin x
x→∞ 4 + sin 2x + x2 x→∞

2. [R] Use the pinching theorem to find the following limits.

sin x cos x
a) lim b) lim
x→∞ x x→∞ x2

3. [R] [V]
√ √
a) Prove that lim ( x + 1 − x) = 0.
x→∞
p 1
b) Show that lim ( x2 + x − x) = .
x→∞ 2

Problems 2.2 : The definition of lim f (x) and

x→∞
2.3 : Proving that lim f (x) = L using the limit definition
x→∞

4. [R]

a) Write down the formal definition for the statement

lim f (x) = L.
x→∞

1
b) Evaluate lim .
x→∞2x2
c) Verify from the formal definition that your answer in (b) is correct.

5. [R]

x2 + 1
a) Evaluate lim .
x→∞ x2
x2 + 1
b) Find a real number M such that the distance between and its limit is less
x2
than 0.01 whenever x > M .
c) Suppose that ǫ > 0. Find a real number M (expressed in terms of ǫ) such that the
x2 + 1
distance between and its limit is less than ǫ whenever x > M .
x2

c
2020 School of Mathematics and Statistics, UNSW Sydney
48 CHAPTER 2. LIMITS

6. [R] [V] For each of the following, find the limit of f (x) as x tends to infinity and prove
from the definition that your answer is correct.
4x x−3
a) f (x) = b) f (x) = 2 c) f (x) = e−2x
x+7 x +3
sin x sin 3x
d) f (x) = e) f (x) = 2
x x +4

7. [X]

a) With ǫ in (0, 1), Sarah solves the inequality |f (x) − 4| < ǫ and finds that the required
x values satisfy
1
x∈ ,∞ .
ǫ
Does lim f (x) exist? Give reasons for your answer.
x→∞
b) With ǫ in (0, 1), Lyndal solves the inequality |g(x)−5| < ǫ and finds that the inequality
holds for all x satisfying
1
x∈ ,∞ .
ǫ
Does lim g(x) exist? Give reasons for your answer.
x→∞

8. [R] [V] A parcel is dropped from an aeroplane. A simple model, taking into account
gravity and air resistance, suggests that the parcel’s velocity v(t) (in metres per second)
is given by v(t) = 50(1 − e−t/5 ), where t is the number of seconds since leaving the plane.

a) Calculate the terminal velocity of the parcel (that is, find lim v(t)).
t→∞
b) The parcel never attains its terminal velocity. How long does it take to come within
1 metre per second of its terminal velocity?

Problems 2.4 : Proofs of basic limit results (MATH1141 only)

9. [X] For each question below, give reasons for your answer. [In some cases a single example
will be sufficient while in other cases a general proof will be required. As a reminder, if
f (x) → ∞ as x → ∞ then lim f (x) does not exist.]
x→∞

a) If lim f (x) and lim g(x) do not exist, can lim [f (x) + g(x)] or lim f (x)g(x) exist?
x→∞ x→∞ x→∞ x→∞
b) If lim f (x) exists and lim [f (x) + g(x)] exists, must lim g(x) exist?
x→∞ x→∞ x→∞
c) If lim f (x) exists and lim g(x) does not exist, can lim [f (x) + g(x)] exist?
x→∞ x→∞ x→∞
d) If lim f (x) exists and lim f (x)g(x) exists, does it follow that lim g(x) exists?
x→∞ x→∞ x→∞

Problems 2.5 : Limits of functions at a point

10. [R] Evaluate the following limits.

1
x2 − 4 x3 − 1 x− 13
a) lim 2x + 4 b) lim c) lim d) lim
x→3 x→2 x − 2 x→1 x − 1 x→3 x − 3

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 2 49

11. [R] [V]

|x − 2|
a) Find the left-hand limit lim .
x−2
x→2−
|x − 2|
b) Find the right-hand limit lim .
x→2+ x − 2
|x − 2|
c) Does lim exist?
x→2 x − 2

12. [R] By finding the left- and right-hand limits first, decide whether or not each of the
following limits exist and if so find their values.
x |x2 − 4| x−4 4
a) lim b) lim c) lim d) lim
x→0 |x| x→2 x − 2 x→4 |x − 4| x→0 x

13. [R] [V]

a) Use the pinching theorem to find lim x sin x1 .

x→0
1
b) Repeat for lim x2 sin .
x→0 2x

14. [R] [V] Suppose that θ is a (positive) angle measured in radians and consider the diagram
below.
D
C

θ
O A B
The curve segment CB is the arc of a circle of radius 1 centre O.

a) Write down, in terms of θ, the length of arc CB and the lengths of the line segments
CA and DB.
b) By considering areas, deduce that sin θ cos θ ≤ θ ≤ tan θ whenever 0 < θ < π2 .
θ
c) Use the pinching theorem to show that lim = 1.
θ→0+ sin θ
sin θ
d) Deduce that lim = 1.
θ→0 θ

15. [H] Discuss the limiting behaviour of cos x1 as x → 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
50 CHAPTER 2. LIMITS

c
2020 School of Mathematics and Statistics, UNSW Sydney
51

Chapter 3

Properties of continuous functions

In the world of functions, there is a small civilised country inhabited by the race of continuous
functions. Citizens of this country, such as the polynomials and exponentials, are studied (and
perhaps even loved) by students in high schools throughout the human world. However, outside
this small civilised country is a vast untamed universe of all sorts of discontinuous functions. From
this perspective, the continuous functions are a very rare breed.
However, continuous functions are far from endangered. They are among the most useful
functions for modelling real-life phenomena, particularly when the quantity that is being modelled
occurs in nature and changes smoothly with time. For example,

• the distance from the earth to the sun at time t,

• the height of Mount Everest at time t,

• the volume of water in a dam at time t,

• the mass of an earthworm at time t, and

• the temperature at the back of a lecture theatre at time t

can all be modelled by continuous functions of the variable t.

In addition to their usefulness for modelling naturally occurring quantities, continuous functions
have important mathematical characteristics. The combination of the usefulness of continuous
functions and their rarity among the world of functions means that continuity is a deep property.
In this chapter we begin to explore the ramifications of continuity.

3.1 Combining continuous functions

(Ref: SH10 §2.4)
If two continuous functions are combined via function addition, subtraction, multiplication, division
or composition, when is the resulting function also continuous? The two next propositions answer
this question.

Proposition 3.1.1. Suppose that the functions f and g are continuous at a point a. Then f + g,
f − g and f g are continuous at a. If g(a) 6= 0 then f /g is also continuous at a.

c
2020 School of Mathematics and Statistics, UNSW Sydney
52 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS

Proof. Suppose that f and g are continuous at a. Then, by the definition of continuity at a point
(see Definition 2.5.3), lim f (x) = f (a) and lim g(x) = g(a). Therefore
x→a x→a

lim (f + g)(x) = lim (f (x) + g(x)) (by the definition of f + g)

x→a x→a
= lim f (x) + lim g(x) (by Proposition 2.5.6)
x→a x→a
= f (a) + g(a) (since f and g are continuous)
= (f + g)(a) (by the definition of f + g).

Hence f + g is continuous at a.
The proofs that the functions f − g, f g and f /g are continuous at a are similar.

Proposition 3.1.2. Suppose that f is continuous at a and that g is continuous at f (a). Then g ◦ f
is continuous at a.

Proof. Combine Definition 2.5.3 with Proposition 2.5.7.

In Chapters 1 and 2 it was stated (without proof) that the polynomials, rational functions
and trigonometric functions are continuous at every point in their respective domains. The next
example shows that the continuity of these functions can be deduced from the continuity of

• the constant functions

• the sine function, and

• the function f : R → R given by f (x) = x.

Example 3.1.3. Suppose that a is a point in R. By assuming that the constant functions, the sine
function and the function f : R → R, given by f (x) = x, are continuous everywhere, show that

(a) every polynomial is continuous at a,

(b) every rational function r is continuous at a (provided that a ∈ Dom(r)), and

Solution. (a) Every polynomial can be constructed from constant functions and the function f by
a finite number of function multiplications and additions. (For example,

x3 − 4x2 + 5 = (x × x × x) + ((−4) × x × x) + 5

for all x in R.) Since the constant functions and f are continuous at a, it follows from Proposition
3.1.1 that every polynomial is continuous at a.
(b) Every rational function r can be written in the form p/q, where p and q are polynomials.
By the result of part (a), p and q are continuous a. It follows from Proposition 3.1.1 that p/q is
also continuous at a, provided that q(a) 6= 0.
(c) Recall that
cos x = sin(π/2 − x) ∀x ∈ R.
That is, cos x = h(g(x)) where h(x) = sin x and g(x) = π/2 − x. Since the polynomial g is
continuous at a and the sine function h is continuous everywhere, the cosine function must also be
continuous at a by Proposition 3.1.2.

c
2020 School of Mathematics and Statistics, UNSW Sydney
3.2. CONTINUITY ON INTERVALS 53

Finally, since
sin x
tan x = ,
cos x
the tangent function is also continuous at a (provided that a ∈ Dom(tan)) by Proposition 3.1.1.

The next example justifies an assertion made in Section 2.5.2. We continue working under
the assumption that the constant functions and the function f : R → R, given by f (x) = x, are
continuous everywhere.

Example 3.1.4. Suppose that g : R → R is continuous at the point a and let |g| denote the
function given by
|g|(x) = |g(x)| ∀x ∈ R.
Show that |g| is continuous at the point a.

Remark 3.1.5. Throughout this section we have taken for granted that the constant functions,
the sine function and the function f : R → R, given by f (x) = x, are continuous everywhere. To
prove this, one needs a rigorous definition of the limit of a function at a point. Such a definition is
similar in flavour to the definition for the limit of a function at infinity, but will not be presented
in these notes.

Example 3.1.6. Suppose that a and b are real numbers and consider the function f : R → R given
by (
eax+b if x ≥ 0
f (x) =
cos x if x < 0.
For what values of a and b will the function f be continuous at 0?

Solution. We require that

lim f (x) = f (0) = lim f (x).
x→0− x→0+

Now

lim f (x) = lim cos x = 1, f (0) = eb and lim f (x) = lim eax+b = eb .
x→0− x→0− x→0+ x→0+

So we require that eb = 1, which implies that b = 0. There is no restriction on a.

3.2 Continuity on intervals

(Ref: SH10 §2.4)
In section 2.5.2, a definition for continuity at a point was given. We now extend this definition to
define what is meant by continuity on an interval.

Definition 3.2.1. Suppose that f is a real-valued function defined on an open

interval (a, b). We say that f is a continuous on (a, b) if f is continuous at every
point in the interval (a, b).

c
2020 School of Mathematics and Statistics, UNSW Sydney
54 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS

Definition 3.2.2. Suppose that f is a real-valued function defined on a closed

interval [a, b]. We say that

(a) f is continuous at the endpoint a if lim f (x) = f (a),

x→a+

(b) f is continuous at the endpoint b if lim f (x) = f (b), and

x→b−

(c) f is continuous on the closed interval [a, b] if f is continuous on the open

interval (a, b) and at each of the endpoints a and b.

To illustrate these definitions, consider the functions f , g and h, whose graphs are shown below.

f (x) g(x) h(x)

b bc b b

bc b b

| | | | | |
a b x a b x a b x
All three functions are defined on the interval [a, b]. We see that
• f is continuous on the open interval (a, b) and at the endpoint b;
• g is continuous at the endpoints a and b but not continuous on the open interval (a, b);
and
• h is continuous on the closed interval [a, b] (and, by implication, on the open interval (a, b)
and at both endpoints a and b).

3.3 The intermediate value theorem

(Ref: SH10 §§2.6, B.1)
The idea of this section is very simple. Suppose that a function f is continuous on some closed
interval [a, b] and that z is any real number that lies between f (a) and f (b), as illustrated below.
f (x)
f (b) b
|

z
|

f (a) b
|

| | |
a c b x

c
2020 School of Mathematics and Statistics, UNSW Sydney
3.3. THE INTERMEDIATE VALUE THEOREM 55

It is clear from the graph that one can find a real number c in [a, b] such that f (c) = z. The
intermediate value theorem, which is stated below, says that this can always be done for any
function that is continuous on [a, b].
Theorem 3.3.1 (The intermediate value theorem). Suppose that f is continuous on the closed
interval [a, b]. If z lies between f (a) and f (b) then there is at least one real number c in [a, b] such
that f (c) = z.
The intermediate value theorem is proven using the least upper bound property of the real
numbers. (It is this property that distinguishes the real numbers from the rational numbers.) We
shall not examine the proof here.
We note three important points about the theorem. First, there may be more than one number
c in [a, b] that satisfies the conclusion of the intermediate value theorem. The diagram below
illustrates this possibility.
f (x)

f (a) b
|

z
|

f (b) b
|

| |
a c1 c2 c3 b x
Second, the condition in Theorem 3.3.1 that f is continuous cannot be relaxed. For example,
consider the function f : [a, b] → R whose graph is shown below.

f (x)
f (b) b
|

b
z
|

f (a) b
|

| |
a b x

Because of the discontinuity, z is not in the range of f , even though z lies between f (a) and f (b).
That is, there is no real number c in [a, b] such that f (c) = z.
Third, the intermediate value theorem is a theorem about continuous functions defined on the
real numbers. There is no analogous theorem for continuous functions defined on the rational
numbers. For example, consider the functions f : R → R and g : Q → Q, given by

f (x) = x2 − 2 and g(x) = x2 − 2.

Both functions are continuous on their domains. Also,

f (0) = g(0) = −2 and f (2) = g(2) = 2.

c
2020 School of Mathematics and Statistics, UNSW Sydney
56 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS

However, while there√is a number c in [0, 2] such that f (c) = 0, there is no number c in [0, 2] such
that g(c) = 0 (since 2 ∈/ Dom(g)).
One of the chief applications of the intermediate value theorem is determining whether an
equation has a solution. Moreover, if a solution exists, then the theorem helps us determine an
approximate location of this solution. This is particularly helpful when the equation cannot be
solved by simple algebra.

Example 3.3.2. Show that the equation

e2x = sin x + 4 (3.1)

has at least one positive solution.

Solution. Consider the function f given by

f (x) = e2x − sin x − 4.

Now
f (0) = e0 − sin 0 − 4 = −3

while
f (π/2) = eπ + 1 − 4 ≈ 20.14.

Since f is continuous on the closed interval [0, π/2] and 0 lies between f (0) and f (π/2), the
intermediate value theorem implies that there is a real number c in [0, π/2] such that f (c) = 0.
That is,
e2c = sin c + 4

for some c in [0, π/2]. Moreover, one can check that c 6= 0. Hence the equation has at least one
positive solution c.

Note that in equation (3.1), one cannot isolate x to find an explicit solution. However, now
that it is known a solution exists in the interval [0, π/2], one could use Newton’s method to find a
good approximation to this solution.

3.4 The maximum-minimum theorem

(Ref: SH10 §§2.6, B.2)
In many situations, it is desirable to know the maximum (or minimum) value of a quantity that
varies over an interval of time. Examples include the maximum temperature in a 24 hour period
or the minimum stock price over a seven day period. Depending on the quantity and the time
interval, a maximum or minimum value may not be attained. The maximum-minimum theorem
gives conditions under which a maximum and minimum is attained. To clarify discussion, we begin
with a definition.

c
2020 School of Mathematics and Statistics, UNSW Sydney
3.4. THE MAXIMUM-MINIMUM THEOREM 57

Definition 3.4.1. Suppose that f is defined on a closed interval [a, b].

(a) We say that a point c in [a, b] is an absolute minimum point for f on [a, b]
if f (c) ≤ f (x) for all x in [a, b]. The corresponding value f (c) is called the
absolute minimum value of f on [a, b]. If f has an absolute minimum point
on [a, b] then we say that f attains its minimum on [a, b].

(b) We say that a point d in [a, b] is an absolute maximum point for f on [a, b]
if f (x) ≤ f (d) for all x in [a, b]. The corresponding value f (d) is called the
absolute maximum value of f on [a, b]. If f has an absolute maximum point
on [a, b] then we say that f attains its maximum on [a, b].

An absolute maximum point and an absolute minimum point are sometimes referred to as a
global maximum point and a global minimum point.

Example 3.4.2. Consider the functions g and h, whose graphs are illustrated below.

g(x) h(x)
4 4
|

3 3
|

2 2
|

1 1 b
|

| | | | | | | | | |

1 2 3 4 5 x 1 2 3 4 5 x

The absolute minimim and maximum points of g and h on [1, 5] are recorded in the following table.

g h
Absolute minimum points none 3
Absolute minimum value n.a. 1
Absolute maximum points 2, 4 none
Absolute maximum value 4 n.a.

The above example shows that a function f : [a, b] → R need not have an absolute maximum
point (or an absolute minimum point) on a closed interval [a, b]. However, if f is continuous on
[a, b] then such points always exist.

Theorem 3.4.3 (The maximum-minimum theorem). If f is continuous on a closed interval [a, b]

then f attains its minimum and maximum on [a, b]. That is, there exist points c and d in [a, b]
such that
f (c) ≤ f (x) ≤ f (d)
for all x in [a, b].

As with the intermediate value theorem, the least upper bound property of the real numbers is
used to prove the maximum-minimum theorem. We omit the proof.

c
2020 School of Mathematics and Statistics, UNSW Sydney
58 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS

Example 3.4.4. Suppose that the function f : R → R is given by the rule

f (x) = x2 − 10x + 16.

Find the absolute maximum and absolute minimum values of f on the interval [1, 10].

Solution. Since f (x) = (x − 2)(x − 8), the function f has zeros at 2 and 8.

f (x)

|
2 8 |

1 5 10 x

By symmetry, we know that the axis of the parabola is the line x = 5. Hence 5 is an absolute
minimum point for f on [1, 10]. Since f (5) = −9, the absolute minimum value of f on [1, 10] is −9.
It is clear from the sketch of f that at least one of the endpoints of the interval [1, 10] must be
an absolute maximum point for f on [1, 10]. Now f (1) = 7 while f (10) = 16. So 10 is an absolute
maximum point for f on [1, 10] and the corresponding maximum value of f is 16.

Remark 3.4.5. While continuity on a closed interval guarantees the existence of at least one
absolute maximum point and absolute minimum point, locating such points may not be easy. A
systematic approach to finding maximum and minimum points will be developed in Chapters 4 and
5.

Remark 3.4.6. All the hypotheses stated in the maximum-minimum theorem must be satisfied for
the conclusion of the theorem to be valid. If the interval is closed but the function is not continuous,
then an absolute maximum (or minimum) point may not exist (see Example 3.4.2). The next
example shows that if the function is continuous on the interval but the interval is not closed then
an absolute maximum (or minimum) point may not exist. It is expected that students will know
the precise hypotheses of the maximum-minimum theorem (and other important theorems, such as
the intermediate value theorem) and know how to apply the theorem correctly.

Example 3.4.7. Consider the function f : R → R given by f (x) = x + 1. Since f is a polynomial,

it is continuous on the open interval (0, 1). However, f does not have an absolute maximum point
on (0, 1). To see why, consider the graph of f on (0, 1).

f (x)

2
|

|
0 1 x

As x → 1− , f (x) → 2 but there is no real number d in (0, 1) such that f (d) = 2. By a similar
observation, f has no absolute minimum point on (0, 1).

c
2020 School of Mathematics and Statistics, UNSW Sydney
3.4. THE MAXIMUM-MINIMUM THEOREM 59

The final result of this chapter is a corollary of maximum-minimum theorem. We begin with a
definition.

Definition 3.4.8. A function f is said to be bounded on an interval I if there is

some positive number M such that |f (x)| ≤ M for all x in I.

In other words, f is bounded if the y-values of its graph lie between −M and M for some
positive number M . The sine and cosine functions are obvious examples of functions that are
bounded on the whole of the real line.

Corollary 3.4.9. If f : [a, b] → R is continuous on [a, b] then f is bounded on [a, b].

Proof. Suppose that f is continuous on [a, b]. Then the function |f | is also continuous on [a, b]
(see Example 3.1.4). By the maximum-minimum theorem, |f | attains its maximum on [a, b]. If M
denotes the absolute maximum value of |f | on [a, b] then

|f (x)| ≤ M ∀x ∈ [a, b].

Hence f is bounded.

c
2020 School of Mathematics and Statistics, UNSW Sydney
60 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS

Problems for Chapter 3

Problems 3.1 : Combining continuous functions and

3.2 : Continuity on intervals
1. [R] Suppose that f : R → R is defined by f (x) = |x|.

a) Show that f is continuous at 0.

b) Is f continuous everywhere? Give brief reasons for your answer.

2. [R] Determine at which points each function f : R → R is continuous. Give reasons.


−2x
e if x < 0
( 
e2x if x < 0
a) f (x) = b) f (x) = sin x + 1 if 0 ≤ x ≤ π/2
cos x if x ≥ 0 

2x − π if x > π/2

3. [R] Suppose that

(
x2 −16
x−4 if x 6= 4
f (x) =
k if x = 4,

where k is a real number. For which values of k (if any) will f be continuous everywhere?

4. [H] Use the pinching theorem for limits to show that if f , g and h are three functions
defined on an open interval I, such that

• f (x) ≤ g(x) ≤ h(x) for all x ∈ I,

• f (a) = g(a) = h(a) for some a ∈ I, and
• f and h are continuous at a,

then g is also continuous at a.

Problems 3.3 : The intermediate value theorem

5. [R] Show that the function f , given by f (x) = x3 − 5x + 3, has a zero in each of the
intervals [−3, −2], [0, 1] and [1, 2].

6. [R] [V] Use the intermediate value theorem to show that the equation ex = 2 cos x has at
least one positive real solution.

7. [H] Suppose that f is continuous on [0, 1] and that Range(f ) is a subset of [0, 1]. By using
g(x) = f (x) − x, prove that there is a real number c in [0, 1] such that f (c) = c.

8. [X] Suppose that f is a continuous function such that f (0) = 1 and lim f (x) = −1. Show
x→∞
that f has a zero somewhere in (0, ∞).

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 3 61

Problems 3.4 : The maximum-minimum theorem

9. [R] In each case, determine whether or not f attains a maximum on the given interval.
Give reasons for your answer.

2
x ln x
a) f (x) = x − 4 on [−3, 5] b) f (x) = sin(e ) + 2
on [2, 4]
x − 1
c) f (x) = x2 − 4 on (−3, 5) d) f (x) = −(x2 − 4) on (−3, 5)

10. [H] [V] Suppose that f is a continuous function on R and that lim f (x) = lim f (x) = 0.
x→∞ x→−∞

a) Give an example of such a function which has both a maximum value and a minimum
value.
b) Give an example of such a function which has a minimum value but no maximum
value.
c) [X] Show that if there is a real number ξ such that f (ξ) > 0 then f attains a
maximum value on R. [Note that the maximum-minimum theorem only applies to
finite closed intervals [a, b].]

c
2020 School of Mathematics and Statistics, UNSW Sydney
62 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS

c
2020 School of Mathematics and Statistics, UNSW Sydney
63

Chapter 4

Differentiable functions

A cyclist accelerates from rest in such a way that her displacement s (in metres) from her starting
position after t seconds is given by

s = t2 ∀t ∈ [0, 3].

The corresponding displacement–time graph for the first three seconds is shown below.

displacement (m)
4 b
|

1 b
|

| |
0 1 2 3
time (s)

What is the cyclist’s instantaneous velocity when t = 1?

At present, we do not have a formula for calculating instantaneous velocity. However, we can
calculate the average velocity v of the cyclist over any time interval by using the formula

∆s
v= ,
∆t

where ∆s denotes the change in displacement corresponding to a change ∆t in time. For example,
the average velocity over the time interval [1, 1.5] is 2.5 metres per second, as calculated below:

∆s (1.5)2 − (1)2
v [1,1.5] = = = 2.5. (4.1)
∆t 1.5 − 1

Average velocities over small time intervals are recorded in the following table.

c
2020 School of Mathematics and Statistics, UNSW Sydney
64 CHAPTER 4. DIFFERENTIABLE FUNCTIONS

Time interval ∆t Average velocity

[1, 1.5] 0.5 2.5
[1, 1.4] 0.4 2.4
[1, 1.3] 0.3 2.3
[1, 1.2] 0.2 2.2
[1, 1.1] 0.1 2.1
[1, 1.01] 0.01 2.01
[1, 1.001] 0.001 2.001

The table suggests that the average velocity approaches 2 meters per second as ∆t → 0. Hence it
appears that instantaneous velocity of the cyclist when t = 1 is 2 meters per second.
One of the goals of this chapter is put these kinds of ideas on a rigorous footing in a general
context (where they can be applied to understanding rates of change of any quantity, not just
displacement). Hence we reframe these ideas in terms of functions and analytic geometry. More
precisely, we shall now show that the problem of determining instantaneous velocity is equivalent
to the problem of finding the gradient of a tangent line to the graph of a function.
To begin, the problem of calculating average velocity is equivalent to the problem of calculating
the gradient of a secant to the graph of a function. (A secant is a straight line that intersects a
curve at least twice.) For example, compare the calculation of average velocity in (4.1) with the
calculation of the gradient of the secant shown in the following diagram.

s = t2
secant

2.25 b
|

s
∆s (1.5)2 −(1)2
gradient of secant = ∆t = 1.5−1 = 2.5

1 b
|

| |

1 1.5

t
In the same way, the above table of average velocities can be reinterpreted geometrically as a table
of gradients corresponding to secants of the curve s = t2 .

Intersection points ∆t Gradient of secant

t = 1, t = 1.5 0.5 2.5
t = 1, t = 1.4 0.4 2.4
t = 1, t = 1.3 0.3 2.3
t = 1, t = 1.2 0.2 2.2
t = 1, t = 1.1 0.1 2.1

c
2020 School of Mathematics and Statistics, UNSW Sydney
4.1. GRADIENTS OF TANGENTS AND DERIVATIVES 65

The secants used in the table are illustrated over the page in Figure 4.1. The figure suggests that
as ∆t → 0, the gradient of the corresponding secant approaches the gradient of the tangent line to
the curve. Thus the problem of determining instantaneous velocity is equivalent to the problem of
determining the gradient of a tangent to the graph of a function. It is to this general problem that
we now turn.

4.1 Gradients of tangents and derivatives

(Ref: SH10 §3.1)
Suppose that a function f has a ‘smooth’ graph. In this section, we give a general approach to
calculating the gradient of tangents to f . This leads the concept of a gradient function, also known
as a derivative (since it is derived from f ), whose values give the gradient of the tangent to f at
any point. We begin with an example.

Example 4.1.1. Suppose that f : R → R is given by f (x) = x3 . Find the gradient of the tangent
to the graph of f when x = 2.

Solution. The tangent to the graph of f passes through the point P (2, 23 ). We will approximate
the gradient of the tangent with the gradient of a secant passing through the points P (2, 23 ) and
Q(2 + h, (2 + h)3 ), where h is a very small real number. (See the illustration below for the case
when h > 0.)

y
y = f (x)
secant

Q(2 + h, (2 + h)3 ) b tangent

23 P (2, 23 ) b
|

| |

2 2+h x

Using the ‘rise over run’ formula, the gradient of the secant is

(2 + h)3 − 23
.
h

Now, as h approaches 0, the point Q moves along the curve towards P , and hence the gradient of

c
2020 School of Mathematics and Statistics, UNSW Sydney
66 CHAPTER 4. DIFFERENTIABLE FUNCTIONS

s = t2

secants

tangent

2.25 b
|

1.96 b
|

s
1.69 b
|

1.44 b
|

1.21 b
|

1 b
|

| | | | | |

1 1.1 1.2 1.3 1.4 1.5

t
Figure 4.1: Using secants to determine the gradient of a tangent.

c
2020 School of Mathematics and Statistics, UNSW Sydney
4.1. GRADIENTS OF TANGENTS AND DERIVATIVES 67

the secant approaches the gradient of the tangent. In other words,

(2 + h)3 − 23
(gradient of tangent) = lim
h→0 h
8 + 12h + 6h2 + h3 − 8
= lim
h→0 h
12h + 6h2 + h3
= lim
h→0 h
= lim 12 + 6h + h2
h→0
= 12.
Thus the gradient of the tangent to f at the point 2 is 12. This gives a quantitative description of
the rate of change of f at the point 2.

There is nothing special about the point 2 in the previous example; we could calculate the
gradient of the tangent to f at any point x. The calculation would go something like this:
(x + h)3 − x3
(gradient of tangent at x) = lim
h→0 h
x + 3x2 h + 3xh2 + h3 − x3
3
= lim
h→0 h
2
3x h + 3xh 2
= lim
h→0 h
2
= lim 3x + 3xh
h→0
2
= 3x .
So the gradient of the tangent to f when x = 5 is 75 (since 3 × 52 = 75), while the gradient of the
tangent when x = −4 is 48. The gradient function of f , denoted by f ′ and given by
f ′ (x) = 3x2 ∀x ∈ R,
is called the derivative of f .
We can repeat the same procedure for functions other than f (x) = x3 . The gradient function
(or derivative) may be calculated using the limit given in the next definition.

Definition 4.1.2. Suppose that f is defined on some open interval containing the
point x. We say that f is differentiable at x if
f (x + h) − f (x)
lim
h→0 h
exists. If the limit exists, we denote it by f ′ (x). We call f ′ (x) the derivative of f at
x.

df d
Other notation for f ′ (x) includes dx (x) and dx f (x). If y = f (x) then the derivative is often
dy
denoted by y ′ or dx . The ratio
f (x + h) − f (x)
h
is called the difference quotient for f at the point x.

c
2020 School of Mathematics and Statistics, UNSW Sydney
68 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
dy ∆y
Remark 4.1.3. The notation dx originates from the gradient formula ∆x , where ∆y represents a
change in y corresponding to a small change ∆x in x. (The symbol ∆ represents the Greek letter
‘delta’, which is equivalent to our capital ‘D’.) Some calculus texts prefer the use of delta notation
and write the difference quotient for a function f as
f (x + ∆x) − f (x)
.
∆x
√
Example 4.1.4. Suppose that f : [0, ∞) → R is given by f (x) = x. Find the equation of the
tangent to f when x = 16.
Solution. First we shall calculate the derivative of f (if it exists) at 16. Note that, in the working
below, the limit as h approaches 0 is difficult to evaluate immediately, so we use a standard trick
(see, for example, Subsection 2.1.4):
√ √
f (16 + h) − f (16) 16 + h − 16
=
h √ h √ √ √
16 + h − 16 16 + h + 16
= ×√ √
h 16 + h + 16
16 + h − 16
= √ √
h( 16 + h + 16)
h
= √ √
h( 16 + h + 16)
1
=√ √
16 + h + 16
1
→
8
as h → 0. So the gradient of the tangent is 1/8. Using the point-gradient formula, the equation of
the tangent is

y − y1 = m(x − x1 )
1
y − 4 = (x − 16)
8
8y − x − 16 = 0.

Example 4.1.5. Suppose that f : R → R is given by f (x) = |x|. Determine whether or not f is
differentiable at the point 0.
Solution. We calculate the difference quotient of f at 0:
f (0 + h) − f (0) |h|
=
h (h
h
h if h > 0
= −h
h if h < 0.
(
1 if h > 0
=
−1 if h < 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
4.2. RULES FOR DIFFERENTIATION 69

Because of the ‘split formula,’ we must consider the left- and right-hand limits as h approaches 0
separately. Now
f (0 + h) − f (0)
lim = −1
h→0− h
while
f (0 + h) − f (0)
lim = 1.
h→0+ h
Since the left- and right-hand limits are different, the two-sided limit doesn’t exist and hence f is
not differentiable at 0.
Note that this conclusion makes sense geometrically, since the graph of f has a ‘vertex’ at 0
and there is no unique tangent that touches the graph at this point.

Example 4.1.6. Suppose that f : R → R is defined by

(
x sin x1 if x 6= 0
f (x) =
0 if x = 0.

Determine whether or not f is differentiable at 0.

4.2 Rules for differentiation

(Ref: SH10 §§3.2, 3.5, 3.6)
So far we have calculated derivatives working directly from the difference quotient. In this section we
see that there are fast ways of calculating derivatives without referring to the difference quotient
and limits at all. This fast method is part of what is known as ‘the calculus.’ We begin with
the derivatives of some standard functions, and then show how derivatives of other functions are
obtained by applying some simple rules.
The following table gives the derivatives for some standard functions.

f (x) f ′ (x)
C, where C is a constant 0
xn , where n is a positive integer nxn−1
sin x cos x
ex ex

c
2020 School of Mathematics and Statistics, UNSW Sydney
70 CHAPTER 4. DIFFERENTIABLE FUNCTIONS

With these basic functions, it is possible to construct many other functions via function addition,
subtraction, multiplication and division. The next theorem describes what happens to the resulting
derivatives.

Theorem 4.2.1 (Rules for differentiation). Suppose that f and g are differentiable at x. Then
f + g, f − g and f g are differentiable at x. If g(x) 6= 0 then fg is also differentiable at x. Moreover,

(i) (f + g)′ (x) = f ′ (x) + g ′ (x),

(ii) (C.f )′ (x) = C.f ′ (x), where C is a constant,

(iii) (f g)′ (x) = f ′ (x)g(x) + f (x)g ′ (x), and

′
f f ′ (x)g(x) − f (x)g ′ (x)
(iv) (x) = , provided that g(x) 6= 0.
g g(x)2
Rules (iii) and (iv) are usually called the product and quotient rules. A familiar way of express-
ing these rules is
d(uv) du dv
=v +u
dx dx dx
and
d u v du − u dv
= dx 2 dx ,
dx v v
where u and v are both functions of x.
Given two functions f and g, a new function f ◦ g may be constructed by function composition.
The next theorem describes the derivative of this new function.

Theorem 4.2.2 (The chain rule). Suppose that g is differentiable at the point x and f is differen-
tiable at the point g(x). Then f ◦ g is differentiable at x and

(f ◦ g)′ (x) = f ′ (g(x))g ′ (x). (4.2)

You are probably more familiar with the chain rule expressed in the following way: if y = f (u)
and u = g(x) then
dy dy du
= . (4.3)
dx du dx
While formula (4.3) looks simpler than (4.2), it does not specify the relationship between the
variables or the points at which each derivative is to be evaluated. It should therefore be used with
care.
The next example illustrates how the chain rule formula (4.2) is applied.

Example 4.2.3. Suppose that y = (sin x + 1)7 . Find the derivative of y at the point x.

Solution. Suppose that f (x) = x7 and g(x) = sin x + 1. Then f ′ (x) = 7x6 and g′ (x) = cos x. Then
y = (f ◦ g)(x). By the chain rule, the derivative of y at x is given by

(f ◦ g)′ (x) = f ′ (g(x))g ′ (x)

= 7(g(x))6 × cos x
= 7(sin x + 1)6 cos x.

c
2020 School of Mathematics and Statistics, UNSW Sydney
4.3. PROOFS OF RESULTS IN SECTION 4.2 71

As mentioned at the beginning of this section, derivatives of many functions can be obtained
from those of a few basic functions by the application of simple rules. The next example illustrates
this point.
Example 4.2.4. Show that
d m
(a) x = mxm−1 , where m is an integer;
dx
d
(b) cos x = − sin x; and
dx
d
(a) tan x = sec2 x.
dx

4.3 Proofs of results in Section 4.2

In Section 4.2 we stated two theorems and gave a table of derivatives. In this section we prove
some of these results. We begin with the derivative of the sine function, using the standard result
that
sin θ
lim = 1. (4.4)
θ→0 θ
For a geometric argument for this limit, see Question 14 from the problems for Chapter 2.
Proposition 4.3.1. If f (x) = sin x then f ′ (x) = cos x for all real x.
Proof. The difference quotient for the sine function and the angle sum formula for sine gives
sin(x + h) − sin x sin x cos h + cos x sin h − sin x
=
h h
cos h − 1 sin h
= sin x + cos x . (4.5)
h h
Now cos2 h + sin2 h = 1 so
cos h − 1 cos2 h − 1
=
h h(cos h + 1)
− sin2 h
=
h(cos h + 1)
sin h sin h
=− ,
h cos h + 1
and by (4.4) we have
sin h sin h
− → −1 × 0 as h → 0.
h cos h + 1
If we combine this with (4.5) then
sin(x + h) − sin x cos h − 1 sin h
lim = sin x lim + cos x lim
h→0 h h→0 h h→0 h
= sin x × 0 + cos x × 1
= cos x.

Thus f ′ (x) = cos x.

c
2020 School of Mathematics and Statistics, UNSW Sydney
72 CHAPTER 4. DIFFERENTIABLE FUNCTIONS

The derivative of ex will be dealt with in Chapter 9. The derivative of constant functions can
be easily computed from Definition 4.1.2.
Suppose that f is given by f (x) = xn , where n is a positive integer. The derivative of f can be
computed using the same method as Example 4.1.1, with the assistance of the binomial theorem.
Alternatively, one can use mathematical induction and the product rule for differentiation.

Proof of the product rule for differentiation. Suppose that f and g are differentiable at the point
x. The difference quotient of f g at x gives

(f g)(x + h) − (f g)(x)
h
f (x + h)g(x + h) − f (x)g(x)
=
h
f (x + h)g(x + h) − f (x)g(x + h) + f (x)g(x + h) − f (x)g(x)
=
h
f (x + h) − f (x) g(x + h) − g(x)
= g(x + h) + f (x) . (4.6)
h h
Now g is differentiable at x, so g is also continuous at x (see Theorem 4.5.1). Hence g(x+h) → g(x)
as h → 0. Therefore (4.6) implies that

(f g)(x + h) − (f g)(x)
→ g(x)f ′ (x) + f (x)g ′ (x)
h
as h → 0.

Proofs of other differentiation rules are found in most undergraduate calculus textbooks.

4.4 Implicit differentiation

(Ref: SH10 §3.7)
In Section 1.8 we introduced functions that are defined by an equation relating the variables x and
y. Using the chain rule, one can calculate the derivative of such functions without first expressing
y explicitly as a function of x. We begin with a simple example to illustrate the principle. The
point to keep in mind is that y is a function of x.

Example 4.4.1. Suppose that y is a function of x, and that y and x are related by the formula

x2 + y 2 = 1.
dy
Calculate dx .

Solution. We begin with the equation

x2 + y 2 = 1

and differentiate both sides with respect to x to obtain

d 2 d 2 d
(x ) + (y ) = (1). (4.7)
dx dx dx
c
2020 School of Mathematics and Statistics, UNSW Sydney
4.4. IMPLICIT DIFFERENTIATION 73
d d
Now clearly, dx (x2 ) = 2x and dx (1) = 0. To simplify the central term of (4.7), we regard y as a
function of x and suppose that g(y) = y 2 . The chain rule gives

d 2 dg
(y ) =
dx dx
dg dy
=
dy dx
dy
= 2y .
dx
So continuing from (4.7) we have
dy
2x + 2y = 0.
dx
dy
By isolating dx we obtain

dy −2x
=
dx 2y
x
=−
y

as required.
(We note in passing that this answer coincides with the derivative obtained by first expressing
y as a function of x and then differentiating. In particular, if

y = (1 − x2 )1/2

then
dy 1 x x
= (1 − x2 )−1/2 (−2x) = − =−
dx 2 (1 − x2 )1/2 y
as above.)

In general, if g is a function of y and y is a function of x then

d dy
g(y) = g′ (y) (4.8)
dx dx
by the chain rule. The use of this formula is illustrated below.

Example 4.4.2. Suppose that y is a function of x, implicitly related by the equation

y 4 + x3 − x2 e3y = 4. (4.9)

Find the equation of the tangent to the corresponding curve at the point where (x, y) = (2, 0).
d n
In Section 4.2 we saw that dx x = nxn−1 whenever n is an integer. A similar formula holds
when n is any rational number. This can be easily proved using implicit differentiation.

Proposition 4.4.3. Suppose that q is a rational number. Then

d q
x = qxq−1 .
dx
c
2020 School of Mathematics and Statistics, UNSW Sydney
74 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
m
Proof. Since q is a rational number, there are integers m and n such that q = n and n 6= 0. Now
if y = xq then y = xm/n and taking the nth power of both sides gives
y n = xm .
Differentiating both sides with respect to x yields
dy
ny n−1 = mxm−1 .
dx
Hence
dy mxm−1
=
dx ny n−1
m xm−1
=
n xq(n−1)
= qx(m−1)−qn+q
= qxq−1
as required.

4.5 Differentiation, continuity and split functions

(Ref: SH10 §3.1)
Not every function that is continuous at a point a is differentiable at a. (Consider, for example,
the function f , given by f (x) = |x|, at the point 0.) However, we do have the following result.
Theorem 4.5.1. If f is differentiable at a then f is continuous at a.
Proof. Suppose that f is differentiable at the point a. Then
f (a + h) − f (a)
lim
h→0 h
exists. To show that f is continuous at a, we need to show that
lim f (x) = f (a),
x→a

or equivalently, that
lim f (a + h) = f (a).
h→0
Now

f (a + h) − f (a)
lim f (a + h) − f (a) = lim ×h
h→0 h→0 h

f (a + h) − f (a)
= lim × lim h (4.10)
h→0 h h→0
′
= f (a) × 0
= 0.
(Note that we can separate the limits in (4.10) because we know that the limit of the difference
quotient exists.) Hence
lim f (a + h) = f (a),
h→0
proving the theorem.

c
2020 School of Mathematics and Statistics, UNSW Sydney
4.5. DIFFERENTIATION, CONTINUITY AND SPLIT FUNCTIONS 75

The contrapositive form of the theorem is stated below.

Corollary 4.5.2. If f is not continuous at a then it is not differentiable at a.
Remark 4.5.3. Differentiability is a much stronger property than continuity. While every differen-
tiable function is continuous, there exist functions that are continuous everywhere but differentiable
nowhere. In fact, functions that are differentiable everywhere are a very rare breed, even among the
continuous functions. An example of a function that is continuous everywhere but differentiable
nowhere is the Weierstrass function whose graph is shown below.
y

The next function studied is an example of a ‘split function’. Whether or not the function is
differentiable at the ‘split point’ can be determined by calculating left- and right-hand limits of the
difference quotient.
Example 4.5.4. Suppose that f : R → R is defined by
(
sin x if x ≥ 0
f (x) = 2
x + bx + c if x < 0,
where b and c are real numbers. Find all possible values of b and c such that f is (a) continuous at
0 and (b) differentiable at 0.
Solution. (a) For f to be continuous at 0, we require that
lim f (x) = f (0) = lim f (x).
x→0− x→0+

Now f (0) = sin 0 = 0 while

lim f (x) = c and lim f (x) = 0.
x→0− x→0+

Hence f is continuous if and only if c = 0 and b ∈ R.

(b) If f is differentiable at 0 then it is continuous at 0 and hence c = 0 by part (a). Also, for
differentiability we require that
f (0 + h) − f (0) f (0 + h) − f (0)
lim = lim .
h→0− h h→0+ h
Now,
f (0 + h) − f (0) sin h − sin 0 sin h
lim = lim = lim =1
h→0+ h h→0 + h h→0 + h
while
f (0 + h) − f (0) h2 + bh − 0
lim = lim = lim h + b = b.
h→0− h h→0− h h→0−
Hence we require that b = 1. So f is differentiable at 0 if and only if b = 1 and c = 0. The graph
of the corresponding function is sketched below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
76 CHAPTER 4. DIFFERENTIABLE FUNCTIONS
f (x)

| | |
−1 2π x

An important application of split functions is the theory of splines. A spline is a function

defined piecewise by polynomials. They are frequently used in computer graphics and elsewhere
because of their simplicity and their capacity to approximate complex curves. The following is an
example of a spline.

Example 4.5.5.
(
(x − 1)2 − 1 if − 2 ≤ x < 0
f (x) =
1 − (x − 1)2 if 0 ≤ x ≤ 2.

In this example, each of the functions p(x) = (x − 1)2 − 1 and q(x) = 1 − (x − 1)2 have had
their domains restricted and then ‘glued’ together at the point 0.
If the constituent functions are continuous and differentiable in some interval containing the
point a, then the following theorem tells us when the corresponding split function is differentiable.

Theorem 4.5.6. Suppose a is a fixed real number and the function f is defined by
(
p(x) if x ≥ a
f (x) =
q(x) if x < a,

where p(x) and q(x) are continuous and differentiable in some open interval containing a. Then if
f is continuous at a and p′ (a) = q ′ (a), then f is differentiable at x = a.

This theorem in particular can be applied to splines since the constituent functions are polyno-
mials.

Example 4.5.7.
(
sin x if x ≥ 0
f (x) =
x2 + x if x < 0.

The functions sin x and x2 + x are continuous and differentiable everywhere. We saw in Example
4.5.4 that f is continuous at x = 0. Also, with p(x) = sin x and q(x) = x2 + x, we have p′ (0) =
cos 0 = 1 and q ′ (0) = 1 so f is differentiable at x = 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
4.6. DERIVATIVES AND FUNCTION APPROXIMATION 77

4.6 Derivatives and function approximation

(Ref: SH10 §4.11)
The last three sections of this chapter are devoted to some applications of the derivative.
Suppose that a function f is differentiable at a. Consider its graph and the tangent to f at a,
shown below.
y

f (a) b

|
a x

The equation of the tangent to f at a may be calculated using the point-gradient formula:
y − f (a) = f ′ (a)(x − a)
y = f ′ (a)(x − a) + f (a).
Observe in the above diagram that the tangent line is close to the graph of f for all points that are
close to a. In other words,
f (x) ≈ f ′ (a)(x − a) + f (a) (4.11)
when x is close to a.
Formula (4.11) reflects the fact that every differentiable function can be locally approximated
by a linear function. We use this principle in the following example.
√
Example 4.6.1. Estimate 9.001 without using a calculator.
√
Solution. Suppose that f (x) = x. Since 9.001 is close to 9 and f is a continuous function, we
could use the approximation √
9.001 = f (9.001) ≈ f (9) = 3. (4.12)
However, we can do better than this. We shall approximate f with a linear function at the
point 9 using (4.11). Since f ′ (9) = 1/6 we have
f (x) ≈ (x − 9)/6 + 3
whenever x is close to 9. When x = 9.001 this gives
√
9.001 = f (9.001) ≈ (9.001 − 9)/6 + 3 = 3 + 1/6000. (4.13)
The table below gives the error involved in each approximation.
Approximation Error
Approximation (4.12) 1.667−4
Approximation (4.13) 4.637−9
Clearly (4.13) is the superior approximation.

c
2020 School of Mathematics and Statistics, UNSW Sydney
78 CHAPTER 4. DIFFERENTIABLE FUNCTIONS

4.7 Derivatives and rates of change

(Ref: SH10 §3.4)
Many physical processes involve quantities (such as temperature, volume, concentration, velocity)
that change with time. If Q is a quantity that varies with time, then the derivative dQdt gives the
rate of change of that quantity with respect to time. The chain rule is a major tool for answering
questions about rates of quantities which are related.

Example 4.7.1. A spherical balloon is being inflated and its radius is increasing at a constant
rate of 6 mm/sec. At what rate is its volume increasing when the radius of the balloon is 20 mm?

Solution. Suppose that V (t) is the volume of the balloon and r(t) is its radius at time t. We are
told that dr dV
dt = 6 and we need to find dt when r = 20. By the chain rule,

dV dV dr
= .
dt dr dt

Now V = 43 πr 3 so dV
dr = 4πr 2 . Hence

dV
= 4πr 2 × 6.
dt

When r = 20,
dV
= 24π(20)2 = 9600π.
dt
So the volume is increasing at a rate of 9600π mm3 /sec when the radius is 20 mm.

The above example illustrates an approach to solving such problems.

1. Define variables for the quantities involved.

2. Write down what is known in terms of these variables and their derivatives.

3. Write down what you need to find in terms of the these variables and their derivatives.

4. Write down anything else you know that relates the variables (for example, a volume or
area formula).

5. Use the chain rule (or implicit differentiation) to find the relevant derivative.

Sometimes drawing a diagram helps in the first few steps.

The next example follows this procedure.

Example 4.7.2. A point P moves to the right along the positive x-axis at a constant rate of 5
cm per second, and a point Q moves up the positive y-axis at a constant rate of 10 cm per second.
How fast is the distance between them changing when OP = 30 cm and OQ = 40 cm?

c
2020 School of Mathematics and Statistics, UNSW Sydney
4.8. LOCAL MAXIMUM, LOCAL MINIMUM AND STATIONARY POINTS 79

4.8 Local maximum, local minimum and stationary points

(Ref: SH10 §4.3)
In Example 3.4.4, we determined the absolute maximum and minimum points of a continuous
function over a closed interval. While the existence of these extreme points is given by the max-
min theorem, the theorem provides no systematic way of locating these points. In this section we
begin to develop a systematic approach to locating maxima and minima. A complete approach will
be presented in the next chapter.
We begin by defining what is meant by a local minimum point c. This is a point such that for
all x sufficiently close to c, f (c) ≤ f (x). A more precise statement is below.

| | | x
c−h c c+h

Local minimum point

Definition 4.8.1. Suppose that f is defined on some interval I. We say that a point
c in I is a local mininum point if there is a positive number h such that f (c) ≤ f (x)
whenever x ∈ (c − h, c + h) and x ∈ I. We say that a point d in (a, b) is a local
maximum point if there is a positive number h such that f (x) ≤ f (d) whenever
x ∈ (d − h, d + h) and x ∈ I.

The following theorem will be familiar from high school.

Theorem 4.8.2. Suppose that f is defined on (a, b) and has a local maximum or minimum point
at c for some c in (a, b). If f is differentiable at c then f ′ (c) = 0.

Proof. Suppose that f is a local maximum at c. To show that f ′ (c) = 0 we consider the difference
quotient
f (c + h) − f (c)
.
h
Since f is a maximum at c, f (c + h) ≤ f (c) for h sufficiently close to 0. So if h is sufficiently small
and positive,
f (c + h) − f (c)
≤ 0,
h
while if h is sufficiently small and negative,
f (c + h) − f (c)
≥ 0.
h
c
2020 School of Mathematics and Statistics, UNSW Sydney
80 CHAPTER 4. DIFFERENTIABLE FUNCTIONS

That is,
f (c + h) − f (c)
lim ≤0
h→0+ h
while
f (c + h) − f (c)
lim ≥ 0.
h→0− h
Hence
f (c + h) − f (c)
lim =0
h→0 h
which is equivalent to saying that f ′ (c) = 0.
The proof when c is a local minimum is similar and will be omitted.

The set of points where the derivative of a function is zero is important. In particular, tangents
to the graph of the function at these points are horizontal lines. This motivates the following
definition.

Definition 4.8.3. If a function f is differentiable at a point c and f ′ (c) = 0 then c

is called a stationary point of f .

Example 4.8.4. Suppose that f : R → R is given by f (x) = 4x5 − 5x4 − 40x3 − 2. Find all the
stationary points of f .

Solution. We begin by differentiating f and factorising the derivative:

f ′ (x) = 20x4 − 20x3 − 120x2

= 20x2 (x2 − x − 6)
= 20x2 (x + 2)(x − 3).

Hence the solutions to f ′ (x) = 0 are x = −2, 0, 3. So the points −2, 0 and 3 are the stationary
points of f .

The graph of the function f appearing in Example 4.8.4 is shown below.

f (x)
200
|

| |

−2 3 x
−200
|

c
2020 School of Mathematics and Statistics, UNSW Sydney
4.9. MAPLE NOTES 81

It is clear that −2 is a local maximum point of f and that 3 is a local minimum point of f . However,
0 is neither a local minimum point nor local maximum point, and none of these points are global
maximum points or global minimum points. So while the local maxima and minima of a function
occur where f ′ is zero, it does not follow that any point c satisfying f ′ (c) = 0 is a local maximum
point or local minimum point of f .
Tools for identifying whether a stationary point is a local maximum, local minimum or neither
are developed in the next chapter using an important result called the mean value theorem.

4.9 Maple notes

df
The Maple diff command is used to compute derivatives: diff(f(x),x); calculates and
n
dx
d f
diff(f(x),x$n); calculates . For example,
dxn
> # Note Maple use of csc to denote function we usually call cosec
> diff(sqrt(csc(x^12-53*x^5-1)),x);
p
− 21 csc (x12 − 53 x5 − 1) cot x12 − 53 x5 − 1 12 x11 − 265 x4
> diff(x^3*exp(5*x),x);

3 x2 e5 x + 5 x3 e5 x
> diff(%,x);
6 xe5 x + 30 x2 e5 x + 25 x3 e5 x
> f2ndDeriv:=diff(x^3*exp(5*x),x$2);
f2ndDeriv := 6 xe5 x + 30 x2 e5 x + 25 x3 e5 x

c
2020 School of Mathematics and Statistics, UNSW Sydney
82 CHAPTER 4. DIFFERENTIABLE FUNCTIONS

Problems for Chapter 4

Problems 4.1 : Gradients of tangents and derivatives

1. [R] Using the definition of the derivative, show that:

a) if f (x) = x2 then f ′ (x) = 2x;

b) if f (x) = x3 then f ′ (x) = 3x2 ;
1
c) if f (x) = then f ′ (x) = −1
x x2
;
√ 1
d) [H] if f (x) = x then f ′ (x) = √
2 x
.

Problems 4.2 : Rules for differentiation

2. [R] Find the derivative in each case.

a) f (x) = 5(x4 + 3x7 ) b) g(x) = (x4 − 2x)(4x2 + 2x + 5)
y2
c) h(y) = d) f (x) = x(x2 − 4)1/2
y3 + 8
√
e) f (t) = t/ t2 − 4 f) g(y) = sin 3y − 3 cos2 2y
√
g) g(x) = x4 e−x h) f (x) = (x2 + 1) ln x3 + 1
i) f (x) = ln(etan x ) j) f (x) = ln(cos x)

3. [H] Suppose that f : R → R is defined by f (x) = x|x| for all x in R.

f (0 + h) − f (0)
a) If it exists, evaluate lim .
h→0+ h
f (0 + h) − f (0)
b) If it exists, evaluate lim .
h→0 − h
c) State the value of f ′ (0) or explain why f is not differentiable at 0.

4. [R] [V] Determine at which points each function f is (i) differentiable; (ii) continuous.
(
sin x if x ≤ 0 x3 − 6x + 4
a) f (x) = |x| b) f (x) = c) f (x) = 2
x if x > 0 x + 4x + 4

5. [R] Sketch the graph of f , where f (x) = x1/3 . Is f differentiable at 0? Give reasons.

6. [X] Prove that the function f : R → R, given by

(
x2 sin x1 if x 6= 0
f (x) =
0 if x = 0,

is continuous and differentiable everywhere, but that f ′ is not continuous at 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 4 83

7. [X] The function f is differentiable at a. Find

f (a + ph) − f (a − ph)
lim .
h→0 h

8. [R] (An exercise on notation.) Suppose that f (x) = x + cos 2x. Write down
a) f (x + 17π) b) f ′ (x + nπ) c) f (2 − x2 )
d
d) f ′ (2 − x2 ) e) dx f (2 − x2 ).

Problems 4.4 : Implicit differentiation

dy
9. [R] Find in terms of x and y if
dx
√
a) x3 + y 3 = xy b) x2 − xy + y 2 = 6.

dy
10. [R] Find for the curve x4 + y 4 = 16. Sketch the graph of the curve.
dx

11. [R] [V] Find the equation of the line tangent to the curve x3 + y 3 = 3(x + y) at the point
(1, 2).

Problems 4.5 : Differentiation, continuity and split functions

12. [R] Suppose that a and b are real numbers. Find all values of a and b (if any) such that
the functions f and g, given by
( (
ax + b if x < 0 ax + b if x < 0
a) f (x) = and b) g(x) = ,
sin x if x ≥ 0 e2x if x ≥ 0
are (i) continuous at 0 and (ii) differentiable at 0.

13. [H] The function f : R → R is is defined by

(√ √
x sin x if x ≥ 0
f (x) =
ax + b if x < 0,

where a and b are real numbers. Find all values of a and b (if any) such that f is differ-
entiable at 0.

Problems 4.6 : Derivatives and function approximation

√
14. [R] [V] Suppose that f (x) = 3
x.
a) Without using a calculator, give a rough estimate of f (8.01).
b) i) Find the equation of the tangent to f at the point (8, 2).
ii) Use your answer to part (i) to find a different approximation for f (8.01).
c) Using a calculator, determine the error for the approximation in (a) and in (b). Which
approximation is better?

c
2020 School of Mathematics and Statistics, UNSW Sydney
84 CHAPTER 4. DIFFERENTIABLE FUNCTIONS

Problems 4.7 : Derivatives and rates of change

15. [R] At a certain instant the side length of an equilateral triangle is a cm and this length
is increasing at r cm/s. How fast, in cm2 /s, is the area increasing?

16. [R] [V] A 5 m ladder is leaning against a vertical wall. Suppose that the bottom of the
ladder is being pulled away from the wall at a rate of 1 m/s. How fast is the area of the
triangle underneath the ladder changing at the instant that the top of the ladder is 4 m
from the floor?

17. [R] A spherical balloon is to be filled with water so that there is a constant increase in
the rate of its surface area of 3 cm2 /s.
(The surface area A and volume V of a sphere of radius r is given by A = 4πr 2 and
V = 43 πr 3 .)

a) Find the rate of increase in the radius when the radius is 3 cm.
b) Find the volume when the volume is increasing at a rate of 10 cm3 /s.

18. [R]

a) A container in the shape of a right circular cone, of semi-vertical angle tan−1 ( 12 ), is

placed vertex downwards with its axis vertical.

θ = tan−1 ( 12 )
θ

Water is poured in at the rate of 10 mm3 /s. Find the rate at which the depth, h mm,
is increasing when the depth of water in the cone is 50 mm.
b) [H] The cone is filled to a depth of 100 mm and pouring is then stopped. A hole is
then opened
√ at the vertex of the cone and water flows out of the hole at the rate
of 50π h mm3 per second, where h is the depth at time t. Show that it takes 200
seconds to empty the cone.

c
2020 School of Mathematics and Statistics, UNSW Sydney
85

Chapter 5

The mean value theorem and

applications

The mean value theorem is one of the most important results for establishing the theoretical
structure of calculus. Several results that you are familiar with from high school calculus are based
on the mean value theorem. Applications of the mean value theorem include
• identifying where a function is increasing or decreasing,
• identifying different types of stationary points,
• determining how many zeros a polynomial has,
∞
• evaluating limits which are indeterminate forms of type ∞ and 00 ,
• proving useful inequalities and
• estimating errors in approximations.
We begin the chapter by introducing the theorem.

5.1 The mean value theorem

(Ref: SH10 §4.1)
The basic idea behind the mean value theorem is straightforward. Consider a differentiable function
f and choose an interval [a, b]. We construct a chord AB joining the points (a, f (a)) and (b, f (b))
as shown in Figure 5.1. Because the function is continuous and differentiable, there is a point c in
(a, b) such that tangent to the graph of f at c lies parallel to the chord AB.
Since the tangent and the chord are parallel, their gradients are equal. The gradient of the
tangent at c is f ′ (c) while the gradient of the chord is given by
f (b) − f (a)
b−a
(using the ‘rise over run’ formula). Hence
f (b) − f (a)
= f ′ (c).
b−a
A precise statement of the theorem is given below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
86 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
f (x)
C

f (b) B
|

f (a) A
|

| | |
a c b x

Figure 5.1: The mean value theorem – AB is parallel to the tangent at C.

Theorem 5.1.1 (The mean value theorem). Suppose that f is continuous on [a, b] and differentiable
on (a, b). Then there is at least one real number c in (a, b) such that

f (b) − f (a)
= f ′ (c).
b−a
The proof of the theorem will be presented in the next section. For now we illustrate the
theorem with a simple example.
√
Example 5.1.2. Suppose that f : [1, 8] → R is given by f (x) = 3 x. Find a number c in (1, 8)
that satisfies the conclusions of the mean value theorem for f on [1, 8].

Solution. Note that f is continuous on [1, 8] and differentiable on (1, 8). By the mean value theorem
there is a real number c in (1, 8) such that

f (8) − f (1)
= f ′ (c).
8−1

Now f ′ (x) = 31 x−2/3 so the above equation becomes

2−1 1
= √ .
7 3 2
3 c
Rearranging to find c gives
√
3 7
c2 =
3
r
343
c= .
27

Since c ≈ 3.56 it is clear that c ∈ (1, 8). This is illustrated graphically below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.2. PROOF OF THE MEAN VALUE THEOREM 87
√
3
x

2 b

|
b

1 b
|

| | |

1 c 8 x

5.2 Proof of the mean value theorem

(Ref: SH10 §4.1)
In this section we prove the mean value theorem. The proof is part of MATH1141 only, so
MATH1131 students may want to jump straight to Section 5.3.
To prove the mean value theorem, we begin by considering the special case when f (a) = f (b).
This case is known as Rolle’s theorem and is illustrated in Figure 5.2.

Theorem 5.2.1. Suppose that g is continuous on [a, b] and differentiable on (a, b). Suppose also
that g(a) = g(b) = 0. Then there is a real number c in (a, b) such that g ′ (c) = 0.

Proof. The proof considers three cases.

Case 1: Suppose that g is the constant function given by g(x) = 0. Then g ′ (c) = 0 for every c
in (a, b) (see Section 4.2).
Case 2: Suppose that there is a point d in (a, b) such that g(d) > 0. By the max-min theorem
(Theorem 3.4.3), g attains a maximum value at some point c in [a, b]. Moreover, c cannot be a or
b since g(d) > g(a) = g(b) = 0. Hence c lies in (a, b), and since g is differentiable on (a, b), we have
g ′ (c) = 0 (see Theorem 4.8.2).
Case 3: Suppose that g(x) ≤ 0 for all x in [a, b] and that g is not constant on [a, b]. Then g
attains a minimum at a point c in (a, b), and similarly to Case 2 one can show that g ′ (c) = 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
88 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
g(x)

A B
a c b x

Figure 5.2: Rolle’s theorem – the tangent at C is horizontal.

From here it is not hard to see how to prove the mean value theorem. Consider the graph of
the function f in Figure 5.1 and ‘subtract’ the chord AB. The result will look something like the
graph of g in Figure 5.2, to which we can apply Rolle’s theorem and obtain a horizontal tangent
at C. We then ‘add’ the chord back again to obtain the tangent to f of Figure 5.1. This rough
geometric argument is made rigorous below.

Proof of the mean value theorem. Suppose that f is continuous on [a, b] and differentiable on (a, b).
We consider the function g given by

f (b) − f (a)
g(x) = f (x) − (x − a) + f (a) .
b−a
(The part in square brackets is taken from the equation of the chord AB in Figure 5.1.) One can
check that g is continuous on [a, b], differentiable on (a, b) and that g(a) = g(b) = 0. By Rolle’s
theorem there is a c in (a, b) such that g ′ (c) = 0; that is, such that

′ f (b) − f (a)
f (c) − = 0.
b−a
If we rearrange this equation then we obtain
f (b) − f (a)
f ′ (c) =
b−a
as required.

5.3 Proving inequalities using the mean value theorem

Our first application of the mean value theorem will be to prove some useful inequalities. We begin
with an example.
Example 5.3.1. By using the mean value theorem, show that ln x < x − 1 whenever x > 1.
Solution. Suppose that x > 1 and consider the closed interval [1, x]. We define a function f :
[1, x] → R by f (t) = ln t. Now f is continuous on [1, x] and differentiable on (1, x). So we may
apply the mean value theorem to f on the interval [1, x]. Now f ′ (t) = 1/t so
f (x) − f (1) 1
=
x−1 c

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.3. PROVING INEQUALITIES USING THE MEAN VALUE THEOREM 89

for some c in (1, x) by the mean value theorem. That is,

ln x 1
=
x−1 c
for some c between 1 and x. Since c > 1 we have 1/c < 1 and hence
ln x
< 1.
x−1
Rearranging the inequality gives ln x < x − 1 as desired.
The technique used in the above example is as follows. Consider a function f that is continuous
on the interval [a, x] and differentable on the interval (a, x). The mean value theorem gives
f (x) − f (a)
= f ′ (c) (5.1)
x−a
for some c between a and x. If f ′ (c) < M then
f (x) − f (a)
<M
x−a
and hence
f (x) < M (x − a) + f (a). (5.2)
whenever x > a.
Sometimes it is desirable for the inequality to go the other way. In this case we find m such
that f ′ (c) > m. Then, instead of (5.2), we obtain
f (x) > m(x − a) + f (a).
Often, the tricky part of the process is determining which function f will give the desired
inequality.
Example 5.3.2. Use the mean value theorem to show that
√ x
x+4−2 <
4
whenever x > 0.
Solution. Fix positive x. The first step is to identify a good choice of f . If we rewrite the inequality
as √ x
x+4< +2
4
then we have a similar form
√ to inequality (5.2). This suggests that we should define f : [0, x] → R
by the formula f (t) = t + 4. Then f is continuous on [0, x] and differentiable on (0, x). An
application of the mean value theorem gives
f (x) − f (0) 1
= √
x−0 2 c+4
1
for some c in (0, x). Now the right-hand side is bounded above by √
2 4
, so we have
√ √
x+4− 0+4 1
< √ .
x−0 2 4
Simplifying and rearranging gives
√ x
x+4−2 <
4
as required.

c
2020 School of Mathematics and Statistics, UNSW Sydney
90 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

5.4 Error bounds

A second application √ of the mean√value theorem√is in calculating error bounds. Suppose that a
calculation involves 27. Clearly 27√is close to 25, but the latter is easier to work with because
it is equal to 5. If we use 5 instead of 27 then calculations are made easier with the tradeoff that
we introduce an error. How bad is this error?

Example 5.4.1.
√ Use√the mean value theorem to find an upper bound for the error involved if we
approximate 27 by 25.

Solution. The precise error in the approximation is given by

√ √
27 − 25.
√
We apply the mean value theorem to the function f , given by f (x) = x, on the interval [25, 27].
This gives √ √
27 − 25 1
= √
27 − 25 2 c
for some c in (25, 27). Rearranging this gives us
√ √ 1
27 − 25 = √
c

where 25 < c < 27. Hence

√ √
error = 27 − 25
1
=√
c
1
<√
25
1
= ,
5
since c > 25. Hence an upper bound for the error is 1/5.

We illustrate the process with one more example.

Example 5.4.2. Use the mean value theorem to find an upper bound for the error involved if we
approximate sin 5π 2π
7 by sin 3 .

Solution. Note that since

2π 14π 15π 5π
= ≈ =
3 21 21 7
and the sine function is continuous,
2π 5π
sin ≈ sin .
3 7
The (absolute) error of the approximation is exactly

sin 5π − sin 2π .

7 3

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.5. THE SIGN OF A DERIVATIVE 91

If we apply the mean value theorem to the sine function on the interval [2π/3, 5π/7] then

sin 5π 2π
7 − sin 3
5π 2π = cos c
7 − 3

for some c in (2π/3, 5π/7). Taking absolute values of both sides and rearranging gives

sin 5π − sin 2π = 5π − 2π | cos c |

7 3 7 3

where 2π/3 < c < 5π/7. Now | cos c | < | cos 5π 7 | (because the absolute value of the cosine is
increasing over the interval (π/2, π)) but we don’t have an exact value for cos 5π 7 . A simple way
out of this situation is to just use the fact that | cos c | < 1 for all c in (2π/3, 5π/7). Hence we have
shown that
15π 2π 5π 2π
absolute error = sin − sin < − × 1.
21 3 7 3
This proves that
π
absolute error < < 0.15.
21
If we aren’t so lazy with our estimates we can quickly spot that

cos 5π = cos 15π < cos 15π = √1 .

7 21 20 2
This allows us to prove the more accurate statement that
π
absolute error < √ < 0.11.
21 2

5.5 The sign of a derivative

(Ref: SH10 §§4.2, 4.3)
Suppose that a function f is differentiable at 4 and that f ′ (4) = −2. In high school, it was taught
that, since the derivative of f at 4 is negative, the function f is decreasing at the point 4. This
makes intuitive sense since the gradient of the tangent to f at the point 4 is −2 and so the tangent
slopes downward. In this section, these ideas are given a rigorous foundation by using the mean
value theorem.

Definition 5.5.1. Suppose that a function f is defined on an interval I. We say

that

a) f is increasing on I if for every two points x1 and x2 in I,

x1 < x2 implies that f (x1 ) < f (x2 );

b) f is decreasing on I if for every two points x1 and x2 in I,

x1 < x2 implies that f (x1 ) > f (x2 ).

c
2020 School of Mathematics and Statistics, UNSW Sydney
92 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

Example 5.5.2. The function f : R → R given by f (x) = x2 is increasing on [0, ∞) and decreasing
on (−∞, 0].

f (x)

decreasing increasing
x

The following theorem should be familiar to you. It is proved using the mean value theorem.
Theorem 5.5.3. Suppose that f is continuous on [a, b] and differentiable on (a, b).
(i) If f ′ (x) > 0 for all x in (a, b) then f is increasing on [a, b].

(ii) If f ′ (x) < 0 for all x in (a, b) then f is decreasing on [a, b].

(iii) If f ′ (x) = 0 for all x in (a, b) then f is constant on [a, b].

Proof. We only prove statement (i) since the proofs of statements (ii) and (iii) are similar. Suppose
that f ′ (x) > 0 for all x in (a, b) and choose two points x1 and x2 in [a, b] such that x1 < x2 . By
Definition 5.5.1, it suffices to show that f (x1 ) < f (x2 ).
Since f is differentiable on I, it is continuous on [x1 , x2 ] (see Theorem 4.5.1) and differentiable
on (x1 , x2 ). Hence, by the mean value theorem, there is a c in (x1 , x2 ) such that
f (x2 ) − f (x1 )
= f ′ (c).
x2 − x1
But f ′ (c) > 0 and x2 − x1 > 0. This means that f (x2 ) − f (x1 ) > 0 and hence f (x1 ) < f (x2 ) as
required.

Theorem 5.5.3 has several important applications. The first of these is classifying different
types of stationary points. The basic idea is as follows. Suppose that f is differentiable on the
open interval (1, 3) and that for some small positive number h we have the following:
• f ′ (x) > 0 for all x in (2 − h, 2),

• f ′ (2) = 0, and

• f ′ (x) < 0 for all x in (2, 2 + h).

Now f is differentiable on (2 − h, 2 + h) so it is also continuous on this interval. Moreover, f
increases on the interval (2 − h, 2), is stationary at 2 and then decreases on the interval (2, 2 + h).
From these facts we conclude that the stationary point 2 is a local maximum point for f .
Example 5.5.4. Find and classify all stationary points of the function f : R → R whose derivative
is given by
f ′ (x) = (x − 4)(x − 1)(x + 5)2 .
Solution. The stationary points, which are found by solving the equation f ′ (x) = 0, are 4, 1 and
−5. To classify each stationary point, we examine the sign of f ′ on small intervals either side of
each stationary point. This process is concisely documented in the following table.

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.6. THE SECOND DERIVATIVE AND APPLICATIONS 93

−5− −5 −5+ 1− 1 1+ 4− 4 4+
x−4 − − − − − − − 0 +
x−1 − − − − 0 + + + +
(x + 5)2 + 0 + + + + + + +
f ′ (x) + 0 + + 0 − − 0 +
Gradient − − −

For example, the notation 4− is shorthand for those points x lying in a small interval to the
immediate left of 4. For such a point x, the factor x − 4 is negative, while the factors x − 1 and
(x + 5)2 are positive. Hence f ′ (x) is negative when x is slightly to the left of 4. Other columns of
the table are filled by a similar process.
In conclusion, 4 is a local minimum point, 1 is a local maximum point and −5 is neither a
maximum nor minimum point. (In fact, −5 is called a horizontal point of inflexion since the
tangent to f at −5 is horizontal and the concavity of the function changes about this point. See
any standard undergraduate calculus text or any advanced high school calculus text for further
details.)

5.6 The second derivative and applications

(Ref: SH10 §4.3)
In this section we give another method for classifying the stationary points of a function f . To do
so, we use the second derivative of f , which is denoted by f ′′ .

Example 5.6.1. Find the second derivative of the function f : R → R, given by the rule

f (x) = 3x4 + 2x − sin x.

Solution. The first derivative of f is given by

f ′ (x) = 12x3 + 2 − cos x.

To find the second derivative f ′′ of f , we simply differentiate f ′ . Hence

f ′′ (x) = 36x2 + sin x.

dy
Remark 5.6.2. If y = f (x) then the first derivative of f is often written as dx or y ′ while the
d2 y
second derivative of f is often written as dx2 or y ′′ .

Remark 5.6.3. Even if a function is differentiable at a point a, the function may not have a second
derivative at a. (Consider, for example, the function f : R → R given by

f (x) = x4/3 ,

at the point 0.) A function which has a second derivative at a is called twice differentiable at a.

If f is twice differentiable, the sign of f ′′ can often be used to determine whether a stationary
point is a local maximum point or a local minimum point.

c
2020 School of Mathematics and Statistics, UNSW Sydney
94 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

Theorem 5.6.4 (The second derivative test). Suppose that a function f is twice differentiable on
(a, b) and that c ∈ (a, b).

(i) If f ′ (c) = 0 and f ′′ (c) > 0 then c is a local minimum point of f ;

(ii) If f ′ (c) = 0 and f ′′ (c) < 0 then c is a local maximum point of f .

The proof of the theorem uses the following result.

Lemma 5.6.5. Suppose that g is differentiable at a point c.

(i) If g ′ (c) > 0 then g(c − h) < g(c) < g(c + h) for all positive h sufficiently small.

(ii) If g ′ (c) < 0 then g(c + h) < g(c) < g(c − h) for all positive h sufficiently small.

Sketch proof. We give an intuitive argument for statement (i) and leave (ii) as an exercise. Suppose
that g ′ (c) > 0. By the definition of the derivative,

g(c + h) − g(c)
lim = g′ (c). (5.3)
h→0 h

That is,
g(c + h) − g(c)
≈ g′ (c)
h
when h is close to 0. Now, if h > 0 then

g(c + h) − g(c) ≈ hg ′ (c) > 0

Hence
g(c + h) − g(c) > 0 (5.4)

provided that h > 0 and that h is sufficiently small.

On the other hand, the definition of the derivative also implies that

g(c − h) − g(c)
lim = g′ (c)
h→0 −h

(to see this, simply replace h with −h in (5.3)). Now, if h > 0 and h is close to 0 then

g(c) − g(c − h) ≈ hg ′ (c) > 0.

Hence
g(c) − g(c − h) > 0 (5.5)

provided that h > 0 and that h is sufficiently small.

If we combine (5.4) and (5.5) then we have

g(c − h) < g(c) < g(c + h)

for all positive h sufficiently small.

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.6. THE SECOND DERIVATIVE AND APPLICATIONS 95

Proof of Theorem 5.6.4. We prove statement (i) and leave the proof of (ii) as an exercise. Suppose
that f ′ (c) = 0, f ′′ (c) > 0 and that g = f ′ . Then g ′ (c) > 0 and

g(c − h) < g(c) < g(c + h)

for all positive h sufficiently small, by Lemma 5.6.5. Since f ′ (c) = g(c) = 0 we conclude that

f ′ (c − h) < 0 < f ′ (c + h)

for all positive h sufficiently small. So f ′ is positive on a small interval immediately to the right
of c, zero at x and negative on a small interval immediately to the left of c. By Theorem 5.5.3 we
now conclude that f has a local minimum at the point c.

Example 5.6.6. Find and classify the stationary points of the function f : R → R given by

f (x) = x3 − 6x2 + 9x − 5.

Solution. First we calculate the first and second derivatives of f :

f ′ (x) = 3x2 − 12x + 9 = 3(x − 1)(x − 3)

f ′′ (x) = 6x − 12.

Hence f has stationary points at 1 and 3. Now f ′′ (1) = −6 < 0, so f has a local maximum point
at 1. For second stationary point, f ′′ (3) = 6 > 0, so f has a local minimum stationary point at 3.
(To illustrate these conclusions, we have included the graph of f , shown below.)
f (x)

1 3
| |
x
−5

Remark 5.6.7. Suppose that f ′ (c) = 0 and that f ′′ (c) = 0. Then we cannot apply the second
derivative test. In fact, the point c may be a local maximum, a local minimum or neither of these.
The examples below illustrate various possibilities.

x4 −x4 x3

x x x

c
2020 School of Mathematics and Statistics, UNSW Sydney
96 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

• If f (x) = x4 then f ′ (0) = f ′′ (0) = 0 and there is a local minimum at 0.

• If f (x) = −x4 then f ′ (0) = f ′′ (0) = 0 and there is a local maximum at 0.

• If f (x) = x3 then f ′ (0) = f ′′ (0) = 0 and there is neither a a local maximum and minimum
at 0 (in fact, we have a horizontal point of inflexion at 0).

Hence if f ′ (c) = f ′′ (c) = 0 then it is best to classify the stationary point c by examining the sign
of the derivative on either side of c, as illustrated in Section 5.5.

5.7 Critical points, maxima and minima

(Ref: SH10 §§4.3–4.5)
By combining results from Chapters 3, 4 and 5, we now give a comprehensive presentation of how
to locate global maxima and minima for any real-valued continuous function defined on a closed
interval.
Suppose that a function f : [a, b] → R is continuous on [a, b]. By the max-min theorem
(Theorem 3.4.3), f attains a global maximum and a global minimum on [a, b]. Suppose that the
global maximum occurs at the point c. Now either c is one of the endpoints a or b, or c lies in the
open interval (a, b). In the latter case, f is either differentiable at c or it is not. If it is differentiable
at c, then f ′ (c) = 0 by Theorem 4.8.2.
Thus we have identified that a global maximum point c must satisfy one of the three following
properties:

• c is one of the endpoints a or b of the closed interval [a, b],

• f is differentiable at c and f ′ (c) = 0, or

• f is not differentiable at c.

The set of points satisfying these properties is useful so we give it a special name.

Definition 5.7.1. Suppose that f is defined on [a, b]. We say that a point c in [a, b]
is a critical point for f on [a, b] if c satisfies one of the following properties:

(a) c is an endpoint a or b of the interval [a, b],

(b) f is not differentiable at c, or

(c) f is differentiable at c and f ′ (c) = 0.

Theorem 5.7.2. Suppose that f is continuous on [a, b]. Then f has a global maximum and global
minimum on [a, b]. Moreover, the global maximum point and the global minimum point are both
critical points for f on [a, b].

The proof of the theorem follows the same reasoning given at the beginning of this section. The
theorem provides a systematic way of finding global minima and maxima of functions.

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.7. CRITICAL POINTS, MAXIMA AND MINIMA 97
f (x) f (x)
b

3 b b b

4 b

b | | b |

1 2 x 3 4 5 x
2

(a) (b)

Figure 5.3: Graphs of functions and their critical points.

Example 5.7.3. Suppose that f : R → R is given by

f (x) = 3(x − 1)2/3 .

Find the absolute maximum and minimum values of f on [0, 2].

Solution. The derivative of f is given by

2
f ′ (x) = 2(x − 1)−1/3 = √
3
.
x−1

Clearly f is not differentiable at 1. Morover, its derivative is never zero. So the only critical
points of f on [0, 2] are 1 and the endpoints 0 and 2. By Theorem 5.7.2, the global maximum and
minimum points must be among the points 0, 1 and 2. But

f (0) = 3, f (1) = 0 and f (2) = 3.

Hence the global minimum value of f on [0, 2] is 0 while the global maximum value of f on [0, 2] is
3. (See Figure 5.3 (a) for a sketch of f that shows its critical points.)

Example 5.7.4. Suppose that the function f : R → R is given by the rule

f (x) = |x2 − 3x − 4|.

Find the absolute maximum and absolute minimum values of f on the interval [0, 5].

Example 5.7.5. Consider the parabolic arc

{(x, y) ∈ R2 : y = x2 − x − 1, −2 ≤ x ≤ 3}

(see Figure 5.4 (a)). Find where the arc is (i) closest to and (ii) furthest from the origin.

Solution. If (x, y) is a point on the arc then its distance from the origin is given by
p
x2 + y 2 .

c
2020 School of Mathematics and Statistics, UNSW Sydney
98 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS
y
g(x)
34 b

|
b

| |

−2 3 x b
b

| |

−2 3 x
(a) (b)

Figure 5.4: Graphs for Example 5.7.5.

p
To find where the arc is closest to the origin, we want to minimise x2 + y 2 subject to the con-
straints y = x2 − x − 1 and −2 ≤ x ≤ 3. In other words, we want to find a global minimum for the
function p
f (x) = x2 + (x2 − x − 1)2
on the interval [−2, 3]. Because calculations involving square roots can get messy, we instead
consider the function g, given by

g(x) = x2 + (x2 − x − 1)2

on [−2, 3]. (Note that g(x) may be interpreted as the square of the distance between the origin
and a point on the arc with coordinate (x, x2 − x − 1). The global minimum of both f and g will
occur at the same point x.)
We seek the critical points of g, so we compute its derivative:

g′ (x) = 2x + 2(x2 − x − 1)(2x − 1)

= 2(x + 2x3 − 2x2 − 2x − x2 + x + 1)
= 2(2x3 − 3x2 − x2 + 1)
= 2(x − 1)(2x2 − x − 1)
= 2(x − 1)(x − 1)(2x + 1).

Hence the critical points of g on [−2, 3] are −2, −1/2, 1 and 3. Now

g(−2) = 29, g(−1/2) = 5/16, g(1) = 2, g(3) = 34

(see Figure 5.4 (b)). Therefore the arc is closest to the origin when x = −1/2 and furthest from
the origin when x = 3.

5.8 Counting zeros

In this section we combine the intermediate value theorem with results about the sign of derivatives
to determine how many solutions there are to various equations. The main ideas are illustrated in
the next example.

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.8. COUNTING ZEROS 99

Example 5.8.1. Determine how many real numbers satisfy the equation

x3 − 6x2 − 15x + 8 = 0. (5.6)

Solution. Suppose that f (x) = x3 − 6x2 − 15x + 8 for all real x. Ideally, we would look for one
root of f , factor this out and solve a quadratic equation to determine the remaining roots (if any).
However, finding one root by inspection is not straightforward, so we need another approach.
Note that f is differentiable (and hence continuous) everywhere. We begin by identifying
intervals where f is monotonically decreasing and intervals where f is monotonically increasing.
The derivative of f is given by

f ′ (x) = 3x2 − 12x − 15 = 3(x − 5)(x + 1).

So the stationary points of f are 5 and −1. Morover, the table below shows that f is increasing on
the intervals (−∞, −1] and [5, ∞), while it is decreasing on [−1, 5].

Interval (−∞, −1) (−1, 5) (5, ∞)

(x − 5) − − +
(x + 1) − + +
f ′ (x) + − +

We evaluate f at the (finite) endpoints of these intervals and see that

f (−1) = 16 and f (5) = −92.

Since f is a cubic, we can now give a rough sketch of the graph of f .

f (x)
(−1, 16)
b

(5, −92)

We now argue that f has exactly one root on each of the intervals mentioned above.

• On the interval [−1, 5], the function f is decreasing. Hence f cannot have more than one
root on [−1, 5]. On the other hand, since f (−1) > 0 and f (5) < 0 there is at least one c
in [−1, 5] such that f (c) = 0 by the intermediate value theorem. Hence f has exactly one
root on [−1, 5].

• On the interval (−∞, −1], the function f is increasing. Hence f cannot have more than
one root on (−∞, −1]. On the other hand, since f (−1) > 0 and f (−5) < 0 there is at
least one c in [−5, −1] such that f (c) = 0 by the intermediate value theorem. Hence f has
exactly one root on (−∞, −1].

c
2020 School of Mathematics and Statistics, UNSW Sydney
100 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

• On the interval [5, ∞), the function f is increasing. Hence f cannot have more than one
root on [5, ∞). On the other hand, since f (5) < 0 and f (10) > 0 there is at least one c
in [5, 10] such that f (c) = 0 by the intermediate value theorem. Hence f has exactly one
root on [5, ∞).

In conclusion, equation (5.6) has exactly three real solutions, one in each of the intervals [−5, −1],
[−1, 5] and [5, 10].

Guided by our solution to Example 5.8.1, we now outline a general approach for solving problems
of this type.
Suppose that f is continuous and differentiable everywhere and that its derivative is also con-
tinuous. To determine how many real solutions the equation f (x) = 0 has, one may follow the
procedure below.

1. Calculate f ′ and solve f ′ (x) = 0.

2. Determine the intervals where the derivative is positive and the intervals where it is neg-
ative.

3. By step 2 we know the intervals where the function f is monotonically increasing, and the
intervals where f is monotonically decreasing.

4. Evaluate the function f at the endpoints of each interval. If f changes sign on the interval,
there is exactly one root on that interval. If it does not change sign, there are no roots
on that interval. (This step must be modified slightly for intervals of the form (−∞, a] or
[b, ∞).)

Example 5.8.2. Determine how many real numbers satisfy the equation

2x3 − 9x2 + 12x − 1 = 0. (5.7)

Solution. We follow the steps outlined above. Suppose that

f (x) = 2x3 − 9x2 + 12x − 1.

By differentiating f we obtain

f ′ (x) = 6x2 − 18x + 12 = 6(x − 1)(x − 2).

The stationary points of f are 1 and 2. By studying the sign of f ′ , it is clear that f is increasing
on the intervals (−∞, 1) and (2, ∞) while it is decreasing on (1, 2). Now

f (1) = 4 and f (2) = 3,

as illustrated in the sketch below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.9. ANTIDERIVATIVES 101
f (x)

(1, 4)
b
b

(2, 3)
x

So it is clear that f has no zeros on [1, 2] or on [2, ∞). On (−∞, 1), the function f has no more
than one zero (since it is increasing on this interval). Moreover, f changes sign on the interval [0, 1]
(since f (0) = −1 < 0 and f (1) > 0). Hence f has exactly one root on [0, 1] by the intermediate
value theorem.
In conclusion, equation (5.7) has exactly one real solution and this solution lies somewhere in
the interval [0, 1].

5.9 Antiderivatives
We open with an example.
Example 5.9.1. While filming a movie, a stunt man jumps out of a stationary helicopter 900
meters above the ground. The scriptwriter wants the man to make a 20 second mobile phone call
before opening the parachute. Physical considerations (gravity and air resistance) suggest that the
man’s velocity f (t) (in metres per second) without an open parachute is given by
f (t) = 50(1 − e−t/5 ),
where t is the number of seconds after jumping from the helicopter. How far will the man fall 20
seconds?
Solution. Write F (t) for the distance fallen (in metres) after t seconds. We note that F (0) = 0.
Since velocity is the rate of change of displacement, F satisfies the equation
F ′ (t) = f (t).
(Any equation like this involving differentiation is called a differential equation). The differential
equation may be rewritten as
dF
= 50 − 50e−t/5 . (5.8)
dt
One can check by differentiation that if
F (t) = 50t + 250e−t/5 + C, (5.9)
where C is a real constant, then F satisfies (5.8). Let’s assume (for the moment) that all possible
solutions to the differential equation (5.8) are of the form (5.9). Imposing the condition that
F (0) = 0 allows us to evaluate the unknown constant C:
0 = F (0) = 50 × 0 + 250e0 + C,

c
2020 School of Mathematics and Statistics, UNSW Sydney
102 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

whence C = −250. Therefore F (t) = 50t + 250e−t/5 − 250. To complete the solution,

F (20) = 50 × 20 + 250e−4 − 250 ≈ 754.6.

Hence the stunt man will fall about 755 metres in the first 20 seconds.

To be certain of this solution, we need to show that every function F satisfying the differential
equation (5.8) is of the form (5.9). This can be done using the mean value theorem. We begin by
introducing the notion of an antiderivative.

Definition 5.9.2. Suppose that f is continuous on an open interval I. A function

F is said to be an antiderivative (or a primitive) of f on I if F ′ (x) = f (x) for all x in
I. The process of finding an antiderivative of a function is called antidifferentiation.

Example 5.9.3. (a) Suppose that

f (t) = 50 − 50e−t/5 ∀t ∈ R

Then the function F , given by

F (t) = 50t + 250e−2t − 250

is an antiderivative of f on (−∞, ∞).

(b) Suppose that n is a positive integer and f (x) = xn . If

xn+1 xn+1
F (x) = and G(x) = +5
n+1 n+1
then F and G are both antiderivatives of f on the interval (−∞, ∞). So f has more than
one antiderivative on (−∞, ∞). In fact, if C is any real number and

H(x) = xn+1 /(n + 1) + C

then H is an antiderivative of f on (−∞, ∞). Thus the function f has infinitely many
antiderivatives on (−∞, ∞).

Example 5.9.3 (b) illustrates the general principle that if F is an antiderivative of f on I then

F +C

is also an antiderivative of f on I, for any real constant C. The next theorem says that all
antiderivatives are of this form.

Theorem 5.9.4. Suppose that f is a continuous function on an open interval I and that F and G
are two antiderivatives of f on I. Then there is a real constant C such that

G(x) = F (x) + C

for all x in I.

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.10. L’HÔPITAL’S RULE 103

Proof. Suppose that F and G are two antiderivatives of f on I and let H denote the function given
by
H(x) = G(x) − F (x)
for all x in I. Then H is differentiable on I and

H ′ (x) = G′ (x) − F ′ (x)

= f (x) − f (x)
=0

for all x in I. Hence there is a constant C such that H(x) = C for all x in I (by Theorem 5.5.3
(c)). Therefore
G(x) = F (x) + H(x) = F (x) + C
for all x in I.

Theorem 5.9.4 justifies the assumption made in the solution to Example 5.9.1. It also enables
us to write down all possible antiderivatives of some well-known functions. In the table below, C
is any real constant.

Function Antiderivative

1
xr , where r is rational and r 6= −1 r+1 x
r+1 +C

sin x − cos x + C

cos x sin x + C

1 ax
eax ae +C
f ′ (x)
ln |f (x)| + C
f (x)

5.10 L’Hôpital’s rule

(Ref: SH10 §§11.5, 11.6)
Consider the limit
ex − 1
lim .
x→0 5x + x2

Since both the numerator and denominator approach 0 as x → 0+ , the limit is an example of an
indeterminate form of type 00 . None of the limit rules of Chapter 2 can be easily applied to evaluate
the limit. However, the mean value theorem gives another rule which helps in this situation.
Suppose that f (x) = ex − 1 and g(x) = 5x + x2 . Instead of considering the quotient f (x)/g(x)
(whose limit is difficult to calculate), we consider the quotient f ′ (x)/g ′ (x) of derivatives (whose
limit is easier to calculate):
f ′ (x) ex 1
lim ′ = lim = .
x→0 g (x) x→0 5 + 2x 5

c
2020 School of Mathematics and Statistics, UNSW Sydney
104 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

Since this second limit exists, l’Hôpital’s rule (which is stated below) implies that

f (x) f ′ (x)
lim = lim ′ .
x→0 g(x) x→0 g (x)

Hence
ex − 1 1
lim = .
x→0 5x + x2 5
Theorem 5.10.1 (l’Hôpital’s rule). Suppose that f and g are both differentiable functions and a
is a real number. Suppose also that either one of the two following conditions hold:
• f (x) → 0 and g(x) → 0 as x → a;

• f (x) → ∞ and g(x) → ∞ as x → a.

If
f ′ (x)
lim
x→a g ′ (x)

exists then
f (x) f ′ (x)
lim = lim ′ .
x→a g(x) x→a g (x)

Remark 5.10.2. The theorem also holds for

• limits as x → ∞ or x → −∞;

• one-sided limits (that is, as x → a+ or as x → a− ).

L’Hôpital’s rule is proved using the mean value theorem (see the sketch proof at the end of this
section).
ln x
Example 5.10.3. Calculate lim .
x→1 1 − x

Solution. If f (x) = ln x and g(x) = 1 − x then both f (x) and g(x) approach 0 as x → 1. Moreover,
f and g are both differentiable. So we look at the quotient of the derivatives:
f ′ (x) 1/x 1
′
= = − → −1
g (x) −1 x
as x → 1. We conclude by l’Hôpital’s rule that
f (x)
lim = −1.
x→1 g(x)

(Note: for short we may write

ln x 1/x
lim = lim (5.10)
x→1 1 − x x→1 −1
1
= lim −
x→1 x
= −1

provided that we check that our application of l’Hôpital’s rule in (5.10) is valid.)

c
2020 School of Mathematics and Statistics, UNSW Sydney
5.10. L’HÔPITAL’S RULE 105

Sometimes l’Hôpital’s rule must be applied more than once to calculate a limit. The next
example illustrates this.
x2
Example 5.10.4. Determine the limiting behaviour of as x → ∞.
e2x
Solution. Observe that both the numerator and denominator approach ∞ as x → ∞. Differenti-
ating the numerator and denominator gives
2x
.
2e2x
∞
Again, as x → ∞ we have an indeterminate form of type ∞ . If we differentiate the numerator and
denominator once more we have
2
4e2x
and this limit can be evaluated. Piecing this together, we have

x2 2x 2
lim 2x
= lim 2x
= lim =0
x→∞ e x→∞ 2e x→∞ 4e2x

by l’Hôpital’s rule.

As the next example shows, l’Hôpital’s rule can be used to solve limits that are indefinite forms
of type 0 × ∞.
Example 5.10.5. Determine the limiting behaviour of x ln x as x → 0+ .
The next example warns against the improper application of l’Hôpital’s rule.
2x − cos x
Example 5.10.6. Determine the limiting behaviour of as x → ∞.
3x + cos x
We shall now sketch the proof of l’Hôpital’s rule for the case when x → a+ . Other cases are done
similarly. (A complete proof uses a generalisation of the mean value theorem known as Cauchy’s
mean value theorem.)

Sketch proof of Theorem 5.10.1. Suppose that f and g are differentiable (and hence continuous)
everywhere, that f (x) → 0 and g(x) → 0 as x → a+ , and that

f ′ (x)
lim
x→a+ g ′ (x)
exists. We need to show that
f (x) f ′ (x)
lim = lim ′ . (5.11)
x→a+ g(x) x→a+ g (x)
We begin by applying the mean value theorem to both functions f and g on the interval [a, x].
So there are real numbers c and d in (a, x) such that
f (x) − f (a) g(x) − g(a)
= f ′ (c) and = g ′ (d). (5.12)
x−a x−a
Since f is continuous and f (x) → 0 as x → a+ , we have f (a) = 0. Similarly g(a) = 0. So by (5.12)
we have
f (x) = (x − a)f ′ (c) and g(x) = (x − a)g′ (d)

c
2020 School of Mathematics and Statistics, UNSW Sydney
106 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

for some c and d in (a, x). Hence

f (x) (x − a)f ′ (c) f ′ (c)

= = ′ . (5.13)
g(x) (x − a)g(d) g (d)

Now c → a+ and d → a+ as x → a+ . Hence (5.13) becomes (5.11) as required.

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 5 107

Problems for Chapter 5

Problems 5.1 : The mean value theorem

1. [R] Find a real number c which satisfies the conclusions of the mean value theorem for
each function f on the given interval.
√
a) f (x) = x3 on [1, 2] b) f (x) = x on [0, 2].

2. [R] Suppose that f (x) = 1/x. Show that there is no real number c in [−1, 2] such that
f (2) − f (−1)
f ′ (c) = .
2 − (−1)
Why does this not contradict the mean value theorem?

3. [R] Consider the function f given by f (x) = (x − 2)4 cos(x2 − 4x + 4). Use the mean value
theorem to show that f ′ has a zero on the interval [1, 3].

Problems 5.3 : Proving inequalities using the mean value theorem

4. [R] [V] By using the mean value theorem, show that

a) ln(1 + x) < x whenever x > 0;
b) − ln(1 − x) < x/(1 − x) whenever 0 < x < 1;
c) 1 + x < ex whenever x > 0.

5. [R]
a) Use the mean value theorem to show that sin t ≤ t whenever t > 0.
b) Hence show that sin t < t whenever t > 0.
1
c) Using the pinching theorem and part (a), evaluate the limit lim sin .
x→∞ x

6. [H] Prove that

x √ x
1+ √ < 1+x <1+ whenever x > 0.
2 1+x 2

Problems 5.4 : Error bounds

7. [R] [V] Use the mean value theorem to find an upper bound for the error involved if we
approximate
√ √
a) 17 by 16 = 4;

1998 2
b) by 22 = 4;
1000
1 1
c) by .
1002 1000
c
2020 School of Mathematics and Statistics, UNSW Sydney
108 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

Problems 5.5 : The sign of a derivative,

5.6 : The second derivative and applications and
5.7 : Critical points, maxima and minima

8. [R] The derivative of a function f : R → R is given by f ′ (x) = 3(x + 1)(x − 1)2 (x − 4)3 .
Locate all stationary points of f and identify any local maximum or minimum points of
f.

9. [X] The function f : R → R, given by

(
x sin x1 if x 6= 0
f (x) =
0 if x = 0,
is continuous but not differentiable at 0. Does f have a local maximum or a local minimum
at 0? Prove your answer.

10. [R] Find the maximum and minimum values for each function f over the given interval.
a) f (x) = 3 − x3 over [−2, 4] b) f (x) = 3 − x4 over [−2, 4]
c) f (x) = x3 − x4 over [−5, 5] d) f (x) = 2x(x + 4)3 over [−2, 1]
e) f (x) = |x2 − 3x + 2| over [0, 3]

11. [R] Find the point on the straight line 2x + 3y = 6 which is closest to the origin.

x2 x3
12. a) i) [R] Show that the polynomial p3 , where p3 (x) = 1 + x + + , has at least
2! 3!
one real root.
x2
ii) [H] Show that the polynomial p2 , where p2 (x) = 1 + x + , has no real roots
2!
and deduce that p3 has exactly one real root.
x2 x3 x4
iii) [X] Deduce that p4 (x) = 1 + x + + + > 0 for all real numbers x.
2! 3! 4!
n
X xk
b) [X] Suppose that pn (x) = whenever n = 1, 2, 3, . . . . Use induction to prove
k!
k=0
that
i) if n is even then pn (x) > 0 for all real numbers x, and
ii) if n is odd then pn (x) has exactly one real root and this root is negative.

13. [R] A wire of length 100 cm is cut into two pieces of length x cm and y cm. The piece of
length x cm is bent into the shape of a square and the piece of length y cm into the shape
of a circle. Find x and y so that the sum of the areas enclosed by the shapes will be
a) a minimum b) a maximum.

14. [X] Suppose that a ≥ 0. Find the greatest and least distances from the point (a, 0) to
the ellipse
x2 y 2
+ = 1.
4 1
(Have a precise answer before comparing with the given answer.)

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 5 109

15. [X] Find all the values of a and x, both in [0, 2π], where

f (x) = cos a + 2 cos(2x) + cos(4x − a)

has a horizontal point of inflexion.

Problems 5.8 : Counting zeros

16. [R] Show that x3 + x − 9 = 0 has only one real solution.

17. [R] [V] Suppose that p(x) = x3 − 12x2 + 45x − 51 whenever x ∈ R. How many real zeros
does p have?

Problems 5.9 : Antiderivatives

18. [R]

a) Find a function f that has the following properties:

f ′ (t) = sin t + t whenever t ∈ R,

f (0) = 2.

b) Are there any other functions with these properties? Explain your answer.

19. [R] [V] A particle moving along the x-axis has velocity 2t − t2 units per second after t
seconds. Find

a) the distance from the starting point after three seconds;

b) the total distance travelled after three seconds.

Problems 5.10 : L’Hôpital’s rule

20. [R] Calculate the following limits.

ex − 1 xm − 1 x − π/2
a) lim b) lim , n 6= 0 c) lim
x→0 x(3 + x) x→1 xn − 1 x→π/2 cos x

ln (1 + x) − x 1 − sin x tan x − x
d) lim e) lim f) lim
x→0 x2 x→π/2 1 + cos 2x x→0 x3

21. [R] Determine the limiting behaviour in the following cases.

x3 + 1 e5x
a) as x → ∞ b) as x → ∞
x4 + 1 x3
e5x
c) as x → −∞ d) x sin(1/x) as x → ∞
x3
√
x4 + 1 ln(x3 + 1)
e) √ as x → ∞ f) as x → ∞
3
x6 + 1 ln(x2 + 1)

c
2020 School of Mathematics and Statistics, UNSW Sydney
110 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS

1 1
22. [H] Find the value of lim + .
t→0 ln(1 + t) ln(1 − t)

ax − 1 + ebx
23. [H] Find (a, b) such that lim = 1.
x→0 x2

4x + sin x
24. [R] Explain why l’Hôpital’s Rule cannot be used to find lim . Use another
x→∞ 2x − sin x
method to find this limit.

25. [R] [V] Show that the function f , given by

(
e2x if x ≥ 0
f (x) =
2x + 1 if x < 0,

is differentiable at 0.

26. [R]
√
cos
h−1
a) Evaluate lim .
h→0+ h
b) A function f is defined by
( √
cos x if x ≥ 0
f (x) =
ax + b if x < 0,

where a and b are real numbers. By using the limit calculated in (a), find all possible
values of a and b such that f is differentiable at 0.

27. [H] [V]

a) Use l’Hôpital’s rule to show that lim x ln x = 0.

x→0+

b) By using part (a), or otherwise, show that lim x2 ln x = 0.

x→0+
c) A function f is defined by
(
x2 ln x if x > 0
f (x) =
ax + b if x ≤ 0,

where a and b are real numbers. Find all possible values of a and b such that f is
differentiable at 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
111

Chapter 6

Inverse functions

We return again to the pollution example (see the introductions to Chapter 2). According to our
model, the volume of pollution P (t) in the lake after t days of the factory’s operation is given by
109 5
P (t) = 1 − e−101t/10 .
101
Environmental authorities, after hearing that the eventual amount of pollution in the lake will be
109 /101 litres, determine that this level would be environmentally devastating. Recommendations
are made that the amount of pollution should not exceed 8 × 106 litres. If factory operations do
not change, in how many days will pollution levels in the lake exceed this amount?
Note that the function P expresses pollution as a function of time. One approach to solving
this problem is to find a function T that expresses time as a function of pollution. Such a function
is called an inverse function of P . This chapter is devoted to a study of inverse functions.

6.1 Some preliminary examples

The examples of this section give an introduction to some of the key concepts appearing in Chapter
6. (A rigorous treatment of these concepts will be given in later sections.)
Example 6.1.1. Jack and Jill are playing a number game. Jack says, ‘A positive number, when
squared, is equal to 16. What was the original number?’ Jill correctly answers that the number
was 4.
Jack goes again: ‘A positive number, when squared, is equal to 49. What was the original
number?’ This time Jill answers ‘7.’
This game may be modelled using two functions. Jack’s function f takes a positive number and
squares it. That is, f is given by

f : (0, ∞) → (0, ∞), f (x) = x2 .

The function g that Jill uses to answer Jack’s question is given by

√
g : (0, ∞) → (0, ∞), g(x) = x.

So f takes an initial number and squares it, while g takes the square root and recovers the original
number. A way of writing down the fact that g undoes (or reverses) what f does to any positive
number x is:
g(f (x)) = x ∀x ∈ Dom(f ). (6.1)

c
2020 School of Mathematics and Statistics, UNSW Sydney
112 CHAPTER 6. INVERSE FUNCTIONS

f (x) f (x) f (x)

output

output output

input x input x input input x

(a) Example 6.1.1 (b) Example 6.1.2 (c) Example 6.1.3

Figure 6.1: Correspondence between inputs and outputs

In fact, f also undoes what g does to any positive number. This is easily verified by checking that

f (g(x)) = x ∀x ∈ Dom(g). (6.2)

Because f and g have this special relationship, they are called inverses of each other.
The relationship between any function f and its inverse function g can also be viewed in the
following way. If y = f (x) and we want to express x as a function of y, then we find that x = g(y).
For the functions of Example 6.1.1, this is easily verified by the following calculation:
√
y = f (x) ⇐⇒ y = x2 ⇐⇒ y=x ⇐⇒ x = g(y).

The next example shows that changing the domain of f gives a different inverse function g.

Example 6.1.2. Jack changes the game slightly. He says, ‘A negative number, when squared, is
equal to 36. What was the original number?’ Jill correctly answers that the number was −6.
Again, we model the game with two functions. Jack’s function f is given by

f : (−∞, 0) → (0, ∞), f (x) = x2 ,

while Jill’s function g is given by

√
g : (0, ∞) → (−∞, 0), g(x) = − x.

Note that g undoes what f does to any negative number, and vice-versa. That is, f and g satisfy
equations (6.1) and (6.2) and are therefore inverse functions of each other.

Each function f in Example 6.1.1 and Example 6.1.2 had the property that for every output
there is a unique corresponding input (See Figure 6.1 (a) and (b)). However, the function f graphed
in Figure 6.1 (c) does not have a one-to-one correspondence between its outputs and inputs. As
we see in the next example, this function f has no inverse function.

Example 6.1.3. Jack changes the game one more time. This time he says, ‘A real number, when
squared, is equal to 64. What was the original number?’ This time Jill cannot give a definite
answer to his question. The answer is either 8 or −8, but she has no way of telling which it is.
The model for this situation is the following. Jack’s function f is now given by

f : R → [0, ∞), f (x) = x2 ;

c
2020 School of Mathematics and Statistics, UNSW Sydney
6.2. ONE-TO-ONE FUNCTIONS 113

but there is no function g that undoes what f does to a real number x. That is, there is no function
g that satisfies the equation (6.1). Therefore f has no inverse function. The reason why this is the
case is clear: the domain of f now includes the positive and negative real numbers, and therefore
some (in fact most) outputs of f have two possible corresponding inputs. That is, there is no
one-to-one correspondence between inputs and outputs of f .

The ideas introduced through these examples are summarised below.

• If f has an inverse function g, then g undoes (or reverses) what f does to elements of
Dom(f ).

• Suppose f and g are inverse functions. Then y = f (x) if and only if x = g(y).

• A function f has an inverse function only if there is a one-to-one correspondence between

inputs and outputs of f .

• The domain of a function f plays an important role in determining whether f has an

inverse function g, and if so, exactly what the inverse function is.

• If a function f does not have an inverse function, then we can sometimes modify the
domain of f so that it does have an inverse function. (For example, if f (x) = x2 then
f : R → [0, ∞) does not have an inverse function while the function f : (0, ∞) → (0, ∞),
with domain restricted to the positive real numbers, does have an inverse function.)

In the following sections we treat these ideas more rigorously.

6.2 One-to-one functions

(Ref: SH10 §7.1)
As was demonstrated in Section 6.1, the property that a function has a one-to-one correspondence
between inputs and outputs is critical to our discussion.

Definition 6.2.1. A function f is said to be one-to-one if

f (x1 ) = f (x2 ) implies that x1 = x2
whenever x1 , x2 ∈ Dom(f ).

In other words, a function is one-to-one if every output has a unique input. Equivalently, f is
one-to-one if
f (x1 ) 6= f (x2 ) whenever x1 6= x2 ,

provided that both x1 and x2 belong to Dom(f ). One-to-one functions are sometimes called
injective functions.

Example 6.2.2. Show that the function f : R → R, given by f (x) = 2x − 3, is one-to-one.

c
2020 School of Mathematics and Statistics, UNSW Sydney
114 CHAPTER 6. INVERSE FUNCTIONS

Solution. Suppose that f (x1 ) = f (x2 ). Then

2x1 − 3 = 2x2 − 3.

If we add 3 to both sides and afterwards divide by 2 then we obtain

x1 = x2 .

Hence f is one-to-one.

Example 6.2.3. Show that the function f : R → R, given by f (x) = x2 , is not one-to-one.

Proof. Clearly f (−2) = f (2). However, −2 6= 2 and so f is not one-to-one.

There are other ways of identifying one-to-one functions. The following method is a simple
geometric test.

Proposition 6.2.4 (The horizontal line test). Suppose that f is a real-valued function defined on
some subset of R. Then the following statements are equivalent:

(a) f is one-to-one;

(b) every horizontal line in the Cartesian plane intersects the graph of f at most once.

Example 6.2.5. Consider the functions graphed below.

f (x) g(x) h(x)

A
x x x

• f is not one-to-one because the horizontal line passing through the point A cuts the graph
of f more than once;

• g is one-to-one (in fact, since g is increasing, every horizontal line can cut the graph of g
graph no more than once);

• h is also one-to-one (even though it is not always increasing).

Although not every one-to-one function is increasing (or decreasing), it is true that every in-
creasing function is one-to-one.

Proposition 6.2.6. If a function f is either increasing or decreasing, then f is one-to-one.

Sketch proof. If f is increasing and x1 , x2 ∈ Dom(f ), then

x1 < x2 ⇐⇒ f (x1 ) < f (x2 ). (6.3)

Now compare (6.3) with Definition 6.2.1.

c
2020 School of Mathematics and Statistics, UNSW Sydney
6.3. INVERSE FUNCTIONS 115

Example 6.2.7. Show that the function f : R → R, given by

f (x) = 2x5 + x3 + x − 10,

is one-to-one.
Solution. The derivative of f is given by f ′ (x) = 10x4 + 3x2 + 1. Since f ′ (x) ≥ 1 for all real x, the
function f is increasing and hence one-to-one.

Remark 6.2.8. Not every function whose derivative is only positive (or only negative) on its
domain is one-to-one. For example, on Dom(tan)
d
tan x = sec2 x ≥ 1,
dx
but tan is not one-to-one, nor is it increasing on its domain. The problem here is that the domain
of tan has gaps.

6.3 Inverse functions

(Ref: SH10 §7.1)
We have already mentioned the term inverse function in Section 6.1. In this section we define
precisely what is meant by an inverse function and explore some of the relationships between a
function and its inverse. Our starting point is the following theorem. It says that if f is one-to-one,
then there is a function g which undoes what f does to any input x (c.f. equation (6.1)).
Theorem 6.3.1. Suppose that f is a one-to-one function. Then there is a unique function g with
Dom(g) = Range(f ) satisfying

g(f (x)) = x ∀x ∈ Dom(f ) (6.4)

and
f (g(x)) = x ∀x ∈ Range(f ). (6.5)
Moreover,
Range(g) = Dom(f )
and g is one-to-one.
Proof. Suppose that f is one-to-one and let D and R denote the domain and range of f respectively.
Since f is one-to-one, for every y in R there is a unique x in D such that y = f (x).
Now define a function g : R → D by the following rule: if y ∈ R then g(y) is the unique x in D
such that f (x) = y.
It is left as an exercise to show that g is one-to-one and that equations (6.4) and (6.5) hold.

The theorem allows us to define the term inverse function.

Definition 6.3.2. Suppose that f is a one-to-one function. Then the inverse func-
tion of f is the unique function g given by Theorem 6.3.1. The inverse function for
f is often written as f −1 .

c
2020 School of Mathematics and Statistics, UNSW Sydney
116 CHAPTER 6. INVERSE FUNCTIONS

Remark 6.3.3. If f −1 denotes the inverse function of a one-to-one function f , then equations (6.4)
and (6.5) can be expressed as

f −1 (f (x)) = x ∀x ∈ Dom(f )

and
f (f −1 (x)) = x ∀x ∈ Range(f ).
(Warning: This notation is potentially confusing if not understood correctly. The function f −1
is not the reciprocal function of f (that is, 1/f (x)). To denote the reciprocal of f (x) using index
notation we write [f (x)]−1 rather than f −1 (x).)

Remark 6.3.4. If f is a bijection and g is the inverse of a function f , then f is the inverse of g.
This is due to the symmetry between f and g in the statements of Theorem 6.3.1.

If f is a one-to-one function then equation (6.5) can often be used to find an explicit formula
for its inverse g.

Example 6.3.5. Suppose that f : R → R is given by f (x) = 4 − 13 x3 .

(a) Explain why f has an inverse function.

(b) Find the inverse function g of f .

(c) Sketch f and g on the Cartesian plane.

(d) Use the inverse function g to solve the equation f (x) = 13.

Solution. (a) Since f ′ (x) = −x2 ≤ 0 for all x in R (with f ′ (x) = 0 only when x = 0), the function
f is decreasing and is therefore one-to-one. Hence f has an inverse function.
(b) Let g denote the inverse function of f . Then Dom(g) = Range(f ) = R and Range(g) =
Dom(f ) = R. To find an explicit formula for g(x), suppose that

y = g(x).

If we apply f to both sides of the equation, and use (6.5), then

f (y) = f (g(x)) = x.

Using the definition of f , this simplifies to

1
4 − y 3 = x,
3
whereupon rearrangement gives √
3
y= 12 − 3x.
Therefore g : R → R is given by √
3
g(x) = 12 − 3x. (6.6)
(c) The graphs of f and g are shown below. Note that each graph is the reflection the other in
the line y = x.

c
2020 School of Mathematics and Statistics, UNSW Sydney
6.3. INVERSE FUNCTIONS 117
y
y=x

y = g(x)

y = f (x)

(d) To solve the equation f (x) = 13, apply g to both sides. This gives

g(f (x)) = g(13).

By (6.4) and (6.6) this simplifies to

√
3
x= 12 − 3 × 13
= −3.

Remark 6.3.6. As was illustrated in Example 6.3.5 (c), the graph of a one-to-one function f and
its inverse g are reflections of each other in the line y = x. We give a proof of this fact below:

(x, y) lies on the graph of f ⇐⇒ y = f (x)

⇐⇒ g(y) = g(f (x))
⇐⇒ g(y) = x
⇐⇒ (y, x) lies on the graph of g.

If a function f is not one-to-one, then we can sometimes restrict the domain of f so that it
becomes one-to-one. The function f with restricted domain then has an inverse function.

Example 6.3.7. Consider the function f : R → R given by

f (x) = 4 − x2 .

Find a restriction of f such that f becomes one-to-one. Find the inverse function g of this restriction.

Solution. A quick sketch of the graph of f (see Figure 6.2 (a)) shows that f is not one-to-one. If we
restrict the domain of f to [0, ∞), then the restricted function f : [0, ∞) → R passes the horizontal
line test and is therefore one-to-one. Let g denote the inverse of f : [0, ∞) → R. Then

g : (−∞, 4] → [0, ∞)

c
2020 School of Mathematics and Statistics, UNSW Sydney
118 CHAPTER 6. INVERSE FUNCTIONS
y y y
4 y = g(x) 4 y = f (x) 4
−2 2 4 4
x x x
y = g(x)
y = f (x) y = f (x)

(a) (b) (c)

Figure 6.2: Two different restrictions of the function f , and their corresponding inverses (see
Example 6.3.7).

since Dom(g) = Range(f ) = (−∞, 4] and Range(g) = Dom(f ) = [0, ∞). We find an explicit
formula for g:

y = g(x)
f (y) = x (by equation 6.5)
4 − y2 = x
√
y = 4−x (taking the positive root since Range(g) = [0, ∞)).
√
Hence g(x) = 4 − x. The graphs of f and g are sketched in Figure 6.2 (b).

Remark 6.3.8. In the solution to Example 6.3.7, one could have instead chosen the restriction
f : (−∞, 0] → R. The corresponding inverse g is given by
p
g : (−∞, 4] → (−∞, 0], g(x) = − 4 − x2 .

The graphs of f and g are given in Figure 6.2 (c).

Example 6.3.9. Consider the function f : R → R, given by

f (x) = −x3 + 3x2 + 24x − 13.

Find all intervals I, each as large as possible, such that the function f : I → R (with domain
restricted to I) has an inverse function.
Remark 6.3.10. If you study MATH1081 you will see that an abstract function f : A → B is said
to be invertible if it is a bijection from the set A onto the set B. That is, it is both one-to-one and
its range is all of its codomain B and so the problem of solving f (x) = b has a unique solution for
all b ∈ B. In the algebra strand of this course you will see that an m × n matrix A is invertible
if the matrix equation Ax = b has a unique solution for all b ∈ Rm , and we’ll see the idea again
with linear transformations in Mathematics 1B.
In calculus we often initially define a function without precisely knowing its range, and so we
usually say,
“Define f : (a, b) → R by ...”. Rx 2
For example, a very important function is f : R → R, f (x) = 0 e−t dt. It is easy to check that this
√ √
function f is one-to-one, but it is hard to see that the range of f is (− π/2, π/2). In calculus,
when we say that this function f is invertible, we are not saying that f is a bijection from its
domain onto it’s codomain R, but rather that f is a bijection from it domain onto its range.

c
2020 School of Mathematics and Statistics, UNSW Sydney
6.4. THE INVERSE FUNCTION THEOREM 119

6.4 The inverse function theorem

(Ref: SH10 §§7.1, B.3)
If a function f is one-to-one then its inverse function g is also one-to-one (see Theorem 6.3.1).
What other properties of f does its inverse function g inherit?
First, if f is continuous, then so is g. (A rough argument for this fact would go like this: if f
is continuous then its graph can be drawn as an unbroken line. Now reflect this graph in the line
y = x. The reflected graph is also an unbroken line. Moreover it corresponds to the graph of g.
Hence g is continuous.)
What about differentiability? If a one-to-one function f is differentiable then its inverse g is
not always differentiable. For example, if f : R → R is given by f (x) = x3 , then f is differentiable
everywhere. Its inverse function g is given by
√
3
g : R → R, g(x) = x.

Since
1
g′ (x) = √
3
,
x2
g is not differentiable at 0. So while f is differentiable everywhere, its inverse g is not.
The problem with the above example occurs where the derivative of f is zero. When a horizontal
tangent to f is reflected in the line y = x, it becomes a vertical tangent to g and hence g is not
differentiable at this point.
y

However, if f is a differentiable function whose derivative is never zero, its inverse g will also be
differentiable.

Theorem 6.4.1 (The inverse function theorem). Suppose that I is an open interval, f : I → R is
differentiable and f ′ (x) 6= 0 for all x in I. Then

(i) f is one-to-one and has an inverse function g : Range(f ) → Dom(f ),

(ii) g is differentiable at all points in Range(f ), and

(iii) the derivative of g is given by the formula

1
g′ (x) =
f ′ (g(x))

for all x in Range(f ).

Some parts of the inverse function theorem are easy to prove while others are hard. Statement
(i) is true since f ′ (x) 6= 0 for x in I implies that f is either increasing on I or decreasing on I (by

c
2020 School of Mathematics and Statistics, UNSW Sydney
120 CHAPTER 6. INVERSE FUNCTIONS

the mean value theorem). Hence f is one-to-one. Statement (ii) is difficult to prove and involves a
delicate limiting argument with the difference quotients of f and g.
It is important that students understand the proof of statement (iii). Suppose (by statement
(ii)) that g is differentiable. Beginning with the equation

f (g(x)) = x

(see 6.5), differentiation with respect to x gives

f ′ (g(x)).g ′ (x) = 1

by the chain rule. Since f ′ is never zero on I, we can divide by f ′ (g(x))) to obtain

1
g′ (x) =
f ′ (g(x))

as required.
In the next section the inverse function theorem will be applied to the trigonometric functions.

6.5 Applications to the trigonometric functions

(Ref: SH10 §7.7)
b
In elementary trigonometry, the sine function gives the ratio a of two side lengths of a right-angle
triangle corresponding to the angle θ.

c
b

θ
a
However, if we know the ratio and instead want to find the corresponding angle, we use the inverse
sine function. Thus relationship between the sine and inverse sine functions is given by

b b
sin θ = , sin−1 = θ. (6.7)
c c
This relationship works in elementary trigonometry where the angle θ is acute. However, if θ is
allowed to be any real number then the relationship between sin and sin−1 is not so straightforward.
This is because the function sin : R → R is not one-to-one and therefore has no inverse (see Figure
6.3).
In the next example, we define exactly what we mean by sin−1 and compute its derivative.

Example 6.5.1 (The inverse sine function). Since the usual sine function is not a one-to-one
function, we consider instead a restricted sine function

sin : [− π2 , π2 ] → [−1, 1]

which is one-to-one (see Figure 6.3 (a)). This restricted function has an inverse

sin−1 : [−1, 1] → [− π2 , π2 ],

c
2020 School of Mathematics and Statistics, UNSW Sydney
6.5. APPLICATIONS TO THE TRIGONOMETRIC FUNCTIONS 121
sin x sin−1 x
(1, π2 )
( π2 , 1) b

b
x x
(− π2 , −1) b

(−1, − π2 )

(a) (b)

Figure 6.3: Graphs of sin and sin−1

which is graphed in Figure 6.3 (b).

We note that sin−1 is an odd function, that is,

sin−1 (−x) = − sin−1 x ∀x ∈ [− π2 , π2 ].

We now examine the differentiability of sin−1 . If I = (− π2 , π2 ) then the sine function is differen-
tiable on I and
d
(sin x) = cos x 6= 0
dx
for all x in I. So by the inverse function theorem (Theorem 6.4.1), sin−1 is differentiable on (−1, 1).
While the derivative of sin−1 could be computed using the formula of Theorem 6.4.1 (iii), we prefer
to compute it directly by implicit differentiation.
If y = sin−1 x then

sin y = x (by applying sin to both sides)

dy
cos y =1 (by implicit differentiation)
dx
dy 1
= (since cos y > 0 when − π2 < y < π2 ). (6.8)
dx cos y

We seek a simple expression for cos y in terms of x. By the trigonometric identity sin2 y +cos2 y = 1,
and the fact that y = sin−1 x, we have
q q p
cos y = 1 − sin2 y = 1 − (sin(sin−1 x))2 = 1 − x2 , (6.9)

where the positive square root was taken since cos y > 0. Substituting (6.9) into (6.8) gives

dy 1
=√
dx 1 − x2
and hence
d 1
(sin−1 ) = √
dx 1 − x2
whenever −1 < x < 1.
As a corollary, note that the derivative of sin−1 is positive, so sin−1 is an increasing function.

The following table summarises the domain, range and derivatives of the restricted (and hence
one-to-one) trigonometric functions and their inverses. The derivatives exist on the largest open

c
2020 School of Mathematics and Statistics, UNSW Sydney
122 CHAPTER 6. INVERSE FUNCTIONS
(−1, π) cos−1 x tan−1 x
b

π
2
b
(0, π2 ) b

(1, 0) x x
b

− π2

Figure 6.4: Graphs of cos−1 and tan−1

intervals contained in the domain of each function. Proofs of the derivatives for the inverse trigono-
metric functions are done in much the same way as illustrated in Example 6.5.1. It is expected that
students will be able to derive these derivatives, using the method of Example 6.5.1, if required.
The graphs of cos−1 and tan−1 are shown in Figure 6.4.

Function Domain Range Derivative

d
sin [− π2 , π2 ] [−1, 1] (sin x) = cos x
dx
d 1
sin−1 [−1, 1] [− π2 , π2 ] (sin−1 x) = √
dx 1 − x2
d
cos [0, π] [−1, 1] (cos x) = − sin x
dx
d 1
cos−1 [−1, 1] [0, π] (cos−1 x) = − √
dx 1 − x2
d
tan (− π2 , π2 ) (−∞, ∞) (tan x) = sec2 x
dx
d 1
tan−1 (−∞, ∞) (− π2 , π2 ) (tan−1 x) =
dx 1 + x2

We note also that sin−1 and tan−1 are both odd functions, but that

cos−1 (−x) = π − cos−1 x ∀x ∈ [−1, 1].

The next example illustrates that one must be careful with the notation sin and sin−1 . It is
not always true that sin−1 (sin x) = x. This is because sin−1 is not the inverse of sin : R → R. It is
only the inverse of sin : [− π2 , π2 ] → R. In particular,

sin−1 (sin x) = x

only when x ∈ [− π2 , π2 ]. Similar comments apply to the other trigonometric functions.

Example 6.5.2. Evaluate the following numbers without using a calculator:

(a) sin−1 (sin 5π

4 ),

(b) sin(cos−1 53 ).

c
2020 School of Mathematics and Statistics, UNSW Sydney
6.5. APPLICATIONS TO THE TRIGONOMETRIC FUNCTIONS 123

Solution. (a) We cannot write sin−1 (sin 5π

4 )=
5π
4 , since 5π
4 is not in the interval [− π2 , π2 ]. However,
since sin−1 is an odd function,

5π 1 1 π
sin−1 sin = sin−1 − √ = − sin−1 √ =− .
4 2 2 4

(b) Let θ denote cos−1 35 . Then we seek sin θ. By the trigonometric identity

sin2 θ + cos2 θ = 1,

we have p
sin θ = ± 1 − cos2 θ,

which simplifies to
s 2
3 4
sin θ = ± 1− =± .
5 5
3
Since θ = cos−1 5 > 0 we have sin θ > 0 and hence

4
sin θ = + ,
5

completing the question.

An alternate solution to (b) is the following. Once again, let θ denote the angle cos−1 53 . This
means that cos θ = 53 , whereupon we construct the right-angled triangle shown below.

5
4

θ
A 3 C

By Pythagoras’ theorem, the length of side AB is 4. Hence sin θ = 54 .

The final example uses theory from all chapters covered in the course so far.

Example 6.5.3. A man and a woman sit on opposite edges of a canyon which is 50 metres wide
and 100 metres deep. The man drops a small stone from rest into the canyon. The woman watches
the stone fall to the bottom of the canyon through a telescope. At what time during the stone’s
fall will the angle that the telescope makes with the horizontal change the fastest? (That is, when
will the angular velocity of the telescope be greatest?) Assume that the air resistance experienced
by the stone is negligible.

Solution. Let x(t) denote the distance (in metres) that the stone has travelled exactly t seconds
after the stone was dropped. Let θ(t) denote the angle that the telescope makes with the horizontal
at t seconds. Let g denote acceleration due to gravity (g ≈ 9.8 ms−2 ).

c
2020 School of Mathematics and Statistics, UNSW Sydney
124 CHAPTER 6. INVERSE FUNCTIONS
man woman
θ(t)

x(t)

100 m

r
stone

50 m

If v(t) denotes the velocity of the stone at time t, then we have

dx
= v, x(0) = 0, (6.10)
dt
dv
= g, v(0) = 0. (6.11)
dt
From (6.11) we obtain v(t) = gt + C1 . By imposing the initial condition that v(0) = 0 we see that
C1 = 0. Hence v(t) = gt and substituting this into (6.10) gives

dx
= gt, x(0) = 0.
dt
It follows that x(t) = 12 gt2 + C2 and one easily shows that C2 = 0.
To find out when the stone hits the canyon floor, we solve the equation
1
100 = gt2
2
p
for t. Hence t = 200/g ≈ 4.5. So it takes about 4.5 seconds for the stone to hit the canyon floor.
By examining the triangle in the above diagram, we see that

x(t) gt2
tan θ = = .
50 100
dθ
To solve the problem, we need to determine the value of t when dt is a maximum. Now

−1 gt2
θ = tan
100

and so
dθ 1 gt
= 2 ×
dt gt2 50
1+ 100

by the derivative of tan−1 and the chain rule. This simplifies to

dθ 200gt
= 4 .
dt 10 + g2 t4

c
2020 School of Mathematics and Statistics, UNSW Sydney
6.6. MAPLE NOTES 125

For simplicity, denote dθ

dt by ω. We need to find when ω attains its maximum on the interval
p
[0, 200/g] (that is, over the time interval when the stone is falling). To locate the critical points
of ω, we calculate dω
dt :

dω d2 θ
= 2
dt dt
(104 + g2 t4 )200g − 200gt(4g2 t3 )
=
(104 + g 2 t4 )2
(10 )200g − 600g3 t4
4
= .
(104 + g2 t4 )2

The stationary points of ω occur when

(104 )200g − 600g3 t4 = 0,

that is, when

104
t4 = .
3g2

p p
Since t ≥ 0, we obtain t = 10/ 4 3g2 ≈ 2.42. So the critical points of ω on [0, 200/g] are

p p
0, 10/ 4 3g2 and 200/g.

The value of ω at each of these points is calculated below:

p p
ω(0) = 0, ω(10/ 4 3g 2 ) ≈ 1.10, ω( 200/g) ≈ 0.26.

p
Clearly ω(t) attains its maximum when t = 10/ 4 3g2 .
p
In conclusion, the angular velocity of the telescope is greatest 10/ 4 3g2 (or about 2.26) seconds
after the stone is dropped. At this time, the angular velocity is about 1.10 radians per second (that
is, about 62.9 degrees per second).

6.6 Maple notes

Maple knows about the inverse trigonometric functions. The functions sin−1 , cos−1 , tan−1 , cosec−1 ,
sec−1 , cot−1 are written respectively as: arcsin, arccos, arctan, arccsc, arcsec and arccot.
For example,

> plot([arctan(x), arccsc(x)], x=-10..10, y=-2..2, linestyle=[dash,solid]);

c
2020 School of Mathematics and Statistics, UNSW Sydney
126 CHAPTER 6. INVERSE FUNCTIONS

> # Note the gap around zero in the graph of arccsc(x) --- please explain!

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 6 127

Problems for Chapter 6

Problems 6.1 : Some preliminary examples

1. [R] [V] Suppose
√ that the functions
√ f : [0, ∞) → [1, ∞) and g : [1, ∞) → [0, ∞) are given
by f (x) = 1 + x2 and g(x) = x2 − 1.

a) By calculating (f ◦ g)(x) and (g ◦ f )(x), verify that g is the inverse function to f .

b) What are the domains of f ◦ g and g ◦ f ?

2. [R]

a) Suppose that f : R → R is given by f (x) = 3x + 1. Find f −1 (x). Sketch the graph

of f and the graph of its inverse function, f −1 , on the same diagram.
b) The function g : (−∞, 0] → R is defined by g(x) = x2 +1. Write down the domain and
range of the inverse function g−1 and find a formula for g−1 (x). Find the derivative
of g−1 .

Problems 6.2 : One-to-one functions and

6.3 : Inverse functions

3. [R] Show that the function f : R → R, given by f (x) = x3 +3x+1, has an inverse function
whose domain is R.

4. [R] [V] Suppose that f : R → R is given by f (x) = 4x + cos x.

a) Show that f has an inverse function g.

b) By using the inverse function theorem, find g ′ (2π).

5. [R] Suppose that f : R → R is defined by f (x) = x3 − 3x + 1.

a) Show that f : R → R is not a one-to-one function.

b) Find all possible intervals I of R, each as large as possible, such that the restricted
function f : I → R has an inverse. What is the domain of each of corresponding
inverse function?

6. [H]

a) Can you find a quadratic function from R to R which is one-to-one?

b) Can you find a cubic function from R to R which is not one-to-one?

Problems 6.4 : The inverse function theorem

7. [H] For each function f : R → R given below, find all possible intervals I of R, each as
large as possible, such that the restricted function f : I → R is one-to-one. State the
range of each restricted function f : I → R. What can you say about existence, domain
of definition, continuity and differentiability of the corresponding inverse functions?

c
2020 School of Mathematics and Statistics, UNSW Sydney
128 CHAPTER 6. INVERSE FUNCTIONS

a) f (x) = x(x2 − 1)(x + 2)

b) f (x) = (x + 1)17
c) f (x) = |x| |x + 1|

Problems 6.5 : Applications to the trigonometric functions

8. [R] Simplify each expression without using a calculator.

√
a) sin−1 ( 3/2) b) cos(cos−1 (2/5)) c) sin−1 (sin(5π/3))
d) cos−1 (cos(−π/3)) e) cos(sin−1 (3/5)) f) sin(tan−1 (3/5))
π 3π
g) sec−1 (2) h) sin−1 (sin x) when 2 ≤x≤ 2

9. [R] Sketch the graph of f : [1, 3] → R, where f (x) = cos−1 (x − 2).

10. [R] Show that

d −1 d 1
a) cos−1 x = √ b) tan−1 x = .
dx 1 − x2 dx 1 + x2

11. [R] Differentiate

√
a) cos−1 (2x) b) sin−1 x c) tan−1 (2x − 3).

12. [R] Prove that sin−1 x + cos−1 x is constant. For what values of x is this valid and what
is the constant?

13. [H] [V] Suppose that f (x) = tan−1 x + tan−1 (1/x) whenever x 6= 0.
a) Show that f ′ (x) = 0 whenever x 6= 0.
b) Hence evaluate f on the intervals (0, ∞) and (−∞, 0).
c) How do you account for this result geometrically?

14. [H]
a) Draw the graph of cosec x.
b) Show that cosec restricted to the interval (0, π2 ] has an inverse function. Sketch the
graph of the inverse and calculate its derivative.

15. [X]
a) Show that 2 tan−1 2 = π − cos−1 (3/5).
b) Show that cos−1 (1 − 2x2 ) = 2 sin−1 x whenever 0 ≤ x ≤ 1.
c) Suppose that q(x) = cos−1 (1 − 2x2 ). Is q differentiable at 0?

16. [H] A function f : R → R is defined by

(
x tan−1 √1x if x > 0
f (x) =
ax + b if x ≤ 0,

c
2020 School of Mathematics and Statistics, UNSW Sydney
6.6. MAPLE NOTES 129

where a and b are real numbers. Find all values of a and b such that f is differentiable
at 0.

17. [H] A lighthouse containing a revolving beacon is located 3 km from P , the nearest point
on a straight shoreline. The beacon revolves with a constant rotation rate of 4 revolutions
per minute and throws a spot of light onto the shoreline. How fast is the spot of light
moving when it is (a) at P and (b) at a point on the shoreline 2 km from P ?

18. [H] [V] A picture 2 metres high is hung on a wall with its bottom edge 6 metres above
the eye of the viewer. How far from the wall should the viewer stand for the picture to
subtend the largest possible vertical angle with her eye?

c
2020 School of Mathematics and Statistics, UNSW Sydney
130 CHAPTER 6. INVERSE FUNCTIONS

c
2020 School of Mathematics and Statistics, UNSW Sydney
131

Chapter 7

Curve sketching

There are a variety of ways to describe curves that lie in the plane. In this chapter we study curves
which are described by using
√
• a Cartesian equation (for example, y = x2 or y = 1 − x2 ),

• parametric equations, and

• polar coordinates.

Each section of this chapter is devoted to one of these methods. The use of Cartesian equations
will already be familiar. However, many curves cannot be easily described by Cartesian equations.
The following diagrams illustrate two curves that are better described by a parameter or with polar
coordinates.

y y

An example of a parametric curve: An example of a polar curve:


r = | cos(4θ)|
x(t) = p
 sin t cos t ln |t|
y(t) = |t| cos t


t ∈ [−1, 1], t 6= 0

7.1 Curves defined by a Cartesian equation

(Ref: SH10 §§4.7, 4.8)

c
2020 School of Mathematics and Statistics, UNSW Sydney
132 CHAPTER 7. CURVE SKETCHING

In this section we survey techniques for sketching curves that are described by a Cartesian equation
of the form y = f (x).

7.1.1 A checklist for sketching curves

Students will already be familiar with techniques for sketching the graph of a function f . For
convenience we summarise some of the most useful here.
• Identify the domain of f . (In particular, Dom(f ) should not include any points that lead
to division by 0 or the square root of a negative number.)
• Identify any symmetries:
f is odd if f (−x)= −f (x) ∀x ∈ Dom(f )
f is even if f (−x) = f (x) ∀x ∈ Dom(f ).

• Find x- and y-axis intercepts.

• Identify vertical asymptotes.
• Examine the behaviour of f (x) as x → ±∞. (This will identify any other asymptotes that
exist).
• Use calculus to identify stationary points and other features, if necessary.

7.1.2 Oblique asymptotes

Consider the graph of the function f that is sketched below.
y
y = ax + b

y = f (x)
x

It appears that the graph of f approaches the line y = ax + b as x → ∞. If this is indeed the case,
we say that f is asymptotic to the line. Since this line is neither vertical, nor horizontal, we call it
an oblique asymptote to the function f .

Definition 7.1.1. Suppose that a and b are real numbers and that a 6= 0. We say
that a straight line, given by the equation
y = ax + b,
is an oblique asymptote for a function f if

lim f (x) − (ax + b) = 0.
x→∞

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.1. CURVES DEFINED BY A CARTESIAN EQUATION 133

Example 7.1.2. Suppose that f is defined by the equation

2
f (x) = x + 1 +
x−2
whenever x 6= 2. Identify all asymptotes to f and hence sketch its graph.
Solution. Clearly f (x) → ∞ as x → 2+ , therefore there is a vertical asymptote when x = 2.
To identify the other asymptotes (if any), we examine the behaviour of f (x) as x → ∞ and as
x → −∞.
2
Now as x → ∞, x−2 → 0 from above. Hence we conclude that f (x) approaches x + 1 from
above. Therefore the line y = x + 1 is an oblique asymptote to f . (The calculation
2
lim f (x) − (x + 1) = lim =0
x→∞ x→∞ x−2
demonstrates from Definition 7.1.1 that this is the case.)
2
On the other hand, as x → −∞, x−2 → 0 from below. Therefore f (x) approaches x + 1 from
below.
The asymptotes for f help us construct the sketch (which is pictured below). To complete the
sketch, we identify the axes intercepts. For the y-axis, we have
f (0) = 0.
For the x-axis,
f (x) = 0 ⇒ (x + 1)(x − 2) + 2 = 0 ⇒ x2 − x = 0 ⇒ x = 0, 1.
Thus we have the following sketch:
y y = f (x)

y =x+1

1
x
x=2

(Note that we have obtained sufficient information to make a useful sketch of the graph without
resorting to calculus. However, calculus would help to refine this by locating the turning points
and confirming where the graph is increasing or decreasing.)

If f is a rational function of the form p/q, where p and q are polynomials and deg(p) > deg(q),
then the asymptotic behaviour of f may be determined by polynomial division.
Example 7.1.3. Suppose that f is defined by the equation
x2 − 4
f (x) =
x+1
whenever x 6= −1. Identify all asymptotes to f and hence sketch its graph.

c
2020 School of Mathematics and Statistics, UNSW Sydney
134 CHAPTER 7. CURVE SKETCHING

Solution. As in the previous example, clearly there is a vertical asymptote when x = −1. It is also
clear that f (x) → ∞ as x → ∞, but the precise asymptotic behaviour of f (x) as x → ∞ is still
not clear. However, polynomial division gives

x−1
x+ 1 ) x2 −4
x2 + x
−x − 4
−x − 1
−3.

Hence
3
f (x) = x − 1 − . (7.1)
x+1
(See Remark 7.1.4 for an alternative method of establishing (7.1).)
3
Since x+1 → 0 from above as x → ∞, we conclude that f (x) approaches x − 1 from below as
x → ∞. A similar observation shows that f (x) approaches x − 1 from above as x → −∞. So the
line y = x − 1 is an oblique asymptote to the graph of f at both ∞ and −∞.
Finally, we calculate the axes intercepts:

f (0) = −4

and
f (x) = 0 ⇒ x2 − 4 = 0 ⇒ x = ±2.

Hence we obtain the following sketch.

y =x−1

−2 2 x

y = f (x) −4

x = −1

(Again, we have obtained sufficient information for a useful sketch of the graph without the use of
calculus.)

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.1. CURVES DEFINED BY A CARTESIAN EQUATION 135

Remark 7.1.4. We give a method for establishing equation (7.1) that uses an algebraic ‘trick’
rather than polynomial ‘long division’:
x2 − 4
f (x) =
x+1
x2 − 1 − 3
=
x+1
(x − 1)(x + 1) − 3
=
x+1
3
= (x − 1) − .
x+1

7.1.3 Examples
We complete this section with three examples.
Example 7.1.5. Sketch the graph of f , where f is defined by
2
f (x) = xe−x .
Solution. It is clear that Dom(f ) = R. Moreover,
2
f (−x) = −xe−x = −f (x) ∀x ∈ R,
so f is an odd function. Therefore, it suffices to consider x ≥ 0; the rest of the graph can be
obtained by symmetry.
2
Since e−x > 0, it is easy to see that the only axis intercept occurs at (0, 0). We look now at
the behaviour of f (x) as x → ∞. By l’Hôpital’s rule,
x 1
lim f (x) = lim x 2 = lim 2 = 0.
x→∞ x→∞ e x→∞ 2xex
So y = 0 is an horizontal asymptote.
Finally, we use calculus to determine the shape of the curve. By the product rule,
2 2
f ′ (x) = x − 2xe−x + e−x (1)
2
= (1 − 2x2 )e−x .
2
Since e−x > 0 for all x in R, we conclude that
1
f ′ (x) = 0 when x = √ ,
2
while
1 1
f ′ (x) > 0 when 0 ≤ x < √ and f ′ (x) < 0 when x > √ .
2 2
So f is increasing on the interval (0, √12 ), stationary at √1 and decreasing on ( √1 , ∞). Now
2 2
√ 1 1
f (1/ 2) = √ e−1/2 = √ ≈ 0.429.
2 2e
Moreover, f ′ (0) = 1, implying that the tangent line to the graph at the origin has a gradient of 1.
This information can be used to sketch the graph when x ≥ 0, and the fact that f is odd allows
us to deduce the shape of the graph when x < 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
136 CHAPTER 7. CURVE SKETCHING
f (x)

√1

|
2e
−2 − √12
| | | |

√1 2 x
2
− √12e

|
Example 7.1.6. Sketch the graph of f , where f is defined by

3
f (x) = x2 − 4 + .
x2

The next function is important since it is later used to define the Si function (see Example
8.12.1). The Si function is used by electrical engineers for digital signal processing and by surveyors
using GPS (Global Positioning System).

Example 7.1.7. Sketch the graph of f , where f is defined by


 sin x if x 6= 0
f (x) = x
1 if x = 0.

7.2 Parametrically defined curves

(Ref: SH10 §§10.5, 10.6)
Many moving bodies trace out a path that lies in a plane. An important problem is to describe
the curve of the trajectory. We give an example.

Example 7.2.1. Suppose that a stone is thrown horizontally from the deck of Sydney Harbour
Bridge at 20 metres per second, and that the air resistance experienced by the stone is negligible.
Let x(t) denote the horizontal distance (in metres) travelled by the stone t seconds after being
thrown and let y(t) denote its height (in metres) above the water. The deck is 50 metres above sea
level.

deck
b
stone
x(t)

50m
y(t)

water

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.2. PARAMETRICALLY DEFINED CURVES 137

y = f (x)

√ √
−43 1 3
| √
2 3−4 x

y = x2 − 4
−4

Figure 7.1: Sketch for Example 7.1.6.

c
2020 School of Mathematics and Statistics, UNSW Sydney
138 CHAPTER 7. CURVE SKETCHING

If we take the acceleration due to gravity to be 10 m/s2 , then Newton’s laws of motion show that

x(t) = 20t and y(t) = 50 − 5t2 (7.2)

√
when 0 < t < 10 (that is, after the stone
√ is thrown and just before it hits the water). Sketch the
trajectory of the stone when 0 < t < 10.

Solution. We want to express y as a function of x. By (7.2) we have

x(t)
t= and y(t) = 50 − 5t2 .
20
If we substitute the equation on the left into the equation on the right then

x(t) 2
y(t) = 50 − 5
20
which becomes
x2
y = 50 − .
80
√
So the trajectory is a parabola. It is easy to see that y = 50 when x = 0 and that x = 20 10 when
y = 0. We therefore obtain the following sketch.
y

√ x
20 10

Remark 7.2.2. The equation

x(t), y(t) = 20t, 50 − 5t2 , t∈R

is called a parametrisation of the parabola

x2
y = 50 − .
80
The variable t is called a parameter.

Parametrisations of curves are important to study for several reasons.

• They arise naturally with moving bodies, as illustrated in Example 7.2.1. In this con-
text they also contain more information than a corresponding Cartesian equation (since a
Cartesian equation only records the trajectory of the body, not the position of the body
at any given time).

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.2. PARAMETRICALLY DEFINED CURVES 139

• Many curves that cannot be described by an equation of the form y = f (x) can be described
parametrically.

• Even if a curve is the graph of a function, it may be much easier to describe the curve
parametrically than to write down an equation describing the function. (This is the case
with the cycloid, which will be studied in Example 7.2.7.)

• Curves in three (or higher) dimensional space are most easily described using parameters.

7.2.1 Parametrisation of conic sections

The trajectories of many heavenly bodies (such as planets and comets) can be modelled as conic
sections. In this section we will give some standard parametrisations of conic sections.
Example 7.2.3. Show that the curve given by the functions

x(t) = a cos t, y(t) = b sin t, 0 ≤ t ≤ 2π (7.3)

is an ellipse. Sketch the curve, showing how the point (x(t), y(t)) moves as t varies from 0 to 2π.
Solution. Suppose that t ∈ [0, 2π]. From (7.3) we see that

x(t)2 y(t)2
= cos2 t, = sin2 t.
a2 b2
Since cos2 t + sin2 t = 1,
x(t)2 y(t)2
+ 2 = cos2 t + sin2 t = 1.
a2 b
Hence the point (x(t), y(t)) lies on the ellipse

x2 y 2
+ 2 = 1.
a2 b
(Conversely, any point (x, y) that satisfies this equation can be expressed as x = a cos t, y = b sin t
for some t in R.)
In fact, it is easy to see that, as t moves from 0 to 2π, (x(t), y(t)) traces out the entire ellipse.
This is illustrated in the diagram below.

y(t)
π
t= 2
π b
π
2 <t<π b 0<t< 2

t=π t = 0, 2π
b b

−a a
x(t)

π<t< 3π −b b
3π
< t < 2π
2 2
3π
t= 2

c
2020 School of Mathematics and Statistics, UNSW Sydney
140 CHAPTER 7. CURVE SKETCHING

The table below lists some commonly used parametrisations of conic sections.

Conic section Cartesian equation Parametric equation

Parabola 4ay = x2 x(t) = 2at

y(t) = at2

Circle x 2 + y 2 = a2 x(t) = a cos t

y(t) = a sin t
x2 y2
Ellipse + =1 x(t) = a cos t
a2 b2
y(t) = b sin t
x2 y2
Hyperbola − =1 x(t) = a sec t
a2 b2
y(t) = b tan t

There are also other parametrisations for each of these curves. (We give a second parametrisation
of the hyperbola in Chapter 10.)
In Examples 7.2.1 and 7.2.3, we sketched a parametrically defined curve by first eliminating the
parameter t. It is not always possible to do this, as the next example illustrates. Such curves can
be sketched with the aid of a table of values.

Example 7.2.4. Sketch the curve defined parametrically by

x(t), y(t) = (t2 cos t, t2 sin t), 0 ≤ t ≤ 3π. (7.4)

Solution. We begin by noting that the above parametric equation is similar to the parametrisation

x(t), y(t) = (a cos t, a sin t) (7.5)

of the circle. The only difference is that the points in (7.5) have a fixed distance a from the origin,
whereas the points in (7.4) have a variable distance t2 from the origin. So intuitively, as t increases,
the point (x(t), y(t)) in (7.4) rotates about origin, while at the same time getting further away from
the origin. Therefore it is likely that the curve is a spiral.
The following table of values helps verify this reasoning.

π 3π 5π
t 0 2 π 2 2π 2 3π
x 0 0 −π 2 0 4π 2 0 −9π 2
π2 9π 2 25π 2
y 0 4 0 4 0 4 0

We plot these points on the Cartesian plane and interpolate between them to produce the following
sketch.

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.2. PARAMETRICALLY DEFINED CURVES 141
y(t)
b

25π 2
4

π2
4
−π 2 b
b b b b
x(t)
−9π 2 4π 2

b
2
− 9π4

7.2.2 Calculus and parametric curves

Given a smooth curve defined using a parameter t, one would like to calculate its gradient at any
point along the curve. The following proposition enables us to do this without having to eliminate
t.

Proposition 7.2.5. Suppose that x and y are both differentiable functions of t and that y is a
function of x. If x′ (t) =
6 0 then
dy
dy dt y ′ (t)
= dx = ′ . (7.6)
dx dt
x (t)

Proof. Since y is a function of x, we write y = f (x). Hence

y(t) = f (x(t)).

Differentiating both sides with respect to t gives

df
y ′ (t) = (x(t)) x′ (t)
dx
by the chain rule. Since x′ (t) 6= 0,
df y ′ (t)
(x(t)) = ′
dx x (t)
which is equivalent to (7.6).

Example 7.2.6. A curve is described parametrically by

x(t), y(t) = (t2 − 1, t3 + 2t), t ∈ R.

Find the equation of the tangent to the curve at the point (0, 3).

c
2020 School of Mathematics and Statistics, UNSW Sydney
142 CHAPTER 7. CURVE SKETCHING

Solution. By Proposition 7.2.5,

dy y ′ (t)
= ′
dx x (t)
3t2 + 2
= (7.7)
2t
provided that t 6= 0.
It is easy to see that
t2 − 1 = 0 and t3 + 2t = 3
only when t = 1. By substituting this into (7.7), we find that the gradient of the curve at (0, 3) is
5/2. Hence the equation of the tangent at (0, 3) is given by
5x
y= + 3.
2

7.2.3 The cycloid and curve of fastest descent

Suppose that a point A is higher than a point B. What curve, starting at A and ending at B,
would enable a particle moving under the influence of gravity (and ignoring friction) to move from
A to B in the shortest time?

A b

b
B

Curve of fastest descent.

Such a curve is known as a curve of fastest descent or a brachistochrone (which, in Greek, means
‘shortest time’).
In 1638 Galileo stated that the curve of fastest descent is the arc of a circle. This was shown to
be false by Johann Bernoulli who found the correct description of the curve. In 1696 he issued a
public challenge for others to find this description. Four mathematicians responded with a correct
answer: Jakob Bernoulli (Johann’s brother), Gottfried Leibniz, Isaac Newton and Guillaume de
l’Hôpital. The curve is the arc of a cycloid.
Example 7.2.7 (The cycloid). A circle of radius r rolls along the x-axis, starting from the origin
as shown below. Describe the locus (x(t), y(t)) of the point P on the edge of the circle that satisfies
(x(0), y(0)) = (0, 0)
y
P
P
P
P x
The cycloid.

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.2. PARAMETRICALLY DEFINED CURVES 143

Solution. The solution will be built up over three steps, of which the first two correspond to the
diagrams below.
y y

θ rt

t x t x

P (x, y) P (x, y)

rt rt y = −r
Step 1. Step 2.

Step 1. First we consider a circle of radius r centred at the origin. If the circle rotates about
its centre by t radians in a clockwise direction (as shown) then

x(t) = r cos θ = r cos( 3π
2 − t) = r cos 3π
2 cos t + sin 3π
2 sin t = −r sin t,
3π 3π 3π

y(t) = r sin θ = r sin( 2 − t) = r sin 2 cos t − cos 2 sin t = −r cos t

and the point P sweeps out a distance of rt.

Step 2. We now consider what happens when the circle rolls along the line y = −r. If the circle
rotates by t radians, then the centre of the circle moves to the right by rt (corresponding to the
distance that P sweeps out – see Step 1). Hence

x(t) = −r sin t + rt.

By Step 1 we still have

y(t) = −r cos t.
Step 3. We retain the same situation as in Step 2, except that we shift the circle up by r (so
that the circle rolls along the line y = 0 rather than along the line y = −r. Thus

y(t) = −r cos t + r.

By Step 2 we still have

x(t) = −r sin t + rt.
Final solution. The locus of the curve is given by

x(t) = r(t − sin t),

y(t) = r(1 − cos t)

where t ≥ 0.

Remark 7.2.8. The curve of fastest descent from A to B is the unique arc of an (inverted) cycloid
whose tangent at A is vertical.

c
2020 School of Mathematics and Statistics, UNSW Sydney
144 CHAPTER 7. CURVE SKETCHING

7.3 Curves defined by polar coordinates

(Ref: SH10 §§10.2, 10.3)
In this final section, we introduce a different coordinate system for the plane. Many equations in
this system correspond to beautiful curves.

7.3.1 Polar coordinates

Every point P in plane can be specified by

• (x, y), where x is the horizontal signed distance of P from the origin, and y is the vertical
distance, or

• (r, θ), where r is the distance of P from the origin and θ is the angle (taken in the
anticlockwise direction) between OP and the positive horizontal axis.

x b
P b
P
r
y
θ
O O

The pair (x, y) is called the Cartesian coordinates of P and the pair (r, θ) are polar coordinates of
P . (If P is the origin then r = 0 and θ is not defined.)
The polar coordinates (r, θ) of P are related to the Cartesian coordinates (x, y) of P by the
formulae p
x = r cos θ r = x2 + y 2
y (7.8)
y = r sin θ tan θ = , provided x 6= 0.
x
This is easily seen using trigonometry, Pythagoras’ theorem and the diagram below.

P (x, y)
b

r
r sin θ
θ
r cos θ

When converting between the polar coordinates and Cartesian coordinates, it is best to draw a
diagram.

Example 7.3.1. Suppose that the polar coordinates of P and Q are given by (2, π4 ) and (3, 7π
6 )
respectively. Find the Cartesian coordinates for P and Q.

Solution. First we plot these points on the Cartesian plane.

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.3. CURVES DEFINED BY POLAR COORDINATES 145

y
P (2, π4 )

2
13π
6 π
4
π x
6

3
7π
Q(3, 6 )

From the diagram, it is clear that the Cartesian coordinate of P is given by

π π 2 2 √ √
2 cos , 2 sin = √ ,√ = ( 2, 2),
4 4 2 2
while the Cartesian coordinate of Q is given by

π π √ 3
−3 cos , −3 sin = − 3, − .
6 6 2

√
Example 7.3.2. Suppose that the Cartesian coordinate of R is given by (−2, 2 3). Find polar
coordinates for R.

Solution. We begin with the following diagram.

√
R(−2, 2 3) y

√ r
2 3
θ
α
2 x

√
2 3
√
In the diagram, tan α = 2 = 3, so α = π3 . Hence

2π
θ =π−α= .
3
By Pythagoras’ theorem, √
r 2 = 22 + (2 3)2 = 16.
Hence in polar coordinates, R can be written as (4, 2π
3 ).

Remark 7.3.3. Note that the point R does not have unique polar coordinates. For example

(4, 2π
3 + 2π) and (4, 2π
3 − 2π)

are also polar coordinates for R. In practice, we often choose the value for θ such that −π < θ ≤ π.

c
2020 School of Mathematics and Statistics, UNSW Sydney
146 CHAPTER 7. CURVE SKETCHING

7.3.2 Basic sketches of polar curves

Many curves can be described by an equation relating the polar variables r and θ. We begin with
the simplest case: when either r or θ are constant.

Example 7.3.4. Sketch the curves corresponding to the the equations

(a) r = 3
3π
(b) θ = 4 .

Solution. (a) Since r = 3, we seek all points in the plane whose distance from the origin is 3. This
is a circle of radius three centred at (0, 0).
(b) This is the collection of all points P such that the angle between OP and the positive x-axis
is 3π
4 . Hence the curve is the ray sketched below with the origin excluded.

y y
3π
3 4

3 x x

r=3 θ= 3π
4

To sketch a curve, described by the polar equation r = f (θ), in the xy-plane, it may be helpful
to first sketch r = f (θ) in the rθ-plane.

Example 7.3.5. Sketch the curve whose polar equation is

r = 2 − cos θ.

Solution. First we graph r against θ.

r
e
3 b
|

√1
b b
2+
|

2 d f r = 2 − cos θ
2 b b
|

c g
2− √1
2 b b
|

b h
1 a b

i
b

The r-value of each point a, b, c, ..., i in the rθ-plane corresponds to the distance from the origin to
each point a′ , b′ , ..., i′ in the xy-plane.

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.3. CURVES DEFINED BY POLAR COORDINATES 147
y
θ = π2
θ = 3π
4 θ= π
4
b c′ r = 2
b

d′
b′ b

r=3 a′ r = 1 θ=0
θ=π b b

e ′ i′ x
b

h′

b
f′ g′ r = 2
b

5π 7π
θ= 4 θ= 4
3π
θ= 2
Our task now is to interpolate between the points a′ , b′ , ..., i′ to obtain a sketch of the curve.
Consider what happens to r as θ moves from 0 to 2π.

• On the rθ-plane, as θ moves from 0 to π, r increases from 1 to 3. So on the xy-plane, if

we travel along the curve in an anticlockwise direction from the positive to the negative
x-axis, then the distance r from the origin to points on the curve increases from 1 to 3.

• On the rθ-plane, as θ moves from π to 2π, r decreases from 3 to 1. So on the xy-plane,

if we travel along the curve in an anticlockwise direction from the negative to the positive
x-axis, then the distance r from the origin to points on the curve decreases from 3 to 1.

These considerations lead to the final sketch.

y
3π π
θ= 4 θ= 4
b
2
b

−3 b b

1 x
b

b
b

5π −2 7π
θ= 4 θ= 4

r = 2 − cos θ

We list a few more helpful tips for sketching a curve described by the polar equation r = f (θ).

• If f is an even function (that is, f (−θ) = f (θ)) then the polar curve is symmetric about
the x-axis.

c
2020 School of Mathematics and Statistics, UNSW Sydney
148 CHAPTER 7. CURVE SKETCHING

• If f (π − θ) = f (θ) then the polar curve is symmetric about the y-axis.

• If f is 2π–periodic then it suffices to consider θ in the range 0 ≤ θ < 2π.

• Sometimes it is possible to rewrite the equation r = f (θ) in terms of x and y only.
The use of calculus will be discussed in the next subsection.
Example 7.3.6. Sketch the curve that is described by the polar equation

r = 2 sin θ, 0 ≤ θ ≤ π.

Solution. Graphing r against θ, followed by y against x leads to the following two diagrams.
r y
r=2
2 b
θ= 43π
θ = π4
b
|

√2 b b
|

2
b b
√ √
r= 2 r= 2

b | | | |b b

π π 3π π θ r=0 x
4 2 4

r = 2 sin θ r = 2 sin θ
The curve looks as though it is a circle, an observation that can be confirmed by rewriting r = 2 sin θ
in terms of x and y:

r = 2 sin θ
r 2 = 2r sin θ (multiply by r)
2 2
x + y = 2y (by (7.8))
x2 + y 2 − 2y + 1 = 1 (completing the square)
2 2
x + (y − 1) = 1.

Therefore the polar curve is a circle with Cartesian centre (0, 1) and radius 1.

Example 7.3.7. Sketch the curve that is described by the polar equation

r = 4| sin(3θ)|.

7.3.3 Sketching polar curves using calculus

Suppose that a curve can be expressed in polar form as r = f (θ). Since x = r cos θ and y = r sin θ,
the curve can also be written in parametric form (with parameter θ) as

x(θ) = f (θ) cos θ and y(θ) = f (θ) sin θ.

Hence,
dy
dy dθ y ′ (θ)
= dx
=
dx dθ
x′ (θ)
f (θ) cos θ + f ′ (θ) sin θ
=
−f (θ) sin θ + f ′ (θ) cos θ

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.3. CURVES DEFINED BY POLAR COORDINATES 149

by the product rule. Since r = f (θ), this is usually written as

dr
dy r cos θ + dθ sin θ
= dr
.
dx −r sin θ + dθ cos θ

(Rather than memorising this formula, we recommend that students understand how it is obtained.)
Knowing the value of derivative of the curve at various points allows for greater accuracy in
sketching. We illustrate with one example.

Example 7.3.8. Sketch the curve described by the polar equation r = 1 − sin θ.

Solution. First we graph r against θ and then give a preliminary sketch of y against x.

r θ= 3π y θ= π
4 4
2 b
|

√1
b b
1+
|

2
−1 b b
1
1 b b b b b b
|

x
1− √1
2 b b
|

b b
| |b | | | | | |
π π 3π π 5π 3π 7π 2π θ
4 2 4 4 2 4 5π b 7π
θ= 4 −2 θ= 4

r = 1 − sin θ r = 1 − sin θ
dy
To obtain a clearer idea of the behaviour of the curve, we calculate dx . We have

x = r cos θ = (1 − sin θ) cos θ

and
y = r sin θ = (1 − sin θ) sin θ.
Hence

x′ (θ) = cos θ(− cos θ) + (1 − sin θ)(− sin θ)

= − cos2 θ − sin θ + sin2 θ
= 2 sin2 θ − sin θ − 1
= (2 sin θ + 1)(sin θ − 1)

and

y ′ (θ) = cos θ − 2 sin θ cos θ

= cos θ(1 − 2 sin θ).

Thus
dy y ′ (θ) cos θ(1 − 2 sin θ)
= ′ = .
dx x (θ) 2 sin2 θ − sin θ − 1
To locate horizontal and vertical tangents, we examine where

x′ (θ) = 0 and y ′ (θ) = 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
150 CHAPTER 7. CURVE SKETCHING

Solving y ′ (θ) = 0 gives

1
cos θ = 0 or sin θ = ,
2
that is,
π 3π π 5π
θ= , , , .
2 2 6 6
Solving x′ (θ) = 0 gives
1
sin θ = 1 or sin θ = − ,
2
that is
π 7π 11π
θ= , , .
2 6 6
So horizontal tangents occur when
3π π 5π
θ= , ,
2 6 6
and vertical tangents occur when
7π 11π
θ= , .
6 6
π
(Understanding how the derivative behaves when θ is close to 2 is harder since

x′ ( π2 ) = y ′ ( π2 ) = 0.

A full analysis is given in Remark 7.3.9.)

dy dy
Finally, we observe that dx = −1 when θ = 0 and that dx = 1 when θ = π.
Hence we obtain the following sketch. (The abbreviations HT and VT, which appear in the
diagram, stand for ‘horizontal tangent’ and ‘vertical tangent.’)

5π
y π
θ= 6
θ= 6

HT HT
−1 1
x
gradient of tangent is 1 gradient of tangent is −1

VT VT
7π 11π
θ= 6 θ= 6

−2 HT

r = 1 − sin θ

c
2020 School of Mathematics and Statistics, UNSW Sydney
7.4. MAPLE NOTES 151

Remark 7.3.9. (MATH1141 only) In the above example, the curve has a ‘vertical cusp’ at the
dy
origin. This is seen by analysing dx when θ is close to π2 . By l’Hôpital’s rule we see that

x′ (θ) 2 sin2 θ − sin θ − 1

lim = lim
θ→ π2 − y ′ (θ) θ→ π2 − cos θ − 2 sin θ cos θ
4 sin θ cos θ − cos θ
= lim
θ→ π2 − − sin θ − 2(cos θ cos θ − sin θ sin θ)
0
= lim
θ→ π2 − 1
= 0.

Hence
dy y ′ (θ) π−
= ′ →∞ as θ→ .
dx x (θ) 2
By symmetry,
dy y ′ (θ) π+
= ′ → −∞ as θ→ .
dx x (θ) 2
π
Therefore the derivative has an infinite discontinuity when θ = 2. Such a point is called a ‘cusp
point.’

Remark 7.3.10. In most cases, a reasonable sketch of a polar curve can be made without appealing
to calculus. As seen in the example above, while employing calculus gives a more accurate sketch,
it also dramatically increases the amount of work and time required.

7.4 Maple notes

It is easy to draw parametric curves using Maple. Maple also knows about polar coordinates, but
you should be aware that Maple allows r < 0. For example,
> # Draw a hyperbola via parametric specification

> plot([3sec(t), 2tan(t), t=-Pi/2..3*Pi/2], x=-10..10, y=-8..8, numpoints=100);

c
2020 School of Mathematics and Statistics, UNSW Sydney
152 CHAPTER 7. CURVE SKETCHING

> # What is the reason for the straight lines through the origin in graph above?
> plot([sin(5*theta), theta, theta=-Pi..Pi], coords=polar);

> plot([2+cos(5*t), t, t=-Pi..Pi], coords=polar);

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 7 153

Problems for Chapter 7

Problems 7.1 : Curves defined by a Cartesian equation

√
1. [R] Find the maximal domain and range of the function f , given by f (x) = 5 + 4x − x2 ,
and sketch its graph.

2. [R] Write down the period of each of the following functions f (where possible). Determine
which are odd or even. Sketch the graph of each function.
a) f (x) = sin 3x b) f (x) = 1 + sin(2x/3) c) f (x) = x sin x
d) f (x) = tan 3x e) f (x) = cos2 x f) f (x) = sin x + cos x

3. [R] Suppose that f is an odd function (not everywhere zero). Determine whether each
function g below is odd, even or neither.
a) g(x) = x2 f (x) b) g(x) = x3 f (x) c) g(x) = x2 + f (x)
d) g(x) = x3 + f (x) e) g(x) = sin(f (x)) f) g(x) = f (cos x)

4. [R] For each function f , identify any vertical and oblique asymptotes and hence sketch
the graph. (Do not use calculus.)
1 x2 − 2 x3 − 7x + 8
a) f (x) = x + 2 + b) f (x) = c) [H] f (x) =
x−3 x+1 x2 + x − 6

5. [R] Sketch the following curves, showing their main features.

1 x−1 2
a) y = x2 + 2 b) [V] y = c) y = e−x /2
x x−2
x2
d) y = xe−x e) y 2 = x(x − 4)2 f) y =
x−2
2
x −1
g) y = 2 h) y = x cos−1 x
x − 2x

3x2 − 10x + 3
6. [H] (Longer rather than difficult) Suppose that y = .
3x2 + 10x + 3
a) Find the values of x for which y ≥ 0. b) Find the asymptotes.
c) Find the turning points. d) Find the domain and range.
e) Sketch the graph.

Problems 7.2 : Parametrically defined curves

7. [R] Sketch the curves given by the following parametric equations. Also find, where
possible, a Cartesian equation for the curve.
a) x = 4 cos t, y = 5 sin t
b) x = 3 sec t, y = 2 tan t
.
c) x = t3 , y = t2
d) x = et cos t, y = et sin t

c
2020 School of Mathematics and Statistics, UNSW Sydney
154 CHAPTER 7. CURVE SKETCHING

8. [R] For each of the curves given in parametric form by

x=1−t x = 3t + 2 x = cos t
a) b) c)
y =1+t y = t4 − 1 y = sin t,

i) find the points on the curve corresponding to t = −1, 0, 1, and 2;

ii) find any point on the curve where y = 0;
dy
iii) find as a function of t.
dx

9. [R] [V]

t t
a) Find the equation of the normal to the curve x = , y = at the point P
t+1 t−1
when t = 2.

b) Eliminate t from the above equations and find the gradient of the normal at P using
the Cartesian form.

10. [X] A curve is given in terms of the parameter t by x = t3 , y = 3t2 .

a) What is the equation of the curve? Can you sketch it?

b) Show that the equation of the chord joining the points with parameters t1 , t2 is

(t21 + t1 t2 + t22 )y = 3(t1 + t2 )x + 3t21 t22 .

c) Show that the equation of the tangent at t is ty = 2x + t3 .

d) Suppose that P is a point with coordinates (a, b) and that P does not lie on on the
curve or on the y-axis.

i) Show that either one or three tangents may be drawn from P to the given curve.
Illustrate on a sketch the region in which P must lie so that there are three
tangents from P to the curve.
ii) Assume that P lies in this region and let Q1 , Q2 , Q3 denote the points of contact
of the tangents from P to the curve. Show that the centroid of the triangle
Q1 Q2 Q3 is the point (−2a, 2b).

11. [H] Consider a fixed circle of radius 1 centred at the origin and a smaller circle of radius 14
initially centred at ( 43 , 0). The smaller circle rolls (without slipping) around the inside rim
of the larger circle such that the centre Q of the smaller circle moves in an anticlockwise
direction. A point P , fixed on the rim of the smaller circle and initially with coordinates
(1, 0), traces out a curve as the smaller circle moves inside the larger circle.

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 7 155
y y

x θ P x
O Q P O

Initial configuration Configuration after motion has begun

The goal of this question is to find the Cartesian form of the trajectory of P . Let θ denote
the angle (in radians) between OQ and the positive x-axis, as shown in the above diagram.
⇀
a) Explain why OQ = 43 (cos θ, sin θ).
⇀
b) [X] Explain why QP = 14 (cos(−3θ), sin(−3θ)).
⇀
c) Show that OP = (cos3 θ, sin3 θ).
(You may find techniques from MATH1131 Algebra useful here.)
d) Hence the trajectory of P is given by

x = cos3 θ, y = sin3 θ, 0 ≤ θ ≤ 2π.

By using an appropriate trigonometric identity, eliminate θ to find the cartesian

equation of the trajectory of P .
e) Sketch the curve corresponding to this equation. (This curve is called an astroid after
the Greek word for ‘star’.)

12. [R] [V] In the 1960s two French car engineers, Paul de Casteljau and Pierre Bezier,
independently discovered a remarkable new approach to parameterising curves. Let’s in-
troduce their approach by finding a quadratic curve determined by the control points
A(1, 1), B(2, 2) and C(3, 1) with respecitve coordinate vectors a, b and c.

a) The linear de Casteljau Bezier curve determined by A and B is given parametrically

as
p(t) = (1 − t)a + tb.
This is called the linear interpolation of the vectors a and b. Find a parametric vector
equation for p(t) and evaluate p(0) and p(1).
b) Similarly, the linear de Casteljau Bezier determined by B and C is

q(t) = (1 − t)b + tc.

Find a Cartesian equation for this line, and evaluate q(1/2).

c
2020 School of Mathematics and Statistics, UNSW Sydney
156 CHAPTER 7. CURVE SKETCHING

c) The quadratic de Casteljau Bezier curve r(t) determined A, B and C, is the linear
interpolation of the vectors p(t) and q(t):
r(t) = (1 − t)p(t) + tq(t).
Find an explicit expression for r(t) and find the polynomials p0 , p1 and p2 such that
r(t) = p0 (t)a + p1 (t)b + p2 (t)c.

Problems 7.3 : Curves defined by polar coordinates

13. [R] The following points are given in polar coordinate form. Plot them on a diagram and
find their Cartesian coordinates.
a) (3, 0) b) (6, 7π/6) c) (2, 7π/4)

14. [R] Convert these Cartesian coordinates into polar forms with r ≥ 0 and −π < θ ≤ π.
√
a) (−3, 0) b) (−1, −1) c) (−2, 2 3)
√ √
d) (0, 1) e) (−2 3, 2) f) (−2 3, −2)

15. [R] Sketch the graph corresponding to each polar equation.

a) r = 4 b) θ = 2 c) r = 3θ, for θ ≥ 0.

16. [R]
a) Express r = 6 sin θ, where 0 ≤ θ ≤ π, in Cartesian form and hence draw its graph.
b) Repeat this for r = 2 cos θ, where −π/2 ≤ θ ≤ π/2.

17. [R] Sketch the graph corresponding to each polar equation.

a) r = 2 + sin θ b) r = 3 + cos θ c) [V] r = 2 − 2 cos θ
d) r = 2| cos θ| e) r = 3| sin 6θ| f) r = | tan 2θ | (−π < θ < π)

18. [H] [V] The hyperbolic spiral is described by the equation rθ = a whenever θ > 0, where
sin θ
a is a positive constant. Using the fact that lim = 1, show that the line y = a is a
θ→0 θ
horizontal asymptote to the spiral. Sketch the spiral.

5
19. [H] Show that r = is the polar equation of an ellipse by finding the Cartesian
3 − 2 cos θ
equation of the curve (and completing the square).

20. [X]
a) For what values of θ is r 2 = 25 cos 2θ defined?
b) Sketch the graph of this curve. What difference would it make if you allowed negative
values of r?
dy
21. [H] Find dx for the curves in Q16a,b,c.

c
2020 School of Mathematics and Statistics, UNSW Sydney
157

Chapter 8

Integration

The problem of understanding and calculating the area of regions in the plane has a long history.
Some of the main contributors to this field are Archimedes (the great Greek mathematician of
antiquity), Isaac Barrow (Isaac Newton’s mentor), Isaac Newton (one of the greatest mathemati-
cians and physicists of all time), Gottfried Leibniz (Newton’s contemporary), Bernhard Riemann
(a German mathematician of the nineteenth century) and Henri Lebesgue (a French mathemati-
cian of the early twentieth century). In this chapter we see a method for calculating the area of
regions with curved boundaries known as Riemann integration. We then give a startling result,
known as the fundamental theorem of calculus, which connects the problem of calculating areas
with antidifferentiation. This theorem was known to Barrow and its implications were developed
by Newton, Leibniz and their disciples. A consequence of this theorem and other related results,
is that the problem of calculating area is much the same as that of calculating mass, volume, work
and probability. The unifying feature is a body of theory known as the integral calculus.

8.1 Area and the Riemann Integral

(Ref: SH10 §§5.1, 5.2, B.5)
In this section we examine the problem of defining the area of regions with curved boundaries. The
main technique introduced is called integration, which leads to defining ‘the area under the graph
of a function’ via the Riemann integral.

8.1.1 Area of regions with curved boundaries

We all know how to calculate the area of a rectangle or a triangle. The area of more exotic polygons
can be computed by partitioning the polygon into triangles and then summing the areas of these
triangles (see Figure 8.1 (a)). Such calculations are based on an intuitive understanding of area
that takes for granted that

• the area of a rectangle is the product of its length and height,

• areas of congruent regions are equal, and

• the area of a whole region is the sum of the areas of its parts.

When precisely formulated, these points could be taken together as a definition of area. With this
definition, one can derive a formula for the area of a right-angled triangle (see Figure 8.1 (b)), then

c
2020 School of Mathematics and Statistics, UNSW Sydney
158 CHAPTER 8. INTEGRATION

h h

b b1 b2
b
(a) Partition of a polygon (b) A = 12 bh (c) A = 12 bh

Figure 8.1: Calculating the area of polygons

for an arbitrary triangle (see Figure 8.1 (c)) and then calculate the area of an arbitrary polygon
(as in Figure 8.1 (a)).
When it comes to calculating the area of a region with curved boundaries (see, for example,
Figures 8.2 and 8.3), a more sophisticated definition of area is needed. Before asking what shape
this definition might take, it is worthwhile asking the question, What is area?
First, we recognise that the area of a region is a non-negative real number. Also, our intuition
of area informs us that it should satisfy the following properties (see Figure 8.2 for a diagram
illustrating each property). We assume throughout that the regions considered are bounded.
(A1) If Ω is a region of the plane then
area(Ω) ≥ 0.
(A2) If one region Ω1 is contained in another region Ω2 , then
area(Ω1 ) ≤ area(Ω2 ).

(A3) If the area of a region Ω is partitioned into two smaller disjoint regions Ω1 and Ω2 , then
area(Ω) = area(Ω1 ) + area(Ω2 ).

(A4) If Ω1 and Ω2 are congruent regions then

area(Ω1 ) = area(Ω2 ).

(A5) If Ω is a rectangle of length a and height b then

area(Ω) = ab.

These properties are axioms that any definition of area should satisfy.
Rather than give a definition for the area of any bounded region of the plane, we shall only
give a definition for ‘the area under the graph of a function.’ The following example gives some
justification for this decision.
Example 8.1.1. Consider the region Ω of Figure 8.3. By partitioning Ω with straight lines as
shown, one can see by axiom (A3) that
area(Ω) = area(Ω1 ) + area(Ω2 ) + area(Ω3 ).
By rotating and translating each subregion, we see by axiom (A4) that each of area(Ω1 ), area(Ω2 )
and area(Ω3 ) is equal to the area under the graph of a function.
This procedure can be done for any region in the plane with a ‘reasonable’ boundary. Hence
the problem of defining the area of a region with a curved boundary can be reduced to defining
what is meant by ‘the area under the graph of a function.’

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.1. AREA AND THE RIEMANN INTEGRAL 159

Ω Ω2 Ω

Ω1 Ω1 Ω2 Ω2

Ω1
(a) (A1) (b) (A2) (c) (A3) (d) (A4)

a
(e) (A5)

Figure 8.2: Properties of area

Ω1

Ω Ω3
Ω2

(a) (b)
y y y
f1
f2 f3

x x x
(c)

Figure 8.3: Partitioning of a region with curved boundary

c
2020 School of Mathematics and Statistics, UNSW Sydney
160 CHAPTER 8. INTEGRATION
f (x) f (x) f (x)

Ω Ω1 Ω2

1 x 1 x 1 x
(a) (b) (c)

Figure 8.4: Area under the graph of f (x) = x2 .

8.1.2 Approximations of area using Riemann sums

To guide us towards a definition for ‘the area under the graph of a function,’ we study the following
example.
Consider the function f : R → R, given by the rule f (x) = x2 . Let Ω denote the region bounded
by the graph of f , the x-axis and the lines x = 0 and x = 1. The region Ω is shaded in Figure
8.4 (a). The region Ω may be ‘approximated’ with rectangles as in Figure 8.4 (b) and (c). Since
the regions Ω1 and Ω2 are composed of disjoint rectangles, their areas are easily calculated using
axioms (A3) and (A5) (see Subsection 8.1.1). Moreover, however we eventually choose to define
area(Ω), it is clear by axiom (A2) that we want

area(Ω1 ) ≤ area(Ω) ≤ area(Ω2 ).

This gives a lower and upper bound for area(Ω).

The regions Ω1 and Ω2 each consist of ten rectangles. By changing the number of rectangles
used, we obtain different upper and lower bounds for area(Ω).

Example 8.1.2. Find different upper and lower bounds for the area of the shaded region Ω of
Figure 8.4 (a).

The following formula will help our solution:

n
X 1
k2 = 12 + 22 + 32 + · · · + n2 = n(n + 1)(2n + 1). (8.1)
6
k=1

(This may be proved using mathematical induction. If students need to use such a formula in an
exam, they will be given it).

Solution to Example 8.1.2. Let A denote the (as yet undefined) area of the region Ω. Rather than
calculating an upper bound for A by using 10 rectangles (as in Figure 8.4 (c)), we use n rectangles,
where n is a positive integer (see Figure 8.5). The bases of the rectangles are constructed by
subdividing the interval [0, 1] into n subintervals

[0, n1 ], [ n1 , n2 ], [ n2 , n3 ], . . . , [ n−1
n , 1].

The set Pn , given by

Pn = {0, n1 , n2 , n3 , . . . , n−1
n , 1},

that divides the interval [0, 1] into these subintervals is called a partition of [0, 1].

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.1. AREA AND THE RIEMANN INTEGRAL 161
y y = f (x)

Rk
··· ···

1 2 3 4 ··· k−1 k ··· n−1 n x

n n n n n n n n

Figure 8.5: An upper Riemann sum for Example 8.1.2.

Let Rk denote the area of the kth rectangle in Figure 8.5. Then

Rk = width × height
1
= × f ( nk )
n
2
1 k
= ×
n n
1
= 3 × k2 .
n
If S Pn (f ) denotes the total area of the shaded region in Figure 8.5, then
n
X
S Pn (f ) = Rk
k=1
n
X 1
= × k2
n3
k=1
n
1 X 2
= 3 k
n
k=1
1 2
= 3
1 + 22 + 32 + · · · + (n − 1)2 + n2
n
1 1
= × n(n + 1)(2n + 1) (by formula (8.1))
n3 6
1 1 1
= 1+ 2+
6 n n
1 1 1
= + + .
3 2n 6n2

c
2020 School of Mathematics and Statistics, UNSW Sydney
162 CHAPTER 8. INTEGRATION

The quantity S Pn (f ) is called the upper Riemann sum of f with respect to the partition Pn . Axiom
(A2) imples that
1 1 1
A ≤ S Pn (f ) = + + 2,
3 2n 6n
and by taking different values of n we obtain different upper bounds for A.
To compute a lower bound corresponding to n rectangles, consider Figure 8.6. The area Rk of
the kth rectangle is given by
1 1
Rk = × f ( k−1
n ) = 3 (k − 1)2 .
n n
The sum of all the areas of the rectangles is called the lower Riemann sum for the function f over
the partition Pn and is denoted by S Pn (f ). We see that
n
X
S Pn (f ) = Rk
k=1
n
X 1
= × (k − 1)2
n3
k=1
n
1 X
= (k − 1)2
n3
k=1
1 2
= 3
1 + 22 + 32 + · · · + (n − 1)2
n
1 1
= 3
× (n − 1)n(2n − 1) (by formula (8.1))
n 6
1 1 1
= 1− 2−
6 n n
1 1 1
= − + 2.
3 2n 6n
By axiom (A2),
1 1 1
A ≥ S Pn (f ) =− + 2.
3 2n 6n
Hence for each positive integer n, the inequality
1 1 1 1 1 1
− + 2 ≤A≤ + + 2 (8.2)
3 2n 6n 3 2n 6n
gives an upper and lower bound for A.

From inequality (8.2) we see that, however the area A of Ω is eventually defined, it must be
squeezed between
1 1 1 1 1 1
− + 2 and + + 2
3 2n 6n 3 2n 6n
for every value of n. But as n → ∞,
1 1 1 1 1 1 1 1
− + 2 → and + + 2 → . (8.3)
3 2n 6n 3 3 2n 6n 3
So there is only one number A that satisfies (8.2) for every positive integer n, namely A = 31 . Our
definition of the area under the graph of a function must agree with this calculation.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.1. AREA AND THE RIEMANN INTEGRAL 163
y y = f (x)

Rk
··· ···

1 2 3 4 ··· k−1 k ··· n−1 n x

n n n n n n n n

Figure 8.6: A lower Riemann sum for Example 8.1.2.

Remark 8.1.3. The process of calculating upper and lower Riemann sums and taking a limit as
in (8.3) is called integration.

Guided by the ideas introduced in this subsection, we are now ready to give a definition for the
area under the graph of a function.

8.1.3 The definition of area under the graph of a function and the Riemann
integral
Suppose that f is a bounded function on [a, b] and that f (x) ≥ 0 for all x in [a, b]. In this subsection
we define what is meant by ‘the area under the graph of f from a to b’. This is done by constructing
upper and lower Riemann sums with respect to partitions of [a, b].

Definition 8.1.4. A finite set P of points in R is said to be a partition of [a, b] if

P = {a0 , a1 , a2 , . . . , an }
and
a = a0 < a1 < a2 < . . . < an = b.

Suppose that P is a partition of [a, b]. In a manner similar to Example 8.1.2, P is used
to construct rectangles that approximate the region under the graph of f (see Figure 8.7). As
illustrated in the diagram, the points of P need not be evenly spaced.
The area of the kth rectangle in Figure 8.7 is

width × height = (ak − ak−1 ) × f k ,

c
2020 School of Mathematics and Statistics, UNSW Sydney
164 f (x) CHAPTER 8. INTEGRATION

··· ···

a0 a1 a2 a3 · · · ak−1 ak ··· an−2 an−1 an x

Figure 8.7: An example of an upper Riemann sum

where f k is the maximum value of the function f on the subinterval [ak−1 , ak ]. The upper Riemann
sum S P (f ) for f with respect to the partition P is defined by the formula
n
X
S P (f ) = (ak − ak−1 )f k . (8.4)
k=1

Hence the upper Riemann sum corresponds to the total area of the rectangles in Figure 8.4. Like-
wise, the lower Riemann sum S P (f ) for f with respect to the partition P is defined by the formula
n
X
S P (f ) = (ak − ak−1 )f k , (8.5)
k=1

where f k is the minimum value of f on the subinterval [ak−1 , ak ].

We are now ready to give a definition of the area under the graph of a function f .

Definition 8.1.5 (Definition of the area under the graph of a function). Suppose
that a function f is bounded on [a, b] and that f (x) ≥ 0 for all x in [a, b]. If there
exists a unique real number A such that
S P (f ) ≤ A ≤ S P (f ) for every partition P of [a, b], (8.6)
then we say that A is the area under the graph of f from a to b.

It may not seem obvious now but, as we shall see in Section 8.2, not every bounded function f
has an area under its graph in the sense of Definition 8.1.5. The following definition gives a special
name to those functions that do have a well-defined area under their graph. At this point, we lift
the restriction that f (x) ≥ 0 for all x in [a, b].

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.2. INTEGRATION USING RIEMANN SUMS 165

Definition 8.1.6 (Definition of the Riemann integral). Suppose that a function f

is bounded on [a, b]. If there exists a unique real number I such that
S P (f ) ≤ I ≤ S P (f ) for every partition P of [a, b], (8.7)
then we say that f is Riemann integrable on the interval [a, b]. If f is Riemann
integrable, then the unique real number I satisfying (8.7) is called the definite integral
of f from a to b and we write
Z b
I= f (x) dx.
a
The function f is called the integrand of the definite integral, while the points a and
b are called the limits of the definite integral.

Rb
Remark 8.1.7. The notation a f (x) dx is due to Leibniz. It evolved from a slightly different way
of writing down lower and upper Riemann sums. For example, S P (f ) may be written as
n
X
S P (f ) = f (xk )∆xk ,
k=1

where ∆xk = ak − ak−1 and f attains its maximum value on [ak−1 , ak ]Pat the point xk . When
taking a limit, as in (8.3), ∆xk was replaced with dx and the symbol was replaced with an
elongated ‘S’ (‘S’ stands for ‘sum’).
Rb
Remark 8.1.8. If f is Riemann integrable on [a, b], then the real number a f (x) dx is the area
under the graph of f from a to b, provided that f (x) ≥ 0 for all x in [a, b]. A geometric interpretation
Rb
of the real number a f (x) dx when f takes negative values will be discussed in Section 8.3.
Remark 8.1.9. (MATH1141 only.) If f is bounded but not continuous, then f may not attain
a maximum or minimum value on the closed interval [ak , ak−1 ]. This technical difficulty can be
overcome by defining the upper and lower Riemann sums in terms of suprema and infima rather than
maxima and minima. The concept of a supremum and infimum will be introduced in MATH1241.
Remark 8.1.10. One should think about why Definition 8.1.5 is a good definition for the area
under the graph of a function. The following questions need to be answered:
• If the area under the graph of f is well-defined by Definition 8.1.5, would it agree with
any other reasonable definition for the area under the graph of f ?
• Is Definition 8.1.5 consistent with the axioms of area listed in Subsection 8.1.1?
The answer to each of these questions is ‘yes,’ but we leave it to the reader to ponder why this
might be so. Some of the results given over the next few sections do confirm that our definition of
area under a graph is consistent with the axioms of area.

8.2 Integration using Riemann sums

(Ref: SH10 §5.2)
In this section we calculate the area under the graph of functions by computing Riemann sums.
Our notation for the upper and lower Riemann sums follows that of equations (8.4) and (8.5).

c
2020 School of Mathematics and Statistics, UNSW Sydney
166 CHAPTER 8. INTEGRATION
f (x) f (x) f (x) f (x)

2
C
1

|
a bx 1 x 1 2 x 1 x
(a) (b) (c) (d)

Figure 8.8: Diagrams for examples in Section 8.2.

Example 8.2.1. Suppose that f (x) = C where C is a positive constant. Show that f is Riemann
integrable on [a, b] and calculate the area under the graph of f from a to b (see Figure 8.8 (a)).
Solution. Suppose that P is a partition of [a, b] and that P = {a0 , a1 , . . . , an }. Since f is constant,
f k = f k = C for every k between 1 and n. Therefore
n
X
S P (f ) = f k (ak − ak−1 )
k=1
n
X
= C(ak − ak−1 )
k=1
= C(a1 − a0 + a2 − a1 + a3 − a2 + · · · + an−1 − an−2 + an − an−1 )
= C(ak − a0 )
= C(b − a).

The same calculation gives S P (f ) = C(b − a). Since P is an arbitrary partition of [a, b], we have

S P (f ) = C(b − a) = S P for every partition P of [a, b].

Hence there is a unique number A satisfying (8.7), namely A = C(b − a). Therefore f is Riemann
integrable,
Z b
f (x) dx = C(b − a)
a
and the area under the graph of f is C(b − a).

Remark 8.2.2. The area under the graph of f is consistent with axiom (A5).
There are many other kinds of functions which are Riemann integrable. One of the largest
classes consists of those functions that are bounded and piecewise continuous.

Definition 8.2.3. A function f : [a, b] → R is said to be piecewise continuous if it

is continuous on [a, b] at all except perhaps a finite number of points.

Figure 8.9 illustrates the graphs of two functions f and g that are piecewise continuous on [a, b].
The function f is bounded on [a, b] while the function g is not. According to the next theorem, the
area of the shaded region under the graph of f is well-defined.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.2. INTEGRATION USING RIEMANN SUMS 167
f (x) g(x)
b

b b

bc
b

| | | |
a b x a b x

Figure 8.9: Piecewise continuous functions

Theorem 8.2.4. If f is bounded and piecewise continuous on [a, b] then f is integrable on [a, b].

Since the proof of this theorem is difficult, we omit it.

In practice, it often hard to prove from first principles that a function f is Riemann integrable.
However, if we know in advance that f is integrable on [a, b] then it is much easier to calculate its
definite integral from a to b. All one needs to do is show that

lim S Pn (f ) = lim S Pn (f )
n→∞ n→∞

for some sequence of partitions Pn of [a, b]. If I denotes this common limit, then
Z b
f (x) dx = I.
a

Example 8.2.5. If f (x) = x2 , find the area under the graph of f from 0 to 1 (see Figure 8.8 (b)).

Solution. We first note that f is integrable on [0, 1] since it is bounded and continuous on [0, 1].
Suppose that
Pn = {0, n1 , n2 , n3 , . . . , n−1 n
n , n .}

Then
1 1 1 1 1 1
S Pn (f ) = + + 2 and S Pn (f ) = − + 2
3 2n 6n 3 2n 6n
(see Example 8.1.2 for these calculations). Since

1
lim S Pn (f ) = = lim S (f ),
n→∞ 3 n→∞ Pn
we conclude that Z 1
1
f (x) dx = .
0 3
So the area underneath the graph of f from 0 to 1 is 31 .
R2
Example 8.2.6. Suppose that f (x) = 3 − x. Calculate 1 f (x) dx (see Figure 8.8 (c)).

Our final example shows that some functions are not Riemann integrable. For such a function
f , the area under the graph of f is not defined according to Riemann integration.

c
2020 School of Mathematics and Statistics, UNSW Sydney
168 CHAPTER 8. INTEGRATION

Example 8.2.7. Suppose that f : [0, 1] → R is defined by

(
2 if x ∈ Q
f (x) =
1 if x ∈/ Q.
(see Figure 8.8 (d)). Show that f is not Riemann integrable.
Solution. Suppose that Pn is an arbitrary partition of [0, 1]. Then f k = 2 while f k = 1 for every k
between 1 and n. Hence
S Pn (f ) = 1 and S Pn (f ) = 2.
(Exercise: show that this is the case by following a method similar to Example 8.2.1). Therefore
there is no unique real number I satisfying (8.7). We conclude that f is not Riemann integrable.
Remark 8.2.8. A more sophisticated definition of the integral, known as the Lebesgue integral, was
developed at the beginning of the twentieth century by the French mathematician Henri Lebesgue.
Lebesgue’s definition of the integral allows us to integrate some functions which are not Riemann
integrable. For example, if f is the function of Example 8.2.7 then
Z 1
• f is not Riemann integrable and the Riemann integral f (x) dx is not defined;
0
Z 1
• f is Lebesgue integrable and the Lebesgue integral f (x) dx is equal to 1.
0
The Lebesgue integral is studied in some third year mathematics courses.

8.3 The Riemann integral and signed area

(Ref: SH10 §5.5)
The definite integral
Z b
f (x) dx
a
can only be interpreted as the area under the graph of f from a to b if f (x) ≥ 0 for all x in [a, b].
How should we interpret the definite integral if the integrand f has negative values on [a, b]?
Consider the function f graphed in Figure 8.10 (a). The upper sum from a to b, illusrated in
Figure 8.10 (b), approximates area(Ω1 ). However, since f (x) < 0 when b < x < c, each term in the
upper sum from b to c will be negative. Hence the upper sum from b to c approximates −area(Ω2 ).
By increasing the number of subdivisions of [a, c] and decreasing the size of each subdivison, these
approximations get better and better. It follows that
Z c
f (x) dx = area(Ω1 ) − area(Ω2 ).
a
We call this quantity the signed area under the graph of f from a to c.
Thus there are two ways to calculate the (unsigned or absolute) area of the entire shaded region
Ω of Figure 8.10 (a):
Z c
• area(Ω) = |f (x)| dx; or
a
Z b Z c
• area(Ω) = f (x) dx − f (x) dx.
a b
This kind of calculation will be illustrated in Section 8.6.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.4. BASIC PROPERTIES OF THE RIEMANN INTEGRAL 169
f (x) f (x)

Ω1
c c
a b x a b x
Ω2

(a) (b)

Figure 8.10: Signed area and the definite integral

8.4 Basic properties of the Riemann integral

(Ref: SH10 §5.8)
The following proposition gives a list of some of the basic properties of the Riemann integral.
Proposition 8.4.1 (Basic properties of the Riemann integral). Suppose that f and g are integrable
functions over [a, b].
(i) If α and β are real numbers then αf + βg is integrable and
Z b Z b Z b
(αf + βg)(x) dx = α f (x) dx + β g(x) dx.
a a a

(ii) If a < c < b then

Z b Z c Z b
f (x) dx = f (x) dx + f (x) dx.
a a c
Z b
(iii) If f (x) ≥ 0 for all x in [a, b] then f (x) dx ≥ 0.
a
Z b Z b
(iv) If f (x) ≤ g(x) for all x in [a, b] then f (x) dx ≤ g(x) dx.
a a

(v) If m ≤ f (x) ≤ M for all x in [a, b] then

Z b
m(b − a) ≤ f (x) dx ≤ M (b − a).
a

(vi) If |f | is integrable on [a, b] then

Z b Z b

f (x) dx ≤
|f (x)| dx.
a a

Each of these properties can be interpreted geometrically (see Figure 8.11). For example, under
the assumption that f (x) ≥ 0, property (iv) says that if the graph of f lies beneath the graph of g
then the area under the graph of f will be less than the area under the graph of g. Except for (i),
each of these properties is easy to prove.

c
2020 School of Mathematics and Statistics, UNSW Sydney
170 CHAPTER 8. INTEGRATION
f (x) y g y
f
f M

a c b x a b x a b x
(a) Prop 8.4.1(ii) (b) Prop 8.4.1 (iv) (c) Prop 8.4.1 (v)

Figure 8.11: Geometric interpretation of Proposition 8.4.1.

Proof. We omit the proof of (i) and (ii).

(iii) If f (x) ≥ 0 for all x in [a, b], then each term in lower Riemann sum (8.5) is non-negative
for any partition P of [a, b]. Hence
Z b
0 ≤ S P (f ) ≤ f (x) dx
a
by Definition 8.1.6.
(iv) Suppose that h = g − f . Then h(x) ≥ 0 for all x in [a, b] and by applying (iii) and (i) we
have Z b Z b Z b Z b

0≤ h(x) dx = g(x) − f (x) dx = g(x) dx − f (x) dx.
a a a a
Rearranging the inequality gives the result.
(v) By (iv) we have
Z b Z b Z b
m dx ≤ f (x) dx ≤ M dx.
a a a
By Example 8.2.1, the integral on the left is m(b − a) and the integral on the right is M (b − a),
completing the proof.
(vi) Now
−|f (x)| ≤ f (x) ≤ |f (x)| ∀x ∈ [a, b],
so Z Z Z
b b b
−|f (x)| dx ≤ f (x) dx ≤ |f (x)| dx
a a a
Rb Rb
by (iv). By (i), a −|f (x)| dx = − a |f (x)| dx and so
Z b Z b Z b
− |f (x)| dx ≤ f (x) dx ≤ |f (x)| dx
a a a
which is equivalent to Z b Z b

f (x) dx ≤ |f (x)| dx
a a
by a property of the absolute value function.

So far the definite integral Z b

f (x) dx
a
has only been defined when a < b. We now extend this definition.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.5. THE FIRST FUNDAMENTAL THEOREM OF CALCULUS 171

Definition 8.4.2. Suppose that b < a and that f in integrable on [b, a]. Then we
define Z b
f (x) dx
a
to be the real number Z a
− f dx
b
and we define Z a
f (x) dx
a
to be 0.

This definition gives a more general version of Proposition 8.4.1 (ii). We omit the proof.
Proposition 8.4.3. Suppose that a, b and c are real numbers and that f is integrable over some
interval containing a, b and c. Then
Z b Z c Z b
f (x) dx = f (x) dx + f (x) dx.
a a c

8.5 The first fundamental theorem of calculus

In this section we present a surprising theoretical result that indicates a strong connection between
integration (calculating areas) and differentiation (calculating gradients of tangents). This result
has enormous ramifications for the problem of calculating area.
We begin by studying the area under the graph of a function. Suppose that f is a continuous
function on the interval [a, b]. Since f is integrable on [a, b] (see Theorem 8.2.4) we can define the
function F : [a, b] → R by the formula
Z x
F (x) = f (t) dt ∀x ∈ [a, b]. (8.8)
a

For x in [a, b], the number F (x) represents the (signed) area captured underneath the graph of x
on the interval [a, x] (see Figure 8.12 (a)). Thus F is an ‘area function’.
(Note that the limit x in (8.8) is a different symbol from the ‘dummy variable’ t. It would be
incorrect and confusing to write Z x
F (x) = f (x) dx
a
for formula (8.8).)
We investigate some of the properties of F . First, it is easy to see that if f (t) ≥ 0 for all t in
[a, b] then the shaded area under f increases as x increases. That is, if f is a non-negative function
then F is an increasing function.
Second, we ask whether F continuous on [a, b] and differentiable on (a, b). Since differentiability
on (a, b) implies continuity on (a, b), we address the question of differentiability first. Consider the
difference quotient
F (x + h) − F (x)
h
c
2020 School of Mathematics and Statistics, UNSW Sydney
172 CHAPTER 8. INTEGRATION
y y = f (t) y y = f (t)

F (x)
| | | | | | |
a x b t a x x+h b t
(a) (b)

Figure 8.12: The area function F

when x ∈ (a, b) and h is small. Now, if h is a small positive number then Figure 8.12 (b) shows
that

F (x + h) − F (x) = (area under f on the interval [x, x + h])

≈ f (x) × h,

since f (t) ≈ f (x) for all t in [x, x + h] by the continuity of f . Dividing both sides by h gives
F (x + h) − F (x)
≈ f (x).
h
As h gets smaller, this approximation gets better and so we expect that
F (x + h) − F (x)
lim = f (x).
h→0+ h
A similar argument may be repeated when h is a small negative number to obtain the corresponding
left-hand limit. Therefore it seems true that
F (x + h) − F (x)
F ′ (x) = lim = f (x).
h→0 h
This intuitive argument can be made rigorous (see the proof at the end of this section). Hence we
conclude that F is continuous and differentable on (a, b) and that F ′ (x) = f (x) for all x in (a, b).
The fact that a continuous function f and its ‘area function’ F are related by the differential
equation
F ′ (x) = f (x)
has far-reaching consequences. This result is so important that it has come to be known as the
fundamental theorem of calculus.
Theorem 8.5.1 (The first fundamental theorem of calculus). If f is continuous function defined
on [a, b] then the function F : [a, b] → R, defined by
Z x
F (x) = f (t) dt, (8.9)
a

is continuous on [a, b], differentiable on (a, b) and has derivative F ′ given by

F ′ (x) = f (x)

for all x in (a, b).

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.5. THE FIRST FUNDAMENTAL THEOREM OF CALCULUS 173

The first fundamental theorem of calculus is a deep mathematical result which relates differen-
tiation to integration. From the statement of the theorem we note the following points.
• The fact that F satisfies the equation F ′ = f on (a, b) says that F is an antiderivative of
f on (a, b). Hence every continuous function f has an antiderivative F , given by formula
(8.9).

• Since any two antiderivatives of f differ by a constant (see Theorem 5.9.4), every an-
tiderivative of f is of the form
F + constant,
where F is given by the integral formula (8.9). Hence the process of integration and of
antidifferentiation are essentially the same. (This is surprising from a geometric point of
view, where integration is used to calculate areas while differentiation is used to calculate
gradients of tangents.)

• The above point suggests that integration and differentiation are inverse processes. If one
takes a function f , integrates it and differentiates the result, then one obtains f again.
That is, differentiation undoes what integration does to f . This is precisely expressed by
the formula Z x
d
f (x) = f (t) dt .
dx a
Whether or not the converse statement is true (that is, that integration undoes what
differentiation does to a function f ) will be discussed in the next section.
Thus the first fundamental theorem of calculus suggests that there may be a different method for
calculating the area under the graph of f ; rather than integrating via limits of Riemann sums, try
integrating via antidifferentiation. This approach is investigated in the next section.
We end this section by presenting a proof of the first fundamental theorem of calculus. The
proof of the differentiability of F is based on the intuitive argument given immediately prior to the
statement of Theorem 8.5.1.

Proof of Theorem 8.5.1. We begin by showing that F is differentiable on (a, b). Suppose that
x ∈ (a, b) and consider a small positive number h. By Proposition 8.4.1 (ii) one can show that
Z x+h
F (x + h) − F (x) = f (t) dt. (8.10)
x

Since f is continuous on [a, b], it attains a minimum value mh and a maximum value Mh on [x, x+h];
that is,
mh ≤ f (t) ≤ Mh ∀t ∈ [x, x + h].
By Proposition 8.4.1 (v) (see also Figure 8.13),
Z x+h
mh h ≤ f (t) dt ≤ Mh h
x

and hence
mh h ≤ F (x + h) − F (x) ≤ Mh h.
by (8.10). Now h > 0 so
F (x + h) − F (x)
mh ≤ ≤ Mh .
h
c
2020 School of Mathematics and Statistics, UNSW Sydney
174 CHAPTER 8. INTEGRATION
y

Mh y = f (x)
mh

x x+h x

Figure 8.13: Diagram for the proof of Theorem 8.5.1.

Since f is continuous on [x, x + h],

lim mh = f (x) = lim Mh
h→0+ h→0+
and hence
F (x + h) − F (x)
lim = f (x)
h→0+ h
by the pinching theorem (Theorem 2.5.9).
In a similar manner, one can show that
F (x + h) − F (x)
lim = f (x).
h→0− h
Hence F is differentiable on (a, b) and F ′ (x) = f (x) for all x in (a, b).
To complete the proof, we need to show that F is continuous on [a, b]. Since F is differentiable
on (a, b), it is also continuous on (a, b) by Theorem 4.5.1. So it suffices to show that F is continuous
at the endpoints of [a, b].
We will show that F is continuous at a. Since f is continuous on [a, b] there is a positive
constant M such that |f (x)| ≤ M for all x in [a, b] (see Corollary 3.4.9). Suppose that x ∈ (a, b).
By using the properties of Proposition 8.4.1,
Z x Z a

|F (x) − F (a)| = f (t) dt − f (t) dt
Zax a

= f (t) dt (Definition 8.4.2)
Z ax
≤ |f (t)| dt (Proposition 8.4.1 (vi))
a
Z x
≤ M dt (f is bounded)
a
= M |x − a| (Example 8.2.1).
Hence
|F (x) − F (a)| ≤ M |x − a| → 0
as x → a+ , or in other words, lim F (x) = F (a). So F is continuous at a.
x→a+
The proof of continuity at the endpoint b is similar.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.6. THE SECOND FUNDAMENTAL THEOREM OF CALCULUS 175

8.6 The second fundamental theorem of calculus

(Ref: SH10 §5.4)
The first fundamental theorem of calculus says that the processes of integration and antidifferen-
tiation are essentially the same. The second fundamental theorem of calculus gives a fast way of
calculating integrals by exploiting this similarity.

Theorem 8.6.1 (The second fundamental theorem of calculus). Suppose that f is a continuous
function on [a, b]. If F is an antiderivative of f on [a, b] then
Z b
f (t) dt = F (b) − F (a). (8.11)
a

The second fundamental theorem of calculus is an astonishing result. To calculate the signed
area under the graph of a continuous function f , all one needs to know is the value of any an-
tiderivative of f at the endpoints a and b. To illustrate the power of this result, compare the
lengthy calculation of Riemann sums in Example 8.1.2 with the example below.

Example 8.6.2. Suppose that f (x) = x2 for all real numbers x. Calculate the area of the region
bounded by the x-axis, the graph of f and lines x = 0 and x = 1.

Solution. Essentially we are being asked to evaluate

Z 1
f (t) dt
0

t3
where f (t) = t2 (see Figure 8.4 (a)). An antiderivative F of f is given by F (t) = 3. Hence, by the
second fundamental theorem of calculus,
Z 1
13 03 1
f (t) dt = F (1) − F (0) = − = .
0 3 3 3

So the area of the region is 1/3.

Remark 8.6.3. The expression F (b) − F (a) appearing in (8.11) is used so frequently that it is
often abbreviated to b h ib

F (x) or F (x) .
a a

Thus the calculation of Example 8.6.2 may be written as

Z 1 1
2 t3 1 1
t dt = = −0= .
0 3 0 3 3

When calculating areas, it is important to bear in mind that the integral in (8.11) represents
the signed area.

Example 8.6.4. Calculate the area of the region bounded by the x-axis, the lines x = 0 and x = 4
and the function f given by
f (x) = (x − 2)(x − 4).

c
2020 School of Mathematics and Statistics, UNSW Sydney
176 CHAPTER 8. INTEGRATION
y

y = f (x)

2 4 x

Solution. The above sketch shows that part of the region lies beneath the x-axis. So the area we
seek corresponds to the integral
Z 4 Z 2 Z 4
|f (x)| dx = f (x) dx + (−f (x)) dx
0 0 2
Z 2 Z 4
= x2 − 6x + 8 dx − x2 − 6x + 8 dx
0 2
4 2
x3 x32 2
= − 3x + 8x − − 3x + 8x
3 3
0 2
8
= − 12 + 16 − 0
3

64 8
− − 48 + 32 − − 12 + 16
3 3
= 8.
So the area is 8 square units.
Z 4
Note that this is a different problem to: Evaluate f (x) dx, whose solution is:
0
Z 4 Z 4
f (x) dx = x2 − 6x + 8 dx
0 0
4
x3
= − 3x2 + 8x
3 0
16
= .
3

We now present a proof of the second fundamental theorem of calculus. The tools used in the
proof are the first fundamental theorem of calculus and a corollary of the mean value theorem.
Proof of Theorem 8.6.1. Suppose that f is continuous on [a, b] and that F is an antiderivative of
f on [a, b]. Define a function G : [a, b] → R by the formula
Z x
G(x) = f (t) dt ∀x ∈ [a, b].
a

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.6. THE SECOND FUNDAMENTAL THEOREM OF CALCULUS 177

By the first fundamental theorem of calculus (see Theorem 8.5.1), G is continuous on [a, b] and
G′ (x) = f (x) for all x in (a, b). Hence G is also an antiderivative of f on [a, b]. By Theorem 5.9.4,
there is a constant C such that
G(x) = F (x) + C (8.12)
for all x in [a, b]. Note that Z a
G(a) = f (t) dt = 0.
a

So if x = a then equation (8.12) becomes

0 = F (a) + C.

Hence C = −F (a). Therefore

Z b
f (t) dt = G(b)
a
= F (b) + C (by (8.12))
= F (b) − F (a)

as required.

We return to the question of whether integration and differentiation are inverse processes.
Suppose that f is continuous on [a, b]. The first fundamental theorem of calculus states that if one
integrates a function f and differentiates the result, then one obtains f again. This is expressed by
the formula Z x
d
f (x) = f (t) dt ∀x ∈ (a, b).
dx a

To be able to say that integration and differentiation are inverses operations of each other, we must
ask whether the converse is true. That is, if one differentiates a function f and integrates the result,
then does one obtain f again? In precise terms, we would like the formula
Z x
f (x) = f ′ (t) dt ∀x ∈ (a, b) (8.13)
a

to hold. Unfortunately, this converse is false. The following corollary to the second fundamental
theorem of calculus says that one obtains f again only when f (a) = 0. Otherwise formula (8.13) is
‘out by a constant.’

Corollary 8.6.5. Suppose that f is continuous on [a, b] and has a continuous derivative on (a, b).
Then Z x
f ′ (t) dt = f (x) − f (a)
a

for all x in [a, b]

Proof. Apply the second fundamental theorem of calculus to the function f ′ on the interval [a, x].

The second fundamental theorem of calculus allows a fruitful interaction between Riemann
sums and antidifferentiation.

c
2020 School of Mathematics and Statistics, UNSW Sydney
178 CHAPTER 8. INTEGRATION
1
Example 8.6.6. Suppose that f (x) = . By considering the lower Riemann sum of f with
1 + x2
respect to the partition
{0, n1 , n2 , . . . , n−1 n
n , n}

of [0, 1], show that

n
π X n n n n n
= lim = lim + 2 + 2 + ··· + 2 .
4 n→∞ n + k2 n→∞
2 2
n +1 2 n +2 2 n +3 2 n + n2
k=1

Solution. Let Pn denote the above partition of [0, 1]. Since f is a decreasing function on [0, 1], the
lower Riemann sum is given by
n
X 1 1
S Pn (f ) =
n 1 + (k/n)2
k=1
n
X n
= .
n2 + k2
k=1

Since f is bounded and continuous, it is integrable on [0, 1] and hence

Z 1
1
lim S Pn (f ) = dx
n→∞ 0 1 + x2

(see Remark 8.6.7). But by the second fundamental theorem of calculus,

Z 1 h i1 π
1 −1
dx = tan (x) = .
0 1 + x2 0 4

Hence
n
π X n
= lim
4 n→∞ n + k2
2
k=1

as desired.

Remark 8.6.7. The solution to Example 8.6.6 uses the following result. Suppose that f is Riemann
integrable on [a, b] and that {Pn }∞
n=1 is a sequence of partitions of [a, b]. Let sn denote the maximum
size of the subintervals generated by Pn . If lim sn = 0 then
n→0
Z b
lim S Pn (f ) = f (x) dx = lim S Pn (f ).
n→∞ a n→∞

8.7 Indefinite integrals

(Ref: SH10 §5.6)
The fundamental theorems of calculus say that integration and antidifferentiation are, in a certain
sense, the same process. In particular, if a F is an antiderivative of f then
Z x
f (t) dt = F (x) + C
a

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.7. INDEFINITE INTEGRALS 179

for some suitable constant C. If we have no particular interest in the interval [a, x] and merely
want to indicate that F is an antiderivative of f , then we write
Z
f (t) dt = F (t) + C. (8.14)

An integral expressed in this way, without limits, is called an indefinite integral. The constant C
in (8.14) is called the constant of integration.
The table of antiderivatives given in Section 5.9 is equivalent to the following table of indefinite
integrals.

Indefinite integrals
Z
1
xr dx = xr+1 + C, where r is a rational number and r 6= −1
r+1
Z
sin x dx = − cos x + C
Z
cos x dx = sin x + C
Z
1
eax dx = eax + C
a
Z ′
f (x)
dx = ln |f (x)| + C
f (x)

Since differentiation and indefinite integration (or antidifferentiation) are inverse processes, they
have many analogous properties. For example, differentiation behaves linearly, which means that

(f + g)′ (x) = f ′ (x) + g ′ (x)

and
(αf )′ (x) = α × f ′ (x)
for every real number α and every differentiable function f and g. The next proposition says that
indefinite integration is also behaves linearly. (We have already shown in Proposition 8.4.1 that
definite integration is linear.)

Proposition 8.7.1. Suppose that f and g are integrable functions and that α is a real constant.
Then Z Z Z
(f + g)(x) dx = f (x) dx + g(x) dx (8.15)

and Z Z
(αf )(x) dx = α f (x) dx. (8.16)

Proof. We prove (8.15) only and leave the proof of 8.16 as an exercise. Suppose that F , G and H
are antiderivatives of f , g and f + g respectively. Then

H ′ (x) = (f + g)(x) = f (x) + g(x) = F ′ (x) + G′ (x)

by the linearity of differentiation. By expressing the equation H ′ = F ′ + G′ in terms of indefinite

integrals we obtain (8.15) as desired.

c
2020 School of Mathematics and Statistics, UNSW Sydney
180 CHAPTER 8. INTEGRATION

Example 8.7.2. Find Z

(10x4 + 8 sin x − 9ex ) dx.

Proof. By Proposition 8.7.1,

Z Z Z Z
(10x4 + 8 sin x − 9ex ) dx = 10 x4 dx + 8 sin x dx − 9 ex dx

1 5
= 10 x + C1 + 8 (− cos x + C2 ) − 9 (ex + C3 )
5
= 2x5 − 8 cos x − 9ex + C

where C1 , C2 and C3 are constants of integration and C = 10C1 + 8C2 − 9C3 .

Remark 8.7.3. It is standard practice to combine of all constants of integration and simply write
‘+C’. Thus the calculation in Example 8.7.2 may be written as
Z Z Z Z
(10x + 8 sin x − 9e ) dx = 10 x dx + 8 sin x dx − 9 ex dx
4 x 4

1
= 10 × x5 + 8(− cos x) − 9ex + C
5
= 2x − 8 cos x − 9ex + C.
5

8.8 Integration by substitution

(Ref: SH10 §5.7)
When differentiating the composition of two functions, we apply the chain rule. By reversing this
process with integration, we obtain a technique known as integration by substitution.
The basic idea is the following. Suppose that F is an antiderivative for f . Then
Z
f (g(x)).g ′ (x) dx = F (g(x)) + C, (8.17)

as can be easily verified by differentiating the right-hand side via the chain rule. On the other
hand, Z
f (u) du = F (u) + C. (8.18)

If we make the substitution u = g(x) then the right-hand sides of (8.17) and (8.18) are the same.
This suggests the substitution du = g′ (x) dx, so that the left-hand side of (8.17) is transformed
into the left-hand side of (8.18). These mechanical substitutions can be used to transform the
complicated integral into a simpler one.

Example 8.8.1. Calculate the indefinite integral

Z
(2x − 3) cos(x2 − 3x + 4) dx

by the method of substitution.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.8. INTEGRATION BY SUBSTITUTION 181

Solution. We begin with the substitution

u = x2 − 3x + 4.

Since
du
= 2x − 3,
dx
we also make the substitution
du = (2x − 3) dx.
Then
Z Z
2
(2x − 3) cos(x − 3x + 4) dx = cos(u) du

= sin u + C
= sin(x2 − 3x + 4) + C.

We can verify our answer by differentiating sin(x2 − 3x + 4) via the chain rule.

A similar technique can be used to evaluate definite integrals.

Example 8.8.2. Evaluate the definite integral

Z 2 p
3x2 1 + x3 dx.
0

du
Solution. If u = 1 + x3 then dx = 3x2 . Hence we make the following substitutions:

u = 1 + x3
du = 3x2 dx
u = 1 (when x = 0)
u = 9 (when x = 2).

Thus
Z 2 p Z 9√
3x2 1 + x3 dx = u du
0 1
9
2 3/2
= u
3 1
2
= (93/2 − 13/2 )
3
52
= .
3

In general, the reversal of the chain rule via integration gives rise to the change of variables
formula
Z b Z g(b)
′
f (g(x))g (x) dx = f (u) du, (8.19)
a g(a)

c
2020 School of Mathematics and Statistics, UNSW Sydney
182 CHAPTER 8. INTEGRATION

The substitutions
u = g(x), du = g ′ (x) dx,
are only to be understood within this context. Precise conditions under which formula (8.19) holds
is given by the following theorem.
Theorem 8.8.3 (Change of variables formula). Suppose that g is a differentiable function such
that g′ is continuous on [a, b]. If f is continuous on any interval I containing g(a) and g(b) then
the change of variables formula (8.19) holds.
Proof. Since f is continuous on I it has an antiderivative F : I → R by the first fundamental
theorem of calculus. By the chain rule (Theorem 4.2.2),
(F ◦ g)′ (x) = F ′ (g(x))g ′ (x)
= f (g(x))g ′ (x).
So by two applications of the second fundamental theorem of calculus,
Z b
f (g(x))g ′ (x) dx = (F ◦ g)(b) − (F ◦ g)(a)
a
= F (g(b)) − F (g(a))
Z g(b)
= f (u) du,
g(a)

completing the proof.

Finding a fruitful substitution is not always easy (and sometimes not even possible). In principle,
one looks for a function g in the integrand whose derivative g ′ is also in the integrand. Once such
a function g is identified, try the substitution u = g(x). As illustrated below, it may be necessary
to manipulate the integrand to implement this strategy.
Example 8.8.4. Evaluate √
Z
e x
√ dx.
x
Solution. Note that
d √ 1
x = √
dx 2 x
√
and so if g(x) = x then g ′ (x) appears in the integrand (up to the constant factor 1/2). So the
substitution we use is
√ 1
u = x, du = √ dx.
2 x
Hence
Z √x Z √x
e e
√ dx = 2 √ dx
x 2 x
Z
= 2 eu du

= 2eu + C
√
x
= 2e + C.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.8. INTEGRATION BY SUBSTITUTION 183

There are occasions when we make a substitution of the form x = g(u) rather than u = g(x).
Example 8.8.5. Evaluate the integral I, where
Z 9
dx
I= √ .
4 2+ x
Solution. If the integral were Z 9
dx
,
4 2+x
then it would be easy to evaluate. The difficulty with I clearly lies with the square root appearing
in the integrand. To remove the square root, we use the substitution x = u2 . This gives

x = u2
dx = 2u du
u = 2 when x = 4
u = 3 when x = 9.

(Note that we are implicitly assuming that u > 0. It would be incorrect to take the limits in the
variable u to be −2 and 3, for example.) Hence
Z 3
2u du
I=
2 2+u
Z 3
u
=2 du
2 2 + u
Z 3
2+u−2
=2 du
2 2+u
Z 3
2
=2 1− du
2 2+u
h i3
= 2 u − 2 ln |2 + u|
2

4
= 2 + 4 ln .
5

If an integrand possesses symmetry, then the corresponding definite integral may be easy to
evaluate. For example, if f is an odd function and we integrate f over a balanced interval [−a, a],
then the ‘negative area’ cancels out the ‘positive area’ and consequently the integral is zero (see
Figure 8.14). This result is proved using integration by substitution.
Proposition 8.8.6. Suppose that f is a continuous function and a is a real number.
(i) If f is even then Z Z
a a
f (x) dx = 2 f (x) dx.
−a 0

(ii) If f is odd then Z a

f (x) dx = 0.
−a

c
2020 School of Mathematics and Statistics, UNSW Sydney
184 CHAPTER 8. INTEGRATION
f (x)

−a
| |
a x

Figure 8.14: Integration with an odd function f over [−a, a]

(iii) If f is periodic with period T then

Z a+T Z T
f (x) dx = f (x) dx.
a 0

Proof. We prove (ii) only and leave the proofs of (i) and (iii) as an exercise. Suppose that f is an
odd function. By breaking up the integral and using a substitution we have
Z a Z 0 Z a
f (x) dx = f (x) dx + f (x) dx
−a −a 0
Z 0 Z a
=− f (−u) du + f (x) dx (substituting u = −x)
Z aa Z a0
= f (−u) du + f (x) dx
0
Z a Z0 a
=− f (u) du + f (x) dx (since f (−u) = f (u))
0 0
= 0,

completing the proof of (ii).

Example 8.8.7. Find

Z π
17
+2π
sin5 x dx.
π
17

Solution. If f (x) = sin5 x then f is both odd and periodic with period 2π. Hence
Z π Z
17
+2π 2π
5
sin x dx = sin5 x dx (by Proposition 8.8.6 (iii) when a = π
17 )
π
17
0
Z π
= sin5 x dx (by Proposition 8.8.6 (iii) when a = −π)
−π
=0 (by Proposition 8.8.6 (ii)).

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.9. INTEGRATION BY PARTS 185

8.9 Integration by parts

(Ref: SH10 §8.2)
Suppose that u and v are two functions of a variable x. The product rule for differentiation gives
d du dv
(uv) = v +u .
dx dx dx
By integrating both sides with respect to x we obtain
Z Z
du dv
uv = v dx + u dx.
dx dx
Rearranging this equation gives the integration by parts formula
Z Z
dv du
u dx = uv − v dx. (8.20)
dx dx

For definite integrals the formula becomes

Z b h ib Z b
dv du
u dx = uv − v dx. (8.21)
a dx a a dx

by the second fundamental theorem of calculus.

Integration by parts allows one to evaluate the integral of a product of two functions. Its success
depends on choosing u and v such that
Z b Z b
du dv
v dx is easier to calculate than u dx.
a dx a dx
Once u and v are chosen, simply apply formula (8.20) or (8.21).
Remark 8.9.1. The shorthand version
Z Z
uv ′ = uv − vu′

of formulae (8.20) and (8.21) is easier to remember.

Example 8.9.2. Evaluate the integral Z
xe2x dx.

Solution. We set

u=x v = 12 e2x
u′ = 1 v ′ = e2x .

Then integration by parts (see formula (8.20)) gives

Z Z
2x 1 2x 1 2x
xe dx = 2 xe − 2e × 1 dx

= 12 xe2x − 14 e2x + C,

c
2020 School of Mathematics and Statistics, UNSW Sydney
186 CHAPTER 8. INTEGRATION

completing the solution.

(Note that if we had instead set
u = e2x v = 21 x2
u′ = 2e2x v ′ = x.
then integration by parts gives
Z Z
2x 1 2 2x
xe dx = 2x e − x2 e2x dx

and the integral on the right-hand side is harder to evaluate than the one we started with.)

Sometimes integration by parts must be applied several times to evaluate the integral.
Example 8.9.3. Find the integral I, where
Z π
I= x2 sin x dx.
0

Example 8.9.4. Evaluate the indefinite integral I, where

Z
I = ex sin x dx
R
Integration by parts can also be used to evaluate some integrals of the form f (x) dx. The
trick is to rewrite the integral as Z
f (x) × 1 dx.

Example 8.9.5. Evaluate the integral

Z
cos−1 x dx.

8.10 Improper integrals

(Ref: SH10 §11.7)
So far we have defined definite integrals for bounded functions over finite intervals. In this section
we look at integrals over infinite intervals. Such integrals have many applications in probability,
statistics, physics and engineering.
To give meaning to the improper integral,
Z ∞
f (x) dx,
a
RR
we examine the behaviour of a f (x) dx as R → ∞, as illustrated below.
f (x)

| |
a R x

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.10. IMPROPER INTEGRALS 187

Definition 8.10.1. (a) Suppose that there is a real number L such that
Z R
f (x) dx → L
a

Ras∞R → ∞. Then f is said to be integrable over [a, ∞) and the integral

a f (x) dx is said to be convergent.
Z ∞
Morever, we write

f (x) dx = L.
a

(b) Suppose that

Z R
f (x) dx
a
does not have a limit as R → ∞. Then
R ∞ f we say that f not integrable over
(a, ∞) and we say that the integral a f (x) dx is divergent.

Rb
One can write down a similar definition for improper integrals of the form −∞ f (x) dx.

Example 8.10.2. Evaluate the following improper integrals or show that they diverge:
Z ∞
1
(a) 2
dx;
0 x +1
Z ∞
1
(b) √ dx;
1 x
Z 0
(c) e2x dx.
−∞

Solution. (a) If R is a real number then

Z R h iR
1 −1
dx = tan
0 x2 + 1 0
−1
= tan R − 0
π
→
2
as R → ∞. Hence the improper integral converges and
Z ∞
1 π
2
dx = .
0 x +1 2
(b) If R is a real number then
Z R h iR
x−1/2 dx = 2x1/2
1 1
√
=2 R−2
→∞

as R → ∞. Hence the improper integral diverges.

c
2020 School of Mathematics and Statistics, UNSW Sydney
188 CHAPTER 8. INTEGRATION

We now consider improper integrals whose interval of integration is the entire real line.

Definition 8.10.3. We say f is integrable over (−∞, ∞) if f is integrable over both

(−∞, 0) and (0, ∞). In this case we write
Z ∞ Z 0 Z ∞
f (x) dx = f (x) dx + f (x) dx.
−∞ −∞ 0
If f is not integrable on either of the intervals (−∞, 0) or (0, ∞), then we say that
the improper integral Z ∞
f (x) dx
−∞
diverges.

Z ∞
The following example shows that the improper integral f (x) dx may diverge even if
−∞
Z R
lim f (x) dx exists.
R→∞ −R

Example 8.10.4. Consider the improper integral

Z ∞
x dx.
−∞

If R is a positive number then Z R

x dx = 0
−R
since the function f , given by f (x) = x, is odd (see Proposition 8.8.6). Hence
Z R
lim x dx = 0.
R→∞ −R

However Z R h
1 2
iR R2
x dx = 2 x = →∞
0 0 2
as R → ∞. Hence Z ∞
x dx
0
diverges, and by Definition 8.10.3 Z ∞
x dx
−∞
diverges also.
The following proposition determines the convergence or divergence of improper integrals whose
integrand is a power of x. Such integrals will be used frequently in Section 8.11
Proposition 8.10.5 (Convergence and divergence of p-integrals). The improper integral
Z ∞
1
dx
1 xp
is convergent if p > 1 and divergent if p ≤ 1.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 189

Proof. If p 6= 1 then
Z R R
x1−p
x−p dx =
1 1−p 1
R1−p − 1
=
1−p
(
1
when 1 − p < 0
→ p−1
∞ when 1 − p > 0

as R → ∞. Hence the integral converges when 1 − p < 0 (that is, when p > 1) and diverges when
1 − p > 0 (that is, when p < 1).
In the case when p = 1, we have
Z R h iR
1
dx = ln x = ln R − ln 1 → ∞ as R → ∞.
1 x 1

Hence the integral diverges when p = 1.

Remark 8.10.6. While you may be familiar with the fact that the ln function is unbounded, the
proof that ln R → ∞ as R → ∞ requires the use of Riemann sums and is given later in §9.2.

8.11 Comparison tests for improper integrals

(Ref: SH10 §11.7)
Some of the most important improper integrals, such as
Z ∞
2
e−x dx
0

have integrands without any antiderivative in the elementary functions. In these cases, determining
convergence via a computation like
Z ∞ Z R
f (x) dx = lim f (x) dx = lim F (R) − F (a),
a R→∞ a R→∞

(where F is an antiderivative
R∞ of f ) is not a viable strategy. Instead, we compare the integral of f
with another integral a g(x) dx whose behaviour is known. The basic idea is as follows.
Suppose that 0 ≤ f (x) ≤ g(x) whenever x > a, as illustrated below.
y
g

a x

c
2020 School of Mathematics and Statistics, UNSW Sydney
190 CHAPTER 8. INTEGRATION

Obviously the area under the graph of g is greater than that under the graph of f . Hence if the
area under g is finite then the area under f is also finite. If the area under f is infinite then so too
is the area under g. The next theorem expresses this conclusion in terms of improper integrals.

Theorem 8.11.1 (The comparison test). Suppose that f and g are integrable functions and that
0 ≤ f (x) ≤ g(x) whenever x > a.
Z ∞ Z ∞
(i) If g(x) dx converges then f (x) dx converges.
a a
Z ∞ Z ∞
(ii) If f (x) dx diverges then g(x) dx diverges.
a a

Proof. We begin by proving R(i). Suppose that f and g are integrable functions, that 0 ≤ f (x) ≤ g(x)
∞
for all x in [a, ∞) and that 0 g(x) dx converges. Then
Z R Z R
0≤ f (x) dx ≤ g(x) dx
a a

whenever R > 0 by Proposition 8.4.1. Since the integral

Z R
f (x) dx
a
R∞
Rincreases
∞
as R increases and is bounded above by a g(x) dx, it has a limit as R → ∞. Hence
a f (x) dx is also convergent.
The proof of (ii) is left as an exercise.

When applying the comparison test, we often compare an improper integral I with
Z ∞
1
dx,
1 xp

since the behaviour of this second integral is known (see Proposition 8.10.5). The value of p is chosen
by analysing the dominant terms appearing in the integrand of I. The next example illustrates
this procedure.

Example 8.11.2. Determine whether or not the following improper integrals converge.
Z ∞
1
(a) 3
dx
1 x +1
Z ∞
x+2
(b) 3/2
dx.
2 x −1
Solution. (a) When x is large, the dominant term in the denominator of

1
x3 + 1

is x3 . So intuitively,
1 1
≈ 3
x3 + 1 x

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 191

for large x. Consequently we aim to compare

Z ∞ Z ∞
1 1
3
dx with dx.
1 x +1 1 x3

To make this comparison rigorous, we seek an appropriate inequality. Note that x3 + 1 > x3
for all x > 1 and so
1 1
< 3
x3 + 1 x
for all x > 1. Since Z ∞
1
dx
1 x3
converges (see Proposition 8.10.5), Z ∞
1
dx
1 x3 +1
also converges by the comparison test.

There are many examples where the ‘dominant term analysis’ is straightforward but it is difficult
to obtain an appropriate inequality for successful use of the comparison test. This difficulty can
sometimes be overcome by multiplying one of the integrands by a ‘fudge factor’. The name ‘fudge
factor’ might sounds like we’re about to ‘cheat’ and do something that is not mathematically
rigourous, but despite the name we really will be finding an inequality that will rigourously prove
convergence or divergence of our improper integral.

Example 8.11.3. Determine whether or not the the improper integral

Z ∞
1
3/2
dx.
2 x −1
converges.

Solution. Dominant term analysis suggests that

1 1
≈ 3/2
x3/2 −1 x
when x is large. So we compare
Z ∞ Z ∞
1 1
3/2
dx with dx.
2 x −1 2 x3/2

Now the integral to the right converges (by Proposition 8.10.5) but it is clear that

1 1
> 3/2
x3/2 −1 x
when x > 2, so we cannot immediately apply the comparison test.
However, by multiplying the integrand on the right with the ‘fudge factor’ 2, we see that
1 2
< 3/2 (8.22)
x3/2 − 1 x

c
2020 School of Mathematics and Statistics, UNSW Sydney
192 CHAPTER 8. INTEGRATION

when x > 2. (To see why, observe that when x > 2,

x3/2 − 1 > x3/2 − 21 x3/2 = 12 x3/2

whence (8.22) follows.) Since Z ∞

2
dx
2 x3/2
converges, Z ∞
1
dx.
2 x3/2 −1
must also.

Finding a fudge factor can be a lot of work in itself. Later we give a method for determining
the convergence of this integral that does not rely on the use of fudge factors.
Dominant term analysis is not always straightforward, and at times may not even be useful.
One must develop an intuition of how functions decay at infinity and of what comparisons to use.

Example 8.11.4. Determine whether or not the following improper integrals converge.
Z ∞ √
(a) e− x dx
1
Z ∞
sin x + 2
(b) √ dx.
1 x+1

Solution. (a) Dominant term analysis does not apply since there is only
√ one term appearing in the
integrand. Instead, we recognise that the decay of the exponential e− x is very rapid; it is certainly
much faster than the decay of x12 . This can be verified by the following limit calculation:
√
e− x x2
lim 1 = lim
x→∞ x→∞ ex1/2
x2
2x
= lim (by l’Hôpital’s rule)
x→∞ 1 x−1/2 ex1/2
2
4x3/2
= lim 1/2
x→∞ ex
4. 23 x1/2
= lim (by l’Hôpital’s rule)
x→∞ 1 x−1/2 ex1/2
2
12x
= lim
x→∞ ex1/2
..
. (two more applications of l’Hôpital’s rule)
24
= lim
x→∞ ex1/2
= 0.

Hence we conclude that

√ 1
e− x
<
x2
c
2020 School of Mathematics and Statistics, UNSW Sydney
8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 193

whenever x is sufficiently large. Since Z ∞

1
dx
1 x2
converges, Z ∞ √
e− x
dx
1
also converges by the comparison test.
(b) While the numerator of
sin x + 2
√
x+1
has no dominant term, intuitively we see that, since the numerator is bounded and the dominant
√
term of the denominator is x, we should compare the integrand with

1
√ .
x

Since Z ∞
1
√ dx
1 x
diverges, we seek an inequality of the form

sin x + 2
f (x) ≤ √
x+1
√
where f (x) is 1/ x with an appropriate fudge factor.
Now, −1 ≤ sin x, and so

sin x + 2 1
√ ≥√
x+1 x+1
1
≥ √
2 x

for all x ≥ 1. Since Z ∞

1
√ dx
1 2 x
diverges (see Proposition 8.10.5), Z ∞
sin x + 2
√ dx
1 x+1
also diverges by the comparison test.

The next theorem provides an alternative approach for constructing a comparison of integrals.
It is particularly useful if dominant term analysis is straightforward but one wants to avoid the use
of fudge factors.

Theorem 8.11.5 (The limit form of the comparison test). Suppose that f and g are non-negative
and bounded on [a, ∞). If
f (x)
lim =L
x→∞ g(x)

c
2020 School of Mathematics and Statistics, UNSW Sydney
194 CHAPTER 8. INTEGRATION

and 0 < L < ∞ then either

Z ∞ Z ∞
f (x) dx and g(x) dx both converge
a a
or Z ∞ Z ∞
f (x) dx and g(x) dx both diverge.
a a

The hypothesis
f (x)
lim =L
x→∞ g(x)

indicates that f and g exhibit similar behaviour at infinity. The proof of the theorem will be given
at the end of this section.
Example 8.11.6. Determine whether or not the following improper integrals converge.
Z ∞
1
(a) √ dx
1 1 + x + x2
Z ∞ 2
x + 3x
(b) dx.
2 5x4 − 2
1
Solution. (a) Suppose that f (x) = √ . We need to find a suitable function g to compare
1 + x + x2
f with. As x gets large, the dominant term under the square root is x2 . Therefore it seems that
1 1 1
√ ≈√ =
1 + x + x2 x2 x

when x is large. So define g by g(x) = x1 . The following calculation verifies that our choice of g is
suitable:
r s
f (x) x x2 1
lim = lim √ = lim = lim 1 = 1.
x→∞ g(x) x→∞ 1+x+x 2 x→∞ 1+x+x 2 x→∞
x2
+ x1 + 1

Since this limit is a Rpositive real number, we can

R∞apply the limit form of the comparison test. By
∞
Proposition 8.10.5, 1 g(x) dx diverges. Hence 1 f (x) dx also diverges.

Remark 8.11.7. As a general rule, students will find the limit form of the comparison test easier
to use than the inequality form of the comparison test when the integrand is a rational function,
or is a ratio of roots of polynomials.
Proof of Theorem 8.11.1 (MATH1141 only). Suppose that f and g are non-negative and bounded
on [a, ∞). (It follows, for later use, that if a < M < R and R is increasing then the integrals
Z R Z R
f (x) dx and g(x) dx,
M M

are also increasing.) Suppose also that

f (x)
lim =L
x→∞ g(x)

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 195

where 0 < L < ∞. Choose a positive number ǫ such that 0 < ǫ < L. By Definition 2.2.1, there is
a positive real number M such that

f (x)
g(x) − L < ǫ

whenever x > M . Hence

f (x)
−ǫ < −L<ǫ ∀x > M
g(x)

and rearranging gives

0 < (L − ǫ)g(x) < f (x) < (L + ǫ)g(x) ∀x > M.

By integrating with respect to x we obtain

Z R Z R Z R
0 < (L − ǫ) g(x) dx < f (x) dx < (L + ǫ) g(x) dx
M M M

whenever R > M . By letting R approach infinity it is seen that

R∞ RR RR
• if M g(x) dx is convergent then M f (x) is also
R ∞ convergent (since M f (x) increases as R
increases and it is bounded above by (L + ǫ) M g(x) dx);
R∞ RR RR
• if M f (x) dx is convergent then M g(x) is
R also convergent (since M g(x) is increases as
1 ∞
R increases and is bounded above by L−ǫ M f (x) dx);

R∞ RR
• if M f (x) dx is divergent then M g(x) dx → ∞ as R → ∞; and
Z ∞ RR
• if g(x) dx is divergent then M f (x) dx → ∞ as R → ∞.
M

Hence either Z Z
∞ ∞
f (x) dx and g(x) dx both converge
M M

or Z Z
∞ ∞
f (x) dx and g(x) dx both diverge.
M M

The proof of the theorem now easily follows by first observing that
Z R Z M Z R
f (x) dx = f (x) dx + f (x) dx
a a M

and
Z R Z M Z R
g(x) dx = g(x) dx + g(x) dx
a a M

whenever a < M < R, and then letting R approach infinity.

c
2020 School of Mathematics and Statistics, UNSW Sydney
196 CHAPTER 8. INTEGRATION
Si(x)
π

|
2

| |

−10π 10π x

− π2

|
Figure 8.15: The Si function.

8.12 Functions defined by an integral

Many important functions are defined by an integral. If f is a continuous function then we may
define a function F by Z x
F (x) = f (t) dt.
a
We note the following points.

• The derivative of F may be calculated by the first fundamental theorem of calculus.

• The value of F (x) may be approximated by computing Riemann sums.

• Symmetries of f can help in understanding the behaviour of F (see, for example, Propo-
sition 8.8.6).

• The behaviour of F (x) as x → ∞ or as x → −∞ may be ascertained by studying an

improper integral of f .

We give one example. Others may be found in the problem set for Chapter 8 and at the beginning
of Chapter 9.
Consider the function f : R → R given by
(
sin t
t if t 6= 0
f (t) =
1 if t = 0.

sin t
Since lim = 1, the function f is continuous everywhere. We can now define the function
t→0 t
Si : R → R by Z x
Si(x) = f (t) dt ∀x ∈ R.
0
The name Si comes from ‘sine integral.’ This function is used in electrical engineering for signal
processing and in surveying for the Global Positioning System (GPS). Its graph is plotted in Figure
8.15.

Example 8.12.1. Consider the functions f and Si defined above.

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.12. FUNCTIONS DEFINED BY AN INTEGRAL 197

(a) Find the value of Si(0).

(b) Show that Si is an odd function.

(c) Find and classify the critical points of Si on (0, 3π).

(d) A function G : R → R is defined by the formula

Z x2
G(x) = f (t) dt.
0

Calculate G′ (x).
Z 0
Solution. (a) Si(0) = f (t) dt = 0.
0
(b) We need to show that Si(−x) = − Si(x). Now
Z −x
sin t
Si(−x) = dt
0 t
Z x
sin(−u)
= (−du) (using the substitution t = −u)
0 −u
Z x
− sin u
=− du
−u
Z0 x
sin u
=− du
0 u
= − Si(x).

Hence Si is an odd function.

(c) By the first fundamental theorem of calculus,
sin x
Si′ (x) =
x
for all x in (0, 3π). Critical points occur when Si′ (x) = 0, that is, when x = π or when x = 2π. To
classify the critical points, we calculate the second derivative:
x cos x − sin x
Si′′ (x) = .
x2
Now Si′′ (π) = − π1 and Si′′ (2π) = 2π
1
. Therefore Si has a local maximum point when x = π and a
local minimum point when x = 2π.
(d) Note that G(x) = Si(x2 ), so to calculate G′ (x) we use the chain rule:

d 2x sin(x2 ) 2 sin(x2 )
G′ (x) = Si(x2 ) = 2x Si′ (x2 ) = 2x × f (x2 ) = = .
dx x2 x
(If this calculation is too dense, then consider

y = G(x), u = x2

so that Z u
dy
y= f (t) dt and = f (u)
0 du

c
2020 School of Mathematics and Statistics, UNSW Sydney
198 CHAPTER 8. INTEGRATION

by the first fundamental theorem of calculus. Then by the chain rule,

dy dy du 2x sin(x2 ) 2 sin(x2 )
= = f (u) × 2x = 2xf (x2 ) = = .
dx du dx x2 x
2 sin(x2 )
Hence G′ (x) = x .)

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.13. MAPLE NOTES 199

8.13 Maple notes

Z
The command int(f(x),x); will cause Maple to attempt to find the indefinite integral f (x) dx,
Z b
and the command int(f(x),x=a..b); is used to find definite integrals f (x) dx. If Maple is
a
unable to find a primitive, then numerical integration can be carried out by combining the int and
evalf commands.
For example,
> int(1/(x*log(x)^2),x);

− (ln (x))−1
> int(1/(x*log(x)^2),x=2..infinity);
(ln (2))−1
> int( exp(-x^2)*ln(x), x=0..1 );
R 1 −x2
0e ln (x) dx
> evalf(%);
−0.9059404763

c
2020 School of Mathematics and Statistics, UNSW Sydney
200 CHAPTER 8. INTEGRATION

Problems for Chapter 8

Problems 8.1 : Area and the Riemann Integral and

8.2 : Integration using Riemann sums
1. [R] [V]

1 2
a) By taking the partition Pn = 0, , , . . . , 1 of the interval [0,1], calculate the
n n
lower sum S Pn (f ) and the upper sum S Pn (f ) for each function f .
i) f (x) = 1
ii) f (x) = x
iii) f (x) = x2
n
X
[You may need k2 = 16 n(n + 1)(2n + 1).]
k=1
iv) f (x) = x3
n
X
[You may need k3 = 14 n2 (n + 1)2 .]
k=1
(
1 if x ∈ Q
v) f (x) =
0 if x ∈
/Q
b) By taking the limit as n → ∞ for each sum S Pn (f ) and S Pn (f ) calculated in (a),
Z 1
either calculate f (x) dx, or show that f is not Riemann integrable.
0

1
2. [R] An electrical signal S(t) has its amplitude |S(t)| tested (sampled) every 10 of a second.
It is desired to estimate the energy over a period of half a second, given exactly by

Z 1
!1
2
2
|S(t)|2 dt .
0

The results of the measurement are shown in the following table:

t .1 .2 .3 .4 .5
|S(t)| 60 50 50 45 55
e(t) 5 3 7 4 10

a) Using the above data for S(t), set up an appropriate Riemann sum and compute an
approximate value for the energy.
b) It is known that the signal varies by an amount of at most ±e(t), as shown above, in
1
each 10 second period. Calculate upper and lower bounds for the energy.

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 8 201

3. [X] Consider the partition Pn of [1, 2], given by Pn = {q 0 , q 1 , q 2 , . . . , q n } where q n = 2.

(Notice that (i) the divisions are not of equal width and (ii) 1 < q < 2 and q → 1 as
n → ∞.) If f (x) = xj for some positive integer j, then evaluate the integral
Z 2
f (x) dx
1

by calculating the limit lim S Pn (f ) of the corresponding lower Riemann sums.

n→∞

Problems 8.3 : The Riemann integral and signed area and

8.4 : Basic properties of the Riemann integral

4. [R] Find the area of the region bounded by the line y = x and the parabola y = x2 − 2.

5. [R] Find
Z 9 Z 2
x3 − x
a) dx b) |x| dx.
4 x3/2 −4

6. [H] Find a function f which satisfies the integral equation

Z x Z 0
tf (t) dt = (t2 + 1)f (t) dt + x.
0 x

1 1
Z 1
1
7. [R] Explain why dx = − = −1 − 1 = −2 is not valid.
−1 x2 x −1

8. [H]

a) Suppose that f is a continuous increasing (and hence invertible) function on [a, b]. If
c = f (a), d = f (b) and a, b, c, d ≥ 0, then explain why
Z d Z b
f −1 (t) dt = bd − ac − f (x) dx.
c a

Z 1
b) Use this to find sin−1 x dx.
1/2

9. [H] Suppose that U ′ (x) = u(x).

Z x
a) Find V ′ (x) if V (x) = (a − x)U (x) + U (t) dt where a is a constant.
0
Z a Z a
b) Hence show that U (x) dx = aU (0) + (a − x)u(x) dx.
0 0

c
2020 School of Mathematics and Statistics, UNSW Sydney
202 CHAPTER 8. INTEGRATION

Problems 8.5 : The first fundamental theorem of calculus

Z x
10. [H] Suppose that f (t) = ⌊t⌋ and F (x) = f (t) dt, where ⌊t⌋ is the greatest integer less
0
than or equal to t. Use a graph of f to sketch F on the interval [−1, 3]. Is F continuous?
Where is F differentiable?

11. [H] Suppose that f (t) = sin(t2 ). Sketch the graph of f on Zthe interval [0, 3]. Use this to
x
sketch the graph of F on the interval [0, 3], where F (x) = f (t) dt. Indicate where F
0
has local maxima and minima.

12. [R] [V] Find F ′ (x) for each function F : R → R given below.
Z x Z x3
2
a) F (x) = sin(t ) dt b) F (x) = sin(t2 ) dt
0 0
Z 1 Z x3
2
c) F (x) = sin(t ) dt d) F (x) = sin(t2 ) dt
x3 x

Z 4
d
13. [R] Find (5 − 4t)5 dt.
dx x

Problems 8.6 : The second fundamental theorem of calculus

14. [R]
1
a) Suppose that f (x) = . By considering the lower Riemann sum for f with respect
x
to the partition
n n+1 n+2 2n
, , ,...,
n n n n
of [1, 2], show that

1 1 1
ln 2 = lim + + ··· + .
n→∞ n+1 n+2 2n

1
b) Suppose that f (x) = √ .
1 − x2
i) Show that f is increasing on the interval [0, 12 ].
ii) Find the upper Riemann sum for f with respect to the partition

0 1 2 3 n
, , , ,...,
2n 2n 2n 2n 2n

of [0, 21 ].
iii) Hence evaluate

1 1 1 1
lim √ +√ +√ + ··· + √ .
n→∞ 4n2 − 12 4n2 − 22 4n2 − 32 4n2 − n2

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 8 203

Problems 8.7 : Indefinite integrals

15. [R] Evaluate the following integrals by inspection.

Z Z √
x2 sin x
a) x e dx b) √ dx
x
Z 1 Z a p
2 3
c) 2x(1 + x ) dx d) x2 a3 − x3 dx (a > 0)
0 −a
Z π/2 Z 0 p
e) cos3 x sin x dx f) [H] t2 + t4 dt
0 −1

Problems 8.8 : Integration by substitution

16. [R] Use a substitution to evaluate the following integrals.

Z Z
dx
a) √ b) x(5x − 1)19 dx
1+ x
Z Z 4
1−x dx
c) [V] 3
dx d) [V] √
(1 + x) 0 5+ x
Z
1 + t2
17. [X] Use the substitution u = t − t−1 to find dt.
1 + t4

Problems 8.9 : Integration by parts

18. [R] Use integration by parts to evaluate the following integrals.

Z 1 Z Z
a) x e5x dx b) x2 cos x dx c) ln x dx
0
Z 0.5 Z e Z π
−1 7
d) sin x dx e) x ln x dx f) x2 cos 2x dx
0 1 0
Z Z Z π/4
x −1
g) [V] e cos x dx h) tan x dx i) [H] sec3 θ dθ
0

Problems 8.10 : Improper integrals

19. [R] [V] Evaluate the following improper integrals or show that they diverge.
Z ∞ Z 1 Z ∞
dx
a) e−5x dx b) e−0.01x dx c)
0 −∞ 0 4 + x2
Z ∞ Z ∞ Z ∞
4 dx dx
d) x3 e−x dx e) 3/2
f)
−∞ 2 (x − 1) e x ln x
Z ∞
20. [H] Prove that xn e−x dx = n! whenever n = 0, 1, 2, . . . .
0

21. [H]

c
2020 School of Mathematics and Statistics, UNSW Sydney
204 CHAPTER 8. INTEGRATION
Z R
x
a) Find lim dx.
R→∞ −R 1 + x2
Z 2R
x
b) Find lim dx.
R→∞ −R 1 + x2
Z ∞
x
c) Does dx converge? Explain.
−∞ 1 + x2

Problems 8.11 : Comparison tests for improper integrals

22. [R] Use the inequality form of the comparison test to determine whether or not the
following improper integrals converge.
Z ∞ Z ∞ Z ∞
1 1 1
a) [V] √ dx b) √ dx c) dx
1 1+x 4
2
3 2
x −x 2 ln x

23. [R] Use the limit form of the comparison test to determine whether or not the following
improper integrals converge.
Z ∞ Z ∞ Z ∞
x 2x − 1 1
a) 3−1
dx b) 2+2
dx c) √ dx
2 2x 1 x 2
6
x −1

24. [R] Use a comparison test to determine whether or not the following improper integrals
converge.
Z ∞ Z ∞ 3 Z ∞
3x + sin x + 2 4x − x + 5 ln t
a) 3
dx b) 4 2
dx c) [H] dt
1 2x − x + 8 4 x −x +1 2 t3/2

25. [H] Find all real numbers s such that the improper integral
Z ∞
xs
dx
1 1+x

is convergent.
Z ∞
1
26. [H] Find all real numbers p such that dx converges.
2 x(ln x)p
Z ∞
xb
27. [H] For which pairs of numbers (a, b) does the improper integral dx con-
1 (1 + x2 )a
verge?

Problems 8.12 : Functions defined by an integral

28. [R] [V] Given a positive real number x, let π(x) denote the number of primes less than
or equal to x. The function Li with domain (1, ∞) is given by
Z x
1
Li(x) = dt
2 ln t

c
2020 School of Mathematics and Statistics, UNSW Sydney
8.13. MAPLE NOTES 205

and is known as the ‘logarithmic integral function’. It has the property that

Li(x)
≈1
π(x)

when x is sufficiently large.

a) Evaluate π(10), π(20) and π(3.14159).

π(x)
b) Suppose that x > 0. What does represent?
x
d
c) Find dx Li(x) and Li(2).
d) By applying the mean value theorem to Li on the interval [2, 106 ], find a lower bound
for Li(106 ).
e) If x is large then
π(x) π(x) Li(x) Li(x)
≈ = .
x x π(x) x
Using this approximation and your answer to part (d), find an approximate lower
π(106 )
bound for .
106
π(106 )
Note: There are 78, 498 primes less than one million so the actual value of is
106
0.078498.

29. [R] The function erf : R → R is defined by the formula

Z x
2 2
erf(x) = √ e−t dt.
π 0

The function erf is an error function and can be used to calculate the probability that a
measurement has an error in a given range of values.

a) Calculate erf ′ (x).

b) Explain why erf is an increasing function on R.
c) [H] Show that erf is an odd function.
d) i) By calculating Riemann sums with respect to the partition {0, 41 , 12 , 34 , 1}, find
upper and lower bounds for erf(1).
2
ii) Explain why e−t < e−t whenever t > 1.
Z ∞
2
iii) Hence show that e−t dt converges and find an upper bound for this improper
1
integral.
iv) Using your answers to (i) and (iii), find an upper bound for lim erf(x). (In fact,
x→∞
lim erf(x) = 1 but this is not so easy to prove.)
x→∞
e) Sketch the graph of erf.
f) Explain why erf has an inverse function erf −1 and sketch its graph.

c
2020 School of Mathematics and Statistics, UNSW Sydney
206 CHAPTER 8. INTEGRATION

c
2020 School of Mathematics and Statistics, UNSW Sydney
207

Chapter 9

The logarithmic and exponential

functions

The volume of a colony of bacteria on an agar plate is 1 cubic millimetre and doubles every day.
Let V (t) denote the volume of bacteria after t days. It is clear that

V (0) = 1, V (1) = 2, V (2) = 4, V (3) = 8, V (4) = 16,

and so on. In general

V (t) = 2t

if t is a nonnegative integer. The corresponding points are plotted in Figure 9.1 (a).
If one wanted to determine the volume of the colony part-way through a day, it would be natural
to use the formula
V (t) = 2t ,

where t is a nonnegative real number. This formula makes sense when t is a rational number. If
t = p/q where p and q are integers and q > 0, then 2t is defined to be the unique positive qth root
of 2p√. However, if t is an irrational number, this definition does not apply. So what do we mean
by 2 3 or 2π ? We have not yet seen a definition for such numbers.
That this shortcoming in our current definition of 2t is a serious problem can be illustrated
graphically. The graph corresponding to V (t) = 2t , where t is confined to the nonnegative rational
numbers, has an infinite number of ‘gaps’ (see Figure 9.1 (b)). Clearly this is not a satisfactory
state of affairs. One of the aims of this chapter is to rectify this deficiency and show that, when
t is an irrational number, 2t can be defined in such a way that the graph of V : [0, ∞) → R is
continuous (see in Figure 9.1 (c)).
Our plan of attack is the following.

1. In Section 9.2, we define the function ln as an integral (by the fundamental theorem of
calculus).

2. In Section 9.3, we define the function exp as the inverse of ln (by the inverse function
theorem).

3. In Section 9.4, we define the number bt , where b > 0 and t is an irrational number, by a
formula involving both ln and exp.

c
2020 School of Mathematics and Statistics, UNSW Sydney
208 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
V (t) V (t) V (t)
16 b
16 16
|

|
b

b
b

| | |

(a) 4 t (b) 4 t (c) 4 t

Figure 9.1: Growth in volume of bacteria on an agar plate.

We shall see that when the definition of bt (for rational t) is combined with the definition of bt (for
irrational t), the resulting function f : R → R, given by

f (t) = bt ∀t ∈ R,

is continuous.

9.1 Powers and logarithms

The content of this section is revision, and is placed here primarily as reference. It summarises a
few known definitions and facts about powers and logarithms.

Definition 9.1.1. Suppose that b is a positive real number. If p and q are integers
and q > 0 then bp/q is defined to be the unique positive qth root of bp .

The following is an elementary definition of the logarithm to the base b.

Definition 9.1.2. Suppose that b is a positive real number not equal to one, c is a
rational number and a = bc . Then logb a is defined by the formula
c = logb a.

Note that, according to Definition 9.1.2, one cannot write down logb a for any positive number
a (even if a is rational); logb a is only defined if a = bc for some rational number c. (This short-
coming in the domain of the function logb will be rectified later in the chapter when Definition
9.1.2 is superseded by a superior definition.)

c
2020 School of Mathematics and Statistics, UNSW Sydney
9.2. THE NATURAL LOGARITHM FUNCTION 209

The elementary properties of logarithms are summarised below: if x > 0, y > 0 and b > 0 with
b 6= 1 then

logb 1 = 0, logb b = 1,

logb (xy) = logb x + logb y, logb xy = logb x − logb y,
logb (br ) = r, logb (xr ) = r logb x. (9.1)

Remark 9.1.3. In the properties listed above, it is implicitly assumed that r is a rational number,
and that x and y are of the form bc for some rational number c. One of the aims of this chapter is
to remove these awkward assumptions.

9.2 The natural logarithm function

(Ref: SH10 §7.2)
The ln function will already be familiar to you from high school. However, it was probably never
defined rigorously. In order to address the problems identified in the introduction to Chapter 9, it
will be necessary to start from scratch. Fortunately, by the fundamental theorem of calculus, this
is not difficult.

Definition 9.2.1. The function ln : (0, ∞) → R is defined by the formula

Z x
1
ln(x) = dt.
1 t

We read ln as either ‘ell en’ or as ‘log’.

Since the function ln is defined by an integral, ln x is simply the area of the shaded region shown
below in the case when x > 1.
1/t

1 x t
One can also immediately see, by the first fundamental theorem of calculus, that
d 1
ln x = .
dx x
This and other properties of ln are stated in the proposition below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
210 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS

Proposition 9.2.2. The function ln has the following properties:

d 1
(i) ln is differentiable on (0, ∞) and ln x = ;
dx x
(ii) ln x > 0 if x > 1, ln 1 = 0 and ln x < 0 if 0 < x < 1;

(iii) ln x → −∞ as x → 0+ and ln x → ∞ as x → ∞;

(iv) ln(xy) = ln(x) + ln(y) for all positive real numbers x and y;

(v) ln xy = ln(x) − ln(y) for all positive real numbers x and y; and

(vi) ln(xr ) = r ln(x) whenever r is a rational number and x is a positive real number.

Proof. (i) Apply the first fundamental theorem of calculus (Theorem 8.5.1) to the definition of ln.
1
R ∞ 1t > 0 when t > 0.
(ii) This follows from Definition 8.4.2 and the fact that
(iii) We need to show that the improper integral 1 t dt diverges to infinity. The diagram
below shows that
Z 2 Z 4 Z 8
dt 1 dt 1 dt 1
≥1× , ≥2× , ≥4×
1 t 2 2 t 4 4 t 8

and so on.
y 1
y=
t

1 2 4 8 t

Hence
Z 2n
dt 1 1 1
≥ + + ··· +
1 t |2 2 {z 2}
n terms
n
=
2
→∞

as n → ∞. Hence the integral diverges and therefore ln x → ∞ as x → ∞.

This argument can be adapted to show that
Z 1
dt
→ −∞
2−n t

as n → ∞. Hence ln x → −∞ as x → 0+ .
(iv) Suppose that y is some fixed positive real number and that x ∈ (0, ∞). Then, by the chain
rule and part (i),
d 1 1 d
ln(xy) = y ln′ (xy) = y × = = ln(x) .
dx xy x dx

c
2020 School of Mathematics and Statistics, UNSW Sydney
9.2. THE NATURAL LOGARITHM FUNCTION 211

Hence
ln(xy) = ln(x) + C

for some constant C. When x = 1 we obtain

ln(y) = ln(1) + C = 0 + C = C

by part (ii). Hence

ln(xy) = ln(x) + C = ln(x) + ln(y)

as required.
(v) The proof uses the same technique as the proof of (vi) and is left to the reader.
(vi) This proof also uses the same technique as the proof of (iv).

The fact that ln(1) = 0 and that ln satisfies properties (iv), (v) and (vi) suggests that ln coincides
with one of the logarithm functions logb for some base b (with the advantage that Dom(ln) =
(0, ∞)). We now aim to identify the base.
The function ln is increasing and continuous with Range(ln) = R (see Proposition 9.2.2 (i) and
(ii)). Hence, by the intermediate value theorem, there is exactly one real number x that satisfies
the equation
ln(x) = 1.

We call this solution e (after Leonard Euler, who was probably the greatest mathematician of the
eighteenth century).

Definition 9.2.3. The real number e is defined to be the unique number x that
satisfies the equation Z x
1
dt = 1.
1 t

Thus ln(e) = 1. One can show that e is irrational and that e ≈ 2.71828. From Proposition
9.2.2, we see now that
ln(er ) = r

for every rational number r. This equation shows that ln coincides with the logarithm to the base
e. Since the number e arises naturally in many contexts, ln is usually called the natural logarithm.
By using the properties of ln given by Propoistion 9.2.2, one can draw the graph of ln.

c
2020 School of Mathematics and Statistics, UNSW Sydney
212 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS
ln x

1 e x

d
We note that, although ln(x) → ∞ as x → ∞, we have dx ln(x) = x1 → 0 as x → ∞. This means
the graph of ln becomes flatter towards infinity and hence approaches infinity very slowly.

9.3 The exponential function

(Ref: SH10 §7.4)
d
In Section 9.2 it was shown that dx (ln x) > 0 for all x > 0. Therefore, by the inverse function
theorem (Theorem 6.4.1), ln has an inverse function.

Definition 9.3.1. The function exp : R → (0, ∞) is defined to be the inverse

function of ln : (0, ∞) → R.

By reflecting the graph of ln in the line y = x, one obtains the graph of exp.
y

(1, e)
b

1 b

y = exp x (e, 1)
1 x

y = ln x

Proposition 9.3.2. The function exp : R → (0, ∞) has the following properties:

(i) exp(ln x) = x for all x in (0, ∞) and ln(exp x) = x for all x in R;

c
2020 School of Mathematics and Statistics, UNSW Sydney
9.3. THE EXPONENTIAL FUNCTION 213

(ii) exp(1) = e and exp(0) = 1;

(iii) exp x → ∞ as x → ∞ and exp x → 0 as x → −∞.
d
(iv) exp is differentiable on R and dx (exp x) = exp x for all x in R;
(v) exp(x + y) = exp(x) exp(y) for all x and y in R; and
(vi) exp(rx) = (exp x)r for every real number x and every rational number r.
Proof. The proof of (i) follows immediately from Definition 9.3.1 and the proof of (ii) and (iii) then
follow from (i) and the continuity of ln and exp.
That exp is differentiable on R follows from the inverse function theorem. To complete the
proof of (iii), we differentiate both sides of the equation
ln(exp x) = x
with respect to x to obtain
1 d
× (exp x) = 1
exp x dx
d
by the chain rule. Rearranging gives dx (exp x) = exp x as required.
If x and y are real numbers then

exp(x + y) = exp ln(exp x) + ln(exp y) (by part (i))

= exp ln(exp x × exp y) (by Proposition 9.2.2 (iv))
= exp x × exp y (by part (i)).
This completes the proof of (iv)
To prove (v), suppose that r is a rational number and x is a real number. Then

exp(rx) = exp r ln(exp x) (by part (i))

= exp ln (exp x)r (by Proposition 9.2.2 (v))
(exp x)r (by part (i)).
This completes the proof of the proposition.
Note that by Proposition 9.3.2 (ii) and (v) we have
exp(r) = (exp 1)r = er (9.2)
for every rational number r. This inspires the following definition.

Definition 9.3.3. Suppose that x is an irrational number. Then we define the real
number ex by the formula
ex = exp x.

Thus Definition 9.3.3 extends Definition 9.1.1, in the case when b = e, to include irrational
powers. Moreover, since exp x = ex and exp is a continuous function, the function f : R → R, given
by f (x) = ex , is also continuous. We have thus made significant progress towards the problem
outlined at the beginning of the chapter. Finally, since exp x = ex , we call exp the exponential
function.

c
2020 School of Mathematics and Statistics, UNSW Sydney
214 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS

9.4 Exponentials and logarithms with other bases

(Ref: SH10 §7.5)
At the end of the last section, we gave a definition of ex , where x is an irrational number. Moreover,
if f : R → R is given by f (x) = ex , then f is a continuous and differentiable function. In this
section, we do the same for bx , where b is any positive real number.
To begin, consider a rational number r and any positive real number b. Then

br = exp ln(br ) = exp r ln b = er ln b (9.3)

by the definitions and properties established in the last two sections. This inspires the following
definition.

Definition 9.4.1. Suppose that b is a positive real number and x is an irrational

number. We define the number bx by the formula
bx = ex ln b .

By combining equation (9.3) and Definition 9.4.1 we see that

bx = ex ln b ∀x ∈ R. (9.4)

Since exp and ln are continuous functions, the function f : R → R, given by

f (x) = bx ∀x ∈ R,

is also continuous (see Proposition 3.1.2). This solves the problem articulated at the beginning of
the chapter. Thus
√ √
3 3 ln 2
2 =e ≃ 1.492106 . . . .

Moreover, we can discard our old definition of logb (see Definition 9.1.2), together with its
technical difficulties (see Remark 9.1.3), and replace it with a simpler and more powerful definition.

Definition 9.4.2. Suppose that b is a positive real number, b 6= 1. Then the function
logb : (0, ∞) → R is defined to be the inverse of the function f : R → (0, ∞), where
f (x) = bx
for all x in R.

We leave it as an exercise to show that the function f of Definition 9.4.2 is one-to-one and
hence invertible. The graphs of various exponential functions and their corresponding logarithmic
functions are shown below.

c
2020 School of Mathematics and Statistics, UNSW Sydney
9.5. INTEGRATION AND THE LN FUNCTION 215
y= 8x y= 3x
y y = 2x

y = log2 x
y = log3 x
1
y = log8 x

1 x

The following proposition gives a formula for logb in terms of ln. Its proof is left as an exercise.
Proposition 9.4.3. Suppose that b is a positive real number, b 6= 1. Then
ln x
logb x = ∀x ∈ (0, ∞). (9.5)
ln b
Using equations (9.4) and (9.5), one can easily establish familiar properties for the exponentials
and logarithms to the base b, with the advantage that we no longer need the technical restrictions
of Remark 9.1.3.

9.5 Integration and the ln function

We have already seen that
d 1
(ln x) =
dx x
provided that x > 0. It follows from the chain rule that
d 1 f ′ (x)
ln(f (x)) = × f ′ (x) = ,
dx f (x) f (x)
provided that f is differentiable and f (x) > 0. Hence
Z ′
f (x)
dx = ln(f (x)) + C.
f (x)
It is not difficult to remove the restriction that f (x) > 0. To take a simple example, suppose
that x 6= 0. Then
(
d
d dx ln x if x > 0
ln |x| = d
dx dx ln(−x) if x < 0
(
1
if x > 0
= x1
−x × (−1) if x < 0
1
= .
x

c
2020 School of Mathematics and Statistics, UNSW Sydney
216 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS

Thus Z
1
dx = ln |x| + C
x
provided x is restricted to an interval not containing zero. By generalising this calculation we
obtain the formula
Z ′
f (x)
dx = ln |f (x)| + C,
f (x)
provided that f is never zero over the interval of integration.

Example 9.5.1. Find

Z
(a) tan x dx
Z
(b) sec x dx.

sin x
Solution. (a) By using the fact that tan x = cos x we have
Z Z
− sin x
tan x dx = − dx
cos x
= − ln | cos x| + C.

(The indefinite integral of cot can be found similarly.)

(b) This example involves a special trick:
Z Z
tan x + sec x
sec x dx = sec x dx
tan x + sec x
Z
tan x sec x + sec2 x
= dx
sec x + tan x
= ln | sec x + tan x| + C.

(A similar trick can be used to find the indefinite integral of cosec.)

9.6 Logarithmic differentiation

(Ref: SH10 §7.3)
Logarithms are powerful because they transform powers into products, products into sums and
quotients into differences. In this section, we illustrate the use of the logarithm for differentiating
functions that are defined by (combinations of) powers, products or quotients.
dy
Example 9.6.1. Find dx in each case:

(a) y = 10x
3/5
(3x2 + 4)(x + 2)
(b) y =
x3 + 5x
(c) y = (sin x)cos x .

c
2020 School of Mathematics and Statistics, UNSW Sydney
9.7. INDETERMINATE FORMS WITH POWERS 217

Solution. (a) If we take ln of both sides of the equation y = 10x then

ln y = ln 10x
= x ln 10.

By differentiating with respect to x, we obtain

1 dy
= ln 10
y dx

and hence

dy
= (ln 10) × y
dx
= (ln 10)10x .

d x d x ln 10 )
Alternatively: dx (10 ) = dx (e = ln 10ex ln 10 = (ln 10)10x .

Remark 9.6.2. Recall that one can only take the logarithm of a positive number. Hence our
solution to Example 9.6.1 (b) and (c) is only valid when y > 0.

9.7 Indeterminate forms with powers

(Ref: SH10 §11.6)
Consider the limits
lim xx and lim x1/x . (9.6)
x→0+ x→∞

The first limit is of the form 00 while the second is of the form ∞0 . Both of these forms are
examples of indeterminate forms and hence we cannot say what the corresponding limits are in
each case without further calculation. Since each limit involves a power, it is natural to first take
the logarithm of the limit and then bring l’Hôpital’s rule into play.

Example 9.7.1. Evaluate the limit

lim x2x .
x→0+

Solution. The limit is an indeterminate form of the type 00 . By taking the natural logarithm, we
∞
can transform the limit into an indeterminate form of the type ∞ :

lim x2x = lim exp ln x2x (since ln and exp are inverses)
x→0+ x→0+

= lim exp 2x ln x
x→0+

= exp lim 2x ln x (since exp is continuous)
x→0+
ln x
= exp lim .
x→0+ 1/(2x)

c
2020 School of Mathematics and Statistics, UNSW Sydney
218 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS

We can now apply l’Hôpital’s rule to the problem. By differentiating the numerator and denomi-
nator and then simplifying we obtain
1/x
lim x2x = exp lim
x→0+ x→0+ −1/(2x2 )

= exp lim −2x
x→0+
= exp(0)
= 1.

Each of the limits in (9.6) can be evaluated by using the ideas in this example.

Remark 9.7.2. The limit of Example 9.7.1 is of the form 00 and has the value 1. The limit
1 x
lim e− x
x→0+

is also of the form 00 . However, its value is not 1:

1 x 1
lim e− x = lim e−1 = .
x→0+ x→0+ e

The fact that limits of the type 00 can have different values explains why 00 is an indeterminate
form.

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 9 219

Problems for Chapter 9

Problems 9.1 : Powers and logarithms and

9.2 : The natural logarithm function
1. [R]

a) Write down the definition of ln x, where x > 0.

d 1
b) Explain why ln x = whenever x > 0.
dx x
c) Suppose that r is a rational number and that x and y are positive real numbers.
i) By first differentiating ln(xy) with respect to x, show that ln(xy) = ln x + ln y.

ii) Use the same technique to show that

x
ln = ln x − ln y and ln(xr ) = r ln x.
y

2. [R]

a) Prove, using upper and lower Riemann sums and the definition of ln x, that ln 2 <
1 < ln 4, and hence that 2 < e < 4.
5
b) [H] Use Maple and the method of part (a) to prove that 2 < e < 3. How many
partition points do you need?

3. [R] Find the derivatives of

√
a) f (x) = ln x3 + 1 b) g(x) = e|x|
5 +6)
c) h(x) = ln(ln(ln x)) d) q(x) = eln(x

Problems 9.3 : The exponential function and

9.5 : Integration and the ln function

4. [R] Find
Z Z Z
e2x e1/x
a) dx b) dx c) 3x dx
1 + e2x x2
Z √x Z Z
e ln x
d) √ dx e) dx f) cot x dx.
8 x x

(Hint for part (f ): express cot in terms of sin and cos.)

5. [R] Sketch the curves

(ex + x)
a) y = ln(1 + ex ) b) y = .
(ex − x)

c
2020 School of Mathematics and Statistics, UNSW Sydney
220 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS

6. [R]
ln x
a) Sketch the curve y = , noting any turning points and asymptotes.
x
b) By using (a) or otherwise, prove that π e < eπ .

7. [R] [V]
y
1
y= x

1
t
1+t

1 1+ 1 x
t

1 1 1
a) From the graph, explain why ≤ ln 1 + ≤ whenever t ≥ 0.
1+t t t
t
1 1 t
b) Deduce that lim ln 1 + = 1 and hence find the value of lim 1 + .
t→∞ t t→∞ t

Problems 9.6 : Logarithmic differentiation

dy
8. [R] Use logarithmic differentiation to find if
dx
1/5
x x3 − 3
a) y = 3 b) y =
1 + x2
c) y = (sin x)sin x d) y = sin(xsin x ).

Problems 9.7 : Indeterminate forms with powers

9. [R] Calculate the following limits:

ln x
a) [V] lim a , a > 0 b) lim xa ln x, a>0
x→∞ x x→0+

c) lim xx d) lim x2/ ln x

x→0+ x→0+

e) lim x1/x f) lim a1/x , a > 0

x→∞ x→∞
a x
g) lim 1 + h) lim x100 e−x
x→∞ x x→∞
−x
i) lim p(x) e , where p is any polynomial.
x→∞

10. [H] Prove that the functions f : (−1, ∞) → R and g : (−1, ∞) → R, given by

x2 x2 x3
f (x) = ln(1 + x) − x − and g(x) = x − + − ln(1 + x),
2 2 3

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 9 221

are increasing on (0, ∞). Deduce that

x2 x2 x3
x− < ln(1 + x) < x − +
2 2 3
whenever x > 0.

c
2020 School of Mathematics and Statistics, UNSW Sydney
222 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS

c
2020 School of Mathematics and Statistics, UNSW Sydney
223

Chapter 10

The hyperbolic functions

A telecommunications company wishes to suspend a cable from a sequence of telegraph poles in

a cost effective manner. There are at least two competing considerations. First, it is desirable to
minimise the length of cable used. Second, it is desirable to minimise the amount of tension in
the cable (since an increase in tension requires an increase in cable strength). The configuration
in Figure 10.1 (a) requires a short length of strong cable while that in Figure 10.1 (b) reduces the
tension in the cable at the cost of increasing its length. Is there an optimal configuration for this
problem?
In order to solve this problem, one needs an equation which describes the curve of a suspended
cable. Such a curve is called a catenary, after the Latin word for ‘chain.’ Galileo claimed that
every catenary was a parabola, but this was later disproved in 1669. Two decades later, the Swiss
mathematician Jakob Bernoulli issued a public challenge to find the general equation that describes
a catenary. Three mathematicians Gottfried Leibniz, Christiaan Huygens and Johann Bernoulli
(Jakob’s brother) independently derived the correct equation:

a x/a
y= e + e−x/a (10.1)
2

(see Figure 10.1 (c)). In this chapter we study the function described by equation (10.1) and
other functions related to it. Such functions are important to mathematics, engineering, physics
(especially the theory of relativity) and architecture.
We note from the outset that a large number of new formulae will be introduced in this chapter.
However, students do not need to memorise all of them. Those which should be memorised are
listed in Section 10.7.

a
x
(a) (b) (c)

Figure 10.1: Catenaries.

c
2020 School of Mathematics and Statistics, UNSW Sydney
224 CHAPTER 10. THE HYPERBOLIC FUNCTIONS

10.1 Hyperbolic sine and cosine functions

(Ref: SH10 §7.8)
We mentioned in the introduction that the equation describing a freely suspended chain is given
by y = f (x), where
a
f (x) = ex/a + e−x/a .
2
To begin we study the simplest case: when a = 1.

Definition 10.1.1. The hyperbolic cosine function cosh : R → R is defined by the

formula
cosh x = 21 (ex + e−x ) ∀x ∈ R.

The reasoning behind the name ‘hyperbolic cosine’ will gradually become apparent. Since cosh
is a linear combination of exponential functions, it is differentiable and
d 1 d x 1
cosh x = e + e−x = (ex − e−x ).
dx 2 dx 2
The derivative of cosh is important in its own right and is given a special name.

Definition 10.1.2. The hyperbolic sine function sinh : R → R is defined by the

formula
sinh x = 21 (ex − e−x ) ∀x ∈ R.

Some people pronounce sinh as ‘shine.’ One can easily show that the derivative of sinh is cosh.
Thus we have
d d
(sinh x) = cosh x and (cosh x) = sinh x. (10.2)
dx dx
We now consider the graphs of these functions.

Example 10.1.3. Sketch the graph of sinh.

Solution. We begin by identifying a few key features of the sinh function.

• Since
sinh(−x) = 21 (e−x − ex ) = − 12 (ex − e−x ) = − sinh x,
we see that sinh is an odd function. As a corollary, sinh(0) = 0.

• Since
d
(sinh x) = cosh x = 12 (ex + e−x ) > 0,
dx
sinh is an increasing function.

• Since lim e−x = 0, sinh x gets arbitrarily close to 12 ex as x → ∞.

x→∞

c
2020 School of Mathematics and Statistics, UNSW Sydney
10.1. HYPERBOLIC SINE AND COSINE FUNCTIONS 225
y y y = cosh x

y = sinh x 1 y = 21 ex
y = 21 ex
x x

(a) (b)

Figure 10.2: Graphs of sinh and cosh.

From these observations we obtain a sketch of the graph of sinh (see Figure 10.2).

Example 10.1.4. Sketch the graph of cosh.

Solution. Some properties of the cosh function are listed below:

• cosh is an even function;

• cosh(0) = 1;

• cosh is decreasing on (−∞, 0), stationary at 0 and increasing on (0, ∞);

• cosh x ≥ 1 for all x in R;

• cosh x gets arbitrarily close to 12 ex as x → ∞.

The reader should verify each of these properties by appealing to Definition 10.1.1. From these
observations, we obtain a sketch of the graph of cosh (see Figure 10.2).

The functions cosh and sinh are related by the following important identity.

Proposition 10.1.5. If x is a real number then

cosh2 x − sinh2 x = 1.

Proof. If x ∈ R then
2 2
cosh2 x − sinh2 x = 1 x
2 (e + e−x ) − 21 (ex − e−x )
1
= × 2ex × 2e−x difference of two squares
4
= 1,

completing the proof.

c
2020 School of Mathematics and Statistics, UNSW Sydney
226 CHAPTER 10. THE HYPERBOLIC FUNCTIONS

We are now in a position to appreciate why these functions are called ‘hyperbolic cosine’ and
‘hyperbolic sine.’ First, we examine their similarity to the trigonometric cosine and sine functions.
The sine and cosine functions have the following properties:

d d
(cos x) = − sin x (sin x) = cos x
dx dx
cos is an even function sin is an odd function
cos2 x+ sin2 x = 1.

The functions sinh and cosh have analogous properties (with the occasional adjustment of a negative
sign):

d d
(cosh x) = sinh x (sinh x) = cosh x
dx dx
cosh is an even function sinh is an odd function
cosh2 x− sinh2 x = 1.

The next example explains the origin of the name hyperbolic in ‘hyperbolic cosine.’

Example 10.1.6. Sketch the curve defined by the parametric equation

(
x(t) = cosh t
(10.3)
y(t) = sinh t,

where t ∈ R.

Solution. The parameter t in equation (10.3) can be eliminated by Proposition 10.1.5 to obtain

1 = cosh2 t − sinh2 t
2 2
= x(t) − y(t) ,

or simply
x2 − y 2 = 1.

This equation describes an hyperbola. Since cosh t > 0 for every real number t, it follows that
x > 0 and hence the curve is the branch of the hyperbola that lies in the right-half plane (see the
black curve in Figure 10.3 (b)). Its asymptotes are the lines y = x and y = −x.
(The other branch of the hyperbola is parameterised by
(
x(t) = − cosh t
y(t) = sinh t

where t ∈ R, and is shown in Figure 10.3 (b) in gray.)

Hence we use trigonometric sine and cosine to paramaterise a circle or ellipse, and hyperbolic
sine and cosine to parameterise an hyperbola.

c
2020 School of Mathematics and Statistics, UNSW Sydney
10.2. OTHER HYPERBOLIC FUNCTIONS 227

1
x

Figure 10.3: Sketch for Example 10.1.6.

10.2 Other hyperbolic functions

(Ref: SH10 §7.9)
Other hyperbolic functions are defined in analogy to the trigonometric functions. Thus we have
sinh x cosh x
tanh x = , coth x = ,
cosh x sinh x
1 1
sech x = , cosech x = .
cosh x sinh x
Some people pronounce tanh as ‘than.’
The derivative of each of these functions is easy to compute.
Example 10.2.1. Compute the derivative of tanh.
Solution. By the definition of tanh,

d d sinh x
(tanh x) =
dx dx cosh x
d d
cosh x dx (sinh x) − sinh x dx (cosh x)
= 2 (by the quotient rule)
cosh x
2 2
cosh x − sinh x
= (by (10.2))
cosh2 x
1
= (by Proposition 10.1.5)
cosh2 x
= sech2 x.

A complete list of derivatives will be given in Section 10.4.

Example 10.2.2. Sketch the graph of the function tanh.

c
2020 School of Mathematics and Statistics, UNSW Sydney
228 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
tanh x
1

−1

Figure 10.4: The graph of tanh.

sinh x
Solution. Since tanh x = cosh x and cosh x > 0 for all x in R, we see that Dom(tanh) = R. Some
properties of tanh are listed below.

• By using the fact that cosh is even and sinh is odd, we see that

sinh(−x) − sinh x
tanh(−x) = = = − tanh(x).
cosh(−x) cosh x

Hence tanh is an odd function. It follows that tanh(0) = 0.

• Since d
dx (tanh x) = sech2 x > 0 for all x in R, the function tanh is increasing everywhere.

• By writting tanh x in terms of exponentials we find that

sinh x
tanh x =
cosh x
ex − e−x
= x
e + e−x
1 − e−2x
=
1 + e−x
→1

as x → ∞.

• Finally, the slope of the graph of tanh at the origin is

d
= sech2 0 = 1.

(tanh x)
dx x=0

These features are shown in the following sketch in Figure 10.4.

10.3 Hyperbolic identities

(Ref: SH10 §7.8)
It has already been observed that some identities involving cosh and sinh are almost analogous to

c
2020 School of Mathematics and Statistics, UNSW Sydney
10.3. HYPERBOLIC IDENTITIES 229

those for cos and sin. This analogy can also be seen in the following larger list of identities. There
are the ‘difference of squares’ identities

cosh2 x − sinh2 x = 1,
1 − tanh2 x = sech2 x,
coth2 x − 1 = cosech2 x,

the ‘sum and difference’ formulae

sinh(x ± y) = sinh x cosh y ± cosh x sinh y,

cosh(x ± y) = cosh x cosh y ± sinh x sinh y,
tanh x ± tanh y
tanh(x ± y) = ,
1 ± tanh x tanh y
and the ‘double-angle’ formulae

sinh(2x) = 2 sinh x cosh x,

cosh(2x) = cosh2 x + sinh2 x,
2 tanh x
tanh(2x) = .
1 + tanh2 x
The following remark gives an easy method for remembering these identities.
Remark 10.3.1. There is a convenient mnemonic for these hyperbolic identities, provided that
you know the corresponding trigonometric identities. Simply replace cos with cosh and sin with
i sinh. For example,

cos2 x + sin2 x = 1 −→ cosh2 x + i2 sinh2 x = 1

−→ cosh2 x − sinh2 x = 1,

and

sin(x + y) = sin x cos y + cos x sin y

−→ i sinh(x + y) = i sinh x cosh y + i cosh x sinh y
−→ sinh(x + y) = sinh x cosh y + cosh x sinh y.
sin i sinh
Since tan = cos we replace tan with cosh = i tanh. Similarly sec is replaced with sech. Thus

1 + tan2 x = sec2 x −→ 1 + i2 tanh2 x = sech2 x

−→ 1 − tanh2 x = sech2 x.

We emphasise that Remark 10.3.1 provides a way to remember each formula, not to prove it.
Proofs are constructed using techniques that are illustrated in the following example.
Example 10.3.2. (a) Prove the identity

cosh(x + y) = cosh x cosh y + sinh x sinh y.

(b) Hence show that

cosh(x − y) = cosh x cosh y − sinh x sinh y.

c
2020 School of Mathematics and Statistics, UNSW Sydney
230 CHAPTER 10. THE HYPERBOLIC FUNCTIONS

(c) By using the result of (a), prove the double-angle formula for cosh.
(d) By assuming that
sinh(x + y) = sinh x cosh y + cosh x sinh y, (10.4)
use the result of (a) to show that
tanh x + tanh y
tanh(x + y) = .
1 + tanh x tanh y
Solution. (a) By the definition of cosh,

LHS = 1
2 ex+y + e−(x+y)
while
1 x

−x 1 y −y
1 x

−x 1 y −y

RHS = 2 e +e 2 e +e +2 e − e 2 e −e
1 x+y

= 4 e + ex−y + e−x+y + e−x−y + 14 ex+y − ex−y − e−x+y + e−x−y
1 x+y
= 2e + 12 e−x−y
= LHS.
This proves the identity.

Remark 10.3.3. Note that in Example 10.3.2, each proof started by simplifying either the left-
hand side of the equation, or the right-hand side of the equation. It would be wrong to begin the
proof of (a) by
cosh(x + y) = cosh x cosh y + sinh x sinh y
x+y

1
2 e + e−(x+y) = 21 ex + e−x 12 ey + e−y + 21 ex − e−x 12 ey − e−y
··· = ···
Such a method is flawed and can be used to ‘prove’ erroneous statements such as ‘1 = 0’.

10.4 Hyperbolic derivatives and integrals

(Ref: SH10 §§7.8, 7.9)
Below is a list of derivatives for the hyperbolic functions:
d d
sinh x = cosh x cosh x = sinh x
dx dx
d d
tanh x = sech2 x coth x = − cosech2 x
dx dx
d d
sech x = − sech x tanh x cosech x = − cosech x coth x.
dx dx
These derivatives can be established by using the definition of each function, as in Example 10.2.1.
The derivatives of sinh, cosh and tanh are the most important to remember.
Corresponding to these derivatives are the indefinite integrals
Z Z
sinh x dx = cosh x + C, sech2 x dx = tanh x + C

and so on.

c
2020 School of Mathematics and Statistics, UNSW Sydney
10.5. THE INVERSE HYPERBOLIC FUNCTIONS 231

Example 10.4.1. Calculate the derivative of the function f given by

f (x) = ln(sech(2x)).

Solution. First we have

d
sech 2x = −2 sech(2x) tanh(2x).
dx
Hence the chain rules gives

−2 sech(2x) tanh(2x)
f ′ (x) = = −2 tanh(2x).
sech(2x)

Example 10.4.2. Find the following integrals:

Z 1
5
ln 3
(a) sinh 5x dx
0
Z
(b) x cosh x dx

Z √
sech2 ( x)
(c) √ dx
x
Z
(d) ex sinh x dx.

Solution. (a) We have

Z 1
5
ln 3 h i 1 ln 3
1 5
sinh 5x dx = 5 cosh 5x
0 0
h i 1 ln 3
1
e5x + e−5x
5
= 10 0

ln 3 − ln 3
1
= 10 e + e − e0 − e0
1

= 10 3 + 31 − 1 − 1
2
= 15 .

10.5 The inverse hyperbolic functions

(Ref: SH10 §7.9)
The functions sinh : R → R and tanh : R → (−1, 1) are both increasing on R and hence have
well-defined inverses. However, cosh : R → [1, ∞) is not one-to-one on R. To obtain an inverse,
one must restrict the domain of cosh. The following definition gives the conventional restriction.

c
2020 School of Mathematics and Statistics, UNSW Sydney
232 CHAPTER 10. THE HYPERBOLIC FUNCTIONS
y y y

1 −1
x 1 x 1 x

(a) sinh and sinh−1 (b) cosh and cosh−1 (c) tanh and tanh−1

Figure 10.5: Graphs of the hyperbolic functions (in gray) and their inverses (in black).

Definition 10.5.1. The function cosh−1 : [1, ∞) → [0, ∞) is defined to be the

inverse function of the restricted hyperbolic cosine function
cosh : [0, ∞) → [1, ∞).

Graphs of the functions sinh−1 , cosh−1 and tanh−1 are illustrated in Figure 10.5.
Expressions involving the hyperbolic functions and their inverses can be simplified using the
identities of Section 10.3. It is important to note that

cosh−1 (cosh x) = x

only when x ≥ 0.

Example 10.5.2. Simplify

(a) sinh cosh−1 34 ,

(b) cosh−1 (cosh(−7)), and

(c) cosh(2 sinh−1 3).

Solution. (a) The identity
cosh2 t − sinh2 t = 1
can be rearranged to give
sinh2 t = cosh2 t − 1.
By substituting cosh−1 4
3 for t, we obtain

4 2
sinh2 cosh−1 4
3 = 3 − 1 = 97 .

We need to decide whether to take the positive or negative square root. Since cosh−1 is a nonneg-
ative function, we have cosh−1 43 > 0 and hence

sinh(cosh−1 34 ) > 0.

Therefore √
sinh cosh−1 4
3 =+ 7
3 .

c
2020 School of Mathematics and Statistics, UNSW Sydney
10.5. THE INVERSE HYPERBOLIC FUNCTIONS 233

(b) We cannot write cosh−1 (cosh(−7)) = −7 because cosh−1 is only the inverse function of cosh
with domain restricted to [0, ∞). However, since cosh is an even function,
cosh−1 (cosh(−7)) = cosh−1 (cosh(7)) = 7.
(c) We use the double-angle formula
cosh(2t) = 1 + 2 sinh2 t
for cosh:
2
cosh(2 sinh−1 3) = 1 + 2 sinh(sinh−1 3)
= 1 + 2 × 32
= 19.

Each of the functions sinh, cosh and tanh can be expressed in terms of the exponential function.
It is not surprising, then, that their inverses can be expressed in terms of the natural logarithm.
Proposition 10.5.3. The following identities hold:
p
sinh−1 x = ln(x + x2 + 1) ∀x ∈ R,
p
cosh−1 x = ln(x + x2 − 1) ∀x ∈ [1, ∞),

−1 1 1+x
tanh x = ln ∀x ∈ (−1, 1).
2 1−x
Proof. We prove the formula for sinh−1 only; the others can be proved similarly.
Suppose that y = sinh−1 x. Then
sinh y = x,
which means that
1 y −y
2 (e − e ) = x.
If we multiply this equation through by 2ey then we obtain
e2y − 1 = 2xey ,
which is a quadratic equation in ey . Further rearrangement gives
(ey )2 − 2x(ey ) = 1,
to which we complete the square to obtain
(ey − x)2 = 1 + x2 .
Since ey > 0, we take the positive square root. Thus
p
ey = x + x2 + 1.
(Note that this may also be obtained by using the quadratic formula rather than completing the
square). Taking the natural logarithm of both sides of this equation gives
p
y = ln(x + x2 + 1)
as required.

c
2020 School of Mathematics and Statistics, UNSW Sydney
234 CHAPTER 10. THE HYPERBOLIC FUNCTIONS

By the inverse function theorem, the functions

sinh−1 : R → R, cosh−1 : (1, ∞) → (0, ∞) and tanh−1 : (−1, 1) → R

are differentiable on their respective domains. A formula for each derivative is given in the next
proposition.

Proposition 10.5.4. The derivatives of sinh−1 , cosh−1 and tanh−1 are given by
d 1
sinh−1 x = √ ,
dx 2
x +1
d 1
cosh−1 x = √ ,
dx 2
x −1
d 1
tanh−1 x = .
dx 1 − x2
d
Proof. We prove the formula for dx (sinh−1 x) only; the others can be proved similarly.
−1
Suppose that y = sinh x, Then
sinh y = x
and differentiating with respect to x gives

dy
cosh y = 1.
dx
Hence
dy 1 1
= = . (10.5)
dx cosh y cosh(sinh−1 x)
The identity cosh2 t − sinh2 t = 1 gives
p
cosh t = 1 + sinh2 t

(where we have taken the positive square root since cosh t > 0 for all t). Hence
p
cosh(sinh−1 x) = 1 + x2 .

If we substitute this into (10.5) then we obtain

d 1
sinh−1 x = √
dx 1 + x2
as required.

Remark 10.5.5. Students should be familiar with the proofs of Propositions 10.5.3 and 10.5.4.

10.6 Integration leading to the inverse hyperbolic functions

When propositions 10.5.3 and 10.5.4 for sinh−1 are restated in terms of indefinite integration we
obtain Z
dx p
√ = sinh−1 x + C = ln(x + x2 + 1) + C.
x2 + 1

c
2020 School of Mathematics and Statistics, UNSW Sydney
10.6. INTEGRATION LEADING TO THE INVERSE HYPERBOLIC FUNCTIONS 235

A more general version may be obtained by observing that, when a > 0,

d x 1 1 1
sinh−1 = q =√ .
dx a a
x 2 x 2 + a2
a +1

In this manner, one is able to prove the formulae

Z
dx x
√ = sinh−1 + C
2
x +a 2 a
p
= ln(x + x2 + a2 ) + (C − ln a), a > 0,

Z
dx x
√ = cosh−1 + C
2
x −a 2 a
p
= ln(x + x2 − a2 ) + (C − ln a), x ≥ a > 0,

Z (
dx tanh−1 xa + C, |x| < a
1
a
=
a − x2
2
coth−1 xa + C, |x| > a > 0
1
a

1 a + x
= ln + C, x2 6= a2 .
2a a − x

These formulae are included in the table of standard integrals that is issued at the final examination.

Example 10.6.1. Find

Z
dx
(a) √
1 + 9x2
Z
dx
(b) √ .
2
x − 2x + 10
Solution. (a) A little algebraic manipulation shows that

1 1
√ = p .
1 + 9x 2 3 (1/3)2 + x2

Hence
Z Z
dx 1 dx
√ = p
1 + 9x 2 3 (1/3)2 + x2
1 x
= sinh−1 +C
3 1/3
1
= sinh−1 (3x) + C.
3
Alternatively, one could approach this problem by using the substitution u = 3x.
(b) The technique we apply to this example is known as ‘completing the square’:

x2 − 2x + 10 = x2 − 2x + 1 + 9 = (x − 1)2 + 32 = u2 + 32 ,

c
2020 School of Mathematics and Statistics, UNSW Sydney
236 CHAPTER 10. THE HYPERBOLIC FUNCTIONS

where u = x − 1. Hence
Z Z
dx du
√ = √
2
x − 2x + 10 + 32u2
u
= sinh−1 + C
3
−1 x − 1
= sinh + C.
3

10.7 A summary of important hyperbolic formulae

A large number of formulae was introduced in this chapter. Below is a list of formulae, with the
more important appearing earlier in the list, that students are expected to know.
• The definitions of sinh and cosh:

sinh x = 21 (ex − e−x ) cosh x = 12 (ex + e−x )

• The derivatives of sin and cosh:

d d
(sinh x) = cosh x (cosh x) = sinh x
dx dx

• The definition of tanh:

sinh x
tanh x =
cosh x
• The hyperbolic identity
cosh2 x − sinh2 x = 1

• The derivative of sinh−1 :

d 1
(sinh−1 x) = √
dx 1 + x2
• The derivative of tanh:
d
(tanh x) = sech2 x.
dx
Other formulae need not be memorised, though students will do well to note the following
points.
• Derivatives for the inverse hyperbolic functions can be read from the table of standard
integrals.

• Identities which express the inverse hyperbolic functions in terms of the natural loga-
rithm (see Proposition 10.5.3) can be read from the table of standard integrals.

• The definition of coth, sech and cosech are in exact analogy with the definition of cot,
sec and cosec.

• The hyperbolic identities listed in Section 10.3 are easily remembered from their trigono-
metric counterparts (see Remark 10.3.1).

c
2020 School of Mathematics and Statistics, UNSW Sydney
10.8. (APPENDIX):THE CATENARY 237
y

suspended cable
T

T sin θ
P b
θ
T cos θ

A
H b

w ℓ(x)

Figure 10.6: Forces acting on a suspended cable.

• It is not expected that students will memorise the derivatives of coth, sech and cosech.
However, they can be easily derived from first principles provided that the derivatives
of sinh and cosh are known.
A table of standard integrals will be issued at the final examination.

10.8 (Appendix):The catenary

We end this chapter showing that the cosh function describes the curve of a suspended cable.
Suppose that a cable of uniform mass is suspended from two fixed points. To describe the
equation of the resulting curve in terms of x and y, we fix a coordinate system such that vertex
of the curve passes through the point A(0, y0 ) as in Figure 10.6. (The ordinate y0 can be any real
number; its choice only fixes the coordinate system and does not alter the shape of the catenary.)
Consider a point P (x, y) that lies on the curve and suppose that
• T is the tension in the cable at P (x, y),
• H is the tension in the cable at A(0, y0 ),
• ℓ(x) is the length of the cable from A(0, y0 ) to P (x, y), and
• w is the weight per unit length of the cable.
(see Figure 10.6). Thus the gravitational force acting on the cable along the segment AP is given
by w ℓ(x). We note that the quantities T , θ and ℓ depend on x but that w and H do not. We also
make the natural assumption that the catenary is described by a differentiable function.
Theorem 10.8.1. There is a choice of coordinate system such that the catenary in Figure 10.6 is
described by the equation
H wx
y= cosh .
w H
c
2020 School of Mathematics and Statistics, UNSW Sydney
238 CHAPTER 10. THE HYPERBOLIC FUNCTIONS

To prove the theorem, it suffices to show that if the catenary is described by y = f (x), where
f is a differentiable function, then
H wx
f (x) = cosh
w H
for a particular choice of y0 .
We begin with a general formula that relates a function f to its arc length function ℓ by a
differential equation.
Lemma 10.8.2. Suppose that f is a differentiable function. Then the arc length function ℓ is also
differentiable and q
′
2
ℓ (x) = 1 + f ′ (x) (10.6)
Sketch proof. Suppose that h is a small nonzero real number and consider the diagram below.
f (t)

b
x + h, f (x + h)
ℓ(x + h) − ℓ(x)
f (x + h) − f (x)

x, f (x) b

By Pythagoras’ theorem,
q 2
ℓ(x + h) − ℓ(x) ≈ h2 + f (x + h) − f (x) .
If we divide both sides by h then
s 2
ℓ(x + h) − ℓ(x) f (x + h) − f (x)
≈ 1+ .
h h
As h → 0 the approximation gets better and we obtain
q 2
ℓ′ (x) = 1 + f ′ (x)
as required.

The next lemma uses facts specific to the catenary.

Lemma 10.8.3. If the equation of the catenary in Figure 10.6 is given by y = f (x), then f satisfies
the following conditions:
f (0) = y0 , (10.7)
′
f (0) = 0, (10.8)
q
′′ w 2
f (x) = 1 + f ′ (x) . (10.9)
H
c
2020 School of Mathematics and Statistics, UNSW Sydney
10.8. (APPENDIX):THE CATENARY 239

Proof. The first two conditions are a result of our choice of coordinate system and the assumption
that f is differentiable. It remains to establish the third condition.
Since the cable is stationary, the forces w ℓ(x) and T sin θ that act vertically cancel each other
out. That is,
T sin θ = w ℓ(x).
Similarly, the horizontal forces H and T cos θ also cancel each other out and so
T cos θ = H.
Using these equations, and the fact that f ′ (x) = tan θ at the point P (x, y), gives
T sin θ w ℓ(x)
f ′ (x) = tan θ = = .
T cos θ H
So far we have
w ℓ(x)
f ′ (x) = .
H
Since w and H are constants, differentiating this equation with respect to x gives
w ′
f ′′ (x) = ℓ (x).
H
Therefore q
′′ w 2
f (x) = 1 + f ′ (x) .
H
by equation (10.6).
Proof of Theorem 10.8.1. It remains to find all functions f that satisfy the conditions of Lemma
10.8.3. If we let u denote f ′ (x) and b denote w/H, then equation (10.9) becomes
du p
= b 1 + u2 .
dx
Hence
dx 1
= √ .
du b 1 + u2
Every solution to this differential equation is of the form
1
x = sinh−1 u + C1 .
b
Hence we have
f ′ (x) = u = sinh(bx − bC1 ).
Enforcing condition (10.8) shows that either b = 0 or C1 = 0. Now if b = 0 then f ′ (x) = 0 and f
is simply a constant function (that is, the cable is pulled tight and exhibits no sag). On the other
hand, if b 6= 0 then C1 = 0 and hence
f ′ (x) = sinh(bx).
Therefore
cosh(bx)
f (x) = + C2
b
and enforcing the condition (10.7) yields C2 = 1b − y0 . If we choose y0 to be 1/b then C2 = 0 and
the function f that describes the catenary is given by
cosh(bx) H wx
f (x) = = cosh ,
b w H
proving the theorem.

c
2020 School of Mathematics and Statistics, UNSW Sydney
240 CHAPTER 10. THE HYPERBOLIC FUNCTIONS

10.9 Maple notes

Maple knows about the hyperbolic functions sinh, cosh, tanh, csch, sech, and coth, and inverse
hyperbolic functions arcsinh, arccosh, arctanh, arccsch, arcsech, and arccoth. For example,
> int(arctanh(x),x);
1
x arctanh (x) + ln 1 − x2
2
> # which is closely analogous to
> int(arctan(x),x);
1
x arctan (x) − ln 1 + x2
2
> # BUT the following result does not follow the analogy between trig and hyperbolic
functions
> int(sech(x),x);

arctan (sinh (x))

> plot(arccsch(x),x=-6..6, y=-6..6, discont=true, numpoints=100);

c
2020 School of Mathematics and Statistics, UNSW Sydney
PROBLEMS FOR CHAPTER 10 241

Problems for Chapter 10

Problems 10.1 : Hyperbolic sine and cosine functions and

10.2 : Other hyperbolic functions
1. [R] [V] Define sinh x and cosh x. Hence show that
d
a) (cosh 6x) = 6 sinh 6x;
dx
b) ln(sinh x) < x − ln 2 whenever x > 0.

2. [R] By expressing the following hyperbolic functions in terms of sinh x and cosh x, find
the derivative of each function f given below.
a) f (x) = tanh x b) f (x) = sech x c) f (x) = coth x

3. [R] In each case, find f ′ (x).

a) f (x) = sinh(3x2 ) b) f (x) = cosh( x1 ) c) f (x) = sinh(ln x)

Problems 10.3 : Hyperbolic identities

4. [R]
a) Given the formula sinh(A + B) = sinh A cosh B + cosh A sinh B, find a formula for
sinh 2x. By differentiation or otherwise, find a formula for cosh 2x.
b) [H] Using the results of part
Z (a), express sinh 3x as a cubic polynomial in sinh x.
Hence, or otherwise, find sinh3 x dx.

5. [R] Show that cosh x+sinh x = ex . Deduce that (cosh x+sinh x)n = cosh nx+sinh nx.

6. [H] [V] Consider the hyperbola x2 − y 2 = 1, where x ≥ 1.

(cosh t, sinh t)

A(t)
0 1 x

a) Using the definitions of cosh and sinh, prove that, for every real number t, the
point (cosh t, sinh t) lies on the hyperbola.

c
2020 School of Mathematics and Statistics, UNSW Sydney
242 CHAPTER 10. THE HYPERBOLIC FUNCTIONS

b) When t > 0, let A(t) denote the shaded region in the diagram. Explain why
Z cosh t p
1
A(t) = cosh t sinh t − x2 − 1 dx.
2 1
t
c) By first calculating A′ (t), prove that A(t) = .
2

Problems 10.4 : Hyperbolic derivatives and integrals

7. [R] Evaluate the following integrals.

Z Z 1 ln 2
3
a) cosh(4x) dx b) sinh 3x dx
0
Z Z √
sinh( x)
c) cosh2 x dx d) √ dx
x

Problems 10.5 : The inverse hyperbolic functions

8. [R] Simplify cosh(sinh−1 (3/4)), cosh−1 (cosh(−3)) and sinh(tanh−1 (5/13)).

9. [R] Show that

d 1 d 1
a) cosh−1 x = √ , for x > 1 b) tanh−1 x = .
dx 2
x −1 dx 1 − x2

10. [R] Show that

p
a) cosh−1 x = ln(x + x2 − 1) ∀x ∈ [1, ∞)

−1 1 1+x
b) [V] tanh x = ln ∀x ∈ (−1, 1).
2 1−x

dy
11. [R] Find if
dx
a) y = sinh−1 (2x)
b) y = tanh−1 (1/x)
c) y = cosh−1 (sec x) whenever 0 < x < π/2.

Problems 10.6 : Integration leading to the inverse hyperbolic functions

12. [R] Find

Z Z 1/2 Z
dx dx dx
a) [V] √ b) c) √ .
1 + 4x2 0 1 − x2 x2 + 4x + 13

13. [X] Sketch the function sech−1 . What is its maximal domain? For y = sech−1 x, show
that
√ !
dy −1 1 + 1 − x2
a) = √ b) y = ln .
dx x 1 − x2 x

c
2020 School of Mathematics and Statistics, UNSW Sydney
243

Answers to selected problems

Chapter 1

1. a) The set of integers between −π and π.

c) The empty set.

3. Answer for both: the interior and boundary of the triangle with vertices at (0, 0), (2, 0) and
(2, 4).

4. a) x < 0 or x > 1 b) 1 < x < 2 c) x < −2 or x > 0

d) −1 < x < 1 e) −2 ≤ x < 1 or x ≥ 3

5. a) −4 < x < 2 b) x < −5 or x > 1

c) −1 < x < −1/3 d) x > 0

1
6. c) From (a) we have x2 + ≥ 2 with equality if and only if x = ±1.
x2
7. a) F b) F c) T d) T e) F

8. Hint: (x2 + y 2 )2 ≥ 4x2 y 2 .

√ √ √
10. a) − 5 ≤ x ≤ 5; 0 ≤ y ≤ 5
√ √
b) x ≤ − 5 or x ≥ 5; y ≥ 0
c) x 6= 8; y 6= 0
d) [1, ∞); [0, ∞)
e) (1, ∞); (0, ∞)
f) {x ∈ R : 2nπ ≤ x ≤ (2n + 1)π; n ∈ Z}; [0, 1]
√
g) The union of the intervals [− 7π π
6 + 2kπ, 6 + 2kπ] where k ∈ Z; 0≤y≤ 3
h) {x ∈ R : x 6= (2n + 1)π/2, n an integer}; [1, ∞)
i) R; [−1, ∞)

11. a) 22 b) x2 + 10x + 22 c) 6 d) x2 + 2
√ √ √
12. a) x − 1 + 1/ x − 1 b) x−1 c) (x − 1)3/2 d) (1/ x − 1) − 1

16. [4, 13]

17. x = 1, 7

c
2020 School of Mathematics and Statistics, UNSW Sydney
244 CHAPTER 3

18. a) If p(x) = a0 + a1 x + · · · + an xn then p(q(x)) = a0 + a1 q(x) + a2 (q(x))2 + · · · + an (q(x))n .

Products and sums of polynomials are again polynomials.
b) Yes.

Chapter 2

1. a) 1 b) 2 c) 0
d) Doesn’t exist (→ ∞). e) 5 f) Doesn’t exist.

2. a) 0 b) 0

4. b) 0
√
5. a) 1 b) M = 10 (best possible) c) M = 1/ ǫ will do.

6. a) 4 b) 0 c) 0 d) 0 e) 0

7. a) Not necessarily, as the information given indicates only that the inequality holds for a
subset of (ǫ−1 , ∞).
b) Yes. In fact one can prove that lim g(x) = 5 from the definition of the limit by taking
x→∞
M to be 1ǫ .

8. a) 50 metres per second b) 5 ln 50 ≈ 19.56 seconds after leaving the plane.

9. a) Yes. If limit of f (x) as x → ∞ does not exist and f (x) 6= 0, then

lim (f (x) − f (x)) = 0 and lim (f (x)/f (x)) = 1.
x→∞ x→∞

b) Yes, since g(x) = (f (x) + g(x)) − f (x).

c) No, as in (b).
d) No. For example if f (x) = 0 for all x and lim g(x) does not exist, we have
x→∞
lim (f (x)g(x)) = 0.
x→∞

10. a) 10 b) 4 c) 3 d) −1/9

11. a) −1 b) 1 c) No

12. a) Doesn’t exist. b) Doesn’t exist. c) Doesn’t exist. d) Doesn’t exist.

13. a) 0 b) 0

14. a) |CB| = θ, |CA| = sin θ, |DB| = tan θ.

15. Neither the left-hand nor right-hand limits exist due to wild oscillatory behaviour.

c
2020 School of Mathematics and Statistics, UNSW Sydney
ANSWERS 245

Chapter 3

1. b) Yes

2. a) Continuous everywhere. b) Continuous everywhere except at π/2.

3. k = 8

5. Use the intermediate value theorem.

9. a) Yes b) Yes c) No d) Yes

Chapter 4

2. a) 5(4x3 + 21x6 ) b) (4x3 −2)(4x2 +2x+4)+(x4 −2x)(8x+2)

c) (16y − y 4 )/(y 3 + 8)2 d) (2x2 − 4)/(x2 − 4)1/2
e) −4/(t2 − 4)3/2 f) 3 cos 3y + 12 cos 2y sin 2y
g) (4x3 − x4 )e−x h) x ln(x3 + 1) + 3x2 (x2 + 1)/2(x3 + 1)
i) sec2 x j) − tan x

3. a) 0 b) 0 c) f ′ (0) = 0

4. a) i) x 6= 0 ii) all x b) i) all x ii) all x

c) i) x 6= −2 ii) x 6= −2

7. 2pf ′ (a)

8. a) x + 17π + cos 2x b) 1 − 2 sin 2x c) 2−x2 +cos 2(2−x2 )

d) 1 − 2 sin 2(2 − x2 ) e) −2x(1 − 2 sin 2(2 − x2 ))

dy 3x2 − y dy √ √
9. a) = b) = (y − 4x xy)/(4y xy − x)
dx x − 3y 2 dx

11. y = 2

12. a) (i) b = 0 (ii) a = 1, b = 0

b) (i) b = 1 (ii) a = 2, b = 1.

13. a = 1, b = 0

14. a) f (8.01) ≈ f (8) = 2

b) i) y = (x − 8)/12 + 2
1
ii) f (8.01) ≈ (8.01 − 8)/12 + 2 = 2 + 1200
c) The approximation in (b) is much better.
√
15. 3 ar/2

16. 7/8 m2 /s

c
2020 School of Mathematics and Statistics, UNSW Sydney
246 CHAPTER 5
1
17. a) 8π cm/s
32000π 3
b) 81 cm

dh 2
18. a) dt = 125π mm/s when h = 50 mm.

Chapter 5
q
7 1
1. 3 b) 2

5. c) 0
√ √ 1
7. a) By the Mean Value Theorem, for some c with 16 < c < 17, 17 − 16 = √ < √1 =
2 c 2 16
0.125.
b) 0.008
c) 2 × 10−6 .

8. −1, 1 and 4 are stationary points; 4 is a local minimum point;−1 is a local maximum point.

9. No

10. a) 11, −61 b) 3, −253 c) 27/256, −750

d) 250, −54 e) 2, 0

11. (12/13, 18/13)

12. p′n (x) = pn−1 (x), and if pn−1 (x) = 0 then pn (x) = xn /n!. These hints are all you need!

13. a) (400)/(4 + π), 100π/(4 + π) b) 0, 100

(p
1 − a2 /3 if 0 ≤ a ≤ 3/2
14. The greatest distance is a + 2; the least distance is
|a − 2| if a > 3/2.

15. a = π2 , x = 3π 7π 3π π 5π
4 , 4 ; a = 2 , x = 4 , 4 . The Maple commands
with(plots):
animate(plot,[cos(a) + 2*cos(2*x) + cos(4*x-a), x=0..2*Pi],a=0..2*Pi);
should confirm your answers.

17. Three real zeros

18. a) f (t) = − cos t + t2 /2 + 3 b) No

19. a) 0 b) 8/3

1 m
20. a) 3 b) n c) −1 d) − 21 e) 1
4 f) 1
3

21. a) → 0 b) → ∞ c) → 0
d) → 1 e) → 1 f) → 32

c
2020 School of Mathematics and Statistics, UNSW Sydney
ANSWERS 247

22. Combine the two fractions and apply l’Hôpital twice only. You will need to simplify the quotient
obtained after the first application of l’Hôpital. Maple can confirm your answer.
√ √ √ √
23. (a, b) = (− 2, 2) or ( 2, − 2)

26. a) −1/2
b) a = −1/2, b = 1

27. c) a = b = 0

Chapter 6

1
2. a) f −1 (x) = (x − 1)
3
√
b) g −1 (x) = − x − 1, Dom(g −1 ) = [1, ∞),
−1
Range(g ) = (−∞, 0], (g −1 )′ (x) = 2√−1
x−1

4. b) 1/3

5. b) The restriction of f to (−∞, −1] has an inverse with domain (−∞, 3],
the restriction of f to [−1, 1] has an inverse with domain [−1, 3], and
the restriction of f to [1, ∞) has an inverse with domain [−1, ∞).

6. a) No b) Yes

7. a) The graph is symmetric about x = − 21 , which surely gives a local maximum of f (x).
There will be four (maximal) intervals where f will have an inverse. Try this exercise on
Maple. The commands plot, diff and solve should suffice.
b) f is one-to-one; f −1 (x) = x1/17 − 1 is not differentiable when x = 0.
c) I can be one of four intervals.

8. a) π/3 b) 2/5 c) −π/3 d) π/3

√
e) 4/5 f) 3/ 34 g) π/3 h) π − x
√ √
11. a) −2/ 1 − 4x2 b) 1/(2 x − x2 ) c) 2/(4x2 − 12x + 10)

12. Differentiate; −1 ≤ x ≤ 1; π/2.

13. b) f (x) = π/2 when x > 0 and f (x) = −π/2 when x < 0.
√
14. b) The derivative of the inverse is −1/x x2 − 1 when x > 1.

16. a = π/2, b = 0

17. a) 24π km/min b) 104π/3 km/min

√
18. 48 metres

c
2020 School of Mathematics and Statistics, UNSW Sydney
248 CHAPTER 7

Chapter 7

1. [−1, 5], [0, 3], upper half of circle.

2. a) period 2π/3, odd b) period 3π, neither

c) not periodic, even d) period π/3, odd
e) period π, even f) 2π

3. odd, even, neither, odd, odd, even.

4. The asymptotes are

a) x = 3, y = x + 2 b) x = −1, y = x − 1 c) x = −3, x = 2, y = x − 1.

6. a) x ≥ 3, − 13 < x ≤ 13 b) x = − 31 , x = −3, y = 1 c) (1, − 41 ), (−1, −4) d) Domain:

x 6= 3, − 3 , Range: (−∞, −4], [− 41 , ∞).
1

x2 y2 x2 y2
7. a) + = 1, ellipse b) − = 1, hyperbola
16 25 9 4
2/3 d) spiral
c) y = x

8. a) ii) (2, 0) iii) −1

b) ii) (5, 0), (−1, 0) iii) 4t3 /3
c) ii) (1, 0), (−1, 0) iii) − cot t

1
9. a) 3x − 27y + 52 = 0 b) 9

2
10. a) y = 3x 3 .

11. b) Hint: the length of one particular arc of the larger circle equals the length of one arc on
the smaller circle.
d) x2/3 + y 2/3 = 1

12. a) p(t) = a + t(b − a), p(0) = a, p(1) = b b) y = 4 − x, q(1/2) is the coodinate vector for
the midpoint of B and C c) p0 (t) = (1 − t)2 , p1 (t) = 2t(1 − t), p2 (t) = t2
√ √ √
13. a) (3, 0) b) (−3 3, −3) c) ( 2, − 2)
√
14. a) (3, π) b) ( 2, −3π/4) c) (4, 2π/3)
d) (1, π/2) e) (4, 5π/6) f) (4, −5π/6)

15. a) Circle, centre (0,0), radius 4

b) A ray in the second quadrant
c) A spiral of Archimedes

16. a) Circle, centre (0,3), radius 3

b) Circle, centre (1,0), radius 1

c
2020 School of Mathematics and Statistics, UNSW Sydney
ANSWERS 249

17. The following sketches are a guide to shape only.

y y y

x x
x
a) b) c)
y
y y

x x x
d) e) f)

(x − 2)2 y2
19. + =1
9 5

Chapter 8

1. a) i) S Pn (f ) = S Pn (f ) = 1

ii) S Pn (f ) = 21 1 − n1 , S Pn (f ) = 12 1 + n1

iii) S Pn (f ) = 61 1 − n1 2 − n1 , S Pn (f ) = 16 1 + n1 2 + n1
v) S Pn (f ) = 1, S Pn (f ) = 0
1 1 1
b) i) 1 ii) 2 iii) 3 iv) 4 (v) Not Riemann integrable
√
2. a) 1365 = 36.95
√ √
b) 1690.9 = 41.12 and the lower bound is 1078.9 = 32.85

4. 4.5

5. a) 82.4 b) 10

1
6. f (x) =
x2 +x+1
1
7. x is not differentiable on all of [−1, 1] so the FTC doesn’t apply.
√
8. a) Draw a picture! b) 5π/12 − 3/2

10. F is continuous everywhere, but not differentiable at the integers.

12. a) sin x2 b) 3x2 sin x6 c) −3x2 sin x6 d) 3x2 sin x6 − sin x2

13. −(5 − 4x)5

π
14. biii) 6.

2 √
15. a) 12 ex + C b) −2 cos x + C c) 15/4
√ √
d) 4 2 a9/2 9 e) 1/4 f) (2 2 − 1)/3
√ √ 1 1 21 1

16. a) 2 x − 2 ln(1 + x) + C b) 25 21 (5x − 1) + 20 (5x − 1)20 + C
c) x/(x + 1)2 + C d) 4 − 10 ln(7/5)

c
2020 School of Mathematics and Statistics, UNSW Sydney
250 CHAPTER 9

1 t2 − 1
17. √ tan−1 √ for t 6= 0
2 2t
5
18. a) 4e25+1 b) x2 sin x + 2x cos x − 2 sin x + C
√
π
c) x(ln(x) − 1) + C d) 12 + 23 − 1
7e8 +1 π
f) 2
e)
ex
64 √
g) (cos x + sin x) h) x tan−1 x − ln 1 + x2 + C
√2 √
2 1
i) 2 + 2 ln(1 + 2)

19. a) 1/5 b) diverges c) π/4

d) 0 e) 2 f) diverges

21. a) 0 b) ln 2 c) No

22. a) convergent b) divergent c) divergent

23. a) convergent b) divergent c) convergent

24. a) convergent b) divergent c) convergent

25. s < 0

26. p > 1

27. The integral converges whenever 2a − b > 1.

28. a) 4, 8, 2
c) Li′ (x) = 1
ln x > 0 so Li is an increasing function; Li(2) = 0.
6
10 −2
d) Li(106 ) ≥ 6 ln 10 .
π(106 )
e) x ' 0.07238.
2
29. a) √2 e−x
π

d) (i) 0.749 < erf(1) < 0.928 (iii) 1/e (iv) 1.344

Chapter 9

2. a) A partition into 7 equal parts will suffice

3. a) 3x2 /2(x3 + 1) b) ex for x > 0, −e−x for x < 0

1
c) (ln(ln x))(ln x)x d) 5x4 (where x > −61/5 )

4. a) 12 ln(1 + e2x ) b) −e1/x

√
c) 3x / ln 3 d) e 4
x

2
(ln x) f) ln | sin x|
e) 2

7. b) e

c
2020 School of Mathematics and Statistics, UNSW Sydney
ANSWERS 251
1/5
x3 − 3 3x2 2x
8. a) 3x ln 3 b) −
x2 + 1 5(x

3 − 3) 5(1 + x2 )
sin x
c) (sin x)sin x cos x (1 + ln(sin x)) d) cos(xsin x ) xsin x cos x ln x +
x

9. a) 0 b) 0 c) 1 d) e2 e) 1
f) 1 g) ea h) 0 i) 0

Chapter 10

2. a) sech2 x b) −sechx tanh x c) −cosech2 x

− sinh(1/x) 1 1
3. a) 6x cosh(3x2 ) b) x2 c) 2 + 2x2

4. a) sinh 2x = 2 cosh x sinh x ; cosh 2x = cosh2 x + sinh2 x

b) 1 1
4(3 cosh 3x − 3 cosh x) or 1
3 cosh3 x − cosh x

sinh 4x 1 √
7. a) 4 b) 12 c) (2x + sinh 2x)/4 d) 2 cosh x

8. 5/4, 3, 5/12
√ 1
11. a) 2/ 1 + 4x2 b) 1−x2 for |x| > 1 c) sec x

12. a) 1
2 sinh−1 2x b) tanh−1 1
2 = 1
2 ln 3 c) sinh−1 x+2
3

c
2020 School of Mathematics and Statistics, UNSW Sydney
252

c
2020 School of Mathematics and Statistics, UNSW Sydney
INDEX 253

Index

absolute maximum point, 57 continuous functions, 21

absolute maximum value, 57 and limits, 40
absolute minimum point, 57 combination of, 51
absolute minimum value, 57 piecewise, 166
absolute value, 7 critical point, 96
answers, 243 cubic, 13
antiderivative, 102 curve of fastest descent, 142
antidifferentiation, 102, 177 cusp point, 151
area, 157 cycloid, 142
absolute, 168
signed, 168 decreasing function, 91
under the graph of f , 164 definite integral, see integral(s), definite
unsigned, 168 derivative(s), 65, 67
astroid, 155 definition of, 67
asymptote, 132 difference of squares identities, 229
oblique, 132 difference quotient, 67
vertical, 132 differentiability, 75
and continuity, 75
Barrow, Isaac, 157
local extrema, 79
Bernoulli, Jakob, 142, 223
differentiable, 74
Bernoulli, Johann, 142, 223
at a point, 67
Bolzano, 34
local approximation, 77
bounded function, 59
maximum point, 79
brachistochrone, 142
mean value theorem, 85
Cartesian coordinates, 144 minimum point, 79
catenary, 223, 237 twice, 93
chain rule, 70, 78 differential calculus, 1
change of variables formula, 181, 182 differential equation, 101
circle, 20, 140 differentiation, 69
closed interval, 3 chain rule, 70
codomain, 9 product rule, 72
composition (of functions), 12 quotient rule, 70
conic sections, 20 discontinuity, 21
contains, 3 distance, 7
continuity, 40, 75 domain, 9
and differentiability, 75 maximal, 10
at an end point, 54 natural, 10
everywhere, 41 dominant term analysis, 190, 193
on an interval, 53, 54 double-angle formulae, 229
continuous, 74 dummy variable, 171

c
2020 School of Mathematics and Statistics, UNSW Sydney
254 INDEX
e, 211 hyperbolic sine, 224
elementary functions, 18 sum and difference formulae, 229
elements (of set), 2
ellipse, 20, 140 improper integral
endpoints p-integrals, 188
critical points, 96 convergence of, 187
endpoints of intervals, 3 indeterminate form(s), 31, 103, 217, 218
equations inequalities, 4
solutions, 98 and absolute value, 7
erf, 205 triangle inequality, 7
error bounds, 90 inequality, 4
error function, 205 infinity, symbol for, 3
even function, 132 injective, 113
exp, 212 integers, 2
exponential function, 214 integral calculus, 1, 157
integral(s), 165
fastest growing term, 30 definite
function(s), 1, 9 limits of, 165
composition of, 12, 70 improper
decreasing, 91 p-integrals, 186
error function, 205 comparison test, 190
even, 132 convergence of, 186
exponential, 213 limit comparison test, 193
implicitly defined, 19 indefinite, 179
increasing, 91 over (a, ∞), 187
inverse function, 115 Riemann sum, 162, 164
limit at infinity, 33 table of, 237
local approximation, 77 integrand, 165
natural logarithm, 209 integration, 163
odd, 132 and differentiation, 177
quadratic, 13 by parts, 185
range of, 10 by substitution, 180
rational, 14 constant of, 179
real-valued, 9 dummy variable, 171
sine integral, 196 intermediate value theorem, 55
value of, 9 number of solutions, 100
fundamental theorem of calculus, 1, 196 intervals, 3
first fundamental theorem, 172 closed, 3
second fundamental theorem, 175 open, 3
inverse function(s), 115
Galileo, 142, 223 hyperbolic functions, 231
inverses of, 231
horizontal line test, 114 restriction, 118
horizontal point of inflexion, 93 trigonometric functions, 120, 121
Huygens, Christiaan, 223 irrational numbers, 2
hyperbola, 20, 140, 226
hyperbolic functions, 227 l’Hôpital’s rule, 104, 217
double-angle formulae, 229 l’Hôpital, Guillaume, 142
hyperbolic cosine, 224 least upper bound property, 55, 57

c
2020 School of Mathematics and Statistics, UNSW Sydney
INDEX 255
Lebesgue, Henri, 157, 168 polar coordinates, 144
left-hand limits, 39 polar curves, 146
Leibniz, Gottfried, 1, 142, 157, 223 polynomial, 13
Leonard Euler, 211 coefficients of, 13
limit(s), 1, 27 degree of, 13
at infinity, 33 leading coefficient, 13
composition of function, 42 polynomial division, 134
left-hand, 39 primitive, 102
of continuous functions, 40 product rule, 72
of the form lim f (x), 27
x→∞ quadratic function, 13
pinching theorem, 37, 43
quartic, 13
right-hand, 39
quintic, 13
two-sided, 39
ln, 209 radians, 14
logarithmic functions, 214 range of a function, 10
logarithms, 209 rates of change, 78
rational function, 14
maximum rational numbers, 2, 55
global, 96 real numbers, 2, 55
local, 79, 94 Riemann integrable, 165
second derivative test, 94 Riemann integral, 163
maximum-minimum theorem, 56, 57 area under the graph, 163
mean value theorem, 85, 86 basic properties of, 169
members (of set), 2 Riemann sum
minimum upper, 162, 164
global, 96 Riemann sums, 177, 196
local, 94 lower, 162, 164
second derivative test, 94 Riemann, Bernhard, 157
right-hand limits, 39
natural domain, 10 Rolle’s theorem, 87
natural logarithm, 211, 233
natural numbers, 2 secant, 64
Newton, Isaac, 1, 142, 157 second derivative, 93, 95
set(s), 2
oblique asymptote, 132 sine integral, 196
odd function, 132 stationary point, 80
one-to-one correspondence, 112, 113 classification of, 92
one-to-one function, 113 subset, 3
open interval, 3 sum and difference formulae, 229

parabola(s), 20, 140 tangent to a curve, 65

parametric curves, 136 triangle inequality, 7
conic sections, 139 trigonometric functions, 14
parameter, 138 inverse functions, 121
parametrisation, 138
polar curves, 146 vertical asymptotes, 132
partition, 160, 163 Weierstrass, 34
piecewise continuous, 166
pinching theorem, 29, 37, 43 Zeno of Elea, 34

c
2020 School of Mathematics and Statistics, UNSW Sydney

Calculus I Lecture Notes Overview
No ratings yet
Calculus I Lecture Notes Overview
265 pages
Introduction to Ordinary Differential Equations
100% (3)
Introduction to Ordinary Differential Equations
388 pages
MATH1231-1241-Calculus-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)
No ratings yet
MATH1231-1241-Calculus-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)
213 pages
MA100 at Lecture Notes
No ratings yet
MA100 at Lecture Notes
252 pages
Further Pure 2 and 3 For Ocr
33% (3)
Further Pure 2 and 3 For Ocr
14 pages
Differential Equations Course
No ratings yet
Differential Equations Course
4 pages
Functions of Several Variables
No ratings yet
Functions of Several Variables
16 pages
Phsl2101 Exam 2 Notes 1
No ratings yet
Phsl2101 Exam 2 Notes 1
30 pages
Revision Proof by Induction
No ratings yet
Revision Proof by Induction
15 pages
MATH1231-1241-Algebra-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)
No ratings yet
MATH1231-1241-Algebra-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)
254 pages
Area Under The Curve
No ratings yet
Area Under The Curve
51 pages
Fourier Series Tutorial by Dr. Tisdell
No ratings yet
Fourier Series Tutorial by Dr. Tisdell
8 pages
MT105b - Mathematics 2 - 2017 Exam - Zone-A
No ratings yet
MT105b - Mathematics 2 - 2017 Exam - Zone-A
3 pages
Advanced Integration Techniques
100% (1)
Advanced Integration Techniques
155 pages
Lecture Notes: Divergence Theorem and Stokes' Theorem
No ratings yet
Lecture Notes: Divergence Theorem and Stokes' Theorem
6 pages
Integration by Parts Examples
No ratings yet
Integration by Parts Examples
5 pages
ct1 Financial Math Notes
No ratings yet
ct1 Financial Math Notes
48 pages
Factoring and Solving Polynomials
No ratings yet
Factoring and Solving Polynomials
13 pages
Arjudin - Differential Equations
No ratings yet
Arjudin - Differential Equations
84 pages
Ra JPG
No ratings yet
Ra JPG
179 pages
ريض 101 (@ksupdf)
No ratings yet
ريض 101 (@ksupdf)
357 pages
Functions of Several Variables PDF
No ratings yet
Functions of Several Variables PDF
21 pages
Implicit Differentiation Guide
No ratings yet
Implicit Differentiation Guide
13 pages
Multivariable Calculus
100% (2)
Multivariable Calculus
158 pages
Indefinite Integral
No ratings yet
Indefinite Integral
88 pages
Double Integrals
No ratings yet
Double Integrals
29 pages
ST104A 03 June
No ratings yet
ST104A 03 June
21 pages
Line and Surface Integrals Guide
No ratings yet
Line and Surface Integrals Guide
33 pages
MATH1231 Course Pack
No ratings yet
MATH1231 Course Pack
616 pages
Calculus of Vector Valued Functions
100% (1)
Calculus of Vector Valued Functions
18 pages
Advanced Integration Techniques Guide
No ratings yet
Advanced Integration Techniques Guide
10 pages
Integral Calculus: Solved Exercises
No ratings yet
Integral Calculus: Solved Exercises
14 pages
Linear Algebra MAT223 Instructor's Guide by Jason Siefken
100% (1)
Linear Algebra MAT223 Instructor's Guide by Jason Siefken
108 pages
Differential Geometry of Curves and Surfaces by Manfredo Perdigão Do Carmo Homework4
100% (1)
Differential Geometry of Curves and Surfaces by Manfredo Perdigão Do Carmo Homework4
4 pages
Mte 101 Chapter 4 Oct 2020
No ratings yet
Mte 101 Chapter 4 Oct 2020
22 pages
AP GP Summary
No ratings yet
AP GP Summary
21 pages
Partial Differentiation Guide
No ratings yet
Partial Differentiation Guide
47 pages
Mathematics Extension 1 Notes (UPDATED)
No ratings yet
Mathematics Extension 1 Notes (UPDATED)
12 pages
Logistic Function Calculus Notes
No ratings yet
Logistic Function Calculus Notes
6 pages
Calculus 08 Techniques of Integration 4up
No ratings yet
Calculus 08 Techniques of Integration 4up
7 pages
Introduction to Differentiation Techniques
No ratings yet
Introduction to Differentiation Techniques
36 pages
Partial-Derivative-of-Composite and Implicit Function
100% (1)
Partial-Derivative-of-Composite and Implicit Function
5 pages
Understanding Partial Derivatives in Multivariable Functions
No ratings yet
Understanding Partial Derivatives in Multivariable Functions
9 pages
Integration Techniques Worksheet
No ratings yet
Integration Techniques Worksheet
11 pages
Special Function and Series Solution of Differential Equation
No ratings yet
Special Function and Series Solution of Differential Equation
83 pages
Double and Triple Integrals Solutions
100% (1)
Double and Triple Integrals Solutions
18 pages
Differential Equations Cheatsheet
No ratings yet
Differential Equations Cheatsheet
9 pages
Multivariable Calculus Lecture Notes
100% (1)
Multivariable Calculus Lecture Notes
296 pages
MAT503 - Ordinary Differential Equations
100% (1)
MAT503 - Ordinary Differential Equations
156 pages
2021 Lecture Notes MAT1330
No ratings yet
2021 Lecture Notes MAT1330
246 pages
Calc T1 Condensed
No ratings yet
Calc T1 Condensed
9 pages
Calculus 1
No ratings yet
Calculus 1
103 pages
131 Notes and Practice Problems
No ratings yet
131 Notes and Practice Problems
243 pages
Student Manual Mathematics
No ratings yet
Student Manual Mathematics
100 pages
Student's Manual Essential Mathematics For Economic Analysis
100% (2)
Student's Manual Essential Mathematics For Economic Analysis
100 pages
Advanced Calculus MAST10021 Guide
No ratings yet
Advanced Calculus MAST10021 Guide
209 pages
Calculus Problems
No ratings yet
Calculus Problems
242 pages
Calculus Assignment PDF
No ratings yet
Calculus Assignment PDF
329 pages
CalculusI LectureNOTES 230929 113727 240717 214458
No ratings yet
CalculusI LectureNOTES 230929 113727 240717 214458
229 pages
Introduction to Calculus Concepts
No ratings yet
Introduction to Calculus Concepts
6 pages
College Algebra 1
No ratings yet
College Algebra 1
16 pages
FLCD Mod - FUNCTION, LIMIT, CONTINUITY & DERIVABILITY METHOD OF DIFFERENTIATION INVERSE TRIGONOMETRIC FUNCTION
No ratings yet
FLCD Mod - FUNCTION, LIMIT, CONTINUITY & DERIVABILITY METHOD OF DIFFERENTIATION INVERSE TRIGONOMETRIC FUNCTION
42 pages
4th Order Runge-Kutta Method Explained
No ratings yet
4th Order Runge-Kutta Method Explained
4 pages
TMUA 2025 Topic Checklist
No ratings yet
TMUA 2025 Topic Checklist
2 pages
Matrix Operations and Properties
No ratings yet
Matrix Operations and Properties
51 pages
SPM 2305 Complex Analysis 1 Year III Semester I
No ratings yet
SPM 2305 Complex Analysis 1 Year III Semester I
2 pages
Functions of Bounded Variation
No ratings yet
Functions of Bounded Variation
5 pages
Graph Theory and Matrix Algebra Basics
No ratings yet
Graph Theory and Matrix Algebra Basics
79 pages
HW9 Soln Fall18
No ratings yet
HW9 Soln Fall18
3 pages
Btech Ee 3 Sem Basic Signals and Systems Ree303 2022
No ratings yet
Btech Ee 3 Sem Basic Signals and Systems Ree303 2022
2 pages
On Faddeev-Leverrier's Method For The Computation of The Characteristic Polynomial of A Matrix and of Eigenvectors
No ratings yet
On Faddeev-Leverrier's Method For The Computation of The Characteristic Polynomial of A Matrix and of Eigenvectors
15 pages
Module 6 Flat
No ratings yet
Module 6 Flat
4 pages
Quadratic Functions Problems Guide
No ratings yet
Quadratic Functions Problems Guide
3 pages
Grade 11 General Math First Exam Guide
83% (65)
Grade 11 General Math First Exam Guide
6 pages
L4 Linear Operators in Quantum Mechanics
No ratings yet
L4 Linear Operators in Quantum Mechanics
16 pages
Test2 Solutions-1
No ratings yet
Test2 Solutions-1
4 pages
Graphing Using Vertex Form
No ratings yet
Graphing Using Vertex Form
5 pages
Derivatives - Practice - Sheet - (1) - 1
No ratings yet
Derivatives - Practice - Sheet - (1) - 1
8 pages
Algorithm Design by Jon Kleinberg, Eva Tardos (1) - 389
No ratings yet
Algorithm Design by Jon Kleinberg, Eva Tardos (1) - 389
1 page
AP Calculus AB Practice Test
No ratings yet
AP Calculus AB Practice Test
6 pages
Geometric Transformations Guide
No ratings yet
Geometric Transformations Guide
7 pages
5.14 Glossary: Fundamental Relations and Functions
No ratings yet
5.14 Glossary: Fundamental Relations and Functions
1 page
Maths - 1a Half Yearly Syllabus
No ratings yet
Maths - 1a Half Yearly Syllabus
91 pages
Graph Theory Basics: Definitions & Examples
No ratings yet
Graph Theory Basics: Definitions & Examples
118 pages
Mathematics T 2 BAS 203 Dr. Mahtab Alam
No ratings yet
Mathematics T 2 BAS 203 Dr. Mahtab Alam
5 pages
Advanced Trace Formula Analysis
No ratings yet
Advanced Trace Formula Analysis
74 pages
Spectral Theory of Operators On Hilbert Spaces 1st Edition Carlos S. Kubrusly (Auth.) PDF Download
No ratings yet
Spectral Theory of Operators On Hilbert Spaces 1st Edition Carlos S. Kubrusly (Auth.) PDF Download
139 pages
1.2 Finite Differences of Polynomial Functions
No ratings yet
1.2 Finite Differences of Polynomial Functions
2 pages
1720And1760Practice Midterm F23
No ratings yet
1720And1760Practice Midterm F23
3 pages