Units1 2 3 4Block1MST2092ndedOU2008
Units1 2 3 4Block1MST2092ndedOU2008
MST209 Block 1
MST209 Mathematical
methods and models
Block 1
Block 1
Contents
UNIT 1 Getting started 5
Introduction 6
4 Complex numbers 27
4.1 The arithmetic of complex numbers 28
4.2 Polar form 29
5 Differentiation 31
5.1 Rates of change 31
5.2 Differentiating combinations of functions 35
5.3 Investigating functions 38
6 Integration 41
6.1 Reversing differentiation 41
6.2 Evaluating integrals 43
6.3 Integration by parts and by substitution 45
6.4 Definite integrals 47
7 Computer activities 50
Outcomes 52
1 Some basics 63
1.1 Why differential equations? 63
1.2 Differential equations and solutions 65
1.3 Approximations in calculations 70
Introduction 110
1 Homogeneous differential equations 111
1.1 First thoughts 111
1.2 Method of solution 114
1.3 The general solution 121
Outcomes 145
Index 198
1.1
UNIT 1 Getting star ted
1
7
PC
5
Unit 1 Getting started
Introduction
The main purpose of this unit is to review ideas that you should have met
before, and that you will need as a basis for your study of MST209. The unit
focuses mainly on mathematical techniques, but also covers some examples
involving skills in the application of mathematics. The use of mathemat-
ics to investigate questions arising in non-mathematical contexts is broadly
referred to as ‘mathematical modelling’. In this course, the study of math-
ematical techniques will quite often be separated from their use in models,
as this enables you to practise the mathematical methods before you go on
to use them. However, the methods introduced in the course are chosen
because of their wide application in modelling.
The unit contains a number of ‘standard formulae’: for example, for the
solution of a quadratic equation, for expanding sin(a + b) and cos(a + b),
and for the derivatives and integrals of standard functions. It is helpful if
you are able to remember such formulae, but not essential; they are all given
in the course Handbook. You do need to be aware that the formulae exist,
however, and to be able to find them in the Handbook and to apply them.
The computer algebra package for the course can be used to help with much
of the work in this unit, and the unit reviews how this is done. However, the
majority of the unit concentrates on how the mathematical techniques can
be performed ‘by hand’; that is, without use of a computer (or calculator). Generally, we advise you to
This is because, in the long run, a familiarity and confidence with using first attempt all exercises
common mathematical formulae and techniques by hand will speed up your without using a computer,
unless they are specifically
study, and because it is not always convenient to resort to a computer. marked as computer
(Having said that, the computer remains a valuable tool, both for checking activities.
hand calculations and for addressing problems too complicated or time-
consuming to be worked on by hand.)
Section 1 starts by reviewing some basic points about numbers. Sections 2
and 3 cover a number of important standard functions: linear, quadratic,
logarithmic and exponential functions in Section 2, and trigonometric func-
tions in Section 3. All these functions occur frequently. These sections also
remind you of some important mathematical techniques: for example, for
manipulating algebraic expressions, for solving a quadratic equation, and for
manipulating expressions involving functions such as sin and cos. Section 4
covers some basic ideas about complex numbers.
Sections 5 and 6 discuss the fundamental concepts of calculus: differentiation
and integration. It is important that you understand what these ideas are,
and how they arise in models. These sections also provide plenty of exercises
on performing basic calculus operations by hand, as being able to do this Note, however, that the
quickly will stand you in good stead for the rest of the course. computer algebra package for
the course can be used to find
The final section shows how the computer algebra package for the course most derivatives and many
may be used in many of the techniques in the unit. integrals.
6
Section 1 Numbers, measurement and accuracy
We distinguish various types of number. The integers are the positive and
negative whole numbers, together with zero:
. . . , −4, −3, −2, −1, 0, 1, 2, 3, 4, . . . .
We denote the set of all integers by Z. These are fine for counting, but
insufficient for measuring lengths, for example. Non-integer quantities can
sometimes be represented exactly, for example as fractions (such as 73 ) or
√
roots (such as 5). Often in this course, however, decimals will be used. In
decimal notation, we can sometimes express a number exactly, but frequently
we need to approximate. We may write a number correct to a particular
number of decimal places, as in, for example,
7
3 = 2.33 (to two decimal places),
√
2 = 1.4142 (to four decimal places).
For large or small numbers we extend decimal notation by ‘taking out’ pow-
ers of 10, as in, for example,
3.414 × 106 (for 3 414 000),
3.42 × 10−7 (for 0.000 000 342).
In this course we shall use the convention that a non-zero number is ex-
pressed in scientific notation as Other conventions for
c scientific notation exist; for
±b × 10 , example, choosing b to satisfy
where 1 ≤ b < 10 and c is an integer. 0.1 ≤ b < 1.
7
Unit 1 Getting started
If we know that a real number x is 1.274 to three decimal places, then x lies
between 1.2735 and 1.2745; that is, x lies in the interval
[1.2735, 1.2745].
This ‘closed interval’ notation represents the set of real numbers between
1.2735 and 1.2745, inclusive of the endpoints. We can actually say slightly
more, for we know that x is not 1.2745, since that would round up to 1.275.
So x is in fact in the interval
[1.2735, 1.2745), Note the round bracket on
the right.
which represents the set of real numbers between 1.2735 and 1.2745, with
1.2735 included, but 1.2745 not included. Other, similar, ‘round bracket’
notations are also used.
[a, b] means all real numbers x between a and b, that is, a ≤ x ≤ b
with a and b both included.
[a, b) means all real numbers x between a and b, that is, a ≤ x < b
with a included and b excluded.
(a, b] means all real numbers x between a and b, that is, a < x ≤ b
with a excluded and b included.
(a, b) means all real numbers x between a and b, that is, a < x < b
with a and b both excluded.
The interval [a, b] is sometimes referred to as a closed interval, the interval
(a, b) as an open interval, and the intervals [a, b) and (a, b] as half-open
intervals. As you will see later, interval notation is useful in expressing
domains of functions and in discussing the accuracy of calculations, and
sometimes it is important to be able to say whether or not the endpoints
are to be included.
It is common scientific practice to quote measurements with ‘error bounds’.
8
Section 2 Some standard functions
9
Unit 1 Getting started
To define a function, a process must produce a unique output value for each
allowed input.
10
Section 2 Some standard functions
Exercise 2.1
Suppose that, at midday on 1 May, a reservoir is at 60% of its capacity. The
forecast for the next 50 days suggests that 30 000 m3 of water will be removed The notation m3 is shorthand
each day for consumption, while only 15 600 m3 will be added to the reservoir for ‘cubic metres’.
each day, so that on average the quantity of water in the reservoir reduces by
(30 000 − 15 600)/(24 × 60) = 10 cubic metres each minute. Crisis measures
will be introduced when the reservoir falls to 20% of its capacity.
Let V (measured in m3 ) be the volume of water t minutes after midday on
1 May, and let C (measured in m3 ) be the reservoir’s overall capacity. The
volume of water can be modelled using
V = V0 − 10t (0 ≤ t ≤ 72 000),
with a suitable choice of V0 .
Determine a suitable expression for V0 , and hence use the model to obtain
an expression in terms of C for the time at which crisis measures will be
needed (according to this model).
11
Unit 1 Getting started
y
( x 2, y 2)
y2 − y1
gradient m = tan q =
y2 – y1 x2 − x1
( x1, y1)
x2 – x1
c
q
x
Y
X
7 m s –1 the 5 m s –1
buoy The notation m s−1 is
shorthand for ‘metres per
second’.
the A B Z
port 2000 m
Figure 2.2
Suppose that we choose to measure time in seconds, starting from midnight, In this course, we shall
and distance in metres, measured from A. Let X metres be the distance usually use SI units. The SI
of the coastguard cutter from A at time t seconds after midnight, and let units of distance and time are
metres and seconds (denoted
Y metres be the distance of the boat from A at the same time. We can by m and s, respectively).
readily obtain an expression for X in terms of t, since X = 0 when t = 0, Commonly used SI units are
and the cutter travels at a constant speed of 7 metres per second: we have given in the Handbook.
X = 7t.
We also want an expression giving Y in terms of t. We know that (at
point B) Y = 2000 at 11.00 pm, which is 1 hour, or 602 seconds, before
midnight and so corresponds to t = −3600. Also, as the boat is moving at
a constant speed of 5 metres per second, Y will be related to t by a linear
function of the form
Y = 5t + c.
12
Section 2 Some standard functions
*Exercise 2.2
(a) Find the value of c such that Y = 5t + c satisfies the condition Y = 2000
at t = −3600.
(b) (i) When will the coastguard cutter catch the boat?
y
(ii) In the direction that the boat is travelling, the limit of territorial
waters is 100 kilometres from A. Will the cutter catch the boat within
territorial waters?
4x + 3y = –1
3x + y = 3
Simultaneous linear equations
x
Suppose that we know the paths of two aircraft, each of which is travelling
in a straight line, and wish to know where these paths cross. This is one
of a wide variety of situations where we need to find the intersection of two
straight-line graphs, which is equivalent to the algebraic problem of solving
two simultaneous linear equations. Consider, for example, the following Figure 2.3
linear equations (see Figure 2.3):
4x + 3y = −1, (2.4) These equations are linear,
since they can be rewritten in
3x + y = 3. (2.5)
the form y = mx + c.
There are many ways of solving these equations: one quick method is
Gaussian elimination. Let us see how this works for (2.4) and (2.5). This method was known to
Chinese mathematicians in
The aim of the method is to subtract a multiple of the first equation from about 100 bc, but not to
the second in order to eliminate the x terms. First, we multiply (2.4) by 34 , European ones until it was
to obtain an equation with the same coefficient of x as in (2.5): discovered by the German
mathematician Carl Friedrich
3
4 × 4x + 3
4 × 3y = 34 (−1), Gauss (1777–1855). (‘Gauss’
is pronounced as ‘gowce’.)
which simplifies to
3x + 94 y = − 34 . (2.6)
Now we subtract (2.6) from (2.5), to eliminate x, and obtain
y − 94 y = 3 − (− 34 ) = 3 + 3
4 = 15
4 ,
that is, − 54 y = 15
4 , so y = −3.
To find x, substitute this value of y into (2.4), to obtain
4x + 3(−3) = −1,
which gives 4x = −1 + 9 = 8, and hence x = 2.
So the solution of Equations (2.4) and (2.5) is x = 2, y = −3. You may like to check these
values, by substitution
*Exercise 2.3 into (2.4) and (2.5).
Use Gaussian elimination to solve the equations below for u and v:
2u − 5v = 19,
3u + 4v = −29.
13
Unit 1 Getting started
y y
a>0 a<0
x
x
14
Section 2 Some standard functions
Here the solution t = −0.18 refers to a time before the ball is thrown, so can
be discarded. The ball hits the ground about 2.2 seconds after it is thrown.
In this example, the quadratic equation has two solutions. Look at the
graphs in Figure 2.4, and imagine moving them up and down (which corre-
sponds to varying the value of c). The x-axis may meet a quadratic graph
in two places, or not at all, or it may happen just to touch the minimum
(or maximum) point of the graph √ (see Figure 2.6). In formula (2.8), we
need to find the square root b2 − 4ac. If b2 − 4ac > 0, then we find a
real value, greater than 0, for this square root, and there are two different
solutions to the quadratic equation. If b2 − 4ac = 0, then there is just one Complex numbers, which are
solution (though, for reasons given below, this one solution is sometimes discussed in Section 4, enable
considered as two equal solutions). If b2 − 4ac < 0, then there are no (real) us to express square roots of
negative numbers and hence
solutions. The quantity b2 − 4ac is often referred to as the discriminant of to produce (complex)
the quadratic equation because it discriminates between the cases shown in solutions to a quadratic
Figure 2.6. equation when b2 − 4ac < 0.
y y y
x x x
Figure 2.6
*Exercise 2.4
Solve for x the following equations.
(a) 2x2 + 7x − 4 = 0 (b) x2 + x − 6 = 0
Sometimes, you may find that you need to solve a quadratic equation where
the coefficients are letters rather than numbers.
Exercise 2.5
Show that the solutions (for x) of
mx2 + 2kx + mw2 = 0 (m = 0)
√
are x = −K ± K 2 − w2 , where K = k/m.
15
Unit 1 Getting started
One point of caution: if you want to factorize a quadratic function, you can
do this by first solving the equation (e.g. by using formula (2.8)), but you
need to be careful to match the coefficient of x2 in the original quadratic
function with that in the factorization. For example, 2x2 + 7x − 4 = 0 has
solutions x = 12 and x = −4, but to factorize 2x2 + 7x − 4 we write
2x2 + 7x − 4 = 2(x − 12 )(x + 4) = (2x − 1)(x + 4),
where the 2 is needed to ensure that the coefficients of x2 are the same on
each side.
There are some particular factorizations that it is helpful to recognize. Two
useful ones are Note that if we allow A to be
2 2 2 2 2 2 positive or negative, then
(x + A) = x + 2Ax + A and (x − A) = x − 2Ax + A . both cases can be written as
So, for example, − 6x + 9 = (x − 3)2 . We refer to such quadratics as
x2 (x + A)2 = x2 + 2Ax + A2 .
perfect squares. Perfect squares correspond to quadratic equations in
which the discriminant is b2 − 4ac = 0. (You may like to check this for
yourself.) Thus equations in which the discriminant is zero can be written in
the form (x + A)(x + A) = 0 or (x − A)(x − A) = 0, and these factorizations
lead us sometimes to consider such equations as having two equal roots
x = −A and x = −A, or x = A and x = A, rather just one root.
Another useful factorization is
(x + A)(x − A) = x2 − A2 .
So, for example, x2 − 16 = (x + 4)(x − 4). We refer to such a quadratic as
a difference of two squares.
One needs to be particularly careful when solving a quadratic equation that
involves the same letters as appear in the standard formula (2.8), but in a
different way.
Example 2.1
Solve for x the equation
abx2 − (a + b)x + 1 = 0,
where a and b are non-zero constants.
Solution
You need to keep a cool head here, because the letters used in formula (2.8)
are used in a different way in the given equation. In (2.8), we need
ab for a, −(a + b) for b, 1 for c.
So we obtain the solutions
a + b ± (a + b)2 − 4ab
x= .
2ab
This expression gives the solutions, but it turns out to be possible to express You will find it advantageous
them in a much simpler form. We have (a + b)2 = a2 + 2ab + b2 , so the in this course to be able to
discriminant can be written as perform manipulations like
this by hand. If you find
(a + b)2 − 4ab = (a2 + 2ab + b2 ) − 4ab = a2 − 2ab + b2 = (a − b)2 . them difficult, however, you
may like to make use of the
Therefore computer algebra package for
the course.
a+b± (a + b)2 − 4ab a + b ± (a − b)2 a + b ± (a − b)
= = .
2ab 2ab 2ab
Now (a + b) + (a − b) = 2a and (a + b) − (a − b) = 2b, so the two solutions
are 1/a and 1/b.
16
Section 2 Some standard functions
Powers
You will be familiar with the meaning of a positive integer power of a num-
ber, such as 105 = 10 × 10 × 10 × 10 × 10. In general, an means the product In an , a is called the base,
of n copies of a (for any real number a and any positive integer n). In and n may be referred to as
particular, a1 = a. the power, the index or the
exponent.
For positive integers m and n, we have the property
an × am = an+m , (2.9)
since each side is the product of n + m copies of a. For example,
102 × 105 = 107 .
Consequently, if we multiply m copies of an , we obtain
m times
n n n n n+n+n+···+n
a
× a × a
× · · · × a = a ;
m times
that is,
(an )m = an×m . (2.10)
For example, (102 )3 = 106 .
The definition of an can be extended to cases where n is not a positive
integer by assuming that (2.9) and (2.10) hold more generally. For a = 0,
this assumption leads to the definition of a0 as 1, and a−n as 1/an ; and, for
a > 0, to the definition of a1/n as the nth root of a, and am/n as the nth Recall that the nth root of a
root of am . So, for example: number a is a number b such
bn = a, and we write
that √
10−4 = 1/104 = 0.0001; n
b = a.
√
3
271/3 = 27 = 3 (since 33 = 27);
1 1 1 1
4−3/2 = 3/2 = √2 3 =
√ = .
4 4 64 8
It is conventional to take fractional powers to
√ mean positive roots (where
there is a choice). So, for example, 91/2 = 9 means 3 rather than −3. The negative square root of 5,
In general, roots of negative numbers do not necessarily exist (at least, for example, would
√ be written
not as real numbers); but√where they do, we use the same notation. So, as −51/2 or − 5.
for example, (−27)1/3 = 3 −27 = −3 (since (−3)3 = −27, and there is no
positive cube root in this case).
17
Unit 1 Getting started
We can define ax for a > 0 and for irrational values of x by means of a For a = 0, 0x is taken to
limiting process that need not concern us here. (The value of ax for any equal 0. For a < 0, the
particular a > 0 and x can be found using your calculator.) This definition definition of ax involves
complex numbers and need
of ax leads to the following properties of powers that hold for all real numbers not concern us here.
a > 0 and all real exponents x and y:
ax > 0, We have not proved these
properties, but we shall make
a−x = 1/ax ,
use of them as necessary.
ax+y = ax × ay ,
(ax )y = ax×y ,
ax /ay = ax−y .
Finally, note that for powers of a product or a quotient, we have
(ab)x = ax bx and (a/b)x = ax /bx .
For example, 157 = 37 57 and (5/3)4 = 54 /34 .
*Exercise 2.6
Use the properties of indices to simplify each of the following.
(a) a3 a5 (b) a3 /a5 (c) (a3 )5 (d) (2−1 )4 × 43
3/2
(e) 8−1/3 (f ) 163/4 (g) 49 (h) (16x4 )1/2
Before the advent of calculators and computers, these properties were com-
monly used in calculating powers, reciprocals and products ‘using loga-
rithms’. Such applications are no longer needed, but these properties of
logarithms are still important in the manipulation of expressions involving
exponentials and logarithms.
18
Section 2 Some standard functions
*Exercise 2.7
Simplify each of the following (where a > 0, b > 0 and x > 0).
(a) ln 7 + ln 4 − ln 14 (b) ln a + 2 ln b − ln(a2 b)
(c) ex × (ey )2 ÷ e2x (d) ln(ex × ey ) (e) e2 ln x (f) e−2 ln x
(g) exp(2 ln x + ln(x + 1))
Exercise 2.8
By taking logs of both sides of the equation
ax = ekx ,
where a > 0, show that we can find a value for k so that this equation holds
for all values of x.
y y
k>0 k<0
1 1
x x
Log plots
Suppose that you have data on some quantity y at various times t, and you
believe that y is an exponential function of t. To test such a hypothesis, you
can plot ln y against t. If such a plot gives a straight line, then this confirms Such plots are often referred
that y is of the form Aekt . For example, suppose that a plot of ln y against to as log–linear plots.
t suggests the linear relationship
ln y = 1.47t + 3.82.
Then, taking exponentials of each side, we have
y = exp(1.47t + 3.82)
= exp(1.47t) × exp(3.82)
= 45.6e1.47t .
The next exercise shows how we can test data (on variables x and y) for a
different form of relationship by plotting ln x against ln y.
19
Unit 1 Getting started
Exercise 2.9
(a) Suppose that
Plots of ln y against ln x are
ln y = 2.83 ln x + 0.37. called log–log plots.
Using the properties of exp and ln, express y as a function of x.
(b) In general, suppose that
ln y = a ln x + b,
where a and b are constants. What form of relationship is there between
y and x?
20
Section 2 Some standard functions
Example 2.2
If f (x) = ex (x ∈ R) and g(x) = 1 + x2 (x ∈ R), what are the following?
(a) g(f (x)) (b) f (g(x))
Solution
(a) g(f (x)) = g(ex ) = 1 + (ex )2 = 1 + e2x In this example, both
1+x2 functions f and g are defined
(b) f (g(x)) = f (1 + x2 ) = e for all x in R, so we need not
worry about domains.
Notice that g(f (x)) and f (g(x)) are different. The function g(f (x)) is ‘apply
f first, then g’, while f (g(x)) is ‘apply g first, then f ’; the order in which f
and g are applied matters!
21
Unit 1 Getting started
Example 2.3
Express the function
1
h(x) = √ 3 (2.15)
1 + 2x2
as a composite of a quadratic function and a power function.
Solution
Note first that 1 + 2x2 is a quadratic function and that, writing y = 1 + 2x2 ,
1
the right-hand side of (2.15) becomes √ 3 = y −3/2 (a power function). So
( y)
we can obtain h(x) in two steps.
Step 1 Calculate y = 1 + 2x2 .
1
Step 2 Apply √ 3 = y −3/2 to the result of Step 1.
( y)
So if f (x) = 1 + 2x2 and g(x) = x−3/2 , then h(x) = g(f (x)). Here, the domain of g is
x > 0, but since
f (x) = 1 + 2x2 is always
Exercise 2.10 greater than 0, there is no
(a) If f (x) = e−x and g(x) = 1 − x3 , find the following. problem.
(i) f (g(x)) (ii) g(f (x))
(b) Express
1
h(x) =
(4 + 9x2 )4
as a composite of a quadratic function and a power function.
3 Trigonometric functions
In this section, we add another class of functions to the ‘library’ developed
in Section 2. These are the trigonometric functions. They originate in the
geometry of right-angled triangles, but in this course we are equally often
concerned with their use in modelling repetitive or oscillatory behaviour. In
particular, they arise as solutions of certain differential equations.
You will have met sin θ = a/h, cos θ = b/h and tan θ = a/b as ratios in a The Greek letter θ is read as
right-angled triangle (see Figure 3.1). However, these definitions of the sine, ‘theta’. The Greek alphabet
cosine and tangent functions work only for 0 < θ < π2 . (Note that we shall is given in the Handbook.
almost always express angles in radians in this course.) Recall that 180◦ = π radians.
22
Section 3 Trigonometric functions
To define the sine and cosine functions for a general value of θ, we can
use Figure 3.2, which shows a circle of radius 1. Imagine that the line OA
started along the x-axis, and was then rotated anticlockwise through an
angle θ. Then the point A has coordinates (cos θ, sin θ). Here θ may be any
value, positive or negative. (A negative value of θ corresponds to a rotation
clockwise.)
y
A
h 1 q
a
O x
q
b
y y
1 1
–2π 3π –π π 0 π π 3π 2π q –2π
3π
− 2 –π
π
−2 0 π
π 3π
2π q
− 2 −2 2 2 2 2
–1 –1
Figure 3.3
Other trigonometric functions can be defined in terms of sin and cos. You
will have met tan θ = sin θ/ cos θ. This is defined for all real θ except where
cos θ = 0 (i.e. at θ = ± π2 , ± 32π , and so on). You may also have met These functions may be
referred to as tangent,
1 1 1 cos θ secant, cosecant and
sec θ = , cosec θ = and cot θ = = .
cos θ sin θ tan θ sin θ cotangent.
We need to restrict the domains of cosec and cot to exclude points where
sin θ = 0, and the domains of sec and tan to exclude points where cos θ = 0.
*Exercise 3.1
π
(a) Find the values of sin θ and cos θ for θ = 0 and θ = π2 . 6
2
(b) Hence find the values of tan θ, sec θ, cosec θ and cot θ for θ = 0 and
3
θ = π2 , where they are defined. 2
1
(c) Two right-angled triangles are shown in Figure 3.4. Use these to calcu- π π
4 3
late the values of sin θ, cos θ, tan θ, cosec θ, sec θ and cot θ for θ equal to 1 1
each of π6 , π4 and π3 .
(d) For what values of θ is sin θ = 0? (Refer to Figure 3.3(a).) Figure 3.4
23
Unit 1 Getting started
The function tan has the graph shown in Figure 3.5. Notice that tan ac-
tually repeats its values every π. (This is because sin(θ + π) = − sin θ and
cos(θ + π) = − cos θ, so that tan(θ + π) = tan θ.)
y
y = tan q
The graphs of sec, cosec and
cot are given in the
Handbook (as well as those of
sin, cos and tan).
3π
− 2 –π π
− 2 0 π π 3π
q
2 2
Figure 3.5
24
Section 3 Trigonometric functions
3π
2
π
π
2
Table 3.1
Function Inverse Codomain of Domain of
inverse function inverse function
y = sin x x = arcsin y − π2 ≤ x ≤ π2 −1 ≤ y ≤ 1 Some texts use sin−1 , cos−1
y = cos x x = arccos y 0≤x≤π −1 ≤ y ≤ 1 and tan−1 rather than arcsin,
arccos and arctan.
y = tan x x = arctan y − 2 < x < π2
π
R
25
Unit 1 Getting started
y
Figure 3.7 shows the relation between a clockwise rotation through θ (re- sin q
(cos q, sin q)
A
garded as a rotation through −θ) and an anticlockwise rotation through θ.
1
Notice that such rotations lead to equal x-coordinates but to y-coordinates
of opposite signs. So we have q X cos q =
O q cos(–q ) x
cos(−θ) = cos θ and sin(−θ) = − sin θ.
1
These relations hold for all values of θ, and are examples of trigonometric B
sin(– q)
identities. These can be useful in a variety of contexts, such as simplifying (cos(–q), sin(– q))
expressions involving trigonometric functions.
To derive one particularly useful identity, apply Pythagoras’s Theorem to Figure 3.7
the right-angled triangle OAX in Figure 3.7. This leads to
cos2 θ + sin2 θ = 1. (3.2) Notice that we write (sin θ)2
as sin2 θ.
If we divide each side of (3.2) by cos2 θ, we obtain the identity
1 + tan2 θ = sec2 θ. Strictly speaking, this and the
following identities hold only
Similarly, if we divide each side of (3.2) by sin2 θ, we obtain where the functions tan, sec,
etc. are defined.
cot2 θ + 1 = cosec2 θ.
You may also have met previously the identities These identities can be
derived using transformation
sin(θ + φ) = sin θ cos φ + cos θ sin φ, (3.3) matrices, for example, but we
cos(θ + φ) = cos θ cos φ − sin θ sin φ. (3.4) shall not discuss here how
this is done.
Replacing φ by −φ in these identities (and using cos(−φ) = cos φ and
sin(−φ) = − sin φ), we obtain
sin(θ − φ) = sin θ cos φ − cos θ sin φ,
cos(θ − φ) = cos θ cos φ + sin θ sin φ.
We can obtain an identity for tan(θ + φ) by dividing those for sin(θ + φ)
and cos(θ + φ):
sin(θ + φ) sin θ cos φ + cos θ sin φ
tan(θ + φ) = = ,
cos(θ + φ) cos θ cos φ − sin θ sin φ
and dividing top and bottom by cos θ cos φ gives
tan θ + tan φ
tan(θ + φ) = ,
1 − tan θ tan φ
All these identities can be useful when manipulating expressions involving
trigonometric functions. One situation where such manipulations are needed
is when performing certain integrations by hand (as you will see in Section 6),
and there expressions for sin 2θ and cos 2θ can be particularly useful. We
ask you to obtain such expressions in the next exercise.
(Note that it is usual to write sin 2θ rather than sin(2θ) — the spacing
makes the meaning clear. However, when using the course computer algebra
package you will have to include the brackets. When there is any danger of
ambiguity, we shall also use brackets in the text.)
26
Section 4 Complex numbers
*Exercise 3.3
(a) By putting φ = θ in the expressions for sin(θ + φ) and cos(θ + φ),
establish the following identities.
(i) sin 2θ = 2 sin θ cos θ (ii) cos 2θ = cos2 θ − sin2 θ
(b) Using trigonometric identities, and particular values of sin and cos,
simplify each of the following.
(i) sin(2π − θ) (ii) cos(2π − θ) (iii) sin(π − θ)
(iv) cos(π − θ) (v) sin( π2 − θ) (vi) cos( π2 − θ)
3π
(vii) cos 2 + x
The most useful identities to remember are (3.2), (3.3) and (3.4), as most
of the others can be derived from these.
4 Complex number s
Complex numbers provide a system within which we can solve any quadratic
equation (and, indeed, any polynomial equation). They are helpful in some
of the mathematical techniques introduced in this course, although the use
we shall make of them is quite limited.
There is no real number x satisfying the equation
x2 = −1.
However, there are circumstances where it is convenient to have a system
of ‘numbers’ in which such an equation can be solved. Such a system is the
system of complex numbers. A complex number is one of the form
Engineers √
commonly use j to
z = a + bi (or, equivalently, z = a + ib), represent −1.
√
where i = −1, and a and b are real numbers. We refer to a as the real part
of z, written Re(z), and to b as the imaginary part of z, written Im(z). Note that Im(z) is the real
A complex number of the form a + 0i is, in effect, just the real number a; number b; Im(z) is not equal
so the real numbers are seen as part of (a subset of) the complex numbers. to bi.
We denote the set of all complex numbers by C. Within C, we can solve any
quadratic
√ equation, since the formula will always give a solution once we
can use −1. For example, the equation x2 − 2x + 2 = 0 has the solutions
√ √ √ √
2 ± 22 − 4 × 2 2 ± −4 2 ± 4 × −1 2 ± 2i
x= = = = = 1 ± i,
2 2 2 2
and the equation x2 = −1 has the solutions x = ±i.
27
Unit 1 Getting started
An nth-order polynomial with real coefficients is a function of the form An nth-order polynomial is
n n−1 sometimes referred to as a
p(x) = an x + an−1 x + · · · + a1 x + a0 , polynomial of degree n.
where an = 0 and each coefficient ak (k = 0, 1, . . . , n) is a constant in R.
Within the complex numbers, any such polynomial can be written as a In fact, this result also holds
product of an and n factors of the form x − ck (k = 1, 2, . . . , n), with each if the coefficients ak are
ck in C. These n factors correspond to the n roots (i.e. solutions) x = ck complex.
(k = 1, 2, . . . , n) of the corresponding polynomial equation p(x) = 0; if a fac-
tor x − c occurs more than once, then the root x = c is a repeated root. Repeated roots are sometimes
In Subsection 2.3 we saw that for second-order (i.e. quadratic) polynomi- referred to as equal roots or
als, repeated roots correspond to perfect squares (i.e. factorizations such as coincident roots.
x2 − 2cx + c2 = (x − c)2 ).
28
Section 4 Complex numbers
√
The modulus of a complex number z = a + bi is a2 + b2 , written |z|, so
the rule for division can be written, for complex numbers u and v, as
u uv
= 2.
v |v|
*Exercise 4.1
Let v = 3 − 4i and w = 2 − i. Evaluate each of the following.
(a) v (b) |v| (c) v − w (d) vw
(e) w/v (f) 1/w (g) w2 (h) 2w − 3v
*Exercise 4.2
Solve (for x in C) the quadratic equation 2x2 + 2x + 1 = 0.
If a quadratic equation with real coefficients has complex roots (as in Exer-
cise 4.2), then these always form a pair of complex conjugates (of the form This follows from the
a ± bi). formula (2.8) for the solution
of a quadratic equation.
Polar coordinates
Polar coordinates provide an alternative way of representing points in We shall use angle brackets to
the plane. Figure 4.1 shows a point A with Cartesian coordinates (x, y) and distinguish polar from
polar coordinates r, θ. The quantity r is the distance from A to the origin, Cartesian coordinates. (This
is not a universal convention.)
so r ≥ 0. The angle θ is measured anticlockwise from the x-axis. (Negative
angles correspond to measuring clockwise from the x-axis.) It is convenient
to allow θ to take any real value, but this has the consequence that the polar y
representation of a point is not unique. For example, r, θ and r, θ + 2π y = r sin q
A (x, y)
provide polar coordinates of the same point. We can see from Figure 4.1
that if a point has polar coordinates r, θ and Cartesian coordinates (x, y), r
then
q
x = r cos θ and y = r sin θ. x = r cos q x
These equations allow us to translate from polar to Cartesian coordinates.
Figure 4.1
To translate from Cartesian to polar coordinates, we can use (see Figure 4.1)
r = x2 + y 2 , cos θ = x/r, sin θ = y/r (r = 0). (4.1) If r = 0, then we can choose
any value for θ.
Equations (4.1) do not have a unique solution for θ in R, but they do have
a unique solution in the range −π < θ ≤ π.
*Exercise 4.3
What are the polar coordinates r, θ of each of the following points, for θ
in the range −π < θ ≤ π?
(a) (−2, 0) (b) (1, 1) (c) (−1, −1)
√
(d) (4, 0) (e) (0, 4) (f ) (− 3, 1)
29
Unit 1 Getting started
Exercise 4.4
If a complex number z has polar form 2, − π4 , what is its Cartesian form?
Exercise 4.5
Express each of the following complex numbers in polar form, choosing the
principal value of the argument.
(a) −2 (b) 1 + i (c) −1 − i
√
(d) 4 (e) 4i (f) − 3 + i
30
Section 5 Differentiation
Complex exponentials
For a complex number z = x + iy, we can define the complex exponential
ez by the formula
ez = ex (cos y + i sin y).
We choose this definition because it works! That is, this complex exponential
behaves, as we would hope, like the real exponential function. In particular,
it retains the property that, for any complex numbers u and v,
eu × ev = eu+v .
In the case when x = 0, the definition of the complex exponential gives
This equation is known as
eiy = cos y + i sin y. Euler’s formula.
This leads us to a third way of expressing a complex number, which is often Leonhard Euler (1707–1783)
convenient. If z has polar form r, θ, then we have was a prolific mathematician,
making many fundamental
z = r(cos θ + i sin θ) = reiθ , contributions to diverse areas
of mathematics and science.
where reiθ is referred to as the exponential form of the complex number z.
In this form, r is the modulus of z and θ is the argument of z. As with the
polar form, the value of θ is not unique, but there is a unique choice of θ in
the range −π < θ ≤ π.
*Exercise 4.7
Let z = r, θ. Use the exponential form of z to find Re(zeiωt ).
5 Differentiation
The concepts and techniques of calculus are central to many of the mathe-
matical methods discussed in this course. In this section, we consider dif-
ferentiation.
31
Unit 1 Getting started
32
Section 5 Differentiation
ds d d d
= 12 (−9.81) (t2 ) + 10 (t) + (2)
dt dt dt dt
= 12 (−9.81)(2t) + 10(1) + 0
= −9.81t + 10
(using the derivative of the standard function tn for n = 2 and n = 1, and the
fact that the derivative of a constant is 0). Similarly, if V = 2 × 106 − 15t,
we find its derived function to be V = −15. This is constant (not dependent
on t), corresponding to the fact that this function has a straight-line graph
(with constant gradient).
There are various notations for derivatives, some of which we have used
above. We shall use whichever is convenient in a particular context. No-
ds
tation expressed purely in terms of variables, such as ds/dt, is referred to In text, may be written as
dt
as Leibniz notation (after its inventor, G. W. Leibniz (1646–1717)). This ds/dt, to save space.
notation is extended to write, for example,
d d
(3t + 5 sin 2t) or (ax + bx2 ).
dt dx
Leibniz notation is sometimes a little clumsy, and may be inconvenient in
situations where the role of functions is prominent. There the more modern
function notation of adding a prime ( ) to the function name is preferred. We
can then write, for example, f (3) to mean the value of the derived function
of f at 3. Sometimes we find it convenient to mix function and variable
names, and write, for example, s , rather than introducing a separate name We may sometimes write s (t)
for the function relating the variables s and t. When using Leibniz notation, if we want to emphasize that
ds s is a function of t.
we may sometimes write (t) to emphasize that this derivative is a function
dt
ds
of t, or (4) to mean the value of the derivative when t = 4. Some of the
dt
simpler forms of notation are open to ambiguity if used in inappropriate
contexts, and at times we need to be careful about how we express things,
for example by using the function notation in a precise manner.
Differentiation of a derivative produces the so-called ‘higher’ derivatives.
If, for example, we have f (x) = x3 + 5x, then differentiation gives f (x) =
3x2 + 5. This derivative is itself a function of x, and can be differentiated
again. This gives the second derivative as f (x). (In the above example, The derivative dy/dx is
d dy sometimes referred to as the
f (x) = 6x.) In Leibniz notation, we write the second derivative, , first derivative.
dx dx
d2 y
as . Differentiating yet again leads to the third derivative, written
dx2
3
d y
, or f (x), which may also be written f (3) (x). (f (x) = 6 in this case.)
dx3
The process can be continued, and a general nth derivative may be written
dn y
as or f (n) (x), where n is referred to as the order of the derivative.
dxn
There is one final piece of notation to mention. We so often find that time
(habitually denoted by t) is the independent variable that there is a separate
notational convention for differentiation with respect to time. We use a dot This notation is attributed to
over the variable to indicate a first derivative with respect to t, and two dots Isaac Newton (1642–1727),
to indicate a second derivative. So if x(t) is the position of an object as a and so is sometimes referred
to as Newtonian notation.
function of time t, then ẋ(t) means the same as x (t), and is the velocity of
¨ means the same as x (t), and is its acceleration.
the object, while x(t)
33
Unit 1 Getting started
Exercise 5.1
Suppose that an object is moving in a straight line so that its position x
(measured from a chosen origin) is related to time t by the equation
x = 5 + 7 cos(3t + 2).
(a) Find expressions in terms of t for the velocity ẋ(t) and acceleration ẍ(t)
of the object.
(b) Use the above equation to eliminate t from your expression for ẍ(t), and
hence find a relationship between the position and acceleration of the
object that holds at all times.
Exercise 5.2
The weekly wage bill of a company, t years in the future, is projected to be
£B, where
B = 105 exp(0.04t).
Find an expression for the rate at which the wage bill will be rising in t years’
time. What will this rate of rise be as a percentage of the wage bill at the
time?
*Exercise 5.3
Calculate the following derivatives.
dy
(a) , where y = 1 − 0.9 exp(−0.5x).
dx
(b) F (2), where F (x) = 3x4 − 4x + 1.
d2 y
(c) , where y = ln t (t > 0).
dt2
(d) F ( π6 ), where F (x) = 3 sec(2x) − 4 cos(−3x).
(e) g (0), where g(t) = a cos(3t + φ) + b sin(3t + φ) (and a, b and φ are
constants).
Exercise 5.4
Calculate the following derivatives.
(a) v (z), where v = 3 tan z + 2 cos z.
dy π
(b) at t = 12 , where y = A sin 3t + B cos 3t (and A and B are constants).
dt
(c) f (4) ( π2 ), where f (t) = 2 sin 3t.
(d) f (y), where f (y) = arctan(3y).
dz
(e) when x = 0, where z = ln(cx + d) (and c and d are constants, with
dx
d > 0).
34
Section 5 Differentiation
f f g − f g
Quotient Rule = . A useful way of remembering
g g2 this rule is: derivative of top
Or, in Leibniz notation, times bottom, minus top
times derivative of bottom, all
du dv over bottom squared.
d u dx
v−u
dx
= .
dx v v2
Example 5.1
Find h (x), where h(x) = x3 cos 2x.
Solution
The function h(x) is a product, f (x)g(x), with f (x) = x3 , g(x) = cos 2x.
We have
f (x) = 3x2 and g (x) = −2 sin 2x.
So, using the Product Rule, we have
h (x) = 3x2 cos 2x − 2x3 sin 2x.
Exercise 5.5
dy ln x
*(a) Find , where y = 2 .
dx x +1
*(b) Find f (t), where f (t) = t5 ln(3t + 4).
(c) Find g (0) (in terms of the constants A, B and C), where
g(t) = (At + B) sin(At + C).
*(d) If the position of an object at time t is given by e−3t sin 4t, find its
velocity and acceleration.
35
Unit 1 Getting started
Example 5.2
Find f (x), where f (x) = sin3 x.
Solution
If we let u = sin x, then we have f (x) = u3 . The recognition of sin3 x as a
composite function, and of
We then have how to break it down into two
df du parts, each consisting of a
f (x) = = 3u2 cos x = 3 sin2 x cos x, standard function, is the key
du dx
to differentiating it.
replacing the variable u by sin x.
Your proficiency with differentiation will depend on your experience and will
develop with practice. It will be helpful for your study of MST209 if you
are able to differentiate expressions such as that in Example 5.2 without
recourse to the computer algebra package for the course — and, ideally,
without even needing to refer to the Handbook. But such proficiency may
take some time to develop, and while it is developing feel free to check your
work using the computer.
36
Section 5 Differentiation
Exercise 5.6
Use the Composite Rule to differentiate each of the following.
*(a) y = exp(t2 ) (b) f (x) = (3x3 + 4)6
√
*(c) z = tan(3v + 4) (d) g(z) = 4 − z 2
1
*(e) f (x) = √ 3
1 + 2x2
Exercise 5.7
Differentiate the following functions. These differentiations involve
x more than one rule.
(a) y = sec (b) z = t2 exp(t3 + 1)
x2 + 1
Suppose that we want to find the gradient at the point (2, 1) of the tangent
to the ellipse with equation
x2 + 4y 2 = 8. (5.2)
We want dy/dx at x = 2. We could start by expressing y as a function
of x, but a more convenient approach is to differentiate the equation as it
stands. To differentiate y 2 with respect to x, we use the Composite Rule,
d(y 2 ) dy dy
and obtain = 2y . So, differentiating both sides of Equation (5.2)
dy dx dx
with respect to x, we obtain
dy
2x + 4(2y) = 0.
dx
When x = 2 and y = 1, this gives 4 + 8 dy/dx = 0, so dy/dx = − 12 . There-
fore the gradient of the tangent to this ellipse at (2, 1) is − 12 .
Differentiation with respect to x of an expression such as x2 + 4y 2 , where y
is a function of x, is known as implicit differentiation.
*Exercise 5.8
(a) Use the Product and Composite Rules to find the following in terms of
x, y and dy/dx.
d 2 d 3
(i) (x y) (ii) (y )
dx dx
(b) Find the gradient at the point (−1, 1) of the tangent to the curve
x3 + x2 y + y 3 = 1.
37
Unit 1 Getting started
Exercise 5.9
Find the second derivative of the function f (t) = cos 2t + i sin 2t.
38
Section 5 Differentiation
y
A
local
maximum gradient is 0,
but neither
maximum nor
minimum
local
minimum
x
global
B minimum
Figure 5.2
*Exercise 5.10
Find any stationary points of the function
1
y(x) = 5 − 2(x + 1)e− 2 x (x ≥ 0).
Classify these as local mimima or local maxima or neither, and evaluate
y(x) at these points.
Example 5.3
Suppose that
(x2 − 3)y = x − 2.
Sketch a graph of y against x.
Solution
x−2
We have y = , but need to note that this expression for y is not
x2 − 3 √
defined if x2 − 3 = 0, i.e. if x = ± 3. We can see that y = 0 if (and only if)
x = 2, so the graph crosses the x-axis at this one point. If x is large (positive
or negative), then y will be close to zero.
39
Unit 1 Getting started
Again
√ we refer to the
√ lines
x = 3 and x = − 3 as
asymptotes of this graph.
– 3 1 3 2 3 x
Figure 5.3
A continuous function is one whose graph can be drawn without lifting your
(x − 2)/(x2 − 3) is not continuous
pen from the paper. The function√f (x) = √
on any domain containing either 3 or − 3.
A smooth function is continuous and has a continuous derivative. For
example, the function
3x, x ≥ 0,
f (x) =
−2x, x < 0,
is continuous, but is not smooth (since its derivative is not continuous at
x = 0).
Exercise 5.11
(a) Sketch a graph of the function
v
f (v) = (v ≥ 0).
4 + 1.5v + 0.008v 2
In particular, find: any values of v for which f (v) is zero; any values
of v for which f (v) is not defined; and any local maxima or minima of
f (v). Also, indicate how f (v) behaves as v becomes large.
(b) Find the global maximum and minimum of f .
40
Section 6 Integration
6 Integration
Subsection 6.1 provides a reminder of the basic idea of integration as ‘revers-
ing differentiation’. Subsection 6.2 discusses how we may calculate relatively
simple integrals by hand. In Subsection 6.3, we look at two techniques for
finding more complicated integrals by hand.
As well as ‘reversing differentiation’, integrals also arise as the limits of
certain sums. In Subsection 6.4, we see how this can lead to integrals arising
in models.
s= (5t + 7) dt.
To evaluate this integral, you can use the table of standard integrals in the
Handbook. These show that t dt = 12 t2 and 1 dt = t, and on integration
we obtain
s = 52 t2 + 7t + c, (6.2)
where c may be any constant. To confirm this, note that with s given The constant c is often
by (6.2), we have referred to as an arbitrary
constant or a constant of
ds d 5 2 integration.
= 2 t + 7t + c = 5t + 7,
dt dt
as required by (6.1). Since c may be any constant, we see that the differential
equation (6.1) does not have a unique solution.
Generalizing, suppose that f is a known function, and
F (x) = f (x).
We write the general solution of this differential equation as
41
Unit 1 Getting started
exp(−x2 ) dx.
At first sight, this might seem no harder a problem to solve than (6.3).
In fact, however, it is impossible! To be more precise, there is no simple
combination of the standard functions (polynomials, sin, cos, exp and ln)
that when differentiated gives exp(−x2 ).
Finding explicit expressions for integrals is a much harder task than finding
derivatives. The rules of differentiation ensure that we can, in principle,
find an explicit expression for the derivative of any combination of standard
functions. The equivalent is not true for integrals. What is more, even
where integrals can be found, this may be a messy process. The methods
whereby particular functions can be integrated depend on recognizing what
happens to work in various particular cases.
Integrals are readily foundif they appear
in7 the table of standard integrals
2x
in the Handbook (as do e dx and x dx, for example). So are the
integrals
of constant multiples and sums of constant multiples of these, such
as (4e2x + 9x7 ) dx. There are also integration techniques that enable you
to find some more complicated integrals. Integration by substitution is based
on the rule for differentiating composite functions, while integration by parts
is based on the rule for differentiating a product. There are brief reminders
of these two techniques in Subsection 6.3.
The table of standard integrals in the Handbook contains quite a wide se-
lection of integrals. Some of these integrals are deduced from the table of
standard derivatives, others from using integration by parts or substitution.
You can regard them all as the fruit of others’ experience, and draw on
them as needed. The correctness of an integral, obtained either by using
the Handbook or from the computer algebra package for the course, can be
verified by differentiation.
*Exercise 6.1
Use differentiation to verify that the following integrals are correct (where
a = 0 is a constant and c is an arbitrary constant).
x 1
(a) x sin ax dx = − cos ax + 2 sin ax + c
a a
1 π
(b) tan ax dx = − ln(cos ax) + c − 2 < ax < π2
a
42
Section 6 Integration
*Exercise 6.2
(a) Use implicit differentiation (and a trigonometric formula) to show that
if x = tan y, then
dy 1
= .
dx 1 + x2
1
Hence confirm that dx = arctan x + c.
1 + x2
(b) Use implicit differentiation of x = sin y (and a trigonometric formula) Here −1 < x < 1,
to deduce an expression for the indefinite integral − π2 < y < π2 .
1
√ dx.
1 − x2
Check that your result agrees with the one in the Handbook.
Your first recourse for finding an integral by hand is the table of standard Or, preferably, use your
integrals in the Handbook. If the integrals of functions f and g are known, memory!
then the integral of af + bg, where a and b are constants, is readily found,
using the rule
= 4( 12 e2x ) + 9( 81 x8 ) + c
= 2e2x + 98 x8 + c.
Sometimes algebraic manipulation can transform an expression to be inte-
grated into a more amenable form. For example, the manipulation
3x2 + 2x 3x2 2x
√ = √ + √ = 3x3/2 + 2x1/2
x x x
transforms the expression on the left into a sum of constant multiples of func-
tions in the Handbook table. Less obvious transformations can be achieved
using trigonometric formulae. For example, using cos 2x = cos2 x − sin2 x
(see Exercise 3.3(a)(ii)) and sin2 x + cos2 x = 1 (Identity (3.2)), we obtain
cos 2x = 2 cos2 x − 1. Rearranging this gives
cos2 x = 21 (1 + cos 2x),
which enables us to integrate cos2 x.
43
Unit 1 Getting started
Exercise 6.3
Use the identity
cos2 ax = 21 (1 + cos 2ax)
*Exercise 6.4
Find the following integrals. Use the standard integrals
1 given in the Handbook as
(a) e5x dx (b) 6 sec2 (3t) dt (c) dv necessary.
36 + 4v 2
1 1
(d) dy y < 32 (e) dy y > 32
3 − 2y 3 − 2y
Exercise 6.5
Find the following integrals. Use the standard integrals
1 given in the Handbook as
(a) (6 cos(−2t) + 8 sin 4t) dt (b) √ dt (−3 < t < 3) necessary.
9 − t2
5t3 + 7 2
(c) dt (t < 0) (d) 2 ln(4t) − dt (t > 0)
t t
1
(e) dx (−1 < x < 1)
(x − 1)(x + 1)
The next example again uses a standard integral from the Handbook, but
requires careful matching of parameters and attention to domains.
Example 6.1
1 1
For A > 0, x > , find dx.
A x(1 − Ax)
Solution
We can match the integrand with that of a standard integral by writing it
in the form
1 −1
= .
x(1 − Ax) A(x − 0)(x − A1 )
44
Section 6 Integration
1
The Handbook gives the integral dx for b < a. To match
(x − a)(x − b)
that, choose b = 0 and a = A1 . Since x > A1 , we must choose the standard
integral for the case a < x. So we obtain
1 1 1
dx = − dx
x(1 − Ax) A (x − 0)(x − A1 )
1 1 x − A1
=− ln +c
A A1 − 0 x−0
x − A1
= − ln +c
x
x Ax
= ln + c = ln + c.
x− A 1 Ax − 1
*Exercise 6.6
1
(a) For k > 0, −k < v < k, find dv.
v2 − k2
(Hint : Remember that v 2 − k 2 = (v − k)(v + k).)
a a 1
(b) For a > 0, b > 0, − <v< , find dv.
b b a − bv 2
In this subsection, we look briefly at two methods for evaluating more com-
plicated integrals. You will use these methods later in the course. For
now, it is particularly useful to recognize the type of integral illustrated in
Equation (6.5).
Integration by substitution
The formula for integration by substitution is
45
Unit 1 Getting started
Example 6.2
Find x sin(2 + 3x2 ) dx.
Solution
Let u = 2 + 3x2 , so du/dx = 6x. In (6.4), we are taking
g(x) = 2 + 3x2 .
We have
x sin(2 + 3x2 ) dx = 1
6 6x sin(2 + 3x2 ) dx Now g (x) = 6x.
1 du
= 6 sin u dx
dx
1 du
= 6 sin u du Note how, in effect, dx is
dx
replaced by du.
= − 16 cos u + c
= − 16 cos(2 + 3x2 ) + c,
substituting at the end for u in terms of x.
*Exercise 6.7
Find the following integrals.
46
Section 6 Integration
Integration by parts
The formula for integration by parts is
Example 6.3
Find xe−2x dx.
Solution
In the formula, take f (x) = x and g (x) = e−2x . Then f (x) = 1 and
g(x) = − 12 e−2x , so Note that an arbitrary
constant need not be included
xe−2x dx = x(− 12 e−2x ) − 1(− 21 e−2x ) dx in the expression for g(x).
= − 12 xe−2x + 1
2 e−2x dx
= − 12 xe−2x − 41 e−2x + c
= − 14 (2x + 1)e−2x + c.
*Exercise 6.8
(a) Use integration by parts to find xe−x dx.
(b) Find x2 e−x dx. (Use integration by parts, then the result of part (a).)
4 − 4) − ( 4 − 2) = 4 .
= ( 16 1 7
47
Unit 1 Getting started
4 − 4) − (0 − 0) = 0,
= ( 16
or use parts (a) and (b):
2 1 2
(x3 − 2) dx = (x3 − 2) dx + (x3 − 2) dx
0 0 1
= − 74 + 7
4 = 0.
Exercise 6.9
3/2
1
Evaluate dz.
0 9 + 4z 2
b
A rough-and-ready way of thinking of a definite integral a f (x) dx is as ‘the
accumulation of the values taken by f (x) as x runs from a to b’. This ties
up with a useful way of visualizing definite integrals, as areas. If f (x) ≥ 0
b
for a ≤ x ≤ b, then the definite integral a f (x) dx is equal to the area under
the graph of f (x) between x = a and x = b (see Figure 6.1(a)). There is one
point to be careful about here. If f (x) < 0, corresponding to a region below
the x-axis, then we have a negative contribution to the integral, whereas
area is always a positive quantity. Thus for a function f as pictured in
b
Figure 6.1(b), a f (x) dx = area A1 − area A2 .
y y
y = f (x)
A1
A2
a b x
a b x y = f (x)
(a) (b)
b
Figure 6.1 a
f (x) dx as an area: (a) with f (x) > 0; (b) in general.
48
Section 6 Integration
Example 6.4
An island is modelled as a circle of radius 500 metres. The density of nests
is greatest on the edge of the island, and least at the centre. (The birds
prefer ready access to the sea.) The density is modelled as D, measured in
nests per square metre, where
D = 0.1 + r/500 = 0.1 + 0.002r,
where r is the distance from the centre of the island, measured in metres.
Estimate the number of nests on the island.
Solution
50
Imagine the island divided by concentric circles into narrow ‘annular’ strips,
0
m
each of width δr. Figure 6.2 shows a ‘typical’ strip, between circles of radius δr
r and r + δr.
r
The area of this typical strip is approximately 2πrδr (‘length × width’). The
number of nests within this strip is (the area) × (the density of nests), and
so is approximately
2πrδrD = 2πr(0.1 + 0.002r)δr.
The total number of nests on the island is the sum of the number of nests
in all the strips. If we take the limit of this sum as δr → 0, then the sum Figure 6.2
converges to the definite integral of 2πr(0.1 + 0.002r) between r = 0 (the
centre of the island) and r = 500 (at the edge). That is, an estimate of the
number of nests is
500
2πr(0.1 + 0.002r) dr = 6.021 × 105 ,
0
so there are approximately 600 000 nests.
End-of-section Exercise
Exercise 6.10
Find the following integrals (where a is a constant).
500
(a) 2πr(0.1 + 0.002r) dr (b) exp(a − 2y) dy
0
1 −2
1 1
(c) du (d) du (e) ln(3t + a) dt
0 3u + 5 −3 3u + 5
π
4 sin 2x
(f) t cos(3t + a) dt (g) dx
0 3 + cos 2x
49
Unit 1 Getting started
7 Computer activities
This section demonstrates how the computer algebra package for the course
can assist in performing various tasks that have been studied in the unit. In
particular, we can easily perform algebraic manipulations, such as simpli-
fying expressions and solving quadratic equations. We can also solve equa-
tions numerically when an algebraic solution cannot be found, and handle
the arithmetic of complex numbers. Turning to calculus, you will see how
to produce graphs, and how to differentiate and integrate.
Activity 7.2
Expand each of the following.
(a) (x + 2)3 (b) (2x + 3)10
Activity 7.3
Suppose that ln y = 2.83 ln x + 0.37. Express y as a function of x.
Activity 7.4
(a) Solve for x the following equations.
(i) 2x2 + 7x − 4 = 0 (ii) x2 + x − 6 = 0 (iii) x2 − 2x + 2 = 0
(b) Solve for x the quadratic equation abx2 − (a + b)x + 1 = 0.
Activity 7.6
Evaluate the following.
(a) (1 + i)(1 − 2i) (b) |4 + i|
50
Section 7 Computer activities
Activity 7.7
(a) Expand (cos x + i sin x)4 . Find Re (cos x + i sin x)4 , and hence use De
Moivre’s Theorem to obtain an expression for cos 4x in terms of sin x
and cos x.
(b) Expand cos 4x. Compare the result with your answer to part (a).
Activity 7.8
Consider the function
y(x) = 5 − 2(x + 1) exp(− 12 x) (x ≥ 0).
(a) Obtain a graph of y(x) against x.
(b) How does y(x) appear to behave as x becomes large?
(c) Can you find the global maximum and global minimum values taken by You may wish to refer back to
y(x)? Exercise 5.10 in answering
this.
Activity 7.9
dy
(a) Find , where y = 1 − 0.9 exp(−0.5x). The functions in parts (a)
dx and (c) were also considered
dy x in Exercise 5.3.
(b) Find , where y = 2 .
dx x +1
d2 y
(c) Find 2 , where y = ln t.
dt
Activity 7.11
Evaluate the following definite integrals, first numerically and then symbol-
ically.
4 1
1
(a) dx (b) exp(−x2 ) dx
1 x 0
Activity 7.12
Evaluate the following.
b 2
1 1
(a) dx, symbolically (b) dx, numerically
a x −1 x
51
Unit 1 Getting started
Outcomes
After studying this unit you should be able to do the following.
• Interpret the following notation: scientific notation, Z, R, C, [a, b], (a, b],
[a, b), (a, b), |x|, f (x), exp(x), ex , ln x,√sin x, cos x, tan x, sec x, cosec x,
cot x, arccos x, arcsin x, arctan x, i (= −1), Re(z), Im(z), z, |z|, ez (for
dy d2 y dn y
z in C), r, θ, lim f (h), f (x), f (x), f (n) (x), , , , ẋ(t), x(t),
¨
b h→0 dx dx2 dxn
f (x) dx, a f (x) dx, [F (x)]ba .
• Interpret the following terminology: integer, real number, interval, deci-
mal places, significant figures, rounding; variable, dependent variable, in-
dependent variable, parameter; function, domain, image set, codomain;
linear function, quadratic function, polynomial function (and root of a
polynomial equation); exponential function, logarithm function, power
function, trigonometric function; composite function; complex number,
complex conjugate, real and imaginary parts (of a complex number),
modulus and argument (of a complex number); polar coordinates, polar
form and exponential form (of a complex number), De Moivre’s Theo-
rem, Euler’s formula; differentiation, derivative, Leibniz notation, Chain
Rule (for differentiation), implicit differentiation, higher derivative; gra-
dient of a function, stationary point, local maximum, local minimum,
global maximum, global minimum, derivative of a complex-valued func-
tion; continuous function; integration, integral, integrand, indefinite in-
tegral, definite integral, arbitrary constant.
• Use the formulae for: the solution of a quadratic equation; the alge- The formulae are all given in
braic properties of indices (and the exponential function); the algebraic the Handbook.
properties of logarithm functions; various trigonometric identities; mul-
tiplying complex numbers in polar form; finding powers in polar form;
derivatives of standard functions; differentiating products, quotients and
composite functions; standard integrals; integration by parts and by sub-
stitution.
• Solve two simultaneous linear equations by Gaussian elimination.
• Factorize quadratic functions.
• Sketch the graphs of linear, quadratic, exponential and trigonometric
functions.
• Sketch the graphs of more general functions, including identifying sta-
tionary points and asymptotes.
• Use log–linear plots to recognize relationships of the form
y = Aekx .
• Use log–log plots to recognize relationships of the form y = Axb .
• Add, subtract, multiply and divide complex numbers, and move between
Cartesian, polar and exponential forms of a complex number.
• Find derived functions, using the table of standard derivatives and the
rules for differentiating various types of combination of functions.
• Find indefinite integrals, using the table of standard integrals and (in
simple cases) the rules for integration by substitution and for integration
by parts.
• Find definite integrals (of suitable functions).
• Use the computer to: simplify and expand algebraic expressions, graph
a function, solve an equation (numerically or symbolically), find deriva-
tives, find integrals.
52
Solutions to the exercises
53
Unit 1 Getting started
54
Solutions to the exercises
w wv (2 − i)(3 + 4i) 10 5 2
(e) = 2 = = 25 + 25 i = 5 + 51 i
y v |v| 25
y = sin q 1 w 2+i 2
0.8 (f ) = 2
= = 5 + 51 i
w |w| 5
(g) w2 = (2 − i)(2 − i) = 3 − 4i
–π 0 0.93 π π –0.93 π 2π q (h) 2w − 3v = 4 − 2i − (9 − 12i) = −5 + 10i
2
4.2 We obtain
√
−2 ± 4−8
x= = − 12 ± 21 i.
4
(b) We saw in Exercise 3.1(c) that tan π4 = 1, so θ = π4
is one solution. We can see from the graph of tan (Fig-
4.3
ure 3.5) that there is one solution of this equation in the y
range − π2 to π2 , and that other solutions are obtained (0,4)
by adding multiples of π to this. We can express the
full set of solutions as
π
4+ nπ,
where n ∈ Z.
(− 3,1) (1,1)
3.3 (a) (i) sin 2θ = sin(θ + θ) π
f 4
= sin θ cos θ + cos θ sin θ (–2,0) (4,0) x
π
= 2 sin θ cos θ (–1,–1) 4
55
Unit 1 Getting started
56
Solutions to the exercises
5.8 (a) (i) The Product Rule gives (b) From the graph we see that the global√maximum
d 2 dy of f occurs at the local maximum, i.e. v = 500. The
(x y) = 2xy + x2 . global minimum occurs at the endpoint of the domain,
dx dx
i.e. v = 0.
(ii) The Composite Rule gives
d 3 dy
(y ) = 3y 2 .
dx dx
(b) Using implicit differentiation, we obtain Section 6
dy dy
3x2 + 2xy + x2 + 3y 2 = 0. In all solutions for this section, c is an arbitrary con-
dx dx
When x = −1 and y = 1, this gives stant.
dy dy
3−2+ +3 = 0. 6.1 (a) Using the Product Rule for derivatives,
dx dx
Hence dy/dx = − 14 , and the required gradient is − 14 . d x 1
− cos ax + 2 sin ax + c
dx a a
1 x a
5.9 We have f (t) = −2 sin 2t + 2i cos 2t. = − cos ax + − (−a sin ax) + 2 cos ax
a a a
Then f (t) = −4 cos 2t − 4i sin 2t. = x sin ax.
Therefore
5.10 To test for stationary points, use the Product x 1
x sin ax dx = − cos ax + 2 sin ax + c,
Rule to find a a
1 1 1
y (x) = −2e− 2 x + 12 (2(x + 1))e− 2 x = (x − 1)e− 2 x . so verifying the given integral.
So y (x) = 0 only at x = 1. The derivative is negative if (b) Using the Composite Rule for derivatives,
x < 1 and positive if x > 1, so this is a local minimum. d 1
We have y(1) = 2.574 (to three decimal places). − ln(cos ax) + c
dx a
1 1 d
= − (cos ax)
5.11 (a) f (v) = 0 only when v = 0. a cos ax dx
The denominator 4 + 1.5v + 0.008v 2 is positive for all 1 −a sin ax
= −
v ≥ 0, so f (v) is defined for all v ≥ 0. a cos ax
As v → ∞, f (v) → 0. = tan ax,
To find any stationary points, differentiate f (v) using so verifying the given integral.
the Quotient Rule, to obtain
(4 + 1.5v + 0.008v 2 ) − v(1.5 + 0.016v) 6.2 (a) If x = tan y, then, differentiating with respect
f (v) = to x, we obtain
(4 + 1.5v + 0.008v 2 )2
d d dy dy
4 − 0.008v 2 1= (tan y) = (tan y) = sec2 y .
= . dx dy dx dx
(4 + 1.5v + 0.008v 2 )2
√ Then, using the formula sec2 y = 1 + tan2 y, and the
This is 0 when v = ± 500 = ±22.36 (to two decimal fact that x = tan y, we obtain
places).
dy dy
The negative stationary point is outside the domain 1 = (1 + tan2 y) = (1 + x2 ) .
dx dx
(v ≥ 0), so we need consider only v = 22.36. For Therefore
v < 22.36, f (v) > 0, while for v > 22.36, f (v) < 0. dy 1
Therefore v = 22.36 is a local maximum. We have = .
dx 1 + x2
f (22.36) = 0.538 (to three decimal places). Now, if x = tan y, then y = arctan x, so
A sketch graph of f is shown below. 1
dx = arctan x + c.
1 + x2
57
Unit 1 Getting started
58
Solutions to the exercises
(e) If u = 1 + sin2 t, then du/dt = 2 sin t cos t = sin 2t, (b) Since exp(a − 2y) = ea e−2y , we have
using the trigonometric formula for sin 2t. So the in-
tegrand is of the form g (x)/g(x) with g(x) > 0, and exp(a − 2y) dy = ea e−2y dy
hence
= − 12 ea e−2y + c
sin 2t
dt = ln(1 + sin2 t) + c. = − 21 exp(a − 2y) + c.
1 + sin2 t
(f ) Using Equation (6.5) with g(y) = 1 − y 2 , so that (The same result can be obtained using integration by
g (y) = −2y, we have (for y =
±1) substitution with u = a − 2y.)
y −2y (c) For 0 ≤ u ≤ 1, we have 3u + 5 > 0, so
dy = − 12 dy = − 12 ln |1 − y 2 | + c. 1
1 − y2 1 − y2 1 1
du = 13 ln(3u + 5) 0
0 3u + 5
= 13 (ln 8 − ln 5) = 13 ln 85 .
6.8 (a) Take f (x) = x and g (x) = e−x , so f (x) = 1
and g(x) = −e−x . Then (d) For −3 ≤ u ≤ −2, we have 3u + 5 < 0, so
−2 −2
1
xe−x dx = x(−e−x ) − (−e−x ) dx du = 13 ln(−3u − 5) −3
−3 3u + 5
3/2 3/2
(g) We have
1 1 1
6.9 dz = 4 9 dz d
(3 + cos 2x) = −2 sin 2x
0 9 + 4z 2 0 4 + z2 dx
3/2 and 3 + cos 2x > 0 for all x, so
1 1 x
= arctan π π
4 3 3 4 sin 2x 4 −2 sin 2x
2 2 0 dx = − 12 dx
= 16 (arctan 1 − arctan 0) 0 3 + cos 2x 0 3 + cos 2x
π
59
UNIT 2 First-order differential equations
1
61
Unit 2 First-order differential equations
Introduction
An important class of the equations that arise in mathematics consists of
those that feature the rates of change of one or more variables with respect to
one or more others. These rates of change are expressed mathematically by
derivatives, and the corresponding equations are called differential equations.
Equations of this type crop up in a wide variety of situations. They are
found, for example, in models of physical, electronic, economic, demographic
and biological phenomena.
First-order differential equations, which are the particular topic of this unit,
feature derivatives of order one only; that is, if the rate of change of variable
y with respect to variable x is involved, the equations feature dy/dx but not
d2 y/dx2 , d3 y/dx3 , etc.
When a differential equation arises, it is usually an important aim to solve
the equation. For an equation that features the derivative dy/dx, this entails
expressing the dependent variable y directly in terms of the independent
variable x. The solution process requires the effect of the derivative to be
‘undone’. The reversal of differentiation is achieved by integration, so it is
to be expected that integration will feature significantly in the methods of
solution for differential equations.
Integration can be attempted either symbolically, to obtain an exact formula
for the integral, or numerically, to give approximate numerical values from
which the integral can be tabulated or graphed. The same two approaches
can therefore be tried to obtain solutions of differential equations, and both
are introduced in this unit.
Section 1 considers in detail one example of how a differential equation
arises in a mathematical model. This is followed by some basic definitions
and terminology associated with differential equations and their solutions.
We also note how errors and accuracy are defined.
Section 2 starts by looking at the direction field associated with a first-order
differential equation. This is a device for visualizing the overall behaviour of
the differential equation and of its solutions, and leads to a basic numerical
method of solution known as Euler’s method. Both direction fields and
Euler’s method are implemented in a computer subsection.
Section 3 turns to analytic (that is, symbolic) methods of solution, consid-
ering first direct integration and then separation of variables.
Section 4 describes a further analytic approach to solving differential equa-
tions, called the integrating factor method. It applies only to equations that
are linear. Linear first-order differential equations are important in their
own right, but also give valuable clues on how to solve linear second-order
differential equations, which are the subject of Unit 3.
In Section 5 you will see how each of the analytic methods from Sections 3
and 4 can be implemented on your computer.
62
Section 1 Some basics
1 Some basics
Subsection 1.1 develops a mathematical model for a specific situation which
leads naturally to a first-order differential equation. A key step in deriving
this equation is to apply the input–output principle, which is a useful device
for building relations between variables.
Subsection 1.2 addresses what is meant by the term ‘solution’ in the context
of first-order differential equations, and brings out the distinction between
the general solution and the various possible particular solutions. The spec-
ification of a constraint, or initial condition, usually permits us to find a
unique function that is a particular solution of the differential equation and
also satisfies the constraint.
The short Subsection 1.3 provides the definition and description of numerical
errors, in anticipation of Euler’s method in Section 2.
63
Unit 2 First-order differential equations
This principle applies to any quantity whose change, over a given time in-
terval, is due solely to the specified input and output.
The accumulation δP of population over the time interval δt is the popu-
lation at the end of the interval minus the population at the start of the
interval; that is,
δP = P (t + δt) − P (t).
The input is the number of births (Equation (1.1)), and the output is the
number of deaths (Equation (1.2)). The input–output principle now enables
us to express the accumulation δP of the population over the time interval
δt as
δP bP (t)δt − cP (t)δt = (b − c)P (t)δt.
Dividing through by δt, we obtain
δP
(b − c)P (t).
δt
The approximations involved in deriving this equation become progressively
more accurate for shorter time intervals. So, finally, by letting δt tend to
zero, we obtain This is the step that requires
P to be a continuous (rather
dP
= (b − c)P (t). than discrete) function of t.
dt
(This follows because
dP P (t + δt) − P (t)
= lim
dt δt→0 δt
is the definition of the derivative of P .)
This is a differential equation because it describes dP /dt rather than the
eventual object of our interest (which is P itself). The purpose of this unit
is to enable you to solve a wide variety of such equations.
Of course, we can simplify the above equation slightly by using the pro-
P
portionate growth rate r, which is the difference between the proportionate
birth and death rates: r = b − c. Then our model becomes
dP
= rP.
dt r>0
For very simple population models, r is taken to be a constant. As we shall
see, this leads to a prediction of exponential growth (or, if r < 0, decay)
in population size with time, as illustrated in Figure 1.1. This may be a t
very good approximation for certain populations, but it cannot be sustained
indefinitely if r > 0. Figure 1.1
In practice, both the proportionate birth rate and the proportionate death
rate will vary, and so therefore will the proportionate growth rate. It turns
out to be convenient to model these changes as being dependent on the
population size, so that the proportionate growth rate r becomes a function
of P . The justification for this is as follows. When the population is low, one
may assume that there is potential for it to grow (assuming a reasonable
environment). The proportionate growth rate should therefore be high.
However, as the population grows, there will be competition for resources.
64
Section 1 Some basics
Thus the proportionate growth rate will decline, and in this way unlimited
(exponential) growth does not occur.
A particularly useful model arises from taking r(P ) to be a decreasing linear
function of P . We shall write this as
P You will see later why this
r(P ) = k 1 − , (1.3) particular form is chosen.
M
where k and M are positive constants. Looking at this formula, you can
see that the proportionate growth rate r decreases linearly with P , from the
value k (when P = 0) to 0 (when P = M ).
Using this expression for r, the above differential equation satisfied by P
becomes
dP P
= kP 1 − . (1.4)
dt M
This is well known to biologists as the logistic equation — we shall consider
it further in Section 2, and see how to solve it in Section 3. For now, we have
achieved our objective of showing that differential equations arise naturally
in modelling the real world.
Exercise 1.1
Suppose that a population obeys the logistic model (with the proportionate
growth rate given by Equation (1.3)), and that you are given the following
information. When P = 10 the proportionate growth rate is 1, and when
P = 10 000 the proportionate growth rate is 0. Find the corresponding
values of k and M .
65
Unit 2 First-order differential equations
dy
= f (x, y). Equation (1.4) is of this form,
dx P
The right-hand side here stands for an expression involving both, either or with f (t, P ) = kP 1 − .
M
neither of the variables x and y, but no other variables and no derivatives.
According to the definition above, a function has to satisfy a differential
equation in order to be regarded as a solution of it. The differential equation
is satisfied by the function provided that when the function is substituted This substitution includes the
into the equation, the left- and right-hand sides of the equation give an requirement that the function
identical expression. should be differentiable
(i.e. that it should have a
You are asked to verify in the next exercise that several functions are solu- derivative) at all points where
tions of corresponding first-order differential equations. Later in the unit, it is claimed to be a solution.
66
Section 1 Some basics
you will see how all of these differential equations may be solved; but even
when a solution has been deduced, it is worth checking in the manner of this
exercise (i.e. by substitution) that the supposed solution is indeed correct.
Exercise 1.2
Verify that each of the functions given below is a solution of the correspond-
ing differential equation.
dy
*(a) y = 2ex − (x2 + 2x + 2); = y + x2 . Remember that an asterisk
dx denotes an exercise (or part
dy of one) that is considered
(b) y = 12 x2 + 32 ; = x.
dx particularly important.
2
*(c) u = 2ex /2 ; u = xu.
27 − x2 √ √ dy x
(d) y = (−3 3 < x < 3 3); =− (y = 0). Note that the restriction
3 dx 3y y= 0 placed on the
*(e) y = t + e−t ; ẏ = −y + t + 1. differential equation in
part (d) is necessary to ensure
*(f) y = t + Ce−t ; ẏ = −y + t + 1. (Here C is an arbitrary constant.) that −x/3y is well defined.
In the last two parts of Exercise 1.2 you were asked to verify that
y = t + e−t and y = t + Ce−t
are solutions of the differential equation ẏ = −y + t + 1, where in the second
case C is an arbitrary constant. In saying that C is arbitrary, we mean that
it can assume any real value. Whatever number is chosen for C, the corre-
sponding expression for y(t) is always a solution of the differential equation.
The particular function y = t + e−t is just one example of such a solution,
obtained by choosing C = 1.
This demonstrates that solutions of a differential equation can exist in pro-
fusion; as a result, we need terms to distinguish between the totality of all
these solutions for a given equation and the individual solutions that are
completely specified.
Definitions
(a) The general solution of a differential equation is the collection
of all possible solutions of that equation.
(b) A particular solution of a differential equation is a single solu-
tion of the equation, and consists of a solution function whose rule
contains no arbitrary constant.
67
Unit 2 First-order differential equations
*Exercise 1.3
(a) Verify that, for any value of the constant C, the function y = C − 13 e−3x
is a solution of the differential equation
dy
= e−3x .
dx
(b) Verify that, for any value of the constant C, the function u = Cet − t − 1
is a solution of the differential equation
u̇ = t + u.
(c) Verify that, for any value of the constant C, the function
CM ekt
P =
1 + Cekt
is a solution of Equation (1.4) on page 65.
As you have seen, there are many solutions of a differential equation. How-
ever, a particular solution of the equation, representing a definite relation-
ship between the variables involved, is often what is needed. This is achieved
by using a further piece of information in addition to the differential equa-
tion. Often the extra information takes the form of a pair of values for the
independent and dependent variables.
For example, in the case of a population model, it would be natural to
specify the starting population, P0 say, and to start measuring time from
t = 0. We could then write
P = P0 when t = 0, or, equivalently, P (0) = P0 .
A requirement of this type is called an initial condition.
Definitions
(a) An initial condition associated with the differential equation
dy
= f (x, y)
dx
specifies that the dependent variable y takes some value y0 when
the independent variable x takes some value x0 . This is written
either as
y = y0 when x = x0
or as
y(x0 ) = y0 .
The numbers x0 and y0 are referred to as initial values.
(b) The combination of a first-order differential equation and an initial
condition is called an initial-value problem.
The word ‘initial’ in these definitions arises from those (frequent) cases in
which the independent variable represents time. In such cases, the differen-
tial equation describes how the system being modelled behaves once started,
while the initial condition specifies the configuration in which the system is
started off. In fact, if the initial condition is y(x0 ) = y0 , then we are often
interested in solving the corresponding initial-value problem for x > x0 . If x represents time, then
x > x0 is ‘the future’ after the
system has been started off.
68
Section 1 Some basics
Example 1.1
Using the result given in Exercise 1.3(b), solve the initial-value problem
dy
= x + y, y(0) = 1.
dx
Solution
From Exercise 1.3(b), on replacing the variables t, u by x, y, respectively,
the general solution of the differential equation here is
y = Cex − x − 1.
The initial condition says that y = 1 when x = 0, and on feeding these values
into the general solution we find that
1 = Ce0 − 0 − 1 = C − 1.
Hence C = 2, and the particular solution of the differential equation that
solves the initial-value problem is
y = 2ex − x − 1.
Exercise 1.4
The size of a population (measured in hundreds of thousands) is modelled
by the logistic equation
dP P
= kP 1− , P (0) = 1,
dt M
where k = 0.15 and M = 10.
*(a) Use your answer to Exercise 1.3(c) to find a solution to this initial-value
problem.
(b) Can you predict the long-term behaviour of the population size from
your answer?
y
Finally in this subsection, note that one needs to keep an eye on the domain 4
of the function defining the differential equation. ‘Gaps’ in the domain 2
usually show up as some form of restriction on the nature of a solution –10 –5 5 10 x
–2
curve. For example, consider the differential equation
–4
dy 1 –6
= . (1.5)
dx x
It turns out that there are two distinct families of solutions of this equation, Figure 1.2
given by y = ln x + C (if x > 0) and y = ln(−x) + C (if x < 0). These two Since |x| = −x if x < 0, you
families of solutions are illustrated in Figure 1.2. Notice that the right-hand can see that this agrees with
side of Equation (1.5) is not defined at x = 0, and that there is no solution what we know from Unit 1,
that crosses the y-axis. namely that
1
This unit deals with numerical and analytic (symbolic) methods of solving dx = ln |x|.
x
differential equations. However, before we can discuss numerical methods,
we need to know something about the way that errors and accuracy are
described: this is the topic of the next subsection.
69
Unit 2 First-order differential equations
Exercise 1.5
For each of the functions f given below, use your calculator to find f (3.142). You are welcome to use your
Then estimate f (π) more accurately, using π = 3.141 592 6 (to seven decimal computer rather than a
places). Hence estimate the error in using f (3.142) as an approximation calculator if you prefer.
to f (π).
(a) f (x) = x3 (b) f (x) = e10x
In Exercise 1.5(b), you saw that an absolute error of less than 0.0005 in
the value of π results in an absolute error of the order of 1.8 × 1011 in
the calculated value of f (π), for f (x) = e10x . For this function f , errors
are severely magnified! However, the situation is not quite so bad as this
statement might suggest. The calculated value of f (π) is not completely
unreliable — it is accurate to two significant figures. The value of f (π) is
itself very large (4.4 × 1013 ), so an error of 1.8 × 1011 is not quite so serious
as it sounds. We often want a measure of ‘error’ that takes into account the
size of the error relative to the size of the number being calculated.
70
Section 2 Direction fields and Euler’s method
Exercise 1.7
(a) Verify that, for any value of the constant C, the function
π
x = tan(t + C) − 2 < t + C < π2
is a solution of the differential equation
ẋ = 1 + x2 .
(b) Using the result of part (a), find the solution of the initial-value problem
ẋ = 1 + x2 , x π4 = 1.
71
Unit 2 First-order differential equations
Definition
A direction field associates a unique direction to each point within
a specified region of the (x, y)-plane. The direction corresponding to
the point (x, y) may be thought of as the slope of a short line segment
through the point.
In particular, the direction field for the differential equation
dy
= f (x, y)
dx
associates the direction f (x, y) with the point (x, y).
72
Section 2 Direction fields and Euler’s method
0 1 2 x
–2 –1
Figure 2.2
From this diagram, we can gain a good qualitative impression of how the
graphs of particular solutions of Equation (2.3) behave. The aim is mentally
to sketch curves on the diagram in such a way that the tangents to the curves
are always parallel to the local slopes of the direction field. For example,
starting from the point (−1, 0.5) (that is, taking the initial condition to be
y(−1) = 0.5), we expect the solution graph initially to fall as we move to
the right. The magnitude of the negative slope decreases, however, and
eventually reaches zero, after which the slope becomes positive and then
increases. On this basis, we could sketch the graph of the corresponding
particular solution and obtain something like the curve shown in Figure 2.3.
(– 1 , 0 . 5)
0 1 2 x
–2 –1
Figure 2.3
73
Unit 2 First-order differential equations
Exercise 2.1
(a) Part of the direction field for the logistic equation
dP P
=P 1− This is Equation (2.2) with
dt 1000 k = 1 and M = 1000.
is sketched in Figure 2.4. Using this diagram, sketch the solution curves
that pass through the following points:
(0, 1500), (0, 1000), (0, 100), (0, 0), (0, −100).
From your results, describe the graphs of particular solutions of the
differential equation.
P
1500
1000
500
0
2 4 6 8 10 t
–500
Figure 2.4
(b) What does your answer to part (a) tell you about the predicted be-
haviour of a population whose size P (t) at time t is modelled by this
logistic equation?
74
Section 2 Direction fields and Euler’s method
slope = f ( x 0, y0)
P1 ( x1, Y1)
P0 Y 1 – y0
(x 0, y0)
h
0 x0 x1 x
Figure 2.5
The idea is that the point P1 , whose coordinates have been denoted by
(x1 , Y1 ), provides an approximate value Y1 of the solution function y(x) at The reason for using Y1 here,
x = x1 . Now, unless the solution function follows a straight line as x moves rather than y1 , will be
from x0 to x1 , Y1 is unlikely to give the exact value of y(x1 ). However, the explained shortly.
hope is that, because we headed off from x0 along the correct slope, as given
by the direction field, Y1 will be reasonably close to the exact value. Before
worrying about accuracy, let us continue with the construction of the points
in our sequence.
The next thing that we need to do, before proceeding to the second step
in the construction process, is determine formulae for x1 and Y1 in terms
of x0 , y0 , h and f (x0 , y0 ). By the construction described, as the point P1
is reached from P0 by taking a step to the right of horizontal length h, we
have
x1 = x0 + h. (2.5)
We can express Y1 in terms of other quantities by equating two expressions
for the slope of the line segment P0 P1 ,
Y 1 − y0
= f (x0 , y0 ),
h
and then rearranging to give
Y1 = y0 + hf (x0 , y0 ). (2.6)
This completes the first step, and we now take a second step to the right.
75
Unit 2 First-order differential equations
The direction of the second step is along the line with slope defined by the
direction field at the point P1 , namely f (x1 , Y1 ). The second step moves us
from P1 through a further horizontal distance h to the right, to the point
labelled P2 , as illustrated in Figure 2.6. This point provides an approximate
value Y2 of the solution function y(x) at x = x2 .
y
slope = f ( x1, Y1) P2 ( x2, Y2)
Y 2 – Y1
( x1, Y1) P1
h
P0
0 x0 x1 x2 x
Figure 2.6
y
P i+1 ( xi+1 , Y i+1 )
slope = f ( x i , Y i )
Y i+1 – Yi
Pi
(xi , Yi )
h
0 xi xi+1 x
Figure 2.7
76
Section 2 Direction fields and Euler’s method
y = y(x) P6
P5
P4
P3
P2
P1
P0
0 x0 x1 x2 x3 x4 x5 x6 x
Figure 2.8
77
Unit 2 First-order differential equations
Nevertheless, the formulae (2.9) and (2.10) provide a method for finding ap-
proximate solutions to the initial-value problem (2.4), in terms of numerical The accuracy of such
estimates Y1 , Y2 , Y3 , . . . at the respective domain values x1 , x2 , x3 , . . .. This approximate solutions, and
is called Euler’s method, and is summarized below. ways of improving accuracy,
will be considered shortly.
Example 2.1
Consider the initial-value problem
dy
= x + y, y(0) = 1.
dx
Use Euler’s method, with step size h = 0.2, to obtain an approximation to
y(1).
Solution
We have x0 = 0, Y0 = y0 = 1, and f (xi , Yi ) = xi + Yi . The step size is given
as h = 0.2. Equation (2.9) with i = 0 gives
x1 = x0 + h = 0 + 0.2 = 0.2,
and Equation (2.10) with i = 0 gives
Y1 = Y0 + hf (x0 , Y0 ) = 1 + 0.2 × (0 + 1) = 1.2.
For the second step, we have (from Equation (2.9) with i = 1)
x2 = x1 + h = 0.2 + 0.2 = 0.4,
and (from Equation (2.10) with i = 1)
Y2 = Y1 + hf (x1 , Y1 ) = 1.2 + 0.2 × (0.2 + 1.2) = 1.48.
If more than a couple of steps of such a calculation have to be computed by
hand, then it is a good idea to lay out the calculation as a table. In this
case, by continuing as above and putting i in turn equal to 2, 3 and 4, we
obtain Table 2.1.
78
Section 2 Direction fields and Euler’s method
Table 2.1
i xi Yi f (xi , Yi ) = xi + Yi Yi+1 = Yi + hf (xi , Yi )
0 0 1 1 1.2 After each value of Yi+1 has
1 0.2 1.2 1.4 1.48 been calculated from the
2 0.4 1.48 1.88 1.856 formula and entered in the
3 0.6 1.856 2.456 2.347 2 last column, it is transferred
to the Yi column in the next
4 0.8 2.347 2 3.147 2 2.976 64
row.
5 1.0 2.976 64
So, at x = 1, Euler’s method with step size h = 0.2 gives the approximation
y(1) 2.976 64.
*Exercise 2.2
Use Euler’s method, with step size h = 0.2, to obtain an approximation to
y(1) for the initial-value problem
dy
= y, y(0) = 1.
dx
79
Unit 2 First-order differential equations
y
estimate with
h = 0.4
with h = 0.2
y = y (x)
with h = 0.1
exact solution
at x = 0.4
Figure 2.9
In fact, it can be shown that the accuracy of Euler’s method does indeed
usually improve when we take a smaller step size.
To demonstrate this, consider the initial-value problem from Exercise 2.2.
This has the exact solution y = ex (as you can verify), and its value at
x = 1 is y(1) = e = 2.718 282, to six decimal places. In Exercise 2.2 you
showed that with a step size h = 0.2, Euler’s method gives the approximation
2.488 32 for y(1). Table 2.2 shows the corresponding results (to six decimal
places) obtained when we apply Euler’s method to this same initial-value
problem but with successively smaller step sizes h. In Exercise 2.2, where
h = 0.2, the value of y(1) was
approximated by Y5 . From
Table 2.2 the column for ‘number of
steps’ in Table 2.2, you can
h Approximation Absolute Number of see that y(1) is approximated
to y(1) error steps by:
0.1 2.593 742 0.124 539 10 Y10 when h = 0.1;
Y100 when h = 0.01;
0.01 2.704 814 0.013 468 100
Y1000 when h = 0.001;
0.001 2.716 924 0.001 358 1000
Y10000 when h = 0.0001.
0.0001 2.718 146 0.000 136 10000
As expected, the absolute errors in the third column of the table become
progressively smaller as h is reduced.
Looking more carefully at these absolute errors, we notice that they seem
to tend towards a sequence in which each number is a tenth of the previous
one. Since each value of h in the table is a tenth of the previous one, this
suggests that:
absolute error is approximately proportional to step size h.
This turns out to be a general property of Euler’s method, for sufficiently You will see this property
small values of the step size. So, not only do we know that accuracy can be stated formally in Unit 26,
improved by decreasing the step size h, but this general property also tells us where you will also see how
the property can be used to
that, by making h small enough, the absolute error in an approximation can estimate the size of absolute
errors.
80
Section 2 Direction fields and Euler’s method
*Exercise 2.3
Suppose that when Euler’s method is applied to the problem in Exercise 2.2,
the absolute error in approximating y(1) is proportional to the step size h,
for sufficiently small h.
Use the last row of Table 2.2 to estimate the constant of proportionality, k
say, and hence to estimate the step size required to compute y(1) correct to
five decimal places (that is, so that the absolute error is less than 5 × 10−6 ).
A few words of caution are necessary at this point. Although the absolute
error can be made as small as we please by making the step size h sufficiently
small, this is valid only if the arithmetic is performed using sufficient decimal
places. Where a calculator or computer is involved, the number of decimal
places that can be used is limited, and as a result rounding errors may
be introduced into the calculations. After a certain point, any increase in
accuracy brought about by reducing the size of h may be swamped by these
rounding errors.
Moreover, rounding errors are not the only problem. Before concluding that
h should always be chosen to be very small, we must also consider the cost
of this additional accuracy. Now, by cost is meant the effort involved, which
can be measured in a variety of ways; commonly for iterative methods (such
as Euler’s method) it is measured by the number of steps taken. In general
for numerical methods, the greater the accuracy required, the greater the
cost. To illustrate this, look back at Table 2.2. The last column of the
table shows how the number of steps required for the calculation goes up
in inverse proportion to the step size: to move from x = 0 to x = 1, it In general, to move from a
takes 10 steps with step size h = 0.1, 100 steps with step size h = 0.01, to b (where b > a) with step
and so on. Since, for sufficiently small h, the error in Euler’s method is size h takes (b − a)/h steps.
approximately proportional to the step size, it follows that for this method
a ten-fold improvement in accuracy is paid for by a ten-fold increase in the
number of steps required.
So, for Euler’s method and similar methods, the choice of step size has to be
based on a compromise between the two opposing requirements of accuracy
and cost. Methods for choosing the step size are discussed in Unit 26, which
also introduces other numerical methods for solving initial-value problems
that are considerably more efficient than Euler’s method. In fact, Euler’s Greater efficiency means that
method is not suitable for high-accuracy work. Its virtue lies rather in its the same or better numerical
simplicity and its clear illustration of the basic principles of how differential accuracy is achieved with
fewer numerical
equations may be solved numerically. computations.
In any practical case, calculations of the type described in this subsection
are ideally suited to being performed on a computer, as you will see in the
next subsection.
81
Unit 2 First-order differential equations
*Activity 2.2
Use Euler’s method to obtain approximations to y(1) for the initial-value
problem
dy
= x + y, y(0) = 0,
dx
using step sizes h = 1, 0.5, 0.2, 0.1, 0.01, 0.001, 0.0001, in turn. In each case,
plot the graph of the solution on an appropriate direction field diagram, and
observe how each graph compares with the previous one.
*Activity 2.3
Euler’s method is to be used to estimate the value of the function y(x) at
x = 0.1, 0.2, . . . , 1 for the initial-value problem
dy
= x + y, y(0) = 0.
dx
(a) Use the step sizes h = 0.1, 0.01, 0.001, 0.0001, in turn. Compare the
results in each case with the exact solution y = ex − x − 1, and comment
on how the size of the absolute error varies with h.
(b) Compare your estimates for the step sizes h = 0.1 and h = 0.01. Then
compare your estimates for all four step sizes. What can you conclude?
End-of-section Exercise
Exercise 2.4
(a) Without plotting the direction field, say what you can about the slopes
defined by the differential equation
dy
= f (x, y) = y + x2 .
dx
(b) Verify that your conclusions are consistent with the direction field dia-
gram in Figure 2.10.
82
Section 3 Finding analytic solutions
–2 –1 0 1 2 x
–1
–2
Figure 2.10
(c) On the basis of the direction field, what can be said about the graphs
of solutions of the differential equation?
(d) Write down the formulae required in order to apply Euler’s method to
the initial-value problem
dy
= y + x2 , y(−1) = −0.2,
dx
using a step size h = 0.1.
83
Unit 2 First-order differential equations
f (x) dx = F (x) + C.
This means that the general solution of Equation (3.2) can be written down
directly as an indefinite integral; and, if the integration can be performed,
then the equation is solved.
Once the general solution has been found, it is possible to single out a
particular solution by specifying a value for the constant C. As before, this
value may be found by applying an initial condition.
Example 3.1
(a) Find the general solution of the differential equation
dy
= e−3x .
dx
84
Section 3 Finding analytic solutions
(b) Find the particular solution of this differential equation that satisfies
the initial condition y(0) = 53 .
Solution
(a) On applying direct integration, we obtain the general solution
y= e−3x dx = − 13 e−3x + C,
Procedure 3.1 uses x for the independent variable and y for the dependent
variable. As usual, you should be prepared to translate this into situations
where other symbols are used for the variables. But remember that the
method of direct integration applies solely to first-order differential equations
for which the derivative is equal to a function of the independent variable
alone. Thus direct integration can be applied, for example, to the differential
equation
dx
= cos t,
dt
to give the general solution
x= cos t dt = sin t + C,
Exercise 3.1
Solve each of the following initial-value problems.
dy
(a) = 6x, y(1) = 5.
dx
dv
(b) = e4u , v(0) = 2.
du
Remember that ẏ stands for
*(c) ẏ = 5 sin 2t, y(0) = 0. dy/dt, where t denotes time.
85
Unit 2 First-order differential equations
Exercise 3.2
Find the general solution of each of the following differential equations.
dy
(a) = xe−2x
dx
t
*(b) ṗ = (Hint : For the integral, try the substitution u = 1 + t2 .)
1 + t2
The answer to Exercise 3.2(b) can be generalized to any differential equation This is a simple extension of
of the form the result from Unit 1 that
f (x)
dy f (x) dx = ln |f (x)| + C,
=k (f (x) = 0), f (x)
dx f (x) for f (x) = 0.
where k is a constant, to give the general solution
y = k ln |f (x)| + C,
where C is an arbitrary constant.
86
Section 3 Finding analytic solutions
where C is an arbitrary constant. Making y the subject of the equation, we Note that one arbitrary
obtain the solution expression constant suffices.
y = tan( 12 x2 + C).
The approach just demonstrated applies more widely. In principle, it works
for any differential equation of the form
dy
= g(x)h(y). (3.3)
dx
On dividing this equation through by h(y) (for all values of y other than
those where h(y) = 0), we obtain
1 dy
= g(x).
h(y) dx
Integration with respect to x on both sides gives
1 dy
dx = g(x) dx,
h(y) dx
and, on applying the rule for integration by substitution to the left-hand
side, this becomes
1
dy = g(x) dx. (3.7) This is the form that you
h(y) need to remember! Note that
If the two integrals can be evaluated at this stage, then we reach an equation you can obtain it ‘informally’
that relates x and y and features an arbitrary constant. This equation is by dividing Equation (3.3) by
the general solution of the differential equation (for values of y other than h(y), ‘multiplying through
by dx’, and then adding the
those where h(y) = 0); but usually y will not be the subject of this equation. two integral signs.
It is a form of the general solution called an implicit (general) solution of
the differential equation. (An example of an implicit solution is provided
by Equation (3.6).) Usually, the final aim is to make y the subject of the
equation, if possible — that is, to manipulate the equation into the form
y = function of x.
This is called the explicit (general) solution of the differential equation.
In either case (implicit or explicit), a particular solution may be obtained
from the general solution as before, by applying an initial condition.
The method just described for solving differential equations of the form (3.3)
is called the method of separation of variables since, in Equation (3.7), we
have separated the variables to either side of the equation, with only the
dependent variable appearing on the left and only the independent variable
on the right. The method is summarized below.
87
Unit 2 First-order differential equations
The separation of variables method is useful, but there are some difficulties
with it. First, it may not be possible to perform the necessary integrations.
Second, the general solution obtained is restricted to those values of y such
that h(y) = 0. Third, it may not be possible to perform the necessary
manipulations to obtain an explicit solution.
Of these difficulties, the first can be overcome by use of a numerical method,
such as Euler’s method. The second will be discussed shortly. The third
will usually also need numerical techniques.
It is necessary to be careful about the domain or image set of the solution
obtained, as the following example illustrates.
Example 3.2
(a) Find the general solution of the differential equation
dy x
=− (y > 0).
dx 3y
(b) Find the particular solution that satisfies the initial condition y(0) = 3.
Solution
(a) The equation is of the form
dy
= g(x)h(y),
dx
where the obvious choices for g and h are
g(x) = −x and h(y) = 1/(3y). Notice that since y > 0,
h(y) is never zero.
We now apply Procedure 3.2. On dividing through by h(y) = 1/(3y)
(that is, multiplying through by 3y) and integrating with respect to x,
the differential equation becomes
88
Section 3 Finding analytic solutions
*Exercise 3.3
A mass m(t) of a uranium isotope, which is present in an object at time t,
declines over time due to radioactive decay. Its behaviour is modelled by
the differential equation
dm
= −λm (m > 0), This model can be applied to
dt other radioactive substances
where the decay constant λ is a positive constant characteristic of the ura- by selecting the appropriate
nium isotope. value of the parameter λ.
(a) Find the general solution of this differential equation.
(b) Find the particular solution for which the initial amount of uranium
present (at time t = 0) is m0 .
The condition m > 0 in Exercise 3.3 arose from the modelling context. This
condition enabled us to find the general solution without needing to worry
about dividing by zero at Step (a) of the separation of variables method
(and hence without needing to restrict the image set further). Suppose we
were to forget the modelling context — that is, suppose we were to remove
the restriction m > 0. How does this affect the solution process? And how
do we cope with the case where m = 0? These questions are answered in the
following example where, to emphasize the absence of the previous modelling
context, the variables used are x and y.
Example 3.3
Find the general solution of the differential equation
dy
= −λy,
dx
where λ is a non-zero constant.
Solution
To apply the separation of variables method, we need to exclude the cases
where y = 0. So, for y = 0, on dividing through by y, integrating with
respect to x, and using the rule for integration by substitution on the left-
hand side, we obtain
1
dy = (−λ) dx. (3.8)
y
Integrating, we obtain You saw in Unit 1 that
1
ln |y| = −λx + B, dy = ln |y| (y =
0).
y
where B is an arbitrary constant. Taking exponentials gives
|y| = e−λx+B
or, removing the modulus sign,
y = ±e−λx+B = ±eB e−λx = Ce−λx ,
where C = ±eB is a non-zero but otherwise arbitrary constant.
89
Unit 2 First-order differential equations
This is not quite the general solution, as we have to consider what happens
when y = 0. Now, looking at the above solution, it is natural to ask what
happens when C = 0. This gives the zero function, y = 0 for all x, and
inspection of the differential equation shows that this is a particular solution.
So we now have the general solution
y = Ce−λx ,
where C is an arbitrary constant. (Positive C corresponds to y > 0, negative
C to y < 0, and C = 0 to the particular solution y = 0.)
End-of-section Exercises
Exercise 3.6
Find the general solution of each of the following differential equations,
where a is a non-zero constant.
dy 1
(a) = (u = a)
du u−a
dy 1
(b) = (x = 0, x = 1/a)
dx x(1 − ax)
Exercise 3.7
Find the general solution of each of the following differential equations.
(a) u = xu (b) ẋ = 1 + x2
Exercise 3.8
(a) Solve the initial-value problem The differential equation here
is the logistic equation
dP P
= kP 1 − , P (0) = P0 (where P0 > 0), (Equation (2.2)) which, as
dt M was pointed out earlier, may
where k and M are positive constants. be used as a model for the
size P (t) of a population at
(Hint : For the integral involving P , the solution for Exercise 3.6(b) time t. The direction field of
should be of use.) this equation in a specific case
(b) Describe what happens to the solution P (t) as t becomes large. was examined in Exercise 2.1
(page 74).
90
Section 4 Solving linear differential equations
Definitions
(a) A first-order differential equation for y = y(x) is linear if it can be
expressed in the form
dy
+ g(x)y = h(x), (4.1) This differential equation can
dx be written in the general form
where g(x) and h(x) are given functions. dy
= f (x, y)
(b) A linear first-order differential equation is said to be homoge- dx
that we have been using by
neous if h(x) = 0 for all x, and inhomogeneous or non- putting
homogeneous otherwise. f (x, y) = −g(x)y + h(x).
*Exercise 4.1
Decide whether or not each of the following first-order differential equations
is linear.
dy dy dz
(a) + x3 y = x5 (b) = x sin x (c) = −3z 1/2
dx dx dt
dy dy
(d) ẏ + y 2 = t (e) x + y = y2 (f) (1 + x2 ) + 2xy = 3x2
dx dx
91
Unit 2 First-order differential equations
Theorem 4.1
If the functions g(x) and h(x) are continuous throughout an interval
(a, b) and x0 belongs to this interval, then the initial-value problem This includes the possibility
that either a = −∞ or b = ∞,
dy so the interval might be all of
+ g(x)y = h(x), y(x0 ) = y0 ,
dx the real line.
has a unique solution throughout the interval.
This is a very powerful result, since it means that once you have found a
solution in a particular interval, that solution will be the only one.
There is a particularly useful technique for solving linear differential equa-
tions, to which we turn next.
(1 + x2 )y = 3x2 dx = x3 + C,
92
Section 4 Solving linear differential equations
This solution was arrived at by noting that the left-hand side of Equa-
tion (4.2) is of the form
dy dp
p + y, (4.4)
dx dx
where p = 1 + x2 , and that this form can be re-expressed, using the Product
Rule, as
d
(py).
dx
Linear differential equations need not come in this convenient form. For
example, the left-hand side of the equation
dy 2x 3x2
+ y= (4.5)
dx 1 + x2 1 + x2
is not of the form (4.4). However, Equation (4.2) can be obtained from
Equation (4.5) on multiplying through by p = 1 + x2 . For this reason,
p = 1 + x2 may be called an integrating factor for Equation (4.5): it is
the factor by which Equation (4.5) needs to be multiplied in order that the
resulting differential equation has a left-hand side of the form (4.4), enabling
direct integration to be performed.
This leaves the question of how such an integrating factor can be found,
starting from Equation (4.5). The answer comes from writing down the two
properties that such a function p = p(x) must satisfy, as follows.
• Multiplying Equation (4.5) by p gives, on the left-hand side,
dy 2x
p +p y.
dx 1 + x2
• The left-hand side must be of the form
dy dp
p + y. (4.4)
dx dx
Comparison of these two expressions shows that p must itself be a particular
solution of the differential equation
dp 2x
= p. (4.6)
dx 1 + x2
This is a homogeneous linear first-order differential equation, and we can
solve it by separation of variables. Indeed, following Procedure 3.2, the
0)
equation becomes (for p =
dp 2x
= dx.
p 1 + x2
Performing the left-hand integral gives
2x
ln |p| = dx,
1 + x2
so
2x
|p| = exp dx . (4.7)
1 + x2
93
Unit 2 First-order differential equations
94
Section 4 Solving linear differential equations
Example 4.1
Use the integrating factor method to find the general solution of each of the The first example cannot be
following differential equations. solved by separation of
variables. The latter two can,
dy 2xy dy y−1 dy 2y
(a) =x− 2 (b) = (x > 0) (c) = as you saw in Exercise 3.4.
dx x +1 dx x dx 1 + x2 You can compare these
answers with those obtained
Solution earlier.
(a) On rearranging the differential equation as
dy 2xy
+ 2 = x,
dx x + 1
we see that it is in the form of Equation (4.1) with
2x
g(x) = and h(x) = x.
+1x2
The integrating factor (from Equation (4.9)) is therefore
2x
p = exp dx
x2+1
= exp(ln |x2 + 1|)
= exp(ln(x2 + 1)) (since 1 + x2 > 0)
= x2 + 1. Checking, we see that
dp
= 2x = g(x)p(x).
dx
95
Unit 2 First-order differential equations
= (x3 + x) dx
= 14 x4 + 12 x2 + C,
where C is an arbitrary constant. Finally, to obtain an explicit solution
we divide by x2 + 1 to obtain
x4 + 2x2 + 4C
y= .
4(x2 + 1)
(b) On rearranging the differential equation as
dy 1 1
− y=− ,
dx x x
we see that it is in the form of Equation (4.1) with g(x) = h(x) = −1/x.
The integrating factor (from Equation (4.9)) is therefore
1
p = exp − dx
x
= exp(− ln x) (since x > 0) Recall that a ln x = ln(xa )
1 and hence, in particular,
= exp ln − ln x = ln(x−1 ) = ln(1/x).
x
1
= . Checking, we see that
x
dp 1
Multiplying through the equation by p(x) = 1/x gives = − 2 = g(x)p(x).
dx x
1 dy 1 1
− y = − 2,
x dx x2 x
and the differential equation becomes
d 1 1
y =− .
dx x x2
Integration then gives
y 1
= − dx
x x2
1
= + C,
x
where C is an arbitrary constant. The general solution is therefore
y = 1 + Cx,
where C is an arbitrary constant.
96
Section 4 Solving linear differential equations
(c) In order to put the given differential equation into the form (4.1), we
need to bring the term in y to the left-hand side to obtain
dy 2
− y = 0. (4.12)
dx 1 + x2
Hence, in this case, we have g(x) = −2/(1 + x2 ) and h(x) = 0. The The equation is homogeneous.
integrating factor is
2
p = exp − dx = exp(−2 arctan x) = e−2 arctan x . Checking, we see that
1 + x2
dp −2e−2 arctan x
Multiplying through by the integrating factor gives =
dx 1 + x2
dy 2y −2 arctan x 2
e−2 arctan x− e = 0. = e−2 arctan x −
dx 1 + x2 1 + x2
Thus the differential equation can be rewritten as = p(x)g(x).
d −2 arctan x
e y = 0.
dx
It follows, on integrating, that
e−2 arctan x y = C, or, equivalently, y = Ce2 arctan x ,
where C is an arbitrary constant. This is the general solution.
*Exercise 4.2
Find the general solution of each of the following differential equations. For part (b), see
dy dy Examples 1.1 and 2.1.
(a) − y = ex sin x (b) =y+x A direction field diagram is
dx dx shown in Figure 2.2.
Exercise 4.3
Use the integrating factor method to solve each of the following initial-value You saw the differential
problems. equation in part (a) in
Exercise 3.7(a), where you
(a) u = xu, u(0) = 2. solved it using separation of
(b) tẏ + 2y = t2 , y(1) = 1. variables.
End-of-section Exercises
Exercise 4.4
Which method would you use to try to solve each of the following linear
first-order differential equations?
dy dy
(a) + x3 y = x5 (b) = x sin x
dx dx
dv dy
(c) + 5v = 0 (d) (1 + x2 ) + 2xy = 1 + x2
du dx
Exercise 4.5
Solve each of the following initial-value problems. The differential equation in
part (a) is equivalent to that
(a) ẏ + y = t + 1, y(1) = 0. considered in parts (e) and (f)
(b) e3t ẏ =1 − e3t y, y(0) = 3. of Exercise 1.2.
97
Unit 2 First-order differential equations
Exercise 4.6
Find the general solution of each of the following differential equations.
dy
(a) x − 3y = x (x > 0)
dx
dv
(b) + 4v = 3 cos 2t
dt
(Hint : If a and b are non-zero constants, then
eat
eat cos bt dt = (a cos bt + b sin bt) + C,
a2 + b 2
where C is an arbitrary constant.)
*Activity 5.2
Use separation of variables to find the general solution of each of the follow-
ing differential equations.
dy dy 2y dy
(a) = −λy (b) = 2 (c) = 1 + y2
dx dx x +1 dx
Compare your solutions with those obtained in Example 3.3 and Exer-
cises 3.4(b) and 3.7(b), respectively.
*Activity 5.3
Use the integrating factor method to solve the following initial-value prob-
lems.
dy
(a) = x + y, y(0) = 0.
dx
dy
(b) x + 2y = x2 , y(1) = 1.
dx
Compare your solutions with those obtained in Exercises 4.2(b) and 4.3(b),
respectively.
98
Outcomes
*Activity 5.4
Use the integrating factor method to find the general solution of each of the
following differential equations.
dy dy
(a) x − 3y = x (b) + 4y = 3 cos 2x
dx dx
Compare your solutions with those obtained in Exercises 4.6(a) and 4.6(b),
respectively.
Outcomes
After studying this unit you should be able to:
• understand and use the basic terminology relating to differential equa-
tions and their solutions;
• check by substitution whether a given function is a solution of a given
first-order differential equation or initial-value problem;
• find from the general solution of a first-order differential equation the
particular solution that satisfies a given initial condition;
• appreciate the difficulties with domains and image sets for the solution
of some differential equations;
• deduce the qualitative behaviour of solutions from consideration of a
first-order differential equation itself, as visualized from its direction
field;
• set up the formulae required by Euler’s method for solving an initial-
value problem, carry out a few steps of the method by hand, and use
the computer to deal with large numbers of steps;
• recognize when a first-order differential equation is soluble by direct
integration, and carry out that integration when appropriate, by hand
in simple cases and otherwise on the computer;
• recognize when a first-order differential equation is separable, and ap-
ply the method of separation of variables by hand in simple cases and
otherwise on the computer;
• recognize when a first-order differential equation is linear, and solve such
an equation by the integrating factor method, by hand in simple cases
and otherwise on the computer.
99
Unit 2 First-order differential equations
100
Solutions to the exercises
1.5 Calculator results are given to eight significant fig- The graphs of solutions through a starting point above
ures. (Different calculators may give slightly different the line P = 1000 appear to decrease, but at a slower
results.) and slower rate, tending from above towards the limit
P = 1000 as t increases.
(a) f (3.142) = 31.018 339,
The graphs of solutions through starting points in the
f (3.141 592 6) = 31.006 275.
region 0 < P < 1000 are increasing, with slope grow-
So we have ing before the level P = 500 is reached and declining
error = f (3.142) − f (π) thereafter. For large values of t, these graphs tend from
f (3.142) − f (3.141 592 6) below towards the limit P = 1000.
0.012. For a starting point in the region P < 0, the graphs de-
crease without limit and with steeper and steeper slope.
(b) f (3.142) = 4.421 123 2 × 1013 ,
These various cases are illustrated by typical graphs in
f (3.141 592 6) = 4.403 148 2 × 1013 . the figure below.
So we have
error = f (3.142) − f (π) P
f (3.142) − f (3.141 592 6)
1500
1.8 × 1011 .
1000
1.6 (a) If y = arcsin x + C (−1 < x < 1), then differ-
entiating gives
dy 1 500
=√ ,
dx 1 − x2
so y satisfies the given differential equation.
0
(b) The initial condition is y( 12 ) = π2 , i.e. y = π2 when 2 4 6 8 x
x = 12 . On substituting these values into the solution
from part (a), we have –500
π 1 π
2 = arcsin 2 +C = 6 + C.
π
This gives C = 3, so the solution of the initial-value (b) If the differential equation is considered as a model
problem is of population behaviour, then the region P < 0 must be
π excluded. The analysis above leads to the following pre-
y = arcsin x + 3 (−1 < x < 1).
dictions for the population.
1.7 (a) If x = tan(t + C) (− π2 < t + C < π2 ), then dif- • If the population is zero at the start, then it remains
ferentiating gives zero.
dx • If the population size starts at 1000, then it remains
ẋ = = sec2 (t + C) = 1 + tan2 (t + C) = 1 + x2 , fixed at this level.
dt
so x satisfies the given differential equation. • If the population starts at a level higher than 1000,
then it declines (more and more gradually) towards
(b) The initial condition is x π4 = 1, i.e. x = 1 when
t = π4 . On substituting these values into the solution 1000.
from part (a), we have • If the population starts at a level below 1000 (but
above 0), then it increases and eventually tends
1 = tan π4 + C .
gradually towards 1000.
This gives C = arctan 1 − π4 = 0, so the solution of the
initial-value problem is
π
x = tan t − 2 < t < π2 . 2.2 For the initial-value problem
dy
= y, y(0) = 1,
dx
we have x0 = 0, Y0 = y0 = 1 and f (xi , Yi ) = Yi . The
Section 2 step size is given as h = 0.2. Equation (2.9) with i = 0
gives
x1 = x0 + h = 0 + 0.2 = 0.2,
2.1 (a) The slope is shown to be zero at all points on
the horizontal lines P = 0 and P = 1000, so these corre- and Equation (2.10) with i = 0 gives
spond to constant solutions of the differential equation. Y1 = Y0 + hf (x0 , Y0 ) = 1 + 0.2 × 1 = 1.2.
(As pointed out earlier in the text, these two solutions
can also be spotted directly from the form of the differ-
ential equation.)
101
Unit 2 First-order differential equations
–1
2.3 Since we are told that, for sufficiently small h, the
absolute error is proportional to the step size h, we can
deduce from the last row of Table 2.2 that there exists
a constant k such that –2
0.000 136 = 0.0001k,
so k = 1.36. In order to determine y(1) correct to five (c) It appears from the direction field that there are
decimal places, h must be such that several types of solution. Any solution whose graph
1.36h < 5 × 10−6 cuts the y-axis above the origin has positive slope at all
or points. The solution graph that passes through the ori-
5 × 10−6 gin has zero slope there, but positive slope everywhere
h< 3.7 × 10−6 . else. Any solution graph that cuts the y-axis below
1.36
So a suitable choice of h would be 10−6 = 0.000 001. the origin has a maximum (where it meets y = −x2 for
x < 0). Some of these graphs also have a minimum
(In fact, using this value of h gives an approximation to
(where they meet y = −x2 for x > 0). Others have no
y(1) of 2.718 280, which is correct to 5 decimal places.)
minimum (though this is not clear from the diagram
given). A solution graph of each type is sketched be-
2.4 (a) The slope defined by the direction field low.
f (x, y) = y + x2 is zero when y = −x2 , which is a
parabola in the lower half-plane with vertex at the y
origin. Below this parabola we have y < −x2 and 2
f (x, y) < 0, while above the parabola we have y > −x2
and f (x, y) > 0. Thus all slopes for points of the plane
below the parabola y = −x2 are negative, and all slopes
for points above it are positive. 1
Also, if x is fixed, then f (x, y) = y + x2 is an increasing
function as y increases. If instead y is fixed, then for
x > 0, f (x, y) increases as x increases, and for x < 0,
f (x, y) increases as x becomes more negative. These –1 0 x
–2 1 2
observations indicate that the slope given by the di-
rection field increases as we move from bottom to top
along any vertical line, whereas on moving along any
horizontal line, the slope increases with distance from –1
the y-axis.
(b) The features described in the solution to part (a)
are all apparent on the direction field diagram. This –2
direction field diagram is repeated below, with the
parabola y = −x2 superimposed upon it. (Note that
this parabola does not represent a solution of the dif- (d) The initial-value problem is
ferential equation.) dy
= y + x2 , y(−1) = −0.2.
dx
From Equations (2.9) and (2.10), the necessary formu-
lae are
xi+1 = xi + h,
Yi+1 = Yi + hf (xi , Yi ).
102
Solutions to the exercises
For the current problem, x0 = −1, Y0 = y0 = −0.2, (b) The differential equation ṗ = t/(1 + t2 ) has general
f (xi , Yi ) = Yi + x2i and h = 0.1. The particular formu- solution
lae needed here are therefore: t
p= dt.
xi+1 = xi + 0.1, where x0 = −1; 1 + t2
Yi+1 = Yi + 0.1(Yi + x2i ), where Y0 = −0.2. Using the hint provided, we make the substitution
u = 1 + t2 , for which du/dt = 2t. This gives
The second of these formulae can also be written as
t 1
Yi+1 = 1.1Yi + 0.1x2i , where Y0 = −0.2. 2
dt = 12 (2t) dt
1+t 1 + t2
1
= 12 du
u
Section 3 = 12 ln u + C (since u = 1 + t2 > 0)
= 1
2 ln(1 + t2 ) + C,
3.1 We apply direct integration to find the general so- where C is an arbitrary constant. The general solution
lution. In each case, C is an arbitrary constant. of the differential equation is therefore
(a) The differential equation dy/dx = 6x has general p = 12 ln(1 + t2 ) + C.
solution
y= 6x dx = 3x2 + C. 3.3 (a) The differential equation is dm/dt = −λm,
where m > 0. Following Procedure 3.2, we obtain
From the initial condition y(1) = 5, we have 5 = 3 + C, 1
so C = 2. The solution of the initial-value problem is dm = (−λ) dt
m
therefore
and, since m > 0, integration produces
y = 3x2 + 2.
ln m = −λt + B,
(b) The differential equation dv/du = e4u has general where B is an arbitrary constant. On solving this equa-
solution tion for m, by taking the exponential of both sides, we
v= e4u du = 14 e4u + C. obtain
m = e−λt+B = eB e−λt = Ce−λt ,
From the initial condition v(0) = 2, we have 2 = 14 + C,
so C = 74 . The solution of the initial-value problem is where C = eB is a positive (since eB > 0 for all B), but
therefore otherwise arbitrary, constant. The general solution is
therefore
v = 14 e4u + 74 .
m = Ce−λt ,
(c) The differential equation ẏ = 5 sin 2t has general
where C is a positive but otherwise arbitrary constant.
solution
(b) The initial condition is m(0) = m0 , from which we
y= 5 sin 2t dt = − 52 cos 2t + C. have m0 = Ce0 , so C = m0 . The required particular
From the initial condition y(0) = 0, we have solution is therefore
0 = − 25 + C, so C = 52 . The solution of the initial-value m = m0 e−λt .
problem is therefore
y = 52 (1 − cos 2t). 3.4 (a) The differential equation is
dy y−1
= , where x > 0.
3.2 (a) The differential equation dy/dx = xe−2x has dx x
general solution In order to apply the separation of variables method, we
need to exclude the cases where y = 1. So, for y = 1,
y= xe−2x dx. on applying Procedure 3.2 we have
The integral may be found using integration by parts. 1 1
dy = dx.
Taking f (x) = x and g (x) = e−2x , and using the for- y−1 x
mula Since x > 0, for y = 1 (so that y − 1 = 0), integration
produces
f (x)g (x) dx = f (x)g(x) − f (x)g(x) dx,
ln |y − 1| = ln x + B,
we have where B is an arbitrary constant. On solving this equa-
−2x
xe dx = − 21 xe−2x + 1 −2x
2e dx tion for y, by first taking the exponential of both sides,
we obtain
= − 21 xe−2x − 41 e−2x + C, y = 1 ± eln x+B = 1 ± eB eln x = 1 + Cx,
where C is an arbitrary constant. The general solution where C = ±eB is a non-zero but otherwise arbitrary
of the differential equation is therefore constant.
y = − 14 (2x + 1)e−2x + C.
103
Unit 2 First-order differential equations
Examination of the differential equation shows that Integration produces the general solution
C = 0 also gives a solution (the constant function y = ln |u − a| + C,
y = 1). The general solution is therefore
where C is an arbitrary constant.
y = 1 + Cx,
(b) The general solution of dy/dx = 1/(x(1 − ax)),
where C is an arbitrary constant.
where x = 0, x = 1/a, is given by
(If you cannot convince yourself that this is the general 1
solution for all y, including y = 1, then you will see it y= dx.
x(1 − ax)
proved in Example 4.1.)
This can be solved by using the substitution u = 1/x.
(b) The differential equation is dy/dx = 2y/(x2 + 1). Alternatively, if the integral is rearranged as
In order to apply the separation of variables method, 1 1
0,
we need to exclude the cases where y = 0. So, for y = y=− dx,
a (x − 0)(x − 1/a)
on applying Procedure 3.2 we have
then we can use the table of standard integrals in the
1 2
dy = dx. Handbook. Either method of integration gives the gen-
y 2
x +1 eral solution
Since y = 0, integration produces 1
y = C − ln − a ,
ln |y| = 2(arctan x + B), x
where B is an arbitrary constant. On solving this equa- where C is an arbitrary constant.
tion for y, we obtain
y = ±e2 arctan x+2B = ±e2B e2 arctan x = Ce2 arctan x , 3.7 Each of the differential equations can be solved by
where C = ±e2B is a non-zero but otherwise arbi- separation of variables.
trary constant. Examination of the differential equa- (a) The differential equation is u = du/dx = xu. For
tion shows that C = 0 also gives a solution (the zero the cases where u = 0, we divide through by u and in-
function y = 0). The general solution is therefore tegrate with respect to x. This gives
y = Ce2 arctan x , 1
du = x dx.
where C is an arbitrary constant. u
(If you cannot convince yourself that this is the general Integration produces
solution for all y, including y = 0, then you will see it ln |u| = 12 x2 + B,
proved in Example 4.1.) where B is an arbitrary constant. On solving this equa-
3.5 The differential equation is dv/du = eu+v = eu ev . tion for u, we obtain
2 2 2
Dividing through by ev and integrating with respect u = ±ex /2+B = ±eB ex /2 = Cex /2 ,
to u, we obtain where C = ±eB is a non-zero but otherwise arbitrary
e−v dv = eu du. constant. However, the case C = 0 can be added, since
it can be seen by inspection of the differential equation
Integration produces that the zero function u = 0 is a solution. Hence the
−e−v = eu + B, general solution is
2
where B is an arbitrary constant. On solving this equa- u = Cex /2
,
tion for v, we obtain where C is an arbitrary constant.
v = − ln(−eu − B) = − ln(C − eu ),
2
(You verified that u = 2ex /2 is a particular solution of
where C = −B. We need C > 0 and u < ln C in order this differential equation in Exercise 1.2(c).)
for the argument of ln here to be positive. Hence the (b) The differential equation is ẋ = dx/dt = 1 + x2 .
general solution is We divide through by 1 + x2 and integrate with respect
v = − ln(C − eu ) (u < ln C), to t. This gives
where C is a positive but otherwise arbitrary constant. 1
dx = 1 dt.
The initial condition v(0) = 0 gives 0 = − ln(C − e0 ), 1 + x2
so C − e0 = 1 and hence C = 2. The solution of the Integration produces
initial-value problem is therefore arctan x = t + C,
v = − ln(2 − eu ) (u < ln 2). where C is an arbitrary constant. On solving for x, we
have
3.6 Each of the differential equations is soluble by di-
x = tan(t + C).
rect integration.
A suitable domain for the solution is − π2 < t + C < π2 ,
(a) The general solution of dy/du = 1/(u − a), where
since the image set of arctan is the interval − π2 , π2 .
u= a, is given by
1
y= du.
u−a
104
Solutions to the exercises
3.8 (a) The given equation is dP/dt = kP (1 − P/M ). (c) The equation dz/dt = −3z 1/2 is not linear (because
First, note that (as remarked upon in Section 2) the of the z 1/2 term).
constant functions P = 0 and P = M are both solu- (d) The equation ẏ + y 2 = t is not linear (because of
tions. Assuming that we are considering neither of these the y 2 term).
possibilities (we are certainly not interested in P = 0
(e) The equation x(dy/dx) + y = y 2 is not linear (be-
since we know that P0 > 0), we can use the separation
cause of the y 2 term).
of variables method to obtain
1 1 (f ) The equation (1 + x2 )(dy/dx) + 2xy = 3x2 is lin-
dP = 1 dt.
k P (1 − P/M ) ear, since we can divide through by 1 + x2 to obtain
The integral on the left-hand side is of the form evalu- dy/dx + 2xy/(1 + x2 ) = 3x2 /(1 + x2 ), which is of the
ated in Exercise 3.6(b), with 1/M in place of a. Hence defined form with g(x) = 2x/(1 + x2 ) and h(x) =
we have 3x2 /(1 + x2 ).
1 1 1
− ln − = t + B, 4.2 (a) The given equation is dy/dx − y = ex sin x.
k P M
Comparison with Equations (4.1) and (4.9) shows that
where B is an arbitrary constant. On solving for P , we
the integrating factor is
find first that
1 1 = exp(−x) = e−x .
− = ±e−k(t+B) p = exp (−1) dx
P M
= ±e−kB e−kt Multiplying through by p(x) gives
= Ce −kt
, dy
e−x − e−x y = sin x.
where C = ±e −kB
is a non-zero but otherwise arbitrary dx
Thus the differential equation can be rewritten as
constant. However, note that C = 0 corresponds to the
constant solution P = M already noted, so the restric- d −x
(e y) = sin x.
0 may be dropped. Hence we obtain
tion C = dx
−1 On integrating, we find the general solution
1
P = + Ce−kt (ekt = −M C), e−x y = − cos x + C,
M
or, equivalently,
where C is an arbitrary constant.
y = ex (C − cos x),
From the initial condition P (0) = P0 , we have
−1 where C is an arbitrary constant.
1 1 1
P0 = + Ce0 , so C = − . (b) The given equation, when rearranged into
M P0 M
The solution of the initial-value problem is therefore form (4.1), is dy/dx − y = x. This has the same left-
−1 hand side as the differential equation in part (a), and
1 1 1 hence the same integrating factor, p = e−x . Multiplying
P = + − e−kt ,
M P0 M through by p(x) gives
which yields dy
M e−x − e−x y = xe−x .
P = . dx
1 + (M/P0 − 1)e−kt Thus the differential equation can be rewritten as
Finally, we rewrite this in the more familiar form d −x
(e y) = xe−x .
M ekt dx
P = kt . On integrating (by parts on the right-hand side), we
e + (M/P0 − 1)
find
(b) As t → ∞ we have e−kt → 0, and consequently the
value of P (t) approaches M . e−x y = xe−x dx
(Note that this is true whether the starting value P0 is
greater than or less than M . This result is consistent = −xe−x + e−x dx
with the specific direction field shown in Figure 2.4.)
= −xe−x − e−x + C
= C − (x + 1)e−x ,
where C is an arbitrary constant. After multiplying
through by ex , the general solution in explicit form is
y = Cex − (x + 1).
105
Unit 2 First-order differential equations
4.3 (a) The given equation, when rearranged into 4.5 (a) The given equation is dy/dt + y = t + 1. Com-
form (4.1), is du/dx − xu = 0. The integrating factor is parison with Equations (4.1) and (4.9) shows that the
integrating factor is
p = exp (−x) dx
p = exp 1 dt
= exp(−x2 /2)
2 = exp(t)
= e−x /2
.
= et .
Multiplying through by p(x) gives
Multiplying through by p(t) gives
2 du 2
e−x /2 − xe−x /2 u = 0. dy
dx et + et y = (t + 1)et .
Thus the differential equation can be rewritten as dt
d −x2 /2 Thus the differential equation can be rewritten as
(e u) = 0. d t
dx (e y) = (t + 1)et .
On integrating, we find the general solution dt
2 2 On integrating (by parts on the right-hand side), we
e−x /2 u = C, or, equivalently, u = Cex /2
, find
where C is an arbitrary constant.
et y = (t + 1)et dt
From the initial condition u(0) = 2, we have 2 = Ce0 , so
C = 2. Hence the solution of the initial-value problem
= (t + 1)et − et dt
is
2
u = 2ex /2 . = (t + 1)et − et + C
(b) After division by t, the given equation can be writ- = tet + C,
ten as dy/dt + (2/t)y = t. (To avoid division by zero, where C is an arbitrary constant. After multiplying
we take t > 0, say, which is consistent with the initial through by e−t , the general solution in explicit form is
condition.) The integrating factor is y = Ce−t + t.
2
p = exp dt From the initial condition y(1) = 0, we have
t 0 = Ce−1 + 1, so C = −e. Hence the solution of the
= exp(2 ln t) initial-value problem is
= exp(ln(t2 )) y = t − e1−t .
2
=t . (b) After division by e3t and rearrangement, the given
Multiplying through by p(t) gives equation becomes dy/dt + y = e−3t . This has the same
dy left-hand side as the differential equation in part (a),
t2 + 2ty = t3 . and hence the same integrating factor, p = et . Multi-
dt
Thus the differential equation can be rewritten as plying through by p(t) gives
d 2 dy
(t y) = t3 . et + et y = e−2t .
dt dt
On integrating, we find the general solution Thus the differential equation can be rewritten as
t2 y = 14 t4 + C, or, equivalently, y = 41 t2 + Ct−2 , d t
(e y) = e−2t .
where C is an arbitrary constant. dt
On integrating, we find the general solution
From the initial condition y(1) = 1, we have 1 = 14 + C,
et y = − 12 e−2t + C,
so C = 34 . Hence the solution of the initial-value prob-
lem is or, equivalently,
y = 14 (t2 + 3t−2 ). y = Ce−t − 12 e−3t ,
where C is an arbitrary constant.
4.4 (a) and (d) require the integrating factor method. From the initial condition y(0) = 3, we have
(b) is best solved by direct integration. (c) can be 3 = Ce0 − 12 e0 , so C = 72 . Hence the solution of the
solved by separation of variables or the integrating fac- initial-value problem is
tor method.
y = 12 (7e−t − e−3t ).
106
Solutions to the exercises
p = exp 4 dt
= exp(4t)
= e4t .
Multiplying through by p(t) gives
dv
e4t + 4e4t v = 3e4t cos 2t.
dt
Thus the differential equation can be rewritten as
d 4t
(e v) = 3e4t cos 2t.
dt
On integrating (using the hint for the right-hand side,
with a = 4 and b = 2), we find
e4t v = 20
3 4t
e (4 cos 2t + 2 sin 2t) + C,
where C is an arbitrary constant. After multiplying
through by e−4t , the general solution in explicit form is
v= 3
20 (4 cos 2t + 2 sin 2t) + Ce−4t .
107
UNIT 3 Second-order differential
equations 1
109
Unit 3 Second-order differential equations
Introduction
Unit 2 introduced you to differential equations, and in particular to first-
order differential equations that can be written in the form
dy
= f (x, y).
dx
Such an equation is said to be of first order because it involves only the first
derivative dy/dx of the function y = y(x).
This unit considers second-order differential equations, that is, differential The order of a differential
equations that involve a second (but no higher) derivative. Examples of equation was defined in
second-order differential equations are Subsection 1.2 of Unit 2.
d2 y dy x d2 y
− 3 + 2y = 4e and 3 + y 2 sin x = x2 . A second-order differential
dx2 dx dx2 equation may or may not
As in the case of first-order differential equations, second-order differential include a first derivative.
equations, and in particular the derivatives in such equations, can be written
in a variety of notations. For example, the second derivative of a dependent
variable y with respect to an independent variable t (representing time) may Of course, the dependent
be written as d2 y/dt2 , ÿ, y , and so on. Also, as in the case of first-order variable is not always y!
equations, the dependent variable and sometimes the independent variable
can be considered as functions, and the same symbol is frequently used for
both the variable and the corresponding function.
One particularly simple example of a second-order differential equation, with
dependent variable s and independent variable t, is
d2 s
= a, (0.1)
dt2
where a is a given constant. This equation can be solved by applying direct See Subsection 3.1 of Unit 2.
integration twice. One application gives
ds
= a dt = at + C,
dt
where C is an arbitrary constant. Integrating a second time gives
where D is another arbitrary constant. Equation (0.2) is the general solution Recall, from Unit 2, that the
of the second-order differential equation (0.1). general solution of a
differential equation is the
Now, in Unit 2 you saw that the general solution of a first-order differential collection of all possible
equation usually involves just one arbitrary constant. But here, even for solutions of that equation.
such a simple second-order differential equation, the general solution involves
two arbitrary constants (namely C and D). It is a property of second-order
differential equations that the general solution usually involves two arbitrary
constants.
The remainder of the unit proceeds as follows. Section 1 concentrates on
homogeneous linear constant-coefficient second-order differential equations,
leaving inhomogeneous equations to Section 2. Section 3 considers the types
of condition needed to move from a general to a particular solution. Finally,
Section 4 uses the computer to examine the nature of solutions.
110
Section 1 Homogeneous differential equations
Example 1.1
Suppose that a car is travelling with constant acceleration a along a straight
road. If, at time t, its distance from a fixed point is s, then its velocity
is given by ds/dt, its acceleration is given by d2 s/dt2 , and its motion is
modelled by
d2 s
= a. (0.1)
dt2
If the car is initially stationary at position s = 0 and thereafter has a con-
stant acceleration of 2 m s−2 , how long does it take for the car to attain a
velocity of 30 m s−1 , and what distance has it travelled in that time?
Solution
You saw in the Introduction that integrating Equation (0.1) leads to
ds
= at + C and s = 12 at2 + Ct + D,
dt
where C and D are arbitrary constants. To find these constants (and hence
answer the questions asked), we need to make use of the conditions given.
These are that the car is initially stationary (i.e. ds/dt = 0 when t = 0) at
position s = 0 (i.e. s = 0 when t = 0). The first of these conditions, together
with the equation ds/dt = at + C, tells us that C = 0. With C = 0, the
second equation becomes s = 12 at2 + D, and this together with the second
condition tells us that D = 0.
Therefore, when a = 2, we have
d2 s ds
2
= 2, = 2t, s = t2 .
dt dt
So the velocity is ds/dt = 30 when 2t = 30, i.e. after 15 seconds, and in this
time the car has travelled s = 152 = 225 metres.
111
Unit 3 Second-order differential equations
Definitions
(a) A second-order differential equation for y = y(x) is linear if it can Compare the definitions for
be expressed in the form first-order equations in
Subsection 4.1 of Unit 2. The
d2 y dy important feature is the
a(x) + b(x) + c(x)y = f (x), linear combination of y and
dx2 dx
its derivatives on the
where a(x), b(x), c(x) and f (x) are given continuous functions. left-hand side.
(b) A linear second-order differential equation is constant-coefficient
if the functions a(x), b(x) and c(x) are all constant, so that the
equation is of the form
d2 y dy
a 2
+b + cy = f (x), (1.1)
dx dx
where a = 0. If a = 0, then the equation is
first-order.
(c) A linear constant-coefficient second-order differential equation is
said to be homogeneous if f (x) = 0 for all x, and inhomoge-
neous (or non-homogeneous) otherwise.
*Exercise 1.1
Consider the following second-order differential equations.
d2 y d2 y dy d2 y dy
(i) = x2 (ii) 3 + 4 + y = x2
(iii) 3 +4 +y =0
dx2 dx 2 dx dx 2 dx
d2 y dy d2 y dy
(iv) xy + x2 y = 0 (v) 2y 2 + xy = 3 (vi) 2y 2 + 4y = 3
dx dx dx dx
d2 t dt
(vii) 2 2 + 3 + 4t = sin θ (viii) x ¨ = −4t (ix) x ¨ = −4x
dθ dθ
(a) Which of the equations are linear and constant-coefficient?
(b) Which of the linear constant-coefficient equations are homogeneous?
(c) For each equation, identify the dependent and independent variables.
112
Section 1 Homogeneous differential equations
113
Unit 3 Second-order differential equations
Definition
The auxiliary equation of the homogeneous linear constant-coefficient The auxiliary equation is
second-order differential equation sometimes called the
characteristic equation.
d2 y dy
a
2
+b + cy = 0
dx dx
is the quadratic equation
aλ2 + bλ + c = 0. (1.5)
114
Section 1 Homogeneous differential equations
Example 1.2
Write down the auxiliary equation of the differential equation
d2 y dy
3 −2 + 4y = 0.
dx2 dx
Solution
The auxiliary equation is
3λ2 − 2λ + 4 = 0.
*Exercise 1.2
Write down the auxiliary equation of each of the following differential equa-
tions.
d2 y dy
(a) 2
−5 + 6y = 0 (b) y − 9y = 0 (c) x
¨ + 2ẋ = 0
dx dx
115
Unit 3 Second-order differential equations
In Subsection 1.3 we shall prove that if λ1 and λ2 are distinct roots of the
auxiliary equation of a homogeneous linear constant-coefficient second-order
differential equation, then any solution is of the form
y = Ceλ1 x + Deλ2 x , (1.6)
for some choice of constants C and D.
*Exercise 1.3
Use the auxiliary equation to find the general solution of each of the following
differential equations.
d2 y dy d2 y dy d2 z
(a) + 5 + 6y = 0 (b) 2 + 3 = 0 (c) − 4z = 0
dx2 dx dx2 dx du2
We now consider an example where the two roots of the auxiliary equation
are equal, in which case the above recipe does not work! Indeed, in light
of the earlier discussion, you might expect the solution always to be of the
form y = Aeλ1 x + Beλ2 x , where A and B are arbitrary constants. But if
λ1 = λ2 , this reduces to y = (A + B)eλ1 x = Ceλ1 x , where C = A + B is a
single arbitrary constant, so this cannot be the general solution of a second -
order differential equation.
Example 1.4
(a) Write down the auxiliary equation of the differential equation
d2 y dy
2
+6 + 9y = 0,
dx dx
and find its roots λ1 and λ2 .
(b) Deduce that y1 = Ce−3x is a solution of the differential equation, for
any value of the constant C.
(c) Show that y2 = Dxe−3x is also a solution, for any value of the con-
stant D.
(d) Deduce that y = (C + Dx)e−3x is also a solution of the differential equa-
tion, for any values of the two constants C and D.
Solution
(a) The auxiliary equation is
λ2 + 6λ + 9 = 0.
The left-hand side is the perfect square (λ + 3)2 , so the auxiliary equa-
tion has equal roots λ1 = λ2 = −3.
(b) Since λ1 = −3 is a root of the auxiliary equation, y1 = Ce−3x is a solu- Note that the ‘other’ root
tion of the differential equation, for any value of C. λ2 = −3 gives the same
solution.
116
Section 1 Homogeneous differential equations
The solution in Example 1.4 is of the form y = Ceλ1 x + Dxeλ1 x . The extra x
in the second term, Dxeλ1 x , is needed, in this special case, to incorporate the
second arbitrary constant required by the general solution of a second-order
differential equation.
In general, when λ1 = λ2 , y = xeλ1 x is a solution of Equation (1.3) (page 114).
To see this, differentiate twice to obtain
dy
= eλ1 x + λ1 xeλ1 x = (1 + λ1 x)eλ1 x ,
dx
d2 y
= λ1 eλ1 x + λ1 (1 + λ1 x)eλ1 x = (2λ1 + λ21 x)eλ1 x ,
dx2
and substitute into the left-hand side of Equation (1.3) to obtain
d2 y dy
a 2
+b + cy
dx dx
= a (2λ1 + λ21 x)eλ1 x + b (1 + λ1 x)eλ1 x + c xeλ1 x
= eλ1 x a(2λ1 + λ21 x) + b(1 + λ1 x) + cx
= eλ1 x (2aλ1 + b) + (aλ21 + bλ1 + c)x . (1.7)
Since λ1 is the solution of the auxiliary equation, we have aλ21 + bλ1 + c = 0.
Also, the formula method for solving the auxiliary equation aλ2 + bλ + c = 0
gives
√
−b ± b2 − 4ac
λ= ;
2a
since in this case we have equal roots, we must have b2 − 4ac = 0, so
λ1 = −b/2a, and therefore 2aλ1 + b = 0. Thus the right-hand side of Equa-
tion (1.7) is zero, and y = xeλ1 x is indeed a solution of Equation (1.3).
Therefore, when λ1 = λ2 , by the principle of superposition,
y = Ceλ1 x + Dxeλ1 x = (C + Dx)eλ1 x , (1.8)
where C and D are arbitrary constants, is always a solution of Equa-
tion (1.3). In fact, as you will see in Subsection 1.3, Equation (1.8) gives
the general solution.
117
Unit 3 Second-order differential equations
*Exercise 1.4
Use the auxiliary equation to find the general solution of the following dif-
ferential equations.
d2 y dy
(a) 2
+2 +y =0 (b) s̈ − 4ṡ + 4s = 0
dx dx
Equations (1.6) and (1.8) give us the general solution of Equation (1.3) for
the cases where the roots λ1 and λ2 of the auxiliary equation are distinct
and equal, respectively. However, the distinct roots of a quadratic equation
may not be real — they could consist of a pair of complex conjugate roots Recall that the complex
λ1 = α + βi and λ2 = α − βi. If the auxiliary equation has such a pair of conjugate of the complex
roots, we can still write the general solution in the form number α + βi is α − βi.
y = Aeλ1 x + Beλ2 x = Ae(α+βi)x + Be(α−βi)x , You will soon see why we use
A and B for the arbitrary
but we now have a complex-valued solution. constants (rather than our
usual choice of C and D).
Since Equation (1.3) has real coefficients, we would like a real-valued solu-
tion. In order to achieve this, we shall need to allow A and B to be complex.
Then we can use Euler’s formula, which tells us that Euler’s formula was discussed
in Unit 1.
eiβx = cos βx + i sin βx and e−iβx = cos βx − i sin βx.
Now
y = Aeλ1 x + Beλ2 x
= Ae(α+βi)x + Be(α−βi)x
= Aeαx eiβx + Beαx e−iβx
= Aeαx (cos βx + i sin βx) + Beαx (cos βx − i sin βx)
= eαx ((A + B) cos βx + (Ai − Bi) sin βx)
= eαx (C cos βx + D sin βx), The constants in the final
expression are now C and D,
where C = A + B and D = (A − B)i. Provided that any initial conditions in keeping with our previous
are real-valued, C and D are real, and this is the required real-valued solution solutions.
containing two arbitrary constants.
Example 1.5
(a) Write down the auxiliary equation of the differential equation
d2 y dy
−6 + 13y = 0,
dx2 dx
and show that its roots are λ1 = 3 + 2i and λ2 = 3 − 2i.
(b) Confirm that y1 = e3x cos 2x and y2 = e3x sin 2x are both solutions of
the differential equation.
(c) Deduce that y = e3x (C cos 2x + D sin 2x) is also a solution of the differ-
ential equation, for any values of the two constants C and D.
Solution
(a) The characteristic equation is
λ2 − 6λ + 13 = 0.
The formula method gives
√ √
6 ± 36 − 4 × 1 × 13 6 ± −16
λ= = = 3 ± 2i,
2 2
so the two complex conjugate roots are λ1 = 3 + 2i and λ2 = 3 − 2i. With the previous notation
we have α = 3 and β = 2.
118
Section 1 Homogeneous differential equations
*Exercise 1.5
Use the auxiliary equation to find the general solution of each of the following
differential equations.
d2 y dy
(a) +4 + 8y = 0
dx2 dx
d2 θ
(b) + 9θ = 0
dt2
119
Unit 3 Second-order differential equations
It is worth noting that the three cases in part (b) of Procedure 1.1 corre-
spond to three different possibilities that arise when solving the characteris-
tic equation aλ2 + bλ + c = 0. These three different possibilities relate to the
value of the discriminant b2 − 4ac: b2 − 4ac > 0 corresponds to case (i),
b2 − 4ac = 0 to case (ii), and b2 − 4ac < 0 to case (iii).
*Exercise 1.6
Find the general solution of each of the following differential equations.
d2 y
(a) + 4y = 0
dx2
(b) u (x) − 6u (x) + 8u(x) = 0
d2 y dy
(c) +2 =0
dx2 dx
d2 y dy
(d) −2 +y =0
dx2 dx
d2 y
(e) − ω2 y = 0, where ω is a real constant
dx2
d2 y dy
(f) +4 + 29y = 0
dx2 dx
120
Section 1 Homogeneous differential equations
Exercise 1.7
Small oscillations of the pendulum of a clock can be modelled by the differ-
ential equation
g
θ¨ = − θ,
l θ
l
where g is the magnitude of the acceleration due to gravity, l is the length
of the pendulum, and θ is the angle the pendulum makes with the vertical
(see Figure 1.1). Solve the differential equation to obtain an expression for
θ in terms of g and l.
Theorem 1.2
Suppose that the roots of the equation
aλ2 + bλ + c = 0
are λ1 and λ2 . Then the general solution of the second-order linear
constant-coefficient differential equation
d2 y dy
a +b + cy = 0 (1.3)
dx2 dx
can always be written in the form
Ceλ1 x + Deλ2 x (λ1 = λ2 ),
y(x) =
(C + Dx)eλx (λ1 = λ2 = λ).
121
Unit 3 Second-order differential equations
122
Section 1 Homogeneous differential equations
or
d −λ1 x
ye = Ae(λ2 −λ1 )x . (1.13)
dx
We now have two cases to consider. If λ2 − λ1 = 0, then integrating both This is the case when the
sides yields auxiliary equation has
distinct roots.
A
ye−λ1 x = e(λ2 −λ1 )x + C,
λ2 − λ1
where C is an arbitrary constant. Multiplying through by eλ1 x , we arrive at
the explicit form of the general solution
A
y= eλ2 x + Ceλ1 x .
λ2 − λ1
Finally, replacing the arbitrary constant A/(λ2 − λ1 ) by D gives the required
result:
y = Ceλ1 x + Deλ2 x .
If λ1 = λ2 = λ, say, then Equation (1.13) becomes This is the case when the
d −λx
auxiliary equation has equal
ye = A, roots.
dx
and integrating gives
ye−λx = Ax + C,
where C is an arbitrary constant. Re-labelling A as D, and multiplying
through by eλx , we arrive at the explicit form of the general solution
y = (C + Dx)eλx .
This completes the proof of Theorem 1.2.
End-of-section Exercises
Exercise 1.8
(a) Write down the auxiliary equation of the differential equation
dy d2 y
3 − y − 2 2 = 0.
dx dx
(b) Solve this auxiliary equation.
(c) Write down the general solution of the differential equation.
Exercise 1.9
Find the general solution of each of the following differential equations.
d2 y dy d2 y
(a) + 2 + 2y = 0 (b) − 16y = 0
dx2 dx dx2
d2 y dy d2 θ dθ
(c) 2
+ 4y = 4 (d) 2
+3 =0
dx dx dt dt
Exercise 1.10
For which values of the constant k does the differential equation
d2 y dy
2
+ 4k + 4y = 0
dx dx
have a general solution with oscillating behaviour, that is, a general solution
which involves sines and cosines?
123
Unit 3 Second-order differential equations
Example 2.1
Show that y = Ce−2x + De−3x + 2 is a solution of the inhomogeneous dif-
ferential equation
d2 y dy
+5 + 6y = 12 (2.2)
dx2 dx
for any values of the constants C and D.
Solution
We know from Exercise 1.3(a) that the homogeneous differential equation
d2 y dy
2
+5 + 6y = 0 (2.3)
dx dx
has a general solution yc = Ce−2x + De−3x , where C and D are arbitrary
constants.
Now consider the constant function yp = 2. This is a particular solution of The notation yc and yp will
Equation (2.2) since d2 yp /dx2 = dyp /dx = 0 and 6yp = 12. be explained shortly.
124
Section 2 Inhomogeneous differential equations
have special names in this context: yc , the general solution of the asso-
ciated homogeneous equation (2.3), is called the complementary function,
and yp , a particular solution of the inhomogeneous equation (2.2), is called
a particular integral.
Definitions
Let
d2 y dy
a 2
+b + cy = f (x) (2.1)
dx dx
be an inhomogeneous linear constant-coefficient second-order differen-
tial equation.
(a) Its associated homogeneous equation is
d2 y dy
a 2
+b + cy = 0.
dx dx
(b) The general solution yc of the associated homogeneous equation is
known as the complementary function for the original inhomo-
geneous equation (2.1).
(c) Any particular solution yp of the original inhomogeneous equa-
tion (2.1) is referred to as a particular integral for that equation. The term particular integral
is used here, rather than the
term particular solution used
Later in this section we shall show how to find particular integrals for a wide in some other texts, to
variety of equations. Before we do that, it is important to realize the full distinguish it from the
particular solution to
significance of finding just one particular integral. Equation (2.1) that satisfies
given initial or boundary
*Exercise 2.1 conditions (see Section 3).
Suppose that we have found two different particular integrals yp1 , yp2 for
Equation (2.1). Use the principle of superposition to show that the function
yp1 − yp2 is then a solution of the associated homogeneous equation.
The result of Exercise 2.1 shows the true significance of finding a particular
integral. For if we do so then, since from Section 1 we know how to solve That is, we can find the
the associated homogeneous equation, we can find all particular integrals complementary function.
simply by adding the complementary function. We have proved the following
important result.
Theorem 2.1
If yc is the complementary function for an inhomogeneous linear
constant-coefficient second-order differential equation, and yp is a par-
ticular integral for that equation, then yc + yp is the general solution
of that equation.
Note that yc , being the general solution of the associated homogeneous equa-
tion, will contain two arbitrary constants, whereas yp , being a particular
solution, will contain none.
Let us now see how the method based on Theorem 2.1 can be applied.
125
Unit 3 Second-order differential equations
Example 2.2
Find the general solution of the differential equation
d2 y
+ 9y = 9x + 9. (2.4)
dx2
Solution
The associated homogeneous equation is
d2 y
+ 9y = 0,
dx2
which has the general solution
yc = C cos 3x + D sin 3x. See Exercise 1.5(b), although
there different symbols were
This is the complementary function for Equation (2.4). used for the variables.
A particular integral for Equation (2.4) is You will see in the next
subsection how to find such a
yp = x + 1. particular integral.
This may be verified by differentiation and substitution: yp = 1 and yp = 0,
and substituting into the left-hand side of Equation (2.4) gives
yp + 9yp = 0 + 9(x + 1) = 9x + 9,
which is the same as the right-hand side of Equation (2.4), as required.
The general solution of Equation (2.4) is, therefore, by Theorem 2.1,
y = yc + yp = C cos 3x + D sin 3x + x + 1,
where C and D are arbitrary constants.
126
Section 2 Inhomogeneous differential equations
In that case, the form of the general solution would have been obtained as
y = C cos 3x + D sin 3x + x + 1 + sin 3x. This looks a little different, but it
may be written in the form y = C cos 3x + (D + 1) sin 3x + x + 1; and, since
C and D are arbitrary constants, this form of the general solution represents
exactly the same family of solutions.
*Exercise 2.2
Consider the following differential equations.
d2 y d2 y dy
(a) 2
+ 4y = 8 (b) −3 + 2y = 6 See Exercise 1.6(a) and
dx dx2 dx Example 1.3.
For each equation:
• write down its associated homogeneous equation and its complementary
function yc ;
• find a particular integral of the form yp = p, where p is a constant;
• write down the general solution.
127
Unit 3 Second-order differential equations
Example 2.3
Find a particular integral for
d2 y dy
3 2
−2 + y = 4x + 2.
dx dx
Solution
We try a solution of the form
y = p1 x + p0 ,
where p1 and p0 are coefficients to be determined so that the differential
equation is satisfied. To try this solution, we need the first and second
derivatives of y:
dy d2 y
= p1 , = 0.
dx dx2
Substituting these into the left-hand side of the differential equation gives
d2 y dy
3 2
−2 + y = 3 × 0 − 2p1 + (p1 x + p0 ) = p1 x + (p0 − 2p1 ).
dx dx
Therefore, for y = p1 x + p0 to be a solution of the differential equation, we
require that
p1 x + (p0 − 2p1 ) = 4x + 2 for all x. (2.5)
To find the two unknown coefficients p1 and p0 , we compare the coefficients Comparing coefficients works
on both sides of Equation (2.5). Comparing the terms in x gives p1 = 4. because two polynomials are
Comparing the constant terms gives p0 − 2p1 = 2, so that p0 = 2 + 2p1 = equal if and only if all their
corresponding coefficients are
2 + 2 × 4 = 10. Therefore we have the particular integral the same.
yp = 4x + 10.
Checking : if y = 4x + 10, then dy/dx = 4, d2 y/dx2 = 0, and substituting
into the left-hand side of the differential equation gives
d2 y dy
3 2
−2 + y = 3 × 0 − 2 × 4 + (4x + 10) = 4x + 2,
dx dx
as required.
You will have noticed in Example 2.3 that substituting a linear trial solution
y = p1 x + p0 into the left-hand side of the differential equation resulted
in a linear function, namely p1 x + (p0 − 2p1 ), whose coefficients could be
compared with those of the linear target function 4x + 2 to obtain values
for p1 and p0 . This is really the key to the method. If the target function
is linear, choosing a linear trial solution ensures that substituting into the
left-hand side of the differential equation results in a linear function whose
coefficients can be compared with those of the target function. Similarly,
as you will see below, if the target function belongs to certain other classes
of functions, choosing as a trial solution a general function from that class
ensures that substitution into the left-hand side of the differential equation
produces another function from the same class, whose coefficients can be
compared with those of the target function, thus enabling values to be given
to the coefficients of the trial solution. The method will work provided that
all the derivatives of functions in the class are also in the class.
128
Section 2 Inhomogeneous differential equations
*Exercise 2.3
Find particular integrals of the form y = p1 x + p0 for each of the following
differential equations.
d2 y dy d2 y dy
(a) 2
− 2 + 2y = 2x + 3 (b) 2
+2 + y = 2x
dx dx dx dx
*Exercise 2.4
Find a particular integral for
y − y = t2 .
An exponential function f (x) = mekx
Example 2.4
Find a particular integral for
d2 y
+ 9y = 2e3x .
dx2
Solution
We try a solution of the form
y = pe3x , Since the derivative of e3x is
3e3x , the exponent (3x)
where p is a coefficient to be determined so that the differential equation is appearing in y(x) should be
satisfied. Differentiating y = pe3x gives the same as that appearing in
f (x), and only the coefficient
dy d2 y
= 3pe3x , = 9pe3x . p is to be determined.
dx dx2
Substituting these into the left-hand side of the differential equation gives
d2 y
+ 9y = 9pe3x + 9pe3x = 18pe3x .
dx2
Therefore, for y = pe3x to be a solution of the differential equation, we
require that 18pe3x = 2e3x for all x. Hence p = 19 , and
yp = 19 e3x
is a particular integral for the given differential equation.
129
Unit 3 Second-order differential equations
*Exercise 2.5
Find a particular integral for
d2 y dy
2 −2 + y = 2e−x .
dx2 dx
A sinusoidal function (f (x) = m cos Ωx + n sin Ωx) This type of function f (x) is
particularly important in
Following on from earlier ideas, the trial solution must contain terms like many practical applications.
those in f (x) and all its derivatives; so, even if f (x) contains only a sine or
only a cosine, the trial solution y(x) must contain both a sine and a cosine.
However, the value of the parameter Ω appearing in y(x) should be the same
as that appearing in f (x).
Example 2.5
Find a particular integral for
d2 y dy
2
+2 + 2y = 10 sin 2x.
dx dx
Solution
We try a solution of the form
y = p cos 2x + q sin 2x,
where p and q are coefficients to be determined so that the differential equa-
tion is satisfied. Differentiating y gives
dy d2 y
= −2p sin 2x + 2q cos 2x, = −4p cos 2x − 4q sin 2x.
dx dx2
Substituting these into the left-hand side of the differential equation gives
d2 y dy
2
+2 + 2y = (−4p cos 2x − 4q sin 2x)
dx dx
+ 2 (−2p sin 2x + 2q cos 2x)
+ 2(p cos 2x + q sin 2x)
= (−2p + 4q) cos 2x + (−4p − 2q) sin 2x.
Therefore, for y = p cos 2x + q sin 2x to be a solution of the differential equa-
tion, we require that
(−2p + 4q) cos 2x + (−4p − 2q) sin 2x = 10 sin 2x for all x. (2.6)
To find p and q, we compare the coefficients of cos and sin on both sides Comparing coefficients works
of Equation (2.6). Comparing cos terms gives −2p + 4q = 0. Comparing because the cosine and sine
sin terms gives −4p − 2q = 10. Solving these simultaneous equations gives functions are linearly
independent :
p = −2, q = −1. Hence if a sin rx + b cos rx = 0 for
yp = −2 cos 2x − sin 2x all x, then a = b = 0.
*Exercise 2.6
Find a particular integral for
d2 y dy
− = cos 3t + sin 3t.
dt2 dt
130
Section 2 Inhomogeneous differential equations
Exercise 2.7
What form of trial solution y should you use in order to find a particular
integral for each of the following differential equations?
d2 y d2 y dy
(a) − y = e 3x (b) +2 − 4y = sin 3x
dx2 dx2 dx
*Exercise 2.8
Find the general solution of each of the following differential equations. The complementary functions
were obtained in Exercise 1.9
d2 y dy d2 θ dθ
(a) + 2 + 2y = 4 (b) + 3 = 9 cos 3t parts (a) and (d).
dx2 dx dt2 dt
Exercise 2.9
A long horizontal rectangular beam of length l rests on rigid supports at
each end. It is important in civil engineering to determine how much such a l
beam ‘sags’. A simple model of this ‘sag’, or vertical displacement y, is the
differential equation
Ry − Sy + 12 Q(l − x)x = 0, y
x displacement
where R, S and Q are constants related to the structure of the beam, and
x is the distance from one end of the beam (as illustrated in Figure 2.1).
Find the general solution of the differential equation in the case where R, S Figure 2.1
and Q are all equal to 1.
In Subsection 2.4 you will see how the principle of superposition can be
used in combination with Procedure 2.2 to solve differential equations whose
right-hand-side functions f (x) are sums of polynomial, exponential and
sinusoidal functions. But first let us look at some exceptional cases for
which Procedure 2.2 does not work and needs to be adapted.
131
Unit 3 Second-order differential equations
Example 2.6
Find a particular integral for
d2 y
− 4y = 2e2x .
dx2
Solution
Using Procedure 2.2, let us try y = pe2x . Differentiating this gives
dy d2 y
= 2pe2x , = 4pe2x .
dx dx2
Substituting these into the left-hand side of the differential equation gives
d2 y
− 4y = 4pe2x − 4pe2x = 0.
dx2
So there is no value of p that gives a particular integral of the form y = pe2x .
The trouble is that the complementary function, i.e. the general solution of
the associated homogeneous equation
d2 y
− 4y = 0,
dx2
is y = Ce−2x + De2x , thus the trial solution is a solution of the associated See Exercise 1.3(c).
homogeneous equation (with C = 0, D = p). Hence, on substituting the
trial solution y = pe2x into the inhomogeneous equation, the left-hand side
is zero for any value of p, so it cannot be equal to the non-zero right-hand
side.
In such circumstances, the difficulty can generally be overcome by multiply-
ing the trial solution suggested in Procedure 2.2 by x. Thus, in this case, the There is an analogy here with
trial solution should be modified to take the form y = pxe2x . Differentiating the case of the homogeneous
this gives differential equation when the
characteristic equation has
dy equal roots; in that case,
= pe2x + 2pxe2x = p(1 + 2x)e2x , when eλx is one solution of
dx
d2 y the equation, another solution
= 2pe2x + 2p(1 + 2x)e2x = 4p(1 + x)e2x . is given by xeλx .
dx2
Substituting these into the left-hand side of the differential equation gives
d2 y
− 4y = 4p(1 + x)e2x − 4pxe2x = 4pe2x .
dx2
Therefore y = pxe2x is a solution of the differential equation provided that
4pe2x = 2e2x for all x. Hence p = 12 , and
yp = 12 xe2x
is a particular integral for the given differential equation.
The problem with the trial solution being a solution of the associated ho-
mogeneous equation can occur irrespective of the form of the trial solution
(i.e. polynomial, exponential or sinusoidal), but in most cases it can be
overcome by multiplying the trial solution suggested in Procedure 2.2 by x.
132
Section 2 Inhomogeneous differential equations
When using Procedure 2.2, you should check whether the proposed trial
solution is a solution of the associated homogeneous equation, and if so try
multiplying it by x. (This is why it is important to find yc before yp in
Procedure 2.1.)
Exercise 2.10
Find a particular integral for each of the following differential equations. The complementary functions
are given in the solutions to
d2 y dy x d2 y dy
(a) 2
− 3 + 2y = 4e (b) 2 2
+3 =1 Example 1.3 and
dx dx dx dx Exercise 1.3(b).
Exercise 2.11
The motion of a marble dropped from the Clifton Suspension Bridge into
the River Avon can be modelled by the differential equation
mẍ + rẋ − mg = 0,
x
where m is the mass of the marble, r is a constant related to air resistance,
g is the magnitude of the acceleration due to gravity, and x is the vertical
distance from the point of dropping (as shown in Figure 2.2). Find an
expression for x in terms of the time t. Figure 2.2
We have seen that Procedure 2.2 fails if the trial solution is a solution of
the associated homogeneous differential equation: in such cases we multiply
the suggested trial solution by x and use this as the trial solution. Another
situation in which it is necessary to multiply the suggested trial solution by
x is illustrated in the following example.
Example 2.7
Find a particular integral for
d2 y dy
2
+2 = 2x + 2.
dx dx
Solution
Using Procedure 2.2, let us try y = p1 x + p0 . Differentiating this gives
dy d2 y
= p1 , = 0.
dx dx2
Substituting these into the left-hand side of the differential equation gives
d2 y dy
+2 = 2p1 .
dx2 dx
But there is no value of p1 that satisfies 2p1 = 2x + 2 for all x.
The problem this time is that the complementary function is y = C + De−2x , See Exercise 1.6(c).
so the p0 part of the trial solution is a solution of the associated homogeneous
equation (with C = p0 , D = 0). Hence, on substituting the trial solution
y = p1 x + p0 into the inhomogeneous equation, the p0 part disappears, and
the trial solution effectively reduces to y = p1 x. The result in this case is
that, after substituting the trial solution and its derivatives into the left-
hand side of the equation, there are not enough terms on the left-hand side
to compare with the terms in the right-hand-side function.
Again, the difficulty can be overcome by multiplying the trial solution sug-
gested by Procedure 2.2 by x, to give y = p1 x2 + p0 x. Differentiating this
gives
dy d2 y
= 2p1 x + p0 , = 2p1 .
dx dx2
133
Unit 3 Second-order differential equations
Substituting these into the left-hand side of the differential equation gives
d2 y dy
2
+2 = 2p1 + 2(2p1 x + p0 ) = 4p1 x + (2p1 + 2p0 ).
dx dx
Therefore y = p1 x2 + p0 x is a solution of the differential equation provided
that 4p1 x + (2p1 + 2p0 ) = 2x + 2 for all x. This gives p1 = 12 , p0 = 12 , so
yp = 12 (x2 + x)
is a particular integral for the given differential equation.
To summarize, Procedure 2.2 will fail if all or part of the suggested trial
solution is a solution of the associated homogeneous equation. In such cases,
a particular integral can usually be found by multiplying the trial solution
by x.
However, it may sometimes be necessary to multiply the trial function by x
more than once, as explained in Procedure 2.3 and Exercise 2.12.
Exercise 2.12
Find a particular integral for You found the complementary
function in Exercise 1.6(d).
d2 y dy
−2 + y = ex .
dx2 dx
Example 2.8
Find a particular integral for
d2 y
+ 9y = 2e3x + 18x + 18. (2.7)
dx2
Solution
In Example 2.4 (page 129) you saw that yp = 19 e3x is a particular integral
for
d2 y
+ 9y = 2e3x ,
dx2
134
Section 3 Initial conditions and boundar y conditions
Exercise 2.13
Find particular integrals for each of the following differential equations.
d2 y dy
(a) 2
−2 + y = 4ex − 3e2x See Exercise 2.12.
dx dx
d2 x dx
(b) 2 2 + 3 + 2x = 12 cos 2t + 10
dt dt
End-of-section Exercise
*Exercise 2.14
Find the general solution of each of the following differential equations. You will find some help for
d2 θ parts (a), (d), (e) and (f) in
(a) + 4θ = 2t (b) u (t) + 4u (t) + 5u(t) = 5 Exercises 1.3 and 1.6, and
dt2 Example 1.3.
d2 Y dY d2 y
(c) 3 2 − 2 − Y = e2x + 3 (d) − 4y = e−2x
dx dx dx2
d2 y d2 y dy
(e) 2
+ 4y = sin 2x + 3x (f) 2
−3 + 2y = 2ex − 5e2x
dx dx dx
135
Unit 3 Second-order differential equations
Definitions
(a) Initial conditions associated with a second-order differential equa-
tion with dependent variable y and independent variable x specify
that y and dy/dx take values y0 and z0 , respectively, when x takes
the value x0 . These conditions can be written as
dy
y = y0 and = z0 when x = x0
dx
or as
y(x0 ) = y0 , y (x0 ) = z0 .
The numbers x0 , y0 and z0 are often referred to as initial values.
(b) The combination of a second-order differential equation and initial
conditions is called an initial-value problem.
Let us now see how initial conditions can be used to determine values for
the two arbitrary constants and hence find a particular solution.
Example 3.1
Find the particular solution of the differential equation
d2 y dy
2
−3 + 2y = 0
dx dx
that satisfies the initial conditions y = 0 and dy/dx = 1 when x = 0.
136
Section 3 Initial conditions and boundar y conditions
Solution
From Example 1.3 we know that the general solution is
y = Cex + De2x , (3.1)
where C and D are two arbitrary constants. One of the initial conditions
involves the derivative of the solution, so we need to obtain the derivative
of the general solution:
dy
= Cex + 2De2x . (3.2)
dx
The initial conditions state that y(0) = 0, y (0) = 1. Substituting x = 0,
y = 0 into Equation (3.1) gives
0 = Ce0 + De0 = C + D,
while substituting x = 0, dy/dx = 1 into Equation (3.2) gives
1 = Ce0 + 2De0 = C + 2D.
Solving these equations gives C = −1, D = 1, so the required particular Note that, when checking a
solution is particular solution, you
should check that it satisfies
y = −ex + e2x . the initial or boundary
conditions as well as the
differential equation.
*Exercise 3.1
Find the solutions of the following initial-value problems.
(a) u (t) + 9u(t) = 0, u π2 = 0, u π2 = 1. See Exercise 1.5(b).
d2 y dy
(b) −3 + 2y = 4ex , y(0) = 4, y (0) = 2. See Example 1.3 and
dx2 dx Exercise 2.10(a).
d2 y dy
(c) 2
−2 + y = 4ex − 3e2x , y(0) = 4, y (0) = −1. See Exercises 1.6(d)
dx dx and 2.13(a).
Theorem 3.1
The initial-value problem
d2 y dy
a +b + cy = f (x), y(x0 ) = y0 , y (x0 ) = z0 ,
dx2 dx
where a, b, c are real constants with a = 0, and f (x) is a given contin-
uous real-valued function on an interval (r, s), with x0 ∈ (r, s), has a
unique solution on that interval.
Note that one consequence of this theorem is that if the differential equation
is homogeneous and the initial conditions are of the form y(x0 ) = 0 and
y (x0 ) = 0, then the unique solution must be the zero function y = 0, since
it satisfies the differential equation and the initial conditions.
137
Unit 3 Second-order differential equations
Definitions
(a) Boundary conditions associated with a second-order differential
equation with dependent variable y and independent variable x
specify that y or dy/dx (or some combination of the two) takes
values y0 and y1 at two different values x0 and x1 , respectively, of x.
The numbers x0 , x1 , y0 and y1 are often referred to as boundary
values.
(b) The combination of a second-order differential equation and bound-
ary conditions is called a boundary-value problem.
Example 3.2
Find the particular solution of the differential equation
d2 y
+ 9y = 0
dx2
that satisfies the boundary conditions y = 0 when x = 0 and dy/dx = 1
when x = π3 .
Solution
From Exercise 1.5(b), we know that the general solution is
y = C cos 3x + D sin 3x, (3.3)
where C and D are two arbitrary constants.
One of the boundary conditions involves the derivative of the solution, so
we need to obtain the derivative of the general solution:
dy
= −3C sin 3x + 3D cos 3x. (3.4)
dx
The boundary conditions state that y(0) = 0, y π3 = 1. Substituting x = 0,
y = 0 into Equation (3.3) gives
0 = C cos 0 + D sin 0 = C,
138
Section 3 Initial conditions and boundar y conditions
Exercise 3.2
Find the solution of the following boundary-value problem:
d2 y dy
−3 + 2y = 4ex , y (0) = 2, y(1) = 0. See Exercise 3.1(b).
dx2 dx
Exercise 3.3
Use the differential equation of Exercise 2.9, with R = S = Q = 1, namely
y − y + 12 (l − x)x = 0, (3.5)
to determine the vertical displacement at the centre of a beam of length
2 metres resting on rigid supports at its ends.
Example 3.3
Try to find a solution of the boundary-value problem
d2 y π
+ 4y = 0, y(0) = 0, y 2 = 1.
dx2
Solution
From Exercise 1.6(a), the general solution is
y = C cos 2x + D sin 2x,
where C and D are two arbitrary constants.
π
The boundary conditions state that y(0) = 0, y 2 = 1. Substituting each
of these into the general solution in turn gives
0 = C cos 0 + D sin 0 = C,
1 = C cos π + D sin π = −C.
There is no solution for which C = 0 and C = −1, so there is no solution of
the differential equation that satisfies the given boundary conditions.
139
Unit 3 Second-order differential equations
Example 3.4
Find the solution of the boundary-value problem
d2 y dy
+4 + 5y = 5, y(0) = 1, y(π) = 1.
dx2 dx
Solution
From Exercise 2.14(b), the general solution is
y = e−2x (C cos x + D sin x) + 1,
where C and D are two arbitrary constants.
The boundary conditions state that y(0) = 1, y(π) = 1. Substituting each
of these into the general solution in turn gives
1 = e0 (C cos 0 + D sin 0) + 1 = C + 1,
1 = e−2π (C cos π + D sin π) + 1 = −Ce−2π + 1.
Both of these equations reduce to C = 0, but D can take any value, so any
solution of the form
y = De−2x sin x + 1
satisfies the differential equation and the boundary conditions.
End-of-section Exercises
*Exercise 3.4
For each of the following problems, identify the conditions as either initial You found the general
conditions or boundary conditions, and find the solution of each problem. solution of the differential
equation in Exercise 1.6(a).
(a) u (x) + 4u(x) = 0, u(0) = 1, u (0) = 0.
(b) u (x) + 4u(x) = 0, u(0) = 0, u π2 = 0.
(c) u (x) + 4u(x) = 0, u π2 = 0, u π2 = 0.
(d) u (x) + 4u(x) = 0, u(−π) = 1, u π4 = 2.
(e) u (x) + 4u(x) = 0, u (0) = 0, u π4 = 1.
Exercise 3.5
Find the solution (if any) of each of the following problems.
(a) u (t) + 4u (t) + 5u(t) = 0, u(0) = 0, u (0) = 2. See Exercise 2.14(b).
d2 y dy dy
(b) +2+ 2y = 0, where y = 0 and = 0 when x = 0. See Exercise 1.9(a).
dx2 dx dx
¨ + 9x = 3(1 − πt), x(0) = 13 , ẋ π3 = 0.
(c) x See Exercise 1.5(b).
140
Section 4 The nature of solutions
4.1 Transients
Consider the equation
d2 y dy
a 2 +b + cy = 0 with a > 0, b > 0, c > 0.
dx dx
The nature of the general solution depends on the nature of the roots of the
auxiliary equation aλ2 + bλ + c = 0. More specifically, you saw in Proce-
dure 1.1 that
λ1 , λ2 real and distinct ⇒ solution y(x) = Ceλ1 x + Deλ2 x ,
λ1 , λ2 real and equal ⇒ solution y(x) = (C + Dx)eλ1 x ,
λ1 , λ2 complex (= α ± iβ) ⇒ solution y(x) = eαx (C cos βx + D sin βx),
where in each case C and D are arbitrary real constants.
Since the auxiliary equation has λ1 and λ2 as roots, it may be written as
(λ − λ1 )(λ − λ2 ) = 0,
or
λ2 − (λ1 + λ2 )λ + λ1 λ2 = 0.
When we divide through the original auxiliary equation by a, we obtain
b c
λ2 + λ + = 0.
a a
Comparing the coefficients of λ in these two versions of the same quadratic
equation, we find that
b c
λ1 + λ2 = − and λ1 λ2 = .
a a
Now, using the fact that a, b and c are positive, we can make some interesting
deductions. First, c/a must be positive so, if they are real, λ1 and λ2
have the same sign. Also, since −b/a is negative, the sum λ1 + λ2 must
be negative. There is only one conclusion to draw from this: if λ1 and λ2
are real, then both are negative. If on the other hand λ1 and λ2 form the
complex conjugate pair α ± iβ, then their sum is
λ1 + λ2 = 2α.
Now λ1 + λ2 = −b/a being negative implies that α must be negative.
The upshot is that, in the above list of solutions, all the exponential terms
have a negative index. Thus for large values of x, the magnitude of all the
above solutions will become small. This phenomenon represents damping,
and you will meet it again in Unit 17.
The graphs of typical complementary functions in the three cases are shown
in Figure 4.1.
141
Unit 3 Second-order differential equations
Figure 4.1
142
Section 4 The nature of solutions
For this reason, a particular integral not involving part of the transient
solution is known as the steady-state solution. In most cases, for large x,
because the transient solution then has little effect, the solution settles down However, it is still possible
to a ‘steady state’ given by the contribution from that particular integral. that a particular integral may
Two typical solutions to initial-value problems of this type are sketched at decay at an even faster rate
than the complementary
the top of Figure 4.2. Here you can see the two examples of a particular function!
solution, followed by the contributions made to each by the transient and
the steady-state solution.
y y
2 particular solution 2 particular solution
1 1
0 x 0 x
10 20 30 40 50 10 20 30 40 50
–1 –1
–2 –2
y y
2 transient solution 2 transient solution
1 1
0 x 0 x
10 20 30 40 50 10 20 30 40 50
–1 –1
–2 –2
y y
2 steady-state solution 2 steady-state solution
1 1
0 x 0 x
10 20 30 40 50 10 20 30 40 50
–1 –1
–2 –2
Figure 4.2
Example 4.1
Consider the differential equation of Example 2.5 (page 130),
d2 y dy
+2 + 2y = 10 sin 2x,
dx2 dx
together with the initial conditions y(0) = −1 and y (0) = −2. Find the
particular solution and the steady-state behaviour of the solution.
Solution
The auxiliary equation is
λ2 + 2λ + 2 = 0,
with roots −1 ± i. In Example 2.5 we found a particular integral
y(x) = −2 cos 2x − sin 2x.
143
Unit 3 Second-order differential equations
In the remainder of this section you are invited to investigate these ideas
using the computer algebra package.
144
Outcomes
Outcomes
After studying this unit you should be able to:
• understand and use the terminology relating to linear constant-coefficient
second-order differential equations;
• understand the key role of the principle of superposition in the solution
of linear constant-coefficient second-order differential equations;
• obtain the general solution of a homogeneous linear constant-coefficient
second-order differential equation using the solutions of its auxiliary
equation;
• use the method of undetermined coefficients to find a particular integral
for an inhomogeneous linear constant-coefficient second-order differential
equation with certain simple forms of right-hand-side function;
• obtain the general solution of an inhomogeneous linear constant-
coefficient second-order differential equation by combining its comple-
mentary function and a particular integral;
• use the general solution together with a pair of initial or boundary condi-
tions to obtain, when possible, a particular solution of a linear constant-
coefficient second-order differential equation;
• understand the nature of the solutions of linear constant-coefficient
second-order differential equations with positive coefficients, particularly
those involving transient and steady-state parts;
• use the computer algebra package for the course to solve a second-order
differential equation.
145
Unit 3 Second-order differential equations
146
Solutions to the exercises
1.9 (a) λ2 + 2λ + 2 = 0 has solutions −1 ± i, so the (b) Substituting y = p1 x + p0 and its derivatives into
general solution is y = e−x (C cos x + D sin x). the differential equation gives
(b) λ2 − 16 = 0 has solutions ±4, so the general solu- 0 + 2p1 + (p1 x + p0 ) = p1 x + (2p1 + p0 ) = 2x.
tion is y = Ce4x + De−4x . Hence p1 = 2, p0 = −4, and a particular integral is
2 yp = 2x − 4.
(c) λ − 4λ + 4 = 0 has solutions λ1 = λ2 = 2, so the
general solution is y = (C + Dx)e2x .
(d) λ2 + 3λ = 0 has solutions λ1 = 0 and λ2 = −3, so 2.4 We try y = p2 t2 + p1 t + p0 , which has derivatives
the general solution is θ = C + De−3t . y = 2p2 t + p1 , y = 2p2 . Substituting these into the
differential equation gives
1.10 The auxiliary equation λ2 + 4kλ + 4 = 0 can be 2p2 − (p2 t2 + p1 t + p0 ) = −p2 t2 − p1 t + (2p2 − p0 )
solved
√ using the formula method to give λ = −2k ±
2 k 2 − 1. So there are complex conjugate solutions, = t2 .
leading to a general solution involving sines and cosines, Hence p2 = −1, p1 = 0, p0 = −2, and a particular inte-
when k 2 < 1, i.e. when |k| < 1. gral is
yp = −t2 − 2.
147
Unit 3 Second-order differential equations
(b) From Exercise 1.9(d), θc = C + De−3t . To find a Hence p = −4, and a particular integral is
particular integral, try θ = p cos 3t + q sin 3t. Differen- yp = −4xex .
tiating gives
dθ (b) From Exercise 1.3(b), the associated homogeneous
= −3p sin 3t + 3q cos 3t, equation has general solution y = C + De−3x/2 , and the
dt
trial solution y = p0 suggested by Procedure 2.2 is a so-
d2 θ
= −9p cos 3t − 9q sin 3t. lution of this equation (with C = p0 , D = 0). So we try
dt2 y = p0 x. Differentiating twice gives
Substituting into the differential equation gives
dy d2 y
(−9p cos 3t − 9q sin 3t) + 3(−3p sin 3t + 3q cos 3t) = p0 , = 0.
dx dx2
= (9q − 9p) cos 3t − (9q + 9p) sin 3t Substituting into the left-hand side of the differential
= 9 cos 3t. equation gives 3p0 = 1, so p0 = 13 , and a particular in-
This gives a pair of simultaneous equations to solve: tegral is
−9p + 9q = 9, yp = 13 x.
−9p − 9q = 0.
Hence p = − 21 , q = 12 , and a particular integral is 2.11 The associated homogeneous equation is
θp = − 12 cos 3t + 12 sin 3t. Therefore the general solu- mλ2 + rλ = 0,
tion is with solutions λ = 0 and λ = −r/m. The complemen-
θ = C + De−3t − 1
2 cos 3t + 1
2 sin 3t. tary function is therefore
xc = C + De−rt/m .
2.9 Putting the equation into standard form and using The inhomogeneous term is mg, so Procedure 2.2 sug-
R = S = Q = 1 gives gests a trial solution x = p0 . However, this is a solution
y − y = − 12 (l − x)x = − 12 lx + 21 x2 . of the associated homogeneous equation (with C = p0 ,
D = 0). Hence we try x = p0 t instead. Differentiating
The associated homogeneous equation is y − y = 0, and substituting gives
which has auxiliary equation λ2 − 1 = 0. This has roots
λ = ±1, so the complementary function is rp0 = mg,
yc = Cex + De−x . so
mg
To obtain a particular integral, we try a function of p0 = .
r
the form y = p2 x2 + p1 x + p0 . Its derivatives are y = Hence a particular integral is
2p2 x + p1 , y = 2p2 . Substituting into the differential mgt
equation gives xp = ,
r
2p2 − (p2 x2 + p1 x + p0 ) = −p2 x2 − p1 x + (2p2 − p0 ) and the general solution is
= 12 x2 − 12 lx. mgt
x = C + De−rt/m + .
r
Hence p2 = − 12 , p1 = 12 l, p0 = −1, and a particular in-
tegral is
2.12 From Exercise 1.6(d), the associated homoge-
yp = − 12 x2 + 21 lx − 1.
neous equation has general solution y = (C + Dx)ex ,
Therefore the general solution is so not only is the trial solution y = pex suggested by
y = Cex + De−x − 12 x2 + 21 lx − 1. Procedure 2.2 a solution of the associated homogeneous
differential equation (with C = p, D = 0), but so is
y = pxex (with C = 0, D = p). So we try y = px2 ex .
2.10 (a) From Example 1.3, the associated homoge- Differentiating twice gives
neous equation has general solution y = Cex + De2x ,
dy
and the trial solution y = pex suggested by Proce- = 2pxex + px2 ex = p(2x + x2 )ex ,
dure 2.2 is a solution of this equation (with C = p, dx
D = 0). So we try y = pxex instead. Differentiating d2 y
= p(2 + 2x)ex + p(2x + x2 )ex
twice gives dx2
= p(2 + 4x + x2 )ex .
dy
= pex + pxex = p(1 + x)ex , Substituting into the differential equation gives
dx
d2 y p(2 + 4x + x2 )ex − 2p(2x + x2 )ex + px2 ex = 2pex
= pex + p(1 + x)ex = p(2 + x)ex .
dx2 = ex .
Substituting into the left-hand side of the differential Hence p = 12 , and a particular integral is
equation gives
yp = 12 x2 ex .
p(2 + x)ex − 3p(1 + x)ex + 2pxex = −pex = 4ex .
148
Solutions to the exercises
2.13 (a) From Exercise 2.12, yp = 12 x2 ex is a partic- Hence p1 = 12 , p0 = 0, and a particular integral is
ular integral for θp = 12 t.
d2 y dy
−2 + y = ex . Therefore the general solution is
dx2 dx
So, using the principle of superposition, we can find a θ = C cos 2t + D sin 2t + 12 t.
particular integral for the given differential equation if (b) The auxiliary equation is λ2 + 4λ + 5 = 0, which
we can find one for has solutions λ = −2 ± i. So the complementary func-
d2 y dy tion is
−2 + y = −3e2x .
dx dx uc = e−2t (C cos t + D sin t).
We try y = pe2x , which has derivatives
To find a particular integral, try u = p0 . Substituting
dy d2 y
= 2pe2x , = 4pe2x . gives 5p0 = 5. Hence p0 = 1, and a particular integral
dx dx2 is
Substituting into the differential equation gives
up = 1.
4pe2x − 4pe2x + pe2x = pe2x = −3e2x .
Therefore the general solution is
Hence p = −3, and yp = −3e2x is a particular integral
u = e−2t (C cos t + D sin t) + 1.
for the differential equation with right-hand side −3e2x .
Thus, using the principle of superposition, a particular (c) The auxiliary equation is 3λ2 − 2λ − 1 = 0, which
integral for the given differential equation is has solutions λ1 = 1 and λ2 = − 13 . So the complemen-
tary function is
yp = 4( 12 x2 ex ) − 3e2x = 2x2 ex − 3e2x .
Yc = Cex + De−x/3 .
(b) This time we do not have a particular integral for
Consider first the e2x term on the right-hand side
any part of the right-hand-side function, so we need to
of the equation. To find a particular integral, try
start from scratch.
Y = pe2x . The derivatives are dY /dx = 2pe2x and
First consider the 12 cos 2t term on the right-hand side, d2 Y /dx2 = 4pe2x . Substituting gives
and try x = p cos 2t + q sin 2t as a trial solution. This
has derivatives 3(4pe2x ) − 2(2pe2x ) − pe2x = 7pe2x = e2x .
dx Hence p = 17 , and a particular integral is
= −2p sin 2t + 2q cos 2t,
dt Yp = 17 e2x .
2
d x Now consider the 3 term on the right-hand side of the
= −4p cos 2t − 4q sin 2t.
dt2 equation, and try Y = p0 . Substituting gives −p0 = 3,
Substituting into the differential equation gives so p0 = −3, and a particular integral is
2(−4p cos 2t − 4q sin 2t) + 3(−2p sin 2t + 2q cos 2t) Yp = −3.
+ 2(p cos 2t + q sin 2t)
Therefore, using the principle of superposition, a
= 6(q − p) cos 2t − 6(p + q) sin 2t particular integral for the differential equation with
= 12 cos 2t. f (x) = e2x + 3 is
So p + q = 0, q − p = 2, hence p = −1, q = 1, and a Yp = 17 e2x − 3.
particular integral is Therefore the general solution is
xp = − cos 2t + sin 2t. Y = Cex + De−x/3 + 17 e2x − 3.
Now consider the 10 term, and try x = p0 . Substituting
(d) From Exercise 1.3(c), the complementary function
into the differential equation gives 2p0 = 10, so p0 = 5,
is
and a particular integral is
yc = Ce−2x + De2x .
xp = 5.
To find a particular integral, since e−2x is a so-
Therefore, using the principle of superposition, a
lution of the associated homogeneous equation, try
particular integral for the differential equation with
y = pxe−2x . The derivatives are dy/dx = p(1 − 2x)e−2x
f (t) = 12 cos 2t + 10 is
and d2 y/dx2 = 4p(x − 1)e−2x . Substituting gives
xp = − cos 2t + sin 2t + 5.
4p(x − 1)e−2x − 4pxe−2x = −4pe−2x = e−2x .
Hence p = − 14 , and a particular integral is
2.14 (a) From Exercise 1.6(a), the complementary
yp = − 41 xe−2x .
function is
Therefore the general solution is
θc = C cos 2t + D sin 2t.
y = Ce−2x + De2x − 14 xe−2x .
To find a particular integral, try θ = p1 t + p0 . Substi-
tuting this and its derivatives into the differential equa-
tion gives
4(p1 t + p0 ) = 2t.
149
Unit 3 Second-order differential equations
150
Solutions to the exercises
Substitution of these into the differential equation gives 3.5 Parts (a) and (b) are initial-value problems, and
C + D − 1 = 0 and Ce2 + De−2 − 1 = 0. Multiply- therefore by Theorem 3.1 each has a unique solution.
ing the second equation by e2 gives C + D = 1 and However, part (c) is a boundary-value problem, which
Ce4 + D = e2 as the equations to solve. Subtracting may have no solution, a unique solution, or an infinite
the equations gives C(e4 − 1) = e2 − 1. This gives number of solutions.
e2 − 1 e2 − 1 1 (a) From Exercise 2.14(b), the general solution is
C= 4 = 2 2
= 2 ,
e −1 (e + 1)(e − 1) e +1 u = e−2t (C cos t + D sin t).
1 e2 + 1 − 1 e2 Its derivative is
D =1−C =1− 2 = = .
e +1 e2 + 1 e2 + 1
u = e−2t ((−2C + D) cos t − (C + 2D) sin t).
Hence the required particular solution is
1 The condition u(0) = 0 gives C = 0. The condition
y= 2 (ex + e2−x ) − 12 x2 + x − 1. u (0) = 2 gives D = 2. The solution is therefore
e +1
At the centre of the beam, x = 1, so y 0.148. The u = 2e−2t sin t.
displacement or ‘sag’ at the centre of the beam is ap- (b) The differential equation is homogeneous and the
proximately 0.148 m or about 14.8 cm. initial conditions are both equal to zero. Hence the so-
lution is the zero function y = 0.
3.4 Problems (a) and (c) are initial-value problems; (c) From Exercise 1.5(b), the complementary function
problems (b), (d) and (e) are boundary-value problems. is
The differential equation is the same in each case, and xc = C cos 3t + D sin 3t.
from Exercise 1.6(a) its general solution is To find a particular integral, try x = p1 t + p0 . Substi-
u = C cos 2x + D sin 2x. tuting into the differential equation gives
The derivative is 9(p1 t + p0 ) = 3(1 − πt).
u = −2C sin 2x + 2D cos 2x. Hence p1 = − π3 , p0 = 13 , and a particular integral is
xp = − π3 t + 31 .
(a) The condition u(0) = 1 gives C = 1. The condi-
tion u (0) = 0 gives D = 0. The required solution is Therefore the general solution is
therefore x = C cos 3t + D sin 3t − π3 t + 31 ,
u = cos 2x. and its derivative is
(b) The ẋ = −3C sin 3t + 3D cos 3t − π3 .
condition u(0) = 0 gives C = 0. The condi-
tion u π2 = 0 gives C = 0 also. D therefore remains The 1
arbitrary, so there is an infinite number of solutions, of π condition x(0) = π3 gives C = 0. The condition
ẋ 3 = 0 gives D = − 9 . The solution is therefore
the form
x = − π9 sin 3t − π3 t + 31 .
u = D sin 2x.
(c) The condition
u π2 = 0 gives C = 0. The condi-
tion u π2 = 0 gives D = 0. The required solution is
therefore the zero function
u = 0.
(Alternatively, since the differential equation is homoge-
neous and the initial conditions are both equal to zero,
by the remarks after Theorem 3.1 the solution is the
zero function u = 0.)
(d) The condition
u(−π) = 1 gives C = 1. The con-
dition u π4 = 2 gives D = 2. The required solution is
therefore
u = cos 2x + 2 sin 2x.
(e) The condition u (0) = 01 gives D = 0. The condi-
π
tion u 4 = 1 gives C = − 2 . The required solution is
therefore
u = − 12 cos 2x.
151
UNIT 4 Vector algebra
1
153
Unit 4 Vector algebra
Introduction
We often need to represent physical quantities such as mass, force, velocity,
acceleration, time, etc., mathematically. Most of the physical quantities
that we need can be classified into two types: scalars and vectors. Scalar
quantities are quantities, like mass, temperature, energy, volume and time,
that can be represented by a single real number. Other quantities, like
force, velocity and acceleration, possess magnitude and direction in space,
and cannot be represented by a single real number; they are called vector
quantities.
The definitive vector quantity is displacement. The displacement of a point
specifies the position of the point in space relative to some reference point.
We use the concept of displacement whenever we want to describe spatial
relationships. Consider, for example, the instructions written in blood on a
pirate’s treasure map:
Take five paces due north from the big oak tree, then seven paces due
west, and then dig down for three metres.
This is a specification of a displacement vector — the displacement of the
treasure from a reference point (the big oak tree). In fact, this particular
way of specifying the displacement of the treasure is known as the Cartesian
description of a displacement, although the pirate probably didn’t know
that. Alternatively, there is the so-called polar description of the same
displacement (equating paces with metres):
Starting at the big oak tree, dig for 9.1 paces along a straight sloping
line inclined at 19◦ below the horizontal at a bearing of 54◦ west of north.
This ‘distance (or magnitude) plus direction’ specification will also get you
to the treasure, although less conveniently because it is more difficult to
dig along a sloping line. These two different specifications are shown in
Figure 0.1.
N
5 paces
7 paces
54°
19°
3 metres
9.1 paces
Figure 0.1
154
Section 1 Describing and representing vectors
155
Unit 4 Vector algebra
The modulus or magnitude of the vector v is denoted by |v| or, sometimes, We read |v| as ‘the modulus
where there is no possibility of ambiguity, by v; |v| is a non-negative scalar. of v’ or ‘the magnitude of v’,
or simply ‘mod v’.
of the vector. In Figure 1.1, which uses the origin of the Cartesian coor- O 1 2 x
dinate system as the fixed origin, the shorter arrow represents a vector of
magnitude 1 in the positive
√ x-direction, and the longer arrow represents Figure 1.1
a vector of magnitude 2 2 in a direction at π4 radians (45◦) to the posi-
tive x-direction. (Note that we use the convention that positive angles are y
measured anticlockwise.) If we decide to denote these vectors by letters a 2 b
and b, respectively, then we can also put this information on the diagram,
by writing a and b near the arrowheads, as shown in Figure 1.2. 1
Note that in this course, and commonly elsewhere, the arrows representing a
O 1 2 x
vectors are drawn using thick lines. This helps to distinguish vector arrows
from other arrowed lines such as those representing the coordinate axes
(e.g. Figures 1.1 and 1.2) or those representing compass directions (e.g. Figure 1.2
Figure 1.3).
*Exercise 1.1
Represent the following two vectors on a diagram by arrows:
• vector a has magnitude 3 units and points in the positive y-direction;
• vector b has magnitude 4 units and points in the direction at π3 radians
(60◦) to the positive x-direction.
Vector notation and the use of arrows in diagrams is now illustrated further
by specific reference to displacement vectors and velocity vectors.
Displacement is the position of a point in space relative to some reference
point or origin. For example, the city of Leeds is 296 km from the city of
Bristol in the direction of 15◦ east of north (N 15◦ E). The displacement of
Leeds from Bristol can be specified as the vector It would be wrong to write
s = 296 km, because the
s = 296 km N 15◦ E. left-hand side is a vector
symbol and the right-hand
Here the bold symbol s has been used to denote the displacement. Note that
side is a scalar.
both magnitude and direction are specified: the magnitude of the displace-
ment is |s| = 296 km, and the direction is specified by the compass bearing
N 15◦ E. N s
→
Displacement vector PQ
Q
−−→
The displacement vector P Q is the vector whose magnitude is the dis-
tance from P to Q and whose direction is the direction of the straight P
line from P to Q.
Figure 1.4
One query may have occurred to you. What is the displacement vector of a
−−→
point from itself? In other words, what is the vector P P ? Clearly its length
is zero, but what is its direction? The answer is that it does not have one!
We define the zero vector to be the unique vector with magnitude zero
−−→
and no direction. It is denoted by 0. Thus we can conclude that P P = 0.
Zero vector
The zero vector is the unique vector with magnitude zero and no Be particularly careful to
direction. It is denoted by 0. underline the zero vector
(0 or 0 ) in your written work!
∼
*Exercise 1.2
The displacement of Birmingham from Derby is 57 km in the direction
S 30◦ W. The displacement of Leicester from Derby is 32 km in the direction
S 45◦ E.
Draw a diagram, to a suitable scale, representing these two displacements
by arrows.
Exercise 1.3
A car travelling from London along the M1 with speed 70 mph heads in the
direction N 60◦ W near Junction 14.
Represent the velocity of the car by an arrow, drawn to a suitable scale.
157
Unit 4 Vector algebra
Definition
Two vectors are said to be equal if they have the same magnitude and
the same direction.
y
You have seen how to represent a vector by an arrow. This definition of
equality of vectors tells us that the two features needed to define a vector 3
uniquely are its magnitude and direction. This means that any two arrows b
2
drawn at different places on the page but which are equal in length, parallel
and have the same sense, can be used to represent the same vector. For 1
instance, the two arrows in Figure 1.6 are each of length 2 units and point d
in the positive x-direction. They represent two equal vectors, and we write O 1 2 3 x
b = d. In other words, the arrow representing a vector does not have to be
drawn so that its tail is at any particular point. Figure 1.6
Example 1.1
Figure 1.7 shows several vectors represented by arrows drawn to scale. Find
the vector equal to the vector a.
y
c
3 b
2 e
f
d
1
–4 –3 –2 –1 O 1 2 3 4 5 6 x
–1
a
g
–2
–3
h
Figure 1.7
Solution
We are looking for a vector that is equal in length to a (i.e. one unit), parallel
to a and points in the same direction (i.e. the positive x-direction). There
are two arrows (and thus vectors) other than a that point in the positive
x-direction; they are c and h. (The arrow representing d points in the
negative x-direction.) The magnitudes of c and h are 1 unit and 3 units,
respectively. Since the magnitude of a is 1 unit, c = a but h = a.
Note that although a and c are drawn at different places in the (x, y)-plane,
they are equal in magnitude and have the same direction, so they are equal
vectors.
*Exercise 1.4
Which vector in Figure 1.7 is equal to vector b?
158
Section 1 Describing and representing vectors
y
r
y = r sin φ P
r = |r|
φ
O x = r cos φ x
Figure 1.8
We have not quite finished the description, because a vector now has many
representations (since rotating the line segment OP through 2nπ, where n is In fact, a vector has an
any integer, leaves it unchanged). To avoid this ambiguity, we shall normally infinite number of
take φ to lie in the range −π < φ ≤ π. (Note that under this convention a representations!
vector below the x-axis has a negative value for φ — see Figure 1.9.)
Thus the endpoint P of a vector r is specified by the two numbers r (a dis-
tance) and φ (an angle). These two numbers r and φ are the plane polar y positive φ
coordinates (or simply polar coordinates) of the endpoint P of the vec-
tor r, when the tail of its arrow is at O. We use the notation r, φ in order
to distinguish polar coordinates from the Cartesian variety, so the vector is φ
now specified as x
r = r, φ. y
You can see from Figure 1.8 that the polar coordinates r, φ of P are related x
to the Cartesian coordinates (x, y) of P by the following formulae: φ
x = r cos φ, y = r sin φ;
2 2 1/2 negative φ
r = (x + y ) , tan φ = y/x.
However, the statement tan φ = y/x does not define φ uniquely since, for Figure 1.9
example, tan φ = tan(π + φ). To pin down the value of φ in the range
−π < φ ≤ π, we can use the two equations
y
sin φ = y/r and cos φ = x/r. quadrant 2 quadrant 1
S A
In practice, when finding the angle φ from the values of x and y, it usually sin > 0 sin > 0
helps to sketch the Cartesian coordinates in the (x, y)-plane so that you can cos < 0 cos > 0
see in which quadrant φ must lie. The signs of sin and cos for angles in sin < 0 sin < 0 x
the four quadrants are shown in Figure 1.10: you will find it useful to know cos < 0 cos > 0
these. (A simple acronym to aid the memory is ‘CAST’: starting from the T C
lower right, and working anticlockwise round the quadrants, the following quadrant 3 quadrant 4
are positive: Cos, All (of sin, cos, tan), Sin and Tan.)
Figure 1.10
159
Unit 4 Vector algebra
Example 1.2
Give the polar representation of the vectors a, b, e and g in Figure 1.11.
e 2
b
g
–1 O 1 x
–1 a
Figure 1.11
Solution
In Figure 1.11 each vector is drawn as an arrow from the origin, so the
polar representation of each vector is given by the polar coordinates of its
endpoint. In some cases we can specify r and φ simply from inspection of
Figure 1.11, but we shall use the above formulae in order to illustrate the
general method.
The endpoint of a has Cartesian coordinates (1, −1), so we have
√
r = (12 + (−1)2 )1/2 = 2,
√ √
sin φ = y/r = −1/ 2 and cos φ = x/r = 1/ 2.
√
Thus φ = − π4 radians, i.e. a = 2, − π4 . (Since − π2 < − π4 < 0, the angle
coordinate − π4 indicates that a should lie in the fourth quadrant, which is
confirmed by Figure 1.11.)
Vector b is of length 2 units and points in the positive y-direction. The
Cartesian coordinates of its endpoint are (0, 2), so we have
r = (02 + 22 )1/2 = 2,
sin φ = y/r = 2/2 = 1 and cos φ = x/r = 0/2 = 0.
π
Hence φ = radians (which is obvious from the fact that b points in the
2
positive y-direction). Hence b = 2, π2 .
The endpoint of vector e has Cartesian coordinates (−1, 2), so
√ √ √
r = ((−1)2 + 22 )1/2 = 5, sin φ = 2/ 5 and cos φ = −1/ 5,
√
giving φ = 2.034 radians. Hence e = 5, 2.034. (Since π2 < 2.034 < π, the
angle coordinate 2.034 indicates that e should lie in the second quadrant,
which is confirmed by Figure 1.11.)
Finally, g is of unit length (i.e. of length 1 unit) and points in the positive
x-direction. The Cartesian coordinates of its endpoint are (1, 0), so
r = (12 + 02 )1/2 = 1, sin φ = 0 and cos φ = 1.
Thus g = 1, 0. (This is an exceptional case where the numerical values of
the coordinates are the same in the two coordinate systems.)
*Exercise 1.5
Complete Table 1.1. Each row should show the Cartesian and corresponding
polar coordinates of a particular point. If any entry is invalid, say so and
explain why.
160
Section 2 Scaling and adding vectors
Table 1.1
Cartesian Polar
coordinates (x, y) coordinates r, φ
(0, −1) 1, − π2
(1, 1)
4, − π4
6, π
(−1, −1)
−1, π
108 , exp(0.1π)
*Exercise 1.6
As you saw earlier, the displacement of Leeds from Bristol can be expressed
as s = 296 km N 15◦ E (see Figure 1.3 on page 156). Express this vector in
polar form r, φ using a suitable coordinate system.
End-of-section Exercises
Exercise 1.7
The following is a list of some physical quantities: temperature, velocity,
volume, energy, force, displacement, time, acceleration. Decide which are
scalar and which are vector quantities.
Exercise 1.8
What are the polar coordinates of a point Q whose Cartesian coordinates
−−→
are (0, −3)? What is the magnitude of the vector OQ where O is the origin
of coordinates?
161
Unit 4 Vector algebra
Figure 2.1 v O
Definition
For any vector v and any real number m, the scalar multiple mv is
the vector with magnitude |m||v| which is:
• in the same direction as v if m > 0;
• in the opposite direction to v if m < 0;
• the zero vector (i.e. with unspecified direction) if m = 0.
The multiplication of v by m is called scaling or scalar multiplica-
tion.
Example 2.1
(a) Let u represent the velocity of my car travelling with a speed of 30 mph
along a straight road due north. Write down, in terms of u, the velocity
A B
of a car overtaking me and travelling at 45 mph. If another car is trav-
elling in the opposite direction to me with speed 60 mph, write down
this car’s velocity in terms of u. F C
−−
→
(b) If ABCDEF is a regular hexagon (Figure 2.3) and, for example, AB
represents the displacement vector from A to B, write down algebraic
E D
relations connecting:
−−→ −−→ −→ −−→
(i) AB and ED; (ii) AF and DC. Figure 2.3
Solution
(a) The velocity vector u has magnitude 30 mph and points due north. The
car overtaking me is travelling in the same direction but has a velocity
of magnitude 45 mph; suppose that its velocity vector is denoted by v
(see Figure 2.4). Then v is parallel to u and has the same sense as u, Two vectors are parallel if
and |v| = 45
30 |u|. Therefore v is just a scaling of u, i.e.
they have either the same, or
opposite, directions.
v = 1.5u.
162
Section 2 Scaling and adding vectors
Now suppose that the velocity of the car travelling in the opposite di-
rection is denoted by w. Then w is parallel to u but has the opposite N
30 |u|. So we can write
sense to u, and |w| = 60
50
w = −2u, v
40
where the negative sign indicates the opposite sense.
30 u
(b) The opposite sides of a regular hexagon are parallel, and all the sides
have the same length. 20
−−
→ −−→
(i) Thus the displacement vectors AB and ED have equal magnitudes 10
and the same direction. So we have
O
−−
→ −−→
AB = ED.
−→ −−→ Figure 2.4
(ii) The displacement vectors AF and DC have equal magnitudes but
opposite directions, thus
−→ −−→ −−→ −→
AF = −DC (or, equivalently, DC = −AF ).
*Exercise 2.1
(a) If d is the displacement vector from Bristol to Leeds, write down in
terms of d the displacement vectors from Leeds to Bristol and from
Leeds to Leeds.
(b) If v represents the velocity of a wind of 35 knots from the north-east,
what vectors represent the following?
(i) A wind of 70 knots from the north-east.
(ii) A wind of 35 knots from the south-west.
(iii) Still air.
(c) Relate the direction and magnitude of −1.5v to those of v, where v is
any given non-zero vector. Do the same for −kv, where k is an arbitrary
positive number. B C
−−→
(d) If ABCD is a parallelogram (Figure 2.5) and, for example, AB repre-
sents the displacement vector from A to B, write down algebraic rela-
A D
tions connecting:
−−
→ −−→ −−→ −−→
(i) AB and DC; (ii) BC and DA. Figure 2.5
(e) If v is any non-zero vector, what are the magnitude and direction of the
1
vector v?
|v|
Unit vector s
1
The vector v in Exercise 2.1(e) is a vector that has magnitude 1 and
|v|
points in the direction of v. It is called the unit vector in the direction of v.
The unit vector in the direction of v is often denoted by the symbol v .
Definition
For any non-zero vector v, the unit vector in the direction of v is the
vector
1
=
v v.
|v|
163
Unit 4 Vector algebra
Unit vectors are often used to denote directions in the plane, or in space.
A particular example is provided by the unit vectors in the positive directions
of the x- and y-axes in the plane Cartesian coordinate system. These unit We shall develop the
vectors are denoted by i and j, respectively, and are called Cartesian unit Cartesian representation of
vectors. vectors in Section 3.
Figure 2.6 shows these Cartesian unit vectors and two other vectors, a and b. y
The vector a has magnitude 2 and points in the positive x-direction; b has
magnitude 3.5 and points in the positive y-direction. The unit vector i has 3.5 b
magnitude 1 and points in the same direction as a. Thus we can write a in 3
terms of i by a scaling:
2
a = 2i. a
1 j
Similarly, we can write b in terms of j:
i
b = 3.5j. O 1 2 3 4 x
–4 –3 –2 –1 1 2 3 4 x
–1 j
d –2
i
–3
Figure 2.7
Exercise 2.3
Let the unit vectors i and j denote the directions of east and north, respec-
tively. Specify the following vectors as scalings of i and j.
(a) The wind velocity of 35 km per hour due south.
(b) The displacement of Bristol from London (112 miles due west).
(c) The displacement of London from Bristol.
164
Section 2 Scaling and adding vectors
Note that the sum of two displacement vectors can also be written using the
notation
−−→ −−→ −−→
OP + P Q = OQ.
Now recall that when discussing displacements we mentioned the zero vector
0 (representing no displacement). Once addition of vectors is introduced, we y
need the zero vector in order to answer questions such as ‘what is i + (−1)i?’. c
Geometrically, no construction is needed when adding the zero vector, which
obeys the rather obvious rule
a + 0 = a.
π
a
Exercise 2.4 O
3
π
4
x
Three vectors a, b and c of magnitudes 3, 2 and 4 are shown in Figure 2.10.
(a) Draw a rough sketch to show the vectors a + b and a + c. b
(b) Sketch the vector −b, and draw a rough sketch to show the addition of
a and −b. Figure 2.10
*Exercise 2.5
A vector a has magnitude 3 units and points in the positive x-direction. A
vector b has magnitude 4 units and points in the positive y-direction. Draw
a diagram showing the vectors a + b and a − b.
165
Unit 4 Vector algebra
Vector addition is commutative, i.e. the order in which we add two vectors
does not matter. This can be illustrated by reference to vectors a and c of y
Exercise 2.4 (see Figure 2.11). The triangle OP1 Q illustrates the addition a Q
−−→ P2
a + c, while triangle OP2 Q illustrates c + a. The same resultant OQ is c a c
obtained in both cases. Thus c ++ c
a
a + c = c + a.
Exercise 2.6
For the particular cases of the vectors a, b, c defined in Exercise 2.4, and O a P1 x
for the scalar m = 2, draw sketches to illustrate the associative property of
vector addition, Figure 2.11
(a + b) + c = a + (b + c),
and the distributive property of scaling over vector addition,
m(a + b) = ma + mb.
Example 2.2
Simplify the expression
2(a + b) + 3(b + c) − 5(a + b − c).
166
Section 2 Scaling and adding vectors
Solution
Strict use of the rules requires us to write the expression solely in terms of
addition. So we have
2(a + b) + 3(b + c) − 5(a + b − c)
= 2(a + b) + 3(b + c) + (−5)(a + b + (−1)c) (using Rule 8)
= 2(a + b) + 3(b + c) + (−5)((a + b) + (−1)c) (using Rule 2)
= 2a + 2b + 3b + 3c + (−5)(a + b) + (−5)((−1)c) (using Rule 6 three times)
= 2a + 2b + 3b + 3c + (−5)(a + b) + 5c (using Rule 4)
= 2a + 2b + 3b + 3c + (−5)a + (−5)b + 5c (using Rule 6)
= 2a + (−5)a + 2b + 3b + (−5)b + 3c + 5c (using Rule 1 several times)
= (2 − 5)a + (2 + 3 − 5)b + (3 + 5)c (using Rule 5 four times)
= (−3)a + 0 + 8c (using Rule 7)
= (−3)a + 8c (using Rule 7)
= 8c + (−3)a (using Rule 1)
= 8c + (−1)(3a) (using Rule 4)
= 8c − 3a (using Rule 8).
However, because Rules 1, 2, 4, 5, 6, 7 and 8 are exactly the same as
the familiar rules for manipulating algebraic expressions involving scalar
quantities, we would usually write the solution more succinctly as
2(a + b) + 3(b + c) − 5(a + b − c)
= 2a + 2b + 3b + 3c − 5a − 5b + 5c
= (2 − 5)a + (2 + 3 − 5)b + (3 + 5)c
= 8c − 3a.
Exercise 2.7
Simplify the expression 4(a − c) + 3(c − b) + 2(2a − b − 3c).
*Exercise 2.8
Find the vector x in terms of a and b in the following vector equations.
(a) 2b + 4x = 7a (b) n(b − a) + x = m(a − b)
Vectors
• A vector has magnitude and direction.
• Any two vectors can be added by the triangle rule.
• A vector can be scaled by a real number in such a way that the
above rules apply.
167
Unit 4 Vector algebra
End-of-section Exercises y
b
Exercise 2.9
The vectors a and b are represented by the arrows shown in Figure 2.12.
The magnitudes of a and b are 4 and 6, respectively. Draw a sketch to show
the vectors a + b, a − b and 2a + 12 b.
π
a
Exercise 2.10 4
O x
If v = −4.7u, what can you say about the magnitude and direction of v in
terms of the magnitude and direction of the non-zero vector u? Figure 2.12
Exercise 2.11
−−
→
If ABCD is a quadrilateral, with AB denoting the displacement vector from
−−→ −−→ −−→
A to B, and BC, CD, DA defined similarly, show that
−−
→ −−→ −−→ −−→
AB + BC + CD + DA = 0.
*Exercise 2.12
Two vectors p and q are defined in polar form: p = 3, π2 , q = 4, π. Sketch
p, q and p + q, and give the polar forms of 5p, −q and p + q.
Exercise 2.13
(a) Which of the following proposed general rules is true for the scalar
multiplication of a vector in polar form? (Assume m > 0.)
?
mr, φ = mr, φ,
?
mr, φ = mr, mφ.
(b) Does the following proposed general rule hold for the addition of vectors
in polar form?
?
r1 , φ1 + r2 , φ2 = r1 + r2 , φ1 + φ2
−−→ −−→
Furthermore, the vectors OB and OC can be written as scalings of the
Cartesian unit vectors i and j:
−−→ −−→
OB = a1 i and OC = a2 j.
Hence the triangle rule (or parallelogram rule) for the addition of vectors
−−→ −−→
allows the vector a to be expressed as the sum of OB and OC, i.e. as
−−→ −−→
a = OB + OC or a = a1 i + a2 j.
The latter is called the component form of a, and the numbers a1 and a2 You may also see these
are called the i- and j-components of a, respectively. numbers referred to as the
x- and y-components of a.
When the tail of the vector a is not at the origin, its components are defined
in an obvious way.
y
q2 C Q
a
a2
p2 B
a1
P
O p1 q1 x
Figure 3.2
Definition
−−→
A vector a = P Q in the (x, y)-plane, where P is the point (p1 , p2 ) and
Q is the point (q1 , q2 ), has component form
a = a1 i + a2 j,
where a1 = q1 − p1 and a2 = q2 − p2 , and i and j are the Cartesian unit
vectors.
The component form may also be written as
a1
a= or a = [a1 a2 ]T .
a2
The numbers a1 and a2 are the (Cartesian) components of a.
169
Unit 4 Vector algebra
*Exercise 3.1
Write each of the vectors in Figure 1.7 (page 158) in the form a = a1 i + a2 j
and as a column vector.
Vectors in component form can also be added and scaled very easily, by
making use of the algebraic rules for the scaling and adding of vectors. For
example,
a + b = (a1 i + a2 j) + (b1 i + b2 j)
= a1 i + a2 j + b 1 i + b 2 j
= (a1 i + b1 i) + (a2 j + b2 j)
= (a1 + b1 )i + (a2 + b2 )j.
So, to add two vectors one adds their respective components. Similarly,
ma = m(a1 i + a2 j)
= (ma1 )i + (ma2 )j,
so scaling a vector is achieved by scaling its components.
Exercise 3.2
Figure 3.3 shows four vectors in the (x, y)-plane.
170
Section 3 Cartesian components of a vector
y
3 b
a
1
c
–4 –3 –2 –1 1 2 3 4 5 6 x
–1
d
–2
Figure 3.3
*Exercise 3.3
p+q −3
(a) Find the numbers p and q if r = ,s= and r = s.
p−q 7
1 1
(b) Find the magnitude of the vector t if t = u + v, where u = √
2 1
1 1
and v = √ .
2 −1
Exercise 3.4 y
The three vectors a, b and c in Figure 3.4 are specified in polar coordinates b
by a
a = 2, π3 , b = 3, 34π , c = 1, π6 .
c
(a) What are the magnitudes of the three vectors? x
O
(b) Write down the vectors a, b and c in terms of i and j.
(c) Obtain the vector a + c in terms of i and j. Figure 3.4
y
In Exercise 3.4 the vectors were given in polar coordinate form, which is a
just a systematic way of specifying magnitude and direction. The process
of finding the Cartesian components of a vector given its magnitude and
direction is known as resolving a vector into its components. This is j
essentially what you did in Exercise 3.4(b). Thus given the magnitude |a| φ
of the vector a in Figure 3.5, and its direction φ, we can resolve it into its x
i
components:
a1 = |a| cos φ and a2 = |a| sin φ. Figure 3.5
Conversely, given the components a1 and a2 of a vector, we can specify its You had practice at doing
magnitude and direction: these calculations in
Subsection 1.5.
|a| = (a21 + a22 )1/2 , cos φ = a1 /|a|, sin φ = a2 /|a|.
171
Unit 4 Vector algebra
You will see this idea again in Section 4. For now, note that if you wish to
add two vectors in polar form it will be necessary first to resolve them into y
their Cartesian components (since there is no convenient formula for vector
addition in polar coordinates). v
Exercise 3.5 5
(a) Resolve the vector v of magnitude 5 shown in Figure 3.6 into its Carte- 5π
sian components. 18
Thus far we have discussed vectors in the plane, reaching the component
representation of such vectors in the previous subsection. However, the
world is three-dimensional, and few real problems are restricted to a plane
surface. For example, starting at point A at one corner of the cube shown in B
Figure 3.7, you can reach the opposite corner S by three successive displace-
−→ −−→ −→
ments: AQ + QB + BS. In order to work with such addition of displace- A Q
ments in three dimensions, it is necessary to introduce a three-dimensional
coordinate system. Figure 3.7
x-axis x
172
Section 3 Cartesian components of a vector
p1 = SP = OA,
x
p2 = RP = OB,
p3 = QP = OC. y
For example, the point (2, 3, 4) is shown in Figure 3.9.
When drawing Figure 3.9 it was necessary to choose one of two possible ways x
for the positive z-direction to be defined; these are shown in Figure 3.10,
where in both cases the y-axis is meant to point into the plane of the page,
away from you. positive z
The usual convention for relating the positive directions of x, y and z is given Figure 3.10
by the following rule, called the right-hand rule. The right hand is held
with the middle finger, first finger and thumb placed (roughly) perpendicular
to each other, and the other two fingers closed (see Figure 3.11). If the
thumb and first finger are pointing in the directions of the positive x- and
y-axes, respectively, then the middle finger is pointing in the direction of
the positive z-axis.
Alternatively, you can think of Figure 3.9 as showing a corner of a room
(with the z-axis pointing upwards). If you are standing in the corner facing
outwards, then the left-hand edge of the floor is the y-axis, and the right-
hand edge is the x-axis. A coordinate system defined in this way is called
a right-handed system. Only right-handed systems will be used in this
course. The systems drawn in Figure 3.9 and the top of Figure 3.10 are
right-handed systems.
clockwise screw
positive z-direction turn moves
in
positive y-direction
(a)
positive x-direction
z
Figure 3.11
173
Unit 4 Vector algebra
Exercise 3.6
Decide which of the sets of perpendicular axes in Figure 3.13 define right-
handed coordinate systems.
y y x y
z
O O O O
z z y x
x x z
(a) (b) (c) (d)
Figure 3.13
(The x-axis points out of the plane of the paper in (a) and (b). The z-axis
points into and out of the plane of the paper in (c) and (d), respectively.)
j
i
Definition
The position vector of a point A relative to the origin O of three- Figure 3.14
dimensional space is the displacement of A from O, i.e. the vector
−→
a = OA.
The i-, j- and k-components of the position vector a are the coordinates These may sometimes be
a1 , a2 and a3 of the point A, respectively. referred to as x-, y- and
z-components.
The components of vectors not based at the origin are defined similarly, as
follows.
Definition
−−→
A vector a = P Q in three-dimensional space, where P is the point Note that the component
(p1 , p2 , p3 ) and Q is the point (q1 , q2 , q3 ), has component form form may also be written as
a1
a = a1 i + a2 j + a3 k, a = a2
a3
where a1 = q1 − p1 , a2 = q2 − p2 , a3 = q3 − p3 , and i, j, k are the or
Cartesian unit vectors. The numbers a1 , a2 , a3 are the (Cartesian)
a = [a1 a2 a3 ]T .
components of a.
174
Section 3 Cartesian components of a vector
A
a
a3 k
a3
O
a1 y
a12+ a 2
a 1i 2
a2 a2 j N
x
Figure 3.15
*Exercise 3.7
Given vectors a = i + j + k, b = 2i − 3j − k and c = 3i + k:
(a) express d = 2a − 3b and e = a − 2b + 4c in component form;
(b) find the magnitudes of the vectors d and e;
(c) evaluate |a|, and write down a unit vector in the direction of a;
(d) find the components of a vector x such that a + x = b.
Exercise 3.8
⎡ ⎤ ⎡ ⎤
1 2
Find the magnitude of the vector p = 3 ⎣ 0 ⎦ − ⎣ 3 ⎦.
6 −1
175
Unit 4 Vector algebra
Note that if the parameter s in Example 3.1 is allowed to range over all the
real numbers (−∞ < s < ∞), then the point T traces out the entire straight
line of which P Q is a segment. Also note that the ideas in Example 3.1 are
easily extended to three dimensions.
*Exercise 3.9
Write down, in component form, the vector equation of the straight line on
which lie the points with Cartesian coordinates (1, 1, 2) and (2, 3, 1).
End-of-section Exercises
Exercise 3.10
Let a = 2i − j, b = i + 3j + 5k and c = j − 2k.
(a) Find the magnitudes of a and b, and describe the direction of a.
(b) Find the vectors a + b, 2a − b and c + 2b − 3a in component form.
(c) What is the endpoint Q of the displacement represented by the vector
2a − b if (0, 2, 3) is its beginning point P ?
Exercise 3.11
Write the vectors 0, i, j and k as column vectors in three dimensions.
176
Section 4 Products of vectors
4 Products of vector s
So far in this unit we have defined two algebraic operations: vector addition
(by the triangle rule) and the scaling of a vector. The addition of vectors can
be usefully applied only to two vectors representing the same type of physical
quantity. For example, the addition of a displacement and a velocity has no
physical meaning. However, vectors representing the same or different types
of physical quantities can be combined in operations that are called the dot
product and the cross product. They are called products because in some
respects they behave like ‘multiplications’ in the algebra of real numbers.
Dot products and cross products of vectors have numerous applications in
geometry, mechanics and electromagnetism.
In this section the dot product and cross product are defined geometrically
and also in terms of components of vectors. The dot product of two vectors
is interpreted in terms of projecting a shadow of one vector onto another,
and is applied to the problem of finding the angle between two vectors or
lines. The cross product of two vectors is interpreted as a vector whose
magnitude is an area. Both dot and cross products can be used in problems
involving finding the areas of plane figures and the volumes of solid objects.
Definition
The dot product of two vectors a and b is
The product a . b is read as
a . b = |a| |b| cos θ, ‘a dot b’.
where θ (0 ≤ θ ≤ π) is the angle between the directions of a and b (see
Figure 4.1). b
The dot product of two vectors is a scalar quantity, i.e. it is a real number:
a . b is the product of the three scalars |a|, |b| and cos θ. So the operation of θ
the dot product combines two vectors to define a scalar, and for this reason
the dot product is also called the scalar product. The angle θ lies in the
range 0 ≤ θ ≤ π: the value of a . b is positive for 0 ≤ θ < π2 , i.e. when θ is a
an acute angle; the value of a . b is negative for π2 < θ ≤ π, i.e. when θ is
obtuse; the value of a . b is zero for θ = π2 , i.e. when θ is a right angle. Figure 4.1
It is important, when writing a dot product, to make sure that the dot
between the vectors is clear.
b
*Exercise 4.1
Three vectors a, b and c of magnitudes 2, 4 and 1 units, respectively, lying
in the same plane, are represented by arrows as shown in Figure 4.2. The 4
angle between the vectors a and b is π3 radians, and that between the vectors c π
b and c is π6 radians. Use the definition of dot product to find the values of 6
1
a . b, b . c, a . c and b . b. π
3
2 a
Figure 4.2
177
Unit 4 Vector algebra
The following example shows how these properties can be used to simplify
expressions.
Example 4.1
Expand the expression x . y, given that x = 2u + v and y = u − 5v. Cal-
culate its value when u and v are perpendicular unit vectors.
Solution
x . y = (2u + v) . (u − 5v)
= (2u) . (u − 5v) + v . (u − 5v) (Property 3)
= (2u) . u + (2u) . (−5v) + v . u + v . (−5v) (Property 3)
= 2(u . u) − 10(u . v) + v . u − 5(v . v) (Property 4)
= 2(u . u) − 9(u . v) − 5(v . v) (Property 2)
Now u . u = |u|2 = 1 and v . v = |v|2 = 1 when u and v are unit vectors.
Furthermore, u . v = 0 when u and v are perpendicular vectors. So when u
and v are perpendicular unit vectors, we have
x . y = 2 − 0 − 5 = −3.
178
Section 4 Products of vectors
*Exercise 4.2
(a) Expand the expression (a + b) . (a − b).
(b) Expand the expression |a + b|2 . Recall that |a|2 = a . a.
Exercise 4.3
Given that a and b are perpendicular unit vectors:
(a) find the value of m such that the two vectors 2a + 3b and ma + b are
perpendicular;
(b) find the value of |c| if c = 3a + 5b.
Finally, a word of caution: (a . b)c is not in general the same as a(b . c). In general, if m is a scalar
The vector (a . b)c is a scaling of c by the number a . b, whereas a(b . c) is a and a is a vector, we can
scaling of a by the number b . c. Clearly these two vectors are not generally write ma or am as
convenient, although ma is
even parallel, let alone equal. For example, if a = b = i and c = j, then more usual; thus a(b . c)
(a . b)c = (i . i)j = j but a(b . c) = i(i . j) = 0. means the same as (b . c)a.
179
Unit 4 Vector algebra
*Exercise 4.4
If a = 4i + j − 5k and b = i − 3j + k, show that a . b = −4. What does the
negative sign tell us?
Example 4.2
√
(a) Find the angle between the vector a = i + 3k and the x-axis.
√ √
(b) Find the angle between the vectors a = i + 3k and b = 3i − 2j + 3k.
√ √
(c) Show that c = −2 3i + 2k is perpendicular to a = i + 3k.
Solution
(a) The direction of the x-axis is the same as the direction of i, and the
angle θ between a and i is given by
a.i a1 1 1
cos θ = = =√ = .
|a| |i| |a| 1+3 2
Thus the angle between a and the x-axis is π3 radians.
√ √
(b) We have |a| = 1 + 3 = 2, |b| = 3 + 4 + 9 = 4 and
√ √ √
a . b = (1 × 3) + (0 × −2) + ( 3 × 3) = 4 3.
Therefore the angle θ between a and b is given by
√ √
4 3 3
cos θ = = ,
2×4 2
so θ = π6 radians.
180
Section 4 Products of vectors
(c) To test whether a and c are perpendicular, we calculate their dot prod-
uct:
√ √
a . c = (i + 3k) . (−2 3i + 2k)
√ √
= (1 × −2 3) + (0 × 0) + ( 3 × 2)
= 0.
Since a . c = 0 and a and c are non-zero vectors, c is perpendicular
to a.
*Exercise 4.5
Consider the vectors
a = 2i − 3j + k and b = −i + 2j + 4k.
Find the magnitudes of a and b, and the angle between them.
*Exercise 4.6
If a = a1 i + a2 j + a3 k, find the values of a . i, a . j and a . k.
The solution to Exercise 4.6 shows the important fact that the i-component
of any vector a may be found by taking the dot product a . i. The j- and a A
k-components can be found similarly (by taking dot products with j and k,
respectively).
We can also find the components of a vector in other directions. Suppose P
−→ q
u
that a vector a, represented by OA, makes an angle θ with a unit vector u
(see Figure 4.4). Draw the line AP perpendicular to the direction of u. O
Then the distance OP is seen from simple trigonometry to be |a| cos θ. Now
observe that the dot product of a and u is Figure 4.4
a . u = |a| |u| cos θ = |OP | (since |u| = 1).
The distance OP represents the component of a in the direction of u. Note that a . u will be
negative if θ > π2 , i.e. if P and
u lie on opposite sides of O.
Definition
The component of a vector a in the direction of an arbitrary unit
vector u is a . u.
*Exercise 4.7
Consider the vectors
a = 2i − 3j + k and b = −i + 2j + 4k.
(a) Which of the following vectors is perpendicular to a?
c = −i + j + 3k, d = −2i + k, e = −i − j − k.
(b) Find the component of the vector a + 2b in the direction of the line
joining the origin to the point (1, 1, 1).
181
Unit 4 Vector algebra
The following example uses Cartesian unit vectors that are not horizontal
and vertical.
Example 4.3
Suppose that the unit vector i points up a plane which is inclined at an angle
α to the horizontal, and the unit vector j is perpendicular to the plane, as
shown in Figure 4.6. Find the i- and j-components of the vectors N and W. N
π−a π
Solution 2
+a
a j
It is easy to resolve the vector N into its component form:
a W i
N = 0i + |N|j = |N|j.
For the vector W, we note from the geometry of the diagram that the angles Figure 4.6
between W and i, and W and j, are given by π2 + α and π − α, respectively.
Applying Procedure 4.1 twice (with u = i and then u = j) allows us to j
resolve W into its component form: v
W= |W| cos( π2
+ α) i + |W| cos(π − α) j π
6
= −|W| sin α i − |W| cos α j. i
π
So the i- and j-components of N are 0 and |N| respectively, and those of W 6
are −|W| sin α and −|W| cos α.
*Exercise 4.8 w
The two-dimensional vectors v and w in Figure 4.7 have magnitudes 1.5
and 2, respectively. Resolve v and w into their i- and j-components. Figure 4.7
182
Section 4 Products of vectors
Exercise 4.9
In Figure 4.8 the point P lies on a line making an angle α with the x-axis.
The vectors a, b, c, d have magnitudes 1, 1.5, 1.5 and 2, respectively, and
point in the directions shown.
c
b
P
d
j
α a
i x
Figure 4.8
*Exercise 4.10
Figure 4.9 shows a configuration similar to Figure 4.8, but with the unit
vectors i and j aligned along and perpendicular to the line, respectively.
j
y
i
c
b
P
d
α a
x
Figure 4.9
*Exercise 4.11
The vectors p, q and r in Figure 4.10 have magnitudes 2.5, 3 and 2.5,
respectively. Resolve p, q and r into their i- and j-components.
p j
α
β γ i x
q
r
Figure 4.10
183
Unit 4 Vector algebra
Exercise 4.12
j
The sum of the two-dimensional vectors a, b, c in Figure 4.11 is the zero vec- a
tor, and |c| = 2. By resolving the vectors into their components, determine
the magnitudes of a and b.
i
π
4.2 The cross product b 4
You have seen that the dot product of two vectors is a scalar (i.e. a real
number). In contrast, the cross product of two vectors is a vector, whose
direction is perpendicular to both. The cross product has numerous appli- c
Figure 4.11
cations in geometry and mechanics, as you will see later in the course.
Definition
The cross product of two vectors a and b is
The product a × b is read as
a × b = (|a| |b| sin θ)
c, ‘a cross b’.
where θ (0 ≤ θ ≤ π) is the angle between the directions of a and b, and
c is a unit vector perpendicular to both a and b, whose sense is given
by the right-hand screw rule as shown in Figure 4.12. direction
of screw’s
motion
The angle θ between two vectors a and b lies in the range 0 ≤ θ ≤ π, so
sin θ ≥ 0 and hence |a| |b| sin θ ≥ 0. So the cross product of a and b is a
vector with magnitude |a| |b| sin θ and direction defined by
c. The direction
of
c is the direction in which the screw in Figure 4.12 would advance when
turned from a towards b through the angle θ. Notice that c is not defined ^
c b
if a and b are parallel or if a or b is the zero vector; but in these cases
|a| |b| sin θ = 0 so we take a × b = 0. The cross product is also called the
vector product, which stresses the fact that a × b is a vector. θ turn
a to b
The order of writing down a and b is very important. According to the
screw rule, b × a is a vector in the direction opposite to a × b. Figure 4.13 a
shows what would happen to the screw in Figure 4.12 if we turned from b
to a: it would ‘unscrew’. The unit vector d in the direction of b × a is in
Figure 4.12
the opposite sense to c, i.e. d = −
c. Hence
b × a = (|b| |a| sin θ) d = −(|b| |a| sin θ)
c = −(a × b).
direction
*Exercise 4.13 of screw’s
motion
Three vectors u, v and w lie in the (x, y)-plane. Their magnitudes are 2, 3
and 4 units, respectively, their directions make angles π6 , π3 and π6 radians,
respectively, with the positive x-axis, and they have positive j-components.
Use the definition of the cross product to find the vectors u × v, u × w and
v × w. b
Exercise 4.13 illustrates an important property of the cross product. If two θ turn
vectors a and b are parallel, then the angle θ between their directions is zero b to a
or π radians, so the cross product of a and b is the zero vector, because the ^
magnitude of the vector, i.e. |a| |b| sin θ, is zero. The converse also holds: if d a
a and b are two non-zero vectors such that a × b = 0, then the definition
of the cross product tells us that sin θ = 0; therefore θ = 0 or θ = π, and the Figure 4.13
vectors are parallel. We can also deduce that
a × a = 0 for any vector a.
184
Section 4 Products of vectors
So we can test for perpendicular vectors by using the dot product and for
parallel vectors by using the cross product.
These properties can all be derived from the definition of the cross product,
but the derivations are not given here. Note in particular Property 2: the
cross product is not commutative — the order does matter.
*Exercise 4.14
(a) Show that i × j = k, j × k = i and k × i = j.
(b) Calculate j × i, k × j and i × k.
(c) Calculate i × i, j × j and k × k.
(d) Expand and simplify
(i + k) × (i + j + k) and (i × (i + k)) − ((i + j) × k).
185
Unit 4 Vector algebra
a × b = (a1 i + a2 j + a3 k) × (b1 i + b2 j + b3 k)
= a1 i × (b1 i + b2 j + b3 k)
+ a2 j × (b1 i + b2 j + b3 k)
+ a3 k × (b1 i + b2 j + b3 k) (using Property 3)
= a1 i × b 1 i + a1 i × b 2 j + a1 i × b 3 k
+ a2 j × b 1 i + a2 j × b 2 j + a2 j × b 3 k
+ a3 k × b 1 i + a3 k × b 2 j + a3 k × b 3 k (using Property 3)
= a1 b2 (i × j) + a1 b3 (i × k)
+ a2 b1 (j × i) + a2 b3 (j × k)
+ a3 b1 (k × i) + a3 b2 (k × j) (using Properties 4 and 6)
= (a2 b3 − a3 b2 )i + (a3 b1 − a1 b3 )j + (a1 b2 − a2 b1 )k (using the results above).
We highlight this important formula.
This formula is not easy to remember or use. For this reason, simpler meth- Another quick way to
ods have been devised, such as the following, which is known as Sarrus’s evaluate cross products is to
Rule. Given two vectors a = [a1 a2 a3 ]T and b = [b1 b2 b3 ]T , draw a use determinants. This
method is introduced in
tableau with i, j and k in the top row, then repeat i and j. In the second Unit 9 when we discuss
row do the same with the components of a, and in the third row with those determinants. If you already
of b. Then following the diagonal lines as shown, and multiplying the en- know this method, then we
tries, gives the corresponding components of the cross product a × b, which suggest that you continue to
are the elements on the fourth row of the tableau. use it.
i j k i j
b b " b " "
b b" b" "
bb "" bb " "" bb
a1 a2 a3 a1 " a2
" b " b " b
" b" b" b
" "b "b bb
b1 " b2 " b b3 " b b1 b2
" " " b b b
" " " b b b
"" "" "
" b
b bb bb
−a2 b1 k −a3 b2 i −a1 b3 j a2 b 3 i a3 b 1 j a1 b 2 k
(The diagonals pointing to the right yield positive terms, while those point-
ing to the left have a minus sign.)
Example 4.4
If a = 2i + j − k and b = i − 3j + 4k, find a × b.
Solution
Since a1 = 2, a2 = 1, a3 = −1 and b1 = 1, b2 = −3, b3 = 4, the formula
above gives
a × b = ((1 × 4) − (−1 × −3))i
+ ((−1 × 1) − (2 × 4))j
+ ((2 × −3) − (1 × 1))k
= i − 9j − 7k.
186
Section 4 Products of vectors
i j k i j
b b " b " "
b b" b" "
bb "
" bb "
" bb ""
2 1 −1 2 1
" b " b " b
" b" b" b
"" "" bb "" bb bb
1 −3 4 1 −3
" " " b b b
" " " b b b
"" "" "" bb bb bb
−k −3i −8j 4i −j −6k
so a × b = i − 9j − 7k, as before.
*Exercise 4.15
If a = 2i − 3j + k, b = −i + 2j + 4k and c = −4i + 6j − 2k, find a × b, a × c
and b × c. From your results, what can you say about a and c?
*Exercise 4.16
If a = 2i + 2j + k and b = 4i + 4j − 7k, find a unit vector whose direction
is perpendicular to the directions of both a and b.
b
We close the section, and the unit, with some useful geometric applications
of the cross product. The following example is the first step.
Example 4.5 θ
Any two non-zero and non-parallel vectors a and b define a parallelogram, a
as shown in Figure 4.15. Express the area of the parallelogram in terms of
a × b. Figure 4.15
Solution
The area A of the parallelogram defined by the two vectors a and b is a
b
the same as the area of the rectangle of height |b| sin θ and width |a| (see
Figure 4.16). Thus A = |a| |b| sin θ, and this is the magnitude of a × b. So b sin θ
A = |a × b|.
θ
a
Area of a parallelogram
The area of a parallelogram with sides defined by vectors a and b is Figure 4.16
|a × b|.
b
This idea is easily extended for the area of a triangle. Any two non-zero,
non-parallel vectors a and b define a triangle (see Figure 4.17). The area of
this triangle is half that of the corresponding parallelogram, so it is 12 |a × b|.
a
Area of a triangle
The area of a triangle with sides defined by vectors a and b is 12 |a × b|. Figure 4.17
187
Unit 4 Vector algebra
Using the formula for the area of a parallelogram, it is easy to find the
volume of a parallelepiped (see Figure 4.18). This is given by A parallelepiped is like a
distorted brick. All of its
volume of parallelepiped = base area × vertical height h faces are parallelograms.
= |(a × b) . c|.
Here we have made use of the fact that the base is a parallelogram (assumed
to be in the (x, y)-plane) defined by the vectors a and b. The base therefore c
has an area equal to the magnitude of a × b. Now the vertical height h h
is the component of the vector c in the direction of the Cartesian unit b
vector k pointing vertically upwards, i.e. it is the z-component of c, given
a
by c . k = k . c. So the volume of the parallelepiped is |a × b|(k . c). But
the vector product a × b points vertically upwards and can therefore be
Figure 4.18
expressed as |a × b| k. Hence the volume of the parallelepiped is
|a × b|(k . c) = (|a × b| k) . c = (a × b) . c.
Of course, the scalar (a × b) . c can be negative if one of the defining vec- The scalar quantity
tors a or b is chosen to be in the opposite direction to the one chosen in (a × b) . c is an example of a
Figure 4.18, or if the order of the cross product is reversed. The modulus scalar triple product.
signs in the formula |(a × b) . c| ensure that the volume comes out positive.
End-of-section Exercises
Exercise 4.17
(a) Is a . b a vector?
(b) Can a . b be negative?
(c) What is special about a and b if a . b = |a| |b|?
(d) If a . b = 0, what can you say about a and b?
(e) If a × b = 0, what can you say about a and b?
Exercise 4.18
Suppose that the vectors r and s are directed towards north and north-east,
respectively, and define r × s = t.
(a) What is the direction of t?
(b) In what direction is s × r?
(c) In what direction is t × r?
(d) If |r| = |s| = 1, what is |t|?
(e) Calculate the vector t × (r × s).
(f) If |r| = |s| = 1, what is the value of r . s?
(g) If |r| = |s| = 1, what is the value of s . (t × r)?
Exercise 4.19
Find the value of (a × b) . a for any non-zero vectors a and b.
188
Outcomes
Outcomes
After studying this unit you should be able to:
• understand the meaning of the terms scalar, vector, displacement vector,
unit vector and position vector, and know what it means to say that two
vectors are equal;
• use vector notation and represent vectors as arrows on diagrams;
• use the plane polar coordinate representation of the magnitude and di-
rection of a vector, and convert between the polar coordinates and the
Cartesian coordinates of the endpoint of a vector drawn from the origin;
• scale a vector by a number, and add two vectors geometrically using the
triangle rule (or the parallelogram rule);
• resolve a vector into its Cartesian components, and scale and add vectors
given in Cartesian component form;
• calculate the dot product (scalar product) and cross product (vector
product) of two given vectors;
• determine whether or not two given vectors are perpendicular or parallel
to one another;
• determine the magnitude of a vector and the angle between the directions
of two vectors;
• write down the vector equation of a given straight line;
• resolve a vector in a given direction;
• manipulate vector expressions and equations involving the scaling, ad-
dition, dot product and cross product of vectors;
• use the cross product to determine the area of a parallelogram or triangle.
189
Unit 4 Vector algebra
Section 1
Cartesian Polar
1.1 coordinates (x, y) coordinates r, φ
y (0, −1) 1, − π
√ π2
b (1, 1) 2, 4
3 a √ √
(2 2, −2 2) 4, − π4
(−6, 0) 6, π
2 √
(−1, −1) 2, − 34π
1 −1, π
π (2.003 × 107 , 9.797 × 107 ) 8
10 , exp(0.1π)
3
1.8 Since (0, −3) lies on the negative part of the y-axis,
we can immediately write down the polar coordinates
−−→ −−→
0 10 20 30 of OQ as 3, − π2 , so |OQ| = 3.
Birmingham Scale (km) Alternatively, using the formulae
r = (02 + (−3)2 )1/2 = 3,
sin φ = −3/3 = −1, cos φ = 0
1.3 gives the same results.
N
Section 2
70 mph
60°
2.1 (a) Leeds to Bristol: −d.
Leeds to Leeds: 0.
(b) (i) 2v (ii) −v (iii) 0
(c) The vector −1.5v has magnitude 1.5|v| and direc-
tion opposite to v.
0 10 20 30 40 50 60 70
The vector −kv (k positive) has magnitude k|v| and
Scale (mph) direction opposite to v.
−−→ −−→
(d) (i) The vectors AB and DC are equal in length
and parallel, and point the same way (i.e. have the same
1.4 f = b, as both are of length 2 units and both point direction). Thus
−−→ −−→
in the positive y-direction. AB = DC.
−−→ −−→
(ii) The vectors BC and DA are equal in length and
parallel, but point in opposite directions. Thus
1.5 The completed table is as follows. −−→ −−→ −−→ −−→
BC = −DA (or, equivalently, DA = −BC).
190
Solutions to the exercises
O a x
–b a–b –b
191
Unit 4 Vector algebra
2.9 2.13 (a) The first rule is true and the second is false
y (scalar multiplication does not change direction).
a+b
b b (b) The proposed rule does not hold. (Consider r + r,
for example, where r = r, φ. The proposed rule gives
2a + 12 b r + r = 2r, 2φ, whereas actually r + r = 2r = 2r, φ.)
1
2b (There is an algebraic rule for adding vectors in polar
π π
a 4 4 form, but it is rather unwieldy. This is one reason why
O
2a x Section 3 introduces the Cartesian representation of a
vector, for which there is a simple algebraic rule for the
addition of vectors.)
a–b –b Section 3
3.1 a = i = [1 0]T
b = 2j = [0 2]T
2.10 The magnitude of v is 4.7 times the magnitude c = i = [1 0]T (= a)
of u. v is parallel to u, but the sense of v is opposite
to the sense of u, i.e. v and u have opposite directions. d = −2i = [−2 0]T
e = −i + 2j = [−1 2]T
2.11 By the triangle rule, f = 2j = [0 2]T (= b)
−−→ −−→ −→ g = i − j = [1 −1]T
AB + BC = AC,
−−→ −−→ −→ h = 3i = [3 0]T
CD + DA = CA.
−→ −→
But CA = −AC, so we have
−−→ −−→ −−→ −−→ −→ −→ 3.2 (a) a = 3i + j, b = −i + 3j, c = −3i − 2j,
AB + BC + CD + DA = AC − AC = 0. d = 3i + j (= a).
C (b)
y
B
3.5
3.5 a
A
D
−
−→ −−→ −→ 1
Alternatively, AB + BC = AC. Hence a
−−→ −−→ −−→ −→ −−→ −−→
AB + BC + CD = AC + CD = AD,
3 10.5 x
so
−−
→ −−→ −−→ −−→ −−→ −−→
AB + BC + CD + DA = AD + DA = 0, 3.5a = 3.5(3i + j) = 10.5i + 3.5j, as in the diagram.
−−→ −−→
since AD = −DA.
(There are various other possible arguments!) (c) y
4
b
2.12 The vectors p, q and p + q are sketched below. a+b
3
y
p+q p 〈3, π 〉 2
2
φ 1
q a
〈4, π 〉 x
O 1 2 3 4 x
π
5p = 15, 2 , −q = 4, 0.
a + b = (3i + j) + (−i + 3j) = 2i + 4j, as in the diagram.
Since the √ directions of p and q are at right angles,
|p + q| = 32 + 42 = 5 and φ = π2 + arctan 43 = 2.498 (d) 2a + b − c = 2(3i + j) + (−i + 3j) − (−3i − 2j)
radians, so = 8i + 7j.
√ √
p + q = 5, 2.498. Thus |2a + b − c| = 82 + 72 = 113.
192
Solutions to the exercises
3.3 (a) Given r = s, we can equate the corresponding 3.7 (a) d = 2(i + j + k) − 3(2i − 3j − k)
components. Thus = −4i + 11j + 5k,
p + q = −3 and p − q = 7,
e = (i + j + k) − 2(2i − 3j − k) + 4(3i + k)
which gives p = 2 and q = −5. = 9i + 7j + 7k.
√ √
1 1 1
(b) t = u + v = √ + (b) |d| = (−4)2 + 112 + 52 = 162 (= 9 2),
2 1 −1 √
√ |e| = 92 + 72 + 72 = 179.
1 2 2 √ √
=√ = .
2 0 0 (c) |a| = 12 + 12 + 12 = 3.
√
Hence |t| = 2. A unit vector in the direction of a is
1 1
a = √ (i + j + k).
|a| 3
3.4 (a) |a| = 2, |b| = 3, |c| = 1. (d) If a + x = b, then
(b) Use the formulae x = r cos φ and y = r sin φ (see x = b − a = (2i − 3j − k) − (i + j + k)
Subsection 1.5). = i − 4j − 2k.
First consider the vector a. The Cartesian components Thus the components of x are 1, −4 and −2.
of a are the numbers a1 and a2 given by
√
a1 = 2 cos π3 = 1, a2 = 2 sin π3 = 3. ⎡ ⎤ ⎡ ⎤ ⎡ ⎤
3 2 1
Thus
√ 3.8 p = ⎣ 0 ⎦ − ⎣ 3 ⎦ = ⎣ −3 ⎦, so
a = a1 i + a2 j = i + 3j. 18 −1 19
Similarly for b and c:
|p| = (12 + (−3)2 + 192 )1/2 = 3711/2 ( 19.26).
b = 3 cos 3π 3π √3
4 i + 3 sin 4 j = − 2 i +
√3 j,
2
√
c = cos π6 i + sin π6 j = 23 i + 12 j.
3.9 Relative to the origin of the Cartesian coordinate
(c) We now have system, the two points have position vectors i + j + 2k
√ √
and 2i + 3j + k. Thus the vector equation of the line is
a + c = i + 3j + 23 i + 12 j
√ √ t = (1 − s)(i + j + 2k) + s(2i + 3j + k)
= 1 + 23 i + 3 + 21 j = (1 + s)i + (1 + 2s)j + (2 − s)k,
1.866i + 2.232j. where −∞ < s < ∞.
√
3.5 (a) The components are 3.10 (a) |a| = 22 + (−1)2 = 5,
√
v1 = 5 cos 5π
18 3.214, |b| = 12 + 32 + 52 = 35.
5π The vector a lies in the (x, y)-plane, and the angle√φ
v2 = 5 sin 18 3.83.
(b) The magnitudes are that it makes with
√ the x-axis is given by cos φ = 2/ 5
√ and sin φ = −1/ 5. Hence φ −0.4636 radians.
|a| = (( 3)2 + (−1)2 )1/2 = 2,
√ (b) a + b = 3i + 2j + 5k,
|b| = ((−3)2 + 32 )1/2 = 3 2 4.243.
2a − b = 3i − 5j − 5k,
To specify the directions, we need a reference direction.
Using the plane polar coordinate convention, we can c + 2b − 3a = −4i + 10j + 8k.
−−→
specify the angle φ with respect to the positive x-axis. (c) The vector P Q is equal to 2a − b. The point Q is
Thus, for vector a, −−→
√ the end of the vector OQ, which is given by
cos φ = a1 /|a| = 3/2, −−→ −−→ −−→
OQ = OP + P Q
sin φ = a2 /|a| = −1/2, = (2j + 3k) + (3i − 5j − 5k)
hence φ = − π6 . = 3i − 3j − 2k,
For vector b,
√ √ so Q is the point (3, −3, −2).
cos φ = −3/(3 2) = −1/ 2,
√ √
sin φ = 3/(3 2) = 1/ 2,
3.11 0 = [0 0 0]T ,
hence φ = 3π
4 . i = [1 0 0]T ,
j = [0 1 0]T ,
3.6 Systems (b), (c) and (d) are right-handed. k = [0 0 1]T .
193
Unit 4 Vector algebra
194
Solutions to the exercises
4.9 The technique here is the same for all the vectors. and the j-component of d is
One must find the angle between the vector in question |d| cos( π2 + α) = −2 sin α
and the unit vectors i and j.
(using the usual trigonometric formulae). So
Vector a points vertically downwards, in the direc-
d = −2 cos α i − 2 sin α j.
tion −j. Hence i and a are perpendicular, and
a = 0i − j = −j. 4.10 As in the previous exercise, we must find the
The angle between i and b is α, and the angle between angles between a, b, c, d and the unit vectors i and j.
j and b is π2 − α. Hence the i-component of b is First notice that b points in the direction i, so b = 1.5i.
|b| cos α = 1.5 cos α, Similarly, d points in the direction −i, so d = −2i.
and the j-component of b is Also, c points in the direction j, so c = 1.5j.
|b| cos( π2 − α) = 1.5 sin α,
y
where we have used the formula
cos(β − α) = cos β cos α + sin β sin α
to evaluate cos( π2 − α) (see the Handbook). So
b = 1.5 cos α i + 1.5 sin α j. P α
π
The angle between i and c is 2 + α, and the angle be- π
−a
tween j and c is α. j 2
α a
i
y x
195
Unit 4 Vector algebra
4.12 The i-component of a is clearly zero, while the 4.14 (a) i, j and k are unit vectors forming a right-
j-component is simply |a|. Similarly, the i-component handed system.
of b is −|b| while the j-component is zero. Hence
a = |a| j and b = −|b| i. k
The i- and j-components of c are, respectively,
√
|c| cos π4 = 2 × √12 = 2
and
√ j
|c| cos( π2 + π4 ) = −2 sin π4 = − 2,
so
√ √ i
c= 2i − 2j.
Given that a + b + c = 0, the sum of all the i-
components of a + b + c must be zero, and so must the Thus, using the definition of the cross product,
sum of all the j-components. Therefore (i-components) i × j = (|i| |j| sin π2 )k = k.
√
0 − |b| + 2 = 0 Similarly,
and (j-components) j×k=i and k × i = j.
√
|a| + 0 − 2 = 0. (b) Since (a × b) = −(b × a) for any vectors a and b,
√ we have
Thus we see that |a| = |b| = 2.
j × i = −k, k × j = −i and i × k = −j.
(c) Since a × a = 0 for any vector a, we have
4.13 For the sake of clarity, here is a diagram showing
i × i = j × j = k × k = 0.
u, v and w (where all three vectors start at O) drawn
in the (x, y)-plane. (The z-axis points out of the page.) (d) (i + k) × (i + j + k)
= (i × (i + j + k)) + (k × (i + j + k))
= (0 + k + (−j)) + (j + (−i) + 0)
y
v = −i + k,
(i × (i + k)) − ((i + j) × k)
w = (0 + (−j)) − (−j + i)
= −i.
u
π
6
π
4.15 To compute a × b, we use Sarrus’s Rule:
6
i j k i j
O x
2 −3 1 2 −3
−1 2 4 −1 2
The cross products are all perpendicular to the (x, y)- −3k −2i −8j −12i −j 4k
plane.
so a × b = −14i − 9j + k.
A unit vector in the direction of u × v is k, so
Similarly for a × c:
u × v = |u| |v| sin π6 k = (2 × 3 × 12 )k = 3k.
The angle between u and w is zero, so i j k i j
2 −3 1 2 −3
u × w = (|u| |w| sin 0)
c = (2 × 4 × 0)
c = 0
c = 0.
−4 6 −2 −4 6
A unit vector in the direction of v × w is −k, so
−12k −6i 4j 6i −4j 12k
v × w = |v| |w| sin π6 (−k) = (3 × 4 × 12 )(−k)
so a × c = 0.
= −6k.
Finally, for b × c:
i j k i j
−1 2 4 −1 2
−4 6 −2 −4 6
8k −24i −2j −4i −16j −6k
so b × c = −28i − 18j + 2k.
Since a × c = 0, and neither vector is zero, the vectors
a and c are parallel. In fact, c = −2a.
196
Solutions to the exercises
(e) t × (r × s) = t × t = 0
(f ) r . s = |r| |s| cos π4 = √1
2
197
Block 1
Index
absolute error 70, 80 cost 81
absolute error bound 70 cotangent 23
absolute value 9 cross product 184, 185
accumulation 64
addition of vectors 165, 170, 175 De Moivre’s Theorem 31
algebraic rules for scaling and adding vectors 166 death rate 63
analytic solution 83 decay constant 89
angle between vectors 180 decimal places 7
approximate solution 75 decreasing function 38
arbitrary constant 41, 67, 94 definite integral 48
arccos 25 dependent variable 10, 65, 85
arcsin 25 derivative 32, 65
arctan 25 derived function 32
area of a parallelogram 187 difference of two squares 16
area of a triangle 187 differentiable 66
Argand diagram 30 differential equation 66
argument of a complex number 30 explicit solution 87
associated homogeneous equation 125 first-order 66
associativity 166 general solution 67
asymptote 39 homogeneous 91
auxiliary equation 114 implicit solution 87
inhomogeneous 91
base of an exponential 17 linear 91
birth rate 63 non-homogeneous 91
boundary condition 138 order 66
boundary value 138 particular solution 67
boundary-value problem 138 solution 66
bounded above 39 differentiation 31
bounded below 39 of a complex-valued function 37
direct integration 84
Cartesian components of a vector 169, 174 direction field 72
Cartesian coordinates 159, 164, 172 direction of a vector 155, 157
Cartesian unit vectors 164, 174 discrete model 10
Chain Rule 36 discriminant 15, 120
characteristic equation 114 displacement 154, 156
closed form of a recurrence system 10 displacement vector 156
closed interval 8 distributivity 166
codomain of a function 25 division of complex numbers 28
coincident roots 28 domain 69
column vector 169 domain of a function 10, 44
commutativity 166 dominate 144
complementary function 125 dot product 177, 178
complex conjugate 28
complex exponential 31 efficiency 81
complex number 27 equal roots 28
complex-valued function 37 equal vectors 158
component form of a vector 169, 174 equation of a straight line 176
component form of cross product 186 error 70
component form of dot product 180 error bound 9
component of a vector 169, 174, 181 Euler’s formula 31
composite function 21 Euler’s method 78
Composite Rule 36 explicit solution 87
composition of functions 21 exponent of an exponential 17
constant of integration 41 exponential form of a complex number 31
constant-coefficient equation 112 exponential function 17, 18, 129
continuous function 40
continuous model 10 factorization 15
cosecant 23 first derivative 33
cosine 23 first-order differential equation 66
198
Index
formula method for a quadratic equation 14 method of undetermined coefficients 127, 131
function 10 minimum 39
function notation for derivatives 33 modulus of a complex number 29
modulus of a number 9
Gaussian elimination 13 modulus of a real number 155
general solution 110, 120, 121, 125, 126 modulus of a vector 155
general solution of a differential equation 67 multiplication of a vector by a scalar 162
global maximum 39 multiplication of complex numbers 30
global minimum 39
gradient 12, 32, 72 natural logarithm function 18
Newtonian notation 33
half-open interval 8 non-homogeneous differential equation 91
homogeneous differential equation 91 non-homogeneous equation 112
homogeneous equation 112 nth derivative 33
image set of a function 10 nth root 17
imaginary part of a complex number 27 nth-order polynomial 28
implicit differentiation 37
implicit solution 87 open interval 8
increasing function 38 order of a derivative 33
indefinite integral 41, 84 order of a differential equation 66
independent variable 10, 65, 85 orientation of a vector 157
index of an exponential 17 output 64
inhomogeneous differential equation 91
inhomogeneous equation 112, 124 parallel vectors 162
initial condition 68, 136 parallelepiped 188
initial value 68, 136 parallelogram rule 166
initial-value problem 68, 136 parameter 11
input 64 particular integral 125, 131
input–output principle 64 particular solution 111, 135
integer 7 particular solution of a differential equation 67
integral 42, 84 perfect square 16
integrand 41 periodic function 23
integrating factor 94 perpendicular vectors 178
integrating factor method 95 plane polar coordinates 159
integration 41 polar coordinates 29, 159
by parts 47 polar form of a complex number 30
by substitution 45 polynomial function 128
interval 8 polynomial of degree n 28
inverse function 18 population model 63
inverse trigonometric functions 25 position vector 168, 174
irrational number 7 power function 20
power of an exponential 17
Leibniz notation 33 principal value of the argument 30
linear constant-coefficient second-order differential principle of superposition 113
equation 112 Product Rule 35
linear differential equation 91 proportionate birth rate 63
linear function 12 proportionate death rate 63
local maximum 38 proportionate growth rate 64
local minimum 38
log plot 19 quadratic equation 14
logarithm function 18 quadratic function 14
logistic equation 65, 72, 90 Quotient Rule 35
log–linear plot 19
log–log plot 20 rate of change 32
lower bound 39 rational number 7
real number 7, 155
magnitude of a number 9 real part of a complex number 27
magnitude of a real number 155 recurrence system 10
magnitude of a scalar 155 reducing the step size 79
magnitude of a vector 155, 170, 175 relative error 71
maximum 39 repeated roots 28
199
Block 1
resolving a vector into components 171, 181 solving inhomogeneous equations 126
resultant 165 speed 155
right-hand rule 173 stationary point 38
right-handed system 173 steady-state solution 143
roots step length 77
of a polynomial equation 28 step size 77, 79
of a quadratic equation 14 subtraction of vectors 165
sum of 14 sum of vectors 165
rounding 8
rounding error 81 tangent function 23
third derivative 33
Sarrus’s Rule 186 transient 142
scalar 155 transient solution 142
scalar multiple 162 transpose symbol 169
scalar multiplication 162 trial solution 127
scalar product 177 triangle rule 165
scalar triple product 188 trigonometric functions 22
scaling 162, 166 trigonometric identities 26
scaling of a vector 162, 170, 175
scientific notation 7 undetermined coefficients, method of 127, 131
screw rule 173 unit vector 163, 174
secant 23 upper bound 39
second derivative 33
sense of a vector 157 variable 10
separable differential equation 87 vector 155, 167
separation of variables method 87 vector addition 165, 170, 175
sign of a real number 155 vector addition rule 165
significant figures 7 vector equation of a straight line 176
simultaneous linear equations 13 vector product 184
sine 23 vector subtraction 165
sinusoidal function 130 velocity 155, 157
slope 12, 32, 72, 75 volume of a parallelepiped 188
smooth function 40
solution of a differential equation 66 (x, y)-plane 172
solution of a quadratic equation 14
solving homogeneous equations 120 zero vector 157, 165, 166
200