MME R Lee: Eek Noe
MME R Lee: Eek Noe
Mathematics
eek noe
MMErlee
Edexcel
Edexcel
Edexcel i
Edexcel | 4
A-Level
MELE Cece
UXBRIDGE COLLEGE
LEARNING CENTRE
2. Linear Regression
Explanatory and response variables 114
Regression lines 115
Interpolation and extrapolation 118
Linear regression with coded data 121
Review Exercise — Chapter 4 123
Exam-Style Questions — Chapter 4 124
eye ay
The Normal Distribution
1. The Normal Distribution
The normal distribution 126
Chapter 2 Probability
In the next example, you're asked to find the probability that Z takes a value
between two limits. You can do this by subtracting one area from another.
The speeds of 60 cars travelling in a 40 mph speed limit area were measured to the nearest mph.
The data is summarised in this table.
Calculate estimates of the mean and median, pecan pul
and state the modal class. Frequenc
Find the mean and standard deviation of the following numbers: 11, 12, 14, 17, 21, 23, 27.
The scores in an IQ test for 50 people are recorded in the table below.
Score [100
-106 | 107-113
|114-120 | 121
-127 | 128
-134
Frequency 6 11 I 22 9 2
Estimate the mean and variance of the distribution,
Exam-Style Questions A group of 19 people played a game. The scores, x, that the people achieved are
summarised by:
D(x — 30) = 228 and (x — 307 = 3040
Questions in the same style as the a) Calculate the mean and the standard deviation of the 19 scores. (3 marks)
ones you'll get in the exam, with b) Show that }).x = 798 and >)x’ = 33820. (3 marks)
Containing all the formulas and statistical b) Find the standard deviation ofthe masses ofthe brand A chocolate drops. (3 marks)
d) Find the standard deviation of the masses ofall 50 chocolate drops. (4 marks)
Chapter 1 Data
Glossary
All the definitions you need to know for
the exam, plus other useful words.
A-Level
“Mathematics
for Edexcel
With thanks to Mona Allen and Glenn Rogers for the proofreading.
With thanks to Alastair Duncombe for the reviewing.
_Photocopying — it's dull, grey and sometimes a bit naughty. Luckily, it's dead cheap, easy and
quick to order more copies of this book from CGP — just call us on 0870 750 1242. Phew!
Text, design, layout and original illustrations © Coordination Group Publications Ltd. (CGP) 2012
All rights reserved.
Introduction to Mathematical NOR ente
1. Mathematical Models in
Probability and Statistics
Mathematical models are basically descriptions of things in the real world, Learning Objectives:
and are usually written to help solve some kind of problem. They’re usually Be able to explain
much simpler than the real-world situations they describe, but are still useful. the mathematical
modelling process
The benefits of mathematical models Be able to describe
some of the reasons
Understanding real-world problems for carrying out
= A technology company is going to launch a new product mathematical
— the USB-Penguin. modelling
The company’s managers would like to maximise profits from
USB-Penguin sales. (At the very least, they need to sell enough
USB-Penguins to recover the money spent on development.)
= They are trying to decide what would be the best price to charge
for the USB-Penguin. They realise that the variables ‘price’, ‘sales’,
and ‘profit’ are linked.
For example:
= If the price is low, then sales should be higher.
(i) However, if sales aren’t as high as expected, the low price may mean
that the company will not be able to recover the development costs.
(ii) But if the low price encourages lots of people to buy the
USB-Penguin, then profits could be high.
= Ifthe price is high, then they can expect to sell fewer USB-Penguins.
(i) However, if sales turn out to be higher than expected, then the
company will make a very big profit.
(ii) But if the high price puts people off buying, then the company won't
recover the development costs.
= The managers need to answer the following question:
What is the best price to charge for the USB-Penguin to maximise profits
(or to have the best chance of recovering the development costs)?
1. Representing Data
SEES GSS SUS
Data is to statistics what fuel is to a car — without data, all the statistics Learning Objectives:
knowledge in the world won’t be much use. This chapter covers the essentials
" Be able to recognise
about data — from graphs, through to location, dispersion and skewness. different types of
variables.
Data basics " Be able to interpret :
A lot of the subject of statistics involves analysing data. frequency tables and
grouped frequency
Data consists of a number of observations (or measurements). tables.
Each of these observations records the value of a particular variable. Be able to draw and
There are different kinds of variables. interpret histograms,
= Variables that take non-numerical values (i.e. they’re not numbers) and stem and leaf
diagrams. '
— these are called qualitative variables. espeene gn sr orem gr ;
elem a es
An employer collects information about the computers in his office.
He gathers observations of the 5 variables shown in this table.
1. Manufacturer Bell _|Banana | Deucer | Deucer
2. Processor speed (in GHz) 2.6 Dive) oeies
3. Year of purchase 2009 | 2010 | 2011
4. Memory (in MB) . 2 3 Jel
5. Colour Grey Grey
Which of the five variables are:
a) qualitative?
® The variables ‘Manufacturer’ and ‘Colour’ take values that are not
numbers.
= So there are two qualitative variables: ‘Manufacturer’ and ‘Colour’ .
b) quantitative?
= The variables ‘Processor speed’, ‘Year of purchase’ and ‘Memory’
take values that are numbers.
= So there are three quantitative variables:
, ‘Processor speed’, ‘Year of as * and ‘Memory’ .
Chapter1 Data =
The variables below are all quantitative.
(i) length, (ii) weight, (iii) number of brothers, (iv) time,
(v) total value of 6 coins from down the back of my sofa
Tip: ‘Number of a) Which of these 5 quantitative variables are continuous?
brothers’ can only take = ‘Length’, ‘weight’ and ‘time’ can all take any value in a range.
whole-number values.
= So the continuous variables are: ‘length’, ‘weight’ and ‘time’.
‘Total value of 6 coins’
can only take particular b) Which of these 5 quantitative variables are discrete?
values. For example, = ‘Number of brothers’ and ‘total value of 6 coins’ can
they could be worth 12p only take certain values.
or eanbut not 12.
iei
= So there are two discrete variables — these are: ‘number of brothers’
and ‘total value of 6 coins’ .
Tip: Frequency tables = Grouped frequency tables show the number of observations whose
and grouped frequency values fall within certain classes (i.e. ranges or groups of values).
tables can also be They're often used when there is a large range of possible values.
drawn ‘vertically’, like For example, this grouped frequency table shows
this:
the number of potatoes in thirty 25 kg sacks.
Number | Frequency
of Muriberof
pote 50 55 260| 61-65 | 66-70| 71-75
bananas
Freguenc 12
= Notice how grouped See tables don’t tell you the exact value
of the observations — just the most and the least they could be. ,
» And notice how the different classes don’t overlap. In fact, there are
‘gaps’ between the classes because this is discrete data.
Grouped frequency tables are also used for continuous data. Since there are
no ‘gaps’ between possible data values for continuous variables, there can be
no gaps between classes in their grouped frequency tables either.
For example, this grouped frequency table
shows the masses of 50 potatoes.
= Inequalities have been used to define
Mass of
Tip: You don’t always the class boundaries (the upper and
potato (m, in g)
a
need to leave the top lower limits of each class). There are no
and bottom classes ‘gaps’ and no overlaps between classes.
without a lower and
= The smallest class doesn’t have a lower 100 nara |
< m < 200
upper limit — e.g. if you
know for a fact that very
small or very large data
limit — so very small potatoes can
still be put into one of the classes.
[200sm<300[16 |
isse «00a a
values are impossible. Similarly, the largest class doesn’t have
an upper limit.
Chapter 1 Data
This grouped frequency table shows the
lengths (to the nearest cm) of the same
Length of Frequency
potato (/, in cm)
50 potatoes.
= The shortest potato that could go in the aoe 3 Tip: Even though a
6-7 class would actually have a length of 6-7 u potato of length 7.5 cm
5.5 cm (since 5.5 cm would be rounded up 8-9 15 would go in the 8-9
to 6 cm when measuring to the nearest cm). 10-11 class, this is still the
| 16
upper class boundary of
So the lower class boundary of the 12-13 | 3 the 6-7 class.
6-7-Class-is-5.5-cm.
= The upper class boundary of the 6-7 class
is the same as the lower class boundary of Tip: The upper class
boundary of the 12-13
the 8-9 class — this is 7.5 cm. This means
class will be 13.5 cm.
there are never any gaps between classes.
= For each class, you can find the class width using this formula:
class width = upper class boundary — lower class boundary Tip: A class with a
lower class boundary
of 50 g and upper class
= And you can find the mid-point of a class using this formula:
boundary of 250 g can
lower class boundary + upper class boundary be written in different
mid-point = 5 ways.
So you might see:
e ‘100 — 200, to the
nearest 100 g’
Example 1 or eee
e ‘50 < mass < 250’
A researcher measures the length (to the nearest 10 cm) of 40 cars. e ‘50’, followed by
Her results are shown in the table. reneinn on ‘250— for the next
Frequency class, and so on.
Add four columns to the table to show: &
(i) the lower class boundaries 250-350 They all mean the same.
360-410
(ii) the upper class boundaries
(iii) the class widths 420-450
(iv) the class mid-points 460-500
=» The shortest car that measures 250 cm (to the nearest 10 cm) is 245 cm
long. So the lower class boundary of the 250-350 class is 245 cm.
= The upper class boundary of the 250-350 class is 355 cm (even though Tip: 355 cm must be the
a car measuring 355 cm would actually go into the 360-410 class). upper class boundary
for the 250-350 class,
= Once you have the class boundaries, use the above formulas to find the because no number less
class widths and the mid-points. than this would work.
Length |Frequency| lower class | upper class | class width |mid-point For example, the upper
(cm) boundary (cm)|boundary (cm) (cm) (cm) class boundary can’t
250-350 5 245 355 110 300 be 354.99 cm, because
360-410] 11 355 415 60 385 then a car of length
435 354.999 cm wouldn't fit
ACO ee 415 455 40 into any of the classes.
ACOs OOMe oem 455 505 50 480 And the upper class
boundary can’t be
354.999 cm, because
then a car of length
354.9999 cm wouldn’t
fit into any of the
classes. And so on.
Chapter 1 Data
A researcher measures the (nose-to-tip-of-tail) length of 40 cats.
His results are shown in the table. Length, /(mm)_| Frequency
Add two columns to the table to show:
: 3
(i) the lower class boundaries 250 = = a 7.
(ii) the upper class boundaries 350 s/<
410 <1< 450 iv
450 <1<500 5
eee
Q1 Amechanic collects the following information about cars he services:
Make, Mileage, Colour, Number of doors, Cost of service
Write down all the variables from this list that are:
a) qualitative
b) quantitative
»
Chapter 1 Data
Histograms
Histograms look like bar charts. However, because they're used to show Tip: The formula for
frequencies of continuous variables, there are no gaps between the bars. frequency density (f.d.)
can actually be written:
To plot a histogram, you plot the frequency density rather than the frequency
(as you would in a bar chart). Use this formula to find frequency density: _ 1. frequency
oe k ~* Class width’
frequency where k can be any
Frequency density Eee
= wiih number.
However, it usually
= Here’s some data showing the heights of 24 people. makes sense to use k = 1
when drawing your own
|Height (cm) lower class | upper class | class | Frequency | Frequency histograms.
boundary | boundary | width density
(cm) (cm ) You'll see on the next
130<h<150| 130 DO 11h Cees 0.15 page how to interpret
150<h<160| 150 roo ars histograms where
different values for k
160<h<165] 160 5 5
have been used.
.
170 (20
Tip: If you just plotted
= Here’s the same data
the frequency (rather
plotted as a histogram. than the frequency
1.0
density), then your graph
would look like this:
density
Frequency
noon
© Oo+nNme
oO
130 140 150 160 170 180 190 — = -
130 140 150 160 170 180 190
Height (cm) Height(c
= First draw a table showing the upper and lower class boundaries,
plus the frequency density:
Mass of | Lower class | Upper class | Class Frequency
parcel boundary aay oa ee density
100 - 200 50 250 | 200 100 0.5
300 - 400 250 |. 450 ~—'|~-200 250 ies
~ [500-700 | 450 - 750 a 600 2
|800-1100| 750 =o 1150 7 400 50 0.125
Chapter 1 Data
= Now you can draw the histogram.
Density
Frequency
- — tt ee
0 200 250 400 450 600
800
Mass of parcel (g)
Tip: In a bar chart, it’s On a histogram, the frequency ina class is proportional to the area of its bar.
the height of a bar that In other words, frequency = k x area of bar (where k is a number).
represents the frequency.
Example 2
This histogram shows
the heights of a group
of people.
There were 6 people between
155 and 160 cm tall.
Tip: You always need
to go through an initial
stage of working out
how area and frequency
are related — basically, Height (cm)
you're working out the
value of k that’s been a) How many people in the group are between 130 cm and 155 cm tall?
used. (See the ‘Tip’ Before looking at the 130-155 class, you need to use
about the frequency the information you have about the 155-160 class.
density formula on the
previous page.) # Work out the area of the bar for Width of bar = 160-155 =5
155 cm to 160 cm to find how Height of bar = 0.6 :
area and frequency are related. So area. of bar=5 x0.6=3
Tip: Or you can say that = An area of 3 represents a frequency of 6 — so ‘frequency = 2 x area’.
1 square unit represents
2 people. * Now you need to find the area of | Width of bar = 155 — 130 =25
the bar from 130 cm to 155 cm. Height of bar = 0.2
So area of bar=25x0.2=5
Chapter 1 Data
Example 3 eee
The histogram below shows the speeds of cars along astretch of road.
There were 26 cars travelling between 50 and 60 mph.
10
©
density
Frequency
You now need to find the area of the bars to the left of 25 mph
on the horizontal axis (because you need the
frequency of cars travelling at 25 mph or less).
« First, find the area of Width of bar = 20 —0 = 20
the 0-20 mph bar. Height of bar = 0.4
So area Onpah=20pe0.4.—= 8
This represents 8 cars. Tip: Finding half of the
area under the 20-30 bar
= Now find half the area of ‘ Width of bar = 30—20= 10 kes aasecmpucn
the 20-30 mph bar — this Height of bar = 1.8 eaviaate nomen
represents speeds from Total area of bar = 10 x 1.8= 18 travelling at 20-30 mph
20 mph up to 25 mph. So half the bar’s area = 18 = 2 =9 were going 25 mph or
This represents 9 cars. less while the other half
were travelling faster
, than 25 mph.
= Add these figures together Total travelling at 25 mph or less
to estimate the total number —~3+9=- 8
travelling at 25 mph or less.
Chapter 1 Data
Exercise 1.2
Q2 The histogram below shows the audition times (in seconds) for
contestants applying for a place on a television talent show.
The auditions for 54 contestants lasted between 30 and 45 seconds.
ioe)On
Qo
N Oo
on
NO °
ee w
density
Frequency
1.0
OS
(0) 1) AY 380 40 SO 6 70 80 90
Time (seconds)
Chapter 1 Data
Stem and leaf diagrams
Tip: The stem represents
Stem and leaf diagrams are another way to represent data. a much bigger part of
In fact, the data values themselves appear in a stem and leaf diagram. the value than the leaf.
= Each data value is split into a ‘stem’ and a ‘leaf’. i ea OAs
= A complete stem and leaf diagram looks something like this: stem might be 2 (which
represents 20), while the
These are the stems. => 1| 1 3 7 +—sJlhese are the lea leaf might be 8 (which
we 23 | Ae oo 758 a oh represents 8).
Bo | COON - Or with the data value
a) | ioe Key: 2 | 8 means 28 3.7, the stem might be
3 (representing 3), while
5 the leaf might be 7
= A stem and leaf diagram always needs a key to tell you how to read it. (representing 0.7).
So in the above stem and leaf diagram, the first row represents the values
11, 13 and 17, while the second row represents the values 24, 25, 25, 27
; a Tip: The leaves in each
and 28. You can read the other two rows in a similar way. rowi are usually sorted
into the correct order,
from lowest to highest.
Example 1
The lengths in metres of cars in a car park were measured to the nearest
10 cm. The results were: 2.9, 3.5, 4.0, 2.8, 4.1, 2.7, 3.1, 3.6, 3.8 and 3.7.
Draw a stem and leaf diagram to show the data.
y
» Use the numbers before the decimal point 3
as the stems. So the stems will look like this: ——— 4
# Use the numbers after the decimal point as the leaves. Tip: It’s best to do a
Add them to the correct row of the diagram one at a : : i é oo rough version first, and
time. It’s a good idea to cross out the numbers in the ie then put the leaves in
list (in pencil) as you add them to your diagram. order afterwards.
= Then put the digits after the decimal point in order. —,» 2|7 8 9
;
= And finally, add a key. Key 2|9 means2.9m] Dia els Daler
4|o 4
Example 2
The results of a survey of people’s heights are shown below in the form of
a stem and leaf diagram.
a) How many people’s heights were measured? 14/1 3779 (5) Paes on
: :
Count the leaves — thiswill be the number 15 22.255 (4) ee
Ip:
ee
e numbers
In
of people who had their height recorded. 1610-0 J lea (5) Het aes
This is 22 people . 17}5 6 89 (4) row. These aren’t always
all ener ae (4)
b) What was the:
[Key 15|2 means 152 cm| etme
(i) minimum height recorded?
The minimum height recorded was 141 cm.
(ii) maximum height recorded?
The maximum height recorded was 183 cm.
Chapter 1 Data
A back-to-back stem and leaf diagram is basically two normal stem and leaf
diagrams drawn either side of the same stems.
For example, this back-to-back stem and leaf diagram shows the ages of
two sets of newspaper readers — for the Daily Rant and the Daily Pry.
The stems are the numbers in the middle, between the two vertical lines.
Example 2
Construct a back-to-back stem and leaf diagram to represent >
the following data:
Boys’ test marks: 34, 27, 15, 39, 20, 26, 32, 37, 19, 22
Girls’ test marks: 21, 38, 37, 12, 27, 28, 39, 29, 25,5, 24, 31, 36
= Use the tens digits as the stems. Boys G irls
So the stems will look like this: 1
2
3
Tip: The smallest leaf = Add the units digits one Boys Girls
should be the one ata time as the leaves. OS eetwie
nearest the stem. So 2,6 0 74); 25) leet Seo. 5 24
the leaves for the boys
Fs Deets Avanl. SBinal aBtighes Ogle:
are in descending order
(getting smaller from = Then put the leaves Boys Girls
left to right), while the in order, and add a key. Husa) Tine
leaves for the girls are in EF iGyay: beledWi leutine ius Zp
ascending order (getting
bigger from left to right). ee Se ah ee ke te
Key 0|2|1 means 20 for Boys and 21 for Girls
Chapter 1 Data
Exercise 1.3
Q1__ The stem and leaf diagram 1 246 79 (4)
shows the number of members Palle De Je eb Gok 8. (8)
in each of the choirs taking SSS a7 20 (5)
par
ina t
choir competition. 4/145578 (6)
Deine Ome eo (5)
[Key: 1|2=12 members] 6 (0)
Fa Le (2)
a) How many choirs were competing in this competition altogether?
b) How many choirs had 45 members?
c) How many choirs had more than 52 members?
d)) What was the size of the largest choir?
Q2__ The stem and leaf diagram shows the Cole (2)
minimum recorded temperature (to 9147899 (5)
the nearest 0.1 °C) in 24 towns one POG tari a7 OPO)
a
June night. | Hl 0.4 5a (4)
Wo 2 ey # (3)
If you have two sets of data and then combine them, you can
find the mean of the combined set of data in the following way.
= Chapter 1 Data
b) Find the mean (x,) of the data values 10, 12, 14.
The mean of this data is x, — Dx
: wey lLOsrl 2c 1A.= 36
c) Find the mean (x) of the combined data set 2, 3,4, 5, 6, 10, 12, 14. Tipsevoulcanitinediida
The Se of the combined data set is X, and ¥, together.
=_ 2* _ 24+34+4+4+546+10+1
nonin = tpt tt * =2 And you can’t just find
the mean of xX, and x).
If you know adata set of size n, has mean x, and another data set
of size n, has mean X,, then the combined mean is ¥, where:
Example 4
A data set consisting of 5 values has mean X, = 4.
A second data set consisting of 3 values has mean x, = 12. Tip: This is the same
problem as in
Find the mean (x) of the combined data set.
Example 3, only it’s
= Here, n, = 5 and-n, = 3. been solved using the
Exercise 2.1
Q1_ Katia visits 12 shops and records the price of a loaf of bread.
Her results are shown in the table below.
£1.08 | £1.15 | £1.25 | £1.19 | £1.26
|£1.24 |
£1.15 | £1.09 | £1.16 | £1.20 | £1.05
| £1.10 |
Work out the mean price of a loaf of bread in these shops.
Q2 Twenty students sit a maths exam. The teacher records their marks
{x,,.-+; X94 and calculates that Die 07
Work out the mean mark for these students.
Chapter 1 Data
The mode and the median
There are two other important measures of location you need to know about
— the mode and the median.
Tip: The mode is often Mode = most frequently occurring data value. } _
called the modal value.
eT a ————-
First put your n data values in order, then find the position of the median in
the ordered list. There are two possibilities:
i) sur S is a whole number (i.e. 1 is even), then the median is halfway
between the values in this position and the position above.
(ii) if 5 is not a whole number (i.e. 7 is odd), round it up to find the
position of the median. :
Examples
Find the medians of the following data sets.
a). 2;3;6, 276905 87
= Put the values in order first: 2, 2, 3, 3, 6, 6, 7, 8,9
« There are 9 values, and 5 = : = 4.5. Rounding this up to 5 means
that the median is the 5th value in the ordered list — median=6.
b) 4,3, 11,
4,10, 9, 3,8, 7,8
" Put the values in order first: 3, 3, 4, 4, 7, 8, 8,9, 10, 11
" There are 10 values, and 5 = u = 5,
Tip: The value halfway * So the median is halfway between the 5th value in the ordered list fe—7 }
between two numbers is
their mean. and the 6th value (= 8). So the median=7.5.
Chapter 1 Data
If your data is in a frequency table, then the mode and the median are still
easy to find as long as the data isn’t grouped.
Finding the median and mode from a stem and leaf diagram also shouldn't
cause too many problems.
Chapter 1 Data
Exercise 2.2 f
a
Q5__ Kwasi and Ben each check the download speeds on their computers
on a number of different occasions.
Their results are shown in this back-to-back stem and leaf diagram.
Kwasi Ben
3 5 gate (3)
(6) 7 APD OF 0 a ow eee (5)
(5) G28 4.3.0 |esis oO (4)
(6) OOS 7 61 G1 ee ae
(2) G7 A eee (3)
Key: 3 |5|9 represents a speed of 5.3 Mbit/s for Kwasi
and a speed of 5.9 Mbit/s for Ben.
a) Write down the modal download speed(s) for: (i) Kwasi (ii) Ben
b) Find the median download speed for: i
(i) Kwasi ii) Ben
Chapter 1 Data
Averages of grouped data
If you have grouped data, you can only estimate the mean and median.
This is because the grouping means you no longer have the exact data values.
And instead of a mode, you can only find a modal class.
Modal class
= To find a modal class, you need to find the class with Tip: See p7 for more
the highest frequency density. about frequency density.
= If all the classes are the same width, then this will just
be the class with the highest frequency.
Find the modal class for this data showing the heights of various shrubs.
Height of shrub
to nearest cm
i
# In this example, all the classes are the same width (= 10 cm).
= So the modal class is the class with the highest frequency.
» This means the modal class is 31-40 cm.
Mean
= To find an estimate of the mean, you assume that every reading in a class
takes the value of the class mid-point.
= Then you can use the formula ¥ = x a Tip: This is the formula
from page 14.
Chapter 1 Data
Estimate the median length of the newts described in this table. Tip: You don’t always
need to add a row
Length (to nearest cm) °-
2 Yo
7 to the table to show
Number of newts cumulative frequency.
Here, the numbers are
« There aren =3+18+12 + 4 = 37 data values in total. small enough that you
So 5 = 18.5 — meaning the median will be in the 3-5 class. can quickly tell which
class the median lies in.
= Now draw the picture of the eS
class containing the median. —.2.5 & m 55 You can see that 5
is greater than the
= So you need to solve: 3 a, 18.5 21 cumulative frequency
m=2.5) 2 le5
—3 of the first class, but
Sen LSD °, less than the cumulative
frequency of the second
This gives moe. — 15) Ong ==) x D2 +25 = 5.1cm(to 1d.p.) — so the median is in
the second class.
Chapter 1 Data
3. Dispersion and Skewness
SESS
A measure of location tells you roughly where the centre of the data lies. Learning Objectives:
Dispersion, on the other hand, tells you how spread out the data values are.
And skewness tells you how symmetrically distributed they are. " Be able to calculate
the range, interquartile
range, and
Range, interquartile range interpercentile range.
and interpercentile range Be able to determine
whether a reading is
Range an outlier.
The range is about the simplest measure of dispersion you can imagine. Be able to draw and
interpret box plots.
Range = highest value — lowest value = Be able to calculate
and interpret variance
But the range is heavily affected by extreme values, so it isn’t really the most
and standard
useful way to measure dispersion. deviation (including
with the use of
interquartile range coding). _
A more useful way to measure dispersion,is to use the interquartile range Be able to calculate
— but first you have to find the quartiles. You’ve already seen how the and interpret
median divides a data set into two halves. The quartiles are similar — there coefficients of
are three quartiles altogether (usually labelled Q,, Q, and Q,) and they divide skewness.
the data into four parts.
= Q, is the lower quartile — 25% of the data is less than or
equal to the lower quartile.
= Q, is the median — 50% of the data is less than or equal to the median.
= Q, is the upper quartile — 75% of the data is less than
or equal to the upper quartile.
For example, the values in the data set below have been sorted so that they’re
Tip: There are various
in numerical order, starting with the smallest. The three quartiles are shown.
ways you can find the
quartiles, and they
Paes 444 5[5 5 6 § 7 7.9 Be sometimes give different
Oe results. But if you use
Q, Q, Q,
the methods below,
you'll be fine.
The quartiles are worked out in a similar way to the median
— by first finding their position in the ordered list of data values.
Tip: This is the same
To find the position of the lower quartile (Q,), first work out r as the method used on
page 16 for finding the
= if a is a whole number, then the lower quartile is halfway median, only with na5
between the values in this position and the position above. replaced by 7.
= if a is not a whole number, round it up to find the position of
the lower quartile.
Tip: This is the same
as the method used on
To find the position of the upper quartile (Q,), first work out an
page 16 for finding the
a if = is a whole number, then the upper quartile is halfway median, only with 4
between the values in this position and the position above. replaced by on,
a) Estimate the lower and upper quartiles for the tree heights in this table.
H eight of tree 0-5 11-15
Tip: This is the same to nearest m
data as on p19-20.
Number of trees Lindl il
Chapter 1 Data
" Now draw a picture of the class containing Q, — just show
the important numbers you're going to need.
Put heights one side of the ‘ b, >
line (I’ve put them on top) 0 a, Q
Tip: Remember... 5,
5.5 should match the class
and cumulative frequencies
[+ ________
on the other side. frequency from the
0 —_— a, 15 (=3) 2° original frequency table.
+ ——->
b,
= Then solve the equation 4 = &.,
b, by
# First, calculate = = fe = A.
aoc OMisin ihe class 11-15.
= Now draw your picture of rl
the class containing Q,. 10.5 a, Oo. 155
{2
__—_-}—____________
43 a, 45 a4
Se ee es
b,
2 eubeathe eG eee
ubstituting numbers into Re. oe a
ee a5
UL 45 43
ess
- Or 105° 62 i Ds
And so ~—— = 74,0 Qa = 10.5 +5 x & = 11.4m(to 1d.p.)
Interpercentile range
You've seen how the quartiles divide the data into four parts, where each part Tip: The median is the
50th percentile and Q,
contains the same number of data values. Percentiles are similar, but they
is the 25th percentile,
divide the data into 100 parts. and so on.
The position of the xth percentile (P_) is700 x total frequency (7).
For example, to find the 11th percentile in a data set containing a total
of 200 values:
= 11
Calculate 700 200 = 22.
Tip: When you're
= Use linear interpolation to estimate the value in this position in the finding percentiles, the
ordered list of data values. data set is usually large,
and will probably be
You can find interpercentile ranges by subtracting two percentiles. grouped.
The a% to b% interpercentile range is P, — P..
= For example, the 20% to 80% interpercentile range is P,, — P,o.
Chapter 1 Data
A reptile specialist records the mass
erred
(m, in kilograms) of 150 tortoises.
0.67 ma 110)eSTa
Her results are shown in the table.
ALOEme 4[7 GEE |
14<m<18Se
i
ee
a) Estimate the 10th percentile for this data.
» It'll help to add a column showing
cumulative frequency. Mass (kg) | Frequency| frequenc
. AO! Ol - 02 kin50:6 | 27
Now calculate T00
0.6sm<10| 43 |
1.0<m<1.4 a
* Using the cumulative frequency,
ei - 136
you can see that this will be in
the ‘0.2 <m< 0.6’ class.
* So draw a picture of this class b,
Tip: As always with showing the important masses ——> Ae
linear ee and cumulative frequencies. OP 2 ae oe
Ao
solve2 = 2, tae
«>
15 7
FE b;
>
b,
= The equation45 =n
= gives 02 eS
5— which means:
1 . a, .
by
142 & P,, “18 R
105, 5, 135 136
b,
= Th e€ equation
tion if
2 =&s gi
gives
Po— 1.4 _ Le
13
ea,—5
136= 105
105°
AS ae1
Chapter 1 Data
Exercise 3.1
Ql The diameters (in miles) of the eight planets in the Solar System are
given below:
3032, 7521, 7926, 4222, 88846, 74898, 31763, 30778
For this data set, calculate:
a) the range
b) (i) the lower quartile (Q))
(ii) the upper quartile (Q,)
(iii) the interquartile range (IQR)
Q2 Each of the three data sets below shows the speeds (in mph) of
18 different cars observed at a certain time and place.
In town at 8:45 am: ;
Peon la loool, 10,4 oy tor tonto la, Sala, 17, 18, 15
In town at 10:45 am:
BO 3 1 29,3025, 29, 33e4e son al
On the motorway at 1 pm:
CO OM ae OOa) 4, (O97 1 5, Oy 99 0) ly OF nO, / 3.7 Opel Dy 4
UXBRIDGE COLLEGE
LEARNING CENTRE
Chapter 1 Data
Outliers and box plots
Outliers
Tip: There are various An outlier is a freak piece of data that lies a long way from the
ways to calculate the majority of the readings in a data set.
values of the fences, To decide whether a reading is an outlier, you have to test whether it
but in an exam you will falls outside certain limits, called fences.
always be told what
method to use. A very common way to test for outliers is to use the following values
for the fences:
= for the lower fence, use the value Q, — (1.5 x IQR)
= for the upper fence, use the value Q, + (1.5 x IQR)
The lower and upper quartiles of a data set are 70 and 100.
Use the fences Q, - (1.5 x IQR) and Q, + (1.5 x IQR) to
decide whether the data values 30 and 210 are outliers.
= First work out the IQR: Q, - Q, = 100 — 70 = 30
= Then you can find where your fences are.
Lower fence: Q>— (1.5 x IQR) = 70 =(1.5 x 30) =25
Upper fence: Q, + (1.5 x IQR) = 100 + (1.5 x 30) = 145
# 30 is inside the lower fence, so it is not an outlier.
210 is outside the upper fence, so it is an outlier.
Box plots
Tip: Box plots are
sometimes called ‘box A box plot is a kind of ‘visual summary’ of a set of data. Box plots show the
and whisker diagrams’ median, quartiles and outliers in an easy-to-look-at kind of Way.
(where the whiskers are They look like this:
the horizontal lines at
either end of the box).
Lowest value Lower Median Upper quartile
(that isn’t an outlier) quartile Highest value
a
Outliers marked \ (that isn’t an outlier)
with crosses
Chapter 1 Data
The back-to-back stem and leaf diagram below shows the IQs of year 11
students at two schools, Vale Academy and Cherry Comprehensive.
Vale Academy Cherry Comprehensive
(1) BAO 3 (1)
(8) 9 8 7 665 3.1/10/5 8 9 (3)
(5) SO 4 21 tl OM 2° 3"5' 76'S *9"(7)
(3) ime AS 22.) 0. 0 153 4 678 (7)
(2) 61/13 |2 4 (2)
(1) 0|14/ 4 (1)
Chapter 1 Data
. i== 5, so the lower quartile is halfway between
the 5th and 6th data values — Q, = 106.
= 15, so the upper quartile is halfway between
“B's
the 15th and 16th data values — Q,= 123.
=» The interquartile range, IQR is Q, — Q, = 123 — 106 = 17.
= This gives a lower fence of Q, — (1.5 x IQR) = 106 — (1.5 x 17) = 80.5.
And an upper fence of Q, + (1.5 x IQR) = 123 + (1.5 x 17) = 148.5.
This means there are no outliers at Vale Academy.
® So the box plot PLDT
mel
ee Fae
PEEP CE EEE: PAPE ERP
SEER
EEE
Re
EEEEEE
ae
OOKS IIKe MIs:
Exercise 3.2
Q1__The lower and upper quartiles of a data set are 19 and 31.
Use the fences Q, — (1.5 x IQR) and Q, + (1.5 x IQR) to decide
whether the data values 4 and 52 are outliers.
Q2__ A set of data was analysed and the following values were found.
minimum value = 4, maximum value = 49
Q, = 16, median = 24, Q, = 37
a) Find the interquartile range.
b) Are there any outliers in this data set?
Use the values Q, — (1.5 x IQR) and Q, + (1.5 x IQR) as fences.
c) Draw a box plot to illustrate the data set.
I Chapter 1 Data
Q4 The numbers of items of junk mail received in a day by people living
in the towns of
Goossea and Pigham Goossea Pigham
are shown in this (3) Oiee 0-0
back-to-back stem Cea 7 (2)
andleaf diagram. Se rae) 2) 00. 2.6 G (4)
(3) Dee em so) 5 OS)
(2) AWA) 1 2 6 8 (4)
(1) O52 455 (3)
6
(1) BZ
Key: 4|2|0 represents 24 items of junk mail in Goossea
and 20 items of junk mail in Pigham
variance =
De — ¥F Or
,
Varlance =
ee
Tip: Although the
formula on the right is
= Here, the x-values are the data, ¥ is the mean, and 7 is the total number easier to use, the one on
of data values. the left makes it easier
= The two formulas above are equivalent to each other — you can rearrange to understand what the
one to get the other (although you won't be asked to do this in the exam). variance actually is —
it’s ‘the average of the
squared deviations from
Dees = 1 (x? — 2+ F) (multiplying out brackets)
the mean’.
=15=2: ye Ly;x (writing as 3 summations)
BIE cscs
Example
Find the variance and standard deviation of the following data set:
Tip: The alternative 2,3, 4,4, 6911712
If your data is given in a frequency table, then the variance formula can be
written like this, where fis the frequency of each x.
= Then take the square root of the variance to find the standard deviation:
Standard deviation = /1.666... = 1.29 (to 3s.f.)
If your data is grouped, then you can only estimate the variance and standard Tip: This is the same
assumption you used
deviation (because you don’t know the actual data values — see p19).
when you found the
In this case, assume that each data value is equal to the class mid-point. mean of grouped data
Then go through the same steps as in the example above. on p19.
Chapter 1 Data
sets to
You can use the variances (or standard deviations) of two small data
ed set
find the overall variance (or standard deviation) of the larger, combin
of data. But it’s a bit fiddly, and so you need to do this very carefully.
Tip: You could use the = Call the boys’ heights x and the girls’ heights y.
formula from p15 | = First, write down the formula for the
to find the combined mean, and substitute in the mean for . yx
Meanie Gens ls | the boys (x) and the total number of Beg eed a
| oA et h m = — 1800
=c=_ 1X + h% values (n). This gives you the su
a of all the boys’ heights, x. 2
& See y= 22 5165 = 4% = py = 1485
Tip: Round the fraction of the girls’ heights, D/y.
one pase our = So the sum of the heights
NSS). URSIN, of the combined group pete Dy = 01800 451485 285
you use the mean in
more calculations, use
of boys and girls is:
the fraction (or your : The total number of boys and girls is10+9=19. 3985
calculator’s memory) so “T9 = 172.9cm
So the mean height of the combined group is:
you don't lose accuracy.
Now you need to go through a very similar process to find the sum of
the squares of the combined set of heights.
* The standard deviation for the boys is 10, so the variance = 10? = 100.
The standard deviation for the girls is 8, so the variance = 8? = 64.
Tip: Here, variance. is : * Substitute the boys’ variance into the formula for the variance.
the Warianeetontne boys’ This will give you the sum of the squares of their heights, >)x’.
heights, and variance, is ee “s ye
the variance of the girls’ Vallance t= 1 Oa 130100
heights.
i So )ix?= 10 x (180?+ 100)= 325000
= Now do the same for the girls to find >)y’.
variance, = Dow
fi 7 ee
2
64
So >}y = 9 x (165’+ 64) = 245601
= So the sum of the squares for the combined group of boys and girls is:
Dx + yy’ = 325000 + 245601 = 570601
Chapter 1 Data
* You now have all the information you need to find the variance for the
combined group. Tip: Don't use the
{ 570601 7 rounded mean (172.9)
variance = “a — (2282) = 139.0415... =(139°0emr (to 1d.p.) — you'll lose accuracy.
= And finally the standard deviation of the boys and the girls is:
standard deviation = /139.0415... = 11.8 cm (to 1d.p.)
You've seen that different measures of location are useful in different ways
(see page 22). The same is true of measures of dispersion — the range,
interquartile range, variance and standard deviation all have pros and cons.
Range
= The range is the easiest measure of dispersion to calculate.
= But it’s heavily affected by even a single extreme value / outlier.
And it depends on only two data values — it doesn’t tell you anything
about how spread out the rest of the values are.
interquartile range
= |t’s not affected by extreme values — so if your data contains outliers, then
the interquartile range is a good measure of dispersion to use.
® It’s fairly tricky to work out.
Variance
= The variance depends on all the data values — so no values are ‘ignored’.
= But it’s tricky to work out, and is affected by extreme values / outliers
(meaning that ‘freak’ values have more influence than they deserve).
= It’s also expressed in different units from the actual data values, so it can
be difficult to interpret.
Standard deviation
= Like the variance, the standard deviation depends on all the data values —
so no values are ‘ignored’.
= |t has the same units as the data values themselves, and so is easier to
understand and interpret. »
= But it is also tricky to work out, and affected by extreme values / outliers.
Exercise 3.3
Q1_ The attendance figures (x) for Wessex Football Club’s first six matches
of the season were: 756, 755, 764, 778, 754, 759.
a) Find the mean (x) of these attendance figures.
b) Calculate the sum of the squares of the attendance figures, Die
c) Use your answers to find the variance of the attendance figures.
d) Hence find the standard deviation of the attendance figures.
e) ) Explain why the standard deviation is a reasonable measure of
dispersion to use with this data.
Chapter 1 Data
Q2 The number of runners in the first eight Broughton marathons were:
322107 5722).88, 998R7Age
a) Calculate the variance for these figures.
b) Find the standard deviation of the number of runners.
Q3 The figures for the number of TVs (x) in the households of 20 students
are shown in the table below.
frequency,
f]7|8[4 |1)
a) Find the mean number of TVs (x) in the 20 households.
b) By adding rows showing x? and fx? to the table, find >) fi’.
c) Calculate the variance for the data above.
d ) Hence find the standard deviation.
frequency]
1 [627 sear a ee
a) Add four extra rows to the table showing:
(i) the class mid-points (x)
(ii) fc (ili) x? (iv) fx?
b) Use your table to find: (i) }}f (ii) peyitibe. (iii) ope
c) Use your answers to estimate the variance of the pulse rates.
[s0sw<90
|6
a) Estimate the variance for this data.
b) Estimate the standard deviation.
Find the mean and standard deviation of: 1862 020, 1862040, 1862010
and 1862050.
« All the original data values (call them x) start with the same four digits
(1862) — so start by subtracting 1 862 000 from every reading to leave
20, 40, 10 and 50.
= You can then make life even simpler by dividing by 10 — giving 2, 4, 1
and 5. These are the coded data values (call them y).
= So putting those steps together, each x-value is ey = 1862000 Tip: This means
related to a corresponding y-value by the equation: y = “——zy5———— a = 1862000 and
bieallO:
= Now work out the mean and
standard deviation of the y= 2+ 44145512
4 ad
(easy-to-use) coded values.
2 2 2 2
Tip: Remember...
standard deviation of y = j2teet te — 3? standard deviation
= /46
— 8fig.
to3sig.
9 = /25 = 1.5
= Then find the mean and standard deviation of the original values
using the formulas above.
y= oe a + by. This means: x = 1862000+ 10y
— 1862000
+(10 x 3)
= 1862030
Chapter 1 Data
ae f
» And standard deviation ofy = standard deviationofx
So standard deviation ofx = b x standard deviation of y
= 10x 1.58 = 15.8 (to3 sig. fig.)
Carry out the method in exactly the same way with grouped data. Remember,
with grouped data, you assume that all the readings equal the class mid-point,
and so this is the x-value that you use with the coding equation y = ~ :
. . . — a
Example oe oes
* Make a new table showing the class mid-points (x) of the original data,
and the coded class mid-points (vy). Also include rows for fy, ? and fy”.
Length of interview, to nearest minute
Now use your table to find the mean of the coded values (7).
« The number of coded values is: })f = 17 + 21+27+15 =80
= The sum of the coded values is: }\fy =0 + 21+54+45 = 120
= So the mean of the coded values is: y = dS = fe = 1.5
ae
Now for the standard deviation...
2S fy =0+ 21 = 108+
135 = 264
2
Chapter 1 Data
Sometimes, you won't have the data itself — just some summations.
Now you can find the mean and standard deviation of the original data.
= d= 10(¢— 93.5), sot = 93.5 + 4 = 93.5, ie. & = £93.50
= standard deviation of d = 10 x standard deviation of c,
so standard deviation of c =
standard deviation
10 ofd _ 316.0
ia _ 31 69
There are lots of different ways to ask questions about this topic.
Example ee
Chapter 1 Data
Exercise Cy
tion of the
Ql In each case below, find the mean and standard devia
original data sets.
a) A set of data values (x) are coded usingy = x — 500.
ard deviation
The mean of the coded data (7) is 12, and the stand
of the coded data is 4.22.
Q1 Hint: In part b), b) A set of data values (x) are coded usingy = 4x.
tion
=| The mean of the coded data (7) is 6, and the standard devia
of the coded data is 2.14.
x — 20 000
c) A set of data values (x) are coded using y = TS
The mean of the coded data (7) is 12.4, and the standard
deviation of the coded data is 1.34.
Q2 In each case use the given coding to find the mean and standard
deviation of the given data.
a) 2003, 2007, 2008" Use the coding: y = x — 2000
Q3 The widths (in cm) of 10 sunflower seeds in a packet are given below.
0.61, 0.67, 0.63, 0.63, 0.66, 0.65, 0.64, 0.68, 0.64, 0.62
a) Code the data values above (x) to form a new data set consisting
of integer values (vy) between 1 and 10.
b) Find the mean and standard deviation of your coded values (y).
c) Use your answers to find the mean and standard deviation of the
original values (x).
»
Q4 The table below displays the weight (x, to the nearest gram) of 12
items on a production line.
Weight (to nearest g)
Frequenc
Chapter 1 Data
Skewness
Skewness tells you how symmetrical or asymmetrical (lopsided) the
distribution of your data is.
A primate keeper at a zoo records the mass (in kilograms) of fruit eaten by
50 monkeys and apes in one morning. Her results are shown in the table.
Mass (kg) | 0.2-0.6|0.6-1.0| 1.0-1.4 |1.4-1.8 |1.8-2.2 |2.2-2.6 |2.6-3.0
[Frequency| 10 14 11 8 4 2 1
Draw a histogram of the data, and state the type of skewness present.
14 Tip: All the classes are
# Draw the histogram. 12 the same width, so I’ve
plotted the frequency,
# This has a ‘tail’ on the =°
rather than the
right-hand side. © frequency density.
Frequency
= Most of the values are a
Chapter 1 Data
You don’t always have to judge skewness by eye, though.
p-one-end’
You can use a coefficient of skewness to measure how ‘all-u
Tip: These two your data is. There are a couple of important coefficients of skewness
definitions should give — the first of these is Pearson’s coefficient of skewness.
approximatel y the same
3(me an — median)
answer, because for
nearly all distributions,
icient t of
Pearson’s coefficien o skewness = —— 7_
standard 4devia
_7 tion
‘mean — mode’ and
_ mean — mode
‘3 x (mean — median)’
standard deviation
are approximately equal
(see p41).
(In the exam, you'll be Pearson’s coefficient of skewness usually lies between —3 and +3.
told what formula to = Positive values mean that your data is positively skewed.
use.)
= Negative values mean that your data is negatively skewed.
= Asymmetrical distribution would give a skewness coefficient of zero.
= The more highly skewed your data is, the closer to 3 or -3 Pearson’s
coefficient of skewness will be.
Although it’s calculated using a different formula, you interpret it in much the
same way as Pearson’s coefficient of skewness.
= A positive quartile coefficient of skewness means your data is positively
skewed, while a negative quartile coefficient of skewness means your data
is negatively skewed.
= The larger the quartile coefficient of skewness, the more highly skewed
your data is.
= A value of zero means your data is symmetrically distributed.
Chapter 1 Data
A box plot can quickly tell you the skew of a data set. Each of these three
histograms has its corresponding box plot shown underneath.
Chapter 1 Data
1 |9 e
The ages of 15 lottery es
winners are given in this : ;:,
t
7a Semen
d leaf diagram.
4|1479 (4)
Sil ez ese/, (4)
Key 3|8 3 means 38 years of age
= The sum of the ages is })x = 600. So the mean age is a = 40.
= The sum of the squares is })x° = 26102.
So the standard deviation is oa — 40? = 11.84 (to 2 d.p.).
= There are 15 values, so the median is the 8th value in the ordered list.
This means the median = 41.
; — 4]
= So Pearson’s coefficient of skewness isSGU = -0.253 (to 3 dip.)
Exercise 3.5
Chapter 1 Data
Q2__ Find Pearson’s coefficient of skewness for each distribution below,
and use your answer to say whether the distribution is positively
skewed, negatively skewed or symmetrical.
a) mean=7 mode = 5 standard deviation = 1.6
b) mean = 12 median = 13.5 standard deviation = 1.8
c). mean = 14 mode = 14 standard deviation = 2.37
d)) mean =—-9 median = —8 standard deviation = 2.5
Q3__ Find the quartile coefficient of skewness for each distribution below,
and use your answer to say whether the distribution is positively
skewed, negatively skewed or symmetrical.
Se) Ag MC 15
b) Q,=35 Q.=119 Q,=194
c) lower quartile =—28 median=-23 — upper quartile =-10
d) lower quartile= 2.4 median =5.3 upper quartile = 8.2
Q4_ The following data shows the winnings (in pounds)
of a bingo player one summer: 1, 6, 9, 10, 10, 10, 11, 11
a) Find the mean, median and standard deviation of the data.
3(mean — median)
b) Find Pearson’s coefficient of skewness using
standard deviation ©
c) Use your answer to describe the skewness of the distribution.
Q5__ The salaries (in thousands of pounds) of a small firm’s ten
employees are shown below.
2A Bil G2 24 22,722, D4, 24127;,60
mean — mode
a) Calculate Pearson’s coefficient of skewness using standard deviation’
b) Use your answer to describe the skewness of the distribution.
Comparing distributions
It’s really common in the exam to be asked to compare two distributions.
To do this, there are different kinds of things you can say, depending on what
information you have about the distributions. You can:
= Compare measures of location, such as the mean, median or mode.
— You'll need to say which distribution has the higher mean/median/mode,
Tip: ‘...in the context of
and by how much. the question’ means you
— Then say what this means in the context of the question. need to use the same
= Compare measures of dispersion, such as variance, standard deviation, ‘setting’ in your answer
as the question uses.
range, interquartile range, or interpercentile range.
— You'll need to say which distribution’s data values are more ‘tightly For example, if the
packed’, or which distribution’s values are more spread out. question is all about the
weights of tigers in a
— Then say what this means in the context of the question. zoo, then you need to
= Compare measures of skewness — such as Pearson's coefficient of talk about the weights of
skewness or the quartile coefficient of skewness. tigers in the zoo in your
answer as well.
— For each distribution, say whether it is symmetrical, positively skewed,
or negatively skewed, and whether it is highly skewed, or just slightly.
— ‘Then say what this means in the context of the question.
Chapter 1 Data
This table summarises the marks obtained by a group of students in
Maths ‘Calculator’ and ‘Non-calculator’ papers.
a) Comment on the location and dispersion of the distributions.
Calculator paper oo, Sen Non-calculator paper
Sais hecasesdl Lower quartile, Q, | 35
35
ri nae
Location:
« The mean and the median are both higher for the Calculator paper
(the mean is approximately 9 marks higher, while the median is 16
marks higher).
= This means that scores were generally higher on the Calculator paper.
Dispersion:
= The interquartile range (IQR) for the Calculator paper is
Q,
-Q, = 70
-40 = 30.
= The interquartile range (IQR) for the Non-calculator paper is
Q,=Q, 256-35 =21.
= So the IQR and the standard deviation are both higher for the
Calculator paper.
« This means the scores on the Calculator paper are more spread out
than those for the Non-calculator paper.
Chapter 1 Data
You may have to base your comparison on a graph instead of numbers.
Location:
= The median is more than 25 g higher for Harvest 1.
« This shows that the tomatoes in Harvest 1 were generally heavier.
Dispersion:
« The interquartile range (IQR) and the range for Harvest 1 are higher
than those for Harvest 2.
# This shows that the masses of the tomatoes in Harvest 1 were more
varied than the masses of the tomatoes in Harvest 2. ‘ nes
Tip: This question is
Skewness: to do with harvests of
=" The distribution of the tomatoes’ masses in Harvest 1 is negatively tomatoes, so you need
skewed, showing that there isa tail on the left of this distribution. ee INES Se2
= This means the masses of the tomatoes were generally relatively high,
but there were some much lighter tomatoes as well.
» Harvest 2 is very different — for this harvest, the distribution of the
tomatoes’ masses is positively skewed, showing that there is a tail on
the right of this distribution.
# This means the masses of the tomatoes were generally relatively low,
but there were some much heavier tomatoes as well.
Exercise 3.6
Q1__The box plots below represent the prices of shoes (in £) from two
different shops.
A et
oe eee ee
OS eta le. ee
Ome Ome On eoC 1405950) 460" 70) 80
Price (£)
Chapter 1 Data
Q2 This back-to-back stem and leaf diagram shows the daily sales of two
competing small cafés over a 14-day period.
Café A CaféB
(0) 5 Ss 4.9 (3)
(1) BoiNGeH ht" 2 (2)
(6) Oc Fre | 7 Ow (2)
(7) Qe Zeats 4 4 Aug de le Be 86. (6°%7 (3)
(0) 9/05 (2)
(0) 10 | 8 (1)
(0) 1S (1)
Key: 8|6]|1 represents £68 of sales in CaféA
and £61 of sales in Café B
a) )( i) Calculate the mean sales and the median sales for both cafés.
(ii) Use your answers to compare the locations of the two data sets.
b) (i) Calculate the standard deviation for both cafés’ sales.
(ii) Use your answers to compare the dispersion of the two data sets.
c) )( (i) Find the quartiles for both cafés.
Hence find the quartile coefficient of skewness of each data set.
(ii) Use your answers to compare the skewness of the two data sets.
Q3 10 men and 10 women were asked how many hours of sleep they got
on a typical night. The results are shown below.
Men: 0.9, OO, Oe OneD
Wroirnen?s 950 67825 eld 2).0,nGal ar
a) (i) Calculate the mean and median of the individual data sets.
(i!) Use your answers to compare the locations of the two data sets.
b) (i) Calculate the standard deviation of the individual data sets.
(ii) Use your answers to compare the dispersion of the two data sets.
) i) Find Pearson’s coefficient of skewness
cM
for each data set, using 3(mean — median)
standard deviation *
(ii) Use your answers to compare the skewness of the two data sets.
Chapter 1 Data
Review Exercise — Chapter 1
_ QI The stem and leaf diagram on the right represents the lengths Taee
(in cm) of 15 bananas. Write down the original data asalist. S| F295
TA} 3 (3256988
Tea \e2e29
Key 12|8 means 12.8cm| 16] 1 1 2 3
TS Or 2
Q4 The speeds of 60 cars travelling in a 40 mph speed limit area were measured to the nearest mph.
The data is summarised in this table.
Calculate estimates of the mean and median, po ea
Een 34 |35- IOs =|
and state the modal class. Frequency 2
Q5__ Find the mean and standard deviation of the following mumbers> it, F214, 79 21R ose.
Q6 The scores in an IQ test for 50 people are recorded in the table below.
Score cee vats Pie 20 |121 27 L281
Frequenc 22 9 Z
Q7 For aset of data, n = 100, }(x — 20) = 125, and > (x — 207 = 221.
Find the mean and standard deviation of x.
Chapter 1 Data
Q8_ The time taken (to the nearest minute) for a commuter to travel to work on
20 consecutive work days is recorded in the table.
Use the codingy = x — 35.5 to estimate the mean and standard deviation of
the times, where x is the class mid-point.
Time to nearest minute | 30-33 | 34-37
Frequencyi Liat |. @ Be Foun eee pa
Q9 A data value is considered an outlier if it’s more than 1.5 times the IQR above the upper quartile
or more than 1.5 times the IQR below the lower quartile. If the lower and upper quartiles of a data
set are 62 and 88, decide which of the following data values are outliers:
a) 124 b) 131 c) 28
Q11 A set of data has a median of 132, a lower quartile of 86 and an upper quartile of 150.
Calculate the quartile coefficient of skewness, and draw a possible sketch of the distribution.
Q12 Two workers iron clothes. Each irons 10 items, and records the time it takes for each,
to the nearest minute:
WorkerA: 35271045
5 4 12
WorkerBey 494<8 677.8.9.10 14 9
a) For worker A’s times. Find:
(i) the median
(ii) the lower and upper quartiles
(iii) whether any of these values are outliers, using the values
Q, — 1.5 x IQR and Q, + 1.5 x IQR as fences.
b) On graph paper draw, using the same scale, two box plots
to represent the times of each worker.
c) Make one statement comparing the two sets of data.
d) Which worker would be better to employ? Give a reason for your
answer.
Chapter 1 Data
Exam-Style Questions — Chapter 1
The profits of 100 businesses are given in the table.
Profit, £x million _| Number of businesses
4.0<x<5.0 24
5.0<x< 5.5
5.5<x< 6.0 21
6.0<x<6.5
65<x<8.0 |
a) Represent the data in a histogram. (3 marks)
A group of 19 people played a game. The scores, x, that the people achieved are
summarised by:
>\(x — 30) = 228 and +(x — 30 = 3040
a) Calculate the mean and the standard deviation of the 19 scores. (3 marks)
b) Find the standard deviation of the masses of the brand A chocolate drops. rrens)
d) Find the standard deviation of the masses of all 50 chocolate drops. (4 marks)
Chapter 1 Data
4 The stem and leaf diagram shows the test marks for 30 male students
and 16 female students.
Male students Female students
eee
5
OFT. 6 6 l 3 4x5. 60729
99 1 9
Key 5/6|2 means Male student test mark 65
and Female student test mark 62
The table shows the number of hits received by people at a paint ball party.
[Prequendy.)2
1/74] 65k78) 6 |Be aR ae | oO Oe
Find the median and mode number of hits. (3 marks)
An outlier is a data value which is more than 1.5 x (Q, — Q,) above Q, or
more than 1.5 x (Q,—Q,) below Q.. Is 25 an outlier? Show your working. peri
Sketch a box plot of the distribution and comment on any skewness. Oman
How would the shape of the distribution be affected if the value of 25 was
removed? (Priel)
2
density
Frequency
150 160 170 180 190 200 210 220 230 240 250 260 270
Length (cm)
Use the histogram to find the number of lions who measured
over 220 cm from nose to tail.
(5 marks)
Chapter 1 Data
Chapter 2 Probability
1. Elementary Probability
Probability is a measure of how likely events are to happen. We’re errs Learning Objectives:
with a reminder of the basics, which you'll have seen before. But it’s all
econenpeanae!
A bag contains 15 balls — 5 are red, 6 are blue and 4 are green.
If one ball is selected from the bag at random, find the probability that:
a) the ball is red
= The event is ‘a red ball is selected’.
# There are 5 red balls, so there are 5 outcomes where the event happens
and 15 possible outcomes altogether.
» So: P(red ball) = ae= 3 ‘ It’s usually best to simplify your
answer as much as possible.
b) the ball is blue
= The event is ‘a blue ball is selected’.
= There are 6 blue balls, so there are 6 outcomes where the event happens
and 15 possible outcomes altogether.
: Tip: Always check
» So: P(blue ball)= Be= z that your probability is
on between 0 and 1. |
c) the ball is red or green
= The event is ‘a red ball or a green ball is selected’.
= There are 5 red balls and 4 green balls, so there are 9 outcomes where
the event happens and 15 possible outcomes altogether.
= So: P(red or green ball) = z:= 2
Chapter 2 Probability
The sample space
The sample space (called S) is the set of all possible outcomes ofatrial.
Drawing a diagram of the sample space can help you to count the outcomes
you're interested in. Then it’s an easy task to find probabilities using the
formula on the previous page. |
lf atrial consists of two separate activities, then a good way to draw your
sample space is as a grid.
Tip: A sample-space Two bags each contain five cards. Bag A contains cards numbered
diagram can also be rt 1, 3, 3, 4 and 5, and bag B contains cards numbered 1, 2, 4, 4 and 5.
drawn as a table — with | A card is selected at random from each bag and the numbers on the
INS SNCOESa VO: two cards are added together to givea total score.
activity along the top
and the outcomes for the — _Use a sample-space diagram to find the probability that the total
other activity down the | score is no more than 6.
left-hand side. E.g:
y Bona = Start by drawing a sample-space diagram
sPiltineseee Se ho 0G showing all the possible total scores.
. : ; ‘ : ; : * This time you need to show the total
lew score for each pair of numbers at
Sl ae ee each intersection. e.g.14+2=3
516 8 8 9 10 * Circle all the scores of 6 or less.
5 x 5 = 25 outcomes altogether
5A
Chapter 2 Probability
_" So now you can use the probability formula. There are 12 outcomes
where the event ‘total score is no more than 6’ happens and
25 outcomes altogether.
So, P(total score is no more than 6) = 1N
i) o
Exercise 1.1
Q5 Spinner 1 has five equal sections, labelled 2,3, 5, 7 and 11, and
spinner 2 has five equal sections, labelled 2, 4, 6, 8 and 10.
If each spinner is spun once, find the probability that the number on
spinner 2 is greater than the number on spinner 1.
Chapter 2 Probability
2. Venn Diagrams ENURESIS SESS
Learning Objectives: Venn diagrams are a really useful tool for solving probability problems.
= Be able to draw Venn Here you'll see how to draw them to represent two or three events,
diagrams for two or and how to use them to find probabilities.
three events.
Use the correct Using Venn diagrams
notation to describe
A Venn diagram shows howa collection of objects is split up into different
the different areas of a
Venn diagram. groups, where everything in a group has something in common.
Use Venn diagrams to = Here, for example, the objects are outcomes and the groups are events.
find probabilities. So the collection of objects, represented by the rectangle, is the
Understand and sample space (S). Inside the rectangle are two circles representing
use complementary two events, A and B.
events.
= The circle A represents all the outcomes
corresponding to event A, and the circle B
represents all the outcomes
corresponding to event B.
s The diagram is usually labelled with the CO
number of outcomes (or the probabilities) represented by each area.
= Since Sis the set of all possible outcomes,
the total probability in S equals 1.
So that’s the basic idea of Venn diagrams. To describe the different areas
of the diagram, you can use set notation.
Chapter 2 Probability
The shaded area represents all the outcomes
corresponding to event B happening and event
A not happening. This area is written A’ MB.
S
The shaded area represents all the
outcomes corresponding to event A not
happening and event B not happening.
This area is written A’ M B’.
Here’s an example where the objects are people, and they're divided into
groups based on whether they have certain characteristics in common.
Example 1 ee
There are 30 pupils in a class. 14 of the pupils are girls and 11 of the Tip: Each of the groups
pupils have brown hair. Of the pupils with brown hair, 6 are boys. needs to be based on a
a) Show this information on a Venn diagram. Se aa
person either has or
= OK, so first you need to identify the groups. Let G be the group of doesn’t have. Then if a
girls and BH be the group of pupils with brown hair. person has that
* Draw aVenn diagram to represent the groups G and BH. characteristic, they're
ne included inside the
= Now you need to label it with the numbers circle. Ifthey don’t have
of pupils in each part of the diagram. the characteristic, they're
= You're told that there are 6 boys with brown outside the circle. E.g.
hair, which means there are 6 pupils who all the girls are included
in circle G, and all the
have brown hair and aren't girls.
boys (‘not girls’) are
So label the area BH M G’ with 6 pupils. outside G.
« There are 11 pupils in total with brown hair,
so there are 11 — 6 = 5 girls with brown hair.
So label the area G M BH with 5 pupils. Tip: Remember, ’N
means ‘and’, so the
= Now, there are 14 girls in total, so the number who don’t have
area GM BH is the area
brown hair = 14 —5 = 9. So label the area G N BH’ with 9 pupils. ‘girls and brown hair’.
= Finally, don’t forget the pupils that aren’t in G or BH. There are 30
pupils altogether, so the number of boys who don't have brown hair
= 30-(9 +5 + 6) = 10. So label the area G’ M BH’ with 10 pupils.
b) A pupil is selected at random from the class. Find the probability that
the pupil is a girl who doesn't have brown hair. Tip: All the outcomes
Using the Venn diagram, there are 9 girls who don’t have brown hair, are equally likely, so you
out of the 30 pupils in the class. can use the probability
eo. formula on page 53.
So, P(girl who doesn't have brown hair) = 30 — 10
Chapter 2 Probability
rs
The Venn diagram below represents two events, A and B. The numbe
two events .
show how many outcomes correspond to each of the
b) P(B)
= Use the Venn diagram to work out the number of outcomes in B.
There are 7 + 12 = 19 outcomes in B, and 40 outcomes in total.
19
" So using the formula, P(B) = 7q
ot c) P(A’)
* Use the Venn diagram to work out the number of outcomes in A’.
Tip: Remember...
A’ includes all the There are 12 + 6 = 18 outcomes in A’, and 40 outcomes in total.
outcomes that are
= So using the formula, P(A’) = AO ge
not inA. 40 2 .
d) P(A U B’)
# P(A U B’) is the probability of either event A happening or event B not
happening or both of the above.
Tip: A U B’ includes all
the outcomes that are « Use the Venn diagram to work out the number of outcomes in A U B’.
either in A, There are 15 + 7 = 22 outcomes in A, plus another 6 outcomes that
or not in B, aren't in B. So that’s 22 + 6 = 28 outcomes in A U B’,
or both of the above (i.e. and 40 outcomes in total. ;
in A but not in B).
* So using the formula, P(A U B’) = = os a
You also need to be able to draw and use Venn diagrams for three overlapping
groups (or events). There’s an example on the next page. Instead of showing
the ‘number of objects’ (or outcomes), the numbers show proportions, but the
ideas are exactly the same.
Chapter 2 Probability
A survey was carried out to find out what pets people like.
The proportion who like dogs is 0.6, the proportion who like cats is 0.5,
and the proportion who like gerbils is 0.4. The proportion who like dogs
and cats is 0.4, the proportion who like cats and gerbils is 0.1, and the
proportion who like gerbils and dogs is 0.2. Finally, the proportion who
like all three kinds of animal is 0.1.
a) Draw a Venn diagram to represent this information.
First, identify the groups. Let C be the group ‘likes cats’, D be the group
‘likes dogs’ and G be the group ‘likes gerbils’.
Draw a Venn diagram to represent the groups C, D and G. Tip: Label the diagram
Now you need to label it — this time with the with proportions this
time, because that’s the
proportions for each area. It’s best to start in the information you're given
CR
middle and work outwards... _ in the question.
The proportion who like all 3 animals is 0.1.
So label the area CM DM G with 0.1.
Next do the ‘likes 2 animals’ areas, by subtracting 0.1 Tip: If they like 3
0.1 from each of the given proportions. animals, they’II also be
SoC NDNG’=0.3,CNGND’=0, andDAGNC =0.1. in the ‘likes 2 animals’
Then complete each circle by making sure that the proportions add up bits. So subtracting this
proportion gives us the
to the proportion given in the question for each animal.
proportions of liking just
CND/nNG =0.5 —-(0.3 + 0.1 + 0) = 0.1, 2 of the animals.
DNC’ NG’ =0.6 - (0.3 + 0.1 + 0.1) = 0.1, and
GNC’ND’ =0.4-(0.1 + 0.1 + 0) =0.2.
Finally, subtract all the proportions from 1 to find the proportion who
like none of these animals: C’ M D’M G’ = 1-0.9 = 0.1.
One person who completed the survey is chosen at random. Tip: When you select a
person at random, the
b) Find the probability that this person likes dogs or cats (or both). above proportions equal
the probabilities of that
This is represented by the shaded area. person being in each
The probability of the person being in this
area is equal to the proportion of people
E>
Ae
area of the diagram.
C) Find the probability that this person likes gerbils, but not dogs.
This is represented by the shaded area.
So add up all the probabilities in this area.
So, P(likes gerbils, but not dogs)
=0+0.2 =O2
Chapter 2 Probability
d) Find the probability that a dog-lover also likes cats.
= This is the proportion of dog-lovers who also like cats.
Tip: You can also D 1 C is represented by the shaded area,
think of this as ‘the
probability that the which forms a proportion of 0.3 + 0.1 = 0.4
person likes cats, given of the whole group of people.
that they like dogs’. But we're only interested in the ‘likes dogs’ circle,
This is an example of a not the whole group, so that means we need to
conditional probability. divide by 0.6.
Conditional probabilities
are covered in detail on So, P(dog-lover also likes cats) = az = 2 = 5
pages 65-67.
Exercise 2.1
QI For events A and B, P(A) = 0.4, P(B) = 0.5 and P(AN B) = 0.15.
Q1 Hint: You're given
the information as a) Draw a Venn diagram to represent events A and B.
probabilities, rather than b) What is P(A’ MB’)?
numbers of outcomes
corresponding to Q2 A sixth form college has 144 students — 46 of the students study
each event. So label maths, 38 study physics and 19 study both.
your diagram with
probabilities. a) Represent the information given above using a Venn diagram.
b) Find the probability that a randomly selected student from the
college studies at least one of either maths or physics.
c) Given that a student studies maths, what is the probability that
they also study physics?
Chapter 2 Probability
3. Laws of Probability
There are two main probability laws you need to know — the addition law Learning Objectives:
and the product law (or multiplication law). You'll see how to use these laws
Be able to use the
to find probabilities, and how you can adapt them in different circumstances.
addition law to find
probabilities.
The addition law Be able to recognise
For two events, A and B, there’s'a nice formula linking the union of A and B mutually exclusive
and the intersection of A and B — the addition law: events.
Be able to use the
P(A U B) = P(A) + P(B) — P(AN B) product law to find
probabilities.
Be able to recognise
You can see why this is true, using Venn diagrams. independent events.
" Be able to understand
and use tree diagrams.
Chapter 2 Probability
On any given day, the probability that Jason eats an apple is 0.6, the
probability that he eats a banana is 0.3, and the probability that he eats
both an apple and a banana is 0.2.
a) Find the probability that he eats an apple or a banana (or both).
« Let A be the event ‘eats an apple’ and B be the event ‘eats a banana’.
* You want to find P(A U B), so use the addition law:
P(A U B) = P(A)+ P(B) — P(A.G.B) = 0.6 +0.3—-02—07
= So P(he eats an apple or a banana, or both) = 0.7
b) Find the probability that he either doesn’t eat an apple,
or doesn’t eat a banana.
* You want to find P(A’ U B’). You can either do this using a Venn
diagram, or you can use the addition law by replacing A with A’ and
B with B’. Like this: :
Tip: A’ U B’ is also the » P(A’ UB’) = P(A’) + P(B’) — P(A’ 9 B’)
complement of A M B, so = [TS PAS Te PB) fl SPA)
you could work out: =(1-0.6) + (1 —0.3)-—(1 —0.7) = 0.4+ 0.7 -0.3 =0.8
P(A’ U B’) = 1 — P(A B) # So P(he either doesn’t eat an apple, or doesn’t eat a banana) = 0.8
=1-0.2
= 0.8
Exercise 3.1
Hint: Remember, you Ql HPA) =0.3,.P(8) = 0.5 and PYA 1) By) =0.15, tind:
can always draw a Venn a) P(A’) b) PRA. B) Che PAs
diagram to help you. |
Q4_ If P(X UY) = 0.77, P(X) = 0.43 and P(Y) = 0.56, find:
a) P(Y’) b) P(X NY) Cc) PX iY) dP”)
Q5_ If P(C’ UD) = 0.65, P(C) = 0.53 and P(D) = 0.44, find:
a) PCD) b) PIC aD) Cc) PC Gy) d) PCD)
coe
S
We defined the addition law on page 61 as P(A U B) = P(A) + P(B) — P(A B).
When A and B are mutually exclusive, we can substitute P(A B) = 0,
to give a slightly simpler version.
For two events, A and B,
where A and B are mutually exclusive:
Tip: With n exclusive
And you can write a general form of this for n exclusive events.
events, AU, aX,
For exclusive events PLS ese: none of the events can
happen at the same time
P(A, UA, U... UA) = P(A,) + P(A,) +... + P(A) as any of the others.
In other words, only
one out of A,, ..., A, can
eae. happen at a time.
Example
= Start by defining the two events. Let A be the event ‘select a picture
card’ and B be the event ‘select the 7, 8 or 9 of clubs’.
= You want to find the probability ofA or B, P(A U B). The card can’t be
both a picture card and the 7, 8, or 9 of clubs, so A and B are mutually
exclusive, which means that P(A U B) = P(A) + P(B).
=» Using the formula for equally likely outcomes:
P(A)=
15 and P(B)=
3
12 outcomes where event 3 outcomes where event
happens out of a total of 52 happens out of a total of 52
‘To show whether or not events A and B are mutually exclusive, you just
o
need to show whether the intersection of A and Bis zero or non-zer
—j.e. whether P(A MB) = 0 or P(A N B) # 0.
Chapter 2 Probability
a) For two events, A and B, P(A) = 0.38, P(B) = 0.24 and P(A UB) = 0.6.
Tip: You could also Show whether events A and B are mutually exclusive.
show that
= The easiest way to do this is to use the addition law to find P(A / B).
P(A U B) # P(A) + P(B)
P(A U B) = P(A) + P(B) — P(A B)
— that’s the same = P(A MB) = P(A) + P(B) — P(A U B) = 0.38 + 0.24 — 0.6 = 0.02
as showing that
P(A M B) # O. = So, P(A B) #0, which means that A and B are not mutually exclusive.
b) For two events, A and B, P(A) = 0.75 and P(A M B’) = 0.75.
Tip: Drawing a Venn Show whether events A and B are mutually exclusive.
diagram can be helpful = This one looks abit trickier, but you just need to think about the
to see areas.
different areas that make up event A.
P(A) = P(A. B) + P(A mB’)
= P(A. B)= P(A) — PIA B) = 0.75 —0.75— 0
So, P(A M B) = 0, which means that A and B are mutually exclusive.
Exercise 3.2
Q1__If X and Y are mutually exclusive events,
with P(X) = 0.48 and P(Y) = 0.37, find:
a) P(XNY) b) P(X UY) CPX TY)
Q2_ P(L) = 0.28, P(M) = 0.42 and P(N) = 0.33. If the pairs
of events (L and M) and (L and N) are mutually exclusive,
and P(M MN) = 0.16, find:
a) P(LUM) b) P(LUN) c) P(MUN)~ d) P(ILMA MAN)
e) Draw and label a Venn diagram to show events L, M and N.
Q5__ For events C and D, P(C’) = 0.6, P(D) = 0.25 and (CN D’) = 0.4.
a) Show that C and D are mutually exclusive. b) Find.P(C U D)
P(B| A) = an
Tip: The probability of
A given Bis:
Here’s an explanation of where this formula comes from... P(A 1 B)
P(A|B) = P(B)
Event A has happened and for BJA to happen, B will also happen.
= If you know that A has already happened,
then the only remaining possible outcomes c
must be the ones corresponding to A: 6
And the only remaining possible outcomes ee
corresponding to B also happening a
must be the ones in A 1 B.
ei
Using the probability formula
(and assuming event A has already happened):
number of possible outcomes corresponding to B
PUB A= total number of possible outcomes
— number
of outcomes in AB
number of outcomes in A
a) Find P(A|B).
Using the formula for conditional probability:
AB i 8-3 = 0.6
P(A|B) = Cm
The Venn diagram below represents two events, C and D. The numbers
of equally likely outcomes corresponding to the events are shown.
a) Find P(C|D).
« Here you’re given the numbers of outcomes, so you can find the
Tip: Doing it this way probability by using the original probability formula from page 53.
is a bit simpler than
using the conditional = Once event D has happened, there are 5 possible
probability formula. outcomes corresponding to event C happening «—____5 outcorhes
But if you did use the
formula, you’d work out:
and 12 possible remaining outcomes in total. inCnD
Chapter 2 Probability
Vikram either walks or runs to the bus stop. The probability that he walks
is 0.4. The probability that he catches the bus is 0.54. If he walks to the
bus stop, the probability that he catches the bus is 0.3.
ik me P(C) — PPC N W)
b Find the probability that Vikram catches the bus, given that he runs to
—
= So P(Vikram catches the bus, given that he runs to the bus stop) = 0.7
3 If the student is studying Art, what is the probability that they are
also studying Maths?
S If the student is studying English and Maths, what is the
probability that they are also studying Art?
os If the student is not studying Maths, what is the probability that
they are studying English? .
d Find P(A|E’).
e SSFind P(M |AN E).
Q5_ Given that P(X) = 0.44, PCY’) =0-72,, PZy-=0'61, PIXIY= 0.75,
P(Z|X) = 0.25, P(Y NZ’) = 0.2 and P(X NY |Z) = Z, find:
a) P(Y) by PX) cy PXNZD
d) PLY |Z4 e) PIXNYNZ)
Independent events
If the probability of an event B happening doesn’t depend on whether an
event A has happened or not, events A and B are independent.
= For example, if a dice is rolled twice, the events A = ‘first roll is a 4’ and
B = ‘second roll is a 4’, are independent, because the number rolled on
the second roll doesn’t depend on the number rolled on the first roll.
Tip: If the first card isn’t = Or, suppose a card is selected at random from a pack of cards, °
replaced, then A and B then replaced, then a second card is selected at random. The events
are not independent A = ‘first card is a 7’ and B = ‘second card is a 7’, are independent
(B is conditional on A). because P(B) is unaffected by what was selected on the first pick.
IfA happens, P(B) = 2,
but if A doesn’t happen,
4 If A and B are independent, then P(B) is the same, whether A happens or not.
P(B)i= By: And that means you have the following results:
= P(B |A) = P(B |A’) = P(B). Similarly, P(A | B) = P(A | B’) = P(A).
P(A M B) = P(A)P(B)
Chapter 2 Probability
V and W are independent events, where P(V) = 0.2 and P(W) = 0.6.
a) Find P(V MW).
Just put these probabilities into the product law for independent events:
P(V 1 W) = P(V)P(W) = 0.2 x 0.6 = a
To show that events A and B are independent, you just need to show that one
of the following statements is true:
= P(B|A) = P(B) [or P(A|B) = P(A)] Tip: If one of these is
true, then the other one
m P(A) x P(B) = P(A NM B) will also be true.
So, since P(A) x P(B) # P(A 12 B), the events ‘catch first disease’ and or P(A|B)=
‘catch second disease’ are not independent. P(A 1B). SFOs
ar oe 0.4
22 PUA) = O25.
For events A and B, P(A) = 0.4, P(B |A) = 0.25 and P(A’ M B) = 0.2.
a) Find: (i) P(ANMB), (ii) P(A’), |A),
(iii) P(B’
(iv) (B| A’), (v) P(B), | B).
(vi) P(A
(i) Using the product law: P(A M B) = P(A)P(B |A) = 0.4 x 0.25 = 0.1
(ii) P(A’) = 1-— P(A) = 1 -0.4 = 0.6
Chapter 2 Probability
Now at this stage, things are starting to get a bit trickier. It'll help a lot if
you draw a Venn diagram showing what you know so far.
* You know from the question that:
P(A) = 0.4, P(B| A) = 0.25 and P(A’ M B) = 0.2
# And you've found:
P(A 1 B) = 0.1 and P(A’) = 0.6 P(A) — P(A NB) Ee 0}
Tip: You can also find | (v) You can see from the Venn diagram that:
P(B) by doing: | P(B) = (BNA) +P(BNA) =0.1 +0.2 = 03
P(B|A)P(A) + P(B|A’)P(A’) |
That's because... And finally...
P(B |A)P(A) = P(B iA),
=PBA) ,
P(BIA)P(X) (vi): P(A | B) B 8-8
And... ;
P(BMA) + P(BNA’) = P(B)
b) Say whether or not A and Bare independent.
Again, there are different ways you can do this. For example, if you
Tip: Or you can show: compare the values of P(B|A) and P(B), you can see that:
P(A| B) # P(A), or
P(A) x P(B) # P(A B) P(B|A) = 0.25 # P(B) = 0.3, so A and B are not independent.
Exercise 3.4
Q1_ If X and are independent events, with P(X) = 0.62 and P(Y) = 0.32,
calculate P(X 1 Y).
oar ~Q3__ Events M and N are independent, with P(M) = 0.4 and P(N) = 0.7.
Q3 Hint: IfM and N Calculate the following probabilities:
are independent, then
M and N’ are also a) PMN) b) P(MUN) c) P(IMAN’)
independent.
vernon erm Q4 Acard is picked at random from a standard pack of 52 cards.
The card is replaced and the pack is shuffled, before a second card is
picked at random.
a) What is the probability that both cards picked are ‘hearts’?
b) Find the probability that the ‘ace of hearts’ is chosen both times.
Chapter 2 Probability
Q5 For events A, B and C: P(A) = o P(B) = i, P(C) = are
OGaex , Y and Z are independent events, with P(X) = 0.84, P(Y) = 0.68 and
P
(Z) = 0.48. Find the following probabilities:
Jer (oye wi Z) ey PY |Z) “dy PZ" IN"). ~e) PY TX’)
Tree diagrams
Tree diagrams show probabilities for sequences of two or more events.
Here’s a tree diagram representing two trials. There are two possible
results for the first trial — events A and A’, and there are two possible
results for the second trial — events B and B’.
‘End’ probabilities
1st trial 2nd trial
P(BJA) -~B P(AMB)
Tip: The ‘end’
Probabilities P(A) A <i
P(B’|A) ~B’ P(A MB’) probabilities are the
on each set x probabilities of the
P(BIA)-B PIYOB
of branches P(A) tet ( ) different sequences of
add up to 1. A, ee
Chapter 2 Probability
eT Cs
purple.
A bag contains 10 balls, 6 of which are red and 4 of which are
One ball is selected from the bag at random, then replaced. A second
ball is then selected at random.
/) 04 P,
0.6 R, P, = ‘second ball is purple’
Tip: You can answer 0.6+0.4=1
questions like b) and 0.4 P
c) without using a tree
diagram, but very often
if you draw one, it helps
to make things clearer. a— Find the probability that both balls are red.
There is 1 ‘path’ along the branches that gives the result ‘red and red’.
Tip: This is the same as Multiply along the branches R, and R, to give:
using the product rule P(R, A R,) = 0.6 x 0.6 = 0.36
for independent events
R, and R,.
Find the probability that one ball is red and the other is purple.
There are 2 ‘paths’ along the branches that give the result
‘red and purple’ — (R, and P.) and (P, and R,).
Tip: This is the same
as using the product Multiply along these pairs of branches to give:
rule for independent P(R, MP.) = 0.6 x 0.4=0.24 and P(P, MR) =0.4 x 0.6 = 0.24
events twice, followed
by the addition rule Now, you want to find the probability of (R, and P,) or (P, and R,),
for mutually exclusive so you add these two probabilities together:
events. P(1 red and 1 purple) = P(R, 1 P,) + P(P, A R,) = 0.24 + 0.24 = 0.48
Chapter 2 Probability
Example 1
A box of 6 biscuits contains 5 chocolate biscuits and 1 lemon biscuit.
George takes out a biscuit at random and eats it. He then takes out
another biscuit at random.
ee ‘
b) Find the probability that George takes out two chocolate biscuits.
= There is 1 ‘path’ along the branches that gives this result.
= Multiply along the branches a and C, to give:
P(C, NC) 2= PIC,)P(C,(C,)= Sy2 x 3 202
S053
Chapter 2 Probability
Horace is either late for school or on time for school, and when he gets to
school he is either shouted at or not shouted at. The probability that he’s
late for school is 0.4. If he’s late, the probability that he’s shouted at is
0.7. If he’s on time, the probability that he’s shouted at is 0.2.
Given that Horace is shouted at, what is the probability that he was late?
=
It’s best to take complicated questions like this step by step.
You're given information about two events:
Let L = ‘Horace is late’ and let § = ‘Horace is shouted at’.
* Start by writing down the probability you want to find: that’s P(L |S).
P(LNS
Using the conditional probability formula: P(L |S) = -
Tip: Be careful with
questions like this... * So you need to find P(L 4S) and P(S) — and the easiest way is by
The question tells you drawing a tree diagram using the information in the question:
the probability of S
conditional on L (and L’).
But you need to think of
the situation the ‘other eh
way round’ — with L
0.4 gr P(S"|L) = 1 - P(S|L)
conditional on S. Ss
So don’t just rush in. PL)=1-P)—>06 ~,, 92
0.8 S’ PSS 1 =P(S 1)
Tip: In general, if B
depends on A, then A
= Using the tree diagram:
depends on B. P(L M S) = P(L)P(S|L) = 0.4 x 0.7 = 0.28
Here, S depends on L, P(S) = P(LM S) + P(L’ MS) = 0.28 + P(L)(S|L)
so L depends on S. = 0.28+0.6x0.2
= 0.4
PLAS — 928 _ 97
* So,P(L|S) =“)
This means P(Horace was late, given he is shouted at)= 0.7. >
For events M and N: P(M) = 0.2, P(N |M) = 0.4 and P(N’ |M’)
= 0.7.
'a) Draw a tree diagram representing events M and N.
P(N|M) 9.4 N
02,.-—M ae N’
N
oe
M’
P(N’|M’) 0.7 yp
04-7 N
M
es 0.6 nv
N
oy
0.7 N’
b) Find P(N).
= P(M)P(N|M)
P(N) = P(MA.N) + P(M’ 9 N) MeO + P(M)P(N |M’) Tipe Thete ae spain
ee OF giving N, sofind the
a, i : probability of each, then
-Cs add them together.
= 7 Fe
Using the conditional probability formula: N’)= teeP(N)
P(M’ |iNes
iSabdigu malas Md
» Using the probabilities you found above:
P(N’) = 1 — P(N) = 0.68
P(M’ 7.N’) = P(M’)P(N’ |M’) = 0.8 x 0.7 = 0.56
Chapter 2 Probability
Exercise 3.5
Ql The probability that Jake will win two consecutive erie
darts matches is shown on the tree diagram. ist”
a) Explain whether the events ‘wins 1st match’. ei (ote
and ‘wins 2nd match’ are independen t? rae Win
b) Find the probability that Jake will win: <a
(i) both matches (ii) at least one match
Chapter 2 Probability
Review Exercise — Chapter 2
ITD
Q1_ Afair, six-sided dice and a fair coin are thrown and a score is recorded.
If a head is thrown, the score is double the number on the dice.
If a tail is thrown, the score is the number on the dice plus 4.
a) Draw a sample-space diagram to represent all the possible outcomes.
b) What is the probability of scoring more than 5?
c) Ifa tail is thrown, what is the probability that the score is an even number?
Q2 Half the students in a sixth-form college eat sausages for dinner and 20% eat chips.
2% eat sausages and chips together.
a) Draw a Venn diagram to show this information.
b) Find the percentage of students who eat chips but not sausages.
c) Find the percentage of students who eat either chips or sausages but not both.
Q3 Arabella rolls two fair, six-sided dice and calculates her score by adding
the two results together.
a) What is the probability that her score is a prime number?
b) What is the probability that her score is a square number?
Let P be the event ‘Arabella’s score is a prime number’ and S be the event
‘Arabella’s score is a square number’.
c) Explain whether or not the events P and S are mutually exclusive.
d) Find. P(P U.S).
Arabella carries out the experiment twice. Let S, be the event ‘score from first pair of rolls is
a square number’ and S, be the event ‘score from second pair of rolls is a square number’.
e) Explain whether or not the events S, and S, are independent.
Dey Find P(S, 1 5,).
Q4 Aschool orchestra is made up of pupils in either the upper or the lower school.
40% of the musicians are boys. Of the boys, 30% are in the upper school.
Of the girls in the orchestra, 50% are in the upper school.
a) Draw a tree diagram to show this information.
b) A musician is chosen at random. Find the probability that the musician is:
(i) in the lower school, given that they are a boy
(ii) a girl in the lower school
(iii) in the upper school
Chapter 2 Probability
Exam-Style Questions — Chapter 2
A soap company asked 120 people about the types of soap (from Brands A, B and C) they
bought. Brand A was bought by 40 people, Brand B by 30 people and Brand C by 25.
Both Brands A and B (and possibly C as well) were bought by 8 people,
B and C (and maybe A) were bought by 10 people, and A and C (and maybe B) by 7 people.
All three brands were bought by 3 people.
(iii) they buy soap B, given that they buy only one of the soaps fan)
A jar contains counters of 3 different colours. There are 3 red counters, 4 white counters
and 5 green counters. Two random counters are removed from the jar one at a time.
Once removed, the colour of the counter is noted. The first counter is not replaced before
the second one is drawn.
a) Draw atree diagram to represent this situation. (3 marks)
) Find the probability that both the counters are red. (2 marks)
d) Find the probability that the two counters are not both the same colour. Gene
Event J and Event K are independent events, where P(J) = 0.7 and P(K) = 0.1.
a) Find:
Gy Pik)
(1 mark)
Gi) POW)
(2 marks)
b) IfL is the event that neither J or K occurs, find P(L|K’). Bane)
marks
Chapter 2 Probability
When Albert eats in a certain restaurant, he always eats either chicken or beef for his main
course. And he always eats either sponge pudding or ice cream for his dessert.
The probability that he eats chicken is 1 the probability that he eats ice cream given that he
eats chicken is z, and the probability that he eats ice cream given that he eats beef is 3
a) Find the probability that he eats either chicken or ice cream, but not both.
(3 marks)
by) Fj ae :
) ind the probability that he eats ice cream. (3 marks)
A film club with 20 members meets once a week. 14 of the members go every week and
13 plan to renew their membership for another year. Of those planning to renew their
membership, 10 go every week.
One member of the club is selected at random.
a) Find the probability that the person selected plans to renew their membership
and goes to the club every week. seman)
b) Find the probability that the person selected goes to the club every week,
but doesn’t plan to renew their membership. (2 marks)
Show whether or not going to the club every week and renewing membership
are independent. (ame)
For a particular biased dice, the event ‘roll a 6’ is called event B. P(B) = 022.
This biased dice and afair, six-sided dice are rolled together.
a) Write down the probability that the biased dice doesn’t show a 6. men)
b) Find the probability that at least one of the dice shows a 6. Cane)
Chapter 2 Probability
Chapter 3 Discrete Random Variables
1. Probability Distributions
Learning Objectives: Probability distributions show the probability of a discrete random variable
Understand what is taking certain values. They can be used to make predictions about the
meant by a discrete outcomes of random experiments.
random variable.
Be able to find and Discrete random variables
First things first, you'll need to know what a discrete random variable is:
distributions and
probability functions. = A variable is just something that can take a variety of values —
Be able to find and
its value isn’t fixed.
= A random variable is a variable that takes different values with
distribution functions. different probabilities.
= A discrete random variable is a random variable which can
only take a certain number of values.
Tip: Remember that
discrete data can only A discrete random variable is usually represented by an upper
take certain values case letter such as X. The particular values that_X can take are
(unlike continuous represented by the lower case letter x.
data which can take all
the values in a certain These examples should help you to get used to the difference between x and_X.
range). See page 3.
Rolling a fair dice and recording the score: Tossing a fair coin twice
= X is the name of the random variable. and counting the number
Tip: Notice that in
these two examples, Here it’s ‘score on dice’. of
heads:
the discrete random ® x is a particular value that
XYcan take. = Xis ‘number of heads’.
variable can only take a Here x could be 1, 2, 3, 4, 5 or 6. = x could be 0, 1 or 2.
few different values. The
possible values are all
whole numbers — but
they don’t have to be. Probability distributions and functions
A probability distribution is a table showing all the possible values a discrete
random variable can take, plus the probability that it’ll take each value.
We can check this works for the fair coin in Example a) above:
+P= N=5h+z=
YSPX=xn = YP K=x =PX=0) 1Y
4
all x x=0,1
You may be asked to find the probability thatX is greater or less than a value,
or lies between two values. You just need to identify all the values that XYcan
now take and then it’s a simple case of adding up all their probabilities.
a ee ee)
Find: a) the value of a, b) P(2<X <4), c) the mode
a) Use })P(X
all x
= 1 again. RC
From theitable?” 04-0:.2: 502430 Deane
=> 0.8+a=1
=> hae 02
Score on rolt 1
$441 242 3
So 2 4
G1) 279 4
S122 4
S913 3 5
DOs
WY
3 5
314 4 6
= From the diagram you can see that there are only five values that
A can takes = 23 2417.0:
=» Since all 36 outcomes are equally likely, you can find Tip: See p53 for the
the probability of each value by counting how many equally likely outcomes
times it occurs in the diagram and dividing by 36. formula.
9 out of the 36 outcomes give a score of 2.
SoP\X=2)= =F
12 out of the 36 outcomes give a score of 3. Tip: Don’t forget to
change the fractions into
So P\X= 3) = 42 =4 their simplest form.
Similarly,
PUX= 4)=70 = 3, PUY= 5) = se = 4,
9 21% 4 5
ous 4 5 6
oe 5 6 i
&5 6 7. 8 10
6 be 8 97410 bis?
b) The game costs 15p to play. Find the probability of making a profit.
A player will make a profit if they win more than 15p — so
if they win 20p or 50p.
= So the probability of making a profit is
POXS15) = PX 720) 4 Pax = 50) 3+ = 5
Q2_ Afair six-sided dice is rolled. Write down the probability distribution
for the following random variables:
a) A =‘score rolled on the dice’.
b) B= ‘1 if the score is even, 0 otherwise’.
c) C= ‘5 times the score rolled on the dice’.
rov-o[o2
[oaforfe
(i) Finda (ii) Find P(X2 2)
Sometimes you'll be asked to work backwards — you can work out the
probability function, given the cumulative distribution function.
The probability that Xis equal to a certain x value is the same as the
probability thatXis less than or equal to that x value, but not less than
or equal to the next lowestx value.
To describe these x values, it can be useful to use the notation x,. Tip: You might have
For example, if Xcan take the values x = 2, 4, 6, 9, then you can seen this x, notation
label these as x, = 2, x, =4,x, =6 and x, = 9. in C1 — it's used for
sequences and series.
Using this notation, the probability that ‘Xx= x, is the same as the probability 4 be ee
that ‘X is less than or equal to x,, but NOT less than or equal to x,_,’.
This clever trick can be written:
F(1)=P(¥< 1)=1xk= ~
FQ)= Pixs 2=2K R= ;
F(3) = P(X <3)=3xk= enSS I ES) lA Bs i
BIO
luo
—
Exercise 1.2
poo |0.1 |
|x |-2|-1[o]1
[2]
ia
|x [2/4/98
[i6[32]64)
1] ts ta a fs
veo] 3/3
[8]ro]3535
Chapter 3 Discrete Random Variables
Q2 Each of a)-b) shows the probability distribution for a discrete random
variable, X. For each part, draw up a table showing the cumulative
distribution function, F(x), and use it to find the required probabilities.
a
Find i) .PXs 3) Find( P(X< 0)
i: P(X> 0)
X a2 aly OI
ECON Or OZ
roa)o3|« [oe
probabilities P(X= x) for
each x as a function of
a, and then use the fact
Given that P(X= 2) = P(¥= 3), tilat ick Xe)
draw a table showing the probability distribution of X. to work out the value
of a.
a) F(x) =
(x 25
+ ky / awl pee p>
Q7 Hint: You need
osha
3
Tip: Remember that x, So all you do is multiply each x-value by its probability and add them all
is just notation for all together.
the different x-values,
X,, X, etc.
SO >)%PLX= 2)
= Gee ®)
The probability distribution of X, the number of daughters in a family of 3
+ P(X= x) +...
children, is shown in the table. Find the expected number of daughters.
And p, is just short for
P(X= x).
. If the discrete random variableX can take values x,, x,, X3,.-.
then the expected value of X° is:
Exercise 2.1 :
re 0
If E(X)= 6.5 find the value of p.
[en Fd Ble By
Q?2 b) Hint: Find afirst. |Pix=x)
105]02|a }02
If E(X) = 7.5 find the values of a and p.
eee Pm =| 3 [7|8 ae
Povss) 02]a |ot]b Pars fot]
a| bs}.|
where a=o225 where E(X) = 7.8
Q4 In parts a)-c), the tables show the probability distribution for a discrete
random variable,
X. For each part, find: (i) EQ) (it) EQ)
Now then... suppose a discrete random variable X can take values x,, ye x,,Mor x Agrees
The variance of X is ‘the expected squared deviation from the mean (i) Tip: You can remember
— in other words: — pny] = Dae — YD; (see Tip above).
Var(X) = E[(X¥ the variance formula as:
But there’s a different way to write the formula for the variance ‘mean of the square
that’s much easier to use: minus the
square of the mean.’
VarlX) = EX?) — [EX = Yxtp, — [xpi
Tip: Remember...
(i) P(X¥=x) =p,
In fact, the formula for the variance can be written in various ways.
For example, on the formula sheet you'll get in the exam, it’s written as: (ii) E(X) = w= Dpxpi
(iii) E(X?) = >) xp:
Var(X) = (4 — BYP(X = x) = Dox P(X =x) —
But these formulas are all equivalent — they'll always give the same answers.
Tip: You'll see on
And you can always take one of these formulas and ‘turn it into’ any of the p96 that
others — just by rearranging it, and using the definitions on the right. E(aX + b) = aE(X) +b
For example:
This tells you that
you can ‘break apart’
Var(X) = E[(X— 47] an expected value
= ELX? — 2uX+ y’] (multiplying out the brackets) involving addition, or
multiplication by a
= E(X?)— 2uE(X) + wv? (see the bottom Tip for more info)
number (not by another
= £(X2)—2[EQO))? + (E(x)? (using the definition 4 = E(X)) variable, though). This
means you can write
=r E[X? — 2X+ p’] as
E(X?) — 2wE(X) + pv’.
Similarly, P(X = 1) = 2k
PX= 2.23%
P(X= 3) = 4k
P(X= 4) = 5k
Go k+ 2k+ 3k+4k =1,1.
45k andso k= ae
1,=
15k
0.2+a+05+0.2 =1
>0.9+a =1
=> a=0.1
So Var(X)
= EQ’) — [E(X)P
= 32.6 —5.2?= 32.6— 27.04 =—aaas
Exercise 2.2
“ ee ee : pete
C)
Leee eee
Q2__ Each of the probability functions below defines a discrete random
variable, X. Calculate the mean and variance of X.
a) x pie) Bay 0
P(Xx=x) |0.2/a)0.4| 0.1 =
P(X ra 0.1
* EatsIf EX)= 5.2, find the value of p and the variance One
Var(X— 2) = 1? Var(X) = 7
c) Next work out E(X’) so you can use it to find the variance:
EX = > ae
me a0 is be 01a 14 G0 |
+[5? x 0.2] + [6° x 0.2]
==» (,9:2
Then the variance is easy:
Var(X) = E(X?) — [E(X)]? = 19.2 — 4.2? ="
Exercise 2.3
Q2 Hint: The
probability distribution
E(X) = 3.2 and Var(X) = 1.56. of a function of X is
easy to find — just
For each of the random variables in a)-d) below, find:
apply the function to the
(i) the probability distribution, values of x, but keep the
(ii) the mean and variance directly from the probability distribution, probabilities the same
given.
(iii) the mean and variance using the values of E(X) and Var(X) (just like we did with
a) Y=3X+4 = b) Z=3X-4 ey V=20=3X “d) W=20+4 3X x2 on p91):
eli
olom
Find the mean and variance of the score on an unbiased,
standard six-sided dice. c
Mean Pedi
= tkvie \otO weeees 3.5
Example 2
A lottery involves a ball being picked at random from a box of 30 balls
numbered from 11 to 40. The random variable X represents the number
on the first ball to be picked. Write down the probability function of X,
and find its mean and variance.
Each ball has a probability of<n of being picked first,
so the probability function Is:
Pave= 1112 40
The smallest value is 11 and the largest is 40, so a= 11 and b= 40.
Using the formulas:
cy
Chapter 3 Discrete Random Variables
Review Exercise — Chapter 3
Ql The probability distribution ofYis:
menleal Pe
a) Find the value of k. ) Find P(Y< 2).
Q2 The probability distribution for the random variable Wis given in the table.
Draw up atable to show the cumulative distribution function.
0.2 | 0.3 | 0.4 | 0.5
DEP =x) = =a ss 4158 6,7,Ore Find (ip PX <a) (i) PG <x 37)
Q4 The cumulative distribution function for a random variable Ris given in the table.
Work out the — distribution for R. Find PO < R < 1).
Q6 A discrete random variable X has the probability distribution shown in the table,
where & is a constant.
Q7 A discrete random variable X has the probability distribution shown in the table.
a) Find E(X)
ofaas
5 ffosfo b) Find Var(X) ;
Q8 Hint: You're looking
’ : / : for F@)= P(X sx). Find
Q8 Xis the random variable ‘highest score showing when three fair . the probability that a
six-sided dice are rolled’. Find the cumulative distribution function single fair six-sided dice
for X, and then use it to find the probability distribution for X. bac a ccoremomnore than
x and remember that dice
rolls are independent.
b) Use the probability distribution to find the probability that the player wins
(i.e. gets more money than they pay to play) in one game. (2 marks)
eeeee ae
The random variable X takes the following values with equal probability:
0,123 245576. Ss 9
a) Write down the probability distribution of X.
(1 mark)
b) Find the mean and variance of X.
(3 marks)
c) Calculate the probability that _X is less than the mean.
(2 marks)
b) Find E(X). ne
c) If Var(X) = 5 find E(X?). (2 marks)
1. Correlation
Learning Objectives: Correlation is all about how closely two quantities are linked.
Be able to plot scatter For example, as one quantity grows, the other might grow as well.
diagrams using paired Or it might shrink. Or there might be no pattern at all to what it does.
observations of two
variables. Scatter diagrams and correlation
Be able to tell from Sometimes variables are measured in pairs — maybe because you want to
a scatter diagram
investigate whether they’re linked.
whether two variables
are positively These pairs of variables might be things like:
correlated, negatively = ‘my age’ and ‘length of my feet’,
correlated, or not
correlated. = ‘temperature’ and ‘number of accidents on a stretch of road’.
Be able to calculate Data made up of pairs of values (x, y) is called bivariate data. You can plot
a product moment bivariate data on a scatter diagram — where each variable is plotted along
correlation coefficient. one of the axes. The pattern of points on a scatter diagram can tell you
Be able to use something about the data.
coding to simplify = For example, on this scatter
the calculation €25
of correlation
diagram, the variables ‘my age’
coefficients. and ‘length of my feet’ seem linked
SRO TPT EEO PERETTI
OIRO — you can tell because nearly all the
points lie close to a straight line.
= As| got older, my feet got bigger
of
Length
feet
(cmy
and bigger (though | stopped
Tip: The line of best fit measuring when | was 10 years old).
doesn’t need to actually
pass through any of = The line of best fit on this scatter
the data points — it :
FE
diagram lies close to most of the
just needs to be near to points. My age (years)
them.
# The circled point doesn’t fit the pattern of the rest of the data at all —
so the line of best fit doesn’t need to pass close to it. A point like this
could show a measurement error (like here), or just a ‘freak’ observation.
E
Chapter 4 Correlation and Regression
Correlation is all about whether points on a scatter diagram lie close to a
straight line.
Exercise 1.1
Q1 Owen asked a group of people how far from the town centre they
live, and how many visits they made to the cinema in the previous
year. The table shows his results.
|Distance from centre (km) |0.6 |7.0 |1.4 sufio ufo
|Cinema visits |14 ens ea 11
Q2 This table shows the average length and the average circumference of
eggs for several species of bird. (All distances are measured in cm.)
|Length 6.2 |2.1 sf s7fiol 7.2145 |68 |
|Circumference 14 |6.3 Aes 24 18.7|11.3|18.4|
a) Plot a scatter diagram to show this data.
b) Describe the type of correlation shown.
c) One of the measurements was recorded incorrectly.
Use your scatter diagram to determine which one.
Chapter 4 Correlation and Regression
Si, 5,, and S,,,
Tip: The formula for
Suppose you've got a set of bivariate data written as 7 pairs of values (x, y).
variance on page 31 is:
_ Sx
d(x — XY The formula for the variance of the x-values can be written:| variance, = >>
variance =
2
Tip: The form There's also a similar quantity S. that involves both x and y.
i
8,= Soy — 2A?
is easier to use.
Dy he NYY) = ae Dxdy
To get a feel for what S,, means, look at the scatter diagrams below.
Tip: The formulas for The dotted lines show the value of x (the mean of the x-values) and y (the
S., Sand S., will be mean of the y-values).
important for the rest of
this chapter. = This scatter diagram shows a pair of variables Seen
(x and y) that are positively correlated — so
most of the values fall in the shaded squares.
Tip: On the graph,
= For each point in the top-right square,
there are two points for
which (x— x )(y — p) is
(x-x)y- y)>O0 (sincex> x andy> y).
negative, but there are # Similarly, for each point in the bottom-left
far more for which square, (x — x )(y— y) > O (sincex < ¥ andy< jy).
(x — ¥)(v — 7) is positive. = This means that for positively correlated variables,
So overall, S_xy is positive. S,= D(x — ¥)(y — 7) will be positive.
= Now you can use the formulas for S_, Sand S..
Tip: Remember... 7 is
= Sin DY
2) ='533 4 — a 81 _ @yere the number of pairs of
data values.
el 1331-8= 9375
2 2
Exam questions won't always use the letters x and y to label bivariate data.
But you can still use the formulas for S_, S,, and S., by replacing x and y
with the letters in the question.
= To find S,, use the formula for S,, — replacing x with p and y with q.
a ZPD _ 2 —15.2x 36 _ a9184
= To find S,, use the formula for S.,— but wherever you see x, write p.
S
The formula for the PMCC involves S., S_andS.;yy | 4 = =
Hy 29 Say
eS
Uc
Wess
Chapter 4 Correlation and Regression
The following data show the score (x) of 10 students in a reading test,
and the time in seconds (y) it took them to run 40 metres.
feo aes5 ft Ak ee Me
y| 9.8 5.8 |8. 8.;
Illustrate the data with a scatter ree and = the are moment
correlation coefficient (r) between the variables x and y.
= The scatter diagram’s the easy bit
— just plot the points.
* Now for the correlation coefficient.
From the scatter diagram, the points
look pretty randomly scattered
— so you'd expect a correlation
Oe 2 ae eA ee Om 7
coefficient fairly close to zero.
= You need to find S., Sand S_, so add a few extra rows to your table.
Sea x
: ey
x)oe et = a0
om tales
Sy = ie ==
ux
i = 596.82 —
75.67
0
_= 5264 Tip: This is very close
to zero — there seems
to be little (if any)
=» Now use the formula for r: r = correlation between
these two variables.
=. 0.) _ss _- Gi
J16.24 X 25.284
Exercise 1.3
Q2 The table below shows the heights and weights of 8 teenage boys.
Height in cm, x |180]171 |182
Weiginkgy
[7067[66[50[or[75[os[56
a) Calculate the values of S,, Sand S_.
b) Find the value of the product moment correlation coefficient (r)
for x and y.
79
Testscore,s _[8.9]40]54|2.8|7.1[75]31
[62[04]
Calculate the correlation coefficient r and interpret your result.
a
Chapter 4 Correlation and Regression
Qé A doctor checked the kidney function of some of his patients to see
if it was related to their weight. Here, 0 indicates no kidney function
at all and 10 indicates that a patient's kidneys are functioning at
maximum efficiency.
The table shows the results.
|Weight inkg w | ,66| 74 | 96 | 83 |79|54] 64 |71 | 88 |
|Kidney function, k[ 7.9 [8.2 |2.3[7.1 [48/91 [8.4[68 [3.71
a) Find the value of the product moment correlation coefficient (r)
for w and k.
b) Interpret the value of this correlation coefficient.
Coded data
Tip: If your data consists
You've seen the idea of coding before (see p37). The idea is to make your life of really huge numbers,
easier by making numbers less fiddly to work with. you could use coding to
The coding you’ll be expected pg produce smaller, more
to use is of the form: A b manageable numbers.
where x is an original data value, u is the corresponding But if your data values
coded data value, and a and b are numbers to be chosen. are tiny, you’d code
them to produce larger
This is a linear transformation — if you were to draw a graph of w against numbers. And so on.
x, it would beastraight line.
Tip: The reason why the
Using a linear transformation to code data values doesn’t affect the value of correlation coefficient
the product moment correlation coefficient at all — r for the coded data is isn’t affected is because
exactly the same as the value of r for the uncoded data. coding a set of values in
this way doesn’t change
the pattern of points
Example ee ee on a scatter diagram.
The effect of coding a
The table below shows the annual salaries (s, in £) of six shipping variable is the same as
clerks, see with the number of years experience (x) they have. relabelling one of the
scatter diagram’s axes.
Q1 Hint: The
transformation
u =x —20 is of the form
_——
weal Exercise 1.4
b /
but with b = 1. Q1__ Find the product moment correlation coefficient for x and y in the
table below by using the coding u =x — 20 andv=y — 60.
Time in secs, s
Find the product moment correlation coefficient for w and s using
the coding uv = w- 60 and v= 10(s— 11).
Be able to transform
a regression equation
Linear regression is a method for finding the equation of a line of best fit
relating coded ona scatter diagram. Or you can think of it as a method for modelling the
variables to an relationship between two variables.
equation relating the
original variables. Before carrying out a linear regression, you first have to decide which variable
is the explanatory variable, and which is the response variable.
= The explanatory variable (or independent variable) is the variable
you can directly control, or the one that you think is affecting the other.
In the above example, ‘load’ is the explanatory variable.
The explanatory variable is always drawn along the horizontal axis.
= The response variable (or dependent variable) is the variable you think
is being affected. In the above example, ‘fuel efficiency’ is the response
variable. The response variable is always drawn up the vertical axis.
Seley ———$___.
For each situation below, explain which quantity would be the
explanatory variable, and which would be the response variable.
Tip: A place’s latitude is
an angle showing how a) A scientist is investigating the relationship between the amount of
far north or south of the fertiliser applied to a tomato plant and the eventual yield.
equator it lies. = The scientist can directly control the amount of fertiliser she gives
The latitude of the North
each plant — so ‘amount of fertiliser’ is the explanatory variable.
Pole is 90° north, while _ ® She then measures the effect this has on the plant's yield — so
the latitude of the South ‘yield’ is the response variable.
Pole is 90° south.
b) A researcher is examining how a town’s latitude and the number of
North Pole days last year when the temperature rose above 10 °C are linked.
= Although the researcher can’t control the latitude of towns, it would be
the difference in latitude that leads to a difference in temperature, and
not the other way around.
= So the explanatory variable is ‘the town’s latitude’, and the response
South Pole variable is ‘the number of days last year when the temperature rose
above 10 °C’.
Regression lines
Here’s the ‘fuel efficiency v load’ scatter diagram from the previous page.
The regression line (which is what I’m going to call the ‘line of best fit’
from now on) is marked on again. The equation of this regression line is Tip: b is the gradient of
y =a + by, where a and b are numbers to be found. the regression line, and
a is its intercept on the
There are also some dotted lines vertical axis.
showing the vertical distance 10 4
between each data value and oo x
the regression line. 3 8] Xx WwW
Each of these small distances ued
is called a residual (e). aie Kye
The residuals show the errors in 2 :
the model (the regression line) sy .
— they show how the real-life ad a. ekas GAR oeL AG
observations differ from what the Load (tonnes)
model predicts.
Linear regression involves finding the equation of the line that minimises Tip: This is why this
the sum of the squares of the residuals, >}e7. kind of regression is
sometimes called
This will mean the regression line is as close as possible to your points. least-squares regression.
The formulas below tell you how to find the regression line of y on x
(the ‘...ofy on x’ part means that x is the explanatory variable,
and y is the response variable).
8,=Dy
—See n
yx ia (23)
S re
summations
ae ye and =- 17.914
>)xv, rather than the
individual data values.
And) = S.= ut— (Dux) i723
2 = SA Lay
= 22.081
= This means that the regression line ofy on x is: y= 12.5 —0.811x
To plot a regression line Hever Lx = 44:3 7.93 'and p = dy BOS S96 %68.
you can choose any two
So the regression line
points to plot. Or you
can draw the line using must go through the
the y-intercept and the point (7.23, 6.68). uth
€) So
gradient.
" By puttingx = 0 into the
It’s a good idea to make equation, you can see the
the points you're plotting line must also go through
for the regression line the point (0, 12.5).
look different from your
actual data points. = So draw the regression line efficiency
Fuel
(km
litr
perOaNWAUDNDWO
through these two points. : : ie e 8 iO pagei2
Load (tonnes)
In the above example, the regression line was y = 12.5 — 0.811x, where x
was the load (in tonnes) andy was the fuel efficiency (in km per litre).
' These values for a (= 12.5) and b (= —0.811) tell you:
= for every extra tonne carried, you'd expect the lorry’s fuel efficiency to fall
by 0.811 km per litre (since when x increases by1, y falls by 0.811).
= with no load (x = 0), you'd expect the lorry to do 12.5 km per litre of fuel
(assuming it was reasonable to use the line down to x = 0 — see p118).
b) Interpret your values of a and b in this context. Tip: Make sure you
= The value of b tells you that the fare will increase by give your explanations
approximately £1.20 for every extra mile travelled in the context of the
(since when d increases by 1, fincreases by 1.20). question — so here you
need to talk about taxis,
= The value of a tells you that a journey of 0 miles costs £5.19 distances and fares.
— this is a fixed part of the fare that doesn’t depend on how
far you travel (so the taxi’s meter will show approximately
£5.19 before you've even gone anywhere).
Example eee OF Fe
The length of a spring (y, in cm) when loaded with different masses
(m, in g) is shown in the table below.
Mass,m | 200 | 250 | 300 | 400 | 450 | 500 |
[Length,
y|9.8 | 10.7 | 10.8 | 11.8 | 12.4 | 13.2|
a) Calculate the equation of the regression line ofy on m.
= Extend the table to include m and my, so that you can find Sand S._.
aa
a 24775 — oe 730
. ear
ES
Sam = Sym Pas yee = 805 000 —
2100”
i = 70000
b) Use your regression line to estimate the length of the spring when
loaded with a mass of: (i) 370g (li) 6708
= (i) m= 370, soy=7.8 + 0.01043 x 370 = 11.7 cm (to 1 d.p.).
= (ii) m= 670, soy=7.8 + 0.01043 x 670 = 14.8 cm (to 1 d.p.).
A company collects data on the age (g) and salary (s) of its senior
managers. The data is coded in the following way:
x=g-50 y= s — 50000
1000
If the regression line ofy on x is given byy = 1.24x + 5.44, find the
regression line of s on g.
= Substitute the expressions defining
x and y into the equation of the s — 50 000 = 1.24(g—50) + 5.44
regression line fory on x. This gives 1000
you an equation involving just s and g.
= Rearrange this so it is in the
form s = a+ bg, where a and s — 50000 = 1240g- 62000 + 5440
b are constants. This is the s = 1240¢ —6560
regression line of s on g.
The annual heating bill (4, in £) for 8 office buildings is shown below,
along with the total floor area (f, in m’) of each building. Tip: If you work out
the means of these
variables, you'll find
Exercise 2.4
Q1. Aset of bivariate data (x, y) has been coded usingp = x — 7 and
q=y-—50. The regression line of g on p is given by g = 40°+ 2p.
Find the equation of the regression line ofy on x.
Q4 Mrs Brown put out different amounts of birdseed (s, in grams) each
day, and then counted the number of birds (v) in her garden at noon.
She coded her data in the following way:
X= 759 —2andy=v-7
The summary statistics for the coded data are as follows:
+ = 28, Dy= 124, ox? = 140, xy = 618 andn = 10,
a) Find the equation of the regression line of y on x.
b) Hence find the equation of the regression line of v ons.
Q5__ The time (¢, in hours) that seedlings could survive in water containing
different concentrations of salt (s, in mg per litre) was recorded.
The results were coded such thatx = -& and y = ¢-29.
10
The coded data is in the table below.
Ee bd el aaa
yo, ti oer aro
a) Find the equation of the regression line of y on x.
b) Hence find the equation of the regression line of ton s.
Q2 For each pair of variables below, state which would be the explanatory variable
and which would be the response variable.
a) © the annual number of volleyball-related injuries
e the annual number of sunny days
b) © the annual number of rainy days
e the annual number of Monopoly-related injuries
c) ® aperson’s disposable income
* a person’s spending on luxuries
d) ¢ the number of trips to the loo per day
e the number of cups of tea drunk per day
e) ¢ the number of festival tickets sold
e the number of pairs of Wellington boots bought
Q3 The radius in mm, r, and the weight in grams, w, of 10 randomly selected blueberry pancakes
are given in the table below.
| [48.0| 51.0 | 52.0 | 54.5 ee an
mee 100 | 105 | 108 | 120 1
=> Comment on the reliability of your estimate, giving a reason for your answer.
pe [2s es eee
|y | 050 | 070 | 010 | 082 | oso [ 036 | 016 | 0.80_|
a) Represent this data on a scatter diagram. (2 marks)
c) What does this value of the PMCC tell you about these variables? (1 mark)
The following times (in seconds) were taken by eight different runners to complete
distances of 20 metres and 60 metres.
mea ee et Te
Find the equation of the regression line of y on x, and plot it on your scatter
diagram. (8 marks)
Use the equation of the regression line to estimate the value of y when:
(jae 35; (11) x = 3.88.
Comment on the reliability of your estimates. (4 marks)
(4 marks)
Ten athletes all trained for the same 10 000 m race. On the day before the race, they each
ran different distances. The distance (d, in km) each athlete ran the day before the race, and
the time (¢, in seconds) that each athlete recorded in the race itself were recorded.
The results were then coded using p = d and q = t— 2367.
The values of p and q are shown in the table below.
Use your answer to part (c) to comment on how the length of a run on the day
before a race affects the performance of the athletes. (2 marks)
A scientist is investigating the link between the fat content of different brands of burgers
and the price. She measures the amount of fat (x, in grams) in 100 g of each type of burger
(when raw), and calculates the price for 100 g of each burger (y, in pence).
Her data is shown in this table.
Brand | A E FiieG | H |
1) 12 [email protected] | 25
106 | 94 Se |
= 1975, ae= 43290 and >)xy= 6446)
Draw a scatter vee showing this data. (2 marks)
The scientist models the relationship between x and y with the equation
rene)
y=a+bx. Use linear regression to find the values of a and b.
iced.
Use the scatter diagram to say which brand of burger appears to be overpr
eH)
Explain your answer.
There are some facts about the area under the curve
that apply to all normal distributions.
Tip: You don’t need to
= 68% of the total area lies within +1
learn these facts, but
standard deviation (+o) of the mean. they should give you an
® 95% of the total area lies within +2 idea of how values close
standard deviations (+20) of the mean. to the mean are much
more likely than those
= 99.7% of the total area lies within +3 further away.
standard deviations (+30) of the mean.
Working out the area under a normal distribution curve is usually difficult.
But the reason why Z is so important is that there are tables that list areas
under the curve for N(O, 1). The main table you'll use (see page 150) shows
values of the normal cumulative distribution function, ®(z). These are the
areas under the curve to the left of z (the probability that Z < z) for different
values ofz —1.e. ®(@) = P(Z < z).
So you can often find P(Z< z) by looking up the value of z in the table
Tip: The table only gives
and reading off the value for ®(z). And since Z is a continuous distribution,
P(Z < z) forz = 0.
P(Z = z) = 0, which means that P(Z < z) and P(Z < z) are the same thing.
So you can interchange the < and < signs — i.e. ®(z) = P(Z < z) as well.
Tip: For any continuous
random variable x,
P(X= a) = 0 for any
value of a.
That’s because the area Find the following probabilities.
under a graph at a single
point is zero.
a) P(Z < 0.64)
This is a nice, straightforward one — all you have
to do is look up z = 0.64 in the table for ®(z).
Tip: The table for ®(z) is So, P(Z < 0.64) = 7389
on page 150.
= (01251
Now then, things start to get a bit trickier when z takes a negative value.
To work these out, you need to use the symmetry of the curve. The best thing
to do is to start by drawing a sketch and shading the area you want to find.
Example 3
Find the following probabilities. f@)
a) P(Z>-0.42)
= Shade the area you want to find. ———_
= Because z is negative, you can’t look it up in the aa ty
= You still can’t get your answer from the table directly, ss up in the ee
but using the fact that the area under the curve is 1,
P(Z> 1.94) =1-P(Z< 1.94).
aca Fey 94) =PZ = 1.04) 21 PZ < 1,94) =0.9738
P(Z < 1.9 4)
al 009738
= (00262
Exercise 2.1 = ss
Q3___By using sketches and the table of the normal cumulative distribution
function, ®(z), find the following probabilities.
If P(Z < z) is less than 0.5, then z will be negative, and these values aren’t
listed in the table. But you can still use the table — you just need to doa bit
of thinking and sketching to decide what to look up.
elec eee
Okay, so the ®(z) table contains values for P(Z< z) and the percentage-points
table contains values for P(Z > z). But it really depends on what sort of
decimal the probability is as to which table you should use to find z.
The final type of example we need to go through is when you‘re given the
probability that Z takes a value between two limits, and you're asked to
find the missing limit.
Examples
a) If P(0 < Z <z) = 0.4452, find the value of z.
= Start by drawing a sketch, showing what you know. ——>
= If you find P(Z < z), you can use the (2) table to get z. N
Exercise 2.2
|
For example: f(z) Z ~ NO, 1)
x= 23. becomes
2= a =
= 1 -0.8413
= HEI Oe
As you saw in the previous section, you often need to use the symmetry
Tip: See page 129. of the curve to answer questions on the normal distribution.
sone oe We'll be using the same methods here as in Section 2, but we need to
do the extra step of transforming X to Z first.
Examples
If X ~ N(102, 144):
a) Find P(X> 78).
You also need to be able to find values, when you're given probabilities.
For example, find a, given that P(X< a) = 0.6. Tip: You can look up
to Z.
= As usual, the first step is to transform _X probabilities in the ®(z)
table, or the
X~ N(u, o”), the equation P(X< a) = 0.6 becomes P(Z
If eeoO )= 0.6. percentage-points table.
= Then you use the same methods as on pages 131-134, and you end up See p131-134 fora
with an equation in a to solve. reminder and some
examples.
Examples —— tance
* Start by transforming X to Z.
BT 4 a—8>) = 0.9192
= Draw a sketch to show the information. ———————_>
= Use the &(z) table to find z for which ®(z) = 0.9192.
85 — 1.40 See p150
(z) = 0.9192 forz= 1.40, so 2
= Now, just solve for a: Equation in a.
a— 83 -1.44-85=7> a= 92
Chapter 5 The Normal Distribution
b) X ~ N(85, 25). If P(X< b) = 0.0179, find the value of b.
= Start by transforming Xto Z.
bee ISL aa ny = 0.0179
= Draw a sketch to show the information — the area is less
than 0.5,s0 b a 85 must be negative. Now mark on the
Tip: The equivalent : (ph Dez OD,
preasie to theanehtc) equivalent area to the right o ie
5 nea
Fe * You can see that P(Z 2 Mt = O.0079:
: : b= 0 S5=b 5” @
Clee b
ee So P(Z < 88
=4)
D
= 1 - 0.0179 = 0.9821
5
= Using the ®(z) table, &(z) = 0.9821 for z = 2.10, so 85 = D340,
A machine which fills boxes of cereal is set so that the mass of cereal
going into the boxes follows a normal distribution with mean 766 g and
standard deviation 8 g.
a) Find the probability that a randomly selected box of cereal contains
less than 780 g of cereal. Tip: So the variable here |
« First, define a random variable to represent the mass of cereal in a box. is the mass of cereal ina |
box.
If X represents the mass of cereal in g, then X ~ N(766, 64).
* Next, turn the question into a probability for xX. _@
So you want to find, P(X < 780).
= Now you can transform _Xto Z in the usual way.
PUK< 780) = P(Z < 2B0—766) ;,
= (2 <ale/9) ‘ Tip: See p150 for the
®(z) table.
= Draw a sketch showing the area you need to find. ——>
» Using the ®(z) table, P(Z < 1.75) = 0.9599 4
Tip: Always check the
» So P(a random box of cereal contains less than 780 g) = 0.9599 answer seems sensible.
Using the rules on p127,
you know that 97.5% of
b) Find the probability that a randomly selected box of cereal contains values are less than
2 x o above the mean
—
='(00256
» This means:
P(a randomly selected person took more than 620 seconds) = 0.0256
re
ane
Beam Chapter 5 The Normal Distribution
The forces needed to snap lengths of a certain type of elastic are normally
distributed with ,.= 13 N and o = 1.8N.
41-07 Sa-13=1.265 oT OG
—
Find the range of values that includes the middle 80% of forces needed.
It’s difficult to know where to start with this one. So it’s a good idea to
sketch the distribution of F, to show the range you need to find.
Ay (1 —0.8)+2=0.1
So, pe Se iO
Tip: The range is
13
And using symmetry, “-g= = —1.2816 = a= 10.69 symmetrical about z = 0,
ean Oe eens
Sica 7.8
So the range of values is 10.69 N to 15.31 N
Q4 A particular type of toy car uses two identical batteries. The lifetimes
of individual batteries can be modelled by a normal distribution with
a mean of 300 hours and a standard deviation of 50 hours.
a) What is the probability that a battery of this type lasts less than
200 hours?
b) What is the probability that a battery of this type lasts at least 380
hours?
Q4 Hint: See p68 for c) Stating any assumptions you make, find the probability that both
a reminder of finding of the batteries in a car last at least 380 hours.
probabilities of multiple
events. d) The probability that a randomly selected battery lasts more than
160 hours, but less than A hours, is 0.9746. Find the value of h.
Q5 The masses of the eggs laid by the hens on farmer Elizabeth’s farm
are assumed to follow a normal distribution with mean 60 g and
standard deviation 3 g.
a) The probability that a randomly selected egg has a mass of at least
60 — m grams is 0.9525. Find the value of m to the nearest gram.
b) Farmer Elizabeth keeps the lightest 10% of eggs for herself and
uses them to make sponge cakes. Find the maximum mass of an
egg that could end up in one of farmer Elizabeth’s sponge cakes.
Py 23)= P(Z< Zo 2
ph
)=0.9015
f(z)
Example 2
If the random variable X ~ N(1, 42) and P(X > 19.84) = 0.025, find .
If the random variable X ~ N(53, 0?) and P(X < 50) = 0.2, find o.
= Again, start by transforming the probability you're given forX into a
probability for Z. The s.d. is unknown, so just leave it as o for now.
<20 =23) Ah
PUX < 50) = P(Z Bie o3)=02
= Draw asketch to show the information. The area —___»
is less than 0.5, so mark on the equivalent area to
the right 3
i of — i= ae
oO es =(=3/6) = 3/01
=02
See z
When you’re asked to find the mean and the standard deviation, the method is
eee renee ol a little bit more complicated.
can solve simultaneous You start off as usual, but instead of getting one equation in one unknown to
equations by adding or solve, you end up with two equations in two unknowns, «. and o. In other
subtracting them to get | words, you have simultaneous equations, which you solve to find j: and o.
rid of one unknown. |
= Now, if you use the ®(z) table to find z for which ®(z)
= 0.5596, you can form an equation in ,1 and o.
From the table, @(z) = 0.5596 for z = 0.15. See p150
So, 2 = 0.193 9-110:
a Ven Ce :
1st equation in. and o
eerie F) = 0.0322
Tip: p = 0.0322 isn’t in
= Draw a sketch to show the information. ——--—> the percentage-points
table, so it might help to
= You can see from the graph that: Cente use a sketch.
= Each equation has one ‘1’, so you can subtract them to get rid of 1,
which will leave you with an equation ino to solve.
@ - @ gives 14-9 -—p-() = 1.850 - 0.150
=> 5 = 1.76
= 0 =2.941176...> 6=2.94 + Jo3f.
Exercise 3.3
Q1_ For each of the following, use the information to find pL.
a) X~ N(u, 6?) and P(X< 23) = 0.9332.
b)X~ N(u, 82) and P(X< 57)= 0.9970.
c) X ~ Niu, 100?) and PX> 528)= 0.1292.
d) X ~ N(u, 0.42) and P(X< 11.06) = 0.0322.
e) X~ N(u, O.0”) and PU> 1.52)= 0.9938.
ee 4: Q2?2 Hint: Start by
renmeete
Q2 X~ N(u, 3.52). If the middle 95% of the distribution lies between en
Ehowinenhe Geinbun
6.45 and 20.17, find the value of 1.
of X.
Q5 X~N(80, 15) |
ERGs a) = 0.99, finda. Q5 Hint: |X— 80] <b means
b) If P(LX— 80| <b) = 0.8, find b. Lae eae
Q6 The mass of items produced by a factory is normally distributed with a mean of 55 grams and
a standard deviation of 4.4 grams. Find the probability of a randomly chosen item having a
mass of:
a) less than 55 grams b) less than 50 grams c) more than 60 grams
Q7 The mass of eggs laid by an ostrich is normally distributed with a mean of 1.4 kg and a
standard deviation of 300 g. If 88.3% of the eggs laid by this ostrich have a mass of less than
a kg, find the value of a.
The exam marks for 1000 candidates can be modelled by a normal distribution
with mean 50 marks and standard deviation 15 marks.
a) One candidate is selected at random. Find the probability that they scored less
than 30 marks on this exam. (3 marks)
Find the mark needed for a distinction if the top 10% of the candidates achieved
a distinction. (3 marks)
The random variable X follows a normal distribution with mean \1 and standard deviation 6.
The probability thatX takes a value of less than 50 is 0.123.
a) Find the mean of this distribution. (4 marks)
The lifetimes of a particular type of battery are normally distributed with mean LL
and standard deviation o. A student using these batteries finds that 40% last less
than 20 hours and 80% last less than 26 hours.
a) Find and o.
(7 marks)
b) Find the probability that a randomly selected battery of this type has a lifetime _
of at least 15 hours.
(3 marks)
The diameters of the pizza bases made at a restaurant are normally distributed.
The mean diameter is 12 inches, and 5% of the bases measure more than 13 inches.
a) Write down the median diameter of the pizza bases. (1 mark)
b) Find the standard deviation of the diameters of the pizza bases. (4 marks)
A garden centre sells bags of compost. The volume of compost in the bags is normally
distributed with a mean of 50 litres.
a) If the standard deviation of the volume is 0.4 litres, find the probability that a
randomly selected bag will contain less than 49 litres of compost. (a mane)
b) If 1000 of these bags of compost are bought, find the expected number of bags
containing more than 50.5 litres of compost. ier
A different garden centre sells bags of similar compost. The volume of compost, in litres,
that
in these bags is described by the random variable Y, where Y ~ N(75, 6”). It is found
.
10% of the bags from this garden centre contain less than 74 litres of compost
cjaeeFind 0: (3 marks)
$1 Statistical Tables
$1 Statistical Tables
Percentage points of the normal distribution
The z-values in the table are those which a random variable
Z ~ N(O, 1) exceeds with probability p16. PZ > Zz) == oC ee:
S1 Statistical Tables
e Q2 a) First you need to work out how many people are
Chapter 1 ss Data represented by each square unit on the graph
— use the information that the bar for 30-45
seconds represents 54 contestants.
1. Representing Data Bar for 30-45 seconds:
Exercise 1.1 — Data basics Width = 45 — 30 = 15 and height = 1.8
So area = 15 x 1.8 =27
See ee — this represents 54 contestants.
b) Mileage, Number of doors, Cost of service So each square unit represents
Q2 a) Number of medals won last season, Shoe size 54 + 27 =2 people.
b) Height, Mass Or frequency = 2 x area’
Q3 a) Th ere are no : gaps bet
etween Possl ible
ie heights.
elg S
Bar for 10-30 seconds:
Width Ak 10 = 20 et height = 03 ‘
as So area = 20 x 0.3 = 6
— this represents 6 x 2 = 12 contestants
{em)_|_ (cm) fem) So 12 contestants’ auditions lasted less than
tears
b) Now you need to add on the frequencies
represented by the other bars as well.
2 | 1708 180) ter) 175 Area of ‘45-55’ bar = 10 x 2.6 = 26, which
180 | 190 represents 26 x 2 = 52 contestants.
Area of ‘55-60’ bar = 5 x 3.0 = 15, which
represents 15 x 2 = 30 contestants.
: : Area of ‘60-75’ bar = 15 x 1.0 = 15, which
Exercise 1.2 — Histograms represents 15 x 2 = 30 contestants.
Q1 First add columns to the table to show class Area of ‘75-90’ bar = 15 x 0.4 = 6, which
boundaries, the class widths and the frequency represents 6 x 2 = 12 contestants.
densities. So the total number of contestants who
Mass (m, in g) class |Freq- auditioned was:
width }uency uency 12+54+52+30+30+12=190
Q3 a) The area of the bar is 1.5 x 9 = 13.5 cm?.
Ra This represents 12 butterflies.
ol So each butterfly is represented by
13.5 + 12 = 1.125 cm? on the histogram
b) 22.5 + 1.125 = 20, so the frequency was 20.
: c) The first bar, representing a class of width 3, was
1.5 cm wide. So the bar representing the class
53.5 < w < 58.5, which has a width of 5, must be
2.5 cm wide. And because it needs to represent
a frequency of 14, its area must be
14 x 1.125 = 15.75 cm’. This means it must be
15.75 + 2.5 = 6.3 cm high.
Answers
Q2 a) 2+5=7 towns Q3 a) 5
b) 10.7°C b) There are 176 ratings in total.
OF 12a Sas °C 176 + 2 = 88, so the median is midway between
O37 a) 0} 8 (1) the 88th and 89th values.
ieee (3) Add a column to the table to show cumulative
Oat 4ae 5. G7 9.6) persuas
3 2 (2) Number of | Cumulative
AM 0 3 (2) customers recwueney
Saab (1)
6 |4 (1) ey ay eee
ce eae)See eee
2a a ae
b) (i) 6.4km (ii) 4 children anne hans|1043)
Q4 a) 60 minutes een eto
b) 17 From the cumulative frequencies, the 88th and
89th values are both 4, so the median = 4.
Cc) 6
Q4 a) 5 seats
There are 35 values altogether.
2. Location: Mean, Median
e e b)
Answers
b) Add some extra columns to the table: So . = 30, meaning the median must lie in the class
‘270 < t< 300’. Now you need to sketch that class.
letters houses, f G b, 6
270% m 300
arenesA 7
+
30 48
Pree ee wo}; >Eee P.
b,
>
RID
GU}
ri a a2 3 . 2
Finally, solve =!i = =.
mes This g gives:
vf =50, fe = 209 m2270, 30 =27
So estimate of mean = 209 = 50 = 4.18 letters 300-270 Ara 27
c) Since ))f +2 =50 +2 =25, the median is or m = 30 X fl4270 = 274.3 mins (to 1d-p.)..
halfway between the values in this position (25)
and the next position (26) in the ordered list.
So the median must be in the class 3-5. Exercise 2.4 — Comparing measures of
Q3 Add a column showing cumulative frequency to the
table:
location
Q1 a) Median — most employees will earn relatively
Rainfall (r, mm) | Frequency | Cumulative low salaries but a few may earn much higher
frequency salaries, so the mean could be heavily affected by
5 a few high salaries.
40<r<50 I 12 b) Mean — the data should be reasonably
50 <r<60 9 symmetrical so the mean would be a good
60 <r< 80 i 36 measure of location. The median would be good
Bee |oe el as well (for a symmetric data set, it should be
roughly equal to the mean).
c) Mode — make of car is qualitative data so the
So 5 = a = 23, meaning the median must lie in
mode is the only average that can be found.
the class ‘60 < r < 80’. Now you need to sketch that
d) Mean — the data should be reasonably
class. symmetrical so the mean would be a good
60 ai
ay m
>
80
measure of location. The median would be good
as well (for a symmetric data set, it should be
2 23 36 roughly equal to the mean).
4 as > e) Median — most employees will perhaps travel
fairly short distances to work but a few employees
Finally, solve 2 = @. This gives: may live much further away. The median would
bh bp not be affected by these few high values.
60s ees 2 The mode is unlikely to be suitable in b), d) and e) (and
6060 — 3621 possibly a) as well) because all the values may well be
or m = 20 x a+ 60 = 62.7mm(to1d.p.). different.
Q2 There is a very extreme value of 8 that would affect
Q4 a) Estimated mean = 16740 + 60 = 279 minutes the mean quite heavily.
b) Add a column showing cumulative frequency to
the table:
Frequency, | Cumulative
3. Dispersion and Skewness
ie frequency Exercise 3.1 — Range, interquartile
iaoeremo| 8 |8 | range and interpercentile range
Q1 a) Highest value = 88846 miles
5 OE Pee Oe ane
360< t< 480
7 Lowest value = 3032 miles
So range = 88 846 — 3032 = 85814 miles
b) (i) There are 8 values, and the ordered list is
3032, 4222, 7521, 7926, 30 HIS), 3)\\HESS,
74898, 88 846
Answers
(ii) Since an = 6, the upper quartile (Q,) is Finally, solve 4 = a This gives:
halfway between the values in this position b, 2
—— ms
10 4, Q, A
B 4
22 ae 63 97
{22 > z >
b,
Answers
b) The position of the 80th percentile is (vi) The upper fence is
i x 200 = 160, meaning that P,, must lie in
=56+ (1.5 x 21)
Q, + (1.5 x IQR)
S85) EUS
the class ‘16-30’. Now you need to sketch that
b) The value 1 is outside the lower fence, so that is
class. an outlier. Similarly, the values 91 and 99 are
5, — outside the upper fence, so these values are also
15.5 4% P,, 30.5 outliers.
]
154 a, 160 184 C)
>
b,
Finally, solve 4 = ©. This gives:
bby Pheer
0 10 20 30 40 50 60 70 80 90 100
Po— 15.5 _ 160 — 154
30.5.2 wh5e5 ae (64a) 54
Q4 a) (i) There are 18 data values for Pigham.
oF Pao = 15 x B+ 15.5 = 18.5mm Since 18 + 2 = 9, the median is halfway
between the 9th and 10th data values (which
C) So the 20% to 80% interpercentile range are both 35). So the median = 35. °
=P, — P.) = 18.5 —2.7 = 15.8 mm (to 1 d.p.).
(ii) Since 18 + 4 = 4.5, the lower quartile (Q,) is
the 5th data value.
Exercise 3.2 — Outliers and box plots So the lower quartile = 23.
QI IQR=Q,-Q,=31-19=12 (iii) Since 3 x 18 + 4 = 13.5, the upper quartile
Lower fence = Q, — (1.5 x IQR) = 19-(1.5 x 12) =1
(Q,) is the 14th data value.
Upper fence = Q, + (1.5 x IQR) So the upper quartile = 46.
Sil $215
12) =4) (iv) The interquartile range = Q, — Q,
The value 4 is inside the lower fence, so 4 is not an = 46 — 23 = 23
outlier. The value 52 is outside the upper fence, so (v) the lower fence is
52 is an outlier. QO, ~ (125eQk)=23 = (So822)
Q2 a) IQR
=Q,-Q, =37-16=21 =-11.5
(vi) the upper fence,
b) Lower fence = Q, — (1.5 x IQR)
ml r= Geo eal
t= es Q, + (1.5 x IQR) = 46
+ (1.5 x 23)
Upper fence = Q) + (le5 2alQR) = 60.5
= O77 leo xl)
= 6885 b) No, none of the values fall outside the fences.
Since the minimum value and the maximum C)
value both fall inside the fences, there are no
outliers in this data set. elles
C)
ansread BAagaah ES 0 10 20 30 40 50 60 70 80
Answers
So the box plot looks like this:
—I}- «
Extend the table to include fx, x? and fi’.
0 10 20 30 40 50 60 70 80
50 <w<60
Then x* = Si tS leo
yo 60 <w< 70
70 < w < 80 1125|5625| 84375
b) Now add two more rows to the table.
80 < w < 90 510 |7225| 43350
xX
90<w<100]2] 95 |190 |9025| 18050
frequency, f
Lr So ys == EXS), ae = 07,0) yee = 266050.
Le Oe Then:
eae Variance =
Be (Ba)
So) fi = 7 32 e364 16 =91 yas
_ 266050 (3870
Ss — ("53)
Cc) variance= Ae == a =e 5e = 0.7475
= 134.95838... = 135kg’ (to 3sig. fig.)
Answers
Q7 a) Work out the total duration of all the 23 eruptions Q2 a) The coded data values are 3, 7 and 8.
that Su has timed. The mean of these is y = (3 + 7+ 8)+3 =6.
This is > x =x = 23 = 3:42 =" 76.66 minutes. The standard deviation of the coded values is
Work out the total duration of all the 37 eruptions 7°+8" _ 6? — 2.16 (to 3sig. fig.).
3+
that Ellen has timed.
So the mean of the original data values is
3 .
variance =
n 28
So the mean of the original data values is
So oan == 23) (07-5 3-42-)) = 295.3499 x=—y — 100] s— 100 — 0:08:
Do the same for the 37 eruptions that Ellen has The standard deviation of x
timed — use the formula for variance. = standard deviation ofy + 100,
so standard deviation ofx = 5.51 + 100
2 2!
Answers
b = Eve ee eee es eee
Ace 2 :
ae 10 Exercise 3.5 — Skewness
a a A Q1 a) symmetrical
, b) positively skewed
Find the sum of squares of the coded values, . Hl
y?. This is 2 999. c) negatively skewed
ys Ly ; Here, you can see there's a bit of a tail on the left’ of the
Cavaronee Das sta 5/229 68, 33 distribution (though you have to imagine rotating the
n : 10 ; stem and leaf diagram 9O° anticlockwise).
= 4.41 Q2 a) Pearson's coefficient of skewness
This gives a standard deviation of /4.41 = 2.1 =p node
ean 25
©) Since y = 100(x — 0.6), 7 = 100(¥ - 0.6 standard deviation 1.6
esat Sete ee BONE This is positive, so the distribution is positively
This means:
skewed.
= _ 4.3 oes
ai = “1 0.6 100 + 0.6 = 0.643 cm b) Pearson’s coefficient of skewness
Since
y = 100(%
— 0.6), » stmean = median) an jax (le. 13.5) ae 45
stan. dev. ofy = 100 x stan. dev. ofx ~ standarddeviation 1.8 ‘
So stan. dev. ofx = stan. dev. ofy + 100 This is negative, so the distribution is negatively
= 2.1 + 100 =0.021 cm skewed.
Q4 Make a new table showing the class mid-points (x) ©) Pearson’s coefficient of skewness
and their corresponding coded values (y), as well as = “mean mode 14a a
fy, ¥ and fy’. ~ standarddeviation =—-.2.37
Weight (to nearest g) |100-104] 105-109] 110-114] 115-119 This is zero, so the distribution is symmetrical.
equencif 3 d) Pearson’s coefficient of skewness
112 ~ 3(mean = median) 3°x (—9 = 8))
~ standarddeviation 25
=-1.2
This is negative, so the distribution is negatively
skewed.
Q3 a) Quartile coefficient of skewness
One 2 Opa O ie x8+3
= Q;—-Q, 15 =3
> h 675 = 0.167 (to 3sig. fig.).
variance of y = Sp wee = 7 ae 25° This is positive, so the distribution is positively
— 17.1875 skewed.
— ui 7 b) Quartile coefficient of skewness
This means standard BRNO MN Oy 20.4 0) 164 oe ones
4.15 (to3 sig. fig.).
= SO oeOi 194 35
Now you can convert these back to values for x. = — 0,0566 (to 3 sig. fig.).
Sincey =x — 102: This is negative, so the distribution is negatively
¥ =y+ 102 = 108.25 skewed.
pea ° es Bye a Only very slightly skewed, though.
stan. dev. ofx = stan. dev. of y = 4.15 g (to3 sig. fig.). ©) Quartile coefficient of skewness =
Q5_ Using the coding y= * +2. Qs = 205 Om 10 = 2x (= 23) = 28)
ee =O. ON flat = ijn)
Tee = = 0.444 (to 3 sig. fig.).
PO amen =% mee This is positive, so the distribution is positively
And the variance of y is: eeaven
De — i oe — 0.352 = 3.8775 d) Quartile coefficient of skewness =
Qr=2Qrr Cimr 82 ae 24 _ 9
This gives a standard deviation fory Q;— Q;
of 43.8775 = 1.97 (to 3 sig. fig.)
6 FS S085 =A 1.65: This means that the distribution is symmetrical.
And standard deviation of x ae
= standa rd deviati on ofy = 1.97 (to 3 sig. fig.)
Answers
For Café B:
Q4 a) mean = 2s = w = ts, 50
mean = Ze = me == ff)
There are 8 values, so the median is halfway
between the 4th and 5th values in the ordered There are 14 values, so the median is halfway
list. So the median is £10. between the 7th and 8th values. So the
median is £81.50.
stan. dev. = x a a — 8.5° (ii) The mean is the same for both data sets, and
nA
= 3.20156... = £3.20 (to 3 sig. fig.). the median is only slightly different, so the
average daily sales in both cafés are very
b) Pearson’s coefficient of skewness similar.
3(mean — median) _ 3 x (8.5 — 10)
~ “standard deviation 3.20 b) (i) For Café A:
=-1.41 (to3 sig. fig.) OLI0l ae
stan. dev.= 2 =f 74
C) This is negative, so the distribution is negatively = eOLoe (tonearest penny)
skewed.
For Café B:
Q5 a) mean = 2% See
10
4.9
The mode is 22. stan. dev.= / akah =,/ 22074 7
=e IS bie(to nearest penny)
stan. dev. as
= De
as ase: =Sal 72>:
TO — 24.9°
(ii) The standard deviation is a lot higher for Café
= 12.3486... B, so sales are a lot more varied there (or
Pearson’s coefficient of skewness sales are a lot more consistent in Café A).
_ _mean—mode_ _ 24.9 — 22 c) (i) For Café A:
~ standarddeviation 12.3486...
There are 14 values, so Q, is the 4th value
= 0.235 (to 3 sig. fig.) and Q, is the 11th value from the ordered list.
b) This is small but positive, so the distribution is So Q, = £72 and Q, = £84.
slightly positively skewed. This gives:
Quartile coefficient of skewness =
Answers
(ii) The mean and median are both higher for the Q3 Add a row to the table showing fc.
women, so they get between 1 and 1.5 hours
more sleep per night, on average, than the
men.
b) (i) Forthe men:
Dawe = (MS, ape = 22" so mean
= 22.) |6= 17375
stan. nee = (3-73
Since there are 16 values, the median will be halfway
= 1.42 La (to 3 sig. fig.) between the 8th and the 9th values (which are both
For the women: 1), so median = 1
Mode = 0
stan. dev.= 2 x am” — 8.5? Q4 Extend the table to show the class mid-points (x) and
=' 1.63 foe? (to 3 sig.fig) the values of fx.
(ii) The standard deviation is slightly higher for
the women, so the number of hours of sleep
for the women varies slightly more from the
average than it does for the men.
c) (i) For the men: vf= 60, ¥ fe= 2226, so
Pearson’s coefficient of skewness estimated mean = 2226 + 60 = 37.1 mph.
~.3(mean = median) 3 x (7.3 — 7) Use linear interpolation to find the median.
standard deviation 1.42 Since there are n = 60 values, this means that
n + 2 = 30, and so the median will be in the
= 0.63 (to 2 sig. fig.).
class 35 - 39.
For the women: Now draw a diagram of that class showing the
Pearson’s coefficient of skewness cumulative frequencies at either end, as well as the
= 3(mean — median) | 3 x (8.5 = 8.5) _9 class boundaries.
standard deviation 1.63
eas <a eee
(ii) The distribution for the men is slightly 34.5 gi
ee eS
m 39
positively skewed, so there are a few 12 a, 30 49
relatively high values for the men, while most eee >
are relatively low. The distribution for the b,
women, on the other hand, is symmetrical. So you need to solve: m234,5 _ 30=12
39.5—345 49—12
This gives m = 5 X + 34.5
Review Exercise — Chapter 1 = 36.9 mph (to3 sig. fig.).
Ol 12-8513.2, 13.5, 1413, 14.3, 14.6, 14.8, 15.2, 15.9,
161, 16:1 16.2.6 917.0; .1722. “(all iniem) The modal class is 35 - 39 mph.
Always work out the class mid-points very carefully.
Q2_ Lots of fiddly details here — a table helps you get
them right. 5 MeanULtee 14 size
= 7
Length of calls
Number of calls =122=17.9
(to3sig. fig.).
Lower class boundary
Upper class boundary Stan. dev. = z
ee Bey Acoa1 2
Answers ail
Q7 Let
y=x — 20. b) IORH13 5513s:
So lower fence = 5 — 1.5 x 8.5 =-7.75
Then:
y= x— 20 of se—y-, 20 and upper fence = 13.5 + 1.5 x 8.5 = 26.25.
So the values 30 and 50 are outliers.
dy = 125 and Yo yr= 221
C) This means the box plot looks like this:
So y= 13 = Seance 10 dees —{] }— x x
And:
Daa
variance for y = Ta 1.25* = 0.6475 and so OS
+++
10.1 2045
30 35 40) 25> 50
3
Pocket money (£)
standard deviation for y = 0.805 to3sig. fig.
This data is positively skewed. Most of the
But since y = x — 20, this must also equal 15-year-olds earned a relatively small amount of
the standard deviation for x. pocket money. A few got very large amounts.
So standard deviation forx = 0.805 (to 3 sig. fig.). Q11 Quartile coefficient of skewness =
Q:==207-POr 150= 2 x 13256 36
O7=Q; (S036
— ULSD
A possible sketch of the distribution would need to
be negatively skewed — something like this:
Answers
c) You could say various things — here, you can b)<x=
choose whether to say something about the
location or dispersion or skewness. And so })x = 42 x 19 = 798 [1 mark]
E.g. the times for Worker B are 3 minutes longer Dae es
than those for Worker A, on average. Variance of x =
190, ae.
The IQR for both workers is the same — generally
they both work with the same consistency. ee
at.Ce eee
= [1 mark] ee
The range for Worker A is larger than that for And so >)x? = (16 + 427) x 19
Worker B. Worker A had a few items he/she
= 33820 [1 mark]
could iron very quickly and a few which took a
long time. C) New >)x = 798 + 32 = 830 [1 mark]
Worker A’s times are positively skewed, whereas So new
xX= we = 41.5 [1 mark]
those for Worker B are negatively skewed.
New ae = 33820 + 32* = 34844 [1 mark]
d) Worker A would be better to employ. The median
time is less than for Worker B, and the upper So new variance = ae = 41.57 = 19.95
quartile is less than the median of Worker B. and new stan. dev. = 4.47 to3 sig. fig. [7 mark]
Worker A would generally iron more items in a
given time than worker B. 60.3 = 3.015g [1 mark]
a) = 0
Answers
b) The female median is halfway between the 8th
and 9th readings, so median = 64.5. This means
Chapter 2: Probability
the female median is higher than the male
median by 1 mark, so the females scored better
than the males on average.
1. Elementary Probability
Female range = 79 — 55 = 24. Exercise 1.1 — The sample space
Male range = 79 — 43 = 36 -
Q1 a) There is 1 outcome corresponding to the 7 of
The female range is less than the male range. .
Their scores are more consistent than the males’. diamonds, and 52 outcomes in total
[Up to 2 marks available for any sensible So, P(7 of diamonds) = 5
comments] b) There is 1 outcome corresponding to the queen of
You could also have compared the figures for the spades, and 52 outcomes in total.
mean, interquartile range or skewness. So, P(queen of spades) = a
a) Total number of people = 38 c) There are 4 outcomes corresponding to a ‘9’, and
So median is halfway between the 19th and 20th 52 outcomes in total.
So, P(9 of any suit) = s = at
values in the ordered list [7 mark].
19th value = 15; 20th value = 16, 13 '
so median = 15.5 hits [7 mark]. d) There are 26 outcomes corresponding to a heart
or a diamond, and 52 outcomes in total.
Mode = 15 hits [7 mark]
So, P(heart or diamond) = a — :
b) 38 +4 =9.5, so lower quartile = 10th value = 14
38 +4 x 3 = 28.5, so upper quartile = 29th value Q2 a) 6 of the 36 outcomes are prime numbers.
=i So, P(product is a prime number) = ~ oe i
[1 mark for both]
b) 14 of the 36 outcomes are less than 7.
So interquartile range = 17 — 14 = 3, So, P(product is less than 7) = tn — a
and upper fence = 17 + (1.5 x 3) = 21.5.
This means that 25 is an outlier [7 mark]. C) 6 of the 36 outcomes are multiples of 10.
° ~[T}— .~
So, P(product is a multiple of 10) = z = z
Q3 a) E.g.
(2
eeee re
AS Oreo 20) 22 24726
Number of hits [1 mark]
As Q, - Q, > Q, - Q,, and the line extended i 2 3:SA EA Ca eae Oe LO)
from the box is longer on the right-hand side,
the distribution seems to be slightly positively b) There are 5 ways of getting an even number and
skewed. [7 mark]
‘tails’, and 20 outcomes altogether.
D
# P(even number and tails) = 30-4 1
Make sure you can tell positive skew and negative skew
apart — from a histogram and from a box plot. Q4 a) g.
d) lf 25 was removed then the distribution would
be more symmetrical (but still slightly positively
skewed) [7 mark].
Find the total area underneath the histogram using
the grid squares [1 mark]:
24+1.5424+24+154+44+5+4+3+4=25 [1 mark]
So each grid square represents 2 lions [17 mark].
The number of squares for lengths above 220 cm is 7
[1 mark], which represents 7 x 2 = 14 lions [17 mark].
6 of the 36 outcomes are zero.
So, P(score is zero) = Oo een
A. 6
C) None of the outcomes are greater than 5.
So, P(score is greater than 5) = 0
d) The most likely score is the one corresponding to
the most outcomes — so it’s 1.
10 of the 36 outcomes give a score of 1, so:
30 wots
Answers
Q5 Start by drawing a sample-space diagram to show all
d) P(U AM’ AN’) = 1-(0.25 + 0.1 + 0.15 + 0.25)
the possible outcomes for the two spins combined.
= 0.25
Then circle the ones that correspond to the event
‘number on spinner 2 is greater than number on e) P(LU M) = 0.25 + 0.1 +0.15 = 0.5
spinner 1’. E.g.
P(M’) = 0.25 + 0.25 + 0.25 =0.75
10}®@ @ Don't forget to include P(L'’ 1 M’ MN’). You could
also find P(M’) by doing 1 — P(M) = 1— (01 + 0.15).
Q4 a) Number of outcomes in S’M F’ A G’
= 200-(17+
18+ 49+284+11+34+6)=37
Spinner
2
So P(S’ MF’ AG)= ia
b) You're only interested in those people who have
De? oe we || been to France — 49 + 28 + 34+11=122
Spinner1 people. The number of people who have been to
There are 13 outcomes that correspond to the event France and Germany = 28 + 34 = 62.
‘number on spinner 2 is greater than number on So P(G, given F) = oS = 4
spinner 1’, and 25 outcomes altogether.
C) Number of outcomes in SM F’ = 17 + 18 = 35.
So, P(spinner 2 > spinner 1) = i
35
So PS NF) = #2 = 7
Q5 a) Start by drawing a Venn diagram to represent
2. Venn Diagrams the information. If H = ‘goes to home league
matches’, A = ‘goes to away league matches’ and
Exercise 2.1 — Using Venn diagrams C = ‘goes to cup matches’, then:
ao
Q1 a) Label the diagram by starting in the middle with
the probability for AM B. Then subtract this
probability from P(A) and P(B). And remember
to find P(A’ M B’) by subtracting the other
probabilities from 1. So:
Answers
Q4 a) P(Y’)=1-—P(Y) =1-0.56 = 0.44 Q4 a) P(AMB) = P(A) + P(B)— P(A U B)
b) P(X MY) = P(X) + P(Y) — P(X UY) = 0.28 + 0.66 — 0.86 = 0.08
= 0.43 + 0.56 — 0.77 = 0.22 P(A MB) # 0, so A and B are not mutually
exclusive.
@) ROM ING) = 1 SO UING) = 1 = O77 OS
d) PX UY) =1-P(IXNY)=1-0.22 =0.78 B) =PAN Ce Oe ee
0.28 + 0.49 —0.77 =0
Q5 a) PC’ ND)=P(C) + P(D)- PIC UD) P(A 9 C) = 0, soA and C are mutually exclusive.
= (1 —0.53) + 0.44 — 0.65 = 0.26 ©) PIBAC)=P(B) +P(C-PBUC)
e108
Fe Answers
DYSON Pry) PCAN EAM) te
Se
el) _ 100
755 2
Q5 A and B:
cl ) P(EM M) He P er ae ple
100
ge ras as nn
ey Pee) P(A) x P(B) = = = P(A 1B),
= eM) 700 _ 39 _ 13
P(M’y ss60 *—3O
«C200 so A andB are independent.
100
A and C:
d) P(A|E’) PANE) aa 4 _
OVE = sche =paps 18
eo= 36 100
P(A) x P(C)= 3
; x 78=— AS
308
There are lots of ways to write an expression for d) Since Z’ and Y’ are independent,
POX MYA) eg) PZ XA YYP(XM Y) or RZ lea = |Z) = 11 =0418 =O. 52
PIX|YN Z)P(YM Z). You have to choose the way that e) SinceY and X’ are independent,
makes best use of the information in the question. P(Y |X= R= OF68
Q7 LetJ = ‘Jess buys a DVD’, K = ‘Keisha buys a DVD’
Exercise 3.4 — Independent events and L = ‘Lucy buys a DVD’.
QI P(X MY) = P(X)P(Y) = 0.62 x 0.32 = 0.1984 a) The probability that all 3 buy a DVD is
Q2 P(A)P(B) = P(A NB),s PUMKAL). Since the 3 events are independent,
p(ay= PAO
PB)
B) 0.45 you can multiply their probabilities together to
~ T=0.25= 0.6 get: PU) x P(K) x P(L) = 0.66 x 0.5 x 0.3 = 0.099
Q3 a) P(MAN) = P(M)P(N) = 0.4 x 0.7 = 0.28 b) The probability that at least 2 of them buy a DVD
will be the probability that one of the following
b) P(MUN) = P(M) + P(N) — P(MMN)
happens: JN KN LorJQAKNU orJNK’ AL or
= 0.4 + 0.7 — 0.28 = 0.82
Jemkeiag
c) P(MAN’) = P(M)P(N’) = 0.4 x 0.3 = 0.12 Since these events are mutually exclusive, you
Q4 a) LetA= ‘Ist card is hearts’ and B = ‘2nd card can add their probabilities together to give:
is hearts’. Then, since the first card is replaced 0.099 + (0.66 x 0.5 x 0.7) + (0.66 x 0.5 x 0.3) +
before the second is picked, A and B are (0.34 x 0.5 x 0.3)
independent events. = 0.099 + 0.231 + 0.099 + 0.051
So, P(A Mm B) = P(A) x P(B) = 0.48
There are 13 hearts ps of the 52 cards, so P(A)
= itq
and P(B) both equal 5
2 Exercise 3.5 — Tree diagrams
So, P(A B) Ee= h5
ligy x ~ ayPras
ie Q1 a) The events are not independent because the
probability that Jake wins his 2nd match depends
b) Let A = ‘1st card is ace of hearts’ and B = ‘2nd
on whether or not he won his 1st match.
card is ace of hearts’. Then, since the first card is
replaced before the second is picked, A and B are b) (i) P(Win then Win) = 0.6 x 0.75 = 0.45
independent events. (ii) P(Wins at least 1) = P(Win then Win) +
So, P(A M B) = P(A) x P(B) P(Win then Lose) + P(Lose then Win)
There is 1 ‘ace of hearts’ out of the 52 cards, so = 0.45 + (0.6 x 0.25) + (0.4 x 0.35)
P(A) and P(B) both equal 55 Oy
Or you could find 1 — P(Lose then Lose).
1 [eegee |
So, BAW By 55% 55 = 5704 -
Answers
Q2 a) 1 Heads Q6 a) Let R, = ‘ball i is red’, Y, = ‘ball i is yellow’ and
2 G, = ‘ball i is green’, for i= 1 and 2. Since the
Ye ae first ball isn’t replaced, the second pick depends
= 2 Tails on the first pick and you get the following tree
S\Not 31 Heads
diagram:
ee | \
2 Tails
Sl 1
b) P(wins) = Pn Tails) = +x
Onan
= 1+
mel =heYf\= RLS!
pS
On \
Q3 a) Let D = ‘passed driving test’ and U = ‘intend to go
to university’. Since D and U are independent, ale
eS
P(U |D’) = P(U|D), and so:
lua \
nN
O75 9)
als 1
INAS
03220 au eS
@)
a
72
Ou
y
xe 025 >
== nN
Since a is
72 ~ 66’ a player is more likely to win
now that the game has been changed.
You could also answer this question by explaining that
when a ball of one colour is selected, then replaced, the
proportion of balls of that colour left for the second pick
SO BVA) = P(RIGWA) = RUR nA)
is higher than ifit isn’t replaced. So the probability of
= (0.56 x 0.72) + (0.44 x 0.33)
picking the colour again is higher.
= 0.5484
Q5 Let S = ‘owns smartphone’ and let C = ‘has contract
costing more than £25 a month’. Then you know the Review Exercise — Chapter 2
following probabilities: P(S) = 0.62, P(C) = 0.539 and Q1 a) The sample space looks like this:
P(C’ |S) = 0.29, and you want to find P(S|C).
Using the conditional probability formula: Dice
1 2 3 4 5 6
P(S iC)
PS |C)=
rors Eth |e 4 6 8 10 12
Use a tree diagram to help you find P(S MC). You CoinT 6 Hi 8 9 10
don’t need to label all the branches, just the ones that
help you answer the question: b) There are 12 outcomes in total, and 9 are more
O71 C than 5, so P(score more than 5) = =F:3 9
Answers
Q2 a) 50% of the students eat sausages (S), 20% eat (iii) Choosing an upper school pupil means either
chips (C) and 2% eat sausages and chips, so you ‘boy and upper’ or ‘girl and upper’.
get the following Venn diagram: P(B MU) = 0.4 x 0.3 = 0.12
ACG iO) SOS S105 SO sx0)
So P(U) = 0.12 + 0.30 = 0.42
Ss
b) Using the Venn diagram, 18% of students eat
chips but not sausages.
ie
c) Using the Venn diagram, 18% + 48% = 66% eat
chips or sausages, but not both.
Q3 It’s best to start by drawing a sample-space diagram:
[1 mark for the central figure correct, 2 marks for ‘5;
‘7’ and ‘4’ correct (get 1 mark for 2 correct), 1 mark
for ‘287 ‘15’ and ‘11’ correct, plus 1 mark for a box
with ‘47’ outside the circles.]
b) (i) Add up the numbers in all the circles to get
73 people out of 120 buy at least 1 of the
soaps [7 mark].
Or you could do 120 — 47 = 73...
So the probability = x [1 mark]
(ii) Add up the numbers in the intersections
a) to get5+3+4+7=19,
meaning that 19
people buy at least two of the soaps
; [1 mark].
b) 7 of the 36 outcomes are square pes (since 4 Sethe Drobabiliuea Dereon OU eat eset Wo
and 9 are square), so P(square) = 36 See as Tt marke
c) It isn’t possible for her score to be both a prime (iii) 28 + 11 + 15 = 54 people buy only1 of the
number and a square number, so the events P and Gone b 5
S are mutually exclusive. soaps [7 mark], and of these 15 buy soap
[1 mark].
d) Since P and S are mutually exclusive, So the probability of a person who only buys
PS 7SEEN ayo 5 67
36 p=— 22
36 ti
4 : type B is lees
one of the soaps buying = = +5
e) The score from the second experiment is [1 mark].
unaffected by the score from the first experiment, Let R, = ‘Ist red’, W, = ‘Ist white’,
so the events S, and S, are independent. G, = ‘Ist green’, R, = ‘2nd red’, W, = ‘2nd white’
79 49 and G, = ‘2nd green’. Then:
a) rea 36, 36 1296 ALR,
Q4 a) Let B=boy, G= girl, U = ‘in upper school’ and 4, W,
L = ‘in lower school’, then: Wy pe V1 G
0.3-U hy R,
0.4 B : Yi a
Oxiral: See
0.5 U
Y,12 : ;
4 Ww,
0.6 G Z11 G
b) (i) Reading from the tree diagram, P(L|B) = 0.7. branches on the right-hand side correct]
(ii) PCG OL) = 0.6 x 0.5 = 0.3
Answers
b) The second counter is green means one of three a) P(chicken or ice cream but not both)
results: ‘red then green’ or ‘white then green’ or = P(C NS) + P(BAI) [1 mark]
‘green then green’. So P(2nd is green) = = (3 x 2)+ (3 x 4)[1 mark]
3 y 5 pats a
(53 * ta)+ Ga * Ta) + aa * Ta) At marl Sein 6: ole
IS i 20 20
2. _ 55. es is? ire ne
= 732 + 732 + 732 = T32 ~ 12 [FPmark b) P(ice cream) = P(C M1) + P(B. 1) [7 mark]
c) For both to be red there's only one possibility: aol? Do:
‘red then red’ [1 mark]. = (3 x §) + ( x g) [1 mark]
P(both red) = 5 x 4 = 25 = x, [1 mark] - oe SO.
= 7s a=
soll mand
d) ‘Both same colour’ is the complementary event Cc) P(chicken |ice cream)
of ‘not both same colour’. So P(not same colour)
= 1 —P(both same colour) [7 mark]. Both same = P(C|I) = on 2 [1 mark]
colour is either R, and R,, or W, and W,, or
G, and G,. So P(not same colour)
pCAh= « and P(l) = 4 (both from b) above)
PEMA Pe TS
erSes eet a) eo ° PCE = 75 * 309 [1 mark]
=1-[3 <4) + (aX A) + Ga * 7)
[1 mark] pen
= 79 [1 mark]
ee OO ee a
=) — i397 = Too eGo tt mark! a) Let E = ‘goes every week’ and R = ‘plans to renew
membership’. Then P(E) = 14 P(R) = A: and
(Alternatively, 1 mark for showing P(R,W, or = 20" ~ 520
R_G, or W,R, or W,G, or G,R, or GW), 1 mark P(E|R)= 73.
=10
for adding the 6 correct probabilities and 1 mark PiRia 2) = P(E |R)P(R) [1 mark]
for the correct answer.)
a) (i) J and K are independent, so
OA
= 3 [1 mark]
x x0
Pin Kyr= RO) Pik)! = O07 < OF) = 0:07
ma=aq'=
eal) 3 [1 mark]
[1 mark]
(ii) PJ UK) = PJ) + P(K) — PU 9K) [7 mark] b) P(E MQ R’) = P(E)— P(E R) [7 mark]
= 0.7 + 0.1 — 0.07 = 0.73 [1 mark] welt alge
b) Drawing a quick Venn Diagram often helps: ye aS 91 1
Cc)
ty
[1 mark]
So going to the club every week and renewing
membership are not independent [7 mark].
Or you could say that P(E) # P(E|R).
Eo27,
a) P(B’) = 1 — 0.2 = 0.8 [1 mark]
P(L|K’) = P(L 9 K’) = P(K’)
Now Ln kK’ = L — think about it — all of L is b) It’ll help if you draw a tree diagram before going
contained in K’, so L 1 K’ (the ‘bits in both L and
any further. If F = ‘roll a 6 on the fair dice’:*
K’ ) are just the bits in L. YEE
Therefore P(L M K’) = P(L)
\2 B Se
=1-P(K UJ) =1-0.73 = 0.27 [1 mark] 76 SE
P(K’) = 1 — P(K) = 1 — 0.1 = 0.9 [7 mark]
And so P(L|K’) = 0.27 + 0.9 = 0.3 [1 mark]
4 Start by drawing a tree diagram to make things easier.
Let C = ‘eats chicken’, B = ‘eats beef’, | = ‘eats ice You want to find P(B U F). Now, either at least
cream’ and S = ‘eats sponge pudding’. one of the dice shows a 6 or neither of them do,
so these are complementary events.
leet So P(B U F) = 1 — P(B’ MN F’) [7 mark]
es = 1 —($x2)= 1 —=4 [1 mark]
yl
4% ~B Or you can use the addition rule:
P(B U F) = P(B) + P(F) — P(BN F)
Answers
c) P(exactly one 6 | at least one 6)
_ P(exactly one 6 1 at least one 6) b) The probability of the score being even is 2 =
P(at least one 6) pa vared
and the probability of ‘otherwise’ (the score bei SNIA
8
The next step might beabit easier to figure out if odd) is the same. The probability distribution is:
you draw a Venn diagram:
x
Son (6k="1 apie
sok = ze.
Answers
F(S) Ree)
= P(X = 5) + P(X = 4) + P(X = 3)
+ P(X = 2) + PX = 1)
=0.2+0.2+0.3+4+02+01=1
Using all this information, the cumulative
Q5 a) Drawasample-space diagram to show all the en ves va
possible outcomes:
Score on dice 1 a
N Cia wh ea 0.ne0.3 (|
S
7
Ms Y=
1 On6 F(2) = P(X < 2) = P(XY= 2) + P(X= 1)
16 20.2 02205
c) PB <X<10) F(3) = PW < 3) = P(X = 3) + P(X =2) + PX = 1)
= P(X = 4) + P(X = 6) + P(X = 8) + P(X=
9) = 03 023032078
Bt el he ees oh
= 16%
@ PBeulE eo F(4) SS)
=:|IHooa P(X
<4)
= P(X = 4) + P(X = 3) + P(X = 2) + P(X =1)
Exercise 1.2 — The cumulative =0.24 0.3 40.25 03.= 1
distribution function Using all this information, the cumulative
Q1 a) Add up the probabilities to work out CistriburiopiTag ction ty
the values of F(x): : x 1 2 |3 4
F(1) = P< 1) =P(X=1)=0.1 F(x) | 0:3) |lo.5 [08 1
(2) sR OX ap (XG)
Xe)
= 02 = 0.1 2088 d) Add up the probabilities to work out the
F(3) = P(X< 3) = P(X= 3) + PIX= 2) + P(X=1) vel Oho)
=0.3+0.2+0,1=0.6 F(2) = PX < 2) = P(X=2) = 4
F(4) = P(X< 4) e eae =
= P(X= 4) + P(X= 3) + P(X= 2) + P\X= 1) os - oe Se
=0.2+03 +02 201 =08 hr aay
Answers
iy = — me ae 8) + P(X= 4) + P(X =2) So the cumulative distribution function
= Bae Se looks like this:
+ P(X
=4)+P(X =2
aay Sel oe Pe 31
Dettmann Open Aare 2p Se. Here you're just using the fact that
F(64) = P(X< 64) P(X< O) + P(X > O) = 1 (all the
= P(X= 64) + P(X= 32) + P(X= 16) probabilities add up to 1).
as a :+ ee q ¥ Soy Q3 a) Use the probability function to work out the
ch ayUae pealinee ieNy elect ty | F(x).
Se iba A ales. ’
Using all this information, the cumulative EG) Oa) 8
distribution function is: Oe eee b ees
(2) = P(X¥= 2) + CESS gag tats ay
F(3) = P(X¥
= 3) + P(X
= 2) + P(X¥=1)
Sy ieee eee
“88.88
ae
BecauseE you are adding on a constant term of 4
Q2 a) You need to draw up P a table showing 6 the each time, the rest are easy to work out:
cumulative distribution function, so work
out the values of F(x): F(4) = 4 = 3 FS)= 2, EG) 8 = 3,
Fi y= P= 1)=028
F(2) = P(X = 2) + P(X = 1) =0.14+0.3=0.4 F7)=4,8° Fe)=8=1
8
F(3) = P(X= 3) + P(W= 2) + P(X=1) So the cumulative distribution function looks like
= 0.45 + 0.1 + 0.3 = 0.85 this:
F(4) = PX = 4) + P(X = 3) + PX = 2) + PX = 1)
= 0/5 O45 + 0. +: 0.3 =|
So the cumulative distribution function looks like
this:
Patt Peeety
a a
apy = Dia Be 4 ON
= Iee
F(O) = P(X = 0) + P(X =—1) + PX =—2) —
Se eee Always check that the probabilities add up to 1 — if
> 1Oee neOon2> they don't, you know for sure that you've gone wrong.
a qo Waa et b) PX =1) =P(X< 1)-PIX< 0) =1-0.7=03
=i+t+2+5-2 P(X = 0)= P(X <0) - PIX <-1) = 0.7 -0.2 =0.5
: P(X =-1)
= P(X <-1) —-P(X<-2) =0.2-0.1 =0.1
BOS te +2)P(X
=-2)
r= 0) P(X
=-2) = P(X
<- -2) = 0.1 om
P(X=-1)
_ i 1 ;i Ze gl 25 So the probability distribution is:
= 5 Ge 5 oP To ateie sr =
; nee 1 ce
p(x) | 0.1 | 0.1 | 0.5 | 0.3
Answers
P(X = 2) = P(X < 2) —P(X< 1) = F(2) - F(1)
<32) =1 —
c) P(X= 64) = P(X< 64) - P(X (26 1, Chee We to7 Bop 2 19
=~ 164 (A= 4964 6
=32) = PX<32)
P(X -Ps 16)= $—
P(X = 16) = P(X <16) —P(X< 8) = 37 7
P(X= 8)= PUY<8) Ps 4)= 4-9 =
<4) - ee 2=4-35
=4) = P(X
P(X
P(X = 2) =P(X< 2) = 30
So the probability distribution
Sw is:
oo[so]s]3]3|a1a
=4)=P(X< 4)-P(X<
1
3) =1-0.8=0.2
S072 ea.
For 2 to the power of something to be equal to 1,
the power must be 0, so3 -k=0 > k=3.
P(X = 3) = PX s 3)—=PXs2)= i a
2) = P(X
< 2) —P(X <1) =a— To find the probability distribution, you need to
POG ai) =e Gai) 083 find the probability of each outcome:
Now P(X
= 2) = P(X= 3) so 0.8-a=a-0.3, P(X= 3) = P(X <3).— PW s2) = FB)—F2)
soZa= 1.1, soa= 0:55. = 90-9)L 92-3 99 _ pat
SOIRCG=2) =) Be 3) = 0225 1 SS 1
S|
So the ee distribution is: De
POXG
2) = POD)
= PX 1) = FO) F(a)
= 92-3) __ 911-3
=) 9-1 9-2
oul eb zeta,4
2 4
Q6 a) F(3) = P(X <3) = HPUX
Il
PX = 1) POS) Ft) a2 aie?
So (3 +k
SF = 133ns vi
4+h=
So the table looks like:
>34+k=5>k=2
To find the probability distribution, you need to
find the probability of each outcome:
P(X= 3) = PW< 3) — PWs 2) = FG) = FQ)
_G+2P_@+27_25
2 DS DiS
4 F(x) means the probability that the larger score on
the dice is no larger than x. This means both dice
=) se ae) Cel) 0) must score no more than x, where x = 1, 2, 3, 4,
C2) leo) 6
i ay ae 5 or 6. The ees) that one dice will score
no more than x is=, so the ee that Hoth
will score no more ohar rid Se Se
6 6nd,
Two dice rolls are completely independent, so the
Cae as can just be multiplied together.
SOIR)== ax = 12a, SG.
36
To find the probability distribution you need to
by FG) BOS 3a) R(X a find the probability of each outcome:
5 allllke
go BEN
1s G+ipa 64 PU =16) = POs BlPOs Oe
2
363
>34+k=4>5>k=1 = 36 25 Sa
36 36 36
To find the probability distribution you need to Sg oeae
P(X = 5) POTS 5) BOGS diate
find the probability of each outcome:
=o = 10293
P(X= 3) = P(X< 3) — P(X< 2) = F(3) — F(2) 36, Gb 936.14 =e
hoa) ea) eso PIX= 4)= Ps 4)- PIX<3) = 2 —
64 64 64 64 64 = dee
36 36 36
Answers
Q2 a) E(X) =[2 x 0.2] +[5 x 0.3]
~ eeoy a Nie
|S |& |
ale aly AIA
[oO |
als + [6 x 0.1]+[p x 0.4]
= 2.5 + 0.4p
SO 6.5=2.5+04p> p=10.
2. Expected Values, Mean and b) The probabilities add up to 1 so
Variance 0.5 + 02 -a+-02=>i1>a=0.1
E(X) =[4 x 0.5] +[8 x 0.2]
Exercise 2.1 — The expected value + [px a)+[15 x02]
Q1 a) E(X)= [0 x 0.2] + [1 x 0.2] + [2 x 0.2] = 6.6+0.1p
+ [3 x 0.2] + [4 x 0.2] =2
So 7.5=6.6+0.1p > p=9.
b) E(X)= [1 x ay]
14 t[2 x 14
G]+B x3]
14 Q3 a) The sum of the probabilities is 1 so
20s=a
TO" =D) STISCt, 02+a+01+b=15>a+b=0.7
=A
E(X) is 2.5 so
CMEC)= 320i 18 084 |= x 015)
= 1-9 02>, 2a-- 0:35 4b — 225
d) E(X) =[-2 x 0.1] + +1 x 0.2] + [1 x 0.2] = 2a+ 4b=2>a+2b=1
2px Oe Ob0s i)— 02 Rearranging the second equation to get a= 1 — 2b
e) First work out the value of k: and substituting this into the first equation we get:
The probabilities add up to 1 so (1 —2b)+b=0753b=03
3k+4k+5k+ 6k+7k=1
and putting b = 0.3 into the first equation we get
be | = i5=
Dijcs yee i
75° a=0.4.
Now you can work out E(x):
1 1 b) The sum of the probabilities is 1 so
EO x ae(l
D5 + 2)|462 M552
25 t 2) Oi a aseo4-
ON = il =>>asbb
= O06
Answers
©) (i) E(X)=[3 x 0.1] + [4 x 0.25] + [5 x 0.15] b) (i) The probabilities must add up to 1:
G0 aa Onle=nle=niai—Onoe
+ [7 x 0.3] + [9 x 0.2]
= 5.95 Gi) EX) = 3 x 0.31 #42 «023!
+ [-1 x 0.3] + [0 x 0.1]
(ii) E(X2) = [9 x 0.1] + [16 x 0.25] + [25 x 0.15]
eo
+ [49 x 0.3] + [81 x 0.2]
= 39) 55 E(X2) = [9 x 0.3] + [4 x 0.3]
+ {1 xi0.3] + [0.0.1]
=4.2
Exercise 2.2 — Variance Var) = 4.2 — (-1.87 = 0.96
1 a) E(X)=[1 x0.2] +[2x0.1)+ 3 x0.2 Q4 a) EX) =[3 x 0.2] + [4 x 0.3] + [6 x 0.1] + 0.4p
a a : mn i i 4 ue aka
seb=2.4+04
pee
E(X2) = [1 x 0.2] + [4 x 0.1] + [9 x 0.2]
+ [16 x 0.1] + [25 x 0.4] = 14
So Var(X) = E(X2) - (EW)? = 14 - 3.4? = 2.44 EUie= 0.2] Oana
+ [49
x0.4]
0.3] + [36 x 0.1
b) EW)=[1 x 5)+B x G+ 6x 9] = 29.8
+18 x ab} +19 x 35] + 10x 351 So Var(X)= 29.8— 5.2?= 2.76
99 b) The probabilities add up to 1:
to3sf)
= 35 (= 3.09 0.2+a+044+03=1 >a=0.1
2 = [1 x ft ‘ll
crqg!+ Bex“3! 0.2] + [4 x 0.1] + 0.4p+ [9 x 0.3]
= [1 x oe
-E(X) bas
E(X*) Zales
+ [64 x =] + (81 x 35] + [100 x 35] 5.7=3.3+04p > p=6
x 0.1]
E(X?) = [1 x 0.2]+ [16
541
oa Oe) + [36 x0.3]
x 0.4] + [81
So Var(X) = E(X?)— (E(X))? = 40.5
= 541_ (99) = -7.33to3s.4) Var(X) = 40.5 — 5.7? = 8.01
32 024 Q5 a) (i) All the probabilities add upto 1:
c) E(X)=[-2 x 0.2]+ ree arola) talear
+ [1x 0.1] + [12 x 0.4]= ot Se ae
Answers
C) i) V=20- s
20
3
20
7 25 x {19)} 36 x {19)|
4
rasta
(ii)EV) = (17
x0.1] + 114
x 0.2]
fa
|0.2 0.2
erm
ese enc = taexen al) ee
Var(Z)= Var(50— 2X)= (-2)?Var(X)
= 4 x 26.49
=105.96
(ii) E(Z) = [-1 x 0.1] + [2 x 0.2] b) (i) EX)= [-4x 4]+ ]
1 1
+ [5 x 0.3] + [8 x 0.2] + [11 x 0.2] = 5.6 + [2
Answers
1 1 1
—2X) = 7 — 2E() = 7 - (2 x (-1.78))
(ii) E(Y) = E(7 Q4 a) ED) = [1 x a [3 x z|+ [5 x Al
= 10.56 1 1 1
ee xe]
Var(¥)=Var(7 — 2X)= (-2)VarlX) = 4 x 7.23 +[7 =xg]+[9OK at
xe] +[1
= 28.9 to 3 sf. =6
(iii) E(Z) = E(7 + 2X) = 7 + 2E(X) ; ;
= 7 +(2 x (-1.78)) - b) ED) =[1 x 2]+[9 x g]+ [25 xs]
= 3.44 to 3 sf. : ;
Var(Z) =Var(7 + 2X) = (+2)°Var(X) = 4 x 7.23 + [49 x 2] +[81 xg] +[121 xg]
ee as = 143 _ 47.7t03sf.
Q4 a) E(X)=[10 x 0.2] + [12 x 0.3] + [13 x 0.1] 3
+ [15 x 0.3] + [16 x 0.1] = 13 Var(D) = 47.7— 6 = 11.7 to3.s4.
E(X2) = [100 x 0.2] + [144 x 0.3] + [169 x 0.1] Aevihigass
+ (225 x 0.3] + (256 < 0.1] = 173.2 Q5. a) = EO) =
Var(X)= 173.2 — 132= 4.2 (6 =a+ iy =—1
b) E(Y) = E(26 — mX) = 26 — mE(X) = 26 - 13m g? = Val) ae
So 0 =26-13m > m=2. (8-1+1¥-1_ 63
Var(¥) = Var(26 — mX) = (-m)*Var(X) = 4 x 4.2 =F etiatot eee
= 16.8 b) Substitute your values for the mean and variance
Cc). EQSEBxH=e) =3EX) c= 6.x 13)-e=39-e¢ into the inequality and rearrange — then it’s just a
So 30 =39-e>c¢=9 simple probability question.
Var(Z) = Var(3X — c) = 9Var(X) = 9 x 4.2 = 37.8 i P(X — |< 0) = P(X - 4.5\< 55)
= P(|\X-4.5|< 2.291)
Exercise 2.4 — The discrete uniform Hi the modulumefiy< 4:5 ss less tham zy Vint
Cieues : means that the size (i.e. ignoring any minus signs)
distribution of X— 4.5 is less than 2.291. For example, when
X = 3, X-4.5 =-1.5. The modulus of this is 1.5,
which is less than 2.291.
|X— 4.5|< 2.291 is satisfied by
X = 3, 4, 5 and 6.
So P(X
— |< 0) = PLX= 3, 4, 5, 6)
Seat) ae here ear
ips oyee ete
ee ee eA ae) 2A Q6 a) Any single digit number means that Y could take
> 2 = iw 2) any value between 0 and 9 inclusive.
oy r Gee
pak = S$" = OE2 Seaao ee45
o? =Var(Y)=Cae
1
4 &= (SOHRI
T2 bo 8=ecore
77 = 8.25
EY)= 24°Cee
= + = 135 b) P(\¥—n|
>0)=P(\Y-4.5| > ¥8.25)
= P(|Y—4.5| > 2.87
ek Saal (5—12+ 1f—1 45 : | : ae
Vi aoe ee = |Y—4.5| > 2.87 is satisfied by Y= 0, 1, 8 and 9,
qe so P(|Y — | > o) = P(Y=0, 1, 8,9)
Siehae lee eee
Pizimeh hades DDL Lie When, Te |05) c) P(|\¥—p| > 20)= P(\Y—4.5|
>278.25)
T0 | 10} 10| 10] 10|10|10| 10 | 10 | 10 = P(|¥-4.5|
>5.74)
Be 91 S26 : There are no values
that Y can take such
E(2)= 25" = US = 155 that|¥— 4.5]
>5.74, 50
PY —al > 20)=0.
Varig)
JO mes ee oP
OS A
ety Dae=1 _
99 | Q7 ate ~
ee
233 DEs ieee
Be 2 2 ile
= 6, a Vata
So the standard deviation is the square root of the 12
variance, i.e. /8.25
= 2.87 to 3s.f. ._ 7 a1 a eee
7 12 asbees
b) EC) SECS 27 = VEZ) Ok) =
Var(X) = Var(2 — 2Z) = (-2)*Var(Z) =4 x 4=16
Answers
Review Exercise — Chapter 3 = 0)=P(R <0)=011
Q1 a) All the probabilities have to add up to 1. PRS = PRS 1) PR = OVS 0520 = 0.4
So 0.5+k+k+3k= Oot okay Wes 5k
= 0.5, R= PR = 2) PR = S105 = 05
hese Onl
So the probability distribution ofR is:
b) P(Y <2) =P(Y=0)+P(¥=1)=0.5+0.1 =06.
r 0 | 2
Q2 PW = 0.2)=PW=0.2) = 0.2 PUR") Ol Ora lmOn5
PW < 0.3) = PW = 0.2) + PW =03)=0.4
POSRS)=.05
P(W < 0.4) = P(W = 0.2) + P(W = 0.3)
+ P(W = 0.4) = 0.7 There are 5 possible outcomes, and the probability of
P(W < 0.5) = P(W = 0.2) + P(W = 0.3) each of them is k, sok=1+5=0.2.
+ P(W = 0.4) + P(W = 0.5) = Mean of
X =on 4 =),
So the cumulative distribution function of W is : (Ae Ois let tk geil
Naa ON i a ae
Q3
cater [erfaTe
a) P(X<1)=P(X¥=1)=
a) As always, the probabilities have to add up to 1:
sosbvnc pp
ae (ore
5 of a
54 A aes
PX <2) = P(X¥=2)+ PiX= )=se+H= b) E(x) =(1 x 2)+ (2x4)
P(X
s 3) = P(X = 3) + PX = 2) + P(X¥=1)
Baer dis os +(3 xg)
+(4x 54)
5s aE STs25 145 24359 490.. Sia)
57
P(X s 4) = P(X = 4) + P(X =3) 24 cr2dee
+ P(X = 2) + P(X = 1)
orem, 6 14 31, 16 E(X’) = (1? x 2)+ (2? x 4)
BS 2 be AT 2S. By 25
+ (3°
3)+(# x33)
AS OE 27ia BONES 1 DONE 53
Se
WISETO
BSS 5 a5
Olin 251 gen 5 os 22
S55 24 ieee
So the cumulative FST function is:
Var(x) = EC) — [e(ap = 33— (12)
_ 424 — 361_ 63
64 ~+~«64
E(2X — 1) = 2E(X)—1
19, cons
Var(2X — 1) a= 2Nat(X) =A xX oe
036316
Answers
Q8_ To find the cumulative distribution function, you need eZ a) All the probabilities must add up to 1, so
to find P(X< x). This is the probability that the highest 2k+3k+k+k=1, i.e. 7k =1, and so k=
score rolled is no more than x. So all 3 dice must roll [1 mark]
a score no more than x. The individual probability of
one dice rolling a score of no more than x is - — so b) P(X< 0) = P(X= 0) =4 [1 mark]
the probability iseall 3 dice roll a score no more P(X< 1) = P(X= 0) + P(X= 1) = 2 [1 mark]
than x is (z) = (since the probabilities of the
316
6 P(X< 2)= P(X = 0) + P(X= 1)
individual dice rolls are ia [1 mark]
+ P(X =)e=2
SO) Fs) (G78)= Sl,
DpSey Oy 8 P(X< 3)= PX= 0) 4+ P(X= 1)
316
6
Now to find the probability distribution, you need to + P(X =2)+P(X=3)=1
work out P(X= x) for each x. [1 mark]
wap 5
AQ. |) 3Z |) iL
a6 316 21GH 2061 216.) 216 [1 mark]
5Teageabel
X=5 x =3 3 mark] c) P(X < 4.5) = P(X= 0) + P(X=1)
The probablyof sete 2 heads is: + P(X= 2) + P(X = 3) + P(X= 4) 7 mark]
3X- 5x 4 x 3-3 a (multiply by 3 because =0.5 [1 mark]
any of the three coins could be the tail — the a) PX=1)=a, PX = 2) =2a, P(X =3)=3a. *
order in which the heads and the tail occur isn’t Therefore the total probability is 3a + 2a + a = 6a.
important). [7 mark] This must equal 1, so a = z. [1 mark]
Similarly the probability of getting 1 head is: a a e(2 xe)
2
b) E(X) =(1 Xe) x)3
Led § lee
A ERBGA GER =seats [1 mark]
And the probability of getting no heads is:
tlie le eel a 5 [1 mark]
Do 7S Deeg
So the pera of 1 or no heads c) E(X?) = Var(X) + [E(X)P
2 aar = ant3 mark] = 2 + (5)= 2 5 a
: [1 mark]
cae the ee glistDungD of X is:
= 34 = 6 [1 mark]
d) E(3X+ 4) = 3E(X)+4=3 xf+4= 11
b) You need the probability that ¥>10p [1 mark] [1 mark]
180 Answers
E(X) = (0 x 0.4) +(1 x 0.3)
a)
+ (2X 0.2) + (3 x 0.1)
Chapter 4: Correlation and
=0+03+40.4+4 0.3 [1 mark] Regression
=1 [1 mark]
b) E(6X + 8) = 6E(X¥)+8=6+48 [1 mark] 1. Correlation
= 14 [1 mark]
C) The formula for variance is
Exercise 1.1 — Scatter diagrams and
Mant x)= EC) — [Ey correlation
So first work out E(X?): QI a) 25
E(X*) = (0? x 0.4) + (1? x 0.3) oOo
Then complete the formula by using your answer to Distance from centre, km
part a): ‘
b) Negative correlation
Var(X) = EX’) — [E(X)P = 2 — (1?) [1 mark]
Q2 a) ~30
=1 [1 mark]
d) Var(aX+ b) = a’Var(X)
Var(> — 3X)
= (-3)Var(X) = 9Var(X) [1 mark]
= ox
QO 2 & & & 10 12
=9 [1 mark] Length (cm)
a) E(X) = [0 x a] + [1 x 5d] + [2 x 5] + [3 x 0.2] b) Positive correlation
=7b+0.6 [1 mark] c) The circumference of 3.5 cm
$0 1.3 =7b+0.6 [1 mark] => b=0.1 [1 mark] or the length of 11.9 cm.
All the probabilities must add up to 1 so:
a+0.5+0.1+0.2=1
=> a=0.2 [1 mark]
[1 mark]
Exercise 1.2 — S,,, S,, and S,,
Q1 a) Add rows to the table for x”, y? and xy. And
b) EGS x+ 3) = 73E) +3 [1 mark] include a column showing the total of each row.
= (79 x 1.3)+3=4 [1 mark] actetal Gat! OralPace seeee 7See ae oe
12 O i)
C) E(X2) = [0? x 0.2] + [12 x 0.5] + [22 x 0.1] |x2|
1121 ||36 | |81 ||16 ||64| a | 25 | 347
+ [32 x 0.2] = 2.7 [1 mark] 1600
Var(X) = Bt*) — (E(X))? [7 mark] iy [26478 |162)20715221
6017385]
—1.32=1.01 [1 mark].
SO >) = 45, yp = 92, Doe S847,
d) Var(4fia 1 Var(X) [1 mark]
dv = 1600 and >)xy = 738.
=1.01 [1 mark]
Sy ae A5 x 92
b) Sa ey 2 2 738 5
1 pe B aw —§
= 146.57 (to 2 d.p.)
; a) _ 45°
= 57.71 (to 2 dp.)
Sa ae
eae
apres = 1600 —
92?
-
Ea 29 x- 109
Q2 a) S,=Yiy z 589
= 43.2
S =x nes 167 — 97
22 =-1.2
swy Soe le 2031 — 102 = 345.2
Answers
Q3 a) Make atable including p’, q* and pq.
pee xD ye 206x 50" Include a final column showing the
total of each row.
¢ Os 2062 _
xX fafa tefistte
7e pee arabe
gts. he
5, i pane (29) = 326 at = 76 lp?|169|81|225|100|64|121|
9
r25149| 4 |16/| 64[ 1 |
Q3_ S, n= ee = 1880 — 1156114,
pq - 4" 2X14 = 305 350
7 OTD 115?
=) pS 208s 6
oY = 172, >)pq = 350
= 178.83 (to 2 d.p.)
ie og — 4
S,,= yay = 2762 — 1142
14 _=596 e 2
S = pt — 22) = 1100 — 2 = 42
PP
s = ig
99
2D = 177 — 32 = 44
correlation coefficient
c) This means:
Q1 SF
DAY
te
BIS OCS.
ee ee a ea
—
WS = — 18
SS OR 04.19 io Cosisioanio
fi
V¥42 x 44 teases
= 343.7333...
Q4 a) First find >)/ = 43 and dw = 30.4.
S,= yy“OD
— 2 saad ZZ
473 752
—£2- = 98
oa
Then:
This means:
I
S. = Sipser = 181,75 - Bx304
oe Sey
¥ SxxSyy
= 18.35
=50 ey fig. 1y = 258 — 432 = 26.875
= — 0.2 72 (to
(to33sig.
sig. fig.) 5,= yp
343.7333... x 98
=
Q2 a) Make atable including x’, y° and xy. Include a (uM _ 198.2 304° = 12.68
final column showing the total of each row.
Sap = 4 w
b) This means:
Siw
i
18.35
= (26.875
1839. eesipttig:
X 12.68 2 0 994ttouO 23 Steteo
SS eae = 91148 — 1403 x 519 c) This value for r is very close to 1, which shows
n 8 that there is a very strong positive correlation
= 128.375 between the length ofa leaf (/) and its width (w).
The longer a leaf is (i.e. the higher the value of J),
S.=7 DxOre 246395 — OS
—“= 1403?
the wider it tends to be (i.e. the higher the value
of w).
= 343.875
S,=Lee
Dy 2 5192 _= 262.875
= 33933 — 312
b) This means:
——
Se
Sia Se
2 128.375 = ADT G08 sig.fig.)
V¥343.875 X 262.875
Ee
Answers
Q5_ Make atable including a2, s? and as.
b) This is fairly close to -1, and so there is a strong
Include a final column showing the
negative correlation between a patient's weight
total of each row. (w) and their kidney function (k). The higher a
patient's weight is (i.e. the higher the value of w),
3249 | 4225 |8836 7744
the lower their kidney function tends to be (i.e.
79.21 | 23.04] 29.16] 7.84 the lower the value of 4).
507.3| 312 [507.6 |246.4
V Sages
= 0,5
— 181.9444...
Vvl663.5559.5 30 lllooe.
2.
of
This means:
their score on the memory test tends to be (i.e. the
lower the value of s). —
Suis
Q6 a) Make atable including w”, k* and wk.
Include a final column showing the Se S00 eB dip)
A/a eS 42 0930 8
total of each row.
Because wu and v are related by linear transformations
[ow [Meet [e742] 56!|p3) 79 [esa | 64)|71s |80
88 |
to x and y, this means that the PMCC between x and y
|| 79 | 62 |23 [70] 48 [91 | a4|68 |37| must also be 0.012 (to 3 d.p.)
46.24 Create a new table showing the coded data values u
[wk|521.4|606.8|220.8|589.3[379.2 |491.4]537.6|482.8 |325.6 and v. Include extra rows for wv’, v and uv.
So Dw = 675, ik = 58.3, hw =51 975, | 1 48
om TOM Sm om Mom 0
ee = 421.69, wk = 4154.9 a 81 | 121] 64 | 356
See Dds"
ae eee oe x 58.3
= 217.6
Sa" =
Oe. AS ars 6757
9 So Diu = 48, v= 120; or = 6,
div? = 1982, Suv = 759
= 1350
Then:
Se (Qa a9 16912283
n 9
5,= Yiw mud = 759 — 48 X120 _ 39
= 44.0355...
aa are 48" = 68
This means: 2
Answers
This means: This means:
San
i
V Saga
See
vw
care
r= aia 5.oso tari ie:
JOO 835.5555. G20 Ma 2ete NE.) This means:
Because wu and v are related by linear transformations a ‘Sigs
Answers
Q3 Explanatory variable: the population of a town Call the equation of the regression liney = a + bx.
Response variable: the number of phone calls made Then the gradient of the regression line is b,
in a town in a week where:
It is the population that would affect the number of
calls, not the other way around. b=ee
S. = Saao —0.44457...
Q4 Explanatory variable: the amount of sunlight falling = -0.445 (to 3 sig. fig.)
on a plant in an experiment And the intercept of the regression line is a,
Response variable: the growth rate of a plant in an where:
experiment
It is the amount of sunlight that would affect the
a=>y 7 —br = trae ee
growth rate, not the other way around. = Be — (— 0.44457...) x 330
228
(Or you could say that the amount of sunlight can be
directly controlled, as this is an experiment.) =wit pantse ant fig.).
So the equation of the regression line of
yonx is: y= 31.2 —0.445x
_ Exercise 2.2 — Regression lines C) The value ofb tells you that the temperature
Q1 Call the equation of the regression liney = a + bx. decreases by approximately 0.445 °C for every
Then the gradient of the regression line is b, where: extra degree of latitude (since when x increases
E eSll 95 4oy by 1, y decreases by 0.445).
b= 5 = 739-4 = ':82855.. = 1.83 (to3 sig. fig.) The value of a suggests that on the equator (when
And the intercept of the regression line is a, where: x = 0) the mean annual temperature is about
dy = pet Sd 2E
a2=y—br=
d) (i) When x = 19, the residual is:
-us — (1.82855...) x e = 24 —(31.16006... — 0.44457... x 19)
= 1.29 (to 3 sig. fig.)
= ee(to 3 sig. fig.).
(ii) When x = 41, the residual is:
So the equation of the regression line of e = 10—(31.16006... — 0.44457... x 41)
NOM MIS2 0) — 223 2a Oon = -2.93 (to3 sig. fig.)
Q2 a) Q3 a) You first need to find S,, and S.,.
eeDee 92 x 264
ee
= 167.2
pe?
= 2
= ey.ce (072 92? 725.6
OF 10) 20 308 e240 50) COR 70 If the equation of the regression line is P = a + be,
b) Start by working out the four summations >)x, then the gradient of the regression line is b,
yD, ee and >ay: where:
It’s best to draw a table. == Be
Se —905.6
1672>.=O
Te pepey
a[|
41345
Answers
Exercise 2.3 — Interpolation and - q=-0.9 + 0.1p, x — => 20
and so 10y—3 =—0.9 + 0.1 x
extrapolation
Now rearrange this into the formy = a + bx.
Ql a) y is the response variable (since this is the
regression line ofy on x). So 10y = -0.9 + 0.05x — (0.05 x 20) + 3
j.e. 10y = 0.05x + 1.1
b) (i) 1.67 +0.107 x 5 = 2.205 Ory = 0.005x + 0.11
(ii) 1.67 + 0.107 x 20 = 3.81
3 y = 17.4 -0.78x, so t—45 = 17.4 — 0.78(w— 60)
Q2 a) 103 — 4.57 x 4 = 84.72 2 Rearrange this to give:
This is interpolation (since 4 is between 2 and
t= 17.4 —0.78w + (0.78 x 60) + 45,
15). This estimate should be reliable. or t = -0.78w + 109.2
b) 103 —4.57 x 20=11.6
This is extrapolation (since 20 is greater than the Q4 a) Sy LLY — 618 28 x 124
10
largest observed value of x, which was 15).
This estimate may not be reliable. = 270.8
CM 103457 <7 = 71-01
This is interpolation (since 7 is between 2 and
—(xy = 140
S,.= Dx 26° =6 61.6
—4
15). This estimate should be reliable. Call the equation of the regression inay =a-+bx.
Q3 a) VA 20, =139 Then the gradient of the regression line is b,
This is interpolation (since 20 is between 17 and where:
35, which are the values ofx between which data
was collected). This estimate should be reliable. oe_ Spall
oe2708:
Gs = 4.396103..
b) 1.4x50+7=77 = 4.396 (to4 sig. fig.)
This is extrapolation (since 50 is greater than the And the intercept of the regression line is a,
largest value ofx for which data was collected). where:
This estimate may not be reliable.
Q4 a) 58.8 — 2.47 x 7 = 41.51 — so the volunteer a=y—br= uy — ue
would be predicted to have approximately 42
spots. This is interpolation (since 7 is between 2 _ 124.— (4.396103...) x 28
and 22, which are the values ofx between which == Saree = ae ae fig.).
data was collected). This estimate should be
So the equation of the regression line of
reliable.
yonx is: y= 0.09091 + 4.396x
b) 58.8 —2.47 x 0 = 58.8 — so the volunteer would
be predicted to have approximately 59 spots. b) v-7= 0.09091 + 4.396 x (75 — 2)
This is extrapolation (since 0 is less than 2, which So v =-1.70 + 0.0440s
was the smallest value of d for which data was
collected). This estimate may not be reliable.
Q5 a) Start by working out the four summations >)x,
DW ee and oe
c) Using the formula for d = 30 is extrapolation, It's best to draw a table.
since 30 is greater than 22, the largest value of
d for which data was collected. The model isn’t
valid for d = 30, since you can’t have a negative
number of spots. But this doesn’t mean that the
regression equation is wrong.
_ Sy =275 _
b= <= 573337 = 4.796511...
= -4.80 (to3 sig. fig.)
Answers
And the intercept of the regression line is a, c) Explanatory: a person’s disposable income
where: Response: a person’s spending on luxuries
d) Explanatory: the number of cups of tea drunk per
= §0 ~ (4.796511...) x 28
day
Response: the number of trips to the loo per day
= 22.4 (to 3sig. fig.). e) Explanatory: the number of festival tickets sold
So the equation of the regression line of Response: the number of pairs of Wellington
yon iS y= 22.4 — 4.80% boots bought
b) Substitute expressions for x and y into your Q3 a) cil)=Sy 68e za aot = 46.504
regression equation: t— 29 = 22.4 — 4.8035
This means that: t= 51.4 — 0.480s (ies = 570455 — ted 10") = 183.24
b) beeSee ena
esas = oes...
Review Exercise — Chapter 4 = 3.94 (to3 sig. fig.).
Q1 a) 4547
40 x
35 x c) a=w—br, wherew= uu = 109.9
30 x
25 x x x x
and7 = 2 =e 4
20 x
15
x
10
Soa = 109.9 — 3.9403... x 51.74
5
0 7 : : - 7
x
- x
= — 93.9711...— 94.0
(to3sig. fig.).
50 100 150 200 250 300.
d) The equation of the regression line is:
b) First you need to find these values: w = 3.94r — 94.0
dix = 1880, Dy = 247, x’ = 410400, e) When r= 60, the regression line gives an estimate
for w of: w = 3.94 x 60 — 94.0 = 142.4 ¢
> Y = 6899 and >)xy = 40600.
f) This estimate might not be very reliable because
Then it uses an r-value from outside the range of the
Sits a
= De Se OO
1880 x 247
ah
original data. It is extrapolation.
You'll be given the equations for finding a regression line
= —5836 — but you still need to know how to use them, otherwise
the formula booklet will just be a blur.
S
u (ux)es =_ 410 400 io
1880’ And you need to practise USING them of course...
= 56960 Q4 Rearrange the expressions P = r—5 and QO= 8Ww to get
formulas for r and w. These arer =P +5
Wy
Dy r= a ; cit iden
LIF eB 6899 —
Da7F
OM = 798.1 and w = 80.
Substitute these expressions for r and w into the
This means: regression line’s equation from Question 3.
a= De Wi= S94 03s xr 93-97 Iles,
Vv Sa So and so 8Q = 3.9403... x (P + 5) — 93.9711...
OD
= F56560
x7581 0.866 (to 3sig.fig.) Now rearrange to get an equation of the form
Q=a+bP:
c) The PMCC tells you that there is a strong negative SO 309403e. x Pi GA9403m 5 SI 93,9 /Alilnes)
correlation between drink volume and alcohol i.e. O=0.493P — 9.28.
concentration — cocktails with smaller volumes
tend to have higher concentrations of alcohol.
Don't panic about that PMCC equation — you need
Exam-Style Questions — Chapter 4
1 a) 0.90 7»
to know how to USE it, but they give you the formula in 0.80 x x
the exam, so you don't need to REMEMBER tt. 0.70 x
Answers
b) You need to work out these sums: Then the gradient b is given by:
dix = 36, Hy = 3.94, be oe
Se
— 0.29015
0.1226
25 6ees.
ex = 204, = 246076, ay — 17.66 = 2,367 (to 3d.p.).
[1 mark]
Then:
And the intercept a is given by:
Sy = Dixy —Ld 3.04
= 17.66 ex at BO
= = 0.07
Al == may
¥ Para
_ 67.49
Ser ee 2) 3006824 26.12
8
Ot) 36F
= 0.709 (to 3d.p.).
[1 mark]
(24676 = aa = 0.52715
2 2
Answers
b) There is a strong positive correlation between
the miles cycled in the morning and calories
consumed for lunch. Generally, the further they
cycled, the more they ate [7 mark].
Answers
e) The &(z) table doesn’t contain negative values of
Chapter 5: The Normal z, $O use a ibe to aesan equivalent area:
Distribution
2. The Standard Normal S1a74i0' 01.14
-0.629 00.62
Ee
Answers
Exercise 2.2 — Using the tables in b) P(0<Z<z) =0.4854
reverse = P(Z < z) = 0.4854 + P(Z < 0)
= 0.4854 + 0.5 = 0.9854
Q1 a) Using the ®(z) table, ®(z) = 0.8577 for z = 1.07
®(z) = 0.9854 for z = 2.18
b) Using the ®(z) table, ®(z) = 0.8264 for z = 0.94 Q5 a) Tala Zea) nD
c) P(Z<z) = 0.3783 = P(Z>-z) = 0.3783 SHO<6g<4 <7) S024
= P(Zs -z) = 0.6217 > z=0.31 >z=-0.31 = P(Z < -2) = 0.2422 + 0.5 = 0.7422
d) P(Z<z)=0.004 > P(Z>~-z) = 0.004 ®(z) = 0.7422 tells you that -z = 0.65 = z =-0.65
=> P(Z< -z) = 0.996 > ~=2.65 > z =-2.65 b) PzZ<Z<0)=0.1443
e) PZ > 2) = 0.758 P(Z <—2) = 0:758 = P(0<Z<-z) =0.1443
= —z = 0.70 = z=-0.70 = P(Z < -z) = 0.1443 + 0.5 = 0.6443
®(z) = 0.6443 tells you that -z = 0.37 = z = -0.37
f) P(Z>z)=0.9441 = P(Z<-z) = 0.9441
2s al a) P(1.5 <Z<z)=0.0406
26
= P(Z <z)—P(IZs 1.5) = 0.0406
2) R(Z> Zz) = 04801 = PZ =z) = 0.5199
= P(Z < z) = 0.0406 + P(Z < 1.5)
= 4 = (0.05 = PZ <z) = 0.0406 + 0.9332 = 0.9738
h) P(Z>z) =0.0951 = P(Z < z) = 0.9049 ®(z) = 0.9738 for z = 1.94
eo | b) POM8= Zz) = 10,0691
Q2 a) Using the percentage-points table, p = 0.005 for | => P(Z< z) - P(Z< 0.58) = 0.0691
Z=)2:5758 = P(Z< z) = 0.0691 + P(Z< 0.58)
b) Using the percentage-points table, p = 0.2 for => PZ <z) = 0.0691 + 0.7190 = 0.7881
z= 0.8416 (z) = 0.7881 for z = 0.80
C) A) On (Zz) = 3 = 244 C) ae1.3 <Z <z)= 0.871
P(GA<2z)—= PZ s—1-3)= 0.871
dR Az) = 0S RZ)
= 015 = 2 —1.0364
P(Z
< z) = 0.871
+ P(Z < -1.3)
©) ela Z4 7) le—alaZe)0a PZ <z)=0:871 +l —PZ<13))
— 420 ol— 2 —— 126 Ilo ate(Z<z)= 0.871 + (1 — 0.9032) = 0.9678
=>
Pre 2 Oo PZ = =z) = 0.15 (z) = 0.9678 for z= 1.85
= 7 = 1.0364 > z=-1.0364 d) P(-0.54 < Z<z)=0.667
gs) P\ZSz)= 0.6 => PZ <2) = 0:6 => PiZe =z) =0.4 P(Z < z)— P(Z < -0.54)= 0.667
7 0223) = 0.2539 P(Z< z)= 0.667 + P(Z< -0.54)
P(Z
( < z))== 0.667 + [1 —-P(Z< 0.54)]
h) RZ 2) — 0199 — RZ <7) 099
=> PZ( <z)= 0.667 + (1 — 0.7054)= 0.9616
iZiee2) 0.0 23.63) — 7 =| 283263
®(z) = 0.9616 for z = 1.77
Q3 a) P(-z < Z<z) =0.599, so the remaining area is
e) PGaZ = 027) 05458
1 — 0.599 = 0.401. Using symmetry,
— (AO) Az Ona4 8
P(Z =z) = 0.401 = 2 = 0.2005
= P(Z < z) = P(Z < 0.27) — 0.5458
= 4<— 4) = OS = 2 = One
= P(Z < z) = 0.6064 — 0.5458 = 0.0606
An alternative way of doing this is to divide O.599 by
But you can't look up O.0606 in the table, so there's
2 to get PCO < Z<z) =O2995. Then add O.5 to get
still some work to do...
AE < B) = OVS = P(Z = -z) = 0.0606 = PZ < -z) = 0.9394
b) Piz <Z<z)=0.9426, so the remaining area is ®(z) = 0.9394 tells you that -z = 1.55 =>z=-1.55
1 — 0.9426 = 0.0574. Using symmetry, P(g < Z<-1.25) = 0.0949
RiGee) 0)05745 2 = 010237 et
> ie -1.25)— P(Z< z) = 0.0949
=a) 10 Ole | 2) P(Z < z) = PZ < -1.25) — 0.0949
c) Pi-z<Z<z)=(0.4, so the remaining area is P(Z < z)= PZ 1.25) — 0.0949
1—0.4=0.6. Using symmetry, P(Z < z) =[1 —P(Z< 1.25)] — 0.0949
P(Z =z) = 0.6 +2 =0.3. Using the P(Z< z)= (1 — 0.8944)— 0.0949 = 0.0107
percentage-points table, p = 0.3 for z = 0.5244 = Pg = -z)= 0.0107 = P\Z < =z) = 0.9893
a
d) P(-z < Z<z) = 0.98, so the remaining area is ®(z) = 0.9893 tells you that —z = 2.30 > z= -2.30
1 — 0.98 = 0.02. Using symmetry,
P(Z > z) = 0.02 + 2 =0.01. Using the
percentage-points table, p = 0.01 for z = 2.3263
Q4 a) P(O < Z< z) = 0.3869
= P(Z < z) = 0.3869 + P(Z< 0)
= 0.3869 + 0.5 = 0.8869
@(z) = 0.8869 for z = 1.21
Answers
3. Normal Distributions and Q7 a) P(240 < X< 280)
- p/240 260 280
— 260
Z-Tables ieemip Le
ee ee
Exercise 3.1 — Converting to the P( Z < 1.33)— P(Z< -1.33)
Z distribution P(Z<133)-PZ21.35),
PZ
P( <1.33) =11 = PZ =1.33)
QI a) PIX<50)=P(Z< wt = P(Z < 2) =0.9772 = 9 x PIZ< 1.33) —1
= 2 x 0.9082 — 1 = 0.8164
b) PX< 43) = P(Z < ae
b) P(232 <X< 288)
=P(Z< 0.6)= 0.7257 " pilee 00 288— 260
= P/z> OG
PY > 28) 2A = P( iS teats )
Q2 a)
Xr> 28)= P(Z> AEA)=PP(Z
> 1.63)
= Peron = ee)
= 1—P(Z< 1.63) = 1— 0.9484 = 0.0516 = P(Z < 1.87) — P(Z< -1.87)
Answers
Q11 a) PUY > w) = oe
0.9177 = P(Z > ¥— 23) - 0.9177 c) P(6<L< 11) =P(2584 <7<U— 84)
al
| tO 7 aS 1.10272 0.84)
Using the ®() table, ®(z) = 0.9177 forz = 1.39. P(Z< 0.84)— P(Z< -1.10)
= P(Z < 0.84)— P(Z= 1.10)
So,BAW = 139 > w=73 -1.39 x 6 = 64.66 P(Z< 0.84)— [1 — P(Z< 1.10)]
b) PUY >) = 0.6664 = P(Z > £73) = 0.6664 nen n ME ee
Vie ead 6 Q2 a) Let F ~ N(36, 6?) represent the length of time
oF P(Z Se )= 0.6664 taken to replace red blood cells, in days.
Using the ®(z) table, ®(z) = 0.6664 for z = 0.43. P(T< 28) = P(Z<2838)
ViMeet
50) = CiKe = 0.43 =>k= 73 -0.43 x 6 = 70.42 = P(Z < -1.33)= P(Z> 1.33)= 1 — P(Z< 1.33)
= 1 — 0.9082 = 0.0918
Q12 P(.18-a<X<
18+ a) =0.899
b) Let b = the number of days taken by Bella.
= p(Bas—18 <7 < oewe
18 —a—18 18
Then, P(T'> b) = 0.063.
a
= Pe a 2 < wr) = 0.899 = IOV =— 0.063
= P(Z > 22)
The remaining area = 1 — 0.899 = 0.101. = P(Z aUre 0.9370
So, P(Z
(
= 05)
4.) =
O:
0.101 + 2 = 0.0505. 4
= Using the ®(z) table, ®(z) = 0.9370 for z = 1.53.
= P(Z< 24.)= 0.9495 So, 2=38
36 = 1.53
pee the ®(z) table, ®(z) = 0.9495 forz = 1.64. => b=1.53 x 6 436 = 45.18 days
So, ae
4 Ke7.642995 1.645004 = 0.82 Don't forget to say days’ in your answer — you need to
answer the question in the context it was asked.
Q13 P(170 <X <2) =0:377
170 = 170 Peng Q3 a) Let 7 ~ N(51, 2.1’) represent the ‘personal best’
= P( AO <Z< AO )= 0.377 time taken to run 400 m in seconds.
= P(0<Z< $510) - 0377 ee)
= P(Z> -0.81) = P(Z< 0.81) = 0.7910
= P(z< tle
ay) - P(Z < 0) = 0.377 So, 79.1% are slower than Gary.
PZ = ai = 0377 = 05 = 0.877 b) Let a= the time to beat.
a Then, P(T< a)= 0.2.
Using the ®(z) table, ®(z) = 0.877 for
z = 1.16.
P(Z
< 4") = 0.2
So, 0 = 1.16 = t= 1.16 x 40 + 170 = 216.4 zi, zi)=
Q14 P(107.6 < X< v) = 0.1677
rz > 2h=4) = 0.2
107.6 Using the percentage-points table,
= eae— 98 4 yot08\
ae )= 0.1677
p = 0.2 forz = 0.8416
= P(0.64 < Z< = ie28) = 0.1677 So,
254; = 0.8416
— 98
= P(Z < T=") — PZ< 0.64) = 0.1677 = a=51-2.1 x 0.8416 = 49.2 s (to 3 s.f.)
= P(Z Sec 1 8) = 0.1677 + 0.7389 = 0.9066 Q4 a) Let Z ~ N(300, 50?) represent the lifetime of a
Using the ®(z) table, ®(z) = 0.9066 for z = 1.32. battery in hours. a
200 —
So, X98 = 1.32 3 y= 1.32 x15 +98= 117.8 <200) = P(Z < “aa==)
P(L
=a Da P4sS DSi S4s 2)
° : ° ; = | — 019772 — 0,0228
Exercise 3.2 — The normal distribution
f . ° °
in real-life situations 80 — 300)
b) PUL > 380) = P(z> 380 300
Answers
d) P(160<L<h) = 0.9746 c) P(X> 528) =0.1292
160 — 300
> Reg a h — 300 5 0.9 746 eS p(Z> a
100
= 0.1292
h — 300 a Pp(Z < eal = 0.8708
ea 2:8 G2 ee a 0.9746
Using the ®(z) table, ®(z) = 0.8708 forz = 1.13.
=a p(z< 2= 300 ‘ P(Z<-2.8)= 0.9746 528 —
eles
rene7 < b=300Ms P(Z >2.8) = 0.9746 100
=> p= 528-1.13 x 100=415
seppieaaonyiin it (Z < 2.8)] = 0.9746
d) P(X< 11.06) = 0.0322
= P(Za he aT )= 0.9746
+(1 —0.9974) Thi) Oley poy
=>Pp(Z<d ee = 0.0322
=> P(Z< h =300)- 0.9772
Using the ®(z) table, ®(z) = 0.9772 for z = 2.0.
adPp(Z= ro = 0.0322
So,h— 300 2 n=? x 50 + 300
= 400 =P(z< a = 0.9678
Q5 a) Let M~ N(60, 3’) represent the mass of an egg in Using the ®(z) table, ®(z) = 0.9678 for z = 1.85.
grams. 11,06 —= 185
Oa
P(M > 60 — m) = 0.9525
== 1-05 x0:4 1-06 —sies
= P(Z >o0 60) = 019525
= P(Z > =r) = 0.9525 e) P(X> 1.52) =0.9938
Answers
b) PY< 528) = 0.7734 Subtracting equation 1 from equation 2 gives:
= P(Z < 228 =510) _ P(Z < 8) = 0.7734 36 —30-- (1) = 2.60- 1.460
=O oo 5
Using the ®(z) table, ®(z) = O:7734 tor z = 0175. Putting o = 5 into equation 1 gives:
= 18-075 30=18:0.75 =24 w=30—=1.4x5 = 23
SO =28 aie! ora. 5
C) P(X> 24) = 0.0367 b) P(X< 4) = 0.9332 = P/ et= | ZA 93380
= p(Z > 2412) x P(z>Z2)= 0.0367 ®(z) = 0.9332 for
z= 1.50
= P(Z <7) 7 = 0.9633 4— wu
See es 1.5 > 4-= . 1)
1.50 (equation
Using the ®(z) table, ®(z) = 019633 forz= 1.79.
P(X < 4.3)= 0.9987 => P(Z = Enea
3 = 0.9987
aTee oo a= = 1.79 = 3.91 (to3 sf.) ®(z) = 0.9987 for z = 3.00
d) P(X< 0.95) = 0.3085 = =F
43 —p
= 3 > 4.3 - 1 = 30 (equation 2)
= P(z< 0.95 —0.98) = P(z< — 9.03) Subtracting equation 1 from equation 2 gives:
4.3-4--(-) =30-1.56
= 0.3085 = 055 = SoS
sor= 0)
= P(Z> 2.03) = 0.3085 Putting o = 0.2 into equation 1 gives:
= 41 sc0.2ee 7
= P(Zs 2.03) = 0.6915 SOIL = 3.7 ance =O2
Using the ®(z) table, &(z) = 0.6915 forz=0.5.
C) P(X< 20) = 0.7881 = P(Z< sar =) = 0.7881
0.03
Se Cath = 0.5 > 0 = 0.03 + 0.5 = 0.06 ®(z) = 0.7881 forz = 0.80
Answers
e) P(X> 33) =0.1056 a) Let H=height incm. Then H ~ N(175, oe
= P(Z> 33 = BNE
)= 0.1056 P(H> 170) = 0.8
Don! )
—p(z>
= p(z> WO —2)=08
=> P(Z = a) = 0.8944
5 =0.8 > P(Z2 5)
= P(Z< 2) 5 = 0.2
&(z) = 0.8944 for z= 1.25
oe = 1.25 = 33 —p = 1.250 (equation 1) p= 0.2forz = 0.8416 = 2 = 0.8416
>0=5+
0.8416 = 5.94 cm (to 3sf.)
BOGS 21) = 019599
b) P(171 <H<179)
=P Ze ZI = =) = 0.9599 ens 179
—175
= P(Z <
p—21
)= 0.9599 =2( aac 52 ae)
a
= PE067 <=Z < 0.67)
(2) = 0.9599 for
z= 1.75 = P(Z < 0.67) — P(Z < -0.67)
= P(Z < 0.67) — [1 — P(Z< 0.67)]
e ae = 1.75 > p—21 = 1.750 (equation 2) = 2 x 0.7486 - 1
Adding equations 1 and 2 gives: = 0.4972
33 =21-—p+p=1.250+ 1.750 Let R = rainfall in cm. Then R ~ N(u, 07).
sS12=305>0=4
Putting o = 4 into equation 2 gives: P(R < 4) = 0.102 > P(Z< Le 20.102
w=1.75x4+21=28
So = 28 anda
=4 i — =0.102 > Piz < — = 0.898
f) P(X> 66) = 0.3632 ®(z) = 0.898 for
z= 1.27
=P(Z> nSa E) = 0.3632 > uae = 1.27 = p—4 = 1.276 (equation 1)
=> P(Z =
66 —
oe = 0.6368 P(R > 7) = 0.648 = P(z > 2—#) = 0.648
®(z) = 0.6368 forz = 0.35
66 —
= P(z<4#=*) = 0.648
= 0.35 = 66 — p= 0.350 (equation 1)
o ®(z) = 0.648 for z = 0.38
P(X< 48) = 0.3446
= eo = 0.38 > p-7 = 0.380 (equation 2)
48 — uw
=> P(Z z o )= 0.3446 Subtracting equation 2 from equation 1 gives:
p— 48 b= p—4 = (7) = 1.270 —0.386
=> P(Z > o )= 0.3446
= 3 =O0.0%e SoS 3.37 (to 3 Se)
u— 48
=> P(Z = ae = 0.6554 Putting o = 3.37 cm into equation 1 gives:
i= Ix B37 A =8.2 Brom (tolsisa)
®(z) = 0.6554 for z = 0.40 So 1 = 8.28 cm and o = 3.37 cm (to 3 sf.)
e aes = 0.4 = 1 - 48 = 0.40 (equation 2)
Adding equations 1 and 2 gives: Review Exercise — Chapter 5
66 — 48-11 + p = 0.350 + 0.40
Using the ®(z) table:
= 18=0./50 >0=24
Putting o = 24 into equation 2 gives: a) P(Z< 0.84) = 0.7995
p= 0.4 x 24+ 48 = 57.6 b) P(Z < 2.95) = 0.9984
So , = 57.6 and o = 24
c) P(Z> 0.68) = 1 —P(Z< 0.68) = 1 —0.7517
Q6 a) Let V= volume of vinegar in ml. = 0.2483
Then V ~ N(u, 52).
d) PZ2 155)] 1 = PZ <1 55)
DOUG
PWV= 506) =07/ 19 Pp(Za )=o.719 = 1-0.9394 = 0.0606
5
®(z) = 0.719 for z= 0.58 ©) RZ <2 0 Pie 2 10) = i Piano)
te DOGr = 1-—0.9821 = 0.0179
»
P=0.58 f) P(Z<-0.01) = P(Z> 0.01)
=.= 506—0:58 x 5 =503.1 ml = 1-P(Z< 0.01) = 1 — 0.5040 = 0.4960
b) P(V< 500) = P(Z< es g) PZS010)=1=PZ <010) =1 = 0.5398
= 0.4602
= P(Z <=0.62) = P(Z= 0:62) == PGs 0262)
= 1 -—0.7324 = 0.2676
h) P(Z< 0.64) = 0.7389
So, 26.76% of bottles contain less than 500 ml. i) P(Z>0.23)=1-P(Z< 0.23) = 1 — 0.5910
= 0.4090
Answers
P(0.10 < Z< 0.50) = P(Z< 0.50) — P(Z< 0.10)
= 0.6915 — 0.5398 = 0.1517 d) P(47 <X<57)= p(2 20 <7 < ey
P(-0.62:s Z < 1,10).= P(Z< 1.10) — P(Z < -0.62) = P(-0.75
< Z< 1.75)
= P(Z< 1.10) — P(Z> 0.62) = P(Z< 1.75)- P(Z< 0.75)
= P(Z< 1.10) — (1 — P(Z< 0.62)) = P(Z< 1.75)- P(Z> 0.75)
= 0.8643 — (1 — 0.7324) = 0.5967 = P(Z<1.75)— [1 - P(Z< 0.75)
P{-0;99 <7 <'—0174) = P(0.74 < Z < 0.99) = 0.9599— (1 - 0.7734)= 0.7333
= P(Z< 0.99) — P(Z < 0.74) Q4 a) P(X <0) = P(z<2=2)
= 0.8389 — 0.7704 = 0.0685
= P(Z< -0.71) = P(Z> 0.71)
Q2 If P(Z < z) = 0.9131, then from the &(z) table, =1-P(Z< 0.71) =1-0.7611 = 0.2389
Zee Or
b) P(X <1) = P(Z< 42)
b) If P(Z < z) = 0.5871, then from the &(z) table,
O22. = P(Z< -0.57) = P(Z> 0.57)
C) If P(Z> z) = 0.0359, then P(Z < z) = 0.9641. =1-P(Z< 0.57) =1-0.7157 = 0.2843
From the ®(z) table, z = 1.80.
d) If P(Z > z)= 0.01, then from the percentage-points
C) P(X> 7) = P(Z > 2>2) = PZ> 0.29)
table, z= 2. 3263: = 1-P(Z< 0.29) =1-0.6141 = 0.3859
) If P(Z < z) = 0.4013, then z must be negative (and d) P(2<X <4) = P(As2<7< 42) i
so won't be in the ®(z) table). '
= P(-0.43 < Z<-0.14)
But this means P(Z= —z) = 0.4013, and so
=P(0.14<Z< 0.43)
P(Z< -z) = 1 — 0.4013 = 0.5987.
= P(Z< 0.43) — P(Z < 0.14)
Using the ®(z) table, —z = 0.25, soz =-0.25.
= 0.6664 — 0.5557
Ifyou need to draw a graph here to make it a bit
= 0.1107
clearer what's going on, then draw one.
If P(Z = z) = 0.995, then P(Z< -z) = 0.995 OS a) Here
X ~ N(80, 15).
Answers
Q7 If X represents the mass of an egg in kilograms, Exam-Style Questions — Chapter 5
then XY ~ N(1.4, 0.37).
1 Let Y represent the exam marks. Then XY~ N(50, 157).
Watch out for the units here — you need to make them the
same for both the mean and the variance. a) PIX <30)= Piz 409) 2 PiZehs3)
[1 mark] :
PUK <a) = 0.8830 = P(Z < a4) = 0.8830
= P(Z> 1.33) =1-—P(Z< 1.33) [7 mark]
&(z) = 0.8830forz= 1.19 = 1 — 0.9082 = 0.0918 [7 mark]
Answers
3 a) Let X represent the lifetime of a battery in hours. For a normal distribution, mean = median,
a)
Then X ~ N(, 0°). so median = 12 inches [7 mark].
PX< 20) =0.4=5 P(Z < a E) = 0.4 [1 mark] b) Let X represent the base diameters.
20 Then X ~ N(12, o?). P(¥> 13) = 0.05 [1 mark],
= P(z>F— )=04 13 — 12 1
sO P(Z >aoe = P(Z > 5 = 0.05 [1 mark]
Using the percentage-points table,
p= OA Torz = 02533 Using the percentage-points table,
p = 0.05 for z = 1.6449
fu — 20
ar = 0.2533 [1 mark]
= . = 1.6449 [7 mark]
= 1 — 20 = 0.25336 (equation 1)
=>o0=1 + 1.6449= 0.608 (to 3 s.f.) [7 mark]
P(X< 26)=0.8 5 ieee )= 0.8 [1 mark]
26
= fi
C) P(X< 10.8) g= P(Z< (0.8aoe— 12 )
= P(Z= o P)=0.2 = P(Z< -1.97) [1 mark]
Using the percentage-points table, SHZS Ir)= 1=(4S< or
p= 02 1Oez = 0.8416 = 1 - 0.9756 = 0.0244 [7 mark]
So you would expect 0.0244 x 100 = 2 pizza
a F — 0.8416 [1 mark] bases to be discarded [1 mark].
=> 26 — 1 = 0.84160 (equation 2) d ~S P(at least 1 base too small)
Adding equations 1 and 2 gives: = 1 — P(no bases too small).
pb — jt — 20 + 26 = 0.25330 + 0.84160 [1 mark] P(base not too small)= 1 — 0.0244= 0.9756.
= 6 = 1.09496 => o = 5.47995... = 5.48 (to 3 s.f.) P(no bases too small)= 0.9756?= 0.9286
Putting o = 5.47... into equation 1 gives: [1 mark]
hi O25
3 Sao Are ate Ole? 1-4 (tO. 3S...) P(at least 1 base too small) = 1 — 0.9286 [1 mark]
So = 21.4 hours [7 mark] = 0.0714 [7 mark].
and o = 5.48 hours [7 mark] (to 3 s.f.) a) LetX represent the volume of compost in a bag.
Then X ~ N(50, 0.47).
b) From part a), X ~ N(21.4, 5.48?)
P(X< 49) = P(Z< a9
=
a 50 )
So P(X= 15) ba 15—21.4 )[1 mark]
= P(Z 23a
= P(Z < —2.5) [1 mark]
= P(Z==1.17) = PZ< 1.17) [1 mark] = P(Z> 2.5) = 1—-—P(Z< 2.5) [1 mark]
= 0.8790 [1 mark] = 1 — 0.9938 = 0.0062 [7 mark]
a) P(X> 145) = P(Z > Hehe b) P(X> 50.5) = P(Z > 282
= 20)
= P(Z> 1) [1 mark] = P(Z> 1.25) [1 mark]
= 1—P(Z< 1) [1 mark] = 1-—P(Z< 1.25) [1 mark]
= 1 — 0.8413 = 0.1587 [1 mark] = 1-—0.8944 = 0.1056 [7 mark]
So in 1000 bags, 0.1056 x 1000 [7 mark]
b) P(120 < X <j) = 0.4641 ~ 106 bags [7 mark] (approximately) would be
= p(20120 e7e ae = 0.4641 expected to contain more than 50.5 litres of
compost.
= p(o<z<L7
i200 )= 0.4641 ff mark] C) P(Y< 74) =0.1
= P(Z< See
Stee P(Z < 0) = 0.4641 = P(z< A=) = p(z<—3) =0.1
Se 3) = 0.1 [1 mark]
= P(Z< i).
use 0.4641 +0.5 Using the percentage-points table,
e ae
asi) - 0.9641 [1 mark] p =0.1 forz= 1.2816
= a = 1.2816 [1 mark]
a = 0.9641 for
z = 1.80
= 120 =o =i = lao
= 0.780 litres (to 3 s.f.) [7 mark]
= j=1.8x
25 +120 =165 [1 mark]
Answers
Addition law Dependent variable Histogram
A formula linking the probability of Another name for the response A diagram showing the frequencies
the union and the probability of the variable. with which a continuous variable falls
intersection of events A and B. in particular classes — the frequency
Discrete random variable of a class is proportional to the area
A random variable with ‘gaps’ of a bar.
between its possible values.
Bivariate data Discrete uniform distribution
Data that comes as an ordered pair of A discrete random variable has a
variables (x, y). discrete uniform distribution if all the Independent events
possible values it can take are equally If the probability of an event B
Boxplot happening doesn’t depend on
A diagram showing the median, likely.
whether or not an event A happens,
quartiles and greatest/least values of a Dispersion events A and B are independent.
data set, as well as any outliers. Measures of dispersion describe how
spread out data values are. Independent variable
Another name for the explanatory
variable.
Coding
Coding means transforming all the Interpolation
Event
readings in a data set to make the Predicting a value ofy corresponding
An event is a ‘group’ of one or more
numbers easier to work with. to a value of x within the range for
possible outcomes.
which you have data.
Complement (of an event A) Expected value
The group of all outcomes interquartile range
The expected value of a random
corresponding to event A not A measure of dispersion. It’s the
variable is the ‘expected’ mean of a
happening. difference between the upper quartile
large number of readings. It’s the sum
and the lower quartile.
Conditional probability of all possible values multiplied by
A probability is conditional if it their probability. Intersection (of events A and B)
depends on whether or not another The set of outcomes corresponding
Explanatory variable
event happens. to both event A and event B
In an experiment, the variable you
happening.
Continuous random variable can control, or the one that you think
A random variable which is measured is affecting the other.
on a continuous scale. It may take Extrapolation
any value in a given range (i.e. with Predicting a value ofy corresponding Linear regression
no ‘gaps’ between possible values). to a value of x outside the range for A method for finding the equation of
Correlation which you have data. a line of best fit on a scatter diagram.
A linear relationship between two Location
variables showing that they change Measures of location show where the
together to some extent. ‘centre’ of the data lies.
Fence
(A correlation does not necessarily
mean a causal relationship.)
If a data value lies outside a fence, Lower quartile
then it is an outlier. The value that 25% of data values in a
Cumulative distribution function data set are less than or equal to.
A function, F(x), which gives the
Frequency density
probability that a random variable, The frequency of a class divided by its
X, will be less than or equal to a class width.
particular value, x. Mean
A measure of location — it’s the sum
of a set of data values, divided by the
number of data values. ~
Glossary
Median Product moment correlation Standard normal variable, Z
A measure of location — it’s the value coefficient f# A random variable that follows a
in the middle of the data set when all A measure of the strength of the normal distribution with mean O and
the data values are in order of size. correlation between two variables. variance 1.
Mode
Stem and leaf diagram
A measure of location — it’s the most
A way of displaying data in which the
frequently occurring data value. data values are split into a ‘stem’ and
Qualitative variable
Mode ofa discrete random variable A variable that takes non-numerical a ‘leaf’. Data values with the same
The value that the random variable is values. stem are shown in the same row of
most likely to take — i.e. the one with the diagram.
Quantitative variable
the highest probability.
A variable that takes numerical
Mutually exclusive values.
Events are mutually exclusive (or just
Quartiles Tree diagram
‘exclusive’) if they have no outcomes The three quartiles Q,, Q, and Q, Tree diagrams show probabilities for
in common, and so can’t happen at divide an ordered data set into four
sequences of two or more events.
the same time. parts. Trial
A process (e.g. an experiment) with
different possible outcomes.
Normal distribution Random variable
A continuous ‘bell-shaped’ probability A variable taking different values with
distribution where the further from the different probabilities. Union (of events A and B)
mean a value is, the less likely it is to The set of outcomes corresponding
occur. Range
to either event A or event B (or both)
A measure of dispersion. It’s the
happening.
difference between the highest value
and the lowest value. Upper quartile
Outcome The value that 75% of data values in a
Regression line
One of the possible results of a trial data set are less than or equal to.
A line of best fit found using linear
or experiment.
regression.
Outlier
Residual
A freak piece of data lying a long way Variable
The difference between areal-life
from the majority of the values in a A quantity that can take a variety of
observation and what a regression
data set. values — its value is not fixed.
line predicts.
Response variable Variance
In an experiment, the variable you A measure of dispersion from the
Percentiles think is being affected. mean.
The percentiles (P,-P,,) divide an
Variance of a discrete random variable
ordered data set into 100 parts.
The ‘expected’ variance of a large
Probability distribution for a number of readings.
Sample space
discrete random variable
The set of all possible outcomes of a Venn diagram
A table showing all the possible A Venn diagram shows how a
trial.
values that the random variable can
collection of objects is split up into
take, plus the probability that it takes Scatter diagram different groups, where everything in
each value. Graph showing the two variables in a
a group has something in common.
bivariate data set plotted against each In probability, the objects are
Probability function
other. outcomes, and the groups are events.
A function that generates the
probabilities of a discrete random Skewness
variable taking each of its possible Measures of skewness tell you how
values. symmetrically distributed a data set is.
Z-tables
Product law Standard deviation Tables relating to the standard normal
A formula used to work out the A measure of dispersion calculated by variable (Z) — such as the cumulative
probability of two events both taking the square root of the variance. distribution function ®(z), and the
happening. percentage-points table.
Glossary
A D
addition law 61-64 Gata™ 3 independent events 68-70
assumptions 1, 2 dependent variables 114 testing for independence 69
discrete random variables 80-82 independent variables 114
discrete uniform distribution interpercentile range 25, 26
B 938799
discrete variables 3
interpolation 20, 21, 24-26,
118, Aig
back-to-back stem and leaf ‘interquartile range 23-25, 35
dispersion 23-26, 31-35,
diagrams 12 intersection (of two events)
45-47, 93
bivariate data 104, 106
grouped data 24, 25, 33 56-50 Olle oo 68
box plots 28-30
C E L
events 53 linear interpolation 20, 21,
causation 113
exclusive events 63, 64 24-26
class boundaries 4-7
expected value 90-92 linear regression 114-117
class mid-points 5
of a discrete random with coded data 121, 122
class width 5
variable 90 lines of best fit 104, 105, 114
coding data
of a function of a discrete location 14-17, 19-22,
finding mean and standard
random variable 96, 97 45-47
deviation 37-39
of the discrete uniform comparing measures 22
linear regression 121, 122
distribution 99 grouped data 19-21
product moment correlation
experiments 53 lower class boundary 5-7
coefficient 111, 112
comparing distributions 45-47 explanatory variables 114 lower fence 28-30
complement (of an event) 56-58
extrapolation 118, 119 lower quartile 23-25, 43, 44
conditional probability 65-68,
72-75
continuous variables 3
-
M
converting variables to Z 135 fences 28-30 mathematical models 1, 2
correlation 104-106 frequency density 7-9 mean 14, 15, 22,31-34, 41
limitations 113 frequency tables 4, 5 of a discrete random
cumulative distribution function variable 90
discrete random variables of anormal distribution 126,
86, 87
normal distribution 128, 150
G 143-145
of grouped data 19, 38
grouped data
cumulative frequency 17, 20, of the discrete uniform
mean 19, 38
21, 24-26
median 20, 21 distribution 99
modal class 19 using coding 37-39
quartiles 24, 25 medial 16," 17, 20-22) 41
standard deviation 33, 38 mid-points of classes 5
variance 33, 38 modal class 19
grouped frequency tables 4, 5 mode 16, 17, 22, 41
modelling process 2
multiplicationlaw 65-68
H mutually exclusive events 63, 64
histograms 7-9, 126
Index
N T
negative correlation qualitative variables 3 transforming variables to Z 135
105, 106, 108 quantitative variables 3 tree diagrams 71-75
negative skew 41-44 quartile coefficient of skewness for dependent events 72-75
normal distributions 126, 127 42-44 for independent events 71, 72
cumulative distribution function quartiles 23-25, 42-44 trials 53
12350
mean 126, 143-145
shape 126, 127
standard deviation
R U
random variables union (of two events)
127, 143-145
continuous 126 56) 50,50 1) eGs
standard normal distribution
discrete 80 upper class boundary 5-7
128-134
ranger 23735 upper fence 28-30
table of percentage points
regression lines 115-117 upper quartile 23-25, 43, 44
(eo
2 et
interpreting 117
using Z-tables 128-134
residuals 115
variance 126, 127
response variables 114
risk 1
V
variables 3, 80, 104, 105, 114,
O 126
variance 31-35
observations 3
outcomes 53, 54, 56
S of a discrete random variable
sample space 54, 56 93-95
outliers 28-30
sample-space diagrams of a function of a discrete
54, 83, 84 random variable 96, 97
scatter diagrams 104-106, 108 of grouped data 33, 38
P set notation 56 of the discrete uniform
Pearson’s coefficient of skewness simplifications 2 distribution 99
42, 44 skewness 41-47 Venn diagrams 56-60
percentage-points table 132, 151 Pearson's coefficient of 42, 44
percentiles 25, 26 quartile coefficient of 42-44
PMCC 108-113
positive correlation
standard deviation 32-35
of a normal distribution
Z
Z (standard normal variable) 128
TOS 1067108 127, 143-145
Z-tables 150, 151
positive skew 41-44 of grouped data 33, 38 using them 128-134
predictions 2 using coding 37-39
probability 53-75 standard normal distribution
formula for equally likely — 128-134
outcomes 53 statistical tables 150-151
probability distributions using them 128-134
80-84, 86-88 stem and leaf diagrams
discrete uniform distribution Tagen 7
98799 back-to-back 12
probability functions S. 106-108
31,782) 87, Go S. 106-108
product law 65-68 S,, 106-108
product moment correlation
coefficient 108-113
limitations 113
with coded data 111, 112
Index
S1 Formula Sheet
These are the formulas you'll be given in the exam, but make sure you know exactly
when you need them and how to use them.
Continuous Distributions
Standard continuous distribution:
www.cgpbooks.co.uk
0412 - 9870
A-Level Maths for Edexcel S1
This student book is a comprehensive guide to success in the Statistics 1 Module
of A-Level Maths, matched exactly to the Edexcel course requirements.
Features include:
os Straightforward EMORGR ene supported by clear examples.
= Hundreds of helpful tips throughout the book. |
* Practice questions and exam-style questions for each chapter.
* Detailed, step-by-step answers to every question.
* Iwo printable practice exam papers on the CD-ROM.
And since it’s a CGP book, rest assured it covers Shed you need
as painlessly as possible.
£13.99 www.cgpbooks.co:uk .
(Retail Price) ; . , ee
P es bs
De aw &
a = = nak. . aa. 5 " ‘ “
4 sr a i . eaOe
are