CH 1 Collecting and Presenting Data
CH 1 Collecting and Presenting Data
STATISTICAL ANALYSIS
COLLECTING AND
PRESENTING DATA
Today, we live in the Information Age. The world economy has progressed from manufacturing
products to the complex task of processing data to manage the needs of a growing population.
Statistics are used by governments to plan for community facilities, by scientists and economists
to predict the future, by businesses to forecast sales and by advertisers to help sell products.
CHAPTER OUTLINE
S1.1 1.01 Interpreting graphs
S1.1 1.02 Misleading graphs
S1.1 1.03 Types of data
S1.1 1.04 Sampling techniques
S1.1 1.05 Constructing graphs
S1.1 1.06 Frequency histograms and polygons
S1.1 1.07 Dot plots and stem-and-leaf plots
IN THIS CHAPTER YOU WILL:
• interpret and construct a variety of statistical graphs and displays: bar charts, sector graphs,
divided bar graphs, line graphs, Pareto charts, tables, frequency histograms and polygons, dot
plots and stem-and-leaf plots
• identify graphs that mislead and misrepresent
• classify data as numerical (either discrete or continuous) or categorical (either nominal or ordinal)
• distinguish between a sample and a population, and the collection of data for each
• distinguish between random, systematic, stratified and self-selected sampling, and determine the
appropriateness of each type for a given situation
• identify bias in sampling
iStockphoto/Pinkypills
TERMINOLOGY
bar chart bias biased sample
categorical data census class centre
class interval cluster continuous data
discrete data divided bar graph dot plot
frequency histogram frequency polygon line graph
nominal data numerical data ordinal data
Pareto chart population random sample
sample sample size sector graph
self-selected sample statistics stem-and-leaf plot
stratified sample survey systematic sample
SkillCheck
1 Match each statistical display to its correct name from this list:
WS
bar chart (column graph) divided bar graph dot plot
Assignment
Homework1
line graph sector graph
stem-and-leaf plot
a b
3 044
4 36668
5 2457
6 133
c 100 d
Rugby Union
80
Hockey
Netball
League
Soccer
Rugby
AFL
60
40
20
0
0 5 10 15 20 25
2 Draw the two statistical displays from the list in Question 1 that were not illustrated
in the question.
EXAMPLE 1
Solution
This sector graph shows the overseas countries Australian students overseas
where Australian students studied during Canada
6% Germany
one year.
4%
a What was the second most popular overseas
Other
country for Australian students to study in? 17%
b What percentage of students chose neither New
the United Kingdom (UK) nor the United Zealand
28%
States of America (USA)? USA
29%
c If a total of 3685 students was surveyed, UK
how many of them went to Canada? 16%
Solution
a New Zealand
b 100% − 16% − 29% = 55% Or 17% + 6% + 4% + 28% = 55%
c 6% × 3685 = 221.1 ≈ 221
221 students went to Canada.
d 17% × 360° = 61.2° ≈ 61° There are 360° in a circle.
Example
1 This bar chart shows the percentage Risky/High Risk Alcohol Consumption
18
1 of Australians drinking alcohol at a
16
‘risky’ or ‘high risk’ level in five different
years. 14
12
a What percentage of people were
Percentage
10
drinking at a risky/high risk level
8
in 2013?
6
b What was the percentage increase
4
in risky/high risk drinking in the
2
4 years after the 2009 survey?
0
1995 2001 2005 2009 2013
Year
2 This sector graph shows how Frank’s Pizza Frank’s Pizza Parlour: Expenses Example
Parlour spends its budget. 2
a What fraction of his budget does Frank
spend on ingredients? Ingredients
Labour 25%
b What is the largest expense? 35%
3 This divided bar graph shows causes of bushfires during one summer.
Causes of bushfires
Unknown
Natural
causes
Prevalence of smoking
30
Males
Females
25
15
10
0
18–24 25–34 35–44 45–54 55–64 65–74 75 and over
Age group (years)
5 This clustered bar chart shows the different ways people shop for sports/outdoor
equipment, groceries and home entertainment.
50
40
30
20
10
0
Wearable Mobile Tablet Computer TV Catalogue/ In-store Do not
technology* phone device shopping magazine buy
* e.g. watch, glasses
6 The graphs below illustrate healthy weight ranges, according to gender, age and body
mass index (BMI). Your BMI is calculated by dividing your weight (in kilograms) by your
height (in metres) squared.
over igh
26 m i n g 26 v e rwe
Beco o
ing
24 24 om
Bec
22 22
Healthy Healthy
20 20
18 18
16 16
14 Underweight 14 Underweight
9 10 11 12 13 14 15 16 17 18 9 10 11 12 13 14 15 16 17 18
Age Age
25
20
Number of raindays
20
Rainfall (mm)
15
15
10
10
5
5
0 0
Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
Average monthly rainfall Number of raindays
© Copyright Commonwealth of Australia 2017, Bureau of Meteorology.
Attribution 3.0 Australia (CC BY 3.0 AU).
b What is the size of one unit on the vertical axis of the NCM Bank graph?
c What was the Optel share price on Monday?
d On which day was the NCM Bank share price $25.10?
e Describe what happened to the Optel share price over the week.
f What would have been the worst day to sell NCM Bank shares? Why?
g Which company’s shares showed more changes in price over the week?
10 These line graphs show the pollution index in three Sydney regions over a week.
Pollution index
Regional pollution index
25
South-west
20 North-west
East
15
10
5
0
Mon Tues Wed Thurs Fri Sat Sun
EXAMPLE 3
Three misleading graphs are shown below, used by the marketing manager of a computer
store to highlight the increase in the sales of notebook computers over five weeks.
Week 1 2 3 4 5
Computers sold 16 21 27 30 35
Solution
a In Graph A, the vertical axis begins at 10 instead of 0, so the section of the graph
between 10 and 40 has been exaggerated or ‘stretched up’ to give the impression
that sales have increased greatly.
In Graph B, the scale on the vertical axis is not even, which gives the false
impression that the increases in sales are greater than they actually are.
In Graph C, the horizontal axis has been compressed or ‘shrunk’ to make it look like
the sales have increased quicker.
Sales of computers
30
20
10
0 1 2 3 4 5
Week
Misleading graphs
A misleading graph can give a wrong impression by:
• not having a scale
• having an uneven scale or showing only part of the scale
• not showing the correct position of zero on the scale.
1.80
1.70
1.60
1.50
2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018
Year
2016
2018
Sales
b What message is the ex-director trying to convey
New company
to the shareholders? director appointed
c What information is missing from the graph?
Months
5 This bar chart appeared in a news article with the Votes to increase childcare funding
headline ‘Labor votes overwhelmingly to increase 63
childcare funding.’ 62
a How are Labor’s votes exaggerated on this 61
graph? 60
Number of votes
59
b Redraw the graph correctly.
58
c Do you agree with the headline? Justify your 57
answer. 56
55
54
53
Labor Liberal Greens
Political party
7 Joe drew this bar chart on the How Joe spends his day
right to illustrate how he spends 7
a typical weekday.
6
a How many hours does Joe
spend sleeping? 5
Hours
EXAMPLE 4
Solution
Nominal data
(cannot be ordered)
Categorical data
(words or symbols)
Ordinal data
(can be ordered)
Data
Discrete data
(counted: separate values, gaps)
Numerical data
(numbers)
Continuous data
(measured: smooth scale, no gaps)
Colour of hair is an example of nominal data because the categories cannot be ordered, for
example, blonde does not come before or after brown.
Month of birth is an example of ordinal data because the categories can be ordered, for
example, May comes before September.
Number of tries scored is an example of discrete data because they are numbers that are
obtained by counting. Discrete values are separate numbers (in this case, whole numbers) with
1
‘gaps’ between the possible values. You cannot have 2 2 tries or 1.8 tries scored.
Amount of petrol is an example of continuous data because they are numbers that are
obtained by measuring. Continuous values lie on a smooth scale with no ‘gaps’ between the
possible values. You can have 43.1 L, 43.15 L or 43.02 L of petrol.
EXAMPLE 5
Solution
a nominal
b ordinal For example, outstanding, satisfactory
c nominal
d ordinal For example, low, medium
Solution
2 What is the difference between categorical data and numerical data? Give an example of
each type.
3 Classify each type of categorical data as nominal (N) or ordinal (O). Example
a brand of car 5
b level of manager
c level of satisfaction with the government
d marital status
e favourite football team
f colour of Olympic medal
g size of soft drink sold at a fast food store
h a person’s gender
Determine whether sampling or taking a census is more appropriate for each study below.
a finding the most popular radio station in Newcastle
b conducting a survey on whether Australians want a new national flag
c finding the exact number of young women living on the South Coast, with the view
to building a new maternity hospital
d selecting the politicians to form a new state government
Solution
Types of samples
If a sample is to be truly representative of a population, then each item in the population must
have an equal chance of being chosen. Also, the sample size is important. The larger the sample,
the more accurate the results of the study will be. There are four types of sample, described below.
In a random sample, each item is chosen completely at random from the population (for
example, selecting names out of a box, or setting a computer to create random numbers to
choose people or items).
In a systematic sample, the first item is chosen at random and all other items are chosen at
regular intervals. For example, every 20th mobile phone is taken from a factory for testing, or
the driver of every 8th car on the road is selected for random breath testing. ‘Systematic’ means
a ‘system’ is used to select items.
In a stratified sample, the population is divided into
strata (layers) according to some characteristic (for
example, gender or age group) and a random sample is
taken from each layer using representative proportions
or percentages. For example, if 28% of all shoppers in
[Link]/Dwight Smith
EXAMPLE 8
Solution
EXAMPLE 9
At Newcent High School, there are 126 senior students and 572 junior students. The
student council wants to survey 80 students about the school’s environmental program.
How many senior students should be selected for a stratified sample?
Solution
EXAMPLE 10
Solution
a The group members are likely to be friends with similar opinions, so this sample may
not be representative of all Year 12 students.
b This is biased against people who are not at home during the day.
c The sample would contain only (self-selected) viewers of the program who have a
strong opinion about the subject or who make the effort to vote.
study below. 7
a counting the number of students learning Japanese at a school
b finding the number of people who watch soccer on SBS TV
c judging the most popular supermarket in town
d finding the number of Australians who were born in Europe
e investigating people’s opinions on whether Australia should become a republic
f determining the number of people in NSW who have red hair
g finding the exact number of schoolchildren living in Parramatta
h testing whether a brand of mobile phone battery lasts 50 hours
Example 4 A department store has a customer database of 3347 female shoppers and 1504 male
9 shoppers. It wants to introduce a new credit card to a stratified sample of 200 shoppers.
How many female shoppers should be included in the sample?
5 A public transport department surveys a sample of 600 commuters about the efficiency
of the city’s rail network. It is known that each region of the city has the following
number of commuters:
If the sample is to be stratified, how many commuters from the West should
be included?
ESTIMATING POPULATIONS
How many students at your school:
a have red hair? b have brown eyes? c are left-handed?
d are vegetarian? e walk to school? f have a dog as a pet?
A survey is often conducted using a questionnaire, which asks for details about a person’s
lifestyle, beliefs, attitudes and behaviours. The table below describes the features of a
good questionnaire, giving examples.
Estimate the answers to these questions by surveying a sample of 50 students from your
school and making appropriate calculations. How could you improve on your estimates?
INVESTIGATION
WS
DESIGNING A QUESTIONNAIRE
A survey is often conducted using a questionnaire, which asks for details about a person’s
Student survey form
Homework
lifestyle, beliefs, attitudes and behaviours. The table below describes the features of a
good questionnaire, giving examples.
WS
Feature Bad example Good example
2 Unambiguous questions ‘What type of car do you own?’ ‘What make of car do
Questions must not have Does the question mean: you own? Tick one box
more than one meaning or from the following list:
• the make of car (for example,
interpretation. …’ (This is followed by
Toyota)?
a list of different makes
• the style of car (for example, of cars.)
hatchback)?
• the size of car (for example,
small)?
3 Respect for privacy ‘What is your email address?’ ‘In case we need to
A survey should be People are concerned about contact you further
anonymous and not require receiving junk email if they regarding this survey,
personal details. Questions give out their email address. please supply your email
that are too personal should address. This is optional
be optional. and your details will
not be forwarded to
other parties or used for
promotional purposes.’
5 Consideration of a ‘How well did you enjoy your ‘How satisfied were
number of choices meal?’ (from a restaurant you with your meal?
A question may have many survey). Choose from one of the
different answers, so either Too open ended. There would following:
make the question very be a wide range of possible Very satisfied �
specific or provide a list answers that would be difficult Satisfied �
of answers from which to to organise and analyse.
choose. Not satisfied �
Very unsatisfied �
No opinion �
1 Explain what is wrong with each survey question and write a better question
each time.
a Are politicians paid too much?
b What is your favourite food?
c How often do you smoke?
d Which mode of transport do you employ to commute to your place of work?
e Are you happy at school?
f How satisfied are you with the level of customer service at the New Century
Bank?
2 a In groups of 2 to 4 students, design a questionnaire of 10 to 12 questions about
one of the following topics:
• TV viewing habits of people
• spending habits of teenagers
• students’ use of smartphones and/or tablets
• family and household details
• favourite sports to watch and play.
b Decide on the type of facts you wish to collect, then determine suitable wording
for each question and the format of each answer. Make sure that your questions
reflect the features of an effective questionnaire.
EXAMPLE 11
Jewish 91 022
a
First calculate the angle of each sector, rounded to the nearest degree.
563 674 Write each religion as a fraction and
Buddhist: × 360° ≈ 9°
23 401 892 multiply by 360°.
12 201 600
Christian: × 360° ≈ 188°
23 401 892
440 300
Hindu: × 360° ≈ 7°
23 401 892
604 240 Note: Due to rounding, the sector
Islamic: × 360° ≈ 9°
23 401 892 angles may not add exactly to 360°.
91 022
Jewish: × 360° ≈ 1°
23 401 892
2 567 348
Other religion: × 360° ≈ 39°
23 401 892
6 933 708
No religion: × 360° ≈ 107°
23 401 892
Religions in Australia
Buddhist
Buddhist Instead of labelling the
Christian sectors, you could provide a ‘key’
Hindu like this.
Islamic
No religion
Other
Jewish
Christian No religion
Other
b Draw a long rectangle (for example, 12 cm long), then calculate the length of the
section representing each religion, rounded to the nearest millimetre.
563 674 Write each religion as a fraction
Buddhist: × 12 cm ≈ 0.3 cm
23 401 892 and multiply by 12 cm.
12 201 600
Christian: × 12 cm ≈ 6.3 cm
23 401 892
440 300
Hindu: × 12 cm ≈ 0.2 cm
23 401 892
604 240
Islamic: × 12 cm ≈ 0.3 cm
23 401 892
91 022 ‘Jewish’ will be too small to show as a
Jewish: × 12 cm ≈ 0.05 cm
23 401 892 section.
Religions in Australia
Pareto charts
A Pareto chart contains a bar chart and a line graph combined. Individual values are
represented in descending order with bars. The cumulative total (or cumulative frequency)
of the data is represented by the line. For example, the Pareto chart below explains the
decrease in profits at a shoe store after surveying customers.
60 50%
40%
40
30%
20%
20
10%
0 0%
ult
le
es
es
ate
vic
tiv
lab
sho
sho
fic
qu
eti
ser
vai
dif
de
ty
of
mp
or
sa
ina
ali
ng
ip
co
Po
ize
qu
nsh
rki
ys
ot
es
or
pla
Pa
ma
sn
ho
Po
dis
ork
ice
ds
re
Pr
W
ite
Sto
Lim
2 Georgie won $800 000 on a quiz show and spent it as shown in the table. Example
a Why is a sector graph or divided bar graph suitable for this information?
b Draw a sector graph for this information.
c What did Georgie do with one-quarter of her winnings?
d What fraction of Georgie’s winnings was spent on home renovations?
e What percentage of Georgie’s winnings was spent on a new car?
ABC 3460
SBS 855
Seven 4140
Nine 4215
Ten 2300
4 The teachers at Summer Heights High School were surveyed on the number of cups of
coffee they drank yesterday.
Cups of coffee 0 1 2 3 4 5 6
Frequency 7 5 11 21 15 9 4
a How many teachers are there at the school?
b What type of graph can be used to illustrate this data? Give a reason for your
answer.
c What percentage of the teachers did not drink coffee that day? Answer correct to
one decimal place.
d If the recommended healthy coffee intake is no more than three cups of coffee per
day, what fraction of teachers had a healthy intake of coffee?
5 Serena wants to graph the results of a survey about people’s favourite ice cream flavour.
Which of the following graphs can she use? Select A, B, C or D.
A bar chart
B sector graph
C divided bar graph
D all of the above
600 50%
40%
400
30%
20%
200
10%
0 0%
ed
e
t
y
a
led
ho
es
erl
pin
ere
izz
lay
he
bil
rop
t
dp
liv
top
no
de
hc
ly
de
ge
dp
za
ery
ect
ate
ug
ma
ize
Piz
ake
no
orr
liv
qu
Da
gs
te
De
tb
de
Inc
ron
No
Ina
No
W
350 90%
200 50%
150 40%
30%
100
20%
50 10%
0 0%
Insects
Too warm/cold
Furniture worn-out
Damaged parts 90
Dirty 55
Missing instructions 45
Missing parts 15
Unpainted 10
Poor quality 5
a Download the ‘Pareto chart template’ worksheet from NelsonNet to draw a Pareto
chart for the data shown.
b What are the two most common complaints?
c Name two ways I-AKE could improve the quality of their furniture.
United Kingdom 20
New Zealand 35
Middle East 7
India 10
Africa 3
Asia 12
Other regions 6
a Draw line graphs showing both types of theft on the same axes. Include labels or a
key.
b In which year was home theft highest?
c In which year was motor vehicle theft lowest?
d Which type of theft is generally more common?
e Write a sentence to describe what happened to the number of home thefts over the
5 years.
f What do you think happened to motor vehicle theft in 2016? Justify your answer.
Cycle
8%
Bus
9% Train
Train Car
36% Walk
Walk
18% Bus
Cycle
Car
29%
Step 5: Format any features of your graph, including colour and design, by clicking on
it and using Format or Design from the Toolbar. You may also resize the graph.
Further investigations
• Repeat the steps above for other data, creating different sector graphs and
layout options.
• Graph the same data as a bar chart by clicking Design and Change Chart Type.
8 8
Frequency
Frequency
6 6
4 4
2 2
0 0
1 2 3 4 5 1 2 3 4 5
Number of phones Number of phones
The ages of the patients at a medical centre in one afternoon were recorded:
37 12 19 65 44 30 1 8 14 36 25 57 21 16
4 30 40 49 38 69 33 55 16 4 8 16 6 28
29 54 43 48 33 18 9 59 21 70 22 33 2 60
a Copy and complete this frequency distribution table for the data.
0–9
10–19
20–29
30–39
40–49
50–59
60-69
70–79
Total
Solution
These are discrete class intervals.
a
0–9 4.5 8
10–19 14.5 7
20–29 24.5 6
30–39 34.5 8
40–49 44.5 5
50–59 54.5 4
60-69 64.5 3
70–79 74.5 1
Total 42
6
Frequency
OR:
6
Frequency
The monthly call costs for a group of 120 mobile phone users were recorded and grouped
into class intervals as shown.
0–< 20 10 6
20–< 40 30 8
40–< 60 50 13
60–< 80 70 17
80–< 100 90 23
Total 120
a When graphing data grouped into continuous class intervals, either the class
boundaries or class centres are shown on the horizontal axis, like this:
Monthly mobile phone costs
25
20
Frequency
15
10
0
0 20 40 60 80 100 120 140 160 180 200
Cost ($)
class boundary
or like this:
5
0
10 30 50 70 90 110 130 150 170 190
Cost ($)
class centre
4 The number of words in each sentence of a magazine article was counted, with the
results shown below.
27 22 15 8 14 7 9 25 15 17 5 24
9 11 22 8 5 15 25 18 10 21 24 13
9 14 18 11 9 23 15 19 10 8 14 17
Frequency
a How many students made 22 calls? 5
4
b How many students were surveyed?
3
c What percentage of students made 2
fewer than 20 calls? Answer correct 1
to one decimal place. 0
fewer 21 23 25 27 29 more
d Can you tell how many students than 20 than 30
Number of phone calls
made 32 calls? Why?
7 The number of kilograms lost by each person attending Julie’s Weight Loss Centre is Example
shown below. 13
1.8 2.5 4.3 6.5 2.7 4.6 11.0 10.8 0.3 8.2
2.1 3.8 4.4 5.8 1.6 5.9 7.6 9.3 4.8 3.4
12.5 4.6 2.5 6.9 7.5 3.5 4.8 12.2 4.3 3.7
0.0 0.9 2.6 7.8 4.9 7.4 9.8 10.4 2.6 8.2
a Is this data discrete or continuous?
b Complete a frequency table for this data using class intervals 0−< 2, 2−< 4 etc.
c Draw a frequency histogram for this data.
d What is the modal class?
e Use your histogram to comment on the effectiveness of the weight loss program.
a Using class intervals of 160 −< 165, 165 −< 170 etc., complete a frequency table for
this data.
b Draw a frequency polygon for the data.
c What is the modal class interval?
d What is the lowest class interval?
e Which class interval is least common?
nter the data about the cost of mobile phone calls from Example 13 into a
Step 1: E
spreadsheet but add a last row (210, 0) as shown.
This allows you to draw a frequency polygon which touches the x-axis.
Step 2: S
elect B2:B13 the values in the ‘Number of users’ column, as the ‘y-values’.
(Do not select B1 because we will add the titles later.)
Step 3: Click Insert then click Insert Column or Bar Chart and Clustered
Column.
Step 4: Right-click on the graph and click Select Data. Click Horizontal (Category)
Axis Labels then Edit and highlight A2:A13.
Step 5: Click Add Chart Element and Axes Titles Primary Horizontal and
Primary Vertical and enter appropriate axes titles.
Step 6: Right-click on the bar chart and choose Format Data Series. Change Gap
Width to 0%.
Step 7: Click on Chart Title on the graph and enter an appropriate title for the data in
the frequency histogram.
Step 9: Click Insert then click Insert Line or Area Chart and click Line with
Markers.
Step 10: Right-click on the graph and click Select Data. Click Horizontal (Category)
Axis Labels then Edit and highlight A2:A13.
Step 11: Click Add Chart Element and Axes Titles Primary Horizontal and
Primary Vertical and enter appropriate axes titles. Then click Chart Title
and enter an appropriate title for the polygon.
36 37 38 39 40 41 42 °C
Solution
a 10 10 dots in total
b 2 patients There are two dots at 37°C
c 38°C The score with the most dots
d From 37°C to 39°C.
e The outlier temperature is 42°C and a patient with this temperature would be very
hot and ill.
EXAMPLE 15
Stem Leaf
4 37
5 02248
6 1333669 The stem shows the tens digit, and the leaf
shows the units digit: 6 | 1 means 61.
7 578
8 46
9 2
a 92
b
20 The number of values in the ‘Leaf’ column
c
63 This score occurs the most, 3 times
d Number of students who scored over 60 = 13
13
Percentage of students who scored over 60 = × 100% = 65%
20
14
1 A class was surveyed to find out how many hours each student spent on maths
homework each week. The results are shown below.
7 6 8 9 5 10 6 9 9 0 9 8
18 7 5 3 4 9 6 7 8 10 7 8
3 The Bennelong Bears scored the following numbers of goals per match in their
first season.
4 0 2 3 7 1 2 4 5 0 3 5
a Display the data as a dot plot.
b What was the lowest score?
c Comment on any clusters or outliers.
4 A survey was conducted to determine the number of people in each car in a sample of
cars on a road. The results are shown below.
6 3 1 2 3 1 1 3 4 2
5 3 1 1 2 3 4 5 6 4
a Display the data as a dot plot.
b How many cars were surveyed?
c What was the highest score?
d Comment on any clusters or outliers.
5 This dot plot shows the scores (out of 10) of students Words correct
competing in a spelling bee.
a How many students competed in the spelling bee?
b What percentage of students scored more than 8?
c Where were the scores clustered?
d Comment on the general performance of the 3 4 5 6 7 8 9 10
contestants in the spelling bee.
Example 6 This unordered stem-and-leaf plot represents the number of points scored per match by
15 the Sharks in a football season.
Stem Leaf
a Redraw the data as an ordered stem-and-leaf plot.
4 539
b How many matches were played in the season? 5 7208
c What was the Sharks’ highest score for a match? 6 478512
d For what percentage of matches did the Sharks score 7 2930
below 56 points? 8 942
66 64 28 93 47 110 53 68 117 43
72 68 84 103 59 82 78 61 104 79
51 63 112 81 79 94 42 57 83 100
8 This plot shows the times (in seconds) in a slalom ski race.
Stem Leaf
9 1579
10 245668
11 0223445
12 1233379
13 234577
14 369
15 012
9 A nurse at Greenacres Hospital took the following pulse rates (in heartbeats per minute)
of 40 patients.
71 81 63 55 93 52 69 78 84 65
72 80 68 74 85 79 90 84 76 68
58 64 60 97 83 69 74 56 64 89
94 81 63 60 76 72 110 83 90 64
a Draw a stem-and-leaf plot for this data.
b If the average pulse is from 65 to 75 beats per minute inclusive, how many patients
had an average pulse?
c Find the outlier and describe the patient that has this pulse.
d A very fit person has a low resting pulse (say, less than 60). How many patients were
in this category?
Stem Leaf
2 04 10 17 18 26 35 37 76 77
3 10 26 42 66 54 76 85 85 85
4 02 53 62 65 73 75 76 85
5 03 10 24 60 68 92
6 14 29 32 34 35 35 54
7 18 52 63 85
8 12 25 46 72 93
9 38 75
11 At the school athletics carnival, runners recorded the following times (in seconds) in the
100-metre sprint.
12.1 13.6 11.8 18.1 12.0 15.6 13.9 17.2
14.5 18.7 15.7 14.6 16.3 11.6 17.7 15.6
14.6 15.4 12.4 16.5 17.4 14.6 16.8 14.3
a Represent this data on a stem-and-leaf plot using stems of 11, 12 etc.
b How many runners were there?
c What was:
i the best time recorded?
ii the worst time recorded?
d What percentage of students took more than 15 seconds to run the distance?
Answer correct to one decimal place.
e To qualify for the regional carnival, a runner’s time must be less than 13.5 seconds.
What fraction of students qualified for the regional carnival?
A sample of 3000 rugby league fans was surveyed on their main concerns about the
national competition. The results of the survey are shown by the sector graph below.
Fans concerns about rugby league
• Do your homework.
• Learning maths is about mastering a collection of skills.
• You become successful at maths by doing it more, through regular practice and training.
• Aim to achieve a high level of understanding.
• Once you have mastered the maths skills, there is no further learning or reading needed.
• Compared to other subjects, the types of questions asked in maths exams are more
conventional and predictable.
• Test your understanding with revision exercises, practice papers and past exams.
• Develop your exam technique and problem-solving skills.
• Go back to Steps 1 to 3 to improve your study.
This chapter, Collecting and presenting data, looked at the graphing of data, sampling
techniques and the methods of data collection. You should be able to interpret and construct
various graphs and displays, know the different types of data and samples, and be familiar
with statistical terminology.
Make a summary of this topic. Use the chapter outline at the beginning of this chapter
and the mind map below as a guide. Use your own words, symbols, diagrams, boxes and
reminders. Gain a ‘whole picture’ view of the topic and identify any weak areas.
Types of data
Statistical Sampling
graphs techniques
COLLECTING AND
PRESENTING DATA
Exercise 1 This bar graph shows the ten most fatal types of cancer for males in Australia in 2013.
1.01
Types of cancer deaths for males
Stomach
Oesophageal
Colon
Liver
Melanoma
Pancreatic
Colorectal
Prostate
Bowel
Lung
0 1000 2000 3000 4000 5000 6000
5 For each of the numerical data identified in Question 3, classify as discrete (D) or Exercise
continuous (C). 1.03
6 The manufacturers of Burp Cola want to conduct a quality control study to ensure that each Exercise
bottle of drink made does, in fact, contain 1.25 litres. Which type of sample would be most 1.04
appropriate: random, systematic, stratified or self-selected? Give a reason for your answer.
7 a To investigate the types of music to play at the school disco, Tasha surveyed a Exercise
sample of 50 students visiting the canteen. Why might this sample be biased? 1.04
b Suppose Tasha uses a stratified sample instead, based on the following numbers of
school students in each Year group:
Year 7 8 9 10 11 12
Number of students 114 120 114 128 105 96
If she surveys a sample of 50 students, how many Year 10 students should be in the
sample?
8 Explain what is wrong with each survey question below and write a better question each time. Exercise
Exercise
10 A company surveyed its employees on reasons for arriving late to work, to help assist
1.05 people to be more punctual. The results are shown on the Pareto chart.
30 50%
40%
20
30%
20%
10
10%
0 0%
Traffic Child Public Weather Overslept Emergency
care transport
a What are the two main reasons employees give for being late to work?
b What is the least common reason for being late to work?
c How many employees are late due to using public transport?
d What percentage of employees are late due to using public transport?
e What are two possible ways this company might assist its employees to improve
their ability to arrive at work on time?
1.06
58 63 77 82 53 69 65 80 96 105
79 63 52 90 104 85 65 87 68 105
65 87 109 84 62 75 102 78 93 84
68 105 74 59 68 74 88 66 70 62
a Are people’s masses discrete data or continuous data?
b Organise this data into a frequency distribution table with class intervals 50−< 60,
60−< 70 etc.
c Draw a frequency histogram and polygon to represent the data.
d Which class interval had the highest frequency?
e What fraction of skydivers were in the 80–< 90 kg class?
12 The ages of a sample of children at a Wiggles concert are listed below. Exercise
1.07
6 0 4 1 5 6 4 2 8 6 4 4
3 6 5 5 2 5 3 1 5 6 3 3
a Draw a dot plot for this data.
b How many children were in the sample?
c What fraction of children were over 5 years old?
d Comment on any clusters.
13 This stem-and-leaf plot shows the ages of visitors entering Stem Leaf Exercise