0% found this document useful (0 votes)
4K views7 pages

Variance and Standard Deviation Exercises

This document provides information and examples on measures of dispersion, including standard deviation and variance, for both ungrouped and grouped data. It includes definitions of key terms like mean, standard deviation, and variance. It then provides 10 exercises calculating these measures of dispersion for various data sets. Exercises include finding standard deviation and variance, determining missing values, and combining data sets to calculate overall mean and standard deviation.

Uploaded by

Teoh Siang Yew
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4K views7 pages

Variance and Standard Deviation Exercises

This document provides information and examples on measures of dispersion, including standard deviation and variance, for both ungrouped and grouped data. It includes definitions of key terms like mean, standard deviation, and variance. It then provides 10 exercises calculating these measures of dispersion for various data sets. Exercises include finding standard deviation and variance, determining missing values, and combining data sets to calculate overall mean and standard deviation.

Uploaded by

Teoh Siang Yew
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

016-411 9575

Form Six Mathematics M & T


Descriptive Statistics (I)
Measures of Dispersion
Ungrouped data
𝑛
Mean 1
𝑥 = ∑ 𝑥𝑙
𝑛
𝑖=𝑙

Standard deviation 2
1 (∑ 𝑥)
𝜎 = √ [∑ 𝑥 2 − ]
𝑛 𝑛
1
OR = √𝑛 [∑(𝑥 − 𝑥)2 ]
OR
∑ 𝑥2
√ − (𝑥)2
𝑛
Variance = (𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛)2
Exercise 1
1. Find the mean and the standard deviation of the following sets of numbers.
a) 2, 4, 5, 6, 8 b) 6, 8, 9, 11 c)11, 14, 17, 23, 29 d) 5,13, 7, 16, 15
e) 4,6 ,2.7, 3.1, 0.5, 6.2 f) 200, 203, 206, 207, 209
2. The mean of the numbers 3, 6, 7, a, 14 is 8. Find the standard deviation of the set of numbers.
3. For a set of 10 numbers ∑ 𝑥 = 290 and ∑ 𝑥 2 = 8469. Find the mean and the variance.
2
4. For a set of 9 numbers ∑(𝑥 − 𝑥) =234. Find the standard deviation of the numbers.
5. For a sets of 9 numbers ∑(𝑥 − 𝑥) = 60 and ∑ 𝑥 2 = 285. Find the mean of the numbers.
6. The numbers 𝑎, 𝑏, 8,5,7 have mean 6 and variance 2. Find the values of 𝑎 and 𝑏, if 𝑎 > 𝑏.
7. Find the mean and standard deviation of the set of integers 1, 2, 3, …, 20.
8. From the information given about each of the following sets of data, work out the missing
values in the table:
n ∑𝑥 ∑ 𝑥2 x s
(a) 63 7623 924800
(b) 152.6 10.9 1.7
(c) 52 57300 33
(d) 18 57 4
9. Calculate the mean and the standard deviation of the four numbers 2, 3, 6, 9.
Two numbers, a and b, are to be added to this set of four numbers, such that the mean is
increased by 1 and the variance is increased by 2.5. Find a and b.
10. For a particular set of data 𝑛 = 100, ∑(𝑥 − 50) = 123.5, ∑(𝑥 − 50)2 = 238.4. Find
the mean and the standard deviation.
11. Find the variance of x if
∑ 𝑓(𝑥 − 100) = 127 ∑ 𝑓(𝑥 − 100)2 = 2593 ∑ 𝑓 = 20
Group data Mean ∑ 𝑓𝑥
𝑥=
∑𝑓

Standard deviation
1 (∑ 𝑓𝑥)2
𝜎=√ 2
[∑ 𝑓𝑥 − ]
∑𝑓 ∑𝑓
∑ 𝑓𝑥 2
OR = √ ∑𝑓
− (𝑥)2
Exercise 2
Find the mean and the standard deviation for each of the following sets of data. Giving your answers,
correct to one decimal places.
a) b)
Interval Frequency Current (xA) No. of fuses
150 – 154 3 0 < 𝑋 ≤ 120 1
155 – 159 7 120 < 𝑋 ≤ 180 9
160 – 164 10 180 < 𝑋 ≤ 240 15
165 – 169 15 240 < 𝑋 ≤ 300 17
170 – 174 25 300 < 𝑋 ≤ 360 13
175 – 179 12 360 < 𝑋 ≤ 600 5
180 – 184 6
185 – 189 2
c) d)
Mark No. of students Time (min) Frequency
0–9 0 0.0 – 2.9 12
10 – 19 0 3.0 – 5.9 33
20 – 29 5 6.0 – 8.9 45
30 – 39 8 9.0 – 11.9 38
40 – 49 16 12.0 – 14.9 19
50 – 59 10 15.0 – 17.9 7
60 – 69 6 18.0 – 20.9 6
70 – 79 4
80 – 89 1
90 – 99 0
e) f)
Range Cumulative frequency x No. of car with width ≤ x meter
𝑥 ≤ 400 000 0 5 0
𝑥 ≤ 450 000 2 8 29
𝑥 ≤ 500 000 15 9 138
𝑥 ≤ 550 000 39 10 192
𝑥 ≤ 600 000 42 12 215
𝑥 ≤ 700 000 48 15 235
𝑥 ≤ 800 000 50 20 240
g) h)
Interval f Interval f
0– 5 -200 0
5– 10 -250 20
10 – 13 -300 33
15 – 18 -350 35
40 – 12 -400 25
15 – 6 -450 14
30 – 6 -600 3
35 – 0
i) (giving your answer, correct to 3 significant figures) j)
Range Frequency Range Cumulative frequency
0 < 𝑤 ≤ 0.000 05 30 0 < 𝑤 ≤ 25 12
0.000 05 < 𝑤 ≤ 0.0001 68 25 < 𝑤 ≤ 50 23
0.0001 < 𝑤 ≤ 0.0002 21 30 < 𝑤 ≤ 35 48
0.0002 < 𝑤 ≤ 0.0003 40 35 < 𝑤 ≤ 45 15
0.0003 < 𝑤 ≤ 0.0005 26 45 < 𝑤 ≤ 60 3
0.0005 < 𝑤 ≤ 0.001 15
𝑤 > 0.001 0
Exercise 3

1. For the set of numbers 3, 6, 7, 9, 10 the mean is 7 and the standard deviation is √6. If each
numebr in the set is increased by 3. Find the new mean and standard deviation. Comment on
your answers.

2. a) Find the mean and the standard deviation of the set of numbers 4, 6, 9, 3, 5, 6, 9.
b) Deduce the mean and the standard deviation of the set of numbers 514, 516, 519, 513, 515,
516, 519
c) Deduce the mean and the standard deviation of the set of numbers 52, 78, 117, 39, 65, 78, 117.

3. Find the mean and the variance of the ordered set of numbers A = {1, 2, 3, 4, 5, 6, 7}
Hence find the mean and the variance of the following ordered sets
B = {4, 5, 6, 7, 8, 9, 10 } ; C = {10, 20, 30, 40, 50, 60, 70} ; D = {13, 23, 33, 43, 53, 63, 73}

4. Show that the standard deviation of the integers 1, 2, 3, 4, 5, 6, 7 is 2. Using this result find
the standard deviation of the numbers.
a) 101, 102, 103, 104, 105, 106, 107 b) 100, 200, 300, 400, 500, 600, 700
Exercise 4
1. The mean height of a sample of 15 boys is 1.38m and the mean height of a sample of 20 girls
is 1.22m. Find the mean height of the combined sample of boys and girls.

2. A class consists of 18 boys and 12 girls. In the meaths test average mark is 56. If the mean
marks are obtained by the boys is 60, determine the mean mark obtained by the girls.

3. A typist spends 30% of her time typing letters, 40% of her time preparing accounts and 30 %
of her time working on legal documents. On average the makes 5 mistakes per hour when
typing letters, 12 per hour working an accounts and 10 per hour with legal documents.
Determine her mean number of istakes per hour.

4. For each of the following sets of data, find the mean and stand deviation of the combined set
a) 𝑛1 = 12, 𝑥1 = 6, 𝑠1 = 2 b) 𝑛1 = 30, 𝑥1 = 27, 𝑠1 = 5.6
𝑛2 = 8, 𝑥2 = 10, 𝑠2 = 3 𝑛2 = 40, 𝑥2 = 33, 𝑠2 = 6.4
c) 𝑛1 = 12, 𝑥1 = 15, 𝑠1 = 2.7
𝑛2 = 15, 𝑥2 = 14, 𝑠2 = 3.1
𝑛3 = 13, 𝑥3 = 12, 𝑠3 = 2.4

5. For a batch of 60 men, the mean weight is 72 kg witg population standard deviation 5 kg. For
a batch of 90 women, the mean weight is 58 kg with populations tandard deviation 7 kg.
Calculate the mean and the population standard deviation of the weights of the combined
batch of 150 people.
6. The number of subject, population mean and standard deviation of three different batches are
as follows:
Batch No. of student Mean Standard deviation
I 50 42 5
II 100 45 8
III 150 50 10
Calculate the population mean and standard deviation of the distribution formed by taking the
three together.

7. The manager of a car showroom monitored the numbers of cars sold during two successive
five-day period. During the first five days the numbers of cars sold per day had mean 1.8 and
variance 0.56. during the next five days the numbers of cars sold per day had mean 2.8 and
variance 1.76. Find the mean and variance of the numbers of cars sold per day during the full
ten days.

8. The table shows the distribution of the number of droplets in a bag of sugar:
No. of droplets 44 45 46 47 48 49 50
No. of bags 28 28 14 15 8 5 2
a) Calculate the population mean and the standard deviation of the number of droplets per bag
in these bags.
b) A second bath of 50 bags is found to have a mean of 47.2 and a population standard
deviation of 1.5. Calculate the population mean and standard deviation of the combined batch
of 150 bags.

9. Sixty boys and forty girls took an oral examination. The population means and standard
deviations of marks obtained by the two groups were as follows:
Mean Standard deviation
Boys 48 12
Girls 53 10
a) Calculate the population mean and standard deviation of the combined group of 100
candidates.
b) One of the candidates was awarded 100 marks and another, 0 marks. If it were decided to
ignore these two marks, calculate the mean and standard deviation of the remaining 98
candidates.

10. The number, population means and standard deviation of three different samples are as
follows:
Sample No. in sample Mean Standard deviation
I 250 45 8
II 350 55 10
III 400 50 9
Calculate the mean and standard deviation of the distribution formed by the three taken
together

11. A newspaper conducted a survey into the sttitudes of teenage girls. One of the questions asked
was ‘What is the ideal age for a girl to have her first baby?’ In reply. The batch of 165 girls from
Hong Kong Island gave a mean of 24.2 years and a population standard deviation of 1.6 years; the
batch of 180 girls from Kowloon gave a mean of 23.8 years and a population standarddeviation of
1.8 years. Subsequently, the overall batch of 500 girls gave a mean of 24.0 years and a population
standard deviation of 1.9 years. Assuming that no girl was consulted twice, calculate the mean and
population standard for the 155 girls from New Territories.
Exercise 5
1. A summary of 30 values of x gave the following information:
∑(𝑥 − 𝑐) = 234, ∑(𝑥 − 𝑐)2 = 1957.5,
i) Find the standard deviation of these values of x.
ii) Given that the mean of these values is 86, find the value of c.

2. A summary of speeds, x kilometers per hour, of 22 cars passing a ceretain point gave the
following information:
∑(𝑥 − 50) = 81.4 and ∑(𝑥 − 50)2 = 671.0
Find the variane of the speeds and hence find the value of ∑ 𝑥 2 .

3. The amounts of money, x dollars, that 24 people had in their pockets are summarized by
∑(𝑥 − 36) = −60 and ∑(𝑥 − 36)2 = 227.76. Find ∑ 𝑥 and ∑ 𝑥 2 .

4. The ages, x years, of 150 cars are summarized by ∑ 𝑥 = 645 and ∑ 𝑥 2 = 8287.5. Find
2
∑(𝑥 − 𝑥) , where 𝑥 denotes the mean of x.

5. The heights, x cm, of a group of young children are summarised by


∑(𝑥 − 100) = 72, ∑(𝑥 − 100)2 = 499.2
The mean height is 104.8cm.
i) Find the number of children in the group.
ii) Find ∑(𝑥 − 104.8)2.

6. The values, x, in a particular set of data are summarized by


∑(𝑥 − 25) = 133, ∑(𝑥 − 25)2 = 3762.
The mean, x, is 28.325.
i) Find the standard deviation of x.
ii) Find ∑ 𝑥 2 .

7. A sample of 36 data values, x, gave ∑( 𝑥 − 45) = −148 and ∑(𝑥 − 45)2 = 3089.
i) Find the mean and standard deviation of the 36 values.
ii) One extra data value of 29 was added to the sample. Find the standard deviation of all 37
values.

8. Red Street Garage has 9 used cars for sale. Fairwheel Garage has 15 used cars for sale. The mean
age of the cars in Red Street Garage is 3.6 years and the standard deviation is 1.925 years. In
Fairwheel Garage, ∑ 𝑥 = 64 and ∑ 𝑥 2 = 352, where x is the age of a car in years.
i) Find the mean age of all 24 cars.
ii) Find the standard deviation of the ages of all 24 cars.

9. Anita made observations of the maximum temperature, 𝑡℃, on 50 days. Her results are by ∑ 𝑡 =
2
910 and ∑(𝑡 − 𝑡) = 876, where 𝑡 denotes the mean of the 50 observations. Calculate the
standed deviation of the observations.

10. Esme noted the test marks, x, of 16 people in a class. She found that ∑ 𝑥 = 824 and that the
standard deviation of x was 6.5.
i) Calculate ∑(𝑥 − 50) and ∑(𝑥 − 50)2,
ii) One person did the test later and her mark was 72. Calculate the new mean and standard
deviation of the marks of all 17 people.

11. Delip measured the speeds, x km per hour, of 70 cars on a road where the speed limit is 60 km per
hour. His results are summarized by ∑(𝑥 − 60) = 245.
i) Calculatethe mean speed of these 70 cars.
His friend Sachim used values of (𝑥 − 50) to calculate the mean.
ii) Find ∑(𝑥 − 50).
iii) The standard deviation of the speeds is 10.6 km per hour. Calculate ∑(𝑥 − 50)2 .

12. The heights, x cm, of a group of 82 children are summarised as follows:


∑(𝑥 − 130) = −287, standard deviation of 𝑥 = 6.9.
i) Find the mean height. Ii) Find ∑(𝑥 − 130)2 .

13. A summary of 24 observations of x gave the following information:


∑(𝑥 − 𝑎) = −73.2 and ∑(𝑥 − 𝑎)2 = 2115.
The mean of these values of values of x is 8.95.
i) Find the value of constant a.
ii) Find the standard deviation of these values of x.

14. The length of time, t minutes, taken to do the crossword in a certain newspaper was observed on
12 occasions. The results are summarized below.
∑(𝑡 − 35) = −15 ∑(𝑡 − 35)2 = 82.23
Calculate the mean and standard deviation of these times taken to do the crossword.

15. The lengths of time in minutes to swim a certain distance by the members of a class of twelve 9-
year-olds and by the members of a class of eight 16-year-olds are shown below.
9-year-olds: 13.0 16.1 16.0 14.4 15.9 15.1 14.2 13.7 16.7 16.4 15.0 13.2
16-year-olds: 14.8 13.0 11.4 11.7 16.5 13.7 12.8 12.9
i) Draw a back-to-back stem-and-leaf diagram to represent the information above.
ii) A new pupil joined the 16-year-old class and swam the distance. The mean time for the class of
nine pupils was now 13.6 minutes. Find the new pupil’s time to swim the distance.

16. A group of 10 married couples and 3 single mean found that the age 𝑥𝑤 of the 10 women was
41.2 years and the standard deviation of the women’s ages was 15.1 years. For the 13 men, the
mean age 𝑥𝑚 was 46.3 years and the standard deviation was 12.7 years.
i) Find the mean age of the whole group of 23 people.
ii) The individual women’s ages are denoted by 𝑥𝑤 and the individual men’s ages by 𝑥𝑚
By first finding ∑ 𝑥𝑤 2 and ∑ 𝑥𝑚2, find the standard deviation for the whole group.

17. The following table shows the results of a survey to find the average daily time, in group of school
children spent in internet chat rooms.
Time per day (t minutes) Frequency
0 ≤ 𝑡 < 10 2
10 ≤ 𝑡 < 20 F
20 ≤ 𝑡 < 40 11
40 ≤ 𝑡 < 80 4
The mean time was calculated to be 27.5 minutes.
i) Form an equation involving f and hence show that the total number of chilren was 26.
ii) Find the standard deviation of these times.

18. The age, x years, of 18 people attending an evening class is summarized by the following
∑ 𝑥 = 745, ∑ 𝑥 2 = 33951.
i) Calculate the mean and the standard deviation of the ages of this group of people.
ii) One person leaves the group and the mean age of the remaining 17 people is exactly 41 years.
Find the age of the person who left and the standard deviation of the ages of 17 people.
19. Two cricket teams kept records of the number of runs scored by their teams in 8 matches. The
score are shown in the following table.
Team A 150 220 77 30 298 118 160 57
Team B 166 142 170 93 111 130 148 86
i) Find the mean and standard deviation of the scores for team A.
The mean and standard deviation for team B are 130.75 and 29.63 respectively.
ii) State with a reason which team has the more consistent scores.

20. In a recent survey, 640 people were asked about the length of time each week that they spent
watching television. The median time was found to be 20 hours, and the lower and upper quartiles
were 15 hours and 35 hours respectively. The least amount of time that anyone spent was 3 hours,
and the greatest amount was 60 hours.
i) On graph paper, show these results using a fully labeled cumulative frequency graph.
ii) Use your graph to estimate how many people watched more than 50 hours of television each
week.

21. A computer can generate random numbers which are either 0 or 2. On a particular computer
generates a set of numbers which consists of 23 zeros and 17 twos. Find the mean and variance of
this set of 40 numbers.

22. i) In a spot check of the speeds 𝑥 𝑘𝑚ℎ−1 of 30 cars on a motorway, the data were summarised by
∑(𝑥 − 110) = −47.2 and ∑(𝑥 − 110)2 = 5460. Calculate the mean and standard deviation of
these speeds.
ii) On another day the mean speed of cars on the motorway was found to be 107.6 𝑘𝑚ℎ−1 and the
standard deviation was 13.8 𝑘𝑚ℎ−1. Assuming these speeds follow a normal distribution and that
the speed limit is 110 𝑘𝑚ℎ−1 , find what proportion of cars exceed the speed limit.

ANSWER
Exercise 1
1. a) 5, 2 b) 8.5, 1.80 c) 18.8, 6.46 d) 10(5/6), 4.10 e) 3.42, 1.911 f) 205, 3.16
2. 3.74 3. 29, 5.9 4.5.10 5. 5 6. 6,4 7. 10.5, 5.77
8. a) 121, 6.19 b) 14,1703.8 c) 1716, 3.59 d) 1026, 58770
9. 5, √7.5; 5, 11 10. 51.235, 0.927 11. 89.3275
Exercise 2
a) 169.6, 79 b) 264, 90.5 c) 48.5, 14.6 d) 8.65, 4.30 e) 535000, 70710
f) 8.90, 2.39 g) 17.1, 8.18 h) 321.9, 68.1 i) 0.000203, 0.000196 j) 36.7, 8.42
Exercise 3
1. 10, √6 2.a) 6, 2.14 b) 516, 2.14 c) 78, 27.8
3. 4, 4; 7, 4; 40, 400; 43, 400 4. a) 2 b) 200
Exercise 4
1.1.29m 2. 50 3. 9.3 4.a) 7.6, 3.14 b) 30.4, 6.76 c) 13.65, 3.02
5. 63.6, 9.2973 6. 47, 9.2466 7. 2.3, 1.41 8.a) 45.7, 1.5969 b) 46.2, 1.7176
9.a) 50, 11.5065 b) 50, 9.1696 10. 50.5, 9.907111. 24.0194, 2.2443
Exercise 5
1. i) 2.1 ii) 78.2 2. 16.8, 63811 3. 804, 27011.76
4. 5514 5.i) 15 ii) 153.6 6. i) 9.11 ii) 35412
7.i) 40.9, 8.30 ii) 8.41 8.i) 4.02 ii) 2.19 9. 18.2, 4.19
10.i) 24, 712 ii) 52.7, 7.94 11.i) 63.5 ii) 945 iii) 20623
12. i) 126.5 ii) 4908.5 13. i) 12 ii) 8.88
14. 33.75, 2.3 15.b) 15.6 16.i) 44.1 ii) 19254.5, 29964.74, 14.0 17. ii) 16.1
18.i) 41.4, 13.2 ii) 48, 13.4
19. i) 139 (138.75), 83.1 ii) team B, smaller standard deviation
20.60-70 for straight lines, 40-70 for curve 21. 0.850, 0.978 22. i) 108, 13.4 ii) 0.431

Common questions

Powered by AI

The overall mean of combined samples can be calculated by taking the weighted average of the means of the subgroups, using their sample sizes as weights. The standard deviation requires calculating the pooled variance, which involves both the variances of the subgroups and their means. It is important to use these concepts to accurately represent the whole group's behavior, respecting the distribution and variability within each subgroup .

Accuracy in calculating the standard deviation of grouped data requires choosing adequate class intervals that appropriately represent the data's distribution, and accurately estimating midpoints for those intervals. Proper handling of frequencies and ensuring the distribution assumptions align with class widths are fundamental. Addressing these factors ensures that the dispersion measure reflects true variability. This method integrates concepts of continuity and central tendency which are crucial for representative analysis .

To calculate the mean of a frequency distribution, multiply each interval's midpoint by its frequency, sum these products, and divide by the total frequency. The variance involves first finding the mean as described, then summing the squared differences between each midpoint and the mean weighted by the frequency, and dividing by the total frequency. Accurate calculations are crucial to ensuring reliable statistical representation, which affects conclusions drawn from data analysis .

Examining both mean and standard deviation is crucial as the mean provides information about the central tendency, while the standard deviation offers insights into variability. Relying solely on the mean can be misleading if the variances differ significantly, as a similar mean could represent very diverse distributions. Inconsistencies might arise in decision-making or interpretations if variability isn't considered, potentially masking or overstating trends or differences within the data .

Population calculations utilize the totality of data without bias correction, while sample statistics involve Bessel's correction (n-1) for standard deviation to mitigate bias in estimating population parameters. These distinctions are vital as they affect precision, generalizability, and reliability of conclusions derived from data analyses in research, policy-making, and business strategies .

A major challenge is avoiding double-counting when samples overlap. One solution involves careful bookkeeping of shared elements and subtracting their influence using weighted averages or revising pooled variances for distinct groups. Understanding how shared variations affect the standard deviation and mean is vital to adjustments ensuring accuracy in representing the true distributions .

To solve for unknown values, one must first set up equations based on the given conditions, such as setting up the equation for the mean and then an equation for the variance or standard deviation if necessary. Solving these equations simultaneously will reveal the unknowns. Understanding this process enhances skills in forming equations from statistical properties, thereby boosting problem-solving abilities in real-world applications .

The standard deviation quantifies the amount of variation or dispersion in a set of values, representing how much the values typically diverge from the mean. Unlike range or quartiles, which are less sensitive to each data point, the standard deviation considers all data points, offering a comprehensive measure. It is preferred because it complements the mean, using the same units, and provides clearer insights in normally distributed data, allowing better predictions and analyses .

First, identify how the transformation affects the data (e.g., scaling vs. shifting). For scaling, multiply the original standard deviation by the absolute value of the scaling factor. Shifts do not affect standard deviation. Past conditions, like the original mean and standard deviation, influence the computation by determining the baseline variations. Logical sequence helps ensure clarity and precision in predictive analytics and corrects for transformation bias .

Adding a constant to each element in a data set increases the mean by that constant but does not affect the standard deviation. The mean is affected because every data point shifts by the same amount, so the overall center of the distribution shifts by that amount. However, standard deviation, which measures how spread out the numbers are relative to the mean, remains unchanged because the distances between the numbers remain the same. This is supported by the properties of the mean and standard deviation formulas as shown in exercise 3, Source 2.

You might also like