Descriptive statistics (Unit 1)
1. One of the major measures of the quality of the service provided by an organization is the speed
with which the organization responds to customer complaints. During a recent year a carpet industry
got 50 complaints concerning carpet installation. The following data represent the number of days
between the receipt of the complaint and the resolution of the complaint.
87 76 67 58 92 59 41 50 90 75
80 81 70 73 69 61 88 46 85 97
50 47 81 87 75 60 65 92 77 71
70 74 53 43 61 89 84 83 70 46
84 76 78 64 69 76 78 67 64 74
a. Form stem and leaf display of the given data set and interpret the result.
b. Classify this data set into six classes of equal width and construct frequency distribution and
relative frequency distribution and percentage distribution table.
c. Construct a box and whisker plot from group data set prepared in (b) and then describe nature
of the distribution of data points.
d. Plot the less than ogive of the percentage distribution and determine the third quartile.
e. Compute the mean and coefficient of the variation from group data prepared in (b).
2. In a survey, it was found that 50 small tea shops bought milk in the following (litters) in a particular
day:
19 16 22 9 22 39 14 23 19 12
24 6 16 18 7 17 20 25 28 18
20 10 24 21 10 7 18 28 24 20
14 23 25 34 22 5 33 23 26 29
36 13 11 11 26 37 30 13 8 15
a) Form stem and leaf display of the given data set and interpret the result.
b) Construct the frequency distribution table by classifying the data into seven classes of equal width and
construct histogram and polygon.
c) Construct a box and whisker plot and describe the shape of the distribution.
d) Compute the mean and standard deviation from group data set prepared in (b).
e) Construct cumulative frequency curve and find the minimum amount of milk bought by top 15% tea
shops.
3. The President of Ocean Airlines is trying to estimate when the Federal Aviation Administration (FAA)
is most likely to rule on the company’s application for a new flight between Charlotte and Nashville.
Assistants to the president have assembled the following waiting times for applications field during the
past year. The data are given in days from the date of application until an FAA ruling.
14 40 13 48 31 40 25 33 62 12
44 34 68 11 33 42 26 55 47 11
29 40 41 30 34 31 64 35 57 63
44 44 17 52 32 36 34 53 41 39
29 22 28 44 51 31 44 28 56 53
a. Arrange above data in ascending order by preparing stem and leaf display.
b. Construct a frequency distribution using 6 intervals of equally spaced. Also construct
histogram and comment on the shape of the distribution.
c. Construct a box and whisker plot from group data set prepared in (b) and then describe nature
of the distribution of data points.
d. Compute the mean and coefficient of the variation from group data prepared in (b).
e. Detect the outlier if any using exploratory data analysis.
4. In a sate, saving banks are permitted to sell a form of life insurance called saving bank life
insurance (SBLI). The approval process consists of underwriting, which includes a review of the
application, a medical information bureau check, possible requests for additional medical
information and medical exams, and a policy compilation stage during which the policy pages are
generated and sent to the bank for delivery. The ability to deliver approved policies to customers
in a timely manner is critical to the profitability of this service to the bank. During a period of one
month, a random sample of 40 approved policies was selected, and the following total processing
times in days were recorded.
73 19 16 64 28 29 31 80 60 56
22 18 45 48 17 71 17 62 17 63
50 51 69 16 17 32 24 42 38 51
68 25 53 77 26 13 38 42 31 56
a. Form stem and leaf display of the given data set and interpret the result.
b. Construct a frequency distribution table using appropriate class interval and calculate the
relative frequency and cumulative relative frequency distribution.
c. Draw a box and whisker plot from group data set prepared in (ii) and then describe nature of
the distribution of data points.
d. Plot the less than ogive of the frequency distribution and determine the third quartile.
e. Compute the mean and coefficient of the variation from group data prepared in (ii).
f. Detect the outlier if any using exploratory data analysis.
5. The following data represent the cost of electricity during July 2018 for a random sample of 50 one-
bedroom apartment in a large city:
Raw Data on Utility Charges ($)
96 171 202 178 147 102 153 197 127 82
157 185 90 116 172 111 148 213 130 165
141 149 206 175 123 128 148 168 109 167
95 163 150 154 130 143 187 166 139 149
108 119 183 151 141 135 191 137 129 158
a) Form stem and leaf display of the given data set and interpret the result.
b) Construct the frequency distribution table by classifying the data into appropriate number of classes.
Around what amount does the monthly electricity cost seem to be concentrated?
c) Construct a box and whisker plot and describe the shape of the distribution.
d) Represent the data by means of less than cumulative frequency curve. Identify third quartile from the
curve and interpret the result.
e) Compute coefficient of variation from the grouped data set prepared in (b).
6. The following data is 45 days of 5 GB broadband internet prepaid card sales in a town by space
communication, a dealer of communication service.
92 85 87 109 115 102 120 112 95
99 80 84 90 119 125 99 124 117
113 110 105 120 89 86 98 108 112
115 115 130 127 114 118 120 103 95
100 81 84 92 118 109 115 138 120
a. Form stem and leaf display of the given data set and interpret the result.
b. Classify the data into six classes of suitable length and construct frequency, relative frequency and
cumulative frequency table.
c. Construct a box and whisker plot and then describe nature of the distribution of data points.
d. Plot the less than cumulative frequency curve of the distribution and determine the third quartile.
e. Compute the mean and standard deviation of the internet card sales of the communication.
f. The mean and standard deviation of the sales of competitive brand by another distributer in the same
town is 111 and 15 respectively. Use appropriate statistical tool and explain your ratings of the two
distributers.
7. The New Mexico State Highway Department is charged with maintaining all state roads in good
condition. One measure of condition is the number of cracks present in each 1000feet of roadway. From
the department’s yearly sample, the following data were obtained:
87 76 67 58 92 59 41 50 90 75
80 81 70 73 69 61 88 46 85 97
50 47 81 87 75 60 65 92 77 71
70 74 53 43 61 89 84 83 70 46
84 76 78 64 69 76 78 67 64 74
a) Prepare a stem-and-leaf display of the given data set and interpret the result.
b) Classify the data set into six classes of equal width and construct the frequency distribution.
c) Prepare a box-and –whisker plot of the grouped data set prepared in (b) and discuss the shape of the
distribution.
d) Plot a cumulative frequency curve and obtain the approximate value of third quartile.
e) Calculate the mean and standard deviation of the grouped data set prepared in (b).
f) Find the outlier observations from above data set if any?
9. A national associations of real states sellers have collected these data on a sample of 130
Salespeople representing their total commission earning annually.
Commission 0-5 5-10 10-15 15-20 20-30 30-40 40-50 50-60
($000)
Frequency 5 9 11 33 37 19 9 7
Construct an Ogive that will help you answer these questions
a. About what proportion of the salespeople earns more than $25000
b. About what does the middle salesperson in the sample earn?
c. Approximately how much could real estate salespersons whose performance was about
25% from the top expect to earn annually.
10. BMT manufactures performance equipment for cars used in various types of racing. It has gathered
the following information on the number of models of engines in different size categories used in the
racing market it serves:
Engine size in cubic Frequency(of models)
inches
100-150 1
150-200 7
200-250 7
250-300 8
300-350 17
350-400 16
400-450 15
450-500 7
Construct a cumulative relative frequency distribution that will help answer these questions
i. Seventy percent of the engine models available are larger than about what size?
ii. What was the approximate middle value in the original data set?
iii. If BMT has designed a fuel injection system that can be used on racing engines up to 400
cubic inches about what percentage of the engine models available will not be able to use
BMT‘s systems ?
11. There are a number of possible measures of sales performance, including how consistent a
salesperson is in meeting established sales goals. The data that follow represent the percentage of
goal met by each of three salespeople over the last 5 years.
Patricia: 88 68 89 92 103
John: 76 88 90 86 79
Frank: 104 88 118 88 123
Which salesperson is the most consistent?
13. The University has decided to test three new kinds of light bulbs. They have three identical
rooms to use in the experiment. Bulb 1 has an average lifetime of 1470 hours and a variance of
156. Bulb 2 has an average lifetime of 1400 hours and a variance of 81. Bulb 3 has an average
lifetime of 1350 hours and a standard deviation of 6 hours. Rank the bulbs in terms of relative
variability. Which was the best bulb?
14. Student ages in the regular daytime M.B.A. program and the evening program of Central
University are described by these two samples:
Regular M.B.A.:23 29 27 22 24 21 25 26 27 24
Evening M.B.A.:27 34 30 29 28 30 34 35 28 29
If homogeneity of the class is a positive factor in learning, use a measure of relative variability to
suggest which of the two groups will be easier to teach.
15. Suppose that a prospective buyer tests bursting pressure of samples of polythene bags received
from two manufactures A and B. The test revealed the following results:
Bursting Pressure(Ibs) 5-10 10-15 15-20 20-25 25-30
Number of bags-A 2 9 29 54 6
Number of bags-B 9 15 30 32 14
Which manufacturer’s bags have the higher average bursting pressure? Which of them is more
uniform in bursting pressure?
16. The New Mexico State Highway Department is charged with maintaining all state roads in good
condition. One measure of condition is the number of cracks present in each 100feet of roadway.
From the department’s yearly sample, the following data were obtained:
4 7 8 9 9 10 11
12 12 13 13 13 13 14
14 14 14 15 15 16 16
16 16 16 17 17 17 18
18 19 19 20
a. Calculate the inter-quartile range
b. Plots the box and whisker and discuss the shape of the above distribution.