Measures of Disperssion and Skewness
Measures of Disperssion and Skewness
360 DECISIONS -|
between diagrams &graphs?
38. Which are thetwo differences
Oct. 89,
Oct. 98 &Apr. 2000 CHAPTER 4
39. What are the utilities of diagrams?
Oct. 91
graphs? MEASURES OF DISPERSION AND SKEWNESS
40. Compare diagrams & Ap. 92
41. What are the advantages of diagrammatic presentation of data? The various measures off central tendency give us one single figure,
the entire data. But the averages alone cannct adequately
that represents
Nov. 92 describe a set of observations, unless all the observations are the same. It
necessary to describe the variability or dispersion of the observations. It
42. State the generalrules for constructing diagrams? Ap. 93
is
ia alwavs found, that in two or more distributions, the central value may be
43. What do youmean bya bar diagram? What are it's special the same., but still there can be wide disparities in the formation of the
merits? distribution. Measures of variation help us in studying the important
characteristic of a distribution, that is the extent to which the itens vary
Nov. 93
from one another and from the central value.
44. What is a bar diagram? Nov. 94 Kafka defines measures of variation as, "the measurement of the
Scatterness of the mass of fiqures in a series about an average, is called
45. When an ogive curve is given, how do you locate median? Nov. 94 measure of variation or dispersion". So it is ciear from the explanation that
dispersion is the spread or scatter of item values from a measure of central
46. Distinguish between diagrams andgraphs? Ap. 97
tendency.
47. What are less than and more than ogives? Ap. 96 The significance of measures of variation can be illustrated with the
help of the following example.
48. What is meant by ogive? What are its uses? Oct. 98
Series A Series B Series C
100 100 1
100 103 3
100 90 5
Total |500 500 500
values differ
arithmetic mean is the same in all cases, but the
widely tromHere,onetheanother. In seriesA, each and veryitemis perfecty represented
by the arithmetic mean, or in other words,, none of the items of series Adeviates
Irom the In series B, the
tems arithmetic
asingleVary,
itembut
isthe
mean and hence there is no dispersion.
variation isbyverytheSmall
repreesented compared
arithmetic
seriesC. in series C, not
meantoand theitems vary widely
Irom one B.In measuring greatercompared
muchamount
to series
another.
only and not t in
In otherwords,
dispersion, We are
in series C, interested is the
dispersionin of variation
Ofthese, first four are mathematical methods and the last is a Discrete Series
one. Accordingto the syllabus
Calculating Range for
first four methods only.
requirements, we have to studygraphithec For calculting
range for discreteseries, frequency column isignored.
values of the Xvariable only. This is explained
calculationis based on the
1. RANGE The tollowing example.
with the help of the
78 96 100 150 230 240 261
It is the simplest measurement of dispersion. The 64
the largest item and the smallest item is known as range. difference
between X 43
105 91 82 61 32 70 88 67
f 100
It is defined as "the difference between the two extreme items of tha
distribution". Inother words, range is the difference between the largest and is based on the X values only. In
As explained earlier, calCulation smallest item is 43
smallest items of the distribution. is 261, and the
this range the largest item
R = L-S = L- S
Range
L =Largest item = 261 - 43 = 218
S = Smallest item L - S
Coefficient of Range =
L +S
The relative measure corresponding to range is called the coefficient 261 - 43
of range, which is obtained by applying the formula.
261 + 43
Coeficient of Aange L+ S 218
a. Calculating Range for Individual Observations 304
= 0.7171
rom the given values, the difference between the largest value and
the smallest value willgive the value of range. It is explained with the help of
the following example. C. Calculating Range for Continuous Series
Problem range is calculated by taking the
In the Continuous series. the lowest class and upper limit of the
Find out the range and coefficient of range from the folowing Uiierence between the lower limit of the the following example.
of
Values of X- 42, 47, 54,64, 72, 94, 96, 45. gnest class. This method is explained with the help
Here L= 96 and
Problem
the following data.
S= 42 balculate Range and coefficient of range from
So R= L-S =96 - 42 Number ofWorkers
Wages (in Rs)
= 54 8
10-20
Coefficient of Range = 20-30
10
Subsituting the values in the eguation, we get 30-40 12
Coefficient ot Range =96 - 42 54 = 0.3913
40-50
8
96 + 42 138
4
50-60
QUANTITATIVE ANALYSIS FOR BUSINESS MEASURES OF DISPERSION AND SKEWNESS
366 DECISIONS -| 367
lowest class = 10 and upper limit of the highest Range is used in industry for the statistical quality control of the
Here lower limit of the
class = 60
manufactured products by the inspection at regular intervals. For
example, when a machine is turning out large number of a particular
60- 10 = 50
So range = L- S= component, a small sample of four or five is taken at half an interval
L - S examine whetherthe process remains constant within the limits
Coeficient of Range = of error.
L +S
EINTERQUARTILE RANGE ANDTHE QUARTILE DEVIATION
Substituting the values in the equation,we get 2. THE
60- 10 Interquartilerange includes the middle 50 percent of the distribution.
of the observations at the lower end and another quarter
Coefficient of Range = 60+10 That is one quarter
at the upper end of the distribution are excluded in
of the observations
50
= 0.7143 computing the interquartile range.
70 In other words, interquartile range represents, the difference between
Merits of Range the third quartile and the first quartile. Symbolicaly.
1. Range is simple to understand. where Q.D. Quartile deviation
Q.D.
2 Range is easy to calculate.
3 Ifone is interested in getting a quick rather than a very acCurate Quartile deviation gives the amount by which the two quartiles differ
picture of variability one may compute range. from the median. In a symmetrical distribution, the two quartiles Q, and Q,
Limitations of Range will be egquidistant from the median. But nearly all distributions of social
1. Range is not based on each and every item of the distribution. Science series are asymmetrical. In an asymmetrical distribution Q,and Q,
are not equidistant from the median. As a result, an asymmetrical distribution
2 Range iS affected by extreme values in the series, because it takes indludes only approximately 50 percent of the observations.
into account only the largest and smallest items.
When quartile deviation is very small, it describes high uniformity or
3. Range cannot be calculated for open-ended classes. Smallvarition of the central 50 per cent items and a high quartile deviation
4. The valuethe of means that the variation among the central items is large.
Range is highly affected by fluctuations of sampling.increases,
range is never stable. As the size of the sample Quartile deviation is an absolute measure of dispersion. The relative
range tends to increase though not in the same ratio. ure Corresponding to this measure is called the coefficient or quartile
5.
Calculation of rrange does not take into account the Composition
of deviation and is calculated as follows,
the series. The calculation is done using the smallest andlargest Q, -Q,
items of the series. So the value of range is considerably changed, Coefficient of Quartile Deviation.=
if any item is added or deleted on either side of extreme valu. Q, + Q,
Uses of Range Coefficient
Ofvariation in of iquartile deviation can be used to Compare the degree
1
Range is prices of
shares
a. different distributions.
and stock, useful in
gold, studying
the variations in the
Computation of Quartile Deviation for rIndividual Observations
2 In weather jewellery etC. difference For computing quartile deviation for individual observations, it is
ornecessary
der. that the data should be arrangeddin
forecasts also, range is the or descending
between the used to determine to varyon
likely t the ascending
a particular day.
minimum and maximum temperature The Q, - Q,
iormula used for Quartile Deviation = 2
ANALYSIS FOR BUSINESS MEASURES OF DISPERSION AND SKEWNESS
368
QUANTITATIVE DECISIONS -| 369
The calculations are explained and made Clear with the hel. Substituting the values in the equation, we get
following examples. 40 - 15
Coefficient of Q.D. = 25 = 0.4545
PROBLEM 40 +15 55
Runs scroed by a batsman in acricket match - 20, 28, 40, 12 20 4e PROBLEM
Q, is calculated using the formula term 8,10, 14, 22, 26, 28, 30, 39, 44, 59, 64
N= the number of observations SOLUTION
The data is given in the ascending order. So in order to calculate Q,.
=2nd term = 15 runs
The formula is term = term
Similarly Q, can be calculated using the formula.
= 3rd term = 14
= term
Similarly Q,can be calculated using the equation
= 3x2 = 6h term = 40 runs term
b. Computation of
Quartile Deviation for Discrete Series.
= 3x11 =33d term = 40
The formula to be used is the same i.e.,
Coefficient of Quartile Deviation = So Q, = Rs. 40
Q.D. =
2
Q,+Q, NowQ.D. can be calculated using the equation
The following examples clearly explains the calculation
Q.D.
PROBLEM 2
Substituting the values in the equation, we get
Compute Quartile Deviation and Coefficient of Quartile Deviation 40- 20 20
= Rs. 10
using the following data. Q.D.
2
Coefficient of quartile deviation can be calculated using the equation,
Wages in Rs. 10 20 30 40 50 60
i.e.,
Number of Workers 4 7 15 8 7 2
30 15
26
b Interquartile range
40
34 Semiiinterquartile range (Quartile deviation), and
C.
50 7
41
43
Coeficient of Quartile deviation of the marks of students
60 2
Marks 40 45 52 53 54 55 56 57 68 70
th Number of
Q,is calculated using the
equation Q, =
term
Students 10 12 14 10 11 12 11
52 14 36 PROBLEM
10-20 12 12
Q, is calculated using the equation,
20-30 19 31
Q, = term, where N = frequency total
30-40 5 36
Substituting the values in the equation,we get 40-50 10 46
106 + J th
50-60 55
26.75hterm
60-70 6 61
52 Marks 70-80 6 67
Similarly Q, can be calculated using the equation
Q, class can be calculated using the equation term
O, = 3 term
Where N -0 frequency total
= 3x 26.75 = 80.25th term = 57 Marks 4
So N
Q, = 4
=6.75h term
Interquartile range is calculated using the equation,
a, -Q, =57 - 52 = 5 marks
b
Semi interquartile range or Quartile deviation isScalculated
usingthe Remember that it is only
equation.
57 - 52
Now Q, class is 20 - 30
) 2 . 5 Marks ,can be calculated using the formula
C formula
Coefficient of Quartile deviation is calculated using the Q, = L+ N
, -Q,57 - 52 - c.f X0
0.0458 4
, +Q, 57 +52 109
BUSINESS MEASURES OF
QUANTITATIVE ANALYSIS FOR DECISIONS -| DISPERSION AND SKEWNESS 375
374
c.f = cumulative frequency of the preceeding class,,i.e., the Where L= Lower limit of the Q, class
class preceeding the quartle class. PROBLEM
f = simple frequency of the quartile class Calculate the
class interval of the quartile class. interquartile range
Substituting the values in the equation we get.
a.
a. Quartile deviation can now be calculated using the equation - 29.5th term
Q.D. = , -Q, 54.72-22.5 Inerefore 20-30 is the Q. class. Q. is calculated using the formula
2 2 N
-c.f
32.22
= 16.11 marks =L+ f
xi
So Q, class is 40-50. Q, can be calculated using the equation; measuring dispersion. An ideal method shouid be based on each and
everyitem.
argument against quartile deviation is that, it is not ameasure
2. Another as it does notshowthe scatter around an averages, rather
x0 of dispersion
f is a distance
on a scale. This means that quartile deviation is not itself
Substituting the values in the equation, we get measured from an average. But it is a positional average. Hence, some
economists and statisticians are of the viewthat quartile deviation is a
measure of partition rather than a measure of dispersion.
88.5 -76
x 10
a, =40 + 24 treatment.
oOuartile deiviation is not amenable to further algebraic both ends.
hecause it ignores the first and last 25 percent items on
= 40 + 5.21
values of central items.
AQuartile deviation is.basically dependent on the affected adversely.
= 45.21 marks
If these values are irregular, then the results are
fluctuations.
a Interquartile range =Q, -Q, 5. Quartile deviation is highly affected by sampling
the distribution of items
= 45.21 - 22.5 6. Lastly quartile deviation is not affected by
the quartiles. That
= 22.71 marks
between Q, and Q, or by the distribution outside
be the same for two dissimilar
Semi interquartile range or Quartile deviation means, the values of the quartiles may
b
series.
45.21 - 22.5 DEVIATION
=11.36 marks 3. THE AVERAGE DEVIATION OR MEAN
2 2
Range and
methods of dispersion discussed above namely
TWo strict sense of the
Coefficient of Quartile Deviation Quartile Deviation are not measures of dispersion in the
C
ignoring signs
Where IDl denotes deviations trom median SOLUTION
Computation of Mean Deviation for the First Set ofValues
Steps in Calculation
X D= X - 400
1 Compute the medianor mean according to the averane
100 300
2 Take deviaions of items from the average ignoring signs and deno 200 200
these deviations by |D|
400
3 Obtain the total of these deviations i.e.. IDI 600 200
ED| where N is the number of 800 400
4 Apply the formula M.D. =
observations. N 1,100
If the deviation is small, the distribution is highly uniform, since th
term = term
more than half of the values are concentrated within a small range around The median is given by
the mean or median according to the average selected.
3rd term = 400
The relative measure corresponding to the mean deviation is called
coefficient of mean deviation and it is obtained by dividing mean deviation Mean deviation can be calculated using the formula;
by the particular average (i.e., either median or mean) used in computing
mean deviation. M.D. = substituting the values, we get
N
That is, if median is used in computing mean deviation, then M.D. = 1,100 = 220
Mean Deviation 5
Coefficient of Mean Deviation Median Computation of Mean Deviation for the Second Set ofValues
If mean is used, then, X |D|= X - 4,000
Mean Deviation
Coefficient of Mean Deviation 3,000 1,400
Mean
4,000 400
observations is
The computation of mean deviation for individual 4,200 200
made clear with the following examples.
4,400
4,600 200
PROBLEM
valuesfor 4,800 400
Calculate mean deviation using median from the giiven 5,800 1,400
two sets of observations. 4,000
100 800
200 400 600 Median ot the series can be calculated using the formula,
X 3,000 4,800 5,800 term, term
4,000 4,2004,400 4,600
4th term
4,400
QUANTITATIVE ANALYSIS
FOR BUSINESS DECISIONS MEASURES OF DISPERSION AND SKEWNESS 381
380
Deviations = (X - 80)
Now mean deviation can be calculated using the eguatios
Mean Deviation =
ED 30 20 15 10 10 20 30 40 160
PROBLEM
Wages in Rs 80 100 120240 11o70 60 80 90 65 50
na survey regarding the number of home appliances they have in
245
Meanhouseholds are shown in the following frequency
deviation about distribution. Compute
SOLUTION median.
In order to calculate median, the data has to be arranged in
the Number of Home
ascending or descending order. If arrangedIin
t the ascending
orderthevalues Appliances 1 2 4 5 6 7
will be:
50, 60, 65, 70, 80, 80, 90, 100, 110, 120,
Frequency 14 21 25 43 51 40 39 12
frequency total. 10 15
Here N is the Runs Scored 5 20 25 30 35 40
Computation of Median 36
Numberof Matches 16 32 44 28 18 12 14
The calculation is made clear with the help of the following example 60-70 65 30 3 18 54
70-80 75 40 9 36
m-A 80-90 85 50 5 5 25
Marks f m D|= f|D| 110 154
10
12 Here A = 35
10-20 4 15
2 12
20-30 25 Mean can be calculated using the equation
10
30-40 10 35 Efd'
X A+
40-50 20 451 10 N
50-60 10 55 1 Substituting the values in the equation, we get
2 12
60-70 6 65 154
3 12 35 +
70-80 4 75 x10
68 110
60
35 + 14 = 49 marks
Here A = 45 Computation of mean devlation
The formula to calculate Mean Deviation is omputation of Mean Deviation from Mean is explained below.
M.D. N Xi, Marks (X) m
D|= m - 49 f
Substitutingthe values in the equation, we get 10-20 15 272
34 8
M.D.
68
x 10 = 11.33 marks 20-30 25 24 10 240
60 30-40 35 14 15 210
PROBLEM
Meanfrom
the 40-50 45 4 25 100
Calculate Mean Deviation and its Coefficient from 30-60 55 6 20 120
following data.
l80-90
60-70 65 16 18 288
Marks 10-20 20-3030-40 40-50 50-60 60-70
|70-80 70-80 75 234
Number of
80-90 85
26
36 5 180
8 10 15 25 20 18
Students 110 1644
QUANTITATIVE ANALYSIS FOR BUSINESS DECISIONS -|
MEASURES OF DISPERSION AND SKEWNESS 399
388
So the |D|is calculated as m- 49. We have not
Here,mean = 49, Mean can be calculated using the equation
the class interval. because it will make the values in
dividedthe values with Efd'
fractions. X=A + XI
Mean Deviation can now be calculated using the fornula
N
Substituting the values in the eugation, we get
M.D. 20
X= 50 + X 20
Rememberthat, here for calculating |D|, we have not 100
divided theuie
Values with the class interval, hence for calculating mean derivation
= 50 + 4
not have to multiply the clasS interval, Substhtutng the values in the eouuatie = Rs.54
for Mean Deviation, we get Computation of Mean Deviation from Mean
1,644 = 14.945 marks
M.D. 3
110
Wages |D= m -54
Coetficient ofMean Deviation can be claculated using the formula;
Mean Deviation 0-20 10 44 10 440
Coefficient Mean Deviation =
Mean 20-40 30 24 16 384
14.95 =0.305 40-60 50 4 30 120
Substituting the values, we get 49 60-80 70 16 32 512
PROBLEM
80-100 90 36 12 432
Calculate Mean Deviation from Mean and Median for the following. 100 1,888
0-20 20-40 40-60 60-80 80-100
Wages (Rs.)
30 32 12 Mean Deviation from Mean can be calculated using the equation
Number ofWorkers 10 16
0-20 10 -40 -2
10
-16 20-40 16 26
20-40 30 -20 -1 16 40-60 30 56
40-60 50) 30
32 60-80 32 88
60-80 70 20 1
32
24 80-100 12 100
80-100 12
90 40 2
100
20 Median classis given by N th
term =
100
= 50" term. This