DIRT Meee OG u eens TSE
TEXT BOOK OF
UTNE
SPUN TLFirst Published-2005
ISBN 81-7141-953-4
© Author
Published by
DISCOVERY PUBLISHING HOUSE
4831/24, Ansari Road, Prahlad Street,
Darya Ganj, New Delhi-110002 (India)
Phone: 23279245 « Fax: 91-11-23253475
E-mail:
[email protected]
Printed at
Arora Offset Press
Laxmi Nagar, Delhi-92Contents
Pages
Preface
1.__Classification and Tabulation of Data =
Introduction, Meaning of Classification, Formation of A
Discrete Frequency Distribution, Formation of Continuous
Frequency Distribution, Tabulation of Data., Parts of a Table,
General Rules of Tabulation, Considerations in the
Construction of Frequency Distributions.
2. Graphic Presentation of Data 23
Introduction, Significance of Diagrams and Graphs, General
Rule for Constructing Diagrams, Types of Diagrams, Two-
Dimensional Diagrams, Pie Diagram, Graphs, Range Chart,
Graphs of Frequency Distributions, Smoothed Frequency
Curve, Cumulative Frequency Curves or Ogives.
3.__Measure of Central Value 87
Introduction, Average Defined, Types of Averages, Arithmetic
Mean, Merits and Limitations of Arithmetic Mean,
Determination of Median, Quartiles, etc., Graphically, Related
Positional Measure, Calculation of Mode—Continuous Series,
Weighted Geometric Mean, Compound Interest Formula,
Harmonic Mean, Weighted Harmonic Mean, Relationship
Among the Averages, Harmonic Mean.
4. Measures of Dispersion \ 213
Introduction, Mertis and Limitations, The Mean Elevation,
Tchebycheff's Theorem, Mertis and Limitations, The Standard
Deviation, Variance and Standard Deviation Compared, Merits
and Limitations, Correcting Incorrect Values of Mean and1
CLASSIFICATION AND TABULATION
OF DATA
INTRODUCTION
The phrase “classification and tabulation” has been used, classification
is, in effect, only the first step in tabulation, for, in general, items having
common characteristics must be brought together before the data can be
displayed in tabular form.
MEANING OF CLASSIFICATION
Sorting facts on one basis of classification and then on another basis is
called cross classification. This process can be repeated as many times as
there are possible bases of classification. Classification of data is a function
very similar to that of sorting letters in a post-office. It is well known that
the letters collected in a post-office are sorted into different lots on a
geographical basis.
Objects of Classifications
The principal objective of classification data are:
1. To pinpoint the most significant features of the data at a glance.
2. To give prominence to the important information gathered while
dropping out the unnecessary elements.
3. To enable a statistical treatment of the material collected.
To condense the mass of data in such a manner that similarities and
dissimilarities can be readily apprehended. Millions of figures can thus
be arranged in a few classes having common features.
5. To facilitate comparison.
Types of Classification
The data can be classified on the following four basis :
1. Qualitative, ie, according to some attributes.nN
Textbook of Elementry Statistics
2. Quantitative, ie, in terms or magnitudes.
3. Geographical, ie., area-wise, e.g., cities, districts, etc.
4. Chronological, ie, on the basis of time.
1. Qualitative Classification
In qualitative classification data are classified on the basis of some
attribute or quality such as sex, color of hair, literacy, religion, etc.
The type of classification where only two classes are formed is also
called two-fold or dichotomous classification. If instead of forming only two
classes we further divide the data on the basis of some attribute or attributes
so as to form several classes, the classification is known as manifold
classification.
2. Quantitative Classification
Quantitative classification refers to the classification of data according
to some characteristics that can be measured, such as height, weight, etc. For
Ex. the students ofa college may be classified according to weight as follows:
Weight (in Ib.s) No. of Students
90-100 100
100-110 220
100-120 250
120-130 370
130-140 80
140-150 60
Total 1,080
Such a distribution is known as empirical frequency distribution or
simple frequency distribution.
The following are two Ex.s of discrete and continuous frequency
distributions :
No, of Children No, of Families © Weight (Ib.s) No. of persons
0 15 100-110 15
1 20 110-120 20
3 70 120-130 50
4 120 130-140 55
5 270 140-150 15
6 70 150-160 5
Total 575 Total 160
(a) Discrete Frequency Distribution. (b) Continuous Frequency Distribution.co
Classification and Tabulation of Data
3. Geographical Classification
In this type of classification data are classified on the basis of geographical
or locational differences between the various items. For instance, the production
of sugarcane in India may be presented State-wise in the following manner:
Production of Sugarcane for the Year 1976 (Figures imaginary)
Name of State Sugarcane Production (in million tonnes)
Uttar Pradesh 68
Bihar 28
Tamil Nadu 18
Maharashtra. 14
Other States 12
Total 140
4. Chronological Classification
When data are observed over a period of time the type of classification
is known as chronological classification. For Ex., we may present the figures
of population (or production, sales, etc.) as follows :
Population of India from 1921 to 1971
Year Production of india Year Production of India
(in millions tonnes) {in millions tonnes)
1921 348 1951 538
1931 376 1961 576
194] 413 1971 678
FORMATION OF A DISCRETE FREQUENCY DISTRIBUTION
In the formation of discrete frequency distribution, we have just to count
the number of times a particular-value is repeated which is called the frequency
of that class. In order to facilitate counting, prepare a column of “tallies”.
In another column, place all possible values of variables from the lowest to
the highest. Then, put a bar (vertical line) opposite the particular value to
which it relates. To facilitate counting, blocks of five bars are prepared and
some space is left in between each block. We finally count the number of
bars corresponding to each value of the variable and place it in the column
entitled ‘frequency’.
Example:
ina survey of 35 families in a village, the number of children per family
was recorded and the following data obtained :Textbook of Elemeniry Statistics
I 3 2 3 4 5 6
7 2 3 4 5 2 5
8 4 5 10 6 3 2
Fe 6 5 3 7 8
vo D 1 9 5 4 3
Solution:
Frequency Distribution of the Number of Children
Ne. of Children Tallies Frequency
1 | 1
2 i 4
3 tH | 6
4 IH | 6
5 tH 6
6 Ill 3
7 lil 3
8 ll 2
9 | 1
10 ll 2
Total 35
FORMATION OF CONTINUOUS FREQUENCY DISTRIBUTION
The following tectmical terms are important when a continuous frequency
distribution is found.
@ Class Limits : The class limits are the lowest and the highest values
(ii)
that can be included in the class. For Ex., take the class 10-20. The
lowest value of the class is 10 and the highest 20. The two boundaries
ofa class are known as the lower limit and the upper limit of the class.
Class Intervals : The spare of a class, that is the difference between
the Upper and Lower Limit is known as class interval for Ex., in the
class 50-100, the class interval is 100 (i.¢., 100 minus 50). An important
decision while constructing a frequency distribution is about the width
of the class interval. A simple formula to obtain the estimate of
appropriate class interval i.e., i is.
._L-S
i=——
kClassification and Tabulation of Data 5
where, L = largest item,
S = smallest item,
k = the number of classes
(iii) Class Frequency : The number of observations corresponding to a
particular class i known as the frequency of that class or the class
frequency. In the following Ex., the frequency of the class 0-50 is 50
which implies that there are 50 persons having income between Rs.
0-50.
Class Mid-point or Class Mark ; It is the value lying half-way between
the lower and upper class limits of a class-interval, Mid-point of a class is
ascertained as follows :
Upper limit of the class -
Lower limit of the class
2
There are two methods of classifying the data according to class-intervals,
namely:
(i) ‘Executive’ method, and
(ii) ‘Inclusive’ method.
(i) ‘Exclusive’ Method : When the class intervals are so fixed that the
upper limit of one class is the lower limit of the next class it is known
as the ‘exclusive’ method of classification. The following data are
classified on this basis :
Mid — point of a class =
income (Rs.) per day No, of Persons
0-50 50
50-100 100
100-150 200
150-200 150
200-230 40
250-300 10
Total 550
It is clear that the ‘exclusive’ method ensures continuity of data inasmuch
as the upper limit of one class is that lower limit of the next class. Thus,
in the above Ex., there are 50 persons whose income is between Rs. 0-49.99.
A person whose income is Rs. 50 would be included in the class
Rs. 50-100.
A better way of expressing the classes when exclusive method is followed
is :6 Textbook of Elementry Statistics
Income (Rs.) No. of Persons
0 but under 50 50
50 but under 100 100
100 but under 150 ' 200
150 but under 200 150
200 but under 250 40
250 but under 300 10
Total 550
(ii) *Inetusive’ Method. : Under the ‘inclusive’ method of classification,
the upper limit of one class it included in that class itself. The following
Ex. illustrates the method :
Income (Rts.) No. of Persons
0-49 50
50-99 100
100-149 200
150-199 ~ 150
200-249 50.
250-299 10
Totai 560
In the class 100-149 we include persons whose income is between
Rs. 100 and Rs. 149. It the income of a person is exactly Rs. 150 in the next
class.
TABULATION OF DATA
A table is a systematic arrangement of statistical data in columns and
rows. Rows are horizontal arrangements whereas columns are vertical ones.
The purpose of a table is to simplify the presentation and to facilitate
comparisons. The simplification results from the clear-cut and systematic
arrangement, which enables the reader to quickly locate desired information.
Comparison is facilitated bringing related items of information close together.
Role of Tabulation
Tables make it possible for the analyst to present a huge mass of data
in a detailed orderly manner within a minimum of space. Because of this,
tabular presentation is the cornerstone of statistical reporting. The significance
of tabulation will be clear from the following points:
|. Jt simplifies complex data : When data are tabulated all unnecessary
details and repetitions are avoided. Data are presented systematically
in columns and rows. Hence, the reader gets a very clear idea of whatClassification and Tabulation of Data 7
the table presents. There is thus a considerable saving in time taken
in understanding what is represented by the data and all confusion is
avoided. Also a large amount of space is saved because of non-
duplicating of his headings and designations; the description at the top
of a column serves for all the terms beneath it.
It facilitates comparison : Tabulation facilitates comparison. Since a
table is divided into various parts and for each part there are totals
and sub-totals, the relationship between different parts of data can be
studied much more easily with the help of a table than without it.
It reveals patterns : Tabulation reveals patterns within the figures
which cannot be seen in the narrative form. It also facilitates the
summation of the figures if the reader desires to check the totals.
It gives identity to the data ; When the data are arranged in-a table
with a title and number they can be distinctly identified and can be
used as a source reference in the interpretation of a problem.
PARTS OF A TABLE
1.
The main parts of a table in general are following :
1. Table number 2. Title of the table
3. Caption 4, Stub
5. Body of the table 6. Headnote
7. Footnote
Table Number : Each table should be numbered. There are different
practices with regard to the place where this number is to be given.
The number may be given either in the centre at the top above the
title or Inside of the title at the top or in the bottom of the table on
the left-hand side. However if space permits the table number should
be given in the, centre as is shown in the specimen table given.on page.
Where there are many columns, it is also desirable to number each
column so that easy reference to it is possible.
Title of the Table : Every table must be given suitable title. The title
is a description of the contents of the table. A complete title has to
answer the questions what, where and when in that sequence. In other
words:
(i) what precisely are the data in the table (/.e., what categories of
statistical data are shown)?
(ii) when the data occurred (i.¢., the specific time or period covered
by the statistical materials in the table)?Textbook of Elementry Statistics
(iii) where the data occurred (i.¢., the precise geographical, political
or physical area covered)?
The title should be clear, brief and self-explanatory. However,
clarity should not be sacrificed for the sake of brevity. Long titles
cannot be read as promptly as short titles, but at times they may
have to be used for the sake of clarity. The title should be so
worded that it permits one and only one interpretation. It should
be in the form of a series of phrases rather than complete
sentences. Its lettering should be the most prominent of any
lettering on the table.
Caption : Caption refers to the column headings. It explains, what
the column represents. It may consist of one or more column headings.
Under a column heading there may be sub-heads. The caption should
be clearly defined and placed at the middle of the column. If the
different columns are expressed in different units, the units should be
mentioned with the captions. As compared with the main part of the
table the caption should be shown in smaller letters. This helps in
saving space.
Stub : As distinguished from caption, stubs are the designations of
the rows or row headings. They are at ‘the extreme left and perform
the same function for the horizontal rows of numbers in the table as
the column headings do for the vertical columns of numbers. The stubs
are usually wide than column headings but should be kept as narrow
as possible without sacrificing precision and clarity of statements.
Body : The body of the table contains the numerical information. This
is the most vital part of the table. Data presented in the body arranged
according to description are classifications of the captions and stubs.
Headnote : It is a brief explanatory statement applying to all or a
major part of the material in the table, and is placed below the point
centered and enclosed in brackets. It is used to explain certain points
relating to the whole table that have not been included in the title nor
in the captions or stubs. For example, the unit of measurement is
frequently written as a headnote, such as “in thousands” or “in million
tonnes” or “in crores”, etc.
Footnotes : Anything in a table which the reader may find difficult
to understand from the title, captions and stubs should be explained
in footnotes. If footnotes are needed they are placed directly below
the body of the table. Footnotes are used for the following main
purposes:Classification and Tabulation of Data 9
(i) To clarify anything in the table.
(ii) To point out any exceptions as to the basis of arriving at the data,
for example, sales recorded at ‘ex-factory price’ for some of
the entries and at ‘delivered price” for others. Any heterogeneity
in the data recorded must be disclosed to avoid wrong conclusions.
(iii) Any special circumstances affecting the data, for example, strike,
lock-out, fire, etc.,
(iv) To give the source in case of secondary data, The reference to
the source should be complete in itself. For example, if the data
is obtained from some periodical, its name, date of publication,
page number, table number, etc., should be mentioned so that
if the user wishes to check the data from the original source or
considers later data from the same source he will know where
to look for the information.
GENERAL RULES OF TABULATION
“In collection and tabulation common sense is the chief requisite and
experience the chief teacher.” However, the following general considerations
may be kept in view while tabulating data:
1.
The table should suit the size of the paper usually with more rows than
columns. In making a suitable layout it may be necessary to alter the
original design. The alteration often consists In changing the rows to
columns or the other way round. For this reason, it Is desirable to make
a rough draft of the table before the figures are entered In it. Space
must be allowed for reference or any other matter which is to be
Included in the table.
The table should not be overloaded with details. If many characteristics
are to be shown it is not necessary to load them all in one table; rather
a number of tables should be prepared, each table complete in itself
and serving a particular purpose.
Indicate a zero quantity by a zero, and do not use zero to indicate
information which is not available. If it is not available, show this fact
by the letter N.A. or dash (—).
A column entitled ‘miscellaneous’ should be added for data which do
not fit in the classification made.
Be explicit. The expression “etc.” is bad form in a table, since the
reader may not readily discover what it refers to. In fact clarity is the
most Important feature of tabular presentation of any kind of statistical
data.10
6.
Textbook af Elementry Statistics
The arrangement of the table should be logical and Items related to
each other should be placed nearabout and. If possible. In the same
group. Derivative figures such as totals, averages and percentages
should be placed near the original figures, Columns and rows should
be numbered for identification since reference is more easily given
by quoting numbers than the title of the column.
Abbreviations should be avoided especially in titles and headings. For
example “yr’’ should not be used for ‘year’.
In all tables the captions and stubs should be arranged in some systematic
order. It would make the table easier to read and allow more important
items to be emphasised. The arrangement of items basically depends
upon the type of data. However, the principal bases for arranging items
are the following:
(a) Alphabetical, ie, arrangement according to alphabets. The type -
of arrangement is very common in general purpose or reference
tables.
(b) Chronological, i.2., arrangement according to time. This type of
arrangement is of particular value in presenting historical data.
(c) Geographical, £.¢., arrangement of data in certain territorial units
such as countries, cities, districts, etc.
(d) Conventional, or arrangement in a customary order such as men,
women and children or Hindus, Muslims, Sikhs and Christians.
(e) Items may be arranged according to size, i.e, the numerical
importance of the items, the largest items being given first and
the smallest in the last. This arrangement may be reversed, if
necessary.
Where standard classifications have been prepared it is usually desirable
to employ them, as they are superior to hastily constructed individual
classification.
If certain figures are to be emphasized they should be in distinctive
type or in a ‘box’ or circle or between thick lines.
Percentage and ratios should be computed and shown, if necessary.
Frequently, figures in tables become more meaningful if expressed as
percentages or (less often) as ratios. In constructing a table, therefore,
it is important to decide whether or not it can be Improved in this way.
If it can, additional column should be Inserted in the table and the
percentages (or ratios) computed and entered. Such percentages and
ratios are sometimes called derived statistics.Classification and Tabulation of Data i
12.
13.
Figures should be rounded off to avoid unnecessary details in the table
and a footnote to this effect should be given. For example, the figures
may be taken to the nearest rupees and paise be eliminated.
The point of measurement should be clearly defined and given in the
table such as income in rupees or weight in pounds, etc.
CONSIDERATIONS IN THE CONSTRUCTION OF FREQUENCY
DISTRIBUTIONS
It is difficult to lay down any hard and fast rules for constructing a
frequency distribution much depends on the nature of the given data and the
object of classification.
However, the following general considerations may be borne in mind
for ensuring meaningful classification of data :
1.
The starting point, ie., the lower limit of the first class, should either
be zero or 5 or multiple of 5.
The number of classes should preferably be between 5 to 20. However,
there is no rigidity about it. The classes can be more than 20 depending
upon the total number of items in the series and the details required,
but they should not be less than five because in'that case the classification
may not reveal the essential characteristics. The choice of number of
classes basically depends upon:
(a) the number of figures to be classified,
(b) the magnitude of the figures,
(c) the details required, and
(d) case of calculation of further statistical work.
As far as possible one should avoid such values of class-intervals, as
3, 7, 11, 26, 39, etc. Preferably, one should have class-intervals of
either five or multiples of 5 like 10, 20, 25, 100 etc. The reason is
that the human mind is accustomed more to think in terms of certain
multiples of 5, 10 and the like. However, where the data necessitate
a class-intervals of less than 5 it can be any value between | and 4.
The ensure continuity and to get correct class-interval we should,
adopt ‘exclusive’ method of classification. However, were ‘inclusive’
method has been adopted it is necessary to make an adjustment to
determine the correct class-interval and to have continuity. The
adjustment consists of finding the difference between the lower limit
of the second class and the upper limit of the first class, dividing the
difference by two, subtracting the value so obtained from all lower12 Textbook af Elementry Statistics
limits and adding the value to all upper limits. This can be expressed
in the form of a formula as follows :
Lower limit of the 2nd class —
Upper limit of the Ist class
2
How the adjustment is made when data are given by inclusive method
can be seen from the following Ex.s:
Correction iactor =
Weekly Wages No. of Weekly Wages No. of
(in Rs.) Workers (in Rs.) Workers
10-19 5 40-49 8
20-29 10 50-59 2
30-39 15
To adjust the class limits, we take here the difference between 20 and
19, which is one. By dividing it by two we get % or 0.5. This (0.5) is called
the correction factor. Deduct 0.5 from the Jower limits of all classes and add
0.5 to upper limits. The adjusted classes would then be as follows :
Weekly Wages No. of Weekly Wages No. of
(in Rs.) Workers {in Rs.) Workers
9.5-19.5 3 39.5-49.5 8
19.5-29.5 10 49.5-59.5 2
29.5-39.5 15
It should be noted that before adjustment the class-intervals was 9 but
after adjustment, it is 10. Observe another case.
Variable Frequency
5-9.5 8
10-14.5 . 10
15-19.5 2
(10-95) _
The correction factor here is 0.25.
After adjustment the classes will be :
The class-interval now is 5 and not 4.5, Taking a third Ex., if the class
limits are :Classification and Tabulation of Data 13
Variable Frequency
5-9.99 8
10-14.99 10
15-19.99 2
10-999) 0.01
The correction factor would be (0 999) =— = 0.005.
2 2
After adjustment the classes will become :
Variable Frequency
4.995~9.995 8
9.995-14.995 . 10
14.995-19.995 2
5. Wherever possible, it is desirable to use class intervals of equal sizes
because comparisons of frequencies among classes are facilitated and
subsequent calculations from the distribution are simplified. However,
this is not always a practical procedure.
Open-end distribution presents problems of graphing and further analysis.
When the frequency distribution is being employed as the only technique
of presentation, open-end classes do not seriously reduce its usefulness as
long as only a few items fall in these classes. However, use of the distribution
for purposes of further mathematical computation is difficult because a mid-
point value, which can be used to present the class, cannot be determined
for an open-end class.
6. In any frequency distribution the sizes of items or the values are
indicated on the left-hand side and the number of times the items in
those sizes or values have repeated are indicated by frequencies on
the right-hand side opposite to the respective sizes or values.
SOLVED EXAMPLES
Example 1:
Point out the mistakes in the following table drawn to show the distribution
of population, according to sex, age and literacy :
Sex. Oto 25 25 to 50 50 to 75 75 ta 100
Males _ — _ -
Females = — oe a14 Texthook of Elementry Statistics
Solution:
All the characteristics are not revealed in the above table; the characteristic
of literacy has been completely ignored. Even otherwise the table need to
be re-arranged as follows :
Table Showing the Distribution of Population According
to Age, Sex and Literacy
Literates Mliterates Total
Age group M F Total M F Total M F Total
0 to 25 - - = See
25 to 50 —- - = —--—- — SS eS
50 to 75 - - = -- — - - -—
75 to 100 - - — - -—- — —- — =
Total —- —- = — — — —_- — —
Example 2:
Draft a blank table to show the distribution of personnel in a
manufacturing concern according to :
(a) Sex : males and females.
(b) Three grades of salary : below Rs. 300, Rs. 300-500, Rs. 500 and
above.
(c) Two period ; 1978 and 1979
(d) Three age-groups : below 25, 25 and under 40, 40 and over.
Solution:
Table Showing Distribution of Personnel According to Sex, Salary
and Age-Groups for Two Years
Salary Grade
Year Age Below Rs. 300 Rs. 300-500 Rs. 500 & above = Total
Groups M F TotalM F Total M F Total M F Total
Blw25 — — — —— — ~~ ~~ ~~ ~—
25 and
1978 under 40 — — — — — — ~~ ~~ ~~ ~
40 and
above — — — — — — —-_- - -- -
Tal —-— -——-—-— —-— — —-——Classification and Tabulation of Data 15
Below 25 — — — — — — —--—- | Fe
25 and
1979 under 40 — — — — — — —-——- - -- -
40 and
above — — — — — — -——- -—- -—-- —
Tout — — — — — — —_-—_—- —_ —— —
Example 3:
Following figures give the ages of newly married husbands and their
wives in years. Represent the data by a frequency distribution.
Age of husband Age of Wife Age of husband Age of Wife
24 17 25 17
26 18 26 18
27 19 27 19
25 i7 25 19
28 20 27 20
24 ig 26 19
27 ig 25 17
28 ig 26 20
25 i8 26 i7
26 19 26 18
Solution:
Frequency Distribution of the Age of Husbands and Wives
Age of husbands Total
Age of wives 24 25 26 27 28
17 1d) me) 1a) _ _ 5
18 (1) 10g) WG) IW _ 6
19 _— 11) Il (2) II (2) 1(1) 6
20 - - 1 (1) 1(1) 1 (1) 3
Total 2 5 7 4 2 20
Example 4:
Out of a total number of 1,807 women who were interviewed for
employment in a textile factory of Bombay, 512 were from textile areas and
the rest from the non-textile areas. Amongst the married women who belonged
to textile areas, 247 were experienced and 73 inexperienced, while for non-16 Textbook of Elementry Statistics
textile areas, the corresponding figures were 49 and 520. The total number
of inexperienced women 918 were unmarried, and of these the number of
experienced women in the textile and non-textile areas was 154 and 16
respectively, Tabulate.
Solution:
Table Showing the Marital Status of 1,807 Women Residing in
Textile and Non-Textile Areas
Textile Areas Non-textile Areas Total
M U Total M U_ ‘Total M U_ Total
Experienced 247 154 401 49 16 65 296 170 466
Inexperienced 73 38 111 $20 710 = 1,230 593 748 1,341
Total 320-192 512) «$67: 726 «(1,295 889 «(918 «1,807
Example §:
Inasample study about coffee habit in two tons, the following information
were received :
Town A Females were 40%, Total coffee drinkers were 45% and Male
non-coffee drinkers were 20%.
Town B Males were 55%; Males non-coffee drinkers were 30% and
Females coffee drinkers were 15%.
Represent the above data in a tabular form.
Solution:
Table Showing the Coffee Drinking Habit of Town A & B
(in percentage)
Attribute Town A Town B
Males Females Total Males Females Total
Coffee Drinkers 40 5(a) 45 25c} 15 40(e)
Non-coffee Drinkers 20 35(b) 55 30 30(d) 6 0(f)
“Toul sts=<‘“‘é‘ OS!”*”*~«KC*dOSC*é‘“(“ ®SS!SC«MSSSC«*
Figures (a), (b), (c), (d), (e) and (f), are obtained by simple process of
deduction after taking into consideration the given information.
Example 6:
Represent the following information in suitable tabular form with proper
rulings and headings :Classification and Tabulation of Data 17
The Annual Report of the Ishapore Public Library reveals the following
points regarding the reading habits of its members :
Out of the total 3,713 book issued to the members in the month of June
1970, 2,100 were fictions. There were 467 members of the library during
the period and they were classified into five classes A, B, C, D, and E. The
number of members belonging to the first four class were respectively 15,
176, 98 and 129; and the number of fictions issued to them were 103, 1,187,
647 and 58 respectively. Number of books, other than textbooks and fictions,
issued to these four classes of members were respectively 4,390, 217 and 341,
The textbooks were issued only to members belonging to he classes C, D and
E and the number of textbooks issued to them were respectively t3, 317 and
160.
During the same period, 1,246 periodicals were issued. Thee included
396 technical journals of which 36 were issued to members of class B, 45
to class D and 315 to class E.
To members of the classes B, C, D and E the number of other journals
issued were 419, 26, 231 and 99 respectively.
The report, however, showed an increased by 3.9% in the number of
books issued over last month, though was a corresponding decrease by 6.1%
in the number of periodicals and journals issued to members.
Solution:
Table Showing the Reading Habits of People
Numbers of books issued Numbers of periodicals
issued
Class of No.of Fiction Text Other Total Tech- Others Total
members members books than nical
in each fiction
class & text-
books
A 15 103 a 4 107 _ 75 75
B 176 187 — 390 = (1,577 360-419 455
c 98 647 3 217-867 _ 26 26
D 129 58 317 341 716 45 231 276
E 49 105 160 181 446 315 99 = 414
Total 467 1,200 480 1,133. 3,713 396 = 850 1,246
%age increase
(+) _ — _ — +39 _ — -61
decrease (—)
May, 197018 Textbook of Elementry Statistics
The main principles of classification adopted are :
(i) One major group is the class of members.
(ii) Among reading matter, the two major classifications are—Books and
Periodicals.
(iii) The sub-groups under each of the two groups under (ii) above are (a)
fiction, (b) text books, and (c) others under books and (a) technical,
and (b) other under periodicals.
(iv) The number of members belonging to class ‘E’ is derived by deducting
the sum of the members (given) under the four classes A, B, C and
D from the total number of members.
Example 7:
A sample consists of 34 observations recorded correct to the nearest
integer, ranging in value from 201 to 337. If it is decided to use seven classes _
of width 20 integers and to begin the first class at 199.5, find the class limits
and class marks of the seven.classes.
Solution:
Since it is decide to begin with 199.5 and take a class interval of 20,
the first class be 199.5-219.5, the second 219.5-239.5 and so on. The class
mark shall be obtained by adding the lower and upper limits and dividing
it by 2. Thus for the first class, the class mark shall be (199.5 + 21.5)/2 =
209.5. Since class interval is equal throughout the other class marks can be
obtained simply by adding 20 to the preceding class mark. The following
table gives the class limits and class marks of the seven classes :
Class limits Class marks
199.5-219.5 209.5
219.5—239.5 229.5
239.5-259.5 249.5
259.5-279.5 269.5
279,5-299.5 289.5
299,5-319.5 309.5
319,5-339.5 329.5
Example 8:
Present the following information in a suitable tabular form:
In 1965 out of a total of 1,750 workers of a factory, 1,200 were members
of a trade union.Classification and Tabulation of Data 19
The number of women employed was 200, of which 175 did not belong
to a trade union. In 1970 the number of union workers increased to 1,580
of which 1,290 were men. On the other hand, the number of non-union
workers fell down to 208, of which 180 were men.
In 1975, there were 1,800 employees who belonged to a trade union and
50 who did not belong to a trade union. Of all the employees in 1970, 300
were women of whom only 8 did not belong to a trade union.
Solution:
Table Showing the Sex-wise Distribution of Union and Non-union
Members for 1965, 1970 and 1975
1968 1970 1978
Category MF Total M F Total MF Total
Members 1,175 25 1,200 1,290 290 1,580 1,508 292 1,800
Non-members 375 175 550 180 28 208 42 8 50
“Total __'1,580 200 1,750 1,470 318 1,788 1,550 300 1,850.
Example 9:
In a trip organised by a college there were 80 persons, each of whom
paid Rs. 15.50 on an average. There were 60 students each of whom paid
Rs. 16. Members of the teaching staff were charged at a higher rate. The
number of servants was 6 (all males) and they were not charged anything.
The number of ladies was 20% of the total of which one was a lady staff
member.
Solution:
Table Showing the Type of Participants, Sex and Contribution Made
Sex Contribution Total
Types per member Contribution
Participants Males Females Total (Rs.) (Rs.)
Students 45 15 60 16.00 960
Teaching Staff 3 1 4 20.00 280
Servants 6 _— 6 — —-
Total 64 16 80 _ 1,400
Notes: |. Total contribution = Average contribution
* No. of person who joined the trip
= 15.5 = 80 = 1,24020 Textbook of Elementry Statistics
2. Contribution of the staff per head has been obtained by deducting the
contribution of students from the total dividing the difference by the
number of teaching staff, i.e.,
(1240 —(60 = 16) — 1240-960 280
—=Rs.20,
14 14
14
Example 10:
Draw a blank table to present the information regarding the college
students according to :
(a) Faculty — Art, Commerce, Science.
(b) Class — Degree and Pre-University Class.
(c) Sex — Male and Female.
(d) Age — below 20, above 20,
(e) For 2 years — 1978 and 1979.
Solution:
Table Showing the Sex-wise Distribution of the People of Different
Religions, Localities and Age-groups
Degree Classes
Males Females Total
Years Faculties Below Above Total Below Above Total M F Total
20 20 20 20
1978 Art -_ = — - -—- re rT
Commerce — — — — — — —— =
Science -_ = _ —_— SOS rr
Total _- =— _— -— —- Or
pn At —- —- —- — — — —— —~
Commerce — ~— — — — — ~—— —
Science —- - F- rF Or Olrrerer
Total
Pre-University Classes
Males Females Total
Years Faculties Below Above Total Below Above Total M F Total
20 «20 20 «20
jo AN
Commerce — — —~— —~— ~— — —— —
Science - —- — -—- -—- —- —— —
Total —-— —- - --_- ----Classification and Tabulation of Data 21
1979 Art = - =—-— —-— -—- - SOC
Commerce — — S—SSSSaee
Science es
Total ae ae
M = Males, F = Females
Hand tabulation is difficult to use when the field of investigation is vast
and a number of characteristics are studied simultaneously. In such a case
we can make use of machine tabulation.
Example U1:
in certain data, the following four main characteristics with the sub-
characteristics are present :
Main Characteristics Sub-characteristics
Locality Urban Rural
Religion Hindu, Non-Hindu
Sex Males, Females
Age 0-30, 30-60, 60 and over
Prepare a suitable form of table.
Solution:
Table Showing the Sex-wise Distribution of the People of Different
Religions, Localities and Age-groups
Hindus
Age Urban Rural Total
(In years) M F Total M F Total M F Total
0-30 _- =
30-60 _- _ _ — - = _—
60 7 Over - = _— -_- = = =- = _
Total — = _ - = _ _- = _
Non-Hindus
Age Urban Rural Total
(In years) M F_ Total M F Total M F _ Total
0-30 - — _ - — — - = _
30-60 -_- _ -_ _— - = _
60 7 Over - - — —_ — —_ — _
Total —— SF EE22 Textbook of Elementry Statistics
Example 12:
The data given below relate to the heigits and weights of 20 persons,
You are required to form a two-way frequency table with class interval 62"
to 64", 61” to 66", and so on and 115 to 123 Ib, 125 to 135 Ib., ete.
S. No. Weight Height &. No. Weight Height
1 170 70 I] 163 70
21 135 65 12 139 7
3 136 65 13 122 63
4 13? 64 Id 134
i 148 69 15 140 67
6 124 63 16 132 69
7 i? 65 17 120 66
8 128 70 18 148 68
9 143 a 19 129 67
10 129 62 20 152 67
Using standard deviation and its coefficient, state whether there is a
greater variation in height or weight.
Solution:
As per the requirements of the question, the population is to be divided
into five classes according to the heights of the persons included in each
group and six classes according to the weight. There will be thus 5 x 6 =
30 cells.
For tabulating the information in appropriate cells, first, the row to which
the height measurement (say, X) should belong is determined. Afterwards
on a consideration of the weight (say, Y), the column in which it should be
includéd is determined. The tabulation is recorded by tally bars. Thus the
two way table shall be prepared like this.
Two-way Frequency Table Showing Weight and Height of 20 Persons
Weight
in Ibs. (Y)
118-125 125-135 135-145 145-155 155-165 165-175 Total
Height in
inches (X)
62-64 fH (2) 1d) 3
64-66 (a) i @) 4
66-68 La) | dy iQ) 1a 5
68-70 I @) 2) 4
70-72 1a) [dy i) |} 4
Total 4 5 6 3 1 1 202
GRAPHIC PRESENTATION OF DATA
INTRODUCTION
Here we will discussed some method by tabulation and classification of
data with the help of graph.
The most convincing and appealing ways in which statistical results may
be presented is through diagrams and graphs. Evidence of this can be found
in newspapers magazines, journals, advertisements, etc. There are numerous
ways in which statistical data may be displayed pictorially such as different
types of diagrams, graphs and maps. Very often the problem is that of
selecting the best out of several methéds that may be available. This is a
difficult task and requires a great deal of artistic talent and imagination on
the part of the individual or agency engaged in the preparation of diagrams
and graphs. It is not practicable to discuss all the possible forms of charts
here. An attempt is made in this chapter to illustrate some of the major types
of diagrams, graphs and maps frequently used in presenting statistical data.
SIGNIFICANCE OF DIAGRAMS AND GRAPHS
Graphs are very useful due to following reasons:
(a) They facilitate comparison of data relating to different periods of time
of different regions. Diagrams help one in making quick and accurate
comparison of data. They bring out hidden facts and relationship and
can stimulate as well as aid analytical thinking and investigation.
(b) They have a great memorising effect. The impressions created by
diagrams last much longer than those created by the figures presented
in a tabular form.
(c) They are attractive to the eye. Figures are dry but diagrams delight
the eye. For this reason diagrams create greater interest than cold
figures. Thus, while going though journals and newspapers the readers
generally skip over the figures but most of them do look at the
diagrams and graphs. Since diagrams have attraction value, they-.are
very popular in exhibitions, fairs, conferences, board meetings and
public functions.24 Textbook of Eiementry Statistics
(d) It is a fact that as the number and magnitude of figures increases they
become more confusing and their analysis tends to be more strenuous.
Pictorial presentation helps in proper understanding of the data as it
gives an interesting form to it. The old saying A picture is worth
10,000 words is very true. The mind through the eye can more readily
appreciate the significance of figure in the form of pictures than it can
follow the figures themselves.
Comparison of Tabular and Diagrammatic Presentation
Tabulat and diagrammatic presentation have their own usefulness for
particular purposes. Hence, the choice of the form of presentation must be
made with due thought and care. We should kept following point in mind:
(a) Tables contain precise figures whereas diagrams give only an
approximate idea. Exact values can be read from a table.
(b) Graphs and diagrams have a visual appeal and, therefore, prove to be
more impressive to laymen.
(c) More information can be presented in one table than either in one
graph or diagram.
(d) Tables usually require much close reading and are more difficult to
. interpret than diagrams.
Difference Between Diagrams and Graphs
(c) Diagrams are more attractive to the eye and as such are better suited
for publicity and propaganda. They do not add anything to the meaning
of the data.
(a) A diagram is generally constructed on plain paper. In other words, a
graph represents mathematical relationship (though not necessarily
functional) between two variables whereas a diagram does not.
(c) For representing frequency distributions and time series graphs are
more appropriate than diagrams. In fact for presenting frequency
distributions diagrams are rarely used.
GENERAL RULES FOR CONSTRUCTING DIAGRAMS
We use the following rules to construc the diagrams:
1. Footnotes: In order to clarify certain point about the diagram footnote
may be given at the bottom of the diagram.
2. Index: An index illustrating different types of lines or different shades,
colours should be given so that the reader can easily make out the
meaning of the diagram.
3. Neatness and Cleanliness: Diagrams should be absolutely neat and
clean.Graphic Presentation of Data 25
4. Simplicity: Diagrams should be as simple as possible so that the
reader can understand their meaning clearly and easily.
5. Title: Every diagram must be given a suitable title. The title should
convey in as few words as possible the main idea that the diagrams
intend to portray.
6. Proportion between width and height: We should maintain the
proper proportion between heighest and width. If either the height and
width is too short or too long in proportion, the diagram would give
an ugly look. "Graphic Presentation" may be adopted for general use
It is know as "Root-two" that is, a ration of | (short side) to 1.414
(long side). Modifications may no doubt, made to accommodate a
diagram in the space available.
7. Selection of Scale: The scale showing the values should be in even
numbers or in multiples of five or ten ¢.g., 25, 50, 75, or 20, 40, 60.
Odd values like 1. 3. 5, 7 should be avoided. Again no rigid rules can
be laid down about the number of rulings on the amount scale, but _
ordinarily it should not exceed five. The scale should also specify the
size of the unit and what it represents;
TYPES OF DIAGRAMS
Diagrams can be divided in the following ways:
1. Three-dimensional diagrams, ¢.g., cubes, cylinders and spheres.
2. Two-dimensional diagrams, e.g., rectangles, squares and circles.
3. One-dimensional diagrams, e.g. bar diagrams.
4. Pictograms and cartograms.
One-dimensional or Bar Diagranis
A bar is a thick line whose width is shown merely for attention. They
are called one-dimensional because it is only the length of the bar that matters
and not the width. When the number of items is large lines may be drawn
instead of bars to economise space. The special merits of bar diagrams are
the following.
(i) They are readily understood even by those unaccustomed to reading
charts or those who are not chart-minded.
(ii) They possess the outstanding advantage that they are the simplest and
the easiest to make.
(iii) When a large number of items are to be compared they are the only
form that can be used effectively.
To construct the bar kept the following points in mind.
(i) The gap between one bar and another should be uniform throughout.