Collection of Data
Statistics
Class - XI
MS. RIDHIKA LUTHRA
Synopsis
1. Key terms
2. Sources of data
3. Surveys
4. Census or complete Enumeration
5. Sample survey
6. Questionnaire
7. Types of questions
8. Open ended Vs Close ended
9. Mode of data collection
10. Pilot survey
11. Methods of sampling
12. Important agenciesMS. RIDHIKA LUTHRA
Key Terms
Variable: The values which change, such as production of food grains
per annum, temperature of a city, etc. They are represented by the
letters X,Y or Z.
Observation: The value of an variable.
Data: Observations corresponding to different variables.
Statistical Investigation: It means search for information conducted by
using statistical methods.
MS. RIDHIKA LUTHRA
Key Terms
Investigators: The person who
conducts statistical investigation.
Enumerators: A person who helps
investigator in the collection of data.
Respondents: The persons from
whom statistical information is
collected.
Statistical Unit: The items on which
measurements are taken. Example;
weight in kgs.
MS. RIDHIKA LUTHRA
Key Terms
Population or the Universe: it
means totality of the items under
study.
Sample: It refers to a group or
section of the population from
which information is to be
obtained.
Good sample: It is smaller than
the population and is capable of
providing accurate information
about the population at lower
cost and lesser time. MS. RIDHIKA LUTHRA
Sources of Data
1. Primary Data: when the enumerator collect the data by
conducting an enquiry or an investigation. They are based on
first hand information.
For example, you will have to enquire from a large number
of school students, by asking questions from them to collect
the desired information.
2. Secondary Data: When the data have been collected and
processed by some other agency. It is based on second hand
information.
For example, information obtained from publish sources
such as government reports, newspaper.
MS. RIDHIKA LUTHRA
Surveys
Survey is a method of gathering information from
individuals. The surveys are done to describe some
characteristics like price, quality, usefulness and popularity,
etc.
The purpose is to collect data.
On the basis of area covered there are two methods of
survey:
1. Census Survey
2. Sample Survey
MS. RIDHIKA LUTHRA
Census or Complete Enumeration
A survey which includes every element of the population. It
covers every individual unit in the entire population. The
example includes Census of India, which is carried out every ten
years. This surveys are carried for demographic data on birth,
death, literacy.
Advantages Disadvantages
Results are absolutely correct, A lot of time, energy and money is
accurate and reliable required to collect data
Less chances of biasness Suitable for certain specific cases
Data related to each element is Large number of enumerators are
collected required for collecting data
MS. RIDHIKA LUTHRA
Sample Survey
In this a sample from the population is surveyed. The first step is
selecting a sample to identify the population. Than select
Representative Sample, as it is difficult to study entire
population.
Example: Population, research, etc.
Advantages Disadvantages
Economical as only some units are
studied Partial investigation of the universe
Not easy to select a sample which
Not time consuming represent whole population
Less efforts are required as small It is complicated process and
portion is studied difficult
MS. RIDHIKA LUTHRA
Questionnaire
The most common type of instrument used in surveys for
collecting primary data is questionnaire.
While preparing the same the following points are kept in
mind:
• It should not be too long
• The series of questions should move from general to
specific
• The questions should be precise and clear
• The questions should not be ambiguous
• The question should not use double negatives
• The question should not be a leading question
• The question should notMS. indicate
RIDHIKA LUTHRA
alternatives to the
METHODS OF COLLECTING PRIMARY
DATA
Direct personal observation
Indirect oral interview
Information through agencies
Mailed questionnaires
Schedules sent through enumerator
MS. RIDHIKA LUTHRA
Mode of Data Collection
1. Direct Personal Interviews: In this method, The investigator conducts interviews
with the respondents and obtained required information.
The data is collected by the investigator personally, he/she must be a keen
observer, tactful and courteous in behavior.
He asks or cross-examines the informant and collects necessary
information.
It is original in character
MS. RIDHIKA LUTHRA
Personal Interviews
Advantages Disadvantages
High degree of originality Costly and time consuming
Cannot be used where area of study
Information is reliable is large
Easy to administer Highly prone to personal biasness
Elastic method as necessary
adjustments in the set of questions It requires that the investigator is
can be made skilled and trained
MS. RIDHIKA LUTHRA
2. INDIRECT ORAL INVESTIGATION
The investigator approaches the witness or third parties, who
are in touch with the informant.
The enumerator interviews the people, who are directly or
indirectly connected with the problem under the study.
Generally this method is employed by different enquiry
committees and commissions .
The police department generally adopts this method to get clues of
thefts, riots , murders, etc.
MS. RIDHIKA LUTHRA
Merits
It is simple and convenient.
It saves time, money and labor.
It can be used in the investigation of a large area.
Adequate information can be had.
Demerits
The information cannot be relied because of absence of
direct contact.
Interview with an improper man will spoil the results.
In order to get the real position, a sufficient no of people are to
be interviewed
The careless attitude of the informant will affect the degree of
accuracy
MS. RIDHIKA LUTHRA
3. INFORMATION FROM
CORRESPONDANTS
The local agents or correspondents will be appointed, they collect
the information and transmit it to the office or person.
They do according to their own ways and tastes.
This system is adopted by newspapers, agencies, etc., when
information is needed in different fields.
The informants are generally called correspondents.
MS. RIDHIKA LUTHRA
Information through agencies
Merits
Extensive information can be had.
It is the most cheap and economical method.
Speedy information is possible.
It is useful where information is needed regularly.
Demerits
The information may be biased.
Degree of accuracy cannot be maintained.
Uniformity cannot be maintained.
Data may not be original.
MS. RIDHIKA LUTHRA
4. Telephone Interviews: In this method, the investigator asks
questions over the telephone.
Advantages Disadvantages
Cheaper and takes shorter time Obstruct visual reactions.
They allow researcher to assist Excludes the population who are not
respondents by clarifying questions having telephone connection.
Helpful where respondents are
reluctant to answer questions in
personal interview.
MS. RIDHIKA LUTHRA
5. Mailing Questionnaire: In this method, the required data
are collected by mail. The questionnaire is sent to each
respondent by mail with a request to complete and return it
by a given date.
Advantages Disadvantages
Possibility of misunderstanding the
Less expensive questions
Allows access to researchers in
remote areas. Produces low response rate
Allows respondents to take sufficient
time to answers Data are not reliable
MS. RIDHIKA LUTHRA
Mode of Data Collection
MS. RIDHIKA LUTHRA
Pilot Survey/ Pre- Testing
It is a trial survey which helps to test the effectiveness of
the questionnaire on a small group.
Importance:
1. it helps in pre-testing of the questionnaire, so as to
know the shortcomings and drawbacks of the
questions.
2. It also helps in accessing the suitability of questions,
clarity of instructions, performance of enumerators and
cost and time involved.
MS. RIDHIKA LUTHRA
Secondary Data
Secondary data are those data which have been already collected and
analysed by some earlier agency for its own use and later the same
data are used by a different agency.
MS. RIDHIKA LUTHRA
Published sources:
Various governmental, international and local agencies publish
statistical data, and chief among them are:
International publications: They are U.N.O, IM.F etc.
Official publications of Central and State Govt.: Reserve Bank of
India Bulletin, Census of India, Indian Trade Journal, etc.
Semi-Official publications: Semi-Govt. institutions like
Municipal Corporation, District Board, Panchayat, etc. publish
reports.
Publications of Research Institutions: Indian Statistical
Institutions (I.S.I), Indian Council of Agricultural Research
(I.C.A.R) etc. publish the finding of their research programmes.
Journals and Newspapers: Current and important materials on
statistics and socio-economic problems can be obtained from journals
and newspapers like, Economic Times, Commerce, Indian Finance etc.
MS. RIDHIKA LUTHRA
Unpublished sources:
There are various sources of unpublished data. They are the records
maintained by various government and private offices, the researches
carried out by individual research scholars in the universities or
research institutes.
MS. RIDHIKA LUTHRA
Precautions in the use of Secondary
Data:
Before using the secondary data, the investigators should consider the
following factors:
Suitability of the data
Adequacy of the data
Reliability of data
MS. RIDHIKA LUTHRA
Methods Of Sampling
1. Random Sampling: Method where samples are selected at
random. In this method, every individual unit has an equal
chance of being selected. Methods under random sampling:
a) Lottery Method: In this method all the items in the
populations are assigned a distinct number and these are
written on identical piece of paper and put in a bowl.
Samples are selected on random.
b) Table of Random Numbers: In this random numbers are
arranged in rows and column which are selected on
population size.
c) Exit Polls: it is used to predict election results. In this
technique a random sample of voters, who exit from the
polling booths are asked who they voted for.
MS. RIDHIKA LUTHRA
Methods Of Sampling
2. Non-Random Sampling: In this all the units of the
population do not have an equal chance of being selected.
Methods under this are:
a) Judgement/Purposive/Deliberate Sampling: Here
sample units are selected consciously by the investigator
on the basis of his judgement. This method is subject to
personal bias of investigator.
b) Quota Sampling: Here the investigator is allotted definite
quota and he is required to collect the required data from
a specific numbers of unit of each quota.
c) Convenience Sampling: Here the investigator collects the
sample units on the basis of his convenience.
MS. RIDHIKA LUTHRA
Sampling and Non Sampling Errors
Error in statistics is used to denote the difference between the
true value and the estimated value. Errors can be classified as:
1. Sampling Errors: the difference between the actual value of
a parameter of the population( which is not known) and its
estimate( known). It is possible to reduce sampling error by
increasing the size of the sample.
2. Non- Sampling Errors: It includes:
• Errors in Data Acquisition: From recording incorrect
response
• Non-Response: It occurs when interviewer is unable to
contact person listed in the sample
• Sampling Bias: It occurs when in a sampling plan some
members of target population could not included
MS. RIDHIKA LUTHRA
Important Agencies of Secondary Data
1.Census of India: It provides the
most important and complete
demographic record of population.
These are conducted every 10
years. The census official collects
information on various aspects of
population such as sex ratio,
literacy, migration, etc.
Which are used to interpret and
analyse to understand many
economic and social issues in India
and accordingly plans and policies
are formulated.
MS. RIDHIKA LUTHRA
Important Agencies of Secondary Data
2.National Sample Survey
Organisation: it was
established in 1950 under the
Ministry of Finance to
conduct surveys and collect
data on estimates of literacy,
school enrolments,
maternity, PDS, etc to publish
surveys through reports.
NSSO conducts continuous
surveys on various problems
in successive rounds.
MS. RIDHIKA LUTHRA
Thank You!
Lesson by
R Murali