Sampling
Sampling
Distributions
Statisticians use the word population to refer not only to people but to all
items that have been chosen for study
Statisticians use the word sample to describe a portion chosen from the
population
Mathematically, we can describe samples and populations by using
measures such as the mean, median, mode, Standard deviation etc.
When these terms describe the characteristic of a sample, they are called
Statistics
When they describe the characteristics of a population, they are called
parameters.
Response errors
Response errors result from the data that have been requested,
provided, received or recorded incorrectly. The response errors may
occur because of inefficiencies with the questionnaire, the
interviewer, the respondent or the survey process.
Poor questionnaire design
It is essential that sample survey or census questions are worded
carefully in order to avoid introducing bias. If questions are
misleading or confusing, then the responses may end up being
distorted.
Interview bias
An interviewer can influence how a respondent answers the survey
questions. This may occur when the interviewer is too friendly or
aloof or prompts the respondent. To prevent this, interviewers must
be trained to remain neutral throughout the interview. They must
also pay close attention to the way they ask each question. If an
interviewer changes the way a question is worded, it may impact the
respondent's answer.
Respondent errors
Respondents can also provide incorrect answers. Faulty
recollections, tendencies to exaggerate or underplay events,
and inclinations to give answers that appear more 'socially
desirable' are several reasons why a respondent may provide a
false answer.
Problems with the survey process
Errors can also occur because of a problem with the actual
survey process. Using proxy responses (taking answers from
someone other than the respondent) or lacking control over the
survey procedures are just a few ways of increasing the
possibility for response errors.
Non-response errors
Non-response errors are the result of not having obtained
sufficient answers to survey questions. There are two types of
non-response errors: complete and partial.
Sampling Method
Systematic Sampling
Systematic sampling, sometimes called interval sampling,
means that there is a regular gap, or interval, between each
selection. This method is often used in industry, where an item
is selected for testing from a production line (say, every fifteen
minutes) to ensure that machines and equipment are working
to specification.
For example, the manufacturer might decide to select every
20th item on a production line to test for defects and quality.
This technique requires the first item to be selected at random
as a starting point for testing and, thereafter, every 20th item is
chosen. Another example is, when questioning people in a
sample survey. A market researcher might select every 10th
person who enters a particular store, after selecting a person at
random as a starting point; or interview occupants of every 5th
house in a street, after selecting a house at random as a starting
point.
Stratified Sampling
A general problem with random sampling is that you could, by
chance, miss out a particular group of units in the sample.
However, if you form the population into groups, and a sample
contains units from each group, you can make sure the sample
is representative.
Stratified random sampling is a random sampling method,
which uses the available information relating to population for
designing a more efficient sample. In stratified random
sampling, a given population to be sampled is sub-divided into
number of sub-groups or sub-population known as Strata.
The units of each stratum are homogeneous and differ as
widely as possible between strata
Cluster Sampling
Sometimes it is too expensive to spread a sample across the
population as a whole. Travel costs can become expensive if
interviewers have to survey people from one end of the
country to the other. To reduce costs, statisticians may choose
a cluster sampling technique.
Cluster sampling divides the population into groups or
clusters. A number of clusters are selected randomly to
represent the total population, and then all units within
selected clusters are included in the sample. No units from
non-selected clusters are included in the samplethey are
represented by those from selected clusters. This differs from
stratified sampling, where some units are selected from each
group.
Non-probability Sampling
Convenience sampling
Judgment sampling
Quota sampling
Principles of Sampling
Law of Statistical Regularity
Law of Inertia of Large Numbers.
The law of statistical regularity owes its origin to the
mathematical theory of probability. According to Conner, The
law of statistical regularity lays down that a group of objects
chosen at random from a larger group tends to possess the
characteristics of that large group (universe).
In simple words, the law states that if the sample is drawn
from the population at random, is likely to have the same
characteristics as that of the population.