INTRODUCTION TO SAMPLING
METHODS/THEORY
SAMPLING DISTRIBUTION
...shows the frequency with which
values of statistics are observed
when all the possible random
samples are drawn from a given
population.
POPULATION...
refers to the totality of
observations or elements from a
set of data
SAMPLE...
refers to one or more elements
taken from the population for a
specific purpose
2
Venn Diagram
Sample
Population
U - Research
Parameter versus statistic
Population: the entire Sample: the part
group of individuals in
of the population
which we are interested
but can’t usually assess
we actually
directly. examine and for
which we do have
A parameter is a number data.
describing a A statistic is a
characteristic of the
number describing
population. Parameters
a characteristic of a
are usually unknown.
sample. We often
use a statistic to
EXAMPLE
A teacher conducts a study on the correlation of the
students' entrance examination scores and their
respective academic performance. To ensure the validity
of his findings, he decided to include all the students
who are enrolled for the current school year under a
certain program, hence the entire population.
But because of budget issue and feasibility concerns,
the teacher decided to include only a group of 200
students.
PARAMETER
a numerical measure that
describes the whole population
If all the students in a school are
surveyed about heir heights and an
average of 65 inches was
determined, then 65 in is called
population parameter
STATISTIC
a numerical description of the
sample
65 in is called a sample statistic
when only 50 students out of 230
students are surveyed
Identify the parameter & statistic.
1. The Food and Nutrition Research
Institute of the Dept. of Science &
Technology surveyed 14 million Filipino
adults aged 20 and above and
determined 80% of Filipino adults are at
risk of hypertension.
Parameter: Percentage of adults at risk of
hypertension out of all aged 20 and above
Statistic: 80% obtained from 14 million adult sample
2 KINDS OF SAMPLING
1. Probability Sampling
2. Nonprobability Sampling
PROBABILITY SAMPLING
A probability sampling scheme
is one in which every unit in the
population has a chance of being
selected in the sample, and this
probability can be accurately
determined.
.
PROBABILITY SAMPLING INCLUDES:
Simple Random Sampling,
Systematic Sampling,
Stratified Random Sampling,
Cluster Sampling
Multistage Sampling.
Multiphase sampling
1. SIMPLE RANDOM SAMPLING
• Applicable when population is small,
homogeneous & readily available
• All subsets of the frame are given an
equal probability. Each element of the
frame thus has an equal probability of
selection.
• It provides for greatest number of
possible samples. This is done by
assigning a number to each unit in the
sampling frame.
• A table of random number or lottery
system is used to determine which
units are to be selected.
2. SYSTEMATIC SAMPLING
Systematic sampling relies on arranging
the target population according to some
ordering scheme and then selecting
elements at regular intervals through
that ordered list.
Systematic sampling involves a random
start and then proceeds with the
selection of every kth element from then
onwards. In this case, k=(population
size/sample size).
It is important that the starting point is
not automatically the first in the list, but
is instead randomly chosen from within
the first to the kth element in the list.
A simple example would be to select
every 10th name from the telephone
directory (an 'every 10th' sample, also
referred to as 'sampling with a skip of
10').
3. STRATIFIED SAMPLING
Where population embraces a number of distinct
categories, the frame can be organized into separate
"strata." Each stratum is then sampled as an
independent sub-population, out of which individual
elements can be randomly selected.
Every unit in a stratum has same chance of being
selected.
Using same sampling fraction for all strata ensures
proportionate representation in the sample.
Adequate representation of minority subgroups of
interest can be ensured by stratification & varying
sampling fraction between strata as required.
Draw a sample from each stratum
4. CLUSTER SAMPLING
Cluster sampling is an example of 'two-stage
sampling' .
First stage a sample of areas is chosen;
Second stage a sample of respondents within
those areas is selected.
Population divided into clusters of homogeneous
units, usually based on geographical contiguity.
Sampling units are groups rather than individuals.
A sample of such clusters is then selected.
All units from the selected clusters are studied.
• Identification of clusters
– List all cities, towns, villages & wards of cities with
their population falling in target area under study.
– Calculate cumulative population & divide by 30, this
gives sampling interval.
– Select a random no. less than or equal to sampling
interval having same no. of digits. This forms 1st cluster.
– Random no.+ sampling interval = population of 2nd cluster.
– Second cluster + sampling interval = 4th cluster.
– Last or 30th cluster = 29th cluster + sampling interval
Two types of cluster sampling methods.
One-stage sampling. All of the elements within
selected clusters are included in the sample.
Two-stage sampling. A subset of elements
within selected clusters are randomly selected
for inclusion in the sample.
Difference Between Strata and Clusters
Although strata and clusters are both non-
overlapping subsets of the population, they
differ in several ways.
All strata are represented in the sample; but
only a subset of clusters are in the sample.
With stratified sampling, the best survey
results occur when elements within strata are
internally homogeneous. However, with cluster
sampling, the best results occur when elements
within clusters are internally heterogeneous
What is the difference between
Stratified Sampling and Cluster
Sampling?
In stratified sampling, the
population is divided into
homogeneous groups called strata,
using an attribute of the samples.
Then members from each stratum
are selected, and the number of
samples taken from those strata
is proportional to the presence of
the strata within the population.
What is the difference between
Stratified Sampling and Cluster
Sampling?
In stratified sampling, each
group used (strata) include
homogenous members while, in
cluster sampling, a cluster is
heterogeneous.
5. MULTISTAGE SAMPLING
Complex form of cluster sampling in which two or more levels of
units are embedded one in the other.
First stage, random number of districts chosen in all
states.
Followed by random number of talukas, villages.
Then third stage units will be houses.
All ultimate units (houses, for instance) selected at last step are
surveyed.
This technique, is essentially the process of taking
random samples of preceding random samples.
Not as effective as true random sampling, but
probably solves more of the problems inherent to
random sampling.
An effective strategy because it banks on multiple
randomizations. As such, extremely useful.
Multistage sampling used frequently when a complete
list of all members of the population not exists and is
inappropriate.
Moreover, by avoiding the use of all sample units in all
selected clusters, multistage sampling avoids the
large, and perhaps unnecessary, costs associated with
traditional cluster sampling.
6. MULTI PHASE SAMPLING
Part of the information collected from whole
sample & part from subsample.
In Tb survey MT in all cases – Phase I
X –Ray chest in MT +ve cases – Phase II
Sputum examination in X – Ray +ve cases - Phase
III
Survey by such procedure is less costly, less
laborious & more purposeful
NONPROBABILITY SAMPLING
Any sampling method where some
elements of population have no chance of
selection (these are sometimes referred
to as 'out of coverage'/'undercovered'),
or where the probability of selection
can't be accurately determined.
It involves the selection of elements
based on assumptions regarding the
population of interest.
NON PROBABILITY SAMPLING
Example: We visit every household in a
given street, and interview the first
person to open the door. In any household
with more than one occupant, this is a
nonprobability sample, because some
people are more likely to open the door
(e.g. an unemployed person who spends
most of their time at home is more likely
to answer than an employed housemate
who might be at work when the
interviewer calls) and it's not practical to
calculate these probabilities.
NONPROBABILITY SAMPLING…….
Nonprobability Sampling includes:
Quota Sampling
Convenience Sampling
Purposive or Judgmental Sampling
QUOTA SAMPLING
The population is first segmented into mutually exclusive
sub-groups, just as in stratified sampling.
Then judgment used to select subjects or units from
each segment based on a specified proportion.
For example, an interviewer may be told to sample 200
females and 300 males between the age of 45 and 60.
It is this second step which makes the technique one of
non-probability sampling.
In quota sampling the selection of the sample is non-
random.
For example interviewers might be tempted to interview
those who look most helpful. The problem is that these
samples may be biased because not everyone gets a
chance of selection. This random element is its greatest
weakness and quota versus probability has been a matter
of controversy for many years
CONVENIENCE SAMPLING
Sometimes known as grab or opportunity sampling
or accidental or haphazard sampling.
A type of nonprobability sampling which involves
the sample being drawn from that part of the
population which is close to hand. That is, readily
available and convenient.
For example, if the interviewer was to
conduct a survey at a shopping center early in
the morning on a given day, the people that
he/she could interview would be limited to
those given there at that given time, which
would not represent the views of other
members of society in such an area, if the
survey was to be conducted at different times
of day and several times per week.
CONVENIENCE SAMPLING…….
Use results that are easy to get
33
Judgmental Sampling or Purposive
sampling
-The researcher chooses the
sample based on who they think
would be appropriate for the
study. This is used primarily when
there is a limited number of
people that have expertise in the
area being researched