DATA ANALYSIS
❖is a process of inspecting, cleansing, transforming and
modeling data with the goal of discovering useful information,
informing conclusion and supporting decision-making,
(Wikipedia)
❖involves sorting through massive amounts of unstructured
information and deriving key insights from it. These insights are
enormously valuable for decision-making at companies of all
sizes.
DATA ANALYSIS PROCESS
1. Data Requirement Gathering
➢ decide what to analyze and how to measure it
2. Data Collection
➢ collect your data based on requirements
3. Data Cleaning
➢ data should be cleaned and error free.
DATA ANALYSIS PROCESS
4. Data Analysis
➢ Use the data analysis tools and software which will
help you understand, interpret, and derive
conclusions based on the requirements
5. Data Interpretation
➢ Express or communicate your data analysis
6. Data Visualization
➢ They often appear in the form of charts and graphs
2 CORE AREAS OF DATA ANALYSIS:
Quantitative Data Qualitative Data
Analysis Methods Analysis Methods
Quantitative Data Analysis Methods
Quantitative data is defined as the value of data in the form of counts
or numbers where each data-set has a unique numerical value
associated with it.
Types of Quantitative Data:
Counter: Count equated with entities.
Measurement of physical objects: Calculating measurement of any
physical thing.
Sensory calculation: Mechanism to naturally “sense” the measured
parameters to create a constant source of information.
Projection of data: Future projection of data can be done using
algorithms and other mathematical analysis tools
Quantification of qualitative entities: Identify numbers to qualitative
information.
Qualitative Data Analysis Methods
Qualitative data analysis is made up of words, observations, images, and
even symbols
Measurement Scales:
✓ Nominal Scale
✓ Ordinal Scale
✓ Interval Scale
✓ Ratio Scale
Measurement Scales:
NOMINAL SCALE numbers serve as “tags” or “labels” only, to identify
or classify an object
Examples:
How would you describe
Could you please select an your behavioral pattern?
option from below to describe E – Extroverted
your hair color. I – Introverted
1 – Black A – Ambivert
2 – Brown
3 – Burgundy What is your gender?
4 – Auburn M- Male
5 – Other F- Female
dichotomous nominal scale
ORDINAL SCALE reports the ranking and ordering of the data
without actually establishing the degree of
variation between them.
Examples:
“How satisfied are you with “How happy are you with
our products?” the customer service?”
1- Totally Satisfied 1- Very Unhappy
2- Satisfied 2- Unhappy
3- Neutral 3- Neutral
4- Dissatisfied 4- Unhappy
5- Totally Dissatisfied 5- Very Unhappy
The Likert scale is a variant of the ordinal scale that is used
to calculate customer or employee satisfaction
The frequency of occurrence – Questions such as “How
frequently do you have to get the phone repaired?”
Very often
Often
Not Often
Not at all
Understanding preferences: If a marketer
conducts surveys to understand which laptop brand do
their respondents do not prefer, they can use the
ordinal scale. Out of the five mentioned laptop brand,
rate the order of preference –
HP _____
Apple _____
Lenovo _____
Dell _____
Acer _____
INTERVAL SCALE scale is a quantitative measurement scale where
there is order, the difference between the two
variables is meaningful and equal, and the
presence of zero is arbitrary.
Examples:
Time is also one of the
most popular interval data
examples measured on an
interval scale where the
values are constant,
known, and measurable.
Temperature
RATIO SCALE helps to understand the ultimate-order, interval,
values, and the true zero characteristic is an essential
factor in calculating ratios.
Examples:
What is your height in feet
and inches?
• Less than 5 feet.
• 5 feet 1 inch – 5 feet 5 inches
• 5 feet 6 inches- 6 feet
• More than 6 feet
How much time do you spend
What is your weight in kgs?
daily watching television?
• Less than 50 kgs
• Less than 2 hours
• 51- 70 kgs
• 3-4 hours
• 71- 90 kgs
• 4-5 hours
• 91-110 kgs
• 5-6 hours
• More than 110 kgs
• More than 6 hours
SUMMARY of Measurement Scales:
POPULATION AND
SAMPLE
POPULATION AND SAMPLE
Population (N) is a collection, or set of individuals, objects, or
measurements whose properties are to be analyzed. It is the totality of
the observation.
Sample (n) is a subset of a population. It is a smaller group representing
the population having identical characteristics from which it was taken.
Slovin’s Formula
Formula: n = sample
𝑵
𝒏= N = population
𝟏 + 𝑵𝒆𝟐 e = marginal error
Marginal error is the probability of committing a mistake
Confidence level is the probability of getting the correct result
SAMPLE (n)
Example: In a population of 5,000 students, how many samples will you
take to get an accurate result of a study?
Solution: N = 5,000 students 𝑵
n=? 𝒏=
𝟏 + 𝑵𝒆𝟐
If e = 1% If e = 5% If e = 10%
5,000 5,000 5,000
𝑛= 𝑛= 𝑛=
1 + (5,000)(0.01)2 1 + (5,000)(0.05)2 1 + (5,000)(0.10)2
𝑛 = 3,333 𝑠𝑡𝑢𝑑𝑒𝑛𝑡𝑠 𝑛 = 370 𝑠𝑡𝑢𝑑𝑒𝑛𝑡𝑠 𝑛 = 98 𝑠𝑡𝑢𝑑𝑒𝑛𝑡𝑠
SAMPLING TECHNIQUES
SIMPLE
RANDOM CONVENIENCE
SAMPLING SAMPLING
CLUSTER PURPOSIVE
SAMPLING SAMPLING
PROBABILITY SAMPLING NON-PROBABILITY
SAMPLING SAMPLING
METHODS
METHOD METHOD
SNOWBALL
SYSTEMATIC SAMPLING
SAMPLING
STRATIFIED QUOTA
RANDOM SAMPLING
SAMPLING
SAMPLING TECHNIQUES
Sampling Techniques Or Sampling Plan is the process of gathering
sampling units from the population.
1. Probability Sampling Technique is a sampling technique wherein each of
the population unit has an equal chance of being included in the sample.
a. Random Sampling is also called “Lottery Sampling” or
“Raffle Sampling”.
Example: electronic raffle system, random picker, etc.
b. Cluster Sampling . The process of randomly selecting intact groups,
not individuals, with the defined population sharing similar
characteristics
c. Systematic Sampling in this technique, elements are numbered
for identification and samples are selected at regular intervals.
Example: If N = 300, e = 10% and n = 75, which of the
300 will be Included in the sample?
𝑁 𝟑𝟎𝟎
Use: 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 = = =𝟒
𝑛 𝟕𝟓
Therefore, from the population, take the
elements tagged as the 4th, 8th, 12th, 16th,…
d. Stratified Sampling in this technique, the set of interests is divided
into groups or aggregates from which the actual sampling is done.
Example: College # of
computation Sample
If e = 5%, n = 370. Using Students
stratified sampling, how CED 900 900 67
× 370
will you get the 370 5000
CEAT 2,000 2000 148
sample from the × 370
5000
5,000population? CAS 1,000 1000 74
× 370
Use: 5000
CBET 1,100 1100 81
× 370
# 𝒐𝒇 𝒔𝒕𝒖𝒅𝒆𝒏𝒕𝒔 𝒑𝒆𝒓 𝒄𝒐𝒍𝒍𝒆𝒈𝒆 × 𝒏 5000
𝑵 N=5,000 n=370
2. Non-Probability Sampling Technique is a sampling technique wherein
each of the population unit has a n equal chance of being included in the
sample.
a. Convenience Sampling in this technique, the samples are simply
those chosen to respond.
b. Purposive Sampling in this technique, the samples are chosen
based on specific objectives.
c. Snowball Sampling in this technique, the researcher can rely on
his initial respondents to refer him to the next respondents
d. Quota Sampling in this technique, the researcher characterizes
the population based on desired features.
DATA COLLECTION
DATA COLLECTION METHODS:
1. Direct or Interview – researcher prepares a set of questions
and respondents will answer verbally and directly.
2. Indirect or Questionnaire – researcher prepares a well- planned,
written questions.
Useful steps in designing a survey:
➢ Determine the goal of your survey
➢ Identify the sample population
➢ Choose an interviewing method
➢ Decide what questions you will ask in what order, and how to
phrase them.
➢ Conduct the interview and collect the information.
➢ Analyze the results by making graphs and drawing conclusions.
Example of Planning and Conducting Surveys:
We want to construct a survey that shows which sports students at
your school like to play the most.
(a) List the goal of the survey
✓ The goal of the survey is to find the answer to the question:
“Which sports do students at your school like to play the most?”
(b) What population sample should we interview?
✓ A sample of the population would include a random sample of the
student population in your school. A good strategy would be to
randomly select students
FOCUS GROUP DISCUSSION
FGD is an in-depth field method that brings together a
small homogeneous group (usually six to twelve persons) to
discuss topics on a study agenda.
General Principles to consider:
➢ Standardization of questions
➢The number of focus groups conducted
➢Number of participants in a group
➢Level of moderator involvement
3. Registration – document and records
4. Observation – collects information on the characteristics of the
units under study by actual measurement.
5. Experimental Method – makes trials and tests; it is used to
describe any process that generates a set of data
Activity #1
Gform for Activity #1 will be posted in our Gclassroom.