3 Summarizing Data

The document discusses different measures of central tendency including the mean, median, and mode. It provides formulas to calculate each and examples of when each measure is most appropriate based on the distribution and scale of the data. Key measures discussed are the population mean, sample mean, weighted mean, and using the appropriate central tendency for normal vs. non-normal distributions.

Uploaded by

Joevyvamae Torre

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views

3 Summarizing Data

Uploaded by

Joevyvamae Torre

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 71

Summarizing Data:

Central Tendency
Kristoffer Ryan T. Gidaya, PhD, RGC
Learning Objectives
• After reading this chapter, you should be able to:
1. Distinguish between a population mean and sample mean.
2. Calculate and interpret the mean, the median, and the mode.
3. Calculate and interpret the weighted mean for two or more samples with unequal sample
sizes.
4. Identify the characteristics of the mean.
5. Identify an appropriate measure of central tendency for different distributions and scales
of measurement.
6. Compute the mean, the median, and the mode using SPSS.
INTRODUCTION TO CENTRAL
TENDENCY
• Suppose before registering for a statistics class, a friend told you that
students in Professor Smith's class earned higher grades on average than
those in Professor Jones's class. On the basis of this information, you
decided to register for Professor Smith's class. You did not need to know all
the individual grades for each student in both classes to make your decision.
Instead, your decision was based on knowledge of a single score, or in this
case, a class average.
• The class average in this example is a measure of central tendency.
• Measures of central tendency are statistical measures for locating a single
score that is most representative or descriptive of all scores in a distribution.
• Measures of central tendency are single values that have a "tendency" to be
near the "center" of a distribution.
• statistical measures of central tendency ensure that the single score
meaningfully represents a set of data.
• MEAN, MEDIAN & MODE
• Measures of central tendency are stated differently for populations and
samples.
• Calculations of central tendency are largely the same for populations and
samples of data, except for the notation used to represent population size
and sample size.
• The population size is the number of individuals who constitute an entire
group or population. The population size is represented by a capital N.
• The sample size is the number of individuals who constitute a subset of
those selected from a larger population. The sample size is represented by a
lowercase n.
MEASURES OF CENTRAL
TENDENCY

Although we use different symbols to represent the number of scores (x)

in a sample (n) versus a population (N), the computation of central
tendency is the same for samples and populations.
The Mean
The formulas for the population mean and
the sample mean are as follows.
• The population mean is the sum of N scores divided by N:

• The sample mean is the sum of n scores divided by n:

The mean is often referred to as the "balance
point" in a distribution. The balance point is not
always at the exact center of a distribution
• Remember that the computation of the mean does not change for
samples and populations, just the notation used in the formula.
• To calculate the mean of a sample or a population, we do the same thing:
We sum a set of scores and divide by the number
of scores summed.
Example 1
Example 2
Activity
• A scientist records the following sample of scores (n = 6) : 3, 6, 4, 1, 10, and
12. What is the sample mean of these scores?
The Weighted Mean
• (denoted Mw) is the combined mean of two or more groups of scores in
which the number of scores in each group is disproportionate or unequal.
• A common application of this in behavioral science is when scores are measured in two
or more samples with unequal sample sizes.
• The term disproportionate refers to the fact that some samples have more scores than
others (the samples are of disproportionate sizes).
The formula for the weighted mean for samples of
unequal size can be expressed as follows:
• M represents the mean of each sample,
and n represents the size of each
sample.
• In this formula, the sample size (n) is
the weight for each mean. Using this
formula, we will compute the
combined mean for two or more
samples of scores in which the
number of scores in each sample is
disproportionate or unequal.
• Notice that the sample size for each group is not the same; more scores were
used to compute the mean for some samples than others. If we computed
the arithmetic mean, we would get the following result:
To compute the weighted mean, we find the product, M x n, for each sample. This gives us a weight for
the mean of each sample. By adding these products, we arrive at the weighted sum:

Then, we divide the weighted sum by the combined sample size (n), which is computed by adding the
sample sizes in the denominator:
• The weighted mean for these samples is 63.5.
• The weighted mean is larger than the arithmetic mean (63 .5 vs. 59.0) because
the larger sample (the sample of obese participants) scored higher on the
fitness measure. Hence, the value of the weighted mean shifted toward the
mean from the larger sample (or the sample with more weight).
• This makes the weighted mean an accurate statistic for computing the mean
for samples with unequal sample sizes.
The Median
• is the middle value in a distribution of data listed in numeric order.
• Suppose you measure the following set of scores: 2, 3, 4, 5, 6, 6, and
100.
• The mean of these scores is 18 (add up the seven scores and divide by 7).
• Yet, the score of 100 is an outlier in this data set, which causes the mean value to increase so much that the mean fails to
reflect most of the data.
• The mean can be misleading when a data set has an outlier because the mean
will shift toward the value of that outlier. For this reason, there is a need for alternative measures of
central tendency.
• One measure is the median, which is the middle value in a distribution. The median value represents the
midpoint of a distribution of scores where half the scores in a distribution fall above and
half below its value.
To find the median position, list a set of scores
in numeric order and compute this formula:

n+1
Median position = __________
2

• Locating the median is a little different for odd- and even-numbered sample
sizes (n).
Activity 1
• When the number of scores in a distribution is odd, order the set of scores
from least to most (or vice versa) and find the middle number. Let us find
the median for each of these lists.
a) 3, 6, 5, 3, 8, 6, 7 (n = 7)
b) 99, 66, 44, 13, 8 (n = 5)
c) 51, 55, 105, 155, 205, 255, 305, 355, 359 (n = 9)
Activity 2
• When the number of scores in a distribution is even, list the scores in
numeric order and then average the middle two scores. Let us find the
median for each of these lists.
• (a) 3, 6, 5, 3, 8, 6 (n = 6)
• (b) 99, 66, 44, 13 (n = 4)
• (c) 55, 105, 155, 205, 255, 305 (n = 6)
• Notice that to find the median, we find the middle score.
• Graphically, the median can be estimated by a cumulative percent distribution.
Because the median is located in the middle of a distribution, it is
approximately at the 50th percentile of a cumulative percent distribution.
The Mode
• is the value in a data set that occurs most often or most frequently.
• One advantage of the mode is that it is simply a count; no calculations or formulas are
necessary to compute a mode.
• To find the mode, list a set of scores in numeric order and count the score that occurs
most often.
• The following is a list of 20 golfers' scores on a difficult par·4 golf hole: 2, 3,
3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 5, 5, 5, 5, 6, and 7. What score did these golfers
card the most (mode) on this hole?
Table 3.4 lists these scores in a frequency distribution table. From this table, it
is clear that most golfers scored a par 4 on this difficult hole. Therefore, the
mode, or most common score on this hole, was par.
Activity 2
• A researcher recorded the number of symptoms for major depressive
disorder (MDD) expressed in a small sample of 20 "at-risk" participants: 0, 4,
3, 6, 5, 2, 3, 3, 5, 4, 6, 3, 5, 6, 4, 0, 0, 3, 0, and 1. How many symptoms of
MOD did participants in this sample most commonly express?
• First list these scores in numeric order: 0, 0, 0, 0, 1, 2, 3, 3, 3, 3, 3, 4, 4, 4, 5, 5,
5, 6, 6, and 6. In doing so, we find that 3, which occurred five times, is the
mode in this data set. Participants in this "at-risk" sample most often
reported three symptoms of MDD.
CHARACTERISTICS OF THE MEAN
CHOOSING AN APPROPRIATE
MEASURE OF CENTRAL TENDENCY
• The choice of which measure to select depends largely on the type of
distribution and the
• scale of measurement of the data
Using the Mean to Describe Data

• The mean is typically used to describe data that are normally distributed and
measures on an interval or ratio scale.
Describing Normal Distributions

• The mean is used to describe data that are approximately normally distributed. The normal
distribution is a symmetrical distribution in which scores are similarly distributed above and
below the mean, the median, and the mode at the center of the distribution.
• The general structure and approximate shape of this distribution is shown in Figure 3 .2.
• For cases in which the data are normally distributed, the mean is used to summarize the data.
• We could choose to describe a normal distribution with the median or mode, but the mean is
most often used because all scores are included in its calculation (i.e., its value is most reflective
of all the data).
• The normal distribution (also called the symmetrical, Gaussian, or bell-
shaped distribution) is a theoretical distribution in which scores are
symmetrically distributed above and below the mean, the median, and the
mode at the center of the distribution

2007-2013 Chevy Silverado + GMC Trucks
0% (1)
2007-2013 Chevy Silverado + GMC Trucks
5 pages
HRM Interventions - Performance Management
100% (3)
HRM Interventions - Performance Management
38 pages
Epidemiology - Exercises. Gaetano Marrone
100% (1)
Epidemiology - Exercises. Gaetano Marrone
4 pages
HPEX 358 Homework Assignment #2 (Chapter 4, 5, & 6)
No ratings yet
HPEX 358 Homework Assignment #2 (Chapter 4, 5, & 6)
4 pages
Measuring Health and Disease
No ratings yet
Measuring Health and Disease
25 pages
Typhoid in Tajikistan: Instructor'S Version
No ratings yet
Typhoid in Tajikistan: Instructor'S Version
31 pages
Answer Key - Epi Midterm Study Guide - 2018
No ratings yet
Answer Key - Epi Midterm Study Guide - 2018
8 pages
Errors in Epidemiological Studies
No ratings yet
Errors in Epidemiological Studies
30 pages
3) Measurement of Mortality and Morbidity
100% (1)
3) Measurement of Mortality and Morbidity
21 pages
Measures in Epidemiology 03-2017
No ratings yet
Measures in Epidemiology 03-2017
46 pages
Case Control Study
No ratings yet
Case Control Study
12 pages
Commed Measures of Disease Frequency and Association
No ratings yet
Commed Measures of Disease Frequency and Association
11 pages
Epidemiology Exercises 2
No ratings yet
Epidemiology Exercises 2
21 pages
Lecture 2A - Biological Variability, Descriptive Stats
No ratings yet
Lecture 2A - Biological Variability, Descriptive Stats
9 pages
Week 3 Epi Tool Kit
No ratings yet
Week 3 Epi Tool Kit
2 pages
Lecture 4 & 5. Epidemiological Studies
No ratings yet
Lecture 4 & 5. Epidemiological Studies
58 pages
Surveillance Assignment
No ratings yet
Surveillance Assignment
8 pages
Immunotoxicology Evaluation
No ratings yet
Immunotoxicology Evaluation
38 pages
Chapter 1 Introduction To Biostat
No ratings yet
Chapter 1 Introduction To Biostat
62 pages
Screening For Disease
No ratings yet
Screening For Disease
15 pages
Relative Risk
No ratings yet
Relative Risk
8 pages
Measures of Disease Frequency 0903 - Gaohongcai
100% (2)
Measures of Disease Frequency 0903 - Gaohongcai
89 pages
Dynamics of Disease Transmission
No ratings yet
Dynamics of Disease Transmission
23 pages
Topic 4a
No ratings yet
Topic 4a
28 pages
Oecd TG 488
No ratings yet
Oecd TG 488
23 pages
Case-Control Study Design
No ratings yet
Case-Control Study Design
60 pages
Surveillance
No ratings yet
Surveillance
64 pages
Confounding in Epidemiology
100% (1)
Confounding in Epidemiology
36 pages
Principles of Epidemiology - Lesson 1 - Section 8
No ratings yet
Principles of Epidemiology - Lesson 1 - Section 8
4 pages
IKM Cohort Study
No ratings yet
IKM Cohort Study
88 pages
Dynamics of Disease Transmission
No ratings yet
Dynamics of Disease Transmission
50 pages
Epid 600 Homework 7
No ratings yet
Epid 600 Homework 7
5 pages
Infectious Disease Epidemiology: Alick Mwambungu
100% (2)
Infectious Disease Epidemiology: Alick Mwambungu
68 pages
Chapter 1 Introduction The Teaching of Theory (3 Hours) Objective
100% (1)
Chapter 1 Introduction The Teaching of Theory (3 Hours) Objective
32 pages
ROBINS-I Detailed Guidance
No ratings yet
ROBINS-I Detailed Guidance
53 pages
Epidemiology: Measuring Disease Frequency
100% (1)
Epidemiology: Measuring Disease Frequency
25 pages
Epidemiological Tools: Fractions, Measures of Disease Frequency and Measures of Association
0% (1)
Epidemiological Tools: Fractions, Measures of Disease Frequency and Measures of Association
16 pages
A4 Cross Section
No ratings yet
A4 Cross Section
24 pages
BMTRY 701 Biostatistical Methods II
100% (1)
BMTRY 701 Biostatistical Methods II
52 pages
Complete Download Epidemiology and Biostatistics An Introduction To Clinical Research Bryan Kestenbaum PDF All Chapters
100% (3)
Complete Download Epidemiology and Biostatistics An Introduction To Clinical Research Bryan Kestenbaum PDF All Chapters
62 pages
Epidemiological Studies Eman Mahfouz 2022-1
No ratings yet
Epidemiological Studies Eman Mahfouz 2022-1
81 pages
Statistics: Shaheena Bashir
No ratings yet
Statistics: Shaheena Bashir
37 pages
L 7estimating Risk
No ratings yet
L 7estimating Risk
63 pages
Epidemiology - Definitions of Terms
No ratings yet
Epidemiology - Definitions of Terms
3 pages
Biostat Quiz Leak
50% (2)
Biostat Quiz Leak
3 pages
Natural History and Spectrum of Disease
100% (1)
Natural History and Spectrum of Disease
34 pages
Investigation of An Epidemic
No ratings yet
Investigation of An Epidemic
18 pages
Disease Detection and Diagnosis
No ratings yet
Disease Detection and Diagnosis
15 pages
Unit 11 Introduction To Epidemiology
No ratings yet
Unit 11 Introduction To Epidemiology
38 pages
Application of Computers in Epidemiology
No ratings yet
Application of Computers in Epidemiology
12 pages
Lesson 3 USes and Core Function in Epidemiology
No ratings yet
Lesson 3 USes and Core Function in Epidemiology
17 pages
Descriptive Epidemiology Study Designs: Zziwa Swaibu (BEH, MPH MUK)
100% (1)
Descriptive Epidemiology Study Designs: Zziwa Swaibu (BEH, MPH MUK)
47 pages
Chance, Bias, Confounding
No ratings yet
Chance, Bias, Confounding
33 pages
Measures of Morbidity 2022
No ratings yet
Measures of Morbidity 2022
50 pages
2) Public Health and Epidemiology
No ratings yet
2) Public Health and Epidemiology
34 pages
Triangulation Validity
No ratings yet
Triangulation Validity
7 pages
Environment and Health 2nd ED
No ratings yet
Environment and Health 2nd ED
225 pages
Preparatory Lecture: Confounding and Bias
No ratings yet
Preparatory Lecture: Confounding and Bias
144 pages
Measures of Dispersion
100% (1)
Measures of Dispersion
25 pages
Day 3 - Increasing Citations and Improving Your Impact Factor, Dr. Rohimah Mohamud (USM)
No ratings yet
Day 3 - Increasing Citations and Improving Your Impact Factor, Dr. Rohimah Mohamud (USM)
42 pages
Blood Donation and Safety
No ratings yet
Blood Donation and Safety
24 pages
Understanding Health Promotion
From Everand
Understanding Health Promotion
Ashlee C. Whitney
No ratings yet
Tds PP 30% FV Old Polifor C l30 Gf30 Ts
No ratings yet
Tds PP 30% FV Old Polifor C l30 Gf30 Ts
2 pages
ME Wiring Diagram Panel DP LB A
No ratings yet
ME Wiring Diagram Panel DP LB A
1 page
Rick, The Evolution of Authority...
No ratings yet
Rick, The Evolution of Authority...
19 pages
Lesson 14
No ratings yet
Lesson 14
14 pages
Research - Script
No ratings yet
Research - Script
8 pages
Preboard Exam (XII) Applied Math 23-24
No ratings yet
Preboard Exam (XII) Applied Math 23-24
5 pages
NEO Messtechnik EN
No ratings yet
NEO Messtechnik EN
60 pages
Methods of Writing A Research Report
No ratings yet
Methods of Writing A Research Report
27 pages
03 SEP670 Intro To Relion 670 Series Hardware
100% (1)
03 SEP670 Intro To Relion 670 Series Hardware
20 pages
Toleshi Wakjira
No ratings yet
Toleshi Wakjira
89 pages
Unit 3 FEM Bar & Beam Elements PDF
No ratings yet
Unit 3 FEM Bar & Beam Elements PDF
14 pages
Diodos Zener SMD
No ratings yet
Diodos Zener SMD
4 pages
Answer-To-Word-Formation Week 1
No ratings yet
Answer-To-Word-Formation Week 1
2 pages
Class 11 CPT 1 Jee Main Paper 19-05-20
No ratings yet
Class 11 CPT 1 Jee Main Paper 19-05-20
8 pages
Problem Solving Skills of Shs Students in General Mathematics
No ratings yet
Problem Solving Skills of Shs Students in General Mathematics
86 pages
2014 - Standard Manual
No ratings yet
2014 - Standard Manual
68 pages
Iec 60598
100% (1)
Iec 60598
2 pages
Saveetha University
No ratings yet
Saveetha University
1 page
Wet Riser New
No ratings yet
Wet Riser New
19 pages
Aircraft Manual
No ratings yet
Aircraft Manual
40 pages
Proposed Exercises For Memory and Emotion in Acting Pedagogy A Shared Narrative With Science (Theatre, Acting)
No ratings yet
Proposed Exercises For Memory and Emotion in Acting Pedagogy A Shared Narrative With Science (Theatre, Acting)
220 pages
Structural_Concrete_2023_Louren_o_Design_and_assessment_of_concrete_structures_with_strut_and_tie_models_and_stress
No ratings yet
Structural_Concrete_2023_Louren_o_Design_and_assessment_of_concrete_structures_with_strut_and_tie_models_and_stress
21 pages
Sg4 Sts Science, Technology, and Nation-Building
No ratings yet
Sg4 Sts Science, Technology, and Nation-Building
31 pages
Industry Connect Event (1)
No ratings yet
Industry Connect Event (1)
3 pages
CRG Setup Guide PDF
No ratings yet
CRG Setup Guide PDF
30 pages
Erection MOS
No ratings yet
Erection MOS
11 pages
French Mungai Miringu Resume
No ratings yet
French Mungai Miringu Resume
3 pages
Space Exploration British English Teacher
No ratings yet
Space Exploration British English Teacher
16 pages

3 Summarizing Data

Uploaded by

3 Summarizing Data

Uploaded by

Summarizing Data:

Although we use different symbols to represent the number of scores (x)

• The sample mean is the sum of n scores divided by n:

You might also like