MODULE 2 - Frequency Distribution
MODULE 2 - Frequency Distribution
https://www.tes.com/teaching-resource/review-of-data-averages-graphs-and-an-introduction-to-frequency-distribution-tables-
11831526
Module Introduction
In this module, we present techniques for organizing data into tables and graphs so
that an entire set of scores can be presented in a relatively simple display or illustration.
Module Content
2.1 FREQUENCY DISTRIBUTION AND FREQUENCY DISTRIBUTION TABLE
FREQUENCY DISTRIBUTION
A frequency distribution is an organized tabulation of the number of individuals
located in each category on the scale of measurement. A frequency distribution can be
structured either as a table or as a graph, but in either case, the distribution presents the
same two elements:
1. The set of categories that make up the original measurement scale.
2. A record of the frequency, or number of individuals in each category.
• Frequency (f) - Refers to the number of values that fall in a given interval.
• Midpoint or Class Mark (x) - Serves as the representative of the given interval.
L+U
X=
2
where: L – lower class limit; U – upper class limit
Example: In class interval 20-24, the midpoint is 22.
• Class Boundaries - Refers to the value midway between the upper limit of a certain
interval and the lower limit of the next.
o Example: In the class interval 35-39, the class boundary is 34.5 – 39.5
• Relative frequency (rel.f) / Proportion (p) - Refers to the proportion of time the
score occurs
rel. f = f/N OR p = f/N
where: f – frequency; N – total population
2. Get the value of the range. The range denoted by R, refers to the difference between
the highest value and the lowest value in the distribution.
R=H–L
where: R – range; H – highest value; L – lowest value
5. Construct a frequency distribution table by making class intervals with the lowest
value in the lower limit of the next class interval. Continue adding the class size of
the lower limit of each class interval and subtracting one from the lower limit of
the next class interval.
8. Determine the class boundaries by subtracting 0.5 from every lower limit and
adding 0.5 from every higher limit.
A. Histograms. To construct a histogram, you first list the numerical scores (the categories
of measurement) along the X-axis. Then you draw a bar above each X value so that
a. The height of the bar corresponds to the frequency for that category.
b. For continuous variables, the width of the bar extends to the real limits of the
category. For discrete variables, each bar extends exactly half the distance to the
adjacent category on each side.
For both continuous and discrete variables, each bar in a histogram extends to the
midpoint between adjacent categories. As a result, adjacent bars touch and there are no
spaces or gaps between bars. An example of a histogram is shown in Figures 2.1 and 2.2.
Figure 2.1
An example of a frequency distribution histogram. The same set of quiz scores is presented
in a frequency distribution table and in a histogram.
Figure 2.2
An example of a frequency distribution histogram for grouped data. The same set of
children’s heights is presented in a frequency distribution table and in a histogram.
B. Polygons. The second option for graphing a distribution of numerical scores from an
interval or ratio scale of measurement is called a polygon. To construct a polygon, you
begin by listing the numerical scores (the categories of measurement) along the X-axis.
Then,
a. A dot is centered above each score so that the vertical position of the dot
corresponds to the frequency for the category.
b. A continuous line is drawn from dot to dot to connect the series of dots.
c. The graph is completed by drawing a line down to the X-axis (zero frequency) at
each end of the range of scores. The final lines are usually drawn so that they reach
the X-axis at a point that is one category below the lowest score on the left side and
one category above the highest score on the right side.
A polygon also can be used with data that have been grouped into class intervals. For
a grouped distribution, you position each dot directly above the midpoint of the class
interval. The midpoint can be found by averaging the highest and the lowest scores in the
interval. For example, a class interval that is listed as 20–29 would have a midpoint of 24.5.
An example of a frequency distribution polygon is shown in Figures 2.3 and 2.4.
Figure 2.3
An example of a frequency distribution polygon. The same set of data is presented in a
frequency distribution table and in a polygon.
College of Arts and Sciences - Psychology Department
Instructional Material: PSY 303 Psychological Statistics
2nd Semester, A.Y. 2020-2021 5
Republic of the Philippines
BATANGAS STATE UNIVERSITY JPLPC-Malvar
Malvar, Batangas
Tel. Nos.: (043) 778-2170/ (043) 406-0830 loc. 122
Website Address: http://www.batstate-u.edu.ph
Figure 2.4
An example of a frequency distribution polygon for grouped data. The same set of data is
presented in a frequency distribution table and in a polygon.
Figure 2.5
A bar graph showing the distribution of personality types in a sample of college students.
Because personality type is a discrete variable measured on a nominal scale, the graph is
drawn with space between the bars.
Figure 2.6
Examples of different shapes for distributions.
In 1977, J.W. Tukey presented a technique for organizing data that provides a
simple alternative to a grouped frequency distribution table or graph (Tukey, 1977). This
technique, called a stem and leaf display, requires that each score be separated into two
parts: The first digit (or digits) is called the stem, and the last digit is called the leaf. For
example, X = 85 would be separated into a stem of 8 and a leaf of 5. Similarly, X = 42
would have a stem of 4 and a leaf of 2. To construct a stem and leaf display for a set of
data, the first step is to list all the stems in a column. For the data in Table 2.1, for example,
the lowest scores are in the 30s and the highest scores are in the 90s, so the list of stems
would be.
The next step is to go through the data, one score at a time, and write the leaf for
each score beside its stem. For the data in Table 2.1, the first score is X = 83, so you would
write 3 (the leaf) beside the 8 in the column of stems. This process is continued for the
entire set of scores. The complete stem and leaf display is shown with the original data in
Table 2.1.
Table 2.1
A set of N = 24 scores presented as raw data and organized in a stem and leaf display.
Relative
Frequency Class
Class Interval Tally Class Boundaries <cf >cf Frequency
(f) Mark (x)
(%)
Lower Upper
Limit Limit Lower Upper
Other Assessments:
• Online Recitation – This will incorporate a video conference within online
teaching to give learning a more personal touch. During scheduled brief online
interviews, students can demonstrate their proficiency in most essential topics.
(Google Meet/FB Messenger)
• Online Activities – These are integral part of the course. This may come in various
tasks such as group work, individual activity, research work, extended reading and
the like. This will provide opportunities for the students to transfer the concepts
they have learned in class to a more concrete situation and to equally participate in
class discussion.
Learning Reference
Gravetter, F.J. & Wallnau, L.B. (2016). Statistics for the Behavioral Sciences, 10th Edition;
Boston, MA: Cengage