UNIT 3 Methods of Organizing and Presenting Data
UNIT 3 Methods of Organizing and Presenting Data
You have learned the different ways on how to gather data and the sampling
techniques from which you can choose the one that you will employ in your research.
Now is the time for you to know what to do with the data that you have gathered. It is
essential to organize your data so that you can easily interpret them.
Data may be ungrouped or grouped. Ungrouped data are unsorted or raw data.
This means that the data have not been grouped or classified according to any
characteristic. On the other hand, grouped data are data that have been organized
or grouped.
1. By forming an array
An array is an arrangement of numbers in increasing or decreasing
order.
The three ways of organizing a set of ungrouped data are shown using the
example below.
42, 51, 44, 28, 32, 24, 30, 25, 24, 35, 43, 37, 28,
28, 22, 45, 29, 28, 36, 35, 50, 25, 25, 46, 44
42, 51, 44, 28, 32, 24, 30, 25, 24, 35, 43, 37, 28,
28, 22, 45, 29, 28, 36, 35, 50, 25, 25, 46, 44
In the first value, 42, the digit 4 is the stem, and the digit 2 is the leaf. In the
second value, 51, the digit 5 is the stem, and the digit 1 is the leaf. Continue plotting
all the ages of the employees. After all the ages have been plotted, make another table
and arrange the leaves in increasing order.
Draft: (as the data is given) Final: (after arranging the leaves from
The lowest to the highest)
Table No. 1
Ages of 25 Employees in a Supermarket
Ages Frequency
22 1
24 2
25 3
28 4
29 1
30 1
32 1 Note: You may first form an
35 2 array or a stem-leaf plot so
36 1 that it would be easier to
37 1 construct the frequency
42 1 distribution table.
43 1
44 2
45 1
46 1
50 1
51 1
25
Interpretation of the data may be made in this way.
The table shows that of the twenty-five employees of the supermarket, the
youngest is twenty-two years old while the oldest is fifty-one years old. Most of the
employees are in their twenties, six are in their thirties and their forties, and only two
are in their fifties.
Table No 2
Scores of a Sample of 40 Students in a Biology Test
You have to remember the definition of the following terms that are found in
the frequency distribution table:
Class interval refers to the grouping bounded by the lower limit (LL) and upper limit
(UL).
Class size (c) is the length or width of the class.
Class frequency (f) is the number of observations falling within a class interval.
Class boundaries refer to the true boundaries (true limits) of a class interval
Class
Intervals In this example, 17-21 is the first class interval
(Scores) where 17 is the lower limit and 21 is the upper
17-21 limit. The lower limit of the first class interval is
17+5 22-26 21+5 usually the lowest value in the data. The upper
limit 21 was obtained by counting 5 units (since
22+5 27-31 26+5
c=5) starting from the lower limit 17 (17, 18, 19,
27+5 32-36 31+5
20, 21). To get the succeeding lower limits, just
32+5 37-41 36+5
add 5 which is the class size. Do the same for the
37+5 42-46 41+5 upper limits.
42+5 47-51 46+5
c=5
*21+0.5=21.5
Class mark or class midpoint refers to the representative of the class interval.
5, 6, 7, 8, 9, 10
You have to follow the steps to construct a frequency distribution table. To show
you these steps, let us consider the test scores of 50 students in Statisticsrecorded as
follows:
Table 1
Test Score of 50 Students in Statistics
48 39 55 65 51
79 63 89 29 54
65 58 64 76 90
30 84 50 55 59
69 43 79 44 40
49 50 24 78 71
63 64 73 35 65
58 36 47 86 46
85 74 64 72 54
38 52 33 53 42
Step 1: Determine the Range (R) of the distribution. The range is equal to the
highest score minus the lowest score.
R = 90 - 24
R = 66
Step 2: Determine the class size by dividing the range by the desired number of
classes. (The number of classes must not be too few nor too many. Too many
class intervals may result in classes with zero frequencies.) Let us have ten
classes on this problem. In some cases, the class size is already given.
𝑅𝑎𝑛𝑔𝑒
Class size or class width( c ) = (if the class size is not
numbe𝑟𝑜𝑓𝑐𝑙𝑎𝑠𝑠𝑒𝑠
(Formula 3) exact, round it off to the
66 nearest whole number)
c= = 6.6 ≈ 7
10
Step 3: Unless otherwise specified, always start the lowest class limit by the lowest
value of the given data (raw data). For the second lower limit, just add the
class size and then continue to add the class size to this lower limit to get the
rest of the lower limits. To get the first upper limit, subtract one (1) from the
second lower limit. For the second upper limit, just add the class size continue
to add the class size to this upper limit to get the rest of the upper limits.
Note: The last class interval should contain the highest value.
Constructing the Class Limits Resulting Class Limits/Class
Intervals
LL - UL
Step 4: Determine the class boundaries by subtracting 0.5 from each of the lower
class limits and adding 0.5 to each of the upper class limits.
Lower Upper
Class
Boundarie Boundarie
Boundaries
Lower Limit s Upper Limit s
LB - UB
- 0.5 (LB) + 0.5 (UB)
24 - 0.5 23.5 30 + 0.5 30.5 23.5 – 30.5
31 - 0.5 30.5 37 + 0.5 37.5 30.5 – 37.5
38 - 0.5 37.5 44 + 0.5 44.5 37.5 – 44.5
45 - 0.5 44.5 51 + 0.5 51.5 44.5 – 51.5
52 - 0.5 51.5 58 + 0.5 58.5 51.5 – 58.5
59 - 0.5 58.5 65 + 0.5 65.5 58.5 – 65.5
66 - 0.5 65.5 72 + 0.5 72.5 65.5 – 72.5
73 - 0.5 72.5 79 + 0.5 79.5 72.5 – 79.5
80 - 0.5 79.5 86 + 0.5 86.5 79.5 – 86.5
87 - 0.5 86.5 93 + 0.5 93.5 86.5 – 93.5
Step 5: Calculate the class marks or class midpoints. It is the numerical location
of the center of the class and is computed as follows:
Formula 4
LL + Class Class
LL + UL
UL Midpoint Midpoint
2
2 Xi Xi
24 + 30 24 + 30
27 27
2 2
31 + 37 shortcut: 1st midpoint +
34 class size 34
2 27 + 7
38 + 44
41 34 + 7 41
2
45 + 51
48 OR 41 + 7 48
2
52 + 58
55 48 + 7 55
2
59 + 65
62 55 + 7 62
2
66 + 72
69 62 + 7 69
2
73 + 79
76 69 + 7 76
2
80 + 86
83 76 + 7 83
2
87 + 93
90 83 + 7 90
2
Step 6: Tally the data. Write the numerical equivalent of the tally on the column for
frequency. The best example of tallying the scores is counting the votes cast
in an election.
From table 1
Class Tally
Interval (Test Scores)
Tally f
s First 2nd 3rd 4th 5th
LL- UL Column Column Column Column Column
l l l
24 - 30 III 3
(30) (24) (29)
l l l
31 - 37 III 3
(36) (33) (35)
lI II
l l
38 - 44 (39),(43 (40), IIII-I 6
(38) (44)
) (42)
ll II
lI l
45 - 51 (50),(47 (51), IIII-II 7
(48),(49) (50)
) (46)
ll II II
l l
52 - 58 (52),(58 (55), (54), IIII-III 8
(58) (55)
) (53) (54)
lI lI II
ll I
59–65 (63),(64 (64),(64 (59), IIII-IIII 9
(65, 63) (65)
) ) (65)
l l I
66 - 72 III 3
(69) (72) (71)
lI Il
l I
73 - 79 (79),(73 (76), IIII-I 6
(79) (74)
) (78)
l I I
80 - 86 III 3
(85) (84) (86)
I I
87 - 93 ll 2
(89) (90)
Applying the steps, table 2 shows how the frequency distribution table looks like.
Table No. 2
Frequency Distribution of the 50 Test Scores in Statistics
Class
Class Class Frequen
Boundarie
Intervals Marks cy
s
LL - UL Xi f
LB – UB
24 - 30 23.5 - 30.5 27 3
31 – 37 30.5 - 37.5 34 3
38 – 44 37.5 - 44.5 41 6
45 – 51 44.5 - 51.5 48 7
52 – 58 51.5 - 58.5 55 8
59 – 65 58.5 - 65.5 62 9
66 – 72 65.5 - 72.5 69 3
73 – 79 72.5 - 79.5 76 6
80 – 86 79.5 - 86.5 83 3
87 - 93 86.5 - 93.5 90 2
c=7 n = 50
Illustration 1
(Data from Test Scores of 50 Students in Statistics)
Resulting "Less
Successive addition of Successive addition of than" and
frequencies from top to frequencies from "Greater than"
bottom bottom to top Cumulative
Frequencies
Greater Cumulati
Less Than
Frequenc than ve
Cumulative Frequency
y Cumulative f Frequenc
Frequency f
f Frequency y
(<cf)
(>cf) <cf >cf
3 3 3 47 + 3 50 3 3 50
3 3+3 6 3 44 + 3 47 3 6 47
6 6+6 12 6 38 + 6 44 6 12 44
7 12 + 7 19 7 31 + 7 38 7 19 38
8 19 + 8 27 8 23 + 8 31 8 27 31
9 27 + 9 36 9 15 + 9 23 9 36 23
3 36 + 3 39 3 11 + 3 14 3 39 14
6 39 + 6 45 6 5 + 6 11 6 45 11
3 45 + 3 48 3 2+3 5 3 48 5
2 48 + 2 50 2 2 2 50 2
n=5
n = 50 n = 50
0
𝑐𝑓 Formula 5
Cumulative Percentage Frequency (cpf) = 𝑥 100
𝑛
Cumulative Percentage
Frequency Cumulative Frequency
Frequency
f
<cf >cf <cpf >cpf
3 50
3 3 50 𝑥100 = 6 𝑥100 = 100
50 50
6 48
3 6 48 𝑥100 = 12 𝑥100 = 96
50 50
Now our table looks like this with the addition of the column on cumulative
percentage frequency.
Table 3
Cumulative Percentage Distribution of 50 Test Scores in Statistics
Cumulative
Class Class Cumulative
Frequency Percentage
Intervals Boundaries Frequency
f Frequency
LL - UL LB - UB
<cf >cf <cpf >cpf
24 - 30 23.5 - 30.5 3 3 50 6 100
31 – 37 30.5 - 37.5 3 6 48 12 94
38 – 44 37.5 - 44.5 6 12 44 24 88
45 – 51 44.5 - 51.5 7 19 38 38 76
52 – 58 51.5 - 58.5 8 27 31 54 62
59 – 65 58.5 - 65.5 9 36 23 72 46
66 – 72 65.5 - 72.5 3 39 14 78 28
73 – 79 72.5 - 79.5 6 45 11 90 22
80 – 86 79.5 - 86.5 3 48 5 96 10
87 - 93 86.5 - 93.5 2 50 2 100 4
c=7 n= 50
This is how to interpret the cumulative frequency and the cumulative
percentage frequency.
Remember: Use the upper class boundaries in interpreting the <cf and
the <cpf. (lower than the upper class boundaries)
Use the lower class boundaries in interpreting the >cf and the
>cpf. (higher than the lower class boundaries)
(For less than cumulative frequency and less than cumulative percentage frequency,
use the numbers colored yellow in the table.)
As seen in table 3, of the 50 students who took the test in Statistics, three
(3) students scored lower than 30.5 while six (6) students scored lower than 37.5.
Twenty-seven (27) or more than half of them, scored lower than 58.5. Moreover,
24% of the students scored lower than 44.5, while 78% scored lower than 72.5.
(For greater than cumulative frequency and greater than cumulative percentage
frequency, use the numbers colored green in the table.)
𝑓 Formula 6
Relative Frequency (rf) = 𝑛 𝑥 100%
Resulting
Relative Frequency
lustration No.3 Distribution of 50 Test
Scores in Statistics
Relative Relative
Frequency Frequency
Frequency Frequency
f f
(rf) (rf)
3 3 6
𝑥100% 6 3
50
3 3 6
𝑥100% 6 3
50
6 6 12
𝑥100% 12 6
50
7 14
8 16
9 18
3 6
6 12
3 6
2 4
n = 50 100%
Table 4
Cumulative Percentage and Relative Frequency Distribution of 50 Test Scores
in Statistics
Cumulative
Class Class Frequen Cumulative Relative
Percentage
Intervals Boundaries cy Frequency Frequency
Frequency
LL – UL LB - UB f (RF)
<cf >cf <cpf >cpf
24 – 30 23.5 - 30.5 3 3 50 6 100 6
31 – 37 30.5 - 37.5 3 6 48 12 94 6
38 – 44 37.5 - 44.5 6 12 44 24 88 12
45 – 51 44.5 - 51.5 7 19 38 38 76 14
52 – 58 51.5 - 58.5 8 27 31 54 62 16
59 – 65 58.5 - 65.5 9 36 23 72 46 18
66 – 72 65.5 - 72.5 3 39 14 78 28 6
73 – 79 72.5 - 79.5 6 45 11 90 22 12
80 – 86 79.5 - 86.5 3 48 5 96 10 6
87 – 93 86.5 - 93.5 2 50 2 100 4 4
c=7 n = 50 100%
Collected data remains useless and invalid unless presented effectively for
analysis and interpretations. Data are presented in three general methods as
enumerated below:
1.) Textual Method. By textual or paragraph method, the reader gets information by
merely reading the gathered data. Readers are directed to pay particular attention on
specific data such as comparisons, contrasts, synthesis, generalizations, or findings.
This method, however, when employed alone elicit its boredom. It is a weak means of
showing the quantitative comparisons or relations among quantitative or numerical
data attractively and interestingly. This method of presenting data is simple when there
are only a few numbers to be presented. However, it is not desirable to include a big
mass of quantitative data in a “text” or paragraph as the presentation becomes almost
incomprehensible.
Example:
The present enrollment of the Bulacan State University is 19,900 students.
About 12,077 or 63.56% are female or 36.44 % are male. Of the 19,000 students, 950
or 5% are scholars sponsored by different benefactors.
Example:
“Studies were made to evaluate the performance of students regarding the use
of modules (an innovative method) in teaching algebra against the traditional method.
A sample of 400 students were subjective in both methods. Two hundred of this
sample uses the modular approach and the other 200 of the students underwent the
traditional method. The groups were handled by two different professors. At the end
of the semester the students took the final examinations. The recorded results of the
examinations were as follows: 175 or 87.5% of the students who took the traditional
method passed the final exams and the remaining 25 or 12.5% of them failed the
exam. While for the group exposed to who uses the modular approach, 190 or 95% of
the students passed the examination and only 10 or 5% did not make it.
2.) Tabular Method. The tabular method is a more effective way of showing
relationships or comparisons of numerical data. It gives a more precise, systematic
and orderly presentation of data in rows and columns. This makes comparisons of
figures easy and comprehensive.
Example:
Table 3.4
Comparison of Final Grades of the 400 students between the
Traditional and Modular Method
Semi-tabular method. The semi-tabular method employs both the textual and tabular
method. This method is used only if there are few figures to be tabulated. The tables
are followed by narrative explanations to make the facts more understandable.
3.) Graphical Presentation. The use of graphs is the most effective method of
presenting statistical result or findings. It gives the relationships of data in pictorial
form. Presentations of facts are made attractive and meaningful especially if colors
are made and pictures are used, making it easy for important information to be grasped
by the reader. Its only limitations are 1.) graphs are not precise as the tables. 2.)
graphs require more skills and time to prepare, 3.) graphs can only be made after data
have been presented in tabular forms.
a.) Line graph. A line graph is a type of chart used to show information that changes
over time. We plot line graphs using several points connected by straight lines. We
also call it a line chart. The line graph comprises of two axes known as ‘x’ axis and ‘y’
axis.
Source:
https://ph.images.search.yahoo.com/search/images?p=line+graph+example&fr=mcafee&type=
E210PH91215G91490&imgurl=https%3A%2F%2F2.zoppoz.workers.dev%3A443%2Fhttps%2Fimages.edrawmax.com%2Fimages%2Fknowle
dge%2Fline-graph-5-example-2.jpg#id=8&iurl=https%3A%2F%2F2.zoppoz.workers.dev%3A443%2Fhttps%2Fd77da31580fbc8944c00-
52b01ccbcfe56047120eec75d9cb2cbd.ssl.cf6.rackcdn.com%2F2691abf7-b6f8-45dc-83ba-
417e3224dc9a%2Fslide11.jpg&action=click
b.) Bar Graph. The bar graph is used to represent discrete data, so instead of being
joined like in the histogram, the bars are separated. The length of each bar also
represents the frequency within the given class. The width of the bar is arbitrary. The
bars, however must be of the same width. Thus, the bar graph is almost the same as
the histogram, the only difference is that the bars of the histogram are joined.
1200
1000
800
600
400
200
0
COEd CAS CIT COE CAFA OTHERS
c.) The Pie Diagram. A pie diagram or a pie chart is used to present
percentage distribution. It consists of a circle subdivided into sectors
proportional in size to the quantities or percentages they represent.
d.) Statistical maps. A special type of map in which the variation in quantity of a factor
such as rainfall, population, or crops in a geographic area is indicated.
Pictogram. A pictogram is a chart that uses pictures to represent data. Pictograms
are set out in the same way as bar charts, but instead of bars they use columns of
pictures to show the numbers involved.
Example of statistical Map
The histogram is composed of vertical bars that are joined together, making it
an appropriate graph for continuous data. The basis of each bar or rectangle is equal
to the class boundaries with the height corresponding to its class frequency.
The column of the class boundaries are obtained by subtracting point 5 from all
the lower limits and adding point 5 to all upper limits. The amount to be subtracted or
added is exactly half the gap between to class intervals. For instance, in the interval
15-21 and 22-28, the gap is 1, so half or 1 is 0.5.
Step 1. Lay off the x-axis and the y-axis to represent the class intervals and the class
frequencies, respectively.
Step 2. Determine the most appropriate scale for the class frequencies.
Step 3. Draw each bar with a height equal to the class frequency of each class
boundary. Class marks may replace the class boundaries. So just enter the base of
each bar on its corresponding class mark.
Step 4. Label x and y-axis correctly.
Step 1. Represent the units on the x-axis by the exact upper limit of each class
interval.
Step 2. Represent the y-axis by the cumulative frequency.
Step 3. Plot the point above the exact upper limit, which corresponds to its
cumulative frequency.
Step 4. Connect the points by straight lines.
Class Interval Frequency Class Boundaries <Cf >Cf
LL-UL f LCB-UCB
15-21 2 14.5-21.5 2 50
22-28 2 21.5-28.5 4 48
29-35 4 28.5-35.5 8 46
36-42 8 35.5-42.5 16 42
43-49 11 42.5-49.5 27 34
50-56 9 49.5-56.5 36 23
57-63 4 56.5-63.5 40 14
64-70 6 63.5-70.5 46 10
71-77 2 70.5-77.5 48 4
78-84 2 77.5-84.5 50 2
c=7 n=50