MMW Module 4 Lesson 1
MMW Module 4 Lesson 1
4
Data
Management
Data Management
94
Learning Outcomes
95
Lesson 1. Introduction to Data Management
Statistics play a very vital role in our society today, especially this time
of pandemic (COVID-19). All should be included, be counted and accountable for.
No one should be left behind. Because of the usefulness of statistics in almost all
fields of endeavor, some cautions should also be considered. Impressive figures
can be blown out of proportions of their real or imagined importance.
Unscrupulous minds with vested interests make improper or unethical use of
different statistical methods. Questionable and even conflicting claims backed
up with “statistics” can be accepted as true which leads one to believe that
anything can be proven statistically. Moreover, faulty researchers maybe slanted
to produce a particular outcome, that is, statistical analyses are chosen to
produce such outcomes.
Most importantly, for the above reasons, for the statistics users or the
researchers that they clearly understand the statistical tools or techniques being
used in their researches. Thus, in this module, careful attention will be given to
the role of statistics as a tool in research.
96
Statistics is being divided into two (2) categories or branches called
descriptive and inferential statistics. We can differentiate the two using the
definition of statistics.
COLLECTING
ORGANIZING DATA DESCRIPTIVE STATISTICS
PRESENTING
ANALYSIS
DATA INFERENTIAL STATISTICS
INTERPRETING
PARAMETER
A parameter is a value, usually a numerical value that describes a
population. It may be obtained from a single measurement, or it may be derived
from a set of measurements from the population. (µ-population mean; δ-
population standard deviation)
STATISTIC
A statistic is a value, usually a numerical value that describes a sample.
It may be obtained from a single measurement, or it may be derived from a set
of measurements from the sample. (Ẍ-sample mean; s-sample standard
deviation)
VARIABLE
A variable is any information that differs from one member to another in
a population or sample. It is a characteristic of interest for the elements. The
weight (kg) in Table 1.1 served as the variable.
97
Table 1.1 Weights of Randomly selected Grade IV pupils in AES, 1st Quarter of 2020
Section Weights (Kg)
IV - 1 50 41 36 34 54 60 51 37
IV - 2 22 39 42 42 45 38 38 40
IV - 3 38 28 32 44 42 47 37 28
IV - 4 27 27 40 41 39 32 36 24
IV - 5 40 39 33 33 27 30 31 45
Each weight of pupils included in the data set is called an element. An entity on
which data are collected. Collected measurements on each variable for every
element in a study provide the data. The set of measurements obtained for
particular element is called observation.
In Table 1.1, we see the different measurements for the first observations
(IV-1) are 50, 41, 36, 34, 54, 60, 51, 37. For the second observations (IV-2) are
22, 39, 42, 42, 45, 38, 38, 40, and so on. A data set with 40 elements contains
40 observations.
CONSTANT
A constant is an information about the population or sample that is true
to all members. The value of pi, temperature (Celsius to Fahrenheit and vice
versa), number of days in a week, and different forms of measurements e.g. 12
inches = I foot, are some examples of constant.
1. Qualitative Data
Qualitative data describes qualities or characteristics. It is mostly non-
numerical and descriptive in nature. It often but not always captures
emotions, feeling and subjective perception of something.
Qualitative method of research is characterized by the following:
2. Quantitative Data
Quantitative data deals with things that are measurable and can be
expressed in number and figures. It is usually expressed in numerical form
and can be mathematically computed. Qualitative data can be collected
using:
● Experiments/clinical trials
● Observing and recording well-defined objects such as number of
cars which participated in a motorcade.
● Administering surveys with closed-ended questions.
● Paper-pencil questionnaires
Example:
1. Number of siblings
2. Height and weight
3. Temperature in degree Celsius
For example, if you would describe a house, your description can either be
qualitative or quantitative. Here are some descriptions:
99
Qualitative Quantitative
The house is located in Baguio City. The house is 8.5 meters high.
The house is mostly made of cement. The house has 3 bedrooms.
The color of the house is green. The house’s floor area is 125 square
The door is made of oak tree. meters.
1. Nominal data
This level of data is categorical in nature; none is greater than or less than
the other, and it is not in any particular order. Also, the categories are
exclusive and exhaustive, meaning, the response can neither be ‘both’ nor
‘neither’.
2. Ordinal data
Ordinal data must also be exclusive and exhaustive, but the difference is
that the responses are ranked or it has order. Here, you can say that one
response is higher or better than the other.
3. Interval
Here, interval of equal length signifies equal differences in the data.
Difference makes sense but ratios do not. An example is temperature, 30 oC
is not twice as hot as 15oC. Also, the ‘true zero’ start point is not applicable.
This means that zero does not signify the absence of the measurement. Zero
degree Celsius does not mean that there is no temperature.
Example: Temperature
4. Ratio
At this level, both differences and ratios are meaningful. Example, 4
Liters of water is twice as much as 2 Liters of water. There also exists the
‘true zero’ start point in which zero means nothing or the absence of the
measurement. Zero liter of water means there is no water.
100
Example: Weight, Height, Number of children
Data
Qualitative Quantitative
Data can also be classified according to who collected the data. It can be
a primary data or secondary data.
Primary data – These are data which were collected first hand. It is more
authentic, reliable and objective as compare to secondary data. Primary data can
be obtained through experiments, surveys, questionnaires, interviews and
observations.
Secondary data – These data are collected from already published in any form.
The review of literature of research is based on secondary sources. The
importance of secondary data is when you do not need to go through the hassle
of collecting data when it is already available and published. It will save time,
effort and money in the part of the researcher. Secondary data can be collected
from books, records, magazines, research articles, newspapers, biographies,
databases, etc.
Data Presentation
1. Textual Presentation
In textual or descriptive presentation, the data are presented using texts
or paragraphs. This is usually used when the number of data is not too large.
For example:
The population of Region I as of May 1, 2020 is 5,301,139 based on the
2020 Census of Population and Housing (2020 CPH). This accounts for about
4.86 percent of the Philippine population in 2020. The 2020 population of the
region is higher by 275,011 from the population of 5.03 million in 2015, and
552,767 more than the population of 4.75 million in 2010. Moreover, it is higher
101
by 1,100,661 compared with the population of 4.20 million in 2000.
(psa.gov.ph)
2. Tabular Presentation
In tabular presentation, data are presented using tables to represent even
a large number of data to make it engaging and easier to read. The data are
arranged in rows (horizontal) and columns (vertical). Tabular presentation
avoids unnecessary details and repetitions of data. It reveals patterns which
cannot be seen when it is presented in textual form.
102
3. Diagrammatical or Graphical Presentation
This type of presentation uses graphs or diagrams such as bar graph, pie
graph, line graph and scatter diagram. Diagrams give a bird’s eye view of the
data and can be easily understood just by looking at the graph.
Some of the charts or graphs which are commonly used are the following:
1. Pie chart
The following pie graph illustrates the population of Region I per province
for the year 2020 using the data in table 2.
It can be seen in the graph that Pangasinan constitutes 60% of the total
population of Region I.
2. Bar graph
The following bar graph shows the comparison among the population of
the provinces in Region I from 2000, 2010, 2015 and 2020 as seen in
Figure 2.
103
The bar graph shows that Pangasinan dominates the population of
Region I from year 2000 to 2020. The province with the least population
is Ilocos Norte.
3. Column chart
4. Line graph
104
The line graph illustrates that the population of the provinces from
Region I continuously increased from year 2000 to 2020.
5. Scatterplot
The scatter plot shows an almost perfect linear relationship between the
year and the population of the Philippines.
Looking at the given examples, diagrams are mostly used as visual aids. It
cannot be considered as alternatives for numerical data. Diagrams and graphs are
not as accurate as tabular data. Only tabular data can be used for further analysis.
105
MODULE IV – Data Management
Learning Activity 1 – Introduction to Data Management
Name: ______________________________________________________
Course, Year and Section: _____________________________________
Classify the following data whether they are qualitative or quantitative and
nominal, ordinal, ratio or interval.
106