Lesson 5.1
Lesson 5.1
DATA
MANAGEMENT
LEARNING OUTCOMES:
1. Use a variety of statistical tools to
process and manage numerical
data.
2. Use the methods of linear
regression and correlations to
predict the value of a variable given
certain conditions.
3. Advocate the use of statistical data
in making important decisions.
LESSON 5.1
THE DATA
SPECIFIC OBJECTIVES:
1. To Understand the nature of
statistics.
2. To gain deeper insights on the
different levels. of measurements.
3. To clarify the meaning of some
important key concepts.
4. To explore the strengths and
limitations of graphical
representation.
It is written in the Holy Book that
“the truth shall set us free;”
therefore, understanding statistics
paves the way towards intellectual
freedom.
For without sufficient knowledge
about it, we may be doomed to a life
of half-truth.
Statistics will provide deeper
insights to critically evaluate
information and to bring us to the
well-lit arena of practicality.
GENERAL FIELDS OF STATISTICS
Descriptive Statistics
it is about “describing” data in symbolic
forms and abbreviated fashions.
Sometimes we dealing with a large
amount of data and that it is impossible to
describe it as it is being a large amount of
data but descriptive statistics will provide
us certain tools to make the data
manageable to handle and conveniently
neat to describe.
Inferential Statistics
It has the ability to “infer” and to
generalize and it offers the right tool to
predict values that are not really known.
Inference or generalization is a risky
process that is why we need to ensure that
the small group of workers we selected are
the approximate representative of the
workers in the entire region. But
nevertheless, this inference or prediction
is better than chance accuracy.
MEASUREMENT
Itessentially means quantifying an
observation according to a certain rule.
For instance, the presence of fever can
be quantified by using a thermometer.
Body weight can be determined by
using a weighing scale. Or the mental
ability can be quantified by using
written examination that can generate
scores.
TWO TYPES OF QUANTITATIVE
INFORMATION:
(1) (2)
Obviously, those numbers only
serve as labels and they do not
contain any numerical weight.
Thus, we cannot say that
married people (having been
labelled 2) have more marital
status than single people
(having been labelled 1).
ORDINAL SCALE
There are instances wherein comparison
is necessary and cannot be avoided.
Ordinal scale provides ranking of the
observation in order to generate
information to the extent of “greater than”
or “less than;”. But the ranked data
generated is limited also the extent of
“greater than” or “less than;”.
It is not capable of telling information
about how much greater or how much
less.
EXAMPLE:
Ordinal scale can be best illustrated
in sports activities like fun run.
Finding the order finish among the
participants in a fun run always
come up with a ranking.
However, ranked data cannot
provide information as to the
difference in time between 1st placer
and 2nd placer.
INTERVAL SCALE
In the nominal scale, we use numbers to
label categories while in the ordinal scale
we use numbers to merely provide
information regarding greater than or less
than.
However, in interval scale we assign
numbers in such a way that there is
meaning and weight on the value of points
between intervals. This scale of
measurement provides more information
about the data.
EXAMPLE:
Academic performance of five students in a
certain class
Population
Itcan be defined as an entire
group people, things, or events
having at least one trait in
common
A common trait is the binding
factor in order to group a cluster
and call it a population.
Merely having a clustering of
people, things or events
cannot be considered as a
population. At least one
common trait must be
established to make a
population.
But, on the other hand, adding too many
common traits can also limit the size of
the population.
Example:
Graphs
It is another way to visually
show the behavior of data. To
create a graph, distribution
of scores must be organized.
For instance, in the scores
provided below, presenting the
scores in an unorganized manner
can provide confusing or no
information at all; Reporting raw
can even hide some significant
scores to be noticed
120, 65, 110, 75, 105, 80, 105,
85, 100, 85, 100, 90, 95, 90, 90
But when we arrange the scores from highest
to lowest, which is a form of score
distribution, some pieces of information can
gradually brought forth and exposed.
Distribution of Scores
120 90
110 90
105 85
105 85
100 80
100 75
95 65
90
The score distribution can still
be organized in a form of a
frequency distribution.
Frequency distribution provides
information about raw scores,
and the frequency of
occurrences. Frequency
distribution provides clearer
insights about the behavior of
scores.
X f
(raw score) (Frequency of
Occurrence)
120 1
110 1
105 2
100 2
95 1
90 3
85 2
80 1
75 1
65 1
Another alternative way of
presenting data in frequency
distribution is to present them
in a tabular form. A tabular
form has the advantage of
showing the visual
representation of the data. This
kind of presentation is more
appealing to the general
audience.
Another way of showing the data in graphical
form is by using Microsoft Excel, as also
illustrated in the graphs below. It is the
frequency polygon of the scores in our cited
example above.
Notice in the illustration of the
frequency polygon, the two graphs
may appear different but they are
actually the same and they
disclose the similar information.
This illustration will allow you
realize that unless you see things
with a critical eye, a graph can
create a false impression of what
the data really reveal.
This is an obvious situation showing
how graphs can be used to distort
reality if you are not equipped with a
critical statistical mind. This type of
deceitful cleverness in distorting
graphs is common in some
corporations devising the tinsel to
camouflage and also to portray some
gigantic leaps in sales in order to
attract more clients or buyers
ACTIVITY
Indicatewhich scale of
measurement- nominal
ordinal or interval is being
used.
1. Both Globe and Smart
phone number prefix 0917
and 0923 served 1 million
and 2.5 subscribers,
respectively.
2. The Philippine Statistics
Office announces that the
average height of Filipino
male is 156.41 cm tall.
3. Postal Office shows
that 4,231 individuals
have a zip code of
4231.
4. The Sportsfest committee
posted the names of
individuals with their order of
finish for the first 50 runners
to reach the finish line.
5. The University Admission
Office posted the names and
scores of student applicants
who took the entrance
examination.