100% found this document useful (2 votes)
1K views

Statistics and Probability

This document provides information about probability distributions and random variables. It discusses discrete and continuous random variables and how to determine the possible values of a random variable based on a given experiment or situation. Examples are provided to illustrate finding the sample space, possible values of a random variable, and constructing the probability distribution in graphical form using a probability histogram. The key properties of discrete probability distributions are also outlined.

Uploaded by

Joh Tayag
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (2 votes)
1K views

Statistics and Probability

This document provides information about probability distributions and random variables. It discusses discrete and continuous random variables and how to determine the possible values of a random variable based on a given experiment or situation. Examples are provided to illustrate finding the sample space, possible values of a random variable, and constructing the probability distribution in graphical form using a probability histogram. The key properties of discrete probability distributions are also outlined.

Uploaded by

Joh Tayag
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 81

11

STATISTICS AND
PROBABILITY
Learning Activity Sheets
Quarter 3: Week 1-8

Division of Angeles City


Name of learner: ___________________________ Section: ___________________

WEEK 1: RANDOM VARIABLES AND PROBABILITY DISTRIBUTION


Background Information
In the study of basic probability, you have learned that an experiment is any activity
which can be done repeatedly under similar condition. The set of all possible outcomes of an
experiment is called the sample space. You have also learned how to mathematically list the
possible outcome of a given experiment. In tossing a coin, for instance, the possible outcomes
are turning up a head or a tail.

For you to begin let us all understand that probability distributions can be illustrated or
classified as discrete probability distributions or as continuous probability distributions,
depending on whether they define probabilities associated with discrete variables and
continuous variables.

RANDOM VARIABLE

A variable 𝑋 whose value depends on the outcome of a random process is called a


random variable. A random variable is a variable whose value is a numerical outcome of a
random phenomenon. A random variable is denoted with a capital letter. The probability
distribution of a random variable 𝑋 tells what the possible values of 𝑋 are and how probabilities
are assigned to those values.

A random variable can be discrete or continuous.

A discrete variable is a numerical A continuous variable has infinite


value associated with the desired numerical values associated with
outcomes. It is also either a finite or any interval on the number line
infinite number of values but system without any gaps or breaks.
countable such as whole numbers
0,1,2,3. Examples:
Examples: • height
• number of siblings • weight
• admissions in a school • skull circumference
• total number of teachers

Example 1
Suppose we toss a fair coin 3 times. The sample space for this chance process is :
{ HHH HHT HTH THH HTT THT TTH TTT}

1
Since there are 8 possible outcomes then probability of each outcome is .
8
Define the variable 𝑋 = “the number of heads obtained”.
What are the outcomes of possible heads?

2
𝑋 =0: TTT
𝑋 =1: HTT , THT , TTH
𝑋 =2: HHT , HTH , THH
𝑋 =3: HHH

The random variable in the activity above is a discrete random variable because the
set of possible outcomes is countable. The possible values of random variable 𝑋 = {0, 1, 2, 3}

PROBABILITY DISTRIBUTION
Probability distribution of a random variable tells all of its possible values along with
their associated probabilities.

Example 1: The random variable X is the number of tails in two tosses of a coin. What are
the probabilities of the possible outcomes? What is the probability distribution of X?

Solution:
Possible outcomes: {HH HT TH TT}
1
Each has probability so the probability distribution is:
4

𝑿 = No. of Tails 0 1 2
1 2 1
𝑷(𝑿)
4 4 4

In Graphical Form:
We can use the probability distribution to answer questions about variable x. In symbols, we
want to find 𝑃(𝑥 ≥ 1). We could add probabilities to find the answer:

𝑃(𝑥 ≥ 1) = 𝑃(1) + 𝑃(2) or 𝑃(𝑥 ≥ 1) = 1 − 𝑃(𝑥 < 1)


2 1
𝑃(𝑥 ≥ 1) = + 𝑃(𝑥 ≥ 1) = 1 − 𝑃(0)
4 4
3 1
𝑃(𝑥 ≥ 1) = 𝑃(𝑥 ≥ 1) = 1 −
4 4
3
𝑃(𝑥 ≥ 1) =
4

3
FINDING POSSIBLE VALUES OF A RANDOM VARIABLE

To understand how one finds the possible values of random variables, study the
example below...

Suppose two cellphones are tested at random. We want to find out the number of
defective cellphones. Thus, to each outcome in the sample space we shall assign a value.
These are 0, 1, or 2. If there is no defective cellphone, we assign the number 0; if there is 1
defective cellphone, we assign the number 1; and 2, if there are two defective cellphones. The
number of defective cellphones is a random variable.

The possible values of this random variable are 0, 1, and 2. Read and analyze the
given situation below.

Illustration:
Let D represent the defective cellphones and N for the non-defective cellphones. If we
let Y be the random variable representing the number of defective cellphones, show the values
of the random variable Y. The table below shows the values of the random variable.

Value of the Random


Possible Outcomes Variable Y (number of
defective cellphone)
DD 2
NN 0
DN 1
ND 1

Example 2. Suppose two coins are tossed, let Z be the random variable representing the
number of heads that occur. Find the values of the random variable Z.

1. Determine the sample space. Let H represent head and T for tail

Solution: The sample space for this experiment is: S = {TT, TH, HH, HT}

2. Count the number of heads in each outcome in the sample space and assign this
number to this outcome.

Value of the Random


Possible Outcomes variable Z (Number of
heads)
TT 0
TH 1
HH 2
HT 1

So, the possible values of the random variable 𝑍 = {0, 1, 2 }

4
ILLUSTRATING A PROBABILITY DISTRIBUTION FOR A DISCRETE RANDOM
VARIABLE AND ITS PROPERTIES

A random variable is a set whose elements are the numbers assigned to the
outcomes of an experiment. It is usually denoted by uppercase letters such as 𝑋, whose
elements are denoted by lower case letters 𝑥1 , 𝑥2 , 𝑥3 and so on.

The probability distribution of a discrete random variable specifies the probability


of each possible value of the random variable. The distribution functions of discrete random
variables are concentrated as a mass for a particular value, and generally known as
probability mass function.

Properties of Discrete Probability Distribution

1. The probability of each value of the random variable must be between or equal to 0
and 1. In symbol 0 < 𝑃(𝑋) < 1.
2. The sum of all the probabilities of all values of the random variable must be equal to 1.
In symbol, we write it as ∑ 𝑃(𝑋) = 1.

Probability Histogram

We can represent a probability distribution graphically by constructing a type of bar


graph called a probability histogram. This is constructed by displaying the possible distinct
values of the random variable along a horizontal axis. Above each value x of the random
variable, we draw vertical bar having height equal to the probability P(x).

Example 1: Number of Green Balls

Write the probability distribution of a random variable Y representing the number of


green balls when 2 balls are drawn in succession without replacement from a jar containing 4
red and 5 green balls. Construct a histogram for this probability distribution.

Step 1: Determine the sample space. Let R represent the red ball and G represent the green
ball.

Solution : The sample space for this experiment is: 𝑆 = { 𝑅𝑅, 𝑅𝐺, 𝐺𝑅, 𝐺𝐺}

Step 2: Count the number of green balls in each outcome in the sample space and assign this
number to this outcome.

Value of the Random


Possible
Variable Y (No. Of
Outcomes
Green Balls)

RR 0
RG 1
GR 1
GG 2

5
Step 3: Assign probability values 𝑃(𝑌) to each value of the random variable.

• There are 4 possible outcomes, if no green ball occurs once, the probability that
1
we shall assign to the random variable 0 is .
4
• If 1 green ball occurs two times, the probability that we shall assign to the
2 1
random variable 1 is or .
4 2
• If 2 green balls occur once, the probability that we shall assign to the random
1
variable 1 is .
4

Number of Green
Probability P(Y)
Balls Y
1
0
4
1
1
2
1
2
4

Histogram

Construct a histogram for this probability distribution. A histogram is a bar graph. To


construct a histogram for a probability distribution, follow these steps:
1) Plot the values of the random variable along the horizontal axis.
2) In plotting the probabilities along the vertical axis, you can change fractions into
1 1
decimals (ex. = 0.25 and = 0.5)
4 2

0.6

0.5

0.4
P(Y)

0.3

0.2

0.1

0
0 1 2
No. of Green Balls (Y)

Example 2
Let 𝑋 be a random variable giving the number of girls in a randomly selected three-
child family. Assuming that boys and girls are equally likely, construct the probability
distribution of 𝑋 and its corresponding histogram.

6
a.
Possible Outcomes Value of the Random Variable X
(No. of Girls)

BBB 0

BBG 1

BGB 1

GBB 1

BGG 2

GBG 2

GGB 2

GGG 3

b. No. of Girls 0 1 2 3

1 3 3 1
Probability 𝑷(𝑿) or 0.125 or 0.375 or 0.375 or 0.125
8 8 8 8

1 3 3 1 8
∑ 𝑃(𝑋) = + + + = =1
8 8 8 8 8
c. Histogram
0.4
0.3
P(X)

0.2
0.1
0
0 1 2 3

No. of Girls (X)

Learning Competencies with code:

➢ Illustrate a random variable (discrete and continuous) M11/12SP – IIIa-1


➢ Distinguish between a discrete and a continuous random variable(M11/12SP-IIIa2)
➢ Find the possible values of a random variable. (M11/12SP-IIIa-3)
➢ Illustrate a probability distribution for a discrete random variable and its properties.
(M11/12SP-IIIa-4)

7
GENERAL INSTRUCTION: Write your answer on a separate paper
Practice A
Classify whether the given experiment implies a discrete random variable or a
continuous random variable. Write D if discrete and C if continuous.
_____ 1. The temperature of a solution in the laboratory
_____ 2. Collecting data about the heights of students in senior high schools.
_____ 3. Recording the distance travelled by the bus
_____ 4. Surveying about the number of cases due to COVID-19 pandemic in Angeles City
_____ 5. Number of promoted students at the end of school year

Practice B.
Complete the given table by finding the possible values of random variable.
1. In a box there are 2 balls - one white and one yellow. Two balls are picked one at a
time with replacement. Let X be the random variable representing the number of white
balls. Find the values of the random variable X.

Possible Outcomes Value of the random variable X


(Number of white balls)

2. Four coins are tossed. Let Y be the random variable representing the number of tails
that occur. Find the values of the random variable Y.

Possible Outcomes Value of Random Variable Y


(number of tails)

8
Practice C.
Determine whether the given values can serve as the values of a probability distribution.
1. 𝑃(1) = 0.05, 𝑃(2) = 1.01, 𝑃(3) = 0.2
3 7 1
2. 𝑃(1) = , 𝑃(2) = , 𝑃(3) =
20 20 2

3. 𝑃(1) = 0.35 , 𝑃(3) = 0.25 , 𝑃(5) = 0.28 , 𝑃(7) = 0.12

X 1 5 7 8
1.
1 1 1 1
P(x)
4 8 4 8

X 1 3 5 7
2.
P(x) 0.35 0.25 0.22 0.12

Practice D. Fill in the blanks with the correct answer.


1. It is the list of possible values of a random variable X and the corresponding
probabilities of the values. _______________________________.
2. What is the sum of all the probabilities of all values of the random variable?
______________________________________________________________.
3. What should be the probability of each value of the random variable? Write your
answer in symbol. __________________________________________.
4. How will you determine if the given distribution is a probability distribution?
__________________________________________________________.
5. Enumerate the steps on how to illustrate the probability distribution of a discrete
random variable by completing the statements given below.
Determine first the _____________space in the given experiment. Then, find the
possible values of the _____________________. Illustrate the probability distribution
by assigning ________________ to each value of the random variable.

References
Albert, J. R. G. (2008). Basic Statistics for the Tertiary Level (ed. Roberto Padua,
Welfredo Patungan, Nelia Marquez), published by Rex Bookstore.

Handbook of Statistics 1 (1st and 2nd Edition), Authored by the Faculty of the
Institute of Statistics, UP Los Baños, College Laguna 4031

R Pierce, Rod. (3 Feb 2020). "Random Variables". Math Is Fun. Retrieved 24 May 2020
from http://www.mathsisfun.com/data/random-variables.html
Malate, Jose S. Statistics and Probability for Senior High School. Vicarisg Publications
and Trading, Inc, 2017.

Lim, Yvette F., Nocon, R., Nocon, E., Ruivivar, L. Math for Engaged Learning Statistics
and Probability. Sibs Publishing House, Inc. 2016.

9
ANSWER KEY
Practice A Practice B Practice C Practice D
1. 1. Probability
1 C 1. not a Distribution
2. 1
2 C Possible Value of probability 3. 0 ≤ P(x) ≤ 1
3 C Outcomes the distribution 4. ∑ P(x) = 1 and
4 D Random 2. probability 0 ≤ P(x) ≤ 1
5 D Variable Y distribution 5. Sample
3. probability Random variable
(No. Of
Probability values
white distribution or P(x)
balls) 4. not a
probability
WW 2 distribution
YY 0 5. not a
probability
WY 1 distribution
YW 1
Possible value of the random
X = { 0,1,2}

2.

Possible value of the random


X= { 0,1,2,3,4}

Prepared

JOANAH S. BAUTISTA
Secondary School Teacher I - FGNMHS

10
WEEK 2: COMPUTING PROBABILITY CORRESPONDING TO A GIVEN RANDOM
VARIABLE

Background Information

RANDOM VARIABLES

A random variable is a variable whose possible values are numerical outcomes of a


random experiment.

Example: Tossing a coin: we could get Heads or Tails.

In short:
𝑋 = {0, 1}
Note:
We could choose Heads = 100 and
Tails = 150 or other values if we want! It
is our choice

So:
• We have an experiment such as tossing coin.
• We give values to each event.
• The set of values is a Random Variable.

A random variable has a whole set of values, and it could take on any of those values,
randomly. We use a capital letter, like X or Y, to avoid confusion with the Algebra type of
variable. A Random Variable’s set of values is the sample space.
Example: Throw a die once
Random Variable X : The score shown on the top face.
X : 1, 2, 3, 4, 5 or 6
Sample space : {1, 2, 3, 4, 5, 6}

PROBABILITY
We can show the probability of any one value using this style:
𝑃(𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑋) = 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑡ℎ𝑎𝑡 𝑣𝑎𝑙𝑢𝑒

Example (continued): Throw a die once


𝑋 = {1, 2, 3, 4, 5, 6}
In this case they are all equally likely, so the probability of landing on any face when rolling a
die is 1/6
1 1
▪ 𝑃(1) = ▪ 𝑃(4) =
6 6
1 1
▪ 𝑃(2) = ▪ 𝑃(5) =
6 6
1 1
▪ 𝑃(3) = ▪ 𝑃(6) =
6 6

Note that the sum of the probabilities = 1.

11
Example:
How many heads (H) do you get when we toss three coins?
X = “The number of the Heads” is the random variable
In this case, there could be zero H (if all the coins land tails
up), 1H, 2H or 3H.

So the sample space = {0, 1, 2, 3}

But this time the outcomes are NOT all equally likely.

The three coins can land in eight possible ways:

X = “number of heads”

Looking at the table we can just see 1 case of


three heads, but 3 cases of 2H, 3 cases of 1H,
and 1 case of 0H. So:

1
• 𝑃(3) =
8

3
• 𝑃(2) =
8

3
• 𝑃(1) =
8

1
• 𝑃(0) =
8

Example: two dice are tossed.


The random variable is X = “The sum of the scores on the two dice”.

Let’s make a table of all the possible values.

There are 6 × 6 = 36 possible outcomes, and the


sample space (which is the sum of the scores on
the two dice) is {2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12}

12
Let’s count how often each value occurs, and work out the probabilities:
1
• 2 occurs just once, so 𝑃(2) =
36
2 1
• 3 occurs twice, so 𝑃(3) = or
36 18
3 1
• 4 occurs three times, so 𝑃(4) = or
36 12
4 1
• 5 occurs four times, so 𝑃(5) = or
36 9
5
• 6 occurs five times, so 𝑃(6) =
36
6 1
• 7 occurs six times, so 𝑃(7) = or
36 6
5
• 8 occurs five times, so 𝑃(8) =
36
4 1
• 9 occurs 4 times, so 𝑃(9) = or
36 9
3 1
• 10 occurs thrice, so 𝑃(10) = or
36 12
2 1
• 11 occurs twice, so 𝑃(11) = or
36 18
1
• 12 occurs just once, so 𝑃(12) =
36

THE PROBABILITY SCALE

Probabilities are given on a scale of 0 to 1, as decimals or as fractions; sometimes probabilities


are expressed as percentages using a scale of 0% to 100%, particularly on weather forecasts.

0 : This is the probability of something that is impossible.


1 : This is the probability that something is certain.
1
: This is the probability of something that is likely to happen as it is not to happen.
2

THE PROBABILITY OF A SINGLE EVENT

For equally likely events:


𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑤𝑎𝑦𝑠 𝑜𝑏𝑡𝑎𝑖𝑛𝑖𝑛𝑔 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
𝑃(𝑎 𝑝𝑎𝑟𝑡𝑖𝑐𝑢𝑙𝑎𝑟 𝑜𝑢𝑡𝑐𝑜𝑚𝑒) =
𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠

Example
1) A card is taken at random from a full pack of 52 playing cards. What is the probability
that the card taken is:
(a) red card?
(b) a queen?
(c) a red ace?
(d) the seven of hearts?

Solution
As each card is equally likely to be drawn from the pack there are 52 equally likely
outcomes.

(a) There are 26 red cards in the pack, so:


26 1
𝑃(𝑟𝑒𝑑) = =
52 2

13
(b) There are 4 queens in the pack, so:
4 1
𝑃(𝑞𝑢𝑒𝑒𝑛) = =
52 13
(c) There are 2 red aces in the pack, so:
2 1
𝑃(𝑟𝑒𝑑 𝑎𝑐𝑒) ==
52 26
(d) There is only 1 seven of hearts in the pack, so:
1
𝑃(𝑠𝑒𝑣𝑒𝑛 𝑜𝑓 ℎ𝑒𝑎𝑟𝑡𝑠) =
52
2) A packet of sweets contains 18 red sweets, 12 green sweets and 10 yellow sweets. A
sweet is taken at random from the packet. What is the probability that the sweet is:
(a) red?
(b) not green?
(c) green or yellow?

Solution
The total number of sweets in the packet is 40, so there are 40 equally likely outcomes
when one is taken at random.
(a) There are 18 red sweets, so:
18 9
𝑝(𝑟𝑒𝑑) = =
40 20
(b) There are 28 sweets that are not green in the packet, so:
28 7
𝑝 (𝑛𝑜𝑡 𝑔𝑟𝑒𝑒𝑛) = =
40 10
(c) There are 22 sweets that are green or yellow in the packet, so:
22 11
𝑝 (𝑔𝑟𝑒𝑒𝑛 𝑜𝑟 𝑦𝑒𝑙𝑙𝑜𝑤) = =
40 20

3) You roll a fair dice 120 times. How many times would you expect to obtain:
(a) a 6?
(b) an even score?
(c) a score of less than 5?

Solution:
1
(a) 𝑃(6) =
6
1
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 6𝑠 = × 120 = 20
6
3 1
(b) 𝑃(𝑒𝑣𝑒𝑛 𝑛𝑢𝑚𝑏𝑒𝑟) = =
6 2
1
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑒𝑣𝑒𝑛 𝑠𝑐𝑜𝑟𝑒𝑠 = 𝑥 120 = 60
2
4 2
(c) 𝑃(𝑠𝑐𝑜𝑟𝑒 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 5) = =
6 3
2
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 5 = 𝑥 120 = 80
3

A RANGE OF VALUES
We could also calculate the probability that a random variable takes on a range of
values.

Example:
You roll two dice, what is the probability that the sum of the scores is 5, 6, 7, or 8?
In other words: What is 𝑃(5 ≤ 𝑋 ≤ 8)?

14
Solution:
𝑃(5 ≤ 𝑥 ≤ 8) = 𝑃(5) + 𝑃(6) + 𝑃(7) + 𝑃(8)
4+5+6+5
=
36
20
=
36
5
=
9

Expected Value (or Mean) of a Discrete Random Variable


For a discrete random variable, the expected value, usually denoted as µ or ∑(𝑋), is
calculated using:
µ = ∑(𝑋) = ∑ 𝑥𝑖 𝑓 (𝑥𝑖 )
The formula means that we multiply each value, 𝑥, by its corresponding probability,
f(x), and then add them altogether. It can be seen as an average value but weighted by the
likelihood of the value.
Example:
𝑥 0 1 2 3 4
1 1 1 1 1
𝑓(𝑥)
5 5 5 5 5
What is the expected value?

Answer
1 1 1 1 1
µ = ∑(𝑋) = ∑ 𝑥𝑓(𝑥) = 0 ( ) + 1 ( ) + 2 ( ) + 3 ( ) + 4 ( ) = 2
5 5 5 5 5

For this example, the expected value is equal to a possible value of X. This may not
always be the case. For example, if we flip a fair coin 9 times, how many heads should we
expect? We will explain how to find this later but we should expect 4.5 heads. The expected
value in this case is not a valid number of heads.

VARIANCE OF A DISCRETE RANDOM VARIABLE

The variance of a discrete random variable is given by:


2
σ² = Var(X) = ∑ 𝑥 2 𝑖 𝑓(𝑥𝑖 ) − [∑(𝑋)] = ∑ 𝑥 2 𝑖 𝑓(𝑥𝑖 ) − µ2
The formula means that first, we add the square of each value times its probability then
subtract the square of the mean. We will use this form of the formula in all of our examples.

STANDARD DEVIATION OF A DISCRETE RANDOM VARIABLE

The standard deviation of a random variable, X, is the square root of the variance.
σ = SD(X) = √𝑉𝑎𝑟(𝑋) = √σ²

Example
𝑥 0 1 2 3 4
1 1 1 1 1
𝑓(𝑥)
5 5 5 5 5

Find the variance and the standard deviation of X.

15
Solution:
σ = SD(X) = √𝑉𝑎𝑟(𝑋) = √σ²
1 1 1 1 1
𝑉𝑎𝑟(𝑋) = [02 ( ) + 12 ( ) + 22 ( ) + 32 ( ) + 42 ( )] − 22 = 6 − 4 = 2
5 5 5 5 5
𝑆𝐷(𝑋) = √2 = 1.4142

Learning Competencies with code


• Compute probabilities corresponding to a given random variable. (M11/12SP-IIIa-6)
• Illustrate the mean and variance of a discrete random variable. (M11/12SP-IIIb-2)
• Calculate the mean and the variance of a discrete random variable. (M11/12SP-IIIb-2)

GENERAL INSTRUCTION: Write on your answer on a separate paper


Activity 1: Solve for the probability of the following
1. A coin is tossed 125 times and head is obtained 64 times. If a coin is tossed at
random, what is the probability of getting a head?
2. A coin is tossed 200 times and tails have obtained 128 times. If a coin is tossed at
random, what is the probability of getting a tail?
3. In 60 throws of a dice, 6 is obtained 24 times. In a random throw of a dice, what is
the probability of getting 6?
4. In 100 throws of a dice, 5 is obtained 42 times. In a random throw of a dice, what is
the probability of getting 5?
5. In 150 throws of a dice, 2 is obtained 76 times. In a random throw of a dice, what is
the probability of getting 2?

Activity 2: Practice calculating and interpreting the mean and standard deviation of a
discrete random variable.

1. Organizers of a concert are limiting tickets sales to a maximum of 444 tickets per
customer. Let T be the number of tickets purchased by a random customer. Here is
the probability distribution of T:
T=# of tickets 1 2 3 4
P(T) 0.1 0.3 0.2 0.4
Calculate the expected value of T.

2. Choose the best interpretation of the mean (or expected value) that you found in the
previous question.
a. Most customers will purchase 3 tickets.
b. A group of 10 customers will purchase exactly 29 tickets.
c. If we look at a large number of customers, then each customer, on average, will
have purchased about 2.9 tickets.

3. Calculate the variance and the standard deviation of T.


Round your answer to three decimal places.

Reference
https://online.stat.psu.edu/stat500/lesson/3/3.2/3.2.1

16
ANSWER KEY

Activity 1
1. 64/125
2. 16/25
3. 2/5
4. 21/50
5. 38/75

Activity 2
1. 2.9 tickets
2. C
3. σ2 = 1.09 ; σ = 1.044

Prepared by

APRIL JOY S. MANIALUNG


Teacher II

17
WEEK 3: MEAN AND VARIANCE OF A DISCRETE RANDOM VARIABLE

Background Information
The mean or expected value of a probability distribution tells the value of a random
variable that we expect to get if the experiment is done repeatedly. In other words, if the
experiment or process is repeated long enough, there is a likelihood or chance that the
average of the outcomes will begin to approach the expected value or the mean value.

The standard deviation of a probability distribution indicates how far, on the average,
is an observed value of a random variable X from its mean. The smaller the standard deviation,
the more likely that an observed value of the random variable will be close to its mean.

Use these concepts to solve problems involving the mean and the variance of
probability distribution.

Example 1:

The following data show the probability distribution of the number of computers sold
daily in a computer shop during the past several months.

Number of Computers X Probability P(X)


0 0.10
1 0.20
2 0.45
3 0.15
4 0.05
5 0.05

Questions:

1. What is the probability that on a given day…


a. fewer than three computers will be sold?
b. at most, two computers will be sold?
c. at least, four computers will be sold?
d. exactly three computers will be sold?
Solution:

To find the probabilities…

a. To find the probability that fewer than three computers will be sold, use X = 0, 1, and 2.
𝑃(𝑋 < 3) = 𝑃(0) + 𝑃(1) + 𝑃(2)
= 0.10 + 0.20 + 0.45
𝑃(𝑋 < 3) = 0.75

18
b. To find the probability that, at most, two computers will be sold, use X = 0, 1, and 2.

𝑃(𝑋 ≤ 2) = 𝑃(0) + 𝑃(1) + 𝑃(2)


= 0.10 + 0.20 + 0.45
𝑃(𝑋 ≤ 2) = 0.75

c. To find the probability that, at least, four computers will be sold, use X = 4 and 5.
𝑃(𝑋 ≥ 4) = 𝑃(4) + 𝑃(5)
= 0.05 + 0.05
𝑃(𝑋 ≥ 4) = 0.10

d. To find the probability that exactly three computers will be sold, use X = 3.
𝑃(3) = 0.15

2. The mean of the probability distribution is computed as follows:

X P(X) X • P(X)
0 0.10 0.00
1 0.20 0.20
2 0.45 0.90
3 0.15 0.45
4 0.05 0.20
5 0.05 0.25

𝜇 = ∑[𝑋 • 𝑃(𝑋)]

𝜇 = 0.00 + 0.20 + 0.90 + 0.45 + 0.20 + 0.25

𝜇 = 2.00

The mean of the probability distribution is 2. This implies that in the long run,
we expect that the average number of computers that will be sold on a given day will
be two.

3. The standard deviation is computed as follows:

X P(X) X2 X2 • P(X)
0 0.10 0 0.00
1 0.20 1 0.20
2 0.45 4 1.80
3 0.15 9 1.35
4 0.05 16 0.80
5 0.05 25 1.25

19
𝜎 = √[𝑋2 • 𝑃(𝑋)] − 𝜇2

𝜎 = √[0.00 + 0.020 + 1.80 + 1.35 + 0.80 + 1.25] − 22

𝜎 = √5.40 − 4

𝜎 = √1.40

𝜎 = 1.18

The standard deviation of the probability distribution is 1.18. This implies that on the
average, the number of computers sold is 1.18 units from the mean 2.

Example 4:

In a convenient store, the number of tellers (X) busy with customers at 12:00 noon
varies from day to day. Past records indicate that the probability distribution of X is as
follows:

Number of Tellers X Probability P(X)


0 0.029
1 0.049
2 0.078
3 0.155
4 0.212
5 0.262
6 0.215

Questions:

1. What is the probability that exactly four tellers are busy at 12:00 noon?
2. What is the probability that, at least, two tellers are busy at 12:00 noon?
3. What is the probability that, fewer than five tellers are busy at 12:00 noon?
4. What is the probability that at least two, but fewer than five tellers are busy at
12:00 noon?
5. What is the mean of the probability distribution?
6. What is the standard deviation of the probability distributions?

Solution:

1. To find the probability that exactly four tellers are busy, use X = 4.
𝑃(𝑋 = 4) = 0.212

2. To find the probability that at least two tellers are busy, use X = 2, 3, 4, 5, and 6.
𝑃(𝑋 ≥ 2) = 𝑃(2) + 𝑃(3) + 𝑃(4) + 𝑃(5) + 𝑃(6)
= 0.078 + 0.155 + 0.212 + 0.262 + 0.215
𝑃(𝑋 ≥ 2) = 0.922

20
3. To find the probability that fewer than five tellers are busy,
use X = 0, 1, 2, 3, and 4.
𝑃(𝑋 < 5) = 𝑃(0) + 𝑃(1) + 𝑃(2) + 𝑃(3) + 𝑃(4)

= 0.029 + 0.049 + 0.078 + 0.155 + 0.212

𝑃(𝑋 < 5) = 0.523

4. To find the probability that at least two, but fewer than five tellers are busy,
use X = 2, 3, and 4.
𝑃(2 ≤ 𝑋 < 5) = 𝑃(2) + 𝑃(3) + 𝑃(4)
= 0.078 + 0.155 + 0.212
𝑃(2 ≤ 𝑋 < 5) = 0.445

5. The mean of the probability distribution is computed as follows:

X P(X) X • P(X)
0 0.029 0.000
1 0.049 0.049
2 0.078 0.156
3 0.155 0.465
4 0.212 0.848
5 0.262 1.310
6 0.215 1.290

𝜇 = ∑[𝑋 • 𝑃(𝑋)]
= 0.00 + 0.49 + 0.156 + 0.465 + 0.848 + 1.310 + 1.290
𝜇 = 4.118
The mean of the probability distribution is 4.118. This implies that in the long
run, we expect that the average number of tellers who are busy at 12:00 noon
will be 4.118 or 4.

6. The standard deviation of the probability distribution is computed as follows:

X P(X) X2 X2 • P(X)
0 0.029 0 0.000
1 0.049 1 0.049
2 0.078 4 0.312
3 0.155 9 1.395
4 0.212 16 3.392
5 0.262 25 6.550
6 0.215 36 7.740

21
𝜎 = √[𝑋2 •𝑃(𝑋)] − 𝜇2

𝜎 = √[0.00 + 0.049 + 0.312 + 1.395 + 3.392 + 6.550 + 7.740] − (4.118)2

= √19.438 − 16.958

= √2.48

𝜎 = 1.57

NORMAL PROBABILITY DISTRIBUTION

The probability distribution for a continuous random variable X is a bell-shaped curve.


The curve is a function of the random variable X and is sometimes called a probability density
function or probability distribution. The probability values under this curve enable us to make
probability statements about a random variable. Hence, it is usually used as a mathematical
model in inferential statistics.
The graphical form of the probability distribution for a continuous random variable X,
where X is the number of heads occurring, is a smooth bell-shaped curve. The graph is a
pictorial representation of discrete number. If we continue generating data for the random
variable X, we shall observe an emerging pattern.
Normal Probability Distribution is a probability distribution of continuous random
variables. It shows graphical representations of random variables obtained through
measurement like the height and weight of the students, the percentile ranks of the A&E
(Accreditation and Equivalency) Test result of the Alternative Learning System students, or
any data with infinite values.

It is used to describe the characteristics of populations and help us visualize the


inferences we make about the population. It also used to determine the probabilities and
percentile of the continuous random variables in the distribution.

For example, your grade in Mathematics is one of the scores in the distribution, you
can predict the location of that score in the distribution and interpret it with regards to the mean
and standard deviation.

Properties of the Normal Curve

The graphical representation of the normal distribution is popularly known as a normal


curve. The normal curve is described clearly by the following characteristics:

1. The normal curve is bell-shaped.


2. The curve is symmetrical about its center. This means that, if we draw a segment from
the peak of the curve down to the horizontal axis, the segment divides the normal curve
into two equal parts or areas.
3. The mean, median, and mode coincide at the center. This also means that in a normal
distribution, or a distribution described by a normal curve, the mean, median, and mode
are equal.

22
4. The width of the curve is determined by the standard deviation of the distribution.
5. The tails of the curve are plotted in both directions and flatten out indefinitely along the
horizontal axis. The tails are thus asymptotic to the baseline. A portion of the graph
that is asymptotic to a reference axis or another graph is called an asymptote, always
approaching another but never touching it.
6. The total area under a normal curve is 1. This means that the normal curve represents
the probability, or the proportion, or the percentage associated with specific sets of
measurement values.
To visualize and understand the properties/characteristics of a normal curve, please
refer to figure as shown below.

Graph of a NORMAL CURVE

A normally distributed random variable with a mean µ = 0 and standard deviation 𝜎 =


1 is called a standard normal variable. It is presented using standard normal distribution where
the center of the curve is zero and add one unit from the center to the right and subtract one
unit from the center to the left. This is shown below in the figure.

The shape of a normal curve is based on the two given parameters, the mean and the
standard deviation of the distribution. When comparing two distributions each described by
the normal curve, the following are the three situations based on the said parameters

a. When the means are not equal, but the standard deviations are equal. (µ1 ≠ µ2 ; 𝜎1 = 𝜎2 ),
the curves have a similar shape but centered at different points, as shown in the figure.

23
b. When the means are equal, but the standard deviations are not equal. (µ1 = µ2 ; 𝜎1 ≠ 𝜎2 ),
the curves are centered at the same point but they have different height and spreads. as
shown in the figure.

c. When the means are different and the standard deviations are also different (µ1 ≠ µ2 ; 𝜎1 ≠
𝜎2 ), the curves are centered at different points and vary in shapes, as shown in the figure.

Every normal curve (regardless of its mean or standard deviation) conforms to the
following "empirical rule"

The empirical rule is better known as 68% - 95% - 99.70% rule. This rule states that
the data in the distribution lies within one (1), two (2), and three (3) of the standard deviation
from the mean are approximately 68%, 95%, and 99.70%, respectively. Since the area of a
normal curve is equal to 1 or 100% as stated on its characteristics, there are only a few data
which is 0.30% falls outside the 3-standard deviation from the mean.

24
Example 1:

The scores of the Senior High School students in their Statistics and Probability quarterly
examination are normally distributed with a mean of 35 and a standard deviation of 5.

Questions:
a. What percent of the scores are between 30 to 40?
b. What scores fall within 95% of the distribution?

Solution:

Draw a standard normal curve and plot the mean at the center. Then, add the standard
deviation to the mean once and mark it to the right of the mean. Add twice the standard
deviation to the mean and put it to the right of the first sum. Then, add thrice the standard
deviation to the mean and mark it to the right of the second sum. Do the same to the left.
This time, instead of adding, subtract the standard deviation from the mean.

Answer:

a. The scores 30 to 40 falls within the first standard deviation from the mean.
Therefore, the scores that fall between 30 and 40 is approximately 68% of the
distribution.

b. Since 95% of the distribution lies within 2 standard deviations from the mean, then
the scores corresponding to this area of the distribution are scores from 25 up to
45.

Example 2:

The district nurse of “Ospital ning Angeles” needs to measure the BMI (Body Mass
Index) of the Alternative Learning System students. She found out that the heights of male
students are normally distributed with a mean of 160 cm and a standard deviation of 7 cm.
Find the percentage of male students whose height is within 153 cm to 174 cm.

25
The mean of this problem is 160, it is subtracted by 7 to the left (160 − 7 = 153; 153 −
7 = 146; 146 − 7 = 139) and add 7 to the right. (e.g. 160 + 7 = 167; 167 + 7 = 174 + 7 =
181).

153 cm falls at 1 standard deviation from the mean to the left and the height of 174 cm
falls at 2 standard deviations from the mean to the right.

Therefore, it covers the whole 68% and 13.5%. of the distribution and the sum of it is 81.5%

ANSWER: 81.5% of the male students have a height between 153 cm to 174 cm.

Learning Competencies with code


➢ Interpret the mean and the variance of a discrete random variable. (M11/12SP-IIIb-3)
➢ Solve problems involving mean and variance of probability distributions. (M11/12SP-IIIb-4)
➢ Illustrate a normal random variable and its characteristics (M11/12SP-IIIc-1)

GENERAL INSTRUCTION: Write your answer on a separate paper


Activity 1: Solve the following problem

The probabilities that a market inspector will discover violations of the public health code in a
public market are given in the following table.

Number of Violations X Probability P(X)


0 0.41
1 0.22
2 0.17
3 0.13
4 0.05
5 0.02

a. What is the probability that a market inspector will discover, at least, two violations
of the public health code?
b. What is the probability that a market inspector will discover, at most, three
violations of the public health code?

26
c. What is the probability that a market inspector will discover at least two, but fewer
than four violations of the public health code?
d. What is the mean of the probability distribution?
e. What are the variance and the standard deviation of the probability distribution?

Activity 2: Solve the following problem.

The following are the number of cars sold per day at ABC Motor Corporation during
the past 500 working days.

Number of Cars Sold Per Day (X) Frequency of Occurrence


0 40
1 100
2 142
3 66
4 36
5 30
6 26
7 20
8 16
9 14
10 8
11 2
Total 500

a. Construct the probability distribution of the random variable X.


b. What is the probability that exactly four cars will be sold at a given day?
c. What is the probability that, at most, four cars will be sold at a given day?
d. What is the probability that at least six, but at most ten cars will be sold at a given
day?
e. What is the probability that fewer than eight cars, but greater than three cars will be
sold at a given day?
f. What is the expected number of cars that will be sold at a given day? Explain the
meaning of this value.
g. What is the standard deviation of the number of cars that will be sold at a given day?

Activity 3: True or False. Write the word TRUE if the statement is correct and FALSE if the
statement is incorrect

_______1. The total area under the normal curve is 1.


_______2. Normal probability distribution is used to determine the probabilities of a discrete
random variable.
_______3. The mean, median and mode in a normal distribution have the same value.

27
_______4. The distance of the number in the horizontal line or the x-axis should always
proportional.
_______5 The normal probability distribution has a mean µ = 1 and standard deviation ơ = 0.

Activity 4

A. Use Empirical rule to complete the following table.


B. Write on the respective column the range or interval of the scores based on the given
parameters.
C. Illustrate the distribution through a diagram.
Standard
Mean 68% 95% 99.7%
Deviation
Ex 50 3 47 to 53 44 to 56 41 to 59
1 135 28
2 87 5.5
3 213 15
4 567 20
5 785 29

Activity 5: Read and analyze the following problem to answer the question.

The government launched Enhanced Community Quarantine on March 15, 2020, as a


preventive measure to fight the spread of COVID-19 in the country. Many of us experienced
difficulties in getting our daily needs like food or getting money to pay for our bills and rentals.

Our government gave financial support and food, but it is not enough for the people.
Fortunately, we are blessed because the spirit of love and generosity is still present in the
heart of our fellow countrymen. They donated money, food, time, and other essential things.

Suppose you and your friends conducted a donation-drive in your barangay. Each of
the donors gave a certain amount with an average of Php 320.00 and a standard deviation of
Php 60.00. Assuming, it is normally distributed,

1. What percentage of the distribution that the amount falls between Php. 260.00 and
Php 380.00?
2. How much of the donations fall in 95% of the distributions?
3. How much of the donations fall in 99.70% of the distributions?

References
Department of Education. Statistics and Probability: Teacher’s Guide. pp. 117-181.

Department of Education. Statistics and Probability: Learner’s Material. pp. 119-131.


Ocampo, Jose M., and Wilmer G. Marquez. 2016. Conceptual Math and Beyond: Statistics
and Probability. Quezon City: Brilliant Creations Publishing Inc.

Belecina, Rene R., Elisa S. Baccay, and Efren B. Mateo. 2016. Statistics and Probability, 1st
ed. Manila: Rex Bookstore Inc.

28
Answer Key
Activity 1 Activity 2 Activity 3

a. 𝑃(𝑋 ≥ 2) = 0.37 a. Tabulate 1. True

b. 𝑃(𝑋 ≤ 3) = 0.93 b. 𝑃(𝑋 = 4) = 0.072 2. False

c. 𝑃(2 ≤ 𝑋 ≤ 4) = 0.30 c. 𝑃(𝑋 ≤ 4) = 0.768 3. True

d. µ = 1.25 d. 𝑃(6 ≤ 𝑋 ≤ 10) = 0.168 4. True

e. 2 = 2.12 e. 𝑃(3 < 𝑋 < 8) = 0.224 5. False

 = 1.456 f. µ = 3.056

g. 2 = 12.352

 = 3.515

Activity 4 Activity 5

1. 68%
2. 200 to 440
3. 140 to 500

Prepared by:

ERVIN A. PEREZ
Malabanias Integrated School
SHS Department

29
WEEK 4: IDENTIFYING REGIONS UNDER NORMAL CURVE CORRESPONDS TO
DIFFERENT STANDARD NORMAL VALUES

Background Information

We have learned that the total area under the curve is equal to 1. Since the standard
normal distribution is defined as normally distributed random variable having a mean of zero
(µ = 0) and standard deviation of one (𝜎 = 1), the fifty (50) percent of the region under the
curve is below the mean and the other 50% is above the mean. Figure 1, as shown below,
illustrates the standard normal curve.

Figure 1: Standard Normal Curve

You can easily identify the area of the regions under the normal curve by using the
Table of Areas under the Normal Curve which is also known as z-table (see attachment at
the last page of this module). This table gives the area of any value of z from -3.99 to 3.99.
The value from this table will describe the area of the specific region of the curve to the left of
the given z-value.

How to use the Table of Areas under the Normal Curve?

Example: Find the area to the left of −1.69

Solution: First, split the given z-value into hundredths. We can find the whole
number and the tenths digit (-1.6) at the left side of the table while the hundredths (0.09)
located at the uppermost of the table. Then the intersection of these numbers will be the
area of the normal curve to the left of the z-value.

Answer: The area to the left of 𝑧 = −1.69 is 0.0455

30
STEPS IN FINDING THE AREA THAT CORRESPONDS TO Z –VALUE

1. Draw/sketch a normal curve and locate the given z-value on the normal curve

2. Shade the region of the curve according to the condition of z-value whether it is below,
above, or between.

3. Use the table of the area under the normal curve to find the corresponding area.

4. Choose the appropriate operation based on step 2 and 3

4.1. When the z-value is to the left or any related terms (e.g. below, less than) just write the
value we obtained in step 3

4.2. When the z-value is to the right or any related terms (e.g. above, greater than), subtract
1 by the obtained value in step 3

4.3. When the shaded region is in between of the two z-value, subtract the biggest by the
smallest value obtained in step 3

5. Label the shaded region and draw a conclusion

Example 1: Find the area that corresponds below 𝑧 = −1.35

Solution:

Step 1. Draw/sketch a normal curve and locate the given z-value.

Step 2: Shade the region of the curve according to the condition of z-value whether it is
below, above, or between.

31
Step 3. Use the table of the area under the normal curve to find the corresponding area

The intersection between -1.3 and 0.05 is 0.0885

Step 4. Choose the appropriate operation based on step 2 and 3

4.1 When the z-value is to the left or any related terms (e.g. below, less than) just write
the value we obtained in step 3 Since the shaded region of the curve is to the left of the
𝑧 = −1.35 and the intersection between -1.3 and 0.05 is 0.0885. Therefore, the area of
the shaded region is 0.0885

Step 5. Label the shaded region and draw a conclusion.

Therefore, the area that corresponds below z = -1.35 is 0.0885

32
0.9115
0.9115

0.9115

33
CONVERTING A NORMAL RANDOM VARIABLE TO A STANDARD NORMAL
VARIABLE AND VICE-VERSA

In solving real-life problems involving the normal curve, it is particularly important to


convert a random variable x to a standard normal variable or z-score. This procedure is
known as “standardizing” or “standardization” of a random variable, where a
standardized value is called a z-score. A z-score is a measure of the number of standard
deviations (𝝈) a particular data value is away from the mean (𝝁).

Suppose your score on a test in Probability and Statistics is 39 and the scores are
normally distributed with a mean of 33 and standard deviation of 3, then your score is exactly
2 standard deviations above the mean. If you scored 30, then it is exactly 1 standard deviation
below the mean. All values that are above the mean have positive z-scores and all values that
are below the mean have negative z-scores. If you obtained a z-score of -3, this means that
your score is 3 standard deviations below the mean, that is, 33 – 3(3) = 33 – 9 = 24.

Given any value x from a normal distribution with mean μ and standard deviation σ, to
convert x to a z-score (standard normal score), you need to;

(a) Subtract the mean μ from x.

(b) Divide this quantity, 𝒙 – 𝝁, by the standard deviation σ.

The formula used in converting a random variable x to a standard normal variable z


is:
𝑥−𝜇
𝑧 =
𝜎
where: 𝑧 – standard normal score or z – score
𝑥 – any data value in a normal distribution
𝜇 - mean
𝜎 - standard deviation

34
Example 1.
Suppose IQ scores are normally distributed with a mean of 100 and standard
deviation of 10. If your IQ is 85, what is your z-score? (Round off your answer to the nearest
hundredths)
Solution:

The z-score can be computed using the formula;


𝑥−𝜇
𝑧 =
𝜎

85 − 100 15
𝑧 = = − = −𝟏. 𝟓𝟎
10 10

Example 2.
On a nationwide placement test that is normally distributed, the mean was 125 and
standard deviation was 15. If you scored 149, what was your z-score? (Round off your
answer to the nearest hundredths)

Solution:

Using the formula:


𝑥−𝜇
𝑧 =
𝜎

149 − 125
𝑧 =
15

24
= = 𝟏. 𝟔𝟎
15

Example 3.
The heights of teachers in Sta. Catalina National High School are normally
distributed with a mean of 150 cm and standard deviation of 15 cm. The height of Sir Victor
has a z-score of 3.25. What is the actual height of Sir Victor? (Round off your answer to the
nearest hundredths)
Solution:
In this example, z-score is given while the normal random variable is unknown. To
solve for the normal random variable x, multiply the z-score (z) by the standard deviation σ,
then add the mean μ. Therefore;
𝑥 = zσ + 𝜇

𝑥 = 3.25(15 cm) + 150𝑐𝑚

𝑥 = 48.75 cm + 150 𝑐𝑚

𝒙 = 𝟏𝟗𝟖. 𝟕𝟓 𝐜𝐦

35
Example 4.
The time it takes for a cell to divide is normally distributed with an average of 60
minutes and standard deviation of 5 minutes. How long will it take for a given cell to divide if
its “mitosis” has a z-score of -1.35?
Solution:
𝑥 = zσ + 𝜇
𝑥 = −1.35(5 min) + 60𝑚𝑖𝑛
𝑥 = −6.75 min + 60 𝑚𝑖𝑛
𝑥 = 𝟓𝟑. 𝟐𝟓 𝐦𝐢𝐧𝐮𝐭𝐞𝐬

Example 5.
Suppose you have the population values 40 and 70 and that their corresponding z –
scores are -1 and 2, respectively.
1. Is it possible to determine the population’s mean and standard deviation?
2. If so, what are these values?

Solution:
To begin, let us first identify the given. Let
𝑥1 = 40 𝑥2 = 70
𝑧1 = −2 𝑧2 = 1
Use these given and substitute to the formula
𝑥1 = 𝑧1 σ + 𝜇 𝑥2 = 𝑧2 σ + 𝜇
40 = −2σ + 𝜇 70 = 1σ + 𝜇

Note: In this example, we do not have 𝜎1 and 𝜎2 , and 𝜇1 and 𝜇2 since we are
describing a single population. Thus, 𝜎1 = 𝜎2 = 𝜎 and 𝜇1 = 𝜇 2 = 𝜇. Furthermore, we need to
review our algebra specifically on systems of linear equation. So,

40 = −2σ + 𝜇 equation 1
70 = 1σ + 𝜇 equation 2
Subtract equation 1 from equation 2, we have

40 = −2σ + 𝜇 equation 1
70 = 1σ + 𝜇 equation 2
30 = 1σ − (−2 σ)
30 =3 σ the difference
30 3𝜎
=
3 3

10= σ
The standard deviation of the population is 10. To obtain the mean, substitute it to either
equation 1 or 2.
40 = −2σ + 𝜇 ⟹ 40 = −2(10) + 𝜇 ⟹ 60 = 𝜇
70 = 1σ + 𝜇 ⟹ 70 = 1(10) + 𝜇 ⟹ 60 = 𝜇

Both agrees that the mean of the population is 60.


Therefore,
1. It is possible to determine the population’s mean and standard deviation given only
the raw score (x) and its corresponding standard score (z).
2. The mean and standard deviation of the population is 60 and 10,
respectively.

36
COMPUTING PROBABILITIES AND PERCENTILES USING THE STANDARD NORMAL
TABLE

For any set of measurements (arranged in ascending or descending order), a


percentile is a measure of relative standing. It is a descriptive measure of the relationship of
a measurement to the rest of the data.
For example, in a test in Algebra, you got a score of 82 and you want to know how
fared in comparison with your classmates. If your teacher tells you that you scored at 90 th
percentile, it means that 90% of the grades were lower than yours and 10% were higher. If
the relative frequency histogram were used to show the scores, the 90 th percentile would be
located at a point such that 90% of the total area under the relative frequency histogram lies
below the 90th percentile and 10% lies above.
In a previous lesson, we learned how to compute the z-scores based on means and
standard deviations for both sample and population. The resulting z – scores allowed us to
read the corresponding areas (or probabilities) using the standard normal table.

Suppose that we know the probability this time. What is the corresponding z-score?
For example, where is 95% under the normal curve?

There are three important things to remember when we are given probabilities and
we want to know their corresponding z-scores.

• First, a probability value corresponds to an area under the normal curve.


• Second, in the table of Areas Under the Normal Curve, the numbers in the
extreme left across the top are z-scores, which are the distances along the
horizontal scale. The number in the body of the table are areas or
probabilities.
• Third, the z-scores to the left of the mean are negative values.

Thus, when we are given the area and we wish to find the corresponding z-value, we locate
the given area at the body of the table. If the exact area is not available, we take the nearest
area. Then, we look up the corresponding z-value in the Table of Areas Under the Normal
Curve or z- Table.

A normal distribution curve can be used as a probability distribution curve for normally
distributed variables. The area under the standard normal distribution curve can also be
thought of as a probability. That is, if it's possible to select any 𝑧 value at random, the
probability of choosing one, say, below 1.45 would be the same as the area under the curve
at the left of 1.45. In this case, the area is 0.9265. Therefore, the probability of randomly
selecting a 𝑧 value below of 1.45 is 0.9265 or 92.65%. The problems involving probabilities
and percentiles are solved in the same manner as finding the areas under a normal curve.

In finding probabilities, the following notations will be used:

(a) 𝑃(𝑍 < 𝑧) – probability at the left of 𝑧


(b) 𝑃(𝑍 > 𝑧) = 1 − 𝑃(𝑍 < 𝑧) – probability at the right of 𝑧
(c) 𝑃(𝑎 < 𝑧 < 𝑏) - the probability of 𝑧 that is in between two other 𝑧 values, say 𝑎 and 𝑏
(d) 𝑃(𝑧 < 𝑎) ∪ 𝑃(𝑧 > 𝑏) – the probability of 𝑧 is in the opposite direction of two values,
say 𝑎 and 𝑏
(e) 𝑃(𝑋 < 𝑥) – probability at the left of a normal random variable 𝑥

37
(f) 𝑃(𝑋 > 𝑥) – probability at the right of a normal random variable 𝑥
(g) 𝑃(𝑎 < 𝑋 < 𝑏) – the probability of a normal random variable 𝑋 that is in between two
other normal random variables, say 𝑎 and 𝑏.
(h) 𝑃(𝑋 < 𝑎) ∪ 𝑃(𝑋 > 𝑏 ) – the probability of 𝑋 is in the opposite direction of two values,
say 𝑎 and 𝑏

Example 1.
Find the probabilities for each of the following.
(a) 𝑃(𝑍 < 1.32)
(b) 𝑃(𝑍 < −1.05)
(c) 𝑃(−0.75 < 𝑍 < 1.56)
(d) 𝑃(𝑍 > −0.88)

Solution:
(a) The probability 𝑃(𝑍 < 1.32) means to find
the area under the normal curve at the left of
𝑧 = 1.32. From the table, find the intersection on
row 1.3 and column 0.02. Thus, the probability
is 𝟎. 𝟗𝟎𝟔𝟔 or 𝟗𝟎. 𝟔𝟔%.

(b) The probability 𝑃(𝑍 < −1.05) is the same as


the area under the normal curve at the left of 𝑧 =
−1.05. From the table, the area is 0.1469. Thus,
the probability is 𝟎. 𝟏𝟒𝟔𝟗 𝒐𝒓 𝟏𝟒. 𝟔𝟗%.

(c) The probability 𝑃(−0.75 < 𝑍 < 1.56) is the


same as the area under the normal curve between
𝑧 values -0.75 and 1.56. To get the area, subtract
the area of 𝒛 = −𝟎. 𝟕𝟓 from the area of 𝒛 = 𝟏. 𝟓𝟔.
Thus,𝑃(−0.75 < 𝑍 < 1.56) = 𝑃(𝑍 < 1.56) −
𝑃(𝑍 < −0.75) = 0.9406 − 0.2266 =
𝟎. 𝟕𝟏𝟒𝟎 𝒐𝒓 𝟕𝟏. 𝟒𝟎%.

(d) The probability 𝑃(𝑍 > −0.88) is the same as


the area under the normal curve at the right of
𝑧 = −0.88. To get the area, subtract the area
of 𝑧 = −0.88 from 1. Thus
, 𝑃(𝑍 > −0.88) = 1 − 𝑃(𝑍 < −0.88) =
1 – 0.1894 = 𝟎. 𝟖𝟏𝟎𝟔 𝑜𝑟 𝟖𝟏. 𝟎𝟔%.

Example 2.
Let 𝑋 be a normal random variable with mean 𝜇 = 15 and standard deviation 𝜎 = 3. Find the
probabilities of the following:
(a) (𝑋 < 19)
(b) (10 < 𝑋 < 19)
Solution:

38
(a) To find the probability, convert the normal random variable 19 to a z-score.
𝑥−𝜇 19−15
Thus, (𝑋 < 19) = 𝑃 (𝑍 < ) = 𝑃 (𝑍 < )
𝜎 3
4
= (𝑍 < ) = 𝑃(𝑍 < 1.33).
3

From the table, the probability at the left of 𝒛 = 𝟏. 𝟑𝟑 is equal to 𝟎. 𝟗𝟎𝟖𝟐 𝒐𝒓 𝟗𝟎. 𝟖𝟐%.
(b) To find the probability, convert the normal random variables 10 and 19 to z values.
10−15 19−15
Thus, 𝑃(10 < 𝑋 < 19) = 𝑃 ( <𝑍< )
3 3

= (−1.67 < 𝑍 < 1.33)

= (𝑍 < 1.33) − (𝑍 < −1.67).

From the table, we have 0.9082 – 0.0475 = 𝟎. 𝟖𝟔𝟎𝟕 𝒐𝒓 𝟖𝟔. 𝟎𝟕%. These probabilities can also
be applied to real-life problems.

Learning Competencies with code


➢ Identify regions under the normal curve corresponding to different standard normal
values. (M11/12SP-IIIc-3)
➢ Convert a normal random variable to a standard normal variable and vice versa
(M11/12SP-IIIc-4)
➢ Compute probabilities and percentiles using the standard normal table. (M11/12SP-IIIc-d-1)

Activity 1.
Identify the area under the normal curve of the following standard normal values
using the z table (refer to the last page of this module).
1. 𝑧= −3.25 _____________ 6. 𝑧 = −1.33 _________
2. 𝑧= 1.25 _____________ 7. 𝑧 = −2.5 _________
3. 𝑧= 0.56 _____________ 8. 𝑧 = 0.29 _________
4. 𝑧= 3 _____________ 9. 𝑧 = −0.13 _________
5. 𝑧= 1.96 _____________ 10. 𝑧 = −1.96 _________

Activity 2.
Find the area or region that corresponds to each of the following z values. Sketch
and label each curve.
1. to the right of 𝑧 = 2.18
2. below 𝑧 = −1.40

3. above 𝑧 = 0.57

4. between 𝑧 = −2.34 and 𝑧 = 1.57


5. between 𝑧 = 0.32 and 𝑧 = 2.42

39
Activity 3.
Supply the needed information by solving the missing value.
1. 𝑥 = 25, 𝜇 = 40, σ = 6, 𝐳 = ?
2. 𝑧 = −2.25, 𝜇 = 218, σ = 36, 𝐱 = ?
3. 𝑥 = 39, 𝜇 = 30, σ = 3, 𝐳 = ?
4. 𝑥 = 125, 𝜇 = 105, σ = 8, 𝐳 = ?
5. 𝑧 = 1.11, 𝜇 = 145, σ = 20, 𝐱 = ?

Activity 4.
Solve the following problems.
1. The mean number of hours a Filipino worker spends on the computer is 3.1 hours per
workday. Assume the standard deviation is 0.5 hour and is normally distributed, how long
does a worker spend on the computer if his z-score is 1.2?
2. Each month, a Filipino household generates an average of 28 pounds of newspaper for
garbage or recycling. Assume the standard deviation is 2 pounds. Determine the z-score of
a household that generates 22 pounds of newspaper.
Activity 5.
A. Use the Table of Standard Normal distribution to determine corresponding
probabilities of the following:
1. 𝑃(𝑍 < −1.45)
2. 𝑃(0.5 < 𝑍 < 2.33)
3. 𝑃(𝑍 > 1.78)

B. 𝑋 is a normally distributed random variable with a mean of 60 and a standard


deviation of 8. Find the probabilities indicated by using the table.
4. 𝑃(𝑋 < 52)
5. 𝑃(48 < 𝑋 < 64)
6. 𝑃(𝑋 > 57)

References

Department of Education. Statistics and Probability-PIVOT: Learner’s Material. pp. 132-


168.
Belecina, Rene R., Elisa S. Baccay, and Efren B. Mateo. 2016. Statistics and Probability, 1st
ed. Manila: Rex Bookstore Inc.

40
Answer Key
Activity 1
1. 0.0006 6. 0.0918
2. 0.8944 7. 0.0062
3. 0.7123 8. 0.6141
4. 0.9987 9. 0.4483
5. 0.9750 10. 0.0250

Activity 2

1. to the right of z = 2.18 4. between z = -2.34 and z = 1.57

0.0146 0.9322

2. below z= -1.40

5. between z = 0.32 and z = 2.42


0.0808

0.3667

3. above z= 0.57

0.2843

`Activity 3 Activity 4 Activity 5


1. -2.5 1. 3.7 hrs A.
2. 137 2. -3
1. 0.0735
3. 3
2. 0.2986
4. 2.5
3. 0.0375
5. 167.2
B.
4. 0.1587
5. 0.6247
6. 0.6480

41
STANDARD NORMAL TABLE: Table Values Represent AREA to the LEFT of Z-score

42
Prepared by:

JELENE V. BASILIO
Sto. Domingo Integrated School
SHS Department

43
WEEK 5: RANDOM SAMPLING

Background Information

Random Sampling is a sampling method of choosing representatives from the


population wherein every sample has an equal chance of being selected. Accurate data can
be collected using random sampling techniques.

A population includes all of its elements from a set of data. The size of the
population is the number of observations in the population. For example, if ABS-CBN
network has 11,000 employees having the required blood type in a certain study, then we
have a population size of 11,000.

Sample consists of one or more data drawn from the population. It is a subset, or an
incomplete set taken from a population of objects or observations. Taking samples instead of
the population is less time-consuming and cost-effective. Although sampling has
advantages, it can also be a source of bias and inaccuracy

Four Types of Random Sampling


1. Simple Random Sampling is the basic
sampling technique where we select a
group of subjects (a sample) for study
from a larger group (a population). Each
individual is chosen entirely by chance
and each member of the population has
an equal chance of being included in the
sample. Every possible sample of a given
size has the same chance of selection.

2. Systematic Random Sampling involves


choosing your sample based on a regular
interval, rather than a fully random selection.
It can also be used when you don’t have a
complete list of the population.

44
3. Stratified Random Sampling is used when
you want to ensure that specific charac-
teristics are proportionally represented in the
sample. You split your population into strata
(for example, divided by gender or race), and
then randomly select from each of these
subgroups.
4. Cluster Random Sampling is used when
you are unable to sample from the entire
population. You divide the sample into clus-
ters that approximately reflect the whole
population, and then choose your sample
from a random selection of these clusters.

PARAMETER AND STATISTICS

Parameter
A parameter is a measure that is used to describe the population while statistic is a
measure that is used to describe the sample. To understand more, let us discuss the two
measures. Population mean (µ), population variance (𝜎2) and population standard deviation
(𝜎) are what we called parameters.

Population Mean
The mean is the sum of the data divided by the number of data. The mean is used to
describe where the set of data tends to concentrate at a certain point.
Below are grades in Statistics of Grade 11 students during the third quarter.

94 85 88 79 78

75 89 91 84 77

Let us compute the population mean, population variance, and population standard
deviation.

Grades in Statistics of Grade 11 Students during the Third Quarter


Student’s number Grade (X)
1 94
2 85
3 88
4 79
5 78
6 75
7 89
8 91
9 84
10 77
N = 10 ∑X = 840

45
ΣX
To compute the population mean (𝜇), we can use this equation 𝜇 = whereas ∑𝑋 is
𝑁
the summation of all values of grades (x) and N is the total number of population therefore,
840
𝜇= = 84 the computer population mean (𝜇) is 84.
10

POPULATION VARIANCE AND POPULATION STANDARD DEVIATION

Variance and standard deviation determine how to spread or to scatter each data on
the set from the mean.
To compute for the population variance, we apply the formula:
∑(𝑋 − 𝜇)2
𝜎2 =
𝑁
To compute for the population standard deviation, we use the formula:

∑(𝑋 − 𝜇)2
𝜎= √
𝑁

whereas:
𝑿 is the given data,
µ is the population mean
𝑵 is the population size.

Consider the data given above, to solve for the population variance and population
standard deviation, we have this table:

Student number Grade (X) 𝑿−𝝁 (𝑿 − 𝝁)𝟐


1 94 94 − 84 = 10 102 = 100
2 85 85 − 84 = 1 12 = 1
3 88 88 − 84 = 4 42 = 16
4 79 79 − 84 = – 5 (– 5)2 = 25
5 78 78 − 84 = – 6 (– 6)2 = 36
6 75 75 − 84 = – 9 (– 9)2 = 81
7 89 89 − 84 = 5 52 = 25
8 91 91 − 84 = 7 72 = 49
9 84 84 − 84 = 0 02 = 0
10 77 77 − 84 = 7 72 = 49
𝑁 = 10 ∑𝑋 = 840 ∑(𝑋 − µ)2 = 382

46
To solve for the population variance and population standard deviation we have to
use these equations:

Population Mean Population Variance Population Standard Deviation

ΣX ∑(𝑋 − 𝜇)2 ∑(𝑋 − 𝜇)2


𝜇= 𝜎2 = 𝜎= √
𝑁 𝑁 𝑁

Solution:
ΣX 840
Population mean: 𝜇 = = = 84
𝑁 10
∑(𝑋− 𝜇)2 382
Population variance: 𝜎2 = = = 38.2
𝑁 10
∑(𝑋− 𝜇)2 382
Population Standard Deviation: 𝜎 = √ = √ = √38.2 ≈ 6.18
𝑁 10

STATISTIC

Statistic or sample statistic is any quantity computed from values in a sample which
is considered for a statistical purpose. Sample mean (𝑥̅ ), sample variance (s2) and sample
standard deviation (s) are what we call statistic.

From the previous data of the population, suppose that we randomly select only 7 data
(numbers with check) out of the total 10 data in the population.

✓ 94 85 ✓ 88 ✓ 79 78
75 ✓ 89 ✓ 91 ✓ 84 ✓ 77

Compute the sample mean, sample variance, and sample standard deviation. Here is the
result:

SAMPLE MEAN

The sample mean is the average of all the data of the samples. The symbol 𝑥̅ , (read
as “x bar”) is used to represent the sample mean. To compute for the sample mean 𝑥̅ we can
ΣX
use this equation: 𝑥̅ =
𝑛
Where: 𝑥̅ is the sample mean
∑𝑥 is the summation of x (sum of the measures)

𝑛 is number of elements in the sample

47
Grades in Statistics of Grade 11 Students during the Third Quarter
Population Sample
Student’s number
(Student’s grade) (Student’s grade)
1 94 94
2 85
3 88 88
4 79 79
5 78
6 75
7 89 89
8 91 91
9 84 84
10 77 77
N = 10; n = 7 ∑X = 840 ∑X = 602

To get the sample mean:


ΣX 602
We substitute to the formula 𝑥̅ = = = 86
𝑛 7
Therefore, the computed sample mean 𝑥̅ is 86.

SAMPLE VARIANCE AND SAMPLE STANDARD DEVIATION

Sample variance is the computed variance of the elements of the sample. s2 is used
to represent sample variance.

Sample standard deviation is the computed standard deviation of the elements of


the sample. 𝒔 is used to represent sample standard deviation.

To compute for the sample variance, we apply the formula:


2
∑(𝑥 − 𝑥̅ )2
𝑠 =
𝑛−1
To compute for the sample standard deviation, we use the formula:
∑(𝑥 − 𝑥̅ )2
𝑠= √
𝑛−1
Where: x is the given data
𝑥̅ is the sample mean
𝑛 is number of elements in the sample

48
To apply this here’s another example using the previous example.

Student’s Student’s grade


𝑥 − 𝑥̅ (𝑥 − 𝑥̅ )2
number Population Sample
1 94 94 94 − 86 = 8 82 = 64
2 85
3 88 88 88 − 86 = 2 22 = 4
4 79 79 79 − 86 = – 7 (– 7)2 = 49
5 78
6 75
7 89 89 89 − 86 = 3 32 = 9
8 91 91 91 − 86 = 5 52 = 25
9 84 84 84 − 86 = – 2 (– 2)2 = 4
10 77 77 77 − 86 = – 9 (– 9)2 = 81
∑𝑋 = 840 ∑𝑋 = 602 ∑(𝑥 − 𝑥̅ )2 = 236

To solve for the sample variance and sample standard deviation we have to use
these equations:
Sample Mean Sample Variance Sample Standard Deviation
ΣX ∑(𝑥 − 𝑥̅ )2
𝑥̅ = 𝑠2 = ∑(𝑥 − 𝑥̅ )2
𝑛 𝑛−1 𝑠= √
𝑛−1

Solution:
ΣX 602
Sample mean: 𝑥̅ = = = 86
𝑛 7
2 ∑(𝑥− 𝑥̅ )2 236 236
Sample Variance: 𝑠 = = = ≈ 39.33
𝑛−1 7−1 6
∑(𝑥− 𝑥̅ )2 236
Sample Standard Deviation: 𝑠 = √ =√ = √39.33 = 6.27
𝑛−1 7−1

Remember that parameters are for population while statistics are for sample.

IDENTIFYING SAMPLING DISTRIBUTION OF STATISTICS

Sampling distribution of the sample means is a frequency distribution using the


computed sample mean from all the possible random samples of a particular sample size
taken from the given population.

Steps to follow in making a sampling distribution of the sample mean:


1. Determine the number of sets of all possible random samples that can be drawn from
the given population by using the formula, .𝑁 𝐶𝑛 , where N is the population size and n
is the sample size.

49
Example:
The given population is 1, 2, 3, 4, 5 and sample size of 3, using the formula .𝑁 𝐶𝑛
𝑁! 5! 5! 5∙4∙3∙2∙1 20
.𝑁 𝐶𝑛 = .5 𝐶3 = [𝑛!(𝑁−𝑛)!]
= = = (3∙2∙1)(2∙1∙) = = 10
3!(5−3)! 3!2! 2
2. List all the possible random samples and solve for the sample mean of each set of
samples.
Sample Mean
1,2,3 2
1,2,4 2.33
1,2,5 2.67
1,3,4 2.67
1,3,5 3
1,4,5 3.33
2,3,4 3
2,3,5 3.33
2,4,5 3.67
3,4,5 4

3. Construct a frequency and probability distribution table of the sample means indicating
its number of occurrence or the frequency and probability.
Probability P(x)
Sample Means Frequency 𝑭𝒓𝒆𝒒𝒖𝒆𝒏𝒄𝒚
𝑵
1
2 1 = 0.10
10
1
2.33 1 = 0.10
10
2
2.67 2 = 0.20
10
2
3 2 = 0.20
10
2
3.33 2 = 0.20
10
1
3.67 1 = 0.10
10
1
4 1 = 0.10
10

Learning Competencies with code:


• Illustrate random sampling. (M11/12SP-IIId-2)
• Distinguish between parameter and statistic.(M11/12SP-IIId-3)
• Identify sampling distributions of statistics (sample mean). (M11/12SP-IIId-4)

GENERAL INSTRUCTION: Write your answer on a separate paper


Practice A
Choose the letter of the best answer. Encircle the letter of the correct answer.

50
1. What do you call the set of all data that researchers want to investigate?
a. data b. population
c. random d. sample
2. Which of the following can be considered as the best sample?
a. conveniently sample b. selected at random
c. purposively selected d. self-selected
3. From the given populations, which of the following samples is most likely to be
representative of the population from which it is drawn?
a. population: SAP beneficiaries b. population: lawmakers
sample: car owners sample: students
c. population: online shoppers d. population: graduating students
sample: cellphone users sample: grade 8 students
4. In random sampling, the sample should be_______ the population
a. as large as b. different from
c. not taken from d. representative of
5. Which of the following is NOT a type of random sampling?
a. cluster b. purposive
c. simple d. systematic
6. Which of the following is random sampling?
a. convenience b. non-probability
c. purposive d. stratified
7. As a researcher, you ask the people of your barangay on how they spent their money
during the community quarantine. You decided to group them according to their
socio-economic status and then proportionately and randomly selecting samples
from these groups. Which type of sampling are you using?
a. convenience b. purposive
c. stratified d. systematic
8. As a researcher, you want to know the plans of graduating senior high school
students. You ask all by-passers to participate in your research. Which type of
sampling are you using?
a. convenience b. purposive
c. stratified d. systematic
9. You wish to examine the effects of community quarantine on your fellow students.
You decided to borrow a list from the guidance office and select every 10th student
on it. What sampling technique are you using?
a. convenience b. purposive
c. stratified d. systematic
10. You want to know the effects of the COVID-19 to the Filipino people and you will ask
the medical frontliners. Which type of sampling will you use?
a. convenience b. purposive
c. stratified d. systematic

Practice B
Choose the letter of the best answer. Encircle the letter of the correct answer.

1. What do you call a number that describes a sample characteristic?


a. data b. sample
c. parameter d. statistic

51
2. What do you call a number that describes a population characteristic?
a. data b. sample
c. parameter d. statistic
3. Which of the following is a statistic?
a. 𝜎2 b. 𝜎
c. µ d. s

For numbers 4-9, the following are scores of students in their quiz:
10, 8, 7, 20, 11, 8, 15, 10, 6, 14, 16, 20
4 – 5. What is the value of µ? (2 points)
a. 10.4 b. 11
c. 12.08 d. 14.4
6 – 7. What is the value of 𝜎? (2 points)
a. 4.6 b. 5
c. 6 d. 5.6
8– 9. What is the value of the population variance? (2 points)
a. 18.7 b. 25
c. 20 d. 21.5
For numbers 10-15, the following are the seven scores of the students above that are
randomly selected: 8, 7, 20, 15, 10, 14, 16

10 – 11. What is the value of the sample mean? (2 points)


a. 15.2 b. 12.9
c. 13.5 d. 14.7
12 – 13. What is the value of s2? (2 points)
a. 20.8 b. 19.7
c. 22.1 d. 25.4
14 – 15. What is the sample standard deviation? (2 points)
a. 6.2 b. 4.7
c. 8.3 d. 5.6

Practice C
Choose the letter of the best answer. Encircle the letter of the correct answer.

1. What distribution pertains to the frequency distribution of the sample mean from all
the possible random samples of a particular sample size n taken from the given
population?
a. frequency b. normal
c. population d. sampling
2. Which of the following is NOT a step-in creating sampling distribution of the sample
mean?
a. Determine the number of sets of all b. Compute for the standard
possible random samples. deviation and variance of the
samples.
c. Construct a frequency distribution d. List all the possible random
table of the sample means and samples and solve for the sample
probability. mean of each set of samples

52
3. Which of the following is the mean of sample 6, 10, 21, 25, and 28?
a. 15.17 b. 18
c. 21.2 d. 22
For numbers 4-9, refer to the following set of data of a population {5, 6, 7, 8}
4. How many different samples of size n = 2 can be drawn from the population?
a. 6 b. 5
c. 4 d. 3
5. Which of the following sample mean appears most frequent in the sampling
distribution?
a. 5 b. 5.5
c. 6 d. 6.5
6. What is the lowest value of the sample mean in this sampling distribution?
a. 5.5 b. 6.5
c. 7 d. 8
7. What is the frequency of the sample mean 7?
a. 0 b. 1
c. 2 d. 3
8. What is the probability of the sample mean 6 in the sampling distribution?
a. 1/6 b. 2/3
c. 1/2 d. 2/5
For numbers 9-10, refer to the following population consists of the data {3,6,9,12,15}.
9. How many different samples of size n = 3 can be drawn from the population?
a. 6 b. 12
c. 8 d. 10
10. Which of the following sample mean appears most frequent in the sampling
distribution?
a. 6 b. 7
c. 8 d. 12

References:
• Statistics and Probability 11, Pivot 4A Learners Material (N/A) Pages 172–206
• https://www.scribbr.com/methodology/simple-random-sampling/
• http://www.stat.yale.edu/Courses/199798/101/sample.htm#:~:text=Simple%20rando
m%20sampling%20is%20the,being%20included%20in%20the%20sample.

53
Answer Key:

Practice A: Practice B: Practice C:


1. b 1. d 1. d
2. b 2. c 2. b
3. c 3. d 3. b
4. d 4– 5. c 4. a
5. b 6-7. a 5. d
6. d 8-9. d 6. a
7. c 10-11. b 7. c
8. a 12-13. c 8. a
9. d 14-15. b 9. d
10. b 10. c

Prepared by:

Jeral S. Reyes

Teacher – II

54
WEEK 6: MEAN AND VARIANCE OF SAMPLING DISTRIBUTION OF THE SAMPLE MEAN

Background Information
The sampling distribution of the sample mean is a frequency distribution using the
means computed from all possible random samples of a specific size taken from a population.

MEAN
The mean of the sampling distribution of the mean is the mean of the population from
which the scores were sampled. Therefore, if a population has a mean μ, then the mean of
the sampling distribution of the mean is also 𝜇. The symbol 𝜇𝑥̅ is used to refer to the mean of
the sampling distribution of the mean. Therefore, the formula for the mean of the sampling
distribution of the mean can be written as:

𝜇𝑥̅ = 𝜇
VARIANCE
The variance of the sampling distribution of the mean is computed as follows:

2
𝜎2
𝜎 𝑥̅ =
𝑁
That is, the variance of the sampling distribution of the mean is the population
variance divided by N, the sample size (the number of scores used to compute a mean).
Thus, the larger the sample size, the smaller the variance of the sampling distribution of the
mean.

The standard error of the mean is the standard deviation of the sampling
distribution of the mean. It is therefore the square root of the variance of the sampling
distribution of the mean and can be written as:

𝜎
𝜎𝑥̅ =
√𝑁

Example 1 : Mean and Variance of Sampling Distribution of Sample Means


Consider a population consisting of 1,2,3,4, and 5. Suppose samples of size 2 are
drawn from this population. Describe the sampling distribution of the sample means. What is
the mean and variance of the sampling distribution of the sample mean?

Steps Solution
1. Compare the mean of the ∑𝑥
population (𝜇). 𝜇=
𝑁
1+2+3+4+5
=
5
= 3.00

55
2. Compute the variance of the 𝑥 𝑥−𝜇 (𝑥 − 𝜇)2
population (𝜎 2 )
1 -2 4
2 -1 1
3 0 0
4 1 1
5 2 4
∑(𝑥 − 𝜇)2 = 10

∑(𝑥 − 𝜇)2
𝜎2 =
𝑁
10
=
5
= 2

So, the variance of the population is 2.

3. Determine the number of Use the formula NCn. Here n =2


possible outcomes 5C2 = 10
So, there are 10 possible samples of size 2 that can
be drawn.

4. List all possible samples


Samples Mean

1,2 1.50

1,3 2.00

1,4 2.50

1,5 3.00

2.3 2.50

2,4 3.00

2,5 3.50

3,4 3.50

3,5 4.00

4,5 4.5

56
5. Construct the sampling
Distribution of the sample Sampling Distribution of Sample Mean
mean Sample Frequency Probability
Mean ( ) P( )

1.50 1 1
10
2.00 1 1
10
2.50 2 1
5
3.00 2 1
5
3.50 2 1
5
4.00 1 1
10
4.50 1 1
10
Total 10 1.00

6. Compute the mean of the


sampling distribution of the
sample means 𝜇𝑥̅ Sample Probability
𝑋̅ ∙ 𝑃(𝑋̅)
Mean (𝑋̅) 𝑃(𝑋̅)
Follow the steps: 1
1.50 0.15
10
1
a. Multiply the sample mean 2.00 0.20
10
by the corresponding 1
2.50 0.50
probability. 5
1
3.00 0.60
5
b. Add the results. 1
3.50 0.70
5
1
4.00 0.40
10
1
4.50 0.45
10
Total 1.00 3.00

∑ 𝑋̅ ∙ 𝑃(𝑋̅) = 3.00

So, the mean of the sampling distribution 𝜇𝑥̅ of the


sample mean is 3.00.

57
7. Compute the variance of the ̅
𝒙 ̅)
𝑷(𝒙 ̅−𝝁
𝒙 ̅ − 𝝁)𝟐
(𝒙 ̅ − 𝝁)𝟐
̅) ∙ (𝒙
𝑷(𝒙
sampling distribution of the
1
sample means 𝜎 2 𝑥̅ . 1.50 -1.50 2.25 0.225
10
1
Follow these steps: 2.00 -1.00 1.00 0.100
10
1
a. Subtract the population 2.50 -0.50 0.25 0.050
5
mean (𝜇). Label it as 𝑥̅ − 𝜇. 1
3.00 0.00 0.00 0.000
5
b. Square the differences. 1
3.50 0.50 0.25 0.050
Label this column as 5
(𝑥̅ − 𝜇)2 1
4.00 1.00 1.00 0.100
10
1
c. Multiply the results in (b) by 4.50 1.50 2.25 0.225
10
each corresponding
probability. 𝑃(𝑥̅ ) ∙ (𝑥̅ − 𝜇)2 Total 1.00 0.750

d. Add the results. 𝜎 2 𝑥̅ = 𝑃(𝑥̅ ) ∙ (𝑥̅ − 𝜇)2

𝜎 2 𝑥̅ = 0.75

𝜎𝑥̅ = 0.85

So, the variance of the sampling distribution is 0.75


and the standard deviation is 0.87

Example 2: Describing the sampling Distribution of the sample means from an Infinite
Population

A population has a mean of 60 and a standard deviation of 5. A random sample of 16


measurements is drawn from this population. Describe the sampling distribution of the sample
means by computing its mean and standard deviation.

Steps Solution
1. Identify the given information Here 𝜇 = 60, 𝜎 = 5 and 𝑛 = 16.
2. Find the mean of the sampling 𝜇𝑥̅ = 𝜇
distribution. Use the property that
𝜇𝑥̅ = 60
𝜇𝑥̅ = 𝜇
3. Find the standard deviation of the 𝜎
𝜎𝑥̅ =
sampling distribution. Use the √𝑁
𝜎
property that 𝜎𝑥̅ = 5
√𝑁 =
√16
5
=
4
𝜎𝑥̅ = 1.25

58
Observe that the mean of the sampling distribution of the sample mean is always equal
to the mean of the population. The variance of the sampling distribution is obtained by using
𝜎 2 𝑁−𝑛
the formula, 𝜎 2 𝑥̅ = ( ). This formula holds when the population is finite.
𝑛 𝑁−1
A finite population is one that consists of a finite or fixed number of elements,
measurements, observations; while an infinite population contains, hypothetically at least,
infinitely elements.
The standard deviation (𝜎𝑥̅ ) of the sampling distribution of the sample means is also
known as the standard error of the mean. It measures the degree of accuracy of the sample
mean (𝜇𝑥̅ ) as an estimate of the population mean (𝜇).
A good estimate of the mean is obtained if the standard error of the mean is smaller or
close to zero, while a poor estimate, if the standard error of the mean is large.

DEFINING THE SAMPLING DISTRIBUTION OF THE SAMPLE MEAN FOR NORMAL


POPULATION WHEN THE VARIANCE IS: (A) KNOWN; (B) UNKNOWN

The definition of the sampling distribution of the sample mean for the normal population
when the variance is known or unknown is used to determine the probability value of a certain
event for small and large samples. This serves as a tool for statisticians or any interesting
group who wants to test the sample mean using statistical formulas and to make a rightful
decision in the future. If the population standard deviation is known, use the normal distribution
and if the population standard deviation is unknown, use t-distribution.

Example 3: Read and analyze the following situations. Determine whether the following
statements have a known or unknown population variance.
a. Because of Inclusive Education, learners with disabilities are also part of the normal
students. Consider a population of the PWD learners consisting of 1, 2, 3, 4, and 5.
Samples of size 2 are drawn from this population.
b. Given 𝜇 = 15, 𝑠 = 6 𝑎𝑛𝑑 𝑛 = 16.
c. IQ scores are normally distributed with standard deviation 𝜎 = 15 𝑝𝑜𝑖𝑛𝑡𝑠. You wish to
determine the 𝜇 IQ of all students in Matiyaga National High School by taking a random
sample of 16 students.
d. The general weighted average (GWA) of the Grade 7 applicants for admission in
QSHS over the past 7 years has been consistently 𝜇 = 92.78. This year a sample data
of the applicant’s GWA are 98, 89, 92, 95, 96, 90, 91, 92, 97 and 93
e. A finite population composed of six items whose values are 2, 3, 6, 7, 8, and 10.
Samples of 3 items are drawn at random without replacement.

Solution:
a. Known – This is a population data. Although the value of the variance is not given, you can
still determine the population mean and population variance of the data using the formulas 𝜇 =
∑𝑋 ∑(𝑥−𝜇)2
, 𝜎2 = respectively. So, this situation is an example of the sampling distribution when
𝑁 𝑁
𝜎
the variance is known. Since the population variance can be calculated, used 𝜎𝑥̅ = in the
√𝑛
computation of the standard error of the mean.

59
b. Unknown- The population variance is unknown. The given values are the population mean
𝜇 = 15, the sample standard deviation 𝑠 = 6, and the sample size 𝑛 = 16. Use the formula 𝑠𝑥̅ =
𝑠
for the standard error of the mean.
√𝑛

c. Known- the population variance is given with the value of the square of the standard
𝜎
deviation 𝜎 2 = 152 . Apply the formula 𝜎𝑥̅ = in computing for the standard error of the mean.
√𝑛

d. Unknown- The population variance is unknown. The only given values are the population
𝑠
mean 𝜇 = 92.78 and the sample size 𝑛 = 10. Use the formula 𝑠𝑥̅ = for the standard error of
√𝑛
the mean.

e. Known – This is a population data, you can determine the population mean and variance of
∑𝑋 ∑(𝑥−𝜇)2
the data using the formulas 𝜇 = , 𝜎2 = . So, this situation is an example of the
𝑁 𝑁
sampling distribution where the population variance can be computed. Apply the formula 𝜎𝑥̅ =
𝜎
to solve
√𝑛

Learning Competencies with code:


• Find the mean and variance of sampling distribution of the sample mean (M11/12SP-IIId-5)
• Define the sampling distribution of the sample mean for normal population when the
variance is: (a) known; (b) unknown (M11/12SP-IIIe-1)

GENERAL INSTRUCTION: Write your answer on a separate paper


Practice A
Find the finite population factor given the following.
1. N = 10 , n = 2
2. N = 10 , n = 3
3. N = 9 , n = 2
4. N = 8 , n = 3
5. N = 7 , n = 4

Practice B
Given the population 1,4,5,7, 9. Suppose sample size of 3 are drawn from this population.
1. What is the mean (𝜇), variance (𝜎 2 ) and standard deviation (𝜎) of the population?
2. How many different samples of size n = 3 can be drawn from the population? List them
with their corresponding means.
3. Construct the sampling distribution of the sample mean.
4. What is the mean (𝜇𝑥̅ ) of the sampling distribution mean? Compare to the mean of the
population.
5. What is the variance (𝜎 2 𝑥̅ ) and standard (𝜎𝑥̅ ) of the sampling distribution of the sample
means?

60
Practice C
Solve the following problems. Assume that the population is infinite in each case.
1. The scores of individual students on a national test have a normal distribution with
mean 18.6 and standard deviation 5.9. At Federico Ramos Rural High School, 76
students took the test. If the scores at this school have the same distribution as national
scores, what are the mean and standard of the sample mean for 76 students?
2. A molding machine prepares a certain kind of car spare part with a target diameter 𝜇 =
40.265 millimeters. The machine has some variability, so the standard deviation of the
diameter is 𝜎 = 0.004 millimeters. A sample of 6 spare parts is inspected each hour
for process control purposes and records are kept of the sample mean diameter. What
will be the mean and standard deviation of the numbers recorded?

Practice D
Read and analyze the following situations. Determine whether the following statements have
a known or unknown population variance. Write your answer on the space provided before
the number.
_________1. A population has a mean of 85 and a standard deviation of 6. A random sample
of 22 measurements is drawn from this population.
_________2. The weights of female SHS students are normally distributed with a mean of
45kg. Twenty samples consisting of 10 students are drawn from the population and found to
have a mean of 46 kg and a standard deviation of 1.24kg. 232

_________3. A random of 𝑛 = 70 measurements are obtained from a population with a mean


of 156 and a standard deviation of 23.
_________4. A manufacturer of nylon synthetic fibers for fishing line, claims that the mean
tensile strength of their product is 30 lb. of a 20 meter-line. To test the claim of the
manufacturer, a sample of 25, the 20- meter line was tested and found to have a mean weight
of 32 lbs and a standard deviation of .78 lb.
__________5. Given population mean of 200, a sample of 25 has a standard deviation of 3.
Practice E
Consider all sample of size 5 from this population 3,5,7,8,9,10
1. Compute mean (𝜇), variance (𝜎 2 ) and standard deviation (𝜎) of the population?
2. List of all sample of size 5 and compute the mean (𝑥) for each sample.
3. Construct the sampling distribution of the sample mean.
4. Calculate the mean (𝜇𝑥̅ ) of the sampling distribution mean?
5. What is the variance (𝜎 2 𝑥̅ ) and standard (𝜎𝑥̅ ) of the sampling distribution of the sample
means?

61
References
Belecina, Rene, Baccay Elisa and Mateo Efren: Statistics and Probability,Manila, Philippines:
REX Book Store, 2016
Alferez, Merle and Duro, Ma. Cecilia: Statistics and ProbabilityQuezon City, Philippines :MSA
Publishing House
Pineda, Hermenegildo, Statistics and Probability, Cainta, Rizal, PhilippinesPublished by the
Department of Education
http://onlinestatbook.com/2/sampling_distributions/samp_dist_mean.html
https://stattrek.com/sampling/sampling-distribution.aspx

Answer key
Practice A
1. 45
2. 120
3. 36
4. 56
5. 35

Practice B
1. 𝜇 = 5.2

𝜎 2 = 7.36

𝜎 = 2.71

2. 10 samples

Samples Mean
1,4,5 3.33
1,4,7 4.00
1,4,9 4.67
1,5,7 4.33
1,5,9 5.00
1,7,9 5.67
4,5,7 5.33
4,5,9 6.00
4,7,9 6.67
5,7,9 7.00

62
Sampling Distribution of Sample Mean

̅)
Sample Mean (𝑿 Frequency ̅)
Probability P(𝑿
1
3. 1.50 1
10
1
2.00 1
10
1
2.50 2
5
1
3.00 2
5
1
3.50 2
5
1
4.00 1
10
1
4.50 1
10
Total 10 1.00

4. 𝜇𝑥̅ = 5.2
The sampling distribution of the sample mean is equal to the mean of the population

5. 𝜎 2 𝑥̅ = 1.23

𝜎𝑥̅ = 1.11
Practice C
1. 𝜇𝑥̅ = 18.6

𝜎𝑥̅ = 0.68

2. 𝜇𝑥̅ = 40.265

𝜎𝑥̅ = 0.0016

Practice D

1. Known
2. Unknown
3. Known
4. Unknown
5. Unknown

Practice E
1. 𝜇 = 7.00

𝜎 2 = 5.67

𝜎 = 2.38

63
2.

Samples Mean
3, 5, 7,8,9 6.4
3,5,7,8,10 6.6
3,5,7,9,10 6.8
3,5,8,9,10 7.0
3,7,8,9,10 7.4
5,7,8,9,10 7.8

3.

Sample Probability
̅) Frequency ̅)
Mean (𝑿 P(𝑿
3, 5, 7,8,9 1 1/6
3,5,7,8,10 1 1/6
3,5,7,9,10 1 1/6
3,5,8,9,10 1 1/6
3,7,8,9,10 1 1/6
5,7,8,9,10 1 1/6
Total 10 6/6 =1.00

4. 𝜇𝑥̅ = 7.00
The sampling distribution of the sample mean is equal to the mean of the population

5. 𝜎 2 𝑥̅ = 0.23 , 𝜎𝑥̅ = 0.48

Prepared :

JOSE P. AGTARAP JR.


Teacher III

64
WEEK 7 : CENTRAL LIMIT THEOREM AND PROBLEMS INVOLVING SAMPLING
DISTRIBUTIONS OF THE SAMPLE MEAN

Background Information

Many common statistical procedures require data that approximately follows a normal
distribution. However, normally distributed data isn’t always the case. There are cases wherein
the population is skewed. This is where the Central Limit Theorem becomes useful.

The central Limit Theorem is a fundamental importance is statistics because it justifies


the use of normal curve method for a wide range of problems. This theorem applies
automatically to sampling form infinite population. It also assures us that no matter what the
shape of the population distribution of the mean is, the sampling distribution of the sample
means is closely normally distributed whenever n is large. Consequently, it justifies the use of
𝑥̅ −𝜇
the formula 𝑧 = 𝜎 when computing for the probability that 𝑥̅ will take on the value within a
√𝑛
𝑋−𝜇
given range in the sampling distribution of 𝑥̅ . We can also use 𝑧 = for individual score /
σ
observation.
where 𝑥̅ = sample mean
µ = population mean
σ = population standard deviation
n = sample size
X = given score / measurement

Many statistics have distributions that are approximately normal for large sample sizes,
even when we are sampling from a distribution that is not normal. This means that we can
develop statistical inference procedures that are based on the normal distribution, even if we
are sampling from a population that is not normal provided, we have a large sample size.

Let us illustrate this through this example:


Suppose that the average age of the people living in a Barangay is 34 with a standard
deviation of 4. If 100 residents of a certain Barangay decided to take summer outing after
COVID-19 pandemic and Enhanced Community Quarantine has been lifted for bonding and
relaxation, what is the probability that the average age of these residents is less than 35?

Solution:
It is not given that the population is normally distributed but since 𝑛 > 30, then you can
assume that the sampling distribution of the mean ages of 100 barangay residents is normal
according to the Central Limit Theorem.
The Central Limit Theorem describes the normality of the distribution of the sample
mean taken from a population that is not normally distributed.

Step 1: Identify the given information


𝜇 = 34, 𝜎 = 4, 𝑋̅ = 35, 𝑛 = 100

65
Step 2: Identify what is asked.
𝑃(𝑥 < 35)

Step 3: Identify the formula to be used.


Here we are dealing with data about the sample means. So, we will use the formula
̅−𝝁
𝒙
𝑧= 𝝈 to standardize 35.
√𝒏
Step 4: Solve the problem
𝑋̅−𝜇
𝑧 = 𝜎
√𝑛

35−34
𝑧 = 4
√100

1
= 4
10

= 𝟐. 𝟓
We shall find 𝑃(𝑋 < 35) by getting the area under the normal curve.
𝑃(𝑋 < 35) = 𝑃(𝑧 = 2.5)
= 0.5 + 0.4938
= 𝟎. 𝟗𝟗𝟑𝟖

Step 5 : State your final answer


Therefore, the probability that the random sample of 100 persons has an average age
of fewer than 35 is 0.9938 or 99.38%

A sampling distribution of the sample mean is a frequency distribution of the sample


mean computed from all possible random samples of a specific size n taken from a population.
The probability distribution of the sample mean is also called the sampling distribution of
the sample mean. The standard distribution of the sampling distribution of the sample mean
is also known as the standard error of the mean.

Example A.
The mean it takes a group of senior high school students to complete a certain
examination is 50.6 minutes. The standard deviation is 6 minutes. Assume that the variable is
normally distributed.

1. What is the probability that a randomly selected senior high school student will complete
the examination in less than 48 minutes?

Solution:
Step 1. Identify the given information:
𝜇 = 50.6, 𝜎 = 6, X = 48

Step 2. Identify what is asked:


𝑃(𝑋˂48)

Step 3. Identify the formula to be used:


The problem is dealing with an individual data obtained from the population so the formula
𝑋− 𝜇
to be used is 𝑧 = convert 48 to standard score.
𝜎

66
Step 4. Solve the problem
𝑋− 𝜇
𝑧=
𝜎
48− 50.6
𝑧=
6
𝑧 = − 𝟎. 𝟒𝟑

Find 𝑃(𝑋 < 48) by getting the area under the normal curve.
𝑃(𝑋˂48) = 𝑃(𝑧˂ − 0.43)
= 0.5 − 0.1664
= 𝟎. 𝟑𝟑𝟑𝟔

Step 5: State your final answer


Therefore, the probability that a randomly selected college student will complete the
examination in less than 48 minutes is 0.3336 or 33.36%

2. If 49 randomly selected senior high school students take the examination, what is the
probability that the mean time it takes the group to complete the test will be less than 48
minutes?

Solution
Step 1: Identify the given information:
𝜇 = 50.6, 𝜎 = 6, 𝑋̅ = 48, 𝑛 = 49

Step 2: Identify what is asked.


𝑃(𝑋̅˂48)

Step 3: Identify the formula to be used.


The problem is dealing with data about the sample mean or n observations, so the formula
𝑋̅−𝜇
to be used to standardize 48 is 𝑧 = 𝜎
√𝑛
Step 4: Compute for the Probability:
𝑋̅−𝜇
𝑧 = 𝜎
√𝑛
48 − 50.6
=
6
√49
= − 𝟑. 𝟎𝟑

Find 𝑃(𝑋̅˂48) by getting the area


under the normal curve.
𝑃(𝑋̅˂48) = 𝑃(𝑧˂ − 3.03)
= 0.5 − 0.0478
= 𝟎. 𝟎𝟎𝟏𝟐

67
Step 5 : State your final answer.
The probability that 49 randomly selected senior high school students will complete
the test in less than 48 minutes is 0.0012 or 0.12%

3. If 49 randomly selected senior high school students take the examination, what is the
probability that the mean time it takes the group to complete the test will be more than 51
minutes?

Solution
Step 1: Identify the given information:
𝜇 = 50.6, 𝜎 = 6, 𝑋̅ = 51, 𝑛 = 49

Step 2: Identify what is asked.


𝑃(𝑋̅ > 51)

Step 3: Identify the formula to be used.


The problem is dealing with data about the sample mean or n observations, so the formula
𝑋̅−𝜇
to be used to standardize 51 is 𝑧 = 𝜎
√𝑛
Step 4: Compute for the Probability:
𝑋̅−𝜇
𝑧 = 𝜎
√𝑛
51−50.6
= 6
√49
= 𝟎. 𝟒𝟕
̅
Find 𝑃(𝑋 > 51) by getting the area under the
normal curve.
𝑃(𝑋̅ > 51) = 𝑃(𝑧 > 0.47)
= 0.5 − 0.1808
= 𝟎. 𝟑𝟏𝟗𝟐

Step 5: State your final answer


The probability that 49 randomly selected senior high students will complete
the test in more than 51 minutes is 0.3192 or 31.92%

4. If 49 randomly selected senior high school students take the examination, what is the
probability that the mean time it takes the group to complete the test is between 47.8 and 53
minutes?

Solution
Step 1: Identify the given information:
𝜇 = 50.6, 𝜎 = 6, 𝑋̅ = 47.8 and 53, 𝑛 = 49

Step 2: Identify what is asked.


𝑃(47.8 < 𝑋̅ < 53)

Step 3: Identify the formula to be used.

68
The problem is dealing with data about the sample mean or n observations, so the
𝑋̅ −𝜇
formula to be used to standardize 47.8 and 53 is 𝑧 = 𝜎
√𝑛

Step 4: Compute for the Probability:


𝑃(𝑋̅ > 47.8) 𝑃(𝑋̅ < 53)
𝑋̅ −𝜇 𝑋̅ −𝜇
𝑧 = 𝜎 𝑧 = 𝜎
√𝑛 √𝑛

47.8−50.6 53−50.6
= 6 = 6
√49 √49

= −𝟑. 𝟐𝟕 = 𝟐. 𝟖

Find 𝑃(𝑋̅ < 47.8) by getting the Find 𝑃(𝑋̅ < 53) by getting the
area under the normal curve. area under the normal curve
𝑃(𝑋̅ < 47.8) = 𝑃(𝑧 < −3.27) 𝑃(𝑋̅ < 53) = 𝑃(𝑧 < 2.8)
= 0.5 − 0.4995 = 0.5 + 0.4974
= 𝟎. 𝟎𝟎𝟎𝟓 = 𝟎. 𝟗𝟗𝟕𝟒
Note: We use 𝑃(𝑋̅ < 47.8) instead of 𝑃(𝑋̅ > 47.8) so that we can subtract on the later part

To find the probability that 49 randomly selected senior high school students will
complete the test between 47.8 and 53 minutes, subtract the smaller area from the bigger
area under the normal curve.

That is 𝟎. 𝟗𝟗𝟕𝟒 − 0.0005 = 𝟎. 𝟗𝟗𝟔𝟗 or 99.69%

Step 5: State your final answer


Therefore, the probability that 49 randomly selected senior high school students will
complete the test between 47.8 and 53 minutes is 0.9969 or 99.69%

69
Example B
The average number of milligrams (mg) of cholesterol in a cup of a certain brand of ice
cream is 660mg, and the standard deviation is 35mg. Assume the variable is normally
distributed.
a. If a cup of ice cream is selected, what is the probability that the cholesterol content
will be more than 670mg?

Solution
Step 1: Identify the given information
µ = 660, 𝜎 = 35, 𝑋 = 670
Step 2: Identify what is asked.
𝑃(𝑋 > 670)
Step 3: Identify the formula to be used.
Here we are dealing with an individual data obtained from the population. So, we
𝑿− 𝝁
will use the formula 𝒛 = to standardize 670.
𝝈

Step 4: Solve the problem


𝑋− 𝜇
𝑧=
𝜎
670 − 660
=
35
= 𝟎. 𝟐𝟗
We shall find P(X>670) by getting the area under 0.3859
the normal curve.
𝑃(𝑋 > 670) = 𝑃(𝑧 > 0.29)
= 0.5 − 0.29
= 𝟎. 𝟑𝟖𝟓𝟗
0.29
Step 5: State your final answer
Therefore, the probability that a cholesterol content will be more than 670mg is 0.3859 or
38.59%.

b. If a sample of 10 cups of ice cream is selected, what is the probability that the mean
of the sample will be larger than 670 mg?
Solution
Step 1: Identify the given information
µ = 660, 𝜎 = 35, 𝑋̅ = 670, 𝑛 = 10

Step 2: Identify what is asked.


𝑃(𝑋̅ > 670)

Step 3: Identify the formula to be used.


Here we are dealing with data about the sample means. So we will use the formula
𝑥̅ −𝜇
𝑧= 𝜎 to standardize 670.
√𝑛

Step 4: Solve the problem


𝑥̅ −𝜇
𝑧= 𝜎
√𝑛

70
670−660
= 35
√10
= 𝟎. 𝟗𝟎
We shall find 𝑃(𝑋̅ > 670) by getting the area under 0.1841
the normal curve.
𝑃(𝑋̅ > 670) = 𝑃(𝑧 > 0.90)
= 0.5 − 0.3159
= 𝟎. 𝟏𝟖𝟒𝟏
0.90
Step 5: State your final answer
Therefore, the probability that the mean cholesterol content of 10 randomly selected
cups of ice cream will be more than 670 mg is 0.1841 or 18.41%.

Learning Competencies with code


• Illustrate the Central Limit Theorem (M11/12SP-IIIe-2),
• Define the sampling distribution of the sample mean using the Central Limit Theorem
(M11/12SP-III-3) and
• Solve problems involving sampling distributions of the sample mean (M11SP-IIIe-f-1)

GENERAL INSTRUCTION: Write your answer on a separate paper


Activity 1: Multiple Choice
Choose the best answer in each question. Write the letter of your choice on a paper.

For numbers 1 - 4, refer to the problem below.


The score of the students in Mathematics for the midyear examination has a mean of
40 and a standard deviation of 5.
1. What is the value of 𝜇?
A. 5 B. 25 C. 30 D. 40
2. What is the value of 𝜎?
A. 5 B. 25 C. 30 D. 40
3. If the score of one of the students is 41, what is his standard score?
A. -0.22 B. -0.2 C. 0.2 D. 0.22
4. What is the rank of the student who got 41 in score?
A. 58.71% B. 57.93% C. 8.71% D. 7.93%
For numbers 5 – 7, refer to the problem below.
The GPAs of all students enrolled at a large university have an approximately normal
distribution with a mean of 3.05 and a standard deviation of 0.29. Find the probability
that the mean GPA of a random sample of 20 students selected from this university
is:
5. less than 2.98?
A. 12.58% B. 14.01% C. 15.62% D. 85.23%
6. greater than 3.01?
A. 19.32% B. 22.41% C. 55.56% D. 73.24%
7. between 2.90 and 3.07?
A. 62.17% B. 61.13% C. 42.34% D. 1.04%

71
8. Which formula can standardize an individual data?
𝑥̅ −𝜇 𝑥−𝜇 𝑥−𝜇
A. 𝜎 B. 𝑛𝜋(1 − 𝜋) C. D.
𝜎 𝑛
√𝑛
9. Which formula is applicable when dealing with data about the sample means?
𝑥̅ −𝜇 𝑥−𝜇 𝑥−𝜇
A. 𝜎 B. 𝑛𝜋(1 − 𝜋) C. D.
𝜎 𝑛
√𝑛
10. The number of total carbohydrates in an order of camote fries in a native restaurant is
approximately normally distributed with a mean of 240 and a standard deviation of 20. What
is the probability that a randomly selected order has at least 265 total carbohydrates?
A. 54.13% B. 34.67% C. 21.56% D. 10.56%

Activity 2: Let’s Standardize!


Standardize the following data. Round off your final answer into nearest hundredths. Use a
paper for your answer.

1. µ = 60, 𝜎 = 8, and 𝑋 = 64
2. µ = 52.9 𝜎 = 7.2 𝑋 = 48.5
3. µ = 12.5, 𝜎 = 5, 𝑋̅ = 11.8 and 𝑛 = 50
4. µ = 62, 𝜎 = 6, 𝑋̅ = 59, and 𝑛 = 30
5. µ = 245, 𝜎 = 20, 𝑋̅ = 248, and 𝑛 = 25

Activity 3: Problem Solving


Use a paper for your answer. Show your solution using 5 steps.
1. The height of pupils of Josel L. Dizon Elementary School has a mean of 121cm with
standard deviation of 5. If 50 of them are taken samples, what is the probability that
their mean height is less than 120cm?
2. The mean NAT scores of Grade 10 students is 65. Sixty (60) students were chosen
and found that the standard deviation of their score is 5. What is the probability that
their mean is between 64 and 67?
3. The average time it takes a group of college students to complete a certain
examination is 46.2 minutes. The standard deviation is 8 minutes. Assume that the
variable is normally distributed.
a. What is the probability that a randomly selected college student will complete the
examination in less than 43 minutes?
b. If 50 randomly selected college students take the examination, what is the
probability that the mean time it takes the group to complete the test will be less
than 43 minutes?

References:
Rene R. Belecina, et.al, 2016. “Statistics and Probability.” First Edition. pgs.126-132

Ricardo B. Banigon, et.al., 2016. “Statistics and Probability for Senior High School”. pgs.52-54

Roland V. Magsino, "Learners Material in Statistics and Probability". Accessed on August


22, 2020. https://www.dummies.com/education/math/calculus/how-to-identify-the-four-
conic-sections-in-equation-form/

72
Answer Key
Activity A Activity B
1. D 6.D 1. 0.5
2. A 7. B 2. -0.61
3. C 8. C 3. -0.99
4. B 9. A 4. -2.74
5. B 10. D 5. 0.75

Activity C
1. Step 1: µ = 121, 𝜎 = 5, 𝑋̅ = 120, 𝑛 = 50 2. Step 1: µ = 65, 𝜎 = 5, 𝑋̅ = 64 & 67, 𝑛 = 60
Step 2: P(𝑋̅ < 120) Step 2: P(64 < 𝑋̅ < 67)
𝑥̅ −𝜇 𝑥̅ −𝜇
Step 3: 𝑧 = 𝜎 Step 3: 𝑧 = 𝜎
√𝑛 √𝑛
Step 4: 𝑧 = −1.41 Step 4: 𝑧 = −1.55 and 3.1
Step 5: P(𝑋̅ < 120) = 0.0793 or 7.93% Step 5: P(64 < 𝑋̅ < 67) = 0.9384 or 93.84%

3.a. 3.b.
Step 1: µ = 46.2, 𝜎 = 8, 𝑋 = 43 Step 1: µ = 46.2, 𝜎 = 8, 𝑋̅ = 43, 𝑛 = 50
Step 2: 𝑃(𝑋 < 43) Step 2: 𝑃(𝑋̅ < 43)
𝑋− 𝜇 𝑥̅ −𝜇
Step 3: 𝑧 = Step 3: 𝑧 = 𝜎
𝜎
Step 4: 𝑧 = −0.40 √𝑛

Step 5 : 𝑃(𝑋 < 43) = 0.3446 or 34.46%. Step 4: 𝑧 = −2.83


Step 5 : 𝑃(𝑋̅ < 43) = 0.0023 or 0.23%.

Prepared:

JOSEL L. DIZON
Senior High School Teacher III
Angeles City Senior High School

73
WEEK 8: T-DISTRIBUTION AND ESTIMATION OF PARAMETERS

Background Information
According to W.S. Gosset in 1908, even the standard deviation was unknown, the
distribution of the sample means was symmetric and similar to the normal distribution. He
stated that as the sample size increases, the distribution as well approaches the standard
normal distribution.
Definition:
The t-distribution is a probability distribution that is used to estimate population
parameters when the sample size is small (n ≤ 30). It can also be used when the population
is unknown.
If the distribution of a random variable x is approximately normal, then the formula
𝑋̅−µ
𝑡 = 𝑠
√𝑛
Characteristics of a t-distribution

1. The t-distribution is bell-shaped and symmetric about 𝑡 = 0, similar to the normal


distribution.
2. The total area under a t-curve is 1 (or 100%, similar to the normal distribution).
3. The mean, median, and mode of the t-distribution are all equal to zero.
𝑛−1
4. The variance of the t-distribution is equal to which approaches 1 as 𝑛 increases
𝑛−3
infinitely.

5. The t-distribution is a family of curves, each determined by a parameter known as


degrees of freedom (𝑑𝑓). This term refers to the number of free choices left after a
sample statistic such as 𝑋̅ is calculated and is given by
𝑑𝑓 = 𝑛 − 1 for a single group, or
𝑑𝑓 = 𝑛1 + 𝑛2 − 2 for two groups, where 𝑛, 𝑛1 , 𝑛2 are the sample sizes.
6. As the sample size 𝑛 gets larger, the t-distribution gets closer to the normal
distribution.

74
The critical values of 𝑡 allows us to make reliable inferences about the population
mean. The t-distribution can also be used to test hypotheses concerning population mean and
sample mean, and the difference between two means. However, we need to make sure that
the sample meets the following conditions:
1. The sample is small (𝑛 ≤ 30) and randomly selected.
2. The population standard deviation σ is unknown.
3. The parent population, the population from which you are sampling, is essentially
normal.
For a single group of samples, the degrees of freedom (𝑑𝑓) is equal to the sample
size n minus one:
𝑑𝑓 = 𝑛 − 1

For other situations, the degrees of freedom can be calculated differently. The degrees
of freedom may change for different applications and, since they specify a correct t-distribution
to use, it is important to calculate the degrees of freedom correctly. For example, when dealing
with two groups of samples, the degrees of freedom is calculated as
𝑑𝑓 = 𝑛1 + 𝑛2 − 2

where 𝑛1 and 𝑛2 are the two sample sizes.

75
Examples:
1. Find the 99th percentiles of t-distribution with 4 degrees of freedom.
df = 4
t = 4.604
2. Find the 90th percentiles of t- distribution with 6 degrees of freedom.
df = 6
t = 1.943
3. n = 21 , 99% confidence.
t = 2.845
4. n = 24, 95% confidence
df = 24 – 1 = 23
t = 2.069
5. df = 15, 90% confidence
t = 1.753

• Population follows normal distribution if random sample size n is considered and


determined using:
𝑋̅ − µ
𝑡 = 𝑠
√𝑛
Therefore, the t-interval estimate is,
𝑠 𝑠
𝑋̅ − 𝑡 ( ) < µ < 𝑋̅ + 𝑡 ( )
√𝑛 √𝑛

Population Standard Deviation Σ is Unknown

If 𝜎 is unknown it can be estimated by 𝑠𝑥 or the sample standard deviation,

𝑛 2
(∑𝑖=1 𝑥)
∑𝑛 𝑋−𝑋
√ 𝑖=1
̅ √∑𝑛𝑖=1 𝑥𝑖 2 − 𝑛
𝑆 = 𝑛−1
or 𝑠= 𝑛−1

It is similar to the standard deviation of normal distribution in a unimodal, bell-shaped,


symmetrical and extends infinitely in either direction approaching the horizontal axis

-3 -2 -1 0 1 2 3
Example:
1. The 28 employees of XYZ Company has a mean travel time of 14.3 kilometer to their
work. The standard deviation of the travel time is 2km. determine the 95% confidence
interval population mean.
𝑠 𝑠
Since the population standard deviation is not given then, 𝑋̅ − 𝑡 (√𝑛 ) < µ < 𝑋̅ + 𝑡 ( ) will
√𝑛
be applied.

76
Given: 𝑋̅ = 14.3 𝑑𝑓 = 27
𝛼
𝑠 = 2 = 0.025
2
𝑛 = 28 𝑡 = 2.052

Solution:
2 2
14.3 − 2.052 ( ) < µ < 14.3 + 2.052 ( )
√28 √28
14.3 − 2.052(0.378) < µ < 14.3 − 2.052(0.378)

14.3 − 0.776 < µ < 14.3 + 0.776

13.5 < µ < 15.1


Definition:
Population Proportion – it is the fraction or part of the population that has a certain
characteristic in a binomial experiment denoted by p and the
value of the parameter p is unknown.
Sample Proportion – it is the fraction or part of the sample that contain characteristic 𝑝
It is denoted by the statistic 𝑝̂ .
Point Estimate – a single number to estimate a population parameter.
where 𝑝̂ = for one proportion
𝑝̂1 – 𝑝̂2 for difference in two proportion
Interval Estimate – an interval of values used to estimate a population parameter for
proportion.
Confidence Level – A confidence interval for a population proportion based on the
sample size n. Most commonly used are 0.95, 0.90 and 0,99

Formula for One proportion

𝑝̂𝑞̂
𝐸 = 𝑧𝑎 √
2 𝑛

𝑝̂ 𝑞̂ 𝑝̂ 𝑞̂
𝑝̂ − 𝑧𝑎 √ < 𝑝 < 𝑝̂ + 𝑧𝑎 √
2 𝑛 2 𝑛

𝑥
where: 𝑝̂ =
𝑛

𝑞̂ = 1 − 𝑝̂

Example:
1. There are 55 college students chosen randomly from 450 enrolled in the first semester
who wish to enroll a summer program in their major subjects. Estimate the population
proportion of students that 11% will take summer class. Use 95% confidence level.

Given: 𝑥 = 55 , 𝑛 = 450, 95% = 0.95 , 𝑧 = 1.96


Solution:
55
𝑝̂ = = 0.1222
450
𝑞̂ = 1 − 0.1222 = 0.8778

77
Then,
𝑝̂ 𝑞̂ 𝑝̂ 𝑞̂
𝑝̂ − 𝑧√ < 𝑝 < 𝑝̂ + 𝑧√
𝑛 𝑛

0.1222(0.8778) 0.1222(0.8778)
0.1222 − 1.96√ < 𝑝 < 0.1222 + 1.96√
450 450
0.1222 − 1.96(0.015439283) < 𝑝 < 0.1222 + 1.96(0.015439283)
0.1222– 0.0302360995 < 𝑝 < 0.1222 + 0.0302360995
0.092 < 𝑝 < 0.1524
9.2% < 𝑝 < 15.24%
Therefore an estimated 11% will enroll this summer is correct since it is included
within the confidence interval estimate.

2. An LTO researcher wants to know the percentage of drivers using GPS systems in
their car. He wishes to be 99% confident that the estimate is within 2 percentage points
of the proportion. A recent study of 180 drivers showed that 25% used GPS systems.
How large is the sample size?
Given: 𝑝̂ = 0.25 , 𝐸 = 2% = 0.02
𝑞̂ = 1 − 0.25 = 0.75 𝑧𝑎 = 0.005
2
Solution:
Derived the formula for n
𝑝̂𝑞̂
𝐸 = 𝑧𝑎 √
2 𝑛

𝐸 𝑝̂𝑞̂
=√
𝑧𝑎 𝑛
2

2
𝐸 𝑝̂𝑞̂
( ) =
𝑧𝑎 𝑛
2

2
𝐸
𝑛= 𝑝̂ 𝑞̂ ( )
𝑧𝑎
2

0.005 2
𝑛 = 0.25(0.75) ( )
0.02
𝑛 = 0.1875(129)2
𝑛 = 0.1875(16,641)
𝑛 = 3,120.1875
Therefore, there are 3121 drivers considered in the study.

Learning Competencies with Code:


• Illustrate the t-distribution. (M11/12SP-lllg–2)
• Identify percentiles using the t-table. (M11/12SP-lllg–5)
• Identify the length of a confidence interval. (M11/12SP-lllj–1)
• Compute for the length of the confidence interval. (M11/12SP-lllj–2)
• Compute for an appropriate sample size using the length of the interval. (M11/12SP-lllj–3)
• Solve problems involving sample size determination. (M11/12SP-lllj–4)

78
Practice A.
1. Give the confidence coefficient of the following, using the t-table.
1. n = 6, 90% confidence
2. df = 7, 90% confidence
3. n = 12, 95% confidence
4. df = 24, 99% confidence
5. n = 8, 99% confidence

2. Find E given the following.


1. n = 6, s = 2, 90% confidence
2. n = 9, s = 2.8, 90% confidence
3. n = 13, s = 4.5, 95% confidence
4. n = 16, s = 3.1, 95% confidence
5. n = 21, s = 5, 95% confidence
Practice B
1. An admission officer of an educational institution wants to know the mean age of all
entering mathematics majors. He computed a mean age of 18 years and a standard
deviation of 1.2 years on a random sample of 25 entering mathematics majors
purportedly coming from a normally distributed population. with 99% confidence, find
the point estimate and the interval estimate of the population mean.
2. Students nowadays spend most of their time in social media aside from studying their
lessons. Suppose the researcher wants to know the habitual number of hours that
students spend on-line and she considered 20 students. A mean of 16.3 hours with
standard deviation of 5.4 hours were obtained. Construct a 95% confidence interval
for the average of all students who spend on-line.
3. The average weight of 20 students in a certain school was found to be 165 lbs with a
standard deviation of 4.5. Construct a 95% confidence interval for the population mean

Practice C
Find 𝑝̂ and 𝑞̂ given x and n.
1. x = 28, n = 100
2. x = 45, n = 240
3. x = 120, n = 1000
4. x = 234, n = 1500
5. x = 318, n = 2300

Practice D
Choose the letter of the correct answer
1. When x = 50 and n = 215, what is the value of 𝑝̂ ?
a. 0.23 b. 4.3 c. 50 d. 215
2. When x = 120 and n = 406, what is the value of 𝑞̂
a. 0.70 b. 20.3 c. 120 d. 406
3. Given: n = 20, s = 1.2 and 𝛼 = 0.1, the maximum error E is approximately
a. 0.33 b. 0.39 c. 0.45 d. 0.50
4. Which of the following illustrates confidence level?
a. 32.8 b. 30.39 c. 90% d. 90%-94%

79
For nos. 5 – 10.
The mean gasoline consumption of 10 cars is 28 liters with a standard deviation of 1.6 liters.
Find the point and the interval estimates using the confidence level.
5. What is α?
a. 95% b. 28 c. 1.6 d. 0.05
6. What is the point estimate of µ?
a. 95% b. 28 c. √10 d. 1.6
7. What is the appropriate distribution?
a. p b. s c. t d. z
8. what are the confidence coefficient?
a. +−2.58 b. +
−2.33 c. +
−2.26 d. +
−1.96
9. The lower limit is approximately
a. 27 b. 26.85 c. 26.81 d. 26.68
10. The upper limit is
a. 29 b. 29.15 c. 19.19 d. 29.32
Practice E
Solve the following problem
1. Mary wants to know, with 95% confidence, the proportion of households who like to
use detergent X. a previous survey showed that 42% like to use detergent X. Mary like
to be accurate within 2% of the true proportion. What sample size does Mary need?
2. The Dean of a college wants to use the proportion of a population to determine the
sample size needed to interview regarding their thoughts about the new school
structure. She wants to be able to assert with a probability 0.95 that her error will be
at most 0.05. Similar polls in the past showed that 65% approved the new structure.
How large a sample does the Dean need?

References

Blay, B.E. Elementary Statistics. Mandaluyong City ANVIL Publishing Inc.: 2007
Johnson, R,R. and Kuby, P. Statistics 2nd Edition. Cengage Learning Singapore, 2013

Link.quipper.com. Accessed July 16, 2020.


https://link.quipper.com/en/organizations/547ff959d2b76d0002001b69/curriculum#curric
ulum.

80
Answer Key
Practice A Practice B
A.1 A.2
1. t = 2.015 1. t = 2.015 1.t = 2.797
2. t = 1.895 E = 1.65 17.33 < µ < 18.67
3. t = 2.201 2. t = 1.861 2. t = 2.093
4. t = 2.797 E = 1.74 13.77 < µ < 18.83
5. t = 3.499 3. t = 2.179 3. t = 2.093
E = 2.72 162.894 < µ < 167.106
4. t = 2.131
E= 1.65
5. t = 2.086
E = 2.276
Practice C Practice D Practice E

1. 𝑃̂ = 0.28 1. A 1. Z = 1.96
𝑞̂ = 0.72 2. A E = 0.02
2. 𝑃̂ = 0.2875 3. C n = 2339.53 or 2340
𝑞̂ = 0.8125 4. C
3. 𝑃̂ = 0.12 5. D 2. Z = 1.96
𝑞̂ = 0.88 6. B E = 0.05
4. 𝑃̂ = 0.156 7. C n = 349.58 or 350
𝑞̂ = 0.844 8. C
9. B
5. 𝑃̂ = 0.138
10. B
̂𝑞 = 0.862

Prepared by

LEDA A. CABANILLA JESSA CHRISTE VENASQUEZ-PINEDA


Teacher III Teacher II

81

You might also like