Statistics and Probability
Statistics and Probability
STATISTICS AND
PROBABILITY
Learning Activity Sheets
Quarter 3: Week 1-8
For you to begin let us all understand that probability distributions can be illustrated or
classified as discrete probability distributions or as continuous probability distributions,
depending on whether they define probabilities associated with discrete variables and
continuous variables.
RANDOM VARIABLE
Example 1
Suppose we toss a fair coin 3 times. The sample space for this chance process is :
{ HHH HHT HTH THH HTT THT TTH TTT}
1
Since there are 8 possible outcomes then probability of each outcome is .
8
Define the variable 𝑋 = “the number of heads obtained”.
What are the outcomes of possible heads?
2
𝑋 =0: TTT
𝑋 =1: HTT , THT , TTH
𝑋 =2: HHT , HTH , THH
𝑋 =3: HHH
The random variable in the activity above is a discrete random variable because the
set of possible outcomes is countable. The possible values of random variable 𝑋 = {0, 1, 2, 3}
PROBABILITY DISTRIBUTION
Probability distribution of a random variable tells all of its possible values along with
their associated probabilities.
Example 1: The random variable X is the number of tails in two tosses of a coin. What are
the probabilities of the possible outcomes? What is the probability distribution of X?
Solution:
Possible outcomes: {HH HT TH TT}
1
Each has probability so the probability distribution is:
4
𝑿 = No. of Tails 0 1 2
1 2 1
𝑷(𝑿)
4 4 4
In Graphical Form:
We can use the probability distribution to answer questions about variable x. In symbols, we
want to find 𝑃(𝑥 ≥ 1). We could add probabilities to find the answer:
3
FINDING POSSIBLE VALUES OF A RANDOM VARIABLE
To understand how one finds the possible values of random variables, study the
example below...
Suppose two cellphones are tested at random. We want to find out the number of
defective cellphones. Thus, to each outcome in the sample space we shall assign a value.
These are 0, 1, or 2. If there is no defective cellphone, we assign the number 0; if there is 1
defective cellphone, we assign the number 1; and 2, if there are two defective cellphones. The
number of defective cellphones is a random variable.
The possible values of this random variable are 0, 1, and 2. Read and analyze the
given situation below.
Illustration:
Let D represent the defective cellphones and N for the non-defective cellphones. If we
let Y be the random variable representing the number of defective cellphones, show the values
of the random variable Y. The table below shows the values of the random variable.
Example 2. Suppose two coins are tossed, let Z be the random variable representing the
number of heads that occur. Find the values of the random variable Z.
1. Determine the sample space. Let H represent head and T for tail
Solution: The sample space for this experiment is: S = {TT, TH, HH, HT}
2. Count the number of heads in each outcome in the sample space and assign this
number to this outcome.
4
ILLUSTRATING A PROBABILITY DISTRIBUTION FOR A DISCRETE RANDOM
VARIABLE AND ITS PROPERTIES
A random variable is a set whose elements are the numbers assigned to the
outcomes of an experiment. It is usually denoted by uppercase letters such as 𝑋, whose
elements are denoted by lower case letters 𝑥1 , 𝑥2 , 𝑥3 and so on.
1. The probability of each value of the random variable must be between or equal to 0
and 1. In symbol 0 < 𝑃(𝑋) < 1.
2. The sum of all the probabilities of all values of the random variable must be equal to 1.
In symbol, we write it as ∑ 𝑃(𝑋) = 1.
Probability Histogram
Step 1: Determine the sample space. Let R represent the red ball and G represent the green
ball.
Solution : The sample space for this experiment is: 𝑆 = { 𝑅𝑅, 𝑅𝐺, 𝐺𝑅, 𝐺𝐺}
Step 2: Count the number of green balls in each outcome in the sample space and assign this
number to this outcome.
RR 0
RG 1
GR 1
GG 2
5
Step 3: Assign probability values 𝑃(𝑌) to each value of the random variable.
• There are 4 possible outcomes, if no green ball occurs once, the probability that
1
we shall assign to the random variable 0 is .
4
• If 1 green ball occurs two times, the probability that we shall assign to the
2 1
random variable 1 is or .
4 2
• If 2 green balls occur once, the probability that we shall assign to the random
1
variable 1 is .
4
Number of Green
Probability P(Y)
Balls Y
1
0
4
1
1
2
1
2
4
Histogram
0.6
0.5
0.4
P(Y)
0.3
0.2
0.1
0
0 1 2
No. of Green Balls (Y)
Example 2
Let 𝑋 be a random variable giving the number of girls in a randomly selected three-
child family. Assuming that boys and girls are equally likely, construct the probability
distribution of 𝑋 and its corresponding histogram.
6
a.
Possible Outcomes Value of the Random Variable X
(No. of Girls)
BBB 0
BBG 1
BGB 1
GBB 1
BGG 2
GBG 2
GGB 2
GGG 3
b. No. of Girls 0 1 2 3
1 3 3 1
Probability 𝑷(𝑿) or 0.125 or 0.375 or 0.375 or 0.125
8 8 8 8
1 3 3 1 8
∑ 𝑃(𝑋) = + + + = =1
8 8 8 8 8
c. Histogram
0.4
0.3
P(X)
0.2
0.1
0
0 1 2 3
7
GENERAL INSTRUCTION: Write your answer on a separate paper
Practice A
Classify whether the given experiment implies a discrete random variable or a
continuous random variable. Write D if discrete and C if continuous.
_____ 1. The temperature of a solution in the laboratory
_____ 2. Collecting data about the heights of students in senior high schools.
_____ 3. Recording the distance travelled by the bus
_____ 4. Surveying about the number of cases due to COVID-19 pandemic in Angeles City
_____ 5. Number of promoted students at the end of school year
Practice B.
Complete the given table by finding the possible values of random variable.
1. In a box there are 2 balls - one white and one yellow. Two balls are picked one at a
time with replacement. Let X be the random variable representing the number of white
balls. Find the values of the random variable X.
2. Four coins are tossed. Let Y be the random variable representing the number of tails
that occur. Find the values of the random variable Y.
8
Practice C.
Determine whether the given values can serve as the values of a probability distribution.
1. 𝑃(1) = 0.05, 𝑃(2) = 1.01, 𝑃(3) = 0.2
3 7 1
2. 𝑃(1) = , 𝑃(2) = , 𝑃(3) =
20 20 2
X 1 5 7 8
1.
1 1 1 1
P(x)
4 8 4 8
X 1 3 5 7
2.
P(x) 0.35 0.25 0.22 0.12
References
Albert, J. R. G. (2008). Basic Statistics for the Tertiary Level (ed. Roberto Padua,
Welfredo Patungan, Nelia Marquez), published by Rex Bookstore.
Handbook of Statistics 1 (1st and 2nd Edition), Authored by the Faculty of the
Institute of Statistics, UP Los Baños, College Laguna 4031
R Pierce, Rod. (3 Feb 2020). "Random Variables". Math Is Fun. Retrieved 24 May 2020
from http://www.mathsisfun.com/data/random-variables.html
Malate, Jose S. Statistics and Probability for Senior High School. Vicarisg Publications
and Trading, Inc, 2017.
Lim, Yvette F., Nocon, R., Nocon, E., Ruivivar, L. Math for Engaged Learning Statistics
and Probability. Sibs Publishing House, Inc. 2016.
9
ANSWER KEY
Practice A Practice B Practice C Practice D
1. 1. Probability
1 C 1. not a Distribution
2. 1
2 C Possible Value of probability 3. 0 ≤ P(x) ≤ 1
3 C Outcomes the distribution 4. ∑ P(x) = 1 and
4 D Random 2. probability 0 ≤ P(x) ≤ 1
5 D Variable Y distribution 5. Sample
3. probability Random variable
(No. Of
Probability values
white distribution or P(x)
balls) 4. not a
probability
WW 2 distribution
YY 0 5. not a
probability
WY 1 distribution
YW 1
Possible value of the random
X = { 0,1,2}
2.
Prepared
JOANAH S. BAUTISTA
Secondary School Teacher I - FGNMHS
10
WEEK 2: COMPUTING PROBABILITY CORRESPONDING TO A GIVEN RANDOM
VARIABLE
Background Information
RANDOM VARIABLES
In short:
𝑋 = {0, 1}
Note:
We could choose Heads = 100 and
Tails = 150 or other values if we want! It
is our choice
So:
• We have an experiment such as tossing coin.
• We give values to each event.
• The set of values is a Random Variable.
A random variable has a whole set of values, and it could take on any of those values,
randomly. We use a capital letter, like X or Y, to avoid confusion with the Algebra type of
variable. A Random Variable’s set of values is the sample space.
Example: Throw a die once
Random Variable X : The score shown on the top face.
X : 1, 2, 3, 4, 5 or 6
Sample space : {1, 2, 3, 4, 5, 6}
PROBABILITY
We can show the probability of any one value using this style:
𝑃(𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑋) = 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑡ℎ𝑎𝑡 𝑣𝑎𝑙𝑢𝑒
11
Example:
How many heads (H) do you get when we toss three coins?
X = “The number of the Heads” is the random variable
In this case, there could be zero H (if all the coins land tails
up), 1H, 2H or 3H.
But this time the outcomes are NOT all equally likely.
X = “number of heads”
1
• 𝑃(3) =
8
3
• 𝑃(2) =
8
3
• 𝑃(1) =
8
1
• 𝑃(0) =
8
12
Let’s count how often each value occurs, and work out the probabilities:
1
• 2 occurs just once, so 𝑃(2) =
36
2 1
• 3 occurs twice, so 𝑃(3) = or
36 18
3 1
• 4 occurs three times, so 𝑃(4) = or
36 12
4 1
• 5 occurs four times, so 𝑃(5) = or
36 9
5
• 6 occurs five times, so 𝑃(6) =
36
6 1
• 7 occurs six times, so 𝑃(7) = or
36 6
5
• 8 occurs five times, so 𝑃(8) =
36
4 1
• 9 occurs 4 times, so 𝑃(9) = or
36 9
3 1
• 10 occurs thrice, so 𝑃(10) = or
36 12
2 1
• 11 occurs twice, so 𝑃(11) = or
36 18
1
• 12 occurs just once, so 𝑃(12) =
36
Example
1) A card is taken at random from a full pack of 52 playing cards. What is the probability
that the card taken is:
(a) red card?
(b) a queen?
(c) a red ace?
(d) the seven of hearts?
Solution
As each card is equally likely to be drawn from the pack there are 52 equally likely
outcomes.
13
(b) There are 4 queens in the pack, so:
4 1
𝑃(𝑞𝑢𝑒𝑒𝑛) = =
52 13
(c) There are 2 red aces in the pack, so:
2 1
𝑃(𝑟𝑒𝑑 𝑎𝑐𝑒) ==
52 26
(d) There is only 1 seven of hearts in the pack, so:
1
𝑃(𝑠𝑒𝑣𝑒𝑛 𝑜𝑓 ℎ𝑒𝑎𝑟𝑡𝑠) =
52
2) A packet of sweets contains 18 red sweets, 12 green sweets and 10 yellow sweets. A
sweet is taken at random from the packet. What is the probability that the sweet is:
(a) red?
(b) not green?
(c) green or yellow?
Solution
The total number of sweets in the packet is 40, so there are 40 equally likely outcomes
when one is taken at random.
(a) There are 18 red sweets, so:
18 9
𝑝(𝑟𝑒𝑑) = =
40 20
(b) There are 28 sweets that are not green in the packet, so:
28 7
𝑝 (𝑛𝑜𝑡 𝑔𝑟𝑒𝑒𝑛) = =
40 10
(c) There are 22 sweets that are green or yellow in the packet, so:
22 11
𝑝 (𝑔𝑟𝑒𝑒𝑛 𝑜𝑟 𝑦𝑒𝑙𝑙𝑜𝑤) = =
40 20
3) You roll a fair dice 120 times. How many times would you expect to obtain:
(a) a 6?
(b) an even score?
(c) a score of less than 5?
Solution:
1
(a) 𝑃(6) =
6
1
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 6𝑠 = × 120 = 20
6
3 1
(b) 𝑃(𝑒𝑣𝑒𝑛 𝑛𝑢𝑚𝑏𝑒𝑟) = =
6 2
1
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑒𝑣𝑒𝑛 𝑠𝑐𝑜𝑟𝑒𝑠 = 𝑥 120 = 60
2
4 2
(c) 𝑃(𝑠𝑐𝑜𝑟𝑒 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 5) = =
6 3
2
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 5 = 𝑥 120 = 80
3
A RANGE OF VALUES
We could also calculate the probability that a random variable takes on a range of
values.
Example:
You roll two dice, what is the probability that the sum of the scores is 5, 6, 7, or 8?
In other words: What is 𝑃(5 ≤ 𝑋 ≤ 8)?
14
Solution:
𝑃(5 ≤ 𝑥 ≤ 8) = 𝑃(5) + 𝑃(6) + 𝑃(7) + 𝑃(8)
4+5+6+5
=
36
20
=
36
5
=
9
Answer
1 1 1 1 1
µ = ∑(𝑋) = ∑ 𝑥𝑓(𝑥) = 0 ( ) + 1 ( ) + 2 ( ) + 3 ( ) + 4 ( ) = 2
5 5 5 5 5
For this example, the expected value is equal to a possible value of X. This may not
always be the case. For example, if we flip a fair coin 9 times, how many heads should we
expect? We will explain how to find this later but we should expect 4.5 heads. The expected
value in this case is not a valid number of heads.
The standard deviation of a random variable, X, is the square root of the variance.
σ = SD(X) = √𝑉𝑎𝑟(𝑋) = √σ²
Example
𝑥 0 1 2 3 4
1 1 1 1 1
𝑓(𝑥)
5 5 5 5 5
15
Solution:
σ = SD(X) = √𝑉𝑎𝑟(𝑋) = √σ²
1 1 1 1 1
𝑉𝑎𝑟(𝑋) = [02 ( ) + 12 ( ) + 22 ( ) + 32 ( ) + 42 ( )] − 22 = 6 − 4 = 2
5 5 5 5 5
𝑆𝐷(𝑋) = √2 = 1.4142
Activity 2: Practice calculating and interpreting the mean and standard deviation of a
discrete random variable.
1. Organizers of a concert are limiting tickets sales to a maximum of 444 tickets per
customer. Let T be the number of tickets purchased by a random customer. Here is
the probability distribution of T:
T=# of tickets 1 2 3 4
P(T) 0.1 0.3 0.2 0.4
Calculate the expected value of T.
2. Choose the best interpretation of the mean (or expected value) that you found in the
previous question.
a. Most customers will purchase 3 tickets.
b. A group of 10 customers will purchase exactly 29 tickets.
c. If we look at a large number of customers, then each customer, on average, will
have purchased about 2.9 tickets.
Reference
https://online.stat.psu.edu/stat500/lesson/3/3.2/3.2.1
16
ANSWER KEY
Activity 1
1. 64/125
2. 16/25
3. 2/5
4. 21/50
5. 38/75
Activity 2
1. 2.9 tickets
2. C
3. σ2 = 1.09 ; σ = 1.044
Prepared by
17
WEEK 3: MEAN AND VARIANCE OF A DISCRETE RANDOM VARIABLE
Background Information
The mean or expected value of a probability distribution tells the value of a random
variable that we expect to get if the experiment is done repeatedly. In other words, if the
experiment or process is repeated long enough, there is a likelihood or chance that the
average of the outcomes will begin to approach the expected value or the mean value.
The standard deviation of a probability distribution indicates how far, on the average,
is an observed value of a random variable X from its mean. The smaller the standard deviation,
the more likely that an observed value of the random variable will be close to its mean.
Use these concepts to solve problems involving the mean and the variance of
probability distribution.
Example 1:
The following data show the probability distribution of the number of computers sold
daily in a computer shop during the past several months.
Questions:
a. To find the probability that fewer than three computers will be sold, use X = 0, 1, and 2.
𝑃(𝑋 < 3) = 𝑃(0) + 𝑃(1) + 𝑃(2)
= 0.10 + 0.20 + 0.45
𝑃(𝑋 < 3) = 0.75
18
b. To find the probability that, at most, two computers will be sold, use X = 0, 1, and 2.
c. To find the probability that, at least, four computers will be sold, use X = 4 and 5.
𝑃(𝑋 ≥ 4) = 𝑃(4) + 𝑃(5)
= 0.05 + 0.05
𝑃(𝑋 ≥ 4) = 0.10
d. To find the probability that exactly three computers will be sold, use X = 3.
𝑃(3) = 0.15
X P(X) X • P(X)
0 0.10 0.00
1 0.20 0.20
2 0.45 0.90
3 0.15 0.45
4 0.05 0.20
5 0.05 0.25
𝜇 = ∑[𝑋 • 𝑃(𝑋)]
𝜇 = 2.00
The mean of the probability distribution is 2. This implies that in the long run,
we expect that the average number of computers that will be sold on a given day will
be two.
X P(X) X2 X2 • P(X)
0 0.10 0 0.00
1 0.20 1 0.20
2 0.45 4 1.80
3 0.15 9 1.35
4 0.05 16 0.80
5 0.05 25 1.25
19
𝜎 = √[𝑋2 • 𝑃(𝑋)] − 𝜇2
𝜎 = √5.40 − 4
𝜎 = √1.40
𝜎 = 1.18
The standard deviation of the probability distribution is 1.18. This implies that on the
average, the number of computers sold is 1.18 units from the mean 2.
Example 4:
In a convenient store, the number of tellers (X) busy with customers at 12:00 noon
varies from day to day. Past records indicate that the probability distribution of X is as
follows:
Questions:
1. What is the probability that exactly four tellers are busy at 12:00 noon?
2. What is the probability that, at least, two tellers are busy at 12:00 noon?
3. What is the probability that, fewer than five tellers are busy at 12:00 noon?
4. What is the probability that at least two, but fewer than five tellers are busy at
12:00 noon?
5. What is the mean of the probability distribution?
6. What is the standard deviation of the probability distributions?
Solution:
1. To find the probability that exactly four tellers are busy, use X = 4.
𝑃(𝑋 = 4) = 0.212
2. To find the probability that at least two tellers are busy, use X = 2, 3, 4, 5, and 6.
𝑃(𝑋 ≥ 2) = 𝑃(2) + 𝑃(3) + 𝑃(4) + 𝑃(5) + 𝑃(6)
= 0.078 + 0.155 + 0.212 + 0.262 + 0.215
𝑃(𝑋 ≥ 2) = 0.922
20
3. To find the probability that fewer than five tellers are busy,
use X = 0, 1, 2, 3, and 4.
𝑃(𝑋 < 5) = 𝑃(0) + 𝑃(1) + 𝑃(2) + 𝑃(3) + 𝑃(4)
4. To find the probability that at least two, but fewer than five tellers are busy,
use X = 2, 3, and 4.
𝑃(2 ≤ 𝑋 < 5) = 𝑃(2) + 𝑃(3) + 𝑃(4)
= 0.078 + 0.155 + 0.212
𝑃(2 ≤ 𝑋 < 5) = 0.445
X P(X) X • P(X)
0 0.029 0.000
1 0.049 0.049
2 0.078 0.156
3 0.155 0.465
4 0.212 0.848
5 0.262 1.310
6 0.215 1.290
𝜇 = ∑[𝑋 • 𝑃(𝑋)]
= 0.00 + 0.49 + 0.156 + 0.465 + 0.848 + 1.310 + 1.290
𝜇 = 4.118
The mean of the probability distribution is 4.118. This implies that in the long
run, we expect that the average number of tellers who are busy at 12:00 noon
will be 4.118 or 4.
X P(X) X2 X2 • P(X)
0 0.029 0 0.000
1 0.049 1 0.049
2 0.078 4 0.312
3 0.155 9 1.395
4 0.212 16 3.392
5 0.262 25 6.550
6 0.215 36 7.740
21
𝜎 = √[𝑋2 •𝑃(𝑋)] − 𝜇2
= √19.438 − 16.958
= √2.48
𝜎 = 1.57
For example, your grade in Mathematics is one of the scores in the distribution, you
can predict the location of that score in the distribution and interpret it with regards to the mean
and standard deviation.
22
4. The width of the curve is determined by the standard deviation of the distribution.
5. The tails of the curve are plotted in both directions and flatten out indefinitely along the
horizontal axis. The tails are thus asymptotic to the baseline. A portion of the graph
that is asymptotic to a reference axis or another graph is called an asymptote, always
approaching another but never touching it.
6. The total area under a normal curve is 1. This means that the normal curve represents
the probability, or the proportion, or the percentage associated with specific sets of
measurement values.
To visualize and understand the properties/characteristics of a normal curve, please
refer to figure as shown below.
The shape of a normal curve is based on the two given parameters, the mean and the
standard deviation of the distribution. When comparing two distributions each described by
the normal curve, the following are the three situations based on the said parameters
a. When the means are not equal, but the standard deviations are equal. (µ1 ≠ µ2 ; 𝜎1 = 𝜎2 ),
the curves have a similar shape but centered at different points, as shown in the figure.
23
b. When the means are equal, but the standard deviations are not equal. (µ1 = µ2 ; 𝜎1 ≠ 𝜎2 ),
the curves are centered at the same point but they have different height and spreads. as
shown in the figure.
c. When the means are different and the standard deviations are also different (µ1 ≠ µ2 ; 𝜎1 ≠
𝜎2 ), the curves are centered at different points and vary in shapes, as shown in the figure.
Every normal curve (regardless of its mean or standard deviation) conforms to the
following "empirical rule"
The empirical rule is better known as 68% - 95% - 99.70% rule. This rule states that
the data in the distribution lies within one (1), two (2), and three (3) of the standard deviation
from the mean are approximately 68%, 95%, and 99.70%, respectively. Since the area of a
normal curve is equal to 1 or 100% as stated on its characteristics, there are only a few data
which is 0.30% falls outside the 3-standard deviation from the mean.
24
Example 1:
The scores of the Senior High School students in their Statistics and Probability quarterly
examination are normally distributed with a mean of 35 and a standard deviation of 5.
Questions:
a. What percent of the scores are between 30 to 40?
b. What scores fall within 95% of the distribution?
Solution:
Draw a standard normal curve and plot the mean at the center. Then, add the standard
deviation to the mean once and mark it to the right of the mean. Add twice the standard
deviation to the mean and put it to the right of the first sum. Then, add thrice the standard
deviation to the mean and mark it to the right of the second sum. Do the same to the left.
This time, instead of adding, subtract the standard deviation from the mean.
Answer:
a. The scores 30 to 40 falls within the first standard deviation from the mean.
Therefore, the scores that fall between 30 and 40 is approximately 68% of the
distribution.
b. Since 95% of the distribution lies within 2 standard deviations from the mean, then
the scores corresponding to this area of the distribution are scores from 25 up to
45.
Example 2:
The district nurse of “Ospital ning Angeles” needs to measure the BMI (Body Mass
Index) of the Alternative Learning System students. She found out that the heights of male
students are normally distributed with a mean of 160 cm and a standard deviation of 7 cm.
Find the percentage of male students whose height is within 153 cm to 174 cm.
25
The mean of this problem is 160, it is subtracted by 7 to the left (160 − 7 = 153; 153 −
7 = 146; 146 − 7 = 139) and add 7 to the right. (e.g. 160 + 7 = 167; 167 + 7 = 174 + 7 =
181).
153 cm falls at 1 standard deviation from the mean to the left and the height of 174 cm
falls at 2 standard deviations from the mean to the right.
Therefore, it covers the whole 68% and 13.5%. of the distribution and the sum of it is 81.5%
ANSWER: 81.5% of the male students have a height between 153 cm to 174 cm.
The probabilities that a market inspector will discover violations of the public health code in a
public market are given in the following table.
a. What is the probability that a market inspector will discover, at least, two violations
of the public health code?
b. What is the probability that a market inspector will discover, at most, three
violations of the public health code?
26
c. What is the probability that a market inspector will discover at least two, but fewer
than four violations of the public health code?
d. What is the mean of the probability distribution?
e. What are the variance and the standard deviation of the probability distribution?
The following are the number of cars sold per day at ABC Motor Corporation during
the past 500 working days.
Activity 3: True or False. Write the word TRUE if the statement is correct and FALSE if the
statement is incorrect
27
_______4. The distance of the number in the horizontal line or the x-axis should always
proportional.
_______5 The normal probability distribution has a mean µ = 1 and standard deviation ơ = 0.
Activity 4
Activity 5: Read and analyze the following problem to answer the question.
Our government gave financial support and food, but it is not enough for the people.
Fortunately, we are blessed because the spirit of love and generosity is still present in the
heart of our fellow countrymen. They donated money, food, time, and other essential things.
Suppose you and your friends conducted a donation-drive in your barangay. Each of
the donors gave a certain amount with an average of Php 320.00 and a standard deviation of
Php 60.00. Assuming, it is normally distributed,
1. What percentage of the distribution that the amount falls between Php. 260.00 and
Php 380.00?
2. How much of the donations fall in 95% of the distributions?
3. How much of the donations fall in 99.70% of the distributions?
References
Department of Education. Statistics and Probability: Teacher’s Guide. pp. 117-181.
Belecina, Rene R., Elisa S. Baccay, and Efren B. Mateo. 2016. Statistics and Probability, 1st
ed. Manila: Rex Bookstore Inc.
28
Answer Key
Activity 1 Activity 2 Activity 3
= 1.456 f. µ = 3.056
g. 2 = 12.352
= 3.515
Activity 4 Activity 5
1. 68%
2. 200 to 440
3. 140 to 500
Prepared by:
ERVIN A. PEREZ
Malabanias Integrated School
SHS Department
29
WEEK 4: IDENTIFYING REGIONS UNDER NORMAL CURVE CORRESPONDS TO
DIFFERENT STANDARD NORMAL VALUES
Background Information
We have learned that the total area under the curve is equal to 1. Since the standard
normal distribution is defined as normally distributed random variable having a mean of zero
(µ = 0) and standard deviation of one (𝜎 = 1), the fifty (50) percent of the region under the
curve is below the mean and the other 50% is above the mean. Figure 1, as shown below,
illustrates the standard normal curve.
You can easily identify the area of the regions under the normal curve by using the
Table of Areas under the Normal Curve which is also known as z-table (see attachment at
the last page of this module). This table gives the area of any value of z from -3.99 to 3.99.
The value from this table will describe the area of the specific region of the curve to the left of
the given z-value.
Solution: First, split the given z-value into hundredths. We can find the whole
number and the tenths digit (-1.6) at the left side of the table while the hundredths (0.09)
located at the uppermost of the table. Then the intersection of these numbers will be the
area of the normal curve to the left of the z-value.
30
STEPS IN FINDING THE AREA THAT CORRESPONDS TO Z –VALUE
1. Draw/sketch a normal curve and locate the given z-value on the normal curve
2. Shade the region of the curve according to the condition of z-value whether it is below,
above, or between.
3. Use the table of the area under the normal curve to find the corresponding area.
4.1. When the z-value is to the left or any related terms (e.g. below, less than) just write the
value we obtained in step 3
4.2. When the z-value is to the right or any related terms (e.g. above, greater than), subtract
1 by the obtained value in step 3
4.3. When the shaded region is in between of the two z-value, subtract the biggest by the
smallest value obtained in step 3
Solution:
Step 2: Shade the region of the curve according to the condition of z-value whether it is
below, above, or between.
31
Step 3. Use the table of the area under the normal curve to find the corresponding area
4.1 When the z-value is to the left or any related terms (e.g. below, less than) just write
the value we obtained in step 3 Since the shaded region of the curve is to the left of the
𝑧 = −1.35 and the intersection between -1.3 and 0.05 is 0.0885. Therefore, the area of
the shaded region is 0.0885
32
0.9115
0.9115
0.9115
33
CONVERTING A NORMAL RANDOM VARIABLE TO A STANDARD NORMAL
VARIABLE AND VICE-VERSA
Suppose your score on a test in Probability and Statistics is 39 and the scores are
normally distributed with a mean of 33 and standard deviation of 3, then your score is exactly
2 standard deviations above the mean. If you scored 30, then it is exactly 1 standard deviation
below the mean. All values that are above the mean have positive z-scores and all values that
are below the mean have negative z-scores. If you obtained a z-score of -3, this means that
your score is 3 standard deviations below the mean, that is, 33 – 3(3) = 33 – 9 = 24.
Given any value x from a normal distribution with mean μ and standard deviation σ, to
convert x to a z-score (standard normal score), you need to;
34
Example 1.
Suppose IQ scores are normally distributed with a mean of 100 and standard
deviation of 10. If your IQ is 85, what is your z-score? (Round off your answer to the nearest
hundredths)
Solution:
85 − 100 15
𝑧 = = − = −𝟏. 𝟓𝟎
10 10
Example 2.
On a nationwide placement test that is normally distributed, the mean was 125 and
standard deviation was 15. If you scored 149, what was your z-score? (Round off your
answer to the nearest hundredths)
Solution:
149 − 125
𝑧 =
15
24
= = 𝟏. 𝟔𝟎
15
Example 3.
The heights of teachers in Sta. Catalina National High School are normally
distributed with a mean of 150 cm and standard deviation of 15 cm. The height of Sir Victor
has a z-score of 3.25. What is the actual height of Sir Victor? (Round off your answer to the
nearest hundredths)
Solution:
In this example, z-score is given while the normal random variable is unknown. To
solve for the normal random variable x, multiply the z-score (z) by the standard deviation σ,
then add the mean μ. Therefore;
𝑥 = zσ + 𝜇
𝑥 = 48.75 cm + 150 𝑐𝑚
𝒙 = 𝟏𝟗𝟖. 𝟕𝟓 𝐜𝐦
35
Example 4.
The time it takes for a cell to divide is normally distributed with an average of 60
minutes and standard deviation of 5 minutes. How long will it take for a given cell to divide if
its “mitosis” has a z-score of -1.35?
Solution:
𝑥 = zσ + 𝜇
𝑥 = −1.35(5 min) + 60𝑚𝑖𝑛
𝑥 = −6.75 min + 60 𝑚𝑖𝑛
𝑥 = 𝟓𝟑. 𝟐𝟓 𝐦𝐢𝐧𝐮𝐭𝐞𝐬
Example 5.
Suppose you have the population values 40 and 70 and that their corresponding z –
scores are -1 and 2, respectively.
1. Is it possible to determine the population’s mean and standard deviation?
2. If so, what are these values?
Solution:
To begin, let us first identify the given. Let
𝑥1 = 40 𝑥2 = 70
𝑧1 = −2 𝑧2 = 1
Use these given and substitute to the formula
𝑥1 = 𝑧1 σ + 𝜇 𝑥2 = 𝑧2 σ + 𝜇
40 = −2σ + 𝜇 70 = 1σ + 𝜇
Note: In this example, we do not have 𝜎1 and 𝜎2 , and 𝜇1 and 𝜇2 since we are
describing a single population. Thus, 𝜎1 = 𝜎2 = 𝜎 and 𝜇1 = 𝜇 2 = 𝜇. Furthermore, we need to
review our algebra specifically on systems of linear equation. So,
40 = −2σ + 𝜇 equation 1
70 = 1σ + 𝜇 equation 2
Subtract equation 1 from equation 2, we have
40 = −2σ + 𝜇 equation 1
70 = 1σ + 𝜇 equation 2
30 = 1σ − (−2 σ)
30 =3 σ the difference
30 3𝜎
=
3 3
10= σ
The standard deviation of the population is 10. To obtain the mean, substitute it to either
equation 1 or 2.
40 = −2σ + 𝜇 ⟹ 40 = −2(10) + 𝜇 ⟹ 60 = 𝜇
70 = 1σ + 𝜇 ⟹ 70 = 1(10) + 𝜇 ⟹ 60 = 𝜇
36
COMPUTING PROBABILITIES AND PERCENTILES USING THE STANDARD NORMAL
TABLE
Suppose that we know the probability this time. What is the corresponding z-score?
For example, where is 95% under the normal curve?
There are three important things to remember when we are given probabilities and
we want to know their corresponding z-scores.
Thus, when we are given the area and we wish to find the corresponding z-value, we locate
the given area at the body of the table. If the exact area is not available, we take the nearest
area. Then, we look up the corresponding z-value in the Table of Areas Under the Normal
Curve or z- Table.
A normal distribution curve can be used as a probability distribution curve for normally
distributed variables. The area under the standard normal distribution curve can also be
thought of as a probability. That is, if it's possible to select any 𝑧 value at random, the
probability of choosing one, say, below 1.45 would be the same as the area under the curve
at the left of 1.45. In this case, the area is 0.9265. Therefore, the probability of randomly
selecting a 𝑧 value below of 1.45 is 0.9265 or 92.65%. The problems involving probabilities
and percentiles are solved in the same manner as finding the areas under a normal curve.
37
(f) 𝑃(𝑋 > 𝑥) – probability at the right of a normal random variable 𝑥
(g) 𝑃(𝑎 < 𝑋 < 𝑏) – the probability of a normal random variable 𝑋 that is in between two
other normal random variables, say 𝑎 and 𝑏.
(h) 𝑃(𝑋 < 𝑎) ∪ 𝑃(𝑋 > 𝑏 ) – the probability of 𝑋 is in the opposite direction of two values,
say 𝑎 and 𝑏
Example 1.
Find the probabilities for each of the following.
(a) 𝑃(𝑍 < 1.32)
(b) 𝑃(𝑍 < −1.05)
(c) 𝑃(−0.75 < 𝑍 < 1.56)
(d) 𝑃(𝑍 > −0.88)
Solution:
(a) The probability 𝑃(𝑍 < 1.32) means to find
the area under the normal curve at the left of
𝑧 = 1.32. From the table, find the intersection on
row 1.3 and column 0.02. Thus, the probability
is 𝟎. 𝟗𝟎𝟔𝟔 or 𝟗𝟎. 𝟔𝟔%.
Example 2.
Let 𝑋 be a normal random variable with mean 𝜇 = 15 and standard deviation 𝜎 = 3. Find the
probabilities of the following:
(a) (𝑋 < 19)
(b) (10 < 𝑋 < 19)
Solution:
38
(a) To find the probability, convert the normal random variable 19 to a z-score.
𝑥−𝜇 19−15
Thus, (𝑋 < 19) = 𝑃 (𝑍 < ) = 𝑃 (𝑍 < )
𝜎 3
4
= (𝑍 < ) = 𝑃(𝑍 < 1.33).
3
From the table, the probability at the left of 𝒛 = 𝟏. 𝟑𝟑 is equal to 𝟎. 𝟗𝟎𝟖𝟐 𝒐𝒓 𝟗𝟎. 𝟖𝟐%.
(b) To find the probability, convert the normal random variables 10 and 19 to z values.
10−15 19−15
Thus, 𝑃(10 < 𝑋 < 19) = 𝑃 ( <𝑍< )
3 3
From the table, we have 0.9082 – 0.0475 = 𝟎. 𝟖𝟔𝟎𝟕 𝒐𝒓 𝟖𝟔. 𝟎𝟕%. These probabilities can also
be applied to real-life problems.
Activity 1.
Identify the area under the normal curve of the following standard normal values
using the z table (refer to the last page of this module).
1. 𝑧= −3.25 _____________ 6. 𝑧 = −1.33 _________
2. 𝑧= 1.25 _____________ 7. 𝑧 = −2.5 _________
3. 𝑧= 0.56 _____________ 8. 𝑧 = 0.29 _________
4. 𝑧= 3 _____________ 9. 𝑧 = −0.13 _________
5. 𝑧= 1.96 _____________ 10. 𝑧 = −1.96 _________
Activity 2.
Find the area or region that corresponds to each of the following z values. Sketch
and label each curve.
1. to the right of 𝑧 = 2.18
2. below 𝑧 = −1.40
3. above 𝑧 = 0.57
39
Activity 3.
Supply the needed information by solving the missing value.
1. 𝑥 = 25, 𝜇 = 40, σ = 6, 𝐳 = ?
2. 𝑧 = −2.25, 𝜇 = 218, σ = 36, 𝐱 = ?
3. 𝑥 = 39, 𝜇 = 30, σ = 3, 𝐳 = ?
4. 𝑥 = 125, 𝜇 = 105, σ = 8, 𝐳 = ?
5. 𝑧 = 1.11, 𝜇 = 145, σ = 20, 𝐱 = ?
Activity 4.
Solve the following problems.
1. The mean number of hours a Filipino worker spends on the computer is 3.1 hours per
workday. Assume the standard deviation is 0.5 hour and is normally distributed, how long
does a worker spend on the computer if his z-score is 1.2?
2. Each month, a Filipino household generates an average of 28 pounds of newspaper for
garbage or recycling. Assume the standard deviation is 2 pounds. Determine the z-score of
a household that generates 22 pounds of newspaper.
Activity 5.
A. Use the Table of Standard Normal distribution to determine corresponding
probabilities of the following:
1. 𝑃(𝑍 < −1.45)
2. 𝑃(0.5 < 𝑍 < 2.33)
3. 𝑃(𝑍 > 1.78)
References
40
Answer Key
Activity 1
1. 0.0006 6. 0.0918
2. 0.8944 7. 0.0062
3. 0.7123 8. 0.6141
4. 0.9987 9. 0.4483
5. 0.9750 10. 0.0250
Activity 2
0.0146 0.9322
2. below z= -1.40
0.3667
3. above z= 0.57
0.2843
41
STANDARD NORMAL TABLE: Table Values Represent AREA to the LEFT of Z-score
42
Prepared by:
JELENE V. BASILIO
Sto. Domingo Integrated School
SHS Department
43
WEEK 5: RANDOM SAMPLING
Background Information
A population includes all of its elements from a set of data. The size of the
population is the number of observations in the population. For example, if ABS-CBN
network has 11,000 employees having the required blood type in a certain study, then we
have a population size of 11,000.
Sample consists of one or more data drawn from the population. It is a subset, or an
incomplete set taken from a population of objects or observations. Taking samples instead of
the population is less time-consuming and cost-effective. Although sampling has
advantages, it can also be a source of bias and inaccuracy
44
3. Stratified Random Sampling is used when
you want to ensure that specific charac-
teristics are proportionally represented in the
sample. You split your population into strata
(for example, divided by gender or race), and
then randomly select from each of these
subgroups.
4. Cluster Random Sampling is used when
you are unable to sample from the entire
population. You divide the sample into clus-
ters that approximately reflect the whole
population, and then choose your sample
from a random selection of these clusters.
Parameter
A parameter is a measure that is used to describe the population while statistic is a
measure that is used to describe the sample. To understand more, let us discuss the two
measures. Population mean (µ), population variance (𝜎2) and population standard deviation
(𝜎) are what we called parameters.
Population Mean
The mean is the sum of the data divided by the number of data. The mean is used to
describe where the set of data tends to concentrate at a certain point.
Below are grades in Statistics of Grade 11 students during the third quarter.
94 85 88 79 78
75 89 91 84 77
Let us compute the population mean, population variance, and population standard
deviation.
45
ΣX
To compute the population mean (𝜇), we can use this equation 𝜇 = whereas ∑𝑋 is
𝑁
the summation of all values of grades (x) and N is the total number of population therefore,
840
𝜇= = 84 the computer population mean (𝜇) is 84.
10
Variance and standard deviation determine how to spread or to scatter each data on
the set from the mean.
To compute for the population variance, we apply the formula:
∑(𝑋 − 𝜇)2
𝜎2 =
𝑁
To compute for the population standard deviation, we use the formula:
∑(𝑋 − 𝜇)2
𝜎= √
𝑁
whereas:
𝑿 is the given data,
µ is the population mean
𝑵 is the population size.
Consider the data given above, to solve for the population variance and population
standard deviation, we have this table:
46
To solve for the population variance and population standard deviation we have to
use these equations:
Solution:
ΣX 840
Population mean: 𝜇 = = = 84
𝑁 10
∑(𝑋− 𝜇)2 382
Population variance: 𝜎2 = = = 38.2
𝑁 10
∑(𝑋− 𝜇)2 382
Population Standard Deviation: 𝜎 = √ = √ = √38.2 ≈ 6.18
𝑁 10
STATISTIC
Statistic or sample statistic is any quantity computed from values in a sample which
is considered for a statistical purpose. Sample mean (𝑥̅ ), sample variance (s2) and sample
standard deviation (s) are what we call statistic.
From the previous data of the population, suppose that we randomly select only 7 data
(numbers with check) out of the total 10 data in the population.
✓ 94 85 ✓ 88 ✓ 79 78
75 ✓ 89 ✓ 91 ✓ 84 ✓ 77
Compute the sample mean, sample variance, and sample standard deviation. Here is the
result:
SAMPLE MEAN
The sample mean is the average of all the data of the samples. The symbol 𝑥̅ , (read
as “x bar”) is used to represent the sample mean. To compute for the sample mean 𝑥̅ we can
ΣX
use this equation: 𝑥̅ =
𝑛
Where: 𝑥̅ is the sample mean
∑𝑥 is the summation of x (sum of the measures)
47
Grades in Statistics of Grade 11 Students during the Third Quarter
Population Sample
Student’s number
(Student’s grade) (Student’s grade)
1 94 94
2 85
3 88 88
4 79 79
5 78
6 75
7 89 89
8 91 91
9 84 84
10 77 77
N = 10; n = 7 ∑X = 840 ∑X = 602
Sample variance is the computed variance of the elements of the sample. s2 is used
to represent sample variance.
48
To apply this here’s another example using the previous example.
To solve for the sample variance and sample standard deviation we have to use
these equations:
Sample Mean Sample Variance Sample Standard Deviation
ΣX ∑(𝑥 − 𝑥̅ )2
𝑥̅ = 𝑠2 = ∑(𝑥 − 𝑥̅ )2
𝑛 𝑛−1 𝑠= √
𝑛−1
Solution:
ΣX 602
Sample mean: 𝑥̅ = = = 86
𝑛 7
2 ∑(𝑥− 𝑥̅ )2 236 236
Sample Variance: 𝑠 = = = ≈ 39.33
𝑛−1 7−1 6
∑(𝑥− 𝑥̅ )2 236
Sample Standard Deviation: 𝑠 = √ =√ = √39.33 = 6.27
𝑛−1 7−1
Remember that parameters are for population while statistics are for sample.
49
Example:
The given population is 1, 2, 3, 4, 5 and sample size of 3, using the formula .𝑁 𝐶𝑛
𝑁! 5! 5! 5∙4∙3∙2∙1 20
.𝑁 𝐶𝑛 = .5 𝐶3 = [𝑛!(𝑁−𝑛)!]
= = = (3∙2∙1)(2∙1∙) = = 10
3!(5−3)! 3!2! 2
2. List all the possible random samples and solve for the sample mean of each set of
samples.
Sample Mean
1,2,3 2
1,2,4 2.33
1,2,5 2.67
1,3,4 2.67
1,3,5 3
1,4,5 3.33
2,3,4 3
2,3,5 3.33
2,4,5 3.67
3,4,5 4
3. Construct a frequency and probability distribution table of the sample means indicating
its number of occurrence or the frequency and probability.
Probability P(x)
Sample Means Frequency 𝑭𝒓𝒆𝒒𝒖𝒆𝒏𝒄𝒚
𝑵
1
2 1 = 0.10
10
1
2.33 1 = 0.10
10
2
2.67 2 = 0.20
10
2
3 2 = 0.20
10
2
3.33 2 = 0.20
10
1
3.67 1 = 0.10
10
1
4 1 = 0.10
10
50
1. What do you call the set of all data that researchers want to investigate?
a. data b. population
c. random d. sample
2. Which of the following can be considered as the best sample?
a. conveniently sample b. selected at random
c. purposively selected d. self-selected
3. From the given populations, which of the following samples is most likely to be
representative of the population from which it is drawn?
a. population: SAP beneficiaries b. population: lawmakers
sample: car owners sample: students
c. population: online shoppers d. population: graduating students
sample: cellphone users sample: grade 8 students
4. In random sampling, the sample should be_______ the population
a. as large as b. different from
c. not taken from d. representative of
5. Which of the following is NOT a type of random sampling?
a. cluster b. purposive
c. simple d. systematic
6. Which of the following is random sampling?
a. convenience b. non-probability
c. purposive d. stratified
7. As a researcher, you ask the people of your barangay on how they spent their money
during the community quarantine. You decided to group them according to their
socio-economic status and then proportionately and randomly selecting samples
from these groups. Which type of sampling are you using?
a. convenience b. purposive
c. stratified d. systematic
8. As a researcher, you want to know the plans of graduating senior high school
students. You ask all by-passers to participate in your research. Which type of
sampling are you using?
a. convenience b. purposive
c. stratified d. systematic
9. You wish to examine the effects of community quarantine on your fellow students.
You decided to borrow a list from the guidance office and select every 10th student
on it. What sampling technique are you using?
a. convenience b. purposive
c. stratified d. systematic
10. You want to know the effects of the COVID-19 to the Filipino people and you will ask
the medical frontliners. Which type of sampling will you use?
a. convenience b. purposive
c. stratified d. systematic
Practice B
Choose the letter of the best answer. Encircle the letter of the correct answer.
51
2. What do you call a number that describes a population characteristic?
a. data b. sample
c. parameter d. statistic
3. Which of the following is a statistic?
a. 𝜎2 b. 𝜎
c. µ d. s
For numbers 4-9, the following are scores of students in their quiz:
10, 8, 7, 20, 11, 8, 15, 10, 6, 14, 16, 20
4 – 5. What is the value of µ? (2 points)
a. 10.4 b. 11
c. 12.08 d. 14.4
6 – 7. What is the value of 𝜎? (2 points)
a. 4.6 b. 5
c. 6 d. 5.6
8– 9. What is the value of the population variance? (2 points)
a. 18.7 b. 25
c. 20 d. 21.5
For numbers 10-15, the following are the seven scores of the students above that are
randomly selected: 8, 7, 20, 15, 10, 14, 16
Practice C
Choose the letter of the best answer. Encircle the letter of the correct answer.
1. What distribution pertains to the frequency distribution of the sample mean from all
the possible random samples of a particular sample size n taken from the given
population?
a. frequency b. normal
c. population d. sampling
2. Which of the following is NOT a step-in creating sampling distribution of the sample
mean?
a. Determine the number of sets of all b. Compute for the standard
possible random samples. deviation and variance of the
samples.
c. Construct a frequency distribution d. List all the possible random
table of the sample means and samples and solve for the sample
probability. mean of each set of samples
52
3. Which of the following is the mean of sample 6, 10, 21, 25, and 28?
a. 15.17 b. 18
c. 21.2 d. 22
For numbers 4-9, refer to the following set of data of a population {5, 6, 7, 8}
4. How many different samples of size n = 2 can be drawn from the population?
a. 6 b. 5
c. 4 d. 3
5. Which of the following sample mean appears most frequent in the sampling
distribution?
a. 5 b. 5.5
c. 6 d. 6.5
6. What is the lowest value of the sample mean in this sampling distribution?
a. 5.5 b. 6.5
c. 7 d. 8
7. What is the frequency of the sample mean 7?
a. 0 b. 1
c. 2 d. 3
8. What is the probability of the sample mean 6 in the sampling distribution?
a. 1/6 b. 2/3
c. 1/2 d. 2/5
For numbers 9-10, refer to the following population consists of the data {3,6,9,12,15}.
9. How many different samples of size n = 3 can be drawn from the population?
a. 6 b. 12
c. 8 d. 10
10. Which of the following sample mean appears most frequent in the sampling
distribution?
a. 6 b. 7
c. 8 d. 12
References:
• Statistics and Probability 11, Pivot 4A Learners Material (N/A) Pages 172–206
• https://www.scribbr.com/methodology/simple-random-sampling/
• http://www.stat.yale.edu/Courses/199798/101/sample.htm#:~:text=Simple%20rando
m%20sampling%20is%20the,being%20included%20in%20the%20sample.
53
Answer Key:
Prepared by:
Jeral S. Reyes
Teacher – II
54
WEEK 6: MEAN AND VARIANCE OF SAMPLING DISTRIBUTION OF THE SAMPLE MEAN
Background Information
The sampling distribution of the sample mean is a frequency distribution using the
means computed from all possible random samples of a specific size taken from a population.
MEAN
The mean of the sampling distribution of the mean is the mean of the population from
which the scores were sampled. Therefore, if a population has a mean μ, then the mean of
the sampling distribution of the mean is also 𝜇. The symbol 𝜇𝑥̅ is used to refer to the mean of
the sampling distribution of the mean. Therefore, the formula for the mean of the sampling
distribution of the mean can be written as:
𝜇𝑥̅ = 𝜇
VARIANCE
The variance of the sampling distribution of the mean is computed as follows:
2
𝜎2
𝜎 𝑥̅ =
𝑁
That is, the variance of the sampling distribution of the mean is the population
variance divided by N, the sample size (the number of scores used to compute a mean).
Thus, the larger the sample size, the smaller the variance of the sampling distribution of the
mean.
The standard error of the mean is the standard deviation of the sampling
distribution of the mean. It is therefore the square root of the variance of the sampling
distribution of the mean and can be written as:
𝜎
𝜎𝑥̅ =
√𝑁
Steps Solution
1. Compare the mean of the ∑𝑥
population (𝜇). 𝜇=
𝑁
1+2+3+4+5
=
5
= 3.00
55
2. Compute the variance of the 𝑥 𝑥−𝜇 (𝑥 − 𝜇)2
population (𝜎 2 )
1 -2 4
2 -1 1
3 0 0
4 1 1
5 2 4
∑(𝑥 − 𝜇)2 = 10
∑(𝑥 − 𝜇)2
𝜎2 =
𝑁
10
=
5
= 2
1,2 1.50
1,3 2.00
1,4 2.50
1,5 3.00
2.3 2.50
2,4 3.00
2,5 3.50
3,4 3.50
3,5 4.00
4,5 4.5
56
5. Construct the sampling
Distribution of the sample Sampling Distribution of Sample Mean
mean Sample Frequency Probability
Mean ( ) P( )
1.50 1 1
10
2.00 1 1
10
2.50 2 1
5
3.00 2 1
5
3.50 2 1
5
4.00 1 1
10
4.50 1 1
10
Total 10 1.00
∑ 𝑋̅ ∙ 𝑃(𝑋̅) = 3.00
57
7. Compute the variance of the ̅
𝒙 ̅)
𝑷(𝒙 ̅−𝝁
𝒙 ̅ − 𝝁)𝟐
(𝒙 ̅ − 𝝁)𝟐
̅) ∙ (𝒙
𝑷(𝒙
sampling distribution of the
1
sample means 𝜎 2 𝑥̅ . 1.50 -1.50 2.25 0.225
10
1
Follow these steps: 2.00 -1.00 1.00 0.100
10
1
a. Subtract the population 2.50 -0.50 0.25 0.050
5
mean (𝜇). Label it as 𝑥̅ − 𝜇. 1
3.00 0.00 0.00 0.000
5
b. Square the differences. 1
3.50 0.50 0.25 0.050
Label this column as 5
(𝑥̅ − 𝜇)2 1
4.00 1.00 1.00 0.100
10
1
c. Multiply the results in (b) by 4.50 1.50 2.25 0.225
10
each corresponding
probability. 𝑃(𝑥̅ ) ∙ (𝑥̅ − 𝜇)2 Total 1.00 0.750
𝜎 2 𝑥̅ = 0.75
𝜎𝑥̅ = 0.85
Example 2: Describing the sampling Distribution of the sample means from an Infinite
Population
Steps Solution
1. Identify the given information Here 𝜇 = 60, 𝜎 = 5 and 𝑛 = 16.
2. Find the mean of the sampling 𝜇𝑥̅ = 𝜇
distribution. Use the property that
𝜇𝑥̅ = 60
𝜇𝑥̅ = 𝜇
3. Find the standard deviation of the 𝜎
𝜎𝑥̅ =
sampling distribution. Use the √𝑁
𝜎
property that 𝜎𝑥̅ = 5
√𝑁 =
√16
5
=
4
𝜎𝑥̅ = 1.25
58
Observe that the mean of the sampling distribution of the sample mean is always equal
to the mean of the population. The variance of the sampling distribution is obtained by using
𝜎 2 𝑁−𝑛
the formula, 𝜎 2 𝑥̅ = ( ). This formula holds when the population is finite.
𝑛 𝑁−1
A finite population is one that consists of a finite or fixed number of elements,
measurements, observations; while an infinite population contains, hypothetically at least,
infinitely elements.
The standard deviation (𝜎𝑥̅ ) of the sampling distribution of the sample means is also
known as the standard error of the mean. It measures the degree of accuracy of the sample
mean (𝜇𝑥̅ ) as an estimate of the population mean (𝜇).
A good estimate of the mean is obtained if the standard error of the mean is smaller or
close to zero, while a poor estimate, if the standard error of the mean is large.
The definition of the sampling distribution of the sample mean for the normal population
when the variance is known or unknown is used to determine the probability value of a certain
event for small and large samples. This serves as a tool for statisticians or any interesting
group who wants to test the sample mean using statistical formulas and to make a rightful
decision in the future. If the population standard deviation is known, use the normal distribution
and if the population standard deviation is unknown, use t-distribution.
Example 3: Read and analyze the following situations. Determine whether the following
statements have a known or unknown population variance.
a. Because of Inclusive Education, learners with disabilities are also part of the normal
students. Consider a population of the PWD learners consisting of 1, 2, 3, 4, and 5.
Samples of size 2 are drawn from this population.
b. Given 𝜇 = 15, 𝑠 = 6 𝑎𝑛𝑑 𝑛 = 16.
c. IQ scores are normally distributed with standard deviation 𝜎 = 15 𝑝𝑜𝑖𝑛𝑡𝑠. You wish to
determine the 𝜇 IQ of all students in Matiyaga National High School by taking a random
sample of 16 students.
d. The general weighted average (GWA) of the Grade 7 applicants for admission in
QSHS over the past 7 years has been consistently 𝜇 = 92.78. This year a sample data
of the applicant’s GWA are 98, 89, 92, 95, 96, 90, 91, 92, 97 and 93
e. A finite population composed of six items whose values are 2, 3, 6, 7, 8, and 10.
Samples of 3 items are drawn at random without replacement.
Solution:
a. Known – This is a population data. Although the value of the variance is not given, you can
still determine the population mean and population variance of the data using the formulas 𝜇 =
∑𝑋 ∑(𝑥−𝜇)2
, 𝜎2 = respectively. So, this situation is an example of the sampling distribution when
𝑁 𝑁
𝜎
the variance is known. Since the population variance can be calculated, used 𝜎𝑥̅ = in the
√𝑛
computation of the standard error of the mean.
59
b. Unknown- The population variance is unknown. The given values are the population mean
𝜇 = 15, the sample standard deviation 𝑠 = 6, and the sample size 𝑛 = 16. Use the formula 𝑠𝑥̅ =
𝑠
for the standard error of the mean.
√𝑛
c. Known- the population variance is given with the value of the square of the standard
𝜎
deviation 𝜎 2 = 152 . Apply the formula 𝜎𝑥̅ = in computing for the standard error of the mean.
√𝑛
d. Unknown- The population variance is unknown. The only given values are the population
𝑠
mean 𝜇 = 92.78 and the sample size 𝑛 = 10. Use the formula 𝑠𝑥̅ = for the standard error of
√𝑛
the mean.
e. Known – This is a population data, you can determine the population mean and variance of
∑𝑋 ∑(𝑥−𝜇)2
the data using the formulas 𝜇 = , 𝜎2 = . So, this situation is an example of the
𝑁 𝑁
sampling distribution where the population variance can be computed. Apply the formula 𝜎𝑥̅ =
𝜎
to solve
√𝑛
Practice B
Given the population 1,4,5,7, 9. Suppose sample size of 3 are drawn from this population.
1. What is the mean (𝜇), variance (𝜎 2 ) and standard deviation (𝜎) of the population?
2. How many different samples of size n = 3 can be drawn from the population? List them
with their corresponding means.
3. Construct the sampling distribution of the sample mean.
4. What is the mean (𝜇𝑥̅ ) of the sampling distribution mean? Compare to the mean of the
population.
5. What is the variance (𝜎 2 𝑥̅ ) and standard (𝜎𝑥̅ ) of the sampling distribution of the sample
means?
60
Practice C
Solve the following problems. Assume that the population is infinite in each case.
1. The scores of individual students on a national test have a normal distribution with
mean 18.6 and standard deviation 5.9. At Federico Ramos Rural High School, 76
students took the test. If the scores at this school have the same distribution as national
scores, what are the mean and standard of the sample mean for 76 students?
2. A molding machine prepares a certain kind of car spare part with a target diameter 𝜇 =
40.265 millimeters. The machine has some variability, so the standard deviation of the
diameter is 𝜎 = 0.004 millimeters. A sample of 6 spare parts is inspected each hour
for process control purposes and records are kept of the sample mean diameter. What
will be the mean and standard deviation of the numbers recorded?
Practice D
Read and analyze the following situations. Determine whether the following statements have
a known or unknown population variance. Write your answer on the space provided before
the number.
_________1. A population has a mean of 85 and a standard deviation of 6. A random sample
of 22 measurements is drawn from this population.
_________2. The weights of female SHS students are normally distributed with a mean of
45kg. Twenty samples consisting of 10 students are drawn from the population and found to
have a mean of 46 kg and a standard deviation of 1.24kg. 232
61
References
Belecina, Rene, Baccay Elisa and Mateo Efren: Statistics and Probability,Manila, Philippines:
REX Book Store, 2016
Alferez, Merle and Duro, Ma. Cecilia: Statistics and ProbabilityQuezon City, Philippines :MSA
Publishing House
Pineda, Hermenegildo, Statistics and Probability, Cainta, Rizal, PhilippinesPublished by the
Department of Education
http://onlinestatbook.com/2/sampling_distributions/samp_dist_mean.html
https://stattrek.com/sampling/sampling-distribution.aspx
Answer key
Practice A
1. 45
2. 120
3. 36
4. 56
5. 35
Practice B
1. 𝜇 = 5.2
𝜎 2 = 7.36
𝜎 = 2.71
2. 10 samples
Samples Mean
1,4,5 3.33
1,4,7 4.00
1,4,9 4.67
1,5,7 4.33
1,5,9 5.00
1,7,9 5.67
4,5,7 5.33
4,5,9 6.00
4,7,9 6.67
5,7,9 7.00
62
Sampling Distribution of Sample Mean
̅)
Sample Mean (𝑿 Frequency ̅)
Probability P(𝑿
1
3. 1.50 1
10
1
2.00 1
10
1
2.50 2
5
1
3.00 2
5
1
3.50 2
5
1
4.00 1
10
1
4.50 1
10
Total 10 1.00
4. 𝜇𝑥̅ = 5.2
The sampling distribution of the sample mean is equal to the mean of the population
5. 𝜎 2 𝑥̅ = 1.23
𝜎𝑥̅ = 1.11
Practice C
1. 𝜇𝑥̅ = 18.6
𝜎𝑥̅ = 0.68
2. 𝜇𝑥̅ = 40.265
𝜎𝑥̅ = 0.0016
Practice D
1. Known
2. Unknown
3. Known
4. Unknown
5. Unknown
Practice E
1. 𝜇 = 7.00
𝜎 2 = 5.67
𝜎 = 2.38
63
2.
Samples Mean
3, 5, 7,8,9 6.4
3,5,7,8,10 6.6
3,5,7,9,10 6.8
3,5,8,9,10 7.0
3,7,8,9,10 7.4
5,7,8,9,10 7.8
3.
Sample Probability
̅) Frequency ̅)
Mean (𝑿 P(𝑿
3, 5, 7,8,9 1 1/6
3,5,7,8,10 1 1/6
3,5,7,9,10 1 1/6
3,5,8,9,10 1 1/6
3,7,8,9,10 1 1/6
5,7,8,9,10 1 1/6
Total 10 6/6 =1.00
4. 𝜇𝑥̅ = 7.00
The sampling distribution of the sample mean is equal to the mean of the population
Prepared :
64
WEEK 7 : CENTRAL LIMIT THEOREM AND PROBLEMS INVOLVING SAMPLING
DISTRIBUTIONS OF THE SAMPLE MEAN
Background Information
Many common statistical procedures require data that approximately follows a normal
distribution. However, normally distributed data isn’t always the case. There are cases wherein
the population is skewed. This is where the Central Limit Theorem becomes useful.
Many statistics have distributions that are approximately normal for large sample sizes,
even when we are sampling from a distribution that is not normal. This means that we can
develop statistical inference procedures that are based on the normal distribution, even if we
are sampling from a population that is not normal provided, we have a large sample size.
Solution:
It is not given that the population is normally distributed but since 𝑛 > 30, then you can
assume that the sampling distribution of the mean ages of 100 barangay residents is normal
according to the Central Limit Theorem.
The Central Limit Theorem describes the normality of the distribution of the sample
mean taken from a population that is not normally distributed.
65
Step 2: Identify what is asked.
𝑃(𝑥 < 35)
35−34
𝑧 = 4
√100
1
= 4
10
= 𝟐. 𝟓
We shall find 𝑃(𝑋 < 35) by getting the area under the normal curve.
𝑃(𝑋 < 35) = 𝑃(𝑧 = 2.5)
= 0.5 + 0.4938
= 𝟎. 𝟗𝟗𝟑𝟖
Example A.
The mean it takes a group of senior high school students to complete a certain
examination is 50.6 minutes. The standard deviation is 6 minutes. Assume that the variable is
normally distributed.
1. What is the probability that a randomly selected senior high school student will complete
the examination in less than 48 minutes?
Solution:
Step 1. Identify the given information:
𝜇 = 50.6, 𝜎 = 6, X = 48
66
Step 4. Solve the problem
𝑋− 𝜇
𝑧=
𝜎
48− 50.6
𝑧=
6
𝑧 = − 𝟎. 𝟒𝟑
Find 𝑃(𝑋 < 48) by getting the area under the normal curve.
𝑃(𝑋˂48) = 𝑃(𝑧˂ − 0.43)
= 0.5 − 0.1664
= 𝟎. 𝟑𝟑𝟑𝟔
2. If 49 randomly selected senior high school students take the examination, what is the
probability that the mean time it takes the group to complete the test will be less than 48
minutes?
Solution
Step 1: Identify the given information:
𝜇 = 50.6, 𝜎 = 6, 𝑋̅ = 48, 𝑛 = 49
67
Step 5 : State your final answer.
The probability that 49 randomly selected senior high school students will complete
the test in less than 48 minutes is 0.0012 or 0.12%
3. If 49 randomly selected senior high school students take the examination, what is the
probability that the mean time it takes the group to complete the test will be more than 51
minutes?
Solution
Step 1: Identify the given information:
𝜇 = 50.6, 𝜎 = 6, 𝑋̅ = 51, 𝑛 = 49
4. If 49 randomly selected senior high school students take the examination, what is the
probability that the mean time it takes the group to complete the test is between 47.8 and 53
minutes?
Solution
Step 1: Identify the given information:
𝜇 = 50.6, 𝜎 = 6, 𝑋̅ = 47.8 and 53, 𝑛 = 49
68
The problem is dealing with data about the sample mean or n observations, so the
𝑋̅ −𝜇
formula to be used to standardize 47.8 and 53 is 𝑧 = 𝜎
√𝑛
47.8−50.6 53−50.6
= 6 = 6
√49 √49
= −𝟑. 𝟐𝟕 = 𝟐. 𝟖
Find 𝑃(𝑋̅ < 47.8) by getting the Find 𝑃(𝑋̅ < 53) by getting the
area under the normal curve. area under the normal curve
𝑃(𝑋̅ < 47.8) = 𝑃(𝑧 < −3.27) 𝑃(𝑋̅ < 53) = 𝑃(𝑧 < 2.8)
= 0.5 − 0.4995 = 0.5 + 0.4974
= 𝟎. 𝟎𝟎𝟎𝟓 = 𝟎. 𝟗𝟗𝟕𝟒
Note: We use 𝑃(𝑋̅ < 47.8) instead of 𝑃(𝑋̅ > 47.8) so that we can subtract on the later part
To find the probability that 49 randomly selected senior high school students will
complete the test between 47.8 and 53 minutes, subtract the smaller area from the bigger
area under the normal curve.
69
Example B
The average number of milligrams (mg) of cholesterol in a cup of a certain brand of ice
cream is 660mg, and the standard deviation is 35mg. Assume the variable is normally
distributed.
a. If a cup of ice cream is selected, what is the probability that the cholesterol content
will be more than 670mg?
Solution
Step 1: Identify the given information
µ = 660, 𝜎 = 35, 𝑋 = 670
Step 2: Identify what is asked.
𝑃(𝑋 > 670)
Step 3: Identify the formula to be used.
Here we are dealing with an individual data obtained from the population. So, we
𝑿− 𝝁
will use the formula 𝒛 = to standardize 670.
𝝈
b. If a sample of 10 cups of ice cream is selected, what is the probability that the mean
of the sample will be larger than 670 mg?
Solution
Step 1: Identify the given information
µ = 660, 𝜎 = 35, 𝑋̅ = 670, 𝑛 = 10
70
670−660
= 35
√10
= 𝟎. 𝟗𝟎
We shall find 𝑃(𝑋̅ > 670) by getting the area under 0.1841
the normal curve.
𝑃(𝑋̅ > 670) = 𝑃(𝑧 > 0.90)
= 0.5 − 0.3159
= 𝟎. 𝟏𝟖𝟒𝟏
0.90
Step 5: State your final answer
Therefore, the probability that the mean cholesterol content of 10 randomly selected
cups of ice cream will be more than 670 mg is 0.1841 or 18.41%.
71
8. Which formula can standardize an individual data?
𝑥̅ −𝜇 𝑥−𝜇 𝑥−𝜇
A. 𝜎 B. 𝑛𝜋(1 − 𝜋) C. D.
𝜎 𝑛
√𝑛
9. Which formula is applicable when dealing with data about the sample means?
𝑥̅ −𝜇 𝑥−𝜇 𝑥−𝜇
A. 𝜎 B. 𝑛𝜋(1 − 𝜋) C. D.
𝜎 𝑛
√𝑛
10. The number of total carbohydrates in an order of camote fries in a native restaurant is
approximately normally distributed with a mean of 240 and a standard deviation of 20. What
is the probability that a randomly selected order has at least 265 total carbohydrates?
A. 54.13% B. 34.67% C. 21.56% D. 10.56%
1. µ = 60, 𝜎 = 8, and 𝑋 = 64
2. µ = 52.9 𝜎 = 7.2 𝑋 = 48.5
3. µ = 12.5, 𝜎 = 5, 𝑋̅ = 11.8 and 𝑛 = 50
4. µ = 62, 𝜎 = 6, 𝑋̅ = 59, and 𝑛 = 30
5. µ = 245, 𝜎 = 20, 𝑋̅ = 248, and 𝑛 = 25
References:
Rene R. Belecina, et.al, 2016. “Statistics and Probability.” First Edition. pgs.126-132
Ricardo B. Banigon, et.al., 2016. “Statistics and Probability for Senior High School”. pgs.52-54
72
Answer Key
Activity A Activity B
1. D 6.D 1. 0.5
2. A 7. B 2. -0.61
3. C 8. C 3. -0.99
4. B 9. A 4. -2.74
5. B 10. D 5. 0.75
Activity C
1. Step 1: µ = 121, 𝜎 = 5, 𝑋̅ = 120, 𝑛 = 50 2. Step 1: µ = 65, 𝜎 = 5, 𝑋̅ = 64 & 67, 𝑛 = 60
Step 2: P(𝑋̅ < 120) Step 2: P(64 < 𝑋̅ < 67)
𝑥̅ −𝜇 𝑥̅ −𝜇
Step 3: 𝑧 = 𝜎 Step 3: 𝑧 = 𝜎
√𝑛 √𝑛
Step 4: 𝑧 = −1.41 Step 4: 𝑧 = −1.55 and 3.1
Step 5: P(𝑋̅ < 120) = 0.0793 or 7.93% Step 5: P(64 < 𝑋̅ < 67) = 0.9384 or 93.84%
3.a. 3.b.
Step 1: µ = 46.2, 𝜎 = 8, 𝑋 = 43 Step 1: µ = 46.2, 𝜎 = 8, 𝑋̅ = 43, 𝑛 = 50
Step 2: 𝑃(𝑋 < 43) Step 2: 𝑃(𝑋̅ < 43)
𝑋− 𝜇 𝑥̅ −𝜇
Step 3: 𝑧 = Step 3: 𝑧 = 𝜎
𝜎
Step 4: 𝑧 = −0.40 √𝑛
Prepared:
JOSEL L. DIZON
Senior High School Teacher III
Angeles City Senior High School
73
WEEK 8: T-DISTRIBUTION AND ESTIMATION OF PARAMETERS
Background Information
According to W.S. Gosset in 1908, even the standard deviation was unknown, the
distribution of the sample means was symmetric and similar to the normal distribution. He
stated that as the sample size increases, the distribution as well approaches the standard
normal distribution.
Definition:
The t-distribution is a probability distribution that is used to estimate population
parameters when the sample size is small (n ≤ 30). It can also be used when the population
is unknown.
If the distribution of a random variable x is approximately normal, then the formula
𝑋̅−µ
𝑡 = 𝑠
√𝑛
Characteristics of a t-distribution
74
The critical values of 𝑡 allows us to make reliable inferences about the population
mean. The t-distribution can also be used to test hypotheses concerning population mean and
sample mean, and the difference between two means. However, we need to make sure that
the sample meets the following conditions:
1. The sample is small (𝑛 ≤ 30) and randomly selected.
2. The population standard deviation σ is unknown.
3. The parent population, the population from which you are sampling, is essentially
normal.
For a single group of samples, the degrees of freedom (𝑑𝑓) is equal to the sample
size n minus one:
𝑑𝑓 = 𝑛 − 1
For other situations, the degrees of freedom can be calculated differently. The degrees
of freedom may change for different applications and, since they specify a correct t-distribution
to use, it is important to calculate the degrees of freedom correctly. For example, when dealing
with two groups of samples, the degrees of freedom is calculated as
𝑑𝑓 = 𝑛1 + 𝑛2 − 2
75
Examples:
1. Find the 99th percentiles of t-distribution with 4 degrees of freedom.
df = 4
t = 4.604
2. Find the 90th percentiles of t- distribution with 6 degrees of freedom.
df = 6
t = 1.943
3. n = 21 , 99% confidence.
t = 2.845
4. n = 24, 95% confidence
df = 24 – 1 = 23
t = 2.069
5. df = 15, 90% confidence
t = 1.753
𝑛 2
(∑𝑖=1 𝑥)
∑𝑛 𝑋−𝑋
√ 𝑖=1
̅ √∑𝑛𝑖=1 𝑥𝑖 2 − 𝑛
𝑆 = 𝑛−1
or 𝑠= 𝑛−1
-3 -2 -1 0 1 2 3
Example:
1. The 28 employees of XYZ Company has a mean travel time of 14.3 kilometer to their
work. The standard deviation of the travel time is 2km. determine the 95% confidence
interval population mean.
𝑠 𝑠
Since the population standard deviation is not given then, 𝑋̅ − 𝑡 (√𝑛 ) < µ < 𝑋̅ + 𝑡 ( ) will
√𝑛
be applied.
76
Given: 𝑋̅ = 14.3 𝑑𝑓 = 27
𝛼
𝑠 = 2 = 0.025
2
𝑛 = 28 𝑡 = 2.052
Solution:
2 2
14.3 − 2.052 ( ) < µ < 14.3 + 2.052 ( )
√28 √28
14.3 − 2.052(0.378) < µ < 14.3 − 2.052(0.378)
𝑝̂𝑞̂
𝐸 = 𝑧𝑎 √
2 𝑛
𝑝̂ 𝑞̂ 𝑝̂ 𝑞̂
𝑝̂ − 𝑧𝑎 √ < 𝑝 < 𝑝̂ + 𝑧𝑎 √
2 𝑛 2 𝑛
𝑥
where: 𝑝̂ =
𝑛
𝑞̂ = 1 − 𝑝̂
Example:
1. There are 55 college students chosen randomly from 450 enrolled in the first semester
who wish to enroll a summer program in their major subjects. Estimate the population
proportion of students that 11% will take summer class. Use 95% confidence level.
77
Then,
𝑝̂ 𝑞̂ 𝑝̂ 𝑞̂
𝑝̂ − 𝑧√ < 𝑝 < 𝑝̂ + 𝑧√
𝑛 𝑛
0.1222(0.8778) 0.1222(0.8778)
0.1222 − 1.96√ < 𝑝 < 0.1222 + 1.96√
450 450
0.1222 − 1.96(0.015439283) < 𝑝 < 0.1222 + 1.96(0.015439283)
0.1222– 0.0302360995 < 𝑝 < 0.1222 + 0.0302360995
0.092 < 𝑝 < 0.1524
9.2% < 𝑝 < 15.24%
Therefore an estimated 11% will enroll this summer is correct since it is included
within the confidence interval estimate.
2. An LTO researcher wants to know the percentage of drivers using GPS systems in
their car. He wishes to be 99% confident that the estimate is within 2 percentage points
of the proportion. A recent study of 180 drivers showed that 25% used GPS systems.
How large is the sample size?
Given: 𝑝̂ = 0.25 , 𝐸 = 2% = 0.02
𝑞̂ = 1 − 0.25 = 0.75 𝑧𝑎 = 0.005
2
Solution:
Derived the formula for n
𝑝̂𝑞̂
𝐸 = 𝑧𝑎 √
2 𝑛
𝐸 𝑝̂𝑞̂
=√
𝑧𝑎 𝑛
2
2
𝐸 𝑝̂𝑞̂
( ) =
𝑧𝑎 𝑛
2
2
𝐸
𝑛= 𝑝̂ 𝑞̂ ( )
𝑧𝑎
2
0.005 2
𝑛 = 0.25(0.75) ( )
0.02
𝑛 = 0.1875(129)2
𝑛 = 0.1875(16,641)
𝑛 = 3,120.1875
Therefore, there are 3121 drivers considered in the study.
78
Practice A.
1. Give the confidence coefficient of the following, using the t-table.
1. n = 6, 90% confidence
2. df = 7, 90% confidence
3. n = 12, 95% confidence
4. df = 24, 99% confidence
5. n = 8, 99% confidence
Practice C
Find 𝑝̂ and 𝑞̂ given x and n.
1. x = 28, n = 100
2. x = 45, n = 240
3. x = 120, n = 1000
4. x = 234, n = 1500
5. x = 318, n = 2300
Practice D
Choose the letter of the correct answer
1. When x = 50 and n = 215, what is the value of 𝑝̂ ?
a. 0.23 b. 4.3 c. 50 d. 215
2. When x = 120 and n = 406, what is the value of 𝑞̂
a. 0.70 b. 20.3 c. 120 d. 406
3. Given: n = 20, s = 1.2 and 𝛼 = 0.1, the maximum error E is approximately
a. 0.33 b. 0.39 c. 0.45 d. 0.50
4. Which of the following illustrates confidence level?
a. 32.8 b. 30.39 c. 90% d. 90%-94%
79
For nos. 5 – 10.
The mean gasoline consumption of 10 cars is 28 liters with a standard deviation of 1.6 liters.
Find the point and the interval estimates using the confidence level.
5. What is α?
a. 95% b. 28 c. 1.6 d. 0.05
6. What is the point estimate of µ?
a. 95% b. 28 c. √10 d. 1.6
7. What is the appropriate distribution?
a. p b. s c. t d. z
8. what are the confidence coefficient?
a. +−2.58 b. +
−2.33 c. +
−2.26 d. +
−1.96
9. The lower limit is approximately
a. 27 b. 26.85 c. 26.81 d. 26.68
10. The upper limit is
a. 29 b. 29.15 c. 19.19 d. 29.32
Practice E
Solve the following problem
1. Mary wants to know, with 95% confidence, the proportion of households who like to
use detergent X. a previous survey showed that 42% like to use detergent X. Mary like
to be accurate within 2% of the true proportion. What sample size does Mary need?
2. The Dean of a college wants to use the proportion of a population to determine the
sample size needed to interview regarding their thoughts about the new school
structure. She wants to be able to assert with a probability 0.95 that her error will be
at most 0.05. Similar polls in the past showed that 65% approved the new structure.
How large a sample does the Dean need?
References
Blay, B.E. Elementary Statistics. Mandaluyong City ANVIL Publishing Inc.: 2007
Johnson, R,R. and Kuby, P. Statistics 2nd Edition. Cengage Learning Singapore, 2013
80
Answer Key
Practice A Practice B
A.1 A.2
1. t = 2.015 1. t = 2.015 1.t = 2.797
2. t = 1.895 E = 1.65 17.33 < µ < 18.67
3. t = 2.201 2. t = 1.861 2. t = 2.093
4. t = 2.797 E = 1.74 13.77 < µ < 18.83
5. t = 3.499 3. t = 2.179 3. t = 2.093
E = 2.72 162.894 < µ < 167.106
4. t = 2.131
E= 1.65
5. t = 2.086
E = 2.276
Practice C Practice D Practice E
1. 𝑃̂ = 0.28 1. A 1. Z = 1.96
𝑞̂ = 0.72 2. A E = 0.02
2. 𝑃̂ = 0.2875 3. C n = 2339.53 or 2340
𝑞̂ = 0.8125 4. C
3. 𝑃̂ = 0.12 5. D 2. Z = 1.96
𝑞̂ = 0.88 6. B E = 0.05
4. 𝑃̂ = 0.156 7. C n = 349.58 or 350
𝑞̂ = 0.844 8. C
9. B
5. 𝑃̂ = 0.138
10. B
̂𝑞 = 0.862
Prepared by
81