The Normal Distribution and
Other Continuous Distributions
Goals
Describe the characteristics of the normal
distribution
Translate normal distribution problems into
standardized normal distribution problems
Find probabilities using a normal distribution table
Evaluate the normality assumption
Recognize when to apply the uniform and
exponential distributions
Probability Distributions
Probability
Distributions
Discrete Continuous
Probability Probability
Distributions Distributions
Binomial Normal
Poisson Uniform
Hypergeometric Exponential
Continuous Probability Distributions
A continuous random variable is a variable that
can assume any value on a continuum (can
assume an uncountable number of values)
thickness of an item
time required to complete a task
temperature of a solution
height, in inches
These can potentially take on any value,
depending only on the ability to measure
accurately.
The graphical form of the probability distribution for a
continuous random variable forms a smooth curve.
This curve, which is a function of X, is called a
probability density function (pdf). It is denoted by
f(x).
Two requirements must be satisfied by all
probability distributions for continuous random
variables.
f(y) > 0 for all possible intervals of y.
. f ( y ) dy 1
Definition
X is a continuous random variable if there is a
function f(x) so that for any constants a and b,
Example
Calculation of the expected value
Discrete case:
E( X ) = ∑ x i p(x i )
all x
Continuous case:
E( X )
all x
xi p(xi )dx
Example
Solution
Example
The random variable X is given by the following
PDF.
Check that this is a valid PDF and calculate the
mean of X.
Variance
Discrete case:
Var ( X ) 2
(x )
all x
i
2
p(xi )
Continuous case:
Var ( X ) ( xi ) p( xi )dx
2 2
The Normal Distribution
Probability
Distributions
Continuous
Probability
Distributions
Normal
Uniform
Exponential
The Normal Distribution
The normal distribution was introduced by the French mathematician Abraham
DeMoivre in 1733. DeMoivre, who used this distribution to approximate
probabilities connected with coin tossing, called it the exponential bell-shaped
curve. Its usefulness, however, became truly apparent only in 1809, when the
famous German mathematician Karl Friedrich Gauss (1777-1855) used it as an
integral part of his approach to predicting the location of astronomical entities. As
a result, it became common after this time to call it the Gaussian distribution.
During the mid-to late 19th century, it became accepted that it was “normal” for
any well-behaved data set to follow this curve. As a result, following the lead of
the British statistician Karl Pearson, people began referring to the Gaussian
curve by calling it simply the normal curve.
Normal probability distribution
• The normal distribution is the most important of all the
distributions.
• Its graph is bell-shaped.
• psychology, business, economics, natural sciences, nursing,
mathematics
• marks on a test
• IQ scores
• real estate prices
• the height of a man
• blood pressure
• errors in measurements
Properties of the Normal Distribution
It is bell-shaped.
Properties of the Normal Distribution
The normal curve is symmetrical about the mean μ.
Properties of the Normal Distribution
Mean, Median and Mode are Equal.
Properties of the Normal Distribution
Total area under the curve is equal to 1.
The Normal Distribution
(continued)
Location is
determined by the
mean, μ f(x)
Spread is determined σ
by the standard
deviation, σ x
μ
Mean
The random variable = Median
= Mode
has an infinite
theoretical range:
+ to
The Normal Distribution
The normal distribution has two parameters
(two numerical descriptive measures), the mean
() and the variance (2). If X is a quantity to be
measured that has a normal distribution with
mean () and the standard deviation (2), we
designate this by writing
NORMAL:XN (, 2)
Or XN (, )
Figure 5.7 Several normal distributions with different
means and Standard deviations
Parameters μ and σ
Normal pdfs have two parameters
μ - expected value (mean “mu”)
σ - standard deviation (sigma)
μ controls location σ controls spread
The Normal Distribution
Shape
f(X) Changing μ shifts the
distribution left or right.
Changing σ increases
or decreases the
σ spread.
μ X
Many Normal Distributions
By varying the parameters μ and σ, we obtain different
normal distributions.
The Normal Probability
Density Function
The formula for the normal probability density
function is
1 (1/2)[(X μ)/σ] 2
f(X) e
2π
Where e = the mathematical constant approximated by 2.71828
π = the mathematical constant approximated by 3.14159
μ = the population mean
σ = the population standard deviation
X = any value of the continuous variable
Cumulative Normal Distribution
For a normal random variable X with mean μ and
variance σ2, i.e., X~N(μ,σ2), cumulative distribution
function is
F(x 0 ) P(X x 0 )
f(x)
P(X x 0 )
0 x0 x
Finding Normal Probabilities
Probability is the
Probability is measured by the area
area under the
curve! under the curve
f(X) P (a ≤ X ≤ b)
= P (a < X < b)
(Note that the
probability of any
individual value is zero)
a b X
Finding Normal Probabilities
(continued)
F(b) P(X b)
a μ b x
F(a) P(X a)
a μ b x
P(a X b) F(b) F(a)
a μ b x
Areas to the left and Right of x
Probability as
Area Under the Curve
The total area under the curve is 1.0, and the curve is
symmetric, so half is above the mean, half is below
f(X) P( X μ) 0.5
P(μ X ) 0.5
0.5 0.5
μ X
P( X ) 1.0
The Standardized Normal
Any normal distribution (with any mean and
variance combination) can be transformed into the
standardized normal distribution (Z), with mean 0
and variance 1
f(Z)
Z ~ N(0,1) 1
Z
0
Need to transform X units into Z units by subtracting the
mean of X and dividing by its standard deviation
X μ
Z
σ
The Standardized Normal
Any normal distribution (with any mean and
standard deviation combination) can be
transformed into the standardized normal
distribution (Z)
Need to transform X units into Z units
Translation to the Standardized
Normal Distribution
Translate from X to the standardized normal
(the “Z” distribution) by subtracting the mean
of X and dividing by its standard deviation:
X μ
Z
σ
Z always has mean = 0 and standard deviation = 1
The Standardized Normal
Probability Density Function
The formula for the standardized normal
probability density function is
1 (1/2)Z 2
f(Z) e
2π
Where e = the mathematical constant approximated by 2.71828
π = the mathematical constant approximated by 3.14159
Z = any value of the standardized normal distribution
The Standardized
Normal Distribution
Also known as the “Z” distribution
Mean is 0
Standard Deviation is 1
f(Z)
1
Z
0
Values above the mean have positive Z-values,
values below the mean have negative Z-values
Definition
The standard normal distribution is a special case of the
normal distribution.
Figure 5.8 Standart normal distribution
0, 1
Finding Normal Probabilities
a μ b μ
P(a X b) P Z
σ σ
f(x) b μ a μ
F F
σ σ
a µ b x
a μ b μ
0 Z
σ σ
Table 5.1
P(0 Z z)
Example
If X is distributed normally with mean of 100
and standard deviation of 50, the Z value for
X = 200 is
X μ 200 100
Z 2.0
σ 50
This says that X = 200 is two standard
deviations (2 increments of 50 units) above
the mean of 100.
Normal Distribution: Empirical
Rule for Any Normal Curve
≈ 68% of the observations fall within one standard deviation of the mean.
≈ 95% of the observations fall within two standard deviations of the mean.
≈ 99.7% of the observations fall within three standard deviations of the mean.
The probability equals approximately 0.68 within 1 standard deviation of the mean,
approximately 0.95 within 2 standard deviations, and approximately 0.997 within 3
standard deviations. Question: How do these probabilities relate to the empirical rule?
Normal Distribution: Empirical
Rule for Any Normal Curve
Comparing X and Z units
100 200 X (μ = 100, σ = 50)
0 2.0 Z (μ = 0, σ = 1)
Note that the distribution is the same, only the
scale has changed. We can express the problem in
original units (X) or in standardized units (Z)
The Standardized Normal Table
The Table gives the probability F(a) for any value of a.
.9772
0 2.00 Z
P(Z < 2.00) = 0.5 + P(0<Z<2)
= 0.5+0.4772
= 0.9772
The Standardized Normal Table
The Standardized Normal table gives the
probability less than a desired value for Z
(i.e., from negative infinity to Z)
.5 .4772
Example:
P(Z < 2.00) = .9772
0 2.00 Z
The Standardized Normal Table
(continued)
The column gives the value of
Z to the second decimal point
Z 0.00 0.01 0.02 …
The row shows 0.0
the value of Z 0.1
. The value within the
to the first .
decimal point . table gives the
2.0 .4772 probability from Z = 0 up
to the desired Z value
2.0
P(Z < 2.00) = .5+.4772=.9772
Looking up probabilities in the
standard normal table
What is the area to
the left of Z=1.51 in a
standard normal
curve?
Area is 93.45%
Z=1.51
Z=1.51
The Standardized Normal Table
(continued)
For negative Z-values, use the fact that the
distribution is symmetric to find the needed
probability:
.9772
.0228
Example:
0 2.00 Z
P(Z < -2.00) = 1 – 0.9772
= 0.0228 .9772
.0228
-2.00 0 Z
General Procedure for
Finding Probabilities
To find P(a < X < b) when X is
distributed normally:
Draw the normal curve for the problem in
terms of X
Translate X-values to Z-values
Use the Standardized Normal Table
Finding Normal Probabilities
Suppose X is normal with mean 8.0 and
standard deviation 5.0
Find P(X < 8.6)
X
8.0
8.6
Finding Normal Probabilities
(continued)
Suppose X is normal with mean 8.0 and
standard deviation 5.0. Find P(X < 8.6)
X μ 8.6 8.0
Z 0.12
σ 5.0
μ=8 μ=0
σ = 10 σ=1
8 8.6 X 0 0.12 Z
P(X < 8.6) P(Z < 0.12)
Solution: Finding P(Z < 0.12)
Standardized Normal Probability P(X < 8.6)
Table (Portion) = P(Z < 0.12)
Z .00 .01 .02 =.5+.0478
0.0 .0000 .0040 .0080 =.5478
0.1 .0398 .0438 .0478
0.2 .0793 .0832 .0871
Z
0.00
0.3 .1179 .1217 .1255
0.12
Upper Tail Probabilities
Suppose X is normal with mean 8.0 and
standard deviation 5.0.
Now Find P(X > 8.6)
X
8.0
8.6
Upper Tail Probabilities
(continued)
Now Find P(X > 8.6)…
P(X > 8.6) = P(Z > 0.12) = 1.0 - P(Z ≤ 0.12)
= 1.0 - .5478 = .4522
.5478
1.000 1.0 - .5478
= .4522
Z Z
0 0
0.12 0.12
Probability Between
Two Values
Suppose X is normal with mean 8.0 and
standard deviation 5.0. Find P(8 < X < 8.6)
Calculate Z-values:
X μ 8 8
Z 0
σ 5
8 8.6 X
X μ 8.6 8 0 0.12 Z
Z 0.12
σ 5 P(8 < X < 8.6)
= P(0 < Z < 0.12)
Solution: Finding P(0 < Z < 0.12)
Standardized Normal Probability P(8 < X < 8.6)
Table (Portion) = P(0 < Z < 0.12)
=.0478
Z .00 .01 .02
0.0 .0000 .0040 .0080 .0478
0.1 .0398 .0438 .0478
0.2 .0793 .0832 .0871
0.3 .1179 .1217 .1255 Z
0.00
0.12
Probabilities in the Lower Tail
Suppose X is normal with mean 8.0 and
standard deviation 5.0.
Now Find P(7.4 < X < 8)
X
8.0
7.4
Probabilities in the Lower Tail
(continued)
Now Find P(7.4 < X < 8)…
P(7.4 < X < 8)
= P(-0.12 < Z < 0) .0478
= .0478
The Normal distribution is
symmetric, so this probability
is the same as P(0 < Z < 0.12)
7.4 8.0 X
-0.12 0 Z
Areas under the standard normal curve
A1 = A2 = .4082
P(-1.33 < z < 1.33)=P(-1.33 < z < 0) + P( 0< z < 1.33)=A1 + A2
= .4082*2=.8164
Find the probability that a standard normal random
variable exceeds 1.96 in absolute value.
P(|z|> 1.96)=P(z<-1.96 or z>1.96)
Since the area between z=0 and z=1.96 is .4750,
P(|z|> 1.96)=A1 + A2 = .0250 + .0250=.05
Example 5.10: Find the value of z0 such that 95% of the standard
normal Z values lie between - z0 and + z0; that is, find P(- z0 ≤ Z ≤ z0) =
0.95.
This means that the area on each side of the mean will be equal to 0.95 / 2=0.475.
The area between z=0 and z0 is 0.475. We look up 0.475 in the body of Table IV to
find the value z0 = 1.96.
It means 95% of a normal distribution lies between
+1.96 and -1.96 standard deviations of the mean.
Finding the X value for a
Known Probability
Steps to find the X value for a known
probability:
1. Find the Z value for the known probability
2. Convert to X units using the formula:
X μ Zσ
Finding the X value for a
Known Probability
(continued)
Example:
Suppose X is normal with mean 8.0 and
standard deviation 5.0.
Now find the X value so that only 20% of all
values are below this X
.2000
? 8.0 X
? 0 Z
Find the Z value for
20% in the Lower Tail
1. Find the Z value for the known probability
Standardized Normal Probability 20% area in the lower
Table (Portion) tail is consistent with a
Z … .03 .04 .05 Z value of -0.84
0.7 … .2673 .2704 .2734
.2000
0.8 … .2967 .2995 .3023
0.9 … .3238 .3264 .3289
? 8.0 X
-0.84 0 Z
Finding the X value
2. Convert to X units using the formula:
X μ Zσ
8.0 ( 0.84 )5.0
3.80
So 20% of the values from a distribution
with mean 8.0 and standard deviation
5.0 are less than 3.80
Question: What gestational length is smaller than
97.5% of gestations?
Let X represent gestations length
We know from prior research that
X ~ N(39, 2)
A value that is smaller than .975 of gestations
has a cumulative probability of.025
z - percentile
Less than 97.5%
(right tail) = greater
than 2.5% (left tail)
z lookup:
z.025 = −1.96
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
–1.9 .0287 .0281 .0274 .0268 .0262 .0256 .0250 .0244 .0239 .0233
Unstandardize and sketch
x z p 39 (1.96)(2) 35
The 2.5th percentile is 35 weeks
Practice problem
If birth weights in a population are normally
distributed with a mean of 109 oz and a standard
deviation of 13 oz,
a. What is the chance of obtaining a birth weight of
141 oz or heavier when sampling birth records at
random?
b. What is the chance of obtaining a birth weight of
120 or lighter?
Answer
a. What is the chance of obtaining a birth weight of
141 oz or heavier when sampling birth records at
random?
141 109
Z 2.46
13
From the chart or SAS Z of 2.46 corresponds to a right tail (greater
than) area of: P(Z≥2.46) = 1-(.9931)= .0069 or .69 %
Answer
b. What is the chance of obtaining a birth weight of
120 or lighter?
120 109
Z .85
13
From the chart or SAS Z of .85 corresponds to a left tail area of:
P(Z≤.85) = .8023= 80.23%
Example 5.11: College Entrance Exam Application
Suppose the scores X on a college entrance
examination are normally distributed with a mean of 550
and a standard deviation of 100.
A certain prestigious university will consider for
admission only those applicants whose scores exceed
the 90th percentile of the distribution.
Find the mininmum score an applicant must achieve in
order to receive consideration for admission to the
university.
Example 5.11: College Entrance Exam Application
Solution: P(X ≤ x) = 0.90
P(X ≤ x) = P(Z ≤ (x-µ)/σ)
=P(Z ≤ (x-550)/100)=0.90
z= (x-550)/100=1.28
X=550+1.28(100)=550+128=678
• The 90th percentile of the test score distribution
is 678.
Figure 5.18 Area under the normal curve
for Example 5.11
That is to say, an applicant must score at least 678 on the
entrance exam to receive consideration for admission by
the university.
Exercise 1
A radar unit is used to measure speeds of cars
on a motorway.
The speeds are normally distributed with a
mean of 90 km/hr and a standard deviation of
10 km/hr.
What is the probability that a car picked at
random is travelling at more than 100 km/hr?
Exercise 2
For a certain type of computers, the length of
time between charges of the battery is normally
distributed with a mean of 50 hours and a
standard deviation of 15 hours.
John owns one of these computers and wants
to know the probability that the length of time
will be between 50 and 70 hours.
Exercise 3
Entry to a certain University is determined by a
national test.
The scores on this test are normally distributed
with a mean of 500 and a standard deviation of
100.
Tom wants to be admitted to this university and
he knows that he must score better than at least
70% of the students who took the test.
Tom takes the test and scores 585.
Will he be admitted to this university?
Exercise 4
The annual salaries of employees in a large
company are approximately normally distributed
with a mean of $50,000 and a standard
deviation of $20,000.
What percent of people earn between $45,000
and $65,000?
Exercise 5
The time taken to assemble a car in a certain
plant is a random variable having a normal
distribution of 20 hours and a standard
deviation of 2 hours.
What is the probability that a car can be
assembled at this plant in a period of time
between 20 and 22 hours?
Exercise 6
Let’s suppose that the income of families are
normally distributed with N($25000, $100002).
If the poverty level is $10,000, what percentage
of the population lives in poverty?
Problems
1. If X is a random variable with a distribution of N(μ, σ), find:
p(µ−3σ ≤ X ≤ µ+3σ)
2. Calculate the value of a in a normal distribution with a mean of 4 and a
standard deviation of 2 for which:
P(4−a ≤ x ≤ 4+a) = 0.5934
3. In a city, it is estimated that the maximum temperature in June is normally
distributed with a mean of 23º and a standard deviation of 5°. Calculate the
number of days in this month in which it is expected to reach a maximum of
between 21° and 27°.
Answer 1:
If X is a random variable with a distribution of
N(μ, σ), find:
p(µ−3σ ≤ X ≤ µ+3σ)
Approximately 99.74% of the X values are less
than three standard deviations from the mean.
Answer 2:
Calculate the value of a in a normal distribution
with a mean of 4 and a standard deviation of 2
for which:
P(4−a ≤ x ≤ 4+a) = 0.5934
Answer 3
4. The mean weight of 500 college students is 70 kg and the standard
deviation is 3 kg. Assuming that the weight is normally distributed,
determine how many students weigh:
a. Between 60 kg and 75 kg.
b.More than 90 kg.
c.Less than 64 kg.
d.Exactly 64 kg.
e.64 kg or less.
5. It is assumed that the test results for a class follow a normal distribution
with a mean of 78 and a standard deviation of 36.
a. What is the probability of a student obtaining the a grade higher than
72?
b.Calculate the percentage of students who have scores that are at least
five points higher than the score that marks the border between pass and
fail (a failing grade is defined as 25% of the tests with the lowest scores) .
c. If you know that a student's grade is greater than 72, what is the
probability that it is greater than 84?
Answer 4
Answer 5
6. An IQ test shows that the scores follow a distribution of N(65, 18). It is
desirable to arrange the participants into three groups (low intelligence,
average intelligence and high intelligence) so that 20% of the partipants
are filled under the first group, 65% in the second and 15% in the third.
What are the scores that differentiate each intelligence group from one
another?
7. Several intelligence tests follow a normal distribution with a mean of 100
and a standard deviation of 15.
a. Determine the percentage of the population that would obtain a score
between 95 and 110.
b. What interval centered at a score of 100 contains 50% of the
population?
c. For a population of 2,500, how many are expected to have a score
above 125?
Answer 6
Answer 7