0% found this document useful (0 votes)
25 views25 pages

Final Tests Prob Stat 2021-PRINT

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views25 pages

Final Tests Prob Stat 2021-PRINT

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 25

HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 2, 2017-2018

AND EDUCATION Subject: Mathematical Statistics for Engineers


HIGH QUALITY TRAINING FACULTY Course code: MATH131401E
Number of pages: 02 pages.
FOUNDATION SCIENCE GROUP
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. A car is selected at random from a car park. The probability of the car being blue is
0.25 and the probability of it being an estate is 0.15. The probability of the car being a
blue estate is 0.08. What is the probability of the car being neither blue nor an estate?
2. A box of 10 biscuits chocolate biscuits and 1 lemon biscuit. Jenny takes out a biscuit
at random and eats it. She then takes out another biscuit at random.
a. Draw a tree diagram to show this information.
b. Find the probability that the second biscuit she takes is chocolate.
3. Let X denote the amount of space occupied by an article placed in a 1-packing
container and have range [0,1]. The pdf of X is f(x) = kx8 (1-x). Find k, EX, VX, and
the 70th percentile of X.
4. A multiple-choice test has three possible answers to each question, only one of
which is correct. A student guesses the answer to each of the twelve questions at
random. The random variable X is the number of correct answers.
a. State the distribution of X and explain why this model is suitable.
b. Find the probability that the student gets fewer than three questions correct.

Question II (5.5 point)


1. A survey of 2253 adults of a country revealed that 1262 of the respondents had at
some point used wireless means for online access. Calculate a 97% confidence interval
for the proportion of all adults in the country who at the time of the survey had used
wireless means for online access.
2. For a sample of 10 water specimens selected for treatment by coagulation, the
sample mean arsenic concentration was 24.3 mg/L, and the sample standard deviation
was 4.1. Using t-based methods to analyze the data to calculate and interpret a 95% CI
for true average arsenic concentration in all such water specimens. Assume that the
distribution of arsenic concentration was normal.
3. A hot-tub manufacturer advertises that with its heating equipment, a temperature of
100°F can be achieved in at most 15 min. A random sample of 42 tubs is selected, and
the time necessary to achieve a 100°F temperature is determined for each tub. The
sample average time and sample standard deviation are 16.5 min and 2.2 min,
respectively. Does this data cast doubt on the company’s claim? (using α = 0.05).
4. Urban storm water can be contaminated by many sources, including discarded
batteries. When ruptured, these batteries release metals of environmental significance.
A test for two different brands of size D batteries gave the following summary data on
zinc mass (g):
Assuming that both zinc mass
Brand Duracell Energizer
distributions are at least
Sample size 41 51
approximately normal, carry
Sample Mean 17.9 19.4 out a test at significance level
Sample Standard Deviation 2.5 3.4 .02 using the P-value approach
to decide whether true average zinc mass is different for the two types of batteries.

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


5. Toughness and fibrousness of asparagus are major determinants of quality. This was
the focus of a study reported in “Post-Harvest Glyphosphate Application Reduces
Toughening, Fiber Content, and Lignification of Stored Asparagus Spears” (J. of the
Amer. Soc. of Hort. Science, 1988: 569–572). The article reported the accompanying
data (read from a graph) on x shear force (kg) and y percent fiber dry weight
x 48 57 60 72 81 85 109 121 137 148 149 184 185 187
y 2.1 2.28 2.34 2.53 2.28 2.62 2.5 2.66 2.8 3.01 2.98 3.34 3.49 3.26
Determine the correlation coefficient for the above set of results and the equation of
the least squares regression line for predicting the percent fiber dry weight. Use your
regression line to estimate the value of the percent fiber dry weight when shear force is
132 kg?
Notice: Invigilators should not explain the questions on the exam papers.

Expected Learning Outcomes Questions


[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

May 30th, 2017

Head of foundation science group

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 2, 2018-2019
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH131401E
Number of pages: 02 pages.
FOUNDATION SCIENCE GROUP
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. A class has 20 male students and 30 female students. Choose randomly 5 students from this
class. Find the probability that there are both genders (male and female) in 5 students.
2. A test gives a positive result with probability 97% when the patient is indeed affected by
H1N1, while it gives a negative result with 98% probability when the patient is not affected
by H1N1. If a patient is selected at random from a population in which 0,5% of individuals
are affected by H1N1 and he is found positive, what is the probability that he is indeed
affected by H1N1?
3. The weight of a duck is a continuous random variable X (kg) . Suppose that X has the

kx( x 2  8) , if x  [1, 4]
following pdf: f ( x)  


0
 , if x  [1, 4]
a. Find the value of k
b. Find the probability that in 10 ducks, there are 4 ducks which are greater than 3 kg in
weight.
4. The lifetime of a light bulb is a continuous random variable X (years) normally distributed
with N (6; 1.25) . Find the probability that in 100 light bulbs there are at most 25 bulbs which
have lifetime greater than 7 years?

Question II (5.5 points)


1. Observe the salary X (million/month) of some people in Company A. We have the
following data.
X 4.0 - 4.5 4.5 – 5.0 5.0 – 5.5 5.5 – 6.0 6.0 – 6.5 6.5-7.0
The number 23 33 55 73 45 28
of people
a/ Calculate a 99% CI for true average salary of a person in Company A ? ( X has normal
distribution)
b/ It was also stated that the salary of a person in Company A was 6 million/month. Does the
data strongly suggest that the true average salary of a person in Company A is something
other than the stated value? Test the opinion using the significance level 0.02
2. In one experiment, 110 questionnaires with no incentive resulted in 75 respones, whereas
98 questionnaires that included a chance to win a lottery yielded 66 responses.
a/ Does this data suggest that the proportion of respones with no incentive resulted is less than
the proportion of response including a chance to win a lottery ? Test the hypotheses at
significance level 0.05
b/ What the confidence level is required if the precision of one-sided confidence for response
with no incentive resulted is 0.10347 ?
3. Let X (million Euro) be the money spending on advertising of a company. Let Y (million
euro) be the revenue of this company. Observe a sample (X,Y):
X 23 26 30 41 43 48 52 57
Y 651 762 856 1063 1190 1298 1421 1440
Do two variables X and Y have a linear regression relationship ? Find the linear regression
equation of X and Y? When X=70 , find Y=?

Notice: Invigilators should not explain the questions on the exam papers.

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


Expected Learning Outcomes Questions
[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

June 10th, 2019

Head of foundation science group

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 3, 2018-2019
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH131401E
Number of pages: 02 pages.
FOUNDATION SCIENCE GROUP
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. A class has 30 students. In this class, there are 8 students who can understand Japanese, 7
students who can understand Chinese and 3 students who can understand both Japanese and
Chinese. Choose randomly 4 students from this class to play a game. The rules of the game are
written a half in Japanese and a half in Chinese. Find the probability that this group can
understand the rules.
2. A certain company sends 30% of its overnight mail parcels via express mail service E1; 50%
of the overnight parcels via express mail service E2 and the remaining 20% via E3. Of those
sent via E1, only 3% arrive late, whereas 1% of the parcels handled by E2 arrive late and 2% of
the parcels handled by E3 arrive late.
If a record of an overnight mailing is randomly selected from the company’s file, what is the
probability that the parcel arriving late was E2 ?
3. Let X denote the temperature at which a certain chemical reaction takes place. Suppose that X

 2

 k(5  x 2 ) , if x  [1, 2]
has pdf : f ( x)   7



0 , if x  [1, 2]
a. Find the value of k
b. Suppose this reaction is independently carried out once in each of fourteen different labs
and that the pdf of reaction time in each lab is as given. Find the probability that there are at least
2 labs which the temperature exceeds 1.
4. An article suggests a normal distribution with mean 130.2 oz and standard deviation 1.2 oz
for the actual contents of jars of a certain type. Stop selecting randomly jars until get 4 jars more
than 132 oz. Find the probability that the total number of jars is selected is 16 .
Question II (5.5 points)
1. Electric energy consumption ( X ) of a household in city A has the normal distribution.
Observe some households in city A randomly:
X(kwh/month) 60-110 110-170 170-220 220-270 270-320 320-370
The number of 20 32 62 71 80 115
households
a/ Calculate a 97% CI for the true average electric energy consumption of a household in city A.
b/ It was also stated that the electric energy consumption of a household in city A was 250
kwh/month. Does the data strongly suggest that the true average electric energy consumption of
a household is greater than the stated value? Test the opinion using the significance level 0.02
2. Researchers sent some resumes in response to job ads that appeared in the Boston Globe and
Chicago Tribune. The resumes were identical except that 2400 of them had “white sounding”
first names, such as Brett and Emily, whereas the other 2600 had “black sounding” names such
as Tamika and Rasheed. The resumes of the first type elicited 220 responses and the resumes of
the second type only 160 responses .
a/ Does this data strongly suggest that a resume with a “black” name is less likely to result in a
response than is a resume with a “white” name? Test the hypotheses at significance level 0.05
b/ Calculate a 99% lower bound for the population proportion of responses of the first type.
3. Let X (people) be the number of people attending a trip to Europe. Let Y (million VND) be
the revenue of the travel agency. Observe a sample (X,Y):
X 12 15 17 19 20 25 27 29
Y 720 895 1053 1100 1210 1480 1590 1780

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


Do two variables X and Y have a linear regression relationship ? Find the linear regression
equation of X and Y? Calculate a point estimate of the revenue of the travel agency when
X=30.

Notice: Invigilators should not explain the questions on the exam papers.

Expected Learning Outcomes Questions


[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

August 2st, 2019

Head of foundation science group

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 1, 2020-2021
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH131401E
Number of pages: 02 pages.
FOUNDATION SCIENCE GROUP
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. The line  1 is parallel to  2 . Let A1 , B1 , C1 , D1 , E1 , F1 be on the line  1 and
A2 , B2 , C 2 , D2 , E 2 , F2 , G 2 be on the line  2 . Select randomly 3 points on these lines.
Find the probability that we get a triangle.
2. At a certain gas station, 40% of the customers use A95, 35% use A92, and 25% use E5.
Of those customers using A95, only 35% fill their tanks. Of those customers using A92,
65% fill their tanks, whereas of those using E5, 55% fill their tanks.
If the next customer fills the tank, what is the probability that A92 is requested?
3. Let X (gram) denote the weight of a product type A. The pdf of X is
k ( x 2  10 x  24) ; 4  x  6
f ( x)  
 0 ; else
a/ Find the value of k
b/ Select randomly the products until finding a product less than 5 gram. What is the
probability that we must select 12 products?
4.It is known that the cholesterol level in males 30 years old follows a normal distribution
with mean 220 mg/dl and standard deviation 30 mg/dl. Find the probability that at most 2
males in 20 males has cholesterol level greater than 240 mg/dl.
Question II(5.5 points)
1.An article reported that for a sample of 70 kitchens with gas cooking appliances
monitored during a one-week period, the sample mean CO2 level (ppm) was 620.12 , and
the sample standard deviation was 162.
a/ Calculate a 95% confidence interval for true average CO2 level in the population of
all homes from which the sample was selected.
b/ It was also stated that the CO2 level in the population of all homes was 600 (ppm).
Does the data strongly suggest that the true average CO2 level in the population of all
homes is something other than the stated value? Test the opinion using the significance
level 0.02
2. Last year, in a sample of 1200 randomly selected consumers who had opportunities to
send in a rebate claim form after purchasing a product A, there are 300 of these people said
they never did. This year, if a consumer sends in a rebate claim form, he will get 5 reward
points. Observe 1000 randomly selected consumers who had opportunities to send in a
rebate claim form after purchasing a product A this year, 200 of these people said they never
did.
a/ Does this data strongly suggest that the true proportion of such consumers who never
apply for a rebate claim after purchasing the product A last year is greater than this
year? Test the hypotheses at significance level 0.05
b/ Calculate an lower confidence bound at the 97% confidence level for the true
proportion of such consumers who never apply for a rebate claim after purchasing the
product A this year.
4. Let X (dollars) be the expenditure per month and Y (dollars) be the income per month
of an employee at a certain company. Observe a sample (X,Y):

X 2400 2600 2300 4900 3100 2500 5100 3200


Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2
Y 41000 45000 39000 59000 44000 43000 60000 48000
Do two variables X and Y have a linear regression relationship ? Find the linear
regression equation of X and Y?

Notice:Invigilators should not explain the questions on the exam papers.

Expected Learning Outcomes Questions


[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

January8th, 2021

Head of foundation science group

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


HCMC UNIVERSITY OF FINAL EXAMINATION
TECHNOLOGY AND EDUCATION SEMESTER 1 - ACADEMIC YEAR 2019-2020
Faculty for High Quality Training Course name: Mathematical Statistics for Engineers
------------------- Course ID: MATH132901E
Duration: 90 minutes. Number of pages: 02 pages
Open book.

Question 1. (1.5/10) Consider randomly selecting a single individual and having that person
test drive 3 different vehicles. Define events A1 , A2 , and A3 by A1 = likes vehicle #1, A2 = likes
vehicle #2, A3 = likes vehicle #3. Suppose that

P (A1 ) = 0.63, P (A2 ) = 0.58, P (A3 ) = 0.72, P (A1 ∪ A2 ) = 0.83,

P (A2 ∩ A3 ) = 0.45, and P (A1 ∪ A2 ∪ A3 ) = 0.90.


a. What is the probability that the individual likes both vehicle #1 and vehicle #2?
b. Are A2 and A3 independent events? Justify your answer.
c. If you learn that the individual did not like vehicle #1, what now is the probability that he/she
liked at least one of the other two vehicles?
Question 2. (1.0/10) Seventy percent of the light aircraft that disappear while in flight in a
certain country are subsequently discovered. Of the aircraft that are discovered, 66% have an
emergency locator, whereas 95% of the aircraft not discovered do not have such a locator. Suppose
a light aircraft has disappeared.
a. If it has an emergency locator, what is the probability that it will not be discovered?
b. If it does not have an emergency locator, what is the probability that it will be discovered?
Question 3. (1.0/10) Suppose that 20% of all students who have to buy a text for a particular
course want a new copy, whereas the other 80% want a used copy. Consider randomly selecting 30
purchasers.
a. What is the probability that the number who want new copies is more than two standard de-
viations away from the mean value?
b. The bookstore has 20 new copies and 20 used copies in stock. If 30 people come in one by one
to purchase this text, what is the probability that all 30 will get the type of book they want from
current stock?
Question 4. (1.5/10) The heights (X) of adult males in Vietnam are normally distributed with
mean of 65 inches and standard deviation of 6 inches.
a. What proportion of adult males in Vietnam are taller than 70 inches?
b. If nine people are independently selected, what is the probability that at least one has a height
exceeding 70 inches?
Question 5. (2.0/10)
a. A random sample of 120 lightning flashes in a certain region resulted in a sample average radar
echo duration of 0.88 sec and a sample standard deviation of 0.35 sec. Calculate a 98% (two-sided)
confidence interval for the true average echo duration µ, and interpret the resulting interval.
b. An April 2009 survey of 2353 American adults conducted by the Pew Research Center’s Internet
& American Life Project revealed that 1272 of the respondents had at some point used wireless
means for online access. Calculate and interpret a 99% CI for the proportion of all American adults
who at the time of the survey had used wireless means for online access.

Document ID: BM1/QT-PĐBCL-RĐTV-E Page: 1/2


Question 6. (2.0/10)
a. A manufacturer of nickel-hydrogen batteries randomly selects 120 nickel plates for test cells,
cycles them a specified number of times, and determines that 11 of the plates have blistered. Does
this provide compelling evidence for concluding that more than 8% of all plates blister under such
circumstances? State and test the appropriate hypotheses using a significance level of 0.05.
b. Is there any systematic tendency for part-time college faculty to hold their students to differ-
ent standards than do full-time faculty? The article “Are There Instructional Differences Between
Full-Time and Part-Time Faculty?” reported that for a sample of 132 courses taught by full-time
faculty, the mean course GPA was 2.7186 and the standard deviation was 0.63342, whereas for
a sample of 90 courses taught by part-timers, the mean and standard deviation were 2.8639 and
0.49241, respectively. Does it appear that true average course GPA for part-time faculty differs
from that for faculty teaching full-time? Test the appropriate hypotheses at significance level 0.01.

Question 7. (1.0/10) Toughness and fibrousness of asparagus are major determinants of qual-
ity. This was the focus of a study reported in “Post-Harvest Glyphosphate Application Reduces
Toughening, Fiber Content, and Lignification of Stored Asparagus Spears”. The article reported
the accompanying data (read from a graph) on x shear force (kg) and y percent fiber dry weight.
x 48 57 60 72 81 85 109 121 137 148 149 184 185 187
y 2.10 2.28 2.53 2.29 2.62 2.50 2.66 2.82 3.01 2.98 3.34 3.49 3.48 3.26
Determine the correlation coefficient for the above set of results and the equation of the least
squares regression line for predicting the percent fiber dry weight. Use your regression line to
estimate the value of the percent fiber dry weight when shear force is 132 kg?

Note: Proctors are not allowed to give any unauthorised explaination.

Learning outcome mapping Assessed in


[LO 2.1.1]: Compute mean, median, mode, standard deviation, vari-
Question 1
ance, and know their function
[LO 2.4.3; 2.4.4]: Become familiar with various graphical represen-
Question 2
tations of data and learn to recognize misleading graphs.
[LO 2.1.1; 2.1.2; 2.4.3; 2.4.4]: Use binomial, normal, Poisson distri-
Questions 3,4
butions, Hyper geometric distribution and their relationships.
[LO 2.1.1, 2.1.2]: Calculate the confidence intervals for proportion,
Question 5
mean, variance based on a sample collected.
[LO 2.1.3, 2.1.4]: Use test procedures to solve and develop profi-
Question 6
ciency in its applications.
[LO 2.4.4]: Use linear regression model. Question 7

December 10th , 2019


Approved by program chair

Lưu Việt Hùng

Document ID: BM1/QT-PĐBCL-RĐTV-E Page: 2/2


HCMC UNIVERSITY OF FINAL EXAMINATION
TECHNOLOGY AND EDUCATION SEMESTER 2 - ACADEMIC YEAR 2019-2020
Faculty of Applied Sciences Course name: Mathematical Statistics for Engineers
------------------- Course ID: MATH132901E
Duration: 90 minutes. Number of pages: 02 pages
Open book.

Question 1. (1.0/10) Three plants manufacture hard drives and ship them to a warehouse for
distribution. Plant I produces 54% of the warehouse’s inventory with a 4% defect rate. Plant II
produces 35% of the warehouse’s inventory with an 8% defect rate. Plant III produces the remain-
der of the warehouse’s inventory with a 12% defect rate.
a. What is the probability that a randomly selected hard drive is defective?
b. Suppose a hard drive is defective. What is the probability that it came from Plant II?
Question 2. (1.0/10) A large lot of tires contains 5% defectives. 4 tires are to be chosen for a
car.
a. Find the probability that you find at most 2 defective tires before 4 good ones.
b. Find the mean and variance of the number of defective tires you find before finding 4 good tires.
Question 3. (1.0/10) A shipment of 25 integrated circuits (ICs) arrives at an electronics man-
ufacturing site. The site manager will randomly select 4 ICs and test them to see whether they
are faulty. Unknown to the site manager, 5 of these 25 ICs are faulty. Suppose the shipment will
be accepted if and only if at most one of the inspected ICs is faulty. What is the probability this
shipment of 25 ICs will be accepted?
Question 4. (1.5/10) Based on extensive data from an urban freeway near Toronto, Canada,
“it is assumed that free speeds can best be represented by a normal distribution”. The mean and
standard deviation reported in the article were 118 km/h and 13.1 km/h, respectively.
a. The posted speed limit was 100 km/h. What percentage of vehicles was traveling at speeds
exceeding this posted limit?
b. If five vehicles are randomly and independently selected, what is the probability that at least
one is not exceeding the posted speed limit?
Question 5. (2.0/10)
a. A randomly selected sample of n = 16 students at a university is asked, “How much did you
spend for textbooks this semester?” The responses, in dollars, are

380, 290, 310, 200, 175, 450, 300, 350, 250, 150, 200, 320, 370, 404, 250, 420

Calculate a 95% confidence interval for the population mean.


b. A soft drink manufacturer wants to determine if one of its fill operations is working properly.
A random sample of twenty "1 liter" bottles of the same soft drink showed an average file of 0.996
liters with a standard deviation of 0.008 liters. At a 1% level of significance, is it true that the
average fill is under 1 liter? Show your working and reasoning.
Question 6. (2.5/10)
a. A study was conducted to determine if office background noise at an office decreases productivity.
Fifteen office workers were studied at work for one week both before and after the wearing of
noise-reduction head gear. Their productivity on a 100 point scale as determined by the company

Document ID: BM1/QT-PĐBCL-RĐTV-E Page: 1/2


conducting the study were recorded as follow:

Subject 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Before 91 88 95 81 86 79 84 79 87 86 75 87 93 85 77
After 92 90 94 83 84 89 85 78 87 91 74 89 86 89 78

At a 5% level of significance is there evidence that wearing the noise reduction head gear increases
worker productivity?
b. “Would you marry a person from a lower social class than your own?” Researchers asked this
question of a sample of 685 never-married students at two historically colleges in the South. Of the
349 men in the sample, 291 said “Yes.” Among the 336 women, 217 said “Yes.” At a 2% level of
significance is there reason to think that different proportions of men and women in this student
population would be willing to marry beneath their class?
Question 7. (1.0/10) Do heavier people burn more energy? We have data on the lean body mass
and resting metabolic rate for 12 women who are subjects in a study of dieting. Lean body mass,
given in kilograms, is a person’s weight leaving out all fat. Metabolic rate, in calories burned per
24 hours, is the rate at which the body consumes energy.
Mass 36.1 54.6 48.5 42.0 50.6 42.0 40.3 33.1 42.4 34.5 51.1 41.2
Rate 995 1425 1396 1418 1502 1256 1189 913 1124 1052 1347 1204
Determine the correlation coefficient for the above set of results and the equation of least-squares
regression line for predicting metabolic rate from body mass. Another woman has lean body mass
45 kilograms. What is her predicted metabolic rate?

Note: Proctors are not allowed to give any unauthorised explaination.

Learning outcome mapping Assessed in


[LO 2.1.1]: Compute mean, median, mode, standard deviation, vari-
Question 1
ance, and know their function
[LO 2.4.3; 2.4.4]: Become familiar with various graphical represen-
Question 2
tations of data and learn to recognize misleading graphs.
[LO 2.1.1; 2.1.2; 2.4.3; 2.4.4]: Use binomial, normal, Poisson distri-
Questions 3,4
butions, Hyper geometric distribution and their relationships.
[LO 2.1.1, 2.1.2]: Calculate the confidence intervals for proportion,
Question 5
mean, variance based on a sample collected.
[LO 2.1.3, 2.1.4]: Use test procedures to solve and develop profi-
Question 6
ciency in its applications.
[LO 2.4.4]: Use linear regression model. Question 7

July 16th , 2020


Approved by program chair

Nguyễn Văn Toản

Document ID: BM1/QT-PĐBCL-RĐTV-E Page: 2/2


HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 2, 17-18
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH130401E
-------------------------
Q Id Content pts
ea
1 B: “Car is blue” ; E: “Car is an estate”
P  B or E   P  B  E   P  B   P  E   P  B  E  0,5
0,5
 0.25  0.15  0.08  0.32.
2 0.5

P  second biscuit is chocolate   P  C1C2   P  L1C2 


10 9 1 10 0.5
    1
11 10 11 11

I
3 1

 kx (1  x ) dx  1  k  90
8
(4,5
0
pts) 1
9 0.5
EX   90 x 9 (1  x )dx  ;
0
11
0.5
1
15
 VX  EX 2   EX   0.0124.
2
EX 2   90 x10 (1  x ) dx 
0
22
y
14 0.5
Let the 70th percentile of the thickness be y. 90 x8 (1  x ) dx  0.7  y   0.56
0
25
4. a. X~B(12,1/3) because these following reason:
- 12 answers: either ‘correct’ or ‘incorrect’. 0,5
- Questions are answered independently of each other
- Probability of a correct answer is 1/3.
0,5
b. P(X<3)=P(X=0)+P(X=1)+P(X=2)=0.181 (to 3 s.f)
1 Let p be the proportion of all adults in the country who at the time of the survey had
used wireless means for online access.

II
p  1262
  2.17
p 1  p

 0.0227
 0,5
(5,5
2253 n
pts)
A 97% confidence interval for the proportion of all adults in the country who at the 0,5
time of the survey had used wireless means for online access is  0.5374;0.5828 
2 Let µ be the true average arsenic concentration in all such water specimens.
n  10, s  4.1, x  24.3
0,5
s
  2.262  2.9328
n
Hence, the 95% confidence interval for the true average arsenic concentration in all 0.5

 
such water specimens is x   ; x     21.3672; 27.2328  .

3 Let µ be the true average time necessary to achieve a 100°F temperature of the
heating equipment.
H 0 :   15 ; H a :   15 .
0,5
n  42, s  2.2, x  16.5

z
 x  15 42  4.4187 0.5
s
 P _ val  0  0.05    or z  ztb  1.96  0.5
We reject H0. There is the evidence to reject the claim.
Hence, the data cast doubt on the company’s claim.
4 Let µ1 , µ2 be the true average zinc mass of the two brands of size D batteries: 0,25
Duracell and Energizer, respectively.
H 0 : 1  2 ; H a : 1  2 .
0,5
n1  41, s1  2.5, x1 17.9 ; n2  51, s2  3.4, x2  19.4
x  x 
0.25
1 2
z  3.9567  P_value  0    0.02
2 2
s s
1
 2

n1 n2
We reject H0 at  = .02. There is evidence of a difference in true average zinc mass
for two types of batteries.
5 r = 0.9623895711: Strong relationship 0,25
Slope: B=0.008349892; y-Intercept: A= 1.759866075
The “best-fit” linear model is: y=1.759866075+0.008349892x; 0,25
Interpretation of the slope: Slope: B=0.008349892>0: Direct relationship 0,25
For every additional shear force, the fiber dry weight acquires an estimated
1.759866075 percent per kilogram.
If x = 132, the point estimate for the percent fiber dry weight is y=2.862051838. 0,25
HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 2, 18-19
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH130401E
-------------------------
Q Content pts
1 Let A be an event that there are both genders (male and female) in 5 students.
There are   C505 ways to choose 5 students randomly.
0,5
There are A  C205  C305 ways to choose 5 male or 5 female students.
A 5
C20  C305
The probability : P( A)  1   1  0.92542
 C505 0,5
2 Let A1 be an event of selecting an individual affected by H1N1.
Let A2 be an event of selecting an individual not affected by H1N1.
Let C be an event of positive result.
P( A1 )  0.005 ; P( A2 )  0.995
P(C | A1 )  0.97 ; P(C | A2 )  0.02
By the law of total probability we find the probability of positive result:
0.5
P(C)  P( A1 ) P (C | A1 )  P( A2 ) P(C | A2 )
 (0.005)(0.97)  (0.995)(0.02)  0.02475
Using Bayes formula, we find the probability that he is indeed affected by H1N1:
I P( A1.C) P( A1 ) P (C | A1 ) (0.005)(0.97) 0.5
P ( A1 | C)     0.1959
P(C) P (C) 0.02475
(4,5 
3 4
4
pts) a/ 

f ( x)  1   k .x.( x 2  8)  1  k 
1
495 0.5
4
4
b/ P ( X  3)   x( x 2  8)dx  0.5798
3
495 0.5
Let Y be the number of ducks greater than 3 kg in 10 ducks. Then Y has the
binomial distribution with n=10; p= 0.5798
P (Y  4)  C104 (1  0.5798)6 (0.5798) 4  0.13063 0.5
4. X  N (6; 1.25) ;   6 ;   1.25
The probability that a light bulb has lifetime greater than 7 years
 X  76  X 
P ( X  7)  P    with Z   N (0,1)
  1.25  
 P ( Z  0.89)  1   (0.89)  0.1867 0,5
Let Y be the number of light bulbs greater than 7 years in 100 light bulbs. Then Y
has the binomial distribution with n=100; p=0.1867
   n. p  18.67
Approximate to normal distribution  2 0,5
  n. p.(1  p)  15.1843
 X   25.5  18.67  X 
P ( X  25)  P ( X  25.5)  P    with Z   N (0,1)
  15.1843  
 P ( Z  1.75)   (1.75)  0.9599
1a n  257; x  5.5768 ; s  0.7127 0,5
Let µ be the true average salary of a person in Company A.
s
  2.58  0.1147 0,5
n
Hence, a 99% CI for true average salary of a person in Company A is
 
  x   ; x     5.4621;5.6915  .
0,5

1b Let µ be the true true average salary of a person in Company A


H0 :   6; H a :   6 .

z
 x  6 n  9.519
s 0.5

  0.02   ( z /2 )  1   0.99  z /2  2.33
2
Because z  z /2 : We reject H0 ; accept H a .
There is the evidence to reject the claim.
Hence, true average salary of a person in Company A is not 6 million/ month. 0.5
II 2a Let p1 be the proportion of respones with no incentive resulted.
(5,5 p2 the proportion of response including a chance to win a lottery .
pts)
H 0 : p1  p2 ; H a : p1  p2
75 66 75  66 141
n1  110 ; n2  98 ; f1  ; f2  ; f  
110 98 110  98 208

z
 f1  f 2   0.1286
1 1 0.5
f (1  f )   
 n1 n2 
  0.05 ;  ( z )  1    0.95  z  1.65
Because z   z : We accept H0 .
0.5
The claim is wrong. The proportion of respones with no incentive resulted is equal to the
proportion of response including a chance to win a lottery .
2b Let p be the proportion of respones with no incentive resulted.
p  75
110 0.5

  z

p 1  p
  0.10347  z  2.33

n
 ( z )     (2.33)    0.99   0.5

3 r = 0.99238: two variables X and Y have a linear regression relationship 0.5


Slope: B=24.0421; y-Intercept: A= 123.44
The linear regression equation is: y=123.44+24.0421x;
If x = 70, then y=1806.382 0.5
HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 3, 18-19
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH130401E
-------------------------
Q Content Pts
1 Let A be an event that this group can understand the rules.
There are   C304 ways to choose 4 students randomly. 0,25

There are A  C234  C224  C184 0,5


A 0,25
The probability : P( A)  1   0.47837

2 Let A1 be an event of the overnight mail parcels via express mail service E1.
Let A2 be an event of the overnight parcels are sent via express mail service E2.
Let A3 be an event of the overnight parcels are sent via express mail service E3.
Let C be an event of the parcel arriving late.
P( A1 )  0.3 ; P( A2 )  0.5 ; P( A3 )  0.2
P(C | A1 )  0.03 ; P(C | A2 )  0.01 ; P(C | A3 )  0.02
0.5
By the law of total probability we find the probability of the parcel arriving late:
P(C)  P( A1 ) P(C | A1 )  P ( A2 ) P (C | A2 )  P( A2 ) P(C | A2 )  0.018
Using Bayes formula, we find the probability that the parcel arriving late was E2:
P( A2 .C) P( A2 ) P(C | A2 ) (0.5)(0.01) 5 0.5
I P( A2 | C)    
P(C) P(C) 0.018 18
(4,5 3  2
2 7
 f ( x)  1   7 (5  x ) 1 k 
2
pts) a/ 0.5
 1
24
2
0.25
2 7 2
b/ P( X  1)   . (5  x 2 ) 
1
7 24 9
Let Y be the number of labs which the temperature exceeds 1. 0.5
Then Y has the binomial distribution with n=14; p= 2/9.
P (Y  2)  1  P (Y  1)  P(Y  0)  1  C114 (2 / 9)(7 / 9)13  (7 / 9)14  0.851766
0.25
4. Let X be actual contents of jars of a certain type.
X  N (130.2; 1.2 2 ) ;   130.2 ;   1.2
The probability thata jar more than 132 oz.
 X   132  130.2  X 
P ( X  132)  P    with Z   N (0,1)
  1.2   0,5
 P ( Z  1.5)  1   (1.5)  1  0.9332  0.0668
Let Y be the jars total number of jars is selected. Then Y has the negative binomial
distribution with p=0.0668 ; r=4
0,5
P(Y  16)  C153 (0.0668)3 (1  0.0668)12 (0.0668)  0.0039519
II 1a n  380; x  260.3684 ; s  77.5626 0,5
(5,5
pts) Let µ be the true average electric energy consumption of a household in city A.
  97%  z /2  2.17 0,5
s
  2.17  8.6341
n 0,5
Hence, a 97% CI for the true average electric energy consumption of a household in city A
 
is   x   ; x     251.7343; 269.0025  .
1b Let µ be the true average electric energy consumption of a household in city A.
H 0 :   250 ; H a :   250 .

z
 x  250 n  2.6058
s 0.5
  0.02   ( z )  1    0.98  z  2.06
Because z  z : We reject H0 ; accept H a .
The true average electric energy consumption of a household is greater than the stated
value . The claim is right.
0.5
2a Let p1 be the proportion of respones of resumes with “white” name.
p2 the proportion of response of resumes with “black” name.
H 0 : p1  p2 ; H a : p1  p2
220 160 220  160 19
n1  2400 ; n2  2600 ; f1  ; f2  ; f  
2400 2600 2400  2600 250

z
 f1  f 2   4.0164
1 1 0.5
f (1  f )   
 n1 n2 
  0.05 ;  ( z )  1    0.95  z  1.65
Because z  z : We reject H0 ; accept H a .
0.5
The claim is right. The resume with a “black” name is not less likely to result in a
response than is a resume with a “white” name.
2b Let p be the proportion of responses of the first type. (“white” name)
p  220 ; z  z  2.33
 
2400 0.5

  z

p 1  p
 0.01372
n
The 99% lower bound for the population proportion of responses of the first type: 0.5
p  p    p  0.105386
3 r = 0.996885: two variables X and Y have a linear regression relationship 0.5
Slope: B=60.0873; y-Intercept: A= -3.28968
The linear regression equation is: y=-3.28968+60.0873;
If x = 30, then y=1799.32932 0.5
HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 1, 2020-2021
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH130401E
-------------------------
Q Content Pts
1 Let A be an event that we get a triangle.
There are   C133 ways to choose 3 points randomly. 0,25
There are A  C62 .C71  C61 .C72 =231 0,5
A 21
The probability : P( A)    0.8076923077 0,25
 26
2 Let A1 , A2 , A3 be the event of the customers use A95; A92; E5, respectively
Let C be the event thatthe customers fill their tanks
P( A1 )  0.4 ; P( A2 )  0.35 ; P( A3 )  0.25
P(C | A1 )  0.35 ; P(C | A2 )  0.65 ; P(C | A3 )  0.55
By the law of total probability we find the probability that the customers fill their
tanks:
P(C)  P ( A1 ) P(C | A1 )  P( A2 ) P(C | A2 )  P( A2 ) P(C | A2 )  0.505 0.5
Using Bayes formula, we find the probability that these customers use A92:
P( A2 .C) P( A2 ) P(C | A2 ) (0.35)(0.65)
P( A2 | C)     0.4504950495
P(C) P(C) 0.505 0.5
I
3  6
3
(4,5
a/ 

f ( x)  1   k ( x 2  10 x  24)dx  1  k 
4
4
0.5
pts) 5
3
b/ P( X  5)   ( x 2  10 x  24) dx  0.5
4
4
Let Y be the number of the products that we must select until finding a product less 0.5
than 5 gram.

Then Y has the negative binomial distribution with p= 0.5


0.5
1
P (Y  12)  (1  0.5)11 0.5 
4096
4. Let X be the cholesterol level in males 30 years old.
X  N (220; 302 ) ;   220 ;   30
The probability that males have cholesterol level greater than 240 mg/dl
 X   240  220  X 
P ( X  240)  P    with Z   N (0,1)
  30   0,5
 1   (0.67)  1  0.7486  0.2514
Let Y be the number of males having cholesterol level greater than 240 mg/dl. Then Y
has the binomial distribution with p=0.2514 ; n=12
0,5
2
P(Y  2)   C202 (0.2514)i (1  0.2514)20i  0.0089
i 0
1a n  70; x  620.12 ; s  162
Let µ be the true average CO2 level in the population of all homes
  95%  z /2  1.96
0,5
s
  1.96  37.9508988
n
Hence, a 95% CI for the true average CO2 level in the population of all homesis 0,5

 
  x   ; x     582.1691012;658.0708988 .
1b Let µ be the true average CO2 level in the population of all homes
H 0 :   600 ; H a :   600 .
0.25
z
 x  600  n  1.039111095
s 0.5

  0.02   ( z /2 )  1   0.99  z /2  2.33
2
Because z  z /2 : We accept H0.
the true average CO2 level in the population of all homes is equal to the stated value. The claim
is wrong. 0.5
II 2a Let p1 , p2 be the true proportion of such consumers who never apply for a rebate claim
(5,5 after purchasing the product A last year and this year, respectively.
pts)
H 0 : p1  p2 ; H a : p1  p2
300 200 300  200 5
n1  1200 ; n2  1000 ; f1  ; f2  ; f  
1200 1000 1200  1000 22 0.25

z
 f1  f 2   2.786522184
1 1 0.5
f (1  f )   
 n1 n2 
  0.05 ;  ( z )  1    0.95  z  1.65
Because z  z : We reject H0 ; accept H a .
0.5
The claim is right. The true proportion of such consumers who never apply for a rebate
claim after purchasing the product A last year is greater than this year.
2b Let p be the true proportion of such consumers who never apply for a rebate claim after
purchasing the product A this year
p  200 ; ( z )  0.97  z  1.89 0.5
 
1000

0.5
  z

p 1  p
  0.02390681911
n
The 97% lower bound for the population proportion of consumers who never apply for a
rebate claim after purchasing the product A this year:

p  p    p  0.1760931809
3 r = 0.98282017: two variables X and Y have a linear regression relationship 0.5
Slope: B=6.972803645; y-Intercept: A= 24626.22811
The linear regression equation is: y=24626.22811+6.972803645x;
0.5
SOLUTION TO FINAL EXAMINATION
Course name: MATHEMATICAL STATISTICS FOR ENGINEERS
SEMESTER 2 – ACADEMIC YEAR 2019-2020
---------------------------------------------------------------------------------


Q1: Let Ai = hard drives from plant i , i = 1,2, 3 , 

B = a randomly selected hard drive is defective  .25 points

3
157
( )  P (A ) P (B | A ) = .54  .04 + .35  .08 + .11  .12 = 2500
a. P B =
i =1
i i
= .0628 .25 points

P ( A ) P ( B | A ) .35  .08 70
P (A | B ) =
2 2
b. = =  .4459 .5 points
P (B )
2
.0628 157

Q2:

X = number of defective tires you find before you finding 4 good tires
a.
→ X ~ NegBin 4,.95 ( )
2

(
P X 2 = ) C x =0
4 −1
x + 4 −1
 .954  .05x  .99777 .5 point

r (1 − p ) 4 r 1−p( )= 80
b. E X ( ) = = ,
p 19
( )
V X =
p 2
361
.5 points

Q3: Let X = number of integrated circuits ICs is faulty → X ~ H 25, 5, 4 ( ) ( ) .5 points

The probability this shipmet of 25 ICs will be accepted:

1
C 5xC 204 −x 2109
(
P X 1 = )  x =0 C 4
=
2530
 .8336 .5 points
25

X − 118
Q4: Let X = free speeds can best → X ~ N 118,13.12 , Z = ( ) 13.1
( )
→ Z ~ N 0,1 .25 points

 180 
(
a. P X  100 = 1 − −)  = .915286 .25 points
 131 
b. Y = number of vehicles is not exceeding the posted speed limit

( )
→ Y ~ Bin 5, p , with p = P(X  100) = .084714 .5 points

( ) C ( )
5 −x
P Y 1 = x
5
px 1 − p  .3576 .5 points
x =1
Q5:

a. n = 16, x = 301.1875, sx = 91.3668202 .25 points


= .95 → = .05 → t /2,n −1
= 2.131 .25 points
A 95% confidence interval for the population mean is:
s
x t
= 252.5118265, 349.8631735
/2,n −1 ( ) .5 points
n
b. Let = theaverageamount of water is filled in 1liter bottles .25 points

Null hypothesis H 0 : =1 Alternative hypothesis Ha : 1 .25 points


= .01 → z = 2.3265

20
(
z = .996 − 1 ) .008  −2.2361 Reject H a .25 points

That the average fill is 1 liter. .25 points


Q6:

a. A
= average worker productivity after wearing noise reduction device

B
= average worker productivity before wearing noise reduction device .25 points

Let d
= A
− B

Null hypothesis H 0 : d
=0 Alternative hypothesis Ha : d
 0 .25 points
we have: nd = 15, d = 1.066666667, sd = 3.731462116
= .05 → t ,n −1
= 1.761

( )s
nd
t = d −0  1.1071  t ,n −1
→ Reject H a .5 points
d
That wearing the noise reduction head gear increases worker productivity is
wrong. .25 points
b. pm = proportions of men said " yes ", pw = proportions of women said " yes " .25 points

291 217 508


We have: fm = , fw = ,f =
349 336 685
Null hypothesis H 0 : pm = pw Alternative hypothesis Ha : pm  pw .25 points
= .02 → z /2
 2.3265
fm − fw
z =
 1
 5.6184  −z ( /2
,z /2 )→ Reject H 0 .5 points
1 
(
f 1− f  + )
 349 336 
That different proportions of men and women in this student population would
be willing to marry beneath their class. .25 points

Q7: The correlation coefficient: r = 0.8764526758 .25 points


The equation of least-squares regression line for predicting metabolic rate
from body mass: .25 points

y = 201.1615996 + 24.02606662x .25 points

When x = 45 → y = 1282.334598 .25 points

You might also like