Statistics Workbook
Statistics Workbook
of
Statistics
1
Descriptive Statistics
1. In a study on the growth of male children in the age range of 1
At the age of 2, a sample was collected with the weight (in kg) of 50 boys. In the table
The following are the collected data.
13.6 13.2 14.8 15.5 13.7 14.0 14.2 13.5 11.5 15.0
11.9 12.7 13.8 12.9 14.6 13.7 14.2 15.1 12.7 13.5
13.4 15.7 11.5 12.7 11.9 15.7 14.8 10.9 12.7 14.0
12.4 14.8 15.3 14.0 15.2 13.8 13.5 12.7 15.0 11.9
12.4 12.0 12.0 11.4 13.8 11.1 11.7 13.7 10.0 13.2
a) Using class intervals with equal amplitudes of 0.9, determine the average,
median, mode, and standard deviation of the set of recorded values.
93 76 68 84 75 82 68 90 62 88 75 85 73 79 88 73 60 93 71 59
72 63 61 65 75 87 74 62 95 78 60 68 66 78 82 75 94 77 69 74
71 83 96 78 89 61 75 95 60 79 75 71 79 62 67 97 78 85 76 65
74 53 65 80 73 57 88 78 62 76 77 85 86 67 73 81 72 63 76 75
mode
00200033001850043062231101011012120
01643312400312000001102024402202000
2
a) Build the absolute frequency distribution of the number of absences per day.
c) Determine the median, variance, and standard deviation of the number of absences per day.
f) What conclusions can you draw about the symmetry of the distribution?
752 755 725 753 764 738 757 744 747 754 741 750 757 745 754 750 729 742 754 747
755 736 744 753 721 738 732 752 745 758 740 751 746 736 741 748 735 747 727 750
743 750 732 749 745 736 733 741 749 743 748 749 737 737 749 740 724 753 738 752
747 735 743 751 726 749 741 751 745 754 753 745 749 731 746 737 741 728 750 747
747 740 741 730 739 754 739 744 755 748 759 750 756 740 745 742 730 736 750 754
a) Consider the data presented in the table and group them into cells, calculating the
respective absolute and relative frequencies. Justify the decision regarding the number
of adopted classes.
(a) 50 52 54 56 58 60 62
(b) 41 46 51 56 61 66 71
Determine, for each of these sets, the mean, variance, and standard deviation.
3
6. Consider two classes for which the ages of their students are indicated.
Idade: 17 18 19 20 21 22 23 24 25 26 27
0 4 4 6 12 10 8 0 0 0 0
For each class, determine: mean, mode, median, variance, standard deviation
What conclusions can you draw about the symmetry of the distribution?
7. The following table indicates the distances (in Km) to be covered by an athlete.
of marathon during training, in the 60 weeks leading up to an event
important.
93 76 68 84 75 82 68 90 62 88 75 85
73 79 88 73 60 93 71 59 72 63 61 65
75 87 74 62 95 78 60 68 66 78 82 75
94 77 69 74 71 74 53 65 80 73 57 89
74 65 63 72 81 83 57 64 59 76 72 60
a) Define the intervals for the classes and determine: mode, median, average, variance and
standard deviation.
8. The number of phone calls (per minute) received by a certain company was
registered over a period of 50 minutes, observing the following values:
1 0 1 1 0 0 2 2 1 1 0 1 4 0 3 1 3 1 0 1 4 0 1 0 2
0 1 0 2 0 0 1 1 0 0 1 2 1 0 0 1 1 0 2 0 0 1 2 2 1
b) Calculate approximations for the mean, median, mode, and standard deviation of this.
sample.
c) Draw the frequency histogram. Does the sample show any skewness?
Justify.
4
36 people were asked to classify the Health System in Portugal accordingly.
with the following scale: 1 (terrible), 2 (bad), 3 (slightly reasonable), 4 (reasonable), 5 (very
reasonable), 6 (good), 7 (very good), 8 (excellent). The ratings were:
5 2 7 6 3 7 8 3 2 6 3 6 3 7 5 3 6 7
3 7 6 4 3 5 8 6 5 4 3 6 6 5 7 8 4 3
b) Calculate approximations for the mean, median, mode, and standard deviation of this.
sample.
Road 6.7 6.7 7.1 7 6.9 7.3 7.4 7.2 7.1 6.5
City 7.4 7.1 8.1 8.3 8 8.8 8.9 8.7 8.4 8.1
5
a) Calculate the average and variance of daily consumption in the city.
c) Calculate the sample correlation coefficient for the linear adjustment between the
kilometers traveled (response variable Y) and highway consumption (variable
independent X).
12. The annual production of strawberries from a greenhouse depends on the percentage of
existing humidity. The following table shows the value of this production, in
tons, depending on humidity:
Humidity (%) 20 25 30 35 40
a) Quantify the quality of the adjustment made (sample correlation coefficient) and
present the conclusions.
6
Correlation and linear regression analysis
The 4 pairs of values are related to the variables: memory size (Mbytes)
the transferred bytes (Mbytes).
a) Create the scatter plot. Conclude about the correlation between the two.
variables.
b) Calculate the Pearson correlation coefficient. Conclude about the correlation.
between the two variables.
c) Estimate the linear regression line and the coefficient of determination.
a) Make the scatter plot. Conclude about the correlation between the two.
variables.
b) Calculate the Pearson correlation coefficient. Conclude about the correlation.
between the two variables.
c) Estimate the linear regression line and the coefficient of determination.
7
To study the pollution of a river, a scientist measured the concentration of a
determined organic compound (Y) and the rainfall precipitation in the week
anterior (X):
X Y
0.91 0.10
1.33 1.10
4.19 3.40
2,68 2.10
1.86 2.60
1.17 1.00
8
5. The table below shows the frequency of the average pulse in different periods.
age groups:
There are suspicions that the quality of the medicine depends on the maturation time.
spent in its production. To verify this, a pharmaceutical laboratory
collected the following data:
Tempo-X Quality-Y
1 23
2 31
3 40
4 46
5 52
6 63
9
Probabilities
Two events have P(A)=1/4, P(B|A)=1/2, and P(A|B)=1/3.
2. Given two events A and E for which P(A)=0.67, P(E)=0.23, and P(A∩E)=0.12,
find P(Ā) E).
P(E|F) F).
5. Consider the random experiment that consists of the simultaneous throwing of two
perfect data and the following events
6. In the roll of a die, determine the probability that it will land on:
b) Face by a multiple of 3
10
7. Let A and B be independent events and P(A) = 1/6 and P(B) = 1/4. Determine:
8. Let A and B be two events such that P(A)=1/4, P(B)=2/3 andP(A∩ B)= 1/6.
Determine
P(B A)
10. A city of 200,000 inhabitants has two daily newspapers available: 'The
"Aurora" and "The Knower". An investigation revealed the following data:
-50,000 people read 'O Aurora' daily;
-40,000 people read 'O Conhecedor' daily;
-5000 people read both newspapers daily.
What is the probability that when we randomly choose an inhabitant of this city, they are
reader:
from at least one of the newspapers.
11
b) Product 2 is sold because product 1 is sold.
At least one of the products must be sold.
13. After some tests carried out on the quality of a certain component
electric, it was concluded that this has a type I defect with an equal probability
a 0.6, a type II defect with a probability of 0.7 and no type I defects
neither type II defects with a probability of 0.25.
a) Determine the probability of the component having type I defect and type II defect.
b) Determine the probability that the component has only type I defect or only
type II defect.
c) Determine the probability of the component having a type II defect, knowing that it
there are no type I defects.
15. An accident can be caused by human error, brake failure, or tire blowout.
of tire, with the 1st cause being twice as likely as each of the others.
16. In one hospital, 50% of individuals had disease K, and 30% had disease L.
and 20% with disease M. The probability of curing disease K is 0.7; for diseases L and
The probabilities are respectively 0.8 and 0.9. A patient admitted was given
high. Calculate the probability that this individual has suffered from disease K.
12
serves 20% of the clients and makes a mistake 8% of the times, and Paulo serves 50% of the clients
of 20%, the probability of complaining solely due to the poor quality of the products is
15% is the probability of a customer complaining due to a delay in the delivery deadline.
the product quality is 15%.
a) Calculate the probability that a randomly chosen customer has complained due to poor service.
b) Calculate the probability that a randomly chosen customer has only complained due to
to the delay in delivery.
19. The analysis of road accident claims from the last year revealed that, among the victims of
accidents where they wore seat belts, 10% suffered serious injuries while
that, among the victims who were not wearing seatbelts, 5% sustained injuries
lightweights. It was found that 75% of drivers were using the seat belt.
a) Considering an accident in which the victim was wearing a seatbelt, what is the
probability of it being a minor injury?
b) Considering an accident that resulted in a serious injury, what is the probability
Did the victim not wear a seatbelt?
13
of 20%, the probability of complaining solely due to poor product quality is
15% is the probability of a customer complaining due to a delay in the delivery deadline.
the product quality is 15%.
a) Calculate the probability of a randomly chosen customer having complained due to poor service.
quality of products.
b) Calculate the probability that a randomly chosen customer only complained due to
to the delay in the delivery deadline.
21. The incidence of a certain disease in the population of a country in the fourth
the world is 3%. A test available to detect the disease shows a result
positive in 96% of people who are actually sick. Unfortunately, the test
it also gives a positive result in some people who do not have the disease but
only in 8% of these.
a) A certain individual from that country underwent the test and the result was
negative. What is the probability that he actually has the disease?
b) If the test is positive, what is the probability that he does not have the disease?
23. From the set of large companies in a given industrial sector, it is known that 60%
Of these companies, 40% have a quality control department.
of human resources and 20% have both departments. A company is
randomly selected from the file of companies in the referred sector. Calculate the
probability of this company:
14
Solutions
1.1/8; 3/8; ½
5.8/36
1/24
3/4
9.0.75
17/40
0.5
16.0.4545
15
Random Variables
1. Determine if the given values can be used as values of a function of
probability of a random variable in the range of values x=1,2,3,4.
−2
a)f(x)= x= 1,2,3,4,5
5
x2
f(x)= x= 0,1,2,4
30
1
c)f(x)= x= 0,1,2,3,4,5
5
4. Find the cumulative distribution function of the random variable that has the
probability distribution:
x
f(x)= x= 1,2,3,4,5
15
16
⎧0 x< 1
⎪1 3 1 ≤ x< 4
⎪⎪
F ( x ) = ⎨1 2 4≤ x< 6
⎪5 6 6≤ x< 10
⎪
⎪⎩ 1 x≥ 10
a)P(2< x≤ 6);
b)P(X= 4);
c)f(x).
⎧1 2< x< 7
⎪
f(x)= ⎨ 5
⎪⎩ 0 for other values
a) Draw the graph and verify that the total area under the curve is equal to 1.
b) Calculate aP(3< x≤ 7) .
c) Determine the cumulative distribution function and use it to calculate item b).
⎧ c 0 < x< 4
⎪
f(x)= ⎨ x
⎪0 for other values
⎩
c) Determine the cumulative distribution function and use it to calculate part b).
8. Find the expected value of the discrete random variable X with the following
probability distribution
x− 2
f(x)= x= −1,0,1,3
7
17
9. Find the expected value of the continuous random variable X with the following function
probability density
⎧x
⎪ 0< x< 2
f(x)= ⎨ 2
⎪⎩ 0 other values
Find:
a) E[X];
b) VAR[X].
11. Consider the random variable that represents the number of men who do
part of a group of 6 people randomly selected from 4 women and 5
men that is defined by the following probability function:
x p(x)
2 5/42
3 20/42
4 15/42
5 2/42
a) Determine the location parameters (mean, median, and mode) and the dispersion
(variance and standard deviation) of that variable.
b) Calculate the probability of having at least 3 men in the referred group and the
probability of having exactly zero women.
18
a) Determine the parameters a and b.
b) Graphically represent the probability density function and the function
Probability distribution of the variable t.
c) Determine the location parameters (mean, median, and mode) and dispersion.
(variance) for the variable t.
d) CalculateP(t≥ 2).
13. A box contains the following 4 cards: the duke of clubs, the suit of hearts, the
deck of spades and the ace of diamonds. Two cards are drawn from the box, with replacement
from the first before the second is extracted. Consider the random variable X, which
represents the sum of the points obtained on the two cards.
c) Calculate the probability that the sum of the points obtained does not exceed 5.
d) Solve again the previous items, assuming that the first card is not replaced.
in the box before extracting the second one.
No;
No
⎧0 x< 1
⎪ 1/15 1≤ x< 2
⎪
⎪⎪ 3/15 2≤ x< 3
4.F ( x ) = ⎨
⎪ 6 / 1 5 3 ≤ x< 4
⎪ 10/15 4≤ x< 5
⎪
⎪⎩ 1 x≥ 5
19
⎧0 x <1
⎪ 1/3 1≤ x < 4
⎪⎪
1/2f(x)= 1/6 ⎨ 4≤ x< 6
⎪ 26/ 6≤ x< 10
⎪
⎪⎩ 1/6 x≥ 10
⎧0 x< 2
7 ⎪ x−2
6.a)15/dx=∫ 1 ; b)4/5; c)F ( x ) = ⎪ 2≤ x< 7
⎨
2 ⎪ 5
⎪⎩ 1 x≥ 7
⎧0 x< 0
⎪
7.a) 1/4; b) 1/4; 1/2; c) F ( x⎪⎨) =
x
0 ≤ x< 4
⎪ 2
⎪1 x≥ 4
⎩
8.1/7
9.37/12
4/3
a=7/18 b=-1/18
⎧0 x< 4
⎧ 1/16x= 4 ⎪ 1/16 4≤ x< 5
⎪ 4/16x= 5 ⎪
⎪ ⎪ 5 / 1 6 5 ≤ x< 6
⎪⎪ 4 / 1 6 x = 6 ⎪
13.a)f(x)= ⎨ F ( x ) = ⎨9 / 1 6 6 ≤ x< 7b) 6.5; 1.53;
⎪ 2/16x= 7 ⎪ 11/16 7≤ x< 8
⎪ 4/16x= 8 ⎪
⎪ ⎪ 15/16 8≤ x< 10
⎪⎩ 1/16x= 10 ⎪1
⎩ x≥ 10
c)5/16; d)6.5; 1.258; 4/12.
20
Probability Distributions – discrete and continuous
1 defective screw.
b) 0 defective screws.
d) How many questions should the test consist of for the probability of
you get three of the questions right, which is approximately 25%.
21
The probability of a student entering university graduating is
0.4. Determine the probability that, among 5 students:
b) To graduate.
7. If the probability of an individual suffering from a harmful reaction, resulting from the
the injection of a certain serum is 0.001, determine the probability of among
2000 individuals:
i. P(X>t)=0.2514
ii. P(X<t)=0.97982
b) Between 5 x 10310x103hours?
22
10. Suppose that the time an operator takes to fill out a form
electronic is uniformly distributed between 1.5 and 2.2 minutes.
a) Calculate the average and variance of the time that the operator needs to fill out
the form.
b) What is the probability that the operator will need less than 2 minutes to
fill out the form?
a) Determine the probability that on any given day, the train will be late.
maximum 8 minutes.
b) If a customer arrives at the train station at 8:10, what is the probability of that
client to catch the train?
13. If the heights of 300 students are normally distributed, with an average of
172.72 cm and a standard deviation of 7.62 cm, how many students have heights:
23
Solutions to the exercises on Probability Distributions
1.0.8418
2.0.0076
3.0.4762
80.04
0.04
24
Confidence Intervals
2. Let X be a population with a normal distribution of mean μ and standard deviation equal to
2. A random sample of size n = 25 was taken from this population and
revealed an averagex78.3.
b) What is the width of the confidence interval (at 99% confidence) when estimating μ by
x= 78.3?
c) What should be the sample size for the range (at 99% of
confidence), when estimating μ byxdo not exceed 0.1?
f) What should be the sample size for the range, at 95% confidence, at
estimate μ forxDon'texceed 0.1? And at 99.9% confidence? Interpret the
results.
From a normal population with mean μ and variance equal to 4, a sample was taken.
of 9 observations in whichx= 4.
4. Consider a normal random variable with a variance of 4. The following sample was collected:
b) What should be the level of confidence to use for the range of the interval
was 2.77?
c) Indicate the sample size you would consider for the range of the
the interval should be less than one, under the conditions of paragraph 1.
d) Briefly explain what would happen if the degree of increase were raised to 99%
trust, maintaining the sample.
25
5. Consider the sample of a normal random variable with variance equal to 4:
a) Deduce and indicate a confidence interval, at 80%, for the population mean.
b) What should be the level of confidence to use for the range of the interval
was 2.77?
6. A paper company wants to estimate the average time required for a new
machine produces a ream of paper. It is known that a sample of 36 reams
It averages about 1.5 minutes per ream of the machine. Assuming that σ = 0.30
minutes, construct a 95% confidence interval.
The owner of a café wants to calculate the average daily profit per customer. In a sample of
100 clients verified that the average spending per client was 350 units.
monetary units (u.m.), with a standard deviation of this sample being 75 u.m. Estimate a
confidence interval for the true average spending with 90% confidence.
b) Considering that the estimate for the average value is not accurate enough
(given that the confidence interval is too large), one wonders: what should it be
the sample size so that the 95% confidence interval has a
amplitude of 3 kilos?
26
a) Deduce a (1− α)100% confidence interval for the average flow of the river.
b) The following confidence interval was obtained for the average flow rate
]6.503; 7.376[
12. We weighed 16 bags of coffee and with the observed weights, in grams, we constructed
the following 95% confidence interval for the average weight of a bag:
1000.74; 1009.26
a) Deduce the average value and the standard deviation of the weight of the bags that make up the
sample, assuming the normality of the population.
Cat. 1 57.9 66.2 65.4 65.2 62.6 67.6 63.7 67.2 71.0 65.4
Cat. 2 66.4 71.7 70.3 69.3 64.8 69.6 68.6 69.4 65.3 68.8
Determine a 95% confidence interval for the difference of means of the data.
obtained from the two catalysts.
14. It is intended to investigate the level of salary remuneration of men and women.
certain professional category. From two samples obtained between two groups, stand out-
if the following results (in u.m.):
Construct a 99% confidence interval for the average salary differences between
both sexes and conclude about the possible existence of sexual discrimination in the
assignment of compensations.
15. A trader intends to acquire fruits from one of the orchards A or B. How the
the weight of the fruits is a preferential factor, the merchant takes a casual sample of
36 fruits (in each orchard) and choose the orchard that corresponds to the sample with
greater average weight. If the weight of the fruits is normally distributed, being:
27
Pomar: Average (g): Standard deviation (g) :
A 20 2
B 18 5
What is the probability of the merchant choosing orchard B?
[7.398, 12.602[
a) Knowing that, with the sample information, s = 4 was obtained, what is the degree of
confidence that can be attributed to the aforementioned interval?
b) Based on the same sample, build a 95% confidence interval for the
population variance.
c) Suppose that the true population variance is 44. If you intend to construct
a confidence interval (at 95% for the population mean) whose width does not
exceed 6.5, what should be the sample size to consider?
28
19. Collected were 9 observations of a random variable N (8;σ) obtaining the following values:
I =1 i =1
The height (in mm) of the soap foam in a basin is important for the
detergent manufacturers and it is assumed that their behavior is Normal. It was
an experiment was conducted, placing the same amount of detergent in 10
standard size basins and, after a certain agitation of the water, it was measured
height of the foam. The following results were obtained:
10 10
∑ x=i 229 ∑ (x − x=) 1553
i
2
i =1 i =1
a) Determine a point estimate for the mean and for the standard deviation.
c) Comment on the two types of estimates obtained (in the previous sections) for the
standard deviation.
c) The market share could be, with 95% confidence, 56.5% (value
midpoint of the interval;
The obtained result indicates only that it is timely to proceed with the launch of the
new bath foam.
29
23. In a region affected by an epidemic outbreak, a sample of 2500 was observed.
individuals, having found 850 contaminated.
24. A sample of 40 first-year students was collected. othe ULusíada in the academic year of
2005/2006, it was found that 10 of these students attend the courses that
selected as the first option.
ii. and to maintain the level of confidence, indicate the sample size that should
to use?
c) If 200 samples of size 40 were collected from the same population, from
So that you would build 200 confidence intervals at 99%, how many of these
intervals I would expect to contain the true value of the proportion of
students in analysis?
c) If 4000 voters had been surveyed and 1680 had declared a preference for the
Plan A, what would the 95% confidence interval be now? Comment on the results.
26. Two surveys conducted (in 1990 and 1999) regarding consumption of
alcoholic beverages, for ages between 15 and 35, provided the following
data:
Through a confidence interval, at 98%, indicate the truth of the statement: 'The
percentage of consumers of alcoholic beverages, in individuals of ages
aged between 15 and 35 years, recorded a significant increase in the decade
of 90.
30
27. With the aim of identifying risk factors for coronary disease, the following were analyzed.
two samples of 215 men and 1140 women, it was recorded that 58 of
men and 217 of the women had diabetes.
28. Two samples drawn from two normal populations consisting of 21 and 9.
observations have variances given bys12= 24 es22= 9 respectively. Prepare a
confidence interval for the ratio of variances at 95% confidence.
a) Construct a 95% confidence interval for the ratio of variances and remove
conclusions about the dispersions of grades.
30. To find out the degree of preference of consumers from two cities in
Regarding a brand of detergent, a survey was conducted where the
respondents rated the product on a scale of 0 to 20. Thus, data was collected
randomly 21 consumer opinions from city A and 11 from city B, being
the results obtained are the following:
x A12.9sA2= 2.1
xB14.7sB2= 1.8
a) Deduce and calculate a 95% confidence interval for the quotient of the
variances of opinions and comment on the result obtained.
b) Indicate, justifying, how you could reduce the range of the interval.
anterior
31. Define point estimation and interval estimation. Say which of the
two will be better, justifying.
31
Solutions
⎤ σ σ ⎡
1. x−
⎥ zα / 2 ≤ µ ≤ x+ zα / 2
⎦ n n ⎢⎣
]77.2696; 79.3304[
n≥17330.
6.]1.402; 1.598[.
7.]337.6625; 362.3375[.
8.]0.4685; 0.9315[.
9.]26.3919; 37.6081[.
13.]−5.830,−0.570[.
14.]2.021, 3.579[ .
15.0.9742
16.]62.491, 143.51[ .
19.]0.4373; 3.5179[ .
20.]1.5609; 2.7824[.
False.
32
−0.037, 0.012
27.]0.0265, 0.1335[
28.]0.667, 7.760[ .
[1.0385, 3.2526]
[0.341, 3.236[
31.-
33
Hypothesis Testing
a) Test at a significance level of 5% if the company should be punished for requiring that
your employees work more than they should.
b) What kind of mistake can you make regarding the decision you made?
Assuming the normality of weights, what is your opinion, at the level of significance of
1%, relating to the manufacturer's statement.
a) Assuming that the average weight of the 10 eggs is equal to 57 grams, what decision should he make?
take?
b) Calculate the probability of a Type II error, considering the true average weight of
eggs should weigh 50g.
34
a) Assuming that the population standard deviation, considered normal, is equal to 5ml,
test, at the level of 5%, the hypothesis that the average volume of all milk packages is
equal to 1 liter.
b) Assuming that the population mean is equal to 998ml, calculate the probability of
accept the hypothesis tested in the previous item.
6. Suppose a certain television channel wants to know what had been the
percentage of people who watched a certain program. To this end, a
survey having interviewed 220 people, of which 132 said they had seen the
referred program.
A certain brand of motor oil claims that its oil is known for
last on average 5000 km with a variance of 250,000 km2Assuming that the
duration follows a normal distribution, test the statement regarding
variance, at a significance level of 5%, based on the following values of the number of
kilometers that 6 cars traveled before the oil burned:
35
The administrator of a Hospital intends to estimate the outstanding accounts (not yet
charged), pertaining to treatments and hospitalizations. Assume that the amount for each
debt follows a normal distribution and that the experience of previous years allows
consider a value for the standard deviation equal to 50 euros. To carry out your
objective, the administrator randomly selected 25 of these accounts and evaluated the
your value, having incurred a total debt of 1850 euros.
a) Determine a 95% confidence interval for the average amount of each account
not settled.
b) It can be statistically asserted, at the 1% level, that the average value of each account
Is the unpaid amount less than 100 euros? What mistake might you be making with your decision?
c) Assuming that the true average value of each outstanding account is equal to 85
euros, calculate the probability of making a type II error.
a sample standard deviation value, based on the 25 accounts, equal to 60
euros, could it endanger, at the level of 5%, the value previously considered?
10. Admit that a random sample of 400 households from a certain city
it revealed that 8% of these are rental houses, while in another city, one
A sample of 270 households revealed that 37 were rental houses.
11. The Duramais and Duramuito batteries cost the same price. To test if
both have the same duration, two samples of 100 batteries of each were collected
mark, having obtained the following results:
36
Brand Dimension of Average Standard deviation
sample
Duramais 100 1180 120
Duramuito 100 1160 40
12. Suppose that the average fabric production (in grams) of two looms of a
factories can be considered normal. Admit it, even in an experience conducted
with the aim of comparing these two looms, in terms of their average productions,
and obtained the following results:
8 8
1 ∑ i ∑ x = 816.664 grams
Tear:x= 80.8g i
2 2
i=1 i=1
9 9
∑ i
Tear=2 96.3g ∑ y = 1030.959 grams
i
2 2
i=1 i=1
a) At a significance level of 5%, test the equality of the variances of the looms.
b) Compare, at a significance level of 1%, the average productions of the two looms.
Admit the necessary assumptions.
37
b) Assuming that the service times of each of the employees follow a
normal distribution, construct a 99% confidence interval for the difference between
the average service times of the two employees. Comment on the results obtained.
arrived.
c) Assuming that the service times of each of the employees follow a
normal distribution, decide whether the director should agree with the complaints of
users, at a significance level of 5%.
d) Test the hypothesis that the variance of Mr. Snail's service time is equal to
350, at a significance level of 1%.
Process A 24 27 26 21 24
Process B 27 28 23 31 26
Assuming that the nicotine contents, per mg, follow normal distributions with
standard deviations equal to 2 for process A and 2.5 for process B, which can
conclude at the 5% level? And at the 1% level?
What to conclude about the magazine's news at a significance level of 10%? And at 5%?
38
Solutions
1.a) Z = 12.1, H is rejected0at the level of 5%.
b) Type I error.
8.a) IC95%(µ)=(54.4;93.6)
b) Z = -2.6, H is rejected0at the level of 1%. Type I error.
c)βP(Type II error) = 0.7967
d) Q = 34.56, H is not rejected0at a level of 5%.
9.Z = 2.5, Reject H0at the level of 5% and at the level of 1%.
11.Z = 1.58, Do not reject H0at the level of 5% and, consequently, at the level of 1%.
39
14.Z = -1.82, H is not rejected0at the level of 5% and, consequently, at the level of 1%.
Note: In hypothesis testing, it must always be stated what it means in each context.
to reject or not to reject the tested hypothesis.
40