Hypothesis Testing
Hypothesis Testing
Learning Objectives: To explain why there is a need to test the hypothesis and to identify the
test statistics to be used in testing the significance of difference between means.
Performance Standard:
· able to formulate null and alternative hypotheses,
· identify the appropriate test statistics to be used, and
· be able to perform simple tests of hypotheses.
Topics:
• Test of Significance
• Hypothesis: Null and Alternative
• Hypothesis Testing
• Level of Significance, Errors & Rejection/Acceptance Region
• Hypothesis Testing Approaches: Critical Value, p-Value
• The 5-step Solution
• Z-test and t-test
Test of Significance
Once sample data has been gathered, statistical inference allows researchers to assess
evidence in favor of some claims about the population from which the sample has been drawn.
The methods of inference used to support or reject claims based on sample data are known as
test of significance. In English, “significant” means important, while in Statistic “significant”
means probably true. When Statisticians say the result is “highly significant” they mean it is
probably true. They do not necessarily mean it is highly important. (Blay, n.d.)
In Descriptive Statistics, we have learned that when Group 1 for instance got an average
score of 90 while Group 2 has an average score of 85, we readily conclude that Group 1
performed better than Group 2. It is because Group 1 has a higher mean score than Group 2. But
it is a different thing in Inferential Statistics, we need to search for more evidence for saying so
beyond what numbers tell us. We need to find out the difference “5” (90-85) is big enough for us
to conclude that Group 1 really performed better than Group 2 by TESTING THE SIGNIFICANCE
OF DIFFERENCE between these two mean scores using some test statistic which we are about
to discuss. If the difference is found to be significant, it is only then that we can make a conclusion
that Group 1 performed better than Group 2.
Testing the significance of difference can also be done between two portions. Forexample,
if survey says 40% of respondents are in favor of divorce while 47% are not in favor, with 13%
undecided, we cannot just conclude that the difference between the proportion of favorand not in
favor is significant unless we do testing. But we cannot just test this difference. We need to write
it in the form of HYPOTHESIS, thus the process is called “HYPOTHESIS TESTING”.
Hypothesis tests are not limited to means and proportions. It is also conducted in
regression and correlation analysis to determine if the regression relationship and the correlation
coefficient (r) are statistically significant. A goodness of fit test refers to hypothesis test in which
the null hypothesis is that the population has a specific probability distribution, such as a normal
probability distribution. Nonparametric statistical methods involve a variety of hypothesis-testing
procedures.
Null Hypothesis (Ho) is always hoped to be rejected. It always contains the “=” sign (≤ or
≥). Whilethe Alternative Hypothesis (Ha) challenges Ho. It never contains the “=” sign. Uses “< or
> or ≠. Itgenerally represents the idea which the researcher wants to prove.
Every test of significance begins with a Null Hypothesis. It represents a theory that has
been put forward, either because it is believed to be true or because it is to be used as a basis
for argument but has not been proved. The Alternative Hypothesis is a statement of what a
statistical hypothesis test is set up to establish.
The logic of traditional hypothesis testing requires that we set up two competing
statements in the form of null hypothesis and alternative hypothesis. These hypotheses are
mutually exclusive and exhaustive. You will have no problem in the null hypothesis since it always
uses “=” sign. Your little trouble is the alternative, since you have three choices, “< or > or ≠. But
do not worry there are glaring clues.
Hypothesis Testing
Hypothesis testing is a procedure for deciding if the null hypothesis should be rejected in
favor of an alternative hypothesis or will not be rejected. There are types of hypothesis tests that
are determined by the alternative hypothesis. An alternative hypothesis may be one-sided
(one-tail right or left directional) or two-sided (two-tail non directional) depending upon the
problem. A one-tail left/right test is used if Ha claims that a parameter is smaller/greater than the
value given by the null hypothesis. A two-tail test claims that a parameter is simply not equal to
the value given by the null hypothesis, thus the direction does not matter. In short, If Ha uses >
or <, then the hypothesis test is One-Tail right or left directional respectively, and if Ha uses ≠, the
hypothesis test is Two-Tail Non-directional.
Hypotheses are always stated in terms of population parameters, such as “μ” for mean
and “P” for proportion.
2. The Effects of To determine if there is Ho: μ after = μ before Ha: μ after ≠ μ before
New a significant difference There is no significant There is a significant
Compensation on the job satisfaction difference between difference between the
Package on the level of employees the level of job level of job satisfaction
Job Satisfaction of before and after the satisfaction of of employees in Banko
Banko Metro new compensation employees in Banko Metro before and after
Employees package is introduced. Metro before and the new compensation
after the new package is introduced.
compensation
package is
introduced.
While the study number 2 the hypothesis on means is two-tail non-directional. Ha uses ≠,
because the manager is just interested to know if there is a significant difference on the job
satisfaction level of employees before and after the new compensation package is introduced.
Whether it has increased or decreased will be seen after the test has been done.
On the other hand, the study number 2 the hypothesis on proportion is one-tail left-
directional. Ha uses < because the human resource researcher wants to find out if the number of
dissatisfied customers is significantly lower than 10%.
Once the test has been carried out the decision is always given the Null Hypothesis. We
either “Reject Ho in favor of Ha” or “Do not Reject Ho”; we never say, “reject Ha”, or even
“accept Ha”. If we decide “not to reject”, this does not necessarily mean that the null hypothesis
is true, it only suggests that there is not sufficient evidence against Ho in favor of Ha. Rejecting
the null hypothesis then, suggests that the alternative hypothesis may be true.
The significance level is the probability of rejecting the null hypothesis when it is true. For
example, a significance level of 0.05 indicates a 5% risk of concluding that a difference exists
when there is no actual difference. Lower significance levels indicate that you require stronger
evidence before you will reject the null hypothesis.
Use significance levels during hypothesis testing to help you determine which hypothesis
the data support. Compare your p-value to your significance level. If the p-value is less than your
significance level, you can reject the null hypothesis and conclude that the effect is statistically
significant. In other words, the evidence in your sample is strong enough to be able to reject the
null hypothesis at the population level.
So, it is very clear that the Level of Significance is the probability of committing Type I
Error, this is the basis to Reject or Not to Reject Ho and in short it is the area of the Rejection
Region.
Usually, researchers use either the 0,05 level or sometimes called 5% level, written as α
=0.05 or the 0.01 level or 1% level, written as α =0.01, although the choice of levels is largely
subjective. The lower the significance level, the more the data must diverge from the null
hypothesis to be significant. Therefore, the 0.01 level is more conservative than the 0.05 level.
The next illustrations show most of the concepts that we defined and discussed previously. These
include: Types of hypothesis Test (One-Tail/Two Tail Test), Level of Significance “α“,Rejection
and Acceptance Regions.
Acceptance Rejection
Rejection
Region 95% Region
Region
Area = 0.025 Area = 0.025
2. The P-Value Approach - We compare p-value with the level of significance or alpha, thus
making our simpler. The p-value is often called the observed level of significance for the test.
Again, p-value refers to probability or the expected value that the phenomenon is likely to
occur. If the hypothesis is to be tested at α = 0.05, then the area of the rejection is 0.05. In this
approach, comparison is made between “α” and p-value. We reject Ho if the p-value ≤ α
(0.05), otherwise Ho cannot be rejected. The decision rule is in this process is “Reject Ho if
the p-value is less than or equal to α.
Please watch this video to fully understand this concept
https://www.youtube.com/watch?v=-FtlH4svqx4
3. The 5-Step Solution - These five steps will guide us in the hypothesis testing process.
a. Critical Value Approach
1. Ho:
Ha:
2. α = ; critical value = ; 1-T/2-T
3. Test Statistic Reject Ho if the [computed-value] ≥ [critical-value]
4. Decision: Reject/Do not reject Ho because………
5. Conclusion:
b. p-value vs α
1. Ho:
Ha:
2. α = ; critical value = ; 1-T/2-T
3. Decision rule: Reject Ho if p-value ≤ α
4. Decision: Reject/Do not reject Ho because………
5. Conclusion:
Please watch this video to supplement your understand about critical value approach and
p-value approach part 1: https://youtu.be/4mDemwLHzbo and part 2:
https://youtu.be/JkLCPOZcc4A
Z – test Normal
n ≥ 30 means Z- distribution
t-test Student
n < 30 & n ≥ 30 2 means T- distribution
F-test 2 or more F
(ANOVA) means distribution
Z-test is a statistical hypothesis test that follows a normal distribution, the T- test follows a
Student’s T-distribution while the F-Test or ANOVA follows an F-distribution.
Z-test & T-test are both basically the same. They compare between two means to suggest
whether one or both samples come from the same population. Z-tests are often applied to
enlarged samples n>30 if the distribution is normal. It is used if these conditions are met;
otherwise, other statistical tests like T-test are applied in substitute. Z-test is used to test if there
is a significant difference between one sample mean and population mean and for two sample
means. Please watch this video to fully understand this concept
https://www.youtube.com/watch?v=5ABpqVSx33I
T-test is also referred to as “Student T-test”. It is the most used in Statistical Data Analysis
procedure for hypothesis testing since it is straightforward and easy to use. Additionally, it is
flexible and adaptable to a broad range of circumstances. T-test is best applied, if you have a
limited sample size n < 30 if the variables are approximately normally distributed and the variation
of scores in the two groups is not reliably different. It is however noted that if ‘n'' becomes larger,
the T-distribution becomes close to Z- distribution. Thus, T-test can be used not only if n<30 but
also if n is large or n ≥ 30 and if the populations’ standard deviation (σ) is not known.
The t-distribution is like the Z-distribution. They are both symmetrical about the mean.
Both are bell shaped, but the t-distribution is more variable since t values depend on the
fluctuation of the mean and standard deviation, whereas the Z values depend only on the
fluctuation of the mean from sample to sample. The two distributions differ in standard deviation,
Z has a standard deviation of 1 while t has a standard deviation which is always greater than 1.
The divisor n-1 in the formula for variance and standard deviation is called the degrees of
freedom (df). Degrees of freedom is the number of variables which are free to vary. If the mean
and standard deviation are computed from samples of size n, the values t is said to belong to a t
distribution with df = n-1. Therefore, we have a different t curve for each possible sample size,
such that a curve becomes more and more like the standard normal curve as n becomes larger
or as n approaches infinity.
References:
Frost, J. (n.d.). Significance level - Statistics By Jim. Retrieved October 22, 2020, from
http://statisticsbyjim.com/glossary/significance-level/
significance/