PSY1004 Session 08
PSY1004 Session 08
Session 8
1
Confidence interval (of a population mean)
• Calculating the confidence interval (CI) using the
formula, CI = sample mean ± Z * (s /N), as introduced
in the previous session, requires the population SD (s)
value to be known
• To work out the CI when the population SD is unknown
(as in most research situations), the s can be estimated
as the sample SD (s) with the formula changed to:
Confidence interval (CI) = sample mean ± T * (s/N) ,
N = sample size; and
T is a value such that C% (confidence level) of the area
under the corresponding t distribution function (see
next page) is within ± T around the distribution’s mean.
2
t distribution (source: sphweb.bumc.bu.edu)
• A family of theoretical
probability distributions
mathematically derived
• Each t distribution is
symmetrical about its mean
and extends to infinity and
negative infinity
5
t distribution table
• For this course’s assignments/quizzes, when the df of a distribution in use is not
available from a statistical table (like the table below), use the table row of the
next lower df number, e.g., for a t distribution with 39 degrees of freedom, use
the table row for df = 30 (not 40). Note: this method represents a conservative
approach, and may not give the most accurate value required.
• For other applications, a more accurate value can be obtained from a reliable
software/website, e.g., the Excel formula “= TINV(0.05, 39)” gives 2.023 as the
(two-tailed) critical t value for df = 39.
6
Hypothesis testing (example)
Example (Caldwell):
• The population productivity (unit/day) of all workers (based
on 3 years’ data): mean = 193.80, SD = 31.55
• The productivity of a random sample of 50 workers who
were given flextime arrangement = 202.94
s = 31.55
9
Hypothesis testing (example)
• Assuming that the sampling distribution of the mean is normally
distributed, only 5 times out of 100 will a z-value of more than
+1.96 or less than –1.96 be expected
• Since the sample’s z-value of 2.05 is beyond the range of ±1.96, the
probability of such a z-score occurring is below 5%
• If this probability criterion (< 5%) was preset for rejecting the null
hypothesis, it can be rejected
• Interpretation: the productivity level of flextime workers is
significantly greater than the productivity level of all workers over
the past 3 years. 10
Statistical significance (example)
Population mean ( SD): 193.8 (31.55)
In the example above: Sample N: 50
11
Statistical significance (example)
Population mean ( SD): 193.8 (31.55)
In the example above: Sample N: 50
12