Polynomial Regression Models: Possible Models For When The Response Function Is "Curved"
Polynomial Regression Models: Possible Models For When The Response Function Is "Curved"
Example
What is impact of exercise on human
immune system?
Is amount of immunoglobin in blood (y)
related to maximal oxygen uptake (x) (in a
curved manner)?
Scatter plot
2000
1500
1000
30
40
50
Immunoglobin (mg)
60
70
Yi 0 1 X i 11 X i
2
i
where:
Yi = amount of immunoglobin in blood (mg)
Xi = maximal oxygen uptake (ml/kg)
typical assumptions about error terms (INE)
R-Sq = 93.8 %
R-Sq(adj) = 93.3 %
igg
2000
1500
1000
30
40
50
oxygen
60
70
SE Coef
411.4
16.47
0.1582
R-Sq = 93.8%
T
-3.56
5.36
-3.39
P
0.001
0.000
0.002
VIF
99.9
99.9
R-Sq(adj) = 93.3%
Analysis of Variance
Source
Regression
Residual Error
Total
Source
oxygen
oxygensq
DF
1
1
DF
2
27
29
SS
4602211
305818
4908029
Seq SS
4472047
130164
MS
2301105
11327
F
203.16
P
0.000
A multicollinearity problem
5000
oxygensq
4000
3000
2000
1000
30
40
50
60
70
oxygen
oxygen
34.6
45.0
62.3
58.9
42.5
44.3
67.9
58.5
35.6
49.6
33.0
oxcent
-16.037
-5.637
11.663
8.263
-8.137
-6.337
17.263
7.863
-15.037
-1.037
-17.637
oxcentsq
257.185
31.776
136.026
68.277
66.211
40.158
298.011
61.827
226.111
1.075
311.064
oxcentsq
300
200
100
0
-20
-10
10
20
oxcent
Yi x x i
*
0
where
xi X i X
*
1 i
* 2
11 i
Coef
1632.20
34.000
-0.5362
S = 106.4
SE Coef
29.35
1.689
0.1582
R-Sq = 93.8%
T
55.61
20.13
-3.39
P
0.000
0.000
0.002
VIF
1.1
1.1
R-Sq(adj) = 93.3%
Analysis of Variance
Source
Regression
Residual Error
Total
Source
oxcent
oxcentsq
DF
1
1
DF
2
27
29
SS
4602211
305818
4908029
Seq SS
4472047
130164
MS
2301105
11327
F
203.16
P
0.000
R-Sq = 93.8 %
R-Sq(adj) = 93.3 %
igg
2000
1500
1000
-20
-10
oxcent
10
20
Similar estimates
Regression Plot
igg = 1557.63 + 32.7427 oxcent
S = 124.783
R-Sq = 91.1 %
R-Sq(adj) = 90.8 %
igg
2000
1500
1000
-20
-10
oxcent
10
20
Yi b0 b1 X i b11 X i
Centered model:
*
*
* 2
Yi b0 b1 xi b11 xi
Where:
*
11
200
Residual
100
-100
-200
1000
1500
Fitted Value
2000
Normal Score
-1
-2
-200
-100
Residual
100
200
oxcent
39.4
oxcentsq
1549
R-Sq = 64.0 %
R-Sq(adj) = 0.0 %
2
2
Yi 0 1 x i 11 x 111 x i
2
i
H 0 : 11 111 0
3
i
E Y x x
E Yi 0 11 x
2
i
Example
Quality of a product (y) a score between
0 and 100
Temperature (x1) degrees Fahrenheit
Pressure (x2) pounds per square inch
82.725
quality
53.375
95
temp
85
57.5
pressure
52.5
A two-predictor, second-order
polynomial regression function
Yi 0 1 X i1 2 X i 2 11 X i21 22 X i22 12 X i1 X i 2 i
where:
Yi = quality
Xi1 = temperature
Xi2 = pressure
12 = interaction effect coefficient
SE Coef
110.3
1.344
3.140
0.006853
0.02741
0.009692
R-Sq = 99.3%
T
-46.49
23.13
44.50
-19.46
-41.74
-15.01
P
0.000
0.000
0.000
0.000
0.000
0.000
R-Sq(adj) = 99.1%
VIF
1154.5
1574.5
973.0
1453.0
304.0
temp pressure
0.000
0.999
0.000
0.773
0.000
1.000
0.632
tempsq
presssq
-0.000
0.772
0.632
where:
Yi = quality
xi1 = centered temperature
xi2 = centered pressure
*12 = interaction effect coefficient
Reduced correlation
quality
tcent
-0.423
pcent
0.182
tpcent
-0.274
tcentsq -0.355
pcentsq -0.762
tcent
pcent
tpcent
tcentsq
0.000
0.000
-0.000
0.000
0.000
0.000
0.000
0.000
0.000
-0.000
SE Coef
0.7224
0.03957
0.07913
0.009692
0.006853
0.02741
R-Sq = 99.3%
T
131.40
-23.15
9.95
-15.01
-19.46
-41.74
P
0.000
0.000
0.000
0.000
0.000
0.000
R-Sq(adj) = 99.1%
VIF
1.0
1.0
1.0
1.0
1.0
Residual
1
0
-1
-2
-3
40
50
60
70
Fitted Value
80
90
100
Normal Score
-1
-2
-3
-2
-1
Residual
SE Fit
95.0% CI
0.722 (93.424,96.428)
95.0% PI
(91.125,98.726)
tcent
0.0000
pcent
0.0000
tpcent
0.0000
tcentsq
0.0000
pcentsq
0.0000