Correlation & Regression
Correlation & Regression
Ex.
(i) The yield of crop varies with the amount of rainfall .
(ii) Price varies with demand .
(iii) Cost of production varies with the cost of raw materials .
Coefficient of correlation
𝒙−𝒙 𝒚−𝒚
𝒓𝒙𝒚 = 𝒓 =
𝒏.𝝈𝒙.𝝈𝒚
Co-Variance –
𝑥−𝑥 𝑦 −𝑦
𝑐𝑜𝑣(𝑥, 𝑦) =
𝑛
𝑥𝑦
Or 𝑐𝑜𝑣 𝑥, 𝑦 = − 𝑥𝑦
𝑛
1 ( 𝑥𝑖 ) ( 𝑦𝑖 )
Or Cov(𝑥, 𝑦) = 𝑥𝑖 𝑦𝑖 −
𝑛 𝑛
𝑥−𝑥 𝑦 −𝑦
𝑟 = 𝑟(𝑥, 𝑦) =
𝑛 .𝜎𝑥 .𝜎𝑦
Where –
𝜎 = Standard deviation
𝜎 2 = Variance
Var(𝑥) = 𝜎𝑥 2 ⟹ 𝜎𝑥 = 𝑣𝑎𝑟(𝑥)
Var(𝑦) = 𝜎𝑦 2 ⟹ 𝜎𝑦 = 𝑣𝑎𝑟(𝑦)
𝑐𝑜𝑣 (𝑥, 𝑦)
𝑟𝑥𝑦 =
𝑣𝑎𝑟 𝑥 . 𝑣𝑎𝑟 (𝑦)
Characteristics of 𝒓𝒙𝒚
(i) −1 ≤ 𝑟𝑥𝑦 ≤ 1
I. Lines of Regression
(i) 𝑦 on 𝑥 –
𝑦 − 𝑦 = 𝑏𝑦𝑥 𝑥 − 𝑥
𝑐𝑜𝑣(𝑥, 𝑦)
𝑦−𝑦= 𝑥−𝑥
𝜎𝑥 2
𝑟𝜎𝑦
𝑦−𝑦= 𝑥−𝑥
𝜎𝑥
𝑟𝜎𝑦
Where 𝑏𝑦𝑥 =
𝜎𝑥
(ii) 𝑥 on 𝑦 –
𝑥 − 𝑥 = 𝑏𝑥𝑦 𝑦 − 𝑦
𝑐𝑜𝑣(𝑥, 𝑦)
𝑥−𝑥 = 𝑦−𝑦
𝜎𝑦 2
𝑟𝜎𝑥
𝑥−𝑥 = 𝑦−𝑦
𝜎𝑦
𝑟𝜎𝑥
Where 𝑏𝑥𝑦 =
𝜎𝑦
y on x :
𝑟𝜎𝑦 𝑐𝑜𝑣 (𝑥,𝑦 )
(i) 𝑏𝑦𝑥 = =
𝜎𝑥 𝜎𝑥 2
𝑥𝑖 ( 𝑦𝑖)
𝑥𝑖 𝑦𝑖 −
(ii) 𝑏𝑦𝑥 = 𝑛
( 𝑥 𝑖 )2
𝑥 𝑖2 −
𝑛
𝑛 𝑥𝑖 𝑦𝑖 − 𝑥𝑖 ( 𝑦𝑖 )
(iii) 𝑏𝑦𝑥 =
𝑛 𝑥 𝑖2 −( 𝑥 𝑖 )2
𝒙 on 𝒚 :
𝑥𝑖 ( 𝑦𝑖)
𝑥𝑖 𝑦𝑖 −
(ii) 𝑏𝑦𝑥 = 𝑛
( 𝑦 𝑖 )2
𝑦𝑖2 −
𝑛
𝑛 𝑥𝑖 𝑦𝑖 − 𝑥𝑖 ( 𝑦𝑖 )
(iii) 𝑏𝑦𝑥 =
𝑛 𝑦𝑖2 −( 𝑦𝑖 )2
𝑟𝜎𝑥
and Coefficient of Regression 𝑥 on 𝑦 is 𝑏𝑥𝑦 = .
𝜎𝑦
Then
𝜋
(i) If 𝑟 = 0 then 𝜃 = i.e. lines are perpendicular.
2
Note :
a. 0 < r ≤ 1
b. −1 ≤ 𝑟 ≤ 1
c. 𝑟 ≥ 1
d. −1 < 𝑟 < 1
2. If 𝑥𝑖 = 44, 𝑦𝑖 = 55, 𝑥𝑖 𝑦𝑖 = 256 and 𝑛 = 8, 𝑡ℎ𝑒𝑛 cov 𝑥, 𝑦 =?
X 4 5 6 8 10
Y 12 10 8 7 5
The coefficient of correlation is
Rank 1 2 3 4 5 6 7 8 9 10
in
Hindi
Rank 3 10 5 1 2 9 4 8 7
in 6
English
a. Arithmetic mean
b. Harmonic mean
c. Geometric mean
d. none of these
23. The arithmetic mean of the regression coefficient will be ……..
always coefficient of correlation :
1−𝑟 2 𝜎𝑥 .𝜎𝑦 .
a.
𝑟 𝜎𝑥2 +𝜎𝑦2
𝑟 2 −1 𝜎𝑥 .𝜎𝑦 .
b.
𝑟 𝜎𝑥2 +𝜎𝑦2
𝑟 2 −1 𝜎𝑥 .𝜎𝑦 .
c.
𝑟 𝜎𝑥2 +𝜎𝑦2
d. none of these
1 1
26. If 𝑏𝑦𝑥 = − and 𝑏𝑥𝑦 = − , then coefficient of correlation will
4 4
be :
1 1 1 1
a. b. ± c. d. −
2 4 4 4
27. The regression line of 𝑥 on 𝑦 is
𝜎𝑥
a. 𝑥−𝑥 =𝑟 (𝑦 − 𝑦)
𝜎𝑦
𝜎𝑦
b. 𝑦−𝑦 =𝑟 (𝑥 − 𝑥 )
𝜎𝑥
𝜎𝑥
c. 𝑥−𝑥 = (𝑦 − 𝑦)
𝑟𝜎𝑦
d. none of these
28. If the regression lines of 𝑥 on 𝑦 and on 𝑥 are 𝑥 = 4𝑦 + 5 and
𝑦 = 𝑘𝑥 + 4, Then
a. 1 ≤ 4𝑘 ≤ 0
b. 0 ≤ 4𝑘 ≤ 1
c. 0 < 4𝑘 < 1
d. 1≤ 4𝑘 < 0
29. If 𝑥 , 𝑦 , 𝑥 2, 𝑦 2 𝑎𝑛𝑑 𝑥𝑦 and number of variable is 𝑛
are given then coefficient 𝑏𝑥𝑦 will be :
𝑥𝑦 −𝑛 𝑥. 𝑦
a.
𝑥 2 −𝑛 𝑥 2
𝑛 𝑥𝑦 −𝑛 𝑥. 𝑦
b.
𝑛 𝑥 2− 𝑥 2
𝑛 𝑥𝑦 − 𝑥. 𝑦
c.
𝑛( 𝑥 2 − 𝑥 2 )
𝑥𝑦 −𝑛 𝑥. 𝑦
d.
𝑥 2 −𝑛 𝑥 2
30. If 𝑥 , 𝑦, 𝑥 2 , 𝑦 2 and 𝑥𝑦 𝑎𝑛𝑑 number of variables is 𝑛
are given then coefficient 𝑏𝑦𝑥 will be
𝑛 𝑥𝑦 − 𝑥. 𝑦 𝑛 𝑥𝑦 − 𝑥. 𝑦 𝑛 𝑥𝑦 − 𝑥. 𝑦
a. b. c. d. none of these
𝑛 𝑥2− 𝑥 2 𝑛 𝑦 2− 𝑦 2 𝑛 𝑥2− 𝑥 2
31. If Cov 𝑥, 𝑦 , 𝜎𝑥2 and 𝜎𝑦2 are given, then coefficient of regression
𝑏𝑥𝑦 will be
𝜎𝑦 𝜎𝑥 𝜎𝑦 𝜎𝑥
a. 𝑟 b. 𝑟 c. d.
𝜎𝑥 𝜎𝑦 𝑟𝜎𝑥 𝑟 𝜎𝑦
34. The value of coefficient of regression 𝑏𝑥𝑦 is :
𝜎𝑦 𝜎𝑥 𝜎𝑦 𝜎𝑥
a. 𝑟 b. 𝑟 c. d.
𝜎𝑥 𝜎𝑦 𝑟𝜎𝑥 𝑟 𝜎𝑦