0% found this document useful (0 votes)

21 views58 pages

Econometrics For MGT ppt-2

Chapter Two discusses regression analysis, focusing on the differences between sample and population regression functions, the nature of error terms, and the assumptions underlying linear regression models. It explains the estimation of parameters using methods like Ordinary Least Squares and introduces concepts such as covariance, correlation coefficients, and the coefficient of determination (R²). Additionally, it covers hypothesis testing related to regression coefficients and the significance of explanatory variables.

Uploaded by

mengistuyilma79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views58 pages

Econometrics For MGT ppt-2

Uploaded by

mengistuyilma79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 58

Chapter Two: Regression analysis

What is the difference between the sample regression

function and the population regression function?
Regression analysis is used to model the relationship between a
response variable and one or more predictor variables.
Estimation is a key factor used in regression analysis.
The population regression function is a description of the model
that is thought to be generating the actual data and it represents
the true relationship between the variables.
The population regression function is also known as the data
generating process.
The sample generating process is the relation that has been
estimated using the sample observations.
Con’t…………………………

• The population regression function is a hypothetical conjecture

about the form of the relationships between the response variable
and the set of explanatory variables.
• The parameters in the model are the regression coefficients which
represent the weights given to each predictor in the linear
combination of them.
• The sample regression equation contains estimated numerical
values of those coefficients, the set of which are chosen to best fit
data from a particular sample (best fit = minimizes the sum of
squared residuals).
• The concise version: the population model species the form of
the model.
• The sample regression equation comes from fitting that form to
observed data.
Con’t…………………………….

• Before you begin with regression analysis, you need to

identify the population regression function (PRF).
• The PRF defines reality (or your perception of it) as it relates
to your topic of interest.
• To identify it, you need to determine your dependent and
independent variables (and how they’ll be measured) as well
as the mathematical function describing how the variables are
related.
Simple Linear Regression
Con’t……………………………
Concept of Regression Analysis

Regression analysis is the process of estimating the relationship

between two or more variables.
In any regression there are dependent variables and explanatory
(independent) variables; hence, regression used to study the dependence
of one variable (dependent variable) over one or more of explanatory
(independent) variables.
how the average value of the dependent variable (regressand) varies
with values of explanatory variables (regressors).
In Regression Analysis the primary objective is to estimate parameters
of population based on empirical data; and predict the average value of
the dependent variable on the basis of values of explanatory variables.
Con’t……………………………
Con’t……………………………
The nature of the error term

• An error term represents the margin of error within a statistical

model; it refers to the sum of the deviations within the
regression line, which provides an explanation for the
difference between the theoretical value of the model and the
actual observed results.
• An error term is a residual variable produced by a statistical or
mathematical model, which is created when the model does
not fully represent the actual relationship between the
independent variables and the dependent variables.
• As a result of this incomplete relationship, the error term is the
amount at which the equation may differ during empirical
analysis.
• The error term is also known as the residual, disturbance, or
remainder term, and is variously represented in models by the
letters e, ε, or u.
Assumptions of the Classical Simple Linear Regression
Model
1. The model is linear in parameters.
• The classical assumed that the model should be linear in the
parameters regardless of whether the explanatory and the
dependent variables are linear or not.
• This is because if the parameters are non-linear it is difficult
to estimate them since their value is not known but you are
given with the data of the dependent and independent variable.
Con’t……………………………………
Con’t………………………………….

2. Ui- is a random real variable

• This means that the value which u may assume in any one period
depends on chance; it may be positive, negative or zero. Every
value has a certain probability of being assumed by u in any
particular instance.
3. The mean value of the random variable(U) in any particular
period is zero
• This means that for each value of x, the random variable(u) may
assume various values, some greater than zero and some smaller
than zero, but if we considered all the possible and negative
values of u, for any given value of X, they would have on
average value equal to zero.
• In other words the positive and negative values of u cancel each
other.
Mathematically, E(Ui)= 0………………………………..….(2.3)
Con’t……………………………..

4. The variance of the random variable(U) is constant in each

period (The assumption of homoscedasticity)
• For all values of X, the u’s will show the same dispersion around
their mean.
• In Fig.2.c this assumption is denoted by the fact that the values
that u can assume lie with in the same limits, irrespective of the
value of X.
• For , u can assume any value with in the range AB; for , u can
assume any value with in the range CD which is equal to AB and
so on.
Con’t……………………………
Con’t……………………………………
Con’t………………………….

7. The are a set of fixed values in the hypothetical process of

repeated sampling which underlies the linear regression
model.
• This means that, in taking large number of samples on Y and
X, the values are the same in all samples, but the values do
differ from sample to sample, and so of course do the values
of .
8. The random variable (U) is independent of the explanatory
variables.
• This means there is no correlation between the random
variable and the explanatory variable.
• If two variables are unrelated their covariance is zero.
Con’t………………………….
Con’t…………………………………..

9. The explanatory variables are measured without error

• U absorbs the influence of omitted variables and possibly
errors of measurement in the y’s. i.e., we will assume that the
regressors are error free, while y values may or may not
include errors of measurement.
The Multiple Linear Regression Model

• In the multiple linear regression model, a dependent variable Y

can depend on a whole series of explanatory variables or
regressors.
• For instance, in demand studies we study the relationship
between quantity demanded of a good and price of the good,
price of substitute goods and the consumer’s income.
• The model we assume is:
Con’t………………………….
Con’t………………………………….
Assumptions of Multiple Regression Model
Con’t……………………………….
Parameter Estimation: Least Squares Methods of estimation

• Specifying the model and stating its underlying assumptions are

the first stage of any econometric application.
• The next step is the estimation of the numerical values of the
parameters of economic relationships.
• The parameters of the simple linear regression model can be
estimated by various methods. Three of the most commonly used
methods are:
– Ordinary least square method (OLS)

– Maximum likelihood method (MLM)

– Method of moments (MM)

Ordinary Least Squares (OLS)
Assumptions of Classical Simple Regression Model (CLRM)
Deriving OLS Estimators
Con’t……………………………….
Con’t……………………………
Con’t………………………………
Con’t……………………………
Con’t…………………………….
Con’t……………………………
Con’t……………………………..
Con’t……………………………..
Con’t……………………………
Con’t……………………………
Covariance

 What is Covariance? In mathematics and statistics, covariance

is a measure of the relationship between two random variables.
 The metric evaluates how much – to what extent – the
variables change together.
 In other words, it is essentially a measure of the variance
between two variables.
 The variance can take any positive or negative values. The
values are interpreted as follows:
• Positive covariance: Indicates that two variables tend to move
in the same direction.
• Negative covariance: Reveals that two variables tend to move
in inverse directions.
Con’t…………………………
Con’t……………………………….

• Covariance measures the total variation of two random

variables from their expected values.
• Using covariance, we can only gauge the direction of the
relationship (whether the variables tend to move in tandem or
show an inverse relationship).
• However, it does not indicate the strength of the relationship,
nor the dependency between the variables.
Correlation coefficient

• The degree of association is measured by a correlation

coefficient, denoted by r.
• It is sometimes called Pearson’s correlation coefficient after its
originator and is a measure of linear association.
• If a curved line is needed to express the relationship, other and
more complicated measures of the correlation must be used.
• Correlation measures the strength of the relationship between
variables.
• Correlation is the scaled measure of covariance.
• It is dimensionless. In other words, the correlation coefficient
is always a pure value and not measured in any units.
Con’t………………………….
• The correlation coefficient is measured on a scale that varies
from + 1 through 0 to – 1.
• Complete correlation between two variables is expressed by
either + 1 or -1.
• When one variable increases as the other increases the
correlation is positive; when one decreases as the other
increases it is negative.
• Complete absence of correlation is represented by 0.
• The relationship between the two concepts can be expressed
using the formula below:
Con’t………………………….
Con’t……………………………….
• Where:
• rxy – the correlation coefficient of the linear relationship
between the variables x and y
• xi – the values of the x-variable in a sample
• x̅ – the mean of the values of the x-variable
• yi – the values of the y-variable in a sample
• ȳ – the mean of the values of the y-variable
 In order to calculate the correlation coefficient using the
formula above, you must undertake the following steps:
1. Obtain a data sample with the values of x-variable and y-
variable.
2. Calculate the means (averages) x̅ for the x-variable and ȳ for
the y-variable.
Con’t………………………………
3. For the x-variable, subtract the mean from each value of the x-
variable (let’s call this new variable “a”). Do the same for the
y-variable (let’s call this variable “b”).
4. Multiply each a-value by the corresponding b-value and find
the sum of these multiplications (the final value is the
numerator in the formula).
5. Square each a-value and calculate the sum of the result
6. Find the square root of the value obtained in the previous step
(this is the denominator in the formula).
7. Divide the value obtained in step 4 by the value obtained
in step 7.
Coefficient of determination(R2 )

 What is the coefficient of determination?

• The coefficient of determination (R²) measures how well a
statistical model predicts an outcome.
• The outcome is represented by the model’s dependent variable
.
• The lowest possible value of R² is 0 and the highest possible
value is 1.
• Put simply, the better a model is at making predictions, the
closer its R² will be to 1.
• Example: Coefficient of determination, Imagine that you
perform a simple linear regression that predicts students’ exam
scores (dependent variable) from their time spent studying (
independent variable).
Con’t……………………………….
• If the R2 is 0, the linear regression model doesn’t allow you to
predict exam scores any better than simply estimating that
everyone has an average exam score.
• If the R2 is between 0 and 1, the model allows you to partially
predict exam scores. The model’s estimates are not perfect, but
they’re better than simply using the average exam score.
• If the R2 is 1, the model allows you to perfectly predict
anyone’s exam score.
• More technically, R2 is a measure of goodness of fit.
• It is the proportion of variance in the dependent variable that is
explained by the model.
Con’t………………
Con’t…………………………………….
Con’t……………………………….
Con’t…………………………
Con’t…………………………..

• You can also say that the R² is the proportion of variance

“explained” or “accounted for” by the model. The proportion that
remains (1 − R²) is the variance that is not predicted by the model.
• If you prefer, you can write the R² as a percentage instead of a
proportion. Simply multiply the proportion by 100.
• Example:
• Interpreting R²A simple linear regression that predicts students’
exam scores (dependent variable) from their study time
(independent variable) has an R² of .71. From this R² value, we
know that:
• 71% of the variance in students’ exam scores is predicted by their
study time
• 29% of the variance in student’s exam scores is unexplained by the
model
• The students’ study time has a large effect on their exam scores
Con’t…………………………………

• Studying longer may or may not cause an improvement in the

students’ scores.
• Although this causal relationship is very plausible, the R²
alone can’t tell us why there’s a relationship between students’
study time and exam scores.
• For example, students might find studying less frustrating
when they understand the course material well, so they study
longer.
Hypotheses testing

• Computing p-values for t tests So far, we have talked about

how to test hypotheses using a classical approach: after stating
the alternative hypothesis, we choose a significance level,
which then determines a critical value.
• Once the critical value has been identified, the value of the t
statistic is compared with the critical value, and the null is
either rejected or not rejected at the given significance level.
• Even after deciding on the appropriate alternative, there is a
component of arbi-rariness to the classical approach, which
results from having to choose a significance level ahead of
time.
• Different researchers prefer different significance levels,
depending on the particular application.
• There is no “correct” significance level.
Con’t……………………….
• Committing to a significance level ahead of time can hide
useful information about the outcome of a hypothesis test.
• For example, suppose that we wish to test the null hypothesis
that a parameter is zero against a two-sided alternative, and
with 40 degrees of freedom we obtain a t statistic equal to
1.85.
• The null hypothesis is not rejected at the 5% level, since the t
statistic is less than the two-tailed critical value.
• A researcher whose agenda is not to reject the null could
simply report this outcome along with the estimate: the null
hypothesis is not rejected at the 5% level.
Con’t……………………………….

• Given the observed value of the t statistic, what is the smallest

significance level at which the null hypothesis would be
rejected? This level is known as the p-value for the test.
• Example, we know the p-value is greater than .05, since the
null is not rejected at the 5% level, and we know that the p-
value is less than .10, since the null is rejected at the 10%
level.
• We obtain the actual p-value by computing the probability that
a t random variable, with 40 df, is larger than 1.85 in absolute
value.
• That is, the p-value is the significance level of the test when
we use the value of the test statistic, 1.85 in the above
example, as the critical value for the test.
Con’t………………………………

• Significance tests of the individual regression coefficients.

If the multiple regression model conforms with the underlying
economic theory, one would expect the exogenous variables xj
to influence the endogenous variable y in particular directions.
• In the econometric model, the estimated regression
coefficients should therefore display the theoretically expected
signs.
• In addition it needs to be examined whether the influencing
factors do in fact matter for the explanation of the endogenous
variable.
Con’t………………………….
Because, if a regression coefficient has the expected sign but only
randomly deviates from 0, the explanatory variable will have
no systematic influence on the endogenous variable.
• Whether an independent variable exhibits a systematic
influence on the dependent variable, can be checked with a
significance test:
• If the null hypothesis H0 : βj = 0 is being contrasted to the
alternative hypothesis H1 : βj≠ 0 one speaks of a two-sided
significance test.

Topical Notes by Chapter For IGCSE Biology
100% (1)
Topical Notes by Chapter For IGCSE Biology
23 pages
Saint Kabir's Pad & Dohe
100% (3)
Saint Kabir's Pad & Dohe
89 pages
Handbook-Riser-Design - Clamps PDF
67% (3)
Handbook-Riser-Design - Clamps PDF
46 pages
Chapter Two Part One
No ratings yet
Chapter Two Part One
6 pages
Unit III
No ratings yet
Unit III
13 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
15 pages
Chapter 2 - Simple Linear Regression Function
100% (1)
Chapter 2 - Simple Linear Regression Function
49 pages
Econometrics I Handout
No ratings yet
Econometrics I Handout
41 pages
Business Analytics: Advance: Simple & Multiple Linear Regression
No ratings yet
Business Analytics: Advance: Simple & Multiple Linear Regression
38 pages
Chapter 2 - Regression Analysis
No ratings yet
Chapter 2 - Regression Analysis
49 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
Econometrics Unit 3 Tedy Best
No ratings yet
Econometrics Unit 3 Tedy Best
147 pages
CLRM
No ratings yet
CLRM
17 pages
Econometrics Chapter Two
No ratings yet
Econometrics Chapter Two
36 pages
Chapter Two: Simple Linear Regression Models: Assumptions and Estimation
100% (3)
Chapter Two: Simple Linear Regression Models: Assumptions and Estimation
34 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
Regression Analysis
No ratings yet
Regression Analysis
12 pages
14 Statistics and Probability
No ratings yet
14 Statistics and Probability
37 pages
Simple Linear Regression Analysis..
No ratings yet
Simple Linear Regression Analysis..
51 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
CH - 3 - Econometrics UG
No ratings yet
CH - 3 - Econometrics UG
38 pages
Week 2 - The Simple Linear Regression Model PDF
No ratings yet
Week 2 - The Simple Linear Regression Model PDF
47 pages
Econometrics 2
No ratings yet
Econometrics 2
27 pages
Simple Linear Regression1
No ratings yet
Simple Linear Regression1
36 pages
Lecture Two (Copy)
No ratings yet
Lecture Two (Copy)
27 pages
Econometrics Session
No ratings yet
Econometrics Session
43 pages
Linear Regression Models
No ratings yet
Linear Regression Models
42 pages
Module 3
No ratings yet
Module 3
34 pages
Simple and Multiple Linear Regression
No ratings yet
Simple and Multiple Linear Regression
91 pages
Chapter 5 - 1
No ratings yet
Chapter 5 - 1
5 pages
REGRESSION and CORRELATION ANALYSIS STA 106 - DR. BASHIRU
No ratings yet
REGRESSION and CORRELATION ANALYSIS STA 106 - DR. BASHIRU
10 pages
Module 2 Transcripts - v3
No ratings yet
Module 2 Transcripts - v3
103 pages
Linear Regression
No ratings yet
Linear Regression
216 pages
Topic 3 - Simple Regression Analysis
No ratings yet
Topic 3 - Simple Regression Analysis
37 pages
M2L2 CLRM & Simple Linear Regression Analysis
No ratings yet
M2L2 CLRM & Simple Linear Regression Analysis
13 pages
Ch2 Slides Edited
No ratings yet
Ch2 Slides Edited
66 pages
Econometrics Chapter Two-1
No ratings yet
Econometrics Chapter Two-1
41 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
9 Regression Analysis
No ratings yet
9 Regression Analysis
38 pages
DISCRETE MATH Chapter-8
No ratings yet
DISCRETE MATH Chapter-8
34 pages
Chapter Two Metrics (I)
No ratings yet
Chapter Two Metrics (I)
35 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
Linear Regression Models
No ratings yet
Linear Regression Models
41 pages
Data Analytics Lesson 11 Notes
No ratings yet
Data Analytics Lesson 11 Notes
8 pages
ECON3049 Lecture Notes 1
No ratings yet
ECON3049 Lecture Notes 1
32 pages
Regression and Correlation
No ratings yet
Regression and Correlation
66 pages
Econometrics I Ch2
No ratings yet
Econometrics I Ch2
105 pages
Student Notes Madule 2
No ratings yet
Student Notes Madule 2
12 pages
ML Module3 Regression
No ratings yet
ML Module3 Regression
51 pages
ECO - Chapter 2 SLRM
No ratings yet
ECO - Chapter 2 SLRM
40 pages
CHAPTER TWO-Econometrics I (Econ 2061) Edited1 PDF
No ratings yet
CHAPTER TWO-Econometrics I (Econ 2061) Edited1 PDF
35 pages
QT - Unit 2 - Part B - Regression
No ratings yet
QT - Unit 2 - Part B - Regression
40 pages
Econometrics Chapter - Two
No ratings yet
Econometrics Chapter - Two
71 pages
Chapter No 11 (Simple Linear Regression)
No ratings yet
Chapter No 11 (Simple Linear Regression)
3 pages
Econometrics Revision Work
100% (6)
Econometrics Revision Work
6 pages
Stat Cor Reg
No ratings yet
Stat Cor Reg
85 pages
Eonometrics For Acct and Finance CH 2 2023
No ratings yet
Eonometrics For Acct and Finance CH 2 2023
19 pages
Basic Regression Analysis
No ratings yet
Basic Regression Analysis
5 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Angol Nyelvi Próbafelvételi
No ratings yet
Angol Nyelvi Próbafelvételi
10 pages
Sample New Criticism Essay
No ratings yet
Sample New Criticism Essay
5 pages
Cognitive Assignment
No ratings yet
Cognitive Assignment
19 pages
Li Ion Standards
No ratings yet
Li Ion Standards
4 pages
Project C: Dr. Shahin Tavakoli Applied Bayesian Statistics Project 1
No ratings yet
Project C: Dr. Shahin Tavakoli Applied Bayesian Statistics Project 1
2 pages
Intake - Output Medication Nursing Reference
No ratings yet
Intake - Output Medication Nursing Reference
4 pages
Client Questionaire
No ratings yet
Client Questionaire
17 pages
The New India Assurance Co. LTD.: Certificate Cum Policy Schedule
No ratings yet
The New India Assurance Co. LTD.: Certificate Cum Policy Schedule
2 pages
Evaluation of The Methods For Determination of The Free Radical Scavenging Activity by DPPH
No ratings yet
Evaluation of The Methods For Determination of The Free Radical Scavenging Activity by DPPH
14 pages
The Trinity - Lesson 4
100% (1)
The Trinity - Lesson 4
3 pages
WEEK 4-Text Structure EAPP Lesson
No ratings yet
WEEK 4-Text Structure EAPP Lesson
23 pages
Thyroid Surgery Dissertation
100% (2)
Thyroid Surgery Dissertation
5 pages
Bostik No More Nails
No ratings yet
Bostik No More Nails
1 page
Hensley Bolt-On Wear Runners
No ratings yet
Hensley Bolt-On Wear Runners
7 pages
CSEC Qualitative of Cations
No ratings yet
CSEC Qualitative of Cations
2 pages
GAs - BOUNDARY WALL-S77 BOUNDARY WALLS
100% (1)
GAs - BOUNDARY WALL-S77 BOUNDARY WALLS
1 page
A Cute Letter From A Muslim Girl To Her Christian Parents
No ratings yet
A Cute Letter From A Muslim Girl To Her Christian Parents
3 pages
MiniRexMulti en Web
No ratings yet
MiniRexMulti en Web
4 pages
INTRODUCTION TO BIOMEDICAL TECHNOLOGY - Module-1ppt
No ratings yet
INTRODUCTION TO BIOMEDICAL TECHNOLOGY - Module-1ppt
17 pages
Call of Cthulhu - D20 - Hardboiled Part 1 - Waters Over Heaven
100% (1)
Call of Cthulhu - D20 - Hardboiled Part 1 - Waters Over Heaven
51 pages
CMTOFT
No ratings yet
CMTOFT
4 pages
All INDIA JE & PSU Electrical Engineering Volume 4
No ratings yet
All INDIA JE & PSU Electrical Engineering Volume 4
848 pages
Icect 2012
No ratings yet
Icect 2012
4 pages
Stoichiometry Lab With Iron and Copper Sulfate Spring 2009
No ratings yet
Stoichiometry Lab With Iron and Copper Sulfate Spring 2009
3 pages
UKMT - JMC - Junior Mathematical Challenge 2015 - Solutions
No ratings yet
UKMT - JMC - Junior Mathematical Challenge 2015 - Solutions
13 pages
2020 Msce Practical Questions Target
No ratings yet
2020 Msce Practical Questions Target
30 pages
Just A Pretty Face
No ratings yet
Just A Pretty Face
2 pages

Econometrics For MGT ppt-2

Uploaded by

Econometrics For MGT ppt-2

Uploaded by

Chapter Two: Regression analysis

What is the difference between the sample regression

• The population regression function is a hypothetical conjecture

• Before you begin with regression analysis, you need to

Regression analysis is the process of estimating the relationship

• An error term represents the margin of error within a statistical

2. Ui- is a random real variable

4. The variance of the random variable(U) is constant in each

7. The are a set of fixed values in the hypothetical process of

9. The explanatory variables are measured without error

• In the multiple linear regression model, a dependent variable Y

• Specifying the model and stating its underlying assumptions are

– Maximum likelihood method (MLM)

– Method of moments (MM)

 What is Covariance? In mathematics and statistics, covariance

• Covariance measures the total variation of two random

• The degree of association is measured by a correlation

 What is the coefficient of determination?

• You can also say that the R² is the proportion of variance

• Studying longer may or may not cause an improvement in the

• Computing p-values for t tests So far, we have talked about

• Given the observed value of the t statistic, what is the smallest

• Significance tests of the individual regression coefficients.

You might also like