0% found this document useful (0 votes)

104 views92 pages

Simple Linear Regression Overview

This document provides an introduction to linear regression and correlation analysis. It defines key concepts such as scatter plots, dependent and independent variables, best fit lines, simple linear regression, correlation coefficients, and residuals. Simple linear regression finds the best fitting straight line to describe the relationship between one independent variable and one dependent variable. The slope and intercept of the regression line are estimated to minimize the sum of squared residuals.

Uploaded by

Kathiravan Gopalan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views92 pages

Simple Linear Regression Overview

Uploaded by

Kathiravan Gopalan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Introduction to Linear

Regression and
Correlation Analysis
Scatter Diagrams

A scatter plot is a graph that may be

used to represent the relationship
between two variables. Also
referred to as a scatter diagram.
diagram
Dependent and Independent
Variables

A dependent variable is the variable to be

predicted or explained in a regression
model. This variable is assumed to be
functionally related to the independent
variable.
Dependent and Independent
Variables

An independent variable is the variable

related to the dependent variable in a
regression equation. The independent
variable is used in a regression model to
estimate the value of the dependent
variable.
Best Fit

Represents our model. It is the line that

“best fits” our data points. The line
represents the best estimate of the y value
for every given input of x.
What is Simple Linear Regression?
• Simple Linear Regression is a method used to fit
the best straight line between a set of data points.
• After a graph is properly scaled, the data points
must “look” like they would fit a straight line, not
a parabola, or any other shape.
• The line is used as a model in order to predict a
variable y from another variable x. A regression
line must involve 2 variables, the dependent and
the independent variable.
• Finding the “best-fit” line is the goal of simple
linear regression.
Two Variable Relationships

(a) Linear
Two Variable Relationships

(b) Linear
Two Variable Relationships

(d) Curvilinear
Two Variable Relationships

(e) No Relationship
Correlation

The correlation coefficient is a quantitative

measure of the strength of the linear
relationship between two variables. The
correlation ranges from + 1.0 to - 1.0. A
correlation of  1.0 indicates a perfect linear
relationship, whereas a correlation of 0
indicates no linear relationship.
Correlation
SAMPLE CORRELATION COEFFICIENT

r
 ( x  x )( y  y )
[ ( x  x ) ][ ( y  y )
2 2
]
where:
r = Sample correlation coefficient
n = Sample size
x = Value of the independent variable
y = Value of the dependent variable
Correlation
SAMPLE CORRELATION COEFFICIENT

or the algebraic equivalent:

n xy   x  y
r
[n( x 2 )  ( x) 2 ][n( y 2 )  ( y ) 2 ]
Correlation
Sales Years
y x yx y2 x2
487 3 1,461 237,169 9
445 5 2,225 198,025 25
272 2 544 73,984 4
641 8 5,128 410,881 64
187 2 374 34,969 4
440 6 2,640 193,600 36
346 7 2,422 119,716 49
238 1 238 56,644 1
312 4 1,248 97,344 16
269 2 538 72,361 4
655 9 5,895 429,025 81
563 6 3,378 316,969 36

  4,855   55   26,091   2,240,687   4,855

Correlation

n xy   x  y
r
[n( x )  ( x) ][n( y )  ( y ) ]
2 2 2 2

12(26,091)  55(4,855)
r
[12(329)  (55) 2 ][12(2,240,687)  (4,855) 2 ]
 0.8325
Correlation

Sales Years with Midwest

Sales 1
Years with Midwest 0.832534056 1

Correlation between Years and Sales

Excel Correlation Output

Correlation
TEST STATISTIC FOR CORRELATION
r
t
2
1 r
n2
df  n  2
where:
t = Number of standard deviations r is from 0
r = Simple correlation coefficient
n = Sample size
Correlation Significance Test

H 0 :   0.0 (no correlatio n)

H A :   0.0
  0.05
Rejection Region Rejection Region
 /2 = 0.025  /2 = 0.025

t.025  2.228 0 t.025  2.228

10 n2
1  0.6931 1 r 2
 4.752  t
0.8325 r

Since t=4.752 > 2.048, reject H0, there is a significant

linear relationship
Correlation

Spurious correlation occurs when

there is a correlation between two
otherwise unrelated variables.
Simple Linear Regression
Analysis

Simple linear regression analysis

analyzes the linear relationship that
exists between a dependent variable
and a single independent variable.
Simple Linear Regression
One Variable
• Problem: A waiter wants to predict his next tip, but he forgot to record
the bill amounts for previous tips.
• Here is a graph of his tips. The tips is the only variable. Let’s call it the
y variable.
• Meal# is not a variable. It is simply used to identify a tip.
y variable

Can we come up with a model for this problem with only 1

variable?
𝒚=
10
• Now, let’s talk about goodness of fit. This will
tell us how good our data points fit the line.
• We need to calculate the residuals (errors) for
each point.

+7
+1 +4
-2 𝒚 = 10
-5 -5
Residual

A residual is the difference between

the actual value of the dependent
variable and the value predicted by
the regression model.

y  yˆ
• The best fit line is the one that minimizes the sum of the squares of the
residuals (errors).
• The error is the difference between the actual data point and the point on
the line.
• SSE (Sum Of Squared Errors) = (-5)2 + 72 + 12 + (-2) 2 + 42 + (-5) 2 = 120

+
7 +
+
1 4
- 𝒚=
- 2 -
10
5 5

• SST (Sum Of Squared Total) = SSR (Sum Of Squared Regression) + SSE is

the Sum Of Squares Equation.
• Since there is no regression line (as we only have 1 variable), we can
not make the SSE any smaller than 120, because SSR = 0.
Two Variables
Simple Linear Regression
Analysis
SIMPLE LINEAR REGRESSION MODEL
(POPULATION MODEL)

y   0  1 x  
where:
y = Value of the dependent variable
x = Value of the independent variable
 0= Population’s y-intercept
1 = Slope of the population regression line
 = Error term, or residual
Simple Linear Regression
Analysis
The simple linear regression model has four
assumptions:
 Individual values if the error terms, i, are
statistically independent of one another.
 The distribution of all possible values of  is normal.
 The distributions of possible i values have equal
variances for all value of x.
 The means of the dependent variable, for all specified
values of the independent variable, y, can be
connected by a straight line called the population
regression model.
Simple Linear Regression
Analysis

REGRESSION COEFFICIENTS
In the simple regression model, there
are two coefficients: the intercept and
the slope.
Simple Linear Regression
Analysis

The interpretation of the regression slope

coefficient is that is gives the average change
in the dependent variable for a unit increase
in the independent variable. The slope
coefficient may be positive or negative,
depending on the relationship between the
two variables.
Simple Linear Regression
Analysis
The least squares criterion is used
for determining a regression line
that minimizes the sum of squared
residuals.
Another Example…
Experience and Sales
Simple Linear Regression
Analysis
Y yˆ  150  60 x
Sales in Thousands

390
400

300
312
200
Residual = 312 - 390 = -78
100
4 X
Years with Company
Simple Linear Regression
Analysis
ESTIMATED REGRESSION MODEL
(SAMPLE MODEL)

yˆ i  b0  b1 x
where:
ŷ= Estimated, or predicted, y value
b0 = Unbiased estimate of the regression intercept
b1 = Unbiased estimate of the regression slope
x = Value of the independent variable
Simple Linear Regression
Analysis
LEAST SQUARES EQUATIONS

b1 
 ( x  x )( y  y )
 (x  x) 2

algebraic equivalent:
 xy   x y
b1  n
( x ) 2
 
x 2

n
and

b0  y  b1 x
Simple Linear Regression
Analysis

SUM OF SQUARED ERRORS

SSE   y  b0  y  b1  xy
2
Simple Linear Regression Analysis

Sales Years
y x xy y2 x2
487 3 1,461 237,169 9
445 5 2,225 198,025 25
272 2 544 73,984 4
641 8 5,128 410,881 64
187 2 374 34,969 4
440 6 2,640 193,600 36
346 7 2,422 119,716 49
238 1 238 56,644 1
312 4 1,248 97,344 16
269 2 538 72,361 4
655 9 5,895 429,025 81
563 6 3,378 316,969 36

  4,855   55   26,091   2,240,687   4,855

Simple Linear Regression
Analysis

 xy   x y
26,091 
55(4,855)
b1  n  12  49.9101
 x 2

(  x ) 2
329 
(55) 2

n 12

b0  y  b1 x  404.5833  49.9101(4.5833)  175.8288

The least squares regression line is:

yˆ  175.8288  49.9101( x)
Simple Linear Regression
Analysis

SUMMARY OUTPUT

Regression Statistics
Multiple R 0.832534056
R Square 0.693112955
Adjusted R Square 0.662424251
Standard Error 92.10553441
Observations 12

ANOVA
df SS MS F Significance F
Regression 1 191600.622 191600.622 22.58527906 0.000777416
Residual 10 84834.29469 8483.429469
Total 11 276434.9167

Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 175.8288191 54.98988674 3.197475563 0.00953244 53.30369475 298.3539434 53.30369475 298.3539434
Years with Midwest 49.91007584 10.50208428 4.752397191 0.000777416 26.50996978 73.3101819 26.50996978 73.3101819

Excel Midwest Distribution Results

Least Squares Regression
Properties
 The sum of the residuals from the least
squares regression line is 0.
 The sum of the squared residuals is a
minimum.
 The simple regression line always passes
through the mean of the y variable and the
mean of the x variable.
 The least squares coefficients are unbiased
estimates of 0 and 1.
Simple Linear Regression
Analysis

SUM OF RESIDUALS

 ( y  yˆ )  0
SUM OF SQUARED RESIDUALS

 ( y  yˆ ) 2
Simple Linear Regression
Analysis

TOTAL SUM OF SQUARES

TSS   ( y  y ) 2

where:
TSS = Total sum of squares
n = Sample size
y = Values of the dependent variable
y= Average value of the dependent variable
Simple Linear Regression
Analysis

SUM OF SQUARES ERROR (RESIDUALS)

SSE   ( y  yˆ ) 2

where:
SSE = Sum of squares error
n = Sample size
y = Values of the dependent variable
ŷ= Estimated value for the average of y for the
given x value
Simple Linear Regression
Analysis

SUM OF SQUARES REGRESSION

SSR   ( yˆ  y ) 2

where:
SSR = Sum of squares regression
y= Average value of the dependent variable
y = Values of the dependent variable
ŷ= Estimated value for the average of y for the
given x value
Simple Linear Regression
Analysis

SUMS OF SQUARES

TSS  SSE  SSR

Simple Linear Regression
Analysis

The coefficient of determination is the

portion of the total variation in the
dependent variable that is explained by its
relationship with the independent variable.
The coefficient of determination is also
called R-squared and is denoted as R2.
Simple Linear Regression
Analysis

COEFFICIENT OF DETERMINATION (R2)

2 SSR
R 
TSS
Simple Linear Regression
Analysis

COEFFICIENT OF DETERMINATION (R2)

2SSR 191,600.62
R    0.6931
TSS 276,434.90

69.31% of the variation in the sales data for this

sample can be explained by the linear relationship
between sales and years of experience.
Simple Linear Regression
Analysis

COEFFICIENT OF DETERMINATION
SINGLE INDEPENDENT VARIABLE CASE

2 2
R r
where:
R2 = Coefficient of determination
r = Simple correlation coefficient
Simple Linear Regression
Analysis
STANDARD DEVIATION OF THE
REGRESSION SLOPE COEFFICIENT
(POPULATION)

 b1 
where:
 (x  x) 2

 b=1 Standard deviation of the regression slope

(Called the standard error of the slope)
 = Population standard error of the estimate
Simple Linear Regression
Analysis
ESTIMATOR FOR THE STANDARD ERROR
OF THE ESTIMATE

SSE
s 
n  k 1
where:
SSE = Sum of squares error
n = Sample size
k = number of independent variables in the model
Simple Linear Regression
Analysis
ESTIMATOR FOR THE STANDARD
DEVIATION OF THE REGRESSION SLOPE
s s
sb1  
 (x  x) 2

x 2

( x ) 2

where: n
sb1= Estimate of the standard error of the least squares
slope
s = SSE Sample standard error of the estimate
n2
Simple Linear Regression
Analysis
TEST STATISTIC FOR TEST OF
SIGNIFICANCE OF THE REGRESSION SLOPE

b1  1
t df  n  2
where: sb1
b1 = Sample regression slope coefficient
1 = Hypothesized slope
sb1 = Estimator of the standard error of the slope
Significance Test of
Regression Slope
H 0 :  1  0. 0
H A : 1  0.0
  0.05
Rejection Region Rejection Region
 /2 = 0.025  /2 = 0.025

t.025  2.228 0 t.025  2.228

sb1 10.50
t 1 1
  4.753
b  49.91  0

Since t=4.753 > 2.048, reject H0: conclude that the

true slope is not zero
Simple Linear Regression
Analysis

MEAN SQUARE REGRESSION

SSR
MSR 
where: k
SSR = Sum of squares regression
k = Number of independent variables in the model
Simple Linear Regression
Analysis

MEAN SQUARE ERROR

SSE
MSE 
where: n  k 1
SSE = Sum of squares error
n = Sample size
k = Number of independent variables in the model
Significance Test

H 0 : 1  0.0 F  Ratio
H A : 1  0.0 MSR 191,600.6
  22.59
  0.05 MSE 8,483.43

Rejection Region
 = 0.05

F  4.96
Since F= 22.59 > 4.96, reject H0: conclude that the
regression model explains a significant amount of the
variation in the dependent variable
Simple Regression Steps

Develop a scatter plot of y and x. You are

looking for a linear relationship between
the two variables.
Calculate the least squares regression line
for the sample data.
Calculate the correlation coefficient and the
simple coefficient of determination, R2.
Conduct one of the significance tests.
Simple Linear Regression
Analysis
CONFIDENCE INTERVAL ESTIMATE FOR
THE REGRESSION SLOPE

b1  t / 2 sb1
or equivalently: df  n  2
s
b1  t / 2
where:
 (x  x) 2

sb1 = Standard error of the regression slope

coefficient
s = Standard error of the estimate
Simple Linear Regression
Analysis
CONFIDENCE INTERVAL FOR y | xp

1 ( x p  x )2
yˆ  t / 2 s 
n  (x  x) 2

where:
ŷ = Point estimate of the dependent variable
t = Critical value with n - 2 d.f.
s = Standard error of the estimate
n = Sample size
xp = Specific value of the independent variable
x = Mean of independent variable observations
Simple Linear Regression
Analysis

PREDICTION INTERVAL FOR Y | xp

2
1 (xp  x)
yˆ  t / 2 s 1  
n  (x  x) 2
Residual Analysis

Before using a regression model for

description or prediction, you should do a
check to see if the assumptions concerning
the normal distribution and constant
variance of the error terms have been
satisfied. One way to do this is through
the use of residual plots.
plots
Simple Linear Regression Model
- Assumptions
Assumptions of CLRM
CLRM – ASSUMPTION 1
CLRM – ASSUMPTION 2
X values are fixed in repeated sampling
Values taken by the regressor, X, are
considered fixed in repeated samples. More
technically, X is assumed to be non-
stochastic (so that Xi and ui are also
uncorrelated)
CLRM – ASSUMPTION 3
 Error term εi has ZERO MEAN VALUE given the
value of X
Thus, the Conditional mean value of εi is
Zero.
 That is, E(εi Xi) = 0
CLRM – ASSUMPTION 4
Homoscedasticty or Equal variance of εi
Given the value of X, the variance of εi is
same for all observations
Thus, Var(εi Xi) = E[εi - E(εi Xi)]2
CLRM – ASSUMPTION 5
 No autocorrelation between the error terms
 Given any two X values, Xi and Xj, the correlation
between any two εi and εj is Zero
 Covar(εi,εj Xi,Xj) = E[εi - E(εj)] Xi [εj - E(εi)] Xj
= E(εi Xi)(εj Xj) = 0
CLRM – ASSUMPTION 6
Zero covariance between εi and Xi or
E(εiXi) = 0
CLRM – ASSUMPTION 7
The number of observations ‘n’ must be
greater than the number of parameters
to be estimated
That is, N > P
CLRM – ASSUMPTION 8
Variability in X values: The X values in a
given sample MUST NOT be same for all
Thus, Variance (X) must be a finite positive
number
CLRM – ASSUMPTION 9
The regression model should be
correctly specified
Therefore, there is NO SPECIFICATION
BIAS or ERROR in the model used for
empirical analysis
CLRM – ASSUMPTION 10
There is NO PERFECT MULTI-COLLINEARITY
There are no perfect linear relationships
among the explanatory variables
Example, X2 = A + B*X1

Understanding Duality in Linear Programming
No ratings yet
Understanding Duality in Linear Programming
84 pages
Forecasting Models Overview and Examples
No ratings yet
Forecasting Models Overview and Examples
8 pages
Z-Test and T-Test for Mean Differences
No ratings yet
Z-Test and T-Test for Mean Differences
29 pages
Stock Analysis Fundamentals Guide
No ratings yet
Stock Analysis Fundamentals Guide
47 pages
Understanding ANOVA and F Distribution
100% (1)
Understanding ANOVA and F Distribution
3 pages
BUS601: Quantitative Analysis Overview
0% (1)
BUS601: Quantitative Analysis Overview
4 pages
Understanding Learning Curves in Economics
No ratings yet
Understanding Learning Curves in Economics
13 pages
Sales Forecasting Models Explained
No ratings yet
Sales Forecasting Models Explained
5 pages
Decision Tree Analysis in Management
No ratings yet
Decision Tree Analysis in Management
1 page
Chapter 2 Individual Differences, Mental Ability
No ratings yet
Chapter 2 Individual Differences, Mental Ability
19 pages
Security Market Line Analysis in Excel
No ratings yet
Security Market Line Analysis in Excel
18 pages
Understanding Variable Overhead Costs
No ratings yet
Understanding Variable Overhead Costs
42 pages
Albright DADM 6e - PPT - Ch02
No ratings yet
Albright DADM 6e - PPT - Ch02
75 pages
Lesson 19 Hypothesis Testing For Two Means of Independent Samples
No ratings yet
Lesson 19 Hypothesis Testing For Two Means of Independent Samples
7 pages
Introduction to Integer Programming Concepts
No ratings yet
Introduction to Integer Programming Concepts
30 pages
Forecasting: To Accompany by Render, Stair, and Hanna Power Point Slides Created by Brian Peterson
No ratings yet
Forecasting: To Accompany by Render, Stair, and Hanna Power Point Slides Created by Brian Peterson
84 pages
Sensitivity Analysis in Management Accounting
No ratings yet
Sensitivity Analysis in Management Accounting
16 pages
Introduction to Management Science
No ratings yet
Introduction to Management Science
17 pages
MBA Economics Assignment
No ratings yet
MBA Economics Assignment
8 pages
Hypothesis Testing Quiz Questions
No ratings yet
Hypothesis Testing Quiz Questions
26 pages
Cost-Volume-Profit Analysis Overview
No ratings yet
Cost-Volume-Profit Analysis Overview
5 pages
Decision Making Under Uncertainty Criteria
No ratings yet
Decision Making Under Uncertainty Criteria
13 pages
Mathematical Analysis Lecture Notes
No ratings yet
Mathematical Analysis Lecture Notes
359 pages
Contribution Margin Accounting Problems
No ratings yet
Contribution Margin Accounting Problems
5 pages
Simplex Method in Linear Programming
No ratings yet
Simplex Method in Linear Programming
41 pages
Statistics Classification and Exercises
No ratings yet
Statistics Classification and Exercises
2 pages
Mockboard Exam for Practical Accounting 2
No ratings yet
Mockboard Exam for Practical Accounting 2
8 pages
Independent vs. Dependent Demand Forecasting
No ratings yet
Independent vs. Dependent Demand Forecasting
39 pages
Regression Analysis in Business Forecasting
No ratings yet
Regression Analysis in Business Forecasting
12 pages
Excel Regression Tool Quiz Questions
No ratings yet
Excel Regression Tool Quiz Questions
7 pages
Introduction to Management Science
No ratings yet
Introduction to Management Science
2 pages
Decision Theory: Maximin and Criteria
No ratings yet
Decision Theory: Maximin and Criteria
36 pages
IIS University Process Costing Analysis
No ratings yet
IIS University Process Costing Analysis
15 pages
Nonparametric Methods:: Analysis of Ranked Data
100% (1)
Nonparametric Methods:: Analysis of Ranked Data
30 pages
NutraFlakes Process Control Analysis
No ratings yet
NutraFlakes Process Control Analysis
7 pages
Anderson Sweeney Williams: Quantitative Methods For Business 8E
No ratings yet
Anderson Sweeney Williams: Quantitative Methods For Business 8E
23 pages
Understanding Hypothesis Testing
No ratings yet
Understanding Hypothesis Testing
10 pages
Sensitivity Analysis in CVP Models
No ratings yet
Sensitivity Analysis in CVP Models
6 pages
Simple Regression Analysis: True/False Quiz
75% (4)
Simple Regression Analysis: True/False Quiz
91 pages
2SLS and Hausman Test Example
No ratings yet
2SLS and Hausman Test Example
4 pages
Machinery Depreciation Calculations
No ratings yet
Machinery Depreciation Calculations
4 pages
Activity-Based Costing Overview
No ratings yet
Activity-Based Costing Overview
55 pages
Preboard Exam Review for Accountancy
No ratings yet
Preboard Exam Review for Accountancy
17 pages
Introduction to Linear Regression Concepts
No ratings yet
Introduction to Linear Regression Concepts
52 pages
LPP Formulation and Solutions Guide
No ratings yet
LPP Formulation and Solutions Guide
7 pages
Introduction to Regression Analysis
No ratings yet
Introduction to Regression Analysis
22 pages
Joint Cost Apportionment Methods Explained
No ratings yet
Joint Cost Apportionment Methods Explained
16 pages
Understanding Pricing Strategies
No ratings yet
Understanding Pricing Strategies
22 pages
Process Optimization PDF
No ratings yet
Process Optimization PDF
117 pages
Overview of International Financial Markets
No ratings yet
Overview of International Financial Markets
12 pages
Financial Regression Analysis Overview
No ratings yet
Financial Regression Analysis Overview
167 pages
Introduction to Quantitative Analysis
No ratings yet
Introduction to Quantitative Analysis
555 pages
Business-Level Strategy Explained
No ratings yet
Business-Level Strategy Explained
2 pages
Standard Costing Revision Notes
No ratings yet
Standard Costing Revision Notes
5 pages
Regression and Correlation Analysis Guide
No ratings yet
Regression and Correlation Analysis Guide
29 pages
Linear Regression and Correlation Analysis
100% (1)
Linear Regression and Correlation Analysis
43 pages
Simple Linear Regression and Correlation
No ratings yet
Simple Linear Regression and Correlation
31 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
58 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
25 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
17 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
22 pages
Autocorrelation Analysis in Stata
No ratings yet
Autocorrelation Analysis in Stata
27 pages
Introduction to Econometrics Concepts
No ratings yet
Introduction to Econometrics Concepts
37 pages
Understanding Tests of Significance
No ratings yet
Understanding Tests of Significance
22 pages
1 Class Probability
No ratings yet
1 Class Probability
79 pages
TANUVAS PG Diploma Admission 2022-23
No ratings yet
TANUVAS PG Diploma Admission 2022-23
57 pages
IT-GO No.979
No ratings yet
IT-GO No.979
117 pages
G Kathiravan
No ratings yet
G Kathiravan
2 pages
Food Safety's Role in Food Security
No ratings yet
Food Safety's Role in Food Security
2 pages
NAAS Ratings for Science Journals 2023
No ratings yet
NAAS Ratings for Science Journals 2023
65 pages
ICAR Training on Diversified Poultry Management
No ratings yet
ICAR Training on Diversified Poultry Management
2 pages
PERMISnet: Managing Agricultural Personnel
No ratings yet
PERMISnet: Managing Agricultural Personnel
6 pages
ICAR Research Publications Overview
No ratings yet
ICAR Research Publications Overview
8 pages
Brochure For The International Conference - ICFS 2020 - Compressed
No ratings yet
Brochure For The International Conference - ICFS 2020 - Compressed
2 pages
Growth and Instability in Indian Agriculture
No ratings yet
Growth and Instability in Indian Agriculture
8 pages
VRC Training on Molecular Diagnostics
No ratings yet
VRC Training on Molecular Diagnostics
2 pages
PSYC3001 Lecture Slides Topic 2 2026 2
No ratings yet
PSYC3001 Lecture Slides Topic 2 2026 2
34 pages
Hypothesis Testing: One-Sample Methods
No ratings yet
Hypothesis Testing: One-Sample Methods
51 pages
Business Statistics: Comparing Means
No ratings yet
Business Statistics: Comparing Means
48 pages
Pearson Product Moment Correlation Guide
No ratings yet
Pearson Product Moment Correlation Guide
4 pages
Data Mining Analysis of Lobster Populations
No ratings yet
Data Mining Analysis of Lobster Populations
37 pages
Ebook & Testbank Introduction To Econometrics 3rd Edition by James H Stock Mark W Watson
No ratings yet
Ebook & Testbank Introduction To Econometrics 3rd Edition by James H Stock Mark W Watson
304 pages
A Step-by-Step Guide To Survival Analysis: Lida Gharibvand, University of California, Riverside
No ratings yet
A Step-by-Step Guide To Survival Analysis: Lida Gharibvand, University of California, Riverside
16 pages
Understanding Z-Scores and Their Applications
No ratings yet
Understanding Z-Scores and Their Applications
4 pages
Regression Analysis for Decision Support
No ratings yet
Regression Analysis for Decision Support
45 pages
Understanding the ARCH Model for Volatility
No ratings yet
Understanding the ARCH Model for Volatility
38 pages
Abandono Infantil
No ratings yet
Abandono Infantil
13 pages
Onyx: A Tool for Structural Equation Modeling
100% (1)
Onyx: A Tool for Structural Equation Modeling
8 pages
Unpaired T-Test: Procedure and Example
No ratings yet
Unpaired T-Test: Procedure and Example
3 pages
Mth3003 Tutorial Questions FOR UPM STUDENTS JULY 2010/2011: Please Try All Your Best To Answer All Questions)
No ratings yet
Mth3003 Tutorial Questions FOR UPM STUDENTS JULY 2010/2011: Please Try All Your Best To Answer All Questions)
29 pages
KMO and Bartlett's Test in Factor Analysis
No ratings yet
KMO and Bartlett's Test in Factor Analysis
4 pages
Econometrics Exam Example
No ratings yet
Econometrics Exam Example
9 pages
Estudio de Hidrología y Precipitaciones
No ratings yet
Estudio de Hidrología y Precipitaciones
15 pages
NumPy Operations and Flowchart Guide
No ratings yet
NumPy Operations and Flowchart Guide
184 pages
Understanding the Central Limit Theorem
No ratings yet
Understanding the Central Limit Theorem
3 pages
Understanding Statistics and Data Types
No ratings yet
Understanding Statistics and Data Types
44 pages
Cricket and Insurance Data Analysis
No ratings yet
Cricket and Insurance Data Analysis
39 pages
EdTech Lead Conversion Analysis
No ratings yet
EdTech Lead Conversion Analysis
41 pages
Understanding Sampling in Research
No ratings yet
Understanding Sampling in Research
9 pages
Nonparametric Correlation Analysis Results
No ratings yet
Nonparametric Correlation Analysis Results
8 pages
Statistics and Probability - Q2 - M1
86% (7)
Statistics and Probability - Q2 - M1
14 pages
ML Lab Manual R22
No ratings yet
ML Lab Manual R22
57 pages
Multinomial Logit Model Overview
No ratings yet
Multinomial Logit Model Overview
19 pages
Rank Correlation and Regression Analysis
No ratings yet
Rank Correlation and Regression Analysis
5 pages
Reliability Compelete Notes
No ratings yet
Reliability Compelete Notes
15 pages
Test Bank for Business Statistics 12th Edition
No ratings yet
Test Bank for Business Statistics 12th Edition
20 pages

Simple Linear Regression Overview

Uploaded by

Simple Linear Regression Overview

Uploaded by

Introduction to Linear

A scatter plot is a graph that may be

A dependent variable is the variable to be

An independent variable is the variable

Represents our model. It is the line that

The correlation coefficient is a quantitative

or the algebraic equivalent:

  4,855   55   26,091   2,240,687   4,855

Sales Years with Midwest

Correlation between Years and Sales

Excel Correlation Output

H 0 :   0.0 (no correlatio n)

t.025  2.228 0 t.025  2.228

Since t=4.752 > 2.048, reject H0, there is a significant

Spurious correlation occurs when

Simple linear regression analysis

Can we come up with a model for this problem with only 1

A residual is the difference between

• SST (Sum Of Squared Total) = SSR (Sum Of Squared Regression) + SSE is

The interpretation of the regression slope

SUM OF SQUARED ERRORS

  4,855   55   26,091   2,240,687   4,855

b0  y  b1 x  404.5833  49.9101(4.5833)  175.8288

The least squares regression line is:

Excel Midwest Distribution Results

TOTAL SUM OF SQUARES

SUM OF SQUARES ERROR (RESIDUALS)

SUM OF SQUARES REGRESSION

TSS  SSE  SSR

The coefficient of determination is the

COEFFICIENT OF DETERMINATION (R2)

COEFFICIENT OF DETERMINATION (R2)

69.31% of the variation in the sales data for this

 b=1 Standard deviation of the regression slope

t.025  2.228 0 t.025  2.228

Since t=4.753 > 2.048, reject H0: conclude that the

MEAN SQUARE REGRESSION

MEAN SQUARE ERROR

Develop a scatter plot of y and x. You are

sb1 = Standard error of the regression slope

PREDICTION INTERVAL FOR Y | xp

Before using a regression model for

You might also like