0% found this document useful (0 votes)

10 views

2.1 Linear Regression

Linear regression is a simple and popular machine learning algorithm that finds the linear relationship between variables. It fits a linear equation to the data to make predictions for continuous variables. The linear regression line minimizes the sum of the squared residuals to find the best fit. Regularization techniques like Lasso and Ridge are used to avoid overfitting by adding a penalty term to the loss function.

Uploaded by

Nandini rathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

2.1 Linear Regression

Uploaded by

Nandini rathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

LINEAR

REGRESSION
Linear regression is one of the easiest and most
popular Machine Learning algorithms.
WHAT IS Linear regression makes predictions for
LINEAR continuous/real or numeric variables
Three main uses of regression analysis are:
REGRESSIO 1. Predicting share price
N? 2. Analyzing the impact of price changes
LINEAR REGRESSION
Linear regression shows the linear relationship,
which means it finds how the value of the
dependent variable (y) is changing according to
the value of the independent variable (x).
The linear regression model provides a sloped
straight line representing the relationship
between the variables.
LINEAR REGRESSION
Mathematically, we can represent a linear regression
as:
y= a0+a1x+ ε
Here,
Y= Dependent Variable (Target Variable)
X= Independent Variable (predictor Variable)
a0= intercept of the line (Gives an additional degree
of freedom)
a1 = Linear regression coefficient (scale factor to each
input value).
ε = random error
TYPES OF LINEAR REGRESSION
Simple Linear Regression:
If a single independent variable is used to predict the value of a numerical
dependent variable, then such a Linear Regression algorithm is called Simple Linear
Regression.
Multiple Linear regression:
If more than one independent variables are used to predict the value of a numerical
dependent variable, then such a Linear Regression algorithm is called Multiple Linear
Regression.
LINEAR REGRESSION LINE
Positive Linear Relationship: Negative Linear Relationship:
If the dependent variable increases on If the dependent variable decreases on
the Y-axis and independent variable the Y-axis and independent variable
increases on X-axis, then such a increases on the X-axis, then such a
relationship is termed as a Positive linear relationship is called a negative linear
relationship. relationship.
COST FUNCTION
Goal: find the best fit line that means the error between predicted values and actual values
should be minimized. The best fit line will have the least error.
The different values for weights or the coefficient of lines (a0, a1) gives a different line of
regression, so we need to calculate the best values for a0 and a1 to find the best fit line, so
to calculate this we use cost function.
For Linear Regression, we use the Mean Squared Error (MSE) cost function, which is the
average of squared error occurred between the predicted values and actual values. It can
be written as:
MSE can be calculated as:
Where,
N=Total number of observation
Yi = Actual value
(a1xi+a0)= Predicted value.
GRADIENT DESCENT
Gradient descent is a method of updating a_0 and a_1 to reduce the cost
function(MSE). The idea is that we start with some values for a_0 and a_1 and then
we change these values iteratively to reduce the cost.

In the gradient descent algorithm, the number

of steps you take is the learning rate. This
decides on how fast the algorithm converges to
the minima.
GRADIENT DESCENT
GOODNESS OF FIT
The Goodness of fit determines how the line of regression fits the set of observations.
R-squared method:
R-squared is a statistical method that determines the goodness of fit.
It measures the strength of the relationship between the dependent and independent
variables on a scale of 0-100%.
The high value of R-square determines the less difference between the predicted
values and actual values and hence represents a good model.
It is also called a coefficient of determination, or coefficient of multiple
determination for multiple regression.
Formula:
EXAMPLE
EXAMPLE
MEAN SQUARE ERROR
MEAN SQUARE ERROR
MEAN SQUARE ERROR
FIND THE BEST FIT LINE
GOODNESS OF FIT-R2
GOODNESS OF FIT

GOODNESS OF FIT
GOODNESS OF FIT
MULTIVARIATE LINEAR REGRESSION
More than one independent variables are used to predict the value of a numerical
dependent variable.
MULTIVARIATE LINEAR REGRESSION
Y=f(x,z)
y = m1.x + m2.z+ c
y is the dependent variable i.e. the variable that needs to be estimated
and predicted.
x is the first independent variable i.e. the variable that is controllable. It is
the first input.
m1 is the slope of x1. It determines what will be the angle of the line (x).
z is the second independent variable i.e. the variable that is controllable. It
is the second input.
m2 is the slope of z. It determines what will be the angle of the line (z).
c is the intercept. A constant that determines the value of y when x and z
are 0.
MULTIVARIATE LINEAR
REGRESSIONREGRESSION
model with two input variables can be expressed as:
y = β0 + β1.x1 + β2.x2
In machine learning world, there can be many dimensions. A model with three
input variables can be expressed as:
y = β0 + β1.x1 + β2.x2 + β3.x3
A generalized equation for the multivariate regression model can be:
y = β0 + β1.x1 + β2.x2 +….. + βn.xn
MULTIVARIATE LINEAR REGRESSION
When we have multiple features
and we want to train a model that
can predict the price given those
features, we can use a
multivariate linear regression. The
model will have to learn the
parameters(theta 0 to theta n) on
the training dataset below such
that if we want to predict the
price for a house that is not sold
yet, it can give us prediction that
is closer to what it will get sold
for.
MULTIVARIATE LINEAR REGRESSION
COST FUNCTION AND GRADIENT
DESCENT FOR MULTIVARIATE LINEAR
REGRESSION
GRADIENT DESCENT
PRACTICAL IDEAS FOR MAKING
GRADIENT DESCENT WORK WELL
Use feature scaling to help gradient descent converge faster. Get every feature
between -1 and +1 range. It doesn’t have to be exactly in the -1 to +1 range but it
should be close to that range.
PRACTICAL IDEAS FOR MAKING
GRADIENT DESCENT WORK WELL
PRACTICAL IDEAS FOR MAKING
GRADIENT DESCENT WORK WELL
MODEL INTERPRETATION
y = -85090 + 102.85 * X1 + 43.79 * X2+ 1.52 * x3 - 37.91 * x4 + 908.12 * x5 + 364.33 * x6
x1: With all other predictors held constant, if the x1 is increased by one unit, the average
price increases by $102.85.
x2: With all other predictors held constant, if the x2 is increased by one unit, the average
price increases by $43.79.
x3: With all other predictors held constant, if the x3 is increased by one unit, the average
price increases by $1.52.
x4: With all other predictors held constant, if the x4 is increased by one unit, the average
price decreases by $37.91 (length has a -ve coefficient).
x5: With all other predictors held constant, if the x5 is increased by one unit, the average
price increases by $908.12
x6t: With all other predictors held constant, if the x6 is increased by one unit, the average
price increases by $364.33
REGULARIZATION
32
PROBLEM OF OVERFITTING

Regularisation is a technique used to reduce the errors by fitting the function appropriately on the given training set and avoid
overfitting.

33
REGULARISATION IN ML
L1 regularisation
L2 regularisation
A regression model which uses L1 Regularisation technique is called LASSO (Least Absolute
Shrinkage and Selection Operator) regression.

A regression model that uses L2 regularisation technique is called Ridge regression.

34
REGULARISATION
Lasso Regression adds “absolute value of magnitude” of coefficient as penalty term to the loss
function(L).

Ridge regression adds “squared magnitude” of coefficient as penalty term to the loss

function(L).

During Regularisation the output function(y_hat) does not change. The change is only in the loss function.

35
LOSS FUNCTION

The loss function after regularisation:

lambda is a Hyperparameter Known as regularisation constant and it is greater than zero.

36
PROBELM
A clinical trial gave the following data for BMI and
17 140
Cholesterol level for 10 patients. Predict the likely
value of cholesterol level for someone who has a 21 189
BMI of 27. 24 210

28 240

14 130

16 100

19 135

22 166

15 130

18 170
SOLUTION:

17 140 -2.4 -21 50.4 5.76

21 189 1.6 28 44.8 2.56

24 210 4.6 49 225.4 21.16

28 240 8.6 79 679.4 73.96

14 130 -5.4 -31 167.4 29.16

16 100 -3.4 -61 207.4 11.56

19 135 -0.4 -26 10.4 0.16

22 166 2.6 5 13 6.76

15 130 -4.4 -31 136.4 19.36

18 170 -1.4 9 -12.6 1.96

Total=1522 Total=172.4

Biology Paper 6 Notes
78% (18)
Biology Paper 6 Notes
6 pages
5.linear Regression
No ratings yet
5.linear Regression
39 pages
Machine Learning and Deep Learning Course
No ratings yet
Machine Learning and Deep Learning Course
23 pages
Data Science
100% (1)
Data Science
14 pages
Unit 3c Linear Regression
No ratings yet
Unit 3c Linear Regression
98 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Linear Regression
No ratings yet
Linear Regression
10 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
MachineLearning Unit II
No ratings yet
MachineLearning Unit II
45 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
33 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Linear Models
No ratings yet
Linear Models
50 pages
UNIT-2 ML
No ratings yet
UNIT-2 ML
39 pages
MachineLearning_Unit-II
No ratings yet
MachineLearning_Unit-II
45 pages
UNIt-3 TY
No ratings yet
UNIt-3 TY
67 pages
Regression
No ratings yet
Regression
16 pages
LinearRegression1 210720 171800
No ratings yet
LinearRegression1 210720 171800
41 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37 (1)
No ratings yet
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37 (1)
115 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Home Ai Machine Learning Dbms Java Blockchain Control System Selenium HTML Css Javascript
No ratings yet
Home Ai Machine Learning Dbms Java Blockchain Control System Selenium HTML Css Javascript
9 pages
Unit -3_ML_24
No ratings yet
Unit -3_ML_24
41 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
GradientDescent-Regression_slides
No ratings yet
GradientDescent-Regression_slides
26 pages
3. Linear Regression
No ratings yet
3. Linear Regression
49 pages
Chapter4_Regression.docx
No ratings yet
Chapter4_Regression.docx
15 pages
ML Unit
No ratings yet
ML Unit
23 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
linear regression (1)
No ratings yet
linear regression (1)
8 pages
s&Ml Unit 5- q & A
No ratings yet
s&Ml Unit 5- q & A
15 pages
Linear Regression
100% (1)
Linear Regression
8 pages
Linear Regression
No ratings yet
Linear Regression
26 pages
Chapter_2_Linear and Logistic Regression
No ratings yet
Chapter_2_Linear and Logistic Regression
34 pages
CSL0777 L12
No ratings yet
CSL0777 L12
18 pages
Regression_Questionnaire
No ratings yet
Regression_Questionnaire
10 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
ML EasySol
No ratings yet
ML EasySol
62 pages
Unit No. 2
No ratings yet
Unit No. 2
30 pages
Classical Machine Learning: Linear Regression: Ramesh S
No ratings yet
Classical Machine Learning: Linear Regression: Ramesh S
28 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
1. Lecture+Notes+-+Advanced+Regression
No ratings yet
1. Lecture+Notes+-+Advanced+Regression
12 pages
chapter2- optimisation
No ratings yet
chapter2- optimisation
7 pages
linear regression
No ratings yet
linear regression
130 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
25 pages
ML L6 Linear Regresion
No ratings yet
ML L6 Linear Regresion
54 pages
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
No ratings yet
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
6 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
-18-Linear Regression
No ratings yet
-18-Linear Regression
29 pages
Exercises of Multi-Variable Functions
From Everand
Exercises of Multi-Variable Functions
Simone Malacrida
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
A Conversation About Calculus
From Everand
A Conversation About Calculus
Ginachukwu Amah
No ratings yet
Fosroc Renderoc LA: Constructive Solutions
No ratings yet
Fosroc Renderoc LA: Constructive Solutions
2 pages
RPMS E-Portfolio: Name of Ratee Name of Rater
No ratings yet
RPMS E-Portfolio: Name of Ratee Name of Rater
92 pages
Sociology Notes
No ratings yet
Sociology Notes
59 pages
DLL - Science 3 - Q4 - W4
No ratings yet
DLL - Science 3 - Q4 - W4
4 pages
Accident Investigation
No ratings yet
Accident Investigation
20 pages
Her Analysis
No ratings yet
Her Analysis
3 pages
Elements of News: Top Ten Elements That Make A Story Newsworthy
No ratings yet
Elements of News: Top Ten Elements That Make A Story Newsworthy
12 pages
Understanding Culture, Society, and Politics: Subject Description
No ratings yet
Understanding Culture, Society, and Politics: Subject Description
105 pages
International Conference Brouchere-2
No ratings yet
International Conference Brouchere-2
4 pages
Use of English 6
No ratings yet
Use of English 6
6 pages
Geography 1290 - Assignment 1
No ratings yet
Geography 1290 - Assignment 1
7 pages
Physics 2 PT
No ratings yet
Physics 2 PT
12 pages
Greatwk Syllabus
No ratings yet
Greatwk Syllabus
6 pages
Dlp-Grade-9 - D3
No ratings yet
Dlp-Grade-9 - D3
6 pages
Course Guide EDTECH 212
No ratings yet
Course Guide EDTECH 212
4 pages
Preparation of Disperse Inks For Direct Inkjet Pri
100% (1)
Preparation of Disperse Inks For Direct Inkjet Pri
9 pages
Centrifuge DSC-203,303SD (Manual) - 2021
100% (1)
Centrifuge DSC-203,303SD (Manual) - 2021
3 pages
CFA Level 1 Cheatsheet
No ratings yet
CFA Level 1 Cheatsheet
11 pages
Joel Snyder - Picturing Vision
No ratings yet
Joel Snyder - Picturing Vision
29 pages
PH, TA& % Acidity
No ratings yet
PH, TA& % Acidity
41 pages
G8 Test - Water - MCQ
No ratings yet
G8 Test - Water - MCQ
4 pages
Drishti Book
No ratings yet
Drishti Book
3 pages
Ipc2022-87301 The State of Dent Screening and Shape-Based Assessments
No ratings yet
Ipc2022-87301 The State of Dent Screening and Shape-Based Assessments
6 pages
Diamec U6: Electrical System Manual
No ratings yet
Diamec U6: Electrical System Manual
21 pages
ĐỀ SỐ 38
No ratings yet
ĐỀ SỐ 38
5 pages
RPT SC Year 1 DLP
No ratings yet
RPT SC Year 1 DLP
22 pages
Lindab Smoke Control Damper Brochure
No ratings yet
Lindab Smoke Control Damper Brochure
4 pages
French
No ratings yet
French
8 pages
Math 9 2nd Quarter
No ratings yet
Math 9 2nd Quarter
5 pages

2.1 Linear Regression

Uploaded by

2.1 Linear Regression

Uploaded by

LINEAR

In the gradient descent algorithm, the number

A regression model that uses L2 regularisation technique is called Ridge regression.

Ridge regression adds “squared magnitude” of coefficient as penalty term to the loss

The loss function after regularisation:

lambda is a Hyperparameter Known as regularisation constant and it is greater than zero.

17 140 -2.4 -21 50.4 5.76

21 189 1.6 28 44.8 2.56

24 210 4.6 49 225.4 21.16

28 240 8.6 79 679.4 73.96

14 130 -5.4 -31 167.4 29.16

16 100 -3.4 -61 207.4 11.56

19 135 -0.4 -26 10.4 0.16

22 166 2.6 5 13 6.76

15 130 -4.4 -31 136.4 19.36

18 170 -1.4 9 -12.6 1.96

You might also like