0% found this document useful (0 votes)

26 views30 pages

Ue21cs352a 20230830120810

This document discusses regression analysis in machine learning. It begins with defining regression as a statistical technique for understanding relationships between independent and dependent variables to predict outcomes. Examples of regression applications include predicting rain, road accidents, sales, and more. The document then covers types of regression like linear and nonlinear regression. It focuses on simple linear regression, explaining it uses one independent variable to model a linear relationship with the dependent variable. Gradient descent is introduced as an optimization algorithm to iteratively minimize error and find the best-fit regression line parameters m (slope) and b (y-intercept). Numerical examples are provided to demonstrate running gradient descent over multiple iterations to update the parameter values and reduce error.

Uploaded by

Sonupatel Sonupatel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views30 pages

Ue21cs352a 20230830120810

Uploaded by

Sonupatel Sonupatel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

UE21CS352A - Machine Intelligence

Dr. Uma D
Professor
Department of Computer Science & Engineering

Teaching Assistant : (K S Ramalakshmi - Semester VII)

UE21CS352A - Machine Intelligence
What is Regression in Machine Learning?

Regression:
• Regression is a method/ statistical measure for understanding the relationship
between independent variables(X) or features and a dependent variable(Y) or
outcome. Outcomes can then be predicted once the relationship between
independent and dependent variables has been estimated.

• Forecast value of a dependent variable (Y) from the value of independent

variables (X1 , X2 ,… ).

• Analyse the specific relationships between two or more variables.

• This is done to gain information about the one through knowing values of the
others.
UE21CS352A - Machine Intelligence
Examples of Regression

Some examples of regression can be as:

• Prediction of rain using temperature and other factors

• Prediction of road accidents due to rash driving.

• Forecasting continuous outcomes like house prices, stock process or sales.

• Predicting the success of future retail sales or marketing campaigns to ensure resources are
used effectively.

• Analyzing datasets to establish the relationships between variables and output.

• Creating time series visualizations

UE21CS352A - Machine Intelligence
Types of Regression
UE21CS352A - Machine Intelligence
Scatter Plots

Choosing the type of regression using scatter plot:

• A scatter plot is a useful visualization tool that can help
you decide which type of regression to use when
analyzing a relationship between two variables.

• Scatter plots show individual data points as dots on a

two-dimensional plane, with one variable on the x-axis
and the other variable on the y-axis.

• The pattern of the data points in a scatter plot can

provide insights into the nature of the relationship
between the variables, and this can guide you in
choosing the appropriate type of regression analysis.
UE21CS352A - Machine Intelligence
Linear Regression

Linear Regression:
• Linear regression is a statistical regression method which is used
for predictive analysis, that shows a linear relationship between
a dependent (y) and one or more independent (x) variables.

• It is one of the very simple and easy algorithms.

• It is used when you have continuous numerical data.

• Linear Regression is best used when the relationship between the

variables can be approximated by a straight line.
UE21CS352A - Machine Intelligence
Types of Linear Regression

Types of Linear Regression:

Simple Linear Regression:

If there is only one input variable (x), then such linear regression is called simple linear
regression.

Multiple Linear Regression:

If there is more than one input variable, then such linear regression is called multiple
linear regression.

NOTE: In this course, we will be focusing on Simple Linear Regression only.

UE21CS352A - Machine Intelligence
Simple Linear Regression

Simple Linear Regression:

Simple Linear Regression is a type of Regression algorithms that models

the relationship between two continuous variables: a dependent variable
and a single independent variable.

The equation of a simple linear regression model is represented as:

y = mx + b
where:
y is the dependent variable.
x is the independent variable.
m is the slope of the line (coefficient), representing how much y changes
for a unit change in x.
b is the y-intercept, which is the value of y when x is 0.
UE21CS352A - Machine Intelligence
Best Fitted Line

Best Fitted Line:

• Regression shows a line or curve that passes through all the

datapoints on target-predictor graph in such a way that the
vertical distance between the datapoints and the regression line
is minimum.

• The distance between datapoints and line tells whether a model

has captured a strong relationship or not.

• The goal is to find the best fitted line which requires values for m
and b for the given dataset.
UE21CS352A - Machine Intelligence
Gradient Descent Algorithm

GRADIENT DESCENT ALGORITHM:

The parameters m and b can be found using

gradient descent.

Gradient Descent is an iterative optimization

algorithm used to iteratively update the
values of m and b to minimize a loss
function, which measures the difference
between predicted and actual y values.

Note: In the figure, theta 0 and theta 1 are

corresponding to y-intercept(i.e. b) and
slope(i.e. m) respectively.
UE21CS352A - Machine Intelligence
Gradient Descent Algorithm

GRADIENT DESCENT ALGORITHM:

The parameters m and b can be

found using gradient descent.

Gradient Descent is an iterative

optimization algorithm used to
iteratively update the values
of m and b to minimize a loss Fig.(a)

function, which measures the

difference between predicted and Fig.(b)

actual y values.
Note: Fig.(a) represents visualization of cost function(J) and parameters(b and m) (3D
graph) and how Gradient Descent algorithm started with some initial random guess for
the parameters. Fig.(b) represents after several iteration, Gradient Descent algorithm
approaching minimum of cost/ loss function.
UE21CS352A - Machine Intelligence
Gradient Descent Algorithm

Visualization of Cost Function in 2D:

This figure represents the visualization of cost

function vs. one parameter (2D graph) and how
gradient descent algorithm takes steps to reach
the minimum of the cost function with learning
rate(alpha).

Learning Rate Alpha: is a positive constant

determines how much bigger the steps are
going to be.

Note :
If alpha is extremely large then it is going to overshoot the minimum value of cost function(J) and
might diverge.
If alpha is smaller value then smaller steps are taken to reach the minimum of cost function.
UE21CS352A - Machine Intelligence
Gradient Descent Algorithm – Working

• Gradient descent works by moving downward toward the

pits or valleys in the graph to find the minimum value.

• This is achieved by taking the derivative of the loss function,

as illustrated in the coming slides.

• During each iteration, gradient descent step-downs the cost

function in the direction of the steepest descent.

• By adjusting the parameters in this direction, it seeks to

reach the minimum of the loss function and find the
best-fit values for the parameters.

• The size of each step is determined by parameter L or

α known as Learning Rate.
UE21CS352A - Machine Intelligence
Gradient Descent

Steps to Find Optimum Values of m and b:

1. Initialize m and b with random values.
2. Define the loss function: The loss function measures how far off the predictions
are from the actual values. A common loss function for linear regression is the
Mean Squared Error (MSE):

Where n is the number of data points, yi is the actual y value for the ith data point
and the corresponding x value.

3. Calculate the gradient: Calculate the partial derivatives of the loss function with
respect m and b. This tells us how much the loss will change if we make small
adjustments to m and b:
UE21CS352A - Machine Intelligence
Gradient Descent Algorithm

Steps to Find Optimum Values of m and b:

4. Update parameters: Update m and b using the gradients and a learning rate (α).
The learning rate determines the step size in each iteration and should be chosen
carefully,

5. Iterate: Repeat the steps 3 and 4 for a certain number of iterations or until the
parameters converge.

Note: To get rid of 2 in the numerator of partial derivatives of MSE( which is our cost
function J in Linear Regression) w.r.to m and b, we can also take MSE as

Note: In the above MSE formula, the predicted line(mxi+b), considers only two
parameters. However, the same idea can be extended to any number of parameters.In
such case, the error space dimension will be higher which can’t be visualized.
UE21CS352A - Machine Intelligence
Numerical

Using y = mx+b, assume m=10, b=300, LR(α)= 0.0001. Perform five iterations of
gradient descent on this linear regression model to find out the new parameters
and observe the reduction in error through each iteration.

First iteration:
y = 10*x + 300

MSE = =

= 1/7 (sum ( (800-600)^2 + (950-670)^2 + (600-550)^2 +

(1050-730)^2 + (1200 - 800 )^2 + (740-590 )^2 + (1100 - 760)^2)

= 74485.7143
UE21CS352A - Machine Intelligence
Numerical

Doing a partial derivative wrt m of the MSE,

Doing a partial derivative wrt b of the MSE,

UE21CS352A - Machine Intelligence
Numerical

Now,
y = 12.03 * x + 300.0497

= 1/7 (sum ((800-660.94)^2 + (950 - 745.1597)^2 + (600-600.7997)^2

+ (1050-817.3397)^2 + (1200-901.5497)^2 + (740-648.9197)^2 +
(1100-853.4297)^2)

= 38957.23

Note that in comparison to the start, the error value after one
iteration has been reduced.
Let’s now predict new y values.
UE21CS352A - Machine Intelligence
Numerical

Second iteration:
y = 12.0388 * x + 300.0497

MSE =

= 1/7 (sum((800 – 660.9)^2 +

(950 – 745.15)^2 + (600 –
600.79)^2 + (1050 – 1050.8)^2 +
(1200 – 901.5)^2 + (740 – 648.9)^2
+ (1100 – 853.4)^2))

= 38957.23
UE21CS352A - Machine Intelligence
Numerical

Doing a partial derivative wrt m of

the MSE,
-14443.2337

12.0388 -14443.2337
13.4831
Doing a partial derivative wrt b of
the MSE,

= -345.59

Therefore, after two iterations,

300.0497 -345.59 Value of m is 13.4831 and
300.0843 Value of b is 300.0843.
UE21CS352A - Machine Intelligence
Numerical

Table for iteration 1: Table for iteration 2:

Initial m = 10, b = 300 New m = 12.0388, b = 300.0497
UE21CS352A - Machine Intelligence
Numerical

Table for iteration 3: Table for iteration 4:

New m = 13.4831, b = 300.0843 New m = 14.5063, b = 300.1081
UE21CS352A - Machine Intelligence
Numerical

Table for iteration 5:

New m = 15.231, b = 300.1243

Note that in comparison to the start,

the error value has been reduced
significantly.
UE21CS352A - Machine Intelligence
Measuring Model Performance

Measuring Model Performance:

• The Goodness of fit determines how the line of regression fits the set of
observations. The process of finding the best model out of various models is
called optimization.

R-squared method:
• R-squared is a statistical method that determines the goodness of fit.

• It measures the strength of the relationship between the dependent and

independent variables on a scale of 0-100%.
UE21CS352A - Machine Intelligence
Measuring Model Performance

• The high value of R-square determines the less difference between the predicted
values and actual values and hence represents a good model.

• It is also called a coefficient of determination, or coefficient of multiple

determination for multiple regression.

• It can be calculated from the formula:

UE21CS352A - Machine Intelligence
Outlier – Graphical Representation

Effect of Outlier in Model Prediction:

An outlier can be higher or lower than expected,
or displaced more to the right or left than
expected. Outliers can effect regression Outlier
lines, making the regression lines less accurate in
predicting other data.
UE21CS352A - Machine Intelligence
Advantages vs Disadvantages of Linear Regression

Advantages Disadvantages

Outliers can have huge effects on the

Simple to implement and easier to
regression and boundaries are linear in this
interpret the output coefficients.
technique.

Linear regression assumes a linear

When you know the relationship between
relationship between dependent and
the independent and dependent variable
independent variables. That means it
have a linear relationship, this algorithm is
assumes that there is a straight-line
the best to use because of it’s less
relationship between them. It assumes
complexity compared to other algorithms.
independence between attributes.
UE21CS352A - Machine Intelligence
Advantages vs Disadvantages of Linear Regression

Advantages Disadvantages

It also looks at a relationship between the

Linear Regression is susceptible to mean of the dependent variables and the
over-fitting but it can be avoided using independent variables. Just as the mean is
some dimensionality reduction techniques, not a complete description of a single
regularization (L1 and L2) techniques and variable, linear regression is not a
cross-validation. complete description of relationships
among variables.
UE21CS352A - Machine Intelligence
References

• https://www.educba.com/linear-regression-analysis/
• https://towardsdatascience.com/linear-regression-using-gradient-descent
• https://www.geeksforgeeks.org/ml-advantages-and-disadvantages-of-linear-regression/
• https://www.javatpoint.com/linear-regression-in-machine-learning
• https://www.ibm.com/topics/linear-regression
• https://www.javatpoint.com/multiple-linear-regression-in-machine-learning
• https://www.scribbr.com/statistics/multiple-linear-regression/
• https://towardsdatascience.com/linear-regression-using-gradient-descent-97a6c8700931
• https://www.vedantu.com/maths/linear-regression
• https://www.geeksforgeeks.org/gradient-descent-in-linear-regression/
• https://www.analyticsvidhya.com/blog/2021/10/everything-you-need-to-know-about-linear-regression/#:~:t
ext=Linear%20regression%20shows%20the%20linear,is%20called%20simple%20linear%20regression.
THANK YOU

Dr. Uma D
Professor
Department of Computer Science & Engineering
[email protected]

Linear Regression Notes
No ratings yet
Linear Regression Notes
25 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
10 pages
Unit 3.1 Gradient Descent in Linear Regression
No ratings yet
Unit 3.1 Gradient Descent in Linear Regression
6 pages
Regression PPT
No ratings yet
Regression PPT
21 pages
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37
No ratings yet
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37
115 pages
CSE445 Linear-Regression
No ratings yet
CSE445 Linear-Regression
40 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
Module 3
No ratings yet
Module 3
27 pages
MACHINE LEARNING ALGORITHM Unit-II
No ratings yet
MACHINE LEARNING ALGORITHM Unit-II
115 pages
Linear Regression Explained
No ratings yet
Linear Regression Explained
67 pages
Linear Regression Using Gradient Descent
No ratings yet
Linear Regression Using Gradient Descent
2 pages
5.1loss Function, Optimization, GD
No ratings yet
5.1loss Function, Optimization, GD
39 pages
Regression and Optimization in ML
No ratings yet
Regression and Optimization in ML
41 pages
Regression
No ratings yet
Regression
45 pages
20102A0071 DL Experiment5
No ratings yet
20102A0071 DL Experiment5
6 pages
Lect03 CSN382
No ratings yet
Lect03 CSN382
31 pages
Linear Regression for Beginners
No ratings yet
Linear Regression for Beginners
11 pages
3.linear Regression
No ratings yet
3.linear Regression
18 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Linear Regression by IntuitiveAI v2.5
No ratings yet
Linear Regression by IntuitiveAI v2.5
5 pages
Linear Regression
100% (1)
Linear Regression
8 pages
Modern Pridictive Modelling (Regression)
No ratings yet
Modern Pridictive Modelling (Regression)
12 pages
CSE 412 Lab Manual 3 Linear Regression
No ratings yet
CSE 412 Lab Manual 3 Linear Regression
10 pages
Module2 Optimizations
No ratings yet
Module2 Optimizations
65 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Alshammari 2024 Ijca 923446
No ratings yet
Alshammari 2024 Ijca 923446
6 pages
Regression Linear Simple
No ratings yet
Regression Linear Simple
37 pages
Regression - Docx 1 2
No ratings yet
Regression - Docx 1 2
2 pages
Basic Interview Question of Linear Regression
No ratings yet
Basic Interview Question of Linear Regression
9 pages
Ch2 - Lec3 - Linear Regression and Gradient Descent
No ratings yet
Ch2 - Lec3 - Linear Regression and Gradient Descent
60 pages
Machine Learning QB
No ratings yet
Machine Learning QB
32 pages
ML Khuraim
No ratings yet
ML Khuraim
27 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
Chapter - 2 - Linear and Logistic Regression
No ratings yet
Chapter - 2 - Linear and Logistic Regression
34 pages
Linear Regression
No ratings yet
Linear Regression
91 pages
2 Simple Linear Regression
No ratings yet
2 Simple Linear Regression
22 pages
Regression Notes
100% (1)
Regression Notes
20 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Unit No. 2
No ratings yet
Unit No. 2
30 pages
Chap 2 Linear Regression - Part1
No ratings yet
Chap 2 Linear Regression - Part1
29 pages
DeepLearning Lect2 3
No ratings yet
DeepLearning Lect2 3
89 pages
Lecture 04
No ratings yet
Lecture 04
24 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Linear Regression
No ratings yet
Linear Regression
130 pages
Complete Chapter Revision Takeaways Supervised ML Regression
No ratings yet
Complete Chapter Revision Takeaways Supervised ML Regression
22 pages
Unit 2
No ratings yet
Unit 2
26 pages
ML PPT 2
No ratings yet
ML PPT 2
206 pages
ML Notes
No ratings yet
ML Notes
14 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
APSC 258 Midterm Study Guide
No ratings yet
APSC 258 Midterm Study Guide
4 pages
3.1. Linear Regression and Gradient Desent
No ratings yet
3.1. Linear Regression and Gradient Desent
29 pages
Linear Regression and Logistic Regression
No ratings yet
Linear Regression and Logistic Regression
19 pages
Week 04
No ratings yet
Week 04
101 pages
Insurance's Role in Lagos SMEs Amid COVID-19
No ratings yet
Insurance's Role in Lagos SMEs Amid COVID-19
8 pages
Basic 7 First Term
No ratings yet
Basic 7 First Term
50 pages
Effects of Solar Radiation On Animal Thermoregulation
No ratings yet
Effects of Solar Radiation On Animal Thermoregulation
27 pages
Reproductive Health in One Shot
No ratings yet
Reproductive Health in One Shot
53 pages
Potentiometer
No ratings yet
Potentiometer
7 pages
SEO-Optimized Document Title
100% (2)
SEO-Optimized Document Title
21 pages
Mechatronics: Pınar Boyraz, Mutlu Gündüz
No ratings yet
Mechatronics: Pınar Boyraz, Mutlu Gündüz
13 pages
Asthma Control Test
No ratings yet
Asthma Control Test
8 pages
Signature Global Daxin Vistas
No ratings yet
Signature Global Daxin Vistas
49 pages
Philosophy for Senior High Students
100% (1)
Philosophy for Senior High Students
22 pages
Kapil CPF
No ratings yet
Kapil CPF
3 pages
Enriquez, Jestoni R. - Phileduc - Final Exam
No ratings yet
Enriquez, Jestoni R. - Phileduc - Final Exam
3 pages
Rylstone WS - River Pumping Station Concept Design Report: Prepared For Mid-Western Regional Council
No ratings yet
Rylstone WS - River Pumping Station Concept Design Report: Prepared For Mid-Western Regional Council
54 pages
Hitachi India - Brochure Sep 2021
No ratings yet
Hitachi India - Brochure Sep 2021
1 page
Geith Instalation Quick-Coupling Exc
50% (2)
Geith Instalation Quick-Coupling Exc
11 pages
Private Excel Modelling Test
No ratings yet
Private Excel Modelling Test
40 pages
Diseño de Vigas
No ratings yet
Diseño de Vigas
12 pages
Computer Science Second Half Book
No ratings yet
Computer Science Second Half Book
4 pages
BW Developer Resume Profile
No ratings yet
BW Developer Resume Profile
9 pages
Petrochemical Product - Wonjin A3 210228
No ratings yet
Petrochemical Product - Wonjin A3 210228
2 pages
217 Energy Management
No ratings yet
217 Energy Management
1 page
Representation For Evaluation
No ratings yet
Representation For Evaluation
6 pages
1A Time Clauses
No ratings yet
1A Time Clauses
3 pages
C-Band Antenna Specs & Manual
No ratings yet
C-Band Antenna Specs & Manual
9 pages
Biophilic Hospital Design Seminar
No ratings yet
Biophilic Hospital Design Seminar
59 pages
Unit - I
No ratings yet
Unit - I
17 pages
State Space to Transfer Function
No ratings yet
State Space to Transfer Function
2 pages
Monopoly Pricing and Profit Analysis
No ratings yet
Monopoly Pricing and Profit Analysis
6 pages
Networking Essentials - Network Types
No ratings yet
Networking Essentials - Network Types
1 page
Iq2wjqjhfkkby1lyhkyd5z3i
No ratings yet
Iq2wjqjhfkkby1lyhkyd5z3i
2 pages

Ue21cs352a 20230830120810

Uploaded by

Ue21cs352a 20230830120810

Uploaded by

UE21CS352A - Machine Intelligence

Teaching Assistant : (K S Ramalakshmi - Semester VII)

• Forecast value of a dependent variable (Y) from the value of independent

• Analyse the specific relationships between two or more variables.

Some examples of regression can be as:

• Prediction of rain using temperature and other factors

• Prediction of road accidents due to rash driving.

• Forecasting continuous outcomes like house prices, stock process or sales.

• Analyzing datasets to establish the relationships between variables and output.

• Creating time series visualizations

Choosing the type of regression using scatter plot:

• Scatter plots show individual data points as dots on a

• The pattern of the data points in a scatter plot can

• It is one of the very simple and easy algorithms.

• It is used when you have continuous numerical data.

• Linear Regression is best used when the relationship between the

Types of Linear Regression:

Simple Linear Regression:

Multiple Linear Regression:

NOTE: In this course, we will be focusing on Simple Linear Regression only.

Simple Linear Regression:

Simple Linear Regression is a type of Regression algorithms that models

The equation of a simple linear regression model is represented as:

Best Fitted Line:

• Regression shows a line or curve that passes through all the

• The distance between datapoints and line tells whether a model

GRADIENT DESCENT ALGORITHM:

The parameters m and b can be found using

Gradient Descent is an iterative optimization

Note: In the figure, theta 0 and theta 1 are

GRADIENT DESCENT ALGORITHM:

The parameters m and b can be

Gradient Descent is an iterative

function, which measures the

Visualization of Cost Function in 2D:

This figure represents the visualization of cost

Learning Rate Alpha: is a positive constant

• Gradient descent works by moving downward toward the

• This is achieved by taking the derivative of the loss function,

• During each iteration, gradient descent step-downs the cost

• By adjusting the parameters in this direction, it seeks to

• The size of each step is determined by parameter L or

Steps to Find Optimum Values of m and b:

Steps to Find Optimum Values of m and b:

= 1/7 (sum ( (800-600)^2 + (950-670)^2 + (600-550)^2 +

Doing a partial derivative wrt m of the MSE,

Doing a partial derivative wrt b of the MSE,

= 1/7 (sum ((800-660.94)^2 + (950 - 745.1597)^2 + (600-600.7997)^2

= 1/7 (sum((800 – 660.9)^2 +

Doing a partial derivative wrt m of

Therefore, after two iterations,

Table for iteration 1: Table for iteration 2:

Table for iteration 3: Table for iteration 4:

Table for iteration 5:

Note that in comparison to the start,

Measuring Model Performance:

• It measures the strength of the relationship between the dependent and

• It is also called a coefficient of determination, or coefficient of multiple

• It can be calculated from the formula:

Effect of Outlier in Model Prediction:

Outliers can have huge effects on the

Linear regression assumes a linear

It also looks at a relationship between the

You might also like