Regression

regression lecture

Uploaded by

Mohamed Ragab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views16 pages

Regression

regression lecture

Uploaded by

Mohamed Ragab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Ridge ,lasso & Regression Evaluation

Machine learning chapter

Overfitting and Underfitting
• Overfitting occurs when a machine learning model learns the training data
too well, capturing noise and fluctuations in the data rather than the
underlying patterns. As a result, the model performs very well on the
training set but fails to generalize effectively to new, unseen data.
𝑌

𝑋
Overfitting and Underfitting
• Underfitting occurs when a machine learning model is too simple to
capture the underlying patterns in the training data. The model fails to
learn the relevant relationships and performs poorly on both the training
set and new data.
𝑌

𝑋
Overfitting and Underfitting

𝑋
Bias and Variance
• If we have model which is very
accurate, therefore the error of our
model will be low, meaning a low bias
and low variance. All the data points
fit within the bulls-eye. Similarly we
can say that if the variance increases,
the spread of our data point increases
which results in less accurate
prediction. And as the bias increases
the error between our predicted value
and the observed values increases.
Bias and Variance
• As we add more and more
parameters to our model, its
complexity increases, which results
in increasing variance and
decreasing bias, i.e., overfitting. So
we need to find out one optimum
point in our model where the
decrease in bias is equal to increase
in variance. In practice, there is no
analytical way to find this point. So
how to deal with high variance or
high bias?
Multicollinearity
• Multicollinearity is the occurrence of
high intercorrelations among two or 𝑥2
more independent variables in a
multiple regression model.
Multicollinearity can lead to skewed
or misleading results when a
researcher or analyst attempts to
determine how well each
independent variable can be used
most effectively to predict or
understand the dependent variable 𝑥1
in a statistical model.
Regularization ?
• Regularization is a technique used in machine learning to prevent
overfitting by adding a penalty term to the objective function that the
model is trying to optimize. The goal of regularization is to discourage the
model from fitting the training data too closely and, instead, encourage it to
learn the underlying patterns that generalize well to new, unseen data
• We have 2 images of regularization :
✓ L1 Regularization (Lasso)
✓ L2 Regularization (Ridge)
Ridge Regression
• Ridge regression is a model tuning method that is used to analyse any data
that suffers from multicollinearity. This method performs L2 regularization.
When the issue of multicollinearity occurs, least-squares are unbiased, and
variances are large, this results in predicted values being far away from the
actual values.

𝑛 2 𝑝 2
Ridge Objective Function = σ𝑖 𝑦𝑖 − 𝑦ො𝑖 + 𝜆 σ𝑖 𝛽𝑖
• λ (lambda) is the regularization parameter, a non-negative hyperparameter
that controls the strength of the regularization. As λ increases, the impact
of the regularization term on the objective function increases.
Lasso Regression
• LASSO regression, also known as L1 regularization, is a popular technique
used in statistical modeling and machine learning to estimate the
relationships between variables and make predictions. LASSO stands for
Least Absolute Shrinkage and Selection Operator.

𝑛 2 𝑝 .
Lasso Objective Function = σ𝑖 𝑦𝑖 − 𝑦ො𝑖 + 𝜆 σ𝑖 𝛽𝑖
• λ (lambda) is the regularization parameter, a non-negative hyperparameter
that controls the strength of the regularization..
Regression Evaluation
• Model evaluation in machine learning is a
crucial step that serves various purposes. It
enables the assessment of a model's
performance on a specific task, using metrics
like accuracy and precision.
• Evaluating on a separate test set ensures the
model's ability to make accurate predictions
on new, unseen data, indicating its
generalization capabilities. Additionally,
model evaluation facilitates the comparison
of different models, aiding in the selection of
the most effective one for a given task. It
plays a key role in hyperparameter tuning,
helping to find optimal settings and improve
overall performance.
Mean Absolute Error(MAE)
• MAE is a metric used to measure the average magnitude of errors between
predicted and actual values in a regression task. It is calculated by taking the
average of the absolute differences between the predicted and true values.
MAE provides a straightforward way to assess the accuracy of a predictive
model, with lower MAE values indicating better performance.

𝑁
• MAE formula :
1
𝑀𝐴𝐸 = ෍ 𝑦𝑖 − 𝑦ො𝑖
𝑁
𝑖=1

• When : 𝑁 is the number of data points, 𝑦𝑖 is the true value for the 𝑖 -th data
point and 𝑦ො𝑖 is the predicted value for the 𝑖 -th data point.
Mean Squared Error(MSE)
• MSE is a most used and very simple metric with a little bit of change in mean
absolute error. Mean squared error states that finding the squared difference
between actual and predicted value.
• It represents the squared distance between actual and predicted values. we
perform squared to avoid the cancellation of negative terms and it is the
benefit of MSE. 𝑁
1 2
• MSE formula : 𝑀𝑆𝐸 = ෍ 𝑦𝑖 − 𝑦ො𝑖
𝑁
𝑖=1

• When : 𝑁 is the number of data points, 𝑦𝑖 is the true value for the 𝑖 -th data
point and 𝑦ො𝑖 is the predicted value for the 𝑖 -th data point.
Root Mean Squared Error(RMSE)
• As RMSE is clear by the name itself, that it is a simple square root of mean
squared error.

𝑁
• RMSE formula : 1
𝑅𝑀𝑆𝐸 = ෍ 𝑦𝑖 − 𝑦ො𝑖 2
𝑁
𝑖=1

• When : 𝑁 is the number of data points, 𝑦𝑖 is the true value for the 𝑖 -th data
point and 𝑦ො𝑖 is the predicted value for the 𝑖 -th data point.
R Squared 2
(𝑅 )
• R-squared (R²) is a statistical measure that represents the proportion of the
variance in the dependent variable that is explained by the independent
variables in a regression model. It is also known as the coefficient of
determination. R-squared values range from 0 to 1, where:
• 𝑅2 = 0 ∶ The model does not explain any of the variability in the dependent
variable.
• 𝑅2 = 1 ∶ The model explains all the variability in the dependent variable.

𝑠𝑢𝑚 𝑜𝑓𝑆𝑞𝑢𝑎𝑟𝑒𝑠 𝑜𝑓 𝑅𝑒𝑠𝑖𝑑𝑢𝑎𝑙𝑠 (𝑆𝑆𝑅)

• 𝑅2 formula : 𝑅 2 =1 −
𝑡𝑜𝑡𝑎𝑙 𝑠𝑢𝑚 𝑜𝑓 𝑠𝑞𝑢𝑎𝑟𝑒𝑠 (𝑆𝑆𝑇)

• When : 𝑆𝑆𝑅 = σ𝑁
𝑖=1 𝑦𝑖 − 𝑦
ො𝑖 2 , 𝑆𝑆𝑇 = σ𝑁
𝑖=1 𝑦𝑖 − 𝑦
ഥ𝑖 2

here 𝑦ഥ𝑖 s the mean of the dependent variable.

Session Finished

Thank You!
MACHINFY EDUCATION TEAM

Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
Regression Analysis for Beginners
No ratings yet
Regression Analysis for Beginners
35 pages
Linear Regression
No ratings yet
Linear Regression
9 pages
Unit 2
No ratings yet
Unit 2
92 pages
Regression
No ratings yet
Regression
56 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Bias Varience Trade Off
100% (2)
Bias Varience Trade Off
35 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
Regression Notes
100% (1)
Regression Notes
20 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
Machine Learning With Ridge and Lasso Regression
No ratings yet
Machine Learning With Ridge and Lasso Regression
19 pages
ML Models and When To Choose One Over Others
No ratings yet
ML Models and When To Choose One Over Others
7 pages
Lecture 19
No ratings yet
Lecture 19
25 pages
Understanding Bias, Variance, and Regularization
No ratings yet
Understanding Bias, Variance, and Regularization
33 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Least Square Method Definition
No ratings yet
Least Square Method Definition
7 pages
Lecture+Notes+-+Advanced+Regression
No ratings yet
Lecture+Notes+-+Advanced+Regression
12 pages
Module 3
No ratings yet
Module 3
35 pages
Complete Chapter Revision Takeaways Supervised ML Regression
No ratings yet
Complete Chapter Revision Takeaways Supervised ML Regression
22 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
SumitBurnwal ML
No ratings yet
SumitBurnwal ML
13 pages
Linear Models
No ratings yet
Linear Models
50 pages
Class 9 After
No ratings yet
Class 9 After
38 pages
Assesing Performance of Regression-Error Measures
No ratings yet
Assesing Performance of Regression-Error Measures
5 pages
Regression v33
No ratings yet
Regression v33
81 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
Regression
No ratings yet
Regression
6 pages
3 Da
No ratings yet
3 Da
16 pages
Linear Regression Case Study
No ratings yet
Linear Regression Case Study
6 pages
ML 1
No ratings yet
ML 1
24 pages
2.1 Supervised Regression
No ratings yet
2.1 Supervised Regression
26 pages
Lec 3 Regression.
No ratings yet
Lec 3 Regression.
20 pages
445 Lecture 7
No ratings yet
445 Lecture 7
30 pages
9 - Linear Regression-Problems and Solutions
No ratings yet
9 - Linear Regression-Problems and Solutions
23 pages
ML - Perplexity
No ratings yet
ML - Perplexity
71 pages
Unit-2 Sem II Notes
No ratings yet
Unit-2 Sem II Notes
33 pages
Machine Learning Questions and Answers For Interview
No ratings yet
Machine Learning Questions and Answers For Interview
20 pages
Limitations of Linear Regression
No ratings yet
Limitations of Linear Regression
41 pages
Regression Questionnaire
No ratings yet
Regression Questionnaire
10 pages
ML U2 Regression
No ratings yet
ML U2 Regression
20 pages
Evaluation Metrics For Your Regression Model - Analytics Vidhya
No ratings yet
Evaluation Metrics For Your Regression Model - Analytics Vidhya
6 pages
Unit2 ML Notes
No ratings yet
Unit2 ML Notes
19 pages
Lecture 3
No ratings yet
Lecture 3
35 pages
Unit 2
No ratings yet
Unit 2
34 pages
Day.10 Regression Evaluation Metrics MSE, RMSE, MAE, R-Squared
No ratings yet
Day.10 Regression Evaluation Metrics MSE, RMSE, MAE, R-Squared
8 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
6 pages
Regression Performance Metrics Overview
No ratings yet
Regression Performance Metrics Overview
6 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
d3 It ML Jan 2023 Part 2
No ratings yet
d3 It ML Jan 2023 Part 2
32 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
10 - Linear Regression-Problems and Solutions
No ratings yet
10 - Linear Regression-Problems and Solutions
23 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
SQL2
No ratings yet
SQL2
29 pages
Classification 3
No ratings yet
Classification 3
21 pages
GPT-5 Prompting Guide
No ratings yet
GPT-5 Prompting Guide
27 pages
Optimal Posting Frequency, Platform Prioritization
No ratings yet
Optimal Posting Frequency, Platform Prioritization
3 pages
Certification Ranking For AI Consultant in Marketing
No ratings yet
Certification Ranking For AI Consultant in Marketing
6 pages
Demand Forecasting Script
No ratings yet
Demand Forecasting Script
52 pages
SSRN Id3640517
No ratings yet
SSRN Id3640517
8 pages
Geophysics Data Processing Guide
No ratings yet
Geophysics Data Processing Guide
21 pages
Adze
No ratings yet
Adze
10 pages
Ds - Lab - 4.ipynb - Colab
No ratings yet
Ds - Lab - 4.ipynb - Colab
7 pages
Direct Interpolation Methods Explained
No ratings yet
Direct Interpolation Methods Explained
15 pages
Week-11 Assignment Solution
No ratings yet
Week-11 Assignment Solution
3 pages
Econometrics Theory and Practice 1
No ratings yet
Econometrics Theory and Practice 1
5 pages
01 - Andi Ahmad Rifaldi R - Quiz 3 Spasial
No ratings yet
01 - Andi Ahmad Rifaldi R - Quiz 3 Spasial
4 pages
Formula Sheet NM-SP-24
No ratings yet
Formula Sheet NM-SP-24
1 page
Climate Data Reconstruction Project
No ratings yet
Climate Data Reconstruction Project
22 pages
Tut 5
No ratings yet
Tut 5
4 pages
Chapter Four
No ratings yet
Chapter Four
8 pages
APPLICATION CASE OF THE DATA PANEL WITH EVIEWS - Maryorith Cusacani
No ratings yet
APPLICATION CASE OF THE DATA PANEL WITH EVIEWS - Maryorith Cusacani
15 pages
L2 MM 24 Formula Sheet
No ratings yet
L2 MM 24 Formula Sheet
6 pages
Interpolation
No ratings yet
Interpolation
37 pages
Power Amplifier Design with Load Pull Data
No ratings yet
Power Amplifier Design with Load Pull Data
6 pages
Introductury Econometrics: A Modern Approach 7th Edition Jeffrey M. Wooldridge Kindle & PDF Formats
100% (4)
Introductury Econometrics: A Modern Approach 7th Edition Jeffrey M. Wooldridge Kindle & PDF Formats
153 pages
General Linear Model Overview
No ratings yet
General Linear Model Overview
5 pages
Master Thesis Rohit Pal
No ratings yet
Master Thesis Rohit Pal
48 pages
Linear Model Selection Techniques
No ratings yet
Linear Model Selection Techniques
22 pages
Table 20 Murder by State Types of Weapons 2013
No ratings yet
Table 20 Murder by State Types of Weapons 2013
13 pages
Advanced Regression Models With SAS and R - Revised 1st Edition Olga Korosteleva Instant Download
No ratings yet
Advanced Regression Models With SAS and R - Revised 1st Edition Olga Korosteleva Instant Download
61 pages
MA Exam - December 2023
No ratings yet
MA Exam - December 2023
4 pages
w6 EEF311E Interpolation
No ratings yet
w6 EEF311E Interpolation
58 pages
Bài tập buổi 9
No ratings yet
Bài tập buổi 9
50 pages
Introductory Methods of Numerical Analys
No ratings yet
Introductory Methods of Numerical Analys
11 pages
Ccrma Stanford Edu Jos Pasp
No ratings yet
Ccrma Stanford Edu Jos Pasp
1 page
Computer Oriented Numerical Methods (CONM) 2620004
No ratings yet
Computer Oriented Numerical Methods (CONM) 2620004
3 pages
Numerical Methods 2
No ratings yet
Numerical Methods 2
25 pages
NMF P
No ratings yet
NMF P
57 pages

Regression

Uploaded by

Regression

Uploaded by

Ridge ,lasso & Regression Evaluation

Machine learning chapter

𝑠𝑢𝑚 𝑜𝑓𝑆𝑞𝑢𝑎𝑟𝑒𝑠 𝑜𝑓 𝑅𝑒𝑠𝑖𝑑𝑢𝑎𝑙𝑠 (𝑆𝑆𝑅)

here 𝑦ഥ𝑖 s the mean of the dependent variable.

You might also like