0% found this document useful (0 votes)

3 views

Bias_and_Variance

Machine LEarning Notes

Uploaded by

yuvrajsharma56780

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Bias_and_Variance

Machine LEarning Notes

Uploaded by

yuvrajsharma56780

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

ML | Underfitting and Overfitting

When we talk about the Machine Learning model, we actually talk about how well it performs and its
accuracy which is known as prediction errors. Let us consider that we are designing a machine learning
model. A model is said to be a good machine learning model if it generalizes any new input data from
the problem domain in a proper way. This helps us to make predictions about future data, that the data
model has never seen. Now, suppose we want to check how well our machine learning model learns and
generalizes to the new data. For that, we have overfitting and underfitting, which are majorly
responsible for the poor performances of the machine learning algorithms.

Bias and Variance in Machine Learning

 Bias: Bias refers to the error due to overly simplistic assumptions in the learning algorithm.
These assumptions make the model easier to comprehend and learn but might not capture the
underlying complexities of the data. It is the error due to the model’s inability to represent the
true relationship between input and output accurately. When a model has poor performance
both on the training and testing data means high bias because of the simple model, indicating
underfitting.

 Variance: Variance, on the other hand, is the error due to the model’s sensitivity to fluctuations
in the training data. It’s the variability of the model’s predictions for different instances of
training data. High variance occurs when a model learns the training data’s noise and random
fluctuations rather than the underlying pattern. As a result, the model performs well on the
training data but poorly on the testing data, indicating overfitting.

Underfitting in Machine Learning

A statistical model or a machine learning algorithm is said to have underfitting when a model is too
simple to capture data complexities. It represents the inability of the model to learn the training data
effectively result in poor performance both on the training and testing data. In simple terms, an underfit
model’s are inaccurate, especially when applied to new, unseen examples. It mainly happens when we
uses very simple model with overly simplified assumptions. To address underfitting problem of the
model, we need to use more complex models, with enhanced feature representation, and less
regularization.

Note: The underfitting model has High bias and low variance.

Reasons for Underfitting

1. The model is too simple, So it may be not capable to represent the complexities in the data.

2. The input features which is used to train the model is not the adequate representations of
underlying factors influencing the target variable.

3. The size of the training dataset used is not enough.

4. Excessive regularization are used to prevent the overfitting, which constraint the model to
capture the data well.

5. Features are not scaled.

Techniques to Reduce Underfitting

1. Increase model complexity.

2. Increase the number of features, performing feature engineering.

3. Remove noise from the data.

4. Increase the number of epochs or increase the duration of training to get better results.

Overfitting in Machine Learning

A statistical model is said to be overfitted when the model does not make accurate predictions on testing
data. When a model gets trained with so much data, it starts learning from the noise and inaccurate data
entries in our data set. And when testing with test data results in High variance. Then the model does
not categorize the data correctly, because of too many details and noise. The causes of overfitting are
the non-parametric and non-linear methods because these types of machine learning algorithms have
more freedom in building the model based on the dataset and therefore they can really build unrealistic
models. A solution to avoid overfitting is using a linear algorithm if we have linear data or using the
parameters like the maximal depth if we are using decision trees.

In a nutshell, Overfitting is a problem where the evaluation of machine learning algorithms on training
data is different from unseen data.

Reasons for Overfitting:

1. High variance and low bias.

2. The model is too complex.

3. The size of the training data.

Techniques to Reduce Overfitting

1. Improving the quality of training data reduces overfitting by focusing on meaningful patterns,
mitigate the risk of fitting the noise or irrelevant features.

2. Increase the training data can improve the model’s ability to generalize to unseen data and
reduce the likelihood of overfitting.

3. Reduce model complexity.

4. Early stopping during the training phase (have an eye over the loss over the training period as
soon as loss begins to increase stop training).

5. Ridge Regularization and Lasso Regularization.

6. Use dropout for neural networks to tackle overfitting.

Underfitting and Overfitting

Good Fit in a Statistical Model

Ideally, the case when the model makes the predictions with 0 error, is said to have a good fit on the
data. This situation is achievable at a spot between overfitting and underfitting. In order to understand
it, we will have to look at the performance of our model with the passage of time, while it is learning
from the training dataset.

With the passage of time, our model will keep on learning, and thus the error for the model on the
training and testing data will keep on decreasing. If it will learn for too long, the model will become more
prone to overfitting due to the presence of noise and less useful details. Hence the performance of our
model will decrease. In order to get a good fit, we will stop at a point just before where the error starts
increasing. At this point, the model is said to have good skills in training datasets as well as our unseen
testing dataset.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
15b Kinematics
No ratings yet
15b Kinematics
15 pages
Espellardo EBM Self-Instructional Manual
No ratings yet
Espellardo EBM Self-Instructional Manual
84 pages
Chapter 05
No ratings yet
Chapter 05
59 pages
Underfitting and Overfitting
No ratings yet
Underfitting and Overfitting
4 pages
bias_-_variance
No ratings yet
bias_-_variance
2 pages
U&O Fitting
No ratings yet
U&O Fitting
6 pages
MACHINE LEARNING NOTES ANNA UNIVERSITY
No ratings yet
MACHINE LEARNING NOTES ANNA UNIVERSITY
9 pages
Overfitting and Underfitting in Machine Learning
No ratings yet
Overfitting and Underfitting in Machine Learning
3 pages
emsemble methods-pages-deleted
No ratings yet
emsemble methods-pages-deleted
2 pages
Bias and Variance in Machine Learning
No ratings yet
Bias and Variance in Machine Learning
3 pages
Bias Variance Overfitting
No ratings yet
Bias Variance Overfitting
3 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Unit 4
No ratings yet
Unit 4
50 pages
[Technical] Machine Learning U3-6 [2019 Pattern]
No ratings yet
[Technical] Machine Learning U3-6 [2019 Pattern]
101 pages
Underfitting and Overfitting Slides and Transcript
No ratings yet
Underfitting and Overfitting Slides and Transcript
13 pages
OVERFITTING and UNDERFITTING
No ratings yet
OVERFITTING and UNDERFITTING
5 pages
Chapter 1-ML
No ratings yet
Chapter 1-ML
27 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Bais and Variance
No ratings yet
Bais and Variance
4 pages
Data Science Concepts Overfitting Underfitting
No ratings yet
Data Science Concepts Overfitting Underfitting
8 pages
Overfitting
No ratings yet
Overfitting
7 pages
DL_Unit1 (1)
100% (1)
DL_Unit1 (1)
79 pages
Unit II_2.5_Overfitting Underfitting @ CSJMU_6 Slides Handouts
No ratings yet
Unit II_2.5_Overfitting Underfitting @ CSJMU_6 Slides Handouts
5 pages
Machine Leafning
No ratings yet
Machine Leafning
5 pages
Underfitting & Overfitting
No ratings yet
Underfitting & Overfitting
13 pages
Chapter5 Regularization Summary Final
No ratings yet
Chapter5 Regularization Summary Final
10 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
LECTURE - 1
No ratings yet
LECTURE - 1
35 pages
Bias Variance dichotomy
No ratings yet
Bias Variance dichotomy
11 pages
Bias - Variance Trade Off
No ratings yet
Bias - Variance Trade Off
11 pages
Chapter2 1 22
No ratings yet
Chapter2 1 22
9 pages
Bias, Variance, and Tradeoff
No ratings yet
Bias, Variance, and Tradeoff
8 pages
Complete ML Concepts
No ratings yet
Complete ML Concepts
30 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
Machine Learning Math Essentials _12.02.2025
No ratings yet
Machine Learning Math Essentials _12.02.2025
88 pages
4 - Bias-Variance Tradeoff
No ratings yet
4 - Bias-Variance Tradeoff
28 pages
Bias-variance
No ratings yet
Bias-variance
8 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
UNDERFITTING_OVERFITTING
No ratings yet
UNDERFITTING_OVERFITTING
7 pages
All DL
No ratings yet
All DL
72 pages
Ensemble Method
No ratings yet
Ensemble Method
12 pages
ML & DL
No ratings yet
ML & DL
19 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
module 3 modified
No ratings yet
module 3 modified
48 pages
Lec 3
No ratings yet
Lec 3
13 pages
Underfitting and Overfitting in Machine Learning by ROll (41,42)
No ratings yet
Underfitting and Overfitting in Machine Learning by ROll (41,42)
29 pages
Bias and Variance
No ratings yet
Bias and Variance
7 pages
ML Bu
No ratings yet
ML Bu
31 pages
Session 3
No ratings yet
Session 3
26 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
Questions
No ratings yet
Questions
8 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
Machine Learning Models
No ratings yet
Machine Learning Models
54 pages
25.-Overfitting-and-Underfitting
No ratings yet
25.-Overfitting-and-Underfitting
25 pages
module3_DS_ppt
No ratings yet
module3_DS_ppt
68 pages
Week 15
No ratings yet
Week 15
41 pages
Machine Learning Juunit2.pdf Lands
No ratings yet
Machine Learning Juunit2.pdf Lands
7 pages
unit-1.2-Perceptron-2024
No ratings yet
unit-1.2-Perceptron-2024
107 pages
Biasvariancetradeoff 210313075413
No ratings yet
Biasvariancetradeoff 210313075413
13 pages
Data science unit-I notes
No ratings yet
Data science unit-I notes
3 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Learning and Behavior A Contemporary Synthesis 2nd Edition Complete Book Download
100% (3)
Learning and Behavior A Contemporary Synthesis 2nd Edition Complete Book Download
17 pages
The Giant (Apr 1971)
No ratings yet
The Giant (Apr 1971)
2 pages
Full Text Report Daum Agile Methods
No ratings yet
Full Text Report Daum Agile Methods
26 pages
The Difference Between Governance and Managemen - Chapter 2 - Assignment
No ratings yet
The Difference Between Governance and Managemen - Chapter 2 - Assignment
7 pages
Character and Qualities - According To Astrological Principles
No ratings yet
Character and Qualities - According To Astrological Principles
6 pages
Discovering Tut - Summary Slide Show
No ratings yet
Discovering Tut - Summary Slide Show
9 pages
Amendment No. 1 TO AIS-033/2001: Automotive Vehicles - Plastic Fuel Tanks For Four - Wheelers
No ratings yet
Amendment No. 1 TO AIS-033/2001: Automotive Vehicles - Plastic Fuel Tanks For Four - Wheelers
16 pages
EE577a Syllabus Jaiswal Fall20
No ratings yet
EE577a Syllabus Jaiswal Fall20
4 pages
Paper 2: Literary Analysis Rubric
No ratings yet
Paper 2: Literary Analysis Rubric
1 page
Efficiency of Gas Turbine
100% (2)
Efficiency of Gas Turbine
10 pages
Symbolic Interactionism and Rational Choice
No ratings yet
Symbolic Interactionism and Rational Choice
14 pages
Recognition of The Elements of Financial Statements
No ratings yet
Recognition of The Elements of Financial Statements
1 page
ATMC 01 of 2019
No ratings yet
ATMC 01 of 2019
3 pages
20240515（全本杂志）thejournalofnuclearmedicine202405suppl complete-issue
No ratings yet
20240515（全本杂志）thejournalofnuclearmedicine202405suppl complete-issue
85 pages
Frantic v. Certain Underwriters
No ratings yet
Frantic v. Certain Underwriters
28 pages
Unit 7
No ratings yet
Unit 7
2 pages
Political Philosophy - Britannica Online Encyclopedia
No ratings yet
Political Philosophy - Britannica Online Encyclopedia
33 pages
A Review of The Cardiovascular System and ECG Performance, Troubleshooting, and Interpretation
No ratings yet
A Review of The Cardiovascular System and ECG Performance, Troubleshooting, and Interpretation
47 pages
The Rizal Memorial Colleges, Inc
No ratings yet
The Rizal Memorial Colleges, Inc
4 pages
Signal Dan System
No ratings yet
Signal Dan System
25 pages
Jensen - Television Model JE3217 User's Guide
No ratings yet
Jensen - Television Model JE3217 User's Guide
62 pages
Tips and Hints For Sharing Data PDF
No ratings yet
Tips and Hints For Sharing Data PDF
5 pages
Astronomical Phenomenon Models Ptolemaic, Copernican, and Tychonic Model - 20240406 - 202816 - 0000
No ratings yet
Astronomical Phenomenon Models Ptolemaic, Copernican, and Tychonic Model - 20240406 - 202816 - 0000
15 pages
Error Correcting Codes: - 34 - R-E-S-O-N-A-N-C-E-I - Ja-n-u-a-rY - 19-9-7
No ratings yet
Error Correcting Codes: - 34 - R-E-S-O-N-A-N-C-E-I - Ja-n-u-a-rY - 19-9-7
10 pages
Jurnal Inesa NR
No ratings yet
Jurnal Inesa NR
9 pages
Rasmi LED Lighting Company. LED Light, Indoor, Outdoor, Underwater LED Light
No ratings yet
Rasmi LED Lighting Company. LED Light, Indoor, Outdoor, Underwater LED Light
18 pages
WM8731/L Audio CODEC Supported Sampling Rates: To Receive Regular Email Updates, Sign Up
No ratings yet
WM8731/L Audio CODEC Supported Sampling Rates: To Receive Regular Email Updates, Sign Up
9 pages

Bias_and_Variance

Uploaded by

Bias_and_Variance

Uploaded by

ML | Underfitting and Overfitting

Bias and Variance in Machine Learning

Underfitting in Machine Learning

Reasons for Underfitting

3. The size of the training dataset used is not enough.

5. Features are not scaled.

Techniques to Reduce Underfitting

1. Increase model complexity.

2. Increase the number of features, performing feature engineering.

3. Remove noise from the data.

Overfitting in Machine Learning

Reasons for Overfitting:

1. High variance and low bias.

3. The size of the training data.

Techniques to Reduce Overfitting

3. Reduce model complexity.

5. Ridge Regularization and Lasso Regularization.

6. Use dropout for neural networks to tackle overfitting.

Underfitting and Overfitting

Good Fit in a Statistical Model

You might also like