Unit 5 ML

The document discusses cross-validation, a statistical method in machine learning for evaluating model performance by dividing data into subsets for training and validation. It outlines various types of cross-validation techniques, including Holdout Validation, Leave-One-Out Cross Validation (LOOCV), and K-Fold Cross Validation, each with its advantages and drawbacks. Additionally, it covers classification metrics such as accuracy, precision, recall, and area under the curve (AUC) to assess model performance.

Uploaded by

abinayasundaramoorthi2000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views21 pages

Unit 5 ML

Uploaded by

abinayasundaramoorthi2000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

UNIT 5

DESIGN AND ANALYSIS OF

MACHINE LEARNING
EXPERIMENTS
CROSS-VALIDATION
Cross-validation is a statistical method used in machine learning to
evaluate how well a model performs on an independent data set. It
involves dividing the available data into multiple folds or subsets, using
one of these folds as a validation set and training the model on the
remaining folds. This process is repeated multiple times each time using
a different fold as the validation set. Finally the results from each
validation step are averaged to produce a more robust estimate of the
model’s performance.
Types of Cross-Validation
• There are several types of cross validation techniques including k-fold
cross validation, leave-one-out cross validation, Holdout validation
and Stratified Cross-Validation. The choice of technique depends on
the size and nature of the data, as well as the specific requirements of
the modeling problem.
1. Holdout Validation
In Holdout Validation we perform training on the 50% of the given
dataset and rest 50% is used for the testing purpose. It’s a simple and
quick way to evaluate a model.
The major drawback of this method is that we perform training on the
50% of the dataset, it may possible that the remaining 50% of the data
contains some important information which we are leaving while
training our model i.e. higher bias.
LOOCV (Leave One Out Cross Validation)
In this method we perform training on the whole dataset but leaves only
one data-point of the available dataset and then iterates for each data-
point. In LOOCV the model is trained on n−1n−1 samples and tested on
the one omitted sample repeating this process for each data point in the
dataset. It has some advantages as well as disadvantages also.
An advantage of using this method is that we make use of all data points
and hence it is low bias.
• The major drawback of this method is that it leads to higher variation in
the testing model as we are testing against one data point. If the data
point is an outlier it can lead to higher variation. Another drawback is it
takes a lot of execution time as it iterates over ‘the number of data
points’ times.
K-Fold Cross Validation
• In K-Fold Cross Validation we split the dataset into k number of
subsets (known as folds) then we perform training on the all the
subsets but leave one(k-1) subset for the evaluation of the trained
model. In this method, we iterate k times with a different subset
reserved for testing purpose each time.
Example of K Fold Cross Validation

The diagram below shows an example of the training subsets and

evaluation subsets generated in k-fold cross-validation. Here we have
total 25 instances. In first iteration we use the first 20 percent of data
for evaluation and the remaining 80 percent for training ([1-5] testing
and [5-25] training) while in the second iteration we use the second
subset of 20 percent for evaluation and the remaining three subsets of
the data for training ([5-10] testing and [1-5 and 10-25] training) and so
on.
Classification Metrics
In a classification task, our main task is to predict the target variable, which
is in the form of discrete values. To evaluate the performance of such a
model, following are the commonly used evaluation metrics:
• Accuracy
• Logarithmic Loss
• Area Under Curve
• Precision
• Recall
• F1 Score
• Confusion Matrix
Accuracy
Accuracy is a fundamental metric for evaluating the performance of a classification model,
providing a quick snapshot of how well the model is performing in terms of correct
predictions. It is calculated as the ratio of correct predictions to the total number of input
samples.

• It works great if there are an equal number of samples for each class. For example, we
have a 90% sample of class A and a 10% sample of class B in our training set. Then, our
model will predict with an accuracy of 90% by predicting all the training samples
belonging to class A. If we test the same model with a test set of 60% from class A and
40% from class B. Then the accuracy will fall, and we will get an accuracy of 60%.
• Accuracy is good but it gives a False Positive sense of achieving high accuracy. The
problem arises due to the possibility of misclassification of minor class samples being
very high.
Area Under Curve (AUC)
True Positive Rate:
• Also called or termed sensitivity. True Positive Rate is considered as a
portion of positive data points that are correctly considered as
positive, with respect to all data points that are positive.
True Negative Rate
• Also called or termed specificity. True Negative Rate is considered as a
portion of negative data points that are correctly considered as
negative, with respect to all data points that are negatives.
False Positive Rate
• False Negatives rate is actually the proportion of actual positives that
are incorrectly identified as negatives
Precision
• There is another metric named Precision. Precision is a measure of a
model’s performance that tells you how many of the positive
predictions made by the model are actually correct.

Recall
• Recall is the ratio of correctly predicted positive instances to the total
actual positive instances. It measures how well the model captures all
relevant positive cases.

Comparing Multiple Algorithms
No ratings yet
Comparing Multiple Algorithms
70 pages
Unit 5-2 Marks
No ratings yet
Unit 5-2 Marks
5 pages
Unit 2
No ratings yet
Unit 2
28 pages
ML - 03 Evaluation Metrics
No ratings yet
ML - 03 Evaluation Metrics
17 pages
Unit 5 (ML)
No ratings yet
Unit 5 (ML)
25 pages
ML Mod 5
No ratings yet
ML Mod 5
58 pages
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
No ratings yet
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
27 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
M.L L-9 Machine Learning Model Evaluation
No ratings yet
M.L L-9 Machine Learning Model Evaluation
20 pages
Cross Validation
No ratings yet
Cross Validation
5 pages
Cofusion Matrix Cross - Validation
No ratings yet
Cofusion Matrix Cross - Validation
34 pages
Dimensionality Reduction & Model Evaluation
No ratings yet
Dimensionality Reduction & Model Evaluation
80 pages
MLA CT1 - Notes
No ratings yet
MLA CT1 - Notes
17 pages
Presentation On Classification
No ratings yet
Presentation On Classification
18 pages
Chapter 3
No ratings yet
Chapter 3
56 pages
6 Model Evalution
No ratings yet
6 Model Evalution
16 pages
Model Evaluation & Cross-Validation Guide
No ratings yet
Model Evaluation & Cross-Validation Guide
43 pages
List Steps in Data Preparation. Give Short Description of Each Step
No ratings yet
List Steps in Data Preparation. Give Short Description of Each Step
20 pages
ML Unit 4 Trupesh Patel
No ratings yet
ML Unit 4 Trupesh Patel
56 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Lec 16
No ratings yet
Lec 16
18 pages
Model Answer Paper - UT1-QP-ML-SEM7-COMPUTER-2023-3024 Version2
No ratings yet
Model Answer Paper - UT1-QP-ML-SEM7-COMPUTER-2023-3024 Version2
18 pages
Advanced ML Classification Guide
No ratings yet
Advanced ML Classification Guide
40 pages
ML Pyq Ans
No ratings yet
ML Pyq Ans
37 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Module 6
No ratings yet
Module 6
24 pages
ML Module Iii
No ratings yet
ML Module Iii
12 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
TR Rain Error
No ratings yet
TR Rain Error
6 pages
Topic 3
No ratings yet
Topic 3
48 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Model Evaluation and Cross-Validation Methods
No ratings yet
Model Evaluation and Cross-Validation Methods
3 pages
Cross Validation Techniques
No ratings yet
Cross Validation Techniques
27 pages
Module 10 Notes
No ratings yet
Module 10 Notes
5 pages
Cross Validation
No ratings yet
Cross Validation
10 pages
Chương 2e. Model Evaluation
No ratings yet
Chương 2e. Model Evaluation
27 pages
Unit V
No ratings yet
Unit V
12 pages
Evaluating Model Performance in ML
No ratings yet
Evaluating Model Performance in ML
16 pages
ML Nithish
No ratings yet
ML Nithish
16 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
Chapter2 1 33
No ratings yet
Chapter2 1 33
18 pages
ML - Training - Evaluation For Machine Learning Course
No ratings yet
ML - Training - Evaluation For Machine Learning Course
31 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
Cross Validation
No ratings yet
Cross Validation
7 pages
Unit - I Chap-4 Model Evaluation and Development
No ratings yet
Unit - I Chap-4 Model Evaluation and Development
35 pages
Model Evaluation
No ratings yet
Model Evaluation
44 pages
UNIT 4 1 ConfusionMatrix
No ratings yet
UNIT 4 1 ConfusionMatrix
33 pages
Answer-4 Shreyansh
No ratings yet
Answer-4 Shreyansh
4 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
Lec 5
No ratings yet
Lec 5
28 pages
14 Model Selection and Boosting
No ratings yet
14 Model Selection and Boosting
51 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
DAV Module 2
No ratings yet
DAV Module 2
21 pages
IDS U-5 Answers
No ratings yet
IDS U-5 Answers
16 pages
Cross-Validation Techniques Guide
No ratings yet
Cross-Validation Techniques Guide
10 pages
Ai Kmeans
No ratings yet
Ai Kmeans
5 pages
DC Notes
No ratings yet
DC Notes
137 pages
Cs25co3 - Essentials of Computing Unit 1 & 2 Notes
No ratings yet
Cs25co3 - Essentials of Computing Unit 1 & 2 Notes
96 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages
Al3451 Machine Learning CIA II Question Bank
No ratings yet
Al3451 Machine Learning CIA II Question Bank
1 page
Al3451 Machine Learning CIA II Question Bank
No ratings yet
Al3451 Machine Learning CIA II Question Bank
1 page
N&S Unit 2
No ratings yet
N&S Unit 2
25 pages
OSI Security: Attacks & Mechanisms
No ratings yet
OSI Security: Attacks & Mechanisms
19 pages
Journal Final
No ratings yet
Journal Final
6 pages
Cs3271 Programming in C Laboratory
0% (1)
Cs3271 Programming in C Laboratory
4 pages
Cognitive Informatics Reengineering Clinical Workflow For Safer and More Efficient Care 1st Edition Complete Book Download
100% (19)
Cognitive Informatics Reengineering Clinical Workflow For Safer and More Efficient Care 1st Edition Complete Book Download
17 pages
Chapter 1 Pr2
No ratings yet
Chapter 1 Pr2
9 pages
Online TOEFL Essay Correction
No ratings yet
Online TOEFL Essay Correction
3 pages
Education Is The Second Foundation of Every Persons
No ratings yet
Education Is The Second Foundation of Every Persons
11 pages
3-Month Internship Progress Report
No ratings yet
3-Month Internship Progress Report
1 page
Microscale and Macroscale Techniques in The Organic Laboratory 1st Edition by Pavia Full Version
No ratings yet
Microscale and Macroscale Techniques in The Organic Laboratory 1st Edition by Pavia Full Version
317 pages
7th Grade English Lesson Plan: Morning Routine
No ratings yet
7th Grade English Lesson Plan: Morning Routine
15 pages
Embrace Your Dark Side A New Perspective On Negative Emotions - Big Think
No ratings yet
Embrace Your Dark Side A New Perspective On Negative Emotions - Big Think
1 page
Name Those Notes Treble Clef
No ratings yet
Name Those Notes Treble Clef
6 pages
Ba Part III Compulsory English
No ratings yet
Ba Part III Compulsory English
4 pages
LC Technical Polytechnics
No ratings yet
LC Technical Polytechnics
7 pages
2024 UTEW 311 Assignment B Instructions
No ratings yet
2024 UTEW 311 Assignment B Instructions
9 pages
EMS Result
No ratings yet
EMS Result
2 pages
MBT Quality Manual for Practitioners
100% (2)
MBT Quality Manual for Practitioners
123 pages
Future Tenses Homework
No ratings yet
Future Tenses Homework
2 pages
Appendix Keller - Statistics For Management and Economics 9th-Trang-769-796
No ratings yet
Appendix Keller - Statistics For Management and Economics 9th-Trang-769-796
28 pages
Introduction to Philosophy Module 1
No ratings yet
Introduction to Philosophy Module 1
24 pages
Apj Park Street - Receipt
No ratings yet
Apj Park Street - Receipt
1 page
2025 Grade 10 & 11 Midyear Exam (DISTRICT INCLUDED)
No ratings yet
2025 Grade 10 & 11 Midyear Exam (DISTRICT INCLUDED)
4 pages
Multi-Intellectuals Scholarship (MIS)
No ratings yet
Multi-Intellectuals Scholarship (MIS)
3 pages
Kimhietee Porfolio2
No ratings yet
Kimhietee Porfolio2
44 pages
Classroom Objects Vocabulary Missing Letters in Words Esl Worksheet PDF
No ratings yet
Classroom Objects Vocabulary Missing Letters in Words Esl Worksheet PDF
2 pages
Senior Design Project Proposal Template
No ratings yet
Senior Design Project Proposal Template
1 page
Xander Collins: History Education Resume
No ratings yet
Xander Collins: History Education Resume
1 page
IDEA Report
No ratings yet
IDEA Report
8 pages
MCS604 Pre-recordedLecture Topic3 v1.1
No ratings yet
MCS604 Pre-recordedLecture Topic3 v1.1
59 pages
Workbook Unit 11
No ratings yet
Workbook Unit 11
6 pages
Ceiling Plan for School Building
No ratings yet
Ceiling Plan for School Building
1 page
SB1 Proponent Clune
No ratings yet
SB1 Proponent Clune
1 page
The Character Education in Learning Mathematics at Elementary School
No ratings yet
The Character Education in Learning Mathematics at Elementary School
4 pages

Unit 5 ML

Uploaded by

Unit 5 ML

Uploaded by

UNIT 5

DESIGN AND ANALYSIS OF

The diagram below shows an example of the training subsets and

You might also like