Unit2 - Perfomance Measures

This document covers performance measures in machine learning, focusing on the importance of selecting appropriate metrics for evaluating algorithms. It discusses confusion matrices, accuracy, precision, recall, F1-score, and the trade-offs between precision and recall in different applications. Additionally, it introduces the ROC curve and AUC as tools for visualizing model performance across thresholds.

Uploaded by

manaskagankar07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views32 pages

Unit2 - Perfomance Measures

Uploaded by

manaskagankar07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

Machine Learning

Unit 1
Performance Measures

Course: Machine Learning TY B. Tech(CSIT)

Faculty: Mrs. S. P. Patil, IT Dept., RIT
Performance metrics

■ We must carefully choose the metrics for

evaluating ML performance because −
– How the performance of ML algorithms is
measured and compared will be dependent
entirely on the metric you choose.
– How you weight the importance of various
characteristics in the result will be influenced
completely by the metric you choose.
Classification
Need for Confusion Matrices
Need for Confusion matrix
What is Confusion Matrix
Creating confusion Matrix
Creating Confusion Matrix
Type-I and Type-II Error
Confusion Matrix Metrics

■Accuracy
■Precision
■Recall
■F1-Score
Accuracy
Precision
Recall
F1-Score
Importance of precision VS.
recall
■ The importance of precision vs. recall depends on the
specific application.
– For instance, in a medical diagnosis system:
High recall might be crucial – catching as many
positive cases (diseases) as possible, even if it leads to
some false positives (unnecessary tests).
– a financial fraud detection system might prioritize high
precision – minimizing false positives (wrongly declined
transactions) to avoid inconveniencing customers.
Confusion Matix: MNIST data
example
Precision/Recall Trade-off
Precision/Recall Trade-off
Choice of metric and tradeoffs
Metric Guidance
• Accuracy Use as a rough indicator of model
training progress/convergence for
balanced datasets.
For model performance, use only in
combination with other metrics.
Avoid for imbalanced datasets. Consider
using another metric.
• Recall Use when false negatives are more
(True positive expensive than false positives.
rate)
• False positive rate Use when false positives are more
expensive than false negatives.
• Precision Use when it's very important for positive
predictions to be accurate.
ROC Curve
■ The ROC curve is a visual representation of model performance across all
thresholds.
■ The ROC curve is drawn by calculating the true positive rate (TPR) and false
positive rate (FPR) at every possible threshold (in practice, at selected
intervals), then graphing TPR over FPR.
■ A perfect model, which at some threshold has a TPR of 1.0 and a FPR of 0.0,
can be represented by either a point at (0, 1) if all other thresholds are
ignored, or by the following:

ROC and AUC of a hypothetical perfect model.

Area under the curve (AUC)
■ The area under the ROC curve (AUC) represents the probability that the
model, if given a randomly chosen positive and negative example, will
rank the positive higher than the negative.
■ The perfect model above, containing a square with sides of length 1, has
an area under the curve (AUC) of 1.0. This means there is a 100%
probability that the model will correctly rank a randomly chosen positive
example higher than a randomly chosen negative example.
■ In the spam classifier example, a spam classifier with AUC of 0.5 assigns
a random spam email a higher probability of being spam than a random
legitimate email only half the time.
ROC Curve

This ROC curve plots the false positive rate against the true positive rate for all
possible thresholds; the red circle highlights the chosen ratio (at 43.68% recall)
ROC-AUC
Today’s Task

■ Study Performance metrics

■ Solve one example in your notebook (Calculate the performance
measure values)
■ Perform the experiment considering one of the application (using
python programming.
References

■ https://www.youtube.com/watch?v=prWyZhcktn4&t=1079s
■ https://youtu.be/aWAnNHXIKww

Classification Metrics Guide
No ratings yet
Classification Metrics Guide
15 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
03 Performance Metrics
No ratings yet
03 Performance Metrics
15 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Lecture 2.3
No ratings yet
Lecture 2.3
9 pages
Performance Measures
No ratings yet
Performance Measures
32 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
AUC and The ROC Curve in Machine Learning - DataCamp
No ratings yet
AUC and The ROC Curve in Machine Learning - DataCamp
12 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
W6 CSE 4781 Classification Metrics
No ratings yet
W6 CSE 4781 Classification Metrics
28 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
11 pages
Module 01 - Performance Metrics in ML
No ratings yet
Module 01 - Performance Metrics in ML
15 pages
March 3rd&4th
No ratings yet
March 3rd&4th
19 pages
2.3 Performance Metrics
No ratings yet
2.3 Performance Metrics
32 pages
Unit - 5
No ratings yet
Unit - 5
57 pages
Performance Metrics
No ratings yet
Performance Metrics
34 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Performance
No ratings yet
Performance
11 pages
Confusion Matrix & Evaluation Metrics in Machine Learning
No ratings yet
Confusion Matrix & Evaluation Metrics in Machine Learning
23 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Auc Roc Curve Machine Learning
No ratings yet
Auc Roc Curve Machine Learning
12 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
Machine Learning Evaluation Metrics
No ratings yet
Machine Learning Evaluation Metrics
16 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
Guide To AUC ROC Curve in Machine Learning
No ratings yet
Guide To AUC ROC Curve in Machine Learning
10 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Confusion Matrix
No ratings yet
Confusion Matrix
16 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
ML3 Evaluating Models
No ratings yet
ML3 Evaluating Models
40 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
4.9 Estimating The Performance of A Classifier II
No ratings yet
4.9 Estimating The Performance of A Classifier II
16 pages
Confusion Matrix V 2.0
No ratings yet
Confusion Matrix V 2.0
14 pages
Last Day
No ratings yet
Last Day
35 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
l09 Machine Learning
No ratings yet
l09 Machine Learning
39 pages
The ROC Curve
No ratings yet
The ROC Curve
5 pages
Introduction To Data Mining Unit 4
No ratings yet
Introduction To Data Mining Unit 4
13 pages
ML - Training - Evaluation For Machine Learning Course
No ratings yet
ML - Training - Evaluation For Machine Learning Course
31 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Lecture - 3
No ratings yet
Lecture - 3
24 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
Roc Curve in Python
No ratings yet
Roc Curve in Python
58 pages
AUC ROC Curve for ML Enthusiasts
No ratings yet
AUC ROC Curve for ML Enthusiasts
5 pages
6.evaluation Metrics - UNIT 2
No ratings yet
6.evaluation Metrics - UNIT 2
4 pages
Compare Class I Fiers Part 13
No ratings yet
Compare Class I Fiers Part 13
32 pages
How To Evaluate and Monitor Performance of AI Models For Financial Risk Management - A Practical Guide by Indraneel Dutta Barua
No ratings yet
How To Evaluate and Monitor Performance of AI Models For Financial Risk Management - A Practical Guide by Indraneel Dutta Barua
1 page
ROC Auc
No ratings yet
ROC Auc
5 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Roc 1 PDF
No ratings yet
Roc 1 PDF
8 pages
M05 - Identify and Resolve Network Problems
No ratings yet
M05 - Identify and Resolve Network Problems
38 pages
Audio Headroom Essentials
No ratings yet
Audio Headroom Essentials
9 pages
React JS
No ratings yet
React JS
4 pages
HackerRank SQL Certification Roadmap
No ratings yet
HackerRank SQL Certification Roadmap
3 pages
EV Licensing Guide 14.0
No ratings yet
EV Licensing Guide 14.0
21 pages
Information Security Incident Management - Current Practice As Reported in The Literature
No ratings yet
Information Security Incident Management - Current Practice As Reported in The Literature
16 pages
Kuka - Xrob RCS: KUKA Robot Group KUKA System Technology (KST)
No ratings yet
Kuka - Xrob RCS: KUKA Robot Group KUKA System Technology (KST)
53 pages
Vandana Resume 12122023
No ratings yet
Vandana Resume 12122023
3 pages
VHDL Implementation of H264 Video Coding Standard
No ratings yet
VHDL Implementation of H264 Video Coding Standard
8 pages
What Is Scratch 1611213348 1617187994
No ratings yet
What Is Scratch 1611213348 1617187994
4 pages
Interview Questions Servicenow
No ratings yet
Interview Questions Servicenow
8 pages
Artificial Intelligence in Manufacturing PDF
No ratings yet
Artificial Intelligence in Manufacturing PDF
12 pages
Review2 PPT - Meghana
No ratings yet
Review2 PPT - Meghana
32 pages
NN & DL Lab Manual 1
No ratings yet
NN & DL Lab Manual 1
44 pages
Panasonic VL-SV74 PDF
No ratings yet
Panasonic VL-SV74 PDF
2 pages
Soluciones Telecomunicaciones Eltek
No ratings yet
Soluciones Telecomunicaciones Eltek
38 pages
OLSX - API Doc 1
No ratings yet
OLSX - API Doc 1
19 pages
Homework 6 Grade
No ratings yet
Homework 6 Grade
10 pages
A Scoping Review of Computational Thinking Assessments in Higher Education
No ratings yet
A Scoping Review of Computational Thinking Assessments in Higher Education
46 pages
Advancing Reliability-Centered Maintenance in Industry 4.0 A Comparison To Total Productive Maintenance
No ratings yet
Advancing Reliability-Centered Maintenance in Industry 4.0 A Comparison To Total Productive Maintenance
8 pages
Chapter 1 Algorithm and Complexity Lesson 1
No ratings yet
Chapter 1 Algorithm and Complexity Lesson 1
18 pages
Matrikon OPC UA Explorer: Datasheet
No ratings yet
Matrikon OPC UA Explorer: Datasheet
3 pages
Spring Data JPA
No ratings yet
Spring Data JPA
13 pages
Usability Study of Mobile Apps
No ratings yet
Usability Study of Mobile Apps
15 pages
EasyXLS LICENSE
No ratings yet
EasyXLS LICENSE
4 pages
19th National Science Quest Checklist
No ratings yet
19th National Science Quest Checklist
3 pages
CAM DAHUA BULLET DH-IPC-HFW1239S1-A-LED-S4 - Datasheet
No ratings yet
CAM DAHUA BULLET DH-IPC-HFW1239S1-A-LED-S4 - Datasheet
3 pages
Emp Tech Module 4.3 Lesson 7
No ratings yet
Emp Tech Module 4.3 Lesson 7
42 pages
MG SoftwareDevelopment L4 AlgorithmFundametals
No ratings yet
MG SoftwareDevelopment L4 AlgorithmFundametals
13 pages
Meldasmagic Monitor Operation Manual: BNP-B2192 (ENG)
No ratings yet
Meldasmagic Monitor Operation Manual: BNP-B2192 (ENG)
14 pages

Unit2 - Perfomance Measures

Uploaded by

Unit2 - Perfomance Measures

Uploaded by

Machine Learning

Course: Machine Learning TY B. Tech(CSIT)

■ We must carefully choose the metrics for

ROC and AUC of a hypothetical perfect model.

■ Study Performance metrics

You might also like