W6 CSE 4781 Classification Metrics

The document discusses classification metrics in machine learning, including confusion matrix, accuracy, precision, recall, F1 score, and ROC-AUC. It explains how these metrics evaluate the performance of classification algorithms, using examples like fraud detection and email spam classification. The ROC AUC score is highlighted as a comprehensive measure of model quality across various classification thresholds.

Uploaded by

Aliul Hassan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views28 pages

W6 CSE 4781 Classification Metrics

Uploaded by

Aliul Hassan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Asst Prof.

S M Sadakatul Bari
AE(Avionics), AAUB
CLASSIFICATION METRICS
IN MACHINE LEARNING
 Confusion Matrix
 Accuracy
 Precision
 Recall,
 F1 score
 ROC-AUC
CONFUSION MATRIX
 A confusion matrix is a table that evaluates the performance of a classification
algorithm.
 The matrix uses actual(target) values to compare with machine learning (ML) predicted
values.
Examples :

Fraud detection: predicting if a payment transaction is fraudulent.

The words “positive” and “negative” refer to the target and non-target classes.
In this example, fraud is our target. We refer to transactions flagged as fraudulent as “positives.”
Let’s say we have an email spam classification model. It is a binary classification problem.
The two possible classes are “spam” and “not spam.”

After training the model, we generated predictions for 10000 emails in the validation dataset.
We already know the actual labels and can evaluate the quality of the model predictions.
In our example above, accuracy is (600+9000)/10000 =
0.96. The model was correct in 96% of cases.
Precision : How often a machine learning model correctly predicts the positive class?
In our example above, precision is 600/(600+100)= 0.86.
When predicting “spam,” the model was correct in 86% of
cases.
Recall: How often a machine learning model correctly identifies positive instances (true positives)
from all the actual positive samples in the dataset?

Recall can also be called sensitivity or true positive rate

(TPR).
In our example above, recall is 600/(600+300)= 0.67. The
model correctly found 67% of spam emails. The other 33%
made their way to the inbox unlabeled.
The F1 score is the harmonic mean (a kind of average) of precision and
recall.

Preferable for class-imbalanced datasets

What is a ROC curve?

The ROC curve stands for the Receiver Operating Characteristic curve. It is a graphical representation
of the performance of a binary classifier at different classification thresholds.

The curve plots the possible True Positive rates (TPR) against the False Positive rates (FPR).
What is a ROC AUC score?

ROC AUC stands for Receiver Operating Characteristic Area Under the Curve.

ROC AUC score is a single number that summarizes the classifier's performance across all possible
classification thresholds. To get the score, you must measure the area under the ROC curve.
• ROC AUC reflects the model quality in one number. It is convenient to use a single metric,
especially when comparing multiple models.

• In fact, it sums up the performance across the different classification thresholds. It is a

valuable "overall" quality measure, whereas precision and recall provide a quality
"snapshot" at a given decision threshold.

• ROC AUC measures the model's ability to discriminate between the positive and negative
classes, regardless of their relative frequencies in the dataset.

• During model training, it helps compare multiple ML models against each other.
 https://www.evidentlyai.com/classification-metrics

 https://www.youtube.com/watch?v=LxcRFNRgLCs

 https://www.youtube.com/watch?v=Joh3LOaG8Q0

Classification Metrics Guide
No ratings yet
Classification Metrics Guide
15 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Performance Metrics
No ratings yet
Performance Metrics
34 pages
Confusion Matrix
No ratings yet
Confusion Matrix
16 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
03 Performance Metrics
No ratings yet
03 Performance Metrics
15 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
Lecture - 3
No ratings yet
Lecture - 3
24 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Guide To AUC ROC Curve in Machine Learning
No ratings yet
Guide To AUC ROC Curve in Machine Learning
10 pages
Confusion Matrix V 2.0
No ratings yet
Confusion Matrix V 2.0
14 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
March 3rd&4th
No ratings yet
March 3rd&4th
19 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
Unit 3
No ratings yet
Unit 3
13 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Auc Roc Curve Machine Learning
No ratings yet
Auc Roc Curve Machine Learning
12 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
AUC and The ROC Curve in Machine Learning - DataCamp
No ratings yet
AUC and The ROC Curve in Machine Learning - DataCamp
12 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
11 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
Unit - 5
No ratings yet
Unit - 5
57 pages
Unit2 - Perfomance Measures
No ratings yet
Unit2 - Perfomance Measures
32 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
l09 Machine Learning
No ratings yet
l09 Machine Learning
39 pages
Comprehensive Guide On Confusion Matrix 1657202063
No ratings yet
Comprehensive Guide On Confusion Matrix 1657202063
5 pages
CLASSIFICATION
No ratings yet
CLASSIFICATION
36 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
No ratings yet
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
4 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Module 01 - Performance Metrics in ML
No ratings yet
Module 01 - Performance Metrics in ML
15 pages
Confusion Matrix & Evaluation Metrics in Machine Learning
No ratings yet
Confusion Matrix & Evaluation Metrics in Machine Learning
23 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Performance Metrics Classification
No ratings yet
Performance Metrics Classification
39 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Understanding F1 Score, Accuracy, ROC-AUC & PR-AUC Metrics
No ratings yet
Understanding F1 Score, Accuracy, ROC-AUC & PR-AUC Metrics
10 pages
F1 Score Vs ROC AUC Vs Accuracy Vs PR AUC Which Evaluation Metric Should You Choose - Neptune - Ai
No ratings yet
F1 Score Vs ROC AUC Vs Accuracy Vs PR AUC Which Evaluation Metric Should You Choose - Neptune - Ai
1 page
DSML Clasification
No ratings yet
DSML Clasification
44 pages
Notes 03
No ratings yet
Notes 03
38 pages
Unit 4
No ratings yet
Unit 4
20 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Lec5 Classification
No ratings yet
Lec5 Classification
27 pages
Classification Matrics
No ratings yet
Classification Matrics
18 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
10 pages
Machine Learning Evaluation Metrics
No ratings yet
Machine Learning Evaluation Metrics
16 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
W2 CSE 4781 - Training Linear Regression
No ratings yet
W2 CSE 4781 - Training Linear Regression
59 pages
Cse 4781:machine Learning: S M Sadakatul Bari Asst Professor AE (Avionics), AAUB
No ratings yet
Cse 4781:machine Learning: S M Sadakatul Bari Asst Professor AE (Avionics), AAUB
30 pages
Chapter 6
No ratings yet
Chapter 6
8 pages
Chapter 4
No ratings yet
Chapter 4
12 pages
Lec 6 ORGANIZATION AND PRESENTATION OF DATA
No ratings yet
Lec 6 ORGANIZATION AND PRESENTATION OF DATA
29 pages
Chapter 8
No ratings yet
Chapter 8
9 pages
Chapter 9
No ratings yet
Chapter 9
8 pages
123123123
No ratings yet
123123123
189 pages
1st Print
No ratings yet
1st Print
189 pages
2nd Print
No ratings yet
2nd Print
17 pages
Maximize Your 20% for Success
No ratings yet
Maximize Your 20% for Success
1 page
40% Marks Math-2nd Seme - Avionics - Student
No ratings yet
40% Marks Math-2nd Seme - Avionics - Student
2 pages
Experiment
No ratings yet
Experiment
4 pages
Static Part 1
No ratings yet
Static Part 1
82 pages
Static Part 1
No ratings yet
Static Part 1
18 pages
Hypothesis Testing: Example 1: Does A New Drug Improve Cancer Survival Rates?
No ratings yet
Hypothesis Testing: Example 1: Does A New Drug Improve Cancer Survival Rates?
25 pages
Regression Analysis of Log Dividends
No ratings yet
Regression Analysis of Log Dividends
10 pages
Econometrics Jimma Assignment
No ratings yet
Econometrics Jimma Assignment
6 pages
Osburn, H.G. (2000) Coefficient Alpha and Related Internal Consistency Reliability Coefficients PDF
No ratings yet
Osburn, H.G. (2000) Coefficient Alpha and Related Internal Consistency Reliability Coefficients PDF
13 pages
Reduction in Average Cycle Time
No ratings yet
Reduction in Average Cycle Time
50 pages
Index of Dispersion Explained
No ratings yet
Index of Dispersion Explained
20 pages
Pengauditan Chapter 15 Arens
No ratings yet
Pengauditan Chapter 15 Arens
33 pages
Hypothesis Testing Guide & Examples
No ratings yet
Hypothesis Testing Guide & Examples
6 pages
Two-Way ANOVA Analysis Guide
No ratings yet
Two-Way ANOVA Analysis Guide
11 pages
(Ebook PDF) Understandable Statistics 11th Edition by Charles Henry Brase Available Any Format
100% (1)
(Ebook PDF) Understandable Statistics 11th Edition by Charles Henry Brase Available Any Format
160 pages
0000 19124-36997-1 - Predator
No ratings yet
0000 19124-36997-1 - Predator
15 pages
ANOVA, T-Test and Other Statistical Tests With Python
No ratings yet
ANOVA, T-Test and Other Statistical Tests With Python
11 pages
JM jap,+LIVEN+JOLANDA+TUEGEH+JURNAL
No ratings yet
JM jap,+LIVEN+JOLANDA+TUEGEH+JURNAL
10 pages
Tut 5
No ratings yet
Tut 5
4 pages
Actuarial CT3 Probability & Mathematical Statistics Sample Paper 2011
100% (2)
Actuarial CT3 Probability & Mathematical Statistics Sample Paper 2011
9 pages
Week 3 Project
No ratings yet
Week 3 Project
2 pages
Normal Distribution
No ratings yet
Normal Distribution
28 pages
Rao Schoenfeld 2007 Survival Methods
No ratings yet
Rao Schoenfeld 2007 Survival Methods
5 pages
Machine Learning Model Metrics
No ratings yet
Machine Learning Model Metrics
6 pages
Percentile Rank
No ratings yet
Percentile Rank
18 pages
Nama: Lingga Pristiya Ningsih Nim: 1501015020 MK: Ekonometrika
No ratings yet
Nama: Lingga Pristiya Ningsih Nim: 1501015020 MK: Ekonometrika
2 pages
Probability and Statistics
No ratings yet
Probability and Statistics
2 pages
Labook DA
No ratings yet
Labook DA
59 pages
Chapter 12
No ratings yet
Chapter 12
18 pages
Solution Manual For Practicing Statistics Guided Investigations For The Second Course by Kuiper
100% (1)
Solution Manual For Practicing Statistics Guided Investigations For The Second Course by Kuiper
18 pages
A Complete Tutorial Which Teaches Data Exploration in Detail PDF
No ratings yet
A Complete Tutorial Which Teaches Data Exploration in Detail PDF
18 pages
Variography Insights for Geologists
No ratings yet
Variography Insights for Geologists
8 pages
Regression Analysis
100% (1)
Regression Analysis
43 pages
Z Table
100% (1)
Z Table
2 pages
Estimation Techniques: Shreekanth Vankamamidi
No ratings yet
Estimation Techniques: Shreekanth Vankamamidi
7 pages

W6 CSE 4781 Classification Metrics

Uploaded by

W6 CSE 4781 Classification Metrics

Uploaded by

Asst Prof.

Fraud detection: predicting if a payment transaction is fraudulent.

Recall can also be called sensitivity or true positive rate

Preferable for class-imbalanced datasets

• In fact, it sums up the performance across the different classification thresholds. It is a

You might also like