0% found this document useful (0 votes)

3 views14 pages

Data Science

The document outlines key evaluation metrics used in data science, including confusion matrix, precision, recall, F1-score, accuracy, true positive rate, and false positive rate. It provides definitions and formulas for each metric, along with examples to illustrate their application. Additionally, it discusses advanced metrics like the area under the ROC curve, Dice score, and Intersection over Union (IoU) for assessing model performance.

Uploaded by

Abhishek Goutam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views14 pages

Data Science

Uploaded by

Abhishek Goutam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

ED5340 - Data Science: Theory apa th y

and Practise an
h ug
u t
M
a n
L24 - Evaluation Metrics n a th
a
Ram

Ramanathan Muthuganapathy (https://ed.iitm.ac.in/~raman)

Course web page: https://ed.iitm.ac.in/~raman/datascience.html
Moodle page: Available at https://courses.iitm.ac.in/
Classification

• Confusion Matrix
h y
a t
• Precision n ap
a
h ug
• Recall u t
M
a n
• F1-Score n a th
a
• True positive rate Ram

• False positive rate

• Accuracy
• AUC
Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras
Confusion Matrix

h y
a t
Actual Class ap TP - True Positive
a n
1 u g 0
th
u FP - False Positive
M
a n
t h
Predicted Class

TP n a FP
1 a FN - False Negative
a m
R
TN - True Negative

FN TN
0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Details

• TP (True Positive) - Actual and Prediction are both

h y positive.
a t
ap
• FP (False Positive) - Actual is false but thegaprediction n is true (Prediction cancer
when there is no such case). h u
u t
M
a n
• FN (False Negative) - Actual is true n a t hbut the prediction is false (Prediction no
cancer when there is one). a
a m
R
• TN (True Negative) - Actual and Prediction are both negative.

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Precision and recall

• Precision - Of all the positive predicted cases, hwhat

y is the fraction that is
actually positive? a t
ap
a n Actual Class
TP u th ug 1 0
• P = M
TP + FP a th a n

Predicted Class
a n TP FP
am 1
R

FN TN
0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Precision and recall

• Recall - Of all the actual positive cases, what ishythe fraction that has been
correctly predicted? a t
ap
a n Actual Class
TP u th ug 1 0
• R = M
TP + FN a th a n

Predicted Class
a n TP FP
am 1
R

FN TN
0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Example

• Dataset - 50 cases, 40 true and 10 h y

false a t
ap
a n Actual Class
TP u th ug 1 0
• P = M
TP + FP a th a n

Predicted Class
n
TP Ram
a 1 30 FP

• R =
TP + FN
FN 3
0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Precision or Recall

• E.g. - Email spam filter h y

a t Actual Spam
ap
• High precision or high recall an
1 0
h ug
u t
• FP - Genuine email getting a n
M

Predicted Spam
classified as spam th TP FP
n a 1
a
am
• R
FN - Spam coming to your
inbox
FN TN
0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

F1 - Score (Harmonic mean)

• Dataset - 50 cases, 40 true and 10 false h y

a t
p
P*R a n a Actual Class
• F 1 = 2 * h u g
1 0
P+R M
u t
a n
th
a

Predicted Class
a n 30 FP
am 1
R

FN 3
0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Accuracy

• 50 cases, 40 true and 10 false h y

a t
p
TP + TN a n a Actual Class
Acc = ug
• TP + FP + FN + TN u th 1 0
M
a n
th
a

Predicted Class
a n 30 FP
am 1
R

FN 3
0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

TPR and FPR

• Dataset - 50 cases, 40 true and 10 h y

false a t
ap
a n Actual Class
TP u th ug 1 0
• TPR = M
TP + FN a t h a n

Predicted Class
n
FP R a m
a 1 TP FP

• FPR = (negative cases

FP + TN
being predicted incorrectly
FN TN
0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Area under ROC curve (AUC)
TPR and FPR

• Higher the area, the better. h y

a t
ap
• Qn: How to get this curve? an
h ug
u t

TPR
M
a n
th
n a
a
Ram

FPR

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Dice score / coefficient (pixel data)

|A ∩ B|
DC = 2 * th y
• |A| + |B| n apa
a
ug
Areaofintersection u th
DC = 2 * n
M
• Sumofthetwoareas a th a
a n
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

IoU (Intersection over union)

|A ∩ B| y
IoU = th
• |A ∪ B| n apa
a
ug
Areaofintersection M
u th
DC = n
• Areaoftheunion a th a
a n
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Evaluation Measures for Machine Learning Models
No ratings yet
Evaluation Measures for Machine Learning Models
6 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
W4M1-Logistic Regression Classificaiton Evaluation Metrics
No ratings yet
W4M1-Logistic Regression Classificaiton Evaluation Metrics
8 pages
Lecture-(3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture-(3-4) Evaluation Metrices Classification and Regression
28 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Confusion Matrix
No ratings yet
Confusion Matrix
18 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Machine Learning: B.Tech (CSBS) V Semester
No ratings yet
Machine Learning: B.Tech (CSBS) V Semester
9 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
06-FSSR_DS610_2024=2025T1_ٍMetrics
No ratings yet
06-FSSR_DS610_2024=2025T1_ٍMetrics
24 pages
Lesson 4 - Performance Metrics
No ratings yet
Lesson 4 - Performance Metrics
46 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
Confusion Matrix and Classification Evaluation Metrics
No ratings yet
Confusion Matrix and Classification Evaluation Metrics
16 pages
Lect_02_Evaluation_Part_1
No ratings yet
Lect_02_Evaluation_Part_1
33 pages
9__ROC__AUC
No ratings yet
9__ROC__AUC
27 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
22 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
SupervisedLearning_Classification
No ratings yet
SupervisedLearning_Classification
20 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
L22 KNN+Metrics
No ratings yet
L22 KNN+Metrics
18 pages
CS340 Machine Learning ROC Curves
No ratings yet
CS340 Machine Learning ROC Curves
8 pages
Roc 1 PDF
No ratings yet
Roc 1 PDF
8 pages
Performance Metrics Classification (1)
No ratings yet
Performance Metrics Classification (1)
39 pages
IDS 6 EvaluationMetrics
No ratings yet
IDS 6 EvaluationMetrics
34 pages
5.3
No ratings yet
5.3
31 pages
Confusion Matrix
No ratings yet
Confusion Matrix
42 pages
CH 4
No ratings yet
CH 4
9 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
Lectures3 5
No ratings yet
Lectures3 5
57 pages
? Task
No ratings yet
? Task
23 pages
IE 527 Intelligent Engineering Systems: Basic Concepts Model/performance Evaluation Overfitting
No ratings yet
IE 527 Intelligent Engineering Systems: Basic Concepts Model/performance Evaluation Overfitting
18 pages
CS305 Exercise 5: Task 1: Comparing Machine Learning Algorithms
No ratings yet
CS305 Exercise 5: Task 1: Comparing Machine Learning Algorithms
7 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Notes 03
No ratings yet
Notes 03
38 pages
UNIT-3
No ratings yet
UNIT-3
13 pages
Instruction & Option Choice
No ratings yet
Instruction & Option Choice
6 pages
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
No ratings yet
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
79 pages
14 - Performance Measure - Final
No ratings yet
14 - Performance Measure - Final
17 pages
Lec 12 Performances Metrices Matrix Part 2
No ratings yet
Lec 12 Performances Metrices Matrix Part 2
26 pages
Lec 12 Performances Metrices Matrix Part 2
No ratings yet
Lec 12 Performances Metrices Matrix Part 2
26 pages
Classification Metrics.pptx
No ratings yet
Classification Metrics.pptx
39 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Cristian Quiñonez Fase2
No ratings yet
Cristian Quiñonez Fase2
7 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Session01 DataScience
No ratings yet
Session01 DataScience
79 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
unit7_ML
No ratings yet
unit7_ML
54 pages
Confusion Metrics
No ratings yet
Confusion Metrics
7 pages
Lecture 2 Classifier Performance Metrics
No ratings yet
Lecture 2 Classifier Performance Metrics
60 pages
Lecture 2.3
No ratings yet
Lecture 2.3
9 pages
Data Mining Final
No ratings yet
Data Mining Final
25 pages
OS Lecture27 Virtual Memory and Paging in Xv6
No ratings yet
OS Lecture27 Virtual Memory and Paging in Xv6
10 pages
OS Lecture31 Device Driver and Block I-O in Xv6
No ratings yet
OS Lecture31 Device Driver and Block I-O in Xv6
9 pages
OS Lecture32 Filesystem in Xv6
No ratings yet
OS Lecture32 Filesystem in Xv6
12 pages
OS Lecture33 Network I-O Using Sockets
No ratings yet
OS Lecture33 Network I-O Using Sockets
7 pages
Python Lecture 10-Efficiency
No ratings yet
Python Lecture 10-Efficiency
26 pages
Python Lecture 16-High Order Function
No ratings yet
Python Lecture 16-High Order Function
20 pages
OS Lecture25 Scheduling and Context Switching in Xv6
No ratings yet
OS Lecture25 Scheduling and Context Switching in Xv6
10 pages
Python Lecture 18-File Input Output
No ratings yet
Python Lecture 18-File Input Output
20 pages
Linux Lecture 013 IITK
No ratings yet
Linux Lecture 013 IITK
8 pages
Python Lecture 6 Python Recursion
No ratings yet
Python Lecture 6 Python Recursion
22 pages
Python Lecture 2-Fundamental-Algorithms
No ratings yet
Python Lecture 2-Fundamental-Algorithms
14 pages
Linux Lecture 008 IITK
No ratings yet
Linux Lecture 008 IITK
9 pages
Python Lecture 1-Fundamental-Algorithms
No ratings yet
Python Lecture 1-Fundamental-Algorithms
16 pages
Linux Lecture 012 IITK
No ratings yet
Linux Lecture 012 IITK
11 pages
1-6. Printed Electronics-Based Physically Unclonable Functions
No ratings yet
1-6. Printed Electronics-Based Physically Unclonable Functions
156 pages
Lyapunov-Design For A Super-Twisting
No ratings yet
Lyapunov-Design For A Super-Twisting
6 pages
Flow Control Mechanism
No ratings yet
Flow Control Mechanism
11 pages
Sign Language Recognition Using Python and OpenCV
100% (1)
Sign Language Recognition Using Python and OpenCV
22 pages
Numerical Methods-MIDTERM
No ratings yet
Numerical Methods-MIDTERM
8 pages
Order and Ranking - Short Notes
No ratings yet
Order and Ranking - Short Notes
2 pages
Meterial
No ratings yet
Meterial
50 pages
3 Naive Bayes Full
No ratings yet
3 Naive Bayes Full
41 pages
Approximation Algorithms
No ratings yet
Approximation Algorithms
27 pages
Higher Engineering Mathematics Bs Grewal-Page25
No ratings yet
Higher Engineering Mathematics Bs Grewal-Page25
1 page
kvs-maths-marking-scheme
No ratings yet
kvs-maths-marking-scheme
9 pages
Midterm f10
No ratings yet
Midterm f10
5 pages
AI Note
No ratings yet
AI Note
113 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Advanced Data Mining and Applications 10th International Conference ADMA 2014 Guilin China December 19 21 2014 Proceedings 1st Edition Xudong Luo pdf download
100% (1)
Advanced Data Mining and Applications 10th International Conference ADMA 2014 Guilin China December 19 21 2014 Proceedings 1st Edition Xudong Luo pdf download
65 pages
Numerical Methods
No ratings yet
Numerical Methods
33 pages
Lesson 4.1 - Solving Systems by Graphing
No ratings yet
Lesson 4.1 - Solving Systems by Graphing
14 pages
YOLO Object Detection Explained_ A Beginner's Guide _ DataCamp
No ratings yet
YOLO Object Detection Explained_ A Beginner's Guide _ DataCamp
14 pages
Slide 2 Discrete Time Signals
No ratings yet
Slide 2 Discrete Time Signals
100 pages
Design and Implementation of Proportional Integral Observer Based Linear Model Predictive Controller
No ratings yet
Design and Implementation of Proportional Integral Observer Based Linear Model Predictive Controller
8 pages
A Hybrid Artificial Bee Colony Algorithmic Approach For Classification Using Neural Networks
No ratings yet
A Hybrid Artificial Bee Colony Algorithmic Approach For Classification Using Neural Networks
24 pages
Chapter 4 Exponential Smoothening Methods
No ratings yet
Chapter 4 Exponential Smoothening Methods
19 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Arithmetic Sequence: Finding The Common Difference (D)
No ratings yet
Arithmetic Sequence: Finding The Common Difference (D)
11 pages
A Comprehensive Overview of Large Language Models: A B, C, D, E, F, G F, G H, J I J
No ratings yet
A Comprehensive Overview of Large Language Models: A B, C, D, E, F, G F, G H, J I J
47 pages
DAA CAE-1 QB
No ratings yet
DAA CAE-1 QB
3 pages
Grade 8 Logical Reasoning in
100% (1)
Grade 8 Logical Reasoning in
9 pages
Appendix B Probability 2015
No ratings yet
Appendix B Probability 2015
41 pages

Data Science

Uploaded by

Data Science

Uploaded by

ED5340 - Data Science: Theory apa th y

Ramanathan Muthuganapathy (https://ed.iitm.ac.in/~raman)

• False positive rate

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• TP (True Positive) - Actual and Prediction are both

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Precision - Of all the positive predicted cases, hwhat

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Dataset - 50 cases, 40 true and 10 h y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• E.g. - Email spam filter h y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Dataset - 50 cases, 40 true and 10 false h y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• 50 cases, 40 true and 10 false h y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Dataset - 50 cases, 40 true and 10 h y

• FPR = (negative cases

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Higher the area, the better. h y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

You might also like