0% found this document useful (0 votes)

32 views18 pages

FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I

Uploaded by

aroravansh068

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views18 pages

FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I

Uploaded by

aroravansh068

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Model Evaluation & Selection

 Evaluation metrics: How can we measure accuracy? Other metrics to

consider?
 Use validation test set of class-labeled tuples instead of training set when
assessing accuracy
 Methods for estimating a classifier’s accuracy:
 Holdout method, random subsampling
 Cross-validation
 Bootstrap
 Comparing classifiers:
 Confidence intervals
 Cost-benefit analysis and ROC Curves
Classifier Evaluation Metrics:
Confusion Matrix
2
Confusion Matrix:
Actual class\Predicted class C1 ¬ C1
C1 True Positives (TP) False Negatives (FN)
¬ C1 False Positives (FP) True Negatives (TN)

Example of Confusion Matrix:

Actual class\ buy_computer buy_computer Total
Predicted class = yes = no
buy_computer = yes 6954 46 7000
buy_computer = no 412 2588 3000
Total 7366 2634 10000
 Given m classes, an entry, CMi,j in a confusion matrix indicates # of tuples in class i
that were labeled by the classifier as class j
 May have extra rows/columns to provide totals
Classifier Evaluation Metrics: Accuracy,
Error Rate, Sensitivity and Specificity
3

A\P C ¬C  Class Imbalance Problem:

C TP FN P  One class may be rare, e.g.

¬C FP TN N fraud, or HIV-positive
P’ N’ All  Significant majority of the

negative class and minority of

 Classifier Accuracy, or the positive class
recognition rate: percentage of test  Sensitivity: True Positive
set tuples that are correctly recognition rate
classified 
Sensitivity = TP/P
Accuracy = (TP + TN)/All  Specificity: True Negative

 Error rate: 1 – accuracy, or recognition rate


Specificity = TN/N
Error rate = (FP + FN)/All
Classifier Evaluation Metrics:
Precision and Recall, and F-measures
4

 Precision: exactness – what % of tuples that the classifier labeled as positive are
actually positive

 Recall: completeness – what % of positive tuples did the classifier label as

positive?

 Perfect score is 1.0

 Inverse relationship between precision & recall

F measure (F1 or F-score): harmonic mean of precision and recall,


Fß: weighted measure of precision and recall
 assigns ß times as much weight to recall as to precision
Example
5

Actual Class\Predicted class cancer = yes cancer = Total

no
cancer = yes 90 210 300
cancer = no 140 9560 9700
Total 230 9770 10000
Example
6

Actual Class\Predicted class cancer = yes cancer = no Total Recognition(%)

cancer = yes 90 210 300 30.00 (sensitivity
cancer = no 140 9560 9700 98.56 (specificity)
Total 230 9770 10000 96.40 (accuracy)
A\P C ¬C
C TP FN P
¬C FP TN N
Accuracy = (TP + TN)/All P’ N’ All 
Sensitivity = TP/P

Specificity = TN/N

 Precision = 90/230 = 39.13% Recall = 90/300 = 30.00%

Evaluating Classifier Accuracy:
Holdout & Cross-Validation Methods
7

 Holdout method
 Given data is randomly partitioned into two independent sets
 Training set (e.g., 2/3) for model construction
 Test set (e.g., 1/3) for accuracy estimation
 Random sampling: a variation of holdout
 Repeat holdout k times, accuracy = avg. of the accuracies obtained
 Cross-validation (k-fold, where k = 10 is most popular)
 Randomly partition the data into k mutually exclusive subsets, each
approximately equal size
 At i-th iteration, use Di as test set and others as training set
 Leave-one-out: k folds where k = # of tuples, for small sized data
Model Selection: ROC Curves
8

 ROC (Receiver Operating Characteristics) curves: for visual comparison of

classification models
 Shows the trade-off between the true positive rate and the false positive rate
 The area under the ROC curve is a measure of the accuracy of the model
 Rank the test tuples in decreasing order: the one that is most likely to belong to the
positive class appears at the top of the list
 The closer to the diagonal line (i.e., the closer the area is to 0.5), the less accurate is
the model
 Vertical axis represents the true positive rate
 Horizontal axis rep. the false positive rate
 The plot also shows a diagonal line
 A model with perfect accuracy will have an area of 1.0
Model Selection: ROC Curves
9

 An ROC curve for a given model shows the trade-off between the true positive
rate (TPR) and the false positive rate (FPR).
 Given a test set and a model, TPR is the proportion of positive (or “yes”) tuples
that are correctly labeled by the model
 FPR is the proportion of negative (or “no”) tuples that are mislabeled as positive.
 Given that TP, FP, P, and N are the number of true positive, false positive, positive,
and negative tuples, respectively we know that
Example
10
Example
11
Issues Affecting Model Selection
12

 Accuracy
 classifier accuracy: predicting class label
 Speed
 time to construct the model (training time)
 time to use the model (classification/prediction time)
 Robustness: handling noise and missing values
 Scalability: efficiency in disk-resident databases
 Interpretability
 understanding and insight provided by the model
 Other measures, e.g., goodness of rules, such as decision tree size or compactness
of classification rules
Another Example
13
Solution
14

 In the above example, there are 192 cases (N = 15 + 47 + 12 + 118).

 Accuracy = (15 + 118)/192 = 69.27%
 Precision = 15/(15 + 12) = 55.55%
 Recall = 15/(15 + 47) = 24.19%
 Specificity = 118/(118+12) = 90.77%
 F1-Score = 2*15/((2*15) + 12 +47) = 33.70%
ROC Example
15
Solution
16
References

 Jiawei Han, Micheline Kamber and Jian Pei, “Data Mining: Concepts and Techniques”,
3rd ed., The Morgan Kaufmann Series in Data Management Systems, Morgan
Kaufmann Publishers, July 2011. ISBN 978-0123814791
 https://hanj.cs.illinois.edu/bk3/bk3_slidesindex.htm

Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Model Evaluation
No ratings yet
Model Evaluation
31 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
Lesson 6 Analytics Methods
No ratings yet
Lesson 6 Analytics Methods
12 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Evaluation Matrix
No ratings yet
Evaluation Matrix
29 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
CST 42315 Dam - L9 1
No ratings yet
CST 42315 Dam - L9 1
15 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
Machine Learning for Data Analysts
No ratings yet
Machine Learning for Data Analysts
31 pages
Analytic Method:: Model Evaluation
No ratings yet
Analytic Method:: Model Evaluation
17 pages
Compare Class I Fiers Part 13
No ratings yet
Compare Class I Fiers Part 13
32 pages
Performance
No ratings yet
Performance
11 pages
Precision, Recall and ROC Curves
No ratings yet
Precision, Recall and ROC Curves
17 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Evaluation Method Holdout
No ratings yet
Evaluation Method Holdout
14 pages
Classification - Performance Evlaution
No ratings yet
Classification - Performance Evlaution
13 pages
Evaluation Metricsflaksdj Fa
No ratings yet
Evaluation Metricsflaksdj Fa
22 pages
Statistical Modelling and Evaluation
No ratings yet
Statistical Modelling and Evaluation
15 pages
Data M
No ratings yet
Data M
10 pages
Data Mining: Class Imbalance Solutions
No ratings yet
Data Mining: Class Imbalance Solutions
56 pages
DM 09 Classification and Prediction 19112024 102854am
No ratings yet
DM 09 Classification and Prediction 19112024 102854am
21 pages
Classification Model, Features and Decision Region
No ratings yet
Classification Model, Features and Decision Region
17 pages
Classification Techniques Overview
No ratings yet
Classification Techniques Overview
17 pages
4-1 Fine-Tuning Your Model
No ratings yet
4-1 Fine-Tuning Your Model
60 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
Analytics in Practice: Model Evaluation
No ratings yet
Analytics in Practice: Model Evaluation
40 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
Data M11
No ratings yet
Data M11
5 pages
CLASSIFICATION
No ratings yet
CLASSIFICATION
36 pages
ML - 03 Evaluation Metrics
No ratings yet
ML - 03 Evaluation Metrics
17 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
AI351 Lecture 2 - Common Evaluation Metrics
No ratings yet
AI351 Lecture 2 - Common Evaluation Metrics
50 pages
6.1 Model Evaluation
No ratings yet
6.1 Model Evaluation
13 pages
Lecture Testmodels
No ratings yet
Lecture Testmodels
31 pages
2 Supervised Learning
No ratings yet
2 Supervised Learning
52 pages
TE - DWM Module No 3
No ratings yet
TE - DWM Module No 3
48 pages
Machine Learning PDF
No ratings yet
Machine Learning PDF
8 pages
Data Mining Final
No ratings yet
Data Mining Final
25 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
61 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
41 pages
l09 Machine Learning
No ratings yet
l09 Machine Learning
39 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Lec07 Classification ModelEvaluation Ensemble
No ratings yet
Lec07 Classification ModelEvaluation Ensemble
62 pages
Hardbound Thesis 1 29 2020
No ratings yet
Hardbound Thesis 1 29 2020
116 pages
Base SAS Certification Questions Series - Part 1
100% (1)
Base SAS Certification Questions Series - Part 1
3 pages
Forecasting: I See That You Will Get An A From This Course
No ratings yet
Forecasting: I See That You Will Get An A From This Course
60 pages
R FLOSS Internship Report IIT Bombay
No ratings yet
R FLOSS Internship Report IIT Bombay
31 pages
Scsa 4003
No ratings yet
Scsa 4003
3 pages
W.A.S.M.U.Widanaarachchi Postgraduate Institute of Science University of Peradeniya Peradeniya, Sri Lanka Csc2239@pgis - LK
No ratings yet
W.A.S.M.U.Widanaarachchi Postgraduate Institute of Science University of Peradeniya Peradeniya, Sri Lanka Csc2239@pgis - LK
7 pages
A Research Project To Investigate The Effect of Automation in Banking Services
100% (1)
A Research Project To Investigate The Effect of Automation in Banking Services
46 pages
Mathematics Standard Stage 6 Syllabus 2017
No ratings yet
Mathematics Standard Stage 6 Syllabus 2017
94 pages
Machine Learning Presentaion
No ratings yet
Machine Learning Presentaion
15 pages
Redundant Fixed Effects Tests
No ratings yet
Redundant Fixed Effects Tests
9 pages
Weiss Saklofske PDF
No ratings yet
Weiss Saklofske PDF
9 pages
Midterm 2017
No ratings yet
Midterm 2017
5 pages
ION8800 User Guide
No ratings yet
ION8800 User Guide
222 pages
29suppl1 Karina s77-82
No ratings yet
29suppl1 Karina s77-82
7 pages
Anexas Europe Services - Brochure PDF
No ratings yet
Anexas Europe Services - Brochure PDF
5 pages
Mohamed Anwar Analyst
No ratings yet
Mohamed Anwar Analyst
2 pages
MSBA Program
No ratings yet
MSBA Program
6 pages
Basics of Data Literacy
100% (2)
Basics of Data Literacy
33 pages
Banking Operation Project Work
No ratings yet
Banking Operation Project Work
16 pages
Effects of Sleep Deprivation-Schumacher and Sipes-Final
100% (2)
Effects of Sleep Deprivation-Schumacher and Sipes-Final
55 pages
Project Stakeholder Management
No ratings yet
Project Stakeholder Management
37 pages
Chapter III GROUP 5
No ratings yet
Chapter III GROUP 5
5 pages
Parametric vs Non-Parametric Stats
No ratings yet
Parametric vs Non-Parametric Stats
9 pages
FPGROWTH
No ratings yet
FPGROWTH
17 pages
Petrographic and XRF Analyses of Andesit
100% (1)
Petrographic and XRF Analyses of Andesit
28 pages
1st Periodic Test in PR2
No ratings yet
1st Periodic Test in PR2
6 pages
Handouts in Educ 9 - Judelle L. Inocencio
No ratings yet
Handouts in Educ 9 - Judelle L. Inocencio
15 pages
Assignment On Correlation
100% (1)
Assignment On Correlation
7 pages
2425s Csec520 08 Naive Bayes KNN
No ratings yet
2425s Csec520 08 Naive Bayes KNN
44 pages
MASQuestions For MASChapter 11
No ratings yet
MASQuestions For MASChapter 11
34 pages

FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I

Uploaded by

FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I

Uploaded by

Model Evaluation & Selection

 Evaluation metrics: How can we measure accuracy? Other metrics to

Example of Confusion Matrix:

A\P C ¬C  Class Imbalance Problem:

negative class and minority of

 Error rate: 1 – accuracy, or recognition rate

 Recall: completeness – what % of positive tuples did the classifier label as

 Perfect score is 1.0

Actual Class\Predicted class cancer = yes cancer = Total

Actual Class\Predicted class cancer = yes cancer = no Total Recognition(%)

 Precision = 90/230 = 39.13% Recall = 90/300 = 30.00%

 ROC (Receiver Operating Characteristics) curves: for visual comparison of

 In the above example, there are 192 cases (N = 15 + 47 + 12 + 118).

You might also like