0% found this document useful (0 votes)

7 views6 pages

B83c05aa 672f 4234 A627 Cfc944f11d45 Classification Summary

This document covers a module on classification techniques in machine learning, including an introduction to classification, linear regression, and decision trees. It explains key concepts such as binary and multiclass classification, confusion matrices, and performance metrics like accuracy, precision, recall, and F1-score. Additionally, it discusses logistic regression, ROC curves, and decision tree algorithms for feature importance evaluation.

Uploaded by

sunilksaini121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views6 pages

B83c05aa 672f 4234 A627 Cfc944f11d45 Classification Summary

Uploaded by

sunilksaini121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

This module has the following sessions:

● Introduction to Classification
● Linear Regression
● Decision Trees

Introduction to Classification
In this session, you learnt about classification, which is another form of a supervised learning
algorithm. Classification is the task of predicting or detecting which category (or categories) an
observation (or data point) belongs to. In classification problems, the output variables are always
discrete values (such as, ‘yes’ or ‘no’). The main difference between classification and regression is
that classification predicts discrete categories or groups, whereas regression predicts real-valued
and continuous quantities.

There are two types of classification problems:

● Binary classification: The target variable has two classes. This is the most common type of
classification problem.

● Multiclass classification: The target variable has more than two classes.

Confusion matrix is used to assess any model’s performance by estimating the correct and
incorrect predictions made for any given data sample. The following table shows an example.

Actual/Predicted True (1) False (0)

True (1) True positive False negative

False (0) False positive True negative

© upGrad Campus Private Limited. All rights reserved.

A confusion matrix can also be used to estimate the following:

● Accuracy: It is the ratio of correct predictions made by the model to the total number of
predictions. It is mathematically represented as follows:
𝐶𝑜𝑟𝑟𝑒𝑐𝑡 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑠 𝑇𝑁 + 𝑇𝑃
Accuracy = 𝑇𝑜𝑡𝑎𝑙 𝑛𝑜. 𝑜𝑓 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑠
= 𝑇𝑁 + 𝐹𝑃 + 𝐹𝑁 + 𝑇𝑃

You learnt that accuracy might not be enough to evaluate most classification models, especially
those with imbalanced classes. This is where you were introduced to the following new metrics:

● Precision: It is the probability that a predicted ‘True’ case is actually a ‘True’ case. It can be
represented as follows:
𝐶𝑜𝑟𝑟𝑒𝑐𝑡 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑠 𝑇𝑃
Precision = 𝑇𝑜𝑡𝑎𝑙 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑠
= 𝑇𝑃 + 𝐹𝑃

● Recall: It is the probability that an actual ‘True’ case is predicted correctly. It can be
represented as follows:
𝐶𝑜𝑟𝑟𝑒𝑐𝑡 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑠 𝑇𝑃
Recall = 𝐴𝑐𝑡𝑢𝑎𝑙 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠
= 𝑇𝑃 + 𝐹𝑁

● F1-score: It is used to check the model’s overall hygiene to ensure that neither the model’s
precision nor recall is too far off. It can be represented as follows:
2 × 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 × 𝑅𝑒𝑐𝑎𝑙𝑙
F1-score = 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 + 𝑅𝑒𝑐𝑎𝑙𝑙

© upGrad Campus Private Limited. All rights reserved.

Logistic Regression
A sigmoid curve is used in logistic regression to assign probabilities to each observation. The
sigmoid curve equation (for one independent variable) is as follows:
1
y = P(True) = −(β0+β1𝑥) .
1+𝑒

Then, we used Maximum Likelihood Estimation (MLE) to find the best values of β0 and β1 so that
the likelihood function has the highest value. You performed this operation using the ‘Real
Statistics’ add-in in Excel.

You also learnt that the relationship between x (the input value or feature) and probability is not
linear, so we transformed the equation such that the relationship between x and log odds is linear.
Hence, we got the following:
𝑃
Log(odds) = ln( 1−𝑃 ) = β0 + β1x

For multiple independent variables, the equation for logistic regression is as follows:
1
y = P(True) = (
− β0+β1𝑥1+β2𝑥2+β3𝑥3…+β𝑛𝑥𝑛 )
1+𝑒

The ‘Real Statistics’ add-in also calculates the probability of each observation using the optimal
beta values and given values of independent variables. The final predictions are then made based
on a particular cut-off.

You were also introduced to some new metrics to help you decide the optimal cut-off for
prediction. They are as follows:

𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑐𝑡𝑢𝑎𝑙 𝑦𝑒𝑠𝑒𝑠 𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑙𝑦 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑇𝑃

● Sensitivity = 𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑐𝑡𝑢𝑎𝑙 𝑦𝑒𝑠𝑒𝑠
= 𝑇𝑃 + 𝐹𝑁 = True positive rate
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑐𝑡𝑢𝑎𝑙 𝑛𝑜𝑠 𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑙𝑦 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑇𝑁 𝐹𝑃
● Specificity = 𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑐𝑡𝑢𝑎𝑙 𝑛𝑜𝑠
= 𝑇𝑁 + 𝐹𝑃 = 1 – 𝑇𝑁 + 𝐹𝑃 = 1 –
False positive rate

© upGrad Campus Private Limited. All rights reserved.

Receiver Operating Characteristics Curve (ROC Curve) is used to assess the diagnostic
capability of any binary classifier system when the cut-offs are varied. The following image shows
the image of an ROC curve.

The following are the steps for plotting a ROC curve:

1. Choose a cut-off probability value for a given model.

2. Calculate the true positive rate, also known as recall or sensitivity.

3. Calculate the false positive rate, which is defined as (1-Specificity).

4. Plot the point on the graph.

5. Repeat the first and fourth steps for different cut-off probability values to arrive at
different points on the graph, which you can then connect to plot the curve.

Similarly, different ROC curves can be plotted for different models (different beta values), and the
best model will have the highest area under the ROC curve. You also observed from the ROC curve
that TPR and FPR have a positive relationship; this implies that sensitivity and specificity have a
negative relationship.

You learnt the following about determining the optimal cut-off:

● The optimal cut-off depends on the business context, and one may also need to trade-off
between the metrics, sensitivity and specificity, to determine the same.

● One method to select a cut-off is to ensure that the values of all metrics, i.e., accuracy,
specificity and sensitivity, are almost equal.

● The optimal cut-off probability of an ROC curve is the one which maximises the TPR and
minimises the FPR.

Decision Trees
A decision tree is a kind of classification algorithm. A classification problem can be approached in
two ways: descriptive way and discriminative way. Decision trees use an algorithm that follows the
discriminative way of classification. They do not need pre-defined classification rules, such as
logistic regression. So, in this case, you do not need to define a cut-off score to divide the data set
into two classes. Rather, decision trees use an algorithm that selects the most important feature
for classification, then the second most and so on.

To determine which features are more important than others, you need to evaluate the purity of a
feature. Purity must be calculated at a block level before calculating the feature’s purity. The more
skewed the ratio of positive and negative outputs for any block is, the purer a block is and the
easier it is to predict (0 or 1) for that block.

Following are the three essential metrics that can be used to calculate purity:

● Accuracy: It is defined as max(P1, P2), where P1 and P2 are the probabilities of the two
classes that occur in any particular region. For example, in the case discussed in the
module’s fourth segment, The Decision Tree Algorithm, P1 and P2 were the probabilities of
liking or not liking a particular block.

● Gini score: It is generally considered a better metric for purity than accuracy and can be
calculated as P1 × accuracy(P1) + P2 × accuracy(P2).

● Information gain: It is given by (1 - entropy), and entropy is defined as Σpi × log2(1/pi). In the
case discussed in the module’s fourth segment, The Decision Tree Algorithm, since there
were two classes, i.e., a block can either be liked or disliked, the entropy formula becomes
P1 × log2(P1) + P2 × log2(P2).

After calculating the purity of a block, you need to take the sum-product of the purity of the blocks
(with respect to a feature’s subcategory) and their corresponding sizes. This will give you the
purity of one of the feature’s subcategories. This process must be repeated for all subcategories of
that feature. The purity of each sub-category is then multiplied by their respective sizes to get the
purity of a feature. This process must be repeated for each feature. Once the purity of each
feature is calculated (with respect to one of the three metrics listed above), one can simply identify
the most important feature by determining which feature has the highest score.

Disclaimer: All content and material on the upGrad Campus website is copyrighted material, either
belonging to upGrad Campus or its bona fide contributors and is purely for the dissemination of
education. You are permitted to access, print, and download extracts from this site purely for your own
education only and on the following basis:

● You can download this document from the website for self-use only.

● Any copies of this document, in part or full, saved to disk or to any other storage medium, may
only be used for subsequent, self-viewing purposes or to print an individual extract or copy for
non-commercial personal use only.

● Any further dissemination, distribution, reproduction or copying of the content of the document
herein or the uploading thereof on other websites, or use of the content for any other
commercial/unauthorized purposes in any way which could infringe the intellectual property
rights of upGrad Campus or its contributors, is strictly prohibited.

● No graphics, images, or photographs from any accompanying text in this document will be used
separately for unauthorized purposes.

● No material in this document will be modified, adapted, or altered in any way.

● No part of this document or upGrad Campus content may be reproduced or stored on any other
website or included in any public or private electronic retrieval system or service without prior
written permission from upGrad Campus.

● Any rights not expressly granted in these terms are reserved.

Data M
No ratings yet
Data M
10 pages
Data M11
No ratings yet
Data M11
5 pages
Machine Learning PDF
No ratings yet
Machine Learning PDF
8 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
ML Tutorial
No ratings yet
ML Tutorial
45 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Business Intelligence Endsem
No ratings yet
Business Intelligence Endsem
12 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
DWDM Final5
No ratings yet
DWDM Final5
45 pages
Model Evaluation for Data Scientists
No ratings yet
Model Evaluation for Data Scientists
7 pages
Machine Learning Most Important Question For Mid Term Ipu University
No ratings yet
Machine Learning Most Important Question For Mid Term Ipu University
36 pages
Big Data Lesson 5 Lucrezia Noli
No ratings yet
Big Data Lesson 5 Lucrezia Noli
30 pages
Unit 4
No ratings yet
Unit 4
20 pages
Performance Measures
No ratings yet
Performance Measures
32 pages
Exp8 Wa Aids
No ratings yet
Exp8 Wa Aids
3 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
Data Mining: Class Imbalance Solutions
No ratings yet
Data Mining: Class Imbalance Solutions
56 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
ML-classification Models
No ratings yet
ML-classification Models
27 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
ES335
No ratings yet
ES335
22 pages
AIML-HC Mod 03
No ratings yet
AIML-HC Mod 03
46 pages
03 Performance Metrics
No ratings yet
03 Performance Metrics
15 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Lecture 7 Classification
No ratings yet
Lecture 7 Classification
33 pages
MLS 2 - Classification
No ratings yet
MLS 2 - Classification
13 pages
Data Minning Unit 2-1
No ratings yet
Data Minning Unit 2-1
10 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Session-11 Machine Learning
No ratings yet
Session-11 Machine Learning
27 pages
Model Evaluation
No ratings yet
Model Evaluation
31 pages
Machine Learning Issues & Algorithms
No ratings yet
Machine Learning Issues & Algorithms
133 pages
CLASSIFICATION
No ratings yet
CLASSIFICATION
36 pages
Unit 4 DS
No ratings yet
Unit 4 DS
16 pages
? Task
No ratings yet
? Task
23 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Decision Tree Introduction
No ratings yet
Decision Tree Introduction
14 pages
Machine Learning Evaluation Metrics
No ratings yet
Machine Learning Evaluation Metrics
16 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Compare Class I Fiers Part 13
No ratings yet
Compare Class I Fiers Part 13
32 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Unit3 Evaluating Models
No ratings yet
Unit3 Evaluating Models
10 pages
GR 10 - Final Evaluation
No ratings yet
GR 10 - Final Evaluation
45 pages
Session01 DataScience
No ratings yet
Session01 DataScience
79 pages
Unit 2
No ratings yet
Unit 2
20 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
2.3 Performance Metrics
No ratings yet
2.3 Performance Metrics
32 pages
Sanatander Analysis
No ratings yet
Sanatander Analysis
19 pages
German Dataset Tasks
No ratings yet
German Dataset Tasks
6 pages
Decision Tree in ML
No ratings yet
Decision Tree in ML
21 pages
MSC Thesis Nordin Sahla
100% (1)
MSC Thesis Nordin Sahla
58 pages
Titanic Survival Prediction
No ratings yet
Titanic Survival Prediction
14 pages
MM5425 WK5 Lecture Decision Tree
No ratings yet
MM5425 WK5 Lecture Decision Tree
34 pages
Gradient Boosting Explained
100% (1)
Gradient Boosting Explained
7 pages
Unit 1
No ratings yet
Unit 1
11 pages
Text To Speech System For Punjabi Using Festival Framework
No ratings yet
Text To Speech System For Punjabi Using Festival Framework
5 pages
AI-Driven Dragline Planning Report
No ratings yet
AI-Driven Dragline Planning Report
25 pages
Data Warehousing & Mining Assignment
No ratings yet
Data Warehousing & Mining Assignment
10 pages
A Novel Study On Machine Learning Algorithm Based
No ratings yet
A Novel Study On Machine Learning Algorithm Based
10 pages
Predicting Diabetes in Medical Datasets Using Machine Learning Techniques
No ratings yet
Predicting Diabetes in Medical Datasets Using Machine Learning Techniques
14 pages
Machine Learning Training Report
No ratings yet
Machine Learning Training Report
36 pages
Integrating Clustering With Different Data Mining Techniques in The Diagnosis of Heart Disease
No ratings yet
Integrating Clustering With Different Data Mining Techniques in The Diagnosis of Heart Disease
10 pages
Decision Trees for Data Scientists
100% (1)
Decision Trees for Data Scientists
12 pages
AI Practical
No ratings yet
AI Practical
28 pages
ML Unit 2
No ratings yet
ML Unit 2
53 pages
Final Report
No ratings yet
Final Report
26 pages
Machine Learning & Deep Learning Course
No ratings yet
Machine Learning & Deep Learning Course
7 pages
Kci Fi002111038
No ratings yet
Kci Fi002111038
10 pages
Machine Learning in A Nutshell
No ratings yet
Machine Learning in A Nutshell
36 pages
Report TCS Explorers
No ratings yet
Report TCS Explorers
11 pages
Sample Questions and Answers For All Subjects of MCA SEM 5 SMU
No ratings yet
Sample Questions and Answers For All Subjects of MCA SEM 5 SMU
88 pages
Random Forests and Decision Trees: September 2012
No ratings yet
Random Forests and Decision Trees: September 2012
8 pages
Data Mining: Decision Trees & CHAID
No ratings yet
Data Mining: Decision Trees & CHAID
18 pages
A Method For Vietnamese Text Normalization To Improve The Quality of Speech Synthesis
No ratings yet
A Method For Vietnamese Text Normalization To Improve The Quality of Speech Synthesis
8 pages
J48
No ratings yet
J48
3 pages
Reading 3 Machine Learning
No ratings yet
Reading 3 Machine Learning
9 pages
Data Warehousing & Data Mining Unit-3 Notes
No ratings yet
Data Warehousing & Data Mining Unit-3 Notes
27 pages
HW1 Final
No ratings yet
HW1 Final
4 pages

B83c05aa 672f 4234 A627 Cfc944f11d45 Classification Summary

Uploaded by

B83c05aa 672f 4234 A627 Cfc944f11d45 Classification Summary

Uploaded by

This module has the following sessions:

There are two types of classification problems:

Actual/Predicted True (1) False (0)

True (1) True positive False negative

False (0) False positive True negative

© upGrad Campus Private Limited. All rights reserved.

© upGrad Campus Private Limited. All rights reserved.

𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑐𝑡𝑢𝑎𝑙 𝑦𝑒𝑠𝑒𝑠 𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑙𝑦 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑇𝑃

© upGrad Campus Private Limited. All rights reserved.

The following are the steps for plotting a ROC curve:

1. Choose a cut-off probability value for a given model.

2. Calculate the true positive rate, also known as recall or sensitivity.

3. Calculate the false positive rate, which is defined as (1-Specificity).

4. Plot the point on the graph.

You learnt the following about determining the optimal cut-off:

© upGrad Campus Private Limited. All rights reserved.

© upGrad Campus Private Limited. All rights reserved.

● No material in this document will be modified, adapted, or altered in any way.

● Any rights not expressly granted in these terms are reserved.

© upGrad Campus Private Limited. All rights reserved.

You might also like