0% found this document useful (0 votes)

67 views

Lec 3&4

The document describes decision tree classifiers and how they are constructed using information gain. It provides examples of weather and tennis play datasets to demonstrate how a decision tree is built in a recursive top-down manner by splitting the dataset into purer subsets based on feature attributes that have the highest information gain. The key steps are: 1) calculating the information gain of attributes, 2) selecting the attribute with highest gain as the root node, 3) splitting the dataset on that attribute into subsets, 4) recursively repeating on the subsets until stopping criteria is met.

Uploaded by

Hariharan Ravichandran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views

Lec 3&4

Uploaded by

Hariharan Ravichandran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Decision Tree Classifiers

Tanmay Basu

Department of Data Science and Engineering

IISER Bhopal, India

Tanmay Basu Decision Tree Classifiers 1

Overview

▶ Decision tree builds classification models in the form of a tree

structure
▶ A decision tree classifier is expressed as a recursive partition of
the instance space
▶ The decision tree consists of nodes that form a rooted tree
i.e., a directed tree with a node called ’root’ that has no
incoming edges into it
▶ All other nodes in the tree have exactly one incoming edge
and can have many outgoing edges
▶ All other nodes at the bottom are called leave nodes or
terminal nodes or decision nodes

Tanmay Basu Decision Tree Classifiers 2

Example of a Decision Tree

Tanmay Basu Decision Tree Classifiers 3

Weather Dataset Developed by Quinlan

SL Outlook Temperature Humidity Wind Play Tennis?

1 Sunny Hot High False No
2 Sunny Hot High True No
3 Overcast Hot High False Yes
4 Rainy Mild High False Yes
5 Rainy Cool Normal False Yes
6 Rainy Cool Normal True No
7 Overcast Cool Normal True Yes
8 Sunny Mild High False No
9 Sunny Cool Normal False Yes
10 Rainy Mild Normal False Yes
11 Sunny Mild Normal True Yes
12 Overcast Mild High True Yes
13 Overcast Hot Normal False Yes
14 Rainy Mild High True No
Tanmay Basu Decision Tree Classifiers 4
How the Algorithm Works?

Figure 1: Transformation of Figure 1

Tanmay Basu Decision Tree Classifiers 5

How the Algorithm Works?

Figure 2: Decision Subtree for ”Outlook”

Tanmay Basu Decision Tree Classifiers 6

How the Algorithm Works?

Figure 3: Decision Subtree for ”Outlook” and ”Temperature”

Tanmay Basu Decision Tree Classifiers 7

Objectives and Natures of Decision Tree Algorithms

▶ The goal of a decision tree classification algorithm is to find

the optimal decision tree by minimizing the generalization
error.

▶ The optimal decision tree can be obtained by minimizing the

number of nodes or minimizing the average depth of the tree.

▶ Induction of an optimal decision tree from a given data is

considered to be a hard task.

▶ There are various top down decision tree algorithms such as

ID3, C4.5, CART, which are greedy in nature and construct
the decision tree in a top-down recursive manner.

Tanmay Basu Decision Tree Classifiers 8

Basic Idea of Decision Tree Algorithms

▶ In each iteration these algorithms consider the partition of a

training set using the outcome of a discrete function of the
features.

▶ The selection of the most appropriate function is made

according to some splitting criteria.

▶ After the selection of an appropriate split, each node further

subdivides the training set into smaller subsets.

▶ This process continues till no more split is possible or a

stopping criterion is satisfied.

Tanmay Basu Decision Tree Classifiers 9

Splitting Measure: Information Gain
▶ The information gain for a feature of a set of training samples
is simply the reduction in entropy caused by partitioning the
training samples according to this feature.

▶ This is used as splitting measure of the ID3, C4.5 decision

tree algorithm.

▶ The information gain of a particular feature (say f) of a set of

training samples X can be defined as

X X : f = v
IG (X , f ) = E (X ) − E (X : f = v ) (1)
|X |
v ∈values(f )

where values(f) refer to the set of all possible values of the

feature f.
Tanmay Basu Decision Tree Classifiers 10
Splitting Measure: Information Gain

• E(X) be the entropy and defined as

c
X
E (X ) = −pi log pi (2)
i=1

where c be the number of classes,

pi is the proportion of X belong to
the i th class.
• E (X ) ∈ [0, 1] when c=2
• Entropy = 0 =⇒ all the members
of X belong to either of the classes
Figure 4: Entropy Function • Entropy = 1 =⇒ X contains equal
for Binary Classification number of samples from two
Problem different classes

Tanmay Basu Decision Tree Classifiers 11

How Information Gain Works?

SL Outlook Temperature Humidity Wind Play Tennis?

2
P 5 5 9 9
E (X ) = −pi log2 pi = − log2 − log2 = 0.94
i=1 14 14 14 14

Tanmay Basu Decision Tree Classifiers 12

How Information Gain Works?

X |X : outlook = v |
IG (X , outlook) = E (X ) − E (X : outlook = v )
|X |
v ∈outlook
2 2 3 3
E (X |outlook = sunny ) = − log2 − log2 = 0.970
5 5 5 5
E (X |outlook = overcast) = −1 log2 1 − 0 log2 0 = 0
3 3 2 2
E (X |outlook = rainy ) = − log2 − log2 = 0.970
5 5 5 5
5 5 5
IG (X , outlook) = 0.94− ∗ 0.97 + ∗0+ ∗ 0.970 = 0.94 − 0.692 = 0.248
14 14 14
Tanmay Basu Decision Tree Classifiers 13
How Information Gain Works?

▶ Similarly, IG(X,temperature)=0.029; IG(X,humidity)=0.152;

IG(X,wind)=0.048
▶ Hence the root node is selected as outlook as it has highest IG value
▶ The next step is to divide the subtrees further when outlook =sunny and
when outlook=rainy
▶ For outlook=overcast, all the entries belong to a single class and hence
no further split is necessary

Tanmay Basu Decision Tree Classifiers 14

How Information Gain Works?

X X : wind = v &outlook = sunny
IG (X , wind : outlook = sunny ) = E (X : outlook = sunny ) −
v ∈wind
|X |

E (X : wind = v &outlook = sunny )

E (X : outlook = sunny ) = 0.970

E (X : wind = true & outlook = sunny ) = 1 (since, one yes, one no)
1 1 2 2
E (X : wind = false & outlook = sunny ) = − log2 − log2 = 0.918
3 3 3 3
Tanmay Basu Decision Tree Classifiers 15
Final Decision Tree using Information Gain

2 3
∴ IG (X , wind : outlook = sunny ) = 0.97 − ∗1+
∗ 0.918
5 5
= 0.97 − 0.950 = 0.020

Similarly, IG(X,temperature:outlook=sunny)=0.571 and

IG(X,humidity:outlook=sunny)=0.971

Thus the left subtree of outlook will split based on Humidity as it has highest
information gain. Eventually, the decision tree will look as follows:

Figure 5: Final Decision Tree

Tanmay Basu Decision Tree Classifiers 16

Splitting Measure: Gini Index

• It is used as the splitting measure of the CART decision tree

algorithm.
• Gini index of a set of training samples X for a particular
feature f is
k
X |Xj |
Gini(X , f ) = Gini(Xj ) (3)
|X |
j=1

where k denotes the number of splits for a particular feature f,

Xj denotes the set of training samples traversed at the j th
split and
Xc
2
Gini(Xj ) = 1 − pi,j (4)
i=1

Here c be the number of classes in the data set and pi,j be the
proportion of Xj belonging to the i th class.

Tanmay Basu Decision Tree Classifiers 17

Splitting Measure: Gini Index

▶ The minimum value of Gini(Xj ) = 0 when all samples of Xj

belong to one particular class, which indicates most
interesting information.
▶ Gini index is maximum when the training samples are equally
distributed among all classes implying least important
information.
▶ The maximum gini index is (1 − X0 ), where X0 is the
proportion of Xj belonging to the individual classes.
▶ The feature that produces the smallest gini index following
equation 3 by computing the gini gain as defined below is
chosen to split a node

Gini Gain(X , f ) = Gini(X ) − Gini(X , f ) (5)

Gini index tends to isolate the largest class from the data.

Tanmay Basu Decision Tree Classifiers 18

Splitting Measure: Gain Ratio

• It has been mentioned that information gain measure tends to

prefer features with large number of value.
• Gain ratio is an extension of information gain measure that
reduces its bias on features with large number of branches
• The gain ratio is defined as follows for a set of training
samples X and a particular feature f.

IG (X , f )
GainRatio(X , f ) = (6)
E (X , f )

• Note that gain ratio normalizes the information gain and it is

not defined when E(X,f)=0
• The feature with maximum gain ratio is selected for the
splitting.

Tanmay Basu Decision Tree Classifiers 19

References
▶ Lior Rokach and Oded Z Maimon. Data mining with decision
trees: theory and applications,vol. 69. World scientific, 2007.

▶ Tom Mitchell. Machine Learning. McGraw Hill, ISBN:

0070428077 edition, 1997.

▶ L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone.

Classification and regression trees.Metrika, 33:128–128, 1986.

▶ J Ross Quinlan. C4. 5: programs for machine learning.

Elsevier, 2014.

▶ Kowsari K, Jafari Meimandi K, Heidarysafa M, Mendu S,

Barnes L, Brown D. Text classification algorithms: A survey.
Information;10(4):150, 2019.
https://github.com/kk7nc/Text Classification
Tanmay Basu Decision Tree Classifiers 20

Texting The Art of Messaging - How To Influence, Persuade Seduce Anyone Via Text Message - Secret Sexting Strategies... (Michelle Audet)
100% (2)
Texting The Art of Messaging - How To Influence, Persuade Seduce Anyone Via Text Message - Secret Sexting Strategies... (Michelle Audet)
31 pages
Instant Penis Enlargement Exercises That Make You The King in Bed
No ratings yet
Instant Penis Enlargement Exercises That Make You The King in Bed
3 pages
06-Classification_Part1
No ratings yet
06-Classification_Part1
44 pages
Decision Tree
No ratings yet
Decision Tree
36 pages
Decision Tree
No ratings yet
Decision Tree
30 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
Data Mining Algorithms Classification L4
No ratings yet
Data Mining Algorithms Classification L4
7 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Decitions Tree
No ratings yet
Decitions Tree
6 pages
Decision Tree
No ratings yet
Decision Tree
34 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
Decision Tree Example
No ratings yet
Decision Tree Example
21 pages
2c Decision Tree Algorithm
No ratings yet
2c Decision Tree Algorithm
21 pages
ML Unit II
No ratings yet
ML Unit II
183 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Ml Unit 2 Final_iii Yr
No ratings yet
Ml Unit 2 Final_iii Yr
72 pages
Decision Tree
100% (4)
Decision Tree
66 pages
COS10022 DSP Week05 Decision Tree and Random Forest
No ratings yet
COS10022 DSP Week05 Decision Tree and Random Forest
50 pages
Decision Tree (1)
No ratings yet
Decision Tree (1)
7 pages
Classification
No ratings yet
Classification
30 pages
Asset v1 MKAU+SEng9032+DEV 01+Type@Asset+Block@ML Chapterthree
No ratings yet
Asset v1 MKAU+SEng9032+DEV 01+Type@Asset+Block@ML Chapterthree
129 pages
Unit-3_ML
No ratings yet
Unit-3_ML
47 pages
dm unit 4
No ratings yet
dm unit 4
24 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
6__DecisionTrees__ID3_CART
No ratings yet
6__DecisionTrees__ID3_CART
24 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
MODULE 4-Dr - GM
No ratings yet
MODULE 4-Dr - GM
23 pages
UNIT-3[MLT]
No ratings yet
UNIT-3[MLT]
42 pages
decision tree
No ratings yet
decision tree
66 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Python Decision Tree Classification
No ratings yet
Python Decision Tree Classification
14 pages
Session 5b Classification by Decision Tree Induction (1)
No ratings yet
Session 5b Classification by Decision Tree Induction (1)
42 pages
CS467-M4-Machine Learning-Ktustudents - in
No ratings yet
CS467-M4-Machine Learning-Ktustudents - in
9 pages
ML-Lecture-8-9-Classification
No ratings yet
ML-Lecture-8-9-Classification
35 pages
S&ML Unit 6- Q & A
No ratings yet
S&ML Unit 6- Q & A
12 pages
Gini Vs Entrophy
No ratings yet
Gini Vs Entrophy
8 pages
Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
Decision Tree.pptx
No ratings yet
Decision Tree.pptx
41 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
UNIT 1 CLASSIFICATION & PREDICTION DM
No ratings yet
UNIT 1 CLASSIFICATION & PREDICTION DM
71 pages
Classification DecisionTreesNaiveBayeskNN
No ratings yet
Classification DecisionTreesNaiveBayeskNN
75 pages
DMDW-CO3-SESSION-14
No ratings yet
DMDW-CO3-SESSION-14
55 pages
Trinh Khanh Ly 20213676
No ratings yet
Trinh Khanh Ly 20213676
13 pages
10. Decistion Tree.pptx
No ratings yet
10. Decistion Tree.pptx
27 pages
Day48 Decision Trees
No ratings yet
Day48 Decision Trees
5 pages
Decision Tree For Classification (ID3 Information Gain Entropy)
No ratings yet
Decision Tree For Classification (ID3 Information Gain Entropy)
3 pages
07.2.decision Trees
No ratings yet
07.2.decision Trees
33 pages
Data Mining & Knowledge Discovery
No ratings yet
Data Mining & Knowledge Discovery
34 pages
ML Unit-2
No ratings yet
ML Unit-2
16 pages
Data Mining Unit 2
No ratings yet
Data Mining Unit 2
41 pages
3. Tree Models
No ratings yet
3. Tree Models
42 pages
ML Unit-3 ppt
No ratings yet
ML Unit-3 ppt
92 pages
4. Classification
No ratings yet
4. Classification
75 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Classification Using Decision Trees
No ratings yet
Classification Using Decision Trees
0 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Simple Numbers
From Everand
Simple Numbers
Prasant
No ratings yet
Lec 1
No ratings yet
Lec 1
41 pages
Project DSML Course
No ratings yet
Project DSML Course
3 pages
Lec 2
No ratings yet
Lec 2
40 pages
Lec 2
No ratings yet
Lec 2
11 pages
Sex Positions For Couples - The Greatest and Easiest Love and Sex Guide For Men and Women. Discover The Secrets of Kamasutra, Tantric Sex and Erotic Game (2019) - Libgen - Li
67% (6)
Sex Positions For Couples - The Greatest and Easiest Love and Sex Guide For Men and Women. Discover The Secrets of Kamasutra, Tantric Sex and Erotic Game (2019) - Libgen - Li
201 pages
Exp 2
No ratings yet
Exp 2
2 pages
Exp 1
No ratings yet
Exp 1
2 pages
Sale Associate Resume Template
No ratings yet
Sale Associate Resume Template
1 page
(Playbook Bible For Couples) Wagner, Alison - Sex Positions - Sex Tips & Techniques For 21 Steamy Sexual Positions (2017)
100% (3)
(Playbook Bible For Couples) Wagner, Alison - Sex Positions - Sex Tips & Techniques For 21 Steamy Sexual Positions (2017)
48 pages
Aluminum Based Nanogalvanic
No ratings yet
Aluminum Based Nanogalvanic
24 pages
Hariharan Ravichandran
No ratings yet
Hariharan Ravichandran
1 page
New LL Acknowledgement PDF
No ratings yet
New LL Acknowledgement PDF
1 page
Down 4
No ratings yet
Down 4
83 pages
Machine Learning Lab File
No ratings yet
Machine Learning Lab File
48 pages
Decision Tree ID3 CART
No ratings yet
Decision Tree ID3 CART
28 pages
40 - Malware Detection Using Machine Learning and Performance Evaluation
No ratings yet
40 - Malware Detection Using Machine Learning and Performance Evaluation
77 pages
Decision Tree Ppt
0% (1)
Decision Tree Ppt
24 pages
FINAL UNIT 4
No ratings yet
FINAL UNIT 4
107 pages
MLA LabManual1
No ratings yet
MLA LabManual1
52 pages
Predicting Bitcoin Returns Using High-Dimensional
No ratings yet
Predicting Bitcoin Returns Using High-Dimensional
16 pages
整合機器學習方法於決策樹為基智慧型排程系統之研究
No ratings yet
整合機器學習方法於決策樹為基智慧型排程系統之研究
76 pages
L22 DecisionTrees
No ratings yet
L22 DecisionTrees
14 pages
decision_tree_learning_lecture
No ratings yet
decision_tree_learning_lecture
13 pages
601 sp09 Midterm Solutions
No ratings yet
601 sp09 Midterm Solutions
14 pages
Data Description
No ratings yet
Data Description
1 page
Downloads Papers N5cb9ce4592aef PDF
No ratings yet
Downloads Papers N5cb9ce4592aef PDF
7 pages
ID3 AllanNeymark
No ratings yet
ID3 AllanNeymark
22 pages
VI Sem Machine Learning CS 601 PDF
No ratings yet
VI Sem Machine Learning CS 601 PDF
28 pages
06 - Decision Trees
100% (1)
06 - Decision Trees
83 pages
Mining Big Data: Breast Cancer Prediction Using DT - SVM Hybrid Model
No ratings yet
Mining Big Data: Breast Cancer Prediction Using DT - SVM Hybrid Model
12 pages
Analysis of Student Academic Performance Using Machine Learning Algorithms: - A Study
No ratings yet
Analysis of Student Academic Performance Using Machine Learning Algorithms: - A Study
15 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
AIML Ak
No ratings yet
AIML Ak
21 pages
1.10. Decision Trees — scikit-learn 0.24.1 documentation
No ratings yet
1.10. Decision Trees — scikit-learn 0.24.1 documentation
10 pages
Lab Manual ML Final
No ratings yet
Lab Manual ML Final
47 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
AIML Module-04
No ratings yet
AIML Module-04
46 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
W7-8_ Decision Trees
No ratings yet
W7-8_ Decision Trees
81 pages
Machine Learning
No ratings yet
Machine Learning
133 pages
ID3 Decision Tree Classifier From Scratch in Python - by Bernardo Garcia Del Rio - Towards Data Science
No ratings yet
ID3 Decision Tree Classifier From Scratch in Python - by Bernardo Garcia Del Rio - Towards Data Science
15 pages

Lec 3&4

Uploaded by

Lec 3&4

Uploaded by

Decision Tree Classifiers

Department of Data Science and Engineering

Tanmay Basu Decision Tree Classifiers 1

▶ Decision tree builds classification models in the form of a tree

Tanmay Basu Decision Tree Classifiers 2

Tanmay Basu Decision Tree Classifiers 3

SL Outlook Temperature Humidity Wind Play Tennis?

Figure 1: Transformation of Figure 1

Tanmay Basu Decision Tree Classifiers 5

Figure 2: Decision Subtree for ”Outlook”

Tanmay Basu Decision Tree Classifiers 6

Figure 3: Decision Subtree for ”Outlook” and ”Temperature”

Tanmay Basu Decision Tree Classifiers 7

▶ The goal of a decision tree classification algorithm is to find

▶ The optimal decision tree can be obtained by minimizing the

▶ Induction of an optimal decision tree from a given data is

▶ There are various top down decision tree algorithms such as

Tanmay Basu Decision Tree Classifiers 8

▶ In each iteration these algorithms consider the partition of a

▶ The selection of the most appropriate function is made

▶ After the selection of an appropriate split, each node further

▶ This process continues till no more split is possible or a

Tanmay Basu Decision Tree Classifiers 9

▶ This is used as splitting measure of the ID3, C4.5 decision

▶ The information gain of a particular feature (say f) of a set of

where values(f) refer to the set of all possible values of the

• E(X) be the entropy and defined as

where c be the number of classes,

Tanmay Basu Decision Tree Classifiers 11

SL Outlook Temperature Humidity Wind Play Tennis?

Tanmay Basu Decision Tree Classifiers 12

▶ Similarly, IG(X,temperature)=0.029; IG(X,humidity)=0.152;

Tanmay Basu Decision Tree Classifiers 14

E (X : wind = v &outlook = sunny )

E (X : outlook = sunny ) = 0.970

Similarly, IG(X,temperature:outlook=sunny)=0.571 and

Figure 5: Final Decision Tree

Tanmay Basu Decision Tree Classifiers 16

• It is used as the splitting measure of the CART decision tree

where k denotes the number of splits for a particular feature f,

Tanmay Basu Decision Tree Classifiers 17

▶ The minimum value of Gini(Xj ) = 0 when all samples of Xj

Gini Gain(X , f ) = Gini(X ) − Gini(X , f ) (5)

Tanmay Basu Decision Tree Classifiers 18

• It has been mentioned that information gain measure tends to

• Note that gain ratio normalizes the information gain and it is

Tanmay Basu Decision Tree Classifiers 19

▶ Tom Mitchell. Machine Learning. McGraw Hill, ISBN:

▶ L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone.

▶ J Ross Quinlan. C4. 5: programs for machine learning.

▶ Kowsari K, Jafari Meimandi K, Heidarysafa M, Mendu S,

You might also like