MLS+1+-+Decision+Trees+and+Random+Forests
MLS+1+-+Decision+Trees+and+Random+Forests
[email protected]
R8L0PN473F
R8L0PN473Fe. Overfitting
[email protected]
2. Random Forest
a. Bias-Variance Tradeoff
b. Pruning
c. Bagging
d. Random Forest
[email protected]
R8L0PN473F
When to use Classification Tree Classification Tree Classification Tree Regression Tree
[email protected]
R8L0PN473F
0 to 0.5 0 to 1 0 to 1
Range 0 = most pure 0 = most pure 0 = less gain -
0.5 = most impure 1 = most impure 1 = more gain
[email protected]
R8L0PN473F
● It is one of the most accurate algorithms because of the number of decision trees taking part in the process.
Disadvantages:
● The Random Forest algorithm is very slow compared to others because it calculates predictions for each
decision tree for every sample and then votes on them to select the best one, which is time-consuming.
● It is difficult to interpret the model in comparison to decision tree where you can easily make the decision
following the path of the tree.
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 10
Case Study
[email protected]
R8L0PN473F
where |T| is the number of terminal nodes and R(T) is the total misclassification rate of the terminal nodes.