0% found this document useful (0 votes)
17 views9 pages

Random Forest Algorithm Overview

Random Forest is a supervised machine learning algorithm that combines multiple decision trees to improve predictive accuracy for classification and regression tasks. It enhances model accuracy and prevents overfitting by averaging predictions from various trees, with a greater number of trees leading to better results. The algorithm is widely used in sectors like banking for loan risk assessment and in medicine for identifying disease trends.

Uploaded by

networkessencial
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views9 pages

Random Forest Algorithm Overview

Random Forest is a supervised machine learning algorithm that combines multiple decision trees to improve predictive accuracy for classification and regression tasks. It enhances model accuracy and prevents overfitting by averaging predictions from various trees, with a greater number of trees leading to better results. The algorithm is widely used in sectors like banking for loan risk assessment and in medicine for identifying disease trends.

Uploaded by

networkessencial
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

RANDOM FOREST

" Machine learning algorithm


" Belongs to the supervised learning technique.
" It caused for both Classification and Regression problems in ML.
Training Traning Traimin
Data
1 Data

Training
Set Decision Decision Decision
Tree Tree Tree
1 2

Voting
Test Set (averaging)

ACADEMY Prediction
"Random Forest is a classifier that contains a number of decision trees
on various subsets of the given dataset and takes the average to improve
the predictive accuracy of that dataset".

" Instead of relying on one decision tree, the random forest takes the
prediction from each tree and based on the majority votes of
predictions, and it predicts the final output.

"Greater number of trees in forest leads to higher accuracy and


prevents the problem of over fitting.
Assumptions for Random Forest,

(Ö) Since the random forest combines multiple trees to predict the
class of the dataset, it is possible that some decision trees may
predict the correct output, while others may not.

(iü) But together, all the trees predict the correct output.
ILVER
CADEMY Twoassumptions for a better Random Forest classifier:
There should be some actual values in the feature variable
of the dataset so that the classifier can predict accurate
results rather than a guessed result.
" Predictions from each tree must have very low correlations.

Advantages of Random Forest

[Link] is capable of handling large datasets with high dimensionality.


2. Itenhances the accuracy of the model and prevents the overfitting
issue.

Disadvantages of Random Forest

TSILVER
ACADEMY
Advantages of Random Forest

[Link] is capable of handling large datasets with high dimensionality.


overfitting
2. It enhances the accuracy of the model and prevents the
issue.

Disadvantages of Random Forest

Although random forest can be used for both classification and


regression tasks, it is not more suitable for Regression
tasks.
SILVER
ACADEMY
Working of Algorithm
Random Forest works in two-phase,
First is to create the random forest by combining N decisiontree.
Secondis to make predictions for each tree created in the first phase.

Working Process
Step 1:Select random Kdata points from the training set.
Step 2: Build the decision trees associated with the selected data points
Step 3: Choose the number N for decision trees that want to build.
Step 4: Repeat Step 1 &2.
SILVER
ACADEMY
Step 5: For newdata points,find the predictions of each decision tree,
and assignthe newdata points to the category that wins the majority
votes.

Example
Suppose there is a dataset that contains multiple fruit images is given
to Random Forest classifier.

Dataset is divided into subsets and given to each decision tree.

During training phase,


Each decision tree produces a prediction result and when a new data
point occurs.

SILVER
ACADEMY
Then based on majority of results
Random Forest classifier predicts final decision.

Iree-1
Itee-2

Class-A Class A Class-a

Majority -Vong

Final Class

Applications of Random Forest


Banking: Banking sector mostly uses this algorithm for the identification
of loan risk.
SILVER
Medicine: With the help of this algorithm, disease trends and risksof the
ACADEMY

disease can be identified.


Appllcations of Random Forest
Banking: Banking sector mostly uses this algorithm for the identification
of loan risk.

Medicine: With the help of this algorithm,disease trends and risks of the
disease can be identifled.

You might also like