Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
Evaluation of Algorithms
Bagging and Boosting
What is Machine Learning?
“A computer program is said to learn from experience E with respect to
some class of tasks T and performance measure P, if its performance at
tasks in T, as measured by P, improves with experience E.”
Tasks
Analysing medical reports for identifying patients who might be
suffering from cancer
Predicting possibility of fraud by looking at profiles of loan applicants
Identifying objects in a picture
Examples of tasks are:
Classification, Regression, Machine translation, anomaly detection…
…
2
Experience
Consists of datasets from the past which may or may not be labelled
3
Performance Measure
Predicted Values
Negative Positive
Model Prediction
Evaluation – Test Data
Predicted Actual
0.673536 0
Test Data
0.407434 0
0.582201 1
0.601923 0
… …
0.99839 0
0.950054 1
Evaluation – Test Data
Threshold
Predicted Predicted Actual
0.673536 1 0
0.407434 if (predicted_value ≤ 0.5) 0 0
0.582201 predicted = 0 1 1
0.601923 else 1 0
… predicted = 1 … …
0.99839 1 0
0.950054 1 1
The Four Values
True Positive: Model predicts Positive and it is True
Predicted Values
Negative Positive
If Recall is more important than precision, then we use the F 2 score – Recall has twice
the weightage of Precision