0% found this document useful (0 votes)

10 views60 pages

4-1 Fine-Tuning Your Model

The document discusses supervised learning techniques using Scikit-learn, focusing on model evaluation metrics like accuracy, precision, recall, and the F1 score, particularly in the context of class imbalance. It explains the importance of confusion matrices, ROC curves, and hyperparameter tuning through methods like GridSearchCV and RandomizedSearchCV. Additionally, it highlights the need for proper performance assessment to avoid misleading results, especially in classification tasks such as predicting fraudulent transactions.

Uploaded by

John Clark Mondero

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views60 pages

4-1 Fine-Tuning Your Model

Uploaded by

John Clark Mondero

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

Welcome back,

ML peeps!

Hugo Bowne-
Anderson
Data Scientist at DataCamp
Big ideas from
last session?

Hugo Bowne-
Anderson
Data Scientist at DataCamp
How good is your
model?
SUPERVISED LEARNING WITH SCIKIT-LEARN

George
Boorman
Core Curriculum Manager, DataCamp
Classification metrics
• Measuring model performance with accuracy:
o Fraction of correctly classified samples
o Not always a useful metric
Class imbalance
Accuracy???

99% legit 99%!

Predicts Wow
Legit
But how
All the time!
about
predicting
1% fraud fraud?
Class imbalance
• Classification for predicting fraudulent bank transactions
99% of transactions are legitimate; 1% are fraudulent
• Could build a classifier that predicts NONE of the transactions are fraudulent
99% accurate!
But terrible at actually predicting fraudulent transactions
Fails at its original purpose
• Class imbalance: Uneven frequency of classes
• Need a different way to assess performance
Confusion matrix for assessing classification
performance
Confusion matrix

NOTE:
Fraud – Positive event
Legitimate – Negative event
Assessing classification performance
Assessing classification performance
Assessing classification performance
Assessing classification performance
Assessing classification performance
Assessing classification performance
Assessing classification performance
Assessing classification performance

Accuracy:
Precision

Precision

High precision = lower false positive rate

High precision: Not many legitimate transactions are predicted to be fraudulent

Recall

High recall = lower false negative rate

High recall: Predicted most fraudulent transactions correctly

F1 score
𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 ∗𝑟𝑒𝑐𝑎𝑙𝑙
• F1 Score: 2 ∗
𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛+𝑟𝑒𝑐𝑎𝑙𝑙
Confusion matrix in scikit-learn
from sklearn.metrics import classification_report, confusion_matrix

knn = KNeighborsClassifier(n_neighbors=7)

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.4,
random_state=42)

knn.fit(X_train, y_train)
y_pred = knn.predict(X_test)
Confusion matrix in scikit-learn
print(confusion_matrix(y_test, y_pred))
[[1106 11]
[ 183 34]]

[[1106 11]
[183 34]]
Classification report in scikit-learn
print(classification_report(y_test, y_pred))
[[1106 11]
precision
[ 183 34]]
recall f1-score support

0 0.86 0.99 0.92 1117 1 0.76 0.16 0.26 217

accuracy 0.85 1334 macro avg 0.81 0.57 0.59 1334

weighted avg 0.84 0.85 0.81 1334

Note: 0 – negative class, [[1106 11]

1 – positive class [183 34]]
Questions?
Logistic regression
and the ROC curve
SUPERVISED LEARNING WITH SCIKIT-LEARN

George Boorman
Core Curriculum Manager, DataCamp
Logistic regression for binary classification
• Logistic regression is used for classification problems

• Logistic regression outputs probabilities (O to 1)

• If the probability, p>=0.5:

o The data is labeled 1
• If the probability, p<0.5:
o The data is labeled 0
Linear decision boundary
Logistic regression in scikit-learn
from sklearn.linear_model import LogisticRegression

logreg = LogisticRegression()
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.3,
random_state=42)

logreg.fit(X_train, y_train)

y_pred = logreg.predict(X_test)
Predicting probabilities
y_pred_probs = logreg.predict_proba(X_test)[:, 1]

print(y_pred_probs[0])
[0.08961376]

[0.08961376]
Probability thresholds
• By default, logistic regression threshold = 0.5

• What happens if we vary the threshold?

• Not specific to logistic regression

o KNN classifiers also have thresholds
The ROC curve (Receiver Operating Characteristic)

𝑇𝑃 𝑭𝑷
𝑇𝑃𝑅 = 𝑇𝑃+𝐹𝑁
𝑭𝑷𝑹 =
𝑭𝑷+𝑻𝑵

Sensitivity / Recall Fall-Out

The ROC curve (Receiver Operating Characteristic)
𝑇𝑃 𝐹𝑃
𝑇𝑃𝑅 = 𝐹𝑃𝑅 =
𝑇𝑃+𝐹𝑁 𝐹𝑃+𝑇𝑁

Sensitivity / Recall Fall-Out

In most contexts, you would want a low False Positive Rate (FPR) and a high True
Positive Rate (TPR).

However, the goal of minimizing FPR is often in tension to maximize the True
Positive Rate (TPR).

Increasing TPR usually comes at the cost of increasing the FPR, and this cost is
heavily influenced by the threshold you set for classification.
The ROC curve (Receiver Operating Characteristic)
𝑇𝑃 𝐹𝑃
𝑇𝑃𝑅 = 𝐹𝑃𝑅 =
𝑇𝑃+𝐹𝑁 𝐹𝑃+𝑇𝑁

Sensitivity / Recall Fall-Out

This emphasizes the importance of your role in setting and managing this threshold,
giving you a sense of control and responsibility in the process.

This trade-off is precisely what the Receiver Operating Characteristic (ROC) curve
illustrates: as you move along the curve, increasing sensitivity usually comes at the
expense of increasing the FPR.
The ROC curve This is like predicting that all belongs to the Positive Class!

𝑇𝑃 𝐹𝑃
𝑇𝑃𝑅 = 𝐹𝑃𝑅 =
𝑇𝑃+𝐹𝑁 𝐹𝑃+𝑇𝑁

Sensitivity / Recall Fall-Out

The ROC curve This is like predicting that all belongs to the Positive Class!

𝑇𝑃 𝐹𝑃
𝑇𝑃𝑅 = 𝐹𝑃𝑅 =
𝑇𝑃+𝐹𝑁 𝐹𝑃+𝑇𝑁

Sensitivity / Recall Fall-Out

This is like predicting that all belongs to the Negative Class!

The ROC curve

𝑇𝑃 𝐹𝑃
𝑇𝑃𝑅 = 𝐹𝑃𝑅 =
𝑇𝑃+𝐹𝑁 𝐹𝑃+𝑇𝑁

Sensitivity / Recall Fall-Out

If we vary the threshold, we get a series of different false positive and true positive rates.
The ROC curve

𝑇𝑃 𝐹𝑃
𝑇𝑃𝑅 = 𝐹𝑃𝑅 =
𝑇𝑃+𝐹𝑁 𝐹𝑃+𝑇𝑁

Sensitivity / Recall Fall-Out

The ROC curve
Plotting the ROC curve
from sklearn.metrics import roc_curve

fpr, tpr, thresholds = roc_curve(y_test, y_pred_probs)

plt.plot([0, 1], [0, 1], 'k--’)

plt.plot(fpr, tpr)

plt.xlabel('False Positive Rate')

plt.ylabel('True Positive Rate')
plt.title('Logistic Regression ROC Curve’)

plt.show()
Plotting the ROC curve

𝑇𝑃 𝐹𝑃
𝑇𝑃𝑅 = 𝐹𝑃𝑅 =
𝑇𝑃+𝐹𝑁 𝐹𝑃+𝑇𝑁

Sensitivity / Recall Fall-Out

ROC AUC
ROC AUC

𝑇𝑃 𝐹𝑃
𝑇𝑃𝑅 = 𝐹𝑃𝑅 =
𝑇𝑃+𝐹𝑁 𝐹𝑃+𝑇𝑁

Sensitivity / Recall Fall-Out

ROC AUC in scikit-learn
from sklearn.metrics import roc_auc_score

print(roc_auc_score(y_test,
0.6700964152663693
y_pred_probs))

0.6700964152663693
Precision-Recall PR Curve
Other methods
Note: There are various methods other than the ROC and PR curve to evaluate the
performance of classification models.
0.6700964152663693

▪ AUC-PR (Area Under the Precision-Recall Curve)

▪ Cost-Benefit Analysis
▪ Balanced Accuracy
▪ Log-Loss
▪ Gini Coefficient
Questions?
Hyperparameter
tuning
SUPERVISED LEARNING WITH SCIKIT-LEARN

George Boorman
Core Curriculum Manager
Hyperparameter tuning
• Ridge/lasso regression: Choosing alpha
• KNN: Choosing n_neighbors
• Hyperparameters: Parameters we specify before training the model
o Like alpha and n_neighbors
Choosing the correct hyperparameters
1. Try lots of different hyperparameter values
2. Fit all of them separately
3. See how well they perform
4. Choose the best performing values

• This is called hyperparameter tuning

• It is essential to use cross-validation to avoid overfitting to the test
set
• We can still split the data and perform cross-validation on the
training set
• We withhold the test set for final evaluation
Choosing the correct hyperparameters

Note: we can also perform CV in the Training and Validation set

https://medium.com/@rahulchavan4894/understanding-train-test-and-validation-dataset-split-in-simple-quick-terms-5a8630fe58c8
Grid search cross-validation
Grid search cross-validation
Grid search cross-validation
GridSearchCV in scikit-learn
from sklearn.model_selection import GridSearchCV

kf = KFold(n_splits=5, shuffle=True, random_state=42)

param_grid = {"alpha": np.linspace(0.0001, 1, 10),

"solver": ["sag", "lsqr"]}

ridge = Ridge()

ridge_cv = GridSearchCV(ridge, param_grid, cv=kf)

ridge_cv.fit(X_train,
{'alpha': 0.0001, 'solver':y_train)
'sag'}
0.7529912278705785
print(ridge_cv.best_params_, ridge_cv.best_score_)
GridSearchCV in scikit-learn
How to Know the Parameters You Can Tune for Every Model?
• The get_params() Method: Most scikit-learn estimators have a
get_params() method that returns a dictionary of all the parameters for
the estimator, along with their current values.
• You can use this method to not only see what parameters are available
but also to check their current settings.
•{'alpha':
For example:
0.0001, 'solver': 'sag'}
0.7529912278705785
GridSearchCV in scikit-learn
How to Know the Parameters You Can Tune for Every Model?

{'alpha': 0.0001, 'solver': 'sag'}

0.7529912278705785
Limitations and an alternative approach
• 3-fold cross-validation, 1 hyperparameter, 10 total values = 30 fits
• 10 fold cross-validation, 3 hyperparameters, 30 total values = 900 fits
RandomizedSearchCV
from sklearn.model_selection import RandomizedSearchCV

kf = KFold(n_splits=5, shuffle=True, random_state=42)

param_grid = {'alpha': np.linspace(0.0001, 1, 10),

"solver": ['sag', 'lsqr’]}

ridge = Ridge()

ridge_cv = RandomizedSearchCV(ridge, param_grid, cv=kf, n_iter=2)

ridge_cv.fit(X_train,
{'solver': 'sag', 'alpha': y_train)
0.0001}
0.7529912278705785
print(ridge_cv.best_params_, ridge_cv.best_score_)
Evaluating on the test set
test_score = ridge_cv.score(X_test, y_test)

print(test_score)
0.7564731534089224
Questions?
Let's practice!
SUPERVISED LEARNING WITH SCIKIT-LEARN

AUC and The ROC Curve in Machine Learning - DataCamp
No ratings yet
AUC and The ROC Curve in Machine Learning - DataCamp
12 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
Compare Class I Fiers Part 13
No ratings yet
Compare Class I Fiers Part 13
32 pages
How Good Is Your Model?: Andreas Müller
No ratings yet
How Good Is Your Model?: Andreas Müller
54 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Machine Learning Evaluation Metrics
No ratings yet
Machine Learning Evaluation Metrics
16 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
How To Use ROC Curves and Precision-Recall Curves For Classification in Python
No ratings yet
How To Use ROC Curves and Precision-Recall Curves For Classification in Python
47 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
Supervised Learning Using Python - Chapter3
No ratings yet
Supervised Learning Using Python - Chapter3
47 pages
Guide To AUC ROC Curve in Machine Learning
No ratings yet
Guide To AUC ROC Curve in Machine Learning
10 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Auc Roc Curve Machine Learning
No ratings yet
Auc Roc Curve Machine Learning
12 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
Session-11 Machine Learning
No ratings yet
Session-11 Machine Learning
27 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
Scikit-Learn for Data Scientists
No ratings yet
Scikit-Learn for Data Scientists
31 pages
Data Mining: Class Imbalance Solutions
No ratings yet
Data Mining: Class Imbalance Solutions
56 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
Machine Learning II
No ratings yet
Machine Learning II
61 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
Lec5 Classification
No ratings yet
Lec5 Classification
27 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
Data M11
No ratings yet
Data M11
5 pages
Model Evaluation
No ratings yet
Model Evaluation
31 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Data M
No ratings yet
Data M
10 pages
l09 Machine Learning
No ratings yet
l09 Machine Learning
39 pages
Session-11 Machine Learning - Jupyter Notebook
No ratings yet
Session-11 Machine Learning - Jupyter Notebook
11 pages
Analytics in Practice: Model Evaluation
No ratings yet
Analytics in Practice: Model Evaluation
40 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Performance Measures
No ratings yet
Performance Measures
32 pages
SMOTE: Improving Classifier Performance
No ratings yet
SMOTE: Improving Classifier Performance
37 pages
CS340 Machine Learning ROC Curves
No ratings yet
CS340 Machine Learning ROC Curves
8 pages
Roc 1 PDF
No ratings yet
Roc 1 PDF
8 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
Lecture 2.3
No ratings yet
Lecture 2.3
9 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
ROC & AUC for Data Scientists
No ratings yet
ROC & AUC for Data Scientists
1 page
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Roc Curve in Python
No ratings yet
Roc Curve in Python
58 pages
Lect 02 Evaluation Part 1
No ratings yet
Lect 02 Evaluation Part 1
33 pages
Machine Learning PDF
No ratings yet
Machine Learning PDF
8 pages
AUC ROC Curve for ML Enthusiasts
No ratings yet
AUC ROC Curve for ML Enthusiasts
5 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
ROC Auc
No ratings yet
ROC Auc
5 pages
Food Science Exam Guide
No ratings yet
Food Science Exam Guide
19 pages
Spare Capacity - 1
No ratings yet
Spare Capacity - 1
1 page
Chanakya National Law UNIVERSITY, Patna: Legal History Project Topic: Anglo-French Struggle
No ratings yet
Chanakya National Law UNIVERSITY, Patna: Legal History Project Topic: Anglo-French Struggle
19 pages
Data Cleaning Methods in Excel
No ratings yet
Data Cleaning Methods in Excel
11 pages
X500 - 250 Cap 01 (Info Generali) PDF
No ratings yet
X500 - 250 Cap 01 (Info Generali) PDF
44 pages
Consti 1 - Syllabus
No ratings yet
Consti 1 - Syllabus
26 pages
Mobile Phone Repairing
75% (4)
Mobile Phone Repairing
22 pages
Grade 6 TLE Module: First Quarter
No ratings yet
Grade 6 TLE Module: First Quarter
69 pages
Arduino Course Final Exam
60% (10)
Arduino Course Final Exam
6 pages
4 - ITU Standards and Network Deployment Guidelines
100% (5)
4 - ITU Standards and Network Deployment Guidelines
87 pages
Running Head: Homeland Security 1
No ratings yet
Running Head: Homeland Security 1
6 pages
GTM Strategy - Solar Module Manufacturing & Data Centre
No ratings yet
GTM Strategy - Solar Module Manufacturing & Data Centre
11 pages
Aspirin Uses for Healthier Plants
No ratings yet
Aspirin Uses for Healthier Plants
8 pages
Barangay Baybay Record System
No ratings yet
Barangay Baybay Record System
13 pages
Faqs Fellow Programme in Management (FPM)
No ratings yet
Faqs Fellow Programme in Management (FPM)
7 pages
CML Vs SML
No ratings yet
CML Vs SML
9 pages
TECUMSEH Model Number Codes
0% (1)
TECUMSEH Model Number Codes
6 pages
Wbcs Prelims Practice Set 4 2024
No ratings yet
Wbcs Prelims Practice Set 4 2024
4 pages
Iimv ML Ai
No ratings yet
Iimv ML Ai
18 pages
MKT B363F - 2024 Summer - TS1a - Course - Tutor Intro
No ratings yet
MKT B363F - 2024 Summer - TS1a - Course - Tutor Intro
9 pages
Paper 2 - May 2018 Mark Scheme
No ratings yet
Paper 2 - May 2018 Mark Scheme
14 pages
Advantx Video Intensifier Parts Guide
No ratings yet
Advantx Video Intensifier Parts Guide
94 pages
Clinical Gynecologic Oncology
No ratings yet
Clinical Gynecologic Oncology
809 pages
1 s2.0 S0003682X21003832 Main
No ratings yet
1 s2.0 S0003682X21003832 Main
17 pages
DAO Regulation: Future Perspectives
No ratings yet
DAO Regulation: Future Perspectives
298 pages
Certificate of Analysis BIRKOSIT 021208
No ratings yet
Certificate of Analysis BIRKOSIT 021208
1 page
Digital Investigation: Philipp Amann, Joshua I. James
No ratings yet
Digital Investigation: Philipp Amann, Joshua I. James
10 pages
WHO PPT On Aseptic Processing
No ratings yet
WHO PPT On Aseptic Processing
47 pages
Annual Gender and Development (Gad) Plan and Budget Cy 2019
No ratings yet
Annual Gender and Development (Gad) Plan and Budget Cy 2019
2 pages
Ielts
0% (1)
Ielts
35 pages

4-1 Fine-Tuning Your Model

Uploaded by

4-1 Fine-Tuning Your Model

Uploaded by

Welcome back,

99% legit 99%!

High precision = lower false positive rate

High precision: Not many legitimate transactions are predicted to be fraudulent

High recall = lower false negative rate

High recall: Predicted most fraudulent transactions correctly

X_train, X_test, y_train, y_test = train_test_split(X, y,

0 0.86 0.99 0.92 1117 1 0.76 0.16 0.26 217

accuracy 0.85 1334 macro avg 0.81 0.57 0.59 1334

Note: 0 – negative class, [[1106 11]

• Logistic regression outputs probabilities (O to 1)

• If the probability, p>=0.5:

• What happens if we vary the threshold?

• Not specific to logistic regression

Sensitivity / Recall Fall-Out

Sensitivity / Recall Fall-Out

Sensitivity / Recall Fall-Out

Sensitivity / Recall Fall-Out

Sensitivity / Recall Fall-Out

This is like predicting that all belongs to the Negative Class!

Sensitivity / Recall Fall-Out

Sensitivity / Recall Fall-Out

fpr, tpr, thresholds = roc_curve(y_test, y_pred_probs)

plt.plot([0, 1], [0, 1], 'k--’)

plt.xlabel('False Positive Rate')

Sensitivity / Recall Fall-Out

Sensitivity / Recall Fall-Out

▪ AUC-PR (Area Under the Precision-Recall Curve)

• This is called hyperparameter tuning

Note: we can also perform CV in the Training and Validation set

kf = KFold(n_splits=5, shuffle=True, random_state=42)

param_grid = {"alpha": np.linspace(0.0001, 1, 10),

ridge_cv = GridSearchCV(ridge, param_grid, cv=kf)

{'alpha': 0.0001, 'solver': 'sag'}

kf = KFold(n_splits=5, shuffle=True, random_state=42)

param_grid = {'alpha': np.linspace(0.0001, 1, 10),

ridge_cv = RandomizedSearchCV(ridge, param_grid, cv=kf, n_iter=2)

You might also like