0% found this document useful (0 votes)

126 views5 pages

This Study Resource Was: Answer

This study examined a loan application dataset using naive Bayes classification and k-nearest neighbors (k-NN) algorithms. Using k=1, a customer with specified characteristics was classified as belonging to the "loan not accepted" group. The best k value of 9 balanced overfitting and ignoring predictor information. This k value produced a validation confusion matrix showing classification errors. When the data was split into training, validation, and test sets, the test set classification matrix differed from the training and validation matrices, likely due to overfitting on the training data.

Uploaded by

Saurabh Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

126 views5 pages

This Study Resource Was: Answer

Uploaded by

Saurabh Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Question 7.

a. Using the naive rule on the training set, classify a customer with the following char-

acteristics: Age=40, Experience=10, Income=84, Family=2, CCAvg=2, Education_2=1,

Education_3=0, Mortgage=0, SecuritiesAccount=0, CD Account=0, Online=1 and Credit

card = 1.

Compute the confusion matrix for the validation set based on the naive rule.

Perform a k-nearest neighbor classification with all predictors except zipcode using k = 1

Remember to transform categorical predictors with more than 2 categories into dummy

m
er as
variables first. Specify the “success” class as 1 (loan acceptance), and use the default

co
eH w
cutoff value of 0.5. How would the above customer be classiffied?

o.
Answer:
rs e
ou urc
“Education” variable is converted to dummy variable.

Using the success class as 1 and default cutoff value of 0.5.

o
aC s

The customer characteristics are:

vi y re

Age=40, Experience=10, Income=84, Family=2, CCAvg=2, Education_2=1, Education_3=0,

Mortgage=0, Securities Account=0, CD Account=0, Online=1 and Credit card =1.

ed d
ar stu

Prob.
Actual
Predicte for 1 Experienc
is

#Nearest Age Income Family CCAvg

d Class (success e
Neighbors
Th

)
0 0 1 40 10 84 2 2
sh

Securities CD Credit
Education_2 Education_3 Mortgage Online
Account Account Card
1 0 0 0 0 1 1

This study source was downloaded by 100000761058697 from CourseHero.com on 09-16-2021 23:48:58 GMT -05:00

https://www.coursehero.com/file/12444953/Chapter-7-Problems-VSINGI4452/
From the output we conclude that the above customer is classified as belonging to the loan not

accepted group.

b. What is a choice of k that balances between over fitting and ignoring the predictor

information?

Answer:

Validation error log for different k:

% %
Error Error
Value Traini Valida
of k ng tion

m
er as
<---

co
1 0 10 Best k

eH w
2 5.83 13.75
3 6.67 11.25

o.
4 7.5
rs e 18.75
ou urc
5 6.67 12.5
6 7.5 16.25
7 10 12.5
8 9.17 12.5
o

9 8.33 11.25
aC s
vi y re

The value of k that balances between overfitting and ignoring the predictor information is 9.
ed d

c. Show the classification matrix for the validation data that result from using the best k.
ar stu

Answer:
is

Validation Data scoring - Summary Report (for k=1)

Cut off Prob.Val. for

0.5
Success (Updatable)
sh

Classification Confusion
Matrix
Predicted Class

This study source was downloaded by 100000761058697 from CourseHero.com on 09-16-2021 23:48:58 GMT -05:00

https://www.coursehero.com/file/12444953/Chapter-7-Problems-VSINGI4452/
Actual
Class 1 0
1 3 4
0 4 69
Error Report
# # %
Class Cases Errors Error
1 7 4 57.14
0 73 4 5.48
Overa
ll 80 8 10

d. Classify the customer using the best k.

m
Answer:

er as
co
eH w
Prob.
Actual

o.
Predicte for 1 Experienc
d Class rs e
(success
#Nearest Age
e
Income Family CCAvg
ou urc
Neighbors
)
0 0 1 40 10 84 2 2
o
aC s
vi y re

CD
Education_ Education_ Education_ Mortgag Securitie Onlin CreditCar
Accoun
1 2 3 e s Account e d
t
ed d

0 1 0 0 0 0 1 1
ar stu

From the output we conclude that the above customer is classified as belonging to the loan not
is

accepted group
Th

e. Repartition the data, this time into training, validation, and test sets (50%: 30%: 20%). Apply
sh

the k-NN method with the k chosen above. Compare the classification matrix of the test set with

that of the training and validation sets. Comment on the differences and their reason.

Answer:

This study source was downloaded by 100000761058697 from CourseHero.com on 09-16-2021 23:48:58 GMT -05:00

https://www.coursehero.com/file/12444953/Chapter-7-Problems-VSINGI4452/
Training Data scoring - Summary Report (for k=1)
Cut off Prob.Val. for Success
0.5
(Updatable)

Classification Confusion
Matrix
Predicted Class
Actual
1 0
Class
1 11 0
0 0 89

m
Error Report

er as
# %
Class # Cases

co
Errors Error

eH w
1 11 0 0.00

o.
0 89 0 0.00
Overall
rs e 100 0 0.00
ou urc
Validation Data scoring - Summary Report (for k=1)
Cut off Prob.Val. for Success
0.5
(Updatable)
o
aC s
vi y re

Classification Confusion
Matrix
Predicted Class
Actual
ed d

1 0
Class
ar stu

1 1 3

0 2 54
is

Error Report
Th

# %
Class # Cases
Errors Error
1 4 3 75.00
sh

0 56 2 3.57
Overall 60 5 8.33
Test Data scoring - Summary Report (for k=1)
Cut off Prob.Val. for Success
0.5
(Updatable)
Classification Confusion

This study source was downloaded by 100000761058697 from CourseHero.com on 09-16-2021 23:48:58 GMT -05:00

https://www.coursehero.com/file/12444953/Chapter-7-Problems-VSINGI4452/
Matrix
Predicted Class
Actual
1 0
Class
1 2 2
0 2 34
Error Report
# %
Class # Cases
Errors Error
1 4 2 50.00
0 36 2 5.56
Overall 40 4 10.00
We have to choose the best K which minimizes the misclassification rate in the validation set.

Our best k is 1.The percentage of classification error in the validation set is 8.33% and the test

m
er as
set is 10% is nearly same.

co
eH w
o.
rs e
ou urc
o
aC s
vi y re
ed d
ar stu
is
Th
sh

This study source was downloaded by 100000761058697 from CourseHero.com on 09-16-2021 23:48:58 GMT -05:00

https://www.coursehero.com/file/12444953/Chapter-7-Problems-VSINGI4452/
Powered by TCPDF (www.tcpdf.org)

ML 5
No ratings yet
ML 5
76 pages
UGBA 104 Prob Set C
No ratings yet
UGBA 104 Prob Set C
29 pages
Difference Between Instance-And Model-Based Learning
No ratings yet
Difference Between Instance-And Model-Based Learning
35 pages
Chapter 4 - Part 2
No ratings yet
Chapter 4 - Part 2
4 pages
Cart Project
75% (4)
Cart Project
17 pages
ML Practical Kiranjot 6-10
No ratings yet
ML Practical Kiranjot 6-10
10 pages
Artificial Intelligence Lab 7
No ratings yet
Artificial Intelligence Lab 7
10 pages
Universal Bank Case Solution
No ratings yet
Universal Bank Case Solution
9 pages
Amazon Sales Analysis Presentation
No ratings yet
Amazon Sales Analysis Presentation
24 pages
Chapter 9 PDF
No ratings yet
Chapter 9 PDF
25 pages
List - Midterm - 1 ML
No ratings yet
List - Midterm - 1 ML
6 pages
Introduction To Machine Learningnptelweek1!13!240422175732-B56d025a
No ratings yet
Introduction To Machine Learningnptelweek1!13!240422175732-B56d025a
303 pages
Week 5
No ratings yet
Week 5
13 pages
CAT - 2 Class
No ratings yet
CAT - 2 Class
62 pages
Personal Loan Campaign Final
No ratings yet
Personal Loan Campaign Final
12 pages
Solution 2.2
No ratings yet
Solution 2.2
4 pages
cs4302 Lecture2
No ratings yet
cs4302 Lecture2
40 pages
Classification and K Nearest Neighbour Algorithm
No ratings yet
Classification and K Nearest Neighbour Algorithm
53 pages
Tutorial KNN
No ratings yet
Tutorial KNN
2 pages
Lecture 02 - KNN and ML Basics
No ratings yet
Lecture 02 - KNN and ML Basics
33 pages
Ranvijay 12203409
No ratings yet
Ranvijay 12203409
13 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
25 pages
Week1 Assignment
No ratings yet
Week1 Assignment
6 pages
Nearest Neighbour
No ratings yet
Nearest Neighbour
25 pages
Part I
No ratings yet
Part I
12 pages
Naive Bayes Model With Python 1684166563
No ratings yet
Naive Bayes Model With Python 1684166563
9 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
Final Project
No ratings yet
Final Project
9 pages
Quiz 4 - Attempt Review
No ratings yet
Quiz 4 - Attempt Review
3 pages
Exercises695Clas Solution
100% (2)
Exercises695Clas Solution
13 pages
PA v0.7
No ratings yet
PA v0.7
15 pages
Lapse Team
No ratings yet
Lapse Team
28 pages
Classification
No ratings yet
Classification
58 pages
Amta Assignment
No ratings yet
Amta Assignment
20 pages
ISYE 6501 Georgia Tech Hmwk3.1a
No ratings yet
ISYE 6501 Georgia Tech Hmwk3.1a
4 pages
Cmam2022 285 290
No ratings yet
Cmam2022 285 290
6 pages
Machine Learning
100% (2)
Machine Learning
30 pages
Soft Skills
No ratings yet
Soft Skills
15 pages
Solution 1
No ratings yet
Solution 1
6 pages
Data Science and ML - End Term
No ratings yet
Data Science and ML - End Term
4 pages
National Institute of Technology Rourkela: Department of Computer Science and Engineering
No ratings yet
National Institute of Technology Rourkela: Department of Computer Science and Engineering
2 pages
6720 Labs Chapter 7
No ratings yet
6720 Labs Chapter 7
2 pages
Questions For Chapter 2
No ratings yet
Questions For Chapter 2
6 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
When Do We Use KNN Algorithm?
No ratings yet
When Do We Use KNN Algorithm?
7 pages
MLFA Spring 2024
No ratings yet
MLFA Spring 2024
11 pages
Here's An Visualization of The K-Nearest Neighbors Algorithm
No ratings yet
Here's An Visualization of The K-Nearest Neighbors Algorithm
5 pages
Digital Scholarly Editing - Elena Pierazzo
No ratings yet
Digital Scholarly Editing - Elena Pierazzo
247 pages
KNN 20exercise 202 101223044152 Phpapp01
No ratings yet
KNN 20exercise 202 101223044152 Phpapp01
2 pages
Digital Marketing Final Assignment
50% (2)
Digital Marketing Final Assignment
7 pages
Ebooks File Planning and Urban Design Standards All Chapters
No ratings yet
Ebooks File Planning and Urban Design Standards All Chapters
34 pages
Machine Learning Unit 4 MCQ
No ratings yet
Machine Learning Unit 4 MCQ
28 pages
Machine Learning Cheatsheet
No ratings yet
Machine Learning Cheatsheet
12 pages
BDMDM Telemarketing
No ratings yet
BDMDM Telemarketing
16 pages
Data Mining 4th Is
No ratings yet
Data Mining 4th Is
24 pages
Marketplace Simulation - Quarter 2
0% (1)
Marketplace Simulation - Quarter 2
15 pages
Credit Risk Analysis
No ratings yet
Credit Risk Analysis
6 pages
Sontag Looking at War PDF
No ratings yet
Sontag Looking at War PDF
18 pages
PPG Unit 1.2 PPT Political Science
No ratings yet
PPG Unit 1.2 PPT Political Science
29 pages
SLC 70 Marks Set 1
No ratings yet
SLC 70 Marks Set 1
3 pages
Marburg Virus Disease
No ratings yet
Marburg Virus Disease
30 pages
Virtues and Outstanding Traits of Sayyiduna Umar Final
No ratings yet
Virtues and Outstanding Traits of Sayyiduna Umar Final
33 pages
Beige Scrapbook Geography Presentation
No ratings yet
Beige Scrapbook Geography Presentation
60 pages
Villancicos Edition Complete Wlscm32
No ratings yet
Villancicos Edition Complete Wlscm32
239 pages
Data Analysis With R Boston Housing Dataset Academic FP RP 007 PDF
No ratings yet
Data Analysis With R Boston Housing Dataset Academic FP RP 007 PDF
15 pages
Tory Ime: The Story of Ramayan
100% (1)
Tory Ime: The Story of Ramayan
3 pages
Digital Marketing - Assignment @final: October 2021
No ratings yet
Digital Marketing - Assignment @final: October 2021
7 pages
Bollywood Social Media
No ratings yet
Bollywood Social Media
24 pages
Exercises 695 Clas
No ratings yet
Exercises 695 Clas
3 pages
This Study Resource Was: 8.1 Financial Condition of Banks: The File Banks - Xls Includes Data On A Sample of 20 Banks
No ratings yet
This Study Resource Was: 8.1 Financial Condition of Banks: The File Banks - Xls Includes Data On A Sample of 20 Banks
3 pages
This Study Resource Was: 8.1 Financial Condition of Banks: The File Banks - Xls Includes Data On A Sample of 20 Banks
No ratings yet
This Study Resource Was: 8.1 Financial Condition of Banks: The File Banks - Xls Includes Data On A Sample of 20 Banks
3 pages
Omissions Answer Notes
No ratings yet
Omissions Answer Notes
2 pages
History of Computers
No ratings yet
History of Computers
3 pages
d2c Igniters Club Manual
No ratings yet
d2c Igniters Club Manual
10 pages
Docx
No ratings yet
Docx
5 pages
Classroom 1 Class Notes For Article
No ratings yet
Classroom 1 Class Notes For Article
2 pages
BE & Sus Test Sec B
No ratings yet
BE & Sus Test Sec B
4 pages
3.03 Who Has The Power?: Name
No ratings yet
3.03 Who Has The Power?: Name
2 pages
Questions and Solutions
No ratings yet
Questions and Solutions
47 pages
Truth-or-Dare Fun Game
No ratings yet
Truth-or-Dare Fun Game
1 page
Durlak Et Al. - 2022 - What We Know, and What We Need To Find Out About U
No ratings yet
Durlak Et Al. - 2022 - What We Know, and What We Need To Find Out About U
18 pages
Entrepreneurship Sum A 3RD Quarter 1
No ratings yet
Entrepreneurship Sum A 3RD Quarter 1
2 pages
Infect Me Not Lesson Plan
No ratings yet
Infect Me Not Lesson Plan
19 pages
Powers of President of India
No ratings yet
Powers of President of India
5 pages
Resume - Lita May o Lubuguin
No ratings yet
Resume - Lita May o Lubuguin
2 pages
Case No 114 Philippine Tobacco Flu Curing and Redrying Corp Vs NLRC Dec 10, 1998
No ratings yet
Case No 114 Philippine Tobacco Flu Curing and Redrying Corp Vs NLRC Dec 10, 1998
4 pages
Rehabilitation and Retrofitting of Structurs Question Papers
No ratings yet
Rehabilitation and Retrofitting of Structurs Question Papers
4 pages
Prescription Writing
No ratings yet
Prescription Writing
20 pages
Why Do I Oppose The Unification Church?
No ratings yet
Why Do I Oppose The Unification Church?
6 pages
DLL 3rD QUARTER - React To What Is Asserted or Expressed in A Text
No ratings yet
DLL 3rD QUARTER - React To What Is Asserted or Expressed in A Text
3 pages
List of Connectors: Listing Reformulating To Put It Another Way
No ratings yet
List of Connectors: Listing Reformulating To Put It Another Way
1 page
Op-Art Overview
No ratings yet
Op-Art Overview
2 pages
Manufacturing Engineering - II
No ratings yet
Manufacturing Engineering - II
3 pages
Customer Journey Map Playbook
100% (11)
Customer Journey Map Playbook
36 pages