0% found this document useful (0 votes)

24 views59 pages

07 - KNN & Naive Bayes

The document explains two machine learning models: K-Nearest Neighbors (K-NN) and Naive Bayes. K-NN classifies new data points based on the majority label of their nearest neighbors, while Naive Bayes uses Bayes' Theorem to predict class probabilities based on feature independence. Both models are simple yet effective for classification tasks, with K-NN focusing on distance metrics and Naive Bayes leveraging prior probabilities and likelihoods.

Uploaded by

david1milad1982

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views59 pages

07 - KNN & Naive Bayes

Uploaded by

david1milad1982

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

K - Nearest Neighbor Model

&&
Naive Bayes Model
Nearest Neighbor
➢ One of the simplest of all machine
learning classifiers
➢ label a new point the same as the
closest known point

Label it
red.
Nearest Neighbor
How Does the K-Nearest Neighbors Algorithm Work?

The K-NN algorithm compares a new data entry to the values in a given

data set (with different classes or categories).

Based on its closeness or similarities in a given range (K) of neighbors, the

algorithm assigns the new data to a class or category in the data set

(training data).
Nearest Neighbor
Step #1 - Assign a value to K.

Step #2 - Calculate the distance between the new data entry and all other

existing data entries (you'll learn how to do this shortly). Arrange them in

ascending order.

Step #3 - Find the K nearest neighbors to the new entry based on the

calculated distances.

Step #4 - Assign the new data entry to the majority class in the nearest

neighbors.
Nearest Neighbor

K-nearest neighbor: predict

based on K closest training
samples
x
x
x o
x x x
+ o
o x
x
o o+
o
o
x2
x1
Nearest Neighbor

1-nearest
neighbor
x
x
x o
x x x
+ o
o x
o x
o+
o
o
x2
x1
Nearest Neighbor

3-nearest
neighbor
x
x
x o
x x x
+ o
o x
o x
o+
o
o
x2
x1
Nearest Neighbor

5-nearest
neighbor
x
x
x o
x x x
+ o
o x
x
o o+
o
o
x2
x1
k – Nearest Neighbor
◼ Generalizes 1-NN to smooth away
noise in the labels
◼ A new point is now assigned the
most frequent label of its k
nearest neighbors

Label it red, when k = 3

Label it blue, when

k=7
Distance Metrics
◼ Different metrics can change the decision
surface

Dist(a,b) =(a1 – b1)2 + (a2 – b2)2 Dist(a,b) =(a1 – b1)2 + (3a2 – 3b2)2

◼ Standard Euclidean distance metric:

◼ Two-‐dimensional: Dist(a,b) = sqrt((a 1 – b1) 2 + (a2 – b2)2)
◼ Multivariate: Dist(a,b) = sqrt(∑ (ai – bi)2)
Three Aspects of an
Instance-‐Based Learner:

❑ A distance metric
❑ How many nearby neighbors to
look at?
❑ How to fit with the local points?
1-NN’s Three Aspects of an
Instance-Based Learner:
❑ A distance metric
❑ Euclidian
❑ How many nearby neighbors t o look at?
❑ One
❑ How t o f i t w i t h the local points?
❑ Just predict the same output as the
nearest neighbor.
Example on classification
◼ First we calculated distance for each one.

◼ Then we assumed k =4

◼ Then to predict E find least distance form 4 points, so it will be A.

◼ So prediction of E will be Bad

Another example on classification
Euclidean distance formula.
As you can see above, the majority class within the 5 nearest neighbors to
the new entry is Red. Therefore, we'll classify the new entry as Red.
Example on Regression
◼ First we calculated distance for each one.

◼ Then we assumed k =3

◼ Then to predict 48 find average value for HPI for least 3 values.
Example on Regression
◼ First we calculated distance for each one.

◼ Then we assumed k =3

◼ Then to predict 48 find average value for HPI for least 3 values.
Example
Example
Example
Bayesian Algorithm
Naive Bayes Model
What is Bayesian Algorithm ?
➢ Bayesian Algorithm is a classification technique support by Bayes’
Theorem with associate degree assumption of independence
among predictors.

In easy terms, a Naïve Bayes categorize assumes that the presence

of a specific feature in a class is unrelated to the presence of the
other feature.
What is Naive Bayes Classifier?
➢ Naive Bayes is a statistical classification technique based on
Bayes Theorem.
➢ It is one of the simplest supervised learning algorithms.
➢ Naive Bayes classifier is the fast, accurate and reliable algorithm.
➢ Naïve Bayes is used in classification and regression.
➢ Naive Bayes classifiers have high accuracy and speed on large
datasets.
➢ Naive Bayes classifier assumes that the effect of a particular
feature in a class is independent of other features.
Naive Bayes Classifier (Bayes
theorem)
(Bayes theorem)
How Naive Bayes Classifier Works? Multinomial Naive Bayes
classifier

Naive Bayes classifier calculates the probability of an event in the

following steps:
• Step 1: Calculate the prior probability for given class labels
• Step 2: Find Likelihood probability with each attribute for each class
• Step 3: Put these value in Bayes Formula and calculate posterior
probability.
• Step 4: See which class has a higher probability, given the input
belongs to the higher probability class.
Example
Example

▶ In learning phase calculate the probability based on

table.
Example
▶ In Test phase: try all cases then give decision for higher
probability
Naive Bayes Classifier

Example:
Filtering Spam Emails
Initially: we have 15 normal emails and 5 spam emails.
Category Number of mails Probability Word Number of Probability
occurences

Normal mails 15 See

Meet
Free
Cash
Total

Spam mails 5 See

Meet
Free
Cash
Total 20 Total
Naive Bayes Classifier
Category Number of mails Probability Word Number of Probability
occurences

Normal mails 15 0.75 See

Meet
Free
Cash
Total

Spam mails 5 0.25 See

Meet
Free
Cash
Total 20 Total

Then compute the proportion of normal and spam emails in the

sample. For example, proportion of normal emails in the
sample is 15/20 = 0.75.
Naive Bayes Classifier

Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9

Meet 8
Free 2
Cash 6
Total 25

Spam mails 5 0.25 See 4

Meet 1
Free 10
Cash 5
Total 20 Total 20

In reality, we can analyse every single word in the emails.

But for simplicity, we analyse 4 words in this activity.

Count the occurrences of different words and record them.

Naive Bayes Classifier

Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9 0.36

Meet 8 0.32
Free 2 0.08
Cash 6 0.24
Total 25 1

Spam mails 5 0.25 See 4 0.2

Meet 1 0.05
Free 10 0.5
Cash 5 0.25
Total 20 Total 20 1

Afterwards, compute the probability of occurrence for each

word. We now construct the probability distribution tables
for normal and spam emails.
Naive Bayes Classifier

Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9 0.36

Meet 8 0.32
Free 2 0.08
Cash 6 0.24
Total 25 1

Spam mails 5 0.25 See 4 0.2

Meet 1 0.05
Free 10 0.5
Cash 5 0.25
Total 20 Total 20 1

Given an email is having the words “Free” and “Cash”, the probability of this
email is normal =(0.75) (0.08) (0.24)

= 0.0144
Naive Bayes Classifier
Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9 0.36

Meet 8 0.32
Free 2 0.08
Cash 6 0.24
Total 25 1

Spam mails 5 0.25 See 4 0.2

Meet 1 0.05
Free 10 0.5
Cash 5 0.25
Total 20 Total 20 1

Given an email is having the words “Free” and “Cash”, the

probability of this email is spam =(0.25) (0.5) (0.25)

= 0.03125 > 0.0144

Naive Bayes Classifier

Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9 0.36

Meet 8 0.32
Free 2 0.08
Cash 6 0.24
Total 25 1

Spam mails 5 0.25 See 4 0.2

Meet 1 0.05
Free 10 0.5
Cash 5 0.25
Total 20 Total 20 1

Conclusion:

Given an email is having the words “Free” and “Cash”, most probably, it
is a spam email.
Naive Bayes Classifier

Advantages
1. This algorithm works very fast and can easily predict the class of a test dataset.
2. You can use it to solve multi-class prediction problems as it’s quite useful with them.
3. Naive Bayes classifier performs better than other models with less training data if the
assumption of independence of features holds.

Disadvantages
1. If your test data set has a categorical variable of a category that wasn’t present in the
training data set, the Naive Bayes model will assign it zero probability and won’t be able to
make any predictions in this regard.
2. It assumes that all the features are independent. While it might sound great in theory, in
real life, you’ll hardly find a set of independent features.
Example
we will generate synthetic data using scikit-learn and train and evaluate the
Gaussian Naive Bayes algorithm.

Generating the Dataset

Scikit-learn provides us with a machine learning ecosystem so that you can

generate the dataset and evaluate various machine learning algorithms.

In our case, we are creating a dataset with six features, three classes, and
800 samples using the `make_classification` function.
Example
Example
Example
Decision tree example with 3
output (classes)
▶ When building a decision tree with three classes using entropy,
the base of the logarithm does not depend on the number
of classes; you can use any base (e.g., base 2, base 3, or base
10). However, base 2 (log2) is commonly used for entropy
calculations in decision trees.
▶ Using log3 instead of log2 would scale the entropy values, but
it does not affect the relative comparisons of information
gain, so the decision tree structure remains the same.
Decision tree example with 3
output (classes)
Example: Decision Tree with Three Classes Using Entropy (Base 2,
and Base 3)
Let’s consider a dataset:
Decision tree example with 3
output (classes)
Decision tree example with 3 output (classes)
Decision tree example with 3 output (classes)
Decision tree example with 3 output (classes)

2425s Csec520 08 Naive Bayes KNN
No ratings yet
2425s Csec520 08 Naive Bayes KNN
44 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Notes On Module 3 - Pattern Recognition
No ratings yet
Notes On Module 3 - Pattern Recognition
17 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
3 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
17 pages
Naïve Bayes Classifiers 3
No ratings yet
Naïve Bayes Classifiers 3
16 pages
Detecting Spam Mail With Naive Bayes
No ratings yet
Detecting Spam Mail With Naive Bayes
5 pages
Bayes Classifier
No ratings yet
Bayes Classifier
35 pages
Lec6 Parametricvsnonparametric
No ratings yet
Lec6 Parametricvsnonparametric
29 pages
Nave Bayes Algorithms
No ratings yet
Nave Bayes Algorithms
15 pages
Lecture W10ab
No ratings yet
Lecture W10ab
25 pages
Naive Bayes Classifier 1
No ratings yet
Naive Bayes Classifier 1
18 pages
Baye's Theorem - Example
No ratings yet
Baye's Theorem - Example
7 pages
W2 3-NaiveBayes
No ratings yet
W2 3-NaiveBayes
17 pages
Naive Bayes Classifier Overview
No ratings yet
Naive Bayes Classifier Overview
7 pages
Unit 3
No ratings yet
Unit 3
46 pages
Naive Bayes Classification - Elements of AI
No ratings yet
Naive Bayes Classification - Elements of AI
1 page
CS 188 Introduction To Artificial Intelligence Fall 2018 Note 9
No ratings yet
CS 188 Introduction To Artificial Intelligence Fall 2018 Note 9
14 pages
Report On Naive Bayes
No ratings yet
Report On Naive Bayes
5 pages
Top Machine Learning Informations About Different Algorithms
No ratings yet
Top Machine Learning Informations About Different Algorithms
63 pages
MachineLearning Lecture06 PDF
No ratings yet
MachineLearning Lecture06 PDF
16 pages
Naive Bayes
No ratings yet
Naive Bayes
4 pages
WINSEM2023-24 MCSE602L TH VL2023240501960 2024-03-13 Reference-Material-I
No ratings yet
WINSEM2023-24 MCSE602L TH VL2023240501960 2024-03-13 Reference-Material-I
132 pages
Lecture 12 Dr. Lamiaa
No ratings yet
Lecture 12 Dr. Lamiaa
21 pages
Day 4 - Supervised Learning (Classification)
No ratings yet
Day 4 - Supervised Learning (Classification)
46 pages
Naive Bayes for Data Scientists
No ratings yet
Naive Bayes for Data Scientists
2 pages
Statistics
No ratings yet
Statistics
25 pages
cs188 Fa22 Note19
No ratings yet
cs188 Fa22 Note19
8 pages
Naive Bayesian Spam Filtering
No ratings yet
Naive Bayesian Spam Filtering
6 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
Naive Bayes
No ratings yet
Naive Bayes
7 pages
CH 5
No ratings yet
CH 5
21 pages
Lecture W11ab - PG
No ratings yet
Lecture W11ab - PG
29 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
18 pages
Shawndra Hill Spring 2013 TR 1:30 - 3pm and 3 - 4:30
No ratings yet
Shawndra Hill Spring 2013 TR 1:30 - 3pm and 3 - 4:30
75 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
16 pages
Naive Bayes
No ratings yet
Naive Bayes
12 pages
Naïve Bayes Algorithm Explained
No ratings yet
Naïve Bayes Algorithm Explained
17 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
6 pages
NaiveBayes N Text Analytics
No ratings yet
NaiveBayes N Text Analytics
20 pages
NOTES
No ratings yet
NOTES
15 pages
Naive Bayesian Spam Filter Study
No ratings yet
Naive Bayesian Spam Filter Study
68 pages
DWM Exp5 C49
No ratings yet
DWM Exp5 C49
12 pages
AI ML Unit4
No ratings yet
AI ML Unit4
252 pages
DWM Exp4 A49
No ratings yet
DWM Exp4 A49
11 pages
Naive Bayes Classifier Presentation
No ratings yet
Naive Bayes Classifier Presentation
10 pages
Naïve Bayesian Classifier
No ratings yet
Naïve Bayesian Classifier
15 pages
ProbabilisticLearning Bayesian
No ratings yet
ProbabilisticLearning Bayesian
11 pages
Aiml Chap 2
No ratings yet
Aiml Chap 2
9 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
A Comparison of The Accuracy of Support Vector
No ratings yet
A Comparison of The Accuracy of Support Vector
17 pages
Bayes Theorem
No ratings yet
Bayes Theorem
9 pages
What Is Naive Bayes Algorithm
No ratings yet
What Is Naive Bayes Algorithm
10 pages
Digital Electronics Sheet 3
No ratings yet
Digital Electronics Sheet 3
5 pages
ALL of Security - DR - Ihap - v2 - 255
No ratings yet
ALL of Security - DR - Ihap - v2 - 255
22 pages
Sheet 5 Coding
100% (1)
Sheet 5 Coding
18 pages
Antenna Lab Booklet V1 (Simulation Added) 1
No ratings yet
Antenna Lab Booklet V1 (Simulation Added) 1
43 pages
ATmega16 USART
No ratings yet
ATmega16 USART
5 pages
Network - Lec. 6 - Fall 2024
No ratings yet
Network - Lec. 6 - Fall 2024
35 pages
ATmega16 Timer 1
No ratings yet
ATmega16 Timer 1
5 pages
ATmega16 Timers Toggle and PWM
No ratings yet
ATmega16 Timers Toggle and PWM
11 pages
Routing Algorithms in Networks
No ratings yet
Routing Algorithms in Networks
17 pages
Net. Fall 2024 Lec. 10
No ratings yet
Net. Fall 2024 Lec. 10
41 pages
03 - Random Forest
No ratings yet
03 - Random Forest
24 pages
08 NN
No ratings yet
08 NN
117 pages
Voting Machine
No ratings yet
Voting Machine
9 pages
M.Sc. SS May 2023 Exam Schedule
No ratings yet
M.Sc. SS May 2023 Exam Schedule
2 pages
Phlebotomy 6th Edition Warekois Solution Manual Test Bank Available Instantly
0% (1)
Phlebotomy 6th Edition Warekois Solution Manual Test Bank Available Instantly
319 pages
Technical Specification For Laptop and Desktop MOE
100% (1)
Technical Specification For Laptop and Desktop MOE
4 pages
Run Length Encoding
No ratings yet
Run Length Encoding
11 pages
CAD Standards
No ratings yet
CAD Standards
7 pages
How To 10x Your Productivity: Getting Things Done (GTD)
100% (3)
How To 10x Your Productivity: Getting Things Done (GTD)
24 pages
2024 0008 Niagara Compatible Drivers Applications
No ratings yet
2024 0008 Niagara Compatible Drivers Applications
10 pages
CHARAN NET New
No ratings yet
CHARAN NET New
6 pages
Installation and Setup of Cisco Sg500-52P - 500 Series Stackable Managed Switches
No ratings yet
Installation and Setup of Cisco Sg500-52P - 500 Series Stackable Managed Switches
22 pages
Spring 2305.15486
No ratings yet
Spring 2305.15486
305 pages
IoT Based Landslide Detection System
No ratings yet
IoT Based Landslide Detection System
8 pages
Context-Free Grammars: CSE 211 (Theory of Computation)
No ratings yet
Context-Free Grammars: CSE 211 (Theory of Computation)
26 pages
Unit-2@IP (Ritik Chauhan)
No ratings yet
Unit-2@IP (Ritik Chauhan)
10 pages
Audit Engagement Essentials
100% (1)
Audit Engagement Essentials
51 pages
Sap
No ratings yet
Sap
3 pages
1 Esteem Dandan
No ratings yet
1 Esteem Dandan
45 pages
Advanced Microprocessor Comparison
No ratings yet
Advanced Microprocessor Comparison
19 pages
SASMO 2020 Secondary 1 (Grade 7) Contest Questions: Section A
100% (3)
SASMO 2020 Secondary 1 (Grade 7) Contest Questions: Section A
22 pages
Full Emerging Technologies in Engineering Mahesh P. K. Ebook All Chapters
100% (2)
Full Emerging Technologies in Engineering Mahesh P. K. Ebook All Chapters
57 pages
Lecture # 6
No ratings yet
Lecture # 6
5 pages
Etherchannel
No ratings yet
Etherchannel
10 pages
WTA WorkForce API Specs Guide - 18.3
No ratings yet
WTA WorkForce API Specs Guide - 18.3
508 pages
Quantum Computing and Artificial Intelligence - Status and Perspectives
No ratings yet
Quantum Computing and Artificial Intelligence - Status and Perspectives
32 pages
Product Release Matrix
No ratings yet
Product Release Matrix
3 pages
HCI - Chapter 3 - Computer in HCI
100% (1)
HCI - Chapter 3 - Computer in HCI
28 pages
Major Report 1
No ratings yet
Major Report 1
48 pages
Model Objects Policy Simulations and Forecasting
No ratings yet
Model Objects Policy Simulations and Forecasting
49 pages
PC Build Techland
No ratings yet
PC Build Techland
1 page
Ms Excel Notes
No ratings yet
Ms Excel Notes
2 pages
Major Examination: Subject - Science SESSION 2019-20 Maximum Marks: 80 Class - VII Time: 3 Hours
No ratings yet
Major Examination: Subject - Science SESSION 2019-20 Maximum Marks: 80 Class - VII Time: 3 Hours
22 pages

07 - KNN & Naive Bayes

Uploaded by

07 - KNN & Naive Bayes

Uploaded by

K - Nearest Neighbor Model

data set (with different classes or categories).

Based on its closeness or similarities in a given range (K) of neighbors, the

K-nearest neighbor: predict

Label it red, when k = 3

Label it blue, when

◼ Standard Euclidean distance metric:

◼ Then to predict E find least distance form 4 points, so it will be A.

◼ So prediction of E will be Bad

In easy terms, a Naïve Bayes categorize assumes that the presence

Naive Bayes classifier calculates the probability of an event in the

▶ In learning phase calculate the probability based on

Normal mails 15 See

Spam mails 5 See

Normal mails 15 0.75 See

Spam mails 5 0.25 See

Then compute the proportion of normal and spam emails in the

Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9

Spam mails 5 0.25 See 4

In reality, we can analyse every single word in the emails.

Count the occurrences of different words and record them.

Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9 0.36

Spam mails 5 0.25 See 4 0.2

Afterwards, compute the probability of occurrence for each

Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9 0.36

Spam mails 5 0.25 See 4 0.2

Normal mails 15 0.75 See 9 0.36

Spam mails 5 0.25 See 4 0.2

Given an email is having the words “Free” and “Cash”, the

= 0.03125 > 0.0144

Category Number of mails Probability Word Number of occurences Probability

Normal mails 15 0.75 See 9 0.36

Spam mails 5 0.25 See 4 0.2

Generating the Dataset

Scikit-learn provides us with a machine learning ecosystem so that you can

You might also like