0% found this document useful (0 votes)

81 views9 pages

KNN Limitations in Spam Filtering

Uploaded by

geetha.r

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views9 pages

KNN Limitations in Spam Filtering

Uploaded by

geetha.r

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Why KNN is poor choice for spam filter?

What is KNN?

 KNN is a very simple algorithm used to solve

classification problems. KNN stands for K-Nearest
Neighbors. K is the number of neighbors in KNN.
Why KNN is poor choice as spam filter
 KNN classifiers are good whenever there is a really
meaningful distance metric. In the spam case, KNN
classifiers are going to label as spam things that are
“close” to known spams being “close” in the sense of your
distance metric (which will likely be poor).
Therefore, KNN classifiers are only going to filter
spams that are really similar to what you already
know. It won’t really generalize properly.
Also, you have to train on non-spam examples too,
and KNN will suffer from the same problem: it will
only confidently say something is non-spam if it is
written very similarly to a non-spam email that KNN
was trained on.
 Limitations of KNN to use as spam
filters
1. Doesn’t work well with a large dataset:
Since KNN is a distance-based algorithm, the cost
of calculating distance between a new point and
each existing point is very high which in turn
degrades the performance of the algorithm.
2. Doesn’t work well with a high number of
dimensions:
Again, the same reason as above. In higher
dimensional space, the cost to calculate distance
becomes expensive and hence impacts the
performance.
 Distribution of e-mails data set
 3. Sensitive to outliers and missing
values:
KNN is sensitive to outliers and missing
values and hence we first need to impute
the missing values and get rid of the
outliers before applying the KNN
algorithm.
 4.Need feature scaling: We need to do feature
scaling (standardization and normalization)
before applying KNN algorithm to any dataset. If
we don't do so, KNN may generate wrong
predictions.
 5. For different values of
‘k’ prediction of gain data
may varies, therefore
accuracy may be poor.
 For example
 With respect to given data if
k=3 ,the given data
belongs to class B
 If K=7,the given data
belongs to classA
 So, for different values of
k predictions may varies
 Failure of KNN
CASE 1
In this case, the data is
grouped in clusters but the
query point seems far away
from the actual grouping. In
such a case, we can use K
nearest neighbors to identify
the class, however, it doesn’t
make much sense because the
query point (yellow point) is
really far from the data points
and hence we can’t be very
Case 2
In this case, the data is randomly
spread and hence no useful
information can be obtained from it.
Now in such a scenario when we are
given a query point (yellow point), the
KNN algorithm will try to find the k
nearest neighbors but since the data
points are jumbled, the accuracy
is questionable
 Based on accuracy

KNN Is A Very Simple Algorithm Used To Solve Classification Problems. KNN Stands For K-Nearest Neighbors. K Is The Number of Neighbors in KNN
0% (1)
KNN Is A Very Simple Algorithm Used To Solve Classification Problems. KNN Stands For K-Nearest Neighbors. K Is The Number of Neighbors in KNN
9 pages
21 KNN
No ratings yet
21 KNN
28 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
16 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
K-NN for Beginners and Practitioners
No ratings yet
K-NN for Beginners and Practitioners
47 pages
KNN With Example
No ratings yet
KNN With Example
21 pages
K-Nearest Neighbor Algorithm Guide
No ratings yet
K-Nearest Neighbor Algorithm Guide
19 pages
Sample KNN
No ratings yet
Sample KNN
7 pages
KNN Presentation
No ratings yet
KNN Presentation
19 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
7 pages
KNN
No ratings yet
KNN
53 pages
KNN Algorithm: Gnitc Mrs - Sumitra Mallick CSE Dept
No ratings yet
KNN Algorithm: Gnitc Mrs - Sumitra Mallick CSE Dept
12 pages
KNN Algorithm
No ratings yet
KNN Algorithm
9 pages
ML-LECTURE9 KNN Classification
No ratings yet
ML-LECTURE9 KNN Classification
23 pages
4.kNN Concepts
No ratings yet
4.kNN Concepts
12 pages
Adobe Scan 16 May 2023
No ratings yet
Adobe Scan 16 May 2023
9 pages
Aitee (Notes) KNN
No ratings yet
Aitee (Notes) KNN
3 pages
K - Nearest Neighbors
No ratings yet
K - Nearest Neighbors
33 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
23 Supervised KNN
No ratings yet
23 Supervised KNN
18 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
13 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
KNN Algorithm: Clustering & Classification
No ratings yet
KNN Algorithm: Clustering & Classification
10 pages
K - Nearest Neighbours (K-NN) Algorithm
No ratings yet
K - Nearest Neighbours (K-NN) Algorithm
10 pages
Supervised Learning KNN
No ratings yet
Supervised Learning KNN
23 pages
KNN Algorithm: Basics and Python Guide
No ratings yet
KNN Algorithm: Basics and Python Guide
17 pages
Presentation of KNN-1
No ratings yet
Presentation of KNN-1
18 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
Lecture 38 KNN
No ratings yet
Lecture 38 KNN
4 pages
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
No ratings yet
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
8 pages
K - Nearest Neighbors
No ratings yet
K - Nearest Neighbors
33 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
K-Nearest NEIGHBOUR
No ratings yet
K-Nearest NEIGHBOUR
16 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
When Do We Use KNN Algorithm?
No ratings yet
When Do We Use KNN Algorithm?
7 pages
ML 2
No ratings yet
ML 2
6 pages
K Nearestneighborknnalgorithm 241117075907 d767c46d
No ratings yet
K Nearestneighborknnalgorithm 241117075907 d767c46d
13 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
9 pages
ML Unit-2
No ratings yet
ML Unit-2
24 pages
K-Nearest Neighbor (KNN)
No ratings yet
K-Nearest Neighbor (KNN)
27 pages
K-Nearest Neighbor Algorithm
100% (1)
K-Nearest Neighbor Algorithm
6 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
PML Lab Exp 11
No ratings yet
PML Lab Exp 11
3 pages
Shubh
No ratings yet
Shubh
10 pages
K-Nearest Neighbor (KNN) Algorithm: Last Updated: 14 May, 2025
No ratings yet
K-Nearest Neighbor (KNN) Algorithm: Last Updated: 14 May, 2025
14 pages
K-Nearest Neighbor (KNN)
No ratings yet
K-Nearest Neighbor (KNN)
12 pages
Dr. BC Roy Engineering College Durgapur
No ratings yet
Dr. BC Roy Engineering College Durgapur
10 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
KNN
No ratings yet
KNN
2 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
K-Nearest Neighbors Guide
No ratings yet
K-Nearest Neighbors Guide
25 pages
Object Detyection Using CNN
No ratings yet
Object Detyection Using CNN
113 pages
Lesson Plan - FCV - 2024
No ratings yet
Lesson Plan - FCV - 2024
4 pages
21CS644 Module
No ratings yet
21CS644 Module
30 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
43 pages
Probability
No ratings yet
Probability
11 pages
Scheme Report
No ratings yet
Scheme Report
1 page
Problems 34 2
No ratings yet
Problems 34 2
3 pages
How Smartphones Are Important in Our Life
No ratings yet
How Smartphones Are Important in Our Life
5 pages
Section 1: Type of Registration
No ratings yet
Section 1: Type of Registration
6 pages
A001852 OxESP Booklet Call Centres Revised
No ratings yet
A001852 OxESP Booklet Call Centres Revised
24 pages
22 - Pineapple Lva 7 - Faxing of Document - Jonah Bucoy
No ratings yet
22 - Pineapple Lva 7 - Faxing of Document - Jonah Bucoy
3 pages
Club Med Board Member Application 2022
No ratings yet
Club Med Board Member Application 2022
7 pages
Paragon Hybrid Mail Userguide
No ratings yet
Paragon Hybrid Mail Userguide
18 pages
ITU Maritime Service Publications and MARS
No ratings yet
ITU Maritime Service Publications and MARS
38 pages
IT Policies PPT WORKING
No ratings yet
IT Policies PPT WORKING
19 pages
Train Ticket - 14.04.2024
No ratings yet
Train Ticket - 14.04.2024
2 pages
Exercises On E-Mails
No ratings yet
Exercises On E-Mails
17 pages
Rescue and Smart Assistant User Guide
No ratings yet
Rescue and Smart Assistant User Guide
54 pages
User Guide EPOS
No ratings yet
User Guide EPOS
29 pages
Nion Marine Management Services Pte LTD
No ratings yet
Nion Marine Management Services Pte LTD
12 pages
Barracuda Forensics Sales Training
No ratings yet
Barracuda Forensics Sales Training
8 pages
Classroom Introduction Interview Applicant Guide
No ratings yet
Classroom Introduction Interview Applicant Guide
4 pages
DG Shipping FAQ For Controlled Crew Change
No ratings yet
DG Shipping FAQ For Controlled Crew Change
9 pages
RBC Electronic Access Agreement
No ratings yet
RBC Electronic Access Agreement
18 pages
Breakout 4 Instruction Manual 1
No ratings yet
Breakout 4 Instruction Manual 1
16 pages
RTR - Erin Wilson
No ratings yet
RTR - Erin Wilson
2 pages
Secure Email Service - Message
No ratings yet
Secure Email Service - Message
2 pages
Higher Officials of Corporate Office and Units.
No ratings yet
Higher Officials of Corporate Office and Units.
3 pages
N-LIST Member Login Details For PRASHANT
No ratings yet
N-LIST Member Login Details For PRASHANT
2 pages
Cloud Computing Unit 3 19-01-2025
No ratings yet
Cloud Computing Unit 3 19-01-2025
19 pages
Vanderbilt Transcript Request Form
No ratings yet
Vanderbilt Transcript Request Form
1 page
Dotask Info 67e1fe360f7a4f4134019081
No ratings yet
Dotask Info 67e1fe360f7a4f4134019081
4 pages
The Forgotten Art of Letterlocking
No ratings yet
The Forgotten Art of Letterlocking
3 pages
CSEC IT MayJune P2 2024 (Answers)
100% (1)
CSEC IT MayJune P2 2024 (Answers)
15 pages
Broadcast Engineering Consultants India Limited: Vacancy Advertisement No. 321
No ratings yet
Broadcast Engineering Consultants India Limited: Vacancy Advertisement No. 321
3 pages
Grade 12 Life Orientation Guide
No ratings yet
Grade 12 Life Orientation Guide
47 pages

KNN Limitations in Spam Filtering

Uploaded by

KNN Limitations in Spam Filtering

Uploaded by

Why KNN is poor choice for spam filter?

 KNN is a very simple algorithm used to solve

You might also like