0% found this document useful (0 votes)

56 views23 pages

Clustering Techniques in Data Mining

Uploaded by

Monirul Islam Roni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views23 pages

Clustering Techniques in Data Mining

Uploaded by

Monirul Islam Roni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

WELCOME

to the presentation of “Group-2”

PRESENTATION
TOPIC :
Clustering

COURSE CODE : STAT-309

COURSE
TITLE : Data
Mining
COURSE
TEACHER :

AHSANUL HAQUE
LECTURER
DEPARTMENT OF
UNIVERSITY
STATISTICS OF
BARISHAL
Our Team

SMINA AHMED FARZANA TABASU

SHORMY SHARMISTHA ESHITA AKTER MIM
BISWAS SWARNA MONIRUL ISLAM
RONI
Clustering

Grouping of a particular set of

objects based on their
characteristics,
aggregating them according
to their similarities
Huge Dataset

Find common attributes - all data in the

same group have similar attributes
Clustering

Examine the data to form clusters

Entitles in the real world are very complex

Products sold on a Users of a social Readers of an online

e-commerce site media platform newspaper
Defining Characteristics Using Numbers
Ratings
Review sentiment (1-positive, 0-negative
Category (1- electronics, 2- fashion,...)
Dimensions (size,height,weight)
Color
RatScore posts, comments,likes,shares
Score every post by topic (music lovers,sports lovers)
Activity score(100 most active, 0 not active at all)
Number of connections
% profile complete
Sports Science
professional basketball teams may Health Insurance
collect the following information
about players: an actuary may collect the
 Points per game
following information about
households:
 Assists per game
 Total number of doctor
 Steals per game
REAL LIFE visits per year
EXAMPLE
 Total household size
Email Marketing  Total number of chronic
11
a business may collect the following conditions per household
information about consumers:  Average age of household
 Percentage of emails opened members
 Number of clicks per email
 Time spent viewing email
Basic
Features:
• ·The number of clusters is
not known
• There may not be any a
priori knowledge
concerning the clusters
• Cluster results are
dynamic.
HIERARCHICAL
AGGLOMERATIV
E CLUSTERING
(CLUSTER
• Single: nearest distance or
HAC CAN BE single linkage.
• Complete: farthest
REPRESENTED distance or complete

USING THREE linkage.

• Average: average distance
TECHNIQUES- or average linkage.
Linkage Method Merits Demerits

can separate non-elliptical

cannot separate the
shapes as long as the gap
Single clusters properly if there is
between two clusters is not
noise between clusters.
small.

does well in separating

biased towards equal
Complete clusters if there is noise
variance clusters
between clusters.

balances compactness and

Average computationally intensive
connectivity
K-Means Clustering-
· K-Means clustering is an unsupervised
iterative clustering technique.

· It partitions the given data set into k

predefined distinct clusters.

· A cluster is defined as a collection

of data points exhibiting certain similarities.
IT PARTITIONS THE DATA SET
SUCH THAT-

Each data point belongs to a cluster with the nearest

mean.

Data points belonging to one cluster have high degree of

similarity.

Data points belonging to different clusters have high

degree of dissimilarity.
K-Means Clustering
Step-01: Step-04:

Choose the number of

Algorithm Assign each data point to some
cluster.
clusters K.
A data point is assigned to that cluster
whose center is nearest to that data
point.

Step-02: K-Means
Randomly select any K data points Clustering
as cluster centers. Step-05:
Algorithm Re-compute the center of newly
 Select cluster centers in such a
way that they are as farther as involves the formed clusters.
possible from each other. following steps- The center of a cluster is computed by
taking mean of all the data points
contained in that cluster.

Step-06:
Keep repeating the procedure from Step-03 to
Step-03:
Step-05 until any of the following stopping
Calculate the distance between each data
point and each cluster center.
criteria is met-

The distance may be calculated either by using Center of newly formed clusters do not change
given distance function or by using Euclidean Data points remain present in the same cluster
distance formula.
Maximum number of iterations are reached
Advantages of k-means Disadvantages

Relatively simple to implement.

Scales to large data sets. It requires to specify the number of
•
Guarantees convergence. clusters (k) in advance.

Can warm-start the positions of

centroids. • It cannot handle noisy data and outliers.

Easily adapts to new examples.

It is not suitable to identify clusters with
Generalizes to clusters of •
non-convex shapes.
different shapes and sizes, such
as elliptical clusters.

K-Means Clustering Overview and Techniques
No ratings yet
K-Means Clustering Overview and Techniques
84 pages
Clustering Algorithms: K-Means vs Hierarchical
No ratings yet
Clustering Algorithms: K-Means vs Hierarchical
40 pages
Unsupervised Learning: K-Means & Clustering
No ratings yet
Unsupervised Learning: K-Means & Clustering
149 pages
Unsupervised Learning: Clustering Techniques
No ratings yet
Unsupervised Learning: Clustering Techniques
38 pages
Types and Applications of Clustering
No ratings yet
Types and Applications of Clustering
84 pages
Unsupervised Learning: Clustering Models
No ratings yet
Unsupervised Learning: Clustering Models
38 pages
Understanding Clustering Techniques
0% (1)
Understanding Clustering Techniques
57 pages
Big Data Analytics: Clustering & Classification
No ratings yet
Big Data Analytics: Clustering & Classification
31 pages
Clustering Techniques Overview and Methods
No ratings yet
Clustering Techniques Overview and Methods
75 pages
Clustering Techniques in Machine Learning
No ratings yet
Clustering Techniques in Machine Learning
39 pages
Unsupervised Learning: Clustering Methods
No ratings yet
Unsupervised Learning: Clustering Methods
60 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
7 pages
Clustering Data Techniques Explained
No ratings yet
Clustering Data Techniques Explained
41 pages
Clustering Techniques and K-Means Guide
No ratings yet
Clustering Techniques and K-Means Guide
23 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
110 pages
Linkage Methods in Hierarchical Clustering
No ratings yet
Linkage Methods in Hierarchical Clustering
23 pages
Clustering Techniques and Hierarchical Methods
No ratings yet
Clustering Techniques and Hierarchical Methods
20 pages
Unsupervised Learning in Machine Learning
No ratings yet
Unsupervised Learning in Machine Learning
31 pages
K-Means and Hierarchical Clustering Guide
No ratings yet
K-Means and Hierarchical Clustering Guide
11 pages
Clustering and Classification in Data Mining
No ratings yet
Clustering and Classification in Data Mining
49 pages
Clustering Techniques in Machine Learning
No ratings yet
Clustering Techniques in Machine Learning
125 pages
Unsupervised Learning and Clustering Techniques
No ratings yet
Unsupervised Learning and Clustering Techniques
84 pages
Clustering and Ensemble Methods Overview
No ratings yet
Clustering and Ensemble Methods Overview
28 pages
K-Means Clustering Analysis in Jaipur
100% (1)
K-Means Clustering Analysis in Jaipur
26 pages
Introduction to Clustering Methods
No ratings yet
Introduction to Clustering Methods
8 pages
Cluster Analysis in Data Mining
No ratings yet
Cluster Analysis in Data Mining
91 pages
Data Mining - Chapter 4 Cluster Analysis
No ratings yet
Data Mining - Chapter 4 Cluster Analysis
37 pages
Understanding Clustering Techniques
No ratings yet
Understanding Clustering Techniques
19 pages
Classification and Clustering Techniques
No ratings yet
Classification and Clustering Techniques
32 pages
K-Means Clustering Overview
No ratings yet
K-Means Clustering Overview
51 pages
Clustering and Time-Series Forecasting Techniques
No ratings yet
Clustering and Time-Series Forecasting Techniques
23 pages
Overview of Clustering Techniques
No ratings yet
Overview of Clustering Techniques
80 pages
Unsupervised Learning: K-Means & Clustering
No ratings yet
Unsupervised Learning: K-Means & Clustering
125 pages
Unsupervised Learning: Clustering Methods
No ratings yet
Unsupervised Learning: Clustering Methods
74 pages
K-means and Hierarchical Clustering
No ratings yet
K-means and Hierarchical Clustering
16 pages
Unsupervised Learning: Clustering Methods
No ratings yet
Unsupervised Learning: Clustering Methods
21 pages
Machine Learning: Clustering Techniques
No ratings yet
Machine Learning: Clustering Techniques
74 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
57 pages
Unsupervised Learning: Clustering Techniques
No ratings yet
Unsupervised Learning: Clustering Techniques
83 pages
Simple Cluster Analysis Overview
No ratings yet
Simple Cluster Analysis Overview
82 pages
Cluster Analysis Techniques Explained
No ratings yet
Cluster Analysis Techniques Explained
19 pages
Unsupervised Learning: K-means Clustering Guide
No ratings yet
Unsupervised Learning: K-means Clustering Guide
91 pages
K-Means Clustering Explained
100% (1)
K-Means Clustering Explained
14 pages
Unsupervised Learning and Clustering Concepts
No ratings yet
Unsupervised Learning and Clustering Concepts
34 pages
Cluster Analysis: Concepts & Algorithms
No ratings yet
Cluster Analysis: Concepts & Algorithms
72 pages
Unsupervised Learning: Clustering Techniques
No ratings yet
Unsupervised Learning: Clustering Techniques
97 pages
Clustering Algorithms Overview and Methods
No ratings yet
Clustering Algorithms Overview and Methods
19 pages
Unsupervised Learning and Clustering Techniques
No ratings yet
Unsupervised Learning and Clustering Techniques
50 pages
Understanding Cluster Analysis Techniques
No ratings yet
Understanding Cluster Analysis Techniques
34 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
51 pages
Customer Segmentation Strategies
No ratings yet
Customer Segmentation Strategies
46 pages
Data Mining: Clustering Techniques Explained
No ratings yet
Data Mining: Clustering Techniques Explained
18 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
16 pages
Cluster Analysis in Data Mining Techniques
No ratings yet
Cluster Analysis in Data Mining Techniques
15 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
67 pages
Clustering Techniques in Data Mining
No ratings yet
Clustering Techniques in Data Mining
33 pages
Unsupervised Learning: Clustering Techniques
No ratings yet
Unsupervised Learning: Clustering Techniques
23 pages
Unsupervised Learning: Clustering Methods
No ratings yet
Unsupervised Learning: Clustering Methods
80 pages
8th Standard Tamil Guide PDF
No ratings yet
8th Standard Tamil Guide PDF
264 pages
JNTUH Microprocessors Exam Paper 2023
No ratings yet
JNTUH Microprocessors Exam Paper 2023
2 pages
SAP2000 Bridge Analysis Report
No ratings yet
SAP2000 Bridge Analysis Report
30 pages
Shear Reinforcement Design in Beams
No ratings yet
Shear Reinforcement Design in Beams
50 pages
SPI Exam Sample Questions PDF Guide
No ratings yet
SPI Exam Sample Questions PDF Guide
6 pages
Sales Data for Vivo and Nokia Devices
No ratings yet
Sales Data for Vivo and Nokia Devices
10 pages
IEEE 802.11i Wireless Security Overview
No ratings yet
IEEE 802.11i Wireless Security Overview
16 pages
5G Identifiers White Paper
No ratings yet
5G Identifiers White Paper
5 pages
Full-Stack Developer Resume: Patel Dev
No ratings yet
Full-Stack Developer Resume: Patel Dev
2 pages
Graphs and Algorithms Overview
No ratings yet
Graphs and Algorithms Overview
6 pages
AD5-E811 Certification Renewal Guide
No ratings yet
AD5-E811 Certification Renewal Guide
3 pages
SW Maps User Manual for EASE 2017
100% (2)
SW Maps User Manual for EASE 2017
13 pages
Form 1 Computer Science Exam 2025
No ratings yet
Form 1 Computer Science Exam 2025
7 pages
Responsible Social Media Campaign Plan
No ratings yet
Responsible Social Media Campaign Plan
2 pages
Understanding AI: Key Concepts and Types
No ratings yet
Understanding AI: Key Concepts and Types
16 pages
FLEMMS 2019: Literacy & Media Use in PH
No ratings yet
FLEMMS 2019: Literacy & Media Use in PH
1 page
Learning Guide - 31: Information Technology Support Service
No ratings yet
Learning Guide - 31: Information Technology Support Service
15 pages
Overview of Operating Systems Concepts
No ratings yet
Overview of Operating Systems Concepts
60 pages
Student and Staff Database Design Guide
No ratings yet
Student and Staff Database Design Guide
3 pages
'Kirloskar Green' 15-250kVA Geneset
0% (1)
'Kirloskar Green' 15-250kVA Geneset
6 pages
GB 150.1 GB 150.4-2011 (English Version) Pressure Vessels
No ratings yet
GB 150.1 GB 150.4-2011 (English Version) Pressure Vessels
12 pages
Key Contacts in Solar Energy Sector
No ratings yet
Key Contacts in Solar Energy Sector
40 pages
Pune to Bengaluru Train Ticket Details
No ratings yet
Pune to Bengaluru Train Ticket Details
2 pages
Internet Café Business Plan for Ethiopia
No ratings yet
Internet Café Business Plan for Ethiopia
20 pages
CSTR Control: MLD vs Nonlinear MPC
No ratings yet
CSTR Control: MLD vs Nonlinear MPC
6 pages
Piping Support Design Handbook
No ratings yet
Piping Support Design Handbook
8 pages
Cyberpunk Red Character Sheet
No ratings yet
Cyberpunk Red Character Sheet
19 pages
Understanding Linked Lists in C++
No ratings yet
Understanding Linked Lists in C++
32 pages
Caste Certificate for Isan Rabha, Assam
No ratings yet
Caste Certificate for Isan Rabha, Assam
1 page
B.Des Interior Design Fee Structure 2025-29
No ratings yet
B.Des Interior Design Fee Structure 2025-29
2 pages

Clustering Techniques in Data Mining

Uploaded by

Clustering Techniques in Data Mining

Uploaded by

WELCOME

to the presentation of “Group-2”

COURSE CODE : STAT-309

SMINA AHMED FARZANA TABASU

Grouping of a particular set of

Find common attributes - all data in the

Examine the data to form clusters

Products sold on a Users of a social Readers of an online

USING THREE linkage.

can separate non-elliptical

does well in separating

balances compactness and

· It partitions the given data set into k

· A cluster is defined as a collection

Each data point belongs to a cluster with the nearest

Data points belonging to one cluster have high degree of

Data points belonging to different clusters have high

Choose the number of

Relatively simple to implement.

Can warm-start the positions of

Easily adapts to new examples.

You might also like