clustering

Uploaded by

Gomathy

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

clustering

Uploaded by

Gomathy

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Clustering in ML

 Clustering or cluster analysis is a machine learning technique, which groups the

unlabelled dataset.
 It is an unsupervised learning method, and it deals with the unlabeled dataset.
 It can be defined as "A way of grouping the data points into different clusters,
consisting of similar data points. The objects with the possible similarities remain in a
group that has less or no similarities with another group."
 It does it by finding some similar patterns in the unlabelled dataset such as shape,
size, color, behavior, etc., and divides them as per the presence and absence of those
similar patterns.
 After applying this clustering technique, each cluster or group is provided with a
cluster-ID. ML system can use this id to simplify the processing of large and complex
datasets.
Hard clustering - datapoint belongs to only one group and
- Soft Clustering - data points can belong to another group also.
• Partitioning Clustering : It is a type of clustering that divides the data into non-hierarchical groups. It is also known as the
centroid-based method. The most common example of partitioning clustering is the K-Means Clustering algorithm.

• The density-based clustering method connects the highly-dense areas into clusters, and the arbitrarily shaped distributions
are formed as long as the dense region can be connected. This algorithm does it by identifying different clusters in the
dataset and connects the areas of high densities into clusters.

• The distribution model-based clustering method, the data is divided based on the probability of how a dataset belongs to
a particular distribution. The grouping is done by assuming some distributions commonly Gaussian Distribution.

• The example of this type is the Expectation-Maximization Clustering algorithm that uses Gaussian Mixture Models
(GMM).

• Hierarchical clustering, the dataset is divided into clusters to create a tree-like structure, which is also called a
dendrogram. The observations or any number of clusters can be selected by cutting the tree at the correct level. The most
common example of this method is the Agglomerative Hierarchical algorithm.

• Fuzzy clustering is a type of soft method in which a data object may belong to more than one group or cluster. Each
dataset has a set of membership coefficients, which depend on the degree of membership to be in a cluster. Fuzzy C-means
algorithm is the example of this type of clustering; it is sometimes also known as the Fuzzy k-means algorithm.
K-Means clustering

• There is an algorithm that tries to minimize the distance of the points in a

cluster with their centroid – the k-means clustering technique.
• K-Means clustering is an unsupervised iterative clustering technique.
• It partitions the given data set into k predefined distinct clusters.
• A cluster is defined as a collection of data points exhibiting certain
similarities.

• K-means is a centroid-based algorithm or a distance-based algorithm,

where we calculate the distances to assign a point to a cluster. In K-Means,
each cluster is associated with a centroid.
• The main objective of the K-Means algorithm is to minimize the sum of
distances between the points and their respective cluster centroid .
K-Means ….
It partitions the data set such that-
• Each data point belongs to a cluster with the nearest mean.
• Data points belonging to one cluster have high degree of similarity.
• Data points belonging to different clusters have high degree of dissimilarity.
K-Means
Stopping Criteria for K-Means Clustering

• There are essentially three stopping criteria that can be adopted to stop the K-means algorithm:
1. Centroids of newly formed clusters do not change

2. Points remain in the same cluster

3. Maximum number of iterations is reached

Short Questions Numerical Methods July-21
100% (2)
Short Questions Numerical Methods July-21
6 pages
Module 5
No ratings yet
Module 5
91 pages
ML CH 4
No ratings yet
ML CH 4
51 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
64 pages
Unsupervised Learning-01
No ratings yet
Unsupervised Learning-01
42 pages
Unit 4-L2
No ratings yet
Unit 4-L2
19 pages
Classify Clustering
No ratings yet
Classify Clustering
31 pages
ML Unit 4 Notes - NJ
No ratings yet
ML Unit 4 Notes - NJ
15 pages
4.unsupervised Learning Model-Clustering
No ratings yet
4.unsupervised Learning Model-Clustering
45 pages
ML_Unit-3
No ratings yet
ML_Unit-3
22 pages
Unit 3 Clustering Algorithm
No ratings yet
Unit 3 Clustering Algorithm
44 pages
chapter 3 p4
No ratings yet
chapter 3 p4
18 pages
Unit 5
No ratings yet
Unit 5
5 pages
Introduction to Cluster Analysis.
No ratings yet
Introduction to Cluster Analysis.
53 pages
U20cs604 Machine Learning Unit III
No ratings yet
U20cs604 Machine Learning Unit III
23 pages
Cluster Analysis
No ratings yet
Cluster Analysis
4 pages
Module-5_Notes_13-12-2024.docx
No ratings yet
Module-5_Notes_13-12-2024.docx
45 pages
Unit 4
No ratings yet
Unit 4
74 pages
Clustering
No ratings yet
Clustering
11 pages
M5
No ratings yet
M5
40 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
7 pages
Unit III Clustering
No ratings yet
Unit III Clustering
47 pages
unit4_ml[1]
No ratings yet
unit4_ml[1]
20 pages
clustering
No ratings yet
clustering
6 pages
DM MODULE 4
No ratings yet
DM MODULE 4
17 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
7 pages
UNIT 4 K-Means Clustring
No ratings yet
UNIT 4 K-Means Clustring
13 pages
unsupervised learning
No ratings yet
unsupervised learning
23 pages
Clustering new
No ratings yet
Clustering new
6 pages
unit 2 ml
No ratings yet
unit 2 ml
11 pages
Week-9-Part-2 Agglomerative Clustering
No ratings yet
Week-9-Part-2 Agglomerative Clustering
40 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
Ds Econtent
No ratings yet
Ds Econtent
8 pages
Clustering Agglo Devisive DBSCAN
No ratings yet
Clustering Agglo Devisive DBSCAN
78 pages
E-Note_28966_Content_Document_20241211091351PM
No ratings yet
E-Note_28966_Content_Document_20241211091351PM
69 pages
ML UNIT-III
No ratings yet
ML UNIT-III
18 pages
K Means Clustering
No ratings yet
K Means Clustering
6 pages
M5
No ratings yet
M5
40 pages
MA Unit 5
No ratings yet
MA Unit 5
7 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
14 pages
An Introduction To Clustering Methods
No ratings yet
An Introduction To Clustering Methods
8 pages
Cluster Analysis (1)- Rmm
No ratings yet
Cluster Analysis (1)- Rmm
17 pages
Unit 4 Clustering
No ratings yet
Unit 4 Clustering
18 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
15 pages
DM UNIT-4 Part2
No ratings yet
DM UNIT-4 Part2
18 pages
CBSYLLABUS BDA
No ratings yet
CBSYLLABUS BDA
5 pages
Clustering
No ratings yet
Clustering
7 pages
Clustering
No ratings yet
Clustering
6 pages
UNIT 4 NOTES
No ratings yet
UNIT 4 NOTES
66 pages
Data Mining Notes UNIT IV
No ratings yet
Data Mining Notes UNIT IV
19 pages
Clustering
No ratings yet
Clustering
7 pages
DataMining_Unit4_notes
No ratings yet
DataMining_Unit4_notes
27 pages
Data Mining Unit 3 Cluster Analysis: Types of Clusters
No ratings yet
Data Mining Unit 3 Cluster Analysis: Types of Clusters
11 pages
Dmbi Unit-4
No ratings yet
Dmbi Unit-4
18 pages
DMDW Qa-5
No ratings yet
DMDW Qa-5
7 pages
UNIT 2 DMW
No ratings yet
UNIT 2 DMW
26 pages
Iv Unit DM
No ratings yet
Iv Unit DM
26 pages
DM UNIT-5 NOTES
No ratings yet
DM UNIT-5 NOTES
16 pages
8. Clustering
No ratings yet
8. Clustering
38 pages
Cluster Is A Group of Objects That Belongs To The Same Class
No ratings yet
Cluster Is A Group of Objects That Belongs To The Same Class
12 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Practice Ezxam
No ratings yet
Practice Ezxam
2 pages
Support Vector Machines
No ratings yet
Support Vector Machines
69 pages
Post-Quiz - AA Attempt Review
No ratings yet
Post-Quiz - AA Attempt Review
2 pages
Computational Physics Exam 02: 1. Types of Equations
No ratings yet
Computational Physics Exam 02: 1. Types of Equations
4 pages
Striver's CP List (Solely For Preparing For Coding Rounds of Top Prod Based Companies)
100% (2)
Striver's CP List (Solely For Preparing For Coding Rounds of Top Prod Based Companies)
7 pages
Unit 1 NM I-Bracketing Methods-Bisection Method-2
No ratings yet
Unit 1 NM I-Bracketing Methods-Bisection Method-2
10 pages
ch03 Brute Force
No ratings yet
ch03 Brute Force
67 pages
2.2: Polynomial Functions: Learning Goals
No ratings yet
2.2: Polynomial Functions: Learning Goals
9 pages
DMGT Mid2 QB
No ratings yet
DMGT Mid2 QB
4 pages
Tabular Diff Between Bfs Greedybfs Astar
No ratings yet
Tabular Diff Between Bfs Greedybfs Astar
1 page
Chapter 3 - Determinants and Diagonalization
No ratings yet
Chapter 3 - Determinants and Diagonalization
59 pages
HW Mat4
No ratings yet
HW Mat4
13 pages
National Institute of Technology Rourkela
No ratings yet
National Institute of Technology Rourkela
2 pages
Olympiad Solutions
No ratings yet
Olympiad Solutions
3 pages
Jacobi and Gauss-Seidel
No ratings yet
Jacobi and Gauss-Seidel
10 pages
Algorithms Worksheet 4 Merge Sort and Quicksort
No ratings yet
Algorithms Worksheet 4 Merge Sort and Quicksort
5 pages
02 01 KMeans
100% (1)
02 01 KMeans
62 pages
cs502 Midterm Solved MCQs
No ratings yet
cs502 Midterm Solved MCQs
28 pages
Calyley Hamilton Theoram. Eigen Notes
No ratings yet
Calyley Hamilton Theoram. Eigen Notes
8 pages
Hopfield Networks and Boltzman Machines-Part 2
No ratings yet
Hopfield Networks and Boltzman Machines-Part 2
13 pages
Polynomials Sheet by Ps
No ratings yet
Polynomials Sheet by Ps
17 pages
Edexcel数学qp 202401 Math d1
No ratings yet
Edexcel数学qp 202401 Math d1
28 pages
Transportation Problem 1
No ratings yet
Transportation Problem 1
9 pages
April 2019 Linear Algebra Question Paper
No ratings yet
April 2019 Linear Algebra Question Paper
3 pages
Unit II Brute Force Divide and Conquer Decrease and Conquer
No ratings yet
Unit II Brute Force Divide and Conquer Decrease and Conquer
132 pages
Linear Programming - Management Science
No ratings yet
Linear Programming - Management Science
4 pages
01. Bubble Sort
No ratings yet
01. Bubble Sort
5 pages
302 Topper 21 101 2 3 87 Polynomials Up202005021159 1588400942 4903
No ratings yet
302 Topper 21 101 2 3 87 Polynomials Up202005021159 1588400942 4903
7 pages
VNS For The Graph Coloring Problem
No ratings yet
VNS For The Graph Coloring Problem
10 pages