clustering
clustering
• The density-based clustering method connects the highly-dense areas into clusters, and the arbitrarily shaped distributions
are formed as long as the dense region can be connected. This algorithm does it by identifying different clusters in the
dataset and connects the areas of high densities into clusters.
• The distribution model-based clustering method, the data is divided based on the probability of how a dataset belongs to
a particular distribution. The grouping is done by assuming some distributions commonly Gaussian Distribution.
• The example of this type is the Expectation-Maximization Clustering algorithm that uses Gaussian Mixture Models
(GMM).
• Hierarchical clustering, the dataset is divided into clusters to create a tree-like structure, which is also called a
dendrogram. The observations or any number of clusters can be selected by cutting the tree at the correct level. The most
common example of this method is the Agglomerative Hierarchical algorithm.
• Fuzzy clustering is a type of soft method in which a data object may belong to more than one group or cluster. Each
dataset has a set of membership coefficients, which depend on the degree of membership to be in a cluster. Fuzzy C-means
algorithm is the example of this type of clustering; it is sometimes also known as the Fuzzy k-means algorithm.
K-Means clustering
• There are essentially three stopping criteria that can be adopted to stop the K-means algorithm:
1. Centroids of newly formed clusters do not change