0% found this document useful (0 votes)

12 views91 pages

Eml 10 250825

Uploaded by

isgdarklight

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views91 pages

Eml 10 250825

Uploaded by

isgdarklight

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 91

AI20001

Essentials of Machine
Learning
Unsupervised Learning
Koustav Rudra
25/08/2025
Example: Face Clustering
Example: Search result clustering
Example: Google News
A data set with clear cluster structure

What are some of the

issues for clustering?

What clustering algorithms

can we use?
Issues for clustering

• Representation for clustering

• How do we represent an example
• features, etc.
• Similarity/distance between examples

• Flat clustering or hierarchical?

• How many clusters you want to create?

• Fixed a priori
• Data driven
Major Types of Clustering Algorithms
• Flatalgorithms
• Usually start with a random partitioning
• Refine it iteratively
• Example: K-means clustering
• Produces disjoint set of groups

• Hierarchical algorithms
• Bottom-up, agglomerative
• Top-down, divisive
Hard vs. soft clustering

• Hard
clustering:
Each example belongs to exactly one cluster

• Softclustering:
• An example can belong to more than one cluster (probabilistic)
• A pair of sneakers may belong to two groups
• sports apparel and shoes
Flat Clustering: K-mean Clustering algorithm
K-means

• K-means is simple, efficient and widely used

• Main steps of k-means:

When to stop?
Some possibilities
1. after fixed #
STEP 1: Start with k initial cluster
centers (that is why k-mean J) iteration
2. when centers do
not change
STEP 2: Assign/cluster each

go back here
member to the closest center

Iterative steps

STEP 3: Recalculate centers as the

mean of the points in a cluster
K-means: an example
K-means: Initialize centers randomly
K-means: assign points to nearest center
K-means: readjust centers
K-means: assign points to nearest center
K-means: readjust centers
K-means: assign points to nearest center
K-means: readjust centers
K-means: assign points to nearest center

No changes: Done
K-means

Iterate:
• Assign/cluster each example to closest center
• Recalculate centers as the mean of the points in a cluster

How do we do this?
K-means
Iterate:
• Assign/cluster each example to closest center

• Iterate over each point:

• - get distance to each cluster center
• - assign to closest center (hard cluster)

• Recalculate centers as the mean of the points in a cluster

K-means
Iterate:
n Assign/cluster each example to closest center

n Iterate over each point:

n - get distance to each cluster center
n - assign to closest center (hard cluster)

n Recalculate centers as the mean of the points in a cluster

What distance measure should we use?

Distance measure

Euclidean:
n
d(x, y) = ∑ (xi − yi )2
i=1

x and y are n-dimensional vectors:

x = (𝑥!, 𝑥", … . . , 𝑥# )
y = (𝑦!, 𝑦", … . . , 𝑦# )
K-means

Iterate:
• Assign/cluster each example to closest center
• Recalculate centers as the mean of the points in a cluster

Where are the cluster centers?

K-means

Iterate:
• Assign/cluster each example to closest center
• Recalculate centers as the mean of the points in a cluster

How do we calculate these?

K-means

Iterate:
• Assign/cluster each example to closest center
• Recalculate centers as the mean of the points in a cluster

Mean of the points in the cluster:

1
µ (C) = ∑
| C | x∈C
x

where:

n x n x
x + y = ∑ xi + yi =∑ i
i=1 C i=1 C
K-means loss function

K-means tries to minimize what is called the “k-means”

loss function:

n
loss = ∑ d(xi , µ k ) where µ k is cluster center for xi
2

i=1

That is, the sum of the squared distances from each point to the
associated cluster center.
K-means algorithm
Randomly initialize cluster centroids

Repeat {
for = 1 to
Cluster := index (from 1 to ) of cluster centroid
assignment
closest to
Move for = 1 to
centroid := average (mean) of points assigned to cluster

}
Running time of Kmeans

• In every iteration

• Assign data points to closest cluster center

• O(kn) time (k = # clusters, n = # data points)

• Change the cluster center to the average of its assigned points

• O(n)
K-means: Big Issues

• Value of k (# cluster)

• Convergence
• A fixed number of iterations
• partitions unchanged
• Cluster centers do not change

• Initial (seed) cluster centers

KMEANS: VALUE OF K
Elbow method
• Run kmeans with different k
• Plot k-means loss vs. k
• Choose k such that the curve show an elbow shape

Kmeans loss
Good value of k
Silhouette Measure

• Key Idea for good clustering

• Small within cluster variance

• Large between cluster variance

Silhouette Measure

𝐶! 𝑖𝑠 𝑡ℎ𝑒 𝑐𝑙𝑢𝑠𝑡𝑒𝑟 𝑓𝑜𝑟 𝑑𝑎𝑡𝑎 𝑝𝑜𝑖𝑛𝑡 𝑖

Within cluster measure d(i,j) = distance between i & j

Between cluster measure

Silhouette measure

s(i) = 0, if 𝑪𝒊 = 𝟏
Silhouette Plot
• Propertyof the Silhouette measure:
• High score is better

Average Silhouette score

Desirable value of k
𝒏
𝟏
4 𝑺(𝒊)
𝒏
𝒊#𝟏
Initial (seed) cluster centers
K-means: Initialize centers randomly

What would happen here?

K-means: Initialize centers randomly

Bad clustering
Choice of Initial Centroids

• Resultscan vary drastically based on random seed selection

• Slow convergence
• converges to sub-optimal clustering

• Common heuristics
• Random centers in the space
• Randomly pick from feature vectors
• Points least similar to any existing center (furthest centers heuristic)
• Try out multiple starting points
• Initialize with the results of another clustering method
Furthest centers heuristic

• μ1 = pick random point (the first center)

• 𝒇𝒐𝒓 𝒊 = 𝟐 𝒕𝒐 𝒌 (# cluster)
• μi = point that is furthest from any previous centers
K-means: Initialize furthest from centers

Say, k = 3

• Pick a random point for the first center

• Which point will be chosen next?
• Next point?
K-means: Initialize furthest from centers

Furthest point from center

Any issues/concerns with this approach?

Furthest points concerns

If k = 4, which points will get chosen?

Doesn’t deal well with outliers

A Better Approach

• K-means++
• Centers are initialized using a probabilistic approach
• Other steps are exactly the same as the standard k-means algorithm

Cluster centers initialization:

1. Choose one center 𝒄𝟏 randomly from the data (X)

2. For each 𝒙 ∈ 𝑿, compute D(x) (D(x) is the distance of x from the
closest center we have already chosen)
𝑫𝒙𝟐
3. Select a data 𝒙 from X as a new center with probability ∑𝒙∈𝑿 𝑫 𝒙 𝟐

4. Repeat step 2 and 3 until k centers are chosen

K-means++
• Cluster centers initialization:
1. Choose the first center randomly from the data, X
2. For each 𝒙 ∈ 𝑿, compute D(x) (D(x) is the distance of x from the closest center we have already
chosen)
𝑫 𝒙 𝟐
3. Select a data 𝒙 from X as a new center with probability ∑
𝒙∈𝑿 𝑫 𝒙 𝟐
4. Repeat step 2 and 3 until k centers are chosen

Illustration: say we want to create 3 clusters

most likely second center

Quiz:
first center Given the two centers, what will be the D(x)?
length of black line or read line?
x

most likely third center

Clustering Graph/Network Data
Graph/Network Data

• So far, we talked about data as n-dimensional point

• What about the following?

• Facebook friendship network

• Communities in question-answering website like Quora

• Protein interaction network

Facebook Friendship Graph/Network
Problem You Want to Solve

• Find coherent groups

• friend circles in Facebook

• Group similar objects

• So, essentially, it is again a grouping problem with different type of data

Example: Protein Interaction Network
What is Graph?

• A graph is composed of two things

1. Set of objects (called nodes of the graph)
2. Set of connections (called edges of the graph)

node

edge
Types of Graph
• Un-weighted
• Edges do not have weight
• Edges simply say whether two objects are connected or not

• Weighted
• Edges have weight
• Examples:
• Email communication network
• How often two persons exchanging emails?
• City and Road network
• Edge weight: the distance between two cities
Graph Data

• Graph data shows relationship between object pairs

• Therefore, centroid is often not valid in graph

• What does centroid mean in your Facebook friend network?

• How do you define the center in a graph?

Clustering Un-weighted Graph
Graph Clustering: Minimum Cut (or Mincut)

Min cut of a graph

• Partition on nodes in two groups 𝑆! 𝑎𝑛𝑑 𝑆" so that the # edges between
𝑆! 𝑎𝑛𝑑 𝑆" are minimized
Example
For the above graph, the min cut partition is {a,b,e,f} and {c,d,g,h}
Graph Clustering using Mincut

• Use a min cut algorithm to break a graph into two sets

• Use the min cut algorithm to further break the smaller graphs

• Continue until the stopping condition is satisfied

Karger’s Min Cut Algorithm

The algorithm is based on edge contraction

• Repeat until just two nodes remain

1. Pick an edge at random

2. Collapse its two endpoints into a single node

Karger’s Min Cut Algorithm: Example
14 edges to choose from
Pick b –f with prob 1/14 9 edges to choose from
Pick ae – bf with prob 4/9

13 edges to choose from

Pick g - h with prob 1/13
5 edges to choose from
Pick c - dgh with prob 3/5

12 edges to choose from

Pick d - gh with prob 2/12

DONE: just two nodes

10 edges to choose from

Min cut value:
Pick a - e with prob 1/10 # of parallel edges in the final two-node graph

For this example: min cut is 2

Karger’s Min Cut Algorithm

• An Important Note

• It is a randomized algorithm

• Therefore, for good result do the following

1. Run Karger’s algorithm multiple times

(it will produce multiple cuts)

2. Take the cut with minimum value

Clustering Weighted Graph
Hierarchical Clustering algorithms
• Agglomerative (bottom-up)

• Start with each object being a single cluster

• Gradually merge two most similar clusters

• Divisive (top-down)
• Start out with all objects in the same cluster.

• Then in each step of the algorithm do the following

• Partition the cluster into two smaller clusters maximizing the
distance
Hierarchical Clustering: Important Notes
1. Does not require the number of clusters in advance

2. Needs a termination/readout condition

• Could be distance threshold
Rest of the Lecture

• Hierarchical Agglomerative Clustering (HAC)

• And for simple reasons

• Simpler
• Widely used
Hierarchical Agglomerative Clustering (HAC)
• Define a similarity function for determining the similarity of two instances.

• Starts with all instances in a separate cluster

• Then repeatedly joins the two clusters that are most similar

• The history of merging forms a hierarchy (called dendogram).

Hierarchical Clustering

• The important question

How do you determine the “nearness” of clusters?

Closest pair of clusters

Many variants to defining closest pair of clusters

• Single-link
• Distance of the “closest” points (single-link)

• Complete-link
• Distance of the “furthest” points

• Average-link
• Average distance between pairs of elements
Single Link Agglomerative Clustering

• Use maximum similarity of pairs:

#$%#&$ "& ' ! = $%& #$%# "" !!

"!&$ " !!& '

6
4

Cluster 1 Cluster 2

Distance between Cluster 1 and 2: 3

Single Link Example
Problem with Single Link Clustering

Chain or elongated clusters

They are far apart, yet they are in the same cluster
Complete Link Agglomerative Clustering
• Use minimum similarity of pairs:

#$% #&$ "& ' ! = $%& #$% # "" ! !

"!&$ " !!& '

6
4

Cluster 1 Cluster 2

Distance between Cluster 1 and 2: 10

• Makes “tighter,” spherical clusters that are typically preferable.
Complete Link Example
Example in Detail: Single Link Clustering
Example in Detail: Single Link Clustering
Example in Detail: Single Link Clustering
Example in Detail: Single Link Clustering
Spectral Clustering
Spectral Clustering
Spectral Clustering: Examples
Spectral Clustering

• Group points based on links in the graph

Creating Graph from n-dimensional data

• Use Gaussian Kernel to compute similarity between objects i and j

• Possible Graphs:
• Fully
connected
• Connect only k-nearest neighbours
Image Segmentation
Why not min cut?
Graph Partitioning
Useful Terminologies
Graph Cut
Normalized cut
Solving Normalized Cut

NP Hard!
Solving Normalized Cut: Approximate Solution

The second smallest eigenvector is the real valued solution to this problem!!
Two-way Normalized Cut
Portioning using 2nd Eigenvector

• Second eigenvector takes continuous values

• Difficult to find a clear threshold to split

• How to choose splitting threshold?

• Pick the median value as splitting point
• Look for the splitting point that has the minimum Normalized cut value:
1. Choose n possible splitting points.
2. Compute Normalized cut value.
3. Pick minimum.
THANK YOU

Unsupervised Learning 1
No ratings yet
Unsupervised Learning 1
40 pages
DM&BAFall2204 2
No ratings yet
DM&BAFall2204 2
61 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
07 Clustering 2024
No ratings yet
07 Clustering 2024
51 pages
Clustering Part1
No ratings yet
Clustering Part1
84 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
9 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
Clustering Algorithm: An Unsupervised Learning Approach
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
23 pages
Unsupervised Learning - Clustering
No ratings yet
Unsupervised Learning - Clustering
55 pages
20 - 1 - ML - Unsup - 01 - Partition Based - Kmeans
No ratings yet
20 - 1 - ML - Unsup - 01 - Partition Based - Kmeans
20 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
84 pages
Chapter 3 Unsupervised Learning
No ratings yet
Chapter 3 Unsupervised Learning
45 pages
Clustering
No ratings yet
Clustering
75 pages
Day 3
No ratings yet
Day 3
74 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
16 pages
Clustering
No ratings yet
Clustering
84 pages
Customer Segmentation Techniques Explained
No ratings yet
Customer Segmentation Techniques Explained
46 pages
K Means Clustering
No ratings yet
K Means Clustering
29 pages
Overview of Clustering Methods in ML
No ratings yet
Overview of Clustering Methods in ML
37 pages
ML Unit III
No ratings yet
ML Unit III
82 pages
Cluster
100% (1)
Cluster
72 pages
Unsupervised Learning: Clustering
No ratings yet
Unsupervised Learning: Clustering
57 pages
Lect 6 - Clustering
No ratings yet
Lect 6 - Clustering
50 pages
Unit 4 Machine Learning
No ratings yet
Unit 4 Machine Learning
12 pages
Module 5
No ratings yet
Module 5
43 pages
Machine Learning: Clustering & Algorithms
No ratings yet
Machine Learning: Clustering & Algorithms
66 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
Clustering and K-Means Lecture
No ratings yet
Clustering and K-Means Lecture
36 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
51 pages
Clustering
No ratings yet
Clustering
80 pages
MachineLearning Unit IV
No ratings yet
MachineLearning Unit IV
51 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
40 pages
MODULE 4 Clustering
No ratings yet
MODULE 4 Clustering
23 pages
M5
No ratings yet
M5
40 pages
5clustering 2
No ratings yet
5clustering 2
35 pages
Clustering in Python
No ratings yet
Clustering in Python
31 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
27 pages
AI-AG-Day-2-28th Feb 2023
No ratings yet
AI-AG-Day-2-28th Feb 2023
44 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
ML Unit 4
No ratings yet
ML Unit 4
110 pages
AI Chapter 3 Part 5
No ratings yet
AI Chapter 3 Part 5
30 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
Clustering
No ratings yet
Clustering
53 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
38 pages
Cluster Analysis 1731695796
No ratings yet
Cluster Analysis 1731695796
91 pages
WINSEM2021-22 ECE6093 ETH VL2021220505450 Reference Material I 23-03-2022 Slides Kmeans
No ratings yet
WINSEM2021-22 ECE6093 ETH VL2021220505450 Reference Material I 23-03-2022 Slides Kmeans
28 pages
Session 37 CO4 Unsupervised Learning
No ratings yet
Session 37 CO4 Unsupervised Learning
34 pages
Lecture 9 Clustering
No ratings yet
Lecture 9 Clustering
36 pages
Lecture 1 (UNIT 1)
No ratings yet
Lecture 1 (UNIT 1)
68 pages
Unit - 4 DWDM
No ratings yet
Unit - 4 DWDM
27 pages
Unsupervised Learning: Clustering Techniques
No ratings yet
Unsupervised Learning: Clustering Techniques
21 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
20 pages
Week 10
No ratings yet
Week 10
84 pages
ML Module 4 2022 1 PDF
No ratings yet
ML Module 4 2022 1 PDF
31 pages
Datamining-Lect5 - Clustering. The K-Means Algorithm. Hierarchical Clustering. The DBSCAN Algorithm. Clustering Evaluation
No ratings yet
Datamining-Lect5 - Clustering. The K-Means Algorithm. Hierarchical Clustering. The DBSCAN Algorithm. Clustering Evaluation
110 pages
Unsupervised Learning Explained
No ratings yet
Unsupervised Learning Explained
54 pages
APQP & Layered Process Audits Overview
No ratings yet
APQP & Layered Process Audits Overview
9 pages
Power Quality MCQ Question and Answer
100% (2)
Power Quality MCQ Question and Answer
27 pages
Fit of CADCAM Implant Frameworks A Comprehensive Review
No ratings yet
Fit of CADCAM Implant Frameworks A Comprehensive Review
9 pages
Network Routing Configuration Guide
No ratings yet
Network Routing Configuration Guide
56 pages
IPTV Ale4
0% (1)
IPTV Ale4
21 pages
Mayo-Dlp-Formulate Statistical Mini-Research
No ratings yet
Mayo-Dlp-Formulate Statistical Mini-Research
10 pages
Mellanox SX6012
No ratings yet
Mellanox SX6012
2 pages
Web Designing Course Syllabus, Details, Fees, Salary, Jobs - Leverage Edu
No ratings yet
Web Designing Course Syllabus, Details, Fees, Salary, Jobs - Leverage Edu
6 pages
DirectDebitMandate PDF
No ratings yet
DirectDebitMandate PDF
1 page
CV Ifrim Albert Madalin en
No ratings yet
CV Ifrim Albert Madalin en
2 pages
ASPAK
No ratings yet
ASPAK
4 pages
DATA SHEET FOR MOGAS STORAGE TANK - Rev.0
100% (1)
DATA SHEET FOR MOGAS STORAGE TANK - Rev.0
6 pages
CCTV System
No ratings yet
CCTV System
2 pages
Testbank For Business Analytics 2nd Edition Jaggia
No ratings yet
Testbank For Business Analytics 2nd Edition Jaggia
17 pages
2nd SCH 2013-14 PDF
No ratings yet
2nd SCH 2013-14 PDF
67 pages
MC 2010-017
No ratings yet
MC 2010-017
4 pages
2.1. Introduction To JavaScript
No ratings yet
2.1. Introduction To JavaScript
20 pages
ABB - Technical Note 127 - LVD-EOTKN127U-EN - BackupGeneratorsAndHarmonics - RevB
No ratings yet
ABB - Technical Note 127 - LVD-EOTKN127U-EN - BackupGeneratorsAndHarmonics - RevB
4 pages
Khaing Khaing Oo's Resume
No ratings yet
Khaing Khaing Oo's Resume
5 pages
Isollat 02 Certification and Testing Spetsialniy Tehnologii
No ratings yet
Isollat 02 Certification and Testing Spetsialniy Tehnologii
22 pages
Soal 1 Dan 2
No ratings yet
Soal 1 Dan 2
2 pages
Unit 6
No ratings yet
Unit 6
12 pages
Impact Filter
No ratings yet
Impact Filter
1 page
LSB Chillers
No ratings yet
LSB Chillers
2 pages
Ventilator Installation Checklist
No ratings yet
Ventilator Installation Checklist
8 pages
Simran Saini: Software Developer Profile
No ratings yet
Simran Saini: Software Developer Profile
1 page
2013 Maria Miranda Unsitely Aesthetics
No ratings yet
2013 Maria Miranda Unsitely Aesthetics
157 pages
Founding Engineer Startup Guide
No ratings yet
Founding Engineer Startup Guide
16 pages
Unit 3 Boolean Algebra and Logic Gates
No ratings yet
Unit 3 Boolean Algebra and Logic Gates
15 pages
Custom Software Pricing Guide
100% (2)
Custom Software Pricing Guide
136 pages