Unit 3

Uploaded by

Chitra M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views43 pages

Unit 3

Uploaded by

Chitra M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 43

Unit – III

Unsupervised Learning, Clustering Support Vector Machines

• Parametric methods when data is known
to have a distribution
• Use data to get the parameters of the
distribution
– Typically few in number
• Maybe too rigid in some cases – ie always
same distribution
Clustering
Clustering

4
Clustering
• (Q) Supervised or unsupervised?

5
Clustering
• (Q) Supervised or unsupervised?

6
Clustering

7
K-means Clustering
• A method to allot the cluster to each
sample
• Minimize overall cost = reconstruction
error = distance between cluster “centers”
• Choosing a cluster for each sample:

8
K-Means Clustering
• Once new centers are obtained,
recalculate the assignment
• Repeat with new means and re-assign
samples
• Continue till no change in centers

9
10
11
12
13
14
15
16
K-Means Clustering
• Choose a random set of cluster centers
• The reconstruction error should be
minimized, hence, differentiate and equate
to zero

• These are the new cluster centers –

nothing but mean of the samples
17
K-Means Clustering

18
Leader Clustering
• Some outliers may skew the means
• Add a parameter “t” – max length
– If not within t of any cluster, sample become a
new cluster head
• Recalculate with new mean and continue
• (Q) What is the value of t for which leader
clustering become K-means?

19
EM Algorithm

20
Expectation Maximization (EM)
Algorithm
• Unsupervised data – only data, no class labels

• Hence, the data consists of two parts – observables X

and unknowns Z

• E-step we estimate these labels given our current

knowledge of components
• M-step we update our component knowledge given the
labels estimated in the E-step.
– (Q) Is K-Means also a type of EM algorithm?

21
EM Example
• Example: Missing Normally Distributed Data
• Assume Gaussian variable
• {5, 11, x, x}
• Start: choose random value for x
• E-step: calculate mean of data
• M-step: replace with new mean from
previous step
• Continue till convergence (Will it??)

22
Coin Example

23
Sample EM Algorithm
• Consider the six samples and the true likelihood of the two classes
• Problem in EM
– What we have: K (ie number of classes), Distribution Pattern Data
points
– What we don’t have: parameters of distribution, class labels
• How to get the two class means?

24
Sample EM Algorithm
• (Q) Outline the application of the EM algorithm for
Gaussian Mixtures.

25
Hierarchical Clustering

26
Hierarchical Clustering

27
Hierarchical Clustering

28
Agglomerative Clustering
• Agglomerate – collect into a group
• Idea: take each sample as a separate class
– Bottom up approach
• Start by combining samples close to each other
– Distance, e.g., in documents, word to vector and
Euclidean distance
• Each iteration – combine two closest samples to
agglomerate (ie, combine)
• Continue until only one group is left

29
Agglomerative Clustering
• Consider the second iteration of agglomerative
clustering – ie, all groups have two elements
• How to combine two groups – what is the
distance between the two groups with two
elements each?
– Single link clustering – calculate all the four (how?)
distances, smallest is distance between groups
– Complete Link Clustering – distance between
clusters is largest distance between all pairs

30
Agglomerative Clustering

31
• Perform Agglomerative clustering and
show the intermetidate steps on the
following data:
1 2 3 4
1 0 7 4 8
2 0 2 1
3 0 8
4 0

32
33
34
35
36
Dendrogram

37
Dendrogram
• Used to visualize the clusters
• Clusters joined at “height” – based on linkage type
• Two combined to one – agglomeration
• Can be “cut” at a height h to not group beyond that
distance

38
• (Q) Draw the dendrogram using single link
clustering and cut at height 0.55. What are
the resulting clusters?

39
Kernelised SVM

40
41
42
Kernels
• Example:
Types for feature extraction: Identity, Blur,
Edge Detection, Sharpening
Operations: Convolution, Pooling, Dilation,
Erosion, Cross-correlation

Lect 10 - Unsupervised Learning
No ratings yet
Lect 10 - Unsupervised Learning
50 pages
Machine Learning: CSCE883
No ratings yet
Machine Learning: CSCE883
22 pages
K-means and EM Clustering Overview
No ratings yet
K-means and EM Clustering Overview
23 pages
EM and Kmeans Relations
No ratings yet
EM and Kmeans Relations
70 pages
Lecture Expectation Maximization
No ratings yet
Lecture Expectation Maximization
58 pages
Expectation Maximization
No ratings yet
Expectation Maximization
23 pages
ML Unit Iii
No ratings yet
ML Unit Iii
12 pages
ML09 Clustering
No ratings yet
ML09 Clustering
65 pages
Week 5 v1.1 - Unsupervised Learning
No ratings yet
Week 5 v1.1 - Unsupervised Learning
40 pages
MLT Lab 08
No ratings yet
MLT Lab 08
5 pages
5 Clustering
No ratings yet
5 Clustering
38 pages
Unsupervised Learning: Clustering
No ratings yet
Unsupervised Learning: Clustering
57 pages
Unsupervised Learning: K-Means & EM
No ratings yet
Unsupervised Learning: K-Means & EM
34 pages
Unsupervised Learning - A Comprehensive Overview of
No ratings yet
Unsupervised Learning - A Comprehensive Overview of
5 pages
Key Concepts in Clustering and EM Algorithm
No ratings yet
Key Concepts in Clustering and EM Algorithm
18 pages
PROBABILISTIC Learning Jb-New
No ratings yet
PROBABILISTIC Learning Jb-New
13 pages
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
No ratings yet
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
47 pages
Day 3 - Content
No ratings yet
Day 3 - Content
50 pages
15 GMC
No ratings yet
15 GMC
4 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
Intro to Machine Learning Concepts
No ratings yet
Intro to Machine Learning Concepts
30 pages
Expectation-Maximization Clustring V2
No ratings yet
Expectation-Maximization Clustring V2
9 pages
22AIP3101A Session 9
No ratings yet
22AIP3101A Session 9
38 pages
Data Mining For BI - Part 5
No ratings yet
Data Mining For BI - Part 5
34 pages
Intro to Clustering Methods
No ratings yet
Intro to Clustering Methods
39 pages
Chapter 3 Unsupervised Learning
No ratings yet
Chapter 3 Unsupervised Learning
45 pages
Mixture Models and K-Means Clustering
No ratings yet
Mixture Models and K-Means Clustering
8 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
Clustering Techniques for Analysts
No ratings yet
Clustering Techniques for Analysts
7 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Clustering
No ratings yet
Clustering
27 pages
Unsupervised Learning Techniques in Python
100% (2)
Unsupervised Learning Techniques in Python
89 pages
Concepts and Techniques: - Chapter 11
No ratings yet
Concepts and Techniques: - Chapter 11
103 pages
CE345 - Lecture #10 - Clustering (Part 2)
No ratings yet
CE345 - Lecture #10 - Clustering (Part 2)
64 pages
Clustering
No ratings yet
Clustering
84 pages
Unit 5
No ratings yet
Unit 5
5 pages
06 - Unsupervised Learning - 18 Dec 2023
No ratings yet
06 - Unsupervised Learning - 18 Dec 2023
50 pages
Lec. 15-Final. ClusAdvanced
No ratings yet
Lec. 15-Final. ClusAdvanced
103 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
Data Clustering for Informatics Students
No ratings yet
Data Clustering for Informatics Students
5 pages
Lec09 Clustering
No ratings yet
Lec09 Clustering
27 pages
Clustering Techniques - Hierarchical, K-Means Clustering
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
22 pages
UNIT III Part-1
No ratings yet
UNIT III Part-1
69 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
Module 3
No ratings yet
Module 3
123 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
80 pages
5clustering 2
No ratings yet
5clustering 2
35 pages
cz4041 10 Clustering
No ratings yet
cz4041 10 Clustering
67 pages
Bayesian Networks & EM Algorithm
No ratings yet
Bayesian Networks & EM Algorithm
7 pages
01 Clustering
No ratings yet
01 Clustering
43 pages
ML-2-Expectation Maximization
No ratings yet
ML-2-Expectation Maximization
11 pages
Section 3
No ratings yet
Section 3
22 pages
Clustering
No ratings yet
Clustering
82 pages
Machine Learning Notes-1 (Clustering-1)
No ratings yet
Machine Learning Notes-1 (Clustering-1)
25 pages
Clustering, K-Means,. Expectation Maximization, Mean Shift, Classifier Ensembles, Bagging, Boosting
No ratings yet
Clustering, K-Means,. Expectation Maximization, Mean Shift, Classifier Ensembles, Bagging, Boosting
21 pages
ML - Unit - 4 - Part Ii
No ratings yet
ML - Unit - 4 - Part Ii
79 pages
Cluster Analysis: Introduction - I: Dr. A. Ramesh
No ratings yet
Cluster Analysis: Introduction - I: Dr. A. Ramesh
28 pages
19MIS0424 Yerram Karthik
No ratings yet
19MIS0424 Yerram Karthik
72 pages
Lean Manufacturing Insights
No ratings yet
Lean Manufacturing Insights
35 pages
Data Science Course Overview
No ratings yet
Data Science Course Overview
146 pages
The Unscrambler Methods
No ratings yet
The Unscrambler Methods
288 pages
ML Lab Syllabus for Students
No ratings yet
ML Lab Syllabus for Students
90 pages
Marine Vessel Predictive Maintenance Model
No ratings yet
Marine Vessel Predictive Maintenance Model
12 pages
Understanding Deep Convolutional Networks
No ratings yet
Understanding Deep Convolutional Networks
17 pages
Unit 9 ANN
No ratings yet
Unit 9 ANN
14 pages
Data Mining Based On Neural Networks: Fore Word: What Is A Neural Network?
No ratings yet
Data Mining Based On Neural Networks: Fore Word: What Is A Neural Network?
21 pages
NLP - Emotion Detection
No ratings yet
NLP - Emotion Detection
8 pages
Artificial Neural Networks Exam Paper
No ratings yet
Artificial Neural Networks Exam Paper
2 pages
Digital Content Labels & Sensitive Topics
No ratings yet
Digital Content Labels & Sensitive Topics
5 pages
Roni Presentation
No ratings yet
Roni Presentation
17 pages
ML Mod1-CS467 Machine Learning - Ktustudents - in
No ratings yet
ML Mod1-CS467 Machine Learning - Ktustudents - in
16 pages
Adt301 Foundations of Data Science, November 2024
100% (1)
Adt301 Foundations of Data Science, November 2024
2 pages
CS229 Andrew NG Lecture Notes
No ratings yet
CS229 Andrew NG Lecture Notes
216 pages
006 Practical List of DM-2023
No ratings yet
006 Practical List of DM-2023
1 page
Why Is Deep Learning Challenging For Printed Circuit Board (PCB) Component Recognition and How Can We Address It?
No ratings yet
Why Is Deep Learning Challenging For Printed Circuit Board (PCB) Component Recognition and How Can We Address It?
18 pages
Zhang 2020
No ratings yet
Zhang 2020
5 pages
ML 1,2 Unit Peter Flach Machine Learning. The Art and Scienc
No ratings yet
ML 1,2 Unit Peter Flach Machine Learning. The Art and Scienc
22 pages
Lab 7
No ratings yet
Lab 7
4 pages
Credit Risk Classification Methods
0% (1)
Credit Risk Classification Methods
16 pages
Lecture 2 Data Mining Functions
No ratings yet
Lecture 2 Data Mining Functions
40 pages
Machine Learning for Wear Debris Analysis
No ratings yet
Machine Learning for Wear Debris Analysis
13 pages
Analysis-Second-Edition-4299392: (4.5/5.0 - 314 Downloads)
No ratings yet
Analysis-Second-Edition-4299392: (4.5/5.0 - 314 Downloads)
86 pages
SAP Conversion Design Document Guide
No ratings yet
SAP Conversion Design Document Guide
20 pages
Data Science Notes
No ratings yet
Data Science Notes
1 page
Lesson 3.2 - Supervised Learning Evaluation
No ratings yet
Lesson 3.2 - Supervised Learning Evaluation
31 pages
Diabetes Prediction via Machine Learning
No ratings yet
Diabetes Prediction via Machine Learning
16 pages