Expectation Maximization
Expectation Maximization
Maximization
(Data Mining and Warehousing)
Expectation Maximization
How EM works
Applications of EM
Example – Probabilistic Clustering
Both Maximum Likelihood Estimation (MLE) and EM can find the "best-fit"
parameters, but with different methodologies.
MLE accumulates all the data object to estimate the parameters; but EM
takes a guess at the parameter first, and then tweaks the model to fit the
guesses and the observed data.
wij =
where j=1,2.
According to this formula, after 1st iteration, we get the updated
centers c1(8.47,5.12) and c2(10.42,8.99).
Fuzzy Clustering Using EM (Contd.)
Fuzzy Clustering Using EM (Contd.)
Mixture Models
Probabilistically-grounded way of doing soft clustering.
Each cluster is a generative model (Gaussian, Multinomial).
The parameters are latent (Mean, covariance etc.).
Advantages and Limitations of EM
Advantages:
It is always guaranteed that likelihood will increase with each iteration.
The E-step and M-step are often pretty easy for many problems in terms of
implementation.
Solutions to the M-steps often exist in the closed form.
Limitations:
Slow convergence.
It makes convergence to the local optima only.
It requires both the probabilities, forward and backward (numerical
optimization requires only forward probability).
EM – Dealing with The Local Maxima
Problem
1. Jung YG, Kang MS, Heo J. Clustering performance comparison using K-means and
expectation maximization
algorithms. Biotechnol Biotechnol Equip. 2014;28(sup1):S44-S48.
doi:10.1080/13102818.2014.949045
2. Gupta, Ujjwal Das, Vinay Menon, and Uday Babbar. "Detecting the number of
clusters during expectation-maximization clustering using information criterion."
In 2010 Second International Conference on Machine Learning and Computing, pp.
169-173. IEEE, 2010.
3. https://towardsdatascience.com/a-comparison-between-k-means-clustering-and-
expectation-maximization-estimation-for-clustering-8c75a1193eb7
4. https://www.geeksforgeeks.org/ml-expectation-maximization-algorithm/
5. http://www.inf.ed.ac.uk/teaching/courses/iaml/2011/slides/em.pdf
6. https://machinelearningmastery.com/expectation-maximization-em-algorithm/
7. https://www.statisticshowto.com/em-algorithm-expectation-maximization/
Thank you!!
Univariate Gaussian Mixture Model with EM
Does it look
more like a
sample from
yellow
gaussian, or
blue?
Bayesian Posterior
Each cluster
follows 1-D
Gaussian
distribution.
Optimal Number of k for Gaussians