0% found this document useful (0 votes)

208 views21 pages

Poisson Mixture Models Explained

The document discusses Poisson mixture models, which can be used to model datasets consisting of observations from a mixture of different Poisson distributions. It introduces the Poisson distribution and mixture models. It describes how the expectation-maximization (EM) algorithm can be used to estimate the parameters of a Poisson mixture model by calculating membership probabilities using Jensen's inequality.

Uploaded by

Seba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

208 views21 pages

Poisson Mixture Models Explained

Uploaded by

Seba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Poisson Mixture Models

Brandon Malone
Much of this material is adapted from Bilmes 1998 and Tomasi 2004.
Many of the images were taken from the Internet

February 20, 2014

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Poisson Mixture Models

Suppose we have a dataset D which consists of DNA sequences observed

from a mixture of k bacteria. We do not know which sequence belongs
to which species.

Sequence Species Count

CAGAGGAT ? 5
TCAGTGTC ? 13
CTCTGTGA ? 2
AACTGTCG ? 7
CGCGTGGA ? 15
GGATGAGA ? 1

Which DNA sequences belong to the same species?

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Poisson Mixture Models

Suppose we have a dataset D which consists of DNA sequences observed

from a mixture of k bacteria. We do not know which sequence belongs
to which species.

Sequence Species Count

CAGAGGAT ? 5
λk P
TCAGTGTC ? 13 K
CTCTGTGA ? 2
AACTGTCG ? 7
⇒
CGCGTGGA ? 15
GGATGAGA ? 1 Dl zl
M
Which DNA sequences belong to the same species?

This can be described by a Poisson mixture model.

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

1 The Poisson Distribution

2 Mixture Models

3 Expectation-Maximization

4 Wrap-up

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Multiple Bernoulli trials

Suppose we have a Bernoulli-distributed variable (a weighted coin

flip with parameter θ).

If we flip two coins, what is our probability of seeing exactly one H?

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Multiple Bernoulli trials

Suppose we have a Bernoulli-distributed variable (a weighted coin

flip with parameter θ).

If we flip two coins, what is our probability of seeing exactly one H?

C1 C2 P(C1 , C2 )
H H θ·θ
H T θ · (1 − θ)
T H (1 − θ) · θ
T T (1 − θ) · (1 − θ)

So, P(exactly one H) = 2 · θ · (1 − θ).

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Multiple Bernoulli trials

Suppose we have a Bernoulli-distributed variable (a weighted coin

flip with parameter θ).

If we flip two coins, what is our probability of seeing exactly one H?

C1 C2 P(C1 , C2 )
H H θ·θ
H T θ · (1 − θ)
T H (1 − θ) · θ
T T (1 − θ) · (1 − θ)

So, P(exactly one H) = 2 · θ · (1 − θ).

n

In general, P(exactly m successes in n trials) = m · θm · (1 − θ)n−m .

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Take it, to the limit, one more time

What if we have an infinite number of trials and expect to see λ

successes?
λm
lim P(exactly m successes in n trials) = exp {−λ}
n→∞ m!
This is called the Poisson distribution.

We will write g (m : λ) to mean P(exactly m successes given λ).

(See the videos for a detailed derivation.)

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Mixtures of distributions
Suppose we have K Poisson distributions (components) with
parameters λ1 . . . λK mixed together with proportions p1 . . . pK .
We often write P = {p1 . . . pK } and θ = {λ1 . . . λK , P}.

procedure GenerateDataset(Poisson parameters λ1 . . . λk , mixing proportions

)
p1 . . . pk , samples N

D←∅
for l = 1 to N do
component zl ←sample(Mult(p1 . . . pK ))
observation Dl ←sample(Poisson(λzl ))
D ← D ∪ Dl
end for
return D
end procedure
Brandon Malone Poisson Mixture Models
The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

λk P
K

Dl zl
M

Figure: Generative model for a Poisson mixture model (PMM)

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Likelihood of data
We can write the (log) probability of any mixture model as follows.

K
X
P(D : θ) = pk g (D : λk )
k
N X
Y K
P(D : θ) = pk g (Dl : λk )
l k
N X
Y K
`(D : θ) = log pk g (Dl : λk )
l k
N
X K
X
`(D : θ) = log pk g (Dl : λk )
l k

The learning problem can be formulated as follows.

θ∗ = arg max `(D : θ)

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Membership probabilities

Notation

q(k, l) ..= pk g (Dl : λk ) joint probability of Dl and component k

P(k|l) ..= P(zl = k|Dl ) conditional probability of component k given Dl

The probability that Dl came from comonent k is expressed as

follows.
q(k, l)
P(k|l) = PK
m q(m, l)

Also, we know each observation came from some component.

X
P(k|l) = 1
k

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Jensen’s Inequality
Recall the likelihood of the mixture model.
N
X K
X
`(D : θ) = log q(k, l)
l k

Jensen’s inequality shows the following.

K
X K
X
log πk αk ≥ πk log αk when π is a distribution
k k

We can make this work for any values.

K K K K
X X πk X ck X ck
log ck = log ck = log πk ≥ πk log
πk πk πk
k k k k

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Expectation-Maximization (EM)

Our learning problem is formulated as follows.

θ∗ = arg max `(D : θ)

EM begins with a (bad) set of estimates for θ.

1 Use Jensen’s inequality to estimate a bound b on `
called the expectation of `
2 Find values of θ which maximize b

EM is guaranteed to find θs which do not decrease b.

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Expectation and the Q function

Recall the definition of ` and Jensen’s inequality.
N
X K
X
`(D : θ) = log q(k, l)
l k
N K
XX q(k, l)
≥ P(k|l) log
P(k|l)
l k

This gives the expectation of ` with our current parameters θ.

Based on this equation, we define Q(θ) which we want to maximize.

N X
X K
Q(θ) = P(k|n) log q(k, l)
l k

(See the handout for a detailed derivation of Q.)

Brandon Malone Poisson Mixture Models
The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Maximization and the Q function

We use the following process to maximize Q for a particular

parameter θi .
1 Differentiate Q w.r.t θi
2 Set the derivative equal to 0
3 Solve for θi
(See the handout for detailed derivations.)

PN
P(k|l)Dl
l
λk =
Z (k)
Z (k)
pk =
N

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

The EM algorithm for PMMs

procedure pmmEM(data D, inital p1 . . . pK , λ1 . . . λK , convergence criteria C)

while C has not been met do
. Update the expectations
q(k, l) ← pk · g (Dl , λk )
P(k|l) ← PKq(k,l)
m q(m,l)
PN
. Maximize the parameters
P(k|l)Dl
λk ← l
Z (k)
Z (k)
pk ← N
end while
end procedure

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Grouping the DNA sequences into clusters

After running EM, we have several useful pieces of information

about our metagenomics sample.
P(k|l). The distribution over species for each sequence.
pk . The relative genome sizes of the species.
λk . The abundance of the species.

Other questions...
Do we really know how many species there are?
Can we differentiate species with similar abundances?
How do we pick “good” initial parameters?
When have we converged?

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

EM is a general framework that is useful whenever data is missing.

If used to estimate class probabilities in naive Bayes models, it
is called Bayesian clustering
If used in HMMs, it is called the Baum-Welch algorithm
Can be used in general Bayesian networks to calculate
parameters when some data is missing
If used with structure learning algorthms, it is called
Structural EM
Many, many others...
We maximize likelihood with EM. What if we want MAP
parameters?

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Recap

During this part of the course, we have discussed:

Mixture models as a probabilistic clustering method
Expectation-maximization as a framework for estimating
parameters when variables are hidden

Brandon Malone Poisson Mixture Models

The Poisson Distribution Mixture Models Expectation-Maximization Wrap-up

Next in probabilistic models

We will see a Bayesian version of EM.

Estimating parameters in topic models

Brandon Malone Poisson Mixture Models

Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
Fitting A Model Probability Distribution
No ratings yet
Fitting A Model Probability Distribution
17 pages
(Computational Biology, V. 2) Timo Koski - Hidden Markov Models For Bioinformatics-Kluwer (2001)
No ratings yet
(Computational Biology, V. 2) Timo Koski - Hidden Markov Models For Bioinformatics-Kluwer (2001)
404 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
55 pages
Important PMFs and PDFs
No ratings yet
Important PMFs and PDFs
7 pages
Unsupervised Learning Clustering Math
No ratings yet
Unsupervised Learning Clustering Math
28 pages
Statistical Models: Modeling and Simulation
No ratings yet
Statistical Models: Modeling and Simulation
51 pages
2 Hidden Markov Model v1
No ratings yet
2 Hidden Markov Model v1
9 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Gaussian Mixture Model Overview
No ratings yet
Gaussian Mixture Model Overview
55 pages
GMM and EM Algorithm Overview
No ratings yet
GMM and EM Algorithm Overview
33 pages
Mathematics Behind The Powerful Gaussian Mixture Model (GMM)
No ratings yet
Mathematics Behind The Powerful Gaussian Mixture Model (GMM)
30 pages
Mixture Distribution Fitting Tutorial
No ratings yet
Mixture Distribution Fitting Tutorial
12 pages
Probability Distributions in Modeling
No ratings yet
Probability Distributions in Modeling
9 pages
TD10 - TD - GMM - 2025
No ratings yet
TD10 - TD - GMM - 2025
1 page
Finite Mixture Models Explained
No ratings yet
Finite Mixture Models Explained
11 pages
Mixture Models & Expectation Maximization
No ratings yet
Mixture Models & Expectation Maximization
38 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
CS109/Stat121/AC209/E-109 Data Science: Statistical Models
No ratings yet
CS109/Stat121/AC209/E-109 Data Science: Statistical Models
26 pages
Notes7 Mixtures and EM
No ratings yet
Notes7 Mixtures and EM
7 pages
CB PDF
No ratings yet
CB PDF
69 pages
Intro To Mixture Models
No ratings yet
Intro To Mixture Models
5 pages
Stat 301 L15
No ratings yet
Stat 301 L15
25 pages
Bhati 2016
No ratings yet
Bhati 2016
32 pages
Machine Learning Homework Guide
No ratings yet
Machine Learning Homework Guide
6 pages
Lecture-04 GMM EMalg
No ratings yet
Lecture-04 GMM EMalg
34 pages
Lec-1 Probabilistic Models
No ratings yet
Lec-1 Probabilistic Models
29 pages
Week 7 GMM
No ratings yet
Week 7 GMM
9 pages
Asymptotic Normality of The Maximum-Likelihood Estimator For General Hidden Markov Models
No ratings yet
Asymptotic Normality of The Maximum-Likelihood Estimator For General Hidden Markov Models
22 pages
Lecture 04: Statistical Models: Dr. Nguyen Tai Hung
No ratings yet
Lecture 04: Statistical Models: Dr. Nguyen Tai Hung
54 pages
Mixture Models
No ratings yet
Mixture Models
16 pages
T 3 Estimation
No ratings yet
T 3 Estimation
20 pages
MAT2377 Final Formula Sheet
No ratings yet
MAT2377 Final Formula Sheet
4 pages
Naive Bayes Classifier and Other Topics
No ratings yet
Naive Bayes Classifier and Other Topics
52 pages
Appendix: 12.1 Inventory of Distributions
No ratings yet
Appendix: 12.1 Inventory of Distributions
6 pages
Stochastic Processes by Jyotiprasad Medhi PDF
70% (33)
Stochastic Processes by Jyotiprasad Medhi PDF
128 pages
DMML2023 Lecture16 09mar2023
No ratings yet
DMML2023 Lecture16 09mar2023
16 pages
Gaussian Mixture Models Overview
No ratings yet
Gaussian Mixture Models Overview
34 pages
Contol of Quairyststae
No ratings yet
Contol of Quairyststae
2 pages
Lecture 9. Poisson Process
No ratings yet
Lecture 9. Poisson Process
26 pages
Stat4001 Probability Formula Sheet
No ratings yet
Stat4001 Probability Formula Sheet
5 pages
R300 Advanced Econometrics Methods Lecture Slides
No ratings yet
R300 Advanced Econometrics Methods Lecture Slides
362 pages
Bayes 3
No ratings yet
Bayes 3
61 pages
Principles of Statistics
No ratings yet
Principles of Statistics
113 pages
Solved Exercises and Problems of Statist PDF
100% (1)
Solved Exercises and Problems of Statist PDF
229 pages
Ejercicios Resueltos de Inferencia Estadistica
No ratings yet
Ejercicios Resueltos de Inferencia Estadistica
229 pages
Lecture Notes Week 2
No ratings yet
Lecture Notes Week 2
10 pages
MLSlides5 - Selected - Shared
No ratings yet
MLSlides5 - Selected - Shared
30 pages
Stochastic Optimization Notes
No ratings yet
Stochastic Optimization Notes
42 pages
S1) Basic Probability Review
No ratings yet
S1) Basic Probability Review
71 pages
Learning Models From Data: 1 Parametric Estimation
No ratings yet
Learning Models From Data: 1 Parametric Estimation
14 pages
(Ebook PDF) Introduction To Probability Models 12th Edition PDF Download
100% (1)
(Ebook PDF) Introduction To Probability Models 12th Edition PDF Download
63 pages
LHC Physicists' Statistics Guide
No ratings yet
LHC Physicists' Statistics Guide
283 pages
2017may 02323 02402 Solution en
No ratings yet
2017may 02323 02402 Solution en
43 pages
hw3 Solution
No ratings yet
hw3 Solution
7 pages
Lecture 6
No ratings yet
Lecture 6
76 pages
Probability Distributions Guide
No ratings yet
Probability Distributions Guide
86 pages
Mixture Models
No ratings yet
Mixture Models
16 pages
RDBMS and SQL Notes 2014
No ratings yet
RDBMS and SQL Notes 2014
197 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 03
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 03
29 pages
Neural Network Control Methods Guide
No ratings yet
Neural Network Control Methods Guide
11 pages
Shri Guru Gita - Hindi1
100% (1)
Shri Guru Gita - Hindi1
92 pages
Understanding Second Normal Form Rules
No ratings yet
Understanding Second Normal Form Rules
14 pages
Permutations, Probability, and Statistics Guide
No ratings yet
Permutations, Probability, and Statistics Guide
3 pages
Course Outline Maya
No ratings yet
Course Outline Maya
2 pages
UML Diagram Step by Step
No ratings yet
UML Diagram Step by Step
26 pages
Study of Uml - Exp01
No ratings yet
Study of Uml - Exp01
9 pages
Transistor Specifications List
100% (1)
Transistor Specifications List
25 pages
Time Series Forecasting Techniques
No ratings yet
Time Series Forecasting Techniques
43 pages
The General Systems Model of The Firm
100% (1)
The General Systems Model of The Firm
10 pages
Chapter 4 Part 1 (A) : Logical Database Design and The Relational Model
No ratings yet
Chapter 4 Part 1 (A) : Logical Database Design and The Relational Model
48 pages
Database Systems (Lab 07)
No ratings yet
Database Systems (Lab 07)
5 pages
Lab 1
No ratings yet
Lab 1
7 pages
Amigos F
No ratings yet
Amigos F
28 pages
HDL Designer Series™ Language Support Guide Release v2018.2 © 1994-2018 Mentor Graphics Corporation
No ratings yet
HDL Designer Series™ Language Support Guide Release v2018.2 © 1994-2018 Mentor Graphics Corporation
84 pages
Chapter Five: Type Checking
100% (1)
Chapter Five: Type Checking
48 pages
ERDAS Haze Reduction Tutorial
No ratings yet
ERDAS Haze Reduction Tutorial
6 pages
X-Plane Livery Design with Blender
No ratings yet
X-Plane Livery Design with Blender
9 pages
Image Processing On The GPU: A Canonical Example: Scales Ns Orientatio Colors
No ratings yet
Image Processing On The GPU: A Canonical Example: Scales Ns Orientatio Colors
11 pages
Unit II (Methodologies)
No ratings yet
Unit II (Methodologies)
48 pages
3d The Basics
No ratings yet
3d The Basics
13 pages
HDL Coder and System Generator
No ratings yet
HDL Coder and System Generator
15 pages
Object Detection in A Cluttered Scene Using Point Feature Matching
No ratings yet
Object Detection in A Cluttered Scene Using Point Feature Matching
2 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
11 pages
GIS Data Models Explained
No ratings yet
GIS Data Models Explained
97 pages
Forward and Reverse Engg
No ratings yet
Forward and Reverse Engg
21 pages
Lithunwrap Tutorial
100% (2)
Lithunwrap Tutorial
45 pages