0% found this document useful (0 votes)

62 views28 pages

Machine Learning Course Syllabus GIT

The document discusses hidden Markov models (HMMs). It describes the key elements of an HMM including states, observations, transition probabilities, and emission probabilities. It also outlines the three basic problems of HMMs: evaluation, finding the most likely state sequence, and learning model parameters from data. The evaluation and state sequence problems can be solved using dynamic programming algorithms like the forward-backward and Viterbi algorithms.

Uploaded by

V Yaswanth Sai 22114242113

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views28 pages

Machine Learning Course Syllabus GIT

Uploaded by

V Yaswanth Sai 22114242113

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Day & Time: Monday (10am-11am & 3pm-4pm)

Tuesday (10am-11am)
Wednesday (10am-11am & 3pm-4pm)
Friday (9am-10am, 11am-12am, 2pm-3pm)
Dr. Srinivasa L. Chakravarthy
&
Smt. Jyotsna Rani Thota
Department of CSE
GITAM Institute of Technology (GIT)
Visakhapatnam – 530045
Email: [email protected] & [email protected]
Department of CSE, GIT 1
2 Novt 2020
EID 403 and machine learning
Course objectives

● Explore about various disciplines connected with ML.

● Explore about efficiency of learning with inductive bias.
● Explore about identification of Ml algorithms like decision
tree learning.
● Explore about algorithms like Artificial Neural networks,
genetic programming, Bayesian algorithm, Nearest neighbor
algorithm, Hidden Markov chain model.

Department of CSE, GIT EID 403 and machine learning

Learning Outcomes

● Identify the various applications connected with ML.

● Classify efficiency of ML algorithms with Inductive bias
technique.
● Discriminate the purpose of all ML algorithms.
● Analyze any application and Correlate available ML
algorithms.
● Choose an ML algorithm to develop their project.

Department of CSE, GIT EID 403 and machine learning

Syllabus

20 August 2020 4

Department of CSE, GIT EID 403 and machine learning

Reference book 1. Title -Machine Learning
Author- Tom M Mitchell

Department of CSE, GIT EID 403 and machine learning

Reference book 2. Title –Introduction to Machine Learning
Author- Ethem Alpaydin

Department of CSE, GIT EID 403 and machine learning

Module -5
(Chapter-15 from prescribed book author -Ethem Alpaydin)
It includes-

Discrete Markov processes

Hidden Markov Models

Three problems of HMM

Evaluation problem

Finding state sequence

Learning model parameters & continuous observations

HMM with output & Model selection in HMM 7

Introduction
So far, we assumed that the instances that forms a sample are
independent and identically distributed i.e., if each random variable has the
same probability distribution as the others and all are mutually independent.

This assumption is not valid for applications where successive instances

are dependent.

For example, Processes where sequence of observations cannot be

modeled as sample probability distributions are-

1. In a word successive letters are dependent.

2. Base pairs in a DNA sequence are dependent. and.
3. In a speech recognition, phonemes in a word (dictionary),
words in a sentence (syntax, semantics of the language).
Introduction

Any sequence is characterized by parametric random process.

In this chapter-it is about

● How modelling will be done.

● How parameters of such a model can be learned from a training sample of-
example sequences.
Discrete Markov Processes
Consider a system that acts like, at any time it is in one of the

set of N distinct states-S1,S2,...SN,.

The state at time t is qt, t=1,2,...

For example qt=Si means that at time t, the system state is Si.

At regularly spaced discrete times, the system moves to a state with a given
probability depends on previous state-

P(qt+1=Sj | qt=Si, qt-1=Sk ,...)

Discrete Markov Processes(cont.)

First-order Markov, the state at time t+1 depends on state at time t-

P(qt+1=Sj | qt=Si , qt-1=Sk ,...) = P(qt+1=Sj | qt=Si)
This corresponds to saying that, for a give present state-the future is
independent of past.

Let us assume that the probabilities are independent of time called transition
probabilities-
aij ≡ P(qt+1=Sj | qt=Si) aij ≥ 0 and Σj= 1N aij=1
So,going from Si to Sj has the same probability aij at any time. The only special
case is first state Si with an initial probability 𝛑i .
πi ≡ P(q1=Si) Σj=1N πi=1
Discrete Markov Processes(cont.)

Example of a markov model with 3 states. This is a stochastic automaton.

In an observable Markov model, the states are observable. At any time, as the
system moves from one state to other state, we get an observation sequence i.e., a
sequence of states.

The output of the process is set of states at each instant of time where each state
corresponds to physical observable event.
Discrete Markov Processes(cont.)
We have an observation sequence, i.e, the state sequence O = Q ={q1,q2,..qT}

Where the probability is given as

Where 𝛑q1 is probability going from q1. aq1q2 is the probability of going from q1 to q2
.
We multiply these probabilities to get the probability of whole sequence.
Discrete Markov Processes(cont.)
Let us assume, we have N urns/baskets where each urn contains balls of only
one color.
So there is an urn of red balls, another of blue balls and so on..

Let us say we have 3 states, S : red, S : blue, S : green

1 2 3
With initial probabilities
Let us say A=[ aij ] is a N X N matrix whose rows sum to 1.
aij is the probability of drawing from urn j (a ball of color j), after drawing a ball
of color i from urn i. The transition matrix is
Discrete Markov Processes(cont.)
Given 𝚷 and A, it is easy to generate K random sequences each of length T.

Let us see how to calculate probability of a sequence..

Assume that the first 4 balls are “red,red,green,green” .

This corresponds to observation sequence O={S1,S1,S3,S3}.

Its probability is-

Discrete Markov Processes(cont.)
Now, let us see how we can learn the parameters 𝚷 and A.

Given K example sequences of length T,

Where qtk is the state at time t of sequence k,
The initial probability

Where 1(b) is 1 if b is true and 0 otherwise.

Transition probability aij
Hidden Markov Models
In HMM,

1. The states are not observable.

But when we visit a state, an observation is recorded that is a probabilistic function of

the state.

2. Discrete observations {v1,v2,...,vM} in each state.

3. Observable or Emission probability bj(m),that we observe vm, m=1...M in state Sj.
bj(m) ≡ P(Ot=vm | qt=Sj)
We assume that the probabilities do not depend on t.
The values observed forms the observation sequence O.
4. The state sequence Q is not observed, that is what makes the model “hidden” but it
should be inferred from the observation sequence O.
Hidden Markov Models(cont.)
Elements of an HMM

N: Number of states
M: Number of observation symbols
A = [aij]: N X N state transition probability matrix
B = bj(m): N X M observation probability matrix
Π = [πi]: N X 1 initial state probability vector

λ = (A, B, Π), λ is a parameter set of HMM.

Given λ, the model can be used to generate an arbitrary number of
observation sequences of arbitrary length.
Three Basic Problems of HMMs

Given a number of sequences of observations,we are interested in 3 problems-

1. Evaluation- Given a model λ and observation sequence O, evaluate the probability

P(O| λ).

2. State Sequence- Given λ and O, state sequence Q ={q1,q2...qT}, find Q* which is the
highest probability state sequence ,
such that P (Q* | O, λ ) = maxQ P (Q | O , λ ) .

3. Learning- Given a training set of observation sequences, X={Ok}k learn the model
that maximizes the probability of X i.e., find λ* = maxλ P ( X | λ ).
Hidden Markov Models(cont.)
1. Evaluation Problem

Give an observation sequence O = {O1,O2...OT} and state sequence Q ={q1,...qT},

λ is a parameter set of HMM.

To calculate P(O| λ) there is an efficient procedure called forward-backward

procedure.
It is based on the idea of dividing the observation sequence into two parts-
1. Starting from time 1 until time t,
2. Starting from time t+1 until time t.
Hidden Markov Models(cont.)
1. Evaluation Problem(cont.)
We define forward variable as, the probability of observing the partial
sequence {O1...Ot} until time t and being in Si at time t, given the model λ:

The nice thing about it is that, it can be calculated recursively by accumulating

results-
Hidden Markov Models(cont.)
1. Evaluation Problem(cont.)
We define backward variable as, the probability of being in Si at time t and
observing the partial sequence Ot+1….OT

It can be calculated recursively by time going in the backward direction-

Hidden Markov Models(cont.)
2. Finding the State Sequence-
Let us define as, the probability of being in state S i at time t, given O
and λ, which can be computed as follows-

To find the state sequence,Choose the state that has the highest probability,
for each time step t:
qt*= arg maxi γt(i)
Hidden Markov Models(cont.)
2. Finding the State Sequence-(cont.)

To find the single best state sequence, we use the Viterbi algorithm, based
on dynamic programming, which takes such transition probabilities into account.
Given state sequence Q, observation sequence O,
we define δt(i) ≡ maxq1q2∙∙∙ qt-1 p(q1q2∙∙∙qt-1,qt =Si,O1∙∙∙Ot | λ)

Where δt(i) is the highest probability path at time t that accounts for the first
t observations and ends in Si.
Hidden Markov Models(cont.)
3. Learning Model Parameters
To calculate λ* that maximizes the probability of X, i.e., P ( X | λ )

We define as, the probability of being in S i at time t and in Sj at

time t+1, given the whole observation O and λ-
Hidden Markov Models(cont.)
Continuous Observations-
We assumed discrete observations modeled as a multi-nominal-

The k-means used for vector quantization is the hard version of a

Gaussian mixture model-

The scalar continuous observation, The easiest is to assume a normal distribution-

Hidden Markov Models(cont.)
Model Selection in HMM-
Example of left-right HMM

In classification, estimate P (O | λi) by a separate HMM and

Use Bayes’ rule-
END OF MODULE-5 (Chapter 15)

Знімок екрана 2022-10-31 о 18.56.30
No ratings yet
Знімок екрана 2022-10-31 о 18.56.30
96 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
26 pages
Markov Models
No ratings yet
Markov Models
54 pages
Introduction to Hidden Markov Models
No ratings yet
Introduction to Hidden Markov Models
30 pages
BT302 L9 HMM
No ratings yet
BT302 L9 HMM
29 pages
T6-Hang Li - Machine Learning Methods-Springer (2023) - 230-252
No ratings yet
T6-Hang Li - Machine Learning Methods-Springer (2023) - 230-252
23 pages
Sequence Model:: Hidden Markov Models
No ratings yet
Sequence Model:: Hidden Markov Models
60 pages
Hidden Markov Model Introduction
No ratings yet
Hidden Markov Model Introduction
36 pages
Hidden Markov Models and Sequential Data
No ratings yet
Hidden Markov Models and Sequential Data
45 pages
Hidden Markov Models for Experts
No ratings yet
Hidden Markov Models for Experts
59 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
55 pages
Lecture 11
No ratings yet
Lecture 11
55 pages
Slides
No ratings yet
Slides
69 pages
HMM Isolated Word Recognition
No ratings yet
HMM Isolated Word Recognition
23 pages
AML Mod2
No ratings yet
AML Mod2
38 pages
Hidden Markov Models for ML Students
No ratings yet
Hidden Markov Models for ML Students
5 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
35 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
56 pages
Recitation4 Notes
No ratings yet
Recitation4 Notes
6 pages
Markov Models in AI Applications
No ratings yet
Markov Models in AI Applications
78 pages
Computational Genomics Hidden Markov Models (HMMS)
No ratings yet
Computational Genomics Hidden Markov Models (HMMS)
55 pages
Hidden Markov Model (HMM) Architecture
No ratings yet
Hidden Markov Model (HMM) Architecture
15 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Hidden Markov Models 3pb6fukspf
No ratings yet
Hidden Markov Models 3pb6fukspf
29 pages
Lec20 PDF
No ratings yet
Lec20 PDF
7 pages
Hidden Markov Models Overview
No ratings yet
Hidden Markov Models Overview
51 pages
Prques 2
No ratings yet
Prques 2
13 pages
Hidden Markovnikov Model
No ratings yet
Hidden Markovnikov Model
32 pages
Understanding Markov and Hidden Models
No ratings yet
Understanding Markov and Hidden Models
25 pages
2024 Fall CSE366 12 HMM
No ratings yet
2024 Fall CSE366 12 HMM
46 pages
Applications of Hidden Markov Model Stat-1
No ratings yet
Applications of Hidden Markov Model Stat-1
8 pages
Hidden Markov Models in NLP
No ratings yet
Hidden Markov Models in NLP
33 pages
A Hidden Markov Model
No ratings yet
A Hidden Markov Model
6 pages
Learning Algorithms in AI Explained
No ratings yet
Learning Algorithms in AI Explained
67 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
24f 09 Hidden Markov Models
No ratings yet
24f 09 Hidden Markov Models
79 pages
Implementation of Discrete Hidden Markov Model For Sequence Classification in C++ Using Eigen
No ratings yet
Implementation of Discrete Hidden Markov Model For Sequence Classification in C++ Using Eigen
8 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
15 pages
Hidden Markov Models & Algorithms
No ratings yet
Hidden Markov Models & Algorithms
39 pages
Lecture Week11
No ratings yet
Lecture Week11
24 pages
Hidden Markov Model Overview
100% (2)
Hidden Markov Model Overview
73 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
10 pages
Lec18 HMMs
No ratings yet
Lec18 HMMs
56 pages
Markov Models for Data Analysis
No ratings yet
Markov Models for Data Analysis
32 pages
Lecture07 HMM S
No ratings yet
Lecture07 HMM S
26 pages
01 Hidden Markov Models
No ratings yet
01 Hidden Markov Models
3 pages
21CSC305P ML - Unit4
No ratings yet
21CSC305P ML - Unit4
76 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
20 pages
Understanding Hidden Markov Models
No ratings yet
Understanding Hidden Markov Models
11 pages
Hidden Markov Model in Machine Learning
No ratings yet
Hidden Markov Model in Machine Learning
2 pages
Labman 2
No ratings yet
Labman 2
16 pages
Hidden Markov Models Explained
No ratings yet
Hidden Markov Models Explained
13 pages
Understanding Hidden Markov Models
No ratings yet
Understanding Hidden Markov Models
46 pages
Algorithms - Hidden Markov Models
No ratings yet
Algorithms - Hidden Markov Models
7 pages
IS 7118 Unit-6 HMM
No ratings yet
IS 7118 Unit-6 HMM
78 pages
Understanding Polymorphism in OOP
No ratings yet
Understanding Polymorphism in OOP
20 pages
TensorFlow 2.0 Intro & Exercises
No ratings yet
TensorFlow 2.0 Intro & Exercises
100 pages
Damped Trend Exponential Smoothing: Prediction and Control: Giacomo Sbrana
No ratings yet
Damped Trend Exponential Smoothing: Prediction and Control: Giacomo Sbrana
8 pages
Oosd Notes Unit-1
No ratings yet
Oosd Notes Unit-1
19 pages
Lecture # 15-1 Knowledge Distillation
No ratings yet
Lecture # 15-1 Knowledge Distillation
51 pages
Case Studies
No ratings yet
Case Studies
17 pages
Queuing Theory 2
No ratings yet
Queuing Theory 2
18 pages
Lecture 17 - KL Divergence, Autoencoders
No ratings yet
Lecture 17 - KL Divergence, Autoencoders
54 pages
Understanding Regular Languages and Grammars
No ratings yet
Understanding Regular Languages and Grammars
45 pages
Unified CLDNN Architecture for Speech Recognition
No ratings yet
Unified CLDNN Architecture for Speech Recognition
5 pages
Given An Actual Demand of 60 For A Period When Forecast of 70
No ratings yet
Given An Actual Demand of 60 For A Period When Forecast of 70
17 pages
(Lecture Notes in Control and Information Sciences) Andrzej Janczak - Identification of Nonlinear Systems Using Neural Networks and Polynomial Models - A Block-Oriented Approach-Springer (2004)
No ratings yet
(Lecture Notes in Control and Information Sciences) Andrzej Janczak - Identification of Nonlinear Systems Using Neural Networks and Polynomial Models - A Block-Oriented Approach-Springer (2004)
208 pages
ec4b56ff4f6b65d4e5a1e4472de02fe8
No ratings yet
ec4b56ff4f6b65d4e5a1e4472de02fe8
15 pages
Govt Postgraduate College Samanabad Faisalabad: Practical 1. Basic C++ Programs Structure
No ratings yet
Govt Postgraduate College Samanabad Faisalabad: Practical 1. Basic C++ Programs Structure
2 pages
ARIMA for Bus Travel Time Prediction
No ratings yet
ARIMA for Bus Travel Time Prediction
11 pages
Convolutional Fuzzy Neural Network for Object Classification
No ratings yet
Convolutional Fuzzy Neural Network for Object Classification
10 pages
CS236 Homework 1
100% (1)
CS236 Homework 1
4 pages
DeepLearning Practical File K - Nishant
No ratings yet
DeepLearning Practical File K - Nishant
38 pages
Mid Term
No ratings yet
Mid Term
1 page
Forecasting Lecture Canvas
No ratings yet
Forecasting Lecture Canvas
4 pages
Discrete Probability Distributions Guide
50% (2)
Discrete Probability Distributions Guide
48 pages
Demand Forecasting Methods Analysis
No ratings yet
Demand Forecasting Methods Analysis
3 pages
Seq2Seq RNN With Attention
No ratings yet
Seq2Seq RNN With Attention
20 pages
Tutorial Associative Memory
No ratings yet
Tutorial Associative Memory
2 pages
Formal Language and Automata Theory
100% (1)
Formal Language and Automata Theory
136 pages
Understanding ARMA Models in Econometrics
No ratings yet
Understanding ARMA Models in Econometrics
31 pages
Back Propagation
No ratings yet
Back Propagation
9 pages
Toc For Gate
100% (1)
Toc For Gate
5 pages
Python Time Series Forecasting Models
No ratings yet
Python Time Series Forecasting Models
77 pages
Deep Learning Nanodegree Syllabus
No ratings yet
Deep Learning Nanodegree Syllabus
15 pages

Machine Learning Course Syllabus GIT

Uploaded by

Machine Learning Course Syllabus GIT

Uploaded by

Day & Time: Monday (10am-11am & 3pm-4pm)

● Explore about various disciplines connected with ML.

Department of CSE, GIT EID 403 and machine learning

● Identify the various applications connected with ML.

Department of CSE, GIT EID 403 and machine learning

Department of CSE, GIT EID 403 and machine learning

Department of CSE, GIT EID 403 and machine learning

Department of CSE, GIT EID 403 and machine learning

Discrete Markov processes

Hidden Markov Models

Three problems of HMM

Finding state sequence

Learning model parameters & continuous observations

HMM with output & Model selection in HMM 7

This assumption is not valid for applications where successive instances

For example, Processes where sequence of observations cannot be

1. In a word successive letters are dependent.

Any sequence is characterized by parametric random process.

In this chapter-it is about

● How modelling will be done.

set of N distinct states-S1,S2,...SN,.

The state at time t is qt, t=1,2,...

P(qt+1=Sj | qt=Si, qt-1=Sk ,...)

First-order Markov, the state at time t+1 depends on state at time t-

Example of a markov model with 3 states. This is a stochastic automaton.

Where the probability is given as

Let us say we have 3 states, S : red, S : blue, S : green

Let us see how to calculate probability of a sequence..

Assume that the first 4 balls are “red,red,green,green” .

This corresponds to observation sequence O={S1,S1,S3,S3}.

Its probability is-

Given K example sequences of length T,

Where 1(b) is 1 if b is true and 0 otherwise.

1. The states are not observable.

But when we visit a state, an observation is recorded that is a probabilistic function of

2. Discrete observations {v1,v2,...,vM} in each state.

λ = (A, B, Π), λ is a parameter set of HMM.

Given a number of sequences of observations,we are interested in 3 problems-

1. Evaluation- Given a model λ and observation sequence O, evaluate the probability

Give an observation sequence O = {O1,O2...OT} and state sequence Q ={q1,...qT},

λ is a parameter set of HMM.

To calculate P(O| λ) there is an efficient procedure called forward-backward

The nice thing about it is that, it can be calculated recursively by accumulating

It can be calculated recursively by time going in the backward direction-

We define as, the probability of being in S i at time t and in Sj at

The k-means used for vector quantization is the hard version of a

The scalar continuous observation, The easiest is to assume a normal distribution-

In classification, estimate P (O | λi) by a separate HMM and

You might also like