0% found this document useful (0 votes)

175 views69 pages

Automata Module 2

Automata module 2

Uploaded by

nayakbhuvangame

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

175 views69 pages

Automata Module 2

Automata module 2

Uploaded by

nayakbhuvangame

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 69

AUTOMATA THEORY AND COMPUTABILITY

[As per Choice Based Credit System (CBCS) scheme]

SEMESTER – V

Module-2

NOTES

Page 1
Regular Expressions (RE): what is a RE? Kleene’s theorem, Applications of REs, Manipulating

and Simplifying REs. Regular Grammars: Definition, Regular Grammars and Regular languages.

Regular Languages (RL) and Non-regular Languages: How many RLs, To show that a language is

regular, Closure properties of RLs, to show some languages are not RLs.

Page 2
Module 2
CONTENTS

Title Page
Chapter No: 2 No.
2.1 REGULAR EXPRESSION 1 - 46

2.1.1 What is Regular Expression 1

2.1.2 Kleen’s Theorem 9
2.1.2.1 Building FSM from Regular Expression 10
2.1.2.2 Building Regular Expression from FSM – State Elimination Technique 20
2.1.2.3 Building Regular Expression from FSM – Kleen’s Theorem 38
2.1.3 Application of Regular Expression 40
2.1.4 Manipulating and Simplifying Regular Expressions. 40
2.2 REGULAR GRAMMARS 47- 66

2.2.1 Definition of Regular Grammars and Regular Languages 47

2.2.2 Regular and Non-Regular Languages, How many Regular languages? 54

2.2.3 To show that a language is regular, 54

2.2.4 Closure properties of RLs 55

2.2.5 To show some languages are not RLs.( Pumping Theorem for RLs) 59

Page 3
Automata Theory and Computability Regular Expressions

Regular Expressions
Introduction
Instead of focusing on the power of a computing device, let's look at the task that we need to
perform. Let's consider problems in which our goal is to match finite or repeating patterns.
For example regular expressions are used as pattern description language in
• Lexical analysis.-- compiler
• Filtering email for spam.
• Sorting email into appropriate mailboxes based on sender and/or content words and
phrases.
• Searching a complex directory structure by specifying patterns that are known to occur in
the file we want.
A regular expression is a pattern description language, which is used to describe particular
patterns of interest. A regular expression provides a concise and flexible means for
"matching" strings of text, such as particular characters, words, or patterns of characters.
Example: [ ] : A character class which matches any character within the brackets

[^ \t\n] matches any character except space, tab and newline character.
Regular expression:
A language accepted by a finite- state machine is called as regular language. A regular
language can be described using regular expressions, in the form of algebraic notations
consisting of the symbols such as alphabets or symbols in Σ and a set of special symbols to
which we will attach particular meanings when they occur in a regular expression. These
symbols are Ø, U, ε, (, ), *, and .

Define a Regular expression.

A regular expression is a string that can be formed according to the following rules:
1. Ø is a regular expression.
2. ε is a regular expression.
3. Every element in ∑ is a regular expression.
4. Given two regular expressions α and β, αβ is a regular expression.

Page 1
Automata Theory and Computability Regular Expressions

5. Given two regular expressions α and β, α U β is a regular expression.

6. Given a regular expression α, α* is a regular expression.
7. Given a regular expression α, α+ is a regular expression.
8. Given a regular expression α, (α) is a regular expression.
Example:
Let ∑ = {a, b}, The following strings are regular expressions:
Ø , ε, a, b, (aUb)* etc……….
NOTE:
1. L (Ø) = Ø, the language that contains no strings.
2. L(ε) = {ε}, the language that contains just the empty string.
3. For any c ϵ ∑, L (c) = {c}, the language that contains the single one character string c
4. For any regular expressions α and β. L (αβ) = L(α)L(β). That is concatenation of two
regular expressions.
5. If either L(α) or L(β) is equal to Ø, then the concatenation will also be equal to Ø
6. For any regular expressions α and β. L (αUβ) = L(α) U L(β). That is union of two
regular expressions.
7. For any regular expressions α, L (α*) = (L (α))* where * is the Kleen start
operator. 8. L (Ø*) = { ε }

What is L((aU b)* b) = ?

L((aU b)* b) = L((a U b)*) L(b)

= (L(a) U L(b))* L(b)
= ({a} U {b})* {b}
= {a, b}* {b}. That is the set of all strings over the alphabet {a, b} that ends in b.

Page 2
Automata Theory and Computability Regular Expressions

Regular expression Meaning

a* String consisting of any number of a’s. (zero or more a’s)
a+ String consisting of at least one a. (one or more a’s)
(a , b) String consisting of either a or b
(a, b)* String consisting of any nuber of a’s and b’s including ε
(a, b)* ab Strings of a’s and b’s ending with ab.
ab(a, b)* Strings of a’s and b’s starting with ab.
(a , b)* ab (a,b)* Strings of a’s and b’s with substring ab.

Write the regular expressions for the following languages:

a. Strings of a’s and b’s having length 2:
Regular expression = ( a + b) ( a+ b). OR (a, b) (a, b) OR (a U b) (a U b)
b. Strings of a’s and b’s of length  10:
Regular expression = ( ε + a + b)10.
c. Strings of a’s and b’s of even length.
Regular expression = ((a + b) ( a + b) )*
d. Strings of a’s and b’s of odd length
Regular expression = ( a + b) ((a + b) ( a + b))*
e. Strings of a’s of even length
Regular expression = (aa)*
f. Strings of a’s of odd length
Regular expression = a(aa)*
Strings of a’s and b’s with alternate a’s and b’s.
Alternate a’s and b’s can be obtained by concatenating the string (ab) zero or more times. ie (ab)*
and adding an optional b to the front ie: (ε +b) and adding an optional a at the end, ie: (ε +a)
The regular expression = (ε +b) (ab)*(ε +a)

Page 3
Automata Theory and Computability Regular Expressions

Obtain regular expression to accept the language containing at least one a and one b over Σ = { a,
b, c}. OR
Obtain regular expression to accept the language containing at least one 0 and one 1 over Σ = {
0, 1, 2}.
String should contain at least one a and one b, so the regular expression corresponding to this is
given by = ab + ba
There is no restriction on c’s. Insert any number of a’s, b’s and c;s ie: (a+b+c)* in between the
above regular expression.
So the regular expression = (a+b+c)* a (a+b+c)* b(a+b+c)* + (a+b+c)* b(a+b+c)*a(a+b+c)*
Obtain regular expression to accept the language containing at least 3 consecutive zeros.
Regular expression for string containing 3 consecutive 0’s = 000
The above regular expression can be preceded or followed by any number of 0’s and 1’s, ie:
(0+1)*
Regular expression = (0+1)*000(0+1)*
Obtain regular expression to accept the language containing strings of a’s and b’s ending with b
and has no substring aa.
Regular expression for strings of a’s and b’s ending with b and has no substring aa is nothing but
the string containing any combinations of either b or ab without ε.
Regular expression = ( b + ab) (b +ab)*
Obtain regular expression to accept the language containing strings of a’s and b’s such that
L = { a2n b2m | n, m  0 }
a2n means even number of a’s, regular expression = (aa)*
b2m means even number of b’s, regular expression = (bb)*.
The regular expression for the given language = (aa)* (bb)*
Obtain regular expression to accept the language containing strings of a’s and b’s such that L = {
a2n+1 b2m | n, m  0 }.
a2n+1 means odd number of a’s, regular expression = a(aa)*
b2m means even number of b’s, regular expression = (bb)*
The regular expression for the given language = a(aa)* (bb)*

Page 4
Automata Theory and Computability Regular Expressions

Obtain regular expression to accept the language containing strings of a’s and b’s such that L = {
a2n+1 b2m+1 | n, m  0 }.
a2n+1 means odd number of a’s, regular expression = a(aa)*
b2m+1 means odd number of b’s, regular expression = b(bb)*
The regular expression for the given language = a(aa)*b(bb)*
Obtain regular expression to accept the language containing strings of 0’s and 1’s with exactly
one 1 and an even number of 0’s.
Regular expression for exactly one 1 = 1
Even number of 0’s = (00)*
So here 1 can be preceded or followed by even number of 0’s or 1 can be preceded and followed
by odd number of 0’s.
The regular expression for the given language = (00)* 1 (00)* + 0(00)* 1 0(00)*
Obtain regular expression to accept the language containing strings of 0’s and 1’s having no two
consecutive 0’s. OR
Obtain regular expression to accept the language containing strings of 0’s and 1’s with no pair of
consecutive 0’s.
Whenever a 0 occurs it should be followed by 1. But there is no restriction on number of 1’s. So
it is a string consisting of any combinations of 1’s and 01’s, ie regular expression = (1+01)*
Suppose string ends with 0, the above regular expression can be modified by inserting (0 + ε ) at
the end.
Regular expression for the given language = (1+01)* (0 + ε )
Obtain regular expression to accept the language containing strings of 0’s and 1’s having no two
consecutive 1’s. OR
Obtain regular expression to accept the language containing strings of 0’s and 1’s with no pair of
consecutive 1’s.
Whenever a 1 occurs it should be followed by 0. But there is no restriction on number of 0’s. So
it is a string consisting of any combinations of 0’s and 10’s, ie regular expression = (0+10)*
Suppose string ends with 1, the above regular expression can be modified by inserting (1 + ε ) at
the end.
Regular expression for the given language = (0+10)* (1 + ε )
Page 5
Automata Theory and Computability Regular Expressions

Obtain regular expression to accept the following languages over Σ = { a, b}

i. Strings of a’s and b’s with substring aab.
Regular expression = (a+b)* aab(a+b)*
ii. Strings of a’s and b’s such that 4th symbol from right end is b and the 5th symbol from
right end is a.
Here the 4th symbol from right end is b and the 5th symbol from right end is a the
corresponding regu;lar expression = ab(a+b)(a+b)(a+b).
But the above regular expression can be preceded with any number of a’s and b’s.
Therefore the regular expression for the given language = (a+b)*ab(a+b)(a+b)(a+b).
iii. Strings of a’s and b’s such that 10th symbol from right end is b.
The regular expression for the given language = (a+b)*b(a+b)9.
iv. Strings of a’s and b’s whose lengths are multiple of 3.
OR
L = { |w| mod 3 = 0, where w is in Σ = { a, b}
Length of string w is multiple of 3, the regular expression = ((a+b) (a+b) (a+b))*
v. Strings of a’s and b’s whose lengths are multiple of 5.
OR
L = { |w| mod 5 = 0, where w is in Σ = { a, b}
Length of string w is multiple of 5,
The regular expression = ((a+b) (a+b) (a+b) (a+b)(a+b))*
vi. Strings of a’s and b’s not more than 3 a’s:
Not more than 3 a’s, regular expression= (ε+a) (ε+a) (ε+a).
But there is no restriction on b’s, so we can include b* in between the above regular
expression.
The regular expression for the given language = b*(ε+a) b*(ε+a) b* (ε+a) b*

Page 6
Automata Theory and Computability Regular Expressions

vii. Obtain the regular expression to accept the words with two or more letters but
beginning and ending with the same letter. Σ = { a, b}
Regular expression beginning and ending with same letter is = a a + b b. In between
include any number of a’s and b’s.
Therefore the regular expression = a (a+b)* a + b (a+b)* b
viii. Strings of a’s and b’s of length is either even or multiple of 3.
Multiple of regular expression = [(a+b) (a+b) (a+b)]*
Length is of even, regular expression = [(a+b) (a+b)]*
So the regular expression for the given language = ((a+b) (a+b) (a+b)]* + [(a+b)
(a+b))*
ix. Obtain the regular expression to accept the language L = { anbm | m+n is even }
Here n represents number of a’s and m represents number of b’s.
m+n is even results in two possible cases;
case i. when even number of a’s followed by even number of b’s.
regular expression : (aa)*(bb)*
case ii. Odd number of a’s followed by odd number of b’s.
regular expression = a(aa)* b(bb)*.
So the regular expression for the given language = (aa)*(bb)* + a(aa)* b(bb)*

x. Obtain the regular expression to accept the language L = { anbm | n  4 and m  3 }.

Here n  4 means at least 4 a’s, the regular expression for this = aaaa(a)*
m  3 means at most 3 b’s, regular expression for this = (ε+b) (ε+b) (ε+b).
So the regular expression for the given language = aaaa(a)* (ε+b) (ε+b) (ε+b).

xi. Obtain the regular expression to accept the language L = { anbm cp | n  4 and m  3 p 
2}.
Here n  4 means at least 4 a’s, the regular expression for this = aaaa(a)*
m  3 means at most 3 b’s, regular expression for this = (ε+b) (ε+b) (ε+b).
p  2 means at most 2 c’s, regular expression for this = (ε+c) (ε+c)

Page 7
Automata Theory and Computability Regular Expressions

So the regular expression for the given language = aaaa(a)* (ε+b) (ε+b) (ε+b) (ε+c)
(ε+c).
xii. All strings of a’s and b’s that do not end with ab.
Strings of length 2 and that do not end with ab are ba, aa and bb.
So the regular expression = (a+b)*(aa + ba +bb)
xiii. All strings of a’s, b’s and c’s with exactly one a.
The regular expression = (b+c)* a (b+c)*
xiv. All strings of a’s and b’s with at least one occurrence of each symbol in Σ = {a, b}.
At least one occurrence of a’s and b’s means ab + ba, in between we have n number
of a’s and b’s.
So the regular expression = (a+b)* a (a+b)* b(a+b)* +(a+b)* b(a+b)* a(a+b)*

Obtain the regular expression for the language L = { anbm | m  1, n  1, nm  3 }

Solution:
Case i. Since nm  3, if n = 1 then m should be  3. The equivalent regular expression is given
by: RE = a bbb(b)*

Case ii. Since nm  3, if m = 1 then n should be  3. The equivalent regular expression is given
by: RE = aaa(a)* b
Case iii. Since nm  3, if m  2 and n  2 then the equivalent regular expression is given by:
RE = aa(a)* bb(b)*
So the final regular expression is obtained by adding all the above regular expression.
Regular expression = abbb(b)* + aaa(a)*b + aa(a)*bb(b)*

Page 8
Automata Theory and Computability Kleen’s Theorem

The regular expression language provides three operators (precedence order from highest to
lowest)
1. Kleene star
2. Concatenation, and
3. Union
NOTE:
(α U ε) : optional α and expression can be satisfied either by matching α or the empty string.
(a U b)* : Describes the set of all strings composed of the characters a and b.
a* U b* ≠ (a U b)* : Every string in the language on the left contains only a’s or b’s whereas
right side it contains combination of a’s and b’s.
(ab)* ≠ a*b*: The language on the left contains the string abab….. while the
language on the right does not. The language on the right contains the string aaabbbb, while
the language on the left does not.
The regular expression a* is simply a string. It is different from Language L(a*) = {w: w is
composed of zero or more a's}.

Kleene's Theorem
• The regular expression language is a useful way to define patterns..
• Any language that can be defined by a regular expression can be accepted by some finite
state machine.
• Any language that can be accepted by a finite state machine can be defined by some
regular expression.

Page 9
Automata Theory and Computability FSM to Regular Expressions

*****Building an FSM from a Regular Expression

Theorem: Any language that can be defined with a regular expression can be accepted by some
FSM and so is regular.
OR
Prove that if R is a regular expression then there exists some ε-NFA that accepts L(R)
Proof: The proof is given by constructing FSM:
For a given regular expression α, we can construct an FSM M such that L (α) = L (M).
If α is any c ϵ ∑, we construct for it the simple FSM as:

If α is Ø, we construct for it the simple FSM as:

If α is ε we construct for it the simple FSM as:

Let β and γ be regular expressions that define languages over the alphabet ∑
If L (β) is regular, then it is accepted by some FSM M1 = (K1, ∑, δ1, s1, A1).
If L (γ) is regular, then it is accepted by some FSM M2 = (K2, ∑, δ2, s2, A2).
If regular expression α = β U γ and if both L(β) and L(γ) are regular, then we construct M3 =(K3,
∑, δ3, s3, A3), such that L(M3) = L(α ) = L(β ) U L(γ).
Construct a new machine M3, by creating a new start state s3, and connect it to the start states of
M1 and M2 via ε-transitions. M3 accepts if either M1 or M2 accepts.
So M3 = ({ s3} U K1 U K2, ∑, δ3, s3, A1 U A2 ) where δ3 = δ1 U δ2{((s3, ε), s1), (s3, ε ), s2)}

Page 10
Automata Theory and Computability FSM to Regular Expressions

If regular expression α = βγ and if both L(β) and L(γ) are regular, then we construct M 3 =(K3, ∑,
δ3, s3, A3), such that L(M3) = L(α ) = L(β )L(γ).
Construct a new machine M3, by connecting every accepting state of M1 to the start state of M2
via an ε-transition. M3 will start in the start state of M1 and will accept if M2 does. So M3 = ( K1
U K2, ∑, δ3, s1, A2) where δ3 = δ1 U δ2{((q, ε), s2) : q ϵ A1)}

If regular expression α = β*and L(β) is regular, then we construct M2 =(K2, ∑, δ2, s2, A2), such
that L(M2) = L(α ) = L(β )*
M2 is constructed by creating a new start state s2 and make it accepting state, thus assuming that
M2 accepts ε. We link the new s2 to s1 via an ε –transitions. Finally, we create ε -transitions from
each of M1's accepting states back to s1. So M2 = ({s2} U K1, ∑, δ2, s2, {s2 } U A1) where δ2 = δ1
U {((s2, ε), s1) } U {((q, ε), s1) : q ϵ A1}

Page 11
Automata Theory and Computability FSM to Regular Expressions

NOTE:
Finite state Machines constructed from Regular expression are typically highly non-eterministic
because of their use of ε-transitions. These FSM’s have a large number of unnecessary states. As
a practical matter, it is not a problem, since, given an arbitrary NDFSM M, we have an algorithm
that can construct an equivalent DFSM M’. We also have an algorithm that can minimize M’
Construct a FSM for the regular expression (b U ab)*
OR
Convert the regular expression (b + ab)* to an ε- NFA
OR
Convert the regular expression (b, ab)* to a FSM.
FSM for b :

FSM for a :

FSM for ab :

FSM for (b U ab) :

Page 12
Automata Theory and Computability FSM to Regular Expressions

FSM for (b U ab)*

Convert the regular expression (0 + 1)* 1 ( 0 + 1) to an ε- NFA or a FSM.

Convert the regular expression (0 U 1)* 1 ( 0 U 1) to an ε- NFA or a FSM
FSM for 0:

FSM for 1:

Page 13
Automata Theory and Computability FSM to Regular Expressions

FSM for (0+1):

FSM for (0+1)*:

FSM for (0+1)* 1 :

Page 14
Automata Theory and Computability FSM to Regular Expressions

FSM for (0+1)* 1 (0+1) :

Convert the regular expression (01+ 1)* to an ε- NFA or a FSM

FSM for 01:

FSM for 1:

FSM for (01+1):

Page 15
Automata Theory and Computability FSM to Regular Expressions

FSM for (01+ 1)*:

Convert the regular expression (0 U 1)*01 to an ε- NFA or a FSM

FSM for 0:

FSM for 1:

FSM for (0 U 1) or (0+1):

Page 16
Automata Theory and Computability FSM to Regular Expressions

FSM for (0 U 1)* or (0+1)*:

FSM for 01:

FSM for (0 U 1)01 or (0+1)01:

Page 17
Automata Theory and Computability FSM to Regular Expressions

Convert the regular expression 0* + 1* + 2* to an ε- NFA or a FSM

FSM for 0:

FSM for 1:

FSM for 2:

FSM for 0*:

FSM for 1*:

FSM for 2*:

Page 18
Automata Theory and Computability FSM to Regular Expressions

FSM for 0 * + 1* + 2*:

Page 19
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Building a Regular Expression from an FSM (State Elimination Technique)

How to build a regular expression for a FSM. Instead of limiting the labels on the transitions of
an FSM to a single character or ε, we will allow entire regular expressions as labels.
• For a given input FSM M, we will construct a machine M’ such that M and M’ are
equivalent and M’ has only two states, start state and a single accepting state.
• M’ will also have just one transition, which will go from its start state to its accepting
state. The label on that transition will be a regular expression that describes all the strings
that could have driven the original machine M from its start state to some accepting state
Algorithm to create a regular expression from FSM: (State elimination)
1. Remove any states from given FSM M that are unreachable from the start state
2. If FSM M has no accepting states then halt and return the simple regular expression Ø.
3. If the start state of FSM M is part of a loop (i.e: it has any transitions coming into it), then
create a new start state s and connects to M ‘s start state via an ε-transition. This new
start state s will have no transitions into it.
4. If a FSM M has more than one accepting state or if there is just one but there are any
transitions out of it, create a new accepting state and connect each of M’s accepting states
to it via an ε-transition. Remove the old accepting states from the set of accepting states.
Note that the new accepting state will have no transitions out from it.
5. At this point, if M has only one state, then that state is both the start state and the
accepting state and M has no transitions. So L (M} = {ε}. Halt and return the simple
regular expression as ε.
6. Until only the start state and the accepting state remain do:
Select some state rip of M. Any state except the start state or the accepting state may be
chosen.
Remove rip from M.
Modify the transitions among the remaining states so that M accepts the same
strings The labels on the rewritten transitions may be any regular expression.
7. Return the regular expression that labels the one remaining transition from the start state
to the accepting state

Page 20
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Consider the following FSM M: Show a regular expression for L(M).

OR
Obtain the regular expression for the following finite automata using state elimination method

We can build an equivalent machine M' by eliminating state q2 and replacing it by a transition
from q1 to q3 labeled with the regular expression ab*a.
So M' is:

Regular Expression = ab*a

Obtain the regular expression for the following finite automata using state elimination method

There is no incoming edge into the initial state as well as no outgoing edge from final state. So
there is only two states, initial and final.

Regular expression = (a+b+c) or (a U b U c)

Page 21
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Obtain the regular expression for the following finite automata using state elimination method

There is no incoming edge into the initial state as well as no outgoing edge from final state.
After eliminating the state B:

Regular expression = ab
Obtain the regular expression for the following finite automata using state elimination method

There is no incoming edge into the initial state as well as no outgoing edge from final state.
After eliminating the state B:

Regular expression = ab*c

Obtain the regular expression for the following finite automata using state elimination method

Page 22
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Since initial state has incoming edge, and final sate has outgoing edge, we have to create a new
iniatial and final state by connecting new initial state to old initial state through ε and old final
state to new final state through ε. Make old final state has non-final state.

After removing state A:

After removing state B:

Regular expression: 0(10)*

Obtain the regular expression for the following finite automata using state elimination method

Page 23
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Since there are multiple final states, we have to create a new final state.

After removing states C, D and E:

After removing state B:

Regular Expression: a(b+c+d)

Obtain the regular expression for the following finite automata using state elimination method

Page 24
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

After inserting new start state:

After removing state A:

After removing state B:

Regular expression: b(c +ab)*d

Obtain the regular expression for the following finite automata using state elimination method

Page 25
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

By creating new start and final states:

After removing state B:

After removing state A:

Regular expression: (0+101)

Obtain the regular expression for the following finite automata using state elimination method

Page 26
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

By creating new start and final states:

After removing state q1:

After removing state q2:

After removing state q3:

Regular expression: 1001(0+101)

Page 27
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Obtain the regular expression for the following finite automata using state elimination method

By creating new start state and final state:

After removing q1 state:

Page 28
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

After removing q2 state:

After removing q3 state:

After removing q0 state:

Regular expression: (01+10)*

Page 29
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Consider the following FSM M: Show a regular expression for L(M).

OR
Obtain the regular expression for the following finite automata using state elimination method.

Since start state 1 has incoming transitions, we create a new start state and link that state to state
1 through ε.

Since accepting state 1 and 2 has outgoing transitions, we create a new accepting state and link
that state to state 1 and state 2 through ε. Remove the old accepting states from the set of
accepting states. (ie: consider 1 and 2 has non final states)

Page 30
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Consider the state 3 has rip and remove that state:

Consider the state 2 has rip and remove that state:

Consider the state 1 has rip and remove that state:

Finally we have only start and final states with one transition from start state 1 to final state 2,
The labels on transition path indicates the regular edpression.

Page 31
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Regular Expression = (ab U aaa* b)* (a U ε )

Consider the following FSM M: Show a regular expression for L(M).

After creating new start and final states:

After removing q2 state:

After removing q1 state:

Page 32
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

After removing q0 state:

Regular expression: 0* (ε + 1+) = 0* 1*

Consider the following FSM M: Show a regular expression for L(M). OR
Construct regular expression for the following FSM using state elimination method

By creating new state and final states.

Page 33
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

After removing D state:

After removing E state:

After removing A state:

After removing B state:

After removing C state:

Page 34
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

Regular expression = (00)11(11)

Consider the following FSM M: Show a regular expression for L(M). OR
Construct regular expression for the following FSM using state elimination method

By creating final state.

After removing q1state:

Page 35
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

After removing q2state:

After removing q3state:

Regular expression= 0101

Consider the following FSM M: Show a regular expression for L(M). OR
Construct regular expression for the following FSM using state elimination method

By creating new start and final states:

Page 36
Automata Theory and Computability Regular Expressions to FSM (State Elimination)

After removing q0 state:

After removing q1 state:

After removing q2 state:

After removing q3 state:

Regular expression: (0+1)1(0+1) +(0+1)1(0+1)(0+1)

Page 37
Automata Theory and Computability FSM to Regular Expressions ( Kleen’s Theorem)

Kleen’s Thereom
Theorem: Every regular language (ie: every language that can be accepted by some FSM) can
be defined with a regular expression.
This proof is by construction of FSM (construct a new FSM by using the following steps)
1. Remove any states from given FSM M that are unreachable from the start state.
2. If the start state of M is part of a loop (i.e: it has any transitions coming into it), then
create a new start state s and connects to M ‘s start state via an ε-transition. This new
start state s will have no transitions into it.
3. If there is more than one accepting state of M or if there is just one but there are any
transitions out of it, create a new accepting state and connect each of M’s accepting states
to it via an ε-transition. Remove the old accepting states from the set of accepting states.
Note that the new accepting state will have no transitions out from it.
4. If there is more than one transition between states p and q, collapse them into a single
transition.
5. If there is a pair of states p, q and there is no transition between them and p is not the
accepting state and q is not the start state, then create a transition from p to q labeled Ø.
6. At this point, if M has only one state, then that state is both the start state and the
accepting state and M has no transitions. So L (M} = {ε}. Halt and return the simple
regular expression as ε.
7. If M has no accepting states then halt and return the simple regular expression Ø.
8. Until only the start state and the accepting state remain do:
i. Select some state rip of M. Any state except the start
state or the accepting state may be chosen.
ii. For every transition from some state p to some state ,
if both p and q are not rip then, using the current
labels given by the expressions R, compute the new
label R ' for the transition from p to q using the formula:
R'(p, q) = R(p, q) U R(p, rip)R(rip, rip)* R(rip, q)
9. Remove rip and all transitions into and out of it.

Page 38
Automata Theory and Computability FSM to Regular Expressions ( Kleen’s Theorem)

10. Return the regular expression that labels the one remaining transition from the start state
to the accepting state.
Construct the regular expression for the following FSM using Kleen’s Theorem

By Adding all the required transitions, the FSM becomes:

Ripping States Out One at a Time:

Let rip be state 2. Then:
R'(1, 3) = R(1, 3) U R(1, rip)R(rip, rip)*R(rip, 3).
= R(1, 3) U R(1, 2) R(2, 2)* R(2, 3)
R(1, 3)3 = Ø U a b* a = ab*a
R’(p, q)k = R(p, q)k-1 + R(p, k)k-1 (R(k, k)k-1)*R(k,q)k-1
R(1, 3)4 = R(1, 3)3 + R(1, 4)3 (R(4, 4)3)*R(4,3)3
R(1, 4)3 = b
R(4, 4)3 = Ø
R(4, 3)3 = bb*a we know that (Ø)* = ε
Regular expression = ab*a + bbb*a

Page 39
Automata Theory and Computability Application of Regular expressions

Applications of Regular Expressions

• Text editors: which are some programs used for processing the text. Example: UNIX text
editor uses the RE for substituting the strings.
• Lexical Analysers Regular expressions are extensively used in the design of Lexical
analyzer phase, for example to recognize identifier as : (letter) (letter+digit)*
• Regular expressions are used to search patterns in text.
Manipulating and simplifying regular expressions:

Identity rule Example

ɛR=Rɛ=R 1ɛ=ɛ1=1

ØR = RØ = Ø 1Ø = Ø1 = Ø

ɛ*=ɛ

(Ø)* = ɛ

Ø + R = R+Ø = R Ø +1 =1

R +R = R 1U1=1

RR* =RR = R+ 00 = 0+

(R*)* = R* (1*)* = 1*

R* R* = R*
ɛ + RR* = R* ɛ + 1+ = 1*
(P+Q)R = PR +QR
(P+Q)* =(P*Q*) =
R*(ɛ + R) = (ɛ + R) R* =

(ɛ + R)* = R*
ɛ + R* = R*
(PQ)* P = P(QP)*
R*R + R = R*R =R+

Page 40
Automata Theory and Computability Simplification of Regular expressions

Prove that (1 + 001) + (1 + 001) (0 + 101) (0 + 101) = 01(0 + 101)

Let us consider LHS : (1 + 00*1) + (1 + 00*1) (0 + 10*1)* (0 + 10*1)
By taking a common factor as (1 + 00*1)

(1 + 001) (ɛ + (0 + 101)* (0 + 10*1))

We know that (ɛ + R*R ) = R* where R = (0 + 10*1)
The above expression reduces to:
(1 + 00*1) (0 + 10*1)*
Again taking 1 as common factor in above expression:
(ɛ +00*)1 (0 + 10*1)*
By applying (ɛ +00*) =0*
0*1 (0 + 10*1)* = RHS
Hence the two regular expressions are equivalent.

Prove that ɛ +1* (011)* (1* (011)) = (1 + 011)*

Let LHS is
ɛ +1* (011)* (1* (011)*)*
By considering 1* (011)* as R and applying (ɛ + R*R ) = R*
*
LHS = (1* (011)*)
Put P = 1 and Q = 011* and by applying
(P* Q*)* = (P + Q)*
= (1 + 011)*
= RHS
LHS = RHS
Hence the proof
State True or False : Every subset of regular language is regular
Let us consider one regular language as L1 = { 0*}
The subset of L1 say L2 = { 02n | n ≥ 1 } is not a regular language. (Its not possible to construct
FSM)
Hence the given statement is False.

Page 41
Automata Theory and Computability Simplification of Regular expressions

Check the equivalence of the regular expressions: (abbb)a* and a(bbba)*

(ab)*a = a(ba)*
By identity rule (PQ)* P = P(QP)*
Let us map this rule with the given expression:
P = a* Q = bbb

Hence the given expressions are equivalent.

(ab)a = a(ba) : True, by using the above identity rule.( P = a and Q= b)

Exercises

Write a regular expressions to describe each of the following languages

1 {w € {a. b}* : every a in w is immediately preceded and followed by b}.

Regular expression = (b + bab)*
2 { w € {a. b}* : w does not end in ba}.
Regular expression = ε + a + (a + b)* (b + aa)
3 { w € {0,1}* : w has 001 as a substring}
Regular expression = (0 + 1)* 001 (0 + 1)*
4 { w € {0, 1}* : w does not have 001 as a substring}
Regular expression = (1 + 01)* 0*
5 {w € {a. b}* : w has both aa and bb as substrings}
Regular expression = (a + b)* aa (a + b)* bb (a + b)* + (a + b)* bb (a + b)* aa (a + b)*
6 {w € {a. b}* : w has both aa and aba as substrings}
Regular expression = (a + b)* aa (a + b)*aba (a + b)* + (a + b)* aba (a + b)* aa (a +
b)* + (a + b)* aaba (a + b)* + (a + b)*abaa (a + b)*
7 {w € {0 ,1}* : none of the prefixes of w ends in 0}
Regular expression = 1*
8

Regular expression = (ba ba ba)b*

Page 42
Automata Theory and Computability Simplification of Regular expressions

9
( Number of a’s in w is at most 3)
Regular expression = b* (a + ε) b* (a + ε) b* (a + ε) b*
10 {w € {a. b}* : w contains exactly two occurrences of the substring aa}
Regular expression = (b + ab)*aaa (b + ba)* + (b + ab)*aab (b + ab)*aa(b + ba)*
Simplify each of the following regular expressions
a (a U b)* (a U ε) b* = (a U b)*
b (Ø* + b) b*.
We know that Ø* = ε
(ε +b) b* = b* + bb* = b*
c (a U b)*a* U b = (a U b)*
d ((a U b)*)* = (a U b)*
e ((a U b)+ )* = (a U b)*
Let L = {an bn : 0 ≤ n ≤ 4}.
Regular expression = (ε + ab + aabb + aaabbb + aaaabbbb)
Write the regular expression for the following FSM M using state elimination method.

By creating a new start and final state.

Page 43
Automata Theory and Computability Simplification of Regular expressions

After removing state q3:

After removing state q1:

Page 44
Automata Theory and Computability Simplification of Regular expressions

After removing state q2:

After removing state q0:

Regular expression = ( a + bbaa) (ε + bb*(ε +a))

Write the regular expression for the following FSM M using state elimination method

By creating a new Final state:

Page 45
Automata Theory and Computability Simplification of Regular expressions

After removing q2 state:

After removing q1 state:

After removing q3 state:

Regular expression = ε + ((a + ba) (ba)* b)

Page 46
Automata Theory and Computability Regular grammars

Regular Grammars
So far we have considered two equivalent ways to describe exactly the class of regular
languages:
 Finite state machines.
 Regular expressions.
We now introduce a third:
• Regular grammars (sometimes also called right linear grammars).
Define regular Grammar
A regular grammar G is a quadruple (V, ∑, R,S)
where:
• V is the rule alphabet, which contains Non-terminal symbols and Terminal symbols.
• ∑ is the set of terminal symbols ( Subset of V)
• R is finite set of rules of the form X → Y
• S is the start symbol, which is a non-terminal symbol.
Example: G = ({ A, C, a, b}, {a, b}, R, A) where rule R is: A → ε, A → b, A → aC and C →a
Non-Terminal and Terminal Symbols:
Non-terminal Symbols: symbols that are used in the grammar but that do not appear in strings
in the language. In above example A, C are non-terminal (Variable) symbols.
Terminal Symbols: symbols that can appear in strings generated by G. In above example a, b
are terminal symbols.
Rules R of any Regular Grammar:
Rule R is of the form X → Y must satisfy the following 2 conditions:
1. Left-hand side contains only one symbol that must be a non- terminal.
2. RHS contains ε or a single character (terminal) or a single character (terminal) followed
by a single non-terminal
Example: A → ε or A → b or A → aA are legal Rules
BA → ε , A → aSa are not legal rules.
Language generated by a Grammar:
The language generated by a grammar G =( V, ∑, R, S ), denoted L( G), is the set of all strings w
in ∑* such that it is possible to start with S, apply some finite set of rules in R, and derive w.
Page 47
Automata Theory and Computability Regular grammars

To generate any string by using regular grammar G; to start with S, apply derivation step, by
replacing non-terminal symbol in each derivation step until the required string is generated.
Regular Grammars and Regular Languages:

Theorem : The Regular grammar defines exactly the regular languages or The class of languages
that can be defined with regular grammars is exactly the regular languages.
To prove this theorem one must prove that for given regular grammar it is possible to construct
equivalent FSM or from FSM it is possible to get the regular grammar.

Method for conversion of FSM to Regular Grammar G:

Conversion of FSM to regular Grammar G = ( { A0, A1,………An}, ∑, R, A0 ) is as follows:
Where R is the set of production rules can be defined by following rules:
1. Ai →aAj is a production rule if δ(Ai , a) = Aj where Aj can be any final or non-final state
but not the final state #

2. Ai →a is a rule if δ(Ai , a) = Aj where Aj is final state #

3. Ai →ε is a rule if Ai is any final state other than final state #
Note: Final state # of FSM is not the part of regular grammar rule.
Let us consider one regular language L = { strings of a’s and b’s with at least one a}
The regular expression which defines the regular language L is = (a + b) * a (a + b)*
The FSM which accepts the regular language L:

By applying the above methods (Above FSM doesnot contain final state as #)
The regular Grammar which defines the regular language L is:
Regular grammar G = ( { S, A, a, b}, { a, b}, R, S ) where S is the start Non-terminal symbol of
grammar G and R is the rules defined as:
S→ aA
S → bS
A → aA
Page 48
Automata Theory and Computability Regular grammars

A → bA
A→ε
Method for conversion of Regular Grammar G to FSM:
1. Create in FSM M a separate state for each non terminal in V.
2. Make the state corresponding to S the start state.
3. If there are any rules in R of the form X → w, for some w € ∑ then create an additional
state labelled #.
4. For each rule of the form X → wY, add a transition from X to Y labelled w.
5. For each rule of the form X → w, add a transition from X to # labelled w.
6. For each rule of the form X → ɛ, mark state X as accepting.
7. Mark state # as accepting.
Consider the following regular grammar G:
S → aT
T → bT
T→a
T → aW
W→ε
W → aT
The equivalent FSM for the given regular grammar G:

Page 49
Automata Theory and Computability Regular grammars

Write the regular grammar for the language L = {w € {a, b }*: |w| is even}.
The following DFSM M accepts L:

By converting FSM to regular grammar G:

G = ({S, T, a, b}, {a, b}, R, S) where R is the rule defined as follows:
S → aT
S → bT
T → aT
T → bT
S→ ε
Write the regular grammar for the language L = {w € {a, b }* : w ends with pattern aaaa}.
The following FSM (NDFSM) M accepts L:

By converting FSM to regular grammar G:

G = ({S, B, C, D, a, b}, {a, b}, R, S) where R is the rule defined as follows:
S → aS
S → bS
S → aB
B → aC
C→ aD
D→ a

Page 50
Automata Theory and Computability Regular grammars

Write the regular grammar for L = {w € {a, b}*: w contains an odd number of a's and w ends in
a}. Also generate the string baaba by using this regular grammar.
DFSM which accepts L is:

Regular grammar G = ({S, T, X, a, b}, {a, b}, R, S) where R is the rule defined as follows:
S → bS
S → aT
T → aS
T→ bX
T→ ε
X→ aS
X→ bX
To generate the string baaba, start with S apply derivation process, by replacing non terminal
symbol in each step, until the required string is generated.

Page 51
Automata Theory and Computability Regular grammars

Show a regular grammar for the language: L ={ w € {a, b }*: w contains an even number of a's
and an odd number of b's }
DFSM which accepts L is:

Regular grammar G = ({S, A, B, C, a, b}, {a, b}, R, S) where R is the rule defined as follows:
S → aA
S → bB
A → aS
A→ bC
B→ aC
B→ bS
B→ ε
C→ aB
C→ bA
Show a regular grammar for the language: L ={ w € {a, b }*: w does not end in aa }
DFSM which accepts L:

Page 52
Automata Theory and Computability Regular grammars

Regular grammar G = ({S, A, B, a, b}, {a, b}, R, S) where R is the rule defined as follows:
S → aA | bS | ε
A → aB | bS | ε
B→ aB | bS
Show a regular grammar for the language: L ={ w € {a, b }*: w does not contain the substring
aabb }

DFSM which accepts L:

Regular grammar G = ({S, A, B, C, a, b}, {a, b}, R, S) where R is the rule defined as follows:
S → aA | bS | ε
A → aB | bS | ε
B→ aB | bC | ε
C→ aA | ε

Page 53
Automata Theory and Computability Regular and Non regular Languages

Regular and Non regular Languages.

The language L = { a*b*} is a regular language because it can be modeled by a FSM.
The language L = { an bn| n≥ 0 } is not a regular language because it is not possible, given some
finite number of states, to count an arbitrary number of a’s and then compare that count to the
number of b's.
Regular languages are normally denoted by some regular expression.
How Many Regular Languages Are There?
There is a countably infinite number of regular languages. There cannot be more regular
languages than there are DFSMs. There are at most a countably infinite number of regular
languages. There is no one to one relationship between RLs and DFSM’s, since there is an
infinite number of machines that accept any given language.
Example: {a}, { aa} , { aaa}, { aaaa}. { aaaaa}, { aaaaaa } ….. .. .
Showing That a Language is Regular.
Theorem: Every finite language is regular.
Proof:
• If L is the empty language ( no strings); L = { } Then the regular expression
corresponding to L is = Ø , so L is regular.
• If any finite language L composed of the strings s1, s2, ... sn for some positive integer n,
then it is defined by the regular expression: s1 U s2 U……………..U sn ; So L is regular.
• Intersection of two infinite languages is finite.
Example: L1 = { anbn |n ≥ 0 } and L2 = { bnan |n ≥ 0 }. Here both L1 and L2 are non-regular. But
L1∩L2 = { ε } which is finite, is a regular language.
Prove that the following language L is regular or not?
i. L = { ai bj |i, j ≥ } 0 and i + j = 5
L is Regular. A simple FSM with five states just counts the total number of characters.
ii. L = { ai bj |i, j ≥ } 0 and i - j = 5
L is Not Regular. L consists of all strings of the form a*b* where the number of a’s is five more
than the number of b’s. So we need infinite number of states to model this.
iii. L = { ai bj | 0 ≤ i < j< 2000 }
L is Regular, because language is finite.
iv. {w ∈ {Y, N}* : w contains at least two Y’s and at most two N’s}.
L is Regular. L can be accepted by an FSM that keeps track of the count (up to 2) of Y’s and
N’s.
Page 54
Automata Theory and Closure properties of
Computability RLs

Closure Properties of Regular Languages.

If certain languages are regular and language L is formed from them by certain operations such
as union, concatenation, difference, star closure etc. then L is also regular. These properties are
called closure properties. Closure property is a useful tool for building many complex automata.
The various closure properties of regular languages are:
1. Regular languages are closed under Union, concatenation and star (closure) operation.
2. The complement of a regular language is regular.
3. The intersection of two regular languages is regular.
4. The difference of two regular languages is regular.
5. The reversal of a regular language is regular.
6. A homomorphism of regular languages is regular.
7. The inverse homomorphism of regular language is regular.
1. Regular languages are closed under Union, concatenation and star (closure) operation.:
If L1 and L2 are regular languages, then prove that L1U L2, L1.L2 and L1* is also regular
language.
Proof: Since L1 and L2 are regular languages, they have regular expressions, say R1 and R2 such
that L1 = L(R1) and L2 = L(R2).
By definition of regular expressions, we have
R1 U R2 is a regular expression denoting the language L1U L2. ( ie: L(R1) U L(R2) )
R1.R2 is a regular expression denoting the language L1.L2 ( ie: L(R1) L(R2) )
R1* is a regular expression denoting the language L1*( ie: L(R1)* )
So L1U L2, L1.L2 and L1* is also regular language.
2. Closure under complementation:
The complement of a regular language is regular.
If L is regular language over alphabet Σ, then is also regular language.
Proof:
Let L = L(M1) for some DFSM M1 = ( Q, Σ, δ, s, A). Then = L(M2), where M2 is the DFSM =
( K, Σ, δ, s, K - A). That is M2 is exactly like M1, but the accepting states of M1 have become
non-accepting states of M2, and vice versa.

Page 55
Automata Theory and Closure properties of
Computability RLs

So the language which is rejected by M1 is accepted by M2 and vice versa. Thus we have a
machine M2 which accepts all those strings ‘w’ in that are rejected by machine M1. So the
complement of regular language L is is also regular.
3. Closure under intersection:
The intersection of two regular languages is regular.
If L and M are regular languages then show that L∩M is also regular.
Proof::
Let L and M are regular languages. We know that complement of a regular language is regular.
So complement of L, ie: and complement of M, ie: is also regular language.
Also union of two regular languages is a regular language.
So union of and , ie: U is also regular.

But we know that complement of U ie: = is also regular.

According to De-morgan’s law, L∩M = which is a regular language.

Therefore L∩M is also regular.

4. The difference of two regular languages is regular.

If L and M are regular languages, then so is L – M.
Proof: L and M are regular languages,
We know that complement of a regular language is again a regular language. So the complement
of M , ie: is regular.
Also the intersection of two regular languages is regular, ie: L ∩ is also regular.
According De-morgan’s law L – M = L ∩ is regular language.
Therefore L – M is a regular language.
5. The reversal of a regular language is regular.
If L is regular language, so is LR
Proof: Assume that L is defined by a regular expression E.
We have to show that there is another regular expression ER such that L(ER) = (L(E) )R
By definition of a regular expression, ε, φ, or a for some symbol a, is a regular expression.
If, E = ε , ER = ε , E = φ, ER = φ and E = a, ER = a then ER is same as E.

Page 56
Automata Theory and Closure properties of
Computability RLs

Also by definition of regular expression,

1. E = E1 + E2 then ER = E1R + E2R ie: the reversal of the union of two languages is obtained
by computing the reversal of the two languages and taking the union of those languages.
2. E = E1E2. Then ER = E2R E1R.
3. E = E1*. Then ER = (E1R)* ie: string is in L(E) if and only if its reversal is in L(E1R)*
Homomorphism (Letter substitution)
The term homomorphism means substitution of strings by some other symbols.
Example: string aabb can be written as 0011 under Homomorphism by replacing a with 0 and b
with 1.
Let ∑ is the set of input alphabets and Г be the set of substitution symbols, then ∑* → Г* is
homomorphism.
Let w = a1, a2, a3, .............................. an
Then h(w) = h(a1) h(a2) h(a3) ................ h(an)
If L is a language belongs to set ∑, then the homomorphic image of L can be defined as:
h(L) = { h(w) : w € L }
6. A homomorphism of a regular language is regular.
Proof: by counter example
Let L is a regular language, we have to prove that h(L) is also regular.
∑ = { a, b} and string w = abab
h(a) = 00
h(b) = 11
By definition h(w) = h(a1) h(a2) h(a3) ................ h(an)
h(w) = h(a) h(b) h(a) h(b) = 00110011
The homomorphism to a language is applied by applying homomorphism on each string of
language. Let L = ab*b ( regular language) ie: L = { ab, abb, abbb, abbbb, ..................... }
h(L) = {0011, 001111, 00111111, 0011111111, ........ }
h(L) = 00(11)+ ; h(L) can be represented by RE, so it is a regular language.
7. The inverse homomorphism of regular language is regular.
Proof: Let ∑* → Г* is homomorphism, where ∑ is the set of input alphabets and Г be the set of
substitution symbols, then h(L) be homomorphic language.

Page 57
Automata Theory and Closure properties of
Computability RLs

The inverse of homomorphic language can be represented as h-1(L)

Let h-1(L) = { w : w € L }
If L is regular then we know that h(L) is also regular. Since L is regular there exists some FSM
M = ( K, ∑, δ, s, A) which accepts L. Then h(L) must also accepted by M.
For complement of L ie: is also regular which is accepted by FSM M’ where Final states of M
becomes non final states and all the non final states of M become the final states. Clearly is
accepted’ by M. The inverse homomorphic language of is h-1 (L) is also accepted by M’. So , h-
1
(L) is also regular.

Page 58
Automata theory and Computability Showing that language is not RL

Showing That a Language, is Not Regular.

We can show that a language is regular by exhibiting a regular expression or an FSM or a finite
list of the equivalence classes of ~ L or a regular grammar or by using the closure properties
that we have proved hold for the regular languages. But how shall we show that a language is not
regular. In other words, how can we show that none of those descriptions exists for it'?
We need a technique that does not rely on our cleverness.
What we can do is to make use of the following observation about the regular languages:
Every regular language L can be accepted by an FSM M with a finite number of states. If L is
infinite, then there must be at least one loop in M. All sufficiently long strings in L must be
characterized by one or more repeating patterns corresponding to the substrings that drive M
through its loops.
Let us consider the following 5 state FSM:

 It can accept infinite number of strings.

 The longest string that can be accepted by the FSM without going through any loops has
length 4.
 The total number of states in FSM is 5 ie: |k| = 5.
 Take any longest string w, such that length of longest string in L ≥ number of states in
FSM
 If we take string w = babbab, then |w| = 6 which is greater than the number of states in
FSM. Here the second b in w drove the FSM through its loop, we call it as y
 Suppose if we remove this y (pump y out), resulting in a new string babab which is also
accepted by FSM.
 Also we can pump in (adding one or more) as many copies of b as we like, generating
such strings as babbbab, babbbbab and so forth. FSM also accepts all of them.
 In the original string w = babbab, the third b also drove FSM through its loop. We could
also pump it in or out and we get a similar result.

Page 59
Automata theory and Computability Showing that language is not RL

Let us consider one more FSM with 5 states:

This FSM accepts only one string, aab. The only string that can drive FSM through its loop is ɛ
No matter how many times FSM goes through the loop, it cannot accept any longer strings.
Therefore the length of pumping string y must be greater 0. It should not be empty.
 This property of FSMs and the languages that they can accept is the basis for a powerful
tool for showing that a language is not regular.
 If a language contains even one long string that cannot be pumped in the fashion that we
have just described, then it is not accepted by any FSM and so is not regular.
We formalize this idea, in Pumping Theorem.
The Pumping Theorem for Regular languages (Pumping Lemma for Regular Languages)
**********State and prove pumping theorem for regular languages
Theorem:
Let L be a regular language. Then there exists a constant ‘k’ (number of states in FSM which
depends on L) such that for every string ‘w’ in L such that |w| ≥ k, we can break w into three
strings, w = xyz, such that:

|y| > 0 ie: y ≠ ε

|xy| ≤ k
For all q ≥ 0, the string xyqz is also in L
Proof: Suppose L = L(M) for some DFSM ‘M’ and L is regular language. Suppose ‘M’ has ’k’
number of states. Consider any string w = a1a2a3………………..am of length ’m’ where m ≥ k and
each ai is an input symbol. Since we have ‘m’ input symbols, naturally we should have ‘m+1’
states, in sequence q0, q1, q2……………….qm where q0 is the start state and qm is the final state.

Page 60
Automata theory and Computability Showing that language is not RL

Since |w| ≥ k, by the pigeonhole principle it is not possible to have distinct transitions, since
there are only ‘k’ different states. So one of the state can have a loop. Thus we can find two
different integers i and j with 0 ≤ i < j ≤ k, such that qi = qj. Now we can break the string w = xyz
as follows:

x = a1a2a3 .....................ai.

y = ai+1, ai+2, aj ( loop string where i = j)

z = aj+1, aj+2, ................. am.

The relationships among the strings and states are given in figure below:

‘x’ may be empty in the case that i= 0. Also ‘z’ may be empty if j = k = m. However, y cannot be
empty, since ‘i’ is strictly less than ‘j’.

Thus for any integer q ≥ 0, xyqz is also accepted by DFSM ‘M’; that is for a language L to be a
regular, xyqz is also in L for all q ≥ 0.

Show that L = { anbn | n ≥ 0 } is not regular.

Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.
Let w = ak bk
Since |w| = k + k = 2k ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| <= k, y must occur within the first k characters and so y = ap for some p. Also y ≠ε,
p must be greater than 0.
x = ak – p y = ap z = bk
According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.

Page 61
Automata theory and Computability Showing that language is not RL

Let q =2 and the resulting string w = ak – p (ap)2 bk where p ≥ 1 = ak + p bk must be in L.

But it is not since it has more a’s than more b’s.
Thus there exists at least one long string w in L that fails to satisfy the conditions of the Pumping
Theorem. So anbn | n ≥ 0 is not regular.

NOTE: Show that L= {w €( a, b)* | na(w) = nb(w) } is not regular.

We can prove that L is not regular by taking string w= anbn | n>=0. Refer previous problem.

Show that L = {w € { ), ( }*: the parentheses are balanced} are not regular
Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.
Let w = (k )k
Since |w| = k + k = 2k ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| <= k, y must occur within the first k characters and so y = (p for some p. Also y ≠ε,
p must be greater than 0.
x = (k – p y = (p z = )k
According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.

Let q =2 and the resulting string w = (k – p (2p )k where p ≥ 1 = (k + p )k must be in L.

But it is not since it has more (’s than more )’s.
Thus there exists at least one long string w in L that fails to satisfy the conditions of the Pumping
Theorem. So {w € { ), ( }*: the balanced parentheses}is not regular.
Show that L = { wwR | w € (a, b)* } is not regular.
Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.
Let us consider one string defined in L; w = ak bk bk ak
Since |w| = k + k+ k + k = 4k ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| <= k, y must occur within the first k characters and so y = ap for some p. Also y ≠ε,

Page 62
Automata theory and Computability Showing that language is not RL

p must be greater than 0.

x = ak – p y = ap z = bk bk ak
According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.

Let q =2 and the resulting string w = ak – p (ap)2 bk bk ak where p ≥ 1 = ak + p bk bk ak must be in L

But it is not since the first half of the string has more a's than the second half does. So it is not in L.
Thus there exists at least one long string w in L that fails to satisfy the conditions of the Pumping
Theorem. So L = { wwR | w € (a, b)* is not regular.}
Show that L = { anbm | n > m } is not regular.
Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.
Let w = ak+1 bk (Language contains strings of more a’s than b’s)
Since |w| = k +1+ k = 2k +1 ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| <= k, y must occur within the first k characters and so y = ap for some p. Also y ≠ε,
p must be greater than 0.
x = ak – p y = ap z = abk
According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.

Let q = 0 and the resulting string w = ak – p (ap)0 abk where p ≥ 1 = ak+1- p bk must be in L
But it is not since p > 0 and k + 1 - p <= k, so the resulting string no longer has more a's than b's
and so is not in L.
Thus there exists at least one long string w in L that fails to satisfy the conditions of the Pumping
Theorem. So L = { anbm | n > m } is not regular.

NOTE: Show that L= {ai bj | i ≠ j} is not regular.

ie: i ≠ j means i > j or i < j; so we can take string ‘w’ = ak+1bk or w= akbk+1.
Solution is similar to the previous problems

Page 63
Automata theory and Computability Showing that language is not RL

Show that L = {0m | m is prime} is not regular.

Here the language L contains stings of 0’s of length prime.
L = {0, 00, 000, 00000, 0000000, ................................ }
Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.
Let w = 0k (k is prime)
Since |w| = k ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| <= k, y must occur within the k characters and so y = 0p for some p. Also y ≠ε,
p must be greater than 0.
x = 0i y = 0p z = 0k – i – p where i + p <= k and |p| > 0
According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.

0i (0p)q 0k – i – p = 0k + p(q – 1) € L

ie: k + p(q – 1) is also prime for all q ≥ 0

If we choose q = k + 1 then k + p(q – 1) becomes k + kp = k (p + 1) is also prime

We know that p > 0; suppose p = 1 then 2k is also prime, but it is not true, which is a
contradiction to our assumption. So the language L = {0m | m is prime} is not regular.

Show that L = {an! | n ≥ 0} is not regular.

Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.
Let w = ak! ( the length of string is k!)
Since |w| = k! ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| <= k, y must occur within the k characters and so y = ap for some p. Also y ≠ε,
p must be greater than 0.
x = ai y = ap z = ak! – i – p where i + p <= k and |p| > 0

Page 64
Automata theory and Computability Showing that language is not RL

According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.

ai (ap)q ak! – i – p = ak! + p(q – 1) € L for all q ≥ 0

If we choose q = 0 then ak! – p € L for all p ≥ 1

If p = 1, then k! – 1 > k!

ak! – 1 does not belongs to L, which is a contradiction to our assumption. So the language L = {an!
| n ≥ 0} is not regular.

Show that L = { ww | w € (a, b)* } is not regular.

Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.
Let us consider one string defined in L; w = ak bk ak bk
Since |w| = k + k+ k + k = 4k ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| <= k, y must occur within the first k characters and so y = ap for some p. Also y ≠ε,
p must be greater than 0.
x = ak – p y = ap z = bk ak bk
According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.

Let q =2 and the resulting string w = ak – p (ap)2 bk ak bk where p ≥ 1 = ak + p bk ak bk must be in L

But it is not since the first half of the string has more a's than the second half does. So it is not in L.
Thus there exists at least one long string w in L that fails to satisfy the conditions of the Pumping
Theorem. So L = { ww | w € (a, b)* is not regular.}
Show that L= {w €(a, b)* | na(w) < nb(w) } is not regular
Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.
Let w = ak bk+1 (Language contains strings of more b’s than a’s)
Since |w| = k + k+1 = 2k +1 ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| <= k, y must occur within the first k characters and so y = ap for some p. Also y ≠ε,

Page 65
Automata theory and Computability Showing that language is not RL

p must be greater than 0.

x = ak – p y = ap z = bk+1
According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.
Let q = 2 and the resulting string w = ak – p (ap)2 bk+1 where p ≥ 1; w = ak+1+ p bk+1 must be in L
But it is not since p > 0 and k + 1 + p > k+1, so the resulting string no longer has more b's than
a's and so is not in L.
Thus there exists at least one long string w in L that fails to satisfy the conditions of the Pumping
Theorem. So L= {w €(a, b)* | na(w) < nb(w) } is not regular.
Show that L= {0n | n is a perfect square} is not regular. OR
Show that L= {0n | n = k 2 where k ≥ 1} is not regular
Here the language L contains stings of 0’s of perfect square length.
ie: L = {0, 0000, 000000000, ............................ }
Assume that given language L is regular language and there exist some ‘k’ number of states,
such that any string w, where |w| ≥ k must satisfy the conditions of the theorem.

Let w = 0k
Since |w| = k2 ≥ k, we can split ‘w’ into xyz such that |xy| ≤ k and |y | ≥ 1 as
w = xyz
Since |xy| ≤ k, y must occur within the first k characters and so y = ap for some p. Also y ≠ε,
p must be greater than 0.
–i-p
x = 0i y = 0p z = 0k
According to pumping lemma, language to be regular, xyqz € L for all q ≥ 0.
–i-p
Let q = 2 and the resulting string w = 0i (0p)2 0k where p ≥ 1; w = 0k +p
must be in L.
But it is not since p > 0 and when p= 1; k2 < k2+1 < (k+1)2, so the resulting string no longer
appears in L
Thus there exists at least one long string w in L that fails to satisfy the conditions of the Pumping
Theorem. So L= {0n | n is a perfect square} is not regular.

Page 66

Regular Expressions and Their Applications
No ratings yet
Regular Expressions and Their Applications
68 pages
Atcd Module 2 2021 Scheme
No ratings yet
Atcd Module 2 2021 Scheme
56 pages
Regular Expressions in Automata Theory
No ratings yet
Regular Expressions in Automata Theory
28 pages
Unit 3 - Regular Expression
No ratings yet
Unit 3 - Regular Expression
45 pages
TOC 2nd Chapter
No ratings yet
TOC 2nd Chapter
64 pages
Bcs503 Module 2
No ratings yet
Bcs503 Module 2
46 pages
Atc-21cs51 Module 2
No ratings yet
Atc-21cs51 Module 2
56 pages
Regular Expression
No ratings yet
Regular Expression
89 pages
Chapter 2 RegularExpressions
No ratings yet
Chapter 2 RegularExpressions
95 pages
Chapter Two
No ratings yet
Chapter Two
59 pages
Regular Expressions and Finite Automata
No ratings yet
Regular Expressions and Finite Automata
95 pages
Vision 2023 Toc Chapter 3 Regular Expression 59
No ratings yet
Vision 2023 Toc Chapter 3 Regular Expression 59
8 pages
Unit Ii
No ratings yet
Unit Ii
25 pages
2.0+regular Expression Part 1 MKN
No ratings yet
2.0+regular Expression Part 1 MKN
33 pages
Regular Expressions for Language Patterns
No ratings yet
Regular Expressions for Language Patterns
46 pages
Class 10 Regular Expression
No ratings yet
Class 10 Regular Expression
26 pages
Regular Expressions in Computation Theory
No ratings yet
Regular Expressions in Computation Theory
2 pages
Regular Expression
No ratings yet
Regular Expression
17 pages
Regular Expression
No ratings yet
Regular Expression
21 pages
Reguler Language and Reguler Expression
No ratings yet
Reguler Language and Reguler Expression
4 pages
Toc Unit 2
No ratings yet
Toc Unit 2
29 pages
Automata Chapter 2
No ratings yet
Automata Chapter 2
15 pages
Regular Expressions and Finite Automata
No ratings yet
Regular Expressions and Finite Automata
39 pages
Regular Expressions and Identities Explained
No ratings yet
Regular Expressions and Identities Explained
70 pages
TOC Unit2
No ratings yet
TOC Unit2
87 pages
Regular Expression
No ratings yet
Regular Expression
18 pages
Unit-2 Regular Expression and Languages
No ratings yet
Unit-2 Regular Expression and Languages
42 pages
Regular Expressions Full Notes Cse
No ratings yet
Regular Expressions Full Notes Cse
16 pages
Chapter 2 REGULAR EXPRESSION
No ratings yet
Chapter 2 REGULAR EXPRESSION
26 pages
File 16
No ratings yet
File 16
46 pages
IOT Mod1@AzDOCUMENTS - in
No ratings yet
IOT Mod1@AzDOCUMENTS - in
13 pages
BIS613D Important Question
No ratings yet
BIS613D Important Question
2 pages
GI Lab
No ratings yet
GI Lab
27 pages
Automata Module 1
No ratings yet
Automata Module 1
86 pages
Microcontroller
No ratings yet
Microcontroller
28 pages
CNS Module 5
No ratings yet
CNS Module 5
30 pages
Rancangan Pengajaran Tahunan KSSR Pend. Khas (BP) : Tahun 4: Bahasa Inggeris
No ratings yet
Rancangan Pengajaran Tahunan KSSR Pend. Khas (BP) : Tahun 4: Bahasa Inggeris
8 pages
Exploring Idioms A Kid's Guide
No ratings yet
Exploring Idioms A Kid's Guide
2 pages
Impact of Translation Memory on Quality
No ratings yet
Impact of Translation Memory on Quality
8 pages
Task Type 2
No ratings yet
Task Type 2
12 pages
Fun in The Park
100% (1)
Fun in The Park
6 pages
Gateway b1
No ratings yet
Gateway b1
3 pages
Unit 3, Lesson A (Demonstrative, Articles and Plural Nouns) - 2
No ratings yet
Unit 3, Lesson A (Demonstrative, Articles and Plural Nouns) - 2
2 pages
English Presentation Final
No ratings yet
English Presentation Final
30 pages
Khai Thác Tranh Ảnh Để Dạy Từ Vựng Theo Hướng Giao Tiếp Cho Sinh V
No ratings yet
Khai Thác Tranh Ảnh Để Dạy Từ Vựng Theo Hướng Giao Tiếp Cho Sinh V
14 pages
3 IELTS WT1 AC Prepositions
No ratings yet
3 IELTS WT1 AC Prepositions
10 pages
All Modul Fun English 2022
No ratings yet
All Modul Fun English 2022
22 pages
Unit 4 Art Show and Holidaying Vocabulary Review
100% (1)
Unit 4 Art Show and Holidaying Vocabulary Review
12 pages
The Lexical Approach
No ratings yet
The Lexical Approach
8 pages
Ngữ Pháp Chức Năng (Tổng Hợp Bài Giảng)
No ratings yet
Ngữ Pháp Chức Năng (Tổng Hợp Bài Giảng)
106 pages
Final Test Superkids I d4
100% (5)
Final Test Superkids I d4
3 pages
Sound Devices and Figurative Language Review
No ratings yet
Sound Devices and Figurative Language Review
2 pages
Adverb Usage in Young Learners' Writing
No ratings yet
Adverb Usage in Young Learners' Writing
24 pages
Mega Flash PDF
No ratings yet
Mega Flash PDF
1 page
Understanding 21st Century Communication
No ratings yet
Understanding 21st Century Communication
4 pages
Emphasizing Auxiliary Verbs
No ratings yet
Emphasizing Auxiliary Verbs
6 pages
History and Origin of ESP
No ratings yet
History and Origin of ESP
8 pages
Of Mice and Men Exam Study Guide
No ratings yet
Of Mice and Men Exam Study Guide
51 pages
Lesson Plan in English For Classroom Observation
No ratings yet
Lesson Plan in English For Classroom Observation
4 pages
If Clause
No ratings yet
If Clause
6 pages
Tips For SPM Bahasa Melayu and English 1119
No ratings yet
Tips For SPM Bahasa Melayu and English 1119
6 pages
T S Eliots Influence Upon Salah Abdel-Sabour Final
100% (4)
T S Eliots Influence Upon Salah Abdel-Sabour Final
12 pages
PBN Class Vi-Viii MDP 2022-23
100% (1)
PBN Class Vi-Viii MDP 2022-23
23 pages
Formula B1 Unit 2 Test DF
No ratings yet
Formula B1 Unit 2 Test DF
6 pages
TOEFL Listening Comprehension
No ratings yet
TOEFL Listening Comprehension
28 pages
Anglicisme in Limba Romana
No ratings yet
Anglicisme in Limba Romana
12 pages

Automata Module 2

Uploaded by

Automata Module 2

Uploaded by

AUTOMATA THEORY AND COMPUTABILITY

[As per Choice Based Credit System (CBCS) scheme]

2.1.1 What is Regular Expression 1

2.2.1 Definition of Regular Grammars and Regular Languages 47

2.2.3 To show that a language is regular, 54

2.2.4 Closure properties of RLs 55

Define a Regular expression.

5. Given two regular expressions α and β, α U β is a regular expression.

What is L((aU b)* b) = ?

L((aU b)* b) = L((a U b)*) L(b)

Regular expression Meaning

Write the regular expressions for the following languages:

Obtain regular expression to accept the following languages over Σ = { a, b}

x. Obtain the regular expression to accept the language L = { anbm | n  4 and m  3 }.

Obtain the regular expression for the language L = { anbm | m  1, n  1, nm  3 }

*****Building an FSM from a Regular Expression

If α is Ø, we construct for it the simple FSM as:

If α is ε we construct for it the simple FSM as:

FSM for (b U ab) :

FSM for (b U ab)*

Convert the regular expression (0 + 1)* 1 ( 0 + 1) to an ε- NFA or a FSM.

FSM for (0+1):

FSM for (0+1)*:

FSM for (0+1)* 1 :

FSM for (0+1)* 1 (0+1) :

Convert the regular expression (01+ 1)* to an ε- NFA or a FSM

FSM for (01+1):

FSM for (01+ 1)*:

Convert the regular expression (0 U 1)*01 to an ε- NFA or a FSM

FSM for (0 U 1) or (0+1):

FSM for (0 U 1)* or (0+1)*:

FSM for 01:

FSM for (0 U 1)*01 or (0+1)*01:

Convert the regular expression 0* + 1* + 2* to an ε- NFA or a FSM

FSM for 0*:

FSM for 1*:

FSM for 2*:

FSM for 0 * + 1* + 2*:

Building a Regular Expression from an FSM (State Elimination Technique)

Consider the following FSM M: Show a regular expression for L(M).

Regular Expression = ab*a

Regular expression = (a+b+c) or (a U b U c)

Regular expression = ab*c

After removing state A:

After removing state B:

Regular expression: 0(10)*

After removing states C, D and E:

After removing state B:

Regular Expression: a(b+c+d)

After inserting new start state:

After removing state A:

After removing state B:

Regular expression: b(c +ab)*d

By creating new start and final states:

After removing state B:

After removing state A:

Regular expression: (0+10*1)*

By creating new start and final states:

After removing state q1:

After removing state q2:

After removing state q3:

Regular expression: 1*00*1(0+10*1)*

By creating new start state and final state:

After removing q1 state:

After removing q2 state:

After removing q3 state:

After removing q0 state:

Regular expression: (01+10)*

Consider the following FSM M: Show a regular expression for L(M).

Consider the state 3 has rip and remove that state:

Consider the state 2 has rip and remove that state:

Consider the state 1 has rip and remove that state:

Regular Expression = (ab U aaa* b)* (a U ε )

Consider the following FSM M: Show a regular expression for L(M).

After creating new start and final states:

After removing q2 state:

FSM for (0 U 1)01 or (0+1)01:

Regular expression: (0+101)

Regular expression: 1001(0+101)

Regular expression = (00)11(11)

Regular expression= 0101

Regular expression: (0+1)1(0+1) +(0+1)1(0+1)(0+1)

RR* =RR = R+ 00 = 0+

Prove that (1 + 001) + (1 + 001) (0 + 101) (0 + 101) = 01(0 + 101)

(1 + 001) (ɛ + (0 + 101)* (0 + 10*1))

Prove that ɛ +1* (011)* (1* (011)) = (1 + 011)*

Check the equivalence of the regular expressions: (abbb)a* and a(bbba)*

(ab)a = a(ba) : True, by using the above identity rule.( P = a and Q= b)

Regular expression = (ba ba ba)b*

Regular expression = ( a + bbaa) (ε + bb*(ε +a))