0% found this document useful (0 votes)

47 views39 pages

Theory of Computation: Automata Theory (CFG, CFL, CNF)

The document discusses context-free grammars (CFG) and how they are used to generate formal languages. Some key points: - A CFG consists of variables, terminals, substitution rules, and a start variable. Strings are generated by repeatedly substituting variables according to the rules. - The language of a grammar consists of all strings that can be generated. Regular languages can be described by CFGs by constructing rules from a deterministic finite automaton. - A derivation shows the sequence of substitutions used to generate a string. Ambiguity occurs when a string has multiple derivations. - Chomsky normal form restricts rules to be of the form A→BC or A→a, which any CFG can

Uploaded by

Aditya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views39 pages

Theory of Computation: Automata Theory (CFG, CFL, CNF)

Uploaded by

Aditya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 39

Theory of Computation

Automata Theory
(CFG, CFL, CNF)
Objectives
• Introduce Context-free Grammar
(CFG) and Context-free Language
(CFL)
• Show that Regular Language can be
described by CFG
• Terminology related to CFG
– Leftmost Derivation, Ambiguity,
Chomsky Normal Form (CNF)
• Converting a CFG into CNF
Context-free Grammar
(Example)

A  0A1
Substitution
Rules AB
B#
Variables A, B
Terminals 0,1,#
Start Variable A

Important: Substitution Rule in CFG has a special form:

Exactly one variable (and nothing else) on the left
side of the arrow
How does CFG generate strings?
A  0A1
AB
B#
• Write down the start symbol
• Find a variable that is written down, and a
rule that starts with that variable; Then,
replace the variable with the rule
• Repeat the above step until no variable is
left
How does CFG generate strings?
A  0A1
AB
B#
Step 1. A (write down the start variable)
Step 2. 0A1 (find a rule and replace)
Step 3. 00A11 (find a rule and replace)
Step 4. 00B11 (find a rule and replace)
Step 5. 00#11 (find a rule and replace)
Now, the string 00#11 does not have any variable.
We can stop.
How does CFG generate strings?
• The sequence of substitutions to
generate a string is called a derivation
• E.g., A derivation of the string
000#111 in the previous grammar is
A  0A1  00A11  000A111
 000B111  000#111
• The same information can be
represented pictorially by a parse tree
(next slide)
Parse Tree
A

A
0 1
A
0 1
0 B 1

#
Language of the Grammar
• In fact, the previous grammar can
generate strings #, 0#1, 00#11,
000#111, …
• The set of all strings that can be
generated by a grammar G is called
the language of G, denoted by L(G)
• The language of the previous
grammar is {0n#1n | n  0 }
CFG (Formal Definition)
• A CFG is a 4-tuple (V,T, R, S), where
– V is a finite set of variables
– T is a finite set of terminals
– R is a set of substitution rules, where
each rule consists of a variable (left side
of the arrow) and a string of variables
and terminals (right side of the arrow)
– S 2 V called the start variable
CFG (terminology)
• Let u and v be strings of variables and
terminals
• We say u derives v, denoted by u  * v, if
– u = v, or
– there exists u1, u2, …, uk, k  0 such that u  u1
 u2  …  uk  v

• In other words, for a grammar G = (V,T,R,S),

*
L(G) = { w 2 T*| S  w }
CFG (more examples)
• Let G = ( {S}, {a,b}, R, S ), and the set
of rules, R, is
– S  aSb | SS |  This notation is
an abbreviation for
S  aSb
S  SS
S
• What will this grammar generate?
• If we think of a as “(” and b as “)”, G generates
all strings of properly nested parentheses
• Is the following a CFG?

G = { {A,B}, {0,1}, R, A }
A  0B1 | A | 0
B  1B0 | 1
0B  A
Designing CFG
• Can we design CFG for
{0n1n | n  0} [ {1n0n | n  0} ?

• Do we know CFG for {0n1n | n  0}?

• Do we know CFG for {1n0n | n  0}?
Designing CFG
• CFG for the language L1 = {0n1n | n  0}
S  0S1 | 
• CFG for the language L2 = {1n0n | n  0}
S  1S0 | 
• CFG for L1 [ L2
S  S1 | S 2
S1  0S11 | 
S2  1S20 | 
Designing CFG
• Can we design CFG for {02n13n | n  0}?

• Yes, by “linking” the occurrence of 0’s

with the occurrence of 1’s
• The desired CFG is:
S  00S111 | 
• Can we construct the CFG for the
language { w | w is a palindrome } ?
Assume that the alphabet of w is {0,1}

• Examples for palindrome: 010, 0110,

001100, 01010, 1101011, …
Regular Language & CFG
Theorem: Any regular language can be
described by a CFG.

How to prove? (By construction)

Regular Language & CFG
Proof: Let D be the DFA recognizing the
language. Create a distinct variable Vi for
each state qi in D.
• Make V0 the start variable of CFG
Assume that q0 is the start state of D

• Add a rule Vi  aVj if (qi,a) = qj

• Add a rule Vi   if qi is an accept state
Then, we can show that the above CFG generates
exactly the same language as D (how to show?)
Regular Language & CFG
(Example)
DFA 0 1
1

start q0 q1

0
CFG G = ( {V0, V1}, {0,1}, R, V0 ), where R is
V0  0V0 | 1V1 | 
V1  1V1 | 0V0
Leftmost Derivation
• A derivation which always replace the
leftmost variable in each step is called a
leftmost derivation
– E.g., Consider the CFG for the properly nested
parentheses ( {S}, {(,)}, R, S ) with rule R: S
 ( S ) | SS | 
– Then, S  SS  (S)S  ( )S  ( ) ( S )
 ( ) ( ) is a leftmost derivation
– But, S  SS  S(S)  (S)(S)  ( ) ( S )
 ( ) ( ) is not a leftmost derivation
• However, we note that both derivations
correspond to the same parse tree
Ambiguity
• Sometimes, a string can have two or more
leftmost derivations!!
• E.g., Consider CFG ( {S}, {+,x,a}, R, S) with
rules R:
SS+S|SxS|a
– The string a + a x a has two leftmost
derivations as follows:
– S  S + S  a + S  a +S x S  a + a x S
a+axa
– S  S x S  S + S x S  a +S x S  a + a x S
a+axa
Ambiguity
• If a string has two or more leftmost
derivations in a CFG G, we say the string is
derived ambiguously in G
• A grammar is ambiguous if some strings is
derived ambiguously
• Note that the two leftmost derivations in
the previous example correspond to
different parse trees (see next slide)
– In fact, each leftmost derivation corresponds
to a unique parse tree
Two parse trees for a + a x a
S S

S + S S x S

a S x S S + S a

a a a a
Fun Fact:
Inherently Ambiguous
• Sometimes when we have an ambiguous
grammar, we can find an unambiguous grammar
that generates the same language
• However, some language can only be generated
by ambiguous grammar
E.g., { anbncm | n, m  0} [ {anbmcm | n, m  0}

• Such language is called inherently ambiguous

Chomsky Normal Form (CNF)
• A CFG is in Chomsky Normal Form if each
rule is of the form
A  BC
Aa
where
– a is any terminal
– A,B,C are variables
– B, C cannot be start variable
• However, S   is allowed
Converting a CFG to CNF
Theorem: Any context-free language
can be generated by a context-free
grammar in Chomsky Normal Form.

Hint: When is a general CFG not in

Chomsky Normal Form?
Proof Idea
The only reasons for a CFG not in CNF:
1. Start variable appears on right side
2. It has  rules, such as A  
3. It has unit rules, such as A  A, or B  C
4. Some rules does not have exactly two
variables or one terminal on right side

Prove idea: Convert a grammar into CNF

by handling the above cases
The Conversion (step 1)
• Proof: Let G be the context-free
grammar generating the context-free
language. We want to convert G into
CNF.
• Step 1: Add a new start variable S0
and the rule S0  S, where S is the
start variable of G
This ensures that start variable of the new grammar
does not appear on right side
The Conversion (step 2)
• Step 2: We take care of all  rules. To
remove the rule A  , for each
occurrence of A on the right side of a rule,
we add a new rule with that occurrence
deleted.
– E.g., R  uAvAw causes us to add the rules:
R  uAvw, R  uvAw, R  uvw
• If we have the rule R  A, we add R  
unless we had previously removed R  
After removing A  , the new grammar still
generates the same language as G.
The Conversion (step 3)
• Step 3: We remove the unit rule A 
B. To do so, for each rule B  u
(where u is a string of variables and
terminals), we add the rule A  u.
– E.g., if we have A  B, B  aC, B  CC,
we add: A  aC, A  CC
After removing A  B, the new grammar still
generates the same language as G.
The Conversion (step 4)
• Step 4: Suppose we have a rule
A  u1 u2 …uk, where k > 2 and each ui
is a variable or a terminal. We replace
this rule by
– A  u1A1, A1  u2A2, A2  u3A3, …,
Ak-2  uk-1uk
After the change, the string on the right side of any
rule is either of length 1 (a terminal) or length 2 (two
variables, or 1 variable + 1 terminal, or two terminals)
The Conversion (step 4 cont.)
• To remove a rule A  u1u2 with some
terminals on the right side, we replace
the terminal ui by a new variable Ui and
add the rule Ui  ui
After the change, the string on the right side of any
rule is exactly a terminal or two variables
The Conversion (example)
• Let G be the grammar on the left
side. We get the new grammar on the
right side after the first step.

S0  S S0  S
S  ASA | aB S  ASA | aB | a
AB|S AB|S|
Bb| Bb

Before removing After removing

B B
The Conversion (example)
• After that, we remove A  
S0  S S0  S
S  ASA | aB | a S  ASA | aB | a |
AB|S|
Bb SA | AS | S
AB|S
Before removing B After
 b removing
A A
The Conversion (example)
• Then, we remove S  S and S0  S

S0  S S0  ASA | aB | a |
S  ASA | aB | a |
SA | AS
SA | AS S  ASA | aB | a |
AB|S SA | AS
Bb AB|S
After removing B b removing
After
SS S0  S
The Conversion (example)
• Then, we remove A  B
S0  ASA | aB | a | S0  ASA | aB | a |

SA | AS SA | AS
S  ASA | aB | a | S  ASA | aB | a |
SA | AS SA | AS
AB|S Ab|S
B Before
 b removing B b removing
After
AB AB
The Conversion (example)
• Then, we remove A  S
S0  ASA | aB | a | S0  ASA | aB | a |

SA | AS SA | AS
S  ASA | aB | a | S  ASA | aB | a |
SA | AS SA | AS
Ab|S A  b | ASA | aB |
Bb a | SA | AS
Before removing B  After
b removing
AS AS
The Conversion (example)
• Then, we apply Step 4
S0  AA1 | UB | a | SA |
S0  ASA | aB | a |
AS
SA | AS S  AA1 | UB | a | SA | AS
S  ASA | aB | a |
A  b | AA1 | UB | a | SA |
SA | AS
A  b | ASA | aB | AS
a | SA | AS Bb
After Step 4
B b Step 4
Before A1  SA Grammar is in CNF
Ua

Lecture7 PDF
No ratings yet
Lecture7 PDF
40 pages
Automata Lectuee5
No ratings yet
Automata Lectuee5
33 pages
Unit 3 CFG
No ratings yet
Unit 3 CFG
65 pages
Pda Annotated 10 12 2021
No ratings yet
Pda Annotated 10 12 2021
37 pages
Toc Ii
No ratings yet
Toc Ii
40 pages
Context Free Grammars
No ratings yet
Context Free Grammars
40 pages
ACT Chapter 3
No ratings yet
ACT Chapter 3
28 pages
Chapter 4 and 5
No ratings yet
Chapter 4 and 5
71 pages
Chapter 4 and 5
100% (1)
Chapter 4 and 5
71 pages
Context Free Grammars
No ratings yet
Context Free Grammars
36 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
38 pages
Context Free Grammars
No ratings yet
Context Free Grammars
40 pages
Context-Free Languages & Grammars Explained
No ratings yet
Context-Free Languages & Grammars Explained
40 pages
TOC II Updated
No ratings yet
TOC II Updated
41 pages
Context Free Grammars
No ratings yet
Context Free Grammars
39 pages
Context Free Languages
No ratings yet
Context Free Languages
36 pages
ContextFreeGrammars Myppt
No ratings yet
ContextFreeGrammars Myppt
41 pages
CH 3
No ratings yet
CH 3
16 pages
ContextFreeGrammars
No ratings yet
ContextFreeGrammars
28 pages
Context Free Language
No ratings yet
Context Free Language
31 pages
CS242 - Module 5
No ratings yet
CS242 - Module 5
42 pages
Lecture 06 Context-Free Grammars
No ratings yet
Lecture 06 Context-Free Grammars
22 pages
08 CFG
No ratings yet
08 CFG
41 pages
Context-Free Grammars & CNF Conversion
No ratings yet
Context-Free Grammars & CNF Conversion
23 pages
Context Free Grammars
No ratings yet
Context Free Grammars
24 pages
Context-Free Grammar Basics
100% (1)
Context-Free Grammar Basics
68 pages
Context-Free Grammar and Languages Guide
No ratings yet
Context-Free Grammar and Languages Guide
25 pages
Act CH 3
No ratings yet
Act CH 3
36 pages
Context Free Grammars
No ratings yet
Context Free Grammars
25 pages
5c-partB-CFG and PDA
No ratings yet
5c-partB-CFG and PDA
57 pages
CS372 Formal Languages & The Theory of Computation
No ratings yet
CS372 Formal Languages & The Theory of Computation
33 pages
Toc-L06 N Summer
No ratings yet
Toc-L06 N Summer
16 pages
Context-Free Languages & Grammars (Cfls & CFGS) : 1 10/10/2022 C.P.Shabariram Ap (Sr. GR.) /cse
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : 1 10/10/2022 C.P.Shabariram Ap (Sr. GR.) /cse
36 pages
2 Contex Free Language
No ratings yet
2 Contex Free Language
13 pages
Unit-3 Context Free Grammar
No ratings yet
Unit-3 Context Free Grammar
57 pages
Theory of Automata
No ratings yet
Theory of Automata
202 pages
WINSEM2024-25 BCSE304L TH VL2024250501632 2025-02-15 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE304L TH VL2024250501632 2025-02-15 Reference-Material-I
29 pages
Formal Languages and Automata Theory: CH 4: Context Free Languages
No ratings yet
Formal Languages and Automata Theory: CH 4: Context Free Languages
59 pages
Chapter 3 - Context Free Languages
No ratings yet
Chapter 3 - Context Free Languages
59 pages
Context Free Grammar181007800
No ratings yet
Context Free Grammar181007800
42 pages
Flat Module 3
No ratings yet
Flat Module 3
18 pages
08 CFG
No ratings yet
08 CFG
27 pages
WINSEM2024-25 BCSE304L TH VL2024250501647 2025-02-15 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE304L TH VL2024250501647 2025-02-15 Reference-Material-I
29 pages
Lectures Examples and Solutions of CFG&RE
No ratings yet
Lectures Examples and Solutions of CFG&RE
290 pages
Toc Unit III
No ratings yet
Toc Unit III
36 pages
Session 07 - Context Free Grammar
No ratings yet
Session 07 - Context Free Grammar
34 pages
Unit IV Context Free Languages
No ratings yet
Unit IV Context Free Languages
89 pages
Jan-June 2025 Btcs 4 Sem v10 Btcs404 Btcs404 Unit3 Notes
No ratings yet
Jan-June 2025 Btcs 4 Sem v10 Btcs404 Btcs404 Unit3 Notes
14 pages
CSE322 #Automata Full Unit - 4 Context Free Languages (@rajkumar)
No ratings yet
CSE322 #Automata Full Unit - 4 Context Free Languages (@rajkumar)
74 pages
Ambiguity in Context Free Languages
No ratings yet
Ambiguity in Context Free Languages
32 pages
CS351 Context Free Grammars
No ratings yet
CS351 Context Free Grammars
9 pages
Lecture 7 - 8 & 9 - Chapter 4
No ratings yet
Lecture 7 - 8 & 9 - Chapter 4
50 pages
Chap 4 Formal
No ratings yet
Chap 4 Formal
20 pages
Context-Free Grammars for Regular Languages
No ratings yet
Context-Free Grammars for Regular Languages
65 pages
Context Free Grammar
No ratings yet
Context Free Grammar
113 pages
Chapter 3
No ratings yet
Chapter 3
32 pages
Context
No ratings yet
Context
57 pages
Unit Iv Context Free Languages
No ratings yet
Unit Iv Context Free Languages
74 pages
Chapter - 2 - Finite State Automata - Part - 3
No ratings yet
Chapter - 2 - Finite State Automata - Part - 3
50 pages
Information Security Management and Metrics
No ratings yet
Information Security Management and Metrics
30 pages
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
31 pages
Information Security Audit and Features
100% (1)
Information Security Audit and Features
45 pages
Scanned Document Pages
No ratings yet
Scanned Document Pages
14 pages
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
69 pages
VL2020210106690 Ast02 PDF
No ratings yet
VL2020210106690 Ast02 PDF
1 page
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
9 pages
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
27 pages
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
11 pages
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
14 pages
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
29 pages
Theory of Computation and Compiler Design: Module - 5
No ratings yet
Theory of Computation and Compiler Design: Module - 5
12 pages
Pushdown Automata (PDA) : Reading: Chapter 6
No ratings yet
Pushdown Automata (PDA) : Reading: Chapter 6
34 pages
FALLSEM2020-21 CSE2002 TH VL2020210106983 Reference Material III 27-Aug-2020 CFGPDANOTES PDF
100% (1)
FALLSEM2020-21 CSE2002 TH VL2020210106983 Reference Material III 27-Aug-2020 CFGPDANOTES PDF
79 pages
FALLSEM2020-21 CSE2002 TH VL2020210106983 Reference Material I 26-Aug-2020 NPDA
No ratings yet
FALLSEM2020-21 CSE2002 TH VL2020210106983 Reference Material I 26-Aug-2020 NPDA
80 pages
FALLSEM2020-21 CSE2002 TH VL2020210106983 Reference Material I 21-Aug-2020 Ambiguity
No ratings yet
FALLSEM2020-21 CSE2002 TH VL2020210106983 Reference Material I 21-Aug-2020 Ambiguity
5 pages
Pushdown Automata: Introduction To Formal Languages and Automata
No ratings yet
Pushdown Automata: Introduction To Formal Languages and Automata
102 pages
Pumping Lemma For Regular Languages
No ratings yet
Pumping Lemma For Regular Languages
45 pages
Book American Big Picture
No ratings yet
Book American Big Picture
74 pages
English Grammar Step by Step 1
100% (3)
English Grammar Step by Step 1
30 pages
Unit VI
No ratings yet
Unit VI
45 pages
Teaching Idioms & Phrasal Verbs
No ratings yet
Teaching Idioms & Phrasal Verbs
2 pages
The Infinitive
No ratings yet
The Infinitive
3 pages
The Indo European Language
No ratings yet
The Indo European Language
12 pages
Reserva - A - Excessive Tourism Exam Ugr
No ratings yet
Reserva - A - Excessive Tourism Exam Ugr
1 page
Cls.a 9-A A - English My Love L1
No ratings yet
Cls.a 9-A A - English My Love L1
2 pages
Active and Passive Voice
No ratings yet
Active and Passive Voice
26 pages
Winter Holidays. Speaking+Writing
No ratings yet
Winter Holidays. Speaking+Writing
1 page
Makalah B. Inggris
No ratings yet
Makalah B. Inggris
13 pages
Handout Week 1 - Syntax
No ratings yet
Handout Week 1 - Syntax
18 pages
English Grammar for Beginners
No ratings yet
English Grammar for Beginners
26 pages
English 12 Unit 10 Endangered Species
No ratings yet
English 12 Unit 10 Endangered Species
3 pages
A Brief Swedish Grammar
100% (7)
A Brief Swedish Grammar
344 pages
Assignment Syntax: Name: Melisa NPM: 201912500363 Class: Y4C
100% (3)
Assignment Syntax: Name: Melisa NPM: 201912500363 Class: Y4C
5 pages
A Descriptive Study of The Bhaktapur Dialect of Newari (Sunder Krishna Joshi)
100% (1)
A Descriptive Study of The Bhaktapur Dialect of Newari (Sunder Krishna Joshi)
371 pages
How To Write Japanese
100% (1)
How To Write Japanese
170 pages
Biblical Hebrew Grammar Presentation
83% (6)
Biblical Hebrew Grammar Presentation
245 pages
Eng Grammar
No ratings yet
Eng Grammar
14 pages
Grammar Lesson 1 Interjection
No ratings yet
Grammar Lesson 1 Interjection
6 pages
100 Essential Vocab Giveaway
100% (1)
100 Essential Vocab Giveaway
32 pages
UNIT 5 Passive Voice SIMPLE PRESENT
No ratings yet
UNIT 5 Passive Voice SIMPLE PRESENT
10 pages
Scheme of Work English Form 4 2017
100% (2)
Scheme of Work English Form 4 2017
7 pages
Indirect Speech Transformations
No ratings yet
Indirect Speech Transformations
3 pages
Zero Conditional Grammar Guide
No ratings yet
Zero Conditional Grammar Guide
3 pages
Automata Notes
No ratings yet
Automata Notes
75 pages
Myp 4 - Test Review
No ratings yet
Myp 4 - Test Review
3 pages
Early Morphological Development Guide
No ratings yet
Early Morphological Development Guide
5 pages
Verb Tense Exercises and Corrections
100% (2)
Verb Tense Exercises and Corrections
20 pages

Theory of Computation: Automata Theory (CFG, CFL, CNF)

Uploaded by

Theory of Computation: Automata Theory (CFG, CFL, CNF)

Uploaded by

Theory of Computation

Important: Substitution Rule in CFG has a special form:

• In other words, for a grammar G = (V,T,R,S),

• Do we know CFG for {0n1n | n  0}?

• Yes, by “linking” the occurrence of 0’s

• Examples for palindrome: 010, 0110,

How to prove? (By construction)

• Add a rule Vi  aVj if (qi,a) = qj

• Such language is called inherently ambiguous

Hint: When is a general CFG not in

Prove idea: Convert a grammar into CNF

Before removing After removing

You might also like