0% found this document useful (0 votes)

88 views4 pages

Algebraic Laws for Regular Expressions

This document discusses finite automata and regular expressions. Finite automata are abstract machines that recognize patterns using a finite set of states, an alphabet, and transitions between states labeled with alphabet symbols. Regular expressions formally define languages using operators like union, concatenation, and Kleene closure. Regular expressions can be converted to finite automata models like non-deterministic finite automata (NFAs) and deterministic finite automata (DFAs) using constructions like Thompson's algorithm. NFAs and DFAs are also interconvertible using the subset construction algorithm.

Uploaded by

Manoj Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views4 pages

Algebraic Laws for Regular Expressions

Uploaded by

Manoj Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

• Finite Automata – formal or abstract

machine to recognize patterns

Patterns, Automata, and
• Regular expressions – formal notation to
Regular Expressions describe/generate patterns

Finite Automata Regular Expressions

• A finite collection of states • Defines a set of strings over the characters contained in
• An alphabet some alphabet, defining a language
• A set of transitions between those states labeled • Atomic operand can be
– a character,
with symbols in the alphabet
– the symbol ε,
• A start state, S0 – the symbol Φ, or
• One or more final states – a variable whose value can be any pattern defined by a regular
expression
• Three basic operations/operators
Deterministic – single-valued transition, no epsilon
– Union – e.g., a|b
transitions – Concatenation – e.g., ab
Non-deterministic – multi-valued transitions – Closure – (Kleene closure) – e.g., a* - where a can be a set
concatenated with itself zero or more times

Precedence of Regular Expression Algebraic Laws for Regular

Operators Expressions
• Closure (highest) • Identity for Union Φ|R = R|Φ= R
• Identity for concatenation εR=Rε=R
• Concatenation • Associativity and commutativity of union
• Union R|S=S|R, ((R|S)|T) = (R|(S|T))
• Associativity of concatenation (RS)T = R(ST)
• Non-commutativity of concatenation
• Left distributivity of concatenation over union
(R(S|T)) = (RS|RT)
• Right distributivity of concatenation over union
((S|T)R) = (SR|TR)
• Idempotence of union (R | R) = R

1
RE <-> DFA Automated RE->NFA

State-Elimination
• Build NFA for each term, connect them
a b
Construction
DFA
with moves s0 s1 s2 s3
RE
– Concatenate - ab
a b
s0 s1 s2
Subset s3
Thompson’s
Construction
Construction
– Union – a|b
s0
a

s1
NFA s4
s2
b s5

– Kleene Closure – a* s3

a

s2 s0 s1 s3

Thompson’s Construction NFA->DFA

• Each NFA has a single start state and a single • Subset construction algorithm
– Each state in DFA corresponds to a set of states in NFA
final state, with no transitions leaving the final
state and only the initial transition entering the q0 ε-closure(n0)
start state initialize Q with {q0}

• An -move always connects two states that were

while (Q is still changing)

for each qi Q
start or final states for each character
• A state has at most 2 entering and 2 exiting - t ε-closure(move(qi, ))
T[qi, ] t
moves, and at most 1 entering and 1 exiting
if t Q then
move on a symbol in the alphabet add t to Q

Example DFA Minimization

NFA P P {SF, {S - SF}
while (P is still changing)
m a i n
S0 Sm Sa Si Sn T 0
for each set p P
-m -a,m -i,m -n,m for each

Corresponding DFA partition p by

m a i
S0 S0,Sm S0,Sa S0,Si into p1, p2, p3, … pk
n
-m m m m T T U p1, p2, p3, … pk
m m m
if T P then
m
a So,Sn,
i
S0,Sn, P T
S0,Sn S0,Sn,
Sa Si -m
-a,m
Sm
-i,m

2
Automated DFA->RE Regular Expression and DFA
Identifier
for i = 1 to N
for j = 1 to N letter (a|b|c| … |z|A|B|C| … |Z)
Rij0 = {a| (si,a) = sj} digit (0|1|2|3|4|5|6|7|8|9)
if (i == j)
Rij0 = Rij0 U
id letter (letter|digit)*
letter
for k = 1 to N digit
for i = 1 to N
letter
for j = 1 to N S0 S1

Rijk = Rikk-1 Rkkk-1 Rkjk-1 URijk-1

digit
L = U R1jN accept
s S
j

F S2 error
letter
digit

Implementing Scanners Code for Semi-Mechanical Pure

(Recognizer) DFA
state = S0; /* code for S0 */
• Ad-hoc done = false;
token_value = “” /* empty string */
token_type = error;

• Semi-mechanical pure DFA char = next_char();

while (not done) {
class = char_class[char];

• Table-driven DFA switch(state) {

case S0:
switch (class)
case ‘letter’: token_type = identifier; token_value = token_value+char;
state = S1; char = next_char(); break;
case ‘digit’: done = true; break;
case ‘other’: done = true; break;
break;
case S1:
switch(class)
case ‘letter’:
case ‘digit’: token_value = token_value+char; char = next_char(); break;
case ‘other’: done = true; break;
break;
}
}
return(token_type);

Table-Driven Recognizer Tables Driving the Recognizer

a-z A-Z 0-9 other
letter char_class
digit value letter letter digit other
letter other
S0 S1 S2

digit
other
class S0 S1 S2 S3
accept

S3 error letter S1 S1 -- --
next_state
digit S3 S1 -- --
other S3 S2 -- --
To change language, we can just change tables

3
Table-Driven Recognizer/Scanner Error Recovery
char = next_char();
state = S0; /* code for S0 */
token_value = “” /* empty string */
• E.g., illegal character
while (not done) {
class = char_class[char]; • What should the scanner do?
state = next_state[class,state];
switch(state) {
case S2: /* accept state */
– Report the error
token_type = identifier;
done = true; break; – Try to correct it?
case S3: /* error */
token_type = error;
done = true; break;
• Error correction techniques
default: /* building an id */
token_value = token_value + char; – Minimum distance corrections
char = next_char; break;

}
} – Hard token recovery
return(token_type);
– Skip until match

Scanner Summary
• Break up input into tokens
• Catch lexical errors
• Difficulty affected by language design
• Issues
– Input buffering
– Lookahead
– Error recovery
• Scanner generators
– Tokens specified by regular expressions
– Regular expression -> DFA
– Highly efficient in practice

CS 346: Compilers: Lexical Analyzer Lexical Analyzer
No ratings yet
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
52 pages
Lecture 3 Lexical Analyzer
No ratings yet
Lecture 3 Lexical Analyzer
44 pages
Lec02 Lexicalanalyzer
100% (1)
Lec02 Lexicalanalyzer
50 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
32 pages
04 Regular Expressions & FAs
No ratings yet
04 Regular Expressions & FAs
46 pages
CompilerD L3
No ratings yet
CompilerD L3
36 pages
Lec 4 CH 2
No ratings yet
Lec 4 CH 2
39 pages
Automata Theory Basics
No ratings yet
Automata Theory Basics
42 pages
Scanner and Token Recognition Basics
No ratings yet
Scanner and Token Recognition Basics
26 pages
2 - 4 Finite Automata
No ratings yet
2 - 4 Finite Automata
23 pages
TAFL Unit 1 - Basic Concepts and Automata Theory - Detailed Notes
No ratings yet
TAFL Unit 1 - Basic Concepts and Automata Theory - Detailed Notes
13 pages
Compiler 5
No ratings yet
Compiler 5
42 pages
Reg Exp 2 DFA
No ratings yet
Reg Exp 2 DFA
11 pages
Compilation Techniques
No ratings yet
Compilation Techniques
21 pages
Lexical Analysis
No ratings yet
Lexical Analysis
16 pages
Regular Expression
No ratings yet
Regular Expression
46 pages
Compiler Design: Lexical Analysis
No ratings yet
Compiler Design: Lexical Analysis
25 pages
Regular Expressions to DFAs Explained
No ratings yet
Regular Expressions to DFAs Explained
37 pages
Automata Theory for CS Students
No ratings yet
Automata Theory for CS Students
33 pages
Reg Ex
No ratings yet
Reg Ex
12 pages
CS-352 - Spring 2024 - Lec4
No ratings yet
CS-352 - Spring 2024 - Lec4
38 pages
Unit 1 Part 2 - Compiler
No ratings yet
Unit 1 Part 2 - Compiler
32 pages
CD - Unit1 - Lecture4 5 6 7
No ratings yet
CD - Unit1 - Lecture4 5 6 7
50 pages
Lecture 04
No ratings yet
Lecture 04
37 pages
Chapter 3 Implementation - of - Lexical - Analysis
No ratings yet
Chapter 3 Implementation - of - Lexical - Analysis
63 pages
02 Automata
No ratings yet
02 Automata
78 pages
Two Issues in Lexical Analysis
No ratings yet
Two Issues in Lexical Analysis
11 pages
Compiler Design: RE to DFA
No ratings yet
Compiler Design: RE to DFA
23 pages
COMP 330 - How Tos
No ratings yet
COMP 330 - How Tos
5 pages
Tafl Last Min Notes
No ratings yet
Tafl Last Min Notes
19 pages
Formal Language and Automata Theory: Prof. Sachin Jain, Prof - Atul Kumar, Prof. Vaibhavi Patel
No ratings yet
Formal Language and Automata Theory: Prof. Sachin Jain, Prof - Atul Kumar, Prof. Vaibhavi Patel
86 pages
Understanding Non-Deterministic Finite Automata
No ratings yet
Understanding Non-Deterministic Finite Automata
26 pages
Lexical Analysis All Token List and Diffence
No ratings yet
Lexical Analysis All Token List and Diffence
4 pages
19CSE401 CD 02 Scanners
No ratings yet
19CSE401 CD 02 Scanners
82 pages
Formal Languages for CS Students
No ratings yet
Formal Languages for CS Students
31 pages
Formal Language & Automata Basics
No ratings yet
Formal Language & Automata Basics
24 pages
Understanding Regular Expressions and DFA
No ratings yet
Understanding Regular Expressions and DFA
16 pages
Rahul Kumar Shaw
No ratings yet
Rahul Kumar Shaw
10 pages
548445041
No ratings yet
548445041
17 pages
2 - Compilers (Lexical Analysis)
No ratings yet
2 - Compilers (Lexical Analysis)
60 pages
Re To DFA
No ratings yet
Re To DFA
6 pages
QuickBooks Error 3008 Solutions
No ratings yet
QuickBooks Error 3008 Solutions
43 pages
Lec 4
No ratings yet
Lec 4
17 pages
TOC DFA Regex NFA Explained
No ratings yet
TOC DFA Regex NFA Explained
9 pages
Compiler Construction Basics
No ratings yet
Compiler Construction Basics
79 pages
Compiler Design and Construction6
No ratings yet
Compiler Design and Construction6
23 pages
Lexical Analysis for Programmers
No ratings yet
Lexical Analysis for Programmers
67 pages
Lect 04
No ratings yet
Lect 04
12 pages
Regular Expressions: Reading: Chapter 3
No ratings yet
Regular Expressions: Reading: Chapter 3
39 pages
Understanding Regular Expressions and DFAs
No ratings yet
Understanding Regular Expressions and DFAs
11 pages
FLAT - Ch.2
No ratings yet
FLAT - Ch.2
86 pages
3 Automata Gate
No ratings yet
3 Automata Gate
225 pages
Regular Expression, DFA and NFA: Prepared By: Prof. J. S. Dhobi Prof. M. D. Mehta
No ratings yet
Regular Expression, DFA and NFA: Prepared By: Prof. J. S. Dhobi Prof. M. D. Mehta
82 pages
TCS Notes
No ratings yet
TCS Notes
14 pages
Flat CH 2
No ratings yet
Flat CH 2
86 pages
Recognition of Tokens
No ratings yet
Recognition of Tokens
34 pages
Language Operations and Finite Automata
No ratings yet
Language Operations and Finite Automata
16 pages
NFA to DFA Construction Guide
No ratings yet
NFA to DFA Construction Guide
66 pages
Lemon Parser: Token Processing Guide
100% (3)
Lemon Parser: Token Processing Guide
8 pages
Compiler Semantic Analysis
No ratings yet
Compiler Semantic Analysis
108 pages
CSE/IT 213 - Eclipse, Collections, and Exceptions: New Mexico Tech
No ratings yet
CSE/IT 213 - Eclipse, Collections, and Exceptions: New Mexico Tech
58 pages
PPL 2
No ratings yet
PPL 2
144 pages
CS382
No ratings yet
CS382
3 pages
Compiler Design CS8602 Part A & Part B Answers
No ratings yet
Compiler Design CS8602 Part A & Part B Answers
149 pages
Query Processing & Optimization Guide
100% (1)
Query Processing & Optimization Guide
45 pages
Unit 1
No ratings yet
Unit 1
16 pages
REXX400 Programmer's Guide
No ratings yet
REXX400 Programmer's Guide
251 pages
Java Microproject
No ratings yet
Java Microproject
7 pages
Error Recovery
No ratings yet
Error Recovery
30 pages
TY CSE Curriculum Overview 2016-17
No ratings yet
TY CSE Curriculum Overview 2016-17
14 pages
Computer Science and Engineering PDF
No ratings yet
Computer Science and Engineering PDF
17 pages
2024 CD-Ch02 Lexical Analysis
No ratings yet
2024 CD-Ch02 Lexical Analysis
25 pages
Compiler Design Introduction
No ratings yet
Compiler Design Introduction
22 pages
Formal Method Nusmv Lab Manual 1
No ratings yet
Formal Method Nusmv Lab Manual 1
145 pages
spaCy 101: NLP Basics & Features Guide
No ratings yet
spaCy 101: NLP Basics & Features Guide
10 pages
04 Novikov
No ratings yet
04 Novikov
25 pages
The Ring Programming Language Version 1.4.1 Book - Part 2 of 31
No ratings yet
The Ring Programming Language Version 1.4.1 Book - Part 2 of 31
30 pages
Evoluation of Programming Languages DR Jivtode
No ratings yet
Evoluation of Programming Languages DR Jivtode
31 pages
Chap 1306 - Ak
No ratings yet
Chap 1306 - Ak
52 pages
Department of Computer Science and Engineering: Lab Manual
100% (1)
Department of Computer Science and Engineering: Lab Manual
61 pages
SPPU IT Sem 6 Data Science Syllabus
No ratings yet
SPPU IT Sem 6 Data Science Syllabus
13 pages
Coq 8.20.1 Reference Manual
No ratings yet
Coq 8.20.1 Reference Manual
936 pages
Compiler Construction
No ratings yet
Compiler Construction
3 pages
NIELIT Notes
No ratings yet
NIELIT Notes
15 pages
Syllabus For Bachelor of Engineering (Computer Sc. & Engg.) Seventh Semester Paper Title: Compiler Design
No ratings yet
Syllabus For Bachelor of Engineering (Computer Sc. & Engg.) Seventh Semester Paper Title: Compiler Design
15 pages
Python Programming Basics Guide
No ratings yet
Python Programming Basics Guide
53 pages
Project 1 Requirements
No ratings yet
Project 1 Requirements
2 pages
Compiler Design Module 1 Notes 2023-24 17-03-2024
No ratings yet
Compiler Design Module 1 Notes 2023-24 17-03-2024
24 pages

Algebraic Laws for Regular Expressions

Uploaded by

Algebraic Laws for Regular Expressions

Uploaded by

• Finite Automata – formal or abstract

machine to recognize patterns

Finite Automata Regular Expressions

Precedence of Regular Expression Algebraic Laws for Regular

Thompson’s Construction NFA->DFA

• An -move always connects two states that were

Example DFA Minimization

Corresponding DFA partition p by 

Rijk = Rikk-1 Rkkk-1 Rkjk-1 URijk-1

Implementing Scanners Code for Semi-Mechanical Pure

• Semi-mechanical pure DFA char = next_char();

• Table-driven DFA switch(state) {

Table-Driven Recognizer Tables Driving the Recognizer

You might also like

Corresponding DFA partition p by