Regular Expressions

The document discusses the concepts of regular expressions and regular languages, emphasizing their role in defining patterns for finite strings and programming language tokens. It explains operations on languages such as union, concatenation, and Kleene closure, along with the precedence and associativity of these operations. Additionally, it covers finite automata as a mathematical model for recognizing regular expressions and details the construction of finite automata, including states, transitions, and final states.

Uploaded by

Pradnya Vikhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views21 pages

Regular Expressions

Uploaded by

Pradnya Vikhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Compiler Design

Count no of token in following C code

Regular Expression and Regular
Languages
• The lexical analyzer needs to scan and identify only a finite set of
valid string/token/lexeme that belong to the language in hand.
• It searches for the pattern defined by the language rules.
• Regular expressions have the capability to express finite
languages by defining a pattern for finite strings of symbols.
• The grammar defined by regular expressions is known
as regular grammar. The language defined by regular
grammar is known as regular language.
Regular Expression and Regular
Languages
• Programming language tokens can be described by regular
languages.
• There are a number of algebraic laws that are obeyed by
regular expressions, which can be used to manipulate regular
expressions into equivalent forms.
Regular Expression and Regular
Languages
The various operations on languages are:
• Union of two languages L and M is written as
L U M = {s | s is in L or s is in M}

• Concatenation of two languages L and M is written as

LM = {st | s is in L and t is in M}

• The Kleene Closure of a language L is written as

L* = Zero or more occurrence of language L.
Notations
If r and s are regular expressions denoting the languages
L(r) and L(s), then
• Union : (r)|(s) is a regular expression denoting L(r) U
L(s)
• Concatenation : (r)(s) is a regular expression denoting
L(r)L(s)
• Kleene closure : (r)* is a regular expression denoting
(L(r))* (r) is a regular expression denoting L(r)
Precedence and Associativity
• *, concatenation (.), and | (pipe sign) are left associative
• * has the highest precedence
• Concatenation (.) has the second highest precedence.
• | (pipe sign) has the lowest precedence of all.
Representing valid tokens of a language in regular
expression
If x is a regular expression, then:
• x* means zero or more occurrence of x.
i.e., it can generate { e, x, xx, xxx, xxxx, … }
• x+ means one or more occurrence of x.
i.e., it can generate { x, xx, xxx, xxxx … } or x.x*
• x? means at most one occurrence of x
i.e., it can generate either {x} or {e}.
• [a-z] is all lower-case alphabets of English language.
• [A-Z] is all upper-case alphabets of English language.
• [0-9] is all natural digits used in mathematics.
Precedence and Associativity
• *, concatenation (.), and | (pipe sign) are left associative
• * has the highest precedence
• Concatenation (.) has the second highest precedence.
• | (pipe sign) has the lowest precedence of all.
Write the regular expression for the language accepting all the string
containing any number of a's and b’s.
Solution: The regular expression will be: 1. r.e. = (a + b)*
This will give the set as L = {ε, a, aa, b, bb, ab, ba, aba, bab, .....}, any
combination of a and b.
The (a + b)* shows any combination with a and b even a null string.
Write the regular expression for the language accepting all
combinations of a's except the null string, over the set ∑ = {a}
Solution: The regular expression has to be built for the language
L = {a, aa, aaa, ....} This set indicates that there is no null string.
So we can denote regular expression as: R = a+
Write the regular expression for the language accepting all
combinations of a's, over the set ∑ = {a}
Solution: All combinations of a's means a may be zero, single, double
and so on. If a is appearing zero times, that means a null string. That is
we expect the set of {ε, a, aa, aaa, ....}.
So we give a regular expression for this as: R = a*
That is Kleen closure of a
Write the regular expression for the language accepting all the string
which are starting with 1 and ending with 0, over ∑ = {0, 1}.
Solution: In a regular expression, the first symbol should be 1, and the
last symbol should be 0.
There is as follows: 1. R = 1 (0+1)* 0
Write the regular expression for the language starting and ending with
a and having any having any combination of b's in between.
Solution: The regular expression will be:
R = a b* a
Write the regular expression for the language starting with a but not
having consecutive b’s.
Solution: The regular expression has to be built for the language:
L = {a, aba, aab, aba, aaa, abab, .....}
The regular expression for the above language is: R = {a + ab}*
Regular Expression and Regular
Languages
• The language accepted by finite automata can be easily described by simple
expressions called Regular Expressions.
• It is the most effective way to represent any language.
• The languages accepted by some regular expression are referred to as Regular
languages.
• A regular expression can also be described as a sequence of pattern that defines
a string.
• Regular expressions are used to match character combinations in strings.
• String searching algorithm used this pattern to find the operations on a string.
Finite automata
• Finite automata is a state machine that takes a string of symbols as
input and changes its state accordingly.
• Finite automata is a recognizer for regular expressions.
• When a regular expression string is fed into finite automata, it
changes its state for each literal.
• If the input string is successfully processed and the automata reaches
its final state, it is accepted, i.e., the string just fed was said to be a
valid token of the language in hand.
Finite automata
The mathematical model of finite automata consists of:
• Finite set of states (Q)
• Finite set of input symbols (Σ)
• One Start state (q0)
• Set of final states (qf)
• Transition function (δ)
Finite automata Construction
Let L(r) be a regular language recognized by some finite automata (FA).
• States : States of FA are represented by circles. State names are of the state is written
inside the circle.
• Start state : The state from where the automata starts, is known as start state. Start state
has an arrow pointed towards it.
• Intermediate states : All intermediate states has at least two arrows; one pointing to and
another pointing out from them.
• Final state : If the input string is successfully parsed, the automata is expected to be in this
state. Final state is represented by double circles. It may have any number of arrows
pointing to it and any number of arrows pointing out from it.
• Transition : The transition from one state to another state happens when a desired symbol
in the input is found. Upon transition, automata can either move to next state or stay in
the same state.
Movement from one state to another is shown as a directed arrow, where the arrows
points to the destination state. If automata stays on the same state, an arrow pointing
from a state to itself is drawn.
Finite automata Construction
Regular expressions
a
b
a*
ab
a|b
a?
ab*
(ab|cd)*
a(b|c)*d
a(b|a)*b

Unit Ii
No ratings yet
Unit Ii
25 pages
TCS Lect 4 Regular Expression Part 1 PDF
No ratings yet
TCS Lect 4 Regular Expression Part 1 PDF
23 pages
Theory of Automata RE 3
No ratings yet
Theory of Automata RE 3
13 pages
Vision 2023 Toc Chapter 3 Regular Expression 59
No ratings yet
Vision 2023 Toc Chapter 3 Regular Expression 59
8 pages
Regular Expression
No ratings yet
Regular Expression
89 pages
TPL Lect 15 - 16
No ratings yet
TPL Lect 15 - 16
5 pages
Unit 2
No ratings yet
Unit 2
53 pages
Chapter 3 RE
No ratings yet
Chapter 3 RE
19 pages
Theory of Automata Lecture#3: by Riaz Ahmad Ziar R.ziar@kardan - Edu.af
No ratings yet
Theory of Automata Lecture#3: by Riaz Ahmad Ziar R.ziar@kardan - Edu.af
19 pages
Regular Expressions
No ratings yet
Regular Expressions
4 pages
Regular Expression
No ratings yet
Regular Expression
18 pages
Understanding Regular Expressions in Languages
No ratings yet
Understanding Regular Expressions in Languages
23 pages
Regular Expressions and Their Applications
No ratings yet
Regular Expressions and Their Applications
68 pages
Unit 2
No ratings yet
Unit 2
135 pages
Lecture 3, 4
No ratings yet
Lecture 3, 4
33 pages
RE - Basics
No ratings yet
RE - Basics
26 pages
Bcs503 Module 2
No ratings yet
Bcs503 Module 2
46 pages
Theory of Automata and Formal Languages
No ratings yet
Theory of Automata and Formal Languages
24 pages
Regular Languages and Regular Grammars
No ratings yet
Regular Languages and Regular Grammars
20 pages
Absent Tha
No ratings yet
Absent Tha
33 pages
1 Regular Expression
No ratings yet
1 Regular Expression
6 pages
Chapter 2 RegularExpressions
No ratings yet
Chapter 2 RegularExpressions
95 pages
Chapter Two
No ratings yet
Chapter Two
59 pages
Regular Expressions in Automata Theory
No ratings yet
Regular Expressions in Automata Theory
28 pages
Atcd Module 2 2021 Scheme
No ratings yet
Atcd Module 2 2021 Scheme
56 pages
AT&CD Unit 1
No ratings yet
AT&CD Unit 1
19 pages
Regular Expressions and Identities Explained
No ratings yet
Regular Expressions and Identities Explained
70 pages
Regular Expressions and Finite Automata
No ratings yet
Regular Expressions and Finite Automata
95 pages
Regular Expressions and Regular Languages
No ratings yet
Regular Expressions and Regular Languages
5 pages
Chapter 3
No ratings yet
Chapter 3
10 pages
Automata Lectuee3
No ratings yet
Automata Lectuee3
27 pages
Regular Expression
No ratings yet
Regular Expression
17 pages
Unit I
No ratings yet
Unit I
37 pages
Reguler Language and Reguler Expression
No ratings yet
Reguler Language and Reguler Expression
4 pages
2.0+regular Expression Part 1 MKN
No ratings yet
2.0+regular Expression Part 1 MKN
33 pages
Chap-2 2 (RegularExpression)
No ratings yet
Chap-2 2 (RegularExpression)
46 pages
Toc Unit 2
No ratings yet
Toc Unit 2
29 pages
Automata & Regular Expressions Guide
No ratings yet
Automata & Regular Expressions Guide
52 pages
Theory of Automata Lecture#2: by Riaz Ahmad Ziar R.ziar@kardan - Edu.af
No ratings yet
Theory of Automata Lecture#2: by Riaz Ahmad Ziar R.ziar@kardan - Edu.af
22 pages
TOC Unit2
No ratings yet
TOC Unit2
87 pages
TOC SY Unit-3
No ratings yet
TOC SY Unit-3
80 pages
Compiler - Chap.2.part 3
No ratings yet
Compiler - Chap.2.part 3
85 pages
Automata Module 2
No ratings yet
Automata Module 2
69 pages
Regular Expressions in Compiler Design
No ratings yet
Regular Expressions in Compiler Design
25 pages
Unit 3 - Regular Expression
No ratings yet
Unit 3 - Regular Expression
45 pages
Regular Expression
No ratings yet
Regular Expression
4 pages
Atcd Unit-2 PDF
No ratings yet
Atcd Unit-2 PDF
21 pages
Regular Expression Notes
No ratings yet
Regular Expression Notes
46 pages
Introduction to Alphabets and Regular Expressions
No ratings yet
Introduction to Alphabets and Regular Expressions
21 pages
Lec 4
No ratings yet
Lec 4
16 pages
Finite State Machines & Regular Expressions
No ratings yet
Finite State Machines & Regular Expressions
14 pages
Chapter 4
No ratings yet
Chapter 4
31 pages
Regular Expressions
No ratings yet
Regular Expressions
31 pages
21CS51 ATCD MODULE 2 - 1 Regular Expressions
No ratings yet
21CS51 ATCD MODULE 2 - 1 Regular Expressions
148 pages
Specification of Tokens
No ratings yet
Specification of Tokens
21 pages
WINSEM2024-25 BCSE304L TH VL2024250501647 2025-01-24 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE304L TH VL2024250501647 2025-01-24 Reference-Material-I
42 pages
Theory of Computation: Dr. Krishnendu Rarhi E: Krishnendu.e9621@cumail - in
No ratings yet
Theory of Computation: Dr. Krishnendu Rarhi E: Krishnendu.e9621@cumail - in
44 pages
Unit 6 Code Generation - 1 - 1708946443942
No ratings yet
Unit 6 Code Generation - 1 - 1708946443942
63 pages
Parser First and Follow
No ratings yet
Parser First and Follow
28 pages
Finite Automata
No ratings yet
Finite Automata
16 pages
LEX and YACC
No ratings yet
LEX and YACC
32 pages
Bottom Up Parser
No ratings yet
Bottom Up Parser
15 pages
Computer Science Resume Overview
No ratings yet
Computer Science Resume Overview
1 page
PCC-CS402: Computer Architecture Code: Contacts: 3L Computer Architecture
No ratings yet
PCC-CS402: Computer Architecture Code: Contacts: 3L Computer Architecture
2 pages
Difference Between JVM and DVM: DVM (Dalvik Virtual Machine)
No ratings yet
Difference Between JVM and DVM: DVM (Dalvik Virtual Machine)
7 pages
Syllabus OOPJ
No ratings yet
Syllabus OOPJ
2 pages
Data Structures: Searching & Sorting Algorithms
No ratings yet
Data Structures: Searching & Sorting Algorithms
3 pages
Form 2 Computer Science
No ratings yet
Form 2 Computer Science
9 pages
RSA 32-Bit Encryption Implementation
No ratings yet
RSA 32-Bit Encryption Implementation
6 pages
12
No ratings yet
12
31 pages
2019 PLE Mathematics Exam Paper
No ratings yet
2019 PLE Mathematics Exam Paper
16 pages
Encryption & Decryption (Source Code)
No ratings yet
Encryption & Decryption (Source Code)
3 pages
B.Tech IT: OOP Assignments
No ratings yet
B.Tech IT: OOP Assignments
6 pages
Project Theseas
No ratings yet
Project Theseas
44 pages
Section 15 Plomberie 2 Mai 2016
No ratings yet
Section 15 Plomberie 2 Mai 2016
17 pages
AI Developer Profile: Georges Bejjani
No ratings yet
AI Developer Profile: Georges Bejjani
2 pages
Ram - Random Access Memory:: What Is RAM Structure, Explain With Block Diagram?
No ratings yet
Ram - Random Access Memory:: What Is RAM Structure, Explain With Block Diagram?
2 pages
SAC - Logical and Mathematical API Functions
No ratings yet
SAC - Logical and Mathematical API Functions
3 pages
Discrete and Combinatorics - Math 2052-Hand Out
No ratings yet
Discrete and Combinatorics - Math 2052-Hand Out
158 pages
Trees
No ratings yet
Trees
35 pages
Addressing Modes of 8051 Microcontroller
No ratings yet
Addressing Modes of 8051 Microcontroller
5 pages
Building Data-Driven Apps with Danfo.js
No ratings yet
Building Data-Driven Apps with Danfo.js
62 pages
Unit-3. Context Free Grammar
No ratings yet
Unit-3. Context Free Grammar
68 pages
Return Values Take A Closure Walk - Martin Troxler
No ratings yet
Return Values Take A Closure Walk - Martin Troxler
18 pages
Data Compression and Data Retrieval 2161603: Department of CE / IT - 07 / 16
No ratings yet
Data Compression and Data Retrieval 2161603: Department of CE / IT - 07 / 16
18 pages
Computer Science Core 2
No ratings yet
Computer Science Core 2
2 pages
Assignment-1: Name of Student:Patekar Umesh B. Batch:D3 Branch:Computer Roll No77 Problem Statement
No ratings yet
Assignment-1: Name of Student:Patekar Umesh B. Batch:D3 Branch:Computer Roll No77 Problem Statement
3 pages
Allama Iqbal Open University, Islamabad Warning: (Department of Computer Science)
No ratings yet
Allama Iqbal Open University, Islamabad Warning: (Department of Computer Science)
5 pages
DAy 2 DSA Vivek
No ratings yet
DAy 2 DSA Vivek
14 pages
Class XI CS Work Sheet
No ratings yet
Class XI CS Work Sheet
4 pages
C++ Operators: Types and Examples
No ratings yet
C++ Operators: Types and Examples
38 pages

Regular Expressions

Uploaded by

Regular Expressions

Uploaded by

Compiler Design

Count no of token in following C code

• Concatenation of two languages L and M is written as

• The Kleene Closure of a language L is written as

You might also like