Role of Lexical Analyzer_Input Buffering

The document provides an overview of the role of the lexical analyzer in a compiler, which includes reading source program characters, grouping them into lexemes, and producing tokens for syntax analysis. It also discusses input buffering techniques to enhance processing efficiency and outlines error recovery methods for lexical errors. Key concepts such as tokens, patterns, and lexemes are defined, along with the use of buffer pairs and sentinel characters in input handling.

Uploaded by

Subashini Hari Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views11 pages

Role of Lexical Analyzer_Input Buffering

Uploaded by

Subashini Hari Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

UNIT I

INTRODUCTION TO COMPILERS

Lexical Analysis
Role of Lexical Analyzer
Input Buffering
The Role of the Lexical Analyzer
• First phase of a compiler
• Read the input characters of the source
program, group them into lexemes, and
produce as output a sequence of tokens for
each lexeme in the source program
• The stream of tokens is sent to the parser for
syntax analysis
• When discovers a lexeme constituting an
identifier, it enters that lexeme into the
symbol table
The Role of the Lexical Analyzer

• getNextToken command
– Causes the lexical analyzer to read characters from its
input until it can identify the next lexeme and produce for
it the next token, which it returns to the parser
The Role of the Lexical Analyzer
• Other tasks
– Stripping out comments and whitespace
– Correlating error messages generated by the
compiler with the source program
• Associate a line number with each error message
– Expansion of macros
The Role of the Lexical Analyzer
Tokens, Patterns and Lexemes
• Token
– A pair consisting of a token name and an optional attribute value
• The token name is an abstract symbol representing a kind of lexical unit
– E.g.: keyword, identifier.
• The token names are the input symbols that the parser processes
• Pattern
– A description of the form that the lexemes of a token may take
• Keyword - the pattern is sequence of characters that form the keyword
• Identifiers - the pattern is matched by many strings
• Lexeme
– A sequence of characters in the source program that matches the
pattern for a token and is identified by the lexical analyzer as an
instance of that token
The Role of the Lexical Analyzer
Lexical Errors
• Panic mode recovery
– Delete successive characters from the remaining
input, until the lexical analyzer can find a well-
formed token at the beginning of what input is left
• Other possible error-recovery actions are:
– Delete one character from the remaining input
– Insert a missing character into the remaining input
– Replace a character by another character
– Transpose two adjacent characters
Input Buffering
• Speed reading the source program
• Two-buffer scheme to handle large
lookaheads
Input Buffering
Buffer Pairs
• Involves two buffers that are alternately
reloaded
– To reduce the amount of overhead required to
process a single input character
• Each buffer is of the same size N
– N is usually the size of a disk block
• Two pointers to the input are maintained:
– lexemeBegin
• Marks the beginning of the current lexeme
– forward
•
Input Buffering
Buffer Pairs

• Once the next lexeme is determined, forward is set

to the character at its right end
• After the lexeme is recorded, lexemeBegin is set to
the character immediately after the lexeme just
found
• If end of one buffer is reached the other buffer is
reloaded from the input, and forward is moved to
the beginning of the newly loaded buffer
Input Buffering
Sentinels
• To combine the buffer-end test with the test for
the current character, each buffer to hold a
sentinel character at the end
– A special character that cannot be part of the source
program, and a natural choice is the character eof
• eof that appears other than at the end of a
buffer means that the end of input

NEB-2000C (EPIRB) User's Manual - 20170921 V3 0
100% (2)
NEB-2000C (EPIRB) User's Manual - 20170921 V3 0
31 pages
Pablo Vazquez Charging Documents
No ratings yet
Pablo Vazquez Charging Documents
3 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Primanila Plans VS Sec GR 193791 August 6, 2014 CS
100% (1)
Primanila Plans VS Sec GR 193791 August 6, 2014 CS
3 pages
Durian Production Guide
50% (2)
Durian Production Guide
26 pages
8 Business Intelligence Tools and Techniques
No ratings yet
8 Business Intelligence Tools and Techniques
21 pages
CD Unit I Part II Lexical Analysis
No ratings yet
CD Unit I Part II Lexical Analysis
58 pages
2.1 - Lexical Analysis
No ratings yet
2.1 - Lexical Analysis
102 pages
Lecture 04 05 PDF
No ratings yet
Lecture 04 05 PDF
8 pages
compiler_design- Module2-print
No ratings yet
compiler_design- Module2-print
16 pages
Compiler Design: Lexical Analysis
No ratings yet
Compiler Design: Lexical Analysis
68 pages
Compiler - 2
No ratings yet
Compiler - 2
15 pages
Unit 2 Lexical Analyzer
No ratings yet
Unit 2 Lexical Analyzer
30 pages
Unit 01 - PART 2
No ratings yet
Unit 01 - PART 2
25 pages
cd1
No ratings yet
cd1
92 pages
@CD_ch2 compiler design
No ratings yet
@CD_ch2 compiler design
26 pages
lec 02
No ratings yet
lec 02
17 pages
Ch2_Lexical Analysis (2)
No ratings yet
Ch2_Lexical Analysis (2)
71 pages
Lexical Analysis
No ratings yet
Lexical Analysis
5 pages
Lexical Analysis: Deterministic Finite Automata
No ratings yet
Lexical Analysis: Deterministic Finite Automata
37 pages
UNIT 2 Compiler Design
No ratings yet
UNIT 2 Compiler Design
23 pages
cd UNIT-1
No ratings yet
cd UNIT-1
60 pages
CD - Module 2
No ratings yet
CD - Module 2
12 pages
Comp Chap2
No ratings yet
Comp Chap2
36 pages
Compiler Construction: Chapter # 2 - Lexical Analysis Instructor: Ms. Raazia Sosan
No ratings yet
Compiler Construction: Chapter # 2 - Lexical Analysis Instructor: Ms. Raazia Sosan
53 pages
Lexical Analysis
No ratings yet
Lexical Analysis
14 pages
Unit2
No ratings yet
Unit2
61 pages
Ch2_Lexical Analysis
No ratings yet
Ch2_Lexical Analysis
71 pages
Chapter 2
No ratings yet
Chapter 2
6 pages
SPCC Module 5 Lect 2 Lexical Analysis Part 1
No ratings yet
SPCC Module 5 Lect 2 Lexical Analysis Part 1
16 pages
Lexical Analysis: Risul Islam Rasel
No ratings yet
Lexical Analysis: Risul Islam Rasel
148 pages
CD Aii Partb Ans
No ratings yet
CD Aii Partb Ans
8 pages
CD - CH2 - Lexical Analysis
No ratings yet
CD - CH2 - Lexical Analysis
67 pages
HW_31712
No ratings yet
HW_31712
22 pages
Bunk Class
No ratings yet
Bunk Class
21 pages
EXP5SPCC
No ratings yet
EXP5SPCC
6 pages
Compiler Design Chapter 2
No ratings yet
Compiler Design Chapter 2
14 pages
Chapter Two-Lexical Analysis
No ratings yet
Chapter Two-Lexical Analysis
4 pages
Compiler
No ratings yet
Compiler
4 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
17 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
Lecture 3
No ratings yet
Lecture 3
4 pages
Lexical Analysis
No ratings yet
Lexical Analysis
45 pages
CD - CH2 - Lexical Analysis
No ratings yet
CD - CH2 - Lexical Analysis
59 pages
Lexical Analysis
No ratings yet
Lexical Analysis
6 pages
Lexical Analysis - Compiler Design: Token, Pattern and Lexeme
No ratings yet
Lexical Analysis - Compiler Design: Token, Pattern and Lexeme
5 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
39 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
16 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
38 pages
Chapter 3 Lexical Analysis
No ratings yet
Chapter 3 Lexical Analysis
5 pages
Compiler - Lexical Analysis
No ratings yet
Compiler - Lexical Analysis
17 pages
Chapter 2 - Lexical Analysis (1) (1)
No ratings yet
Chapter 2 - Lexical Analysis (1) (1)
48 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
14 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
26 pages
Week 4 Lec 8 CC p1-1
No ratings yet
Week 4 Lec 8 CC p1-1
23 pages
Lexical Analysis in Compiler Design With Example
No ratings yet
Lexical Analysis in Compiler Design With Example
8 pages
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part1
No ratings yet
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part1
63 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
84 pages
Unit 2 Lexical Analysis
No ratings yet
Unit 2 Lexical Analysis
94 pages
Ch2 - Lexical Analysis
No ratings yet
Ch2 - Lexical Analysis
76 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
56 pages
Ch2 Lexical Analysis
No ratings yet
Ch2 Lexical Analysis
11 pages
ch-2 Compiler Design
No ratings yet
ch-2 Compiler Design
9 pages
Automata Theory and Compiler Design: Name: Smitha.A Usn: 1Vj21Cs042 Branch: Cse
No ratings yet
Automata Theory and Compiler Design: Name: Smitha.A Usn: 1Vj21Cs042 Branch: Cse
9 pages
Lecture 02
No ratings yet
Lecture 02
150 pages
HRM 340 Employees Recruitment and Selection
No ratings yet
HRM 340 Employees Recruitment and Selection
22 pages
S 1 Entrepreneurship Notes 2020
No ratings yet
S 1 Entrepreneurship Notes 2020
12 pages
SDS MERICAN_9505, 9505U, 9505P-LV
No ratings yet
SDS MERICAN_9505, 9505U, 9505P-LV
8 pages
DP-203 Updated Dumps - Data Engineering On Microsoft Azure
No ratings yet
DP-203 Updated Dumps - Data Engineering On Microsoft Azure
60 pages
Booster Pumps Multistage Vertical 60Hz
No ratings yet
Booster Pumps Multistage Vertical 60Hz
28 pages
Calacala v. Republic
No ratings yet
Calacala v. Republic
2 pages
lecture task
No ratings yet
lecture task
4 pages
Solving Word Problem Involving Multiplication and Addition O Subtraction of Decimals
No ratings yet
Solving Word Problem Involving Multiplication and Addition O Subtraction of Decimals
14 pages
Presentation On ERP Implementation
No ratings yet
Presentation On ERP Implementation
11 pages
Divisibility Rules For 3, 6 and 9 (3 Digit Numbers) (A)
No ratings yet
Divisibility Rules For 3, 6 and 9 (3 Digit Numbers) (A)
1 page
Cohen Sutherland Clipping Algorithm
No ratings yet
Cohen Sutherland Clipping Algorithm
6 pages
Banzai PDF
No ratings yet
Banzai PDF
4 pages
Impacts of IT On The Banking Sector of Bangladesh PDF
100% (1)
Impacts of IT On The Banking Sector of Bangladesh PDF
8 pages
SASB Investment - Banking - Brokerage - Standard - 2018
No ratings yet
SASB Investment - Banking - Brokerage - Standard - 2018
31 pages
GLICO LIFE ANNUAL REPORT - Web
No ratings yet
GLICO LIFE ANNUAL REPORT - Web
57 pages
Unraveling of A Fraud: © 2015 Pearson Education, Inc
No ratings yet
Unraveling of A Fraud: © 2015 Pearson Education, Inc
1 page
The Alternative To PWHT Temper Bead Welding
No ratings yet
The Alternative To PWHT Temper Bead Welding
64 pages
SIST-EN-15085-3-2023
No ratings yet
SIST-EN-15085-3-2023
15 pages
Acculturation Strategy, Accult
No ratings yet
Acculturation Strategy, Accult
198 pages
Lecture - 3 To 5 - Permeability-Rev
No ratings yet
Lecture - 3 To 5 - Permeability-Rev
79 pages
Paper 4-Analysis of the Impact of Different Parameters Setting
No ratings yet
Paper 4-Analysis of the Impact of Different Parameters Setting
6 pages
Colegio de Muntinlupa: The Use of Water Hyacinth As A Material in Making Biodegrabable Plastics
No ratings yet
Colegio de Muntinlupa: The Use of Water Hyacinth As A Material in Making Biodegrabable Plastics
11 pages
JANUARY,2025 AUTOMATION
No ratings yet
JANUARY,2025 AUTOMATION
22 pages
555 Timer Pinout
No ratings yet
555 Timer Pinout
1 page
Module 4 - Study Material - Overview of Predictive Analytics
No ratings yet
Module 4 - Study Material - Overview of Predictive Analytics
15 pages

Role of Lexical Analyzer_Input Buffering

Uploaded by

Role of Lexical Analyzer_Input Buffering

Uploaded by

UNIT I

• Once the next lexeme is determined, forward is set

You might also like