100% found this document useful (1 vote)

950 views9 pages

Structure of a Lex Program

Lex code

Uploaded by

Athul murali T

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

950 views9 pages

Structure of a Lex Program

Lex code

Uploaded by

Athul murali T

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

STRUCTURE OF LEX PROGRAM

A Lex program consists of three sections, separated by a line consisting of two

percent signs (%%):

1. Definition Section
2. Rules Section
3. Auxiliary Section

Definition section
%%
Rules section
%%
Auxiliary section

The first two sections are necessary, even if they are empty. The third part and the
preceding %% line may be omitted.

Definition Section

The Definition Section contains user-defined Lex options used by the lexer. It creates
an environment for the execution of the Lex program and can be empty. This section
helps in two ways:

1. Environment for the Lexer:

○ Contains C statements such as global declarations and commands.
○ Enclosed by %{ and %}.
○ Includes global declarations, commands, and tool configurations.
2. Environment for Flex Tool:
○ Provides declarations of simple name definitions to simplify scanner
specifications.
○ Declares start conditions.
○ Helps Flex convert the Lex specifications correctly and efficiently to the
lexical analyzer.

Rules Section

The Rules Section contains the patterns and actions that define the Lex
specifications:

● Patterns:
○ Formed by regular expressions to match the largest possible string.
● Actions:
○ Enclosed in braces {}.
○ Contain normal C language statements.
○ When a pattern is matched, the corresponding action is invoked.
○ The lexer tries to match the largest possible string. If two rules match
the same length, the lexer uses the first rule to invoke its corresponding
action.

Auxiliary Section

This section contains user-defined C functions (subroutines), including the main()

function from where execution begins. These functions are copied as-is to the lexical
analyzer C file by FLEX.

Lex Variables

● yyin:
○ Type: FILE*
○ Points to the current input file.
● yyout:
○ Type: FILE*
○ Points to the output location.
● yytext:
○ Stores the text of the matched pattern in a variable.
● yylen:
○ Gives the length of the matched pattern.

Lex Functions

● yywrap:
○ Called when the end of the input file is encountered.
○ Can be used to parse multiple input files.
● yylex(int n):
○ Can be used to push back all but the first n characters of the read
token.
● yymore:
○ Keeps the token’s lexeme in yytext when another pattern is matched.

Lex Macros

● Letter: [a-zA-Z]
● Digit: [0-9]
● Identifier: {letter}({letter}|{digit})
INTRODUCTION

The function of Lex is as follows:

❖ Firstly lexical analyzer creates a program lex.1 in the Lex language. Then Lex
compiler runs the lex.1 program and produces a C program [Link].c.
❖ Finally C compiler runs the [Link].c program and produces an object program
[Link].
❖ [Link] is lexical analyzer that transforms an input stream into a sequence of
tokens.

yylex(): The main Lex function that performs lexical analysis and matches patterns in the
input.
yytext: A pointer to the matched text (a string) for the current pattern.
yyleng: The length of the matched text in yytext.
yyin: A file pointer that indicates the input stream (defaults to stdin).
yyout: A file pointer for output (defaults to stdout).
yywrap(): A function called when the end of input is reached; by default, returns 1 to
indicate end of input.
Experiment-1a: LEX program to count the number of lines, words
and characters in an input and input from a file.

Countchlw.l

OUTPUT:
Experiment-1b: LEX program to count number of words, lines
and characters from file
Countfilech.l

OUTPUT:
Experiment-2: LEX program to identify and Count Positive and
Negative Numbers.

Countnp.l

OUTPUT:
Experiment-3: LEX program to count the number of vowels and
consonants.
Countvc.l

OUTPUT
Experiment-4: LEX program to remove space, tab or newline.

rmstn.l

OUTPUT
Experiment-5: LEX program to find the length of a string.

strlen.l

OUTPUT

Common questions

yytext is a pointer variable that stores the text of the currently matched pattern. It is central to capturing the matched input string during the lexical analysis process. yytext holds the matched text that can be manipulated or used in defined actions following a pattern match within a Lex program .

The yylex function is the central component of a Lex program, driving the lexical analysis process. It is responsible for reading the input, finding the longest match for patterns defined in the rules section, and executing the associated actions. The yylex function repeatedly calls itself until there are no more input characters, effectively transforming the entire input stream into a sequence of tokens as specified by the Lex rules .

A Lex program handles counting tasks by defining specific patterns and their corresponding actions within the Rules Section. Separate Lex patterns are used to match lines, words, and characters, each executing an action that increments a count variable upon each match. For counting lines, words, and characters, a Lex program typically uses patterns that match newline characters, spaces (or sequences of non-space characters for words), and any character, respectively. The accumulated counts can then be output, providing a count of each aspect in the input file .

A Lex program transforms an input stream into a sequence of tokens through several stages. Initially, a lexical analyzer program, typically named lex.1, is created in the Lex language. This program is processed by the Lex compiler to produce a C program file, lex.yy.c. The C compiler then compiles this C file to create an object program, usually named a.out. This object program functions as the lexical analyzer, reading the input stream and using defined patterns and actions to identify and process tokens .

The Definition Section in a Lex program is designed to set up the necessary environment for executing the Lex program. It includes user-defined Lex options, containing C statements such as global declarations and commands. These elements are enclosed by %{ and %}. Additionally, it provides declarations for start conditions and tool configurations to help Flex convert Lex specifications efficiently into a lexical analyzer. The section can be empty but is crucial for setting up the environment for both the lexer and Flex tool .

Lex macros like "Letter" (defined as [a-zA-Z]) and "Digit" (defined as [0-9]) streamline pattern definitions in a Lex program by encapsulating frequently used regular expressions. These macros can be reused across multiple patterns, enhancing readability and maintainability by reducing redundancy in the rules section. For example, the "Identifier" macro can use "Letter" and "Digit" to simplify the pattern for identifying variable names in programming languages .

The auxiliary section of a Lex program is used for including user-defined C functions or subroutines that are necessary for the lexical analyzer's operation. For instance, it includes the main() function from which the program execution begins. It is helpful in cases where additional logic or computations, beyond pattern matching, are necessary. The contents of this section are copied directly into the lexical analyzer C file generated by Flex .

In the Rules Section of a Lex program, patterns and actions define the Lex specifications essential for lexical analysis. Patterns are constructed using regular expressions to match the largest possible string in the input. When a pattern is matched, the corresponding action, enclosed in braces {}, is executed. This action involves normal C language statements. Importantly, if two patterns match strings of the same length, the lexer prioritizes the first specified rule to execute its associated action .

The yywrap function in a Lex program is invoked when the end of an input file is reached. By default, yywrap returns 1, indicating that the input has ended. This behavior is crucial for Lex to know when to stop reading from the input stream. Additionally, yywrap can be customized to handle multiple input files by modifying the return value to allow continued reading from new files if necessary .

The yymore function in a Lex program is used to accumulate text from multiple pattern matches into a single token's lexeme. When yymore is called after a pattern is matched, the matched text is appended to the existing content in yytext rather than replacing it. This is useful in scenarios requiring multi-stage matching tasks, such as assembling compound tokens or concatenating lines of input before processing them collectively .

Compiler Design 2 Mark Q&A Guide
No ratings yet
Compiler Design 2 Mark Q&A Guide
22 pages
Flex Lexical Analyzer Overview
No ratings yet
Flex Lexical Analyzer Overview
22 pages
Introduction to Yacc and Lex
No ratings yet
Introduction to Yacc and Lex
11 pages
Compiler Design Question Bank with Solutions
No ratings yet
Compiler Design Question Bank with Solutions
43 pages
Compiler Design Unit 1 Overview
No ratings yet
Compiler Design Unit 1 Overview
21 pages
Role of Parser in Compiler Design
No ratings yet
Role of Parser in Compiler Design
20 pages
Input Buffering in Compiler Design
No ratings yet
Input Buffering in Compiler Design
129 pages
Peephole Optimization in Compiler Design
No ratings yet
Peephole Optimization in Compiler Design
9 pages
Introduction to Compiler Design
No ratings yet
Introduction to Compiler Design
17 pages
Compiler Design - Lexical Analysis
No ratings yet
Compiler Design - Lexical Analysis
2 pages
ATCD Important Questions for JNTUH
No ratings yet
ATCD Important Questions for JNTUH
5 pages
Lexical Analyzer for Token Separation
No ratings yet
Lexical Analyzer for Token Separation
50 pages
Syntactic Analysis Exercises and Solutions
No ratings yet
Syntactic Analysis Exercises and Solutions
26 pages
Compiler Design Question Bank
No ratings yet
Compiler Design Question Bank
12 pages
Overview of YACC in Compiler Design
No ratings yet
Overview of YACC in Compiler Design
5 pages
Lexical Analyzer Design with LEX Tool
No ratings yet
Lexical Analyzer Design with LEX Tool
13 pages
36-Register Allocation and Assignment-06-11-2024
No ratings yet
36-Register Allocation and Assignment-06-11-2024
11 pages
Token Specification in Compiler Design
No ratings yet
Token Specification in Compiler Design
20 pages
Semantic Analysis in Compiler Design
No ratings yet
Semantic Analysis in Compiler Design
46 pages
Principles of Compiler Design Overview
No ratings yet
Principles of Compiler Design Overview
33 pages
Eliminating Left Recursion in Parsing
No ratings yet
Eliminating Left Recursion in Parsing
17 pages
Lexical Analyzer Implementation Guide
100% (1)
Lexical Analyzer Implementation Guide
15 pages
Lexical Analysis in Compiler Design
100% (1)
Lexical Analysis in Compiler Design
51 pages
Intermediate Code Generation in Compilers
No ratings yet
Intermediate Code Generation in Compilers
47 pages
Phases of Compiler Design Explained
No ratings yet
Phases of Compiler Design Explained
3 pages
SLR Parsing Table Construction Guide
No ratings yet
SLR Parsing Table Construction Guide
22 pages
Bottom-Up Parsing in Compiler Design
No ratings yet
Bottom-Up Parsing in Compiler Design
20 pages
Context-Free Grammar and Derivations
No ratings yet
Context-Free Grammar and Derivations
35 pages
Understanding Syntax Analysis in Compilers
No ratings yet
Understanding Syntax Analysis in Compilers
5 pages
Resolving Grammar Conflicts in Parsing
No ratings yet
Resolving Grammar Conflicts in Parsing
18 pages
Structure and Phases of a Compiler
No ratings yet
Structure and Phases of a Compiler
2 pages
Key Compiler Design Questions Explained
No ratings yet
Key Compiler Design Questions Explained
3 pages
Convert Regular Expression to DFA
No ratings yet
Convert Regular Expression to DFA
6 pages
Compiler Design Course Overview
No ratings yet
Compiler Design Course Overview
114 pages
YACC Tool in Compiler Design Explained
No ratings yet
YACC Tool in Compiler Design Explained
13 pages
YACC: Building LALR Parsers in C/C++
No ratings yet
YACC: Building LALR Parsers in C/C++
12 pages
Lexical Analysis in Compiler Design
100% (2)
Lexical Analysis in Compiler Design
19 pages
DAG Applications in Compiler Design
No ratings yet
DAG Applications in Compiler Design
8 pages
Compiler Design Quiz Questions
No ratings yet
Compiler Design Quiz Questions
3 pages
Compiler Design Unit-2
No ratings yet
Compiler Design Unit-2
29 pages
Introduction to Tcl Programming Basics
No ratings yet
Introduction to Tcl Programming Basics
85 pages
Lexical Analyzer in Compiler Design
No ratings yet
Lexical Analyzer in Compiler Design
30 pages
Tcl Programming Essentials
No ratings yet
Tcl Programming Essentials
20 pages
Compiler Design Notes for Engineers
No ratings yet
Compiler Design Notes for Engineers
101 pages
Syntax-Directed Translation in Compilers
No ratings yet
Syntax-Directed Translation in Compilers
25 pages
Lex and Yacc in Compiler Design
No ratings yet
Lex and Yacc in Compiler Design
67 pages
Compiler Phases Overview
No ratings yet
Compiler Phases Overview
18 pages
Recursive Descent Parsing in Compilers
No ratings yet
Recursive Descent Parsing in Compilers
31 pages
Compiler Design Question Bank 2024
No ratings yet
Compiler Design Question Bank 2024
9 pages
Lexical Analysis with LEX and yytext
No ratings yet
Lexical Analysis with LEX and yytext
69 pages
Lex Tool: C Lexical Analyzer Guide
No ratings yet
Lex Tool: C Lexical Analyzer Guide
7 pages
Lex Tool in Compiler Design Overview
No ratings yet
Lex Tool in Compiler Design Overview
17 pages
Lex Tool and Program Structure
No ratings yet
Lex Tool and Program Structure
18 pages
Lexical Analysis with Lex Compiler
No ratings yet
Lexical Analysis with Lex Compiler
19 pages
Lexical Analysis of Java Using Lex
No ratings yet
Lexical Analysis of Java Using Lex
6 pages
Lex: Lexical Analyzer Overview
No ratings yet
Lex: Lexical Analyzer Overview
3 pages
Simple Lex Program Overview
No ratings yet
Simple Lex Program Overview
10 pages
Lexical Analyzer Implementation Guide
No ratings yet
Lexical Analyzer Implementation Guide
26 pages
Overview of Lex in Compiler Design
No ratings yet
Overview of Lex in Compiler Design
10 pages
Lexical Analyzer Implementation in UNIX
No ratings yet
Lexical Analyzer Implementation in UNIX
5 pages
Smart Parking System BoQ Overview
No ratings yet
Smart Parking System BoQ Overview
1 page
Standard 22 NX Gyro Compass Tutorial
No ratings yet
Standard 22 NX Gyro Compass Tutorial
156 pages
Master Machine Learning in 20 Days
No ratings yet
Master Machine Learning in 20 Days
23 pages
Wayne Vista Dispenser Configuration Guide
0% (1)
Wayne Vista Dispenser Configuration Guide
2 pages
NVC Lighting Product Catalog
No ratings yet
NVC Lighting Product Catalog
23 pages
Understanding While and For Loops in Java
No ratings yet
Understanding While and For Loops in Java
37 pages
OSI Model and Network Troubleshooting Guide
No ratings yet
OSI Model and Network Troubleshooting Guide
316 pages
Go-to-Market Strategies for Vector Databases
No ratings yet
Go-to-Market Strategies for Vector Databases
4 pages
Currency Exchange Program Design
No ratings yet
Currency Exchange Program Design
15 pages
Modding Guide for Civilization V
No ratings yet
Modding Guide for Civilization V
85 pages
Transfer Functions in System Dynamics
No ratings yet
Transfer Functions in System Dynamics
68 pages
HP t540 Thin Client: Simple, Secure Virtual Desktop Solution
No ratings yet
HP t540 Thin Client: Simple, Secure Virtual Desktop Solution
2 pages
Hackfest 2025 at NMAMIT, Nitte
No ratings yet
Hackfest 2025 at NMAMIT, Nitte
12 pages
YPG 625 Eng Manual PDF
No ratings yet
YPG 625 Eng Manual PDF
142 pages
Client Support Requirements Guide
No ratings yet
Client Support Requirements Guide
59 pages
Create Outbound IDoc in SAP
No ratings yet
Create Outbound IDoc in SAP
8 pages
Lyric PG Series Printer Manual
No ratings yet
Lyric PG Series Printer Manual
34 pages
Low-Power FIR Filter Design Analysis
No ratings yet
Low-Power FIR Filter Design Analysis
19 pages
IoT Elements: Sensors and Actuators
No ratings yet
IoT Elements: Sensors and Actuators
19 pages
Cyberattack Analysis on NMC2S System
No ratings yet
Cyberattack Analysis on NMC2S System
21 pages
Stage II Electrics Training for Technicians
No ratings yet
Stage II Electrics Training for Technicians
62 pages
Sundararajan Balakrishnan SalesForce Resume
0% (1)
Sundararajan Balakrishnan SalesForce Resume
13 pages
PMH-1 and PMH-2 Pressure Modules Overview
No ratings yet
PMH-1 and PMH-2 Pressure Modules Overview
3 pages
GE2000 Servo Drive User Manual
100% (1)
GE2000 Servo Drive User Manual
105 pages
AWS Support Plans Overview
No ratings yet
AWS Support Plans Overview
1 page
DDCMIS Overview for Power Plants
50% (2)
DDCMIS Overview for Power Plants
128 pages
Jiyi Agri Assistant App Guide
No ratings yet
Jiyi Agri Assistant App Guide
151 pages
Tech Mahindra Internship Roles Overview
No ratings yet
Tech Mahindra Internship Roles Overview
2 pages
Dowsing Tutorial
100% (1)
Dowsing Tutorial
16 pages
3500 Spec Sheet
100% (1)
3500 Spec Sheet
2 pages

Structure of a Lex Program

Uploaded by

Structure of a Lex Program

Uploaded by

STRUCTURE OF LEX PROGRAM

A Lex program consists of three sections, separated by a line consisting of two

1. Environment for the Lexer:

This section contains user-defined C functions (subroutines), including the main()

The function of Lex is as follows:

Common questions

What is the role of yytext, and how is it used in a Lex program?

In what way does the yylex function serve as the main driver of a Lex program?

How does a Lex program handle counting tasks, such as counts of lines, words, and characters, in an input file?

Describe how a Lex program transforms an input stream into a sequence of tokens.

What is the purpose of the Definition Section in a Lex program and how is it structured?

What is the significance of Lex macros, such as the "Letter" and "Digit" macros, in the pattern definitions of a Lex program?

In what scenarios would the auxiliary section of a Lex program be utilized?

How do patterns and actions function in the Rules Section of a Lex program?

Why is the yywrap function important in a Lex program, and what default behavior does it exhibit?

Explain how the yymore function is used in a Lex program and provide a potential use case scenario.

You might also like