0% found this document useful (0 votes)

52 views

Lab Session 2 Arithmetic Expresion Grammar

The document discusses arithmetic expression grammars in ANTLRWorks. It begins with the objectives of handling arithmetic expressions and writing grammars. Various grammars are presented and modified to address issues like left recursion, operator precedence, and whitespace. Exercises are given to modify existing grammars to handle multiple expressions, precedence, ignore whitespace, and use parentheses. Students are required to submit the grammars for the exercises and parse trees for sample inputs.

Uploaded by

Gudako Chaotic-Evil

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

Lab Session 2 Arithmetic Expresion Grammar

Uploaded by

Gudako Chaotic-Evil

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 6

LAB SESSION 2

ARITHMETIC EXPRESION GRAMMAR

1. OBJECTIVE
The objectives of Lab 2 are (1) to write a grammar with ANTLRWorks and (2) to handle
arithmetic expression grammar.

2. EXPERIMENT

2.1 Your first grammar

In the first lab, we have tried to make ANTLRWorks recognize some patterns by the
means of regular expressions (RE). In fact, it is not very convenient, because
ANTLRWorks is not the tool intended for RE merely. ANTLRWorks only shows its true
power when working with actual grammar.

Let’s play with some grammars. To begin, define your first grammar in ANTLRWorks as
follows:

grammar G1;

e : INT '+' e| INT;

INT :'0'..'9'+;

So, what are the non-terminal set and the token set of the grammar?

In ANTLRWorks, if a rule has its left-hand side (LHS) beginning with an uppercase
letter, this rule defines a token. Otherwise, this rule is a production of the grammar, thus
its LHS is a non-terminal symbol.

Hence, in the above grammar, the token set is {INT} and the non-terminal set is {e}, the
production set is {e  INT ‘+’ e | INT}.

The language generated by this grammar is very simple: it is an addition of many

integers, for example: 25+3, 12+5+7, etc.
Since everything now has been indicated clearly in this grammar, ANTLRWorks can
generate the corresponding lexer and parser without any problem. So that we do not need
to bother about the lexer::header directive as in Lab 1. Just play with this grammar.

2.2 Revising your grammar

Unfortunately, even though now your grammar is defined precisely, it still cannot work.
Test your grammar with some inputs like 2+3 or 23+5, what you get are only some
erroneous trees.

ANTLRWork is not intended for all arbitrary grammars, but those of programming
languages. Remember that in most of programming languages, the statements must be
ended by a special character (like ‘;’ in C++ or Java). If you have chances to learn some
parsing algorithms in some appropriate courses like Compiler, you might know the
reason behind this secret. For now, just follow the tradition by revising your grammar as
follows.

grammar G2;

e : INT '+' e SEMI| INT SEMI;

INT :'0'..'9'+;

SEMI : ';';

Things are nice now. Your grammar should work perfectly with valid inputs like 2+3;
(do not forget the semicolon).

Or, you can make your grammar more concise as follows:

grammar G3;

p : e SEMI;

e : INT '+' e| INT;

INT :'0'..'9'+;

SEMI : ';';

2.3 Deal with whitespace

When you test grammar G3 above with the input of 2 + 3; (see, there are some
spaces occurring inside the expression), you still get the parse tree, but it does not look
nice. The spaces are inserted into the generate nodes, and they do not make sense.

It urges us to modify the grammar as follows:

grammar G4;

p : e SEMI;

e : INT '+' e| INT;

INT :'0'..'9'+;

SEMI : ';';

WS : (' ')+ {$channel=HIDDEN;};

In grammar G4, we define token WS to deal with spaces. In particular, this token is
associated with some actions. The actions, put in the curly brackets, are
$channel=HIDDEN. It means that token WS will be put into a hidden channel. In other
words, this token will always be ignored by the parser.

Test your grammar again with the input of 2 + 3; and observe the difference in the
generated tree.

2.4 Left-recursion elimination

In grammar G4, the operator ‘+’ is of left-associativity. One may want to make it of right-
associativity. In the first place, things seem quite simple. Just modify your grammar as
follows:

grammar G5;

p : e SEMI;

e : e '+' INT| INT;

INT :'0'..'9'+;

SEMI : ';';
WS : (' ')+ {$channel=HIDDEN;};

That is right, theoretically. However, it does not work on ANTLRWorks. You can try and
see.

The reason is due to the production: e e ‘+’ INT. Here we have something called left-
recursion: the symbol on the LHS is also the first symbol on the RHS.

Some parsing algorithms can work well with left-recursion, but it is not the case of
ANTLRWorks. We must eliminate this left-recursion using the following transformation:

If we have left-recursion productions as follows:

X X| 

The left-recursion can be eliminated by applying transformation on the productions as

follows:

X Y

Y Y | ε

Applying the above transformation on the case of e e ‘+’ INT | INT (where X is e,  is
‘+’ INT and  is INT, we have a new grammar as follows;

grammar G6;

p : e SEMI;

e : INT t;

t : '+' INT t|;

INT :'0'..'9'+;

SEMI : ';';

WS : (' ')+ {$channel=HIDDEN;};

Note that in grammar G6, the associativity of ‘+’is still left. It is quite hard to obverse this
on the tree, but in the upcoming Lab 3 we will verify it.
3. CLASS EXERCISES

3.1.

a) Modify grammar G3 such that it can generate multiple expressions separated by

semicolons; the operators accepted are ‘+’ and ‘-‘.

For example:

3+4;4+5-6;23-12+47;

b) Modify grammar in written 3.1 such that the operator ‘+’ takes higher precedence than
that of ‘-‘. The grammar can also now be able to ignore some whitespace characters like
space, tab and new line1. In addition, the grammar also allows users to use parentheses in
the expression.

3.2 Eliminate left-recursion in the following grammars and test the transformed
grammars in ANTLRWorks

e  t ‘+’ e |t

t  t ‘*’ A | A

A  ‘0’..’9’+

b2)

e  e ‘+’ t | e ‘-’ t | t

t  t ‘*’ A | A

A  ‘0’..’9’+

4. SUBMISSIONS

Students are required to submit in writing to the tutor-in-charge the following materials:

1
One can use ‘\t’ and ‘\n’ to represent tab and new line character in ANTLRWorks.
2
For Honor program (KSTN program) only
4.1 The grammars for Exercise 3.1 and 3.2

4.2 The trees generated by ANTLRWorks with the following (transformed) grammars
and inputs

Grammar input

3.2a 3+4*5*6

3.2b 3+4*5-6

Chapter2 2015
No ratings yet
Chapter2 2015
47 pages
Lab2 PPL
No ratings yet
Lab2 PPL
8 pages
Lab 3
No ratings yet
Lab 3
8 pages
Nguyễn Đức Phi Hồng ITITIU 17022- Principles of Programming Languages- Lab3
No ratings yet
Nguyễn Đức Phi Hồng ITITIU 17022- Principles of Programming Languages- Lab3
12 pages
Parsing, Lexical Analysis, and Tools: William Cook
No ratings yet
Parsing, Lexical Analysis, and Tools: William Cook
16 pages
Antlr 4 Guide To Help U Learn
No ratings yet
Antlr 4 Guide To Help U Learn
37 pages
Antlr PDF
0% (1)
Antlr PDF
37 pages
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
No ratings yet
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
44 pages
Lab1 PPL
No ratings yet
Lab1 PPL
5 pages
4 - Top-Down
No ratings yet
4 - Top-Down
67 pages
Chapter 3 - Syntax Analyzer
No ratings yet
Chapter 3 - Syntax Analyzer
28 pages
Chapter-3-Syntax Analysis
No ratings yet
Chapter-3-Syntax Analysis
126 pages
Chapter 2 - Simple Syntax Directed Translator
No ratings yet
Chapter 2 - Simple Syntax Directed Translator
39 pages
Syntax Analyser
No ratings yet
Syntax Analyser
30 pages
Parsing - 1
No ratings yet
Parsing - 1
59 pages
CD - Ch.2
No ratings yet
CD - Ch.2
39 pages
Compiler Lab Manual
No ratings yet
Compiler Lab Manual
19 pages
Compiler Lab Manual
No ratings yet
Compiler Lab Manual
19 pages
Parsernotes in C
No ratings yet
Parsernotes in C
45 pages
Chapter 3 - Syntax Analyzer
No ratings yet
Chapter 3 - Syntax Analyzer
28 pages
Lecture 7-8 - Context-Free Grammars and Bottom-Up Parsing
No ratings yet
Lecture 7-8 - Context-Free Grammars and Bottom-Up Parsing
39 pages
Unit 2
No ratings yet
Unit 2
67 pages
cs212 Lect05 63 Inter
No ratings yet
cs212 Lect05 63 Inter
48 pages
COMP322: Assignment 1 - Winter 2010: Due at 11:59pm EST, 10 Feb 2010
No ratings yet
COMP322: Assignment 1 - Winter 2010: Due at 11:59pm EST, 10 Feb 2010
3 pages
4.parsing
No ratings yet
4.parsing
32 pages
Ch4a Modified
No ratings yet
Ch4a Modified
53 pages
Lecture 05 Parsing
No ratings yet
Lecture 05 Parsing
3 pages
Compiler Lab Antlr
No ratings yet
Compiler Lab Antlr
18 pages
Compilers - Week 3
No ratings yet
Compilers - Week 3
17 pages
CD Chapter 2
No ratings yet
CD Chapter 2
39 pages
Top Down Parsing
No ratings yet
Top Down Parsing
37 pages
CD Chapter-3
No ratings yet
CD Chapter-3
105 pages
CC 3
No ratings yet
CC 3
29 pages
03 Lexing Parsing
No ratings yet
03 Lexing Parsing
78 pages
Lecture 08 09 PDF
No ratings yet
Lecture 08 09 PDF
10 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
Cs1622 Parsing Part2 Bun
No ratings yet
Cs1622 Parsing Part2 Bun
5 pages
4th - Syntax Analysis
No ratings yet
4th - Syntax Analysis
29 pages
Chapter4 ND - Edu Dthain
No ratings yet
Chapter4 ND - Edu Dthain
14 pages
Top-Down Parsing
No ratings yet
Top-Down Parsing
73 pages
LL 1
No ratings yet
LL 1
73 pages
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
No ratings yet
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
26 pages
CSE 4102 Syntax Analysis or Parsing
No ratings yet
CSE 4102 Syntax Analysis or Parsing
73 pages
KCA015 Unit2
No ratings yet
KCA015 Unit2
29 pages
2.2 - Syntax Analysis (Upto Top-down Parsing)
No ratings yet
2.2 - Syntax Analysis (Upto Top-down Parsing)
91 pages
Unit-II CD
No ratings yet
Unit-II CD
81 pages
Parsing ME Modified
No ratings yet
Parsing ME Modified
168 pages
04 Syntax Analysis
No ratings yet
04 Syntax Analysis
66 pages
Context Free Grammars
No ratings yet
Context Free Grammars
10 pages
Tekkom M4,5
No ratings yet
Tekkom M4,5
29 pages
Unit 2-Part A
No ratings yet
Unit 2-Part A
75 pages
Chapter4-1
No ratings yet
Chapter4-1
61 pages
CD Unit 2
No ratings yet
CD Unit 2
19 pages
Lexical and syntax analysis
No ratings yet
Lexical and syntax analysis
63 pages
[Week 4] Syntax Analysis (CFG)
No ratings yet
[Week 4] Syntax Analysis (CFG)
50 pages
Chiradza Lawine H190638e Assignment 2
No ratings yet
Chiradza Lawine H190638e Assignment 2
5 pages
Lec03 parserCFG
No ratings yet
Lec03 parserCFG
27 pages
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
100% (2)
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
50 pages
Syntax Analysis
No ratings yet
Syntax Analysis
73 pages
03 Syntaxanalysis 2 2012 2013
No ratings yet
03 Syntaxanalysis 2 2012 2013
83 pages
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Grade 12 Data Management Notes
0% (1)
Grade 12 Data Management Notes
2 pages
A Simple Nonlinear Pendulum
No ratings yet
A Simple Nonlinear Pendulum
20 pages
Least Square Solution-Hsmallik
No ratings yet
Least Square Solution-Hsmallik
2 pages
CS 212 Embedded Systems: Data Representation
No ratings yet
CS 212 Embedded Systems: Data Representation
28 pages
Full Download Calculus and Linear Algebra: Fundamentals and Applications 1st Edition Aldo G. S. Ventre PDF DOCX
100% (4)
Full Download Calculus and Linear Algebra: Fundamentals and Applications 1st Edition Aldo G. S. Ventre PDF DOCX
37 pages
Feasible Generalized Least Squares For Panel Data With Cross-Sectional and Serial Correlations
No ratings yet
Feasible Generalized Least Squares For Panel Data With Cross-Sectional and Serial Correlations
18 pages
Math 4 April 29 30 2024
No ratings yet
Math 4 April 29 30 2024
9 pages
Maths Xth(2025)
No ratings yet
Maths Xth(2025)
7 pages
Quantum Fourier Analysis: Zhengwei Liu
No ratings yet
Quantum Fourier Analysis: Zhengwei Liu
38 pages
Sheet Conic Section B
No ratings yet
Sheet Conic Section B
117 pages
JC003ALP000EV
No ratings yet
JC003ALP000EV
24 pages
Percentages Answers
No ratings yet
Percentages Answers
1 page
Geometric Numbers
100% (1)
Geometric Numbers
4 pages
SeniorDegree Colleges (Undergraduates) Set A - 2020
No ratings yet
SeniorDegree Colleges (Undergraduates) Set A - 2020
19 pages
MITx SCX KeyConcept SC0x FV PDF
No ratings yet
MITx SCX KeyConcept SC0x FV PDF
61 pages
formulae of pns iit kgp
No ratings yet
formulae of pns iit kgp
2 pages
Math G10 1.13 Wk13-16
No ratings yet
Math G10 1.13 Wk13-16
5 pages
Adjustment Examples
No ratings yet
Adjustment Examples
254 pages
Complex Numbers Worksheet: Answers
No ratings yet
Complex Numbers Worksheet: Answers
2 pages
Identity of Trigonometry: Critical Book Report
No ratings yet
Identity of Trigonometry: Critical Book Report
12 pages
Mathex Year 8 2014 Plus Answers
No ratings yet
Mathex Year 8 2014 Plus Answers
3 pages
Mathematics s2 Unit 01
No ratings yet
Mathematics s2 Unit 01
98 pages
Legendrian Skein Hall Algebras and Hall Algebras
No ratings yet
Legendrian Skein Hall Algebras and Hall Algebras
54 pages
A Study of A Backorder EOQ Model For Cloud-Type Intuitionistic Dense Fuzzy Demand Rate
No ratings yet
A Study of A Backorder EOQ Model For Cloud-Type Intuitionistic Dense Fuzzy Demand Rate
12 pages
Multiplying Decimals
No ratings yet
Multiplying Decimals
2 pages
Adding Integers
No ratings yet
Adding Integers
19 pages
Bab Iv Hasil Dan Pembahasan 4.1 HASIL 4.1.1 Perhitungan Konsentrasi Larutan Baku
No ratings yet
Bab Iv Hasil Dan Pembahasan 4.1 HASIL 4.1.1 Perhitungan Konsentrasi Larutan Baku
8 pages
Math Tos Fourth Quarter
No ratings yet
Math Tos Fourth Quarter
2 pages

Lab Session 2 Arithmetic Expresion Grammar

Uploaded by

Lab Session 2 Arithmetic Expresion Grammar

Uploaded by

LAB SESSION 2

ARITHMETIC EXPRESION GRAMMAR

2.1 Your first grammar

e : INT '+' e| INT;

The language generated by this grammar is very simple: it is an addition of many

2.2 Revising your grammar

e : INT '+' e SEMI| INT SEMI;

Or, you can make your grammar more concise as follows:

e : INT '+' e| INT;

2.3 Deal with whitespace

It urges us to modify the grammar as follows:

e : INT '+' e| INT;

WS : (' ')+ {$channel=HIDDEN;};

2.4 Left-recursion elimination

e : e '+' INT| INT;

If we have left-recursion productions as follows:

The left-recursion can be eliminated by applying transformation on the productions as

t : '+' INT t|;

WS : (' ')+ {$channel=HIDDEN;};

a) Modify grammar G3 such that it can generate multiple expressions separated by

You might also like