0% found this document useful (0 votes)

57 views

CH06

This document discusses intermediate code generation and optimization in compilers. It describes how producing an intermediate representation facilitates retargeting a compiler to different machines and allows for machine-independent optimizations. Common intermediate representations include graphs, postfix notation, and three-address code. The document outlines various machine-independent optimizations that can improve the intermediate code, such as peephole, local, global, loop, and inter-procedural optimizations. It also discusses basic blocks and how they are constructed from three-address instructions.

Uploaded by

zemike

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

CH06

Uploaded by

zemike

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

CHAPTER SIX

Intermediate Code Generation and

Optimization

Outline
 Introduction
 Intermediate-Code Generation
 Machine-Independent Optimizations
6.1 Introduction: Structure of a Compiler
6.2 Intermediate Code Generation

 Although a compiler can directly produce a target language

(i.e. machine code or assembly of the target machine),
producing a machine independent intermediate representation
has the following benefits.
 Retargeting to another machine is facilitated.
 Intermediate code representation is neutral in relation to target
machine, so the same intermediate code generator can be
shared for all target languages (machines).
 Build a compiler for a new machine by attaching a new code
generator to an existing front-end
 Machine independent code optimization can be applied to
intermediate code.
Compiling Process without
Intermediate Representation

C SPARC

Pascal HP PA

FORTRAN x86

C++ IBM PPC

Compiling Process with Intermediate
Representation

C SPARC

Pascal HP PA
IR
FORTRAN x86

C++ IBM PPC

10
Methods of Intermediate Code (IC) Generation

Intermediate language can be many different languages,

and the designer of the compiler decides this intermediate
Language. Common IRs:
 Graphical Representation: such as syntax trees, AST
(Abstract Syntax Trees), DAG
 Postfix Notation: the abstract syntax tree is linearized as a
sequence of data references and operations.
 For instance, the tree for : a * ( 9 + d ) can be mapped to the
equivalent postfix notation: a9d+*
 Three-address Code: All operations are represented as a 4-
part list in quadruples:
 (op, arg1, arg2, result). E.g., x := y + z -> (+ y z x)
Direct Acyclic Graph (DAG) Representation

 Example: F = ((A+BC) (ABC))+C

= =
F + +
F

* *
C
+ * + *
A
* * *
A
B C B C A
B C
DAG Syntax tree
A syntax tree depicts the natural hierarchical structure of a
source program. A DAG gives the same information but in
compact way because common expressions are identified
Postfix Notation: PN

 A mathematical notation wherein every operator follows all

of its operands.
 Or a list of nodes of a tree in which a node appears
immediately next to its children.
Example: PN of expression a* (b+c) is abc+*
How about (a+b)/(c-d)
 Form Rules:
 If E is a variable/constant, the PN of E is E itself.
 If E is an expression of the form E1 op E2, the PN of E is
E1 ’E2 ’op (E1 ’ and E2 ’ are the PN of E1 and E2,
respectively.)
 If E is a parenthesized expression of form (E1), the PN
of E is the same as the PN of E1.
Three Address Code
 The general form:x = y op z
 x,y,and z are names, constants, compiler-generated temporaries
 op stands for any operator such as +,-,….

 We use the term “three-address code” because each statement

usually contains three addresses (two for operands, one for the
result).
 A popular form of intermediate code used in optimizing
compilers is three-address statements.
 Linearized representation of syntax tree with explicit names
given to interior nodes.
 There is only one operator in the right. Thus a source language
expression like : a+b*c might be translated into a sequence with
temporaries t1 and t2
t1 = b* c
t2 = a + t1
DAG vs. Three Address Code
 Three address code is a linearized representation of
a syntax tree (or a DAG) in which explicit names
(temporaries) correspond to the interior nodes of the
graph.
Expression: F = ((A+B*C) * (A*B*C))+C
=
T1 := A T1 := B * C
F + T2 := C T2 := A+T1
T3 := B * T2 T3 := A*T1
T4 := T1+T3 T4 := T2*T3
* T5 := T1*T3 T5 := C
+ * T6 := T4 * T5 T6 := T4 + T5
T7 := T6 + T2 F := T6
A
* F := T7
B C
Syntax tree DAG

Question: Which IR code sequence is better?

Implementation of Three Address Code

• Quadruples
Four fields: op, arg1, arg2, result
Array of struct {op, *arg1, *arg2, *result}
 x:=y op z is represented as op y, z, x
arg1, arg2 and result are usually pointers to symbol table
entries.
May need to use many temporary names.
Many assembly instructions are like quadruple, but arg1,
arg2, and result are real registers.
• Triples
Three fields: op, arg1, and arg2. Result become implicit.
arg1 and arg2 can be pointers to the symbol table.
5/31/2015 \course\cpeg621-10F\Topic-1a.ppt 11
Types of Three-Address Statements

 Assignment statements:
 x := y op z, where op is a binary operator add a,b,c
 x := op z, where op is a unary operator not a, ,c or intoreal a, ,c
 Copy statements
 x := y mov a, ,c
 The unconditional jumps:
 goto L jump , ,L1
 Conditional jumps:
 if x relop y goto L jmprelop y,z,L or if y relop z goto L
 param x and call p, n and return y relating to procedure calls
Eg: f(x+1,y)  add x,1,t1
param t1, ,
param y, ,
call f,2,
 Indexed assignments:
 x := y[i]
 x[i] := y
 Address and pointer assignments:
 x := &y, x := *y, and *x = y
6.3 Code Optimization:
Summary of Front End

Lexical Analyzer (Scanner)

+
Syntax Analyzer (Parser)
+ Semantic Analyzer

Front
Abstract Syntax Tree w/Attributes End

Intermediate-code Generator

Error Non-optimized Intermediate Code

Message
5/31/2015 \course\cpeg621-10F\Topic-1a.ppt 13
Code Optimization

• The machine-independent code-optimization phase attempts to

improve the intermediate code so that better target code will
result.
• Usually better means faster, but other objectives may be
desired, such as shorter code, or target code that consumes less
power.
• A simple intermediate code generation algorithm followed by
code optimization is a reasonable way to generate good target
code.
How Compiler Improves Performance
• Execution time = Operation count * Machine cycles per
operation
• Minimize the number of operations
• Arithmetic operations, memory accesses
• Replace expensive operations with simpler ones
• E.g., replace 4-cycle multiplication with1-cycle shift
• Minimize cache misses
• Both data and instruction accesses
• Perform work in parallel
• Instruction scheduling within a thread
• Parallel execution across multiple threads
Code Optimization

• There is a great variation in the amount of code optimization

different compilers perform.
• In those that do the most, the so called “optimizing compilers”,
take significant time in this phase.
• Trade off between compilation time and degree of optimization
Why to use optimization:
• There are simple optimizations that significantly improve the
running time of target program without slowing down
compilation too much
Types of Optimization

• Peephole
• Local
• Global
• Loop
• Inter-procedural, whole-program or link-time
• Machine code
• ….
Basic Blocks

 Basic blocks are maximal sequences of consecutive three-

address instructions.
 The flow of control can only enter the basic block through the
first instruction in the block. (no jumps into the middle of the
block )
 Control will leave the block without halting or
branching, except possibly at the last instruction in the
block.
 The basic blocks become the nodes of a flow graph,
whose edges indicate which blocks can follow which
other blocks.
Construction of Basic Blocks
 Input: A sequence of three-address instructions
 Output: A list of the basic blocks for that sequence in
which each instruction is assigned to exactly one basic
block
 Method: Determine instructions in the intermediate code that
are leaders:
 The rules for finding leaders are:
 The first three-address instruction in the intermediate code
 Any instruction that is the target of a conditional or
unconditional jump
 Any instruction that immediately follows a conditional or
unconditional Jump is a leader
Construction Partitioning Three-address
Instructions in to Basic Blocks
1. i=1
 First, instruction 1 is a leader by rule (1).
2. j=1
Jumps are at instructions 6, 8, and 11. By 3. t1 = 10 * i
rule (2), the targets of these jumps are 4. t2 = t1 + j
leaders ( instructions 3, 2, and 10, 5. j=j+1
respectively) 6. if j <= 10 goto (3)
 By rule (3), each instruction following a 7. i=i+1
jump is a leader; instructions 7 and 9. 8. if i <= 10 goto (2)
 Leaders are instructions 1, 2, 3, 7, 9 and 9. i=1
10. t3 = i – 1
10. The basic block of each leader
11. if i <= 10 goto (10)
contains all the instructions from itself
until just before the next leader.
Flow Graphs
 Flow Graph is the representation of control flow between
basic blocks. The nodes of the flow graph are the basic blocks.
 There is an edge from block B to block C if and only if it is
possible for the first instruction in block C to immediately
follow the last instruction in block B. There are two ways that
such an edge could be justified:
1. There is a conditional or unconditional jump from the end

of B to the beginning of C.
2. C immediately follows B in the original order of the three-
address instructions, and B does not end in an
unconditional jump.
 B is a predecessor of C, and C is a successor of B.
Flow Graphs: Example
Flow Graph Example of program in Example(1).
The block led by first statement of the program is the
start, or entry node.
Entry
Exit
B1: i = 1
B6: t3 = i – 1
B2: j = 1 if i <= 10 goto (10)

B3: t1 = 10 * i B5: i = 1
t2 = t1 + j
j=j+1 B4: i = i + 1
if j <= 10 goto (3) if i <= 10 goto (2)

22
Representation of Basic Blocks

• Each basic block is represented by a record

consisting of
– a count of the number of statements
– a pointer to the leader
– a list of predecessors
– a list of successors

23
Peephole Optimization
• Improve the performance of the target program by
examining and transforming a short sequence of
target instructions
• Depends on the window size
• May need repeated passes over the code
Examples Redundant loads and stores
MOV R0, a
MOV a, Ro
• Algebraic Simplification
x := x + 0
x := x * 1
• Constant folding
x := 2 + 3 x := 5
y := x + 3 y := 8
Local Optimizations
 Analysis and transformation performed within a basic block
 No control flow information is considered
 Examples of local optimizations:
 Local common sub expression elimination
analysis: same expression evaluated more than once.
transformation: replace with single calculation
 Local constant folding or elimination
analysis: expression can be evaluated at compile time
transformation: replace by constant, compile-time value
 Dead code elimination

25
Global Optimizations:

Intraprocedural
 Global versions of local optimizations
 Global common sub-expression elimination
 Global constant propagation
 Dead code elimination

 Loop optimizations
 Reduce code to be executed in each iteration

26
Examples

• Unreachable code
#define debug 0
if (debug) (print debugging information)

if 0 <> 1 goto L1
print debugging
information L1:

if 1 goto L1
print debugging information
L1:
27
Examples

• Flow-of-control optimization

goto L1 goto L2
… …
L1: goto L2 L2: …

goto L1 if a < b goto L2

… …
L1: if a < b goto L2

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
57% (83)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (108)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
91% (35)
The 36 Questions That Lead To Love - The New York Times
3 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
70% (73)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Compiler Construction: A Compulsory Module For Students in
No ratings yet
Compiler Construction: A Compulsory Module For Students in
34 pages
Unit 4
No ratings yet
Unit 4
4 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
21 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
23 pages
Chapter 5 - Intermediate Code Generation
No ratings yet
Chapter 5 - Intermediate Code Generation
27 pages
2024_CD_Ch06_Intermidiate_&_Ch07_runtime_&_Ch08_code_optimization
No ratings yet
2024_CD_Ch06_Intermidiate_&_Ch07_runtime_&_Ch08_code_optimization
29 pages
Intermediate Code Generation and Code Optimization
No ratings yet
Intermediate Code Generation and Code Optimization
40 pages
C Ompiler Theory: (Intermediate C Ode Generation - Abstract S Yntax + 3 Address C Ode)
No ratings yet
C Ompiler Theory: (Intermediate C Ode Generation - Abstract S Yntax + 3 Address C Ode)
32 pages
Chapter 6
No ratings yet
Chapter 6
28 pages
6837
No ratings yet
6837
47 pages
Code Generation: M.B.Chandak Lecture Notes On Language Processing
No ratings yet
Code Generation: M.B.Chandak Lecture Notes On Language Processing
19 pages
Chapter 8 - Code Generation
No ratings yet
Chapter 8 - Code Generation
62 pages
CS 346: Intermediate Code Generation: Resource
No ratings yet
CS 346: Intermediate Code Generation: Resource
60 pages
Lecture 08
No ratings yet
Lecture 08
36 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
23 pages
CSC3201 - Compiler Construction (Part II) - Lecture 5 - Code Generation
No ratings yet
CSC3201 - Compiler Construction (Part II) - Lecture 5 - Code Generation
64 pages
Cdunit 5
No ratings yet
Cdunit 5
41 pages
Compiler Design - Code Generation
No ratings yet
Compiler Design - Code Generation
62 pages
Code Generation
No ratings yet
Code Generation
43 pages
Unit-Iv: Intermediate Code Generation
No ratings yet
Unit-Iv: Intermediate Code Generation
19 pages
Compiler Design Chapter-6
No ratings yet
Compiler Design Chapter-6
83 pages
1 Unit 4 Complete
No ratings yet
1 Unit 4 Complete
92 pages
Code Generation
No ratings yet
Code Generation
62 pages
UNIT-3 Part-A:Semantic Analysis 1. Intermediate Code Forms
No ratings yet
UNIT-3 Part-A:Semantic Analysis 1. Intermediate Code Forms
26 pages
Chapter 5 - Code Generation
No ratings yet
Chapter 5 - Code Generation
27 pages
Code Generation and Code Optimization Eng 16
No ratings yet
Code Generation and Code Optimization Eng 16
11 pages
Code Optimization-I
No ratings yet
Code Optimization-I
12 pages
Compiler Unit 4
No ratings yet
Compiler Unit 4
59 pages
Compiler Design Unit 5
No ratings yet
Compiler Design Unit 5
39 pages
CD Unit 6
No ratings yet
CD Unit 6
27 pages
CSE-303 Chapter-06 Final (1)
No ratings yet
CSE-303 Chapter-06 Final (1)
97 pages
4_Intermediate Code Generation
No ratings yet
4_Intermediate Code Generation
51 pages
006chapter 6 - Intermediate Code Generation
No ratings yet
006chapter 6 - Intermediate Code Generation
23 pages
3unit cd IntermediateCode_Part1
No ratings yet
3unit cd IntermediateCode_Part1
38 pages
3 Intermediate Code Generation
No ratings yet
3 Intermediate Code Generation
20 pages
24-Module 4_ Variants of Syntax Trees - Three Address Code-10!09!2024
100% (1)
24-Module 4_ Variants of Syntax Trees - Three Address Code-10!09!2024
44 pages
CD Unit3
No ratings yet
CD Unit3
17 pages
Introduction To Compilers: Jun.-Prof. Dr. Christian Plessl Custom Computing University of Paderborn
No ratings yet
Introduction To Compilers: Jun.-Prof. Dr. Christian Plessl Custom Computing University of Paderborn
51 pages
18 Code Gen
No ratings yet
18 Code Gen
24 pages
Compilation Techniques
No ratings yet
Compilation Techniques
15 pages
Unit v Updated
No ratings yet
Unit v Updated
126 pages
Unit-4 LMD CD
No ratings yet
Unit-4 LMD CD
34 pages
Code Optimization PPT
No ratings yet
Code Optimization PPT
32 pages
CH-6 Intermediate Code Generator
No ratings yet
CH-6 Intermediate Code Generator
54 pages
Unit V-CD New
No ratings yet
Unit V-CD New
126 pages
Basic Blocks
No ratings yet
Basic Blocks
18 pages
CD Unit 5
No ratings yet
CD Unit 5
49 pages
Unit 4.2
No ratings yet
Unit 4.2
44 pages
Ch8a-myppt
No ratings yet
Ch8a-myppt
42 pages
SPCC
No ratings yet
SPCC
80 pages
Unit IV-1
No ratings yet
Unit IV-1
35 pages
CD Unit 5
No ratings yet
CD Unit 5
126 pages
Cs 3007 Inter Code Gen
No ratings yet
Cs 3007 Inter Code Gen
42 pages
Compiler Engineering
No ratings yet
Compiler Engineering
24 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
42 pages
Module 4 - Intermediate Code Generation
No ratings yet
Module 4 - Intermediate Code Generation
61 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
C Programming
From Everand
C Programming
Netra
No ratings yet
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet
C Programmin Language
From Everand
C Programmin Language
Knowledge Flow
No ratings yet
2018 Annual Report CPIMS TF FINAL
No ratings yet
2018 Annual Report CPIMS TF FINAL
39 pages
Needs Assessment For Refugee Emergencies NARE
No ratings yet
Needs Assessment For Refugee Emergencies NARE
12 pages
12b Module 8-A. Data Analysis
No ratings yet
12b Module 8-A. Data Analysis
61 pages
Som Information Sharing Protocol
No ratings yet
Som Information Sharing Protocol
12 pages
Of The Text Book: Code Optimization
No ratings yet
Of The Text Book: Code Optimization
19 pages
CH04
No ratings yet
CH04
24 pages
Introduction To Compiler Design (CD) : Mu-Mit
No ratings yet
Introduction To Compiler Design (CD) : Mu-Mit
22 pages
Of The Text Book: Code Optimization
No ratings yet
Of The Text Book: Code Optimization
19 pages
MT Impact - Horizon 2020 (And Beyond) : Rudy Tirry
No ratings yet
MT Impact - Horizon 2020 (And Beyond) : Rudy Tirry
23 pages
English To Yorùbá Machine Translation System Using Rule-Based Approach
No ratings yet
English To Yorùbá Machine Translation System Using Rule-Based Approach
6 pages
Memory Interface
No ratings yet
Memory Interface
42 pages
Word Based Statistical Machine Translation From English Text To Indian Sign Language
No ratings yet
Word Based Statistical Machine Translation From English Text To Indian Sign Language
8 pages
Review On Natural Language Processing
No ratings yet
Review On Natural Language Processing
4 pages
Development of Framework For An Integrated Model For Technology Transfer
No ratings yet
Development of Framework For An Integrated Model For Technology Transfer
14 pages
Sketch of A Noisy Channel Model For The Translation Process: Michael Carl Moritz Schaeffer
No ratings yet
Sketch of A Noisy Channel Model For The Translation Process: Michael Carl Moritz Schaeffer
46 pages
1 SM
No ratings yet
1 SM
10 pages
VEDA IIT SELECTION PROCESS
No ratings yet
VEDA IIT SELECTION PROCESS
12 pages
360 Total Security Antivirus
No ratings yet
360 Total Security Antivirus
8 pages
ecms1.0 2024
No ratings yet
ecms1.0 2024
402 pages
Cs8592 - Object Oriented Analysis and Design (Iii Year/V Sem) (Anna University - R2017)
100% (1)
Cs8592 - Object Oriented Analysis and Design (Iii Year/V Sem) (Anna University - R2017)
66 pages
2024_MxL_Interface_Brochure_001BRR02
No ratings yet
2024_MxL_Interface_Brochure_001BRR02
20 pages
Scon-92nntu R0 en
No ratings yet
Scon-92nntu R0 en
8 pages
Bta16 600B
No ratings yet
Bta16 600B
1 page
Loop Unrolling: Source
No ratings yet
Loop Unrolling: Source
11 pages
Chapter 8. Wi-Fi 7 Network Planning - Wi-Fi 7 in Depth - Your Guide To Mastering Wi-Fi 7, The 802.11be Protocol, and Their Deployment
No ratings yet
Chapter 8. Wi-Fi 7 Network Planning - Wi-Fi 7 in Depth - Your Guide To Mastering Wi-Fi 7, The 802.11be Protocol, and Their Deployment
43 pages
EVS Codec
No ratings yet
EVS Codec
12 pages
ECT352 - Ktu Qbank
No ratings yet
ECT352 - Ktu Qbank
7 pages
Important Python Frameworks of The Future
No ratings yet
Important Python Frameworks of The Future
4 pages
IP Project-1
No ratings yet
IP Project-1
26 pages
SAP Testing
0% (3)
SAP Testing
11 pages

CH06

Uploaded by

CH06

Uploaded by

CHAPTER SIX

Intermediate Code Generation and

 Although a compiler can directly produce a target language

C++ IBM PPC

C++ IBM PPC

Intermediate language can be many different languages,

 Example: F = ((A+B*C) * (A*B*C))+C

 A mathematical notation wherein every operator follows all

 We use the term “three-address code” because each statement

Question: Which IR code sequence is better?

Lexical Analyzer (Scanner)

Error Non-optimized Intermediate Code

• The machine-independent code-optimization phase attempts to

• There is a great variation in the amount of code optimization

 Basic blocks are maximal sequences of consecutive three-

• Each basic block is represented by a record

goto L1 if a < b goto L2

You might also like

 Example: F = ((A+BC) (ABC))+C