Computing Inside The Parser Syntax-Directed Translation, II. Comp 412 COMP 412 FALL Chapter 4 in EaC2e. source code. IR IR target.

Similar documents
Computing Inside The Parser Syntax-Directed Translation, II. Comp 412

Computing Inside The Parser Syntax-Directed Translation. Comp 412 COMP 412 FALL Chapter 4 in EaC2e. source code. IR IR target.

Computing Inside The Parser Syntax-Directed Translation. Comp 412

Generating Code for Assignment Statements back to work. Comp 412 COMP 412 FALL Chapters 4, 6 & 7 in EaC2e. source code. IR IR target.

Handling Assignment Comp 412

Implementing Control Flow Constructs Comp 412

Building a Parser Part III

Syntax Analysis, VI Examples from LR Parsing. Comp 412

Intermediate Representations

Context-sensitive Analysis Part IV Ad-hoc syntax-directed translation, Symbol Tables, andtypes

Parsing III. CS434 Lecture 8 Spring 2005 Department of Computer Science University of Alabama Joel Jones

Introduction to Parsing. Comp 412

Instruction Selection: Preliminaries. Comp 412

Syntax Analysis, V Bottom-up Parsing & The Magic of Handles Comp 412

Syntax Analysis, III Comp 412

Syntax Analysis, III Comp 412

Code Shape Comp 412 COMP 412 FALL Chapters 4, 5, 6 & 7 in EaC2e. source code. IR IR target. code. Front End Optimizer Back End

CS415 Compilers. Intermediate Represeation & Code Generation

Runtime Support for Algol-Like Languages Comp 412

Parsing II Top-down parsing. Comp 412

The Software Stack: From Assembly Language to Machine Code

Code Shape II Expressions & Assignment

Parsing. Note by Baris Aktemur: Our slides are adapted from Cooper and Torczon s slides that they prepared for COMP 412 at Rice.

Class Information ANNOUCEMENTS

Alternatives for semantic processing

Instruction Selection, II Tree-pattern matching

Local Register Allocation (critical content for Lab 2) Comp 412

CSCI312 Principles of Programming Languages

Procedure and Function Calls, Part II. Comp 412 COMP 412 FALL Chapter 6 in EaC2e. target code. source code Front End Optimizer Back End

CS 406/534 Compiler Construction Putting It All Together

Bottom-up Parser. Jungsik Choi

Lab 3, Tutorial 1 Comp 412

Local Optimization: Value Numbering The Desert Island Optimization. Comp 412 COMP 412 FALL Chapter 8 in EaC2e. target code

Syntax Analysis, VII One more LR(1) example, plus some more stuff. Comp 412 COMP 412 FALL Chapter 3 in EaC2e. target code.

Parsing II Top-down parsing. Comp 412

Compiler Design. Code Shaping. Hwansoo Han

Intermediate Representations

Arrays and Functions

Instruction Selection: Peephole Matching. Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved.

Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412 at Rice University have explicit

Parsing III. (Top-down parsing: recursive descent & LL(1) )

Lecture 12: Compiler Backend

Lexical Analysis - An Introduction. Lecture 4 Spring 2005 Department of Computer Science University of Alabama Joel Jones

Register Allocation. Note by Baris Aktemur: Our slides are adapted from Cooper and Torczon s slides that they prepared for COMP 412 at Rice.

Lexical Analysis. Note by Baris Aktemur: Our slides are adapted from Cooper and Torczon s slides that they prepared for COMP 412 at Rice.

Runtime Support for OOLs Object Records, Code Vectors, Inheritance Comp 412

CS415 Compilers. Lexical Analysis

CS 406/534 Compiler Construction Parsing Part I

Code Shape Comp 412 COMP 412 FALL Chapters 4, 5, 6 & 7 in EaC2e. source code. IR IR target. code. Front End Optimizer Back End

Syntax-Directed Translation. Lecture 14

Compiler Optimisation

Naming in OOLs and Storage Layout Comp 412

Syntactic Analysis. Top-Down Parsing

Combining Optimizations: Sparse Conditional Constant Propagation

CS 406/534 Compiler Construction Parsing Part II LL(1) and LR(1) Parsing

Syntactic Directed Translation

Intermediate Representations

Introduction to Optimization, Instruction Selection and Scheduling, and Register Allocation

Parsing Part II (Top-down parsing, left-recursion removal)

CS415 Compilers Overview of the Course. These slides are based on slides copyrighted by Keith Cooper, Ken Kennedy & Linda Torczon at Rice University

opt. front end produce an intermediate representation optimizer transforms the code in IR form into an back end transforms the code in IR form into

CS415 Compilers. Syntax Analysis. These slides are based on slides copyrighted by Keith Cooper, Ken Kennedy & Linda Torczon at Rice University

A clarification on terminology: Recognizer: accepts or rejects strings in a language. Parser: recognizes and generates parse trees (imminent topic)

Introduction to Compiler

Review main idea syntax-directed evaluation and translation. Recall syntax-directed interpretation in recursive descent parsers

CS 432 Fall Mike Lam, Professor. Code Generation

CS 406/534 Compiler Construction Instruction Scheduling beyond Basic Block and Code Generation

COMP 181. Prelude. Prelude. Summary of parsing. A Hierarchy of Grammar Classes. More power? Syntax-directed translation. Analysis

Optimizer. Defining and preserving the meaning of the program

CS 314 Principles of Programming Languages. Lecture 9

CS 314 Principles of Programming Languages Fall 2017 A Compiler and Optimizer for tinyl Due date: Monday, October 23, 11:59pm

Topic 6 Basic Back-End Optimization

Compilers. Intermediate representations and code generation. Yannis Smaragdakis, U. Athens (original slides by Sam

Lexical Analysis. Introduction

CS 406/534 Compiler Construction Instruction Selection and Global Register Allocation

CS 11 Ocaml track: lecture 6

RYERSON POLYTECHNIC UNIVERSITY DEPARTMENT OF MATH, PHYSICS, AND COMPUTER SCIENCE CPS 710 FINAL EXAM FALL 96 INSTRUCTIONS

Intermediate Code Generation

CS 406/534 Compiler Construction Intermediate Representation and Procedure Abstraction

CS415 Compilers Register Allocation. These slides are based on slides copyrighted by Keith Cooper, Ken Kennedy & Linda Torczon at Rice University

IR Generation. May 13, Monday, May 13, 13

COMP 181. Prelude. Intermediate representations. Today. High-level IR. Types of IRs. Intermediate representations and code generation

The View from 35,000 Feet

Agenda. CSE P 501 Compilers. Big Picture. Compiler Organization. Intermediate Representations. IR for Code Generation. CSE P 501 Au05 N-1

Introduction to Syntax Directed Translation and Top-Down Parsers

Semantic actions for declarations and expressions

DEMO A Language for Practice Implementation Comp 506, Spring 2018

APT Session 7: Compilers

Semantic actions for declarations and expressions. Monday, September 28, 15

Syntax-directed translation. Context-sensitive analysis. What context-sensitive questions might the compiler ask?

Lecture Compiler Middle-End

Peace cannot be kept by force; it can only be achieved by understanding. Albert Einstein

Three-Address Code IR

Intermediate Code Generation (ICG)

Semantic actions for declarations and expressions

Parsing Part II. (Ambiguity, Top-down parsing, Left-recursion Removal)

Introduction to Parsing

10/4/18. Lexical and Syntactic Analysis. Lexical and Syntax Analysis. Tokenizing Source. Scanner. Reasons to Separate Lexical and Syntactic Analysis

CSE P 501 Compilers. Intermediate Representations Hal Perkins Spring UW CSE P 501 Spring 2018 G-1

Question Points Score

Transcription:

COMP 412 FALL 20167 Computing Inside The Parser Syntax-Directed Translation, II Comp 412 source code IR IR target Front End Optimizer Back End code Copyright 2017, Keith D. Cooper & Linda Torczon, all rights reserved. Students enrolled in Comp 412 at Rice University have explicit permission to make copies of these materials for their personal use. Faculty from other educational institutions may use these materials for nonprofit educational purposes, provided this copyright notice is preserved. Chapter 4 in EaC2e

Midterm Exam When? Where? Wednesday, October 18, 2017 in Keck 100 Two hour, closed book, closed notes, closed devices exam I had said 7PM. Will 7:30PM work? You are responsible for material from: Lectures from start of classes through today Chapters 1 through 5 in EaC2e Excluding attribute grammar material ( 4.3) COMP 412, Fall 2017 1

Review Example Computing the value of an unsigned integer Consider the simple grammar 1 Number digit DigitList 2 DigitList digit DigitList 3 epsilon One obvious use of the grammar is to convert an ASCII string of the number to its integer value Build computation into parser An easy intro to syntax-directed translation pair (boolean, int) Number( value ) { if (word = digit) then { value = ValueOf( digit ); word = NextWord( ); return DigitList( value ); } else return (false, invalid value); } pair (boolean, int) DigitList( value ) { if (word = digit) then { value = value * 10 + ValueOf( digit ); word = NextWord( ); return DigitList( value ); } else return (true, value); } SDT in a top-down, recursive-descent parser COMP 412, Fall 2017 2

Review SDT in a Bottom-Up Parser (e.g., LR(1)) We specify SDT actions relative to the syntax 1 Number DigitList { value(lhs) = value(dl) ; } 2 DigitList DigitList digit { value(lhs) = value(dl) * 10 + value(digit) } 3 digit { value(lhs) = value(digit); } The compiler writer can attach actions to productions and have those actions execute each time the parser reduces by that production Simple mechanism that ties computation to syntax Needs storage associated with each symbol in each production Needs a scheme to name that storage COMP 412, Fall 2017 3

Review SDT in Bison or Yacc In Bison or Yacc, we specify SDT actions using a simple notation 1 Number DigitList { $$ = $1; } 2 DigitList DigitList digit { $$ = $1 * 10 + $2; } 3 digit { $$ = $1; } Scanner provides the value of digit. The compiler writer provides production-specific code snippets that execute when the parser reduces by that production Positional notation for the value associated with a symbol $$ is the LHS; $1 is the first symbol in the RHS; $2 the second, Compiler writer can put arbitrary code in these snippets Solve a travelling salesman problem, compute PI to 100 digits, More importantly, they can compute on the lexemes of grammar symbols and on information derived and stored earlier in translation How does this fit into the LR(1) skeleton parser? COMP 412, Fall 2017 4

Fitting AHSDT into the LR(1) Skeleton Parser stack.push(invalid); stack.push(s 0 ); // initial state word = scanner.next_word(); loop forever { s = stack.top(); if ( ACTION[s,word] == reduce A b ) then { stack.popnum(2* b ); // pop 2* b symbols s = stack.top(); stack.push(a); // push LHS, A stack.push(goto[s,a]); // push next state } else if ( ACTION[s,word] == shift s i ) then { stack.push(word); stack.push(s i ); word scanner.next_word(); } else if ( ACTION[s,word] == accept & word == EOF ) then break; else throw a syntax error; } report success; Actions are taken on reductions Insert a call to Work() before the call to stack.popnum() Work() contains a case statement that switches on the production number Code in Work() can read items from the stack That is why it calls Work() before stack.popnum() COMP 412, Fall 2017 5

Fitting AHSDT into the LR(1) Skeleton Parser stack.push(invalid); stack.push(s 0 ); // initial state word = scanner.next_word(); loop forever { s = stack.top(); if ( ACTION[s,word] == reduce A b ) then { stack.popnum(2* b ); // pop 2* b symbols s = stack.top(); stack.push(a); // push LHS, A stack.push(goto[s,a]); // push next state } else if ( ACTION[s,word] == shift s i ) then { stack.push(word); stack.push(s i ); word scanner.next_word(); } else if ( ACTION[s,word] == accept & word == EOF ) then break; else throw a syntax error; } report success; Passing values between actions Tie values to instances of grammar symbols Equivalent to parse tree nodes We can pass values on the stack Push / pop 3 rather than 2 Work() takes the stack as input (conceptually) and returns the value for the reduction it processes Shift creates initial values COMP 412, Fall 2017 6

stack.push(invalid); stack.push(s 0 ); // initial state word = scanner.next_word(); loop forever { s = stack.top(); if ( ACTION[s,word] == reduce A b ) then { r = Work(stack, A b ) stack.popnum(3* b ); // pop 3* b symbols s = stack.top(); // save exposed state stack.push(a); // push A stack.push (r); // push result of WORK() stack.push(goto[s,a]); // push next state } else if ( ACTION[s,word] == shift s i ) then { stack.push(word); stack.push(initial Value); stack.push(s i ); word scanner.next_word(); } else if ( ACTION[s,word] == accept & word == EOF ) then break; else throw a syntax error; } report success; Fitting AHSDT into the LR(1) Skeleton Parser Modifications are minor - Insert call to Work() - Change the push() & pop() behavior Same asymptotic behavior as the original algorithm. - 50% more stack space Last obstacle is making it easy to write the code for Work() Note that, in C, the stack has some odd union type. COMP 412, Fall 2017 7

Translating Code Snippets Into Work() For each production, the compiler writer can provide a code snippet { value = value * 10 + digit; } We need a scheme to name stack locations. Yacc introduced a simple one that has been widely adopted. $$ refers to the result, which will be pushed on the stack $1 is the first item on the productions right hand side $2 is the second item $3 is the third item, and so on The digits example above becomes { $$ = $1 * 10 + $2; } COMP 412, Fall 2017 8

Translating Code Snippets Into Work() How do we implement Work()? Work() takes 2 arguments: the stack and a production number Work() contains a case statement that switches on production number Each case contains the code snippet for a reduction by that production The $1, $2, $3 macros translate into references into the stack The $$ macro translates into the return value if ( ACTION[s,word] == reduce A b ) then { r = Work(stack, A b ) stack.popnum(3* b ); // pop 3* b symbols s = stack.top(); // save exposed state stack.push(a); // push A stack.push (r); // push result of WORK() stack.push(goto[s,a]); // push next state } $$ translates to r $i translates to the stack location 3 *( b - i + 1) units down from stacktop Note that b, i, 3, and 1 are all constants so $i can be evaluated to a compiletime constant COMP 412, Fall 2017 9

SDT is a Mechanism AHSDT allows the compiler writer to specify syntax-driven computations Build an IR representation Critical for later phases of the compiler Might execute it directly to create an interpreter Calculators, accounting systems, command-line shells, Might analyze it to derive knowledge Optimizing compilers, theorem provers, Might transform it and re-generate source code Automatic parallelization systems, code refactoring tools, Measure and report on program properties Correct syntax errors Key point: the mechanism does not determine how you use it Corollary: the syntax does not dictate the form or content of the IR COMP 412, Fall 2017 10

Example Building an Abstract Syntax Tree 1 Goal Expr $$ = $1; 2 Expr Expr + Term $$ = MakeAddNode($1,$3); 3 Expr - Term $$ = MakeSubNode($1,$3); 4 Term $$ = $1; 5 Term Term * Factor $$ = MakeMulNode($1,$3); 6 Term / Factor $$ = MakeDivNode($1,$3); 7 Factor $$ = $1; 8 Factor ( Expr ) $$ = $2; 9 number $$ = MakeNumNode(token); 10 ident $$ = MakeIdNode(token); Assumptions: constructors for each node stack holds pointers to nodes COMP We call 412, $$ Fall = 2017 $1; a copy rule. 11

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept Consider x 2 * y Trace of the LR(1) parse Detail of states abstracted Expression Grammar 1 Goal Expr 2 Expr Expr + Term 3 Expr - Term 4 Term 5 Term Term * Factor 6 I Term / Factor 7 I Factor 8 Factor ( Expr ) 9 num COMP 412, Fall 2017 10 id 12

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept id x COMP 412, Fall 2017 13

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept Action is a copy rule id x COMP 412, Fall 2017 14

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept id num x 2 COMP 412, Fall 2017 15

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept Action is a copy rule id num x 2 COMP 412, Fall 2017 16

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept id num x 2 id y COMP 412, Fall 2017 17

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept id num x 2 * id y COMP 412, Fall 2017 18

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept - id num x 2 * id y COMP 412, Fall 2017 19

Example Building an Abstract Syntax Tree Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept AHSDT Works! Built the AST Some reduce actions just copied values; others called constructors. Same tree as earlier slide - id num x 2 * id y COMP 412, Fall 2017 20

Example Emitting ILOC 1 Goal Expr 2 Expr Expr +Term $$ = NextRegister(); Emit(add, $1, $3, $$); 3 Expr - Term $$ = NextRegister(); Emit(sub, $1, $3, $$); 4 Term $$ = $1; 5 Term Term * Factor $$ = NextRegister(); Emit(mult, $1, $3, $$) 6 Term / Factor $$ = NextRegister(); Emit(div, $1, $3, $$); 7 Factor $$ = $1; 8 Factor ( Expr ) $$ = $2; 9 number $$ = NextRegister(); Emit(loadI,Value(lexeme),$$); 10 ident $$ = NextRegister(); EmitLoad(ident,$$); Assumptions NextRegister() returns a virtual register name Emit() can format assembly code EmitLoad() handles addressability & gets a value into a register COMP Copy 412, rules Fall on 2017 the same productions as in the last example 21

Same parse, different rule set Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept Emitting ILOC The actions that generated AST leaves and the actions that generated AST interior nodes emit code. pr0 is a base address NextRegister() returned the next virtual register loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 22

Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept Action is a copy rule loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 23

Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 24

Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept Action is a copy rule loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 25

Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 26

Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 27

Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 28

Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept Emitting ILOC Simple, but clean, code EmitLoad() hides all of the messy details of naming, addressability, and address modes loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 29

What About EmitLoad() EmitLoad() hides lots of details Needs to map an ident to a location Register, symbolic address, or formula to compute a virtual address Implies that the ident has allocated storage Someone or something needs to lay out the contents of data memory Needs to generate code to compute the address Register becomes direct reference Symbolic address becomes a loadi/load sequence (as in lab 1) Formula becomes a more complex expression Formula may be an offset from some known point, a chain of one or more pointers, an indirect reference through a class definition, We need to deal with the issue of storage layout, but not today COMP 412, Fall 2017 30

Example Emitting ILOC Stack Input Action $ id num * id shift $ id num * id reduce 10 $ Factor num * id reduce 7 $ Term num * id reduce 4 $ Expr num * id reduce 9 $ Expr Factor * id reduce 7 $ Expr Term * id reduce 10 $ Expr Term * Factor reduce 5 $ Expr Term reduce 3 $ Expr accept What about using multi? Could fold the 2 into the multiply by using multi Requires a non-local computation Reach across productions to expose the difference between num & ident loadai pr0, @x vr1 loadi 2 vr2 loadai pr0, @y vr3 mult vr2, vr3 vr4 sub vr1,vr4 vr5 COMP 412, Fall 2017 31

Example Emitting ILOC Expr Term Fact. Goal Expr Term Fact. Term * Fact. <id,y> What about using multi? Could fold the 2 into the multiply by using multi Requires a non-local computation Reach across productions to expose the difference between num & ident Difficult to describe in the limited vocabulary of AHSDT (see EaC2e 8.4.1) Why is this hard to do with AHSDT? <id,x> <num,2> If the Factor is an id, then the rules Syntax tree need to pass a register upward, loadai pr0, @x vr1 If the Factor is a num, then the rules need to return either a register or an loadi 2 vr2 immediate value, depending on loadai pr0, @y vr3 whether it is the right or left child of mult vr2, vr3 vr4 the mult. The rules rapidly become context sub vr1,vr4 vr5 COMP 412, Fall 2017 dependent in complex ways. 32

Code Generation In a modern, multi-ir compiler, code generation happens several times SDT Lowering Passes source code Front End IR 1 IR 2 IR 3 target ST Optimizer ST Optimizer ST Back End code Generate an IR straight out of the parser Might be an AST, might be some high-level (abstract) linear form Almost always accompanied by a symbol table of some form Code passes through one or more lowering pass Takes the code and decreases ( lowers ) the level of abstraction Expand complex operations (e.g., call or mvcl), make control-flow explicit The problems are, essentially, the same The mechanisms are likely quite different COMP 412, Fall 2016 33

Mechanisms for Code Generation Might STOP In different contexts, the compiler might use different techniques In an LR(1) parser, SDT might be the right tool Generate a tree, a graph, or linear code Actions driven by the syntax of the input code In a lowering pass, traversing the IR might be necessary For a graphical IR: A treewalk, or graph-walk, that produces the new IR For a linear IR: Walk through the IR in one direction or the other In instruction selection, pattern matching is the tool of choice The set of choices, particularly address modes, make this problem complex Code is not subject to subsequent optimization The context makes these tasks subtly different Early in compilation, generate code that will optimize well Late in compilation, generate code that will run well COMP 412, Fall 2016 34

A More Complex Example Generating code for an if-then-else construct Stmt IF LPAREN Expr RPAREN THEN WithElse ELSE WithElse Expr WithElse Control-flow constructs require branches and labels Need a schema for how to implement an if-then-else Evaluate the expression Based on its value, branch to the then part or the else part After evaluating the appropriate part, branch to the next statement We will call that the point in the code the exit to simplify talking about it We will assume that the grammar has Boolean & relational expressions (Fig. 7.7, p 351 in EaC2e) COMP 412, Fall 2017 35

Boolean & Relational Expressions First, we need to add boolean & relational expressions to the grammar Boolean Boolean AndTerm AndTerm AndTerm AndTerm RelExpr RelExpr RelExpr RelExpr < Expr This allows w < x < y < z RelExpr Expr RelExpr = Expr RelExpr Expr RelExpr Expr RelExpr > Expr Expr Expr Expr + Term Expr - Term Term Term Term Value Term Value Value Value! Factor Factor Factor ( Expr ) number Reference where Reference derives a name, a subscripted name, a COMP 412, Fall 2017 structure reference, a string reference, a function call, 36

A More Complex Example Generating code for an if-then-else construct Stmt IF LPAREN Expr RPAREN THEN WithElse ELSE WithElse Expr WithElse Control-flow constructs require branches and labels To generate code with SDT, need to create & track the labels 1. Create labels for then part, else part, and the exit 2. Emit branch to appropriate part (then or else) after RPAREN 3. Emit label for then part before first WithElse 4. At end of WithElse, emit branch to the exit 5. Emit label for else part before second WithElse 6. At end of second WithElse, emit branch to exit 7. Emit label for the exit on a nop after the second WithElse COMP 412, Fall 2017 37

A More Complex Example Generating code for an if-then-else construct We need a way to hang code snippets in the middle of the RHS Use the trick from GramSymbol an epsilon production Stmt IF LPAREN Expr RPAREN CreateAndBranch CreateAndBranch ε EmitThenLabel ε EmitExitJump ε EmitElseLabel ε EmitExitLabel ε THEN EmitThenLabel WithElse EmitExitJump ELSE EmitElseLabel WithElse EmitExitJump EmitExitLabel COMP 412, Fall 2017 38

A More Complex Example Generating code for an if-then-else construct To generate code with SDT, need to create & track the labels 1. Create labels for then part, else part, and exit Need a structure to hold the three labels Generate three labels and push them onto the stack in CreateAndBranch action 2. Emit branch to appropriate part (then or else) after RPAREN Emit the branch in CreateAndBranch action 3. Emit label for then part before first WithElse Emit a labelled nop in EmitThenLabel action 4. At end of WithElse, emit branch to the exit Emit a jump to the exit label in EmitExitLabel action 5. Emit label for else part before second WithElse Emit a labelled nop in EmitElseLabel action 6. At end of second WithElse, emit branch to the exit Handled by 4. above 7. Emit label for the exit on a nop after the second WithElse Emit a labelled nop in EmitExitLabel action COMP 412, Fall 2017 39

A More Complex Example Bison supports this idea Allows code snippets between any two grammar symbols on the RHS of a production Generates the appropriate epsilon production, its reduction, and ties the action to this new reduction Actions work in the name space of the production where they are written Allows the notation to handle these code snippets in a natural way Sample from the compiler for DEMO ITE: IF LP Bool RP { $$ = new ITEHEAD ; $$->then_label = NextLabel; $$->else_label = NextLabel; $$->exit_label = NextLabel; emit_branch($3->reg,$$->then_label, $$->else_label); emit_nop($$->then_label,"then part"); } THEN WithElse { emit_jump($$->exit_label); emit_nop($$->else_label); } ELSE WithElse { emit_jump($$->exit_label); emit_nop($$->exit_label); } Combines several items into one action This does get tricky. Bison introduces the ε-productions, which can introduce new shift-reduce COMP conflicts 412, and Fall changes 2017 the numbering of the $i macros. (See the Bison manual.) 40