1 Intermediate Code Generation Abstraction at the source level identifiers, operators, expressions, statements, conditionals, iteration, functions (user.

Slides:



Advertisements
Similar presentations
Intermediate Code Generation. 2 Intermediate languages Runtime environments Declarations Expressions Statements.
Advertisements

Chapter 6 Intermediate Code Generation
Code Generation.
Intermediate Code Generation
Intermediate Code Generation. 2 Intermediate languages Declarations Expressions Statements.
Backpatching: The syntax directed definition we discussed before can be implemented in two or more passes (we have both synthesized attributes and inheritent.
Lecture 08a – Backpatching & Recap Eran Yahav 1 Reference: Dragon 6.2,6.3,6.4,6.6.
Short circuit code for boolean expressions: Boolean expressions are typically used in the flow of control statements, such as if, while and for statements,
8 Intermediate code generation
Chapter 8 Intermediate Code Generation. Intermediate languages: Syntax trees, three-address code, quadruples. Types of Three – Address Statements: x :=
1 Compiler Construction Intermediate Code Generation.
Generation of Intermediate Code Compiler Design Lecture (03/30//98) Computer Science Rensselaer Polytechnic.
Compiler Construction Sohail Aslam Lecture Boolean Expressions E → E 1 and M E 2 {backpatch(E 1.truelist, M.quad); E.truelist = E 2.truelist; E.falselist.
Compiler Designs and Constructions
Three Address Code Generation Backpatching-I Prepared By: Siddharth Tiwary 04CS3010.
Lecture 09 – IR (Backpatching) Eran Yahav 1 Reference: Dragon 6.2,6.3,6.4,6.6
Theory of Compilation Erez Petrank Lecture 7: Intermediate Representation Part 2: Backpatching 1.
Theory of Compilation Erez Petrank Lecture 6: Intermediate Representation 1.
1 CMPSC 160 Translation of Programming Languages Fall 2002 Lecture-Modules 17 and 18 slides derived from Tevfik Bultan, Keith Cooper, and Linda Torczon.
1 Chapter 7: Runtime Environments. int * larger (int a, int b) { if (a > b) return &a; //wrong else return &b; //wrong } int * larger (int *a, int *b)
1 Intermediate Code generation. 2 Intermediate Code Generation l Intermediate languages l Declarations l Expressions l Statements l Reference: »Chapter.
CH4.1 CSE244 Intermediate Code Generation Aggelos Kiayias Computer Science & Engineering Department The University of Connecticut 371 Fairfield Road, Unit.
Test Yourself #2 Test Yourself #3 Initial code: a := x ** 2 b := 3 c := x d := c * c e := b * 2 f := a + d g := e * f.
Static checking and symbol table Chapter 6, Chapter 7.6 and Chapter 8.2 Static checking: check whether the program follows both the syntactic and semantic.
1 Structure of a Compiler Front end of a compiler is efficient and can be automated Back end is generally hard to automate and finding the optimum solution.
1 Intermediate Code Generation Part I Chapter 8 COP5621 Compiler Construction Copyright Robert van Engelen, Florida State University, 2007.
Chapter 8: Intermediate Code Generation
Review: –What is an activation record? –What are the typical fields in an activation record? –What are the storage allocation strategies? Which program.
1 October 25, October 25, 2015October 25, 2015October 25, 2015 Azusa, CA Sheldon X. Liang Ph. D. Computer Science at Azusa Pacific University Azusa.
國立台灣大學 資訊工程學系 薛智文 98 Spring Intermediate Code Generation (textbook ch# 6.1–6.4, 6.5.1–6.5.3,
Intermediate Code Generation
Intermediate Code Generation
1 June 3, June 3, 2016June 3, 2016June 3, 2016 Azusa, CA Sheldon X. Liang Ph. D. Computer Science at Azusa Pacific University Azusa Pacific University,
Topic #7: Intermediate Code EE 456 – Compiling Techniques Prof. Carl Sable Fall 2003.
Intermediate Code Generation CS308 Compiler Theory1.
Review: Syntax directed translation. –Translation is done according to the parse tree. Each production (when used in the parsing) is a sub- structure of.
Boolean expressions 1 productionsemantic action E  E1 or E2E1.trueLabel = E.trueLabel; E1.falseLabel = freshLabel(); E2.trueLabel = E.trueLabel; E2.falseLabel.
Chap. 4, Intermediate Code Generation
Theory of Compilation Erez Petrank Lecture 6: Intermediate Representation and Attribute Grammars 1.
1 February 28, February 28, 2016February 28, 2016February 28, 2016 Azusa, CA Sheldon X. Liang Ph. D. Computer Science at Azusa Pacific University.
Three Address Code Generation of Control Statements continued..
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Intermediate code generation Jakub Yaghob
Compiler Construction
Compiler Construction
Subject Name:COMPILER DESIGN Subject Code:10CS63
Compiler Optimization and Code Generation
Intermediate Code Generation Part I
Intermediate Code Generation
Intermediate Code Generation Part II
Three Address Code Generation Control Statement
Chapter 6 Intermediate-Code Generation
Intermediate Code Generation Part II
Three Address Code Generation - Control Statements
Intermediate Code Generation Part I
Intermediate Code Generation Part II
Intermediate code generation
Three-address code A more common representation is THREE-ADDRESS CODE . Three address code is close to assembly language, making machine code generation.
7.4 Boolean Expression and Control Flow Statements
Compiler Design 21. Intermediate Code Generation
Intermediate Code Generation Part I
Three Address Code Generation – Backpatching
Intermediate Code Generation
Intermediate Code Generation Part II
Intermediate Code Generation
Compiler Construction
Intermediate Code Generation Part II
Review: For array a[2,5], how would the memory for this array looks like using row major layout? What about column major layout? How to compute the address.
Intermediate Code Generation Part I
Compiler Design 21. Intermediate Code Generation
Presentation transcript:

1 Intermediate Code Generation Abstraction at the source level identifiers, operators, expressions, statements, conditionals, iteration, functions (user defined, system defined or libraries) Abstraction at the target level memory locations, registers, stack, opcodes, addressing modes, system libraries, interface to the operating systems Code generation is mapping from source level abstractions to target machine abstractions

2 Intermediate Code Generation Front end translates a source program into an intermediate representation Back end generates target code from intermediate representation Benefits –Retargeting is possible –Machine independent code optimization is possible Front end Intermediate Code generator Machine Code generato r

3 Syntax directed translation of expression into 3-address code S → id := ES.code = E.code || gen(id.place:= E.place) E → E 1 + E 2 E.place:= newtmp E.code:= E 1.code || E 2.code || gen(E.place := E 1.place + E 2.place) E → E 1 * E 2 E.place:= newtmp E.code := E 1.code || E 2.code || gen(E.place := E 1.place * E 2.place)

4 Syntax directed translation of expression … E → -E 1 E.place := newtmp E.code := E 1.code || gen(E.place := - E 1.place) E → (E 1 )E.place := E 1.place E.code := E 1.code E → id E.place := id.place E.code := ‘ ‘

5 Example For a = b * -c + b * -c following code is generated t 1 = -c t 2 = b * t 1 t 3 = -c t 4 = b * t 3 t 5 = t 2 + t 4 a = t 5

6 Flow of Control S → while E do S 1 S. begin : E.code if E.place = 0 goto S.after S 1.code goto S.begin S.after : S.begin := newlabel S.after := newlabel S.code := gen(S.begin:) || E.code || gen(if E.place = 0 goto S.after) || S 1.code || gen(goto S.begin) || gen(S.after:)

7 Flow of Control … S → if E then S 1 else S 2 E.code if E.place = 0 goto S.else S 1.code goto S.after S.else: S 2.code S.after: S.else := newlabel S.after := newlabel S.code = E.code || gen(if E.place = 0 goto S.else) || S 1.code || gen(goto S.after) || gen(S.else :) || S 2.code || gen(S.after :)

8 Declarations For each name create symbol table entry with information like type and relative address P → {offset=0} D D → D ; D D → id : T enter(id.name, T.type, offset); offset = offset + T.width T → integer T.type = integer; T.width = 4 T → real T.type = real; T.width = 8

9 Declarations … T → array [ num ] of T 1 T.type = array(num.val, T 1.type) T.width = num.val x T 1.width T → ↑ T 1 T.type = pointer(T 1.type) T.width = 4

10 Keeping track of local information when a nested procedure is seen, processing of declaration in enclosing procedure is temporarily suspended assume following language P → D D → D ;D | id : T | proc id ;D ; S a new symbol table is created when procedure declaration D → proc id ; D 1 ; S is seen entries for D 1 are created in the new symbol table the name represented by id is local to the enclosing procedure

11 Example program sort; var a : array[1..n] of integer; x : integer; procedure readarray; var i : integer; …… procedure exchange(i,j:integers); …… procedure quicksort(m,n : integer); var k,v : integer; function partition(x,y:integer):integer; var i,j: integer; …… begin{main} …… end.

12 nil header a x readarray exchange quicksort readarrayexchange quicksort header i k v i j partition to readarray to exchange partition sort

13 Creating symbol table mktable (previous) create a new symbol table and return a pointer to the new table. The argument previous points to the enclosing procedure enter (table, name, type, offset) creates a new entry addwidth (table, width) records cumulative width of all the entries in a table enterproc (table, name, newtable) creates a new entry for procedure name. newtable points to the symbol table of the new procedure

14 Creating symbol table … P → {t=mktable(nil); push(t,tblptr); push(0,offset)} D {addwidth(top(tblptr),top(offset )); pop(tblptr); pop(offset)} D → D ; D

15 Creating symbol table … D → proc id; {t = mktable(top(tblptr)); push(t, tblptr); push(0, offset)} D 1 ; S {t = top(tblptr); addwidth(t, top(offset)); pop(tblptr); pop(offset);; enterproc(top(tblptr), id.name, t)} D → id: T {enter(top(tblptr), id.name, T.type, top(offset)); top(offset) = top (offset) + T.width}

16 Field names in records T → record {t = mktable(nil); push(t, tblptr); push(0, offset)} D end {T.type = record(top(tblptr)); T.width = top(offset); pop(tblptr); pop(offset)}

17 Names in the Symbol table S → id := E {p = lookup(id.place); if p <> nil then emit(p := E.place) else error} E → id {p = lookup(id.name); if p <> nil then E.place = p else error}

18 Addressing Array Elements Arrays are stored in a block of consecutive locations assume width of each element is w ith element of array A begins in location base + (i - low) x w where base is relative address of A[low] the expression is equivalent to i x w + (base-low x w)  i x w + const

19 2-dimensional array storage can be either row major or column major in case of 2-D array stored in row major form address of A[i 1, i 2 ] can be calculated as base + ((i 1 – low 1 ) x n 2 + i 2 - low 2 ) x w where n 2 = high 2 - low rewriting the expression gives ((i 1 x n 2 ) + i 2 ) x w + (base - ((low 1 x n 2 ) + low 2 ) x w)  ((i 1 x n 2 ) + i 2 ) x w + constant this can be generalized for A[i 1, i 2,…, i k ]

20 Example Let A be a 10x20 array therefore, n 1 = 10 and n 2 = 20 and assume w = 4 code to access A[y,z] is t 1 = y * 20 t 1 = t 1 + z t 2 = 4 * t 1 t 3 =A-84 {((low 1 Xn 2 )+low 2 )Xw)=(1*20+1)*4=84} t 4 = t 2 + t 3 x = t 4

21 Type conversion within assignments E → E 1 + E 2 E.place= newtmp; if E 1.type = integer and E 2.type = integer then emit(E.place ':=' E 1.place 'int+' E 2.place); E.type = integer; … similar code if both E 1.type and E 2.type are real … else if E 1.type = int and E 2.type = real then u = newtmp; emit(u ':=' inttoreal E 1.place); emit(E.place ':=' u 'real+' E 2.place); E.type = real; … similar code if E 1.type is real and E 2.type is integer

22 Example real x, y; int i, j; x = y + i * j generates code t 1 = i int* j t 2 = inttoreal t 1 t 3 = y real+ t 2 x = t 3

23 Boolean Expressions compute logical values change the flow of control boolean operators are: and or not E → E or E | E and E | not E | (E) | id relop id | true | false

24 Methods of translation Evaluate similar to arithmetic expressions –Normally use 1 for true and 0 for false implement by flow of control – given expression E 1 or E 2 if E 1 evaluates to true then E 1 or E 2 evaluates to true without evaluating E 2

25 Numerical representation a or b and not c t 1 = not c t 2 = b and t 1 t 3 = a or t 2 relational expression a < b is equivalent to if a < b then 1 else 0 1. if a < b goto t = 0 3. goto 5 4. t = 1 5.

26 Syntax directed translation of boolean expressions E → E 1 or E 2 E.place := newtmp emit(E.place ':=' E 1.place 'or' E 2.place) E → E 1 and E 2 E.place:= newtmp emit(E.place ':=' E 1.place 'and' E 2.place) E → not E 1 E.place := newtmp emit(E.place ':=' 'not' E 1.place) E → (E 1 ) E.place = E 1.place

27 Syntax directed translation of boolean expressions E → id1 relop id2 E.place := newtmp emit(if id1.place relop id2.place goto nextstat+3) emit(E.place = 0) emit(goto nextstat+2) emit(E.place = 1) E → true E.place := newtmp emit(E.place = '1') E → false E.place := newtmp emit(E.place = '0')

28 Example: Code for a < b or c < d and e < f 100: if a < b goto : t l = 0 102: goto : t l = 1 104: if c < d goto : t 2 = 0 106: goto : t 2 = 1 108: if e < f goto : t 3 = 0 110: goto : t 3 = 1 112: t 4 = t 2 and t 3 113: t 5 = t l or t 4

29 Short Circuit Evaluation of boolean expressions Translate boolean expressions without: –generating code for boolean operators –evaluating the entire expression Flow of control statements S → if E then S 1 | if E then S 1 else S 2 | while E do S 1

30 S → if E then S 1 E.true = newlabel E.false = S.next S 1.next = S.next S.code = E.code || gen(E.true ':') || S 1.code E.true E.false E.code S 1.code

31 S → if E then S 1 else S 2 E.true = newlabel E.false = newlabel S 1.next = S.next S 2.next = S.next S.code = E.code || gen(E.true ':') || S 1.code || gen(goto S.next) || gen(E.false ':') || S 2.code E.true E.false S.next E.code S 1.code goto S.next

32 S → while E do S 1 S.begin = newlabel E.true = newlabel E.false = S.next S 1.next = S.begin S.ocde = gen(S.begin ':') || E.code || gen(E.true ':') || S 1.code || gen(goto S.begin) E.true E.false S.begin E.code S 1.code goto S.begin

33 Control flow translation of boolean expression E → E 1 or E 2 E 1.true := E.true E 1.false := newlabel E 2.true := E.true E 2.false := E.false E.code := E 1.code || gen(E 1.false) || E 2.code E → E 1 and E 2 E 1.true := new label E 1 false := E.false E 2.true := E.true E 2 false := E.false E.code := E 1.code || gen(E 1.true) || E 2.code

34 Control flow translation of boolean expression … E → not E 1 E 1.true := E.false E 1.false := E.true E.code := E 1.code E → (E 1 ) E 1.true := E.true E 1.false := E.false E.code := E 1.code

35 Control flow translation of boolean expression … if E is of the form a < b then code is of the form if a < b goto E.true goto E.false E → id 1 relop id 2 E.code = gen( if id 1 relop id 2 goto E.true) || gen(goto E.false) E → true E.code = gen(goto E.true) E → false E.code = gen(goto E.false)

36 Example Code for a < b or c < d and e < f if a < b goto Ltrue goto L1 L1: if c < d goto L2 goto Lfalse L2: if e < f goto Ltrue goto Lfalse Ltrue: Lfalse:

37 Example … Code for while a < b do if c<d then x=y+z else x=y-z L1: if a < b goto L2 goto Lnext L2: if c < d goto L3 goto L4 L3: t 1 = Y + Z X= t 1 goto L1 L4: t 1 = Y - Z X= t 1 goto Ll Lnext:

38 Case Statement switch expression begin case value: statement …. case value: statement default: statement end evaluate the expression find which value in the list of cases is the same as the value of the expression. –Default value matches the expression if none of the values explicitly mentioned in the cases matches the expression execute the statement associated with the value found

39 Translation code to evaluate E into t if t <> V1 goto L1 code for S1 goto next L1 if t <> V2 goto L2 code for S2 goto next L2: …… Ln-2 if t <> Vn-l goto Ln-l code for Sn-l goto next Ln-1:code for Sn next: code to evaluate E into t goto test L1: code for S1 goto next L2: code for S2 goto next …… Ln: code for Sn goto next test: if t = V1 goto L1 if t = V2 goto L2 …. if t = Vn-1 goto Ln-1 goto Ln next: Efficient for n-way branch

40 Back Patching way to implement boolean expressions and flow of control statements in one pass code is generated as quadruples into an array labels are indices into this array makelist(i): create a newlist containing only i, return a pointer to the list. merge(p1,p2): merge lists pointed to by p1 and p2 and return a pointer to the concatenated list backpatch(p,i): insert i as the target label for the statements in the list pointed to by p

41 Boolean Expressions E → E 1 or M E 2 | E 1 and M E 2 | not E 1 | (E 1 ) | id 1 relop id 2 | true | false M → Є Insert a marker non terminal M into the grammar to pick up index of next quadruple. attributes truelist and falselist are used to generate jump code for boolean expressions incomplete jumps are placed on lists pointed to by E.truelist and E.falselist

42 Boolean expressions … Consider E → E 1 and M E 2 –if E 1 is false then E is also false so statements in E 1.falselist become part of E.falselist –if E 1 is true then E 2 must be tested so target of E 1.truelist is beginning of E 2 –target is obtained by marker M –attribute M.quad records the number of the first statement of E 2.code

43 E → E 1 or M E 2 backpatch(E 1.falselist, M.quad) E.truelist = merge(E 1.truelist, E 2.truelist) E.falselist = E 2.falselist E → E 1 and M E 2 backpatch(E 1.truelist, M.quad) E.truelist = E 2.truelist E.falselist = merge(E 1.falselist, E 2.falselist) E → not E 1 E.truelist = E 1 falselist E.falselist = E 1.truelist E → ( E 1 ) E.truelist = E 1.truelist E.falselist = E 1.falselist

44 E → id 1 relop id 2 E.truelist = makelist(nextquad) E.falselist = makelist(nextquad+ 1) emit(if id 1 relop id 2 goto --- ) emit(goto ---) E → true E.truelist = makelist(nextquad) emit(goto ---) E → false E.falselist = makelist(nextquad) emit(goto ---) M → Є M.quad = nextquad

45 Generate code for a < b or c < d and e < f E.t={100,104} E.f={103,105} E.t={100} E.f={101} E.t={104} E.f={103,105 } or M.q=102 Є E.t={102} E.f={103} and M.q=104 E.t ={104} E.f={105} cd < a <b Є e < f Initialize nextquad to : if a < b goto - 101: goto - 102: if c < d goto - 103: goto - 104: if e < f goto goto – backpatch(102,104) 104 backpatch(101,102) 102

46 Flow of Control Statements S  if E then S 1 | if E then S 1 else S 2 | while E do S 1 | begin L end | A L  L ; S | S S : Statement A : Assignment L : Statement list

47 Scheme to implement translation E has attributes truelist and falselist L and S have a list of unfilled quadruples to be filled by backpatching S  while E do S 1 requires labels S.begin and E.true –markers M 1 and M 2 record these labels S  while M 1 E do M 2 S 1 –when while... is reduced to S backpatch S 1.nextlist to make target of all the statements to M 1.quad –E.truelist is backpatched to go to the beginning of S 1 (M 2.quad)

48 Scheme to implement translation … S  if E then M S 1 backpatch(E.truelist, M.quad) S.nextlist = merge(E.falselist, S 1.nextlist) S  if E them M 1 S 1 N else M 2 S 2 backpatch(E.truelist, M 1.quad) backpatch(E.falselist, M 2.quad ) S.next = merge(S 1.nextlist, N.nextlist, S 2.nextlist) S  while M 1 E do M 2 S 1 backpatch(S 1.nextlist, M 1.quad) backpatch(E.truelist, M 2.quad) S.nextlist = E.falselist emit(goto M 1.quad)

49 Scheme to implement translation … S  begin L endS.nextlist = L.nextlist S  AS.nextlist = makelist() L  L 1 ; M Sbackpatch(L 1.nextlist, M.quad) L.nextlist = S.nextlist L  SL.nextlist = S.nextlist N   N.nextlist = makelist(nextquad) emit(goto ---) M   M.quad = nextquad

50 Procedure Calls S  call id ( Elist ) Elist  Elist, E Elist  E Calling sequence –allocate space for activation record –evaluate arguments –establish environment pointers –save status and return address –jump to the beginning of the procedure

51 Procedure Calls … Example parameters are passed by reference storage is statically allocated use param statement as place holder for the arguments called procedure is passed a pointer to the first parameter pointers to any argument can be obtained by using proper offsets

52 Code Generation Generate three address code needed to evaluate arguments which are expressions Generate a list of param three address statements Store arguments in a list S  call id ( Elist ) for each item p on queue do emit('param' p) emit('call' id.place) Elist  Elist, E append E.place to the end of queue Elist  E initialize queue to contain E.place