Presentation is loading. Please wait.

Presentation is loading. Please wait.

Direction of analysis Although constraints are not directional, flow functions are All flow functions we have seen so far are in the forward direction.

Similar presentations


Presentation on theme: "Direction of analysis Although constraints are not directional, flow functions are All flow functions we have seen so far are in the forward direction."— Presentation transcript:

1 Direction of analysis Although constraints are not directional, flow functions are All flow functions we have seen so far are in the forward direction In some cases, the constraints are of the form in = F(out) These are called backward problems. Example: live variables –compute the set of variables that may be live

2 Example: live variables Set D = Lattice: (D, v, ?, >, t, u ) =

3 Example: live variables Set D = 2 Vars Lattice: (D, v, ?, >, t, u ) = (2 Vars, µ, ;,Vars, [, Å ) x := y op z in out F x := y op z (out) =

4 Example: live variables Set D = 2 Vars Lattice: (D, v, ?, >, t, u ) = (2 Vars, µ, ;,Vars, [, Å ) x := y op z in out F x := y op z (out) = out – { x } [ { y, z}

5 Example: live variables x := 5 y := x + 2 x := x + 1 y := x + 10... y...

6 Example: live variables x := 5 y := x + 2 x := x + 1 y := x + 10... y... How can we remove the x := x + 1 stmt?

7 Revisiting assignment x := y op z in out F x := y op z (out) = out – { x } [ { y, z}

8 Revisiting assignment x := y op z in out F x := y op z (out) = out – { x } [ { y, z}

9 Theory of backward analyses Can formalize backward analyses in two ways Option 1: reverse flow graph, and then run forward problem Option 2: re-develop the theory, but in the backward direction

10 Precision Going back to constant prop, in what cases would we lose precision?

11 Precision Going back to constant prop, in what cases would we lose precision? if (p) { x := 5; } else x := 4; }... if (p) { y := x + 1 } else { y := x + 2 }... y... if (...) { x := -1; } else x := 1; } y := x * x;... y... x := 5 if ( ) { x := 6 }... x... where is equiv to false

12 Precision The first problem: Unreachable code –solution: run unreachable code removal before –the unreachable code removal analysis will do its best, but may not remove all unreachable code The other two problems are path-sensitivity issues –Branch correlations: some paths are infeasible –Path merging: can lead to loss of precision

13 MOP: meet over all paths Information computed at a given point is the meet of the information computed by each path to the program point if (...) { x := -1; } else x := 1; } y := x * x;... y...

14 MOP For a path p, which is a sequence of statements [s 1,..., s n ], define: F p (in) = F s n (...F s 1 (in)... ) In other words: F p = Given an edge e, let paths-to(e) be the (possibly infinite) set of paths that lead to e Given an edge e, MOP(e) = For us, should be called JOP (ie: join, not meet)

15 MOP vs. dataflow MOP is the “best” possible answer, given a fixed set of flow functions –This means that MOP v dataflow at edge in the CFG In general, MOP is not computable (because there can be infinitely many paths) –vs dataflow which is generally computable (if flow fns are monotonic and height of lattice is finite) And we saw in our example, in general, MOP  dataflow

16 MOP vs. dataflow However, it would be great if by imposing some restrictions on the flow functions, we could guarantee that dataflow is the same as MOP. What would this restriction be? x := -1; y := x * x;... y... x := 1; y := x * x;... y... Merge x := -1;x := 1; Merge y := x * x;... y... DataflowMOP

17 MOP vs. dataflow However, it would be great if by imposing some restrictions on the flow functions, we could guarantee that dataflow is the same as MOP. What would this restriction be? Distributive problems. A problem is distributive if: 8 a, b. F(a t b) = F(a) t F(b) If flow function is distributive, then MOP = dataflow

18 Summary of precision Dataflow is the basic algorithm To basic dataflow, we can add path-separation –Get MOP, which is same as dataflow for distributive problems –Variety of research efforts to get closer to MOP for non-distributive problems To basic dataflow, we can add path-pruning –Get branch correlation To basic dataflow, can add both: –meet over all feasible paths

19 Program Representations

20 Representing programs Goals

21 Representing programs Primary goals –analysis is easy and effective just a few cases to handle directly link related things –transformations are easy to perform –general, across input languages and target machines Additional goals –compact in memory –easy to translate to and from –tracks info from source through to binary, for source-level debugging, profilling, typed binaries –extensible (new opts, targets, language features) –displayable

22 Option 1: high-level syntax based IR Represent source-level structures and expressions directly Example: Abstract Syntax Tree

23 Option 2: low-level IR Translate input programs into low-level primitive chunks, often close to the target machine Examples: assembly code, virtual machine code (e.g. stack machines), three-address code, register-transfer language (RTL) Standard RTL instrs:

24 Option 2: low-level IR

25 Comparison

26 Advantages of high-level rep –analysis can exploit high-level knowledge of constructs –easy to map to source code (debugging, profiling) Advantages of low-level rep –can do low-level, machine specific reasoning –can be language-independent Can mix multiple reps in the same compiler

27 Components of representation Control dependencies: sequencing of operations –evaluation of if & then –side-effects of statements occur in right order Data dependencies: flow of definitions from defs to uses –operands computed before operations Ideal: represent just those dependencies that matter –dependencies constrain transformations –fewest dependences ) flexibility in implementation

28 Control dependencies Option 1: high-level representation –control implicit in semantics of AST nodes Option 2: control flow graph (CFG) –nodes are individual instructions –edges represent control flow between instructions Options 2b: CFG with basic blocks –basic block: sequence of instructions that don’t have any branches, and that have a single entry point –BB can make analysis more efficient: compute flow functions for an entire BB before start of analysis

29 Control dependencies CFG does not capture loops very well Some fancier options include: –the Control Dependence Graph –the Program Dependence Graph More on this later. Let’s first look at data dependencies

30 Data dependencies Simplest way to represent data dependencies: def/use chains

31 Def/use chains Directly captures dataflow –works well for things like constant prop But... Ignores control flow –misses some opt opportunities since conservatively considers all paths –not executable by itself (for example, need to keep CFG around) –not appropriate for code motion transformations Must update after each transformation Space consuming

32 SSA Static Single Assignment –invariant: each use of a variable has only one def

33

34 SSA Create a new variable for each def Insert  pseudo-assignments at merge points Adjust uses to refer to appropriate new names Question: how can one figure out where to insert  nodes using a liveness analysis and a reaching defns analysis.

35 Converting back from SSA Semantics of x 3 :=  (x 1, x 2 ) –set x 3 to x i if execution came from ith predecessor How to implement  nodes?

36 Converting back from SSA Semantics of x 3 :=  (x 1, x 2 ) –set x 3 to x i if execution came from ith predecessor How to implement  nodes? –Insert assignment x 3 := x 1 along 1 st predecessor –Insert assignment x 3 := x 2 along 2 nd predecessor If register allocator assigns x 1, x 2 and x 3 to the same register, these moves can be removed –x 1.. x n usually have non-overlapping lifetimes, so this kind of register assignment is legal


Download ppt "Direction of analysis Although constraints are not directional, flow functions are All flow functions we have seen so far are in the forward direction."

Similar presentations


Ads by Google