Compiler 薛智文 TH 6 7 8, DTH 102 cwhsueh@csie.ntu.edu.tw http://www.csie.ntu.edu.tw/~cwhsueh/ 96 Spring.

Slides:



Advertisements
Similar presentations
Introduction to Compiler Construction
Advertisements

Compiler Chang Chi-Chung Textbook Compilers: Principles, Techniques, and Tools, 2/E.  Alfred V. Aho, Columbia University  Monica S. Lam,
CPSC Compiler Tutorial 9 Review of Compiler.
Yu-Chen Kuo1 Chapter 1 Introduction to Compiling.
Chapter 1: Introduction to Compiling
Compiler Construction1 A Compulsory Module for Students in Computer Science Department Faculty of IT / Al – Al Bayt University First Semester 2009/2010.
1.3 Executing Programs. How is Computer Code Transformed into an Executable? Interpreters Compilers Hybrid systems.
Lecture 2 Phases of Compiler. Preprocessors, Compilers, Assemblers, and Linkers Preprocessor Compiler Assembler Linker Skeletal Source Program Source.
CPSC 388 – Compiler Design and Construction Lecture: MWF 11:00am-12:20pm, Room 106 Colton.
September 7, September 7, 2015September 7, 2015September 7, 2015 Azusa, CA Sheldon X. Liang Ph. D. Computer Science at Azusa Pacific University.
Course Revision Contents  Compilers  Compilers Vs Interpreters  Structure of Compiler  Compilation Phases  Compiler Construction Tools  A Simple.
COP4020 Programming Languages
Chapter 1. Introduction.
Chapter 1 Introduction Dr. Frank Lee. 1.1 Why Study Compiler? To write more efficient code in a high-level language To provide solid foundation in parsing.
Introduction to Compiler Construction Robert van Engelen COP5621 Compiler Construction Copyright Robert.
CSC 338: Compiler design and implementation
CST320 - Lec 11 Why study compilers? n n Ties lots of things you know together: –Theory (finite automata, grammars) –Data structures –Modularization –Utilization.
1 COMP 3438 – Part II-Lecture 1: Overview of Compiler Design Dr. Zili Shao Department of Computing The Hong Kong Polytechnic Univ.
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
1.  10% Assignments/ class participation  10% Pop Quizzes  05% Attendance  25% Mid Term  50% Final Term 2.
Chapter 1 Introduction. Chapter 1 - Introduction 2 The Goal of Chapter 1 Introduce different forms of language translators Give a high level overview.
1. 2 Preface In the time since the 1986 edition of this book, the world of compiler design has changed significantly 3.
CS 460/660 Compiler Construction. Class 01 2 Why Study Compilers? Compilers are important – –Responsible for many aspects of system performance Compilers.
Introduction to Compilers. Related Area Programming languages Machine architecture Language theory Algorithms Data structures Operating systems Software.
Topic #1: Introduction EE 456 – Compiling Techniques Prof. Carl Sable Fall 2003.
Overview of Previous Lesson(s) Over View  A program must be translated into a form in which it can be executed by a computer.  The software systems.
1 Compiler Design (40-414)  Main Text Book: Compilers: Principles, Techniques & Tools, 2 nd ed., Aho, Lam, Sethi, and Ullman, 2007  Evaluation:  Midterm.
Chapter 1 Introduction Study Goals: Master: the phases of a compiler Understand: what is a compiler Know: interpreter,compiler structure.
國立台灣大學 資訊工程學系 薛智文 98 Spring Compiler TH 234, DTH 103.
Introduction to Compiling
Compiler Design Introduction 1. 2 Course Outline Introduction to Compiling Lexical Analysis Syntax Analysis –Context Free Grammars –Top-Down Parsing –Bottom-Up.
What is a compiler? –A program that reads a program written in one language (source language) and translates it into an equivalent program in another language.
CH1.1 CSE244 Chapter 1: Introduction to Compiling Prof. Steven A. Demurjian, Sr. Computer Science & Engineering Department The University of Connecticut.
Dr. Mohamed Ramadan Saady 314ALL CH1.1 Chapter 1: Introduction to Compiling.
©SoftMoore ConsultingSlide 1 Structure of Compilers.
1 Asstt. Prof Navjot Kaur Computer Dept PRESENTED BY.
Overview of Compilation Prepared by Manuel E. Bermúdez, Ph.D. Associate Professor University of Florida Programming Language Principles Lecture 2.
Presented by : A best website designer company. Chapter 1 Introduction Prof Chung. 1.
Chapter 1 Introduction Samuel College of Computer Science & Technology Harbin Engineering University.
Chapter 1. Introduction.
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Advanced Computer Systems
Compiler Design (40-414) Main Text Book:
PRINCIPLES OF COMPILER DESIGN
Chapter 1 Introduction.
Introduction to Compiler Construction
CS 3304 Comparative Languages
A Simple Syntax-Directed Translator
Compiler Construction (CS-636)
SOFTWARE DESIGN AND ARCHITECTURE
Chapter 1 Introduction.
-by Nisarg Vasavada (Compiled*)
课程名 编译原理 Compiling Techniques
Chapter 1: Introduction to Compiling (Cont.)
Compiler Lecture 1 CS510.
CS 536 / Fall 2017 Introduction to programming languages and compilers
Compiler Construction
CS416 Compiler Design lec00-outline September 19, 2018
Introduction to Compiler Construction
Course supervisor: Lubna Siddiqui
Introduction CI612 Compiler Design CI612 Compiler Design.
Compiler Construction
Compiler 薛智文 TH 6 7 8, DTH Spring.
Compiler design.
CS416 Compiler Design lec00-outline February 23, 2019
Introduction to Compiler Construction
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Lec00-outline May 18, 2019 Compiler Design CS416 Compiler Design.
Compiler 薛智文 M 2 3 4, DTH Spring.
Introduction to Compiler Construction
Presentation transcript:

Compiler 薛智文 TH 6 7 8, DTH 102 cwhsueh@csie.ntu.edu.tw http://www.csie.ntu.edu.tw/~cwhsueh/ 96 Spring

00:21 /27

Why Study Compilers? Excellent software-engineering example --- theory meets practice. Essential software tool. Influences hardware design, e.g., RISC, VLIW. Tools (mostly “optimization”) for enhancing software reliability and security. 00:21 /27

Compilers & Architecture Modern architectures have very complex structures, especially opportunities for parallel execution. Sequential programs can only make effective use of these features via an optimizing compiler. Hardware question: If we implemented this, could a compiler use it? 00:21 /27

Software Reliability Optimization technology (data-flow analysis) used in: Lock/unlock errors. Buffers not range-checked. Memory Leaks. SQL injection bugs.. 00:21 /27

What this Course Offers? Compiler methodology for both compiler implementation and related applications. Theoretical framework. Key algorithms. Hands-on experience. Nongoal: build a complete optimizing compiler. 00:21 /27

Course Outline Part 1 --- Introduction. Part 2 --- Scanner. Part 3 --- Parser. Part 4 --- Syntax-Directed Translation. Part 5 --- Symbol Table. Part 6 --- Intermediate Code Generation. Part 7 --- Run Time Storage Organization. Part 8 --- Optimization. Part 9 --- How to Write a Compiler. Part 10 --- A Simple Code Generation (PSEUDO) Example. 00:21 /27

Introduction Compiler is one of language processors. input source program target program output 00:21 /27

What is a Compiler? Definitions: Compiler writing spans: History: The software system translates description of computations into a program executable by a computer. Source and target must be equivalent! Compiler writing spans: programming languages; machine architecture; language theory; algorithms and data structures; software engineering. History: 1950: the first FORTRAN compiler took 18 man-years; now: using software tools, can be done in a few months as a student’s project. input Compiler source program target program output 00:21 /27

An Interpreter input Interpreter source program output 00:21 /27

A Hybrid Compiler source program intermediate program Virtual Machine Translator intermediate program Virtual Machine output input 00:21 /27

A Language-Processing System source program Preprocessor modified source program Compiler target assembly program Assembler relocatable machine code Linker/Loader target machine code library files relocatable object files 00:21 /27

Applications Computer language compilers. Translator: from one format to another. query interpreter text formatter silicon compiler infix notation  postfix notation: pretty printers · · · Software productivity tools. 3 + 5 – 6 * 6 3 5 + 6 6 * – 00:21 /27

Relations with Computational Theory a set of grammar rules ≡ the definition of a particular machine. also equivalent to a set of languages recognized by this machine. a type of machines: a family of machines with a given set of operations, or capabilities; power of a type of machines ≡ the set of languages that can be recognized by this type of machines. 00:21 /27

A Language-Processing System source program Preprocessor modified source program Compiler target assembly program Assembler relocatable machine code Linker/Loader target machine code library files relocatable object files 00:21 /27

Phases of a Compiler character stream Lexical Analyzer (scanner) token stream Machine-Independent Code Optimizer optimized intermediate representation Code Generator relocatable machine code Machine-Dependent Code Optimizer target-machine code Semantic Analyzer annotated abstract-syntax tree Syntax Analyzer (parser) abstract-syntax tree Intermediate Code Generator intermediate representation Error Handling Symbol Table 00:21 /27

Lexical Analyzer (Scanner) Actions: Reads characters from the source program; Groups characters into lexemes , i.e., sequences of characters that “go together”, following a given pattern ; Each lexeme corresponds to a token . the scanner returns the next token, plus maybe some additional information, to the parser; The scanner may also discover lexical errors, i.e., erroneous characters. The definitions of what a lexeme, token or bad character is depend on the definition of the source language. 00:21 /27

Scanner Example for C Lexeme: C statement Symbol Table Lexeme: C statement position = initial + rate * 60; (Lexeme) position = initial + rate * 60 ; < id,1> <=> <id,2> <+> <id,3> <*> <60> <;> (Token) ID ASSIGN ID PLUS ID TIME INT SEMI-COL Arbitrary number of blanks (white spaces) between lexemes. Erroneous sequence of characters, that are not parts of comments, for the C language: control characters @ 2abc 1 position … 2 initial 3 rate 00:21 /27

Syntax Analyzer (Parser) Actions: Group tokens into grammatical phrases , to discover the underlying structure of the source Find syntax errors , e.g., the following C source line: (Lexeme) index = 12 * ; (Token) ID ASSIGN INT TIMES SEMI-COL Every token is legal, but the sequence is erroneous! May find some static semantic errors , e.g., use of undeclared variables or multiple declared variables. May generate code, or build some intermediate representation of the source program, such as an abstract-syntax tree. 00:21 /27

Parser Example for C Source code: position = initial + rate * 60 < id,1> <=> <id,2> <+> <id,3> <*> <60> Abstract-syntax tree: interior nodes of the tree are OPERATORS; a node’s children are its OPERANDS; each subtree forms a logical unit . the subtree with * at its root shows that * has higher precedence than +, the operation “rate * 60” must be performed as a unit, not “initial + rate”. Where is ”;”? = + * <id,3> < id,1> <id,2> 60 Symbol Table 1 position … 2 initial 3 rate 00:21 /27

Semantic Analyzer Actions: Check for more static semantic errors, e.g., type errors . May annotate and/or change the abstract syntax tree. Symbol Table 1 position … 2 initial 3 rate = + * <id,3> < id,1> <id,2> 60 = + * <id,3> < id,1> <id,2> int_to_float 60 00:21 /27

Intermediate Code Generator Actions: translate from abstract-syntax trees to intermediate codes. One choice for intermediate code is 3-address code : Each statement contains at most 3 operands; in addition to “=”, i.e., assignment, at most one operator. An ”easy” and “universal” format that can be translated into most assembly languages. = + * <id,3> < id,1> <id,2> int_to_float 60 t1 = int_to_float(60) t2 = id3 * t1 t3 = id2 + t2 id1 = t3 00:21 /27

Optimizer Improve the efficiency of intermediate code. Goal may be to make code run faster , and/or to use the least number of registers · · · Current trends: to obtain smaller, but maybe slower, equivalent code for embedded systems; to reduce power consumption. t1 = int_to_float(60) t2 = id3 * t1 t3 = id2 + t2 id1 = t3 t1 = id3 * 60.0 id1 = id2 + t1 00:21 /27

Code Generation A compiler may generate Example: Advantages: pure machine codes (machine dependent assembly language) directly, which is rare now ; virtual machine code. Example: PASCAL  compiler  P-code  interpreter  execution Speed is roughly 4 times slower than running directly generated machine codes. Advantages: simplify the job of a compiler; decrease the size of the generated code: 1/3 for P-code ; can be run easily on a variety of platforms P-machine is an ideal general machine whose interpreter can be written easily; divide and conquer; recent example: JAVA and Byte-code. 00:21 /27

Code Generation Example LDF R2, id3 MULF R2, R2, #60.0 LDF R1, id2 ADDF R1, R1, R2 STF id1, R1 t1 = id3 * 60.0 id1 = id2 + t1 00:21 /27

Practical Considerations (1/2) Preprocessing phase: macro substitution: #define MAXC 10 rational preprocessing: add new features for old languages. BASIC C  C ++ compiler directives: #include <stdio.h> non-standard language extensions. adding parallel primitives 00:21 /27

Practical Considerations (2/2) Passes of compiling First pass reads the text file once. May need to read the text one more time for any forward addressed objects, i.e., anything that is used before its declaration. Example: C language goto error_handling; · · · error_handling: 00:21 /27

Reduce Number of Passes Each pass takes I/O time. Back-patching : leave a blank slot for missing information, and fill in the empty slot when the information becomes available. Example: C language when a label is used if it is not defined before, save a trace into the to-be-processed table label name corresponds to LABEL TABLE[i] code generated: GOTO LABEL TABLE[i] when a label is defined check known labels for redefined labels if it is not used before, save a trace into the to-be-processed table if it is used before, then find its trace and fill the current address into the trace Time and space trade-off ! 00:21 /27