2005-11-11ELEC6200-001 Fall 05 1 Very- Long Instruction Word (VLIW) Computer Architecture Fan Wang Department of Electrical and Computer Engineering Auburn.

Slides:



Advertisements
Similar presentations
Instruction Level Parallelism
Advertisements

Instruction Set Design
CSCI 4717/5717 Computer Architecture
VLIW Very Large Instruction Word. Introduction Very Long Instruction Word is a concept for processing technology that dates back to the early 1980s. The.
RISC and Pipelining Prof. Sin-Min Lee Department of Computer Science.
CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture VLIW Steve Ko Computer Sciences and Engineering University at Buffalo.
POLITECNICO DI MILANO Parallelism in wonderland: are you ready to see how deep the rabbit hole goes? ILP: VLIW Architectures Marco D. Santambrogio:
CS252 Graduate Computer Architecture Spring 2014 Lecture 9: VLIW Architectures Krste Asanovic
Tuan Tran. What is CISC? CISC stands for Complex Instruction Set Computer. CISC are chips that are easy to program and which make efficient use of memory.
Microprocessors VLIW Very Long Instruction Word Computing April 18th, 2002.
Chapter 6 สถาปัตยกรรมไมโครโพรเซสเซอร์แบบต่างๆ Processor Architectures
Processor Technology and Architecture
Chapter XI Reduced Instruction Set Computing (RISC) CS 147 Li-Chuan Fang.
Microprocessors Introduction to ia64 Architecture Jan 31st, 2002 General Principles.
State Machines Timing Computer Bus Computer Performance Instruction Set Architectures RISC / CISC Machines.
RISC. Rational Behind RISC Few of the complex instructions were used –data movement – 45% –ALU ops – 25% –branching – 30% Cheaper memory VLSI technology.
Chapter 2: Impact of Machine Architectures What is the Relationship Between Programs, Programming Languages, and Computers.
11/11/05ELEC CISC (Complex Instruction Set Computer) Veeraraghavan Ramamurthy ELEC 6200 Computer Architecture and Design Fall 2005.
Seqeuential Logic State Machines Memory
Prince Sultan College For Woman
Reduced Instruction Set Computers (RISC) Computer Organization and Architecture.
Cisc Complex Instruction Set Computing By Christopher Wong 1.
COMPUTER ORGANIZATIONS CSNB123 May 2014Systems and Networking1.
Computer Organization and Architecture Reduced Instruction Set Computers (RISC) Chapter 13.
CH13 Reduced Instruction Set Computers {Make hardware Simpler, but quicker} Key features  Large number of general purpose registers  Use of compiler.
Basics and Architectures
Very Long Instruction Word (VLIW) Architecture. VLIW Machine It consists of many functional units connected to a large central register file Each functional.
TECH 6 VLIW Architectures {Very Long Instruction Word}
Previously Fetch execute cycle Pipelining and others forms of parallelism Basic architecture This week we going to consider further some of the principles.
What have mr aldred’s dirty clothes got to do with the cpu
RISC By Ryan Aldana. Agenda Brief Overview of RISC and CISC Features of RISC Instruction Pipeline Register Windowing and renaming Data Conflicts Branch.
Spring 2003CSE P5481 VLIW Processors VLIW (“very long instruction word”) processors instructions are scheduled by the compiler a fixed number of operations.
Ramesh.B ELEC 6200 Computer Architecture & Design Fall /29/20081Computer Architecture & Design.
Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.
RISC architecture and instruction Level Parallelism (ILP) based on “Computer Architecture: a Quantitative Approach” by Hennessy and Patterson, Morgan Kaufmann.
Super computers Parallel Processing By Lecturer: Aisha Dawood.
Advanced Processor Technology Architectural families of modern computers are CISC RISC Superscalar VLIW Super pipelined Vector processors Symbolic processors.
CS5222 Advanced Computer Architecture Part 3: VLIW Architecture
Ted Pedersen – CS 3011 – Chapter 10 1 A brief history of computer architectures CISC – complex instruction set computing –Intel x86, VAX –Evolved from.
RISC and CISC. What is CISC? CISC is an acronym for Complex Instruction Set Computer and are chips that are easy to program and which make efficient use.
Operand Addressing And Instruction Representation Cs355-Chapter 6.
ECEG-3202 Computer Architecture and Organization Chapter 7 Reduced Instruction Set Computers.
Reduced Instruction Set Computers. Major Advances in Computers(1) The family concept —IBM System/ —DEC PDP-8 —Separates architecture from implementation.
E X C E E D I N G E X P E C T A T I O N S VLIW-RISC CSIS Parallel Architectures and Algorithms Dr. Hoganson Kennesaw State University Instruction.
Pipelining and Parallelism Mark Staveley
CISC and RISC 12/25/ What is CISC? acronym for Complex Instruction Set Computer Chips that are easy to program and which make efficient use of memory.
EKT303/4 Superscalar vs Super-pipelined.
COMPUTER ORGANIZATIONS CSNB123 NSMS2013 Ver.1Systems and Networking1.
EECS 322 March 18, 2000 RISC - Reduced Instruction Set Computer Reduced Instruction Set Computer  By reducing the number of instructions that a processor.
Reduced Instruction Set Computing Ammi Blankrot April 26, 2011 (RISC)
Spring 2016, Jan 13 ELEC / Lecture 1 1 ELEC / Computer Architecture and Design Spring 2016 Introduction Vishwani D. Agrawal.
3/12/2013Computer Engg, IIT(BHU)1 CONCEPTS-1. Pipelining Pipelining is used to increase the speed of processing It uses temporal parallelism In pipelining,
RISC / CISC Architecture by Derek Ng. Overview CISC Architecture RISC Architecture  Pipelining RISC vs CISC.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 1: Overview of High Performance Processors * Jeremy R. Johnson Wed. Sept. 27,
CISC. What is it?  CISC - Complex Instruction Set Computer  CISC is a design philosophy that:  1) uses microcode instruction sets  2) uses larger.
Chapter 11 System Performance Enhancement. Basic Operation of a Computer l Program is loaded into memory l Instruction is fetched from memory l Operands.
Topics to be covered Instruction Execution Characteristics
Advanced Architectures
Chapter 14 Instruction Level Parallelism and Superscalar Processors
Scalable Processor Design
CISC (Complex Instruction Set Computer)
Superscalar Processors & VLIW Processors
CISC AND RISC SYSTEM Based on instruction set, we broadly classify Computer/microprocessor/microcontroller into CISC and RISC. CISC SYSTEM: COMPLEX INSTRUCTION.
Chapter 12 Pipelining and RISC
Design of Digital Circuits Lecture 19a: VLIW
William Stallings Computer Organization and Architecture
Presentation transcript:

ELEC Fall 05 1 Very- Long Instruction Word (VLIW) Computer Architecture Fan Wang Department of Electrical and Computer Engineering Auburn University, USA

ELEC Fall 05 2 Background CISC (Complex Instruction Set Computing) instructions are quite complex and have variable length. a relatively small number of registers, and are capable of accessing memory locations directly. Complex instructions are sequenced in microcode in modern CISC processors.

ELEC Fall 05 3 Cont. RISC(Reduced Instruction Set Computing) instructions are of fixed length and of a regular format. Operations are performed on registers only, of which a larger number is available than on CISC processors. The only memory operations are load and store. The hardware in RISC processors is simpler because the RISC architecture relies more on the compiler for sequencing complex operations.

ELEC Fall 05 4 The method for exploiting parallelism The key to higher performance in microprocessors for a broad range of applications is the ability to exploit fine-grain, instruction-level parallelism: + pipelining + multiple processors + superscalar implementation + specifying multiple independent operations per instruction

ELEC Fall 05 5 Problems we meet it is not easy to exploit parallel execution in real programs, which are written in a serial fashion. Mainstream high-level languages (C and FORTRAN) allow a limited freedom to execute operations in parallel. Programs need to be compiled into machine code, but most conventional instruction sets do not allow for the indication of parallel execution.

ELEC Fall 05 6 VLIW was invented The idea of VLIW has been considered the work on trace scheduling, a method of compiling programs written in conventional languages for wide- word machines, done by Josh Fisher in 1979 at Yale laid down the foundation for VLIW technology. Now John Fisher leads HP’s VLIW compiler project. VLIW Pioneer: HP Senior Fellow Josh Fisher beside his MultiFlow Trace VLIW machine, on display at Computer History Museum.

ELEC Fall 05 7 Why VLIW ? To overcome the difficulty of finding parallelism in machine-level object code. In a VLIW processor, multiple instructions are packed together and issued in parallel to an equal number of execution units. The compiler (not the processor) checks that there are only independent instructions executed in parallel.

ELEC Fall 05 8 Comparison of VLIW, CISC,RISC

ELEC Fall 05 9 VLIW characteristics VLIW contains multiple primitive instructions that can be executed in parallel by functional units of a processor. The compiler packs a number of primitive, non-interdependent instructions into a very long instruction word Since multiple instructions are packed in one instruction word, the instruction words are much larger than CISC and RISC’s.

ELEC Fall The VLIW compiler The compiler specifies the primitive instructions per VLIW instruction word. The compiler must guarantee that the multiple primitive instructions which group together are independent so they can be executable in parallel. Only the sequence of different VLIW words affects the outputs (e.g., blue, red, green).

ELEC Fall VLIW principle

ELEC Fall VLIW principles 1.The compiler analyzes dependence of all instructions among sequential code, tries to extract as much parallelism as possible. 2.Based on the analysis, the compiler re-codes the piece of sequential code in VLIW instruction words. 3.Finally, the work left with VLIW hardware is only fetch the VLIWs from cache, decode them, and then dispatch the independent primitive instructions to corresponding function units and execute.

ELEC Fall Implementation To get commercial success, Itanium was invented instead of general purpose VLIW processor A hypothetical VLIW processor architecture was invented Instead of particular implementation

ELEC Fall Generating of VLIW instruction words A hypothetical VLIW processor architecture

ELEC Fall One VLIW instruction word contains maximum 8 primitive instructions. 2. Each time, one VLIW instruction word is fetched from cache and decoded. 3. After decoding, all primitive instructions in this VLIW word are issued to functional units in parallel for execution. 4. These primitive instructions are from the same VLIW word, so they are guaranteed to be independent.

ELEC Fall SOFTWARE INSTEAD OF HARDWARE: IMPLEMENTATION ADVANTAGES OF VLIW VLIW instructions explicitly specify several independent operations— decode the instruction and dispatch hardware that tries to reconstruct parallelism from a serial instruction stream. The processor does not need to consider whether or not the instructions are parallel.

ELEC Fall Conclusion 1. The highly parallel implementation is much simpler and cheaper than its counterparts. 2. The encoding of VLIW words implies parallelism among their primitive instructions, which results in reduced hardware complexity. 3. The complier must assemble multiple primitive instructions into a single VLIW, to make sure that multiple function units are kept busy.

ELEC Fall Conclusion( cont.) 4. The compiler optimizes software pipeline; by re-ordering tries to find the most parallelism in the sequential code. 5. The microprocessor performance is dependent on how the compiler produces VLIW words.

ELEC Fall Relevant areas: Trace Scheduling Algorithm, Dynamic Scheduling Explicitly Parallel Instruction Computing (EPIC) Dynamically Architected Instruction Set from Yorktown (DAISY) VLIW in Embedded Systems

ELEC Fall References at_download/other/vliw-wp.pdf at_download/other/vliw-wp.pdf VLIW.pdf VLIW.pdf 86_vliw/old/ 86_vliw/old/

ELEC Fall Thanks !