UQ: Explain in brief integer instruction pipeline stages of Pentium

Slides:



Advertisements
Similar presentations
Lecture 4: CPU Performance
Advertisements

1/1/ / faculty of Electrical Engineering eindhoven university of technology Speeding it up Part 3: Out-Of-Order and SuperScalar execution dr.ir. A.C. Verschueren.
EZ-COURSEWARE State-of-the-Art Teaching Tools From AMS Teaching Tomorrow’s Technology Today.
ISA Issues; Performance Considerations. Testing / System Verilog: ECE385.
COMP25212 Further Pipeline Issues. Cray 1 COMP25212 Designed in 1976 Cost $8,800,000 8MB Main Memory Max performance 160 MFLOPS Weight 5.5 Tons Power.
COMP 4211 Seminar Presentation Based On: Computer Architecture A Quantitative Approach by Hennessey and Patterson Presenter : Feri Danes.
OMSE 510: Computing Foundations 4: The CPU!
ELEN 468 Advanced Logic Design
CS252/Patterson Lec 1.1 1/17/01 Pipelining: Its Natural! Laundry Example Ann, Brian, Cathy, Dave each have one load of clothes to wash, dry, and fold Washer.
Instruction-Level Parallelism (ILP)
Computer Organization and Architecture
Chapter 12 Pipelining Strategies Performance Hazards.
Computer ArchitectureFall 2007 © October 24nd, 2007 Majd F. Sakr CS-447– Computer Architecture.
L17 – Pipeline Issues 1 Comp 411 – Fall /1308 CPU Pipelining Issues Finishing up Chapter 6 This pipe stuff makes my head hurt! What have you been.
Computer Architecture Pipelines Diagrams are from Computer Architecture: A Quantitative Approach, 2nd, Hennessy and Patterson.
Chapter 12 CPU Structure and Function. Example Register Organizations.
DLX Instruction Format
Midterm Thursday let the slides be your guide Topics: First Exam - definitely cache,.. Hamming Code External Memory & Buses - Interrupts, DMA & Channels,
7/2/ _23 1 Pipelining ECE-445 Computer Organization Dr. Ron Hayne Electrical and Computer Engineering.
Computer ArchitectureFall 2008 © October 6th, 2008 Majd F. Sakr CS-447– Computer Architecture.
ENGS 116 Lecture 51 Pipelining and Hazards Vincent H. Berk September 30, 2005 Reading for today: Chapter A.1 – A.3, article: Patterson&Ditzel Reading for.
Lect 13-1 Lect 13: and Pentium. Lect Microprocessor Family  Microprocessor  Introduced in 1989  High Integration  On-chip 8K.
SUPERSCALAR EXECUTION. two-way superscalar The DLW-2 has two ALUs, so it’s able to execute two arithmetic instructions in parallel (hence the term two-way.
COMPUTER ORGANIZATIONS CSNB123 May 2014Systems and Networking1.
Basic Microcomputer Design. Inside the CPU Registers – storage locations Control Unit (CU) – coordinates the sequencing of steps involved in executing.
Part 2. Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved A five-level memory.
© 2009, Renesas Technology America, Inc., All Rights Reserved 1 Course Introduction  Purpose:  This course provides an overview of the SH-2 32-bit RISC.
CSC 4250 Computer Architectures September 15, 2006 Appendix A. Pipelining.
Lecture 7: Pipelining Review Kai Bu
Computer Architecture Pipelines & Superscalars Sunset over the Pacific Ocean Taken from Iolanthe II about 100nm north of Cape Reanga.
CSE 340 Computer Architecture Summer 2014 Basic MIPS Pipelining Review.
CS.305 Computer Architecture Enhancing Performance with Pipelining Adapted from Computer Organization and Design, Patterson & Hennessy, © 2005, and from.
The Intel 86 Family of Processors
P5 Micro architecture : Intel’s Fifth generation
CSIE30300 Computer Architecture Unit 04: Basic MIPS Pipelining Hsin-Chou Chi [Adapted from material by and
Superscalar - summary Superscalar machines have multiple functional units (FUs) eg 2 x integer ALU, 1 x FPU, 1 x branch, 1 x load/store Requires complex.
Pentium Architecture Arithmetic/Logic Units (ALUs) : – There are two parallel integer instruction pipelines: u-pipeline and v-pipeline – The u-pipeline.
Introduction  The speed of execution of program is influenced by many factors. i) One way is to build faster circuit technology to build the processor.
Oct. 18, 2000Machine Organization1 Machine Organization (CS 570) Lecture 4: Pipelining * Jeremy R. Johnson Wed. Oct. 18, 2000 *This lecture was derived.
COMPUTER ORGANIZATIONS CSNB123 NSMS2013 Ver.1Systems and Networking1.
System Hardware FPU – Floating Point Unit –Handles floating point and extended integer calculations 8284/82C284 Clock Generator (clock) –Synchronizes the.
3/12/2013Computer Engg, IIT(BHU)1 CONCEPTS-1. Pipelining Pipelining is used to increase the speed of processing It uses temporal parallelism In pipelining,
COMPSYS 304 Computer Architecture Speculation & Branching Morning visitors - Paradise Bay, Bay of Islands.
Lecture 9. MIPS Processor Design – Pipelined Processor Design #1 Prof. Taeweon Suh Computer Science Education Korea University 2010 R&E Computer System.
RISC / CISC Architecture by Derek Ng. Overview CISC Architecture RISC Architecture  Pipelining RISC vs CISC.
LECTURE 10 Pipelining: Advanced ILP. EXCEPTIONS An exception, or interrupt, is an event other than regular transfers of control (branches, jumps, calls,
L17 – Pipeline Issues 1 Comp 411 – Fall /23/09 CPU Pipelining Issues Read Chapter This pipe stuff makes my head hurt! What have you been.
RISC Pipelining CS 147 Spring 2011 Kui Cheung
Protection in Virtual Mode
Computer Organization
CS 286 Computer Architecture & Organization
RISC Pipelining RISC Pipelining CS 147 Spring 2011 Kui Cheung.
William Stallings Computer Organization and Architecture 8th Edition
CSCI206 - Computer Organization & Programming
ELEN 468 Advanced Logic Design
PowerPC 604 Superscalar Microprocessor
Pipelining.
Introduction to Pentium Processor
Pipelining: Advanced ILP
CS 704 Advanced Computer Architecture
CSCI206 - Computer Organization & Programming
\course\cpeg323-05F\Topic6b-323
The Processor Lecture 3.6: Control Hazards
An Introduction to pipelining
Data Dependence Distances
Pipelining.
Pipelining.
Presentation transcript:

UQ: Explain in brief integer instruction pipeline stages of Pentium Integer Pipeline UQ: Explain in brief integer instruction pipeline stages of Pentium

Integer Pipeline The pipelines are called “u” and “v” pipes. The u-pipe can execute any instruction, while the v-pipe can execute “simple” instructions as defined in the “Instruction Pairing Rules”. When instructions are paired, the instruction issued to the v-pipe is always the next sequential instruction after the one issued to u-pipe.

Integer Pipeline The integer pipeline stages are as follows: Prefetch(PF) : Instructions are prefetched from the on-chip instruction cache Decode1(D1): Two parallel decoders attempt to decode and issue the next two sequential instructions It decodes the instruction to generate a control word

Integer Pipeline 3. Decode2(D2): A single control word causes direct execution of an instruction Complex instructions require microcoded control sequencing 3. Decode2(D2): Decodes the control word Address of memory resident operands are calculated

Integer Pipeline Execute (EX): Writeback(WB): The instruction is executed in ALU Data cache is accessed at this stage For both ALU and data cache access requires more than one clock. Writeback(WB): The CPU stores the result and updates the flags

Integer Instruction Pairing Rules UQ: Explain the instruction pairing rules for Pentium

Integer Instruction Pairing Rules To issue two instructions simultaneously they must satisfy the following conditions: Both instructions in the pair must be “simple”. There must be no read-after-write(RAW) or write-after-write register(WAW) dependencies RAW: i1. R2  R1 + R3 i2. R4  R2 + R3 WAW: i1. R2  R4 + R7 i2. R2  R1 + R3

Integer Instruction Pairing Rules Neither instruction may contain both a displacement and an immediate Instruction with prefixes (lock,repne) can only occur in the u-pipe

Simple Instructions They are entirely hardwired They do not require any microcode control Executes in one clock cycle Exception: ALU mem,reg and ALU reg,mem are 3 and 2 clock operations respectively

The following integer instructions are considered simple and may be paired: 1. mov reg, reg/mem/imm 2. mov mem, reg/imm 3. alu reg, reg/mem/imm 4. alu mem, reg/imm 5. inc reg/mem 6. dec reg/mem 7. push reg/mem 8. pop reg 9. lea reg,mem 10. jmp/call/jcc near 11. nop 12. test reg, reg/mem 13. test acc, imm

UQ: List the steps in instruction issue algorithm

Instruction Issue Algorithm Decode the two consecutive instructions I1 and I2 If the following are all true I1 and I2 are simple instructions I1 is not a jump instruction Destination of I1 is not a source of I2 Destination of I1 is not a destination of I2 Then issue I1 to u pipeline and I2 to v pipeline Else issue I1 to u pipeline

Floating Point Pipeline UQ: Explain the floating point pipeline stages.

Floating-Point Pipeline The floating point pipeline has 8 stages as follows: Prefetch(PF) : Instructions are prefetched from the on-chip instruction cache Instruction Decode(D1): Two parallel decoders attempt to decode and issue the next two sequential instructions It decodes the instruction to generate a control word

Floating-Point Pipeline A single control word causes direct execution of an instruction Complex instructions require microcoded control sequencing 3. Address Generate (D2): Decodes the control word Address of memory resident operands are calculated

Floating-Point Pipeline Memory and Register Read (Execution Stage) (EX): Register read, memory read or memory write performed as required by the instruction to access an operand. Floating Point Execution Stage 1(X1): Information from register or memory is written into FP register. Data is converted to floating point format before being loaded into the floating point unit

Floating-Point Pipeline Floating Point Execution Stage 2(X2): Floating point operation performed within floating point unit. Write FP Result (WF): Floating point results are rounded and the result is written to the target floating point register. Error Reporting(ER) If an error is detected, an error reporting stage is entered where the error is reported and FPU status word is updated

Instruction Issue for Floating Point Unit The rules of how floating-point (FP) instructions get issued on the Pentium processor are : FP instructions do not get paired with integer instructions. When a pair of FP instructions is issued to the FPU, only the FXCH instruction can be the second instruction of the pair. The first instruction of the pair must be one of a set F where F = [ FLD,FADD, FSUB, FMUL, FDIV, FCOM, FUCOM, FTST, FABS, FCHS]. FP instructions other than FXCH and instructions belonging to set F, always get issued singly to the FPU. FP instructions that are not directly followed by an FXCH instruction are issued singly to the FPU.