1 Acknowledgements Class notes based upon Patterson & Hennessy: Book & Lecture Notes Patterson’s 1997 course notes (U.C. Berkeley CS 152, 1997) Tom Fountain.

Slides:



Advertisements
Similar presentations
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
Advertisements

100 Performance ENGR 3410 – Computer Architecture Mark L. Chang Fall 2006.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
1 Introduction Rapidly changing field: –vacuum tube -> transistor -> IC -> VLSI (see section 1.4) –doubling every 1.5 years: memory capacity processor.
Chapter 4 Assessing and Understanding Performance Bo Cheng.
1 CSE SUNY New Paltz Chapter 2 Performance and Its Measurement.
Performance D. A. Patterson and J. L. Hennessey, Computer Organization & Design: The Hardware Software Interface, Morgan Kauffman, second edition 1998.
Computer ArchitectureFall 2007 © September 17, 2007 Karem Sakallah CS-447– Computer Architecture.
1  1998 Morgan Kaufmann Publishers and UCB Performance CEG3420 Computer Design Lecture 3.
Assessing and Understanding Performance B. Ramamurthy Chapter 4.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
Chapter 4 Assessing and Understanding Performance
Fall 2001CS 4471 Chapter 2: Performance CS 447 Jason Bakos.
Lecture 3: Computer Performance
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
1 ECE3055 Computer Architecture and Operating Systems Lecture 2 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia.
Computer Organization and Design Performance Montek Singh Mon, April 4, 2011 Lecture 13.
1 Computer Performance: Metrics, Measurement, & Evaluation.
Lecture 2: Computer Performance
Memory/Storage Architecture Lab Computer Architecture Performance.
B0111 Performance Anxiety ENGR xD52 Eric VanWyk Fall 2012.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
1 CS/EE 362 Hardware Fundamentals Lecture 9 (Chapter 2: Hennessy and Patterson) Winter Quarter 1998 Chris Myers.
Performance.
Computer Organization and Architecture Tutorial 1 Kenneth Lee.
Computer Performance Computer Engineering Department.
1 CS465 Performance Revisited (Chapter 1) Be able to compare performance of simple system configurations and understand the performance implications of.
1 Seoul National University Performance. 2 Performance Example Seoul National University Sonata Boeing 727 Speed 100 km/h 1000km/h Seoul to Pusan 10 hours.
CEN 316 Computer Organization and Design Assessing and Understanding Performance Mansour AL Zuair.
Morgan Kaufmann Publishers
CSE2021: Computer Organization Instructor: Dr. Amir Asif Department of Computer Science York University Handout # 2: Measuring Performance Topics: 1. Performance:
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
4. Performance 4.1 Introduction 4.2 CPU Performance and Its Factors
Lecture 5: 9/10/2002CS170 Fall CS170 Computer Organization and Architecture I Ayman Abdel-Hamid Department of Computer Science Old Dominion University.
Chapter 4. Measure, Report, and Summarize Make intelligent choices See through the marketing hype Understanding underlying organizational aspects Why.
Lec2.1 Computer Architecture Chapter 2 The Role of Performance.
L12 – Performance 1 Comp 411 Computer Performance He said, to speed things up we need to squeeze the clock Study
EGRE 426 Computer Organization and Design Chapter 4.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 2: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed. Oct. 4,
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Yaohang Li.
Computer Architecture CSE 3322 Web Site crystal.uta.edu/~jpatters/cse3322 Send to Pramod Kumar, with the names and s.
B0111 Performance Anxiety ENGR xD52 Eric VanWyk Fall 2012.
B0111 Growth and Performance ENGR xD52 Eric VanWyk Fall 2014.
COD Ch. 1 Introduction + The Role of Performance.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
BITS Pilani, Pilani Campus Today’s Agenda Role of Performance.
CPEN Digital System Design Assessing and Understanding CPU Performance © Logic and Computer Design Fundamentals, 4 rd Ed., Mano Prentice Hall © Computer.
Performance. Moore's Law Moore's Law Related Curves.
Two notions of performance
Computer Organization
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
September 2 Performance Read 3.1 through 3.4 for Tuesday
How do we evaluate computer architectures?
Defining Performance Which airplane has the best performance?
Prof. Hsien-Hsin Sean Lee
CSCE 212 Chapter 4: Assessing and Understanding Performance
CS2100 Computer Organisation
Computer Performance He said, to speed things up we need to squeeze the clock.
CMSC 611: Advanced Computer Architecture
CMSC 611: Advanced Computer Architecture
Computer Performance Read Chapter 4
Performance.
Chapter 2: Performance CS 447 Jason Bakos Fall 2001 CS 447.
Computer Organization and Design Chapter 4
CS2100 Computer Organisation
Presentation transcript:

1 Acknowledgements Class notes based upon Patterson & Hennessy: Book & Lecture Notes Patterson’s 1997 course notes (U.C. Berkeley CS 152, 1997) Tom Fountain 2000 course notes (Stanford EE182) Michael Wahl 2000 lecture notes (U. of Siegen CS 3339) Ben Dugan 2001 lecture notes (UW-CSE 378) Professor Scott Hauck lecture notes (UW EE 471)

2 Why are you here?

3 What is Computer Architecture?

4 What things are important when buying a computer? (What features do you look for when buying one?)

5 Computer “Performance” MIPS (Million Instructions Per Second) vs. MHz (Million Cycles Per Second) Throughput (jobs/seconds) vs. Latency (time to complete a job) Measuring, Metrics, Evaluation – what is “best”? The PowerBook G4 outguns Pentium III-based notebooks by up to 30 percent.* * Based on Adobe Photoshop tests comparing a 500MHz PowerBook G4 to 850MHz Pentium III-based portable computers 3.09 GHz Pentium 4 Hyper Pipelined Technology

6 Performance Example: Planes Which is the “best” plane? Which gets one passenger to the destination first? Which moves the most passengers? Which goes the furthest? Which is the “speediest” plane (between Seattle and NY for example)? Latency: how fast is one person moved? Throughput: number of people per time moved? AirplanePassenger Capacity Cruising Range (miles) Cruising Speed (mph) Passenger Throughput (passengermile/hour) Boeing ,750 Boeing ,700 Concorde ,200 Douglas DC ,424

7 Computer Performance Primary goal: execution time (time from program start to program completion) To compare machines, we say “X is n times faster than Y” Example: Machine Orange and Grape run a program Orange takes 5 seconds, Grape takes 10 seconds Orange is _____ times faster than Grape

8 Execution Time Elapsed Time counts everything (disk and memory accesses, I/O, etc.) a useful number, but often not good for comparison purposes CPU time doesn't count I/O or time spent running other programs can be broken up into system time, and user time Example: Unix “time” command fpga.olin.edu> time javac CircuitViewer.java 3.370u 0.570s 0: % Our focus: user CPU time time spent executing the lines of code that are "in" our program

9 CPU Time Application example: A program takes 10 seconds on computer Orange, with a 400MHz clock. Our design team is developing a machine Grape with a much higher clock rate, but it will require 1.2 times as many clock cycles. If we want to be able to run the program in 6 second, how fast must the clock rate be? CPU execution time for a program CPU clock cycles for a program Clock period=* CPU execution time for a program CPU clock cycles for a program 1 Clock rate =*

10 CPI How do the # of instructions in a program relate to the execution time? CPU clock cycles for a program Instructions for a program Average Clock Cycles per Instruction (CPI) =* CPU execution time for a program Instructions for a program =* 1 Clock rate *CPI

11 CPI Example Suppose we have two implementations of the same instruction set (ISA). For some program Machine A has a clock cycle time of 10 ns. and a CPI of 2.0 Machine B has a clock cycle time of 20 ns. and a CPI of 1.2 What machine is faster for this program, and by how much?

12 Computing CPI Different types of instructions can take very different amounts of cycles Memory accesses, integer math, floating point, control flow Instruction TypeType CyclesType FrequencyCycles * Freq ALU150% Load520% Store310% Branch220% CPI:

13 CPI & Processor Tradeoffs How much faster would the machine be if: 1.A data cache reduced the average load time to 2 cycles? 2.Branch prediction shaved a cycle off the branch time? 3.Two ALU instructions could be executed at once? Instruction TypeType CyclesType Frequency ALU150% Load520% Store310% Branch220%

14 Warning 1: Amdahl’s Law The impact of a performance improvement is limited by what is NOT improved: Example: Assume a program runs in 100 seconds on a machine, with multiply responsible for 80 seconds of this time. How much do we have to speed up multiply to make the program run 4 times faster? 5 times faster? Execution time after improvement Execution time of unaffected =* 1 Amount of improvement + Execution time affected

15 Warning 2: MIPs, MHz  Performance Higher MHz (clock rate) doesn’t always mean better CPU Orange computer: 1000 MHz, CPI: 2.5, 1 billion instruction program Grape computer: 500MHz, CPI: 1.1, 1 billion instruction program Higher MIPs (million instructions per second) doesn’t always mean better CPU 1 MHz machine, with two different compilers Compiler A on program X: 10M ALU, 1M Load Compiler B on program X: 5M ALU, 1M Load Execution Time: A ____ B ____ MIPS: A ____ B ____ Instruction TypeType Cycles ALU1 Load5 Store3 Branch2

16 Processor Performance Summary Machine performance: Better performance: _____ number of instructions to implement computations _____ CPI _____ Clock rate Improving performance must balance each constraint Example: RISC vs. CISC CPU execution time for a program Instructions for a program =* 1 Clock rate *CPI