Performance. Moore's Law Moore's Law Related Curves.

Slides:



Advertisements
Similar presentations
CDA 3101 Fall 2010 Discussion Section 08 CPU Performance
Advertisements

Computer Organization Lab 1 Soufiane berouel. Formulas to Remember CPU Time = CPU Clock Cycles x Clock Cycle Time CPU Clock Cycles = Instruction Count.
Performance Evaluation of Architectures Vittorio Zaccaria.
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
Measuring Performance
100 Performance ENGR 3410 – Computer Architecture Mark L. Chang Fall 2006.
Computer Organization and Architecture 18 th March, 2008.
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
CSCE 212 Chapter 4: Assessing and Understanding Performance Instructor: Jason D. Bakos.
Performance D. A. Patterson and J. L. Hennessey, Computer Organization & Design: The Hardware Software Interface, Morgan Kauffman, second edition 1998.
9/16/2004Comp 120 Fall September 16 Assignment 4 due date pushed back to 23 rd, better start anywayAssignment 4 due date pushed back to 23 rd, better.
Computer Architecture Lecture 2 Instruction Set Principles.
Chapter 4 Assessing and Understanding Performance
Fall 2001CS 4471 Chapter 2: Performance CS 447 Jason Bakos.
Amdahl's Law.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
1 ECE3055 Computer Architecture and Operating Systems Lecture 2 Performance Prof. Hsien-Hsin Sean Lee School of Electrical and Computer Engineering Georgia.
Computer Organization and Design Performance Montek Singh Mon, April 4, 2011 Lecture 13.
1 Computer Performance: Metrics, Measurement, & Evaluation.
Lecture 2: Computer Performance
Lecture 1: Performance EEN 312: Processors: Hardware, Software, and Interfacing Department of Electrical and Computer Engineering Spring 2013, Dr. Rozier.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
B0111 Performance Anxiety ENGR xD52 Eric VanWyk Fall 2012.
CDA 3101 Fall 2013 Introduction to Computer Organization Computer Performance 28 August 2013.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
1 CS/EE 362 Hardware Fundamentals Lecture 9 (Chapter 2: Hennessy and Patterson) Winter Quarter 1998 Chris Myers.
1 Acknowledgements Class notes based upon Patterson & Hennessy: Book & Lecture Notes Patterson’s 1997 course notes (U.C. Berkeley CS 152, 1997) Tom Fountain.
Performance.
Computer Organization and Architecture Tutorial 1 Kenneth Lee.
Chapter 1 Performance & Technology Trends Read Sections 1.5, 1.6, and 1.8.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Chapter 1 Technology Trends and Performance. Chapter 1 — Computer Abstractions and Technology — 2 Technology Trends Electronics technology continues to.
CPU Performance using Different Parameters CS 250: Andrei D. Coronel, MS,CEH,PhD Cand.
Lecture2: Performance Metrics Computer Architecture By Dr.Hadi Hassan 1/3/2016Dr. Hadi Hassan Computer Architecture 1.
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
Chapter 4. Measure, Report, and Summarize Make intelligent choices See through the marketing hype Understanding underlying organizational aspects Why.
Lec2.1 Computer Architecture Chapter 2 The Role of Performance.
L12 – Performance 1 Comp 411 Computer Performance He said, to speed things up we need to squeeze the clock Study
Performance COE 301 Computer Organization Dr. Muhamed Mudawar College of Computer Sciences and Engineering King Fahd University of Petroleum and Minerals.
EGRE 426 Computer Organization and Design Chapter 4.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
Performance COE 301 / ICS 233 Computer Organization Prof. Muhamed Mudawar College of Computer Sciences and Engineering King Fahd University of Petroleum.
Chapter 1 Performance & Technology Trends. Outline What is computer architecture? Performance What is performance: latency (response time), throughput.
BITS Pilani, Pilani Campus Today’s Agenda Role of Performance.
Computer Organization CS345 David Monismith Based upon notes by Dr. Bill Siever and from the Patterson and Hennessy Text.
Measuring Performance II and Logic Design
Lecture 2: Performance Today’s topics:
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
September 2 Performance Read 3.1 through 3.4 for Tuesday
How do we evaluate computer architectures?
Defining Performance Which airplane has the best performance?
CSCE 212 Chapter 4: Assessing and Understanding Performance
Chapter 1 Fundamentals of Computer Design
CS2100 Computer Organisation
Defining Performance Section /14/2018 9:52 PM.
Computer Performance He said, to speed things up we need to squeeze the clock.
Performance Cycle time of a computer CPU speed speed = 1 / cycle time
Performance of computer systems
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Parameters that affect it How to improve it and by how much
Computer Performance Read Chapter 4
Performance.
Chapter 2: Performance CS 447 Jason Bakos Fall 2001 CS 447.
Computer Organization and Design Chapter 4
CS2100 Computer Organisation
Presentation transcript:

Performance

Moore's Law

Moore's Law Related Curves

The Complete System Performance depends on much besides CPU:

Terminology

CPU Time Execution Time:

CPU Time Execution Time:

Clock GHz = billions of cycles / second

Clock Different subsystems, different clocks Pentium IIi7

Limiting Circuits Increasing clock may outpace time required by some circuits:

Limiting Circuits Processor designers can compensate by making instruction step take multiple cycles:

Clock Speedup Potentially counter productive results:

Clock Speedup Potentially counter productive results:

GHz Myth Different processors = different work/clock Ramping up clock in same processor may not help

MIPS MIPS = Millions of Instructions Per Second

CPI Different instructions = different #clock cycles

CPI Finding average cycles per instruction: – Clock cycles/instruction weighted by frequency

CPI Determining time spent doing different types using frequency & CPI: Average: 1.75

MIPS Flaws Compiler A: 2 million 1-cycle + 1 million 2-cycle = 3 million instruction over 4 million cycles Compiler B: 1.5 million 1-cycle million 2-cycle = 2.7 million instructions over 3.9 million cycles Faster program

MIPS Flaws Compiler A: = 3 million instruction & 4 million cycles 3 / 4 =.75 instruction per cycle Compiler B: = 2.7 million instructions & 3.9 million cycles 2.7 / 3.9 =.69 instructions per cycle Faster program Worse IPS

GHz Myth What work does an instruction do? Assume same execution time for A & B – A's MIPS score is 50% higher

MIPS MIPS can't compare different architectures

Measurement Reliable performance measurement must measure all three factors

Amdahl's Law

Gene Amdahl

Amdahl's Law Applied to parallel processing: Applied to computer systems: – Which chunk of work should we optimize?

Speed up A program used to take 20 minutes to run. Now it takes 15. What is the speedup? 1.33 times or 33%

Amdahl's Law Describes overall speedup of a system when we speed up one part of a system

Amdahl's Law Describes overall speedup of a system when we speed up one part of a system f : fraction of time part is limiting factor

Amdahl's Law Describes overall speedup of a system when we speed up one part of a system f : fraction of time part is limiting factor k : speedup of that part

Amdahl's Law Describes overall speedup of a system when we speed up one part of a system f : fraction of time part is limiting factor k : speedup of that part 1 – f : fraction of time doing other stuff

Amdahl's Law Describes overall speedup of a system when we speed up one part of a system f : fraction of time part is limiting factor k : speedup of that part 1 – f : fraction of time doing other stuff S : speed up

Example On a large system, suppose we can upgrade its disk drives for $7,000 to make them 150% faster. Processes spend 70% of their time running in the CPU and 30% of their time waiting for disk service. What would the expected speedup be?

…upgrade its disk drives for $7,000 to make them 150% faster. Processes spend 70% of their time running in the CPU and 30% of their time waiting for disk service f : fraction of time part is limiting factor k : speedup of that part 1 – f : fraction of time doing other stuff S : speed up Example 150% increase = 250%

…upgrade its disk drives for $7,000 to make them 150% faster. Processes spend 70% of their time running in the CPU and 30% of their time waiting for disk service Speed up is ~ 22% Example

On a large system, suppose we can upgrade its disk drives for $7,000 to make them 150% faster. Processes spend 70% of their time running in the CPU and 30% of their time waiting for disk service. Or we could spend $10,000 to upgrade the CPU to be 50% faster…

…upgrade its cpu to run 50% faster. Processes spend 70% of their time running in the CPU and 30% of their time waiting for disk service f : fraction of time part is limiting factor k : speedup of that part 1 – f : fraction of time doing other stuff S : speed up Example

…upgrade its cpu to run 50% faster. Processes spend 70% of their time running in the CPU and 30% of their time waiting for disk service Speed up is ~ 30% Example

Choices Disks : 22% speedup for $7,000 ~ $318 per % CPU : 30% speedup for $10,000 ~ $333 per %

Non IO Protein String Matching Code 4 days execution time on current machine 20% of time doing integer instructions 35% percent of time doing I/O – Which is the better tradeoff? Compiler optimization that reduces number of integer instructions by 25% Hardware optimization that reduces the latency of each IO operations from 6us to 5us.

Non Hardware Reduce instructions by 25% = (1/.75) = 1.33 speedup 20% of time doing integer instructions 35% percent of time doing I/O

Non Hardware Reduce IO from 6us to 5us = (6/5) = 1.2 speedup 20% of time doing integer instructions 35% percent of time doing I/O

Non Hardware Reduce instructions by 25% Reduce IO from 6us to 5us = 20% speedup (6/5)

Limitations of Amdahl's Law Amdahl's Law doesn't take into account interactions – Speeding up one part of process may change the % of time it is limiting factor