Lecture 2: Performance Evaluation

Slides:



Advertisements
Similar presentations
1 Lecture 2: Metrics to Evaluate Performance Topics: Benchmark suites, Performance equation, Summarizing performance with AM, GM, HM Video 1: Using AM.
Advertisements

11 Measuring performance Kosarev Nikolay MIPT Feb, 2010.
CS2100 Computer Organisation Performance (AY2014/2015) Semester 2.
Computer Abstractions and Technology
TU/e Processor Design 5Z032 1 Processor Design 5Z032 The role of Performance Henk Corporaal Eindhoven University of Technology 2009.
Evaluating Performance
Performance Analysis of Multiprocessor Architectures
CS 6290 Evaluation & Metrics. Performance Two common measures –Latency (how long to do X) Also called response time and execution time –Throughput (how.
Lecture 2c: Benchmarks. Benchmarking Benchmark is a program that is run on a computer to measure its performance and compare it with other machines Best.
Computer Organization and Architecture 18 th March, 2008.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon., Sep 5, 2005 Lecture 2.
CIS629 Fall Lecture Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two important.
Computer ArchitectureFall 2007 © September 17, 2007 Karem Sakallah CS-447– Computer Architecture.
Computer Architecture Lecture 2 Instruction Set Principles.
Chapter 4 Assessing and Understanding Performance
1 COMP 206: Computer Architecture and Implementation Montek Singh Wed., Sep 3, 2003 Lecture 2.
1 Lecture 10: FP, Performance Metrics Today’s topics:  IEEE 754 representations  FP arithmetic  Evaluating a system Reminder: assignment 4 due in a.
CIS429/529 Winter 07 - Performance - 1 Performance Overview Execution time is the best measure of performance: simple, intuitive, straightforward. Two.
1 Chapter 4. 2 Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational motivation.
CPU Performance Assessment As-Bahiya Abu-Samra *Moore’s Law *Clock Speed *Instruction Execution Rate - MIPS - MFLOPS *SPEC Speed Metric *Amdahl’s.
CMSC 611: Advanced Computer Architecture Benchmarking Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted.
Lecture 2: Technology Trends and Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI.
Performance & Benchmarking. What Matters? Which airplane has best performance:
1 Computer Performance: Metrics, Measurement, & Evaluation.
Lecture 2: Computer Performance
BİL 221 Bilgisayar Yapısı Lab. – 1: Benchmarking.
Memory/Storage Architecture Lab Computer Architecture Performance.
Recap Technology trends Cost/performance Measuring and Reporting Performance What does it mean to say “computer X is faster than computer Y”? E.g. Machine.
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 5810/6810: Hennessy.
CDA 3101 Fall 2013 Introduction to Computer Organization Computer Performance 28 August 2013.
10/19/2015Erkay Savas1 Performance Computer Architecture – CS401 Erkay Savas Sabanci University.
1 CS/EE 362 Hardware Fundamentals Lecture 9 (Chapter 2: Hennessy and Patterson) Winter Quarter 1998 Chris Myers.
1. 2 Table 4.1 Key characteristics of six passenger aircraft: all figures are approximate; some relate to a specific model/configuration of the aircraft.
1 CS/COE0447 Computer Organization & Assembly Language CHAPTER 4 Assessing and Understanding Performance.
Computer Architecture
1 Seoul National University Performance. 2 Performance Example Seoul National University Sonata Boeing 727 Speed 100 km/h 1000km/h Seoul to Pusan 10 hours.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
CEN 316 Computer Organization and Design Assessing and Understanding Performance Mansour AL Zuair.
1  1998 Morgan Kaufmann Publishers How to measure, report, and summarize performance (suorituskyky, tehokkuus)? What factors determine the performance.
Performance Performance
TEST 1 – Tuesday March 3 Lectures 1 - 8, Ch 1,2 HW Due Feb 24 –1.4.1 p.60 –1.4.4 p.60 –1.4.6 p.60 –1.5.2 p –1.5.4 p.61 –1.5.5 p.61.
1 Lecture 2: Performance, MIPS ISA Today’s topics:  Performance equations  MIPS instructions Reminder: canvas and class webpage:
September 10 Performance Read 3.1 through 3.4 for Wednesday Only 3 classes before 1 st Exam!
Performance – Last Lecture Bottom line performance measure is time Performance A = 1/Execution Time A Comparing Performance N = Performance A / Performance.
Lec2.1 Computer Architecture Chapter 2 The Role of Performance.
Performance Analysis Topics Measuring performance of systems Reasoning about performance Amdahl’s law Systems I.
EGRE 426 Computer Organization and Design Chapter 4.
CMSC 611: Advanced Computer Architecture Performance & Benchmarks Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some.
Performance Computer Organization II 1 Computer Science Dept Va Tech January 2009 © McQuain & Ribbens Defining Performance Which airplane has.
Jan. 5, 2000Systems Architecture II1 Machine Organization (CS 570) Lecture 2: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed. Oct. 4,
CSE 340 Computer Architecture Summer 2016 Understanding Performance.
June 20, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 1: Performance Evaluation and Benchmarking * Jeremy R. Johnson Wed.
BITS Pilani, Pilani Campus Today’s Agenda Role of Performance.
Measuring Performance II and Logic Design
CS203 – Advanced Computer Architecture
4- Performance Analysis of Parallel Programs
Lecture 3: MIPS Instruction Set
September 2 Performance Read 3.1 through 3.4 for Tuesday
ECE 4100/6100 Advanced Computer Architecture Lecture 1 Performance
Defining Performance Which airplane has the best performance?
CS2100 Computer Organisation
Performance of computer systems
Performance of computer systems
Computer Evolution and Performance
Lecture 3: MIPS Instruction Set
Performance of computer systems
Parameters that affect it How to improve it and by how much
Performance Measurement and Analysis
Computer Organization and Design Chapter 4
CS2100 Computer Organisation
Presentation transcript:

Lecture 2: Performance Evaluation Performance definition, benchmark, summarizing performance, Amdahl’s law, and CPI

What Does Performance Mean? Response time A simulation program finishes in 5 minutes Throughput A web server serves 5 million request per second Other metrics MIPS (million instruction per second) MFLOPS Clock frequency Meaning: how fast the computer respond to user input or finish a program; the sooner the better Examples: A user sees the result in 5 second after inputting the keyword to a library database system; A simulation program finishes in one hour how much work a computer can finish for a given time; the higher the better MFLOPS: A clustered-base web server serves up to 5 million requests per second Why throughput: a system runs multiple jobs simultaneously. Other measures MIPS, MFLOPS, clock frequency Meaningful when directly related to response time or throughput

Execution Time Processor design is concerned with processor consumed by program execution. Shorter execution time=> Shorter response time Higher throughput Execution time = #inst×CPI×Cycletime What affects #inst, CPI, and cycle time? Almost all designs can be interpreted Any other metrics is meaningful only if consistent with execution time

Performance of Computers Performance is defined for a program and a machine. How to compare computers? Need benchmark programs: Real applications: scientific programs, compilers, text-processing software, image processing Modified applications: providing portability and focus Kernels: good to isolate performance of individual features Lmbench: measure latency and bandwidth of memory, file system, networking, etc. Toy benchmarks Synthetic benchmarks: matching average execution profile

Performance Comparison “X is n times faster than Y”: n: speedup if we are considering an enhancement, optimization, etc. What does “improving” mean? Improve performance: decrease execution time, increase throughput Improve execution time: decrease execution time Degrade performance: the reverse of the above; brings negative speedup Give examples: Given machine M1, program A, Program B Execution Time Performance A 5 B 10

Benchmark Suite Benchmark suite is a collection of benchmarks with a variety of applications Alleviating weakness of a single benchmark More representative for computer designers to evaluate their design Benchmarks test both computer and compilers, and OS in many cases Desktop benchmarks: CPU, memory, and graphics performance Sever benchmarks: throughput-oriented, I/O and OS intensive Embedded benchmarks: measuring the ability to meet deadline and save power

Summarizing Performance Given the performance of a set of programs, how to evaluate the performance of machines? A B C P1 (secs) 1 10 20 P2 (secs) 1000 100 20 Total (secs) 1001 110 40 Which computer is the “best” one?

Arithmetic Mean Total execution time / (number of programs) Simple and intuitive Representative if the user run the programs an equal number of times

Weighted Arithmetic Mean Give (different) weights to different programs Considering the frequencies of programs in the workload

Geometric Means Based on relative performance to a reference machine Relative performance is consistent with different reference machines If C is 2x faster than B (using B as the reference), B is 2x faster than A (A as the reference), then C is 4x faster than A (A as the reference)

Harmonic Mean Given speedups s1, s2, …, s_n, the average speedup by harmonic mean is n / (1/s1 + 1/s2 + … + 1/s_n) Why not arithmetic mean?

Amdahl’s Law We know about performance: defining, measuring, and summarizing How to maximize performance gains from the beginning in our design? Principle: Make the Common Case Fast!

Amdahl’s Law Predict overall speedup from “local speedup” by an enhancement, provided the frequency to use the enhancement is know. “Local speedup” is related to design and optimization objectives, like to double CPU frequency, to reduce cache latency by half

Amdahl’s Law

Equation Based on Instruction Types

Make Design Choice Using CPU Time Equation Assume we need to improve the performance of a graphics engine: FP FPSQR Other Frequency 25% 2% 75% CPI 4.0 20 1.33 Alternative 1: CPIFPSQR 20  2 Alternative 2: CPIFP 4  2.5 Which one is better? Calculate speedups.

Amdahl’s Law Choice one: Speed up FP Square root by 10x Choice two: Speed up all FP instruction by 1.6x 20% time are used by FP Square root, 50% for all FP inst Which choice is better? Implication: Optimizing for the common case first

SPEC CPU Benchmark SPEC: Standard Performance Evaluation Corporation CPU-intensive benchmark for evaluating processor performance of workstation Four generations: SPEC89, SPEC92, SPEC95, and SPEC2000 Two types of programs: INT and FP Emphasizing memory system performance in SPEC2000

SPEC CPU2000 Profiling Dynamic instruction mix Instruction Int avg FP avg Load int 26% 15% Store int 10% 2% Load fp - 15% Store fp - 7% Add 19% 23% All fp inst - 41% Cond br. 12% 4% All ctrl inst 16% 4%

Other SPEC Benchmarks SPECviewperf and SPEapc: 3D graphics performance SPEC JVM98: performance of client-side Java virtual machine SPEC JBB2000: Server-cline Java application SPEC WEB99: evaluating WWW servers SPEC HPC96: parallel and distributed computing

Server Benchmarks SPEC CPU2000, WBB99, SFS97 TPC Measuring the ability of a system to handle transactions TPC-C: online transaction processing (OLTP) benchmark (for bank systems) TPC-H: ad hoc decision make support TPC-R: decision make support with standard queries TPC-W: simulating business-oriented transactional web server

Embedded Benchmark EEMBC (Embedded Microprocessor Benchmark Consortium) benchmarks Based on kernel performance Five classes: automotive/industrial, consumer networking, office automation, and telecommunications Embedded benchmarks are not mature