Computer Architecture & Operations I

Slides:



Advertisements
Similar presentations
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Yaohang Li.
Advertisements

Computer Abstractions and Technology
Power calculation for transistor operation What will cause power consumption to increase? CS2710 Computer Organization1.
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
Lec 2 Aug 31 review of lec 1 continue Ch 1 course overview performance measures Ch 1 exercises quiz 1.
CS/ECE 3330 Computer Architecture Chapter 1 Performance / Power.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
Chapter 1 Section 1.4 Dr. Iyad F. Jafar Evaluating Performance.
Morgan Kaufmann Publishers Computer Abstractions and Technology
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
CS 161 Spring 2009Course administration Instructor: Laxmi N. Bhuyan Office: 351 Engg. 2 Website:
Computer Organization and Design Performance Montek Singh Mon, Aug 26, 2013 Lecture 2 1.
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji.
CS3350B Computer Architecture Winter 2015 Performance Metrics I Marc Moreno Maza
Chapter 1 Computer Abstractions and Technology Part II.
Last Time Performance Analysis It’s all relative
EET 4250: Chapter 1 Computer Abstractions and Technology Acknowledgements: Some slides and lecture notes for this course adapted from Prof. Mary Jane Irwin.
Chapter 1 - The Computer Revolution Chapter 1 — Computer Abstractions and Technology — 1  Progress in computer technology  Underpinned by Moore’s Law.
Lecture 1: Performance EEN 312: Processors: Hardware, Software, and Interfacing Department of Electrical and Computer Engineering Spring 2013, Dr. Rozier.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
Chapter 1 — Computer Abstractions and Technology — 1 Understanding Performance Algorithm Determines number of operations executed Programming language,
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Chapter 1 Performance & Technology Trends Read Sections 1.5, 1.6, and 1.8.
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Chapter 1 Technology Trends and Performance. Chapter 1 — Computer Abstractions and Technology — 2 Technology Trends Electronics technology continues to.
Morgan Kaufmann Publishers
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
CSE2021: Computer Organization Instructor: Dr. Amir Asif Department of Computer Science York University Handout # 2: Measuring Performance Topics: 1. Performance:
CPS3340 COMPUTER ARCHITECTURE Fall Semester, /03/2013 Lecture 3: Computer Performance Instructor: Ashraf Yaseen DEPARTMENT OF MATH & COMPUTER SCIENCE.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology.
Chapter 1 — Computer Abstractions and Technology — 1 Uniprocessor Performance Constrained by power, instruction-level parallelism, memory latency.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Modified by S. J. Fritz Spring 2009 (1) Based on slides from D. Patterson and www-inst.eecs.berkeley.edu/~cs152/ COM 249 – Computer Organization and Assembly.
Chapter 1 Performance & Technology Trends. Outline What is computer architecture? Performance What is performance: latency (response time), throughput.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface ARM Edition Chapter 1 Computer Abstractions and Technology.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology.
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Technology Trends and Performance
Measuring Performance II and Logic Design
Morgan Kaufmann Publishers Computer Abstractions and Technology
Computer Architecture & Operations I
CS161 – Design and Architecture of Computer Systems
CS161 – Design and Architecture of Computer Systems
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Architecture
Xiang Lian The University of Texas-Pan American
Morgan Kaufmann Publishers Computer Abstractions and Technology
Uniprocessor Performance
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers
Morgan Kaufmann Publishers Computer Abstractions and Technology
COSC 3406: Computer Organization
CSCE 212 Chapter 4: Assessing and Understanding Performance
EENG 3710 Computer Organization
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Chapter 1 Computer Abstractions & Technology Performance Evaluation
Morgan Kaufmann Publishers
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers
Performance Lecture notes from MKP, H. H. Lee and S. Yalamanchili.
CS161 – Design and Architecture of Computer Systems
Computer Abstractions and Technology
Presentation transcript:

Computer Architecture & Operations I Instructor: Ryan Florin

Morgan Kaufmann Publishers October 8, 2017 Performance Summary The BIG Picture Performance depends on Algorithm: affects IC, possibly CPI Programming language: affects IC, CPI Compiler: affects IC, CPI Instruction set architecture: affects IC, CPI, Tc Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Power Trends §1.5 The Power Wall In CMOS IC technology Complementary Metal Oxide Semiconductor ×30 5V → 1V ×1000 Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Reducing Power Suppose a new CPU has 85% of capacitive load of old CPU 15% voltage and 15% frequency reduction Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 The Power Wall Reduce Voltage With lower voltage, transistor “leak”. Currently responsible for 40% of the power consumption Reduce Heat More powerful cooling How else can we improve performance? Chapter 1 — Computer Abstractions and Technology

Uniprocessor Performance Morgan Kaufmann Publishers October 8, 2017 Uniprocessor Performance §1.6 The Sea Change: The Switch to Multiprocessors Constrained by power, instruction-level parallelism, memory latency Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Multiprocessors Multicore microprocessors More than one processor per chip Requires explicitly parallel programming Compare with instruction level parallelism Hardware executes multiple instructions at once Hidden from the programmer Hard to do Programming for performance Load balancing Optimizing communication and synchronization Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Manufacturing ICs §1.7 Real Stuff: The AMD Opteron X4 Yield: proportion of working dies per wafer Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 AMD Athlon X2 Wafer X2: 300mm wafer, 117 chips, 90nm technology (2007) X4: 28nm technology (2013) Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Intel Core i7 300mm wafer, 280 chips, 32nm technology (2011) Kaby Lake: 14nm technology (Aug 2016) Chapter 1 — Computer Abstractions and Technology

Integrated Circuit Cost Morgan Kaufmann Publishers October 8, 2017 Integrated Circuit Cost Nonlinear relation to area and defect rate Wafer cost and area are fixed Defect rate determined by manufacturing process Die area determined by architecture and circuit design Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 SPEC CPU Benchmark Programs used to measure performance Supposedly typical of actual workload System Performance Evaluation Coop (SPEC) Develops benchmarks for CPU, I/O, Web, … SPEC CPU2006 Elapsed time to execute a selection of programs Negligible I/O, so focuses on CPU performance Normalize relative to reference machine Summarize as geometric mean of performance ratios CINT2006 (integer) and CFP2006 (floating-point) Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 CINT2006 for Opteron X4 2356 Name Description IC×109 CPI Tc (ns) Exec time Ref time SPECratio perl Interpreted string processing 2,118 0.75 0.40 637 9,777 15.3 bzip2 Block-sorting compression 2,389 0.85 817 9,650 11.8 gcc GNU C Compiler 1,050 1.72 0.47 24 8,050 11.1 mcf Combinatorial optimization 336 10.00 1,345 9,120 6.8 go Go game (AI) 1,658 1.09 721 10,490 14.6 hmmer Search gene sequence 2,783 0.80 890 9,330 10.5 sjeng Chess game (AI) 2,176 0.96 0.48 37 12,100 14.5 libquantum Quantum computer simulation 1,623 1.61 1,047 20,720 19.8 h264avc Video compression 3,102 993 22,130 22.3 omnetpp Discrete event simulation 587 2.94 690 6,250 9.1 astar Games/path finding 1,082 1.79 773 7,020 xalancbmk XML parsing 1,058 2.70 1,143 6,900 6.0 Geometric mean 11.7 High cache miss rates Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 SPEC Power Benchmark Power consumption of server at different workload levels Performance: ssj_ops/sec Power: Watts (Joules/sec) Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 SPECpower_ssj2008 for X4 Target Load % Performance (ssj_ops/sec) Average Power (Watts) 100% 231,867 295 90% 211,282 286 80% 185,803 275 70% 163,427 265 60% 140,160 256 50% 118,324 246 40% 920,35 233 30% 70,500 222 20% 47,126 206 10% 23,066 180 0% 141 Overall sum 1,283,590 2,605 ∑ssj_ops/ ∑power 493 Chapter 1 — Computer Abstractions and Technology

Fallacy: Low Power at Idle Morgan Kaufmann Publishers October 8, 2017 Fallacy: Low Power at Idle Look back at X4 power benchmark At 100% load: 295W At 50% load: 246W (83%) At 10% load: 180W (61%) Google data center Mostly operates at 10% – 50% load At 100% load less than 1% of the time Consider designing processors to make power proportional to load Chapter 1 — Computer Abstractions and Technology

SPECpower_ssj2008 Intel XEON X5650

Pitfall: MIPS as a Performance Metric Morgan Kaufmann Publishers October 8, 2017 Pitfall: MIPS as a Performance Metric MIPS: Millions of Instructions Per Second Doesn’t account for Differences in ISAs between computers Differences in complexity between instructions CPI varies between programs on a given CPU Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Concluding Remarks Cost/performance is improving Due to underlying technology development Instruction set architecture The hardware/software interface Execution time: the best performance measure Power is a limiting factor Use parallelism to improve performance §1.9 Concluding Remarks Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Amdahl’s Law Improving an aspect of a computer and expecting a proportional improvement in overall performance §1.8 Fallacies and Pitfalls Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Amdahl’s Law §1.8 Fallacies and Pitfalls Example: A program runs for 100 seconds. Of the 100 seconds, 80 seconds are due to multiplies. How much do we need to improve the multiply to achieve 2x performance? Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers October 8, 2017 Amdahl’s Law Example: A program runs for 100 seconds. Of the 100 seconds, 80 seconds are due to multiplies. 2x improvement: n = 2.66 3x improvement: n = 6 4x improvement: n = 16 5x improvement: n = ? §1.8 Fallacies and Pitfalls Corollary: make the common case fast Chapter 1 — Computer Abstractions and Technology

Summary Performance Definition Power Trend Amdahl’s Law

What I want you to do Review Chapter 1 Work on assignment 1