Lecturer: Simon Winberg Lecture 18 Amdahl’s Law (+- 25 min)

Slides:



Advertisements
Similar presentations
Analyzing Parallel Performance Intel Software College Introduction to Parallel Programming – Part 6.
Advertisements

Performance Measurement n Assignment? n Timing #include double When() { struct timeval tp; gettimeofday(&tp, NULL); return((double)tp.tv_sec + (double)tp.tv_usec.
“Amdahl's Law in the Multicore Era” Mark Hill and Mike Marty University of Wisconsin IEEE Computer, July 2008 Presented by Dan Sorin.
Prepared 7/28/2011 by T. O’Neil for 3460:677, Fall 2011, The University of Akron.
Distributed Systems CS
SE-292 High Performance Computing
PradeepKumar S K Asst. Professor Dept. of ECE, KIT, TIPTUR. PradeepKumar S K, Asst.
La microarchitecture est morte. Longue vie à la microarchitecture! ISCA 2010 Panel, St. Malo, France (Microarchitecture is dead. Long live microarchitecture!)
Lecturer: Simon Winberg Lecture 18 Amdahl’s Law & YODA Blog & Design Review.
11Sahalu JunaiduICS 573: High Performance Computing5.1 Analytical Modeling of Parallel Programs Sources of Overhead in Parallel Programs Performance Metrics.
Example (1) Two computer systems have been tested using three benchmarks. Using the normalized ratio formula and the following tables below, find which.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Parallel Programming with MPI and OpenMP Michael J. Quinn.
Arquitectura de Sistemas Paralelos e Distribuídos Paulo Marques Dep. Eng. Informática – Universidade de Coimbra Ago/ Quantitative.
Performance Metrics Parallel Computing - Theory and Practice (2/e) Section 3.6 Michael J. Quinn mcGraw-Hill, Inc., 1994.
Recap.
CS 584 Lecture 11 l Assignment? l Paper Schedule –10 Students –5 Days –Look at the schedule and me your preference. Quickly.
Lecture 5 Today’s Topics and Learning Objectives Quinn Chapter 7 Predict performance of parallel programs Understand barriers to higher performance.
Steve Lantz Computing and Information Science Parallel Performance Week 7 Lecture Notes.
Amdahl's Law.
CPU Performance Assessment As-Bahiya Abu-Samra *Moore’s Law *Clock Speed *Instruction Execution Rate - MIPS - MFLOPS *SPEC Speed Metric *Amdahl’s.
Rechen- und Kommunikationszentrum (RZ) Parallelization at a Glance Christian Terboven / Aachen, Germany Stand: Version 2.3.
Computer Science 320 Measuring Speedup. What Is Running Time? T(N, K) says that the running time T is a function of the problem size N and the number.
18-447: Computer Architecture Lecture 30B: Multiprocessors Prof. Onur Mutlu Carnegie Mellon University Spring 2013, 4/22/2013.
Lecture 3 – Parallel Performance Theory - 1 Parallel Performance Theory - 1 Parallel Computing CIS 410/510 Department of Computer and Information Science.
Performance Evaluation of Parallel Processing. Why Performance?
Extending Amdahl’s Law in the Multicore Era Erlin Yao, Yungang Bao, Guangming Tan and Mingyu Chen Institute of Computing Technology, Chinese Academy of.
INTEL CONFIDENTIAL Predicting Parallel Performance Introduction to Parallel Programming – Part 10.
Amdahl’s Law in the Multicore Era Mark D.Hill & Michael R.Marty 2008 ECE 259 / CPS 221 Advanced Computer Architecture II Presenter : Tae Jun Ham 2012.
Agenda Project discussion Modeling Critical Sections in Amdahl's Law and its Implications for Multicore Design, S. Eyerman, L. Eeckhout, ISCA'10 [pdf]pdf.
Flynn’s Taxonomy SISD: Although instruction execution may be pipelined, computers in this category can decode only a single instruction in unit time SIMD:
Amdahl's Law Validity of the single processor approach to achieving large scale computing capabilities Presented By: Mohinderpartap Salooja.
Performance Measurement n Assignment? n Timing #include double When() { struct timeval tp; gettimeofday(&tp, NULL); return((double)tp.tv_sec + (double)tp.tv_usec.
Parallel Processing - introduction  Traditionally, the computer has been viewed as a sequential machine. This view of the computer has never been entirely.
Performance Measurement. A Quantitative Basis for Design n Parallel programming is an optimization problem. n Must take into account several factors:
April 26, CSE8380 Parallel and Distributed Processing Presentation Hong Yue Department of Computer Science & Engineering Southern Methodist University.
Compiled by Maria Ramila Jimenez
Lecture 9 TTH 03:30AM-04:45PM Dr. Jianjun Hu CSCE569 Parallel Computing University of South Carolina Department of.
From lecture slides for Computer Organization and Architecture: Designing for Performance, Eighth Edition, Prentice Hall, 2010 CS 211: Computer Architecture.
Chapter 1 Performance & Technology Trends Read Sections 1.5, 1.6, and 1.8.
Parallel Programming with MPI and OpenMP
Advanced Computer Networks Lecture 1 - Parallelization 1.
Lecture 27 Multiprocessor Scheduling. Last lecture: VMM Two old problems: CPU virtualization and memory virtualization I/O virtualization Today Issues.
Scaling Conway’s Game of Life. Why do parallelism? Speedup – solve a problem faster. Accuracy – solve a problem better. Scaling – solve a bigger problem.
Computer Science 320 Measuring Sizeup. Speedup vs Sizeup If we add more processors, we should be able to solve a problem of a given size faster If we.
LECTURE #1 INTRODUCTON TO PARALLEL COMPUTING. 1.What is parallel computing? 2.Why we need parallel computing? 3.Why parallel computing is more difficult?
SSU 1 Dr.A.Srinivas PES Institute of Technology Bangalore, India 9 – 20 July 2012.
Distributed and Parallel Processing George Wells.
Computer Organization CS345 David Monismith Based upon notes by Dr. Bill Siever and from the Patterson and Hennessy Text.
Supercomputing in Plain English Tuning Blue Waters Undergraduate Petascale Education Program May 29 – June
Software Architecture in Practice
18-447: Computer Architecture Lecture 30B: Multiprocessors
Computer Architecture: Parallel Processing Basics
Parallel Processing - introduction
What Exactly is Parallel Processing?
Parallel Computing Lecture
Introduction to Parallelism.
EE 193: Parallel Computing
Lecture 4: Parallel Computing Fundamentals, Base Core Equivalents
Parallel Processing Sharing the load.
Closing Remarks Cyrus M. Vahid, Principal Solutions Architect,
CSE8380 Parallel and Distributed Processing Presentation
Quiz Questions Parallel Programming Parallel Computing Potential
PERFORMANCE MEASURES. COMPUTATIONAL MODELS Equal Duration Model:  It is assumed that a given task can be divided into n equal subtasks, each of which.
Mattan Erez The University of Texas at Austin
Complexity Measures for Parallel Computation
Quiz Questions Parallel Programming Parallel Computing Potential
Quiz Questions Parallel Programming Parallel Computing Potential
Quiz Questions Parallel Programming Parallel Computing Potential
Dealing with reading assignments
Presentation transcript:

Lecturer: Simon Winberg Lecture 18 Amdahl’s Law (+- 25 min)

 Amdahl’s Law  Using Base Core Equivalent with Amdahl’s Law

Amdahl’s Law EEE4084F Speed-up Number of processors

 The guy: Gene Amdahl  Was chief architect for IBM's first mainframe series of computers  Founder of Amdahl Corporation  Amdahl found stringent restrictions on the speedup possible for given parallelized tasks.  Thee observations packaged as: Amdahl's Law

Linux Magazine Video: Understanding Parallel Computing: Amdahl's Law

 Define f as: fraction of computation that can be parallelized (ignoring scheduling overhead)  Then (1 - f ) is the fraction that is sequential  Define n = no. processors for parallel case  The maximum speed-up achievable is: 1 f ( 1 – f ) + n Speedup parallel = Classic version Should be able to remember this formula for exams

Amdahl’s Law: Alternate Representation P = expected performance improvement E u = Execution time on a uniprocessor (serial) E p = Execution time on a number of processors (parallel) n = number of processors S = fraction of time spent in the sequential time

 A great many papers on the subject of Amdahl’s law  Many mention the term BCE  BCE = Base Core Equivalent  A single processing core in a multicore processor design

Example proc. structures Symmetric multicore 16 one-BCE cores Symmetric multicore 4 x 4-BCE cores Asymmetric multicore comprising 2 x 4-BCE cores + 8 x 1-BCE cores

Amdahl’s Law Calculator Check out Hill and Marty’s Amdahl’s interactive grapher at: multifacet/amdahl/ multifacet/amdahl/ You can also download the OCTAVE / Matlab code to do these graphs! Quiz example of resultant graph…

Example run of Hill and Marty’s Amdahl’s interactive grapher:

 Hill and Marty 2008: “Amdahl’s Law in the Multicore Era” Available: amdahl_multicore.pdf (see Vula) amdahl_multicore.pdf  Good article on Wikipedia look over: