Welcome to Computer Architecture

Slides:



Advertisements
Similar presentations
® 1 Exponential Challenges, Exponential Rewards The Future of Moores Law Shekhar Borkar Intel Fellow Circuit Research, Intel Labs Fall, 2004.
Advertisements

Computer Structure Power Management Lihu Rappoport and Adi Yoaz Thanks to Efi Rotem for many of the foils.
VARIUS: A Model of Process Variation and Resulting Timing Errors for Microarchitects Sarangi et al Prateeksha Satyamoorthy CS
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
1 xkcd/619. Multicore & Parallel Processing Guest Lecture: Kevin Walsh CS 3410, Spring 2011 Computer Science Cornell University.
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics
Lecture 2: Modern Trends 1. 2 Microprocessor Performance Only 7% improvement in memory performance every year! 50% improvement in microprocessor performance.
VLSI Trends. A Brief History  1958: First integrated circuit  Flip-flop using two transistors  From Texas Instruments  2011  Intel 10 Core Xeon Westmere-EX.
Low-power computer architecture
Temperature-Aware Design Presented by Mehul Shah 4/29/04.
EE141 © Digital Integrated Circuits 2nd Introduction 1 The First Computer.
ASIC Design Introduction - 1 The history of Integrated Circuit (IC) The base for such a significant progress –Well understanding of semiconductor physics.
1 VLSI and Computer Architecture Trends ECE 25 Fall 2012.
EE141 © Digital Integrated Circuits 2nd Introduction 1 EE4271 VLSI Design Dr. Shiyan Hu Office: EERC 518 Adapted and modified from Digital.
2/6: CPUs & Memory CPUs –Parts of a sample CPU –Types of CPUs available ROM RAM –different kinds & uses inc. VRAM, SRAM image courtesy of How Computers.
CSE477 L01 Introduction.1Irwin&Vijay, PSU, 2002 ECE484 VLSI Digital Circuits Fall 2014 Lecture 01: Introduction Adapted from slides provided by Mary Jane.
Sub-threshold Design of Ultra Low Power CMOS Circuits Students: Dmitry Vaysman Alexander Gertsman Supervisors: Prof. Natan Kopeika Prof. Orly Yadid-Pecht.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
® 1 VLSI Design Challenges for Gigascale Integration Shekhar Borkar Intel Corp. October 25, 2005.
1 CS/EE 6810: Computer Architecture Class format:  Most lectures on YouTube *BEFORE* class  Use class time for discussions, clarifications, problem-solving,
1 Some Limits of Power Delivery in the Multicore Era Runjie Zhang, Brett H. Meyer, Wei Huang, Kevin Skadron and Mircea R. Stan University of Virginia,
Present – Past -- Future
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
Transistor Counts 1,000, ,000 10,000 1, i386 i486 Pentium ® Pentium ® Pro K 1 Billion Transistors.
EE141 © Digital Integrated Circuits 2nd Introduction 1 Principle of CMOS VLSI Design Introduction Adapted from Digital Integrated, Copyright 2003 Prentice.
0 1 Thousand Core Chips A Technology Perspective Shekhar Borkar Intel Corp. June 7, 2007.
EE5780 Advanced VLSI Computer-Aided Design
Computer Science and Engineering Power-Performance Considerations of Parallel Computing on Chip Multiprocessors Jian Li and Jose F. Martinez ACM Transactions.
Scaling I Mohammad Sharifkhani. Reading Text book II (pp. 123)
EE141 © Digital Integrated Circuits 2nd Introduction 1 EE5900 Advanced Algorithms for Robust VLSI CAD Dr. Shiyan Hu Office: EERC 731 Adapted.
CS203 – Advanced Computer Architecture
Hardware Architecture
EE141 © Digital Integrated Circuits 2nd Introduction 1 EE4271 VLSI Design Dr. Shiyan Hu Office: EERC 731 Adapted and modified from Digital.
Memory and Storage Chapter 9. Memory Memory ( from programmer’s point of view): Temporary storage to hold data as well as programs. An Architect views.
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
Modern Processors.  Desktop processors  Notebook processors  Server and workstation processors  Embedded and communications processors  Internet.
Chapter 2 content Basic organization of computer What is motherboard
ECE484 VLSI Digital Circuits Fall 2016 Lecture 01: Introduction
CS203 – Advanced Computer Architecture
Lecture 2: Performance Today’s topics:
Lynn Choi School of Electrical Engineering
Temperature and Power Management
ECE 3055: Computer Architecture and Operating Systems
Lecture 1: CS/ECE 3810 Introduction
CIT 668: System Architecture
At a recent fireside chat…
Lynn Choi School of Electrical Engineering
Day 12: October 1, 2012 Variation
Basics of Energy & Power Dissipation
TECHNOLOGY TRENDS.
The Beauty and Joy of Computing Lecture #8 : Concurrency
Temperature aware architecture of 14nm Broadwell chip-sets
Circuits and Interconnects In Aggressively Scaled CMOS
Phnom Penh International University (PPIU)
Basic Computer Organization
CS2100 Computer Organisation
Lecture 2: Performance Today’s topics: Technology wrap-up
Parallel Processing Sharing the load.
Transistors on lead microprocessors double every 2 years Moore’s Law in Microprocessors Transistors on lead microprocessors double every 2 years.
Performance of computer systems
Impact of Parameter Variations on Multi-core chips
Motherboard External Hard disk USB 1 DVD Drive RAM CPU (Main Memory)
Computer Evolution and Performance
Processor Design Challenges
Introduction EE4271 VLSI Design Professor Shiyan Hu Office: EERC 518
Chapter 4 Multiprocessors
Lecture 15: Scaling & Economics
Lecture 15: Scaling & Economics
CompArch: the 4th Epoch Looms
CS2100 Computer Organisation
Presentation transcript:

Welcome to Computer Architecture Kevin Skadron

What is Computer Architecture? http://hem.passagen.se/religion/bild.html

Talk about Power Hungry…

Hot Chips are No Longer Cool! Sun's Surface 1000 Rocket Nozzle Nuclear Reactor 100 Watts/cm 2 Pentium® 4 Hot plate Pentium® III Pentium® II 10 Pentium® Pro Pentium® i386 i486 1 1.5m 1m 0.7m 0.5m 0.35m 0.25m 0.18m 0.13m 0.1m 0.07m * “New Microarchitecture Challenges in the Coming Generations of CMOS Process Technologies” – Fred Pollack, Intel Corp. Micro32 conference key note - 1999.

Ouch Operating temperatures > 100°!

Graphics Cards Nvidia GeForce 5900 card, a.k.a. the “Dustbuster” Source: Tech-Report.com

Prognostications Linear extrapolation: 2012, Pentium 8 12GHz, 8-way CMP, 4-way MT, 100MB L4 cache on die, 20GB/s optical I/O, 250W, 2TB hard drive What will users do with this? Sequential prophecy alternative A What they do now. Somewhat faster. Commoditized. Sequential prophecy alternative B They won’t want them. They’ll want affordable, mobile, ubiquitous, secure computing based on huge local non-volatile storage with auto-backup Comp Arch must anticipate both alternatives! Courtesy of Bob Colwell

Physically-Aware Architecture

ITRS Projections Zero-Sum Architecture? 2001 – was 0.4 These are targets, doubtful that they are feasible Growth in power density means cooling costs continue to grow High-performance designs seem to be shifting away from clock frequency toward # cores Supply voltage scaling seems to be stopping ITRS 2004 2001 – was 288

A New Era of Multi-Core Architectures vs. Source: Chrostopher Reeve Homepage, http://www.chrisreevehomepage.com/ Cores may also be heterogeneous, with a few powerful cores and very many small cores

Gate L variation due to Optical High freq, high leakage Low freq, low leakage Note random var’s

Impact of Static Variations 1.4 Frequency ~30% Leakage Power ~5-10X 1.3 30% 1.2 130nm Normalized Frequency 1.1 1.0 5X 0.9 1 2 3 4 5 Normalized Leakage (Isb) Source: Shekhar Borkar, keynote presentation, MICRO-37, 2004

Sources, types of variations Device variations L (systematic) Tox, Nd, W (random) Interconnect Width, thickness, height, crosstalk (mostly random) Environmental T, Vdd (heavy arch. dependence) Package variations TIM thickness, chip roughness, etc. (random) Device variations Primarily affect Vth Vth Leakage, frequency Leakage  T Temperature Leakage, frequency, Vdd Vcc Package variations Within-die T effects Chip to chip too

Some Architecture Implications Core-to-core variation: 10-30% freq. Limitations on # cores, placement Symmetry is expensive Leakage increases by 50-100% if you Vth-compensate all cores to nominal This would exacerbate thermal limitations, and probably require voltage scaling or throttling Or you compensate all cores to slowest core, and leave performance on the table SRAM variations will also lead to non-uniform caches

Classic Equations

Important Performance Eqns. CPU performance equation: insts cycles time CPU time = ------ --------- ------ prog inst cycle Speedup = told / tnew Amdahl’s Law: 1 Speedup = ------------------------------------------------ (1 – Fracenh) + Fracenh/Speedupenh

Implications of Amdahl’s Law http://nl.ijs.si/et/talks/esslli02/metadata_files/Haystack-FINALb.jpg

An Unbalanced System CPU I/O, Memory, Cache Courtesy of Bob Colwell

Important Performance Eqns. CPU performance equation: insts cycles time CPU time = ------ --------- ------ prog inst cycle Speedup = told / tnew Amdahl’s Law: 1 Speedup = ------------------------------------------------ (1 – Fracenh) + Fracenh/Speedupenh

Amdahl’s Law example