1 VLSI and Computer Architecture Trends ECE 25 Fall 2012.

Slides:



Advertisements
Similar presentations
VADA Lab.SungKyunKwan Univ. 1 L3: Lower Power Design Overview (2) 성균관대학교 조 준 동 교수
Advertisements

Evolution of Chip Design ECE 111 Spring A Brief History 1958: First integrated circuit – Flip-flop using two transistors – Built by Jack Kilby at.
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics
Lecture 2: Modern Trends 1. 2 Microprocessor Performance Only 7% improvement in memory performance every year! 50% improvement in microprocessor performance.
VLSI Trends. A Brief History  1958: First integrated circuit  Flip-flop using two transistors  From Texas Instruments  2011  Intel 10 Core Xeon Westmere-EX.
Chapter 1. Introduction This course is all about how computers work But what do we mean by a computer? –Different types: desktop, servers, embedded devices.
Design and Implementation of VLSI Systems (EN0160)
8/18/05ELEC / Lecture 11 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 18: Scaling Theory Prof. Sherief Reda Division of Engineering, Brown University.
Prof. John Nestor ECE Department Lafayette College Easton, Pennsylvania ECE Computer Organization Lecture 2 - Technology.
EE314 Basic EE II Silicon Technology [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
ECE 510 Brendan Crowley Paper Review October 31, 2006.
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics Credits: David Harris Harvey Mudd College (Material taken/adapted from Harris’ lecture.
EE141 © Digital Integrated Circuits 2nd Introduction 1 The First Computer.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
ASIC Design Introduction - 1 The history of Integrated Circuit (IC) The base for such a significant progress –Well understanding of semiconductor physics.
EE141 © Digital Integrated Circuits 2nd Introduction 1 EE4271 VLSI Design Dr. Shiyan Hu Office: EERC 518 Adapted and modified from Digital.
Multi Core Processor Submitted by: Lizolen Pradhan
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
CSE477 L01 Introduction.1Irwin&Vijay, PSU, 2002 ECE484 VLSI Digital Circuits Fall 2014 Lecture 01: Introduction Adapted from slides provided by Mary Jane.
Multi-core Programming Introduction Topics. Topics General Ideas Moore’s Law Amdahl's Law Processes and Threads Concurrency vs. Parallelism.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
® 1 VLSI Design Challenges for Gigascale Integration Shekhar Borkar Intel Corp. October 25, 2005.
VLSI & ECAD LAB Introduction.
MS108 Computer System I Lecture 2 Metrics Prof. Xiaoyao Liang 2014/2/28 1.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
[Tim Shattuck, 2006][1] Performance / Watt: The New Server Focus Improving Performance / Watt For Modern Processors Tim Shattuck April 19, 2006 From the.
Outline  Over view  Design  Performance  Advantages and disadvantages  Examples  Conclusion  Bibliography.
Introduction to CMOS VLSI Design Lecture 1: Circuits & Layout.
Present – Past -- Future
Morgan Kaufmann Publishers
Semiconductor Industry Milestones
© Digital Integrated Circuits 2nd Inverter Digital Integrated Circuits A Design Perspective The Inverter Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.
EE586 VLSI Design Partha Pande School of EECS Washington State University
Transistor Counts 1,000, ,000 10,000 1, i386 i486 Pentium ® Pentium ® Pro K 1 Billion Transistors.
EE141 © Digital Integrated Circuits 2nd Introduction 1 Principle of CMOS VLSI Design Introduction Adapted from Digital Integrated, Copyright 2003 Prentice.
EE5780 Advanced VLSI Computer-Aided Design
Overview of VLSI 魏凱城 彰化師範大學資工系. VLSI  Very-Large-Scale Integration Today’s complex VLSI chips  The number of transistors has exceeded 120 million 
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO Session 3 Computer Evolution.
EE141 © Digital Integrated Circuits 2nd Introduction 1 EE5900 Advanced Algorithms for Robust VLSI CAD Dr. Shiyan Hu Office: EERC 731 Adapted.
ECE 3110: Introduction to Digital Systems Introduction (Contd.)
CS203 – Advanced Computer Architecture
Computer Organization IS F242. Course Objective It aims at understanding and appreciating the computing system’s functional components, their characteristics,
1 Week 1: The History of Computing (PART II) READING: Chapter 1.
EE141 © Digital Integrated Circuits 2nd Introduction 1 Digital Integrated Circuits A Design Perspective Introduction Jan M. Rabaey Anantha Chandrakasan.
EE141 © Digital Integrated Circuits 2nd Introduction 1 EE4271 VLSI Design Dr. Shiyan Hu Office: EERC 731 Adapted and modified from Digital.
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
Integration Lower sums Upper sums
ECE484 VLSI Digital Circuits Fall 2016 Lecture 01: Introduction
William Stallings Computer Organization and Architecture 6th Edition
CS203 – Advanced Computer Architecture
Lynn Choi School of Electrical Engineering
Introduction to VLSI ASIC Design and Technology
CIT 668: System Architecture
Morgan Kaufmann Publishers
ECE 154A Introduction to Computer Architecture
Architecture & Organization 1
EE 4611 INTRODUCTION 21 January 2015 Semiconductor Industry Milestones
Lecture 2: Performance Today’s topics: Technology wrap-up
Architecture & Organization 1
BIC 10503: COMPUTER ARCHITECTURE
Parallel Processing Sharing the load.
Overview of VLSI 魏凱城 彰化師範大學資工系.
Transistors on lead microprocessors double every 2 years Moore’s Law in Microprocessors Transistors on lead microprocessors double every 2 years.
Chapter 1 Introduction.
HIGH LEVEL SYNTHESIS.
Energy Efficient Power Distribution on Many-Core SoC
Introduction EE4271 VLSI Design Professor Shiyan Hu Office: EERC 518
The University of Adelaide, School of Computer Science
Presentation transcript:

1 VLSI and Computer Architecture Trends ECE 25 Fall 2012

2 A Brief History 1958: First integrated circuit – Flip-flop using two transistors – From Texas Instruments 2011 – Intel 10 Core Xeon Westmere-EX 2.6 billion transistors 32 nm process Courtesy Texas Instruments Courtesy Intel

3 Moore’s Law Historical growth rate – 2x transistors & clock speeds every 2 years over 50 years – 10x every 6-7 years Dramatically more complex algorithms previously not feasible – Dramatically more realistic video games and graphics animation (e.g. Playstation 4, Xbox 360 Kinect, Nintendo Wii) – 1 Mb/s DSL to 10 Mb/s Cable to 2.4 Gb/s Fiber to Homes – 2G to 3G to 4G wireless communications – MPEG-1 to MPEG-2 to MPEG-4 to H.264 video compression – 480 x 270 (0.13 million pixels) NTSC to 1920x1080 (2 million pixels) HDTV resolution to 2880x1800 (5 million pixels) Retina Display

4 Standard Cells NOR-3XOR-2

5 Standard Cell Layout

6 NVIDIA GeForce 8800 (600+ million transistors, about 60+ million gates)

7 NVIDIA Kepler2 GK104 GPU (1536 cores, 3.54 billion transistors)

8 Subwavelength Lithography Challenges Source: Raul Camposano, 2003

9 NRE Mask Costs Source: MIT Lincoln Labs, M. Fritze, October 2002

10 ASIC NRE Costs Not Justified for Many Applications A complex ASIC will have an NRE Cost of over $40M = $28M (NRE Design Cost) + $12M (NRE Mask Cost) Many “ASIC” applications will not have the volume to justify a $40M NRE cost e.g. a $30 IC with a 33% margin would require sales of 4M units (x $10 profit/IC) just to recoup $40M NRE Cost

11 Power Density a Key Issue Motivated mainly by power limits P total = P dynamic + P leakage P dynamic = ½  C V DD 2 f Problem: power (heat dissipation) density has been growing exponentially because clock frequency (f) and transistor count have been doubling every 2 years

12 Power Density a Key Issue Had scaling continued at previous pace, by 2005, high speed processors would have power density of nuclear reactor by 2005, a rocket nozzle by 2010, and would become the power density of the sun’s by Courtesy: Intel

13 Before Multicore Processors e.g. Intel Itanium II – 6-Way Integer Unit < 2% die area – Cache logic > 50% die area Most of chip there to keep these 6 Integer Units at “peak” rate Main issue is external DRAM latency (50ns) to internal clock (0.25ns) is 200:1 Increase performance by higher clock frequency and more complex pipelining & speculative execution INT6 Cache logic

14 Multicore Era Multicore era – Operate at lower voltage and lower clock frequency – Simpler processor cores – Increase performance by more cores per chip e.g. Intel 10 Core Xeon Westmere-EX – GHz (vs. previous Xeons at 4 Ghz) 1 core

15 Embedded Multicore Processors Embedded multicore processors replacing ASICs – Much simpler processor cores, much smaller caches e.g. Tilera-GX: 100 processors

16 What Does the Future Look Like? Corollary of Moore’s law: Number of cores will double every 18 months ‘05‘08‘11‘ ‘02 16 Research Industry Source: MIT, A. Agrawal, 2009

17 ITRS Roadmap Semiconductor Industry Association forecast – Intl. Technology Roadmap for Semiconductors

18 Power Revisited P total = P dynamic + P leakage P dynamic = ½  C V DD 2 f Historically, P leakage was negligible, P dynamic dominated. Power could be controlled by reducing V DD (which used to be 5V, now about 1V). Lowering V DD requires lowering threshold voltage, but P leakage becomes more dominant, which is why we are not reducing V DD much any more, or very slowly.

19 Classical scaling Device count S 2 Device frequency S Device power (cap) 1/S Device power (V DD ) 1/S 2 Utilization1 Leakage limited scaling Device count S 2 Device frequency S Device power (cap)1/S Device power (V DD )~1 Utilization1/S 2 The Utilization Wall (Source: S. Swanson, M. Taylor 2010) Scaling theory – S = – Exponentially increasing problem!

20 Age of “Dark Silicon” (Source: S. Swanson, M. Taylor 2010) 4 3 GHz 4 2x3 GHz (12 cores dark) 2x4 3 GHz (8 cores dark) (Industry’s Choice).… 65 nm 32 nm.… Spectrum of tradeoffs between # cores and frequency. e.g.; take 65 nm  32 nm; i.e. (s =2)

21 Moving Back to Specialized Silicon? “Traditional” Operating System – Provide many “modules” in software that “applications” call at run-time. – OS modules only “loaded” in memory when used. – Most OS modules unused at any moment. Moving OS into silicon? – Power more expensive than area. – Specialized logic can improve energy efficiency x. – Possible idea: move power-intensive OS modules into silicon or build specialized hardware accelerators. – Most of these “silicon modules” would be “dark”: only “lite-up” when called. – In another words, “waste” silicon to save power. Dark Silicon Source: S. Swanson, M. Taylor, GreenDroid Project, 2010

22 Questions?