Caltech CS184 Winter2005 -- DeHon 1 CS184: Computer Architecture (Structure and Organization) Day 7: January 21, 2005 Energy and Power.

Slides:



Advertisements
Similar presentations
Topics Electrical properties of static combinational gates:
Advertisements

Keeping Hot Chips Cool Ruchir Puri, Leon Stok, Subhrajit Bhattacharya IBM T.J. Watson Research Center Yorktown Heights, NY Circuits R-US.
Power Reduction Techniques For Microprocessor Systems
Elettronica T A.A Digital Integrated Circuits © Prentice Hall 2003 Inverter CMOS INVERTER.
Penn ESE534 Spring Mehta & DeHon 1 ESE534 Computer Organization Day 6: February 12, 2014 Energy, Power, Reliability.
1 A Variation-tolerant Sub- threshold Design Approach Nikhil Jayakumar Sunil P. Khatri. Texas A&M University, College Station, TX.
11/5/2004EE 42 fall 2004 lecture 281 Lecture #28 PMOS LAST TIME: NMOS Electrical Model – NMOS physical structure: W and L and d ox, TODAY: PMOS –Physical.
Lecture #25 Timing issues
Towards An Efficient Low Frequency Energy Recovery Dynamic Logic Sujay Phadke Advanced Computer Architecture Lab Department of Electrical Engineering and.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 7: January 31, 2007 Energy and Power.
Lecture 5 – Power Prof. Luke Theogarajan
Spring 2007EE130 Lecture 39, Slide 1 Lecture #39 ANNOUNCEMENTS Late projects will be accepted –by 1:10PM Monday 4/30: 20 pt penalty –by 1:10PM Friday 5/4:
Lecture 7: Power.
Power-Aware Computing 101 CS 771 – Optimizing Compilers Fall 2005 – Lecture 22.
S. RossEECS 40 Spring 2003 Lecture 24 Today we will Review charging of output capacitance (origin of gate delay) Calculate output capacitance Discuss fan-out.
Low Power Design and Adiabatic Circuits P.Ranjith M.Tech(ICT)
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
6.893: Advanced VLSI Computer Architecture, September 28, 2000, Lecture 4, Slide 1. © Krste Asanovic Krste Asanovic
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 32: November 24, 2010 Uncorrelated Noise.
Low-Power CMOS Logic Circuit Topic Review 1 Part I: Overview (Shaw) Part II: (Vincent) Low-Power Design Through Voltage Scaling Estimation and Optimization.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 19: October 16, 2013 Energy and Power.
1 IN THE NAME GOD Advanced VLSI Class Presentation A 1.1GHz Charge Recovery Logic Insructor : Dr. Fakhrayi Presented by : Mahdiyeh Mehran.
EE415 VLSI Design DYNAMIC LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
A Class Presentation for VLSI Course by : Fatemeh Refan Based on the work Leakage Power Analysis and Comparison of Deep Submicron Logic Gates Geoff Merrett.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 19: October 15, 2014 Energy and Power.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 28: November 16, 2012 Memory Periphery.
Basics of Energy & Power Dissipation Lecture notes S. Yalamanchili, S. Mukhopadhyay. A. Chowdhary.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 18: October 13, 2014 Energy and Power.
Adiabatic Logic as Low-Power Design Technique Presented by: Muaayad Al-Mosawy Presented to: Dr. Maitham Shams Mar. 02, 2005.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 37: December 8, 2010 Adiabatic Amplification.
Chapter 1 Combinational CMOS Logic Circuits Lecture # 4 Pass Transistors and Transmission Gates.
VLSI Design Lecture 5: Logic Gates Mohammad Arjomand CE Department Sharif Univ. of Tech. Adapted with modifications from Wayne Wolf’s lecture notes.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 18: October 14, 2013 Energy and Power.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 17: October 19, 2011 Energy and Power.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 28: November 16, 2011 Memory Periphery.
Basics of Energy & Power Dissipation
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 6: January 19, 2005 VLSI Scaling.
Day 16: October 6, 2014 Inverter Performance
© Digital Integrated Circuits 2nd Inverter Digital Integrated Circuits A Design Perspective The Inverter Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 38: December 10, 2010 Energy and Computation.
Adiabatic Circuits Mohammad Sharifkhani. Introduction Applying slow input slopes reduces E below CV2 Useful for driving large capacitors (Buffers) Power.
FPGA-Based System Design: Chapter 2 Copyright  2004 Prentice Hall PTR Topics n Logic gate delay. n Logic gate power consumption. n Driving large loads.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 6: January 22, 2003 VLSI Scaling.
EE415 VLSI Design THE INVERTER [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Penn ESE534 Spring DeHon 1 ESE534 Computer Organization Day 19: March 28, 2012 Minimizing Energy.
Dynamic Logic.
EE141 Combinational Circuits 1 Chapter 6 (I) Designing Combinational Logic Circuits Dynamic CMOS LogicDynamic CMOS Logic V1.0 5/4/2003.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 19: October 19, 2012 Ratioed Logic.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 4: September 12, 2012 Transistor Introduction.
CS203 – Advanced Computer Architecture
Adiabatic Technique for Energy Efficient Logic Circuits Design
Lecture 08: Pass Transistor Logic
Penn ESE534 Spring DeHon 1 ESE534 Computer Organization Day 8: February 10, 2010 Energy, Power, Reliability.
Smruti R. Sarangi IIT Delhi
Day 29: November 18, 2011 Dynamic Logic
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Day 16: September 15, 2010 Energy and Power
Day 19: October 24, 2011 Ratioed Logic
Notes on Diodes 1. Diode saturation current:  
Day 17: October 18, 2010 (Energy) Ratioed Logic
Day 17: October 15, 2012 Energy and Power Basics
Circuit Design Techniques for Low Power DSPs
Day 25: November 7, 2011 Registers
Day 21: October 29, 2010 Registers Dynamic Logic
Day 2: September 10, 2010 Transistor Introduction
ESE534 Computer Organization
Day 18: October 20, 2010 Ratioed Logic Pass Transistor Logic
Lecture 7: Power.
Power and Heat Power Power dissipation in CMOS logic arises from the following sources: Dynamic power due to switching current from charging and discharging.
Presentation transcript:

Caltech CS184 Winter DeHon 1 CS184: Computer Architecture (Structure and Organization) Day 7: January 21, 2005 Energy and Power

Caltech CS184 Winter DeHon 2 Today Energy Tradeoffs? Voltage limits and leakage? Thermodynamics meets Information Theory Adiabatic Switching

Caltech CS184 Winter DeHon 3 At Issue Many now argue power will be the ultimate scaling limit –(not lithography, costs, …) Proliferation of portable and handheld devices –…battery size and life biggest issues Cooling, energy costs may dominate cost of electronics

Caltech CS184 Winter DeHon 4 What can we do about it?  gd =Q/I=(CV)/I I d =(  C OX /2)(W/L)(V gs -V TH ) 2

Caltech CS184 Winter DeHon 5 Tradeoff E  V 2  gd  1/V Can trade speed for energy E×(  gd ) 2  constant Martin et al. Power-Aware Computing, Kluwer

Caltech CS184 Winter DeHon 6 Questions How far can this go? –(return to later in lecture) What do we do about slowdown?

Caltech CS184 Winter DeHon 7 Parallelism We have Area-Time tradeoffs Compensate slowdown with additional parallelism …trade Area for Energy  Architectural Option

Caltech CS184 Winter DeHon 8 Ideal Example Perhaps: 1nJ/32b Op, 10ns cycle Cut voltage in half 0.25nJ/32b Op, 20ns cycle Two in parallel to complete 2ops/20ns 75% energy reduction

Caltech CS184 Winter DeHon 9 Power Density Constrained Example Logic Density: 1 foo-op/mm 2 Energy cost: 10GHz Cooling limit: 100W/cm 2 How many foo-ops/cm 2 /s? –10nJ/mm 2 x 100mm 2 /cm 2 =1000nJ/cm 2 –  top speed 100MHz –100M x 100 foo-ops = foo-ops/cm 2 /s

Caltech CS184 Winter DeHon 10 What can we support?

Caltech CS184 Winter DeHon 11 (Pushing through the Math)

Caltech CS184 Winter DeHon 12 Improved Power How many foo-ops/cm 2 /s? –2GHz x 100 foo-ops = 2 ×10 11 foo-ops/cm 2 /s –[vs. 100M x 100 foo-ops = foo-ops/cm 2 /s]

Caltech CS184 Winter DeHon 13 How far?

Caltech CS184 Winter DeHon 14 Limits Ability to turn off the transistor Noise Parameter Variations

Caltech CS184 Winter DeHon 15 Sub Threshold Conduction To avoid leakage want I off very small Use I on for logic – determines speed Want I on /I off large [Frank, IBM J. R&D v46n2/3p235]

Caltech CS184 Winter DeHon 16 Sub Threshold Conduction S  90mV for single gate S  70mV for double gate 4 orders of magnitude I VT /I off  V T >280mV [Frank, IBM J. R&D v46n2/3p235]

Caltech CS184 Winter DeHon 17 Thermodynamics

Caltech CS184 Winter DeHon 18 Lower Bound? Reducing entropy costs energy Single bit gate output –Set from previous value to 0 or 1 –Reduce state space by factor of 2 –Entropy:  S= k×ln(before/after)=k×ln2 –Energy=T  S=kT×ln(2) Naively setting a bit costs at least kT×ln(2)

Caltech CS184 Winter DeHon 19 Numbers (ITRS 2001) kT×ln(2) = 2.87× J (at R.T K=300) 0.002fJ = 2× J

Caltech CS184 Winter DeHon 20 Sanity Check CV 2 =2× J V=0.4V Q=CV=5× columbs e=1.6× columbs Q=30 electrons? Energy in  particle? –10 5 —10 6 electrons?

Caltech CS184 Winter DeHon 21 Hmm… CV 2 =2× J 300,000 nm 2 /bit (22nm)  3×10 6 gates/mm 2 6× J/mm 2 6× J/cm 2

Caltech CS184 Winter DeHon 22 Recycling… Thermodynamics only says we have to dissipate energy if we discard information Can we compute without discarding information? Can we use this?

Caltech CS184 Winter DeHon 23 Three Reversible Primitives

Caltech CS184 Winter DeHon 24 Universal Primitives These primitives –Are universal –Are all reversible If keep all the intermediates they produce –Discard no information –Can run computation in reverse

Caltech CS184 Winter DeHon 25 Cleaning Up Can keep “erase” unwanted intermediates with reverse circuit

Caltech CS184 Winter DeHon 26 Thermodynamics In theory, at least, thermodynamics does not demand that we dissipate any energy (power) in order to compute

Caltech CS184 Winter DeHon 27 Adiabatic Switching

Caltech CS184 Winter DeHon 28 Two Observations 1.Dissipate power through on-transistor charging capacitance 2.Discard capacitor charge at end of cycle

Caltech CS184 Winter DeHon 29 Charge Cycle Charging capacitor  Q=CV  E=QV  E=CV 2  Half in capacitor, half dissipated in pullup [Athas/Koller/Svensoon, USC/ISI ACMOS-TR ]

Caltech CS184 Winter DeHon 30 Adiabatic Switching Current source charging: –Ramp supplies slowly so supply constant curret  P=I 2 R  E total =P*T  Q=IT=CV  I=CV/T  E total =I 2 R*T=(CV/T) 2 R*T  E total =I 2 R*T=(RC/T) CV 2

Caltech CS184 Winter DeHon 31 Impact of Adiabatic Switching  E total =I 2 R*T=(RC/T) CV 2  RC=  gd  E total  (  gd /T)  Without reducing V  Can trade energy and time  E×T=constant

Caltech CS184 Winter DeHon 32 Adiabatic Discipline Never turn on a device with a large voltage differential across it. P=  V 2 /R

Caltech CS184 Winter DeHon 33 SCRL Inverter  ’s, nodes, at Vdd/2 P1 at ground Slowly turn on P1 Slow split  ’s Slow turn off P1’s Slow return  ’s to Vdd/2 [Younis/Knight ISLPED(?) 1994]

Caltech CS184 Winter DeHon 34 SCRL Inverter Basic operation –Set inputs –Split rails to compute output adiabatically –Isolate output –Bring rails back together Have transferred logic to output Still need to worry about resetting output adiabatically

Caltech CS184 Winter DeHon 35 SCRL NAND Same basic idea works for nand gate –Set inputs –Adiabatically switch output –Isolate output –Reset power rails

Caltech CS184 Winter DeHon 36 SCRL Cascade Cascade like domino logic –Compute phase 1 –Compute phase 2 from phase 1… How do we restore the output?

Caltech CS184 Winter DeHon 37 SCRL Pipeline We must uncompute the logic –Forward gates compute output –Reverse gate restore to Vdd/2

Caltech CS184 Winter DeHon 38 SCRL Pipeline P1 high (F1 on; F1 inverse off)  1 split: a=F1(a0)  2 split: b=F2(F1(a0)) F2 -1 (F2(F1(a0))=a P1 low – now F2 -1 drives a F1 restore by  1 converge …restore F2 Use F2 -1 to restore a to Vdd/2 adiabatically

Caltech CS184 Winter DeHon 39 SCRL Rail Timing

Caltech CS184 Winter DeHon 40 SCRL Requires Reversible Gates to uncompute each intermediate All switching (except IO) is adiabatic Can, in principle, compute at any energy

Caltech CS184 Winter DeHon 41 Trickiness Generating the ramped clock rails Use LC circuits Need high-Q resonators Making this efficient is key to practical implementation

Caltech CS184 Winter DeHon 42 Big Ideas Can trade time for energy –…area for energy Noise and subthreshold conduction limit voltage scaling Thermodynamically admissible to compute without dissipating energy Adiabatic switching alternative to voltage scaling Can base CMOS logic on these observations