 C. H. Ziesler et al., 2003 Energy Recovering Computers 1 Advanced Computer Architecture Laboratory University of Michigan Conrad H. Ziesler 1 Joohee.

Slides:



Advertisements
Similar presentations
Topics Electrical properties of static combinational gates:
Advertisements

A Resonant Clock Generator for Single-Phase Adiabatic Systems Conrad H. Ziesler Marios C. Papaefthymiou University of Michigan, Ann Arbor, MI Suhwan Kim.
Keeping Hot Chips Cool Ruchir Puri, Leon Stok, Subhrajit Bhattacharya IBM T.J. Watson Research Center Yorktown Heights, NY Circuits R-US.
CMOS Family.
Introduction to Digital Systems By Dr. John Abraham UT-Panam.
Chapter 09 Advanced Techniques in CMOS Logic Circuits
A Look at Chapter 4: Circuit Characterization and Performance Estimation Knowing the source of delays in CMOS gates and being able to estimate them efficiently.
from High-frequency Clocks using DC-DC Converters
 C. H. Ziesler etal., 2003 Energy Recovering ASIC Design Advanced Computer Architecture Laboratory Department of Electrical Engineering and Computer Science.
Design, Verification, and Test of True Single-Phase Adiabatic Multiplier Suhwan Kim IBM Research Division T. J. Watson Research Center, Yorktown Heights.
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 14: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Introduction to CMOS VLSI Design SRAM/DRAM
Die-Hard SRAM Design Using Per-Column Timing Tracking
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
Memory and Advanced Digital Circuits 1.
Towards An Efficient Low Frequency Energy Recovery Dynamic Logic Sujay Phadke Advanced Computer Architecture Lab Department of Electrical Engineering and.
1 High Speed Fully Integrated On-Chip DC/DC Power Converter By Prabal Upadhyaya Sponsor: National Aeronautics and Space Administration.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Microwave Interference Effects on Device,
Lecture 13, Slide 1EECS40, Fall 2004Prof. White Lecture #13 Announcements You should now purchase the reader EECS 40: Introduction to Microelectronics,
 C. H. Ziesler et al., 2003 Energy Recovery Design for Low-Power ASICs Conrad H. Ziesler 1 Joohee Kim 1 Suhwan Kim 2 Marios C. Papaefthymiou 1 1 Advanced.
Lecture 5 – Power Prof. Luke Theogarajan
Lecture 7: Power.
Lecture 21, Slide 1EECS40, Fall 2004Prof. White Lecture #21 OUTLINE –Sequential logic circuits –Fan-out –Propagation delay –CMOS power consumption Reading:
Pulse Width Modulation (PWM) LED Dimmer Circuit
Low Power Design and Adiabatic Circuits P.Ranjith M.Tech(ICT)
The CMOS Inverter Slides adapted from:
Digital Integrated Circuits for Communication
Digital Integrated Circuits© Prentice Hall 1995 Inverter THE INVERTERS.
ACES Workshop 3-4 March, 2009 W. Dabrowski Serial power circuitry in the ABC-Next and FE-I4 chips W. Dabrowski Faculty of Physics and Applied Computer.
Chapter 4 AC to AC Converters
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
ENGG 6090 Topic Review1 How to reduce the power dissipation? Switching Activity Switched Capacitance Voltage Scaling.
Modern VLSI Design 4e: Chapter 7 Copyright  2008 Wayne Wolf Topics Global interconnect. Power/ground routing. Clock routing. Floorplanning tips. Off-chip.
Low-Power CMOS Logic Circuit Topic Review 1 Part I: Overview (Shaw) Part II: (Vincent) Low-Power Design Through Voltage Scaling Estimation and Optimization.
Electrical Characteristics of Logic Gates Dr. Ashraf Armoush © 2010 Dr. Ashraf Armoush.
1 IN THE NAME GOD Advanced VLSI Class Presentation A 1.1GHz Charge Recovery Logic Insructor : Dr. Fakhrayi Presented by : Mahdiyeh Mehran.
Lecture 2 1 Computer Elements Transistors (computing) –How can they be connected to do something useful? –How do we evaluate how fast a logic block is?
Lecture # 12&13 SWITCHING-MODE POWER SUPPLIES
THE INVERTERS. DIGITAL GATES Fundamental Parameters l Functionality l Reliability, Robustness l Area l Performance »Speed (delay) »Power Consumption »Energy.
Logic Synthesis for Low Power(CHAPTER 6) 6.1 Introduction 6.2 Power Estimation Techniques 6.3 Power Minimization Techniques 6.4 Summary.
Chapter 07 Electronic Analysis of CMOS Logic Gates
Adiabatic Logic Circuit for Biomedical Applications Prepared by: Muhammad Arsalan Presented to: Dr. Maitham Shams.
Filip Tavernier Karolina Poltorak Sandro Bonacini Paulo Moreira
Chapter 4 Logic Families.
FPGA-Based System Design: Chapter 2 Copyright  2003 Prentice Hall PTR Topics n Off-chip connections.
Ratioed Circuits Ratioed circuits use weak pull-up and stronger pull-down networks. The input capacitance is reduced and hence logical effort. Correct.
Microelectronic Circuits - Fourth Edition Sedra/Smith 0 PowerPoint Overheads to Accompany Sedra/Smith Microelectronic Circuits 4/e ©1999 Oxford University.
Modern VLSI Design 2e: Chapter 3 Copyright  1998 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
Adiabatic Logic as Low-Power Design Technique Presented by: Muaayad Al-Mosawy Presented to: Dr. Maitham Shams Mar. 02, 2005.
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTEMS
Low Power – High Speed MCML Circuits (II)
Guy Lemieux, Mehdi Alimadadi, Samad Sheikhaei, Shahriar Mirabbasi University of British Columbia, Canada Patrick Palmer University of Cambridge, UK SoC.
Lecture 10: Circuit Families. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 10: Circuit Families2 Outline  Pseudo-nMOS Logic  Dynamic Logic  Pass Transistor.
Modern VLSI Design 3e: Chapter 7 Copyright  1998, 2002 Prentice Hall PTR Topics n Power/ground routing. n Clock routing. n Floorplanning tips. n Off-chip.
Adiabatic Circuits Mohammad Sharifkhani. Introduction Applying slow input slopes reduces E below CV2 Useful for driving large capacitors (Buffers) Power.
Course: High-Speed and Low- Power VLSI (97.575) Professor: Maitham Shams Presentation: Presentation: True Single- Phase Adiabatic Circuitry By Ehssan.
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 A few notes for your design  Finger and multiplier in schematic design  Parametric analysis.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
Click to edit Master title style Progress Update Energy-Performance Characterization of CMOS/MTJ Hybrid Circuits Fengbo Ren 05/28/2010.
Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles Zhiyi Yu, Bevan Baas VLSI Computation Lab, ECE Department University of California,
1 Recap: Lecture 4 Logic Implementation Styles:  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates, or “pass-transistor” logic.
1 Clockless Logic Montek Singh Thu, Mar 2, Review: Logic Gate Families  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates,
Introduction to Linear Voltage Regulators Krishna Kishore Reddy K 2010H223084H.
High Speed Properties of Digital Gates, Copyright F. Canavero, R. Fantino Licensed to HDT - High Design Technology
Subject Name: Fundamentals Of CMOS VLSI Subject Code: 10EC56
POWER SEMICONDUCTOR DEVICES OVERVIEW
Presentation transcript:

 C. H. Ziesler et al., 2003 Energy Recovering Computers 1 Advanced Computer Architecture Laboratory University of Michigan Conrad H. Ziesler 1 Joohee Kim 1 Suhwan Kim 1 2 Marios C. Papaefthymiou 1 2 T. J. Watson Research Center IBM Research Division

 C. H. Ziesler et al., 2003 Conventional CMOS Operation _+_+ Vdd Vss R C R time Constant supply voltage One-way street to ground High voltage drops across conducting devices Energy dissipation grows with CV 2 Vdd IRIR VRVR PRPR VCVC ERER

 C. H. Ziesler et al., 2003 Charge Recovery Vpc R C time Time-varying voltage supply Vpc Charge recovered from load C Charge transfer through resistor is spread over available time Energy dissipation grows with (RC/T) CV 2 ~ Vpc IRIR VRVR PRPR ERER Vdd VCVC

 C. H. Ziesler et al., 2003 Quick Glance at History ● Physics (1970s) – Logical reversibility of computation – Connection to thermodynamics (“adiabatic” computing) – No absolute minimum to energy dissipation, if computing is arbitrarily slow. (Does not have to be slow, however.) ● Engineering (1990s) – Logic circuitry – Energy recycling circuitry – VLSI prototyping

 C. H. Ziesler et al., 2003 The Design Space Complex web of intertwined issues 1. Which capacitance to recover from? 2. How to store/reuse recovered energy? 3. What circuits to do the recovery?

 C. H. Ziesler et al., 2003 The case for reversible computation P. Solomon and D. Frank – SLPE'94 Asymptotically zero-energy split-level charge recovery logic S.G. Younis and T. F. Knight, Jr. – SPLE’94 Clock-powered CMOS: A hybrid adiabatic logic style for energy-efficient computing N. Tzartzanis and W. C. Athas – ARVLSI'99 And many, many others.... Few real working chips, however. Sample Design Points from’90s

 C. H. Ziesler et al., 2003 Our Contributions Multiplier chip (2001) Custom dynamic circuitry with fine-grain recovery from outputs of individual gates Discrete wavelet transform chip (2003) Standard-cell ASIC design with recovery of clock power Guiding principles – Simplicity – Minimal control and hardware overhead Key attributes – Single-phase sinusoidal waveform for power and synchronization – Integrated power-clock generators with inductive elements – Relatively high operating speeds (200MHz+ in silicon) – Higher energy efficiency than conventional counterparts

 C. H. Ziesler et al., 2003 Minimalist approach – Simple tools: magic and spice – Low-cost standard CMOS process: HP 0.5  m, 40-pin DIP package, through MOSIS. Operational chip demonstrates practicality of energy-recovering circuit design – Non-trivial size (8-bit operands, on-chip clock, self-test) – High throughput – Low energy dissipation Multiplier Chip Suhwan Kim (while still at UM) and Conrad Ziesler received First Prize in VLSI Design Contest, DAC 2001.

 C. H. Ziesler et al., 2003 Working 1 st silicon Simple yet effective experimental setup. Correct operation 130 MHz. Energy efficient operation validated by measurements. Low-Energy High-Speed Operation

 C. H. Ziesler et al., 2003 Correct Function 130MHz signature output

 C. H. Ziesler et al., 2003 Energy Comparison 2.7V 1.9V 2.2V 1.9V 1.6V 2.9V Frequency (MHz) Dissipation per Cycle (pJ) Energy recovery 4 stage static CMOS 2.3V 3.0V 2.0V 8 stage static CMOS 2 stage static CMOS 4x

 C. H. Ziesler et al., 2003 Simulations vs. Measurements Frequency (MHz) Discrepancy (%) Good correspondence up to 100MHz Simulations gave consistently higher dissipation

 C. H. Ziesler et al., 2003 Test Chip Overview Two multipliers with self-test per chip (minimum size die) Integrated power-clock generator Resonant LC oscillator

 C. H. Ziesler et al., 2003 Input BILBO (self-test) Product array Multiplicand buffers Result summation Result buffers Self-test Control Output BILBO (self-test) Multiplier and Self-Test 9,048 devices in multiplier array, 2,806 devices in self-test circuitry Implemented entirely in energy-recovering dynamic logic family.

 C. H. Ziesler et al., 2003 Energy-Recovering Dynamic Logic Sense amplifier Precharge diodes Current switches Evaluation tree Current tail Power clock Vss bias at bt afbf af ot NMOS NAND gate Power clock Single-phase sinusoidal power-clock Minimum-size low-swing evaluation tee Built-in state element Dual-rail noise-tolerant design

 C. H. Ziesler et al., 2003 Cascade Operation NMOS and PMOS gates are cascaded like domino logic. Animation shows tokens propagating through buffer chain.

 C. H. Ziesler et al., 2003 Zero-voltage switching LC- resonant clock generation External/bondwire inductor L Resistive/capacitive adiabatic load Compact: 170 x 115  m Single-Phase Power-Clock Generator Power Clock Pulse Gen Ring Osc Gate Drive Power Switches: S1, S2 25 tr. 19 tr. 10 tr. Ring Osc. Pulse Gen. Gate Drive _+_+ _+_+ load S2 S1 PC L Vdd Vss Vbp Vbn

 C. H. Ziesler et al., 2003 Switch Timings Inductor current builds linearly when switches are on. Peak switch current less than peak inductor current. Switch S1 turned on at positive voltage peak. Switch S2 turned on at negative voltage peak. Fixed “on-window” controlled by pulse generator. Inductor current Output voltage

 C. H. Ziesler et al., 2003 Single-phase sinusoidal ~60pF load, ~10nH external inductor One DC supply (Vdd, Vss), two DC biases (NMOS, PMOS) Power-Clock Waveform Vdd Vss PMOS Bias NMOS Bias Power-Clock

 C. H. Ziesler et al., 2003 The Automation Question

 C. H. Ziesler et al., 2003 Resonant Clock ASIC Compatible with ASIC flow Synthesized by Conrad Ziesler and Joohee Kim using in-house standard-cells library and commercial tools Energy recovering clock tree and SRAM word/bit lines Low-cost bulk CMOS process: TSMC 0.25  m, 108-pin PGA package, through MOSIS High frequency (300MHz) Low voltage ( V)

 C. H. Ziesler et al., 2003 Chip Microphotograph

 C. H. Ziesler et al., 2003 System Overview

 C. H. Ziesler et al., 2003 The Energy Recovering Flip-Flop Clock signal: Single-phase resonant sinusoid Probe activates state element only if next state differs from present state. Low voltage operation at high speeds Delay similar to conventional flip-flops Fully compatible with standard-cell ASIC flow 16 transistors 84  m 2 probestate element

 C. H. Ziesler et al., 2003 Flip-Flop Operation

 C. H. Ziesler et al., 2003 Flip-Flop Power Characterization Order of magnitude difference between idle (D, Q constant) and active (D, Q changing) dissipation. frequency (MHz) Flip-Flip Energy per Cycle energy (fJ)

 C. H. Ziesler et al., 2003 Resonant Clock Generator Resonate entire clock capacitance with small inductor Pump resonant system with NMOS switch at appropriate times NMOS switch only conducts incremental losses whenever “on” NMOS Switch Control Pre-driver Driver

 C. H. Ziesler et al., 2003 Clock Generator Operation

 C. H. Ziesler et al., 2003 Recovering vs. Conventional Hardware Synthesized dual-mode ASIC – conventional – energy recovery Dual-mode flip-flop cell Conventional clock tree with conventional flip-flop Resonant clock tree with energy-recovering flip-flop Direct comparison of dissipation at target throughput using identical hardware structures

 C. H. Ziesler et al., 2003 ASIC Statistics Discrete wavelet transform (DWT) 3,897 gates, 413 ffs 15,571 transistors 400  m x 900  m 13.6 pF, 21 nH 300 MHz, 1.5V 0.25  m logic process Dual-mode DWT Clock generator

 C. H. Ziesler et al., 2003 Simulation Results Total system dissipation Conventional mode includes clock tree Energy recovery mode includes on-chip clock generator 300MHz, 1.5VEnergy RecoveryConventional Idle6.74pJ 29.72pJ Active 68.47pJ 78.28pJ Per Cycle Energy Dissipation

 C. H. Ziesler et al., 2003 Lab Setup

 C. H. Ziesler et al., 2003 Correct Function 115MHz 5nH high-Q external inductor signature output (conventional) signature output (energy recovery)

 C. H. Ziesler et al., 2003 Self-Resonance at 225MHz No external inductor! Clock resonates using parasitic inductance of package Lossy parasitics limit power savings 200,000 intervals signature output (energy recovery) buffered power clock 8ns/div 4  s/div

 C. H. Ziesler et al., 2003 Preliminary Power Measurements 115MHz, 1.5VEnergy RecoveryConventional Idle2.6mW 4.3mW Active 8.9mW 11.7mW Total system measurements by monitoring DC supplies – Conventional mode includes clock tree – Energy recovery mode includes on-chip clock generator Not included: I/O supply, test circuitry, ring oscillator Energy recovery data is conservative (obtained when simultaneously driving two DWT cores, one always in idle)

 C. H. Ziesler et al., 2003 Working silicon that realizes the potential of energy recovery – power savings over conventional CMOS – operating speeds in the 100s of Mhz – low control and hardware overhead – integrated power-clock generators Multiplier chip in 0.5  m CMOS – Fine-grain recovery at the level of individual gate output – True singe-phase energy recovering dynamic logic – 2—4x energy savings over static CMOS in 50—200MHz DWT chip in 0.25  m CMOS – ASIC-compatible design flow – Correct operation all the way up to self-resonance at 225MHz – Direct hardware comparison with dual-mode chip: 60—75% of conventional at 115MHz (measurement) 1.2—4x savings at 300MHz (simulation) Summary

 C. H. Ziesler et al., 2003 Ongoing Research Continue chip testing –Break 300MHz barrier by bonding chip onto PCB –Test SRAM Experimentally assess EM noise –Clock EM power is centered around the resonant frequency of the sinusoid Scaling –How large a clock network with single-phase sinusoid? –Skew/jitter? Ultra high speed (multiple GHz) –How fast with current clocking scheme? –Recovery circuitry with self-resonating clock network.

 C. H. Ziesler et al., 2003 Links and Acknowledgments Web Site Energy recovery group Suhwan Kim (emeritus) Conrad Ziesler Joohee Kim Juang-Ying Chueh Visvesh Sathe Funded in part by U.S. Army Research Office and DARPA DAAG , DAAD , N C-8059