We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!
Presentation is loading. Please wait.
Published byLandon Filbin
Modified over 2 years ago
1 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.Copyright © 2011 SuVolta, Inc. All rights reserved. 1 Device Considerations for Low Power VLSI Circuits EDS Silicon Valley Chapter David Kidd Senior Director of Digital Design
2 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Transistor Variability Limits Chips Impact on Mobile System on Chip (SOC) Limited Low Power Design Techniques Where does Variability come from? New Transistor Alternatives to Reduce Variability Deeply Depleted Channel (DDC) technology Silicon Impact Outlook Taking advantage of Deeply Depleted Channel (DDC) technology in Mobile SOC Overview
3 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Multiple blocks with different performance requirements Integrated on the same die Different power modes – would like to run at different supplies Multiple V T transistors used to control leakage Single chip solution requires analog integration Need co-design of architecture, circuits and transistor technology for best solution What is needed in Mobile System on Chip?
4 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Limited benefit using voltage scaling (DVFS) Cannot overdrive much due to reliability and power restrictions Dynamically lowering voltage limited to 100-200mV Only lowering frequency leaves large leakage power “Run to hold” beats DVFS despite overhead Finicky SRAM memories High SRAM V MIN leaves no room for memory voltage scaling Many circuit tricks to improve V MIN and noise margins Design teams moved to dedicated power rail for SRAM Works for CPU – difficult in GPU Impacts power network integrity – more fluctuations Transistor variability limits chips Variability Limits Design & Architecture
5 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Global/Systematic/Manufacturing Variation Shifts all the transistors similarly Longer/shorter transistor lengths More (or less) implant energy and dose Will result in speed/power distribution Local/Random Variation Transistor next to each other vary widely Small number of dopants in transistor channel Random Dopant Fluctuation (RDF) Apparent in threshold voltage mismatch (σV T ) Impacts speed, leakage, SRAM & Analog Industry solution: Remove RDF using Undoped Channel What is the right silicon roadmap going forward? Transistor Variation Source of Chip Variation [#Transistors] Useable Yield Too slowToo hot
6 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. FinFET or TriGate Promises high drive current Manufacturing, cost, and IP challenge Doped channel to enable multi V T FDSOI Showing off undoped channel benefits Good body effect, but lack of multi V T capability Restricted supply chain DDC – Deeply Depleted Channel transistor Straight forward insertion into Bulk Planar CMOS Undoped channel to reduce random variability Good body effect and multi V T transistors Transistor Alternatives Source: IMEC Source: Fujitsu Textbook FinFET Source: GSS, Chipworks Intel TriGate
7 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Undoped or very lightly doped region Significantly reduced transistor random variability V T Lower leakage Better SRAM (I READ, lower V min & V ret ) Tighter corners Smaller area analog design Higher channel mobility (increased I eff, lower DIBL) Higher speed, improved voltage scaling V T setting offset region Enables multiple threshold voltages Screening region Strong body coefficient Bias bodies to tighten manufacturing distribution Body biasing to compensate for temperature and aging Deeply Depleted Channel™ (DDC) Transistor 123 3 2 1 Benefits similar to FinFET in planar bulk CMOS *Example implementation
8 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Wells and VT Poly Spacers and LDDs Salicide Gate Metal Layers Blanket Epi STI Poly Spacers and LDDs Salicide Gate Metal Layers Wells and VT Foundry Standard Flow SuVolta Flow (example) New No new materials / No new tools STI Bulk Planar Foundry-Compatible Process
9 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Presented at IEDM 2011 TEM of DDC Transistor and STI 43.1nm S D
10 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Lower Transistor Variability Reduces Leakage Transistor variability is reflected in threshold voltage (V T ) distribution Leakage current is exponentially dependent on V T Lower V T variability ( V T ) reduces number of leaky low V T devices Power dissipation is dominated by low V T edge of distribution Smaller V T Less leakage power for digital and memory/SRAM High leakage tail High leakage tail dominates power 2.7x higher power (Model using 85mV subV T slope) High V T tail Slows down ICs [#Transistors] [Leakage Power] 65nm Silicon SRAM V T
11 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Nominal (TT) ring oscillator speed expected to be 400ps (A) Equivalent to having many similar critical paths in a chip V T variation will randomly affect paths within the same die limiting speed to 470ps Undoped channel reduces variability and increases mobility (B) 25% faster mean, 30% faster tail due to tighter distribution To match performance lower V DD until tails have same speed (C) Large impact on power due square dependence P=CV 2 f +IV Lower Transistor Variability Improves Speed 65nm Silicon Measurement (A) (B) (C)
12 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. SRAM memories built using 6-T SRAM cell Smallest transistors on every chip, worst V T mismatch Higher V DD is required to avoid failures SRAM blocks limit V DD scaling V min – lowest operating voltage limited by transistor mismatch Demonstrated SRAM to V min of 0.425V Standard SRAM macros No circuit “tricks” for low voltage operation Demonstrates potential for 50% voltage scaling Sub 0.5V V Min is Possible 300 mV Tester limit Industry Norm DDC
13 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. 40-60% improved matching for SRAM transistors Improved V T Matching Key for Low V MIN Baseline DDC Baseline DDC Baseline DDC Baseline DDC Presented at IEDM 2011
14 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. In analog circuits, matching is key Large transistors used to improve relative variability in current mirrors, differential pairs, etc. Better transistor matching allows for Area savings Higher performance Lower power Undoped channel improves R OUT higher gain OpAmp stage Matched bandwidth, gain, slewrate Over 50% smaller area (84 vs 190μm 2 ) 45% better input noise (176 vs 327 μV) Bandgap reference circuit Same accuracy achieved at half the size Design Examples Analog 450um 420um 210um 450um Baseline DDC Baseline DDC 10.9x17.4um 6.9x12.2um SuVolta sample layout
15 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Body Bias to fix systematic variation Speed-up (forward bias - FBB) slow parts Cool down (reverse bias - RBB) hot parts Increase manufacturing yield Body bias enables multiple modes of operation Active minimize power at every performance Standby leakage reduction, power gating DDC transistor provides 2-4x larger body factor Better Chips with Body Biasing Useable Yield Too slowToo hot RBBFBB TCAD prediction
16 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Inverter ring-oscillators (RO) fabricated at process corners Baseline @ 1.2V V DD and DDC @ 0.9V V DD For each corner, DDC transistor RO is faster and lower power Using strong body coefficient to pull in corners Half the power (50% less power) while matching speed Half the Power at Matched Performance 65nm Silicon Measurement Baseline speed 50% power 100% power SS TT FF TT
17 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Better process control leads to tighter corners Manufacturing flow further reduces layout effects 1 sigma tighter wafer to wafer and within wafer variation for DDC Less overdesign as max paths and min (hold) paths are closer Faster design closure earlier tapeout shorter TTM Tighter Manufacturing Corners w/ DDC 11 22 33 V DD =1.2V V DD =0.9V POR 65nm Silicon Measurement
18 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Achieve half the speed at 1/6 the power @0.6V V DD Use body bias to compensate for temperature and aging Critical for low V DD operation Enable workable design window – avoid overdesign Voltage Scaling to 0.6V V DD 65nm Silicon Measurement Baseline DDC FF TT SS -83%
19 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Turbo Mode: DDC transistor achieves over 50% speedup @ 1.2V V DD All corners for DDC run at 580MHz vs 370MHz for baseline This is HotChips – Go Faster! DVFSBaselineDDC V DD 1.2V0.6V0.9V1.05V1.2V Speed10.511.281.56 Power10.170.5211.51 65nm Silicon Measurement
20 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Same performance at 0.75V V DD as baseline at 0.9V V DD 30% lower power Alternatively 25% faster at same voltage Even better when using body bias to pull in corners 28nm and Beyond (silicon calibrated SPICE simulations)
21 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. CPU: Single thread performance critical Push frequency by temporarily raising voltage in turbo mode DVFS with body biasing becomes DVBFS GPU: High number of cores using small transistors Less overdesign due to lower delay variability Increase parallelism, lower voltage, body bias dynamically for more pixels/Watt Lower frequency blocks In addition to high V T transistors also run at lower voltage and optimal body bias Whole chip: Use body bias to adjust for manufacturing variation Take advantage of improved memory and analog performance Lowering variability while compatible with existing bulk planar silicon IP Applying DDC to Lower Variability in Mobile SOC
22 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Variability limits chips DDC transistor reduces random variability through its undoped channel DDC transistor’s strong body factor can be used to fix systematic variation and compensate for temperature variation DDC technology provides performance kicker from 90nm to 20nm Straight forward integration into existing nodes Compatible with existing bulk planar CMOS silicon IP Use existing CAD flow DDC technology brings back low power tools Large range DVFS Body biasing Low voltage operation Taking advantage of reduced variability DDC transistor in design and architecture will lead to next level in mobile SOC Conclusions
23 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.Copyright © 2011 SuVolta, Inc. All rights reserved. 23
EE201C : Stochastic Modeling of FinFET LER and Circuits Optimization based on Stochastic Modeling Shaodi Wang
UTB SOI for LER/RDF EECS Min Hee Cho. Outline Introduction LER (Line Edge Roughness) RDF (Random Dopant Fluctuation) Variation Solution – UTB.
Introduction to FinFet
EE414 VLSI Design Design Metrics in Design Metrics in VLSI Design [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Tunable Sensors for Process-Aware Voltage Scaling
1 adaptive body bias for reducing process variations nuno alves 19 / october / 2006.
Lecture 7: Power.
Leakage reduction techniques Three major leakage current components 1. Gate leakage ; ~ Vdd 4 2. Subthreshold ; ~ Vdd 3 3. P/N junction.
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
1 A Variation-tolerant Sub- threshold Design Approach Nikhil Jayakumar Sunil P. Khatri. Texas A&M University, College Station, TX.
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
Low Power – High Speed MCML Circuits (II)
Patricia Gonzalez Divya Akella VLSI Class Project.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
0 1 Width-dependent Statistical Leakage Modeling for Random Dopant Induced Threshold Voltage Shift Jie Gu, Sachin Sapatnekar, Chris Kim Department of Electrical.
FPGA-Based System Design: Chapter 2 Copyright 2004 Prentice Hall PTR Topics n Logic gate delay. n Logic gate power consumption. n Driving large loads.
ITRS December 2003, Hsin-Chu Taiwan How Much Variability Can Designers Tolerate? Andrew B. Kahng ITRS Design ITWG December 1, 2003.
Topics Electrical properties of static combinational gates:
CS203 – Advanced Computer Architecture Technology Trends.
XIAOYU HU AANCHAL GUPTA Multi Threshold Technique for High Speed and Low Power Consumption CMOS Circuits.
Process Variation Mohammad Sharifkhani. Reading Textbook, Chapter 6 A paper in the reference.
Utilizing Reverse Short Channel Effect for Optimal Sub threshold Circuit Design By Tae- Hyoung Kim, Hanyong Eom, John Keane and Chris H. Kim Presented.
Trends in IC technology and design J. Christiansen CERN - EP/MIC
Lateral Asymmetric Channel (LAC) Transistors
EE/MAtE1671 Process Variability EE/MatE 167 David Wahlgren Parent.
Modern VLSI Design 3e: Chapter 3 Copyright 1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
Introduction to the TRAMS project objectives and results in Y1 Antonio Rubio, Ramon Canal UPC, Project coordinator CASTNESS’11 WORKSHOP ON TERACOMP FET.
MICAS Department of Electrical Engineering (ESAT) Design-In for EMC on digital circuit October 27th, 2005 AID–EMC: Low Emission Digital Circuit Design.
1 Cleared for Open Publication July 30, S-2144 P148/MAPLD 2004 Rea MAPLD 148:"Is Scaling the Correct Approach for Radiation Hardened Conversions.
VLSI Signal Processing Laboratory A 180-mV Subthreshold FFT Processor Using a Minimum Energy Design Methodology -Alice Wang & Anantha Chandrakasan- Seok-jae,
A 30-GS/sec Track and Hold Amplifier in 0.13-µm CMOS Technology
© Digital Integrated Circuits 2nd Devices VLSI Devices Intuitive understanding of device operation Fundamental analytic models Manual Models Spice.
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
EE/MAtE1671 Front-End-Of-Line Variability Considerations EE/MatE 167 David Wahlgren Parent.
1 Dual-V cc SRAM Class presentation for Advanced VLSIPresenter:A.Sammak Adopted from: M. Khellah,A 4.2GHz 0.3mm 2 256kb Dual-V CC SRAM Building Block in.
Advanced Computing and Information Systems laboratory Device Variability Impact on Logic Gate Failure Rates Erin Taylor and José Fortes Department of Electrical.
Integrated Regulation for Energy- Efficient Digital Circuits Elad Alon 1 and Mark Horowitz 2 1 UC Berkeley 2 Stanford University.
Logic Optimization Mohammad Sharifkhani. Reading Textbook II, Chapters 5 and 6 (parts related to power and speed.) Following Papers: –Nose, Sakurai, 2000.
Slide 1 Bayesian Model Fusion: Large-Scale Performance Modeling of Analog and Mixed- Signal Circuits by Reusing Early-Stage Data Fa Wang*, Wangyang Zhang*,
1 Computing with Leakage Currents Nikhil Jayakumar, Kanupriya Gulati, Rajesh Garg and Sunil P. Khatri ECE Department Texas A&M University.
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics
Robust Low Power VLSI R obust L ow P ower VLSI Sub-threshold Sense Amplifier (SA) Compensation Using Auto-zeroing Circuitry 01/21/2014 Peter Beshay Department.
Chapter 10 Digital CMOS Logic Circuits
The new E-port interface circuits Filip Tavernier CERN.
Variation. 2 Sources of Variation 1.Process (manufacturing) (physical) variations: Uncertainty in the parameters of fabricated devices and interconnects.
FaridehShiran Department of Electronics Carleton University, Ottawa, ON, Canada SmartReflex Power and Performance Management Technologies.
An Improved “Soft” eFPGA Design and Implementation Strategy Victor Aken’Ova, Guy Lemieux, Resve Saleh SoC Research Lab, University of British Columbia.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
The Cost of Fixing Hold Time Violations in Sub-threshold Circuits Yanqing Zhang, Benton Calhoun University of Virginia Motivation and Background Power.
SRAM Mohammad Sharifkhani. Effect of Mismatch.
© 2017 SlidePlayer.com Inc. All rights reserved.