Mahapatra-Texas A&M-Spring'021 Power Issues with Embedded Systems Rabi Mahapatra Computer Science.

Slides:



Advertisements
Similar presentations
VADA Lab.SungKyunKwan Univ. 1 L3: Lower Power Design Overview (2) 성균관대학교 조 준 동 교수
Advertisements

Power Reduction Techniques For Microprocessor Systems
Elettronica T A.A Digital Integrated Circuits © Prentice Hall 2003 Inverter CMOS INVERTER.
Introduction to CMOS VLSI Design Lecture 18: Design for Low Power David Harris Harvey Mudd College Spring 2004.
Designing Combinational Logic Circuits: Part2 Alternative Logic Forms:
EE42/100, Spring 2006Week 14a, Prof. White1 Week 14a Propagation delay of logic gates CMOS (complementary MOS) logic gates Pull-down and pull-up The basic.
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 14: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Copyright 2001, Regents of University of California Lecture 18: 04/0703 A.R. Neureuther Version Date 04/03/03 EECS 42 Intro. electronics for CS Spring.
© Digital Integrated Circuits 2nd Inverter CMOS Inverter: Digital Workhorse  Best Figures of Merit in CMOS Family  Noise Immunity  Performance  Power/Buffer.
8/18/05ELEC / Lecture 11 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Lecture #24 Gates to circuits
Mehdi Amirijoo1 Power estimation n General power dissipation in CMOS n High-level power estimation metrics n Power estimation of the HW part.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
8/23-25/05ELEC / Lecture 21 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Lecture 5 – Power Prof. Luke Theogarajan
Lecture 7: Power.
Power-Aware Computing 101 CS 771 – Optimizing Compilers Fall 2005 – Lecture 22.
Lecture 21, Slide 1EECS40, Fall 2004Prof. White Lecture #21 OUTLINE –Sequential logic circuits –Fan-out –Propagation delay –CMOS power consumption Reading:
Low Power Design of Integrated Systems Assoc. Prof. Dimitrios Soudris
Power, Energy and Delay Static CMOS is an attractive design style because of its good noise margins, ideal voltage transfer characteristics, full logic.
Low Power Design and Adiabatic Circuits P.Ranjith M.Tech(ICT)
The CMOS Inverter Slides adapted from:
Digital Integrated Circuits for Communication
Digital Integrated Circuits© Prentice Hall 1995 Inverter THE INVERTERS.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Free Powerpoint Templates Page 1 Free Powerpoint Templates Low Power VLSI Design Dr Elwin Chandra Monie RMK Engineering College.
6.893: Advanced VLSI Computer Architecture, September 28, 2000, Lecture 4, Slide 1. © Krste Asanovic Krste Asanovic
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
ENGG 6090 Topic Review1 How to reduce the power dissipation? Switching Activity Switched Capacitance Voltage Scaling.
Low-Power CMOS Logic Circuit Topic Review 1 Part I: Overview (Shaw) Part II: (Vincent) Low-Power Design Through Voltage Scaling Estimation and Optimization.
Low Power Techniques in Processor Design
1 VLSI Design SMD154 LOW-POWER DESIGN Magnus Eriksson & Simon Olsson.
Power Reduction for FPGA using Multiple Vdd/Vth
EE415 VLSI Design DYNAMIC LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Low-Power Wireless Sensor Networks
Mahapatra-Texas A&M-Spring'021 Power Issues with Embedded Systems Rabi Mahapatra Computer Science.
1 Power Dissipation in CMOS Two Components contribute to the power dissipation: »Static Power Dissipation –Leakage current –Sub-threshold current »Dynamic.
Basics of Energy & Power Dissipation Lecture notes S. Yalamanchili, S. Mukhopadhyay. A. Chowdhary.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Ratioed Circuits Ratioed circuits use weak pull-up and stronger pull-down networks. The input capacitance is reduced and hence logical effort. Correct.
DCSL & LVDCSL: A High Fan-in, High Performance Differential Current Switch Logic Families Dinesh Somasekhaar, Kaushik Roy Presented by Hazem Awad.
경종민 Low-Power Design for Embedded Processor.
1 Power estimation in the algorithmic and register-transfer level September 25, 2006 Chong-Min Kyung.
Basics of Energy & Power Dissipation
© Digital Integrated Circuits 2nd Inverter Digital Integrated Circuits A Design Perspective The Inverter Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.
EE Electronics Circuit Design Digital Logic Gates 14.2nMOS Logic Families 14.3Dynamic MOS Logic Families 14.4CMOS Logic Families 14.5TTL Logic.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
Dynamic Logic.
1 Dynamic CMOS Chapter 9 of Textbook. 2 Dynamic CMOS  In static circuits at every point in time (except when switching) the output is connected to either.
EE141 Combinational Circuits 1 Chapter 6 (I) Designing Combinational Logic Circuits Dynamic CMOS LogicDynamic CMOS Logic V1.0 5/4/2003.
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
ELEC Digital Logic Circuits Fall 2015 Delay and Power Vishwani D. Agrawal James J. Danaher Professor Department of Electrical and Computer Engineering.
CS203 – Advanced Computer Architecture
LOW POWER DESIGN METHODS
M V Ganeswara Rao Associate Professor Dept. of ECE Shri Vishnu Engineering College for Women Bhimavaram Hardware Architecture of Low-Power ALU using Clock.
COE 360 Principles of VLSI Design Delay. 2 Definitions.
High Speed Properties of Digital Gates, Copyright F. Canavero, R. Fantino Licensed to HDT - High Design Technology
Power-Optimal Pipelining in Deep Submicron Technology
CS203 – Advanced Computer Architecture
LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.
Hot Chips, Slow Wires, Leaky Transistors
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Power Issues with Embedded Systems
Circuit Design Techniques for Low Power DSPs
Lecture 7: Power.
Lecture 7: Power.
Presentation transcript:

Mahapatra-Texas A&M-Spring'021 Power Issues with Embedded Systems Rabi Mahapatra Computer Science

Mahapatra-Texas A&M-Spring'022 Plan for today Some Power Models Familiar with technique to reduce power consumption Reading assignment: paper by Bill Moyer on “Low-Power Design for Embedded Processors” Proceedings of IEEE Nov. 2001

Mahapatra-Texas A&M-Spring'023 Next Generation Computing: Watts metrics? Router (.) | vvv| (.) | vvv| (.) | vvv| (.) | vvv| Server/Data Processing Mega Watts Wireless Networks Micro-watts Base Station Laptops,PDAs, Cellphones, GPS W (Watts)

Mahapatra-Texas A&M-Spring'024 Power Aware Increase in prominence of portable devices SoC complexity: heat generation Traditionally, speed (performance), & area (cost), Now, add power as the new axix

Mahapatra-Texas A&M-Spring'025 Physics Revisited Energy is in Joules Power: Rate of energy consumption (joules/sec), in Watt Vdd*Id: instantaneous power

Mahapatra-Texas A&M-Spring'026 Impact on embedded system Energy consumed per activity reduces battery life –Decreases battery capacity fast IR drops in a battery due to flow of current –Requires more Vdd & GND pins to reduce R, also, thick&wide wiring is necessary Inductive Power-supply voltage bounce due to current switching –Requires more & shorter pins to reduce inductance –Require on chip decoupling capacitance to help bypass pins Power dissipation produces heat and high temperature reduces speed and reliability

Mahapatra-Texas A&M-Spring'027 Opportunities for Low-Power Algorithm s Source Code Compiler Operating System ISA Microarchitecture Circuit Design Manufacturing Minimize Operation Optimized code Energy miser Scheduling Energy Exposed Clocked Gating Low voltage swing Low-k dielectric

Mahapatra-Texas A&M-Spring'028 Some Power Models Macro level –Arithmetic –Software –Memory Activity Based –Empirical –Information-theoretic –Signal modeling-based

Mahapatra-Texas A&M-Spring'029 Empirical Based on chip estimation system [Glaser ICCAD91]: P =  G(E r + C L* V dd 2 )f G = number of equivalent gates E r = energy consumed by an equivalent gate C L = average loading per gate including fanout  = activity factor Demerit: lacks consideration on different logic styles

Mahapatra-Texas A&M-Spring'0210 Information Theoretic Reference [Najm95] Based on activity estimation P = k (C L )(  ) = k(A)(h) A = area, h = entropy factor (a function of entropy H) Limited accuracy, does not include possibility of encoding

Mahapatra-Texas A&M-Spring'0211 Signal Model Based Reference [Landman TCAD96] –Properties of 2’s complement encoded data stream –Arithmetic blocks are regular Analytical Method: [Ramprasad TCAD97] –Word-level statistics –Auto-regressive Moving Average signal generation model –2’s complement & sign magnitude signal encoding

Mahapatra-Texas A&M-Spring'0212 Software Power Power consumed by a processor (P): Ref [TiwariTVLSI94] P = V dd * I Energy (E): E = P *T p, program execution time Program Execution Time(T p ) T p = N*T clk E = P *T p = V dd * I *N*T clk If V dd and T clk are assumed to be constant, Energy is measured by measuring current I. Low-power software: small value of N or fast execution time When V dd and T clk are varying? Current measurements?

Mahapatra-Texas A&M-Spring'0213 Instruction Level Power Modeling Reference: [Tiwari TVLSI97] Current consumption of a program with no loops but M instruction I =  i=0 B k *N k + O i,i+1modM /  i=0 N k B k = Base current of kth instruction in the program N k = Number of clocks required to complete kth instruction O i,j = overhead of executing successive instruction

Mahapatra-Texas A&M-Spring'0214 Power Dissipation in CMOS Three sources: P switching : Switching power (capacitive): dominant today Pl eakage : Leakage Power, will dominant in 0.13 micron and below. P shortcircuit : Schort circuit component CLCL

Mahapatra-Texas A&M-Spring'0215 Switching Power Dissipation Occurs when device changes state or switching of charge in and out of C L, capacitance Flow of current across the transistor’s impedence P switching = t * C L * V 2 dd * f –t= average number of transition per cycle –f = clock frequency –C L = effective capacitance Increases with clock frequency Decreases quadratically with supply voltage 85-90% of active power consumption

Mahapatra-Texas A&M-Spring'0216 Low-Power Techniques Low-power techniques reduces one or more of t, C L, V dd, and f –t: encoding – C L : fast algorithm, design layout – V dd : voltage scaling, variable voltage processor –f: low-frequency and clock gating All of these are useful for embedded system

Mahapatra-Texas A&M-Spring'0217 Short Circuit Power Dissipation Occurs due to the overlapped conductance of both PMOS and NMOS transistors forming a CMOS logic gate as the input signal transitions P shortcircuit = I mean * V dd 10-20% contribution to dynamic power Not important if all signals are guaranted to have steep slopes

Mahapatra-Texas A&M-Spring'0218 Leakage Power Dissipation Occurs regardless of state change Due to leakage currents from reversed biased PN junction (OFF switches are not really off) Proportional to device area and temperature Increases exponentially with reduction in Vt, voltage scaling Significant when system is idle (Embedded Systems?)

Mahapatra-Texas A&M-Spring'0219 Static Power Not a factor in pure CMOS designs Sense amplifier, voltage references and constant current sources contribute to the static power Regardless of device state change Total Power: P switching + P shortcircuit +P static +P leakage

Mahapatra-Texas A&M-Spring'0220 Power – Delay Leverage Power & Delay trade off Speed is proportional to C L * V dd / (V dd – V t ) 1.5 Trends: Reduce V dd & V t to improve speed Energy-delay product is minimized when V dd = 2 * V t Reducing V dd from 3 * V t to 2 * V t results in an approximately 50% decrease in performance while using only 44% of the power.

Mahapatra-Texas A&M-Spring'0221 Algorithmic Technique PR Focus on minimizing number of operation weighted by their cost: First order goal. –Underlying implementation: arithmatic or logical Recomputation of intermediate results may be cheaper than memory use Loop unrolling: reduces loop overhead Number representation: – fixed point or floating point –Sign-magnitude versus 2’s complement is preferred in certain DSP when input samples are uncorrelated and dynamic range minimum –Bit length (of course trade off accuracy) –Adaptive bit truncation in portable video encoder reduces 70% of the power over full bit width

Mahapatra-Texas A&M-Spring'0222 Architectural Technique PR Instruction set design and exploiting parallelism & pipelining are important Architecture driven voltage scaling method [Chandrakasan, IEEE J. Solid state Circuits 92] –Lower voltage for power but apply parallelism/pipeline to speedup –Possible if application has parallelism, trade-off with latency due to pipeline & data dependencies, and area –Speculative logic allowed if low overhead else determental Meeting required performance without overdesigning a solution is fundamental optimization –Extra logic power is not controllable and they still present even if parallelism is absent.

Mahapatra-Texas A&M-Spring'0223 Logic and Circuit Level PR Focus on reducing switched capacitance or/and signal swing Signal probabilities may favor either static or dynamic CMOS logic –Example: Two-input NAND gate with uniform distribution at inputs, probability of output being 0 (p0) is 0.25, p1 = 0.75 –For static gate, probability of a power consuming transition from 0 > 1 is p0*p1 = –For dynamic gate with the output precharged to logic 1, power is consumed whenever the output was previously 0. Thus it has higher (by 0.25) transition at output than static. –However, dynamic circuit has lower input capacitance by a factor of 2 to 3.

Mahapatra-Texas A&M-Spring'0224 Logic circuit PR For wider input static gate, say four input NAND, p0 = and p0 > 1 is For dynamic version as above, p0 = p0 > 1 = Static logic suffers from glitches: needs restructuring and that adds up power more than 20% Hazard X Y Restructured Logic X Y

Mahapatra-Texas A&M-Spring'0225 Logic circuit PR Mapping logic function to gates is tricky too P = 0.5 P = 0.25 P = P(total) = P = P = 0.25 P =0.5 P = P(total) =0.4375