ICCAD 2003 Algorithm for Achieving Minimum Energy Consumption in CMOS Circuits Using Multiple Supply and Threshold Voltages at the Module Level Yuvraj.

Slides:



Advertisements
Similar presentations
ECE 667 Synthesis and Verification of Digital Circuits
Advertisements

1 “Scheduling with Dynamic Voltage/Speed Adjustment Using Slack Reclamation In Multi-processor Real-Time Systems” Dakai Zhu, Rami Melhem, and Bruce Childers.
Dynamic and Leakage Power Reduction in MTCMOS Circuits Using an Automated Efficient Gate Clustering Technique Mohab Anis, Shawki Areibi *, Mohamed Mahmoud.
University of Michigan Electrical Engineering and Computer Science University of Michigan Electrical Engineering and Computer Science University of Michigan.
Minimum Energy CMOS Design with Dual Subthrehold Supply and Multiple Logic-Level Gates Kyungseok Kim and Vishwani D. Agrawal ECE Dept. Auburn University.
Leakage and Dynamic Glitch Power Minimization Using MIP for V th Assignment and Path Balancing Yuanlin Lu and Vishwani D. Agrawal Auburn University ECE.
Praveen Venkataramani Suraj Sindia Vishwani D. Agrawal FINDING BEST VOLTAGE AND FREQUENCY TO SHORTEN POWER CONSTRAINED TEST TIME 4/29/ ST IEEE VLSI.
Minimum Dynamic Power CMOS Circuit Design by a Reduced Constraint Set Linear Program Tezaswi Raja Vishwani Agrawal Michael L. Bushnell Rutgers University,
CMOS Circuit Design for Minimum Dynamic Power and Highest Speed Tezaswi Raja, Dept. of ECE, Rutgers University Vishwani D. Agrawal, Dept. of ECE, Auburn.
Polynomial-Time Algorithms for Designing Dual-Voltage Energy Efficient Circuits Master’s Thesis Defense Mridula Allani Advisor : Dr. Vishwani D. Agrawal.
Dual Voltage Design for Minimum Energy Using Gate Slack Kyungseok Kim and Vishwani D. Agrawal ECE Dept. Auburn University Auburn, AL 36849, USA IEEE ICIT-SSST.
Optimization in Engineering Design 1 Lagrange Multipliers.
Multiobjective VLSI Cell Placement Using Distributed Simulated Evolution Algorithm Sadiq M. Sait, Mustafa I. Ali, Ali Zaidi.
Aug 23, ‘021Low-Power Design Minimum Dynamic Power Design of CMOS Circuits by Linear Program Using Reduced Constraint Set Vishwani D. Agrawal Agere Systems,
Fuzzy Simulated Evolution for Power and Performance of VLSI Placement Sadiq M. Sait Habib Youssef Junaid A. KhanAimane El-Maleh Department of Computer.
May 14, ISVLSI 09 Algorithms for Estimating Number of Glitches and Dynamic Power in CMOS Circuits with Delay Variations Jins Davis Alexander Vishwani.
August 12, 2005Uppalapati et al.: VDAT'051 Glitch-Free Design of Low Power ASICs Using Customized Resistive Feedthrough Cells 9th VLSI Design & Test Symposium.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
Dec. 6, 2005ELEC Glitch Power1 Low power design: Insert delays to eliminate glitches Yijing Chen Dec.6, 2005 Auburn university.
Input-Specific Dynamic Power Optimization for VLSI Circuits Fei Hu Intel Corp. Folsom, CA 95630, USA Vishwani D. Agrawal Department of ECE Auburn University,
Nov. 8, 001Low-Power Design Digital Circuit Design for Minimum Transient Energy Vishwani D. Agrawal Circuits and Systems Research Lab, Agere Systems (Bell.
Device Sizing Techniques for High Yield Minimum-Energy Subthreshold Circuits Dan Holcomb and Mervin John University of California, Berkeley EE241 Spring.
9/20/05ELEC / Lecture 81 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Fuzzy Simulated Evolution for Power and Performance of VLSI Placement Sadiq M. SaitHabib Youssef Junaid A. KhanAimane El-Maleh Department of Computer Engineering.
May 28, 2003Minimum Dynamic Power CMOS1 Minimum Dynamic Power CMOS Circuits Vishwani D. Agrawal Rutgers University, Dept. of ECE Piscataway, NJ 08854
EE290A 1 Retiming of AND- INVERTER graphs with latches Juliet Holwill 290A Project 10 May 2005.
An Efficient Chiplevel Time Slack Allocation Algorithm for Dual-Vdd FPGA Power Reduction Yan Lin 1, Yu Hu 1, Lei He 1 and Vijay Raghunathan 2 1 EE Department,
Chung-Kuan Cheng†, Andrew B. Kahng†‡,
Jan 6-10th, 2007VLSI Design A Reduced Complexity Algorithm for Minimizing N-Detect Tests Kalyana R. Kantipudi Vishwani D. Agrawal Department of Electrical.
Jan. 2007VLSI Design '071 Statistical Leakage and Timing Optimization for Submicron Process Variation Yuanlin Lu and Vishwani D. Agrawal ECE Dept. Auburn.
Fuzzy Evolutionary Algorithm for VLSI Placement Sadiq M. SaitHabib YoussefJunaid A. Khan Department of Computer Engineering King Fahd University of Petroleum.
Changbo Long ECE Department, UW-Madison Lei He EDA Research Group EE Department, UCLA Distributed Sleep Transistor Network.
1 Enhancing Performance of Iterative Heuristics for VLSI Netlist Partitioning Dr. Sadiq M. Sait Dr. Aiman El-Maleh Mr. Raslan Al Abaji. Computer Engineering.
Flow Models and Optimal Routing. How can we evaluate the performance of a routing algorithm –quantify how well they do –use arrival rates at nodes and.
VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.
Embedded System Design Framework for Minimizing Code Size and Guaranteeing Real-Time Requirements Insik Shin, Insup Lee, & Sang Lyul Min CIS, Penn, USACSE,
Determining the Optimal Process Technology for Performance- Constrained Circuits Michael Boyer & Sudeep Ghosh ECE 563: Introduction to VLSI December 5.
POWER-DRIVEN MAPPING K-LUT-BASED FPGA CIRCUITS I. Bucur, N. Cupcea, C. Stefanescu, A. Surpateanu Computer Science and Engineering Department, University.
EE 5900 Advanced Algorithms for Robust VLSI CAD, Spring 2009 Static Timing Analysis and Gate Sizing.
Logic Synthesis For Low Power CMOS Digital Design.
An Efficient Algorithm for Dual-Voltage Design Without Need for Level-Conversion SSST 2012 Mridula Allani Intel Corporation, Austin, TX (Formerly.
Minimum Cost Flows. 2 The Minimum Cost Flow Problem u ij = capacity of arc (i,j). c ij = unit cost of shipping flow from node i to node j on (i,j). x.
De-Nian Young Ming-Syan Chen IEEE Transactions on Mobile Computing Slide content thanks in part to Yu-Hsun Chen, University of Taiwan.
Thermal-aware Steiner Routing for 3D Stacked ICs M. Pathak and S.K. Lim Georgia Institute of Technology ICCAD 07.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
1 Exploring Custom Instruction Synthesis for Application-Specific Instruction Set Processors with Multiple Design Objectives Lin, Hai Fei, Yunsi ACM/IEEE.
Energy/Reliability Trade-offs in Fault-Tolerant Event-Triggered Distributed Embedded Systems Junhe Gan, Flavius Gruian, Paul Pop, Jan Madsen.
A Power Grid Analysis and Verification Tool Based on a Statistical Prediction Engine M.K. Tsiampas, D. Bountas, P. Merakos, N.E. Evmorfopoulos, S. Bantas.
Robust Low Power VLSI ECE 7502 S2015 Minimum Supply Voltage and Very- Low-Voltage Testing ECE 7502 Class Discussion Elena Weinberg Thursday, April 16,
Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.
1 ε -Optimal Minimum-Delay/Area Zero-Skew Clock Tree Wire-Sizing in Pseudo-Polynomial Time Jeng-Liang Tsai Tsung-Hao Chen Charlie Chung-Ping Chen (National.
Timing-Driven Routing for FPGAs Based on Lagrangian Relaxation
Inverter Chapter 5 The Inverter April 10, Inverter Objective of This Chapter  Use Inverter to know basic CMOS Circuits Operations  Watch for performance.
EE 4271 VLSI Design, Fall 2013 Static Timing Analysis and Gate Sizing Optimization.
L12 : Lower Power High Level Synthesis(3) 성균관대학교 조 준 동 교수
1  Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Simultaneous Supply, Threshold and Width Optimization for Low-Power CMOS Circuits With an aside on System based shutdown. Gord Allan PhD Candidate ASIC.
Sp09 CMPEN 411 L14 S.1 CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 14: Designing for Low Power [Adapted from Rabaey’s Digital Integrated Circuits,
Multimedia Computing and Networking Jan Reduced Energy Decoding of MPEG Streams Malena Mesarina, HP Labs/UCLA CS Dept Yoshio Turner, HP Labs.
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen, Jason Cong , Computer Science Department , UCLA Presented.
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Static Timing Analysis and Gate Sizing Optimization
Static Timing Analysis and Gate Sizing Optimization
Circuit Design Techniques for Low Power DSPs
University of Texas at Austin
Data Wordlength Reduction for Low-Power Signal Processing Software
Resource Allocation for Distributed Streaming Applications
Presentation transcript:

ICCAD 2003 Algorithm for Achieving Minimum Energy Consumption in CMOS Circuits Using Multiple Supply and Threshold Voltages at the Module Level Yuvraj Singh Dhillon Abdulkadir Utku Diril Abhijit Chatterjee Hsien-Hsin Sean Lee School of ECE, Georgia Institute of Technology, Atlanta, GA

Y.S. Dhillon et al. ICCAD Deadline, Problem Definition

Y.S. Dhillon et al. ICCAD Goal Find the supply and threshold voltages to be assigned to modules such that: Energy is minimized System delay remains unaffected

Y.S. Dhillon et al. ICCAD Contributions Obtained a minimum energy condition on supply and threshold voltages Applied Lagrange Multiplier Method Developed an iterative gradient search algorithm which rapidly converges to the optimum voltage values Developed a heuristic approach to cluster the optimum voltages into a limited number of supply and threshold voltages

Y.S. Dhillon et al. ICCAD Overview Module Level Delay/Energy Models Lagrange Multiplier Formulation Gradient Search Algorithm Clustering Heuristic Experimental Results Conclusion

Y.S. Dhillon et al. ICCAD Module Level Delay Model V DDi : Power supply voltage applied to the i th module  : Velocity saturation coefficient V thi : Threshold voltage k 0i : Delay constant To ↓ delay: ↑ V DD, ↓ V th

Y.S. Dhillon et al. ICCAD Dynamic Energy Model Model for dynamic energy dissipation V DDi : Power supply voltage applied to the i th module k 1i : Energy constant k 1i includes the effect of both switching and short-circuit energies To ↓ E d : ↓ V DD

Y.S. Dhillon et al. ICCAD Static Energy Model Model for static energy dissipation k 2, k 5 : circuit-dependent parameters k 3, k 4, k 6, k 7 : process-dependent parameters To ↓ E s : ↓ V DD, ↑ V th

Y.S. Dhillon et al. ICCAD Problem Formulation Deadline,

Y.S. Dhillon et al. ICCAD Minimize under the constraints for all paths P j E i = E di + E si T d is the time constraint V DDi and V thi are the variables for each module Problem Formulation

Y.S. Dhillon et al. ICCAD Lagrange Multiplier Formulation where j is the Lagrange Multiplier for the j th path For minimum energy consumption:

Y.S. Dhillon et al. ICCAD Minimum Energy Condition Given delay d i for module i, the energy consumed by the module is minimized when CTEG i =CSEG i Constant Threshold Energy Gradient Constant Supply Energy Gradient

Y.S. Dhillon et al. ICCAD Gradient Search Algorithm Step 1: Give initial delays to the modules trying to make all the path delays as close to T d as possible Use the Zero Slack Algorithm Step 2: For the given delay d i for the i th module, solve CTEG i =CSEG i to get V DDi and V thi for that module

Y.S. Dhillon et al. ICCAD Gradient Search Algorithm Step 3: Calculate the cost for the current iteration using V DD and V th values At the minimum energy point, cost will be zero Step 4: If cost is less than a predetermined value, done Else, continue to Step 5

Y.S. Dhillon et al. ICCAD Step 5: Assign new delays to the modules is the gradient of along the null space vectors of A Adding a delay vector in the null space of A to the current delay values guarantees that the path delays do not change Go to Step 2 Gradient Search Algorithm

Y.S. Dhillon et al. ICCAD Note about Cost Function At minimum energy, Designers can use Cost_fn to evaluate the energy efficiency of their designs

Y.S. Dhillon et al. ICCAD Clustering Heuristic pq Assume p supply voltages and q threshold voltages are available (p<N, q<N) Step 1: Obtain initial values for the p V DD_p s and q V th_q s from the N optimum V DD_opt s and V th_opt s Step 2: For every module i, find nearest pair [V DD_p (m),V th_q (n)] to [V DD_opt (i),V th_opt (i)] and assign to [V DDi,V thi ]

Y.S. Dhillon et al. ICCAD Clustering Heuristic Step 3: Calculate the critical path delay, T c If T c is close to the constraint, T d, done Else, continue to Step 4 Step 4: Obtain new values for the p V DD_p s and q V th_q s using gradient search Two different cost functions used: Go to Step 2

Y.S. Dhillon et al. ICCAD Experimental Results Algorithm applied to ISCAS’85 circuits and a Wallace tree multiplier Top level modules in the Verilog description were directly mapped to the modules used in the optimization The process-dependent parameters (k 3, k 4, k 6, k 7 ) were obtained from SPICE simulations of an inverter The circuit-dependent parameters (k 0, k 1, k 2, k 5 ) were obtained using Synopsys Design Compiler with TSMC 0.25µ library

Y.S. Dhillon et al. ICCAD Optimizing a Wallace Tree Multiplier

Y.S. Dhillon et al. ICCAD Baseline Circuits (2 Switching Activities)

Y.S. Dhillon et al. ICCAD Unlimited # of V dd and V th

Y.S. Dhillon et al. ICCAD Clustering to 2 V dd and 1 V th

Y.S. Dhillon et al. ICCAD Summary of Energy Savings

Y.S. Dhillon et al. ICCAD Conclusion Mathematical condition on the supply and threshold voltages of interconnected modules minimizes the total energy consumption under a delay constraint Iterative gradient search algorithm rapidly converges to the optimum voltage values Heuristic clusters the optimum voltages into a limited number of supply and threshold voltages Achieve energy savings of up to 58.4% with unlimited number of V dd and V th

Y.S. Dhillon et al. ICCAD

ICCAD 2003 Backup Slides

Y.S. Dhillon et al. ICCAD Motivation and Goal Usage of multiple supply voltage planes and multiple threshold voltages is becoming increasingly necessary in DSM VLSI design Lower power consumption without significant performance loss Voltage optimization at gate level is highly complex Large numbers of paths have to be optimized for power The search space is huge Assigning different supply voltages at gate level is not technologically feasible

Y.S. Dhillon et al. ICCAD Motivation Why optimize at module level ? Optimization at gate level is highly complex Large numbers of paths Search space is huge Assigning different supply voltages at gate level is not technologically feasible Number of paths is limited Different modules can be assigned different supply and threshold voltages

Y.S. Dhillon et al. ICCAD Summary of Delay/Energy Modeling For any module: To ↓ delay: ↑ V DD, ↓ V th To ↓ E d : ↓ V DD To ↓ E s : ↓ V DD, ↑ V th For given fixed module delay, d i, optimum V DDi and V thi values can be found that minimize E i =E di +E si