An Efficient Algorithm for Dual-Voltage Design Without Need for Level-Conversion SSST 2012 Mridula Allani Intel Corporation, Austin, TX (Formerly with Auburn University) Dr. Vishwani D. Agrawal Department of Electrical and Computer Engineering Auburn University, Auburn, AL March 12, 2012
Outline Motivation Problem statement Background Contributions Algorithm to find V DDL Algorithm to assign V DDL Results Future work References 3/12/2012 2
Motivation Current dual voltage designs use 0.7V DD as the lower supply voltage. Algorithms to assign low voltage have exponential or polynomial complexity. Require efficient algorithms that can increase energy savings in large circuits. 3/12/2012 3
Problem Statement Develop a linear time algorithm to find an optimal lower voltage V DDL, given a single voltage V DDH without affecting the critical path delay. Develop new algorithms for voltage assignment to gates in dual-V DD design. 3/12/2012 4
Background Basic idea: decrease energy consumption without any delay penalty. Done by assigning lower supply voltage to gates on non-critical paths. Different algorithms propose different ways of finding non-critical path gates for lower voltage. 3/12/2012 5
Background Authors Kuroda and Hamada say that power reduction ratio is minimum when 0.6V DD ≤ V DDL ≤ 0.7V DD. Authors Chen, et al., Kulkarni, et al., Srivatstava, et al., claim that the optimal value of V DDL for minimizing total power is 50% of V DD. Rule of thumb proposed by Hamada, et. al. says 3/12/2012 6
Background CVS Structure [Usami and Horowitz] ECVS Structure [Usami, et. al.] V DDL V DD Level Converter K. Usami and M. Horowitz, “Clustered Voltage Scaling Technique for Low-Power Design," Proceedings of the International Symposium on Low Power Design, pp , K. Usami, et. al.,“Automated Low-Power Technique Exploiting Multiple Supply Voltages Applied to a Media Processor," IEEE Journal of Solid-State Circuits, vol. 33, no. 3, pp , Mar /12/2012 7
Background Kulkarni, et al. Greedy heuristic based on gate slacks. Uses 0.7V DD or 0.5V DD as V DDL. Includes power and delay overhead of level converters. Sundararajan and Parhi Linear programming based model. Minimizes the power consumption. Includes level converter delay overheads. 3/12/2012 8
Background Recent work [Kim and Agrawal]: Assign V DDL to gates with S i ≥S u. Assign V DDL to gates with S l ≤ S i ≤ S u one by one without violating timing or topological constraints. Repeat last two steps across all voltages to find the best V DDL and the corresponding dual-voltage design with the least energy. Ref. K. Kim and V. D. Agrawal, “Dual Voltage Design for Minimum Energy Using Gate Slack,” Proceedings of the IEEE International Conference on Industrial Technology, pp , March, /12/2012 9
Grouping of gates 45 o line S u = ps P G ≥0 3/12/ ∑(dl i –dh i )≤min{S i }
Groups when V DDL = 1.2V 45 o line P G 3/12/ V DD = 1.2V V DDL = 1.2V T c = 510 ps S u = 0 ps
45 o line P G 3/12/ V DD = 1.2V V DDL = 1.19V T c = 510 ps S u = 14.6 ps Groups when V DDL = 1.19V
45 o line S u = ps P G 3/12/ T c = 510 ps Groups when V DDL = 0.49V
45 o line P G 3/12/ V DD = 1.2V V DDL = 0.39V S u = 469ps T c = 510 ps Groups when V DDL = 0.39V
Groups when V DDL = 0.1V G 3/12/ V DD = 1.2V V DDL = 0.1V S u = 510 ps = T c T c = 510 ps P 45 o line
Theorems 1. Gates above the 45 o line in the ‘Delay increment versus slack’ plot cannot be assigned lower supply voltage without violating the timing constraint. 2. where β i = dl i /dh i and dl i is the low voltage delay and dh i is the high voltage delay of gate i. The maximum value of β i ; β max, will give us the lower bound on the gate slacks. 3/12/
Theorems 3. Groups within P which satisfy can be assigned lower supply voltage without violating the timing constraint. (where, P’ is a sub-set of P y i = dl i – dh i, dl i = low voltage delay of gate i, dh i = high voltage delay of gate i and S i = slack of the gate i at V DD.) 4. Group with slacks greater than S u, G, can always be assigned the lower supply voltage without causing any topological violations. 3/12/
Algorithm to find V DDL Assume all gates are assigned V DDH initially. Calculate gate slacks. Group gates according to their slacks and delays. 3/12/
Algorithm to find V DDL V DDL = V DDL1, when using no level converter. V DDL = (V DDL1 V DDL2 ) 1/2, when using level converter. 3/12/
Results: V DDL selection algorithm ISCAS ’85 Total gates Without level converters V DDL = V DDL1 V DDL = V DDL2 V DDL = (V DDL1 +V DDL2 )/2 V DDL = (V DDL1 V DDL2 ) 1/2 V DDL (V) Gates in V DDL E sav (%) V DDL (V) Gates in V DDL E sav (%) V DDL (V) Gates in V DDL E sav (%) V DDL (V) Gates in V DDL E sav (%) C C C C C C C C C C /12/
Results: Comparison with reported data ISCAS’85 Total gates Without level converters V DDL =V DDL1 V DDL = 0.7V DD = 0.84V V DDL = 0.5V DD = 0.6V V DDL (V) Gates in V DDL E sav (%) Gates in V DDL E sav ( %) Gates in V DDL E sav (%) C C C C C C C C C C /12/
Algorithm to assign V DDL Assume all gates are at V DD initially. Calculate slacks of all gates. Assign V DDL to all gates i whose slacks, S i ≥S u Recalculate slacks. 3/12/
Algorithm to assign V DDL Assign V DDL to a group of gates in P satisfying the condition Recalculate slacks. Are there are any V DDL gates feeding into any V DDH gates or is there any gate with negative slack? 3/12/
Algorithm to assign V DDL If answer to any of the questions is yes, then put the corresponding gate back to V DDH. Recalculate slacks. Repeat previous five steps until we do not have any unprocessed V DDH gate in group P. 3/12/
c880 slack distribution 45 o line S u =336.9 ps P G 3/12/ V DD = 1.2V V DDL = 0.49V
Slack data after V DDL assignment 45 o line S u = 336.9ps P G V DD = 1.2V V DDL = 0.49V 3/12/
ISCAS’85 Total gates V DDL =V DDL1 Determination and assignment SPICE Results ** [Kim and Agrawal] V DDL (V) Gates in V DDL E sav (%) CPU* (s) E single VDD (fJ) E dual VDD ( fJ) E sav (%) CPU (s) C C C C C C C C N/R C C N/R Dual voltage design without level converter Intel Core i5 2.30GHz, 4GB RAM ** 90nm PTM model 3/12/
CPU Time Vs. Number of Gates 3/12/
Future work Accommodate level converter energy overheads. Consider leakage energy reduction. Dual threshold designs. Simultaneous dual supply voltage and dual threshold voltage designs. Include the effects of process variations. 3/12/
References 1. T. Kuroda and M. Hamada, “Low-Power CMOS Digital Design with Dual Embedded Adaptive Power Supplies," IEEE Journal of Solid-State Circuits, vol. 35, no. 4, pp , Apr M. Hamada, Y. Ootaguro, and T. Kuroda, “Utilizing Surplus Timing for Power Reduction,” in Proceedings of the IEEE Custom Integrated Circuits Conference, pp , C. Chen, A. Srivastava, and M. Sarrafzadeh, “On Gate Level Power Optimization Using Dual-Supply Voltages," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 9, no. 5, pp , Oct S. H. Kulkarni, A. N. Srivastava, and D. Sylvester, “A New Algorithm for Improved VDD Assignment in Low Power Dual VDD Systems," in Proceedings of the International Symposium on Low Power Design, pp , A. Srivastava, D. Sylvester, and D. Blaauw, “Concurrent Sizing, Vdd and Vth Assignment for Low-Power Design," Proceedings of the Design, Automation and Test in Europe Conference, pp , K. Kim, Ultra Low Power CMOS Design. PhD thesis, Auburn University, ECE Dept., Auburn, AL, May /12/
References 7. K. Kim and V. D. Agrawal, “Dual Voltage Design for Minimum Energy Using Gate Slack,” in Proceedings of the IEEE International Conference on Industrial Technology, pp , Mar K. Usami and M. Horowitz, “Clustered Voltage Scaling Technique for Low- Power Design," in Proceedings of the International Symposium on Low Power Design, pp , K. Usami, M. Igarashi, F. Minami, T. Ishikawa, M. Kanzawa, M. Ichida, and K. Nogami, “Automated Low-Power Technique Exploiting Multiple Supply Voltages Applied to a Media Processor," IEEE Journal of Solid-State Circuits, vol. 33, no. 3, pp , Mar V. Sundararajan and K. K. Parhi, “Synthesis of Low Power CMOS VLSI Circuits Using Dual Supply Voltages," in Proceedings of the 36th Annual Design Automation Conference, pp , M. Allani and V. D. Agrawal, “Level-Converter Free Dual-Voltage Design of Energy Efficient Circuits Using Gate Slack,” Submitted to Design Automation and Test in Europe Conference, March 12-16, /12/
Thank you.