EE5970 Computer Engineering Seminar Spring 2012 Michigan Technological University Based on: A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating.

Slides:



Advertisements
Similar presentations
Field Programmable Gate Array
Advertisements

Verilog HDL -Introduction
FPGA (Field Programmable Gate Array)
Lecture 15 Finite State Machine Implementation
Digital Design: Combinational Logic Blocks
CSET 4650 Field Programmable Logic Devices
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 5 Programmable.
Altera FLEX 10K technology in Real Time Application.
P. Keresztes, L.T. Kóczy, A. Nagy, G.Rózsa: Training Electrical Engineers on Asynchronous Logic Circuits on Constant Weight Codes 1 Training Electrical.
Synchronous Digital Design Methodology and Guidelines
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
Architecture Design Methodology. 2 The effects of architecture design on metrics:  Area (cost)  Performance  Power Target market:  A set of application.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
1 Clockless Logic Montek Singh Tue, Mar 16, 2004.
Multiplexers, Decoders, and Programmable Logic Devices
February 4, 2002 John Wawrzynek
Physical Implementation 1)Manufactured Integrated Circuit (IC) Technologies 2)Programmable IC Technology 3)Other Technologies Manufactured IC Technologies.
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
Introduction to FPGA’s FPGA (Field Programmable Gate Array) –ASIC chips provide the highest performance, but can only perform the function they were designed.
Chapter 6 Memory and Programmable Logic Devices
Low power architecture and HDL coding practices for on-board hardware applications Kaushal D. Buch ASIC Engineer, eInfochips Ltd., Ahmedabad, India
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
General FPGA Architecture Field Programmable Gate Array.
Dr. Konstantinos Tatas ACOE201 – Computer Architecture I – Laboratory Exercises Background and Introduction.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Power Reduction for FPGA using Multiple Vdd/Vth
Unit 9 Multiplexers, Decoders, and Programmable Logic Devices
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
Low-Power Wireless Sensor Networks
LOPASS: A Low Power Architectural Synthesis for FPGAs with Interconnect Estimation and Optimization Harikrishnan K.C. University of Massachusetts Amherst.
Abdullah Aldahami ( ) Feb26, Introduction 2. Feedback Switch Logic 3. Arithmetic Logic Unit Architecture a.Ripple-Carry Adder b.Kogge-Stone.
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick.
PROGRAMMABLE LOGIC DEVICES (PLD)
CPLD (Complex Programmable Logic Device)
Memory and Programmable Logic Memory device: Device to which binary information is transferred for storage, and from which information is available for.
Modern VLSI Design 3e: Chapter 3Partly from 2002 Prentice Hall PTR week9-1 Lectures 21, 22 FPGA and Top-Down Design Flow Mar. 3 and 5, 2003.
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
J. Christiansen, CERN - EP/MIC
FPGA (Field Programmable Gate Array): CLBs, Slices, and LUTs Each configurable logic block (CLB) in Spartan-6 FPGAs consists of two slices, arranged side-by-side.
Programmable Logic Devices
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
Basic Sequential Components CT101 – Computing Systems Organization.
A Reconfigurable Low-power High-Performance Matrix Multiplier Architecture With Borrow Parallel Counters Counters : Rong Lin SUNY at Geneseo
EE3A1 Computer Hardware and Digital Design
MAPLD 2005/254C. Papachristou 1 Reconfigurable and Evolvable Hardware Fabric Chris Papachristou, Frank Wolff Robert Ewing Electrical Engineering & Computer.
Programmable Logic Devices (PLDs)
1 Leakage Power Analysis of a 90nm FPGA Authors: Tim Tuan (Xilinx), Bocheng Lai (UCLA) Presenter: Sang-Kyo Han (ECE, University of Maryland) Published.
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA Project Guide: Smt. Latha Dept of E & C JSSATE, Bangalore. From: N GURURAJ M-Tech,
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
Lecture 11: FPGA-Based System Design October 18, 2004 ECE 697F Reconfigurable Computing Lecture 11 FPGA-Based System Design.
Priority encoder. Overview Priority encoder- theoretic view Other implementations The chosen implementation- simulations Calculations and comparisons.
M.Mohajjel. Why? TTM (Time-to-market) Prototyping Reconfigurable and Custom Computing 2Digital System Design.
FPGA-Based System Design: Chapter 1 Copyright  2004 Prentice Hall PTR Moore’s Law n Gordon Moore: co-founder of Intel. n Predicted that number of transistors.
Programmable Logic Device Architectures
Logic and Computer Design Fundamentals, Fifth Edition Mano | Kime | Martin Copyright ©2016, 2008, 2004 by Pearson Education, Inc. All rights reserved.
Delivered by.. Love Jain p08ec907. Design Styles  Full-custom  Cell-based  Gate array  Programmable logic Field programmable gate array (FPGA)
Defect-tolerant FPGA Switch Block and Connection Block with Fine-grain Redundancy for Yield Enhancement Anthony J. YuGuy G.F. Lemieux August 25, 2005.
Processor Organization and Architecture Module III.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Chapter 36 Combinational Logic Circuits. Objectives After completing this chapter, you will be able to: –Describe the functions of encoders, decoders,
1 Architecture of Datapath- oriented Coarse-grain Logic and Routing for FPGAs Andy Ye, Jonathan Rose, David Lewis Department of Electrical and Computer.
Sequential Logic Design
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
Introduction to Programmable Logic
EE345: Introduction to Microcontrollers Memory
FPGA Glitch Power Analysis and Reduction
FIGURE 5-1 MOS Transistor, Symbols, and Switch Models
Presentation transcript:

EE5970 Computer Engineering Seminar Spring 2012 Michigan Technological University Based on: A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama Rohit A. Bhatia IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. VOL. 19, NO. 8, AUGUST 2011

Contents About FPGAs Fine-Grain Power Gating and Related Work – Asynchronous FPGAs – Wave-Pipelining for Bit-Serial FPGAs – Sleep Signal Generation Techniques for Power Gating Architecture of the proposed FPGA – Overview – Fundamental Principle of Autonomous Fine-Grain Power Gating Circuit Implementation Evaluation 2

Field Programmable Gate Array (FPGA) FPGAs offer greater flexibility than ASICs (Application Specific Integrated Circuits) in terms of usability Used in various products: – Programmable Logic Controllers (PLCs) – Medical Instruments – Gigabit Ethernet Controllers – Wireless devices End user can program as per the requirement, enabling incredible customization to the application XILINX FPGA in use Source 3

FPGAs work using “Look Up Tables” or “LUTs” There are several logic blocks / cells (that contain LUTs) and programmable interconnects that give FPGAs the feature of being “Field-Programmable” 4 FPGA Basics ALTERA FPGA Source

About FPGAs Fine-Grain Power Gating and Related Work – FPGA Power Considerations and LEDR – Wave-Pipelining for Bit-Serial FPGAs – Sleep Signal Generation Techniques for Power Gating Architecture of the proposed FPGA – Overview – Fundamental Principle of Autonomous Fine-Grain Power Gating Circuit Implementation Evaluation 5

Power Gating Certain logic blocks can be switched-off when not in use to minimize power consumption – Power consumed by sleep controller must be less than that saved by switching off the block Power Gating: – Fine-Grain: Each LUT has its own sleep transistor and sleep controller – Coarse-Grain: A large number of LUTs share a single sleep controller 6 Overall Architecture [1]

Fine-Grain: Lower standby power but greater dynamic power (sleep transistors) Coarse-Grain: Lower dynamic power but higher standby power (active blocks) 7 Overall Architecture [1] Power Considerations

LEDR (Level Encoded Dual-Rail) Dual-rail code provides scope for data transmission and its validity Two wires per bit are used LEDR or Level Encoded Dual-Rail based architecture achieves lowest dynamic power consumption among all dual-rail architectures considered Decision to shut down and wake up the power-gated domain is based on its activity No spacer required: less transitions, more throughput and less power consumption 8 Dual-Rail code Dual-rail codeInterpretation 00“reset” value 010 value 101 value 11unused [2] [3]

Wave-Pipelining for Bit-Serial FPGAs Allow circuit to process new data set before the previous data set reached the registers. Used to achieve small area without performance degradation – not to reduce standby power 9

Sleep Signal Generation Techniques for Power Gating Software Based – Can be used, but suffers from large power and delay overheads – Overheads make this technique not suitable for fine-grain power gating Hardware Based – A power-gated domain is shut down after it stays idle for a given threshold – Static Sleep Signal Generator (SSSG) – predefined threshold time – Dynamic Sleep Signal Generator (DSSG) – variable threshold time – Both, SSSG and DSSG use instruction level analysis of the activity of the power-gated domains – applicable only to block-level power gating 10

Sleep Signal Generation Techniques for Power Gating (contd.) Proposed Technique – Fine granularity sleep signal generation technique using two-input and one-output LUT – Power-Gated activity detected by comparing phases of input data with output data – Unlike coarse-grain approach, the fine-grain approach utilizes a sleep controller for each logic block – Each logic block can be turned OFF after operation completion with a small delay comparable to that of few small gates – SSSG employed to reduce power overheads – DSSG consumes more resources – “always-running” global counter! 11

12 About FPGAs Fine-Grain Power Gating and Related Work – FPGA Power Considerations and LEDR – Wave-Pipelining for Bit-Serial FPGAs – Sleep Signal Generation Techniques for Power Gating Architecture of the proposed FPGA – Overview – Fundamental Principle of Autonomous Fine-Grain Power Gating Circuit Implementation Evaluation

Architecture of the proposed FPGA Overview Proposed Architecture – Four wires used, two for a data, one for acknowledge (ACK), and one for wake-up – wake-up signal used to wake up the next Logical Block (LB) 13 Overall Architecture [1]

Architecture of the proposed FPGA Overview (contd.) Cell Structure – Four wires used, two for a data, one for acknowledge (ACK), and one for wake-up – Four pass switches and a memory bit make a pass-switch block 14 Cell Structure [4]

Architecture of the proposed FPGA Overview (contd.) Direct Allocation of a Control/Data Flow Graph (CDFG) mapping used Operations mapped onto the LBs Input of a logic block is directly connected to the output of another logic block Reduction in the complexity of the interconnection network between LBs! 15 Direct Allocation [5] Operation Data Dependency

Fundamental Principle of Autonomous Fine-Grain Power Gating On arrival of new data on an LB, the phase of the input data is different from that of the output data On completion of operation, the phase of the input data and output data are the same Proposed sleep controller extracts and compares the phases of input and output data. 16 Activity detection using the asynchronous architecture [6]

Fundamental Principle of Autonomous Fine-Grain Power Gating (contd.) On same phase of input and output data, comparator output is 0 and 1 otherwise Problems: – Sleep Transistor takes time to switch ON on the arrival of new data – Switching power could be more than the “saved” power 17 Phase extraction and wake-up signal generation [7]

Fundamental Principle of Autonomous Fine-Grain Power Gating (contd.) Problems can be solved by introducing a standby state: 1.wake up the LB before the data arrives; 2.power OFF the LB only when the data does not come for quite a while (threshold time) 18 Sleep control strategy [8]

Fundamental Principle of Autonomous Fine-Grain Power Gating (contd.) 19 Illustration of the proposed power gating method [9]

20 About FPGAs Fine-Grain Power Gating and Related Work – FPGA Power Considerations and LEDR – Wave-Pipelining for Bit-Serial FPGAs – Sleep Signal Generation Techniques for Power Gating Architecture of the proposed FPGA – Overview – Fundamental Principle of Autonomous Fine-Grain Power Gating Circuit Implementation Evaluation

Circuit Implementation 21 Block diagram of a LB [10] The wake-up signal is used to wake up the LB before new data arrives Latches retain the wake-up signal until all the input data arrive Programmable delay is used to delay the sleep signal based on the set threshold time

Circuit Implementation (contd.) 22 Phase Comparator [11] Phase comparator – detects data arrival If Phase-a and Phase-b are different from Phase-out, it means that new data has arrived The output (cascaded with the programmable delay) controls the sleep transistor for the LUT Sleep transistor

Circuit Implementation (contd.) 23 Multiplexer Based LUT for LEDR encoding [12] Problem comes with invalid inputs – large number of multiplexers is required to keep the previous output

Circuit Implementation (contd.) 24 Block diagram of the proposed LUT [13] Decoders exclude invalid data patterns (ones with different phases) Number of transistors is reduced since invalid inputs aren’t fed to the multiplexer stage

Circuit Implementation (contd.) 25 Behavior with invalid inputs [14]

Circuit Implementation (contd.) 26 Behavior with valid inputs [14]

27 About FPGAs Fine-Grain Power Gating and Related Work – FPGA Power Considerations and LEDR – Wave-Pipelining for Bit-Serial FPGAs – Sleep Signal Generation Techniques for Power Gating Architecture of the proposed FPGA – Overview – Fundamental Principle of Autonomous Fine-Grain Power Gating Circuit Implementation Evaluation

28 The following results come from HSPICE simulation at 85°C: – Proposed FPGA when compared to conventional LEDR-based FPGA: 1.Standby power of the cell is reduced by 69% in the sleep state 2.The area and dynamic power of the logic block are increased by 14% and 6%, respectively – Proposed FPGA when compared to synchronous FPGA: 1.Standby power of the cell is reduced by 38% in the sleep state and delay is increased by 34% 2.The area is increased by 170% causing a reduction in available logic by 63% for the same given area

Evaluation (contd.) 29 The concepts demonstrated for a simple data path for a small LUT size can be applied to the complex data path architectures of modern FPGAs The power and area of the sleep controller are much smaller than those of the data path components such as a logic block and a switch block. In a more complex data path architecture, the overheads of the sleep controller are smaller.

Thank you for your time! Questions? 30

Image sources [1] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1397, Fig. 3 Available: [Feb. 12, 2012] [2] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1395, Fig. 1 Available: [Feb. 12, 2012] [3] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1395, Fig. 2 Available: [Feb. 13, 2012] [4] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1397, Fig. 4 Available: [Feb. 13, 2012] [5] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1397, Fig. 5 Available: [Feb. 14, 2012] [6] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1397, Fig. 6 Available: [Feb. 14, 2012] [7] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1398, Fig. 7 Available: [Feb. 14, 2012] [8] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1398, Fig. 9 Available: [Feb. 14, 2012] 31

[9] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1398, Fig. 10 Available: [Feb. 15, 2012] [10] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1399, Fig. 12 Available: [Feb. 15, 2012] [11] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1399, Fig. 13 Available: [Feb. 15, 2012] [12] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1400, Fig. 15 Available: [Feb. 15, 2012] [13] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1400, Fig. 16 Available: [Feb. 15, 2012] [14] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1400, Fig. 17 Available: [Feb. 15, 2012] [15] Shota Ishihara, Masanori Hariyama, and Michitaka Kameyama. (2011, Aug.). “A Low-Power FPGA Based on Autonomous Fine-Grain Power Gating” IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS. [On-line]. 19(8), pp. 1400, Fig. 18 Available: [Feb. 15, 2012] 32