Paper review: High Speed Dynamic Asynchronous Pipeline: Self Precharging Style Name : Chi-Chuan Chuang Date : 2013/03/20.

Slides:



Advertisements
Similar presentations
Self-Timed Logic Timing complexity growing in digital design -Wiring delays can dominate timing analysis (increasing interdependence between logical and.
Advertisements

Andrey Mokhov, Victor Khomenko Danil Sokolov, Alex Yakovlev Dual-Rail Control Logic for Enhanced Circuit Robustness.
A Robust, Fast Pulsed Flip- Flop Design By: Arunprasad Venkatraman Rajesh Garg Sunil Khatri Department of Electrical and Computer Engineering, Texas A.
MICROELETTRONICA Sequential circuits Lection 7.
(Neil west - p: ). Finite-state machine (FSM) which is composed of a set of logic input feeding a block of combinational logic resulting in a set.
Design and Application of Power Optimized High-Speed CMOS Frequency Dividers.
1 Clockless Logic  Recap: Lookahead Pipelines  High-Capacity Pipelines.
Introduction to CMOS VLSI Design Lecture 19: Design for Skew David Harris Harvey Mudd College Spring 2004.
Chapter 09 Advanced Techniques in CMOS Logic Circuits
Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.
Synchronous Digital Design Methodology and Guidelines
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.
Clock Design Adopted from David Harris of Harvey Mudd College.
Embedding of Asynchronous Wave Pipelines into Synchronous Data Processing Stephan Hermanns, Sorin Alexander Huss University of Technology Darmstadt, Germany.
A 16-Bit Kogge Stone PS-CMOS adder with Signal Completion Seng-Oon Toh, Daniel Huang, Jan Rabaey May 9, 2005 EE241 Final Project.
© Ran Ginosar Lecture 3: Handshake Ckt Implementations 1 VLSI Architectures Lecture 3 S&F Ch. 5: Handshake Ckt Implementations.
Digital Integrated Circuits© Prentice Hall 1995 Timing ISSUES IN TIMING.
1 A Modular Synchronizing FIFO for NoCs Vainbaum Yuri.
1 Clockless Logic Montek Singh Thu, Jan 13, 2004.
1 Clockless Logic Montek Singh Tue, Mar 23, 2004.
1 Clockless Logic Montek Singh Tue, Mar 16, 2004.
Low Power Design for Wireless Sensor Networks Aki Happonen.
COMP Clockless Logic and Silicon Compilers Lecture 3
Lab for Reliable Computing Generalized Latency-Insensitive Systems for Single-Clock and Multi-Clock Architectures Singh, M.; Theobald, M.; Design, Automation.
1 Clockless Logic Montek Singh Tue, Mar 21, 2006.
High-Throughput Asynchronous Pipelines for Fine-Grain Dynamic Datapaths Montek Singh and Steven Nowick Columbia University New York, USA
1 Clockless Computing Montek Singh Thu, Sep 13, 2007.
Lecture 11 MOUSETRAP: Ultra-High-Speed Transition-Signaling Asynchronous Pipelines.
1 Recap: Lectures 5 & 6 Classic Pipeline Styles 1. Williams and Horowitz’s PS0 pipeline 2. Sutherland’s micropipelines.
1 Clockless Logic: Dynamic Logic Pipelines (contd.)  Drawbacks of Williams’ PS0 Pipelines  Lookahead Pipelines.
Digital Integrated Circuits for Communication
Subthreshold Dual Mode Logic
Micropipeline design in asynchronous circuit Wilson Kwan M.A.Sc. Candidate Ottawa-Carleton Institute for Electrical & Computer Engineering (OCIECE) Carleton.
EE415 VLSI Design DYNAMIC LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Clockless Chips Date: October 26, Presented by:
UK Asynchronous Forum, September Synthesis of multiple rail phase encoding circuits Andrey Mokhov, Crescenzo D’Alessandro, Alex Yakovlev Microelectronics.
Amitava Mitra Intel Corp., Bangalore, India William F. McLaughlin
MOUSETRAP Ultra-High-Speed Transition-Signaling Asynchronous Pipelines Montek Singh & Steven M. Nowick Department of Computer Science Columbia University,
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
MICAS Department of Electrical Engineering (ESAT) Design-In for EMC on digital circuit October 27th, 2005 AID–EMC: Low Emission Digital Circuit Design.
Ratioed Circuits Ratioed circuits use weak pull-up and stronger pull-down networks. The input capacitance is reduced and hence logical effort. Correct.
DCSL & LVDCSL: A High Fan-in, High Performance Differential Current Switch Logic Families Dinesh Somasekhaar, Kaushik Roy Presented by Hazem Awad.
Optimal digital circuit design Mohammad Sharifkhani.
1 Presented By Şahin DELİPINAR Simon Moore,Peter Robinson,Steve Wilcox Computer Labaratory,University Of Cambridge December 15, 1995 Rotary Pipeline Processors.
NTU Confidential Test Asynchronous FIR Filter Design Presenter: Po-Chun Hsieh Advisor:Tzi-Dar Chiueh Date: 2003/12/1.
A Robust Pulse-triggered Flip-Flop and Enhanced Scan Cell Design
Asynchronous circuit design in control driven approach Name: Chi-Chuan Chuang Date:
1 Clockless Computing Montek Singh Thu, Sep 6, 2007  Review: Logic Gate Families  A classic asynchronous pipeline by Williams.
UNIVERSITY OF ROSTOCK Institute of Applied Microelectronics and Computer Science Single-Rail Self-timed Logic Circuits in Synchronous Designs Frank Grassert,
12004 MAPLD: 153Brej Early output logic and Anti-Tokens Charlie Brej APT Group Manchester University.
Reader: Pushpinder Kaur Chouhan
Reading Assignment: Rabaey: Chapter 9
Dynamic Logic Dynamic Circuits will be introduced and their performance in terms of power, area, delay, energy and AT2 will be reviewed. We will review.
Sept. 2005EE37E Adv. Digital Electronics Lesson 2 Advanced Aspects of Digital Logic.
NTU Confidential Introduction to the Applications of Asynchronous Circuits Presenter: Po-Chun Hsieh Advisor:Tzi-Dar Chiueh Date: 2003/09/22.
1 Practical Design and Performance Evaluation of Completion Detection Circuits Fu-Chiung Cheng Department of Computer Science Columbia University.
1 Bridging the gap between asynchronous design and designers Peter A. BeerelFulcrum Microsystems, Calabasas Hills, CA, USA Jordi CortadellaUniversitat.
ECEN 248: INTRODUCTION TO DIGITAL SYSTEMS DESIGN Dr. Shi Dept. of Electrical and Computer Engineering.
1 Recap: Lecture 4 Logic Implementation Styles:  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates, or “pass-transistor” logic.
Clockless Chips Under the esteemed guidance of Romy Sinha Lecturer, REC Bhalki Presented by: Lokesh S. Woldoddy 3RB05CS122 Date:11 April 2009.
TOPIC : Introduction to Sequential Circuits UNIT 1: Modeling and Simulation Module 4 : Modeling Sequential Circuits.
1 Clockless Logic Montek Singh Thu, Mar 2, Review: Logic Gate Families  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates,
Recap: Lecture 1 What is asynchronous design? Why do we want to study it? What is pipelining? How can it be used to design really fast hardware?
Clocking in High-Performance and Low-Power Systems Presentation given at: EPFL Lausanne, Switzerland June 23th, 2003 Vojin G. Oklobdzija Advanced.
332:578 Deep Submicron VLSI Design Lecture 14 Design for Clock Skew
Clockless Logic: Asynchronous Pipelines
Synthesis of multiple rail phase encoding circuits
Synchronous, Wave and Asynchronous pipeling
Wagging Logic: Moore's Law will eventually fix it
A Quasi-Delay-Insensitive Method to Overcome Transistor Variation
Presentation transcript:

Paper review: High Speed Dynamic Asynchronous Pipeline: Self Precharging Style Name : Chi-Chuan Chuang Date : 2013/03/20

Outline Introduction Dual-rail asynchronous pipelines – PS0 pipeline – Lookahead pipelines Self precharging pipeline Timing constraints Simulation results Conclusion Advices

Introduction Asynchronous circuit’s functional blocks communicate with handshake protocol Asynchronous pipeline’s advantages – No global clock distribution problem – No clock skew – Lower power consumption – Automatically adapt to the environments

Introduction (cont.) Asynchronous pipelines have two types – Single-rail topology – Dual-rail topology Single-rail topology – Less area and wiring load – But always takes the worst case delay and additional timing margins Dual-rail topology – More robust data depended completion detection – Low throughput

Dual-rail asynchronous pipelines Williams’ PS0 pipeline Lookahead pipelines – LP3/1 pipeline – LP2/2 pipeline – LP2/1 pipeline Enhanced lookahead pipelines – Enhanced LP3/1 pipeline – Enhanced LP2/1 pipeline

Dual-rail asynchronous pipelines (cont.)

PS0 pipeline precharge

LP3/1 pipeline precharge

LP2/2 pipeline precharge

Asymmetric C element (aC)

LP2/1 pipeline precharge

Enhanced lookahead pipelines LP3/1 and LP2/1 pipelines have the problem of higher wiring load and larger number of inter-stage control signals It is difficult to communicate with the environments Enhanced lookahead pipelines reduce wire load but increase the cycle time

Enhanced LP3/1 pipeline precharge

Enhanced LP2/1 pipeline precharge

Comparison Reduce interstage control signal Communicate with environment

Self precharging pipeline Has all properties of a dual rail pipeline Each pipeline stage consists of – A functional block with domino gates – A completion detector – An special asymmetric C-element The completion detectors are moved just after the previous functional block – interstage wiring load is reduced

Self precharging pipeline (cont.) Completion detector’s done signal is used to precharge both the special aC and the functional block, called self precharging

Special asymmetric C element This completion detector has lesser area, delay and power consumption

Special asymmetric C element (cont.) Has two inputs coming from the CDs of current stage N and next stage (N+1) It’s functionality – When N+1 is 1 => output is 1 – When N+1 is 0 and N is 1 => output is 0 – Hold the previous value otherwise

Self precharging pipeline (cont.) precharge

Timing constraints

Simulation results Layout in 90nm UMC process, at 1.2V supply, temperature is 300K, normal process corner

Simulation results (cont.) Power and area – Enhanced LP3/1 has the highest power consumption and area, followed by PS0 – LP2/2 has the lowest power consumption and area – Enhanced LP2/1 and SP have almost same area and power consumption (but SP slightly higher)

Conclusion Self precharging protocol CDs are just placed after the previous stage aC removes the self takeover timing constraint of the LP family, makes it simpler to design High throughput (2.227G data items/s) Area and power consumption are comparable with LP2/1 pipeline low latency, high robustness, low power, avoidance of explicit latches etc. compared with synchronous counter parts

Comments

Thanks for your attention