ICCD 2008 1 Conversion Driven Design of Binary to Mixed Radix Circuits Ashur Rafiev, Julian Murphy, Danil Sokolov, Alex Yakovlev School of EECE, Newcastle.

Slides:



Advertisements
Similar presentations
Logic Circuits Design presented by Amr Al-Awamry
Advertisements

Andrey Mokhov, Victor Khomenko Danil Sokolov, Alex Yakovlev Dual-Rail Control Logic for Enhanced Circuit Robustness.
Functions and Functional Blocks
1 KU College of Engineering Elec 204: Digital Systems Design Lecture 9 Programmable Configurations Read Only Memory (ROM) – –a fixed array of AND gates.
06/05/08 Biscotti: a Framework for Token-Flow based Asynchronous Systems Charlie Brej.
Logic Synthesis – 3 Optimization Ahmed Hemani Sources: Synopsys Documentation.
Using Carry-Save Adders For Radix- 4, Can Be Used to Generate 3a – No Booth’s Slight Delay Penalty from CSA – 3 Gates.
CPEN Digital System Design
Copyright 2001, Agrawal & BushnellDay-1 PM Lecture 4a1 Design for Testability Theory and Practice Lecture 4a: Simulation n What is simulation? n Design.
Kazi Spring 2008CSCI 6601 CSCI-660 Introduction to VLSI Design Khurram Kazi.
1 Clockless Logic Montek Singh Thu, Jan 13, 2004.
Optimizing high speed arithmetic circuits using three-term extraction Anup Hosangadi Ryan Kastner Farzan Fallah ECE Department Fujitsu Laboratories University.
Mahapatra-Texas A&M-Fall'001 cosynthesis Introduction to cosynthesis Rabi Mahapatra CPSC498.
Computer Arithmetic, Multiplexers Prof. Sin-Min Lee Department of Computer Science.
Automated Generation of Layout and Control for Quantum Circuits Mark Whitney, Nemanja Isailovic, Yatish Patel, John Kubiatowicz University of California,
Computer ArchitectureFall 2008 © August 25, CS 447 – Computer Architecture Lecture 3 Computer Arithmetic (1)
Charles Kime & Thomas Kaminski © 2008 Pearson Education, Inc. (Hyperlinks are active in View Show mode) Chapter 3 – Combinational Logic Design Part 1 –
Chapter 5 Arithmetic Logic Functions. Page 2 This Chapter..  We will be looking at multi-valued arithmetic and logic functions  Bitwise AND, OR, EXOR,
Charles Kime & Thomas Kaminski © 2008 Pearson Education, Inc. (Hyperlinks are active in View Show mode) Chapter 3 – Combinational Logic Design Part 1 –
Overview Part 1 – Design Procedure 3-1 Design Procedure
Petros OikonomakosBashir M. Al-Hashimi Mark Zwolinski Versatile High-Level Synthesis of Self-Checking Datapaths Using an On-line Testability Metric Electronics.
Chapter 6-2 Multiplier Multiplier Next Lecture Divider
March 8, 2006Spectral RTL ATPG1 High-Level Spectral ATPG for Gate-level Circuits Nitin Yogi and Vishwani D. Agrawal Auburn University Department of ECE.
Philip Brisk 2 Paolo Ienne 2 Hadi Parandeh-Afshar 1,2 1: University of Tehran, ECE Department 2: EPFL, School of Computer and Communication Sciences Efficient.
Charles Kime & Thomas Kaminski © 2004 Pearson Education, Inc. Terms of Use (Hyperlinks are active in View Show mode) Terms of Use Lecture 12 – Design Procedure.
Fundamentals of IT UNIT-I OnlyforIPMCA. DIGITAL SIGNALS & LOGIC GATES Signals and data are classified as analog or digital. Analog refers to something.
UK Asynchronous Forum, September Synthesis of multiple rail phase encoding circuits Andrey Mokhov, Crescenzo D’Alessandro, Alex Yakovlev Microelectronics.
CS1Q Computer Systems Lecture 8
Combinational Logic Design BIL- 223 Logic Circuit Design Ege University Department of Computer Engineering.
Reconfigurable Computing - Multipliers: Options in Circuit Design John Morris Chung-Ang University The University of Auckland ‘Iolanthe’ at 13 knots on.
Digital Logic Design Review Dr. Ahmad Almulhem ahmadsm AT kfupm Phone: Office: Ahmad Almulhem, KFUPM 2010.
FORMAL VERIFICATION OF ADVANCED SYNTHESIS OPTIMIZATIONS Anant Kumar Jain Pradish Mathews Mike Mahar.
Charles Kime & Thomas Kaminski © 2008 Pearson Education, Inc. (Hyperlinks are active in View Show mode) Chapter 3 – Combinational Logic Design Part 1 –
Logic Design CS 270: Mathematical Foundations of Computer Science Jeremy Johnson.
TOPIC : SYNTHESIS INTRODUCTION Module 4.3 : Synthesis.
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
Anurag Dwivedi. Basic Block - Gates Gates -> Flip Flops.
1 Synthesizing Datapath Circuits for FPGAs With Emphasis on Area Minimization Andy Ye, David Lewis, Jonathan Rose Department of Electrical and Computer.
Optimality Study of Logic Synthesis for LUT-Based FPGAs Jason Cong and Kirill Minkovich.
CS1Q Computer Systems Lecture 8
CPEN Digital System Design
1 Fundamentals of Computer Science Combinational Circuits.
Routing Wire Optimization through Generic Synthesis on FPGA Carry Hadi P. Afshar Joint work with: Grace Zgheib, Philip Brisk and Paolo Ienne.
Number Representation (Part 2) Computer Architecture (Fall 2006)
CS151 Introduction to Digital Design Chapter 4: Arithmetic Functions and HDLs 4-1: Iterative Combinational Circuits 4-2: Binary Adders 1Created by: Ms.Amany.
RTL Hardware Design by P. Chu Chapter 9 – ECE420 (CSUN) Mirzaei 1 Sequential Circuit Design: Practice Shahnam Mirzaei, PhD Spring 2016 California State.
EECE 320 L8: Combinational Logic design Principles 1Chehab, AUB, 2003 EECE 320 Digital Systems Design Lecture 8: Combinational Logic Design Principles.
ASIC Design Methodology
VLSI Testing Lecture 5: Logic Simulation
VLSI Testing Lecture 5: Logic Simulation
CSCI-100 Introduction to Computing
Vishwani D. Agrawal Department of ECE, Auburn University
Summary Half-Adder Basic rules of binary addition are performed by a half adder, which has two binary inputs (A and B) and two binary outputs (Carry out.
Digital Systems Section 14 Registers. Digital Systems Section 14 Registers.
Two-phase Latch based design
ECE 551: Digital System Design & Synthesis
FIGURE 4.1 Block diagram of combinational circuit
Chapter 3 – Combinational Logic Design
VHDL Introduction.
FPGA Glitch Power Analysis and Reduction
UNIVERSITY OF MASSACHUSETTS Dept
UNIVERSITY OF MASSACHUSETTS Dept
UNIVERSITY OF MASSACHUSETTS Dept
Low Power Digital Design
Synthesis of multiple rail phase encoding circuits
Arithmetic Circuits.
UNIVERSITY OF MASSACHUSETTS Dept
Presentation transcript:

ICCD Conversion Driven Design of Binary to Mixed Radix Circuits Ashur Rafiev, Julian Murphy, Danil Sokolov, Alex Yakovlev School of EECE, Newcastle University, UK {ashur.rafiev, j.p.murphy, danil.sokolov, ncl.ac.uk

ICCD Outline  Switching Balanced Codes Conversion Driven Design (CDD) Motivation Conversion Basics Bitwise Approach Bitwise Gate Grouping Algorithm Artificial Combinational Loops Problem Operandwise Approach Operandwise Gate Grouping Algorithm Benchmark Results Conclusions Outline

ICCD Switching Balanced Codes M-of-N data encoding: data signal is represented with N wires M of them are active (high) Return-to-zero (RTZ) protocol: data signals are separated with dummy signals (spacers) Application areas: Security Asynchronous system design Network-on-chip communication Switching Balanced Codes

ICCD of-2 (Dual-Rail) and 1-of-4 Encodings multi-valuedsingle-rail (binary) dual-rail1-of NULLspacer

ICCD Outline  Switching Balanced Codes  Conversion Driven Design Motivation Conversion Basics Bitwise Approach Bitwise Gate Grouping Algorithm Artificial Combinational Loops Problem Operandwise Approach Operandwise Gate Grouping Algorithm Benchmark Results Conclusions Outline

ICCD Conversion Driven Design: Motivation + Higher radix signals consume less power and reduce cross- talk effect – Require multi-valued logic synthesis + Some tools and techniques already exist (e.g. MV-SIS) – Moving away from the RTL design flow is frequently frowned upon by industry  Reuse existing popular design tools for multi- valued system design

ICCD Conversion Driven Design Flow EDA Tool Synthesize binary circuit from specification HDL specification Conversion Tool fast fully automated Convert to mixed radix component level design Flatten components to the gate level netlist Component library Place and route Final design AREA OF THE RESEARCH

ICCD …… mixerssplitters Conversion Basics …… binary inputs binary outputs binary …… quaternary inputs quaternary outputs quaternary Original binary datapath is given as a structural HDL netlist.Pairs of binary signals can be grouped into quaternary.Certain part of the circuit may remain binary.

ICCD Conversion Basics Signal converters:  A splitter converts one quaternary signal into two binary.  A mixer converts two binary signals into one quaternary.  The way the signals (gates) are grouped determines the efficiency of the conversion, therefore the conversion problem corresponds directly to the gate grouping problem.

ICCD Outline  Switching Balanced Codes  Conversion Driven Design Motivation Conversion Basics  Bitwise Approach Bitwise Gate Grouping Algorithm Artificial Combinational Loops Problem Operandwise Approach Operandwise Gate Grouping Algorithm Benchmark Results Conclusions Outline

ICCD Bitwise Gate Grouping A={A0, A1} B={B0, B1} Q = {Q0, Q1} Q = A·B Q0 = A0·B0 Q1 = A1·B1 A0A1 B0B1 Q0Q a bitwise quaternary component

ICCD Bitwise Gate Grouping Algorithm  Uses heuristics to extract bitwise meaning of signals from the flat netlist. Input and output port grouping is given Algorithm is iterative, based on breadth-first search Bitwise Regularity Ratio is used as an estimation criteria. It is calculated for each gate pair on each iteration. Bitwise Regularity Ratio (BRR) depends on how many quaternary links a pair of gates can form with respect to the state of the conversion on the given iteration.

ICCD Bitwise Gate Grouping Algorithm: Example BRR=2 original circuitconverted circuit * mixed radix components are shown as black boxes

ICCD Artificial Combinational Loops  Combinational loops can appear during the bitwise conversion while the original circuit is free of combinational loops.  Need special methodology to handle. Problems: If mixers wait for valid data from both inputs – deadlock. If mixers produce output regardless to spacers – invalid output. A signal should pass artificial combinational loop exactly 2 times before it produce a valid output.

ICCD Bitwise Gate Grouping Disadvantages of the algorithm:  Computational cost O(N) = 2N 2 log 2 2 N, N is a number of gates in the original circuit. Disadvantages of the approach:  Inefficient for circuits without bitwise nature of signals, e.g. S-boxes.  The algorithm can produce combinational loops. Bitwise (naive) approach is inefficient for CDD.

ICCD Outline  Switching Balanced Codes  Conversion Driven Design Motivation Conversion Basics  Bitwise Approach Bitwise Gate Grouping Algorithm Artificial Combinational Loops Problem  Operandwise Approach Operandwise Gate Grouping Algorithm Benchmark Results Conclusions Outline

ICCD Operandwise Gate Grouping  Bitwise grouping is derived from the functional meaning of signals.  Operandwise grouping is derived from the structural positioning of gates – works with 2-input gates only. A0A1 B0B1 Q0Q A0B0 A1B1 Q0Q a bitwise quaternary componentan operandwise quaternary component A0 B0 A1 B1 Q0 Q1

ICCD Binary Trees Approach For binary tree structures within a netlist we can group inputs and outputs of gates to perform an operandwise grouping.

ICCD Binary Trees Approach Signals cannot be shared between groups, because it leads to duplication of gates. Gates with multiple fanout block operandwise grouping. – Remain binary

ICCD Quaternary-to-Binary Gates (Q/B Gates) Q/B gate – a gate with one quaternary input and one binary output.

ICCD Quaternary-to-Binary Gates (Q/B Gates)  A Q/B gate is a mixed radix component.  A Q/B gate is an incomplete operandwise group.  It can replace a splitter followed by a binary gate.

ICCD Operandwise Gate Grouping Algorithm  Phase I: group all signals regardless of gate fanouts (some gates will be duplicated).  Output ports can be grouped arbitrarily.  Phase II: analyse duplicates and discard groups leading to duplication.  Phase III: insert signal converters and Q/B gates. Computational cost of the algorithm is O(N) = 3N, where N is a number of gates in the original circuit.

ICCD Operandwise Gate Grouping Algorithm: Example original circuitconverted circuit Phase IPhase IIPhase III

ICCD Operandwise Gate Grouping Advantages of the algorithm:  Low computational cost.  It is highly modular: one can add more passes to the algorithm to increase efficiency of the conversion. Disadvantages of the algorithm:  Can produce significant “fractioning” of binary and quaternary parts of the circuit increasing the number of signal converters required.

ICCD Benchmark Results circuit dual-railmixed radix: dual-rail, 1-of-4 switching wires switching energyswitching wires switching energy averagestd. devaveragestd. dev 2-bit adder bit ripple carry adder bit multiplier* Kasumi S-box Kasumi S-box Kasumi S-box9* AES S-box* * original single-rail circuits were optimised in Synopsys.

ICCD Outline  Switching Balanced Codes  Conversion Driven Design Motivation Conversion Basics  Bitwise Approach Bitwise Gate Grouping Algorithm Artificial Combinational Loops Problem  Operandwise Approach Operandwise Gate Grouping Algorithm Benchmark Results  Conclusions Outline

ICCD Conclusions  Conversion driven design technique was suggested in order to reuse popular EDA tools for MVL synthesis.  Binary and quaternary mixed radix was selected to improve the efficiency of the conversion.  Two conversion (gate grouping) algorithms were implemented and analysed. Bitwise approach is not efficient for CDD Operandwise approach is fast and flexible but not efficient enough in terms of saving switching energy. Future work:  Improve operandwise component implementations.  Add more heuristics to the operandwise algorithm to increase the efficiency of the conversion.

ICCD The End Thank you! Questions?