ELEC 6200, Fall 07, Oct 24 Jiang: Async. Processor 1 Asynchronous Processor Design for ELEC 6200 by Wei Jiang.

Slides:



Advertisements
Similar presentations
Accessing I/O Devices Processor Memory BUS I/O Device 1 I/O Device 2.
Advertisements

Registers Computer Organization I 1 September 2009 © McQuain, Feng & Ribbens A clock is a free-running signal with a cycle time. A clock may.
INPUT-OUTPUT ORGANIZATION
CS370 – Spring 2003 Hazards/Glitches. Time Response in Combinational Networks Gate Delays and Timing Waveforms Hazards/Glitches and How To Avoid Them.
Reading1: An Introduction to Asynchronous Circuit Design Al Davis Steve Nowick University of Utah Columbia University.
ELEC 256 / Saif Zahir UBC / 2000 Timing Methodology Overview Set of rules for interconnecting components and clocks When followed, guarantee proper operation.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 24: November 4, 2011 Synchronous Circuits.
Slide 1/20IWLS 2003, May 30Early Output Logic with Anti-Tokens Charlie Brej, Jim Garside APT Group Manchester University.
COMP3221: Microprocessors and Embedded Systems Lecture 17: Computer Buses and Parallel Input/Output (I) Lecturer: Hui.
Synchronous Digital Design Methodology and Guidelines
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.
Clock Design Adopted from David Harris of Harvey Mudd College.
1 Clockless Logic Montek Singh Thu, Jan 13, 2004.
Chapter XI Reduced Instruction Set Computing (RISC) CS 147 Li-Chuan Fang.
COMP Clockless Logic and Silicon Compilers Lecture 3
ENGIN112 L20: Sequential Circuits: Flip flops October 20, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 20 Sequential Circuits: Flip.
Modern VLSI Design 2e: Chapter 5 Copyright  1998 Prentice Hall PTR Topics n Memory elements. n Basics of sequential machines.
CS 151 Digital Systems Design Lecture 20 Sequential Circuits: Flip flops.
TECH CH03 System Buses Computer Components Computer Function
68000 Interface Timing Diagrams Outline –68000 Read Cycle –68000 Write Cycle Goal –Understand bus cycles –Learn how to attach memory, peripherals.
Spring 07, Feb 22 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Power Aware Microprocessors Vishwani D. Agrawal.
Fall 2009 / Winter 2010 Ran Ginosar (
Avshalom Elyada, Ran GinosarPipeline Synchronization 1 Pipeline Synchronization Continued This second part is based on the recent article Bridging Clock.
1 Combining the strengths of UMIST and The Victoria University of Manchester Asynchronous Signal Processing Systems Linda Brackenbury APT GROUP, Computer.
Pipelining By Toan Nguyen.
Lecture 24: CPU Design Today’s topic –Multi-Cycle ALU –Introduction to Pipelining 1.
INPUT-OUTPUT ORGANIZATION
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
CS-334: Computer Architecture
Clockless Chips Date: October 26, Presented by:
MOUSETRAP Ultra-High-Speed Transition-Signaling Asynchronous Pipelines Montek Singh & Steven M. Nowick Department of Computer Science Columbia University,
Top Level View of Computer Function and Interconnection.
1 Presented By Şahin DELİPINAR Simon Moore,Peter Robinson,Steve Wilcox Computer Labaratory,University Of Cambridge December 15, 1995 Rotary Pipeline Processors.
SEQUENTIAL CIRCUITS Component Design and Use. Register with Parallel Load  Register: Group of Flip-Flops  Ex: D Flip-Flops  Holds a Word of Data 
EEE440 Computer Architecture
Reading1: An Introduction to Asynchronous Circuit Design Al Davis Steve Nowick University of Utah Columbia University.
12004 MAPLD: 153Brej Early output logic and Anti-Tokens Charlie Brej APT Group Manchester University.
Reading Assignment: Rabaey: Chapter 9
12/16/  List the elements of 8255A Programmable Peripheral Interface (PPI)  Explain its various operating modes  Develop a simple program to.
Basic LED Interface.
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
11/15/05ELEC / Lecture 191 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Chapter 3 System Buses.  Hardwired systems are inflexible  General purpose hardware can do different tasks, given correct control signals  Instead.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 20: October 25, 2010 Pass Transistors.
1 Recap: Lecture 4 Logic Implementation Styles:  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates, or “pass-transistor” logic.
Dept. of Electrical Engineering
1 Clockless Logic Montek Singh Thu, Mar 2, Review: Logic Gate Families  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates,
Advanced Digital Design
Interconnection Structures
Welcome To Seminar Presentation Seminar Report On Clockless Chips
16.317: Microprocessor System Design I
Other Approaches.
Roadmap History Synchronized vs. Asynchronous overview How it works
Recap: Lecture 1 What is asynchronous design? Why do we want to study it? What is pipelining? How can it be used to design really fast hardware?
Assembly Language for Intel-Based Computers, 5th Edition
Chapter 3 Top Level View of Computer Function and Interconnection
COMP2121: Microprocessors and Interfacing
Clockless Logic: Asynchronous Pipelines
William Stallings Computer Organization and Architecture 7th Edition
The Programmable Peripheral Interface (8255A)
A Quasi-Delay-Insensitive Method to Overcome Transistor Variation
Early output logic and Anti-Tokens
Pipelining and Superscalar Techniques
Presentation transcript:

ELEC 6200, Fall 07, Oct 24 Jiang: Async. Processor 1 Asynchronous Processor Design for ELEC 6200 by Wei Jiang

ELEC 6200, Fall 07, Oct 24 2Jiang: Async. Processor Why Asynchronous Design Higher Performance Higher Performance No global clock No global clock proceed data at appropriate rate of environment proceed data at appropriate rate of environment Do not propagate local delay globally Do not propagate local delay globally Better Power Efficiency Better Power Efficiency Only activating functional units consume power Only activating functional units consume power Inactivated parts remain in “stand-by” state Inactivated parts remain in “stand-by” state No wasteful power dissipation by glitches No wasteful power dissipation by glitches Smaller Chip Size Smaller Chip Size Less high-frequency EMI components due to small amplitude and wide current peaks Less high-frequency EMI components due to small amplitude and wide current peaks

ELEC 6200, Fall 07, Oct 24 3Jiang: Async. Processor Sync vs. Async Pipeline Asynchronous Pipeline: No global clock No delay for clock transition

ELEC 6200, Fall 07, Oct 24 4Jiang: Async. Processor Dynamic Voltage Control Power supply is controlled by measuring the occupancy of input FIFO As FIFO fills the supply voltage will increase and the processor will operate faster to empty the FIFO

ELEC 6200, Fall 07, Oct 24 5Jiang: Async. Processor Asynchronous Circuit Design Level-sensitive/Four- phase signaling protocol simpler design Transition-sensitive/two- phase protocol: no “nonactive” state fewer transitions higher performance lower power dissipation

ELEC 6200, Fall 07, Oct 24 6Jiang: Async. Processor Asynchronous Interface Two-phase Pipeline handshake protocol Sender to ensure that all bits of the data bundle are valid prior to sending the Request event (local delay management); Receiver to accept the data bundle prior to sending its Acknowledge which will permit the Sender to remove the old data and place a new value on the data lines

ELEC 6200, Fall 07, Oct 24 7Jiang: Async. Processor Event Driven Pipeline Components must respond to input transitions (events) rather than logic levels Components must respond to input transitions (events) rather than logic levels Muller C-gate: o Muller C-gate: outputs a transition only when all inputs have experienced transitions changing their input levels Storage elements must respond identically to rising or falling transitions Storage elements must respond identically to rising or falling transitions

ELEC 6200, Fall 07, Oct 24 8Jiang: Async. Processor Asynchronous Storage Capture-Pass latch Capture-Pass latch Event on Capture line causes latch to hold input data Event on Capture line causes latch to hold input data Capture-done event indicates completion of capture operation Capture-done event indicates completion of capture operation Event on Pass line cause latch to return transparent state Event on Pass line cause latch to return transparent state Pass-done event indicates completion of pass operation Pass-done event indicates completion of pass operation Interconnect Capture-Pass latches using Muller C-gates to form Pipeline Interconnect Capture-Pass latches using Muller C-gates to form Pipeline

ELEC 6200, Fall 07, Oct 24 9Jiang: Async. Processor Prevention of Hazards Data Hazards Data Hazards An instruction depends on the results of a previous instruction still in the pipeline An instruction depends on the results of a previous instruction still in the pipeline Prevented by Register Locking mechanism: stall the instruction until write-back of register takes place Prevented by Register Locking mechanism: stall the instruction until write-back of register takes place Pipeline must support multiple asynchronous read/write operations Pipeline must support multiple asynchronous read/write operations Control Hazards Control Hazards The instructions after branch instruction are loaded into pipeline but may not be executed The instructions after branch instruction are loaded into pipeline but may not be executed Prevented by Delayed Branch mechanism: the next instructions is always executed Prevented by Delayed Branch mechanism: the next instructions is always executed

ELEC 6200, Fall 07, Oct 24 10Jiang: Async. Processor Example: AMULET1

ELEC 6200, Fall 07, Oct 24 11Jiang: Async. Processor Example: Asynchronous DLX

ELEC 6200, Fall 07, Oct 24 12Jiang: Async. Processor Conclusion Asynchronous design is Asynchronous design is competitive with the best synchronous design in power efficiency and is close in performance and silicon area References References SCALP: A Superscalar Asynchronous Low-Power Processor, Philip Brian Endecott, Ph.D thesis of University of Manchester, UK, 1996 SCALP: A Superscalar Asynchronous Low-Power Processor, Philip Brian Endecott, Ph.D thesis of University of Manchester, UK, 1996 AMULET1: An Asynchronous ARM Microprocessor, J. V. Woods et al, IEEE Trans. on Computers, Vol.46.4, p , Apr 1997 AMULET1: An Asynchronous ARM Microprocessor, J. V. Woods et al, IEEE Trans. on Computers, Vol.46.4, p , Apr 1997 Automating the Design of an Asynchronous DLX Microprocessor, Manish Amde et al, DAC’03, p , June 2003 Automating the Design of an Asynchronous DLX Microprocessor, Manish Amde et al, DAC’03, p , June 2003