Course-Grained Reconfigurable Devices. 2 Dataflow Machines General Structure:  ALU-computing elements,  Programmable interconnections,  I/O components.

Slides:



Advertisements
Similar presentations
IO Interfaces and Bus Standards. Interface circuits Consists of the cktry required to connect an i/o device to a computer. On one side we have data bus.
Advertisements

HARDWARE Rashedul Hasan..
MEMORY popo.
Commercial FPGAs: Altera Stratix Family Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
Reconfigurable Computing (EN2911X, Fall07) Lecture 04: Programmable Logic Technology (2/3) Prof. Sherief Reda Division of Engineering, Brown University.
TK2633 Introduction to Parallel Data Interfacing DR MASRI AYOB.
Processor System Architecture
DH2T 34 Computer Architecture 1 LO2 Lesson Two CPU and Buses.
Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.
Computer Organization. This module surveys the physical resources of a computer system. –Basic components CPUMemoryBus I/O devices –CPU structure Registers.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
Multithreading and Dataflow Architectures CPSC 321 Andreas Klappenecker.
University College Cork IRELAND Hardware Concepts An understanding of computer hardware is a vital prerequisite for the study of operating systems.
Recap – Our First Computer WR System Bus 8 ALU Carry output A B S C OUT F 8 8 To registers’ input/output and clock inputs Sequence of control signal combinations.
Dynamic NoC. 2 Limitations of Fixed NoC Communication NoC for reconfigurable devices:  NOC: a viable infrastructure for communication among task dynamically.
Basic Computer Organization CH-4 Richard Gomez 6/14/01 Computer Science Quote: John Von Neumann If people do not believe that mathematics is simple, it.
5.1 Chaper 4 Central Processing Unit Foundations of Computer Science  Cengage Learning.
Chapter 17 Microprocessor Fundamentals William Kleitz Digital Electronics with VHDL, Quartus® II Version Copyright ©2006 by Pearson Education, Inc. Upper.
CS-334: Computer Architecture
The Structure of the CPU
Computer Organization Computer Organization & Assembly Language: Module 2.
Computer Systems 1 Fundamentals of Computing The CPU & Von Neumann.
Paper Review I Coarse Grained Reconfigurable Arrays Presented By: Matthew Mayhew I.D.# ENG*6530 Tues, June, 10,
Input/OUTPUT [I/O Module structure].
Dr. Rabie A. Ramadan Al-Azhar University Lecture 6
Computer Architecture and Organization Introduction.
Top Level View of Computer Function and Interconnection.
(More) Interfacing concepts. Introduction Overview of I/O operations Programmed I/O – Standard I/O – Memory Mapped I/O Device synchronization Readings:
Advanced Computer Architecture 0 Lecture # 1 Introduction by Husnain Sherazi.
Memory and Storage Dr. Rebhi S. Baraka
Computer Organization - 1. INPUT PROCESS OUTPUT List different input devices Compare the use of voice recognition as opposed to the entry of data via.
Computer Architecture Lecture 2 System Buses. Program Concept Hardwired systems are inflexible General purpose hardware can do different tasks, given.
EEE440 Computer Architecture
Computer Organization & Assembly Language © by DR. M. Amer.
Introduction Computer System “An electronic device, operating under the control of instructions stored in its own memory unit, that can accept data (input),
Chapter 2 Data Manipulation. © 2005 Pearson Addison-Wesley. All rights reserved 2-2 Chapter 2: Data Manipulation 2.1 Computer Architecture 2.2 Machine.
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
Introduction to Microprocessors
Embedded Network Interface (ENI). What is ENI? Embedded Network Interface Originally called DPO (Digital Product Option) card Printer without network.
COARSE GRAINED RECONFIGURABLE ARCHITECTURES 04/18/2014 Aditi Sharma Dhiraj Chaudhary Pruthvi Gowda Rachana Raj Sunku DAY
Computer Architecture Lecture 32 Fasih ur Rehman.
Computer Organization. This module surveys the physical resources of a computer system.  Basic components  CPU  Memory  Bus  I/O devices  CPU structure.
Computer Hardware A computer is made of internal components Central Processor Unit Internal External and external components.
Programmable Logic Controllers LO1: Understand the design and operational characteristics of a PLC system.
By Fernan Naderzad.  Today we’ll go over: Von Neumann Architecture, Hardware and Software Approaches, Computer Functions, Interrupts, and Buses.
4 Linking the Components Linking The Components A computer is a system with data and instructions flowing between its components in response to processor.
Computer operation is of how the different parts of a computer system work together to perform a task.
Mapping of Regular Nested Loop Programs to Coarse-grained Reconfigurable Arrays – Constraints and Methodology Presented by: Luis Ortiz Department of Computer.
Architectural Considerations A Review of Some Architectural Concepts.
بسم الله الرحمن الرحيم MEMORY AND I/O.
1 Basic Processor Architecture. 2 Building Blocks of Processor Systems CPU.
The Processor & its components. The CPU The brain. Performs all major calculations. Controls and manages the operations of other components of the computer.
Chapter Microcontroller
Chapter 3 System Buses.  Hardwired systems are inflexible  General purpose hardware can do different tasks, given correct control signals  Instead.
BASIC COMPUTER ARCHITECTURE HOW COMPUTER SYSTEMS WORK.
Chapter 1 Introduction.   In this chapter we will learn about structure and function of computer and possibly nature and characteristics of computer.
System on a Programmable Chip (System on a Reprogrammable Chip)
Computer Organization and Architecture Lecture 1 : Introduction
PROGRAMMABLE LOGIC CONTROLLERS SINGLE CHIP COMPUTER
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
The Xilinx Virtex Series FPGA
The Xilinx Virtex Series FPGA
Presentation transcript:

Course-Grained Reconfigurable Devices

2 Dataflow Machines General Structure:  ALU-computing elements,  Programmable interconnections,  I/O components. Most dominating coarse-grained systems:  PACT XPP  NEC-DRP  PicoChip  Morphosys  [RaPiD]  Chameleon

3

4 PACT XPP V. Baumgarte, G. Ehlers, F. May, A. Nueckel, M. Vorbach, and M. Weinhardt, “PACT XPP A self-reconfigurable data processing architecture,” J. Supercomput., vol. 26, no. 2, pp. 167–184, M. Petrov, T. Murgan, F. May, M. Vorbach, P. Zipf, and M. Glesner, “The XPP architecture and its co-simulation within the simulink environment.” in Proceedings of International Conference on Field-Programmable Logic and Applications (FPL), ser. Lecture Notes in Computer Science (LNCS), vol Antwep, Belgium: Springer, Aug. 2004, pp. 761–770.

5 PACT XPP Aim:  Efficiently compute streams of data provided from different sources (e.g. A/D converters) rather than single instructions (as in Von-Neumann computers). Characteristic:  Computation should be done while data are streaming through the processing elements  it is suitable to configure the PEs to adapt to the natural computation paradigm of a given application.

6 Course Grain Architectures

7 PACT XPP: Architecture XPP (Extreme Processing Platform)  A hierarchical structure consisting of PAEs PAEs  Course grain PEs  Adaptive  Clustered in PACs  PA = PAC + CM  A hierarchical configuration tree  Memory elements (aside PAs)  I/O elements (on each side of the chip) PA

8 PACT XPP Architecture: CM CM (Configuration Manager):  Powerful run-time reconfiguration: −Configuration control is distributed over several CMs −PAEs can be configured rapidly in parallel while neighboring PAEs are processing data.  Entire applications can be configured and run independently on different parts of the array.  Reconfiguration can be triggered: −externally or − internally (by special event signals originating within the array −  self-reconfiguring Local CM:  One configuration manager (CM) attached to a local memory is responsible for writing configuration onto a PA.  The CMs at a lower level are controlled by a CM at the next higher level. Root CM:  Attached to an external configuration memory.  Supervises the whole device configuration.

9 XPP Architecture Scalability:  Can cascade multiple devices in a multi-chip module  Root CMs act like ordinary, subordinate CMs CM:  consists of a state machine +  internal RAM for configuration caching

10 PACT XPP Architecture: PAE 1.ALU PAE has: 1.ALU: is configured to perform basic operations: −Common fixed-point arithmetical and logical operations −Special three-input opcodes (e.g. multiply-add, sort, counters) −Generate events (e.g. counting termination, ovf, …) 2.Back Register: provides routing channels for data and events from bottom to top 3.Forward Register: provides routing channels from top to bottom

11 PACT XPP Architecture: PAE  Dataflow-Registers: used at the object output for data buffering in case of a pipeline stall.  Input Registers : can be pre-loaded by configuration data and always provide single cycle stall.

12 PACT XPP Architecture: PAE 2.RAM PAE:  As ALU PAE but instead of ALU, it has a dual port RAM  Useful for data storage (intermediate results) −Can be used in FIFO or RAM mode  Useful for LUT-based functions  The RAM generates a data packet after an address was received at the input.  Writing to the RAM requires two data packets: 1.for the address 2.for the data to be written. RAM

13 PACT XPP Architecture: Communication PAE Objects communicate via a packet-oriented network:  Two types of packets: −Data packets: uniform bit width for a device (specific to the device type, e.g 32) −Event packets: one or a few bits wide  Self-synchronizing: −An operation is performed as soon as all necessary data input packets are available. −The results are forwarded as soon as they are available, provided the previous results have been consumed. −  Thus possible to map a DFG directly to ALU objects, and to pipeline input data streams through it.  Event signals: −can trigger a self-reconfiguration −Can control the merging of data-streams

14 PACT XPP: Routing Routing and Communication:  Two independent networks: 1.for data transmission 2.for event transmission

15 PACT XPP: Routing 1.Horizontal Channel to connect a PAE within a row. 2.Vertical Channel to connect objects to a given horizontal bus. 3.Configuration Bus Horizontal routing channels Vertical routing channels

16 PACT XPP: Interface  Number and type of interfaces vary from device to device XPP42-A1: 6 internal interfaces consisting of:  4 identical general purpose I/O on- chip interfaces (bottom left, upper left, upper right, and bottom right)  One configuration manager (not shown on the picture)  One JTAG (Join Test Action Group, "IEEE Standard ") Boundary scan interface or for testing purpose Interfaces

The PACT XPP - Interface  The I/O interfaces can operate independent from each other. Two operation modes  The RAM mode  The streaming mode RAM mode :  Each port can access external Static RAM (SRAM).  Control signals for the SRAM transaction are available.  No additional logic required

The PACT XPP - Interface Streaming mode:  For high speed streaming of data to and from the device  Each I/O element provides two bidirectional ports for data streaming  Handshake signals are used for synchronization of data packets to external port