Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.

Slides:



Advertisements
Similar presentations
Computer Organization, Bus Structure
Advertisements

Field Programmable Gate Array
Architecture-Specific Packing for Virtex-5 FPGAs
Commercial FPGAs: Altera Stratix Family Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
Reconfigurable Computing (EN2911X, Fall07) Lecture 04: Programmable Logic Technology (2/3) Prof. Sherief Reda Division of Engineering, Brown University.
TIE Extensions for Cryptographic Acceleration Charles-Henri Gros Alan Keefer Ankur Singla.
The Microprocessor is no more General Purpose. Design Gap.
Survey of Reconfigurable Logic Technologies
CGRA QUIZ. Quiz What is the fundamental drawback of fine-grained architecture that led to exploration of coarse grained reconfigurable architectures?
Course-Grained Reconfigurable Devices. 2 Dataflow Machines General Structure:  ALU-computing elements,  Programmable interconnections,  I/O components.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
PipeRench: A Coprocessor for Streaming Multimedia Acceleration Seth Goldstein, Herman Schmit et al. Carnegie Mellon University.
Lecture 2: Field Programmable Gate Arrays I September 5, 2013 ECE 636 Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays I.
Lecture 26: Reconfigurable Computing May 11, 2004 ECE 669 Parallel Computer Architecture Reconfigurable Computing.
ECE669 L11: Static Routing Architectures March 4, 2004 ECE 669 Parallel Computer Architecture Lecture 11 Static Routing Architectures.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Programmable logic and FPGA
Trends toward Spatial Computing Architectures Dr. André DeHon BRASS Project University of California at Berkeley.
ECE669 L23: Parallel Compilation April 29, 2004 ECE 669 Parallel Computer Architecture Lecture 23 Parallel Compilation.
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
Chapter 6 Memory and Programmable Logic Devices
Chapter 5 Array Processors. Introduction  Major characteristics of SIMD architectures –A single processor(CP) –Synchronous array processors(PEs) –Data-parallel.
February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and.
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
Paper Review I Coarse Grained Reconfigurable Arrays Presented By: Matthew Mayhew I.D.# ENG*6530 Tues, June, 10,
A Reconfigurable Processor Architecture and Software Development Environment for Embedded Systems Andrea Cappelli F. Campi, R.Guerrieri, A.Lodi, M.Toma,
Coarse and Fine Grain Programmable Overlay Architectures for FPGAs
Paper Review: XiSystem - A Reconfigurable Processor and System
A RISC ARCHITECTURE EXTENDED BY AN EFFICIENT TIGHTLY COUPLED RECONFIGURABLE UNIT Nikolaos Vassiliadis N. Kavvadias, G. Theodoridis, S. Nikolaidis Section.
ECE 465 Introduction to CPLDs and FPGAs Shantanu Dutt ECE Dept. University of Illinois at Chicago Acknowledgement: Extracted from lecture notes of Dr.
Lecture 15: Multi-FPGA System Software I November 1, 2004 ECE 697F Reconfigurable Computing Lecture 15 Mid-term Review.
Modern VLSI Design 4e: Chapter 6 Copyright  2008 Wayne Wolf Topics Memories: –ROM; –SRAM; –DRAM; –Flash. Image sensors. FPGAs. PLAs.
J. Christiansen, CERN - EP/MIC
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR FPGA Fabric n Elements of an FPGA fabric –Logic element –Placement –Wiring –I/O.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
A Reconfigurable Low-power High-Performance Matrix Multiplier Architecture With Borrow Parallel Counters Counters : Rong Lin SUNY at Geneseo
Lecture 16: Reconfigurable Computing Applications November 3, 2004 ECE 697F Reconfigurable Computing Lecture 16 Reconfigurable Computing Applications.
Lecture 10: Logic Emulation October 8, 2013 ECE 636 Reconfigurable Computing Lecture 13 Logic Emulation.
Lecture 13: Logic Emulation October 25, 2004 ECE 697F Reconfigurable Computing Lecture 13 Logic Emulation.
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #24 – Reconfigurable.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
COARSE GRAINED RECONFIGURABLE ARCHITECTURES 04/18/2014 Aditi Sharma Dhiraj Chaudhary Pruthvi Gowda Rachana Raj Sunku DAY
M.Mohajjel. Why? TTM (Time-to-market) Prototyping Reconfigurable and Custom Computing 2Digital System Design.
ESS | FPGA for Dummies | | Maurizio Donna FPGA for Dummies Basic FPGA architecture.
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #12 – Systolic.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
1 Advanced Digital Design Reconfigurable Logic by A. Steininger and M. Delvai Vienna University of Technology.
Mapping of Regular Nested Loop Programs to Coarse-grained Reconfigurable Arrays – Constraints and Methodology Presented by: Luis Ortiz Department of Computer.
Survey of Reconfigurable Logic Technologies
1 Field-programmable Gate Array Architectures and Algorithms Optimized for Implementing Datapath Circuits Andy Gean Ye University of Toronto.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #23 – Function.
Lecture 17: Dynamic Reconfiguration I November 10, 2004 ECE 697F Reconfigurable Computing Lecture 17 Dynamic Reconfiguration I Acknowledgement: Andre DeHon.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Lecture 4: Contrasting Processors: Fixed and Configurable September 20, 2004 ECE 697F Reconfigurable Computing Lecture 4 Contrasting Processors: Fixed.
Reconfigurable Computing1 Reconfigurable Computing Part II.
1 Architecture of Datapath- oriented Coarse-grain Logic and Routing for FPGAs Andy Ye, Jonathan Rose, David Lewis Department of Electrical and Computer.
집적회로설계 1 Spring 2007 Prof. Sang Sik AHN Signal Processing LAB.
Field Programmable Gate Arrays
Sequential Logic Design
Topics Coarse-grained FPGAs. Reconfigurable systems.
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
Give qualifications of instructors: DAP
Application-Specific Customization of Soft Processor Microarchitecture
SmartCell: A Coarse-Grained Reconfigurable Architecture for High Performance and Low Power Embedded Computing Xinming Huang Depart. Of Electrical and Computer.
CprE / ComS 583 Reconfigurable Computing
Application-Specific Customization of Soft Processor Microarchitecture
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Overview Review 5 different coarse-grained architectures How do they vary? - Basic block -Inter-block communication -Communication protocol -Software summary -Results Device trends

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Coarse-grained Architectures DP-FPGA -LUT-based -LUTs share configuration bits Rapid -Specialized ALUs, mutlipliers -1D pipeline Matrix -2-D array of ALUs Chess -Augmented, pipelined matrix Raw -Full RISC core as basic block -Static scheduling used for communication

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 DP-FPGA Break FPGA into datapath and control sections Save storage for LUTs and connection transistors Key issue is grain size Cherepacha/Lewis – U. Toronto

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Configuration Sharing MC = LUT SRAM bits CE = connection block pass transistors Set MC = 2-3CE Y0Y0 Y1Y1 A0A0 B0B0 C0C0 A1A1 B1B1 C1C1

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Two-dimensional Layout Control network supports distributed signals. Data routed as four-bit values.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 DP-FPGA Technology Mapping Ideal case would be if all datapath divisible by 4, no “irregularities” Area improvement includes logic values only. Shift logic included

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Rapid Reconfigurable Pipeline Datapath Ebeling –University of Washington Uses hard-coded functional units (ALU, Memory, multiply) Good for signal processing Linear array of processing elements. Cell

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Rapid Datapath Segmented linear architecture All RAMs and ALUs are pipelined Bus connectors also contain registers

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Rapid Control Path In addition to static control, control pipeline allows dynamic control. LUTs provide simple programmability. Cells can be chained together to form continuous pipe.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 FIR Filter Example Measure system response to input impulse Coefficients used to scale input. Running sum determined total.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Mapping a Tap to Rapid Chain multiple taps together (one multiplier per tap)

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Matrix Dehon and Mirsky -> MIT 2-dimensional array of ALUs Each Basic Functional Unit contains “processor” (ALU + SRAM) Ideal for systolic and VLIW computation. 8-bit computation Forerunner of SiliconSpice product.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Basic Functional Unit Two inputs from adjacent blocks. Local memory for instructions, data.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 BFU Interconnect Near-neighbor and quad connectivity Pipelined interconnect at ALU inputs Data transferred in 8-bit groups. Interconnect not pipelined

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 BFU Inputs Each ALU inputs come from several sources. Note that source is locally configurable based on data values.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Filtering Example y i = w 1 * x i + w 2 * x i+1 + w 3 * x i+2 + … w k * x i+k-1 For k-weight filter 4K cells needed -One result every 2 cycles K/2 8x8 multiplies per cycle K=8

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Chess HP Labs – Bristol, England 2-D array – similar to Matrix Contains more “FPGA-like” routing resources. No reported software or application results Doesn’t support incremental compilation

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Chess Interconnect More like an FPGA Takes advantage of near- neighbor connectivity

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Chess Basic Block Switchbox memory can be used as storage ALU core for computation

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Chess Statistics Use metrics to evaluate computational power. Efficient multiplies due to embedded ALU Process independent.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Reconfigurable Architecture Workstation MIT Computer Architecture Group Full RISC processor located as processing element. Routing decoupled into switch mode Parallelizing compiler used to distribute work load. Large amount of memory per tile.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 RAW Tile Full functionality in each tile Static router located for near-neighbor communication

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 RAW Datapath

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Raw Compiler Parallelizes compilation across multiple tiles Orchestrates communication between tiles Some dynamic (data dependent) routing possible.

Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 Summary Architectures moving in the direction of coarse- grained blocks. Latest trend is functional pipeline. Communication determined at compile time Software support still a major issue.