Research on Reconfigurable Computing Using Impulse C Carmen Li Shen Mentor: Dr. Russell Duren February 1, 2008.

Slides:



Advertisements
Similar presentations
Field Programmable Gate Array
Advertisements

FPGA (Field Programmable Gate Array)
Progress With iBOBs at Jodrell Bits & Bytes Meeting, JBO, th Dec 2007 Jonathan Hargreaves Electronic Engineer, Jodrell Bank Observatory.
StreamBlade SOE TM Initial StreamBlade TM Stream Offload Engine (SOE) Single Board Computer SOE-4-PCI Rev 1.2.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
Khaled A. Al-Utaibi  Computers are Every Where  What is Computer Engineering?  Design Levels  Computer Engineering Fields  What.
An Introduction to Reconfigurable Computing Mitch Sukalski and Craig Ulmer Dean R&D Seminar 11 December 2003.
Steven Koelmeyer BDS(hons)1 Reconfigurable Hardware for use in Ad Hoc Sensor Networks Supervisors Charles Greif Nandita Bhattacharjee.
Characterization Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
1 Performed By: Khaskin Luba Einhorn Raziel Einhorn Raziel Instructor: Rivkin Ina Spring 2004 Spring 2004 Virtex II-Pro Dynamical Test Application Part.
Moving NN Triggers to Level-1 at LHC Rates Triggering Problem in HEP Adopted neural solutions Specifications for Level 1 Triggering Hardware Implementation.
© 2004 Xilinx, Inc. All Rights Reserved Implemented by : Alon Ben Shalom Yoni Landau Project supervised by: Mony Orbach High speed digital systems laboratory.
BEEKeeper Remote Management and Debugging of Large FPGA Clusters Terry Filiba Navtej Sadhal.
Department of Electrical and Computer Engineering Texas A&M University College Station, TX Abstract 4-Level Elevator Controller Lessons Learned.
Hardware accelerator for PPC microprocessor Final presentation By: Instructor: Kopitman Reem Fiksman Evgeny Stolberg Dmitri.
Performed by : Rivka Cohen and Sharon Solomon Instructor : Walter Isaschar המעבדה למערכות ספרתיות מהירות High Speed Digital Systems Laboratory הטכניון.
Configurable System-on-Chip: Xilinx EDK
The Xilinx EDK Toolset: Xilinx Platform Studio (XPS) Building a base system platform.
Technion Digital Lab Project Xilinx ML310 board based on VirtexII-PRO programmable device Students: Tsimerman Igor Firdman Leonid Firdman Leonid.
An FPGA Based Adaptive Viterbi Decoder Sriram Swaminathan Russell Tessier Department of ECE University of Massachusetts Amherst.
Presenting: Itai Avron Supervisor: Chen Koren Characterization Presentation Spring 2005 Implementation of Artificial Intelligence System on FPGA.
Hardware accelerator for PPC microprocessor by: Dimitry Stolberg Reem Kopitman Instructor: Evgeny Fiksman.
Final Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
Characterization Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
Reconfigurable Computing in the Undergraduate Curriculum Jason D. Bakos Dept. of Computer Science and Engineering University of South Carolina.
XUP Virtex-5 Development System January XUP Virtex52 Introducing XUPV5-LX110T A powerful and versatile platform packaged and priced for Academia!
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
Implementation of DSP Algorithm on SoC. Characterization presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompany engineer : Emilia Burlak.
Introduction to FPGA and DSPs Joe College, Chris Doyle, Ann Marie Rynning.
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
Field Programmable Gate Array (FPGA) Layout An FPGA consists of a large array of Configurable Logic Blocks (CLBs) - typically 1,000 to 8,000 CLBs per chip.
System Architecture A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Hyong-Youb Kim, Paul Willmann, Dr. Scott Rixner Rice.
1 A survey on Reconfigurable Computing for Signal Processing Applications Anne Pratoomtong Spring2002.
FPGA Based Fuzzy Logic Controller for Semi- Active Suspensions Aws Abu-Khudhair.
© 2011 Xilinx, Inc. All Rights Reserved Intro to System Generator This material exempt per Department of Commerce license exception TSU.
Out-of-Order OpenRISC 2 semesters project Semester A: Implementation of OpenRISC on XUPV5 board Final A Presentation By: Vova Menis-Lurie Sonia Gershkovich.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
© Copyright Xilinx 2004 All Rights Reserved 9 November, 2004 XUP Virtex-II Pro Development System.
Ross Brennan On the Introduction of Reconfigurable Hardware into Computer Architecture Education Ross Brennan
Out-of-Order OpenRISC 2 semesters project Semester A: Implementation of OpenRISC on XUPV5 board Midterm Presentation By: Vova Menis-Lurie Sonia Gershkovich.
Experimental Performance Evaluation For Reconfigurable Computer Systems: The GRAM Benchmarks Chitalwala. E., El-Ghazawi. T., Gaj. K., The George Washington.
Performance and Overhead in a Hybrid Reconfigurable Computer O. D. Fidanci 1, D. Poznanovic 2, K. Gaj 3, T. El-Ghazawi 1, N. Alexandridis 1 1 George Washington.
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
Lecture 14 Reconfigurable Computing Basics Lecturer: Simon Winberg.
SLAAC SV2 Briefing SLAAC Retreat, May 2001 Heber, UT Brian Schott USC Information Sciences Institute.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
1 of 23 Fouts MAPLD 2005/C117 Synthesis of False Target Radar Images Using a Reconfigurable Computer Dr. Douglas J. Fouts LT Kendrick R. Macklin Daniel.
RiceNIC: A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Dr. Scott Rixner Rice Computer Architecture:
Automated Design of Custom Architecture Tulika Mitra
Advanced Computer Architecture, CSE 520 Generating FPGA-Accelerated DFT Libraries Chi-Li Yu Nov. 13, 2007.
Efficient Implementation of a String Matching Algorithm for SRC and Cray Reconfigurable Computers Esam El-Araby 1, Mohamed Taher 1, Tarek El-Ghazawi 1,
J. Christiansen, CERN - EP/MIC
FPGA (Field Programmable Gate Array): CLBs, Slices, and LUTs Each configurable logic block (CLB) in Spartan-6 FPGAs consists of two slices, arranged side-by-side.
© 2004 Xilinx, Inc. All Rights Reserved Implemented by : Alon Ben Shalom Yoni Landau Project supervised by: Mony Orbach High speed digital systems laboratory.
1 EDK 7.1 Tutorial -- SystemACE and EthernetMAC on Avnet Virtex II pro Development Boards Chia-Tien Dan Lo Department of Computer Science University of.
Part A Presentation Implementation of DSP Algorithm on SoC Student : Einat Tevel Supervisor : Isaschar Walter Accompanying engineer : Emilia Burlak The.
© 2004 Xilinx, Inc. All Rights Reserved Embedded Processor Design.
Algorithm and Programming Considerations for Embedded Reconfigurable Computers Russell Duren, Associate Professor Engineering And Computer Science Baylor.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
A Monte Carlo Simulation Accelerator using FPGA Devices Final Year project : LHW0304 Ng Kin Fung && Ng Kwok Tung Supervisor : Professor LEONG, Heng Wai.
Evaluating and Improving an OpenMP-based Circuit Design Tool Tim Beatty, Dr. Ken Kent, Dr. Eric Aubanel Faculty of Computer Science University of New Brunswick.
This material exempt per Department of Commerce license exception TSU Xilinx On-Chip Debug.
Connecting EPICS with Easily Reconfigurable I/O Hardware EPICS Collaboration Meeting Fall 2011.
UClinux console (HyperTerminal) Memec V2MB1000 prototyping board running uClinux on embedded Xilinx® MicroBlaze™ processor Development system with Xilinx.
Survey of Reconfigurable Logic Technologies
VHDL From Ch. 5 Hardware Description Languages. History 1980’s Schematics 1990’s Hardware Description Languages –Increased due to the use of Programming.
CoDeveloper Overview Updated February 19, Introducing CoDeveloper™  Targeting hardware/software programmable platforms  Target platforms feature.
Ming Liu, Wolfgang Kuehn, Zhonghai Lu, Axel Jantsch
RECONFIGURABLE PROCESSING AND AVIONICS SYSTEMS
Presentation transcript:

Research on Reconfigurable Computing Using Impulse C Carmen Li Shen Mentor: Dr. Russell Duren February 1, 2008

Presentation Overview Background Information Introduction Impulse C Current Work Conclusion & Future Research Questions

Background Information Reconfigurable computing Field Programmable Gate Arrays (FPGAs) Hardware Description Languages (HDLs): –Verilog –VHDL C++ and C-based software programming languages: –System C –Impulse C

Reconfigurable Computing Employing programmable logic devices where the hardware-based logic itself is being modified Reprogram hardware vs. modifying the program that use a fixed hardware configuration Programming FPGAs vs. Von Neumann Computers –Reconnecting internal gates to modify the hardware –The hw is optimized to perform one function –Vs. changing software running on a processor Image provided by:

Field Programmable Gate Array Custom Circuitry μProc RAM I/O Microprocessor User I/O TCP/IP Control & Test Benches Custom Circuitry Complex calculations (e.g. NN, DSP) FPGA Image provided by:

SRC-6e Hardware Architecture Features: 2 XC2V6000 FPGA 288 MACs, BRAMs 2 Pentium 3 24MB of SRAM 64-bit ports Cost ~ $300,000 Intel® μP L2 MIOC PCICommon Memory SNAPSNAP Controller On-Board Memory (24 MB) FPGA Intel® μP L2 μP Board FPGA 6x 800 MB/s MAP Chain Port 800 MB/s 315/195 MB/s Chain Port 800 MB/s

XUP Virtex II Pro Platform Features: XC2VP30 FPGA 136 MACs, BRAMs 2 PowerPC 256 MB DDR SDRAM 10/100 Ethernet SATA connectors Serial, JTAG, audio, video, USB, etc. ports Cost ~ $300 - $1,600

Research Our research: –Impulse C –Multiple FPGAs Methodology: –Implement a calculation- intensive program –Compare to previous work and the SRC-6e Image provided by: Willis Troy Dr. Eisenbarth Dr. Duren

Neural Network Trained network 27 inputs 3 Hidden Layers (with & 70 nodes) 1200 outputs Additions, multiplication, squashing

Impulse C C-language development tool FPGA-accelerated computing Function library for parallel programming fully compatible with ANSI C CoDeveloper Tools Mixed software/hardware Cost ~ 3,000 Image provided by:

Impulse C Data movement via streams and shared memory Shared memory tradeoff: large but slow –Memory accessed via OPB bus (opb2plb bridge) Floating point implementation supported Customized instructions –xil_printf (2,953 bytes) vs printf (51,788 bytes) –Does not support type real numbers (floating point) or long-long types (64 bit)

Impulse C to Bitstream Build Simulation Executable Launch ANSI-C Simulation Executable Generate HDL select a platform target Export Generated Hardware Export Generated Software Xilinx Platform Studio Project (EDK)

Image Filter DMA Example

Current Implementation Inputs & 3 Hidden Layers 600 Output Nodes Neural Network

Big_NeuralNet_sw.c Software Processes Memory Object

Big_NeuralNet_hw.c Hardware Process Configuration Function

Sigmoid function y(x) = -y0”*(x – x0)2 + y0’*(x – x0) + y0

Projects Comparison Similarities Reconfigurable Computing Neural Network and Weights FPGAs Differences Implementation using VHDL vs. C Fixed point vs. Floating point Platforms / Architectures

Timing Results for Neural Network Solutions ArchitectureLanguageExecution Time PC – Pentium 4C280 µs SRC-6E Carte C (parallel) µs VHDL (serial)1000 µs VHDL (parallel node)250 µs VHDL (parallel input)15 µs Baylor RC Cluster VHDL (1 board)15 µs VHDL (3 boards)6.7 µs Impulse C (3 boards)TBD Impulse C (16 boards)TBD 2x

Conclusion & Future Work Reconfigurable Computing SRC-6e vs. XUP boards architectures NN Calculations & Timing Results Explore different levels of parallelism across multiple FPGA boards using multiple communication schemes Ethernet, MPI, SATA Interfaces RC cluster of Virtex II PRO Willis Troy Dr. Eisenbarth Dr. Duren

Acknowledgements Dr. Russell Duren Dr. Steven Eisenbarth Willis Troy

Questions