CA Final Project – Multithreaded Processor with IPC Interface

Slides:



Advertisements
Similar presentations
RAMP Gold : An FPGA-based Architecture Simulator for Multiprocessors Zhangxi Tan, Andrew Waterman, David Patterson, Krste Asanovic Parallel Computing Lab,
Advertisements

Final Presentation Part-A
Reporter :LYWang We propose a multimedia SoC platform with a crossbar on-chip bus which can reduce the bottleneck of on-chip communication.
Lesson 15 – INSTALL AND SET UP NETWARE 5.1. Understanding NetWare 5.1 Preparing for installation Installing NetWare 5.1 Configuring NetWare 5.1 client.
Configurable System-on-Chip: Xilinx EDK
Modern trends in computer architecture and semiconductor scaling are leading towards the design of chips with more and more processor cores. Highly concurrent.
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
Implementation of DSP Algorithm on SoC. Characterization presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompany engineer : Emilia Burlak.
Introduction to FPGA and DSPs Joe College, Chris Doyle, Ann Marie Rynning.
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
Using FPGAs with Embedded Processors for Complete Hardware and Software Systems Jonah Weber May 2, 2006.
Installing software on personal computer
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
WormCraft Tianyi Zhang tz2210 Ning Li nl2447 Yuxuan Zhang yz2580 Ziwei Zhang zz2282.
© 2011 Xilinx, Inc. All Rights Reserved This material exempt per Department of Commerce license exception TSU Xilinx Tool Flow.
2006 Chapter-1 L2: "Embedded Systems - Architecture, Programming and Design", Raj Kamal, Publs.: McGraw-Hill, Inc. 1 Introduction to Embedded Systems –
Prof. JunDong Cho VADA Lab. Project.
Content Project Goals. Term A Goals. Quick Overview of Term A Goals. Term B Goals. Gantt Chart. Requests.
SOC Consortium Course Material ASIC Logic National Taiwan University Adopted from National Chiao-Tung University IP Core Design.
SYSTEM-ON-CHIP (SoC) AND USE OF VLSI CIRCUIT DESIGN TECHNOLOGY.
SOC Consortium Course Material ASIC Logic Speaker: Lung-Hao Chang 張龍豪 Advisor: Prof. Andy Wu 吳安宇教授 May 21, 2003 National Taiwan University Adopted from.
© 2003 Xilinx, Inc. All Rights Reserved For Academic Use Only Xilinx Design Flow FPGA Design Flow Workshop.
집적회로 Spring 2007 Prof. Sang Sik AHN Signal Processing LAB.
IEEE ICECS 2010 SysPy: Using Python for processor-centric SoC design Evangelos Logaras Elias S. Manolakos {evlog, Department of Informatics.
ARM for Wireless Applications ARM11 Microarchitecture On the ARMv6 Connie Wang.
This material exempt per Department of Commerce license exception TSU Xilinx Tool Flow.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU FPGA Design with Xilinx ISE Presenter: Shu-yen Lin Advisor: Prof. An-Yeu Wu 2005/6/6.
An Architecture and Prototype Implementation for TCP/IP Hardware Support Mirko Benz Dresden University of Technology, Germany TERENA 2001.
Introductory project. Development systems Design Entry –Foundation ISE –Third party tools Mentor Graphics: FPGA Advantage Celoxica: DK Design Suite Design.
Sub-Nyquist Sampling Algorithm Implementation on Flex Rio
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
Fail-Safe Module for Unmanned Autonomous Vehicle
How to use ISE Dept. of Info & Comm. Eng. Prof. Jongbok Lee.
LAB 6 JTAG and Multi-ICE 第四組 R 蘇倉弘 R 莊銘罡 R 陳建志.
Implementing RISC Multi Core Processor Using HLS Language - BLUESPEC Liam Wigdor Instructor Mony Orbach Shirel Josef Semesterial Winter 2013.
Dynamic and On-Line Design Space Exploration for Reconfigurable Architecture Fakhreddine Ghaffari, Michael Auguin, Mohamed Abid Nice Sophia Antipolis University.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Software and Communication Driver, for Multimedia analyzing tools on the CEVA-X Platform. June 2007 Arik Caspi Eyal Gabay.
Maj Jeffrey Falkinburg Room 2E46E
M. Bellato INFN Padova and U. Marconi INFN Bologna
Use of FPGA for dataflow Filippo Costa ALICE O2 CERN
DE2-115 Control Panel - Part I
ARM Embedded Systems
Current Generation Hypervisor Type 1 Type 2.
ECE354 Embedded Systems Introduction C Andras Moritz.
Computer System Laboratory
Implementation of Embedded OS
Andes Technology Innovate SOC ProcessorsTM
ENG3050 Embedded Reconfigurable Computing Systems
Architecture & Organization 1
Performance Tuning Team Chia-heng Tu June 30, 2009
Overview of Embedded SoC Systems
JTAG and Multi-ICE National Taiwan University
Highly Efficient and Flexible Video Encoder on CPU+FPGA Platform
Figure 1 PC Emulation System Display Memory [Embedded SOC Software]
RECONFIGURABLE PROCESSING AND AVIONICS SYSTEMS
Emu: Rapid FPGA Prototyping of Network Services in C#
Course Agenda DSP Design Flow.
FPro Bus Protocol and MMIO Slot Specification
Architecture & Organization 1
Dynamically Reconfigurable Architectures: An Overview
ECEG-3202 Computer Architecture and Organization
Speaker: Tian-Sheuan Chang July, 2004
ECEG-3202 Computer Architecture and Organization
Five Key Computer Components
Multithreaded Programming
Programmable Logic- How do they do that?
Implementation of a GNSS Space Receiver on a Zynq
CS 286 Computer Organization and Architecture
JTAG, Multi-ICE and Angel
Presentation transcript:

CA Final Project – Multithreaded Processor with IPC Interface Part I – Project Review

Goal Multicore processors, e.g. the TI OMAP and the IBM Cell, are getting more and more popular either in the multimedia or the communication applications However, the processors are usually underutilized due to some latency IPC latency Pipeline latency Memory latency In this project, the students are requested to come up with some solutions to those latency and evaluate the effectiveness

Experiment Environment ARM Versatile platform Host processor (ARM926) DSP accelerator (MIPS compatible) on the FPGA Baseline (1 thread), 40MHz IMT (4 threads), 10MHz Applications JPEG decoder (VGA-sized)

Grading Evaluate the DSP utilization in the following cases (the same IPC mechanism) Multithreaded DSP MPU handles the IPC (software solution) DSP handles the IPC (hardware solution) MPU handles the IPC Single-threaded DSP Explain what is the most critical overhead in your dual-core processor experiment Explain whether the IMT-DSP is the good solution to reduce the overhead in your experiment

Reference S. H. Ou, T. J. Lin, X. S. Deng, Z. H. Zhuo, and C. W. Liu, “Multithreaded coprocessor interface for multicore multimedia SoC, ” in ASPDAC, 2008

Introduction to Versatile Programming Part II

Outline Environment setting Coding on the CodeWarrier Burn design on the FPGA

Environment Setting

Software Required Xilinx ISE (v9.1i) ARM Multi-ICE (v2.2) ARM Developer Suite (v1.2)

Xilinx ISE Install CD1 (main program) Install CD2 (FPGA model) Virtex 2 should be included

Multi-ICE Install Multi-ICE and its patch The patch contains ARM926EJ-S data Connect versatile and run “Multi-ICE Server”, 2 mode will be found Normal mode (config jumper open) Config mode (config jumper closed)

ARM Developer Suite Install Run Multi-ICE first (In normal mode) Run AXD Server and config as follows: Option->Configure Target->Add choose Multi-ICE\Multi-ICE.dll Configure->select ARM926EJ-S Load Image and start debug

Coding on the CodeWarrier

Example Create a “New Project” Include the source code and other files

Example Platform environment settings Select “Make” to generate *axf file and “Run” to run on the ARM926 1. Make Run choose ARM926EJ-S include DMA.scf 2.

Burn Design on the FPGA

AMBA Bus Protocol AMBA bus connection Address decode

RTL codes on Xilinx Project Navigator – Implementation Translate Merge multiple design files into a single netlist Map Group logical symbols from the netlist (gates) into physical components (slices and IOBs) Place & Route Place components onto the chip, connect them, and extract timing data into reports

Example Wrap your design with a wrapper Generate BIT file wrapper Click “Generate Programming File”

Example Multi-Ice mode selection Burn Boardfiles -> progcards_multiice.exe

Example ARM-FPGA co-process