A Flexible Architecture for Simulation and Testing (FAST) Multiprocessor Systems John D. Davis, Lance Hammond, Kunle Olukotun Computer Systems Lab Stanford.

Slides:



Advertisements
Similar presentations
Nios Multi Processor Ethernet Embedded Platform Final Presentation
Advertisements

RAMP Gold : An FPGA-based Architecture Simulator for Multiprocessors Zhangxi Tan, Andrew Waterman, David Patterson, Krste Asanovic Parallel Computing Lab,
Jared Casper, Ronny Krashinsky, Christopher Batten, Krste Asanović MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA A Parameterizable.
System Simulation Of 1000-cores Heterogeneous SoCs Shivani Raghav Embedded System Laboratory (ESL) Ecole Polytechnique Federale de Lausanne (EPFL)
Maciej Gołaszewski Tutor: Tadeusz Sondej, PhD Design and implementation of softcore dual processor system on single chip FPGA Design and implementation.
CSC457 Seminar YongKang Zhu December 6 th, 2001 About Network Processor.
Hardwired networks on chip for FPGAs and their applications
The Stanford Smart Memories: A 90nm, 55M transistor, 61mm², 8-core chip multiprocessor VLSI technology scaling is driving changes Designs are getting complex.
Performance Characterization of the Tile Architecture Précis Presentation Dr. Matthew Clark, Dr. Eric Grobelny, Andrew White Honeywell Defense & Space,
Astronomical Array Control & Acquisition System at NAOC Zhaowang Zhao Binxun Ye Research Labs for Astronomy National Astronomical Observatories, Chinese.
1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.
Define Embedded Systems Small (?) Application Specific Computer Systems.
Target Control Electronics Upgrade 08/01/2009 J. Leaver P. Smith.
Climate Machine Update David Donofrio RAMP Retreat 8/20/2008.
Murali Vijayaraghavan MIT Computer Science and Artificial Intelligence Laboratory RAMP Retreat, UC Berkeley, January 11, 2007 A Shared.
6/30/2015HY220: Ιάκωβος Μαυροειδής1 Moore’s Law Gordon Moore (co-founder of Intel) predicted in 1965 that the transistor density of semiconductor chips.
Martin Kruliš by Martin Kruliš (v1.0)1.
SSS 4/9/99CMU Reconfigurable Computing1 The CMU Reconfigurable Computing Project April 9, 1999 Mihai Budiu
Performed by: Yevgeny Kliteynik Ofir Cohen Instructor: Yevgeny Fixman המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory הטכניון - מכון.
Kabuki 2800 “a real-time digital audio effects system for performance” team “Big Country” presents ECEN4610 Preliminary Design Review 14 September 2006.
1 Design of the Front End Readout Board for TORCH Detector 10, June 2010.
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
PlayStation 2 Architecture Irin Jose Farid Momin Quy Ngo Olivia Wong.
Hardware Overview Net+ARM – Well Suited for Embedded Ethernet
Out-of-Order OpenRISC 2 semesters project Semester A: Implementation of OpenRISC on XUPV5 board Final A Presentation By: Vova Menis-Lurie Sonia Gershkovich.
Emotion Engine A look at the microprocessor at the center of the PlayStation2 gaming console Charles Aldrich.
The 6713 DSP Starter Kit (DSK) is a low-cost platform which lets customers evaluate and develop applications for the Texas Instruments C67X DSP family.
RSC Williams MAPLD 2005/BOF-S1 A Linux-based Software Environment for the Reconfigurable Scalable Computing Project John A. Williams 1
© Copyright Xilinx 2004 All Rights Reserved 9 November, 2004 XUP Virtex-II Pro Development System.
Ross Brennan On the Introduction of Reconfigurable Hardware into Computer Architecture Education Ross Brennan
Prof. JunDong Cho VADA Lab. Project.
Out-of-Order OpenRISC 2 semesters project Semester A: Implementation of OpenRISC on XUPV5 board Midterm Presentation By: Vova Menis-Lurie Sonia Gershkovich.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
UNIX System Administration OS Kernal Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept Kernel or MicroKernel Concept: An OS architecture-design.
1 Hardware Support for Collective Memory Transfers in Stencil Computations George Michelogiannakis, John Shalf Computer Architecture Laboratory Lawrence.
Automated Design of Custom Architecture Tulika Mitra
Research on Reconfigurable Computing Using Impulse C Carmen Li Shen Mentor: Dr. Russell Duren February 1, 2008.
Caltech CS184 Spring DeHon 1 CS184b: Computer Architecture (Abstractions and Optimizations) Day 25: May 27, 2005 Transactional Computing.
Somervill RSC 1 125/MAPLD'05 Reconfigurable Processing Module (RPM) Kevin Somervill 1 Dr. Robert Hodson 1
COMPUTER ORGANIZATIONS CSNB123. COMPUTER ORGANIZATIONS CSNB123 Why do you need to study computer organization and architecture? Computer science and IT.
Copyright © 2007 Heathkit Company, Inc. All Rights Reserved PC Fundamentals Presentation 30 – PC Architecture.
Mr. Daniel Perkins Battelle Memorial Institute Mr. Rob Riley Air Force Research Laboratory Gateware Munitions Interface Processor (GMIP)
ATtiny23131 A SEMINAR ON AVR MICROCONTROLLER ATtiny2313.
An Architecture and Prototype Implementation for TCP/IP Hardware Support Mirko Benz Dresden University of Technology, Germany TERENA 2001.
Hardware Benchmark Results for An Ultra-High Performance Architecture for Embedded Defense Signal and Image Processing Applications September 29, 2004.
Simics: A Full System Simulation Platform Synopsis by Jen Miller 19 March 2004.
Motherboard A motherboard allows all the parts of your computer to receive power and communicate with one another.
Chapter 1 — Computer Abstractions and Technology — 1 Below Your Program Application software – Written in high-level language System software – Compiler:
Reconfigurable Computing: HPC Network Aspects Mitch Sukalski (8961) David Thompson (8963) Craig Ulmer (8963) Pete Dean R&D Seminar December.
Proposal for an Open Source Flash Failure Analysis Platform (FLAP) By Michael Tomer, Cory Shirts, SzeHsiang Harper, Jake Johns
Total Embedded Solutions SubSystems. Total Embedded Solutions SubSystems Agenda SSD Background Embedded Products –MIPS based processor products –I/O cards.
Somervill RSC 1 125/MAPLD'05 Reconfigurable Processing Module (RPM) Kevin Somervill 1 Dr. Robert Hodson 1
NetTM FPGA-based processors Martin Labrecque Connections 2010.
WorldScape Defense Company, L.L.C. Company Proprietary Slide 1 An Ultra-High Performance Scalable Processing Architecture for HPC and Embedded Applications.
The Standford Hydra CMP  Lance Hammond  Benedict A. Hubbert  Michael Siu  Manohar K. Prabhu  Michael Chen  Kunle Olukotun Presented by Jason Davis.
Raw Status Update Chips & Fabrics James Psota M.I.T. Computer Architecture Workshop 9/19/03.
CSC 360- Instructor: K. Wu Review of Computer Organization.
Real Time Systems Lab. 1 AV1394 Board : Now and Future Real Time Systems Lab. Myung Jin (Daniel), Lee.
Hardware Architecture
1 Scaling Soft Processor Systems Martin Labrecque Peter Yiannacouras and Gregory Steffan University of Toronto FCCM 4/14/2008.
Operating System Overview
Programmable Hardware: Hardware or Software?
Morgan Kaufmann Publishers
Erno DAVID, Tivadar KISS Wigner Research Center for Physics (HU)
THE PROCESS OF EMBEDDED SYSTEM DEVELOPMENT
Emu: Rapid FPGA Prototyping of Network Services in C#
Dynamically Reconfigurable Architectures: An Overview
A High Performance SoC: PkunityTM
Chip&Core Architecture
Command and Data Handling
Presentation transcript:

A Flexible Architecture for Simulation and Testing (FAST) Multiprocessor Systems John D. Davis, Lance Hammond, Kunle Olukotun Computer Systems Lab Stanford University

2 Glance Custom PCB design 4-way tightly-coupled multiprocessor Real R3000/R3010 cores, CPU/FPU Flexible memory simulation hierarchy, including L1 caches

3 FAST Capabilities Real Applications –Including OS & FP-intensive code Multiprocessor systems up to 4 processors Variety of Memory Structures –L1 and/or L2 caches, FIFO’s, DRAM Emulation of on-chip and off-chip memory latencies Expandable interface –Multiple PCBs, other digital interfaces

4 FAST Simulation Space Thread Level Speculation –Stanford’s HYDRA CMP Transactional Memory –Stanford’s TCC Large-Scale Networked CMP –Stanford’s Smart Memories Multithreaded HW Embedded SOC Architectures All at multi-MHz speeds

5 FAST Processor Tiles 4 MIPS-based Processor Tiles CPU and FPU –External Cache and Coprocessor interface Local Memory Controller –Cache or other memory structures Coprocessor –Additional Functionality –ISA Extensions L1 Local Memory –SRAMs: 256K X 36-bit 1 MB Local Mem MIPS R3000 CPU MIPS R3010 FPU XCV1000 Local Mem Controller XCV1000 Coprocessor

6 MP FAST Read/Write Controller: –Hub of the multi-core system –Handles the interprocessor interaction Shared Memory Controller: –Interfaces the RWC with the L2 memory SRAM array and the expansion connector –Expansion connector can connect multiple PCBs, main memory daughter card, or other digital interfaces.

7 FAST Support HW Host communication –Daughter card with embedded processor and Ethernet controller CPLD –FPGA Programming –Clock Management –PCB Management Flash Memory –Store FPGA bitstreams –Onboard SW, like Board OS

8 FAST Conclusions Flexible hardware emulation platform for tightly-coupled multiprocessor systems Emulate a variety of memory systems Full system or partial system simulation Expandable simulation fabric Integer and Floating-point applications

9 BACKUP

10 Complete FAST

11 FAST HW at a Glance 4 MIPS-based Processor Cores: –CPU and 25MHz – MHz –L1 Local Memory, each: 256K X XC2V6000 L2 Shared Memory 200 MHz –16M X 36-bit L2 Shared 200 MHz XC2V6000 Read/Write 200 MHz All on PCB 3.3V Three main clock domains: 25MHz, 100MHz, 200MHz Four voltage domains: 1.5V, 2.5V, 3.3V, 5.0V 128Mb Flash Memory to store FPGA configurations and PMON OS CPLD for additional control, FPGA programming, etc. RCM3200 Microcontroller with Embedded Ethernet Port for off- PCB Communication

12 FAST PCB Details 23 Layer Board 4200 nets vias 43 BGA packages 2500 testpoints 4300 parts

13 FAST Status PCB in fabrication and assembly phase Need to partition Verilog for initial Speculative Thread design Building Morphware infrastructure for Rapid prototyping