System Architecture A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Hyong-Youb Kim, Paul Willmann, Dr. Scott Rixner Rice.

Slides:



Advertisements
Similar presentations
Nios Multi Processor Ethernet Embedded Platform Final Presentation
Advertisements

Hao wang and Jyh-Charn (Steve) Liu
StreamBlade SOE TM Initial StreamBlade TM Stream Offload Engine (SOE) Single Board Computer SOE-4-PCI Rev 1.2.
Khaled A. Al-Utaibi  Computers are Every Where  What is Computer Engineering?  Design Levels  Computer Engineering Fields  What.
Copyright© 2000 OPNET Technologies, Inc. R.W. Dobinson, S. Haas, K. Korcyl, M.J. LeVine, J. Lokier, B. Martin, C. Meirosu, F. Saka, K. Vella Testing and.
Configurable System-on-Chip: Xilinx EDK
1 Network Packet Generator Characterization presentation Supervisor: Mony Orbach Presenting: Eugeney Ryzhyk, Igor Brevdo.
Technion Digital Lab Project Xilinx ML310 board based on VirtexII-PRO programmable device Students: Tsimerman Igor Firdman Leonid Firdman Leonid.
Application of NetFPGA in Network Security Hao Chen 2/25/2011.
Reliable Data Storage using Reed Solomon Code Supervised by: Isaschar (Zigi) Walter Performed by: Ilan Rosenfeld, Moshe Karl Spring 2004 Midterm Presentation.
Detector Array Controller Based on First Light First Light PICNIC Array Mux PICNIC Array Mux Image of ESO Messenger Front Page M.Meyer June 05 NGC High.
Embedded Transport Acceleration Intel Xeon Processor as a Packet Processing Engine Abhishek Mitra Professor: Dr. Bhuyan.
XUP Virtex-5 Development System January XUP Virtex52 Introducing XUPV5-LX110T A powerful and versatile platform packaged and priced for Academia!
Ethernet Bomber Ethernet Packet Generator for network analysis Oren Novitzky & Rony Setter Advisor: Mony Orbach Spring 2008 – Winter 2009 Midterm Presentation.
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
An Efficient Programmable 10 Gigabit Ethernet Network Interface Card Paul Willmann, Hyong-youb Kim, Scott Rixner, and Vijay S. Pai.
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
Router Architectures An overview of router architectures.
Router Architectures An overview of router architectures.
Sven Ubik, Petr Žejdl CESNET TNC2008, Brugges, 19 May 2008 Passive monitoring of 10 Gb/s lines with PC hardware.
Virtualized FPGA accelerators in Cloud Computing Systems
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
Paper Review Building a Robust Software-based Router Using Network Processors.
© Copyright Xilinx 2004 All Rights Reserved 9 November, 2004 XUP Virtex-II Pro Development System.
1.  Project Goals.  Project System Overview.  System Architecture.  Data Flow.  System Inputs.  System Outputs.  Rates.  Real Time Performance.
Out-of-Order OpenRISC 2 semesters project Semester A: Implementation of OpenRISC on XUPV5 board Midterm Presentation By: Vova Menis-Lurie Sonia Gershkovich.
Network Server Performance and Scalability June 9, 2005 Scott Rixner Rice Computer Architecture Group
© 2004 Xilinx, Inc. All Rights Reserved EDK Overview.
Design and Characterization of TMD-MPI Ethernet Bridge Kevin Lam Professor Paul Chow.
Silicon Building Blocks for Blade Server Designs accelerate your Innovation.
SLAAC SV2 Briefing SLAAC Retreat, May 2001 Heber, UT Brian Schott USC Information Sciences Institute.
RiceNIC: A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Dr. Scott Rixner Rice Computer Architecture:
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
GBT Interface Card for a Linux Computer Carson Teale 1.
LECC2003 AmsterdamMatthias Müller A RobIn Prototype for a PCI-Bus based Atlas Readout-System B. Gorini, M. Joos, J. Petersen (CERN, Geneva) A. Kugel, R.
Research on Reconfigurable Computing Using Impulse C Carmen Li Shen Mentor: Dr. Russell Duren February 1, 2008.
J. Christiansen, CERN - EP/MIC
Micro-Research Finland Oy Components for Integrating Device Controllers for Fast Orbit Feedback Jukka Pietarinen EPICS Collaboration Meeting Knoxville.
Jump to first page One-gigabit Router Oskar E. Bruening and Cemal Akcaba Advisor: Prof. Agarwal.
StreamBlade TM StreamBlade TM Applications Rev 1.2.
R&D for First Level Farm Hardware Processors Joachim Gläß Computer Engineering, University of Mannheim Contents –Overview of Processing Architecture –Requirements.
Increasing Web Server Throughput with Network Interface Data Caching October 9, 2002 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
By V. Koutsoumpos, C. Kachris, K. Manolopoulos, A. Belias NESTOR Institute – ICS FORTH Presented by: Kostas Manolopoulos.
1 EDK 7.1 Tutorial -- SystemACE and EthernetMAC on Avnet Virtex II pro Development Boards Chia-Tien Dan Lo Department of Computer Science University of.
Part A Presentation Implementation of DSP Algorithm on SoC Student : Einat Tevel Supervisor : Isaschar Walter Accompanying engineer : Emilia Burlak The.
Mr. Daniel Perkins Battelle Memorial Institute Mr. Rob Riley Air Force Research Laboratory Gateware Munitions Interface Processor (GMIP)
An Architecture and Prototype Implementation for TCP/IP Hardware Support Mirko Benz Dresden University of Technology, Germany TERENA 2001.
Hot Interconnects TCP-Splitter: A Reconfigurable Hardware Based TCP/IP Flow Monitor David V. Schuehler
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
TCP Offload Through Connection Handoff Hyong-youb Kim and Scott Rixner Rice University April 20, 2006.
A Monte Carlo Simulation Accelerator using FPGA Devices Final Year project : LHW0304 Ng Kin Fung && Ng Kwok Tung Supervisor : Professor LEONG, Heng Wai.
A record and replay mechanism using programmable network interface cards Laurent Lefèvre INRIA / LIP (UMR CNRS, INRIA, ENS, UCB)
Sep. 17, 2002BESIII Review Meeting BESIII DAQ System BESIII Review Meeting IHEP · Beijing · China Sep , 2002.
Reconfigurable Computing: HPC Network Aspects Mitch Sukalski (8961) David Thompson (8963) Craig Ulmer (8963) Pete Dean R&D Seminar December.
Ethernet Bomber Ethernet Packet Generator for network analysis
Hardened IDS using IXP Didier Contis, Dr. Wenke Lee, Dr. David Schimmel Chris Clark, Jun Li, Chengai Lu, Weidong Shi, Ashley Thomas, Yi Zhang  Current.
NetTM FPGA-based processors Martin Labrecque Connections 2010.
Exploiting Task-level Concurrency in a Programmable Network Interface June 11, 2003 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
H-RORC HLT-Meeting CERN 02/06/05 Torsten Alt KIP Heidelberg.
17/02/06H-RORCKIP HeidelbergTorsten Alt The new H-RORC H-RORC.
KM3NeT Offshore Readout System On Chip A highly integrated system using FPGA COTS S. Anvar, H. Le Provost, F. Louis, B.Vallage – CEA Saclay IRFU – Amsterdam/NIKHEF,
System on a Programmable Chip (System on a Reprogrammable Chip)
Operating System Overview
Ming Liu, Wolfgang Kuehn, Zhonghai Lu, Axel Jantsch
CS 286 Computer Organization and Architecture
Spartan FPGAs مرتضي صاحب الزماني.
RECONFIGURABLE PROCESSING AND AVIONICS SYSTEMS
Emu: Rapid FPGA Prototyping of Network Services in C#
Cluster Computers.
Presentation transcript:

System Architecture A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Hyong-Youb Kim, Paul Willmann, Dr. Scott Rixner Rice Computer Architecture: Hardware Research  What NIC architectures scale to support future high-performance offloading at multi-gigabit data rates?  Can reconfiguring the FPGA hardware provide a substantial performance improvement for different applications? Is this improvement greater than simply upgrading the NIC firmware?  What NIC architectures scale to support future high-performance offloading at multi-gigabit data rates?  Can reconfiguring the FPGA hardware provide a substantial performance improvement for different applications? Is this improvement greater than simply upgrading the NIC firmware? AbstractAbstract  In this project, we are designing and implementing a gigabit NIC on a FPGA. This NIC will support research in network server operating systems and network architecture design  Why are existing NICs insufficient for this research?  Current software programmable NICs (e.g. Alteon Tigon) are too slow for modern complex offloading tasks  Current ASIC-based NICs have limited functionality and leave most tasks to the host processor. This architecture does not scale well to high data rates  Commercial products often use locked or proprietary IP, limiting our design flexibility  Why do we need a FPGA-based NIC?  FPGA provides maximum flexibility for future research  For architecture research, we need a flexible NIC with hardware acceleration. This NIC is both reconfigurable (runs frequent / demanding tasks in hardware) and programmable (runs infrequent / flexible tasks in software)  In this project, we are designing and implementing a gigabit NIC on a FPGA. This NIC will support research in network server operating systems and network architecture design  Why are existing NICs insufficient for this research?  Current software programmable NICs (e.g. Alteon Tigon) are too slow for modern complex offloading tasks  Current ASIC-based NICs have limited functionality and leave most tasks to the host processor. This architecture does not scale well to high data rates  Commercial products often use locked or proprietary IP, limiting our design flexibility  Why do we need a FPGA-based NIC?  FPGA provides maximum flexibility for future research  For architecture research, we need a flexible NIC with hardware acceleration. This NIC is both reconfigurable (runs frequent / demanding tasks in hardware) and programmable (runs infrequent / flexible tasks in software) Hardware Design  Virtex-II Pro FPGA  MHz PowerPC processors  MAC controller - Complete  DDR memory controller – Complete  128MB - 1GB DDR Memory  Serial Port Debug – Complete  DMA engine – Pending  Checksum Offloading - Pending  Spartan-IIE FPGA  64-bit/66-MHz PCI interface – Complete  2MB SRAM - Complete  Event notification - Complete  Virtex-II Pro FPGA  MHz PowerPC processors  MAC controller - Complete  DDR memory controller – Complete  128MB - 1GB DDR Memory  Serial Port Debug – Complete  DMA engine – Pending  Checksum Offloading - Pending  Spartan-IIE FPGA  64-bit/66-MHz PCI interface – Complete  2MB SRAM - Complete  Event notification - Complete Software Design  PowerPC processor runs custom packet-handling firmware  Interfaces with MAC controller to send/receive packets  Interfaces with DDR controller to store bulk frame data  Interfaces with DMA controller to transfer data to/from host  Interfaces with SRAM over Spartan bridge to access PCI control registers  Custom device drivers written for Linux and FreeBSD  PowerPC processor runs custom packet-handling firmware  Interfaces with MAC controller to send/receive packets  Interfaces with DDR controller to store bulk frame data  Interfaces with DMA controller to transfer data to/from host  Interfaces with SRAM over Spartan bridge to access PCI control registers  Custom device drivers written for Linux and FreeBSD Device Utilization  Substantial headroom for future development ComponentUtilization Slice Registers8,257/ 27,39230% 4 input LUTs (Logic)8,357/ 27,39230% BRAMs100 / 13673% Occupied Slices8,573 / 13,69662% Global Clocks13 / 1681% Digital Clock Managers5 / 862% Total Gate Count:6,890,605 OS Research Development Platform Serial Port RJ-45 Port DDR Virtex FPGA Spartan FPGA PCI InterfaceEthernet PHY PCI / FPGA Development Board PCI Bridge (Spartan FPGA) PCI Bus (64b / 66 MHz) SRAM (2MB) 32b / 66 MHz PROM 64b / 66 MHz RS-232 (Debug) Ethernet PHY (10/100/1000) Full Duplex Ethernet 2 GBits/sec 8b / 125 MHz PowerPC 300MHz DDR Controller BRAM Block Spartan Bridge PLB – 64b / 100 MHz BRAM Block On-Chip Mem MAC Control PLB / OPB Bridge UART (OPB Bus) DDR-SDRAM (256 MB) Serial Port Memory / Control Ethernet PCI Debug 32b / 125 MHz 32b / 66 MHz Custom FPGA Design Custom FPGA Design  Connection Handoff to NIC  Offload network stack processing to NIC  Reconfigurable NIC allows exploration of hardware / software mechanisms to accelerate offloading process  Thread Parallelization of Network Stack  Parallel network stacks achieve high network throughput but have synchronization challenges  Control-synchronous parallel stacks use the OS scheduler to ensure connections are always managed by the same thread, eliminating some (but not all) locks  Exploring ways to use NIC to eliminate device queue lock to enable simultaneous communication among multiple threads in parallel stacks  Connection Handoff to NIC  Offload network stack processing to NIC  Reconfigurable NIC allows exploration of hardware / software mechanisms to accelerate offloading process  Thread Parallelization of Network Stack  Parallel network stacks achieve high network throughput but have synchronization challenges  Control-synchronous parallel stacks use the OS scheduler to ensure connections are always managed by the same thread, eliminating some (but not all) locks  Exploring ways to use NIC to eliminate device queue lock to enable simultaneous communication among multiple threads in parallel stacks