R OUTE P ACKETS, N OT W IRES : O N -C HIP I NTERCONNECTION N ETWORKS Veronica Eyo Sharvari Joshi.

Slides:



Advertisements
Similar presentations
A Novel 3D Layer-Multiplexed On-Chip Network
Advertisements

1 Lecture 17: On-Chip Networks Today: background wrap-up and innovations.
NETWORK ON CHIP ROUTER Students : Itzik Ben - shushan Jonathan Silber Instructor : Isaschar Walter Final presentation part A Winter 2006.
1 Lecture 12: Interconnection Networks Topics: dimension/arity, routing, deadlock, flow control.
1 Lecture 23: Interconnection Networks Paper: Express Virtual Channels: Towards the Ideal Interconnection Fabric, ISCA’07, Princeton.
IP I/O Memory Hard Disk Single Core IP I/O Memory Hard Disk IP Bus Multi-Core IP R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R Networks.
MINIMISING DYNAMIC POWER CONSUMPTION IN ON-CHIP NETWORKS Robert Mullins Computer Architecture Group Computer Laboratory University of Cambridge, UK.
Lei Wang, Yuho Jin, Hyungjun Kim and Eun Jung Kim
1 Lecture 13: Interconnection Networks Topics: flow control, router pipelines, case studies.
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control Final exam reminders:  Plan well – attempt every question.
Chapter 10 Introduction to Wide Area Networks Data Communications and Computer Networks: A Business User’s Approach.
Network-on-Chip Examples System-on-Chip Group, CSE-IMM, DTU.
1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Sections 8.1 – 8.5)
1 Evgeny Bolotin – ICECS 2004 Automatic Hardware-Efficient SoC Integration by QoS Network on Chip Electrical Engineering Department, Technion, Haifa, Israel.
Issues in System-Level Direct Networks Jason D. Bakos.
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control.
Orion: A Power-Performance Simulator for Interconnection Networks Presented by: Ilya Tabakh RC Reading Group4/19/2006.
1 Lecture 25: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) Review session,
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
Gigabit Routing on a Software-exposed Tiled-Microprocessor
Blue Gene / C Cellular architecture 64-bit Cyclops64 chip: –500 Mhz –80 processors ( each has 2 thread units and a FP unit) Software –Cyclops64 exposes.
Communication Synthesis: Buses and Network-on-Chip (NOC) Dr. Eng. Amr T. Abdel-Hamid ELECT 1002 Spring 2008 System-On-a-Chip Design.
On-Chip Networks and Testing
SHAPES scalable Software Hardware Architecture Platform for Embedded Systems Hardware Architecture Atmel Roma, INFN Roma, ST Microelectronics Grenoble,
High-Performance Networks for Dataflow Architectures Pravin Bhat Andrew Putnam.
Three-Dimensional Layout of On-Chip Tree-Based Networks Hiroki Matsutani (Keio Univ, Japan) Michihiro Koibuchi (NII, Japan) D. Frank Hsu (Fordham Univ,
Elastic-Buffer Flow-Control for On-Chip Networks
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick.
DEVICES AND COMMUNICATION BUSES FOR DEVICES NETWORK
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
1 Dynamic Interconnection Networks Miodrag Bolic.
Chapter 6 – Connectivity Devices
Data and Computer Communications Circuit Switching and Packet Switching.
George Michelogiannakis William J. Dally Stanford University Router Designs for Elastic- Buffer On-Chip Networks.
A Lightweight Fault-Tolerant Mechanism for Network-on-Chip
Presenter: Min-Yu Lo 2015/10/19 Asit K. Mishra, N. Vijaykrishnan, Chita R. Das Computer Architecture (ISCA), th Annual International Symposium on.
Network-on-Chip Introduction Axel Jantsch / Ingo Sander
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
Computer Security Workshops Networking 101. Reasons To Know Networking In Regard to Computer Security To understand the flow of information on the Internet.
1 Lecture 15: Interconnection Routing Topics: deadlock, flow control.
Non-Minimal Routing Strategy for Application-Specific Networks-on-Chips Hiroki Matsutani Michihiro Koibuchi Yutaka Yamada Jouraku Akiya Hideharu Amano.
Lecture 13: Logic Emulation October 25, 2004 ECE 697F Reconfigurable Computing Lecture 13 Logic Emulation.
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
10/03/2005: 1 Physical Synthesis of Latency Aware Low Power NoC Through Topology Exploration and Wire Style Optimization CK Cheng CSE Department UC San.
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the FPX.
Run-time Adaptive on-chip Communication Scheme 林孟諭 Dept. of Electrical Engineering National Cheng Kung University Tainan, Taiwan, R.O.C.
Performance, Cost, and Energy Evaluation of Fat H-Tree: A Cost-Efficient Tree-Based On-Chip Network Hiroki Matsutani (Keio Univ, JAPAN) Michihiro Koibuchi.
Networks-on-Chip (NoC) Suleyman TOSUN Computer Engineering Deptartment Hacettepe University, Turkey.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Team LDPC, SoC Lab. Graduate Institute of CSIE, NTU Implementing LDPC Decoding on Network-On-Chip T. Theocharides, G. Link, N. Vijaykrishnan, M. J. Irwin.
WAN Transmission Media
SCORES: A Scalable and Parametric Streams-Based Communication Architecture for Modular Reconfigurable Systems Abelardo Jara-Berrocal, Ann Gordon-Ross NSF.
Networks: Routing, Deadlock, Flow Control, Switch Design, Case Studies Alvin R. Lebeck CPS 220.
Virtual-Channel Flow Control William J. Dally
1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix F)
Effective bandwidth with link pipelining Pipeline the flight and transmission of packets over the links Overlap the sending overhead with the transport.
Network On Chip Cache Coherency Final presentation – Part A Students: Zemer Tzach Kalifon Ethan Kalifon Ethan Instructor: Walter Isaschar Instructor: Walter.
1 Lecture 29: Interconnection Networks Papers: Express Virtual Channels: Towards the Ideal Interconnection Fabric, ISCA’07, Princeton Interconnect Design.
1 Lecture 22: Interconnection Networks Topics: Routing, deadlock, flow control, virtual channels.
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
Network-on-Chip Paradigm Erman Doğan. OUTLINE SoC Communication Basics  Bus Architecture  Pros, Cons and Alternatives NoC  Why NoC?  Components 
The network-on-chip protocol
Lecture 23: Interconnection Networks
Switching and High-Speed Networks
Azeddien M. Sllame, Amani Hasan Abdelkader
Israel Cidon, Ran Ginosar and Avinoam Kolodny
Overview of Computer Architecture and Organization
Lecture 25: Interconnection Networks
Presentation transcript:

R OUTE P ACKETS, N OT W IRES : O N -C HIP I NTERCONNECTION N ETWORKS Veronica Eyo Sharvari Joshi

O N - CHIP INTERCONNECT NETWORK / N O C The layered-stack approach to the design of the on-chip intercore communications is called the Network-on-Chip (NOC) methodology New approach to design the communication subsystem of a System-on-Chip. Asynchronous clocking Networking theories and systematic networking methods

T HIS IS WHAT A TYPICAL LAYOUT LOOKS LIKE

P ROPOSAL Replace design-specific global on-chip wiring with a general-purpose on-chip interconnection network. Connect top level modules by dedicated wires connect them to a network that routes packet between them.

A SIMPLE ON - CHIP NETWORK 12mm x 12mm chip in 0.1mm CMOS technology with an 0.5mm minimum wire pitch network uses a 2- dimensional folded torus topology with the nodes 0-3 in each row cyclically connected in the order 0,2,3,1

I NPUT PORT D ATAGRAM INTERFACE TO EACH TILE Start of new packet (head) Continuation of a packet (body) End of packet (tail) Idle cycle (idle) Logarithmically encodes the size of the data in the data field from 0 (1 bit) to 8 (256 bits) Specifies which of eight virtual channels this packet may be routed on A source route that specifies two bits for each hop (left, right, straight, or extract). A signal from the network back to the client indicating that the network is ready to accept the next flit on each virtual channel TYPE 2 Bits SIZE 4 Bits VIRTUAL CHANNEL 8 Bits ROUTE 16 Bits READY 8 Bits

O THER RESOURCES Registers: Used to reserve resources for particular virtual channels Provide time-slot reservations for certain classes of traffic Pre-scheduling provides guaranteed, predictable performance for latency-critical or jitter-critical applications

H IGHER LEVEL PROTOCOL ON S IMPLE INTERFACE Services that can be provided to network clients: Memory read/write service A flow-controlled data stream A logical wire to the client Provide a translation from a destination node to a route The logic to implement many higher-level services on top of the simple network will be made readily available so it won’t have to be independently redesigned with each module

R OUTER A RCHITECTURE Is needed at each tile Consists of 5 input and output controllers

P ROS AND C ONS Network takes up upto 6.6% of the total tile area The area of the router is dominated by buffer space The total buffer requirement is about 10^4 bits along each edge of the tile Enables the use of fault tolerant wiring and protocols A spare bit can be provided on each network link and in each network buffer Network handles both pre-scheduled and dynamic traffic Static traffic must share the network with dynamic traffic

T OPOLOGIES TO UTILIZE THE ABUNDANT WIRING RESOURCES Method 1: A wide (almost 300-bit) flit is sent broadside across router channels to use the maximum possible number of pins Method 2: A folded torus topology can be employed Has twice the wire demand and twice the bisection bandwidth of a mesh network Choice of a topology depends on many factors

F LOW CONTROL METHODS TO REDUCE B UFFER COUNT Buffer space in an on-chip router directly impacts the area overhead of the network Example network uses conventional virtual channel flow control and uses a large amount of buffer space Trade-off between buffer storage requirement and performance

N EED TO EXPLOIT STRUCTURED WIRING Well controlled electrical parameters of wiring enable the use of high-performance circuits Circuits can be used to boost the bandwidth of individual wires Circuits can ease the overhead of buffers

P REDICTABLE ELECTRICAL PARAMETERS ENABLE HIGH PERFORMANCE CIRCUITS Electrical characterization Potential late stage impact Parasitic capacitance Conservative circuits needed to drive these wires Synthesis tools oversize the drivers Long wires require repeaters at periodic intervals Structures the wiring Defined precisely at the beginning Paths are optimized for signal integrity Structured wiring can be isolated from intra-tile wiring Enable the use of aggressive circuits Unstructured Wires On –chip interconnection networks

F ACILITATING REUSE WITH A UNIVERSAL INTERFACE Compatibility and inter-opearbility Extending reuse to network components On-chip network is reusable Can dedicate more resources to design, validate and tune the network Flip side- fixing the die size will lead to wastage of on-chip area On-chip networks improve the duty factor of wires Shares the wires across many signals

C ONCLUSION Advantages of structure, performance and modularity Well controlled electrical parameters in turn enable the use of high-performance circuits Enhances modularity by providing standard interface C HALLENGES Topologies must balance power efficiency with wire utilization New flow control methods are required Network interface support a wide variety of protocols and data widths

Q UESTIONS ?