Soc 5.1 Chapter 5 Interconnect Computer System Design System-on-Chip by M. Flynn & W. Luk Pub. Wiley 2011 (copyright 2011)

Slides:



Advertisements
Similar presentations
1 Dynamic Interconnection Networks Buses CEG 4131 Computer Architecture III Miodrag Bolic.
Advertisements

3D Graphics Content Over OCP Martti Venell Sr. Verification Engineer Bitboys.
Presenter : Cheng-Ta Wu Kenichiro Anjo, Member, IEEE, Atsushi Okamura, and Masato Motomura IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 39,NO. 5, MAY 2004.
CSCI 465 D ata Communications and Networks Lecture 20 Martin van Bommel CSCI 465 Data Communications & Networks 1.
PradeepKumar S K Asst. Professor Dept. of ECE, KIT, TIPTUR. PradeepKumar S K, Asst.
Classification of Distributed Systems Properties of Distributed Systems n motivation: advantages of distributed systems n classification l architecture.
1 Evgeny Bolotin – ClubNet Nov 2003 Network on Chip (NoC) Evgeny Bolotin Supervisors: Israel Cidon, Ran Ginosar and Avinoam Kolodny ClubNet - November.
1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Sections 8.1 – 8.5)
Review on Networking Technologies Linda Wu (CMPT )
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
Module I Overview of Computer Architecture and Organization.
Computer Architecture Lecture 08 Fasih ur Rehman.
Chapter 13: WAN Technologies and Routing 1. LAN vs. WAN 2. Packet switch 3. Forming a WAN 4. Addressing in WAN 5. Routing in WAN 6. Modeling WAN using.
High Performance Embedded Computing © 2007 Elsevier Lecture 16: Interconnection Networks Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte.
Networks-on-Chip. Seminar contents  The Premises  Homogenous and Heterogeneous Systems- on-Chip and their interconnection networks  The Network-on-Chip.
LAN Overview (part 2) CSE 3213 Fall April 2017.
Chapter 2 The Infrastructure. Copyright © 2003, Addison Wesley Understand the structure & elements As a business student, it is important that you understand.
Communication issues for NOC By Farhadur Arifin. Objective: Future system of NOC will have strong requirment on reusability and communication performance.
On-Chip Networks and Testing
Introduction to Interconnection Networks. Introduction to Interconnection network Digital systems(DS) are pervasive in modern society. Digital computers.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
1 Interconnects Shared address space and message passing computers can be constructed by connecting processors and memory unit using a variety of interconnection.
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick.
Dynamic Interconnect Lecture 5. COEN Multistage Network--Omega Network Motivation: simulate crossbar network but with fewer links Components: –N.
1 Next Few Classes Networking basics Protection & Security.
DEVICES AND COMMUNICATION BUSES FOR DEVICES NETWORK
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
Course Wrap-Up Miodrag Bolic CEG4136. What was covered Interconnection network topologies and performance Shared-memory architectures Message passing.
1 Dynamic Interconnection Networks Miodrag Bolic.
© 2007 Xilinx, Inc. All Rights Reserved This material exempt per Department of Commerce license exception TSU Hardware Design INF3430 MicroBlaze 7.1.
TELE202 Lecture 5 Packet switching in WAN 1 Lecturer Dr Z. Huang Overview ¥Last Lectures »C programming »Source: ¥This Lecture »Packet switching in Wide.
1 Optical Burst Switching (OBS). 2 Optical Internet IP runs over an all-optical WDM layer –OXCs interconnected by fiber links –IP routers attached to.
Data and Computer Communications Circuit Switching and Packet Switching.
Anshul Kumar, CSE IITD CSL718 : Multiprocessors Interconnection Mechanisms Performance Models 20 th April, 2006.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 January Session 4.
Architectural and Physical Design Optimization for Efficient Intra-Tile Communication Liza Rodriguez Aurelio Morales EEL Embedded Systems Dept.
Network on Chip - Architectures and Design Methodology Natt Thepayasuwan Rohit Pai.
Computer Architecture System Interface Units Iolanthe II approaches Coromandel Harbour.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
CSS 372 Oct 4th - Lecture 3 Chapter 3 – Connecting Computer Components with Buses Bus Structures Synchronous, Asynchronous Typical Bus Signals Two level,
COARSE GRAINED RECONFIGURABLE ARCHITECTURE FOR VARIABLE BLOCK SIZE MOTION ESTIMATION 03/26/
August 1, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 9: I/O Devices and Communication Buses * Jeremy R. Johnson Wednesday,
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
EEC4113 Data Communication & Multimedia System Chapter 7: Network Layer by Muhazam Mustapha, October 2011.
Networks-on-Chip (NoC) Suleyman TOSUN Computer Engineering Deptartment Hacettepe University, Turkey.
Chapter 4 Memory Design: SOC and Board-Based Systems
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Spring 2007W. Rhett DavisNC State UniversityECE 747Slide 1 ECE 747 Digital Signal Processing Architecture SoC Lecture – Working with Buses & Interconnects.
Mohamed Younis CMCS 411, Computer Architecture 1 CMCS Computer Architecture Lecture 26 Bus Interconnect May 7,
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Tree-Based Networks Cache Coherence Dr. Xiao Qin Auburn University
EE 382 Processor DesignWinter 98/99Michael Flynn 1 EE382 Processor Design Winter 1998 Chapter 8 Lectures Multiprocessors Part II.
WAN Technologies. 2 Large Spans and Wide Area Networks MAN networks: Have not been commercially successful.
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
Network Processing Systems Design
Network-on-Chip Paradigm Erman Doğan. OUTLINE SoC Communication Basics  Bus Architecture  Pros, Cons and Alternatives NoC  Why NoC?  Components 
Auburn University COMP8330/7330/7336 Advanced Parallel and Distributed Computing Interconnection Networks (Part 2) Dr.
Dynamic connection system
Lecture 23: Interconnection Networks
Azeddien M. Sllame, Amani Hasan Abdelkader
BIC 10503: COMPUTER ARCHITECTURE
William Stallings Data and Computer Communications
Israel Cidon, Ran Ginosar and Avinoam Kolodny
Networks-on-Chip.
Overview of Computer Architecture and Organization
Overview of Computer Architecture and Organization
William Stallings Computer Organization and Architecture
Multiprocessors and Multi-computers
Presentation transcript:

soc 5.1 Chapter 5 Interconnect Computer System Design System-on-Chip by M. Flynn & W. Luk Pub. Wiley 2011 (copyright 2011)

soc 5.2 SOC interconnect design approach

soc 5.3 Interconnect design find the cost and performance of alternatives iterated to find the least expensive design that meets the requirements consider the larger issues: reliability, scalability, design costs, availability of IP

soc 5.4 SOC module with interconnect

soc 5.5 Many alternatives find requirements: number of nodes, performance requirements, marginal and development cost bus based: purchased IP or proprietary NOC based: static vs dynamic

soc 5.6 AMBA bus based system

soc 5.7 Bus terminology protocol master / slave; agents on the bus arbitration / arbitrator :assigns bus ownership bridge: communications between protocols physical configuration: wires, bidirectional synchronization: clock management bus wrapper: manages multiple protocols

soc 5.8 MUX connects 3 masters, 4 slaves

soc 5.9 Simple AHB transfer

soc 5.10 Core connect SOC

soc 5.11 PLB transfer protocol

soc 5.12 Bus types and ideal performance (a) Simple Bus (b) Bus with arbitration support (c) Tenured split bus: 4 bytes wide (d) Tenured split bus: 16 bytes wide bus transmission time: 1 cycle

soc 5.13 OCP and bus wrappers

soc 5.14 Sonics microNetwork

soc 5.15 Hardware gates for write buffer Performance of buffer; burst mode

soc 5.16 Analyzing bus performance find offered occupancy (  ) for each source (master); find the number of sources (n) –note a complex superscalar can have multiple sources as I, D caches can prefetch independently does the source immediately resubmit a request if it is denied? find achieved occupancy (  a ) overall the system’s performance is reduced by (  a /  )

soc 5.17 Without resubmissions Prob(processor does not access bus) = 1 –  Prob(n processors do not access bus) = (1 –  n Prob(bus is busy) = 1 – (1 –  n = bus bandwidth = bus B(  n) Bw = Bus B(  n) / T bw achieved bandwidth per processor  a is n  a = B(  n)  a = B(  n) / n

soc 5.18 Resubmissions: iterate to find  a let offered occupancy be a; initially set  a find new a =  / (  a  n  a =1-(1-a) n

soc 5.19 SOC interconnect switches (NOC) nodes are the units to be connected links are the connections –width, w bits –cycle time, T ch, determines bandwidth –they can be uni or bidirectional message consists of Header –target node address H and payload l –transmission: T ch (H/w + l /w) –h=H/w usually assumed to be 1 links can be –static: links between nodes fixed –dynamic: links vary, as in crossbar

soc 5.20 Static: nodes, links and fanout

soc 5.21 Static (k,d) networks networks with –k nodes per dimension –d dimensions (k,d) total nodes, N = k d –in hypercube k=2 most (k,d) have end around closure –fanout = 2d (k>2) diameter –(max internode distance with closure) =dk/2

soc 5.22 Static network

soc 5.23 Examples of static networks

soc 5.24 Static network analysis for a static (k,n) network –let k d be average number of network hops for message to transit a single dimension –for bidirectional network with closure k d = k/4, k even time to transmit message without contention T c –T c = n x k d + (l/w) in network cycles –for h = 1

soc 5.25 Dynamic network

soc 5.26 Switch based interconnect

soc 5.27 Dynamic, indirect network

soc 5.28 Crossbar 2x2, kxk

soc 5.29 Dynamic, Indirect Networks switches are separate from the nodes - centralized as a MIN (Multistage Interconnection Network) a switch - k x k crossbar with no storage an N-node (1 channel/node) network - has (N/k)w switches per stage. min. number of stages to connect N to N - [log k N]

soc 5.30 Baseline dynamic network address selects output

soc 5.31 Xfabric (direct network w 2D grid)

soc 5.32 Xfabric Junction

soc 5.33 Format of Nexus burst

soc 5.34 NOC layer architecture

soc 5.35 Typical layered NOC

soc 5.36 NOC layered architecture physical layer –how packets are transmitted over physical wires transport layer –packet routing transaction layer –NIU provides service to the IP each layer transparent to the other

soc 5.37 NOC layered advantages layers can be independently optimized scalable better Quality of Service control –more optimization points of control flexible throughput –can reallocate physical layer resources as required multiple clock domain operation

soc 5.38 Transaction, transport and physical layers of an NOC

soc 5.39 PivotPoint Architecture, 3x3 crossbar

soc 5.40 Dynamic vs Static Section 5.9 assumes –h=1 (header sent in 1 cycle) –wormhole routing (message can begin to leave node after h=1 cycles) spreadsheet can be used to compare configurations

soc 5.41 Message and header

soc 5.42 Bus pros (+) and cons (-) Every unit attached adds parasitic capacitance (-) Bus timing is difficult in deep sub-micron process (-) Bus testability is problematic and slow (-) Bus arbiter delay grows with the number of masters. The arbiter is also instance-specific (-) Bandwidth is limited and shared by all units attached (-) Bus latency is zero once arbiter has granted control (+) The silicon cost of a bus is low for small systems (+) Any bus is directly compatible with most IPs, including software running on CPUs (+) The concepts are simple and well understood (+)

soc 5.43 NOC pros (+) and cons (-) Only point-to-point one-way wires are used for all network sizes (+) Network wires can be pipelined because the network protocol is globally asynchronous (+) Dedicated BIST is fast and complete (+) Routing decisions are distributed and the same router is reinstanciated, for all network sizes (+) Aggregated bandwidth scales with the network size (+) Internal network contention causes a small latency (-) Network uses significant silicon area (-) Software needs clean synchronization in multiprocessor systems (-) System designers need re- education for new concepts (-)

soc 5.44 Summary SOC interconnect design –find the cost and performance of alternatives common choices include –buses, e.g. AMBA, CoreConnect –Network-on-Chip NOC, static/dynamic networks iterated to find the least expensive design that meets the requirements consider the larger issues: reliability, scalability, design costs, availability of IP