1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Sections 8.1 – 8.5)

Slides:



Advertisements
Similar presentations
Ch. 12 Routing in Switched Networks
Advertisements

1 Lecture 22: I/O, Disk Systems Todays topics: I/O overview Disk basics RAID Reminder: Assignment 8 due Tue 11/21.
Ch. 12 Routing in Switched Networks Routing in Packet Switched Networks Routing Algorithm Requirements –Correctness –Simplicity –Robustness--the.
Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts Amherst Operating Systems CMPSCI 377 Lecture.
What's inside a router? We have yet to consider the switching function of a router - the actual transfer of datagrams from a router's incoming links to.
1 Lecture 12: Interconnection Networks Topics: dimension/arity, routing, deadlock, flow control.
IP: The Internet Protocol
1 Lecture 23: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Appendix E)
Shalini Bhavanam. Key words: Basic Definitions Classification of Networks Types of networks Network Topologies Network Models.
1 Chapter 9 Computer Networks. 2 Chapter Topics OSI network layers Network Topology Media access control Addressing and routing Network hardware Network.
1 CSE 591-S04 (lect 14) Interconnection Networks (notes by Ken Ryu of Arizona State) l Measure –How quickly it can deliver how much of what’s needed to.
Internetworking Fundamentals (Lecture #4) Andres Rengifo Copyright 2008.
NUMA Mult. CSE 471 Aut 011 Interconnection Networks for Multiprocessors Buses have limitations for scalability: –Physical (number of devices that can be.
Communication operations Efficient Parallel Algorithms COMP308.
1 Lecture 25: Interconnection Networks, Disks Topics: flow control, router microarchitecture, RAID.
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control.
1 Lecture 25: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) Review session,
ECE669 L16: Interconnection Topology March 30, 2004 ECE 669 Parallel Computer Architecture Lecture 16 Interconnection Topology.
Connecting LANs, Backbone Networks, and Virtual LANs
Switching, routing, and flow control in interconnection networks.
WAN Technologies & Topologies Lecture 8 October 4, 2000.
Interconnect Network Topologies
1 Lecture 23: Interconnection Networks Topics: Router microarchitecture, topologies Final exam next Tuesday: same rules as the first midterm Next semester:
LECTURE 9 CT1303 LAN. LAN DEVICES Network: Nodes: Service units: PC Interface processing Modules: it doesn’t generate data, but just it process it and.
Interconnect Networks
ICOM 6115©Manuel Rodriguez-Martinez ICOM 6115 – Computer Networks and the WWW Manuel Rodriguez-Martinez, Ph.D. Lecture 17.
Network Topologies Topology – how nodes are connected – where there is a wire between 2 nodes. Routing – the path a message takes to get from one node.
Introduction to Interconnection Networks. Introduction to Interconnection network Digital systems(DS) are pervasive in modern society. Digital computers.
High-Performance Networks for Dataflow Architectures Pravin Bhat Andrew Putnam.
1 Interconnects Shared address space and message passing computers can be constructed by connecting processors and memory unit using a variety of interconnection.
Dynamic Interconnect Lecture 5. COEN Multistage Network--Omega Network Motivation: simulate crossbar network but with fewer links Components: –N.
1 Next Few Classes Networking basics Protection & Security.
CPS 356: Introduction to Computer Networks Lecture 7: Switching technologies Ch 2.8.2, 3.1, 3.4 Xiaowei Yang
DEVICES AND COMMUNICATION BUSES FOR DEVICES NETWORK
Computer Networks Performance Metrics. Performance Metrics Outline Generic Performance Metrics Network performance Measures Components of Hop and End-to-End.
Anshul Kumar, CSE IITD CSL718 : Multiprocessors Interconnection Mechanisms Performance Models 20 th April, 2006.
COP 4930 Computer Network Projects Summer C 2004 Prof. Roy B. Levow Lecture 3.
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
Lecture (Mar 23, 2000) H/W Assignment 3 posted on Web –Due Tuesday March 28, 2000 Review of Data packets LANS WANS.
Networks and Distributed Systems Mark Stanovich Operating Systems COP 4610.
Network Technologies Definitions –Network: physical connection that allows two computers to communicate –Packet: a unit of transfer »A sequence of bits.
TELE202 Lecture 6 Routing in WAN 1 Lecturer Dr Z. Huang Overview ¥Last Lecture »Packet switching in Wide Area Networks »Source: chapter 10 ¥This Lecture.
McGraw-Hill©The McGraw-Hill Companies, Inc., 2004 Connecting Devices CORPORATE INSTITUTE OF SCIENCE & TECHNOLOGY, BHOPAL Department of Electronics and.
Teknik Routing Pertemuan 10 Matakuliah: H0524/Jaringan Komputer Tahun: 2009.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Super computers Parallel Processing
Networks and Distributed Systems Sarah Diesburg Operating Systems COP 4610.
1 Lecture 23: Storage Systems Topics: disk access, bus design, evaluation metrics, RAID (Sections )
1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix F)
Spring EE 437 Lillevik 437s06-l22 University of Portland School of Engineering Advanced Computer Architecture Lecture 22 Distributed computer Interconnection.
Computer Communication and Networking Lecture # 4 by Zainab Malik 1.
Effective bandwidth with link pipelining Pipeline the flight and transmission of packets over the links Overlap the sending overhead with the transport.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Day 13 Intro to MANs and WANs. MANs Cover a larger distance than LANs –Typically multiple buildings, office park Usually in the shape of a ring –Typically.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Tree-Based Networks Cache Coherence Dr. Xiao Qin Auburn University
1 Lecture 22: Interconnection Networks Topics: Routing, deadlock, flow control, virtual channels.
Youngstown State University Cisco Regional Academy
Lecture 23: Interconnection Networks
Interconnection Network Design Lecture 14
Lecture: Interconnection Networks
Lecture: Networks Topics: TM wrap-up, networks.
Lecture: Interconnection Networks
Delivery, Forwarding, and Routing of IP Packets
CS 6290 Many-core & Interconnect
A simple network connecting two machines
Switching, routing, and flow control in interconnection networks
Topology What are the consequences of using fewer than n2 connections?
Presentation transcript:

1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Sections 8.1 – 8.5)

2 RAID Summary RAID 1-5 can tolerate a single fault – mirroring (RAID 1) has a 100% overhead, while parity (RAID 3, 4, 5) has modest overhead Can tolerate multiple faults by having multiple check functions – each additional check can cost an additional disk (RAID 6) RAID 6 and RAID 2 (memory-style ECC) are not commercially employed

3 I/O Performance Throughput (bandwidth) and response times (latency) are the key performance metrics for I/O The description of the hardware characterizes maximum throughput and average response time (usually with no queueing delays) The description of the workload characterizes the “real” throughput – corresponding to this throughput is an average response time

4 Throughput Vs. Response Time As load increases, throughput increases (as utilization is high) – simultaneously, response times also go up as the probability of having to wait for the service goes up: trade-off between throughput and response time In systems involving human interaction, there are three relevant delays: data entry time, system response time, and think time – studies have shown that improvements in response time result in improvements in think time  better response time and much better throughput Most benchmark suites try to determine throughput while placing a restriction on response times

5 Estimating Response Time Queueing theory provides results that can characterize some random processes Little’s Law: Mean number of tasks in system = Arrival rate x mean response time The following two results are true for workloads with interarrival times that follow a Poisson distribution P(k tasks arrive in time interval t) = e -a a k / k! Time queue = Time server x server utilization/(1-server utilization) Length queue = server utilization 2 / (1 – server utilization)

6 Interconnect Types Number of autonomous systems connected Bus SAN LAN WAN

7 Communication Applications typically send messages through the OS Steps to send a message:  copy data into an OS buffer  OS attaches header and trailer  OS sends transfer command to network interface hw Steps to receive a message:  copy data from network interface hw into OS buffer  make sure data is correct, send acknowledgment  copy data into user space and signal the application If sender does not see an ack in time, it resends the data

8 Latency Sender overhead: the processing delays to send the application data to the network interface hardware Time of flight: latency for the first bit to travel from sender to receiver: function of distance and speed of light Transmission time: time between arrival of first and last bit: function of message size and network bandwidth Transport latency: sum of time of flight and transmission time Receiver overhead: processing delays to send data from hw to application: usually longer than sender overhead

9 Topologies Internet topologies are not very regular – they grew incrementally Supercomputers have regular interconnect topologies and trade off cost for high bandwidth Nodes can be connected with  centralized switch: all nodes have input and output wires going to a centralized chip that internally handles all routing  decentralized switch: each node is connected to a switch that routes data to one of a few neighbors

10 Centralized Crossbar Switch P1 P2 P3 P4 P5 P6 P7 P0 Crossbar switch

11 Centralized Crossbar Switch P1 P2 P3 P4 P5 P6 P7 P0

12 Crossbar Properties Assuming each node has one input and one output, a crossbar can provide maximum bandwidth: N messages can be sent as long as there are N unique sources and N unique destinations Maximum overhead: WN 2 internal switches, where W is data width and N is number of nodes To reduce overhead, use smaller switches as building blocks – trade off overhead for lower effective bandwidth

13 Switch with Omega Network P1 P2 P3 P4 P5 P6 P7 P

14 Omega Network Properties The switch complexity is now O(N log N) Contention increases: P0  P5 and P1  P7 cannot happen concurrently (this was possible in a crossbar) To deal with contention, can increase the number of levels (redundant paths) – by mirroring the network, we can route from P0 to P5 via N intermediate nodes, while increasing complexity by a factor of 2

15 Tree Network Complexity is O(N) Can yield low latencies when communicating with neighbors Can build a fat tree by having multiple incoming and outgoing links P0P3P2P1P4P7P6P5

16 Bisection Bandwidth Split N nodes into two groups of N/2 nodes such that the bandwidth between these two groups is minimum: that is the bisection bandwidth Why is it relevant: if traffic is completely random, the probability of a message going across the two halves is ½ – if all nodes send a message, the bisection bandwidth will have to be N/2 The concept of bisection bandwidth confirms that the tree network is not suited for random traffic patterns, but for localized traffic patterns

17 Distributed Switches: Ring Each node is connected to a 3x3 switch that routes messages between the node and its two neighbors Effectively a repeated bus: multiple messages in transit Disadvantage: bisection bandwidth of 2 and N/2 hops on average

18 Distributed Switch Options Performance can be increased by throwing more hardware at the problem: fully-connected switches: every switch is connected to every other switch: N 2 wiring complexity, N 2 /4 bisection bandwidth Most commercial designs adopt a point between the two extremes (ring and fully-connected):  Grid: each node connects with its N, E, W, S neighbors  Torus: connections wrap around  Hypercube: links between nodes whose binary names differ in a single bit

19 Topology Examples Grid Hypercube Torus CriteriaBusRing2Dtorus6-cubeFully connected Performance Bisection bandwidth Cost Ports/switch Total links

20 Title Bullet