Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing.

Similar presentations


Presentation on theme: "1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing."— Presentation transcript:

1 1 The Turn Model for Adaptive Routing

2 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing Algorithms in 2D Meshes. 2D Meshes. p-cube routing in Hypercube p-cube routing in Hypercube Analysis. Analysis. Conclusion and Future Work. Conclusion and Future Work.

3 3 What is Direct Network? Each node has a point-to-point, or direct, connection to some number of other nodes. Offers massive parallelism and scalability. Popular architecture for constructing massively parallel computers. Communicates by passing messages

4 4 Generic Architecture

5 5 Evaluation of Direct Network Communication latency: Start-up latency + Network latency + Blocking time Start-up latency + Network latency + Blocking time Start-up latency: Start-up latency: It is the time to handle the packet at source & destination nodes. Network latency: Network latency: It is the time between a packet leaving the source and arriving at the destination. the destination. Blocking time Blocking time All possible delays encountered during the life-time of packet. Communication latency depends on the type of switching technique → Wormhole routing

6 6 Switching Technique Circuit Switching Virtual Cut Through Packet Switching Wormhole Switching Switching Technique Virtual Cut Through

7 04/21/10 Packet Routing There are two basic approaches to routing packets, based on what a switch does with a packet as its flits begin to arrive. 1)Store-and-forward 2)Cut-through Virtual cut-through Virtual cut-through Wormhole Wormhole

8 8 Wormhole Switching A packet is divided into a number of flits The header flit of a packet governs the route The remaining flits following in a pipeline fashion fashion Thus a message resembles worm borrowing through the network through the network

9 9 Transmission Packet is transmitted from a node as flit( smallest unit on which flow control can be performed). Two kinds of Flits Header flit and Data flit. Header Flit tries to get the another channel while the data flits are transmitted through the already obtained channels. A channel is released only when the last flit of the message is passed though it. Flits of two messages cannot be interleaved.

10 10 Wormhole Routing Based on cut-through concept Relatively independent of path length Relatively independent of path lengthOvercome: Store-and-forward and Virtual cut-through ’s buffer size problem Store-and-forward and Virtual cut-through ’s buffer size problem Circuit switching ’s channel reservation and release problem Circuit switching ’s channel reservation and release problem Two positive effect: The absence of network contention makes the network latency relatively insensitive to path length The absence of network contention makes the network latency relatively insensitive to path length Large packet buffers at each intermediate node are obviated Large packet buffers at each intermediate node are obviated

11 11 Flit Wormhole Routing Advantages: reduced buffering Flit (routing info) Flit Low latency Wormhole Packet:

12 12 Characteristics of Routing Algorithms A good routing algorithm: Reduce the network latency Reduce the network latency Easily implemented in hardware Easily implemented in hardware High network throughput High network throughput

13 13 Deadlock Packets are allowed to hold a resource (channel) while waiting for another resource Deadlock free routing algorithm

14 14 Deadlocks in Wormhole Switching Header Flit contains all the routing information that is required to move data flits across the network. If the header flit cannot move any further then there will a congestion which causes a chained block in the network which leads to Deadlocks.

15 15 The Turn Model Basis: Analyze directions in which packets can turn in the network Analyze directions in which packets can turn in the network Determine the cycles that such turns can form Determine the cycles that such turns can form Prohibit just enough turns to break all cycle Prohibit just enough turns to break all cycle Resulting routing algorithms are: Deadlock and livelock free Deadlock and livelock free Minimal/Non-minimal Minimal/Non-minimal Highly Adaptive. Highly Adaptive.

16 16 The Turn model (contd..) Deadlock free Livelock free livelock occurs when the routing of a packet never lead livelock occurs when the routing of a packet never lead it to its destination. it to its destination.Adaptive from A to B determines the path based on the Network load. from A to B determines the path based on the Network load.Minimal restricts packets to shortest paths. restricts packets to shortest paths.Non-minimal Although minimal routing may initially sound more promising, non-minimal routing provides more choices. Although minimal routing may initially sound more promising, non-minimal routing provides more choices.

17 17 The Turn Model (contd…) Classify channels according to the direction in which they route packets. Identify the turns that occur between one direction and another, omitting 0-degree and 180-degree turns. Identify the simple cycles these turns can form. Prohibit one turn in each cycle. In the case of k-ary n-cubes, incorporate as many turns as possible that involve wraparound channels. Add 180-degree and 0-degree turns if there are multiple channels in the same direction.

18 18 The Turn Model (contd.) Simple Illustration: - The possible turns and simple cycles in a two-dimensional mesh. -The four turns allowed by the xy routing algorithm. -Six turns that complete the cycles and allows deadlock.

19 19 Turn Model (contd.) West-First Routing Algorithm Prohibited turns are the two to the west. two to the west. Route a packet first west if necessary, and then adaptively south, east, and north. Both minimal and non-minimal paths are shown. [Dally and Seitz] proof show that a routing algorithm is deadlock free if the channels in the interconnection network can be numbered so that the algorithm routes every packet along channels with strictly decreasing numbers.

20 20 S D Deadlock Free Routing West First Algorithm

21 21 S D West First Algorithm Deadlock Free Routing

22 22 D S Deadlock Free Routing West First Algorithm

23 23 Turn Model (contd.) North-Last Routing Algorithm

24 24 D S Deadlock Free Routing North Last Algorithm

25 25 Turn Model (contd.) Negative-First Routing Algorithm If the packet in +ve direction it will never turn –ve. it will never turn –ve. +y -x -y +x

26 26 Turn Model (contd.) n – Dimensional meshes Prohibit n(n-1) 90 degree turns to prevent deadlock. Prohibit n(n-1) 90 degree turns to prevent deadlock. One half of all possible 180 degree turns must be prohibited One half of all possible 180 degree turns must be prohibited Resulting algorithms: Resulting algorithms:All-but-one-negative-firstAll-but-one-positive-lastNegative-first k-ary n-cubes Allows to use the wraparound channels Assigns the wraparound channels a number. Negative first algorithm Classify wraparound channel according to the direction in which it routes packets Apply the algorithm

27 27 p-cube Routing in Hypercubes Hypercubea are special case of both n-dimensional meshes and k-ary n- cube. S =the binary address of the source node for a packet C=binary address of the node the header flits currently occupy D=binary address of the destination node. Two Phase: Phase 1: route packets along a dimension i for which ci=1 and di=0 in minimal Phase 2:when no such dimension then, route packet along i such that ci=0 and di=1 in minimal.

28 28 The minimal p-cube …. Example : 10-cube S=1011010100 D=0010111001 h0=3 and h1=3 so h=6 36 possible shortest paths. Phase 1: C=1011010100 = C9 C8 C7 C6 C5 C4 C3 C2 C1 C0 C=1011010100 ;( ci=1 and di=0) ‘D=1101000110 ^ ------------------------------ R = 1001000100 ( 3 choices) dimension taken d2 C= 1011010000 ‘D= 1101000110 ------------------------------ R= 1001000000 (2 Choices) dim. taken d9 C= 0011010000 ‘D=1101000110 ------------------------------ R= 0001000000 (1 choice) dim d6 C= 0010010000 ‘D=1101000110 ----------------------- R= 000000000

29 29 The minimal p-cube …. Example : 10-cube S=1011010100 D=0010111001 h0=3 and h1=3 so h=6 36 possible shortest paths. Phase 2: C= 0010010000 D=0010111001 ‘C=1101101111 D=0010111001 ------------------------ R=0000101001 Choices = 3 Dim=d5 C=0010110000 ( 0 to 1) ‘C=1101001111 D=0010111001 ------------------------ R= 0000001001 choices= 2 Dim=d0 C= 0010110001 ‘C= 1101001110 D= 0010111001 --------------------------- R= 0000001000 Choice=1 Dim =d3 C=0010111001 = D

30 30 The nonminimal p-cube algo. It is desired, because of its increased adaptive ness The first phase can route the packet along any dimension i for which ci=1 and di=1. p : last hop was in the +ve direction. +ve direction. Expect more choices

31 31 The nonminimal p-cube algo (0,0,0) (1,0,0) (0,0,1) (0,1,1) (1,1,1) (1,0,1) (1,1,0) The meaning of the algorithm's steps are with Dr. Pfeiffer’s help (example): 2- if last hop from +ve. Take the +ve only. If S=011, D=111 One choice 011→ 111 3- else if last hop was –ve and if destination not any more –ve go –ve or +ve. S=011, D=111 Choice 1 : 011→ 111 Choice 2: 011 → 001 Choice 3: 011 → 010 4- else going -ve (0,1,0)

32 32 Simulation Experiments Comparison of the partially adaptive with a non adaptive routing algorithms Simulation of a 16x16 mesh and a binary 8-cube for three different traffic patterns Each of these contains 256 nodes Bandwidth is equal to 20 flits/microsec For uniform traffic in the mesh and the hypercube the nonadaptive routing algorithms have lower latencies at high throughputs than the partially adaptive algorithms

33 33 Simulation experiments (contd.) At low throughputs,the algorithms performs about the same. For the non-uniform traffic patterns the partially adaptive routing algorithms have lower latencies at high throughputs For the negative-first algo. The degree of adaptive ness higher than the other. According to the hamming distance or if we measure S( algo.)/S( fully adaptive).

34 34 Conclusion The turn model produces deadlock free, livelock free, minimal or non-minimal and maximally adaptive algorithms. These algorithms perform better than nonadaptive algorithms for nonuniform patterns of message traffic.

35 35 A Peek into the Future To investigate the effects of different input and output selection policies on network performance. To illustrate the application of the turn model to networks that include extra physical or virtual channel. To apply the turn model to other topologies, such as, hexagonal, octagonal and cube connected cycle networks. Identification of realistic workload distributions.

36 36 References The Turn Model for adaptive routing Wormhole Routing in Parallel Computers A processor architecture for multiprocessing multiprocessor


Download ppt "1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing."

Similar presentations


Ads by Google