Presentation is loading. Please wait.

Presentation is loading. Please wait.

Soc 5.1 Chapter 5 Interconnect Computer System Design System-on-Chip by M. Flynn & W. Luk Pub. Wiley 2011 (copyright 2011)

Similar presentations


Presentation on theme: "Soc 5.1 Chapter 5 Interconnect Computer System Design System-on-Chip by M. Flynn & W. Luk Pub. Wiley 2011 (copyright 2011)"— Presentation transcript:

1 soc 5.1 Chapter 5 Interconnect Computer System Design System-on-Chip by M. Flynn & W. Luk Pub. Wiley 2011 (copyright 2011)

2 soc 5.2 SOC interconnect design approach

3 soc 5.3 Interconnect design find the cost and performance of alternatives iterated to find the least expensive design that meets the requirements consider the larger issues: reliability, scalability, design costs, availability of IP

4 soc 5.4 SOC module with interconnect

5 soc 5.5 Many alternatives find requirements: number of nodes, performance requirements, marginal and development cost bus based: purchased IP or proprietary NOC based: static vs dynamic

6 soc 5.6 AMBA bus based system

7 soc 5.7 Bus terminology protocol master / slave; agents on the bus arbitration / arbitrator :assigns bus ownership bridge: communications between protocols physical configuration: wires, bidirectional synchronization: clock management bus wrapper: manages multiple protocols

8 soc 5.8 MUX connects 3 masters, 4 slaves

9 soc 5.9 Simple AHB transfer

10 soc 5.10 Core connect SOC

11 soc 5.11 PLB transfer protocol

12 soc 5.12 Bus types and ideal performance (a) Simple Bus (b) Bus with arbitration support (c) Tenured split bus: 4 bytes wide (d) Tenured split bus: 16 bytes wide bus transmission time: 1 cycle

13 soc 5.13 OCP and bus wrappers

14 soc 5.14 Sonics microNetwork

15 soc 5.15 Hardware gates for write buffer Performance of buffer; burst mode

16 soc 5.16 Analyzing bus performance find offered occupancy (  ) for each source (master); find the number of sources (n) –note a complex superscalar can have multiple sources as I, D caches can prefetch independently does the source immediately resubmit a request if it is denied? find achieved occupancy (  a ) overall the system’s performance is reduced by (  a /  )

17 soc 5.17 Without resubmissions Prob(processor does not access bus) = 1 –  Prob(n processors do not access bus) = (1 –  n Prob(bus is busy) = 1 – (1 –  n = bus bandwidth = bus B(  n) Bw = Bus B(  n) / T bw achieved bandwidth per processor  a is n  a = B(  n)  a = B(  n) / n

18 soc 5.18 Resubmissions: iterate to find  a let offered occupancy be a; initially set  a find new a =  / (  a  n  a =1-(1-a) n

19 soc 5.19 SOC interconnect switches (NOC) nodes are the units to be connected links are the connections –width, w bits –cycle time, T ch, determines bandwidth –they can be uni or bidirectional message consists of Header –target node address H and payload l –transmission: T ch (H/w + l /w) –h=H/w usually assumed to be 1 links can be –static: links between nodes fixed –dynamic: links vary, as in crossbar

20 soc 5.20 Static: nodes, links and fanout

21 soc 5.21 Static (k,d) networks networks with –k nodes per dimension –d dimensions (k,d) total nodes, N = k d –in hypercube k=2 most (k,d) have end around closure –fanout = 2d (k>2) diameter –(max internode distance with closure) =dk/2

22 soc 5.22 Static network

23 soc 5.23 Examples of static networks

24 soc 5.24 Static network analysis for a static (k,n) network –let k d be average number of network hops for message to transit a single dimension –for bidirectional network with closure k d = k/4, k even time to transmit message without contention T c –T c = n x k d + (l/w) in network cycles –for h = 1

25 soc 5.25 Dynamic network

26 soc 5.26 Switch based interconnect

27 soc 5.27 Dynamic, indirect network

28 soc 5.28 Crossbar 2x2, kxk

29 soc 5.29 Dynamic, Indirect Networks switches are separate from the nodes - centralized as a MIN (Multistage Interconnection Network) a switch - k x k crossbar with no storage an N-node (1 channel/node) network - has (N/k)w switches per stage. min. number of stages to connect N to N - [log k N]

30 soc 5.30 Baseline dynamic network address selects output

31 soc 5.31 Xfabric (direct network w 2D grid)

32 soc 5.32 Xfabric Junction

33 soc 5.33 Format of Nexus burst

34 soc 5.34 NOC layer architecture

35 soc 5.35 Typical layered NOC

36 soc 5.36 NOC layered architecture physical layer –how packets are transmitted over physical wires transport layer –packet routing transaction layer –NIU provides service to the IP each layer transparent to the other

37 soc 5.37 NOC layered advantages layers can be independently optimized scalable better Quality of Service control –more optimization points of control flexible throughput –can reallocate physical layer resources as required multiple clock domain operation

38 soc 5.38 Transaction, transport and physical layers of an NOC

39 soc 5.39 PivotPoint Architecture, 3x3 crossbar

40 soc 5.40 Dynamic vs Static Section 5.9 assumes –h=1 (header sent in 1 cycle) –wormhole routing (message can begin to leave node after h=1 cycles) spreadsheet can be used to compare configurations

41 soc 5.41 Message and header

42 soc 5.42 Bus pros (+) and cons (-) Every unit attached adds parasitic capacitance (-) Bus timing is difficult in deep sub-micron process (-) Bus testability is problematic and slow (-) Bus arbiter delay grows with the number of masters. The arbiter is also instance-specific (-) Bandwidth is limited and shared by all units attached (-) Bus latency is zero once arbiter has granted control (+) The silicon cost of a bus is low for small systems (+) Any bus is directly compatible with most IPs, including software running on CPUs (+) The concepts are simple and well understood (+)

43 soc 5.43 NOC pros (+) and cons (-) Only point-to-point one-way wires are used for all network sizes (+) Network wires can be pipelined because the network protocol is globally asynchronous (+) Dedicated BIST is fast and complete (+) Routing decisions are distributed and the same router is reinstanciated, for all network sizes (+) Aggregated bandwidth scales with the network size (+) Internal network contention causes a small latency (-) Network uses significant silicon area (-) Software needs clean synchronization in multiprocessor systems (-) System designers need re- education for new concepts (-)

44 soc 5.44 Summary SOC interconnect design –find the cost and performance of alternatives common choices include –buses, e.g. AMBA, CoreConnect –Network-on-Chip NOC, static/dynamic networks iterated to find the least expensive design that meets the requirements consider the larger issues: reliability, scalability, design costs, availability of IP


Download ppt "Soc 5.1 Chapter 5 Interconnect Computer System Design System-on-Chip by M. Flynn & W. Luk Pub. Wiley 2011 (copyright 2011)"

Similar presentations


Ads by Google