CS184a: Computer Architecture (Structure and Organization)

Slides:



Advertisements
Similar presentations
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day6: October 11, 2000 Instruction Taxonomy VLSI Scaling.
Advertisements

Balancing Interconnect and Computation in a Reconfigurable Array Dr. André DeHon BRASS Project University of California at Berkeley Why you don’t really.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day17: November 20, 2000 Time Multiplexing.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 15: March 12, 2007 Interconnect 3: Richness.
CS294-6 Reconfigurable Computing Day 8 September 17, 1998 Interconnect Requirements.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day8: October 18, 2000 Computing Elements 1: LUTs.
DeHon March 2001 Rent’s Rule Based Switching Requirements Prof. André DeHon California Institute of Technology.
CS294-6 Reconfigurable Computing Day 10 September 24, 1998 Interconnect Richness.
Lecture 3: Field Programmable Gate Arrays II September 10, 2013 ECE 636 Reconfigurable Computing Lecture 3 Field Programmable Gate Arrays II.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 13: February 26, 2007 Interconnect 1: Requirements.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 11: February 14, 2007 Compute 1: LUTs.
CS294-6 Reconfigurable Computing Day 2 August 27, 1998 FPGA Introduction.
CS294-6 Reconfigurable Computing Day 12 October 1, 1998 Interconnect Population.
CS294-6 Reconfigurable Computing Day 14 October 7/8, 1998 Computing with Lookup Tables.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 18: March 21, 2007 Interconnect 6: MoT.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 16: March 14, 2007 Interconnect 4: Switching.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 14: February 28, 2007 Interconnect 2: Wiring Requirements and Implications.
HARP: Hard-Wired Routing Pattern FPGAs Cristinel Ababei , Satish Sivaswamy ,Gang Wang , Kia Bazargan , Ryan Kastner , Eli Bozorgzadeh   ECE Dept.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 4, 2005 Interconnect 1: Requirements.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 15: February 12, 2003 Interconnect 5: Meshes.
ESE Spring DeHon 1 ESE534: Computer Organization Day 19: April 7, 2014 Interconnect 5: Meshes.
CALTECH CS137 Winter DeHon 1 CS137: Electronic Design Automation Day 2: January 6, 2006 Spatial Routing.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day11: October 30, 2000 Interconnect Requirements.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 18: February 18, 2005 Interconnect 6: MoT.
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day9:
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #4 – FPGA.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 14: February 10, 2003 Interconnect 4: Switching.
Topics Architecture of FPGA: Logic elements. Interconnect. Pins.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 13: February 20, 2002 Routing 1.
CBSSS 2002: DeHon Interconnect André DeHon Thursday, June 20, 2002.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 16: February 14, 2003 Interconnect 6: MoT.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 18: April 2, 2014 Interconnect 4: Switching.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 16: February 14, 2005 Interconnect 4: Switching.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 6, 2003 Interconnect 3: Richness.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 8: January 27, 2003 Empirical Cost Comparisons.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 6: January 22, 2003 VLSI Scaling.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 11: January 31, 2005 Compute 1: LUTs.
ESE Spring DeHon 1 ESE534: Computer Organization Day 18: March 26, 2012 Interconnect 5: Meshes (and MoT)
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day14: November 10, 2000 Switching.
ESE Spring DeHon 1 ESE534: Computer Organization Day 20: April 9, 2014 Interconnect 6: Direct Drive, MoT.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 12: February 5, 2003 Interconnect 2: Wiring Requirements.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 17: March 29, 2010 Interconnect 2: Wiring Requirements and Implications.
CALTECH CS137 Winter DeHon 1 CS137: Electronic Design Automation Day 8: January 27, 2006 Cellular Placement.
CALTECH CS137 Fall DeHon 1 CS137: Electronic Design Automation Day 21: November 28, 2005 Routing 1.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 10: January 28, 2005 Empirical Comparisons.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day12: November 1, 2000 Interconnect Requirements and Richness.
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE532: System-on-a-Chip Architecture
ESE534: Computer Organization
CS184a: Computer Architecture (Structure and Organization)
CS137: Electronic Design Automation
ESE532: System-on-a-Chip Architecture
CS184a: Computer Architecture (Structure and Organization)
CS137: Electronic Design Automation
CS137: Electronic Design Automation
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
On-time Network On-chip
CS184a: Computer Architecture (Structures and Organization)
CS184a: Computer Architecture (Structures and Organization)
ESE534: Computer Organization
Give qualifications of instructors: DAP
CprE / ComS 583 Reconfigurable Computing
Reconfigurable Computing (EN2911X, Fall07)
CS137: Electronic Design Automation
Presentation transcript:

CS184a: Computer Architecture (Structure and Organization) Day 17: February 15, 2005 Interconnect 5: Meshes Caltech CS184 Winter2005 -- DeHon

Previous Saw we needed to exploit locality/structure in interconnect Saw a mesh might be useful Question: how does w grow? Saw Rent’s Rule as a way to characterize structure Caltech CS184 Winter2005 -- DeHon

Today Mesh: Channel width bounds Linear population Switch requirements Routability Segmentation Clusters Commercial Caltech CS184 Winter2005 -- DeHon

Mesh Caltech CS184 Winter2005 -- DeHon

Mesh Channels Lower Bound on w? Bisection Bandwidth BW  Np N0.5 channels in bisection Caltech CS184 Winter2005 -- DeHon

Straight-forward Switching Requirements Switching Delay? Total Switches? Caltech CS184 Winter2005 -- DeHon

Switch Delay Switching Delay: 2 (Nsubarray) worst case: Nsubarray = N Caltech CS184 Winter2005 -- DeHon

Total Switches Switches per switchbox: 4 3ww / 2 = 6w2 Bidirectional switches (NW same as WN) double count Caltech CS184 Winter2005 -- DeHon

Total Switches Switches per switchbox: Switches into network: 4 3ww / 2 = 6w2 Switches into network: (K+1) w Switches per PE: 6w2 +(K+1) w w = cNp-0.5 Total  N2p-1 Total Switches: N*(Sw/PE)  N2p Caltech CS184 Winter2005 -- DeHon

Routability? Asking if you can route in a given channel width is: NP-complete Caltech CS184 Winter2005 -- DeHon

Traditional Mesh Population Switchbox contains only a linear number of switches in channel width Caltech CS184 Winter2005 -- DeHon

Linear Mesh Switchbox Each entering channel connect to: One channel on each remaining side (3) 4 sides W wires Bidirectional switches (NW same as WN) double count 34W/2=6W switches vs. 6w2 for full population Caltech CS184 Winter2005 -- DeHon

Total Switches Switches per switchbox: Switches into network: (K+1) w Switches per PE: 6w +(K+1) w w = cNp-0.5 Total  Np-0.5 Total Switches: N*(Sw/PE)  Np+0.5 > N Caltech CS184 Winter2005 -- DeHon

Total Switches  Np+0.5 N < Np+0.5 < N2p Total Switches Switches grow faster than nodes Wires grow faster than switches Caltech CS184 Winter2005 -- DeHon

Checking Constants Wire pitch = 8l switch area = 2500 l2 wire area: (8w)2 switch area: 62500 w crossover w=234 ? (practice smaller) Caltech CS184 Winter2005 -- DeHon

Checking Constants: Full Population Wire pitch = 8l switch area = 2500 l2 wire area: (8w)2 switch area: 62500 w2 effective wire pitch: 120 l ~15 times pitch Caltech CS184 Winter2005 -- DeHon

Practical Just showed: Can afford to not use some wires perfectly would take 15 Mapping Ratio for linear population to take same area as full population (once crossover to wire dominated) Can afford to not use some wires perfectly to reduce switches Caltech CS184 Winter2005 -- DeHon

Diamond Switch Typical switchbox pattern: Used by Xilinx Many less switches, but cannot guarantee will be able to use all the wires may need more wires than implied by Rent, since cannot use all wires this was already true…now more so Caltech CS184 Winter2005 -- DeHon

Universal SwitchBox Same number of switches as diamond Locally: can guarantee to satisfy any set of requests request = direction through swbox as long as meet channel capacities and order on all channels irrelevant can satisfy Not a global property no guarantees between swboxes Caltech CS184 Winter2005 -- DeHon

Diamond vs. Universal? Universal routes strictly more configurations Caltech CS184 Winter2005 -- DeHon

Inter-Switchbox Constraints Channels connect switchboxes For valid route, must satisfy all adjacent switchboxes Caltech CS184 Winter2005 -- DeHon

Mapping Ratio? How bad is it? How much wider do channels have to be? detail channel width required / global ch width Caltech CS184 Winter2005 -- DeHon

Mapping Ratio Empirical: Theory/provable: Seems plausible, constant in practice Theory/provable: There is no Constant Mapping Ratio At least detail/global can be arbitrarily large! Caltech CS184 Winter2005 -- DeHon

Domain Structure Once enter network (choose color) can only switch within domain Caltech CS184 Winter2005 -- DeHon

Detail Routing as Coloring Caltech CS184 Winter2005 -- DeHon

Detail Routing as Coloring Global Route channel width = 2 Detail Route channel width = N Can make arbitrarily large difference Caltech CS184 Winter2005 -- DeHon

Detail Routing as Coloring Caltech CS184 Winter2005 -- DeHon

Routability Domain Routing is NP-Complete can reduce coloring problem to domain selection i.e. map adjacent nodes to same channel Previous example shows basic shape (another reason routers are slow) Caltech CS184 Winter2005 -- DeHon

Routing Lack of detail/global mapping ratio Says detail can be arbitrarily worse than global Say global not necessarily predict detail Argument against decomposing mesh routing into global phase and detail phase Modern FPGA routers do not Caltech CS184 Winter2005 -- DeHon

Segmentation To improve speed (decrease delay) Allow wires to bypass switchboxes Maybe save switches? Certainly cost more wire tracks Caltech CS184 Winter2005 -- DeHon

Buffered Delay Lseg  104 sq. 10 segments: Day 13 Buffered Delay Chip: 7mm side, 70nm sq. (45nm process) 105 squares across chip Lseg  104 sq. 10 segments: Each of delay 2 Tgate Tcross = 2030ps = 600ps Compare: 4ns Caltech CS184 Winter2005 -- DeHon

Delay through Switching Day 13 Delay through Switching 0.6 mm CMOS How far in GHz clock cycle? http://www.cs.caltech.edu/~andre/courses/CS294S97/notes/day14/day14.html Caltech CS184 Winter2005 -- DeHon

Segmentation Segment of Length Lseg 6 switches per switchbox visited Only enters a switchbox every Lseg SW/sbox/track of length Lseg = 6/Lseg Caltech CS184 Winter2005 -- DeHon

Segmentation Reduces switches on path N/Lseg May get fragmentation Another cause of unusable wires Caltech CS184 Winter2005 -- DeHon

Segmentation: Corner Turn Option Can you corner turn in the middle of a segment? If can, need one more switch SW/sbox/track = 5/Lseg + 1 Caltech CS184 Winter2005 -- DeHon

VPR Segment 4 Pix Caltech CS184 Winter2005 -- DeHon

VPR Segment 4 Route Caltech CS184 Winter2005 -- DeHon

C-Box Depopulation Not necessary for every input to connect to every channel Saw last time: K(N-K+1) switches Maybe use less? Caltech CS184 Winter2005 -- DeHon

IO Population Toronto Model IOs spread over 4 sides Fc fraction of tracks which an input connects to IOs spread over 4 sides Maybe show up on multiple Shown here: 2 Caltech CS184 Winter2005 -- DeHon

IO Population Caltech CS184 Winter2005 -- DeHon

Leaves Not LUTs Recall cascaded LUTs Often group collection of LUTs into a Logic Block Caltech CS184 Winter2005 -- DeHon

Logic Block [Betz+Rose/IEEE D&T 1998] Caltech CS184 Winter2005 -- DeHon

Cluster Size [Betz+Rose/IEEE D&T 1998] Caltech CS184 Winter2005 -- DeHon

Inputs Required per Cluster Should it be linear? [Betz+Rose/IEEE D&T 1998] Caltech CS184 Winter2005 -- DeHon

Review: Mesh Design Parameters Cluster Size Internal organization LB IO (Fc, sides) Switchbox Population and Topology Segment length distribution Switch rebuffering Caltech CS184 Winter2005 -- DeHon

Commercial Parts Caltech CS184 Winter2005 -- DeHon

XC4K Interconnect Caltech CS184 Winter2005 -- DeHon

XC4K Interconnect Details Caltech CS184 Winter2005 -- DeHon

Virtex II Caltech CS184 Winter2005 -- DeHon

Virtex II Interconnect Resources Caltech CS184 Winter2005 -- DeHon

Big Ideas [MSB Ideas] Mesh natural 2D topology Channels grow as W(Np-0.5) Wiring grows as W(N2p ) Linear Population: Switches grow as W(Np+0.5) Worse than shown for hierarchical Unbounded globaldetail mapping ratio Detail routing NP-complete Caltech CS184 Winter2005 -- DeHon

Big Ideas [MSB-1 Ideas] Segmented/bypass routes can reduce switching delay costs more wires (fragmentation of wires) Caltech CS184 Winter2005 -- DeHon