Caltech CS184 Winter2003 -- DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 15: February 12, 2003 Interconnect 5: Meshes.

Slides:



Advertisements
Similar presentations
Cadence Design Systems, Inc. Why Interconnect Prediction Doesn’t Work.
Advertisements

Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day6: October 11, 2000 Instruction Taxonomy VLSI Scaling.
Balancing Interconnect and Computation in a Reconfigurable Array Dr. André DeHon BRASS Project University of California at Berkeley Why you don’t really.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day17: November 20, 2000 Time Multiplexing.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 15: March 12, 2007 Interconnect 3: Richness.
CS294-6 Reconfigurable Computing Day 8 September 17, 1998 Interconnect Requirements.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day10: October 25, 2000 Computing Elements 2: Cascades, ALUs,
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day8: October 18, 2000 Computing Elements 1: LUTs.
EDA (CS286.5b) Day 5 Partitioning: Intro + KLFM. Today Partitioning –why important –practical attack –variations and issues.
DeHon March 2001 Rent’s Rule Based Switching Requirements Prof. André DeHon California Institute of Technology.
CS294-6 Reconfigurable Computing Day 10 September 24, 1998 Interconnect Richness.
Lecture 3: Field Programmable Gate Arrays II September 10, 2013 ECE 636 Reconfigurable Computing Lecture 3 Field Programmable Gate Arrays II.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 13: February 26, 2007 Interconnect 1: Requirements.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 11: February 14, 2007 Compute 1: LUTs.
CS294-6 Reconfigurable Computing Day 2 August 27, 1998 FPGA Introduction.
CS294-6 Reconfigurable Computing Day 12 October 1, 1998 Interconnect Population.
CS294-6 Reconfigurable Computing Day 14 October 7/8, 1998 Computing with Lookup Tables.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 18: March 21, 2007 Interconnect 6: MoT.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 16: March 14, 2007 Interconnect 4: Switching.
Balancing Interconnect and Computation in a Reconfigurable Array Dr. André DeHon BRASS Project University of California at Berkeley Why you don’t really.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 14: February 28, 2007 Interconnect 2: Wiring Requirements and Implications.
HARP: Hard-Wired Routing Pattern FPGAs Cristinel Ababei , Satish Sivaswamy ,Gang Wang , Kia Bazargan , Ryan Kastner , Eli Bozorgzadeh   ECE Dept.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 4, 2005 Interconnect 1: Requirements.
ESE Spring DeHon 1 ESE534: Computer Organization Day 19: April 7, 2014 Interconnect 5: Meshes.
FPGA Switch Block Design Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
Un/DoPack: Re-Clustering of Large System-on-Chip Designs with Interconnect Variation for Low-Cost FPGAs Marvin Tom* Xilinx Inc.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day11: October 30, 2000 Interconnect Requirements.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 18: February 18, 2005 Interconnect 6: MoT.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 6: January 23, 2002 Partitioning (Intro, KLFM)
Caltech CS184b Winter DeHon 1 CS184b: Computer Architecture [Single Threaded Architecture: abstractions, quantification, and optimizations] Day9:
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 7: February 3, 2002 Retiming.
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #4 – FPGA.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 14: February 10, 2003 Interconnect 4: Switching.
Topics Architecture of FPGA: Logic elements. Interconnect. Pins.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 13: February 20, 2002 Routing 1.
CBSSS 2002: DeHon Interconnect André DeHon Thursday, June 20, 2002.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 16: February 14, 2003 Interconnect 6: MoT.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 18: April 2, 2014 Interconnect 4: Switching.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 16: February 14, 2005 Interconnect 4: Switching.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 9: February 9, 2004 Partitioning (Intro, KLFM)
Directional and Single-Driver Wires in FPGA Interconnect Guy Lemieux Edmund LeeMarvin TomAnthony Yu Dept. of ECE, University of British Columbia Vancouver,
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 6, 2003 Interconnect 3: Richness.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 8: January 27, 2003 Empirical Cost Comparisons.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 6: January 22, 2003 VLSI Scaling.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 10: January 31, 2003 Compute 2:
1 WireMap FPGA Technology Mapping for Improved Routability Stephen Jang, Xilinx Inc. Billy Chan, Xilinx Inc. Kevin Chung, Xilinx Inc. Alan Mishchenko,
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 11: January 31, 2005 Compute 1: LUTs.
ESE Spring DeHon 1 ESE534: Computer Organization Day 18: March 26, 2012 Interconnect 5: Meshes (and MoT)
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day14: November 10, 2000 Switching.
ESE Spring DeHon 1 ESE534: Computer Organization Day 20: April 9, 2014 Interconnect 6: Direct Drive, MoT.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 12: February 5, 2003 Interconnect 2: Wiring Requirements.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 17: March 29, 2010 Interconnect 2: Wiring Requirements and Implications.
CALTECH CS137 Winter DeHon 1 CS137: Electronic Design Automation Day 8: January 27, 2006 Cellular Placement.
CALTECH CS137 Fall DeHon 1 CS137: Electronic Design Automation Day 21: November 28, 2005 Routing 1.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 10: January 28, 2005 Empirical Comparisons.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day12: November 1, 2000 Interconnect Requirements and Richness.
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
CS184a: Computer Architecture (Structure and Organization)
CS184a: Computer Architecture (Structure and Organization)
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
CS184a: Computer Architecture (Structures and Organization)
ESE534: Computer Organization
CS184a: Computer Architecture (Structure and Organization)
CprE / ComS 583 Reconfigurable Computing
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 15: February 12, 2003 Interconnect 5: Meshes

Caltech CS184 Winter DeHon 2 Previous Saw we needed to exploit locality/structure in interconnect Saw a mesh might be useful –Question: how does w grow? Saw Rent’s Rule as a way to characterize structure

Caltech CS184 Winter DeHon 3 Today Mesh: –Channel width bounds –Linear population –Switch requirements –Routability –Segmentation –Clusters –Commercial

Caltech CS184 Winter DeHon 4 Mesh

Caltech CS184 Winter DeHon 5 Mesh Channels Lower Bound on w? Bisection Bandwidth –BW  N p –N 0.5 channels in bisection

Caltech CS184 Winter DeHon 6 Straight-forward Switching Requirements Total Switches? Switching Delay?

Caltech CS184 Winter DeHon 7 Switch Delay Switching Delay: 2  (N subarray ) –worst case: N subarray = N

Caltech CS184 Winter DeHon 8 Total Switches Switches per switchbox: –4 3w  w / 2 = 6w 2 –Bidirectional switches (N  W same as W  N) double count

Caltech CS184 Winter DeHon 9 Total Switches Switches per switchbox: –4 3w  w / 2 = 6w 2 Switches into network: –(K+1) w Switches per PE: –6w 2 +(K+1) w –w = cN p-0.5 –Total  N 2p-1 Total Switches: N*(Sw/PE)  N 2p

Caltech CS184 Winter DeHon 10 Routability? Asking if you can route in a given channel width is: –NP-complete

Caltech CS184 Winter DeHon 11 Traditional Mesh Population Switchbox contains only a linear number of switches in channel width

Caltech CS184 Winter DeHon 12 Linear Mesh Switchbox Each entering channel connect to: –One channel on each remaining side (3) –4 sides –W wires –Bidirectional switches (N  W same as W  N) double count –3  4  W/2=6W switches vs. 6w 2 for full population

Caltech CS184 Winter DeHon 13 Total Switches Switches per switchbox: –6w Switches into network: –(K+1) w Switches per PE: –6w +(K+1) w –w = cN p-0.5 –Total  N p-0.5 Total Switches: N*(Sw/PE)  N p+0.5 > N

Caltech CS184 Winter DeHon 14 Total Switches  N p+0.5 > N  N p+0.5 < N 2p Switches grow faster than nodes Wires grow faster than switches

Caltech CS184 Winter DeHon 15 Checking Constants Wire pitch = 8 switch area = wire area: (8w) 2 switch area: 6  2500 w crossover –w=234 ? –(practice smaller)

Caltech CS184 Winter DeHon 16 Checking Constants: Full Population Wire pitch = 8 switch area = wire area: (8w) 2 switch area: 6  2500 w 2 effective wire pitch: 120  times pitch

Caltech CS184 Winter DeHon 17 Practical Just showed: –would take 15  Mapping Ratio for linear population to take same area as full population (once crossover to wire dominated) Can afford to not use some wires perfectly –to reduce switches

Caltech CS184 Winter DeHon 18 Diamond Switch Typical switchbox pattern: –Used by Xilinx Many less switches, but cannot guarantee will be able to use all the wires –may need more wires than implied by Rent, since cannot use all wires –this was already true…now more so

Caltech CS184 Winter DeHon 19 Domain Structure Once enter network (choose color) can only switch within domain

Caltech CS184 Winter DeHon 20 Universal SwitchBox Same number of switches as diamond Locally: can guarantee to satisfy any set of requests –request = direction through swbox –as long as meet channel capacities –and order on all channels irrelevant –can satisfy Not a global property –no guarantees between swboxes

Caltech CS184 Winter DeHon 21 Diamond vs. Universal? Universal routes strictly more configurations

Caltech CS184 Winter DeHon 22 Inter-Switchbox Constraints Channels connect switchboxes For valid route, must satisfy all adjacent switchboxes

Caltech CS184 Winter DeHon 23 Mapping Ratio? How bad is it? How much wider do channels have to be? Mapping Ratio: –detail channel width required / global ch width

Caltech CS184 Winter DeHon 24 Mapping Ratio Empirical: –Seems plausible, constant in practice Theory/provable: –There is no Constant Mapping Ratio At least detail/global –can be arbitrarily large!

Caltech CS184 Winter DeHon 25 Domain Structure Once enter network (choose color) can only switch within domain

Caltech CS184 Winter DeHon 26 Detail Routing as Coloring

Caltech CS184 Winter DeHon 27 Detail Routing as Coloring Global Route channel width = 2 Detail Route channel width = N –Can make arbitrarily large difference

Caltech CS184 Winter DeHon 28 Detail Routing as Coloring

Caltech CS184 Winter DeHon 29 Routability Domain Routing is NP-Complete –can reduce coloring problem to domain selection i.e. map adjacent nodes to same channel Previous example shows basic shape –(another reason routers are slow)

Caltech CS184 Winter DeHon 30 Routing Lack of detail/global mapping ratio –Says detail can be arbitrarily worse than global –Say global not necessarily predict detail –Argument against decomposing mesh routing into global phase and detail phase Modern FPGA routers do not

Caltech CS184 Winter DeHon 31 Segmentation To improve speed (decrease delay) Allow wires to bypass switchboxes Maybe save switches? Certainly cost more wire tracks

Caltech CS184 Winter DeHon 32 Segmentation Segment of Length L seg –6 switches per switchbox visited –Only enters a switchbox every L seg –SW/sbox/track of length Lseg = 6/L seg

Caltech CS184 Winter DeHon 33 Segmentation Reduces switches on path  N/L seg May get fragmentation Another cause of unusable wires

Caltech CS184 Winter DeHon 34 Segmentation: Corner Turn Option Can you corner turn in the middle of a segment? If can, need one more switch SW/sbox/track = 5/L seg + 1

Caltech CS184 Winter DeHon 35 VPR Segment 4 Pix

Caltech CS184 Winter DeHon 36 VPR Segment 4 Route

Caltech CS184 Winter DeHon 37 C-Box Depopulation Not necessary for every input to connect to every channel Saw last time: –K  (N-K+1) switches Maybe use less?

Caltech CS184 Winter DeHon 38 IO Population Toronto Model –Fc fraction of tracks which an input connects to IOs spread over 4 sides Maybe show up on multiple –Shown here: 2

Caltech CS184 Winter DeHon 39 IO Population

Caltech CS184 Winter DeHon 40 Leaves Not LUTs Recall cascaded LUTs Often group collection of LUTs into a Logic Block

Caltech CS184 Winter DeHon 41 Logic Block [Betz+Rose/IEEE D&T 1998]

Caltech CS184 Winter DeHon 42 Cluster Size [Betz+Rose/IEEE D&T 1998]

Caltech CS184 Winter DeHon 43 Inputs Required per Cluster [Betz+Rose/IEEE D&T 1998] Should it be linear?

Caltech CS184 Winter DeHon 44 Mesh Design Parameters Cluster Size –Internal organization LB IO (Fc, sides) Switchbox Population and Topology Segment length distribution Switch rebuffering

Caltech CS184 Winter DeHon 45 Commercial Parts

Caltech CS184 Winter DeHon 46 XC4K Interconnect

Caltech CS184 Winter DeHon 47 XC4K Interconnect Details

Caltech CS184 Winter DeHon 48 Virtex II

Caltech CS184 Winter DeHon 49 Virtex II Interconnect Resources

Caltech CS184 Winter DeHon 50 Big Ideas [MSB Ideas] Mesh natural 2D topology –Channels grow as  (N p-0.5 ) –Wiring grows as  (N 2p ) –Linear Population: Switches grow as  (N p+0.5 ) Unbounded global  detail mapping ratio Detail routing NP-complete

Caltech CS184 Winter DeHon 51 Big Ideas [MSB-1 Ideas] Segmented/bypass routes –can reduce switching delay –costs more wires (fragmentation of wires)