ESE534: Computer Organization

Slides:



Advertisements
Similar presentations
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 14: March 19, 2014 Compute 2: Cascades, ALUs, PLAs.
Advertisements

Balancing Interconnect and Computation in a Reconfigurable Array Dr. André DeHon BRASS Project University of California at Berkeley Why you don’t really.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 15: March 12, 2007 Interconnect 3: Richness.
CS294-6 Reconfigurable Computing Day 8 September 17, 1998 Interconnect Requirements.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day8: October 18, 2000 Computing Elements 1: LUTs.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 9: February 7, 2007 Instruction Space Modeling.
DeHon March 2001 Rent’s Rule Based Switching Requirements Prof. André DeHon California Institute of Technology.
CS294-6 Reconfigurable Computing Day 10 September 24, 1998 Interconnect Richness.
Penn ESE Fall DeHon 1 ESE (ESE534): Computer Organization Day 19: March 26, 2007 Retime 1: Transformations.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 13: February 26, 2007 Interconnect 1: Requirements.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 26: April 18, 2007 Et Cetera…
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 11: February 14, 2007 Compute 1: LUTs.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 3: January 27, 2008 Clustering (LUT Mapping, Delay) Please work preclass example.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 18: March 21, 2007 Interconnect 6: MoT.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 16: March 14, 2007 Interconnect 4: Switching.
Balancing Interconnect and Computation in a Reconfigurable Array Dr. André DeHon BRASS Project University of California at Berkeley Why you don’t really.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 14: February 28, 2007 Interconnect 2: Wiring Requirements and Implications.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 15: February 12, 2003 Interconnect 5: Meshes.
ESE Spring DeHon 1 ESE534: Computer Organization Day 19: April 7, 2014 Interconnect 5: Meshes.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 11: March 3, 2014 Instruction Space Modeling 1.
CBSSS 2002: DeHon Costs André DeHon Wednesday, June 19, 2002.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 18: February 18, 2005 Interconnect 6: MoT.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 14: February 10, 2003 Interconnect 4: Switching.
CBSSS 2002: DeHon Interconnect André DeHon Thursday, June 20, 2002.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 16: February 14, 2003 Interconnect 6: MoT.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 18: April 2, 2014 Interconnect 4: Switching.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 16: February 14, 2005 Interconnect 4: Switching.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 6, 2003 Interconnect 3: Richness.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 6: January 30, 2013 Partitioning (Intro, KLFM)
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 11: January 31, 2005 Compute 1: LUTs.
ESE Spring DeHon 1 ESE534: Computer Organization Day 18: March 26, 2012 Interconnect 5: Meshes (and MoT)
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day14: November 10, 2000 Switching.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 11: February 20, 2012 Instruction Space Modeling.
ESE Spring DeHon 1 ESE534: Computer Organization Day 20: April 9, 2014 Interconnect 6: Direct Drive, MoT.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 12: February 5, 2003 Interconnect 2: Wiring Requirements.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 17: March 29, 2010 Interconnect 2: Wiring Requirements and Implications.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day12: November 1, 2000 Interconnect Requirements and Richness.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 25: April 17, 2013 Covering and Retiming.
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE532: System-on-a-Chip Architecture
ESE534: Computer Organization
ESE534: Computer Organization
COMPUTATIONAL MODELS.
ESE532: System-on-a-Chip Architecture
ESE534 Computer Organization
ESE532: System-on-a-Chip Architecture
CS184a: Computer Architecture (Structure and Organization)
CS137: Electronic Design Automation
ESE535: Electronic Design Automation
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE535: Electronic Design Automation
Sungho Kang Yonsei University
ESE534: Computer Organization
Professor Ioana Banicescu CSE 8843
ESE535: Electronic Design Automation
CS184a: Computer Architecture (Structures and Organization)
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
CS137: Electronic Design Automation
CS184a: Computer Architecture (Structure and Organization)
ESE532: System-on-a-Chip Architecture
Presentation transcript:

ESE534: Computer Organization Day 18: March 31, 2010 Interconnect 3: Richness Penn ESE534 Spring2010 -- DeHon

Last Time Rent’s Rule Superlinear growth rate of interconnect And its implications Superlinear growth rate of interconnect p>0.5  Area growth W(N2p) Penn ESE534 Spring2010 -- DeHon

Today How rich should interconnect be? specifics of understanding interconnect methodology for attacking these kinds of questions Penn ESE534 Spring2010 -- DeHon

Now What? There is structure (locality) Rent characterizes locality How rich should interconnect be? Allow full utilization? Most area efficient? Need to model requirements and area impact Penn ESE534 Spring2010 -- DeHon

Preclass 1 and 2 Wire count? Penn ESE534 Spring2010 -- DeHon

Preclass 1 and 2 Wire count? Penn ESE534 Spring2010 -- DeHon

Preclass 1 and 2 Wire count? Penn ESE534 Spring2010 -- DeHon

Step 1: Build Architecture Model Assume geometric growth Pick parameters: Build architecture can tune C p Penn ESE534 Spring2010 -- DeHon

Tree of Meshes Natural model is hierarchical Restricted internal bandwidth Can match to model Penn ESE534 Spring2010 -- DeHon

Parameterize C Penn ESE534 Spring2010 -- DeHon

Parameterize p What is p for each network? Penn ESE534 Spring2010 -- DeHon

Parameterize Growth (2 1)* (2 2 1)* (2 2 2 1)* Penn ESE534 Spring2010 -- DeHon

Step 2: Area Model Need to know effect of architecture parameters on area (costs) focus on dominant components wires switches logic blocks(?) Penn ESE534 Spring2010 -- DeHon

Area Parameters Alogic = 40Kl2 Asw = 2.5Kl2 Wire Pitch = 8l Penn ESE534 Spring2010 -- DeHon

Switchbox Population Full population is excessive (next lecture) Hypothesis: linear population adequate still to be (dis)proven Penn ESE534 Spring2010 -- DeHon

“Cartoon” VLSI Area Model (Example artificially small for clarity) Penn ESE534 Spring2010 -- DeHon

Larger “Cartoon” P=0.67 LUT Area 3% 1024 LUT Network Penn ESE534 Spring2010 -- DeHon

Effects of P on Area P=0.5 P=0.67 P=0.75 1024 LUT Area Comparison Penn ESE534 Spring2010 -- DeHon

Effects of P on Capacity Penn ESE534 Spring2010 -- DeHon

Step 3: Characterize Application Requirements Identify representative applications. Today: IWLS93 logic benchmarks How much structure there? How much variation among applications? Penn ESE534 Spring2010 -- DeHon

Application Requirements Compare Problem 1 Max: C=7, P=0.68 Avg: C=5, P=0.72 Penn ESE534 Spring2010 -- DeHon

Benchmark Wide Penn ESE534 Spring2010 -- DeHon

Benchmark Parameters Penn ESE534 Spring2010 -- DeHon

Complication Interconnect requirements vary among applications Interconnect richness has large effect on area What is effect of architecture/application mismatch? Interconnect too rich? Interconnect too poor? Penn ESE534 Spring2010 -- DeHon

Interconnect Mismatch in Theory Penn ESE534 Spring2010 -- DeHon

Step 4: Assess Resource Impact Map designs to parameterized architecture Identify architectural resource required Compare: mapping to k-LUTs; LUT count vs. k. Penn ESE534 Spring2010 -- DeHon

Mapping to Fixed Wire Schedule Easy if need fewer wires than Net If need more wires than net, must depopulate to meet interconnect limitations. Penn ESE534 Spring2010 -- DeHon

Preclass 3 Smallest Network that Top graph fits on? Penn ESE534 Spring2010 -- DeHon

Mapping to Fixed-WS Better results if “reassociate” rather than keeping original subtrees. Penn ESE534 Spring2010 -- DeHon

Preclass 3 Smallest Network that Middle graph fits on? Penn ESE534 Spring2010 -- DeHon

Middle vs. Top Penn ESE534 Spring2010 -- DeHon

Observation Don’t really want a “bisection” of LUTs subtree filled to capacity by either of LUTs root bandwidth May be profitable to cut at some place other than midpoint not require “balance” condition “Bisection” should account for both LUT and wiring limitations Penn ESE534 Spring2010 -- DeHon

Preclass 3 Smallest Network that Bottom graph fits on? Penn ESE534 Spring2010 -- DeHon

Middle vs. Bottom Penn ESE534 Spring2010 -- DeHon

Challenge Not know where to cut design not knowing when wires will limit subtree capacity Penn ESE534 Spring2010 -- DeHon

Brute Force Solution Explore all cuts Viable? start with all LUTs in group consider “all” balances try cut Recurse Viable? Penn ESE534 Spring2010 -- DeHon

Brute Force Too expensive Exponential work …viable if solving same subproblems Penn ESE534 Spring2010 -- DeHon

Simplification Single linear ordering Partitions = pick split point on ordering Reduce to finding cost of [start,end] ranges (subtrees) within linear ordering Only n2 such subproblems Can solve with dynamic programming Penn ESE534 Spring2010 -- DeHon

Dynamic Programming Just one possible “heuristic” solution to this problem not optimal dependent on ordering sacrifices ability to reorder on splits to avoid exponential problem size Opportunity to find a better solution here... Penn ESE534 Spring2010 -- DeHon

Ordering LUTs Another problem lay out gates in 1D line minimize sum of squared wire length tend to cluster connected gates together Is solvable mathematically for optimal Eigenvector of connectivity matrix Use this 1D ordering for our linear ordering Penn ESE534 Spring2010 -- DeHon

Mapping Results Penn ESE534 Spring2010 -- DeHon

Step 5: Apply Area Model Assess impact of resource results Penn ESE534 Spring2010 -- DeHon

Resources  Area Model  Area Penn ESE534 Spring2010 -- DeHon

Net Area Penn ESE534 Spring2010 -- DeHon

Picking Network Design Point Don’t optimize for 100% compute util. (100% yield) …also don’t optimize for highest peak. Penn ESE534 Spring2010 -- DeHon

What about a single design? Penn ESE534 Spring2010 -- DeHon

LUT Utilization predict Area? Single design Penn ESE534 Spring2010 -- DeHon

Methodology Architecture model (parameterized) Cost model Important task characteristics Mapping Algorithm Map to determine resources Apply cost model Digest results find optimum (multiple?) understand conflicts (avoidable?) Penn ESE534 Spring2010 -- DeHon

Admin Reading for Monday online Encourage start looking at HW8 Can run VPR experiments Detail modeling of energy in architecture will build on material covered Monday and Wednesday Penn ESE534 Spring2010 -- DeHon

Big Ideas [MSB Ideas] Interconnect area dominates logic area Interconnect requirements vary among designs within a single design To minimize area focus on using dominant resource (interconnect) may underuse non-dominant resources (LUTs) Penn ESE534 Spring2010 -- DeHon

Big Ideas [MSB Ideas] Two different resources here compute, interconnect Balance of resources required varies among designs (even within designs) Cannot expect full utilization of every resource Most area-efficient designs may waste some compute resources (cheaper resource) Penn ESE534 Spring2010 -- DeHon