Mixed-Size Placement with Fixed Macrocells using Grid-Warping Zhong Xiu, Rob Rutenbar Advanced Micro Devices Inc., Department of Electrical and Computer.

Slides:

Advertisements

Similar presentations

Constraint Driven I/O Planning and Placement for Chip-package Co-design Jinjun Xiong, Yiuchung Wong, Egino Sarto, Lei He University of California, Los.

Advertisements

Optimization of Placement Solutions for Routability Wen-Hao Liu, Cheng-Kok Koh, and Yih-Lang Li DAC’13.

Natarajan Viswanathan Min Pan Chris Chu Iowa State University International Symposium on Physical Design April 6, 2005 FastPlace: An Analytical Placer.

Meng-Kai Hsu, Sheng Chou, Tzu-Hen Lin, and Yao-Wen Chang Electronics Engineering, National Taiwan University Routability Driven Analytical Placement for.

A Size Scaling Approach for Mixed-size Placement Kalliopi Tsota, Cheng-Kok Koh, Venkataramanan Balakrishnan School of Electrical and Computer Engineering.

Shuai Li and Cheng-Kok Koh School of Electrical and Computer Engineering, Purdue University West Lafayette, IN, Mixed Integer Programming Models.

Ripple: An Effective Routability-Driven Placer by Iterative Cell Movement Xu He, Tao Huang, Linfu Xiao, Haitong Tian, Guxin Cui and Evangeline F.Y. Young.

SimPL: An Effective Placement Algorithm Myung-Chul Kim, Dong-Jin Lee and Igor L. Markov Dept. of EECS, University of Michigan 1ICCAD 2010, Myung-Chul Kim,

Consistent Placement of Macro-Blocks Using Floorplanning and Standard-Cell Placement Saurabh Adya Igor Markov (University of Michigan)

FastPlace: Efficient Analytical Placement using Cell Shifting, Iterative Local Refinement and a Hybrid Net Model FastPlace: Efficient Analytical Placement.

Early Days of Circuit Placement Martin D. F. Wong Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign.

1 Understanding force-directed placement Andrew Kennings Electrical and Computer Engineering University of Waterloo.

APLACE: A General and Extensible Large-Scale Placer Andrew B. KahngSherief Reda Qinke Wang VLSICAD lab University of CA, San Diego.

International Conference on Computer-Aided Design San Jose, CA Nov. 2001ER UCLA UCLA 1 Congestion Reduction During Placement Based on Integer Programming.

38 th Design Automation Conference, Las Vegas, June 19, 2001 Creating and Exploiting Flexibility in Steiner Trees Elaheh Bozorgzadeh, Ryan Kastner, Majid.

Constructive Benchmarking for Placement David A. Papa EECS Department University of Michigan Ann Arbor, MI Igor L. Markov EECS.

Architecture and Details of a High Quality, Large-Scale Analytical Placer Andrew B. Kahng, Sherief Reda and Qinke Wang VLSI CAD Lab University of California,

An Analytic Placer for Mixed-Size Placement and Timing-Driven Placement Andrew B. Kahng and Qinke Wang UCSD CSE Department {abk, Work.

Supply Voltage Degradation Aware Analytical Placement Andrew B. Kahng, Bao Liu and Qinke Wang UCSD CSE Department {abk, bliu,

On Legalization of Row-Based Placements Andrew B. KahngSherief Reda CSE & ECE Departments University of CA, San Diego La Jolla, CA 92093

Can Recursive Bisection Alone Produce Routable Placements? Andrew E. Caldwell Andrew B. Kahng Igor L. Markov Supported by Cadence.

Active Set Support Vector Regression

POLAR 2.0: An Effective Routability-Driven Placer Chris Chu Tao Lin.

7/15/ VLSI Placement Prof. Shiyan Hu Office: EERC 731.

MGR: Multi-Level Global Router Yue Xu and Chris Chu Department of Electrical and Computer Engineering Iowa State University ICCAD

A Parallel Integer Programming Approach to Global Routing Tai-Hsuan Wu, Azadeh Davoodi Department of Electrical and Computer Engineering Jeffrey Linderoth.

CRISP: Congestion Reduction by Iterated Spreading during Placement Jarrod A. Roy†‡, Natarajan Viswanathan‡, Gi-Joon Nam‡, Charles J. Alpert‡ and Igor L.

Global Routing.

TSV-Aware Analytical Placement for 3D IC Designs Meng-Kai Hsu, Yao-Wen Chang, and Valerity Balabanov GIEE and EE department of NTU DAC 2011.

10/11/ VLSI Physical Design Automation Prof. David Pan Office: ACES Placement (2)

March 20, 2007 ISPD An Effective Clustering Algorithm for Mixed-size Placement Jianhua Li, Laleh Behjat, and Jie Huang Jianhua Li, Laleh Behjat,

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 EECS 527 Paper Presentation High-Performance.

Seeing the Forest and the Trees: Steiner Wirelength Optimization in Placement Jarrod A. Roy, James F. Lu and Igor L. Markov University of Michigan Ann.

1 CS612 Algorithms for Electronic Design Automation CS 612 – Lecture 8 Lecture 8 Network Flow Based Modeling Mustafa Ozdal Computer Engineering Department,

Analytic Placement. Layout Project:  Sending the RTL file: −Thursday, 27 Farvardin  Final deadline: −Tuesday, 22 Ordibehesht  New Project: −Soon 2.

Quadratic and Linear WL Placement Using Quadratic Programming: Gordian & Gordian-L Shantanu Dutt ECE Dept., Univ. of Illinois at Chicago Acknowledgements:

Improved Cut Sequences for Partitioning Based Placement Mehmet Can YILDIZ and Patrick H. Madden State University of New York at BinghamtonComputer Science.

Regularity-Constrained Floorplanning for Multi-Core Processors Xi Chen and Jiang Hu (Department of ECE Texas A&M University), Ning Xu (College of CST Wuhan.

Multilevel Generalized Force-directed Method for Circuit Placement Tony Chan 1, Jason Cong 2, Kenton Sze 1 1 UCLA Mathematics Department 2 UCLA Computer.

1/24/20071 ECO-system: Embracing the Change in Placement Jarrod A. Roy and Igor L. Markov University of Michigan at Ann Arbor.

Placement. Physical Design Cycle Partitioning Placement/ Floorplanning Placement/ Floorplanning Routing Break the circuit up into smaller segments Place.

Jason Cong‡†, Guojie Luo*†, Kalliopi Tsota‡, and Bingjun Xiao‡ ‡Computer Science Department, University of California, Los Angeles, USA *School of Electrical.

Session 10: The ISPD2005 Placement Contest. 2 Outline  Benchmark & Contest Introduction  Individual placement presentation  FastPlace, Capo, mPL, FengShui,

Quadratic VLSI Placement Manolis Pantelias. General Various types of VLSI placement  Simulated-Annealing  Quadratic or Force-Directed  Min-Cut  Nonlinear.

I N V E N T I V EI N V E N T I V E A Morphing Approach To Address Placement Stability Philip Chong Christian Szegedy.

Physical Synthesis Comes of Age Chuck Alpert, IBM Corp. Chris Chu, Iowa State University Paul Villarrubia, IBM Corp.

An Effective Congestion Driven Placement Framework André Rohe University of Bonn, Germany joint work with Ulrich Brenner.

CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 13: February 20, 2002 Routing 1.

International Workshop on System-Level Interconnection Prediction, Sonoma County, CA March 2001ER UCLA UCLA 1 Wirelength Estimation based on Rent Exponents.

Parallel Solution of the Poisson Problem Using MPI

CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.

1 NTUplace: A Partitioning Based Placement Algorithm for Large-Scale Designs Tung-Chieh Chen 1, Tien-Chang Hsu 1, Zhe-Wei Jiang 1, and Yao-Wen Chang 1,2.

1 CS612 Algorithms for Electronic Design Automation CS 612 – Lecture 8 Lecture 8 Network Flow Based Modeling Mustafa Ozdal Computer Engineering Department,

Unified Quadratic Programming Approach for Mixed Mode Placement Bo Yao, Hongyu Chen, Chung-Kuan Cheng, Nan-Chi Chou*, Lung-Tien Liu*, Peter Suaris* CSE.

High-Performance Global Routing with Fast Overflow Reduction Huang-Yu Chen, Chin-Hsiung Hsu, and Yao-Wen Chang National Taiwan University Taiwan.

International Symposium on Physical Design San Diego, CA April 2002ER UCLA UCLA 1 Routability Driven White Space Allocation for Fixed-Die Standard-Cell.

Design Automation Conference (DAC), June 6 th, Taming the Complexity of Coordinated Place and Route Jin Hu †, Myung-Chul Kim †† and Igor L. Markov.

An Exact Algorithm for Difficult Detailed Routing Problems Kolja Sulimma Wolfgang Kunz J. W.-Goethe Universität Frankfurt.

May Mike Drob Grant Furgiuele Ben Winters Advisor: Dr. Chris Chu Client: IBM IBM Contact – Karl Erickson.

Effective Linear Programming-Based Placement Techniques Sherief Reda UC San Diego Amit Chowdhary Intel Corporation.

6/19/ VLSI Physical Design Automation Prof. David Pan Office: ACES Placement (3)

RTL Design Flow RTL Synthesis HDL netlist logic optimization netlist Library/ module generators physical design layout manual design a b s q 0 1 d clk.

VLSI Physical Design Automation

HeAP: Heterogeneous Analytical Placement for FPGAs

VLSI Quadratic Placement

APLACE: A General and Extensible Large-Scale Placer

EE5780 Advanced VLSI Computer-Aided Design

EDA Lab., Tsinghua University

VLSI Physical Design Automation

Under a Concurrent and Hierarchical Scheme

Presentation transcript:

Mixed-Size Placement with Fixed Macrocells using Grid-Warping Zhong Xiu*, Rob Rutenbar * Advanced Micro Devices Inc., Department of Electrical and Computer Engineering, Carnegie Mellon University

Slide 2 Placement by Grid-Warping In [Zhong et al, DAC04], we showed first grid-warping placer In [Zhong et al, DAC05], we showed our timing-driven placer Fundamentally new idea for placement improvement  Imagine we place the gates on the surface of a flexible elastic sheet  We stretch the sheet to improve the placement Quadratic Initial placement Warp Placement surface Improved warped result Recurse & descend to continue

Slide 3 Grid Warping: Attractive Features Novel paradigm for placement: optimize the grid, not the gates  Think of “gravity” – we reshape curvature of space to move the mass Flexibly nonlinear  Free to warp anyway we like; not driven primarily by linear solves Low-dimensional optimization problem  We only need to control the sheet, we don’t move gates individually Early prototypes – WARP1, WARP2 – perform well  Competitive on wirelength with other published placers  As fast – or faster – than many other analytical placers

Slide 4 Organization of this Talk What’s missing? To handle Mixed-Size Placement  Wirelength/Timing optimization is necessary but not sufficient  We must be able to handle mixed-size placement with fixed macrocells First, we review basic mechanics of grid-warping Second, we show how to extend grid-warping for mixed-size placement with fixed macrocells Finally, we show our results and future directions

Slide 5 Review: Mechanics of Grid-Warping It’s conceptually useful to think of warping as distorting a regular mesh placed on the elastic placement surface…..but this is not actually how we implement warping Quadratic Initial placement Warp Placement surface Improved warped result Recurse & descend to continue

Slide 6 Warp grids and acquire gates We Formulate Warping in an “Inverse” Way We warp to “acquire” a new set of gates in each unit grid area… … then “pull” gates back to the undistorted grid, to move them Restore grids and pull gates back Initial placement mass and grids

Slide 7 And We Do Not Use a Regular Warping Grid 2x2 Warping grid4x4 Warping grid Instead, we use a grid defined by a set of slicing cuts  It turns out this allows a greater range of motion for the gates Yes—a lot like quadrisection or partitioning, but more general  The cuts need not be axis parallel  Because gates are fully placed in each region, we get real wirelength

Slide 8 Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (Domino) Complete Grid Warping Flow Complete flow has several steps We review them briefly here

Slide 9 Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (Domino) Complete Grid Warping Flow Quadratic place onto elastic sheet Note: pure quadratic wirelength No reweighting steps

Slide 10 Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (Domino) Complete Grid Warping Flow Geometric pre-conditioning step Spreads gates out quickly, uniformly, to improve final wirelen

Slide 11 Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (Domino) Complete Grid Warping Flow Nonlinear optimizer iteratively perturbs warping grid on sheet

Slide 12 Complete Grid Warping Flow Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (Domino) Nonlinear optimizer iteratively perturbs warping grid on sheet..each new warping is quickly “stretched” back to a full placement Use this to eval cost function, which tracks ‘rectilinear wirelen + capacity’ stretched

Slide 13 Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (Domino) Complete Grid Warping Flow Nonlinear optimizer delivers a final warped placement Standard improvement step runs hMetis to optimize location of gates placed near partition cuts

Slide 14 Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (Domino) Complete Grid Warping Flow Recurse: in this case, 4 new placements inside 4 regions Continue until ~few gates/region

Slide 15 Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (Domino) Complete Grid Warping Flow Warping flow delivers a final, but still slightly illegal, placement Use Domino (T.U. Munich) to legalize to final detailed placement

Slide 16 Problem: Warped Placement with Macrocells Assumptions  We focus on the fixed-macro case The core problem  Warping is intrinsically weak at separating large macros and small gates  All instances modeled as points; elastic “stretching” keeps nearby points close

Slide 17 Handling Fixed Macrocells Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (FastPlace) Re-Warping 4 new geometric solutions Inside warping, a geometric “hash” function that greedily re-locates gates that warp on top of macrocells …and a new net model (QP) During partition improvement, closer attention to size imbalances New backend (FastPlace)

Slide 18 (1) Geometric Hashing During Warping Problem: nonlinear warping drops gates on top of fixed macros Solution: “hash” them off, inside warping loop  Inside warping, inside each eval of global cost func, check if each cell overlaps macro  If so, we push it to the nearest boundary that has enough space for the cell  We chop chip up into small grids, store “nearest boundary” info in a hash table Note  No attempt to manage density or wirelength in this solution, just legality M M M M

Slide 19 (2) New Net Models 1 st QP & QPs in re-warping  For 2-pin nets, use the clique model, and set weight to 1.  For nets with 3 or more pins, use star model and introduce a new variable, each net has a weight: #Pins/(#Pins - 1). (FastPlace, ISPD’04) 2 nd QP and beyond  At 2nd layer of QP and beyond, use Jens Vygen’s net-split technique  Hybrid models, make QP faster and conserve quality If a column (row) contains only one cell, star model is used and no additional variables is introduced. If a column (row) contains two or more cells, a new cell is introduced and is connected to all the internal cells and all the propagated pins.

Slide 20 (3) Better Consideration of Capacity Improvement Pre-Warping Decompose/Recurse Nonlinear Grid-Warping Loop Quadratic Placement Legalize (FastPlace) Re-Warping Get rid of “Pre-Warping” Cost Function: use a single-sided cost function, under-filled regions receive no capacity penalty hMetis: the total area of all cells in each region does not exceed the capacity of that region Re-Warping: all of the above mentioned modifications are applied here

Slide 21 (4) New Backend Use ideas from FastPlace [ICCAD’05]  Swap based detail placement algorithm – greedy algorithm  Find the “optimal region” for a cell, if the cell is not in this region, try to swap it with a cell or space in that optimal region Detailed backend flow  First pass legalize  FengShui 5.1 from SUNY Binghm  Local repair for small macros/overlaps  Local greedy swaps  Wirelength minimization  Chu’s FastPlace legalizer ideas Our new overall flow  Run WARP as global placement  Sort all the cells and ~legalize them  Run the new detailed backend

Slide 22 Macrocell Results – ISPD’02 Wirelength  3-4% better than Feng Shui  Competitive with BonnPlace Run time  2X Fengshui  ~ Competitive with BonnPlace  Exact comparison difficult – BonnPlace is running on an 1.45GHz IBM 4- processor server and is explicitly parallelized software  We’re on a 2.0GHz Xeon. Feng Shui 5.1BonnPlaceWarp 3 WireCPUWireCPUWireCPU

Slide 23 ISPD’02 Layouts ibm01ibm04

Slide 24 More Macrocell Results – ISPD’05 Design WARP 3APlace GlobalLegalizationBackendWirelen adaptec adaptec bigblue bigblue bigblue bigblue Ratio Versus APlace  ~7% more wirelen  Total hours for all 6 designs for Warp on 2.8GHz Xeon Aside: ISPD’05 contest results:  APlace 1.00  mFAR 1.06  Dragon 1.08  mPL 1.09  FastPlace 1.16  Capo 1.17  NTUP 1.21  Fengshui 1.50  Kraftwerk+Domino 1.84

Slide 25 ISPD’05 Layouts adaptec2 adaptec4

Slide 26 ISPD’05 Layouts bigblue2bigblue3

Slide 27 Conclusion and Future Work Placement with macrocells – WARP3 – competitive  New techniques such as geometric hashing, improved hybrid net model  Can produce very good quality mixed-size placements reasonably quickly Future Work  Improve both quality and runtime  Better handle macrocells and routing congestion  “Hybrid” layout strategies (warping, but a flatter, analytical style)