VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 EECS 527 Paper Presentation High-Performance.

Slides:



Advertisements
Similar presentations
Group: Wilber L. Duran Duo (Steve) Liu
Advertisements

Optimization of Placement Solutions for Routability Wen-Hao Liu, Cheng-Kok Koh, and Yih-Lang Li DAC’13.
Topology-Aware Buffer Insertion and GPU-Based Massively Parallel Rerouting for ECO Timing Optimization Yen-Hung Lin, Yun-Jian Lo, Hian-Syun Tong, Wen-Hao.
Wen-Hao Liu1, Yih-Lang Li, and Cheng-Kok Koh Department of Computer Science, National Chiao-Tung University School of Electrical and Computer Engineering,
Meng-Kai Hsu, Sheng Chou, Tzu-Hen Lin, and Yao-Wen Chang Electronics Engineering, National Taiwan University Routability Driven Analytical Placement for.
Ripple: An Effective Routability-Driven Placer by Iterative Cell Movement Xu He, Tao Huang, Linfu Xiao, Haitong Tian, Guxin Cui and Evangeline F.Y. Young.
EXPLORING HIGH THROUGHPUT COMPUTING PARADIGM FOR GLOBAL ROUTING Yiding Han, Dean Michael Ancajas, Koushik Chakraborty, and Sanghamitra Roy Electrical and.
© KLMH Lienig Multi-Threaded Collision Aware Global Routing Bounded Length Maze Routing.
Coupling-Aware Length-Ratio- Matching Routing for Capacitor Arrays in Analog Integrated Circuits Kuan-Hsien Ho, Hung-Chih Ou, Yao-Wen Chang and Hui-Fang.
VLSI Physical Design: From Graph Partitioning to Timing Closure Paper Presentation © KLMH Lienig 1 EECS 527 Paper Presentation Topological Design of Clock.
An ILP-based Automatic Bus Planner for Dense PCBs P. C. Wu, Q. Ma and M. D. F. Wong Department of Electrical and Computer Engineering, University of Illinois.
MCFRoute: A Detailed Router Based on Multi- Commodity Flow Method Xiaotao Jia, Yici Cai, Qiang Zhou, Gang Chen, Zhuoyuan Li, Zuowei Li.
International Conference on Computer-Aided Design San Jose, CA Nov. 2001ER UCLA UCLA 1 Congestion Reduction During Placement Based on Integer Programming.
38 th Design Automation Conference, Las Vegas, June 19, 2001 Creating and Exploiting Flexibility in Steiner Trees Elaheh Bozorgzadeh, Ryan Kastner, Majid.
1 BoxRouter: A New Global Router Based on Box Expansion and Progressive ILP Minsik Cho and David Z. Pan ECE Dept. Univ. of Texas at Austin DAC 2006, July.
Triple Patterning Aware Detailed Placement With Constrained Pattern Assignment Haitong Tian, Yuelin Du, Hongbo Zhang, Zigang Xiao, Martin D.F. Wong.
A Resource-level Parallel Approach for Global-routing-based Routing Congestion Estimation and a Method to Quantify Estimation Accuracy Wen-Hao Liu, Zhen-Yu.
Metal Layer Planning for Silicon Interposers with Consideration of Routability and Manufacturing Cost W. Liu, T. Chien and T. Wang Department of CS, NTHU,
POLAR 2.0: An Effective Routability-Driven Placer Chris Chu Tao Lin.
CDCTree: Novel Obstacle-Avoiding Routing Tree Construction based on Current Driven Circuit Model Speaker: Lei He.
General Routing Overview and Channel Routing
WISCAD – VLSI Design Automation GRIP: Scalable 3-D Global Routing using Integer Programming Tai-Hsuan Wu, Azadeh Davoodi Department of Electrical and Computer.
VLSI Physical Design: From Graph Partitioning to Timing Closure Paper Presentation © KLMH Lienig 1 EECS 527 Paper Presentation Accurate Estimation of Global.
VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 FLUTE: Fast Lookup Table Based RSMT Algorithm.
Introduction to Routing. The Routing Problem Apply after placement Input: –Netlist –Timing budget for, typically, critical nets –Locations of blocks and.
MGR: Multi-Level Global Router Yue Xu and Chris Chu Department of Electrical and Computer Engineering Iowa State University ICCAD
A Topology-based ECO Routing Methodology for Mask Cost Minimization Po-Hsun Wu, Shang-Ya Bai, and Tsung-Yi Ho Department of Computer Science and Information.
Area-I/O Flip-Chip Routing for Chip-Package Co-Design Progress Report 方家偉、張耀文、何冠賢 The Electronic Design Automation Laboratory Graduate Institute of Electronics.
Authors: Jia-Wei Fang,Chin-Hsiung Hsu,and Yao-Wen Chang DAC 2007 speaker: sheng yi An Integer Linear Programming Based Routing Algorithm for Flip-Chip.
Escape Routing For Dense Pin Clusters In Integrated Circuits Mustafa Ozdal, Design Automation Conference, 2007 Mustafa Ozdal, IEEE Trans. on CAD, 2009.
Xin-Wei Shih and Yao-Wen Chang.  Introduction  Problem formulation  Algorithms  Experimental results  Conclusions.
CRISP: Congestion Reduction by Iterated Spreading during Placement Jarrod A. Roy†‡, Natarajan Viswanathan‡, Gi-Joon Nam‡, Charles J. Alpert‡ and Igor L.
Etron Project: Placement and Routing for Chip-Package-Board Co-Design
TSV-Aware Analytical Placement for 3D IC Designs Meng-Kai Hsu, Yao-Wen Chang, and Valerity Balabanov GIEE and EE department of NTU DAC 2011.
Solving Hard Instances of FPGA Routing with a Congestion-Optimal Restrained-Norm Path Search Space Keith So School of Computer Science and Engineering.
Wen-Hao Liu 1, Yih-Lang Li 1, and Kai-Yuan Chao 2 1 Department of Computer Science, National Chiao-Tung University, Hsin-Chu, Taiwan 2 Intel Architecture.
March 20, 2007 ISPD An Effective Clustering Algorithm for Mixed-size Placement Jianhua Li, Laleh Behjat, and Jie Huang Jianhua Li, Laleh Behjat,
Archer: A History-Driven Global Routing Algorithm Mustafa Ozdal Intel Corporation Martin D. F. Wong Univ. of Illinois at Urbana-Champaign Mustafa Ozdal.
UC San Diego / VLSI CAD Laboratory Incremental Multiple-Scan Chain Ordering for ECO Flip-Flop Insertion Andrew B. Kahng, Ilgweon Kang and Siddhartha Nath.
New Modeling Techniques for the Global Routing Problem Anthony Vannelli Department of Electrical and Computer Engineering University of Waterloo Waterloo,
Efficient Multi-Layer Obstacle- Avoiding Rectilinear Steiner Tree Construction Chung-Wei Lin, Shih-Lun Huang, Kai-Chi Hsu,Meng-Xiang Li, Yao-Wen Chang.
VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 EECS 527 Paper Presentation Techniques for Fast.
AUTOMATIC BUS PLANNER FOR DENSE PCBS Hui Kong, Tan Yan and Martin D.F. Wong Department of Electrical and Computer Engineering, University of Illinois at.
Ping-Hung Yuh, Chia-Lin Yang, and Yao-Wen Chang
IO CONNECTION ASSIGNMENT AND RDL ROUTING FOR FLIP-CHIP DESIGNS Jin-Tai Yan, Zhi-Wei Chen 1 ASPDAC.2009.
VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 6: Detailed Routing © KLMH Lienig 1 What Makes a Design Difficult to Route Charles.
ARCHER:A HISTORY-DRIVEN GLOBAL ROUTING ALGORITHM Muhammet Mustafa Ozdal, Martin D. F. Wong ICCAD ’ 07.
Tao Lin Chris Chu TPL-Aware Displacement- driven Detailed Placement Refinement with Coloring Constraints ISPD ‘15.
Huang-Yu Chen †, Mei-Fang Chiang †, Yao-Wen Chang † Lumdo Chen ‡, and Brian Han ‡ Novel Full-Chip Gridless Routing Considering Double-Via Insertion † The.
Register Placement for High- Performance Circuits M. Chiang, T. Okamoto and T. Yoshimura Waseda University, Japan DATE 2009.
1 Efficient Obstacle-Avoiding Rectilinear Steiner Tree Construction Chung-Wei Lin, Szu-Yu Chen, Chi-Feng Li, Yao-Wen Chang, Chia-Lin Yang National Taiwan.
A SAT-Based Routing Algorithm for Cross-Referencing Biochips Ping-Hung Yuh 1, Cliff Chiung-Yu Lin 2, Tsung- Wei Huang 3, Tsung-Yi Ho 3, Chia-Lin Yang 4,
A Negotiated Congestion based Router for Simultaneous Escape Routing Q.Ma, T.Yan and Martin D.F. Wong Department of Electrical and Computer Engineering.
An Efficient Linear Time Triple Patterning Solver Haitong Tian Hongbo Zhang Zigang Xiao Martin D.F. Wong ASP-DAC’15.
Escape Routing of Mixed-Pattern Signals Based on Staggered-Pin- Array PCBs K. Wang, H. Wang and S. Dong Department of Computer Science & Technology, Tsinghua.
1 A Min-Cost Flow Based Detailed Router for FPGAs Seokjin Lee *, Yongseok Cheon *, D. F. Wong + * The University of Texas at Austin + University of Illinois.
Non-stitch Triple Patterning- Aware Routing Based on Conflict Graph Pre-coloring Po-Ya Hsu Yao-Wen Chang.
ILP-Based Inter-Die Routing for 3D ICs Chia-Jen Chang, Pao-Jen Huang, Tai-Chen Chen, and Chien-Nan Jimmy Liu Department of Electrical Engineering, National.
Maze Routing Algorithms with Exact Matching Constraints for Analog and Mixed Signal Designs M. M. Ozdal and R. F. Hentschke Intel Corporation ICCAD 2012.
1 NTUplace: A Partitioning Based Placement Algorithm for Large-Scale Designs Tung-Chieh Chen 1, Tien-Chang Hsu 1, Zhe-Wei Jiang 1, and Yao-Wen Chang 1,2.
System in Package and Chip-Package-Board Co-Design
High-Performance Global Routing with Fast Overflow Reduction Huang-Yu Chen, Chin-Hsiung Hsu, and Yao-Wen Chang National Taiwan University Taiwan.
Congestion Analysis for Global Routing via Integer Programming Hamid Shojaei, Azadeh Davoodi, and Jeffrey Linderoth* Department of Electrical and Computer.
FPGA Routing Pathfinder [Ebeling, et al., 1995] Introduced negotiated congestion During each routing iteration, route nets using shortest.
1 Chapter 5 Branch-and-bound Framework and Its Applications.
11 Yibo Lin 1, Xiaoqing Xu 1, Bei Yu 2, Ross Baldick 1, David Z. Pan 1 1 ECE Department, University of Texas at Austin 2 CSE Department, Chinese University.
VLSI Physical Design Automation
2 University of California, Los Angeles
Yiyu Shi*, Wei Yao*, Jinjun Xiong+ and Lei He*
Under a Concurrent and Hierarchical Scheme
Presentation transcript:

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 EECS 527 Paper Presentation High-Performance Global Routing with Fast Overflow Reduction - by Huang-Yu Chen, Chin-Hsiung Hsu, and Yao-Wen Chang Presented by Yaoyu Tao Department Electrical Engineering and Computer Science University of Michigan, Ann Arbor 10/2011

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig EECS 527 Paper Presentation  Outlines  Introduction  Problem Formulation  Routing Methodology Pre-routing Initial Iterative Monotonic Routing Iterative Forbidden-Region Rip-up/Rerouting (IFR) Layer Assignment  Experimental Results  Q & A 2

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Introduction State-of-the-art routing techniques  maze routing A*-search routing pattern routing monotonic routing multi-commodity flow integer linear programming (ILP)  Not clear on their capability to handle the upcoming design challenges 3

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Introduction  State-of-the-art Routers  Based on INR (iterative negotiation-based rip-up/rerouting)  INR becomes the main stream due to its great ability to spread out congestion as well as to reduce the overflow  Lagrange Relaxation (IR) mathematical basis to improve the INR 4

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Introduction  Developments on this paper  New global router - NTUgr, that contains 3 major steps: pre-routing, initial routing and enhanced INR  New techniques in pre-routing Congestion-hotspot historical cost pre-increment Small bounding-box area routing  Enhanced INR Multiple forbidden regions expansion Critical subnets rerouting selection Look-ahead historical cost increment 5

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Problem Formulation  Routing & Goal  Routing region - partitioned into global cells and a 2D or 3D routing graph composed of nodes (global tile nodes) and edges (global edges)  Each global edge is associated with a capacity  Objectives of global routing – Minimize the total overflow  Prioritized order of ISPD’08 Total overflow Maximum overflow Weighted total wire length 6

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Some basics  Global tile node & global edge  Overflow: the amount of routing demand that exceeds the given capacity 7 Global tile node Global edge Tile Tile boundary

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Some basics  State-of-the-art INR  Proposed in PathFinder [McMurchie and Ebeling, FPGA’95]  Spreads the congested wires iteratively  At the (i)-th iteration, the cost of a global edge e:  b e : base cost of using e,  p e : # of nets passing e,  h e (i) : historical cost on e,  INR may get stuck as the number of iterations increases

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology – Flow Chart  Flow of the global router  Three new techniques  2 nd place of ISPD 2008 Global Routing Contest 9 3D Benchmark 3D  2D capacity mapping Enhanced 2D routing 2D  3D layer assignment 3D routing result Prerouting Initial Routing Iterative Forbidden-region Rip-up/rerouting (IFR)

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - Prerouting 10 3D Benchmark 3D  2D capacity mapping Enhanced 2D routing 2D  3D layer assignment 3D routing result PreroutingPrerouting Initial Routing Iterative Forbidden-region Rip-up/rerouting (IFR)

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - Prerouting  Pre-routing  Congestion-hotspot Historical Cost Pre-increment Mainly for difficult routing instances, e.g., ISPD’07 newblue3 circuit Handle with high pin density Pre-increment the historical cost h e, in cost function (b e + h e ) ・ p e For ith global edges lying around the high-pin-density tiles before going into the INR procedures Achieve the least overflows for newblue3 ever in the literature  Small Bounding-box Area Routing  First route the subnets with smaller bounding-box areas since they have less flexibility 11

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology – Initial Iterative Monotonic Routing 12 3D Benchmark 3D  2D capacity mapping Enhanced 2D routing 2D  3D layer assignment 3D routing result Prerouting Initial Routing Iterative Forbidden-region Rip-up/rerouting (IFR)

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology – Initial Iterative Monotonic Routing  Initial Iterative Monotonic Routing  First stage that completes all subnets in the whole chip  Monotonic paths: this stage stops when the overflow reduction at the (i+1)-th iteration is less than 5% from the i-th iteration 13

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR 14 3D Benchmark 3D  2D capacity mapping Enhanced 2D routing 2D  3D layer assignment 3D routing result Prerouting Initial Routing Iterative Forbidden-region Rip-up/rerouting (IFR)

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR 15 

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR  Modified p e in IFR  P e is usually set as d e /c e  In proposed IFR  Multiple Forbidden-Regions Expansion  Identify the congested regions, called multiple forbidden regions expansion  Forbidden Region: Introducing overflows in this region is almost forbidden, or it would incur huge cost penalty  Three phases for multiple forbidden regions construction in IFR 16

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR 17 

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR  Multiple Forbidden-region Construction at the first phase 18

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR  Second Phase  Invoked when the number of overflows in the first phase stops decreasing and gets stuck at local optimal solution  New technique: Region Propagation Leveling (RPL)  “Inherit” all forbidden regions at the previous iterations and then expands these forbidden regions simultaneously  Avoids the local optima solutions in the first phase 19

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR  Effects of RPL 20

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR  Third Phase  Starts when the current number of overflows is less than 0.5% of the total overflows after the initial iterative monotonic routing  Final expansion – expand the forbidden region to the entire routing graph to quickly reduce the remaining overflows  Why? Because INR becomes less effective in reducing the overflows when the total overflows become smaller 21

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology - IFR  Comparisons of Congested Regions 22 BoxRouterNTHU-Route 1.0NTUgr (Ours) TerminologyBox Congested region Forbidden region ShapeRectangular Rectilinear # of regionsSingle boxSingle regionMultiple regions Objective Performing progressive ILP Selecting rerouting nets Performing different cost functions Simultaneou s expansion No Yes

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology – Critical Subnets Rerouting Selection  New speed-up scheme into IFR – Critical Subnets rerouting selection  Reduce the number of rerouting subnets in each iteration  Iterative rip-up/rerouting takes the most run-time, thus the key for speed-up is to reduce the number of rerouting subnets in each iteration  Only critical subnets are ripped-up and rerouted in IFR  Criterion for a critical subnet n, S is a constant and e is a global edge passed by n  In this paper, S is set to be -1 and obtain about 1.21x speedup 23

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology – Look-ahead Historical Cost Increment  Recall: main advantage of INR is the great ability to spread out congestion (overflows)  h e Update scheme for most state-of-the-art routers, where K is a constant, and c e and d e represent the capacity and demand of e  Conventional algorithm gets stuck in local optima solutions  Why? Because it performs less effective as the number of iterations increases and cannot minimize the overflows but just exchange the overflow regions instead 24

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Routing Methodology – Look-ahead Historical Cost Increment & Layer Assignment  New updating scheme  N > o represents a positive integer  Not only increase the historical cost on the global edges with overflows, but also on those near-overflow global edges  In this paper, N = K = 1, improves the quality of the router  Layer Assignment: 2-D to 3-D in non-decreasing order of their wire length  In this paper, layer assignment prefers the wire and via sharing 25

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Experimental Results  NTUgr in C++ on 2.8GHz AMD Opteron Linux workstation with 12 GB memory  Comparisons with state-of-the-art global routers in literature  ISPD’07 and ISPD’08  Proposed NTUgr obtains the best routing solutions for the most difficult instance, newblue3 (with only overflows) and newblue4 (with only 142 overflows)  High-quality results for the ISPD’07 and ISPD’08 benchmarks for both overflow and runtime 26

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Experimental Results  Achieved 1.94X speed up and better overflow reduction with similar total wire length 27

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Experimental Results  3-D ISPD’07 Results 28

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig Experimental Results  3-D ISPD’08 Results 29 The best solution in the literature!

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig EECS 527 Paper Presentation Thanks! Q & A 30