A Performance Study of BDD-Based Model Checking Bwolen Yang Randal E. Bryant, David R. O’Hallaron, Armin Biere, Olivier Coudert, Geert Janssen Rajeev K.

Slides:

Advertisements

Similar presentations

Model Checking Lecture 4. Outline 1 Specifications: logic vs. automata, linear vs. branching, safety vs. liveness 2 Graph algorithms for model checking.

Advertisements

Representing Boolean Functions for Symbolic Model Checking Supratik Chakraborty IIT Bombay.

Hardware and Petri nets Symbolic methods for analysis and verification.

Planning based on Model Checking Dept. of Information Systems and Applied CS Bamberg University Seminar Paper Svetlana Balinova.

ECE Synthesis & Verification - L271 ECE 697B (667) Spring 2006 Synthesis and Verification of Digital Systems Model Checking basics.

Introduction to Algorithms Rabie A. Ramadan rabieramadan.org 2 Some of the sides are exported from different sources.

CS357 Lecture: BDD basics David Dill 1. 2 BDDs (Boolean/binary decision diagrams) BDDs are a very successful representation for Boolean functions. A BDD.

Efficient Reachability Analysis for Verification of Asynchronous Systems Nishant Sinha.

SYMBOLIC MODEL CHECKING: STATES AND BEYOND J.R. Burch E.M. Clarke K.L. McMillan D. L. Dill L. J. Hwang Presented by Rehana Begam.

Fundamentals of Python: From First Programs Through Data Structures

Class Presentation on Binary Moment Diagrams by Krishna Chillara Base Paper: “Verification of Arithmetic Circuits using Binary Moment Diagrams” by.

© 2011 Carnegie Mellon University Binary Decision Diagrams Part Bug Catching: Automated Program Verification and Testing Sagar Chaki September.

ECE 667 Synthesis & Verification - Boolean Functions 1 ECE 667 Spring 2013 ECE 667 Spring 2013 Synthesis and Verification of Digital Circuits Boolean Functions.

DATE-2002TED1 Taylor Expansion Diagrams: A Compact Canonical Representation for Symbolic Verification M. Ciesielski, P. Kalla, Z. Zeng B. Rouzeyre Electrical.

Chapter 3: The Efficiency of Algorithms Invitation to Computer Science, C++ Version, Third Edition Additions by Shannon Steinfadt SP’05.

Binary Decision Diagrams and Symbolic Model Checking Randy Bryant CMU Ed Clarke CMU Ken McMillan Cadence Allen Emerson U.

SAT-Based Decision Procedures for Subsets of First-Order Logic

A Performance Study of BDD-Based Model Checking Bwolen Yang Randal E. Bryant, David R. O’Hallaron, Armin Biere, Olivier Coudert, Geert Janssen Rajeev K.

ECE Synthesis & Verification - Lecture 18 1 ECE 697B (667) Spring 2006 ECE 697B (667) Spring 2006 Synthesis and Verification of Digital Systems Word-level.

High-Performance Simulation of Quantum Computation using QuIDDs George F. Viamontes, Manoj Rajagopalan, Igor L. Markov, and John P. Hayes Advanced Computer.

Boolean Functions and their Representations

Sanjit A. Seshia and Randal E. Bryant Computer Science Department

Computation Engines: BDDs and SAT (part 2) 290N: The Unknown Component Problem Lecture 8.

Chapter 3: The Efficiency of Algorithms Invitation to Computer Science, C++ Version, Fourth Edition.

Symbolic Boolean Manipulation with Ordered Binary Decision Diagrams Carnegie Mellon University Randal E. Bryant.

Taylor Expansion Diagrams (TED): Verification EC667: Synthesis and Verification of Digital Systems Spring 2011 Presented by: Sudhan.

Decision Structures Truth TableDecision Tree –Vertex represents decision –Follow green (dashed) line for value 0 –Follow red (solid) line for value 1 –Function.

01/27/2005 Combinationality of cyclic definitions EECS 290A – Spring 2005 UC Berkeley.

ECE Synthesis & Verification - Lecture 10 1 ECE 697B (667) Spring 2006 ECE 697B (667) Spring 2006 Synthesis and Verification of Digital Systems Binary.

 2001 CiesielskiBDD Tutorial1 Decision Diagrams Maciej Ciesielski Electrical & Computer Engineering University of Massachusetts, Amherst, USA

ECE 667 Synthesis & Verification - BDD 1 ECE 667 ECE 667 Synthesis and Verification of Digital Systems Binary Decision Diagrams (BDD)

 2000 M. CiesielskiPTL Synthesis1 Synthesis for Pass Transistor Logic Maciej Ciesielski Dept. of Electrical & Computer Engineering University of Massachusetts,

Computing with Finite Automata (part 2) 290N: The Unknown Component Problem Lecture 10.

Optimizing Symbolic Model Checking for Constraint-Rich Systems Randal E. Bryant Bwolen Yang, Reid Simmons, David R. O’Hallaron Carnegie Mellon University.

Digitaalsüsteemide verifitseerimise kursus1 Formal verification: BDD BDDs applied in equivalence checking.

Athens 2004 Symposium on Innovation of Computer Science Curriculum in Higher Education Athens 2004 Decision Diagrams: Principles of Programming Dragan.

1 Introduction to SMV and Model Checking Mostly by: Ken McMillan Cadence Berkeley Labs Small parts by: Brandon Eames ISIS/Vanderbilt.

Identifying Reversible Functions From an ROBDD Adam MacDonald.

Binary Decision Diagrams (BDDs)

November,2000University of Southern California1 Introduction to Binary Decision Diagrams - Shesha Shayee K. Raghunathan.

Model Checking Lecture 4 Tom Henzinger. Model-Checking Problem I |= S System modelSystem property.

Electrical and Computer Engineering Archana Rengaraj ABC Logic Synthesis basics ECE 667 Synthesis and Verification of Digital Systems Spring 2011.

CS 267: Automated Verification Lecture 6: Binary Decision Diagrams Instructor: Tevfik Bultan.

Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.

Algorithmic Software Verification V &VI. Binary decision diagrams.

Daniel Kroening and Ofer Strichman 1 Decision Procedures An Algorithmic Point of View BDDs.

CAS 721 Course Project Implementing Branch and Bound, and Tabu search for combinatorial computing problem By Ho Fai Ko ( )

Lecture 3: Uninformed Search

Agenda Review: –Planar Graphs Lecture Content:  Concepts of Trees  Spanning Trees  Binary Trees Exercise.

On the Relation between SAT and BDDs for Equivalence Checking Sherief Reda Rolf Drechsler Alex Orailoglu Computer Science & Engineering Dept. University.

Daniel Kroening and Ofer Strichman 1 Decision Procedures An Algorithmic Point of View BDDs.

Verification & Validation By: Amir Masoud Gharehbaghi

Verifying Programs with BDDs Topics Representing Boolean functions with Binary Decision Diagrams Application to program verification class-bdd.ppt

Optimization Problems

Ricochet Robots Mitch Powell Daniel Tilgner. Abstract Ricochet robots is a board game created in Germany in A player is given 30 seconds to find.

BDDs1 Binary Tree Representation The recursive Shannon expansion corresponds to a binary tree Example: Each path from the root to a leaf corresponds to.

To Split or to Conjoin: The Question in Image Computation 1 {mooni, University of Colorado at Boulder 2 Synopsys.

1 Alan Mishchenko Research Update June-September 2008.

Binary Decision Diagrams Prof. Shobha Vasudevan ECE, UIUC ECE 462.

Chapter 11. Chapter Summary  Introduction to trees (11.1)  Application of trees (11.2)  Tree traversal (11.3)  Spanning trees (11.4)

ECE 667 Synthesis and Verification of Digital Systems

Objective of This Course

Chapter 3: The Efficiency of Algorithms

Binary Decision Diagrams

Chapter 3: The Efficiency of Algorithms

Binary Decision Diagrams

Discrete Controller Synthesis

Binary Decision Diagrams

Example: Verification

Verifying Programs with BDDs Sept. 22, 2006

Presentation transcript:

A Performance Study of BDD-Based Model Checking Bwolen Yang Randal E. Bryant, David R. O’Hallaron, Armin Biere, Olivier Coudert, Geert Janssen Rajeev K. Ranjan, Fabio Somenzi CS740 Dec. 3, 1998 Special Presentation of

Outline BDD Background Data structure Algorithms Organization of this Study participants, benchmarks, evaluation process BDD Evaluation Methodology evaluation platform metrics Experimental Results performance improvements characterizations of MC computations

Boolean Manipulation with OBDDs Ordered Binary Decision Diagrams Data structure for representing Boolean functions Efficient for many functions found in digital designs Canonical representation Example: (x 1  x 2 ) &x 3 Nodes represent variable tests Branches represent variable values Dashed for value 0 Solid for value 1

Example OBDDs Constants Unique unsatisfiable function Unique tautology Variable Treat variable as function Odd Parity Linear representation Typical Function (x 1  x 2 ) &x 4 No vertex labeled x 3 independent of x 3 Many subgraphs shared

Symbolic Manipulation with OBDDs Strategy Represent data as set of OBDDs Identical variable orderings Express solution method as sequence of symbolic operations Implement each operation by OBDD manipulation Information always maintained in reduced, canonical form Algorithmic Properties Arguments are OBDDs with identical variable orderings. Result is OBDD with same ordering.  “Closure Property” Treat as Abstract Data Type User not concerned with underlying representation

If-Then-Else Operation Arguments I, T, E Functions over variables X Represented as OBDDs Result OBDD representing composite function I  T + ¬I  E Concept Apply Boolean choice operation to 3 argument functions Implementation Combination of depth-first traversal and dynamic programming. Maintain computed cache of previously encountered argument / result combinations Worst case complexity product of argument graph sizes.

And(F, G ) Or(F, G ) If-Then-Else(F, G, 0) If-Then-Else(F, 1, G ) Derived Algebraic Operations Other common operations can be expressed in terms of If-Then-Else

Generating OBDD from Network NetworkEvaluation Task: Represent output functions of gate network as OBDDs. Resulting Graphs A  new_var ("a"); B  new_var ("b"); C  new_var ("c"); T1  And (A, B); T2  And (B, C); O1  Or (T1, T2); A B C T1 T2 O1 ABC T1T2 O1 01 a 01 c 01 b 01 b a 01 c b c b 01 b a

Checking Network Equivalence Alternate NetworkEvaluation Resulting Graphs Determine: Do 2 networks compute same Boolean function? Method: Compute OBDDs for both networks and compare A B C T3 O2 ABC T3 O2 c b 01 b a 01 ac b 01 a c T3  Or (A, C); O2  And (T3, B); if (O2 == O1) then Equivalent else Different

Symbolic FSM Representation Nondeterministic FSM Symbolic Representation Represent set of transitions as function  (x, o, n) Yields 1 if input x can cause transition from state o to state n. Represent as Boolean function Over variables encoding states and inputs

Reachability Analysis Task Compute set of states reachable from initial state Q0 Represent as Boolean function R(s). Never enumerate states explicitly GivenCompute Initial

Iterative Computation R i +1 – set of states that can be reached i +1 transitions Either in R i or single transition away from some element of R i for some input Continue iterating until R i = R i +1

Restriction Operation Implementation Depth-first traversal. Complexity linear in argument graph size Concept Effect of setting function argument x i to constant k (0 or 1).

Variable Quantification Eliminate dependency on some argument through quantification Same as step used in resolution-based prover Combine with AND for universal quantification.

Multi-Variable Quantification Operation Compute:  X F (X, Y ) XVector of bound variables x 1, …, x n YVector of free variables y 1, …, y m Result: Function of free variables Y only yields 1 if F (X, Y ) would yield 1 for some assignment to variables X Methods Sequentially –  x 1 [  x 2 […  x n [F (X, Y )]…]] Simultaneously, by recursive algorithm over BDD for F Complexity Each quantification can at most square graph size Typically not so bad

Motivation for Studying Symbolic Model Checking (MC) MC is an important part of formal verification digital circuits and other finite state systems BDDs are an enabling technology for MC Not well studied Packages are tuned using combinational circuits (CC) Qualitative differences between CC and MC computations CC: build outputs, constant time equivalence checking MC: build model, many fixed-points to verify the specs CC: BDD algorithms are polynomial If-Then-Else algorithm MC: key BDD algorithms are exponential Multi-variable quantification

BDD Data Structures BDD Multi-rooted DAG Each root denotes different Boolean function Provide automatic memory management Garbage collection based on reference counting Unique Table Provides mapping [x, v 0, v 1 ]  v Required to make sure graph is canonical Computed Cache Provides memoization of argument / results Reduce manipulation algorithms from exponential to polynomial Periodically flush to avoid excessive growth x v0v0 v1v1 v

Node Referencing Maintain Reference Count for Each Node Includes From other BDD nodes From program references “Top-level pointers” Excludes unique table reference computed cache references v Other BDD Nodes      Top-Level Pointers Unique Table Computed Cache           x

Interactions Between Data Structures Dead Nodes Reference Count  0 No references by other nodes or by top-level pointers Decrement reference counts of children Could cause death of entire subgraph Retain invisible references from unique table & computed cache Garbage Collection Eliminate all dead nodes Remove entries from unique table Flush computed cache Rebirth Possible to resurrect node considered dead From hit in unique table Must increment child reference counts Could cause rebirth of subgraph

Organization of this Study: Participants Armin Biere: ABCD Carnegie Mellon / Universität Karlsruhe Olivier Coudert : TiGeR Synopsys / Monterey Design Systems Geert Janssen: EHV Eindhoven University of Technology Rajeev K. Ranjan: CAL Synopsys Fabio Somenzi: CUDD University of Colorado Bwolen Yang: PBF Carnegie Mellon

Organization of this Study: Setup Metrics: 17 statistics Benchmark: 16 SMV execution traces traces of BDD-calls from verification of cache coherence, Tomasulo, phone, reactor, TCAS… size 6 million - 10 billion sub-operations MB of memory Gives 6 * 16 = 96 different cases Evaluation platform: trace driver “drives” BDD packages based on execution trace

Organization of this Study: Evaluation Process Phase 1: no dynamic variable reordering Phase 2: with dynamic variable reordering hypothesize design experiments validate collect stats identify issues suggest improvements validate Iterative Process

BDD Evaluation Methodology Metrics: Time elapsed time (performance) CPU time page fault rate BDD Alg time GC time # of GCs memory usage # of ops (work) computed cache size Challenge Effect of given optimization uncertain E.g., decreasing #GCs would save CPU time, but increase page faults

memory usage # of GCs computed cache size max # of BDD nodes BDD Evaluation Methodology Metrics: Space Time/Space Trade-Offs Cache size GC frequency

Phase 1 Results: Initial / Final Conclusion: collaborative efforts have led to significant performance improvements speedups > 100: : : : 28

Phase 1: Before/After 6 packages * 16 traces = 96 cases Cumulative Speedup Histogram

Phase 1: Hypotheses / Experiments Computed Cache effects of computed cache size number of repeated sub-problems across time Garbage Collection reachable / unreachable Complement Edge Representation work space Memory Locality for Breadth-First Algorithms

Phase 1: Hypotheses / Experiments (Cont’d) For Comparison ISCAS85 combinational circuits (> 5 sec, < 1GB) c2670, c bit, 14-bit multipliers based on c6288 Metric depends only on the trace and BDD algorithms machine-independent implementation-independent

Computed Cache Size Dependency Hypothesis The computed cache is more important for MC than for CC. Experiment Vary the cache size and measure its effects on work. size as a percentage of BDD nodes normalize the result to minimum amount of work necessary; i.e., no GC and complete cache.

Effects of Computed Cache Size # of ops: normalized to the minimum number of operations cache size: % of BDD nodes Conclusion: large cache is important for MC

Computed Cache: Repeated Sub-problems Across Time Source of Speedup increase computed cache size Possible Cause many repeated sub-problems are far apart in time Validation study the number of repeated sub-problems across user issued operations (top-level operations).

Hypothesis: Top-Level Sharing Hypothesis MC computations have a large number of repeated sub-problems across the top-level operations. Experiment measure the minimum number of operations with GC disabled and complete cache. compare this with the same setup, but cache is flushed between top-level operations.

flush: cache flushed between top-level operations min: cache never flushed Conclusion: large cache is more important for MC Results on Top-Level Sharing

Garbage Collection: Rebirth Rate Source of Speedup reduce GC frequency Possible Cause many dead nodes become reachable again (rebirth) GC is delayed until the number of dead nodes reaches a threshold dead nodes are reborn when they are part of the result of new sub- problems

Hypothesis: Rebirth Rate Hypothesis MC computations have very high rebirth rate. Experiment measure the number of deaths and the number of rebirths

Results on Rebirth Rate Conclusions delay garbage collection triggering GC should not be based only on # of dead nodes Just because a lot of nodes are dead doesn’t mean they’re useless delay updating reference counts High cost to kill/resurrect subgraphs

BF BDD Construction On MC traces, breadth-first based BDD construction has no demonstrated advantage over traditional depth-first based techniques. Two packages (CAL and PBF) are BF based.

BF BDD Construction Overview Level-by-Level Access operations on same level (variable) are processed together one queue per level Locality group nodes of the same level together in memory Good memory locality due to BF  # of ops processed per queue visit must be high

Average BF Locality Conclusion: MC traces generally have less BF locality

Average BF Locality / Work Conclusion: For comparable BF locality, MC computations do much more work.

Phase 1: Some Issues / Open Questions Memory Management space-time tradeoff computed cache size / GC frequency resource awareness available physical memory, memory limit, page fault rate Top-Level Sharing possibly the main cause for strong cache dependency high rebirth rate better understanding may lead to better memory management higher level algorithms to exploit the pattern

Phase 2: Dynamic Variable Reordering BDD Packages Used CAL, CUDD, EHV, TiGeR improvements from phase 1 incorporated

Variable Ordering Sensitivity BDD unique for given variable order Ordering can have large effect on size Finding good ordering essential

Dynamic Variable Ordering Rudell, ICCAD ‘93 Concept Variable ordering changes as computation progresses Typical application involves long series of BDD operations Proceeds in background, invisible to user Implementation When approach memory limit, attempt to reduce Garbage collect unneeded nodes Attempt to find better order for variables Simple, greedy reordering heuristics Ongoing improvements

Reordering By Sifting Choose candidate variable Try all positions in variable ordering Repeatedly swap with adjacent variable Move to best position found Best Choices

Swapping Adjacent Variables Localized Effect Add / delete / alter only nodes labeled by swapping variables Do not change any incoming pointers

Dynamic Ordering Characteristics Added to Many BDD Packages Compatible with existing interfaces User need not be aware that it is happening Significant Improvement in Memory Requirement Limiting factor in many applications Reduces need to have user worry about ordering Main cost is in CPU time Acceptable trade-off May run 10X slower Compatible with Other Extensions Now part of “core technology”

Why is Variable Reordering Hard to Study Time-space tradeoff how much time to spent to reduce graph sizes Chaotic behavior e.g., small changes to triggering / termination criteria can have significant performance impact Resource intensive reordering is expensive space of possible orderings is combinatorial Different variable order  different computation e.g., many “don’t-care space” optimization algorithms

BDD Evaluation Methodology Metrics: Time elapsed time (performance) CPU time page fault rate BDD Alg time GC time reordering time # of GCs # of node swaps (reorder cost) memory usage # of ops (work) # of reorderings computed cache size

BDD Evaluation Methodology Metrics: Space memory usage # of GCs reordering time computed cache size max # of BDD nodes

Only CUDD is used Phase 2: Experiments Quality of Variable Order Generated Variable Grouping Heuristic keep strongly related variables adjacent Reorder Transition Relation BDDs for the transition relation are used repeatedly Effects of Initial Variable Order with and without variable reordering

Effects of Initial Variable Order: Perturbation Algorithm Perturbation Parameters (p, d) p: probability that a variable will be perturbed d: perturbation distance Properties in average, p fraction of variables is perturbed max distance moved is 2d (p = 1, d =  )  completely random variable order For each perturbation level (p, d) generate a number (sample size) of variable orders

Effects of Initial Variable Order: Parameters Parameter Values p: (0.1, 0.2, …, 1.0) d: (10, 20, …, 100,  ) sample size: 10 For each trace 1100 orderings 2200 runs (w/ and w/o dynamic reordering)

Effects of Initial Variable Order: Smallest Test Case Base Case (best ordering) time: 13 sec memory: 127 MB Resource Limits on Generated Orders time: 128x base case memory: 500 MB

At 128x/500MB limit, “no reorder” finished 33%, “reorder” finished 90%. Conclusion: dynamic reordering is effective Effects of Initial Variable Order: Result # of unfinished cases

> 4x or > 500Mb Conclusions: For very low perturbation, reordering does not work well. Overall, very few cases get finished.

> 32x or > 500Mb Conclusion: variable reordering worked rather well

Phase 2: Some Issues / Open Questions Computed Cache Flushing cost Effects of Initial Variable Order determine sample size Need New Better Experimental Design

Summary Collaboration + Evaluation Methodology significant performance improvements up to 2 orders of magnitude characterization of MC computation computed cache size garbage collection frequency effects of complement edge BF locality effects of reordering the transition relation effects of initial variable orderings other general results (not mentioned in this talk) issues and open questions for future research

For data and BDD traces used in this study, Conclusions Rigorous quantitative analysis can lead to: dramatic performance improvements better understanding of computational characteristics Adopt the evaluation methodology by: building more benchmark traces for IP issues, BDD-call traces are hard to understand using / improving the proposed metrics for future evaluation