Satisfiability modulo the Theory of Bit Vectors

Slides:

Advertisements

Similar presentations

TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST

Advertisements

A Randomized Satisfiability Procedure for Arithmetic and Uninterpreted Function Symbols Sumit Gulwani George Necula EECS Department University of California,

Path-Sensitive Analysis for Linear Arithmetic and Uninterpreted Functions SAS 2004 Sumit Gulwani George Necula EECS Department University of California,

- 1 - Using an SMT Solver and Craig Interpolation to Detect and Remove Redundant Linear Constraints in Representations of Non-Convex Polyhedra Christoph.

SMELS: Sat Modulo Equality with Lazy Superposition Christopher Lynch – Clarkson Duc-Khanh Tran - MPI.

Constraint Satisfaction Problems

Automated Theorem Proving Lecture 1. Program verification is undecidable! Given program P and specification S, does P satisfy S?

From Propositional SAT to SMT

Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.

ALGEBRAIC EXPRESSIONS

DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.

ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.

MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.

SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION

MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.

Tintu David Joy. Agenda Motivation Better Verification Through Symmetry-basic idea Structural Symmetry and Multiprocessor Systems Mur ϕ verification system.

Syntax-Guided Synthesis Rajeev Alur Joint work with R.Bodik, G.Juniwal, M.Martin, M.Raghothaman, S.Seshia, R.Singh, A.Solar-Lezama, E.Torlak, A.Udupa 1.

© S Haughton more than 3?

1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.

Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.

Satisfiability Modulo Theories and Network Verification Nikolaj Bjørner Microsoft Research Formal Methods and Networks Summer School Ithaca, June

Addition 1’s to 20.

25 seconds left…...

Test B, 100 Subtraction Facts

Complexity ©D.Moshkovits 1 Where Can We Draw The Line? On the Hardness of Satisfiability Problems.

Local Search Jim Little UBC CS 322 – CSP October 3, 2014 Textbook §4.8

Chapter 11 Limitations of Algorithm Power Copyright © 2007 Pearson Addison-Wesley. All rights reserved.

SAT Solver CS 680 Formal Methods Jeremy Johnson. 2 Disjunctive Normal Form  A Boolean expression is a Boolean function  Any Boolean function can be.

50.530: Software Engineering

Satisfiability Modulo Theories (An introduction)

SMT Solvers (an extension of SAT) Kenneth Roe. Slide thanks to C. Barrett & S. A. Seshia, ICCAD 2009 Tutorial 2 Boolean Satisfiability (SAT) ⋁ ⋀ ¬ ⋁ ⋀

Methods of Proof Chapter 7, second half.. Proof methods Proof methods divide into (roughly) two kinds: Application of inference rules: Legitimate (sound)

Effective Propositional Reasoning CSE 473 – Autumn 2003.

Panel on Decision Procedures Panel on Decision Procedures Randal E. Bryant Lintao Zhang Nils Klarlund Harald Ruess Sergey Berezin Rajeev Joshi.

Nikolaj Bjørner Microsoft Research Lecture 3. DayTopicsLab 1Overview of SMT and applications. SAT solving, Z3 Encoding combinatorial problems with Z3.

1 Satisfiability Modulo Theories Sinan Hanay. 2 Boolean Satisfiability (SAT) Is there an assignment to the p 1, p 2, …, p n variables such that  evaluates.

1 Predicate Abstraction of ANSI-C Programs using SAT Edmund Clarke Daniel Kroening Natalia Sharygina Karen Yorav (modified by Zaher Andraus for presentation.

1 Deciding separation formulas with SAT Ofer Strichman Sanjit A. Seshia Randal E. Bryant School of Computer Science, Carnegie Mellon University.

Presented by Ed Clarke Slides borrowed from P. Chauhan and C. Bartzis

1 Quantified Formulas Acknowledgement: QBF slides borrowed from S. Malik.

SAT-Based Decision Procedures for Subsets of First-Order Logic

GRASP SAT solver Presented by Constantinos Bartzis Slides borrowed from Pankaj Chauhan J. Marques-Silva and K. Sakallah.

1 A theory-based decision heuristic for DPLL(T) Dan Goldwasser Ofer Strichman Shai Fine Haifa university TechnionIBM-HRL.

GRASP: A Search Algorithm for Propositional Satisfiability EE878C Homework #2 2002/11/1 KAIST, EECS ICS Lab Lee, Dongsoo.

Daniel Kroening and Ofer Strichman Decision Procedure

272: Software Engineering Fall 2012 Instructor: Tevfik Bultan Lecture 4: SMT-based Bounded Model Checking of Concurrent Software.

Logics for Data and Knowledge Representation Propositional Logic: Reasoning Originally by Alessandro Agostini and Fausto Giunchiglia Modified by Fausto.

Boolean Satisfiability and SAT Solvers

CMU, Oct 4 DPLL-based Checkers for Satisfiability Modulo Theories Cesare Tinelli Department of Computer Science The University of Iowa Joint work with.

SAT and SMT solvers Ayrat Khalimov (based on Georg Hofferek‘s slides) AKDV 2014.

Explorations in Artificial Intelligence Prof. Carla P. Gomes Module 3 Logic Representations (Part 2)

Solvers for the Problem of Boolean Satisfiability (SAT) Will Klieber Aug 31, 2011 TexPoint fonts used in EMF. Read the TexPoint manual before you.

Introduction to Satisfiability Modulo Theories

Explorations in Artificial Intelligence Prof. Carla P. Gomes Module Logic Representations.

Nikolaj Bjørner Microsoft Research DTU Winter course January 2 nd 2012 Organized by Flemming Nielson & Hanne Riis Nielson.

1 A framework for eager encoding Daniel Kroening ETH, Switzerland Ofer Strichman Technion, Israel (Executive summary) (submitted to: Formal Aspects of.

1 Computing Abstractions by integrating BDDs and SMT Solvers Alessandro Cimatti Fondazione Bruno Kessler, Trento, Italy Joint work with R. Cavada, A. Franzen,

Satisfiability Modulo Theories and DPLL(T) Andrew Reynolds March 18, 2015.

SS 2017 Software Verification Bounded Model Checking, Outlook

EA C461 – Artificial Intelligence Logical Agent

Lazy Proofs for DPLL(T)-Based SMT Solvers

8/2/2018 4:21 AM Lifting SAT to richer theories: bit-vectors, finite bases, and theory combination Christoph M. Wintersteiger © 2013 Microsoft Corporation.

Solving Linear Arithmetic with SAT-based MC

Satisfiability Modulo Theories

Lifting Propositional Interpolants to the Word-Level

Logics for Data and Knowledge Representation

A Progressive Approach for Satisfiability Modulo Theories

Presentation transcript:

Satisfiability modulo the Theory of Bit Vectors Alessandro Cimatti IRST, Trento, Italy cimatti@irst.itc.it Joint work with R. Bruttomesso, A. Franzen, A. Griggio, R. Sebastiani We gratefully acknowledge support from the Academic Research Program of Intel

Index of the talk Satisfiability Modulo Theory The theory of Bit Vectors Satisfiability Modulo BV Bit blasting Eager encoding into Linear Integer Arithmetic A lazy approach Conclusions ( A preview of QF_UFBV32 at SMT-COMP )

SMT in a nutshell Satisfiability Modulo Theory or: beyond boolean SAT Decide the satisfiability of a first order formula with respect to a background theory Examples of relevant theories uninterpreted functions: x=y & f(x) != f(y) difference logic: x – y < 7 linear arithmetic: 3x + 2y < 12 arrays: read(write(M, a0, v0) a1) their combinations bit vectors

Why SMT From SAT-based to SMT-based verification Representation of interesting problems timed automata hybrid automata pipelines software Efficient solving leverage availability of structural information hopefully retaining efficiency of boolean SAT

Satisfiability Modulo Theory is there a truth-assignment to boolean variables and a valuation to individual variables such that formula evaluates to true? Standard semantics for FOL Assignment to individual variables Induces truth values to atoms Truth assignment to boolean atoms Induced value to whole formula

Propositional structure + - + + - - + - + + - - TA TA TA TA P P P x y z w x x y z w x

Two Main Approaches to SMT the eager approach the lazy approach theory independent view theory specific view

Eager Approach to SMT Main idea: compilation to SAT STEP1: Theory part compiled to equisatisfiable pure SAT problem STEP2: run propositional SAT solver

Propositional structure TA P P P x y z w x x y z w x

Propositional structure Lifted theory Propositional structure TA TA TA TA P P P

The Lazy approach Ingredients a boolean SAT solver a theory solver The boolean solver is modified to enumerate boolean (partial) models The theory solver is used to Check for theory consistency

Propositional structure TA TA TA TA TA TA P P P TA TA x y z w x x y z w x

MathSAT: intuitions Two ingredients: boolean search and theory reasoning find boolean model theory atoms treated as boolean atoms truth values to boolean and theory atoms model propositionally satisfies the formula check consistency wrt theory set of constraints induced by truth values to theory atoms existence of values to theory variables Example: (P v (x = 3)) & (Q v (x – y < 1)) & (y < 2) & (P xor Q) Boolean model !P, (x = 3), Q, (x – y < 1), (y < 2) Check (x = 3), (x – y < 1), (y < 2) Theory contradiction! Another boolean model P , !(x = 3) , !Q, (x – y < 1), (y < 2) Check !(x = 3), (x – y < 1), (y < 2) Consistent: e.g. x := 0, y := 0

Boolean SAT: search space Q Q R S S T S T R R     T SAT!  The DPLL procedure Incremental construction of satisfying assignment Backtrack/backjump on conflict Learn reason for conflict Splitting heuristics

MathSAT: approach DPLL-based enumeration of boolean models Retain all propositional optimizations Conflict-directed backjumping, learning No overhead if no theory reasoning Tight integration between boolean reasoning and theory reasoning

MathSAT: search space Many boolean models are not theory consistent! P Q Q R S S T S T R R Bool  Bool T Math  Bool T Math  Bool  Bool T Math T SAT! Bool  Many boolean models are not theory consistent!

Early pruning Check theory consistency of partial assignments P Q S T EP:Math  EP:Math T Q EP:Math T S Pruned away in the EP step EP:Math T T EP:Math T R Bool  Bool T Math T SAT!

THEORY OF FIXED-WIDTH BIT VECTORS

Bit Vectors: Example input a, b, c, d : reg[N]; LTmp0 = a; LTmp1 = 2 * b; LTmp2 = LTmp0 + LTmp1; LTmp3 = 4 * c; LTmp4 = LTmp2 + LTmp3; LTmp5 = 8 * d; LOut = LTmp4 + LTmp5; Are they equivalent? ((a + 2b) + 4c) + 8d RTmp0 = d; RTmp1 = RTmp0 << 1; RTmp2 = c + RTmp1; RTmp3 = RTmp2 << 1; RTmp4 = b + RTmp3; RTmp5 = RTmp4 << 1; ROut = a + RTmp5; a + ((b + ((c + (d<<1)) <<1)) <<1) I.e. LOut = ROut ?

Fixed Width Bit Vectors Constants 0b00001111, 0xFFFF, … Variables valued over BitVectors of corresponding width implicit restriction to finite domain Function symbols selection: x[15:0] concatenation: y :: z bitwise operators: x && y, z || w, … arithmetic operators: x + y, z * w, … shifting: x << 2, y >> 3 Predicate symbols comparators: =, ≠ , > , < , ≥ , ≤

Fragments of BV theory Core Bitwise operators Arithmetic operators selection concatenation Bitwise operators x && y, x || y, x ^ y Arithmetic operators x +y, x – y, c * x Core + Bitwise + Arithmetic Complexity of equality between BV terms Core is in P Core + B + A in NP Variable width bit vectors: not covered here core is in NP small additions yield undecidability

Decision procedures for BV Many approaches Cyrluk, Moeller, Ruess Moeller, Ruess Bjørner, Pichora Barrett, Dill, Levitt Focus on deciding conjunctions of literals Emphasis on proof obligations in ITP some emphasis on variable width, generic wrt N Shostak-style integration canonization solving

SATISFIABILITY MODULO THEORY OF BIT VECTORS

Satisfiability modulo Bit Vectors Applications of interest RTL hardware descriptions essentially bit vectors assembly-level programs software with finite precision arithmetic Key feature combination of control flow and data flow In principle, boolean logic can be encoded into BV control (boolean logic) encoded into width 1 BVs. Likely inefficient in comparison to SAT More natural to keep them separate at modeling structural info can be exploited for verification

Approaches to SMT(BV) Bit blasting Eager Encoding into LA Lazy approach

SMT(BV) via Bit Blasting

SMT(BV) via Bit Blasting Boolean variables: untouched Bit vector variables as collections of (unrelated) boolean variables [x0, x1, …, x63] Selection/concatenations are trivial static detection Equalities / Assignments: x = y (x0 <-> y0) & (x1 <-> y1) & … & (x63 <-> y63) Bitwise operators: x && y [x0 & y0, x1 & y1, …, x63 & y63] Arithmetic operators: x + y BVADD([x0, …, x63], [y0, …, y63])

Comparison of Data Paths input a, b, c, d : reg[N]; LTmp0 = a; LTmp1 = 2 * b; LTmp2 = LTmp0 + LTmp1; LTmp3 = 4 * c; LTmp4 = LTmp2 + LTmp3; LTmp5 = 8 * d; LOut = LTmp4 + LTmp5; Are they equivalent? ((a + 2b) + 4c) + 8d RTmp0 = d; RTmp1 = RTmp0 << 1; RTmp2 = c + RTmp1; RTmp3 = RTmp2 << 1; RTmp4 = b + RTmp3; RTmp5 = RTmp4 << 1; ROut = a + RTmp5; a + ((b + ((c + (d<<1)) <<1)) <<1) I.e. LOut = ROut ?

Scalability with respect to N??? Bit Blasting Words a,b,c,d,… blasted to [a1,…aN], [b1,…bN], [c1,…cN], [d1,…dN], … LTmp6 != RTmp6 (LOut.1 != ROut.1) or … or (LOut.N != ROut.N) LTmp1 = 2 * b formula in 2N vars, conjunction of N iffs LTmp2 = LTmp0 + LTmp1 formula relating 3N vars possibly additional vars required (e.g. carries) N = 16 bits? 13 secs N = 32 bits? 170 secs “But obviously N = 64 bits!” stopped after 2h CPU time Scalability with respect to N???

Bit-Blasting: Pros and Conses Bottlenecks dependency on word width “wrong” level of abstraction boolean synthesis of arithmetic circuits assignments are pervasive conflicts are very fine grained e.g. discover x < y Advantages let the SAT solver do all the work and nowadays SAT solvers are tough nuts to crack amalgamation of the decision process no distinction between control and data conflicts can be as fine grained as possible built-in capability to generate “new atoms”

Enhancements to BitBlasting Tuning SAT solver on structural information e.g. splitting heuristic for adders Preprocessing + SAT [GBD05] rewrite and normalize bit vector terms bit blasting to SAT

SMT(BV) via reduction to SMT(LA)

From BV to LIA RTL-Datapath Verification using Integer Linear Programming [BD01] BV constants as integers 0b32_1111 as 15 BV variables as integer valued variables, with range constraints reg x [31:0] as x in range [0, 2^32) Assignments treated as equality, e.g. x = y Arithmetic, e.g. z = x + y Linear arithmetic? not quite! BV Arithmetic is modulo 2^N z = x + y - 2^N s, with z in [0, 2^N) Concatenation: x :: y as 2^n x + y Selection: relational encoding (based on integrity) x[23:16] as xm, where x = 2^24 xh + 2^16 xm + xl, xl in [0, 2^16), xm in [0, 2^8), xl in [0, 2^8) Bitwise operators based on selection of individual bits SOLVER the omega test

From SMT(BV) into SMT(LIA) Generalizes [BD01] to deal with boolean structure Eager encoding into SMT(LIA) Unfortunately, not very efficient More precisely, a failure

Retrospective Analysis Crazy approach? Arithmetic Linear arithmetic? not quite! BV Arithmetic is modulo 2^N Selection and Concatenation an easy problem becomes expensive! Bitwise operators HARD!!! Available solvers not adequate integers with infinite precision reasoning with integers may be hard (e.g. BnB within real relaxation) Functional dependencies are lost! A clear culprit: static encoding depending on control flow, same signal is split in different parts z = if P then x[7:0] :: y[3:0] else x[5:2] :: y[10:3] x, y and also z are split more than needed the notion of “maximal chunk” depends on P !!!

SMT(BV) via online BV reasoning

A lazy approach Based on standard MathSAT schema DPLL-based model enumeation Dedicated Solver for Bit vectors The encoding leverages information resulting from decisions Given values to control variables, the data path is easier to deal with (e.g. maximal chunks are bigger) Layering in the theory solver equality reasoning limited simplification rules full blown bit vector solver only at the end

The architecture Boolean enumeration BV solver EUF reasoning BV rewriter LIA encoding

Rewriting rules evaluation of constant terms rules for equality 0b8_01010101[4:2] becomes 0b3_101 rules for equality x = y and Phi(x) becomes Phi(y) based on congruence closure splitting concatenations (x :: y) = z becomes x = z[h_n] && y == z[l_n]

Rewriting rules pushing selections “pigeon-hole” rules (x && y)[7:0] becomes (x[7:0] && y[7:0]) (x :: y)[23:8] becomes (x[7:0] :: y[15:8]) “pigeon-hole” rules from (x != 0 & x != 1 & x != 2 & x < 3) derive false

BV rewriter Rules are applied until fix point reached contradiction found Implementation based on EUF reasoner rules as merges between eq classes Open issues incrementality/backtrackability selective rule activation conflic set reconstruction When it fails …

LIA encoding (the last hope) idenfication of maximal slices “purification”: separating out arithmetic and BW by introduction of additional variables NB: on resulting problems LIA encoding always superior to bit blasting!!! cfr [DB01]

Status of Implementation Implementation still in prototypical state “Does a lot of stupid things” conflict minimization by deletion filtering checking that conflict are in fact minimal unnecessary calls to LA for SAT clusters calling LA solver implemented as dump on file, and run external MathSAT huge conflict sets

A very very preliminary evaluation

Competitors Run against MiniSAT 1.14 KEY REMARK: ~ winner of SAT competition in 2005 KEY REMARK: boolean methods are very mature A good reason for giving up?

Test benches 74 benchmarks from industrial partner Unfortunately would have been ideal for SMT-COMP QF_UFBV32 Unfortunately can not be disclosed “will have to be destroyed after the collaboration” hopefully our lives will be spared 

Conclusions A “market need” for SMT(BV) solvers Bit Blasting: tough competitors After a failure, … Preliminary results are encouraging Future challenges optimize BV solver better conflict sets tackle some RTL verification cases extension to memories

A small digression on QF_UFBV32 at SMT-COMP

QF_UFBV[32] at SMT-COMP the MathSAT you will see there IS NOT the one I described We currently have no results for QF_UFBV Easy benchmarks: QF_UFBV[32] not particularly “SMT” the boolean component is nearly missing the BV part is “easily” solvable by bit blasting We entered SMT-COMP QF_UFBV32 MathSAT based on BIT BLASTING to SAT NuSMV based on bit blasting to BDDs

QF_UFBV: Bit Blasting to SAT Preprocessing based on Ackerman’s elimination of function symbols rewriting simplification bit blasting Core: call SAT solver underlying MathSAT every SAT problem in < 0.3 secs most UNSAT within seconds a handful of hard ones between 300 and 500 secs

BDDs (???) on SMT-COMP tests Even NuSMV entered SMT-COMP Ackerman’s elimination of functional symbols Rewriting preprocessor Core solver based on BDDs conjunctively partitioned problem structural BDD-based ordering (bit interleaving) (almost) no dynamic reordering affinity-based clustering, threshold 100 nodes early quantification Seems to work well both on SAT and UNSAT instances

RESULTS first STP then YICES then NuSMV then CVC3 (but no results on two samples) then MathSAT BITBLASTING 3rd on SAT last on UNSAT

SAT instances

UNSAT instances