Extensible Semantics for XrML Vicky Weissman Joint work with Joe Halpern.

Slides:

Advertisements

Similar presentations

Completeness and Expressiveness

Advertisements

Working with Discourse Representation Theory Patrick Blackburn & Johan Bos Lecture 3 DRT and Inference.

Justification-based TMSs (JTMS) JTMS utilizes 3 types of nodes, where each node is associated with an assertion: 1.Premises. Their justifications (provided.

Situation Calculus for Action Descriptions We talked about STRIPS representations for actions. Another common representation is called the Situation Calculus.

Hoare’s Correctness Triplets Dijkstra’s Predicate Transformers

Inference and Reasoning. Basic Idea Given a set of statements, does a new statement logically follow from this. For example If an animal has wings and.

Authorization Policies Vicky Weissman

F22H1 Logic and Proof Week 7 Clausal Form and Resolution.

1 Introduction to Computability Theory Lecture15: Reductions Prof. Amos Israeli.

1 Introduction to Computability Theory Lecture12: Reductions Prof. Amos Israeli.

1 Basic abstract interpretation theory. 2 The general idea §a semantics l any definition style, from a denotational definition to a detailed interpreter.

A Formal Foundation for XrML Vicky Weissman Joint work with Joe Halpern.

Lecture 9 Recursive and r.e. language classes

Constraint Logic Programming Ryan Kinworthy. Overview Introduction Logic Programming LP as a constraint programming language Constraint Logic Programming.

Lecture 8 Recursively enumerable (r.e.) languages

A Formal Foundation for XrML Vicky Weissman Joint work with: Joseph Halpern.

Daniel Moran & Marina Yatsina. Access control through encryption.

1 Module 7 Halting Problem –Fundamental program behavior problem –A specific unsolvable problem –Diagonalization technique revisited Proof more complex.

CS 536 Spring Global Optimizations Lecture 23.

1 Lecture 7 Halting Problem –Fundamental program behavior problem –A specific unsolvable problem –Diagonalization technique revisited Proof more complex.

Winter 2004/5Pls – inductive – Catriel Beeri1 Inductive Definitions (our meta-language for specifications)  Examples  Syntax  Semantics  Proof Trees.

Using First-order Logic to Reason about Policies Vicky Weissman Joint work with: Joseph Halpern and Carl Lagoze.

1 Lecture 7 Halting Problem –Fundamental program behavior problem –A specific unsolvable problem –Diagonalization technique revisited Proof more complex.

Copyright © 2006 The McGraw-Hill Companies, Inc. Programming Languages 2nd edition Tucker and Noonan Chapter 18 Program Correctness To treat programming.

CS 4700: Foundations of Artificial Intelligence

A logic for reasoning about digital rights Riccardo Pucella, Vicky Weissman Cornell University.

A Formal Foundation for ODRL What’s ODRL?  An XML-based language for writing software licenses. Language specification includes:  syntax  English interpretation.

Towards a policy language for humans and computers Vicky Weissman Joint work with Carl Lagoze.

Rights management Vicky Weissman

Monadic Predicate Logic is Decidable Boolos et al, Computability and Logic (textbook, 4 th Ed.)

Chapter 2: Algorithm Discovery and Design

The Game of Algebra or The Other Side of Arithmetic The Game of Algebra or The Other Side of Arithmetic © 2007 Herbert I. Gross by Herbert I. Gross & Richard.

CAS LX 502 Semantics 3a. A formalism for meaning (cont ’ d) 3.2, 3.6.

Procedure specifications CSE 331. Outline Satisfying a specification; substitutability Stronger and weaker specifications - Comparing by hand - Comparing.

REFACTORING Lecture 4. Definition Refactoring is a process of changing the internal structure of the program, not affecting its external behavior and.

Chapter 4 Context-Free Languages Copyright © 2011 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1.

Inference is a process of building a proof of a sentence, or put it differently inference is an implementation of the entailment relation between sentences.

Invitation to Computer Science, Java Version, Second Edition.

CAS LX 502 8b. Formal semantics A fragment of English.

1 Inference Rules and Proofs (Z); Program Specification and Verification Inference Rules and Proofs (Z); Program Specification and Verification.

An Algebra for Composing Access Control Policies (2002) Author: PIERO BONATTI, SABRINA DE CAPITANI DI, PIERANGELA SAMARATI Presenter: Siqing Du Date:

Lesson 3 CDT301 – Compiler Theory, Spring 2011 Teacher: Linus Källberg.

Formal Models in AGI Research Pei Wang Temple University Philadelphia, USA.

Pattern-directed inference systems

LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.

Overview Concept Learning Representation Inductive Learning Hypothesis

Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.

CSE Winter 2008 Introduction to Program Verification January 31 proofs through simplification.

Great Theoretical Ideas in Computer Science.

Naïve Set Theory. Basic Definitions Naïve set theory is the non-axiomatic treatment of set theory. In the axiomatic treatment, which we will only allude.

1 Reasoning with Infinite stable models Piero A. Bonatti presented by Axel Polleres (IJCAI 2001,

ece 627 intelligent web: ontology and beyond

Fundamentals of Informatics Lecture 13 Reduction Bas Luttik.

CSE 311 Foundations of Computing I Lecture 28 Computability: Other Undecidable Problems Autumn 2011 CSE 3111.

Daniel Kroening and Ofer Strichman 1 Decision Procedures An Algorithmic Point of View Basic Concepts and Background.

Overview of the theory of computation Episode 3 0 Turing machines The traditional concepts of computability, decidability and recursive enumerability.

Metalogic Soundness and Completeness. Two Notions of Logical Consequence Validity: If the premises are true, then the conclusion must be true. Provability:

1 Section 7.1 First-Order Predicate Calculus Predicate calculus studies the internal structure of sentences where subjects are applied to predicates existentially.

Logics for Data and Knowledge Representation ClassL (part 1): syntax and semantics.

COMP 412, FALL Type Systems C OMP 412 Rice University Houston, Texas Fall 2000 Copyright 2000, Robert Cartwright, all rights reserved. Students.

Valid and Invalid Arguments

Parsing & Context-Free Grammars

CS510 Compiler Lecture 4.

ARTIFICIAL INTELLIGENCE

Great Theoretical Ideas in Computer Science

(Slides copied liberally from Ruth Anderson, Hal Perkins and others)

Logics for Data and Knowledge Representation

Programming Languages 2nd edition Tucker and Noonan

CS 583 Fall 2006 Analysis of Algorithms

Programming Languages 2nd edition Tucker and Noonan

Presentation transcript:

Extensible Semantics for XrML Vicky Weissman Joint work with Joe Halpern

The big picture A policy says that under certain conditions an action, such as downloading a file, is permitted or forbidden. Digital content providers want to write policies about how their works may be accessed, and to have their policies enforced.

Diverse apps – same need Because we can’t regulate access to online content with precision: Digital libraries can’t put certain content online; it might violate IP laws. The Greek Orthodox Archdiocese of America is wary of defamation. Cultural traditions aren’t respected. (Australian Aboriginal communities often restrict access to a clan or gender.)

XrML to the rescue XrML is a language for writing policies. Syntax is XML-based. Semantics is given in 2 ways. 1.An English interpretation of the syntax. 2.An English description of an algorithm that says if a set of XrML policies imply a permission. Bottom line: write policies in XrML, enforce using the algorithm.

Industry likes XrML XrML endorsed by Adobe, Hewlett- Packard, Microsoft, Xerox, Barnesandnoble.com, MPEG International Standards Committee… Microsoft and others plan to make XrML-compliant products. Will tomorrow’s OS, DVD player, … enforce XrML policies?

Improving XrML MPEG International Standards Committee generalized some of the concepts and extended the language (slightly). Released their version in March We refer to the 2003 version as XrML. They released another version in 2004, which we discuss later in this talk.

XrML Shortcomings No formal semantics. Policies can be ambiguous. The interpretation of the syntax doesn’t quite match the algorithm. The algorithm’s behavior on some (realistic) input is unintuitive and unintended by language designers. E.g. If Alice is a student and every student may eat lunch, may Alice? Alg. says no.

Improving XrML (cont) Fix the algorithm to match developers’ intent. Translate XrML policies to formulas in modal first-order logic. Formal semantics given in a fairly standard way. Lets us compare XrML with languages in CS literature, borrow complexity results, extensions,… Prove our translation matches the algorithm. Algorithm says policies imply a permission iff translated policies imply translated permission.

XrML syntax XrML is an XML-based language. XrML policies are verbose. We present a syntax that is more concise and easy to map to XrML syntax.

Syntax Principals Agents (e.g., Alice, Bob) Resources Digital content (e.g., a movie, an article) Rights Actions (e.g., play, edit) Properties Describe a principal (e.g., adult, trusted)

Syntax (cont.) Statements Stmt ::= Pr(p) | Perm(p, r, s) | true Pr(p) means principal p has property Pr. Perm(p, r, s) means p is permitted to exercise right r over resource s.

Syntax (cont.) Statements Stmt ::= Pr(p) | Perm(p, r, s) | true Pr(p) means principal p has property Pr. Perm(p, r, s) means p is permitted to exercise right r over resource s.

Syntax (cont.) Statements Stmt ::= Pr(p) | Perm(p, r, s) | true Pr(p) means principal p has property Pr. Perm(p, r, s) means p is permitted to exercise right r over resource s.

Syntax (cont.) Statements Stmt ::= Pr(p) | Perm(p, r, s) | true Pr(p) means principal p has property Pr. Perm(p, r, s) means p is permitted to exercise right r over resource s.

Syntax (cont.) grant ::=  x 1 …  x n (Stmt  …  Stmt  Stmt) If condition holds, then conclusion holds. In our fragment, grants are closed (no free variables). license ::= (grant, principal) (g, p) means p issues/says g. Stmt ::= Pr(p) | Perm(p, r, s) | true conditionconclusion

Examples Can write: `Joe is a professor’ as true  Prof(Joe) and Vicky says `Every professor who gives a talk may have a cookie’ as (  x (Prof(x)  GivesTalk(x)  Perm(x, eat, cookie)), Vicky).

Examples Can write: `Joe is a professor’ as true  Prof(Joe) and Vicky says `Every professor who gives a talk may have a cookie’ as (  x (Prof(x)  GivesTalk(x)  Perm(x, eat, cookie)), Vicky).

Examples Can write: `Joe is a professor’ as true  Prof(Joe) and Vicky says `Every professor who gives a talk may have a cookie’ as (  x (Prof(x)  GivesTalk(x)  Perm(x, eat, cookie)), Vicky).

Principals – in some detail Set of principals is the set of primitive principals (e.g., Alice, Bob) closed under union. E.g., Alice  Bob is a principal. Often written as {Alice, Bob}. According to the XrML doc, {p 1,.., p n } represents p 1, …, p n “acting together as one holistic identified entity”. But what does this mean?

Groups/members relationship Suppose that Alice has property Pr A and the group {Alice, …} has property Pr g. What should we infer? Option 1: nothing. Option 2: {Alice, …} has property Pr A. Option 3: Alice has property Pr g.

Groups/members relationship Suppose that Alice has property Pr A and group {Alice, …} has property Pr g. What should we infer? Option 1: nothing. Option 2: {Alice, …} has property Pr A. Option 3: Alice has property Pr g. XrML chooses each of these options (at different points in the specification).

Groups/members relationship Suppose that Alice has property Pr A and group {Alice, …} has property Pr g. What should we infer? Option 1: nothing. Option 2: {Alice, …} has property Pr A. Option 3: Alice has property Pr g. XrML chooses each of these options (at different points in the specification). No formal semantics  language is inconsistent!

Our fix Since XrML is inconsistent… We do not assume that a group has the properties of its members or vice-versa. But can easily write policies to force either relationship (or both).

The syntax given here is a fragment of XrML. (See paper for details.)

XrML Algorithm Query(s,L,G) algorithm s is a closed statement. L is a set of licenses. G is a set of grants that implicitly hold. Returns true if s “follows” from L and G. Query calls Auth and Holds.

Auth Algorithm Recall s is a closed statement. L is a set of licenses. G is a set of grants that implicitly hold. A condition is a conjunction of statements. Auth(s, L, G) returns a set D of closed conditions; s “follows” from L and G if a condition in D “holds”.

Holds Algorithm Holds(d,L) algorithm d is a closed condition. L is a set of licenses. Returns true if d “follows” from L.

Query(s, L, G) overview Query(s, L, G) Set D to the output of Auth(s, L, G) Return  d  D Holds(d, L) s is a closed statement, L is a set of licenses, and G is a set of grants that hold implicitly. s “follows” from L and G if a condition in the output of Auth(s, L, G) “holds”. Holds(d, L) returns true if d “follows” from L.

Problem Let g = true  Student(Alice), g’ =  x (Student(x)  Perm(x, eat, lunch)) May Alice eat lunch? Query(Perm(Alice, eat, lunch), , {g, g’})

Problem Let g = true  Student(Alice), g’ =  x (Student(x)  Perm(x, eat, lunch)) May Alice eat lunch? Query(Perm(Alice, eat, lunch), , {g, g’}) Query calls Auth(Perm(Alice, eat, lunch), , {g, g’}). Auth returns {Student(Alice)}.

Problem Let g = true  Student(Alice), g’ =  x (Student(x)  Perm(x, eat, lunch)) May Alice eat lunch? Query(Perm(Alice, eat, lunch), , {g, g’}) Query calls Auth(Perm(Alice, eat, lunch), , {g, g’}). Auth returns {Student(Alice)}. Query calls Holds(Student(Alice),  ). lost g!

Problem Let g = true  Student(Alice), g’ =  x (Student(x)  Perm(x, eat, lunch)) May Alice eat lunch? Query(Perm(Alice, eat, lunch), , {g, g’}) Query calls Auth(Perm(Alice, eat, lunch), , {g, g’}). Auth returns {Student(Alice)}. Query calls Holds(Student(Alice),  ). Holds returns false; so, Query returns false. lost g!

The fix To correct the problem, pass G to Holds and modify Holds to use the new info. Bug is easy to find and easy to fix, but still made it into the released March 2003 version of the spec.

Another bug Query(s, , {  x (Perm(p, issue, x)  s)}) Query calls Auth on same input. Auth returns {Perm(p, issue, g) | g is a grant}. Recall: Auth output is a set of closed conditions. Query calls Holds on each returned condition.

Another bug Query(s, , {  x (Perm(p, issue, x)  s)}) Query calls Auth on same input. Auth returns {Perm(p, issue, g) | g is a grant}. Recall: Auth output is a set of closed conditions. Query calls Holds on each returned condition. The set of grants is infinite. g 0 = true  Student(Alice) g i = true  Perm(Bob, issue, g i-1 ), i = 1, … D is an infinite set; so, Query doesn’t terminate.

Our fix Restrict the grants in the language. If a grant g has a condition d, d mentions a resource variable x, and x is free in d, then x is free in g’s conclusion. Easy to prove that if the restriction is met, then Auth always returns a finite set. Can make an empirical argument for why this restriction is okay.

But that’s not all… In this small fragment of XrML, there are 2 other bugs. See paper for details.

The translation The translation is fairly straightforward. Two points worth noting: Query assumes that a grant g holds, if it’s issued by some trusted principal (i.e., a principal whose permitted to issue g). Holds(d, L, G) returns true iff d logically follows from L and G. So, a condition d holds iff d logically follows from L and G. The translation depends on L and G.

Correctness Thm: the fixed Query(s, L, G) returns true iff  l  L l L,G   g  G g L,G  s L,G is true in every model that satisfies the union properties (p 1  p 2 = p 2  p 1, …). t L,G is the translation of t wrt L and G.

Complexity The XrML alg. runs in exponential time. The XrML document says that the language implementers are responsible for optimizations. But using the translation, we can prove that…

Complexity Determining if a set of XrML grants imply a statement is NP-hard. This is because the language supports sets of primitive principals. If we remove  from the language… XrML translates (essentially) to Datalog, which is a well-known tractable language. Given the translation, finding a tractable, fairly expressive fragment is easy.

Accomplishments We have proposed the first formal semantics for XrML and, in the process, found significant problems with the spec. But the importance of this work hinges on whether the results actually lead to a better language.

Impact When we found bugs, we told Xin Wang and Thomas DeMartini from the MPEG standards committee.

Impact When we found bugs, we told Xin Wang and Thomas DeMartini from the MPEG standards committee. In March 2004, the committee released a new version, which is an ISO standard. They addressed all of our concerns!

Key Change The algorithm is replaced by a formal description of when a permission follows from L and G. A permission p follows from L and G iff there’s a tree of finite depth such that leaves are statements that implicitly hold, non-leaves are implied by a single grant and their children, and the root is p.

Example Alice is a good student and all good students may play. May Alice play? Yes. Alice is good Alice is a student Alice may play known facts follows from children and grant `all good students may play’

Observations The standard has formal semantics! That’s what the formal description is. Also, some corrections are no longer necessary. We no longer need to restrict the class of grants so that the algorithm terminates or modify the algorithm so that complete information is passed from one routine to the next.

But, wait a minute At the end of the day, we want to implement XrML. How do we do this without an algorithm? Tweak our translation to match the standard, then use Datalog techniques to solve the validity problem (assuming no union op). First-order semantics still useful, even though standard has its own.

Another change The standard assumes that a group has the properties of its members. Recall: 2003 version is inconsistent. Why build this relationship into the language?

Hidden assumption The XrML designers assume that (essentially) all facts about the world come from individuals presenting certificates. Examples: Alice is a student if she presents her student id. A principal is the group {Alice, Bob} if the principal presents both Alice and Bob’s IDs.

A consequence In certificate passing systems it is rare that an individual loses a right by presenting a certificate. {Alice, Bob} gets some rights by presenting the Alice ID and does not lose these rights by presenting Bob’s. The assumption guarantees this. Assuming a certificate passing system leads to other design decisions….

Negation is unnecessary Most policies for certificate passing systems are negation-free. Certificates usually give the properties that their holders have. E.g. students are given student ids, instead of everyone else being given `not a student’ ids. It’s silly to have a policy that forbids an action if certain credentials are presented. No one will present them when wanting to do the forbidden task.

Problem Certificate passing systems are not appropriate for all applications. Many policies need negation. E.g. `if a child is not grounded, then she may play outside’ and `smoking is not permitted on the airplane.’ These policies cannot be captured explicitly in the standard.

A partial solution Assume that an action is forbidden unless it’s explicitly permitted. Problem: Can’t distinguish forbidden actions from unregulated ones. As a result, policy sets can’t be merged. E.g. A university’s policies talk about who’s permitted to get tenure. The policies for Alice’s new outreach program don’t.  Alice’s policies contradict the university’s.

We may want functions too. Functions often occur naturally when translating policies from English to first-order logic. E.g. `Classified information may be copied from one secure server to another’:  x 1, x 2, x 3, x 4 (Classified(x 1 )  Secure(x 2 )  Secure(x 3 )  Permitted(x 4, copySrcDst(x 2, x 3 ), x 1 ))

We may want functions too. Functions often occur naturally when translating policies from English to first-order logic. E.g. `Classified information may be copied from one secure server to another’:  x 1, x 2, x 3, x 4 (Classified(x 1 )  Secure(x 2 )  Secure(x 3 )  Permitted(x 4, copySrcDst(x 2, x 3 ), x 1 ))

Extending the standard We want the standard to support negations and functions. Extending the syntax is easy. It’s not clear how to modify the semantics given in the standard. But, extending the translation is easy. Policies simply translate to a fragment of modal first-order logic that includes functions and negation.

Complexity The validity problem for first-order formulas with functions and negation is undecidable. Idea: Restrict functions and negation so that policies translate to a tractable fragment.

Datalog There are tractable fragments of Datalog that support some functions and negation, but the use of functions is severely restricted and negation cannot appear in the conclusion of Datalog rules, so no support for policies that forbid actions.

Lithium The Lithium language [Halpern/Weissman] is a tractable fragment of first-order logic that supports unlimited use of functions and allows a fair amount of negation in both the premise and conclusions of statements.

Expressivity Experimental results suggest that Lithium is sufficiently expressive for many applications. We collected a number of policies from libraries and government databases, and have written them in Lithium.

Summary We proposed the first formal semantics for XrML. This helped the language developers to find several bugs, which they fixed. They have since given XrML formal semantics; however, our approach seems better suited to extending and analyzing the language using known techniques.

Summary Industry wants to implement XrML but … XrML has no formal semantics and needs them! We give formal semantics to a representative fragment of XrML . Even a small fragment is intractable. We can leverage results in the CS literature to find fairly expressive, tractable options. Next step: Add negation to XrML. This is critical for merging policies.  Two minor bugs in paper (don’t effect results); corrected version online.

talk ends on preceding slide

Sample XrML policy Consider the policy `anyone may play the movie `Big Hit’ for $2 (per use)’. We could write this policy in XrML as…

Big Hit 2.00 </sx:paymentPerUse

The translation We now translate XrML licenses and grants to “equivalent” formulas in modal first-order logic. The translation relies on which licenses have been issued and which grants implicitly hold. Let s L,G be the translation of any string s wrt the input parameters L and G.

Translation (cont.) Except for licenses and grants, translation is easy. We assume a constant c g for each grant g Perm(p, issue, g) L,G = Perm(p, issue, c g ) (d 1  d 2 ) L,G = d 1 L,G  d 2 L,G, Pr(p) L,G = Pr(p), and true L,G = true

Translating licenses Recall: (g, p) means p said g. According to Query, if p may issue g, then (g,p) means that g holds otherwise, (g, p) is meaningless Option 1: (g, p) L,G = Said(p, c g ), restrict to models satisfying the axiom scheme Said(p, c g )  Perm(p, issue, c g )  g L,G Option 2: (g, p) L,G = Perm(p, issue, c g )  g L,G

Translating grants  x 1 …  x n (d  e) L,G =  x 1 …  x n (Holds(d, L, G)  e L,G ) Holds(d, L, G) returns true iff d is a logical consequence of L and G. Define a modal operator Val, where Val(  ) is true in a model m iff  is true in all models. Holds(d, L, G)=Val(  l  L l L,G   g  G g L,G  d L,G )