TAPP-09 23/02/2009Giorgos Flouris1 On Explicit Provenance Management in RDF/S Graphs Institute of Computer Science Foundation for Research and Technology.

Slides:



Advertisements
Similar presentations
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
Advertisements

TU e technische universiteit eindhoven / department of mathematics and computer science Modeling User Input and Hypermedia Dynamics in Hera Databases and.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
WP3: Provenance and Access Control Irini Fundulaki Giorgos Flouris Institute of Computer Science-FORTH 1st year review Luxembourg, December 2011.
Designing Indexing Structure for Discovering Relationships in RDF Graphs Stanislav Bartoň.
Dr. Jim Bowring Computer Science Department College of Charleston CSIS 690 (633) May Evening 2009 Semantic Web Principles and Practice Class 5: 27 May.
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Context Dependent Reasoning.
Dr. Alexandra I. Cristea RDF.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
More RDF CS 431 – Carl Lagoze – Cornell University Acknowledgements: Eric Miller Dieter Fensel.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
1 Optimizing Utility in Cloud Computing through Autonomic Workload Execution Reporter : Lin Kelly Date : 2010/11/24.
Cornell CS 502 Resource Description Framework Building the Semantic Web CS 502 – Carl Lagoze – Cornell University Acknowledgements: Eric Miller.
Nancy Ide Vassar College USA Resource Definition Framework A Tutorial EUROLAN 2003 July 28 - August 8 Bucharest - Romania.
RDF Semantics by Patrick Hayes W3C Recommendation Presented by Jie Bao RPI Sept 4, 2008 Part 1 of RDF/OWL Semantics Tutorial.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Publishing data on the Web (with.
BOEMIE Workshop 02/12/2008Giorgos Flouris1 Formalizing the Evolution Process Institute of Computer Science Foundation for Research and Technology – Hellas.
1 CENTRIA, Dept. Informática da Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal. 2 Institute of Computer Science,
Okech Odhiambo Faculty of Information Technology Strathmore University
Applying Belief Change to Ontology Evolution PhD Student Computer Science Department University of Crete Giorgos Flouris Research Assistant.
FORTH Research Activities PlanetData WP1-3 Meeting (Frankfurt, Nov10) Giorgos Flouris, Irini Fundulaki – FORTH.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Logics for Data and Knowledge Representation
Database Support for Semantic Web Masoud Taghinezhad Omran Sharif University of Technology Computer Engineering Department Fall.
SQL Databases are a Moving Target Juan F. Sequeda – Syed Hamid Tirmizi –
Recent research : Temporal databases N. L. Sarda
IDB, SNU Dong-Hyuk Im Efficient Computing Deltas between RDF Models using RDFS Entailment Rules (working title)
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
OWL 2 in use. OWL 2 OWL 2 is a knowledge representation language, designed to formulate, exchange and reason with knowledge about a domain of interest.
Pavan Reddiavri (Ebiquity Labs) “R ♫ P” RDF Access control Policies.
Part I: Set Constructs. RDF Schema (RDFS) RDF does not provide mechanisms to define domain classes and properties RDFS is a vocabulary that provides many.
Chapter 3 RDF and RDFS Semantics. Introduction RDF has a very simple data model But it is quite liberal in what you can say Semantics can be given using.
RQL: RDF Query language Jianguo Lu University of Windsor The following slides are from Grigoris Antoniou, Frank van Harmelen, “A Semantic Web Primer”
Turtle – RDF Graph. Turtle – Basic Syntax Triples are terminated with a full stop URLs are encoded in angle brackets ( ) Literals are enclosed by double.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
WP3: Provenance and Access Policies Giorgos Flouris (FORTH) - Irini Fundulaki (CWI & FORTH) -
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
E.Bertino, L.Matino Object-Oriented Database Systems 1 Chapter 5. Evolution Seoul National University Department of Computer Engineering OOPSLA Lab.
Ontology Architectural Support Options Group Name: MAS WG Source: Catalina Mladin, Lijun Dong, InterDigital Meeting Date: Agenda Item: TBD.
Of 35 lecture 5: rdf schema. of 35 RDF and RDF Schema basic ideas ece 627, winter ‘132 RDF is about graphs – it creates a graph structure to represent.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
Ch 7: RDF schema 현근수, 김영욱, 백상윤, 이용현 Team C. Introduction Semantic web modeling In RDF: simply creates graph structure to represent data In RDFS: about.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
A Semantic Web Approach for the Third Provenance Challenge Tetherless World Rensselaer Polytechnic Institute James Michaelis, Li Ding,
Practical RDF Chapter 12. Ontologies: RDF Business Models Shelley Powers, O’Reilly SNU IDB Lab. Taikyoung Kim.
Of 38 lecture 6: rdf – axiomatic semantics and query.
Kumar Madurai October 21, 2013 Knowledge Engineering Using Linked Data in an Enterprise 1.
Knowledge Technologies Manolis Koubarakis 1 Some Other Useful Features of RDF.
An Optimization Technique for RDFS Inference using the Application Order of RDFS Entailment Rules Kisung Kim, Taewhi Lee
Sesame A generic architecture for storing and querying RDF and RDFs Written by Jeen Broekstra, Arjohn Kampman Summarized by Gihyun Gong.
WP3: Data Provenance and Access Control Irini Fundulaki, FORTH December 11-12, 2012, Luxembourg.
Linked Open Data Dataset from Related Documents Petya Osenova and Kiril Simov IICT-BAS LDL-2016, LREC, Portoroz.
LegalRuleML Metamodel Tara Athan, Harold Boley, Guido Governatori, Monica Palmirani, Adrian Paschke, Adam Wyner July 13, 2013 RuleML th International.
Linked Data & Semantic Web Technology The Semantic Web Part 7. RDF Semantics Dr. Myungjin Lee.
Semantic Web in Depth RDF Schema Dr Nicholas Gibbins –
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Provenance Interoperability and Reasoning Yannis Tzitzikas Assistant.
SDN Network Updates Minimum updates within a single switch
Service-Oriented Computing: Semantics, Processes, Agents
The Semantic Web Part 6. RDF Vocabularies: RDF Schema
Service-Oriented Computing: Semantics, Processes, Agents
Service-Oriented Computing: Semantics, Processes, Agents
Tutorial on Semantic Web
LegalRuleML Metamodel
Introduction to RDF and RDFS Editor: MR3
ece 720 intelligent web: ontology and beyond
Chapter 3 RDF and RDFS Semantics
Data Provenance.
Semantic Web Basics (cont.)
On Provenance of Queries on Linked Web Data
Presentation transcript:

TAPP-09 23/02/2009Giorgos Flouris1 On Explicit Provenance Management in RDF/S Graphs Institute of Computer Science Foundation for Research and Technology – Hellas Heraklion, Greece Panagiotis Pediaditis Giorgos Flouris Irini Fundulaki Vassilis Christophides {pped, fgeo, fundul,

TAPP-09 23/02/2009Giorgos Flouris2 Provenance Management in RDF/S  Provenance management problem  Mostly addressed in the database context  We are dealing with why provenance in RDF/S graphs — Why provenance: identifying the source data that had some influence on the existence of the target data  Three main characteristics (peculiarities of RDF/S)  Triple-based representation — Use quadruples to talk about triples’ provenance  Inference — Assign provenance information to implicit data  Coherence semantics (in updates) — Implicit data is a first-class citizen and should be retained during change, along with its provenance information

TAPP-09 23/02/2009Giorgos Flouris3 Characteristic #1 Triple-based Representation

TAPP-09 23/02/2009Giorgos Flouris4 RDF Graphs Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes RDF graph = set of RDF triples Define classes [Paper rdf:type rdfs:Class] [PaperTAPP rdf:type rdfs:Class] [Person rdf:type rdfs:Class] [Author rdf:type rdfs:Class] Define properties [writes rdf:type rdf:Property [writes rdfs:domain Author] [writes rdfs:range Paper] Instantiate (and define) individuals [Paper10 rdf:type PaperTAPP] [Giorgos rdf:type Author] [Giorgos writes Paper10] Define hierarchies [PaperTAPP rdfs:subClassOf Paper] [Author rdfs:subClassOf Person] And other stuff…

TAPP-09 23/02/2009Giorgos Flouris5 Provenance in RDF Graphs Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes Publications Graph (PUB) TAPP Graph (TAPP) PUB:[Paper rdf:type rdfs:Class] TAPP:[PaperTAPP rdf:type rdfs:Class] PUB:[Person rdf:type rdfs:Class] PUB:[Author rdf:type rdfs:Class] PUB:[writes rdf:type rdf:Property] PUB:[writes rdfs:domain Author] PUB:[writes rdfs:range Paper] TAPP:[Paper10 rdf:type PaperTAPP] TAPP:[Giorgos rdf:type Author] TAPP:[Giorgos writes Paper10] TAPP:[PaperTAPP rdfs:subClassOf Paper] PUB:[Author rdfs:subClassOf Person]

TAPP-09 23/02/2009Giorgos Flouris6 Named Graphs and Provenance  Create two named graphs and assign an ID (URI) to each  Publications graph (URI: PUB)  TAPP graph (URI: TAPP)  Each named graph corresponds to a different source  Need some method to associate named graphs with triples  Triples become quadruples  Fourth element is the URI of the named graph (origin) Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes

TAPP-09 23/02/2009Giorgos Flouris7 Quadruples for Provenance Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes [Paper rdf:type rdfs:Class PUB] [PaperTAPP rdf:type rdfs:Class TAPP] [Person rdf:type rdfs:Class PUB] [Author rdf:type rdfs:Class PUB] [writes rdf:type rdf:Property PUB] [writes rdfs:domain Author PUB] [writes rdfs:range Paper PUB] [Paper10 rdf:type PaperTAPP TAPP] [Giorgos rdf:type Author TAPP] [Giorgos writes Paper10 TAPP] [PaperTAPP rdfs:subClassOf Paper TAPP] [Author rdfs:subClassOf Person PUB] All quadruples of the form [s p o PUB] originate from named graph PUB (Publications graph) All quadruples of the form [s p o TAPP] originate from named graph TAPP (TAPP graph)

TAPP-09 23/02/2009Giorgos Flouris8 Properties of Named Graphs  The named graph URI can be used to refer to the named graph  Can be used for assignment of metadata [TAPP hasAuthor JamesCheney G]  Granularity of provenance  A triple is the smallest bit of information  The granularity of provenance achieved by named graphs is at the triple level  Flexible — A named graph can contain 0,1, or many triples — A triple can belong to 0,1, or many named graphs Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes

TAPP-09 23/02/2009Giorgos Flouris9 Characteristic #2 Inference

TAPP-09 23/02/2009Giorgos Flouris10 RDF/S Graphs  RDF Schema: add-on to RDF  RDFS adds inference semantics  Transitivity of subclass/subproperty  Implicit instantiations  Example  [Giorgos rdf:type Author]  [Author rdfs:subClassOf Person]  Inference: [Giorgos rdf:type Person]  Inferred knowledge is implicit Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes

TAPP-09 23/02/2009Giorgos Flouris11 Provenance and Inference  Quadruples:  [Giorgos rdf:type Author PUB]  [Author rdfs:subClassOf Person TAPP]  [Giorgos rdf:type Person ???]  Needs:  Shared ownership  A more sophisticated, compound structure  Keeping the connection with the components  Composition operator (PT=PUB●TAPP) — [Giorgos rdf:type Person PT] — Ok, but see characteristic #3 Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes

TAPP-09 23/02/2009Giorgos Flouris12 Characteristic #3 Coherence Semantics (in Updates)

TAPP-09 23/02/2009Giorgos Flouris13 Foundational Semantics  Foundational viewpoint (pyramid):  Knowledge consists of the explicitly represented knowledge  Only explicit knowledge can be changed  Implicit knowledge is affected indirectly, through the changes in the explicit knowledge (so that the resulting “pyramid” is “stable”)  Explicit knowledge is more important than implicit knowledge Basic Knowledge Supported Knowledge Explicit Knowledge Implicit Knowledge

TAPP-09 23/02/2009Giorgos Flouris14 Coherence Semantics  Coherence viewpoint (raft):  No discrimination between explicit and implicit knowledge  Both explicit and implicit knowledge can be changed  Changes should be made coherently in order for the resulting knowledge to make sense (so that the “raft” is “stable”)  Explicit and implicit knowledge are of the same value { Knowledge (includes both implicit and explicit knowledge)

TAPP-09 23/02/2009Giorgos Flouris15 Deletes  Under coherence semantics  Inferred knowledge needs to be made explicit (when in danger of being lost)  Explicit assignment of shared origin to triples  Explicit shared origin assignment  Cannot use any composition operator  Must be a first-class construct (autonomous)  Retain the connection with its constituents  A need, but also a useful feature Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes

TAPP-09 23/02/2009Giorgos Flouris16 RDF/S Graphsets  Graphsets are like named graphs  Have IDs (URIs)  Used in quadruples — Association of triples with graphsets [Giorgos rdf:type Person PT] — Can be referred to (metadata) [PT rdf:type Confidential G]  Encode origin or shared origin  [Giorgos rdf:type Person PT]  URI association (via skolem function) — PT is the URI of {PUB, TAPP} — PUB is the URI of {PUB}  A named graph is a graphset — PUB corresponds to {PUB} Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes PT

TAPP-09 23/02/2009Giorgos Flouris17 Querying With RDF/S Graphsets  Standard queries (original RQL)  Give me the Persons [Giorgos]  Provenance queries (extended RQL)  Give me the Persons per {PUB} [ ]  Give me the Persons per {TAPP, PUB} [Giorgos]  Give me the sources per which Author is a subclass of Person [{PUB}]  Give me all the individual sources [{TAPP}, {PUB}] Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes

TAPP-09 23/02/2009Giorgos Flouris18 Validity and Redundancy Elimination  Two invariants for RDF/S graphs  Valid (per some validity rules)  Redundant-free (space considerations)  The invariants allow optimized execution of queries  These invariants are imposed during change  Improve query speed, but make updates more difficult  Trade-off between having query overhead or update overhead

TAPP-09 23/02/2009Giorgos Flouris19 Updating With RDF/S Graphsets  Updates supported through an extended version of RUL  INSERT and DELETE  Only for data (class and property instances)  Implicit or explicit knowledge  Take into account and update graphset (provenance) information  Main considerations  Apply the change (INSERT or DELETE)  Respect invariants — Non-redundancy (INSERT) and validity (DELETE)  Make minimal changes (under coherence viewpoint) — No unnecessary loss of information  Take into account and preserve graphset (provenance) information — Applicable upon quadruples

TAPP-09 23/02/2009Giorgos Flouris20 Conclusion  Objective: assign provenance information to RDF/S graphs to capture why provenance  Triple-based representation — Turned triples into quadruples and used named graphs to record the origin  Inference (per RDFS) — Composed named graphs  Coherence semantics in updates (deletes) — Used graphsets for composed named graphs (cannot use an operator)  Proposed query and update languages for graphsets  Based on RQL, RUL  Can be used to query/update provenance information  Provided syntax and semantics, as well as an implementation — Demo at:

TAPP-09 23/02/2009Giorgos Flouris21

TAPP-09 23/02/2009Giorgos Flouris22 EXTRA SLIDES

TAPP-09 23/02/2009Giorgos Flouris23 RDF/S Graphset Properties  Three types of triples in a graphset:  Explicitly assigned triples  Implicitly assigned triples (from the constituent named graphs)  Implications of the above (per RDFS) Paper10 PaperTAPP Paper instance rdf:type subclass rdfs:subClassOf Giorgos Author Person writes PT

TAPP-09 23/02/2009Giorgos Flouris24 Inserts and Deletes: General Process  INSERT  Validity respected  Must verify non-redundancy  Process  If INSERT is redundant ignore it  Remove all redundant information (after insert)  DELETE  Must verify validity  Non-redundancy respected  Issues with inference and the coherence viewpoint  Process  If DELETE is void ignore it  Make explicit all originally redundant information that will be lost otherwise  Restore validity by removing property instances if necessary