1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.

Slides:



Advertisements
Similar presentations
XML: Extensible Markup Language
Advertisements

OWL - DL. DL System A knowledge base (KB) comprises two components, the TBox and the ABox The TBox introduces the terminology, i.e., the vocabulary of.
Constraint Propagation Algorithms for Temporal Reasoning Marc Vilain, Henry Kautz (AAAI 1986) Presentation by Lin XU March 4, 2002.
Logic.
1 A Description Logic with Concrete Domains CS848 presentation Presenter: Yongjuan Zou.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
DL-LITE: TRACTABLE DESCRIPTION LOGICS FOR ONTOLOGIES AUTHORS: DIEGO CALVANESE, GIUSEPPE DE GIACOMO, DOMENICO LEMBO, MAURIZIO LENZERINI, RICCARDO ROSATI.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE Massimo Poesio LECTURE 4: Semantic Networks and Description Logics.
1 Conditional XPath, the first order complete XPath dialect Maarten Marx Presented by: Einav Bar-Ner.
Validating Streaming XML Documents Luc Segoufin & Victor Vianu Presented by Harel Paz.
Keys For XML Peter Buneman Susan Davidson Wenfei Fan Carmem Hara Wang Chiew Tan.
Description Logics. Outline Knowledge Representation Knowledge Representation Ontology Language Ontology Language Description Logics Description Logics.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
Chapter 7 Reasoning about Knowledge by Neha Saxena Id: 13 CS 267.
DL systems DL and the Web Ilie Savga
Part 6: Description Logics. Languages for Ontologies In early days of Artificial Intelligence, ontologies were represented resorting to non-logic-based.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
4/20/2017.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Main challenges in XML/Relational mapping Juha Sallinen Hannes Tolvanen.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
An Introduction to Description Logics. What Are Description Logics? A family of logic based Knowledge Representation formalisms –Descendants of semantic.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
Ming Fang 6/12/2009. Outlines  Classical logics  Introduction to DL  Syntax of DL  Semantics of DL  KR in DL  Reasoning in DL  Applications.
Query Answering Based on the Modal Correspondence Theory Evgeny Zolin University of Manchester Manchester, UK
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
LDK R Logics for Data and Knowledge Representation ClassL (part 3): Reasoning with an ABox 1.
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
Advanced topics in software engineering (Semantic web)
LDK R Logics for Data and Knowledge Representation PL of Classes.
LDK R Logics for Data and Knowledge Representation Description Logics (ALC)
More on Description Logic(s) Frederick Maier. Note Added 10/27/03 So, there are a few errors that will be obvious to some: So, there are a few errors.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Logical Agents Chapter 7. Outline Knowledge-based agents Logic in general Propositional (Boolean) logic Equivalence, validity, satisfiability.
DL Overview Second Pass Ming Fang 06/19/2009. Outlines  Description Languages  Knowledge Representation in DL  Logical Inference in DL.
LDK R Logics for Data and Knowledge Representation ClassL (Propositional Description Logic with Individuals) 1.
ece 627 intelligent web: ontology and beyond
Reasoning Systems For Categories By Franklyn O. Reasoning Systems For Categories Categories are the primary building blocks of any large-scale knowledge.
1 Instance Store Database Support for Reasoning over Individuals S Bechhofer, I Horrocks, D Turi. Instance Store - Database Support for Reasoning over.
Knowledge Representation and Reasoning University "Politehnica" of Bucharest Department of Computer Science Fall 2011 Adina Magda Florea
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Of 29 lecture 15: description logic - introduction.
LDK R Logics for Data and Knowledge Representation Description Logics: family of languages.
Mathematical Service Matching Using Description Logic and OWL Kamelia Asadzadeh Manjili
Ontology Technology applied to Catalogues Paul Kopp.
CSE573 Autumn /09/98 Machine Learning Administrative –Last topic: Decision Tree Learning Reading: 5.1, 5.4 Last time –finished NLP sample system’s.
Logical Agents. Outline Knowledge-based agents Logic in general - models and entailment Propositional (Boolean) logic Equivalence, validity, satisfiability.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
COP Introduction to Database Structures
P & NP.
Logics for Data and Knowledge Representation
ece 720 intelligent web: ontology and beyond
Semantic Web Foundations
Knowledge and reasoning – second part
ece 627 intelligent web: ontology and beyond
Computational Complexity of Terminological Reasoning in BACK
EA C461 – Artificial Intelligence Logical Agent
ece 720 intelligent web: ontology and beyond
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Knowledge and reasoning – second part
Knowledge Representation I (Propositional Logic)
Logics for Data and Knowledge Representation
Deniz Beser A Fundamental Tradeoff in Knowledge Representation and Reasoning Hector J. Levesque and Ronald J. Brachman.
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
CIS Monthly Seminar – Software Engineering and Knowledge Management IS Enterprise Modeling Ontologies Presenter : Dr. S. Vasanthapriyan Senior Lecturer.
Presentation transcript:

1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University of Waterloo March 4, 2003

2 Outline Introduction XML documents, DTDs and basic reasoning tasks The Description Logic ( DL ) for representing DTDs Representing and reasoning over DTDs in DL Retrieving XML documents from a document base Conclusion and discussion

3 Introduction Recent proposals suggest viewing the Web as a huge semistructured database Drawbacks: they pay insufficient attention to the representation and reasoning on document structures Importance of such reasoning facility: help in several tasks related to query processing; improve both precision and efficiency

4 Introduction (cont ’ d) Documents in the Web are described by means of ad-hoc languages, whose structure is typically explicitly marked up with special tags. XML is one of the most prominent formalism for defining marked-up documents. The structure of XML documents is described by means of DTDs. Types of reasoning about DTDs: Conformance Equivalence (Inclusion, Disjointness)

5 Introduction (cont ’ d) Contributions of this paper: Present a formalization of XML DTDs in terms of an expressive Description Logic, called DL Present sound, complete and terminating inference procedures for DL, which provide a reasoning mechanism to enable reasoning tasks on DTDs Conformance: polynomial time R -equivalence (inclusion, disjointness): exponential time Propose a method for retrieving XML documents from a document base by taking advantage of DL

6 XML DTD and Document Instance Examples

7 XML Documents Each XML document can be considered as a pair: (D, d): D — DTD d — document instance Alphabets: T — terminals (e.g., #PCDATA) E — element types. For each E ∈ E, we associate a start tag and end tag

8 XML DTDs Symbol S ∈ T ∪ E, i.e., either a generic terminal or an element type Definition 2: A DTD D is a pair (P, R): R: root element type, which specifies the document type P: a set of element type definitions, each in the form E ->  E: the element type to be defined  : the content model, an expression over symbols in T ∪ E:

9 Basic Reasoning Tasks on XML DTDs — Conformance Definition of the set of document instances in terms of a DTD to which they conform to

10 Basic Reasoning Tasks on XML DTDs — Equivalence Strong Equivalence (Inclusion, Disjointness): the names of the tags are important. Structural Equivalence (Inclusion, Disjointness): replace the tags into unnamed <> or, and only consider the document structures.

11 Basic Reasoning Tasks on XML DTDs — Equivalence (cont ’ d) Example of structural inclusion:

12 Basic Reasoning Tasks on XML DTDs — Equivalence (cont ’ d) Natural generalization: R -equivalence (inclusion, disjointness): View strong/structural equivalence (inclusion, disjointness) as extremes R is an equivalence relation on E For each E ∈ E, denote [E] R the equivalence class of E respect to R e.g., in the previous example: [Mail] R ={Mail, Note}=[Note] R [Text] R ={Text, Body}=[Body] R ----in which case N ⊑ R M

13 Basic Reasoning Tasks on XML DTDs — Equivalence (cont ’ d) R -Conformance and R -equivalence (inclusion, disjointness):

14 The Description Logic ( DL ) for Representing DTDs Syntax and semantics of DL concept and role constructs

15 The Description Logic ( DL ) for Representing DTDs (cont ’ d) Knowledge bases in DL: A DL knowledge base K : a set of assertions of the form: C 1 ⊑ C 2 or C 1 ≡ C 2, where C 1 and C 2 are arbitrary concepts with no restrictions An interpretation I satisfies the C 1 ⊑ C 2 if C 1 I  C 2 I. I is a model of K if it satisfies all assertions in K If I is considered a first order structure, checking whether I is a model of K has polynomial complexity

16 The Description Logic ( DL ) for Representing DTDs (cont ’ d) Reasoning services in DL: Concept satisfiability: K ⊨ C ≢ ⊥ Knowledge base satifiability: K ⊨ ⊤ ≢ ⊥ Subsumption : K ⊨ C 1 ⊑ C 2 (can be reduced to K ⊨ C 1 ⊓﹁ C 2 ≡ ⊥ ) Theorem: It is sufficient to consider concept satisfiability only, which is an EXPTIME-complete problem

17 Representation of DTDs in DL Given a DTD D=(P, S 0 ), define a DL knowledge base K, named characteristic knowledge base of D The alphabet of K (i.e., the atomic concepts and roles): The atomic concepts Tag and Terminal For each F ∈ T, one atomic concept F For each element type E ∈ E, one atomic concept StartE and one atomic concept EndE The atomic roles f and r (standing for “ first ” and “ rest ” respectively) For each element type definition, one atomic concept E D ---All but atomic concepts E D are basic atomic concepts/roles The set of assertions of K is constituted by K T,E, K R and K D

18 Representation of DTDs in DL (cont’d) General structure: Example: Tree representing the document with h internal components: … Connection roles: For start tag : f-filler For end tag : r h+1 -filler For the k th component: (r k ◦ f)-filler (1 ≤ k ≤h)

19 Representation of DTDs in DL (cont’d) K T,E: Encoding of general structural properties. Assertions:

20 Representation of DTDs in DL (cont’d) K R : Encoding of equivalence relation R Let be the set of equivalence classes determined by R Assertions:

21 Representation of DTDs in DL (cont’d) K D : Encoding of the DTD Assertion: where, : inductively defined complex role: Atomic concept for the definition of a symbol:

22 Representation of DTDs in DL (cont’d) Example: K D for DTD M

23 Reasoning over DTDs in DL —R -Conformance There is a liner time construction of a one-to-one mapping β : from a R -document instance d to a model of K Example: Apply β to the mail document instance conforming to DTD M

24 Reasoning over DTDs in DL —R -Conformance (cont’d) Theorem: verifying R -conformance of a document instance d to a DTD D can be polynomially reduced to model checking in DL The construction of K takes polynomial time in the size of D The construction of a model based on a XML document instance d takes linear time in size of d Corollary: verifying R -conformance of a document instance d to a DTD D can be done in time polynomial in the size of d and D

25 Reasoning over DTDs in DL—R - Equivalence (Inclusion, Disjointness) The checking of R - equivalence between two DTDs D and D ’ can be done by checking the equivalence between the atomic concepts for root element definitions under a unique model of K, where K = K T,E ∪ K R ∪ K D ∪ K D’ Formally:

26 Retrieving XML Documents from a Document Base Document base B = D : a set of DTDs. The relationship of each pair of DTDs is known I : a set of document instances Query Q: a DTD query Q( B ): the set of document instances in B R - conforming to Q Idea of the retrieval algorithm: choose the best DTD D from D, which: Satisfy D ≡ R Q, or Satisfy D ⊑ R Q, and there not exists D ’ ⊑ R Q, and D ⊑ R D ’

27 Conclusion and Discussion What ’ s done: A view of DTDs as concepts of the expressive Description Logic DL is given Some questions regarding reasoning tasks on DTDs are answered (conformance, equivalence) Efficient algorithm for retrieving documents is proposed Some special aspects are taken into account: Attributes as properties of elements Documents containing links to other documents etc. ---Ideas: take more advantage of existing DL (e.g., qualified number restrictions) introduce new concepts and roles

28 Conclusion and Discussion What ’ s to be explored: Compare the expressive power between the tree model given in this paper and other proposed data models (e.g., OEM, BDFS) Fully capture further aspects of DTDs Study new types of reasoning on DTDs Parameterized equivalence Document classification