Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.

Slides:



Advertisements
Similar presentations
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Advertisements

Intelligent Technologies Module: Ontologies and their use in Information Systems Revision lecture Alex Poulovassilis November/December 2009.
SPARQL Dimitar Kazakov, with references to material by Noureddin Sadawi ARIN, 2014.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Information Integration Using Logical Views Jeffrey D. Ullman.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
Data Intensive Techniques to Boost the Real-time Performance of Global Agricultural Data Infrastructures SEMAGROW U SING A POWDER T RIPLE S TORE FOR BOOSTING.
USC Graduate Student DayColumbia, SCMarch 2006 Presented by: Jingshan Huang Computer Science & Engineering Department University of South Carolina PhD.
An Extensible System for Merging Two Models Rachel Pottinger University of Washington Supervisors: Phil Bernstein and Alon Halevy.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Ontology Notes are from:
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
Xyleme A Dynamic Warehouse for XML Data of the Web.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
ANHAI DOAN ALON HALEVY ZACHARY IVES Chapter 6: General Schema Manipulation Operators PRINCIPLES OF DATA INTEGRATION.
A Really Brief Crash Course in Semantic Web Technologies Rocky Dunlap Spencer Rugaber Georgia Tech.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Practical RDF Chapter 1. RDF: An Introduction
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
A Semi-automatic Ontology Acquisition Method for the Semantic Web Man Li, Xiaoyong Du, Shan Wang Renmin University of China, Beijing WAIM May 2012.
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Relational Databases to RDF (a.k.a RDB2RDF) Juan F. Sequeda Dept of Computer Science University of Texas at Austin.
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
Michael Eckert1CS590SW: Web Ontology Language (OWL) Web Ontology Language (OWL) CS590SW: Semantic Web (Winter Quarter 2003) Presentation: Michael Eckert.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Semantic Web - an introduction By Daniel Wu (danielwujr)
XML Schema Integration Ray Dos Santos July 19, 2009.
Advanced topics in software engineering (Semantic web)
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
The Semantic Web Riccardo Rosati Dottorato in Ingegneria Informatica Sapienza Università di Roma a.a. 2006/07.
RE-ENGINEERING AND DOMAIN ANALYSIS BY- NISHANTH TIRUVAIPATI.
1 A Medical Information Management System Using the Semantic Web Technology Networked Computing and Advanced INFORMATION MANAGEMENT, NCM '08. Fourth.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Universität Innsbruck Leopold Franzens  Copyright 2007 DERI Innsbruck Second TTF Technical Fair 12 December 2007 Mediation Component Second.
Author: Akiyoshi Matonoy, Toshiyuki Amagasay, Masatoshi Yoshikawaz, Shunsuke Uemuray.
1 Integrating Databases into the Semantic Web through an Ontology-based Framework Dejing Dou, Paea LePendu, Shiwoong Kim Computer and Information Science,
Ontology Technology applied to Catalogues Paul Kopp.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Of 24 lecture 11: ontology – mediation, merging & aligning.
Cross-Ontological Relationships
Web Ontology Language for Service (OWL-S)
Ontology.
Information Networks: State of the Art
Presentation transcript:

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller

Acknowledgement This work was supported by the project 1ET of the Program Information Society (of the Thematic Program II of the National Research Program of the Czech Republic)Intelligent Models, Algorithms, Methods and Tools for the Semantic Web Realization.

Outline of presentation Introduction Virtual data integration Ontology based system Matching in the system Mapping in the system Query rewriting Conclusion

Introduction Todays world is a world of information Web data use expansion Need of efficient information processing => Semantic web idea (XML, RDF, ontologies) Many data providers, working with distributed data Need of data integration => Semantic web data integration

Virtual data integration Data stays physically stored in original sources Data integration provides an integrated view over distributed data Virtual data integration: Schema matching Schema mapping Query processing

Ontology-based system Sources: Semantic web data (local and global)... RDF/XML Available ontologies for the sources... OWL Task input: sources S i and ontologies O j Use of ontologies: Source ontologies and global ontology for provided integrated data To do matching To describe mapping To query rewriting

Relationships in the system Schema matching – process of searching schema correspondences Schema mapping – description of found schema correspondences, i.e. definition of relation, rule, formula etc. (1-1 rules, use of views, LAV and GAV approaches...) Consider correspondences kinds: Is-a hierarchical relationship, Equivalence Disjointness

Matching and mapping in the system For description of found correspondences in mapping, OWL ontologies and its features are used: rdfs:subClassOf for and owl:equivalentClass for owl:disjointWith for => Ontology O I... ontology of the integration system... contains mapping in the system How is O I obtained?

Matching and mapping in the system Shared ontology case: All data are described in only one (shared) ontology – in that data relationships are described => no need to search somewhere else General case – shared ontology not available: Local ontologies describing data in the local sources Need to obtain shared ontology => Integration local sources ontologies The task is transformed to the ontology merging task Available tools developed when solving this task kind can be employed: Chimaera, PROMPT (Protégé), FCA-MERGE, HCONE (WordNet)

Related work on matching Various approaches searching schema correspondences at different levels: Instance – data processing, e.g. domain Terms – string processing, vocabularies use,... Structure – graphs methods applying,... Classical approaches in schema matching and mapping: Estimation from available information (data, structure, external informational sources, …) Candidates selection (meassures, uncertainty,...) Here, the task is solved by merging ontologies: However, in ontology merging, similar principles as mentioned above are used => similar principles are used at different level

Querying the integrated data Sources S j contains RDF/XML data Querying using SPARQL language Given guery in global environment... Q G However, data available only in local sources with local environments Task: to rewrite the query to the local environment of the local sources with use of mapping... Q L S i Use of mapping for rewriting

Using mapping described in ontology Passing the OWL ontology graph through equivalent or hierarchical relation Using the known OOP rule: a child can substitute its parent For term t: generating set of all possible term rewritings... R(t) End condition: difference in between two passing steps is zero

Using mapping described in ontology

Simple query processing Simple query – only simple condition on RDF triple For each term t in the query generate set of all possible term rewritings … R(t) Using all R(t) for each term in the query obtain all possible query rewritings … Q L Using local queries Q L on local sources obtain local answers Using reverse rewriting return answer placed in global environment … global answer

Simple query rewriting Optimalization: Querying all possible query rewritings in each local source is not effective => Using set of supported terms for each source Obtained from ontology, source schema, source preprocessing… Generating set of all relevant term/query rewritings for each source

Complex query processing Complex query – also complex condition on searched RDF triple Complex query is divided into simple queries by dividing complex condition into simple ones Obtained answers corresponding to simple queries must be composed to the answer corresponding to the original (complex) query

Conclusion Use of ontologies in virtual data integration: Transformes data integration task to ontology merging task Can bring use of formalism, methods and tools from the other task area Can help in task automatization effort Standardized structure instead of particular project oriented mapping rules bring possibility of reuse of mapping Possibility of expression various terms relations Future plans: experiments with real data

Thank for your attention