Tools For Resolving Heterogeneity Computer Science Department

Slides:



Advertisements
Similar presentations
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Advertisements

Data Structures.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Schema Matching and Data Extraction over HTML Tables Cui Tao Data Extraction Research Group Department of Computer Science Brigham Young University supported.
Copyright Irwin/McGraw-Hill Data Modeling Prepared by Kevin C. Dittman for Systems Analysis & Design Methods 4ed by J. L. Whitten & L. D. Bentley.
Status of Mediation Technology Gio Wiederhold Stanford University Oct 1999 SNU -- KINS.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya Fridman Noy and Mark A. Musen.
March 2000 Gio XIT 1 Increasing the Precision when Obtaining Information from the Web Gio Wiederhold Stanford University 4 April 2000 related report: www-db.stanford.edu/pub/gio/1999/miti.htm.
1 © Prentice Hall, 2002 Chapter 5: Logical Database Design and the Relational Model Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B.
August 2000 Gio vdR 1 Increasing the Precision of Semantic Interoperation Gio Wiederhold Stanford University August 2000 Reind van de Riet celebration.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
April 2002 Gio ArtInt 1 Information Interoperation versus Integration Gio Wiederhold Stanford University April 2002 www-db.stanford.edu/people/gio.html.
ENTITY RELATIONSHIP DATA MODEL ELIZABETH GEORGE. INTRODUCTION Architect Aeronautical engineers Computer architects Traffic engineers.
Ontology Maintenance with an Algebraic Methodology: a Case Study Jan Jannink, Gio Wiederhold Presented by: Lei Lei.
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING PROJECT VISTA: Integrating Heterogeneous Utility Data A very brief overview.
March 2000 Gio XIT 1 Increasing the Precision of Semantic Interoperation Gio Wiederhold Stanford University March 2000 report: www-db.stanford.edu/pub/gio/1999/miti.htm.
Generative Programming. Generic vs Generative Generic Programming focuses on representing families of domain concepts Generic Programming focuses on representing.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Database Systems: Design, Implementation, and Management Ninth Edition
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Systems Analysis and Design in a Changing World, Fifth Edition
Maintaining Information Systems Modern Systems Analysis and Design.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Concepts and Terminology Introduction to Database.
Mobile Topic Maps for e-Learning John McDonald & Darina Dicheva Intelligent Information Systems Group Computer Science Department Winston-Salem State University,
Reviewing Recent ICSE Proceedings For:.  Defining and Continuous Checking of Structural Program Dependencies  Automatic Inference of Structural Changes.
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
Generative Programming. Automated Assembly Lines.
Component Based SW Development and Domain Engineering 1 Component Based Software Development and Domain Engineering.
Database Systems: Enhanced Entity-Relationship Modeling Dr. Taysir Hassan Abdel Hamid.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
5 Systems Analysis and Design in a Changing World, Fifth Edition.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
1 © Prentice Hall, 2002 Chapter 5: Logical Database Design and the Relational Model Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B.
Team Members Dilip Narayanan Gaurav Jalan Nithya Janarthanan.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
S calable K nowledge C omposition Ontology Interoperation January 19, 1999 Jan Jannink, Prasenjit Mitra, Srinivasan Pichai, Danladi Verheijen, Gio Wiederhold.
Domain-Expert Repository Management for Adaptive Hypermedia Learning System By Norazah Yusof & Paridah Samsuri Members of SPAtH Group Faculty of Comp.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Conceptualization Relational Model Incomplete Relations Indirect Concept Reflection Entity-Relationship Model Incomplete Relations Two Ways of Concept.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Logical Database Design and the Relational Model.
Department of Mathematics Computer and Information Science1 CS 351: Database Management Systems Christopher I. G. Lanclos Chapter 4.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Of 24 lecture 11: ontology – mediation, merging & aligning.
Technische Universität München © Prof. Dr. H. Krcmar An Ontology-based Platform to Collaboratively Manage Supply Chains Tobias Engel, Manoj Bhat, Vasudhara.
Chapter 4 Logical Database Design and the Relational Model
Chapter 4: Logical Database Design and the Relational Model
Achieving Semantic Interoperability of Cancer Registries
Chapter 5: Logical Database Design and the Relational Model
Component Based Software Engineering
February 2001 Gio Wiederhold Stanford University
Using Knowledge to Exploit Data
8. Educational Challenges
Research Issues in Electronic Commerce
Heterogeneous Information Management
Obtaining Precision when Integrating Information
Semantic Precision for Web-based Interoperation
Extracting Semantic Concept Relations
Stanford University, CSD
CHAPTER 4: LOGICAL DATABASE DESIGN AND THE RELATIONAL MODEL
Data Model.
CVE.
Block Matching for Ontologies
CS561- Advanced topics in database systems
Presentation transcript:

Tools For Resolving Heterogeneity Computer Science Department . Tools For Resolving Heterogeneity in Ontologies SKC Project Computer Science Department Stanford University Gio Wiederhold, PI 10/13/2018 Gio Wiederhold SKC RKF 1

Problem Addressed by SKC Ontologies come from many autonomous sources Differing viewpoints (by source) differing coverage vehicles (DMV, AIA) differing granularity trucks (shipper, manuf.) differing terms for similar items { lorry, truck } same terms for dissimilar items trunk(luggage, car) Created by focused groups high quality used in commerce ongoing maintenance annual models Poor precision when merged ok for web browsing , poor for business 10/13/2018 Gio Wiederhold SKC RKF 2

Domains and Consistency . a domain will contain many objects the object configuration is consistent within a domain all terms are consistent & relationships among objects are consistent context is implicit No committee is needed to forge compromises * within a domain Domain Ontology Compromises hide valuable details 10/13/2018 Gio Wiederhold SKC RKF 3

Heterogeneity among Domains If interoperation involves distinct domains mismatch ensues Autonomy conflicts with consistency, Local Needs have Priority, Outside uses are a Byproduct Heterogeneity must be addressed Platform and Operating Systems 4 4 Representation and Access Conventions 4 Naming and Ontology : 10/13/2018 Gio Wiederhold SKC RKF 4

SKC Objective Provide for Maintainable Ontologies devolve maintenance onto many domain-specific experts / authorities provide an algebra to compute composed ontologies that are limited to their articulation terms enable interpretation within the source contexts SKC 10/13/2018 Gio Wiederhold SKC RKF 5

An Ontology Algebra A knowledge-based algebra for ontologies The Articulation Ontology (AO) consists of matching rules that link domain ontologies Intersection create a subset ontology keep sharable entries Union create a joint ontology merge entries Difference create a distinct ontology remove shared entries 10/13/2018 Gio Wiederhold SKC RKF 6

Sample Operation: INTERSECTION Result contains shared terms, useful for purchasing Articulation Source Domain 1: Owned and maintained by Store Source Domain 2: Owned and maintained by Factory 10/13/2018 Gio Wiederhold SKC RKF 7

Tools to create articulations Graph matcher for Articulation- creating Expert Vehicle ontology Transport ontology Suggestions for articulations 10/13/2018 Gio Wiederhold SKC RKF 8

continue from initial point Also suggest similar terms for further articulation: by spelling similarity, by graph position by term match repository Expert response: 1. Okay 2. False 3. Irrelevant to this articulation All results are recorded Okay’s are converted into articulation rules 10/13/2018 Gio Wiederhold SKC RKF 9

Candidate Match Repository Term linkages automatically extracted from 1912 Webster’s dictionary * * free, other sources . being processed. Based on processing headwords ý definitions using algebra primitives Notice presence of 2 domains: chemistry, transport 10/13/2018 Gio Wiederhold SKC RKF 10

Using the Match Repository 10/13/2018 Gio Wiederhold SKC RKF 11

Using the Match Repository 10/13/2018 Gio Wiederhold SKC RKF 12

Rules for Real-Time Data if [base_station.receiving] = true then satellite_data = [base_station] satellite_data.timestamp = now if [satellite_data.age] < 24 hours or [radio_jamming.level] > 30% then recon_data = [satellite_data] except when [flight_data.age] < 1 hour or [rain_sensor.daytotal] > 1 inch then recon_data = [flight_data] assert [recon_data] 10/13/2018 Gio Wiederhold SKC RKF 13

INTERSECTION support Articulation ontology Matching rules that use terms from the 2 source domains Terms useful for purchasing Store Ontology Factory Ontology 10/13/2018 Gio Wiederhold SKC RKF 14

Other Basic Operations UNION: merging entire ontologies DIFFERENCE: material fully under local control Arti- culation ontology typically prior intersections 10/13/2018 Gio Wiederhold SKC RKF 15

Features of an algebra Operations can be composed Operations can be rearranged Alternate arrangements can be evaluated Optimization is enabled The record of past operations can be kept and reused when sources change 10/13/2018 Gio Wiederhold SKC RKF 16

Knowledge Composition Composed knowledge for applications using A,B,C,E Articulation knowledge for U (A B) (B C) (C E) Articulation knowledge Legend: U : union U : intersection U (C E) Articulation knowledge for Knowledge resource E Knowledge resource C (A B) U U (B C) U (C D) Knowledge resource A Knowledge resource B Knowledge resource D 10/13/2018 Gio Wiederhold SKC RKF 17

Primitive Operations Model and Instance Constructors create object create set Connectors match object match set Editors insert value edit value move value delete value Converters object - value object indirection reference indirection Unary Summarize -- abstract Glossarize - list terms Filter - reduce instances Extract - move into context Binary Match - data corrobaration Difference - distance measure Intersect - use of articulation Union - search broadening 10/13/2018 Gio Wiederhold SKC RKF 18

Exploiting the result . Processing & query evaluation is best Avoid n2 problem of interpreter mapping [Swartout HPKB year 1] Result has links to source Processing & query evaluation is best performed within Source Domains & by their engines 10/13/2018 Gio Wiederhold SKC RKF 19

Sample Processing in HPKB What is the most recent year an OPEC member nation was on the UN security council? Related to DARPA HPKB Challenge Problem SKC resolves 3 Sources CIA Factbook ‘96 (nation) OPEC (members, dates) UN (SC members, years) SKC obtains the Correct Answer 1996 (Indonesia) Problems resolved by SKC Factbook has out of date OPEC & UN SC lists Indonesia not listed Gabon (left OPEC 1994) different country names Gambia => The Gambia historical country names Yugoslavia UN lists future security council members Gabon 1999 intent of original question Temporal variants 10/13/2018 Gio Wiederhold SKC RKF 20

Domain Specialization . Knowledge Acquisition (20% effort) & Knowledge Maintenance (80% effort *) to be performed Domain specialists Professional organizations Field teams of modest size Empowerment automously maintainable * based on experience with software 10/13/2018 Gio Wiederhold SKC RKF 21