By Helen Howell. 2 Outline ] What is interoperation? ] Why? ] Problems ] Approach to Interoperations ] ONION ] Cyc.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Semantic Interoperability & Semantic Models: Introduction
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
DAML Ontology Interoperation Using Cyc
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
INFO624 - Week 2 Models of Information Retrieval Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
Interoperation of Information Sources via Articulation of Ontologies Prasenjit Mitra, Gio Wiederhold Stanford University Supported by AFOSR- New World.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
The Experience Factory May 2004 Leonardo Vaccaro.
WMES3103 : INFORMATION RETRIEVAL
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
How can Computer Science contribute to Research Publishing?
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
XML on Semantic Web. Outline The Semantic Web Ontology XML Probabilistic DTD References.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Semi-Automatic Generation of Mini-Ontologies from Canonicalized Relational Tables Chris Hathaway Supported by NSF.
Semi-Automatic Generation of Mini-Ontologies from Canonicalized Relational Tables Chris Hathaway.
March 2000 Gio XIT 1 Increasing the Precision of Semantic Interoperation Gio Wiederhold Stanford University March 2000 report: www-db.stanford.edu/pub/gio/1999/miti.htm.
Ontology Mapping with Cyc doug foxvog 14 July 2004
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Peer-to-Peer Data Integration Using Distributed Bridges Neal Arthorne B. Eng. Computer Systems (2002) Supervisor: Babak Esfandiari April 12, 2005 Candidate.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Dimitrios Skoutas Alkis Simitsis
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Database Systems: Enhanced Entity-Relationship Modeling Dr. Taysir Hassan Abdel Hamid.
LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
S calable K nowledge C omposition Ontology Interoperation January 19, 1999 Jan Jannink, Prasenjit Mitra, Srinivasan Pichai, Danladi Verheijen, Gio Wiederhold.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Object storage and object interoperability
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Mechanisms for Requirements Driven Component Selection and Design Automation 최경석.
COP Introduction to Database Structures
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Associative Query Answering via Query Feature Similarity
Tools For Resolving Heterogeneity Computer Science Department
Presented by: Prof. Ali Jaoua
Stanford University, CSD
Magnet & /facet Zheng Liang
A Semantic Peer-to-Peer Overlay for Web Services Discovery
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
Presentation transcript:

By Helen Howell

2 Outline ] What is interoperation? ] Why? ] Problems ] Approach to Interoperations ] ONION ] Cyc

3 What is Interoperation? Interoperation may be defined as a loose coupling across information sources, semantic metadata descriptions and ontologies

4 Why? ] Share information

5 Domain Differences ] Naming attribute items differently ] payroll - EMP; personnel - PEOPLE ] Scope ] payroll may include suport for student benefits for employee’s children; personnel - employees only

6 Domain Differences (cont.) ] Encoding ] ssn with and without hypens ] Attribute scopes-Semantic hetergeneity ] hot has different meaning in weather domain than in truck-engine domain

7 Integration Problems ] Data that is available is subject to implicit assumptions which means that it cannot be reused out of context. ] Data is often duplicated and labeled under different terms. e.g., Address verses Addr

8 Approaches ] Direct Translation ] Single Shared Ontology ] Multiple Shared Ontology

9 Direct Translation For each ontology a set of translating functions is provided to allow mappings with the other ontologies.

10 Direct Translation ] Only feasible if a few ontologies ] OBSERVER - mappings between a term in a source ontology and its synonym in a destination ontology. ] Set up relationships e.g., Volcano effects Environment

11 Single Shared Ontology Often standards are not convenient to use since they have to be suitable for all potential uses. So a single subset is formed as a standard

12 Multiple Shared Ontologies Locates shared knowledge in multiple but smaller shared ontologies

13 Multiple Shared Ontologies ] Referred to as Resource Clustering or Ontology Clustering ] Approach more flexible and scalable ] No longer have to commit to a comprehensive ontology

14 Multiple Ontologies ] Normally organized into a hierarchy ] Top level provides general definitions share by all resources ] Lower levels provide definitions of concepts that extend and characterize upper levels

15 Mediation-Based Information Systems ] Mediators provide intelligent middleware services - matching resources to applications - resources are autonomous and heterogeneous

16 Mediation Based Information Systems ] Identify most relevant resources for user’s needs ] Retrieve the relevant data ] Translate it to a common representation e.g., Nevada, NV, NV USA

Interoperation of Information Sources via Articulation of Ontologies Prasenjit Mitra, Gio Wiederhold Stanford University Supported by AFOSR- New World Vistas Program

18 Interoperation of Information Sources ] Compose information ] Multiple independent, heterogeneous sources ] Reliability, scalability ] Semantic heterogeneity - same term different semantics - different term same semantics

19 Onion System

20 Articulation Generator Driver Ont1Ont2 Phrase Relator Thesaurus Structural Matcher Semantic Network Human Expert Context-based Word Relator OntA

21 Preliminaries: Ontology ] Ontology - hierarchy of terms and specification of their properties. ] Modeled as a directed labeled graph + set of rules. ] Ont = ( V, E, R) V - set of nodes(concepts) E - set of edges(properties) R - set of rules involving V,E

22 Articulation Rules ] Articulation rules - logic-based rules that relate concepts in two ontologies: -Binary Relationships (O1.Car SubClassOf O2.Vehicle) (O1.Buyer Equ O2.Owner) (O2.LuxuryCar SubClassOf O1.Car)

23 Example Car H123 Instance List_Price VID PMitra Name Unit $ 40kCA Address Amount Vehicle LuxuryCar PMitra Buyer Instance SubClass Instance VIN H123 OwnerRetail_ Price CA NameAddr $35k SubClass Buyer Owner Attribute Equ

24 Articulation Rules (contd.) ] Horn Clauses - (O1.Car O1.Instance X), (X O1.Price Y), (Y > $30000) => (O2.LuxuryCar Instance X) - (V O2.Retail_Price P), (C O1.List_Price L), (L Unit U), (L Amount A), (V Equ C) => (P Equ concat(U,A))

25 Contributions ] Articulation Generation Toolkit - produce translation rules semi-automatically - a library of reusable heuristic methods - a GUI to display ontologies and interact with the expert ] Ontology Algebra - query rewriting and planning

26 Articulation Generation Methods ] Non-iterative Methods - Lexical Matcher - Thesaurus-based Matcher - Corpus-based Matcher - Instance-based Matcher ] Iterative Methods - Structural Matcher - Inference-based Matcher

27 Lexical Methods ] Preprocessing rules. -Expert-generated seed rules. e.g., (O1.List_Price Equ O2.Retail_Price) -Context-based preprocessing directives. e.g., (O1.UK_Govt Equ O2.US_Govt) -Stop-word Removal & Stemming -Word match (full or partial) -Phrase match e.g., (O1.Ministry_Of_Defence Equ O2.Defense_Ministry) 0.6

28 Thesaurus-based methods ] Consult a dictionary/thesaurus to find synonyms, related words ] Generate a similarity measure or relatedness measure - words that have similar words in their definitions are similar ] Get more semantically meaningful relationships from WordNet (syn, hyper)

29 Candidate Match Repository Term linkages automatically extracted from 1912 Webster’s dictionary * * free, other sources. being processed. Based on processing headwords ý definitions Notice presence of 2 domains: chemistry, transport

30 Corpus-based ] Collect a set of text documents preferably from same domain - search using keywords in google ] Build a context vector (1000-character neighbourhood) for each word ] Compute word-pair similarity based on the cosine of the vectors ] Use word-pair similarity to find similarity among labels of nodes/edges

31 Structural Methods ] Uses results of lexical match ] If x% of parent nodes match & y% of children nodes match ] Special relations (AttributeOf) match

32 Tools to create articulations Graph matcher for Articulation- creating Expert Vehicle ontology Transport ontology Suggestions for articulations

33 continue from initial point Also suggest similar terms for further articulation: by spelling similarity, by graph position by term match repository Expert response: 1. Okay 2. False 3. Irrelevant to this articulation All results are recorded Okay ’s are converted into articulation rules

34 An Ontology Algebra Operations can be composed Operations can be rearranged Alternate arrangements can be evaluated Optimization is enabled The record of past operations can be kept and reused when sources change

35 Binary Operators A knowledge-based algebra for ontologies The Articulation Function (ArtGen), given two ontologies, supplies articulation rules between them. Intersection create a subset ontology keep sharable entries Union create a joint ontology merge entries Difference create a distinct ontology remove shared entries

36 Intersection: Definition ] O1 = (V1, E1, R1) O2 = (V2, E2, R2) ] OI = ( O1 Int ArtGen O2) = (VI, EI, RI) ] Arules = ArtGen( O1, O2 ) ] VI = Nodes( Arules ) ] EI = Edges( Arules ) + Edges(E1, VI.V1) + Edges( E2, VI.V2) ] RI = Arules + Rules( R1, VI.V1) + Rules( R2, VI.V2)

37 Intersection: Example ] ARules = { (O1.Car SubClass O2.LuxuryCar), (O1.MSRP Equ O2.Price)} ] NI = ( O1.Car, O1.MSRP, O2.LuxuryCar, O2.Price ) ] EI = Edges(ARules) + {(O1.Car Attribute O1.MSRP), (O2.LuxuryCar Attribute Price)} InexpCar Car LuxuryCar MSRPPrice LuxuryTax SubClass Equ O1 O2 OI

38 Intersection: Properties ] Commutative? OI 12 =(O1 Int ArtGen O2) = (O2 Int ArtGen O1)=OI 21 VI 12 = VI 21, EI 12 = EI 21, RI 12 =RI 21 ARules 12 = ARules 21 ArtGen(O1, O2) = ArtGen(O2, O1) ] Int ArtGen is commutative iff ArtGen is commutative

39 Semantically Commutative Example: ] ArtGen(O1,O2) : ( O1.Car SubClassOf O2.Vehicle) ] ArtGen(O2,O1) : ( O2.Vehicle SuperClassOf O1.Car) ] Defn: ArtGen is Semantically Commutative iff ArtGen(O1,O2) ArtGen(O2,O1) Operands to intersection can be rearranged if ArtGen is semantically commutative

40 Associativity Example ] Example: ] ArtRules(O1, O3) : (O1.Car SubClassOf O3.Vehicle) ] ArtRules(O2, O3) : (O2.Truck SubClassOf O3.Vehicle) ] ArtRules(O1, O2) : null ] Intersection is not associative!

41 Associativity ] (O1.O2).O3 = O1.(O2.O3) iff ArtGen is consistent and transitively connective ] consistent - given two nodes it generates the same relationship irrespective of relationships between other nodes

42 Associativity (contd.) ] ArtGen relates A and B: (A Rel B) = (A R1 B) or (B R2 A) ] Transitively connective: If ArtGen generates (A Rel B), (B Rel C) then it also generates (A Rel C’)  where A  O1, B  O2, C,C’  O3

43 Using the Articulation Rules ] Q(X) :- (LuxuryCar Instance X), (X Owner Y), (Y Addr `CA’) ] Translated using Articulation Rules Q(X) :- (Car Instance X),(X List_Price Y) (Y Unit ‘$’),(Y Amount A), (A > 30000), (X Buyer Z), (Z Address ‘CA’)

44 Conclusion ] Introduced an Ontology Interoperation System. ] Interoperation based on articulations that bridge the semantic gap. ] Graph-oriented data model, logical rules ] Founded on Ontology Algebra Thanks to Stefan Decker, Alex Carobus, Jan Jannink, Sergey Melnik, Shrish Agarwal

DAML Ontology Interoperation Using Cyc Steve Reed Cycorp

46 DAML Excitement at Cycorp Promoting DAML ontology interoperability through Cyc, as a reference ontology. Cyc now imports and exports DAML, paraphrases and reasons about it. Cyc lexical tools assist the mapping of imported DAML ontologies into Cyc. Both UDDI categorization schemes imported as DAML into Cyc: –UNSPSC (via KSL) and NAICS Cycorp is featuring DAML in it’s upcoming release of OpenCyc (this fall).

47 DAML Challenges at Cycorp How to automate mapping and translation of DAML ontologies. How to get the user community to help map numerous DAML terms -- interest growing at OpenCyc. Importation of large DAML schema is daunting but underway, WordNet, OpenDirectory, UNSPSC, NAICS Provide an inference engine for the Semantic Web - subsumption, classification, constraints, well-formed statement validation, diagnostics, justifications. What web services interface? SOAP likely.

DAML Ontology Interoperation Cyc Ontology UNSPSC Ontology NAICS Ontology Mapping 1: UNSPSC to Cyc Mapping 2: NAICS to Cyc User Search for: “waterproof plywood” Search for: UNSPSC/7311 Hardwood Int. … marine plywood … UDDI Web Services Return: marine plywood by Hardwood Int.

49 Ontology Mapping From Doerr, Semantic Problems of Thesaurus Mapping DAML Import

50 Ontology Mapping From Doerr, Semantic Problems of Thesaurus Mapping DAML Import Cyc reference

51 UNSPC/NAICS Mapping Universal Standard Products and Services Classification, North American Industrial Classification System Both business taxonomies are monohierarchies, entities should have one code assigned for them. Cyc: #$MonohierarchyClassificationSystem implies that each collection is #$partitionedInto its narrower terms. Mapping is targeted at facets of the reference Cyc terms via functional composition.

52 Mapping to Semantic Facets (#$isa (#$BusinessFacetFn #$HardwoodInternational #$Hardware) #$NAICS-Hardware-Store)) Hardwood International is an online store, selling mainly lumber but offering hardware as well, but NAICS is a monohierarchy. Hardwood International Hardware (#$isa (#$BusinessFacetFn #$HardwoodInternational #$Hardwood) #$NAICS-Other-Building-Materials-Dealers)) Hardwood Business facets

53 Conclusion ] Ontology Interoperation is not without it problems ] Hetergeneous data can work together

By Helen Howell