Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation Stephen W. Liddle Information Systems Department Yihong Ding & David.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 ISWC-2003 Sanibel Island, FL IMG, University of Manchester Jeff Z. Pan 1 and Ian Horrocks 1,2 {pan | 1 Information Management.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
OntoBlog: Linking Ontology and Blogs Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of Informatics, Japan 2 Asian.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
OWL-AA: Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation 2006 Spring Research Conference Yihong Ding.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
Two-Level Semantic Annotation Model BYU Spring Conference 2007 Yihong Ding Sponsored by NSF.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Query Rewriting for Extracting Data Behind HTML Forms Xueqi Chen, 1 David W. Embley 1 Stephen W. Liddle 2 1 Department of Computer Science 2 Rollins Center.
Augmenting Traditional Conceptual Models to Accommodate XML Structures Stephen W. Liddle Information Systems Department Reema Al-Kamha & David W. Embley.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
A New Web Semantic Annotator Enabling A Machine Understandable Web BYU Spring Research Conference 2005 Yihong Ding Sponsored by NSF.
The Semantic Web Week 12 Term 1 Recap Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module Website:
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
A Context-Based Mediation Approach to Compose Semantic Web Services Michael Mrissa, Chirine Ghedira, Djamal Benslimane, Zakaria Maamar, Florian Rosenberg,
1 Cui Tao PhD Dissertation Defense Ontology Generation, Information Harvesting and Semantic Annotation For Machine-Generated Web Pages.
Automatic Creation and Simplified Querying of Semantic Web Content An Approach Based on Information-Extraction Ontologies Yihong Ding, David W. Embley,
BYU A Synergistic Semantic Annotation Model December 2007 Yihong Ding,
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Erasmus University Rotterdam Introduction With the vast amount of information available on the Web, there is an increasing need to structure Web data in.
From a Single Ontologically Sound Conceptual Model to Multiple Physical Schema Languages Bruce T. Bauman, U.S. DoD 1.
Formalizing and Querying Heterogeneous Documents with Tables Krishnaprasad Thirunarayan and Trivikram Immaneni Department of Computer Science and Engineering.
Aidministrator nederland b.v. Adding formal semantics to the Web Jeen Broekstra, Michel Klein, Stefan Decker, Dieter Fensel,
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Knowledge representation
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
Michael Eckert1CS590SW: Web Ontology Language (OWL) Web Ontology Language (OWL) CS590SW: Semantic Web (Winter Quarter 2003) Presentation: Michael Eckert.
1 The Modular Structure of Complex Systems Presented by: SeyedMasoud Sadjadi and Wei Zhu David L. Parnas, Paul C. Clement, and David M. Weiss ICSE 1984.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Xml:tm XML Text Memory Using XML technology to reduce the cost of translating XML documents.
Towards the Semantic Web 6 Generating Ontologies for the Semantic Web: OntoBuilder R.H.P. Engles and T.Ch.Lech 이 은 정
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Personalized Recommendation of Related Content Based on Automatic Metadata Extraction Andreas Nauerz 1, Fedor Bakalov 2, Birgitta.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
David W. Embley Brigham Young University Provo, Utah, USA.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
1 Intelligent Information System Lab., Department of Computer and Information Science, Korea University Semantic Social Network Analysis Kyunglag Kwon.
1 Ontological Foundations For SysML Henson Graves September 2010.
David W. Embley Brigham Young University Provo, Utah, USA
Semantic Markup for Semantic Web Tools:
Grant Number: IIS Institution of PI: Brigham Young University PI’s: David W. Embley, Stephen W. Liddle, Deryle W. Lonsdale Title:
Presentation transcript:

Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation Stephen W. Liddle Information Systems Department Yihong Ding & David W. Embley Computer Science Department Brigham Young University, Provo, Utah

2 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Outline Background Instance recognition semantics Automated semantic annotation OWL-AA: OWL for Automated Annotation Related work Conclusion

3 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Motivation Big Hairy Audacious Goal: deliver a machine- processable semantic web that will let us build agents to work intelligently on our behalf Realistic Shorter-term Goal: provide automatic semantic annotation machinery that turns ordinary web pages into semantic web pages by adding appropriate metadata Goal of this Paper: Show that we can extend OWL with epistemological declarations that let us use our data extraction engine to provide a solid automatic annotator that works with standard ontologies

4 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Typical Approach Typical approach to automatic annotation: Use existing data extraction engine to wrap and annotate pages BUT these approaches do not match extracted data with an ontology Main drawback: Post-processing step to map extracted data to the ontology

5 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Our Approach Our data extraction engine is ontology based and does not need a post-processing alignment step We have demonstrated that our approach works (Ding/Embley/Liddle 2006, LNCS 4185) But our ontology language is not a W3C standard OWL is a standard (W3C recommendation) But lacks sufficient declarative instance- recognition semantics for automatic annotation

6 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand A Solution Extend OWL with declarative instance- recognition semantics Call this OWL for Automated Annotation (OWL-AA) OWL-AA extension is epistemological in nature Ontological definitions should be independent of the form of knowledge representation Epistemological definitions may be sensitive to the form of knowledge representation

7 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Our Contributions OWL-AA extends OWL to provide for automated semantic annotation that Embeds instance-recognition semantics declarations in ontologies and data extraction tools Provides enhanced knowledge sharing and reuse via semantic web Separates domain knowledge creation from semantic annotation process Thus maintaining separation between concerns of domain experts and software developers

8 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Instance Recognition Semantics Instance Semantics Recognizers (ISR’s) Formal specifications that identify instances of a concept in unstructured, semistructured, or structured text Example: Concept: PhoneNumber Text: “Call me at ” ISR should recognize that in the text has the intensional meaning of PhoneNumber.

9 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand ISR Declarations in Data Frames BedroomNr internal representation: Integer external representation: [1-9]|10 left context phrase: \b right context phrase:.*r(oo)?ms? exception phrase: \s.*ba(th)?s?\b.*r(oo)?ms? context keywords: b(r|d)s?|bdrms?|bed(rooms?)?... LessThan(p1: Price, p2: Price) returns (Boolean) context keywords: less than|<|or less|fewer| end Feature external representation: ApartmentFeature.lexicon... end

10 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Resiliency There are no document layout dependencies in our ISR declarations Other approaches typically use page layout information for extraction Layout independence (resiliency) is a major benefit Page layouts change periodically on the web Different publishers in the same domain generally use different page layouts Rewriting wrappers is costly

11 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Annotation (Apartment Rental) Feature External Representation BedroomNr External Representation Context Phrase BathNr External Representation Context Phrase MonthRate External Representation Context Phrase ContactPhone External Representation Context Keyword CAPITOL HILL Luxury 2 bdrm 2 bath, 2 grg, w/d, views, 1700 sq ft. $1250 mo. Call

12 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand ISR-Based Annotation Demo demos section

13 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand extractionPattern P RDFS Graph of OWL-AA ISR ExternalRepresentation ContextualRepresentation subClassOf domain &xsd;string range RegularExpression ContextPhrase LexiconList subClassOf ContextKeyword subClassOf LeftContextPhrase RightContextPhrase subClassOf Exception ISRvalue P subClassOf domain range domain owl:Class range domain hasException P extractionRightContext P extractionKeyword P extractionLeftContext P

14 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand OWL-AA Details (1) Add namespace xmlns:owlaa=“ Add properties to OWL classes associating ISR declarations

15 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand OWL-AA Details (2) Declare ISR value and contextual restriction [1-9]|10 <owlaa:extractionLeftContext rdf:resource=“#leftContext-1”/> <owlaa:extractionRightContext rdf:resource=“#rightContext-1”/> <owlaa:extractionKeyword rdf:resource=“#contextKeyword-1”/>

16 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand OWL-AA Details (3) Add context phrases, context keywords, and exceptions \b... (see our paper for the rest of the details)

17 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand OWL-AA Features Syntactically, OWL-AA is attachment- independent w.r.t. OWL No need to change a single line of existing OWL ontologies Semantically, OWL-AA is also attachment- independent w.r.t. OWL ISR declarations do not change meaning of existing ontology concepts at conceptual level Prototype implementation translates OWL-AA to our own internal format for processing

18 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Related Work We aren’t the first to work on ISR declarations IBM’s Unstructured Information Management Architecture There are many proposals for extending OWL Two extensions related to ours Context OWL (C-OWL) Extension for ontology mapping that localizes ontology content for global visibility OWL-Eu Extension that enriches OWL with customized datatypes, a significant need Not attachment-independent, unfortunately But our approach is compatible with OWL-Eu too

19 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Conclusion Automated semantic annotation is an important and fundamental problem for the semantic web Key to automated annotation is ISR declaration Our epistemological OWL-AA extension augments OWL to formalize ISR declarations

20 of 20 6 November 2007 ONISW 2007, Auckland, New Zealand Conclusion (cont.) OWL-AA is fully compatible with ordinary OWL Attachable/detachable OWL-AA does not introduce complexity or decidability issues into OWL Prototype implementation demonstrates that our approach works well Embedding epistemological declarations in OWL is a fruitful area meriting further research