Next Generation Semantic Web Applications Prof. Enrico Motta Director, Knowledge Media Institute The Open University Milton Keynes, UK.

Slides:



Advertisements
Similar presentations
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Advertisements

Opportunistic Reasoning for the Semantic Web: Adapting Reasoning to the Environment Carlos Pedrinaci Tim Smithers and Amaia Bernaras.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
WP8: User Centred Applications Enrico Motta, Marta Sabou, Vanessa Lopez, Laurian Gridinoc, Lucia Specia Knowledge Media Institute The Open University Milton.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Using Watson for Building Intelligent Applications in E-learning Mathieu d’Aquin The Knowledge Media Institute, The Open University
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
OntoBlog: Linking Ontology and Blogs Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of Informatics, Japan 2 Asian.
Web3.0 and Language Resources Marta Sabou Knowledge Media Institute (KMi) The Open University Exploiting Semantic Web Ontologies: An Experimental Report.
Exploiting the Semantic Web: Next Generation Semantic Web Applications in KMi Watson, PowerMagpie, PowerAqua, … Mathieu d’Aquin Laurian Gridinoc Vanessa.
Low-cost semantics-enhanced web browsing with Magpie Enrico Motta Knowledge Media Institute The Open University, UK.
Developing Semantic Web Sites: Results and Lessons Learnt Enrico Motta, Yuangui Lei, Martin Dzbor, Vanessa Lopez, John Domingue, Jianhan Zhu, Liliana Cabral,
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Ontology-Based Applications in the Age of the Semantic Web Prof Enrico Motta, PhD Knowledge Media Institute The Open University Milton Keynes, UK.
The Informative Role of WordNet in Open-Domain Question Answering Marius Paşca and Sanda M. Harabagiu (NAACL 2001) Presented by Shauna Eggers CS 620 February.
Watson Supporting Next Generation Semantic Web Applications Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Marta Sabou, Sofia Angeletou, Enrico.
By : Vanessa López, Enrico Motta Knowledge Media Institute. Open University Ontology-driven question answering in: AQUALog 9 th International Conference.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
Web Logs and Question Answering Richard Sutcliffe 1, Udo Kruschwitz 2, Thomas Mandl University of Limerick, Ireland 2 - University of Essex, UK 3.
Exploiting Large-Scale Semantics on the Web Prof. Enrico Motta Director, Knowledge Media Institute The Open University Milton Keynes, UK.
The Semantic Web Prof. Enrico Motta Knowledge Media Institute The Open University.
Characterizing Semantic Web Applications Prof. Enrico Motta Director, Knowledge Media Institute The Open University Milton Keynes, UK.
Alternatives to Metadata IMT 589 February 25, 2006.
Towards a new generation of semantic web applications Prof. Enrico Motta, PhD Knowledge Media Institute The Open University Milton Keynes, UK.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
On Roles of Models in Information Systems (Arne Sølvberg) Gustavo Carvalho 26 de Agosto de 2010.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
GENERAL CONCEPTS OF OOPS INTRODUCTION With rapidly changing world and highly competitive and versatile nature of industry, the operations are becoming.
Search Engines and Information Retrieval Chapter 1.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
Semantic Web Browsing for the VISWE Learning Community Arthur Stutt (some slides from ISWC 2003 talk) Knowledge Media Institute, The Open University, UK.
SemSearch: A Search Engine for the Semantic Web Yuangui Lei, Victoria Uren, Enrico Motta Knowledge Media Institute The Open University EKAW 2006 Presented.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Service Service metadata what Service is who responsible for service constraints service creation service maintenance service deployment rules rules processing.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
And the Watson Plugin for the NeOn Toolkit. IST NeOn-project.org The Semantic Web is growing… #SW Pages.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Characterizing Knowledge on the Semantic Web with Watson Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, Enrico Motta.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
A Research Programme for the Semantic Web
Exploiting Large Scale Web Semantics
CCNT Lab of Zhejiang University
Ontology Evolution: A Methodological Overview
Extracting Semantic Concept Relations
ece 627 intelligent web: ontology and beyond
CS246: Information Retrieval
Presentation transcript:

Next Generation Semantic Web Applications Prof. Enrico Motta Director, Knowledge Media Institute The Open University Milton Keynes, UK

Structure of the Talk Quick Recap: What is the Semantic Web? State of the art: 1st Generation SW Applications –Emphasis on ontology-driven data aggregation –Limited with respect to their ability to exploit large scale, heterogeneous semantic markup Key research issues –What needs to be done to enable the effective development of the next generation of SW Applications –Need for a different approach to some key res. areas –How the SW itself can be exploited to address such key research issues

Quick Recap: What is the Semantic Web?

The Semantic Web A large scale, heterogenous collection of formal, machine processable, ontology-based statements (semantic metadata) about web resources and other entities in the world, expressed in a XML-based syntax

Ontology Metadata UoD

Person Organization String Organization-Unit partOf hasAffiliation worksInOrgUnit hasJobTitle Enrico Motta kmi director enrico motta Enrico Motta <akt:hasHomePage rdf:resource="

SW = A Conceptual Layer over the web

SW is Heterogeneous!

Generating semantic markup

Key aspects of the SW Size (= Huge) –Sem. markup (eventually to reach) the same order of magnitude as the web Conceptual Heterogeneity (= Big) –Sem. markup based on many different ontologies Rate of change (= Very High) –Data generated all the time from human and artificial agents… Provenance (= Very Heterogeneous) –….Hence provenance itself is extremely heterogeneous Trust (= very variable and subjective) –A side-effect of heterogeneous provenance Data Quality (= very variable) –No guarantee of correctness Intelligence (= by-product of size and heterogeneity) –Rather than a by-product of sophisticated problem solving

Compare with traditional KBS Size (= Small or Medium) –KBS normally small to medium size Conceptual Heterogeneity (= Not an issue) –KBS normally based on a single conceptual model Rate of change (= Very Low) –Change rate under developers' control (hence, low) Provenance (= Not an issue) –KBS are normally created ad hoc for an application by a centralised team of developers Trust (= not a major issue) –Centralisation of devpt. process implies no significant trust issues Data Quality (= not a major issue) –Again, centralisation guarantees data quality across the board Intelligence (= by-product of complex, task-centric reasoning) –E.g., sophisticated diagnostic, planning systems…

The Semantic Web today 1st Generation SW Applications

CS Dept Data AKT Reference Ontology RDF Data Bibliographic Data

Typically use a single ontology –Usually providing a homogeneous view over heterogeneous data sources Limited use of existing SW data Closed to semantic resources Limited interactivity –In contrast with typical web 2.0 applications Features of 1st generation SW Applications Hence: current SW applications are far more similar to traditional KBS (closed semantic systems) than to 'real' SW applications (open semantic systems)

It is still early days..

Next Generation SW Applications

Next generation SW applications NG SW Application Able to exploit the SW at large –Hence: Multi-Ontology Supporting interactivity –E.g., allowing users to add semantic data –Hence, open with respect to SW resources Ideally also able to exploit non-SW data –E.g., folksonomies –Hence, embedding powerful information extraction engines

Two systems we have built MagpieAquaLog

Magpie Components Enriched Web Page Semantic Log (found-item localhost #u" e/motta/" john-domingue john-domingue) (found-item localhost Jabber Server Magpie Hub Ontology cache (Lexicon) Problem Domain & Resources Ontology based Proxy Server Web Page

AquaLog: Ontology-Driven Question Answering Which is the capital of Spain? NL SENTENCE INPUT QUERY TRIPLES ANSWER (?, capital, Spain) Linguistic AnalysisMapping Engine RESULT TRIPLES NL Generation Madrid

Need for mechanisms for automatically identifying semantic markup relevant to the current page, user, browsing session, etc.. PowerMagpie: Semantic browsing on the 'open' SW

Need for mechanisms for automatically locating ontologies relevant to the current query, map user terminology to ontologies, integrate info from different ontologies, etc.. PowerAqua: QA on the 'open' semantic web

What needs to be done to facilitate the development of such 2nd generation SW applications?

Dynamic Ontology Selection First: powerful support for ontology selection Both PowerAqua and PowerMagpie heavily rely on ontology selection to locate possibly relevant knowledge in response to –User queries (PowerAqua) –Accessing web pages (PowerMagpie) Hence, ontology selection is a crucial task for both systems

Current support for ontology selection

Limitations of Swoogle Query/Search –Only keyword search, we need more powerful query methods (e.g., ability to pose formal queries) Repository structure –Very weak in Swoogle, not even duplicates are dealt with –Need for automatic derivation of relations between ontologies E.g., same-ontology-as, ontology-extends, ontology- incompatible-with, etc….. –We need these relations to structure the repository and to support more powerful ranking methods (see next bp) Ontology ranking –Swoogle only uses a 'popularity-based' one, we need other methods as well

We also need: Methods for fast extraction of ontology modules –Typically we only want the part of the ontology relevant to our current needs Methods for the integration of information derived from different ontologies –In the context of QA this problem typically reduces to that of deciding whether two instances denote the same entity

Even more importantly.. Need to look at a number of key research issues in the context provided by NG-SW applications –Example: Ontology Mapping Current work focuses on design-time mapping of complete ontologies –Example: Ontology Selection Current work focuses on user-mediated ontology selection –Example: Ontology Modularization Current work by and large assumes that the user is in the loop

A new application scenario NG-SW applications require algorithms able to perform tasks such as selecting, modularizing, and mapping ontologies at run time Moreover, in such a context, mapping is concerned with mapping ontology fragments, rather than complete ontologies

So What? Time to go beyond 1st generation applications 2nd generation SW applications will exploit much more fully the large scale semantic markup provided by the SW Many issues to be addressed: –Better ontology crawling, indexing, retrieving and ranking support –Mapping, selection, and modularization methods appropriate for NG-SW applications –Further acceleration needed in the generation of semantic markup

Exploiting the SW itself to tackle its heterogeneity Interestingly, a NG-SW-based approach can also be used also to tackle key SW tasks, such as Ontology Mapping –Based on the use of the SW itself as background knowledge

Exploiting Large-Scale Semantics Case Study: Using the Semantic Web as background knowledge in Ontology Mapping

Ontology Mapping: State of the Art State-of-the-art methods rely on a combination of: –Label similarity methods e.g., Full_Professor = FullProfessor –Structure similarity methods Using taxonomic information or information about domain and range of associated properties However, as pointed out by Aleksovski et al (EKAW, 2006): –In many cases there is no sufficient lexical overlap –In many cases source and target ontology have not sufficient structure to allow effective structure-based mapping

Use of bkg. knowledge for ontology mapping A B ? Background Knowledge

External Source = One Ontology Alekszovski et al. EKAW’06 Map candidate terms into concepts from a richly axiomatized domain ontology (anchors) Derive a mapping based on the relation of the anchor terms AB B’ A’ = = rel Advantages: Handles dissimilar ontologies Returns semantic mappings Disadvantages: Assumes that a suitable domain ontology is available. Approach only suitable for closed domains

External Source = Web van Hage et al. ISWC’05 rely on Google and an online dictionary in the food domain to extract semantic relations between candidate mappings using IR techniques AB rel + OnlineDictionary IR Methods Advantages: General purpose Disadvantages: IR Methods introduce noise

External Source = WordNet Lopez et al. ESWC ’05 use wordnet to map queries expressed in the user's terminology to a domain ontology to support question answering AB rel WordNet Advantages: General purpose Disadvantages: Knowledge sparseness Works best with concepts, not so useful with relations WordNet is not an ontology!!!

Knowledge-poor ontology mapping Actually isn’t a bit strange that such complex and knowledge-poor methods are devised, when the SW already provides so much background knowledge?….

Proposal: rely on online ontologies (Semantic Web) to derive mappings ontologies are dynamically discovered and combined AB rel Advantages: General purpose Does not introduce noise Works with any kind of domain entities (concepts, relations, instances) Semantic Web External Source = SW

Strategy 1 - Definition Find ontologies that contain equivalent classes to A and B and use their relationship in the ontologies to derive the mapping. AB rel Semantic Web A1’A1’ B1’B1’ A2’A2’ B2’B2’ An’An’ Bn’Bn’ O1O1 O2O2 OnOn For each ontology use these rules: … These rules can be extended to take into account indirect relations between A’ and B’, e.g., between parents of A’ and B’:

Strategy 1- Variants AB Quick variant: Stop as soon as a relation is found Semantic Web A1’A1’ B1’B1’ O1O1

Strategy 1- Variants Precise variant: Derive all possible mappings from all ontologies and combine them into a final mapping. AB Semantic Web A1’A1’ B1’B1’ O1O1 A2’A2’ B2’B2’ O2O2 Dealing with Contradictions: Return all mappings even if contradictory Return a mapping only when there is no contradiction Return the most frequent mapping (i.e., the mapping derived from most ontologies) Return the mappings with 'higher authority' (based on metrics of ontology evaluation or trust) Try to combine mappings

Strategy 1- Examples BeefFood Semantic Web Beef RedMeat Tap Food MeatOrPoultry SR-16FAO_Agrovoc ka2.rdf Researcher AcademicStaff Semantic Web Researcher AcademicStaff ISWC SWRC

Strategy 2 - Definition Principle: If no ontologies are found that contain the two terms then combine information from multiple ontologies to find a mapping. AB rel Semantic Web A’ BC C’ B’rel Details: (1) Select all ontologies containing A’ equiv. with A (2) For each ontology containing A’: (a) if find relation between C and B. (b) if find relation between C and B.

Strategy 2 - Examples Vs. (midlevel-onto) (Tap) Ex1: Vs.Ex2: (r1) (pizza-to-go) (SUMO) (Same results for Duck, Goose, Turkey) (r1) Vs.Ex3: (pizza-to-go) (wine.owl) (r3)

Conclusions Using the SW as background knowledge for ontology mapping has several benefits –Suitable for our NG-SW scenario as there is no need for design-time selection of a background knowledge –Even when design-time selection is feasible, it is suitable for those cases where a suitable domain ontology cannot be found –Reduces noise by exploiting only ontologies –Can be tailored to handle multiple solutions –Can be integrated with other approaches, based on lexical and structural analysis

If you would like to find out more.. 'Vision' papers –Motta, E., Sabou, M. (2006). "Next Generation Semantic Web Applications". 1st Asian Semantic Web Conference, Beijing. –Motta, E., Sabou, M. (2006). "Language Technologies and the Evolution of the Semantic Web". LREC 2006, Genoa, Italy. –Motta, E. (2006). "Knowledge Publishing and Access on the Semantic Web: A Socio-Technological Analysis". IEEE Intelligent Systems, Vol.21, 3, (88-90). Ontology Modularization –D' Aquin, M., Sabou, M., Motta, E. (2006). "Modularization: A key for the dynamic selection of relevant knowledge components". ISWC 2006 Workshop on Ontology Modularization

If you would like to find out more.. Ontology Mapping –Lopez, V., Sabou, M., Motta, E. (2006). "Mapping the real semantic web on the fly". International Semantic Web Conference, Georgia, Atlanta. –Sabou, M., D'Aquin, M., Motta, E. (2006). "Using the semantic web as background knowledge for ontology mapping". ISWC 2006 Workshop on Ontology Mapping. Ontology Selection –Sabou, M., Lopez, V., Motta, E. (2006). "Ontology Selection for the Real Semantic Web: How to Cover the Queen’s Birthday Dinner?". Proceedings of EKAW 2006, Podebrady, Czech Republic.