Text Understanding Agents and the Semantic Web Akshay Java, Tim Finin, Sergei Nirenburg 01/04/2005.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Mitsunori Ogihara Center for Computational Science
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Chronos: A Tool for Handling Temporal Ontologies in Protégé
CS570 Artificial Intelligence Semantic Web & Ontology 2
OntoSem2OWL. Plan of the talk ● OntoSem Overview ● Features of OntoSem Ontology ● Mapping OntoSem2OWL ● Motivation ● Possible Application Scenarios.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
SIG2: Ontology Language Standards WebOnt Briefing Ian Horrocks University of Manchester, UK.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
1 Semantic Web Mining Presented by: Chittampally Vasanth Raja 10IT05F M.Tech (Information Technology)
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
An OWL based schema for personal data protection policies Giles Hogben Joint Research Centre, European Commission.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
1 Semantic Technologies: Diamond in the Rough? Unik Graduate Research Center Dr. Juan Miguel Gomez Universidad Carlos III de Madrid.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Aidministrator nederland b.v. Adding formal semantics to the Web Jeen Broekstra, Michel Klein, Stefan Decker, Dieter Fensel,
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
The Semantic Web William M Baker
Logics for Data and Knowledge Representation
CoGenTex, Inc. Ontology-based Multimodal User Interface in MOQA AQUAINT 18-Month Workshop San Diego, California Tanya Korelsky Benoit Lavoie Ted Caldwell.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
Populating A Knowledge Base From Text Clay Fink, Tim Finin, Christine Piatko and Jim Mayfield.
OWL 2 in use. OWL 2 OWL 2 is a knowledge representation language, designed to formulate, exchange and reason with knowledge about a domain of interest.
Integrating Language Understanding agents into the Semantic Web Akshay Java, Tim Finin, Sergei Nirenburg 11/04/2005.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Semantic Web Ontology Design Pattern Li Ding Department of Computer Science Rensselaer Polytechnic Institute October 3, 2007 Class notes for CSCI-6962.
Semantic Web - an introduction By Daniel Wu (danielwujr)
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
OntoSem2OWL Integrating Language Understanding agents into the Semantic Web Ebiquity Presentation 05/17/2005 -Akshay Java.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
MT with an Interlingua Lori Levin April 13, 2009.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
A Short Tutorial to Semantic Media Wiki (SMW) [[date:: July 21, 2009 ]] At [[part of:: Web Science Summer Research Week ]] By [[has speaker:: Jie Bao ]]
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Architecture for an Ontology and Web Service Modelling Studio Michael Felderer & Holger Lausen DERI Innsbruck Frankfurt,
Practical RDF Chapter 12. Ontologies: RDF Business Models Shelley Powers, O’Reilly SNU IDB Lab. Taikyoung Kim.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
COMM: Designing a Well-Founded Multimedia Ontology for the Web Wednesday 14 th of November, 2007 Richard Arndt Steffen Staab Rapha.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Human-Assisted Machine Annotation Sergei Nirenburg, Marjorie McShane, Stephen Beale Institute for Language and Information Technologies University of Maryland.
@ eBiquity Lab, CSEE, UMBC Swoogle Tutorial (Part I: Swoogle R & D) A brief introduction to Swoogle An overview of Swoogle research A summary of Swoogle.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
Ontology Technology applied to Catalogues Paul Kopp.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Ccs.  Ontologies are used to capture knowledge about some domain of interest. ◦ An ontology describes the concepts in the domain and also the relationships.
Linked Open Data Dataset from Related Documents Petya Osenova and Kiril Simov IICT-BAS LDL-2016, LREC, Portoroz.
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Analyzing and Securing Social Networks
ece 720 intelligent web: ontology and beyond
Linking Guide Michel Böhms.
Semantic Markup for Semantic Web Tools:
Information Extraction from Social Media
Presentation transcript:

Text Understanding Agents and the Semantic Web Akshay Java, Tim Finin, Sergei Nirenburg 01/04/2005

Outline Motivation: Language Understanding Agents Ontological Semantics Bridging the Knowledge Gap Preliminary Evaluation SemNews: An Application Testbed Conclusion Q&A

WWW Motivation Intelligent agents need knowledge and information. Most Web content is NL text. SW can benefit NLP tools in their language understanding tasks Web of documents Web of data Text Images Audio video Ontologies Instances triples Natural Language RDF/OWL Facts from NL structured information Semantic Web NLP Tools

Motivation Language Understanding Agents Provides RDF version of the news.

Ontological Semantics OntoSem is a Natural Language Processing System that processes the text and converts them into facts. Supported by a constructed world model encoded in a rich Ontology.

Ontological Semantics

Static Knowledge Sources Ontology 8000 concepts Avg 16 properties each Lexicons English: entries Spanish: entries Chinese: 3000 entries Fact repository facts Onomasticon NNNNN names

The OntoSem Ontology PROPERTY FILLER FACET ONTOLOGY ::= CONCEPT+ CONCEPT ::= ROOT | OBJECT-OR-EVENT | PROPERTY SLOT ::= PROPERTY | FACET | FILLER

Text Meaning Representation (TMR) Word sense addressed disambiguated A persistent fact stored in the FR Semantic dependency established

REQUEST-ACTION-69 AGENT HUMAN-72 THEME ACCEPT-70 BENEFICIARY ORGANIZATION-71 SOURCE-ROOT-WORD ask TIME (< (FIND-ANCHOR-TIME)) ACCEPT-70 THEME WAR-73 THEME-OF REQUEST-ACTION-69 SOURCE-ROOT-WORD authorize ORGANIZATION-71 HAS-NAME United-Nations BENEFICIARY-OF REQUEST-ACTION-69 SOURCE-ROOT-WORD UN HUMAN-72 HAS-NAMEColin Powell AGENT-OF REQUEST-ACTION-69 SOURCE-ROOT-WORD he; reference resolution has been carried out WAR-73 THEME-OF ACCEPT-70 SOURCE-ROOT-WORD war Text Meaning Representation (TMR) He asked the UN to authorize the war.

Mapping OntoSem to web based KR NL Text OntoSem OWL Ontology Lexicon OntoSem2OWL Fact Repository TMR Ontology TMRs In OWL

Mapping Rules for Classes OntoSem LISP version ( make-frame patent ( definition (value (common "the exclusive right to make, use or sell an invention, which is granted to the inventor"))) ( is-a (value (common intangible-asset legal-right)))) OWL Version: he exclusive right to make, use or sell an invention, which is granted to the inventor

Mapping Rules for Properties Properties can be ObjectProperty owl:ObjectProperty Datatype Property owl:DatatypeProperty Property hierarchy is defined by owl:subPropertyOf Domain maps to rdfs:domain Range maps to rdfs:range Restrictions are handled using owl:Restriction Numeric datatypes are handled using XSD

Mapping Rules for Properties… (make-frame controls (domain (sem (common physical-event physical-object social-event social-role))) (range (sem (common actualize artifact natural-object social-role))) (is-a (value (common relation))) (inverse (value (common controlled-by))) (definition (value (common "A relation which relates concepts to what they can control"))))

Mapping Rules for Properties… "A relation which relates concepts to what they can control" (make-frame (domain (range (is-a (inverse

Mapping Rules for Facets Facets are a way to restricting the fillers that can be used for a particular slot SEM and VALUE Maps them using owl:Restriction on a particular property. RELAXABLE-TO Add this to the classes present in owl:Restriction and add this information in the annotation. DEFAULT No clear way to represent non-monotonic reasoning and closed world assumptions in Semantic Web. DEFAULT-MEASURE similar to DEFAULT Facet, not handled. DEFAULT, DEFAULT-MEASURE used relatively less frequently NOT Not facet can be handled using owl:disjointOf INV need not be handled since is-a slot is already mapped to owl:inverseOf

Evaluation Swoop Pellet Wonderweb Built Ontology translation tool using Jena API Total Triples Generated ~ (including bnode) Time to build the Model ~ sec Time to do RDFS Inference ~ 10 sec Time to do OWL Micro ~ 40 sec Time to do OWL Full ~ ???? DL Expressivity: ELUIH EL - Conjunction and Full Existential Quantification U - Union H - Role Hierarchy I - Role Inverse Total Number of Classes: 7747 (Defined: 7747, Imported: 0) Total Number of Datatype Properties: 0 (Defined: 0, Imported: 0) Total Number of Object Properties: 604 (Defined: 604, Imported: 0) Total Number of Annotation Properties: 1 (Defined: 1, Imported: 0) Total Number of Individuals: 0 (Defined: 0, Imported: 0) NOTE: This is using no Restrictions After Translation OWL FULL

Evaluation Syntactic Correctness: was checked using OWL/RDF validators. Semantic Validation: Full semantic validation even for subsets of OWL is difficult. Meaning Preservation: some subset of the native representation features such as DEFAULTS, modality, case roles may be underrepresented or not handled. Feature Minimization: Complex features could be difficult for reasoners to handle hence we can perform the translations at each of the levels – OWL Lite, OWL DL, OWL Full. Translation Complexity: OntoSem is an extensive and large ontology (~8000 concepts). Translation itself is done syntactically but in general translation might require reasoning which could be an issue.

An Application Testbed: SemNews Semantically Search and Browse news Aggregators collect the RSS news descriptions form various sources. The sentences are processed by OntoSem and are converted into TMRs Provides intelligent agents with the latest news in a machine readable format

Semantic RSS Data Aggregators News Feeds OntoSem TMRs FR Language Processing OntoSem2OWL Dekade Editor Knowledge Editor Environment Semantic Web Tools OntoSem Ontology (OWL) TMR Inferred Triples Fact Repository Interface Ontology & Instance browser Text Search RDQL Query Swoogle Index RSS Aggregator

Agent understandable news Provides RDF version of the news.

Semantacizing RSS View structured representation of the RSS news story. Future versions would enable editing the facts and provide provenance information

News stories are ontologically linked Find news stories by browsing through the OntoSem ontology.

Tracking Named Entities Find stories on a specific named entity.

Browsing Facts Fact repository explorer for named entity ‘Mexico’ shows that it has a relation ‘nationality-of’ with CITIZEN-235 Fact repository explorer for instance CITIZEN-235 shows that the citizen is an agent of ESCAPE-EVENT

Querying the semanticized RSS RDQL Queries Provides structured querying over text repre- sented in RDF.

Semantic Alerts Alerts can be specified as ontological concepts/ keywords / RDQL queries. Subscribe to results of structured queries

Beyond keyword search Conceptually searching for content Find all news stories that have something to do with a place and a terrorist activity. Context based querying Find all events in which ‘George Bush’ was the ‘speaker’. Reporting facts Find all politicians who traveled to Asia. Knowledge sharing Populating instances by mapping FOAF and DC to OntoSem ontology.

Current work Enron corpus Profiles in terror

Conclusions Integrating language processing agents into the SW would publish SW annotations and documents that capture the text’s meaning. Migrating from native non-web based representation to SW representation may be loss-full but is still useful for many applications. SemNews application testbed demonstrates some scenarios that can benefit from language understanding agents.

For More Information Semnews application OntoSem NLP system UMBC ebiquity research group This presentation

References Software Used [1] OntoSem [2] RDF Validation service [3] Jena Toolkit [4] Swoop Ontology Viewer [5] Pellet OWL DL Reasoner [6] Wonder Web OWL Validator Papers [1] Sergei Nirenburg and Victor Raskin, Ontological Semantics, Formal Ontology and Ambiguity [2] Sergei Nirenburg and Victor Raskin, Ontological Semantics, MIT Press, Forthcoming [3] Sergei Nirenburg, Ontological Semantics: Overview, Presentation CLSP JHU, Spring 2003 [4] Marjorie McShane, Sergei Nirenburg, Stephen Beale, Margalit Zabludowski, The Cross Lingual Reuse and Extension of knowledge Resources in Ontological Semantics [5] P.J Beltran-Ferruz, P.A Gonzalez-Calero, P. Gervas Converting Mikrokosmos frames into Description Logics. [6] Sergei Nirenburg, Ontology Tutorial, ILIT UMBC Mailing Lists [1] Jena Developers [2] pellet users [3] Semantic web [4] W3c RDF Interest [5] W3c Semantic web

Backup slides

Reasoning Capabilities Buildfile: build.xml init: compile: dist: [jar] Building jar: /home/aks1/software/eclipse/workspace/ontojena/dist/lib/ontojena.jar run: [java] MODEL OK [java] Resource: [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:subClassOf [java] - ( rdfs:label ' "a truck with equipment for fighting fires"') [java] - ( rdf:type owl:Class) [java] fire-engine recognized as subclas of vehicle BUILD SUCCESSFUL Total time: 10 seconds real 0m11.144s user 0m9.530s sys 0m0.190s ontojena]$ Finding Transitive Closures (RDFS reasoning) Fire-engine Truck Wheeled-engine-vehicle Engine-propelled--vehicleWheeled--vehicle Land-vehicle vehicle Inferred Triples

Mapping Rules CaseFrequencyMapped Using 1domain617rdfs:domain 2domain with not facet16owl:disjointWith 3range406rdfs:range 4range with not facet5owl:disjointWith 5inverse260owl:inverseOf Property Related Constructs

Mapping Rules CaseFrequencyMapped Using 1value18217owl:Restriction 2sem5686owl:Restriction 3relaxable-to95annotation 4default350Not handled 5default-measure612Not handled 6not134owl:disjointWith 7inv1941Not required Facet related constructs