©2003 Paula Matuszek CSC 9010: AeroText, Ontologies, AeroDAML Dr. Paula Matuszek (610) 270-6851.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
AeroDAML Applying Information Extraction to Generate DAML Annotations Dr. Paul Kogut Lockheed Martin Management & Data Systems.
1 1 HORUS The Egyptian All-Seeing God of Light A Joint IMO/DARPA Project DAML PI Meeting, Naushua, NH 17 Jul 2001 DAML PI Meeting, Naushua, NH 17 Jul 2001.
Semantically Grounded Briefings Bob Balzer, Neil Goldman, Marcelo Tallis Teknowledge
UML-Based Ontology Toolset Dr. Paul Kogut Lockheed Martin Management & Data Systems Valley Forge, PA.
UML-Based Ontology Toolset Dr. Paul Kogut Lockheed Martin Management & Data Systems Valley Forge, PA.
1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
Using DAML format for representation and integration of complex gene networks: implications in novel drug discovery K. Baclawski Northeastern University.
Semantic Web Thanks to folks at LAIT lab Sources include :
CS570 Artificial Intelligence Semantic Web & Ontology 2
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Ontology… A domain ontology seeks to reduce or eliminate conceptual and terminological confusion among the members of a user community who need to share.
Ontology Notes are from:
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
DARPA Agent Markup Language Ashish Jain University of Colorado at Boulder.
Dr. Paul Kogut and William Holmes - Lockheed Martin M&DS AeroDAML: Applying Information Extraction to Generate DAML Annotations from Web Pages Dr. Paul.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
CSC 9010 Spring Paula Matuszek A Brief Overview of Watson.
Extending UML to Support Ontology Engineering Kenneth Baclawski and Mieczylaw K. Kokar Northeastern University Paul A. Kogut, William S. Holmes III and.
CSC 9010 Spring, Paula Matuszek, Lillian Cassel 1 CS 9010: Semantic Web Possible Topics for Discussion Paula Matuszek Spring, 2006.
Knowledge representation
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
1 Technologies for (semi-) automatic metadata creation Diana Maynard.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
©2003 Paula Matuszek CSC 9010: Information Extraction Dr. Paula Matuszek (610) Fall, 2003.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
Translation Patterns to Specify Processes in the PSL Ontology Dr. A. Sánchez-Ruíz University of North Florida CIS Department Associate Professor and Coordinator.
Semantic Web - an introduction By Daniel Wu (danielwujr)
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Proposed NWI KIF/CG --> Common Logic Standard A working group was recently formed from the KIF working group. John Sowa is the only CG representative so.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
THE SUPPORTING ROLE OF ONTOLOGY IN A SIMULATION SYSTEM FOR COUNTERMEASURE EVALUATION Nelia Lombard DPSS, CSIR.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
ICCS 2008, CracowJune 23-25, Towards Large Scale Semantic Annotation Built on MapReduce Architecture Michal Laclavík, Martin Šeleng, Ladislav Hluchý.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
Artificial Intelligence 2004 Ontology
CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM ONTOLOGY-DRIVEN SEMANTIC MARK UP OF UNSTRUCTURED TEXTS EASTERN MEDITERRANEAN UNIVERSITY COMPUTER.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Dr. Paula Matuszek (610)
Digital libraries and web- based information systems Mohsen Kamyar.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Mining the Biomedical Research Literature Ken Baclawski.
CSC 9010 Spring, Paula Matuszek. 1 CS 9010: Semantic Web Applications and Ontology Engineering Paula Matuszek Spring, 2006.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
CSC 9010 Spring, Paula Matuszek, Lillian Cassel 1 CS 9010: Semantic Web Protégé Lab Paula Matuszek Spring, 2006.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Personalized Recommendation of Related Content Based on Automatic Metadata Extraction Andreas Nauerz 1, Fedor Bakalov 2, Birgitta.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
©2012 Paula Matuszek CSC 9010: Information Extraction Overview Dr. Paula Matuszek (610) Spring, 2012.
Chapter 7 K NOWLEDGE R EPRESENTATION, O NTOLOGICAL E NGINEERING, AND T OPIC M APS L EO O BRST AND H OWARD L IU.
© 1990—2006 Visual Knowledge Software® | Private and Confidential | 2 Semantic Agent Wikis For Engineering.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
ece 627 intelligent web: ontology and beyond
Independent Study of Ontologies
Multi-agent system for web services
Semantic Web: Commercial Opportunities and Prospects
Zachary Cleaver Semantic Web.
Presentation transcript:

©2003 Paula Matuszek CSC 9010: AeroText, Ontologies, AeroDAML Dr. Paula Matuszek (610)

©2003 Paula Matuszek AeroText l Information Extraction tool marketed by Lockheed Martin l Capabilities similar to GATE l Much better developed IDE l Less open to extensions of the system itself. l Equally steep learning curve for effective use! Lockheed AeroText General Overview Lockheed AeroText White Paper

©2003 Paula Matuszek AeroText Demo

©2003 Paula Matuszek Ontologies l Information Extraction requires modeling extensive domain knowledge l Other applications of text mining, such as document categorization, can also use domain information l In modeling such knowledge we often create an ontology: An explicit formal specification of how to represent the objects, concepts, and other entities that are assumed to exist in some area of interest and the relationships that hold among them.

©2003 Paula Matuszek A Simple Ontology: Birthdates l Objects, concepts, entities: –Months, days, years –dates –first names –last names –persons –birthdates l Relationships between them –a date has exactly one month, day, year –a birthdate is a date –a person has at least 1 first name and exactly 1 last name –a person has a birthdate –a birthdate has a person

©2003 Paula Matuszek Who and Why? l Many groups are developing ontologies: –standardize terms and vocabulary –facilitate the semantic web –improve information integration –interested in the domain itself l Some ontologies under development –Cyc –GO (Gene ontology) –UMLS (Unified Medical Language System) –CIA World Factbook

©2003 Paula Matuszek DAML l DARPA Agent Markup Language l A language for describing ontologies l Example: an ontology for dates Example: an ontology for dates l Extensive information available at

©2003 Paula Matuszek UBOT l UML Based Ontology Toolkit l Part of a DARPA project to automatically mark up web pages to make them l The purpose of DAML is to annotate information on the web to make it machine-readable so that software agents can interpret it and reason with it: the semantic web l

©2003 Paula Matuszek AeroDAML l AeroDAML is a web service that takes a web page as an input and generates DAML markup. l Uses AeroText as the underlying extraction tool. l Works with various ontologies. l Paper describing system Paper describing system

©2003 Paula Matuszek Lab: try out AeroDAML l AeroDAML page AeroDAML page Choose a news page ( Google News,...) and tag it with the Cyc and CIA ontologies. How well did each ontology do at picking up content? Did they miss things they should have found? Was anything tagged incorrectly? Repeat for one of your domain-specific documents, or a web page in a specific area. Try a different ontology if you think one of the others might be more interesting. How was the annotation different? Are we enabling the semantic web?