Annotation for the Semantic Web Yihong Ding A PhD Research Area Background Study.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

28 March 2003e-MapScholar: content management system The e-MapScholar Content Management System (CMS) David Medyckyj-Scott Project Director.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
SemTag and Seeker: Bootstrapping the Semantic Web via Automated Semantic Annotation Presented by: Hussain Sattuwala Stephen Dill, Nadav Eiron, David Gibson,
Ontology-based Annotation Sergey Sosnovsky
Semiautomatic Generation of Data-Extraction Ontologies Master’s Thesis Proposal Yihong Ding.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
OntoBlog: Linking Ontology and Blogs Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of Informatics, Japan 2 Asian.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
Xyleme A Dynamic Warehouse for XML Data of the Web.
OWL-AA: Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation 2006 Spring Research Conference Yihong Ding.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
DARPA Agent Markup Language Ashish Jain University of Colorado at Boulder.
Two-Level Semantic Annotation Model BYU Spring Conference 2007 Yihong Ding Sponsored by NSF.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
An Architecture for Creating Collaborative Semantically Capable Scientific Data Sharing Infrastructures Anuj R. Jaiswal, C. Lee Giles, Prasenjit Mitra,
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
A New Web Semantic Annotator Enabling A Machine Understandable Web BYU Spring Research Conference 2005 Yihong Ding Sponsored by NSF.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
8/28/97Information Organization and Retrieval Files and Databases University of California, Berkeley School of Information Management and Systems SIMS.
A Brief Survey of Web Data Extraction Tools (WDET) Laender et al.
BYU A Synergistic Semantic Annotation Model December 2007 Yihong Ding,
Overview of Search Engines
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
An Intelligent Broker Architecture for Context-Aware Systems A PhD. Dissertation Proposal in Computer Science at the University of Maryland Baltimore County.
06/03/'07 upd 04/03/08CmpE 588 Spring 2008 EMU1 Tools for Semantic Annotation Atilla ELÇİ Dept. of Computer Engineering Eastern Mediterranean University.
Knowledge based Learning Experience Management on the Semantic Web Feng (Barry) TAO, Hugh Davis Learning Society Lab University of Southampton.
Survey of Semantic Annotation Platforms
OWL Capturing Semantic Information using a Standard Web Ontology Language Aditya Kalyanpur Jennifer Jay Banerjee James Hendler Presented By Rami Al-Ghanmi.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza,
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Query Processing In Multimedia Databases Dheeraj Kumar Mekala Devarasetty Bhanu Kiran.
1 Technologies for (semi-) automatic metadata creation Diana Maynard.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Dimitrios Skoutas Alkis Simitsis
Semantic Technologies & GATE NSWI Jan Dědek.
© Copyright 2008 STI INNSBRUCK Semantic Annotation Semantic Web Lecture Dieter Fensel.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
A Semantic-Web based Framework for Developing Applications to Improve Accessibility in the WWW Michail Salampasis Dept. of Informatics TEI of Thessaloniki.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Semantic Visualization What do we mean when we talk about visualization? - Understanding data - Showing the relationships between elements of data Overviews.
Digital libraries and web- based information systems Mohsen Kamyar.
CREAM: Semantic annotation system May 24, 2013 Hee-gook Jun.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Co-funded by the European Union Semantic CMS Community Reference Architecture for Semantic CMS Copyright IKS Consortium 1 Lecturer Organization Date of.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Introduction to the Semantic Web Jeff Heflin Lehigh University.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
The Semantic Web By: Maulik Parikh.
Cloud based linked data platform for Structural Engineering Experiment
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Presented by: Hassan Sayyadi
Presentation transcript:

Annotation for the Semantic Web Yihong Ding A PhD Research Area Background Study

2 Introduction Current web is designed for humans Semantic web (next-generation web) is designed for both humans and machines Semantic annotation –Disclose semantic meanings of web content –Convert current HTML web pages to machine- understandable semantic web pages

3 Outline Historical Review Current Status Related Research Fields Future Challenges

4 Semantic Annotation in Ancient Ages No evidence when humans started to annotate text about 350 BC history of semantic annotation ≈ history of ontologies

5 The First Dream of Modern Semantic Annotation July 1945, Vannevar Bush, As We May Think, The Atlantic Monthly Bush's dream device –humans could acquire information (World Wide Web) –humans could contribute their own ideas (Web Annotation) from/to the community

6 Web Annotation before 1999 [Heck et. al., 1999] Developing better user interfaces Improving storage structures Increasing annotation sharability Example systems: ComMentor, Annotator TM, Third Voice, CritLink, CoNote, and Futplex

7 Semantic Labeling before 1999 Dublin Core Metadata Standard [ –15 element sets encapsulate data Superimposed Information [Delcambre et. al., 2001] marks Superimposed Layer Base Layer Information Source 1 Information Source 2 Information Source n … –Title –Subject –Description –Creator –Publisher –Contributor –Date –Type –Format –Identifier –Source –Language –Relation –Coverage –Rights

8 Status of Current Web Semantic Annotation Studies Interactive annotation Automatic annotation

9 Interactive Annotation Systems Lets humans interact through machine interfaces to annotate documents Problems –Inconsistency –Error-proneness –Lack of scalability Values –Easy to implement –Suitable for small-scale tasks and experiments –Helpful to build corpora for evaluations

10 Interactive Annotation Systems Annotea [Kahan et. al., 2001] –W3C project –An open RDF infrastructure for shared web annotations SHOE (Simple HTML Ontology Extensions) [Heflin et. al., 2000] –University of Maryland, College Park –Manual annotator using SHOE ontologies

11 Automatic Annotation Systems Common feature: use of ontologies Typical approaches –Annotation with automatic ontology generation (1 system) –Annotation with automatic information extraction (6 systems)

12 Annotation with Ontology Generation SCORE (Semantic Content Organization and Retrieval Engine) [Sheth et. al., 2002] Voquette (now acquired by Semagix Co.), University of Georgia

13 Annotation with Automatic IE Ont-O-Mat [Handschuh et. al., 2002] –University of Karlsruhe at Germany MnM [Vargas-Vera et. al., 2002] –Open University of United Kingdom Common features –DAML+OIL ontologies –Supervised adaptive learning with Lazy-NLP (Amilcare) –Annotation stored inside web pages Differences –MnM allows multiple ontologies at one time –MnM also stores annotations in a knowledge base –Ont-O-Mat uses OntoBroker both as an annotation server and as a reasoning engine

14 Annotation with Automatic IE KIM Platform [Kiryakov et. al., 2004] –Ontotext Lab., Sirma Group, a Canadian-Bulgarian joint venture SemTag [Dill et. al., 2003] –IBM Almaden Research Center Similar features –Use one special designed upper-level ontology, KIM ontology vs. TAP ontology Specific features –KIM uses an NLP tool (GATE) to extract information –KIM stores annotations in a separate file –SemTag uses inductive learning to extract information –SemTag annotates 264 million Web pages and generate approximately 434 million semantic tags

15 Annotation with Automatic IE Stony Brook Annotator [Mukherjee et. al., 2003] –Stony Brook University –Structural analysis of DOM tree for HTML pages –Drawbacks Taxonomic relationships only No generic labeling algorithm disclosed RoadRunner Labeller [Arlotta et. al., 2003] –Università di Roma Tre and Università della Basilicata –Automatic assign label names based on image recognition –Drawbacks Semantic meaning of labels unknown Difficulty in associating labels with ontologies

16 Related Research Fields Semantic Web Information extraction Ontology related topics Conceptual modeling Logic languages Web services

17 Semantic Web Weaving the Web [Berners-Lee 1999], birth of the Semantic Web The Semantic Web [Berners-Lee et. al., 2001]

18 Information Extraction [Laender et. al., 2002] 1.Human-guided approaches Wrapper languages, Modeling-based tools No annotation examples Too heavily human involvement 2.Non-ontology-based approaches HTML-aware tools: StonyBrook tool [Mukherjee et. al., 2003], RoadRunner Labeller [Arlotta et. al., 2003] NLP-based tools: Ont-O-Mat [Handschuh et. al., 2002], MnM [Vargas-Vera et. al., 2002], KIM platform [Kiryakov et. al., 2004] ILP-based tools: SemTag [Dill et. al., 2003] Require extra alignment between extraction categories in wrappers and concepts in ontologies 3.Ontology-based Approaches Ontology-based tools: my proposal Not require alignment, resilient to web page layouts Slow in execution time

19 Ontology Related Topics Ontology languages [W3C, OWL] –Knowledge representation and reasoning Ontology generation [Ding et. al., 2002a] –Annotation domain specification Ontology enrichment [Parekh et. al., 2004 ] –Annotation domain specification expanding Ontology population [Alani et. al., 2003] –Annotation result output Ontology mapping and merging [Ding et. al., 2002b] –Large-scale annotation requires large-scale ontologies –Small-scale ontologies are less expensive to build –Ontology mapping creates the links among small-scale ontologies –Ontology merging fuses small-scale ontologies into a large-scale ontology

20 Conceptual Modeling Annotation requires knowledge modeling Ontology is a type of conceptual modeling ER Model [Chen 1976] –The most influential conceptual model –Influence OSM model, basis of data-extraction ontology

21 Logic Languages Logic foundation provides reasoning and inference power for modeling languages Examples –First-order logic [Smullyan 1995] –Description logics [Brachman et. al., 1984]

22 Web Services More and more, web services become the typical application in semantic web scenario. Two ways aligning web services with semantic annotation –Web service annotation [Brodie 2003] –Semantic annotation web service

23 Summary and Future Challenges Annotation for the semantic web –Enable machine-understandable web –Support semantic searching –Support global-wide web services –Still an unsolved problem Main technical challenges –Direct ontology-driven annotation mechanism –Concept disambiguation –Automatic domain ontology generation –Scalability