Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
IATI Technical Advisory Group Technical Proposals Simon Parrish IATI Technical Advisory Group, DIPR March 2010.
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Management Information Systems, Sixth Edition
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
StrategiesTaxonomy June 9, 2014Copyright 2014 Taxonomy Strategies. All rights reserved. The Search for Meaning and Semantics: Taxonomies Get It Done Joseph.
Information and Business Work
Information Retrieval in Practice
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Vocabulary Markup Language (Voc-ML) Project Joseph A. Busch Content Intelligence Evangelist Interwoven.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.
Overview of Search Engines
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
Databases & Data Warehouses Chapter 3 Database Processing.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
XML DTDs and other Alternatives: Vocabulary Markup Language (Voc-ML) Project & Friends Joseph A. Busch Director, Solutions Architecture NetLab and Friends.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Dr. Bhavani Thuraisingham August 2006 Building Trustworthy Semantic Webs Unit #1: Introduction to The Semantic Web.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Ontology-Centered Personalized Presentation of Knowledge Extracted from the Web Ralitsa Angelova.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
1 Chapter 1 Introduction to Databases Transparencies.
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
1 Chapter 2 Database Environment Pearson Education © 2009.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
Improvement of Semantic Interoperability based on Metadata Registry(MDR) Doo-Kwon Baik Dept. of CSE Korea University.
Welcome: To the fifth learning sequence “ Data Models “ Recap : In the previous learning sequence, we discussed The Database concepts. Present learning:
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Information Retrieval in Practice
The Semantic Web By: Maulik Parikh.
DATA MODELS.
Lifecycle Metadata for Digital Objects
Part of the Multilingual Web-LT Program
Data Model.
The ultimate in data organization
DATA MODELS.
Presentation transcript:

Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch

Outline  Why Semantics Matter  What is the Semantic Web  Semantic Content Management

Why Semantics Matter

When you own a Rembrandt you can spell his name any way you want.

But when you want to find a Rembrandt … you better spell his name correctly.

Vocabulary resources can help find the right artist even if their name is typed incorrectly.

Users cannot type in the complex queries needed to find all the relevant items... But this can be done automatically.

Complex queries are even more important when you search the entire web.

So you find Rembrandt the Dutch guy...

… And not Rembrandt the toothpaste.

Search Failure  19% Character errors. (Young, et al)  40% Vocabulary errors. (Seaman)  20% Index confusion.  21% Successful (Nielsen) 40% 20% 19% 21%

Search Solution  Generate more consistent content to search on.  Correct user errors.  Map the language of users to the language of the target content.

Search Alternatives PersonalizationContent needs to be tagged with attributes that map to user categories Analytics Users don ’ t follow predictable & consistent pathways TaxonomiesAutomatically generated taxonomies reflect ambiguities of natural language SyndicationRequires subscriber profiles, well-categorized content, & managed rules

Solution for Search Alternatives  Predictable standardized structures, and  Consistent semantics to work on … so machines can understand it.

What is the Semantic Web

Berners-Lee’s Semantic Web  Formatting content so that machines can understand it.  Use XML/RDF:  Infinitely flexible markup language.  Process content in many more ways than simply for viewing it.  Problem: Mostly syntax … not semantics (in the human sense of meaning, i.e., language)

XML is a Grail-like Object  XML is just a means for encoding information—an envelope standard. The real value is still in the information that you put in the envelope.  Filling XML placeholders such as,, and requires semantic information management.

Soergel’s SemWeb Proposal  System of integrated access to data on concepts and terminology.  Bring together variety of sources that exist largely in separate worlds, including dictionaries, thesauri, classification schemes, etc.  Federated system with multiple collaborators.  Common interface to all concept & terminology knowledge bases on the Internet.

The Real Semantic Web  Namespace for uniquely identifying a semantic scheme & each concept within each scheme.  Broad template or conceptual schema for holding all types of semantic information & specifying relationships among them.  Definitions of services for interacting with the System.

Vocabulary Markup Language (VocML)  XML schema for the Semantic Web.  Broad template for structured representation of semantic schemes.  Dublin Core metadata.  Tags and syntax for uniquely identifying each concept.  Typed relationships (hierarchical, associative, etc.)  Typed notes. Networked Knowledge Organization Systems nkos.slis.kent.edu nkos.slis.kent.edu

DFSIC-1998 Standard Industrial Classification (1987) Interwoven U.S. Department of Commerce … Field Crops, except Cash Grains, not elsewhere classified Establishments primarily engaged in the production of field crops, except cash grains, not elsewhere classified. This industry also includes establishments deriving 50 percent or more of their total value of sales of agricultural products from field crops, except cash grains (Industry Group 013), but less than 50 percent from products of any single industry … Dublin Core Unique ID Typed Relationships

Implementing the Semantic Web

The Holy Grail is...  Accurate information automatically processed so that it can easily be found and used for applications.  A rich web of linked information, with markup allowing machines to route relevant information to the audiences that value it most.

Metatagging  The hard work is mining content to extract key information:  Recognize the mentions of people, organizations, places, and things.  Infer the subject matter.  And putting it into formats with standard labels for effective exploitation.

Raw Content unstructured text untagged data Semantic Content Management Relevant Information found items granular text User Queries database search text search Structured Content metadata XML/RDF Tag It Exploit It Vocabularies

Exploiting the Semantic Web  Route content to audience segments that value it most.  Link mentions of people, organizations, places, and things to other information related to those entities.  Populate portal directories.  Precisely search heterogeneous content items.

Predictions

 VocabularyML.  Semantic standard for unique identifiers (a namespace) for people, organizations, places, and things and the relationships among them.  See: nkos.slis.kent.edu nkos.slis.kent.edu  Technologies that enable the persistent naming of the information inside XML envelopes.  Generation of enormous value through interoperability among web applications.

Joseph A. Busch Content Intelligence Evangelist ASIST President, fax Moving business to the Web