Materi Minggu ke 11 Introduction to the Semantic Web

Slides:



Advertisements
Similar presentations
Last update: (2) (3) The Dutch airline.
Advertisements

Deutsche Gesellschaft für Informationswissenschaft und Informationspraxis e.V. 1. DGI-Konferenz, 62. DGI Jahrestagung Semantic Web & Linked Data Elemente.
Introduction to Semantic Web and RDF RDF, Linked Data workshop at DANS The Hague, 28 th July, 2010, Ivan Herman, W3C.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Semantic Web Motivating Example. A Motivating example Here’s a motivating example, adapted from a presentation by Ivan Herman It introduces semantic web.
RDF Tutorial.
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Ivan Herman, W3C, “Semantic Café”, organized by the W3C Brazil Office São Paulo, Brazil,
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Semantic Web Series 1 Mohammad M. R. Cowdhury UniK, Kjeller.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Practical RDF Chapter 1. RDF: An Introduction
1 The BT Digital Library A case study in intelligent content management Paul Warren
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Logics for Data and Knowledge Representation
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Information Interchange on the Semantic Web an interactive talk by Piotr Kaminski, University of Victoria
Of 41 lecture 4: rdf – basics and language. of 41 RDF basic ideas the fundamental concepts of RDF  resources  properties  statements ece 720, winter.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Semantic Web - an introduction By Daniel Wu (danielwujr)
XML Extras Outline 1 - XML in 10 Points 2 - XML Family of Technologies 3 - XML is Modular 4 - RDF and Semantic Web 5- XML Example: UK GovTalk Group’s Schema.
1 Tutorial on the Semantic Web (Last update: 26 May 2009) adapted from (C) Ivan Herman, W3C Given at WE course by Peter Dolog Adapted: October 2010.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Understanding RDF. 2/30 What is RDF? Resource Description Framework is an XML-based language to describe resources. A common understanding of a resource.
Dr. Lowell Vizenor Ontology and Semantic Technology Practice Lead Alion Science and Technology Semantic Technology: A Basic Introduction.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Semantic Web Presented by Xia Li. 2 Outline Introduction Examples Semantic Web technologies Applications Concerns.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
What is the Semantic Web? 17 th XBRL International Conference Eindhoven, the Netherlands 5 st May, 2008 Ivan Herman, W3C.
Master Informatique 1 Semantic Web Technologies Part 0Course Organization Semantic Web Technologies Werner Nutt.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
What is the Semantic Web? Ivan Herman, Semantic Web Activity Lead, W3C Zheijiang University, Hangzhou, China, 2007 November 22 Ivan Herman.
Introduction to the Semantic Web (through an Example…) Ivan Herman, W3C (Last updated: 11 November 2009)
Introduction to the Semantic Web Ivan Herman, W3C ESA, Noordwijk, the Netherlands 16 th April, 2007 Ivan Herman.
What is the Semantic Web? (In 15 minutes…) ISOC Nieuwjaarsreceptie , Amsterdam, The Netherlands Ivan Herman, W3C.
Detailed introduction into RDF and the Semantic Web Search & Find Workshop Ghent, Belgium, Ivan Herman, W3C.
Overview of the Semantic Web Ralph R. Swick World Wide Web Consortium (W3C) 17 October 2009.
Tutorial on Semantic Web
The Semantic Web By: Maulik Parikh.
Tutorial on Semantic Web
RDFa How and Why Ralph R. Swick World Wide Web Consortium
Building the Semantic Web
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Introduction to the Semantic Web (tutorial) 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C
Tutorial on Semantic Web
Web 3.0: Semantic web Presentation by: Amardeep Singh Shakhon
How does the Semantic Web Work?
Overview of the Semantic Web Ralph R
the Need for Data Integration
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
PREMIS Tools and Services
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
MUMT611: Music Information Acquisition, Preservation, and Retrieval
Computer Literacy BASICS
Resource Description Framework (RDF)
Linked Open Data in 10 Minutes Sandro Hawke, W3C
Semantic Wikis Expedition #52 Conor Shankey CEO July 18, 2006
Low-bandwidth Semantic Web
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Presentation transcript:

Materi Minggu ke 11 Introduction to the Semantic Web This is just a generic slide set. Should be adapted, reviewed, possibly with slides removed, for a specific event. Rule of thumb: on the average, a slide is a minute…

Subtopic of the day Introduction to semantic web Definition of semantic web RDF and OWL

Web Semantic Definition The word semantic stands for the meaning of. The semantic of something is the meaning of something. The Semantic Web = A Web with a meaning.

Towards a Semantic Web Tasks often require to combine data on the Web: hotel and travel information may come from different sites searches in different digital libraries etc. Humans combine these information easily even if different terminologies are used the information is incomplete, or buried in images, videos, …

Example: automatic airline reservation Your automatic airline reservation knows about your preferences builds up knowledge base using your past can combine the local knowledge with remote services: airline preferences dietary requirements calendaring etc It communicates with remote information (i.e., on the Web!) (M. Dertouzos: The Unfinished Revolution)

Example: data(base) integration Databases are very different in structure, in content Lots of applications require managing several databases after company mergers combination of administrative data for e-Government biochemical, genetic, pharmaceutical research combination of online library data etc. Most of these data are accessible from the Web (though not necessarily public yet)

This problem you know very well…

Example: social networks Social sites are everywhere these days (LinkedIn, Facebook, Dopplr, Digg, Plexo, Zyb, …) Data is not interchangeable: how many times did you have to add your contacts? Applications should be able to get to those data via standard means there are, of course, privacy issues…

Example: digital libraries Sort of catalogues on the Web librarians have known how to do that for centuries goal is to have this on the Web, World-wide extend it to multimedia data, too But it is more: software agents should also be librarians! e.g., help you in finding the right publications

What is needed? (Some) data should be available for machines for further processing Data should be possibly combined, merged on a Web scale Machines may also need to reason about that data Create a Web of Data (beyond the Web of Documents)

Find the right experts at NASA Expertise locater for nearly 70,000 NASA civil servants, integrating 6 or 7 geographically distributed databases, data sources, and web services… The use internal ontologies/vocabularies to describe the knowledge areas, and a combination of the RDF data and that ontology to search through the (integrated) databases for a specific knowledge expertise. The dump is from a faceted browser developed by the company to view result data. Michael Grove, Clark & Parsia, LLC, and Andrew Schain, NASA, (SWEO Case Study)

So what is the Semantic Web?

It is, essentially, the Web of Data It is, essentially, the Web of Data. “Semantic Web Technologies” is a collection of standard technologies to realize a Web of Data

Of course, the devil is in the details It is that simple… Of course, the devil is in the details a common model has to be provided for machines to describe, query, etc, the data and their connections the “classification” of the terms can become very complex for specific knowledge areas: this is where ontologies, thesauri, etc, enter the game…

In what follows… We will use a simplistic example to introduce the main technical concepts The details will be for later during the course

The rough structure of data integration Map the various data onto an abstract data representation make the data independent of its internal representation… Merge the resulting representations Start making queries on the whole! queries that could not have been done on the individual data sets

A simplified bookstore data (dataset “A”)

1st: export your data as a set of relations

Some notes on the exporting the data Relations form a graph the nodes refer to the “real” data or contain some literal how the graph is represented in machine is immaterial for now Data export does not necessarily mean physical conversion of the data relations can be generated on-the-fly at query time via SQL “bridges” scraping HTML pages extracting data from Excel sheets etc. One can export part of the data

Another bookstore data (dataset “F”)

2nd: export your second set of data

3rd: start merging your data

3rd: start merging your data (cont.)

3rd: merge identical resources

Start making queries… User of data “F” can now ask queries like: “give me the title of the original” well, … « donnes-moi le titre de l’original » This information is not in the dataset “F”… …but can be retrieved by merging with dataset “A”!

However, more can be achieved… We “feel” that a:author and f:auteur should be the same But an automatic merge doest not know that! Let us add some extra information to the merged data: a:author same as f:auteur both identify a “Person” a term that a community may have already defined: a “Person” is uniquely identified by his/her name and, say, homepage it can be used as a “category” for certain type of resources

3rd revisited: use the extra knowledge

Start making richer queries! User of dataset “F” can now query: “donnes-moi la page d’accueil de l’auteur de l’originale” well… “give me the home page of the original’s ‘auteur’” The information is not in datasets “F” or “A”… …but was made available by: merging datasets “A” and datasets “F” adding three simple extra statements as an extra “glue”

Combine with different datasets Using, e.g., the “Person”, the dataset can be combined with other sources For example, data in Wikipedia can be extracted using dedicated tools e.g., the “dbpedia” project can extract the “infobox” information from Wikipedia already…

Merge with Wikipedia data

Merge with Wikipedia data

Merge with Wikipedia data

Is that surprising? It may look like it but, in fact, it should not be… What happened via automatic means is done every day by Web users! The difference: a bit of extra rigour so that machines could do this, too

What did we do? We combined different datasets that are somewhere on the web are of different formats (mysql, excel sheet, XHTML, etc) have different names for relations We could combine the data because some URI-s were identical (the ISBN-s in this case) We could add some simple additional information (the “glue”), possibly using common terminologies that a community has produced As a result, new relations could be found and retrieved

It could become even more powerful We could add extra knowledge to the merged datasets e.g., a full classification of various types of library data geographical information etc. This is where ontologies, extra rules, etc, come in ontologies/rule sets can be relatively simple and small, or huge, or anything in between… Even more powerful queries can be asked as a result

What did we do? (cont)

The Basis: RDF

RDF triples Let us begin to formalize what we did! we “connected” the data… but a simple connection is not enough… data should be named somehow hence the RDF Triples: a labelled connection between two resources

RDF triples (cont.) An RDF Triple (s,p,o) is such that: “s”, “p” are URI-s, ie, resources on the Web; “o” is a URI or a literal “s”, “p”, and “o” stand for “subject”, “property”, and “object” here is the complete triple: (<http://…isbn…6682>, <http://…/original>, <http://…isbn…409X>) RDF is a general model for such triples (with machine readable formats like RDF/XML, Turtle, N3, RXR, …)

RDF triples (cont.) Resources can use any URI, e.g.: http://www.example.org/file.xml#element(home) http://www.example.org/file.html#home http://www.example.org/file2.xml#xpath1(//q[@a=b]) URI-s can also denote non Web entities: http://www.ivan-herman.net/me is me not my home page, not my publication list, but me RDF triples form a directed, labelled graph

A simple RDF example (in RDF/XML) <rdf:Description rdf:about="http://…/isbn/2020386682"> <f:titre xml:lang="fr">Le palais des mirroirs</f:titre> <f:original rdf:resource="http://…/isbn/000651409X"/> </rdf:Description> (Note: namespaces are used to simplify the URI-s)

A simple RDF example (in Turtle) <http://…/isbn/2020386682> f:titre "Le palais des mirroirs"@fr ; f:original <http://…/isbn/000651409X> .

“Internal” nodes Consider the following statement: “the publisher is a «thing» that has a name and an address” Until now, nodes were identified with a URI. But… …what is the URI of «thing»?

Internal identifier (“blank nodes”) <rdf:Description rdf:about="http://…/isbn/000651409X"> <a:publisher rdf:nodeID="A234"/> </rdf:Description> <rdf:Description rdf:nodeID="A234"> <a:p_name>HarpersCollins</a:p_name> <a:city>HarpersCollins</a:city> <http://…/isbn/2020386682> a:publisher _:A234. _:A234 a:p_name "HarpersCollins". Syntax is serialization dependent A234 is invisible from outside (it is not a “real” URI!); it is an internal identifier for a resource

Blank nodes: the system can also do it Let the system create a “nodeID” internally (you do not really care about the name…) <rdf:Description rdf:about="http://…/isbn/000651409X"> <a:publisher> <rdf:Description> <a:p_name>HarpersCollins</a:p_name> … </rdf:Description> </a:publisher>

Same in Turtle <http://…/isbn/000651409X> a:publisher [ a:p_name "HarpersCollins"; … ].

Blank nodes: some more remarks Blank nodes require attention when merging blanks nodes with identical nodeID-s in different graphs are different implementations must be careful… Many applications prefer not to use blank nodes and define new URI-s “on-the-fly”