Introduction to Semantic Web What? Why? How? So far? Next? Frank van Harmelen AI Department Vrije Universiteit Amsterdam Creative Commons License: allowed.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
Frank van Harmelen Vrije Universiteit Amsterdam The Information Universe of the (Near) Futur e Creative Commons License: allowed to share & remix, but.
1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
1 ISWC-2003 Sanibel Island, FL IMG, University of Manchester Jeff Z. Pan 1 and Ian Horrocks 1,2 {pan | 1 Information Management.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Frank van Harmelen Vrije Universiteit Amsterdam The Web of data and LarKC’s role in it Creative Commons License: allowed to share & remix, but must attribute.
RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.
Semantic Web Thanks to folks at LAIT lab Sources include :
The Semantic Web – WEEK 4: RDF
An Introduction to RDF(S) and a Quick Tour of OWL
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
27 January Semantically Coordinated E-Market Semantic Web Term Project Prepared by Melike Şah 27 January 2005.
CS570 Artificial Intelligence Semantic Web & Ontology 2
By Ahmet Can Babaoğlu Abdurrahman Beşinci.  Suppose you want to buy a Star wars DVD having such properties;  wide-screen ( not full-screen )  the extra.
RDF Tutorial.
SIG2: Ontology Language Standards WebOnt Briefing Ian Horrocks University of Manchester, UK.
OWL Briefing Frank van Harmelen Vrije Universiteit Amsterdam.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
RDF Briefing Frank van Harmelen Vrije Universiteit Amsterdam.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Frank van Harmelen Semantics: where are we now, where should we go? Creative Commons CC BY 3.0: allowed to share & remix (also commercial) but must attribute.
The Semantic Web: New-style data-integration (and how it works for life-scientists too!) Frank van Harmelen AI Department Vrije Universiteit Amsterdam.
Semantic Web research anno 2006: main streams, popular falacies, current status, future challenges Frank van Harmelen Vrije Universiteit Amsterdam.
Ontology Notes are from:
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
The Semantic Web: New-style data-integration (and how it works for life-scientists too!) Frank van Harmelen AI Department Vrije Universiteit Amsterdam.
1 Bluffers Guide to The Semantic Web Frank van Harmelen CS Department Vrije Universiteit Amsterdam Data wants to be free.
The Semantic Web: New-style data-integration (and how it works for life-scientists too!) Frank van Harmelen AI Department Vrije Universiteit Amsterdam.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
Department of Computer Science, University of Maryland, College Park 1 Sharath Srinivas - CMSC 818Z, Spring 2007 Semantic Web and Knowledge Representation.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
A Really Brief Crash Course in Semantic Web Technologies Rocky Dunlap Spencer Rugaber Georgia Tech.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Nancy Ide Vassar College USA Resource Definition Framework A Tutorial EUROLAN 2003 July 28 - August 8 Bucharest - Romania.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Semantic Web author: Michał Dettlaff. Tim Berners-Lee director of W3C created the World Wide Web in 1990 proposed the idea of Semantic Web Tim Berners-Lee.
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
1 Representing Data with XML September 27, 2005 Shawn Henry with slides from Neal Arthorne.
OWL and SDD Dave Thau University of Kansas
Logics for Data and Knowledge Representation
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
Michael Eckert1CS590SW: Web Ontology Language (OWL) Web Ontology Language (OWL) CS590SW: Semantic Web (Winter Quarter 2003) Presentation: Michael Eckert.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
The Semantic Web from ft Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial.
Semantic Web - an introduction By Daniel Wu (danielwujr)
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
The Semantic Web Riccardo Rosati Dottorato in Ingegneria Informatica Sapienza Università di Roma a.a. 2006/07.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
Introduction to the Semantic Web Jeff Heflin Lehigh University.
Chapter 5 The Semantic Web 1. The Semantic Web  Initiated by Tim Berners-Lee, the inventor of the World Wide Web.  A common framework that allows data.
Building the Semantic Web
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Ontology.
ece 720 intelligent web: ontology and beyond
Ontology.
Talk in 4 parts Basic principles of the Semantic Web
Presentation transcript:

Introduction to Semantic Web What? Why? How? So far? Next? Frank van Harmelen AI Department Vrije Universiteit Amsterdam Creative Commons License: allowed to share & remix, but must attribute & non-commercial

Who am I Frank van Harmelen Prof in AI at Vrije Universiteit Amsterdam Knowledge Representation Early Semantic Web Projects (> 1999) Co-designed OWL Tech advisor of Aduna (Sesame) Scientific Director of LarKC (Large Knowledge Collider) I know nothing about image analysis…

Who are you? who knows roughly what Semantic Web is? who has heard of RDF & OWL? who has studied RDF & OWL? who has used RDF & OWL? who expects ever to use RDF & OWL? who is a logician who is a KR researcher who is a Web researcher who is an image researcher

General idea of the Semantic Web

General idea of Semantic Web Make current web more machine accessible (currently all the intelligence is in the user) Motivating use-cases search personalisation semantic linking data integration web services...

General idea of Semantic Web Do this by: 1.Making data and meta-data available on the Web in machine-understandable form (formalised) 2.Structure the data and meta-data in ontologies These are non-trivial design decisions. Alternative would be: Make current web more machine accessible (currently all the intelligence is in the user)

Whats wrong with the Web? linked web-pages, written by people, written for people, used only by people... Many of these pages already come from data, usable by computers! But we cant link the data.... ? ? ? ? ? linked data, usable by computers! useful for people!

"Web of Data" (TBL) 1.expose data on the web (facts) in interoperable form (RDF) 2.expose knowledge on the web with interoperable semantics (ontologies, RDF Schema, OWL) 3.Apply lightweight inference for Interoperability Query answering Search Unexpected reuse … Semantic Web

Not just data, also knowledge All of this: Low expressivity logic (RDF) That allows some inference: Property inheritance, domain/range inference Some of this: Medium expressive logic (OWL) That allows more inference: (in)equality, number restrictions, datatypes

Desideratum: On the Web of Data, anyone can say anything about anything Need for total decoupling of data vocabulary meta-data x T [ IsOfType ] different owners & locations

Two versions of Semantic Web story: V1: Semantic Web = annotated Web ; 1 & 2 are embedded in text & images on the Web V2: Semantic Web = Web of Data ; 1 & 2 live in dedicated repositories (triple stores) x T [ IsOfType ] different owners & locations

Why is this hard?

machine accessible meaning (What its like to be a machine) symptoms drug administration disease IS-A alleviates META-DATA

What is meta-data? it's just data it's data describing other data its' meant for machine consumption disease name symptoms drug administration

What is required?

Required are: 1. one or more standard vocabularies l so search engines, producers and consumers all speak the same language 2. a standard syntax, l so meta-data can be recognised as such 3. lots of resources with meta-data attached

Bluffers Guide to RDF & RDF Schema

Bluffers Guide to RDF Express relations between things: Results in labelled network (graph) All labels are actually web-addresses (URIs) You can ping any label and find out more Bits of the graph can live at physically different locations & have different owners Franky x AuthorOf MIT publishedBy Subject Object Predicate

Bluffers Guide to RDF Schema types for subjects & objects & predicates Types organised in a hierarchy Inheritance of properties Franky x AuthorOf MIT publishedBy author book publisher personartifact man

So whats special about RDF(S)? statements about an identifier can be distributed no unique name assumption no closed world assumption Remember web-style decoupling

Remember: Need for total decoupling of data vocabulary meta-data x T [ IsOfType ] different owners & locations

RDF(S) have a (very small) formal semantics Defines what other statements are implied by a given set of RDF(S) statements Ensures mutual agreement on minimal content between parties without further contact In the form of entailment rules Very simple to compute (and not explosive in practice)

RDF(S) semantics: examples Aspirin isOfType Painkiller Painkiller subClassOf Drug Aspirin isOfType Drug aspirin alleviates headache alleviates range symptom headache isOfType symptom

RDF(S) semantics: examples isOfType subClassOf isOfType range isOfType

RDF(S) semantics X R Y + R domain T X IsOfType T X R Y + R range T Y IsOfType T T1 SubClassOf T2 + T2 SubClassOf T3 T1 SubClassOf T3 X IsOfType T1 + T1 SubClassOf T2 X IsOfType T1 Semantics = predictable inference

Bluffers Guide to OWL

OWL: things RDF Schema cant do equality enumeration number restrictions Single-valued/multi-valued Optional/required values inverse, symmetric, transitive boolean algebra Union, complement …

Layered language OWL Lite: Classification hierarchy Simple constraints OWL DL: Maximal expressiveness While maintaining tractability Standard formalisation OWL Full: Very high expressiveness Loosing tractability Non-standard formalisation All syntactic freedom of RDF (self-modifying) Syntactic layering Semantic layering Syntactic layering Semantic layering Full DL Lite

Language Layers Full DL Lite OWL Full Allow meta-classes etc OWL DL Negation Disjunction Full Cardinality Enumerated types OWL Light (sub)classes, individuals (sub)properties, domain, range conjunction (in)equality cardinality 0/1 datatypes inverse, transitive, symmetric hasValue someValuesFrom allValuesFrom RDF Schema

Backward compatibility with RDF OWL agents understand everything…

OWL agents understand everything… … others still the most important aspects Backward compatibility with RDF

OWL also has a formal semantics Defines what other statements are implied by a given set of statements Ensures mutual agreement on content (both minimal and maximal ) between parties without further contact Can be used for integrity/ consistency checking Hard to compute (and rarely/sometime/always explosive in practice)

OWL semantics: minimal vanGogh isOfType Impressionist Impressionist subClassOf Painter vanGogh isOfType Painter vanGogh painter-of sunflowers painter-of domain painter vanGogh isOfType painter

OWL semantics: maximal vanGogh isOfType Impressionist Impressionist disjointFrom Cubist NOT: vanGogh isOfType Cubist painted-by has-cardinality 1 sun-flowers painted-by vanGogh Picasso different-individual-from vanGogh NOT: sun-flowers painted-by Picasso

Remember: Require are 1. standard vocabularies 2.a standard syntax, 3. lots of resources with meta-data attached

Ontologies: real life examples handcrafted music: CDnow (2410/5), MusicMoz (1073/7)CDnow MusicMoz biomedical: SNOMED (200k), GO (15k), Emtree(45k+190k Systems biologyGO Systems biology ranging from lightweight Yahoo, UNSPC, Open directory (400k) to heavyweight (Cyc (300k))Yahoo ranging from small ( METAR ) to large ( UNSPC ) METAR

Biomedical ontologies (a few..) Mesh Medical Subject Headings, National Library of Medicine descriptions EMTREE Commercial Elsevier, Drugs and diseases terms, synonyms UMLS Integrates 100 different vocabularies SNOMED concepts, College of American Pathologists Gene Ontology terms in molecular biology NCBI Cancer Ontology: 17,000 classes (about 1M definitions),

Remember: Require are 1. standard vocabularies 2.a standard syntax, 3. lots of resources with meta-data attached

Who makes the meta-data? Dont throw away what we already have: Databases (Amazon.com) Navigation structures meta-data in documents Office, Acrobat, MP3, jpg As spin-off on what we already do MIT Media Lab photo annotator Automated analysis Text, Images, Video

Summary so far

Linked Data/Semantic Web Identification Uniform Resource Identifier (URI) Global identifier (NB: persistent!) Looks like a URL, is often and internationalized Resource Identifier (IRI) Description Resource Description Framework (RDF) RDF Schema (RDFS) Simple Knowledge Organization System (SKOS) Web Ontology Language (OWL) Querying RDF Triple stores SPARQL Query Language

Hoe ziet RDF eruit? Datamodel is een (directed) graph Elk data-item is een resource met een URI als identifier Elke eigenschap is een binaire relatie: triple Tussen resources: Tussen een resource en een literal

Why is this a Web of data? Global unique identifiers Reuse of identifiers in other datasets For data: (two sources say something about over Amsterdam ) For schema: (two sources each use the same concept City) This reuse builds links between datasets

Does this work in practice?

already many billions of facts & rules Linked Open Data cloud Encyclopedia Geographic names (millions) names of artists & art works (10.000s) scientific bibliographies hierarchical dictionaries (UK, FR, NL) hierarchical dictionaries (UK, FR, NL) life-science databases any CD ever recorded (almost) May 09 estimate > 4.2 billion triples million interlinks May 09 estimate > 4.2 billion triples million interlinks basic facts on every country on the planet common sense rules & facts ( s) It gets bigger every month

And remember: not just data All of this: l Low expressivity logic (RDF/RDFS) l That allows some inference: Property inheritance, domain/range inference Some of this: l Medium expressive logic (OWL) l That allows more inference: (in)equality, number restrictions, datatypes

Nice in the lab, but are you getting anywhere in practice?

Semantic Web News Quiz Google Reuters New York Times Microsoft Zemanta Obama Government BBC (music, worldcup, wildlife)BBC BestBuy.com Facebook

Challenges

What to do when success is becoming a problem? Heterogeneity l ontology mapping, instance identification Scale (10^10 statements) Dynamics, versioning (Flickr: 3000 pictures/minute, Wikipedia: 100 edits/minute) Trust, attribution, provenance Multimedia l In both directions