Adding Semantics to the Web Zachary G. Ives University of Pennsylvania CIS 650 – Database & Information Systems April 11, 2005.

Slides:



Advertisements
Similar presentations
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
An Introduction to Description Logics
Semantic Web Thanks to folks at LAIT lab Sources include :
An Introduction to RDF(S) and a Quick Tour of OWL
CS570 Artificial Intelligence Semantic Web & Ontology 2
Ontological Logic Programming by Murat Sensoy, Geeth de Mel, Wamberto Vasconcelos and Timothy J. Norman Computing Science, University of Aberdeen, UK 1.
SIG2: Ontology Language Standards WebOnt Briefing Ian Horrocks University of Manchester, UK.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
OWL TUTORIAL APT CSA 3003 OWL ANNOTATOR Charlie Abela CSAI Department.
Analyzing Minerva1 AUTORI: Antonello Ercoli Alessandro Pezzullo CORSO: Seminari di Ingegneria del SW DOCENTE: Prof. Giuseppe De Giacomo.
Ontology Notes are from:
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
Chapter 8: Web Ontology Language (OWL) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Descriptions Robert Grimm New York University. The Final Assignment…  Your own application  Discussion board  Think: Paper summaries  Web cam proxy.
Web Semantics: KB vs. DB Zachary G. Ives University of Pennsylvania CIS 650 – Database & Information Systems April 13, 2005.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
USCISIUSCISI Loom: Basic Concepts Thomas A. Russ USC Information Sciences Institute.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
Google and Scalable Query Services
ANHAI DOAN ALON HALEVY ZACHARY IVES Chapter 12: Ontologies and Knowledge Representation PRINCIPLES OF DATA INTEGRATION.
FiRE Fuzzy Reasoning Engine Nikolaos Simou National Technical University of Athens.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
An Introduction to Description Logics. What Are Description Logics? A family of logic based Knowledge Representation formalisms –Descendants of semantic.
Knowledge representation
Protege OWL Plugin Short Tutorial. OWL Usage The world wide web is a natural application area of ontologies, because ontologies could be used to describe.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Ontologies for the Integration of Geospatial Data Michael Lutz Workshop: Semantics and Ontologies for GI Services, 2006 Paper: Lutz et al., Overcoming.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Ming Fang 6/12/2009. Outlines  Classical logics  Introduction to DL  Syntax of DL  Semantics of DL  KR in DL  Reasoning in DL  Applications.
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
Michael Eckert1CS590SW: Web Ontology Language (OWL) Web Ontology Language (OWL) CS590SW: Semantic Web (Winter Quarter 2003) Presentation: Michael Eckert.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
An Introduction to Description Logics (chapter 2 of DLHB)
Semantic Web - an introduction By Daniel Wu (danielwujr)
Advanced topics in software engineering (Semantic web)
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
More on Description Logic(s) Frederick Maier. Note Added 10/27/03 So, there are a few errors that will be obvious to some: So, there are a few errors.
Artificial Intelligence 2004 Ontology
DAML+OIL: an Ontology Language for the Semantic Web.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
OilEd An Introduction to OilEd Sean Bechhofer. Topics we will discuss Basic OilEd use –Defining Classes, Properties and Individuals in an Ontology –This.
Semantic Web BY: Josh Rachner and Julio Pena. What is the Semantic Web? The semantic web is a part of the world wide web that allows data to be better.
Description Logics Dr. Alexandra I. Cristea. Description Logics Description Logics allow formal concept definitions that can be reasoned about to be expressed.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Web Ontology Language (OWL). OWL The W3C Web Ontology Language (OWL) is a Semantic Web language designed to represent rich and complex knowledge about.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Of 29 lecture 15: description logic - introduction.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
LDK R Logics for Data and Knowledge Representation Description Logics: family of languages.
Department of Mathematics Computer and Information Science1 CS 351: Database Management Systems Christopher I. G. Lanclos Chapter 4.
Ontology Technology applied to Catalogues Paul Kopp.
Ccs.  Ontologies are used to capture knowledge about some domain of interest. ◦ An ontology describes the concepts in the domain and also the relationships.
The Anatomy of a Large-Scale Hypertextual Web Search Engine S. Brin and L. Page, Computer Networks and ISDN Systems, Vol. 30, No. 1-7, pages , April.
1 CS122A: Introduction to Data Management Lecture #4 (E-R  Relational Translation) Instructor: Chen Li.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
The Semantic Web By: Maulik Parikh.
Google and Scalable Query Services
Ontology.
ece 720 intelligent web: ontology and beyond
Ontology.
Presentation transcript:

Adding Semantics to the Web Zachary G. Ives University of Pennsylvania CIS 650 – Database & Information Systems April 11, 2005

2 Administrivia  Next readings and summaries:  Wednesday – First two sections of the Piazza paper  Summarize the goals, key ideas, and challenges  Reduced reading so you can work on the project!

3 Today’s Trivia Question

4 Last Time…  We were discussing Google  Main features:  Commodity hardware  Fast, transparent failover  Replication and partitioning  No requirement that all of the replicas be consistent with one another, as long as each user only sees a consistent image  Allows them to update one replica set, transition in others  The major components of computation:  PageRank (run offline)  Ranking of queries  Relatively parallelizable

5 Google’s Search Algorithm 1.Parse the query 2.Convert words into wordIDs 3.Seek to start of doclist in the short barrel for every word 4.Scan through the doclists until there is a document that matches all of the search terms 5.Compute the rank of that document 6.If we’re at the end of the short barrels, start at the doclists of the full barrel, unless we have enough 7.If not at the end of any doclist, goto step 4 8.Sort the documents by rank; return the top K

6 Ranking in Google  Considers many types of information:  Position, font size, capitalization  Anchor text  PageRank  Done offline, in a non-query-sensitive way  Count of occurrences (basically, TF) in a way that tapers off  Multi-word queries consider proximity also

7 Could We Build a DBMS for Google?  What would a DBMS for Google-like environments look like?  What would it be useful for, other than Google?

8 Beyond Google  What if we wanted to:  Add on-the-fly query capabilities to Google?  e.g., query over up-to-the-second stock market results  Use WordNet or some thesaurus to supplement Google?  Do PageRank in a topic-specific way?  Supplement Google with “ontology” info?  Do some sort of XML path matching along with keywords?  Allow for OLAP-style analysis?  Do a cooperative, e.g., P2P, Google?  Benefits of this?

9 Beyond the Web  The Web is mostly human-readable  … With some exceptions due to the adoption of XML, plus proprietary formats  Ideally, we’d like to be able to pose questions that go way beyond text matching, exploiting machine-readable data  What are the 5 tallest mountains?  How much has the stock market dropped since January?  What traits are known to be recessive in rats?  etc.  In a sense, the goal is to meet Vannevar Bush’s “Personal Memex” idea from 1945

10 The Semantic Web  The basic ideas:  Semantically annotated data (RDF)  Knowledge of concepts and relationships (ontologies, e.g., OWL)  Inferencing systems (based on KR tools)  Goal: allow very complex queries to be expressed; give best effort in answering them  “We make the language for the rules as expressive as needed to allow the Web to reason as widely as desired” (p. 38)  Berners-Lee, 2003: “The Semantic Web is data integration”  “The challenge of the Semantic Web … is to provide a language that expresses both data and rules for reasoning about the data and that allows rules from any existing knowledge-representation system to be exported onto the Web”

11 RDF: Resource Description Framework  Not too dissimilar in goal or style to XML  “Machine processable” data format  A ternary data model: everything is 3-ary relations  (Resource, Property, Value)  Resources are given unique URLs as global keys  Serialized in XML, in one of several formats

12 “It’s More Semantic” 5’10” 5’10” hasHeight plays hockey Joe 5’10” hasHeight plays hockey named

13 RDF vs. XML  What’s more semantic about RDF?  It requires us to specify entities and relationships, which we can omit in XML  Though someone who understands the XML can specify what the relationships are!  It encodes a number of concepts by default:  Universal identity  Reification  Basically, the specific class or statement becomes something that can be described at a meta-level  e.g., the name “Joe” is only true up to a particular point in time  How: we give the RDF description an ID  A number of default concepts (e.g., some types, descriptions, titles)

14 Ontologies: The Basis of the SW  Basically, a very fancy class hierarchy  “An explicit, shared, formal specification of the terms in the domain and relations among them”  Focus is on structural properties of a class, not methods  Elements of an ontology:  classes (aka concepts)  properties (aka slots, roles)  facets (aka role restrictions)

15 Classes, Properties, Facets, Reification  Classes are generally familiar  Properties may include:  intrinsic properties (of the object)  relationships to other entities (e.g., your parents)  parts (if structured)  Facets are basically the properties’ domains  value type, cardinality, …  “RDF Schema” describes these; think XML Schema in RDF, for RDF  Reification takes a class definition and makes it into an object:  (i,think,(mcintoshapple,has-color,red))

16 Description Logics (Borgida survey)  A class of languages based on FOL, like Datalog, Prolog  Key questions: subsumption of classes, recognition of members of classes  Prolog allows us to reason about instances:  ParentOf(liz,andy).Male(andy).  Child(_x) :- ParentOf(_z, _x)  Son(_y) :- Male(_y), ParentOf(_w, _y)  DLs allow us to make further inferences – that andy is a Child, i.e., they realize:  Child(x)  ( 9 z) ParentOf(z,x)  Son(y)  ( 9 w) Male(y) Æ ParentOf(w,y)

17 Syntax and Semantics  Build variable-free composite terms from atoms using term constructors (e.g., at-most, all)  COURSE and at-most(10, takers) and all (takers, GRADS)  (:and COURSE (:at-most 10 takers) (:all takers GRADS)  COURSE \ · 10 takers \ 8 takers:GRADS  Can be expressed in FOPC:  COURSE(a) Æ ( 9 x 1 … x 10 ) takers(a,x 1 ) Æ … Æ takers(a, x 10 ) Æ (x 1 ≠ x 2 Æ x 2 ≠ x 3 Æ … Æ x 9 ≠ x 10 ) Æ takers µ GRADS

18 Questions for DLs  Is a description D consistent and coherent?  Not if the instance is empty for every possible relational structure  Are D and D’ mutually disjoint?  Yes if D I [ D’ I = ; for every I  Are D and D’ equivalent?  Yes if D I = D’ I for every I  Does D subsume some other description D’?  Yes if for every relational structure I, D I subsumes D’ I  Inconsistency: and(C,D)  NOTHING  Equivalence: D subsumes D’, D’ subsumes D

19 DL Example  class STUDENT is-a PERSON with  studNumber: int, key; level: {1,2,3,4}  and(PERSON, all(studNumber, INTEGER), at-least(1,studNumber),at- most(1,studNumber), all(level, one-of(1,2,3,4)), at-least(1,level),at- most(1,level)  at-most(1, compose(studNumber, inverse(studNumber))  ENROLLMENT := and( all(st,STUDENT) at-least(1,st) at-most(1,st) all(crs,COURSE) at-least(1,crs) at-most(1,crs) all(when,DATE) at-least(1,when) at-most(1,when))  STUDENT := and( all(inverse(st), ENROLLMENT) at-least(1, inverse(st)) at-most(6, inverse(st))  COURSE := and( all(inverse(crs), ENROLLMENT) at-least(1, inverse(crs)) at-most(300,inverse(crs)))  INSERT-IN(Cs431, COURSE). FILL-WITH(Cs431,taughtBy,Einstein). FILL-WITH(Cs431,takers,Anna)

20 More on DLs  We can have both primitive classes (equivalent to extensional relations) and virtual ones  But we can make assertions over virtual classes that directly impact the primitive ones  Contrast with updates to views in databases  Many different levels of expressiveness in different DLs  Comparison with Datalog:  Both are subsets of FOL, with some limitations  DLs allow bidirectional inference; Datalog is unidirectional  DLs are equivalent to at most FOL with <= 3 variables; Datalog has an unbounded number of existential variables

21 Coming Back to the SW  Lots of work on OWL, the Web Ontology Language  Based on different levels of DLs:  OWL Lite – classification hierarchy, simple constraints (cardinalities 0 or 1)  OWL DL – maximum expressiveness, computational completeness (always decidable and terminating)  OWL Full – no computational guarantees, allows classes as instances of other classes  Goal: each community builds an ontology  But how to relate ontologies?  “equivalentClass”, “equivalentProperty”, “sameAs”  Is this enough???  (More on this next time…)