Presentation is loading. Please wait.

Presentation is loading. Please wait.

1cs236607. The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,

Similar presentations


Presentation on theme: "1cs236607. The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,"— Presentation transcript:

1 1cs236607

2 The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee, T, Hendler, J & Lassila, O ‘The semantic web’, Scientific American, May 2001 2cs236607

3 Semantic Processing We want to be able to pose complex search tasks that use the semantics of pieces of information, e.g., I want to purchase a DVD of “Dore the Explorer” at a price lower than 10$. Is such a CD available at amazon.com? 3cs236607

4 Current search agents are not suitable for such task 4cs236607

5 Current Solution Use “intelligent” agents The Semantic-Web Approach Content is machine-understandable by being bound to some formal description of itself (i.e. metadata) 5cs236607

6 Goals Web of data - provides common data representation framework to facilitate integrating multiple sources to draw new conclusions Increase the utility of information by connecting it to its definitions and to its context More efficient information access and analysis cs2366076

7 Applications Agents that search the Web and retrieve valuable information to the end user Web services that publish their information Programs that try to integrate data of different web services and to produce new results or draw new conclusions from the integrated data cs2366077

8 Ontologies & Inference Engines “For the semantic web to function, computers must have access to structured collections of information and sets of inference rules that they can use to conduct automated reasoning.” Berners-Lee, T, Hendler, J & Lassila, O ‘The semantic web’, Scientific American, May 2001 cs2366078

9 The Four Building Blocks 1. XML 2. RDF 3. Ontologies 4. Agents cs2366079

10 XML “XML allows users to add arbitrary structure to their documents but says nothing about what the structures mean” cs23660710

11 RDF –Resource Description Framework Meaning encoded in sets of ‘triples’: entities have properties which have values Entities, properties and values all have distinct URIs cs23660711 “imagine that we have access to a variety of databases with information about people, including their addresses. If we want to find people living in a specific zip code, we need to know which fields in each database represent names and which represent zip codes. RDF can specify that "(field 5 in database A) (is a field of type) (zip code)," using URIs rather than phrases for each term. ” Berners-Lee, T, Hendler, J & Lassila, O ‘The semantic web’, Scientific American, May 2001 a

12 Ontologies Database A and Database B may use different fields to contain ‘zip code’ Ontologies sort this out Ontology = ‘a document or file that formally defines the relations among terms’ Ontologies for the web normally have A taxonomy A set of inference rules cs23660712

13 Agents “Agent based computing appears to be the appropriate paradigm to work in a complex world with multiple ontologies, fragments and multiple inferencing engines.” Stork, Hans-Georg and Mastroddi, Franco, Semantic Web Technologies - a New Action Line in the European Commission’s IST Programme, 2001 cs23660713

14 The Power of Agents - Integration “The real power of the Semantic Web will be realized when people create many programs that collect Web content from diverse sources, process the information and exchange the results with other programs. The effectiveness of such software agents will increase exponentially as more machine-readable Web content and automated services (including other agents) become available.” Berners-Lee, T, Hendler, J & Lassila, O ‘The semantic web’, Scientific American, May 2001 cs23660714

15 ‘Ambient Intelligence’ “In the next step, the Semantic Web will break out of the virtual realm and extend into our physical world. URIs can point to anything, including physical entities, which means we can use the RDF language to describe devices such as cell phones and TVs.” Berners-Lee, T, Hendler, J & Lassila, O ‘The semantic web’, Scientific American, May 2001 cs23660715

16 Resource Description Framework cs23660716

17 17 What is RDF? A part of the semantic-Web activity RDF is a general-purpose language for representing information on the Web Specifically, objects and relationships Designed to allow computer applications to process data based on its semantics Rather than displaying data to humans (as opposed to RSS) An RDF document is actually a labeled graph that is represented in XML The specific language is called RDF/XML W3C recommendation (Feb. 2004)

18 RDF Data Consists of Triplets RDF data is a set of statements Each statement is a triplet (Resource, Property, Value) Sometimes we refer to a triplet using the terminology of (Subject, Predicate, Object) cs23660718 The author of http://www.cs.technion.ac.il/kanza/myPage.html is Yaron Kanza http://www.cs.technion.ac.il/kanza/myPage.html Resource (Subject): myPage.html Property (Predicate): author Value (Object): Yaron Kanza

19 19 RDF Data Subject Object predicate The basic element: Triple (labeled edge) Person#845 #1002 address postalCode 6941 Haifa city Herzel street RDF document: edge- labeled graph Statement

20 20 The XML Syntax of RDF page.html John Smith John’s Home Page DC:Creator DC:Title John Smith John’s Home Page

21 21 Structured Values page.html John Smith John’s Home Page js@corp.com dc:Title dc:Creator NameEmail... John Smith js@corp.com John’s Home Page

22 Containers Groups of things: : unordered list; duplicates allowed : ordered list; duplicates allowed : list of alternatives; one will be selected cs23660722

23 23 A set of fifteen basic properties for describing generalized Web resources The “obvious” mapping of Dublin Core properties into RDF properties has not yet been approved by the Dublin Core initiative, but is generally a good example Dublin Core

24 24 “Title”: the name given to the resource “Creator”: the person or organization primarily responsible for the resource “Subject”: what the resource is about “Description”: a description of the content “Publisher”: the person or organization responsible for making the resource available “Contributor”: someone who has provided content to the resource other than the creator “Date”: date of creation or publication Dublin Core

25 25 “Type”: type of resource, such as home page, technical report, novel, photograph… “Format”: data format of the resource “Identifier”: URL, ISBN number, … “Source”: another resource that this resource is derived from “Language”: the language of the content “Relation”: another resource and its relationship to this one “Coverage”: the portion of time or space described by this resource (atlases, histories, etc.) “Rights”: the intellectual property rights adhering to this resource, or a pointer to them Dublin Core

26 26 Containers: bags, sequences, alternatives aboutEach, aboutEachPrefix Reification (higher order statements) Namespaces and Vocabularies Advanced RDF

27 27 Manually from HTML or “user domain XML” With special assisting tools – like Protégé, Reggie, DC- dot, RDF for XML Ideally – with some automated procedure from HTML/XML documents Can we use XSLT there? Creating RDF documents

28 cs23660728

29 RDF Schema RDF Schema (RDFS) enriches the data model of RDF, adding vocabulary and associated semantics for Classes and subclasses Properties and sub-properties Typing of properties Support for describing simple ontologies Adds an object-oriented flavor But with a logic-oriented approach and using “open world” semantics cs23660729

30 30 Not an XML Schema! A “companion” specification for RDF spec Class, Type, subClassOf, domain, range Misc: label, comment, isDefinedBy,etc. RDF Schema

31 cs23660731 <rdf:RDF xmlns:rdf= "http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xml:base= "http://www.animals.fake/animals#"> <rdf:type rdf:resource="http://www.w3.org/2000/01/rdf-schema#Class"/> <rdf:type rdf:resource="http://www.w3.org/2000/01/rdf-schema#Class"/> Horse is defined as subclass of animal Example

32 <rdf:RDF xmlns:rdf= "http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xml:base= "http://www.animals.fake/animals#"> cs23660732 Abbreviated version. Works because an RDFS class is an RDF resource. Use rdfs:Class instead of rdfDescription and drop the rdf:type information

33 RDF and RDF Schema John Smith <rdfs:subclassOf rdf:resource= “http://schema.org/gen#Person”> u:Chair John Smith rdf:type g:name g:Person g:name rdfs:Classrdfs:Property rdf:type rdfs:subclassOf rdfs:domain

34 cs23660734

35 RDF Schema is Limited We cannot express facts such as Two classes are disjoint Build a class that is the union of two classes Cardinality restriction Scope of properties Provide relationships between properties, such as transitive, unique, inverse cs23660735

36 OWL A Web ontology language that is more expressive than RDF and RDF Schema Written in XML on top of RDF Using OWL we want to provide exact descriptions of items and the relationships between them Basically, built upon Description Logics cs23660736

37 SPARQL Protocol and RDF Query Language cs23660737

38 SPARQL SPARQL = Query Language + Protocol + XML Results Format Access and query RDF graphs Product of the RDF Data Access Working Group We will only provide some examples and will not go over the entire definition of the language cs23660738

39 39 SPARQL Query PREFIX dc: SELECT ?title2 WHERE { ?doc dc:title "SPARQL at speed". ?doc dc:creator ?c. ?docOther dc:creator ?c. ?docOther dc:title ?title2 } On an abstracts/papers database: “Find other papers by the authors of a given paper.”

40 40 SPARQL Query PREFIX dc: PREFIX foaf: PREFIX shop: SELECT ?title WHERE { ?doc dc:title ?title. FILTER regex(?title, "SPARQL"). ?doc dc:creator ?c. ?c foaf:name ?name. OPTIONAL { ?doc shop:price ?price } } “Find books with ‘SPARQL’ in the title. Get the authors’ name and the price (if available).” Multiple vocabularies

41 41 Inference An RDF graph may be backed by inference −OWL, RDFS, application, rules PREFIX rdf: SELECT ?type WHERE { ?x rdf:type ?type. } :x rdf:type :C. :C rdfs:subClassOf :D. -------- | type | ======== | :C | | :D | --------

42 42 Another Example PREFIX dc: PREFIX ldap: PREFIX foaf: SELECT ?name ?email { ?doc dc:title ?title. FILTER regex(?title, “SPARQL”). ?doc dc:creator ?reseacher. ?researcher ldap:email ?email. ?researcher ldap:name ?name } “Find the name and email addresses of authors of a paper about SPARQL”

43 Links http://www.w3.org/RDF/ http://www.w3.org/TR/2004/REC-rdf-primer- 20040210/ http://www.w3.org/TR/2004/REC-rdf-primer- 20040210/ http://www.w3.org/TR/rdf-schema/ http://www.w3.org/TR/2004/REC-owl-ref-20040210/ cs23660743

44 SPARQL Links Jena: Java and.Net Semantic Web Framework http://jena.sourceforge.net/ SPARQL Query http://jena.sourceforge.net/ARQ SPARQL Protocol http://www.joseki.org SquirrelRDF: Access legacy SQL: http://jena.sourceforge.net/SquirrelRDF cs23660744


Download ppt "1cs236607. The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,"

Similar presentations


Ads by Google