Representing Data with XML February 26, 2004 Neal Arthorne.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.
The Semantic Web – WEEK 4: RDF
Managing XML and Semistructured Data Lecture 12: XML Schema Prof. Dan Suciu Spring 2001.
An Introduction to RDF(S) and a Quick Tour of OWL
CS570 Artificial Intelligence Semantic Web & Ontology 2
4 XML Schema.
1 Web Data Management XML Schema. 2 In this lecture XML Schemas Elements v. Types Regular expressions Expressive power Resources W3C Draft:
1 Semantic Web Technologies: The foundation for future enterprise systems Okech Odhiambo Knowledge Systems Research Group Strathmore University.
Ontology Notes are from:
1 An Introduction To The Semantic Web. 2 Information Access on the Web Find an mp3 of a song that was on the Billboard Top Ten that features a cowbell.
CSE 636 Data Integration XML Schema. 2 XML Schemas W3C Recommendation: Generalizes DTDs Uses XML syntax Two documents: structure.
The Semantic Web – WEEK 5: RDF Schema + Ontologies The “Layer Cake” Model – [From Rector & Horrocks Semantic Web cuurse]
1 CIS607, Fall 2005 Semantic Information Integration Instructor/Organizer: Dejing Dou Week 1 (Sept. 28)
More XML namespaces, DTDs CS 431 – February 16, 2005 Carl Lagoze – Cornell University.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
The Semantic Web Week 12 Term 1 Recap Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module Website:
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
XML(EXtensible Markup Language). XML XML stands for EXtensible Markup Language. XML is a markup language much like HTML. XML was designed to describe.
A Really Brief Crash Course in Semantic Web Technologies Rocky Dunlap Spencer Rugaber Georgia Tech.
Nancy Ide Vassar College USA Resource Definition Framework A Tutorial EUROLAN 2003 July 28 - August 8 Bucharest - Romania.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
17 Apr 2002 XML Schema Andy Clark. What is it? A grammar definition language – Like DTDs but better Uses XML syntax – Defined by W3C Primary features.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Semantic Web Technologies ufiekg-20-2 | data, schemas & applications | lecture 21 original presentation by: Dr Rob Stephens
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
Practical RDF Chapter 1. RDF: An Introduction
Dr. Azeddine Chikh IS446: Internet Software Development.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Schemas Ellen Pearlman Eileen Mullin Programming the Web Using XML.
Okech Odhiambo Faculty of Information Technology Strathmore University
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
1 Representing Data with XML September 27, 2005 Shawn Henry with slides from Neal Arthorne.
OWL and SDD Dave Thau University of Kansas
Logics for Data and Knowledge Representation
Cornell CS 502 More XML XHTML, namespaces, DTDs CS 502 – Carl Lagoze – Cornell University.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
The LOM RDF binding – update Mikael Nilsson The Knowledge Management.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Semantic Web - an introduction By Daniel Wu (danielwujr)
An OO schema language for XML SOX W3C Note 30 July 1999.
More XML namespaces, DTDs CS 431 – Carl Lagoze – Cornell University.
Sheet 1XML Technology in E-Commerce 2001Lecture 2 XML Technology in E-Commerce Lecture 2 Logical and Physical Structure, Validity, DTD, XML Schema.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
Artificial Intelligence 2004 Ontology
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
OIL and DAML+OIL: Ontology Languages for the Semantic Web Sungshin Lim TOWARDS THE SEMANTIC WEB: Ontology-driven Knowledge.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Primer on XML Schema CSE 544 April, XML Schemas Generalizes DTDs Uses XML syntax Two parts: structure and datatypes Very complex –criticized –alternative.
Introduction to XML Schema John Arnett, MSc Standards Modeller Information and Statistics Division NHSScotland Tel: (x2073)
OWL & Protege Introduction Dongfang Xu Ph.D student, School of Information, University of Arizona Sept 10, 2015.
The Semantic Web Riccardo Rosati Dottorato in Ingegneria Informatica Sapienza Università di Roma a.a. 2006/07.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
PRACTICAL KNOWLEDGE REPRESENTATION FOR THE WEB Frank van Harmelen Dieter Fensel AIFB Kim Kangil Structural Complexity Laboratory.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Motivation Dynamically identify and understand information sources Provide interoperability between agents in a semantic manner Enable distributed extensible.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
1 XML and XML in DLESE Katy Ginger November 2003.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Presentation transcript:

Representing Data with XML February 26, 2004 Neal Arthorne

Data Representation Design goals for data representation:  Portable (platform independent)  Easy for machines to process  Human legible  Flexible and usable over the Internet and other networks  Concisely defined with formal rules

Extensible Markup Language World Wide Web Consortium (W3C) defines the Extensible Markup Language (XML)  W3C also defined HTML, CSS, HTTP, SVG and other markup languages  XML Working group formed in 1996  XML 1.0 (Third Edition) 4 February 2004 (original Recommendation in 1998)

XML Example Queen Victoria Jubilee issue - 5 cents Two portraits of Queen Victoria 1837 and The portrait on the left... Canada x 5¢ Lyndwode Charles Pereira, Peleg Franklin Brownell ,000 American Bank Note Company, Ottawa file://test/stamps/ /Jubilee _cents.jpg Prolog Element Attribute Entity reference

XML XML documents should be well-formed (syntax, closing tags etc) XML documents are “valid” if they are validated with a Document Type Definition (DTD) DTDs provide a grammar for the XML by defining elements, attributes and entities

XML Advantages XML provides:  Logical structure for data in a textual representation  Formal rules for validating documents  Flexibility to define your own markup language  Portability across networks and platforms  Becoming a widely accepted data interchange format  Processed with off-the-shelf tools

XML Disadvantages XML drawbacks:  Not a binary format so it requires a lot of overhead for a little bit of data  Very little support for binary or mixed media data formats (hex or base64 encoding)  Only for data and holds no semantics or reasoning DTDs do not provide:  Data types for each element or attribute  Complex structural rules for documents

XML Schema XML Schema defines a new schema language to replace DTD Standardized by W3C in 2001 Provides data typing and logical structure Written in XML (easy to process) Higher complexity than DTD

XML Schema Example Element nameData type Attribute nameData type An XML document is an ‘instance document’ of an XML Schema

Simple Types Simple Types are of three varieties:  Atomic: Built-in or derived, e.g.  List: multiple items of the same type  Union: Union or two or more Simple Types

Built-in Types XML Schema defines numerous built-in types:  integer, decimal, token, byte, boolean, date, time, short, long, float, anyURI, language Derived types use facets to restrict existing types:  min/maxInclusive, min/maxExclusive, pattern, enumeration, min/maxLength, length, totalDigits, fractionDigits

Complex Types Complex Types define logical structures with attributes and nested elements They use a sequence, choice or all containing elements that use Simple Types or other Complex Types May reference types defined elsewhere in the schema or imported using import statement

In the Schema of Things XML Schema supersedes DTD Defines a typed data format with no semantics or relations between data Next step: higher level of abstraction and the ability to define objects and relations

Resource Description Framework W3C standard for describing resources on the World Wide Web (1999, revised 2004) Generalized to identify objects that may not be retrievable on the Web Objects identified by Uniform Resource Identifiers (URIs) RDF represented by a directed graph and in XML syntax

RDF Example In English: has the full name Federico Diaz and has an employer called Fisher and Sons. Federico Diaz

RDF Parts Each RDF statement is a triple containing a subject (identifier by URI), a predicate (e.g. creator, title, full name) and an object An object can be either a literal value (e.g. Federico Diaz) or another RDF resource All three parts can be identified with an URI and fragment identifier #

RDF Semantics RDF attaches no specific meaning to RDF statements – just like the name of a database field is meaningless to an SQL engine RDF does provide a way to attach data types to literal values, but RDF does not define data types Generally RDF software uses the XML Schema data types  10 Arbitrary XML can also be used as a literal  10

RDF Schema RDF Schema is a ‘vocabulary description language’ that relates resources to each other using RDF RDFS uses ‘classes’ of objects like in Object-Oriented (OO) systems Class properties relate to other classes using OO concepts such as generalization

RDF Schema Use Differs from OO in that Properties are defined in terms of the resources to which they apply (their domain) – they are not restricted to the scope of a single class  domain: Classes to which a Property applies  range: The Class of a Property (i.e. type) Allows new Properties to be created that apply to the same domain without redefining the domain

RDFS Classes Classes introduced by RDFS:  Resource - top level class  Literal – all literal values like text strings  Class – the class of all classes  Datatype – top level RDF datatype Properties introduced by RDFS:  subClassOf  subPropertyOf  domain – domain of a Property  range – range of a Property  label, comment, seeAlso – human readable labels inheritance

RDF/RDFS Lets authors create vocabularies of Classes and Properties and show how the terms should be used to describe resources, e.g.  Property ‘author’ applies to class ‘Book’  Class ‘Employee’ is a subclass of ‘Person’ Does not define descriptive properties such as ‘dateOfIssue’ or ‘title’ but references them using URIs Like in XML/XML Schema, an RDF instance document can be validated against its RDF Schema

Machines Understanding the Web RDF/RDFS along with XML/XML Schema provide a means to describe resources on the web with basic generalization For a higher conceptual level, applications require semantic information Ontologies serve as a starting point for understanding

Ontologies on the Web “Ontologies define the terms used to represent an area of knowledge.” – OWL Use Cases & Requirements, 2004 Example use cases:  A web portal that needs to classify information  Multimedia archive that requires a taxonomy of media or content-specific properties  Corporate portal website that integrates vocabularies from different departments

Web Ontology Language (OWL) Supersedes DAML+OIL  DARPA Agent Markup Language (DAML) was based on RDF/RDFS and includes much of what is now OWL Adds terms used to better describe relations between classes of RDF resources With OWL ontologies can be integrated, extended and shared

OWL Features OWL improvements on RDF/RDFS:  Cardinality min/maxCardinality for Properties with respect to a Class  Equality, disjointness equivalentClass, equivalentProperty, sameAs, differentFrom, disjointWith  Transitive, Symmetric, Functional Properties labelling a Property allows for reasoning  A has B and B has C implies A has C (Transitive)  A has B implies B has A (Symmetric)

OWL Features con’t  Boolean expressions of Class relations unionOf, complementOf, intersectionOf  Property restrictions Limits how properties can be used by an instance of a class  Versioning priorVersion, versionInfo, incompatibleWith, backwardCompatibleWith

Conclusion XMLXML Schema RDFRDF Schema OWL Unicode/ISO byte streams Data formatting and data types Machine data representation Resource description and vocabulary Knowledge processing and reasoning ??? Conceptual level reasoning – ‘smart’ applications Knowledge Data

References World Wide Web Consortium XML XML Schema Part 0: Primer RDF Primer RDF Concepts RDF/XML Syntax RDF Schema OWL Use Cases & Requirements OWL Overview