ICS-FORTH May 25, 2001 1 The Utility of XML Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Heraklion, May.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
BAH DAML Tools XML To DAML Query Relevance Assessor DAML XSLT Adapter.
1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
CS570 Artificial Intelligence Semantic Web & Ontology 2
The Semantic Web. The Web Today Designed for Human to read Cannot express meaning Architecture: URL –Decentralized: Link structure Language: html.
Melbourne, October 13, Electronic Communication on Diverse Data - The Role of the oo CIDOC Reference Model - Martin Doerr (ICS-FORTH, Crete, Greece)
ICS-FORTH 1 May 22, 2001 Christos Georgis The extensible markup language: An introduction to XML What is a XML document ? How do we check its validity.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
INF201 Fall2010 Intro. to Info. Technologies Department of Informatics University at Albany – SUNY Original Source: w3schools.com Prepared by Xiao Liang,
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
1 Technologies and Modelling Frameworks XML ontology RDF taxonomy OWL thesaurus Semantic Web.
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
HTML, XML, PDF Pros and Cons.
Introduction to XML This material is based heavily on the tutorial by the same name at
The views expressed in this presentation are those of the presenter, not necessarily those of the IASB or IFRS Foundation. International Financial Reporting.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
ICS – FORTH, August 31, 2000 Why do we need an “Object Oriented Model” ? Martin Doerr Atlanta, August 31, 2000 Foundation for Research and Technology -
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Semantic Web Technologies ufiekg-20-2 | data, schemas & applications | lecture 21 original presentation by: Dr Rob Stephens
The GNM-DMS a Document Management System for the Germanische Nationalmuseum Martin Doerr, ICS-Forth Siegfried Krause, GNM April 2004.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
National Institute of Standards and Technology 1 Testing and Validating OAGi NDRs Puja Goyal Salifou Sidi Presented to OAGi April 30 th, 2008.
Practical RDF Chapter 1. RDF: An Introduction
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
XML BIS4430 – unit 10. XML Origins Extensible Markup Language (XML) 1998 Inspired by Standard Generalized Markup Language (SGML) and HTML. SGML defines.
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
XML - Why: The HTML-Dilemma HTML, SGML, XML - How: Syntax, Concept, Language Elements Basics Well-formed XML-Documents (without DTD) Valid XML-Documents.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Avoid using attributes? Some of the problems using attributes: Attributes cannot contain multiple values (child elements can) Attributes are not easily.
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
The LOM RDF binding – update Mikael Nilsson The Knowledge Management.
Mining Structured vs. Unstructured Data Where is the structure and where did the semantics go? Rahim Yaseen SAP Labs LLC.
R. Addie & S. Dekeyser XML for M&C / USQ ? What ? Why ? How ? When ?
COD Common Record & XML Paul Hill Senior Technical Advisor, Title IV Delivery SFA Schools Channel.
What it is and how it works
XML Design Goals 1.XML must be easily usable over the Internet 2.XML must support a wide variety of applications 3.XML must be compatible with SGML 4.It.
XML Engr. Faisal ur Rehman CE-105T Spring Definition XML-EXTENSIBLE MARKUP LANGUAGE: provides a format for describing data. Facilitates the Precise.
Metadata Schema for CERIF Andrei Lopatenko Vienna University of Technology
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Working with Ontologies Introduction to DOGMA and related research.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Managing Semi-Structured Data. Is the web a database?
Objective: To describe the evolution of the Internet and the Web. Explain the need for web standards. Describe universal design. Identify benefits of accessible.
JavaScript 101 Introduction to Programming. Topics What is programming? The common elements found in most programming languages Introduction to JavaScript.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Information Architecture & Design Week 9 Schedule - Web Research Papers Due Now - Questions about Metaphors and Icons with Labels - Design 2- the Web -
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
XML QUESTIONS AND ANSWERS
CDF for Voting Systems: Human Factors Issues
Knowledge Management Systems
Presentation transcript:

ICS-FORTH May 25, The Utility of XML Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Heraklion, May 25, 2001 Center for Cultural Informatics

ICS-FORTH May 25, XML is XML is a compromise between databases and free texts It takes the better from both sides without being perfect on either side. It is readable. It allows to disambiguate meaning. It is simple. It is rich enough to open a new systems paradigm.

ICS-FORTH May 25, What is a Document ?  A composite statement : a unit relating known facts, items and categories with new knowledge - linguistic or by other media.  It has an inner logic: the pure rendered knowledge, independent from language and form.  It has a meaningful structure: The sequence, arrangement or linking used to render the inner logic.  It has a presentation: Structure and style to assist perception and impression

ICS-FORTH May 25, A document

ICS-FORTH May 25, The statements…. Diego Velasquez is Spanish. Diego Velasquez lived Diego Velasquez painted “Juan de Pareja”. “Juan de Pareja” is a painting. “Juan de Pareja” has dimension 81,3X69,9cm Juan de Pareja is Moorish. Juan de Pareja is a painter. Philipp IV sent Velazquez to Italy. …..

ICS-FORTH May 25, Another document

ICS-FORTH May 25, What’s Wrong with HTML MONET, Claude Haystacks at Chailly at Sunrise 1865 Oil on canvas 30 x 60 cm (11 7/8 x 23 3/4 in.) San Diego Museum of Art  If written properly, normal HTML may reflect document presentation, but it cannot adequately represent the semantics & structure of data Artist Name Date Artifact Title Dimensions Material Museum Image Reference

ICS-FORTH May 25, User Problems/ Design Reasons  Preserving info units: who said that / self-contained  Entering data:  what can I say,  what should I say,  how can I say it.  Rendering data: how to tell my child, the public…  Accessing data: querying, mediation  Reusing data: transmission to other environments, merging, evolution of local system, preservation for future use.

ICS-FORTH May 25, In Technical Terms  Transformation under preservation of meaning  Correct adaptation of presentation without knowing meaning  Packaging information for presentation – “1 document”  Sequencing categories for data input.  Interpretation of intended meaning - searching  Automatic relating of common meaning – merging of different statements

ICS-FORTH May 25, What’s wrong with  Free texts: Clear packaging, rendering for one target, not machine processable (poor querying, categories uncomprehensive), poorly reusable, no help to enter data, transform data..  HTML: Solves platform-independence of presentation, weak connection between meaning and presentation structure – not far better than free text.  Databases: Clear logical structure, categorization, machine processable, excellent querying, difficult presentation, transformation, merging, evolution, no information units  XML: Clear packaging, logical structure, machine processable if correctly used, clear separation and relation of meaningful structure and presentation. Helpful to enter data, easy to extend, transform, present. Can be queried, structure not independent from user view.

ICS-FORTH May 25, XML and databases  Databases:  Schema first: Prior to data, complete, inflexible analysis of all categories and their relations.  Table structures: indexes prepared, excellent consistency enforcement.  XML:  Data first; structure explanatory, can come second, need not be formalized, extensible, DTD’s can be combined  semi-structured: flexible, but reduced guarantee if a question can be answered, reduced consistency enforcement.  Embedded schema: each instance carries the schema it uses – querying by parsing without index structures – ideal transport format.

ICS-FORTH May 25, Data First, Embedded Schema  This document carries the interpretation with it. It is readable without knowledge of the schema. Claude Monet Haystacks at Chailly at Sunrise 1865 Oil on canvas /8 23 3/4 San Diego Museum of Art

ICS-FORTH May 25, What’s important  Data first: delayed analysis, preserves data.  Embedded schema: facilitates data transport, readable in the future.  Separation of semantics and presentation: enables information reuse.  Guides and controls data entry  Same meaning can be encoded in multiple formats:  DTD design depends on purpose: Transport, presentation, data entry…

ICS-FORTH May 25, Useful Applications  Prescription for documentation / input  Data transfer between systems (“middle ware”)  Document bases with full query access.  Combine database with XML documents: mission-critical data in tables and DTD, rich extensible structures in DTD only.  Create data for long-term use: even machine readable from paper!  Create information sets for multiple presentation

ICS-FORTH May 25, Final Remark  How to encode meaning without structure ambiguities: => use RDF/ RDFS  How to standardize meaning of element types (tags) ? => use ontologies – e.g. formulated in RDFS!