XML : a brief introduction Managing networks : understanding new technologies, Birmingham, 13 September 2001 Pete Johnston UKOLN, University of Bath Bath,

Slides:



Advertisements
Similar presentations
CSCI N241: Fundamentals of Web Design Copyright ©2004 Department of Computer & Information Science Introducing XHTML: Module B: HTML to XHTML.
Advertisements

XML/EDI Overview West Chester Electronic Commerce Resource Center (ECRC)
XML and Enterprise Computing. What is XML? Stands for “Extensible Markup Language” –similar to SGML and HTML –document “tags” are used to define content.
An Introduction to XML Based on the W3C XML Recommendations.
XML Introduction What is XML –XML is the eXtensible Markup Language –Became a W3C Recommendation in 1998 –Tag-based syntax, like HTML –You get to make.
Tutorial 11 Creating XML Document
Upgrading to XHTML DECO 3001 Tutorial 1 – Part 1 Presented by Ji Soo Yoon 19 February 2004 Slides adopted from
XML Introduction By Hongming Yu Feb 6 th, Index Markup Language: SGML, HTML, XML An XML example Why is XML important XML introduction XML applications.
Introducing XHTML: Module B: HTML to XHTML. Goals Understand how XHTML evolved as a language for Web delivery Understand the importance of DTDs Understand.
Introduction to XML This material is based heavily on the tutorial by the same name at
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
Topics The "bigger picture" –The "XML sales pitch" –XML/XHTML vs. SGML/HTML –XML in electronic publishing –XML and the future, web 2.0 XML basics: –Building.
Introduction to XML: Part I By Sandeep Jangity CS 157B, Section 2 Dr. Lee.
XP New Perspectives on XML Tutorial 4 1 XML Schema Tutorial – Carey ISBN Working with Namespaces and Schemas.
EAD: A Technical Introduction Julie Hardesty, Metadata Analyst June 3, 2014.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 XML Taken from Chapter 7.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
XML: Overview MIS 181.9: Service Oriented Architecture 2 nd Semester,
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
An Introduction to XML Presented by Scott Nemec at the UniForum Chicago meeting on 7/25/2006.
CISC 3140 (CIS 20.2) Design & Implementation of Software Application II Instructor : M. Meyer Address: Course Page:
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
Week 1 Understanding the Web Design Environment. 1-2 HTML: Then and Now HTML is an application of the Standard Generalized Markup Language Intended to.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
XML 1 Enterprise Applications CE00465-M XML. 2 Enterprise Applications CE00465-M XML Overview Extensible Mark-up Language (XML) is a meta-language that.
XHTML. Introduction to XHTML What Is XHTML? – XHTML stands for EXtensible HyperText Markup Language – XHTML is almost identical to HTML 4.01 – XHTML is.
XML Extensible Markup Language. What is XML? An infrastructure for describing text and data Developed by W3C(the World Wide Web Consortium)
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
XML and “meta-tagging” Technical seminar for Pathfinder LEAs, BECTa, Coventry, 26 February 2002 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN.
 XML is designed to describe data and to focus on what data is. HTML is designed to display data and to focus on how data looks.  XML is created to structure,
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Avoid using attributes? Some of the problems using attributes: Attributes cannot contain multiple values (child elements can) Attributes are not easily.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
XML eXtensible Markup Language. Topics  What is XML  An XML example  Why is XML important  XML introduction  XML applications  XML support CSEB.
Softsmith Infotech XML. Softsmith Infotech XML EXtensible Markup Language XML is a markup language much like HTML Designed to carry data, not to display.
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
The JISC IE Metadata Schema Registry and IEEE LOM Application Profiles Pete Johnston UKOLN, University of Bath CETIS Metadata & Digital Repositories SIG,
1 XML An Overview Roger Debreceny University of Hawai`i Skip White University of Delaware XBRL Workshop, August 2006.
RDF, XML and interoperability Managing networks : understanding new technologies, Birmingham, 13 September 2001 Pete Johnston UKOLN, University of Bath.
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
XP 1 Creating an XML Document Developing an XML Document for the Jazz Warehouse XML Tutorial.
1 Credits Prepared by: Rajendra P. Srivastava Ernst & Young Professor University of Kansas Sponsored by: Ernst & Young, LLP (August 2005) XBRL Module Part.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
XML Introduction. What is XML? XML stands for eXtensible Markup Language XML stands for eXtensible Markup Language XML is a markup language much like.
1 XML and RDF Paul Miller Interoperability Focus UK Office for Library & Information Networking (U KOLN ) U.
What it is and how it works
An Introduction to XML Paul Donohue May 8th 2002 Hotel Senator Zürich.
XML Introduction. Markup Language A markup language must specify What markup is allowed What markup is required How markup is to be distinguished from.
XML Basics A brief introduction to XML in general 1XML Basics.
1 Tutorial 11 Creating an XML Document Developing a Document for a Cooking Web Site.
1 Tutorial 14 Validating Documents with Schemas Exploring the XML Schema Vocabulary.
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
Introduction to Markup Languages January 31, 2002.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
XML CSC1310 Fall HTML (TIM BERNERS-LEE) HyperText Markup Language  HTML (HyperText Markup Language): December  Markup  Markup is a symbol.
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
Introduction to XML Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Jackson, Web Technologies: A Computer Science Perspective, © 2007 Prentice-Hall, Inc. All rights reserved Chapter 7 Representing Web Data:
CHAPTER NINE Accessing Data Using XML. McGraw Hill/Irwin ©2002 by The McGraw-Hill Companies, Inc. All rights reserved Introduction The eXtensible.
XP 1Creating Web Pages with XML Tutorial 1 New Perspectives on XML Tutorial 1 – Creating an XML Document.
PART 1 XML Basics. Slide 2 Why XML Here? You need to understand the basics of XML to do much with Android All of they layout and configuration files are.
1 Introduction to XML Babak Esfandiari. 2 What is XML? introduced by W3C in 98 Stands for eXtensible Markup Language it is more general than HTML, but.
XML BASICS and more…. What is XML? In common:  XML is a standard, simple, self-describing way of encoding both text and data so that content can be processed.
XML QUESTIONS AND ANSWERS
XML Schemas for Dublin Core Metadata
XML and RDF Paul Miller Interoperability Focus
Introducing HTML & XHTML:
XML Introduction By Hongming Yu Feb 6th, 2002.
Presentation transcript:

XML : a brief introduction Managing networks : understanding new technologies, Birmingham, 13 September 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported by: URL

Managing networks: understanding new technologies, Birmingham, 13 Sep XML: a brief introduction Markup & markup languages SGML & XML Two perspectives on XML Some features of XML XML & HTML Uses of XML

Managing networks: understanding new technologies, Birmingham, 13 Sep Markup & markup languages Markup –text added to the data content of a document in order to convey information about data –markup pre-dates computers! Marked-up document contains –data and –information about that data (markup)

Managing networks: understanding new technologies, Birmingham, 13 Sep Markup & markup languages Markup language –formalised system for providing markup Definition of markup language specifies –what markup is allowed –how markup is distinguished from data –what markup means

Managing networks: understanding new technologies, Birmingham, 13 Sep Exercise 1 From your own experience, can you suggest –some instances of where markup is used? –some examples of markup languages?

Managing networks: understanding new technologies, Birmingham, 13 Sep SGML and XML Standard Generalized Markup Language ISO 8879 : 1986 General, flexible, powerful Used (mainly but not exclusively) in large publishing environments Extensible Markup Language Recommendation of W3C, 1998, 2000 Subset of SGML Less flexible; easier to implement, use Used (increasingly) everywhere…often invisibly… Define means of describing tree-structured data in text format, using markup embedded in data

Managing networks: understanding new technologies, Birmingham, 13 Sep SGML and XML –not strictly markup languages! –“meta-languages” - languages for describing markup languages –can define unlimited number of markup languages All conforming languages can be processed by single program (“parser”) Rules made public so any programmer can write parser Many parsers available for application developer Data independent of platform, vendor

Managing networks: understanding new technologies, Birmingham, 13 Sep A document perspective (1) Individual documents have structure –component parts –relationships between parts Physical structure –depends on medium Logical structure –hierarchical, tree structure –independent of physical rendition Document types –set of documents sharing common logical structural model

Managing networks: understanding new technologies, Birmingham, 13 Sep A document perspective (2) Logical structure communicated to human reader through presentational conventions Presentation defined by “procedural” markup –instructs “agent” what to do with text –e.g. how to format it Problems –markup specific to processing system –specific to delivery medium –human interprets logical structure but software can’t

Managing networks: understanding new technologies, Birmingham, 13 Sep A document perspective (3) Descriptive markup –identifies the logical components of a document –does not specify what procedures are to be applied to text –so e.g. how to format it must be specified separately Benefits –markup (potentially) independent of processing system –permits reuse and delivery to multiple media –makes logical structure available to software N.B. exchange requires consensus on what markup means!

Managing networks: understanding new technologies, Birmingham, 13 Sep Exercise 2 HTML –conceived as describing the logical structure of hypertext document –acquired features which described presentation –extended by browser vendors In the HTML examples, can you see –where markup describes presentation? –where markup describes logical structure?

Managing networks: understanding new technologies, Birmingham, 13 Sep A data perspective (1) The structured document is just one type of structured data Other types of structured data can be represented as tree-structures A “serialization” syntax is useful for various sorts of structured data (relational, object etc.) –for exchange between application programs on different platforms, across networks etc. SGML too complex, “heavyweight” - but XML ideal

Managing networks: understanding new technologies, Birmingham, 13 Sep A data perspective (2) A “document” might be any collection of information processed as a unit –a report –a patient record –a purchase order transaction –a configuration file for an operating system –some “structured information about a resource” (a metadata record) –… –etc! Applications less concerned with publishing, formatting, presentation

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : elements XML uses embedded tags to delimit and label parts of document –tags Elements –containers delimited by tags which include element type name –start tag –end tag Elements may contain –character data –other elements –both of the above –nothing (empty elements) Document element as root of element tree

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : attributes Attributes –pairs of names and values –occur inside element start tag, after element type name – Element can contain only one occurrence of each attribute Attribute values may contain –character data only Attribute values must be surrounded by quotes

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : elements & attributes Nouns and adjectives? –use character data for “content” –use attributes for “information about content” Document-centric view? No hard and fast rules Design decisions tend to be based (wrongly!) on behaviour of tools XML documents are human-readable… … but ease of human-readability may not be the most important consideration in their design

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : document types & vocabularies “XML lets me make up names for element types! Great!” But…. –XML says nothing about what your names mean –will a human recipient of your document recognise your element? –will a software agent process your element correctly? Communication requires consensus on –structural model of class of document/data –labelling of components –semantics of components Shared use of common XML “vocabularies”

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : DTDs, XML Schemas Two methods to codify syntax rules of vocabulary used to describe document type –what markup is allowed –structural constraints on use of markup –say nothing about what markup means Document Type Definition (DTD) –inherited from SGML –part of XML Recommendation XML Schema –recent recommendation of W3C –support for data-typing i.e. tighter control on element content –support for combining vocabularies –use XML syntax

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : Validation & well-formedness Validation –parser can check markup of individual document against rules expressed in DTD or Schema –authoring tool can enforce rules of DTD/Schema while document is edited Well-formed documents –not checked against DTD/Schema, but do follow basic syntax rules e.g. –all tags use proper delimiters –all elements have start and end tags –all elements nested –attribute values in quotes –appropriate use of special characters

Managing networks: understanding new technologies, Birmingham, 13 Sep Exercise 3 Well-formedness –Identify the errors which mean that the three examples are not well-formed XML –How would you correct the errors?

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : namespaces (1) Applications wish to use element from multiple vocabularies (DTDs/Schemas) –particularly true of metadata applications Problems of “name collisions” – in GPs Directory Schema – in MPs Appointments Schema XML Namespaces –recommendation of W3C –provides universal naming mechanism A Namespace is a collection of names A Namespace is itself given a name, which has the form of a URI

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : namespaces (2) Element type names and attribute names can be qualified by a namespace name (a URI) Association with namespace through use of a namespace prefix Declaration of namespace –xmlns:health=“ –xmlns:parl=“ Use of qualified name –

Managing networks: understanding new technologies, Birmingham, 13 Sep XML and HTML HyperText Markup Language (HTML) –recommendation of W3C (version 4.01) –designed as an application of SGML (not XML) –simple, easy to create –(partial?) support in browsers, editors –mixes description of structure and presentation Browsers –permissive – will display invalid HTML –support proprietary extensions Context –explosion of Web –new devices

Managing networks: understanding new technologies, Birmingham, 13 Sep XML and HTML (2) XHTML 1.0 –expression of HTML 4.01 as XML (not SGML) –same features but restrictions on syntax –case sensitivity, XML well-formedness rules –current W3C recommendation for creation of docs for Web XHTML 1.1 –modularisation of XHTML –separation of structural markup from presentational markup –support for managing extensions

Managing networks: understanding new technologies, Birmingham, 13 Sep Uses of XML (1) Data (and metadata) exchange –e-commerce –e-government ( –rights management –bibliographic data –news syndication –scientific data –health - patient records –(… plus hundreds more…) –Web services Within systems and between systems Many standards/protocols built on XML

Managing networks: understanding new technologies, Birmingham, 13 Sep Uses of XML (2) Storage –publishing –scholarly texts –archival finding aids –document management –… –preservation

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : summary (1) Means of describing structured data in text format Independent of platform, vendor –reuse of data –exchange of data Used –for many types of structured data –in many different applications –both for storage and exchange –data may be stored in database, exposed as XML

Managing networks: understanding new technologies, Birmingham, 13 Sep XML : summary (2) Use of XML –usually invisible to end-user –increasingly invisible to information manager? –generated and consumed by software –requires consensus amongst communication partners

Managing networks: understanding new technologies, Birmingham, 13 Sep Acknowledgements UKOLN is funded by Resource: the Council for Museums, Archives and Libraries, the Joint Information Systems Committee (JISC) of the UK higher and further education funding councils, as well as by project funding from the JISC and the European Union. UKOLN also receives support from the University of Bath where it is based.