Updated 2003.11.21 10:02 Hong Kong University of Science & Technology Library Workshop on XML-Based Library Applications 4. XML Standards and Tools.

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

CNES implementation of the ISO standard An extension of the current CNES implementation of the ISO metadata standard.
Ray Denenberg Ralph LeVan Interoperability Standards & Searching Multiple Repositories Workshop 20 March 25, 2006; Washington.
OAI in DigiTool DigiTool Version 3.0.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
Geneve, February 12, 2004 CERN OAI 3 Workshop - Tutorial 2 F. Lützenkirchen Implementing institutional Content Repositories with MyCoRe and MILESS 3rd.
A Digital Library Repository Utilizing the Open Archives Initiative Developed to meet the needs of UTK Library Special Collections.
INTRODUCTION TO WEB DATABASE PROGRAMMING
JSP Standard Tag Library
XML at Work John Arnett, MSc Standards Modeller Information and Statistics Division NHSScotland Tel: (x2073)
XML & Library Applications ELAG 2001 Poul Henrik Jørgensen, Danish Bibliographic Centre,
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Updated :02 Hong Kong University of Science & Technology Library XML Name Access Control Repository at the Hong Kong University of Science.
XML and its applications: 4. Processing XML using PHP.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
ISO ILL, NISO NCIP or OpenURL? ELAG 2005 workshop 2 report Ed. Poul Henrik Jørgensen.
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
CITA 330 Section 6 XSLT. Transforming XML Documents to XHTML Documents XSLT is an XML dialect which is declared under namespace "
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
Creating an Open Archives Metadata Harvesting Protocol Compliant Repository for the American Memory Online Collections OAI Open Meeting, Washington, DC.
SCIELO AS AN OPEN ARCHIVE: the development of SciELO / OpenArchives data provider interface Prof. Carlos H. Marcondes Federal Fluminense University/ Information.
Introduction to Web Services Eric Lease Morgan University Libraries of Notre Dame June 24, 2005.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Roy Tennant California Digital Library escholarship.cdlib.org/rtennant/presentations/2003cil/ Achieving Together What None Can Do Alone: Interoperability.
Primary funding is provided by the JISC and ESRC. Based at Manchester Computing, The University of Manchester. 1 1 Getting Technical - Linking UKSG Serial.
Metadata harvesting in regional digital libraries in PIONIER Network Cezary Mazurek, Maciej Stroiński, Marcin Werla, Jan Węglarz.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
XML Extras Outline 1 - XML in 10 Points 2 - XML Family of Technologies 3 - XML is Modular 4 - RDF and Semantic Web 5- XML Example: UK GovTalk Group’s Schema.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Open Archives Initiative Protocol for Metadata Harvesting.
Module: Software Engineering of Web Applications Chapter 2: Technologies 1.
XML Tools (Chapter 4 of XML Book). What tools are needed for a complete XML application? n Fundamental components n Web infrasructure n XML development.
Z39.50 and the ZING Initiatives: MAVIS Users Conference, 2003 November 6, 2003 Larry E. Dixson Library of Congress.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
Do Real Archivists Use OAI? Mid-Atlantic Regional Archives Conference Gettysburg, PA October 31, 2003 Chris Prom Assistant University Archivist University.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Web Services Overview Thomas Hickey. 2 What are Web Services? Machine-to-machine communication Run over standard Web protocols –XML syntax, HTTP packaging.
Harvesting and Exporting Metadata 714: Metadata Margaret E.I. Kipp -
Updated :02 Hong Kong University of Science & Technology Library Workshop on XML-Based Library Applications 1. What is XML?
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
7th Annual Hong Kong Innovative Users Group Meeting
Getting a Leg Up on OAI for the NSDL
XML Related Technologies
Repository Software - Standards
Repository Software Marc Goovaerts, Hasselt University Library
Workshop on XML-Based Library Applications 5
November 22, 2002 Patricia Galloway School of Information
OAI and Metadata Harvesting
Digitometric Services for Open Archives Environments
OAI 11/20/07.
SDLIP + STARTS = SDARTS A Protocol and Toolkit for Metasearching
Open Archive Initiative
IVOA Interoperability Meeting - Boston
XML and its applications: 4. Processing XML using PHP
Presentation transcript:

updated :02 Hong Kong University of Science & Technology Library Workshop on XML-Based Library Applications 4. XML Standards and Tools

Hong Kong University of Science & Technology Library Outline XML and Related Standards XML and Related Standards XSLT XSLT OAI-PMH OAI-PMH ZING ZING XML Tools and Software XML Tools and Software XML Database Systems XML Database Systems

Hong Kong University of Science & Technology Library XML and Related Standards (1) Categories of XML and Related Standards: Family of standards that facilitate the use of XML documents Family of standards that facilitate the use of XML documents e.g. DOM, XPath, XSL, XSLT, XML Schema, XML Query, XPointer, XLink, XML Encryption, XML Signature, etc. Protocols that allow interchange of data and messages in XML format. Protocols that allow interchange of data and messages in XML format. e.g. SOAP, OAI-PMH, ZING’s SRW, etc.

Hong Kong University of Science & Technology Library XML and Related Standards (2) Standards that specify data and metadata in XML format Standards that specify data and metadata in XML format e.g. EAD, RDF, MARC XML, METS, EAC XML, TEI, MathML, NewsML, etc.

Hong Kong University of Science & Technology Library XSLT, OAI and ZING XSLT (for Transformation) XSLT (for Transformation) XSL (eX tensible Stylesheet Language) Transformation OAI-PMH (for Harvesting) OAI-PMH (for Harvesting) Open Archives Initiative Protocol for Metadata Harvesting ZING (for Searching) ZING (for Searching) Z39.50 International : Next Generation

Hong Kong University of Science & Technology Library What is XSLT? XSL (eXtensible Stylesheet Language) Transformation. XSL (eXtensible Stylesheet Language) Transformation. Developed by W3C for transforming XML documents into other XML documents, using one or more XSLT stylesheets. Developed by W3C for transforming XML documents into other XML documents, using one or more XSLT stylesheets. XSLT stylesheet is itself an XML document. XSLT stylesheet is itself an XML document. It instructs an XSLT Processor on how to transform the XML document. It instructs an XSLT Processor on how to transform the XML document.

Hong Kong University of Science & Technology Library XSLT Example (1) Given an XML document: XML documentXML document <name> Smith Smith John John </name>

Hong Kong University of Science & Technology Library XSLT Example (2) And you want to transform it to the following XML document: XML document XML document John Smith John Smith

Hong Kong University of Science & Technology Library XSLT Example (3) Fragment of an XSLT Stylesheet showing how this transformation can be achieved: XSLT Stylesheet XSLT Stylesheet <person> </person></xsl:template>

Hong Kong University of Science & Technology Library XPath To address parts of an XML document To address parts of an XML document Used heavily in XSLT Used heavily in XSLT Example (assume you have a book marked up as an XML document): Example (assume you have a book marked up as an XML document)://chapter /book/chapter/section[title = 'Conclusion'] following-sibling::chapter[position()=1]count(//figure)

Hong Kong University of Science & Technology Library XSLT and Web (1) Use XSLT to transform an XML document to HTML (Web page). Use XSLT to transform an XML document to HTML (Web page). Unlike HTML documents, which mix content and page layout in the same document, with XML and XSLT, you can separate content from layout (presentation). Different views of the same content can be presented through the Web via XSLT transformation. Unlike HTML documents, which mix content and page layout in the same document, with XML and XSLT, you can separate content from layout (presentation). Different views of the same content can be presented through the Web via XSLT transformation.

Hong Kong University of Science & Technology Library XSLT and Web (2) By applying different XSLT stylesheets to the same XML file, completely different output can be generated: By applying different XSLT stylesheets to the same XML file, completely different output can be generated: Most Wanted Person Most Wanted Person Most Wanted Person Most Wanted Person My Dearest Father My Dearest Father My Dearest Father My Dearest Father

Hong Kong University of Science & Technology Library Client/Server Side XSLT Client-side Transformation Client-side Transformation Perform the transformation on the client, e.g. via the web browser, or through client-side scripts. Perform the transformation on the client, e.g. via the web browser, or through client-side scripts. Platform dependent. Platform dependent. Server-side Transformation Server-side Transformation Perform the transformation on the server and transmit the resulting document to the client. Perform the transformation on the server and transmit the resulting document to the client. No need to predict a heterogeneous client platform. No need to predict a heterogeneous client platform.

Hong Kong University of Science & Technology Library XSLT Processor Software program to perform transformation based on an XSLT stylesheet. Software program to perform transformation based on an XSLT stylesheet. Many such programs are available for free download from the Internet. See: Many such programs are available for free download from the Internet. See: HKUST Library uses Xsltproc of Libxslt HKUST Library uses Xsltproc of Libxslt (

Hong Kong University of Science & Technology Library OAI-PMH Open Archives Initiative Protocol for Metadata Harvesting Open Archives Initiative Protocol for Metadata Harvesting An interoperability standard for exposing and harvesting XML-formatted metadata. An interoperability standard for exposing and harvesting XML-formatted metadata Why OAI? Why OAI? Once OAI-enabled, metadata can be readily harvested Once OAI-enabled, metadata can be readily harvested

Hong Kong University of Science & Technology Library OAI Data and Service Providers (1) Data Provider Data Provider exposes metadata about its content exposes metadata about its content e.g. HKUST’s Institutional Repository, XML Name Access Control Repository, DOAJ, etc. e.g. HKUST’s Institutional Repository, XML Name Access Control Repository, DOAJ, etc.DOAJ Service Provider Service Provider harvest metadata from data provider harvest metadata from data provider e.g. OAIster – collects (harvests) freely available, difficult-to-access, academically- oriented digital resources. e.g. OAIster – collects (harvests) freely available, difficult-to-access, academically- oriented digital resources.OAIster

Hong Kong University of Science & Technology Library OAI Data and Service Providers (2) Data Provider Service Provider OAI Framework Service Provider XML-Formatted Metadata

Hong Kong University of Science & Technology Library OAI Data and Service Providers (3) Data Provider (e.g. HKUST Institutional Repository) OAI Framework Service Provider (e.g. OAIster) XML-Formatted Metadata Data Provider (e.g. HKU Theses Online) Data Provider (e.g. LC American Memory)

Hong Kong University of Science & Technology Library OAI-PMH Protocol Requests The five OAI-PMH Protocol Request Commands: The five OAI-PMH Protocol Request Commands: Identify Identify ListMetadataFormats ListMetadataFormats ListIdentifiers ListIdentifiers GetRecord GetRecord ListRecords ListRecords e.g.

Hong Kong University of Science & Technology Library OAI-PMH Demonstrations HKUST’s Institutional Repository HKUST’s Institutional Repository Identify, ListMetadataFormats, ListSets, ListRecords, ListIdentifiers, GetRecord Identify, ListMetadataFormats, ListSets, ListRecords, ListIdentifiers, GetRecord IdentifyListMetadataFormatsListSets ListRecordsListIdentifiersGetRecord IdentifyListMetadataFormatsListSets ListRecordsListIdentifiersGetRecord XML Name Access Control Repository XML Name Access Control Repository Identify, ListMetadataFormats, ListSets, ListRecords, ListIdentifiers, GetRecord- MARC21A, GetRecord-MARC21B, GetRecord-EACXML. Identify, ListMetadataFormats, ListSets, ListRecords, ListIdentifiers, GetRecord- MARC21A, GetRecord-MARC21B, GetRecord-EACXML. IdentifyListMetadataFormatsListSetsListIdentifiersGetRecord- MARC21AGetRecord-MARC21B GetRecord-EACXML IdentifyListMetadataFormatsListSetsListIdentifiersGetRecord- MARC21AGetRecord-MARC21B GetRecord-EACXML

Hong Kong University of Science & Technology Library ZING Z39.50 International: Next Generation Z39.50 International: Next Generation Covers a collection of initiatives by Z39.50 implementers. Covers a collection of initiatives by Z39.50 implementers

Hong Kong University of Science & Technology Library Why ZING? ZING is an effort to re-deploy Z39.50 in the modern computing environment using Web- and XML-based technologies. ZING is an effort to re-deploy Z39.50 in the modern computing environment using Web- and XML-based technologies. It retains the intellectual contents of Z39.50 and experience gained in the past 20 years of Z39.50 implementations. It retains the intellectual contents of Z39.50 and experience gained in the past 20 years of Z39.50 implementations. 脫胎換骨 脫胎換骨

Hong Kong University of Science & Technology Library What is Z39.50? Information retrieval protocol and service (NISO Z and ISO 23950) Information retrieval protocol and service (NISO Z and ISO 23950) Allows searching of heterogeneous databases by a common search client Allows searching of heterogeneous databases by a common search client Very popular in the library world and has a 2-decade history of implementation Very popular in the library world and has a 2-decade history of implementation But very little usage outside of the library world But very little usage outside of the library world

Hong Kong University of Science & Technology Library Problems of Z39.50 Uses uncommon syntaxes and protocols (OSI-based); incompatible with the modern Web- and XML-based environment Uses uncommon syntaxes and protocols (OSI-based); incompatible with the modern Web- and XML-based environment High learning curve for implementers High learning curve for implementers High implementation cost High implementation cost Not attractive to non-library applications Not attractive to non-library applications Bath Profile ( )

Hong Kong University of Science & Technology Library ZING’s SRW SRW - Search and Retrieval for the Web SRW - Search and Retrieval for the Web Retain core functionality of Z39.50 in the form of a SOAP-based Web service Retain core functionality of Z39.50 in the form of a SOAP-based Web service Major concepts in Z39.50 are retained Major concepts in Z39.50 are retained Communication mechanisms Communication mechanisms by SOAP messages by SOAP messages by URL (also known as SRU – Search and Retrieval by URL) by URL (also known as SRU – Search and Retrieval by URL) Search query by: CQL (Common Query Language) Search query by: CQL (Common Query Language)

Hong Kong University of Science & Technology Library SRW and SOAP Client SRW SOAP Request: Give me all titles by John Smith Server SRW SOAP Response: No problem, here you are… Request Response Web (HTTP)

Hong Kong University of Science & Technology Library SRU Demonstration Search the XML Name Access Control Repository by SRU. Search the XML Name Access Control Repository by SRU. The following URL searches the Repository for exact name 'Ding Yi' : The following URL searches the Repository for exact name 'Ding Yi' : ame+exact+ding+yi ame+exact+ding+yi

Hong Kong University of Science & Technology Library XML Tools and Databases Free XML Tools and Software Free XML Tools and Software XML and Databases by Ronald Bourret XML and Databases by Ronald Bourret htm htm XML Databases Products XML Databases Products

Hong Kong University of Science & Technology Library XML Database Systems Native XML Database Systems Native XML Database Systems Designed to store XML documents Designed to store XML documents Fundamental unit of storage is XML document Fundamental unit of storage is XML document Examples: Tamino, Berkeley DB XML Examples: Tamino, Berkeley DB XML XML-Enabled Database Systems XML-Enabled Database Systems Any database system that has a gateway to accept XML data, conduct XML-based query, and output XML formatted data. Any database system that has a gateway to accept XML data, conduct XML-based query, and output XML formatted data. Examples: Oracle, Microsoft SQL Server Examples: Oracle, Microsoft SQL Server

Hong Kong University of Science & Technology Library Native or not Native? Native XML Databases are most suitable for applications that have: Native XML Databases are most suitable for applications that have: document-oriented data (e.g. full text of a book) document-oriented data (e.g. full text of a book) loosely-structured data with deep nesting (hierarchical) (e.g. bibliographic data) loosely-structured data with deep nesting (hierarchical) (e.g. bibliographic data) data already in XML format (e.g. SOAP messages) data already in XML format (e.g. SOAP messages) Highly structured data fits best with Relational Database Systems (e.g. order record)

Hong Kong University of Science & Technology Library Database System Evaluation (1) HKUST Library conducted an evaluation of Native XML Database Systems in March 2001, with an objective to build an XML document storage and retrieval infrastructure for the Library. HKUST Library conducted an evaluation of Native XML Database Systems in March 2001, with an objective to build an XML document storage and retrieval infrastructure for the Library. Tamino from Software AG was selected Tamino from Software AG was selected Most mature and complete at that time Most mature and complete at that time Supports a rich set of programming languages Supports a rich set of programming languages Query by URL Query by URL

Hong Kong University of Science & Technology Library Database System Evaluation (2) Software AG is an active participant in XML standards Software AG is an active participant in XML standards Unicode-based, and support major character encodings Unicode-based, and support major character encodings