Next Generation Z39.50 A Web Services Approach for Search and Retrieve 6 th Annual State GILS Conference, March 31 – April 3, 2004, Raleigh, NC William.

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

Searching Options and Result Sets Sara Randall Endeavor Information Systems October 30, 2003.
Z39.50 as a Web Service Ralph LeVan Research Scientist.
Theo van Veen, Koninklijke Bibliotheek The European Library: opportunities for new services.
SRW/U for DSpace Ralph LeVan Research Scientist. What is SRW/U A Pair of HTTP-based Text Query Protocols – SRW: Search and Retrieve Web Service – SRU:
Z39.50 as a Web Service Ralph LeVan Research Scientist.
A centre of expertise in digital information management UKOLN is supported by: SRU: An overview of the SRU protocol and how it can be used.
Delivering MARC/XML records from the Library of Congress catalogue using the open protocols SRW/U and Z39.50 Mike Taylor, Index Data
? CQL – a Common Query LanguageMike Taylor CQL – a Common Query Language 1. What CQL is 2. Motivation 3. Examples and explanation 4.
Distributed Service Registries Workshop, July 2005 Slide 1 NISO Metasearch Initiative Registries Robert Sanderson Dept. of Computer Science University.
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
Ray Denenberg Ralph LeVan Interoperability Standards & Searching Multiple Repositories Workshop 20 March 25, 2006; Washington.
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
CS 430 / INFO 430 Information Retrieval
Searching Digital Content via SRU Ryan Scherle Randall Floyd October 25, 2006.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
ZML  ZNG  ZING “Z39.50 over XML”  “Z39.50 Next Generation”  “Z39.50-international: Next Generation”
ELAG 2005 / Workshop 1: ZING Geneva, June 2005 Ole Husby, BIBSYS.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Presentation 7 part 2: SOAP & WSDL. Ingeniørhøjskolen i Århus Slide 2 Outline Building blocks in Web Services SOA SOAP WSDL (UDDI)
A Practical Introduction to XML in Libraries Marty Kurth NYLA October 22, 2004.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
Internet Basics Dr. Norm Friesen June 22, Questions What is the Internet? What is the Web? How are they different? How do they work? How do they.
T Network Application Frameworks and XML Web Services and WSDL Sasu Tarkoma Based on slides by Pekka Nikander.
Z39.50 & XML Poul Henrik Jørgensen, Danish Bibliographic Centre
Positioning Z39.50 in the Networked Library Standards for Building Sustainable Services William E. Moen School of Library and Information Sciences Texas.
Z39.50 for Finding It All William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge University of North Texas Denton,
Enabling Embedded Systems to access Internet Resources.
Lucas Mak and Dao Rong Gong Michigan State University Millennium and XML: Repurposing and Customizing Metadata May , 2009.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
ELAG 2004  Work Shop on ZING Bill Oldroyd, Animator British Library Janifer Gatenby, Scribe OCLC PICA, Leiden, Netherlands.
James Holladay, Mario Sweeney, Vu Tran. Web Services Presentation Web Services Theory James Holladay Tools – Visual Studio Vu Tran Tools – Net Beans Mario.
Lecture 15 Introduction to Web Services Web Service Applications.
ISO ILL, NISO NCIP or OpenURL? ELAG 2005 workshop 2 report Ed. Poul Henrik Jørgensen.
A Web Services Approach for Search and Retrieve The Next Generation Z39.50 Access 2004, October 13-16, 2004, Halifax, Nova Scotia William E. Moen School.
Testing and Improving Interoperability The Z39.50 Interoperability Testbed William E. Moen School of Library and Information Sciences Texas Center for.
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
An Alternative Approach to Interoperability Testing The Use of Special Diagnostic Records in the Context of Z39.50 and Online Library Catalogs William.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Creating an Application Profile Tutorial 3 DC2004, Shanghai Library 13 October 2004 Thomas Baker, Fraunhofer Society Robina Clayphan, British Library Pete.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Chris Kuruppu NWS Office of Science and Technology Systems Engineering Center (Skjei Telecom) 10/6/09.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
OCLC Online Computer Library Center Interoperability Standards & Searching Multiple Repositories Ralph LeVan/OCLC Ray Denenberg/Library of Congress.
A Quick Introduction to Metadata Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
Ray Denenberg Rob Sanderson “ Key Standards Updates ” SRU Project Briefing April 4, 2006; Washington.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
CNI, 4th April 2006 Slide 1 Key Standards Update: SRU (“Technical” Details) Dr. Robert Sanderson Dept. of Computer Science University of Liverpool
Radioactive Metadata Records An Interoperability Testing Approach Based on Metadata Utilization William E. Moen School of Library and Information Sciences.
Z39.50 & The Z Texas Profile William E. Moen School of Library and Information Sciences University of North Texas Denton, TX.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
WSDL Specification WSDL = Web Service Description Language Similar to ASN.1 – describes the abstract on the wire data structures Multiple bindings to on.
SRW/U: Re-Introduction SRW is a Web Services based Information Retrieval Protocol Motivations: Create an easy to implement protocol with the power of Z39.50.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.
No Longer Under Our Control? The Nature and Role of Standards in the 21 st Century Library William E. Moen School of Library and Information Sciences Texas.
Research and Projects: Z, M, and Beyond! William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge University of North.
Z39.50 and the ZING Initiatives: MAVIS Users Conference, 2003 November 6, 2003 Larry E. Dixson Library of Congress.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
MARC Content Designation Use I mplications for indexing & interoperability William E. Moen School of Library and Information Sciences Texas Center for.
SOAP, Web Service, WSDL Week 14 Web site:
A Complex Standard and Its Use Results from an empirical analysis of MARC 2004 Texas Library Association Annual Conference, March 18, 2004, San Antonio,
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Sabri Kızanlık Ural Emekçi
WEB SERVICES From Chapter 19 of Distributed Systems Concepts and Design,4th Edition, By G. Coulouris, J. Dollimore and T. Kindberg Published by Addison.
Chapter 9 Web Services: JAX-RPC, WSDL, XML Schema, and SOAP
AGENT FRAMEWORK By- Arpan Biswas Rahul Gupta.
WEB SERVICES From Chapter 19, Distributed Systems
Presentation transcript:

Next Generation Z39.50 A Web Services Approach for Search and Retrieve 6 th Annual State GILS Conference, March 31 – April 3, 2004, Raleigh, NC William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge University of North Texas Denton, TX 72603

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 2 Overview Quick description of SRW Brief background – historical, political, conceptual Non-technical (almost) introduction to SRW Common Query Language (CQL) briefly Concluding thoughts

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 3 What is SRW? Search and Retrieve Web Service (SRW) An XML-based protocol for searching, retrieving, and other information retrieval transactions Cast in the standards/technologies for web services XML SOAP HTTP Brings the concepts and experience of Z39.50 into the web environment using web technologies

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 4 Why SRW? Genesis: several years of soul searching by Z39.50 developers and implementors The “web” had become the common implementation environment Z39.50 was not perceived as web friendly Pivotal moments: December 2000 ZIG meeting July 2001 meeting

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 5 Turning point: December 2000 “Z39.50 Future” discussion Perceptions of Z39.50  broken  heavy-weight  difficult and complex  old technology  not web friendly Several options presented Rewrite the protocol from the ground up Rewrite as an XML protocol Separate the Z39.50 protocol from its use of BER as a wire protocol Simplify the protocol specifications to focus on core features Recognition of the intellectual contribution of Z39.50

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 6 Taking action: June 2001 Invitational meeting to discuss moving Z39.50 to an XML- based protocol Goal Lower the barriers to implementation while preserving the existing intellectual contributions of Z39.50, discarding those aspects no longer useful or meaningful. Objective Define specifications for a new web service definition based on Z39.50 together with web technologies Separate the Z39.50 abstract and associated semantic model from its specific encoding and wire protocol (i.e., ASN.1/BER and TCP/IP) Initially called Z39.50 Next Generation (ZNG) Intended as proof-of-concept Defining only those protocol specifications that would actually be implemented by participants

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 7 ZING – Z39.50 International Next Generation Make intellectual/semantic content of Z39.50 more broadly available Make Z39.50 more attractive by lowering barriers to implementation Use of XML – to represent and encode data Use of HTTP – for transport Use of SOAP – for interaction between client and server based on Remote Procedural Call (RPC) Several ZING initiatives: ZOOM, ez39.50, ZeeRex, SRW/U FOR MORE INFORMATION, VISIT THE ZING WEBSITE…

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 8 SRW/U, SRW, SRU SRW/U: Search and Retrieve for the Web General designation for this initiative SRW: Search and Retrieve Web Service HTTP Post Simple Object Access Protocol (SOAP) XML messages SRU: Search and Retrieve URL Service HTPP Get Request parameters included in URL syntax Development Version 1.0 November 2001 Version 1.1 February 2002 FOR MORE INFORMATION, VISIT THE SRW WEBSITE…

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 9 Networked information retrieval What’s needed: Identifying a target to search A vocabulary for expressing search requests, search criteria, retrieval requests, etc. Methods to encode the requests and responses from the target Methods to transport the requests and responses across a network In other words, a protocol and supporting specifications

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 10 Abstract Model of IR

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 11 Abstract model of Z39.50

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 12 Z39.50 classic & SRW

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 13 SRW Overview Builds on Z39.50 concepts and web technologies Web technologies: XML, SOAP, HTTP Uses new, human-readable query language Combines several Z39.50 features into several “operation types” searchRetrieve operation scan operation explain operation

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 14 searchRetrieve operation The core of the protocol Expresses the search and additional criteria Records are returned in XML Request parameters version query Optional parameters sortkeys recordPacking recordSchema recordXPath stylesheet Response parameters version numberOfRecords Optional parameters resultSetID resultSetIdleTime records diagnostics

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 15 SRW & XML XML as foundation for protocol Provides syntax for intelligent markup Defines or references XML schemas Example XML schema for SRW specifications searchRetrieveRequest searchRetrieveResponse

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 16 searchRetrieveRequest example Sent as a HTTP Post XML document is sent to the server Using SOAP to wrap the request 1.1 dc.title all "Squirrel Hungry" 1 dc

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 17 searchRetrieveResponse example info:srw/schema/1/dc- v1.1 Squirrel is Hungry

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 18 searchRetrieve response Records returned in response All records in XML syntax According to one or more XML schemas (semantics) Dublin Core Onix MODS MarcXml

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 19 searchRetrieve example Retrieval results XML view Screen shot 1.1 dc.title computer 1 10 xml dc>

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 20 SRW results

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 21 SRU briefly Protocol requests can be carried via HTTP Get searchRetrieveRequest parameters expressed in standard URL syntax baseURL and search part separated by question mark “?” Response is XML document containing records The searchRetrieveRequest in SRU: rchRetrieve&version=1.1&query=dc.title=%22compute r%22&recordSchema=DC&startRecord=1&maximumR ecords=10&recordPacking=xml

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 22 search/Retrieve query SRW query consists of one or more query statements linked by Boolean operators Five categories of query statements: 1.single search clause 2.two or more search clauses linked by Boolean 3.search clauses and result sets linked by Boolean 4.two or more result sets linked by Boolean 5.single result set Expressed in the Common Query Language (CQL)

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 23 Common Query Language (CQL) A formal language for representing queries to information retrieval systems Human-readable Search clause Always includes a term simple terms consist of one or more words May include index name To limit search to a particular field/element Index name includes base name and may include prefix title, subject dc.title, dc.subject Several index sets have been defined (called Context Sets in SRW) dc bath srw Context set defines the available indexes for a particular application

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 24 Other components of CQL Relation, =, =, <> exact used for string matching all when term is list of words to indicate all words must be found any when term is list of words to indicate any words must be found Boolean operators: and, or, not Proximity (prox operator) relation (, =, =, <>) distance (integer) unit (word, sentence, paragraph, element) ordering (ordered or unordered) Masking rules and special characters single asterisk (*) to mask zero or more characters single question mark (?) to mask a single character carat/hat (^) to indicate anchoring, left or right

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 25 CQL examples Simple queries: dinosaur "the complete dinosaur" Boolean dinosaur and bird or dinobird "feathered dinosaur" and (yixian or jehol) Proximity foo prox bar foo prox/>/4/word/ordered bar Indexes title = dinosaur bath.title="the complete dinosaur" srw.serverChoice=dinosaur Relations year > 1998 title all "complete dinosaur" title any "dinosaur bird reptile" title exact "the complete dinosaur"

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 26 SRW & classic Z39.50 SRW No explicit concept of connection, session, or state Results sets named by server Single record syntax (XML), multiple schemas String (i.e., human- readable) queries CQL Named indexes Classic Z39.50 Stateful Results sets named by client Multiple record syntaxes No human-readable query language Type 1 query using attribute sets Use attribute to identify access point Z39.50 Concepts Retained Result sets Abstract access points Abstract record schemas Explain Diagnostics

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 27 What problems does SRW solve Addresses need for standards-based searching in the networked environment Shows the vitality of the Z39.50 concepts and implements those in a web services & URL access context Offers database providers with a web-friendly method for offering standards-based searching of resources Provides low barrier to entry solution using commonly available technologies XML format of records provide for more reuse, and more interesting use of resources

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 28 Possible implementation venues Gateways to existing Z39.50 servers Lightweight SRW/U servers to specialized databases Cost-effective search access to commercial databases (e.g., citation, full- text) Metasearching Beyond libraries to many other information communities

Moen 6th Annual State GILS Conference -- March 31 – April 3, Raleigh, NC 29 References Z39.50 International Next Generation – ZING  Search and Retrieve for the Web – SRW/U  A Gentle Introduction to SRW  A Gentle Introduction to CQL  Search and Retrieval in The European Library: A New Approach by van Veen and Oldroyd in D-Lib (Feb04) 