Georges Arnaout Chaitanya Krishna

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
OAI Protocol for Metadata Harvesting Tim Brody Intelligence, Agents, Multimedia Group University of Southampton OpCit –
A brief overview of the Open Archives Initiative Steve Hitchcock Open Citation Project (OpCit) Southampton University Prepared for Z39.50/OAI/OpenURL plenary.
Open Archives Initiative Primer DC2001 – Tokyo, October 25, 2001 Thomas Krichel Palmer School of Library and Information Science Long Island University.
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
Y.T. a brief history of the OAI 0 Kaynak: Herbert van de Sompel.
OAI in DigiTool DigiTool Version 3.0.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
Infrastructures for Using Metadata RSS and OAI-PMH CS 431 – March 14, 2005 Carl Lagoze – Cornell University.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.
A Digital Library Repository Utilizing the Open Archives Initiative Developed to meet the needs of UTK Library Special Collections.
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
Metadata Harvesting Interoperable digital collections.
Open Archives Iniative – Protocol for Metadata Harvesting Iztok Kavkler, University of Ljubljana Some slides by Stefaan Ternier, KUL Bram Vandenputte,
A Library Science Perspective on Digitization Bryan Heidorn University of Arizona.
Metadata Harvesting Interoperable digital collections.
Metadata Harvesting Interoperable digital collections.
Herbert van de sompel Workshop on OAI and peer review journals in Europe Geneva, Switserland – March 22nd to 24th 2001 Herbert Van de Sompel Cornell University.
LIS 654 BUILDING DIGITAL LIBRARIES FALL 2011 NOVEMBER 03, 2011 The OAI-PMH Harvester Plugin for The Omeka Content Management System JAMES R. GRIFFIN III.
OAI-PMH The Open Archives Initiative Protocol for Metadata Harvesting Presenter: Knud Möller Friday,
The OAI: overview and historical context OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University --
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
The OAI Protocol for Metadata Harvesting Van de Sompel, Herbert Los Alamos National Laboratory – Research Library.
Metadata harvesting in regional digital libraries in PIONIER Network Cezary Mazurek, Maciej Stroiński, Marcin Werla, Jan Węglarz.
Digital Library Interoperability Architecture CS 502 – Carl Lagoze – Cornell University.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Protocol for Metadata Harvesting hussein suleman uct cs honours 2006.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
NSDL October 12-15, 2003Eisenhower National Clearinghouse Slide 1 NSDL and the Open Archives Initiative NSDL – OAI – and the Eisenhower National Clearinghouse.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
The OAI: technical overview OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University -- Computer Science.
Open Archives Initiative Protocol for Metadata Harvesting.
OAI from the needle box Humboldt Universität Berlin, March 20, 2002 Thomas Krichel Palmer School of Library and Information Science Long Island University.
Metadata Harvesting Interoperable digital collections.
Standards OAI-Protocol Metadata: DC - Agris - MODS Marc Goovaerts Hasselt University Library ODIN-PI TRAINING OSTENDE, May 2008.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
Mod_oai: Metadata Harvesting for Everyone Michael L. Nelson, Herbert Van de Sompel, Xiaoming Liu, Aravind Elango
OAI and ODL Building Digital Libraries from Components Ryan Richardson Virginia Tech DLRL 18 September 2003.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
The Multi-Faceted Use of the OAI-PMH in the LANL Repository Written By: Henry, Xiaoming,Patrick Henry, Xiaoming,Patrick and Herbert. Presented By: Shashi.
OAI Protocol for Metadata Harvesting hussein suleman uct cs honours 2009.
Harvesting and Exporting Metadata 714: Metadata Margaret E.I. Kipp -
Introduction to OAI Static Repositories By Thomas G. Habing Grainger Engineering Library.
Metadata Harvesting - OAI-PMH
Getting a Leg Up on OAI for the NSDL
Introduction to Digital Libraries Week 10: Metadata Harvesting
Making the most of research outputs
CS431 guest lecture Simeon Warner
OAI and Metadata Harvesting
Digitometric Services for Open Archives Environments
OAI 11/20/07.
Old Dominion University Department of Computer Science
Old Dominion University Department of Computer Science
Open Archive Initiative
Open Archives Initiative Object Reuse & Exchange Resource Map Discovery Michael L. Nelson* Carl Lagoze, Herbert Van de Sompel, Pete Johnston, Robert.
Old Dominion University Department of Computer Science
IVOA Interoperability Meeting - Boston
Presentation transcript:

Georges Arnaout Chaitanya Krishna The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Website: http://www.openarchives.org/OAI/2.0/openarchivesprotocol.htm Editors: Carl Lagoze (Cornell University) Herbert Van de Sompel (Los Alamos Laboratory) Michael Nelson (NASA Langley Research Ctr) Simeon Warner (Cornell University) Presented by: Georges Arnaout Chaitanya Krishna CS 791/891-WEB SYNDICATION FORMATS 1

OAI Open Archives Initiative The protocol is openly documented, and metadata is “exposed” to at least some peer group Archive defined as a “collection of stuff” -- or “Repository” OAI is happening at break-neck speed... figure reference: http://www.cs.odu.edu/~mln/oaf-nelson.ppt CS 791/891-WEB SYNDICATION FORMATS

But what is interoperability ??? Definition OAI-PMH: - A protocol that provides an application-independent interoperability framework based on metadata harvesting. But what is interoperability ??? CS 791/891-WEB SYNDICATION FORMATS 3

What is Interoperability? It is the ability of exchanging and using information from 2 or more applications or systems. CS 791/891-WEB SYNDICATION FORMATS 4

CS 791/891-WEB SYNDICATION FORMATS What’s a Harvester ??? it’s a client application that issues OAI-PMH requests, operated in order to collect metadata from the repositories. CS 791/891-WEB SYNDICATION FORMATS 5

CS 791/891-WEB SYNDICATION FORMATS What is a repository ??? It is a BIG database – A place where data is stored and maintained. It is a network accessible server. The data contained in the repository are the metadata that are exposed to harvesters. CS 791/891-WEB SYNDICATION FORMATS 6

Verbs Summary Verb Function Identify description of repository ListMetadataFormats metadata formats supported by repository ListSets sets defined by repository ListIdentifiers OAI unique ids contained in repository ListRecords listing of N records GetRecord listing of a single record figure reference: http://www.cs.odu.edu/~mln/oaf-nelson.ppt CS 791/891-WEB SYNDICATION FORMATS

CS 791/891-WEB SYNDICATION FORMATS OAI-PMH Data Model OAI-PMH distinguishes between 3 distinct entities related to the exposed metadata: 1- Resource: The object that metadata is about. 2- Item: Instance of a metadata object -That instance may be disseminated on the fly, cross-walked from some canonical form , actually stored in repository. 3- Record: is metadata in a specific metadata format. CS 791/891-WEB SYNDICATION FORMATS 8

Example: resource item = identifier all available metadata item about David item Dublin Core metadata MARC SPECTRUM records record = identifier + metadata format + datestamp figure reference: http://www.cs.odu.edu/~mln/oaf-nelson.ppt CS 791/891-WEB SYNDICATION FORMATS

The XML-encoding of records Header Metadata About http://www.openarchives.org/OAI/openarchivesprotocol.html#Record Above link shows encoding of a record in XML CS 791/891-WEB SYNDICATION FORMATS 10

What happens if a record was deleted from the repository??? deleteRecord CS 791/891-WEB SYNDICATION FORMATS 11

What happens if a record was deleted from the repository??? Repositories must declare one of 3 levels of support: 1- no  repository does not maintain information about deletions  MUST NOT reveal a deleted status in any response. 2- persistent  (opposite) maintains info about deletions with no time limit  MUST persistently keep track of deletions and reveal the status of a deleted record. 3- transient  persistent but to a limited time. Such a repository MAY reveal a deleted status. Not revealing the status is acceptable CS 791/891-WEB SYNDICATION FORMATS 12

Selective Harvesting (datestamp and SET) Selective harvesting allows harvesters to limit harvest requests to portions of the metadata available from a repository. CS 791/891-WEB SYNDICATION FORMATS 13

Selective Harvesting via datestamps Request: http://www3.bth.se/servlet/Cupp?verb=Identify&from=2006-01-01&until=2007-01-01 CS 791/891-WEB SYNDICATION FORMATS

CS 791/891-WEB SYNDICATION FORMATS SET membership A set is an optional construct for grouping items for the purpose of selective harvesting. Think of it as a Fraternity. A student (item) may belong to a fraternity. Not all students belong to a fraternity. CS 791/891-WEB SYNDICATION FORMATS 15

Selective Harvesting Via Set <record> <header> <identifier>oai:arXiv:cs/0112017</identifier> <datestamp>2001-12-14</datestamp> <setSpec>cs</setSpec> <setSpec>math</setSpec> </header> <metadata> ….. </metadata> </record> CS 791/891-WEB SYNDICATION FORMATS

CS 791/891-WEB SYNDICATION FORMATS Date/time: 1957-03-20T20:30:00Z is: UTC 8:30:00 PM on March 20th 1957 Encoded in: ISO8601, Z-notation Request: YYYY-MM-DD and YYYY-MM-DDThh:mm:ssZ. Response: YYYY-MM-DDThh:mm:ssZ. CS 791/891-WEB SYNDICATION FORMATS 17

The BIG PICTURE CS 791/891-WEB SYNDICATION FORMATS 18 Figure reference:http://www.oaforum.org/tutorial/english/page3.htm CS 791/891-WEB SYNDICATION FORMATS 18

Request/Response Request is encoded in http Response in XML figure reference: http://www.cs.odu.edu/~mln/oai-cendi.ppt CS 791/891-WEB SYNDICATION FORMATS

CS 791/891-WEB SYNDICATION FORMATS GET Example http://export.arxiv.org/oai2?verb=GetRecord&identifier=oai:arXiv.org:cs/0112017&metadataPrefix=oai_dc CS 791/891-WEB SYNDICATION FORMATS 20

CS 791/891-WEB SYNDICATION FORMATS Flow Control List requests: A number of OAI-PMH requests. The number could be very large  partition them among a series of requests and response CS 791/891-WEB SYNDICATION FORMATS 21

Flow Control Example harvester RDBMS ListRecords Records 1-100, resumptionToken=AXad31 ListRecords, resumptionToken=AXad31 Records 101-200, resumptionToken=pQ22-x ListRecords, resumptionToken=pQ22-x Records 201-277 figure reference: http://www.cs.odu.edu/~mln/oaf-nelson.ppt CS 791/891-WEB SYNDICATION FORMATS

Response with no errors <?xml version="1.0" encoding="UTF-8"?> <OAI-PMH> <responseDate>2002-0208T08:55:46Z</responseDate> <request verb=“GetRecord”… …>http://arXiv.org/oai2</request> <GetRecord> <record> <header> <identifier>oai:arXiv:cs/0112017</identifier> <datestamp>2001-12-14</datestamp> <setSpec>cs</setSpec> <setSpec>math</setSpec> </header> <metadata> ….. </metadata> </record> </GetRecord> </OAI-PMH> CS 791/891-WEB SYNDICATION FORMATS 23

CS 791/891-WEB SYNDICATION FORMATS Response with errors In event of an error or exception condition, repositories must indicate OAI-PMH errors by including the error in the response. Request:http://arXiv.org/oai2? verb=nastyVerb Response: <?xml version="1.0" encoding="UTF-8"?> <OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"> <responseDate>2002-06-01T19:20:30Z</responseDate> <request verb="ListRecords" from="2002-06-01T02:00:00Z" until="2002-06-01T03:020:00Z" metadataPrefix="oai_marc"> http://memory.loc.gov/cgi-bin/oai</request> <error code="badArgument"/> </OAI-PMH> Figure reference:http://www.openarchives.org/OAI/openarchivesprotocol.html#Identify CS 791/891-WEB SYNDICATION FORMATS 24

CS 791/891-WEB SYNDICATION FORMATS Request Verbs There are six different request types: 1) GetRecord 2) Identify 3) ListIdentifiers 4) ListMetadataFormats 5) ListRecords 6) ListSets CS 791/891-WEB SYNDICATION FORMATS

Argument Summary   metadataPrefix from until set resumptionToken identifier Identify  ListMetadata Formats optional ListSets exclusive ListIdentifiers  ListRecords GetRecord Figure reference:http://www.cs.odu.edu/~mln/jcdl03/ CS 791/891-WEB SYNDICATION FORMATS

Error Summary BA NMF IDDNE BRT NSH CDF NRM Identify ListMetadata Formats NMF IDDNE ListSets BRT NSH ListIdentifiers CDF NRM ListRecords GetRecord Figure reference:http://www.cs.odu.edu/~mln/jcdl03/ CS 791/891-WEB SYNDICATION FORMATS

CS 791/891-WEB SYNDICATION FORMATS Dublin Core The Dublin Core metadata element set is a standard for cross-domain information resource description. Mandated metadata format since the initial release of protocol. Purpose of this requirement was to promote interoperability among data providers. CS 791/891-WEB SYNDICATION FORMATS 28

Example http://memory.loc.gov/cgi-bin/oai2_0?verb=Identify http://edoc.hu-berlin.de/OAI-2.0?verb=Identify

Repository explorer and example http://re.cs.uct.ac.za/ We shall discuss following HU-Berlin example in above repository explorer http://edoc.hu-berlin.de/OAI-2.0

OAI-PMH service provider http://www.ncstrl.org/ this is a service provider using OAI-PMH. CS 791/891-WEB SYNDICATION FORMATS

CS 791/891-WEB SYNDICATION FORMATS Conclusion OAI-PMH allows for any metadata format, so long as it is encoded in XML with an XML schema. All repositories must support oai_dc for a minimum level of interoperability. OAI-PMH now defines a single XML Schema to validate responses to all OAI-PMH requests In a successful and trend-setting collaboration with the Dublin Core Metadata Initiative, an XML Schema for unqualified Dublin Core has been created, which is hosted by the DCMI and used in the delivery of metadata in the mandatory DC format in the OAI-PMH. CS 791/891-WEB SYNDICATION FORMATS 32

CS 791/891-WEB SYNDICATION FORMATS Questions? What are the benefits of OAI-PMH? Is the open archives initiative only concerned with metadata? Why choosing the Dublin Core as the standard for OAI-PMH? CS 791/891-WEB SYNDICATION FORMATS 33

CS 791/891-WEB SYNDICATION FORMATS References http://www.openarchives.org/OAI/openarchivesprotocol.html http://www.oaforum.org/tutorial/ http://dublincore.org/ http://www.rsp.ac.uk/usage/harvesters http://www.cs.odu.edu/~mln/jcdl03/ http://www.cs.odu.edu/~mln/oai-cendi.ppt [CENDI Meeting, MD(4/3/02)] http://www.cs.odu.edu/~mln/oaf-nelson.ppt [OA Forum Workshop, Pisa Italy(5/13/02)] CS 791/891-WEB SYNDICATION FORMATS 34