Thomas G. Habing – University of Illinois at Urbana-Champaign Recap: SIGIR 2001 OAI Workshop 19 September 2001 -- OAI Provider Workshop, University of.

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
Registry breakout group DC-8, National Library of Canada 5 October 2000.
A centre of expertise in digital information management IMS Digital Repositories Interoperability Andy Powell UKOLN,
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
OAI in DigiTool DigiTool Version 3.0.
Depositing e-material to The National Library of Sweden.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
A Digital Library Repository Utilizing the Open Archives Initiative Developed to meet the needs of UTK Library Special Collections.
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
University of Illinois at Urbana-Champaign OAI Alpha Experiences Timothy W. Cole Thomas G. Habing Grainger Engineering.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Metadata Harvesting Interoperable digital collections.
Open Archives Initiative OAI openarchives.org “Opening Remarks & Historical Overview” - ACM SIGIR’2001 Ed Fox (w. Lagoze.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
Dec 9-11, 2003ICADL Challenges in Building Federation Services over Harvested Metadata Hesham Anan, Jianfeng Tang, Kurt Maly, Michael Nelson, Mohammad.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Bitter Harvest Metadata Harvesting Issues, Problems, and Possible Solutions Roy Tennant California Digital Library.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Enforcing Interoperability with the Open Archives Initiative Repository Explorer Hussein Suleman, Digital Library Research.
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
Metadata and OAI DLESE OAI Workshop June 29 to July 2, 2002 Katy Ginger Presentation available at:
The OAI: technical overview OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University -- Computer Science.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
Open Archives Initiative Protocol for Metadata Harvesting.
OAI Tools By Thomas G. Habing Grainger Engineering Library Information Center University.
Experiences Implementing OAI Provider Services 13 September ACM SIGIR, New Orleans Open Archives: Communities, Interoperability and Services Timothy.
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
Collection Management Systems
Brian Matthews, euroCRIS, 18/09/03 CRIS architecture to support an ERA Brian Matthews.
Introduction to the OAI Protocol for Metadata Harvesting Version 2.0 Hussein Suleman Virginia Tech DLRL 25 March 2002.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
NDLTD Union Collection User Services Edward A. Fox Virginia Tech DLRL March 2001.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
Do Real Archivists Use OAI? Mid-Atlantic Regional Archives Conference Gettysburg, PA October 31, 2003 Chris Prom Assistant University Archivist University.
OAI metadata: why and how Jenn Riley Metadata Librarian Indiana University.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
The Open Archives Initiative: Perspectives on Metadata Harvesting OAI Provider & Harvesting Services at the University of Illinois Timothy W. Cole Mathematics.
Harvesting and Exporting Metadata 714: Metadata Margaret E.I. Kipp -
Getting a Leg Up on OAI for the NSDL
University of Illinois at Urbana-Champaign OAI Alpha Experiences
Qualified Dublin Core Using RDF for Sci-Tech Journal Articles DC-2001 International Conference on Dublin Core and Metadata Applications, October 22-26,
An Architecture for Complex Objects and their Relationships
OAI and Metadata Harvesting
Health Ingenuity Exchange - HingX
Open Archive Initiative
IVOA Interoperability Meeting - Boston
Presentation transcript:

Thomas G. Habing – University of Illinois at Urbana-Champaign Recap: SIGIR 2001 OAI Workshop 19 September OAI Provider Workshop, University of Illinois at Urbana-Champaign Thomas G. Habing University of Illinois at Urbana-Champaign

Thomas G. Habing – University of Illinois at Urbana-Champaign Overview Eleven attendees (slightly over half of those originally scheduled) Broad interest in OAI from participants: –Only a few of the participants had actual experience implementing OAI, but most had potential OAI projects –Tech reports, NCSTRL, Physics E-Prints, West African Digital Library, National Gallery of Spoken Word, Bibliographies, Personal Archives (Kepler), Thesauri, etc.

Thomas G. Habing – University of Illinois at Urbana-Champaign What exactly is an Open Archive? –How is it related to Digital Libraries? How is it related to traditional paper archives? Is its function preservation or access, both or neither? –Is it metadata only, or can it be full-text? How about non-textual data? Can a thumbnail of an image be considered metadata? Can OAI support non-textual data? The OAI PMH seems to be fairly neutral on these issues. It can support any well structured data, including non-textual data if it is properly encoded and wrapped in XML. –The OAI definitions of terms such as Archive and Record may conflict with other usage. This needs to be made clear in the spec or FAQ.

Thomas G. Habing – University of Illinois at Urbana-Champaign Dublin Core –Is it useful as a least-common-denominator? –Can service providers build useful, value-added systems with only DC metadata? –What about objects for which DC makes little sense, such as people? The consensus seemed to be that DC should continue to be required even when the mappings were forced or contrived, as with people, but that some guidance or best practice for mapping these ‘oddball’ cases should be provided.

Thomas G. Habing – University of Illinois at Urbana-Champaign Access and Authority Control –Which is the authoritative record, especially if brokering or mirror sites are developed? –How can you prove an item existed at a certain time in a certain repository? –Does the protocol need to support its own access controls, or will the HTTP(S) access and user authentication protocols suffice?

Thomas G. Habing – University of Illinois at Urbana-Champaign Rights Management –We need machine readable policies for the section of a record. Enumerated list of values with pre-specified meanings Hyperlinks to external rights management statements or systems

Thomas G. Habing – University of Illinois at Urbana-Champaign Sets –Complex use of sets Used for submitting general queries to a repository. How should ListSets respond in these cases? Could ListSets point to an external list such as PACS? –How to request the number of records per set? –How to lists the sets for a given record? –How to request records not belonging to any set? –The syntax for the setSpec should be expanded to support arbitrary Unicode (not just ASCII) Not “([A-Za-z0-9])+(:[A-Za-z0-9]+)*” But maybe “([^:])+(:[^:]+)*” –Issues with moving records between sets?

Thomas G. Habing – University of Illinois at Urbana-Champaign XML Metadata –How to support multiple namespaces –How to handle schema versioning –What does “oai_dc” mean? Why not just “dc”? –How can some specific metadata fields, and not others, be requested? Maybe by defining different metadata formats –How can single, invalid XML records be handled in the middle of a much larger response, without invalidating the entire response? Possible treat the results as normal text (don’t try to parse as XML) until the entire response is complete. Then try to validate records in a batch. The resumptionTokens could be extracted using common (non-XML) text parsing techniques.

Thomas G. Habing – University of Illinois at Urbana-Champaign Datestamp –May want to add an optional time component to the datestamp to support finer granularity of harvesting, and more dynamic repositories. Consensus is that currently a two-day harvest overlap is required to accommodate the timezone and datestamp granularity issues. –There may be collections for which an OAI Datestamp may not be readily available. Could Datestamp be made optional? Could another date be reasonably substituted, such as today’s date or the creation or publishing date of the object itself, without harming the protocol? Need to educate metadata creators and maintainers that a datestamp for the metadata itself is important.

Thomas G. Habing – University of Illinois at Urbana-Champaign OAI Service Providers –Consensus is that harvesting records for local search is better/simpler than distributed search systems, such as Dienst. –Hybrid systems are possible, combining local search with distributed search. –Distributed architectures: OAI metadata brokering or mirror sites may be used to improve the performance of the overall system Deduping, handling duplicate records may become an issue –Workflow Systems –Citation Linking

Thomas G. Habing – University of Illinois at Urbana-Champaign Communities –Community-based OAI registries for providers. –Need best practice guidelines for utilizing OAI for different communities, such as for traditional archives (EAD), museums, publishers, etc. –Need tools to make support of OAI easy. –Community-based OAI working groups, possibly affiliated with the DublinCore.org, might be useful. –Develop application profiles for different communities: best practice guidelines, custom XML metadata schemas, RDF schemas, standard thesauri, XSLT stylesheets for transforming metadata, etc. - all collected together in one place.

Thomas G. Habing – University of Illinois at Urbana-Champaign Internationalization, multilingual metadata Best practices for handling deleted records Should protocol support an explicit expiration mechanism for resumptionTokens Errors –Does the protocol need standard or suggested ‘reason’ phrases to use for the different HTTP 400 errors? Possibly. –Should error handling be done in the XML body of the HTTP response, instead of via the HTTP status code? There was some sentiment that it should. –Should an effort be made to de-couple OAI from the HTTP protocol? Should SOAP be explored as a wrapper for OAI?