Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.

Slides:



Advertisements
Similar presentations
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Advertisements

A busy persons introduction to OAI-PMH Christopher Gutteridge ALT, April 2003.
Gary Holton ANLC LSA Symposium: The Open Language Archives Community 4 January 2002 Creating an OLAC data provider at the Alaska Native Language Center.
Y.T. a brief history of the OAI 0 Kaynak: Herbert van de Sompel.
Lawrence Webley, Hussein Suleman, Tatenda Chipeperekwa University of Cape Town Department of Computer.
OAI in DigiTool DigiTool Version 3.0.
Harvesting Metadata Using OAI-PMH Roy Tennant California Digital Library.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
ETD’s at the University of Saskatchewan or… David Fox & Darryl Friesen University of Saskatchewan October 4, 2003.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
The Open Archives Initiative Simeon Warner Cornell University, Ithaca, NY, USA CREPUQ 2002, Montréal, Canada 14:00, 24 October 2002.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
NAL-Institutional Repository: A Case Study CSIR Metadata Harvester I.R.N. Goudar Head, ICAST, NAL National Symposium on Open Access and.
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Metadata Repositories for Interoperable/Shareable Metadata.
Herbert van de sompel Workshop on OAI and peer review journals in Europe Geneva, Switserland – March 22nd to 24th 2001 Herbert Van de Sompel Cornell University.
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
OAI-PMH The Open Archives Initiative Protocol for Metadata Harvesting Presenter: Knud Möller Friday,
IESR Interfaces: Current Services and Future Plans Ann Apps MIMAS, The University of Manchester, UK.
1 OAI-PMH harvester for agricultural knowledge gathering (Development, testing and implementation) Francesco Castellani and Stefka Kaloyanova 4 February.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
SCIELO AS AN OPEN ARCHIVE: the development of SciELO / OpenArchives data provider interface Prof. Carlos H. Marcondes Federal Fluminense University/ Information.
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
The OAI: overview and historical context OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University --
Introduction to Web Services Eric Lease Morgan University Libraries of Notre Dame June 24, 2005.
Research Library, Los Alamos National Laboratory RESEARCH OAI4 - Geneva, Switzerland Digital Library Research & Prototyping Team Multi-Graph.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
Metadata harvesting in regional digital libraries in PIONIER Network Cezary Mazurek, Maciej Stroiński, Marcin Werla, Jan Węglarz.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Bitter Harvest Metadata Harvesting Issues, Problems, and Possible Solutions Roy Tennant California Digital Library.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Digital Collections: Making it Happen Hema Ramachandran Ed Sponsler Jim O’Donnell, Caltech Library System SCELC, September , Caltech.
The OAI: technical overview OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University -- Computer Science.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Open Archives Initiative Protocol for Metadata Harvesting.
OAI Tools By Thomas G. Habing Grainger Engineering Library Information Center University.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Open Archive Forum Rachel Heery UKOLN, University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
Standards OAI-Protocol Metadata: DC - Agris - MODS Marc Goovaerts Hasselt University Library ODIN-PI TRAINING OSTENDE, May 2008.
Distributed Service Registry Workshop, Warwick, U.K. 1 Distributed Functionality in the UIUC OAI Registry
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
Designing Protocols in Support of Digital Library Componentization Hussein Suleman and Edward A. Fox Digital Library Research Laboratory Virginia Tech.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Open Access Tools for Scholars Scholarly Communication Retreat Wednesday December 12, 2007 Presented by Marcia Salmon.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
NDLTD Toward Universal Accessibility of ETDs: Building the NDLTD Union Archive Hussein Suleman, Edward A. Fox,
A centre of expertise in digital information management 10 minute practical guide to the JISC Information Environment (for publishers!)
Networked Information Resources Federated search, link server, e-books.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
Harvesting and Exporting Metadata 714: Metadata Margaret E.I. Kipp -
Repository Software Marc Goovaerts, Hasselt University Library
OAI and Metadata Harvesting
Digitometric Services for Open Archives Environments
OAI 11/20/07.
Open Archive Initiative
OpenDOAR and ROAR RSP Services Day, Nottingham, 23rd Apr.2008
Presentation transcript:

Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication 15 Production Archives 3102 Records Theses, technical reports, conference proceedings, oral histories, refereed articles

We Want Federation Search all archives at once (federated search) Browse all authors, and all records from a given author, in one place (electronic CV)

OAI-PMH Can Help Open Archives Initiative – Protocol for Metadata Harvesting Two Tier Model –Data Providers –Service Providers Service Providers harvest metadata from Data Providers via the OAI Protocol

Data Providers Expose Metadata All records must be described by a minimal set of metadata: –Author –Title –Abstract –Submission date –URL to Record –Unique Identifier

Service Providers Metadata is routinely harvested and stored in a central database The central database is the foundation for federated services DP9, Celestial, Google Scholar

Federation using OAI A collection of records must be described with a common, minimal set of metadata Data Provider tools expose the metdata over http using the OAI-PMH Service Providers use OAI-PMH to harvest Data Providers, index the content and produce a new service (such as searching, or act as a Data Provider themselves)

Data Provider Requirements Expose metadata by responding to simple commands. Respond using xml over http. –Identify –GetRecord –ListIdentifiers –ListMetadataFormats –ListRecords –ListSets

OAI Repository Explorer Helps evaluate and validate a Data Provider implementation Provide an OAI Base URL and send it queries. Example Base URL: /perl/oai2 /perl/oai2

Data Provider Tools ools.htmlhttp:// ools.html Currently 26 tools freely available to help implement OAI Most implementation burden placed on Service Providers, not Data Providers

Eprints at Caltech Eprints.org is a scholarly communication archiving software package It is also an OAI Data Provider All Caltech CODA archives are Data Providers Most run on eprints.org; Theses runs on VT ETDdb

The Problem Each Service Provider must harvest each of our 15 archives individually This discourages participation It is unnecessary, provided we can build a local Service Provider (union catalog of all of CODA)

The Solution Design Caltech CODA Union Catalog Locally harvest each archive into a central database using OAI-PMH Implement this database as an OAI Data Provider Instruct all outside harvesters to use this one Data Provider rather than the 15 individually

EPrints.org as SP Build a harvesting routine to feed metadata into another instance of eprints.org using OAI-PMH Eprints.org does the rest –browse screens –search interface –Data Provider

End Result The Caltech Union Catalog will contain all 3100 CODA records in one database The metadata describing the records will be only the oai_dc subset (author, title, abstract, unique id, URL to target) Each record in union catalog will contain a link back to the full record in the harvested archive

End Result There will be one place for all harvesters to obtain Caltech records, instead of 15 Use eprints to provide the local federated search interface across all our archives Author browse pages (like a CV) Centralized RSS (eprints.org supports this) Centralized access statistics

Challenges Centralized Browse by Author requires author name identifier (authority) Implement OAI harvester to feed the Union Catalog (based on eprints.org) Customize eprints.org to import records provided by this harvester

Summary Using OAI-PMH for federated searching requires three steps: –Define a minimal metadata set for all records –Wrap a Data Provider service around each collection of records to expose metadata –Harvest metadata centrally, then produce a service (such as search and browse) Skip step three if you’re satisfied with existing OAI Service Providers (DP9, Google, Celestial, etc.)