OCLC Online Computer Library Center Harvesting and Resolution Methods for Building OAI-based Services Jeffrey A. Young

Slides:



Advertisements
Similar presentations
A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
Advertisements

Y.T. a brief history of the OAI 0 Kaynak: Herbert van de Sompel.
OAI in DigiTool DigiTool Version 3.0.
Harvesting Metadata Using OAI-PMH Roy Tennant California Digital Library.
Breakout Session 5: Collaboration Between DPs and SPs Protocol simple, providing a service more difficult… …because of lack of networking among DPs and.
Furthering Collaboration Among OAI Data Providers and Service Providers Kat Hagedorn University of Michigan Libraries Digital Library Production Service.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
National Science Digital Library (NSDL) Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
William Y. Arms Corporation for National Research Initiatives March 22, 1999 Object models, overlay journals, and virtual collections.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.
A Digital Library Repository Utilizing the Open Archives Initiative Developed to meet the needs of UTK Library Special Collections.
OCLC Online Computer Library Center Two Paths to Interoperable Metadata Jean Godby, Devon Smith, Eric Childress DC-2003 September 29, 2003.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
EAD: A Technical Introduction Julie Hardesty, Metadata Analyst June 3, 2014.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Mark Sullivan University of Florida Libraries Digital Library of the Caribbean.
“Old Style” Libraries, Digital Libraries: Convergences, Divergences, And the Troubles in Between.
Metadata Harvesting Interoperable digital collections.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
Integrating Wiki Functions into OCLC Services Jeffrey A. Young OCLC Office of Research CNI Fall 2005 Task Force Meeting Phoenix, Arizona.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Digital Library Interoperability Architecture CS 502 – Carl Lagoze – Cornell University.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata and OAI DLESE OAI Workshop June 29 to July 2, 2002 Katy Ginger Presentation available at:
The OAI: technical overview OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University -- Computer Science.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Open Archives Initiative Protocol for Metadata Harvesting.
Interoperability How to Build a Digital Library Ian H. Witten and David Bainbridge.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Standards OAI-Protocol Metadata: DC - Agris - MODS Marc Goovaerts Hasselt University Library ODIN-PI TRAINING OSTENDE, May 2008.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
CASEY A. MULLIN WITH: LALA HAJIBAYOVA SCOTT MCCAULAY DECEMBER 8, 2008 FRBR in RDF: a proof-of-concept model 1 ©2008 Casey A. Mullin.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
NDLTD Standards, Metadata and the OAI-PMH Hussein Suleman University of Cape Town October 2003.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Web Services Overview Thomas Hickey. 2 What are Web Services? Machine-to-machine communication Run over standard Web protocols –XML syntax, HTTP packaging.
Beyond HTML: Extensible Markup Language (XML)
Harvesting and Exporting Metadata 714: Metadata Margaret E.I. Kipp -
1 XML and XML in DLESE Katy Ginger November 2003.
Getting a Leg Up on OAI for the NSDL
Chapter Eight Interoperability How to Build a Digital Library
Repository Software Marc Goovaerts, Hasselt University Library
OAI and Metadata Harvesting
Digitometric Services for Open Archives Environments
Open Archive Initiative
Presentation transcript:

OCLC Online Computer Library Center Harvesting and Resolution Methods for Building OAI-based Services Jeffrey A. Young CERN OAI3 Workshop# 4 Geneva, Switzerland 14 February 2004

Introductions Name Affiliation Plans Needs Technical experience

Review OAI-PMH Protocol Identify ListSets ListMetadataFormats ListRecords ListIdentifiers GetRecord

Find Repositories to Harvest BrowseSites.pl bin/Explorer/oai2.0/testoai Friends lists Communities (e.g.

Exercise: Getting Started What are your data sources? How will you add value? Who will design the system? Who will create/operate the software? Who will create/maintain the data? Who will advocate for it politically? Who will benefit? Who will pay?

Metadata Metadata is data about data Metadata formats: two extremes –Dublin Core –MARC Metadata can be relative –Who created this document? –Who created the metadata about this document? Keep in mind, though, that OAI works just as well for sharing XML content

XML/DTD/XSD/XSL XML - eXtensible Markup Language DTD - Document Type Definition XSD - XML Schema Definition XSL - eXtensible Stylesheet Language

eXtensible Markup Language Meta-markup language HTML – Hypertext markup language XHTML – eXtensible hypertext markup language

XML Overview Well-formed XML XML Namespaces Valid XML –DTDs –XML Schemas OAI Items vs. Records –Item identifiers –Multiple metadata record representations

XML Namespaces Ambiguous XML Elements – NNE – Clockwise Prefixes help identify and differentiate elements – SE – Widdershins But, prefixes are arbitrary and potentially ambiguous, so what we really need is a URI (ie. prefixes are a local shorthand for the URI) – NW

XML Schema Definition Defines what an XML document contains –XHTML –oai_dc –MARC21 XML

What is our “item”? Work – a distinct intellectual or artistic creation –J.S. Bach’s The art of the fugue Expression – the intellectual or artistic realization of a work –The composer’s score for organ –An arrangement for chamber orchestra by Anthony Lewis Manifestation – The physical embodiment of an expression of a work –CD, printed score, multimedia kit, etc. Item – A single exemplar of a manifestation

Exercise: Data Definition Design a metadata format for items in your project –List the elements you need –Consider the encoding rules –Consider using controlled vocabularies Assign an XML namespace Map a crosswalk to Dublin Core Create a sample item with both formats –Consider assigning OAI sets Report issues, problems, and concerns

Exercise: A Simple Harvester XOAIHarvester – a simple harvester written in XSLT c.org:xoai/xoaiharvester.xsl The purpose of the Perl script is to manage incremental harvesting Caveat! OAI is merely the first step. Once data is harvested, OAI provides absolutely no guidance for doing something useful with it.

Concerns Data quality Duplicates Intellectual Property Rights (IPR) The appropriate copy problem Persistence

Repository Variables MetadataPrefix –oai_dc – the lowest common denominator Set –Hierarchical –Allows selective harvesting –Work best with community agreement –Client warrant

Exercise: Select/Create Tools /list_software.php s.html n/software/utf8conditioner/ oldenburg.de/dc/index.html

An Alternative Service Model ERRoLs are URLs to content and services related to repositories in the OAI Registry at UIUC

Discussion Issues, Problems, Concerns?

Music Services Organizational issues Cultural issues Collection policies Best practices Consensus-building Controlled vocabularies – Do items represent digital and/or physical entities? Authority control

Repository Descriptors Repository-level “description” elements –oai-identifier description – identifier layout –eprints description – content & policies –friends description – discover repositories –branding description – branding information –olac-archive description – archive info Record-level “about” elements –Rights statements –Provenance statements

XSLT Overview XSLT Stylesheet XSLT Processor XML Document XML Document

Validate Repositories sterasprovider.html bin/Explorer/oai2.0/testoai sv

Example Service Providers ARC - A Cross Archive Search Service ARC - A Cross Archive Search Service (experimental research service) Dokumenten- und Publikationsserver der Humboldt-Universität zu BerlinDokumenten- und Publikationsserver der Humboldt-Universität zu Berlin (search service, German language user interface) iCiteiCite (citation index) NCSTRLNCSTRL—Networked Computer Science Technical Reference Library (search engine) my.OAImy.OAI (value-added search interface to a selected list of metadata databases) PhysnetPhysnet (simple search interface to an experimental OAI harvester) oldenburg.de/oai/query.php ProPrintProPrint (printing-on-demand service, German and English language user interfaces offered)

Resources

Everything you need to know d23_technical2.pdf