The Open Archives Initiative Protocol for Metadata Harvesting and the IMLS Digital Collections & Content Project at the University of Illinois Timothy.

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

2008 EPA and Partners Metadata Training Program: 2008 CAP Project Geospatial Metadata: Intermediate Course Module 3: Metadata Catalogs and Geospatial One.
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
Digital libraries and culture portals Rossella Caffo - MiBAC Coordinator of the MICHAEL and MINERVA eC projects.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
A brief overview of the Open Archives Initiative Steve Hitchcock Open Citation Project (OpCit) Southampton University Prepared for Z39.50/OAI/OpenURL plenary.
IMLS NLG Collection Registry & Item-Level Metadata Repository at the University of Illinois Timothy W. Cole Mathematics Librarian &
National Diet Library Digital Archive Portal - PORTA - Gateway to digital information in Japan April 3, 2008 Hideki Takeuchi Planning.
OLAC Process and OLAC Protocol: A Guided Tour Gary F. Simons SIL International ___________________________ OLAC Workshop 10 Dec 2002, Philadelphia.
Programs and Research Public Private Agreements for Mass Digitisation Ricky Erway JISC Digitisation Conference July 2007.
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
Treasury Board of Canada Secretariat Secrétariat du Conseil du Trésor du Canada IM Standards for E-government The Canadian Experience Managing Information.
A centre of expertise in digital information management IMS Digital Repositories Interoperability Andy Powell UKOLN,
Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges.
Pete Johnston UKOLN, University of Bath Bath, BA2 7AY
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
Optimising metadata workflows in a distributed information environment R. John Robertson & Jane Barton Centre for Digital Library Research University of.
Distributed Service Registries Workshop, July 2005 Slide 1 NISO Metasearch Initiative Registries Robert Sanderson Dept. of Computer Science University.
An overview of collection-level metadata Applications of Metadata BCS Electronic Publishing Specialist Group, Ismaili Centre, London, 29 May 2002 Pete.
Pure Silver Reusing and Repurposing Bibliographic Data in a Current Research Information System and Institutional Repository 15 September.
OAI and Publishers metadata Using the static repositories approach to disclose small journals.
Can We Talk? MICHAEL Conference London May 23, 2008Joyce Ray.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
The Discovery Landscape in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK – eBank UK project A centre.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Findings from the Mellon Metadata Harvesting Initiative Martin Halbert, Joanne Kaczmarek, and Kat Hagedorn Monday 18-Aug-2003 ECDL 2003.
Configuration management
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
Collection-level description in practice Collection-Level Description & NOF-digitise projects NOF-digitise programme seminar, London, 22 February 2002.
An introduction to collections and collection-level description Collection-Level Description & NOF-digitise projects NOF-digitise programme seminar, London,
Information Professionals and Learning Object Repositories … more than just metadata quality … Sarah Currier Stòr Cùram Project Librarian JISC X4L Repository.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
IMLS NLG Collection Registry & Item-Level Metadata Repository at the University of Illinois Timothy W. Cole Mathematics Librarian &
Metadata Repositories for Interoperable/Shareable Metadata.
The Basics of OAI An Introduction to the Protocol for Metadata Harvesting Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond July.
Using OAI-PMH to Aggregate Metadata Describing Cultural Heritage Resources Timothy W. Cole University of Illinois at Urbana-Champaign.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
From small beginnings: Developing collection level description Mapping the Information Landscape Showcase day British Library Conference Centre, London,25.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
IMLS DCC Project Briefing ( ) Jenny Benevento ( ) Timothy W. Cole.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Search Interoperability, OAI, and Metadata An Introduction to the OAI Protocol for Metadata Harvesting Sarah Shreeves University of Illinois at Urbana-Champaign.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
DLF Fall Forum The Distributed Library: OAI for Digital Library Aggregation UIUC’s Role: Registry of OAI Data Providers
OAI metadata: why and how Jenn Riley Metadata Librarian Indiana University.
Timothy W. Cole Jenny Benevento Muriel Foulonneau
Utility of an OAI Service Provider Search Portal
OAI Protocol for Metadata Harvesting & Its Usefulness to STM Publishers Timothy W. Cole Mathematics Librarian & Professor of Library.
IMLS NLG Collection Registry & Item-Level Metadata Repository at the University of Illinois Timothy W. Cole Mathematics Librarian.
Outline Pursue Interoperability: Digital Libraries
Integrating Access for Information Discovery and More
IVOA Interoperability Meeting - Boston
OAI & NSDL Research at Grainger Briefing to UIUC Library Faculty 15 April 2003 Timothy W. Cole William H. Mischo
Integrated Access and Shareable Metadata
Presentation transcript:

The Open Archives Initiative Protocol for Metadata Harvesting and the IMLS Digital Collections & Content Project at the University of Illinois Timothy W. Cole Mathematics Librarian & Professor of Library Administration University of Illinois at Urbana-Champaign Friday 12 November 2004 MCN 2004, Minneapolis, MN

2 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC The Digital Information Landscape The information landscape can be seen as a contour map in which there are mountains, hillocks, valleys, plains and plateaus…. A specialized collection of particular importance is like a sharp peak. Upon a plateau there might be undulations representing strengths and weaknesses…. The landscape is, however, multidimensional. Where one scholar may see a peak another may see a trough. The task is to devise mapping conventions which enable scholars to read the map of the landscape fruitfully, at the appropriate level of generality or specificity. Michael Heaney (2000), An Analytical Model of Collections and their Catalogues.An Analytical Model of Collections and their Catalogues

3 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC Users & Uses of Digital Libraries From Bibusages study (French National Library): Digital Libraries are used in conjunction with Web search engines, generalist portals, commercial sites Mix of intensive & casual users DL users skew somewhat older, higher degree level than average French Internet user population DL users seeking answer for specific information need; most time spent discovering, viewing, & downloading documents Digital Libraries … are now attracting a new type of public, bringing about new, unique and original ways for reading and understanding texts. Houssem Assadi, et al. Users & Uses of Online Digital Libraries in France, ECDL 2003

4 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC Managing Digital Collections & Content How do mandates translate & change in digital world? Content & collections as virtual information landscapes New users, uses, & metrics Increased emphasis on interoperability & sharing New models for sharing & resource discovery Harvesting – e.g., OAI-PMH Federated searching – e.g., Z39.50 / ZNG, DiGIR,... New Emphasis on Shareable metadata Reconciling different descriptive metadata practices New metrics for metadata quality (for interoperability)

5 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC IMLS Digital Library Forum (2001) Framework of Guidance for Building Good Digital Collections Stresses reusability, persistence, interoperability, verification, and documentation of digital collections & content Accompanying report included recommendations encouraging: Creation of an IMLS Collection Registry Implementation of the Open Archives Initiative Protocol for Metadata Harvesting by IMLS projects creating digital content Development of infrastructure to facilitate interoperability between IMLS projects and initiatives like NSDL

6 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC IMLS DCC Project Overview Collection description & prototype registry for IMLS National Leadership Grant projects with associated digital content Enhance discoverability of collections & content Provide alternative view of one output of IMLS NLG program Prototype item level metadata repository via OAI-PMH Demonstrate potential of metadata for interoperability Serve as testbed for IMLS projects interested in OAI-PMH Facilitate reuse of information resources paid for by IMLS Research question: How can resource developers best represent collections and items to meet the needs of service providers and end users?

7 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC IMLS Grantees – A Diverse Community Mix of library, museum, and archive traditions Wide variation in technical skills, technology infrastructure & information management policy Diverse perspectives on intellectual property; use and presentation of metadata & primary resources Diverse embedded knowledge structures Results in wide variability in: Metadata formats Content resource types Controlled vocabularies Descriptive metadata practices

Broad Categories of Institutions Represented in Collection Registry

Detailed Institution Types Represented in Collection Registry

Broad Categories of Institutions Represented in Metadata Repository

Detailed Institution Types Represented in Metadata Repository

Metadata Formats

Types of Resources

14 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC Controlled Vocabularies

15 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC Descriptive Practice Different traditions regarding Inclusion of interpretive information Granularity of description Presentation of information resources Shared problems / issues How to provide context & collection description What exactly to describe Which metadata scheme(s) to use

16 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC Illustration – Coverlets (1 of 2) Description: Digital image of a single-sized cotton coverlet for a bed with embroidered butterfly design. Handmade by Anna F. Ginsberg Hayutin. Source: Materials: cotton and embroidery floss. Dimensions: 71 in. x 86 in. Markings: top right hand corner has 1 1/2 in. x 1/2 in. label cut outs at upper left and right hand side for head board; fabric is woven in a variation of a rib weave; color each of yellow and gray; hand-embroidered cotton butterflies and flowers from two shades of each color of embroidery floss - blue, pink, green and purple and single top 20 in. bordered with blue and black cotton embroidery thread; stitches used for embroidery: running stitch, chain stitch, French knot and back stitches; selvage edges left unfinished; lower edges turned under and finished with large gray running stitches made with embroidery floss. Format: Epson Expression 836 XL Scanner with Adobe Photoshop version 5.5; 300 dpi; 21-53K bytes. Available via the World Wide Web. Coverage: Date Created: :45:18; Updated: ; Created: ; Created: ? Type: Image

17 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC Illustration – Coverlets (2 of 2) Description: Materials: Textile--Multi, PigmentDye; Manufacturing Process: Weaving-- Hand, Spinning, Dyeing, Hand-loomed blue wool and white linen coverlet, worked in overshot weave in plain geometric variant of a checkerboard pattern. Coverlet is constructed from finely spun, indigo-dyed wool and undyed linen, woven with considerable skill. Although the pattern is simpler, the overall craftsmanship is higher than A. - D. Schrishuhn, 11/19/99 This coverlet is an example of early "overshot" weaving construction, probably dating to the 1820's and is not attributable to any particular weaver. -- Georgette Meredith, 10/9/1973 Source: Format: 228 x 169 x 1.2 cm (1,629 g) Coverage: Euro-American; America, North; United States; Indiana? Illinois? Date: Early 19th c. CE Type: cultural; physical object; original

18 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC OAI Protocol for Metadata Harvesting Harvesting approach to interoperability at metadata level Divides world into Metadata Providers & Service Providers Builds on HTTP, XML, & Community Metadata Standards

Metadata Harvesting Model

20 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC How OAI-PMH Works OAI VERBS Identify ListMetadataFormats ListSets ListIdentifiers ListRecords GetRecord

21 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC Why OAI-PMH for IMLS DCC Project Offers low technical barrier options; primary cost is metadata e.g., OAI-PMH itself, OAI Static Repository, mod_oai Is a cross-domain, non-proprietary approach to interoperability Already used by NSDL, OAIster, etc. Seen as a way to bring content to attention of wider audience 37% of visits to State Library of New South Wales image collection via PictureAustralia (a OAI-PMH based portal) Facilitates metadata & metadata services research What makes for good shareable metadata? Contrast & compare metadata designs & workflows Explore normalization, enhancement, aggregated searching issues

22 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC OAI-PMH Issues Harvesting vs. federated Harvested metadata aggregation always out of date, but Federated real-time performance dependent on weakest link Sorting, ranking, & de-dupping easier with harvesting model Potential scale issues Largest OAI-PMH provider serves 4 million records Largest OAI-PMH service provider < 10 million records Integration into existing metadata workflow requires some investment – cost-to-benefit ratio still unclear Practical metadata sharing issues: Persistent identifiers, date stamps, proper application of protocol Metadata quality, consistency, context, cross-walking,...

Federated Searching Model

24 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC Alternative Approaches for Interoperability Federated search models Library: NISO Z39.50 Specimen / Natural History: DiGIR More homogeneous metadata schemes, query rules Collaborative, sometimes proprietary project portals RLG Cultural Materials ArtStor GBIF, MaNIS,... Generally higher technical threshold; rely on higher level of metadata homogeneity & compliance

25 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC OAI-PMH as Complement to Other Approaches OAI-PMH provides a lowest-common-denominator approach to sharing & interoperability Insufficient for some high-level, domain-specific applications, But useful for sharing across more heterogeneous communities & allowing participation with less technology Portals can exploit combination of approaches OAI-PMH metadata harvesters can normalize & augment metadata before sharing on with domain-specific federated search portals

26 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC IMLS DCC Collection Registry (alpha) Features: Searchable Browseable An entry point for item-level searching

27 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC IMLS DCC Metadata Repository (alpha) Currently Harvesting: 27 Collections 193,677 Records Ongoing analysis of metadata Documenting practices Potential for normalization Implications for interface & search engine design

28 OAI-PMH & The IMLS DCC Project MCN 2004, 12 November 2004 University of Illinois at UC More Information This presentation: Project Website: Project PI: Tim Cole, Project Coordinator: Sarah Shreeves, OAI-PMH resources: Online OAI-PMH tutorial: DLF OAI-PMH & shareable metadata best practices (under development):