OAIster: A “No Dead Ends” Digital Object Service Kat Hagedorn OAIster Librarian University of Michigan Libraries October 3, 2003.

Slides:



Advertisements
Similar presentations
A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
Advertisements

Usage Statistics in Context: related standards and tools Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services Usage Statistics and Publishers:
Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges.
Online sheet music Jenn Riley Metadata Librarian Indiana University.
EXtensible Catalog David Lindahl University of Rochester.
Furthering Collaboration Among OAI Data Providers and Service Providers Kat Hagedorn University of Michigan Libraries Digital Library Production Service.
Discove r Humanities and Social Science Electronic Thesaurus - HASSET Faceted search HASSET is the subject thesaurus that the UK Data Service uses to index.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
OAIster != Google Kat Hagedorn University of Michigan Libraries October 26, 2007.
University of Michigan’s OAIster Service Provider Kat Hagedorn OAIster/Metadata Harvesting Librarian University of Michigan, DLPS November 5, 2002.
University of Michigan’s OAI Metadata Harvesting Project Kat Hagedorn OAIster Librarian, UM April 16, 2002.
Rights, Restrictions and Access Kat Hagedorn OAIster / Metadata Harvesting Librarian University of Michigan, DLPS May 31, 2003.
University of Michigan’s OAI Metadata Harvesting Project Kat Hagedorn OAIster Librarian, UM May 12, 2002.
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
School of something FACULTY OF OTHER University Library The Library’s Digital Repository or Whatever happened to MIDESS? Michael Emly Jonathan Ainsworth.
OAI and OAIster Kat Hagedorn University of Michigan Libraries October 30, 2006.
IMLS Grant: University of Michigan’s Role Kat Hagedorn
OAIster Kat Hagedorn University of Michigan Libraries September 12, 2007.
The Open Archives Initiative and OAIster: Past, Present and Future Kat Hagedorn University of Michigan Libraries April 6, 2006.
OAIster: What’s with the Weird Name? Kat Hagedorn UM Library Information Technology November 28, 2005.
Digital Library Architecture and Technology
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Mark Sullivan University of Florida Libraries Digital Library of the Caribbean.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Creating rich shareable metadata: The DLF Aquifer MODS implementation guidelines Sarah L. Shreeves University of Illinois at Urbana-Champaign ALA Annual.
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
From Concept to Reality: An overview of the University of Wisconsin Digital Collections Melissa Mclimans.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Implementing PTFS ArchivalWare at York St John University: a project under the JISC Repositories Start-up and Enhancement (SUE) strand Helen Westmancoat.
OAIster: Metadata Pointing to Digital Objects Kat Hagedorn Metadata Harvesting/DLXS Librarian University of Michigan Libraries February 18, 2004.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
Sakaibrary: Integrating Licensed Library Resources with Sakai 29 November 2006 Steve Smail Mark Notess.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
OAI User Services Kat Hagedorn, UM University of Michigan 11/10/2005.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Bitter Harvest Metadata Harvesting Issues, Problems, and Possible Solutions Roy Tennant California Digital Library.
Public Library Survey FY 2015 SDC General Session December 08, 2015.
DSpace - Digital Library Software
OAIster and the WorldCat Digital Collection Gateway Casey A. Mullin Discovery Metadata Librarian Stanford University Music OCLC Users Group Annual Meeting.
DAEDALUS: ePrints Overview Web Meeting, 4th December 2004 William J Nixon Project Manager (DAEDALUS)
OAI Tools By Thomas G. Habing Grainger Engineering Library Information Center University.
OAIster: A One-Stop-Shop Service for Digital Objects Kat Hagedorn OAIster Librarian University of Michigan Libraries September 18, 2003.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Sakaibrary Update: Initial User Responses and Next Steps Susan Hollar University Library University of Michigan Jon Dunn, Mark Notess Digital Library Program.
Distributed Service Registry Workshop, Warwick, U.K. 1 Distributed Functionality in the UIUC OAI Registry
8th Sakai Conference4-7 December 2007 Newport Beach Sakaibrary Project Update: Subject Research Guides December 6, 2007.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
Open Access Tools for Scholars Scholarly Communication Retreat Wednesday December 12, 2007 Presented by Marcia Salmon.
Do Real Archivists Use OAI? Mid-Atlantic Regional Archives Conference Gettysburg, PA October 31, 2003 Chris Prom Assistant University Archivist University.
The Open Archives Initiative: Perspectives on Metadata Harvesting OAI Provider & Harvesting Services at the University of Illinois Timothy W. Cole Mathematics.
7th Annual Hong Kong Innovative Users Group Meeting
Bielefeld Academic Search Engine
Utility of an OAI Service Provider Search Portal
University of Michigan’s OAIster Progress Report
An Overview of Data-PASS Shared Catalog
Lifecycle …of OAI …of DPs and SPs
Sakaibrary Project Update: Subject Research Guides
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Institutional Repositories
IMLS Grant: University of Michigan’s Role
Presentation transcript:

OAIster: A “No Dead Ends” Digital Object Service Kat Hagedorn OAIster Librarian University of Michigan Libraries October 3, 2003

background One-year Mellon grant project to test the feasibility of making OAI-enabled metadata for digital objects accessible to the public Digital Library Production Service at University of Michigan Libraries began work in December 2001 Publicized as OAIster in February 2002 Launched in June 2002

highlights Any audience Any subject matter Any format Freely accessible No dead ends One-stop shopping …retrieving the “hidden web”

the protocol OAI = Open Archives Initiative OAI-PMH = Open Archives Initiative Protocol for Metadata Harvesting Designed to make it easy to exchange metadata among interested parties Consists of 6 HTTP requests to identify repositories / metadata and perform “harvesting”

tool we borrowed University of Illinois Urbana-Champaign open-source OAI protocol harvester java edition for our unix environment Worked collaboratively to iron out kinks –resumptionToken / retryAfter –inexplicable kill –bogus records in MySQL table

development environment Digital Library Extension Service (DLXS) Develop open-source middleware and license XPAT search engine for building and mounting digital libraries Middleware consists of document classes, i.e., Text, Image, Bib, FindAid Originally designed to make SGML encoded texts available online

tool we developed Runs in DLXS environment using BibClass Current BibClass web templates modified Additional java-based transformation tool to: –DC metadata records concatenated –No-digital-object records filtered out –Records counted –Conversion from UTF-8 to ISO –XSLT used to transform DC records into BibClass records

system design UIUC harvester Record storage XSLT transformation tool BibClass indexes OAI-enabled DC records Non-OAI- enabled DC records XSL stylesheets (per source type) Search interface (XPAT)

result One place to look for digital objects Big –1,723,003 metadata records –203 institutions (as of September ‘03) Popular –Averages 3300 search sessions / month –Picked up in March ‘03: average 3500 now –43,894 searches total (through July ‘03)

search

limiters

sort

results

repositories

repositories: e.g., –Online Archive of California: manuscripts, photographs, and works of art held in institutions across California –arXiv Eprint Archive: math and physics pre- and post-prints –Sammelpunkt, Elektronisch Archivierte Theorie: archive of philosophical publications –British Women Romantic Poets Project: collection of poems written by British women between 1789 and 1832

repositories: stats As of July ‘03, out of 191 repositories… U.S. and foreign –U.S.: 49% (94) –Foreign: 51% (97) By subject –Humanities: 26% (50) –Science: 30% (58) –Mixed: 43% (83) E-prints and pre-prints –Using eprints.org software: 41% (78) –Not using eprints.org software: 58% (110)

major issues encountered Metadata variation Records not leading to digital objects Access restrictions on digital objects described in records Duplicate records for a single digital object

issue: metadata variation With more records, users need more restrictions Consistent metadata needed to facilitate these restrictions One option: normalization of data

issue: metadata variation Type: the obvious quick win –240 metadata values mapped to four generic values (text, image, audio, video) –e.g., audio, sound = audio motion, animation, newsreels, etc. = video watercolour, watercolor, slides, etc. = image article, articles, booklet, diss, story, etc. = text

issue: metadata variation Date: where to begin? –Most records with at least one date –Some records include up to seven dates –No consistent style of date Subject: out of context, what meaning? –Many records with at least one subject element –But over 100 records with more than 50 subjects –And one record with 1000!

issue: metadata variation Sample date values between 1827 and ? November 13, 1947 SEP bce Summer, 1948

issue: metadata variation Sample subject values 30,51, , Apr. 22. E[veritt] Judson, letter to Philuta [Judson]. Slavery--United States--Controversial literature view of interior with John Henry sculpture Particles (Nuclear physics) -- Research.

issue: no digital objects Some records contain links to further description of digital object But not the digital object itself Culling difficult One option: add explanatory text to site

issue: access restrictions No records where metadata itself is restricted in use (as far as we know!) Definitely some records where objects are restricted to licensed users One option: add explanatory text to site

issue: access restrictions DC Rights element: often not enough info about viewing restrictions Currently no protocol method for indicating restricted digital objects (i.e., “yes/no” toggle element) Need to assess whether users feel informed or frustrated when encountering restricted objects

issue: duplicate records Two records harvested, different identifiers, same object described and pointed to Acquired in two ways: –Harvesting of original repository and aggregator –Receiving “static” DC records provided by content creator and harvesting aggregator

issue: duplicate records Aggregators can contain records not currently available through OAI channels Aggregators do not always contain all the records of a particular original repository So, need to harvest both aggregator and original repositories

issue: duplicate records Harvest records from aggregator Also receive from original content creator, but as snapshot –e.g., MEO and cogprints –Snapshot before aggregator –Creator unsure all records would be aggregated

issue: duplicate records Were duplicates to be identified, how to deal with the issue? –Suppress? –Group? –Flag? So far, not addressed in OAIster

assessment Large survey (over 400 respondents) 2 rounds of face-to-face and remote user testing Conducted before design and after phase one rollout

assessment: survey Online journals and reference materials wanted over other digital objects Difficult to search for information; every service different; where to start Number of respondents (5%) indicated they were generally successful in finding resources online

assessment: user testing No short and long record formats: one size fits all Want clearly defined and labeled AND/OR searching options Results clear and easy to understand Want to sort by title, date, institution, resource format…you name it! Use OAIster for academic, trustworthy, authentic materials

service providers: comparison Focus on high usability Focus on all content available Some service providers have increased functionality (e.g., de- duplication, integration of thesauri) DP-9 OAIster Ad hoc Usability Content someall high low UIUC, Emory, etc.

future of OAIster Make it faster Advanced searching Grouping to aid browsing Saving/ ing/downloading records Further normalization of data Handling duplicate records Collaboration with other services: search, instructional…

current state of protocol Popular As Peter Suber says: –“…no other single idea or technology in the [open- source movement has enjoyed this density of endorsement and adoption in a six month period.” Data providers over one year: –June ‘02: 56 repositories / 274,062 records –June ‘03: 187 repositories / 1,246,953 records –Over three-fold increase for repositories –Over four-fold increase for records

future of protocol Branching out –HTTP vs. SOAP –DC required vs. highly recommended –Use of OAI in closed environments –Static repository protocol –OAI-rights committee Need for add-on applications OAI evangelism

how can you be in OAIster? OAI-enable your data –DLXS customer: easiest –Make sure data is UTF-8 / Unicode compliant –Provide as much metadata as you can –Use standard element tags –Develop “sets” for service providers Let us know you’re ready to be harvested Keep us informed about changes to the harvesting URL, new data and deleted data, change in contact info

how can you use OAIster? Just about anywhere… Reference desks Tool for researchers and faculty Inclusion into list of electronic resources and/or subject guides It is: –freely available –regularly updated –simple to use

contact info Kat Hagedorn University of Michigan Libraries, Digital Library Production Service