OAIster: Metadata Pointing to Digital Objects Kat Hagedorn Metadata Harvesting/DLXS Librarian University of Michigan Libraries February 18, 2004.

Slides:



Advertisements
Similar presentations
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
Advertisements

Online sheet music Jenn Riley Metadata Librarian Indiana University.
Furthering Collaboration Among OAI Data Providers and Service Providers Kat Hagedorn University of Michigan Libraries Digital Library Production Service.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
Steve Yip Head of Reference and Research Services HKUST Library Research Support Provided by HKUST Library and other JULAC Libraries in HK 1 Date : March.
OAIster != Google Kat Hagedorn University of Michigan Libraries October 26, 2007.
University of Michigan’s OAIster Service Provider Kat Hagedorn OAIster/Metadata Harvesting Librarian University of Michigan, DLPS November 5, 2002.
And now for something completely different… informal collaboration Kat Hagedorn University of Michigan University of Michigan11/8/2005.
University of Michigan’s OAI Metadata Harvesting Project Kat Hagedorn OAIster Librarian, UM April 16, 2002.
Rights, Restrictions and Access Kat Hagedorn OAIster / Metadata Harvesting Librarian University of Michigan, DLPS May 31, 2003.
University of Michigan’s OAI Metadata Harvesting Project Kat Hagedorn OAIster Librarian, UM May 12, 2002.
OAIster: A “No Dead Ends” Digital Object Service Kat Hagedorn OAIster Librarian University of Michigan Libraries October 3, 2003.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
OAI and OAIster Kat Hagedorn University of Michigan Libraries October 30, 2006.
IMLS Grant: University of Michigan’s Role Kat Hagedorn
University of Michigan’s OAIster Lessons Learned Kat Hagedorn OAIster/Metadata Harvesting Librarian University of Michigan, DLPS October 7, 2002.
OAIster Kat Hagedorn University of Michigan Libraries September 12, 2007.
The Open Archives Initiative and OAIster: Past, Present and Future Kat Hagedorn University of Michigan Libraries April 6, 2006.
OAIster: What’s with the Weird Name? Kat Hagedorn UM Library Information Technology November 28, 2005.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
EMu and Archives NA EMu Users Conference – Oct Slide 1 EMu and Archives Experiences from the Canada Science and Technology Museum Corporation.
Digital Library Architecture and Technology
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Mark Sullivan University of Florida Libraries Digital Library of the Caribbean.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Creating rich shareable metadata: The DLF Aquifer MODS implementation guidelines Sarah L. Shreeves University of Illinois at Urbana-Champaign ALA Annual.
The Western Waters Digital Library: Building a Resource Through Multi- State Collaboration and Technology Dawn Paschal Assistant Dean, Digital Library.
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Imaging Pittsburgh: Creating a Shared Gateway to Digital Image Collections of the Pittsburgh Region IMLS 2002 National Leadership Grant Library & Museum.
Implementing PTFS ArchivalWare at York St John University: a project under the JISC Repositories Start-up and Enhancement (SUE) strand Helen Westmancoat.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
OAI User Services Kat Hagedorn, UM University of Michigan 11/10/2005.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
Metadata and OAI DLESE OAI Workshop June 29 to July 2, 2002 Katy Ginger Presentation available at:
IMLS DCC Project Briefing ( ) Jenny Benevento ( ) Timothy W. Cole.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Ask a Librarian: The Role of Librarians in the Music Information Retrieval Community Jenn Riley, Indiana University Constance A. Mayer, University of Maryland.
OAIster and the WorldCat Digital Collection Gateway Casey A. Mullin Discovery Metadata Librarian Stanford University Music OCLC Users Group Annual Meeting.
DAEDALUS: ePrints Overview Web Meeting, 4th December 2004 William J Nixon Project Manager (DAEDALUS)
OAIster: A One-Stop-Shop Service for Digital Objects Kat Hagedorn OAIster Librarian University of Michigan Libraries September 18, 2003.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
Institutional Repositories and Licensing of Research Output advanced information management laboratory university of cape town department of computer science.
Do Real Archivists Use OAI? Mid-Atlantic Regional Archives Conference Gettysburg, PA October 31, 2003 Chris Prom Assistant University Archivist University.
CERN Document Server 19 tth January 2006 CERN Document Server Jean-Yves Le Meur 19 th January 2006.
Definition, purposes/functions, elements of IR systems Lesson 1.
The Open Archives Initiative: Perspectives on Metadata Harvesting OAI Provider & Harvesting Services at the University of Illinois Timothy W. Cole Mathematics.
7th Annual Hong Kong Innovative Users Group Meeting
Utility of an OAI Service Provider Search Portal
University of Michigan’s OAIster Progress Report
Lifecycle …of OAI …of DPs and SPs
Introduction to DSpace
OAI 11/20/07.
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Institutional Repositories
IMLS Grant: University of Michigan’s Role
Presentation transcript:

OAIster: Metadata Pointing to Digital Objects Kat Hagedorn Metadata Harvesting/DLXS Librarian University of Michigan Libraries February 18, 2004

background One-year Mellon grant project to test the feasibility of making OAI-enabled metadata for digital objects accessible to the public Digital Library Production Service at University of Michigan Libraries began work in December 2001 Launched in June 2002

highlights Any audience Any subject matter Any format Freely accessible No dead ends One-stop shopping …retrieving the “hidden web”

tool we borrowed University of Illinois Urbana-Champaign open-source OAI protocol harvester java edition for our unix environment Worked collaboratively to iron out kinks –resumptionToken / retryAfter –inexplicable kill –bogus records in MySQL table

development environment Digital Library Extension Service (DLXS) Develop open-source middleware and license XPAT search engine for building and mounting digital libraries Middleware consists of document classes, i.e., Text, Image, Bib, FindAid Originally designed to make SGML encoded texts available online

tool we developed Runs in DLXS environment using BibClass Current BibClass web templates modified Additional java-based transformation tool to: –DC metadata records concatenated –No-digital-object records filtered out –Records counted –Conversion from UTF-8 to ISO –XSLT used to transform DC records into BibClass records

system design UIUC harvester Record storage XSLT transformation tool BibClass indexes OAI-enabled DC records Non-OAI- enabled DC records XSL stylesheets (per source type) Search interface (XPAT)

result One place to look for digital objects Big –3,016,251 metadata records –267 institutions (as of last week…) Popular –Averages 3300 search sessions / month –Picked up in March ‘03: average 3500 now –43,894 searches in one year (June 2002 – July 2003)

search

limiters

sort

results

repositories

repositories: e.g., arXiv Eprint Archive: math and physics pre- and post-prints Online Archive of California: manuscripts, photographs, and works of art held in institutions across California Sammelpunkt, Elektronisch Archivierte Theorie: archive of philosophical publications British Women Romantic Poets Project: collection of poems written by British women between 1789 and 1832

repositories: stats As of February ‘04, out of 267 repositories… International and U.S. –U.S.: 50.5% (135) –Intl: 49.5% (132) By subject –Humanities: 24% (65) –Science: 30% (81) –Mixed: 46% (121) E-prints and pre-prints –Using eprints.org software: 39% (104) –Not using eprints.org software: 61% (163)

major issues encountered Metadata variation Records not leading to digital objects Access restrictions on digital objects described in records Duplicate records for a single digital object

issue: metadata variation With more records, users need more restrictions Consistent metadata needed to facilitate these restrictions One option: normalization of data

issue: metadata variation Type: the obvious quick win –240 metadata values mapped to four generic values (text, image, audio, video) –e.g., audio, sound = audio motion, animation, newsreels, etc. = video watercolour, watercolor, slides, etc. = image article, articles, booklet, diss, story, etc. = text

issue: metadata variation Date: where to begin? –Most records with at least one date –Some records include up to seven dates –No consistent style of date Subject: out of context, what meaning? –Many records with at least one subject element –But over 100 records with more than 50 subjects –And one record with 1000!

issue: metadata variation Sample date values between 1827 and ? November 13, 1947 SEP bce Summer, 1948

issue: metadata variation Sample subject values 30,51, , Apr. 22. E[veritt] Judson, letter to Philuta [Judson]. Slavery--United States--Controversial literature view of interior with John Henry sculpture Particles (Nuclear physics) -- Research.

issue: no digital objects Some records contain links to further description of digital object But not the digital object itself Culling difficult One option: add explanatory text to site Or, unfortunately, spot-check and remove repositories with this issue

issue: access restrictions No records where metadata itself is restricted in use (as far as we know!) Definitely some records where objects are restricted to licensed users One option: add explanatory text to site Or sub-set OAIster into free and “partially” free repositories

issue: duplicate records Two records harvested, different identifiers, same object described and pointed to Two records harvested inadvertently through aggregators and original repositories

issue: duplicate records Need algorithm to automate de- duplication Were duplicates to be identified, how to deal with the issue? –Suppress? –Group? –Flag? So far, not addressed in OAIster

future of OAIster Advanced searching Grouping to aid browsing Further normalization of data Handling duplicate records Saving/ ing/downloading records Collaboration with other services: search, instructional… More user testing…

current state of protocol Popular As Peter Suber says: –“…no other single idea or technology in the [open- source movement] has enjoyed this density of endorsement and adoption in a six month period.” Data providers over one year: –June ‘02: 56 repositories / 274,062 records –June ‘03: 187 repositories / 1,246,953 records –Over three-fold increase for repositories –Over four-fold increase for records

future of protocol Branching out –DC required vs. highly recommended –Use of OAI in closed environments –Static repository protocol –OAI-rights committee OAI evangelism

contact info Kat Hagedorn University of Michigan Libraries, Digital Library Production Service