Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra.

Slides:



Advertisements
Similar presentations
OCLC/RLG Working Group on Metadata for Digital Preservation Brian Lavoie Office of Research OCLC Online Computer Library Center, Inc. Information Infrastructures.
Advertisements

The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Metadata and the description of digital images Michael Day UKOLN, University of Bath International Digital Image Symposium London,
Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training.
Joint Information Systems Committee 11/03/07 | | Slide 1 Joint Information Systems CommitteeSupporting education and research JISC Conference 2007 Managing.
Digital Preservation and Trusted Digital Repositories Priscilla Caplan Florida Center for Library Automation ALA 2005 Chicago IL.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
An Leabharlann UCD Órna Roche UCD James Joyce Library Metadata Documenting your data
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
3. Technical and administrative metadata standards Metadata Standards and Applications.
Current Thinking on Digital Preservation: Role of Metadata Oya Y. Rieger Coordinator, Library Office of Distributed Learning Cornell University Library.
Perspectives from The Alberta Library Learn, think, CHANGE 2004 Online Learning Symposium November 3, 2004 Zahina Iqbal.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Metadata : Setting the Scene or a Basic Introduction Wendy Duff University of Toronto, Faculty of Information Studies.
Preserving Digital Collections Andrea Goethals Florida Center for Library Automation (FCLA)
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Metadata for preservation Michael Day, UKOLN, University of Bath Chinese-European Workshop on Digital Preservation,
Documenting to preserve your data: metadata in support of digital preservation Michael Day, UKOLN, University of Bath
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
3. Technical and administrative metadata standards Metadata Standards and Applications Workshop.
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
Ensuring Enduring Access: A Forum on Digital Preservation, July 21, 2009.
Metadata in support of digital preservation Michael Day, UKOLN, University of Bath Beginners Guide to Metadata:
Jenn Riley Metadata Librarian Indiana University Digital Library Program.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
OCLC Research: an update Lorcan Dempsey
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
Preservation – Why the Urgency? “A National Library is a place where a nation nourishes its memory and exerts its imagination – where it connects with.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Creating Archive Information Packages for Data Sets: Early Experiments with Digital Library Standards Ruth Duerr, NSIDC MiQun Yang, THG Azhar Sikander,
METADATA STANDARDS Andrew Wilson Project Manager Digital Preservation Project.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
1 Strategic Plan for Digital Archives Programme DAP PROJECT SCOPE OVERVIEW STATUS.
Cataloging Compound Digital Objects: Using METS for Digitized Sanborn Maps Christopher Cronin Head of Digital Resources Cataloging University of Colorado.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
Small steps and lasting impact: making a start with preservation or It’s not all NASA Patricia Sleeman Digital Archives and Repositories University of.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
The State of PREMIS Brian Lavoie Research Scientist OCLC PREMIS Implementation Fair San Francisco, CA October 7, 2009.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
PREMIS Data Dictionary and the Future of Preservation Metadata Brian Lavoie Research Scientist OCLC Research Society of American Archivists.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Open Access & Institutional Repositories, Accra June 2007 Metadata and e-preservation Dr D Peters DISA: Digital Innovation South Africa.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
13 July 2005 Archives Hub day conference The Paradigm Project: The University of Oxford & The University of Manchester
Preserving Digital Collections
Building A Repository for Digital Objects
Introduction to Implementing an Institutional Repository
Metadata for preservation
Metadata in Digital Preservation: Setting the Scene
Oya Y. Rieger Cornell University Library May 2004
Presentation transcript:

Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra November 10, 2004

Metadata and Preservation Metadata “Structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource” METADATA Descriptive Structural Administrative PRESERVATION METADATA “Information that supports and documents the digital preservation process” Administrative Structural Descriptive

Preservation Metadata: Examples  Provenance: Who has had custody/ownership of the digital object?  Authenticity: Is the digital object what it purports to be?  Preservation Activity: What has been done to preserve the digital object?  Technical Environment: What is needed to render and use the digital object?  Rights Management: What IPR must be observed?

Why Is Preservation Metadata Important?  Digital objects are technology-dependent … Means to access and use archived object must be documented Complex technological environment between content and user Technical metadata especially important  Digital objects are mutable … Can be easily altered, impacting look, feel, functionality Changes to object must be documented/validated Provenance metadata especially important  Digital objects are bound by intellectual property rights … Preservation must proceed while copyright still in effect May constrain preservation activities and access policies Rights management metadata especially important  Makes digital objects self-documenting across time

Preservation Metadata Initiatives … Around the World CEDARS OCLC NLA NEDLIB NLNZ U. of E.

Towards Consensus: OCLC/RLG Preservation Metadata Framework Working Group  March 2000: OCLC, RLG jointly sponsored international working group on preservation metadata Identify key issues, seek consensus  White paper (January 2001) Defined preservation metadata; role in preservation process Reviewed/synthesized existing preservation metadata initiatives  Preservation metadata framework (June 2002) Comprehensive description of types of information constituting preservation metadata Based on OAIS information model Set of “prototype” preservation metadata elements

Next Steps: PREMIS  Preservation Metadata Framework Consolidated expertise Foundation for formal schemas Shared departure point for different schema implementations Interest in moving framework closer to an implementable status  June 2003: OCLC, RLG sponsored new working group: PREMIS: Preservation Metadata: Implementation Strategies  Objectives Define core set of preservation metadata elements, with supporting data dictionary, applicable to broad range of digital preservation activities Identify and evaluate alternative strategies for encoding, storing, managing, and exchanging preservation metadata

PREMIS: Current Status  Core Elements: element-by-element review of prototype elements from metadata framework: Is the element “core”? Data dictionary: definition, rationale, examples, usage rules (data constraints, obligation, repeatability)  Implementation Strategies: survey to identify key characteristics of digital preservation repositories: Mission, content, funding, policies, etc. Focus on how metadata is used to support repository processes, functions, and policies  Expected completion: December 2004

Preservation Metadata Schemas: Perspectives  Prospects for consensus, standards … Foundation starting to coalesce, informing current work But must overcome differences across systems and policies  Involve all stakeholders: creators, publishers, all cultural heritage communities  Focus on internal guidance AND interoperability  Avoid re-invention of wheels: potential overlap between other metadata initiatives

Implementation Issues: Tools  General consensus that: 1) Metadata is key component of digital preservation process 2) Preservation metadata is expensive to create and maintain 3) Need to minimize human mediation  JSTOR/Harvard Object Validation Environment (JHOVE): Identify, validate, and characterize digital object formats Modules for: TIFF (various versions), PDF, XML, and others  NLNZ Preservation Metadata Extract Tool: Extracts information from digital file headers (e.g., MS Word, TIFF, WAV, bitmaps); outputs metadata in XML format  Surface preservation metadata tools in variety of digital repository environments (Dspace, Fedora, DAITSS)

Implementation Issues: Economics  Develop economical ways of acquiring and maintaining preservation metadata  PRONOM File Format Registry (UK National Archives) Technical metadata about specific file formats Description of software needed to create, render, migrate formats Metadata created once, re-used many times  Automatic Exposure (RLG) Facilitate capture of metadata specified in NISO Z39.87 (Technical Metadata for Digital Still Images) Dialog with digital scanner/camera manufacturers Technical metadata automatically captured when object created  Reduce cost of metadata creation by leveraging opportunities for sharing and re-use, and diffusing metadata capture throughout information lifecycle

Implementation Issues: Packaging  Link (physically or logically) archived digital object and all associated metadata  OAIS Information Package Conceptual structure for information moving into, through, and out of archival system Digital object and its metadata, bound into single logical package  Metadata Encoding and Transmission Standard (METS) XML schema for encoding descriptive, administrative, and structural metadata associated with digital object PREMIS elements to be implemented as METS extension schema  Sharing and re-use of preservation metadata in a networked repository environment requires standard mechanisms for encoding and exchange

Implementation Issues: Perspectives  Current focus on format validation and technical metadata. Need work on tools that: Address other forms of preservation metadata Support formal preservation metadata schemas (PREMIS core)  Division of labor: Map preservation metadata requirements to appropriate stages of information lifecycle Allocate responsibility for collecting metadata  “Quality assurance”

Looking ahead …  Questions of “what type”, “how much” preservation metadata still unsettled … Digital preservation processes still not fully tested/understood Metadata requirements shaped by local repository characteristics  Collaboration essential: Pooling expertise from variety of institutional perspectives mitigates uncertainty Highlights points of convergence/divergence; helps distinguish metadata that is widely applicable vs. domain-specific Helps identify best practices and encourages standards-building  In the meantime … “Good judgment is based on experience, and experience is based on bad judgment”

More information… PADI Preservation Metadata Bibliography:  PREMIS:  JHOVE:  PRONOM:  OAIS: pdf/CCSDS B-1.pdf  METS: 