Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training.

Slides:



Advertisements
Similar presentations
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Advertisements

Open Archives Forum Workshop University of Bath, September 2003 Workshop summing up Rachel Heery UKOLN, University of Bath
Open Archives Forum Workshop University of Bath, September 2003 Welcome Rachel Heery UKOLN, University of Bath
Open Archives Forum a place for you to meet Leona Carpenter UKOLN, University of Bath UKOLN is funded by Resource:
The Reference Model for an Open Archival Information System (OAIS) Michael Day Digital Curation Centre UKOLN, University of Bath
Metadata for preservation: the Cedars perspective
Metadata for images Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Why metadata matters for libraries... Rachel Heery UKOLN: The UK Office for Library and Information Networking, University of Bath
PwC SCHEMAS Forum for metadata schema implementers The SCHEMAS project and metadata ETB Workshop, London, 9-10 January 2001 Michael Day,
Issues and approaches to preservation metadata Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
The OAIS Reference Model: current implementations Michael Day, UKOLN, University of Bath Chinese-European Workshop.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
The metadata challenge for libraries: a view from Europe Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
UKOLN, University of Bath
An overview of collection-level metadata Applications of Metadata BCS Electronic Publishing Specialist Group, Ismaili Centre, London, 29 May 2002 Pete.
Metadata and the description of digital images Michael Day UKOLN, University of Bath International Digital Image Symposium London,
Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training.
Digital disaster: are you prepared?, University College London, 23 June 2000 Michael Day, UKOLN Overview UKOLN is funded by Resource:
Image metadata: interoperability and exchange
UKOLN is supported by: The JISC Information Environment Metadata Schema Registry (IEMSR): Update DC-2006, Manzanillo, Mexico October 3-6, 2006 Rachel Heery.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
Andy Powell, Eduserv Foundation July 2006 Repository Roadmap – technical issues.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
Digital Preservation and Trusted Digital Repositories Priscilla Caplan Florida Center for Library Automation ALA 2005 Chicago IL.
Collection description & Collection Description Focus JISC/DNER Moving Image & Sound Cluster Steering Group meeting, HEFCE Office, London, 24 September.
The Dublin Core Collection Description Application Profile (DC CD AP) Pete Johnston, UKOLN, University of Bath Chair, DC Collection Description Working.
Integrating metadata schema registries with digital preservation systems to support interoperability Michael Day UKOLN, University.
Multi-purpose metadata for collections: Creating reusable CLDs Collection Description Focus Workshop 2 Aston Business School, Birmingham 8 February 2002.
Towards consensus on collection-level description Collection Description Focus Briefing Day 1 British Library, St Pancras, London 22 October 2001 Bridget.
An introduction to collections and collection-level description Collection-Level Description & NOF-digitise projects NOF-digitise programme seminar, London,
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Metadata for preservation Michael Day, UKOLN, University of Bath Chinese-European Workshop on Digital Preservation,
Documenting to preserve your data: metadata in support of digital preservation Michael Day, UKOLN, University of Bath
International Council on Archives Section on University and Research Institution Archives Michigan State University September 7, 2005 Preserving Electronic.
Metadata in support of digital preservation Michael Day, UKOLN, University of Bath Beginners Guide to Metadata:
PwC SCHEMAS Forum for metadata schema implementers Metadata: SCHEMAS and other European projects First Austrian Metadata Seminar, 18 May 2001 Michael Day,
A centre of expertise in digital information management The MEG Metadata Schemas Registry Pete Johnston, Research Officer (Interoperability),
OAIS Open Archival Information System. “Content creators, systems developers, custodians, and future users are all potential stakeholders in the preservation.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
METADATA STANDARDS Andrew Wilson Project Manager Digital Preservation Project.
Digital preservation Michael Day UKOLN, University of Bath, UK University of Bristol, MSc in Library and Information.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Digital preservation Michael Day UKOLN, University of Bath, UK University of Bristol, MSc in Library and Information.
A Quick Introduction to Metadata Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
Application Profiles: interoperable friend or foe? Rachel Heery Michael Day TEL Milestone Conference Frankfurt am Main, April 2002.
Integrating metadata schema registries with digital preservation systems to support interoperability Michael Day UKOLN, University of Bath, UK
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Open Archive Forum Rachel Heery UKOLN, University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Collection-level description: from theory to practice Minerva project meeting Paris, 24 January 2003 Pete Johnston UKOLN, University of Bath Bath, BA2.
UKOLN Metadata Projects: Subject Gateways and Preservation Metadata Michael Day UKOLN, University of Bath Electronic Resources: Definition,
Metadata Schema Registries: background and context MEG Registry Workshop, Bath, 21 January 2003 Rachel Heery UKOLN, University of Bath Bath, BA2 7AY UKOLN.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Metadata Issues in Long-term Management of Data and Metadata
Metadata for preservation
Metadata in Digital Preservation: Setting the Scene
Presentation transcript:

Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training Seminar: Metadata in Digital Preservation, Archivschule Marburg, Germany 3-5 September 2003

ERPANET Training Seminar, Marburg, 3-5 September 2003 Presentation outline Categorisation of standards Practical issues –Implementation –Sustainability Interoperability –Registries of formats and metadata

ERPANET Training Seminar, Marburg, 3-5 September 2003 Basics Digital preservation strategies depend - to some extent - on the creation, capture and maintenance of suitable metadata: –"Preserving the right metadata is key to preserving digital objects" (ERPANET Briefing Paper) –"It's all about metadata" (Kelly Russell, ca. 2000) Metadata fulfil various roles, e.g.: –Within a digital repository, metadata accompanies and makes reference to each digital object and provides associated descriptive, structural, administrative, rights management, and other kinds of information (Clifford Lynch, 1999)

ERPANET Training Seminar, Marburg, 3-5 September 2003 The OAIS model The Reference Model for an Open Archival Information System (OAIS): –ISO 14721:2003 –Establishes a common framework of terms and concepts –Identifies basic functions: Ingest, Data Management, Archival Storage, Administration, Access, Preservation Planning –Defines an information model, e.g.: Information Packages Types of metadata required (but not a schema)

ERPANET Training Seminar, Marburg, 3-5 September 2003 Existing standards Developed from many different perspectives: –Generic –Applications of DCMES –Digital libraries: –OCLC/RLG Framework, Cedars, NEDLIB, NLA, NLNZ, METS, NISO Z39.87 … –OAIS influence has been greatest in this area –Recordkeeping: –Pittsburgh, RKMS, NAA, VERS, EAD … –Multimedia: –MPEG-7, SMPTE … –Rights management: –, MPEG-21 …

ERPANET Training Seminar, Marburg, 3-5 September 2003 Draft categorisation (1) PracticalConceptual PRO NEDLIB DCMI METS RKMS PITT VERS NLNZ NLA CEDARS OCLC/RLG MPEG-7 Z39.87

ERPANET Training Seminar, Marburg, 3-5 September 2003 Draft categorisation (2) –Earliest schemas were largely conceptual in nature: –e.g. Pittsburgh BAC model, Cedars outline specification, OCLC/RLG WG I –Gradually moving towards a more practical focus: –e.g., VERS, NLNZ, METS, OCLC/RLG WG II (Implementation Strategies) –Based on XML (DTDs and Schemas) –But there is an urgent need for this experience to be shared –e.g., briefing papers, advice to implementers

ERPANET Training Seminar, Marburg, 3-5 September 2003 Implementation Focus on implementation issues is increasingly important: –We need to prove the practical value of metadata frameworks and 'outline specifications' –It can be difficult for implementers to use these as a guide to the design of real systems? –We need to move from the conceptual to the practical, need to move beyond proof-of-concept –Positive signs: –METS/NISO Z39.87 –OCLC/RLG PREMIS WG looking at implementation strategies for preservation metadata

ERPANET Training Seminar, Marburg, 3-5 September 2003 Creation and capture Metadata creation/capture: –Who? –Human agency vs. automatic capture –How? –Much metadata already exists –The need for automatic (or semi-automatic) capture or conversion of metadata –When? –Need for metadata to be captured at creation, ingest, migration, and at other appropriate points in object life-cycle

ERPANET Training Seminar, Marburg, 3-5 September 2003 Sustainability (1) Balance risks with costs: –There is a perception that metadata creation and maintenance will be expensive –But costs associated with data recovery are not trivial –Need to balance the risks of data loss with the cost of creating metadata –Robust selection criteria –Co-operation between repositories –Re-use of existing metadata

ERPANET Training Seminar, Marburg, 3-5 September 2003 Sustainability (2) Avoid imposing unnecessary costs: –Avoid large schemas –Need to identify the right metadata ('core metadata'?) Who pays? –A more generic issue …

ERPANET Training Seminar, Marburg, 3-5 September 2003 Interoperability (1) Interoperability is important, e.g.: –To support the reuse of existing metadata, e.g., on Ingest –To support the exchange of digital objects between repositories –"… there is a critical need to develop tools that automatically supply core metadata, extract metadata from resources at ingest, and restructure and manage metadata over time" - (Hedstrom, 2003)

ERPANET Training Seminar, Marburg, 3-5 September 2003 Interoperability (2) Some problems: –The need to cope with a wide (and growing) range of metadata standards, object types, formats, etc. –Heterogeneity –No prospect of a single standard –Practical interoperability not within easy scope of the OAIS model

ERPANET Training Seminar, Marburg, 3-5 September 2003 Registries (1) A potential role for registries? –Format Registries –There is "… a pressing need to establish reliable, sustained repositories of file format specifications, documentation, and related software" (Lawrence, et al., 2000) –DSpace 'bitstream format registry' –Typed Object Model (TOM) project –IFLA Conference paper (Abrams & Seaman, 2003)

ERPANET Training Seminar, Marburg, 3-5 September 2003 Registries (2) –Metadata registries –"… formal systems that can disclose authoritative information about the semantics and structure of the data elements that are included within a particular metadata scheme" (Heery, et al., 2000) –Existing registries include the XML.org Registry and Repository (OASIS), and metadata registries set up by DCMI and SMPTE –There has been some experimentation with RDF registries as part of Semantic Web development

ERPANET Training Seminar, Marburg, 3-5 September 2003 Registries (2) –Registry Functions: –Provides support for the ingest process –May also provide support for the access function The export of Dissemination Information Packages The exchange of data objects (AIPs?) with other repositories; conversion to exchange standards –Can link metadata where there are multiple instances –Can help to manage schema evolution

ERPANET Training Seminar, Marburg, 3-5 September 2003 Registry functions Administration Ingest Archival Storage Access Data Management Descriptive info. PRODUCERPRODUCER CONSUMERCONSUMER MANAGEMENT queries result sets Descriptive info. Preservation Planning orders OAIS Functional Entities (Figure 4-1) SIP DIP AIP

ERPANET Training Seminar, Marburg, 3-5 September 2003 Registry functions Administration Ingest Archival Storage Access Data Management Descriptive info. PRODUCERPRODUCER CONSUMERCONSUMER MANAGEMENT queries result sets Descriptive info. Preservation Planning orders OAIS Functional Entities (Figure 4-1) Registry SIP DIP AIP Schemas

ERPANET Training Seminar, Marburg, 3-5 September 2003 Registry functions Administration Ingest Archival Storage Access Data Management Descriptive info. PRODUCERPRODUCER CONSUMERCONSUMER MANAGEMENT queries result sets Descriptive info. Preservation Planning orders OAIS Functional Entities (Figure 4-1) Registry SIP DIP AIP Schemas

ERPANET Training Seminar, Marburg, 3-5 September 2003 Registry functions Administration Ingest Archival Storage Access Data Management Descriptive info. PRODUCERPRODUCER CONSUMERCONSUMER MANAGEMENT queries result sets Descriptive info. Preservation Planning orders OAIS Functional Entities (Figure 4-1) Registry METS VERS MPEG-7 METS MPEG-7 AIP Schemas

ERPANET Training Seminar, Marburg, 3-5 September 2003 Open issues (1) Organisation of registries: –Registries are part of infrastructure –Distributed vs. centralised approaches: –Hedstrom (2003) suggests that format and metadata registries could/should be 'shared services' –Experimental distributed registries are based on Resource Description Framework (RDF) CORES Registry Encourage re-use of metadata (the 'application profile' concept) Are other technologies more suitable? –Who should be responsible?

ERPANET Training Seminar, Marburg, 3-5 September 2003 Open issues (2) Metadata quality: –Standards often deal with metadata semantics, but not always with 'content rules' –Recent experience with use of unqualified Dublin Core by OAI data providers suggests that metadata quality varies widely, e.g.: –DC underutilized, e.g. 5 of 15 elements used 71% of the time, many records have just 'title' and 'creator' elements (Ward, 2003) –Some quality problems with records being imported before their refinement by libraries (Halbert, 2003) –Authority control, de-duplication, etc.

ERPANET Training Seminar, Marburg, 3-5 September 2003 Open issues (3) Different data models: –How does the OAIS model fit into other data models being developed for (digital) objects? –Examples: Functional Requirements for Bibliographic Records (FRBR) - IFLA ABC Ontology and Model - Harmony project CIDOC Conceptual Reference Model (CRM) …

ERPANET Training Seminar, Marburg, 3-5 September 2003 Summing up –Implementation issues: –A need to focus on the practical issues of implementing preservation metadata standards within real systems –Then feed what is learnt through this back into the schema design (iterative process) –If it doesn't work, start again … –Interoperability: –For reuse and exchange of metadata –Possible role for format and metadata registries - but the concept needs extensive testing (and registries are not a panacea)

ERPANET Training Seminar, Marburg, 3-5 September 2003 Acknowledgements UKOLN is funded by Resource: the Council for Museums, Archives and Libraries, the Joint Information Systems Committee (JISC) of the UK higher and further education funding councils, as well as by project funding from the JISC and the European Union. UKOLN also receives support from the University of Bath, where it is based.