2/26/2004 Dan Swaney 1 Preservation Metadata and the OAIS Information Model A Metadata Framework to Support the Preservation of Digital Objects A review.

Slides:



Advertisements
Similar presentations
OCLC/RLG Working Group on Metadata for Digital Preservation Brian Lavoie Office of Research OCLC Online Computer Library Center, Inc. Information Infrastructures.
Advertisements

Metadata for preservation: the Cedars perspective
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
The OAIS experience at the British Library Deborah Woodyard Digital Preservation Coordinator ERPANET OAIS Training Seminar, Nov 2002.
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
An Introduction June 17, 2013 Open Archival Information System (OAIS)
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
The British Library’s METS Experience The Cost of METS Carl Wilson
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
DCC Conference, Glasgow November, Digital Archive Policies and Trusted Digital Repositories MacKenzie Smith, MIT Libraries Reagan Moore, San Diego.
Metadata for preservation Michael Day, UKOLN, University of Bath Chinese-European Workshop on Digital Preservation,
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
PeDALS Persistent Digital Archives & Library System Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library,
USING METADATA TO FACILITATE UNDERSTANDING AND CERTIFICATION ABOUT THE PRESERVATION PROPERTIES OF A PRESERVATION SYSTEM Jewel H. Ward, Hao Xu, Mike C.
1 A journey of a thousand miles begins with a single step. Chinese Proverb.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Reference Model for an Open Archival Information System (OAIS) ESIP Summer Meeting John Garrett – ADNET Systems at NASA/GSFC ESIP Summer Meeting.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Preservation Strategies: Intro to the OAIS Reference Model Curt Tilmes NASA Version 1.0 Review Date.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
Linked Digital Archive Institutional Repository Rathachai Chawuthai CSIM/SET/AIT.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
ETD2006 Preserving ETDs With D.A.I.T.S.S. FLORIDA CENTER FOR LIBRARY AUTOMATION FC LA PAPER AUTHORS: Chuck Thomas Priscilla.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
The OAIS Reference Model and Trustworthy Repositories Josh Lubell Manufacturing Engineering Laboratory NIST
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
The OAIS model SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
OAIS (archive) Producer Management Consumer. Representation Information Data Object Information Object Interpreted using its Yields.
OAIS (archive) OAIS (archive) Producer Management Consumer.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
Joint Meeting of CSUL Committees,
Ingest and Dissemination with DAITSS
FLORIDA CENTER FOR LIBRARY AUTOMATION
OAIS Producer (archive) Consumer Management
DAITSS and the Florida Digital Archive
Exercise: understanding authenticity evidence
Statewide Digitization and the FCLA Digital Archive
Exercise: understanding authenticity evidence
Implementing an Institutional Repository: Part II
Metadata for preservation
An Open Archival Repository System for UT Austin
The Reference Model for an Open Archival Information System (OAIS)
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

2/26/2004 Dan Swaney 1 Preservation Metadata and the OAIS Information Model A Metadata Framework to Support the Preservation of Digital Objects A review of the report by the OCLC/RLG Working Group on Preservation Metadata June (Report: Presented by Dan Swaney

2/26/2004 Dan Swaney 2 The OCLC/RLG Working Group  March 2000 –Working Group was Formed by  OCLC – Online Computer Library Center, Inc.  RLG – Resource Library Group, Inc.  Started with a White Paper entitled –“Preservation Metadata for Digital Objects: A Review of the State of the Art” –Introduced concepts that were followed by the development of the actual framework discussed later.

2/26/2004 Dan Swaney 3 What is OAIS?  Open Archival Information System –May 1999 (Original Model)  Supported the Space Community –June 2001 (Revised Model)  Extended to support libraries/cultural heritage institutions, gov’t agencies, and private sector –Information Model embedded in OAIS  Direct Relevance to Preservation Metadata

2/26/2004 Dan Swaney 4 OAIS Information Model: The Bottom -- From Data to Information Information Object Knowledge Base Data Object Representation Information Digital Object Physical Object OR External to the Archival System Programmers must Have the knowledge base To understand Java source Representation Information Describes the Data Object’s bits: = sound file, paragraph of text, an image

2/26/2004 Dan Swaney 5 OAIS Information Model: Moving from the Bottom to the Top Information Object Knowledge Base Data Object Representation Information Digital Object Physical Object OR External to the Archival System Representation Information Describes the Data Object’s bits: = sound file, paragraph of text, an image

2/26/2004 Dan Swaney 6 OAIS Information Model: The Top -- From Object to Package Information Object Information Package Archival (AIP) Submission (SIP) Dissemination (DIP) Content Information Preservation Description Information Packaging Information Descriptive Information

2/26/2004 Dan Swaney 7 Three Types of Information Packages Information Producer Archive Archival Information Package (AIP) Submission Information Package (SIP) Dissemination Information Package (DIP) Responding to a Query Request Submitting an Information Object

2/26/2004 Dan Swaney 8 Inside the Information Package Information Package Content Information (CI) Preservation Description Information (PDI) Packaging Information Descriptive Information - ‘Content’ Data Object - Representation Info - Info to manage preservation of Content Info - Reference Info - Provenance Info - Context Info - Fixity Info - Metadata for Resource Discovery - Assists finding aids - An Abstract? - Derived from: CI & PDI - Header block of info that binds together an Archive Information Package - Binds together: - digital object + - assoc. metadata

2/26/2004 Dan Swaney 9 Implementing Two of the Components of the OAIS Model  First: Content Information (CI) –‘Content’ Data Object (CDO)  Raw Data Bits – Representation Info (2 components)  Structure Info – technical desc/spec –Example: format, data structs, encoding –Makes CDO Understandable by Machines/Systems  Semantic Info – explains the data –Example: interpret as English or temperatures delimited by tabs –Makes CDO Understandable by Humans

2/26/2004 Dan Swaney 10 Content Information (CI) Attributes Content Information (CI) Package ‘Content’ Data Object Representative Information ‘Content’ Data Object Description Environment Description -Details for Rendering/Viewing in Human-readable form -Defines Attributes: 1. Abstract of Steps -Steps to restore a ZIP file back to files/folders -Steps to restore into a DBMS 2. Structural Type 3. Technical infrastructure (Web Page and all it’s req’d files) 4. File Description 5. Installation requirements 6. Size 7. Access Inhibitors 8. Access Facilitators 9. Significant Properties (whether to enable special features) 10. Functionality (Web Page requires JavaScript) 11. Desc of Rendered Content 12. Quicks (Lost Features) 13. Documentation

2/26/2004 Dan Swaney 11 Content Information (CI) Attributes Content Information (CI) Package ‘Content’ Data Object Representative Information ‘Content’ Data Object Description Environment Description -Rendering Programs is a two-step process: 1. Transform 2. Display/Access -Defines Attributes: 1. Transform Process + Transformer Engine + Params + Input Format + Output Format + Location + Documentation 2. Display/Access App + Input Format + Output Format + Location + Documentation Hardware Environment Software Environment Rendering Programs Operating System

2/26/2004 Dan Swaney 12 Content Information (CI) Attributes Content Information (CI) Package ‘Content’ Data Object Representative Information ‘Content’ Data Object Description Environment Description -Defines Attributes: + OS Name + OS version + Location + Documentation Lacks/Needs: - Recommended Env. or - Minimum Env. - It’s easier to define the environment in terms of recommended or minimum. Hardware Environment Software Environment Rendering Programs Operating System

2/26/2004 Dan Swaney 13 Content Information (CI) Attributes Content Information (CI) Package ‘Content’ Data Object Representative Information ‘Content’ Data Object Description Environment Description -Defines Attributes: 1. Computation Resources + Microprocessor Required (e.g. Pentium 4 1Ghz) + Memory Required + Documentation + Location (URL) 2. Storage + Storage Information (req’s 10GB diskspace) + Documentation + Location (URL) 3. Peripherals + Peripheral Requirements (Sound card, Monitor Resolution) + Documentation + Location (URL) Hardware Environment Software Environment StoragePeripherals Computational Resources

2/26/2004 Dan Swaney 14 Content Information (CI) Attributes Content Information (CI) Package ‘Content’ Data Object Representative Information ‘Content’ Data Object Description Environment Description -Defines Attributes: 4. Hardware Environment as a Whole + Location (e.g. the machine is in a ‘technology museum’ or available through a emulation program like VMWare) Hardware Environment Software Environment StoragePeripherals Computational Resources

2/26/2004 Dan Swaney 15 Implementing Two of the Components of the OAIS Model  Second: Preservation Description Information (PDI) –Focuses on the information to track a history of the ‘Content’ Data Object  How it was added/scanned into digital form  Who did it  Who took care of it at some point in time  Like a Library Index Card in the back of a book tracking who checked it out

2/26/2004 Dan Swaney 16 PDI’s Four Categories Preservation Description Information (PDI) Reference Info Context Info Provenance Info Fixity Info Describes mechanisms for assigning an ID to represent the Data Object both: -Locally (within the archive) (and) -Globally (referenced by an external system) Defines Attributes: 1. Archival System ID + Value + Constr. Method + Resp. Agency 2. Global ID (ISBN, URL) + Value + Constr. Method + Resp. Agency 3. Resource Description + Existing Metadata (MARC bibl. record) + Existing Records (bibliographic record in WorldCat)

2/26/2004 Dan Swaney 17 PDI: 3 Types of Reference Info Preservation Description Information (PDI) Reference Information Context Information Provenance Information Fixity Information Archival System Identification Global Identification Resource Description Defines Attributes: 1. Archival System ID + Value + Constr. Method + Resp. Agency 3. Resource Description + Existing Metadata (MARC bibl. record) + Existing Records (bibliographic record in WorldCat) 2. Global ID (ISBN, URL) + Value + Constr. Method + Resp. Agency

2/26/2004 Dan Swaney 18 PDI: Types of Context Information Preservation Description Information (PDI) Reference Information Context Information Provenance Information Fixity Information Reason for Creation Relationships Intellectual Content Defines Attributes: 1.Reason for Creation (TIFF file created to save a rare book) 2.Relationships (Part of a Collection) (Chapters in a Book) + Manifestation (Change History, Recording outcome of a migration) + Relationship Type (Translated to HTML) + Identification (ID/Link to Description of Object) + Intellectual Content (Relates a chapter to a book) + Relationship Type (Web Page, Collection) + Identification (ID/Link to Description of ‘related’ object) Manifestation

2/26/2004 Dan Swaney 19 PDI: Types of Provenance Information Preservation Description Information (PDI) Reference Information Context Information Provenance Information Fixity Information There are 5 Event Types defined as Attributes: 1.Origin (Event) Describ es the process by which the object was created. 2.Pre-Ingest (Event) - Chain of Custody or Audit Trail. - Tracks History of Content before it was digitized or added to the archive. 3.Ingest (Event) - Tracks how the object was added to the archive 4.Archival Retention - Tracks migration history of what happened since it’s original ingest/add into the archive. If transformed, records what was lost. 5.Rights Management (Event) - Access Permissions - Legal Deposit Responsibilities (if sensitive) *. Event + Designation - Change in Custody - Migration + Procedure + Date + Resp. Agency + Outcome + Note

2/26/2004 Dan Swaney 20 PDI: Types of Provenance Information Preservation Description Information (PDI) Reference Information Context Information Provenance Information Fixity Information Goal: To not have something altered and not know when, how, or why. Defined Attributes: 1.Object Authentication - Digital Signature - Watermark - Checksum + Auth Type (Signed using 128-bit one-way SHA-1 hash) + Auth Procedure (Pointer to software capable of generating a new SHA-1 hash for comparison) + Auth Date (Last time this procedure was used/ran) + Auth Result (Latest result of running this procedure).

2/26/2004 Dan Swaney 21 Review of the PDI Content Information Package Preservation Description Information (PDI) -Reference Info -Identifiers both internal and external to the archive (e.g. ISBN, URN) -Provenance Info -Documents history of the CI (simulates a library checkout card that shows who checked out the book) -Context Info -Relates CI to why it was created, relations to other objects -Fixity Info -Data Integrity (Checksum, Hash, Signature) -History of Changes -Keeps content from being altered without knowing when or why - Info to manage preservation of Content Info

2/26/2004 Dan Swaney 22 Inside the Information Package Information Package Content Information (CI) Preservation Description Information (PDI) Packaging Information Descriptive Information - ‘Content’ Data Object - Representation Info - Info to manage preservation of Content Info - Reference Info - Provenance Info - Context Info - Fixity Info - Metadata for Resource Discovery - Assists finding aids - An Abstract? - Derived from: CI & PDI - Header block of info that binds together an Archive Information Package - Binds together: - digital object + - assoc. metadata

2/26/2004 Dan Swaney 23 Conclusion  Extended the OAIS Information Model to define a Framework of Metadata Elements that implement the concept.  Focused on only 2 areas critical to preserving a Data Object

2/26/2004 Dan Swaney 24 What’s Next to Do?  Develop ‘best practices’ toward populating a database archive. –Assess degree of technical richness –Develop automated algorithms –Determine scope of sharing  Later move from ‘best practices’ to a formalized standard of processes.