Besser--Planning (Brazil) 31/5/01 1 Planning to Maximize Longevity of Digital Information Howard Besser UCLA School of Education & Information

Slides:



Advertisements
Similar presentations
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Advertisements

Issues and approaches to preservation metadata Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Persistent identifiers – an Overview Juha Hakala The National Library of Finland
1 Metadata for Asset Management Peter B. Hirtle Co-Director Cornell Institute for Digital Collections.
1 CS 502: Computing Methods for Digital Libraries Lecture 9 Conversion to Digital Formats Anne Kenney, Cornell University Library.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
Besser--UCLA/Getty Summer Instit-intro 8/6/01 1 UCLA/Getty Summer Institute for Knowledge Sharing (opening) Howard Besser UCLA School of Education & Information.
Besser--NINCH-recent Preservation 12/8/01 1 Recent Digital Preservation Activities Howard Besser UCLA School of Education & Information
Metadata: An Introduction By Wendy Duff October 13, 2001 ECURE.
WMES3103 : INFORMATION RETRIEVAL
Besser--Dublin Core Metadata 2/14/02 1 Dublin Core Metadata Howard Besser UCLA School of Education & Information
All Things to All People Combining Resources to Build an Integrated Digital Repository Preservation and Access for Electronic College and University Records.
Besser--CNI/JISC 6/16/00 1 Projected Changes: Prospect of digitized movies already has some mourning loss of film (SF Chronicle, 3/5/00)
Besser--Digital Longevity 9/2/00 (12/12/99) 1 Planning to Maximize Longevity of Digital Information Howard Besser UCLA School of Education & Information.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
Besser--UC Librarians Metadata 5/19/00 1 Introduction to Metadata for Digital Libraries Howard Besser UCLA School of Education & Information
DAM, 3/30/04 Long Term Preservation in a Digital World Howard Besser NYU Moving Image Archiving & Preservation Program
Besser, frm OAC, 4/17/00 1 OAC as an example of Special Collections Digitization: the Collection, the Institution, Scholarship, Interoperability, Longevity.
Image Metadata Summary of 4/18/99 NISO/DLF Image Metadata Meeting ( Howard Besser UCLA School of Education & Information.
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
WORKING WITH DATA ABOUT DATA: Introduction to Metadata | Ms Dot Porter| slide 1 A project of the Introduction to Metadata Working with Data.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
In Dublin’s fair city, where the metadata are so pretty… John Roberts Archives New Zealand.
Besser--TextOneZero 5/22/01 1 The New Information Environments: Helping content persist over time Howard Besser UCLA School of Education & Information.
Besser--Colorado Longevity & Policy 6/29/01 1 Digital Longevity: Problems & Policy Colorado Cultural Heritage Collaboration in the Digital Age Howard Besser.
Besser--VALA 2/8/02 1 Moving from Isolated Digital Collections to Interoperable Digital Libraries VALA 2002 Conference Howard Besser UCLA School of Education.
Besser--LITA Dig Imaging Preconference 7/7/00 1 Creating Working Digital Libraries Howard Besser UCLA School of Education & Information
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Besser--ICHIM Milan 9/5/01 1 Preserving Electronic Art: What’s the problem & What can we do about it? Howard Besser UCLA School of Education & Information.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Introduction to metadata
Best Practices for Digital Imaging and Metadata Roy Tennant The Library, University of California, Berkeley
Storage of digital objects Adolf Knoll National Library of the Czech Republic
Besser--Seybold--Digital Asset Mgmt 8/29/00 1 Digital Asset Management: An Academic View Howard Besser UCLA School of Education & Information
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Besser--CNI/JISC 6/16/00 1 Emerging Standards for Complex Works Howard Besser UCLA School of Education & Information
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Besser--Moving Image Longevity 3/16/01 1 Moving Image Longevity Howard Besser UCLA School of Education & Information
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Besser--AIC-Electronic Art 6/9/00 1 What’s so Special about Electronic Art?: Issues in Conservation of Digital Works Howard Besser UCLA School of Education.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Digital Stewardship Curriculum
Building A Repository for Digital Objects
Implementing an Institutional Repository: Part II
UCLA School of Education & Information
Planning Digital Projects Smithsonian Archivists Oct 29, 2001
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Image Metadata Summary of 4/18/99 NISO/DLF Image Metadata Meeting
Presentation transcript:

Besser--Planning (Brazil) 31/5/01 1 Planning to Maximize Longevity of Digital Information Howard Besser UCLA School of Education & Information

Besser--Planning (Brazil) 31/5/01 2 Planning to Maximize Longevity of Digital Info-  Access and Preservation  Why are you Managing this Information?  Key Considerations for Imaging Projects  Important Planning Considerations  Models for Digital Collections  Importance of Metadata Standards  Digital Longevity Issues  More Planning Issues

Besser--Planning (Brazil) 31/5/01 3 Access and Preservation _ Digitizing can serve both Access and Preservation –E.g. Access to digital surrogates saves wear & tear on originals _ But Digitization for Access can be quite different than Digitization for Preservation –Level of detail, scanning quality, extensiveness of resources –And long-term retention of digital works is still an open issue

Besser--Planning (Brazil) 31/5/01 4 Why are you Managing this Information?  Organizational mission & type  Users  Uses

Besser--Planning (Brazil) 31/5/01 5 Key Considerations for Imaging Projects-  Users' Needs  Image Quality  Intellectual Property  Standards  Topology  Tools & Processes

Besser--Planning (Brazil) 31/5/01 6 Key Considerations for Imaging Projects (1 of 3)  Users' Needs – Quality of Digital Surrogate – Interoperable desktop applications  Image Quality – Archival – Current online delivery

Besser--Planning (Brazil) 31/5/01 7 Key Considerations for Imaging Projects (2 of 3)  Intellectual Property  Standards – Modular and Layered Architecture – Terminology – Technical imaging information  Topology

Besser--Planning (Brazil) 31/5/01 8 Key Considerations for Imaging Projects (3 of 3)  Tools & Processes – Scanners – Compression techniques – Linking files – Workflow – Interoperable desktop applications

Besser--Planning (Brazil) 31/5/01 9 Some nuts-and-bolts Planning Considerations  Think about users (and potential users), uses, and type of material/collection  Scan at the highest quality that does not exceed the likely potential users/uses/material  Do not let today’s delivery limitations influence your scanning file sizes; understand the difference between digital masters and derivative files used for delivery  Many documents which appear to be bitonal actually are better represented with greyscale scans  Include color bar and ruler in the scan  Use objective measurements to determine scanner settings (do NOT attempt to make the image good on your particular monitor or use image processing to color correct)  Don’t use lossy compression  Store in a common (standardized) file format  Capture as much metadata as is reasonably possiple (including metadata about the scanning process itself)

Besser--Planning (Brazil) 31/5/01 10 Why Scale is important

Besser--Planning (Brazil) 31/5/01 11 Important Planning Considerations  File Formats  Choosing Interoperable Systems  Adhere to standards  Vendors with large installed base  Refreshing and/or Migration

Besser--Planning (Brazil) 31/5/01 12 Key problems we’re facing  Discovery  Longevity-  Interoperability-

Besser--Planning (Brazil) 31/5/01 13 Serious Longevity Problems  What we know from prior widespread digital file formats  Images separating from their metadata  Inaccessibility of software needed to view an image  Inability to even decode the file format of an image  …return to Longevity problem later-

Besser--Planning (Brazil) 31/5/01 14 Traditional Digital Library Model DL user search & presentation

Besser--Planning (Brazil) 31/5/01 15 Ideal Digital Library Model DL user search & presentation

Besser--Planning (Brazil) 31/5/01 16 For Interoperability Digital Libraries Need Standards  Descriptive Metadata for consistent description  Discovery Metadata for finding  Administrative Metadata for viewing and maintaining  Structural Metadata for navigation ... Terms & Conditions Metadata for controlling access...

Besser--Planning (Brazil) 31/5/01 17 Why are Standards and Metadata consensus important?  Managing digital files over time  Longevity  Interoperability  Veracity  Recording in a consistent manner  Will give vendors incentive to create applications that support this

Besser--Planning (Brazil) 31/5/01 18 Why Standards?  Why do we need standards? – To make information universally available to users – facilitate sharing and interchange of information – To preserve information (make it safe from changes in hardware and software)  Standards only work if communities widely accept them, but they’re necessary for communities to work together

Besser--Planning (Brazil) 31/5/01 19 Questions to Ask  What communities is this standard designed for?  What type of information is this standard designed to handle?  What functions is this standard designed to serve?  What previous standards is it built upon?  Does the standard prescribe how to create new records (or parts of records), or how to map from existing records?  How far does the standard go? Semantics: Does it define element sets? Rules? Syntax?-

Besser--Planning (Brazil) 31/5/01 20 Semantics/Syntax/Structure _ Semantics – meaning, as defined by a community to meet their particular needs (DC) _ Syntax – a systematic arrangement of data elements for machine processing – facilitates the exchange and use of metadata among various applications (HTML, XML, RDF) _ Structure – a formal arrangement of the syntax with the goal of consistent representation of the semantics (rules defining field contents like 1/11/99)

Besser--Planning (Brazil) 31/5/01 21 The Short Life of Digital Info: Digital Longevity Problems-  Disappearing Information  The Viewing Problem  The Scrambling Problem  The Inter-relation Problem  The Custodial Problem  The Translation Problem

Besser--Planning (Brazil) 31/5/01 22 The Viewing Problem  Digital Info requires a whole infrastructure to view it  Each piece of that infrastructure is changing at an incredibly rapid rate  How can we ever hope to deal with all the permutations and combinations

Besser--Planning (Brazil) 31/5/01 23 The Scrambling Problem Dangers from:  Compression to ease storage & delivery  Container Architecture to enhance digital commerce

Besser--Planning (Brazil) 31/5/01 24 The Inter-relation Problem  -Info is increasingly inter-related to other info  -How do we make our own Info persist when it points to and integrates with Info owned by others?  -What is the boundary of a set of information (or even of a digital object)?

Besser--Planning (Brazil) 31/5/01 25 The Custodial Problem  How do we decide what to save?  Who should save it?  How should they save it? – -methods for later access: emulation, migration, etc. – -issues of authenticity and evidence

Besser--Planning (Brazil) 31/5/01 26 The Translation Problem  Content translated into new delivery devices changes meaning – -A photo vs. a painting – -If Info is produced originally in digital form in one encoded format, will it be the same when translated into another format? – Behaviors

Besser--Planning (Brazil) 31/5/01 27 Pieces of the Solution (1/2)  -We need to insist upon clearly readable standardized ways for digital objects to self- identify their formats  -We should discourage scrambling  -We need to better understand information inter-relates to other Info, and what constitutes “boundaries” of Info objects

Besser--Planning (Brazil) 31/5/01 28 Pieces of the Solution (2/2)  -People and organizations wishing to make information persist need guidelines of how to go about doing it  -We need to better understand how translating from one storage or display format to another affects the meaning of a work  -We need to save the “behaviors” of a digital object, not just it’s “contents”

Besser--Planning (Brazil) 31/5/01 29 Conceptual Approaches to Digital Preservation _ Refreshing always necessary due to volatility of physical strata –Impact on evidential value _ Migration -- advantages & disadvantages _ Emulation -- advantages & disadvantages

Besser--Planning (Brazil) 31/5/01 30 Metadata can be the first line of defense  Can tell you – where the file is (if you can’t find the file) – where more info about the file is (if you have the file but most other metadata has become separated) – what the file format is – what the compression scheme is – what application program and version is needed for the file

Besser--Planning (Brazil) 31/5/01 31 Groups Working on the Big Problem  CPA Task Force  Getty “Time & Bits” Conference & Follow-ups-  Emulation experiments in US and Europe  NEDLIB, CURL, Michigan  Mellon-funded E-Journal Archive experiments  Internet Archive  Long Now

Besser--Planning (Brazil) 31/5/01 32 Time & Bits

Besser--Planning (Brazil) 31/5/01 33 Time & Bits Participants  Steward Brand  Howard Besser  Brian Eno  Danny Hillis  Peter Lyman  Brewster Kahle  Kevin Kelly  Jaron Lanier  Doug Carlston  John Heilemann  Ben Davis  Margaret MacLean  Bruce Sterling  Paul Saffo

Besser--Planning (Brazil) 31/5/01 34 Groups Working on Pieces of the Big Problem  Internet Archive  Long Now  Emulation experiments in US and Europe  NEDLIB, CURL, Michigan

Besser--Planning (Brazil) 31/5/01 35 Journal Archiving _ License, don’t own; may not be even able to obtain right to make archival copy _ Increasingly no paper back-up at all _ Usually we don’t have the important redundancy factor _ Stanford’s LOCKSS Project (Lots of Copies Keeps Stuff Safe) and its problems (

Besser--Planning (Brazil) 31/5/01 36 Migration/Refreshing  Impact on evidential value

Besser--Planning (Brazil) 31/5/01 37 More Planning Issues _ Image Families _ Behaviors _ Persistent Identification

Besser--Planning (Brazil) 31/5/01 38 Identification/Provenance (Images)-  The number of variant forms of a work can be enormous  Image Families  A digital image frequently has many layers of parentage  Information about the parentage that can indicate the quality and veracity of the image (Dublin Core "Source" and "Relation")  how to deal with different versions derived from the same scan or different encoding schemes  Vocabulary Standards to express this

Besser--Planning (Brazil) 31/5/01 39 The number of variant forms of a work can be enormous  different views of the same object  different scans of the same photo  different resolutions  different compression schemes  different compression ratios  different file storage formats  different details of the same image ...

Image Families

Besser--Planning (Brazil) 31/5/01 41 Identification/Provenance  how to deal with different versions (browse, hi-res, medium res) derived from the same scan or different encoding schemes (TIFF, PICT, JFIF)  Vocabulary Standards to express this – VRA Surrogate Categories – CIMI's "Image Elements”

Besser--Planning (Brazil) 31/5/01 42 MOA II Behaviors  Navigation  Display/Print

Besser--Planning (Brazil) 31/5/01 43 MOA II Best practices  Use/Users/Collection:  Benchmarking  Masters vs. Derivatives  Scanning-  Administrative Metadata-  Structural Metadata-

Besser--Planning (Brazil) 31/5/01 44 To deal with Immediately _ Persistent IDs _ Metadata

Besser--Planning (Brazil) 31/5/01 45 Persistent IDs--the Problem _ Need to separate work ID from work location _ URNs probably won’t be ready until 2003 _ Becomes a business process issue when one organization maintains the resource and another organization references it (ie. licensed from vendors or managed by separate administrative structures)

Besser--Planning (Brazil) 31/5/01 46 More Persistent IDs --the Approach for today _ PURLs _ Handles _ HTTP redirects _ And worry about costs now and conversion costs when URNs become feasible

Besser--Planning (Brazil) 31/5/01 47 Data Set Management More issues with referencing IDs _ References for mirror sites _ References for back-up sites when main site is down or bottle-necked _ References for off-site copies and archival copies

Besser--Planning (Brazil) 31/5/01 48 One Final Question: Who will collect the digital works of today that should become the Special Collections of tomorrow? _ web sites _ zines _ electronic journals _ listserve and discussions _ drafts of works that later become famous

Besser--Planning (Brazil) 31/5/01 49 Howard Besser UCLA School of Education & Information       Planning to Maximize Longevity of Digital Information