The Century Archive Project “CAP” Technology-Independent Information Storage Steven H. McCown & Michael Leonhardt Storage Technology Corporation 4 April.

Slides:



Advertisements
Similar presentations
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
Advertisements

Multimedia Components (Develop & Delivery System)
Electronic Theses and Dissertations: Benefits, Issues, and the University of Waterloo Approach
Sharpdesk Overview Desktop Composer Search Imaging      
A Digital Imaging Primer Nick Dvoracek Instructional Resources Center University of Wisconsin Oshkosh.
Archiving for legal purposes How to implement the new Belgian legislation to destroy physical invoices and use an electronic archive.
Technical Information Center
1 University of Palestine Information technology college Electronic Document Management System Technologies Electronic Document Management System Technologies.
1 CS 502: Computing Methods for Digital Libraries Lecture 9 Conversion to Digital Formats Anne Kenney, Cornell University Library.
1 Chapter 6 Storage and Multimedia: The Facts and More.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
Image Representation.
JSTOR & OCR - A Case Study Kiffany Francis. What is JSTOR? “JSTOR is a not-for- profit organization with a dual mission to create and maintain a trusted.
Document Management Systems For Government Agencies Infocrew Solutions Pvt Ltd.
DMS in Universities, Colleges and School Infocrew Solutions Pvt.Ltd.
File Formats The most common image file formats, the most important for cameras, printing, scanning, and internet use, are JPG, TIF, PNG, and GIF.
Fred ByersNIST1 Government Information Preservation Working Group December 16, 2003.
© 2009 EMC Corporation. All rights reserved. Content Addressed Storage Module 2.5.
File Formats Different applications (programs) store data in different formats. Applications support some file formats and not others. Open…, Save…, Save.
Architecture of information systems Document managment system Peter Záhorák.
Peripherals and Storage Looking at: Scanners Printers Why do we need storage devices anyway? What are magnetic disks? How do magnetic disks physically.
Computer Systems Peripherals. What is a peripheral? A peripheral is a device which can be attached to a computer processor Peripherals can be internal.
Bitmapped Images. Bitmap Images Today’s Objectives Identify characteristics of bitmap images Resolution, bit depth, color mode, pixels Determine the most.
An Introduction to Microsoft Word. Microsoft Word This program allows you to type letters, papers, reports and even books. It is available through the.
Digital Images The digital representation of visual information.
Unit 30 P1 – Hardware & Software Required For Use In Digital Graphics
Records Management: It’s Not Just Paper
Electronic Records Chapter 11. Electronic Record  Stored on electronic media ComputerComputer Audio recordingAudio recording Video recordingVideo recording.
Dean Pentcheff NHMLAC MBPC/Crustacea 17 April 2006.
Drawing Management Chapter Technical Drawing 13 th Edition Giesecke, Mitchell, Spencer, Hill Dygdon, Novak, Lockhart © 2009 Pearson Education, Upper.
CSCI-235 Micro-Computers in Science Hardware Part II.
DOCUM ENT MANAGEMENT SYSTEMS FINANCE INDUSTRY Infocrew Solutions Pvt.Ltd.
“A whole new way of looking at microfilm” Digital Film Viewers and Scanners.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
The Complexities & Economics of Digitizing Microfilm
DIGITAL IMAGING What Every Archivist and Records Officer Should Know DIGITAL IMAGING What Every Archivist and Records Officer Should Know Presented by.
Chapter 9 Section 2 : Storage Networking Technologies and Virtualization.
Content Introduction History Types of optical disc Formats of optical disk Working Features of optical disk Applications Advantages.
© 1999 Rochester Institute of Technology Introduction to Digital Imaging.
An Introduction to Microsoft Word. Microsoft Word This program allows you to type letters, papers, and other documents. This program allows you to type.
Copyright Prentice-Hall, Inc Storing & Retrieving Information Chapter 7.
Color and Resolution Introduction to Digital Imaging.
Here are some paper use reduction strategies to consider for your organization: Analyze your largest documents, such as publications and major mailings.
Digitization Programmes National Library of the Czech Republic Adolf Knoll
Digital Image Capture of Musical Scores Jenn Riley, Indiana University Digital Library Program Ichiro Fujinaga, McGill University.
File Formats Different applications (programs) store data in different formats. Applications support some file formats and not others. Open…, Save…, Save.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
BACKUP/MASTER: Strategies for Archiving Dianne McAdam Senior Analyst and Partner Data Mobility Group.
Storage of digital objects Adolf Knoll National Library of the Czech Republic
Content Addressed Storage
SPECIAL MATERIALS LIS-12 BY: ROMEL U. RELLON. Special collections  Some libraries segregate from the general collection rare books, manuscripts, papers,
Washington State Archives “Going Paperless” Presented by: Leslie Koziara, ERMP May 7, 2009 A GUIDE TO WASHINGTON STATE’S APPROVAL PROCESS FOR THE DESTRUCTION.
Digital Imaging, Photography, Videography. Photography Writing with light.
Scanners. Using a Scanner Scanners are used to digitize any flat object. Several types of scanners- flatbed, sheet fed, handheld, film. Most common is.
Storage Why is storage an issue? Space requirements Persistence Accessibility Needs depend on purpose of storage Capture/encoding Access/delivery Preservation.
The Complexities & Economics of Scanning Microfilmed Documents Videos
Storage Devices.
Chapter 2 HAEDWAER.
Modul 2 Sifat Dasar Informasi Digital Mata Kuliah Preservasi Informasi Digital.
Scanner Scanner Introduction: Scanner is an input device. It reads the graphical images or line art or text from the source and converts.
Submitted By: Shantam Dubey Website:
RECORDS MANAGEMENT Judith Read and Mary Lea Ginn Chapter 12 Electronic Media and Image Records 1 © 2016 Cengage Learning ®. May not be scanned, copied.
Copyright © 2009 S M Resources Corporation. All Rights Reserved. S M Resources Corporation 1 - Overview - Document Services Capabilities S M Resources.
File Formats Different applications (programs) store data in different formats. Applications support some file formats and not others. Open…, Save…, Save.
short term and long term speed, capacity, compression formats, access
Government Information Preservation Working Group
RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging
Basic Concepts of Digital Imaging
An Introduction to Microsoft Word
Presentation transcript:

The Century Archive Project “CAP” Technology-Independent Information Storage Steven H. McCown & Michael Leonhardt Storage Technology Corporation 4 April 2002

McCown & Leonhardt – 4/4/2002 What is a “Document”? A document is: – Letter, check, picture, plot, report, birth certificate, deed... A document is NOT: – Database element – Encoded record – Encoded object Perhaps: – ASCII record of transaction – Image of database table – etc.

McCown & Leonhardt – 4/4/2002 Documents in a “Paperless” Environment 4.4 M Tons of Paper Printed in 1995 … to 5.9 M in B Sheets Laser Printers in 1996 … to 1.2 T sheets in B Sheets From Office Copiers in to 1.1 T Sheets in Billion Letters Sent 170 Billion Pages of Fiche 60 Billion Checks Processed Each Year has created 40% more (personal) printing +$100 M in corporate revenues adds 8.8 M sheets printed

McCown & Leonhardt – 4/4/2002 “To ensure that the media will be readable far into the future, it may be necessary to archive the system along with the media. For a 100-year life, recording systems and sufficient spare parts will need to be archived along with the data storage media. Media with life expectancies greater than 20 years are capable of out-surviving existing recording system technologies.” -- John Van Bogart, NARA 11 th Annual Preservation Conference, “Magnetic Tape Storage”, 1996

McCown & Leonhardt – 4/4/2002 Information Management Long-term storage – Defined: in excess of 100 years – Inherent to many domains such as genealogy Information Management strategies – Usually based on frequent data migration – Poor incorporation of long-term storage Problem: – How to access today’s archives in 100 years or more

McCown & Leonhardt – 4/4/2002 Long-Term Storage Wish List Easy integration with data processing environments Easy data access Migration free Long-life media – “no maintenance” Reader technology independence Human readable data Low cost

McCown & Leonhardt – 4/4/2002 Current Options Encode the data and record digitally – Magnetic media – Optical media Store unencoded, human readable images – Microfilm Something new - “CAP”

McCown & Leonhardt – 4/4/2002 Century Archive Project Features – High density storage of human-readable document images – Storage of digitally encoded documents – Metadata ascription to aid retrieval – Industry standard physical media form factor – Patent on concepts and format filed

McCown & Leonhardt – 4/4/2002 CAP Operations Scan document to create electronic (e.g. TIFF) file Write de-magnified image on optical tape with scanning laser – Use WORM optical media – Create analog record (human readable) – Append new documents as needed Write digitally encoded document file Read – View magnified image  Direct or CCD camera & monitor – Recover digital file

McCown & Leonhardt – 4/4/2002 Tape Record Layout

McCown & Leonhardt – 4/4/2002 Features Record header with document index and metadata Updatable Table of Contents Digital record in addition to analog record – Retrieve digital version if compatible reader available – Include digital header and TOC Gray scale Documents – Use half-toning technique Color Documents – Store separate images for red, green and blue breakdown – Requires three-beam optics for direct color viewing Stereoscopic images

McCown & Leonhardt – 4/4/2002 Adjacent Digital File TIFF file is reformatted with ECC and bit encoding – Image file compressed using lossless compression Digital record format on tape: – Width same as analog record – Track spacing doubled to reduce crosstalk on read-out – Length 1.5x to 2x analog record

McCown & Leonhardt – 4/4/2002 Retrieved Image

McCown & Leonhardt – 4/4/2002 Tape Format Example For 8 1/2” wide documents – Scan images at 300 dpi – Write images on tape at 25,400 dpi  85x reduction in size – Document length parallel to tape  Accommodates different lengths – 5 document tracks across tape 1/2” tape in 3480 cartridge – 200m of tape – 220,000 image documents – 80,000 documents with both image and digital records

McCown & Leonhardt – 4/4/2002 Storage Costs ($/MB) Manually intensive – Paper - $10.00 – Microfiche (volumetric improvement only) - $1.20 “Semi-automated” (manually mounted media) – Non-automated magnetic tape – Microfilm - $.005 (media only, 16 mm) Full automation – Magnetic tape - $.004 – Optical disk - $.03 – CAP - $.002

McCown & Leonhardt – 4/4/2002 Century Archive Summary Provides alternative storage method for valued documents, images – Direct optical viewing – Eliminates drive, media technology migration – Robust media options for relaxed environmental storage conditions Provides digital storage – Faster availability – Data integration Complements magnetic tape storage of bulk records

McCown & Leonhardt – 4/4/2002 Questions?