1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.

Slides:



Advertisements
Similar presentations
1 of 16 Information Access The External Information Providers © FAO 2005 IMARK Investing in Information for Development Information Access The External.
Advertisements

OCLC Online Computer Library Center Steering Around the Iceberg: Economic Sustainability for Digital Collections Brian Lavoie Research Scientist OCLC Economics.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
IKA Øst IKS - a company for long-term storage of electronic archives By Børge Strand.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
Animesh Bhattacharyya Librarian, Vivekananda Mahavidyalaya
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
Digital Content Solutions Digital content management technology has transformed the way to manage content and knowledge, in this knowledge era. Research.
| IFLA2010. Newspaper Section | Newspaper Resources in transition: Digital Preservation and Access - keynote - IFLA International Newspaper.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
1 Strategies for Collecting and Preserving Open Access Materials on the Web William Y. Arms Cornell University Federal Library and Information Center Committee.
1 Planning And Electronic Records Issues For Electronically Enhanced Courses Jeremy Rowe Nancy Tribbensee
1 Archival Storage for Digital Libraries Arturo Crespo Hector Garcia-Molina Stanford University.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Management is not a Natural Act Megan Winget - Co-Project Manager Managing the Digital University Desktop: Introduction and Preliminary Findings.
1 Copyright and Intellectual Property Design Issues by Jeremy Rowe
1 From Filing Cabinet to Desktop and Network: Records Management in N.C. State Government Ed Southern Government Records Branch N.C. Office of Archives.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
“Would You Like to Play a Game?” :: Megan Winget :: University of Texas at Austin A Review of Challenges and Current Practice in Game-Related Collections.
A Seminar report On Electronic Resources :An Overview
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
Trends in scholarly electronic publishing Setting the context for the workshop.
How Well Do You Know Copyright? Connie Murphy Hylton High School Library 2008.
Untitled (Hidden Track): Born Digital Content Preservation Service at UIUC Tracy Popp, MS LIS, CAS Digital Preservation Coordinator University Library.
Records Management: It’s Not Just Paper
Svein Arne Brygfjeld National Library of Norway Nordic Web Archive.
Open Textbooks and Electronic Publishing Formats/Standards Arctic Virtual Learnng Tools
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
LIS654lecture 1 Introduction Thomas Krichel
1 CS 502: Computing Methods for Digital Libraries Lecture 28 Current work in preservation.
Preservation, New Media, Oral Cultures How to Build a Digital Library Ian H. Witten and David Bainbridge.
Technology Choices for the JSTOR Online Archive Presented by Chang Feng Department of Computer Engineering and Computer Science, University of Missouri-Columbia,
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
National Sea Grant Library The New Library System and Publication Submittals Communications Staff Tutorial October 2014 National Sea Grant Library The.
Digital Preservation: Redefining Established Concepts Nancy Kunde UW Madison Records Officer Campus IT Committee February 15, 2008.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Digitization Programmes National Library of the Czech Republic Adolf Knoll
European Commission on Preservation and Access Preservation of digital heritage Yola de Lusenet Lisbon, November
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
Greg Janée topics Fedora NGDA project activities Two study ideas MODIS Preservation as series-of-handoffs.
Storage of digital objects Adolf Knoll National Library of the Czech Republic
Multimedia ETD Questions Bill Savage UMI Dissertations Publishing ETD 2002 Provo, Utah Saturday, June 1, 2002.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
OCLC Online Computer Library Center The ‘Hows’ and ‘Whys’ of Preserving Digital Materials Brian Lavoie Research Scientist OCLC CARL program: “Here Today,
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
The Century Archive Project “CAP” Technology-Independent Information Storage Steven H. McCown & Michael Leonhardt Storage Technology Corporation 4 April.
1/ 4 OCTOBER 2007 Electronic Records Retention Issues Frank Nemeth NMCI Engineering.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives Lisa M. Schmidt
Storage Why is storage an issue? Space requirements Persistence Accessibility Needs depend on purpose of storage Capture/encoding Access/delivery Preservation.
Digital preservation Institute on 21 st Century Librarianship Aug 10, 2000.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
A centre of expertise in digital information management UKOLN is supported by: What are the Barriers to Web Resource Preservation?
Digital Archives You Can Do It! The Collective - March 2016 Paul Kelly - Digital Archivist - The Catholic University of America.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
Data Management and Digital Preservation Carly Dearborn, MSIS Digital Preservation & Electronic Records Archivist
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
Digital Preservation Initiatives in the United States A Summary Deanna B. Marcum.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
GT1 - MODELOS, FRAMEWORKS E ARQUITETURAS APRESENTAÇÃO DA NORMA – GT4 ISO TS 21547:2010 “Health informatics — Security requirements for archiving of electronic.
Windows 7 and file management
Building A Repository for Digital Objects
Use It or Lose It! Preserving Your Digital Documents
DIGITAL LIBRARY.
Emulation: Good or Bad? Emulation as a Digital Preservation Strategy – Stewart Granger Reality and Chimeras in the Preservation of Electronic Records –
Computer Applications -Generic Elective
Presentation transcript:

1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation

2 Administration Online survey Course evaluations at end of class today

3 Long-term preservation Objective Retain digital library materials over centuries Longer than... computer architectures (Wintel, Linux, 390,...) magnetic storage (disks, tapes,...) formats, protocols, applications (Unicode, Java, XML,...) Internet or the web for purposes that we have not yet considered

4

5

6

7

8

9 Levels of preservation Preserve full look and feel of digital material in its context e.g., A video game with its hardware Preserve content with an access system but migrate the look and feel to new environments e.g., successive versions of MS Windows Preserve raw content but no software system e.g., UTF-8 text with XML/XSL mark-up, but no XML/XSL software The complexity of preservation varies greatly with the level.

10 Challenges: user needs Digital information differs from print  May be useless without its environment.  Creator and subscriber may not have copies.  Numerous versions. Example: A scientific journal on-line  If the author does not subscribe - no access to own article.  If the library does not renew subscription - no access to anything.

11 Challenges: technical problems Technical issues  Storage media have short life-span.  Formats and specifications change continually.  Computing environments are very complex. Example: personal files I have retained all my personal computer files since 1984, but have great difficulty in reading some of them.

12 Challenges: economic and legal Legal  Archives require permission to save information. Institutions:  Library of Congress, National Archives, etc. do not provide the same services for electronic information that they provide for physical artifacts. Example: discontinued serials What happens if a journal publisher goes bankrupt, or a scientific archive does not get its grant renewed?

13 Technical approaches: 1. Persistent storage MaterialApproximate life (years) Acid-free paper500+ Microfilm300 Optical disks100? Color film25-50 CDs20? Magnetic disk and tape5 Persistent storage preserves raw content only Research in high-volume, long-term digital media in lacking

14 Technical approaches 2. Copying bits (refreshing) Refreshing bits Repeatedly copy bits from one storage medium to the next. A standard technique in data processing. Benefits from the rapid fall in prices of storage devices. Preserves raw content only. Requires active management Mirrors Have many copies of the same information with independent management.

15 Technical approaches 3. Migration of content Migration Retain content but change formats and representations to keep current with technology Used by journal publishers Preserves content and an access system Example. Pension funds The Social Security Administration has records of every FICA payment, which migrate between systems over many years.

16 Technical approaches 4. Emulation Concept Record a full specification of the computing environment in which the digital information was created At time in future, emulate the original computing environment Would preserve full look and feel Clearly not practical for complex computing systems Emulation is never perfect Computing environments are remarkably complex But may be useful for parts of systems e.g., Java virtual machine

17 Technical approaches 5. Digital archeology After periods of neglect, archeologists are needed Recover data from old media Reverse engineer lost formats and specifications Experts in digital paleography (reading archaic scripts and formats) Example. East Germany German archivists are reconstructing the records of the East German state from worn out tapes, broken computer systems, undocumented data bases, and the recollections of staff.

18 Preservation at publication This is a period of experimentation and change in formats, protocols, object models, etc. Some information is easier to preserve than others. Longevity is more likely if:  Formats are widely used, in important applications.  Methods are simple, without using obscure options.  Coding schemes are easy to interpret. Example. Internet RFC Series The Internet RFC Series use text/ascii. The RFCs go back to 1969 and have no preservation problems. A few RFCs are in PostScript and already hard to decipher

19 Metadata Digital information needs interpretation Self-documentation is always good Persistent identification is vital Simple, standard metadata has a chance of long-life Authentication of material need not be complex (e.g., hash) History of changes (e.g., migration to different format)

20 Preservation of specifications Digital information needs a context Therefore store the specifications of: Formats Database designs Technical documentation User manuals...on high-quality archival materials, e.g., paper.

21 Final word Long-term preservation needs people and organizations who want it!