Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.

Slides:



Advertisements
Similar presentations
Gretchen Gueguen Digital Archivist April 12, 2012.
Advertisements

Australasian Digital Recordkeeping Initiative – Adrian Cunningham
Database System Concepts and Architecture
Policy on digital records preservation in the NSW public sector Cassie Findlay Senior Project Officer, Government Recordkeeping.
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
Data Storage and Security Best Practices for storing and securing your data The goal of data storage is to ensure that your research data are in a safe.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Depositing e-material to The National Library of Sweden.
ISO & OAI-PMH By Neal Harmeyer, Amy Hatfield, and Brandon Beatty PURDUE UNIVERSITY RESEARCH REPOSITORY.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
SOAPI: a flexible toolkit for implementing ingest and preservation workflows Mark Hedges Centre for e-Research, King’s College London Arts and Humanities.
02/12/00 E-Business Architecture
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
CS CS 5150 Software Engineering Lecture 13 System Architecture and Design 1.
Web Development Using ASP.NET CA – 240 Kashif Jalal Welcome to week – 1 of…
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Overview of Database Languages and Architectures.
Course Instructor: Aisha Azeem
Data Preservation Best Practices for preserving your research data for future reuse The goal of data preservation is to ensure that your data is in a sustainable.
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
National Archives of Australia Digital Preservation Update
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
“Filling the digital preservation gap” an update from the Jisc Research Data Spring project at York and Hull Jenny Mitcham Digital Archivist Borthwick.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Components of Database Management System
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
Challenges of Digital Media Preservation Karen Cariani, Director Media Library and Archives Dave MacCarn, Chief Technologist.
Configuration Management (CM)
E.Soundararajan R.Baskaran & M.Sai Baba Indira Gandhi Centre for Atomic Research, Kalpakkam.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
Persistent Digital Archives and Library System (PeDALS)
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Submitted To: Rutvi sarang Submitted By: Kushal Bhagat.
1 BCS, Oxfordshire, 19 February, 2004 WEB ARCHIVING issues and challenges Deborah Woodyard Digital Preservation Coordinator.
Fedora and the Preservation of University Electronic Records Project NHPRC Electronic Records Research Grant Kevin L. Glick Manuscripts and Archives, Yale.
Archiving and Preservation Michele Kimpton CEO, DuraSpace Bryan Beecher Director, ICPSR DuraSpace Webinar November 2, 2011.
Adrian Janson, Melbourne High School Information Systems, Data and Information, The IPC and Organisations For VCE Software Development ¾, 2007.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
 ReadSoft 2004 Processing census forms.  ReadSoft 2004 ReadSoft Corporate Profile n Swedish company - founded1991 n Listed in Stockholm stock exchange.
Modern Programming Language. Web Container & Web Applications Web applications are server side applications The most essential requirement.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Types of Software Chapter 2.
Verification & Validation
A computer contains two major sets of tools, software and hardware. Software is generally divided into Systems software and Applications software. Systems.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
The information systems lifecycle Far more boring than you ever dreamed possible!
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Identify internal hardware devices (e. g
An Approach to Software Preservation
Database System Concepts and Architecture
DAITSS and the Florida Digital Archive
ICT meeting Business needs
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Implementing an Institutional Repository: Part II
Robin Dale RLG OAIS Functionality Robin Dale RLG
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Executive Sponsor: Tom Church, Cabinet Secretary
Presentation transcript:

Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation

What I will talk about NAA Context Some Concepts NAA Implementation NAA Process flow Preservation Software Platform (Xena)

National Archives of Australia Established 1946 as part of National Library Independent since 1960 Legislation: Archives Act 1983 Approx. 420 staff in 12 locations Budget ca. A$65 million. 350 shelf kilometres of records Separate preservation funding since 2001

Digital Preservation Project Started in 2001/2 FY Cost to date approx. A$2 million (30 months) Aim: to develop viable approach to the preservation of ‘born digital’ records for long term accessibility and use Deadline: July 2003

Note NOT a digital archive but an approach to digital preservation Well developed archival processes that can be applied to records irrespective of format: - appraisal/selection - transfer - description - retrieval and access Project purely about preservation

Some Definitions Records Recorded information created or received and maintained by an organisation in the transaction of business Digital Records Records in digital form processed by computers Not: Systems or working applications

The preservation problem Technological obsolescence –Hardware –Software Restrictions on the use of technology

Traditionally Researcher directly experiences the record through its source object Preserve the object and you preserve the record ObjectResearcher

Researcher experiences the record through a performance Preserve the performance and you preserve the record But…digital records are performances SourceProcessPerformanceResearcher

A Two Part Solution 1.Keep a master copy of every source we accept into custody -Passive Access -Researcher gets the 'Zeros and Ones', not the performance 2.Active Intervention to recreate the performance -Replace the source and process -Active Access to the 'essence' of the performance -Based on experience with Audiovisual material

The essence of the record What we want to preserve out of the performance -What aspects are essential to the record's value? -What aspects are incidental to the record's value?

Our preservation approach Select open and well documented data formats Migrate records into these formats (‘normalisation’) Support open source software tools that can read these formats

Preservation System 3 separate components 1. Quarantine 2. Preservation 3. Storage All components physically separated from each other and all other NAA networks Access to hardware restricted to digital preservation staff

Quarantine server Records Transfer written to server Digital Preservation recorder captures information about actions on each digital object Transport medium stored on repository shelf for at least 4 weeks Objects then re-checked for viruses using new virus definitions Checked objects written to transport medium DPr Checksums verified Objects undergo virus check For the technically minded: - Dell PowerEdge 2600 server - 2 x 2GHz processors -.7Tb disk store - independent UPS

Preservation server Transport medium is attached to preservation server Digital Preservation recorder captures information about actions on each digital object Output sets written to new transport media DPr Preservation software platform (Xena) processes digital objects Xena outputs two new objects and calculates new checksums for each: 1.Wrapped bitstream 2.‘Normalised version’ For the technically minded: - Dell PowerEdge 2600 server - 2 x 2GHz processors -.7Tb disk store - independent UPS

Digital Repository Transport media are attached to repository server Digital Preservation recorder captures information about actions on each digital object Third copy on digital tape which is stored offsite DPr RAID Storage 2 copies on RAID storage - Configured as RAID 10 - Automated, regular, frequent verification of checksums Simple management application to allow access to digital objects (eg. DSpace) For the technically minded: - Dell PowerEdge 2600 server - 2 x 2GHz processors -.7Tb disk store - fibre channel between server and RAID - independent UPS Copies written to new media for access To Access

NAA Implementation 1.Follows Open Archival Information System framework 2.Non-proprietary, open source solution 3.Based on the extensible markup language (xml)

xena = xml electronic normalising of archives

xena File-based Java/Swing application Runs in Java Packaged as an executable.jar file Modular Multiple document interface

xena functionality: File format guessing File ‘normalisation’ XML encapsulation Process and data verification File viewing A core module plus ‘plug-in’ modules which do:

Core module The core consists of: I. Graphical User Interface components II. Plug-in management components III. Generic validation components

Plug-in modules Plug-ins are created for identified data types that are to be processed. Each plug-in consists of: –A guesser component –One or more input format type components –A normalised format type –One or more normalisation modules –One or more view components –Sorting functionality –Validation functionality –Printing functionality –GUI interaction methods

DEMONSTRATION OF XENA

Contacts Andrew Wilson Project Manager AtoR Digital Preservation Project Web: /preservation/summary.html

THANK YOU ANY QUESTIONS?