Archiving and Preservation Prepared by: Christopher Eaker, University of Tennessee, Knoxville CC BY-NC
Learning Objectives 1.Explain options for a long-term sustainable preservation strategy for your data 2.Identify types of repositories and archives (for different disciplines, institutional, etc.) 3.Choose appropriate subject repository for long term storage of data 4.Understand process issues for depositing data in a repository Module 7: Archiving and Preservation
Preservation Levels Bit stream copying Refreshing Migration Emulation Standardization Digital Archaeology
Bitstream Copying 1.Definition 2.Relationship to other preservation methods 3.Where it is applied 4.Important considerations 5.Pros 6.Cons
Refreshing 1.Definition 2.Relationship to other preservation methods 3.Where it is applied 4.Important considerations 5.Pros 6.Cons OldNew
Migration 1.Definition 2.Relationship to other preservation methods 3.Where it is applied 4.Important considerations 5.Pros 6.Cons
Emulation 1.Definition 2.Relationship to other preservation methods 3.Where it is applied 4.Important considerations 5.Pros 6.Cons
Standardization 1.Definition 2.Relationship to other preservation methods 3.Where it is applied 4.Important considerations 5.Pros 6.Cons
Digital Archaeology 1.Definition 2.Relationship to other preservation methods 3.Where it is applied 4.Important considerations 5.Pros 6.Cons Forensic Recovery of Evidence Device (F.R.E.D.)
Data Repositories Discipline-specific repositories – NSIDC Data Center (cryospheric data) – GenBank (genetic sequencing data) Institutional Repositories – Trace Databib.org (registry) ONEMercury (federated search engine)
Module 7: Archiving and Preservation
Exercise Review UT’s Research Data Policy again with respect to preservation of research data. Does it have limitations? What are some ramifications of following this policy?
Exercise Search Databib.org for a repository relevant to your discipline and attempt to download a dataset from the repository. (FYI, you may run into a lot of repositories that are down because of the government shutdown.)
Exercise What descriptive information would you need to provide with a data set for someone 20 years in the future to understand it and be able to use it? Think back to the metadata module this morning.