SCIDIP-ES Components Oct 22-23 2014,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation.

Slides:



Advertisements
Similar presentations
Criteria for the trustworthiness of data centres Jens Klump Helmholtz Centre Potsdam German Research Centre for Geosciences (GFZ) DataCite Summer Meeting.
Advertisements

CASPAR Validation. Metrics CASPAR Approach Representation Information (RepInfo) RepInfo Networks and their maintenance.
Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.
CASPAR Preservable Infrastructure Addressing Preservation with an OAIS based Infrastructure Luigi Briguglio Engineering R&D Laboratory – Rome (Italy) 3rd.
Pulling it all together… with thanks to Sheila Anderson.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Sustainability and the APARSEN Network of Excellence: Preservation.
DigCCurr 2007: What digital curators do and what they need to know The CASPAR view on: What digital curators do and what they need to know : Research Perspectives.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Digital Preservation Sustainability on the EU Policy Level Elevator Pitches.
Project Overview APA Conference 2012 ESA/ESRIN (Frascati), 6-7 November 2012 D. Giaretta (APA)
Institutional Repositories It’s not Just the Technology New England Archivists Boston College March 11, 2006 Eliot Wilczek University Records Manager Tufts.
CODATA 2006, Beijing, China Oct CASPAR: Early results and future goals David Giaretta.
Digital Preservation DAVID GIARETTA (APA) FIRST PRELIDA WORKSHOP, TIRRENIA, JUNE 25TH-- ‐ 27TH,2013.
SCIDIP-ES services and toolkits David Giaretta. Preserving digitally encoded information Ensure that digitally encoded information are understandable.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
PARSE.Insight Framework and Lesson Learned David Giaretta (STFC)
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
ISO & OAI-PMH By Neal Harmeyer, Amy Hatfield, and Brandon Beatty PURDUE UNIVERSITY RESEARCH REPOSITORY.
Project Overview APA Conference 2012 ESA/ESRIN (Frascati), 6-7 November 2012 M. Albani (European Space Agency), Project Coordinator.
Current Thinking on Digital Preservation: Role of Metadata Oya Y. Rieger Coordinator, Library Office of Distributed Learning Cornell University Library.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
E-IRG Open Workshop on e-Infrastructures 4-5 Oct 2006 CASPAR Project Digital Preservation and Digital interoperability.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Who is doing a good job in digital preservation? Audit and Certification of Digital Repositories: ISO and the European Framework.
An Overview of Selected ISO Standards Applicable to Digital Archives Science Archives in the 21st Century 25 April 2007 Donald Sawyer - NASA/GSFC/NSSDC.
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
Project Overview APA Conference 2012 ESA/ESRIN (Frascati), 6-7 November 2012 M. Albani (European Space Agency), U.Di Giammatteo (ACS), D. Giaretta (APA)
Science Archives in the 21st Century 25/26 April Towards an International standard for Audit and Certification of Digital Repositories David Giaretta.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
APARSEN Metadata for preservation, curation and interoperability Workshop on Research Metadata in Context 7-8 Sept 2010, Nijmegen David Giaretta APA and.
Reference Model for an Open Archival Information System (OAIS) ESIP Summer Meeting John Garrett – ADNET Systems at NASA/GSFC ESIP Summer Meeting.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
CASPAR Framework and Lessons Learned David Giaretta.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Automation in Digital Preservation: Three Scenarios Milena Dobreva 1, Yunhyong Kim 2, Gillian Oliver 3, Seamus Ross 2, Raivo Ruusalepp 4 1 Centre for Digital.
Small steps and lasting impact: making a start with preservation or It’s not all NASA Patricia Sleeman Digital Archives and Repositories University of.
Co-ordinated by aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT The importance of interoperability and intelligibility in digital.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
The OAIS Reference Model and Trustworthy Repositories Josh Lubell Manufacturing Engineering Laboratory NIST
Data Preservation at Rutherford Lab David Corney 9 th July 2010 KEK.
An overview of the Reference Model for an Open Archival Information System (OAIS) Michael Day, Digital Curation Centre UKOLN, University.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Fulvio Marelli - ESA and future An example of data lifecycle: sensed data need to be acquired…
BNSC Agency Report David Giaretta Colorado Springs 16 Jan 2007.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
DP Knowhow: Introduction to Audit and Certification in ISO APARSEN-EGI Community Workshop on Managing, Computing and Preserving Big Data for Research.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
PV 2009, ESAC, Spain, 1-3 Dec Long term data and knowledge preservation for the Earth Sciences Archive S. ALBANI (ESA) D. Giaretta (STFC) PV 2009.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network aparsen.eu #APARSEN Options.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN CoE offerings Simon Lambert STFC All Hands Meeting, Amsterdam,
Co-ordinated by aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT Services and Sustainability David Giaretta,
DP Knowhow: Open Archival Information Systems (OAIS) in ISO APA/C-DAC International Conference on Digital Preservation and the Development of Trusted.
Digital Sustainability on the EU Policy Level
Digital Sustainability on the EU Policy Level
WP14 Common Testing Environments
Dependency Management
D33.1B PEER REVIEW OF DIGITAL REPOSITORIES
CASPAR Cultural, Artistic and Scientific knowledge for Preservation Access and Retrieval.
Active Data Management in Space 20m DG
Implementing an Institutional Repository: Part II
Open Archival Information System
Digital Preservation and Trusted Digital Repositories
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

SCIDIP-ES Components Oct ,Brussels

Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation Information includes emulation Transform more specific than “migrate” Hand over to another repository

When things change We need to: Know something has changed Identify the implications of that change Decide on the best course of action for preservation What RepInfo we need to fill the gaps Created by someone else or creating a new one If transformed: how to maintain data authenticity Alternatively: hand it over to another repository Make sure data continues to be usable Orchestration Service Gap Identification Service Preservation Strategy Tk RepInfo Registry Service Authenticity Toolkit Storage Service Data Virtualisa tion Toolkit Process Virtualisa tion Toolkit RepInf o Toolkit

ThreatRequirement for solution Users may be unable to understand or use the data e.g. the semantics, format, processes or algorithms involved Ability to create and maintain adequate Representation Information Non-maintainability of essential hardware, software or support environment may make the information inaccessible Ability to share information about the availability of hardware and software and their replacements/substitutes The chain of evidence may be lost and there may be lack of certainty of provenance or authenticity Ability to bring together evidence from diverse sources about the Authenticity of a digital object Access and use restrictions may make it difficult to reuse data, or alternatively may not be respected in future Ability to deal with Digital Rights correctly in a changing and evolving environment Loss of ability to identify the location of data An ID resolver which is really persistent The current custodian of the data, whether an organisation or project, may cease to exist at some point in the future Brokering of organisations to hold data and the ability to package together the information needed to transfer information between organisations ready for long term preservation The ones we trust to look after the digital holdings may let us down Certification process so that one can have confidence about whom to trust to preserve data holdings over the long term RepInfo toolkit, Packager and Registry – to create and store Representation Information. In addition the Orchestration Manager and Knowledge Gap Manager help to ensure that the RepInfo is adequate. Registry and Orchestration Manager to exchange information about the obsolescence of hardware and software, amongst other changes. The Representation Information will include such things as software source code and emulators. Authenticity toolkit will allow one to capture evidence from many sources which may be used to judge Authenticity. Packaging toolkit to package access rights policy into AIP Persistent Identifier system: such a system will allow objects to be located over time. Orchestration Manager will, amongst other things, allow the exchange of information about datasets which need to be passed from one curator to another. Certification toolkit to help repository manager capture evidence for ISO Audit and Certification

APARSEN test audit findings Lack of definition of Designated Community Lack of adequate Representation Information Inadequate Archival Information Packages Lack of hand-over plans

SCIDIP-ES in brief Upgrade CASPAR prototype components into scalable, robust e- infrastructure components to support digital preservation of all types of digital objects decentralised, heterogeneous, asynchronous, no single point of failure Persistent, simple re- implementable interfaces critical mass of users: Earth science as initial focus Other disciplines via APA DIGITAL PRESERVATION RESEARCH needed to create the tools needed to create the “metadata” used by the e-infrastructure and user applications. Tools may be domain dependent. Must include Rep. Info. Network of the metadata SCIence Data Infrastructure for Preservation – with focus on Earth Science Storage Service Gap Identification Service Orchestration Service RepInfo Registry Service Preservation Strategy Toolkit Process Virtualisation Toolkit Finding Aid Toolkit Cloud Storage Persistent ID i/f Service External PI services ISO Certification Organisation Certification Toolkit External Access/Use Services E- INFRASTRUC TURE TOOLKIT S Archives User applications Domain independent Infrastructure counters threats identified by PARSE.Insight based on CASPAR prototypes Consistent with APARSEN integrated view Will help archives with certification

Conclusions: Services and toolkits help repositories to… share the effort of preservation address major threats to digital preservation by supplementing what they currently do proof from CASPAR and PARSE.Insight applicable to all types of digital objects become trustworthy add value to digital holdings

END

Add Representation Information OAIS introduces the concept of Representation Information Information to help understand the digitally encoded object - includes emulators bit-level descriptions dictionaries Ideally description allows automated extraction of information In general if a digital object is no longer usable/understandable adding Representation Information digital can often solve the problem

Migration OAIS defines various types of Migration: Do not change the bits Refresh Replicate Change the packaging but not the content Repackage Change the content Transform (usually non-reversible) Need to consider “Transformational Information Properties” – important for AUTHENTICITY Related to “Significant properties” Add appropriate Representation Information for the new format

AND – be prepared to Hand-over Preservation requires funding Funding for a dataset (or a repository) may stop Need to be ready to hand over everything needed for preservation OAIS (ISO 14721) defines “Archival Information Package (AIP) which brings together everything needed for long term preservation With information which covers Understandability Authenticity How things are packaged together Not a one-off Need to ensure that Understandability (for the Designated Community) is maintained Needs a support system

Preservation Planning Processes Scoping Formulation Impl ESA, Rome14/11/2013

Design Preservation Network Model (PNM) Capture PNM properties cost, risks, objectives, decisions, actions links to metric evidence… Evaluate and select preservation solution/s ESA, Rome14/11/2013 Formulation Preservation Strategies Toolkit

ESA, Rome14/11/2013 Implementation Design RepInfo Network Create RepInfo objects Capture RepInfo properties façade to various tools Search, re-use and share Registry objects Maintain registry objects Repinfo Toolkit