Automation and Scalability in Digital Preservation

Slides:



Advertisements
Similar presentations
Preserv Preservation Eprint Services Simple Preservation Services – towards Proactive Support for the Institutional Repository.
Advertisements

Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.
Characterisation Adrian Brown The National Archives, UK.
Introduction to Planets Hans Hofman Nationaal Archief Netherlands Prague, 17 October 2008.
Preservation as a Process of a Repository David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library IFLA conference 27/02/10.
RepoMMan: using Web Services and BPEL to facilitate workflow interaction with a digital repository Richard Green.
Opportunities for the cultural sector Claude POLIART DG Information Society (IFSO)
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
Digital Preservation Process or Outcome? or What Hath NLNZ Wrought? Joseph Shubitowski IGeLU 2008, September 9, 2008.
Special collections and digital libraries: a new role for consortia? Dale Flecker Harvard University Library.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Preservica: Preservation as a Service
SOAPI: a flexible toolkit for implementing ingest and preservation workflows Mark Hedges Centre for e-Research, King’s College London Arts and Humanities.
Preservation and Long-term access through Networked Services Adam Farquhar, The British Library iPres2006 Cornell University, October 2006.
The Planets Interoperability Framework Rainer Schmidt AIT Austrian Institute of Technology 1st DPIF Symposium, April 21-23, 2010,
Jee Davis University of Texas Libraries ALCTS Technical Services Directors of Large Research Libraries Interest Group 2014 ALA Annual Conference in Las.
IT GOVERNANCE AND CYBERCRIME Open Source Forensic Tools 19/04/10.
1Hydra Connect 2: Working Group Framework Empowering the Community through a Framework for Interest Groups and Working Groups Robin Ruggaber University.
LIFE 3 LIFE3: Predicting Long Term Preservation Costs Paul Wheatley Digital Preservation Manager The British Library.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library KeepIt training course 05/02/10.
our digital memory accessible tomorrow A future with no history meets a history with no future: perspectives on how much we should know.
Repositories collect lots of technical metadata, but lack tools to use it to better understand the objects in their care, and to apply it precisely in.
Columbia Digital Preservation Planning & Implementation Status Report, August 2010.
Ensuring Enduring Access: A Forum on Digital Preservation, July 21, 2009.
Information- and Archive Services | Information- and Archive Services in the Netherlands Jacqueline Slats DLM Forum May 13, 2011 Budapest.
Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, SCAPE Scalable Preservation Environments.
SCAPE Dr. Rainer Schmidt AIT Austrian Institute of Technology GmbH APA 2011 Conference London, 8-9 November, 2011 The SCAPE Project Overview, Objectives,
Caring and Sharing Collaboration in Digital Curation outside North America Ross Harvey Simmons College, Boston Curation Matters: 17 June 2010.
University of North Texas Enhancing the Quality of Metadata: Modular Approach to Digital Resource Lifecycle Management Daniel Gelaw Alemneh & Mark E. Phillips.
MAINTAINING QUALITY METADATA: TOWARD EFFECTIVE DIGITAL RESOURCE LIFECYCLE MANAGEMENT Daniel Gelaw Alemneh University of North Texas.
NCSU Libraries 27 March 2006 Digital Preservation in State Government – Wilmington, NC North Carolina Geospatial Data Archiving Project Workflow, Tools,
International Initiatives International Household Survey Network and Accelerated Data Program Olivier Dupriez World Bank / IHSN.
April 10, 2009CDL Users Council1 Digital Curation Services at CDL Perry Willett Digital Preservation Project Manager California Digital Library.
Digital continuity: current problems, ongoing trends and [Archives NZ’s] solutions Presentation for Auckland Recordkeeping Forum Evelyn Wareham Programme.
BUILDING ON COMMON GROUND: EXPLORING THE INTERSECTION OF ARCHIVES AND DATA CURATION Lizzy Rolando & Wendy Hagenmaier 6/3/2015IASSIST 2015.
SCAP E SCAPE Project EU project aimed at building a scalable platform for planning and execution of computation intensive processes for ingestion or migration.
Small steps and lasting impact: making a start with preservation or It’s not all NASA Patricia Sleeman Digital Archives and Repositories University of.
Microsoft Research Faculty Summit Natasa Milic-Frayling & Vijay Rajagopalan Microsoft Corporation.
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
Connecting Preservation Planning and Plato with Digital Repository Interfaces David Tarrant
Bill Roberts, PresDB 07 Database Preservation: A success story and an unsolved problem Bill Roberts 23 March 2007 PresDB, Edinburgh.
Carcanet Case Study Fran Baker, John Rylands University Library University of Manchester SPRUCE event 19 January 2012.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, A Weekend with Nanite Large scale.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Digital Library Storage using iRODS Data Grids Mark Hedges, Tobias Blanke Centre for e-Research, King’s College London Arts and Humanities Data Service.
Agenda’s for Preservation Research Micah Altman MIT Libraries Prepared for SAA Research Forum Atlanta August 2016.
DP Knowhow: Open Archival Information Systems (OAIS) in ISO APA/C-DAC International Conference on Digital Preservation and the Development of Trusted.
An Introduction to Tessella and The Safety Deposit Box Platform
Joseph JaJa, Mike Smorul, and Sangchul Song
Digital Preservation In Practice
Free for all : opening collections & supporting multi-institutional efforts w/ Internet Archive Patrick R. Wallace, digital projects & archives librarian,
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
EPrints Preservation.
Sophia Lafferty-hess | research data manager
What comes first? Metadata or Data Access?
Sustaining digital projects
Free for all : opening collections & supporting multi-institutional efforts w/ Internet Archive Patrick R. Wallace, digital projects & archives librarian,
Metadata to fit your needs... How much is too much?
The Lone Digital Archivist’s Lament Other digital responsibilities Technical constraints Three common strategies, all with challenges: Disk images.
Digital Preservation Planning:
A Statistical Profile of LTS and Aspirational Goals for 2015-
Enterprise Productivity – HCL Proposition
A Brief Introduction to Digital Forensics
Jisc Research Data Shared Service (RDSS)
EPrints Preservation.
Presentation transcript:

Automation and Scalability in Digital Preservation William Kilbride william@dpconline.org

What’s the problem? Volumes of Digital Data are growing on (at least) three axes: Volume … Complexity … Importance … 15/01/2019 timbusproject.net © 2011

What’s the problem? Urgency to act – digital preservation intolerant of gaps Budget and capacity not keeping pace with this growth: Repetitious processing Bottlenecks, especially at ingest Manual intervention Staff costs Rapid development of tools 15/01/2019 timbusproject.net © 2011

Automation 1: tools for characterisation Making sense of what you’ve got at ingest: Pronom Droid JHove FITS UDFR (?) 15/01/2019 timbusproject.net © 2011

Automation 2: tools for testing and management Making one rational decision for many objects: DELOS Preservation Testbed LIFE 3 PLATO 3 PLATO 4 APARSEN 15/01/2019 timbusproject.net © 2011

Automation 3: tools for E-discovery Forensics tools becoming more popular for ingest and management of large heterogeneous collections: FTK Sleuth Kit FIDO Bit Curator 15/01/2019 timbusproject.net © 2011

Automation 4: automated workflows Where possible, entire workflows from ingest to dissemination are being automated: ADS OASIS project UK Web Archive Digital continuity Quality assurance is the key: getting the right kind of human intervention 15/01/2019 timbusproject.net © 2011

Automation 5: automating access Making digital preservation invisible to end users: Memento – time travel for the web Digital continuity Making stuff easy to find therefore making value self evident 15/01/2019 timbusproject.net © 2011

Automation 6: interoperability Interoperability of preservation tools and services: PLANETS preservation interop framework CDL Micro-services framework Currently harder to integrate interoperable services than to run them serially 15/01/2019 timbusproject.net © 2011

Scalability and Automation: thoughts Automated metadata extraction Natural language processing and sense-making is hard Cultural versus technical preservation Significant properties poorly defined Automated migration Too few migration tools and not enough tests Migration ‘on demand’ End to end preservation Preservation tools not aligned ‘Good enough is not always good enough’ 15/01/2019 timbusproject.net © 2011