"Keeping alert: issues to know today for long-term digital preservation with repositories" Neil Beagrie Fedora Users Group Open Repositories Southampton.

Slides:



Advertisements
Similar presentations
THE BRITISH LIBRARY Open Access Institutional Repositories – Leadership, Direction & Launch January 26, 2005.
Advertisements

Partnering with Faculty / researchers to Enhance Scholarly Communication Caroline Mutwiri.
Usage statistics in context - panel discussion on understanding usage, measuring success Peter Shepherd Project Director COUNTER AAP/PSP 9 February 2005.
Preserv Preservation Eprint Services Simple Preservation Services – towards Proactive Support for the Institutional Repository.
PRESERV Repositories and stakeholders Jessie Hey PRESERV Partners Meeting 18 Nov 2005.
Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Joint Information Systems CommitteeSupporting education and research JISC Conference 2006 Supporting digital curation to safeguard research data Sponsored.
Keeping Research Data Safe JISC Research Data Digital Preservation Costs Study LIFE Conference London June 2008.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
Costs, Policy, and Benefits in Long-term Digital Preservation Neil Beagrie Keepit Training Course Southampton Feb 2010.
Keeping Research Data Safe JISC Research Data Digital Preservation Costs Study JISC-CNI Belfast July 2008.
A centre of expertise in digital information management UKOLN is supported by: Curating the Scientific Record: The Challenges Ahead Dr.
A centre of expertise in digital information management UKOLN is supported by: UK Perspectives on the Curation and Preservation of Scientific.
UKRDS: the policy context 26 February 2009 Paul Hubbard Head of Research Policy, HEFCE.
Costs and Benefits in KRDS and I2S2 Neil Beagrie RAL Feb 2010.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
The JISC vision of research information management Dr Malcolm Read Executive Secretary, JISC.
INFSO-RI Enabling Grids for E-sciencE Grid & Data Preservation Boon Low System Development, EGEE Training National.
Sustainable Preservation Services for Archivists through Distributed Custody Caryn Wojcik State of Michigan Records Management Services.
Digital Preservation: Setting the Course for a Decade of Change Neil Beagrie British Library Bibliotheque royale de Belgique November 2007.
Data Requirements and Digital Repositories IASSIST Workshop Tampere, Finland 26 May, 2009.
Supplementary Data and Publishers Neil Beagrie, Julia Chruszcz, and Peter Williams Charles Beagrie Ltd Dryad UK April 2010.
1. UKPMC ‘We exist for everyone who wants to do research – for academic, personal, or commercial purposes.’ - BL Strategy 2005/8.
The Data Curation Profile IASSIST 2010 Jake Carlson Data Research Scientist Purdue University Libraries.
SCOPING DIGITAL REPOSITORIES SERVICES FOR RESEARCH DATA MANAGEMENT A Project of the Office of the Director of IT 1 The management of research data in digital.
New DFG Information Infrastructure Projects Dr. Stefan Winkler-Nees; Birmingham, 28. March 2011 New DFG Information Infrastructure Projects.
Bielefeld Conference 2006: Academic Library and Information Services: New Paradigms for the Digital Age Hans Geleijnse Director of Library and IT Services.
Keeping Research Data Safe JISC Research Data Digital Preservation Costs Study Oxford Workshop June 2008.
The Tower Hotel, November 26, 2009 Research Data Management Infrastructure Programme Launch Event SUpporting Data Management Infrastructure for the Humanities.
Copyright 2006 M.R.Thorley/NERC Mark Thorley, Natural Environment Research Council Research Outputs: Their Access & Preservation A perspective.
Good practice in Research Data Management Module 6: Tools, training and support.
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library KeepIt training course 05/02/10.
Hydra and Research Data Management Neil Stewart, Digital Library Manager, London School of Economics and Political Science Presentation for Hydra Europe.
Making Grey Literature Available through Institutional Repositories LeRoy J. LaFleur, Social Sciences Bibliographer Nathan A. Rupp, Metadata Librarian.
Science Archives in the 21st Century 25/26 April Towards an International standard for Audit and Certification of Digital Repositories David Giaretta.
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
24 March 2010Atlanta, Georgia Passing it on: Notes on digital initiative sustainability Marty Kurth HBCU Library Alliance – Cornell University Library.
The repositories Landscape: where are Repositories now and what’s around the corner? UKDA-store Louise Corti UKDA, University of Essex MIMAS OPEN FORUM.
Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management ServicesSALT DCAPE.
September 17, 2015 The Evolving Scholarly Record: Scope, Stakeholders, and Stewardship Brian Lavoie Constance Malpas OCLC Research.
Models for a sustainable National Grid Service Introduction Neil Geddes Director, e-Science.
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
Preservation Strategies: Framing The Approach Nancy Hoebelheinrich Knowledge Motifs LLC Data Management Workshop American Geophysical.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
Author(s): Brian Lavoie, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Noncommercial.
David Carr The Wellcome Trust Data management and sharing: the Wellcome Trust’s approach Economic & Social Data Service conference.
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Benefits of data sharing: Towards an economic model Jenny Fry and Charles Oppenheim Information Science, Loughborough University John Houghton Centre for.
UKOLN is supported by: Introduction to UKOLN Dr Liz Lyon, Director UKOLN, University of Bath, UK Grand Challenge Meeting, June a centre.
KNOWLEDGE ECONOMY FORUM VI Technology Acquisition and Knowledge Networks Cambridge, England. April 17-19, 2007 Panel 2A April 19, 2007 Standards and Quality.
From ePrints to eSPIDA: Digital Preservation at the University of Glasgow William J Nixon, Service Development DAEDALUS, University of Glasgow DPC: Digital.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
11 Researcher practice in data management Margaret Henty.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Long-term preservation and access: the UK context Michael Day, UKOLN, University of Bath RCUK Workshop on Publication.
Joint Information Systems Committee Supporting Higher and Further Education Continuing Access and Digital Preservation: the JISC Strategy Neil Beagrie.
Introduction to Research Data Management Joy Davidson and Sarah Jones Digital Curation Centre
Data Management and Digital Preservation Carly Dearborn, MSIS Digital Preservation & Electronic Records Archivist
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
A. D. SMITH – SEPTEMBER 29, 2011 RESOURCE PLANNING.
Working with personal digital archives Susan Thomas Project Manager & Digital Archivist project Manuscripts Matter, Electronica panel London, October.
PRESERV PReservation Eprint SERVices
Funding the full economic costs of research
Presentation transcript:

"Keeping alert: issues to know today for long-term digital preservation with repositories" Neil Beagrie Fedora Users Group Open Repositories Southampton April 2008

Focus of this lecture Research Data JISC Research Data Digital Preservation Costs Study Long-term costs and sustainability of Repositories

Trends

Computer Processing Power and Storage

Growth of Scientific Data and Data Curation In next 5 years e-Science will produce more data than has been collected in the whole of human history Data growth – Protein Data Bank ( /2005)

e-Research and preservation (UK Science and Innovation Investment Framework 2004 – 2014)

Information Infrastructure 2.23 The growing UK research base must have ready and efficient access to information of all kinds – such as experimental data sets, journals, theses, conference proceedings and patents… It is clear that the research community needs access to information mechanisms which: systematically collect, preserve and make available digital information;… The Government [via DTI] will therefore work with interested funders and stakeholders to consider the national e-infrastructure (hardware, networks, communications technology) necessary to deliver an effective system.

e-research data and repositories EU Studies: Driver2 E-SCIDR (e-science repositories) Current UK Studies: Data Scientist careers/skills Data Audit Framework and institutional pilots UK Research Data (shared)Service Feasibility Study Costs for long-term preservation of research data

Keeping Research Data Safe JISC Research Data Digital Preservation Costs Study

Overview Aim – investigate costs, develop model and recommendations Project team – Me, Julia Chruszcz, Brian Lavoie (OCLC), Cambridge, KCL, Southampton Method – detailed analysis of 2 cost models (LIFE & NASA CET) in combination with OAIS and TRAC; literature review;12 interviews; 4 case studies. 4 month study Draft final report in peer review

What have we Produced? A cost framework consisting of: – activity model in 3 parts: pre-archive, archive, support services –Key cost variables divided into economic adjustments and service adjustments –Resources template for TRAC –Used in combination to generate cost/charging models 4 detailed case studies (ADS, Cambridge, KCl, Southampton) Spreadsheet supplement Data from other services.

Some Tentative Findings

Findings Institutional Repository (e- publications): Staff Equipment (capital depreciated over 3 years) Annual recurrent costs 1 FTE£1,300 pa Federated Institutional Repository (data): Annual recurrent costs Staff Equipment (capital depreciated over 3 years) Cambridge4 FTE£58,764 pa KCL2.5 FTE£27,546 pa

Findings Timing. costs c. 333 euros for the creation of a batch of 1000 records. Once 10 years have passed since creation it may cost 10,000 euros to ‘repair’ a batch of 1000 records with badly created metadata (Digitale Bewaring Project) Efficiency Curve effects – start-up to operational Economy of scale effects – Accession rates of 10 or 60 collections - 600% increase in accessions will only increase costs by 325% (ULCC)

Findings National subject repositories costs Acquisition and Ingest Archival Storage and Preservation Access c. 42%c. 23%c. 35%

Findings ADS project of long-term preservation costs Implications for sustainability via project charges Preservation interventions (file format migrations) Long-term storage costs Assumptions of archive growth (economies of scale) Assumptions on “first mover innovation”

What’s New? FEC based – not in or partial in other models but –Requirement for HEIs –Absence of FEC (a) distorts business cases eg for automation (b) cannot accurately compare in-house or out- source costs Not just DIY – application neutral – can cost for in- house archive, full or partial shared service(s), national/subject data centre archive charges Preservation: archival storage, preservation planning, data management, “first mover innovation” Tailored for research data: different collection levels, documentation+ metadata, products from data, etc

Conclusions

Cost Observations for Repositories Not just formula of function costs Can illustrate effect of some choices on costs Sustainable project archive funding model? Start-up v running costs bleeding-edge costs – “first mover innovation” Audit/capacity planning Not last word on costs....

Questions?