UK LOCKSS Alliance Today’s scholarly content, secured for tomorrow Adam Rusbridge UK LOCKSS Alliance Coordinator EDINA, University of Edinburgh 8 th March.

Slides:



Advertisements
Similar presentations
Open Access - Where are we so far? Bill Hubbard SHERPA Project Manager University of Nottingham.
Advertisements

Institutional Repositories and the SHERPA Project Bill Hubbard SHERPA Project Manager University of Nottingham.
SHERPA Jackie Wickham RSP Project Coordinator
Practical Issues for Institutional Repositories Bill Hubbard SHERPA Project Manager University of Nottingham.
Whats Different about the Digital: Community Action via UK LOCKSS Alliance Adam Rusbridge UK LOCKSS Alliance Coordinator EDINA, University of Edinburgh.
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
A centre of expertise in data curation and preservation DCC Workshop: Curating sApril 24 – 25, 2006 Funded by: This work is licensed under the Creative.
Supporting further and higher education Supporting Digital Preservation and Asset Management in Institutions eSPIDA event University of Glasgow 11 February.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Disk and Tape Storage Cost Models Richard Moore & David Minor San Diego Supercomputer.
our digital memory accessible tomorrow...to make our digital memory accessible tomorrow... Enabling Agenda-setting The Digital Preservation.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Disk and Tape Storage Cost Models Richard Moore & David Minor San Diego Supercomputer.
Supporting Further and Higher Education Building the UK National Information Environment - Lessons from the Past and Pointers To the Future Norman Wiseman.
National Activities and the UK LOCKSS Alliance Adam Rusbridge EDINA, University of Edinburgh 10 th May 2011.
Implementing Open Acces in Denmark UNESCO Regional Consultations on Open Access, Berlin, November 2013 forfatter.
DCAPE Project Update Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management.
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech 1 st Canadian ETD.
E-journal preservation: economics and practicalities at the LSE Presented by Bill Barker and Lisa Cardy.
FIL Interlend, 05 July SUNCAT: the Central Source of Serials Information in the UK SUNCAT: the Serials Union Catalogue for the UK
The Alabama Digital Preservation Network (ADPNet) A statewide private LOCKSS network Aaron Trehub, Auburn University Libraries NDIIPP Partners Meeting.
MetaArchive Distributed Digital Preservation Workshop Session 3: Costs and Operational Considerations Wednesday, May 30, 2007 Robert W. Woodruff Library.
Mid-Michigan Digital Practitioners, March 14, 2014 The National Digital Stewardship Alliance Agenda Mid-Michigan Digital Practitioners Meeting Abigail.
Aims and Objectives “ The Archaeology Data Service (ADS) supports research, learning and teaching with high quality and dependable digital resources.
UK LOCKSS Alliance: Content Development Adam Rusbridge EDINA, University of Edinburgh 10 th May 2011.
CLOCKSS: Time and Places for Community-Based Archiving Peter Burnhill University of Edinburgh IFLA 2010.
UK LOCKSS Alliance Today’s scholarly content, secured for tomorrow Adam Rusbridge UK LOCKSS Alliance Coordinator EDINA, University of Edinburgh 19 th October.
“Storage is Cheap” and Other Lies Lance Stuchell, University of Michigan Library Curating and Managing Research Data for Re-Use ICPSR, July 31, 2013 “Hard.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Scholarly Communications at Oxford Brookes ‘Supporting researchers at all levels with managing, sharing, communicating, disseminating and curating their.
Growing the MetaArchive Cooperative: ETDs (electronic theses and dissertations) Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP.
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
Katherine Skinner Educopia Institute and MetaArchive Cooperative Matt Schultz Educopia Institute and MetaArchive Cooperative NDIIPP Partners Meeting Arlington,
Preserving ETDs: NDLTD & MetaArchive Collaboration Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ USETDA 2012.
The International e-Depot to Guarantee Permanent Access to Scholarly Publications Marcel Ras Tartu, June 2012.
Session 2.  Wake Up Call, LSTA Digitization Grant  Digital Preservation Summit, May 2008  ISU Digital Preservation Group, September 2009.
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
1 Designing Storage Architecture for Digital Collections 2012.
Preserving eScholarship and Digitized Special Collections Distributed Digital Preservation Bill Donovan
T HE M ETA A RCHIVE M ODEL : D ISTRIBUTED D IGITAL P RESERVATION N ETWORKS Dr. Martin Halbert VIVA/SCHEV LAC Meeting Christopher Newport University Trible.
HATHITRUST A Shared Digital Repository The HathiTrust Print Monograph Archive Planning Task Force Print Archive Network Forum ALA 2015 Annual Meeting June.
Katherine Skinner, Executive Director, Educopia Institute ESOPI 2013 Chapel Hill, NC April 19, 2013.
Growing the MetaArchive Cooperative ETDs Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP Partners Meeting.
Martin Halbert President, MetaArchive Cooperative DigCCurr 2009 Meeting Chapel Hill, NC Friday, April 3, 2009.
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
Dr. Martin Halbert Dr. Katherine Skinner Digital Preservation: What’s Now, What’s Next. Amigos Online Conference, August 12, 2011.
The Alabama Digital Preservation Network (ADPNet) Aaron Trehub Director of Library Technology Auburn University State Council of Higher Education for Virginia.
The Alabama Digital Preservation Network (ADPNet) A statewide Private LOCKSS Network Aaron Trehub, Auburn University Libraries SAA/CoSA Joint Annual Meeting.
UK LOCKSS Alliance: Investigation into Private LOCKSS Networks Adam Rusbridge EDINA, University of Edinburgh.
1 Strategic Developments at the British Library Lynne Brindley, Chief Executive UK Serials Group, 7 April 2003.
The Story of at the Alaska State Library Presented by Sheri Somerville Alaska State Library March 14, 2009.
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech Canadian.
Top Priorities in IT and Digital Projects at Georgia Tech Tyler Walters Georgia Tech Library and Information Center For ASERL ITDIIG – September 24, 2009.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Supporting further and higher education UK LOCKSS Pilot Programme Hazel Woodward Cranfield University & JISC.
Katherine Skinner, Educopia Institute Emily Gore, Clemson University U.S. Workshop on Roadmap for Digital Preservation Interoperability Framework NIST,
JISC/CNI Conference Edinburgh, 26th June 2002 Challenges of Digital Preservation – do we have a road map? Maggie Jones.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
MRC, 14 June SUNCAT: an introduction for the MRC libraries SUNCAT: the Serials Union Catalogue for the UK
New Opportunities Fund Preservation Workshop March 15th 2002 Maggie Jones Cedars Project Manager.
Joint Information Systems Committee Supporting Higher and Further Education Continuing Access and Digital Preservation: the JISC Strategy Neil Beagrie.
Libraries in the digital age Collection & preservation for generational access part two The LOCKSS Program.
Research Data Management 26 th April 2016 Federica Fina, Data Scientist, University of St Andrews Library.
Katherine Skinner, Martin Halbert & Matt Schultz Educopia Institute and MetaArchive Cooperative NDSA Infrastructure Committee
A Shared Commitment to Digital Preservation and Access.
IPR and the EThOS Project 28 th October 2008 Dr. Susan Copeland Senior Information Adviser (Research)
UK DP Needs Assessment Project overview 2 November 2005 Martin Waller.
The Alabama Digital Preservation Network (ADPNet)
Legal Deposit & UK Publishing
The MetaArchive Model: Distributed Digital Preservation Networks
Presentation transcript:

UK LOCKSS Alliance Today’s scholarly content, secured for tomorrow Adam Rusbridge UK LOCKSS Alliance Coordinator EDINA, University of Edinburgh 8 th March 2012 Trust and eJournals Workshop, London

Summary LOCKSS: Digital equivalent of the physical shelf Sufficient rights to access content as needed Financial control and governance over systems Automate preservation functions where possible LOCKSS provides generic preservation capacity Customise the distributed architecture according to community needs Modeling the total cost of long-term storage

Community Action for Assured Access A co-operative organization to ensure continuing sustainable access to scholarly work over the long term. UK libraries are collaborating to build national ‘network level’ infrastructure and to coordinate the preservation of electronic material of local and UK interest. (since 2008) Support Service at EDINA provides underlying coordination, support and development JISC Collections organises membership subscriptions and gives guidance and support JISC prompted the initial project led by the Digital Curation Centre ( ) 17 member institutions De Montfort University King’s College London London School of Economics Natural History Museum Open University Royal Holloway, University of London University of Birmingham University of Edinburgh University of Glasgow University of Hertfordshire University of Huddersfield University of Newcastle Upon Tyne University of Oxford University of Salford University of St. Andrews University of Warwick University of York Steering Committee directs activity (next meeting May 2012) Phil Adams (De Montfort University) Lisa Cardy (London School of Economics) Geoff Gilbert (University of Birmingham) Tony Kidd (University of Glasgow) Liz Stevenson (University of Edinburgh) Lorraine Estelle (JISC Collections) Peter Burnhill (EDINA) Adam Rusbridge (EDINA)

Technical Infrastructure -Preserves content as published -Preserve the record: web archiving -Fetches content from a server -Preserves integrity -Audit protocol to prevent damage -Tamper resistant -Avoids single point of failure -Distributed network to avoid points of failure -Model on success of print collections (and operation of the library)

Technical Infrastructure -Preserves content as published -Preserve the record: web archiving -Fetches content from a server -Preserves integrity -Audit protocol to prevent damage -Tamper resistant -Avoids single point of failure -Distributed network to avoid points of failure -Model on success of print collections (and operation of the library)

Technical Infrastructure -Preserves content as published -Preserve the record: web archiving -Fetches content from a server -Preserves integrity -Audit protocol to prevent damage -Tamper resistant -Avoids single point of failure -Distributed network to avoid points of failure -Model on success of print collections (and operation of the library)

MetaArchive A distributed digital preservation solution depends on a collaborating set of institutions agreeing to preserve each other’s content. Requires central coordination; shared enthusiasm, resources and benefit Successful models initiated where community / shared need already in place. MetaArchive is a cooperative not a vendor (conceived 2004) Goal is not to make profits, but to improve each member's situation. Distribute across geography: diversify funding, politics, economy Replicate content, lower barriers of entry Educopia Institute - non-profit administrative organisation Coordination role; arrange legal agreements and commitments to preserve member content Sustained by affordable cooperative fee memberships set by members Supplemented by grants and contracts

Costs Equipment Each institution required to contribute a server to the network As of June 2011: $4,600 for a 16TB machine Staffing 2% of a systems administrator’s time Administrator/point of contact Software engineer who preps content for ingest (latter two roles needed for outsourced solution). Storage $1.00/GB/year for content stored in the network. ‘Conspectus’ to organise where content is stored

Tiered Membership Sustaining Members: $5,500/year Leadership, development, governance Preservation Members: $3,000/year Benefit from shared preservation model Collaborative Members: (varies, but e.g. $4,000/year for 20) Consortia that share a server, and so look like one organisation. Allows existing consortia to preserve co-hosted content for a fraction of what it would cost to do so as individual members.

PLNs in the UK: Member Survey Share resources & responsibility, build community, keep costs low Preservation policy Content and Collections Organisational architecture Costs and Resources

Initial Conclusions Survey response rate: 50% of members Institutions seeking affordable solutions to digital preservation. e-preservation strategies have yet to be developed. Extent of digital assets requiring preservation unknown Systematic audits have not yet been carried out. Prefer architecture where content is stored at more than one location However a fully distributed approach was not favoured. Mixed enthusiasm for a PLN Need to demonstrate PLN is low-cost and sustainable Need clear and demonstrable financial benefits Need a shared interest in preserving a particular body or type of content. Difficult to gain acceptance and commitment without these benefits Moving forward: establish a UK PLN, or join the MetaArchive as a Collaborative Member?

Pricing of Long-term Cloud Storage David Rosenthal has been looking at cost models for long-term storage: Does it make economic sense to store data in the cloud, in the long-term. Kryder's Law, 30yr history of exponential increase in disk capacity at roughly constant cost. The cost of storing bits for the long term depends on current price and how fast it is dropping. How long can we expect Kryder's Law to continue? Indications that Kryder's Law is slowing down 4TB disks now available, but slower than expected. Driver for 3.5" disks has been desktop PCs. Volume market is now 2.5" disks: same curve but higher price/ byte. By 2020, ought to have 14TB 2.5" $40 Consumers may prefer a 2TB 1" drive for $15 and less power draw YearCost per GB 2002$ $ $ $ $0.08? 2012$0.06?

Cloud Storage Price History Price of cloud storage is dropping around an order of magnitude more slowly than raw disk prices. There is a recurrent cost for storage in the cloud. As collections grow, will the cost of cloud storage grow more than if performed locally? Research to model total costs over time – local hardware, maintenance, location, power, bandwidth, staffing. ProviderPrice (Year of Launch) Current Price% decrease Amazon S3$0.15/GB/mo (2006) $0.125/GB/mo3%/yr RackSpace$0.15/GB/mo (2008) $0.15/GB/mo0%/yr Windows Azure$0.15/GB/mo (2009) $0.14/GB/mo3%/yr

Cost to preserve 8TB Starting with S3's current pricing and assuming that it continues to drop at 3%/yr, the total cost over 4 years would be $41,065. DIY: 3 geographically separate complete copies each protected against double disk failures. Three Drobo FS network file servers ($600 each at Amazon) populated with 5 3TB Hitachi 5400RPM drives ($210 each at Newegg). Add one spare for each Drobo to cover while failed drives are returned under warranty. Capital cost of $5580. Each Drobo consumes ~70W with all drives active. So we'd consume 1840 KWh over 4 years. Palo Alto Green rates, cost of $250. Stanford experience with Drobos is that almost no attention needed, but assume staff costs at $50/hr for 1hr/mo/box = $7200. The total cost over 4 years would be $13,030: around a third of the total cost of S3.

Principles of LOCKSS: Building Trusted Archives LOCKSS software can be used to provide general, shared preservation capacity Responsibility spread across the community Shepherded by strong universities with strong collection policies Further assessment of UK Private LOCKSS Networks Model selected depends on scale of content & community enthusiasm Further assessment to understand the total cost of storage

Find out more…