A National Agenda for Digital Stewardship Prepared for CNI Fall Membership Meeting December 2013 Presenters: Micah Altman,, Michelle.

Slides:



Advertisements
Similar presentations
Moving Forward With Digital Preservation at the Library of Congress Laura Campbell Associate Librarian for Strategic Initiatives Library of Congress.
Advertisements

Intelligence Step 5 - Capacity Analysis Capacity Analysis Without capacity, the most innovative and brilliant interventions will not be implemented, wont.
Global Congress Global Leadership Vision for Project Management.
EU-funded Digital Preservation Research APA 2014 Conference Brussels, 22 October 2014 Dr. Manuela Speiser European Commission DG CONNECT, unit "Creativity"
April 6, 2011 DRAFT Educator Evaluation Project. Teacher Education and Licensure DRAFT The ultimate goal of all educator evaluation should be… TO IMPROVE.
The National Digital Stewardship Alliance: Community, Content, Commitment.
Project Monitoring Evaluation and Assessment
Information Without Borders: Perspectives from the Federal Government: A Canadian Digital Information Strategy Ingrid Parent Library and Archives Canada.
Return On Investment Integrated Monitoring and Evaluation Framework.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
Preservation and Long-term access through Networked Services Adam Farquhar, The British Library iPres2006 Cornell University, October 2006.
NHPRC ELECTRONIC RECORDS RESEARCH FELLOWSHIP SYMPOSIUM Nov. 19, 2004 Rebecca Schulte University of Kansas Project Title: Testing Boundaries—An Exploration.
1 IS112 – Chapter 1 Notes Computer Organization and Programming Professor Catherine Dwyer Fall 2005.
Records, Archives, and Transparency in the Development Community Initiatives from the World Bank Group Archives Elisa Liberatori Prati, World Bank Group.
Meeting SB 290 District Evaluation Requirements
Planning and submitting a shadow report Charlotte Gage Women’s Resource Centre.
Organization Mission Organizations That Use Evaluative Thinking Will Develop mission statements specific enough to provide a basis for goals and.
Margaret J. Cox King’s College London
AN INVITATION TO LEAD: United Way Partnerships Discussion of a New Way to Work Together. October 2012.
Mid-Michigan Digital Practitioners, March 14, 2014 The National Digital Stewardship Alliance Agenda Mid-Michigan Digital Practitioners Meeting Abigail.
Trends & Challenges in Digital Object Storage Infrastructure: Notes from the National Digital Stewardship Alliance (NDSA) Infrastructure Working Group.
Demystifying the Business Analysis Body of Knowledge Central Iowa IIBA Chapter December 7, 2005.
SOCIAL DEVELOPMENT CANADA 1 The Government of Canada and the Non-Profit and Voluntary Sector: Moving Forward Together Presentation to Civil Society Excellence:
IAEA International Atomic Energy Agency Reviewing Management System and the Interface with Nuclear Security (IRRS Modules 4 and 12) BASIC IRRS TRAINING.
24 March 2010Atlanta, Georgia Passing it on: Notes on digital initiative sustainability Marty Kurth HBCU Library Alliance – Cornell University Library.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Mission and Mission Fulfillment Tom Miller University of Alaska Anchorage.
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
NDIIPP The Next Phase Meg Williams Associate General Counsel The Library of Congress.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
=_A-ZVCjfWf8 Nets for students 2007.
Sub-Regional Workshop for GEF Focal Points in West and Central Africa Accra, Ghana, 9-11 July 2009 Tracking National Portfolios and Assessing Results.
From membership to leadership: advancing women in trade unions Working groups ETUC workshop, Berlin 28 October 2010.
Environmental Management System Definitions
Closing Remarks & Ways Forward Malcolm Hunt: Assistant Director, Evidence & Evaluation Becta Research Conference, 2005.
1 Designing Storage Architecture for Digital Collections 2012.
Eloise Forster, Ed.D. Foundation for Educational Administration (FEA)
CLARIN work packages. Conference Place yyyy-mm-dd
Digital Preservation Coalition Supporting Digital Preservation NOF-digi Preservation Workshop Senior Managers’ Brief Maggie Jones DPC Co-ordinator
Shruthi(s) II M.Sc(CS) msccomputerscience.com. Introduction Digital Libraries have become the source of information sharing across the globe for education,
Users/Historical Data Working Group Update to Coordination Group Butch Lazorchak Library of Congress Chair, U/HDWG September 9, 2014 Coordination.
Catawba County Board of Commissioners Retreat June 11, 2007 It is a great time to be an innovator 2007 Technology Strategic Plan *
Tracking national portfolios and assessing results Sub-regional Workshop for GEF Focal Points in West and Central Africa June 2008, Douala, Cameroon.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
NDSR Boston webinar: Digital Preservation Introduction Presenter: Nancy Y McGovern October 2015.
Revisions Proposed to the CIS Plan by the Global Office Misha V. Belkindas Budapest, July 3-4, 2013.
Developing a Framework In Support of a Community of Practice in ABI Jason Newberry, Research Director Tanya Darisi, Senior Researcher
Livia Bizikova and Laszlo Pinter
Role of Technical Agencies Responsible for Hazard Assessment, Monitoring, Observations, Data and Analysis Dr. David Green National Oceanic and Atmospheric.
JISC/CNI Conference Edinburgh, 26th June 2002 Challenges of Digital Preservation – do we have a road map? Maggie Jones.
Custodians of Culture, Architects of Archives  Martin Halbert (Emory Univ., MetaArchive Cooperative) - Facilitator  Thib Guicherd ‐ Callin (Stanford.
Global Partnership for Enhanced Social Accountability (GPESA) December 19, 2011 World Bank.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Presentation on S&T at the Second Managers’ Forum Lynne McHale Federal Science and Technology Community Management Secretariat February 17, 2005.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
UNDERSTANDING INFORMATION MANAGEMENT (IM) WITHIN THE FEDERAL GOVERNMENT.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
Katherine Skinner, Martin Halbert & Matt Schultz Educopia Institute and MetaArchive Cooperative NDSA Infrastructure Committee
A Shared Commitment to Digital Preservation and Access.
Demographic Full Count Review Presentation to the FSCPE March 26, 2001 Washington D.C.
Jaime Stoltenberg Map and Geospatial Data Librarian Arthur H. Robinson Map Library University of Wisconsin-Madison Wisconsin Land Information Association.
European Agency for Development in Special Needs Education Project updates Marcella Turner-Cmuchal.
The National Digital Stewardship Alliance: Community, Content, Commitment.
Announcing the 2014 National Digital Stewardship Agenda.
Agenda’s for Preservation Research Micah Altman MIT Libraries Prepared for SAA Research Forum Atlanta August 2016.
GISELA & CHAIN Workshop Digital Cultural Heritage Network
SNOMED CT Education SIG: Strategic Plan Review
Joseph JaJa, Mike Smorul, and Sangchul Song
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Presentation transcript:

A National Agenda for Digital Stewardship Prepared for CNI Fall Membership Meeting December 2013 Presenters: Micah Altman,, Michelle Gallinger, Library of Congress Abigail Grotke, Library of Congress Trevor Owens, Library of Congress

This Talk Who are the NDSA? Why develop an agenda for digital stewardship? What should national priorities be? … digital content … technical infrastructure … organizational roles … research areas What’s next?

Collaborators & Co-Conspirators The 150+ institutional members of NDSA, and the hours contributed by their representatives to NDSA working groups, meetings and reports National Agenda Authors: Micah Altman, Jefferson Bailey, Karen Cariani, Jim Corridan, Jonathan Crabtree, Blaine Dessy, Michelle Gallinger, Andrea Goethals, Abigail Grotke, Cathy Hartman, Butch Lazorchak, Jane Mandelbaum, Carol Minton Morris, Trevor Owens, Meg Phillips, John Spencer, Helen Tibbo, Tyler Walters, Kate Wittenberg, Kate Zwaard A National Agenda for Digital Stewardship4

Who are the NDSA? 5

About the NDSA Founded in 2010, the National Digital Stewardship Alliance (NDSA) is a consortium of institutions that are committed to the long-term preservation of digital information. Our mission is to establish, maintain, and advance the capacity to preserve our nation's digital resources for the benefit of present and future generations. NDSA member institutions represent all sectors, and include universities, consortia, professional associations, commercial enterprises, and government agencies at the federal, state, and local levels. The Library of Congress provides organizational support and substantive collaboration as Secretariat. Based on collaborative community effort -- there are no fees for NDSA membership. Each member institution commits to to NDSA principles, and contributes efforts to working groups, reports, surveys, meetings and other NDSA initiatives. A National Agenda for Digital Stewardship6

NDSA Initiatives A National Agenda for Digital Stewardship7 Working Groups Recent Outputs Extending Knowledge Preservation Storage Survey Web Harvesting Survey Preservation Staffing Survey Geospatial Selection & Appraisal report Content case studies NDSA Interview Series Tools for Practice Levels of Preservation Digital Preservation in a Box Digital Preservation on Wikipedia Dissemination National agenda for digital stewardship NDSA Innovation Awards NDSA Social Media

A National Agenda for Digital Stewardship Why develop an agenda for digital stewardship? 8

Why a national agenda for digital stewardship? Effective digital stewardship is vital for: – maintaining authentic public records – growing a reliable scientific evidence base – providing durable access to our cultural heritage Knowledge of ongoing research, practice, and organizational collaborations is distributed widely across disciplines, sectors, and communities of practice A National Agenda for Digital Stewardship9

Why now? Climate Strong trends towards: More production of digital content More publishing, filtering and access More learners and collaborators More attention to public information Weather A National Agenda for Digital Stewardship10

Isn’t digital preservation a solved problem? Why not put everything in Amazon? Amazon claims reliability of % (Longer odds than winning powerball, being struck by lightning, and finding alien life, combined) A National Agenda for Digital Stewardship11

What Was Accomplished? The National Agenda for Digital Stewardship identifies high-impact opportunities to advance: the state of the art the state of practice the state of collaboration A National Agenda for Digital Stewardship12

How was this accomplished it? Contributed community effort -Development: contributions from the (now 150+) institutional members through working group participation, workshop discussion, commentary -Writing: LC Staff, chairs of NDSA working groups, coordination committee -Reviewing: expert reviewers in the preservation community Integrating diverse perspectives from multiple disciplines & sectors The persistence, organization, and commitment of the Library of Congress in its role as Secretariat A National Agenda for Digital Stewardship13

A National Agenda for Digital Stewardship National priorities for… Digital Content 14

Digital Content Areas Web and Social Media Electronic Records Moving Image and Recorded Sound Research Data A National Agenda for Digital Stewardship15

Digital Content Areas Across all areas, content size, value and selection represent a core challenge Important to develop theoretically grounded and empirically tested models of information valuation. A National Agenda for Digital Stewardship16

Raising Awareness and Articulating Value Content area webinars Follow up blog posts, interviews Content Case Studies Content Matters blog posts A National Agenda for Digital Stewardship17

A National Agenda for Digital Stewardship National priorities for… Technical Infrastructure 18

2014 Technical Infrastructure Priorities File Format Action Plan Development Interoperability and Portability in Storage Architectures Integration of Digital Forensics Tools Ensuring Content Integrity A National Agenda for Digital Stewardship19

Technical Infrastructure File Format Action Plan Development – Stewardship organizations are amassing large collections of digital materials suggests a need to monitor the heterogeneous digital files the organizations are managing. – Need for tools and services for creating file-format action plans is needed to make timely execution of file format plans a reality for data stewards. – The digital preservation community would further benefit from organizations sharing their assessments of institutional risk and their plans for mitigating that risk and addressing file format problems with specific plans. A National Agenda for Digital Stewardship20

Interoperability and Portability in Storage Architectures As stewardship organizations manage increasingly large and complex data sets, the need for interoperability at various levels within the technical hardware and software stacks that make digital preservation becomes increasingly important. Interoperability of storage devices, hardware, data tape, and file systems software and would help alleviate bottlenecks in the interrelationship between distinct functions in workflows. Need for establishing and promoting technical means by which lower levels of the technology stack can directly integrate without requiring extensive computation and processing at higher levels. A National Agenda for Digital Stewardship21

Integration of Digital Forensics Tools Digital Forensics tools are essential for working across the range of heterogeneous kinds of digital materials coming under stewardship Projects like BitCurator are pulling together the suite of tools to do this work and developing processes and workflows. We are now at the point of implementation, it’s time for organizations to start implementing and sharing information about their work The result of this work, will be large sets of heterogeneous digital files which will then push for the development of tools to work with these kinds of data at scale. A National Agenda for Digital Stewardship22

Ensuring Content Integrity Digital preservation is possible through a chain of migration of current hardware and software systems to yet-to-be- established future infrastructures. Essential to develop guidance on how to plan for and manage these changes Abstract requirements for fixity are useful as principals, but when applied universally can actually be detrimental in some digital preservation system architectures. Need for best practices for fixity in particular system designs and configurations. Need for the development of standards, practices and strategies that directly address migration, in particular, around end-to-end fixity checking A National Agenda for Digital Stewardship23

A National Agenda for Digital Stewardship National priorities for… Organizational Development 24

Organizational Roles, Policies, and Practices Identifies need to increase cross‐organizational cooperation to increase the impact and leverage investments made by individual institutions. A National Agenda for Digital Stewardship25

“People who work together will win, whether it be against complex football defenses, or the problems of modern society.” - NFL coach, Vince Lombardi A National Agenda for Digital Stewardship26

1)Provision networked preservation services – network of preservation service providers with specialized services rather than every organization performing all aspects of digital preservation 2)Collaborate on shepherding and promotion of standards – digital preservation community representation on the relevant standards bodies rather than each organization needing to participate in every body 3)Share digital preservation training and staffing resources 2014 Priorities for Cross-organizational Cooperation A National Agenda for Digital Stewardship27

A National Agenda for Digital Stewardship National priorities for… Research 28

Research Priorities Applied Research for Cost Modeling and Audit Modeling Understanding Information Equivalence & Significance Policy Research on Trust Frameworks Preservation at Scale The Evidence Base for Digital Preservation A National Agenda for Digital Stewardship29

What does the discipline believe? Our digital evidence base erodes There are multiple threats to information – diversifying against them is crucial Lifecycle analysis is critical for better long- term management of information Better practices are needed A National Agenda for Digital Stewardship30

How have we learned… The Limits of Case Studies Most current evidence for digital preservation practices and outcomes are based on local case studies and convenience samples Case studies are useful for: – existence proofs – raising awareness of problems – process tracing – hypothesis generation, Case studies are not enough to – advance our scientific knowledge – create robust predictive models – test causal hypotheses – strongly guide decision making. Systematic Evidence is needed both to support – general selection of digital preservation practices and method – applications of selected digital preservation methods in a specific operational context. A National Agenda for Digital Stewardship31

Simple question? If you have 1000 files (bitstreams), and you’d like to have 99.99% chance of accessing them in 20 years. How do you store them? A National Agenda for Digital Stewardship32

Insider & External Attacks What are some threats? Physical & Hardware Software Curatorial Error Organizational Failure A National Agenda for Digital Stewardship33

Amazon’s Unrealistic Nine Nines What are the units? - Collection? Object? Bit? How many of these do you have? Seems to be entirely theoretical – MBTF + Independence * enough replicas – No details for estimate provided – No historical reliability statistics provided – No service reliability auditing provided Reasons to Doubt Theoretical Calculations – Storage manufacture hardware MTBF (mean time between failures) is inaccurate… – Failures across hardware replicas are not independent – Many potential correlated failure modes not addressed: software failure (e.g. a bug in the AWS software for its control backplane) legal threats (leading to account lock-out — such as this, deletion, or content removal); institutional threats (such as a change in Amazon’s business model) Process threats (someone hits the delete button by mistake; forgets to pay the bill; or AWS rejects the payment) Amazon SLA’s do not incorporate or reflect “design” reliability claims even slighltly: – No claim to reliability in SLA’s (or uptime, availability, response time…) – Sole recovery for breach is limited to refund of fees for periods the service was unavailable A National Agenda for Digital Stewardship34

The Problem Restated Keeping risk of object loss fixed -- what choices minimize $? “Dual problem” Keeping $ fixed, what choices minimize risk? Extension For specific cost functions for loss of object: Loss(object_i), of all lost objects What choices minimize: Total cost= preservation cost+ sum(E(Loss)) risk cost Are we there yet?

Methods for Mitigating Bit-LevelRisk Physical: Media, Hardware, Environment Number of copies Diversification of copies Formats File Transforms: compression, encoding, encryption Fixity Repair Local Storage File Systems: transforms, deduplication, redundancy Replication Verification Audit

Modeling Bit Corruption Media characteristics Threat characteristics Correlations Logical Scope of Corruption Format Characteristics File/encoding Characteristics Filesystem Characteristics Probability of Successful Repair Auditing Frequency Auditing Algorithm Repair AlgorithmRepair FrequencyRepair duration CorruptionDetectionRepair

What Else do We Need To Know? What is the expected future value of a specified collection of digital content? What content is already being effectively stewarded by other organizations? How much is the expected future cost of preserving that content? How often do different threats to information manifest – storage hardware or media failures – software errors cause information loss – stored information becomes inaccessible because of obsolete formats, or loss of other contextual knowledge – that human error or maliciousness causes loss content in an information system What is the reliability of current digital preservation networks and services? How successful are other proposed strategies for replication, monitoring, certification, and auditing at preventing loss due to these threats? A National Agenda for Digital Stewardship38

How do we learn? Apply existing research methodologies from other fields -- especially fields involving observation research on humans and human systems Some useful methodologies: – probability-based surveys (e.g. of information management practice and outcomes) – replicable simulation experiments tied to theoretically grounded models of information management and risk; – creation of testbeds and test-corpuses which can be used to systematically compare new practices, tools, and methods; – field experiments, in which randomized interventions are applied and evaluated in real operational environments. A National Agenda for Digital Stewardship39

A National Agenda for Digital Stewardship What’s next? 40

A National Preservation Agenda for 2015 and Beyond Drafts and update process starts this winter Community review process late spring An update will be presented in July at Digital Preservation 2014 A National Agenda for Digital Stewardship41

Moving Digital Preservation Forward NDSA has a commitment to: Facilitating broad collaboration Promoting dissemination and engagement Regular updates and revisions of the National Agenda and core NDSA surveys A National Agenda for Digital Stewardship42

Want more information? Contact NDSA for… Briefings, webinars, and consultations on the Agenda or other NDSA work Assistance in gathering comments on National policies and programs Assistance in recruiting experts for review and discussion panels; grant review Referrals to content stewards in specific areas A National Agenda for Digital Stewardship43

More Information digitalpreservation.gov/ndsa/na tionalagenda A National Agenda for Digital Stewardship44