BUILDING A COLLABORATIVE DIGITAL PRESERVATION NETWORK Caroline Arms Office of Strategic Initiatives, Library of Congress Robert H. McDonald Associate Director.

Slides:



Advertisements
Similar presentations
Dublin Core for Digital Video: Overview of the ViDe Application Profile.
Advertisements

A Community Approach to Preservation: Experiences with Social Science Data ASIST Summit 2010 Jonathan Crabtree April 9, 2010.
ETD Preservation Workshop Session Four: Collection Management for Preservation Gail McMillan, Virginia Tech.
The LIBRARY of CONGRESS JISC-CNI Envisioning Future Challenges in Networked Information York, England July 6, 2006.
Moving Forward With Digital Preservation at the Library of Congress Laura Campbell Associate Librarian for Strategic Initiatives Library of Congress.
Katherine Skinner Executive Director, Educopia Institute Program Manager, MetaArchive Cooperative An Age of Discovery, ARL-CNI Washington D.C. Friday,
The National Digital Stewardship Alliance: Community, Content, Commitment.
National Digital Information Infrastructure and Preservation Program (NDIIPP) Data-PASS/NDIIPP: A new effort to harvest our history A funder view May 25,
MICHAEL and the Italian Culture Portal: a cooperation model among national, regional, and local institutions The MICHAEL Project is funded under the European.
DCAPE Project Update Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
MetaArchive of Southern Digital Cultural Partners in the dispersed redundant dark archive University Libraries at Emory Auburn Florida State Georgia Tech.
Collaborative Digital Preservation with LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
A Practical, Working and Replicable Approach to ETD Preservation Catherine M. Jannik, Georgia Institute of Technology Robert H. McDonald, Florida State.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech 1 st Canadian ETD.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
National Digital Information Infrastructure and Preservation Program (NDIIPP) Building a Network of Preservation Partners CNI Spring Task Force Meeting.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
The Alabama Digital Preservation Network (ADPNet) A statewide private LOCKSS network Aaron Trehub, Auburn University Libraries NDIIPP Partners Meeting.
MetaArchive Distributed Digital Preservation Workshop Session 3: Costs and Operational Considerations Wednesday, May 30, 2007 Robert W. Woodruff Library.
MetaArchive of Southern Digital Cultural Partners in a dispersed redundant dark archive University Libraries at Emory Auburn Florida State Georgia Tech.
1.2 Content Management Catherine M. Jannik Georgia Institute of Technology MetaArchive Distributed Digital Preservation Workshop Emory University – Atlanta,
Mid-Michigan Digital Practitioners, March 14, 2014 The National Digital Stewardship Alliance Agenda Mid-Michigan Digital Practitioners Meeting Abigail.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Copyright © 2008, Open Geospatial Consortium, Inc., All Rights Reserved. NDIIPP Partnership Update: North Carolina and Multi-state Demonstration Projects.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Growing the MetaArchive Cooperative: ETDs (electronic theses and dissertations) Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
Katherine Skinner Educopia Institute and MetaArchive Cooperative Matt Schultz Educopia Institute and MetaArchive Cooperative NDIIPP Partners Meeting Arlington,
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
Metadata Handling in the North Carolina Geospatial Data Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives Rob Farrell Geospatial.
Session 2.  Wake Up Call, LSTA Digitization Grant  Digital Preservation Summit, May 2008  ISU Digital Preservation Group, September 2009.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
National Digital Information Infrastructure and Preservation Program (NDIIPP) CNI Project Briefing December 5, 2005.
November 2004 NDIIPP: Future Directions and Relevance to Other Countries Beth Dulabahn Office of Strategic Initiatives Library of Congress November 7,
Preserving eScholarship and Digitized Special Collections Distributed Digital Preservation Bill Donovan
Long-Term Preservation of At- Risk Digital Geospatial Data: A Cooperative Agreement with Library of Congress Steve Morris NCSU Libraries Zsolt Nagy NC.
T HE M ETA A RCHIVE M ODEL : D ISTRIBUTED D IGITAL P RESERVATION N ETWORKS Dr. Martin Halbert VIVA/SCHEV LAC Meeting Christopher Newport University Trible.
Report on Preservation of ETDs: The LOCKSS Prototype The work of Kamini Santhanagopalan Virginia Tech Graduate Student in Computer Science Reported at.
Martin Halbert President, MetaArchive Cooperative DigCCurr 2009 Meeting Chapel Hill, NC Friday, April 3, 2009.
The Alabama Digital Preservation Network (ADPNet) Aaron Trehub Director of Library Technology Auburn University State Council of Higher Education for Virginia.
The Alabama Digital Preservation Network (ADPNet) A statewide Private LOCKSS Network Aaron Trehub, Auburn University Libraries SAA/CoSA Joint Annual Meeting.
UK LOCKSS Alliance: Investigation into Private LOCKSS Networks Adam Rusbridge EDINA, University of Edinburgh.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Katherine Skinner, Educopia Institute Emily Gore, Clemson University U.S. Workshop on Roadmap for Digital Preservation Interoperability Framework NIST,
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Distributed Digital Preservation Networks Across a Region, Across a State: Stretching LOCKSS Gail McMillan, Virginia Tech Martin Halbert, Emory Aaron Trehub,
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Custodians of Culture, Architects of Archives  Martin Halbert (Emory Univ., MetaArchive Cooperative) - Facilitator  Thib Guicherd ‐ Callin (Stanford.
EDLproject WP3 “Developing the European Digital Library” LIBER – EBLIDA workshop Digitisation of Library Material in Europe Copenhagen, October.
Library of Congress Partnerships for Managing Geospatial Data North Carolina Geographic Information Coordinating Council Raleigh, NC November 7, 2007 William.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
LOCKSS at Georgia Tech Patricia E. Kenly April 2007.
The National Digital Information Infrastructure and Preservation Program (NDIIPP) Challenges and Solutions Laura E. Campbell Associate Librarian for Strategic.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
Katherine Skinner, Martin Halbert & Matt Schultz Educopia Institute and MetaArchive Cooperative NDSA Infrastructure Committee
Digital Preservation MetaArchive Cooperative, Digital Preservation Policy Planning Workshop Boston College, Boston, MA October 26, 2010.
The National Digital Stewardship Alliance: Community, Content, Commitment.
Beyond Technology: Creating and Sustaining the MetaArchive Cooperative Joint Annual Meeting, Society of American Archivists & the Council of State Archivists.
The Alabama Digital Preservation Network (ADPNet)
Implementing Metaarchive At Robert E. Kennedy Library
DataNet Collaboration
Introduction to Implementing an Institutional Repository
Gail McMillan Digital Library and Archives, Virginia Tech
CNI Project Briefing December 5, 2005
The MetaArchive Model: Distributed Digital Preservation Networks
Presentation transcript:

BUILDING A COLLABORATIVE DIGITAL PRESERVATION NETWORK Caroline Arms Office of Strategic Initiatives, Library of Congress Robert H. McDonald Associate Director of Libraries for Technology & Research Florida State University Lizabeth B. Nicol Digital Library Coordinator Auburn University Tyler O. Walters Associate Director for Technology and Resource Services Georgia Institute of Technology

OVERVIEW Introduction to the NDIIPP Partnerships The MetaArchive Partnership Auburn University Emory University Florida State University Georgia Tech University of Louisville Virginia Tech The MetaArchive Metadata Strategy The MetaArchive Technical Architecture

INTRODUCTION TO NDIIPP Federal legislation in December 2000 LC to work with public and private sector to support preservation of significant “born- digital” content that is at risk $25 million + another $75 million if matched, for potential total of $175 million. Started with planning period –consultation with stakeholder groups –commissioned surveys and reports –plan approved December 2002

OVERALL NDIIPP GOALS Help identify and preserve at-risk digital content Support development of improved tools, models, and methods for digital preservation Work with industry, concerned federal agencies, libraries, research institutions and not-for-profit entities Develop a national digital collection and preservation strategy

NDIIPP PORTFOLIO Plan calls for LC’s actions to be: –Catalytic, collaborative, iterative, strategic General approach –Find smart, willing collaborators. Learn by doing. Three areas of focus –Network of preservation partners Digital Preservation Partnerships Working with state libraries, archives, CTOs, etc. –Architectural framework for preservation –Digital preservation research Funding DIGARCH program through NSF

DIGITAL PRESERVATION PARTNERSHIPS Competition, cooperative agreements –8 awards announced in September 2004 –Partners collect/preserve content, collaborate with LC and each other –3 year term, LC to report to Congress Primary outcomes for partnerships: –Identify and preserve significant at-risk content –Leverage resources & experience via collaboration –Promote standards and best practices –Learn how to build and sustain partnerships

PARTNERSHIPS DIFFER In content scope –Public television programs (high-definition digital) –Dot-com era business records –Social science datasets –Geospatial information (2 projects) –Heterogeneous content harvested from web for which partners are already responsible In nature of partnership –Partners playing different roles –Group of peers

ACTIVITY ACROSS PROJECTS LC is providing resources and leadership –Individual LC staff as liaison to each project –Meetings twice a year ‘Affinity groups’ on cross-cutting issues –Selection and Collection – appraisal & tools –Rights – copyright and privacy –Technical Architecture –Economic Sustainability - costs and incentives Connections to other NDIIPP activities

METAARCHIVE PARTNERSHIP Project Summary: Six partner institutions: –Emory - Georgia Tech - Florida State –Virginia Tech – Auburn - Louisville Collaborate with LoC – 3-yr $1.4M effort to develop a cooperative for preservation of digital content. Content focus is southern culture and history.

MetaArchive Project Goals Create a conspectus of digital content within the subject domain held by the partner sites Harvest a body of most critical content to be preserved (3 terabytes, w/ capability to expand) Develop a model cooperative agreement for ongoing collaboration and sustainability Distributed preservation network infrastructure based on the LOCKSS software

Governance & Structure Committees: –Steering: coordination, communication, reporting (original six univs.) –Content: organize, develop, select content –Preservation: content retention/transfer, acquisition practices, metadata maintenance, text/image structures, migratability –Technical: server architecture, software development

Governance & Structure Membership Type: –Development partner: Testing and development of hardware, software, networking, and design of Network features. Carry out activities of preservation partner sites as well. –Preservation partner: Network participation -- maintain a node, ingest collections from partners or content contributors. Network development is optional.

Cooperative Agreement Develop a simple, flexible agreement as a model for other institutions seeking to cooperate in digital preservation –Membership criteria (and member withdrawal) –Roles and responsibilities – joint and equal custodians of content harvested –Sustainability plan (over time) –Ensure broad applicability

Cooperative Agreement Issues to Address: –New members: by invite only? by application? –3 rd member type: content contributor? –LOCKSS Alliance membership and fees –Central administration vs. decentralized –Financial sustainability (need central funds?) –Memo of agreement between institutions – detailing what members will do

METADATA OVERVIEW The MetaArchive Conspectus Database contains metadata elements that not only describe the collections that are to be collected, but also provide information that will be necessary for storage estimates, format migration, accrual rules, location, ownership and LOCKSS specific elements. The Conspectus Database is archived along with the collections.

GENESIS OF MD SPECIFICATION Metaarchive MD Specification Dublin Core Elements & Refinements Dublin Core Collection Level Description RSLP Collection Level Description MODS Physical Description MetaArchive Specific Elements

METADATA SCOPE Intellectual content of the collection(s) including subjects, spatial and temporal coverage Format of contained items - and extent of file sizes and formats Relation to other collections –Accrual rules (periodicity, open/closed) –Rights management rules –LOCKSS manifest pages and plugin information –Risk assessment

METADATA ELEMENTS Multiple name spaces utilized: –Dublin Core Elements –Dublin Core Refinements –Collection Level Description RSLP (Research Support Libraries Programme) MODS (Metadata Object Description Schema) MetaArchive defined terms MetaArchive Metadata Specification –

COLLECTION LEVEL DESCRIPTION DC Collection Description Application Profile –Accrual Periodicity [cld:accrualPeriodicity] –Accrual Policy [cld:accrualPolicy] –Contents Date Range [cld:dateContentsCreated] –Is Available Via [cld_gen:isAvailableAt] –Spatial Coverage [cld:spatial] –Temporal Coverage [cld:temporal] MODS –Manifestation [mods:physicalDescription] (1/3 of element definition) RSLP –Accumulation Date Range [rslp:created]

METAARCHIVE SPECIFIC Cataloged Status [ma:catalogedstatus] LOCKSS Manifest page [ma:manifest] MetaArchive Collect. Identifier [ma:collectionid] OAI Provider [ma:oaiprovider] Recommended Harvest Proc. [ma:harvestproc] Risk Rank [ma:riskrank]

TECHNICAL ARCHITECTURE Off-the-Shelf Strategy –Dell/Intel Based Hardware Could easily be HP or SUN Intel Based Hardware etc. Could be old desktops w/large hard drives. –New Low Cost SATA SAN EMC AX100 –$4.00 per GB (already dropping in price)

TECHNICAL ARCHITECTURE Operating System –RedHat Linux Enterprise AS v. 3/4 Ease of update management and experience w/OS –Could easily work on other versions of Linux JAVA SDK LOCKSS Content Ingestion/Replication –LOCKSS Daemon – 6-8 week updates w/RPM Conspectus Database –MySQL/PHP Interface – Integrated w/LOCKSS Plugin Directory MetaArchive Collection Description Metadata Schema

TECHNICAL ARCHITECTURE GaTech Node FSU Node Emory Node Auburn Node Auburn Yearbooks Emory Southern Spaces FSU ETD Collection Online Digital Collections Admin Interface LOCKSS for MetaArchive

LOCKSS ADMIN INTERFACE

TECHNICAL ARCHITECTURE STANDARDS –OAIS Reference Model LOCKSS Compliance –See –OAI-PMH 2.0 (Submission Information Package) Using as alternative to current LOCKSS AU strategy w/ETDs – VaTech, GaTech, FSU –MetaData Based on Known Collection Level Namespaces –

TECHNICAL ARCHITECTURE COLLABORATION –Kickstart Installations for Linux Servers Easy to setup all hardware exactly the same. –Efficiency of Replication Kickstart can be used with production system as well as with any Intel based machine. Currently running several test machines (old desktops) to trigger test LOCKSS quorums. –Communication Strategies Phone Conference, Video Conference I2 Commons, Wiki (MoinMoin), PhpCollab, iVocalize Chat/VOIP Room

MetaArchive NDIIPP Network via I2 Auburn University Emory University Ga Tech Va Tech University of Louisville Florida State University DC NYC CH IN ATL FL Lambda Rail Abilene NetworkSOX Network MAX Network MAX Connection to Va Tech

QUESTIONS MetaArchive Web – NDIIPP Web – Contacts –Caroline Arms – –Robert H. McDonald – –Lizabeth B. Nicol – –Tyler O. Walters -