The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.

Slides:



Advertisements
Similar presentations
The LIBRARY of CONGRESS JISC-CNI Envisioning Future Challenges in Networked Information York, England July 6, 2006.
Advertisements

Moving Forward With Digital Preservation at the Library of Congress Laura Campbell Associate Librarian for Strategic Initiatives Library of Congress.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
The National Digital Stewardship Alliance: Community, Content, Commitment.
The Library of Congress Cooperative Web Archiving Project Abbie Grotke, Library of Congress Grant Harris, Library of Congress Jennifer Long, Georgetown.
National Digital Information Infrastructure and Preservation Program (NDIIPP) Data-PASS/NDIIPP: A new effort to harvest our history A funder view May 25,
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Identification, Selection, and Appraisal within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital.
Federal STI Managers’ Group Brief Introduction CENDI.
USIA Office of Research Surveys, NARA – Roper Center Collaboration: An Update Lois Timms-Ferrara The Roper Center for Public Opinion Research and.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Access to Digital Materials through the Library of Congress OPAC Presentation by Dr. Barbara B. Tillett Chief, Cataloging Policy and Support Office Library.
NARA – Roper Center Collaboration: USIA Office of Research Surveys Michael Carlson National Archives and Records Administration Marc Maynard.
Dogan Seber, PhD San Diego Supercomputer Center University of California, San Diego I. DLESE Library II. DISCOVER OUR EARTH Earth Science Resources for.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
Digitization at the National Archives and Records Administration Doris Hamburg Director, Preservation Programs James Hastings Director, Access Programs.
National Digital Information Infrastructure and Preservation Program (NDIIPP) Building a Network of Preservation Partners CNI Spring Task Force Meeting.
North Carolina Geospatial Data Archiving Project (NCGDAP) Project Overview Partnership –University library (NCSU) and state agency (NCCGIA) –$520,000 funding,
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
Joanne Archer University of Maryland Kate Odell Archive-It Abbie Grotke Library of Congress Tessa Fallon Columbia University Creating and Maintaining Web.
A Dynamic Solution for Electronic Records: The National Archives & Records Administration’s Electronic Records Archives Kenneth Thibodeau, Director Electronic.
Digital Imaging for the NPS Museum Collection Web Catalog PMIS Harpers Ferry Center, Department of Media Assets.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
How to Face the Challenges of Web Archiving? The experiences of a small library on the edge. Chloe Martin, Internet Memory Catherine Ryan, National Library.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Ymchwil Research Ymchwil Research RESAW Ioan Isaac-Richards Ingest Processes Manager Head of Web Archiving
The Web Archiving Service Tracy Seneca California Digital Library California Digital LibraryNew York UniversityUniversity of North Texas National Digital.
The Measuring Success Project November 15, 2013 J.Billman, P.J. Lundgren, S. Brown.
Five Years InterLab ’07 Los Alamos, New Mexico October 1–3, 2007 Valerie S. Allen, MSLIS U.S. Department of Energy Office of Scientific and.
Copyright © 2008, Open Geospatial Consortium, Inc., All Rights Reserved. NDIIPP Partnership Update: North Carolina and Multi-state Demonstration Projects.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
The ECHO DEPository Project A project of the University of Illinois at Urbana-Champaign and OCLC in partnership with the Library of Congress ALA Annual.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
North Carolina Geospatial Data Archiving Project (NCGDAP) JISC/NDIIPP Joint Digital Preservation Workshop – May 2006 Presented by: Rob Farrell, Steve Morris,
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
Office of Strategic Initiatives All Hands Meeting-March 2010 Challenges in Web Archiving: Library of Congress Edition Abbie Grotke, Web Archiving Team.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
The National Digital Information Infrastructure and Preservation Program Annual Partners Meeting 2008 Since we met last year… Martha Anderson, Director.
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
National Digital Information Infrastructure and Preservation Program (NDIIPP) CNI Project Briefing December 5, 2005.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
November 2004 NDIIPP: Future Directions and Relevance to Other Countries Beth Dulabahn Office of Strategic Initiatives Library of Congress November 7,
Safeguarding the Freedom of Information: Digital Archive Initiatives in the United States Federal Government Michael Paul Huff Information Resource Officer.
Long-Term Preservation of At- Risk Digital Geospatial Data: A Cooperative Agreement with Library of Congress Steve Morris NCSU Libraries Zsolt Nagy NC.
Imaging Pittsburgh: Creating a Shared Gateway to Digital Image Collections of the Pittsburgh Region IMLS 2002 National Leadership Grant Library & Museum.
CyberCemetery Preserving At-Risk Government Web Content.
Metadata Extraction & Web Archives: Automating the Record Creation Process Abbie Grotke / Gina Jones /
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
NCSU Libraries 13 June 2006 JCDL 2006 NDIIPP Preservation Network: Progress, Problems, and Promise Jim Tuttle, Geospatial Data Librarian.
1 BCS, Oxfordshire, 19 February, 2004 WEB ARCHIVING issues and challenges Deborah Woodyard Digital Preservation Coordinator.
GISC Seminar: Towards Uncharted GroundSeptember 29, 2006 North Carolina Partnership with Library of Congress on Long-term Preservation of Digital Geospatial.
Building Collections on the Web BCWeb. What’s BCWeb ? BCWeb was developped entirely by the BnF for the content curators to replace its old selection tools.
Al Cornish, Systems Librarian Washington State University Libraries Preserving Access to Multimedia Collections.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Library of Congress Partnerships for Managing Geospatial Data North Carolina Geographic Information Coordinating Council Raleigh, NC November 7, 2007 William.
Preservation Strategies in the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives.
The National Digital Information Infrastructure and Preservation Program (NDIIPP) Challenges and Solutions Laura E. Campbell Associate Librarian for Strategic.
Challenges in Web Archiving UNT Perspective NDIIPP – July 21, 2010.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
Digital Preservation Initiatives in the United States A Summary Deanna B. Marcum.
Use cases for BnF broad crawls Annick Lorthios. 2 Step by step, the first in-house broad crawl The 2010 broad crawl has been performed in-house at the.
The National Digital Stewardship Alliance: Community, Content, Commitment.
2008 DOT GOV HARVEST PRESERVING ACCESS UNIVERSITY OF NORTH TEXAS LIBRARIES Cathy N. Hartman Mark E. Phillips FDLC Oct 21, 2008.
Building A Repository for Digital Objects
National Digital Stewardship Alliance Web Archiving Survey Update
CNI Project Briefing December 5, 2005
Presentation transcript:

The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation Partnerships

The Library of Congress 2 Born Digital “At-Risk” Web Sites

The Library of Congress 3 Take Actions that are Catalytic –Invest in existing strengths Collaborative –Engage partners in areas of mutual interest and expertise Iterative –Learn by doing Strategic –Broad spectrum of balanced short-term & investments NDIIPP Strategic Direction

The Library of Congress 4 Web of projects UIUC NARA GPO LC Web Projects IIPC NDIIP CDL IA AIHT Preservation Partners States Initiative

The Library of Congress 5 Library of Congress Web Archiving Collaborate with partners working on the same preservation issues Develop collection strategies to leverage available resources Learn by doing Strategy

The Library of Congress 6 Collaborate with partners working on the same preservation issues Membership in the International Internet Preservation Consortium (IIPC) Cooperative projects with NDIIPP Preservation Partners –California Digital Library –University of Illinois at Champaign-Urbana Technical information sharing with other US government agencies –Government Printing Office –National Archives and Records Administration

The Library of Congress 7 Collect thematically both by crawling and by acquiring collections gathered by others Develop collection strategies to leverage available resources Learn by doing Case studies and regular collection of theme-based collections Participate in tools development with IIPC Archive Ingest & Handling Project

The Library of Congress 8 Challenges of collecting from the Web Characteristics of the resource--dynamic, deep, linked Intellectual property laws and regulations Tension of preservation vs access goals Degree of alignment with current collection policies for other media Curation strategy Tools for identification and selection Tools for collection, curation, and archiving of large web collections

The Library of Congress 9 Average Web Collection Begins with a theme or event Usually does not include commercial sites Starts with a list of about 200 urls Is crawled by vendor Yields about 1 TB of data per month Has a frequency of once a week

The Library of Congress 10 Web Collections to date at LC Event-based –US National Elections—2000, 2002, 2004 –War in Iraq –September 11 Public Policy Topics –Health Care –Legislative Branch –Terrorism 26 TB

The Library of Congress 11 Archive Ingest & Handling Test AIHT is a first test of proposed NDIIP preservation architecture. The test is conducted with a common data set. –George Mason University 9/11 Archive Phase I tests ingest and data handling in local systems. Phase II tests export and import between institutions. Phase III explores format migration.

The Library of Congress 12 GMU 9/11 Archive Participants demonstrate capabilities Participants exchange archive

The Library of Congress 13 Participants Old Dominion University, Department of Computer Science Stanford University Libraries & Academic Information Resources The Johns Hopkins University, Sheridan Libraries Harvard University Library

The Library of Congress 14 George Mason University 9/11 Archive: Breakdown by File Types 57,450+ files 12GB Originally stored in a Linux environment

The Library of Congress 15 Goals of AIHT Gain practical experience with multiple institutions Document transfer and ingest processes for multiple systems Determine next set of tasks for developing interfaces between layers and institutions

The Library of Congress 16 Status of AIHT All phases completed. –Imports focused on technical assessment of archive and developing tools to examine the archive –Exports included METS and MPG21 DID objects –Migrations included transforms to JPG2000, TIFF, and some exploration of html to xml and avi to mpg Full report expected by early summer.

The Library of Congress 17 For more information…. NDIIPP Technical Architecture version International Internet Preservation Consortium MINERVA: Mapping the INternet Electronic Resources Virtual Archive

The Library of Congress 18 Martha Anderson NDIIP Program Officer Office of Strategic Initiatives The Library of Congress Washington, DC