Evolution of a Prototype Archival System for Preserving & Reviewing Electronic Records 2008 SAA Annual Meeting August 30, 2008.

Slides:



Advertisements
Similar presentations
Advanced Decision Support for Archival Processing of Presidential E-Records: Results and Demonstration William Underwood, P.I. Georgia Tech Research Institute.
Advertisements

File Format Identification and Archival Processing
William Underwood Georgia Tech Research Institute Atlanta, Georgia
Going Almost Paperless in 2009 Three Offices Leading the Way.
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
George W. Bush Presidential Library Electronic Records Alan Lowe April 24, 2012.
Copyright 2012 by Arthur Fricke Memos? What’s that? Look at textbook index under “memos” to see all the very detailed info that this slideshow briefly.
Information Technology IBM DB2 Content Manager “Lunch N Learn” 03/14/2007.
The National Declassification Center Releasing All We Can, Protecting What We Must Public Interest Declassification Board NDC Project Update April 22,
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Developing Assessment Criteria One Archivist’s View Mark Conrad National Archives and Records Administration Center for Advanced Systems and Technologies.
The Process of Accessioning Materials South Dakota State Archives 900 Governors Drive Pierre, SD (605)
Selecting Preservation Strategies for Web Archives Stephan Strodl, Andreas Rauber Department of Software.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
Humboldt University: A workflow model for digital theses and dissertations ETD A workflow model for digital theses and dissertations Developments.
NARA – Roper Center Collaboration: USIA Office of Research Surveys Michael Carlson National Archives and Records Administration Marc Maynard.
Lecture Nine Database Planning, Design, and Administration
Configuration Management
Business Communication Report Writing
Created May 2, Division of Public Health Managing Records What is a Record? What is a Records Retention & Disposition Schedule? Why is this Important?
Software Configuration Management (SCM)
WuArchivalContr.ppt-1 Information Technology & Telecommunications Laboratory Presidential Electronic Records Pilot Operating System (PERPOS) William Underwood.
A Dynamic Solution for Electronic Records: The National Archives & Records Administration’s Electronic Records Archives Kenneth Thibodeau, Director Electronic.
Records Management Overview. Why? It’s the Law It’s the Law It’s University Policy It’s University Policy Fiscal and Legal Compliance Fiscal and Legal.
1 Public Outreach October 2008 By Adelina Murtezaj – Public Relation Officer For Inaugural Partnership Activity between ICC and ERO.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Project Overview Piloting an Enterprise Approach to Electronic Records Management Dawn Bluma DWD Records Officer.
DSpace, CyberCemeteries and Other Active Sites for Community Networking Records Maria Esteva and Sue Soy School of Information, UT Austin Austin History.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
Resume and Cover Letter Development Chapter 5. 5 | 2 Copyright 2012 Wadsworth © Cengage Learning. All rights reserved. The Big Picture Chapter 5 provides.
Data Analysis and Security 11 Session Version 1.0 © 2011 Aptech Limited.
ITTL.ppt-1 Information Technology & Telecommunications Laboratory Document Type Recognition and Content Summarization William Underwood Persistent Archives.
Presidential Memorandum on Managing Government Records Paul Wester Chief Records Officer for the U.S. Government National Archives and Records Administration.
ARCHIVISTS’ TOOLKIT WORKSHOP March 13, 2008 Christine de Catanzaro Jody Thompson.
ITTL.ppt-1 Information Technology & Telecommunications Laboratory Semantic Technologies Applied to FOIA Review William Underwood Partnerships in Innovation:
It’s Up and Running, Now What? Strategies for Building Content in an Institutional Repository LITA National Forum ♦ Denver, Colorado October 6, 2007 Catherine.
Meet and Confer Rule 26(f) of the Federal Rules of Civil Procedure states that “parties must confer as soon as practicable - and in any event at least.
Archive Engine West Contextualizing Digital Objects with EAD Metadata Jodi Allison-Bunnell, Orbis Cascade Alliance Worthy Martin, Institute for Advanced.
Records Management 101 The Basics Archival and Records Management Services Division.
Archives, Records Management and SMARTech: Your guide to managing and preserving campus records April 27, 2006.
Safeguarding the Freedom of Information: Digital Archive Initiatives in the United States Federal Government Michael Paul Huff Information Resource Officer.
Washington State Archives October 2010 Presented by: Russell Wood - State Records Manager Julie Woods – Local Government Records Retention Specialist Basics.
GTRI.ppt-1 NLP Technology Applied to e-discovery Bill Underwood Principal Research Scientist “The Current Status and.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
Tsinghua University Library Yang Zhao & Airong Jiang Tsinghua University Library, Beijing China 4 June, 2004 Electronic Thesis and Dissertation System.
NIEM 3.0 Data Analytics App Dr. Brand Niemann Director and Senior Data Scientist Semantic Community AOL Government Blogger.
BSBPMG507A Apply Communication Management Techniques 10.3 Distribute Information The process of making relevant information available to project stakeholders.
E VALUATING YOUR E - LEARNING COURSE LTU Workshop 11 March 2008.
National Archives and Records Administration Status of the ERA Project RACO Chicago Meg Phillips August 24, 2010.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
DAEDALUS - An ePrints Case Study William J Nixon Service Development Susan Ashworth Advocacy.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
EAD 101: An Introduction to Encoded Archival Description XML and the Encoded Archival Description: Providing Access to Collections Oregon Library Association.
Launching E-Records with a PERPOS: The Presidential Electronic Records PilOt System 2005 NAGARA Annual Meeting.
Donald G. Davis Collection 392K Amy Baker, Megan Peck, Zach Vowell.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
Digitalcommons.unl.edu Archiving Department Records.
DAEDALUS Project William J Nixon Service Development Susan Ashworth Advocacy.
E-Discovery Copyright 2008 Thomas F. Goldman. WHAT HAVE THEY DONE TO US NOW? OH NO, NOT AGAIN!!!!!!!!!! Overview.
7th Annual Hong Kong Innovative Users Group Meeting
Grammar-based Specification and Parsing for Binary File Formats
Chapter 11: Software Configuration Management
Content-level intellectual control for digital archives
Joseph JaJa, Mike Smorul, and Sangchul Song
Chapter 11: Software Configuration Management
APE EAD3 introduction - DARIAH - Brussels
Presentation transcript:

Evolution of a Prototype Archival System for Preserving & Reviewing Electronic Records 2008 SAA Annual Meeting August 30, 2008

Presented by: Chair: Stephannie Oriabure, Archivist, NARA Brooke Clement, Archivist, NARA, and Dr. William Underwood, Georgia Tech Research Institute

Overview What were the Issues? Our Approach Archival Processing Preservation New Technologies Conclusion

Electronic Records at the George H.W. Bush Pres. Library One of the first presidential libraries to have a significant amount of e-records ◦Word Processing Files ◦Databases ◦Spreadsheets ◦Presentations ◦ ◦Computer Programs ◦Scanned Paper Records

Where We Began The archival functions needed to process paper records are well understood. We had few tools to identify, view or review electronic records in response to FOIA requests Tools Initially Needed: ◦File Format Identification Tool ◦Viewers for Records in Legacy File Formats ◦Tools Redacting E-records ◦Tools for Converting Legacy to Current Formats

Approach: Evolutionary Prototyping Computer Scientists Build Tools Archival Tools Archivists Test Tools Experience Archivists Formulate New Require- ments New Requirements Result: Integrated set of tools called PERPOS

Archival Activities Supported by PERPOS PERPOS Repository AccessionArrangePreserveSearchReviewDescribe

Accessioning

Intellectual Arrangement/Description

FOIA Processing: Create a Case

Search

Results Set

Review Checkout Container in ART, then… …open Container in the APT and Change the Activity to “Review.”

Review: Closing a Record

Review: Withdrawal Sheets

Review: Closed Record

Review: Redaction

Create FOIA Collection and Finding Aid

FOIA Collection

FOIA Finding Aid

Preservation Recover Passwords/ Decrypt Encrypted, or password protected files Repair Files corrupted by media deterioration or file transmission errors Conversion For some legacy file formats, there is not a viewer available

Resources for Preserving Records

Preservation: Conversion to a Viewable Format

Preservation: Record Converted to a Viewable Format

Research in Assisting Archivists in Processing E-Records Automatically filling in withdrawal information Automatic description of items, file units (folders), and record series

Documentary Forms of Presidential E-Records Agenda Bar Chart Biography Briefing Memo Decision Memo Correspondence Diary Executive Order Information Memo Job Application Lists Mailing List Memo Minutes of Meeting National Security Directive Newsletter Nomination to Federal Office Notes Presidential Statement Press Pool Report Press Release Recommended Telephone Call Referral Memo Resume Schedule Signature Memo Situation Report Summary Transcript of Speech Transcript of News Conference

Documentary Form Documentary form is “the rules of representation used to convey a message – that is, the characteristics of a document which can be separated from the determination of the particular subjects, or places it concerns. Documentary form is both physical and intellectual. The intellectual form of a document is "the sum of a record's formal attributes that represent and communicate the elements of the action in which the record is involved and of its immediate context, both documentary and administrative." The physical form of a document is “the overall appearance, configuration, or shape, derived from its material characteristics and independent of its intellectual content.” ( L. Duranti, Diplomatics: New Uses for an Old Science)

Grammar for the Documentary Form of a Memorandum

Document Type Recognition and Metadata Extraction Tokenizer Wordlist Lookup Sentence Splitter Hepple POS Tagger Named entity Transducer Intellectual Element Transducer + Rules for Intellectual Elements SUPPLE Parser + Document Type Grammars and semantics Extract Record Metadata

Parse Tree and Metadata Extracted from Record

Extracted Metadata Inserted in Withdrawal Form & Automatic Item Description Item Description: A memorandum, dated April 27, 1992 from EDE Holiday to Sam Skinner regarding California Earthquake.

PERPOS is Still Evolving PERPOS has evolved into a Prototype E-Record Repository and Archival Processing System. However, archivists have identified additional needs, for example, ◦Need for more precise search criteria such as search by:  Office, Series, Date, and Type of Document ◦Need to explore alternatives for providing E-FOIA Collections to Library Researchers. ◦Need for experience in processing

Summary: Research Results and Benefits Evolutionary Prototyping is a good strategy of system development when there is a need to learn more about the problem. The system evolves until the prototype meets all the needs and has thus evolved into a system. PERPOS ◦Has been demonstrated to support to a high degree both systematic and FOIA processing of e-records. ◦Environment for learning new requirements for processing electronic records and discovering new opportunities for improving the process. ◦Environment for exploring preservation strategies. ◦Environment for experimental application of advanced information technologies to support archival tasks.

Additional Information Publications: ◦D. Carter, B. Clement, S. Laib, and W. Underwood, “Results of Pilot Testing of FOIA Processing Using PERPOS.” ◦S. Oriabure, L. Spencer, and W. Underwood, “Launching E- Records with a PERPOS,” 2005 NAGARA Meeting. ◦S. Laib and W. Underwood, “FOIA Processing in the Presidential Electronic Records PilOt System.” ◦Underwood, et al. “Reference Manual for PERPOS: An Electronic Records Repository and Archival Processing System, Version 3.1.” These and other publications are available at:

Questions from the Audience Thank you!