DArcMail Demonstration D igital Arc hive e Mail System Riccardo Smithsonian Institution Archiving.

Slides:



Advertisements
Similar presentations
Research Data Access and Preservation Summit Panel 2 - Promoting Re-Use of Scientific Collections Some responses to the questions posed... John Harrison.
Advertisements

Exchange… …and the Vault Anton Lawrence IT Services 6 th March 2007.
OCLC Digital Archive Overview Judith Cobb LIPA Meeting July 2006.
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
Office 365 Education packages
Transferred 89,000+ messages XML preservation formats Account-centricMessage-centric.
The PeDALS approach.  Pete Watters Arizona State Library, project coordinator  Richard Pearce-Moses Clayton State University, Georgia,
Depositing e-material to The National Library of Sweden.
Preserving a Born-Digital Archive: The H-Net Lists Lisa M. Schmidt MATRIX: The Center.
JATS for Ejournals and BITS for Ebooks-- Adopting BITS for Scholars Portal Ebook Repository JATS conference April 22, 2015.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Repositories.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Extending the Lifecycle of Scientific Field Notes: Making Hidden Collections Reusable Riccardo Ferrante Smithsonian Institution Rusty.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
Developing PANDORA Mark Corbould Director, IT Business Systems.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Data Preservation Best Practices for preserving your research data for future reuse The goal of data preservation is to ensure that your data is in a sustainable.
Mailbox Cleanup. Preventive Measures ●Security: Unlimited mailbox sizes opens RCG to a potential denial- of-service.
Archiving Unit 13 – Use of s. archiving archiving is a process for downloading, keeping and protecting all inbound and outbound.
RECORDS MANAGEMENT AND THE WEB Presented by Jennifer Wright, Archives and Information Management Team and Lynda Schmitz Fuhrig, Electronic Records Division.
E-Domec Electronic archving and document management in the Commission.
Section 6.1 Explain the development of operating systems Differentiate between operating systems Section 6.2 Demonstrate knowledge of basic GUI components.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
A Dynamic Solution for Electronic Records: The National Archives & Records Administration’s Electronic Records Archives Kenneth Thibodeau, Director Electronic.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Society of American Archivists 2008 Annual Meeting Society of American Archivists 2008 Annual Meeting Capturing the E-Tiger: New Tools for Preservation.
Electronic Mail List Preservation Takes Off: The H-Net Archive Lisa M. Schmidt MATRIX: The Center.
Preserving Electronic Mailing Lists: The H-Net Archive H-Net Mapped to the OAIS Model Preservation AssessmentPreservation improvementsOverview How H-Net.
Finding a New Way Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library, Archives and Public Records Using.
ARCHIVING S. WHAT IS ARCHIVING S Archiving is the act of preserving and making searchable all to/from an individual. archiving.
1 A journey of a thousand miles begins with a single step. Chinese Proverb.
Johannes Spitzbart Phonogrammarchiv, Austrian Academy of Sciences Österreichische Tage der Digitalen Geisteswissenschaften save the data - workshop on.
Archiving Unit 13 – Use of s. archiving archiving is the act of storing something instead of deleting it so that you can view it.
Archiving Where did I put that mail?. Business criticity Importance to manage : –Authenticity –Integrity –Perennity –Compliance High TCO of mail.
WLAP: Improving acquisition Workshop on digital video archiving 22 June 2001, CERN Hector Sanchez San Martin Universitat Jaume I Ing. Informatica CERN.
Module 9 Configuring Messaging Policy and Compliance.
Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.
Web Archiving and Access Mike Smorul Joseph JaJa ADAPT Group University of Maryland, College Park.
Records Management and the Center for Folklife and Cultural Heritage Presented by Jennifer Wright Smithsonian Institution Archives Records Management Team.
Informational Objects TypeExamples 1. Structured Items Vouchers, Travel Orders, Invoices, Purchase Orders 2. Semi-Structured Items Letters, Memoranda,
Archiving Kealie Cox.. File sizes. If you have an that is too big to send, there are a couple of things that you could try: Copy & paste the text.
Services for Object Storage and Preservation March 2008 All content in these slides is considered work in progress. In no way does it represent an absolute.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
NOAA/NESDIS/National Oceanographic Data Center Following the Flow of Two Underway Data Streams Within the U. S. National Oceanographic Data Center Steven.
ViciDocs Safe Creating Info repositories from documents.
Carcanet Case Study Fran Baker, John Rylands University Library University of Manchester SPRUCE event 19 January 2012.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan Florida Center for Library Automation (FCLA)
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
 ePADD Demo SAA 2015 Cleveland, OH. Schedule 4:00-4:10 Introduction and Overview 4:10-4:30 Demo 4:30-4:35 Community Tools 4:35-4:40 Testing foreign scripts.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
Softwaretechnologie für Fortgeschrittene Teil Eide Stunde III: Introducing the media server (with contributions from Christian-Emil Ore, Jon Holmen, and.
EVLA Data Processing PDR E2E Data Archive System John Benson, NRAO July 18, 2002.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Surveying and Scheduling Records of OCIO Presented by Jennifer Wright Smithsonian Institution Archives Records Management Team February 16, 2005.
Surveying and Scheduling Records of SCEMS Presented by Ginger Yowell & Mitch Toda Smithsonian Institution Archives Records Management Team October 2, 2007.
Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
How to Convert OLM to Mac Mail Immediately?
Digital Preservation What, Why, and How? Dan Albertson’s Digital Libraries Class April 13, 2016 Jody DeRidder Head, Metadata & Digital Services University.
Lessons we’ve learned – pragmatic digital records.
Fitting into an Appraisal, Accessioning, Processing, Discovery, and Delivery Workflow Chris Prom, University of Illinois at Urbana Champaign.
DXL to PST Converter presents
Topics in Born Digital Archiving
AMRDEC Test Facility Improvement Project
MBOX Converter Convert MBOX to EML, MSG, RTF & HTML
Outlook Recovery Freeware is the professional tool to fix Outlook Error and PST corruption.
Preservation at the Bentley Historical Library
Presentation transcript:

DArcMail Demonstration D igital Arc hive e Mail System Riccardo Smithsonian Institution Archiving Stewardship Tools Workshop Harvard University

data points Earliest dated in the late 1980’s First preserved digitally in 2005 Largest account preserved during CERP 80K s Favorite example of large account 250,000+ s Largest account to date = 30 Gb ???,??? s Most recent account acquired last week 20 GB Primary processing and preservation tool DArcMail Some of the Smithsonian’s platforms over the past 35 years.

Introducing a successor to the CERP Parser DArcMail

CERP Parser Works on one message or a whole account Does preservation: MBOX to XML Generates metadata files and attachments directory, etc. (i.e., the “package”) All components are open source, but – Squeak is not a popular platform – Raw XML is ugly – GUI is the order of the day

DArcMail in the SI Archives Context Appraisal is a precondition to acquisition. Documentation of accessions, their accessions, etc. happens in SIA’s collection management system (CMS). Digital preservation is as preemptive as possible; it begins as soon as an accession is finalized. Storage packages manually transferred to separate server and LTOs.

DArcMail CERP Parser functions plus searching, exporting Simple GUI 4x faster processing Runs on Python and MySQL Puts understanding the account first, preservation second

DArcMail Lifecycle stages outside DArcMail’s scope Appraisal, Capture and preliminary normalization if needed – MS Outlook for PSTs; MBOX client for other formats; Aid4Mail, MessageSave for preliminary normalization Sensitive Data Processing – MS Outlook for PSTs; MBOX client for other formats Repository – Transfer to spinning disk, tape Access – Online Discovery