Presentation on theme: "21 st Century Archives: Architecting the Future Adam Jansen, CRM, MIT, MCP, CDIA DKives Consulting NDPP -- Mar 2009."— Presentation transcript:
21 st Century Archives: Architecting the Future Adam Jansen, CRM, MIT, MCP, CDIA DKives Consulting NDPP -- Mar 2009
Washington State Digital Archives 80+ million public records in the system 950 Researchers per day 130+ foreign countries in one month 25 categories of records (land, court, photos, web pages, , audio recordings) 550 state web pages spidered – 2.5 Million web files % Uptime 3300 state and local agencies as potential customers
Standards Driven Open Archival Information System – ISO Standard for electronic records archiving DOD – ISO Standard for Records Management Applications InterPARES – International effort to define requirements for e-archiving INCITS/V1 committee for OpenXML
DA Responsibilities Negotiate and accept records from producers Control to level needed for long term Understand target audience Ensure independence, understandable FOLLOW DOCUMENTED PROCESS Make records accessible
What is Archiving in the Electronic Age? Protecting machine readable records of enduring legal, historical or fiscal value from loss, alteration, deterioration and technological obsolescence in a environment independent from that which produced the record.
Shifting Media Historically records were stored on paper, kept in filing cabinets – When the cabinet was full, records sent to file room Now records stored electronically on computers – When the computer is full – add more hard drives Basic skills to manage and maintain records have been lost, replaced by infinite storage
So the question becomes… who takes care of the records, and do they have the knowledge?
IMPLEMENTING A DIGITAL ARCHIVES
Brick and Mortar Delivery Loading Dock Processing Area The Stacks Research Room
Records Transportation Preferred method is Internet delivery – Can be VERY slow Also accept on optical, hard drive, tape Have multiple layers of politics to satisfy Large number of small files creates problems Provide removable media devices to partner/clients
10 TB transfer example 100MBS = 27.8 Hours 50MBS = 55.6 Hours USB 1.5 MBS = Hours USB 60 MBS = 46.3 Hours SATA 150 MBS = 18.5 Hours SATA 300 MBS = 9.3 Hours 30 MBS = 92.6 Hours SDLT 16 MBS = Hours Magneto MBS = Hours
PDI Example Ethernet adapter Local Area Connection: Connection-specific DNS Suffix. : annex.co.franklin.wa.us Description : Intel(R) PRO/100 VE Physical Address : 00-0D-60-3C DHCP Enabled.... : Yes Autoconfiguration Enabled.... : Yes IP Address : Subnet Mask : Default Gateway : DHCP Server : DNS Servers : , Primary WINS Server : Secondary WINS Server : Lease Obtained : Tuesday, August 23, :34:25 AM Lease Expires : Thursday, September 22, :34:25 AM
Landing Pad Data must be clean Quarantine and recheck Outside of permanent repository Ease to send, not remove Provide secure storage until processing
Clean and Prep Records Verify records for completeness Send rejects back to agency Assemble records for ingestion Create/note missing metadata Workflow for file conversion – Preservation file format – Presentation file format Move into database, backup
Archival Storage Receive Data – Move to permanent storage Expect frequency – Confirmation and location Manage Storage – Based on policy, statistics – Monitor error logs – Ease of expanse, growth on demand Replace Media – Sampling – Life expectancy, technology changes
Management of Storage Store records in open, non-proprietary method The only constant is change Maintain authenticity, integrity Allow for future migration, conversion MUST be easy to expand and change Flexibility is the key!!!
17 th Century So long as men can breath, and eyes can see, So long lives this, and this gives life to thee. -William Shakespeare, Sonnet XVIII (1609) 21 st Century As long as the media is still readable, a drive available, the software can be loaded, the hardware still runs and file can be found!