Presentation is loading. Please wait.

Presentation is loading. Please wait.

National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.

Similar presentations


Presentation on theme: "National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a."— Presentation transcript:

1 National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a SOA Technical Context Robert Chadduck Principal Technologist Electronic Records Archives Program The National Archives and Records Administration Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a SOA Technical Context Robert Chadduck Principal Technologist Electronic Records Archives Program The National Archives and Records Administration

2 National Archives and Records Administration2 Synopsis of 18 April 2007 Invited Presentation by Dr. Reagan Moore, Ph.D. Distinguished Scientist San Diego Supercomputer Center to NITRD HCI&IM Coordinating Group

3 National Archives and Records Administration3 Open Source, University-based Technology Research collaboratively supported by NSF/Office of CyberInfrastructure & NARA

4 Scientific Data Collections Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar Richard Marciano {moore, schroede, mwan, sekar, marciano}@sdsc.edu {moore, schroede, mwan, sekar, marciano}@sdsc.edu http://www.sdsc.edu/srb http://irods.sdsc.edu/

5 Data Collections NSF Cyberinfrastructure projects Digital holdings for a scientific discipline Simulation applications Output from supercomputers Real-time sensor systems Observational data Scientific laboratories Experimental data

6 Scientific Data Management Data collections Data organization Data grids Data sharing Data publication Digital Libraries Data preservation Persistent archives SDSC uses generic data grid technology to support all data management applications

7

8 Data Management Challenges Authenticity Manage descriptive metadata for each file Manage access controls Manage consistent updates to administrative metadata Integrity Manage checksums Replicate files Synchronize replicas Federate data grids Infrastructure independence Manage collection properties Manage interactions with storage systems Manage distributed data

9 Generic Infrastructure Data grids manage data distributed across multiple types of storage systems File systems, tape archives, object ring buffers Data grids manage collection attributes Provenance, descriptive, system metadata Data grids manage technology evolution At the point in time when new technology is available, both the old and new systems can be integrated

10 Data Grids SRB - Storage Resource Broker Persistent naming of distributed data Management of data stored in multiple types of storage systems Organization of data as a shared collection with descriptive metadata, access controls, audit trails iRODS - integrated Rule-Oriented Data System Rules control execution of remote micro-services Manage persistent state information Validate assertions about collection Automate execution of management policies

11 Preservation Management iRODS - integrated Rule-Oriented Data System

12 Rule-based Data Management Map from management policies to rules controlling execution of remote micro- services Manage persistent state information for results of each micro-service execution Support an additional three logical name spaces Rules Micro-services Persistent state information Constitutes representation information for preservation environments

13 Example Rules Rule composed of four parts: Name | condition | micro-service set | recovery Rule to automate replication of data for a specific collection acPostProcForPut | $objPath like /tempZone/home/rods/nvo/* | msiSysReplDataObj(nvoReplResc,null) | nop Rule types Internal, administrative, user-defined Atomic, deferred, periodic

14 Management Virtualization Standard policies expressed as rules Integrity Validation of checksums Synchronization of replicas Data distribution Data retention Access controls Authenticity Chain of custody - audit trails Required preservation metadata - templates Generation of AIPs, DIPS

15 New Capabilities Management capabilities Rules to validate assessment criteria Access controls on rules Time-dependent access controls Access controls on each micro-service Redaction, access controls on structures in a file Rule to parse audit trails, verify consistency of system Data grid evolution Dynamic addition of new rules / micro-services / persistent state information Rules to control migration from old management policies to new management policies Federation Migration of rules and micro-services with data

16 Federation Between Data Grids Data Grid Logical resource name space Logical user name space Logical file name space Logical rule name space Logical micro-service name Logical persistent state Data Collection B Data Access Methods (Web Browser, DSpace, OAI-PMH) Data Grid Logical resource name space Logical user name space Logical file name space Logical rule name space Logical micro-service name Logical persistent state Data Collection A

17 Digital Preservation Preservation is communication with the future How do we migrate records onto new technology (information syntax, encoding format, storage infrastructure, access protocols)? SRB - Storage Resource Broker data grid provides the interoperability mechanisms needed to manage multiple versions of technology Preservation manages communication from the past What information do we need from the past to make assertions about preservation assessment criteria (authenticity, integrity, chain of custody)? iRODS - integrated Rule-Oriented Data System

18 For More Information Reagan W. Moore San Diego Supercomputer Center moore@sdsc.edu http://www.sdsc.edu/srb/ http://irods.sdsc.edu/

19 National Archives and Records Administration19 For Additional Information and Developments http://irods.sdsc.edu/index.php/Main_Page

20 National Archives and Records Administration20 Robert Chadduck Principal Technologist Electronic Records Archives Program The National Archives and Records Administration telephone: 301-827-1585 robert.chadduck at nara.gov


Download ppt "National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a."

Similar presentations


Ads by Google