Presentation is loading. Please wait.

Presentation is loading. Please wait.

NCSU Libraries 27 March 2006 Digital Preservation in State Government – Wilmington, NC North Carolina Geospatial Data Archiving Project Workflow, Tools,

Similar presentations


Presentation on theme: "NCSU Libraries 27 March 2006 Digital Preservation in State Government – Wilmington, NC North Carolina Geospatial Data Archiving Project Workflow, Tools,"— Presentation transcript:

1 NCSU Libraries 27 March 2006 Digital Preservation in State Government – Wilmington, NC North Carolina Geospatial Data Archiving Project Workflow, Tools, and Resources Jim Tuttle, Geospatial Data Librarian

2 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Project Overview Partnership between university library (NCSU) and state agency (NCCGIA) One of eight projects in the first NDIIPP funding round: "Building a Network of Partners" Focus on state and local geospatial content in North Carolina Objective: engage existing state/federal geospatial data infrastructures in preservation

3 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Content Complexity Multi file objects Spatial databases Ancillary data files Time-versioning Diverse data sources/metadata practices

4 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Workflow Overview Acquisition Format Migration Submission Information Package (SIP) Creation Ingest Metadata

5 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Acquisition: Workflow Collection creation/declaration File Manifest Metadata Seed File Transfer data to processing machine

6 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Acquisition: Tools and Resources PHP/PostgreSQL form Python automation scripting Threat analysis –ClamAV –Unix ‘file’ utility jjtuttle@dli:~/$ file putty putty: MS-DOS executable (EXE), OS/2 or MS Windows

7 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Acquisition: Tools and Resources Md5 checksum –jjtuttle@dli:~/$ md5sum O-view.vsd –69b3e2f6cff1537bd607f5522d0c5c4d O-view.vsd Jhove Format registries –PRONOM (UK National Archives), GDFR (Harvard/Mellon), Fred (LC)

8 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Format Migration: Workflow On-receipt migration of selected formats Object-level metadata creation/augmentation

9 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Format Migration: Tools and Resources Python batch process wrappers ArcCatalog metadata templates

10 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington SIP Item Creation: Workflow Submission Information Package grouping – Ontology logic based on defined multi-file complex format components and directory structure Repository-agnostic item grouping

11 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington SIP Item Creation: Tools and Resources Python scripts highly dependent on: –Explicit understanding of ontological relationships of complex format components –Logical directory structure as dictated by data- producer software Spreadsheet illustrating item assignment for manual review Automated revision of assignment based on spreadsheet modifications

12 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Ingest Metadata: Workflow Extraction of elements from multiple sources Crosswalk metadata to archive ingest record (DSpace Qualified Dublin Core), METS, and external Workflow Management Database

13 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Ingest Metadata: Tools and Resources Python XML libraries XSL/XSLT NOID (Nice Opaque Identifier) Persistent Identifier

14 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington Conclusion Plenty of free, open source tools The robustness of an ingest process must be inversely proportionate to the demands placed on data producers in preparation for ingest Finding the balance between cost-saving automation and the accuracy and flexibility of human intervention is difficult

15 NCSU Libraries 27 March 2006 Digital Preservation in State Government - Wilmington For More Information NCGDAP – North Carolina Data Archiving Project http://www.lib.ncsu.edu/ncgdap/http://www.lib.ncsu.edu/ncgdap/ NDIIPP – National Digital Information Infrastructure Preservation Program http://www.digitalpreservation.gov/ http://www.digitalpreservation.gov/ ClamAV http://www.clamav.net/http://www.clamav.net/ Unix File utility: ‘man file’ JHOVE – JSTOR Harvard Object Validation Environment http://hul.harvard.edu/jhove/http://hul.harvard.edu/jhove/ PRONOM Format Registry http://www.nationalarchives.gov.uk/pronom/http://www.nationalarchives.gov.uk/pronom/ GDFR – Global Digital Format Registry (in planning) http://hul.harvard.edu/gdfr/http://hul.harvard.edu/gdfr/ Fred Format Registry (proof-of-concept) http://tom.library.upenn.edu/cgi- bin/fred?cmd=Default&sid=ca21d10e67b269a75a98fe369d2ab670http://tom.library.upenn.edu/cgi- bin/fred?cmd=Default&sid=ca21d10e67b269a75a98fe369d2ab670 XSLT – eXtensible Stylesheet Language Transformations http://www.w3.org/TR/xslthttp://www.w3.org/TR/xslt NOID – Nice Opaque IDentifier http://www.cdlib.org/inside/diglib/ark/http://www.cdlib.org/inside/diglib/ark/ Jim Tuttle, Geospatial Data Librarian jim_tuttle at ncsu dot edu


Download ppt "NCSU Libraries 27 March 2006 Digital Preservation in State Government – Wilmington, NC North Carolina Geospatial Data Archiving Project Workflow, Tools,"

Similar presentations


Ads by Google