NCSU Libraries Ingest Workflow Issues: Metadata North Carolina Geospatial Data Archiving Project Steve Morris North Carolina State University Libraries.

Slides:



Advertisements
Similar presentations
Training Structure Agenda Metadata Creation Considerations
Advertisements

The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
Spatial Data Infrastructure: Concepts and Components Geog 458: Map Sources and Errors March 6, 2006.
NDIIPP Project Update NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University Libraries North Carolina Center for Geographic Information.
Collecting Digital Content Going Forward: Lessons Learned and New Initiatives NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University.
Map Portals and Geoarchiving: New Opportunities in Geospatial Information Services Steve Morris Head of Digital Library Initiatives NCSU Libraries GIS.
Identification, Selection, and Appraisal within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
Geospatial standards Beyond FGDC Geog 458: Map Sources and Errors March 3, 2006.
Archiving State and Local Agency Digital Geospatial Data: An Overview of the Problem Area Steven P. Morris Head of Digital Library Initiatives North Carolina.
Esri UC 2014 | Technical Workshop | Leveraging Metadata Standards for Supporting Interoperability in ArcGIS Aleta Vienneau, David Danko.
2006 ESRI International Users ConferenceAugust 8, 2006 Spatial Data Infrastructure and Data Preservation in North Carolina Jefferson F. Essic, Robert Farrell,
North Carolina Geospatial Data Archiving Project (NCGDAP) Project Overview Partnership –University library (NCSU) and state agency (NCCGIA) –$520,000 funding,
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
Collection and Preservation of At-Risk Digital Geospatial Data: NDIIPP Project Update on the NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris.
Copyright © 2008, Open Geospatial Consortium, Inc., All Rights Reserved. NDIIPP Partnership Update: North Carolina and Multi-state Demonstration Projects.
State and Local Agency Digital Geospatial Data Preservation The North Carolina Experience Steve Morris NCSU Libraries Earth Sciences Information Partners.
NCSU Libraries Digital Repository Projects at the North Carolina State University Libraries James Jackson Sanborn Jim Tuttle Open Repositories/DSpace User.
North Carolina Geospatial Data Archiving Project (NCGDAP) JISC/NDIIPP Joint Digital Preservation Workshop – May 2006 Presented by: Rob Farrell, Steve Morris,
2008 EPA and Partners Metadata Training Program: 2008 CAP Project Geospatial Metadata: Introduction Module 2: FGDC CSDGM Metadata Compliancy.
1 Integrated Services Program The Virginia Metadata Training Workshop Summer, 2006 Lyle Hornbaker Integrated Services Program
Preservation of Digital Geospatial Data: Challenges and Opportunities Steve Morris Head of Digital Library Initaitives North Carolina State University.
The North Carolina Geospatial Data Archiving Project Steven P. Morris North Carolina State University Libraries Maintaining Long-Term Access to Geospatial.
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
Collection Building Processes within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library.
OGC ® © 2006 Open Geospatial Consortium, Inc.1 Introduction to Archives and Geospatial Issues ( Continued ) Steve Morris Head, Digital Library Initiatives.
Metadata Handling in the North Carolina Geospatial Data Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives Rob Farrell Geospatial.
GeoMAPP Project Overview and Conclusions Alec Bethune- NC Center for Geographic Information and Analysis Matt Peters- Utah Automated Geographic Reference.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
NCSU Libraries 27 March 2006 Digital Preservation in State Government – Wilmington, NC North Carolina Geospatial Data Archiving Project Workflow, Tools,
LTER Information Management Training Materials LTER Information Managers Committee Documenting Spatial Data Theresa Valentine Andrews LTER.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
Cooperative Project with Library of Congress on Preservation of Digital Geospatial Data Steve Morris Head of Digital Library Initiatives NCSU Libraries.
Preserving State and Local Government Digital Geospatial Data Steve Morris Head of Digital Library Initiatives North Carolina State University Libraries.
Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Long-Term Preservation of At- Risk Digital Geospatial Data: A Cooperative Agreement with Library of Congress Steve Morris NCSU Libraries Zsolt Nagy NC.
GeoMAPP: Using Metadata to Help Preserve Geospatial Content Matt Peters, Utah’s Automated Geographic Reference Center Glen McAninch, Kentucky Department.
North Carolina Geospatial Data Archiving Project : Cooperative Project with Library of Congress on Preservation of Digital Geospatial Data Partners: NCSU.
Collection and Preservation of At- Risk Digital Geospatial Data: the North Carolina NDIIPP Project Partners: NCSU Libraries Project Lead: Steve Morris.
NCPMA Fall MeetingOctober 11, 2006 GIS Data Preservation: Partnership with Library of Congress Steve Morris North Carolina State University Libraries.
NCSU Libraries 9 October 2006 EPA Meeting Preservation Partnership with Library of Congress: NDIIPP and the North Carolina Geospatial Data Archiving Project.
Long-term preservation of digital geospatial data: challenges for ensuring access and encouraging reuse Anne Robertson, EDINA & Steve Morris, NCSU Libraries.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
Archiving Geospatial Data: Background to the Problem Area State Government Users Committee October 16, 2008 Steve Morris, NCSU Libraries.
ESRI International Users ConferenceJune 20, 2007 Data Snapshot Archiving: A Frequency of Capture Survey Steve Morris Jeff Essic North Carolina State University.
Preserving Geospatial Data: Challenges and Opportunities Steve Morris NCSU Libraries Indo-US Workshop on Trends in Digital Preservation March 24, 2009.
Preserving Digital Geospatial Data: The NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris North Carolina State University Libraries CRADLE.
Geospatial Data Preservation Challenges at the Sub-National Level: The North Carolina Experience Steve Morris Head of Digital Library Initiatives North.
NCSU Libraries 13 June 2006 JCDL 2006 NDIIPP Preservation Network: Progress, Problems, and Promise Jim Tuttle, Geospatial Data Librarian.
NDIIPP Project: North Carolina Geospatial Data Archiving Project Partners: NCSU Libraries Project Lead: Steve Morris NC Center for Geographic Information.
North Carolina Geospatial Data Archiving Project/NDIIPP: Collection and preservation of at- risk digital geospatial data Partners: NCSU Libraries Project.
GISC Seminar: Towards Uncharted GroundSeptember 29, 2006 North Carolina Partnership with Library of Congress on Long-term Preservation of Digital Geospatial.
NDIIPP Project: Collection and Preservation of At-Risk Digital Geospatial Data Partners: NCSU Libraries Project Lead: Steve Morris NC Center for Geographic.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Models for Shared Responsibility: Collaboration and Engagement with the NCGDAP and GeoMAPP Partnerships Steve Morris North Carolina State Libraries Zsolt.
Mountain Region GIS Advisory Council Meeting September 15, 2006 Long-Term Preservation of Digital Geospatial Data: A Cooperative Project with Library of.
Preservation Strategies in the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives.
Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS.
North Carolina Geospatial Data Archiving Project/NDIIPP: Collection and preservation of at-risk digital geospatial data Partners: NCSU Libraries NC Center.
Joint Meeting of CSUL Committees,
Jim Tuttle North Carolina State University Libraries
Preservation of State and Local Government Digital Geospatial Data: The North Carolina Geospatial Data Archiving Project Steven P. Morris, James Tuttle,
Preserving Digital Geospatial Data: The NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris North Carolina State University Libraries CRADLE.
Long-Term Preservation of At-Risk Digital Geospatial Data: The North Carolina Geospatial Data Archiving Project Steve Morris NCSU Libraries.
Collecting Digital Content Going Forward: Lessons Learned and New Initiatives NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University.
Preserved Digital Content: Collections, Value, and Stewardship NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University Libraries.
NORTH CAROLINA state and local government METADATA PROFILE
Robin Dale RLG OAIS Functionality Robin Dale RLG
ESRM 250/CFR 520 Autumn 2009 Phil Hurvitz
Presentation transcript:

NCSU Libraries Ingest Workflow Issues: Metadata North Carolina Geospatial Data Archiving Project Steve Morris North Carolina State University Libraries

NCSU Libraries How the Data is Received Data is delivered as is – no control over organization of received data Contributing organizations –County and municipal agencies –State agencies –Regional councils of government Data transfer modes –CD/DVD, External Drive –FTP or Web Download

NCSU Libraries Ingest Challenges: General Data consists of multi-file, multi-format objects Ancillary data files can be shared by datasets Some formats require conversion now Some format conversions involve one-to-many relationships Compressed archive files are common and behave unpredictably And all the usual challenges: format validation, validity checking, threat scanning,…

NCSU Libraries Ingest Challenges: Metadata Metadata is encoded in a variety or ways –The FGDC content standard for metadata lacked an encoding standard (arrived pre-XML), will soon be addressed in ISO 19115/19139 FGDC implementation –XML (varied schemas), TXT, HTML Metadata is missing –Only about 25% of local agencies use FGDC Metadata is wrong –Metadata is commonly asynchronous with the data Inconsistent use of dataset naming, etc.

NCSU Libraries Some Key Decisions Capture “transfer set” metadata Normalize, synchronize, and remediate existing metadata, and retain original metadata record Treat contact information as archival Update metadata with format conversions Use ESRI Profile of FGDC –added technical and administrative elements –Has an XML schema –ArcCatalog tool support Use simple rights encoding scheme Record metadata in a workflow management database

NCSU Libraries What is Transfer Set Metadata? Administrative and technical metadata associated with a transfer device or download Propagates to individual data objects PHP Application Interface for Transfer Set Metadata Capture

NCSU Libraries If No Metadata, What Then? Autoextract a subset of technical and descriptive metadata through ArcCatalog Apply an agency-specific metadata template (many elements are static within the context of the agency) Acquire information from the NC OneMap Inventory –Data Source –Contact Info –Datum, Coordinate System Acquire information from agency web site Avoid direct inquiries to local agencies (“contact fatigue”)

NCSU Libraries What Gets Remediated and Why? Key technical elements that are wrong –Datum, coordinate system, format, … Title –Qualify to the agency (e.g. “Streets” becomes “Henderson County Streets”) Keywords –Add ISO keywords –NCSU GIS Lookup terms added later if needed for access These are basic requirements for access and use

NCSU Libraries Metadata Tools ArcCatalog –Automated metadata extraction ArcGIS Toolbar –Metadata synchronization, normalization, templating cns and mp –Raw text handling Python classes –Ingest workflow

NCSU Libraries Source Metadata Translation Hub-and-spoke model a la Echo DEPository –repository agnostic –modular conversion hub –facilitate repository software migration & inter-archive exchange

NCSU Libraries What is the Rights Encoding? Purpose: Define a basic set of codes to hold dataset rights information in a script-actionable form. To assign related text for use in constructing brief rights statements. Propagates to individual data objects Structure: Codes are assigned on a fixed string position basis. Rights assigned to particular user types are grouped after a flag character for that user group. Initial User Groups: –NCSU Faculty/Staff/Students (Code “N”) –General Public (Code “P”) –Library of Congress (Code “L”) Initial Rights Types: –Use –Redistribute –Commercial Use

NCSU Libraries Sample Rights Record M01N110P110L110 Interpretation: This dataset was acquired in a mediated transaction directly from the data producer (acquired on media or via arranged download). There is no data agreement but there is a data disclaimer. NCSU, General Public, and LC all can use and redistribute the data but commercial use is not allowed.

NCSU Libraries Deferred Activities Implementing METS and PreMIS Developing a serial object metadata scheme

NCSU Libraries Ongoing Challenges When to automate and when not to –Learn first from human intervention –Minimizing risk of error related to human intervention Accepting that ingest packages used will evolve over time (implications for archive?) Handling post-ingest migrations

NCSU Libraries Engagement Opportunities NCGDAP partner NCCGIA runs the NC OneMap Metadata Outreach Program Provide feedback to spatial data infrastructure about metadata inconsistencies, lack of adherence to best practices Partner with industry and standards organizations on addressing metadata issues such as poor standards support for versioned data (e.g., through OGC Data Preservation Working Group)