Preserving Digital Geospatial Data: The NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris North Carolina State University Libraries CRADLE.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

GeoMAPP Business Planning: Developing Materials to Get Stakeholder Buy-in Alec Bethune, North Carolinas Center for Geographic Information and Analysis.
Data Management: Metadata, Repositories and Curation Tony Mathys, Anne Robertson Eddie Boyle, Guy McGarva GeoForum, 4 th November, York.
GeoSpatial MultiState Archive and Preservation Partnership State and Local Agency Geospatial Resources Content Transfer, Demonstration, and Learning Project.
NDIIPP Project Update NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University Libraries North Carolina Center for Geographic Information.
The Disappearing Data Problem: Preserving Today's Geospatial Data to Meet Tomorrow's Temporal Analysis Needs Steve Morris Head of Digital Library Initiatives.
Collecting Digital Content Going Forward: Lessons Learned and New Initiatives NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University.
Map Portals and Geoarchiving: New Opportunities in Geospatial Information Services Steve Morris Head of Digital Library Initiatives NCSU Libraries GIS.
Identification, Selection, and Appraisal within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital.
NATIONAL STATES GEOGRAPHIC INFORMATION COUNCIL 2105 Laurel Bush Rd. Suite 200 Bel Air, MD GIS Inventory powered by Ramona.
Geospatial standards Beyond FGDC Geog 458: Map Sources and Errors March 3, 2006.
Archiving State and Local Agency Digital Geospatial Data: An Overview of the Problem Area Steven P. Morris Head of Digital Library Initiatives North Carolina.
2006 ESRI International Users ConferenceAugust 8, 2006 Spatial Data Infrastructure and Data Preservation in North Carolina Jefferson F. Essic, Robert Farrell,
North Carolina Geospatial Data Archiving Project (NCGDAP) Project Overview Partnership –University library (NCSU) and state agency (NCCGIA) –$520,000 funding,
NCSU Libraries Ingest Workflow Issues: Metadata North Carolina Geospatial Data Archiving Project Steve Morris North Carolina State University Libraries.
Content and Practice: Background to the NC Geospatial Data Archiving Project Steve Morris NCSU Libraries.
Twenty Years of Spatial Vision, But What Does 1987 Look Like in Your GIS? – Emerging Issues, Hindsight and Insights from the NC Preservation Partnership.
Collection and Preservation of At-Risk Digital Geospatial Data: NDIIPP Project Update on the NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris.
State Presentation Multi-State Geospatial Partnership Kick-off Meeting Salt Lake City, Utah January 23, 2008.
Copyright © 2008, Open Geospatial Consortium, Inc., All Rights Reserved. NDIIPP Partnership Update: North Carolina and Multi-state Demonstration Projects.
State and Local Agency Digital Geospatial Data Preservation The North Carolina Experience Steve Morris NCSU Libraries Earth Sciences Information Partners.
North Carolina Geospatial Data Archiving Project (NCGDAP) JISC/NDIIPP Joint Digital Preservation Workshop – May 2006 Presented by: Rob Farrell, Steve Morris,
Putting time into the GeoWeb: Data persistence in a web services environment Steve Morris NCSU Libraries July 23, 2008.
Preservation of Digital Geospatial Data: Challenges and Opportunities Steve Morris Head of Digital Library Initaitives North Carolina State University.
The North Carolina Geospatial Data Archiving Project Steven P. Morris North Carolina State University Libraries Maintaining Long-Term Access to Geospatial.
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
Are Geodatabases a Suitable Long-Term Archival Format? Jeff Essic, Matt Sumner North Carolina State University Libraries 2009 ESRI International Users.
Collection Building Processes within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library.
OGC ® © 2006 Open Geospatial Consortium, Inc.1 Introduction to Archives and Geospatial Issues ( Continued ) Steve Morris Head, Digital Library Initiatives.
Metadata Handling in the North Carolina Geospatial Data Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives Rob Farrell Geospatial.
GeoMAPP Project Overview and Conclusions Alec Bethune- NC Center for Geographic Information and Analysis Matt Peters- Utah Automated Geographic Reference.
National Digital Information Infrastructure and Preservation Program (NDIIPP) CNI Project Briefing December 5, 2005.
Next Generation Archives: The NC Geospatial Data Archiving Project Jeff Essic Geospatial Data Services Librarian North Carolina State University Libraries.
NCSU Libraries 27 March 2006 Digital Preservation in State Government – Wilmington, NC North Carolina Geospatial Data Archiving Project Workflow, Tools,
Cooperative Project with Library of Congress on Preservation of Digital Geospatial Data Steve Morris Head of Digital Library Initiatives NCSU Libraries.
Preserving State and Local Government Digital Geospatial Data Steve Morris Head of Digital Library Initiatives North Carolina State University Libraries.
Collection and Preservation of At- Risk Digital Geospatial Data: North Carolina Geospatial Data Archiving Project (NDIIPP Partnership) Steve Morris Head.
Long-Term Preservation of At- Risk Digital Geospatial Data: A Cooperative Agreement with Library of Congress Steve Morris NCSU Libraries Zsolt Nagy NC.
GeoMAPP: Using Metadata to Help Preserve Geospatial Content Matt Peters, Utah’s Automated Geographic Reference Center Glen McAninch, Kentucky Department.
Preserved Digital Content: Value to Public Policy Decision Making Now and in the Future NC Geospatial Data Archiving Project (NCGDAP) North Carolina State.
Preservation of Coastal Community Geospatial Content: What's Your Long Term Care Plan For Aging Data? Jeff Essic North Carolina State University Libraries.
North Carolina Geospatial Data Archiving Project : Cooperative Project with Library of Congress on Preservation of Digital Geospatial Data Partners: NCSU.
Collection and Preservation of At- Risk Digital Geospatial Data: the North Carolina NDIIPP Project Partners: NCSU Libraries Project Lead: Steve Morris.
NCPMA Fall MeetingOctober 11, 2006 GIS Data Preservation: Partnership with Library of Congress Steve Morris North Carolina State University Libraries.
NCSU Libraries 9 October 2006 EPA Meeting Preservation Partnership with Library of Congress: NDIIPP and the North Carolina Geospatial Data Archiving Project.
Long-term preservation of digital geospatial data: challenges for ensuring access and encouraging reuse Anne Robertson, EDINA & Steve Morris, NCSU Libraries.
Archiving Geospatial Data: Background to the Problem Area State Government Users Committee October 16, 2008 Steve Morris, NCSU Libraries.
ESRI International Users ConferenceJune 20, 2007 Data Snapshot Archiving: A Frequency of Capture Survey Steve Morris Jeff Essic North Carolina State University.
Preserving Geospatial Data: Challenges and Opportunities Steve Morris NCSU Libraries Indo-US Workshop on Trends in Digital Preservation March 24, 2009.
Geospatial Data Preservation Challenges at the Sub-National Level: The North Carolina Experience Steve Morris Head of Digital Library Initiatives North.
NCSU Libraries 13 June 2006 JCDL 2006 NDIIPP Preservation Network: Progress, Problems, and Promise Jim Tuttle, Geospatial Data Librarian.
NDIIPP Project: North Carolina Geospatial Data Archiving Project Partners: NCSU Libraries Project Lead: Steve Morris NC Center for Geographic Information.
North Carolina Geospatial Data Archiving Project/NDIIPP: Collection and preservation of at- risk digital geospatial data Partners: NCSU Libraries Project.
GISC Seminar: Towards Uncharted GroundSeptember 29, 2006 North Carolina Partnership with Library of Congress on Long-term Preservation of Digital Geospatial.
NDIIPP Project: Collection and Preservation of At-Risk Digital Geospatial Data Partners: NCSU Libraries Project Lead: Steve Morris NC Center for Geographic.
The Disappearing Data Problem Steve Morris Head of Digital Library Initiatives North Carolina State University Libraries.
Models for Shared Responsibility: Collaboration and Engagement with the NCGDAP and GeoMAPP Partnerships Steve Morris North Carolina State Libraries Zsolt.
Mountain Region GIS Advisory Council Meeting September 15, 2006 Long-Term Preservation of Digital Geospatial Data: A Cooperative Project with Library of.
Preservation Strategies in the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives.
North Carolina Geospatial Data Archiving Project/NDIIPP: Collection and preservation of at-risk digital geospatial data Partners: NCSU Libraries NC Center.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
Overview: GeoMAPP Appraisal Efforts NDSA Geospatial Working Group| 27 June 2012 |
The National Digital Stewardship Alliance: Community, Content, Commitment.
Preservation of State and Local Government Digital Geospatial Data: The North Carolina Geospatial Data Archiving Project Steven P. Morris, James Tuttle,
Preserving Digital Geospatial Data: The NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris North Carolina State University Libraries CRADLE.
Long-Term Preservation of At-Risk Digital Geospatial Data: The North Carolina Geospatial Data Archiving Project Steve Morris NCSU Libraries.
Update on Geospatial Data Preservation Efforts
Collecting Digital Content Going Forward: Lessons Learned and New Initiatives NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University.
Preserved Digital Content: Collections, Value, and Stewardship NC Geospatial Data Archiving Project (NCGDAP) North Carolina State University Libraries.
CNI Project Briefing December 5, 2005
Presentation transcript:

Preserving Digital Geospatial Data: The NC Geospatial Data Archiving Project (NCGDAP) Steven P. Morris North Carolina State University Libraries CRADLE SeminarNovember 17, 2006

Note: Percentages based on the actual number of respondents to each question 2 NC Geospatial Data Archiving Project Partnership between university library (NCSU) and state agency (NCCGIA) Focus on state and local geospatial data in North Carolina (state demonstration) Tied to NC OneMap initiative, which provides for seamless access to data, metadata, and inventories Objective: engage existing state/federal geospatial data infrastructures in preservation Project approaches: Technical and Social Serve as catalyst for discussion within industry

Note: Percentages based on the actual number of respondents to each question 3 Targeted data: Digital orthophotography 85+ NC counties with orthophotos 1-5 flights per county gb per flight

Note: Percentages based on the actual number of respondents to each question 4 Targeted data: Vector data (w/tabular) Economic, infrastructure, and ethnographic data

Note: Percentages based on the actual number of respondents to each question 5 Today’s geospatial data as tomorrow’s cultural heritage Future uses of data are difficult to anticipate (as with Sanborn Maps).

Note: Percentages based on the actual number of respondents to each question 6 Risks to State/Local Geospatial Data Producer focus on current data Data overwrite as common practice Future support of data formats in question No open, supported format for vector data Shift to web services-based access Data becoming more ephemeral Inadequate or nonexistent metadata Impedes discovery and use Increasing use of spatial databases for data management The whole is greater than the sum of the parts

Note: Percentages based on the actual number of respondents to each question 7 Challenge: Vector Data Formats No widely-supported, open vector formats for geospatial data Spatial Data Transfer Standard (SDTS) not widely supported Geography Markup Language (GML) – diversity of application schemas and profiles threatens permanent access Spatial Databases The sum is more than the whole of the parts, and the sum is very difficult to preserve Can export individual data layers for curation Some thinking of using the spatial database as the primary archival platform

Note: Percentages based on the actual number of respondents to each question 8 Challenge: Cartographic Representation Counterpart to the map is not just the dataset but also models, symbolization, classification, annotation, etc.

Note: Percentages based on the actual number of respondents to each question 9 Challenge: Geospatial Web Services How to capture records from decision- making processes? Possible: Atlas collections from automated image capture Web 2.0 impact: Emerging tiling and caching schemes (archive target?)

Note: Percentages based on the actual number of respondents to each question 10 Different Ways to Approach Preservation Technical solutions: How do we archive acquired content over the long term? Build a data repository: not as an end in itself but as a catalyst for discussion within the data community Develop a repository ingest workflow: create technical points of engagement with the digital preservation community

Note: Percentages based on the actual number of respondents to each question 11 Different Ways to Approach Preservation Cultural/Organizational solutions: How do we make the data more preservable—and more prone to be archived—from point of production? Engage data producer community and spatial data infrastructure through outreach and engagement; influence practice Sell the problem to software vendors and standards development Find overlap with more compelling business problems: disaster preparedness, business continuity, road building, etc. Start a discussion about roles at the local, state, and federal level

Note: Percentages based on the actual number of respondents to each question 12 NCGDAP Technical Approach Receive data as is – variety of distribution methods Migration of some at-risk formats Metadata remediation, normalization, and synchronization Distilling complex objects into repository ingest items (not easy) Using DSpace for demonstration purposes (keeping repository platform at arms length) In the development: use METS record as dormant item “brain” within the repository Some unsustainable activities – for learning experience

Note: Percentages based on the actual number of respondents to each question 13 Building Data Bundles: The Zip Codes Example

Note: Percentages based on the actual number of respondents to each question 14 Where is the Dataset?

Note: Percentages based on the actual number of respondents to each question 15 Here’s One! Files Multi-file dataset Georeferencing Metadata file Symbolization file Additional documentation License Disclaimer More Metadata FGDC Acquisition metadata Transfer metadata Ingest metadata Archive rights Archive processes Collection metadata Series metadata

Note: Percentages based on the actual number of respondents to each question 16 Hub-and-Spoke Metadata Workflow

Note: Percentages based on the actual number of respondents to each question 17 Hub-and-Spoke Metadata Workflow

Note: Percentages based on the actual number of respondents to each question 18 Hub-and-Spoke Metadata Workflow Issues: Ingest process needs access to repository specifics (e.g., what collections exist) Understanding of what the core elements should be is refined as spokes are added Need to consider repository response to SIP or AIP evolution

Note: Percentages based on the actual number of respondents to each question 19 Metadata: Going Beyond a Passive Role Feedback to the NC OneMap Metadata Outreach Program vis-à-vis metadata quality problems encountered in repository ingest Engage standards body (Open Geospatial Consortium -- OGC) in discussions about: content packaging standards for geospatial better practices for time-versioned data persistent identifier schemes contributing archive use cases to GeoDRM Meetings with major software vendor development teams

Note: Percentages based on the actual number of respondents to each question 20 Social Issues: Changing Industry Thinking Is the geospatial industry “temporally-impaired?” Lack of access to older data Lack for tool/model support for temporal analysis Metadata: poor support for changing data Education: building class projects around available data (i.e., not temporal) Increased interest now in temporal applications? Increased demand for temporal data? Improved tool support: ArcGIS 9.2 animation tools; Geodatabase History, etc. IMPORTANT: Gathering business cases for using older data

Note: Percentages based on the actual number of respondents to each question 21 Social Issues: Content Exchange Networks Solving the present-day problems of data sharing is a pre-requisite to solving the problem of long-term access Leveraging more compelling business problems: disaster preparedness and business continuity needs can put the data in motion (siphon off to the archive) Geospatial data: large data volumes, frequent data update, complex datasets, ambiguous rights Content exchange network technical challenges: Rights management Large-scale transfers on network Content packaging (MPEG 21 DIDL, XFDU, METS, …)

Note: Percentages based on the actual number of respondents to each question 22 Content Issues: Frequency of Capture Survey Survey objective: Document current practices for obtaining archival snapshots of county/municipal geospatial vector data layers Seek guidance about frequency of capture Survey topics: General questions about data archiving practice Specific questions about parcels, street centerlines, jurisdictional boundaries, and zoning Survey subjects: All 100 counties and 25 municipalities -- 58% response rate Survey conducted September 2006 Added benefit: Survey socialized the preservation issue

Note: Percentages based on the actual number of respondents to each question 23 NC County/Municipal Agency Frequency of Capture: Parcel Data Based on a percentage of the respondents that indicate they actually archive some data

Note: Percentages based on the actual number of respondents to each question 24 Project Status Cultivating a commercial market for older data. Part of “permanent access” is marketing, advertising, and putting older data into the path of the user Content Issues: What About Commercial Data?

Note: Percentages based on the actual number of respondents to each question 25 Mobile, LBS and, social networking applications drive demand for placed-based data Example sources: Oblique Imagery Street-view Imagery (e.g., A9.com) Transportation Dept. Videologs Long-term cultural heritage value in non-overhead imagery: more descriptive of place and function New Challenges: “Platial” vs. Spatial Imagery Emerging: “Tricorder” applications

Note: Percentages based on the actual number of respondents to each question 26 Emerging online environments are increasingly used to make decisions, how are these decisions documented? Web mashup/AJAX interactions with existing systems spur creation of intermediate content layers: e.g., tiling and caching of WMS services Formulation of a standard tiling scheme may create a new preservation opportunity (temporal axis on caches?) New Challenges: Ajax Applications, Google Earth and All That

Note: Percentages based on the actual number of respondents to each question 27 Web mashup/AJAX interactions with existing systems spur creation of intermediate content layers: e.g., tiling and caching of WMS services Identification of a standard tiling scheme may create a new preservation opportunity (temporal axis on caches?)

Note: Percentages based on the actual number of respondents to each question 28 Working with New Partners State Archives now an informal member of the NCGDAP project Collaboration with NARA Working with the Open Geospatial Consortium on standards issues Associate Partnership with JISC-funded UK-wide project Site visits with ESRI (major software vendor) development groups Participation in a variety of content exchange network activities More …

Note: Percentages based on the actual number of respondents to each question 29 Next Steps Working with NARA and the OGC Interoperability Institute to develop an OGC Data Preservation Working Group charter Evaluating results for the frequency of capture survey Stepping up data acquisition and repository ingest Evaluating initial data acquisition efforts (time factors, content variety, technical/legal barriers) Partnership with content exchange network activities Ramping up partnerships with broader (non- geospatial) data repository efforts

Note: Percentages based on the actual number of respondents to each question 30 Questions? Contact: Steve Morris Head, Digital Library Initiatives NCSU Libraries ph: (919)