Presentation is loading. Please wait.

Presentation is loading. Please wait.

Implementing the Data Management Principles Opportunities and Advantages Robert R. Downs, PhD Sr. Digital Archivist, CIESIN, Columbia University.

Similar presentations


Presentation on theme: "Implementing the Data Management Principles Opportunities and Advantages Robert R. Downs, PhD Sr. Digital Archivist, CIESIN, Columbia University."— Presentation transcript:

1 Implementing the Data Management Principles Opportunities and Advantages Robert R. Downs, PhD Sr. Digital Archivist, CIESIN, Columbia University 7 November 2016, GEO XIII Plenary Data Providers Side Event St. Petersburg, Russia

2 GEO Data Management Principles
Discovery Accessibility DMP-1: Metadata for Discovery DMP-3: Data Encoding DMP-5: Data Traceability DMP-7: Data Preservation DMP-9: Data Review and Reprocessing DMP-2: Online Access DMP-4: Data Documentation DMP-6: Data Quality-Control DMP-8: Data and Metadata Verification DMP-10: Persistent and Resolvable Identifiers Usability Preservation Curation Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

3 Discovery Implementing DMP-1: Metadata for Discovery
Data and all associated metadata will be discoverable, through catalogues and search engines and data access and use conditions, including licenses, will be clearly indicated. Descriptive metadata enables discoverability and exploration Comply with metadata standards of user community Metadata in catalogs and harvesters promotes data holdings Enable harvesting via standard protocols (OAI-PMH, etc.) Enable indexing by search engines and aggregators Providing a recommended data citation encourages citations Include a persistent identifier to the location of the data Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

4 Accessibility Implementing DMP-2: Online Access
Data will be accessible via online services, including, at a minimum, direct download but preferably user-customizable services for access, visualization and analysis. Metadata with URL to data affords immediate access Persistent identifier linking to data landing page Data in standard formats & interfaces fosters easy access HTML, OGC WxS, NetCDF, HDF5, OpeNDAP, CSV, etc. Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

5 Usability Implementing DMP-3: Data Encoding
Data should be structured using encodings that are widely accepted in the target user community and aligned with organizational needs and observing methods, with preference given to non-proprietary international standards. Adopt standardized encodings to meet user expectations Validate that encodings work with community-adopted tools Facilitate schematic and syntactic interoperability Enable use with mature services, such as OGC WMS, OpeNDAP, and NetCDF Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

6 Usability Implementing DMP-4: Data Documentation
Data will be comprehensively documented, including all elements necessary to access, use, understand, and process, preferably via formal structured metadata, based on international or community-approved standards. To the extent possible, data will also be described in peer-reviewed publications referenced in the metadata record. Adopt metadata standards & community conventions ISO :2014 Metadata, Dublin Core, Darwin Core, etc. Correctly populate all required elements to enable new uses Work with data producers to identify and verify values, including links to other resources Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

7 Usability Implementing DMP-5: Data Traceability
Data will include provenance metadata indicating the origin and processing history of raw observations and derived products, to ensure full traceability of the product chain. Capture provenance throughout data lifecycle so users can understand potential uses of the data Document events during data collection, processing, distribution Describe provenance within metadata for interoperability Employ discovery and provenance metadata schemas, such as PROV (W3C) and PREMIS (US LoC), etc. Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

8 Usability Implementing DMP-6: Data Quality-Control
Data will be quality-controlled and the results of quality control shall be indicated in metadata; data made available in advance of quality control will be flagged in metadata as unchecked. Leveraging expertise of users to conduct reviews User community, potential users, automated parsers Document reviews to improve data transparency Review criteria, values, value definitions, date & time Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

9 Preservation Implementing DMP-7: Data Preservation
Data will be protected from loss and preserved for future use; preservation planning will be for the long term and include guidelines for loss prevention, retention schedules, and disposal or transfer procedures. Plan and prepare for preservation to identify needs early Package data to facilitate future uses and users Manage data and identify dependencies for future use Determine software needs and migration paths Submit data to a sustainable trustworthy repository Certified with Data Seal of Approval / World Data System, NESTOR, or ISO 16363, etc. Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

10 Preservation Implementing DMP-8: Data and Metadata Verification
Data and associated metadata held in data management systems will be periodically verified to ensure integrity, authenticity and readability. Verify readability of data and metadata over time to ensure continuity of compatibility Ensure that users can read data and metadata Verify integrity periodically and across platforms Inspect the integrity of files, including software, transformations Document transformations of data and environment Describe revisions to content and media Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

11 Curation Implementing DMP-9: Data Review and Reprocessing
Data will be managed to perform corrections and updates in accordance with reviews, and to enable reprocessing as appropriate; where applicable this shall follow established and agreed procedures. Correct and update data to improve quality, based on reviews Document all revisions to facilitate use Reprocess data to improve usability for diverse uses Apply new knowledge, techniques, or algorithms Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

12 Curation Implementing DMP-10: Persistent and Resolvable Identifiers
Data will be assigned appropriate persistent, unique and resolvable identifiers to enable documents to cite the data on which they are based and to enable data providers to receive acknowledgement for use of their data. Assign persistent identifiers to enable data citation and use across platforms and service upgrades Include unique identifiers in recommended citations Maintain persistent identifiers over time to avoid link rot Verify continuous resolution to current data locations Derived from: Group on Earth Observations Data Management Principles Implementation Guidelines.

13 GEOSS Standards and Interoperability Forum (SIF) Workshop Outputs – Oct 20, 2016
Introduction and GEOSS Overview Steven F. Browdy Interoperability and Data Access Overview Tom Kralidis, Paul Eglitis, Siri Jodha S. Khalsa, Arne J, Berre Data Management Principle 1 (Metadata for Discovery) Siri Jodha S. Khalsa Data Management Principle 5 (Traceability) Robert R. Downs Data Management Principle 6 (Quality Control) Joan Masó The Future of GEOSS and the GCI Ken McDonald, Steven F. Browdy, Lucia Lovison Agenda, Slides, and Recordings

14 Thank you !


Download ppt "Implementing the Data Management Principles Opportunities and Advantages Robert R. Downs, PhD Sr. Digital Archivist, CIESIN, Columbia University."

Similar presentations


Ads by Google