NASA Earth Science Data Stewardship

Slides:



Advertisements
Similar presentations
Product Quality and Documentation – Recent Developments H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC
Advertisements

NASA Earth Science Data Preservation Content Specification H. K. (Rama) Ramapriyan John Moses 10 th ESDSWG Meeting – November 2, 2011 Newport News, VA.
Metrics Planning Group (MPG) Report to Plenary Clyde Brown ESDSWG Nov 3, 2011.
Preserving Cloud Information Bruce R. Barkstrom & John J. Bates NCDC.
Provenance and Context Content Standard (Emerging) – Status of Activities H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC.
ISO & OAI-PMH By Neal Harmeyer, Amy Hatfield, and Brandon Beatty PURDUE UNIVERSITY RESEARCH REPOSITORY.
PV2013 Summary Results Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Inter-American Workshop on Environmental Data Access Panel discussion on scientific and technical issues Merilyn Gentry, LBA-ECO Data Coordinator NASA.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
NASA/ESA Interoperability Efforts CEOS Subgroup - CINTEX Alexandria, Sept 12, 2002 Ananth Rao Yonsook Enloe SGT, Inc.
ESA UNCLASSIFIED – For Official Use Data Stewardship Interest Group WGISS-39 Meeting Data Purge Alert Procedure Tsukuba, Japan – May, 2015 Mirko.
AON Data Questionnaire Results 21 Respondents Last Updated 27 March 2007 First AON PI Meeting Scot Loehrer, Jim Moore.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
Emerging Provenance/Context Content Standard Discussion at Data Stewardship Committee Session at ESIP Federation Meeting January 5, 2012 H. K. “Rama” Ramapriyan.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
Why do I want to know about HDF and HDF- EOS? Hierarchical Data Format for the Earth Observing System (HDF-EOS) is NASA's primary format for standard data.
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
Creating Archive Information Packages for Data Sets: Early Experiments with Digital Library Standards Ruth Duerr, NSIDC MiQun Yang, THG Azhar Sikander,
AMSR-E SIPS Processing Status Presented by Helen Conover Information Technology and Systems Center at the University of Alabama in Huntsville AMSR-E Joint.
Preservation Strategies: Emerging standards for preservation Ronald Weaver National Snow and Ice Data Center Version 1.0 Review Date.
Preservation Strategies: Framing The Approach Nancy Hoebelheinrich Knowledge Motifs LLC Data Management Workshop American Geophysical.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
ESDIS Project Status 11/29/2006 Dan Marinelli, Science Systems Development Office.
NASA Earth Science Data and Information System (ESDIS) Project Data Preservation Activities – Update Andrew Mitchell (NASA Goddard Space Flight Center)
NASA Earth Science Data and Information System (ESDIS) Project Preservation Activities – Software & Documentation H. K. “Rama” Ramapriyan Science Systems.
3/30/04 16:14 1 Lessons Learned CERES Data Management Presented to GIST 21 “If the 3 laws of climate are calibrate, calibrate, calibrate, then the 3 laws.
NASA Perspectives on Data Quality July Overall Goal To answer the common user question, “Which product is better for me?”
Thoughts on Stewardship, Archive, and Access to the National Multi- Model Ensemble (NMME) Prediction System Data Sets John Bates, Chief Remote Sensing.
1 U.S. Department of the Interior U.S. Geological Survey LP DAAC Stacie Doman Bennett, LP DAAC Scientist Dave Meyer, LP DAAC Project Scientist.
Science Data in the Science Mission Directorate (SMD) Jeffrey J.E. Hayes Program Executive for MO & DA, Heliophysics Division August 17, 2011.
1 NSIDC DAAC Product Workshop Overview Martha Maiden Program Executive for Data Systems NASA Headquarters NSIDC DAAC Product Workshop January 11-12, 2006.
ESA UNCLASSIFIED – For Official Use Data Stewardship Interest Group WGISS-40 Meeting Preservation of SW & Documents at CEOS Agencies Approaches and Lessons.
Legacy Data. From the 2011 EOL Science Review Team Report “The EOL effort to rescue and digitally archive historical field data that are presently stored.
KEY PERSONNEL Dr. Bob Schutz, GLAS Science Team Leader Dr. Jay Zwally, ICESat Project Scientist, GLAS Team Member Mr. David Hancock, Science Software Development.
ECS Metadata Considerations for Preservation SiriJodha S. Khalsa National Snow and Ice Data Center.
EO Dataset Preservation Workflow Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
HDF-EOS Workshop IV September 19-21, 2000 Richard E. Ullman ESDIS Information Architect NASA/ GSFC, Code 423.
Ed Kearns National Climatic Data Center Asheville, NC.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
1 U.S. Department of the Interior U.S. Geological Survey LP DAAC Stacie Doman Bennett, LP DAAC Scientist.
SOFTWARE ARCHIVE WORKING GROUP (SAWG) REPORT TODD KING PDS MANAGEMENT COUNCIL MEETING FEB. 4-5, 2016.
ESO and the CMR Life Cycle Process Winter ESIP, Jan 2015 ESDIS Standards Office (ESO) Yonsook Enloe Allan Doyle Helen Conover.
CLASS Metadata and Remote Sensing Extensions CLASS Data Provider’s Conference September 2005 Anna Milan, Ted.Habermann,
1 Current Plans for Long Term Archiving of MODIS Data Martha Maiden Program Executive Earth Science Data Systems NASA Headquarters MODIS Meeting November.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
Instrument Landing Pages Challenges and Proposals.
Marianne König, Tim Hewison, Peter Miu
NASA HDF and HDF-EOS Status Use in EOSDIS
Robert R. Downs1and Robert S. Chen2
Synthetic Data and Data Formats for the GPM GMI Radiometer
Ensuring and Improving Information Quality for Earth Science Data and Products – Role of the ESIP Information Quality Cluster H. K. (Rama) Ramapriyan,
NSIDC DAAC Accessioning and “De-commissioning” Plans
Persistent Identifiers Implementation in EOSDIS
EOSDIS Data Preservation Archive (EDPA)
NASA’s EOSDIS – Long Term Archive Infrastructure and Processes
Active Data Management in Space 20m DG
WGISS-WGCV Joint Session
Data Management: Documentation & Metadata
Data Stewardship Interest Group WGISS-45 Meeting
From Observational Data to Information (OD2I IG )
Presented to the CEOS WGISS October 22, 2018
Technical aspects of the GIRO work: inputs from GRWG
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Research data lifecycle²
Australian and New Zealand Metadata Working Group
Presented to the CEOS WGISS October 10, 2019
CEOS WGISS Carbon Data Portal: Progress and Demo CEOS WGISS Carbon Portal Team Reported at WGISS’48 Vietnam Academy of Science and Technology, Hanoi,
Presentation transcript:

NASA Earth Science Data Stewardship Dawn R. Lowe, Project Manager Earth Science Data & Information Systems (ESDIS) Project NASA/Goddard Space Flight Center

Data Stewardship Objective: Ensure that our Earth science data and information content are reliable, of high quality, easily accessible, and usable for as long as they are considered to be of value. Involves communications with data producing projects throughout their lifecycles Essential to plan for long term preservation

Project Lifecycle Events

General requirements: Data Preservation General requirements: Preserve bits Ensure: Discoverability and accessibility Readability Understandability Usability Reproducibility of results

Bits Checksums while transferring between subsystems Regular media migration Raw (Level 0) data from satellites held at back-up archive physically distant from DAACs Product generation software held at the DAACs and SIPSs Guards against catastrophic loss Raw data and higher level products are backed-up at the DAACs as well – for efficiency Periodic reassessment of risk of data loss and impact on users given the back-up approach being used at the DAACs

Discoverability and accessibility Standard metadata are critical for discoverability of data Processing software automatically generates metadata at individual file level Metadata repository is constantly populated Common Metadata Repository (CMR) Unites collection level and file level metadata Provides a source of unified, high-quality and reliable Earth Science metadata across NASA’s Earth science data holdings Improvements to Accessibility Migration from the near-line robotic tape libraries into on-line disk archives during 2005-2008. Assignment of Digital Object Identifiers (in progress)

Readability Hierarchical Data Format (HDF) Primary format for most EOSDIS datasets Translations into other formats such as NetCDF, GeoTIFF and binary, upon request by users HDF is a self-documenting formatting system Flexible structure for data producer to define “profile” HDF library facilitates writing and reading Need to maintain library for future users Alternative - HDF Archive Mapper Promoting structural consistency among datasets Cross-instrument team agreement (EOS Aura – 2004) HDF Product Designer to enable design of interoperable products compliant with community conventions, and to share designs across teams

Understandability, Usability & Reproducibility Algorithm Theoretical Basis Documents (ATBDs) Product information pages, guides, answers to frequently asked questions (FAQs), forums Usability Information on fitness for purpose Accuracy assessments, validation and data quality documentation Reproducibility Source code and/or software specification documents Versions of datasets or the means of regenerating them when they result in peer-reviewed publications

Preservation Content Specification (PCS) Has been in effect since November 2011; latest version dated January 2013 Covers eight categories of content plus a checklist (see next page) Rigor of application varies among completed, on-going and future missions Completed missions – requirements had not been in place; some items may no longer be available for preservation; responsible individuals may not be accessible On-going missions - requirements had not been in place; some of the relevant data and documentation generated early in mission may not be easily available; need additional work to reach responsible individuals Future missions – requirements are in place; included as part of mission planning

Preservation Content Categories Preflight/Pre-Operations: Instrument/Sensor characteristics including pre-flight/pre-operations performance measurements; calibration method; radiometric and spectral response; noise characteristics; detector offsets Science Data Products: Raw instrument data, Level 0 through Level 4 data products and associated metadata Science Data Product Documentation: Structure and format with definitions of all parameters and metadata fields; algorithm theoretical basis; processing history and product version history; quality assessment information Mission Data Calibration: Instrument/sensor calibration method (in operation) and data; calibration software used to generate lookup tables; instrument and platform events and maneuvers Science Data Product Software: Product generation software and software documentation Science Data Product Algorithm Input: Any ancillary data or other data sets used in generation or calibration of the data or derived product; ancillary data description and documentation Science Data Product Validation: Records, publications and data sets Science Data Software Tools: product access (reader) tools. Checklist: “metadata” about the above 8 categories showing how and where items in each category are preserved

Sources of Preservation Content Calibration Team Mission logs Science Data Product Documentation Mission Data Calibration Mission Operations Product Generation Support Teams (SIPSs) Instrument Teams / PI’s Science Data Software Tools Science Data Product Software Science Data Products Level 0 Data Science Data Product Algorithm Input Ancillary data sources (e.g., NOAA) Different entities hold preservation content during the life of a project, but they need to be gathered for long-term preservation. Some items are part of regular flow during active operational phase. Others need extra effort to collect. DAACs Science Data Product Validation Preflight/ Pre-Operations Instrument Developer/ Manufacturer Data gathering project (e.g., flight project) Validation Team

Examples for Scale of Effort Thousands of items had to be reviewed for deciding what had to be preserved Category Number of Items (HIRDLS) Number of Items (GLAS/ ICESat) Preflight/Pre-Operations Calibration 168 23 Product Documentation 18 34 Mission Calibration 10 12 Science Data Product Software 26* 5 Science Data Product Algorithm Inputs 1 56 Science Data Product Validation 1** 3 Science Data Software Tools 20 Total 225 153 *Includes source code and documentation **List of published papers

Standard for Preservation Content NASA would like to see a broad international standard identifying preservation content – NASA’s PCS is a good starting point, as are ESA’s Long Term Data Preservation documents ISO Technical Committee on Geographic Information/ Geomatics (TC 211) is working on a standard (ISO 19165)

Attention to preservation is needed throughout lifecycle Summary NASA has been collecting Earth observation data from many sources for over 50 years Data and derived scientific products are a valuable asset requiring stewardship and preservation Attention to preservation is needed throughout lifecycle Waiting for closeout phase of projects is too late Preservation Content Specification helps with planning ahead We would like to see an international standard on preservation content – looking for collaboration