NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, 2014 1 Climate Archives in NOAA: Challenges and Opportunities March 4, 2014.

Slides:



Advertisements
Similar presentations
NOAAs Comprehensive Large-data Array Stewardship System (CLASS) Robert Rank NOAA CLASS Program Chris Elvidge – NOAA-NGDC January 22, 2006.
Advertisements

Data and Information Framework: Principles Sue Barrell Bureau of Meteorology, Australia CBS-Ext.(14), Asuncion, September 2014.
3rd GOES User’s Conference Report on the NESDIS Data User’s Conference Ken Knapp GOES User’s Conference May 2004.
The Integrated Surface Hourly (ISH) Global Database ESDIM/OGP funded, FCC effort 2 QC phases thus far Full POR online via FTP, partial POR via CDO ISH.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements Workforce Demand and Career Opportunities From.
Challenges for NOAA in Integrating Earth Observations Vice Admiral Conrad C. Lautenbacher, Jr., U.S. Navy (Ret.) Under Secretary of Commerce for Oceans.
USGS Core Data Stream Report Session 7: Baseline Global Observation Scenario SDCG-7 Sydney, Australia March 4 th – 6 th 2015.
Report from the GCOS Archive/Analysis Center Matthew Menne NOAA/National Centers for Environmental Information Center for Weather and Climate (NCEI-Asheville)
1 Developing an Upper-Air Climate Monitoring Capability Richard D. Rosen, Ph.D. Assistant Administrator NOAA/Oceanic and Atmospheric Research.
NOAA’s National Climatic Data Center 1 NOAA’s National Climatic Data Center – Resources Regarding Climate and Weather Extremes J. Neal Lott, Richard R.
Topics Problem Statement Define the problem Significance in context of the course Key Concepts Cloud Computing Spatial Cloud Computing Major Contributions.
Observing Climate Variability and Change Thomas R. Karl National Oceanic and Atmospheric Administration National Environmental Satellite Data and Information.
Global Climate Change Monitoring Ron Birk Director, Mission Integration, Northrop Grumman Member, Alliance for Earth Observations Responding to Emerging.
NOAA CREST Technical Meeting – May 7, 2015 Science, Service, and You Dr. Stephen Volz Assistant Administrator for NOAA Satellites and Information Service.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Data Interoperability and Access Activities Prepared for the Data Archiving and Access Requirements Working Group (DAARWG) Ken McDonald, TPIO/GEO-IDE Jeff.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
The Fundamentals of Preserving Knowledge Assets Pacific Neighborhood Consortium 2010 Catherine Quinlan, Dean of the USC Libraries USC's Dual Approach.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
, Increasing Discoverability and Accessibility of NASA Atmospheric Science Data Center (ASDC) Data Products with GIS Technology ASDC Introduction The Atmospheric.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
Preserving the Scientific Record: Case Study 1 – National Snow & Ice Data Center (NSIDC) Glacier Photos Matthew Mayernik National Center for Atmospheric.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
The Merton Report an AIMES/IGBP-ESA partnership As Earth System science advances and matures, it must be supported by robust and integrated observation.
1 Winter 2009 ESIP Federation Meeting NOAA Data Users Workshop Data Stewardship at the NOAA Data Centers Data Stewardship at the NOAA Data Centers.
M u l t I b e a m III W o r k s h o p M u l t I b e a m III W o r k s h o p National Geophysical Data Center / World Data Centers NOAA Slide 1 End-to-End.
1 NOAA Use of the Open Archival Information System Reference Model (OAIS-RM) Ken McDonald NOAA NESDIS ESIP Federation Meeting July 9, 2009.
June 20-22, nomads.ncdc.noaa.gov Being developed and integrated to provide one-stop.
High Data Volume Transfer Issues at NOAA Christopher D. Elvidge Earth Observation Group National Oceanic and Atmospheric Administration National Geophysical.
Operational Issues from NCDC Perspective Steve Del Greco, Brian Nelson, Dongsoo Kim NOAA/NESDIS/NCDC Dongjun Seo – NOAA/NWS/OHD 1 st Q2 Workshop Archive,
Experts in numerical algorithms and High Performance Computing services Challenges of the exponential increase in data Andrew Jones March 2010 SOS14.
1 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NCEI-IOOS Project Updates Mathew Biddle May 28th, 2015 IOOS DMAC Meeting, IOOS.
Center for Satellite Applications and Research (STAR) Review 09 – 11 March 2010 Image: MODIS Land Group, NASA GSFC March 2000 The Influences of Changes.
NOAA Report WGISS 19 Climate and Meteorology Status Glenn K. Rutledge NOAA Cordoba, Argentina March 7,2005.
Review of Meteorological Data Sharing Project in China Anyuan XIONG (National Meteorological Information Center, CMA, CHINA)
WGISS and GEO Activities Kathy Fontaine NASA March 13, 2007 eGY Boulder, CO.
MADIS Airlines for America Briefing Meteorological Assimilated Data Ingest System (MADIS) FPAW Briefing Steve Pritchett NWS Aircraft Based Observations.
The Unified Access Framework for Gridded Data … the 1 st year focus of NOAA’s Global Earth Observation Integrated Data Environment (GEO-IDE) Steve Hankin,
ORNL DAAC: Introduction Bob Cook ORNL DAAC Environmental Sciences Division Oak Ridge National Laboratory.
NOAA’s National Climatic Data Center Climate Service Partnership Activities At NOAA’s National Climatic Data Center Tim Owen Climate Prediction Applications.
Space Observations Ocean Observations Land Surface Observations Atmospheric Observations Environmental Data at NOAA.
A Proposed Short Course on Data Stewardship Scott Hausman Deputy Director NOAA’s National Climatic Data Center Preparing Scientists to Steward Their Data.
DISCUSSION DRAFT ONLY Data Management METRICS for NNDC and CLASS David Hermreck.
GES DISC Experience with HDF Formats for MeaSUREs Projects by James Johnson NASA/GES DISC (Wyle Inc.) April 17, 2012.
Vision of an Integrated Global Observing System Gregory W. Withee Assistant Administrator for Satellite and Information Services National Oceanic and Atmospheric.
Trials and Tribulations of a Small Archive Presented at the THIC Conference, NCAR, Boulder CO June 30, 2004 Presented at the THIC Meeting at the National.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
Next Generation Data Access at NCDC Achieving world class data access.
Ed Kearns National Climatic Data Center Asheville, NC.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
1. 2 NOAA’s Mission To describe and predict changes in the Earth’s environment. To conserve and manage the Nation’s coastal and marine resources to ensure.
NOAA Science Advisory Board’s Data Archiving and Access Requirements Working Group (DAARWG) May 24-25, 2007 Chicago, IL Tom Karl & Larry Tyminski Co-Chairs:
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
BIG Geospatial Data. WHAT IS SPATIAL BIG DATA?  Defined in part by the context, use-case  Data too big, complex for traditional desktop GIS  Often.
Illustrating NOAA’s Geospatial Role in Resilient Coastal Zones Joseph Klimavicz, NOAA CIO and Director of High Performance Computing and Communications.
88 th Annual American Meteorological Society Meeting New Orleans, LA January 20-25, The Integrated Surface Database: Partnerships and Progress Neal.
Center for Satellite Applications and Research (STAR) Review 09 – 11 March 2010 Image: MODIS Land Group, NASA GSFC March 2000 STAR Enterprise Synthesis.
April 7, 2016 NOAA Satellite and Information Service | National Centers for Environmental Information Mike Tanner Director, Center for Weather and Climate.
OneStop Project Update for WGISS
In-situ Data and obs4MIPs
NCEI’s Long Term Archive: Infrastructure, Processes, Volume and Trend
Copernicus Programme European Commission CEOS Plenary 2017
Observing Climate Variability and Change
Prepared by: Jennifer Saleem Arrigo, Program Manager
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
CMIP6 use case and adoption of RDA outputs
School of Information Studies, Syracuse University, Syracuse, NY, USA
NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION
SEO Report to WGISS-48 Brian Killough
Presentation transcript:

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Climate Archives in NOAA: Challenges and Opportunities March 4, 2014 Thomas R. Karl, L.H.D. Director, NOAA’s National Climatic Data Center Chair, U.S. Subcommittee on Global Change Research NOAA’s National Climatic Data Center

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Challenges and Opportunities Reprocessing Radar Data: a classic big data challenge Computing the Global Temperature: quality analysis and robust science on big data Climate Normals: building useful products with big data Climate Data Records: ensuring long-term consistency and continuity – Merging data: satellite, in-situ, radar Cloud computing; international challenges Naming conventions Discovery, metadata, and access

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, NCDC’s Mission NCDC is responsible for preserving, monitoring, assessing, and providing public access to the Nation’s treasure of climate and historical weather data and information. To Steward the Nation’s Climate Information

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Data Received from Many Sources

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Archive and Access Amazon Kindle Fire Total Archive Volume: 13.4 Petabytes Users download ~5 PB per year = 5 Billion Kindle Fire e-Books/year = 2,300 Kindle Fire e-Readers/day Users download ~5 PB per year = 5 Billion Kindle Fire e-Books/year = 2,300 Kindle Fire e-Readers/day NCDC provides safe storage of 13.4 PB = 13.4 Billion Kindle Fire e-Books = 2.2 Million Kindle Fire e-Readers NCDC provides safe storage of 13.4 PB = 13.4 Billion Kindle Fire e-Books = 2.2 Million Kindle Fire e-Readers *Huge increase in outgoing NCDC satellite data from Boulder Total Data Delivered * Note: all data types are vital for climate science, e.g. Paleoclimate data

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, By 2019, NOAA will hold over 40 Petabytes of environmental data. Future Projections Cumulative Satellite Model Ocean Geophysical Radar

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Reprocessing Radar Data Higher spatial resolution, reduction of artifacts and problems introduced in operational processing Much higher volume of data. Limited by I/O versus computational speed. Thunderstorm from 5/24/2008 over Northwest Kansas OriginalReprocessed

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Computing the Global Temperature The challenge of robust science on big data, e.g., – Quality control – Integrating multiple data sources – Homogeneity adjustments – Spatial interpolation – Operational monitoring requirements – Configuration management and version control

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4,

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4,

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4,

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Climate Data Records Reliable records of Earth’s weather and climate Merge data from surface, atmosphere, and space-based systems across decades Reveal Earth’s short- and longer-term environmental changes and variations Uncorrected versus corrected reflected sunlight satellite measurements

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Cloud not appropriate for all activities – Need cost effective solution – Cost is dependent on utilization and capacity – Need adequate performance Need a mix of private and public cloud services Migration requires significant changes in systems and organization Significant security questions remain Cloud Computing

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Some countries are charging for data or restricting the flow – But more data yields better model output & outlook products that benefit all countries Analog climate data worldwide – Needs rescue and digitization before they’re lost Global climate observing systems – If systems are allowed to continue to degrade, the data coverage and quality will degrade also International Challenges

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Opportunity: adopt common metadata standards – Enable discovery and access in a platform-independent fashion – Support the Administration’s Open Data Initiatives (e.g., data.gov and climate.gov) – Modern, self-describing format standards (e.g. netCDF) bundle metadata with the data Examples of key pieces of information in the metadata: – Observation details (what, how) and Geospatial information (where, when) – Dataset provenance, including quality control information – Data discovery end points, such as websites, FTP sites, GIS services, etc Challenge: implementation is a very large effort – NCDC alone has over 700 different data collections – Each collection may have hundreds to millions of granules, all of which have to be properly described with appropriate metadata. Metadata Challenges

NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, NOAA’s National Climatic Data Center