Unidata's Involvement in Developing and Supporting Climate Science Infrastructure Russ Rew UCAR Unidata April 2010.

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

Expanding Regridding Capabilities of the Earth System Modeling Framework Andrew Scholbrock University of Colorado – Boulder Robert Oehmke NOAA/CIRES 1.
Earth System Curator Spanning the Gap Between Models and Datasets.
Reading HDF family of formats via NetCDF-Java / CDM
Recent Work in Progress
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
THREDDS, CDM, OPeNDAP, netCDF and Related Conventions John Caron Unidata/UCAR Sep 2007.
7 +/- 2 Maybe Good Ideas John Caron June (1) NetCDF-Java (aka CDM) has lots of functionality, but only available in Java – NcML Aggregation – Access.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
A New Generation of Data Services for Earth System Science Education and Research: Unidata’s Plans and Directions AGU Fall Meeting San Francisco, CA 6.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
Quick Unidata Overview NetCDF Workshop 25 October 2012 Russ Rew.
Implementation of Model Data Interoperability for IOOS: Successes and Lessons Learned Rich Signell USGS Woods Hole, MA / NOAA Silver Spring USA Model Data.
NE II NOAA Environmental Software Infrastructure and Interoperability Program Cecelia DeLuca Sylvia Murphy V. Balaji GO-ESSP August 13, 2009 Germany NE.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
2 3 ROMS/COAWST NcML file 4 5 Exploiting IOOS: A Distributed, Standards-Based Framework and Software Stack for Searching, Accessing, Analyzing and.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Unidata’s Common Data Model John Caron Unidata/UCAR Nov 2006.
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
NcML Aggregation vs Feature Collections. NcML functionality 1.Modify the objects found in CDM files – Especially Attributes – Don’t have to rewrite the.
Unidata’s TDS Workshop TDS Overview – Part II Unidata July 2011.
N-Wave Stakeholder Users Conference Wednesday, May 11, Marine St, Rm 123 Boulder, CO Linda Miller and Mike Schmidt Unidata Program Center (UPC)-Boulder,
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
Integrated Model Data Management S.Hankin ESMF July ‘04 Integrated data management in the ESMF (ESME) Steve Hankin (NOAA/PMEL & IOOS/DMAC) ESMF Team meeting.
THREDDS Data Server Unidata’s Common Data Model Background / Summary John Caron Unidata/UCAR Mar 2007.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
The Arctic Observing Network (AON) Cooperative Arctic Data and Information Service (CADIS) Florence Fetterer,
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative External Observatory Integration Christopher Mueller, Matt Arrott, John Graybeal Life Cycle.
IOOS Modeling Testbed Cyberinfrastructure Rich Signell, USGS, Woods Hole, MA IOOS-RA-Briefing, Feb 14, 2012.
Unidata TDS Workshop THREDDS Data Server Overview
1 NASA CEOP Status & Demo CEOS WGISS-24 Oberpfaffenhofen, Germany October 15, 2007 Yonsook Enloe.
Quick Unidata Overview NetCDF Workshop 2 August 2009 Russ Rew Data Services Group.
Recent developments with the THREDDS Data Server (TDS) and related Tools: covering TDS, NCML, WCS, forecast aggregation and not including stuff covered.
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
IOOS Data Services with the THREDDS Data Server Rich Signell USGS, Woods Hole IOOS DMAC Workshop Silver Spring Sep 10, 2013 Rich Signell USGS, Woods Hole.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
The CF Conventions: Options for Sustained Support Involving Unidata Russ Rew Unidata Policy Committee May 12, 2008.
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting
Data File Formats: netCDF by Tom Whittaker University of Wisconsin-Madison SSEC/CIMSS 2009 MUG Meeting June, 2009.
Advances in the NetCDF Data Model, Format, and Software Russ Rew Coauthors: John Caron, Ed Hartnett, Dennis Heimbigner UCAR Unidata December 2010.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
GIS for Atmospheric Sciences and Hydrology By David R. Maidment University of Texas at Austin National Center for Atmospheric Research, 6 July 2005.
Weathertop Consulting, LLC Server-side OPeNDAP Analysis – Concrete steps toward a generalized framework via a reference implementation using F-TDS Roland.
Convergence And Trust in Earth and Space Science Data Systems Ted Habermann, NOAA National Geophysical Data Center Documentation: It’s not just discovery...
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative External Observatory Integration: Christopher Mueller Life Cycle Objectives Review La Jolla,
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
Development of a CF Conventions API Russ Rew GO-ESSP Workshop, LLNL
NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation.
The NOAA Operational Model Archive and Distribution System NOMADS CEOS-Grid Application Status Report Glenn K. Rutledge NOAA NCDC CEOS WGISS-19 Cordoba,
Interoperability Day Introduction Standards-based Web Services Interfaces to Existing Atmospheric/Oceanographic Data Systems Ben Domenico Unidata Program.
Update on Unidata Technologies for Data Access Russ Rew
The CF Conventions: Governance and Community Issues in Establishing Standards for Representing Climate, Forecast, and Observational Data Russ Rew 1, Bob.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team Photo.
NetCDF-Java version 2.2 Common Data Model John Caron Unidata/UCAR Dec 10, 2004.
A Quick Tour of the NOAA Environmental Software Infrastructure and Interoperability Group Cecelia DeLuca Dr. Robert Detrick visit March 28, 2012
Data Browsing/Mining/Metadata
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Status for Endeavor 6: Improved Scientific Data Access Infrastructure
Presentation transcript:

Unidata's Involvement in Developing and Supporting Climate Science Infrastructure Russ Rew UCAR Unidata April 2010

Overview Background – NetCDF – Climate and Forecast Metadata Conventions Current involvement – CF conventions and governance – TDS, CDM, NcML, Libcf, GridSpec Upcoming efforts – NOAA projects: GIP, NCDC climate portal, CF satellite conventions – Proposals under review Concluding remarks

NetCDF: Unidata’s first cyberinfrastructure Portable, self-describing data format, data model, and software libraries supporting creation, access, and sharing of scientific data 1990's: achieves widespread use in ocean and climate modeling 2002: Java version with OPeNDAP client support 2003: NASA funds netCDF/HDF project, Argonne/Northwestern parallel netCDF released 2007: netCDF/CF format mandated for CMIP model archive for IPCC AR4 2008: netCDF-4 with HDF5 integration, enhanced data mode, parallel I/O 2009: netCDF format standard endorsed by NASA 2010: OPeNDAP client support for C/Fortran libraries

cfconventions.org Climate and Forecast (CF) Conventions Community agreements for earth science metadata interoperability Conventional ways to specify –Coordinate information needed to locate data in space and time –Standard names for quantities to determine whether data from different sources are comparable –Additional grid information (e.g., grid cell bounds, cell averaging methods) Infrastructure widely used in tools and climate models: AMIP, CCMVal, CEOP, CFMIP, C- LAMP, CMIP3, CMIP5, ENSEMBLES, IOOS, MERSEA, NARCCAP, PMIP, …

Unidata's Roles in the CF Conventions Development – Active in resolving CF issues and developing conventions – Completing new conventions for observational data Software – NetCDF-Java generates CF metadata when reading data in other formats – Libcf helps data providers create CF-compliant data Standards – Unidata-written draft CF standard for NASA under review – Comprehensive specification of meaning of CF- compliance Governance – Serving on CF Governance and Conventions committees

Goals: – Make use of CF Conventions easy for data writers and readers – Help ensure CF-conforming files, CF-compliant applications – Provide geo coordinate systems for netCDF data – Provide advanced features of netCDF-Java for C and Fortran developers Status: – Available in alpha-release form – Includes early release of Gridspec library Unidata's Development of a CF Library

Gridspec A proposed CF standard extension for the description of grids used in Earth System models Developed at Princeton GFDL (Balaji, Liang) Library API included in Unidata's libcf Allows AR5 model output to use model native grids: staggered, nested, cube-sphere, tripolar, yin-yang Supports conservative regridding by users Unidata’s primary role in NOAA Global Interoperability Project – Inclusion of Gridspec C in libcf – Development of Fortran 2003 API – Automated build system and testing – Documentation of APIs

Unidata's Common Data Model A useful merger of netCDF, OPeNDAP, and HDF5 technologies Implemented in netCDF-Java Reads over 20 data formats through netCDF interface Adds coordinate systems layer Adds scientific features layer (grids, trajectories, swaths, discrete samplings, …) NetcdfDataset Application Scientific Feature Types THREDDS Catalog.xml OPeNDAP NetCDF-3 HDF4 I/O service provider GRIB GINI NIDS NetcdfFile NetCDF-4 … Nexrad DMSP CoordSystem Builder Datatype Adapter NcM L

What is NcML? Server-side XML representation of netCDF metadata Uses include: – creating new netCDF files – modifying (“fixing”) existing datasets without rewriting them – creating virtual datasets as aggregations of multiple existing files Integrated with the TDS NcM L + NcML + NcML * NcML * Data

Web server for scientific data Bulk file transfer via HTTP Remote access, subsetting CDM files – OPeNDAP (any CDM files) – Open Geospatial Consortium (OGC) Web Coverage Service (grids) – OGC Web Map Service (grids) – NetCDF subset service (grids) – Experimental data access protocols (any) Clients include IDL, MATLAB, ArcGIS, GoogleEarth, IDV, McIDAS-V, NASA World Wind, netCDF-Java applications such as Panoply, ncBrowse, ERDDAP, … What is the THREDDS Data Server?

The Present Situation Climate researchers make use of software and services from Unidata: netCDF, TDS, CDM, NcML, libcf, Gridspec Unidata is participating in various climate data and related projects: CF conventions development, OGC standards, preparations for CMIP5 for IPCC AR5 Unidata is being pulled into new climate initiatives: NOAA National Climate Model Portal, NOAA Global Interoperability Program, CF satellite product conventions

NCDC March 2010 Workshop on Ensuring Access and Trustworthiness of Climate Observations and Models From final report: Throughout discussions at the workshop it was clear that both the research and operational Agencies that are involved with the emerging National Climate Service rely on foundational IT infrastructure that has been developed and supported by efforts such as the NSF UCAR Unidata group (e.g., the NetCDF format, CF names and common data model) …

Recent Developments Unidata has developed and mostly implemented CF conventions for discrete sampling (formerly “point observations”) Unidata has proposed further involvement with related research initiatives, including NSF proposals to SDCI solicitation With recent NCDC climate observation meeting, a larger role for Unidata technologies in the National Climate Service is envisioned – Funded collaboration to improve model-to-observational data intercomparisons in support of NOAA’s National Climate Model Portal – Visit from NCDC developer to learn TDS technologies (IOSPs, NcML, …) and coordinate developing CF conventions for satellite data products – Participation in new Unidata cf-satellite mailing list, with NCDC, SSEC, EUMETSAT, and other collaborators

Proposal to NSF OCI SDCI: Enhancements for Developers of netCDF Tools Proposed collaboration with C. Zender, UCI, netCDF operator utilities, widely used for analysis of climate model outputs Adds new high-level interfaces for tool developers, to be used in NCO for quick feedback loop Breaks a chicken-and-egg logjam delaying effective use of recent advances in scientific data models for large and complex collections

Second Proposal to NSF OCI SDCI: Extensions to Unidata’s CDM, IDD, and THREDDS Data Server for Streaming Real-time Data and Large Archived Data Sets Provides ability to stream packets of earth science data based on filters – Add user-defined content-based filters – Query language based on CDM, specialized for scientific data Scalable to very large archives – Parallel techniques (Google map-reduce) – Asynchronous and synchronous communication – Commodity message queues to decouple client and server

Concluding Remarks Unidata has had significant involvement in climate research infrastructure since the early 1990's – netCDF and udunits software – early conventions Unidata's contributions have been important and sustained – CF Conventions, CDM, TDS, OPeNDAP clients and servers Unidata is currently being pulled into more climate- related activities and projects – Libcf, Gridspec, NCDC climate project, proposals, CMIP5

Questions and Discussion

Extra slides

Unidata and NSF Science, Engineering, and Education for Sustainability (SEES) Educational activities Fundamental discovery Research infrastructure Climate change research infrastructure –Software –Services –Standards –Support Educational activities –Engaged multidisciplinary community –Experience bringing data to classrooms

Desirable Outcomes of NCDC/Unidata Project Directly support the improvement of some of the required scientific data software infrastructure Facilitate model-to-observational inter-comparisons in support of the IPCC's Climate Model Intercomparison Project(s) (CMIP) and their associated Assessment Reports (AR) Provide improved scientific data manipulation capabilities using Unidata software and applications (the THREDDS Data Server, TDS) to enable aggregation of long time-series model data

Primary Goals of NCDC/Unidata Project Optimize aggregation capabilities in netCDF-Java library and TDS. Work to ensure that file aggregation works for the very large, very long timescales for ensemble and reanalysis data Develop IOSP’s or NcML to make specific datasets CF- compliant: Climate Forecast System Reanalyis, Reynolds O/I ST’s, NCEP Global Forecast System, NCEP Global Ensemble, Smith-Reynolds ERSST Develop and maintain new IOSPs for five ASCII datasets: Global Historical Climatology Network (GHCN) in-situ data, Interactive Global Rawinsonde Archive (IGRA) upper-air in-situ data, USHCN v2, ISD/ISH, ICOADS Serve data through TDS using CLASS as backend data store Assist in training NCDC software engineers to maintain and extend the work.

A Long Term Vision Leverage NSF's investment in Unidata's proven expertise in developing software and services for the climate change research community Balance Unidata's efforts between weather and climate Provide cyberinfrastructure for climate change research Focus on enabling educators to bring climate change science to the classroom

Unidata GO-ESSP Participation Global Organization for Earth System Science Portals Collaboration to develop software infrastructure for distributed access to observed and simulated data from climate and weather communities – by developing individual software components – by building a federation of frameworks that can work together using agreed-upon standards Crosses institutional, agency and international boundaries: Bryan Lawrence (BADC), V. Balaji (GFDL), Michael Lautenschlager (German Climate Computing Centre), Dean Williams (LLNL), Don Middleton (NCAR), Steve Hankin (PMEL) Annual meetings include CF Conventions discussions, decisions Unidata an invited participant since 2004