NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation.

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

EURO4M Project Kick-Off, April 2010 OGC Web Services Data visualization using OGC web services Maarten Plieger Wim Som de Cerff Royal Netherlands Meteorological.
Complex Scientific Analytics in Earth Science at Extreme Scale John Caron University Corporation for Atmospheric Research Boulder, CO Oct 6, 2010.
Recent Work in Progress
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
NetCDF An Effective Way to Store and Retrieve Scientific Datasets Jianwei Li 02/11/2002.
NetCDF 3.6: What’s New Russ Rew Unidata Program Center University Corporation for Atmospheric Research
Introduction to NetCDF Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 13 April 2011.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
1 CF Unleashed: Introduction to Cf/Radial Joe VanAndel National Center for Atmospheric Research 2013/1/8 The National Center for Atmospheric.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
Developing a NetCDF-4 Interface to HDF5 Data
OPeNDAP and the Data Access Protocol (DAP) Original version by Dave Fulker.
Quick Unidata Overview NetCDF Workshop 25 October 2012 Russ Rew.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
NetCDF-4 The Marriage of Two Data Formats Ed Hartnett, Unidata June, 2004.
Developing a NetCDF-4 Interface to HDF5 Data Russ Rew (PI), UCAR Unidata Mike Folk (Co-PI), NCSA/UIUC Ed Hartnett, UCAR Unidata Quincey Kozial, NCSA/UIUC.
Implementation of Model Data Interoperability for IOOS: Successes and Lessons Learned Rich Signell USGS Woods Hole, MA / NOAA Silver Spring USA Model Data.
Using python client libraries to access remote servers via Web Coverage Services Dominic Lowe, BADC Ben Domenico, Unidata/UCAR* and the GALEON Team * Work.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
N-Wave Stakeholder Users Conference Wednesday, May 11, Marine St, Rm 123 Boulder, CO Linda Miller and Mike Schmidt Unidata Program Center (UPC)-Boulder,
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
1PeopleDocumentsData Catalog Generation Tools Analysis and Visualization Tools Data Services Discovery and Publication Tools Discovery and Publication.
1 International Standards for Data Interoperability GALEON Geo-interface for Air, Environment, Land, Ocean NetCDF Ben Domenico Unidata Program Center*
1 NOAA IOOS Program Data Integration Framework (DIF) Project Overview Adapted from a brief to the NOAA Data Management Committee August 6, 2008 by Jeff.
DAP4 James Gallagher & Ethan Davis OPeNDAP and Unidata.
Unidata TDS Workshop THREDDS Data Server Overview
1 NASA CEOP Status & Demo CEOS WGISS-24 Oberpfaffenhofen, Germany October 15, 2007 Yonsook Enloe.
Quick Unidata Overview NetCDF Workshop 2 August 2009 Russ Rew Data Services Group.
Recent developments with the THREDDS Data Server (TDS) and related Tools: covering TDS, NCML, WCS, forecast aggregation and not including stuff covered.
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
IOOS Data Services with the THREDDS Data Server Rich Signell USGS, Woods Hole IOOS DMAC Workshop Silver Spring Sep 10, 2013 Rich Signell USGS, Woods Hole.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
The CF Conventions: Options for Sustained Support Involving Unidata Russ Rew Unidata Policy Committee May 12, 2008.
1 NASA CEOP Final Summary CEOS WGISS-26 Boulder, Colorado September 23, 2008 Yonsook Enloe
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting
Data File Formats: netCDF by Tom Whittaker University of Wisconsin-Madison SSEC/CIMSS 2009 MUG Meeting June, 2009.
Advances in the NetCDF Data Model, Format, and Software Russ Rew Coauthors: John Caron, Ed Hartnett, Dennis Heimbigner UCAR Unidata December 2010.
SPDF Science Advisory Group - September 29-30, 2005 Page 12/24/2016 9:09:48 PM Services of the Space Physics Data Facility (SPDF) / Sun-Earth Connection.
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
CF 2.0 Coming Soon? (Climate and Forecast Conventions for netCDF) Ethan Davis ESO Developing Standards - ESIP Summer Mtg 14 July 2015.
OGC Web Services with complex data Stephen Pascoe How OGC Web Services relate to GML Application Schema.
Rich Signell Roland Viger Curtis Price USGS Community for Data Integration Feb 15, 2012.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
Interoperability Day Introduction Standards-based Web Services Interfaces to Existing Atmospheric/Oceanographic Data Systems Ben Domenico Unidata Program.
Update on Unidata Technologies for Data Access Russ Rew
NcBrowse: A Graphical netCDF File Browser Donald Denbo NOAA-PMEL/UW-JISAO
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
NetCDF Data Model Details Russ Rew, UCAR Unidata NetCDF 2009 Workshop
Other Projects Relevant (and Not So Relevant) to the SODA Ideal: NetCDF, HDF, OLE/COM/DCOM, OpenDoc, Zope Sheila Denn INLS April 16, 2001.
NetCDF-Java version 2.2 Common Data Model John Caron Unidata/UCAR Dec 10, 2004.
SRNWP Interoperability Workshop
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Unidata & NetCDF BoF Scientific File Formats
Recent Work in Progress
Status for Endeavor 6: Improved Scientific Data Access Infrastructure
Presentation transcript:

NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation for Atmospheric Research Sponsored by NSF and NASA

2 Overview l What is Unidata? l What is netCDF? l Data model l Programming interfaces l Format l Limitations l Standardization Possibilities l Summary

3 Unidata: A Community Endeavor l Community of educators and researchers at 120 universities, 30 other institutions, international in scope l Mission: providing data, tools, support, and community leadership for enhanced earth-system education and research l Funded primarily by the National Science Foundation l Atmospheric science community, expanding to oceanography, hydrology, other geosciences l Unidata Program Center: 25 staff, 15 developers

4 Unidata Provides... l User support, software development, maintenance, and community advocacy l Near real time “push” data dissemination: Internet Data Distribution among 170 institutions for use in research and education (IDD, LDM) l Tools for analysis and visualization (Gempak, IDV, McIDAS) l Data access and collections infrastructure (netCDF, udunits, THREDDS,...)

5 NetCDF Overview l Model for scientific data: variables, dimensions, attributes, coordinates l Libraries for data access: C, Fortran, C++, Java, Perl, Python, Ruby, Matlab, IDL,... l Associated Conventions convey semantics l Format for portable binary data that supports direct access, metadata, appending new data, shared access (note that users need not know anything about format)

6 Some Applications for netCDF l Gridded output from models: forecast, climate, ocean, atmospheric chemistry,... l Observational data: surface, soundings, satellite, lightning, aircraft, radar,... l Various scientific data archives l Chromatography, mass spectrometry, medical imaging,... l In general: platform-independent, language- independent interface to insulate data providers from data consumers

NetCDF’s Niche l Simple data model for scientific datasets l Portable, self-describing data l Appendable, sharable, archivable l Direct access for efficient subsetting l Metadata via attribute conventions such as CF l Flexible remote access via OPeNDAP, HTTP, WCS l Lots of applications: NCO, ncbrowse, ncview, IDV, IDL, MATLAB, ArcGIS,... l Language interfaces include C, Java, Fortran, C++, Perl, Python, Ruby,...

8 NetCDF Data Model l Variables: name, shape (list of Dimensions), type, Attributes, values l Dimensions: name, length l Attributes: name, type, value(s) l Coordinate Variables: values associated with Dimensions

9 NetCDF Data Model Variable name: String shape: Dimension[ ] type: DataType Dataset location: String Dimension name: String length: int isUnlimited: boolean Attribute name: String value: Vector type: DataType CoordinateVariable name: String shape: Dimension[ 1 ] type: DataType same name ValuesArray type: primitive shape: int[ ]

10 Availability of netCDF l Freely available source, binaries: u l Unidata-supported interfaces: u C, C++, Fortran-77, Fortran-90, Java, Perl l Platforms: u Unix, Windows, Mac OS X,... l License: u X Windows/MIT style open source license u Used in several commercial applications l Status: u version released February 2004 u netCDF-4 under development

Widespread netCDF Use l 500 entries representing 32 in netcdfgroup mailing list: academic institutions, government agencies, commercial firms l Downloaded by more than 2000 distinct hosts in 55 countries in last 10 years l Software for manipulating/displaying netCDF: more than u 15 commercial packages, u 56 freely available package l Adopted by Analytical Instrument Association [AIA] to implement the Analytical Data Interchange Protocols l Neuro-Imaging Laboratory of the Montreal Neurological Institute

12 netCDF Language Interfaces l Conventions of each language supported by interfaces l C: core implementation l C++, Fortran-77, Fortran-90, Perl: call C library l Java: independent pure-Java implementation l Python, Ruby, IDL, Matlab, Ada: third-party layers over C library l Data is language-neutral

13 Java Interface l Currently most advanced netCDF interface, used to test new ideas l Client access to data servers: u HTTPD u OPeNDAP/DODS l Advanced coordinate system support for general and georeferenced coordinates l Virtual datasets to aggregate, subset, redefine, using NcML (XML representation for netCDF)

14 NetCDF Format l Purpose: insulate users, applications, and data from machine architectures, format details, additions to data l Provides data that is u self-describing u portable u directly accessible u appendable l Still support original XDR-based format (1988) l 32-bit file offset limits file size l New format with 64-bit offsets for huge files

NetCDF File Format Records grow in the UNLIMITED dimension l Header u Dimensions u Attributes u Variable metadata l Fixed-size variables l Record variables u Use UNLIMITED dimension l XDR-like represenations for all data and metadata netCDF Header 1st non-record variable 2nd non-record variable... nth non-record variable 1st record for 1st record variable 1st record for 2nd record variable... 1st record for rth record variable 2nd record for 1st record variable 2nd record for 2nd record variable... 2nd record for rth record variable

16 Some Current netCDF Limitations l Not a database system: no keyed access, transactions, multiple concurrent writers l Strings are just character arrays l No nested data structures l No “ragged arrays”, variable length records l Only one growable dimension per dataset l No built-in compression or packing l Georeferencing requires agreement on conventions l...

17 NetCDF CF Conventions l Climate and Forecast metadata conventions u Metadata for use, not discovery  u Adopted by: PCMDI, PRISM, ESMF, NCAR, Hadley Centre, GFDL, various EU projects u Developed by volunteers, active list l Includes georeferencing conventions for multidimensional coordinate systems l Includes a “Standard Name Table” for quantities u Google for “CF standard name table” u 76 of 416 entries (as of June 3, 2004) relate to hydrology

Taking Advantage of Web Services for Data System Interoperability GIS Client Applications FES Client Applications OpenGIS Protocols: WMS, WFS, WCS OGC or proprietary GIS protocols OGC or OPeNDAP ADDE. FTP… protocols GIS Server GIS Servers Demographic, infrastructure, societal impacts, … datasets THREDDS Server FES Servers Satellite, radar, forecast model output, … datasets

19 Obstacles to GIS-netCDF Connection l netCDF data model (finite sampling of real functions) differs from relational, GIS datamodels l Some netCDF data not spatial l Much netCDF data multidimensional, > 2D l Georeferencing for netCDF data uses multiple conventions l Typical netCDF datasets may be very large l Large netCDF datasets may be on a remote server

20 Unidata GIS-netCDF Efforts l Prototype netCDF to geoTIFF converters l Parser for georeferencing conventions implemented in Java l Unidata GeoGrid from georeferenced netCDF variable supported in Java l GridDataset: collection of GeoGrids in a netCDF dataset with methods for getting volumes and slices l NcML for wrapping any new conventions around existing data

netCDF-related Changes Proposed in WCS 1.1 l WCS encoding “profiles” instead of fixed list of encoding formats l CF-netCDF is first proposed WCS encoding profile l Multiple “variables” or “parameters” in a coverage (e.g., pressure, temperature, etc.) l Coverages with 3 spatial dimensions l Coverages with multiple time dimensions (e.g. forecast time in model output) l Non-spatial “height” dimension, (e.g., atmospheric pressure, ocean density) l Irregularly-spaced grids l Technical committee votes on entirety of WCS 1.1 this summer.

Possible netCDF Standardization l File format l Applications programming interfaces l Conventions (e.g., CF, Climate and Forecast) l Release level: u netCDF 3 u netCDF 4 later this year (will include access to HDF 5 files) l Common Data Access Model (with elements of netCDF, HDF, and OPeNDAP data models) l (Relationship to WCS encoding profile specification?)

23 Summary l NetCDF is a simple data model, different from Relational or GIS data models l NetCDF format and interfaces provide a portable way to store and access multidimensional array-oriented data l Conventions exist for georeferencing netCDF data l May be a bridge to link GIS and scientific users, applications, and data

More Information l NetCDF documentation and code l CF Conventions metadata/index.html metadata/index.html l Russ Rew