The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.

Slides:



Advertisements
Similar presentations
Introduction to the BinX Library eDIKT project team Ted Wen Robert Carroll
Advertisements

A Draft Standard for the CF Metadata Conventions Cheryl Craig and Russ Rew UCAR.
Data Formats: Using self-describing data formats Curt Tilmes NASA Version 1.0 Review Date.
Recent Work in Progress
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric.
ESCI/CMIP5 Tools - Jeudi 2 octobre CMIP5 Tools Earth System Grid-NetCDF4- CMOR2.0-Gridspec-Hyrax …
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
Reading NetCDF Files in Matlab and analyzing the data.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
NetCDF An Effective Way to Store and Retrieve Scientific Datasets Jianwei Li 02/11/2002.
NetCDF 3.6: What’s New Russ Rew Unidata Program Center University Corporation for Atmospheric Research
NetCDF Ed Hartnett Unidata/UCAR
Introduction to NetCDF Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 13 April 2011.
1 CF Unleashed: Introduction to Cf/Radial Joe VanAndel National Center for Atmospheric Research 2013/1/8 The National Center for Atmospheric.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
Show of Hands... How many traveled to be here? University/Gov't/Industry How many use netCDF? Primary programming language for netCDF? Other data formats.
Developing a NetCDF-4 Interface to HDF5 Data
1 Writing NetCDF Files: Formats, Models, Conventions, and Best Practices Russ Rew, UCAR Unidata June 28, 2007.
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
The HDF Group April 17-19, 2012HDF/HDF-EOS Workshop XV1 Introduction to HDF5 Barbara Jones The HDF Group The 15 th HDF and HDF-EOS Workshop.
NetCDF-4 The Marriage of Two Data Formats Ed Hartnett, Unidata June, 2004.
Developing a NetCDF-4 Interface to HDF5 Data Russ Rew (PI), UCAR Unidata Mike Folk (Co-PI), NCSA/UIUC Ed Hartnett, UCAR Unidata Quincey Kozial, NCSA/UIUC.
1 Russ Rew, Ed Hartnett, John Caron UCAR Unidata Program Center Mike Folk, Robert McGrath, Quincey Kozial NCSA and The HDF Group, Inc. Final Project Review,
1 High level view of HDF5 Data structures and library HDF Summit Boeing Seattle September 19, 2006.
HDF5 A new file format & software for high performance scientific data management.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
NPP/ NPOESS Product Data Format Richard E. Ullman NASA/GSFC/NPP NOAA/NESDIS/IPOAlgorithm / System EngineeringData / Information Architecture
A Domain-Specific Modeling Language for Scientific Data Composition and Interoperability Hyun ChoUniversity of Alabama at Birmingham Jeff GrayUniversity.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
The future of MINC Robert D. Vincent
N P O E S S I N T E G R A T E D P R O G R A M O F F I C E NPP/ NPOESS Product Data Format Richard E. Ullman NOAA/NESDIS/IPO NASA/GSFC/NPP Algorithm Division.
Integrating netCDF and OPeNDAP (The DrNO Project) Dr. Dennis Heimbigner Unidata Go-ESSP Workshop Seattle, WA, Sept
Advanced Utilities Extending ncgen to support the netCDF-4 Data Model Dr. Dennis Heimbigner Unidata netCDF Workshop August 3-4, 2009.
Climate Data Formats Deniz Bozkurt
NetCDF Data Model Issues Russ Rew, UCAR Unidata NetCDF 2010 Workshop
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
July 20, Update on the HDF5 standardization effort Elena Pourmal, Mike Folk The HDF Group July 20, 2006 SPG meeting, Palisades, NY.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting
Data File Formats: netCDF by Tom Whittaker University of Wisconsin-Madison SSEC/CIMSS 2009 MUG Meeting June, 2009.
Advances in the NetCDF Data Model, Format, and Software Russ Rew Coauthors: John Caron, Ed Hartnett, Dennis Heimbigner UCAR Unidata December 2010.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
UC 2006 Tech Session 1 NetCDF in ArcGIS 9.2. UC 2006 Tech Session2 Overview Introduction to Multidimensional DataIntroduction to Multidimensional Data.
SDM Center Parallel I/O Storage Efficient Access Team.
Convergence And Trust in Earth and Space Science Data Systems Ted Habermann, NOAA National Geophysical Data Center Documentation: It’s not just discovery...
A Draft Standard for the CF Metadata Conventions Russ Rew, Unidata GO-ESSP 2009 Workshop
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
CF 2.0 Coming Soon? (Climate and Forecast Conventions for netCDF) Ethan Davis ESO Developing Standards - ESIP Summer Mtg 14 July 2015.
OGC Web Services with complex data Stephen Pascoe How OGC Web Services relate to GML Application Schema.
Developing Conventions for netCDF-4 Russ Rew, UCAR Unidata June 11, 2007 GO-ESSP.
Development of a CF Conventions API Russ Rew GO-ESSP Workshop, LLNL
NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation.
Update on Unidata Technologies for Data Access Russ Rew
The HDF Group Introduction to HDF5 Session 7 Datatypes 1 Copyright © 2010 The HDF Group. All Rights Reserved.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
NetCDF Data Model Details Russ Rew, UCAR Unidata NetCDF 2009 Workshop
Other Projects Relevant (and Not So Relevant) to the SODA Ideal: NetCDF, HDF, OLE/COM/DCOM, OpenDoc, Zope Sheila Denn INLS April 16, 2001.
Adding CF Attributes to an HDF5 File
Moving from HDF4 to HDF5/netCDF-4
SRNWP Interoperability Workshop
NetCDF 3.6: What’s New Russ Rew
Chapter 6: Data Types Lectures # 10.
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Efficiently serving HDF5 via OPeNDAP
What is FITS? FITS = Flexible Image Transport System
Unidata & NetCDF BoF Scientific File Formats
Status for Endeavor 6: Improved Scientific Data Access Infrastructure
Presentation transcript:

The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012

2 April 14, 2011 NetCDF data models, formats, APIs Data models for scientific data and metadata  classic: simplest model -- dimensions, variables, attributes  enhanced: more powerful model -- adds groups, types, nesting File formats for portable data -Array-oriented scientific data and metadata -Formats: classic, 64-bit offset, netCDF-4, netCDF-4 classic model -Formats make data self-describing, portable, direct access, appendable, extensible, sharable, archivable Application programming interfaces (APIs) -C, Java, Fortran, C++ -Python, Ruby, Perl, MATLAB, IDL, … (3 rd party APIs) Together, the data models, file formats, and APIs support the creation, access, and sharing of scientific data 2

3 The netCDF "classic" data model A netCDF file has named variables, attributes, and dimensions. Variables are for data, attributes are for metadata (data about data) Dimensions are for specifying shapes of variables Attributes may apply to a whole file or to a single variable Variables may share dimensions, indicating a common grid. One dimension may be of unlimited length. Each variable or attribute has 1 of 6 types: char, byte, short, int, float, double Variables Dimensions Attributes 3

4 The netCDF classic data model, in UML NetCDF Data has Variables (eg temperature, pressure) Attributes (eg units) Dimensions (eg lat, lon, level, time) Each variable has Name, shape, type, attributes N-dimensional array of values Each attribute has Name, type, value(s) Each dimension has Name, length Variables may share dimensions Represents shared coordinates, grids Variable and attribute values are of type Numeric: 8-bit byte, 16-bit short, 32-bit int, 32-bit float, 64-bit double Character: arrays of char for text Attribute name: String type: primitive value: type[ ] Variable name: String shape: Dimension[ ] type: primitive values: type[ … ] NetCDF Data Dimension name: String length: int 0..* UML = Unified Modeling Language 4

5 The netCDF -4 enhanced data model A file has a top-level unnamed group. Each group may contain one or more named subgroups, user-defined types, variables, dimensions, and attributes. Variables also have attributes. Variables may share dimensions, indicating a common grid. One or more dimensions may be of unlimited length. Dimension name: String length: int Attribute name: String type: DataType value: type[ ] Variable name: String shape: Dimension[ ] type: DataType values: type[ … ] Group name: String File Variables and attributes have one of twelve primitive data types or one of four user-defined types. DataType PrimitiveType char byte short int float double unsigned byte unsigned short unsigned int int64 unsigned int64 string UserDefinedType typename: String Compound VariableLength Enum Opaque 0..* 1..* 0..* 5

6 NetCDF classic data model Strengths Data model simple to understand and explain Can be efficiently implemented Representation good for gridded multidimensional data Shared dimensions useful for coordinate systems Generic applications easy to develop Limitations  Small set of primitive types  Data model is flat, limited to multidimensional arrays, (name, value) pairs  Flat name space not ideal for organizing many data objects  Lacks nested structures, variable-length types, enumerations 6

7 NetCDF enhanced data model Strengths Simpler than HDF5, with similar representational power Adds shared dimensions to HDF5 data model Continues support for existing data, software, and conventions Adds real Strings and unsigned integer types Provides nested structures: hierarchical groups, recursive data types Independent features permit incremental adaptation, adoption On the other hand...  More complex than classic data model  More effort required to develop general tools and applications  Adoption proceeding slowly  Hence, no comprehensive experience-based best practices or conventions yet 7

8 NetCDF classic formats Strengths Simple to understand and explain Supported by many applications Standard used in many archives, data projects Mature conventions and best practices have evolved Limitations  No support for efficient compression  Only one growable dimension  Schema changes can be costly  Portable representation favors big-endian platforms 8

9 April 14, 2011 New NetCDF Binary Format Before the netCDF-4 project, there were two binary formats: classic and 64-bit offset NetCDF-4.0 introduced a new binary format: netCDF- 4/HDF5 It is an HDF5 file, with some additional metadata It is read by netCDF code just like any other netCDF file 9

10 April 14, 2011 NetCDF-4 formats Use HDF5 as a storage layer Provide performance advantages of HDF5 -Compression -Chunking -Efficient schema changes Useful for very large or complex data Suitable for high-performance computing 10

11 April 14, 2011 NetCDF-4 classic-model format netCDF-4 Not compatible with some existing applications Enhanced data model and API, more complex, powerful netCDF-4 classic model Uses classic API for compatibility Uses netCDF-4/HDF5 storage for compression, chunking, performance To use, just recompile, relink netCDF-3 Compatible with existing applications Simplest data model and API 11

12 To ensure future access to existing data archives, Unidata is committed to compatibility of: Data access : new versions of netCDF software will provide read and write access to previously stored netCDF data. Programming interfaces : C and Fortran programs using documented netCDF interfaces from previous versions will work without change with new versions of netCDF software. Future versions : Unidata will continue to support both data access compatibility and program compatibility in future netCDF releases. Commitment to Compatibility 12

13 April 14, 2011 NetCDF standards endorsements : NASA Earth Science Data Systems (ESDS) Standards Process Group endorsed netCDF classic and 64-bit offset formats as appropriate for NASA Earth Science data : Integrated Ocean Observing System (IOOS) Data Management and Communications (DMAC) Subsystem endorsed netCDF with Climate and Forecast (CF) conventions as a preferred data format : Steering Committee of the US Federal Geographic Data Committee (FGDC) officially endorsed netCDF as a Common Encoding Standard : Open Geospatial Consortium (OGC) endorsed "OGC Network Common Data Form (NetCDF) Core Encoding Standard version 1.0" as an OGC standard : NASA ESDS Standards Process Group endorsed NetCDF-4/HDF-5 File Format, as a NASA Recommended Standard : the Open Geospatial Consortium (OGC) approved the NetCDF Enhanced Data Model Extension Encoding Standard, making netCDF-4 an OGC standard. Data model examplesOGC 13