Www.hdfgroup.org The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.

Slides:



Advertisements
Similar presentations
Data Formats: Using self-describing data formats Curt Tilmes NASA Version 1.0 Review Date.
Advertisements

A PLFS Plugin for HDF5 for Improved I/O Performance and Analysis Kshitij Mehta 1, John Bent 2, Aaron Torres 3, Gary Grider 3, Edgar Gabriel 1 1 University.
Recent Work in Progress
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
The HDF Group Apr , 2012HDF/HDF-EOS Workshop XV1 Interoperability with netCDF-4 Kent Yang, Larry Knox, Elena Pourmal The HDF Group.
NetCDF An Effective Way to Store and Retrieve Scientific Datasets Jianwei Li 02/11/2002.
Introduction to NetCDF Ernesto Munoz. Outline Overview of NetCDF Overview of NetCDF NetCDF file information NetCDF file information CDL utilities: ncdump,
DESIGN OF LARGE SCALE DATA ARCHIVAL AND RETRIEVAL SYSTEM FOR TRANSPORTATION SENSOR (WRITE-ONCE-READ-MANY TYPE) DATA. by Nirish Dhruv Department of Computer.
NetCDF Ed Hartnett Unidata/UCAR
Introduction to NetCDF Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 13 April 2011.
1 CF Unleashed: Introduction to Cf/Radial Joe VanAndel National Center for Atmospheric Research 2013/1/8 The National Center for Atmospheric.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
The HDF Group July 8, 2014HDF 2014 ESIP Summer Meeting HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann The.
Developing a NetCDF-4 Interface to HDF5 Data
1 of 14 Substituting HDF5 tools with Python/H5py scripts Daniel Kahn Science Systems and Applications Inc. HDF HDF-EOS Workshop XIV, 28 Sep
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
The HDF Group April 17-19, 2012HDF/HDF-EOS Workshop XV1 Introduction to HDF5 Barbara Jones The HDF Group The 15 th HDF and HDF-EOS Workshop.
NetCDF-4 The Marriage of Two Data Formats Ed Hartnett, Unidata June, 2004.
Developing a NetCDF-4 Interface to HDF5 Data Russ Rew (PI), UCAR Unidata Mike Folk (Co-PI), NCSA/UIUC Ed Hartnett, UCAR Unidata Quincey Kozial, NCSA/UIUC.
1 Russ Rew, Ed Hartnett, John Caron UCAR Unidata Program Center Mike Folk, Robert McGrath, Quincey Kozial NCSA and The HDF Group, Inc. Final Project Review,
1 High level view of HDF5 Data structures and library HDF Summit Boeing Seattle September 19, 2006.
HDF5 A new file format & software for high performance scientific data management.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
NPP/ NPOESS Product Data Format Richard E. Ullman NASA/GSFC/NPP NOAA/NESDIS/IPOAlgorithm / System EngineeringData / Information Architecture
February 2-3, 2006SRB Workshop, San Diego P eter Cao, NCSA Mike Wan, SDSC Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration Object-level.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
CCGrid 2014 Improving I/O Throughput of Scientific Applications using Transparent Parallel Compression Tekin Bicer, Jian Yin and Gagan Agrawal Ohio State.
The HDF Group HDF5 Datasets and I/O Dataset storage and its effect on performance May 30-31, 2012HDF5 Workshop at PSI 1.
N P O E S S I N T E G R A T E D P R O G R A M O F F I C E NPP/ NPOESS Product Data Format Richard E. Ullman NOAA/NESDIS/IPO NASA/GSFC/NPP Algorithm Division.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
Deutscher Wetterdienst
Integrating netCDF and OPeNDAP (The DrNO Project) Dr. Dennis Heimbigner Unidata Go-ESSP Workshop Seattle, WA, Sept
_______________________________________________________________CMAQ Libraries and Utilities ___________________________________________________Community.
1 HDF5 Life cycle of data Boeing September 19, 2006.
NetCDF Data Model Issues Russ Rew, UCAR Unidata NetCDF 2010 Workshop
CCGrid 2014 Improving I/O Throughput of Scientific Applications using Transparent Parallel Compression Tekin Bicer, Jian Yin and Gagan Agrawal Ohio State.
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
HDF Hierarchical Data Format Nancy Yeager Mike Folk NCSA University of Illinois at Urbana-Champaign, USA
HDF5.
CCGrid, 2012 Supporting User Defined Subsetting and Aggregation over Parallel NetCDF Datasets Yu Su and Gagan Agrawal Department of Computer Science and.
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package MuQun Yang, Christian Chilan, Albert Cheng, Quincey Koziol, Mike.
The HDF Group HDF Group Support for NPP/JPSS Mike Folk, Elena Pourmal, Larry Knox, Albert Cheng The HDF Group DEWG Meeting June 19, 2012.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting
Data File Formats: netCDF by Tom Whittaker University of Wisconsin-Madison SSEC/CIMSS 2009 MUG Meeting June, 2009.
Advances in the NetCDF Data Model, Format, and Software Russ Rew Coauthors: John Caron, Ed Hartnett, Dennis Heimbigner UCAR Unidata December 2010.
SDM Center Parallel I/O Storage Efficient Access Team.
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
CF 2.0 Coming Soon? (Climate and Forecast Conventions for netCDF) Ethan Davis ESO Developing Standards - ESIP Summer Mtg 14 July 2015.
Development of a CF Conventions API Russ Rew GO-ESSP Workshop, LLNL
NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation.
Update on Unidata Technologies for Data Access Russ Rew
The HDF Group Introduction to HDF5 Session 7 Datatypes 1 Copyright © 2010 The HDF Group. All Rights Reserved.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
NetCDF Data Model Details Russ Rew, UCAR Unidata NetCDF 2009 Workshop
Copyright © 2010 The HDF Group. All Rights Reserved1 Data Storage and I/O in HDF5.
NetCDF-Java version 2.2 Common Data Model John Caron Unidata/UCAR Dec 10, 2004.
The HDF Group Introduction to HDF5 Session Three HDF5 Software Overview 1 Copyright © 2010 The HDF Group. All Rights Reserved.
Moving from HDF4 to HDF5/netCDF-4
SRNWP Interoperability Workshop
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Efficiently serving HDF5 via OPeNDAP
Tad Scheiblich RSI December 2, 2005
What NetCDF users should know about HDF5?
NetCDF and Scientific Data Standard
Status for Endeavor 6: Improved Scientific Data Access Infrastructure
Hierarchical Data Format (HDF) Status Update
ExPLORE Complex Oceanographic Data
NCL variable based on a netCDF variable model
Presentation transcript:

The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015

Overview 10/17/20152 What is netCDF? Data model and formats Ecosystem

WHAT IS NETCDF? 310/17/2015

Background 10/17/20154 Developed and maintained by Unidata Mission: provide data services, tools, and community leadership to advance Earth system science, enhance educational opportunities, and broaden participation Funded by NSF through UCAR Open source software

What is netCDF? 10/17/20155 netCDF = network Common Data Form Data model Scientific data (especially suitable for gridded data) Widely used in ocean and atmospheric science, climate and weather modeling Other disciplines: molecular dynamics, fusion research, medical imaging File FORMAT Portable, self-describing, etc. Not just one file format (netcdf, HDF4, HDF5, OPeNDAP, etc.) Application programming interfaces (APIs) C, Java, C++, Fortran Python, Ruby, Perl, MATLAB, IDL

History of netCDF 10/17/ – first release of netCDF 1991 – netCDF – HDFv 3.3 reads netCDF files 2004 – netCDF 3.6 supports 64-bit offsets 2008 – netCDF-4 based on HDF and later – parallel, OPeNDAP, HDF4

NETCDF DATA MODEL AND FORMATS 710/17/2015

netCDF classic data model 10/17/20158 Variables Name, shape, type N-dim arrays Dimension Name, length Attributes Name, type, value

netCDF classic data model Data model elements: Variables Name, shape, type N-dim arrays Dimensions Name, length Can be shared Attributes Name, type, value Model limitations: Atomic types only One extendible dimension Flat structure Model advantages: Widely adopted Best practices and conventions 10/17/20159

netCDF classic format Pros: Simple (header with metadata, raw data) Suitable for parallel access (pnetcdf from Argonne) Aside: could easily support SWMR Cons: No support for compression No extensibility in multiple dimension XDR-based (big-endian) Costly to add more variables 10/17/201510

netCDF enhanced data model Data model elements: Variables Dimensions Attributes Groups Model limitations: Complex Harder to use and not widely adopted Emerging best practices and conventions Model advantages: Rich collection of datatypes Hierarchical data organization 10/17/201511

Example of ncdump output netcdf OMI-Aura_L2-example { dimensions: PRESSURE = 18 ; DATETIME = 2 ; independent_465 = 465 ; independent_22 = 22 ; independent_7 = 7 ; independent_2 = 2 ; variables: float PRESSURE(PRESSURE) ; PRESSURE:MissingValue = -999.f ; PRESSURE:standard_name = "air_pressure" ; PRESSURE:VAR_NAME = "PRESSURE" ; PRESSURE:VAR_DESCRIPTION = "Pressure at retrieval layer" ; PRESSURE:units = "hPa" ; PRESSURE:valid_range = -Infinityf, Infinityf ; PRESSURE:VAR_DEPEND = "PRESSURE" ; short APrioriCovarianceMatrix(DATETIME, independent_465) ; APrioriCovarianceMatrix:MissingValue = s ; APrioriCovarianceMatrix:UniqueFieldDefinition = "OMI-Specific" ; APrioriCovarianceMatrix:Offset = 0. ; 10/17/201512

Example of h5dump output /PRESSURE Dataset {18/18} Attribute: CLASS scalar Type: 16-byte null-terminated ASCII string Data: "DIMENSION_SCALE" Attribute: MissingValue {1} Type: native float Data: -999 Attribute: NAME scalar Type: 9-byte null-terminated ASCII string Data: "PRESSURE" Attribute: REFERENCE_LIST {9} Type: struct { "dataset" +0 object reference "dimension" +8 native int } 16 bytes Data: (0) {DATASET-1:14192, 1}, {DATASET-1:14192, 2}, {DATASET-1:30064, 1}, {DATASET-1:32126, 1}, {DATASET-1:32472, 1}, {DATASET-1:34237, 1}, (6) {DATASET-1:49807, 1}, {DATASET-1:55542, 1}, {DATASET-1:62105, 1} 10/17/201513

netCDF-4 - netCDF extended format and library netCDF-4 uses HDF5 as a storage layer Pros: Compression Chunking storage Parallel I/O Extensibility (more objects can be added easily) No limitation on number objects Cons: Complex as HDF5 Clunky support for dimensions Not easy to tune for performance since some HDF5 parameters are hidden 10/17/201514

netCDF-4 Architecture 10/17/ HDF5 Library netCDF-4Library netCDF-3 Interface netCDF-3 applications netCDF-3 applications netCDF-4 applications netCDF-4 applications HDF5 applications HDF5 applications netCDF files netCDF files netCDF-4 HDF5 files HDF5 files

NETCDF ECOSYSTEM Real Strength 1610/17/2015

Ecosystem 10/17/201517

Power of CF conventions 10/17/ NPP data visualized with IDV Without CF metadata With CF metadata and dimensions

19

Questions 10/17/ ? Thank you!