Update on Unidata Technologies for Data Access Russ Rew 2011-05-10.

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

A Draft Standard for the CF Metadata Conventions Cheryl Craig and Russ Rew UCAR.
Recent Work in Progress
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
DMAC ST and the Activities of the IOOS PO Derrick Snowden DMAC Steering Team
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
THREDDS, CDM, OPeNDAP, netCDF and Related Conventions John Caron Unidata/UCAR Sep 2007.
7 +/- 2 Maybe Good Ideas John Caron June (1) NetCDF-Java (aka CDM) has lots of functionality, but only available in Java – NcML Aggregation – Access.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
Introduction to NetCDF Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 13 April 2011.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
Session 2: Using OPeNDAP-enabled Applications to Access Australian Data Services and Repositories eResearch Australasia 2011, ½ Day Morning Workshop, Thursday.
Quick Unidata Overview NetCDF Workshop 25 October 2012 Russ Rew.
NetCDF and HDF5 Ed Hartnett, Unidata/UCAR, Unidata Mission: To provide the data services, tools, and cyberinfrastructure leadership that advance.
Implementation of Model Data Interoperability for IOOS: Successes and Lessons Learned Rich Signell USGS Woods Hole, MA / NOAA Silver Spring USA Model Data.
NetCDF for Developers and Data Providers Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 14 April 2011.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
2 3 ROMS/COAWST NcML file 4 5 Exploiting IOOS: A Distributed, Standards-Based Framework and Software Stack for Searching, Accessing, Analyzing and.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Unidata’s Common Data Model John Caron Unidata/UCAR Nov 2006.
IOOS Data Management Integration Standards Plans in the Northeast Eric Bridger (GMRI) Sep
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
Weathertop Consulting, LLC Wednesday, January 14, 2009 IIPS 11A.2 1 A General Purpose System for Server-side Analysis of Earth Science Data Roland Schweitzer.
U.S. Department of the Interior U.S. Geological Survey Management of Oceanographic time-series data at the Woods Hole Coastal and Marine Science Center.
NcML Aggregation vs Feature Collections. NcML functionality 1.Modify the objects found in CDM files – Especially Attributes – Don’t have to rewrite the.
Unidata’s TDS Workshop TDS Overview – Part II Unidata July 2011.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
THREDDS Data Server Unidata’s Common Data Model Background / Summary John Caron Unidata/UCAR Mar 2007.
DAP4 James Gallagher & Ethan Davis OPeNDAP and Unidata.
Unidata TDS Workshop THREDDS Data Server Overview
1 NASA CEOP Status & Demo CEOS WGISS-24 Oberpfaffenhofen, Germany October 15, 2007 Yonsook Enloe.
Recent developments with the THREDDS Data Server (TDS) and related Tools: covering TDS, NCML, WCS, forecast aggregation and not including stuff covered.
NetCDF Data Model Issues Russ Rew, UCAR Unidata NetCDF 2010 Workshop
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
IOOS Data Services with the THREDDS Data Server Rich Signell USGS, Woods Hole IOOS DMAC Workshop Silver Spring Sep 10, 2013 Rich Signell USGS, Woods Hole.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
Remote Data Access with OPeNDAP Dr. Dennis Heimbigner Unidata netCDF Workshop October 25, 2012.
The Unified Access Framework for Gridded Data … the 1 st year focus of NOAA’s Global Earth Observation Integrated Data Environment (GEO-IDE) Steve Hankin,
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
Unidata's Involvement in Developing and Supporting Climate Science Infrastructure Russ Rew UCAR Unidata April 2010.
Information Technology: GrADS INTEGRATED USER INTERFACE Maps, Charts, Animations Expressions, Functions of Original Variables General slices of { 4D Grids.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting
GrADS-DODS Server An open-source tool for distributed data access and analysis Joe Wielgosz, Brian Doty, Jennifer Adams COLA/IGES - Calverton, MD
Advances in the NetCDF Data Model, Format, and Software Russ Rew Coauthors: John Caron, Ed Hartnett, Dennis Heimbigner UCAR Unidata December 2010.
Convergence And Trust in Earth and Space Science Data Systems Ted Habermann, NOAA National Geophysical Data Center Documentation: It’s not just discovery...
A Draft Standard for the CF Metadata Conventions Russ Rew, Unidata GO-ESSP 2009 Workshop
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
OGC Web Services with complex data Stephen Pascoe How OGC Web Services relate to GML Application Schema.
1 Earth System Grid Center for Enabling Technologies OPeNDAP Services for ESG March 9, 2016 Peter Fox, Patrick West, Stephan Zednik RPI Performance Measures.
Rich Signell Roland Viger Curtis Price USGS Community for Data Integration Feb 15, 2012.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
NetCDF Data Model Details Russ Rew, UCAR Unidata NetCDF 2009 Workshop
NetCDF-Java version 2.2 Common Data Model John Caron Unidata/UCAR Dec 10, 2004.
NetCDF 3.6: What’s New Russ Rew
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Efficiently serving HDF5 via OPeNDAP
Status for Endeavor 6: Improved Scientific Data Access Infrastructure
Accessing Remote Datasets through the netCDF interface.
OPeNDAP/Hyrax Interfaces
Presentation transcript:

Update on Unidata Technologies for Data Access Russ Rew

2 Overview Standards endorsements C-based netCDF NetCDF/OPeNDAP integration NetCDF-Java and Unidata Common Data Model TDS: THREDDS Data Server Libcf: Gridspec Enhanced transparency of Unidata open- source development

Standards endorsements Standards-based data interoperability has typically meant compliance to ISO and OGC standards Now, netCDF also qualifies:  2009: NASA ESDS standard for earth science data  2010: US FGDC endorsed netCDF-3 and netCDF-4 as “Common Encoding Standards”  2010: NASA ESDS endorsed CF Metadata Conventions as a standard for earth science data  2011: TDS includes ncISO services providing ISO metadata (from NOAA/NGDC)  2011: OGC approved netCDF as a core binary encoding standard Additional standards work is underway for netCDF-4, NcML, CF conventions

C-based netCDF libraries OPeNDAP client integration Refactored library architecture for maintainability, other formats, protocols Improved performance Generic tools

NetCDF/OPeNDAP integration a success OPeNDAP access from C-based clients  Supports subset access of selected variables, subarrays  Subset access using DAP can be much faster than whole file access, such as with FTP Benefits from remote access  NcML aggregation  accessing data in other formats  netCDF subset service from remote servers DAP security authorization  Earth System Grid  HTTP

C library refactored for interoperability ApplicationnetCDFlibdispatch libsrc (classic) libsrc4 (netCDF-4) libncdap3 (OPeNDAP ) libncdap4 (OPeNDAP) other handlers … netCDF classic netCDF-4netCDF-4 HDF4, HDF5 (read only) remote subsets (read only) remote remote remote other “formats” Common interfaces and code factored into a new “dispatch layer” Simpler code Easier maintenance Easier handling of additional formats & protocols (I/O plugins)

Performance improvements Refactored read code for large speedup on opening netCDF-4 files with compressed or chunked variables Speedup variable and dimension lookup by name Improved memory allocation to reduce memory footprint Reduced memory when parallel I/O used Improved read code for handling a large number of netCDF-4 attributes and variables Applied intelligent caching to remote access for OPeNDAP client

Generic tools Adapted generic tools to netCDF-4 enhanced data model ncdump: converts netCDF data to CDL text form ncgen: converts CDL text to netCDF data or generates program nccopy: copies, converts, compresses, chunks netCDF data Proved practicality of more complex data model

Practical experience with netCDF-4 Most experience to date is with netCDF-4 classic model format  uses netCDF-3 classic data model, APIs  uses netCDF-4 HDF5-based format  provides backward compatibility  Enables performance features: compression, multidimensional tiling (chunking), efficient schema changes, parallel I/O, … Adoption proceeding in a 3-step process 1. Relink applications with netCDF-4 library 2. Continue use of classic model, netCDF-3 APIs but with netCDF- 4 classic model format to get performance benefits 3. Make use of features of enhanced model, as needed/supported

Status of netCDF-4 adoption: Logjam appears to be broken NetCDF-4 classic model support in analysis and visualization apps: IDL, GrADS, CDAT, MATLAB, IDV, NCO, NCL, CDO, PyNGL, ncview, Panoply, Ferret, OGC WMS and WCS clients Data providers using netCDF-4 classic model format for transparent compression and chunking: within NASA, NOAA, GFDL, COLA, … NetCDF-4 enhanced model support in language APIs: C, Java (read only), C++ (beta), Fortran Partial support for netCDF-4 enhanced model also in NCO, NCL, Panoply, Python API, …

NetCDF-Java and Common Data Model NetCDF-Java provides one of the few general- purpose CF metadata implementations Observational data implementation supporting CF Discrete Sampling Geometry conventions now complete for CF-1.6 addition Recent improvements include improved GRIB collection as alternative to FMRC aggregations Caching of Grid dataset info and File System info GRIB processing correctly handles time interval variables

THREDDS Data Server (TDS) Catalog and data access services  OPeNDAP  OGC web services: WCS and WMS  HTTP  NetCDF subset service in coordinate space  NetCDF API access to other formats (GRIB, HDF, …) Recent improvements  FMRC aggregation configuration simplified and speeded up  ncWMS integration improved, Godiva2 browser supported as client  ncISO metadata services  OPeNDAP handling of ESG, HTTP access control  Subsetting services for Point Observations extended to Station data Plans  GRIB-2 refactor for efficiency  Improve performance of large collections and aggregations  Experimental asynchronous / streaming / remote access services (cdmRemote)

Libcf: Gridspec additions Subject of Alex Pletzer’s talk on Thursday morning Proposed CF extension for complex grids, such as cubed-sphere grid Specifies multi-file implementation of host file, grid files, data files, and mosaic files Supports regridding

Transparency in development process Source code repository open: svn.unidata.ucar.edu/ Bug and issue tracking available: Development plans and schedules more visible Unidata developers posting semi-regularly to blog: These services are experimental, subject to change Goals  Make participation in open-source development practical  Leverage community efforts