THREDDS, CDM, OPeNDAP, netCDF and Related Conventions John Caron Unidata/UCAR Sep 2007.

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

Complex Scientific Analytics in Earth Science at Extreme Scale John Caron University Corporation for Atmospheric Research Boulder, CO Oct 6, 2010.
Reading HDF family of formats via NetCDF-Java / CDM
Recent Work in Progress
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
1 OGC Web Services Kai Lin San Diego Supercomputer Center
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
7 +/- 2 Maybe Good Ideas John Caron June (1) NetCDF-Java (aka CDM) has lots of functionality, but only available in Java – NcML Aggregation – Access.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
Best Practices to Promote Data Interoperability Chris Lynnes Joe Glassy Technology Infusion Working Group.
NextGen Network-Enabled Weather (NNEW) Concepts Aaron Braeckel.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
THREDDS Data Server, OGC WCS, CRS, and CF Ethan Davis UCAR Unidata 2008 GO-ESSP, Seattle.
John Caron Unidata October 2012
OPeNDAP and the Data Access Protocol (DAP) Original version by Dave Fulker.
Quick Unidata Overview NetCDF Workshop 25 October 2012 Russ Rew.
GADS: A Web Service for accessing large environmental data sets Jon Blower, Keith Haines, Adit Santokhee Reading e-Science Centre University of Reading.
Implementation of Model Data Interoperability for IOOS: Successes and Lessons Learned Rich Signell USGS Woods Hole, MA / NOAA Silver Spring USA Model Data.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
Feature Collections Subsetting 1. Overview 2. NCSS 2.1. Dataset description 2.2. Grid requests 2.3. Grid as point requests 3. CdmrFeature.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Unidata’s Common Data Model John Caron Unidata/UCAR Nov 2006.
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
NetCDF-Java Overview John Caron Oct 29, Contents Data Models / Shared Dimensions Coordinate Systems Feature Types NetCDF Markup Language (NcML)
NcML Aggregation vs Feature Collections. NcML functionality 1.Modify the objects found in CDM files – Especially Attributes – Don’t have to rewrite the.
Unidata’s TDS Workshop TDS Overview – Part II Unidata July 2011.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
Tools in Support of a National DMAC Derrick Snowden NERACOOS/ODP Annual Meeting 26 Sep 2012.
DELIVERING ENVIRONMENTAL WEB SERVICES (DEWS) Partners: UK Met Office (Lead Partner), British Atmospheric Data Centre (BADC), British Maritime Technology.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
THREDDS Data Server Unidata’s Common Data Model Background / Summary John Caron Unidata/UCAR Mar 2007.
1 International Standards for Data Interoperability GALEON Geo-interface for Air, Environment, Land, Ocean NetCDF Ben Domenico Unidata Program Center*
1 NOAA IOOS Program Data Integration Framework (DIF) Project Overview Adapted from a brief to the NOAA Data Management Committee August 6, 2008 by Jeff.
DAP4 James Gallagher & Ethan Davis OPeNDAP and Unidata.
Opendap dev - meeting, Boulder, Feb 2007 OPeNDAP infrastructure in European Operational Oceanography T Loubrieu (IFREMER) T Jolibois (CLS)
Unidata TDS Workshop THREDDS Data Server Overview
Recent developments with the THREDDS Data Server (TDS) and related Tools: covering TDS, NCML, WCS, forecast aggregation and not including stuff covered.
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
IOOS Data Services with the THREDDS Data Server Rich Signell USGS, Woods Hole IOOS DMAC Workshop Silver Spring Sep 10, 2013 Rich Signell USGS, Woods Hole.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
DAP Servers and Services Section 2 APAC ‘07 OPeNDAP Workshop 12 Oct 2007 James Gallagher Thanks to Jennifer Adams, John Caron, Roberto De Almeida, Nathan.
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
Unidata’s Common Data Model and NetCDF Java Library API Overview John Caron Unidata/UCAR Nov 2008.
Unidata's Involvement in Developing and Supporting Climate Science Infrastructure Russ Rew UCAR Unidata April 2010.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting
GIS for Atmospheric Sciences and Hydrology By David R. Maidment University of Texas at Austin National Center for Atmospheric Research, 6 July 2005.
Weathertop Consulting, LLC Server-side OPeNDAP Analysis – Concrete steps toward a generalized framework via a reference implementation using F-TDS Roland.
Grids and Beyond: netCDF-CF and ISO/OGC Features and Coverages Ethan Davis, John Caron, Ben Domenico UCAR/Unidata AMS IIPS, 23 January 2008.
Data Stewardship at the NOAA Data Centers Sub Topic - Value Added Products ESIP Federation Meeting, Washington, DC January 6-8, 2009.
Convergence And Trust in Earth and Space Science Data Systems Ted Habermann, NOAA National Geophysical Data Center Documentation: It’s not just discovery...
Common Data Model Scientific Feature Types John Caron UCAR/Unidata July 8, 2008.
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation.
Interoperability Day Introduction Standards-based Web Services Interfaces to Existing Atmospheric/Oceanographic Data Systems Ben Domenico Unidata Program.
Update on Unidata Technologies for Data Access Russ Rew
THREDDS Data Server (TDS) and Data Discovery John Caron Unidata/UCAR May 15, 2006.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
NetCDF-Java version 2.2 Common Data Model John Caron Unidata/UCAR Dec 10, 2004.
Data Are from Mars, Tools Are from Venus
What is NetCDF ? And what are its plans for world domination?
Recent Work in Progress
Remote Data Access Update
OPeNDAP/Hyrax Interfaces
Presentation transcript:

THREDDS, CDM, OPeNDAP, netCDF and Related Conventions John Caron Unidata/UCAR Sep 2007

Contents Overview THREDDS Data Server Unidata’s Common Data Model

2) Server 0) Client 1) Request 3) Response

0) The Client What functionality is needed? 1.Scientific User –Raw data –Drill down to arbitrary detail 2.Decision Support –“best effort” Visualization –operational

1) The Request What functionality is possible? Analogous to SQL language for RDBMS Implies a Data Model OGC vs File access APIs –NetCDF/OPeNDAP/HDF5 : index space –WXS : coordinate space Higher semantic level trumps if no significant extra cost. –File APIs become implementation, not interface

1) WCS Request Functionality –Subsetting (bounding box, time range, variable) –Optional reprojection/resample Variants: KML/XML/SOAP+XML/REST Optional Functionality : 42 flavors Bad news for interoperability Is there an elephant to dictate a standards? –Eg IBM chose SQL/Relational model (1984)

2) Server How do I serve my data? Do I need specialized personnel? –$$, resource consumption, core competency What are the common requests? –(that I should optimize for)?

3) Response What comes back? Has to be a representation of the “answer” in the Data Model WCS allows anything –Cant write a generic client Communities will form around a small number of variants –No elephants in sight

3) Response : XML vs. binary Extensibility vs. Efficiency Binary: netCDF/GeoTIFF/HDF/etc –reflect favorite formats of committee members –Different data models : ideally need a formal mapping (but there arent any yet) –Domain experts can make use GML closely follows the OGC/ISO data model (WFS requires GML)

3) Response : XML vs. binary GML is waaaay too complex –Ambitious –OGC/ISO models are complex –Reality is complex –XML Schema is a disaster Google KML –“visualization format” not “data storage”

HTTP Tomcat Server THREDDS Data Server Datasets catalog.xml motherlode.ucar.edu THREDDS Server NetCDF-Java library Application IDD Data HTTPServer NetcdfSubset WCS OPeNDAP configCatalog.xml

THREDDS Catalogs XML over HTTP Hierarchical listing of online resources (datasets) Container for arbitrary search metadata –Standard set maps to DC, GCMD, ADN –Unidata/NCAR-CDP Metadata can be inherited Design goal: Make it easy for data providers TDS uses extended version for configuration Data Access URLS –“Crossing the protocol boundary”

THREDDS OPeNDAP Server OPenDAP is protocol for remote access to CDM Current version 2.0; NASA ESE standard –Working on new 4.0 protocol spec Based on Java-OPeNDAP library –shared development by Unidata/opendap.org Any CDM dataset can be served Server4 (Hyrax): –latest version of opendap.org C++ library –THREDDS Catalogs replace dods_dir

THREDDS WCS service CDM files that have Grid coordinate system –evenly spaced x,y Allow to subset the dataset by: –Lat/lon or projection bounding box –time and vertical coordinate range –list of Variables Return formats –GeoTIFF floating point, grayscale –NetCDF/CF-1.0 No reprojections, resamplings Uses WCS 1.0, work on WCS 1.1 in progress

NetCDF Subset Service Experiment with REST style web service Allow to subset the dataset by: –Lat/lon bounding box –time and vertical coordinate range –list of Variables NetCDF/CF, XML, CSV (spreadsheet) Gridded Data –Output is a CF-1.0 netCDF file –Variation of WCS (simplified request protocol) Grid as Point Datasets (experimental) –Extract vertical profile, time series from one point in model data Station Data: metars (7 day rolling archive)

HTTP Tomcat Server Common Data Model catalog.xml hostname.edu THREDDS Server Application NetCDF-Java library IDD Data HTTPServer NetcdfSubset WCS OPeNDAP Then a miracle happens Datasets

NetcdfDataset Application Scientific Datatypes NetCDF-Java architecture OPeNDAP THREDDS Catalog.xml NetCDF-3 HDF5 I/O service provider GRIB GINI NIDS NetcdfFile NetCDF-4 … Nexrad DMSP CoordSystem Builder Datatype Adapter NcML

Common Data Model File Formats General: NetCDF, HDF5, OPeNDAP Gridded: GRIB-1, GRIB-2 Radar: NEXRAD level II and level III, DORADE, Chinese NEXRAD Point: BUFR Satellite: DMSP, GINI In Progress: NetCDF4, McIdas AREA, NPOESS, NOAA CLASS legacy files, Barrowdale DataBlade, others

Coordinate Systems Common Data Model Layers Data Access Scientific Datatypes Grid Point Radial Trajectory Swath StationProfile

Common Data Model (Data Access Layer)

Coordinate Systems UML

NetCDF-4 file format NetCDF-4 C library –4.0 Beta implements CDM access layer Persistence format for complete CDM –4.1: adding Coordinate Systems Optional layer, focus on CF-1 (libcf) –4.?: merge OPeNDAP access NetCDF-Java library will read, maybe write

TDS / NcML aggregation

Forecast Model Run Collection (FMRC)

Scientific DataTypes Based on datasets Unidata is familiar with –APIs are evolving How are data points connected? Intended to scale to large, multifile collections Intended to support “specialized queries” –Space, Time Intend to create “standard” NetCDF file encoding conventions

Scientific DataTypes Grids –Structured –Swath –Unstructured Point Observation –Unconnected –Station / Time Series –Trajectory –Profile Radial

Climate and Forecast (CF) Conventions Conventions for encoding coordinate systems, other semantics in netCDF Working for 10 years –Version 1.0 in 2003 –Good for gridded data Current working goups –Point/Station/Trajectory/Profile observations –CRS (map to OGC) Governance in place Volunteer: motivated, practical, real

Summary: Unidata’s directions Client: both Scientific User and Decision Support Request in coordinate space –WCS is fine, not a big architectural decision Server: TDS –Files in native format, augmented by indexing/DB Response: netCDF/CF and GeoTIFF/KML or WMS/JPEG