The Unified Access Framework (UAF) Philosophy, progress, and plans DAARWG Meeting, Seattle, Nov. 2011 Steve Hankin (PMEL), Kevin O’Brien (PMEL/JISAO),

Slides:



Advertisements
Similar presentations
Conversion of CPC Monitoring and Forecast Products to GIS Format Viviane Silva Lloyd Thomas, Mike Halpert and Wayne Higgins.
Advertisements

1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
DMAC ST and the Activities of the IOOS PO Derrick Snowden DMAC Steering Team
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
BlogMyData A Virtual Research Environment for collaborative visualization of environmental data Andrew Milsted | 14 September 2010.
Integrated Ocean Observing System Data Management and Communications March 2004 The US Integrated Ocean Observing System (IOOS) Plan for Data Management.
Data Discovery 1.Data discovery systems 2.TDS and metadata 3.NcISO services.
1 Alternate Title Slide: Presentation Name Goes Here Presenter’s Name Infrastructure Solutions Division Date GIS Perfct Ltd. Autodesk Value Added Reseller.
Steve Rutz NOAA/NESDIS National Oceanographic Data Center NODC Observing Systems Team Leader June 21, 2011.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
A Super-Regional Modeling Testbed for Improving Forecasts of Environmental Processes for the U.S. Atlantic and Gulf of Mexico Coasts Cyberinfrastructure.
Data Interoperability and Access Activities Prepared for the Data Archiving and Access Requirements Working Group (DAARWG) Ken McDonald, TPIO/GEO-IDE Jeff.
Implementation of Model Data Interoperability for IOOS: Successes and Lessons Learned Rich Signell USGS Woods Hole, MA / NOAA Silver Spring USA Model Data.
Observing System Monitoring Center Integrating data and information across observing system networks.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
2 3 ROMS/COAWST NcML file 4 5 Exploiting IOOS: A Distributed, Standards-Based Framework and Software Stack for Searching, Accessing, Analyzing and.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
IOOS Data Management Integration Standards Plans in the Northeast Eric Bridger (GMRI) Sep
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
U.S. Department of the Interior U.S. Geological Survey Management of Oceanographic time-series data at the Woods Hole Coastal and Marine Science Center.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
Enhancements to a Community Toolset for Ocean Model Data Interoperability: Unstructured grids, NCTOOLBOX, and Distributed Search Rich Signell (USGS), Woods.
Tools in Support of a National DMAC Derrick Snowden NERACOOS/ODP Annual Meeting 26 Sep 2012.
DELIVERING ENVIRONMENTAL WEB SERVICES (DEWS) Partners: UK Met Office (Lead Partner), British Atmospheric Data Centre (BADC), British Maritime Technology.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
IOOS Model Data Interoperability Design ROMS POM WW3 WRF ECOM NcML Common Data Model OPeNDAP+CF WCS NetCDF Subset THREDDS Data Server Standardized (CF)
Integrated Model Data Management S.Hankin ESMF July ‘04 Integrated data management in the ESMF (ESME) Steve Hankin (NOAA/PMEL & IOOS/DMAC) ESMF Team meeting.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative External Observatory Integration Christopher Mueller, Matt Arrott, John Graybeal Life Cycle.
NODC ↔ Data Consumers Steve Rutz NOAA/NESDIS National Oceanographic Data Center NODC Observing Systems Team Leader June 21, 2011.
AUKEGGS Architecturally Significant Issues (that we need to solve)
Opendap dev - meeting, Boulder, Feb 2007 OPeNDAP infrastructure in European Operational Oceanography T Loubrieu (IFREMER) T Jolibois (CLS)
IOOS Modeling Testbed Cyberinfrastructure Rich Signell, USGS, Woods Hole, MA IOOS-RA-Briefing, Feb 14, 2012.
IOOS Coastal Ocean Modeling Testbed (COMT) Cyberinfrastructure Oceans 12 Becky Baltes, IOOS Liz Smith, SURA Rich Signell, USGS Eoin Howlett, Kyle Wilcox,
1 NASA CEOP Status & Demo CEOS WGISS-24 Oberpfaffenhofen, Germany October 15, 2007 Yonsook Enloe.
Recent developments with the THREDDS Data Server (TDS) and related Tools: covering TDS, NCML, WCS, forecast aggregation and not including stuff covered.
U.S. Integrated Ocean Observing System (IOOS ® ) IOOS ® Biological Observations Data Project A Multi-Agency Effort to Enable Access to Biological Observations.
IOOS Data Services with the THREDDS Data Server Rich Signell USGS, Woods Hole IOOS DMAC Workshop Silver Spring Sep 10, 2013 Rich Signell USGS, Woods Hole.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Kevin O’Brien University of Washington/JISAO NOAA/PMEL The Observing System Monitoring Center Steve Hankin, PMEL Ted Habermann, NGDC David Neufeld, NGDC.
UAF/OSMC Presenters: Kevin O’Brien and Eugene Burger Abstract: Kevin O’Brien and Eugene Burger are from NOAA’s Pacific Marine Environmental Laboratory.
The Unified Access Framework (UAF) Building NOAA’s Global Earth Observation Integrated Data Environment (GEO-IDE) one step at a time Steve Hankin (PMEL),
1 NASA CEOP Final Summary CEOS WGISS-26 Boulder, Colorado September 23, 2008 Yonsook Enloe
Improving Data Catalogs with Free and Open Source Software Kevin O’Brien University of Washington Joint Institute for the Study of the Atmosphere and Ocean.
The Unified Access Framework for Gridded Data … the 1 st year focus of NOAA’s Global Earth Observation Integrated Data Environment (GEO-IDE) Steve Hankin,
1 Adventures in Web Services for Large Geophysical Datasets Joe Sirott PMEL/NOAA.
Product-Generation in ESG: some explorations of the user experience and discussion of implications for the design of ESG Steve Hankin & Roland Schweitzer.
Implementing Marine XML for NOAA Observing Data Nazila Merati and Eugene Burger NOAA/Pacific Marine Environmental Laboratory Seattle, WA.
LAS and THREDDS: Partners for Education Roland Schweitzer Steve Hankin Jonathan Callahan Joe Mclean Kevin O’Brien Ansley Manke Yonghua Wei.
Convergence And Trust in Earth and Space Science Data Systems Ted Habermann, NOAA National Geophysical Data Center Documentation: It’s not just discovery...
Catalog-driven workflows using CSW Rich Signell, USGS, Woods Hole, MA, USA Filipe Fernandes, SECOORA, Brazil Kyle Wilcox, Axiom Data Science, Wickford,
End-to-End Data Services A Few Personal Thoughts Unidata Staff Meeting 2 September 2009.
1 CLASS – Simple NOAA Archive Access Portal SNAAP Eric Kihn and Rob Prentice NGDC CLASS Developers Meeting July 14th, 2008 Simple NOAA Archive Access Portal.
Access to IOOS Data Relevant to OOI Kathleen Bailey NOAA/NOS/IOOS January 6, 2016.
UAF-grid project status Steve Hankin 12 Jan., 2010.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
The Unified Access Framework (UAF)The Unified Access Framework (UAF) A Global Earth Observation Integrated Data Environment (GEO-IDE) project to integrate.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
Interoperability Day Introduction Standards-based Web Services Interfaces to Existing Atmospheric/Oceanographic Data Systems Ben Domenico Unidata Program.
Update on Unidata Technologies for Data Access Russ Rew
In Situ Data Access Some reasons for success or failure Nancy N. Soreide, Donald W. Denbo NOAA Pacific Marine Environmental Laboratory IIPS Session 3B.
The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team Photo.
Data Browsing/Mining/Metadata
Integrating Data and Information Across Observing System
OneStop: Architecture Review
Prepared by: Jennifer Saleem Arrigo, Program Manager
Presentation transcript:

The Unified Access Framework (UAF) Philosophy, progress, and plans DAARWG Meeting, Seattle, Nov Steve Hankin (PMEL), Kevin O’Brien (PMEL/JISAO), and the NOAA UAF team Rich Signell )

Review … continuing into new material Nov NOAA/UAF 2

Nov GEO-IDE a strategy for enterprise- wide integration of NOAA data (i.e. adopting standards and practices to achieve data interoperability) NOAA/UAF pgs (by “DMIT”) Why is this a hard problem?

Nov NOAA-world NOAA/UAF weather forecast (time critical) fisheries management (regulatory concerns) nautical charting climate, ocean, atmosphere research response and restoration … the list goes on … … the list goes on …

Nov NOAA/UAF Different disciplines have different concepts of ‘data’. Each develops solutions that make sense to them.  Getting people (and organizations) to change habits is difficult! (and data management has often been an after thought.)

Nov The accepted approach: build a “system of systems” Wrap existing systems with loosely coupled, standardized services  a Service Oriented Architecture NOAA/UAF GEO-IDE Con-ops outlines such a plan. How to build it with a largely volunteer team?

Nov Generate use cases 2.Define requirements 3.Write a Concept of Operations 4. … and an Implementation Plan 5.Assemble (volunteer) teams to implement NOT ! NOT ! NOAA/UAF Tried and true approach …

Nov An alternative (‘agile’) approach Don't Solve Problems -- Copy Success -- NOAA/UAF

Nov Why ‘agile’ is attractive Because inevitably: funding is much smaller than needed; collaborations are more difficult than anticipated; infrastructure is being built on a background of rapidly evolving technology NOAA/UAF

Nov Why ‘agile’ is attractive NOAA/UAF infrastructure is being built on a background of rapidly evolving technology Change equals risk. Mitigate risk by following a strategy of incremental change that serves users (increasingly) well at every stage of evolution. “Visualize a decade away. Build things that work today.”

Agile Principles (condensed from the ‘Agile Manifesto’) Working software is the meaningful measure of progress: ensure early, continuous, frequent releases Emphasize simplicity - maximize the work not done Build projects around motivated individuals. Give them the support they need and entrust them The best architectures, requirements, and designs emerge from self-organizing teams Welcome changing requirements Scientists ( ‘business people’ ) and developers must work together on a daily basis Nov NOAA/UAF 11

Nov NOAA/UAF 12 Lemma “Don’t let perfect be the enemy of good”

What “success” did UAF chose to copy? Year 1 focused on gridded datasets. Nov Projects: (too many to name) Data formats: netCDFGRIBHDF Service stack: netCDF-CF-DAP-THREDDS (WMS) Applications: MatlabArcGISFerret GrADS Google Earth IDV LAS ERDDAP… Users: (too many to name) … NOAA/UAF

Nov Who is providing data this way? Modelers –AR4&5, GFDL, NCAR, … Satellite programs –GHRSST, PathFinder, CoastWatch, … NCEP weather and ocean forecasts –GRIB files served via NOMADS Coastal (“HF”) radar A growing list of observations programs –Argo, OceanSites, tide gauges, … Adoption by OGC is well underway NOAA/UAF

Nov How to reach users? (without downloading files) NOAA/UAF Through their preferred tools

Nov Model 1: UMASS-ECOM Model 2: UMAINE-POM Data: SST 2008-Sep-08 07:32 Desktop access in Matlab NOAA/UAF

Nov Access in ArcGIS using the Environmental Data Connector (EDC) NOAA/UAF

Nov Desktop access in Ferret NOAA/UAF

Nov Desktop access in GrADS NOAA/UAF

Nov NOAA/UAF 20 UAF home page Instructions for end users: how to access data through their favorite applications UAF is experimenting to see how documentation may be shared by other projects…

Under the hood Projects with data to provide –make their data available as netCDF-CF (or other TDS-compatible format) –host a THREDDS / OPeNDAP server Lets look at the tools we have to link projects together … Nov NOAA/UAF 21

UAF “network topology” a tree defined in THREDDS (XML) Nov NOAA/UAF 22 NOMADS GEO-IDE/UAF NOAANOAA Affiliated NMFSOARNWSNESDIS NODC NGDC GFDL PMEL AOML OCO PFEG NDBC ESRL Coastwatch IOOS National Partners IOOS Regional Partners NAVO AOOS NANOOS CENCOOS SCCOOS PACIOOS GLOS NERACOOS MACOORA SECOORA CARICOOS GCOOS

Nov Not so fast, kiddo! compliance with CF conventions is inconsistent files commonly are not aggregated into logical datasets metadata are often in need of enhancement NOAA/UAF Perspective – this is not an unusual situation: Standards compliance problems are *the norm* Divergent dialects often pile up (e.g. GRIB, BUFR) UAF tools offer a solution …

Nov ‘NcML’ can be used to repair problems (*) e.g. Improve CF compliance by adding “standard_name” attribute to GRIB data The file, itself, is untouched. The ‘virtual file’ seen through the services conforms to standards. (*) IOServiceprovider modules also important … not discussed here NOAA/UAF

Nov NcML for aggregation e.g. Aggregate three 1-year files of the same (say) model run </aggregation> A long time series ‘virtual file’ is seen through the services. NOAA/UAF

UAF Team members are helping data managers, person-to-person, to improve the data services from their projects (a gradual, but important process) (a gradual, but important process) in parallel … in parallel … Nov NOAA/UAF 26

Developing the UAF Catalog Cleaner (a ‘web crawler’) Nov NOAA/UAF 27 NOMADS UAF ‘RAW’ catalog NOAANOAA Affiliated NMFSOARNWSNESDIS NODC NGDC GFDL PMEL AOML OCO PFEG NDBC ESRL Coastwatch IOOS National Partners IOOS Regional Partners NAVO AOOS NANOOS CENCOOS SCCOOS PACIOOS GLOS NERACOOS MACOORA SECOORA CARICOOS GCOOS NOMADS UAF ‘CLEAN’ catalog NOAANOAA Affiliated NMFSOARNWSNESDIS NODC NGDC GFDL PMEL AOML OCO PFEG NDBC ESRL Coastwatch IOOS National Partners IOOS Regional Partners NAVO AOOS NANOOS CENCOOS SCCOOS PACIOOS GLOS NERACOOS MACOORA SECOORA CARICOOS GCOOS ‘RAW’ ‘CLEAN’

The Catalog Cleaner 1. Crawl the raw catalog 2. Extracts metadata from the files, themselves, and from THREDDS into a relational database 3. Process the data base to detect aggregations, etc. 4. Create new THREDDS XML that is aggregated and metadata-cleaned Nov NOAA/UAF 28

‘09 Carbon Tracker files from ESRL Nov NOAA/UAF 29 ‘raw catalog’ snippit Optional documentation (beyond what is inside the file) Services: Usually OPeNDAP. Often WMS. Sometimes more. Optional viewers

Nov NOAA/UAF 30 ‘clean catalog’ -- same data, but … augmented with Uniform services Uniform viewers improved metadata uniform services uniform viewers

Nov NOAA/UAF 31 Uniform services ISO-standard metadata metadata quality assessment Simple interface to get a subset

Nov NOAA/UAF 32

Nov NOAA/UAF 33

Nov NOAA/UAF 34 Godiva2 uses the WMS map service (as do other GIS apps)

Nov NOAA/UAF Google Earth Property- property plots Sections and Hofmullers Vector plots Segue to desktop tools Line plots Animations Differencing Analyses

Nov and ERDDAP provides … NOAA/UAF

Nov mat REST URL access to data subsets in several formats (accessible through home-grown scripting of many types).kml.mat.nc NOAA/UAF

Nov NOAA/UAF 38 ISO-standard metadata Data Discovery Crawl the clean catalog. Create ISO metadata

Agile principle: “Maximize the work not done” Who has already built reasonably mature discovery portals (preferably free) ? Unidata’s RAMADDAUnidata’s RAMADDA GI-CAT (able to crawl THREDDS catalog)GI-CAT (able to crawl THREDDS catalog) Geoportal (able to harvest ISO metadata)Geoportal (able to harvest ISO metadata) Nov NOAA/UAF 39

Data Discovery using ESRI Geoportal NODC is leading the UAF Geoportal investigations

Enter Search terms Data Discovery using ESRI Geoportal

Search Results Data Discovery using ESRI Geoportal

Refined Search New Results Data Discovery using ESRI Geoportal

Expand Selection Data Discovery using ESRI Geoportal

Available Services Data Discovery using ESRI Geoportal

Available Services Data Discovery using ESRI Geoportal

View metadata details about the dataset

Available Services Data Discovery using ESRI Geoportal

Direct link into the THREDDS Data Server

Available Services Data Discovery using ESRI Geoportal

This seque to LAS demonstrates how GeoPortal can be trained to recognize a uniform set of UAF viewers

Nov NOAA/UAF 52 Full disclosure: Geoportal is very powerful, but … difficult to set up immature user interface

Nov In situ observations NOAA/UAF are characterized by their ‘feature type’ (a time series, a track line, a vertical profile, …) individual feature instances are degenerate (1D) grids the UAF framework just described already handles individual 1D features many data providers are creating such files (Argo, OceanSites, …). Growth is robust. but users want to interact with collections of features standards and practices are in flux

Nov Collections of in situ observations NOAA/UAF Candidates: 1. CF ‘Discrete Geometries’ Chapter 9 of CF 1.6, May 2011 the ‘natural’ (?) extension for UAF but … a new API yet to be released (TDS 4.3) Java support only few tools available

Nov Collections of in situ observations NOAA/UAF Candidates: 2.‘DAPPER’ (OPeNDAP Sequences) solid technology good prototype tools and examples already years ago but … poor standardization of semantics (conventions for lat, long, time, …) flagging momentum …

Nov Collections of in situ observations NOAA/UAF Candidates: 3.OGC Web Feature Service (WFS) much momentum in the GIS community but … by design little standardization of semantics communities must develop ‘application schemas’ success rates poor, momentum flagging …

Nov Collections of in situ observations NOAA/UAF Candidates: 4.OGC Sensor Observation Service (SOS) significant momentum in the earth science community but … only the request protocol is standardized many choices for semantics: “SWE common”, KML, custom ASCII/JSON/etc. few tools, interoperability concerns

Nov Collections of in situ observations NOAA/UAF What to do? Agile philosophy: Maximize the work not done Don’t let perfect be the enemy of good UAF isn’t resourced to undertake R&D ($ + risk)

Nov Collections of in situ observations NOAA/UAF UAF approach: 1.Explore the potentials of ERDDAP. It provides IOOS-compatible SOS ‘TableDAP’ and other R.E.S.T. outputs commitment to support many protocols 2.Where data are in databases: install ERDDAP e.g. OSMC (in situ ocean obs) 3.Prototype TDS 4.3 / CF Discrete Geometries Utilize from ERDDAP and LAS (ArcGIS soon after?)

Nov Collections of in situ observations NOAA/UAF UAF approach: 4.Vigorously promote 1D CF features Here’s why: Available today, understood by data providers, momentum exists Compatible with UAF tools and practices Provides immediate value to users

Nov Collections of in situ observations NOAA/UAF UAF approach: 4.Vigorously promote 1D CF features  Efforts will pay off (low risk) Gets the data ‘out there’ (open, available) Metadata-rich Discoverable through standard frameworks Future TDS versions will offer aggregation  miracles will happen for free: SOS, improved discovery, etc.

To increase data integration, what question should we be asking? Is it: “How do I build a system that will […]?” Nov NOAA/UAF 62 Or is it: “How do we most rapidly narrow the gap between what we know technology is capable of delivering, and what our actual practices are today?” What is the right balance of strategies: ‘Design and build’ vs. ‘Agile’ ? ‘Design and build’ vs. ‘Agile’ ?

Nov Discussion NOAA/UAF