ESIP Air Quality Jan Air Quality Cluster Air Quality Cluster Technology Track Earth Science Information Partners Partners NASA NOAA EPA (?) USGS DOE NSF Industry… Data Flow & Interoperability in DataFed Service-based AQ Analysis System R. B. Husar, S. R. Falke and K. Höijärvi Washington University, St. Louis, MO ESIP Federation Winter Meeting 2006 Washington, DC, January 4, 2006
ESIP Air Quality Jan DataFed in a Nutshell DataFed Vision Aid air quality analysis by effective use of relevant data DataFed Goals Facilitate access and flow of AQ data from provider to users Support user-driven data processing value chains Participate in specific application projects Approach: Mediation Between Users and Data Providers DataFed assumes spontaneous, autonomous data providers Non-intrusively wraps datasets for access by web services Mediates, homogenizes data views. e.g. geo-spatial, time... Applications Browsers and analysis tools for distributed monitoring data Serve as data gateway for user programs; GIS, science tools DataFed is focused on the mediation of air quality data
ESIP Air Quality Jan Typical DataFed AQ Analysis Tools ConsolesConsoles: Data from diverse sources are displayed to create a rich context for exploration and analysis CATTCATT: Combined Aerosol Trajectory Tool for the browsing backtrajectories for specified chemical conditions Tool Viewer: Viewer: General purpose spatio-temporal data browser and view editor applicable for all DataFed datasets
ESIP Air Quality Jan Web Services: Building Blocks of DataFed Programming Access, Process, Render Data by Service Chaining [Station] [Monitor] Clickable monitor locations layer Map boundary layer
ESIP Air Quality Jan Single Data Model for All AQ Data Most Views are slices through a cube of data organized by lat, lon, altitude, and time (X,Y,Z,T) Multidimensional Data Cube
ESIP Air Quality Jan OGC Abstract Specification – Coverage (00-106)Coverage (00-106) Abstract Spec.: Coverages model and visualize spatial relationships between, and the spatial distribution of, earth phenomena Coverage subtypes of Air Quality interest are Image, Grid and Discrete Point Has a property (Coverage_Function) that has a spatial domain, and a value set. For Image and Grid, the Coverage_Function is an array…Discrete Point coverage is described by a PointC_Function where the spatial domain consists of a collection of points Observation: The Abstract Spec. foresaw the need for Point Coverage, i.e. fixed monitoring Stations
ESIP Air Quality Jan OGC Web Coverage Service (WCS) Specification HTTP GET/POST based interfaces Services have XML service descriptions (“Capabilities”, “Description”) Filter parameters allow selection of subsets of source data Output formats advertised by each service instance OGC WCS getCoverage Schema Suitable for wrapping with SOAP envelope, WSDL access, loose coupling WCS is for "coverages" – information representing space-time-varying phenomena WCS describes, requests and delivers coverages in spatio-temporal domain WCS version 1.1 is limited to grids/"simple” coverages with homogeneous range sets
ESIP Air Quality Jan through Data Access through Adapters DataFed SOAP,HTTP Get OGC WCS HTTP Get, Post OGC WMS HTTP Get Station-Point SQL Server, Files… Sequence Image, file nDim Grid OpenDAP NetCDF, … Other Traject., Event, Pic Sources Diverse formats Many data models Data Wrapper Data into geo-cubes Queries to views Virtual Data Cube Global geo-cube data model Makes queries data-neutral Others? e.g. OpenDAP Output Protocol dependent User specified GeoTable CSV,XLS,GML GeoGrid GML,NetCDF.. GeoImage GeoTIFF, PNG.. Other MS Dataset.. Query Adapter Maps query to protocol User selects protocols
ESIP Air Quality Jan OGC WCS Demonstration: AirNOW Station DatasetAirNOW Map View Services WCS Query s&REQUEST=GetCoverage&VERSION=1.0.0&CRS=EPSG:432 6&COVERAGE=AIRNOW.pmfine&FORMAT=CSV&BBOX= , , , ,0,0&TIME= T15:00:00Z&WIDTH=999&HEIGHT=999&DEPTH=999 Formats: CSV, NetCDF-table, etc Time View Services =wcs&REQUEST=GetCoverage&VERSION=1.0.0&CRS=EP SG:4326&COVERAGE=AIRNOW.pmfine&FORMAT=CSV&B BOX= , , , ,0,0&TIME= T00:00:00Z/ T00:00:00Z&WIDTH=999&HEIGHT=999&DEPTH=999 Formats: CSV, NetCDF-table, etc
ESIP Air Quality Jan Data Access Services 1. Click Services Button 4. Set Output Format 3. Access Settings 2. Select Data Access Protocol 4. Execute WCS Query in Browser or in Program 5. Consume data by program, service or viewer
ESIP Air Quality Jan GALEON Interoperability Experiment GALEON Geo-interface for Atmosphere, Land, Earth, and Ocean netCDF Lead by UNIDATA & OGCUNIDATAOGC Participants: academia, industry, international Unify Earth Science & GIS Data Flows B. Domenico GALEON UNIDATA
ESIP Air Quality Jan GALEON: Earth Science – GIS Interoperability Shared Data Model
ESIP Air Quality Jan OGC WCS Demonstration: THREDDS_GFS 4Dim DatasetTHREDDS_GFS Lat/Lon Box Elev Range Time Range Map: BBOX=-180,-90,180,90, 1350,1350& TIME= / /PT3H Time: BBOX=-34,49.05,-34,49.05, 1350,1350& TIME= / /PT3H Elev: BBOX=-34,49.05,-34,49.05, 0,18000 & TIME= / /PT3H The form of the WCS query is the same for all slices through the data cube (views) The only difference in the views is the thickness of the slices in each dimension Return grid is in multiple formats (NetCDF, CSV, GML, PNG, … ) Map View Services WCS Query Time View Services WCS Query Elevation View Services WCS Query WCS Query
ESIP Air Quality Jan OGC WCS Demonstration: Grid, Image, Station Data Types Coverage=THEEDDS.T& BBOX=-126,24,-65,52,0,0 &TIME= / &FORMAT=NetCDF Coverage=SURF.Bext& BBOX=-126,24,-65,52,0,0 &TIME= / &FORMAT=NetCDF-table Coverage=SEAW.Refl& BBOX=-126,24,-65,52,0,0 &TIME= / &FORMAT=GeoTIFF COVERAGE=sst& BBOX=-126,24,-65,52,0,0 &TIME= , &FORMAT=NetCDF UNIDATA – THREDDS/GALEON WCS THREDDS/GALEON WCS DataFed GALEON WCS U Florence, It GALEON WCS DataFed GALEON WCS Grid Image Station Services WCS Query Services WCS Query Services WCS Query Services WCS Query
ESIP Air Quality Jan Benefits of WMS Compatibility: Geospatial One-Stop Provides Access to DataFed spatial layers through WMS Problem: No Time Dim! DataFed/VIEWS Asian Dust Concentration (circles) S. Falke
ESIP Air Quality Jan Summary OGC WCS protocol is suitable for delivering most air quality data: Suitable for most data view queries BBOX, TIME Station-Point SQL Server, Files… Sequence Image, file nDim Grid OpenDAP NetCDF, … Applicable to most data types Station, Image, Grid For Air Quality data needs: WCS: Add Point coverage to WCS (in addition to grid, polygon) WCS: Strongly typed (XML Schema-verifiable) return data types Testing: Much fixing and testing for server-client compatibility