AVHRR 1 km Data Set Consolidation Consolidating 30 years of EO data hosted by multiple data holders 28 th LTDP WG Meeting Frascati (ESA/ESRIN) 04–05 November.

Slides:



Advertisements
Similar presentations
ESA Data Integration Application Open Grid Services for Earth Observation Luigi Fusco, Pedro Gonçalves.
Advertisements

Slide: 1 ROSA GRAS Meeting February 2009 Matera, Italy User Services EUMETSAT EUMETSAT Data Access & User Support.
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
Metadata and DRInet Jake Carlson Purdue University Libraries November 5, 2009.
BADC Workshop 2: BADC Services to Data Suppliers Royal Met. Soc. Conference – 14 September 2005 Ag Stephens et al.
PV2013 Summary Results Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Obtaining MISR Data and Information Jeff Walter Atmospheric Science Data Center April 17, 2009.
Status of upgrading CDI service (user interface, harvesting via GeoNetwork, CDI interoperability options following SeaDataNet D8.7) By Dick M.A. Schaap.
Coordinated Energy and water-cycle Observations Peroject A Well Organized Data Archive System Data Integrating/Archiving Center at University of Tokyo.
ESA UNCLASSIFIED – For Official Use Data Stewardship Interest Group WGISS-39 Meeting Data Purge Alert Procedure Tsukuba, Japan – May, 2015 Mirko.
AON Data Questionnaire Results 21 Respondents Last Updated 27 March 2007 First AON PI Meeting Scot Loehrer, Jim Moore.
AERONET Web Data Access and Relational Database David Giles Science Systems and Applications, Inc. NASA Goddard Space Flight Center.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
ENEON first workshop Observing Europe: Networking the Earth Observation Networks in Europe September, Paris Summary on data availability, sharing,
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Mapping between SOS standard specifications and INSPIRE legislation. Relationship between SOS and D2.9 Matthes Rieke, Dr. Albert Remke (m.rieke,
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
Y.J. Meijer, GECA, II GALION WS, ESA’s Generic Environment for Calibration/validation Analysis (GECA) Yasjka Meijer et al. European Space Agency.
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Metadata Strategy Case Study Bill Rosenblatt GiantSteps Media Technology Strategies (212)
U.S. Department of the Interior U.S. Geological Survey Access to MODIS Land Data Products Through the Land Processes DAAC John Dwyer and Carolyn Gacke,
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
Metadata Implementation Ted Habermann NOAA National Geophysical Data Center Documentation: It’s not just discovery... 50% change in global average Why?
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
Purpose and objectives: AVHRR LAC expert meeting Bojan R. Bojkov Head, Sensor Performance, Products and Algorithms Directorate for Earth Observation ESA/ESRIN.
Documentation from NcML to ISO Ted Habermann, NOAA NESDIS NGDC.
ESA UNCLASSIFIED – For Official Use ESA AVHRR Data Holdings and Ongoing Curation Activities Mirko Albani, Sergio Folco.
ESIP Federation 2004 : L.B.Pham S. Berrick, L. Pham, G. Leptoukh, Z. Liu, H. Rui, S. Shen, W. Teng, T. Zhu NASA Goddard Earth Sciences (GES) Data & Information.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
Where Should the GALION Data Reside? Centrally or Distributed? Introduction to the Discussion Fiebig, M.; Fahre Vik, A. Norwegian Institute for Air Research.
Page 1 CSISS Center for Spatial Information Science and Systems Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Application Profile.
1 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NCEI-IOOS Project Updates Mathew Biddle May 28th, 2015 IOOS DMAC Meeting, IOOS.
NQuery: A Network-enabled Data-based Query Tool for Multi-disciplinary Earth-science Datasets John R. Osborne.
Documenting UAF Data Ted Habermann NOAA/NESDIS/National Geophysical Data Center.
s Donna J. Scott, Marilyn Kaminski, Jason Wolfe, Terry Haran NSIDC's MODIS Snow and Sea Ice Products NSIDC provides a suite.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Interoperability = Leverage + Collaboration  Chris Lynnes  GES DISC.
TSS Database Inventory. CIRA has… Received and imported the 2002 and 2018 modeling data Decided to initially store only IMPROVE site-specific data Decided.
National Aeronautics and Space Administration Aquarius Validation Data System Overview and Status Algorithm Workshop March 21-22, 2007 John Gunn Earth.
GENESI-DR: Ground European Network for Earth Science Interoperations - Digital Repositories Grant agreement no: EGEE 4 th.
EO Dataset Preservation Workflow Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
Standard Metadata in Scientific Data Formats September 19, 2007 Flash at:
ADPS Science Software Development Bryan Franz NASA Ocean Biology Processing Group Aquarius Data Processing Workshop, NASA/GSFC, March 2007.
Open Grid Services for Earth Observation Pedro Gonçalves.
IPCC WG II + III Requirements for AR5 Data Management GO-ESSP Meeting, Paris, Michael Lautenschlager, Hans Luthardt World Data Center Climate.
Zou Ziming 1 Ma Wenzhen Li Lei Zhao Hua Wang Chi 1: Center for Space Science and Applied Research Chinese Academy of Sciences Moscow ·
CLASS Metadata and Remote Sensing Extensions CLASS Data Provider’s Conference September 2005 Anna Milan, Ted.Habermann,
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
GHRSST Science Team Meeting st – 5 th June, 2009, Santa Rosa ESA L2P Operational Service Nigel Houghton ESA / ESRIN NRT and Archive L2P Processing.
LP DAAC Overview – Land Processes Distributed Active Archive Center Chris Doescher LP DAAC Project Manager (605) Chris Torbert.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
New Australian High Resolution AVHRR SST Products from the Integrated Marine Observing System Presented at the GHRSST Users Symposium, Santa Rosa, USA,
NASA Earth Science Data Stewardship
NSIDC DAAC UWG Meeting August 9-10 Boulder, CO
Synthetic Data and Data Formats for the GPM GMI Radiometer
Implementing the Data Management Principles Opportunities and Advantages Robert R. Downs, PhD Sr. Digital Archivist, CIESIN, Columbia University.
GSICS Collaboration Servers a Vehicle for International Collaboration Status 2011 Peter Miu EUMETSAT.
Data Flows in ACTRIS: Considerations for Planning the Future
WGISS Connected Data Assets
Data CeNtre Updates GEO visit 02 July 2015.
Flanders Marine Institute (VLIZ)
Analysis Ready Data ..
WGISS-41: IDN Report Michael Morahan CEOS WGISS-41 Meeting
WGISS Connected Data Assets April 9, 2018 Yonsook Enloe
Prepared by: Jennifer Saleem Arrigo, Program Manager
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
Presentation transcript:

AVHRR 1 km Data Set Consolidation Consolidating 30 years of EO data hosted by multiple data holders 28 th LTDP WG Meeting Frascati (ESA/ESRIN) 04–05 November 2014

Feasibility Study Inter-agency activity - Consolidate and make available AVHRR 1 km (LAC) data held by data centers in Europe and Canada Feasibility Study:

User Requirements Capture Survey distributed to selected users to capture user scenarios and user requirements for Data and metadata – format, structure, content Preservation and usability - data records, information & tools Discovery and retrieval Exploitation tools & infreastructure

User Requirements Capture Respondents Brockmann Consult, Germany Norwegian Meteorological Institute, Norway University of Berne, Switzerland Eidgenössische Technische Hochschule Zurich, Switzerland Helmholtz-Zentrum Geesthacht, Germany Remote Sensing Centre of Institute of Geodesy and Cartography, Warsaw, Poland DLR, Germany ESA CCI

User Requirements – Summary 1 Data Level L0 to L2 for user access Standard formats: NetCDF, HRPT, HDF, ASCII, ESA Beam compatible Consistent across satellite generations, quality flags Metadata Product metadata: OGC-EOP Following CF (Climate and Forecast) standard Keep inside in the hdf5/netcdf4 Collection metadata: ISO and INSPIRE compliant Controlled volcabularies

User Requirements – Summary 2 Preservation and usability Data records Preserve HRPT raw & auxiliary data Reprocess as calibration data or method changes Keep some (tbd.) generations of the previous data sets Complete datasets Consistent versioning / numbering scheme Referenced by DOI Associated knowledge Information: NOAA docs, provenance, quality - available to user Tools: processing chain, visualization, and analysis HW & OS independent, maintain libraries, open source preferred Test suite

User Requirements – Summary 3 Discovery Centralized online portal with advanced browse capability Searchable by: satellite name, time frame, spatial domain, orbit number, product version Standardized interfaces, machine-harvestable catalog (OAI-PMH, OGC CSW) Retrieval Same data format and access interfaces/protocols if data held in distributed achives Automated download using data retrieval script Request for data streams instead of download - OpENDAP preferred Time series retrieval of stacked products for a single pixel (e.g. ASCII) Direct download of bulk data Data ordering not favored Distribution option physical media

User Requirements – Summary 4 Exploitation tools & infrastructure Tools: processing chain, visualization, and analysis Exploration and exploitation capabilities provided by ESA desirable Possibility to integrate other data sources beyond EO May not be necessary if OPeNDAP access is provided

Summary There is a demand for a complete, consistent, calibrated time series Good agreement on data format: NetCDF, CF Diverse requirements for processing level Central discovery & access Procedures, provenance, etc. must be documented and available

Questions? Tyler Christensen DLR - German Remote Sensing Data Center Information Technology

Backup Slides

Analysis DLR and Dundee Inventories Number of NOAA AVHRR 1 km scenes (passes) available at DLR and at Dundee Satellite Receiving Station. Duplicate scenes have been removed for the analysis. The purple line indicates the approximate number of scenes required to ensure an adequate spatial and temporal coverage for the AOI of the TIMELINE project (6 passes x 24 hrs x 365 days). Year Number of scenes (orbits) AOI of the DLR TIMELINE project

Details of User Requirements

User Requirements - 1 User Scenarios Users will process raw data into their own interpreted products: land cover, snow extent, water body extent, NDVI, fires and burned area, etc. Some users want higher-level AVHRR products (e.g. snow, surface temperature) that they can compare to their own model outputs Consolidated data set would be used to extend existing analyses in space and/or time, sometimes using existing processing chains—so data format will be very important

User Requirements - 2 General Requirements for Consolidated Dataset All levels requested, from L0 all the way to L2 interpreted products o Some users need data as raw as possible—no atmospheric correction, individual swaths not stitched, not reprojected, etc. o Some users need fully processed data—mosaics, georeferenced, time-aggregated (daily) Must include quality flags & pixel-center location grids Must be internally consistent (i.e. corrections applied between different satellites for a smooth time series), but should be fully documented Ancillary data: ozone, water vapor, aerosol properties if available

User Requirements - 3 Data - format, structure, content Format Most data users requested NetCDF, one said HDF would also be OK. A few also required the raw and L1b data in original HRPT format. Perhaps also ASCII, especially for a time-series of an individual pixel. Readable with ESA BEAM toolbox. Structure Descriptive, searchable file names—sensor, dates, mission, etc. NetCDF files should have metadata headers—some requested that metadata follow the Climate & Forecast (CF) convention. Content Separate scientific data sets (SDS) for reflectances, brightness temperatures, acquisition angles, scanline times, and NWP data used for atmospheric correction. Requested in metadata header: long_names, grid information, standard_name

User Requirements - 4 Metadata (collection, product) - format, structure, content Format CF (Climate and Forecast standard) metadata inside NetCDF file Collection metadata: ISO and INSPIRE compliant Product metadata: OGC-EOP Keep inside in the hdf5/netcdf4 Structure NetCDF Discovery Attribute Convention Content Follow CF-conventions for netCDF data Utilize controlled vocabularies (C&F standard names, GCMD Science Keywords) Sensor, platform, temporal and spatial coverage, calibration information, applied processing chain, ancillary data and parameters used

User Requirements - 5 Preservation and Usability Needs Reprocess dataset whenever the calibration data or the calibration method changes Complete datasets Any changes to the dataset need to be carefully documented and the information needs to be made available to the user – via the metadata or an easily and permanently accessible document. Datasets should be referenced by DOI A consistent versioning / numbering scheme is required Some (tbd.) generations of the previous data sets should be kept

User Requirements - 6 Data Record Needs Original HRPT raw & auxiliary data must be kept internally, to allow for future reprocessing as improved algorithms, new data formats, or different intercalibrations emerge. Metadata and browse images should also be kept. L1b data & L2 products– possibly several generations of re-processed data Must be kept in standardized data formats

User Requirements - 7 Additional Information and Tools to be Preserved NOAA documentation (e.g. POES user guide) should be available and linked to data. Data processing documentation should be kept: procedures for aggregation, temporal averaging, inter-calibration, etc. Information on provenance should be archived: platforms, sensor characteristics of each platform, calibration updates etc. If possible all software required to generate each version of the product should be maintained: calibration coefficients, NWP data, tle information, etc. Quality assessments, validation and inter-comparison reports Data use policies Data format descriptions

User Requirements - 8 Needs for Software and Tools Software content to preserve: processing chain software, visualization tools, data readers for HRPT and other formats, and BEAM tools (i.e. data readers, visualization, processing, analysis) Also keep copies of libraries that the software uses, and/or compile the software against static libraries so it would be usable even if newer library versions are not compatible. Try not to depend on specific hardware or OS; use standard software platforms and compilers. Need a test suite. Open source preferred.

User Requirements - 9 Discovery Needs Centralized online portal with quick looks and advanced browse capability, searchable by: satellite name, time frame, spatial domain, orbit number, product version A catalogue interface with machine-harvestable metadata would be useful. One user states that OAI-PMH is preferred over OGC CSW due to the simplicity in the interface. Should be no access difference between historic and current missions Data policies should be aligned

User Requirements - 9 Data Retrieval Needs 1 Automated download using a data retrieval script should be possible Data sets can be hosted in separate archives, however, the same data format and access interfaces/protocols (FTP, WCS, other?) should be used by all providers Request for data streams which users could attach a processing chain to, instead of downloading a local copy of the dataset OpENDAP access is preferred. Direct download of bulk data using shell scripts, query specifying file name pattern, etc. Preference for selectable times/locations and 'mission-at-once'.

User Requirements - 10 Data Retrieval Needs 2 Standardized formats and interfaces/transfer protocols in case data need to be transferred to third party processing infrastructure Retrieve a time series of stacked products for a single pixel. Data ordering is acceptable but not preferred. Distribution on physical media (as opposed to online download) is also requested.

User Requirements - 11 Exploitation Infrastructure Needs (hosted processing) Most users don’t think this is necessary, especially if OPeNDAP access is provided. One user requested exploration and exploitation capabilities provided by ESA, with a possibility to integrate with other data sources beyond EO. Would require a collaborative effort to build virtual data exploitation platforms. Exploration and exploitation capabilities provided by ESA desirable

User Requirements - 12 Needs for Current (vs. historic) AVHRR Data New data are as important as the historical archive. Most users update their products regularly, and would like to have daily updates as new data become available. Some users do not need updated data, if they are doing retrospective analysis. New data should have exactly the same format as archived data.

User Requirements - 13 Time Series Needs Most users will download the data to a local copy, and then perform their own time series analysis. Request to receive a time series for a single pixel as a text file. Remote data processing and/or OpENDAP access would reduce the need for users to download large amounts of data for the whole time series.