Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 ClimDB/HydroDB Objectives Don Henshaw Improve access to long-term collections.

Slides:



Advertisements
Similar presentations
Mark Servilla & Duane Costa LTER Network Office LTER 2012 All Scientist Meeting LTER Network Office.
Advertisements

Web Access to Long-term Research Hydrology Data Doug Ryan USDA Forest Service Research and Development.
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
1 COPS Workshop 2008 University of Hohenheim, Stuttgart; 27 to 29 February 2008 IMGI‘s contribution to the COPS 2007 field experiment Simon Hölzl & Alexander.
RAMADDA for Big Climate Data Don Murray NOAA/ESRL/PSD and CU-CIRES Boulder/Denver Big Data Meetup - June 18, 2014.
Preparing CMOR for CMIP6 and other WCRP Projects
1 The Quebec Climate Monitoring Program Onil Bergeron, analyst Direction du suivi de l’état de l’environnement (DSEE) Ministère du Développement durable,
Outline of Talk Introduction Toolbox functionality Results Conclusions and future development.
Examples and opportunities for syntheses of long-term cross site data LTER Network Experimental Forest Network Lotic Intersite Nitrogen Experiment.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Operational Quality Control in Helsinki Testbed Mesoscale Atmospheric Network Workshop University of Helsinki, 13 February 2007 Hannu Lahtela & Heikki.
JEFS Status Report Department of Atmospheric Sciences University of Washington Cliff Mass, Jeff Baars, David Carey JEFS Workshop, August
EAP ILTER 9 July 2007 Don Henshaw Andrews Experimental Forest LTER Pacific Northwest Research Station, USFS Forest Service Oregon State University Corvallis,
Watershed Data System: Overview Jean L. Steiner, E. John Sadler, Jin-Song Chen Greg Wilson, David James, Bruce Vandenberg John Ross, Teri Oster, Kevin.
1 WMO Information System (WIS) and the Next Generation of Worldwide Weather Data Exchange by: Robert Bunge (August 2013)
Synthesis of Incomplete and Qualified Data using the GCE Data Toolbox Wade Sheldon Georgia Coastal Ecosystems LTER University of Georgia.
A Monitoring Program for Ny-Ålesund Christina A. Pedersen Norwegian Polar Institute.
Meteorological Observatory Lindenberg – Richard Assmann Observatory The GCOS Reference Upper Air Network.
ClimDB/HydroDB (ClimHy) Integration ClimHy has been migrated from AND to LNO and will remain status quo in 2011 – Public page (
IMPROVING THE UPTAKE OF GLOBAL DATA SETS Dr Wolfgang Grabs Chief, Hydrological Forecasting and Water Management Climate and Water Department
Met Alert Tool (MAT). Introduction What is MAT? –Met Alert Tool (MAT) monitors and alerts the user to weather conditions exceeding thresholds (for example,
ClimDB/HydroDB A web harvester and data warehouse for hydrometeorological data 2011 StreamChemDB Oct Yang Xia (LTER Network Office, University of.
Exercises: Organizing, Loading, and Managing Point Observations Using HydroServer Support EAR CUAHSI HIS Sharing hydrologic data
AERONET Web Data Access and Relational Database David Giles Science Systems and Applications, Inc. NASA Goddard Space Flight Center.
Introduction to SPSS Edward A. Greenberg, PhD
Best Practices for Preparing Data Sets Non-CO2 Synthesis Workshop Boulder, Colorado October 2008 Compiled by: A. Dayalu, Harvard University Adapted.
Introduction to Standard Reports. Standard Reports 2 How to get information out of AQS Standard Reports Site / Monitor Metadata Detail Data Reports “
Konza PrairieKonza Prairie Long-Term Ecological Research (LTER)LTER Henry Mikhail.
SCIENTIFIC REPORT ON COST SHORT TERM SCIENTIFIC MISSION Tania Marinova National Institute of Meteorology and Hydrology at the Bulgarian Academy of Sciences,
Data Management Developing a Venue for Synthesis Jason Downing BNZ Data Management 2009.
Documentation of surface observation. Classification for siting and performance characteristics Michel Leroy, Météo-France.
Water Quality Data, Maps, and Graphs Over the Web · Chemical concentrations in water, sediment, and aquatic organism tissues.
CC&E Best Data Management Practices, April 19, 2015 Please take the Workshop Survey 1.
Inventory and Monitoring Terrestrial Fauna Inventory and Monitoring Terrestrial Fauna Linking Field Activities to Budget Processes.
GCE Data Toolbox -- metadata-based tools for automated data processing and analysis Wade Sheldon University of Georgia GCE-LTER.
Michael A. Palecki USCRN Science Project Manager National Climatic Data Center DOC/NOAA/NESDIS USCRN PROGRAM STATUS MARCH 3, United States Climate.
National Climate Monitoring Products Andrew Watkins and John Kennedy (updated 28/4/2014)
U.S. Department of the Interior U.S. Geological Survey USGS Water Data Exchange Services USGS Office of Water Information June 2009 Nate Booth, Dave Briar.
Trends Vision Long-term time series of climate, biogeochemical, biotic & population data Create an “atlas” of these data in graphical (graphs & maps) &
Quality control of daily data on example of Central European series of air temperature, relative humidity and precipitation P. Štěpánek (1), P. Zahradníček.
GCE Software Tools for Data Mining, Analysis and Synthesis Wade M. Sheldon Georgia Coastal Ecosystems LTER, University of Georgia, Athens, Georgia Introduction.
CUAHSI HIS Features of Observations Data Model. NWIS ArcGIS Excel NCAR Trends NAWQA Storet NCDC Ameriflux Matlab AccessSAS Fortran Visual Basic C/C++
EML Analysis Tools Introduction Ecoinformatics Working Group Taiwan Forestry Research Institute (TFRI)
Spatial interpolation of Daily temperatures using an advection scheme Kwang Soo Kim.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Long Term Ecological Research Network Office Trends Project Spaghetti & Linguine (aka Trends Data Store) Mark Servilla 14 September.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
Meteorological Observatory Lindenberg Results of the Measurement Strategy of the GCOS Reference Upper Air Network (GRUAN) Holger Vömel, GRUAN.
John Porter Sheng Shan Lu M. Gastil Gastil-Buhl With special thanks to Chau-Chin Lin and Chi-Wen Hsaio.
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
Data Model / Database Implementation (continued) Jeffery S. Horsburgh Hydroinformatics Fall 2014 This work was funded by National Science Foundation Grants.
Survey of Current Practices for Reporting Missing, Qualified Data Wade Sheldon GCE-LTER.
By Larry R. Bohman. 2 Our data collection infrastructure is already in place! Uses: Why USGS needed a policy… Not published before about 1990, but number.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
Pan Evaporation Rates of Southeastern Arizona Alison Radei Mentors: Dr. Mary Nichols and Michelle Cavanaugh USDA-ARS-Southwest Watershed Research Center.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Climate Data. Scope of Core Data Air temperature Precipitation Rain Snow Relative Humidity Barometric Pressure Solar Radiation Wind.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Corn Yield Comparison Between EPIC-View Simulated Yield And Observed Yield Monitor Data by Chad M. Boshart Oklahoma State University.
HydroGET A web service client for ArcGIS Ernest To PhD Candidate, University of Texas at Austin August 2008.
NOAA National Climatic Data Center Dr. Karsten Shein Climatologist NOAA/NESDIS/NCDC 151 Patton Ave. Asheville, NC
Strategies for NIS Development
Using Ocean Data View for EMODnet Chemistry Reiner Schlitzer
Network Information System Advisory Committee (NISAC)
Lecture 8 Database Implementation
Flanders Marine Institute (VLIZ)
Data Management: Documentation & Metadata
Climate Graphs What do they tell us?.
G061 - Data Dictionary.
Robert Dattore and Steven Worley
Presentation transcript:

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 ClimDB/HydroDB Objectives Don Henshaw Improve access to long-term collections of climatic and hydrological data –Long-Term Ecological Research (LTER) 26 NSF-funded sites –U.S. Forest Service Research Experimental Forests / Experimental Watersheds Use web technologies to facilitate synthetic research –Maintain a current data warehouse of multi-site, multi- network, long-term climate and streamflow data –Provide single portal accessibility and a query interface to download and graphically display data

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 ClimDB/HydroDB Harvester / Database/ Query Interface Data ProvidersCentral SitePublic User Triggers on-demand auto-harvest HTTP Post USFS Data Exchange Format Web Page display, graph, download Web Services SOAP, WSDL Access Tools site-specific data mining Data Warehouse Centralized ClimDB/HydroDB Database Harvester NWS Data USGS Data LTER Data Query interface

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 ClimDB Harvest File Naming Convention Example of measurement parameter and associated quality flag names LTER_SiteLTER/Research Area site code (3-letter acronym) StationLocal site name for the weather station or gauging station Date8 character field (yyyymmdd) Daily_AirTemp_Mean_CMean daily air temperature Flag_Daily_AirTemp_Mean_CData quality flag for mean daily air temperature. Daily_AirTemp_AbsMax_CDaily absolute maximum air temperature. Flag_Daily_AirTemp_AbsMax_CData quality flag for daily absolute maximum air temperature Daily_AirTemp_AbsMin_CDaily absolute minimum air temperature Flag_Daily_AirTemp_AbsMin_CData quality flag for daily absolute minimum air temperature Daily_Precip_Total_mmDaily total precipitation Flag_Daily_Precip_Total_mmData quality flag for daily total precipitation Daily_Discharge_Mean_LpsMean daily discharge Flag_Daily_Discharge_Mean_LpsData quality flag for mean daily discharge

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 ClimDB Data Quality Flags G or blankValue is a good value (blank is preferred) EValue is estimated QValue is questionable MValue is missing (in this case, it is preferred to leave value field null or blank with the data quality flag = “M”. It will be allowed to assign the value of “9999” to the data field with the data quality flag = “M”, but not preferred.) TTrace value (For precipitation only. Values must be assigned to the data field (e.g., assign a zero or 0.1). DO NOT leave the data field null or blank.

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 Participant Web Page

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 Duplicate records found

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 ClimDB General Harvest QA/QC FATAL ERROR(901): Missing quality assurance flag –Description: All variables require that a flag_variable directly follow – FATAL ERROR(906): Duplicate found –Description: Duplicate record by site, station, parameter, and date – ERROR(002):Illegal flag character - [flag] not recognized –Description: Illegal flag. Data point is ignored. – WARNING(100): Unknown Variable –Description: Variable name is not listed as valid in the central variable database. All values listed for that variable are ignored. – WARNING(101): [variable] = [value] Failed QC test (data limits check) –Description: Data value fails general data limits check. Data is still accepted. – WARNING(106): Failed (min < mean < max) relationship –Description: Quality assurance failure. Data record is still accepted. – WARNING(104): Trace value error: Flag = T; data = null. Flag set to 'M' –Description: Flag indicates trace value. Data point is considered missing.

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 Data Warehouse Content Parameter (Daily values) % by Measured Parameter Stream Discharge29 Precipitation26 Air Temperature22 Relative Humidity4 Global Radiation4 Soil Temperature3 Resultant Wind Speed3 Resultant Wind Direction2 Other7 Observations: Coverage of precipitation, discharge, and air temperature data is strong across sites. We encourage sites to contribute relative humidity, soil temperature, wind speed & direction, and global radiation in datasets. Primary emphasis Secondary emphasis

ClimDB Temporal Coverage – LTER Sites Air temperature and precipitation August 2006 Air temperature and precipitation sites (85%) sites (81%) sites (54%) 20 years 15 years 10 years

HydroDB Temporal Coverage – 28 Sites August 2006 USGS Small watersheds Streamflow

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 Characterization of quality flags in ClimDB LTER only: No USFS only and no USGS Flag# Values% of Total # Absent Values % All Missing Null or “G”ood 1,199,440 4,141, % “E”stimated145, % “M”issing553, %507, % “Q”uestionable 17, % “T”race10, % Total6,068, %

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 Characterization of quality flags in ClimDB All Data: LTER, USFS, and USGS Flag# Values% of Total # Absent Values % All Missing Null or “G”ood 1,781,391 4,655, % “E”stimated178, % “M”issing671, %604, % “Q”uestionable 19, % “T”race13, % # Precip Values 1,344,951 % Trace flag 1.02% Total7,318, %

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007

Data Acquisition Download or Graphical Display

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 Data Acquisition

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007

Metadata Reports Detail information for the general site, all stations, and all parameters. Metadata descriptions can also be downloaded as a PDF

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 Air Temperature Instrumentation Metadata

Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 ClimDB Improvements/Issues Designate metadata attributes for describing QA procedures, or for describing missing or questionable data problems Tally and list the number of records in monthly and annual aggregations. Optionally include questionable data? Output EML specific to each data download of a derived data product Develop web services to accommodate CUAHSI or other standard interfaces