Data and meta-data exploration and data quality reporting for GCOS

Slides:



Advertisements
Similar presentations
Meteorological Observatory Lindenberg – Richard Assmann Observatory The GCOS Reference Upper Air Network.
Advertisements

The GCOS Reference Upper Air Network (GRUAN): creating an Arctic/Antarctic mirror Greg Bodeker GRUAN co-chair Antarctica New Zealand Annual Science Conference.
Global Climate Observing System (GCOS) including GRUAN Greg Bodeker Bodeker Scientific, Alexandra, New Zealand Presented at the 9 th Ozone Research Managers.
1 Chapter 2: Product Development Process and Organization Introduction Importance of human resources: Most companies have similar technology resources.
© 2014 Cogentus Consulting Ltd Smart Decisions End-to-end problem solving.
Data Mining: A Closer Look
© 2004 Keynote Systems Customer Experience Management (CEM) Bonny Brown, Ph.D. Director, Research & Public Services.
Surveys Survey Builder Survey Data Capture Web Access: e-surveys.
Section 28.2 Types, Trends, and Limitations of Marketing Research
Mantova 18/10/2002 "A Roadmap to New Product Development" Supporting Innovation Through The NPD Process and the Creation of Spin-off Companies.
Meteorological Observatory Lindenberg – Richard Assmann Observatory The GCOS Reference Upper Air Network.
4.04 Understand marketing- research activities to show command of their nature and scope.
Journal Writing. What is Journal Writing Journal writing is a learning tool based on the ideas that students write to learn. Students use the journals.
Outlook WEB How to access your Brevard County Schools Outlook from any Internet-based PC.
GOES Users’ Conference III May 10-13, 2004 Broomfield, CO Prepared by Integrated Work Strategies, LLC GOES USERS’ CONFERENCE III: Discussion Highlights.
3/30/04 16:14 1 Lessons Learned CERES Data Management Presented to GIST 21 “If the 3 laws of climate are calibrate, calibrate, calibrate, then the 3 laws.
Climate data past and future: Can we more effectively monitor and understand our changing climate? Peter Thorne.
Introduction of Geoprocessing Lecture 9. Geoprocessing  Geoprocessing is any GIS operation used to manipulate data. A typical geoprocessing operation.
Five Tips in Ten Minutes Conversion Thursday London 13 th Oct 2011.
Meteorological Observatory Lindenberg Results of the Measurement Strategy of the GCOS Reference Upper Air Network (GRUAN) Holger Vömel, GRUAN.
PROG Developing Robust Modular Software.. Objectives What do we want? Programmatic Elements in a Business System. Logic Layer. Persistence (Data)
Meteorological Observatory Lindenberg – Richard Assmann Observatory (2010) GCOS Reference Upper Air Network Holger Vömel Meteorological Observatory Lindenberg.
Applying Use Cases to Implementation (Chapters 25,26 - Requirements Text) Steve Chenoweth & Chandan Rupakheti Question 1.
Opportunities for Satellite Observations in Validation and Assessment Barbara Brown NCAR/RAL 19 November 2008.
Support to scientific research on seasonal-to-decadal climate and air quality modelling Pierre-Antoine Bretonnière Francesco Benincasa IC3-BSC - Spain.
Web Analytics – An Introduction
Data Science Interview Questions 1.What do you mean by word Data Science? Data Science is the extraction of knowledge from large.
May 9th, 2015 Market Research Describe the purpose of marketing research.
Applied Methodologies in PHI Session 5: Evaluation Kath Roberts (NOO/EMPHO)
Avenues for Communication Teresa Jolley, DEFT153 Ltd LG Technical Advisors Group Presidential Workshop 22 nd May 2015.
Business Communication.  An expert, more generally, is a person with extensive knowledge or ability based on research, experience, or occupation and.
Online survey software tool has been a popular option among many these days who want to get a better understanding of the requirements of their products.
Digital Marketing Growth Hacking Tips for B2B Brands
Introduction to Machine Learning, its potential usage in network area,
CEDEFOP Session 4: Spreading the news around the world Data visualisation as a tool for ICEs Production of Skills Supply and Demand Forecasts Alphametrics.
Gaps assessment in GAIA-CLIM
Participation, transparency, accountability: The role of freedom of information in REDD+ Key messages - May 2013.
Users Requirements The inconsistencies between the UR and GCOS-2006 identified in some of the URDs will be reduced with the new iteration of the GCOS.
Evidence Synthesis/Systematic Reviews of Eyewitness Accuracy
Concepts and Definitions Work Breakdown Structures
Links in the Chain: turning learning analytics data into actions
Computer tools for Scheduling
Big-Data Fundamentals
Big-Data Fundamentals
4.00 Understand promotion and intermediate uses of marketing-information Understand marketing-research activities to show command of their nature.
What will we do today?. What will we do today?
Welcome: How to use this presentation
QA Validation in Big Data
UNDERSTANDING YOUR PSAT/NMSQT RESULTS
Plans of the GCOS Reference Upper Air Network (GRUAN)
Marketing Information System (MIS)
Tim Hewison1 (1) EUMETSAT
Star Early Literacy PreTest Instructions
Automating Profitable Growth™
Maintaining Your Site Module 8: Web Publishing and Maintenance
The Global Observing System for Climate Carolin Richter, Director
Status of the EUMETSAT GSICS DCC product
Clear Language and Organizational Change
UNDERSTANDING YOUR PSAT/NMSQT RESULTS
Management FUNCTIONS.
Hot summary of the Fourth GSICS Users Workshop
Understand Key Features
Preparing Industry for a New Code
What are systematic reviews and why do we need them?
Big DATA.
Section 28.2 Types, Trends, and Limitations of Marketing Research
A modest attempt at measuring and communicating about quality
V. Uddameri Texas Tech University
CloudVOTE Web App Tutorial
08120: Programming 2: SoftwareTesting and Debugging
Presentation transcript:

Data and meta-data exploration and data quality reporting for GCOS Jared Lewis (jared@bodekerscientific.com) Bodeker Scientific, New Zealand Meta-data is the often overshadowed by its big brothers, the data and its uncertainty Meta-data is extremely important. This has been alluded to by other presenters, but is typically focussed on the geographic meta-data (Visualising and subsetting datasets) Also important for future reprocessing of datasets, Today I want to discuss the use of meta-data for engaging data uses and evaluating data quality This is a case study on meta-data exploration of the GRUAN data

GRUAN GCOS Reference Upper Air Network Continuous long-term records of high-quality reference data Best possible uncertainties Best estimate + Uncertainty GRUAN Measurement Uncertainty of input data Traceable sensor calibration Transparent processing algorithm Disregarded systematic effects Black box software Proprietary methods GRUAN's goal is to provide continuous long-term records of high-quality reference data. Reference data in this respect means that the data are free of instrumental effects, that all known systematic biases have been corrected for and best-possible uncertainty estimates are provided. For example in case of a long-term data record which is the composite of various measurement systems (e.g. radiosondes) you don't have to worry about homogenisation. In other words: as a climate researcher doing analyses you can simply use GRUAN data as is. The concept of reference data quality is illustrated by this picture: Black box software: you don’t know what one has done to your data Disregarded systematic effects: your data is wrong! Proprietary methods: when improved/new corrections become available in the future you can't reprocess your data.

The problem Extracting value from meta-data is difficult Large number of variables Large amounts of data Need to know the question you want to answer Discuss GRUAN and how its meta-data is underutilised.

Similar problem Websites create large volumes of logs containing meta-data Meta-data provides information about customers Large incentive to extract value for decision making ($$$) Sparked the “Big Data” revolution New industry using meta-data New tools Need to make sure that the audience knows that the web site

Visualising Meta-data Log files Adapting existing technology to solve our problem

Visualising Meta-data Flight Data Adapting existing technology to solve our problem

3 minutes

Potential Benefits Empowers users to answer their own questions Identify where the data may be falling short of satisfying user’s needs Provide quantitative feedback for sites Benchmark existing sites Identify problems early

Outlook Kibana is being actively developed to be more user friendly Help the GRUAN working group to capture useful metrics Create site specific dashboards and metrics Need to find other datasets which could use these tools Key Message: By promoting the ability for data users to explore the meta-data may increase their use of the datasets