The Viper Main Interface Layout and interpretation.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Correlation and Linear Regression.
Helper Variables Why do we want them? How do we create them? What to avoid with them? Tom Pagano
Correlation and Linear Regression
Chapter 13 Multiple Regression
Statistics for the Social Sciences Psychology 340 Spring 2005 Prediction cont.
Chapter 12 Multiple Regression
Class 6: Tuesday, Sep. 28 Section 2.4. Checking the assumptions of the simple linear regression model: –Residual plots –Normal quantile plots Outliers.
Examining Relationship of Variables  Response (dependent) variable - measures the outcome of a study.  Explanatory (Independent) variable - explains.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Correlation and Regression Analysis
A Regression Model for Ensemble Forecasts David Unger Climate Prediction Center.
Simple Linear Regression Analysis
Example 16.3 Estimating Total Cost for Several Products.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Guide to Using Excel For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 6th Ed. Chapter 14: Multiple Regression.
A Brief Introduction to Statistical Forecasting Kevin Werner.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Great Basin Verification Task 2008 Increased Variability Review of 2008 April through July Period Forecast for 4 Selected Basins Determine what verification.
Time-Series Analysis and Forecasting – Part V To read at home.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Diploma in Statistics Introduction to Regression Lecture 2.21 Introduction to Regression Lecture Review of Lecture 2.1 –Homework –Multiple regression.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Where does VIPER get its data from? Tom Pagano
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 27 Time Series.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
Applications of Regression to Water Quality Analysis Unite 5: Module 18, Lecture 1.
Introduction to SPSS. Object of the class About the windows in SPSS The basics of managing data files The basic analysis in SPSS.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Summarizing Bivariate Data Non-linear Regression Example.
Time Series Analysis – Chapter 6 Odds and Ends
Copyright © 2011 Pearson Education, Inc. Time Series Chapter 27.
NRCS National Water and Climate Center Update Tom Pagano Natural Resources Conservation Service.
The Z-Score Regression Method and You Tom Pagano
Hydrologic Forecasting With Statistical Models Angus Goodbody David Garen USDA Natural Resources Conservation Service National Water and Climate Center.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
VIPER Optimization What is optimization? How does viper’s Station and Time Period optimization work? How to interpret results? What to avoid? Tom Pagano.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Probabilistic seasonal water supply forecasting in an operational environment: the USDA-NRCS Perspective Tom Pagano
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Operational Drought Monitoring and Forecasting at the USDA-NRCS Tom Pagano
Outline of Today’s Discussion 1.Seeing the big picture in MR: Prediction 2.Starting SPSS on the Different Models: Stepwise versus Hierarchical 3.Interpreting.
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
NRCS National Water and Climate Center Update Tom Pagano Natural Resources Conservation Service.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Chapter 15 Multiple Regression Model Building
Regression Analysis AGEC 784.
Correlation and Simple Linear Regression
(Residuals and
Introduction to Regression
Eviews Tutorial for Labor Economics Lei Lei
Regression Forecasting and Model Building
Seasonal Forecasting Using the Climate Predictability Tool
Chapter 13 Additional Topics in Regression Analysis
Chapter 7 Excel Extension: Now You Try!
Chapter 9 Excel Extension: Now You Try!
Presentation transcript:

The Viper Main Interface Layout and interpretation

The Viper Main Interface Layout and interpretation Selecting predictors and predictands Global month changes

The Viper Main Interface Layout and interpretation Selecting predictors and predictands Predictors quality, availability Global month changes Historical statistics

The Viper Main Interface Layout and interpretation Selecting predictors and predictands Predictors quality, availability Forecast vs observed time series Station availability, weights Global month changes Historical statistics

The Viper Main Interface Layout and interpretation Selecting predictors and predictands Predictors quality, availability Forecast vs observed time series Station availability, weights Fcst vs obs scatterplot Helper variable Scatterplot/ Forecast progression Global month changes Historical statistics

The Viper Main Interface Layout and interpretation Selecting predictors and predictands Predictors quality, availability Probability bounds Forecast vs observed time series Station availability, weights Fcst vs obs scatterplot Helper variable Scatterplot/ Forecast progression Settings Global month changes Historical statistics

The Viper Main Interface Layout and interpretation Selecting predictors and predictands Predictors quality, availability Probability bounds Forecast vs observed time series Station availability, weights Fcst vs obs scatterplot Helper variable Scatterplot/ Forecast progression Settings Global month changes Historical statistics There’s more if you scroll right: Relate any variable to another

Selecting the predictand: Specify a type (e.g. Forecast point, snow, precipitation…)

Selecting the predictand: Specify a type (e.g. Forecast point, snow, precipitation…) Specify target station Sorted by USGS ID, upstream to downstream If other than streamflow, ordered alphabetically

Selecting the predictand: Specify a type (e.g. Forecast point, snow, precipitation…) Specify target station Sorted by USGS ID, upstream to downstream If other than streamflow, ordered alphabetically Specify months “Jan “ : January of this year “Jan-1” : January of last year “Jan F” : 1 st half of January (e.g. Jan 1, Jan1-15) “Jan L” : 2 nd half of January (e.g. Jan 16, Jan 16-31)

Selecting the predictors: Is the station used or not? (checked = yes)

Selecting the predictors: Is the station used or not? (checked = yes) Station groups as defined on “station” sheet

Selecting the predictors: Is the station used or not? (checked = yes) Station groups as defined on “station” sheet Clicking on a carat box sends you to the data sheet for that station. Experts may learn how to edit data once there.

Predictor quality, availability

Predictor quality, availability Station number and status indicator (e.g. low skill, missing) Period of record correlation coefficient with predictand 1=good,0=bad Predictor years (overlapping with target/ total) Realtime value as z-score (+1 high,-1 low) and % period of record normal Realtime forecast as if only one predictor was being used

Predictor quality, availability Station number and status indicator (e.g. low skill, missing) Period of record correlation coefficient with predictand 1=good,0=bad Predictor years (overlapping with target/ total) Realtime value as z-score (+1 high,-1 low) and % period of record normal Realtime forecast as if only one predictor was being used

Predictor quality, availability Station number and status indicator (e.g. low skill, missing) Period of record correlation coefficient with predictand 1=good,0=bad Predictor years (overlapping with target/ total) Realtime value as z-score (+1 high,-1 low) and % period of record normal Realtime forecast as if only one predictor was being used

Some data problems Missing: Realtime data value is unavailable Low skill: The predictor is too poor to be considered (correlation too low) Short por: The predictor does not have enough years in common with the target

Group information Correlation and predictions are represented by each station group. “All” is using all stations together. The bold yellow cell is the “bottom line” of the forecast (in k-ac-ft).

Historical statistics of target POR – period of record (all years) – all available** years from (official normal available to right)

1 Analysis type: Z-Score or PCA regression 2 Probability bound information: (gray is editable) Probability of exceedence (low probability = wet) Volume in thousands of acre-feet Percent of normal 1 2

1 Analysis type: Z-Score or PCA regression 2 Probability bound information: (gray is editable) Probability of exceedence (low probability = wet) Volume in thousands of acre-feet Percent of normal 3 Skill statistics: Correlation^2, Standard Error, Standard Error Skill Score Including all years or jackknifed (leave 1 year out at a time) 4. Official normal

Forecast (blue) versus observed (red) time series and historical forecasts (green)

“Forecast” is the calibration set of forecasting equation you set up “Historical forecast” is historical published outlook, issued back in the day Leadtime of historical forecast is specified in cell O46 “Publication Date”

Station weighting time series For each year, what is the relative contribution of skill (R^2) from each station? Taller means more skillful information Example: 1 station… Contributes 100% of skill in all years R^2 =.33

Station weighting time series For each year, what is the relative contribution of skill (R^2) from each station? Taller means more skillful information Example: 1 station… Contributes 100% of skill in all years 3 stations… 1 long, medium and short record These are the weights used in Z-Score regression. Not used in PCA, but shown anyway. (Gets more complicated with multiple groups, but idea the same) R^2 = R^2 =.77

Forecast vs observed scatter plot Gray lines show exceedence probabilities Red dot shows current forecast “Toggle graph” identifies individual years

Helper predictand (x) vs original target (y) Appears blank if helper not used In this example, AWDB vs USGS data shown “Switch” brings you to a plot of forecast vs leadtime Helper Predictand Equation output Published forecasts

Do non-linear regression by transforming predictand (e.g. square root) 2 Limit the start and end year of the analysis 3 Acquire predictand flow data from AWDB or USGS 4 Publication date of forecast (changes each month)

Do non-linear regression by transforming predictand (e.g. square root) 2 Limit the start and end year of the analysis 3 Acquire predictand flow data from AWDB or USGS 4 Publication date of forecast (changes each month) 5 Buttons to view additional predictors, advanced settings, helper 6 Number of principal components retained and % variance explained

“Original” = Value stored in recent data sheet “Estimated” = What you might expect given the other variable “O-E” = Original-Estimated Scroll right on main interface