Hydrologic Statistics

Slides:



Advertisements
Similar presentations
Hydrologic Statistics Reading: Chapter 11, Sections 12-1 and 12-2 of Applied Hydrology 04/04/2006.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Hydrologic Statistics
Simple Linear Regression
QUANTITATIVE DATA ANALYSIS
Start Audio Lecture! FOR462: Watershed Science & Management 1 Streamflow Analysis Module 8.7.
Chapter 12 Simple Regression
Statistics: Data Analysis and Presentation Fr Clinic II.
Data Freshman Clinic II. Overview n Populations and Samples n Presentation n Tables and Figures n Central Tendency n Variability n Confidence Intervals.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Statistics: Data Presentation & Analysis Fr Clinic I.
Chapter Topics Types of Regression Models
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
CHAPTER 6 Statistical Analysis of Experimental Data
Simple Linear Regression Analysis
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Pertemua 19 Regresi Linier
Chapter 14 Introduction to Linear Regression and Correlation Analysis
CHAPTER 6 Statistical Analysis of Experimental Data
Simple Linear Regression and Correlation
Flood Frequency Analysis
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Hypothesis Testing in Linear Regression Analysis
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
CHAPTER 1 Basic Statistics Statistics in Engineering
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
Statistics & Flood Frequency Chapter 3 Dr. Philip B. Bedient Rice University 2006.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
FREQUENCY ANALYSIS.
Chapter 2 Describing Data.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Statistical Methods Statistical Methods Descriptive Inferential
“ Building Strong “ Delivering Integrated, Sustainable, Water Resources Solutions Statistics 101 Robert C. Patev NAD Regional Technical Specialist (978)
Introductory Statistics. Learning Objectives l Distinguish between different data types l Evaluate the central tendency of realistic business data l Evaluate.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
CY1B2 Statistics1 (ii) Poisson distribution The Poisson distribution resembles the binomial distribution if the probability of an accident is very small.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Data Analysis, Presentation, and Statistics
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Chapter 20 Statistical Considerations Lecture Slides The McGraw-Hill Companies © 2012.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Stats Methods at IC Lecture 3: Regression.
Chapter 13 Simple Linear Regression
The simple linear regression model and parameter estimation
Chapter 4: Basic Estimation Techniques
Chapter 4 Basic Estimation Techniques
Analysis and Empirical Results
Regression Analysis AGEC 784.
Review 1. Describing variables.
Data Mining: Concepts and Techniques
Statistical Hydrology and Flood Frequency
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Flood Frequency Analysis
Basic Estimation Techniques
MEGN 537 – Probabilistic Biomechanics Ch.3 – Quantifying Uncertainty
Hydrologic Statistics
Presentation transcript:

Hydrologic Statistics 04/04/2006 Hydrologic Statistics Reading: Chapter 11 in Applied Hydrology Some slides by Venkatesh Merwade

Classification based on randomness. Hydrologic Models Classification based on randomness. Deterministic (eg. Rainfall runoff analysis) Analysis of hydrological processes using deterministic approaches Hydrological parameters are based on physical relations of the various components of the hydrologic cycle. Do not consider randomness; a given input produces the same output. Stochastic (eg. flood frequency analysis) Probabilistic description and modeling of hydrologic phenomena Statistical analysis of hydrologic data.

Probability A measure of how likely an event will occur A number expressing the ratio of favorable outcome to the all possible outcomes Probability is usually represented as P(.) P (getting a club from a deck of playing cards) = 13/52 = 0.25 = 25 % P (getting a 3 after rolling a dice) = 1/6

Random Variable Random variable: a quantity used to represent probabilistic uncertainty Incremental precipitation Instantaneous streamflow Wind velocity Random variable (X) is described by a probability distribution Probability distribution is a set of probabilities associated with the values in a random variable’s sample space

Sampling terminology Sample: a finite set of observations x1, x2,….., xn of the random variable A sample comes from a hypothetical infinite population possessing constant statistical properties Sample space: set of possible samples that can be drawn from a population Event: subset of a sample space Example Population: streamflow Sample space: instantaneous streamflow, annual maximum streamflow, daily average streamflow Sample: 100 observations of annual max. streamflow Event: daily average streamflow > 100 cfs

Types of sampling Random sampling: the likelihood of selection of each member of the population is equal Pick any streamflow value from a population Stratified sampling: Population is divided into groups, and then a random sampling is used Pick a streamflow value from annual maximum series. Uniform sampling: Data are selected such that the points are uniformly far apart in time or space Pick steamflow values measured on Monday midnight Convenience sampling: Data are collected according to the convenience of experimenter. Pick streamflow during summer

Summary statistics Also called descriptive statistics If x1, x2, …xn is a sample then Mean, m for continuous data Variance, s2 for continuous data s for continuous data Standard deviation, Coeff. of variation, Also included in summary statistics are median, skewness, correlation coefficient,

Graphical display Time Series plots Histograms/Frequency distribution Cumulative distribution functions Flow duration curve

Colorado River near Austin Time series plot Plot of variable versus time (bar/line/points) Example. Annual maximum flow series Colorado River near Austin

Histogram Plots of bars whose height is the number ni, or fraction (ni/N), of data falling into one of several intervals of equal width Interval = 25,000 cfs Interval = 50,000 cfs Interval = 10,000 cfs Dividing the number of occurrences with the total number of points will give Probability Mass Function

Using Excel to plot histograms 1) Make sure Analysis Tookpak is added in Tools. This will add data analysis command in Tools 2) Fill one column with the data, and another with the intervals (eg. for 50 cfs interval, fill 0,50,100,…) 3) Go to ToolsData AnalysisHistogram 4) Organize the plot in a presentable form (change fonts, scale, color, etc.)

Probability density function Continuous form of probability mass function is probability density function pdf is the first derivative of a cumulative distribution function

Cumulative distribution function Cumulate the pdf to produce a cdf Cdf describes the probability that a random variable is less than or equal to specified value of x P (Q ≤ 50000) = 0.8 P (Q ≤ 25000) = 0.4

Flow duration curve A cumulative frequency curve that shows the percentage of time that specified discharges are equaled or exceeded. Steps Arrange flows in chronological order Find the number of records (N) Sort the data from highest to lowest Rank the data (m=1 for the highest value and m=N for the lowest value) Compute exceedance probability for each value using the following formula Plot p on x axis and Q (sorted) on y axis

Flow duration curve in Excel Median flow

Statistical analysis Regression analysis Mass curve analysis Flood frequency analysis Many more which are beyond the scope of this class!

Linear Regression A technique to determine the relationship between two random variables. Relationship between discharge and velocity in a stream Relationship between discharge and water quality constituents A regression model is given by : yi = ith observation of the response (dependent variable) xi = ith observation of the explanatory (independent) variable b0 = intercept b1 = slope ei = random error or residual for the ith observation n = sample size

Least square regression We have x1, x2, …, xn and y1,y2, …, yn observations of independent and dependent variables, respectively. Define a linear model for yi, Fit the model (find b0 and b1) such at the sum of the squares of the vertical deviations is minimum Minimize Regression applet: http://www.math.csusb.edu/faculty/stanton/m262/regress/regress.html

Linear Regression in Excel Steps: Prepare a scatter plot Fit a trend line Data are for Brazos River near Highbank, TX Alternatively, one can use ToolsData AnalysisRegression

Coefficient of determination (R2) It is the proportion of observed y variation that can be explained by the simple linear regression model Total sum of squares, Ybar is the mean of yi Error sum of squares The higher the value of R2, the more successful is the model in explaining y variation. If R2 is small, search for an alternative model (non linear or multiple regression model) that can more effectively explain y variation