Lecture (9) Frequency Analysis and Probability Plotting.

Slides:



Advertisements
Similar presentations
Introduction to modelling extremes
Advertisements

Introduction to modelling extremes Marian Scott (with thanks to Clive Anderson, Trevor Hoey) NERC August 2009.
Chapter 7 Sampling and Sampling Distributions
WFM-6204: Hydrologic Statistics
WFM 6204: Hydrologic Statistics © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM-6204: Hydrologic Statistics Akm Saiful Islam Lecture-3: Probabilistic analysis:
THE CENTRAL LIMIT THEOREM
Frequency Analysis Reading: Applied Hydrology Sections 12-2 to 12-6.
Chapter 4 Inference About Process Quality
Estimation of Means and Proportions
Hydrologic Statistics Reading: Chapter 11, Sections 12-1 and 12-2 of Applied Hydrology 04/04/2006.
Applied Hydrology Regional Frequency Analysis - Example Prof. Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Sampling Distributions for Counts and Proportions
Parameters of distribution
1 McGill University Department of Civil Engineering and Applied Mechanics Montreal, Quebec, Canada.
Hydrologic Statistics
Modeling Process Quality
Surface Water Hydrology Summarized in one equation V = velocity, fps or m/s A = channel cross-sectional area, sf or m 2.
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
Start Audio Lecture! FOR462: Watershed Science & Management 1 Streamflow Analysis Module 8.7.
Topics: Inferential Statistics
1 Choice of Distribution 1.Theoretical Basis e.g. CLT, Extreme value 2.Simplify calculations e.g. Normal or Log Normal 3.Based on data: - Histogram - Probability.
WFM 5201: Data Management and Statistical Analysis
Statistics and Probability Theory Prof. Dr. Michael Havbro Faber
Statistics 800: Quantitative Business Analysis for Decision Making Measures of Locations and Variability.
Analyses of Rainfall Hydrology and Water Resources RG744
Flood Frequency Analysis
Hydrologic Statistics
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Estimation of Areal Precipitation from point measurements Most often interested in quantifying rainfall over an entire watershed. Has to be inferred from.
I am sure you have heard about the farmer in Sidell, Illinois. After that fiasco with the cheese being left standing alone taking the blame for the failure.
WFM 5201: Data Management and Statistical Analysis © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 5201: Data Management and Statistical Analysis Akm Saiful.
CE 3354 ENGINEERING HYDROLOGY Lecture 6: Probability Estimation Modeling.
Statistics & Flood Frequency Chapter 3 Dr. Philip B. Bedient Rice University 2006.
FREQUENCY ANALYSIS.
Frequency Analysis and Data Reading: Applied Hydrology Sections
Skewness & Kurtosis: Reference
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
EAS31116/B9036: Statistics in Earth & Atmospheric Sciences Lecture 3: Probability Distributions (cont’d) Instructor: Prof. Johnny Luo
Basic Hydrology & Hydraulics: DES 601
Stracener_EMIS 7305/5305_Spr08_ Reliability Data Analysis and Model Selection Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering.
Basic Hydrology: Gauge Analysis. Gage Analysis Gage analysis is use of historical records to construct a frequency curve for a gauging station. This frequency.
© 2010 Pearson Prentice Hall. All rights reserved 7-1.
Probability distributions
BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant.
FREQUENCY ANALYSIS Siti Kamariah Md Sa’at PPK Bioprocess
CE 3354 ENGINEERING HYDROLOGY Lecture 6: Probability Estimation Modeling.
Hydrological Forecasting. Introduction: How to use knowledge to predict from existing data, what will happen in future?. This is a fundamental problem.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
UNIT – III FLOODS Types of floods Following are the various types of floods: 1.Probable Maximum Flood (PMF):This is the flood resulting from the most sever.
Analyses of Rainfall Hydrology and Water Resources RG744 Institute of Space Technology October 09, 2015.
Hydrological Statistics
MECH 373 Instrumentation and Measurements
Basic Hydrology: Flood Frequency
Hydrology & Water Resources Eng.
Modeling and Simulation CS 313
Basic Hydrology & Hydraulics: DES 601
Statistical Hydrology and Flood Frequency
Flood Frequency Analysis
Hazards Planning and Risk Management Flood Frequency Analysis
Statistics & Flood Frequency Chapter 3 – Part 1
Distributions and Flood Frequency Chapter 3 – Part 2
Hydrologic Statistics
Stochastic Hydrology Hydrological Frequency Analysis (I) Fundamentals of HFA Prof. Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering.
Flood Frequency Analysis
Statistics & Flood Frequency Chapter 3
HYDROLOGY Lecture 12 Probability
Chapter Nine: Using Statistics to Answer Questions
Presentation transcript:

Lecture (9) Frequency Analysis and Probability Plotting

Flood Frequency Analysis Statistical Methods to evaluate probability exceeding a particular outcome - P (X >20,000 cfs) = 10% Used to determine return periods of rainfall or flows Used to determine specific frequency flows for floodplain mapping purposes (10, 25, 50, 100 yr) Used for datasets that have no obvious trends Used to statistically extend data sets

Basics of Probabilistic Prediction of Peak Events “What has happened and the frequency of events in a record are the best indicators of what can happen and its probability of happening in the future.” Requires a hydrological record at a station. The record is analyzed to estimate the probability of events of various sizes, as if they were independent of one another. Then it is extrapolated to larger, rarer floods. BUT most hydrologic records are short and non-stationary (i.e. conditions of climate and watershed condition change during the recording period). Magnitude of this problem varies, but needs to be checked in each case.

Return Period (Recurrence Interval) It is the average period in which a certain magnitude of a hydrological event will be once equalled or exceeded. the Weibull formula mi is the rank of the ith event in a set of n events Tr is the recurrence interval (yr). The long-term average interval between events greater than Qi

Return period (Recurrence interval) cont. E.g. if the probability of exceeding 20 m3/sec in any year at a station is 0.01, It means that: there should be on average 10 events larger than 20 m3/sec in 1000 years, if the conditions affecting events at the site do not change. The average recurrence interval between events is 100 years, so we refer to such a discharge as “the 100-year event”. The events will not occur regularly every 100 years

How to compute the Return Period 1. Arrange the data in ascending or descending order of magnitude ignoring their chronological sequence. 2. Each observation in the data has the same relative frequency of occurrence. 3. For n observations, fr=1/n. 4. For the min value of data we assign 100%. 5. For the next smallest of the data we assign 100%- (1/n)*100%. 6. Next, 100%- 2(1/n)*100%, …, etc. and end up of (1/n)*100% at the max value. Conversely, we can do the same form the max value to the min value (ascending).

Flood Frequency Curve a flood-frequency curve: It is a plot of the calculated values of Tr against Qi.

Plotting Position It is related to the the return period. It can be expressed in terms of relative frequency of occurrence or the probability that a hydrological event of a given magnitude will occur. The plotting position should satisfy some conditions (Gumbel, 1960): The probability that the magnitude of a hydrological variable, X, equals or exceeds an indicated value, a, once in a return period of Tr years is, 2. The probability should be as close as possible to the observed frequency. 3. Neither zero or one.

Methods to compute p and Tr Where, m = rank or order of X (m=1 for the largest value, 2 for the second largest value, …, n for the smallest value. n= sample size.

Methods to compute p and Tr (Cont.)

Choosing Theoretical Probability Distribution of an Event Note that this procedure involves fitting a theoretical probability distribution to an observed sample drawn from an imaginary (but not well-understood) population The “true” theoretical probability distribution of the events is not known, and we have no reason to believe it is simple or has only 1 or parameters. Plotting the data set on various types of graph paper with different scales, designed to represent various theoretical probability distributions as straight lines, yields graphs of different shapes, which when extrapolated beyond the limits of measurement predict a range of events.

Normal Probability Paper Normal Prob Paper converts the Normal CDF S curve into a straight line on a prob scale

Log-normal Probability Paper

Weibull Probability Paper

Example 1

Example 2

Graphical Plot of Richmond Data

Example 3 Year Rank Ordered cfs 1940 1 42,700 1925 2 31,100 1932 3 20,700 1966 4 19,300 1969 5 14,200 1982 6 1988 7 12,100 1995 8 10,300 2000 …… …….

Flood frequency plots of same record on different probability papers Log-extreme value paper Gumbel Type III Log-probability paper

How to Choose a Theoretical Probability Distribution ? One common approach to choice of flood frequency plotting paper is to choose one on which the observed data plot as a straight line that can be extrapolated to estimate rare, large events. A second is to choose one of the common ones, but this still leads to different predictions among analysts

Some Guidelines Take Mean and Variance (S.D.) of ranked data Take Skewness Cs of data (3rd moment about mean) If Cs near zero, assume normal dist’n If Cs large, convert Y = Log x - (Mean and Var of Y) Take Skewness of Log data - Cs(Y) If Cs near zero, then fits Lognormal If Cs not zero, fit data to Log Pearson III

Siletz River Example 75 data points Original Q Y = Log Q Mean 20,452 4.2921 Std Dev 6089 0.129 Skew 0.7889 - 0.1565 Coef of Variation 0.298 0.03

Siletz River Example - Fit Normal and LogN Normal Distribution Q = Qm + z SQ Q100 = 20452 + 2.326(6089) = 34,620 cfs Mean + z (S.D.) Where z = std normal variate - tables Log N Distribution Y = Ym + k SY Y100 = 4.29209 + 2.326(0.129) = 4.5923 k = freq factor and Q = 10Y = 39,100 cfs

Log Pearson Type III Y = Ym + k SY K is a function of Cs and Recurrence Interval Table 3.4 lists values for pos and neg skews For Cs = -0.15, thus K = 2.15 Y100 = 4.29209 + 2.15(0.129) = 4.567 Q = 10Y = 36,927 cfs for LP III Plot several points on Log Prob paper

LogN Prob Paper for CDF What is the prob that flow exceeds     some given value - 100 yr value Plot data with plotting position    formula P = m/n+1 , m = rank, n = # Log N dist’n plots as straight line

LogN Plot of Siletz R. Mean Straight Line Fits Data Well

Siletz River Flow Data Various Fits of CDFs LP3 has curvature LN is straight line

Risk Assessment (Binomial Distribution) The probability of getting x successes followed by n-x failures is the product of prob of n independent events: px (1-p)n-x This represents only one possible outcome. The number of ways of choosing x successes out of n events is the binomial coeff. The resulting distribution is the Binomial or B(n,p).

Risk and Reliability The probability of at least one success in n years, where the probability of success in any year is 1/T, is called the RISK. Prob success = p = 1/T and Prob failure = 1-p RISK = 1-P(0) = 1 - Prob(no success in n years) = 1 - (1-p) n = 1 - (1 - 1/T) n Reliability = (1 - 1/T) n

Risk Example What is the probability of at least one 50 yr flood in a 30 year advance period, where the probability of success in any year is P=1/T = 0.02 RISK = 1 - (1 - 1/T)n = 1 - (1 - 0.02)30 = 1 - (0.98)30 = 0.455 or 46% If this is too large a risk, then increase design level to the 100 year where p = 0.01 RISK = 1 - (0.99)30 = 0.26 or 26%