Presentation on theme: "Lecture (9) Frequency Analysis and Probability Plotting."— Presentation transcript:
Lecture (9) Frequency Analysis and Probability Plotting
Flood Frequency Analysis Statistical Methods to evaluate probability exceeding a particular outcome - P (X >20,000 cfs) = 10% Used to determine return periods of rainfall or flows Used to determine specific frequency flows for floodplain mapping purposes (10, 25, 50, 100 yr) Used for datasets that have no obvious trends Used to statistically extend data sets
Basics of Probabilistic Prediction of Peak Events What has happened and the frequency of events in a record are the best indicators of what can happen and its probability of happening in the future. Requires a hydrological record at a station. The record is analyzed to estimate the probability of events of various sizes, as if they were independent of one another. Then it is extrapolated to larger, rarer floods. BUT most hydrologic records are short and non- stationary (i.e. conditions of climate and watershed condition change during the recording period). Magnitude of this problem varies, but needs to be checked in each case.
Return Period (Recurrence Interval) It is the average period in which a certain magnitude of a hydrological event will be once equalled or exceeded. the Weibull formula m i is the rank of the ith event in a set of n events Tr is the recurrence interval (yr). The long-term average interval between events greater than Qi
Return period (Recurrence interval) cont. E.g. if the probability of exceeding 20 m 3 /sec in any year at a station is 0.01, It means that: there should be on average 10 events larger than 20 m 3 /sec in 1000 years, if the conditions affecting events at the site do not change. The average recurrence interval between events is 100 years, so we refer to such a discharge asthe 100-year event. The events will not occur regularly every 100 years
How to compute the Return Period 1. Arrange the data in ascending or descending order of magnitude ignoring their chronological sequence. 2. Each observation in the data has the same relative frequency of occurrence. 3. For n observations, fr=1/n. 4. For the min value of data we assign 100%. 5. For the next smallest of the data we assign 100%- (1/n)*100%. 6. Next, 100%- 2(1/n)*100%, …, etc. and end up of (1/n)*100% at the max value. Conversely, we can do the same form the max value to the min value (ascending).
Flood Frequency Curve a flood-frequency curve: It is a plot of the calculated values of Tr against Q i.
Plotting Position It is related to the the return period. It can be expressed in terms of relative frequency of occurrence or the probability that a hydrological event of a given magnitude will occur. The plotting position should satisfy some conditions (Gumbel, 1960): 1.The probability that the magnitude of a hydrological variable, X, equals or exceeds an indicated value, a, once in a return period of Tr years is, 2. The probability should be as close as possible to the observed frequency. 3. Neither zero or one.
Methods to compute p and Tr Where, m = rank or order of X (m=1 for the largest value, 2 for the second largest value, …, n for the smallest value. n= sample size.
Methods to compute p and Tr (Cont.)
Choosing Theoretical Probability Distribution of an Event Note that this procedure involves fitting a theoretical probability distribution to an observed sample drawn from an imaginary (but not well-understood) population The true theoretical probability distribution of the events is not known, and we have no reason to believe it is simple or has only 1 or parameters. Plotting the data set on various types of graph paper with different scales, designed to represent various theoretical probability distributions as straight lines, yields graphs of different shapes, which when extrapolated beyond the limits of measurement predict a range of events.
Normal Prob Paper converts the Normal CDF S curve into a straight line on a prob scale Normal Probability Paper
Flood frequency plots of same record on different probability papers Log-probability paper Log-extreme value paper Gumbel Type III
How to Choose a Theoretical Probability Distribution ? One common approach to choice of flood frequency plotting paper is to choose one on which the observed data plot as a straight line that can be extrapolated to estimate rare, large events. A second is to choose one of the common ones, but this still leads to different predictions among analysts
Some Guidelines Take Mean and Variance (S.D.) of ranked data Take Skewness C s of data (3rd moment about mean) If C s near zero, assume normal distn If C s large, convert Y = Log x - (Mean and Var of Y) Take Skewness of Log data - C s (Y) If C s near zero, then fits Lognormal If C s not zero, fit data to Log Pearson III
Siletz River Example 75 data points Mean20, Std Dev Skew Coef of Variation Original QY = Log Q
Siletz River Example - Fit Normal and LogN Normal Distribution Q = Q m + z S Q Q 100 = (6089) = 34,620 cfs Mean + z (S.D.) Where z = std normal variate - tables Log N Distribution Y = Y m + k S Y Y 100 = (0.129) = k = freq factor and Q = 10 Y = 39,100 cfs
Log Pearson Type III Log Pearson Type III Y = Y m + k S Y K is a function of Cs and Recurrence Interval Table 3.4 lists values for pos and neg skews For Cs = -0.15, thus K = 2.15 Y 100 = (0.129) = Q = 10 Y = 36,927 cfs for LP III Plot several points on Log Prob paper
What is the prob that flow exceeds some given value yr value Plot data with plotting position formula P = m/n+1, m = rank, n = # Log N distn plots as straight line LogN Prob Paper for CDF
Straight Line Fits Data Well Mean LogN Plot of Siletz R.
Various Fits of CDFs LP3 has curvature LN is straight line Siletz River Flow Data
The probability of getting x successes followed by n-x failures is the product of prob of n independent events: p x (1-p) n- x This represents only one possible outcome. The number of ways of choosing x successes out of n events is the binomial coeff. The resulting distribution is the Binomial or B(n,p). Risk Assessment (Binomial Distribution)
The probability of at least one success in n years, where the probability of success in any year is 1/T, is called the RISK. Prob success = p = 1/T and Prob failure = 1-p RISK = 1-P(0) = 1 - Prob(no success in n years) = 1 - (1-p) n = 1 - (1 - 1/T) n Reliability = (1 - 1/T) n Risk and Reliability
What is the probability of at least one 50 yr flood in a 30 year advance period, where the probability of success in any year is P=1/T = 0.02 RISK = 1 - (1 - 1/T) n = 1 - ( ) 30 = 1 - (0.98) 30 = or 46% If this is too large a risk, then increase design level to the 100 year where p = 0.01 RISK = 1 - (0.99) 30 = 0.26 or 26% Risk Example