FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Hydrologic Statistics Reading: Chapter 11, Sections 12-1 and 12-2 of Applied Hydrology 04/04/2006.
Hydrologic Statistics
Probability Densities
BCOR 1020 Business Statistics Lecture 15 – March 6, 2008.
Chapter 6 Continuous Random Variables and Probability Distributions
Analysis of Research Data
Probability and Statistics Review
Introduction to Educational Statistics
CHAPTER 6 Statistical Analysis of Experimental Data
Data observation and Descriptive Statistics
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
 Deviation is a measure of difference for interval and ratio variables between the observed value and the mean.  The sign of deviation (positive or.
2 Textbook Shavelson, R.J. (1996). Statistical reasoning for the behavioral sciences (3 rd Ed.). Boston: Allyn & Bacon. Supplemental Material Ruiz-Primo,
Hydrologic Statistics
Chapter 4 Continuous Random Variables and Probability Distributions
Describing Data: Numerical
Statistics for Linguistics Students Michaelmas 2004 Week 1 Bettina Braun.
Chapter 3 Statistical Concepts.
EPE/EDP 557 Key Concepts / Terms –Empirical vs. Normative Questions Empirical Questions Normative Questions –Statistics Descriptive Statistics Inferential.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Dept of Bioenvironmental Systems Engineering National Taiwan University Lab for Remote Sensing Hydrology and Spatial Modeling STATISTICS Random Variables.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Theory of Probability Statistics for Business and Economics.
Statistics & Flood Frequency Chapter 3 Dr. Philip B. Bedient Rice University 2006.
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
“ Building Strong “ Delivering Integrated, Sustainable, Water Resources Solutions Statistics 101 Robert C. Patev NAD Regional Technical Specialist (978)
Skewness & Kurtosis: Reference
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
1 Statistical Analysis – Descriptive Statistics Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Univariate Descriptive Statistics Chapter 2. Lecture Overview Tabular and Graphical Techniques Distributions Measures of Central Tendency Measures of.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
CY1B2 Statistics1 (ii) Poisson distribution The Poisson distribution resembles the binomial distribution if the probability of an accident is very small.
Probability and Distributions. Deterministic vs. Random Processes In deterministic processes, the outcome can be predicted exactly in advance Eg. Force.
Probability distributions
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
1 Day 1 Quantitative Methods for Investment Management by Binam Ghimire.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
CHAPTER – 1 UNCERTAINTIES IN MEASUREMENTS. 1.3 PARENT AND SAMPLE DISTRIBUTIONS  If we make a measurement x i in of a quantity x, we expect our observation.
3.1 Statistical Distributions. Random Variable Observation = Variable Outcome = Random Variable Examples: – Weight/Size of animals – Animal surveys: detection.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Analysis of Financial Data Spring 2012 Lecture: Introduction Priyantha Wijayatunga Department of Statistics, Umeå University
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Lesson 99 - Continuous Random Variables HL Math - Santowski.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Biostatistics Class 3 Probability Distributions 2/15/2000.
Statistics -S1.
MECH 373 Instrumentation and Measurements
Data Mining: Concepts and Techniques
STAT 311 REVIEW (Quick & Dirty)
STATISTICS Random Variables and Distribution Functions
HYDROLOGY Lecture 12 Probability
Univariate Descriptive Statistics
Statistics & Flood Frequency Chapter 3 – Part 1
Chapter 5 Statistical Models in Simulation
Basic Statistical Terms
MEGN 537 – Probabilistic Biomechanics Ch.3 – Quantifying Uncertainty
Hydrologic Statistics
HYDROLOGY Lecture 12 Probability
Statistical analysis and its application
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Advanced Algebra Unit 1 Vocabulary
Experiments, Outcomes, Events and Random Variables: A Revisit
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Presentation transcript:

FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.

FREQUENCY ANALYSIS Basic Assumptions: (a) Data analyzed are to be statistically independent & identically distributed - selection of data (Time dependence, time scale, mechanisms). (b) Change over time due to man-made (eg. urbanization) or natural processes do not alter the frequency relation - temporal trend in data (stationarity).

FREQUENCY ANALYSIS Practical Problems: (a) Selection of reasonable & simple distribution. (b) Estimation of parameters in distribution. (c) Assessment of risk with reasonable accuracy.

REVIEW OF BASIC CONCEPTS Probabilistic Outcome of a hydrologic event (e.g., rainfall amount & duration; flood peak discharge; wave height, etc.) is random and cannot be predicted with certainty. Terminologies  Population The collection of all possible outcomes relevant to the process of interest. Example: (1) Max. 2-hr rainfall depth: all non-negative real numbers; (2) No. of storm in June: all non-negative integer numbers.  Sample A measured segment (or subset) of the population.

REVIEW OF BASIC CONCEPTS Terminologies - Random Variable A variable describable by a probability distribution which specifies the chance that the variable will assume a particular value. Convention: Capital letter for random variables (say, X) whereas the lower case letter (say, x) for numerical realization that the random variable X will take. Example: X = rainfall amount in 2 hours (a random variable) ; x = mm/2hr (realization). - Random variables can be - discrete (eg., no. of rainy days in June) or - continuous (eg., max. 2-hr rainfall amount, flood discharge)..

REVIEW OF BASIC CONCEPTS Terminologies Frequency & Relative Frequency oFor discrete random variables: oFrequency is the number of occurrences of a specific event. Relative frequency is resulting from dividing frequency by the total number of events. e.g. n = no. of years having exactly 50 rainy days; N = total no. of years. Let n=10 years and N=100 years. Then, the frequency of having exactly 50 rainy days is 10 and the relative frequency of having exactly 50 rainy days in 100 years is n/N = 0.1. oFor continuous random variables: oFrequency needs to be defined for a class interval. oA plot of frequency or relative frequency versus class intervals is called histogram or probability polygon. oAs the number of sample gets infinitely large and class interval length approaches to zero, the histogram will become a smooth curve, called probability density function.

REVIEW OF BASIC CONCEPTS Terminologies Probability Density Function (PDF) – For a continuous random variable, the PDF must satisfy and f(x) ≥ 0 for all values of x. For a discrete random variable, the PDF must satisfy and 1≥ p(x) ≥ 0 for all values of x.

REVIEW OF BASIC CONCEPTS Terminologies - Cumulative Distribution Function - For a continuous random variable, For a discrete random variable, by

Statistical Properties of Random Variables Population - Synonymous to sample space, which describes the complete assemblage of all the values representative of a particular random process. Sample - Any subset of the population. Parameters - Quantities that are descriptive of the population in a statistical model. Normally, Greek letters are used to denote statistical parameters. Sample statistics (or simply statistics): Quantities calculated on the basis of sample observations.

Statistical Moments of Random Variables Descriptors commonly used to show statistical properties of a RV are those indicative (1) Central tendency; (2) Dispersion; (3) Asymmetry. Frequently used descriptors in these three categories are related to statistical moments of a RV. Two types of statistical moments are commonly used in hydrosystem engineering applications: (1) product-moments and (2) L-moments.

Product-Moments rth-order product-moment of X about any reference point X=x o is defined, for continuous case, as whereas for discrete case, where E[  ] is a statistical expectation operator. In practice, the first three moments (r=1, 2, 3) are used to describe the central tendency, variability, and asymmetry. Two types of product-moments are commonly used: –Raw moments: µ r '=E[X r ] rth-order moment about the origin; and –Central moments: µ r =E[(X  µ x ) r ] = rth-order central moment –Relations between two types of product-moments are:. where C n,x = binomial coefficient = n!/(x!(n-x)!) Main disadvantages of the product-moments are: (1) Estimation from sample observations is sensitive to the presence of outliers; and (2) Accuracy of sample product-moments deteriorates rapidly with increase in the order of the moments.

Mean, Mode, Median, and Quantiles Expectation (1 st -order moment) measures central tendency of random variable X –Mean (  ) = Expectation = 1 = location of the centroid of PDF or PMF. –Two operational properties of the expectation are useful: in which  k =E[X k ] for k = 1,2, …, K. For independent random variables, Mode (x mo ) - the value of a RV at which its PDF is peaked. The mode, x mo, can be obtained by solving Median (x md ) - value that splits the distribution into two equal halves, i.e, Quantiles - 100pth quantile of a RV X is a quantity x p that satisfies P(X  x p ) = F x (x p ) = p A PDF could be uni-modal, bimodal, or multi-modal. Generally, the mean, median, and mode of a random variable are different, unless the PDF is symmetric and uni-modal.

Uni-modal and bi-modal distributions

Variance, Standard Deviation, and Coefficient of Variation Variance is the second-order central moment measuring the spreading of a RV over its range, Standard deviation (  x ) is the positive square root of the variance. Coefficient of variation,  x =  x /  x, is a dimensionless measure; useful for comparing the degree of uncertainty of two RVs with different units. Three important properties of the variance are: –(1) Var[c] = 0 when c is a constant. –(2) Var[X] = E[X 2 ]  E 2 [X] –For multiple independent random variables, where a k =a constant and  k = standard deviation of X k, k=1,2,..., K.

Skewness Coefficient Measures asymmetry of the PDF of a random variable Skewness coefficient,  x, defined as The sign of the skewness coefficient indicates the degree of symmetry of the probability distribution function.skewness coefficient Pearson skewness coefficient – In practice, product-moments higher than 3rd-order are less used because they are unreliable and inaccurate when estimated from a small number of samples See Table for equations to compute the sample product- moments.Table

Relative locations of mean, median, and mode for positively-skewed, symmetric, and negatively-skewed distributions.

Product-moments of random variables

Kurtosis (  x ) Measure of the peakedness of a distribution. Related to the 4th central product-moment as For a normal RV, its kurtosis is equal to 3. Sometimes, coefficient of excess,  x =  x  3, is used. All feasible distribution functions, skewness coefficient and kurtosis must satisfy

Some Commonly Used Distributions NORMAL DISTRIBUTION Standardized Variable: Z has mean 0 and standard deviation 1.

Some Commonly Used Distributions STANDARD NORMAL DISTRIBUTION:

Some Commonly Used Distributions LOG-NORMAL DISTRIBUTION x f LN (x)  x =0. 6  x =1.3  x =0. 3 (a)  x = 1.0 (b)  x = 1.30

Some Commonly Used Distributions Gumbel (Extreme-Value Type I) Distribution

Some Commonly Used Distributions Log-Pearson Type 3 Distribution

Some Commonly Used Distributions Log-Pearson Type 3 Distribution with  >0, x  e  when  >0 and with  >0, x  e  when  <0