MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Estimation of Means and Proportions
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests
Sampling: Final and Initial Sample Size Determination
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
Outline input analysis input analyzer of ARENA parameter estimation
SOLVED EXAMPLES.
MEGN 537 – Probabilistic Biomechanics Ch.4 – Common Probability Distributions Anthony J Petrella, PhD.
Probability distribution functions Normal distribution Lognormal distribution Mean, median and mode Tails Extreme value distributions.
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
1 Choice of Distribution 1.Theoretical Basis e.g. CLT, Extreme value 2.Simplify calculations e.g. Normal or Log Normal 3.Based on data: - Histogram - Probability.
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
IEEM 3201 Two-Sample Estimation: Paired Observation, Difference.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Inference about a Mean Part II
Inferences About Process Quality
BCOR 1020 Business Statistics
Statistical Intervals for a Single Sample
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
SECTION 6.4 Confidence Intervals for Variance and Standard Deviation Larson/Farber 4th ed 1.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Statistics for Data Miners: Part I (continued) S.T. Balke.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Estimation in Sampling!? Chapter 7 – Statistical Problem Solving in Geography.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Tests for Random Numbers Dr. Akram Ibrahim Aly Lecture (9)
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Chi-squared Tests. We want to test the “goodness of fit” of a particular theoretical distribution to an observed distribution. The procedure is: 1. Set.
Ch9. Inferences Concerning Proportions. Outline Estimation of Proportions Hypothesis concerning one Proportion Hypothesis concerning several proportions.
Chapter 7 Point Estimation
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Determination of Sample Size: A Review of Statistical Theory
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Probability = Relative Frequency. Typical Distribution for a Discrete Variable.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
ENGR 610 Applied Statistics Fall Week 4 Marshall University CITE Jack Smith.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Statistics Sampling Intervals for a Single Sample Contents, figures, and exercises come from the textbook: Applied Statistics and Probability for Engineers,
Confidence Intervals for Variance and Standard Deviation.
Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5.
Mystery 1Mystery 2Mystery 3.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
© Copyright McGraw-Hill 2004
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 9: One- and Two-Sample Estimation Problems: 9.1 Introduction: · Suppose we have a population with some unknown parameter(s). Example: Normal( ,
Chapter 20 Statistical Considerations Lecture Slides The McGraw-Hill Companies © 2012.
Section 6.4 Inferences for Variances. Chi-square probability densities.
9-1 ESTIMATION Session Factors Affecting Confidence Interval Estimates The factors that determine the width of a confidence interval are: 1.The.
MEGN 537 – Probabilistic Biomechanics Ch.4 – Common Probability Distributions Anthony J Petrella, PhD.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
CONCEPTS OF ESTIMATION
MEGN 537 – Probabilistic Biomechanics Ch
Chapter 6 Confidence Intervals.
Presentation transcript:

MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD

Determination of Distribution The underlying distribution can be established in one of the following ways: Drawing a frequency diagram Plotting the data on probability paper Conducting statistical tests known as goodness-of-fit tests for distribution

Probability Paper Gumbel (1954) N observations (X 1, X 2, X 3 …X N ) Arrange Data in increasing order ith value is plotted at the CDF of i/(N+1)

Probability Paper

Plotted versus Normal Dist

Goodness of Fit Question: Whether two independent samples come from identical continuous distributions? Dataset compared to the theoretical distribution Restated: Is the theoretical distribution an acceptable representation of the dataset? Chi Square based on PDF Kolmogorov-Smirnov based on the CDF

Based on error between the observed and assumed PDF of the distribution Methodology: Arrange N data points in increasing order Break data into m intervals Determine: n i – observed frequency of data points in interval “i” e i – theoretical Frequency of data points in interval “i” Chi-Square Test (  2 )

Methodology: Determine c 1- ,f  = Significance Level (usually between 1% and 10%) f = degrees of freedom = m – 1 – k m = # of intervals k = # of distribution parameters (= 2 for normal or lognormal) Obtain c 1- ,f from Appendix 3 The assumed distribution is acceptable at the significance level  if: Chi-Square Test (  2 ) NOTE: m should be > = 5 to obtain satisfactory results

Example (Haldar 5.2)

a) Uniform distributed random variables Ordinary graph paper can be prob. paper b)

Example (Haldar 5.2) c) f = m – 1 - k

Example (Haldar 5.5) Perform Chi-square test on the data from Problem 3.1. Can the underlying distribution be accepted as normal at a 5% significance level? f = degrees of freedom = m – 1 – k m = # of intervals k = # of distribution parameters

Solution (Haldar 5.5a)

Kolmogorov-Smirnov (K-S) Test Based on the error between the observed and assumed CDF of the distribution Methodology: Arrange data in increasing order and assign index, m to each data point where m = 1,2,…,n Determine S n (x i ) = manual CDF: S n (x i ) = 0; x < x 1 S n (x i ) = m/n; x m ≤ x ≤ x m+1 S n (x i ) = 1;x ≥ x n Determine F X (x i ) = Assumed distribution

K-S Test Methodology: Determine D n = max| F x (x i ) – S n (x i ) | Determine D n   = Significance Level D n  value found in Appendix 4 The assumed distribution is acceptable at the significance level  if the maximum difference D n is less than or equal to the tabulated value of D n 

Example (Haldar 5.8) Perform K-S test on the data from Problem 3.1. Can the underlying distribution be accepted as normal at a 5% significance level?

Solution (Haldar, 5.8)

Parameter Estimation

Method of Moments Moments are statistical parameters of a dataset 1 st moment (mean = E(X)) 2 nd central moment (Var(X)) 3 rd central moment (skewness) Distribution parameters are derived from the moments PDF forms and parameters for distributions in Table 5.6 on page 118 All are based on first two moments, E(x) and Var(X)

Method of Maximum Likelihood For a random variable X, if x 1, x 2, … x n are the n observations or sample values, the estimated value of the parameter is the value most likely to produce these observed values. L(x 1, x 2, … x n ;p) = f x (x 1 ;p) f x (x 2 ;p)…f x (x n ;p) p = distribution parameter that needs to be established Maximize likelihood L by setting dL/dp = 0 Solve for p Solve for multiple parameters with simultaneous equations Results are very similar to method of moments

Interval Estimation Differences exist between expected values of populations and samples Distribution parameters (  ) are typically Estimated from samples Applied to populations Intervals estimate the range of possible values for the parameter to a specified level of confidence

Confidence Intervals Distributions can be linked to probability – making possible predictions and evaluations of the likelihood of a particular occurrence In a normal distribution, the number of standard deviations from the mean tells us the percent distribution of the data and thus the probability of occurrence

x = Mean  = Standard Deviation n = Sample Size (1 –  ) = Confidence Interval k  /2 = value of the standard normal variate (z) =  -1 (p) (found using Appendix 1) Interval Estimation for the Mean with Known Variance Two tailed interval!

Lower Confidence Limit for  Upper Confidence Limit for  Lower and Upper Confidence Limit for the Mean with Known Variance Each is a one tailed interval!

Interval Estimation for the Mean with Unknown Variance t  /2,n-1 = value of Student’s t distribution – found using Appendix 5 Standard normal distribution valid for… Known population variance Large n ( > 30) If n is small (< 10), s ≠  use Student’s t

Student’s t distribution f = n – 1 = DOF

Interval Estimation for Variance C ,n-1 = value of Chi Square distribution – found using Appendix 3