Percentile Approximation Via Orthogonal Polynomials Hyung-Tae Ha Supervisor : Prof. Serge B. Provost.

Slides:



Advertisements
Similar presentations
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Advertisements

Slides prepared by Timothy I. Matis for SpringSim’06, April 4, 2006 Estimating Rare Event Probabilities Using Truncated Saddlepoint Approximations Timothy.
Chapter 6 Sampling and Sampling Distributions
CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Sampling: Final and Initial Sample Size Determination
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
Estimation A major purpose of statistics is to estimate some characteristics of a population. Take a sample from the population under study and Compute.
Maximum likelihood (ML) and likelihood ratio (LR) test
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Probability Densities
CF-3 Bank Hapoalim Jun-2001 Zvi Wiener Computational Finance.
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Chapter 7 Sampling and Sampling Distributions
Evaluating Hypotheses
WFM 5201: Data Management and Statistical Analysis
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Part III: Inference Topic 6 Sampling and Sampling Distributions
Horng-Chyi HorngStatistics II41 Inference on the Mean of a Population - Variance Known H 0 :  =  0 H 0 :  =  0 H 1 :    0, where  0 is a specified.
Inferences About Process Quality
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Review What you have learned in QA 128 Business Statistics I.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
5-3 Inference on the Means of Two Populations, Variances Unknown
Maximum likelihood (ML)
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Chapter 7 Inferences Regarding Population Variances.
Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;
AM Recitation 2/10/11.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Random Sampling, Point Estimation and Maximum Likelihood.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Applications The General Linear Model. Transformations.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
1 SMU EMIS 7364 NTU TO-570-N Inferences About Process Quality Updated: 2/3/04 Statistical Quality Control Dr. Jerrell T. Stracener, SAE Fellow.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Chapter 5.6 From DeGroot & Schervish. Uniform Distribution.
6. Introduction to Spectral method. Finite difference method – approximate a function locally using lower order interpolating polynomials. Spectral method.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Fourier series, Discrete Time Fourier Transform and Characteristic functions.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
1 Estimation of Population Mean Dr. T. T. Kachwala.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Week 111 Some facts about Power Series Consider the power series with non-negative coefficients a k. If converges for any positive value of t, say for.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
Runge Kutta schemes Taylor series method Numeric solutions of ordinary differential equations.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Biostatistics Class 3 Probability Distributions 2/15/2000.
Inference for the Mean of a Population
Part 5 - Chapter
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Parameter, Statistic and Random Samples
Statistics in Applied Science and Technology
Statistical Methods For Engineers
STAT 5372: Experimental Statistics
Elementary Statistics
Chapter 13: Inferences about Comparing Two Populations Lecture 7a
Inference on the Mean of a Population -Variance Known
Further Topics on Random Variables: 1
Berlin Chen Department of Computer Science & Information Engineering
Presentation transcript:

Percentile Approximation Via Orthogonal Polynomials Hyung-Tae Ha Supervisor : Prof. Serge B. Provost

2 u Orthogonal Polynomial Approximants u Application to Statistics on (a, b) O U T L I N E u Introduction u Application to Statistics on (0, ) u Computation and Mathematica Codes u Conclusions

Introduction

4 Research Domain: Density Approximation Density Estimate Density Approximant X i Sample Theoretical Moments Focus : Continuous distributions Unknown * We are considering the problem of approximating a density function from the theoretical moments (or cumulants) of a given distribution (for example, that of the sphericity test statistic)

5 Moment Problem 1. While it is usually possible to determine the moments of various random quantities used in statistical inference, their exact density functions are often times analytically intractable or difficult to obtain in closed forms. 2. Suppose a density function admits moments of all orders. A given moment sequence doesn’t define a density function uniquely in general. But it does when the random variable is on compact support. 3. The sufficient condition for uniqueness is that 4. The moments can be obtained from the derivatives of its moment generating function (MGF) or by making use of the recursive relationship to express moments in terms of cumulants. is absolutely convergent for some t > 0.

6 Literature Review Characteristic Paper Pearson CurveSaddlepoint Concept Daniels, H.E. (1954), “Saddlepoint Approximations in Statistics”, Annals of Mathematical Statistics Adequate Approximation A Variety of Applications Unimodal Difficult to implement Tail Approximation is good. A Variety of Applications Unimodal Using up to 4 moments Solomon and Stephens (1978), “Approximations to density functions using Pearson curves”, JASA Approximating density function using a few moments Approximating density function using cumulant generating function

7 Literature Review Characteristic Papers Cornish-Fisher ExpansionOrthogonal Series Expansion Concept Tiku (1965) Laguerre series forms 1. Expressible in terms of Hermite Polynomial 2. Gram-Charlier series 3. Edgeworth’s Expansion Cornish and Fisher (1938) Fisher and Cornish (1960) Hill and Davis (1968) Based on cumulants of a distribution Approximating the density function of noncentral Chi-squared and F random variables

Orthogonal Polynomial Approximants

9 Brief Review of Orthogonal Polynomials Suppose that w(x) is a nonnegative real function of a real variable x. Let (a, b) be a fixed interval on the x-axis and suppose further that, for n=0,1,…, the integral exists and that the integral is positive.

Then, there exists a sequence of polynomials p 0 (x), p 1 (x),…, p n (x),… that is uniquely determined by the following conditions: 1) is a polynomial of degree n and the coefficient of x n in this polynomial is positive. 2) The polynomials p 0 (x), p 1 (x),…, p n (x), … are orthogonal w.r.t. the weight function w(x) if We say that the polynomials p n (x) constitute a system of orthogonal polynomials on the interval (a, b) with the weight function w(x) and orthogonal factor. If, p n (x) is called orthonormal polynomials.

11 Orthogonal Polynomial Approximation Approximant Base Density Orthogonal Polynomial Coefficients

12 Jacobi Polynomials Base Density Jacobi Polynomial

13 Jacobi Polynomial Approximant Transformation Approximant X (-1, 1) Y (a, b)

14 Jacobi Polynomial Approximant Distribution Approximant where

15 Laguerre Polynomials Base Density Laguerre Polynomial

16 Laguerre Polynomial Approximant Transformation Approximant Y X = Given the moments of Y

Application to Statistics on Compact Support ~

18 The L vc Test Statistic * Hypothesis : All the variances and covariances are equal. * Test Statistic : by Wilks (1946) * Moments : where

19 In the case of P=3, N=11 * 4 th degree Jacobi Polynomial Density Approximant * Wilks (1946) determined that 1 st and 5 th percentiles are and , respectively. F 4 [0.1682]= F 4 [0.2802]=

20 The V test statistic * Hypothesis : Equality of variances in independent normal populations * Test Statistic : * Moments :

21 p=5, N=12 * 4 th degree Jacobi Polynomial Density Approximant * Mathai (1979) determined that 1 st and 5 th percentiles are and , respectively. F 4 [ ]= F 4 [ ]=

Application to Statistics on the Positive Half Line ~

23 Test statistic for a single covariance matrix * Hypothesis : Covariance matrix of multivariate normal population is equal to a given matrix * Test Statistic : * MGF :

24 p=5 and N=10 * 4 th degree Laguerre Polynomial Density Approximant * Korin (1968) determined that 95 st and 99 th percentiles are and 38.60, respectively. F 4 [31.40]= F 4 [ ]=

25 Generalized Test of Homoscedasticity * Hypothesis : The constancy of variance and covariance in k sets of p-variate normal samples * Test Statistic : * MGF :

26 p=2, k=5 and N=45

Computation and Mathematica Codes

28 Computational consideration 1.The symbolic computational package Mathematic was used for evaluating the approximants and plotting the graphs. 2. The code is short and simple. 3.The formula will be easier to program when orthogonal polynomials are built-in functions in the computing packages.

29 Mathematica Code : Jacobi Polynomial Approximant

30 Mathematica Code : Laguerre Polynomial Approximant

Conclusion

32 Concluding Remarks 1. The proposed density approximation methodology yields remarkably accurate percentage points while being relatively easy to program. 2.The proposed approximants can also accommodate a large number of moments, if need be. 3.For a vast array of statistics that are not widely utilized, statistical tables, when at all available or accessible, are likely to be incomplete; the proposed methodology could then prove particularly helpful in evaluating certain p-values. 4.When a table is needed for a specific combination of parameters, the proposed methodology could readily generate it.