3 pivot quantities on which to base bootstrap confidence intervals Note that the first has a t(n-1) distribution when sampling from a normal population.

Slides:



Advertisements
Similar presentations
Estimating a Population Variance
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.5 Estimation of a Population Variance This section presents methods.
BCOR 1020 Business Statistics Lecture 17 – March 18, 2008.
The standard error of the sample mean and confidence intervals
Confidence intervals. Population mean Assumption: sample from normal distribution.
Topics: Inferential Statistics
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 7-3 Estimating a Population Mean:  Known Created by.
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Let sample from N(μ, σ), μ unknown, σ known.
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Quiz 6 Confidence intervals z Distribution t Distribution.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
STAT 572: Bootstrap Project Group Members: Cindy Bothwell Erik Barry Erhardt Nina Greenberg Casey Richardson Zachary Taylor.
Bootstrap spatobotp ttaoospbr Hesterberger & Moore, chapter 16 1.
Standard Error of the Mean
Statistical Intervals for a Single Sample
Stat13-lecture 25 regression (continued, SE, t and chi-square) Simple linear regression model: Y=  0 +  1 X +  Assumption :  is normal with mean 0.
What if the population standard deviation, , is unknown? We could estimate it by the sample standard deviation s which is the square root of the sample.
SECTION 6.4 Confidence Intervals for Variance and Standard Deviation Larson/Farber 4th ed 1.
Section 7-4 Estimating a Population Mean: σ Not Known.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Dan Piett STAT West Virginia University
1 SAMPLE MEAN and its distribution. 2 CENTRAL LIMIT THEOREM: If sufficiently large sample is taken from population with any distribution with mean  and.
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Estimating a Population Variance
Estimating a Population Standard Deviation. Chi-Square Distribution.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Confidence Intervals Lecture 3. Confidence Intervals for the Population Mean (or percentage) For studies with large samples, “approximately 95% of the.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Section 9.3: Confidence Interval for a Population Mean.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-5 Estimating a Population Variance.
Lesson 9 - R Chapter 9 Review.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Section 6.3: How to See the Future Goal: To understand how sample means vary in repeated samples.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.4 Estimation of a Population Mean  is unknown  This section presents.
Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population.
6.4 Confidence Intervals for Variance and Standard Deviation Key Concepts: –Point Estimates for the Population Variance and Standard Deviation –Chi-Square.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Estimating a Population Variance
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Section 7-5 Estimating a Population Variance. MAIN OBJECTIIVES 1.Given sample values, estimate the population standard deviation σ or the population variance.
AP STATISTICS LESSON 11 – 1 (DAY 2) The t Confidence Intervals and Tests.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Confidence Intervals and Sample Size. Estimates Properties of Good Estimators Estimator must be an unbiased estimator. The expected value or mean of.
Applied Statistics and Probability for Engineers
Lecture Slides Elementary Statistics Twelfth Edition
Inference for the Mean of a Population
Application of the Bootstrap Estimating a Population Mean
Confidence Interval Estimation
Hypotheses and test procedures
Other confidence intervals
Elementary Statistics
Statistics in Applied Science and Technology
CONCEPTS OF ESTIMATION
Elementary Statistics
Chapter 6 Confidence Intervals.
Confidence Intervals for a Standard Deviation
Simple Linear Regression
IE 355: Quality and Applied Statistics I Confidence Intervals
Determining Which Method to use
Chapter 6 Confidence Intervals.
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

3 pivot quantities on which to base bootstrap confidence intervals Note that the first has a t(n-1) distribution when sampling from a normal population. The second has a chi-square(n-1) distribution when sampling from a normal pop. The third is Fisher’s transformation of the correlation coefficient (r xy ) and it is approx. normal with mean 0 and variance=1/(n-3). These pivot quantities share 3 important properties: They have known distributions They have a parameter that controls their shape This parameter varies with sample size, but the shape will not vary with the value of the parameter being estimated....

As the Student’s t distribution “takes away the mean and divides by the s.e. of the sample mean”, so we codify this process and call it Studentization for any estimator t of . We’ll use the bootstrap form of the Studentized estimate to construct various bootstrap confidence intervals...

So here’s the way to do this for the mean. –compute the sample mean and sample s.d. from the original data –obtain a bootstrap sample of size n from the original data and compute the bootstrap mean, and the bootstrap standard deviation, s b. –now compute the bootstrap t-pivot quantity: –now repeat this step at least 1000 times to get the bootstrap distribution of t b. –let t b,.025 and t b,.975 be the.025 and.975 quantiles of the bootstrap distribution of t b. Construct the 95% confidence interval for  as Now implement this in R... try with the “latch failure” data in Table 1.2.1

Bootstrap confidence intervals for the population standard deviation, , can be done with the  2 -pivot: If the sample is from a normal distribution, then the  2 -pivot has a chi-square distribution with n- 1 degrees of freedom and a 95% confidence interval for sigma can be computed as: If normality is not a reasonable assumption, then we’ll use the bootstrap as follows: –compute the sample variance s 2 for the original sample data. –bootstrap the sample of size n and compute the bootstrap sample variance and the bootstrap  2 - pivot quantity (here, s b 2 is the bootstrap sample variance) –now repeat 1000 times or more to get the bootstrap distribution of the bootstrapped  2 - pivots

–then the bootstrap 95% CI for the population variance is –take the square root if you want confidence intervals for sigma... Implement this in R... again on the Table data. Here are the problems from the textbook that are due at the final exam time. Be sure to write up the solution so I can tell that you understand the methods and the context of the problem. Include appropriate R output to illustrate your solutions... p.106: #6 p.141ff: #4,5,6 (second part), 7,9 p.189ff: #1,3,4,5,7,8,9(a),12,13 p.298ff: #1,5,7 On the last class day I will give you some data that will form the basis for the final exam the next week. See page 258 for a chart of Coverage Percentages in a simulation done by the author for various underlying distributions - make sure you understand the conclusions of that discussion…

See section 8.3 for a discussion of three additional methods for computing bootstrap confidence intervals in more generality… The three methods are: –Percentile method –Residual method –BCA method (BCA=bias corrected and accelerated) Suppose we are trying to estimate the unknown population parameter  - use based on a sample of size n from the population. The percentile method constructs the CI by bootstrapping the estimator to get its distribution - in particular its.025 and.975 quantiles. Construct the 95% CI for  as The residual method estimates  by first getting the bootstrap distribution of the residuals and then constructing the 95% CI for  as

The BCA method is more complex and a discussion of it is given in section If you’re interested in pursuing this method look at the R package called boot. More details can be found in Davison, A.C. and Hinkley, D.V (1997) Bootstrap Methods and Their Applications and Lunneborg, C.E. (2000) Data Analysis by Resampling: Concepts and Applications.