Bootstrapping and Bootstrapping Regression Models

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Lesson 10: Linear Regression and Correlation
Uncertainty in fall time surrogate Prediction variance vs. data sensitivity – Non-uniform noise – Example Uncertainty in fall time data Bootstrapping.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Ch11 Curve Fitting Dr. Deshi Ye
Simple Linear Regression
Multiple regression analysis
Chapter 10 Simple Regression.
Linear Regression with One Regression
T-test.
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
Chapter 11 Multiple Regression.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Simple Linear Regression and Correlation
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Case Selection and Resampling Lucila Ohno-Machado HST951.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
1/61: Topic 1.2 – Extensions of the Linear Regression Model Microeconometric Modeling William Greene Stern School of Business New York University New York.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
Lecture Slides Elementary Statistics Twelfth Edition
Inference about the slope parameter and correlation
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
Linear Regression with One Regression
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
ESTIMATION.
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Regression Analysis Part D Model Building
Inference for Regression
Lecture 19.
Simple Linear Regression - Introduction
Lecture Slides Elementary Statistics Thirteenth Edition
Correlation and Regression
CHAPTER 29: Multiple Regression*
Simulation: Sensitivity, Bootstrap, and Power
CHAPTER 26: Inference for Regression
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Chapter 12 Regression.
CONCEPTS OF ESTIMATION
LESSON 24: INFERENCES USING REGRESSION
Bootstrap - Example Suppose we have an estimator of a parameter and we want to express its accuracy by its standard error but its sampling distribution.
Chapter 25: Paired Samples and Blocks
Regression Models - Introduction
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Microeconometric Modeling
Interval Estimation and Hypothesis Testing
Regression Chapter 8.
Correlation and Regression
Elementary Statistics: Looking at the Big Picture
Simple Linear Regression
Copyright © Cengage Learning. All rights reserved.
Basic Practice of Statistics - 3rd Edition Inference for Regression
SIMPLE LINEAR REGRESSION
CHAPTER 12 More About Regression
Simple Linear Regression
SIMPLE LINEAR REGRESSION
Chapter 13 Additional Topics in Regression Analysis
Regression and Correlation of Data
Regression and Correlation of Data
Presentation transcript:

Bootstrapping and Bootstrapping Regression Models Tong Shan 2019.4

Outline 1. Advantages of bootstrapping 2. Bootstrapping method 3. Bootstrapping regression models 4. Concluding Marked

A number of advantages The bootstrap is quite general. Because it does not require distributional assumptions, the bootstrap can provide more accurate inferences when the data are not well behaved or when the sample size is small. It is possible to apply the bootstrap to statistics with sampling distributions that are difficult to derive. It is simple to apply the bootstrap to complex data-collection plans.

Bootstrapping Basic ------ Husbands’ and Wives’ Incomes Data

We saw first four first. We calculated the confidence interval. Too wide!!! Includes 0!!!

Picture of bootstrapping! If we use bootstrap, we can have 4 4 =256 different bootstrap samples, each selected with probability 1/256 The mean of the 256 bootstrap sample means is just the original sample mean The standard deviation of the bootstrap means is 𝑛 𝑛 rather than by 𝑛 𝑛 −1because the bootstrap sample means is known, not estimated SE =2.015 Better!!!

Bootstrapping procedure 1. Let r denote the number of bootstrap replications—that is, the number of bootstrap samples to be selected. 2. For each bootstrap sample b = 1, . . . , r, randomly draw n observations with replacement from among the n sample values, and calculate the bootstrap sample mean 3. From the r bootstrap samples, estimate the standard deviation of the bootstrap means

R code for bootstrapping I used R to solve this problem to get much better result using bootstrapping.

Bootstrapping Regression Models The most straightforward approach is to collect the response-variable value and regressors for each observation These z can be resampled an the regression estimator computed for each of the resulting bootstrap samples. producing r sets of bootstrap regression coefficients, The methods of the previous slides can be applied to compute standard errors or confidence intervals for the regression estimates. But if the data derive from an experimental design, we may want to treat the Xs as fixed!

Bootstrapping regression models procedure in the case of linear regression 1. Estimate the regression coefficients for the original sample, and calculate the fitted value and residual for each observation: 2. Select bootstrap samples of the residuals, and from these, calculate bootstrapped Y values, where 3. Regress the bootstrapped Y values on the fixed X values to obtain bootstrap regression coefficients. If estimates are calculated by least-squares regression, then where b= 1, 2, … , r. 4. The resampled can be used to construct bootstrap standard errors and confidence intervals for the regression coefficients.

Examples Duncan’s regression of occupational prestige on the income and educational levels of 45 U. S. occupations Model:

Using random resampling, drew r = 2, 000 bootstrap samples, calculating the estimator for each bootstrap sample. The distributions of the bootstrapped regression coefficients for income and education are graphed It is clear that the income and education coefficients are strongly negatively correlated

Example: Survival of Rats after Radiation Doses The data measured the survival percentages of batches of rats who were given varying doses of radiation. At each of 6 doses there were two or three replications of the experiment. dose: The dose of radiation administered (rads). surv: The survival rate of the batches expressed as a percentage.

Example 2(survival data) paired bootstrapping

based on residuals

Concluding Remarks Bootstrap can also be used in Estimation of Bias, Bootstrap Confidence Intervals, Bootstrap Hypothesis Tests Although the bootstrap is so simple and of such broad application, it has some disadvantages so that it isn’t used more in the social sciences.

Thanks!