Bootstrapping Jackknifing

Slides:



Advertisements
Similar presentations
Uncertainty in fall time surrogate Prediction variance vs. data sensitivity – Non-uniform noise – Example Uncertainty in fall time data Bootstrapping.
Advertisements

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Simple Linear Regression Estimates for single and mean responses.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Chapter 10 Simple Regression.
Confidence intervals. Population mean Assumption: sample from normal distribution.
Chapter 11 Multiple Regression.
Bootstrapping LING 572 Fei Xia 1/31/06.
Bagging LING 572 Fei Xia 1/24/06. Ensemble methods So far, we have covered several learning methods: FSA, HMM, DT, DL, TBL. Question: how to improve results?
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Correlation and Regression Analysis
Simple Linear Regression and Correlation
Simple Linear Regression Analysis
Standard error of estimate & Confidence interval.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.2 Estimating Differences.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
Model Building III – Remedial Measures KNNL – Chapter 11.
Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Using Resampling Techniques to Measure the Effectiveness of Providers in Workers’ Compensation Insurance David Speights Senior Research Statistician HNC.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Agresti/Franklin Statistics, 1 of 88  Section 11.4 What Do We Learn from How the Data Vary Around the Regression Line?
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Case Selection and Resampling Lucila Ohno-Machado HST951.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Lecturer: Ing. Martina Hanová, PhD..  How do we evaluate a model?  How do we know if the model we are using is good?  assumptions relate to the (population)
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Lecture Slides Elementary Statistics Twelfth Edition
Multiple Regression.
More on Inference.
Application of the Bootstrap Estimating a Population Mean
Bootstrap in refinement
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Kaniz Rashid Lubana Mamun MS Student: CSU Hayward Dr. Eric A. Suess
Ch12.1 Simple Linear Regression
Chapter Six Normal Curves and Sampling Probability Distributions
Simultaneous equation system
Simple Linear Regression - Introduction
Ungraded quiz Unit 6.
Correlation and Simple Linear Regression
R Data Manipulation Bootstrapping
Estimates of Bias & The Jackknife
Regression Analysis Week 4.
CHAPTER 29: Multiple Regression*
More on Inference.
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Linear Regression.
Multiple Regression.
Regression Models - Introduction
The Regression Model Suppose we wish to estimate the parameters of the following relationship: A common method is to choose parameters to minimise the.
Sampling Distribution
Sampling Distribution
Correlation and Simple Linear Regression
QQ Plot Quantile to Quantile Plot Quantile: QQ Plot:
Simple Linear Regression
Cross-validation Brenda Thomson/ Peter Fox Data Analytics
Checking the data and assumptions before the final analysis.
Bootstrapping and Bootstrapping Regression Models
Regression Models - Introduction
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Bootstrapping Jackknifing Resampling Bootstrapping Jackknifing 2/22/2019 AGR206

Purpose of resampling When the distributions of the estimated parameters are not known, either because The distribution of the original variable is not known, or There is no mathematical formula or rationale to relate the distribution of the estimator to the distribution of the original variables. Example: there is no “formula” for the distribution of the product of two normal distributions! 2/22/2019 AGR206

When to use resampling When distributions are unknown/not met. Robust regression. Weighted regression. When formulas are unknown/cumbersome. Unusual ecological indices. 2/22/2019 AGR206

Common resampling techniques BOOTSTRAPPING Random resampling with replacement. Gives bias, s. error and CI’s. Uses sample as proxy of population and repeats sampling process exactly the same way as for the original sample to obtain hundreds of bootstrap samples. JACKKNIFING No random component. Gives bias & s. error, but no CI. Hold out one observation at a time and recalculate statistics for each subset. 2/22/2019 AGR206

Assumptions: Observations of the original sample are independent. Sample is not small (n>30). 2/22/2019 AGR206

Bootstrapping equations: Take a statistic π(x) where x are the values of a measure in the sample. For example when π(x)=Sx/n, π is the average. Let x denote the original sample and xi* denote a bootstrap sample. Average bias=average π(xi*)-π(x). Estimated variance S2π= S[π(xi*)-avg π(xi*)]2/(B-1) where B is the number of bootstrap samples. 2/22/2019 AGR206

Potential problems Small sample size. Computationally demanding. Lack of user-friendly software for general purpose. Bootstrapping for CI requires many more bootstrap samples and has to be corrected for bias. 2/22/2019 AGR206

Example Multiple Linear regression Fixed X case. Resample from the residuals and add to predicted values. Random X case. Resample from the observations, taking each as a unit. 2/22/2019 AGR206