The Sample Variation Method A way to select sample size This slide show is a free open source document. See the last slide for copyright information.

Slides:



Advertisements
Similar presentations
STA305 Spring 2014 This started with excerpts from STA2101f13
Advertisements

Factorial ANOVA More than one categorical explanatory variable.
Introduction to Regression with Measurement Error STA431: Spring 2015.
Exploratory Factor Analysis
Sample Size Power and other methods. Non-central Chisquare.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Logistic Regression STA302 F 2014 See last slide for copyright information 1.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
© 2003 Prentice-Hall, Inc.Chap 14-1 Basic Business Statistics (9 th Edition) Chapter 14 Introduction to Multiple Regression.
Understanding Statistics in Research
Chapter 9 Hypothesis Testing.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Introduction to Regression with Measurement Error STA431: Spring 2013.
Lecture 16 Correlation and Coefficient of Correlation
Warm-up Day of 8.1 and 8.2 Quiz and Types of Errors Notes.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
STT 315 This lecture is based on Chapter 6. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing.
Normal Linear Model STA211/442 Fall Suggested Reading Faraway’s Linear Models with R, just start reading. Davison’s Statistical Models, Chapter.
Double Measurement Regression STA431: Spring 2013.
SAS: The last of the great mainframe stats packages STA431 Winter/Spring 2015.
4.2 One Sided Tests -Before we construct a rule for rejecting H 0, we need to pick an ALTERNATE HYPOTHESIS -an example of a ONE SIDED ALTERNATIVE would.
Maximum Likelihood See Davison Ch. 4 for background and a more thorough discussion. Sometimes.
Introduction to Regression with Measurement Error STA302: Fall/Winter 2013.
Overview Basics of Hypothesis Testing
Understanding the Variability of Your Data: Dependent Variable Two "Sources" of Variability in DV (Response Variable) –Independent (Predictor/Explanatory)
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Hypothesis Testing PowerPoint Prepared by Alfred.
Hypothesis testing Chapter 9. Introduction to Statistical Tests.
Section 12.1 Scatter Plots and Correlation HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems,
Factorial ANOVA More than one categorical explanatory variable STA305 Spring 2014.
Logistic Regression STA2101/442 F 2014 See last slide for copyright information.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Section 9.3 ~ Hypothesis Tests for Population Proportions Introduction to Probability and Statistics Ms. Young.
SAS: The last of the great mainframe stats packages STA431 Winter/Spring 2013.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Chapter 16 Data Analysis: Testing for Associations.
AP Statistics Section 11.4 B
Confirmatory Factor Analysis Part Two STA431: Spring 2013.
Categorical Independent Variables STA302 Fall 2013.
STA302: Regression Analysis. Statistics Objective: To draw reasonable conclusions from noisy numerical data Entry point: Study relationships between variables.
Categorical Independent Variables STA302 Fall 2015.
AP Statistics Section 11.4 B. A significance test makes a Type I error when ___________________________________ P(Type 1 error ) = ___ A significance.
Tests of Significance: The Basics ESS chapter 15 © 2013 W.H. Freeman and Company.
Another way to select sample size The sample R-squared method.
STA302: Regression Analysis. Statistics Objective: To draw reasonable conclusions from noisy numerical data Entry point: Study relationships between variables.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Chapter 9 Minitab Recipe Cards. Contingency tests Enter the data from Example 9.1 in C1, C2 and C3.
The Idea of the Statistical Test. A statistical test evaluates the "fit" of a hypothesis to a sample.
Logistic Regression For a binary response variable: 1=Yes, 0=No This slide show is a free open source document. See the last slide for copyright information.
Confidence Intervals and Hypothesis Tests Week 5.
Scatter Plots and Correlation
SAS: The last of the great mainframe stats packages
Chapter 11 Analysis of Covariance
Exploratory Factor Analysis
Review and Preview and Basics of Hypothesis Testing
Logistic Regression STA2101/442 F 2017
Poisson Regression STA 2101/442 Fall 2017
Sample Power.
Interactions and Factorial ANOVA
Logistic Regression Part One
STA302: Regression Analysis
Multinomial Logit Models
Understanding How Hypothesis Testing Works
Inferential Statistics
Unit 9 Quiz: Review questions
Exploratory Factor Analysis. Factor Analysis: The Measurement Model D1D1 D8D8 D7D7 D6D6 D5D5 D4D4 D3D3 D2D2 F1F1 F2F2.
Statistical Power.
Presentation transcript:

The Sample Variation Method A way to select sample size This slide show is a free open source document. See the last slide for copyright information.

The sample variation method is not traditional Statisticians usually recommend that sample size be based on a power analysis. Statistical power is the probability of rejecting the null hypothesis when the null hypothesis is false. Power depends on sample size and how wrong H 0 is (effect size).

How to select sample size by power Pick an effect size you’d like to be able to detect. It should be just over the boundary of interesting and meaningful. Pick a desired power – a probability with which you’d like to be able to detect the effect by rejecting the null hypothesis. Start with a fairly small n and calculate the power. Increase the sample size until the desired power is reached.

Power analysis is a Greek letter story It’s a fairly long story too. Best left to STA305. The sample variation method is quick and easy.

F test is based upon Increase in explained variation expressed as a fraction of the variation that the reduced model does not explain.

For any given sample size, the bigger a is, the bigger F becomes. For any a ≠0, F increases as a function of n. So you can get a large F from strong results and a small sample, or from weak results and a large sample.

The sample variation method is to choose a value of a that is just large enough to be interesting, and increase n, calculating F and its p-value each time until p < 0.05; then stop. The final value of n is the smallest sample size for which an effect explaining that much of the remaining variation will be significant. With that sample size, the effect will be significant if and only if it explains a or more of the remaining variation. That's all there is to it. You tell me a proportion of remaining variation that you want to be statistically significant, and I'll tell you a sample size.

Example Suppose we are planning a 2x3x4 analysis of covariance, with two covariates, and factors named A, B and C. We are setting it up as a regression model, with one dummy variable for A, 2 dummy variables for B, and 3 for C. Interactions are represented by product terms, and there are 2 products for the AxB interaction, 3 for AxC, 6 for BxC, and 1*2*3 = 6 for AxBxC. The regression coefficients for these plus two for the covariates and one for the intercept give us p = 26. The null hypothesis is that of no BxC interaction, so r = 6. The "other effects in the model" for which we are "controlling" are represented by 2 covariates and 17 dummy variables and products of dummy variables.

Potato Example In the potato data, there are 3 potatoes per treatment combination in a Temperature (2 levels) by Bacteria type (3 levels) by Oxygen level (3 levels) design. What pro- portion of remaining variation is required for the main effect of bacteria type to be significant?

Copyright Information This slide show was prepared by Jerry Brunner, Department of Statistical Sciences, University of Toronto. It is licensed under a Creative Commons Attribution - ShareAlike 3.0 Unported License. Use any part of it as you like and share the result freely. These Powerpoint slides are available from the course website: