Power and Effect Size Cal State Northridge  320 Andrew Ainsworth PhD.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Power of a test. power The power of a test (against a specific alternative value) Is a tests ability to detect a false hypothesis Is the probability that.
Unlocking the Mysteries of Hypothesis Testing
Introduction to Hypothesis Testing
Hypothesis Tests: Two Related Samples AKA Dependent Samples Tests
Sampling Distributions, Hypothesis Testing and One-sample Tests.
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
 When we perform a hypothesis test, we make a decision to either Reject the Null Hypothesis or Fail to Reject the Null Hypothesis.  There is always the.
Statistics 101 Class 8. Overview Hypothesis Testing Hypothesis Testing Stating the Research Question Stating the Research Question –Null Hypothesis –Alternative.
Binomial Distribution & Hypothesis Testing: The Sign Test
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
Section 9.2: What is a Test of Significance?. Remember… H o is the Null Hypothesis ▫When you are using a mathematical statement, the null hypothesis uses.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Power. The Four Components to a Statistical Conclusion The number of units (e.g., people) accessible to study The salience of the program relative to.
Hypothesis Testing: Type II Error and Power.
Major Points An example Sampling distribution Hypothesis testing
PSY 307 – Statistics for the Behavioral Sciences
Major Points Formal Tests of Mean Differences Review of Concepts: Means, Standard Deviations, Standard Errors, Type I errors New Concepts: One and Two.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Chapter 7 Hypothesis Testing 7-1 Overview 7-2 Fundamentals of Hypothesis Testing.
Fundamental Statistics for the Behavioral Sciences, 5th edition David C. Howell Chapter 8 Sampling Distributions and Hypothesis Testing © 2003 Brooks/Cole.
Overview of Statistical Hypothesis Testing: The z-Test
STATISTICS ELEMENTARY MARIO F. TRIOLA Chapter 7 Hypothesis Testing
Business Statistics - QBM117 Introduction to hypothesis testing.
Hypothesis testing is used to make decisions concerning the value of a parameter.
Presented by Mohammad Adil Khan
Hypothesis Testing.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Sampling Distributions and Hypothesis Testing. 2 Major Points An example An example Sampling distribution Sampling distribution Hypothesis testing Hypothesis.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
1 Virtual COMSATS Inferential Statistics Lecture-17 Ossam Chohan Assistant Professor CIIT Abbottabad.
The Probability of a Type II Error and the Power of the Test
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Overview Basics of Hypothesis Testing
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.
1 Lecture note 4 Hypothesis Testing Significant Difference ©
Hypothesis testing and Decision Making Formal aspects of hypothesis testing.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Statistical Techniques
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Chapter 9: Hypothesis Tests for One Population Mean 9.2 Terms, Errors, and Hypotheses.
HYPOTHESIS TESTING E. Çiğdem Kaspar, Ph.D, Assist. Prof. Yeditepe University, Faculty of Medicine Biostatistics.
 In a Hypothesis test there are two possible mistakes Null Hypothesis TrueFalse Result of Hypothesis Test Reject Fail to Reject Correct Type I Error.
More about tests and intervals CHAPTER 21. Do not state your claim as the null hypothesis, instead make what you’re trying to prove the alternative. The.
Learning Objectives Describe the hypothesis testing process Distinguish the types of hypotheses Explain hypothesis testing errors Solve hypothesis testing.
Power of a test.
Review and Preview and Basics of Hypothesis Testing
Sampling Distributions and Hypothesis Testing
Chapter 8: Hypothesis Testing and Inferential Statistics
Hypothesis Testing: Hypotheses
MATH 2311 Section 8.2.
A Closer Look at Testing
P-value Approach for Test Conclusion
Unlocking the Mysteries of Hypothesis Testing
Chapter 9 Hypothesis Testing
Chapter 9: Hypothesis Testing
Significance and t testing
Hypothesis Testing.
Statistical Inference for Managers
More About Tests Notes from
Chapter 7: Statistical Issues in Research planning and Evaluation
Power of a test.
Sample Mean Compared to a Given Population Mean
Sample Mean Compared to a Given Population Mean
Chapter 9: Testing a Claim
Statistical Power.
Presentation transcript:

Power and Effect Size Cal State Northridge  320 Andrew Ainsworth PhD

2 Major Points Review What is power? What controls power? Effect size Power for one sample t Power for related-samples t Power for two sample t Psy Cal State Northridge

3 Important Concepts Concepts critical to hypothesis testing –Decision –Type I error –Type II error –Critical values –One- and two-tailed tests Psy Cal State Northridge

4 Decisions When we test a hypothesis we draw a conclusion; either correct or incorrect. –Type I error Reject the null hypothesis when it is actually correct. –Type II error Retain the null hypothesis when it is actually false. Psy Cal State Northridge

5 Type I Errors Null Hypothesis really is true We conclude the null is false. This is a Type I error –Probability set at alpha (  )  usually at.05 –Therefore, probability of Type I error =.05 Psy Cal State Northridge

6 Type II Errors The Alternative Hypothesis is true We conclude that the null is true This is also an error (Type II) –Probability denoted beta (  ) We can’t set beta easily. We’ll talk about this issue later. Power = (1 -  ) = probability of correctly rejecting false null hypothesis. Psy Cal State Northridge

7 Confusion Matrix Psy Cal State Northridge

8 Critical Values These represent the point at which we decide to reject null hypothesis. e.g. We might decide to reject null when (p|null) <.05. –In the null distribution there is some value with p =.05 –We reject when we exceed that value. –That value is called the critical value. Psy Cal State Northridge

9 One- and Two-Tailed Tests Two-tailed test rejects null when obtained value too extreme in either direction –Decide on this before collecting data. One-tailed test rejects null if obtained value is too low (or too high) –We only set aside one direction for rejection. Psy Cal State Northridge

10 One- & Two-Tailed Example One-tailed test –Reject null if IQPLUS group shows an increase in IQ Probably wouldn’t expect a reduction and therefore no point guarding against it. Two-tailed test –Reject null if IQPLUS group has a mean that is substantially higher or lower. Psy Cal State Northridge

11 What Is Power? Probability of rejecting a false H 0 Probability that you’ll find difference that’s really there 1 - , where  = probability of Type II error Psy Cal State Northridge

12 What Controls Power? The significance level (  ) True difference between null and alternative hypotheses  1 -  2 Sample size Population variance The particular test being used Psy Cal State Northridge

13 Distributions Under  1 and  0 Psy Cal State Northridge

14 Effect Size The degree to which the null is false –Depends on distance between    and   –Also depends on standard error (of mean or of difference between means) Psy Cal State Northridge

15 What happened to n? It doesn’t relate to how different the two population means are. It controls power, but not effect size. We will add it in later. Psy Cal State Northridge

16 Estimating Effect Size Judge your effect size by: –Past research –What you consider important –Cohen’s conventions Psy Cal State Northridge

17 Combining Effect Size and n We put them together and then evaluate power from the result. General formula for Delta –where f (n) is some function of n that will depend on the type of design Psy Cal State Northridge

18 Power for One-Sample or Related samples t First calculate delta with: –where n = size of sample, and  and  as above Look power up in table using  and significance level (  ) Psy Cal State Northridge

19 Power for Single Sample IQPLUS Study One sample z and t –Compared IQPLUS group with population mean = 100, sigma = 10 –Used 25 subjects –We got a sample mean of 106 and s = 7.78 Psy Cal State Northridge

20 IQPLUS Assuming we don’t know sigma –  = 0.77 – n = 25 – –We are testing at  =.05 –Use Appendix D.5 Psy Cal State Northridge

21 Appendix D.5 This table is severely abbreviated. Power for  = 3.85,  =.05 Psy Cal State Northridge

22 Conclusions If we can trust our estimates in the IQPLUS study then if this study were run repeatedly, 97% of the time the result would be significant. Psy Cal State Northridge

23 How Many Subjects Do I Really Need (Single/Related Sample(s))? Run calculations backward –Start with anticipated effect size (  ) –Determine  required for power =.80. Why.80? –Calculate n What if we wanted to rerun the IQPLUS study, and wanted power =.80? Psy Cal State Northridge

24 Calculating n We estimated =.77 Complete Appendix D.5 shows we need  = 2.80 Calculations on next slide Psy Cal State Northridge

25 IQPLUS n Psy Cal State Northridge

26 Power for Two Independent Groups What changes from preceding? –Effect size deals with two sample means –Take into account both values of n Effect size Psy Cal State Northridge

27 Estimating d We could calculate d  directly if we knew populations. We could estimate from previous data. Here we will calculate using Violent Video Games example Psy Cal State Northridge

28 Example: Violent Videos Games Two independent randomly selected/assigned groups –GTA (violent: 8 subjects) VS. NBA 2K7 (non- violent: 10 subjects) –We want to compare mean number of aggressive behaviors following game play –GTA had a mean of and s = –NBA 2K7 had a mean of 8.4 and s = –s 2 pooled = 2.745, so s pooled = Psy Cal State Northridge

29 Two Independent Groups Then calculate  from effect size Note: The above formula assumes that the 2 groups have equal n Psy Cal State Northridge

30 Two Independent Groups Our data do not have equal n, but let’s pretend they do for a moment (both 10) For our data Psy Cal State Northridge

31 Appendix D.5 This table is severely abbreviated. Power for  = 2.5,  =.05 Estimate =.71 Psy Cal State Northridge

32 Conclusions If we had equal n and we can trust our estimates in the violent video game study then if this study were run repeatedly, 71% of the time the result would be significant. Psy Cal State Northridge

33 Unequal Sample Sizes With unequal samples use harmonic mean of sample sizes Where k is the number of groups (i.e. 2), n i is each group size Standard arithmetic average will work well if n ’s are close. Psy Cal State Northridge

34 Two Independent Groups Our data do not have equal n, so we need to find the harmonic mean For our data Psy Cal State Northridge

35 Two Independent Groups Our data do not have equal n, so… For our data Psy Cal State Northridge

36 Appendix D.5 This table is severely abbreviated. Psy Cal State Northridge Power for  = 2.4,  =.05 Estimate =.67

37 Conclusions If we we can trust our estimates in the violent video game study then if this study were run repeatedly, 67% of the time the result would be significant. Psy Cal State Northridge

38 How Many Subjects Do I Really Need (Independent Samples)? Run calculations backward –Start with anticipated effect size (  ) –Determine  required for power =.80. Why.80? –Calculate n What if we wanted to rerun the violent video game study, and wanted power =.80? Psy Cal State Northridge

39 Calculating n We estimated  = Complete Appendix E.5 shows we need  = 2.80 Calculations on next slide Psy Cal State Northridge

40 Violent Video Games n Psy Cal State Northridge