Another Example Consider a very popular computer game played by millions of people all over the world. The average score of the game is known to be

Slides:



Advertisements
Similar presentations
Power of a test. power The power of a test (against a specific alternative value) Is a tests ability to detect a false hypothesis Is the probability that.
Advertisements

Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
 When we perform a hypothesis test, we make a decision to either Reject the Null Hypothesis or Fail to Reject the Null Hypothesis.  There is always the.
More About Type I and Type II Errors. O.J. Simpson trial: the situation O.J. is assumed innocent. Evidence collected: size 12 Bruno Magli bloody footprint,
CHAPTER 21 More About Tests: “What Can Go Wrong?”.
+ Chapter 10 Section 10.4 Part 2 – Inference as Decision.
Chapter 7(7b): Statistical Applications in Traffic Engineering Chapter objectives: By the end of these chapters the student will be able to (We spend 3.
Introduction to Hypothesis Testing
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Power of a test. power The power of a test (against a specific alternative value) Is the probability that the test will reject the null hypothesis when.
Power of a Test Notes: P 183 and on your own paper.
Overview of Statistical Hypothesis Testing: The z-Test
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
The Probability of a Type II Error and the Power of the Test
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
Introduction to inference Use and abuse of tests; power and decision IPS chapters 6.3 and 6.4 © 2006 W.H. Freeman and Company.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Errors & Power. 2 Results of Significant Test 1. P-value < alpha 2. P-value > alpha Reject H o & conclude H a in context Fail to reject H o & cannot conclude.
 When we perform a hypothesis test, we make a decision to either Reject the Null Hypothesis or Fail to Reject the Null Hypothesis.  There is always the.
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
Errors in Hypothesis Tests Notes: P When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility.
Power of a Hypothesis test. H 0 True H 0 False Reject Fail to reject Type I Correct Type II Power   Suppose H 0 is true – what if we decide to fail.
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
Education 793 Class Notes Decisions, Error and Power Presentation 8.
AP Statistics Section 11.4 B
Type I and Type II Errors. An analogy may help us to understand two types of errors we can make with inference. Consider the judicial system in the US.
1 Hypothesis Testing A criminal trial is an example of hypothesis testing. In a trial a jury must decide between two hypotheses. The null hypothesis is.
Section 9.1(re-visited)  Making Sense of Statistical Significance  Inference as Decision.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Goals… Define what is meant by a Type I error. Define what is meant by a Type II error. Define what is meant by the power of a test. Identify the relationship.
1 Hypothesis Tests on the Mean H 0 :  =  0 H 1 :    0.
Introduction to Inference Tests of Significance Errors in the justice system Actual truth Jury decision GuiltyNot guilty Guilty Not guilty Correct decision.
AP Statistics Section 11.4 B. A significance test makes a Type I error when ___________________________________ P(Type 1 error ) = ___ A significance.
Type I and Type II Errors. For type I and type II errors, we must know the null and alternate hypotheses. H 0 : µ = 40 The mean of the population is 40.
Aims: To understand the difference between a one-tail and two tail test. To be able to formulate a null and alternative hypothesis. To be able to carry.
Power of a test. power The power of a test (against a specific alternative value) Is In practice, we carry out the test in hope of showing that the null.
Chapter 9: Hypothesis Tests for One Population Mean 9.2 Terms, Errors, and Hypotheses.
Type I and II Errors Power.  Better Batteries a) What conclusion can you make for the significance level α = 0.05? b) What conclusion can you make for.
Lesson 2: Section 9.1 (part 2).  Interpret a Type I Error and a Type II Error in context, and give the consequences of each.  Understand the relationship.
C HAPTER 2  Hypothesis Testing -Test for one means - Test for two means -Test for one and two proportions.
Critical Appraisal Course for Emergency Medicine Trainees Module 2 Statistics.
+ Homework 9.1:1-8, 21 & 22 Reading Guide 9.2 Section 9.1 Significance Tests: The Basics.
Unit 9.1B: Testing a Claim Significance Tests: The Basics Using α-levels Type I and Type II Errors The Power of a Test 1.
Errors in Hypothesis Tests
Section Testing a Proportion
Power of a test.
Hypothesis Testing and Confidence Intervals (Part 2): Cohen’s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern October 17 and 19, 2017.
Example: Propellant Burn Rate
Power of a test.
Power of a test.
P-value Approach for Test Conclusion
Introduction to Inference
Power of a Test.
Introduction to Inference
Chapter 11: Introduction to Hypothesis Testing Lecture 5c
Power of a test.
Power of a test.
Power of a Hypothesis Test
Power of a test.
Hypothesis Testing and Confidence Intervals (Part 2): Cohen’s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018.
Confidence Intervals.
Chapter 9: Testing a Claim
  Pick a card….
Power of a test.
Power Problems.
Inference as Decision Section 10.4.
Power and Error What is it?.
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Another Example Consider a very popular computer game played by millions of people all over the world. The average score of the game is known to be 𝜇=5000 and the standard deviation is known to be σ=1000. Suppose the game developers have just created an updated version and want to know if it is more difficult than the original version. They plan to conduct a trial of the updated version on a random sample of 𝑛=100 players and perform a hypothesis test on the sample mean using 𝛼=0.05. 𝐻 0 : 𝜇=5000 𝐻 1 : 𝜇<5000 What is the probability of a Type I error (rejecting 𝐻 0 when, in fact, it is true)? P(Type I Error) =𝛼=0.05 Suppose the updated version actually is more difficult, and the true mean is 𝜇 1 =4700. What is the probability of a Type II error (accepting 𝐻 0 when, in fact, it is false)? 𝑥 𝛼 = −𝑧 𝛼 𝜎 𝑛 + 𝜇 0 → 𝑥 0.05 =−1.645 100 +5000=4835.5 𝑧 𝛽 = 𝑥 𝛼 − 𝜇 1 𝜎/ 𝑛 = 4835.5−4700 100 =1.355 𝛽=𝑃 𝑍> 𝑧 𝛽 =𝑃 𝑍>1.355 =0.0877 𝐻 1 𝐻 0 𝜇 1 =4700 𝜎/ 𝑛 =100 𝜇 0 =5000 𝜎/ 𝑛 =100 𝜶=𝟎.𝟎𝟓 𝜷=𝟎.𝟎𝟖𝟕𝟕 𝑥 0.05 =4835.5

Another Example Consider a very popular computer game played by millions of people all over the world. The average score of the game is known to be 𝜇=5000 and the standard deviation is known to be σ=1000. Suppose the game developers have just created an updated version and want to know if it is more difficult than the original version. They plan to conduct a trial of the updated version on a random sample of 𝑛=100 players and perform a hypothesis test on the sample mean using 𝛼=0.05. 𝐻 0 : 𝜇=5000 𝐻 1 : 𝜇<5000 What is the probability of a Type I error (rejecting 𝐻 0 when, in fact, it is true)? P(Type I Error) =𝛼=0.05 Suppose the updated version actually is more difficult, and the true mean is 𝜇 1 =4700. What is the probability of a Type II error (accepting 𝐻 0 when, in fact, it is false)? 𝑥 𝛼 = −𝑧 𝛼 𝜎 𝑛 + 𝜇 0 → 𝑥 0.05 =−1.645 100 +5000=4835.5 𝑧 𝛽 = 𝑥 𝛼 − 𝜇 1 𝜎/ 𝑛 = 4835.5−4700 100 =1.355 𝛽=𝑃 𝑍> 𝑧 𝛽 =𝑃 𝑍>1.355 =0.0877 Power = 1−𝛽=1−0.0877=0.9123 𝐻 1 𝐻 0 Power =𝟎.𝟗𝟏𝟐𝟑 𝜇 1 =4700 𝜎/ 𝑛 =100 𝜇 0 =5000 𝜎/ 𝑛 =100 𝜷=𝟎.𝟎𝟖𝟕𝟕 𝑥 0.05 =4835.5

Same Example, Smaller 𝛼 Consider a very popular computer game played by millions of people all over the world. The average score of the game is known to be 𝜇=5000 and the standard deviation is known to be σ=1000. Suppose the game developers have just created an updated version and want to know if it is more difficult than the original version. They plan to conduct a trial of the updated version on a random sample of 𝑛=100 players and perform a hypothesis test on the sample mean using 𝛼=0.01. 𝐻 0 : 𝜇=5000 𝐻 1 : 𝜇<5000 What is the probability of a Type I error (rejecting 𝐻 0 when, in fact, it is true)? P(Type I Error) =𝛼=0.01 Suppose the updated version actually is more difficult, and the true mean is 𝜇 1 =4700. What is the probability of a Type II error (accepting 𝐻 0 when, in fact, it is false)? 𝑥 𝛼 = −𝑧 𝛼 𝜎 𝑛 + 𝜇 0 → 𝑥 0.01 =−2.326 100 +5000=4767.4 𝑧 𝛽 = 𝑥 𝛼 − 𝜇 1 𝜎/ 𝑛 = 4767.4−4700 100 =0.674 𝛽=𝑃 𝑍> 𝑧 𝛽 =𝑃 𝑍>0.674 =0.2502 𝐻 1 𝐻 0 𝜇 1 =4700 𝜎/ 𝑛 =100 𝜇 0 =5000 𝜎/ 𝑛 =100 𝜷=𝟎.𝟐𝟓𝟎𝟐 𝜶=𝟎.𝟎𝟏 𝑥 0.01 =4767.4

Same Example, Smaller 𝛼 Consider a very popular computer game played by millions of people all over the world. The average score of the game is known to be 𝜇=5000 and the standard deviation is known to be σ=1000. Suppose the game developers have just created an updated version and want to know if it is more difficult than the original version. They plan to conduct a trial of the updated version on a random sample of 𝑛=100 players and perform a hypothesis test on the sample mean using 𝛼=0.01. 𝐻 0 : 𝜇=5000 𝐻 1 : 𝜇<5000 What is the probability of a Type I error (rejecting 𝐻 0 when, in fact, it is true)? P(Type I Error) =𝛼=0.01 Suppose the updated version actually is more difficult, and the true mean is 𝜇 1 =4700. What is the probability of a Type II error (accepting 𝐻 0 when, in fact, it is false)? 𝑥 𝛼 = −𝑧 𝛼 𝜎 𝑛 + 𝜇 0 → 𝑥 0.01 =−2.326 100 +5000=4767.4 𝑧 𝛽 = 𝑥 𝛼 − 𝜇 1 𝜎/ 𝑛 = 4767.4−4700 100 =0.674 𝛽=𝑃 𝑍> 𝑧 𝛽 =𝑃 𝑍>0.674 =0.2502 Power = 1−𝛽=1−0.2502=0.7498 𝐻 1 𝐻 0 Power =𝟎.𝟕𝟒𝟗𝟖 𝜇 1 =4700 𝜎/ 𝑛 =100 𝜇 0 =5000 𝜎/ 𝑛 =100 𝜷=𝟎.𝟐𝟓𝟎𝟐 𝑥 0.01 =4767.4

Same Example, Larger n Consider a very popular computer game played by millions of people all over the world. The average score of the game is known to be 𝜇=5000 and the standard deviation is known to be σ=1000. Suppose the game developers have just created an updated version and want to know if it is more difficult than the original version. They plan to conduct a trial of the updated version on a random sample of 𝑛=200 players and perform a hypothesis test on the sample mean using 𝛼=0.01. 𝐻 0 : 𝜇=5000 𝐻 1 : 𝜇<5000 What is the probability of a Type I error (rejecting 𝐻 0 when, in fact, it is true)? P(Type I Error) =𝛼=0.01 Suppose the updated version actually is more difficult, and the true mean is 𝜇 1 =4700. What is the probability of a Type II error (accepting 𝐻 0 when, in fact, it is false)? 𝑥 𝛼 = −𝑧 𝛼 𝜎 𝑛 + 𝜇 0 → 𝑥 0.01 =−2.326 1000 200 +5000=4835.5 𝑧 𝛽 = 𝑥 𝛼 − 𝜇 1 𝜎/ 𝑛 = 4835.5−4700 1000/ 200 =1.916 𝛽=𝑃 𝑍> 𝑧 𝛽 =𝑃 𝑍>1.916 =0.0277 𝐻 1 𝐻 0 𝜇 1 =4700 𝜎/ 𝑛 =70.71 𝜇 0 =5000 𝜎/ 𝑛 =70.71 𝜶=𝟎.𝟎𝟏 𝜷=𝟎.𝟎𝟐𝟕𝟕 𝑥 0.01 =4835.5

Same Example, Larger n Consider a very popular computer game played by millions of people all over the world. The average score of the game is known to be 𝜇=5000 and the standard deviation is known to be σ=1000. Suppose the game developers have just created an updated version and want to know if it is more difficult than the original version. They plan to conduct a trial of the updated version on a random sample of 𝑛=200 players and perform a hypothesis test on the sample mean using 𝛼=0.01. 𝐻 0 : 𝜇=5000 𝐻 1 : 𝜇<5000 What is the probability of a Type I error (rejecting 𝐻 0 when, in fact, it is true)? P(Type I Error) =𝛼=0.01 Suppose the updated version actually is more difficult, and the true mean is 𝜇 1 =4700. What is the probability of a Type II error (accepting 𝐻 0 when, in fact, it is false)? 𝑥 𝛼 = −𝑧 𝛼 𝜎 𝑛 + 𝜇 0 → 𝑥 0.01 =−2.326 1000 200 +5000=4835.5 𝑧 𝛽 = 𝑥 𝛼 − 𝜇 1 𝜎/ 𝑛 = 4835.5−4700 1000/ 200 =1.916 𝛽=𝑃 𝑍> 𝑧 𝛽 =𝑃 𝑍>1.916 =0.0277 Power = 1−𝛽=1−0.2502=0.9723 𝐻 1 𝐻 0 Power =𝟎.𝟗𝟕𝟐𝟑 𝜇 1 =4700 𝜎/ 𝑛 =70.71 𝜇 0 =5000 𝜎/ 𝑛 =70.71 𝜷=𝟎.𝟎𝟐𝟕𝟕 𝑥 0.01 =4835.5

Summary

Summary

Summary

Practical Implications Type I Error, Type II Error, and Power are crucial considerations in any hypothesis testing study For a more consequential example, consider a new medical treatment developed to cure a life threatening disease. The treatment is tested on a random sample of ill patients. Type I Error: Deciding that the treatment works when, in fact, it doesn’t. This would be a dangerous mistake to make, because patients would be given this ineffective treatment. Type II Error: Deciding that the treatment does not work when, in fact, it does. This would be a tragic mistake, since this life- saving treatment would not be given to patients. Power: Probability of deciding that the treatment works when, in fact, it does. This would be the correct decision that we want to make. Ideally, we want to minimize both Type I error and Type II error (and maximize power). However, as explained earlier, the probabilities of Type I error and Type II error are inversely related: as one goes down, the other goes up. Therefore, a compromise must be made in choosing a low enough probability of Type I error (𝛼) while keeping the probability of Type II error (𝛽) in check. For any given study, the sample size should be large enough to have adequate power (or low 𝛽). If the sample size is too small, we would most likely not be able to detect a significant effect even it does exist in reality.