Section 4.4 Creating Randomization Distributions.

Slides:



Advertisements
Similar presentations
Introducing Hypothesis Tests
Advertisements

LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Hypothesis Testing: Intervals and Tests
Bootstrap Distributions Or: How do we get a sense of a sampling distribution when we only have ONE sample?
Hypothesis Testing making decisions using sample data.
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Inference Sampling distributions Hypothesis testing.
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 Randomization distribution p-value.
Introduction to Hypothesis Testing AP Statistics Chap 11-1.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Stat 512 – Day 8 Tests of Significance (Ch. 6). Last Time Use random sampling to eliminate sampling errors Use caution to reduce nonsampling errors Use.
Chapter 9 Hypothesis Testing.
Objective: To test claims about inferences for two sample means, under specific conditions.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 101 Dr. Kari Lock Morgan 9/25/12 SECTION 4.2 Randomization distribution.
Confidence Intervals and Hypothesis Tests
Using Bootstrap Intervals and Randomization Tests to Enhance Conceptual Understanding in Introductory Statistics Kari Lock Morgan Department of Statistical.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
4.1 Introducing Hypothesis Tests 4.2 Measuring significance with P-values Visit the Maths Study Centre 11am-5pm This presentation.
Randomization Tests Dr. Kari Lock Morgan PSU /5/14.
Building Conceptual Understanding of Statistical Inference Patti Frazer Lock Cummings Professor of Mathematics St. Lawrence University
Overview Definition Hypothesis
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Tue, Oct 23, 2007.
More Randomization Distributions, Connections
Understanding the P-value… Really! Kari Lock Morgan Department of Statistical Science, Duke University with Robin Lock, Patti Frazer.
Using Simulation Methods to Introduce Inference Kari Lock Morgan Duke University In collaboration with Robin Lock, Patti Frazer Lock, Eric Lock, Dennis.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Chapter 9 Comparing More than Two Means. Review of Simulation-Based Tests  One proportion:  We created a null distribution by flipping a coin, rolling.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Significance STAT 101 Dr. Kari Lock Morgan 9/27/12 SECTION 4.3 Significance level Statistical.
Essential Synthesis SECTION 4.4, 4.5, ES A, ES B
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Hypothesis Testing: p-value
STA Statistical Inference
Chi-square test or c2 test
Using Randomization Methods to Build Conceptual Understanding of Statistical Inference: Day 2 Lock, Lock, Lock Morgan, Lock, and Lock MAA Minicourse- Joint.
Hypotheses tests for means
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
+ Chi Square Test Homogeneity or Independence( Association)
Chapter 20 Testing Hypothesis about proportions
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
AP Statistics Section 11.1 B More on Significance Tests.
AP STATISTICS LESSON (DAY 1) INFERENCE FOR TWO – WAY TABLES.
Early Inference: Using Randomization to Introduce Hypothesis Tests Kari Lock, Harvard University Eric Lock, UNC Chapel Hill Dennis Lock, Iowa State Joint.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 250 Dr. Kari Lock Morgan SECTION 4.1 Hypothesis test Null and alternative.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 26 Chapter 11 Section 1 Inference about Two Means: Dependent Samples.
Introduction to Hypothesis Testing
P-values and statistical inference Dr. Omar Aljadaan.
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 13 Section 2. Chi-Square Test 1.Null hypothesis – written in words 2.Alternative hypothesis – written in words – always “different” 3.Alpha level.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 p-value.
Randomization Tests PSU /2/14.
Statistical Core Didactic
Introducing Hypothesis Tests
Unit 5: Hypothesis Testing
Measuring Evidence with p-values
Simulation-Based Approach for Comparing Two Means
Hypothesis Testing: Hypotheses
Introducing Hypothesis Tests
Comparing Proportions for Multiple Populations
Presentation transcript:

Section 4.4 Creating Randomization Distributions

Randomization Distributions How do we estimate P-values using randomization distributions? Today we’ll discuss ways to simulate randomization samples for a variety of situations. 1.Simulate samples, assuming H 0 is true 2.Calculate the statistic of interest for each sample 3.Find the p-value as the proportion of simulated statistics as extreme as the observed statistic

In a randomized experiment on treating cocaine addiction, 48 people were randomly assigned to take either Desipramine (a new drug), or Lithium (an existing drug), and then followed to see who relapsed Question of interest: Is Desipramine better than Lithium at treating cocaine addiction? Cocaine Addiction

What are the null and alternative hypotheses? What are the possible conclusions? Cocaine Addiction

What are the null and alternative hypotheses? What are the possible conclusions? Cocaine Addiction Reject H 0 : Desipramine is better than Lithium Do not reject H 0 : We cannot determine from these data whether Desipramine is better than Lithium Let p D, p L be the proportion of cocaine addicts who relapse after taking Desipramine or Lithium, respectively. H 0 : p D = p L H a : p D < p L

RRRRRR RRRRRR RRRRRR RRRRRR RRRRRR RRRRRR RRRRRR RRRRRR RRRR RRRRRR RRRRRR RRRRRR RRRR RRRRRR RRRRRR RRRRRR Desipramine Lithium 1. Randomly assign units to treatment groups

RRRR RRRRRR RRRRRR NNNNNN RRRRRR RRRRNN NNNNNN RR NNNNNN R = Relapse N = No Relapse RRRR RRRRRR RRRRRR NNNNNN RRRRRR RRRRRR RRNNNN RR NNNNNN 2. Conduct experiment 3. Observe relapse counts in each group Lithium Desipramine 10 relapse, 14 no relapse18 relapse, 6 no relapse 1. Randomly assign units to treatment groups

To see if a statistic provides evidence against H 0, we need to see what kind of sample statistics we would observe, just by random chance, if H 0 were true Measuring Evidence against H 0

“by random chance” means by the random assignment to the two treatment groups “if H 0 were true” means if the two drugs were equally effective at preventing relapses (equivalently: whether a person relapses or not does not depend on which drug is taken) Simulate what would happen just by random chance, if H 0 were true… Cocaine Addiction

RRRR RRRRRR RRRRRR NNNNNN RR RRRR RRRRNN NNNNNN RR NNNNNN 10 relapse, 14 no relapse18 relapse, 6 no relapse

RRRRRR RRRRNN NNNNNN NNNNNN RRRRRR RRRRRR RRRRRR NNNNNN RNRN RRRRRR RNRRRN RNNNRR NNNR NRRNNN NRNRRN RNRRRR Simulate another randomization Desipramine Lithium 16 relapse, 8 no relapse12 relapse, 12 no relapse

RRRR RRRRRR RRRRRR NNNNNN RR RRRR RNRRNN RRNRNR RR RNRNRR Simulate another randomization Desipramine Lithium 17 relapse, 7 no relapse11 relapse, 13 no relapse

Simulate Your Own Sample In the experiment, 28 people relapsed and 20 people did not relapse. Create cards or slips of paper with 28 “R” values and 20 “N” values. Pool these response values together, and randomly divide them into two groups (representing Desipramine and Lithium) Calculate your difference in proportions Plot your statistic on the class dotplot To create an entire randomization distribution, we simulate this process many more times with technology: StatKeyStatKey

p-value

Randomization Distribution Center A randomization distribution is centered at the value of the parameter given in the null hypothesis. A randomization distribution simulates samples assuming the null hypothesis is true, so

Randomization Distribution We need to generate randomization samples assuming the null hypothesis is true.

Randomization Distribution a) 10.2 b) 12 c) 45 d) 1.8 Randomization distributions are always centered around the null hypothesized value.

Randomization Distribution a) How extreme 10.2 is b) How extreme 12 is c) How extreme 45 is d) What the standard error is e) How many randomization samples we collected We want to see how extreme the observed statistic is.

Randomization Distribution In a hypothesis test for H 0 :  1 =  2, H a :  1 >  2 sample mean #1 = 26 and sample mean #2 = 21. What do we require about the method to produce the randomization samples? We need to generate randomization samples assuming the null hypothesis is true.

Randomization Distribution a) 0 b) 1 c) 21 d) 26 e) 5 The randomization distribution is centered around the null hypothesized value,  1 -  2 = 0 In a hypothesis test for H 0 :  1 =  2, H a :  1 >  2 sample mean #1 = 26 and sample mean #2 = 21. Where will the randomization distribution be centered?

Randomization Distribution a) The standard error b) The center point c) How extreme 26 is d) How extreme 21 is e) How extreme 5 is We want to see how extreme the observed difference in means is. In a hypothesis test for H 0 :  1 =  2, H a :  1 >  2 sample mean #1 = 26 and sample mean #2 = 21. What do we look for in the randomization distribution?

Randomization Distribution For a randomization distribution, each simulated sample should… be consistent with the null hypothesis use the data in the observed sample reflect the way the data were collected

In randomized experiments the “randomness” is the random allocation to treatment groups If the null hypothesis is true, the response values would be the same, regardless of treatment group assignment To simulate what would happen just by random chance, if H 0 were true: Reallocate cases to treatment groups, keeping the response values the same Randomized Experiments

Observational Studies In observational studies, the “randomness” is random sampling from the population To simulate what would happen, just by random chance, if H 0 were true: Simulate drawing samples from a population in which H 0 is true How do we simulate sampling from a population in which H 0 is true when we only have sample data? Adjust the sample to make H 0 true, then bootstrap!

Let  the average human body temperature H 0 :  = 98.6 H a :  ≠ 98.6 Adjust the sample by adding 98.6 – = 0.34 to each value. The sample mean becomes 98.6, exactly the value given by the null hypothesis. Bootstrapping the adjusted sample allows us to simulate drawing samples as if the null is true! Body Temperatures sample mean = 98.26

In StatKey, when we enter the null hypothesis, this shifting is automatically done for us StatKey Body Temperatures p-value = 0.002

Exercise and Gender 1. State null and alternative hypotheses 2. Devise a way to generate a randomization sample that Uses the observed sample data Makes the null hypothesis true Reflects the way the data were collected Do males exercise more hours per week than females? sample mean difference x m – x f = 3

Exercise and Gender 1.H 0 :  m =  f H a :  m >  f 2.Generating a randomization distribution can be done with the “shift groups” method: To make H 0 true set the sample means equal by adding 3 to every female value. Now bootstrap from this modified sample Note: There are other ways. In StatKey, the default randomization method is “Reallocate Groups”, but “Shift Groups” is also an option.

Exercise and Gender p-value = 0.095

Exercise and Gender The p-value is Using α = 0.05, we conclude…. a) Males exercise more than females, on average b) Males do not exercise more than females, on average c) Nothing Do not reject the null… we can’t conclude anything.

Blood Pressure and Heart Rate Is blood pressure negatively correlated with heart rate? 1. State null and alternative hypotheses 2. Devise a way to generate a randomization sample that Uses the observed sample data Makes the null hypothesis true Reflects the way the data were collected sample corre lation r =

Blood Pressure and Heart Rate 1.H 0 :  = 0 H a :  < 0 2.Generating a randomization distribution: Two variables have correlation 0 if they are not associated (null hypothesis). We can “break the association” by randomly shuffling one of the variables. Each time we do this, we get a sample we might observe just by random chance, if there really is no correlation

Blood Pressure and Heart Rate p-value = Even if blood pressure and heart rate are not correlated, we would see correlations this extreme about 22% of the time, just by random chance.

Randomization Distributions: Cocaine Addiction (randomized experiment) Rerandomize cases to treatment groups, keeping response values fixed Body Temperature (single mean) Shift to make H 0 true, then bootstrap Exercise and Gender (observational study) Shift to make H 0 true, then bootstrap Blood Pressure and Heart Rate (correlation) Randomly shuffle one variable

As long as the original data is used and the null hypothesis is true for the randomization samples, most methods usually give similar p-values StatKey generates the randomizations for us. We will not be concerned with the details of the process. It is enough to understand the general principles. Generating Randomization Samples

Summary Randomization samples should be generated Consistent with the null hypothesis Using the observed data Reflecting the way the data were collected The specific method varies with the situation, but the general idea is always the same