Inference: Fishers Exact p-values STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Chapter 12: Inference for Proportions BY: Lindsey Van Cleave.
COMPUTER INTENSIVE AND RE-RANDOMIZATION TESTS IN CLINICAL TRIALS Thomas Hammerstrom, Ph.D. USFDA, Division of Biometrics The opinions expressed are those.
Department of Engineering Management, Information and Systems
StatKey Online Tools for Teaching a Modern Introductory Statistics Course Robin Lock St. Lawrence University USCOTS Breakout – May 2013 Patti Frazer Lock.
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 22 Comparing Two Proportions.
Comparing Two Population Parameters
Statistical Inference Using Scrambles and Bootstraps Robin Lock Burry Professor of Statistics St. Lawrence University MAA Allegheny Mountain 2014 Section.
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Rerandomization in Randomized Experiments STAT 320 Duke University Kari Lock Morgan.
What Can We Do When Conditions Aren’t Met? Robin H. Lock, Burry Professor of Statistics St. Lawrence University BAPS at 2014 JSM Boston, August 2014.
Lecture 3 Outline: Thurs, Sept 11 Chapters Probability model for 2-group randomized experiment Randomization test p-value Probability model for.
Hypothesis Testing: Intervals and Tests
Inference: Neyman’s Repeated Sampling STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 Randomization distribution p-value.
Hypothesis Testing: Hypotheses
9.2a Tests about a Population Proportion Target Goal: I can check the conditions for carrying out a test about a population proportion. I can perform a.
Chapter 9 Tests of Significance Target Goal: I can perform a significance test to support the alternative hypothesis. I can interpret P values in context.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Assignment Mechanisms STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
What Is a Sampling Distribution?
Probability & Statistical Inference Lecture 7 MSc in Computing (Data Analytics)
Inferences About Means of Two Independent Samples Chapter 11 Homework: 1, 2, 3, 4, 6, 7.
Inferences About Means of Two Independent Samples Chapter 11 Homework: 1, 2, 4, 6, 7.
STAT 101 Dr. Kari Lock Morgan Exam 2 Review.
Chapter 9 Hypothesis Testing.
Dr. Kari Lock Morgan Department of Statistics Penn State University Teaching the Common Core: Making Inferences and Justifying Conclusions ASA Webinar.
Weighting STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Subclassification STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Statistics: Unlocking the Power of Data Lock 5 Inference for Proportions STAT 250 Dr. Kari Lock Morgan Chapter 6.1, 6.2, 6.3, 6.7, 6.8, 6.9 Formulas for.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Randomized Experiments STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Overview Definition Hypothesis
© 2003 Prentice-Hall, Inc.Chap 9-1 Fundamentals of Hypothesis Testing: One-Sample Tests IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
SUTVA, Assignment Mechanism STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
© Copyright McGraw-Hill CHAPTER 13 Nonparametric Statistics.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Chapter 9 Introduction to the t Statistic. 9.1 Review Hypothesis Testing with z-Scores Sample mean (M) estimates (& approximates) population mean (μ)
Comments on Midterm Comments on HW4 Final Project Regression Example Sensitivity Analysis? Quiz STA 320 Design and Analysis of Causal Studies Dr. Kari.
PSY 307 – Statistics for the Behavioral Sciences Chapter 9 – Sampling Distribution of the Mean.
Bayesian Inference, Review 4/25/12 Frequentist inference Bayesian inference Review The Bayesian Heresy (pdf)pdf Professor Kari Lock Morgan Duke University.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 250 Dr. Kari Lock Morgan SECTION 4.1 Hypothesis test Null and alternative.
Nonparametric Statistics
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
13 Nonparametric Methods Introduction So far the underlying probability distribution functions (pdf) are assumed to be known, such as SND, t-distribution,
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 p-value.
Chapter 9 Introduction to the t Statistic
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Unit 5: Hypothesis Testing
Measuring Evidence with p-values
Statistical inference: distribution, hypothesis testing
AP Stats Check In Where we’ve been…
YOU HAVE REACHED THE FINAL OBJECTIVE OF THE COURSE
Basic Practice of Statistics - 3rd Edition Introduction to Inference
Test Drop Rules: If not:
Chapter 9: Sampling Distributions
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
Presentation transcript:

Inference: Fishers Exact p-values STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University

Quiz 1 Grades > summary(Quiz1) Min. 1st Qu. Median Mean 3rd Qu. Max

Review of Quiz Potential outcomes: what would spring GPA be if student tents AND what would it be if student doesnt tent? Covariates must be pre-treatment Assumptions: o SUTVA no interference: Y i not affected by W j o individualistic: W i not affected by X j or Y j o unconfounded: W i not affected by Y i or Y j, conditional on X W i can depend on W j Context: dont just recite definitions

Review of Last Class Randomizing units to treatments creates balanced treatment groups Placebos and blinding are important Four types of classical randomized experiments: o Bernoulli randomized experiment o Completely randomized experiment o Stratified randomized experiment o Paired randomized experiment

Diet Cola and Calcium Does drinking diet cola leach calcium from the body? 16 healthy women aged were randomly assigned to drink 24 ounces of either diet cola or water Their urine was collected for 3 hours, and calcium excreted was measured (in mg) Is there a significant difference?

Diet Cola and Calcium DrinkCalcium Excreted Diet cola50 Diet cola62 Diet cola48 Diet cola55 Diet cola58 Diet cola61 Diet cola58 Diet cola56 Water48 Water46 Water54 Water45 Water53 Water46 Water53 Water48

Test Statistic A test statistic, T, can be any function of: o the observed outcomes, Y obs o the treatment assignment vector, W o the covariates, X The test statistic must be a scalar (one number) Examples: o Difference in means o Regression coefficients o Rank statistics o See chapter 5 for a discussion of test statistics

Diet Cola and Calcium Difference in sample means between treatment group (diet cola drinkers) and control group (water drinkers)

Key Question Is a difference of mg more extreme than we would have observed, just by random chance, if there were no difference between diet cola and water regarding calcium excretion? What types of statistics would we see, just by the random assignment to treatment groups?

p-value T: A random variable T obs : the observed test statistic computed in the actual experiment The p-value is the probability that T is as extreme as T obs, if the null is true GOAL: Compare T obs to the distribution of T under the null hypothesis, to see how extreme T obs is SO: Need distribution of T obs under the null

Sir R.A. Fisher

Randomness In Fishers framework, the only randomness is the treatment assignment: W The potential outcomes are considered fixed, it is only random which is observed The distribution of T arises from the different possibilities for W For a completely randomized experiment, N choose N T possibilities for W

Sharp Null Hypothesis Fishers sharp null hypothesis is there is no treatment effect: H 0 : Y i (0) = Y i (1) for all i Note: this null is stronger than the typical hypothesis of equality of the means Advantage of Fishers sharp null: under the null, all potential outcomes known!

Diet Cola and Calcium There is NO EFFECT of drinking diet cola (as compared to water) regarding calcium excretion So, for each person in the study, their amount of calcium excreted would be the same, whether they drank diet cola or water

Sharp Null Hypothesis Key point: under the sharp null, the vector Y obs does not change with different W Therefore we can compute T exactly under the null for each different W ! Assignment mechanism completely determines the distribution of T under the null (why is this not true without sharp null?)

Randomization Distribution The randomization distribution is the distribution of the test statistic, T, assuming the null is true, over all possible assignment vectors, W For each possible assignment vector, compute T (keeping Y obs fixed, because we are assuming the null) The randomization distribution gives us exactly the distribution of T, assuming the sharp null hypothesis is true

Diet Cola and Calcium 16 choose 8 = 12,870 different possible assignment vectors For each of these, calculate T, the difference in sample means, keeping the values for calcium excretion fixed

Diet Cola and Calcium

Exact p-value From the randomization distribution, computing the p-value is straightforward: The exact p-value is the proportion of test statistics in the randomization distribution that are as extreme as T obs This is exact because there are no distributional assumptions – we are using the exact distribution of T

Diet Cola and Calcium p-value = 0.005

Diet Cola and Calcium If there were no difference between diet cola and water regarding calcium excretion, only 5/1000 of all randomizations would lead to a difference as extreme as mg (the observed difference) Drinking diet cola probably does leach calcium from your body!

Notes This approach is completely nonparametric – no model specified in terms of a set of unknown parameters We dont model the distribution of potential outcomes (they are considered fixed) No modeling assumptions or assumptions about the distribution of the potential outcomes

Approximate p-value For larger samples, the number of possible assignment vectors (N choose N T ) gets very large Enumerating every possible assignment vector becomes computationally difficult Its often easier to simulate many (10,000? 100,000?) random assignments

Approximate p-value Repeatedly randomize units to treatments, and calculate test statistic keeping Y obs fixed If the number of simulations is large enough, this randomization distribution will look very much like the exact distribution of T Note: estimated p-values will differ slightly from simulation to simulation. This is okay! The more simulations, the closer this approximate p-value will be to the exact p-value

Diet Cola and Calcium p-value 0.004

StatKey

To Do Read Ch 5 Bring laptops to class Wednesday HW 2 due next Monday