Hypothesis Testing: Hypotheses

Slides:



Advertisements
Similar presentations
Introducing Hypothesis Tests
Advertisements

Hypothesis Tests Steps and Notation (1-Sample)
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Describing Data: Two Variables SECTIONS 2.1, 2.4, 2.5 Two categorical (2.1)
Hypothesis Testing: Intervals and Tests
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Inference: Neyman’s Repeated Sampling STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 Randomization distribution p-value.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Section 4.4 Creating Randomization Distributions.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Significance STAT 250 Dr. Kari Lock Morgan SECTION 4.3 Significance level (4.3) Statistical.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Rejecting Chance – Testing Hypotheses in Research Chapter 22.
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 101 Dr. Kari Lock Morgan 9/25/12 SECTION 4.2 Randomization distribution.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Randomization Tests Dr. Kari Lock Morgan PSU /5/14.
More Randomization Distributions, Connections
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
Hypothesis Testing: p-value
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 9/11/12 Describing Data: Two Variables SECTIONS 2.1, 2.4, 2.5 Two categorical.
Copyright © 2009 Pearson Education, Inc LEARNING GOAL Interpret and carry out hypothesis tests for independence of variables with data organized.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 9: Testing a Claim Section 9.1 Significance Tests: The Basics.
S-012 Testing statistical hypotheses The CI approach The NHST approach.
Statistics: Unlocking the Power of Data Lock 5 Bootstrap Intervals Dr. Kari Lock Morgan PSU /12/14.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Hypothesis Testing with One Sample Chapter 7. § 7.1 Introduction to Hypothesis Testing.
CHAPTER 9 Testing a Claim
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Conclusions STAT 250 Dr. Kari Lock Morgan SECTION 4.3 Significance level (4.3) Statistical.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 250 Dr. Kari Lock Morgan SECTION 4.1 Hypothesis test Null and alternative.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 250 Dr. Kari Lock Morgan Chapter 5 Normal distribution (5.1) Central limit theorem.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Section 9.1 First Day The idea of a significance test What is a p-value?
Significance Tests: The Basics Textbook Section 9.1.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 p-value.
Statistics: Unlocking the Power of Data Lock 5 Section 4.1 Introducing Hypothesis Tests.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Copyright © 2009 Pearson Education, Inc LEARNING GOAL Interpret and carry out hypothesis tests for independence of variables with data organized.
+ Testing a Claim Significance Tests: The Basics.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
What Is a Test of Significance?
Introducing Hypothesis Tests
Unit 5: Hypothesis Testing
Warm Up Check your understanding p. 541
Measuring Evidence with p-values
CHAPTER 9 Testing a Claim
Hypothesis Testing: Hypotheses
Consider This… I claim that I make 80% of my basketball free throws. To test my claim, you ask me to shoot 20 free throws. I make only 8 out of 20.
Week 11 Chapter 17. Testing Hypotheses about Proportions
Measuring Evidence with p-values
CHAPTER 9 Testing a Claim
Boxplots and Quantitative/Categorical Relationships
Introducing Hypothesis Tests
CHAPTER 9 Testing a Claim
Significance Tests: The Basics
Significance Tests: The Basics
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Section 11.1: Significance Tests: Basics
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
CHAPTER 9 Testing a Claim
Presentation transcript:

Hypothesis Testing: Hypotheses STAT 250 Dr. Kari Lock Morgan Hypothesis Testing: Hypotheses SECTION 4.1 Hypothesis test Null and alternative hypotheses Statistical significance

Tea and the Immune System L-theanine is an amino acid found in tea Black tea: about 20mg per cup Green tea (standard): varies, as low as 5mg per cup Green tea (shade grown): varies, up to 46mg per cup (Shade grown green tea examples: Gyokuro, Matcha) Gamma delta T cells are important for helping the immune system fend off infection It is thought that L-theanine primes T cells, activating them to a state of readiness and making them better able to respond to future antigens. Does drinking tea actually boost your immunity? L-theanine is a nonpeptide alkylamine antigen, and T cell exposure to this primes them to better respond to future potentially harmful antigens Antigens in tea-Beverage Prime Human Vγ2Vδ2 T Cells in vitro and in vivo for Memory and Non-memory Antibacterial Cytokine Responses, Kamath et.al., Proceedings of the National Academy of Sciences, May 13, 2003.

Tea and the Immune System Participants were randomized to drink five or six cups of either tea (black) or coffee every day for two weeks (both drinks have caffeine but only tea has L-theanine) After two weeks, blood samples were exposed to an antigen, and production of interferon gamma (immune system response) was measured Explanatory variable: tea or coffee Response variable: measure of interferon gamma Antigens in tea-Beverage Prime Human Vγ2Vδ2 T Cells in vitro and in vivo for Memory and Non-memory Antibacterial Cytokine Responses, Kamath et.al., Proceedings of the National Academy of Sciences, May 13, 2003.

Tea and the Immune System If the tea drinkers have significantly higher levels of interferon gamma, can we conclude that drinking tea rather than coffee caused an increase in this aspect of the immune response? Yes No

Tea and Immune System The explanatory variable is tea or coffee, and the response variable is immune system response measured in amount of interferon gamma produced. How could we visualize this data? Bar chart Histogram Side-by-side boxplots Scatterplot

Tea and Immune System The explanatory variable is tea or coffee, and the response variable is immune system response measured in amount of interferon gamma produced. How might we summarize this data? Mean Proportion Difference in means Difference in proportions Correlation

Hypothesis Test One mean is higher than the other in the sample Is this difference large enough to conclude the difference is real, and holds for the true population parameters? A hypothesis test uses data from a sample to assess a claim about a population

Hypotheses Hypothesis tests are framed formally in terms of two competing hypotheses: Null Hypothesis (H0): Claim that there is no effect or difference. Alternative Hypothesis (Ha): Claim for which we seek evidence.

Tea and Immune Respose Null Hypothesis (H0): No difference between drinking tea and coffee regarding interferon gamma Alternative Hypothesis (Ha): Drinking tea increases interferon gamma production more than drinking coffee No “effect” or no “difference” Claim we seek “evidence” for

Hypotheses: parameters More formal hypotheses: µT = true mean interferon gamma response after drinking tea µC = true mean interferon gamma response after drinking coffee H0: µT = µC Ha: µT > µC

Difference in Hypotheses Note: the following two sets of hypotheses are equivalent, and can be used interchangeably: H0: 1 = 2 Ha: 1 ≠ 2 H0: 1 – 2 = 0 Ha: 1 – 2 ≠ 0

Hypothesis Helpful Hints Hypotheses are always about population parameters, not sample statistics The null hypothesis always contains an equality The alternative hypothesis always contains an inequality (<, >, ≠) The type of inequality in the alternative comes from the wording of the question of interest

Statistical Hypotheses Usually the null is a very specific statement ? Null Hypothesis Alternative Hypothesis Can we reject the null hypothesis? ALL POSSIBILITIES

Null Hypothesis http://xkcd.com/892/

Sleep versus Caffeine Students were given words to memorize, then randomly assigned to take either a 90 min nap, or a caffeine pill. 2 ½ hours later, they were tested on their recall ability. Explanatory variable: sleep or caffeine Response variable: number of words recalled Is sleep or caffeine better for memory? Mednick, Cai, Kanady, and Drummond (2008). “Comparing the benefits of caffeine, naps and placebo on verbal, motor and perceptual memory,” Behavioral Brain Research, 193, 79-86.

Sleep versus Caffeine What is the parameter of interest in the sleep versus caffeine experiment? Proportion Difference in proportions Mean Difference in means Correlation

Sleep versus Caffeine H0: s ≠ c, Ha: s = c Let s and c be the true mean number of words recalled after sleeping and after caffeine. Is there a difference in average word recall between sleep and caffeine? What are the null and alternative hypotheses? H0: s ≠ c, Ha: s = c H0: s = c, Ha: s ≠ c H0: s ≠ c, Ha: s > c H0: s = c, Ha: s > c H0: s = c, Ha: s < c

Hypotheses Define the parameter(s) and state the hypotheses. Does the proportion of people who buy organic food when possible differ between males and females? Is the average hours of sleep per night for college students less than 7? Is amount of time spent studying positively associated with numeric grade in STAT 250?

Your Own Hypotheses Come up with a situation where you want to establish a claim based on data What parameter(s) are you interested in? What would the null and alternative hypotheses be? What type of data would lead you to believe the null hypothesis is probably not true?

Two Plausible Explanations If the sample data support the alternative, there are two plausible explanations: The alternative hypothesis (Ha) is true The null hypothesis (H0) is true, and the sample results were just due to random chance Key question: Do the data provide enough evidence to rule out #2?

Two Plausible Explanations Why might the tea drinkers have higher levels of interferon gamma? Two plausible explanations: Alternative true: Tea causes increase in interferon gamma production Null true, random chance: the people who got randomly assigned to the tea group have better immune systems than those who got randomly assigned to the coffee group

Hypothesis Testing In hypothesis testing, the goal is determine whether random chance can be ruled out as a plausible explanation. Key idea: How unlikely would it be to see a difference in means this large, just by random chance?

Statistical Significance When results as extreme as the observed sample statistic are unlikely to occur by random chance alone (assuming the null hypothesis is true), we say the sample results are statistically significant If our sample is statistically significant, we have convincing evidence against H0, in favor of Ha If our sample is not statistically significant, our test is inconclusive

Statistical Significance Results are significant! Results are not significant Results would be rare, if the null were true Results would not be rare, if the null were true We have evidence against the null We do not have evidence against the null We have evidence that the alternative is true! We can make no conclusions either way

Note on Statistical Significance Statistical significance is a difficult concept, but also one of the most fundamental concepts of the course We return to this concept almost every class for the rest of the semester, so it will get easier! it’s worth thinking deeply about!

Sleep versus Caffeine s and c: mean number of words recalled after sleeping and after caffeine H0: s = c, Ha: s ≠ c If the difference is statistically significant… we have evidence against the null hypothesis, in favor of the alternative we do not have evidence against the null hypothesis

Sleep versus Caffeine s and c: mean number of words recalled after sleeping and after caffeine H0: s = c, Ha: s ≠ c If the difference is not statistically significant… we have evidence against the null hypothesis, in favor of the alternative we do not have evidence against the null hypothesis

Sleep versus Caffeine s and c: mean number of words recalled after sleeping and after caffeine H0: s = c, Ha: s ≠ c If the difference is statistically significant… we have evidence that there is a difference between sleep and caffeine for memory we do not have evidence that there is a difference between sleep and caffeine for memory

Sleep versus Caffeine s and c: mean number of words recalled after sleeping and after caffeine H0: s = c, Ha: s ≠ c If the difference is not statistically significant… we have evidence that there is a difference between sleep and caffeine for memory we do not have evidence that there is a difference between sleep and caffeine for memory

Sleep versus Caffeine s and c: mean number of words recalled after sleeping and after caffeine H0: s = c, Ha: s ≠ c If the difference is not statistically significant, we could conclude… there is a difference between sleep and caffeine for memory (and data show sleep is better) there is not a difference between sleep and caffeine for memory nothing

Hours of Sleep per Night In testing whether the mean number of hours of sleep per night, , for college students is less than 7, we have H0:  = 7 vs Ha:  < 7 If the results of the test are statistically significant, we can conclude… There is evidence that the mean is equal to 7. There is evidence that the mean is less than 7. There is evidence that the mean is greater than 7. There is no evidence of anything. College students get lots of sleep.

Hours of Sleep per Night In testing whether the mean number of hours of sleep per night, , for college students is less than 7, we have H0:  = 7 vs Ha:  < 7 If the results of the test are not statistically significant, we can conclude… There is evidence that the mean is equal to 7. There is evidence that the mean is less than 7. There is evidence that the mean is greater than 7. There is no evidence of anything. College students get lots of sleep.

Summary Statistical tests use data from a sample to assess a claim about a population Statistical tests are usually formalized with competing hypotheses: Null hypothesis (H0): no effect or no difference Alternative hypothesis (Ha): what we seek evidence for If it would be unusual to get results as extreme as that observed, just by random chance, if the null were true, then the data is statistically significant If data are statistically significant, we have convincing evidence against the null hypothesis, and in favor of the alternative

To Do Read Section 4.1 HW 4.1 due Friday, 3/6