Chapter 231 Use and Abuse of Statistical Inference.

Slides:



Advertisements
Similar presentations
In this chapter we introduce the idea of hypothesis testing in general, and then we look at the specifics for a hypothesis test for a single population.
Advertisements

MAT 1000 Mathematics in Today's World. Last Time 1.What does a sample tell us about the population? 2.Practical problems in sample surveys.
Decision Errors and Power
Owners of a chain of ice cream shops recently examined sales figures and noticed that on a randomly selected day, 21 of 103 ice cream customers form their.
Significance Tests About
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Section 9-1 Review and Preview.
BPS - 5th Ed. Chapters 14 & 151 Chapters 14 and 15 Introduction to Inference.
Chapter 11 Asking and Answering Questions About The Difference Between Two Population Proportions Created by Kathy Fritz.
Chapter 9 Hypothesis Testing.
Statistical Thinking Experiments in the Real World
Confidence Intervals and Hypothesis Tests
1. Statistics: Learning from Samples about Populations Inference 1: Confidence Intervals What does the 95% CI really mean? Inference 2: Hypothesis Tests.
Math 227 Elementary Statistics
Modular 15 Ch 10.1 to 10.2 Part I. Ch 10.1 The Language of Hypothesis Testing Objective A : Set up a Hypothesis Testing Objective B : Type I or Type II.
Warm-up Day of 8.1 and 8.2 Quiz and Types of Errors Notes.
Hypothesis Testing – Examples and Case Studies
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
14. Introduction to inference
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
STAT E100 Section Week 10 – Hypothesis testing, 1- Proportion, 2- Proportion – Z tests, 2- Sample T tests.
More About Significance Tests
BPS - 3rd Ed. Chapter 151 Inference in Practice. BPS - 3rd Ed. Chapter 152  If we know the standard deviation  of the population, a confidence interval.
Confidence Intervals and Hypothesis tests with Proportions.
LESSON Tests about a Population Parameter.
Agresti/Franklin Statistics, 1 of 111 Chapter 9 Comparing Two Groups Learn …. How to Compare Two Groups On a Categorical or Quantitative Outcome Using.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Significance Tests: THE BASICS Could it happen by chance alone?
Essential Statistics Chapter 131 Introduction to Inference.
1 A heart fills with loving kindness is a likeable person indeed.
INTRODUCTION TO INFERENCE BPS - 5th Ed. Chapter 14 1.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
Warm-up Day of 8.1 and 8.2 Review. 8.2 P#20, 23 and 24 P#20 a. and b. c. Since the p-hat is along the line for reasonably likely events.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Literature searching & critical appraisal Chihaya Koriyama August 15, 2011 (Lecture 2)
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Chapter 10.2 TESTS OF SIGNIFICANCE.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
AP Statistics Chapter 22 Notes “Comparing Two Proportions”
Essential Statistics Chapter 141 Thinking about Inference.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Inference Toolbox! To test a claim about an unknown population parameter: Step 1: State Identify the parameter (in context) and state your hypotheses.
Issues concerning the interpretation of statistical significance tests.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
1 Chapter 9: Introduction to Inference. 2 Thumbtack Activity Toss your thumbtack in the air and record whether it lands either point up (U) or point down.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Compliance Original Study Design Randomised Surgical care Medical care.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Inference for Proportions Section Starter Do dogs who are house pets have higher cholesterol than dogs who live in a research clinic? A.
Two-Sample Proportions Inference. Sampling Distributions for the difference in proportions When tossing pennies, the probability of the coin landing.
Warm Up In May 2006, the Gallup Poll asked 510 randomly sampled adults the question “Generally speaking, do you believe the death penalty is applied fairly.
Testing a Single Mean Module 16. Tests of Significance Confidence intervals are used to estimate a population parameter. Tests of Significance or Hypothesis.
Apr. 26 Statistic for the day: Chance that a college student expects to be a millionaire by the age of 40: 1 in 2 Assignment: Begin to review for final.
AP Statistics Chapter 24 Notes “Comparing Two Sample Means”
Section 9.1 First Day The idea of a significance test What is a p-value?
Review Statistical inference and test of significance.
Hypothesis Tests for 1-Proportion Presentation 9.
Chapter 10 Comparing Two Populations or Groups Sect 10.1 Comparing two proportions.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Tests of Significance The reasoning of significance tests
Inferences About Means from Two Groups
Essential Statistics Introduction to Inference
Essential Statistics Thinking about Inference
Essential Statistics Thinking about Inference
Presentation transcript:

Chapter 231 Use and Abuse of Statistical Inference

Chapter 232 Thought Question 1 When presenting the results of a study, would it be sufficient to only report the P-value? Why would it be a good idea to also give a confidence interval based on the results?

Chapter 233 Thought Question 2 Suppose a new study found that there was no difference in lung function, measured by average volume of air expired, for smokers and nonsmokers. What may have led to this finding? Do you think the lung function was exactly the same for both groups in the study?

Chapter 234 Thought Question 3 The results of a CNN/USA Today/Gallup public opinion poll in August of 2005 showed that a majority of Americans were pro-choice on the abortion issue. Would it be fair to claim that “significantly more than 50% of Americans were pro- choice”? Explain.

Chapter 235 Thought Question 3: Answer n= stated that they were pro- choice 95% C.I.: to 0.571

Chapter 236 Warnings about Reports on Hypothesis Tests: Data Origins For any statistical analysis to be valid, the data must come from proper samples. Complex formulas and techniques cannot fix bad (biased) data. In addition, be sure to use an analysis that is appropriate for the type of data collected.

Chapter 237 Warnings about Reports on Hypothesis Tests: P-value or C.I.? P-values provide information as to whether findings are more than just good luck, but P-values alone may be misleading or leave out valuable information (as seen later in this chapter). Confidence intervals provide both the estimated values of important parameters and how uncertain the estimates are.

Chapter 238 Warnings about Reports on Hypothesis Tests: Significance If the word significant is used to try to convince you that there is an important effect or relationship, determine if the word is being used in the usual sense or in the statistical sense only.

Chapter 239 Case Study: Patient Satisfaction Bertakis, Klea D., et. al., “The influence of gender on physician practice style”, Medical Care, Vol. 33, No. 4, 1995, pp “Women Doctors Fare Better in Patient Survey” reported in Sacramento Bee, April 26, 1995

Chapter 2310 Case Study: Patient Satisfaction u Alternative (Research) Hypothesis: The mean satisfaction rating by patients who first saw a female physician is different from the mean satisfaction rating by patients who first saw a male physician. u Null Hypothesis: There is no difference in the mean satisfaction rating by patients who first saw a female physician and the mean satisfaction rating by patients who first saw a male physician.

Chapter 2311 Case Study: Patient Satisfaction u The alternative hypothesis is two-sided. u Study was double blinded (neither patients nor physicians were told the purpose of the survey). u Survey was completed by 250 patients at the University of California at Davis Medical Center who rated medical residents on a scale 1 to 5 (very dissatisfied to very satisfied).

Chapter 2312 Case Study: Patient Satisfaction u Bee: “The female physicians received an average score of The men – a respectable, yet significantly lower score of 4.05.” u The average difference was u Medical Care: the difference was “small but statistically significant (P-value=0.02).” u Medical Care: “This difference is both statistically and clinically significant.”

Chapter 2313 Warnings about Reports on Hypothesis Tests: Large Sample If a study is based on a very large sample size, relationships found to be statistically significant may not have much practical importance.

Chapter 2314 Case Study: Drug Use in American High Schools Bogert, Carroll. “Good news on drugs from the inner city,” Newsweek, Feb , pp Alcohol Use

Chapter 2315 Case Study: Drug Use in American High Schools u Alternative Hypothesis: The percentage of high school students who used alcohol in 1993 is less than the percentage who used alcohol in u Null Hypothesis: There is no difference in the percentage of high school students who used in 1993 and in 1992.

Chapter 2316 Case Study: Drug Use in American High Schools 1993 survey was based on 17,000 seniors, 15,500 10th graders and 18,500 8th graders.

Chapter 2317 Case Study: Drug Use in American High Schools u The article suggests that the survey reveals “good news” since the differences are all negative. u The differences are significant. –statistically? –practically?

Chapter 2318 Warnings about Reports on Hypothesis Tests: Small Sample If you read “no difference” or “no relationship” has been found in a study, try to determine the sample size used. Unless the sample size was large, remember that it could be that there is indeed an important relationship in the population, but that not enough data were collected to detect it. In other words, the test could have had very low power.

Chapter 2319 Case Study: Memory Loss Levy, B. and E. Langer. “Aging free from negative stereotypes: Successful memory in China and among the American deaf,” Journal of Personality and Social Psychology, Vol. 66, pp Memory Loss in American Hearing, American Deaf and Chinese Adults

Chapter 2320 Case Study: Memory Loss u Average Memory Test Scores (higher is better) u 30 subjects were sampled from each population

Chapter 2321 Case Study: Memory Loss u Young Americans (hearing and deaf) have significantly higher mean scores. u Science News (July 2, 1994, p. 13): “Surprisingly,...memory scores for older and younger Chinese did not statistically differ.”

Chapter 2322 Case Study: Memory Loss u Since the sample sizes are very small, there is an increased chance that the test will result in a Type II error if indeed there is a difference between young and old subjects’ mean memory scores. u The “surprising” result may just be a Type II error. u The test could have very low power.

Chapter 2323 Warnings about Reports on Hypothesis Tests: 1 or 2 Sided Try to determine whether the test was one- sided or two-sided. If a test is one-sided, and details are not reported, you could be misled into thinking there was no difference, when in fact there was one in the direction opposite to that hypothesized.

Chapter 2324 Case Study: Seen a UFO? Seen a UFO? You May Be Healthier Than Your Friends Roper Organization. Unusual Personal Experiences: An Analysis of the Data from Three National Surveys, Las Vegas: Bigelow Holding Corp., 1992.

Chapter 2325 Case Study: Seen a UFO? u Research Hypothesis (Alternative): People who claim to have seen a UFO are on average more psychologically disturbed than those who make no such claim. u Null Hypothesis: People who claim to have seen a UFO are on average no more or less psychologically disturbed than those who make no such claim.

Chapter 2326 Case Study: Seen a UFO? u 49 subjects were recruited through a newspaper. –18 were UFO nonintense –31 were UFO intense (could explain details of encounter) u 127 control subjects were recruited –74 students of a psychology class (receiving credit for participation) –53 community members recruited through a newspaper

Chapter 2327 Case Study: Seen a UFO? u New York Times (1993): “Study Finds No Abnormality in Those Reporting UFOs.” u Results: UFO groups actually scored significantly better (statistically) on many of the psychological measures. u The stated one-sided alternative hypothesis was not supported. Does this mean the null hypothesis is true?

Chapter 2328 Warnings about Reports on Hypothesis Tests: Only Significant are Reported? Sometimes researchers will perform a multitude of tests, and the reports will focus on those that achieved statistical significance. Remember that if nothing interesting is happening and all of the null hypotheses tested are true, then [about] 1 in 20 (.05) tests should achieve statistical significance just by chance. Beware of reports where it is evident that many tests were conducted, but where results of only one or two are presented as “significant.”

Chapter 2329 Case Study: Spinach is Good? So You Thought Spinach Was Good for You? Norwak, R. “Beta-carotene: Helpful or harmful?” Science, Vol. 264, April 22, 1994, pp

Chapter 2330 Case Study: Spinach is Good? u Startling finding: Supplements of the antioxidant beta-carotene markedly increased the incidence of lung cancer among heavy smokers in Finland. u This is the result of a large, randomized clinical trial: 29,000 cases u But…there were multiple tests conducted.

Chapter 2331 Key Concepts u Difference between a statistically significant effect and a practically important one u Large Samples and Statistical Significance u Small Samples and Statistical Significance u Multiple Tests and Statistical Significance