Hypothesis Testing – Examples and Case Studies

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Objectives (BPS chapter 18) Inference about a Population Mean  Conditions for inference  The t distribution  The one-sample t confidence interval 
Inference for a population mean BPS chapter 18 © 2006 W. H. Freeman and Company.
Introduction to Statistics
STATISTICAL INFERENCE PART V
BHS Methods in Behavioral Sciences I
Inferences about two proportions Assumptions 1.We have proportions from two simple random samples that are independent (not paired) 2.For both samples,
T-Tests Lecture: Nov. 6, 2002.
BS704 Class 7 Hypothesis Testing Procedures
Chapter 11: Inference for Distributions
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Inference about Population Parameters: Hypothesis Testing
CHAPTER 19: Two-Sample Problems
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Overview Definition Hypothesis
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Inferences Based on Two Samples
Statistical Analysis Statistical Analysis
Lesson Carrying Out Significance Tests. Vocabulary Hypothesis – a statement or claim regarding a characteristic of one or more populations Hypothesis.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
The Role of Confidence Intervals in Research 1. A study compared the serum HDL cholesterol levels in people with low-fat diets to people with diets high.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
More About Significance Tests
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
STATISTICAL INFERENCE PART VII
T tests comparing two means t tests comparing two means.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
AP Statistics Section 11.2 A Inference Toolbox for Significance Tests
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Inference for a population mean BPS chapter 18 © 2006 W.H. Freeman and Company.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Inference for a population mean BPS chapter 16 © 2006 W.H. Freeman and Company.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
MATH 2400 Ch. 15 Notes.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
T tests comparing two means t tests comparing two means.
© Copyright McGraw-Hill 2004
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
AP Statistics Section 11.2 A Inference Toolbox for Significance Tests.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Inference for a population mean BPS chapter 16 © 2006 W.H. Freeman and Company.
When  is unknown  The sample standard deviation s provides an estimate of the population standard deviation .  Larger samples give more reliable estimates.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
CHAPTER 10 Comparing Two Populations or Groups
The Practice of Statistics in the Life Sciences Fourth Edition
AP STATISTICS LESSON 10 – 2 (DAY 3)
Objectives 7.1 Inference for the mean of a population
Presentation transcript:

Hypothesis Testing – Examples and Case Studies Steps in Hypothesis Testing 1. Determine the null hypothesis and the alternative hypothesis. 2. Collect and summarize the data into a test statistic. 3. Use the test statistic to determine the p-value. The result is statistically significant if the p-value is less than or equal to the level of significance. Report your conclusions in terms of the original hypothesis

Hypothesis Testing – Examples and Case Studies Inferences about means Now that we know how to test hypotheses about proportions, it’d be nice to be able to do the same for means. Just as we did before, we will base both our confidence interval and our hypothesis test on the sampling distribution model. The Central Limit Theorem told us that the sampling distribution model for means is Normal with mean equal to the population mean and standard deviation (SD): population standard deviation sample size All we need is a random sample of quantitative data and the true population standard deviation. That’s the problem. Proportions have a link between the proportion value and the standard deviation of the sample proportion. This is not the case with means—knowing the sample mean tells us nothing about the SD of the sample mean.

Hypothesis Testing – Examples and Case Studies Inferences about means We’ll do the best we can: estimate the population standard deviation with the sample standard deviation . Our resulting standard error of sample mean (SEM) is : sample standard deviation sample size Reminder: Conditions for Rule for Sample Means Population of measurements is bell-shaped, and a random sample of any size is measured. OR Population of measurements of interest is not bell-shaped, but a large random sample is measured. Sample of size 40 is considered “large,” but if there are extreme outliers, better to have a larger sample.

Hypothesis Testing – Examples and Case Studies Example: Executives' blood pressures The National Center for Health Statistics reports that the mean systolic blood pressure for males 35 to 44 years of age is 128. The medical director of a large company looks at the medical records of 72 executives in this age group and finds that the mean systolic blood pressure in this sample mean is equal to 126.1 and that the standard deviation is = 15.2. Is this evidence that the company's executives have a different mean blood pressure from the general population? The hypotheses: The null hypothesis is "no difference“ from the national mean. The alternative is two sided. Why? So the hypotheses about the unknown population mean of the executive population are Null hypothesis: population mean equal to 128 Alternative hypothesis: population mean not equal to 128 The sampling distribution: If the null hypothesis is true, the sample mean comes from a Normal distribution with mean equal to 128. The standard error of the sample mean (SEM) is: sample standard deviation sample size = 15.2/square root of (72) = 1.79

standardized score (z-score) = observation – null value/standard error Hypothesis Testing – Examples and Case Studies Example: Executives' blood pressures The data: The sample mean is = 126.1. The standardized score (z-score) for this outcome is standardized score (z-score) = observation – null value/standard error = 126.1 – 128/1.79 = -1.06 The P-value: The figure locates the sample outcome -1.06 (in the standard scale) on the Normal curve that represents the sampling distribution if the null hypothesis is true. The two-sided P-value is the probability of an outcome at least this far out in either direction. In Table 8.1, the closest standardized score to -1.08. This is the 13th percentile of a Normal distribution. So the area to the left of-1.08 is 0.14. The area to the left of -1.08 and to the right of 1.08 is double this, equal to 0.28. This is our approximate P-value. (The exact P-value, from software, is P = 0.289.) The conclusion?

Hypothesis Testing – Examples and Case Studies Comparing Two Means This time the parameter of interest is the difference between the two population means For independent random quantities, variances add. So, the standard error(SE) or measure of variability of the difference between two sample means is SE(Sample 1 Mean - Sample 2 Mean ) = square root of [(SEM1)2 + (SEM2)2] where SEM1 = sample 1 standard deviation sample 1 size and SEM2 = sample 2 standard deviation sample 2 size Note: Rules for Sample Means apply for comparing two means. measure of variability known as Standard Error of Difference (SED) = [(SEM1)2 + (SEM2)2] Null hypothesis: Population mean difference is zero Alternative hypothesis: Population mean difference is not zero. The sampling distribution: If the null hypothesis is true, the sample mean comes from a Normal distribution with mean equal to zero and standard error equal to above value.

Hypothesis Testing – Examples and Case Studies Example: Weight Loss for Diet vs Exercise Determine the null and alternative hypotheses. Null hypothesis: No difference in average fat lost in population for two methods. Population mean difference is zero. Alternative hypothesis: There is a difference in average fat lost in population for two methods. Population mean difference is not zero. Collect and summarize data into a test statistic. The sample mean difference (Diet – Exercise) = 5.9 – 4.1 = 1.8 kg and the standard error of the difference is 0.83. standardized score = observation – null value/standard deviation = sample mean difference – null value/ standard error of the difference = 1.8 – 0/0.83 = 2.17 Determine the p-value. The alternative hypothesis was two-sided. p-value = 2  [proportion of bell-shaped curve above 2.17] Table 8.1 => proportion is about 2  0.015 = 0.03. 95% Confidence Interval : 1.6 kg to 4.8 kg

Hypothesis Testing – Examples and Case Studies Example. Cholesterol. A randomized controlled double-blind experiment was performed to demonstrate the efficacy of a drug called "cholestyramine“ in reducing blood cholesterol levels and preventing heart attacks. There were 3,806 subjects, who were all middle-aged men at high risk of heart attack; 1,906 were randomly assigned the treatment group and the remaining 1,900 to the control group. The subjects were followed for 7 years. The drug did reduce the cholesterol level in the treatment group (by about 8%). Furthermore, there were 155 heart attacks in the treatment group, and 187 in the control group: 8.1% versus 9.8%, z = -1.8, P =0.035 (one-tailed). This was called "strong evidence" that cholestyramine helps prevent heart attacks

Hypothesis Testing – Examples and Case Studies How Journals Present Tests Mozart, Relaxation, and Performance on Spatial Tasks Three listening conditions— Mozart, a relaxation tape, and silence—and all subjects participated in all three conditions. Null hypothesis: No differences in population mean spatial reasoning IQ scores after each of three listening conditions. Alternative hypothesis: Population mean spatial reasoning IQ scores do differ for at least one of the conditions compared with the others. “A one-factor (listening condition) repeated measures analysis of variance … revealed that subjects performed better on the abstract/spatial reasoning tests after listening to Mozart than after listening to either the relaxation tape or to nothing (F[2,35] = 7.08, p = 0.002).” “The music condition differed significantly from both the relaxation and silence conditions (t = 3.41, p = 0.002; t = 3.67, p = 0.0008, two-tailed, respectively). The relaxation and silence conditions did not differ (t = 0.795, p = 0.432, two-tailed).” Example: Weight Loss - Diet versus Exercise: z-test statistic given as 2.17 is actually a t-test statistic, because the sample standard deviations were used in the computation. The degrees of freedom (df) for the t-test is (42 + 47 – 2) = 87. Excel: TDIST(2.17,87,2) gives a p-value of 0.0327, very close to 0.0300 found using the standard normal curve (z-curve).

Text Questions Study: Alternations in Brain and Immune Function Produced by Mindfulness Meditation Q16. The participants were right-handed volunteers in a biotechnology company in Madison, Wisconsin in the Midwestern United States. However, for the relationship studied in this randomized experiment, there should be nothing unusual about them, so the results probably apply to adults who would volunteer for a study like this one.