Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Comparison of 2 Population Means Goal: To compare 2 populations/treatments wrt a numeric outcome Sampling Design: Independent Samples (Parallel Groups)
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Significance Tests Hypothesis - Statement Regarding a Characteristic of a Variable or set of variables. Corresponds to population(s) –Majority of registered.
Decision Errors and Power
Topic 6: Introduction to Hypothesis Testing
Probability & Statistical Inference Lecture 7 MSc in Computing (Data Analytics)
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
BS704 Class 7 Hypothesis Testing Procedures
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics
©2006 Thomson/South-Western 1 Chapter 10 – Hypothesis Testing for the Mean of a Population Slides prepared by Jeff Heyl Lincoln University ©2006 Thomson/South-Western.
Chapter 5 Inferences Regarding Population Central Values.
Sample Size Determination Ziad Taib March 7, 2014.
AM Recitation 2/10/11.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Confidence Intervals and Hypothesis Testing - II
Fundamentals of Hypothesis Testing: One-Sample Tests
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
Adapted by Peter Au, George Brown College McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited.
Comparing Two Population Means
Chapter 6 Introduction to Statistical Inference. Introduction Goal: Make statements regarding a population (or state of nature) based on a sample of measurements.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
Hypothesis Testing Introduction to Statistics Chapter 8 Mar 2-4, 2010 Classes #13-14.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Large sample CI for μ Small sample CI for μ Large sample CI for p
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Chapter 20 Testing Hypothesis about proportions
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
© Copyright McGraw-Hill 2004
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Hypothesis Tests for 1-Proportion Presentation 9.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Hypothesis Testing Concepts of Hypothesis Testing Statistical hypotheses – statements about population parameters Examples mean weight of adult.
Chapter Nine Hypothesis Testing.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
More on Inference.
Statistical inference: distribution, hypothesis testing
Inferences Regarding Population Central Values
More on Inference.
Chapter 9 Hypothesis Testing.
Problems: Q&A chapter 6, problems Chapter 6:
Decision Errors and Power
Significance Tests: The Basics
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis - Statement regarding the value(s) of unknown parameter(s). Typically will imply no association between explanatory and response variables in our applications (will always contain an equality) Alternative hypothesis - Statement contradictory to the null hypothesis (will always contain an inequality) Test statistic - Quantity based on sample data and null hypothesis used to test between null and alternative hypotheses Rejection region - Values of the test statistic for which we reject the null in favor of the alternative hypothesis

Hypothesis Testing Goal: Keep a, b reasonably small

Example - Efficacy Test for New drug Drug company has new drug, wishes to compare it with current standard treatment Federal regulators tell company that they must demonstrate that new drug is better than current treatment to receive approval Firm runs clinical trial where some patients receive new drug, and others receive standard treatment Numeric response of therapeutic effect is obtained (higher scores are better). Parameter of interest: mNew - mStd

Example - Efficacy Test for New drug Null hypothesis - New drug is no better than standard trt Alternative hypothesis - New drug is better than standard trt Experimental (Sample) data:

Sampling Distribution of Difference in Means In large samples, the difference in two sample means is approximately normally distributed: Under the null hypothesis, m1-m2=0 and: s12 and s22 are unknown and estimated by s12 and s22

Example - Efficacy Test for New drug Type I error - Concluding that the new drug is better than the standard (HA) when in fact it is no better (H0). Ineffective drug is deemed better. Traditionally a = P(Type I error) = 0.05 Type II error - Failing to conclude that the new drug is better (HA) when in fact it is. Effective drug is deemed to be no better. Traditionally a clinically important difference (D) is assigned and sample sizes chosen so that: b = P(Type II error | m1-m2 = D)  .20

Elements of a Hypothesis Test Test Statistic - Difference between the Sample means, scaled to number of standard deviations (standard errors) from the null difference of 0 for the Population means: Rejection Region - Set of values of the test statistic that are consistent with HA, such that the probability it falls in this region when H0 is true is a (we will always set a=0.05)

P-value (aka Observed Significance Level) P-value - Measure of the strength of evidence the sample data provides against the null hypothesis: P(Evidence This strong or stronger against H0 | H0 is true)

Large-Sample Test H0:m1-m2=0 vs H0:m1-m2>0 H0: m1-m2 = 0 (No difference in population means HA: m1-m2 > 0 (Population Mean 1 > Pop Mean 2) Conclusion - Reject H0 if test statistic falls in rejection region, or equivalently the P-value is  a

Example - Botox for Cervical Dystonia Patients - Individuals suffering from cervical dystonia Response - Tsui score of severity of cervical dystonia (higher scores are more severe) at week 8 of Tx Research (alternative) hypothesis - Botox A decreases mean Tsui score more than placebo Groups - Placebo (Group 1) and Botox A (Group 2) Experimental (Sample) Results: Source: Wissel, et al (2001)

Example - Botox for Cervical Dystonia Test whether Botox A produces lower mean Tsui scores than placebo (a = 0.05) Conclusion: Botox A produces lower mean Tsui scores than placebo (since 2.82 > 1.645 and P-value < 0.05)

2-Sided Tests Many studies don’t assume a direction wrt the difference m1-m2 H0: m1-m2 = 0 HA: m1-m2  0 Test statistic is the same as before Decision Rule: Conclude m1-m2 > 0 if zobs  za/2 (a=0.05  za/2=1.96) Conclude m1-m2 < 0 if zobs  -za/2 (a=0.05  -za/2= -1.96) Do not reject m1-m2 = 0 if -za/2  zobs  za/2 P-value: 2P(Z |zobs|)

Power of a Test Power - Probability a test rejects H0 (depends on m1- m2) H0 True: Power = P(Type I error) = a H0 False: Power = 1-P(Type II error) = 1-b Example: H0: m1- m2 = 0 HA: m1- m2 > 0 s12 = s22 = 25 n1 = n2 = 25 Decision Rule: Reject H0 (at a=0.05 significance level) if:

Power of a Test Now suppose in reality that m1-m2 = 3.0 (HA is true) Power now refers to the probability we (correctly) reject the null hypothesis. Note that the sampling distribution of the difference in sample means is approximately normal, with mean 3.0 and standard deviation (standard error) 1.414. Decision Rule (from last slide): Conclude population means differ if the sample mean for group 1 is at least 2.326 higher than the sample mean for group 2 Power for this case can be computed as:

Power of a Test As sample sizes increase, power increases All else being equal: As sample sizes increase, power increases As population variances decrease, power increases As the true mean difference increases, power increases

Power of a Test Distribution (H0) Distribution (HA)

Power of a Test Power Curves for group sample sizes of 25,50,75,100 and varying true values m1-m2 with s1=s2=5. For given m1-m2 , power increases with sample size For given sample size, power increases with m1-m2

Sample Size Calculations for Fixed Power Goal - Choose sample sizes to have a favorable chance of detecting a clinically meaning difference Step 1 - Define an important difference in means: Case 1: s approximated from prior experience or pilot study - dfference can be stated in units of the data Case 2: s unknown - difference must be stated in units of standard deviations of the data Step 2 - Choose the desired power to detect the the clinically meaningful difference (1-b, typically at least .80). For 2-sided test:

Example - Rosiglitazone for HIV-1 Lipoatrophy Trts - Rosiglitazone vs Placebo Response - Change in Limb fat mass Clinically Meaningful Difference - 0.5 (std dev’s) Desired Power - 1-b = 0.80 Significance Level - a = 0.05 Source: Carr, et al (2004)

Confidence Intervals Normally Distributed data - approximately 95% of individual measurements lie within 2 standard deviations of the mean Difference between 2 sample means is approximately normally distributed in large samples (regardless of shape of distribution of individual measurements): Thus, we can expect (with 95% confidence) that our sample mean difference lies within 2 standard errors of the true difference

(1-a)100% Confidence Interval for m1-m2 Large sample Confidence Interval for m1-m2: Standard level of confidence is 95% (z.025 = 1.96  2) (1-a)100% CI’s and 2-sided tests reach the same conclusions regarding whether m1-m2= 0

Example - Viagra for ED Comparison of Viagra (Group 1) and Placebo (Group 2) for ED Data pooled from 6 double-blind trials Subjects - White males Response - Percent of succesful intercourse attempts in past 4 weeks (Each subject reports his own percentage) 95% CI for m1- m2: Source: Carson, et al (2002)