Hypothesis Testing A hypothesis is a claim or statement about the value of either a single population parameter or about the values of several population.

Slides:



Advertisements
Similar presentations
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Advertisements

Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Hypothesis Testing: One Sample Mean or Proportion
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson8-1 Lesson 8: One-Sample Tests of Hypothesis.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Chapter 8 Introduction to Hypothesis Testing
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 10 Hypothesis Testing
Overview Definition Hypothesis
Confidence Intervals and Hypothesis Testing - II
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Hypothesis testing is used to make decisions concerning the value of a parameter.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Fundamentals of Hypothesis Testing: One-Sample Tests
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 10 Hypothesis Testing
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
Statistical Inference
IE241: Introduction to Hypothesis Testing. We said before that estimation of parameters was one of the two major areas of statistics. Now let’s turn to.
Hypothesis Testing – A Primer. Null and Alternative Hypotheses in Inferential Statistics Null hypothesis: The default position that there is no relationship.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Unit 8 Section 8-1 & : Steps in Hypothesis Testing- Traditional Method  Hypothesis Testing – a decision making process for evaluating a claim.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Revision of basic statistics Hypothesis testing Principles Testing a proportion Testing a mean Testing the difference between two means Estimation.
Hypothesis Testing. “Not Guilty” In criminal proceedings in U.S. courts the defendant is presumed innocent until proven guilty and the prosecutor must.
Introduction Suppose that a pharmaceutical company is concerned that the mean potency  of an antibiotic meet the minimum government potency standards.
Introduction to hypothesis testing Hypothesis testing is about making decisions Is a hypothesis true or false? Ex. Are women paid less, on average, than.
Statistical Inference Making decisions regarding the population base on a sample.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
6.2 Large Sample Significance Tests for a Mean “The reason students have trouble understanding hypothesis testing may be that they are trying to think.”
Hypothesis Testing. A statistical Test is defined by 1.Choosing a statistic (called the test statistic) 2.Dividing the range of possible values for the.
C HAPTER 2  Hypothesis Testing -Test for one means - Test for two means -Test for one and two proportions.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Chapter Nine Hypothesis Testing.
Statistics for Business and Economics
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests
Module 10 Hypothesis Tests for One Population Mean
Statistics for Managers Using Microsoft® Excel 5th Edition
One-Sample Tests of Hypothesis
FINAL EXAMINATION STUDY MATERIAL III
Chapter 9 Hypothesis Testing: Single Population
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing: Hypotheses
One-Sample Tests of Hypothesis
CONCEPTS OF HYPOTHESIS TESTING
Elementary Statistics: Picturing The World
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Virtual University of Pakistan
Hypothesis Testing.
Statistical Inference
Power Section 9.7.
One-Sample Tests of Hypothesis
Confidence Intervals.
Chapter 9 Hypothesis Testing: Single Population
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Hypothesis Testing A hypothesis is a claim or statement about the value of either a single population parameter or about the values of several population parameters. Example: Women are paid less, on average, than men. Hypothesis testing is about making decisions. It is procedure, based on sample evidence and probability theory, used to determine whether the hypothesis is a reasonable statement and should not be rejected, or is unreasonable and should be rejected.

Hypothesis testing In hypothesis testing there are two conflicting statements about the value of a population parameter The Null hypothesis (H0) The Alternative hypothesis(H1 or Ha). For example, the mean age of Level 200 students is 20 years verses mean age is not 20 years. To test the validity of this hypothesis, we must select a sample from the population, calculate sample statistics and based on certain decision rules, either accept or reject the hypothesis.

Principles of hypothesis testing The null hypothesis is initially presumed to be true The analogy of a court of law is a good one here The accused is presumed innocent (null hypothesis) unless the evidence proves otherwise Evidence is gathered, to see if it is consistent with the hypothesis. If it is, the null hypothesis continues to be considered ‘true’ (later evidence might change this). If not, the null is rejected in favour of the alternative hypothesis Innocence is rejected in favour of a guilty verdict.

Two possible types of error Decision making is never perfect and mistakes can be made Type I error: rejecting the null when true (convicting the innocent) Type II error: accepting the null when false (letting the guilty go free)

Type I and Type II errors True situation Decision H0 true H0 false Accept H0 Correct decision Type II error Reject H0 Type I error

Avoiding incorrect decisions We wish to avoid both Type I and Type II errors. We can alter the decision rule to do this. Unfortunately, reducing the chance of making a Type I error generally means increasing the chance of a Type II error. Hence a trade off. Example: Accepting a 10-2 majority from the jury to convict (rather than unanimity) reduces the risk of the guilty going free (Type II error), but increases the risk of convicting the innocent (Type I error).

How to make a decision Where do we place the decision line? Set the Type I error probability to a particular value. By convention, this is 5%. This is known as the significance level of the test and is denoted α (probability of rejecting the Null when it is in fact true). It is complementary to the confidence level of estimation. 5% significance level  95% confidence level.

How to make a decision Test statistic: A value, determined from sample information, used to determine whether or not to reject the null hypothesis. Critical value: The dividing point between the region where the null hypothesis is rejected and the region where it is not rejected.

Example: How long do CFLs last? A manufacturer of compact fluorescent lamps claims its product lasts at least 5,000 hours, on average. A sample of 80 bulbs is tested. The average time before failure is 4,900 hours, with standard deviation 500 hours. Should the manufacturer’s claim be accepted or rejected?

The hypotheses to be tested H0: m = 5,000 H1: m < 5,000 Note This is a one tailed test, since the rejection region occupies only one side of the distribution (more on this soon). The null hypothesis is always a precise statement (with the equality sign in it). Choose significance level of 5% (α = .05, meaning critical value (Zc) is 1.64) Reject Null if Test Statistic is less than -1.64 (since rejection region is in the left tail of normal curve).

Rejection region 5% Z= -1.64 Reject H0

Should the null hypothesis be rejected? Is 4,900 far enough below 5,000? Is it more than 1.64 standard errors below 5,000? (1.64 standard errors below the mean cuts off the bottom 5% of the Normal distribution) The question we want to ask is: Is the mean indeed less than 5000, or the sample value of 4900 obtained was due to chance (sampling variability?) Test statistic 12

Example cont’d 4,900 is 1.79 standard errors below 5,000, so falls into the rejection region (bottom 5% of the distribution) Hence, we can reject H0 at the 5% significance level or, equivalently, with 95% confidence. If the true mean were 5,000, there is less than a 5% (3.67%) chance of obtaining sample evidence such as from a sample of n = 80.

Formal layout of a problem State the hypotheses H0: m = 5,000 H1: m < 5,000 Choose significance level (probability of rejecting H0 when true or committing type I error): 5% Look up critical value and state decision rule: zc = 1.64; reject if z>zc or –z<-zc [or reject if |z|>zc] Calculate the test statistic: z = -1.79 Decision: reject H0 since -1.79 < -1.64 and falls into the rejection region

One verses two tailed tests Should you use a one tailed (H1: m < 5,000) or two tailed (H1: m  5,000) test? If you are only concerned about falling one side of the hypothesized value (as here: we would not worry if the bulbs lasted longer than 5,000 hours) use the one tailed test. You would not want to reject H0 if the sample mean were anywhere above 5,000. If for another reason, you know one side is impossible (e.g. demand curves cannot slope upwards), use a one tailed test. Otherwise, use a two tailed test.

One vs two tailed tests If unsure, choose a two tailed test. Never choose between a one or two tailed test on the basis of the sample evidence (i.e. do not choose a one tailed test because you notice that 4,900 < 5,000). The hypothesis should be chosen before looking at the evidence!

Two tailed test example It is claimed that an average child spends 15 hours per week watching television. A survey of 100 children finds an average of 14.5 hours per week, with standard deviation 8 hours. Is the claim justified? The claim would be wrong if children spend either more or less than 15 hours watching TV. The rejection region is split across the two tails of the distribution. This is a two tailed test.

A two tailed test – diagram 2.5% 2.5% Reject H0 Reject H0

Solution to the problem H0: m = 15 H1: m  15 Choose significance level: 5% or α = 0.05 Look up critical value: zc = 1.96; reject H0 if z>zc=1.96 Calculate the test statistic: Decision: we do not reject H0 since 0.625 < 1.96 and does not fall into the rejection region

The choice of significance level Why 5%? Like its complement, the 95% confidence level, it is a convention. A different value can be chosen, but it does set a benchmark. If the cost of making a Type I error is especially high, then set a lower significance level, e.g. 1%. The significance level is the probability of making a Type I error.

Practice It is necessary for an automobile producer to test the hypothesis that the mean number of miles per gallon achieved by its cars is 28 against the alternative hypothesis that it is not 28. The standard deviation of the number of miles per gallon achieved by the company’s cars is 6. Suppose that the mean number of miles per gallon for a sample of 100 cars is 26.2. On the basis of this result, should the company reject the hypothesis that the population mean is 28? Why, or why not? Use α = 0.05.

The p-value approach There is an alternative way of making the decision. Returning to the CFL problem, the test statistic z = -1.79 cuts off 3.67% in the lower tail of the distribution [i.e. P(Z<-1.79)=0.0367] 3.67% is the p-value for this example Since 3.67% < 5% the test statistic must fall into the rejection region for the test The p-value measures the probability of obtaining a sample statistic as extreme as 4900 were the null hypothesis true The level of significance (α = 0.05) is the risk level we are willing to tolerate If the p-value is less than 0.05, we reject H0 and we do not reject when the p-value is greater than 0.05

Two ways to reject Reject H0 if |z| > zc i.e. |-1.79| > 1.64 or the p-value < the significance level (3.67% < 5%)

Testing a proportion The sample proportion is denoted by p where Proportion: A fraction or percentage that indicates the part of the population or sample having a particular trait of interest. The sample proportion is denoted by p where x is the number of successes in the sample n is the number sampled

Testing a proportion Same principles: reject H0 if the test statistic falls into the rejection region To test H0:  = 0.5 verses H1:   0.5 (e.g. a coin is fair or not) the test statistic is π is the population proportion.

Testing a proportion If the sample evidence were 60 heads from 100 tosses (p = 0.6) we would have so we would (just) reject H0 since 2 > 1.96.

Testing the difference of two means To test whether two samples are drawn from populations with the same mean H0: m1 = m2 or H0: m1 - m2 = 0 H1: m1  m2 or H0: m1 - m2  0 The test statistic is

Testing the difference of two proportions To test whether two sample proportions are equal H0: p1 = p2 or H0: p1 - p2 = 0 H1: p1  p2 or H0: p1 - p2  0 The test statistic is