Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.

Slides:



Advertisements
Similar presentations
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Advertisements

Objectives (BPS chapter 15)
Chapter 10: Hypothesis Testing
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample
Introduction to Inference Estimating with Confidence Chapter 6.1.
Introduction to Inference Estimating with Confidence IPS Chapter 6.1 © 2009 W.H. Freeman and Company.
IPS Chapter 6 © 2012 W.H. Freeman and Company  6.1: Estimating with Confidence  6.2: Tests of Significance  6.3: Use and Abuse of Tests  6.4: Power.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
7/2/2015Basics of Significance Testing1 Chapter 15 Tests of Significance: The Basics.
Chapter 9 Hypothesis Testing.
Hypothesis Testing, part II
Probability Population:
Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview Definition Hypothesis
Confidence Intervals and Hypothesis Testing - II
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.
Objectives 6.2 Tests of significance
14. Introduction to inference
Adapted by Peter Au, George Brown College McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited.
Chapter 8 Introduction to Hypothesis Testing
Significance Tests: THE BASICS Could it happen by chance alone?
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
Essential Statistics Chapter 131 Introduction to Inference.
Lesson Significance Tests: The Basics. Vocabulary Hypothesis – a statement or claim regarding a characteristic of one or more populations Hypothesis.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Inference for a population mean BPS chapter 16 © 2006 W.H. Freeman and Company.
Chapter 10 AP Statistics St. Francis High School Fr. Chris.
Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Confidence intervals: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Chapter 6.1 Estimating with Confidence 1. Point estimation  Sample mean is the natural estimator of the unknown population mean.  Is the point estimation.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Tests of significance: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
Chapter 9: Hypothesis Tests Based on a Single Sample 1.
IPS Chapter 6 © 2012 W.H. Freeman and Company  6.1: Estimating with Confidence  6.2: Tests of Significance  6.3: Use and Abuse of Tests  6.4: Power.
IPS Chapter 6 DAL-AC FALL 2015  6.1: Estimating with Confidence  6.2: Tests of Significance  6.3: Use and Abuse of Tests  6.4: Power and Inference.
Introduction to Inference Estimating with Confidence PBS Chapter 6.1 © 2009 W.H. Freeman and Company.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
A significance test or hypothesis test is a procedure for comparing our data with a hypothesis whose truth we want to assess. The hypothesis is usually.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Tests of significance: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
Introduction to inference Estimating with confidence IPS chapter 6.1 © 2006 W.H. Freeman and Company.
Sampling and Inference September 23, Web Address
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 6: Tests of significance / tests of hypothesis.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
Uncertainty and confidence If you picked different samples from a population, you would probably get different sample means ( x ̅ ) and virtually none.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
14. Introduction to inference
Introduction to Inference
Tests of significance: The basics
Objectives 6.2 Tests of significance
The Practice of Statistics in the Life Sciences Fourth Edition
Lecture 10/24/ Tests of Significance
Presentation transcript:

Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company

Objectives (IPS chapter 6.2) Tests of significance  Null and alternative hypotheses  One-sided and two-sided tests  The P-value  Tests for a population mean  The significance level,   Confidence intervals to test hypotheses

We have seen that the properties of the sampling distribution of x-bar help us estimate a range of likely values for population mean . We can also rely on the properties of the sample distribution to test hypotheses. Example: You are in charge of quality control in your food company. You sample randomly four packs of cherry tomatoes, each labeled 1/2 lb. (227 g). The average weight from your four boxes is 222 g. Obviously, we cannot expect boxes filled with whole tomatoes to all weigh exactly half a pound. Thus,  Is the somewhat smaller weight simply due to chance variation?  Is it evidence that the calibrating machine that sorts cherry tomatoes into packs needs revision?

Null and alternative hypotheses A test of statistical significance tests a specific hypothesis using sample data to determine the validity of the hypothesis. In statistics, a hypothesis is a claim about the characteristics of one or more parameters in (a) population(s). What you want to know: Does the calibrating machine that sorts cherry tomatoes into packs need revision? The same question reframed statistically: Is the population mean µ for the distribution of weights of cherry tomato packages equal to 227 g (i.e., half a pound)?

The null hypothesis is a specific statement about the value of (a) parameter of the population(s). It is labeled H 0. The alternative hypothesis is usually a more general statement about a parameter of the population(s) that contradicts the null hypothesis. It is labeled H a. Weight of cherry tomato packs: H 0 : µ = 227 g (µ is the average weight of the population of packs) H a : µ ≠ 227 g (µ is either larger or smaller)

One-sided and two-sided tests  A two-tail or two-sided test of the population mean has these null and alternative hypotheses: H 0 : µ = [a specific number] H a : µ  [the specific number]  A one-tail or one-sided test of a population mean has either one of these null and alternative hypotheses: H 0 : µ = [a specific number] H a : µ < [a specific number] OR H 0 : µ = [a specific number] H a : µ > [a specific number] The FDA tests whether a generic drug has an absorption extent similar to the known absorption extent of the brand-name drug it is copying. Higher or lower absorption would both be less desirable, so we test: H 0 : µ generic = µ brand H a : µ generic  µ brand two-sided

How to choose? What determines the choice of a one-sided versus a two-sided test is what it is that we want to show. That needs to be determined before we perform a test of statistical significance. A health advocacy group tests whether the mean nicotine content of a brand of cigarettes is greater than the advertised value of 1.4 mg. Here, the health advocacy group suspects that cigarette manufacturers sell cigarettes with a nicotine content higher than what they advertise in order to better addict consumers to their products and maintain revenues. Thus, this is a one-sided test:H 0 : µ = 1.4 mgH a : µ > 1.4 mg It is important to make that choice before performing the test in order to avoid ethical issues.

The P-value The packaging process has a known standard deviation  = 5 g. H 0 : µ = 227 g versus H a : µ ≠ 227 g The average weight from the four randomly chosen boxes is 222 g. What is the probability the sample mean would as small as 222 g if H 0 is true? We call this probability a P-value. Large P-values mean that the experimental results are reasonably consistent with the H o while small P- values mean the experimental results are not. This is a way of assessing the “believability” of the null hypothesis given the evidence provided by a random sample.

Interpreting a P-value Could random variation alone account for the difference between the null hypothesis and our observations?  A small P-value implies that random variation is not likely to be the reason for the observed difference.  For a sufficiently small p-value we reject H 0. We say the observed sample statistic is significantly different from what is stated in H 0. Thus, small P-values are strong evidence AGAINST H 0. But how small is small…?

P = P = P = P = P = 0.01 P = 0.05 When the shaded area becomes very small, the probability of drawing such a sample at random is very slim. Often, a P-value of 0.05 or less is considered significant: The phenomenon observed is unlikely to be entirely due to chance event from the random sampling. Significant P-value ???

Tests for a population mean µ defined by H 0 Sampling distribution σ/√n To test the hypothesis H 0 : µ = µ 0 based on an SRS of size n from a Normal population with unknown mean µ and known standard deviation σ, we rely on the properties of the sampling distribution N(µ, σ√n). For a one-sided alternative hypothesis the P-value is the area under the sampling distribution for values at least as extreme, in the direction of H a, as that of our random sample. Again, we first calculate a z-value and then use Table A.

P-value in one-sided and two-sided tests To calculate the P-value for a two-sided test, use the symmetry of the normal curve. Find the P-value for a one-sided test, and double it. One-sided (one-tailed) test Two-sided (two-tailed) test

Does the packaging machine need revision?  H 0 : µ = 227 g versus H a : µ ≠ 227 g  What is the probability of drawing a random sample such as yours if H 0 is true? The area under the standard normal curve < Thus, P-value = 2* = 4.56% which means we are unlikely to get such a small value if H o is true. 2.28% Sampling distribution σ/√n = 2.5 g µ ( H 0 ) The probability of getting a random sample average so different from µ is so low that we reject H 0.  The machine does need recalibration.

The significance level  The significance level, α, is the largest P-value for which we reject a true null hypothesis (how much evidence against H 0 we require). This value is decided upon before conducting the test.  If the P-value is equal to or less than α then we reject H 0. If the P-value is greater than α then we fail to reject H 0. Does the packaging machine need revision? Two-sided test. The P - value is 4.56%. * If α had been set to 5%, then the P-value would be significant. * If α had been set to 1%, then the P-value would not be significant.

When the z score falls within the rejection region (shaded area on the tail-side), the p-value is smaller than α and you have shown statistical significance. z = Z One-sided test, α = 5% Two-sided test, α = 1%

Rejection region for a two-tail test of µ with α = 0.05 (5%) A two-sided test means that α is spread between both tails of the curve, thus: -A middle area C of 1 − α  = 95%, and -An upper tail area of α  /2 = Table C 0.025

Confidence intervals to test hypotheses You can also use a 2-sided confidence interval to test a two-sided hypothesis. α / 2 In a two-sided test, C = 1 – α. C = confidence level α = significance level Packs of cherry tomatoes (σ  = 5 g): H 0 : µ = 227 g versus H a : µ ≠ 227 g Sample average 222 g. 95% CI for µ = 222 ± 1.96*5/√4 = 222 g ± 4.9 g 227 g does not belong to the 95% CI (217.1 to g). Thus, we reject H 0.

Ex: Your sample gives a 99% confidence interval of. With 99% confidence, could samples be from populations with µ = 0.86? µ = 0.85? 99% C.I. Logic of confidence interval test Cannot reject H 0 :  = 0.85 Reject H 0 :  = 0.86 A confidence interval gives a black and white answer: Reject or don't reject H 0. But it also estimates a range of likely values for the true population mean µ. A P-value quantifies how strong the evidence is against the H 0. But if you reject H 0, it doesn’t provide any information about the true population mean µ.