STAT 111 Introductory Statistics Lecture 10: Confidence Intervals and Hypothesis Tests June 8, 2004.

Slides:



Advertisements
Similar presentations
11.1 – Significance Tests: The Basics
Advertisements

Copyright © Cengage Learning. All rights reserved. 7 Statistical Intervals Based on a Single Sample
1 Introduction to Estimation Chapter Introduction Statistical inference is the process by which we acquire information about populations from.
Chapter 9 Chapter 10 Chapter 11 Chapter 12
Business Statistics for Managerial Decision
7-2 Estimating a Population Proportion
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
1 Economics 173 Business Statistics Lectures 3 & 4 Summer, 2001 Professor J. Petry.
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Chapter 11: Estimation Estimation Defined Confidence Levels
Lecture 14 Sections 7.1 – 7.2 Objectives:
STAT 111 Introductory Statistics Lecture 9: Inference and Estimation June 2, 2004.
Estimates and Sample Sizes Lecture – 7.4
Chapter 7. Statistical Intervals Based on a Single Sample Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University : Office.
PARAMETRIC STATISTICAL INFERENCE
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Significance Tests: THE BASICS Could it happen by chance alone?
AP Statistics Section 11.1 A Basics of Significance Tests
Statistical Methods Introduction to Estimation noha hussein elkhidir16/04/35.
Stat 1510 Statistical Inference: Confidence Intervals & Test of Significance.
Essential Statistics Chapter 131 Introduction to Inference.
INTRODUCTION TO INFERENCE BPS - 5th Ed. Chapter 14 1.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
Topic 7 - Hypothesis tests based on a single sample Sampling distribution of the sample mean - pages Basics of hypothesis testing -
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
AP STATISTICS LESSON 10 – 1 ( DAY 3 ) CHOOSING THE SAMPLE SIZE.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Applied Business Forecasting and Regression Analysis
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
STA Lecture 251 STA 291 Lecture 25 Testing the hypothesis about Population Mean Inference about a Population Mean, or compare two population means.
Essential Statistics Chapter 141 Thinking about Inference.
Section 10.1 Confidence Intervals
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
The z test statistic & two-sided tests Section
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
MATH 2400 Ch. 15 Notes.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 9 Testing a Claim
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Economics 173 Business Statistics Lecture 3 Fall, 2001 Professor J. Petry
Chapter 8: Confidence Intervals based on a Single Sample
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
1 Chapter 9: Introduction to Inference. 2 Thumbtack Activity Toss your thumbtack in the air and record whether it lands either point up (U) or point down.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
One Sample Mean Inference (Chapter 5)
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Tests of Significance: Stating Hypothesis; Testing Population Mean.
Testing a Single Mean Module 16. Tests of Significance Confidence intervals are used to estimate a population parameter. Tests of Significance or Hypothesis.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Estimating a Population Proportion ADM 2304 – Winter 2012 ©Tony Quon.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
CHAPTER 9 Testing a Claim
Confidence Interval Estimation and Statistical Inference
Essential Statistics Introduction to Inference
CHAPTER 9 Testing a Claim
Basic Practice of Statistics - 3rd Edition Introduction to Inference
STAT 111 Introductory Statistics
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

STAT 111 Introductory Statistics Lecture 10: Confidence Intervals and Hypothesis Tests June 8, 2004

Today’s Topics Confidence intervals revisited Margin of error for confidence intervals Introduction to hypothesis testing

Confidence Intervals Revisited A level C confidence interval for some population parameter θ is an interval [L, U] computed from sample data by a method that has probability C of producing an interval containing the true value of the parameter In other words, P(L ≤ θ ≤ U) = C, C can be 90%, 95%, 99%, etc.

Confidence Intervals The general form of a confidence interval is given by estimate ± margin of error The estimate is our guess for the value of the unknown population parameter θ. The margin of error shows how accurate we believe our guess is, based on the variability of the estimate.

Confidence Interval for a Population Mean Suppose we choose a simple random sample of size n from a population with unknown mean µ and known standard deviation σ. Then a level C confidence interval for µ is z * satisfies –P(-z * ≤ Z ≤ z * ) = C –P(Z z * ) = (1 – C)/2

Confidence Interval for a Parameter Z 0

Confidence Interval for a Population Mean Recall the Central Limit Theorem. Suppose we have any population whose distribution has mean µ and standard deviation σ. If we draw a large enough SRS from this population, then This is true regardless of what the actual population distribution is.

Confidence Interval for a Population Mean Hence, if the population follows a normal distribution, or the sample size is sufficiently large, we have This leads to

Confidence Intervals for a Population Mean For any confidence interval, there are two possibilities: –The interval contains the true value of the parameter (in this case, µ). –Our SRS was one of the few samples for which µ is not contained in the interval. It is incorrect to say that there is probability C that the unknown population parameter (µ) lies within our particular confidence interval.

It means that if we repeatedly sample from the population, then the true population mean µ will be covered by the constructed confidence intervals (100C)% of the time. Remember! It is incorrect to say that the probability that the true population mean µ lies within the confidence interval is C. JAVA Applet for demonstrating confidence intervalsJAVA Applet Confidence Interval for a Population Mean

Upper Confidence LimitLower Confidence Limit Width on each side(Margin of error)

Commonly Used Confidence Levels Confidence level(C) 1-C(1-C)/2z* (z (1-C)/2 ) 99% % % % %

Example 1 The number and the types of television programs and commercials targeted at children is affected by the amount of time children watch TV. A survey was conducted among 100 American children, in which they were asked to record the number of hours they watched TV per week. The sample mean is The known population standard deviation is 8. Estimate the average watch time at a 95% confidence level.

Example 2 A study of preferred height for an experimental keyboard with large forearm-wrist support was conducted. 31 trained typists were selected, and the preferred keyboard height was determined for each of them. The resulting sample average height was 80 cm. Assume the preferred height is normally distributed with σ = 2 cm. Calculate a 90% confidence interval for µ, the true average preferred height for the population.

Example 3 Suppose we desire a confidence interval for the true average stray-load loss µ (in watts) for a certain type of induction motor when the line current is held at 10 amps for a speed of 1500 rpm. Assume that stray-load loss is normally distributed with σ = 3.0 If the a sample of size 100 produces a mean stray- load loss of 58.3, compute a 99% confidence interval for µ.

Example 4 The yield point of a particular type of mild steel- reinforcing bar is known to be normally distributed with σ = 100. The composition of the bar has been slightly modified without affecting either the normality or the value of σ. If a sample of 25 modified bars results in a sample average yield point of 8439 lb, compute a 92% confidence interval for the true average yield point of the modified bar.

Confidence Intervals (cont.) Confidence intervals for other parameters in a population can also be constructed. In particular, confidence intervals can be constructed on the standard deviation/variance of a population whose distribution has known mean µ. Also on populations in which some event occurs with proportion p. (More on this one later on.)

Margin of Error of a Confidence Interval The margin of error m is Margin of error measures precision of our estimate, but covers only random sampling errors. The size of the margin of error depends on –Confidence level –Sample size –Population standard deviation

Confidence Interval The length (width) of a confidence interval is The length (width) of a confidence interval increases if the margin of error increases. The width of a confidence interval increases if –Confidence level increases –Sample size decreases –Population standard deviation increases

Choosing the Sample Size Fixing the confidence level, a confidence interval for a population mean will have a specified margin of error m when the sample size is By achieving a specified margin of error, we can estimate the mean to within that margin of error units.

Example 1 To estimate the amount of lumber that can be harvested in a tract of land, the mean diameter of trees in the tract must be estimated to within one inch with 99% confidence. What sample size should be taken? (Assume diameters are normally distributed with σ = 6 inches.)

Example 2 Suppose that the standard deviation of the salaries of a population of individuals is 30K, how many individuals do we need to sample so that the 90% CI has a margin of error no more than 5K?

Example 3 Monitoring of a computer time-sharing system has suggested that response time to a particular command is normally distributed with σ = 25 ms. A new operating system is installed, and we wish to estimate the true average response time µ for the new environment. Assuming that response times are still normally distributed with σ = 25, what sample size is necessary to ensure that the resulting 95% confidence interval has a width of at most 10?

Cautions on CI for Population Mean The data must be an SRS from the population. Formula is incorrect for more complex probability sampling designs. Formula requires carefully produced data. Confidence interval is not resistant to outliers. When sample size is small, examine data for skewness and other signs of non-normality. Formula requires standard deviation of population to be known, which is not realistic in practice.

Introduction: Hypothesis Testing Confidence intervals are one of the two most common types of formal statistical inference. We prefer confidence intervals when our goal is to estimate a population parameter. Second common type of inference is used when we want to assess the evidence provided by the data in favor of some claim (hypothesis) about the population.

Hypothesis Testing Examples of claims to which hypothesis testing can be applied: –Are less than 10% of all circuit boards produced by a particular manufacturer defective? –Is the true average inside diameter of a certain type of pipe 0.75 cm? –Does one type of twine have a higher average breaking strength than a second type of twine? –For a pharmaceutical company, is a new drug effective for a certain disease?

Hypothesis Testing The hypothesis is a statement about the parameters in a population or model. The results of a test are expressed in terms of a probability that measures how well the data and the hypothesis agree. In hypothesis testing, we need to set up two hypotheses: –The null hypothesis H 0 –The alternative hypothesis H a (sometimes denoted H 1 )

Hypothesis Testing The null hypothesis is the claim which is initially favored or believed to be true. The null hypothesis is also the claim that we will try to find evidence against. Usually the null hypothesis is a statement of “no effect” or “no difference.” The test of significance is designed to assess the strength of the evidence against the null hypothesis.

Hypothesis Testing The alternative hypothesis is the claim that we hope or suspect is true instead of H 0. We often begin with the alternative hypothesis H a and then set up H 0 as the statement that the hoped-for effect is not present. Stating H a is often a difficult task. Hypotheses in general refer to some population or model and not to any particular outcome.

Hypothesis Testing The alternative hypothesis H a can be either one- sided or two-sided. One-sided alternative hypotheses: –μ > 0 –p ≤ 0.5 –σ < 2 Two-sided alternative hypotheses: –μ ≠ 0 –p ≠ 0.5 –σ ≠ 2

Example Experiments on learning in animals sometimes measure how long it takes a mouse to find its way through a maze. The mean time is 18 second for one particular maze. A researcher thinks that a loud noise will cause the mice to complete the maze faster. She measures how long each of 10 mice takes with a noise as stimulus. Let μ be the mean time of mice to find their way through a particular maze when noise is presented as a stimulus. –H 0 : μ = 18 –H a : μ < 18 One-sided H a

Example One-sided H a Does more than half of the American population have faith in the economy? 100,000 Americans are sampled. Let p be the population proportion of people who have faith in the economy. –H 0 : p ≤ 0.5 –H a : p > 0.5

Example The Census Bureau reports that households spend an average of 31% of their total spending on housing. A homebuilders association in Cleveland wonders if the national finding applies in their area. They interview a sample of 40 households in the Cleveland metropolitan area to learn what percent of their spending goes toward housing. Let μ be the mean percent of spending of households in Cleveland on housing. –H 0 : μ = 0.31 –H a : μ ≠ 0.31 Two-sided H a

Example Does one type of twine have a higher average breaking strength than a second type of twine? Let μ 1 be the average breaking strength of the first type of twine, and let μ 2 be the average breaking strength of the second type. –H 0 : μ 1 = μ 2 –H a : μ 1 ≠ μ 2 Two-sided H a

Hypothesis Testing The alternative hypothesis in general should express the hopes or suspicions we bring to the data. We should not, however, look first at the data and then frame H a to fit what the data show. Use a two-sided alternative unless you have a specific direction firmly in mind beforehand. In some circles, it is argued that the two-sided alternative should always be used in testing.