Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.1 Chapter 12 Inference About A Population.

Slides:

Advertisements

Similar presentations

Tests of Hypotheses Based on a Single Sample

Advertisements

1 Chapter 12 Inference About One Population Introduction In this chapter we utilize the approach developed before to describe a population.In.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 14 Statistical Inference: Review of Chapters 12 & 13.

Copyright © 2009 Cengage Learning 9.1 Chapter 9 Sampling Distributions.

EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.

Lecture 3 Miscellaneous details about hypothesis testing Type II error

Chapter 9 Chapter 10 Chapter 11 Chapter 12

Lecture 4 Chapter 11 wrap-up

Announcements Homework 2 will be posted on the web by tonight.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 13 Inference About Comparing Two Populations.

Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.

1 Chapter 12 Inference About a Population 2 Introduction In this chapter we utilize the approach developed before to describe a population.In this chapter.

Inference about a Mean Part II

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.

Inferences About Process Quality

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 10 Introduction to Estimation.

Chapter 9 Hypothesis Testing.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.

Business Statistics: Communicating with Numbers

Economics 173 Business Statistics Lecture 8 Fall, 2001 Professor J. Petry

Statistical Inference for Two Samples

1 Economics 173 Business Statistics Lectures 3 & 4 Summer, 2001 Professor J. Petry.

Statistics for Managers Using Microsoft Excel

Copyright © 2009 Cengage Learning 12.1 Chapter 12 Inference About A Population.

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.

More About Significance Tests

Chapter 9 Large-Sample Tests of Hypotheses

McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.

HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.2.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 10 Introduction to Estimation.

Copyright © 2009 Cengage Learning Chapter 10 Introduction to Estimation ( 추 정 )

CHAPTER 18: Inference about a Population Mean

Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.

8 - 1 © 2000 Prentice-Hall, Inc. Statistics for Business and Economics Inferences Based on a Single Sample: Tests of Hypothesis Chapter 8.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 21 Nonparametric Statistics.

Economics 173 Business Statistics Lecture 6 Fall, 2001 Professor J. Petry

Statistical Methods Introduction to Estimation noha hussein elkhidir16/04/35.

Economics 173 Business Statistics Lecture 7 Fall, 2001 Professor J. Petry

Copyright © 2009 Cengage Learning 15.1 Chapter 16 Chi-Squared Tests.

Chapter 13 Inference About Comparing Two Populations.

Chapter 10 Introduction to Estimation Sir Naseer Shahzada.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population.

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.

Chapter 13 Inference About Comparing Two Populations.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 16 Chi-Squared Tests.

1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 10 Introduction to Estimation.

Week 8 October Three Mini-Lectures QMM 510 Fall 2014.

© Copyright McGraw-Hill 2004

Ch 12 實習. Jia-Ying Chen2 We shall develop techniques to estimate and test three population parameters. Population mean  Population variance  2 Population.

Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.

Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.

Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.

Chapter 12 Inference About One Population. We shall develop techniques to estimate and test three population parameters.  Population mean   Population.

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.

Daniel S. Yates The Practice of Statistics Third Edition Chapter 12: Significance Tests in Practice Copyright © 2008 by W. H. Freeman & Company.

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.

Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.

Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.

4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.

Chapter 9 Introduction to the t Statistic

Economics 173 Business Statistics

Review of Power of a Test

Slides by JOHN LOUCKS St. Edward’s University.

Two-Sample Hypothesis Testing

Towson University - J. Jung

Confidence Interval Estimation and Statistical Inference

Keller: Stats for Mgmt & Econ, 7th Ed Sampling Distributions

Presentation transcript:

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Inference About A Population… Identify the parameter to be estimated or tested. Specify the parameter’s estimator and its sampling distribution. Derive the interval estimator and test statistic.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Inference About A Population… We will develop techniques to estimate and test three population parameters: Population Mean Population Variance Population Proportion p

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Inference With Variance Unknown… Previously, we looked at estimating and testing the population mean when the population standard deviation ( ) was known or given: But how often do we know the actual population variance? Instead, we use the Student t-statistic, given by:

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Inference With Variance Unknown… When is unknown, we use its point estimator s and the z-statistic is replaced by the the t-statistic, where the number of “degrees of freedom”, is n–1.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Testing when is unknown… When the population standard deviation is unknown and the population is normal, the test statistic for testing hypotheses about is: which is Student t distributed with = n–1 degrees of freedom. The confidence interval estimator of is given by:

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.1… Will new workers achieve 90% of the level of experienced workers within one week of being hired and trained? Experienced workers can process 500 packages/hour, thus if our conjecture is correct, we expect new workers to be able to process.90(500) = 450 packages per hour. Given the data, is this the case?data

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.1… Our objective is to describe the population of the numbers of packages processed in 1 hour by new workers, that is we want to know whether the new workers’ productivity is more than 90% of that of experienced workers. Thus we have: H 1 : > 450 Therefore we set our usual null hypothesis to: H 0 : = 450 IDENTIFY

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.1… Our test statistic is: With n=50 data points, we have n–1=49 degrees of freedom. Our hypothesis under question is: H 1 : > 450 Our rejection region becomes: Thus we will reject the null hypothesis in favor of the alternative if our calculated test static falls in this region. COMPUTE

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.1… From the data, we calculate = , s =38.83 and thus: Since we reject H 0 in favor of H 1, that is, there is sufficient evidence to conclude that the new workers are producing at more than 90% of the average of experienced workers. COMPUTE

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.1… Alternatively, we can use t-test:Mean from Tools > Data Analysis Plus in Excel… COMPUTE :::: rejection region

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.1… In addition to looking at the computed t-statistic and the critical value of t (one tail), we could look at the p-value (0.0323) and see that it is “small” (~3%), so again, we reject the null hypothesis in favor of the alternative… COMPUTE p-value

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.2… Can we estimate the return on investment for companies that won quality awards? We have are given a random sample of n = 83 such companies. We want to construct a 95% confidence interval for the mean return, i.e. what is: ??random sample IDENTIFY

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.2… From the data, we calculate: For this term and so: COMPUTE

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.2… We are 95% confident that the population mean,, i.e. the mean return of all publicly traded companies that win quality awards, lies between 13.20% and 16.84% Tools > Data Analysis Plus > t-Estimate: Mean is an alternative to the manual calculation… INTERPRET

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Check Requisite Conditions… The Student t distribution is robust, which means that if the population is nonnormal, the results of the t-test and confidence interval estimate are still valid provided that the population is “not extremely nonnormal”. To check this requirement, draw a histogram of the data and see how “bell shaped” the resulting figure is. If a histogram is extremely skewed (say in the case of an exponential distribution), that could be considered “extremely nonnormal” and hence t-statistics would be not be valid in this case.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Estimating Totals of Finite Populations… Large populations are defined as “populations that are at least 20 times the sample size” We can use the confidence interval estimator of a mean to produce a confidence interval estimator of the population total: Where N is the size of the finite population.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Estimating Totals of Finite Populations… For example, a sample of 500 households (in a city of 1 million households) reveals a 95% confidence interval estimate that the household mean spent on Halloween candy lies between $20 & $30. We can estimate the total amount spent in the city by multiplying these lower and upper confidence limits by the total population: Thus we estimate that the total amount spent on Halloween in the city lies between $20 million and $30 million.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Identifying Factors… Factors that identify the t-test and estimator of :

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Inference About Population Variance… If we are interested in drawing inferences about a population’s variability, the parameter we need to investigate is the population variance: The sample variance (s 2 ) is an unbiased, consistent and efficient point estimator for. Moreover, the statistic,, has a chi-squared distribution, with n–1 degrees of freedom.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Testing & Estimating Population Variance The test statistic used to test hypotheses about is: (which is chi-squared with = n–1 degrees of freedom).

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Testing & Estimating Population Variance Combining this statistic: With the probability statement: Yields the confidence interval estimator for : lower confidence limitupper confidence limit

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.3… Consider a container filling machine. Management wants a machine to fill 1 liter (1,000 cc’s) so that that variance of the fills is less than 1 cc 2. A random sample of n=25 1 liter fills were taken. Does the machine perform as it should at the 5% significance level?random sample We want to show that: H 1 : < 1 (so our null hypothesis becomes: H 0 : = 1). We will use this test statistic: Variance is less than 1 cc 2 IDENTIFY

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.3… Since our alternative hypothesis is phrased as: H 1 : < 1 We will reject H 0 in favor of H 1 if our test statistic falls into this rejection region: We computer the sample variance to be: s 2 =.8088 And thus our test statistic takes on this value… COMPUTE compare

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.3… Since: There is not enough evidence to infer that the claim is true. Excel output can also be used for this test… INTERPRET compare

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.4… As we saw, we cannot reject the null hypothesis in favor of the alternative. That is, there is not enough evidence to infer that the claim is true. Note: the result does not say that the variance is greater than 1, rather it merely states that we are unable to show that the variance is less than 1. We could estimate (at 99% confidence say) the variance of the fills…

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.4… In order to create a confidence interval estimate of the variance, we need these formulae: we know (n–1)s 2 = from our previous calculation, and we have from Table 5 in Appendix B: COMPUTE lower confidence limitupper confidence limit

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.4… Thus the 99% confidence interval estimate is: That is, the variance of fills lies between.426 and cc 2. COMPUTE

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Identifying Factors… Factors that identify the chi-squared test and estimator of :

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Inference: Population Proportion… When data are nominal, we count the number of occurrences of each value and calculate proportions. Thus, the parameter of interest in describing a population of nominal data is the population proportion p. This parameter was based on the binomial experiment. Recall the use of this statistic: where p-hat ( ) is the sample proportion: x successes in a sample size of n items.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Inference: Population Proportion… When np and n(1–p) are both greater than 5, the sampling distribution of is approximately normal with mean: standard deviation: Hence:

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Inference: Population Proportion… Test statistic for p : The confidence interval estimator for p is given by: (both of which require that np>5 and n(1–p)>5)

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.5… At an exit poll, voters are asked by a certain network if they voted Democrat (code=1) or Republican (code=2). Based on their small sample, can the network conclude that the Republican candidate will win the vote?exit poll That is: H 1 : p >.50 And hence our null hypothesis becomes: H 0 : p =.50 IDENTIFY

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.5… Since our research hypothesis is: H 1 : p >.50 our rejection region becomes: Looking at the data, we count 407 (of 765) votes for code=2. Hence, we calculate our test statistic as follows… COMPUTE

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Example 12.5… Since: …we reject H 0 in favor of H 1, that is, there is enough evidence to believe that the Republicans win the vote. Likewise from Excel: INTERPRET compare these… …or look at p-value

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Selecting the Sample Size… The confidence interval estimator for a population proportion is: Thus the (half) width of the interval is: Solving for n, we have:

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Selecting the Sample Size… For example, we want to know how many customers to survey in order to estimate the proportion of customers who prefer our brand to within.03 (with 95% confidence). I.e. our confidence interval after surveying will be ±.03, that means W=.03 Substituting into the equation… Uh Oh. Since we haven’t taken a sample yet, we don’t have this sample proportion…

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Selecting the Sample Size… Two methods – in each case we choose a value for then solve the equation for n. Method 1 : no knowledge of even a rough value of. This is a ‘worst case scenario’ so we substitute =.50 Method 2 : we have some idea about the value of. This is a better scenario and we substitute in our estimated value.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Selecting the Sample Size… Method 1 : no knowledge of value of, use 50%: Method 2 : some idea about a possible value, say 20%: Thus, we can sample fewer people if we already have a reasonable estimate of the population proportion before starting.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Estimating Totals for Large Populations… In much the same way as we saw earlier, when a population is large and finite we can estimate the total number of successes in the population by taking the product of the size of the population (N) and the confidence interval estimator: The Nielsen Ratings (used to measure TV audiences) uses this technique. Results from a small sample audience (say 2,000 viewers) is extrapolated to the total number of TV sets (say 100 million)…

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Nielsen Ratings Example… Problem: describe the population of television shows watched by viewers across the country (population), by examining the results from 2,000 viewers (sample).results We take these values and multiply them by N=100 million to estimate that between 9.9 million and 12.7 million viewers are watching the “Tonight Show”. COMPUTE

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Identifying Factors… Factors that identify the z-test and interval estimator of p :

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Flowchart of Techniques… Describe a Population Data Type? NominalInterval z test & estimator of p Type of descriptive measurement? Central LocationVariability t test & estimator of u. X 2 test & estimator ofs 2