Presentation is loading. Please wait.

Presentation is loading. Please wait.

Economics 173 Business Statistics Lecture 6 Fall, 2001 Professor J. Petry

Similar presentations


Presentation on theme: "Economics 173 Business Statistics Lecture 6 Fall, 2001 Professor J. Petry"— Presentation transcript:

1 Economics 173 Business Statistics Lecture 6 Fall, 2001 Professor J. Petry http://www.cba.uiuc.edu/jpetry/Econ_173_fa01/

2 2 General Announcements For efficiency reasons, questions regarding the lecture materials will have to wait until after class, lab session, office hours or web- board. We will no longer require using the Z-table, nor any other table at the back of the book. –The counterintuitive nature of the tables and our ability to use excel in its place make this choice optimal. –We will provide the critical value to which you can compare a test statistic value, a p-value to which you can compare a significance level, or some means to find the necessary values without using the tables in your book. We will do our best to keep lectures as close to the schedule as possible. Regardless of the lecture pace, our schedule as printed in the Course Outline should dictate your reading schedule.

3 Chapter 11 Inference About the Description of a Single Population

4 4 11.1 Introduction In this chapter we utilize the approach developed before for making statistical inference about populations. –Identify the parameter to be estimated or tested. –Specify the parameter’s estimator and its sampling distribution. –Construct an interval estimator or perform a test.

5 5 We will develop techniques to estimate and test three population parameters. –The expected value  –The variance  2 –The population proportion p (for qualitative data) Examples –A bank conducts a survey to estimate the number of times customers will actually use ATM machines. –A random sample of processing times is taken to test the mean production time and the variance of production time on a production line.

6 6 Recall that when  is known is normally distributed – If the sample is drawn from a normal population, or if – the population is not normal but the sample is sufficiently large. When  is unknown, we use its point estimator s, and the Z statistic is replaced then by the t-statistic 11.2 Inference About a Population Mean When the Population Standard Deviation Is Unknown

7 7 s Z t s s s s When the sampled population is normally distributed, the statistic t is Student t distributed. 0 The t distribution is mound-shaped, and symmetrical around zero. The “degrees of freedom”, a function of the sample size determines how spread the distribution is (compared to the normal distribution) d.f. = n 2 d.f. = n 1 n 1 < n 2 Z Z Z ZZt t t t t t t t ttt s s s s s t

8 8 Testing the population mean when the population standard deviation is unknown If the population is normally distributed, the test statistic for  when  is unknown is t. This statistic is Student t distributed with n-1 degrees of freedom.

9 9 Example 11.1 Trainees productivity –In order to determine the number of workers required to meet demand, the productivity of newly hired trainees is studied. –It is believed that trainees can process and distribute more than 450 packages per hour within one week of hiring. –Can we conclude that this belief is correct, based on productivity observation of 50 trainees?

10 10 Solution –The problem objective is to describe the population of the number of packages processed in one hour. –The data are quantitative. H 0 :  = 450 H 1 :  > 450 –The t statistic d.f. = n - 1 = 49

11 11 –Solving by hand The rejection region is t > t ,n - 1 t ,n - 1 = t.05,49 = approximately to 1.676. (=critical value) From the data we have

12 12 The test statistic is Since 1.89 > 1.676 we reject the null hypothesis in favor of the alternative. There is sufficient evidence to infer that the mean productivity of trainees one week after being hired is greater than 450 packages at.05 significance level. 1.676 1.89 Rejection region

13 13 Since.0323 <.05, we reject the null hypothesis in favor of the alternative. Or, as in last slide 1.89 (the test statistic) is more extreme than 1.68 (the critical value). You would be given the critical value in this example. There is sufficient evidence to infer that the mean productivity of trainees one week after being hired is greater than 450 packages at.05 significance level..05.0323 1.68 1.89

14 14 Estimating the population mean when the population standard deviation is unknown Confidence interval estimator of  when  is unknown

15 15 11.3 Inference About a Population Variance Some times we are interested in making inference about the variability of processes. Examples: –The consistency of a production process for quality control purposes. –Investors use variance as a measure of risk. To draw inference about variability, the parameter of interest is  2.

16 16 The sample variance s 2 is an unbiased, consistent and efficient point estimator for  2. The statistic has a distribution called Chi-squared, if the population is normally distributed. d.f. = 1 d.f. = 5 d.f. = 10

17 17 Example 11.3 (operation management application) –A container-filling machine is believed to fill 1 liter containers so consistently, that the variance of the filling will be less than 1 cc (.001 liter). –To test this belief a random sample of 25 1-liter fills was taken, and the results recorded. –Do these data support the belief that the variance is less than 1cc at 5% significance level?

18 18 Solution –The problem objective is to describe the population of 1-liter fills from a filling machine. –The data are quantitative, and we are interested in the variability of the fills. –The complete test is: H 0 :  2 = 1 H 1 :  2 <1 We want to prove that the process is consistent

19 19 –Solving by hand Note that (n - 1)s 2 =  (x i - x) 2 =  x i 2 -  x i /n From the sample (data is presented in units of cc-1000 to avoid rounding) we can calculate  x i = -3.6, and  x i 2 = 21.3. Then (n - 1)s 2 = 21.3 - (-3.6) 2 /25 = 20.8. The complete test is shown next There is insufficient evidence to reject the hypothesis that the variance is equal to 1cc, in favor of the hypothesis that it is smaller.

20 20 13.848420.8 Rejection region  =.05 Do not reject the null hypothesis P-value =.3485

21 21 11.4 Inference About a Population Proportion When the population consists of qualitative or categorical data, the only inference we can make is about the proportion of occurrence of a certain value. The parameter “p” was used before to calculate probabilities using the binomial distribution.

22 22 Statistic and sampling distribution –the statistic employed is – Under certain conditions, [np > 5 and n(1-p) > 5], is approximately normally distributed, with  = p and  2 = p(1 - p)/n.

23 23 Test statistic for p Interval estimator for p (1-  confidence level)

24 24 Example 11.5 (marketing application) –For a new newspaper to be financially viable, it has to capture at least 12% of the Toronto market. –In a survey conducted among 400 randomly selected prospective readers, 58 participants indicated they would subscribe to the newspaper if its cost did not exceed $20 a month. –Can the publisher conclude that the proposed newspaper will be financially viable at 10% significance level?

25 25 Solution –The problem objective is to describe the population of newspaper readers in Toronto. –The responses to the survey are qualitative. –The parameter to be tested is “p”. –The hypotheses are: H 0 : p =.12 H 1 : p >.12 We want to prove that the newspaper is financially viable

26 26 –Solving by hand The rejection region is z > z  = z.10 = 1.28. (= critical value) The sample proportion is The value of the test statistic is The p-value is = P(Z>1.54) =.0618; alpha =0.10 There is sufficient evidence to reject the null hypothesis in favor of the alternative hypothesis. At 10% significance level we can argue that at least 12% of Toronto’s readers will subscribe to the new newspaper.

27 27

28 28 Example 11.6 (marketing application) –In a survey of 2000 TV viewers at 11.40 p.m. on a certain night, 226 indicated they watched “The Tonight Show”. –Estimate the number of TVs tuned to the Tonight Show in a typical night, if there are 100 million potential television sets. Use a 95% confidence level. –Solution

29 29 Selecting the Sample Size to Estimate the Proportion The interval estimator for the proportion is Thus, if we wish to estimate the proportion to within W, we can write The required sample size is

30 30 Example –Suppose we want to estimate the proportion of customers who prefer our company’s brand to within.03 with 95% confidence. –Find the sample size needed to guarantee that this requirement is met. –Solution W =.03; 1 -  =.95, therefore  /2 =.025, so z.025 = 1.96 Since the sample has not yet been taken, the sample proportion is still unknown. We proceed using either one of the following two methods:

31 31 Method 1: –There is no knowledge about the value of Let, which results in the largest possible n needed for a 1-  confidence interval. If the sample proportion does not equal.5, the actual W will be narrower than.03. Method 2: –There is some idea about the value of Use the value of to calculate the sample size


Download ppt "Economics 173 Business Statistics Lecture 6 Fall, 2001 Professor J. Petry"

Similar presentations


Ads by Google