Copyright © Cengage Learning. All rights reserved. 9 Inferences Involving One Population.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Hypothesis Testing Using a Single Sample
Business Statistics - QBM117
8-5 Testing a Claim About a Standard Deviation or Variance This section introduces methods for testing a claim made about a population standard deviation.
Chapter 9 Hypothesis Testing.
Hypothesis Testing Using The One-Sample t-Test
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical Inference for Two Samples
Copyright © Cengage Learning. All rights reserved. 11 Applications of Chi-Square.
Hypothesis Testing.
Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.
Introduction to Statistical Inferences
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 8-6 Testing a Claim About a Standard Deviation or Variance.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
14 Elements of Nonparametric Statistics
More About Significance Tests
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Involving One Population.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Procedure for Hypothesis Testing 1. Establish the null hypothesis, H 0. 2.Establish.
14 Elements of Nonparametric Statistics
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 9: Inferences Involving One Population Student’s t, df = 5 Student’s t, df = 15 Student’s t, df = 25.
Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #7 Jose M. Cruz Assistant Professor.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Testing a Claim about a Standard Deviation or Variance Section 7-6 M A R I O F.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Sampling Distribution and the Central Limit Theorem.
Chapter 7 Inferences Based on a Single Sample: Tests of Hypotheses.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
© Copyright McGraw-Hill 2004
Sec 8.5 Test for a Variance or a Standard Deviation Bluman, Chapter 81.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.5.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Chapter 9 Introduction to the t Statistic
Copyright © Cengage Learning. All rights reserved. Hypothesis Testing 9.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Hypothesis Testing: One-Sample Inference
Virtual University of Pakistan
Chapter 9: Inferences Involving One Population
Chapter 9 Hypothesis Testing.
Hypothesis Tests for a Standard Deviation
Presentation transcript:

Copyright © Cengage Learning. All rights reserved. 9 Inferences Involving One Population

Copyright © Cengage Learning. All rights reserved. 9.3 Inferences about the Variance and Standard Deviation

3 Problems often arise that require us to make inferences about variability. For example, a soft drink bottling company has a machine that fills 16-oz bottles. The company needs to control the standard deviation  (or variance  2 ) in the amount of soft drink, x, put into each bottle. The mean amount placed into each bottle is important, but a correct mean amount does not ensure that the filling machine is working correctly.

4 If the variance is too large, many bottles will be overfilled and many underfilled. Thus, the bottling company wants to maintain as small a standard deviation (or variance) as possible. When discussing inferences about the spread of data, we usually talk about variance instead of standard deviation because the techniques (the formulas used) employ the sample variance rather than the standard deviation. However, remember that the standard deviation is the positive square root of the variance; thus, talking about the variance of a population is comparable to talking about the standard deviation. Inferences about the Variance and Standard Deviation

5 Inferences about the variance of a normally distributed population use the chi-square, χ 2, distributions (“ki-square”: that’s “ki” as in “kite,” and χ is the Greek lowercase letter chi). The chi-square distributions, like Student’s t-distributions, are a family of probability distributions, each of which is identified by the parameter number of degrees of freedom. Inferences about the Variance and Standard Deviation

6 To use the chi-square distribution, we must be aware of its properties (also see Figure 9.7). Do I Use the z-Statistic or the t-Statistic? Figure 9.1 Inferences about the Variance and Standard Deviation

7 Properties of the chi-square distribution 1. χ 2 is nonnegative in value; it is zero or positively valued. 2. χ 2 is not symmetrical; it is skewed to the right. 3. χ 2 is distributed so as to form a family of distributions, a separate distribution for each different number of degrees of freedom. Note When df = 2, the mean value of the chi-square distribution is df. The mean is located to the right of the mode (the value where the curve reaches its high point) and just to the right of the median (the value that splits the distribution, 50% on each side). Inferences about the Variance and Standard Deviation

8 By locating zero at the left extreme and the value of df on your sketch of the χ 2 distribution, you will establish an approximate scale so that other values can be located in their respective positions. See Figure 9.8. Location of Mean, Median, and Mode for χ 2 Distribution Figure 9.8 Inferences about the Variance and Standard Deviation

9 For values of χ 2 on the left side of the median, the area to the right will be greater than The critical values for chi-square are obtained from Table 8 in Appendix B. Each critical value is identified by two pieces of information: df and area under the curve to the right of the critical value being sought. Inferences about the Variance and Standard Deviation

10 Thus, χ 2 (df,  ) (read “chi-square of df, alpha”) is the symbol used to identify the critical value of chi-square with df degrees of freedom and with  area to the right, as shown in Figure 9.9. Since the chi-square distribution is not symmetrical, the critical values associated with the right and left tails are given separately in Table 8. Chi-Square Distribution Showing χ 2 (df,  ) Figure 9.9 Inferences about the Variance and Standard Deviation

11 Example 16 – χ 2 Associated with the Left Tail Find χ 2 (14, 0.90). Solution: See the figure that follows. Use Table 8 in Appendix B to find the value of χ 2 (14, 0.90) at the intersection of row df = 14 and the column for an area of 0.90 to the right, as shown in the portion of the table that follows:

12 The accompanying figure shows the relationship between the cumulative probability and a specific χ 2 -value for a χ 2 -distribution with df degrees of freedom. We are now ready to use chi-square to make inferences about the population variance or standard deviation. Inferences about the Variance and Standard Deviation

13 The assumptions for inferences about the variance  2 or standard deviation  The sampled population is normally distributed. The t procedures for inferences about the mean were based on the assumption of normality, but the t procedures are generally useful even when the sampled population is nonnormal, especially for larger samples. However, the same is not true about the inference procedures for the standard deviation. Inferences about the Variance and Standard Deviation

14 The statistical procedures for the standard deviation are very sensitive to nonnormal distributions (skewness, in particular), and this makes it difficult to determine whether an apparent significant result is the result of the sample evidence or a violation of the assumptions. Therefore, the only inference procedure to be presented here is the hypothesis test for the standard deviation of a normal population. Inferences about the Variance and Standard Deviation

15 The test statistic that will be used in testing hypotheses about the population variance or standard deviation is obtained by using the following formula: Inferences about the Variance and Standard Deviation

16 When random samples are drawn from a normal population with a known variance  2, the quantity possesses a probability distribution that is known as the chi-square distribution with n – 1 degrees of freedom. Inferences about the Variance and Standard Deviation

17 Hypothesis-Testing Procedure

18 Hypothesis-Testing Procedure Let’s return to the example about the bottling company that wishes to detect when the variability in the amount of soft drink placed into each bottle gets out of control. A variance of is considered acceptable, and the company wants to adjust the bottle-filling machine when the variance,  2, becomes larger than this value. The decision will be made using the hypothesis-testing procedure.

19 Example 9 – One-tailed Hypothesis Test for Variance,  2 The soft drink bottling company wants to control the variability in the amount of fill by not allowing the variance to exceed Does a sample of size 28 with a variance of indicate that the bottling process is out of control (with regard to variance) at the 0.05 level of significance?

20 Example 17 – Solution Step 1 The Set-Up: a. Describe the population parameter of interest.  2, the variance in the amount of fill of a soft drink during a bottling process b. State the null hypothesis (H o ) and the alternative hypothesis (H a ). H o :  2 = (  ) (variance is not larger than ) H a :  2  (variance is larger than ) cont’d

21 Step 2 The Hypothesis Test Criteria: a. Check the assumptions. The amount of fill put into a bottle is generally normally distributed. By checking the distribution of the sample, we could verify this. b. Identify the probability distribution and the test statistic to be used. The chi-square distribution and formula (9.10), with df = n – 1 = 28 – 1 = 27, will be used. c. Determine the level of significance:  = cont’d Example 17 – Solution

22 Step 3 The Sample Evidence: a. Collect the sample information: n = 28 and s 2 = b. Calculate the value of the test statistic. Use formula (9.10): cont’d Example 17 – Solution

23 Step 4 The Probability Distribution: Using the p-value procedure: a. Calculate the p-value for the test statistic. Use the right-hand tail because H a expresses concern for values related to “larger than.” P = P( χ 2 > , with df = 27) as shown in the figure. cont’d Example 17 – Solution

24 To find the p-value, use one of two methods: 1. Use Table 8 in Appendix B to place bounds on the p-value:  P  Use a computer or calculator to calculate the p-value: P = b. Determine whether or not the p-value is smaller than . The p-value is smaller than the level of significance,  (0.05). cont’d Example 17 – Solution

25 Using the classical procedure: a. Determine the critical region and critical value(s). The critical region is the right-hand tail because H a expresses concern for values related to “larger than.” The critical value is obtained from Table 8, at the intersection of row df = 27 and column  = 0.05 : χ 2 (27, 0.05) = cont’d Example 17 – Solution

26 Step 5 The Results: a. State the decision about H o : Reject H o. b. State the conclusion about H a. At the 0.05 level of significance, we conclude that the bottling process is out of control with regard to the variance. cont’d Example 17 – Solution

27 Example 19 – Two-tailed Hypothesis Test For Standard Deviation,  A manufacturer claims that a photographic chemical has a shelf life that is normally distributed about a mean of 180 days with a standard deviation of no more than 10 days. As a user of this chemical, Fast Photo is concerned that the standard deviation might be different from 10 days; otherwise, it will buy a larger quantity while the chemical is part of a special promotion. Twelve random samples were selected and tested, with a standard deviation of 14 days resulting. At the 0.05 level of significance, does this sample present sufficient evidence to show that the standard deviation is different from 10 days?

28 Example 19 – Solution Step 1 The Set-Up: a. Describe the population parameter of interest. , the standard deviation for the shelf life of the chemical b. State the null hypothesis (H o ) and the alternative hypothesis (H a ). H a :  = 10 (standard deviation is 10 days) H o :  ≠ 10 (standard deviation is different from 10 days).

29 Step 2 The Hypothesis Test Criteria: a. Check the assumptions. The manufacturer claims shelf life is normally distributed; this could be verified by checking the distribution of the sample. b. Identify the probability distribution and the test statistic to be used. The chi-square distribution and formula (9.10), with df = n – 1 = 12 – 1 = 11, will be used. c. Determine the level of significance:  = cont’d Example 19 – Solution

30 Step 3 The Sample Evidence: a. Collect the sample information: n = 12 and s = 14. b. Calculate the value of the test statistic. Use formula (9.10): cont’d Example 19 – Solution

31 Step 4 The Probability Distribution: Using the p-value procedure: a. Calculate the p-value for the test statistic. Since the concern is for values “different from” 10, the p-value is the area of both tails. cont’d Example 19 – Solution

32 The area of each tail will represent ½ P. Since = is in the right tail, the area of the right tail is ½ P: ½ P = P( χ 2  21.56, with df = 11), as shown in the figure. cont’d Example 19 – Solution

33 To find ½ P, use one of two methods: 1. Use Table 8 in Appendix B to place bounds on ½ P:  ½ P  Double both bounds to find the bounds for P: 2  (0.025  ½ P  0.05) becomes 0.05  P  Use a computer or calculator to find ½ P:½ P = ; therefore, P = b. Determine whether or not the p-value is smaller than . The p-value is not smaller than the level of significance,  (0.05). cont’d Example 19 – Solution

34 Using the classical procedure: a. Determine the critical region and critical value(s). The critical region is split into two equal parts because H a expresses concern for values related to “different from.” The critical values are obtained from Table 8 at the intersections of row df = 11 with columns and for the area to right: and χ 2 (11, ) = 3.82 and χ 2 (11, 0.025) = cont’d Example 19 – Solution

35 b. Determine whether or not the calculated test statistic is in the critical region. is not in the critical region; see the accompanying figure. cont’d Example 19 – Solution

36 Step 5 The Results: a. State the decision about H o : Fail to reject H o. b. State the conclusion about H a. There is not sufficient evidence at the 0.05 significance level to conclude that the shelf life of this chemical has a standard deviation different from 10 days. Therefore, Fast Photo should purchase the chemical accordingly. cont’d Example 19 – Solution

37 Ceramic floor tiles come in all sorts of colors, finishes, and textures. One reason for making the surface textured is to create a natural stone look. In nature, the layers within stone vary greatly. For ceramic tiles there must be enough variation that the tiles resemble real stone, yet not so much as to create a safety problem. Applied Example 20 – Ceramic Floor Tile

38 This variation can be measured as surface height, x, the distance between the surface and the plane of the “highest” points of the surface. See the figure below. Applied Example 20 – Ceramic Floor Tile cont’d

39 The manufacturing specification calls for the mean surface height to be no greater than inch. The manufacturing process is under control when the standard deviation is no greater than 0.01 inch. Twenty-six randomly located points were measured and the following data resulted. Applied Example 20 – Ceramic Floor Tile cont’d