1 Chapter 11 Analysis of Variance 1. 2 11.1 Introduction 11.2 One-Factor Analysis of Variance 11.3 Two-Factor Analysis of Variance: Introduction and Parameter.

Slides:



Advertisements
Similar presentations
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Advertisements

1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Chapter 11 Analysis of Variance
Chapter 3 Analysis of Variance
PSY 307 – Statistics for the Behavioral Sciences
Overview of Lecture Parametric Analysis is used for
Chapter 17 Analysis of Variance
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 15 Analysis of Variance.
Sampling Distributions
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Chapter 9: Introduction to the t statistic
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Statistical Inference for Two Samples
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Analysis of Variance or ANOVA. In ANOVA, we are interested in comparing the means of different populations (usually more than 2 populations). Since this.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
INFERENTIAL STATISTICS: Analysis Of Variance ANOVA
© 2003 Prentice-Hall, Inc.Chap 11-1 Analysis of Variance IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION Dr. Xueping Li University of Tennessee.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
© 2002 Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 9 Analysis of Variance.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Testing Hypotheses about Differences among Several Means.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Chapter 12: Analysis of Variance. Chapter Goals Test a hypothesis about several means. Consider the analysis of variance technique (ANOVA). Restrict the.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
While you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 121.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Virtual University of Pakistan
Chapter 13 f distribution and 0ne-way anova
Sampling Distributions
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
CHAPTER 3 Analysis of Variance (ANOVA)
Statistics Analysis of Variance.
John Loucks St. Edward’s University . SLIDES . BY.
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
Chapter 11 Inferences About Population Variances
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
One-Way Analysis of Variance
Hypothesis Tests for Two Population Standard Deviations
Hypothesis Tests for a Standard Deviation
Goodness of Fit.
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

1 Chapter 11 Analysis of Variance 1

Introduction 11.2 One-Factor Analysis of Variance 11.3 Two-Factor Analysis of Variance: Introduction and Parameter Estimation 11.4 Two-Factor Analysis of Variance: Testing Hypotheses 11.5 Final Comments 2

3 Introduction When producing computer chips, the manufacturer ( 製造業者 ) needs to decide upon the raw materials ( 原料 ) to be used, the temperature at which to fuse the parts, the shape and the size of the chip, and other factors. For a given set of choices of these factors, the manufacturer wants to know the mean quality value of the resulting chip. This will enable her or him to determine the choices of the factors of production that would be most appropriate for obtaining a quality product. In this chapter we introduce the statistical technique used for analyzing the foregoing type of problem. The statistical technique was invented by R. A. Fisher and is known as the analysis of variance (ANOVA). In all the models considered in this chapter, we assume that the data are normally distributed with the same (though unknown) variance σ 2.

One-Factor Analysis of Variance Consider m samples, each of size n. Suppose that these samples are independent and that sample i comes from a population that is normally distributed with mean μ i and variance σ 2, i = 1,...,m. We will be interested in testing the null hypothesis H 0 : μ 1 = μ 2 = · · · = μ m against H 1 : not all the means are equal We will be testing the null hypothesis that all the population means are equal against the alternative that at least two population means differ. 4

One-Factor Analysis of Variance Note that we assume that these samples have a common variance σ 2. How to calculate the common variance σ 2 ? Method 1 5

One-Factor Analysis of Variance Method 2 6

One-Factor Analysis of Variance Conclusion: we can estimate σ 2 by 7

One-Factor Analysis of Variance Fact: when H 0 is true, TS will have what is known as an F distribution with m − 1 numerator ( 分子 ) and m(n − 1) denominator ( 分母 ) degrees of freedom. Let F m−1,m(n−1),α denote the α critical value of this distribution. That is, the probability that an F random variable having numerator and denominator degrees of freedom m − 1 and m(n − 1), respectively, will exceed F m−1,m(n−1),α is equal to α (see Fig. 11.1). The significance-level- α test of H 0 is as follows: 8

One-Factor Analysis of Variance Values of F r,s,0.05 for various values of r and s are presented in App. Table D.4. Part of this table is presented in Table From Table 11.1, we see that there is a 5 percent chance that an F random variable having 3 numerator and 10 denominator degrees of freedom will exceed

One-Factor Analysis of Variance A Remark on the Degrees of Freedom The numerator degrees of freedom of the F random variable are determined by the numerator ( 分子 ) estimator. ◦ Since is the sample variance from a sample of size m, it follows that it has m − 1 degrees of freedom. Similarly, the denominator ( 分母 ) estimator is based on the statistic. ◦ Since each of the sample variances is based on a sample of size n, it follows that they each have n − 1 degrees of freedom. ◦ Summing the m sample variances then results in a statistic with m(n − 1) degrees of freedom. 10

Example 11.1 An investigator ( 研究者 ) for a consumer cooperative ( 消費合作社 ) organized a study of the mileages obtainable from three different brands of gasoline ( 汽油品牌 ). Using 15 identical motors set to run at the same speed, the investigator randomly assigned each brand of gasoline to 5 of the motors. Each of the motors was then run on 10 gallons of gasoline, with the total mileages obtained as follows. Test the hypothesis that the average mileage obtained is the same for all three types of gasoline. Use the 5 percent level of significance. 11

Example 11.1 Solution Since m = 3 and n = 5, the sample means are 12

Example 11.1 Since m − 1 = 2 and m(n − 1) = 12, we must compare the value of the TS with the value of F 2,12,0.05. Now, from App. Table D.4, we see that F 2,12,0.05 = Since the value of the test statistic (TS = 2.60) does not exceed 3.89, at the 5 percent level of significance we cannot reject the null hypothesis. If the value of the test statistic TS is v, then the p value will be given by p value = P{F m−1,m(n−1) ≥ v} where F m−1,m(n−1) is an F random variable with m − 1 numerator and m(n − 1) denominator degrees of freedom. p value = (Program 11-1) 13

One-Factor Analysis of Variance 14

One-Factor Analysis of Variance Remark When m = 2, the preceding is a test of the null hypothesis that two independent samples, having a common population variance, have the same mean. The reader might wonder how this compares with the one presented in Chap. 10. It turns out that the tests are exactly the same. That is, assuming the same data are used, they always give rise to exactly the same p value. 15

Exercise (P. 510, 1&2) 16

Exercise (P. 513, 10) 17

Two-Factor Analysis of Variance: Introduction and Parameter Estimation Example 11.3 Four different standardized reading achievement tests were administered ( 實施 ) to each of five students. Their scores were as follows: Each value in this set of 20 data points is affected by two factors: the examination and the student whose score on that examination is being recorded. The examination factor has four possible values, or levels, and the student factor has five possible levels. 18

Two-Factor Analysis of Variance: Introduction and Parameter Estimation Let us suppose that there are m possible levels of the first factor and n possible levels of the second. Let X ij denote the value of the data obtained when the first factor is at level i and the second factor is at level j. Recall Sec Let X ij represent the value of the j th member of sample i. where α i equal to the deviation of μ i from the average of the means μ. 19

Two-Factor Analysis of Variance: Introduction and Parameter Estimation In the case of two factors, we write our model in terms of row and column deviations. ◦ The value μ is referred to as the grand mean, α i is the deviation from the grand mean due to row i, and β j is the deviation from the grand mean due to column j. “Dot” notation 20

Two-Factor Analysis of Variance: Introduction and Parameter Estimation To determine estimators for parameters μ, α i, and β j, i = 1,...,m, j = 1,..., n. (“Dot” notation) 21

Example 11.4 The following data from Example 11.3 give the scores obtained when four different reading tests were given to each of five students. Use it to estimate the parameters of the model. 22

Example 11.4 Therefore, for instance, if one of the students is randomly chosen and then given a randomly chosen examination, then our estimate of the mean score that will be obtained is. If we were told that examination i was taken, then this would increase our estimate of the mean score by the amount. If we were told that the student chosen was number j, then this would increase our estimate of the mean score by the amount. Thus, for instance, we would estimate that the score obtained on examination 1 by student 2 is the value of a random variable whose mean is 23

Exercise (P. 519, 4) 24

Exercise (P. 519, 3) 25

Two-Factor Analysis of Variance: Testing Hypotheses Consider the two-factor model in which one has data values X ij, i = 1,...,m and j = 1,..., n. These data are assumed to be independent normal random variables with a common variance σ 2 and with mean values satisfying where In this section, we will test Here we will apply the analysis of variance approach in which two different estimators are derived for the variance σ 2. ◦ The first will always be a valid estimator, whereas the second will be a valid estimator only when the null hypothesis is true. 26

Two-Factor Analysis of Variance: Testing Hypotheses Method 1 (estimate variance σ 2 ) The sum of the squares of N standard normal random variables is a chi-squared random variable with N degrees of freedom. 27

28 Ch. 7 Distribution of the Sample Variance of a Normal Population

29 Ch. 7 Distribution of the Sample Variance of a Normal Population

Two-Factor Analysis of Variance: Testing Hypotheses Since 30

Two-Factor Analysis of Variance: Testing Hypotheses 31

Two-Factor Analysis of Variance: Testing Hypotheses Method 2 (estimate variance σ 2 ) Suppose we want to test the null hypothesis that there is no row effect Note that when H 0 is true, each α i is equal to 0, and Since each X i. is the average of n random variables with variance σ 2 When H 0 is true, will be chi squared with m degrees of freedom. 32

Two-Factor Analysis of Variance: Testing Hypotheses When H 0 is true, It can be shown that SS r /(m − 1) will tend to be larger than σ 2 when H 0 is not true. 33

Two-Factor Analysis of Variance: Testing Hypotheses The test of the null hypothesis H 0 that there is no row effect involves comparing the two estimators just given, and it calls for rejection when the second is significantly larger than the first. The test statistic The significance-level- α test is If the value of the test statistic is v, then the p value where F m−1,N is an F random variable with m−1 numerator and N denominator degrees of freedom. 34

Two-Factor Analysis of Variance: Testing Hypotheses 35

Example 11.5 The following are the numbers of defective items produced by four workers using, in turn, three different machines. Test whether there are significant differences between the machines and the workers. Solution Here m = 3 and n = 4, computing the row and column averages: 36

Example 11.5 The calculation of SS e is more involved because we must add the sum of the squares of the terms X ij −X i. −X.j +X.. as i ranges from 1 to 3 and j from 1 to 4. For example, Adding all 12 terms gives 37

Example 11.5 Since m−1=2 and N=2·3=6, the test statistic for the hypothesis that there is no row effect is From App. Table D.4 we see that F 2,6,0.05 = 5.14, and so the hypothesis that the mean number of defective items is unaffected by which machine is used is not rejected at the 5 percent level of significance. The test statistic for the hypothesis that there is no column effect is From App. Table D.4 we see that F 3,6,0.05 = 4.76, and so the hypothesis that the mean number of defective items is unaffected by which worker is used is also not rejected at the 5 percent level of significance. 38

Example 11.5 Running Result of Program 11-2 ◦ The value of the F -statistic for testing that there is no row effect is ◦ The p -value for testing that there is no row effect is ◦ The value of the F -statistic for testing that there is no column effect is ◦ The p -value for testing that there is no column effect is

Exercise (P527, 1) 40 發射裝置 燃料 飛彈

41 KEY TERMS One-factor analysis of variance: A model concerning a collection of normal random variables. It supposes that the variances of these random variables are equal and that their mean values depend on only a single factor, namely, the sample to which the random variable belongs. F statistic: A test statistic that is, when the null hypothesis is true, a ratio of two estimators of a common variance. Two-factor analysis of variance: A model in which a set of normal random variables having a common variance is arranged in an array of rows and columns. The mean value of any of them depends on two factors, namely, the row and the column in which the variable lies. 41