Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance.

Slides:



Advertisements
Similar presentations
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 10 The Analysis of Variance.
Advertisements

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Analysis and Interpretation Inferential Statistics ANOVA
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Chapter 5 Introduction to Factorial Designs
Statistics for Managers Using Microsoft® Excel 5th Edition
1 Multifactor ANOVA. 2 What We Will Learn Two-factor ANOVA K ij =1 Two-factor ANOVA K ij =1 –Interaction –Tukey’s with multiple comparisons –Concept of.
Chapter 11 Analysis of Variance
Statistics for Business and Economics
Statistics for Managers Using Microsoft® Excel 5th Edition
1 Chapter 5 Introduction to Factorial Designs Basic Definitions and Principles Study the effects of two or more factors. Factorial designs Crossed:
8. ANALYSIS OF VARIANCE 8.1 Elements of a Designed Experiment
Copyright © Cengage Learning. All rights reserved. 10 The Analysis of Variance.
The Analysis of Variance
Inferences About Process Quality
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Outline Single-factor ANOVA Two-factor ANOVA Three-factor ANOVA
Copyright ©2011 Pearson Education 11-1 Chapter 11 Analysis of Variance Statistics for Managers using Microsoft Excel 6 th Global Edition.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 11 Multifactor Analysis of Variance.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 11-1 Chapter 11 Analysis of Variance Statistics for Managers using Microsoft Excel.
Chapter 12: Analysis of Variance
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Analysis of Variance Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
QNT 531 Advanced Problems in Statistics and Research Methods
Analysis of Variance ( ANOVA )
Chapter 10 Analysis of Variance.
Chapter 11 Multifactor Analysis of Variance.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
TOPIC 11 Analysis of Variance. Draw Sample Populations μ 1 = μ 2 = μ 3 = μ 4 = ….. μ n Evidence to accept/reject our claim Sample mean each group, grand.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Psych 5500/6500 Other ANOVA’s Fall, Factorial Designs Factorial Designs have one dependent variable and more than one independent variable (i.e.
Design Of Experiments With Several Factors
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
1 Overview of Experimental Design. 2 3 Examples of Experimental Designs.
Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 18 Random Effects.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Chapter 4 Analysis of Variance
Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance.
Copyright © Cengage Learning. All rights reserved. 15 Distribution-Free Procedures.
Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance.
Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
1 Mixed and Random Effects Models 1-way ANOVA - Random Effects Model 1-way ANOVA - Random Effects Model 2-way ANOVA - Mixed Effects Model 2-way ANOVA -
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Three Factor ANOVA I,J,K = Levels of the factors A,B, C L ijk = The number of obeservations of factor A at level i, factor B at level j, and factor C at.
Copyright © Cengage Learning. All rights reserved. 14 Goodness-of-Fit Tests and Categorical Data Analysis.
DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance.
Chapter 15 Analysis of Variance. The article “Could Mean Platelet Volume be a Predictive Marker for Acute Myocardial Infarction?” (Medical Science Monitor,
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Virtual University of Pakistan
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Two-Way Analysis of Variance Chapter 11.
CHAPTER 13 Design and Analysis of Single-Factor Experiments:
Applied Business Statistics, 7th ed. by Ken Black
Chapter 11 Analysis of Variance
Multifactor Analysis of Variance
Multifactor Analysis of Variance
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance

Copyright © Cengage Learning. All rights reserved Three-Factor ANOVA

3 To indicate the nature of models and analyses when ANOVA experiments involve more than two factors, we will focus here on the case of three fixed factors—A, B, and C. The numbers of levels of these factors will be denoted by I, J, and K, respectively, and L ijk = the number of observations made with factor A at level i, factor B at level j, and factor C at level k. The analysis is quite complicated when the L ijk ’s are not all equal, so we further specialize to L ijk = L.

4 Three-Factor ANOVA Then X ijkl and x ijkl denote the observed value, before and after the experiment is performed, of the lth replication (l = 1, 2,…, L) when the three factors are fixed at levels i, j, and k. To understand the parameters that will appear in the three- factor ANOVA model, first recall that in two-factor ANOVA with replications, E(X ijk ) =  ij =  +  i +  j + γ ij, where the restrictions  i  i =  j  j = 0,  i γ ij = 0 for every j, and  i γ ij = 0 for every i were necessary to obtain a unique set of parameters.

5 Three-Factor ANOVA If we use dot subscripts on the  ij ’s to denote averaging (rather than summation), then is the effect of factor A at level i averaged over levels of factor B, whereas is the effect of factor A at level i specific to factor B at level j.

6 Three-Factor ANOVA When the effect of A at level i depends on the level of B, there is interaction between the factors, and the γ ij ’s are not all zero. In particular, (11.11)

7 The Fixed Effects Model and Test Procedures

8 The fixed effects model for three-factor ANOVA with L ijk = L is where the  ijkl ’s are normally distributed with mean 0 and variance  2, and The restrictions necessary to obtain uniquely defined parameters are that the sum over any subscript of any parameter on the right-hand side of (11.13) equal 0. (11.12) (11.13)

9 The Fixed Effects Model and Test Procedures The parameters, and are called two-factor interactions, and is called a three-factor interaction; the  i ’s,  j ’s, and  k ’s are the main effects parameters. For any fixed level k of the third factor, analogous to (11.11), is the interaction of the ith level of A with the jth level of B specific to the kth level of C, whereas

10 The Fixed Effects Model and Test Procedures Is the interaction between A at level i and B at level j averaged over levels of C. If the interaction of A at level i and B at level j does not depend on k, then all ’s equal 0. Thus nonzero ’s represent nonadditivity of the two-factor ’s over the various levels of the third factor C. If the experiment included more than three factors, there would be corresponding higher-order interaction terms with analogous interpretations.

11 The Fixed Effects Model and Test Procedures Note that in the previous argument, if we had considered fixing the level of either A or B (rather than C, as was done) and examining the ’s, their interpretation would be the same; if any of the interactions of two factors depend on the level of the third factor, then there are nonzero ’s. When L > 1, there is a sum of squares for each main effect, each two-factor interaction, and the three-factor interaction.

12 The Fixed Effects Model and Test Procedures To write these in a way that indicates how sums of squares are defined when there are more than three factors, note that any of the model parameters in (11.13) can be estimated unbiasedly by averaging X ijkl over appropriate subscripts and taking differences. Thus with other main effects and interaction estimators obtained by symmetry.

13 The Fixed Effects Model and Test Procedures Definition Relevant sums of squares are df = IJKL – 1 df = I – 1 df = (I – 1)(J – 1)

14 The Fixed Effects Model and Test Procedures df = (I – 1)(J – 1)(k – 1) df = IJK(L – 1) with the remaining main effect and two-factor interaction sums of squares obtained by symmetry. SST is the sum of the other eight SSs.

15 The Fixed Effects Model and Test Procedures Each sum of squares (excepting SST) when divided by its df gives a mean square. Expected mean squares are E(MSE) =  2 with similar expressions for the other expected mean squares. Main effect and interaction hypotheses are tested by forming F ratios with MSE in each denominator.

16 The Fixed Effects Model and Test Procedures Null Hypothesis Test Statistic Value Rejection Region Usually the main effect hypotheses are tested only if all interactions are judged not significant. This analysis assumes that L ijk = L > 1. if L = 1, then as in the two-factor case, the highest-order interactions must be assumed absent to obtain an MSE that estimates  2.

17 The Fixed Effects Model and Test Procedures Setting L = 1 and disregarding the fourth subscript summation over l, the foregoing formulas for sums of squares are still valid, and error sum of squares is SSE = with = X ijk in the expression for.

18 Example 10 The following observations (body temperature –100°F) were reported in an experiment to study heat tolerance of cattle (“The Significance of the Coat in Heat Tolerance of Cattle,” Australian J. Agric. Res., 1959: 744–748).

19 Example 10 Measurements were made at four different periods (factor A, with I = 4) on two different strains of cattle (factor B, with J = 2) having four different types of coat (factor C, with K = 4); L = 3 observations were made for each of the 4  2  4 = 32 combinations of levels of the three factors. The table of cell totals (x ijk.’s) for all combinations of the three factors is cont’d

20 Example 10 Figure 11.8 displays plots of the corresponding cell means. We will return to these plots after considering tests of various hypotheses. Figure 11.8 Plots of x ijk. for Example 10 cont’d

21 Example 10 The basis for these tests is the ANOVA table given in Table Table 11.8 ANOVA Table for Example 10 cont’d

22 Example 10 Since F.01,9,64  2.70 and f ABC = MSABC/MSE =.704 does not exceed 2.70, we conclude that three-factor interactions are not significant, However, although the AB interactions are also not significant, both AC and BC interactions as well as all main effects seem to be necessary in the model. When there are no ABC or AB interactions, a plot of the separately for each level of C should reveal no substantial interactions (if only the ABC interactions are zero, plots are more difficult to interpret.) cont’d

23 Latin Square Designs

24 Latin Square Designs When several factors are to be studied simultaneously, an experiment in which there is at least one observation for every possible combination of levels is referred to as a complete layout. If the factors are A, B, and C with I, J, and K levels, respectively, a complete layout requires at least IJK observations. Frequently an experiment of this size is either impracticable because of cost, time, or space constraints or literally impossible.

25 Latin Square Designs For example, if the response variable is sales of a certain product and the factors are different display configurations, different stores, and different time periods, then only one display configuration can realistically be used in a given store during a given time period. A three-factor experiment in which fewer than IJK observations are made is called an incomplete layout. There are some incomplete layouts in which the pattern of combinations of factors is such that the analysis is straightforward.

26 Latin Square Designs One such three factor design is called a Latin square. It is appropriate when I = J = K (e.g., four display configurations, four stores, and four time periods) and all two- and three-factor interaction effects are assumed absent. If the levels of factor A are identified with the rows of a two- way table and the levels of B with the columns of the table, then the defining characteristic of a Latin square design is that every level of factor C appears exactly once in each row and exactly once in each column.

27 Latin Square Designs Figure 11.9 shows examples of 3  3, 4  4, and 5  5 Latin squares. There are 12 different 3  3 Latin squares, and the number of different Latin squares increases rapidly with the number of levels. (e.g., every permutation of rows of a given Latin square yields a Latin square, and similarly for column permutations). Figure 11.9 Examples of Latin squares

28 Latin Square Designs It is recommended that the square used in a an actual experiment be chosen at random from the set of all possible squares of the desired dimension; for further details, consult one of the chapter references. The letter N will denote the common value of I, J, and K. Then a complete layout with one observation per combination would require N 3 observations, whereas a Latin square requires only N 2 observations.

29 Latin Square Designs Once a particular square has been chosen, the value of k (the level of factor C) is completely determined by the values of i and j. To emphasize this, we use x ij(k) to denote the observed value when the three factors are at levels i, j, and k, respectively, with k taking on only one value for each i, j pair. The model equation for a Latin square design is X ij(k) =  +  i +  j +  k +  ij(k) i, j, k = 1,…, N where  i =  j =  k = 0 and  ij(k) ’s the are independent and normally distributed with mean 0 and variance  2.

30 Latin Square Designs We employ the following notation for totals and averages: Note that although X i.. previously suggested a double summation, now it corresponds to a single sum over all j (and the associated values of k).

31 Latin Square Designs Definition Sums of squares for a Latin square experiment are df = N 2 – 1 df = N = 1 df = N – 1

32 Latin Square Designs df = N – 1 df = (N – 1)(N – 2) SST = SSA + SSB + SSC + SSE Each mean square is, of course, the ratio SS/df. For testing H 0C :  1 =  2 =    =  N = 0, the test statistic value is f C = MSC/MSE, with H 0C rejected if f c  F ,N – 1,(N – 1)(N – 2).

33 Latin Square Designs The other two main effect null hypotheses are also rejected if the corresponding F ratio is at least F ,N – 1,(N – 1)(N – 2). If any of the null hypotheses is rejected, significant differences can be identified by using Tukey’s procedure. After computing pairs of sample means (the x i.. ’s, x. j. ’s, or x.. k ’s ) differing by more than w correspond to significant differences between associated factor effects (the  i ’s,  j ’s, or  k ’s ). The hypothesis H 0C is frequently the one of central interest.

34 Latin Square Designs A Latin square design is used to control for extraneous variation in the A and B factors, as was done by a randomized block design for the case of a single extraneous factor. Thus in the product sales example mentioned previously, variation due to both stores and time periods is controlled by a Latin square design, enabling an investigator to test for the presence of effects due to different product-display configurations.

35 Example 11 In an experiment to investigate the effect of relative humidity on abrasion resistance of leather cut from a rectangular pattern (“The Abrasion of Leather,” J. Inter. Soc. Leather Trades’ Chemists, 1946: 287), a 6  6 Latin square was used to control for possible variability due to row and column position in the pattern.

36 Example 11 The six levels of relative humidity studied were 1 = 25%, 2 = 37%, 3 = 50%, 4 = 62%, 5 = 75%, and 6 = 87%, with the following results: Also, x.. 1 = 46.10, x.. 2 = 40.59, x.. 3 = 39.56, x.. 4 = 35.86, x.. 5 = 32.23, x.. 6 = 32.64, x … = cont’d

37 Example 11 Further computations are summarized in Table Since F.05,5,20 = 2.71 and  2.71, H 0C is rejected in favor of the hypothesis that relative humidity does on average affect abrasion resistance. Table 11.9 ANOVA Table for Example 11 cont’d

38 Example 11 To apply Tukey’s procedure, Ordering the x   k ’s and underscoring yields In particular, the lowest relative humidity appears to result in a true average abrasion resistance significantly higher than for any other relative humidity studied. cont’d