Presentation on theme: "1 Chapter 4 Experiments with Blocking Factors. 2 4.1 The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect."— Presentation transcript:
1 Chapter 4 Experiments with Blocking Factors
2 4.1 The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect on the response, but we are not interested in that factor. Typical nuisance factors include batches of raw material, operators, pieces of test equipment, time (shifts, days, etc.), different experimental units
If the nuisance variable is known and controllable, we use blocking If the nuisance factor is known and uncontrollable, sometimes we can use the analysis of covariance (see Chapter 14) to remove the effect of the nuisance factor from the analysis 3
4 If the nuisance factor is unknown and uncontrollable (a “lurking” variable), we hope that randomization balances out its impact across the experiment Sometimes several sources of variability are combined in a block, so the block becomes an aggregate variable
5 We wish to determine whether 4 different tips produce different (mean) hardness reading on a Rockwell hardness tester Assignment of the tips to an experimental unit; that is, a test coupon Structure of a completely randomized experiment The test coupons are a source of nuisance variability Alternatively, the experimenter may want to test the tips across coupons of various hardness levels The need for blocking Randomized Complete block design (RCBD)
6 To conduct this experiment as a RCBD, assign all 4 tips to each coupon Each coupon is called a “block”; that is, it’s a more homogenous experimental unit on which to test the tips Variability between blocks can be large, variability within a block should be relatively small In general, a block is a specific level of the nuisance factor A complete replicate of the basic experiment is conducted in each block A block represents a restriction on randomization All runs within a block are randomized
7 Suppose that we use b = 4 blocks: Once again, we are interested in testing the equality of treatment means, but now we have to remove the variability associated with the nuisance factor (the blocks)
9 Statistical Analysis of the RCBD Suppose that there are a treatments (factor levels) and b blocks A statistical model (effects model) for the RCBD is – is an overall mean, i is the effect of the ith treatment, and j is the effect of the jth block – ij ~ NID(0, 2 ) –
10 Means model for the RCBD The relevant (fixed effects) hypotheses are An equivalent way for the above hypothesis Notations:
11 ANOVA partitioning of total variability:
12 SS T = SS Treatment + SS Blocks + SS E Total N = ab observations, SS T has N – 1 degrees of freedom. a treatments and b blocks, SS Treatment and SS Blocks have a – 1 and b – 1 degrees of freedom. SS E has ab – 1 – (a – 1) – (b – 1) = (a – 1)(b – 1) degrees of freedom. From Theorem 3.1, SS Treatment / 2, SS Blocks / 2 and SS E / 2 are independently chi-square distributions.
13 The expected values of mean squares: For testing the equality of treatment means,
14 The ANOVA table Another computing formulas:
16 To conduct this experiment as a RCBD, assign all 4 pressures to each of the 6 batches of resin Each batch of resin is called a “block”; that is, it’s a more homogenous experimental unit on which to test the extrusion pressures
Model Adequacy Checking Residual Analysis Residual: Basic residual plots indicate that normality, constant variance assumptions are satisfied No obvious problems with randomization
Multiple Comparisons (Fisher LSD) 22
23 Can also plot residuals versus the type of tip (residuals by factor) and versus the blocks. Also plot residuals v.s. the fitted values. These plots provide more information about the constant variance assumption, possible outliers Some Other Aspects of the Randomized Complete Block Design The model for RCBD is complete additive.
24 Interactions? For example: The treatments and blocks are random. Choice of sample size: –Number of blocks , the number of replicates and the number of error degrees of freedom –
25 Estimating miss values: –Approximate analysis: estimate the missing values and then do ANOVA. –Assume the missing value is x. Minimize SS E to find x –The error degrees of freedom - 1
Estimating Model Parameters and the General Regression Significance Test The linear statistical model The normal equations
28 Under the constraints, the solution is and the fitted values, The sum of squares for fitting the full model: The error sum of squares
29 The sum of squares due to treatments: where
The Latin Square Design RCBD removes a known and controllable nuisance variable. Example: the effects of five different formulations of a rocket propellant used in aircrew escape systems on the observed burning rate. –Remove two nuisance factors: batches of raw material and operators Latin square design: rows and columns are orthogonal to treatments.
31 The Latin square design is used to eliminate two nuisance sources, and allows blocking in two directions (rows and columns) Usually Latin Square is a p p squares, and each cell contains one of the p letters that corresponds to the treatments, and each letter occurs once and only once in each row and column. See Page 139
32 The statistical (effects) model is –y ijk is the observation in the ith row and kth column for the jth treatment, is the overall mean, i is the ith row effect, j is the jth treatment effect, k is the kth column effect and ijk is the random error. –This model is completely additive. –Only two of three subscripts are needed to denote a particular observation.
33 Sum of squares: SS T = SS Rows + SS Columns + SS Treatments + SS E The degrees of freedom: p 2 – 1 = p – 1 + p – 1 + p – 1 + (p – 2)(p – 1) The appropriate statistic for testing for no differences in treatment means is ANOVA table
least squares estimates of the model parameters, i, j, k 34
Under the constrains, 35
38 The residuals If one observation is missing,
Standard Latin square 39 Random order
Replication of Latin Squares: –The same batches and operators 40
41 Replication of Latin Squares: The same batches and different operators
42 Replication of Latin Squares: The different batches and different operators
The Graeco-Latin Square Design Graeco-Latin square: –Two Latin Squares –One is Greek letter and the other is Latin letter. –Two Latin Squares are orthogonal –Table 4.17 –Block in three directions –Four factors (row, column, Latin letter and Greek letter) –Each factor has p levels. Total p 2 runs
45 The statistical model: –y ijkl is the observation in the ith row and lth column for Latin letter j, and Greek letter k – is the overall mean, i is the ith row effect, j is the effect of Latin letter treatment j, k is the effect of Greek letter treatment k, l is the effect of column l. –ANOVA table (Table 4.18) –Under H 0, the testing statistic is F p-1,(p-3)(p-1) distribution.
Example 4.4 –Add a block factor: 5 test assemblies 47
Balance Incomplete Block Designs May not run all the treatment combinations in each block. Randomized incomplete block design (BIBD) Any two treatments appear together an equal number of times. There are a treatments and each block can hold exactly k (k < a) treatments. For example: A chemical process is a function of the type of catalyst employed.
Statistical Analysis of the BIBD a treatments and b blocks. Each block contains k treatments, and each treatment occurs r times. There are N = ar = bk total observations. The number of times each pairs of treatments appears in the same block is The statistical model for the BIBD is
51 The sum of squares
52 The degree of freedom: –Treatments(adjusted): a – 1 –Error: N – a – b – 1 The testing statistic for testing equality of the treatment effects: ANOVA table
Example 4.5 The contrast sum of squares 54
Least Squares Estimation of the Parameters
56 The least squares normal equations: Under the constrains, we have