Principles in the design of multiphase experiments with a later laboratory phase: orthogonal designs Chris Brien 1, Bronwyn Harch 2, Ray Correll 2 & Rosemary.

Slides:



Advertisements
Similar presentations
Multistage Sampling.
Advertisements

Advanced Piloting Cruise Plot.
Introductory Mathematics & Statistics for Business
STATISTICS Linear Statistical Models
STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
By: Saad Rais, Statistics Canada Zdenek Patak, Statistics Canada
SMA 6304 / MIT / MIT Manufacturing Systems Lecture 11: Forecasting Lecturer: Prof. Duane S. Boning Copyright 2003 © Duane S. Boning. 1.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Exit a Customer Chapter 8. Exit a Customer 8-2 Objectives Perform exit summary process consisting of the following steps: Review service records Close.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Year 6 mental test 5 second questions
Analysis by design Statistics is involved in the analysis of data generated from an experiment. It is essential to spend time and effort in advance to.
Overview of Lecture Partitioning Evaluating the Null Hypothesis ANOVA
C82MST Statistical Methods 2 - Lecture 2 1 Overview of Lecture Variability and Averages The Normal Distribution Comparing Population Variances Experimental.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
1 Correlation and Simple Regression. 2 Introduction Interested in the relationships between variables. What will happen to one variable if another is.
Multistage Sampling Module 3 Session 9.
SADC Course in Statistics Sampling design using the Paddy game (Sessions 15&16)
SADC Course in Statistics Objectives and analysis Module B2, Session 14.
Robust microarrary experiments by design: a multiphase framework Chris Brien Phenomics & Bioinformatics Research Centre, University of South Australia.
Experimental and Quasiexperimental Designs Chapter 10 Copyright © 2009 Elsevier Canada, a division of Reed Elsevier Canada, Ltd.
3/2003 Rev 1 I – slide 1 of 33 Session I Part I Review of Fundamentals Module 2Basic Physics and Mathematics Used in Radiation Protection.
Chapter 11: Models of Computation
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Chapter 18 Methodology – Monitoring and Tuning the Operational System Transparencies © Pearson Education Limited 1995, 2005.
(This presentation may be used for instructional purposes)
ABC Technology Project
Multiphase experiments in the biological sciences Chris Brien Phenomics and Bioinformatics Research Centre, University of South Australia Joint work with:
1 Panel Data Analysis – Advantages and Challenges Cheng Hsiao.
Addition 1’s to 20.
25 seconds left…...
Determining How Costs Behave
Week 1.
We will resume in: 25 Minutes.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 12 View Design and Integration.
Chapter Thirteen The One-Way Analysis of Variance.
Formulating mixed models for experiments, including longitudinal experiments (accepted for publication in JABES) Chris Brien 1 & Clarice Demétrio 2 1 University.
Chapter 11: The t Test for Two Related Samples
Experimental Design and Analysis of Variance
1 Chapter 20: Statistical Tests for Ordinal Data.
Simple Linear Regression Analysis
How Cells Obtain Energy from Food
Multiple Regression and Model Building
Copyright © 2010 Pearson Addison-Wesley. All rights reserved. Chapter 14 Factorial Experiments (Two or More Factors)
Robust microarray experiments by design: a multiphase framework Chris Brien Phenomics & Bioinformatics Research Centre, University of South Australia
A tale of randomization Chris Brien Phenomics & Bioinformatics Research Centre, University of South Australia. The Australian Centre for Plant Functional.
1 Review Lecture: Guide to the SSSII Assignment Gwilym Pryce 5 th March 2006.
Factor-allocation in gene- expression microarray experiments Chris Brien Phenomics and Bioinformatics Research Centre University of South Australia
Multiple Comparisons in Factorial Experiments
SPH 247 Statistical Analysis of Laboratory Data 1April 2, 2013SPH 247 Statistical Analysis of Laboratory Data.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Experiments with both nested and “crossed” or factorial factors
Principles in the design of multiphase experiments with a later laboratory phase: orthogonal designs Chris Brien 1, Bronwyn Harch 2, Ray Correll 2 & Rosemary.
Nested and Split Plot Designs. Nested and Split-Plot Designs These are multifactor experiments that address common economic and practical constraints.
Formulating mixed models for experiments, including longitudinal experiments [JABES (2009) 14, ] Chris Brien 1 & Clarice Demétrio 2 1 University.
Biostatistics-Lecture 9 Experimental designs Ruibin Xi Peking University School of Mathematical Sciences.
Fixed vs. Random Effects
Experimental Design An Experimental Design is a plan for the assignment of the treatments to the plots in the experiment Designs differ primarily in the.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Lesson Observational Studies, Experiments, and Simple Random Sampling.
Chris Brien University of South Australia Bronwyn Harch Ray Correll CSIRO Mathematical and Information Sciences Design and analysis.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.
Designing Experiments
Repeated Measures Designs
Randomizing and checking standard and multiphase designs using the R package dae Chris Brien Phenomics and Bioinformatics Research Centre, University.
Comparing Three or More Means
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Principles in the design of multiphase experiments with a later laboratory phase: orthogonal designs Chris Brien 1, Bronwyn Harch 2, Ray Correll 2 & Rosemary Bailey 3 1 University of South Australia, 2 CSIRO Mathematics, Informatics & Statistics, 3 Queen Mary University of London

Outline 1.Primary experimental design principles 2.Factor-allocation description for standard designs. 3.Principles for simple multiphase experiments. 4.Principles leading to complications, even with orthogonality. 5.Summary 2

1)Primary experimental design principles Principle 1 (Evaluate designs with skeleton ANOVA tables) Use whether or not data to be analyzed by ANOVA. Principle 2 (Fundamentals): Use randomization, replication and blocking or local control. Principle 3 (Minimize variance): Block entities to form new entities, within new entities being more homogeneous; assign treatments to least variable entity-type. Principle 4 (Split units): confound some treatment sources with more variable sources if some treatment factors: i. require larger units than others, ii. are expected to have a larger effect, or iii. are of less interest than others. 3

A standard agricultural example 8 treatments combinations of 4 harvest dates and 2 different fertilizers – are to be investigated for a plant crop. During harvest the yield is to be measured. Assume differences between harvest dates expected to be larger than between fertilizers. A split-plot design is to be used: It will have 3 blocks each containing 4 main plots, with each main plot split into 2 subplots; Randomize 4 harvest dates to 4 main plots in a block and 2 fertilizers to 2 subplots in a main plot. This design employs Principles 2, 3 and 4(ii). 4

2) Factor-allocation description for standard designs Standard designs involve a single allocation in which a set of treatments is assigned to a set of units: treatments are whatever are allocated; units are what treatments are allocated to; treatments and units each referred to as a set of objects; Often do by randomization using a permutation of the units. More generally treatments are allocated to units e.g. using a spatial design or systematically Each set of objects is indexed by a set of factors: Unit or unallocated factors (indexing units); Treatment or allocated factors (indexing treatments). Represent the allocation using factor-allocation diagrams that have a panel for each set of objects with: a list of the factors; their numbers of levels; their nesting relationships. 5 (Nelder, 1965; Brien, 1983; Brien & Bailey, 2006)

Cf. single set of factors uniquely indexing observations: {Blocks, Harvests, Fertilizers}. A crucial feature of diagram is that it shows EU and restrictions on randomization/allocation. Factor-allocation diagram for the standard agricultural experiment 6 One allocation (randomization): a set of treatments to a set of subplots. 4Harvests 2Fertilizers 8 treatments 3Blocks 4Mainplots in B 2Subplots in B, M 24 subplots The set of factors belonging to a set of objects forms a tier: {Harvests, Fertilizers} or {Blocks, Mainplots, Subplots}; they have the same status in the allocation (randomization); Textbook experiments are two-tiered. (e.g. Searle et al., 1992)

Some derived items Sets of generalized factors (terms in the mixed model): Blocks, Blocks Mainplots, Blocks Mainplots Subplots; Harvests, Fertilizers, Harvests Fertilizers. Corresponding types of entities (groupings of objects): block, main plot, subplot (last two are EUs); harvest, fertilizer, treatment (harvest-fertilizer combination). Corresponding sources (in an ANOVA): Blocks, Mainplots[B], Subplots[B M]; Harvests, Fertilizers, Harvests#Fertilizers. 7 4Harvests 2Fertilizers 8 treatments 3Blocks 4Mainplots in B 2Subplots in B, M 24 subplots

Skeleton ANOVA Harvests is confounded with the more-variable Mainplots[B] & Fertilizers with Subplots[B^M]. 8 4Harvests 2Fertilizers 8 treatments 3Blocks 4Mainplots in B 2Subplots in B, M 24 subplots

3)Principles for simple multiphase experiments Suppose in the agricultural experiment: in addition to yield measured during harvest, the amount of Na in the plant is to be measured for each subplot, and the harvested material is transported to the laboratory for drying and analysis. The experiment is two phase: field and laboratory phases. The outcome of the field phase is yield and batches of plant material. The outcome of the laboratory phase is the amount of Na. How to process the specimens from the first phase in the laboratory phase? 9

Some principles Principle 5 (Simplicity desirable): assign first-phase units to laboratory units so that each first-phase source is confounded with a single laboratory source. Use composed randomizations with an orthogonal design. Principle 6 (Preplan all): if possible. Principle 7 (Allocate all and randomize in laboratory): always allocate all treatment and unit factors and randomize first-phase units and lab treatments. Principle 8 (Big with big): Confound big first-phase sources with big laboratory sources, provided no confounding of treatment with first-phase sources. 10

A simple two-phase agricultural experiment Simplest is to randomize batches from a subplot to locations (in time or space) during the laboratory phase. 11 4Harvests 2Fertilizers 8 treatments 3Blocks 4Mainplots in B 2Subplots in B, M 24 subplots 24Locations 24 locations Composed randomizations (Brien & Bailey, 2006)

A simple two-phase agricultural experiment (contd) No. subplots = no. locations = 24 and so subplots sources exhaust locations sources. Cannot separately estimate locations and subplots variability, but can estimate their sum. However want to block the laboratory phase. 12

A simple two-phase agricultural experiment (contd) Simplest is to align lab-phase and first-phase blocking. 13 4Harvests 2Fertilizers 8 treatments 3Blocks 4Mainplots in B 2Subplots in B, M 24 subplots 3Intervals 8Locations in I 24 locations Note Blocks confounded with Intervals (i.e. Big with Big). Composed randomizations

The multiphase law DF for sources from a previous phase can never be increased as a result of the laboratory-phase design. However, it is possible that first-phase sources are split into two or more sources, each with fewer degrees of freedom than the original source. 14 DF for first phase sources unaffected.

4)Principles leading to complications, even with orthogonality Principle 9 (Use pseudofactors): An elegant way to split sources (as opposed to introducing grouping factors unconnected to real sources of variability). Principle 10 (Compensating across phases): Sometimes, if something is confounded with more variable first- phase source, can confound with less variable lab source. Principle 11 (Laboratory replication): Replicate laboratory analysis of first-phase units if lab variability much greater than 1 st -phase variation; Often involves splitting product from the first phase into portions (e.g. batches of harvested crop, wines, blood specimens into aliquots, drops, lots, samples and fractions). Principle 12 (Laboratory treatments): Sometimes treatments are introduced in the laboratory phase and this involves extra randomization. 15

A three-phase agricultural experiment with laboratory treatments Suppose that material from a harvest date cannot be stored and must be analysed before the next harvest. Also, 3 different methods are to be used to measure Na. For this, the material from each subplot is divided into 3 portions and the Methods randomized to them. For each harvest, there are 18 portions to be dried and analysed. There are three ovens to be used simultaneously for the drying: they differ in speed of drying and can only fit 6 portions for drying blocks will be randomized to ovens. Piepho et al. (2003)

A three-phase agricultural experiment with laboratory treatments – drying phase Problem: 12 main plots to assign to 4 dates: Use M 1 to group the 4 main plots to be dried on the same date. An alternative is to introduce a grouping factor, say MainGroups, o but, while in the analysis, not an anticipated variability source. Dashed arrow indicates systematic allocation. The blocks are randomized to ovens (Big with Big). The 2 subplots x 3 portions in a main plot are randomized to the 6 locations in an oven on a date. The methods are randomized to portions. 17 4Harvests 2Fertilizers 8 field treatments 4Mainplots in B 3Blocks 2Subplots in B, M 3Portions in B, M, S 72 portions 4Dates 3Ovens in D 6Locations in D, O 72 locations 4M14M1 3Methods 3 lab treatments

A three-phase agricultural experiment with laboratory treatments – lab phase After drying, the analysis to measure Na must be done. In what order will this be done? All 18 dried portions for a harvest will be done together. Then, could randomize the order of analysis for these or could group according to ovens. The latter, so that analyses are performed in batches of Harvests 2Fertilizers 8 field treatments 4Mainplots in B 3Blocks 2Subplots in B, M 3Portions in B, M, S 72 portions 4Dates 3Ovens in D 6Locations in D, O 72 locations 4M14M1 3Methods 3 lab treatments 4Times 3Intervals in T 6Analyses in T, I 72 analyses

A three-phase agricultural experiment with laboratory treatments (contd) 19 Note split source 4Harvests 2Fertilizers 8 field treatments 4Mainplots in B 3Blocks 2Subplots in B, M 3Portions in B, M, S 72 portions 4Dates 3Ovens in D 6Locations in D, O 72 locations 4M14M1 3Methods 3 lab treatments 4Times 3Intervals in T 6Analyses in T, I 72 analyses

A three-phase agricultural experiment with laboratory treatments (contd) 20 Harvests confounded with Times & Dates. No. analyses = no. locations = no. portions = 72: so portions sources exhaust locations sources exhaust analyses sources. Consequently, not all terms are estimable.

Mixed models Mixed models can be easily obtained from the factor- allocation description using Brien & Demétrios (2009) method: In each panel, form terms as the set of generalized factors for that panel all combinations of the factors, subject to nesting restrictions; For each term from each panel, add to either fixed or random model. 21

Mixed model for 3-phase experiment Full mixed model is: Harvests + Fertilizers + Harvests Fertilizers + Methods | Blocks + Blocks Mainplots + Blocks Mainplots Subplots + Blocks Mainplots Subplots Portions + Dates + Dates Ovens + Dates Ovens Locations + Times + Times Intervals + Times Intervals Analyses. Pseudofactors not needed in mixed model (cf. grouping factors). 22 4Harvests 2Fertilizers 8 field treatments 4Mainplots in B 3Blocks 2Subplots in B, M 3Portions in B, M, S 72 portions 4Dates 3Ovens in D 6Locations in D, O 72 locations 4M14M1 3Methods 3 lab treatments 4Times 3Intervals in T 6Analyses in T, I 72 analyses

Mixed model of convenience (for fitting) Full mixed model is: Harvests + Fertilizers + Harvests Fertilizers + Methods | Blocks + Blocks Mainplots + Blocks Mainplots Subplots + Blocks Mainplots Subplots Portions + Dates + Dates Ovens + Dates Ovens Locations + Times + Times Intervals + Times Intervals Analyses. 23 Model will not fit because of confounding (see ANOVA). Mixed model of convenience (a term for each line in the ANOVA): Harvests + Fertilizers + Harvests Fertilizers + Methods | Blocks Mainplots + Blocks Mainplots Subplots + Blocks Mainplots Subplots Portions. This model of convenience does not reflect all the sources of variability in the experiment. For example, the Residual is not just portions variability, but has locations and analyses variability contributing to it also.

24 5)Summary Have provided 4 standard principles and 8 principles specific to orthogonal, multiphase designs. Extension to nonorthogonal designs is coming.

25 References Brien, C. J. (1983). Analysis of variance tables based on experimental structure. Biometrics, 39, 53–59. Brien, C.J., and Bailey, R.A. (2006) Multiple randomizations (with discussion). J. Roy. Statist. Soc., Ser. B, 68, 571–609. Brien, C.J. and Demétrio, C.G.B. (2009) Formulating mixed models for experiments, including longitudinal experiments. Journal of Agricultural, Biological and Environmental Statistics, 14, 253–80. Brien, C.J., Harch, B.D., Correll, R.L. and Bailey, R.A. (2011) Multiphase experiments with laboratory phases subsequent to the initial phase. I. Orthogonal designs. Journal of Agricultural, Biological and Environmental Statistics, 16, 422–450. Nelder, J. A. (1965). The analysis of randomized experiments with orthogonal block structure. Proceedings of the Royal Society of London, Series A, 283(1393), , 163–178. Piepho, H.P., Büchse, A. and Emrich, K. (2003) A hitchhikers guide to mixed models for randomized experiments. Journal of Agronomy and Crop Science, 189, 310–322. Searle, S. R., Casella, G. & Mcculloch, C. E. (1992) Variance components. New York, Wiley. Web address for link to Multitiered experiments site: