Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Design & Analysis of Multistratum Randomized Experiments Ching-Shui Cheng Nov. 30, 2006 National Tsing Hua University.

Similar presentations


Presentation on theme: "1 Design & Analysis of Multistratum Randomized Experiments Ching-Shui Cheng Nov. 30, 2006 National Tsing Hua University."— Presentation transcript:

1 1 Design & Analysis of Multistratum Randomized Experiments Ching-Shui Cheng Nov. 30, 2006 National Tsing Hua University

2 2 Schedule Nov. 30 Introduction, treatment and block structures, examples Dec. 1 Randomization models, null ANOVA, orthogonal designs Dec. 7 More on orthogonal designs, non-orthogonal designs Dec. 8 More complex treatment and block structures, factorial experiments

3 3 Nelder (1965a, b) The analysis of randomized experiments with orthogonal block structure, Proceedings of the Royal Society of London, Series A Fundamental work on the analysis of randomized experiments with orthogonal block structures

4 4 Bailey (1981) JRSS, Ser. A “Although Nelder (1965a, b) gave a unified treatment of what he called ‘simple’ block structures over ten years ago, his ideas do not seem to have gained wide acceptance. It is a pity, because they are useful and, I believe, simplifying. However, there seems to be a widespread belief that his ideas are too difficult to be understood or used by practical statisticians or students.”

5 5 Experimental Design Planning of experiments to produce valid information as efficiently as possible

6 6 Comparative Experiments Treatments 處理 Varieties of grain, fertilizers, drugs, …. Experimental Units Plots, patients, ….

7 7 Design: How to assign the treatments to the experimental units Fundamental difficulty: variability among the units; no two units are exactly the same. Different responses may be observed even if the same treatment is assigned to the units. Systematic assignments may lead to bias.

8 8 Suppose is an observation on the th unit, and is the treatment assigned to that unit. Assume treatment-unit additivity:

9 9 R. A. Fisher worked at the Rothamsted Experimental Station in the United Kingdom to evaluate the success of various fertilizer treatments.

10 10 Fisher found the data from experiments going on for decades to be basically worthless because of poor experimental design.  Fertilizer had been applied to a field one year and not in another in order to compare the yield of grain produced in the two years. BUT It may have rained more, or been sunnier, in different years. The seeds used may have differed between years as well.  Or fertilizer was applied to one field and not to a nearby field in the same year. BUT The fields might have different soil, water, drainage, and history of previous use.  Too many factors affecting the results were “uncontrolled.”

11 11 Fisher’s solution: Randomization 隨機化 In the same field and same year, apply fertilizer to randomly spaced plots within the field. This averages out the effect of variation within the field in drainage and soil composition on yield, as well as controlling for weather, etc. FFFFFF FFFFFFFF FFFFF FFFFFFFF FFFFF FFFF

12 12 Randomization prevents any particular treatment from receiving more than its fair share of better units, thereby eliminating potential systematic bias. Some treatments may still get lucky, but if we assign many units to each treatment, then the effects of chance will average out. In addition to guarding against potential systematic biases, randomization also provides a basis for doing statistical inference. (Randomization model)

13 13 FFFFFFFFFFFF FFFFFFFFFFFF FFFFFFFFFFFF Start with an initial design Randomly permute (labels of) the experimental units Complete randomization: Pick one of the 72! Permutations randomly

14 14 111111111111 111111222222 222222222222 333333333333 333333444444 444444444444 Pick one of the 72! Permutations randomly 4 treatments Completely randomized design

15 15 Assume treatment-unit additivity

16 16 Randomization model for a completely randomized design The ’s are identically distributed is a constant for all

17 17

18 18 Blocking: an effective method for improving precision Randomized complete block design After randomization: 完全區集設計 區集化

19 19 Incomplete block design 7 treatments

20 20 Incomplete block design Balanced incomplete block design Optimality was shown by Kiefer (1958) Randomization is performed independently within each block, and the block labels are also randomly permuted. 平衡不完全區集設計

21 21 Incomplete block design Randomize by randomly choosing one out of the (7!)(3!) 7 permutations that preserve the block structure. These permutations form a subgroup of the group of all 21! permutations of the 21 unit labels.

22 22 Block what you can and randomize what you cannot. The purpose of randomization is to average out those nuisance factors that we cannot predict or cannot control, not to destroy the relevant information we have. Choose a permutation group that preserves any known relevant structure on the units. Usually take the group for randomization to be the largest possible group that preserves the structure to give the greatest possible simplification of the model.

23 23 Two simple block (unit) structures Nesting block/unit Crossing row * column

24 24 Two simple block structures Nesting block/unit Crossing row * column Latin square

25 25

26 26 Treatment structures No structure Treatments vs. control Factorial structure

27 27 Unstructured treatments (Treatment contrast) The set of all treatment contrasts form a dimensional space (generated by all the pairwise comparisons. Might be interested in estimating pairwise comparisons or

28 28 Treatments vs control

29 29 Factorial structure Each treatment is a combination of several factors

30 30 S=2, n=3:

31 31 Interested in contrasts representing main effects and interactions of the factors

32 32 Here each is coded by 1 and -1.

33 33

34 34

35 35

36 36 Treatment structure Block structure (unit structure) Design Randomization Analysis

37 37 Choice of design Efficiency Combinatorial considerations Practical considerations

38 38 Simple orthogonal block structures Iterated crossing and nesting cover most, but not all block structures encountered in practice

39 39 Consumer testing A consumer organization wishes to compare 8 brands of vacuum cleaner. There is one sample for each brand. Each of four housewives tests two cleaners in her home for a week. To allow for housewife effects, each housewife tests each cleaner and therefore takes part in the trial for 4 weeks. 8 unstructured treatments Block structure:

40 40 A αB βC γD δ B γA δD αC β C δD γA βB α D βC αB δA γ Trojan square Optimality of Trojan squares was shown by Cheng and Bailey (1991)

41 41 t = 18

42 42 McLeod and Brewster (2004) Technometrics Blocked split plots (Split-split plots) Chrome-plating process Block structure: 4 weeks/4 days/2 runs block/wholeplot/subplot Treatment structure: A * B * C * p * q * r Each of the six factors has two levels

43 43 Hard-to-vary treatment factors A: chrome concentration B: Chrome to sulfate ratio C: bath temperature Easy-to-vary treatment factors p: etching current density q: plating current density r: part geometry

44 44 Miller (1997) Technometrics Strip-Plots Experimental objective: Investigate methods of reducing the wrinkling of clothes being laundered

45 45 Miller (1997) The experiment is run in 2 blocks and employs 4 washers and 4 driers. Sets of cloth samples are run through the washers and the samples are divided into groups such that each group contains exactly one sample from each washer. Each group of samples is then assigned to one of the driers. Once dried, the extent of wrinkling on each sample is evaluated.

46 46 Treatment structure: A, B, C, D, E, F: configurations of washers a,b,c,d: configurations of dryers

47 47 Block structure: 2 blocks/(4 washers * 4 dryers)

48 48 Block 1 Block 2 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 1 1 0 0 0 0 0 0 0 0 0 1 10 0 0 1 1 1 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 0 1 1 1 1 0 1 0 0 0 0 0 0 0 1 1 1 10 0 0 1 1 1 1 0 0 1 0 1 1 0 1 1 0 0 0 00 1 1 1 0 0 0 1 1 0 0 1 1 0 1 1 0 0 1 10 1 1 1 0 0 0 1 0 1 0 1 1 0 1 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0 0 1 1 0 1 1 1 1 1 1 0 1 1 1 0 0 1 0 0 1 1 0 1 1 0 1 0 0 0 01 0 1 0 1 0 0 1 1 0 1 0 1 1 0 1 0 0 1 11 0 1 0 1 0 0 1 0 1 1 0 1 1 0 1 1 1 0 01 0 1 0 1 0 1 0 1 0 1 0 1 1 0 1 1 1 1 1 1 0 1 0 1 0 1 0 0 1 1 1 0 1 1 0 0 0 0 01 1 0 0 0 1 0 1 1 0 1 1 0 1 1 0 0 0 1 1 1 1 0 0 0 1 0 1 0 1 1 1 0 1 1 0 1 1 0 0 1 1 0 0 0 1 1 0 1 0 1 1 0 1 1 0 1 1 1 1 1 1 0 0 0 1 1 0 0 1

49 49 GenStat code factor [nvalue=32;levels=2] block,A,B,C,D,E,F,a,b,c,d & [levels=4] wash, dryer generate block,wash,dryer blockstructure block/(wash*dryer) treatmentstructure (A+B+C+D+E+F)*(A+B+C+D+E+F) +(a+b+c+d)*(a+b+c+d) +(A+B+C+D+E+F)*(a+b+c+d)

50 50 matrix [rows=10; columns=5; values=“ b r1 r2 c1 c2" 0, 0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 1, 1, 0, 0, 1, 0, 1, 0, 0, 1, 1, 0, 0, 0, 1, 1, 1, 0, 0, 0, 0, 0, 0, 1, 1, 0, 0, 0, 1, 1, 0, 0, 1, 0, 0, 0, 0, 1, 0] Mkey

51 51 Akey [blockfactors=block,wash,dryer; Key=Mkey; rowprimes=!(10(2));colprimes=!(5(2)); colmappings=!(1,2,2,3,3)] Pdesign Arandom [blocks=block/(wash*dryer);seed=12345] PDESIGN ANOVA

52 52 Source of variationd.f. block stratum AD=BE=CF=ab=cd1 block.wash stratum A=BC=EF1 B=AC=DF 1 C=AB=DF1 D=BF=CE1 E=AF=CD1 F=BD=AE1

53 53 block.dryer stratum a1 b1 c1 d1 ac=bd1 bc=ad1

54 54 block.wash.dryer stratum Aa=Db1 Ba=Eb1 Ca=Fb1 Da=Ab1 Ea=Bb1 Fa=Cb1 Ac=Dd1 Bc=Ed1 Cc=Fd1 Dc=Ad1 Ec=Bd1 Fc=Cd1 Residual6 Total 31

55 55 Seven Error Terms!! Are you kidding?? T. M. Loughin et al.

56 56

57 57

58 58

59 59

60 60

61 61

62 62

63 63

64 64

65 65 Treatment structure: 3*2*3*7 Block structure: 4/((3/2/3)*7)

66 66 Factor [nvalues=504;levels=4] Block & [levels=3] Sv, Sr, Var, Rate & [levels=2] St, Time & [levels=7] Sw, Weed Generate Block, Sv, St, Sr, Sw Matrix [rows=4;columns=6; \ values="b1 b2 Col St Sr Row"\ 1, 0, 1, 0, 0, 0,\ 0, 0, 1, 1, 0, 0,\ 0, 0, 1, 1, 1, 0,\ 1, 1, 0, 0, 0, 1] Ckey Akey [blockfactor=Block,Sv,St,Sr,Sw; \ Colprimes=!(2,2,3,2,3,7);Colmappings=!(1,1,2,3,4,5);Key=Ckey] Var, Time, Rate, Weed Blocks Block/((Sv/St/Sr)*Sw) Treatments Var*Time*Rate*Weed ANOVA

67 67 Block stratum 3 Block.Sv stratum Var 2 Residual 6 Block.Sw stratum Weed 6 Residual 18 Block.Sv.St stratum Time 1 Var.Time 2 Residual 9

68 68 Block.Sv.Sw stratum Var.Weed 12 Residual 36 Block.Sv.St.Sr stratum Rate 2 Var.Rate 4 Time.Rate 2 Var.Time.Rate 4 Residual 36

69 69 Block.Sv.St.Sw stratum Time.Weed 6 Var.Time.Weed 12 Residual 54 Block.Sv.St.Sr.Sw stratum Rate.Weed 12 Var.Rate.Weed 24 Time.Rate.Weed 12 Var.Time.Rate. Weed 24 Residual 216 Total 503


Download ppt "1 Design & Analysis of Multistratum Randomized Experiments Ching-Shui Cheng Nov. 30, 2006 National Tsing Hua University."

Similar presentations


Ads by Google