Experimental design.

Slides:



Advertisements
Similar presentations
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Advertisements

Two and more factors in analysis of variance
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Statistics in Science  Role of Statistics in Research.
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Experimental design in environmental assessment  Environmental sampling and analysis (Quinn & Keough, 2003)
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
N-way ANOVA. Two-factor ANOVA with equal replications Experimental design: 2  2 (or 2 2 ) factorial with n = 5 replicate Total number of observations:
Designs with Randomization Restrictions RCBD with a complete factorial in each block RCBD with a complete factorial in each block –A: Cooling Method –B:
The art and science of measuring people l Reliability l Validity l Operationalizing.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr, November 20, 2008 Analysis of Variance.
Chapter 3 Analysis of Variance
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Repeated-Measures ANOVA.
Spotting pseudoreplication 1.Inspect spatial (temporal) layout of the experiment 2.Examine degrees of freedom in analysis.
Experimental Design Terminology  An Experimental Unit is the entity on which measurement or an observation is made. For example, subjects are experimental.
PSYC512: Research Methods PSYC512: Research Methods Lecture 11 Brian P. Dyre University of Idaho.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Magister of Electrical Engineering Udayana University September 2011
Choosing and using statistics to test ecological hypotheses
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Experimental design. Experiments vs. observational studies Manipulative experiments: The only way to prove the causal relationships BUT Spatial and temporal.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Research & Experimental Design Why do we do research History of wildlife research Descriptive v. experimental research Scientific Method Research considerations.
Psych 5500/6500 Other ANOVA’s Fall, Factorial Designs Factorial Designs have one dependent variable and more than one independent variable (i.e.
Blocks and pseudoreplication
Intermediate Applied Statistics STAT 460 Lecture 17, 11/10/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu
Experimental design.
Randomized block designs  Environmental sampling and analysis (Quinn & Keough, 2002)
Intermediate Applied Statistics STAT 460 Lecture 18, 11/10/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Smith/Davis (c) 2005 Prentice Hall Chapter Fifteen Inferential Tests of Significance III: Analyzing and Interpreting Experiments with Multiple Independent.
ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.
1 G Lect 13b G Lecture 13b Mixed models Special case: one entry per cell Equal vs. unequal cell n's.
Single Season Study Design. 2 Points for consideration Don’t forget; why, what and how. A well designed study will:  highlight gaps in current knowledge.
Stats Methods at IC Lecture 3: Regression.
Chapter 11 Analysis of Variance
Comparing Multiple Factors:
Dependent-Samples t-Test
Repeated Measures Designs
Statistics for the Social Sciences
Factorial Experiments
Statistical Core Didactic
Applied Business Statistics, 7th ed. by Ken Black
Comparing Three or More Means
PCB 3043L - General Ecology Data Analysis.
ANOVA lecture Fixed, random, mixed-model ANOVAs
Nested Designs Study vs Control Site.
Analysis of Covariance (ANCOVA)
12 Inferential Analysis.
Lecture 2: Replication and pseudoreplication
Chapter 1 – Ecological Data
Random Effects & Repeated Measures
2 independent Groups Graziano & Raulin (1997).
Chapter 11 Analysis of Variance
Statistics review Basic concepts: Variability measures Distributions
Nested Designs and Repeated Measures with Treatment and Time Effects
Main Effects and Interaction Effects
I. Statistical Tests: Why do we use them? What do they involve?
The Randomized Complete Block Design
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Independent variables correlate with each other
Ch. 7: Randomized Experiments and Causal Inference
12 Inferential Analysis.
Product moment correlation
Null models in community ecology
A protocol for data exploration to avoid common statistical problems
One way Analysis of Variance (ANOVA)
Presentation transcript:

Experimental design

Experiments vs. observational studies Manipulative experiments: The only way to prove the causal relationships BUT Spatial and temporal limitation of manipulations Side effects of manipulations

Example of side effects – exclosures for grazing

Exclosures have significantly higher density of small rodents ????????????

The poles of fencing are perfect perching sites for birds of pray

Laboratory, field, natural trajectory (NTE), and natural snapshot experiments (Diamond 1986) NTE/NSE - Natural Trajectory/Snapshot Experiment

Observational studies (e. g Observational studies (e.g. for correlation between environment and species, or estimates of plot characteristics) Random vs. regular sampling plan

Take care Even if the plots are located randomly, some of them are (in a finite area) close to each other, and so they might be “auto-correlated” Regular pattern maximizes the distance between neighbouring plots

Regular design - biased results, when there is some regular structure in the plot (e.g. regular furrows), with the same period as is the distance in the grid - otherwise, better design providing better coverage of the area, and also enables use of special permutation tests.

Manipulative experiments frequent trade-off between feasibility and requirements of correct statistical design and power of the tests To maximize power of the test, you need to maximize number of independent experimental units For the feasibility and realism, you need plots of some size, to avoid the edge effect

Important - treatments randomly assigned to plots Completely randomized design Typical analysis: One way ANOVA

Regular patterns of individual treatment type location are often used, they usually maximize possible distance and so minimize the spatial dependence of plots getting the same treatment Similar danger as for regular sampling pattern - i.e., when there is inherent periodicity in the environment – usually very unlikely

When randomizing, your treatment allocation could be also e.g.: Regular pattern helps to avoid possible “clumping” of the same treatment plots

Randomized complete blocks For repeated measurements - adjust the blocks (and even the randomization) after the baseline measurement

ANOVA, TREAT x BLOCK interaction is the error term

If the block has a strong explanatory power, the RCB design is stronger than completely randomized one

If the block has no explanatory power, the RCB design is weak

Reminder – Covariates Use of covariates (covariables) and Analysis of Covariance – another possibility how to filter out „noise“ and decrease the unexplained variability

Latin square design In most cases rather weak test if analyzed as Latin square (i.e. column and row taken as factors in incomplete three way ANOVA) Again, useful to avoid clumping of the same treatment

Most frequent errors - pseudoreplications

Cited 4000+ times

Note, B. is in fact not a pseudoreplication, if the analysis reflects correctly the hierarchical design of the data

Logic of experiments in ecology: is pseudoreplication a pseudoissue? Oksanen, L Logic of experiments in ecology: is pseudoreplication a pseudoissue? OIKOS 94 : 27-38 Hurlbert divides experimental ecologist into 'those who do not see any need for dispersion (of replicated treatments and controls) and those who do recognize its importance and take whatever measures are necessary to achieve a good dose of it'. Experimental ecologists could also be divided into those who do not see any problems with sacrificing spatial and temporal scales in order to obtain replication, and those who understand that appropriate scale must always have priority over replication.

Reminder Type I and Type III SS If the design is balanced, you don’t need to care In non-balanced designs – Type I – sequential – the order of predictors IS important Type III, Type VI

What is the interaction?

Log transformation and the interaction If the interaction = 0 - we expect the pure additivity Effect of A+B = Effect of A + Effect of B Null model for interaction: Xijk = m + ai + bj + eijk (fertilization increases height by 5cm) Model with interaction: Xijk = m + ai + bj + gij + eijk Often biologically more feasible null model: pure multiplicativity (fertilization increases height by 20%) Null model for interaction: Xijk = m . ai . bj . eijk Then: log(Xijk) = log(m . ai . bj . eijk) = log m + log ai + log bj + log eijk

If you log-transform in a factorial ANOVA Think more about the meaning of the interaction, the distributional properties of response variable are (usually) less important

Fixed and random factors

Fertilization experiment in three countries Difference of meaning of the test, depending on whether the country is factor with fixed or random effect COUNTRY FERTIL NOSPEC 1 CZ 0.000 9.000 2 CZ 0.000 8.000 3 CZ 0.000 6.000 4 CZ 1.000 4.000 5 CZ 1.000 5.000 6 CZ 1.000 4.000 7 UK 0.000 11.000 8 UK 0.000 12.000 9 UK 0.000 10.000 10 UK 1.000 3.000 11 UK 1.000 4.000 12 UK 1.000 3.000 13 NL 0.000 5.000 14 NL 0.000 6.000 15 NL 0.000 7.000 16 NL 1.000 6.000 17 NL 1.000 6.000 18 NL 1.000 8.000

Country is a fixed factor (i. e Country is a fixed factor (i.e., we are interested in the three plots only) Summary of all Effects; design: (new.sta) 1-COUNTRY, 2-FERTIL df MS df MS Effect Effect Error Error F p-level 1 2 2.16667 12 1.055556 2.05263 .171112 2 1 53.38889 12 1.055556 50.57895 .000012 12 2 26.05556 12 1.055556 24.68421 .000056 Country is a random factor (i.e., the three plots are considered as a random selection of all plots of this type in Europe - [to make Brussels happy]) Summary of all Effects; design: (new.sta) 1-COUNTRY, 2-FERTIL df MS df MS Effect Effect Error Error F p-level 1 2 2.16667 12 1.05556 2.05263 .171112 2 1 53.38889 2 26.05556 2.04904 .288624 12 2 26.05556 12 1.05556 24.68421 .000056

Nested design („split-plot“)

Two explanatory variables, Treatment and Plot, Plot is random factor nested in Treatment. Accordingly, there are two error terms, effect of Treatment is tested against Plot, effect of Plot against residual variability: F(Treat)=MS(Treat)/MS(Plot) F(Plot)=MS(Plot)/MS(Resid) [often not of interest]

Split plot (main plots and split plots - two error levels)

ROCK is the MAIN PLOT factor, PLOT is random factor nested in ROCK, TREATMENT is the within plot (split-plot) factor. Two error levels: F(ROCK)=MS(ROCK)/MS(PLOT) F(TREA)=MS(TREA)/MS(PLOT*TREA)

Following changes in time Non-replicated BACI (Before-after-control-impact)

Analysed by two-way ANOVA factors: Time (before/after) and Location (control/impact) Of the main interest: Time*Location interaction (i.e., the temporal change is different in control and impact locations)

In fact, in non-replicated BACI, the test is based on pseudoreplications. Should NOT be used in experimental setups In impact assessments, often the best possibility (The best need not be always good enough.)

Replicated BACI - repeated measurements Usually analysed by “univariate repeated measures ANOVA”. This is in fact split-plot, where TREATment is the main-plot effect, time is the within-plot effect, individuals (or experimental units) are nested within a treatment. Of the main interest is interaction TIME*TREAT