Blocks and pseudoreplication

Slides:



Advertisements
Similar presentations
Multiple Analysis of Variance – MANOVA
Advertisements

Multiple Comparisons in Factorial Experiments
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Experiments with both nested and “crossed” or factorial factors
Chapter 14Design and Analysis of Experiments 8E 2012 Montgomery 1.
Statistics review Basic concepts: Variability measures Distributions Hypotheses Types of error Common analyses T-tests One-way ANOVA Randomized block ANOVA.
N-way ANOVA. Two-factor ANOVA with equal replications Experimental design: 2  2 (or 2 2 ) factorial with n = 5 replicate Total number of observations:
© 2010 Pearson Prentice Hall. All rights reserved The Complete Randomized Block Design.
Design of Engineering Experiments - Experiments with Random Factors
Part I – MULTIVARIATE ANALYSIS
Lack of independent replicates: A common pitfall in experimental design.
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
PSY 307 – Statistics for the Behavioral Sciences
Spotting pseudoreplication 1.Inspect spatial (temporal) layout of the experiment 2.Examine degrees of freedom in analysis.
Experimental Design Terminology  An Experimental Unit is the entity on which measurement or an observation is made. For example, subjects are experimental.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Lecture 9: One Way ANOVA Between Subjects
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Repeated Measures ANOVA Used when the research design contains one factor on which participants are measured more than twice (dependent, or within- groups.
Basic Analysis of Variance and the General Linear Model Psy 420 Andrew Ainsworth.
Some Notes on the Design and Analysis of Experiments.
Biostatistics-Lecture 9 Experimental designs Ruibin Xi Peking University School of Mathematical Sciences.
Analysis of Variance (ANOVA) Quantitative Methods in HPELS 440:210.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Selecting the Correct Statistical Test
Repeated Measures ANOVA
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 23, Slide 1 Chapter 23 Comparing Means.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
Chapter 13Design & Analysis of Experiments 8E 2012 Montgomery 1.
Statistical Modeling and Analysis of MOFEP Chong He ( with John Kabrick, Xiaoqian Sun, Mike Wallendorf) Department of Statistics University of Missouri-Columbia.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Experimental design. Experiments vs. observational studies Manipulative experiments: The only way to prove the causal relationships BUT Spatial and temporal.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Inferential Statistics
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
DOX 6E Montgomery1 Design of Engineering Experiments Part 9 – Experiments with Random Factors Text reference, Chapter 13, Pg. 484 Previous chapters have.
ANALYSIS OF VARIANCE (ANOVA) BCT 2053 CHAPTER 5. CONTENT 5.1 Introduction to ANOVA 5.2 One-Way ANOVA 5.3 Two-Way ANOVA.
Intermediate Applied Statistics STAT 460 Lecture 17, 11/10/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu
1 G Lect 11a G Lecture 11a Example: Comparing variances ANOVA table ANOVA linear model ANOVA assumptions Data transformations Effect sizes.
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
ANOVA: Analysis of Variance.
STA 2023 Module 11 Inferences for Two Population Means.
1 The Two-Factor Mixed Model Two factors, factorial experiment, factor A fixed, factor B random (Section 13-3, pg. 495) The model parameters are NID random.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
One-Way Analysis of Covariance (ANCOVA)
Experimental design.
ETM U 1 Analysis of Variance (ANOVA) Suppose we want to compare more than two means? For example, suppose a manufacturer of paper used for grocery.
Chapter 13 Design of Experiments. Introduction “Listening” or passive statistical tools: control charts. “Conversational” or active tools: Experimental.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.
ANOVA EDL 714, Fall Analysis of variance  ANOVA  An omninbus procedure that performs the same task as running multiple t-tests between all groups.
ANOVA and Multiple Comparison Tests
1 Topic 14 – Experimental Design Crossover Nested Factors Repeated Measures.
Factorial BG ANOVA Psy 420 Ainsworth. Topics in Factorial Designs Factorial? Crossing and Nesting Assumptions Analysis Traditional and Regression Approaches.
Comparing Three or More Means
Lecture 2: Replication and pseudoreplication
Chapter 5 Introduction to Factorial Designs
The Analysis of Variance
Experimental design.
A protocol for data exploration to avoid common statistical problems
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Blocks and pseudoreplication

This lecture will cover: Blocks Experimental units (replicates) Pseudoreplication Degrees of freedom

Good options for increasing sample size: More replicates More blocks False options for increasing sample size: More “repeated measurements” Pseudoreplication

Ecological rule #1: the world is not uniform! Good patch Medium patch Poor patch

3 options in assigning treatments: Randomly assign Systematic Randomized block Good patch Medium patch Poor patch

1. Randomly assign Statistically robust Pros? Cons? Good patch Medium patch Poor patch Pros? Cons? Statistically robust With small n, chance of all in a bad patch

1. Randomly assign Good patch Medium patch Poor patch What’s the chance of total spatial segregation of treatments? Pros? Cons?

2. Systematic No clumping possible Pros? Cons? Good patch Medium patch Poor patch Pros? Cons? No clumping possible Violates random assumption of statistics…but is this so bad?

3. Randomized block BLOCK A BLOCK B BLOCK C Good patch Medium patch Poor patch BLOCK A BLOCK B BLOCK C

3. Randomized block Note: BLOCK A BLOCK B BLOCK C Note: Do not have to know if patches differ in quality Must have all treatment combinations represented in each block If WANT to test treatment x block interaction, need replication within blocks

How to analyze a blocked design in JMP (Method 1) Basic stats> Oneway. Add response variable, treatment (“grouping”) and block. Click OK

How to analyze a blocked design in JMP (Method 2) Open fit model tab. Enter y-variable. Add treatment, block and –if desired- treatment x block to “effects”. Click on block in effects box and change attributes to random. 4. Change Method option to EMS (not REML)

Good options for increasing sample size: More replicates More blocks False options for increasing sample size: More “repeated measurements” Pseudoreplication

Experimental unit Scale at which independent applications of the same treatment occur Also called “replicate”, represented by “n” in statistics

Experimental unit Example: Effect of fertilization on caterpillar growth

What is our per treatment sample size? What is our treatment n? Experimental unit ? + F - F + F - F What is our per treatment sample size? What is our treatment n? n=2

Experimental unit ? + F - F n=1

Pseudoreplication Misidentifying the scale of the experimental unit; Assuming there are more experimental units (replicates, “n”) than there actually are

Why is this a pseudoreplicated design? + F - F

Example 1. Hypothesis: Insect abundance is higher in shallow lakes

Example 1. Experiment: Sample insect abundance every 100 m along the shoreline of a shallow and a deep lake

Example 2. What’s the problem ? Spatial autocorrelation

Example 2. Hypothesis: Two species of plants have different growth rates

Example 2. Experiment: Mark 10 individuals of sp. A and 10 of sp. B in a field. Follow growth rate over time If the researcher declares n=10, could this still be pseudoreplicated?

Example 2.

Example 2. time

Temporal pseudoreplication: Multiple measurements on SAME individual, treated as independent data points time time

Spotting pseudoreplication Inspect spatial (temporal) layout of the experiment Examine degrees of freedom in analysis

Degrees of freedom (df) Number of independent terms used to estimate the parameter = Total number of datapoints – number of parameters estimated from data

Example: Variance If we have 3 data points with a mean value of 10, what’s the df for the variance estimate? Independent term method: Can the first data point be any number? Yes, say 8 Can the second data point be any number? Yes, say 12 Can the third data point be any number? No – as mean is fixed ! Variance is  (y – mean)2 / (n-1)

Example: Variance If we have 3 data points with a mean value of 10, what’s the df for the variance estimate? Independent term method: Therefore 2 independent terms (df = 2)

Example: Variance If we have 3 data points with a mean value of 10, what’s the df for the variance estimate? Subtraction method Total number of data points? 3 Number of estimates from the data? 1 df= 3-1 = 2

Therefore 2 parameters estimated simultaneously Example: Linear regression Y = mx + b Therefore 2 parameters estimated simultaneously (df = n-2)

Example: Analysis of variance (ANOVA) A B C a1 b1 c1 a2 b2 c2 a3 b3 c3 a4 b4 c4 What is n for each level?

Example: Analysis of variance (ANOVA) A B C a1 b1 c1 a2 b2 c2 a3 b3 c3 a4 b4 c4 df = 3 df = 3 df = 3 n = 4 How many df for each variance estimate?

Example: Analysis of variance (ANOVA) A B C a1 b1 c1 a2 b2 c2 a3 b3 c3 a4 b4 c4 df = 3 df = 3 df = 3 What’s the within-treatment df for an ANOVA? Within-treatment df = 3 + 3 + 3 = 9

Example: Analysis of variance (ANOVA) A B C a1 b1 c1 a2 b2 c2 a3 b3 c3 a4 b4 c4 If an ANOVA has k levels and n data points per level, what’s a simple formula for within-treatment df? df = k(n-1)

Spotting pseudoreplication An experiment has 10 fertilized and 10 unfertilized plots, with 5 plants per plot. The researcher reports df=98 for the ANOVA (within-treatment MS). Is there pseudoreplication?

Spotting pseudoreplication An experiment has 10 fertilized and 10 unfertilized plots, with 5 plants per plot. The researcher reports df=98 for the ANOVA. Yes! As k=2, n=10, then df = 2(10-1) = 18

Spotting pseudoreplication An experiment has 10 fertilized and 10 unfertilized plots, with 5 plants per plot. The researcher reports df=98 for the ANOVA. What mistake did the researcher make?

Spotting pseudoreplication An experiment has 10 fertilized and 10 unfertilized plots, with 5 plants per plot. The researcher reports df=98 for the ANOVA. Assumed n=50: 2(50-1)=98

Why is pseudoreplication a problem? Hint: think about what we use df for!

How prevalent? Hurlbert (1984): 48% of papers Heffner et al. (1996): 12 to 14% of papers