Presentation on theme: "Repeated Measures ANOVA"— Presentation transcript:
1Repeated Measures ANOVA Starting with One-Way RMMore fascinating than a bowl of porridge1.
2One-Way Repeated Measures ANOVA Data considerationsOne interval/ratio dependent variableOne categorical independent variable of > 2 levelsAnalogous to dependent t-test, but for more than 2 levels of the independent variableSo here people are measured more than two timesAll participants participate in all levels of the IVE.g. midterm, final, quiz score for this class in SP 20091.
3One-Way Repeated Measures ANOVA Advantages of repeated measuresAgain, as per paired t-test...SensitivityReduction in error variance (subjects serve as own controls)So, more sensitive to experimental effectsEconomyNeed less participantsWith many levels, this might be even more important for ANOVA than t-test(need to be careful of fatigue effects, though)1.
4One-Way Repeated Measures ANOVA Possible serious disadvantage of RMOrder effects and treatment carry-over effects (goes for paired t-test too)E.g. three chocolates…one’s bad, the others goodShould counterbalance (randomly assign to treatment order)E.g. (for 2 levels of RM: A & B)1.2.3.Order of administration12% of subjects50AB4.
5One-Way Repeated Measures ANOVA Possible serious disadvantage of RMOrder effects (goes for paired t-test too)E.g. (for 3 levels of RM: A, B & C)Order of administration123% of subjects33ABC1. This type of control for order effects is known as a Latin Square design
6One-Way Repeated Measures ANOVA Possible serious disadvantage of RMTreatment carry-over effects (goes for paired t-test too)Even if order effects are controlled for, there must be sufficient time between treatments so that you can be sure that the score on each level of the RM is due to only one treatment (not a combination of two or more)Note – order & treatment carry-over effects are design rather than statistical issues, but very important nevertheless1.
7One-Way Repeated Measures ANOVA 1.Example, with chat about variance partitioning and assumptions...Remember the one-way between subjects ANOVA?Data looked like this in SPSSAnd the trick was to make variance due to treatments bigger than variance due to everything else (& everything else included variance due to individual differences)Well, what if you could take out variance due to individual differences?2.3.4.
8One-Way Repeated Measures ANOVA That’s what the one-way RM ANOVA doesData now looks like this, as each person is measured on all levels of the IVVariation due to individual differences can then be separated from variation due to chance, as the same people are present within each condition.1.2.
9One-Way Repeated Measures ANOVA Now a pause before we consider variance partitioning in RM ANOVA, as we see how to conduct the test.Here’s the first step1. Choose this...
10One-Way Repeated Measures ANOVA 2. Then click “add” and proceed by clicking “define”1. Type in the variable name (“drug”) where it says “factor1”, and the number of levels it has (4)
11One-Way Repeated Measures ANOVA 1. ...next you choose all the levels of the repeated measures factor (i.v.)...3.2. And slide them over to the “within subjects variables” box – just another name for repeated measures variable...or factor
12One-Way Repeated Measures ANOVA Output1.2. This first bit is from the multivariate (more than one dependent variable – 4 here) approach to repeated measures. It has some potential advantages (essentially that one does not have to meet the sphericity assumption...see next slide)
13One-Way Repeated Measures ANOVA And...more output...1.This bit is important. It’s a test of one of the more important assumptions of RM ANOVA – sphericity. It’s kind of like the homogeneity of variance test, but it’s the variance of the difference scores between the levels of the independent variable that are being tested…you really have to adjust for it, & we see how on the next slide (if this is NOT significant, it’s good)3. sphericity…2. What to do…Another important bit…the Huynh-Feldt Epsilon...see next slide
14One-Way Repeated Measures ANOVA And...still more output...Finally, the bit that counts. Note there are FOUR (count ‘em) separate versions of each effect. Here’s the rule (Schutz & Gessaroli, 1993): If Huynh-Feldt Epsilon (see previous slide) is > .7, use Huynh-Feldt adjusted F (third line). If it is less than .7, use G-G (second line)2.1.
15One-Way Repeated Measures ANOVA Same bit once again...1. Here, you can see that, as the epsilon is 1, there is no correction, and the F statistic stays the same throughout.2.
16One-Way Repeated Measures ANOVA One last bit (that you can ignore)...1.Let’s just look at this first. In the top box, you can see a bunch of stuff like “linear”, “quadratic”, & “cubic” – that’s to do with the shape of the difference that the change in scores might take as they progress from drug 1 to drug 4, and only really makes sense in trend analysis, which is again beyond our scope.Finally, down here you see “between subjects effects”. There are none here (just one I.V., and it’s RM). The error variance here is essentially a measure of individual differences, as we’ll see in a minute...2.
17One-Way Repeated Measures ANOVA So, how does the variance thing work?Let’s compare the two methods (“between subjects” and “repeated measures”) directly, bearing in mind where the variances in the output tables come fromIn this way, my goal is simply to indicate the benefit of taking out variation due to individual differencesWe’ll start with the between subjects method...(see next slide)1.2. Simply put…3.
18One-Way Repeated Measures ANOVA 5.4.Here the between groups variance is – this is just variation of mean scores on the different treatments about the overall mean...so this is the bit that is essentially the treatment effect184.108.40.206.615.6And here is the within subjects variation...it is calculated from the sum of the variation within each of the treatments about each of the treatment means323.
19One-Way Repeated Measures ANOVA 1.Now for the repeated measures version:Note that the average score for each subject across the four treatments is different. This is due to individual differences...and is the “between subjects” error variance2716233424.53.2.The thing that makes repeated measures powerful is that this variation is taken out of the within subjects error term...see next slideSum of squares = 680.8(= sum of squared deviations from the mean of these 5 scores, which is 24.9, multiplied by the # levels of the I. V.))4.
20One-Way Repeated Measures ANOVA 220.127.116.11.Error SS in non-RM ANOVA = 793.6Error SS in RM ANOVA = – = 112.8
21One-Way Repeated Measures ANOVA Now what you have to see is that the SS for the denominator in the F test in RM is now 112.8, which is derived from793.6 – = 112.81.4.individual differences2.Error variation in between subjects ANOVA3.
22One-Way Repeated Measures ANOVA 1.And finally, as a direct consequence of all this, the numerator in the F-test is unchanged (698.2), but the denominator has been reduced from 49.6 to 9.4, resulting in an increase in F from 4.69 to 24.76!2.which of course means...more significance, more power
23One-Way Repeated Measures ANOVA So, to summarizeBecause of the way RM ANOVA partitions variance for the RM factors, we have a far more powerful test for the RM factorsBut you have to make sure you control for spurious effects by controlling for order effects and carryover effectsAlso crucial that you adjust for violations of sphericity1.
24Example of interpretation of results Note partial η2 is reported tooInterpretation:A one-way repeated measures ANOVA was conducted to student’s confidence in statistics prior to the class, immediately following the class, and three months after the class. Due to a mild violation of the sphericity assumption ( = .82), the Huynh-Feldt adjusted F was used. There was a significant difference in confidence levels across time, F (1.421, ) = , p < .001, partial η2 = Dependent t-tests were used as post-hoc tests for significant differences with Bonferroni-adjusted = Confidence levels after three months (M = 25.03, SD = 5.20) were significantly higher than immediately following the class (M = 21.87, SD = 5.59), which in turn were significantly higher than pre-test levels (M = 19.00, SD = 5.37).1.2.
25ANOVA/Inferential Statistics Wrap-up Inferential tests to compare differences in groups:Independent t-tests Dependent t-tests One-way ANOVA Factorial ANOVA One-way repeated measures ANOVA Factorial repeated measures ANOVAMixed between-within groups ANOVA (split-plot)Analysis of covariance (ANCOVA)Multivariate analysis of variance (MANOVA)Nonparametric tests (next)1.2.
26Factorial RM ANOVASame notions as for factorial ANOVA – main effects, interactions and so onData set up a bit tricky
27Two-way ANOVA with repeated measures on one factor Sometimes referred to as a split plot or Lindquist type 1 or (most commonly in my experience) a “Two-way ANOVA with repeated measures on one factor.”Research question: Which diet (traditional, low carb, exercise only) is more effective in weight loss across three time periods (before diet, three months later, six months later)? Is there a weight loss across time?
28Two-way ANOVA with repeated measures on one factor Diet RQ (continued)Looks like a 3x3 two-factor ANOVA, except that one of the factors is a repeated measure (one group of subjects tested three times)As such, a two-factor between-groups ANOVA is not appropriate; rather, we have one factor that is between diet types and another that is within a single group of subjects
29Two-way ANOVA with repeated measures on one factor Use a mixed design ANOVA when:A nominal between-subjects IV with 2+ levelsA nominal within-subjects IV with 2+ levelsA continuous interval/ratio DVNote: you can add additional IV’s to this test, but just as with Factorial ANOVA, when you get 3+ IV’s, interpreting findings gets really nasty due to all of the interactions
30Two-way ANOVA with repeated measures on one factor Interactions: like a two-factor between-subjects ANOVA, there may be both main effects for each of the two IV’s plus an interaction between the two IV’s
31Analysis of Covariance An extension of ANOVA that allows you to explore differences between groups while statistically controlling for an additional continuous variableCan be used with a nonequivalent groups pre-test/post-test design to control for differences in pre-test scores with pre-existing groupsYou could use a mixed design ANOVA here, but with small sample size, ANCOVA may be a better alternative due to increased statistical powerBe careful of regression towards the mean as a cause of post-test differences (after using the covariate to adjust pre-test scores)
32Multivariate Analysis of Variance An extension of ANOVA for use with multiple dependent variablesWith multiple DV’s, you could simply use multiple ANOVA’s (one per DV), but risk inflated Type 1 errorSame reason we didn’t conduct multiple t-tests instead of an ANOVAEx. Do differences exist in GRE scores and grad school GPA based on race?There are such things as Factorial MANOVA’s, RM MANOVA’s, and even MANCOVA’s