Presentation on theme: "Planned Comparisons & Post Hoc Tests"— Presentation transcript:
1Planned Comparisons & Post Hoc Tests Comparing Cell MeansPlanned Comparisons & Post Hoc Tests
2QuestionsWhat is the main difference between planned comparisons and post hoc tests?Generate numbers (like 0 1, -1 or 1 –1/2, -1/2) to create a contrast appropriate for a given problem.How many independent comparisons can be made in a given design?What is the difference between a per comparison and a familywise error rate?How does Bonferroni deal with familywise error rate problems?What is the studentized range statistic? How is it used?
3Questions (2)What is the difference between the Tukey HSD and the Newman-Keuls?What are the considerations when choosing a post hoc test (what do you need to trade-off)?Describe (make up) a concrete example where you would use planned comparisons instead of an overall F test. Explain why the planned comparison is the proper analysis.Describe (make up) a concrete example where you would use a post hoc test. Explain why the post hoc test is needed (not the specific choice of post hoc test, but rather why post hoc test at all).
4Planned vs. Post Hoc Planned Comparisons or Contrasts Use instead of overall F test. Planned before the study.Post Hoc or Incidental tests.Use after significant overall F test to investigate specific means. No specific plan before study.ControlComp TutorComp Tutor+LabComp Tutor + lab + quiz
5Planned Comparisons (1) Population Comparison:Weights are real numbers not all zero. Sum of weights must equal zero.Sample Comparison:
6Planned Comparison (2) A1 A2 A3 A4 Source SS df MS F (Data)(3 possible comparisons)A1A2A3A422262821152731172418232025ComparisonA1A2A3A411/2-1/22-13SourceSSdfMSFCells (A1-A4)21937312.17Error72126Total29115(Summary Table)
7Sampling Variance of Planned Comparisons The sample comparison is an unbiased estimate of the population comparison.The variance of the sampling distribution of the comparison:Sampling variance will be large when within cells variance is large, the weights are large, and the number of people in each cell is small. Estimated by:We substitutefor
8Significance Test A1 A2 A3 A4 Source SS df MS F df=N-J; 16-4=12=dfe. 22262821152731172418232025SourceSSdfMSFCells (A1-A4)21937312.17Error72126Total29115df=N-J; 16-4=12=dfe.t(12) =-2.86, p < .05
9Significance Test A1 A2 A3 A4 Source SS df MS F df=N-J= 22 26 28 21 15 2731172418232025SourceSSdfMSFCells (A1-A4)21937312.17Error72126Total29115df=N-J=
10ReviewWhat is the main difference between planned comparisons and post hoc tests?Suppose I do a blind orange juice taste test and discover that my means are:TropicanaFlorida FreshPulpmaster220.127.116.11If my hypothesis is that Tropicana is better than all others, what are my contrast weights?
11Independence of Planned Comparisons You can make several planned comparisons on the same data.Some of these comparisons are independent; some are dependent. We want them independent. Two comparisons from a normal population with equal sample sizes in each cell are independent if the sum of the products of weights is zero.With unequal sample sizes, it’s:
12Independence (2) One and two are orthogonal; one and three are not. ComparisonA1A2A3A41-1/32-1/231/2One and two are orthogonal; one and three are not.There are J-1 orthogonal comparisons. Use only what you need.
13Choosing ComparisonsUsually done on basis of theory. But there are methods to generate all possible orthogonal comparisons.Group12345Comparison 1-1
14Error Rates With 1 test, we set alpha = Type I error rate. With multiple tests, original (nominal) alpha is called the per comparison error rate ( ).With comparisons, we have a family of tests on the same data. Want to know the probability of at least 1 Type I error in the family of tests. Such a probability is called familywise error rate ( ).For independent tests,E.g., 10 tests:
15Bonferroni TestsFamilywise error depends on the number of tests (K) and the nominal alpha,Bonferroni’s solution is to set:Suppose we want FW error to be .05 and we will have 4 comparisons. ThenWhere is an aspiration level.We use the adjusted alpha (.0125) for each of the 4 tests.
16Bonferroni Test (2)Use the adjusted alpha (e.g., .0125) for each comparison.Look at the p value on the printout (use instead of .05).Use a statistical function (e.g., Excel, SAS) if you want to find the critical value.E.g., Excel function TINV says with p=.0125 and df=12, t is 2.93.
17Review How many independent comparisons can be made in a given design? What is the difference between a per comparison and a familywise error rate?How does Bonferroni deal with familywise error rate problems?
18Post Hoc Tests Given a significant F, where are the mean differences? Often do not have planned comparisons.Usually compare pairs of means.There are many methods of post hoc (after the fact) tests.
19SchefféCan use for any contrast. Follows same calculations, but uses different critical values.Instead of comparing the test statistic to a critical value of t, use:Where the F comes from the overall F test (J-1 and N-J df).
20Scheffé (2) Source SS df MS F (Data from earlier problem.) Cells (A1-A4)21937312.17Error72126Total29115(Data from earlier problem.)The comparison is not significant because |-2.86|<3.24.
21Paired comparisonsNewman Keuls and Tukey HSD are two (of many) choices. Both depend on q, the studentized range statistic. Suppose we have J independent sample means and we find the largest and the smallest.MSerror comes from the ANOVA we did to get the J means. The n refers to sample size per cell. If two cells are unequal, use 2n1n2/(n1+n2).The sampling distribution of q depends on k, the number of means covered by the range (max-min), and on v, the degrees of freedom for MSerror.
22Tukey HSDHSD = honestly significant difference. For HSD, use k = J, the number of groups in the study. Choose alpha, and find the df for error. Look up the value qα. Then find the value:Compare HSD to the absolute value of the difference between all pairs of means. Any difference larger than HSD is significant.
23HSD 2Grp ->12345M ->6382807770SourceSSdfMSFpGrps2942.4725.64.13<.05Error9801.055178.2K = 5 groups; n=12 per group, v has 55 df. Tabled value of q with alpha =.05 is 3.98.Group154326371417*19*701012778082
24Newman-Keuls Layer refers to how many means apart. Layer 4 Layer 3 Group123451 63714*17*19*2 7010123 774 805 82Layer 4Layer 3Layer 2Layer 1Same as HSD except the value of q changes with layers. For layer k-1 (here 4), use HSD. For each layer down, subtract 1 from the value of k for the tabled value of q.
25Comparing Post Hoc Tests The Newman-Keuls found 3 significant differences in our example. The HSD found 2 differences. If we had used the Bonferroni approach,we would have found an interval of required for significance (and therefore the same two significant as HSD). Thus, power descends from the Newman-Keuls to the HSD to the Bonferroni. The type I error rates go just the opposite, the lowest to Bonferroni, then HSD and finally Newman-Keuls. Do you want to be liberal or conservative in your choice of tests? Type I error vs Power.
26Review What is the studentized range statistic? How is it used? What is the difference between the Tukey HSD and the Newman-Keuls?What are the considerations when choosing a post hoc test (what do you need to trade-off)?Describe (make up) a concrete example where you would use planned comparisons instead of an overall F test. Explain why the planned comparison is the proper analysis.Describe (make up) a concrete example where you would use a post hoc test. Explain why the post hoc test is needed (not the specific choice of post hoc test, but rather why post hoc test at all).